Tag: algorithm
CudaHull: Fast parallel 3D convex hull on the GPU
We have presented a 3D convex-hull-construction algorithm using the CUDA programming model. It is similar to Quick Hull, in that it starts with an initial polyhedron constructed using four extreme points, discards internal points, and redistributes the remaining points among the faces.
Call For Papers: High Performance Simulation of biological systems
The goal of this track is to explore the use of emerging parallel computing architectures as well as High Performance Computing systems for the simulation of relevant biological systems
MASSIVE 2012: Fourth Workshop on Massive Data Algorithmics
The scope of the workshop includes both fundamental algorithmic problems involving massive data, as well as algorithms for more specialized problems in, e.g., graphics, databases, statistics and bioinformatics.
Anatomy of High-Performance 2D Similarity Calculations
We describe efficient implementations of this primitive as well as example applications using features of modern CPUs that allow 20-40 performance increases relative to typical code
High Precision Integer Multiplication with a GPU
We have improved our prior implementation of Strassen’s algorithm for high performance multiplication of very large integers on a general purpose graphics processor (GPU).
Electronic Structure Calculation Methods on Accelerators workshop at Oak Ridge
This conference seeks to bring together computationally oriented scientists interested in using accelerator coprocessors in electronic structure theory. We will discuss recent advances, barriers and perspectives in modeling electronic structure on accelerators.
Evaluating polynomials in several variables and their derivatives on a GPU computing processor
In this paper we describe algorithms for the massively parallel evaluation and differentiation of polynomials in several variables
Processing piecewise autoregressive model image interpolation algorithm on GPU with CUDA
This paper presents a parallel implementation of piecewise autoregressive modeling image interpolation algorithm, using CUDA (Compute Unified Device Architecture) on GPU
Enhancing data parallelism for ant colony optimisation on GPUs
In this paper, we deal with a GPU implementation of Ant Colony Optimisation (ACO), a population-based optimisation method which comprises two major stages: Tour construction and Pheromone update.





