Tag: GPU
A Parallel Auxiliary Grid AMG Method for GPU
In this paper, we develop a new parallel auxiliary grid algebraic multigrid (AMG) method to leverage the power of graphic processing units (GPUs)
Computing effective properties of heterogeneous materials on heterogeneous parallel processors
With the goal of maximizing the obtained performances and limiting resource consumption, we utilized a software architecture based on stream processing, event-driven scheduling, and dynamic load balancing.
Coupling SIMD and SIMT architectures of phylogeny-aware short-read alignment kernel
Background Aligning short DNA reads to a reference sequence alignment is a prerequisite for detecting their biological origin and analyzing them in a phylogenetic context. With the PaPaRa tool we introduced a dedicated dynamic programming algorithm for simultaneously aligning short reads to reference alignments and corresponding evolutionary reference trees. The algorithm aligns short reads to…
Towards accelerating smoothed particle hydrodynamics simulations
This paper presented a computational methodology to carry out three-dimensional, massively parallel Smoothed Particle Hydrodynamics (SPH) simulations across multiple GPUs
Efficient Method for Calculating Coulombic Interactions in Mass Spectrometry Simulations on GPU
In this study, we tested the parallel hybrid algorithm with a couple of basic models and analyzed the performance by comparing it to that of the original, fully-explicit method written in serial code
GPU Computing Using Concurrent Kernels: A Case Study
We concentrated on two performance factors, namely the launching order of concurrent kernels and the kernel granularity. Extensive experiments show that the launching order of concurrent kernels can hardly affect application performance.
Nvidia Tesla K20 GPU High Resolution pictures
After today’s debut of Cray XK7 supercomputer Titan, NVIDIA released high resolution pictures of Tesla K20 GPU card.
Oak Ridge National Laboratory debuts fastest supercomputer Cray XK7 system Titan
The U.S. Department of Energy’s (DOE) Oak Ridge National Laboratory launched a new era of scientific supercomputing today with Titan, a system capable of churning through more than 20,000 trillion calculations each second-or 20 petaflop
Temporal and spectral imaging with micro-Computer Tomography
Our proposed method produces five-dimensional volumetric images that distinguish different materials at different points in time, and can be used to segment regions containing iodinated blood and compute measures of cardiac function.
A Heterogeneous Accelerated Matrix Multiplication: OpenCL + APU + GPU+ Fast Matrix Multiply
As users and developers, we are witnessing the opening of a new computing scenario: the introduction of hybrid processors into a single die, such as an accelerated processing unit (APU) processor, and the plug-and-play of additional graphics processing units (GPUs) onto a single motherboard






