GTC2013

Tag: algorithm

Webinar: CUDA 5 Math Library Performance Overview

Webinar: CUDA 5 Math Library Performance Overview

| 29 January, 2013 | 0 Comments

Jonathan Cohen and the NVIDIA CUDA Library Team present the latest benchmark results using the extensive numerical libraries included with CUDA 5. This webinar will cover all the data points and the significance of the new Math Library Performance Report

Continue Reading

Pragmatic optimizations for better scientific utilization of large supercomputers

Pragmatic optimizations for better scientific utilization of large supercomputers

| 5 December, 2012 | 0 Comments

In this paper we describe our experiences in running simulations of the explosion phase of Type Ia supernovae on the largest available platforms. The simulations use FLASH, a modular, adaptive mesh, parallel simulation code with a wide user base. The simulations use multiple physics components: hydrodynamics, gravity, a sub-grid flame model, a three-stage burning model, and a degenerate equation of state.

Continue Reading

SC12 Best Paper Award: Framework for Low-Communication 1D FFT

SC12 Best Paper Award: Framework for Low-Communication 1D FFT

| 15 November, 2012 | 1 Comment

Intel’s Software & Intel Labs devised a new framework for distributed 1-D FFT problems which traditionally require three costly all-to-all inter-node data exchanges. The new approach delivers multiple 1D FFT algorithms requiring just a single all-to-all inter-node data exchange.

Continue Reading

Fast box-counting algorithm on GPU

Fast box-counting algorithm on GPU

| 9 October, 2012 | 0 Comments

In this paper we present a fast parallel version of the box-counting algorithm, which has been coded in CUDA for execution on the Graphic Processing Unit (GPU)

Continue Reading

Orthogonalization on a general purpose graphics processing unit

Orthogonalization on a general purpose graphics processing unit

| 4 October, 2012 | 0 Comments

Using a massively parallel algorithm for the modified Gram-Schmidt orthogonalization on a NVIDIA Tesla C2050 Computing Processor we can compensate for the cost of one extra level of precision, even already for modest dimensions.

Continue Reading

A multi-thread scheduling method for 3D CT image reconstruction using multi-GPU

A multi-thread scheduling method for 3D CT image reconstruction using multi-GPU

| 24 September, 2012 | 0 Comments

In this method we use Multi-Threads to control GPUs and a separate thread to accomplish data storage, so that we make the calculation and data storage simultaneously

Continue Reading

On the Future of High Performance Computing: How to Think for Peta and Exascale Computing

On the Future of High Performance Computing: How to Think for Peta and Exascale Computing

| 14 September, 2012 | 0 Comments

Jack Dongarra of Oak Ridge National Laboratory, speaking on 41th SPEEDUP Workshop on High-Performance Computing, ETH Zurich, Switzerland, September 7, 2012.

Continue Reading

Numerical Study of Geometric Multigrid Methods on CPU-GPU Heterogeneous Computers

Numerical Study of Geometric Multigrid Methods on CPU-GPU Heterogeneous Computers

| 24 August, 2012 | 0 Comments

In this work, we studied the performance of GMG on CPU-GPU heterogeneous computers. Our numerical results suggest that in the best-case scenario the GPU version of GMG can achieve 18.5 times speed-up in 2D and 16.0 times speed-up in 3D compared with an efficient implementation of multigrid methods on CPUs.

Continue Reading

Collision Detection Method for High Resolution Objects Using Tessellation Unit on GPU

Collision Detection Method for High Resolution Objects Using Tessellation Unit on GPU

| 5 August, 2012 | 0 Comments

After tessellation for collision detection on GPU, 200-million-face models cannot be computed in real-time. Our algorithm proposed an high-potential-collision area selection.

Continue Reading

Notable SIGGRAPH 2012 technical papers focused on GPU computing

Notable SIGGRAPH 2012 technical papers focused on GPU computing

| 31 July, 2012 | 1 Comment

We collected notable GPU computing talks at upcoming SIGGRAPH 2012 meeting. All DOIs linked to the corresponding technical papers.

Continue Reading