Jonathan Cohen and the NVIDIA CUDA Library Team present the latest benchmark results using the extensive numerical libraries included with CUDA 5. This webinar will cover all the data points and the significance of the new Math Library Performance Report
In this paper we describe our experiences in running simulations of the explosion phase of Type Ia supernovae on the largest available platforms. The simulations use FLASH, a modular, adaptive mesh, parallel simulation code with a wide user base. The simulations use multiple physics components: hydrodynamics, gravity, a sub-grid flame model, a three-stage burning model, and a degenerate equation of state.
Intel’s Software & Intel Labs devised a new framework for distributed 1-D FFT problems which traditionally require three costly all-to-all inter-node data exchanges. The new approach delivers multiple 1D FFT algorithms requiring just a single all-to-all inter-node data exchange.
Using a massively parallel algorithm for the modified Gram-Schmidt orthogonalization on a NVIDIA Tesla C2050 Computing Processor we can compensate for the cost of one extra level of precision, even already for modest dimensions.
In this method we use Multi-Threads to control GPUs and a separate thread to accomplish data storage, so that we make the calculation and data storage simultaneously
Jack Dongarra of Oak Ridge National Laboratory, speaking on 41th SPEEDUP Workshop on High-Performance Computing, ETH Zurich, Switzerland, September 7, 2012.
In this work, we studied the performance of GMG on CPU-GPU heterogeneous computers. Our numerical results suggest that in the best-case scenario the GPU version of GMG can achieve 18.5 times speed-up in 2D and 16.0 times speed-up in 3D compared with an efficient implementation of multigrid methods on CPUs.
After tessellation for collision detection on GPU, 200-million-face models cannot be computed in real-time. Our algorithm proposed an high-potential-collision area selection.
We collected notable GPU computing talks at upcoming SIGGRAPH 2012 meeting. All DOIs linked to the corresponding technical papers.