The MicroCFD Virtual Wind Tunnel, Educational & Professional Edition, has recently been upgraded. The new version (1.8) supports multi-core CPUs and CUDA core GPUs and runs significantly faster than the previous single-processor version.
Our GPU software delivers haplotyping and imputation accuracies comparable to competing programs at a fraction of the computational cost and peak memory demand.
Logo: High Performance GPU-based TSP Solver. It is an approximate stochastic solver based on Iterative Local Search and 2-opt local search merhod. Despite limited shared memory resources, this implementation is able to solve arbitrarily large instances.
We have therefore implemented a very fast version of TM-score for Graphical Processing Units (TM-score-GPU), using a new and novel hybrid Kabsch/quaternion method for calculating the optimal superposition and RMSD that is designed for parallel applications.
PGCL is an OpenCL framework for compiling and running OpenCL 1.1 embedded profile applications on the ST-Ericsson NovaThor U8500 and follow-on platforms using a single ARM core as the OpenCL host and multiple ARM cores as an OpenCL computing device.
We have developed a novel algorithm called Hybrid Motif Sampler (HMS) that enables more scalable and accurate motif analysis. We proposed a “fragmentation” technique to hide data transfer time between memories.
Portable OpenCL (pocl) aims to be an efficient open source implementation of the OpenCL 1.2 standard. In addition to producing an easily portable open source OpenCL implementation, another major goal of the project is improving performance portability of OpenCL programs with compiler optimizations, reducing the need for target-dependent manual optimizations.
We have provided a hybrid massively parallelized molecular dynamic VASP ab initio software for GPUs clusters. To avoid continuously transferring data from CPUs to GPUs, we have ported some functions in CUDA and achieved a balanced combination between CUFFT, CUBLAS, and CUDA.