Tag: parallelization

Using explicit platform descriptions to support programming of heterogeneous many-core systems

Using explicit platform descriptions to support programming of heterogeneous many-core systems

| 27 January, 2012 | 0 Comments

We show various usage scenarios of our PDL and demonstrate the effectiveness of our framework for a commonly used scientific kernel and a financial application on different configurations of a state-of-the-art CPU/GPU system.

Continue Reading

GRace: A low-overhead mechanism for detecting data races in gpu programs

GRace: A low-overhead mechanism for detecting data races in gpu programs

| 26 October, 2011 | 0 Comments

In this paper, we propose GRace, a new mechanism for detecting races in GPU programs that combines static analysis with a carefully designed dynamic checker for logging and analyzing information at runtime.

Continue Reading

Call for papers-IWOMP 2011

Call for papers-IWOMP 2011

| 27 January, 2011 | 0 Comments

The International Workshop on OpenMP 2011 is seeking submissions of unpublished technical papers detailing innovative, original research and development related to OpenMP. All topics related to OpenMP are of interest, including OpenMP applications in any domain

Continue Reading

Parallel-vector algorithms for particle simulations on shared-memory multiprocessors

Parallel-vector algorithms for particle simulations on shared-memory multiprocessors

| 26 January, 2011 | 0 Comments

Two novel algorithms for shared-memory concurrent computation of particle simulations were developed, and their efficiency, scalability, and compatibility with various shared-memory architectures were measured. It was verified that the algorithms enhance the parallel efficiency on most architectures with scalar, vector, and multithreading processors. The performance on a vector processor is particularly excellent: the vector operating ratio reaches 99.8% and the vector length is almost 256—near the architectural limit.

Continue Reading

CUDA accelerated MJPEG 2000 encoder

CUDA accelerated MJPEG 2000 encoder

| 6 October, 2010 | 0 Comments

This paper presents a portable, fault-tolerant and a novel parallelized software implementation of Motion JPEG 2000 (MJPEG 2000) reference encoder using CUDA. Each major structural/ computational unit of JPEG 2000 is discussed in CUDA framework and the results are provided wherever required.

Continue Reading