Category: Computer Science
Accelerating numerical modeling of wave propagation through 2-D anisotropic materials using OpenCL
We present an implementation of the numerical modeling of elastic waves propagation, in 2D anisotropic materials, using the new parallel computing devices (PCDs)
Novel Dynamic Partial Reconfiguration Implementation of K-Means Clustering on FPGAs: Comparative Results with GPPs and GPUs
In this work, a parameterized implementation of the K-means clustering algorithm in Field Programmable Gate Array (FPGA) is presented and compared with previous FPGA implementation as well as recent implementations on Graphics Processing Units (GPUs) and GPPs.
Use of FPGA or GPU-based architectures for remotely sensed hyperspectral image processing
In this paper we explore the role of hardware accelerators in hyperspectral remote sensing missions and further inter-compare two types of solutions: field programmable gate arrays (FPGAs) and graphics processing units (GPUs)
Research for Chinese Spam Filtering Based on GPU
In order to accelerate the speed of large-scale Chinese spam filtering, we propose a spam filtering solution based on GPU as well as considering classical text classification algorithms
GPU in Physics Computation: Case Geant4 Navigation
The goal was to find out whether Geant4 physics simulations could benefit from GPU acceleration and how difficult it is to modify Geant4 code to run in a GPU
Comparison of OpenMP & OpenCL Parallel Processing Technologies
This paper presents a comparison of OpenMP and OpenCL based on the parallel implementation of algorithms from various fields of computer applications
Genetic Algorithm Modeling with GPU Parallel Computing Technology
We present a multi-purpose genetic algorithm, designed and implemented with GPGPU / CUDA parallel computing technology. The model was derived from a multi-core CPU serial implementation, named GAME, already scientifically successfully tested and validated on astrophysical massive data classification problems, through a web application resource (DAMEWARE), specialized in data mining based on Machine Learning paradigms.…
Implementation of a High Performance GPGPU Compiler
In this paper we present our experience in developing an optimizing compiler for general purpose computation on graphics processing units (GPGPU) based on the Cetus compiler framework.
An efficient implementation of Bailey and Borwein’s algorithm for parallel random number generation on graphics processing units
This paper investigates the serial and parallel implementation of a Linear Congruential Generator for Graphics Processing Units (GPU) based on the binary representation of the normal number
Parallelization of Dynamic Programming in Nussinov RNA Folding Algorithm on the CUDA GPU
In this work, the Nussinov algorithm is analyzed but from the CUDA GPU programming perspective. The algorithm is radically redesigned in order to utilize the highly parallel NUMA architecture of the GPU.






