GTC2013

Tag: parallel programming

Proven Algorithmic Techniques for Many-core Processors Workshop

Proven Algorithmic Techniques for Many-core Processors Workshop

| 26 July, 2012 | 0 Comments

By studying many current GPU computing applications, we have learned that the limits of an application’s scalability are often related to some combination of memory bandwidth saturation, memory contention, imbalanced data distribution, or data structure/algorithm interactions.

Continue Reading

Upcoming Book: C++ AMP Accelerated Massive Parallelism with Microsoft Visual C++

Upcoming Book: C++ AMP Accelerated Massive Parallelism with Microsoft Visual C++

| 18 July, 2012 | 0 Comments

With this practical book, experienced C++ developers will learn parallel programming fundamentals with C++ AMP through detailed examples, code snippets, and case studies.

Continue Reading

GPU debugging in Visual Studio 2012 screencast

GPU debugging in Visual Studio 2012 screencast

| 11 July, 2012 | 0 Comments

This screencast assumes knowledge of the C++ AMP API, e.g. that you totally understand the matrix multiplication implementation in C++ AMP. Watch this screencast on what features are available in Visual Studio 2012 for debugging C++ AMP code.

Continue Reading

Inside VC++ 2012 Auto-Vectorization

Inside VC++ 2012 Auto-Vectorization

| 30 June, 2012 | 0 Comments

The VC++ 2012 auto-vectorizer tries to make loops in your code run faster by automatically vectorizing your code using the SSE instructions available in all current mainline Intel and AMD chips.

Continue Reading

ISC 12 Tutorial: Relative, Reverse & CUDA Debugging for Computationally Intensive Application Development

ISC 12 Tutorial: Relative, Reverse & CUDA Debugging for Computationally Intensive Application Development

| 15 June, 2012 | 0 Comments

A significant challenge in developing, maintaining and porting numerical simulations is avoiding subtle errors that undermine the validity of the results without causing an obvious failure. This tutorial will share experiences, best practices and debugging techniques for identifying and resolving such defects in parallel applications.

Continue Reading

Free Webinar: Getting Started with Intel SDK for OpenCL Applications

Free Webinar: Getting Started with Intel SDK for OpenCL Applications

| 14 June, 2012 | 1 Comment

Developing parallel applications that take advantage of all the compute resources available on the underlying system is not a trivial task, and doing that across multiple devices in a standard manner is even more difficult.

Continue Reading

On single-walk parallelization of the job shop problem solving algorithms

On single-walk parallelization of the job shop problem solving algorithms

| 5 June, 2012 | 0 Comments

New parallel objective function determination methods for the job shop scheduling problem are proposed in this paper, considering makespan and the sum of jobs execution times criteria, however, the methods proposed can be applied also to another popular objective functions such as jobs tardiness or flow time

Continue Reading

Algorithmic skeletons for multi-core, multi-GPU systems and clusters

Algorithmic skeletons for multi-core, multi-GPU systems and clusters

| 25 May, 2012 | 0 Comments

In this paper, we present our skeleton library Muesli, which not only simplifies parallel programming. Additionally, it allows to write a single application that may be executed on a variety of parallel machines ranging from simple multi-core processors with shared memory to clusters of multi-and many-core processors with distributed memory as well as multi-GPU systems and GPU clusters.

Continue Reading

High-performance computing tools for the integrated assessment and modelling of social-ecological systems

High-performance computing tools for the integrated assessment and modelling of social-ecological systems

| 10 May, 2012 | 0 Comments

Integrated spatio-temporal assessment and modelling of complex social–ecological systems is required to address global environmental challenges.

Continue Reading

A Parallel Front Propagation Method: Simulating geological folds on parallel architectures

A Parallel Front Propagation Method: Simulating geological folds on parallel architectures

| 4 May, 2012 | 0 Comments

In this thesis, a novel three-dimensional anisotropic front propagation algorithm for simulation of geological folds on parallel architecture is presented. The algorithm’s abundant parallelism is demonstrated on multi-core CPUs and GPU architectures.

Continue Reading