CUDA 4.0 and VS 2010 Samples Updated for 4.0 release

Sunday, June 19th, 2011

I finally  got around to updating the RC2 sample/walkthrough to work with the RTM release of CUDA 4.0. No real issues to report thankfully. Nvidia seem to have fixed the issues relating to  conflicts between STL and the Thrust library so things are in better shape now. Maybe I can ...

Visual Studio 2010 and CUDA

Sunday, May 8th, 2011

So I finally got around to taking a stab at getting the CUDA 4.0 RC2 SDK up and running in between talks at ALT.NET Seattle. I’m really hoping this is the last tutorial. It’s gotten a lot simpler to build CUDA on Windows in the last couple of releases. Update June 19th ...

Using CUDA and Thrust with VS 2010 Part 2: x64 Builds

Saturday, April 9th, 2011

In the first part of this tutorial I covered getting started with CUDA on Visual Studio 2010. The post didn’t talk about building x64 targets. A couple of people asked about this so I’ve upgraded the example to build both x86 and x64. This is very easy to do provided ...

Using CUDA and Thrust with Visual Studio 2010

Sunday, March 6th, 2011

Using CUDA 4.0 RC2? Read the update post here. I was working on setting up some new CUDA projects as I’m doing some spiking (prototyping for the not so agile crowd) work to figure out how best to use CUDA 4.0. I’ve turned it in to a quick tutorial on how ...

Using CUDA/Thrust with the Parallel Patterns Library

Friday, December 31st, 2010

I’ve been working on getting my CUDA/Thrust N-body code working with multiple GPUs. The following is a quick code spike showing how to use the Parallel Patterns Library (PPL) to create a task for each CUDA device and execute a CUDA kernel on it using the Thrust library for CUDA. ...

GPU Technology Conference 2010

Saturday, August 28th, 2010

This year’s GPU Technology Conference is in San Jose which is pretty much my part of the World so I thought I’d talk a busman’s holiday in mid-September and check it out. There’s several pre-conference tutorials on the basics of CUDA, DirectCompute, OpenCL as well as a ...