I'm trying to configure CUDA6.0 with my Visual Studio 2010. I created a project using CUDA 6.0 runtime. When compiling, do I need to change/add any options? Since my CUDA code runs slower than the series version, is there any chance that the code is not executed in parallel (say, the GPU becomes a slow CPU...), regardless other possibilities e.g. double precision, overhead, etc.?
Many thanks, XF
From what it sounds like is that you just took some serial code and compiled it thinking it would work.
But with the assumption that you actually have parallel code you might want to make sure you
You should also check out the GTC talks on optimizations [link to pdf] (my old professor). It covers some basic optimizations that you can perform to get your code up to speed.
The talks from the last few years of GTC can be found here [link]. They have multiple updates to optimizations, talks about different tools and so forth.