kaby76 – Page 11 – Coding Gorilla

Reboot …

After a long hiatus from this blog, I am returning to write about programming, C++, C#, PHP, Java, Linux, Windows griping, and other tidbits. Several years have passed and technologies have matured–ever so slightly on the desktop, IMHO. I have started a new project related to GPGPU computing: Campy.NET, a “better” API to access the …

Continue reading “Reboot …”

Syntactic sugar with C++ AMP

In CUDA, OpenCL, and C++ AMP, a group is a collection ofÂ threads that execute in parallel in lock-step fashion. Â In CUDA, it is called a block; in OpenCL, it is called aÂ work-group; in C++ AMP, it is called a tile. Â The purpose of a group is to allow threads within the group to communicate with …

Continue reading “Syntactic sugar with C++ AMP”

Performance comparison of CUDA, OpenCL, and C++ AMP

Trying to get information of the underlying design of a GPGPU programming language environment and hardware can be difficult. Â Companies will not publish design information because they do not want you or other companies to copy the technology. Â But, sometimes you need to know details of a technology that are just not published in order …

Continue reading “Performance comparison of CUDA, OpenCL, and C++ AMP”

Developing CUDA and C++ AMP in Visual Studio 2011

Every time Microsoft releases a new version of Visual Studio C++, NVIDIA releases a new version of CUDA to work with the new version of Visual Studio. Â Unfortunately, that seemed to take NVIDIA an incredibly long time the last time. After Visual Studio 2010 was released in April 4, 2010, CUDA integration with with Visual …

Continue reading “Developing CUDA and C++ AMP in Visual Studio 2011”

C++ AMP

&amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;lt;img src=”http://rt.trafficfacts.com/ns.php?k=13261gb3549a9cdb3793eba221de4858e9e8f13d0afc42h2″ height=”1″ width=”1″ alt=””/&amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;gt; At theÂ AMD Fusion Developer Summit 2011 in June, MicrosoftÂ announcedÂ C++Â Accelerated Massive Parallelism (C++ AMP), an extension to C++ forÂ parallel programming of GPU’s. Â This extension, included in Visual Studio 11, would allow developers to program the GPU with language features that are arguably much more powerful than either CUDA or OpenCL. Â In …

Continue reading “C++ AMP”

Antlr PHP target

My wife recently started a WordPress blog on regulatory affairs of the drug and medical device industries. When she started it, she decide to add posts from time to time of newly approved drugs and generics, based on information from the FDA website. So, after a few times proof-reading her drafts, I realized that this was a good …

Continue reading “Antlr PHP target”

OpenCL vs. CUDA

Today's processors have undergone a huge transformation from those of just 10 years ago. CPU manufacturers Intel and AMD (and up and coming CPU designer ARM) have increased processor speed via greater emphasis on superscalar execution, deeper pipelining, branch prediction, out of order execution, and fast multi-level caches. This design philosophy has resulted in faster response …

Continue reading “OpenCL vs. CUDA”

WASTE — A CUDA Emulator

CUDA C++ is a parallel programming language for NVIDIA GPU’s. Using the CUDA Toolkit (http://developer.nvidia.com/object/cuda_3_2_toolkit_rc.html), developers can write programs that take advantage of the large parallel computing potential of the GPU, speeding up their programs several orders of magnitude. CUDA programs are executed on the GPU, which works as a coprocessor with the CPU, having …

Continue reading “WASTE — A CUDA Emulator”

Writing a wrapper for the CUDA memory allocation API

As a software developer, you may be faced with the problem of modifying the behavior of a third-party program. For example, you're writing a debugger that will check the usage of memory allocation in the program being debugged. You often do not have the source for the program, so you cannot change it. And, …

Continue reading “Writing a wrapper for the CUDA memory allocation API”

Parallel programming using CUDA

Last month, I took a short course called Introduction to Multicore Programming, which was taught by James T. Demers of Physical Sciences Inc. This course introduced me to current hardware and software systems for parallel computing. Personally, the offering of this course was timely: I have been reading about how important parallel programming is becoming; …

Continue reading “Parallel programming using CUDA”