[PR] New CUDA 4.0 Release Makes Parallel Programming Easier

Reading time:
March 1, 2011

Unified Virtual Addressing, GPU-to-GPU Communication and Enhanced C++ Template Libraries Enable More Developers to Take Advantage of GPU Computing

SINGAPORE — March 1, 2011 — NVIDIA today announced the latest version of the NVIDIA® CUDA® Toolkit for developing parallel applications using NVIDIA GPUs. The NVIDIA CUDA 4.0 Toolkit was designed to make parallel programming easier, and enable more developers to port their applications to GPUs. This has resulted in three main features:

  • NVIDIA GPUDirect™ 2.0 Technology – Offers support for peer-to-peer communication among GPUs within a single server or workstation. This enables easier and faster multi-GPU programming and application performance.
  • Unified Virtual Addressing (UVA) – Provides a single merged-memory address space for the main system memory and the GPU memories, enabling quicker and easier parallel programming.
  • Thrust C++ Template Performance Primitives Libraries – Provides a collection of powerful open source C++ parallel algorithms and data structures that ease of programming for C++ developers. With Thrust, routines such as parallel sorting are 5X to 100X faster than with Standard Template Library (STL) and Threading Building Blocks (TBB).

“Unified virtual addressing and faster GPU-to-GPU communication makes it easier for developers to take advantage of the parallel computing capability of GPUs,” said John Stone, Senior research programmer, University of Illinois, Urbana-Champaign.

“Having access to GPU computing through the standard template interface greatly increases productivity for a wide range of tasks, from simple cashflow generation to complex computations with Libor market models, variable annuities or CVA adjustments,” said Peter Decrem, director of Rates Products at Quantifi. “The Thrust C++ library has lowered the barrier of entry significantly by taking care of low-level functionality like memory access and allocation, allowing the financial engineer to focus on algorithm development in a GPU-enhanced environment.”

The CUDA 4.0 architecture release includes a number of other key features and capabilities, including:

  • MPI Integration with CUDA Applications – Modified MPI implementations like OpenMPI automatically move data from and to the GPU memory over Infiniband when an application does an MPI send or receive call.
  • Multi-thread Sharing of GPUs – Multiple CPU host threads can share contexts on a single GPU, making it easier to share a single GPU by multi-threaded applications.
  • Multi-GPU Sharing by Single CPU Thread – A single CPU host thread can access all GPUs in a system. Developers can easily coordinate work across multiple GPUs for tasks such as “halo” exchange in applications.
  • New NPP Image and Computer Vision Library – A rich set of image transformation operations that enable rapid development of imaging and computer vision applications.
  • New and Improved Capabilities:
    • Auto performance analysis in the Visual Profiler
    • New features in cuda-gdb and added support for MacOS
    • Added support for C++ features like new/delete and virtual functions
    • New GPU binary disassembler

A release candidate of CUDA Toolkit 4.0 will be available free of charge beginning March 4, 2011, by enrolling in the CUDA Registered Developer Programme at: www.nvidia.com/paralleldeveloper. The CUDA Registered Developer Programme provides a wealth of tools, resources, and information for parallel application developers to maximise the potential of CUDA.

For more information on the features and capabilities of the CUDA Toolkit and on GPGPU applications, please visit: www.nvidia.com/cuda.

Share
Load Comments

Gadget

October 19, 2025 - 0

Review Infinix GT 30: Smartphone Gaming Padahal Aslinya All-Rounder!

Ini adalah Infinix GT 30! Ya, hape ini adalah versi…
July 10, 2025 - 0

Fossil Hadirkan Dua Jam Tangan Kolaborasi Marvel Fantastic Four

Fossil mengumumkan hadrinya dua jam tangan eksklusif hasil kolaborasi Marvel…
June 18, 2025 - 0

Review “Singkat” Samsung Galaxy S25 Edge: Smartphone Pemicu Pro-Kontra! Sebaik/Seburuk Itu?

Ini hape yang memicu Pro-kontra.  Banyak orang, bahkan kami pun…
June 17, 2025 - 0

Review Amazfit Active 2 Square: Smartwatch “Kotak” yang Klasik, Canggih, dan Baterai Awet!

Kalian sedang cari smartwatch bentuk kotak yang canggih, baterai irit,…

Laptop

May 27, 2026 - 0

Review Polytron Luxia with AMD Ryzen 5: Rp 7 Jutaan, Kencang, Upgradeable, 5X USB, ADP+

Ini dia Laptop pertama Polytron dengan prosesor AMD Ryzen. Harganya?…
May 25, 2026 - 0

Review Acer TravelMate X4 14 AI: Laptop Bisnis AI Cepat, Ringan & Produktif

Kata siapa bisnis UMKM ga butuh Laptop canggih?! Apalagi sekarang…
May 21, 2026 - 0

Review ASUS Zenbook DUO 2026 (UX8407AA): Desain Baru, Makin Kokoh, Makin Kenceng!

Kerja serius pakai laptop thin and light biasa? Kurang mantap…

Gaming

June 10, 2026 - 0

Xbox Siapkan Opsi Game Pass Baru yang Lebih Fleksibel

Xbox dikabarkan tengah menyiapkan opsi berlangganan Game Pass baru yang…
June 10, 2026 - 0

Rumor: Kingdom Hearts IV Diklaim Target Rilis Pada 2027

Bocoran yang berasal dari listing sejumlah retailer Australia mengindikasikan Kingdom…
June 10, 2026 - 0

Pemain Destiny 2 Penuhi Server Kala Update Terakhirnya Rilis

Update penutup Destiny 2 berhasil menarik gelombang pemain dalam jumlah…
June 10, 2026 - 0

Nintendo Direct Juni 2026 Bawa Banyak Kejutan Menarik

Nintendo Direct kali ini yang telah dinantikan selama 9 bulan…