CUDA Toolkit 3.2 RC (September 2010) 公开下载

Edison · 发表于 2010-9-18 15:20

Release Highlights

New and Improved CUDA Libraries

CUSPARSE, a new library of GPU-accelerated sparse matrix routines for sparse/sparse and dense/sparse operations
CURAND, a new library of GPU-accelerated random number generation (RNG) routines, supporting Sobol quasi-random and XORWOW pseudo-random routines for in both host and device code
CUFFT performance tuned for radix-3, -5, and -7 transform sizes on Fermi architecture GPUs
CUBLAS performance improved 50% to 300% on Fermi architecture GPUs, for matrix multiplication of all datatypes and transpose variations
H.264 encode/decode libraries that were previously available in the GPU Computing SDK are now part of the CUDA Toolkit

CUDA Driver & CUDA C Runtime

Support for new 6GB Quadro and TeslPro-Aducts
Support for debugging GPUs with more than 4GB device memory.
Integrated Tesla Compute Cluster (TCC) support in standard Windows driver packages

Development Tools

Multi-GPU debugging support for both cuda-gdb and Parallel Nsight
Added cuda-memcheck support for Fermi architecture GPUs
NVCC support for Intel C Compiler (ICC) v11.1 on 64-bit Linux distros

Miscellaneous

Support for malloc() and free() in CUDA C compute kernels
NVIDIA System Management Interface (nvidia-smi) support for reporting % GPU busy, and several GPU performance counters

New GPU Computing SDK Code Samples

Several code samples demonstrating how to use the new CURAND library, including MonteCarloCURAND, EstimatePiInlineP, EstimatePiInlineQ, EstimatePiP, EstimatePiQ, and SingleAsianOptionP
Conjugate Gradient Solver, demonstrating the use of CUBLAS and CUSPARSE together
Function Pointers, a sample that shows how to use function pointers to implement the Sobel Edge Detection filter for 8-bit monochrome images
Interval Computing, demonstrating the use of interval arithmetic operators using C++ templates and recursion
Simple Printf, demonstrating best practices for using both printf an cuprintf in compute kernels
Bilateral Filter, an edge-preserving non-linear smoothing filter for image recovery and denoising that is implemented in CUDA C with OpenGL rendering
SLI with Direct3D Texture, a simple example demonstrating the use of SLI and Direct3D interoperability with CUDA C

http://developer.nvidia.com/object/cuda_3_2_toolkit_rc.html

yyzjp · 发表于 2010-9-18 15:36

Support for debugging GPUs with more than 4GB device memory
超过4GB显存的显卡在哪里？

gzpony · 发表于 2010-9-18 15:41

Support for debugging GPUs with more than 4GB device memory
超过4GB显存的显卡在哪里？
yyzjp 发表于 2010-9-18 15:36

专业卡里。
上次有人贴的新闻里面，貌似见过6G显存的型号的

drhuangyt · 发表于 2010-9-18 15:41

开始有稀疏矩阵求解、随机数生成、快速傅利叶变换、三角分解矩阵运算库了，不知道跟MKL比效率如何

越光宝盒 · 发表于 2010-9-18 18:17

提示: 作者被禁止或删除内容自动屏蔽

未来的水世界 · 发表于 2010-9-18 18:37

提示: 作者被禁止或删除内容自动屏蔽

jocover · 发表于 2010-9-18 20:12

发现 2D_MAX_HEIGHT 变32768了，原来是8192

  CL_DEVICE_IMAGE <dim>
                  2D_MAX_WIDTH    4096
                  2D_MAX_HEIGHT    32768
                  3D_MAX_WIDTH    2048
                  3D_MAX_HEIGHT    2048
                  3D_MAX_DEPTH    2048

yyzjp · 发表于 2010-9-18 23:42

稍微关心点新闻应该清楚Fermi核心的Quadro FX6000用的就是6G的GDDR5显存
未来的水世界发表于 2010-9-18 18:37

sorry ,我忘记了专业卡

帐号		自动登录	找回密码
密码			注册

越光宝盒越光宝盒当前离线积分 4 IP卡狗仔卡头像被屏蔽	5^# 发表于 2010-9-18 18:17 \| 只看该作者提示: 作者被禁止或删除内容自动屏蔽
越光宝盒越光宝盒当前离线积分 4 IP卡狗仔卡头像被屏蔽
	回复支持反对使用道具举报显身卡

未来的水世界未来的水世界当前离线积分 4 IP卡狗仔卡头像被屏蔽	6^# 发表于 2010-9-18 18:37 \| 只看该作者提示: 作者被禁止或删除内容自动屏蔽
未来的水世界未来的水世界当前离线积分 4 IP卡狗仔卡头像被屏蔽
	回复支持反对使用道具举报显身卡

CUDA Toolkit 3.2 RC (September 2010) 公开下载

浏览过的版块