原来现在cuda软解已经这么强悍了

AlanLW · 发表于 2009-11-4 23:31

Q9550四核不用，却去用GPU

piphy · 发表于 2009-11-5 09:07

楼主又被高清党bs了,真是娱乐哦

bill_max · 发表于 2009-11-5 09:09

同上所述

66666 · 发表于 2009-11-5 09:09

火星人

780G很久前就办到的事

有什么好惊奇的?
scowl 发表于 2009-11-4 20:30

你拿个A卡给我硬解个ref=15的视频看看。。。

knightmaster · 发表于 2009-11-5 14:32

CoreAVC CUDA 是调用 GPU 的硬件解码电路进行解码，不属于软件解码，和 DXVA 相比， CUDA Decoder 的好处是 renderer 的适用性广得多。
Edison 发表于 2009-11-4 12:17

那么请问为何CoreAVC CUDA对显卡的SP数量与频率有要求呢

jiangfannew · 发表于 2009-11-5 15:01

管理员很牛呀

sky1976 · 发表于 2009-11-5 15:15

cpu足以，硬解都没有什么实际意义
我们的需求是看高清电影，不看“硬件测试用视频”。
rx78meng 发表于 2009-11-4 16:14

+1
再硬也没有cpu硬

Edison · 发表于 2009-11-5 15:25

许多 16 sp 的显卡只有 256MB 内存容量，仅仅能满足 CUDA decoding 的最低要求，如果在 CoreAVC CUDA enabled 的情况下选择 overlay mixer renderer 的话，甚至会出现由于内存容量不足出现 frame flip 断档的问题。

视频播放本身不仅仅是解码，还有一些前处理和后处理，例如色彩空间转换、resize 等处理，这些部分在 CoreVC 中是有可能由 SP 完成。

http://cbaoth.dk/~cbaoth/nvcuvid.pdf

CUDA Video Decoder
The CUDA Video Decoder API gives developers access to the VP2 video processor on NVIDIA GPUs. This API supports the following video stream formats:  MPEG(1/2) and H.264.  This API enables developers to decode video streams on the GPU and process the decoded uncompressed surfaces within CUDA programs.  The decoded surfaces can be transferred back to system memory using CUDA’s fast asynchronous read-backs, or the application can use CUDA’s 3D interoperability features to render the surfaces using a 3D API (OpenGL or DirectX).

Processing and Displaying Frames

The application’s main loop retrieves images from the FrameQueue  (copyDecodedFrameToTexture() in videoDecode.cpp) and renders the  texture to the screen. The DirectX device is set up to block on monitor vsync, throttling rendering to 60Hz for the typical flat-screen display. To handle frame rate conversion of 3:2 pulldown content, we also render the frame multiple-times, according to the repeat information passed from the parser.

copyDecodedFrameToTexture() is the method where the CUDA decoder API is used to map a decoded frame (based on its Picture-Index) into CUDA device memory.  Post processing on a frame is done by mapping the frame through cudaPostProcessFrame().  This returns a pointer to a NV12 decoded frame.  This then gets passed to a CUDA kernel to convert NV12 surface to a RGBA surface.  The final RGBA surface is then copied directly into a DirectX texture and then drawn to the screen.

knightmaster · 发表于 2009-11-5 19:43

知道了谢谢CHO

只看该作者 · 发表于 2009-11-6 09:55

提示: 作者被禁止或删除内容自动屏蔽

zhlmnet1 · 发表于 2009-11-6 10:53

那些叫嚣cpu软解的，其实用GPU也是软解，画质只跟解码算法有关系，跟硬件没关系，不要再无知了

everyoung · 发表于 2009-11-6 15:19

linux下还要强，vapau，vc-1，h.264通吃，win上只有h.264解码

yuanico · 发表于 2009-11-6 18:26

最有用的cuda应用不管原理是dxva还是真调用了sp 都是目前最好的h.264播放方案

wangyics · 发表于 2009-11-6 20:31

老的CUDA无法全硬解VC-1，新的210、220才可以。

帐号		自动登录	找回密码
密码			注册

显卡杀手该用户已被删除	30^# 发表于 2009-11-6 09:55 \| 只看该作者提示: 作者被禁止或删除内容自动屏蔽
显卡杀手该用户已被删除
	回复支持反对使用道具举报显身卡