|
多谢7L和14L。
另外我觉得Anand这一段多少能解释GF104相比GT200的性能增幅并不是一些XD想当然的应该是它们CUDA core数量之比。
“The upside to this is that on average GF104 should be more efficient per clock than GF100, which is quite a remarkable feat. The downside to this is that now NVIDIA has a greater degree of best and worst case scenarios, as requiring superscalar execution to utilize the 3rd CUDA core block means that it’s harder to use that 3rd block than the previous 2. The ability to extract ILP from a warp will result in GF104’s compute abilities performing like a 384 CUDA core part some of the time, and like a 256 CUDA core part at other times. It will be less consistent, but on average faster than a pure 256 CUDA core part would be.”
http://www.anandtech.com/show/38 ... -460-the-200-king/2 |
|