POPPUR爱换

 找回密码
 注册

QQ登录

只需一步,快速开始

手机号码,快捷登录

搜索
查看: 4161|回复: 31
打印 上一主题 下一主题

期待Penryn:一起来等解禁

[复制链接]
跳转到指定楼层
1#
发表于 2007-7-10 11:17 | 只看该作者 回帖奖励 |正序浏览 |阅读模式
据说Penryn在7.15号保密协议解禁,还有不到1周,大家尽情YY吧!
  w00t)
31#
 楼主| 发表于 2007-7-12 12:51 | 只看该作者
原帖由 OHMYGODKING 于 2007-7-12 07:41 发表
p35支持这个版本的cpu吗?


显然P35是支持的
回复 支持 反对

使用道具 举报

30#
发表于 2007-7-12 07:41 | 只看该作者
p35支持这个版本的cpu吗?
回复 支持 反对

使用道具 举报

29#
 楼主| 发表于 2007-7-12 00:09 | 只看该作者
原帖由 the_god_of_pig 于 2007-7-11 20:40 发表
k10比我预计的还差,虽说本来k10从架构上看就该提升不大,但是A社的宣传真是相当之有信心,让偶I饭也心虚了:p

不过现在看来2.6G的FX能不能干过q6600还是个问题:loveliness:


至少年底之前Q6600是杀全家的(_(
我的预言 w00t)
回复 支持 反对

使用道具 举报

28#
发表于 2007-7-11 20:40 | 只看该作者
k10比我预计的还差,虽说本来k10从架构上看就该提升不大,但是A社的宣传真是相当之有信心,让偶I饭也心虚了:p

不过现在看来2.6G的FX能不能干过q6600还是个问题:loveliness:
回复 支持 反对

使用道具 举报

27#
 楼主| 发表于 2007-7-11 19:51 | 只看该作者
原帖由 naze 于 2007-7-11 13:16 发表


说得真好.说出了我们客观饭的心声
最好桌面k10最低频率的卖个不到100美元..
薄利多销嘛.中国都是amd最大市场啦w00t) w00t) w00t)


最好最高型号也不要超过200美元
这样就和谐了
回复 支持 反对

使用道具 举报

26#
发表于 2007-7-11 13:16 | 只看该作者
原帖由 acqwer 于 2007-7-11 08:50 发表

现在AMD的价格还不够好吗,非要回到去年一个闪龙也要600块的时候?
作为中立的性价比论者,应该期待AMD性能被踩。


说得真好.说出了我们客观饭的心声
最好桌面k10最低频率的卖个不到100美元..
薄利多销嘛.中国都是amd最大市场啦w00t) w00t) w00t)
回复 支持 反对

使用道具 举报

25#
发表于 2007-7-11 10:04 | 只看该作者
现在的目标只是等7.22的Q66来替换正在用的E66了.....
回复 支持 反对

使用道具 举报

24#
发表于 2007-7-11 09:20 | 只看该作者
这贴一定要顶~~

强烈期待Penryn解禁。。。
这样才能看到那些Pro A惊慌失措的表情……
才能知道AMD真的快倒了……哈哈哈~

Penryn,揭开你那神秘的面纱;出个必杀技~干死AMD吧!!!w00t)


BTW.Intel万岁~
回复 支持 反对

使用道具 举报

23#
发表于 2007-7-11 08:50 | 只看该作者
原帖由 wl00560 于 2007-7-11 00:22 发表
不关心Intel了,我现在只关心AMD什么时候能出个强悍的U,不要再让Intel在那儿耀武扬威了……
没有竞争的岁月真是讨厌!

现在AMD的价格还不够好吗,非要回到去年一个闪龙也要600块的时候?
作为中立的性价比论者,应该期待AMD性能被踩。
回复 支持 反对

使用道具 举报

22#
发表于 2007-7-11 00:22 | 只看该作者
不关心Intel了,我现在只关心AMD什么时候能出个强悍的U,不要再让Intel在那儿耀武扬威了……
没有竞争的岁月真是讨厌!
回复 支持 反对

使用道具 举报

21#
发表于 2007-7-11 00:13 | 只看该作者
现在大家都等着看好戏了~
回复 支持 反对

使用道具 举报

20#
 楼主| 发表于 2007-7-11 00:05 | 只看该作者
原帖由 enmaai 于 2007-7-10 23:57 发表
w00t) 2407有望无?


2407跟Penryn没啥关系
拿今年出Penryn来赌2407,不是被驴踢了就是被电梯夹了
回复 支持 反对

使用道具 举报

19#
发表于 2007-7-10 23:57 | 只看该作者
w00t) 2407有望无?
回复 支持 反对

使用道具 举报

18#
发表于 2007-7-10 23:24 | 只看该作者
偶蛋疼了一下

本帖子中包含更多资源

您需要 登录 才可以下载或查看,没有帐号?注册

x
回复 支持 反对

使用道具 举报

17#
 楼主| 发表于 2007-7-10 23:16 | 只看该作者
Penryn's Enhanced Core Architecture

The quad core version of Penryn contains 820 million transistors (Kentsfield has 582 million) in two very small dies of 107mm2. That makes the new design 25 percent smaller than Intel's current 65nm Quad core (143 mm2).


The new Penryn CPU also has yet another addition to the x86 ISA: Intel Streaming SIMD Extensions 4 (SSE4) instructions. It has also been confirmed that Penryn will deliver higher IPC and higher clock speeds. Intel wouldn't say more than "more than 3 GHz", but considering that the FSB is bumped up to 1600 MHz, 3.2 GHz is likely. However, several Intel people confirmed that if necessary ("depending on what the competition does"), the 45nm CPUs can go quite a bit higher (3.6 GHz is probably a safe estimate, considering how far current Core 2 CPUs are able to overclock).


With regards to power, Intel will be introducing what it is calling "Deep Power Down Technology", or a new lower power state, C6. The new C6 state reduces core voltage down to the absolute minimum for the given process technology, shuts down the core clock as well as turns off all of the caches. It is the absolute lowest power state that can be attained and will be introduced on Mobile Penryn family processors.

Penryn family processors are supposed to be socket-compatible, meaning that on the desktop we will see them introduced as LGA-775 CPUs. We'd expect that Intel's new lineup of chipsets will be required, but we are not sure if the new chipsets will support the 1600MHz FSB out of the box or if a refresh will be required.


Penryn-based processors also have a much better divider unit, roughly doubling the divider speed using a faster divide technique called Radix 16. Also, the shuffle engine has been improved. Intel's "Super Shuffle Engine" is a 128-bit, single-pass shuffle unit that can perform full-width shuffles in a single cycle, improving performance for SSE2, SSE3 and SSE4 instructions that have shuffle-like operations such as pack, unpack and wider packed shifts.


The last improvement is the "Split Load Cache Enhancement" which lowers the impact of data which is not aligned to cacheline boundaries. This seems to happen in some SSE intensive imaging applications.

The Quad core desktop and the quad core Xeon products will need 120W, 80W and 50W (LV) just like today. The dual core products will get a 40W/65W and 80W TDP.

Better Virtualization

Intel's current hardware support for virtualization in the current Core architecture is lackluster to say the least. To understand this you must understand what happens in a "pure" software-based virtualization solution such as VMware ESX 2.5.3 running on older Intel CPUs.

A technique called "ring deprivileging" is used as the guest OS cannot be allowed to run in the lowest ring 0 where it normally runs; the Virtual Machine Manager or hypervisor now runs there. That means that every time the guest application asks the help of the guest OS, which needs to run instructions which are only available in ring 0, the VMM must intercept that "SYSENTER" and emulate the normal execution. This is quite costly in performance terms.

Hardware assisted virtualization does not have that problem: both the OS and the VMM have their own ring 0. Despite this, Intel's HW assisted solutions didn't give any speed boost. It has not been discussed in detail, but Penryn speeds up virtual machine transition (entry/exit) times by 25% to 75%, and this requires no virtual machine software changes. This might be similar to AMD's nested page technology, although we don't have any clear details at present.

Last but not least, the dual core Penryn processors get a 6 MB shared cache and the quad versions get 12 MB cache. Both new designs will also come with a "higher degree of associativity". Considering the current designs are 16-way set associative, most likely the newer chips will feature a 24-way set associative L2 cache.

Intel EDAT: the End of the Multi-core Clock Speed Disadvantage?

Intel also talked about its "Enhanced Dynamic Acceleration Technology" which is effectively integrated overclocking based on load. If you are running a single threaded application (or a multi-threaded application that's predominantly using a single thread), Intel's EDAT can power down the second core and increase the frequency of the working core to maintain the same thermal envelope at all times.

Intel's EDAT could spell the end of the clock speed differential between single and multi-core processors. With all cores running workloads, the multi-core system would be clocked lower, but when some cores are idle the chip could potentially run at the same speed as a single core solution would. Single core designs have pretty much disappeared from roadmaps already, but considering there are still applications that are single threaded in nature and benefit more from clock speed improvements, future processors will offer both options in a single package.

Performance

Intel hasn't revealed too much about the performance of Penryn but Pat did leave us with a few comments. We don't know anything more about the test conditions than what we are presenting, and we didn't do the measurements ourselves, so take it for what it's worth.

Comparing a 3.2GHz Penryn (1.6GHz FSB) to a 3.0GHz Conroe (1.33GHz FSB), Intel has measured more than 20% increase in gaming performance (with no code changes). For video encoding applications, if SSE4 is utilized, the same Penryn vs. Conroe comparison can offer more than a 40% increase in performance.

Finally, Intel mentioned that in the server space, the fastest quad core Penryn available (>3GHz) vs. a 2.67GHz quad core Xeon resulted in a greater than 45% increase in performance in "bandwidth and FP intensive applications". It's incredibly vague (and oddly similar to AMD's claims of Barcelona vs. Xeon performance), but Pat mentioned that STREAM and certain benchmarks in SpecFP could be considered to be "bandwidth and FP intensive".

Again, we are just reporting what Intel told us. It will be a while before we can actually verify any of these claims or put them in the right context. Given the various enhancements that we've reported on, however, it's only reasonable to expect Penryn to be faster than Conroe, clock-for-clock. Whether that's 10% faster, 20% faster, or something else will be made clear in the future.
回复 支持 反对

使用道具 举报

16#
 楼主| 发表于 2007-7-10 23:14 | 只看该作者
Next up are Intel's Penryn benchmark results revealed at IDF Beijing. The system configuration is a little different, as both Penryn systems run at 3.33GHz and the systems are running Windows Vista Ultimate 32-bit. The exact config is listed below:
Test System Configuration Wolfdale 3.33GHz Yorkfield 3.33GHz Core 2 Extreme QX6800 (2.93GHz)
CPU
Pre-production dual core Penryn 3.33GHz/1333MHz FSB 6MB L2
Pre-production quad core Penryn 3.33GHz/1333MHz 12MB L2
Core 2 Extreme QX6800 quad core 2.93GHz/1066MHz 8MB L2
Motherboard
Pre-production BadAxe2 975X
Pre-production BadAxe2 975X
BadAxe2 975X
BIOS
Pre-production BIOS
Pre-production BIOS
Pre-production BIOS
Chipset Driver
8.1.1.1010
8.1.1.1010
8.1.1.1010
Video Card
GeForce 8800 GTX

Video Driver
NVIDIA 100.65
Memory
2 x 1GB DDR2-800 5-5-5-15

Hard Drive
Seagate 7200.10 320GB

And now the results:
BenchmarkWolfdale 3.33GHz Yorkfield 3.33GHz Core 2 Extreme QX6800 (2.93GHz)
3DMark '06 V1.1.0 Pro CPU (score) :
3061
4957
4070
3DMark '06 V1.1.0 Pro Overall (score) :
11015
11963
11123
Mainconcept H.264 Encoder (seconds) :
119
73
89
Cinebench R9.5 (CPU test)
1134
1935
1549
Cinebench R10 Beta (CPU test)
7045
13068
10416
HL2 Lost Coast Build 2707 (fps) :
210
210
153
DivX 6.6 Alpha w/ VirtualDub 1.7.1 (seconds)
22
18
38

For easier comparison we took the two quad-core chips (Yorkfield vs. Kentsfield) and looked at performance scaling between the two:
BenchmarkYorkfield Performance Advantage
3DMark '06 V1.1.0 Pro CPU (score) :
21.8%
3DMark '06 V1.1.0 Pro Overall (score) :
7.6%
Mainconcept H.264 Encoder (seconds) :
18.0%
Cinebench R9.5 (CPU test)
24.9%
Cinebench R10 Beta (CPU test)
25.5%
HL2 Lost Coast Build 2707 (fps) :
37.3%
DivX 6.6 Alpha w/ VirtualDub 1.7.1 (seconds)
111%

The Yorkfield system runs at a 13.6% higher clock speed than the Kentsfield system giving it an inherent advantage, but that's clearly not all that's making it faster. Half-Life 2 went up an expected 21.8% (we're assuming that Intel ran these numbers at 1024 x 768), and Cinebench saw a 25% increase in performance.
The DivX 6.6 test is particularly strong for Intel because it is using an early alpha version of DivX with support for SSE4. With SSE4 support, the quad-core Yorkfield processor ends up being more than 50% faster than Kentsfield, which bodes very well for Penryn if applications like DivX can bring SSE4 support in time for launch.
Final WordsObviously we'll reserve final judgments on Penryn for our official review of the CPU, but these initial results look very promising. We would expect to see clock for clock Penryn vs. Conroe improvements to be in the 5 - 10% range at minimum depending on the application. Factor in higher clock speeds and you can expect our CPU performance charts to shift up by about 20% by the end of this year.
Intel has shown its cards, now it's time for AMD to respond with those long overdue Barcelona tests...
回复 支持 反对

使用道具 举报

15#
 楼主| 发表于 2007-7-10 23:13 | 只看该作者
You knew it had to be coming. A year ago Intel previewed its first Core 2 processors ahead of their release, and with Penryn due out before the end of the year the boys in blue are back again.
Penryn is still pretty early, although Intel was able to reach over 3GHz on all of the samples we tested. Not surprisingly, the number of benchmarks we were able to run was quite limited. Intel also provided us with a handful of its own test results demonstrated at IDF Beijing which we have reproduced here as well.


Penryn in action
As a recap, Penryn is the 45nm micro-architectural update to Intel's current Core 2 processors. The slide below shows most of the improvements to Penryn:
A faster divider and super shuffle engine both improve IPC in very specific applications. As we mentioned in our IDF day 1 coverage, faster FSB speeds appear to be reserved for Penryn based Xeon processors at this point as desktop Penryn cores will use a 1333MHz FSB. Penryn takes the total amount of L2 cache up to 6MB per two cores, giving the quad core Penryn chips a total of 12MB of on-die L2 cache. Penryn also has improved power management technologies, but only for mobile Penryn chips.

Penryn up and running
First off we'll start with the results we ran ourselves under Intel's supervision. Intel set up three identical systems, one based on a Core 2 Extreme X6800 (dual core, 2.93GHz/1066MHz FSB), one based on a Wolfdale processor (Penryn, dual core, 3.20GHz/1066MHz FSB) and one based on Yorkfield (Penryn, quad core, 3.33GHz/1333MHz FSB).

The modified BadAxe 2 board; can you spot the mod?

Can't find it? It's under that blue heatsink
The processors were plugged into a modified Intel BadAxe2 motherboard, with the modification being necessary to support Penryn. Each system had 2GB of DDR2-800 memory and a GeForce 8800 GTX. All of our tests were run under Windows XP.

Wolfdale - 2 cores

Yorkfield - 4 cores
The Cinebench 9.5 test is the same one we run in our normal CPU reviews, with the dual core Penryn (Wolfdale) scoring about 20% faster than the dual core Conroe. Keep in mind that the Wolfdale core is running at a 9.2% higher clock speed, but even if Cinebench scaled perfectly with clock speed there's still at least a 10% increase in performance due to the micro-architectural improvements found in Penryn.
Next up was Intel's Half Life 2 Lost Coast benchmark which was run with the following settings:
Setting
Model Detail
High
Texture Detail
High
Shader Detail
High
Water Detail
Reflect World
Shadow Detail
High
Texture Filtering
Trilinear
HDR
Full

Half Life 2 performance at a very CPU bound 1024 x 768 has Wolfdale just under 19% faster than Conroe. Once again, clock speed does play a part here but we'd expect at least a 10% increase in performance just due to the advancements in Penryn.
At 1600 x 1200 the performance difference shrinks to 10.6%, still quite respectable:
回复 支持 反对

使用道具 举报

14#
 楼主| 发表于 2007-7-10 23:11 | 只看该作者
原帖由 boris_lee 于 2007-7-10 13:14 发表
2F文章的翻译质量实在是.......


还是转个E文的
不过恐有人看起来费尽~ 唉
回复 支持 反对

使用道具 举报

13#
发表于 2007-7-10 18:50 | 只看该作者
性能如此强悍?还有几天揭底!
人生就在等待中度过了
回复 支持 反对

使用道具 举报

您需要登录后才可以回帖 登录 | 注册

本版积分规则

广告投放或合作|网站地图|处罚通告|

GMT+8, 2024-11-5 18:51

Powered by Discuz! X3.4

© 2001-2017 POPPUR.

快速回复 返回顶部 返回列表