POPPUR爱换

 找回密码
 注册

QQ登录

只需一步,快速开始

手机号码,快捷登录

搜索
查看: 1571|回复: 2
打印 上一主题 下一主题

Par4All 1.0 发布,Tesla C2050 运行 SPEC CPU2006 410.bwaves 为 双 X5670 4.5 倍

[复制链接]
跳转到指定楼层
1#
发表于 2010-8-5 13:29 | 只看该作者 回帖奖励 |倒序浏览 |阅读模式
Hyantes
Description
Hyantes is a library to compute neighbourhood population potential with scale control. It is developed by the Mescal team from the Laboratoire Informatique de Grenoble (France), as a part of Hypercarte project. The
Hypercarte project aims to develop new methods for the cartographic representation of human distributions (population density, population increase, etc.) with various smoothing functions and opportunities for time-scale animations of maps. Hyantes provides one of the smoothing methods related to multiscalar neighbourhood density estimation. It is a C library that takes sets of geographic data as inputs and computes a smoothed representation of this data taking account of neighbourhood's influence.

For more information: http://hyantes.gforge.inria.fr
Results
We measure the wall-clock time that includes startup time, data load time and output write time, that is the real time understood by users. By measuring kernel time only, speed-up would be better but less representative of the real application (Amdahl...).

On one of our WildNode with 2 Intel Xeon X5670 @ 2.93GHz (12 cores) and a Tesla C2050 (Fermi), Linux/Ubuntu 10.04, gcc 4.4.3, CUDA 3.1, we measure in production:

Sequential execution time on CPU: 30.355s
OpenMP parallel execution time on CPUs: 3.859s, speed-up: 7.87
CUDA parallel execution time on GPU: 0.441s, speed-up: 68.8
To test it by yourself on the main computational part, go to the examples/P4A_accel/Hyantes directory of par4all or look in the git here.
SPEC CPU2006 410.bwaves
bwaves is a computational fluid dynamics Fortran program that simulates blast waves in three dimensional transonic transient laminar viscous flow.

More information on the program itself on http://www.spec.org/cpu2006/Docs/410.bwaves.html

On a HPC Project WildNode, we get a speed-up of 4.5 with a 2 Intel Xeon X5670 @ 2.93GHz (12 cores).

Matrix multiplication
The classical **o World in Fortran :-) can be found in examples/F77_matmul_OpenMP directory of par4all or look in the git here so you can try by yourself.

On a HPC Project WildNode, we get a speed-up of 12.1 (thanks to cache effects) with a 2 Intel Xeon X5670 @ 2.93GHz (12 cores).

http://www.par4all.org/documentation/benchmarks
头像被屏蔽
2#
发表于 2010-11-5 02:45 | 只看该作者
提示: 作者被禁止或删除 内容自动屏蔽
回复 支持 反对

使用道具 举报

3#
发表于 2010-11-5 04:12 | 只看该作者

得到的点点滴滴
回复 支持 反对

使用道具 举报

您需要登录后才可以回帖 登录 | 注册

本版积分规则

广告投放或合作|网站地图|处罚通告|

GMT+8, 2025-6-29 12:28

Powered by Discuz! X3.4

© 2001-2017 POPPUR.

快速回复 返回顶部 返回列表