星期六早上去参加了旅英侨界运动会,认识了不少好朋友,也第一次听说了NVidia的CUDA,可以用GPU运算!NV牌八系列以上的芯片组就可以用!我的F8SV完全OK阿!据说利用GPU可以大幅缩短MATLAB矩阵运算的时间,我听了感觉很嗨!当然就投入研究看看啰! 
但是想的比做的美,如果乖乖按照NV官网上的教学,我想我到现在还是无法使用吧?最后还是跟先进求救才搞定。就把这步骤记录下来当作我BLOG的第一篇吧!
先介绍我的laptop
ASUS F8Sv
OS: VISTA 32bit
MATLAB R2008a 
接着看看你的显示适配器是不是支持CUDA,底下是支援列表。
http://www.nvidia.com/object/cuda_learn_products.html
如果你有,恭喜!
安装CUDA以前,先安装第三方显卡驱动。对,你没看错NV自家驱动会让你安装不了CUDA driver,够鸟吧!
专家帮我找到属于我的规格的最新趋动,连结如下
http://www.laptopvideo2go.com/nvidia/175series/17595_vista.exe
http://www.laptopvideo2go.com/infs/175series/17741_vista/nv_disp.inf 
如果你跟我不一样显卡,自己从
http://www.laptopvideo2go.com/
去找吧!
INF档是当EXE自解压缩完成之后要丢进去替换掉的,这样才能顺利安装。安装前记得先移除原来的显示适配器驱动程序。 
接着上NV官网找到CUDA
http://www.nvidia.com/object/cuda_home.html
去当漏区把该抓的三样东西都抓一抓,驱动要第一个灌,上面的INF档一样要塞进来,不然不能顺利安装喔!
接着安装另外两个档案toolkit&SDK
马上就可以看看NV官方提供的GPU强大威力演示啰!
但是这还不够,我要的是MATLAB的API阿!翻来找去,让我找到了Jacket这个project,提供MATLAB用的
函示库喔!不需要自己的撰写C语言再用MEX function,真是爽阿。下载后,直接把Jacket提供的函式库加进PATH就搞定了!
底下是BLAS范例执行的结果,我就不多做解释了,自己看看有多威吧!
BLAS Example 1: Square Matrix Multiplication
Computing the 2x2 benchmark...
On average GPU is 0.177807 times faster than the CPU.
Computing the 4x4 benchmark...
On average GPU is 0.143623 times faster than the CPU.
Computing the 8x8 benchmark...
On average GPU is 0.135356 times faster than the CPU.
Computing the 16x16 benchmark...
On average GPU is 0.182954 times faster than the CPU.
Computing the 32x32 benchmark...
On average GPU is 5.28891 times faster than the CPU.
Computing the 64x64 benchmark...
On average GPU is 1.67258 times faster than the CPU.
Computing the 128x128 benchmark...
On average GPU is 4.13088 times faster than the CPU.
Computing the 256x256 benchmark...
On average GPU is 17.4282 times faster than the CPU.
Computing the 512x512 benchmark...
On average GPU is 139.496 times faster than the CPU.
Computing the 1024x1024 benchmark...
On average GPU is 990.547 times faster than the CPU.
Plotting results...End of BLAS Example 1.
BLAS Example 2: Matrix-Vector Multiplication (Transformation)
Computing the 3x1024 benchmark...
On average GPU is 1.60066 times faster than the CPU.
Computing the 3x2048 benchmark...
On average GPU is 4.33732 times faster than the CPU.
Computing the 3x4096 benchmark...
On average GPU is 13.9893 times faster than the CPU.
Computing the 3x8192 benchmark...
On average GPU is 3.1869 times faster than the CPU.
Computing the 3x16384 benchmark...
On average GPU is 5.72559 times faster than the CPU.
Computing the 3x32768 benchmark...
On average GPU is 8.04638 times faster than the CPU. 
更详细内容:http://www.xasun.com/article/f6/624.html
10台顶级8核服务器速度-Tesla个人超级计算机