Changes between Version 20 and Version 21 of GEMM_Performance_Cypress


Ignore:
Timestamp:
Aug 19, 2010 5:00:44 AM (14 years ago)
Author:
nakasato
Comment:

--

Legend:

Unmodified
Added
Removed
Modified
  • GEMM_Performance_Cypress

    v20 v21  
    1111Comment(20100812): With the latest Catalyst 10.7, slow transfer speed for GPU to CPU memory is gone. We have roughly ~ 6 GB/sec for both directions. However, it just represents practical maximum transfer speed with a combination of Cypress and X58 chipset. The actual transfer speed from/to the main memory is rather slow ~ 2.5 GB/sec at most with the pinned memory allocation. This is the problem! 
    1212 
     13Update(20100820): We have implemented a proper treatment of alpha and beta. In the latest plot, we compare our results with ACML-GPU 1.1 and MAGMA BLAS 0.3 running on Fermi. 
     14 
    1315= Results =  
     16[[Image(DGEMM_ab.png)]] 
     17 
    1418[[Image(DGEMM.png)]] 
    1519