Changes between Version 11 and Version 12 of MatrixMultiply


Ignore:
Timestamp:
Jun 7, 2010 10:14:46 PM (14 years ago)
Author:
nakasato
Comment:

--

Legend:

Unmodified
Added
Removed
Modified
  • MatrixMultiply

    v11 v12  
    11= Matrix Multiply on GPU = 
    22We have implemented single/double precision matrix multiply program for RV770/Cypress. In our implementation, we use two input streams. One is transposed input matrix A and other is input matrix B in normal format. Output matrix C is also not transposed. We adopted 8x8 block for single precision and 4x4 for double precision. Here is benchmark result for each case. Note only kernel execution time is measured. 
     3 
     4== Source code == 
     5Will be posted later. 
    36 
    47== Single precision ==