13
edits
(clapack3.1.1 blas testing handtune) |
(a recall of all the work that I've done) |
||
| Line 35: | Line 35: | ||
For double precision (complex16), problem exists for not accurate element result. | For double precision (complex16), problem exists for not accurate element result. | ||
== a recall of all the work that I've done == | |||
(1) Complex version SVD for ScaLAPACK | |||
(2) RFP for LAPACK | |||
(3) Performance comparison between PLAPACK & SCALAPACK | |||
(4) Re-write PDGETRF, optimization with look-ahead and other threading method for better performance on multi-core cluster | |||
(5) CELL learning | |||
(6) Variations of LU, Chol and QR for LAPACK | |||
(7) CLAPACK conversion | |||
(8) ScaLAPACK auto-tuning | |||
(9) DAG automatic generation for LAPACK | |||
(10) Using random butterfly transformation to remove pivoting | |||
(11) ILP64 support for LAPACK, ScaLAPACK | |||
edits