Management of Deep Memory Hierarchies – Recursive Blocked Algorithms and Hybrid Data Structures for Dense Matrix Computations (original) (raw)
References
Andersen, B., Gustavson, F., Waśniewski, J.: A recursive formulation of Cholesky factorization of a matrix in packed storage. ACM Trans. Math. Software 27, 214–244 (2001) ArticleMATH Google Scholar
Anderson, E., Bai, Z., Bischof, C., Blackford, S., Demmel, J., Dongarra, J., Du Croz, J., Greenbaum, A., Hammarling, S., McKenney, A., Sorensen, D.: LAPACK Users’ Guide, 3rd edn. SIAM, Philadelphia (1999) Book Google Scholar
Elmroth, E., Gustavson, F.G.: Applying recursion to serial and parallel QR factorization leads to better performance. IBM J. Res. Develop. 44, 605–624 (2000) Article Google Scholar
Elmroth, E., Gustavson, F.G.: A faster and simpler recursive algorithm for the LAPACK routine DGELS. BIT 41, 936–949 (2001) ArticleMathSciNet Google Scholar
Elmroth, E., Gustavson, F.G.: High-performance library software for QR factorization. In: Sørevik, T., Manne, F., Moe, R., Gebremedhin, A.H. (eds.) PARA 2000. LNCS, vol. 1947, pp. 53–63. Springer, Heidelberg (2001) Chapter Google Scholar
Elmroth, E., Gustavson, F., Jonsson, I., Kågström, B.: Recursive Blocked Algorithms and Hybrid Data Structures for Dense Matrix Library Software. SIAM Review 46(1), 3–45 (2004) ArticleMATHMathSciNet Google Scholar
Granat, R., Jonsson, I., Kågström, B.: Combining Explicit and Recursive Blocking for Solving Triangular Sylvester-Type Matrix Equations on Distributed Memory Platforms. In: Danelutto, M., Vanneschi, M., Laforenza, D. (eds.) Euro-Par 2004. LNCS, vol. 3149, pp. 742–750. Springer, Heidelberg (2004) Chapter Google Scholar
Gustavson, F.G.: Recursion leads to automatic variable blocking for dense linear-algebra algorithms. IBM J. Res. Develop. 41, 737–755 (1997) Article Google Scholar
Gustavson, F.G., Henriksson, A., Jonsson, I., Kågström, B., Ling, P.: Recursive blocked data formats and BLAS’s for dense linear algebra algorithms. In: Kågström, B., et al. (eds.) PARA 1998. LNCS, vol. 1541, pp. 195–206. Springer, Heidelberg (1998) Chapter Google Scholar
Gustavson, F.G., Jonsson, I.: Minimal-storage high-performance Cholesky factorization via blocking and recursion. IBM J. Res. Develop. 44, 823–849 (2000) Article Google Scholar
IBM, Engineering and Scientific Subroutine Library, Guide and Reference, Ver. 3, Rel. 3 (2001) Google Scholar
Jonsson, I.: Analysis of Processor and Memory Utilization of Recursive Algorithms for Sylvester-Type Matrix Equations Using Performance Monitoring, Report UMINF-03.16, Dept. of Computing Science, Umeå University, Sweden (2003) Google Scholar
Jonsson, I., Kågström, B.: Recursive blocked algorithms for solving triangular systems— Part I: One-sided and coupled Sylvester-type matrix equations. ACM Trans. Math. Software 28, 392–415 (2002) ArticleMATHMathSciNet Google Scholar
Jonsson, I., Kågström, B.: Recursive blocked algorithms for solving triangular systems— Part II: Two-sided and generalized Sylvester and Lyapunov equations. ACM Trans. Math. Software 28, 416–435 (2002) ArticleMATHMathSciNet Google Scholar