Hartwig Anzt
- Research Group Leader
-
hartwig anzt ∂does-not-exist.kit edu
Publication list KIT-Open
Preconditioners for Batched Iterative Linear Solvers on GPUs
Aggarwal, I.; Nayak, P.; Kashi, A.; Anzt, H.
2022. Accelerating Science and Engineering Discoveries Through Integrated Research Infrastructure for Experiment, Big Data, Modeling and Simulation – 22nd Smoky Mountains Computational Sciences and Engineering Conference, SMC 2022, Virtual Event, August 23–25, 2022, Revised Selected Papers. Ed.: K. Doug, 38–53, Springer Nature Switzerland AG. doi:10.1007/978-3-031-23606-8_3
Aggarwal, I.; Nayak, P.; Kashi, A.; Anzt, H.
2022. Accelerating Science and Engineering Discoveries Through Integrated Research Infrastructure for Experiment, Big Data, Modeling and Simulation – 22nd Smoky Mountains Computational Sciences and Engineering Conference, SMC 2022, Virtual Event, August 23–25, 2022, Revised Selected Papers. Ed.: K. Doug, 38–53, Springer Nature Switzerland AG. doi:10.1007/978-3-031-23606-8_3
Implementing Asynchronous Jacobi Iteration on GPUs
Tsai, Y.-H. M.; Nayak, P.; Chow, E.; Anzt, H.
2022. 2022 IEEE/ACM Workshop on Latest Advances in Scalable Algorithms for Large-Scale Heterogeneous Systems (ScalAH). IEEE/ACM Workshop on Latest Advances in Scalable Algorithms for Large-Scale Heterogeneous Systems (ScalAH 2022) Dallas, TX, USA, 13.11.2022–18.11.2022, Institute of Electrical and Electronics Engineers (IEEE). doi:10.1109/ScalAH56622.2022.00006
Tsai, Y.-H. M.; Nayak, P.; Chow, E.; Anzt, H.
2022. 2022 IEEE/ACM Workshop on Latest Advances in Scalable Algorithms for Large-Scale Heterogeneous Systems (ScalAH). IEEE/ACM Workshop on Latest Advances in Scalable Algorithms for Large-Scale Heterogeneous Systems (ScalAH 2022) Dallas, TX, USA, 13.11.2022–18.11.2022, Institute of Electrical and Electronics Engineers (IEEE). doi:10.1109/ScalAH56622.2022.00006
Prediction of Optimal Solvers for Sparse Linear Systems Using Deep Learning
Funk, Y.; Götz, M.; Anzt, H.
2022. Proceedings of the 2022 SIAM Conference on Parallel Processing for Scientific Computing (PP). Ed.: X. Li, 14–24, Society for Industrial and Applied Mathematics (SIAM). doi:10.1137/1.9781611977141.2
Funk, Y.; Götz, M.; Anzt, H.
2022. Proceedings of the 2022 SIAM Conference on Parallel Processing for Scientific Computing (PP). Ed.: X. Li, 14–24, Society for Industrial and Applied Mathematics (SIAM). doi:10.1137/1.9781611977141.2
Providing performance portable numerics for Intel GPUs
Tsai, Y.-H. M.; Cojean, T.; Anzt, H.
2022. Concurrency and Computation: Practice and Experience, Art.-Nr.: e7400. doi:10.1002/cpe.7400
Tsai, Y.-H. M.; Cojean, T.; Anzt, H.
2022. Concurrency and Computation: Practice and Experience, Art.-Nr.: e7400. doi:10.1002/cpe.7400
Compressed basis GMRES on high-performance graphics processing units
Aliaga, J. I.; Anzt, H.; Grützmacher, T.; Quintana-Ortí, E. S.; Tomás, A. E.
2022. The International Journal of High Performance Computing Applications. doi:10.1177/10943420221115140
Aliaga, J. I.; Anzt, H.; Grützmacher, T.; Quintana-Ortí, E. S.; Tomás, A. E.
2022. The International Journal of High Performance Computing Applications. doi:10.1177/10943420221115140
Porting Sparse Linear Algebra to Intel GPUs
Tsai, Y. M.; Cojean, T.; Anzt, H.
2022. Euro-Par 2021: Parallel Processing Workshops – Euro-Par 2021 International Workshops, Lisbon, Portugal, August 30-31, 2021, Revised Selected Papers. Ed.: R. Chaves, 57–68, Springer International Publishing. doi:10.1007/978-3-031-06156-1_5
Tsai, Y. M.; Cojean, T.; Anzt, H.
2022. Euro-Par 2021: Parallel Processing Workshops – Euro-Par 2021 International Workshops, Lisbon, Portugal, August 30-31, 2021, Revised Selected Papers. Ed.: R. Chaves, 57–68, Springer International Publishing. doi:10.1007/978-3-031-06156-1_5
Ginkgo: A Modern Linear Operator Algebra Framework for High Performance Computing
Anzt, H.; Cojean, T.; Flegar, G.; Göbel, F.; Grützmacher, T.; Nayak, P.; Ribizel, T.; Tsai, Y. M.; Quintana-Ortí, E. S.
2022. ACM Transactions on Mathematical Software, 48 (1), Art.-Nr.: 2. doi:10.1145/3480935
Anzt, H.; Cojean, T.; Flegar, G.; Göbel, F.; Grützmacher, T.; Nayak, P.; Ribizel, T.; Tsai, Y. M.; Quintana-Ortí, E. S.
2022. ACM Transactions on Mathematical Software, 48 (1), Art.-Nr.: 2. doi:10.1145/3480935
Ginkgo—A math library designed for platform portability
Cojean, T.; Tsai, Y.-H. M.; Anzt, H.
2022. Parallel Computing, 111, Art.-Nr.: 102902. doi:10.1016/j.parco.2022.102902
Cojean, T.; Tsai, Y.-H. M.; Anzt, H.
2022. Parallel Computing, 111, Art.-Nr.: 102902. doi:10.1016/j.parco.2022.102902
Batched Sparse Iterative Solvers for Computational Chemistry Simulations on GPUs
Aggarwal, I.; Kashi, A.; Nayak, P.; Balos, C. J.; Woodward, C. S.; Anzt, H.
2021. Proceedings of ScalA 2021: 12th Workshop on Latest Advances in Scalable Algorithms for Large-Scale Systems: Held in conjunction with SC21: The International Conference for High Performance Computing, Networking, Storage and Analysis ; St. Louis, Missouri, USA, November 14-19, 2021, 35–43, Institute of Electrical and Electronics Engineers (IEEE). doi:10.1109/ScalA54577.2021.00010
Aggarwal, I.; Kashi, A.; Nayak, P.; Balos, C. J.; Woodward, C. S.; Anzt, H.
2021. Proceedings of ScalA 2021: 12th Workshop on Latest Advances in Scalable Algorithms for Large-Scale Systems: Held in conjunction with SC21: The International Conference for High Performance Computing, Networking, Storage and Analysis ; St. Louis, Missouri, USA, November 14-19, 2021, 35–43, Institute of Electrical and Electronics Engineers (IEEE). doi:10.1109/ScalA54577.2021.00010
A Guide for Publishing, Using, and Licensing Research Software in Germany
Struck, A.; Loewe, A.; Achhammer, E.; Rack, F.; Bach, F.; Löffler, F.; Seemann, G.; Anzt, H.; Funk, M.; Unger, S.; Druskat, S.; Friedl, S.
2020. Zenodo. doi:10.5281/zenodo.4327148
Struck, A.; Loewe, A.; Achhammer, E.; Rack, F.; Bach, F.; Löffler, F.; Seemann, G.; Anzt, H.; Funk, M.; Unger, S.; Druskat, S.; Friedl, S.
2020. Zenodo. doi:10.5281/zenodo.4327148
Using Ginkgo’s memory accessor for improving the accuracy of memory-bound low precision BLAS
Grützmacher, T.; Anzt, H.; Quintana-Ortí, E. S.
2023. Software - Practice and Experience, 531 (1), 81–98. doi:10.1002/spe.3041
Grützmacher, T.; Anzt, H.; Quintana-Ortí, E. S.
2023. Software - Practice and Experience, 531 (1), 81–98. doi:10.1002/spe.3041
Mixed Precision Incomplete and Factorized Sparse Approximate Inverse Preconditioning on GPUs
Göbel, F.; Grützmacher, T.; Ribizel, T.; Anzt, H.
2021. Euro-Par 2021: Parallel Processing: 27th International Conference on Parallel and Distributed Computing, Lisbon, Portugal, September 1–3, 2021, Proceedings. Ed.: L. Sousa, 550–564, Springer Verlag. doi:10.1007/978-3-030-85665-6_34
Göbel, F.; Grützmacher, T.; Ribizel, T.; Anzt, H.
2021. Euro-Par 2021: Parallel Processing: 27th International Conference on Parallel and Distributed Computing, Lisbon, Portugal, September 1–3, 2021, Proceedings. Ed.: L. Sousa, 550–564, Springer Verlag. doi:10.1007/978-3-030-85665-6_34
A Collaborative Peer Review Process for Grading Coding Assignments
Nayak, P.; Göbel, F.; Anzt, H.
2021. Computational Science – ICCS 2021: 21st International Conference, Krakow, Poland, June 16–18, 2021, Proceedings, Part VI. Ed.: M. Paszynski, 654–660, Springer Verlag. doi:10.1007/978-3-030-77980-1_49
Nayak, P.; Göbel, F.; Anzt, H.
2021. Computational Science – ICCS 2021: 21st International Conference, Krakow, Poland, June 16–18, 2021, Proceedings, Part VI. Ed.: M. Paszynski, 654–660, Springer Verlag. doi:10.1007/978-3-030-77980-1_49
Adaptive Precision Block-Jacobi for High Performance Preconditioning in the Ginkgo Linear Algebra Software
Flegar, G.; Anzt, H.; Cojean, T.; Quintana-Ortí, E. S.
2021. ACM transactions on mathematical software, 47 (2), 1–28. doi:10.1145/3441850
Flegar, G.; Anzt, H.; Cojean, T.; Quintana-Ortí, E. S.
2021. ACM transactions on mathematical software, 47 (2), 1–28. doi:10.1145/3441850
An environment for sustainable research software in Germany and beyond: current state, open challenges, and call for action
Anzt, H.; Bach, F.; Druskat, S.; Löffler, F.; Loewe, A.; Renard, B. Y.; Seemann, G.; Struck, A.; Achhammer, E.; Aggarwal, P.; Appel, F.; Bader, M.; Brusch, L.; Busse, C.; Chourdakis, G.; Dabrowski, P. W.; Ebert, P.; Flemisch, B.; Friedl, S.; Fritzsch, B.; Funk, M. D.; Gast, V.; Goth, F.; Grad, J.-N.; Hegewald, J.; Hermann, S.; Hohmann, F.; Janosch, S.; Kutra, D.; Linxweiler, J.; Muth, T.; Peters-Kottig, W.; Rack, F.; Raters, F. H. C.; Rave, S.; Reina, G.; Reißig, M.; Ropinski, T.; Schaarschmidt, J.; Seibold, H.; Thiele, J. P.; Uekermann, B.; Unger, S.; Weeber, R.
2021. F1000Research, 9, 295. doi:10.12688/f1000research.23224.2
Anzt, H.; Bach, F.; Druskat, S.; Löffler, F.; Loewe, A.; Renard, B. Y.; Seemann, G.; Struck, A.; Achhammer, E.; Aggarwal, P.; Appel, F.; Bader, M.; Brusch, L.; Busse, C.; Chourdakis, G.; Dabrowski, P. W.; Ebert, P.; Flemisch, B.; Friedl, S.; Fritzsch, B.; Funk, M. D.; Gast, V.; Goth, F.; Grad, J.-N.; Hegewald, J.; Hermann, S.; Hohmann, F.; Janosch, S.; Kutra, D.; Linxweiler, J.; Muth, T.; Peters-Kottig, W.; Rack, F.; Raters, F. H. C.; Rave, S.; Reina, G.; Reißig, M.; Ropinski, T.; Schaarschmidt, J.; Seibold, H.; Thiele, J. P.; Uekermann, B.; Unger, S.; Weeber, R.
2021. F1000Research, 9, 295. doi:10.12688/f1000research.23224.2
Preparing Ginkgo for AMD GPUs – A Testimonial on Porting CUDA Code to HIP
Tsai, Y. M.; Cojean, T.; Ribizel, T.; Anzt, H.
2020. doi:10.5445/IR/1000131542
Tsai, Y. M.; Cojean, T.; Ribizel, T.; Anzt, H.
2020. doi:10.5445/IR/1000131542
Balanced and Compressed Coordinate Layout for the Sparse Matrix-Vector Product on GPUs
Aliaga, J. I.; Anzt, H.; Quintana-Ortí, E. S.; Tomás, A. E.; Tsai, Y. M.
2021. Euro-Par 2020: Parallel Processing Workshops: Euro-Par 2020 International Workshops, Warsaw, Poland, August 24–25, 2020, Revised Selected Papers. Ed.: B. Balis, 83–95, Springer Verlag. doi:10.1007/978-3-030-71593-9_7
Aliaga, J. I.; Anzt, H.; Quintana-Ortí, E. S.; Tomás, A. E.; Tsai, Y. M.
2021. Euro-Par 2020: Parallel Processing Workshops: Euro-Par 2020 International Workshops, Warsaw, Poland, August 24–25, 2020, Revised Selected Papers. Ed.: B. Balis, 83–95, Springer Verlag. doi:10.1007/978-3-030-71593-9_7
Preparing Ginkgo for AMD GPUs – A Testimonial on Porting CUDA Code to HIP
Tsai, Y. M.; Cojean, T.; Ribizel, T.; Anzt, H.
2021. Euro-Par 2020: Parallel Processing Workshops: Euro-Par 2020 International Workshops, Warsaw, Poland, August 24–25, 2020, Revised Selected Papers. Ed.: B. Balis, 109–121, Springer Verlag. doi:10.1007/978-3-030-71593-9_9
Tsai, Y. M.; Cojean, T.; Ribizel, T.; Anzt, H.
2021. Euro-Par 2020: Parallel Processing Workshops: Euro-Par 2020 International Workshops, Warsaw, Poland, August 24–25, 2020, Revised Selected Papers. Ed.: B. Balis, 109–121, Springer Verlag. doi:10.1007/978-3-030-71593-9_9
Crediting pull requests to open source research software as an academic contribution
Anzt, H.; Kuehn, E.; Flegar, G.
2021. Journal of computational science, 49, Art.-Nr.: 101278. doi:10.1016/j.jocs.2020.101278
Anzt, H.; Kuehn, E.; Flegar, G.
2021. Journal of computational science, 49, Art.-Nr.: 101278. doi:10.1016/j.jocs.2020.101278
Sparse Linear Algebra on AMD and NVIDIA GPUs – The Race Is On
Tsai, Y. M.; Cojean, T.; Anzt, H.
2020. High Performance Computing – 35th International Conference, ISC High Performance 2020, Frankfurt/Main, Germany, June 22–25, 2020, Proceedings. Ed.: P. Sadayappan, 309–327, Springer International Publishing. doi:10.1007/978-3-030-50743-5_16
Tsai, Y. M.; Cojean, T.; Anzt, H.
2020. High Performance Computing – 35th International Conference, ISC High Performance 2020, Frankfurt/Main, Germany, June 22–25, 2020, Proceedings. Ed.: P. Sadayappan, 309–327, Springer International Publishing. doi:10.1007/978-3-030-50743-5_16
Evaluating asynchronous Schwarz solvers on GPUs
Nayak, P.; Cojean, T.; Anzt, H.
2021. The international journal of high performance computing applications, 35 (3), 226–236. doi:10.1177/1094342020946814
Nayak, P.; Cojean, T.; Anzt, H.
2021. The international journal of high performance computing applications, 35 (3), 226–236. doi:10.1177/1094342020946814
Multiprecision Block-Jacobi for iterative triangular solves
Goebel, F.; Anzt, H.; Cojean, T.; Flegar, G.; Quintana-Ortí, E. S.
2020. Lecture notes in computer science, 546–560, Springer. doi:10.1007/978-3-030-57675-2_34
Goebel, F.; Anzt, H.; Cojean, T.; Flegar, G.; Quintana-Ortí, E. S.
2020. Lecture notes in computer science, 546–560, Springer. doi:10.1007/978-3-030-57675-2_34
An environment for sustainable research software in Germany and beyond: current state, open challenges, and call for action
Anzt, H.; Bach, F.; Druskat, S.; Löffler, F.; Loewe, A.; Renard, B. Y.; Seemann, G.; Struck, A.; Achhammer, E.; Aggarwal, P.; Appel, F.; Bader, M.; Brusch, L.; Busse, C.; Chourdakis, G.; Dabrowski, P. W.; Ebert, P.; Flemisch, B.; Friedl, S.; Fritzsch, B.; Funk, M. D.; Gast, V.; Goth, F.; Grad, J.-N.; Hermann, S.; Hohmann, F.; Janosch, S.; Kutra, D.; Linxweiler, J.; Muth, T.; Peters-Kottig, W.; Rack, F.; Raters, F. H. C.; Rave, S.; Reina, G.; Reißig, M.; Ropinski, T.; Schaarschmidt, J.; Seibold, H.; Thiele, J. P.; Uekermann, B.; Unger, S.; Weeber, R.
2020. F1000Research, 9, Article no: 295. doi:10.12688/f1000research.23224.1
Anzt, H.; Bach, F.; Druskat, S.; Löffler, F.; Loewe, A.; Renard, B. Y.; Seemann, G.; Struck, A.; Achhammer, E.; Aggarwal, P.; Appel, F.; Bader, M.; Brusch, L.; Busse, C.; Chourdakis, G.; Dabrowski, P. W.; Ebert, P.; Flemisch, B.; Friedl, S.; Fritzsch, B.; Funk, M. D.; Gast, V.; Goth, F.; Grad, J.-N.; Hermann, S.; Hohmann, F.; Janosch, S.; Kutra, D.; Linxweiler, J.; Muth, T.; Peters-Kottig, W.; Rack, F.; Raters, F. H. C.; Rave, S.; Reina, G.; Reißig, M.; Ropinski, T.; Schaarschmidt, J.; Seibold, H.; Thiele, J. P.; Uekermann, B.; Unger, S.; Weeber, R.
2020. F1000Research, 9, Article no: 295. doi:10.12688/f1000research.23224.1
Load-balancing Sparse Matrix Vector Product Kernels on GPUs
Anzt, H.; Cojean, T.; Yen-Chen, C.; Dongarra, J.; Flegar, G.; Nayak, P.; Tomov, S.; Tsai, Y. M.; Wang, W.
2020. ACM Transactions on Parallel Computing, 7 (1), Article: 2. doi:10.1145/3380930
Anzt, H.; Cojean, T.; Yen-Chen, C.; Dongarra, J.; Flegar, G.; Nayak, P.; Tomov, S.; Tsai, Y. M.; Wang, W.
2020. ACM Transactions on Parallel Computing, 7 (1), Article: 2. doi:10.1145/3380930
Acceleration of PageRank with Customized Precision Based on Mantissa Segmentation
Grützmacher, T.; Cojean, T.; Flegar, G.; Anzt, H.; Quintana-Ortí, E. S.
2020. ACM Transactions on Parallel Computing, 7 (1), Article: 4. doi:10.1145/3380934
Grützmacher, T.; Cojean, T.; Flegar, G.; Anzt, H.; Quintana-Ortí, E. S.
2020. ACM Transactions on Parallel Computing, 7 (1), Article: 4. doi:10.1145/3380934
Parallel selection on GPUs
Ribizel, T.; Anzt, H.
2019. Parallel computing, 91, Article: 102588. doi:10.1016/j.parco.2019.102588
Ribizel, T.; Anzt, H.
2019. Parallel computing, 91, Article: 102588. doi:10.1016/j.parco.2019.102588
ParILUT - A parallel threshold ILU for GPUS
Anzt, H.; Ribizel, T.; Flegar, G.; Chow, E.; Dongarra, J.
2019. Proceedings 2019 IEEE 33rd International Parallel and Distributed Processing Symposium, IPDPS 2019: 20-24 May 2019, Rio de Janeiro, Brazil, 231–241, Institute of Electrical and Electronics Engineers (IEEE). doi:10.1109/IPDPS.2019.00033
Anzt, H.; Ribizel, T.; Flegar, G.; Chow, E.; Dongarra, J.
2019. Proceedings 2019 IEEE 33rd International Parallel and Distributed Processing Symposium, IPDPS 2019: 20-24 May 2019, Rio de Janeiro, Brazil, 231–241, Institute of Electrical and Electronics Engineers (IEEE). doi:10.1109/IPDPS.2019.00033
PAPI software-defined events for in-depth performance analysis
Jagode, H.; Danalis, A.; Anzt, H.; Dongarra, J.
2019. The international journal of high performance computing applications, 33 (6), 1113–1127. doi:10.1177/1094342019846287
Jagode, H.; Danalis, A.; Anzt, H.; Dongarra, J.
2019. The international journal of high performance computing applications, 33 (6), 1113–1127. doi:10.1177/1094342019846287
Are we doing the right thing? - A critical analysis of the academic HPC community
Anzt, H.; Flegar, G.
2019. 2019 IEEE International Parallel and Distributed Processing Symposium Workshops (IPDPSW), 739–745, Institute of Electrical and Electronics Engineers (IEEE). doi:10.1109/IPDPSW.2019.00122
Anzt, H.; Flegar, G.
2019. 2019 IEEE International Parallel and Distributed Processing Symposium Workshops (IPDPSW), 739–745, Institute of Electrical and Electronics Engineers (IEEE). doi:10.1109/IPDPSW.2019.00122
Approximate and exact selection on GPUs
Ribizel, T.; Anzt, H.
2019. 2019 IEEE International Parallel and Distributed Processing Symposium Workshops (IPDPSW), 471–478, Institute of Electrical and Electronics Engineers (IEEE). doi:10.1109/IPDPSW.2019.00088
Ribizel, T.; Anzt, H.
2019. 2019 IEEE International Parallel and Distributed Processing Symposium Workshops (IPDPSW), 471–478, Institute of Electrical and Electronics Engineers (IEEE). doi:10.1109/IPDPSW.2019.00088
A customized precision format based on mantissa segmentation for accelerating sparse linear algebra
Grützmacher, T.; Cojean, T.; Flegar, G.; Göbel, F.; Anzt, H.
2020. Concurrency and computation, 32 (15), Article: e5418. doi:10.1002/cpe.5418
Grützmacher, T.; Cojean, T.; Flegar, G.; Göbel, F.; Anzt, H.
2020. Concurrency and computation, 32 (15), Article: e5418. doi:10.1002/cpe.5418
Toward a modular precision ecosystem for high-performance computing
Anzt, H.; Flegar, G.; Grützmacher, T.; Quintana-Ortí, E. S.
2019. The international journal of high performance computing applications, 33 (6), 1069–1078. doi:10.1177/1094342019846547
Anzt, H.; Flegar, G.; Grützmacher, T.; Quintana-Ortí, E. S.
2019. The international journal of high performance computing applications, 33 (6), 1069–1078. doi:10.1177/1094342019846547
Machine learning-aided numerical linear Algebra: Convolutional neural networks for the efficient preconditioner generation
Götz, M.; Anzt, H.
2019. Proceedings of ScalA 2018: 9th Workshop on Latest Advances in Scalable Algorithms for Large-Scale Systems, 49–56, Institute of Electrical and Electronics Engineers (IEEE). doi:10.1109/ScalA.2018.00010
Götz, M.; Anzt, H.
2019. Proceedings of ScalA 2018: 9th Workshop on Latest Advances in Scalable Algorithms for Large-Scale Systems, 49–56, Institute of Electrical and Electronics Engineers (IEEE). doi:10.1109/ScalA.2018.00010
High-Performance GPU Implementation of PageRank with Reduced Precision Based on Mantissa Segmentation
Grützmacher, T.; Anzt, H.; Scheidegger, F.; Quintana-Orti, E. S.
2019. Proceedings of IA³ 2018: 8th Workshop on Irregular Applications: Architectures and Algorithms, 61–68, Institute of Electrical and Electronics Engineers (IEEE). doi:10.1109/IA3.2018.00015
Grützmacher, T.; Anzt, H.; Scheidegger, F.; Quintana-Orti, E. S.
2019. Proceedings of IA³ 2018: 8th Workshop on Irregular Applications: Architectures and Algorithms, 61–68, Institute of Electrical and Electronics Engineers (IEEE). doi:10.1109/IA3.2018.00015
Variable-Size Batched Condition Number Calculation on GPUs
Anzt, H.; Dongarra, J.; Flegar, G.; Grützmacher, T.
2019. 2018 30th International Symposium on Computer Architecture and High Performance Computing: SBAC-PAD 2018 ; Lyon, France, 24-27 September 2018 ; Proceedings, 132–139, Institute of Electrical and Electronics Engineers (IEEE). doi:10.1109/CAHPC.2018.8645907
Anzt, H.; Dongarra, J.; Flegar, G.; Grützmacher, T.
2019. 2018 30th International Symposium on Computer Architecture and High Performance Computing: SBAC-PAD 2018 ; Lyon, France, 24-27 September 2018 ; Proceedings, 132–139, Institute of Electrical and Electronics Engineers (IEEE). doi:10.1109/CAHPC.2018.8645907
A Jaccard Weights Kernel Leveraging Independent Thread Scheduling on GPUs
Anzt, H.; Dongarra, J.
2019. 2018 30th International Symposium on Computer Architecture and High Performance Computing: SBAC-PAD 2018 ; Lyon, France, 24-27 September 2018 ; Proceedings, 229–232, Institute of Electrical and Electronics Engineers (IEEE). doi:10.1109/CAHPC.2018.8645946
Anzt, H.; Dongarra, J.
2019. 2018 30th International Symposium on Computer Architecture and High Performance Computing: SBAC-PAD 2018 ; Lyon, France, 24-27 September 2018 ; Proceedings, 229–232, Institute of Electrical and Electronics Engineers (IEEE). doi:10.1109/CAHPC.2018.8645946
Adaptive precision in block-Jacobi preconditioning for iterative sparse linear system solvers
Anzt, H.; Dongarra, J.; Flegar, G.; Higham, N. J.; Quintana-Ortí, E. S.
2019. Concurrency and computation, 31 (6), e4460. doi:10.1002/cpe.4460
Anzt, H.; Dongarra, J.; Flegar, G.; Higham, N. J.; Quintana-Ortí, E. S.
2019. Concurrency and computation, 31 (6), e4460. doi:10.1002/cpe.4460
Optimization and performance evaluation of the IDR iterative Krylov solver on GPUs
Anzt, H.; Kreutzer, M.; Ponce, E.; Peterson, G. D.; Wellein, G.; Dongarra, J.
2018. The international journal of high performance computing applications, 32 (2), 220–230. doi:10.1177/1094342016646844
Anzt, H.; Kreutzer, M.; Ponce, E.; Peterson, G. D.; Wellein, G.; Dongarra, J.
2018. The international journal of high performance computing applications, 32 (2), 220–230. doi:10.1177/1094342016646844
Machine learning-aided numerical linear algebra: Convolutional neural network for the efficient preconditioner generation
Götz, M.; Anzt, H.
2018. ScalA18: 9th Workshop on Latest Advances in Scalable Algorithms for Large-Scale Systems, Dallas, TX, November 12, 2018
Götz, M.; Anzt, H.
2018. ScalA18: 9th Workshop on Latest Advances in Scalable Algorithms for Large-Scale Systems, Dallas, TX, November 12, 2018
ParILUT---A New Parallel Threshold ILU Factorization
Anzt, H.; Chow, E.; Dongarra, J.
2018. SIAM journal on scientific computing, 40 (4), C503–C519. doi:10.1137/16M1079506
Anzt, H.; Chow, E.; Dongarra, J.
2018. SIAM journal on scientific computing, 40 (4), C503–C519. doi:10.1137/16M1079506
Using Jacobi iterations and blocking for solving sparse triangular systems in incomplete factorization preconditioning
Chow, E.; Anzt, H.; Scott, J.; Dongarra, J.
2018. Journal of parallel and distributed computing, 119, 219–230. doi:10.1016/j.jpdc.2018.04.017
Chow, E.; Anzt, H.; Scott, J.; Dongarra, J.
2018. Journal of parallel and distributed computing, 119, 219–230. doi:10.1016/j.jpdc.2018.04.017
Preconditioned Krylov solvers on GPUs
Anzt, H.; Gates, M.; Dongarra, J.; Kreutzer, M.; Wellein, G.; Köhler, M.
2017. Parallel computing, 68 (October), 32–44. doi:10.1016/j.parco.2017.05.006
Anzt, H.; Gates, M.; Dongarra, J.; Kreutzer, M.; Wellein, G.; Köhler, M.
2017. Parallel computing, 68 (October), 32–44. doi:10.1016/j.parco.2017.05.006
Variable-size batched Gauss–Jordan elimination for block-Jacobi preconditioning on graphics processors
Anzt, H.; Dongarra, J.; Flegar, G.; Quintana-Ortí, E. S.
2018. Parallel computing, 81, 131–146. doi:10.1016/j.parco.2017.12.006
Anzt, H.; Dongarra, J.; Flegar, G.; Quintana-Ortí, E. S.
2018. Parallel computing, 81, 131–146. doi:10.1016/j.parco.2017.12.006
Variable-Size Batched Gauss-Huard for Block-Jacobi Preconditioning
Anzt, H.; Dongarra, J.; Flegar, G.; Quintana-Ortí, E. S.; Tomás, A. E.
2017. Procedia computer science, 108, 1783–1792. doi:10.1016/j.procs.2017.05.186
Anzt, H.; Dongarra, J.; Flegar, G.; Quintana-Ortí, E. S.; Tomás, A. E.
2017. Procedia computer science, 108, 1783–1792. doi:10.1016/j.procs.2017.05.186
Flexible batched sparse matrix-vector product on GPUs
Anzt, H.; Collins, G.; Dongarra, J.; Flegar, G.; Quintana-Ortí, E. S.
2017. the 8th Workshop, Denver, Colorado, 12–17 November 2017, 1–8, Association for Computing Machinery (ACM). doi:10.1145/3148226.3148230
Anzt, H.; Collins, G.; Dongarra, J.; Flegar, G.; Quintana-Ortí, E. S.
2017. the 8th Workshop, Denver, Colorado, 12–17 November 2017, 1–8, Association for Computing Machinery (ACM). doi:10.1145/3148226.3148230
Overcoming Load Imbalance for Irregular Sparse Matrices
Flegar, G.; Anzt, H.
2017. Proceedings of the 7th Workshop on Irregular Applications : Architectures and Algorithms, IA3’17, Denver, Colorado, USA, 12th - 17th November 2017, Art.Nr. 2, Association for Computing Machinery (ACM). doi:10.1145/3149704.3149767
Flegar, G.; Anzt, H.
2017. Proceedings of the 7th Workshop on Irregular Applications : Architectures and Algorithms, IA3’17, Denver, Colorado, USA, 12th - 17th November 2017, Art.Nr. 2, Association for Computing Machinery (ACM). doi:10.1145/3149704.3149767
Incomplete Sparse Approximate Inverses for Parallel Preconditioning
Anzt, H.; Huckle, T. K.; Bräckle, J.; Dongarra, J.
2018. Parallel computing, 71, 1–22. doi:10.1016/j.parco.2017.10.003
Anzt, H.; Huckle, T. K.; Bräckle, J.; Dongarra, J.
2018. Parallel computing, 71, 1–22. doi:10.1016/j.parco.2017.10.003
title | type | semester | place |
---|---|---|---|
Numerical Linear Algebra for Scientific High Performance Computing | Vorlesung (V) | SS 2022 | |
Tutorial for 2400138 (Numerical Linear Algebra for Scientific High Performance Computing) | Übung (Ü) | WS 21/22 | |
Numerical Linear Algebra for Scientific High Performance Computing | Vorlesung (V) | WS 21/22 |