TU Home
This publication list has been generated automatically from the publication data of the Faculty of Informatics. Please invoke the page "Publications of the Faculty" directly for more complex searches and queries, or use the global search function of the Publication Database of the Vienna University of Technology!


Publication Database Home  

Publication list for members of
E191 - Institute of Computer Engineering
E191-04 Parallel Computing
as authors or essentially involved persons

197 records (2011 - 2021)


Books and Book Editorships


F. Desprez, P. Dutot, C. Kaklamanis, L. Marchal, K. Molitorisz, L. Ricci, V. Scarano, M. Vega-Rodriguez, A. Varbanescu, S. Hunold, S. Scott, S. Lankes, J. Weidendorfer (ed.):
"Euro-Par 2016: Parallel Processing Workshops - Euro-Par 2016 International Workshops, Revised Selected Papers, LNCS 10104";
Springer Nature Switzerland AG 2021, 2017, ISBN: 978-3-319-58942-8; 850 pages.

T. Hoefler, J. Träff (ed.):
"Proceedings of the 26th European MPI Users' Group Meeting, EuroMPI 2019";
ACM, New York, NY, USA, 2019, ISBN: 978-1-4503-7175-9; 134 pages.

D. Holmes, C. Collis, J. Träff, L. Smith (ed.):
"Proceedings of the 23rd European MPI Users' Group Meeting, EuroMPI 2016";
ACM, 2016, ISBN: 978-1-4503-4234-6; 221 pages.

S. Hunold, A. Costan, D. Gimenez, A. Iosup, L. Ricci, G. Gomez Requena, V. Scarano, A. Varbanescu, S. Scott, S. Lankes, J. Weidendorfer, M. Alexander (ed.):
"Euro-Par 2015: Parallel Processing Workshops, Euro-Par 2015 International Workshops, Vienna, Austria, August 24-25, 2015, Revised Selected Papers, LNCS 9523";
Springer International Publishing, 2015, ISBN: 978-3-319-27307-5; 880 pages.

L. Lopes, J. Zilinskas, A. Costan, R. Cascella, G. Kecskemeti, E. Jeannot, M. Cannataro, L. Ricci, S. Benkner, S. Petit, V. Scarano, J. Gracia, S. Hunold, S. Scott, S. Lankes, C. Lengauer, J. Carretero, J. Breitbart, M. Alexander (ed.):
"Euro-Par 2014: Parallel Processing Workshops - Euro-Par 2014 International Workshops, Revised Selected Papers, Part I, LNCS 8805";
Springer, 2014, ISBN: 978-3-319-14324-8; 620 pages.

L. Lopes, J. Zilinskas, A. Costan, R. Cascella, G. Kecskemeti, E. Jeannot, M. Cannataro, L. Ricci, S. Benkner, S. Petit, V. Scarano, J. Gracia, S. Hunold, S. Scott, S. Lankes, C. Lengauer, J. Carretero, J. Breitbart, M. Alexander (ed.):
"Euro-Par 2014: Parallel Processing Workshops - Euro-Par 2014 International Workshops, Revised Selected Papers, Part II, LNCS 8806";
Springer, 2014, ISBN: 978-3-319-14312-5; 590 pages.

J. Träff, S. Benkner, J. Dongarra (ed.):
"Recent Advances in the Message Passing Interface Proceedings of the 19th European MPI Users' Group Meeting, EuroMPI 2012, LNCS 7490";
Springer, 2012, ISBN: 978-3-642-33517-4; 315 pages.

J. Träff, S. Hunold, F. Versaci (ed.):
"Euro-Par 2015: Parallel Processing, 21st International Conference on Parallel and Distributed Computing, Vienna, Austria, August 24-28, 2015, Proceedings, LNCS 9233";
Springer-Verlag Berlin Heidelberg, 2015, ISBN: 978-3-662-48095-3; 732 pages.


Publications in Scientific Journals


S. Benkner, S. Pllana, J. Träff, P. Tsigas, U. Dolinsky, C. Augonnet, B. Bachmayer, C. Kessler, D. Moloney, V. Osipov:
"PEPPHER: Efficient and Productive Usage of Hybrid Computing Systems";
IEEE Micro, Volume 31 (2011), Issue 5; 28 - 41.

R. Bertin, S. Hunold, A. Legrand, C. Touati:
"Fair scheduling of bag-of-tasks applications using distributed Lagrangian optimization";
Journal of Parallel and Distributed Computing, Available online 23 August 2013 (2013), 1 - 16.

R. Bleuse, S. Hunold, S. Kedad-Sidhoum, F. Monna, G. Mounie, D. Trystram:
"Scheduling Independent Moldable Tasks on Multi-Cores with GPUs";
IEEE Transactions on Parallel and Distributed Systems, Volume 28 (2017), Issue 9; 2689 - 2702.

A. Carpen-Amarie, S. Hunold, J. Träff:
"On expected and observed communication performance with MPI derived datatypes";
Parallel Computing, Volume 69 (2017), Issue November; 98 - 117.

M. Forsell, J. Roivainen, V. Leppänen, J. Träff:
"Supporting concurrent memory access in TCF processor architectures";
Microprocessors and Microsystems, Volume 63 (2018), 226 - 236.

S. Hunold:
"One Step towards Bridging the Gap between Theory and Practice in Moldable Task Scheduling with Precedence Constraints";
Concurrency and Computation: Practice and Experience, Volume 27 (2015), Issue 4; 1010 - 1026.

S. Ibrahim, T. Phan, A. Carpen-Amarie, H. Chihoub, D. Moise, G. Antoniu:
"Governing energy consumption in Hadoop through CPU frequency scaling: An analysis";
Future Generation Computer Systems, Volume 54 (2016), 219 - 232.

T. Kainrad, S. Hunold, T. Seidel, T. Langer:
"LigandScout Remote: A New User-Friendly Interface for HPC and Cloud Resources";
Journal of Chemical Information and Modeling, Volume 59 (2019), Issue 1; 31 - 37.

Q. Kang, J. Träff, R. Al-Bahrani, A. Agrawal, A. Choudhary, W. Liao:
"Scalable Algorithms for MPI Intergroup Allgather and Allgatherv";
Parallel Computing, Volume 85 (2019), 220 - 230.

E. Lusk, J. Träff:
"MPI Is 25 Years Old!";
HPCwire (invited), May 1 (2017).

C. Siebert, J. Träff:
"Perfectly Load-Balanced, Stable, Synchronization-Free Parallel Merge";
Parallel Processing Letters, Volume 24 (2014), Issue 1; 1 - 11.

J. Träff:
"Alternative, uniformly expressive and more scalable interfaces for collective communication in MPI";
Parallel Computing (invited), Volume 38 (2012), Issues 1-2; 26 - 36.

J. Träff:
"On Optimal Trees for Irregular Gather and Scatter Collectives";
IEEE Transactions on Parallel and Distributed Systems, Volume 30 (2019), Issue 9; 2060 - 2074.

J. Träff:
"Practical, distributed, low overhead algorithms for irregular gather and scatter collectives";
Parallel Computing, Volume 75 (2018), 100 - 117.

J. Träff:
"Viewpoint: (Mis)Managing Parallel Computing Research through EU Project Funding";
Communications of the ACM, Volume 59 (2016), Issue 12; 46 - 48.

J. Träff, S. Benkner:
"Preface: Selected Papers from EuroMPI 2012";
Computing, Volume 96, Special Issue on EuroMPI 2012 (2014), Issue 4; 259 - 261.

J. Träff, S. Hunold, G. Mercier, D. Holmes:
"MPI collective communication through a single set of interfaces: A case for orthogonality";
Parallel Computing, Volume 107 (2021), 102826:1 - 102826:11.

K. von Kirchbach, C. Schulz, J. Träff:
"Better Process Mapping and Sparse Quadratic Assignment";
ACM Journal of Experimental Algorithmics, Volume 25 (2020), Issue 1; 1.11:1 - 1.11:19.


Editorials in Scientific Journals


C. Lengauer, L. Bougé, J. Träff:
"Editorial: Special Issue: Euro-Par 2015";
Concurrency and Computation: Practice and Experience, Volume 28 (2016), Issue 12; 3445 - 3446.


Contributions to Proceedings


S. Hunold, A. Carpen-Amarie, J. Träff:
"The art of benchmarking MPI libraries";
in: "Austrian HPC Meeting 2016 - AHPC 2016", I. Reichl, C. Blaas-Schenner, J. Zabloudil (ed.); Vienna Scientific Cluster (VSC), 2016, 45.

C. Kessler, U. Dastgeer, M. Majeed, N. Furmento, S. Thibault, R. Namyst, S. Benkner, S. Pllana, J. Träff, M. Wimmer:
"Leveraging PEPPHER Technology for Performance Portable Supercomputing";
in: "Proceedings of the 2012 SC Companion: High Performance Computing, Networking, Storage and Analysis", IEEE Computer Society, 2012, ISBN: 978-0-7695-4956-9, 1395 - 1396.


Editorials in Proceedings


S. Hunold, A. Legrand, L. Nussbaum:
"Introduction to REPPAR Workshop";
in: "Proceedings of the IEEE 31st International Parallel and Distributed Processing Symposium (IPDPS 2017) Workshops", IEEE, 2017, ISBN: 978-1-5386-3408-0, 1559.

J. Träff, T. Hoefler:
"Foreword EuroMPI 2019";
in: "Proceedings of the 26th European MPI Users' Group Meeting, EuroMPI 2019", T. Hoefler, J. Träff (ed.); ACM, New York, NY, USA, 2019, ISBN: 978-1-4503-7175-9, 1:1 - 1:2.


Talks and Poster Presentations (with Proceedings-Entry)


A. Carpen-Amarie, S. Hunold, J. Träff:
"On the Expected and Observed Communication Performance with MPI Derived Datatypes";
Talk: 23rd European MPI Users' Group Meeting, EuroMPI 2016, Edinburgh, United Kingdom; 2016-09-25 - 2016-09-28; in: "Proceedings of the 23rd European MPI Users' Group Meeting, EuroMPI 2016", D. Holmes, C. Collis, J. Träff, L. Smith (ed.); ACM, (2016), ISBN: 978-1-4503-4234-6; 108 - 120.

A. Carpen-Amarie, A. Rougier, F. Lübbe:
"Stepping Stones to Reproducible Research: A Study of Current Practices in Parallel Computing";
Talk: 1st International Workshop on Reproducibility in Parallel Computing (REPPAR 2014) in conjunction with Euro-Par 2014, Porto Portugal; 2014-08-25 - 2014-08-29; in: "Euro-Par 2014: Parallel Processing Workshops - Euro-Par 2014 International Workshops, Revised Selected Papers, Part I, LNCS 8805", L. Lopes, J. Zilinskas, A. Costan, R. Cascella, G. Kecskemeti, E. Jeannot, M. Cannataro, L. Ricci, S. Benkner, S. Petit, V. Scarano, J. Gracia, S. Hunold, S. Scott, S. Lankes, C. Lengauer, J. Carretero, J. Breitbart, M. Alexander (ed.); Springer International Publishing, LNCS 8805 (2014), ISBN: 978-3-319-14324-8; 499 - 510.

M. Faraj, A. van der Grinten, H. Meyerhenke, J. Träff, C. Schulz:
"High-Quality Hierarchical Process Mapping";
Talk: 18th International Symposium on Experimental Algorithms, SEA 2020 - Online Conference, Catania, Italy; 2020-06-16 - 2020-06-18; in: "18th International Symposium on Experimental Algorithms, SEA 2020", S. Faro, D. Cantone (ed.); Schloss Dagstuhl - Leibniz-Zentrum für Informatik, LIPIcs 160 (2020), ISBN: 978-3-95977-148-1; 4:1 - 4:15.

M. Forsell, J. Roivainen, V. Leppänen, J. Träff:
"Implementation of Multioperations in Thick Control Flow Processors";
Talk: 20th Workshop on Advances in Parallel and Distributed Computational Models (APDCM 2018) in conjunction with IPDPS 2018, Vancouver, British Columbia, Canada; 2018-05-21 - 2018-05-25; in: "Proceedings of the IEEE 32nd International Parallel and Distributed Processing Symposium Workshops (IPDPSW 2018)", IEEE, (2018), ISBN: 978-1-5386-5556-6; 744 - 752.

M. Forsell, J. Roivainen, V. Leppänen, J. Träff:
"Supporting Concurrent Memory Access in TCF-aware Processor Architectures";
Talk: IEEE Nordic Circuits and Systems Conference (NORCAS 2017), Linköping, Sweden; 2017-10-24 - 2017-10-25; in: "IEEE Nordic Circuits and Systems Conference (NORCAS 2017), Proceedings", J Nurmi, M. Vesterbacka, J. Wikner, A. Alvandpour, M. Nielsen-Lönn, I. Nielsen (ed.); IEEE, (2017), ISBN: 978-1-5386-2845-4; #1 - #6.

M. Forsell, J. Roivainen, J. Träff:
"Optimizing Memory Access in TCF Processors with Compute-Update Operations";
Talk: 22nd Workshop on Advances in Parallel and Distributed Computational Models (APDCM 2020) in conjunction with IPDPS 2020 - Online Conference, New Orleans, Louisiana, USA; 2020-05-18 - 2020-05-22; in: "Proceedings of the IEEE 34th International Parallel and Distributed Processing Symposium Workshops (IPDPSW 2020)", IEEE, (2020), ISBN: 978-1-7281-7457-0; 577 - 586.

R. Ganian, M. Kalany, S. Szeider, J. Träff:
"Polynomial-Time Construction of Optimal MPI Derived Datatype Trees";
Talk: IEEE 30th International Parallel and Distributed Processing Symposium (IPDPS 2016), Chicago, Illinois, USA; 2016-05-23 - 2016-05-27; in: "Proceedings of the IEEE 30th International Parallel and Distributed Processing Symposium (IPDPS 2016)", IEEE Computer Society, (2016), ISBN: 978-1-5090-2140-6; 638 - 647.

J. Gruber, J. Träff, M. Wimmer:
"Brief Announcement: Benchmarking Concurrent Priority Queues";
Talk: 28th ACM Symposium on Parallelism in Algorithms and Architectures (SPAA 2016), Asilomar State Beach, California, USA; 2016-07-11 - 2016-07-13; in: "Proceedings of the 28th ACM Symposium on Parallelism in Algorithms and Architectures (SPAA 2016)", ACM, New York, NY, USA (2016), ISBN: 978-1-4503-4210-0; 361 - 362.

F. Heinrich, T. Cornebize, A. Degomme, A. Legrand, A. Carpen-Amarie, S. Hunold, A. Orgerie, M. Quinson:
"Predicting the Energy-Consumption of MPI Applications at Scale Using a Single Node";
Talk: IEEE International Conference on Cluster Computing (CLUSTER 2017), Honolulu, Hawaii, USA; 2017-09-05 - 2017-09-08; in: "Proceedings of the IEEE International Conference on Cluster Computing (CLUSTER 2017)", IEEE, (2017), ISBN: 978-1-5386-2326-8; 92 - 102.

S. Hunold:
"Scheduling Moldable Tasks with Precedence Constraints and Arbitrary Speedup Functions on Multiprocessors";
Talk: Workshop on Scheduling for Parallel Computing (SPC 2013) in conjunction with the 10th International Conference on Parallel Processing and Applied Mathematics, PPAM 2013, Warsaw, Poland; 2013-09-08 - 2013-09-11; in: "Parallel Processing and Applied Mathematics, 10th International Conference, PPAM 2013, Revised Selected Papers, Part II", R. Wyrzykowski, J. Dongarra, K. Karczewski, J. Wasniewski (ed.); Springer, LNCS 8385 (2014), ISBN: 978-3-642-55194-9; 13 - 25.

S. Hunold, J. Ajanohoun, A. Carpen-Amarie:
"MicroBench Maker: Reproduce, Reuse, Improve";
Talk: 12th IEEE International Workshop on Performance Modeling, Benchmarking and Simulation of High Performance Computer Systems (PMBS 2021) in conjunction with SC 2021 - Hybrid Conference, St. Louis, Missouri, USA; 2021-11-14 - 2021-11-19; in: "Proceedings of the International Workshop on Performance Modeling, Benchmarking and Simulation of High Performance Computer Systems (PMBS 2021)", IEEE, (2021), ISBN: 978-1-6654-1119-6; 69 - 74.

S. Hunold, A. Bhatele, G. Bosilca, P. Knees:
"Predicting MPI Collective Communication Performance Using Machine Learning";
Talk: IEEE International Conference on Cluster Computing (IEEE Cluster 2020) - Online Conference, Kobe, Japan; 2020-09-14 - 2020-09-17; in: "Proceedings of the IEEE International Conference on Cluster Computing (IEEE Cluster 2020)", IEEE, (2020), ISBN: 978-1-7281-6678-0; 259 - 269.

S. Hunold, A. Carpen-Amarie:
"Algorithm Selection of MPI Collectives using Machine Learning Techniques";
Talk: 9th IEEE International Workshop on Performance Modeling, Benchmarking and Simulation of High Performance Computer Systems (PMBS 2018) in conjunction with SC 2018, Dallas, Texas, USA; 2018-11-11 - 2018-11-16; in: "Proceedings of the 2018 IEEE/ACM Performance Modeling, Benchmarking and Simulation of High Performance Computer Systems (PMBS 2018)", IEEE, (2018), ISBN: 978-1-7281-0182-8; 45 - 50.

S. Hunold, A. Carpen-Amarie:
"Autotuning MPI Collectives using Performance Guidelines";
Talk: International Conference on High Performance Computing in Asia-Pacific Region (HPC Asia 2018), Tokyo, Japan; 2018-01-28 - 2018-01-31; in: "Proceedings of the International Conference on High Performance Computing in Asia-Pacific Region (HPC Asia 2018)", ACM, (2018), ISBN: 978-1-4503-5372-4; 64 - 74.

S. Hunold, A. Carpen-Amarie:
"Hierarchical Clock Synchronization in MPI";
Talk: IEEE International Conference on Cluster Computing, CLUSTER 2018, Belfast, United Kingdom; 2018-09-10 - 2018-09-13; in: "Proceedings of the IEEE International Conference on Cluster Computing, CLUSTER 2018", IEEE, (2018), ISBN: 978-1-5386-8319-4; 325 - 336.

S. Hunold, A. Carpen-Amarie:
"On the Impact of Synchronizing Clocks and Processes on Benchmarking MPI Collectives";
Talk: 22nd European MPI Users' Group Meeting, EuroMPI 2015, Bordeaux, France; 2015-09-21 - 2015-09-23; in: "Proceedings of the 22nd European MPI Users' Group Meeting, EuroMPI 2015", J. Dongarra, A. Denis, B. Goglin, E. Jeannot, G. Mercier (ed.); ACM, (2015), ISBN: 978-1-4503-3795-3; Paper ID 8, 10 pages.

S. Hunold, A. Carpen-Amarie:
"On the Importance of Data Quality when Tuning MPI Libraries";
Talk: Austrian HPC Meeting 2019 - AHPC19, Grundlsee, Austria; 2019-02-25 - 2019-02-27; in: "Austrian HPC Meeting 2019 - AHPC19 (AHPC19 booklet of abstracts)", G. Haase (ed.); Institut für Mathematik und wissenschaftliches Rechnen der Universität Graz, (2019), 15.

S. Hunold, A. Carpen-Amarie, F. Lübbe, J. Träff:
"Automatic Verification of Self-consistent MPI Performance Guidelines";
Talk: 22nd International European Conference on Parallel and Distributed Computing (Euro-Par 2016), Grenoble, France; 2016-08-22 - 2016-08-26; in: "Euro-Par 2016: Parallel Processing, 22nd International Conference on Parallel and Distributed Computing, Proceedings", P. Dutot, D. Trystram (ed.); Springer International Publishing, LNCS 9833 (2016), ISBN: 978-3-319-43658-6; 433 - 446.

S. Hunold, A. Carpen-Amarie, J. Träff:
"Reproducible MPI Micro-Benchmarking Isnīt As Easy As You Think (Best Paper Award)";
Talk: 21st European MPI Users' Group Meeting, EuroMPI/ASIA 2014, Kyoto, Japan; 2014-09-09 - 2014-09-12; in: "Proceedings of the 21st European MPI Users' Group Meeting", J. Dongarra, Y. Ishikawa, A. Hori (ed.); ACM, New York, NY, USA (2014), ISBN: 978-1-4503-2875-3; 69 - 76.

S. Hunold, B. Przybylski:
"Teaching Complex Scheduling Algorithms";
Talk: 11th NSF/TCPP Workshop on Parallel and Distributed Computing Education (EduPar 2021) in conjunction with 35th IEEE IPDPS 2021 - Online Conference, Portland, Oregon, USA; 2021-05-17 - 2021-05-21; in: "IEEE International Parallel and Distributed Processing Symposium Workshops, IPDPS Workshops 2021", IEEE, (2021), ISBN: 978-1-6654-1192-9; 321 - 327.

S. Ibrahim, D. Moise, H. Chihoub, A. Carpen-Amarie, L. Bougé, G. Antoniu:
"Towards Efficient Power Management in MapReduce: Investigation of CPU-Frequencies Scaling on Power Efficiency in Hadoop";
Talk: 1st International Workshop on Adaptive Resource Management and Scheduling for Cloud Computing, ARMS-CC 2014 held in conjunction with ACM Symposium on Principles of Distributed Computing, PODC 2014, Paris, France; 2014-07-15; in: "Adaptive Resource Management and Scheduling for Cloud Computing, 1st International Workshop, ARMS-CC 2014 held in conjunction with ACM Symposium on Principles of Distributed Computing, PODC 2014, Revised Selected Papers, LNCS 8907", F. Pop, M. Potop-Butucaru (ed.); Springer International Publishing, LNCS 8907 (2014), ISBN: 978-3-319-13463-5; 147 - 164.

M. Kalany, J. Träff:
"Efficient, Optimal MPI Datatype Reconstruction for Vector and Index Types";
Talk: 22nd European MPI Users' Group Meeting, EuroMPI 2015, Bordeaux, France; 2015-09-21 - 2015-09-23; in: "Proceedings of the 22nd European MPI Users' Group Meeting, EuroMPI 2015", J. Dongarra, A. Denis, B. Goglin, E. Jeannot, G. Mercier (ed.); ACM, (2015), ISBN: 978-1-4503-3795-3; Paper ID 5, 10 pages.

Q. Kang, J. Träff, R. Al-Bahrani, A. Agrawal, A. Choudhary, W. Liao:
"Full-Duplex Inter-Group All-to-All Broadcast Algorithms with Optimal Bandwidth";
Talk: 25th European MPI Users' Group Meeting (EuroMPI 2018), Barcelona, Spain; 2018-09-23 - 2018-09-26; in: "Proceedings of the 25th European MPI Users' Group Meeting (EuroMPI 2018)", ACM, (2018), ISBN: 978-1-4503-6492-8; 1:1 - 1:10.

C. Kessler, U. Dastgeer, M. Majeed, N. Furmento, S. Thibault, R. Namyst, S. Benkner, S. Pllana, J. Träff, M. Wimmer:
"Leveraging PEPPHER Technology for Performance Portable Supercomputing";
Poster: Supercomputing 2012 Conference, Salt Lake City, Utah, USA; 2012-11-10 - 2012-11-16; in: "Proceedings of the 2012 SC Companion: High Performance Computing, Networking, Storage and Analysis", IEEE Computer Society, (2012), ISBN: 978-0-7695-4956-9; 1397.

C. Kessler, U. Dastgeer, S. Thibault, R. Namyst, A. Richards, U. Dolinsky, S. Benkner, J. Träff, S. Pllana:
"Programmability and Performance Portability Aspects of Heterogeneous Multi-/Manycore Systems";
Talk: Design, Automation & Test in Europe Conference & Exhibition (DATE 2012), Dresden, Germany; 2012-03-12 - 2012-03-16; in: "Design, Automation & Test in Europe Conference & Exhibition (DATE 2012) Proceedings", EDAA, (2012), ISBN: 978-3-9810801-8-6; 1403 - 1408.

M. Lehr, K. von Kirchbach:
"Improved Cartesian Topology Mapping in MPI";
Poster: Austrian HPC Meeting 2020 (AHPC 2020), Klosterneuburg, Austria; 2020-02-19 - 2020-02-21; in: "Austrian High-Performance-Computing Meeting (AHPC 2020)", A. Schlögl, J. Kiss, S. Elefante (ed.); IST Austria, (2020), ISBN: 978-3-99078-004-6; 27.

F. Lübbe:
"Micro-benchmarking MPI Neighborhood Collective Operations";
Talk: Euro-Par 2017 23rd International European Conference on Parallel and Distributed Computing, Santiago de Compostela, Spain; 2017-08-28 - 2017-09-01; in: "Euro-Par 2017: Parallel Processing, 23rd International Conference on Parallel and Distributed Computing, Proceedings", F. Rivera, T. Pena, J. Cabaleiro (ed.); Springer, LNCS 10417 (2017), ISBN: 978-3-319-64202-4; 65 - 78.

S. Markidis, I. Peng, J. Träff, A. Rougier, V. Bartsch, R. Machado, M. Rahn, A. Hart, D. Holmes, M. Bull, E. Laure:
"The EPiGRAM Project: Preparing Parallel Programming Models for Exascale";
Talk: Workshop on Exascale Multi/Many Core Computing Systems (E-MuCoCoS 2016) in conjunction with ISC 2016, Frankfurt, Germany (invited); 2016-06-19 - 2016-06-23; in: "High Performance Computing, ISC High Performance, 2016 International Workshops, Revised Selected Papers", M. Taufer, B. Mohr, J. Kunkel (ed.); Springer International Publishing, LNCS 9945 (2016), ISBN: 978-3-319-46078-9; 56 - 68.

S. Mirsadeghi, J. Träff, P. Balaji, A. Afsahi:
"Exploiting Common Neighborhoods to Optimize MPI Neighborhood Collectives";
Talk: 24th IEEE International Conference on High Performance Computing (HiPC 2017), Jaipur, India; 2017-12-18 - 2017-12-21; in: "Proceedings of the 24th IEEE International Conference on High Performance Computing (HiPC 2017)", IEEE, (2017), ISBN: 978-1-5386-2294-0; 348 - 357.

C. Pachajoa, M. Levonyak, W. Gansterer, J. Träff:
"How to Make the Preconditioned Conjugate Gradient Method Resilient Against Multiple Node Failures";
Talk: 48th International Conference on Parallel Processing (ICPP 2019), Kyoto, Japan; 2019-08-05 - 2019-08-08; in: "Proceedings of the 48th International Conference on Parallel Processing (ICPP 2019)", ACM, (2019), ISBN: 978-1-4503-6295-5; 67:1 - 67:10.

C. Pachajoa, M. Levonyak, C. Pacher, J. Träff, W. Gansterer:
"Classical and pipelined preconditioned conjugate gradient methods with node-failure resilience";
Talk: Austrian HPC Meeting 2020 (AHPC 2020), Klosterneuburg, Austria; 2020-02-19 - 2020-02-21; in: "Austrian High-Performance-Computing Meeting (AHPC 2020)", A. Schlögl, J. Kiss, S. Elefante (ed.); IST Austria, (2020), ISBN: 978-3-99078-004-6; 13.

A. Papatriantafyllou:
"Energy Characterization and Optimization of Parallel Prefix-Sums Kernels";
Talk: 3rd Workshop on Runtime and Operating Systems for the Many-core Era (ROME 2015) in conjunction with Euro-Par 2015, Vienna, Austria; 2015-08-24 - 2015-08-28; in: "Euro-Par 2015: Parallel Processing Workshops, Euro-Par 2015 International Workshops, Vienna, Austria, August 24-25, 2015, Revised Selected Papers", S. Hunold, A. Costan, D. Gimenez, A. Iosup, L. Ricci, G. Gomez Requena, V. Scarano, A. Varbanescu, S. Scott, S. Lankes, J. Weidendorfer, M. Alexander (ed.); Springer International Publishing, LNCS 9523 (2015), ISBN: 978-3-319-27307-5; 685 - 696.

A. Papatriantafyllou, D. Sacharidis:
"High Performance Parallel Summed-Area Table Kernels for Multi-core and Many-core Systems";
Talk: 22nd International European Conference on Parallel and Distributed Computing (Euro-Par 2016), Grenoble, France; 2016-08-22 - 2016-08-26; in: "Euro-Par 2016: Parallel Processing, 22nd International Conference on Parallel and Distributed Computing, Proceedings", P. Dutot, D. Trystram (ed.); Springer International Publishing, LNCS 9833 (2016), ISBN: 978-3-319-43658-6; 306 - 318.

M. Pöter, J. Träff:
"Brief Announcement: Stamp-it, a more Thread-efficient, Concurrent Memory Reclamation Scheme in the C++ Memory Model";
Talk: 30th ACM Symposium on Parallelism in Algorithms and Architectures (SPAA 2018), Vienna, Austria; 2018-07-16 - 2018-07-18; in: "Proceedings of the 30th ACM Symposium on Parallelism in Algorithms and Architectures (SPAA 2018)", ACM, (2018), ISBN: 978-1-4503-5799-9; 355 - 358.

M. Pöter, J. Träff:
"Stamp-it, Amortized Constant-time Memory Reclamation in Comparison to five other Schemes";
Poster: 23rd Symposium on Principles and Practice of Parallel Programming (PPoPP 2018), Vienna, Austria; 2018-02-24 - 2018-02-28; in: "Proceedings of the 23rd Symposium on Principles and Practice of Parallel Programming (PPoPP 2018)", ACM, (2018), ISBN: 978-1-4503-4982-6; 413 - 414.

C. Schulz, J. Träff:
"Better Process Mapping and Sparse Quadratic Assignment";
Talk: 16th International Symposium on Experimental Algorithms, SEA 2017, London, United Kingdom; 2017-06-21 - 2017-06-23; in: "16th International Symposium on Experimental Algorithms, SEA 2017", C. Iliopoulos, S. Pissis, S. Puglisi, R. Raman (ed.); Schloss Dagstuhl - Leibniz-Zentrum für Informatik GmbH, LIPIcs - Vol. 75 (2017), ISBN: 978-3-95977-036-1; 4:1 - 4:15.

C. Siebert, J. Träff:
"Efficient MPI Implementation of a Parallel, Stable Merge Algorithm";
Talk: 19th European MPI Users' Group Meeting, EuroMPI 2012, Vienna, Austria; 2012-09-23 - 2012-09-26; in: "Recent Advances in the Message Passing Interface Proceedings of the 19th European MPI Users' Group Meeting, EuroMPI 2012", J. Träff, S. Benkner, J. Dongarra (ed.); Springer, LNCS 7490 (2012), ISBN: 978-3-642-33517-4; 204 - 213.

J. Träff:
"A Library for Advanced Datatype Programming";
Talk: 23rd European MPI Users' Group Meeting, EuroMPI 2016, Edinburgh, United Kingdom; 2016-09-25 - 2016-09-28; in: "Proceedings of the 23rd European MPI Users' Group Meeting, EuroMPI 2016", D. Holmes, C. Collis, J. Träff, L. Smith (ed.); ACM, (2016), ISBN: 978-1-4503-4234-6; 98 - 107.

J. Träff:
"Exploiting Multi-lane Communication in MPI Collectives";
Talk: Austrian HPC Meeting 2020 (AHPC 2020), Klosterneuburg, Austria; 2020-02-19 - 2020-02-21; in: "Austrian High-Performance-Computing Meeting (AHPC 2020)", A. Schlögl, J. Kiss, S. Elefante (ed.); IST Austria, (2020), ISBN: 978-3-99078-004-6; 30.

J. Träff:
"High Performance Expectations for MPI";
Keynote Lecture: Austrian HPC Meeting 2017 - AHPC 2017, Grundlsee, Austria (invited); 2017-03-01 - 2017-03-03; in: "AHPC 2017, Austrian HPC Meeting 2017", G. Baumgartner, J. Courian (ed.); FSP Scientific Computing, University of Innsbruck, (2017), 33.

J. Träff:
"mpicroscope: Towards an MPI Benchmark Tool for Performance Guideline Verification";
Talk: 19th European MPI Users' Group Meeting, EuroMPI 2012, Vienna, Austria; 2012-09-23 - 2012-09-26; in: "Recent Advances in the Message Passing Interface Proceedings of the 19th European MPI Users' Group Meeting, EuroMPI 2012", J. Träff, S. Benkner, J. Dongarra (ed.); Springer, LNCS 7490 (2012), ISBN: 978-3-642-33517-4; 100 - 109.

J. Träff:
"Optimal MPI Datatype Normalization for Vector and Index-block Types";
Talk: 21st European MPI Users' Group Meeting, EuroMPI/ASIA 2014, Kyoto, Japan; 2014-09-09 - 2014-09-12; in: "Proceedings of the 21st European MPI Users' Group Meeting", J. Dongarra, Y. Ishikawa, A. Hori (ed.); ACM, New York, NY, USA (2014), ISBN: 978-1-4503-2875-3; 33 - 38.

J. Träff:
"Practical, Linear-time, Fully Distributed Algorithms for Irregular Gather and Scatter";
Talk: 24th European MPI Users' Group Meeting (EuroMPI/USA 2017), Chicago, IL, USA; 2017-09-25 - 2017-09-28; in: "Proceedings of the 24th European MPI Users' Group Meeting (EuroMPI/USA 2017)", ACM, (2017), ISBN: 978-1-4503-4849-2; 1:1 - 1:10.

J. Träff:
"Signature Datatypes for Type Correct Collective Operations, Revisited";
Talk: 27th European MPI Users' Group Meeting (EuroMPI/USA 2020) - Online Conference, Austin, Texas, USA; 2020-09-21 - 2020-09-24; in: "Proceedings of the 27th European MPI Users' Group Meeting (EuroMPI/USA 2020)", IEEE, (2020), ISBN: 978-1-4503-8880-1; 81 - 88.

J. Träff, S. Hunold:
"Cartesian Collective Communication";
Talk: 48th International Conference on Parallel Processing (ICPP 2019), Kyoto, Japan; 2019-08-05 - 2019-08-08; in: "Proceedings of the 48th International Conference on Parallel Processing (ICPP 2019)", ACM, (2019), ISBN: 978-1-4503-6295-5; 48:1 - 48:11.

J. Träff, S. Hunold:
"Decomposing MPI Collectives for Exploiting Multi-lane Communication";
Talk: IEEE International Conference on Cluster Computing (IEEE Cluster 2020) - Online Conference, Kobe, Japan; 2020-09-14 - 2020-09-17; in: "Proceedings of the IEEE International Conference on Cluster Computing (IEEE Cluster 2020)", IEEE, (2020), ISBN: 978-1-7281-6678-0; 270 - 280.

J. Träff, S. Hunold, G. Mercier, D. Holmes:
"Collectives and Communicators: A Case for Orthogonality: (Or: How to get rid of MPI neighbor and enhance Cartesian collectives)";
Talk: 27th European MPI Users' Group Meeting (EuroMPI/USA 2020) - Online Conference, Austin, Texas, USA; 2020-09-21 - 2020-09-24; in: "Proceedings of the 27th European MPI Users' Group Meeting (EuroMPI/USA 2020)", IEEE, (2020), ISBN: 978-1-4503-8880-1; 31 - 38.

J. Träff, F. Lübbe:
"Specification Guideline Violations by MPI_Dims_create";
Poster: 22nd European MPI Users' Group Meeting, EuroMPI 2015, Bordeaux, France; 2015-09-21 - 2015-09-23; in: "Proceedings of the 22nd European MPI Users' Group Meeting, EuroMPI 2015", J. Dongarra, A. Denis, B. Goglin, E. Jeannot, G. Mercier (ed.); ACM, (2015), ISBN: 978-1-4503-3795-3; Paper ID 19, 2 pages.

J. Träff, F. Lübbe, A. Rougier, S. Hunold:
"Isomorphic, Sparse MPI-like Collective Communication Operations for Parallel Stencil Computations";
Talk: 22nd European MPI Users' Group Meeting, EuroMPI 2015, Bordeaux, France; 2015-09-21 - 2015-09-23; in: "Proceedings of the 22nd European MPI Users' Group Meeting, EuroMPI 2015", J. Dongarra, A. Denis, B. Goglin, E. Jeannot, G. Mercier (ed.); ACM, (2015), ISBN: 978-1-4503-3795-3; Paper ID 10, 10 pages.

J. Träff, M. Pöter:
"POSTER: A more Pragmatic Implementation of the Lock-free, Ordered, Linked List";
Poster: 26th ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming (PPoPP 2021) - Online Conference, Seoul, South Korea; 2021-02-27 - 2021-03-03; in: "Proceedings of the 26th ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming (PPoPP 2021)", J. Lee, E. Petrank (ed.); ACM, (2021), ISBN: 978-1-4503-8294-6; 457 - 459.

J. Träff, A. Rougier:
"MPI Collectives and Datatypes for Hierarchical All-to-all Communication";
Talk: 21st European MPI Users' Group Meeting, EuroMPI/ASIA 2014, Kyoto, Japan; 2014-09-09 - 2014-09-12; in: "Proceedings of the 21st European MPI Users' Group Meeting", J. Dongarra, Y. Ishikawa, A. Hori (ed.); ACM, New York, NY, USA (2014), ISBN: 978-1-4503-2875-3; 27 - 32.

J. Träff, A. Rougier:
"Zero-copy, Hierarchical Gather is not possible with MPI Datatypes and Collectives";
Talk: 21st European MPI Users' Group Meeting, EuroMPI/ASIA 2014, Kyoto, Japan; 2014-09-09 - 2014-09-12; in: "Proceedings of the 21st European MPI Users' Group Meeting", J. Dongarra, Y. Ishikawa, A. Hori (ed.); ACM, New York, NY, USA (2014), ISBN: 978-1-4503-2875-3; 39 - 44.

J. Träff, A. Rougier, S. Hunold:
"Implementing a Classic: Zero-copy All-to-all Communication with MPI Datatypes";
Talk: 28th ACM International Conference on Supercomputing, ICS 2014, Munich, Germany; 2014-06-10 - 2014-06-13; in: "Proceedings of the 28th ACM International Conference on Supercomputing, ICS 2014", M. Gerndt, P. Stenström, L. Rauchwerger, B. Miller, M. Schulz (ed.); ACM, (2014), ISBN: 978-1-4503-2642-1; 135 - 144.

G. Tzenakis, A. Papatriantafyllou, H. Vandierendonck, P. Pratikakis, D. Nikolopoulos:
"BDDT: Block-Level Dynamic Dependence Analysis for Task-Based Parallelism";
Talk: 10th International Symposium on Advanced Parallel Processing Technologies, APPT 2013, Stockholm, Sweden; 2013-08-27 - 2013-08-28; in: "Advanced Parallel Processing Technologies, 10th International Symposium, APPT 2013, Revised Selected Papers", C. Wu, A. Cohen (ed.); Springer, LNCS 8299 (2013), ISBN: 978-3-642-45292-5; 17 - 31.

F. Versaci:
"OutFlank Routing: Increasing Throughput in Toroidal Interconnection Networks";
Talk: 19th IEEE International Conference on Parallel and Distributed Systems, ICPADS 2013, Seoul, Korea; 2013-12-15 - 2013-12-18; in: "Proceedings of the 19th IEEE International Conference on Parallel and Distributed Systems, ICPADS 2013", IEEE Computer Society, (2013), ISBN: 978-1-4799-2081-5; 216 - 223.

F. Versaci, K. Pingali:
"Processor Allocation for Optimistic Parallelization of Irregular Programs";
Talk: 12th International Conference on Computational Science and Its Applications, ICCSA 2012, Salvador de Bahia, Brazil; 2012-06-18 - 2012-06-21; in: "Computational Science and Its Applications, ICCSA 2012, Proceedings of the 12th International Conference, Part I", B. Murgante, O. Gervasi, S. Misra, N. Nedjah, A. Rocha, D. Taniar, B. Apduhan (ed.); Springer, LNCS 7333 (2012), ISBN: 978-3-642-31124-6; 1 - 14.

K. von Kirchbach, M. Lehr, S. Hunold, C. Schulz, J. Träff:
"Efficient Process-to-Node Mapping Algorithms for Stencil Computations (Best Paper Award)";
Talk: IEEE International Conference on Cluster Computing (IEEE Cluster 2020) - Online Conference, Kobe, Japan; 2020-09-14 - 2020-09-17; in: "Proceedings of the IEEE International Conference on Cluster Computing (IEEE Cluster 2020)", IEEE, (2020), ISBN: 978-1-7281-6678-0; 1 - 11.

M. Wimmer:
"Wait-free Hyperobjects for Task-parallel Programming Systems";
Talk: IEEE 27th International Symposium on Parallel and Distributed Processing (IPDPS 2013), Boston, Massachusetts, USA; 2013-05-20 - 2013-05-24; in: "Proceedings of the IEEE 27th International Symposium on Parallel and Distributed Processing (IPDPS 2013)", IEEE Computer Society, (2013), ISBN: 978-1-4673-6066-1; 803 - 812.

M. Wimmer, D. Cederman, J. Träff, P. Tsigas:
"Work-stealing with Configurable Scheduling Strategies";
Poster: ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming, PPoPP 2013, Shenzhen, China; 2013-02-23 - 2013-02-27; in: "Proceedings of the 2013 ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming, PPoPP 2013", ACM, (2013), ISBN: 978-1-4503-1922-5; 315 - 316.

M. Wimmer, J. Gruber, J. Träff, P. Tsigas:
"The Lock-Free k-LSM Relaxed Priority Queue";
Poster: 20th ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming, PPoPP 2015, San Francisco, CA, USA; 2015-02-07 - 2015-02-11; in: "Proceedings of the 20th ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming, PPoPP 2015", A. Cohen, D. Grove (ed.); ACM, (2015), ISBN: 978-1-4503-3205-7; 277 - 278.

M. Wimmer, M. Pöter, J. Träff:
"The Pheet Task-Scheduling Framework on the Intel(R) Xeon Phi(TM)Coprocessor and other Multicore Architectures";
Talk: Workshop on Multithreaded Architectures and Applications (MTAAP 2013) in conjunction with IPDPS 2013, Boston, Massachusetts, USA; 2013-05-20 - 2013-05-24; in: "Proceedings of the 2013 IEEE 27th International Symposium on Parallel and Distributed Processing Workshops and PhD Forum", IEEE Computer Society, (2013), ISBN: 978-0-7695-4979-8; 1587 - 1596.

M. Wimmer, F. Versaci, J. Träff, D. Cederman, P. Tsigas:
"Data Structures for Task-based Priority Scheduling";
Poster: 19th ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming, PPoPP 2014, Orlando, Florida, USA; 2014-02-15 - 2014-02-19; in: "Proceedings of the 19th ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming, PPoPP 2014", ACM, New York, NY, USA (2014), ISBN: 978-1-4503-2656-8; 379 - 380.


Talks and Poster Presentations (without Proceedings-Entry)


E. Bajrovic, J. Träff:
"Using MPI Derived Datatypes in Numerical Libraries";
Talk: EuroMPI 2011, Santorini, Greece; 2011-09-18 - 2011-09-21.

W. Gropp, T. Hoefler, R. Thakur, J. Träff:
"Performance Expectations and Guidelines for MPI Derived Datatypes";
Talk: EuroMPI 2011, Santorini, Greece; 2011-09-18 - 2011-09-21.

S. Hunold:
"Accurately Measuring MPI Collectives with Synchronized Clocks";
Talk: Dagstuhl Seminar 15281: Algorithms and Scheduling Techniques to Manage Resilience and Power Consumption in Distributed Systems, Schloss Dagstuhl, Wadern, Germany (invited); 2015-07-05 - 2015-07-10.

S. Hunold:
"Can I repeat your parallel computing experiment? Yes, you canīt";
Talk: Technische Universität Dresden, Zentrale für Informationsdienste und Hochleistungsrechnen (ZIH), Dresden, Deutschland (invited); 2013-11-07.

S. Hunold:
"Clock Synchronization Algorithms and SimGrid";
Talk: SimGrid User Days, CNRS center Villa Clythia, Fréjus, France (invited); 2016-01-18 - 2016-01-21.

S. Hunold:
"Moldable Task Scheduling: Theory and Practice";
Talk: Workshop on New Challenges in Scheduling Theory, Aussois, France (invited); 2014-03-31 - 2014-04-04.

S. Hunold:
"On the Scalability of Moldable Task Scheduling Algorithms";
Talk: Dagstuhl Seminar 13381: Algorithms and Scheduling Techniques for Exascale Systems, Schloss Dagstuhl, Wadern, Germany (invited); 2013-09-15 - 2013-09-20.

S. Hunold:
"One Step towards Bridging the Gap between Theory and Practice in Moldable Task Scheduling with Precedence Constraints";
Talk: 9th Scheduling for Large Scale Systems Workshop, Lyon, France (invited); 2014-07-01 - 2014-07-04.

S. Hunold:
"One Step towards Bridging the Gap between Theory and Practice in Moldable Task Scheduling with Precedence Constraints";
Talk: AIT Austrian Institute of Technology, Seibersdorf, Austria (invited); 2014-11-25.

S. Hunold:
"One Step towards Bridging the Gap between Theory and Practice in Moldable Task Scheduling with Precedence Constraints";
Talk: Wirtschaftswissenschaftliche Fakultät, Universität Augsburg, Augsburg, Deutschland (invited); 2015-05-12.

S. Hunold:
"Reproducibility and Data Provenance with VisTrails";
Talk: WP8 meeting, ANR SONGS project, INRIA, Paris, France (invited); 2012-10-04.

S. Hunold:
"Reproducibility in Parallel Computing";
Talk: Session: Performance Reproducibility in HPC - Challenges and State-of-the-Art at the 27th International Conference for High Performance Computing, Networking, Storage and Analysis (SC 2015), Austin, Texas (invited); 2015-11-15 - 2015-11-20.

S. Hunold:
"Reproducibility of Experiments: Itīs about the WHO and less the HOW";
Talk: Panel on reproducible research methodologies and new publication models, 4th International Workshop on Adaptive Self-tuning Computing Systems (ADAPT 2014) co-located with HiPEAC 2014, Vienna, Austria (invited); 2014-01-20 - 2014-01-22.

S. Hunold:
"The art of benchmarking MPI libraries";
Talk: Austrian HPC Meeting 2016 - AHPC16, Grundlsee, Austria (invited); 2016-02-22 - 2016-02-24.

S. Hunold:
"The Art of MPI Benchmarking";
Talk: Lunchtime Seminar, Department of Computer Science, University of Innsbruck, Innsbruck, Austria (invited); 2016-06-09.

S. Hunold:
"The Art of MPI Benchmarking";
Talk: 45th SPEEDUP Workshop on High-Performance Computing, Basel, Switzerland (invited); 2016-09-15 - 2016-09-16.

S. Hunold, A. Carpen-Amarie:
"Autotuning MPI Collectives using Performance Guidelines";
Talk: LIG - Bâtiment IMAG, St Martin d'Hères, France (invited); 2017-12-18.

S. Hunold, A. Carpen-Amarie, J. Träff:
"Reproducible MPI Micro-Benchmarking Isnīt As Easy As You Think";
Talk: Research Group Theory and Applications of Algorithms, University of Vienna, Vienna, Austria (invited); 2014-12-02.

S. Hunold, J. Lepping:
"Evolutionary Scheduling of Parallel Tasks Graphs onto Homogeneous Clusters";
Talk: New Challenges in Scheduling Theory, Centre CNRS, Frejus, France (invited); 2012-10-21 - 2012-10-27.

M. Levonyak:
"MPI related experience";
Talk: NEC User Group: NUG XXIV, Potsdam, Germany (invited); 2012-06-12 - 2012-06-15.

A. Papatriantafyllou:
"Multi-core prefix-sums";
Talk: 3rd Vienna Scientific Cluster User Workshop, Neusiedl am See, Austria (invited); 2014-02-24 - 2014-02-25.

A. Rougier:
"Datatypes in Exascale message-passing";
Talk: 3rd Vienna Scientific Cluster User Workshop, Neusiedl am See, Austria (invited); 2014-02-24 - 2014-02-25.

J. Träff:
"Cartesian Collective Communication: "Advice to users", "Advice to implementers", and "Advice to Standardizers"";
Talk: University of Bordeaux, Bordeaux, France (invited); 2019-12-03.

J. Träff:
"Challenges in Message-Passing Interfaces for Large-Scale Parallel Systems";
Talk: UPMARC Summer School on Multicore Computing, Uppsala University, Uppsala, Sweden (invited); 2013-06-10 - 2013-06-12.

J. Träff:
"Decomposing MPI Collectives for Exploiting Multi-lane Communication";
Talk: SPCL_Bcast, ETH Zürich - Online, Zurich, Switzerland (invited); 2020-12-03.

J. Träff:
"Effective MPI Programming: Concepts, Advanced Features, Do's and Don'ts";
Talk: Vienna Scientific Cluster: VSC School Seminar, TU Wien, Vienna, Austria (invited); 2016-01-13.

J. Träff:
"Fast Processing of MPI Derived Datatypes?";
Talk: Mini Workshop Algorithms Engineering, Uni Wien, Vienna, Austria (invited); 2017-09-07.

J. Träff:
"High Performance Expectations for MPI";
Talk: Friedrich-Alexander-Universität Erlangen-Nürnberg, Prof. Dr. Gerhard Wellein, Erlangen, Germany (invited); 2017-05-18.

J. Träff:
"History and Development of MPI: the Message-Passing Interface";
Talk: CSE Day KTH Stockholm, Stockholm, Sweden (invited); 2012-11-21.

J. Träff:
"History and development of the MPI standard";
Talk: AIT Austrian Institute of Technology, Seibersdorf, Austria (invited); 2012-12-05.

J. Träff:
"History of MPI";
Keynote Lecture: UPMARC Summer School on Multicore Computing, Uppsala University, Uppsala, Sweden (invited); 2013-06-10 - 2013-06-12.

J. Träff:
"Implementing a classic: zero-copy all-to-all communication with MPI datatypes";
Talk: Department of Computer Science, University of Copenhagen, Copenhagen, Denmark (invited); 2014-06-06.

J. Träff:
"Large-scale message passing concepts in EPiGRAM";
Talk: Workshop on Exascale MPI (ExaMPI 2013) at Supercomputing Conference 2013, Denver, Colorado, USA (invited); 2013-11-17 - 2013-11-23.

J. Träff:
"MPI Datatype reconstruction (for vector and index types)";
Talk: Compilers and Languages Group, Institute of Computer Languages, TU Wien, Vienna, Austria (invited); 2015-12-01.

J. Träff:
"On Optimal trees for Irregular Gather and Scatter Collectives";
Talk: Uppsala University, Uppsala, Sweden (invited); 2018-09-13.

J. Träff:
"On Optimal Trees for Irregular Gather and Scatter Collectives";
Talk: Kolloquium Mathematische Informatik, Goethe-Universität Frankfurt am Main, Frankfurt am Main, Germany (invited); 2019-02-11.

J. Träff:
"On optimal Trees for irregular gather and scatter collectives?";
Talk: Humboldt-Universität zu Berlin, Research Group on Modeling and Analysis of Complex Systems, Berlin, Germany (invited); 2019-07-24.

J. Träff:
"On optimal Trees for irregular gather and scatter collectives?";
Talk: FernUniversität in Hagen, Prof. Dr. Jörg Keller, Hagen, Germany (invited); 2019-10-23.

J. Träff:
"On The Power of Structured Data in MPI";
Talk: Guest Lecture of the course: Parallel and High Performance Computing, LMU Munich, Munich, Germany (invited); 2016-12-16.

J. Träff:
"Polynomial-Time Construction of Optimal MPI Derived Datatype Trees";
Talk: Leibniz-Rechenzentrum (LRZ), Garching bei München, Germany (invited); 2016-12-16.

J. Träff:
"Scalability, Expressivity and Performance Portability of Message-Passing Interface(s)";
Talk: VSC Workshop Vienna Scientific Cluster, Neusiedl/See, Austria (invited); 2012-02-27 - 2012-02-28.

J. Träff:
"The past 25 years of MPI";
Talk: Panel at ISC High Performance Conference 2017 - The HPC Event, Intel booth, Frankfurt, Germany (invited); 2017-06-18 - 2017-06-22.

J. Träff:
"The Power of Structured Data in MPI";
Talk: IģMS Seminar Series, Aachen GRS, RWTH Aachen, Aachen, Germany (invited); 2014-06-30.

J. Träff:
"The Power of Structured Data in MPI";
Talk: Research Group Theory and Applications of Algorithms and Research Group Scientific Computing, University of Vienna, Vienna, Austria (invited); 2014-10-07.

J. Träff:
"The Power of Structured Data in MPI";
Talk: Compiler Technology and Computer Architecure Group at the University of Hertfordshire, Hertfordshire, United Kingdom (invited); 2014-10-29.

J. Träff:
"The Power of Structured Data in MPI";
Talk: The University of Texas at Austin, Prof. Robert A. van de Geijn, Austin, Texas (invited); 2015-06-23.

J. Träff:
"The Relative Power of Synchronization Primitives";
Talk: Computational Mathematics in Engineering Group - Prof. Dr. Joachim Schöberl, Institute for Analysis and Scientific Computing, TU Wien, Vienna, Austria (invited); 2015-06-08.

J. Träff:
"Tutorial: Effective MPI Programming: concepts, advanced features, doīs and dontīs";
Talk: Tutorial on MPI at the 22nd International European Conference on Parallel and Distributed Computing (Euro-Par 2016), Grenoble, France (invited); 2016-08-22 - 2016-08-26.

J. Träff:
"Unique Features of MPI: Collective Operations on Structured Data";
Keynote Lecture: 20th European MPI Users' Group Meeting, EuroMPI 2013, Madrid, Spain (invited); 2013-09-15 - 2013-09-18.

F. Versaci:
"OutFlank routing: increasing throughput in toroidal interconnection networks";
Talk: Scalable Approaches to High Performance and High Productivity Computing, ScalPerf'13, Bertinoro, Italy (invited); 2013-09-22 - 2013-09-27.

F. Versaci, G. Bilardi:
"Stochastic optimization and memory management";
Talk: Scalable Approaches to High Performance and High Productivity Computing, ScalPerf'12, Bertinoro, Italy (invited); 2012-09-23 - 2012-09-27.

E. Wimmer:
"Is Gossip-inspired reduction competitive in high performance computing?";
Talk: International Workshop on Parallel Numerics (PARNUM 2017), Waischenfeld, Germany; 2017-04-19 - 2017-04-21.

M. Wimmer:
"Paralleles Rechnen für wissenschaftliche Anwendungen";
Talk: International Summer School Lower Austria, Waidhofen/Ybbs, Austria (invited); 2012-07-24 - 2012-07-27.

M. Wimmer:
"The Pheet Task-Scheduling Framework";
Talk: Massachusetts Institute of Technology - Group CSAIL, Cambridge, Massachusetts, USA (invited); 2013-05-20.

M. Wimmer, D. Cederman, J. Träff, P. Tsigas:
"Work-stealing with Configurable Scheduling Strategies";
Poster: MADALGO Summer School on DATA STRUCTURES, Aarhus University, Denmark; 2013-08-19 - 2013-08-22.


Doctor's Theses (authored and supervised)


M. Wimmer:
"Variations on Task Scheduling for Shared Memory Systems";
Supervisor, Reviewer: J. Träff, K. Agrawal, M. Papatriantafilou; Institut für Informationssysteme, Research Group Parallel Computing, 2014; oral examination: 2014-06-26.


Diploma and Master Theses (authored and supervised)


C. Eisserer:
"Embedded Real-Time 3D Stereo Vision on Multicore Digital Signal Processors";
Supervisor: J. Träff; Institut für Informationssysteme, Arbeitsbereich Parallel Computing, 2013; final examination: 2013-06-07.

J. Gruber:
"KLSM: A Relaxed Lock-Free Priority Queue";
Supervisor: J. Träff; Institute of Information Systems, Parallel Computing Group, 2015; final examination: 2016-02-29.

L. Haselsteiner:
"Adaptive Work-Stealing Techniques";
Supervisor: J. Träff, M. Wimmer; Institut für Informationssysteme, Parallel Computing Group, 2015; final examination: 2015-06-03.

M. Kainer:
"More Parallelism in Single-Source Shortest Path Algorithms: Simulation and Implementation";
Supervisor: J. Träff; Institute of Computer Engineering, Research Division of Parallel Computing, 2018; final examination: 2019-01-24.

T. Kainrad:
"Providing Transparent Remote Access to HPC Resources for Graphical Desktop Applications";
Supervisor: S. Hunold; Institute of Computer Engineering, Research Division of Parallel Computing, 2018; final examination: 2018-06-12.

M. Kalany:
"Efficient Construction of Provably Optimal MPI Datatype Representations";
Supervisor: J. Träff; Institut für Informationssysteme, Parallel Computing Group, 2015; final examination: 2015-11-10.

M. Lehr:
"Efficient Process Mapping for Cartesian Topologies";
Supervisor: J. Träff; Institute of Computer Engineering, Research Division of Parallel Computing, 2019; final examination: 2020-01-20.

F. Lübbe:
"Planung und automatische Verfeinerung der Laufzeitmessung von MPI-Operationen";
Supervisor: R. Vollmar, J. Träff, T. Worsch; KIT Karlsruhe und AB Parallel Computing, Institut für Informationssysteme, TU Wien, 2014.

N. Mayerhofer:
"Untersuchung der Implementierbarkeit eines lock-freien binären Suchbaumes";
Supervisor: J. Träff; Institute of Information Systems, Parallel Computing Group, 2016; final examination: 2016-10-04.

M. Pöter:
"Effective Memory Reclamation for Lock-Free Data Structures in C++";
Supervisor: J. Träff; Institute of Computer Engineering, Research Division of Parallel Computing, 2018; final examination: 2018-03-01.

B. Redl:
"Extending the Pheet framework for Parallel Pipelined Applications";
Supervisor: J. Träff; Institute of Information Systems, Parallel Computing Group, 2016; final examination: 2016-10-04.

N. Roth:
"The Causes of Run Time Variability in HPC, How to Pin Them down, and How to Handle Them";
Supervisor: S. Hunold; Institute of Computer Engineering, Research Division of Parallel Computing, 2021; final examination: 2021-03-10.

B. Sarközi:
"To Co-Schedule or Not To Co-Schedule? Efficiently Utilizing Large Multicore Machines";
Supervisor: S. Hunold; Institute of Computer Engineering, Research Unit Parallel Computing, 2021; final examination: 2021-11-05.


Scientific Reports


A. Carpen-Amarie, S. Hunold, J. Träff:
"MPI Derived Datatypes: Performance Expectations and Status Quo";
Report for CoRR - Computing Research Repository; Report No. arXiv:1607.00178, 2016; 46 pages.

M. Faraj, A. van der Grinten, H. Meyerhenke, J. Träff, C. Schulz:
"High-Quality Hierarchical Process Mapping";
Report for CoRR - Computing Research Repository; Report No. arXiv:2001.07134, 2020; 21 pages.

R. Ganian, M. Kalany, St. Szeider, J. Träff:
"Polynomial-time Construction of Optimal Tree-structured Communication Data Layout Descriptions";
Report for CoRR - Computing Research Repository; Report No. arXiv:1506.09100, 2015; 18 pages.

J. Gruber, J. Träff, M. Wimmer:
"Benchmarking Concurrent Priority Queues: Performance of k-LSM and Related Data Structures";
Report for CoRR - Computing Research Repository; Report No. arXiv:1603.05047, 2016; 17 pages.

S. Hunold:
"A Survey on Reproducibility in Parallel Computing";
Report for CoRR - Computing Research Repository; Report No. arXiv:1511.04217, 2015; 15 pages.

S. Hunold, A. Carpen-Amarie:
"MPI Benchmarking Revisited: Experimental Design and Reproducibility";
Report for CoRR - Computing Research Repository; Report No. arXiv:1505.07734, 2015; 40 pages.

S. Hunold, A. Carpen-Amarie:
"Tuning MPI Collectives by Verifying Performance Guidelines";
Report for CoRR - Computing Research Repository; Report No. arXiv:1707.09965, 2017; 16 pages.

S. Hunold, A. Carpen-Amarie, F. Lübbe, J. Träff:
"PGMPI: Automatically Verifying Self-Consistent MPI Performance Guidelines";
Report for CoRR - Computing Research Repository; Report No. arXiv:1606.00215, 2016; 13 pages.

S. Hunold, B. Przybylski:
"Scheduling.jl - Collaborative and Reproducible Scheduling Research with Julia";
Report for CoRR - Computing Research Repository; Report No. arXiv:2003.05217, 2020; 5 pages.

S. Hunold, J. Träff:
"On the State and Importance of Reproducible Experimental Research in Parallel Computing";
Report for CoRR - Computing Research Repository; Report No. arXiv:1308.3648, 2013; 15 pages.

S. Hunold, K. von Kirchbach, M. Lehr, C. Schulz, J. Träff:
"Efficient Process-to-Node Mapping Algorithms for Stencil Computations";
Report for CoRR - Computing Research Repository; Report No. arXiv:2005.09521, 2020; 18 pages.

M. Kainer, J. Träff:
"More Parallelism in Dijkstra's Single-Source Shortest Path Algorithm";
Report for CoRR - Computing Research Repository; Report No. arXiv:1903.12085, 2019; 29 pages.

C. Pachajoa, M. Levonyak, W. Gansterer, J. Träff:
"How to Make the Preconditioned Conjugate Gradient Method Resilient Against Multiple Node Failures";
Report for CoRR - Computing Research Repository; Report No. arXiv:1907.13077, 2019; 10 pages.

M. Pöter:
"Pheet meets C++11";
Report for CoRR - Computing Research Repository; Report No. arXiv:1411.1951, 2014; 19 pages.

M. Pöter, J. Träff:
"A new and five older Concurrent Memory Reclamation Schemes in Comparison (Stamp-it)";
Report for CoRR - Computing Research Repository; Report No. arXiv:1712.06134, 2017; 34 pages.

M. Pöter, J. Träff:
"Memory Models for C/C++ Programmers";
Report for CoRR - Computing Research Repository; Report No. arXiv:1803.04432, 2018; 15 pages.

M. Pöter, J. Träff:
"Stamp-it: A more Thread-efficient, Concurrent Memory Reclamation Scheme in the C++ Memory Model";
Report for CoRR - Computing Research Repository; Report No. arXiv:1805.08639, 2018; 27 pages.

C. Schulz, J. Träff:
"Better Process Mapping and Sparse Quadratic Assignment";
Report for CoRR - Computing Research Repository; Report No. arXiv:1702.04164, 2017; 13 pages.

C. Schulz, J. Träff:
"VieM v1.00 - Vienna Mapping and Sparse Quadratic Assignment User Guide";
Report for CoRR - Computing Research Repository; Report No. arXiv:1703.05509, 2017; 9 pages.

C. Siebert, J. Träff:
"Perfectly load-balanced, optimal, stable, parallel merge";
Report for CoRR - Computing Research Repository; Report No. arXiv:1303.4312, 2013; 8 pages.

J. Träff:
"A Doubly-pipelined, Dual-root Reduction-to-all Algorithm and Implementation";
Report for CoRR - Computing Research Repository; Report No. arXiv:2109.12626, 2021; 8 pages.

J. Träff:
"A Note on (Parallel) Depth- and Breadth-First Search by Arc Elimination";
Report for CoRR - Computing Research Repository; Report No. arXiv:1305.1222, 2013; 7 pages.

J. Träff:
"Decomposing Collectives for Exploiting Multi-lane Communication";
Report for CoRR - Computing Research Repository; Report No. arXiv:1910.13373, 2019; 77 pages.

J. Träff:
"k-ported vs. k-lane Broadcast, Scatter, and Alltoall Algorithms";
Report for CoRR - Computing Research Repository; Report No. arXiv:2008.12144, 2020; 55 pages.

J. Träff:
"On Optimal Trees for Irregular Gather and Scatter Collectives";
Report for CoRR - Computing Research Repository; Report No. arXiv:1711.08731, 2017; 12 pages.

J. Träff:
"Parallel Quicksort without Pairwise Element Exchange";
Report for CoRR - Computing Research Repository; Report No. arXiv:1804.07494, 2018; 4 pages.

J. Träff:
"Practical, Linear-time, Fully Distributed Algorithms for Irregular Gather and Scatter";
Report for CoRR - Computing Research Repository; Report No. arXiv:1702.05967, 2017; 17 pages.

J. Träff:
"Simplified, stable parallel merging";
Report for CoRR - Computing Research Repository; Report No. arXiv:1202.6575, 2012; 6 pages.

J. Träff:
"The Shortest Path Problem with Edge Information Reuse is NP-Complete";
Report for CoRR - Computing Research Repository; Report No. arXiv:1509.05637, 2015; 3 pages.

J. Träff, A. Carpen-Amarie, S. Hunold, A. Rougier:
"Message-Combining Algorithms for Isomorphic, Sparse Collective Communication";
Report for CoRR - Computing Research Repository; Report No. arXiv:1606.07676, 2016; 12 pages.

J. Träff, M. Pöter:
"A more Pragmatic Implementation of the Lock-free, Ordered, Linked List";
Report for CoRR - Computing Research Repository; Report No. arXiv:2010.15755, 2020; 14 pages.

J. Träff, M. Wimmer:
"An improved, easily computable combinatorial lower bound for weighted graph bipartitioning";
Report for CoRR - Computing Research Repository; Report No. arXiv:1410.0462, 2014; 33 pages.

M. Wimmer, D. Cederman, J. Träff, P. Tsigas:
"Configurable Strategies for Work-stealing";
Report for CoRR - Computing Research Repository; Report No. arXiv:1305.6474, 2013; 17 pages.

M. Wimmer, D. Cederman, F. Versaci, J. Träff, P. Tsigas:
"Data Structures for Task-based Priority Scheduling";
Report for CoRR - Computing Research Repository; Report No. arXiv:1312.2501, 2013; 19 pages.

M. Wimmer, J. Gruber, J. Träff, P. Tsigas:
"The Lock-free k-LSM Relaxed Priority Queue";
Report for CoRR - Computing Research Repository; Report No. arXiv:1503.05698, 2015; 18 pages.