Publications

This page gives an overview on all academic publications related to the DEEP projects.

O. Rausch, T. Ben-Nun, N. Dryden, A. Ivanov, S. Li, T. Hoefler
28 June, 2022

A Data-Centric Optimization Framework for Machine Learning

A. Calotoiu, T. Ben-Nun, T. Schneider, J. de Fine Licht, T. Hoefler
22 June, 2022

Lifting C Semantics for Dataflow Optimization

P. Schaad, T. Ben-Nun, T. Hoefler
21 June, 2022

Boosting Performance Optimization with Interactive Data Movement Visualization

S. Di Girolamo, D. De Sensi, K. Taranov, M. Malesevic, M. Besta, T. Schneider, S. Kistler, T. Hoefler
20 June, 2022

Building Blocks for Network-Accelerated Distributed File Systems

A. Nikolaos Ziogas, G. Zygmunt Kwasniewski, T. Ben-Nun, T. Schneider, T. Hoefler
16 June, 2022

Deinsum: Practically I/O Optimal Multilinear Algebra

T. Ben-Nun, L. Groner, F. Deconinck, T. Wicky, E. Davis, J. Dahm, O. Elbert, R. George, J. McGibbon, L. Trümper, E. Wu, O. Fuhrer, T. Schulthess and T. Hoefler
9 May, 2022

Productive Performance Engineering for Weather and Climate Modeling with Python

Marcin Copik, Alexandru Calotoiu, Konstantin Taranov, Torsten Hoefler
28 March, 2022

FaaSKeeper: Learning from Building Serverless Services with ZooKeeper as an Example

Kévin Mambu, Henri-Pierre Charles, Maha Kooli, Julie Dumas
16 March, 2022

Towards Integration of a Dedicated Memory Controller and Its Instruction Set to Improve Performance of Systems Containing Computational SRAM

A. Netti, M. Ott, C. Guillen, D. Tafani, M. Schulz
1 March, 2022

Operational Data Analytics in practice: Experiences from design to deployment in production HPC environments

S. Bouhrour, T. Pepin, J. Jaeger

Towards leveraging collective performance with the support of MPI 4.0 features in MPC

M. Marazakis, M. Duranton, D. Pleiter, G. Taffoni, and H.C. Hoppe
15 February, 2022

HPC for Urgent Decision-Making

P. Carpenter, U.-U. Haus, E. Laure

Heterogeneous High Performance Computing

A. N. Ziogas, T. Schneider, T. Ben-Nun, A. Calotoiu, T. De Matteis, J. de Fine Licht, L. Lavarini, T. Hoefler
13 November, 2021

Productivity, portability, performance: data-centric Python

O. Aumage, P. Carpenter, S. Benkner
5 October, 2021

Task-Based Performance Portability in HPC

V. Lopez, J. Criado, R. Peñacoba, R. Ferrer, X. Teruel, M. Garcia-Gasulla
8 September, 2021

An OpenMP free agent threads implementation

M. Karp, A. Podobas, T. Kenter, N. Jansson, Ch. Plessl, Ph. Schlatter, and S. Markidis
27 August, 2021

A High-Fidelity Flow Solver for Unstructured Meshes on Field-Programmable Gate Arrays: Design, Evaluation, and Future Challenges

I. Vardas, M. Ploumidis, M. Marazakis
1 August, 2021

Exploring the impact of node failures on the resource allocation for parallel jobs

P. Radojković, P. Carpenter, P. Esmaili-Dokht, R. Cimadomo, H.-P. Charles, A. Sebastian, P. Amato
29 July, 2021

Processing in Memory: The Tipping Point

C. Cummins, Z. V. Fisches, T. Ben-Nun, T. Hoefler, M. O’Boyle, H. Leather
18 July, 2021

PROGRAML: A Graph-based Program Representation for Data Flow Analysis and Compiler Optimizations

M. Copik, K. Taranov, A. Calotoiu, T. Hoefler
25 June, 2021

rFaaS: Enabling High Performance Serverless with RDMA and Leases