Christian Engelmann
Biography
Dr. Christian Engelmann is an R&D Staff Scientist in the
Computer Science Research Group at Oak Ridge National
Laboratory. He has 15 years experience in software
research and development for extreme-scale
high-performance computing (HPC) systems with a strong
funding and publication record. In collaboration with
other laboratories and universities, Dr. Engelmann’s
research solves computer science challenges in HPC
software, such as scalability, dependability, energy
efficiency, and portability. His primary expertise is in
HPC resilience, i.e., providing efficiency and correctness
in the presence of faults, errors, and failures through
avoidance, masking, and recovery. Dr. Engelmann is a
leading expert in HPC resilience and a member of the DOE
Technical Council on HPC Resilience. He received the 2015
DOE Early Career Award for research in resilience design
patterns for extreme scale HPC. His secondary expertise is
in lightweight simulation of future-generation
extreme-scale supercomputers with millions of processors,
studying the impact of hardware and software properties on
the key HPC system design factors: performance,
resilience, and power consumption. Dr. Engelmann is a
member of the Association for Computing Machinery (ACM),
the Institute of Electrical and Electronics Engineers
(IEEE), and the Advanced Computing Systems Association
(USENIX).
Presentations
Birds of a Feather
HPC Center Planning and Operations
Reliability
Resiliency
Workshop
Algorithms
Exascale
Resiliency
SIGHPC Workshop
Paper
State of the Practice




