P30: MPI/OpenMP Parallelization of the Hartree-Fock
Method for the Second Generation Intel Xeon Phi
SessionPoster Reception
Authors
Event Type
ACM Student Research Competition
Poster
Reception
TimeTuesday, November 14th5:15pm -
7pm
LocationFour Seasons Ballroom
DescriptionReplication of critical data structures in the MPI-only
GAMESS Hartree-Fock algorithm limits the full
utilization of the manycore Intel Xeon Phi processor. In
this work, modern OpenMP threading techniques are used
to implement hybrid MPI/OpenMP algorithms. Two separate
implementations that differ by the sharing and
replication details of key data structures among threads
are considered. The hybrid MPI/OpenMP implementations
reduce the memory footprint by approximately 200 times
compared to the legacy code. The MPI/OpenMP code was
shown to run up to six times faster than the original
for a range of molecular system sizes. The
implementation details and stratgeies will be presented
for both hybrid algorithms. Benchmark scaling results
results utilizing up to 3000 Intel Xeon Phi processors
will also be discussed.




