Presentation

· Presenters · Organizations · Search Program

Paper

: Toward Standardized Near-Data Processing with Unrestricted Data Placement for GPUs

SessionGPUs and Communication

Authors

Gwangsun Kim

Niladrish Chatterjee

Mike O'Connor

Kevin Hsieh

Event Type

Paper

Tags

TimeWednesday, November 15th11:30am - 12pm

Location402-403-404

Description3D-stacked memory devices with processing logic can help alleviate the memory bandwidth bottleneck in GPUs. However, in order for such Near-Data Processing (NDP) memory stacks to be used for different GPU architectures, it is desirable to standardize the NDP architecture. Our proposal enables this standardization by allowing data to be spread across multiple memory stacks as is the norm in high-performance systems without an MMU on the NDP stack. The keys to this architecture are the ability to move data between memory stacks as required for computation, and a partitioned execution mechanism that offloads memory-intensive application segments onto the NDP stack and decouples address translation from DRAM accesses. By enhancing this system with a smart offload selection mechanism that is cognizant of the compute capability of the NDP and cache locality on the host processor, system performance and energy are improved by up to 66.8% and 37.6%, respectively.

Download PDF: here

Authors

Gwangsun Kim

ARM Ltd

Niladrish Chatterjee

Nvidia Corporation

Mike O'Connor

Nvidia Corporation

Kevin Hsieh

Carnegie Mellon University

Navigation