|
Monday, February 27 - Queen Anne Ballroom (except for poster session)
|
|
|
Opening Remarks, PPoPP Best Paper Award (8:30 - 9 AM)
|
|
|
|
Joint HPCA/PPOPP Keynote 1 (9 - 10 AM)
|
|
|
|
Sanjeev Kumar (Facebook): Social Networking at Scale
|
|
|
|
Session 1: GPU Tools (10:30 AM - Noon)
Chair: Michael Garland (NVIDIA)
|
|
|
|
Scalable Framework for Mapping Streaming Applications onto Multi-GPU Systems
Huynh Phung Huynh1, Andrei Hagiescu2, Weng-Fai Wong2, Rick Siow Mong Goh1
1A*STAR Institute of High Performance Computing, 2National University of Singapore
|
|
|
|
A Performance Analysis Framework for Identifying Potential Benefits in GPGPU Applications
Jae Woong Sim1, Aniruddha Dasgupta2, Hyesoon Kim1, Richard Vuduc1
1Georgia Institute of Technology, 2Advanced Micro Devices
|
|
|
|
Efficient Performance Evaluation of Memory Hierarchy for Highly Multithreaded Graphics Processors
Sara Baghsorkhi, Isaac Gelado, Matthieu Delahaye, Wen-mei Hwu
University of Illinois at Urbana-Champaign
|
|
|
|
Session 2: Communication & SIMD Optimization (1:30 - 3:30 PM)
Chair: Samuel Midkiff (Purdue University)
|
|
|
|
Communication Avoiding Symmetric Band Reduction
Grey Ballard, James Demmel, Nicholas Knight
University of California, Berkeley
|
|
|
|
Faster Topology-Aware Collective Algorithms Through Non-Minimal Communication (BEST PAPER AWARD)
Paul Sack and William Gropp
University of Illinois at Urbana-Champaign
|
|
|
|
Efficient SIMD Code Generation for Irregular Kernels
Seonggun Kim and Hwansoo Han
Sungkyunkwan University
|
|
|
|
Extending a C-like Language for Portable SIMD Programming
Roland Leiβa1, Sebastian Hack1, Ingo Wald2
1Saarland University, 2Intel
|
|
|
|
|
|
Session 3: Programming Models (4:30 - 6:00 PM)
Chair: Jaejin Lee (Seoul National University)
|
|
|
|
A Hybrid Approach of OpenMP for Clusters
Okwan Kwon, Fahed Jubair, Rudolf Eigenmann, Samuel Midkiff
Purdue University
|
|
|
|
DOJ: Dynamically Parallelizing Object-Oriented Programs
Yong hun Eom, Stephen Yang, James Jenista, Brian Demsky
University of California, Irvine
|
|
|
|
S: A Scripting Language for High-Performance RESTful Web Services
Daniele Bonetta, Achille Peternier, Cesare Pautasso, Walter Binder
University of Lugano
|
|
|
Tuesday, February 28 - Queen Anne Ballroom (except for poster session)
|
|
|
Joint HPCA/PPOPP Keynote 2 (9 - 10 AM)
|
|
|
|
Keshav Pingali (University of Texas, Austin): Parallel Programming Needs Data-Centric Abstractions
Chair: Charles Leiserson (MIT)
|
|
|
|
Session 4: GPU Algorithms (10:30 AM - Noon)
Chair: Rudolf Eigenmann (Purdue University)
|
|
|
|
A GPU implementation of Inclusion-Based Points-to Analysis
Mario Mendez-Lojo1, Martin Burtscher2, Keshav Pingali1
1University of Texas, Austin, 2Texas State University
|
|
|
|
Scalable GPU Graph Traversal
Duane Merrill1, Michael Garland2, Andrew Grimshaw1
1University of Virginia, 2NVIDIA
|
|
|
|
GPU-based NFA Implementation For Memory Efficient High Speed Regular Expression Matching
Kunyang Peng, Yuan Zu, Lin Wang, Qunfeng Dong
University of Science and Technology of China
|
|
|
|
Session 5: Concurrent Data Structures (1:30 - 3:30 PM)
Chair: Michael Scott (University of Rochester)
|
|
|
|
A Methodology for Creating Fast Wait-Free Data Structures
Alex Kogan and Erez Petrank
Technion, Israel Institute of Technology
|
|
|
|
Concurrent Tries with Efficient Non-Blocking Snapshots
Aleksandar Prokopec1, Nathan Bronson2, Phil Bagwell3, Martin Odersky1
1EPFL, 2Stanford University, 3Typesafe
|
|
|
|
A Speculation-Friendly Binary Search Tree
Tyler Crain1, Vincent Gramoli2, Michel Raynal1
1University of Rennes, 2EPFL, Switzerland
|
|
|
|
PARRAY: A Unifying Array Representation for Heterogeneous Parallelism
Yifeng Chen, Xiang Cui, Hong Mei
Peking University
|
|
|
|
|
|
Session 6: Parallel Algorithms (4:30 - 6 PM)
Michel Raynal (University of Rennes)
|
|
|
|
Internally Deterministic Parallel Algorithms Can Be Fast
Guy E. Blelloch1, Jeremy T. Fineman2, Phillip B. Gibbons3, Julian Shun1
1Carnegie Mellon University, 2Georgetown University, 3Intel
|
|
|
|
Deterministic Parallel Random-Number Generation for Dynamic Multithreading Platforms
Charles E. Leiserson, Tao B. Schardl, Jim Sukha
Massachusetts Institute of Technology
|
|
|
|
Scalable Parallel Minimum Spanning Forest Computation
Sadegh Nobari1, Thanh-Tung Cao1, Panagiotis Karras2, Stephane Bressan1
1National University of Singapore, 2Rutgers University
|
|
|
|
|
Wednesday, February 29 - Queen Anne Ballroom
|
|
|
Session 7: Correctness and Fault Tolerance (8:30 - 10 AM)
Chair: Mario Mendez-Lojo (Advanced Micro Devices)
|
|
|
|
GKLEE: Concolic Verification and Test Generation for GPUs
Guodong Li1, Peng Li2, Geof Sawaya2, Ganesh Gopalakrishnan2, Sreeranga Rajan1, Indradeep Ghosh1
1Fujitsu Labs of America, 2University of Utah
|
|
|
|
Algorithm-based Fault Tolerance for Dense Matrix Factorizations
Peng Du, Aurelien Bouteiller, George Bosilca, Thomas Herault, Jack Dongarra
University of Tennessee, Knoxville
|
|
|
|
Efficient Deadlock Avoidance for Streaming Computation with Filtering
Jeremy Buhler, Kunal Agrawal, Peng Li, Roger Chamberlain
Washington University in St. Louis
|
|
|
|
Session 8: Scheduling and Synchronization (10:30 AM - Noon)
Chair: Kunal Agrawal (Washington University in St. Louis)
|
|
|
|
Lock Cohorting: A General Technique for Designing NUMA Locks
David Dice1, Virendra Marathe1, Nir Shavit2
1Oracle Labs, 2MIT & Tel-Aviv University
|
|
|
|
Revisiting the Combining Synchronization Technique
Nikolaos Kallimanis1 and Panagiota Fatourou2
1University of Ioannina, 2University of Crete
|
|
|
|
A Work-Stealing Scheduler for X10's Task Parallelism with Suspension
Olivier Tardieu1, Haichuan Wang2, Haibo Lin3
1IBM T.J. Watson Research Center, 2University of Illinois at Urbana-Champaign, 3IBM Research, China
|
|