Accepted Papers
Paper | Author |
---|---|
Making Strassen Matrix Multiplication Safe | Himeshi De Silva, John Gustafson and Weng-Fai Wong |
Synchronization-Avoiding Graph Algorithms | Jesun Firoz, Marcin Zalewski, Thejaka Amila Kanewala and Andrew Lumsdaine |
Workflow Simulation Aware and Multi-Threading Effective Task Scheduling for Heterogeneous Computing | Vasilios Kelefouras and Karim Djemame |
Why do Users Kill HPC Jobs? | Venkatesh-Prasad Ranganath and Daniel Andresen |
Dynamic Count-Min Sketch for Analytical Queries over Continuous Data Streams | Xiaobo Zhu, Hong Zhang, Guangjun Wu and Shupeng Wang |
Shared-Memory Parallel Maximal Clique Enumeration | Apurba Das, Seyed-Vahid Sanei-Mehri and Srikanta Tirthapura |
Quantification, Trade-off Analysis, and Optimal Checkpoint Placement for Reliability and Availability | Omer Subasi, Ramakrishna Tipireddy and Sriram Krishnamoorthy |
Accelerating TensorFlow with Adaptive RDMA-based gRPC | Rajarshi Biswas, Xiaoyi Lu and Dhabaleswar Panda |
Expediting Parallel Graph Connectivity Algorithms | Mihir Wadwekar and Kishore Kothapalli |
Adaptive Runtime Features For Distributed Graph Algorithms | Jesun Firoz, Marcin Zalewski and Andrew Lumsdaine |
Code and Data Transformations to Address Garbage Collector Performance in Big Data Processing | Damon Fenacci, Hans Vandierendonck and Dimitrios Nikolopoulos |
Balancing Stragglers Against Staleness in Distributed Deep Learning | Saurav Basu, Vaibhav Saxena, Rintu Panja and Ashish Verma |
Share-a-GPU: Providing Simple and Effective Time-Sharing on GPUs | Shaleen Garg, Kishore Kothapalli and Suresh Purini |
Parallel Nonnegative CP Decomposition of Dense Tensors | Koby Hayashi, Grey Ballard and Ramakrishnan Kannan |
Improving Provisioned Power Efficiency in HPC Systems with GPU-CAPP | Kramer Straube, Jason Lowe-Power, Christopher Nitta, Matthew Farrens and Venkatesh Akella |
Adaptive Pattern Matching with Reinforcement Learning for Dynamic Graphs | Hiroki Kanezashi, Toyotaro Suzumura, Dario Garcia-Gasulla, Satoshi Matsuoka and Min-Hwan Oh |
A Performance Prediction Framework for Irregular Applications | Gangyi Zhu and Gagan Agrawal |
Sampled Dense Matrix Multiplication for High-Performance Machine Learning | Israt Nisa, Aravind Sukumaran Rajam, Süreyya Emre Kurt, Changwan Hong and P Sadayappan |
Compiling SIMT Programs on Multi- and Many-core Processors with Wide Vector Units: A Case Study with CUDA | Hancheng Wu, John Ravi and Michela Becchi |
Probabilistic Sequential Consistency in Social Networks | Priyanka Singla, Shubhankar Suman Singh, K Gopinath and Smruti Sarangi |
Lossless parallel implementation of a Turbo Decoder on GPU | Karthikeyan Natarajan and Nitin Chandrachoodan |
OC-DNN: Exploiting Advanced Unified Memory Capabilities in CUDA 9 and Volta GPUs for Out-of-Core DNN Training | Ammar Ahmad Awan, Ching-Hsiang Chu, Hari Subramoni, Xiaoyi Lu and Dhabaleswar Panda |
Scalable Proximity-Based Methods for Large-Scale Analysis of Atom Probe Data | Hao Lu, Sudip Seal and Jonathan D Poplawsky |
A Shared-Memory Parallel Algorithm for Updating Single-Source Shortest Paths in Large Dynamic Networks | Sriram Srinivasan, Sara Riazi, Sajal Das, Boyana Norris and Sanjukta Bhowmick |
A Novel Approach for Handling Soft Error in Conjugate Gradients | Marissa Renardy, Muhammed Emin Ozturk, Yukun Li, Gagan Agrawal and Ching-Shan Chou |
Achieving Performance and Programmability for MapReduce(-like) Frameworks | Jia Guo and Gagan Agrawal |
Vidya: Performing Code-Block I/O Characterization for Data Access Optimization | Hariharan Devarajan, Anthony Kougkas, Prajwal Challa and Xian-He Sun |
Parallel Read Partitioning for Concurrent Assembly of Metagenomic Data | Vasudevan Rengasamy, Mahmut Kandemir, Paul Medvedev and Kamesh Madduri |
Acceleration of an Adaptive Cartesian Mesh CFD Solver in the Current Generation Processor Architectures | Harichand M V, Bharatkumar Sharma, Sudhakaran G and Ashok V |
DeepHyper: Asynchronous Hyperparameter Search for Deep Neural Networks | Prasanna Balaprakash, Misha Salim, Tom Uram, Venkatram Vishwanath and Stefan Wild |
Characterization of the Impact of Soft Errors on Iterative Methods | Burcu Mutlu, Gokcen Kestor, Joseph Manzano, Osman Unsal, Samrat Chatterjee and Sriram Krishnamoorthy |
Decentralized Privacy-preserving Timed Execution in Blockchain-based Smart Contract Platforms | Chao Li and Balaji Palanisamy |
Data-parallel Training of Generative Adversarial Networks on HPC Systems for HEP Simulations | Sofia Vallecorsa, Diana Moise, Federico Carminati and Gul Rukh Khattak |