PPoPP 2025
Sat 1 - Wed 5 March 2025
Las Vegas, Nevada, United States
Toggle navigation
Attending
Venue: The Westin Las Vegas Hotel & Spa
Registration
Accommodation Options
Student Travel Grants
Visa
Code of Conduct
Program
PPoPP Program
Your Program
Sat 1 Mar
Sun 2 Mar
Mon 3 Mar
Tue 4 Mar
Wed 5 Mar
Tracks
PPoPP 2025
Main Conference
Workshops and Tutorials
Artifact Evaluation
Organization
PPoPP 2025 Committees
Organizing Committee
Steering Committee
Track Committees
Main Conference
Workshops and Tutorials
Artifact Evaluation
Contributors
People Index
PPoPP Mailing List
Search
Series
Series
PPoPP 2025
PPoPP 2024
PPoPP 2023
PPoPP 2022
PPoPP 2021
PPoPP 2020
PPoPP 2019
PPoPP 2018
PPoPP 2017
PPoPP 2016
PPoPP 2015
PPoPP 2014
PPoPP 2013
PPoPP 2012
PPoPP 2011
PPoPP 2010
PPoPP 2009
Sign in
Sign up
PPoPP 2025
(
series
) /
The Westin Las Vegas Hotel & Spa
/
Room information: Acacia D
Venue
The Westin Las Vegas Hotel & Spa
Room name
Acacia D
Room Information
No extra information available
Program
Detailed Table
Session Timeline
Detailed Timeline
This program is tentative and subject to change.
Program Display Configuration
Time Zone
The program is currently displayed in
(GMT-08:00) Pacific Time (US & Canada)
.
Use conference time zone: (GMT-08:00) Pacific Time (US & Canada)
Select other time zone
(GMT-12:00) AoE (Anywhere On Earth)
(GMT-11:00) Midway Island, Samoa
(GMT-10:00) Hawaii-Aleutian
(GMT-10:00) Hawaii
(GMT-09:30) Marquesas Islands
(GMT-09:00) Gambier Islands
(GMT-09:00) Alaska
(GMT-08:00) Tijuana, Baja California
(GMT-08:00) Pitcairn Islands
(GMT-08:00) Pacific Time (US & Canada)
(GMT-07:00) Mountain Time (US & Canada)
(GMT-06:00) Chihuahua, La Paz, Mazatlan
(GMT-07:00) Arizona
(GMT-06:00) Saskatchewan, Central America
(GMT-05:00) Guadalajara, Mexico City, Monterrey
(GMT-05:00) Easter Island
(GMT-06:00) Central Time (US & Canada)
(GMT-05:00) Eastern Time (US & Canada)
(GMT-05:00) Cuba
(GMT-05:00) Bogota, Lima, Quito, Rio Branco
(GMT-04:00) Caracas
(GMT-03:00) Santiago
(GMT-04:00) La Paz
(GMT-03:00) Faukland Islands
(GMT-04:00) Manaus, Amazonas, Brazil
(GMT-04:00) Atlantic Time (Goose Bay)
(GMT-04:00) Atlantic Time (Canada)
(GMT-03:30) Newfoundland
(GMT-03:00) UTC-3
(GMT-03:00) Montevideo
(GMT-03:00) Miquelon, St. Pierre
(GMT-03:00) Greenland
(GMT-03:00) Buenos Aires
(GMT-03:00) Brasilia, Distrito Federal, Brazil
(GMT-02:00) Mid-Atlantic
(GMT-01:00) Cape Verde Is.
(GMT-01:00) Azores
(UTC) Coordinated Universal Time
(GMT) Belfast
(GMT) Dublin
(GMT) Lisbon
(GMT) London
(GMT) Monrovia, Reykjavik
(GMT+01:00) Amsterdam, Berlin, Bern, Rome, Stockholm, Vienna
(GMT+01:00) Belgrade, Bratislava, Budapest, Ljubljana, Prague
(GMT+01:00) Brussels, Copenhagen, Madrid, Paris
(GMT+01:00) West Central Africa
(GMT+02:00) Windhoek
(GMT+02:00) Athens
(GMT+02:00) Beirut
(GMT+02:00) Cairo
(GMT+02:00) Gaza
(GMT+02:00) Harare, Pretoria
(GMT+02:00) Jerusalem
(GMT+03:00) Minsk
(GMT+03:00) Syria
(GMT+03:00) Moscow, St. Petersburg, Volgograd
(GMT+03:00) Nairobi
(GMT+03:30) Tehran
(GMT+04:00) Abu Dhabi, Muscat
(GMT+04:00) Yerevan
(GMT+04:30) Kabul
(GMT+05:00) Ekaterinburg
(GMT+05:00) Tashkent
(GMT+05:30) Chennai, Kolkata, Mumbai, New Delhi
(GMT+05:45) Kathmandu
(GMT+06:00) Astana, Dhaka
(GMT+07:00) Novosibirsk
(GMT+06:30) Yangon (Rangoon)
(GMT+07:00) Bangkok, Hanoi, Jakarta
(GMT+07:00) Krasnoyarsk
(GMT+08:00) Beijing, Chongqing, Hong Kong, Urumqi
(GMT+08:00) Irkutsk, Ulaan Bataar
(GMT+08:00) Perth
(GMT+08:45) Eucla
(GMT+09:00) Osaka, Sapporo, Tokyo
(GMT+09:00) Seoul
(GMT+09:00) Yakutsk
(GMT+10:30) Adelaide
(GMT+09:30) Darwin
(GMT+10:00) Brisbane
(GMT+11:00) Hobart
(GMT+10:00) Vladivostok
(GMT+11:00) Lord Howe Island
(GMT+11:00) Solomon Is., New Caledonia
(GMT+11:00) Magadan
(GMT+12:00) Norfolk Island
(GMT+12:00) Anadyr, Kamchatka
(GMT+13:00) Auckland, Wellington
(GMT+12:00) Fiji, Kamchatka, Marshall Is.
(GMT+13:45) Chatham Islands
(GMT+13:00) Nuku'alofa
(GMT+14:00) Kiritimati
The GMT offsets shown reflect the offsets
at the moment of the conference
.
Time Band
By setting a time band, the program will dim events that are outside this time window. This is useful for (virtual) conferences with a continuous program (with repeated sessions).
The time band will also limit the events that are included in the personal iCalendar subscription service.
Display full program
Specify a time band
-
Save
×
You're viewing the program in a time zone which is different from your device's time zone
change time zone
Mon 3 Mar
Displayed time zone:
Pacific Time (US & Canada)
change
09:30 - 10:00
Break
Catering
/
Main Conference
at
Acacia D
09:30
30m
Coffee break
Break
Catering
10:00 - 11:00
Session 1: Graph Neural Networks (Session Chair: TBA)
Main Conference
at
Acacia D
10:00
20m
Talk
Helios: Efficient Distributed Dynamic Graph Sampling for Online GNN Inference
Main Conference
Jie Sun
Zhejiang University
,
Zuocheng Shi
Zhejiang University
,
Li Su
Alibaba Group
,
Wenting Shen
Alibaba Group
,
Zeke Wang
Zhejiang University
,
Yong Li
Alibaba Group
,
Wenyuan Yu
Alibaba Group
,
Wei Lin
Alibaba Group
,
Fei Wu
College of Computer Science and Technology in Zhejiang University
,
Jingren Zhou
Alibaba Group
,
Bingsheng He
National University of Singapore
10:20
20m
Talk
Accelerating GNNs on GPU Sparse Tensor Cores through N:M Sparsity-Oriented Graph Reordering
Main Conference
Jou-An Chen
North Carolina State University
,
Hsin-Hsuan Sung
North Carolina State University
,
Ruifeng Zhang
North Carolina State University
,
Ang Li
Pacific Northwest National Laboratory
,
Xipeng Shen
North Carolina State University
10:40
20m
Talk
Adaptive Parallel Training for Graph Neural Networks
Main Conference
Kaihao Ma
The Chinese University of Hong Kong
,
Renjie Liu
Southern University of Science and Technology
,
Xiao Yan
Centre for Perceptual and Interactive Intelligence (CPII)
,
Zhenkun Cai
Amazon
,
Xiang Song
Amazon Web Services
,
Minjie Wang
Amazon Web Services
,
Yichao Li
The Chinese University of Hong Kong
,
James Cheng
The Chinese University of Hong Kong
11:00 - 11:20
Break
Catering
/
Main Conference
at
Acacia D
11:00
20m
Coffee break
Break
Catering
11:20 - 12:20
Session 2: GPU I (Session Chair: TBA)
Main Conference
at
Acacia D
11:20
20m
Talk
RT–BarnesHut: Accelerating Barnes–Hut Using Ray-Tracing Hardware
Main Conference
Vani Nagarajan
Purdue University
,
Rohan Gangaraju
Purdue University
,
Kirshanthan Sundararajah
Virginia Tech
,
Artem Pelenitsyn
Purdue University
,
Milind Kulkarni
Purdue University
11:40
20m
Talk
EVeREST: An Effective and Versatile Runtime Energy Saving Tool for GPUs
Main Conference
Anna Yue
University of Minnesota at Twin Cities
,
Pen-Chung Yew
University of Minnesota at Twin Cities
,
Sanyam Mehta
HPE
12:00
20m
Talk
TurboFFT: Co-Designed High-Performance and Fault-Tolerant Fast Fourier Transform on GPUs
Main Conference
Shixun Wu
University of California, Riverside
,
Yujia Zhai
NVIDIA Corporation
,
Jinyang Liu
University of California, Riverside
,
Jiajun Huang
University of California, Riverside
,
Zizhe Jian
University of California, Riverside
,
Huangliang Dai
University of California, Riverside
,
Sheng Di
Argonne National Laboratory
,
Franck Cappello
Argonne National Laboratory
,
zizhong chen
University of California, Riverside
12:20 - 14:00
Lunch
Catering
/
Main Conference
at
Acacia D
12:20
1h40m
Lunch
Lunch
Catering
14:00 - 15:20
Session 3: Concurrent Data Structures and Synchronization I (Session Chair: TBA)
Main Conference
at
Acacia D
14:00
20m
Talk
Reciprocating Locks
Main Conference
Dave Dice
Oracle Labs
,
Alex Kogan
Oracle Labs, USA
14:20
20m
Talk
Aggregating Funnels for Faster Fetch&Add and Queues
Main Conference
Younghun Roh
MIT
,
Yuanhao Wei
University of British Columbia
,
Eric Ruppert
York University
,
Panagiota Fatourou
FORTH ICS and University of Crete, Greece
,
Siddhartha Jayanti
Google Research
,
Julian Shun
MIT
14:40
20m
Talk
Fairer and More Scalable Reader-Writer Locks by Optimizing Queue Management
Main Conference
Takashi Hoshino
Cybozu Labs, Inc.
,
Kenjiro Taura
The University of Tokyo
15:00
20m
Talk
Publish on Ping: A Better Way to Publish Reservations in Memory Reclamation for Concurrent Data Structures
Main Conference
Ajay Singh
University of Waterloo
,
Trevor Brown
University of Toronto
15:20 - 15:40
Break
Catering
/
Main Conference
at
Acacia D
15:20
20m
Coffee break
Break
Catering
15:40 - 16:40
Session 4: Memory (Session Chair: TBA)
Main Conference
at
Acacia D
15:40
20m
Talk
AC-Cache: A Memory-Efficient Caching System for Small Objects via Exploiting Access Correlations
Main Conference
Fulin Nan
Xiamen Univeristy
,
Zhirong Shen
Xiamen University
16:00
20m
Talk
Effectively Virtual Page Prefetching via Spatial-Temporal Patterns for Memory-intensive Cloud Applications
Main Conference
Yun Wang
Shanghai Jiao Tong University
,
Liang Chen
,
Tianmai Deng
Shanghai Jiao Tong University
,
Ben Luo
Alibaba Group
,
Yibin Shen
Alibaba Cloud
,
Zhixiang Wei
Shanghai Jiao Tong University
,
Yixiao Xu
Shanghai Jiao Tong University
,
Minglang Huang
Shanghai Jiao Tong University
,
Zhengwei Qi
Shanghai Jiao Tong University
16:20
20m
Talk
Harnessing Inter-GPU Shared Memory for Seamless MoE Communication-Computation Fusion
Main Conference
Hulin Wang
,
Yaqi Xia
Wuhan University
,
Donglin Yang
Nvidia Corporation
,
Xiaobo Zhou
University of Macau
,
Dazhao Cheng
WuHan University
16:40 - 17:00
Break
Catering
/
Main Conference
at
Acacia D
16:40
20m
Coffee break
Break
Catering
17:00 - 18:00
Session 5: Deep Neural Networks (Session Chair: TBA)
Main Conference
at
Acacia D
17:00
20m
Talk
FlashTensor: Optimizing Tensor Programs by Leveraging Fine-grained Tensor Property
Main Conference
Runxin Zhong
Tsinghua University
,
Yuyang Jin
Tsinghua University
,
Chen Zhang
Tsinghua University
,
Kinman Lei
,
Shuangyu Li
Tsinghua University
,
Jidong Zhai
Tsinghua University
17:20
20m
Talk
Mario: Near Zero-cost Activation Checkpointing in Pipeline Parallelism
Main Conference
Weijia Liu
Institute of Computing Technology, Chinese Academy of Sciences
,
Mingzhen Li
Institute of Computing Technology, Chinese Academy of Sciences
,
Guangming Tan
Chinese Academy of Sciences(CAS)
,
Weile Jia
Institute of Computing Technology, Chinese Academy of Sciences
17:40
20m
Talk
COMPSO: Optimizing Gradient Compression for Distributed Training with Second-Order Optimizers
Main Conference
Baixi Sun
Indiana University Bloomington
,
Weijin Liu
Stevens Institute of Technology
,
J. Gregory Pauloski
University of Chicago
,
Jiannan Tian
Indiana University
,
Jinda Jia
Indiana University
,
Daoce Wang
Indiana University
,
Boyuan Zhang
Indiana University
,
Mingkai Zheng
Department of Electrical and Computer Engineering at Rutgers University
,
Sheng Di
Argonne National Laboratory
,
Sian Jin
Temple University
,
Zhao Zhang
Peking University
,
Xiaodong Yu
Stevens Institute of Technology
,
Kamil A. Iskra
Argonne National Laboratory
,
Pete Beckman
Northwestern University and Argonne National Laboratory
,
Guangming Tan
Chinese Academy of Sciences(CAS)
,
Dingwen Tao
Indiana University
Tue 4 Mar
Displayed time zone:
Pacific Time (US & Canada)
change
09:30 - 10:00
Break
Catering
/
Main Conference
at
Acacia D
09:30
30m
Coffee break
Break
Catering
10:00 - 11:00
Session 6: Large Language Models (Session Chair: TBA)
Main Conference
at
Acacia D
10:00
20m
Talk
MARLIN: Mixed-Precision Auto-Regressive Parallel Inference on Large Language Models
Main Conference
Elias Frantar
ISTA
,
Roberto López Castro
Universidade da Coruña
,
Jiale Chen
ISTA
,
Torsten Hoefler
ETH Zurich
,
Dan Alistarh
IST Austria
10:20
20m
Talk
WeiPipe: Weight Pipeline Parallelism for Communication-Effective Long-Context Large Model Training
Main Conference
Junfeng Lin
Tsinghua University
,
Ziming Liu
National University of Singapore
,
Yang You
National University of Singapore
,
Jun Wang
CETHIK Group Co. Ltd.
,
Weihao Zhang
Lynxi Technologies Co. Ltd
,
Rong Zhao
Tsinghua University
10:40
20m
Talk
ATTNChecker: Highly-Optimized Fault Tolerant Attention for Large Language Model Training
Main Conference
Yuhang Liang
University of Oregon
,
Xinyi Li
Pacific Northwest National Laboratory(PNNL)
,
Jie Ren
William & Mary
,
Ang Li
Pacific Northwest National Laboratory
,
Bo Fang
Pacific Northwest National Laboratory(PNNL)
,
Jieyang Chen
University of Oregon
11:00 - 11:20
Break
Catering
/
Main Conference
at
Acacia D
11:00
20m
Coffee break
Break
Catering
11:20 - 12:20
Session 7: Scheduling and Resource Management (Session Chair: TBA)
Main Conference
at
Acacia D
11:20
20m
Talk
SGDRC: Software-Defined Dynamic Resource Control for Concurrent DNN Inference on NVIDIA GPUs
Main Conference
Yongkang Zhang
HKUST
,
Haoxuan Yu
HKUST
,
Chenxia Han
CUHK
,
Cheng Wang
Alibaba Group
,
Baotong Lu
Microsoft Research
,
Yunzhe Li
Shanghai Jiaotong University
,
Zhifeng Jiang
HKUST
,
Yang Li
China University of Geosciences
,
Xiaowen Chu
Data Science and Analytics Thrust, HKUST(GZ)
,
Huaicheng Li
Virginia Tech
11:40
20m
Talk
DORADD: Deterministic Parallel Execution in the Era of Microsecond-Scale Computing
Main Conference
Scofield Liu
Imperial College London
,
Musa Unal
EPFL
,
Matthew J. Parkinson
Microsoft Azure Research
,
Marios Kogias
Imperial College London; Microsoft Research
12:00
20m
Talk
WaterWise: Co-optimizing Carbon- and Water-Footprint Toward Environmentally Sustainable Cloud Computing
Main Conference
Yankai Jiang
Northeastern University
,
Rohan Basu Roy
Northeastern University
,
Raghavendra Kanakagiri
Indian Institute of Technology Tirupati
,
Devesh Tiwari
Northeastern University
12:20 - 14:00
Lunch
Catering
/
Main Conference
at
Acacia D
12:20
1h40m
Lunch
Lunch
Catering
14:00 - 15:20
Session 8: Tensor Cores (Session Chair: TBA)
Main Conference
at
Acacia D
14:00
20m
Talk
FlashSparse: Minimizing Computation Redundancy for Fast Sparse Matrix Multiplications on Tensor Cores
Main Conference
Jinliang Shi
Beijing University of Posts and Telecommunications
,
Shigang Li
Beijing University of Posts and Telecommunications
,
Youxuan Xu
Beijing University of Posts and Telecommunications
,
Rongtian Fu
Beijing University of Posts and Telecommunications
,
Xueying Wang
Beijing University of Posts and Telecommunications
,
Tong Wu
Beijing University of Posts and Telecommunications
14:20
20m
Talk
Acc-SpMM: Accelerating General-purpose Sparse Matrix-Matrix Multiplication with GPU Tensor Cores
Main Conference
Haisha Zhao
Computer Network Information Center, Chinese Academy of Sciences,University of Chinese Academy of Sciences
,
Li San
Computer Network Information Center, Chinese Academy of Sciences,University of Chinese Academy of Sciences
,
Jiaheng Wang
Renmin University of China
,
Chunbao Zhou
Computer Network Information Center, Chinese Academy of Sciences
,
Jue Wang
Computer Network Information Center, Chinese Academy of Sciences
,
Zhikuang Xin
Computer Network Information Center, Chinese Academy of Sciences,University of Chinese Academy of Sciences
,
lishunde
Computer Network Information Center, Chinese Academy of Sciences,University of Chinese Academy of Sciences
,
ZhiQiang Liang
Computer Network Information Center, Chinese Academy of Sciences
,
Zhijie Pan
Hangzhou Dianzi University
,
Fang Liu
Computer Network Information Center, Chinese Academy of Sciences,University of Chinese Academy of Sciences
,
Yan Zeng
Hangzhou Dianzi University
,
Yangang Wang
Computer Network Information Center, Chinese Academy of Sciences
,
Xuebin Chi
Computer Network Information Center, Chinese Academy of Sciences; University of Chinese Academy of Sciences
14:40
20m
Talk
BerryBees: Breadth First Search by Bit-Tensor-Cores
Main Conference
Yuyao Niu
Barcelona Supercomputing Center (BSC) - Universitat Politècnica de Catalunya (UPC)
,
Marc Casas
Barcelona Supercomputing Center
15:00
20m
Talk
FlashFFTStencil: Bridging Fast Fourier Transforms to Memory-Efficient Stencil Computations on Tensor Core Units
Main Conference
Haozhi Han
Microsoft Research; Peking University
,
Kun Li
Microsoft Research
,
Wei Cui
Microsoft Research
,
Donglin Bai
Microsoft Research
,
Yiwei Zhang
UCAS; Microsoft Research
,
Liang Yuan
Chinese Academy of Sciences
,
Yifeng Cheng
Peking University
,
Yunquan Zhang
Zhang
,
Ting Cao
Microsoft Research
,
Mao Yang
Microsoft Research
15:20 - 15:40
Break
Catering
/
Main Conference
at
Acacia D
15:20
20m
Coffee break
Break
Catering
15:40 - 17:00
Session 9: Concurrent Data Structures and Synchronization II (Session Chair: TBA)
Main Conference
at
Acacia D
15:40
20m
Talk
PANNS: Enhancing Graph-based Approximate Nearest Neighbor Search through Recency-aware Construction and Parameterized Search
Main Conference
Xizhe Yin
University of California, Riverside
,
Chao Gao
University of California Riverside
,
Zhijia Zhao
University of California at Riverside
,
Rajiv Gupta
University of California at Riverside (UCR)
16:00
20m
Talk
Balanced Allocations over Efficient Queues: A Fast Relaxed FIFO Queue
Main Conference
Kåre von Geijer
Chalmers University of Technology
,
Philippas Tsigas
Chalmers University of Technology
,
Elias Johansson
Chalmers University of Technology
,
Sebastian Hermansson
Chalmers University of Technology
16:20
20m
Talk
LibRTS: A Spatial Indexing Library by Ray Tracing
Main Conference
Liang Geng
The Ohio State University, USA
,
Rubao Lee
,
Xiaodong Zhang
The Ohio State University
16:40
20m
Talk
Crystality: A Programming Model for Smart Contracts on Parallel EVMs
Main Conference
Hao Wang
International Digital Economy Academy (IDEA), Shenzhen, China; and Fullnodes Labs
,
Minghao Pan
International Digital Economy Academy (IDEA), Shenzhen, China; and Fullnodes Labs
,
Jiaping Wang
International Digital Economy Academy (IDEA), Shenzhen, China; and Fullnodes Labs
Wed 5 Mar
Displayed time zone:
Pacific Time (US & Canada)
change
09:30 - 10:00
Break
Catering
/
Main Conference
at
Acacia D
09:30
30m
Coffee break
Break
Catering
10:00 - 11:20
Session 10: GPU II (Session Chair: TBA)
Main Conference
at
Acacia D
10:00
20m
Talk
Popcorn: Accelerating Kernel K-means on GPUs through Sparse Linear Algebra
Main Conference
Julian Bellavita
Cornell University
,
Thomas Pasquali
University of Trento
,
Laura Del Rio
University of Trento
,
Flavio Vella
Free University of Bozen
,
Giulia Guidi
Cornell University
10:20
20m
Talk
Swift Unfolding of Communities: GPU-Accelerated Louvain Algorithm
Main Conference
Zhibin Wang
Nanjing University
,
Xi Lin
Nanjing University
,
Xue Li
Alibaba Group
,
Pinhuan Wang
Rutgers, The State University of New Jersey
,
Ziheng Meng
Nanjing University
,
Hang Liu
Rutgers, The State University of New Jersey
,
Chen Tian
Nanjing University
,
Sheng Zhong
Nanjing University
10:40
20m
Talk
GLUMIN: Fast Connectivity Check Based on LUTs For Efficient Graph Pattern Mining
Main Conference
Weichen Cao
Institute of Computing Technology, Chinese Academy of Sciences
,
Ke Meng
Chinese Academy of Sciences
,
linzhiheng
Institute of Computing Technology, Chinese Academy of Sciences
,
Guangming Tan
Chinese Academy of Sciences(CAS)
11:00
20m
Talk
Improving Tridiagonalization Performance on GPU Architectures
Main Conference
WangHansheng
University of Electronic Science and Technology of China
,
Zhekai Duan
University of Edinburgh
,
Zitian Zhao
University of Electronic Science and Technology of China
,
Siqi Wu
University of Electronic Science and Technology of China
,
Saiqi Zheng
Xi'an Jiaotong-Liverpool University
,
Qiao Li
University of Electronic Science and Technology of China
,
Xu Jiang
University of Electronic Science and Technology of China
,
Shaoshuai Zhang
11:20 - 11:40
Break
Catering
/
Main Conference
at
Acacia D
11:20
20m
Coffee break
Break
Catering
11:40 - 13:00
Session 11: Parallel Algorithms and Applications (Session Chair: TBA)
Main Conference
at
Acacia D
11:40
20m
Talk
Jigsaw: Toward Conflict-free Vectorized Stencil Computation by Tessellating Swizzled Registers
Main Conference
Yiwei Zhang
UCAS; Microsoft Research
,
Kun Li
Microsoft Research
,
Liang Yuan
Chinese Academy of Sciences
,
Haozhi Han
Microsoft Research; Peking University
,
Yunquan Zhang
Zhang
,
Ting Cao
Microsoft Research
,
Mao Yang
Microsoft Research
12:00
20m
Talk
Semi-StructMG: A Fast and Scalable Semi-Structured Algebraic Multigrid
Main Conference
Yi Zong
Tsinghua University
,
Chensong Zhang
Academy of Mathematics and Systems Science
,
Longjiang Mu
Laoshan Laboratory
,
Jianchun Wang
China Ship Scientific Research Center
,
Jian Sun
CMA Earth System Modeling and Prediction Center
,
Xiaowen Xu
Institute of Applied Physics and Computational Mathematics
,
Xinliang Wang
Huawei Technologies Co., Ltd
,
Peinan Yu
Tsinghua University
,
Wei Xue
Tsinghua University
12:20
20m
Talk
SBMGT: Scaling Bayesian Multinomial Group Testing
Main Conference
Weicong Chen
University of California, Merced
,
Hao Qi
University of California, Merced
,
Curtis Tatsuoka
University of Pittsburgh
,
Xiaoyi Lu
UC Merced
12:40
20m
Talk
An AI-Enhanced 1km-Resolution Seamless Global Weather and Climate Model to Achieve Year-Scale Simulation Speed using 34 Million Cores
Main Conference
Xiaohui Duan
Shandong University
,
Yi Zhang
PIESAT Information Technology,Co. Ltd.
,
Kai Xu
Laoshan Laboratory
,
Haohuan Fu
Tsinghua University
,
Bin Yang
Tianjin University
,
Yiming Wang
PIESAT Information Technology,Co. Ltd.
,
Yilun Han
Tsinghua University
,
Siyuan Chen
PIESAT Information Technology,Co. Ltd.
,
Zhuangzhuang Zhou
National Supercomputing Center in Wuxi
,
Chenyu Wang
National Supercomputing Center in Wuxi
,
Dongqiang Huang
National Supercomputing Center in Wuxi
,
Huihai An
Shandong University
,
Xiting Ju
Tsinghua University
,
Haopeng Huang
Tsinghua University
,
Zhuang Liu
Tsinghua University
,
Wei Xue
Tsinghua
,
Weiguo Liu
Shandong University
,
Bowen Yan
Tsinghua University
,
Jianye Hou
The Chinese University of Hong Kong
,
Maoxue Yu
Laoshan Laboratory
,
Wenguang Chen
Tsinghua University; Pengcheng Laboratory
,
Jian Li
Chinese Academy of Meteorological Sciences
,
Zhao Jing
Laoshan Laboratory
,
Hailong Liu
Laoshan Laboratory
,
Lixin Wu
Laoshan Laboratory
Mon 3 Mar
Displayed time zone:
Pacific Time (US & Canada)
change
Room
9:00
30
10:00
30
11:00
30
12:00
30
13:00
30
14:00
30
15:00
30
16:00
30
17:00
30
Acacia D
Catering + Main Conference
Break
Main Conference
Session 1: Graph Neural Networks (Session Chair: TBA)
Catering + Main Conference
Break
Main Conference
Session 2: GPU I (Session Chair: TBA)
Catering + Main Conference
Lunch
Main Conference
Session 3: Concurrent Data Structures and Synchronization I (Session Chair: TBA)
Catering + Main Conference
Break
Main Conference
Session 4: Memory (Session Chair: TBA)
Catering + Main Conference
Break
Main Conference
Session 5: Deep Neural Networks (Session Chair: TBA)
Tue 4 Mar
Displayed time zone:
Pacific Time (US & Canada)
change
Room
9:00
30
10:00
30
11:00
30
12:00
30
13:00
30
14:00
30
15:00
30
16:00
30
Acacia D
Catering + Main Conference
Break
Main Conference
Session 6: Large Language Models (Session Chair: TBA)
Catering + Main Conference
Break
Main Conference
Session 7: Scheduling and Resource Management (Session Chair: TBA)
Catering + Main Conference
Lunch
Main Conference
Session 8: Tensor Cores (Session Chair: TBA)
Catering + Main Conference
Break
Main Conference
Session 9: Concurrent Data Structures and Synchronization II (Session Chair: TBA)
Wed 5 Mar
Displayed time zone:
Pacific Time (US & Canada)
change
Room
9:00
30
10:00
30
11:00
30
12:00
30
Acacia D
Catering + Main Conference
Break
Main Conference
Session 10: GPU II (Session Chair: TBA)
Catering + Main Conference
Break
Main Conference
Session 11: Parallel Algorithms and Applications (Session Chair: TBA)
Mon 3 Mar
Displayed time zone:
Pacific Time (US & Canada)
change
Room
9:00
15
30
45
10:00
15
30
45
11:00
15
30
45
12:00
15
30
45
13:00
15
30
45
14:00
15
30
45
15:00
15
30
45
16:00
15
30
45
17:00
15
30
45
Acacia D
PPoPP Catering
Break
09:30 - 10:00
PPoPP Main Conference
Helios: Efficient Distributed Dynamic Graph Sampling for Online GNN Inf ...
10:00 - 10:20
PPoPP Main Conference
Accelerating GNNs on GPU Sparse Tensor Cores through N:M Sparsity-Orien ...
10:20 - 10:40
PPoPP Main Conference
Adaptive Parallel Training for Graph Neural Networks
10:40 - 11:00
PPoPP Catering
Break
11:00 - 11:20
PPoPP Main Conference
RT–BarnesHut: Accelerating Barnes–Hut Using Ray-Tracing Hardware
11:20 - 11:40
PPoPP Main Conference
EVeREST: An Effective and Versatile Runtime Energy Saving Tool for GPUs
11:40 - 12:00
PPoPP Main Conference
TurboFFT: Co-Designed High-Performance and Fault-Tolerant Fast Fourier ...
12:00 - 12:20
PPoPP Catering
Lunch
12:20 - 14:00
PPoPP Main Conference
Reciprocating Locks
14:00 - 14:20
PPoPP Main Conference
Aggregating Funnels for Faster Fetch&Add and Queues
14:20 - 14:40
PPoPP Main Conference
Fairer and More Scalable Reader-Writer Locks by Optimizing Queue Management
14:40 - 15:00
PPoPP Main Conference
Publish on Ping: A Better Way to Publish Reservations in Memory Reclama ...
15:00 - 15:20
PPoPP Catering
Break
15:20 - 15:40
PPoPP Main Conference
AC-Cache: A Memory-Efficient Caching System for Small Objects via Explo ...
15:40 - 16:00
PPoPP Main Conference
Effectively Virtual Page Prefetching via Spatial-Temporal Patterns for ...
16:00 - 16:20
PPoPP Main Conference
Harnessing Inter-GPU Shared Memory for Seamless MoE Communication-Compu ...
16:20 - 16:40
PPoPP Catering
Break
16:40 - 17:00
PPoPP Main Conference
FlashTensor: Optimizing Tensor Programs by Leveraging Fine-grained Tens ...
17:00 - 17:20
PPoPP Main Conference
Mario: Near Zero-cost Activation Checkpointing in Pipeline Parallelism
17:20 - 17:40
PPoPP Main Conference
COMPSO: Optimizing Gradient Compression for Distributed Training with S ...
17:40 - 18:00
Tue 4 Mar
Displayed time zone:
Pacific Time (US & Canada)
change
Room
9:00
15
30
45
10:00
15
30
45
11:00
15
30
45
12:00
15
30
45
13:00
15
30
45
14:00
15
30
45
15:00
15
30
45
16:00
15
30
45
Acacia D
PPoPP Catering
Break
09:30 - 10:00
PPoPP Main Conference
MARLIN: Mixed-Precision Auto-Regressive Parallel Inference on Large Lan ...
10:00 - 10:20
PPoPP Main Conference
WeiPipe: Weight Pipeline Parallelism for Communication-Effective Long-C ...
10:20 - 10:40
PPoPP Main Conference
ATTNChecker: Highly-Optimized Fault Tolerant Attention for Large Langua ...
10:40 - 11:00
PPoPP Catering
Break
11:00 - 11:20
PPoPP Main Conference
SGDRC: Software-Defined Dynamic Resource Control for Concurrent DNN Inf ...
11:20 - 11:40
PPoPP Main Conference
DORADD: Deterministic Parallel Execution in the Era of Microsecond-Scal ...
11:40 - 12:00
PPoPP Main Conference
WaterWise: Co-optimizing Carbon- and Water-Footprint Toward Environment ...
12:00 - 12:20
PPoPP Catering
Lunch
12:20 - 14:00
PPoPP Main Conference
FlashSparse: Minimizing Computation Redundancy for Fast Sparse Matrix M ...
14:00 - 14:20
PPoPP Main Conference
Acc-SpMM: Accelerating General-purpose Sparse Matrix-Matrix Multiplicat ...
14:20 - 14:40
PPoPP Main Conference
BerryBees: Breadth First Search by Bit-Tensor-Cores
14:40 - 15:00
PPoPP Main Conference
FlashFFTStencil: Bridging Fast Fourier Transforms to Memory-Efficient S ...
15:00 - 15:20
PPoPP Catering
Break
15:20 - 15:40
PPoPP Main Conference
PANNS: Enhancing Graph-based Approximate Nearest Neighbor Search throug ...
15:40 - 16:00
PPoPP Main Conference
Balanced Allocations over Efficient Queues: A Fast Relaxed FIFO Queue
16:00 - 16:20
PPoPP Main Conference
LibRTS: A Spatial Indexing Library by Ray Tracing
16:20 - 16:40
PPoPP Main Conference
Crystality: A Programming Model for Smart Contracts on Parallel EVMs
16:40 - 17:00
Wed 5 Mar
Displayed time zone:
Pacific Time (US & Canada)
change
Room
9:00
15
30
45
10:00
15
30
45
11:00
15
30
45
12:00
15
30
45
Acacia D
PPoPP Catering
Break
09:30 - 10:00
PPoPP Main Conference
Popcorn: Accelerating Kernel K-means on GPUs through Sparse Linear Algebra
10:00 - 10:20
PPoPP Main Conference
Swift Unfolding of Communities: GPU-Accelerated Louvain Algorithm
10:20 - 10:40
PPoPP Main Conference
GLUMIN: Fast Connectivity Check Based on LUTs For Efficient Graph Patte ...
10:40 - 11:00
PPoPP Main Conference
Improving Tridiagonalization Performance on GPU Architectures
11:00 - 11:20
PPoPP Catering
Break
11:20 - 11:40
PPoPP Main Conference
Jigsaw: Toward Conflict-free Vectorized Stencil Computation by Tessella ...
11:40 - 12:00
PPoPP Main Conference
Semi-StructMG: A Fast and Scalable Semi-Structured Algebraic Multigrid
12:00 - 12:20
PPoPP Main Conference
SBMGT: Scaling Bayesian Multinomial Group Testing
12:20 - 12:40
PPoPP Main Conference
An AI-Enhanced 1km-Resolution Seamless Global Weather and Climate Model ...
12:40 - 13:00
x
Wed 5 Feb 10:37