POSTER: Frontier-guided Graph Reordering (PPoPP 2025 - Main Conference)

Who

Xinmiao Zhang, Cheng Liu, Shengwen Liang, Chenwei Xiong, Yu Zhang, Lei Zhang, Huawei Li, Xiaowei Li

Track

PPoPP 2025 Main Conference

Time Zone

The program is currently displayed in (GMT-08:00) Pacific Time (US & Canada).

Use conference time zone: (GMT-08:00) Pacific Time (US & Canada)Select other time zone

The GMT offsets shown reflect the offsets at the moment of the conference.

Time Band

By setting a time band, the program will dim events that are outside this time window. This is useful for (virtual) conferences with a continuous program (with repeated sessions).
The time band will also limit the events that are included in the personal iCalendar subscription service.

Display full programSpecify a time band

Save

When

Sun 2 Mar 2025 18:00 - 20:00 at Acacia C&D - Poster Session

Abstract

Graph processing suffers from severe locality challenges due to considerable inefficient irregular memory accesses, which mainly originate from random accesses to neighbors of active vertices (a.k.a frontiers). Graph reordering, which assigns continuous IDs to vertices that are more likely to be accessed consecutively, can improve access locality effectively and has demonstrated significant speedups across various architectures and systems. Existing graph reordering methods primarily explore the overlapping intensity of in-neighbor vertices for the data access locality characterization. However, many graph algorithms often activate a fraction of the vertices across the graphs, which vary substantially over different inputs and processing iterations. Many of these vertices are neither connected nor have any shared neighbors, but they are actually processed at the same time and exhibit potential data access locality, which is generally overlooked in prior graph reordering methods.

We notice that the data locality between concurrently activated vertices are usually attributed to the overlapped $k$-order in-neighbors. As the number of $k$-order in-neighbors grows explosively, it is unacceptably time-consuming to analyze the overlapping of $k$-order in-neighbors for graph reordering directly. In this case, we propose to replace the overlapping calculation of $k$-order in-neighbors with frontier distribution analysis of a few BFS samplings. Specifically, we profile the frontiers distributed across iterations of different BFS samplings first and build a feature vector based on the activated iteration order of each vertex in the BFS samplings. On top of the feature vectors, we propose FrontOrder, which has a customized distance metric to characterize the locality between different vertices and leverages $K$-means to cluster vertices with high locality to guide graph reordering. In addition, FrontOrder also takes the load balance into consideration by predicting the runtime computing intensity with the learned clusters of vertices. According to our experiments, FrontOrder delivers an average performance speedup of 2.65$\times$ and 1.73$\times$ on Ligra and GPOP, respectively, and consistently outperforms the state-of-the-art graph reordering methods on a set of representative graph algorithms and datasets with moderate preprocessing overhead.

Xinmiao Zhang

SKLP, Institute of Computing Technology, CAS

China

Cheng Liu

ICT CAS

Shengwen Liang

SKLP, Institute of Computing Technology, CAS

Chenwei Xiong

SKLP, Institute of Computing Technology, CAS

Yu Zhang

School of Computer Science and Technology, Huazhong University of Science and Technology

Lei Zhang

ICT CAS

Huawei Li

SKLP, Institute of Computing Technology, CAS

Xiaowei Li

SKLP, Institute of Computing Technology, CAS

Time Zone

The program is currently displayed in (GMT-08:00) Pacific Time (US & Canada).

Use conference time zone: (GMT-08:00) Pacific Time (US & Canada)Select other time zone

The GMT offsets shown reflect the offsets at the moment of the conference.

Time Band

Display full programSpecify a time band

Save

Session Program

Sun 2 Mar
Displayed time zone: Pacific Time (US & Canada) change

18:00 - 20:00	Poster SessionMain Conference at Acacia C&D

18:00 2h Poster		POSTER: A General and Scalable GCN Training Framework on CPU Supercomputers Main Conference Chen Zhuang Tokyo Institute of Technology, Riken Center for Computational Science, Peng Chen National Institute of Advanced Industrial Science and Technology, Xin Liu National Institute of Advanced Industrial Science & Technology, Rio Yokota Tokyo Institute of Technology, Nikoli Dryden Lawrence Livermore National Laboratory, Toshio Endo Tokyo Institute of Technology, Satoshi Matsuoka RIKEN, Mohamed Wahib RIKEN Center for Computational Science
18:00 2h Poster		POSTER: Triangle Counting on Tensor Cores Main Conference YuAng Chen The Chinese University of Hong Kong, Jeffrey Xu Yu The Chinese University of Hong Kong
18:00 2h Poster		POSTER: Minimizing speculation overhead in a parallel recognizer for regular texts Main Conference Angelo Borsotti Politecnico di Milano, Luca Breveglieri Politecnico di Milano, Stefano Crespi Reghizzi Politecnico di Milano and CNR-EIIT, Angelo Morzenti Politecnico di Milano
18:00 2h Poster		POSTER: Boost Lock-free Queue and Stack with Batching Main Conference Ao Li Wuhan University, Wenhai Li Wuhan University, Yuan Chen Wuhan University, Lingfeng Deng Wuhan University
18:00 2h Poster		POSTER: Frontier-guided Graph Reordering Main Conference Xinmiao Zhang SKLP, Institute of Computing Technology, CAS, Cheng Liu ICT CAS, Shengwen Liang SKLP, Institute of Computing Technology, CAS, Chenwei Xiong SKLP, Institute of Computing Technology, CAS, Yu Zhang School of Computer Science and Technology, Huazhong University of Science and Technology, Lei Zhang ICT CAS, Huawei Li SKLP, Institute of Computing Technology, CAS, Xiaowei Li SKLP, Institute of Computing Technology, CAS
18:00 2h Poster		POSTER: Big Atomics and Fast Concurrent Hash Tables Main Conference Daniel Anderson Carnegie Mellon University, Guy E. Blelloch Carnegie Mellon University, USA, Siddhartha Jayanti Google Research
18:00 2h Poster		POSTER: FastBWA: Practical and Cost-Efficient Genome Sequence Alignment Pipeline Main Conference Zhonghai Zhang Institute of Computing Technology, Chinese Academy of Sciences / University of Chinese Academy of Sciences, Yewen Li Institute of Computing Technology, Chinese Academy of Sciences / University of Chinese Academy of Sciences, Ke Meng Chinese Academy of Sciences, Chunming Zhang Institute of Computing Technology, Chinese Academy of Sciences, Guangming Tan Chinese Academy of Sciences(CAS)
18:00 2h Poster		POSTER: Transactional Data Structures with Orthogonal Metadata Main Conference Yaodong Sheng Lehigh University, Ahmed Hassan Lehigh University, Michael Spear Lehigh University
18:00 2h Poster		POSTER: High-performance Visual Semantics Compression for AI-Driven Science Main Conference Boyuan Zhang Indiana University, Luanzheng Guo Pacific Northwest National Laboratory, Jiannan Tian Indiana University, Jinyang Liu University of California, Riverside, Daoce Wang Indiana University, Fanjiang Ye Indiana University, Chengming Zhang University of Alabama, Jan Strube Pacific Northwest National Laboratory, Nathan R. Tallent Pacific Northwest National Laboratory, Dingwen Tao Institute of Computing Technology, Chinese Academy of Sciences
18:00 2h Poster		POSTER: Magneto: Accelerating Parallel Structures in DNNs via Co-Optimization of Operators Main Conference Zhanyuan Di State Key Lab of Processors, Institute of Computing Technology, CAS, Leping Wang State Key Lab of Processors, Institute of Computing Technology, CAS, Beijing, Ziyi Ren State Key Lab of Processors, Institute of Computing Technology, CAS, En Shao State Key Lab of Processors, Institute of Computing Technology, CAS, Beijing, Jie Zhao Hunan University, Siyuan Feng Shanghai Jiao Tong University, Dingwen Tao Institute of Computing Technology, Chinese Academy of Sciences, Guangming Tan Chinese Academy of Sciences(CAS), Ninghui Sun State Key Laboratory of Computer Architecture, Institute of Computing Technology, Chinese Academy of Sciences, University of Chinese Academy of Sciences
18:00 2h Poster		POSTER: TENSORMD: Molecular Dynamics Simulation with Ab Initio Accuracy of 50 Billion Atoms Main Conference Yucheng Ouyang Institute of Computing Technology, Chinese Academy of Sciences, Xin Chen Institute of Applied Physics and Computational Mathematics, Ying Liu Institute of Computing Technology, Chinese Academy of Sciences, Xin Chen , Honghui Shang Institute of Computing Technology, Chinese Academy of Sciences, Zhenchuan Chen Institute of Computing Technology, Chinese Academy of Sciences, Rongfen Lin National Research Center of Parallel Computer Engineering and Technology, Xingyu Gao Institute of Applied Physics and Computational Mathematics, Lifang Wang Institute of Applied Physics and Computational Mathematics, Fang Li National Research Center of Parallel Computer Engineering and Technology, Jiahao Shan Institute of Computing Technology, Chinese Academy of Sciences, Haifeng Song Institute of Applied Physics and Computational Mathematics, Huimin Cui Institute of Computing Technology, Chinese Academy of Sciences, Xiaobing Feng ICT CAS