Sun 2 Mar 2025 18:00 - 20:00 at Acacia C&D - Poster Session

Concurrent queues and stacks are important components of many software systems.
They are well-known contended data structures due to intense contention on hotspots, resulting in limited performance and poor scalability. To ensure robust performance, they pursue lock-free progress to avoid adverse effects brought by complex systems environments, such as scheduling. To achieve higher performance, several attempts applied the batching technique, that packs a group of standard operations into a single batch for execution, to lock-free queues and stacks bulit upon the linked list, but still inherited the contended compare-and-swap (\texttt{CAS}). Especially for concurrent queues, these attempts fail to outperform modern queues constructed by fetch-and-add (\texttt{FAA}) under high contention. Unfortunately, it can be very difficult to integrate the batching technique into \texttt{FAA}-based queues for improvement due to their array nature. In this paper, we creatively employ an auxiliary version log to construct multiple versions of the queue or stack to make the batching technique benefit from \texttt{FAA}, getting rid of the inherent \texttt{CAS} contention. In our evaluation, our new design can achieve 12.85x throughput of the state-of-the-art lock-free queue supporting batching, and achieve 4.19x throughput of the fastest lock-free queue when the batch size is 50. Also, for concurrent stacks, our design can achieve 2.9x throughput of the existing work supporting batching and 46.41x throughput of the classic lock-free stack when the batch size is 50.

Sun 2 Mar

Displayed time zone: Pacific Time (US & Canada) change

18:00 - 20:00
Poster SessionMain Conference at Acacia C&D
18:00
2h
Poster
POSTER: A General and Scalable GCN Training Framework on CPU Supercomputers
Main Conference
Chen Zhuang Tokyo Institute of Technology, Riken Center for Computational Science, Peng Chen National Institute of Advanced Industrial Science and Technology, Xin Liu National Institute of Advanced Industrial Science & Technology, Rio Yokota Tokyo Institute of Technology, Nikoli Dryden Lawrence Livermore National Laboratory, Toshio Endo Tokyo Institute of Technology, Satoshi Matsuoka RIKEN, Mohamed Wahib RIKEN Center for Computational Science
18:00
2h
Poster
POSTER: Triangle Counting on Tensor Cores
Main Conference
YuAng Chen The Chinese University of Hong Kong, Jeffrey Xu Yu The Chinese University of Hong Kong
18:00
2h
Poster
POSTER: Minimizing speculation overhead in a parallel recognizer for regular texts
Main Conference
Angelo Borsotti Politecnico di Milano, Luca Breveglieri Politecnico di Milano, Stefano Crespi Reghizzi Politecnico di Milano and CNR-EIIT, Angelo Morzenti Politecnico di Milano
18:00
2h
Poster
POSTER: Boost Lock-free Queue and Stack with Batching
Main Conference
Ao Li Wuhan University, Wenhai Li Wuhan University, Yuan Chen Wuhan University, Lingfeng Deng Wuhan University
18:00
2h
Poster
POSTER: Frontier-guided Graph Reordering
Main Conference
Xinmiao Zhang SKLP, Institute of Computing Technology, CAS, Cheng Liu ICT CAS, Shengwen Liang SKLP, Institute of Computing Technology, CAS, Chenwei Xiong SKLP, Institute of Computing Technology, CAS, Yu Zhang School of Computer Science and Technology, Huazhong University of Science and Technology, Lei Zhang ICT CAS, Huawei Li SKLP, Institute of Computing Technology, CAS, Xiaowei Li SKLP, Institute of Computing Technology, CAS
18:00
2h
Poster
POSTER: Big Atomics and Fast Concurrent Hash Tables
Main Conference
Daniel Anderson Carnegie Mellon University, Guy E. Blelloch Carnegie Mellon University, USA, Siddhartha Jayanti Google Research
18:00
2h
Poster
POSTER: FastBWA: Practical and Cost-Efficient Genome Sequence Alignment Pipeline
Main Conference
Zhonghai Zhang Institute of Computing Technology, Chinese Academy of Sciences / University of Chinese Academy of Sciences, Yewen Li Institute of Computing Technology, Chinese Academy of Sciences / University of Chinese Academy of Sciences, Ke Meng Chinese Academy of Sciences, Chunming Zhang Institute of Computing Technology, Chinese Academy of Sciences, Guangming Tan Chinese Academy of Sciences(CAS)
18:00
2h
Poster
POSTER: Transactional Data Structures with Orthogonal Metadata
Main Conference
Yaodong Sheng Lehigh University, Ahmed Hassan Lehigh University, Michael Spear Lehigh University
18:00
2h
Poster
POSTER: High-performance Visual Semantics Compression for AI-Driven Science
Main Conference
Boyuan Zhang Indiana University, Luanzheng Guo Pacific Northwest National Laboratory, Jiannan Tian Indiana University, Jinyang Liu University of California, Riverside, Daoce Wang Indiana University, Fanjiang Ye Indiana University, Chengming Zhang University of Alabama, Jan Strube Pacific Northwest National Laboratory, Nathan R. Tallent Pacific Northwest National Laboratory, Dingwen Tao Institute of Computing Technology, Chinese Academy of Sciences
18:00
2h
Poster
POSTER: Magneto: Accelerating Parallel Structures in DNNs via Co-Optimization of Operators
Main Conference
Zhanyuan Di State Key Lab of Processors, Institute of Computing Technology, CAS, Leping Wang State Key Lab of Processors, Institute of Computing Technology, CAS, Beijing, Ziyi Ren State Key Lab of Processors, Institute of Computing Technology, CAS, En Shao State Key Lab of Processors, Institute of Computing Technology, CAS, Beijing, Jie Zhao Hunan University, Siyuan Feng Shanghai Jiao Tong University, Dingwen Tao Institute of Computing Technology, Chinese Academy of Sciences, Guangming Tan Chinese Academy of Sciences(CAS), Ninghui Sun State Key Laboratory of Computer Architecture, Institute of Computing Technology, Chinese Academy of Sciences, University of Chinese Academy of Sciences
18:00
2h
Poster
POSTER: TENSORMD: Molecular Dynamics Simulation with Ab Initio Accuracy of 50 Billion Atoms
Main Conference
Yucheng Ouyang Institute of Computing Technology, Chinese Academy of Sciences, Xin Chen Institute of Applied Physics and Computational Mathematics, Ying Liu Institute of Computing Technology, Chinese Academy of Sciences, Xin Chen , Honghui Shang Institute of Computing Technology, Chinese Academy of Sciences, Zhenchuan Chen Institute of Computing Technology, Chinese Academy of Sciences, Rongfen Lin National Research Center of Parallel Computer Engineering and Technology, Xingyu Gao Institute of Applied Physics and Computational Mathematics, Lifang Wang Institute of Applied Physics and Computational Mathematics, Fang Li National Research Center of Parallel Computer Engineering and Technology, Jiahao Shan Institute of Computing Technology, Chinese Academy of Sciences, Haifeng Song Institute of Applied Physics and Computational Mathematics, Huimin Cui Institute of Computing Technology, Chinese Academy of Sciences, Xiaobing Feng ICT CAS