K-means is a versatile and popular clustering algorithm with appli- cations in numerous scientific and engineering areas, such as com- putational biology, economics, and machine learning. One drawback of K-means is its inability to identify non-linearly separable clusters, which may lead to inaccurate solutions in certain cases. Kernel K- means is a variant of classical K-means that can find non-linearly separable clusters. However, it scales quadratically with respect to the size of the dataset, taking several minutes to cluster even medium-sized datasets on traditional CPU-based machines. In this paper, we present a formulation of Kernel K-means us- ing sparse-dense matrix multiplication (SpMM) and sparse matrix- vector multiplication (SpMV), and we show that our formulation enables the rapid implementation of a performant GPU-based ver- sion of Kernel K-means with minimal programming effort. Our implementation, named Popcorn, is the first open-source GPU- based implementation of Kernel K-means. Popcorn achieves a speedup of up to 123.8× over a CPU imple- mentation of Kernel K-means and a speedup of up to 2.6× over a GPU implementation of Kernel K-means that does not use sparse matrix computations. Our results support the effectiveness of sparse matrices as tools for efficient parallel programming.

Wed 5 Mar

Displayed time zone: Pacific Time (US & Canada) change

10:00 - 11:20
Session 10: GPU II (Session Chair: Zhijia Zhao)Main Conference at Acacia D
10:00
20m
Talk
Popcorn: Accelerating Kernel K-means on GPUs through Sparse Linear Algebra
Main Conference
Julian Bellavita Cornell University, Thomas Pasquali University of Trento, Laura Del Rio University of Trento, Flavio Vella Free University of Bozen, Giulia Guidi Cornell University
10:20
20m
Talk
Swift Unfolding of Communities: GPU-Accelerated Louvain Algorithm
Main Conference
Zhibin Wang Nanjing University, Xi Lin Nanjing University, Xue Li Alibaba Group, Pinhuan Wang Rutgers, The State University of New Jersey, Ziheng Meng Nanjing University, Hang Liu Rutgers, The State University of New Jersey, Chen Tian Nanjing University, Sheng Zhong Nanjing University
10:40
20m
Talk
GLUMIN: Fast Connectivity Check Based on LUTs For Efficient Graph Pattern Mining
Main Conference
Weichen Cao Institute of Computing Technology, Chinese Academy of Sciences, Ke Meng Chinese Academy of Sciences, linzhiheng Institute of Computing Technology, Chinese Academy of Sciences, Guangming Tan Chinese Academy of Sciences(CAS)
11:00
20m
Talk
Improving Tridiagonalization Performance on GPU Architectures
Main Conference
WangHansheng University of Electronic Science and Technology of China, Zhekai Duan University of Edinburgh, Zitian Zhao University of Electronic Science and Technology of China, Siqi Wu University of Electronic Science and Technology of China, Saiqi Zheng Xi'an Jiaotong-Liverpool University, Qiao Li University of Electronic Science and Technology of China, Xu Jiang University of Electronic Science and Technology of China, Shaoshuai Zhang