Popcorn: Accelerating Kernel K-means on GPUs through Sparse Linear Algebra (PPoPP 2025 - Main Conference)

Who

Julian Bellavita, Thomas Pasquali, Laura Del Rio, Flavio Vella, Giulia Guidi

Track

PPoPP 2025 Main Conference

Time Zone

The program is currently displayed in (GMT-08:00) Pacific Time (US & Canada).

Use conference time zone: (GMT-08:00) Pacific Time (US & Canada)Select other time zone

The GMT offsets shown reflect the offsets at the moment of the conference.

Time Band

By setting a time band, the program will dim events that are outside this time window. This is useful for (virtual) conferences with a continuous program (with repeated sessions).
The time band will also limit the events that are included in the personal iCalendar subscription service.

Display full programSpecify a time band

Save

When

Wed 5 Mar 2025 10:00 - 10:20 at Acacia D - Session 10: GPU II (Session Chair: Zhijia Zhao)

Abstract

K-means is a versatile and popular clustering algorithm with appli- cations in numerous scientific and engineering areas, such as com- putational biology, economics, and machine learning. One drawback of K-means is its inability to identify non-linearly separable clusters, which may lead to inaccurate solutions in certain cases. Kernel K- means is a variant of classical K-means that can find non-linearly separable clusters. However, it scales quadratically with respect to the size of the dataset, taking several minutes to cluster even medium-sized datasets on traditional CPU-based machines. In this paper, we present a formulation of Kernel K-means us- ing sparse-dense matrix multiplication (SpMM) and sparse matrix- vector multiplication (SpMV), and we show that our formulation enables the rapid implementation of a performant GPU-based ver- sion of Kernel K-means with minimal programming effort. Our implementation, named Popcorn, is the first open-source GPU- based implementation of Kernel K-means. Popcorn achieves a speedup of up to 123.8× over a CPU imple- mentation of Kernel K-means and a speedup of up to 2.6× over a GPU implementation of Kernel K-means that does not use sparse matrix computations. Our results support the effectiveness of sparse matrices as tools for efficient parallel programming.

Julian Bellavita

Cornell University

Thomas Pasquali

University of Trento

Laura Del Rio

University of Trento

Flavio Vella

Free University of Bozen

Italy

Giulia Guidi