This package consists of a small extension library of highly optimized graph cluster algorithms for the use in PyTorch.
The package consists of the following clustering algorithms:
where ${CUDA} should be replaced by either cpu, cu126, cu128, or cu130 depending on your PyTorch installation.
cpu
cu126
cu128
cu130
Linux
✅
✅
✅
✅
Windows
✅
✅
✅
✅
macOS
✅
Note: Binaries of older versions are also provided for PyTorch 1.4.0, PyTorch 1.5.0, PyTorch 1.6.0, PyTorch 1.7.0/1.7.1, PyTorch 1.8.0/1.8.1, PyTorch 1.9.0, PyTorch 1.10.0/1.10.1/1.10.2, PyTorch 1.11.0, PyTorch 1.12.0/1.12.1, PyTorch 1.13.0/1.13.1, PyTorch 2.0.0/2.0.1, PyTorch 2.1.0/2.1.1/2.1.2, PyTorch 2.2.0/2.2.1/2.2.2, PyTorch 2.3.0/2.3.1, PyTorch 2.4.0/2.4.1, PyTorch 2.5.0/2.5.1, PyTorch 2.6.0, PyTorch 2.7.0/2.7.1, and PyTorch 2.8.0 (following the same procedure).
For older versions, you need to explicitly specify the latest supported version number or install via pip install --no-index in order to prevent a manual installation from source.
You can look up the latest supported version number here.
From source
Ensure that at least PyTorch 1.4.0 is installed and verify that cuda/bin and cuda/include are in your $PATH and $CPATH respectively, e.g.:
When running in a docker container without NVIDIA driver, PyTorch needs to evaluate the compute capabilities and may fail.
In this case, ensure that the compute capabilities are set via TORCH_CUDA_ARCH_LIST, e.g.:
A greedy clustering algorithm of picking an unmarked vertex and matching it with one its unmarked neighbors (that maximizes its edge weight).
The GPU algorithm is adapted from Fagginger Auer and Bisseling: A GPU Algorithm for Greedy Graph Matching (LNCS 2012)
A sampling algorithm, which iteratively samples the most distant point with regard to the rest points.
import torch
from torch_cluster import fps
x = torch.tensor([[-1., -1.], [-1., 1.], [1., -1.], [1., 1.]])
batch = torch.tensor([0, 0, 0, 0])
index = fps(x, batch, ratio=0.5, random_start=False)
print(index)
tensor([0, 3])
kNN-Graph
Computes graph edges to the nearest k points.
Args:
x(Tensor): Node feature matrix of shape [N, F].
k(int): The number of neighbors.
batch(LongTensor, optional): Batch vector of shape [N], which assigns each node to a specific example. batch needs to be sorted. (default: None)
loop(bool, optional): If True, the graph will contain self-loops. (default: False)
flow(string, optional): The flow direction when using in combination with message passing ("source_to_target" or "target_to_source"). (default: "source_to_target")
cosine(boolean, optional): If True, will use the Cosine distance instead of Euclidean distance to find nearest neighbors. (default: False)
num_workers(int): Number of workers to use for computation. Has no effect in case batch is not None, or the input lies on the GPU. (default: 1)
Computes graph edges to all points within a given distance.
Args:
x(Tensor): Node feature matrix of shape [N, F].
r(float): The radius.
batch(LongTensor, optional): Batch vector of shape [N], which assigns each node to a specific example. batch needs to be sorted. (default: None)
loop(bool, optional): If True, the graph will contain self-loops. (default: False)
max_num_neighbors(int, optional): The maximum number of neighbors to return for each element. If the number of actual neighbors is greater than max_num_neighbors, returned neighbors are picked randomly. (default: 32)
flow(string, optional): The flow direction when using in combination with message passing ("source_to_target" or "target_to_source"). (default: "source_to_target")
num_workers(int): Number of workers to use for computation. Has no effect in case batch is not None, or the input lies on the GPU. (default: 1)
torch-cluster also offers a C++ API that contains C++ equivalent of python models.
export Torch_DIR=`python -c 'import torch;print(torch.utils.cmake_prefix_path)'`
mkdir build
cd build
# Add -DWITH_CUDA=on support for the CUDA if needed
cmake ..
make
make install
PyTorch Cluster
This package consists of a small extension library of highly optimized graph cluster algorithms for the use in PyTorch. The package consists of the following clustering algorithms:
All included operations work on varying data types and are implemented both for CPU and GPU.
Installation
Binaries
We provide pip wheels for all major OS/PyTorch/CUDA combinations, see here.
PyTorch 2.11
To install the binaries for PyTorch 2.11, simply run
where
${CUDA}should be replaced by eithercpu,cu126,cu128, orcu130depending on your PyTorch installation.cpucu126cu128cu130PyTorch 2.10
To install the binaries for PyTorch 2.10, simply run
where
${CUDA}should be replaced by eithercpu,cu126,cu128, orcu130depending on your PyTorch installation.cpucu126cu128cu130PyTorch 2.9
To install the binaries for PyTorch 2.9, simply run
where
${CUDA}should be replaced by eithercpu,cu126,cu128, orcu130depending on your PyTorch installation.cpucu126cu128cu130Note: Binaries of older versions are also provided for PyTorch 1.4.0, PyTorch 1.5.0, PyTorch 1.6.0, PyTorch 1.7.0/1.7.1, PyTorch 1.8.0/1.8.1, PyTorch 1.9.0, PyTorch 1.10.0/1.10.1/1.10.2, PyTorch 1.11.0, PyTorch 1.12.0/1.12.1, PyTorch 1.13.0/1.13.1, PyTorch 2.0.0/2.0.1, PyTorch 2.1.0/2.1.1/2.1.2, PyTorch 2.2.0/2.2.1/2.2.2, PyTorch 2.3.0/2.3.1, PyTorch 2.4.0/2.4.1, PyTorch 2.5.0/2.5.1, PyTorch 2.6.0, PyTorch 2.7.0/2.7.1, and PyTorch 2.8.0 (following the same procedure). For older versions, you need to explicitly specify the latest supported version number or install via
pip install --no-indexin order to prevent a manual installation from source. You can look up the latest supported version number here.From source
Ensure that at least PyTorch 1.4.0 is installed and verify that
cuda/binandcuda/includeare in your$PATHand$CPATHrespectively, e.g.:Then run:
When running in a docker container without NVIDIA driver, PyTorch needs to evaluate the compute capabilities and may fail. In this case, ensure that the compute capabilities are set via
TORCH_CUDA_ARCH_LIST, e.g.:Functions
Graclus
A greedy clustering algorithm of picking an unmarked vertex and matching it with one its unmarked neighbors (that maximizes its edge weight). The GPU algorithm is adapted from Fagginger Auer and Bisseling: A GPU Algorithm for Greedy Graph Matching (LNCS 2012)
VoxelGrid
A clustering algorithm, which overlays a regular grid of user-defined size over a point cloud and clusters all points within a voxel.
FarthestPointSampling
A sampling algorithm, which iteratively samples the most distant point with regard to the rest points.
kNN-Graph
Computes graph edges to the nearest k points.
Args:
[N, F].[N], which assigns each node to a specific example.batchneeds to be sorted. (default:None)True, the graph will contain self-loops. (default:False)"source_to_target"or"target_to_source"). (default:"source_to_target")True, will use the Cosine distance instead of Euclidean distance to find nearest neighbors. (default:False)batchis notNone, or the input lies on the GPU. (default:1)Radius-Graph
Computes graph edges to all points within a given distance.
Args:
[N, F].[N], which assigns each node to a specific example.batchneeds to be sorted. (default:None)True, the graph will contain self-loops. (default:False)max_num_neighbors, returned neighbors are picked randomly. (default:32)"source_to_target"or"target_to_source"). (default:"source_to_target")batchis notNone, or the input lies on the GPU. (default:1)Nearest
Clusters points in x together which are nearest to a given query point in y.
batch_{x,y}vectors need to be sorted.RandomWalk-Sampling
Samples random walks of length
walk_lengthfrom all node indices instartin the graph given by(row, col).Running tests
C++ API
torch-clusteralso offers a C++ API that contains C++ equivalent of python models.