Curated List of Clustering Resources
Last Updated on February 2, 2021 by Editorial Team
Author(s): Durgesh Samariya
Last updated on July 31, 2020. I am continuously updating this post.
Clustering or Cluster analysis is the task of grouping a set of objects in such a way that objects in the same group (called a cluster) are more similar (in some sense) to each other than to those in other groups (clusters). — Wikipedia
This post contents:
- Books
- Research Papers
- Online Courses and Videos
- Clustering Datasets
- List of Journals
Books
Data Clustering by Chandan K. Reddy and Charu C. Aggarwal. This textbook covers most of the clustering techniques. Highly recommended to people working in clustering.
Data Clustering: Theory, Algorithms, and Applications by Guojun Gan, Chaoqun Ma, and Jianhong Wu. This is a useful compendium of a variety of clustering methods, for a variety of data types, with numerous measures of similarity and many examples of algorithms. The ultimate emphasis is on the algorithms, even the implementation in MATLAB or C++.
Research Papers
Survey Papers
Survey of clustering algorithms
A Survey of Clustering Data Mining Techniques
Clustering high-dimensional data: A survey on subspace clustering
A Survey of Text Clustering Algorithms
A Survey of Recent Advances in Hierarchical Clustering Algorithms
Subspace Clustering for High Dimensional Data: A Review
State-of-the-art Methods
Density-Based Clustering Algorithms
Hierarchical Clustering Algorithms
Subspace Clustering
SUBCLU(density-connected Subspace Clustering)
FIRES(FIlter REfinement Subspace clustering)
CSSub (Clustering by Shared Subspaces)
Online Courses and Videos
Coursera Machine Learning course by Andrew Ng, Stanford University
Coursera Machine Learning with Python by IBM
Course on Clustering in Machine Learning by Google
Coursera Clustering Analysis in Data Mining course by University of Illinois at Urbana-Champaign
Clustering Datasets
Clustering basic benchmark dataset
List of Journals for Clustering
Journal of Machine Learning Research
IEEE Transactions on Pattern Analysis and Machine Intelligence
Data Mining and Knowledge Discovery
Check out my git repository awesome-clustering-resources.
More to come…
More items will be added to the repository. Please feel free to suggest other key resources by opening an issue report, submitting a pull request, or dropping me an email @ (samariya.durgesh@gmail.com).
Enjoy reading!
Curated List of Clustering Resources was originally published in Towards AI — Multidisciplinary Science Journal on Medium, where people are continuing the conversation by highlighting and responding to this story.
Published via Towards AI
Towards AI Academy
We Build Enterprise-Grade AI. We'll Teach You to Master It Too.
15 engineers. 100,000+ students. Towards AI Academy teaches what actually survives production.
Start free — no commitment:
→ 6-Day Agentic AI Engineering Email Guide — one practical lesson per day
→ Agents Architecture Cheatsheet — 3 years of architecture decisions in 6 pages
Our courses:
→ AI Engineering Certification — 90+ lessons from project selection to deployed product. The most comprehensive practical LLM course out there.
→ Agent Engineering Course — Hands on with production agent architectures, memory, routing, and eval frameworks — built from real enterprise engagements.
→ AI for Work — Understand, evaluate, and apply AI for complex work tasks.
Note: Article content contains the views of the contributing authors and not Towards AI.