A Must-Know New Clustering Algorithm for Disease Modelling

This article explains a new method for clustering disease data by** both subtype and stage** called SuStaIn (Subtype & Stage Inference). It explains the concept, summarises the maths, and provides a link to the python code.

Classic clustering algorithms like K-Means and Gaussian Mixture Model (GMM) are great for modelling data when we want to find cross-sectional subtypes (aka clusters). This kind of subtyping is used a lot in medicine. A well-known general example is that of subtyping diabetes into “Type I” and “Type **II” **using a single blood sugar measurement. This can help doctors decide whether to prescribe insulin injections or lifestyle changes.

#machine-learning #clustering #disease #unsupervised-learning #biomarker

towardsdatascience.com

A Must-Know New Clustering Algorithm for Disease Modelling