SMOTE for oversampling minority class in an Imbalanced Dataset

SMOTE stands for Synthetic Minority Oversampling Technique. It uses k-nearest neighbors of the minority class to create a synthetic row for the class. We can specify how much data to synthesize by mentioning the ratio to the majority class. We can also specify which classes to resample - majority, minority, not majority, not minority, all classes.

There are other variants of SMOTE in which minority class is synthesized using SVM algorithm or KMeans clustering.


Python Implementation : pip install imbalanced-learn ; from imblearn.over_sampling import SMOTE ; <SMOTE object>.fit_resample(X,Y)


Comments

Popular posts from this blog

Principal Component Analysis (PCA)

Transfer Learning

Receiver Operating Characteristic (ROC) Curve