This package contains functions of resampling strategies to make the binary imbalanced datasets be more balanced. It is important for an imbalanced dataset before applying a classification algorithm, for the reason that class imbalance will lead to a bad performance of classifiers.
RSBID
is available on the github now.
# install.packages("devtools")
devtools::install_github("dongyuanwu/RSBID")
devtools::install_github("dongyuanwu/RSBID", build_vignettes=TRUE) # If you would like to view the vignettes
RSBID
contains five strategies now:
- Random Over-Sampling Algorithm (
ROS
) - Synthetic Minority Over-sampling TEchnique (
SMOTE
) - Synthetic Minority Over-sampling TEchnique-Nominal Continuous (
SMOTE_NC
)
- Random Under-Sampling Algorithm (
RUS
) - Under-Sampling Based on Clustering Algorithm (
SBC
)
We also have an online ShinyApp.