Over sampling and under sampling

Over sampling and under sampling are two common methods used to deal with imbalanced data sets, where one class is much more represented than the other. Over sampling involves duplicating minority class examples until the class is balanced, while under sampling involves removing majority class examples until the class is balanced. Both methods have advantages … Read more