Text this: Exploring the class imbalance problem in text classification