Text this: Hierarchical growth self-generating prototype: an information entropy-based approach for dataset reduction