Text this: A novel and efficient feature extraction algorithm using kmer-derived mutation signal