Text this: Machine learning approach for site classification