Text this: A computer vision framework towards automated scene understanding & analysis