Text this: A framework for evaluating semi-structured hierarchical data using language models