Text this: A theoretical and empirical comparison of various statistical techniques as applied to a multicentre dataset