Text this: Long Context Pre-Training with Lighthouse Attention