Text this: Themis: Training Robust Multilingual Code Reward Models for Flexible Multi-Criteria Scoring