Text this: FAIR Enough: Develop and Assess a FAIR-Compliant Dataset for Large Language Model Training?