Similar Items: A database-driven research data framework for integrating and processing high-dimensional geoscientific data