Similar Items: Enhanced semantic classification of microbiome sample origins using large language models (LLMs)