Similar Items: TokenStack: A Heterogeneous HBM-PIM Architecture and Runtime for Efficient LLM Inference