APA (7th ed.) Citation
(2026). Cross-layer Attention Sharing for Pre-trained Large Language Models. Transactions of the Association for Computational Linguistics.
Chicago Style (17th ed.) Citation
"Cross-layer Attention Sharing for Pre-trained Large Language Models." Transactions of the Association for Computational Linguistics 2026.
MLA (9th ed.) Citation
"Cross-layer Attention Sharing for Pre-trained Large Language Models." Transactions of the Association for Computational Linguistics, 2026.
Warning: These citations may not always be 100% accurate.