Full Text Available

Note: Clicking the button above will open the full text document at the original institutional repository in a new window.

Cross-layer Attention Sharing for Pre-trained Large Language Models

Saved in:
Bibliographic Details
Published in:Transactions of the Association for Computational Linguistics
Format: Online Article RSS Article
Published: 2026
Subjects:
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1867301676809977858
collection WordPress RSS
FRELIP Feed Integration
container_title Transactions of the Association for Computational Linguistics
description
discipline_display Linguistics and Philology
discipline_facet Linguistics and Philology
format Online Article
RSS Article
genre Journal Article
id rss_article:64529
institution FRELIP
journal_source_facet Transactions of the Association for Computational Linguistics
publishDate 2026
publishDateSort 2026
record_format rss_article
spellingShingle Cross-layer Attention Sharing for Pre-trained Large Language Models
Linguistics and Philology
General
Linguistics and Philology
sub_discipline_display General
sub_discipline_facet General
subject_display Linguistics and Philology
General
Linguistics and Philology
Linguistics and Philology
General
Linguistics and Philology
subject_facet Linguistics and Philology
General
Linguistics and Philology
title Cross-layer Attention Sharing for Pre-trained Large Language Models
title_auth Cross-layer Attention Sharing for Pre-trained Large Language Models
title_full Cross-layer Attention Sharing for Pre-trained Large Language Models
title_fullStr Cross-layer Attention Sharing for Pre-trained Large Language Models
title_full_unstemmed Cross-layer Attention Sharing for Pre-trained Large Language Models
title_short Cross-layer Attention Sharing for Pre-trained Large Language Models
title_sort cross-layer attention sharing for pre-trained large language models
topic Linguistics and Philology
General
Linguistics and Philology
url https://direct.mit.edu/tacl/article/doi/10.1162/TACL.a.616/136548/Cross-layer-Attention-Sharing-for-Pre-trained