Full Text Available

Note: Clicking the button above will open the full text document at the original institutional repository in a new window.

Cross-layer Attention Sharing for Pre-trained Large Language Models

Saved in:
Bibliographic Details
Published in:Transactions of the Association for Computational Linguistics
Format: Online Article RSS Article
Published: 2026
Subjects:
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1865670897134731264
collection WordPress RSS
FRELIP Feed Integration
container_title Transactions of the Association for Computational Linguistics
description
discipline_display Arts & Humanities
discipline_facet Arts & Humanities
format Online Article
RSS Article
genre Journal Article
id rss_article:52931
institution FRELIP
journal_source_facet Transactions of the Association for Computational Linguistics
publishDate 2026
publishDateSort 2026
record_format rss_article
spellingShingle Cross-layer Attention Sharing for Pre-trained Large Language Models
— — — — — Linguistics and Philology
Language & Literature
Arts & Humanities
sub_discipline_display Language & Literature
sub_discipline_facet Language & Literature
subject_display — — — — — Linguistics and Philology
Language & Literature
Arts & Humanities
— — — — — Linguistics and Philology
Language & Literature
Arts & Humanities
subject_facet — — — — — Linguistics and Philology
Language & Literature
Arts & Humanities
title Cross-layer Attention Sharing for Pre-trained Large Language Models
title_auth Cross-layer Attention Sharing for Pre-trained Large Language Models
title_full Cross-layer Attention Sharing for Pre-trained Large Language Models
title_fullStr Cross-layer Attention Sharing for Pre-trained Large Language Models
title_full_unstemmed Cross-layer Attention Sharing for Pre-trained Large Language Models
title_short Cross-layer Attention Sharing for Pre-trained Large Language Models
title_sort cross-layer attention sharing for pre-trained large language models
topic — — — — — Linguistics and Philology
Language & Literature
Arts & Humanities
url https://direct.mit.edu/tacl/article/doi/10.1162/TACL.a.616/136548/Cross-layer-Attention-Sharing-for-Pre-trained