Full Text Available
Note: Clicking the button above will open the full text document at the original institutional repository in a new window.
| Published in: | ArXiv cs.DC Recent Papers |
|---|---|
| Format: | Online Article RSS Article |
| Published: |
2026
|
| Subjects: | |
| Tags: |
No Tags, Be the first to tag this record!
|
| _version_ | 1864764977693327361 |
|---|---|
| collection | WordPress RSS FRELIP Feed Integration |
| container_title | ArXiv cs.DC Recent Papers |
| description | |
| discipline_display | Engineering & Technology |
| discipline_facet | Engineering & Technology |
| format | Online Article RSS Article |
| genre | Journal Article |
| id | rss_article:50423 |
| institution | FRELIP |
| journal_source_facet | ArXiv cs.DC Recent Papers |
| publishDate | 2026 |
| publishDateSort | 2026 |
| record_format | rss_article |
| spellingShingle | Tackling the Data-Parallel Load Balancing Bottleneck in LLM Serving: Practical Online Routing at Scale ArXiv cs.DC Recent Papers Computer Science & IT Engineering & Technology |
| sub_discipline_display | Computer Science & IT |
| sub_discipline_facet | Computer Science & IT |
| subject_display | ArXiv cs.DC Recent Papers Computer Science & IT Engineering & Technology ArXiv cs.DC Recent Papers Computer Science & IT Engineering & Technology |
| subject_facet | ArXiv cs.DC Recent Papers Computer Science & IT Engineering & Technology |
| title | Tackling the Data-Parallel Load Balancing Bottleneck in LLM Serving: Practical Online Routing at Scale |
| title_auth | Tackling the Data-Parallel Load Balancing Bottleneck in LLM Serving: Practical Online Routing at Scale |
| title_full | Tackling the Data-Parallel Load Balancing Bottleneck in LLM Serving: Practical Online Routing at Scale |
| title_fullStr | Tackling the Data-Parallel Load Balancing Bottleneck in LLM Serving: Practical Online Routing at Scale |
| title_full_unstemmed | Tackling the Data-Parallel Load Balancing Bottleneck in LLM Serving: Practical Online Routing at Scale |
| title_short | Tackling the Data-Parallel Load Balancing Bottleneck in LLM Serving: Practical Online Routing at Scale |
| title_sort | tackling the data-parallel load balancing bottleneck in llm serving: practical online routing at scale |
| topic | ArXiv cs.DC Recent Papers Computer Science & IT Engineering & Technology |
| url | https://arxiv.org/abs/2605.06113v1 |