-
-
Notifications
You must be signed in to change notification settings - Fork 12k
Pull requests: vllm-project/vllm
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
Don'e assume
position_embedding_type will be present for BERT and RoBERTa models
#30770
opened Dec 16, 2025 by
hmellor
Loading…
[Frontend] Add
max-completion-token option to transcription/translation endpoints
frontend
#30769
opened Dec 16, 2025 by
NickLucche
Loading…
[Doc][CPU] Update CPU doc
ci/build
documentation
Improvements or additions to documentation
#30765
opened Dec 16, 2025 by
bigPYJ1151
Loading…
1 of 5 tasks
Remove ONLY add when PR is ready to merge/full CI is needed
head_mask from Ultravox and Swin
ready
#30764
opened Dec 16, 2025 by
hmellor
Loading…
[KVConnector]: Enable Cross-layers KV cache layout for MultiConnector
kv-connector
#30761
opened Dec 16, 2025 by
kfirtoledo
Loading…
[MM] Pass FA version in ViT Attn
ready
ONLY add when PR is ready to merge/full CI is needed
#30756
opened Dec 16, 2025 by
NickLucche
Loading…
[Core][KVConnector] Propagate block hashes in SchedulerOutput
tpu
Related to Google TPUs
v1
#30753
opened Dec 16, 2025 by
QierLi
Loading…
3 of 5 tasks
[Bugfix]: prevent leaking tokens in crash log
v1
#30751
opened Dec 16, 2025 by
dr75
Loading…
5 tasks
[Refactor] [4/N] Move VLLM_SERVER_DEV endpoints into the serve directory
ci/build
frontend
#30749
opened Dec 16, 2025 by
chaunceyjiang
Loading…
5 tasks
[Docs] fix function name
documentation
Improvements or additions to documentation
#30748
opened Dec 16, 2025 by
lengrongfu
Loading…
5 tasks
[Draft][SM100] Enable fp8 compute for prefill MLA
v1
#30746
opened Dec 16, 2025 by
pavanimajety
•
Draft
2 of 3 tasks
[BugFix]Reclaim resources to prevent memory leaks when use LMCacheMPConnector
kv-connector
#30745
opened Dec 16, 2025 by
wz1qqx
Loading…
5 tasks
[BugFix] Fix memory spike in workspace allocation
ci/build
ready
ONLY add when PR is ready to merge/full CI is needed
v1
#30744
opened Dec 16, 2025 by
LucasWilkinson
Loading…
Support LoRA of PLaMo 2/3
documentation
Improvements or additions to documentation
#30742
opened Dec 16, 2025 by
Alnusjaponica
•
Draft
5 tasks
Properly handle
packed_modules_mapping of PLaMo2
#30740
opened Dec 16, 2025 by
Alnusjaponica
•
Draft
1 of 5 tasks
[BugFix] Support online dense model DP without overhead
kv-connector
ready
ONLY add when PR is ready to merge/full CI is needed
v1
#30739
opened Dec 16, 2025 by
njhill
Loading…
Add mfu stats logging
ready
ONLY add when PR is ready to merge/full CI is needed
v1
#30738
opened Dec 16, 2025 by
SungMinCho
Loading…
add Qwen3OmniMoeAudioEncoder and support torch compile
qwen
Related to Qwen models
#30735
opened Dec 16, 2025 by
XiaobingSuper
Loading…
2 of 5 tasks
[CI/Build] Allow user to configure NVSHMEM version via ENV or command line
ci/build
#30732
opened Dec 16, 2025 by
eicherseiji
Loading…
Previous Next
ProTip!
Updated in the last three days: updated:>2025-12-13.