Skip to content

Pull requests: InternLM/lmdeploy

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Sort

Pull requests list

update pre-commit config
#2683 opened Oct 30, 2024 by lvhan028 Loading…
Better tp exit log. Bug:P2
#2677 opened Oct 29, 2024 by grimoire Loading…
Flatten cache and add flashattention improvement
#2676 opened Oct 29, 2024 by grimoire Loading…
add linear op on dlinfer platform enhancement New feature or request
#2627 opened Oct 21, 2024 by yao-fengchen Loading…
support release pipeline improvement
#2581 opened Oct 11, 2024 by irexyc Loading…
support yarn in turbomind backend enhancement New feature or request
#2519 opened Sep 26, 2024 by irexyc Loading…
Torchrun launching multiple api_server
#2402 opened Aug 30, 2024 by AllentDan Loading…
More w8a8 models
#2373 opened Aug 26, 2024 by AllentDan Draft
[Feature] support qqq(w4a8) for lmdeploy
#2274 opened Aug 9, 2024 by HandH1998 Loading…
6 tasks done
[Feature] Support XTuner Lite Llava enhancement New feature or request
#2191 opened Jul 31, 2024 by pppppM Loading…
Add prefix cache stats to usage
#2018 opened Jul 13, 2024 by ispobock Loading…
feat: decouple input_ids and output_ids
#1855 opened Jun 25, 2024 by zhyncs Loading…
Add Jetson platform support (by docker)
#1820 opened Jun 21, 2024 by BestAnHongjun Loading…
support vl benchmark
#1662 opened May 27, 2024 by AllentDan Loading…
support AI4Chem/ChemLLM-7B-Chat-1_5-SFT WIP
#1552 opened May 7, 2024 by lvhan028 Loading…
Log stats enhancement New feature or request
#1423 opened Apr 11, 2024 by AllentDan Loading…
support frequency penalty
#713 opened Nov 20, 2023 by RytonLi Loading…
ProTip! Updated in the last three days: updated:>2024-10-28.