InternLM / lmdeploy Public

Notifications You must be signed in to change notification settings
Fork 410
Star 4.5k

Code
Issues 287
Pull requests 24
Discussions
Actions
Projects
Security
Insights

Additional navigation options

Code
Issues
Pull requests
Discussions
Actions
Projects
Security
Insights

Issues: InternLM/lmdeploy

[Benchmark] benchmarks on different cuda architecture with mo...

#815 opened Dec 11, 2023 by lvhan028

Open 9

A100算力加持！书生大模型实战营第3期全面升级，趣味闯关模式等你开启

#2021 opened Jul 15, 2024 by boshallen

Open

Labels 34 Milestones 0

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

287 Open 1,170 Closed

Author

Filter by author

Label

Filter by label

Use alt + click/return to exclude labels

or ⇧ + click/return for logical OR

Projects

Filter by project

Milestones

Filter by milestone

Assignee

Filter by who’s assigned

Assigned to nobody

Sort

Sort by

Newest Oldest Most commented Least commented Recently updated Least recently updated Best match

Most reactions

Issues list

[Docs] LoRA 推理服务

#2686 opened Oct 31, 2024 by LIUKAI0815

[Feature] Whether pytorch backend is supported on Windows?

#2684 opened Oct 30, 2024 by eeyrw

[Bug] min_p from request is not used

#2682 opened Oct 29, 2024 by ErykCh

2 of 3 tasks

mllama3.2-V-11b support text-only mode?

#2680 opened Oct 29, 2024 by AnyangAngus

[Bug] pytorch backend 's precision points loss 1.0-2.5 points between main code and v0.6.1 on some models.

#2679 opened Oct 29, 2024 by zhulinJulia24

3 tasks

[Feature] Support QwenVL on Ascend

#2675 opened Oct 29, 2024 by Yang1032

[Feature] support multi-lora in turbomind backend

#2674 opened Oct 28, 2024 by zzf2grx

[Feature] Response Metrics

#2673 opened Oct 28, 2024 by nathan-az

[Bug] no user input makes api server throw exception with MLLM

#2658 opened Oct 25, 2024 by gaord

3 tasks

[Bug] use new 4bits quantizated models of internlm2, decoded word starts with a blank.

#2651 opened Oct 24, 2024 by zhulinJulia24

3 tasks

[Bug] AWQ量化InternVL2 20B输出无意义的杂乱文本

#2650 opened Oct 24, 2024 by diandianliu

1 of 3 tasks

[Bug] Use triton to deploy minicpm-v-2_6 GPU memory keeps increasing until it overflows

#2642 opened Oct 24, 2024 by LinJianping

1 of 3 tasks

[Feature] Metrics Endpoint

#2638 opened Oct 23, 2024 by eldhosemjoy

[Bug] cpu 100%

#2637 opened Oct 23, 2024 by whk6688

3 tasks

[Bug] Phi-3-vision-128k-instruct 跑模型在8卡上出现 “Expected all tensors to be on the same device, but found at least two devices” mllm

#2633 opened Oct 22, 2024 by dreamerlin

3 tasks done

[Bug] qwen2-vl-7b docker delpoy bugs awaiting response

#2629 opened Oct 22, 2024 by jnzbfgjd

3 tasks done

[Feature] Combine Batched Inference and Chat Conversation in VLMs Deployment

#2628 opened Oct 21, 2024 by Yusepp

[Bug] 对InternVL模型进行推理时，图像编码阶段gpu-cpu的传输时间过长

#2624 opened Oct 19, 2024 by Dimensionzw

3 tasks done

[Bug] qwen2-vl-72b无法使用

#2622 opened Oct 18, 2024 by bltcn

3 tasks done

[Bug] Internvl2-8B模型量化后推理速度减慢

#2616 opened Oct 18, 2024 by guozhiyao

3 tasks

[Bug] When TP = 4 and prefix cache is enabled, no result is generated.

#2611 opened Oct 17, 2024 by rbao2018

3 tasks done

[Feature] qwen2.5 是否可以支持tool_calls传入？ awaiting response

#2610 opened Oct 17, 2024 by akai-shuuichi

internvl2量化是否支持自定义calib-dataset？

#2609 opened Oct 16, 2024 by guozhiyao

[Bug] InternVL2-26B model load extremely slow

#2608 opened Oct 16, 2024 by HappyNotHappy

3 tasks done

[Bug] InternVL2-2B的推理速度慢，发现是视觉特征提取的耗时很长

#2604 opened Oct 15, 2024 by fong-git

3 tasks

Previous 1 2 3 4 5 … 11 12 Next

Previous Next

ProTip! Find all open issues with in progress development work with linked:pr.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly