-
Notifications
You must be signed in to change notification settings - Fork 383
Issues: modelscope/ms-swift
Fine-tuning best practices for qwen2.5-72b-instruct and qwen2...
#2064
opened Sep 18, 2024 by
Jintao-Huang
Open
19
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Author
Label
Projects
Milestones
Assignee
Sort
Issues list
A Problem with parameters --sequence_parallel_size with torch==2.5.1
#2493
opened Nov 23, 2024 by
Gary-code
readme里面多节点训练的例子好像有问题,master节点的master_addr设置成127.0.0.1会报错,设置成IP就没问题
#2485
opened Nov 21, 2024 by
tppppppppp
MLLM的微调,除了Qwen2-VL之外,其他模型是否支持限制图像的像素以及图像slide个数
enhancement
New feature or request
#2480
opened Nov 19, 2024 by
Gary-code
Add Multimodal Input Support (Image, Audio, Video) to App-UI in MS-Swift Library
#2469
opened Nov 18, 2024 by
SushantGautam
请问如果我在sft的参数里添加warmup_ratio, 然后用deepspeed去训练,这个warmup_ratio参数会生效么?
#2467
opened Nov 18, 2024 by
samaritan1998
ValueError: Image features and image tokens do not match: tokens: 5589, features 5805
#2460
opened Nov 16, 2024 by
Gary-code
Previous Next
ProTip!
Find all open issues with in progress development work with linked:pr.