-
Notifications
You must be signed in to change notification settings - Fork 1.2k
Pull requests: modelscope/ms-swift
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
[WIP] [v4] refactor megatron-swift (use megatron-core)
#7945
opened Jan 30, 2026 by
Jintao-Huang
Loading…
[feat] Support ProFit: Extend DFT with Probability Threshold-based Token Filtering
#7921
opened Jan 28, 2026 by
maybefunctionname
Loading…
1 of 4 tasks
feat: add greedy packing, MiniCPM packing support, and dataset progress tracking
#7904
opened Jan 26, 2026 by
Lollipop
Loading…
fix(megatron): disable checkpointing when calculate KL
#7828
opened Jan 20, 2026 by
zzc0430
Loading…
1 of 4 tasks
[template] Support HunyuanMT1.5-1.8B and HunyuanMT1.5-7B templates
#7351
opened Jan 10, 2026 by
rinne1998
Loading…
feat(cli): add setproctitle support to customize process name
#7278
opened Jan 4, 2026 by
ciaoyizhen
Loading…
1 task done
[feat] support activation cpu offload in fsdp and fsdp2
#7201
opened Dec 24, 2025 by
meichangsu1
Loading…
1 of 4 tasks
support cce、tiledmlp、activation cpu offload
#7169
opened Dec 23, 2025 by
meichangsu1
Loading…
1 of 4 tasks
Improve vLLM examples regarding vllm_engine_kwargs use
#7133
opened Dec 19, 2025 by
3manifold
Loading…
1 task done
[feat] support TiledMLP in Deepspeed and FSDP2
#7090
opened Dec 17, 2025 by
kevssim
Loading…
2 of 4 tasks
[bugfix] fix missing generate method for InternVL-2.5
#7019
opened Dec 12, 2025 by
xwy-bit
Loading…
1 of 4 tasks
Previous Next
ProTip!
What’s not been updated in a month: updated:<2026-01-02.