Skip to content

Pull requests: sgl-project/SpecForge

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

Supports eagle3 training for Gemma3 27B and Gemma4 26B.
#553 opened May 1, 2026 by pyc96 Collaborator Loading…
6 tasks
fix benchmark template
#552 opened Apr 30, 2026 by Dogacel Loading…
6 tasks
Add MBPP benchmark
#548 opened Apr 27, 2026 by tugot17 Loading…
chore: regenerate_train_data accepts API key and https URL
#543 opened Apr 24, 2026 by lianakoleva Loading…
1 of 6 tasks
add the configs for qwen3-vl-8b-instruct model
#542 opened Apr 23, 2026 by sunny-infra Loading…
1 of 6 tasks
support quant
#540 opened Apr 17, 2026 by liusy58 Loading…
6 tasks
Fix/gemma3 eagle3 hooks
#538 opened Apr 17, 2026 by tcligg Draft
6 tasks
fix: correct vocab_size to 262144 in gemma3-1b-eagle3 configuration
#537 opened Apr 16, 2026 by javierlimt6 Loading…
1 of 6 tasks
Fix incorrect LSE gradients in cached FlashAttention for Eagle training
#536 opened Apr 16, 2026 by uygnef Collaborator Loading…
6 tasks
feat: add EAGLE3 support for Step-3.5-Flash
#530 opened Apr 13, 2026 by zijiexia Loading…
3 tasks
fix: Bump sglang version from 0.5.9 to 0.5.10
#529 opened Apr 13, 2026 by moehanabi Contributor Loading…
1 of 6 tasks
Fix multimodal hidden-state preparation for Qwen3-VL models
#526 opened Apr 8, 2026 by liusy58 Loading…
6 tasks
support qwen3vl-235b
#525 opened Apr 8, 2026 by liusy58 Loading…
6 tasks
[Feature] Train infer disaggregated
#523 opened Apr 2, 2026 by jiapingW Collaborator Loading…
5 tasks
fix: Make template override arg work correctly
#522 opened Apr 1, 2026 by moehanabi Contributor Loading…
1 of 6 tasks
feat: add Qwen3.5 Dense Model EAGLE3 training support
#516 opened Mar 28, 2026 by 36330 Loading…
ProTip! Follow long discussions with comments:>50.