lichangye.lcy
|
c3c986ae76
|
0731_4
|
2025-07-31 18:36:31 +08:00 |
antoinegg1
|
2e0af5dd87
|
0731_3
|
2025-07-31 16:53:06 +08:00 |
lichangye.lcy
|
78d0367ff2
|
0731_2
|
2025-07-31 16:35:13 +08:00 |
antoinegg1
|
c5cd21d5db
|
0731_2
|
2025-07-31 15:57:07 +08:00 |
朱晗
|
d6a6240655
|
0731_1
|
2025-07-31 14:37:55 +08:00 |
朱晗
|
1ae006caf4
|
Merge branch 'lcy/refactor' of https://code.alipay.com/inclusionAI/AReaL into lcy/refactor
|
2025-07-31 13:19:48 +08:00 |
lichangye.lcy
|
68b4b0266d
|
0731
|
2025-07-31 13:18:50 +08:00 |
朱晗
|
4ff813ae9f
|
0725_format
|
2025-07-25 19:17:26 +08:00 |
lichangye.lcy
|
6b8bfcf9a4
|
0725_12
|
2025-07-25 19:13:33 +08:00 |
lichangye.lcy
|
a4ad671d3b
|
0725
|
2025-07-25 19:12:25 +08:00 |
lichangye.lcy
|
fb1796d941
|
0725_11
|
2025-07-25 18:52:29 +08:00 |
朱晗
|
4f8b17feb9
|
0725_4padded_image
|
2025-07-25 15:48:54 +08:00 |
朱晗
|
8eaced4036
|
0725_7
|
2025-07-25 14:10:28 +08:00 |
朱晗
|
3c272ff5b4
|
0725_6
|
2025-07-25 14:04:10 +08:00 |
lichangye.lcy
|
4198cd695c
|
0725_1
|
2025-07-25 13:42:42 +08:00 |
朱晗
|
b8549ac48a
|
0724_merge8
|
2025-07-24 19:54:15 +08:00 |
朱晗
|
6255ad5aa7
|
0724-merge8
|
2025-07-24 19:36:10 +08:00 |
朱晗
|
f5924b1851
|
0724_merge7
|
2025-07-24 19:34:22 +08:00 |
朱晗
|
6aeeabf7b9
|
0724_4
|
2025-07-24 19:29:46 +08:00 |
朱晗
|
27fd51aa15
|
0724_merge7
|
2025-07-24 15:46:15 +08:00 |
朱晗
|
13fc236c99
|
0724_merge6
|
2025-07-24 15:45:09 +08:00 |
朱晗
|
1bc9310252
|
0724_merge5
|
2025-07-24 15:38:24 +08:00 |
Wei Fu
|
311bcd7697
|
[lite] [feature] Bump to SGLang v0.4.9.post2 and use NCCL to update weights (#196)
* PullRequest: 353 [Lite] Add gradient checkpointing to FSDPEngine
Merge branch mzy/add-gradient-ckpt of git@code.alipay.com:inclusionAI/AReaL.git into lite
https://code.alipay.com/inclusionAI/AReaL/pull_requests/353
Reviewed-by: 博惟 <bowei.fw@antgroup.com>
* add gradient checkpointing
* PullRequest: 354 [lite] GRPO pre-commit: minor changes in FSDP engine
Merge branch fw/lite-fix1 of git@code.alipay.com:inclusionAI/AReaL.git into lite
https://code.alipay.com/inclusionAI/AReaL/pull_requests/354
Reviewed-by: 晓雷 <meizhiyu.mzy@antgroup.com>
* .
* .
* .
* .
* PullRequest: 355 [Lite] GRPO pre-commit 2: Refactor RemoteSGLangEngine thread and SGLang configuration
Merge branch fw/lite-fix1 of git@code.alipay.com:inclusionAI/AReaL.git into lite
https://code.alipay.com/inclusionAI/AReaL/pull_requests/355?tab=commit
Reviewed-by: 晓雷 <meizhiyu.mzy@antgroup.com>
* .
* .
* .
* .
* .
* .
* fix
* .
* PullRequest: 357 [lite] GRPO pre-commit 3: Fix typos and experiment utilities
Merge branch fw/lite-fix2 of git@code.alipay.com:inclusionAI/AReaL.git into lite
https://code.alipay.com/inclusionAI/AReaL/pull_requests/357?tab=comment
Reviewed-by: 晓雷 <meizhiyu.mzy@antgroup.com>
* .
* .
* .
* .
* .
* fix destroy process group
* PullRequest: 358 [lite] Support GRPO training locally with the GSM8k dataset
Merge branch fw/lite-fix3 of git@code.alipay.com:inclusionAI/AReaL.git into lite
https://code.alipay.com/inclusionAI/AReaL/pull_requests/358
Reviewed-by: 晓雷 <meizhiyu.mzy@antgroup.com>
* .
* .
* .
* .
* fix loss mask
* fix
* .
* PullRequest: 368 [lite] Refactor train engine after merging contributions from GitHub
Merge branch fw/lite-train-engine of git@code.alipay.com:inclusionAI/AReaL.git into lite
https://code.alipay.com/inclusionAI/AReaL/pull_requests/368
Reviewed-by: 晓雷 <meizhiyu.mzy@antgroup.com>
* .
* .
* PullRequest: 371 [lite] [fix] fix misc bugs in GRPO implementation
Merge branch fw/lite-fix0716 of git@code.alipay.com:inclusionAI/AReaL.git into lite
https://code.alipay.com/inclusionAI/AReaL/pull_requests/371
Reviewed-by: 晓雷 <meizhiyu.mzy@antgroup.com>
* .
* PullRequest: 370 [lite] Add Slurm Launcher and Ray Launcher
Merge branch mzy/lite/launcher of git@code.alipay.com:inclusionAI/AReaL.git into lite
https://code.alipay.com/inclusionAI/AReaL/pull_requests/370
Reviewed-by: 博惟 <bowei.fw@antgroup.com>
* .
* .
* .
* fix
* PullRequest: 392 [lite] Fix several bugs regarding RL learning and add an example to reproduce boba-math results.
Merge branch fw/lite-boba of git@code.alipay.com:inclusionAI/AReaL.git into lite
https://code.alipay.com/inclusionAI/AReaL/pull_requests/392
Reviewed-by: 晓雷 <meizhiyu.mzy@antgroup.com>
* support fsdp engine and sglang remote engine
* minor fix
* .
* refactor trainer
* add close
* rm mb_spec
* .
* fix
* .
* qwen2 grpo works
* fix
* fix
* async works
* fix
* slurm launcher not tested
* fix arg parse
* .
* sglang server wrapper
* .
* .
* slurm run
* ready for boba
* debug
* 32k run
* .
* .
* fix
* .
* .
* .
* .
* .
* fix
* .
* fix
* .
* .
* .
* .
* fix
* .
* .
* .
* .
* .
* .
* .
* refactor train engine
* refactor train engine
* .
* fix update weight error
* .
* .
* match train
* format
* .
* fix
* seems to work
* .
* .
* .
* .
* .
* PullRequest: 408 [Feature] Bump SGLang version to v0.4.9.post2
Merge branch fw/sgl049 of git@code.alipay.com:inclusionAI/AReaL.git into lite
https://code.alipay.com/inclusionAI/AReaL/pull_requests/408
Reviewed-by: 晓雷 <meizhiyu.mzy@antgroup.com>
* .
* bump arealite to sglang 0.4.9.post2
* .
* PullRequest: 412 [lite] Minor refactor on `UpdateWeightMeta`
* PullRequest: 422 [lite] Fix tests and scripts after updating sgl to 0.4.9
Merge branch fw/sgl049 of git@code.alipay.com:inclusionAI/AReaL.git into lite
https://code.alipay.com/inclusionAI/AReaL/pull_requests/422
Reviewed-by: 晓雷 <meizhiyu.mzy@antgroup.com>
* .
* bump arealite to sglang 0.4.9.post2
* .
* PullRequest: 412 [lite] Minor refactor on `UpdateWeightMeta`
* .
* PullRequest: 423 [lite] Remove the boba example for github release.
Merge branch fw/remove-boba of git@code.alipay.com:inclusionAI/AReaL.git into lite
https://code.alipay.com/inclusionAI/AReaL/pull_requests/423
Reviewed-by: 晓雷 <meizhiyu.mzy@antgroup.com>
* .
* .
---------
Co-authored-by: 晓雷 <meizhiyu.mzy@antgroup.com>
|
2025-07-24 15:34:52 +08:00 |
lichangye.lcy
|
5690b52d42
|
0724_merge4
|
2025-07-24 15:33:53 +08:00 |
朱晗
|
5118cfaea2
|
0724_merge3
|
2025-07-24 15:22:12 +08:00 |
朱晗
|
176ec4bb23
|
0724_merge3
|
2025-07-24 14:29:29 +08:00 |
朱晗
|
c816a3ce44
|
0724_merge1
|
2025-07-24 14:18:42 +08:00 |
lichangye.lcy
|
6c28d52387
|
0724_1
|
2025-07-24 13:30:20 +08:00 |
朱晗
|
9a166053ee
|
0724_1
|
2025-07-24 11:15:54 +08:00 |
朱晗
|
82442b86fd
|
Merge branch 'lcy/refactor' of https://code.alipay.com/inclusionAI/AReaL into lcy/refactor
|
2025-07-23 17:12:51 +08:00 |
lichangye.lcy
|
d12dec22c9
|
0723_4
|
2025-07-23 17:12:32 +08:00 |
朱晗
|
8e82c59faa
|
0723_merge3
|
2025-07-23 17:06:47 +08:00 |
朱晗
|
2a2e2fe360
|
Merge branch 'lcy/refactor' of https://code.alipay.com/inclusionAI/AReaL into lcy/refactor
|
2025-07-23 16:56:04 +08:00 |
lichangye.lcy
|
4041afb97d
|
0723_1
|
2025-07-23 16:55:20 +08:00 |
lichangye.lcy
|
9ec2c3f640
|
0723_1
|
2025-07-23 16:54:17 +08:00 |
朱晗
|
dd14838c6d
|
0723_1
|
2025-07-23 14:54:35 +08:00 |
lichangye.lcy
|
391bd85e44
|
Merge branch 'lcy/refactor' of https://code.alipay.com/inclusionAI/AReaL into lcy/refactor
|
2025-07-23 14:44:36 +08:00 |
朱晗
|
6bde86a934
|
reformatted
|
2025-07-23 13:58:25 +08:00 |
朱晗
|
eff8f09149
|
0723_1
|
2025-07-23 11:02:36 +08:00 |
lichangye.lcy
|
8815be68e7
|
Merge branch 'lcy/refactor' of https://code.alipay.com/inclusionAI/AReaL into lcy/refactor
|
2025-07-22 18:00:57 +08:00 |
lichangye.lcy
|
af2f80c346
|
0722_7
|
2025-07-22 17:52:03 +08:00 |
朱晗
|
5c0662fdc1
|
0722_6
|
2025-07-22 16:23:56 +08:00 |
lichangye.lcy
|
ea12141f2b
|
0722_4
|
2025-07-22 16:03:29 +08:00 |
lichangye.lcy
|
229f101b44
|
0722_4
|
2025-07-22 15:58:19 +08:00 |
朱晗
|
eba0b5fd72
|
0722_3
|
2025-07-22 15:19:31 +08:00 |
lichangye.lcy
|
7199ce25b6
|
0722_2
|
2025-07-22 14:55:13 +08:00 |
朱晗
|
79e2a81762
|
0722_1
|
2025-07-22 11:09:24 +08:00 |
lichangye.lcy
|
475c35c156
|
0721_6
|
2025-07-21 17:43:12 +08:00 |
朱晗
|
b4e8215f55
|
0721_merge5
|
2025-07-21 17:17:32 +08:00 |
朱晗
|
60ac722779
|
0721_merge3
|
2025-07-21 16:58:42 +08:00 |