朱晗
|
eff8f09149
|
0723_1
|
2025-07-23 11:02:36 +08:00 |
lichangye.lcy
|
8815be68e7
|
Merge branch 'lcy/refactor' of https://code.alipay.com/inclusionAI/AReaL into lcy/refactor
|
2025-07-22 18:00:57 +08:00 |
lichangye.lcy
|
af2f80c346
|
0722_7
|
2025-07-22 17:52:03 +08:00 |
朱晗
|
5c0662fdc1
|
0722_6
|
2025-07-22 16:23:56 +08:00 |
朱晗
|
c27a51bc1e
|
0722_5
|
2025-07-22 16:22:59 +08:00 |
lichangye.lcy
|
ea12141f2b
|
0722_4
|
2025-07-22 16:03:29 +08:00 |
lichangye.lcy
|
229f101b44
|
0722_4
|
2025-07-22 15:58:19 +08:00 |
Wei Fu
|
6239633213
|
[doc] [lite] Add customization docs for AReaLite. (#191)
* PullRequest: 353 [Lite] Add gradient checkpointing to FSDPEngine
Merge branch mzy/add-gradient-ckpt of git@code.alipay.com:inclusionAI/AReaL.git into lite
https://code.alipay.com/inclusionAI/AReaL/pull_requests/353
Reviewed-by: 博惟 <bowei.fw@antgroup.com>
* add gradient checkpointing
* PullRequest: 354 [lite] GRPO pre-commit: minor changes in FSDP engine
Merge branch fw/lite-fix1 of git@code.alipay.com:inclusionAI/AReaL.git into lite
https://code.alipay.com/inclusionAI/AReaL/pull_requests/354
Reviewed-by: 晓雷 <meizhiyu.mzy@antgroup.com>
* .
* .
* .
* .
* PullRequest: 355 [Lite] GRPO pre-commit 2: Refactor RemoteSGLangEngine thread and SGLang configuration
Merge branch fw/lite-fix1 of git@code.alipay.com:inclusionAI/AReaL.git into lite
https://code.alipay.com/inclusionAI/AReaL/pull_requests/355?tab=commit
Reviewed-by: 晓雷 <meizhiyu.mzy@antgroup.com>
* .
* .
* .
* .
* .
* .
* fix
* .
* PullRequest: 357 [lite] GRPO pre-commit 3: Fix typos and experiment utilities
Merge branch fw/lite-fix2 of git@code.alipay.com:inclusionAI/AReaL.git into lite
https://code.alipay.com/inclusionAI/AReaL/pull_requests/357?tab=comment
Reviewed-by: 晓雷 <meizhiyu.mzy@antgroup.com>
* .
* .
* .
* .
* .
* fix destroy process group
* PullRequest: 358 [lite] Support GRPO training locally with the GSM8k dataset
Merge branch fw/lite-fix3 of git@code.alipay.com:inclusionAI/AReaL.git into lite
https://code.alipay.com/inclusionAI/AReaL/pull_requests/358
Reviewed-by: 晓雷 <meizhiyu.mzy@antgroup.com>
* .
* .
* .
* .
* fix loss mask
* fix
* .
* PullRequest: 368 [lite] Refactor train engine after merging contributions from GitHub
Merge branch fw/lite-train-engine of git@code.alipay.com:inclusionAI/AReaL.git into lite
https://code.alipay.com/inclusionAI/AReaL/pull_requests/368
Reviewed-by: 晓雷 <meizhiyu.mzy@antgroup.com>
* .
* .
* PullRequest: 371 [lite] [fix] fix misc bugs in GRPO implementation
Merge branch fw/lite-fix0716 of git@code.alipay.com:inclusionAI/AReaL.git into lite
https://code.alipay.com/inclusionAI/AReaL/pull_requests/371
Reviewed-by: 晓雷 <meizhiyu.mzy@antgroup.com>
* .
* PullRequest: 370 [lite] Add Slurm Launcher and Ray Launcher
Merge branch mzy/lite/launcher of git@code.alipay.com:inclusionAI/AReaL.git into lite
https://code.alipay.com/inclusionAI/AReaL/pull_requests/370
Reviewed-by: 博惟 <bowei.fw@antgroup.com>
* .
* .
* .
* fix
* PullRequest: 392 [lite] Fix several bugs regarding RL learning and add an example to reproduce boba-math results.
Merge branch fw/lite-boba of git@code.alipay.com:inclusionAI/AReaL.git into lite
https://code.alipay.com/inclusionAI/AReaL/pull_requests/392
Reviewed-by: 晓雷 <meizhiyu.mzy@antgroup.com>
* support fsdp engine and sglang remote engine
* minor fix
* .
* refactor trainer
* add close
* rm mb_spec
* .
* fix
* .
* qwen2 grpo works
* fix
* fix
* async works
* fix
* slurm launcher not tested
* fix arg parse
* .
* sglang server wrapper
* .
* .
* slurm run
* ready for boba
* debug
* 32k run
* .
* .
* fix
* .
* .
* .
* .
* .
* fix
* .
* fix
* .
* .
* .
* .
* fix
* .
* .
* .
* .
* .
* .
* .
* refactor train engine
* refactor train engine
* .
* fix update weight error
* .
* .
* match train
* format
* .
* fix
* seems to work
* .
* .
* .
* .
* .
* .
* .
* .
* .
* .
* .
---------
Co-authored-by: 晓雷 <meizhiyu.mzy@antgroup.com>
|
2025-07-22 15:43:31 +08:00 |
nuzant
|
ba16d4ef44
|
add quickstart (#194)
|
2025-07-22 15:30:59 +08:00 |
朱晗
|
eba0b5fd72
|
0722_3
|
2025-07-22 15:19:31 +08:00 |
lichangye.lcy
|
7199ce25b6
|
0722_2
|
2025-07-22 14:55:13 +08:00 |
lichangye.lcy
|
3d2f7a9278
|
Merge branch 'lcy/refactor' of https://code.alipay.com/inclusionAI/AReaL into lcy/refactor
|
2025-07-22 11:10:45 +08:00 |
朱晗
|
79e2a81762
|
0722_1
|
2025-07-22 11:09:24 +08:00 |
朱晗
|
80862b7016
|
0721_8
|
2025-07-21 18:30:47 +08:00 |
朱晗
|
f451dbd692
|
0721_merge7
|
2025-07-21 18:10:24 +08:00 |
朱晗
|
c29561498e
|
0721_merge6
|
2025-07-21 17:57:29 +08:00 |
朱晗
|
aed6a9013c
|
Merge remote-tracking branch 'backup/lite' into lcy/refactor
|
2025-07-21 17:44:37 +08:00 |
lichangye.lcy
|
475c35c156
|
0721_6
|
2025-07-21 17:43:12 +08:00 |
朱晗
|
b4e8215f55
|
0721_merge5
|
2025-07-21 17:17:32 +08:00 |
朱晗
|
a2d6d21810
|
0721_merge4
|
2025-07-21 17:02:32 +08:00 |
朱晗
|
60ac722779
|
0721_merge3
|
2025-07-21 16:58:42 +08:00 |
lichangye.lcy
|
67760d3389
|
0721_formal
|
2025-07-21 16:53:23 +08:00 |
lichangye.lcy
|
339e87ab80
|
0721_formal
|
2025-07-21 16:52:00 +08:00 |
博惟
|
4dd4a2249f
|
.
|
2025-07-21 15:11:39 +08:00 |
博惟
|
ab5db3fa55
|
.
|
2025-07-21 15:09:50 +08:00 |
lichangye.lcy
|
9fcc177237
|
0721_4
|
2025-07-21 13:28:23 +08:00 |
Wei Fu
|
4804b05637
|
[Refactor] Rename files in arealite before release. (#190)
* PullRequest: 353 [Lite] Add gradient checkpointing to FSDPEngine
Merge branch mzy/add-gradient-ckpt of git@code.alipay.com:inclusionAI/AReaL.git into lite
https://code.alipay.com/inclusionAI/AReaL/pull_requests/353
Reviewed-by: 博惟 <bowei.fw@antgroup.com>
* add gradient checkpointing
* PullRequest: 354 [lite] GRPO pre-commit: minor changes in FSDP engine
Merge branch fw/lite-fix1 of git@code.alipay.com:inclusionAI/AReaL.git into lite
https://code.alipay.com/inclusionAI/AReaL/pull_requests/354
Reviewed-by: 晓雷 <meizhiyu.mzy@antgroup.com>
* .
* .
* .
* .
* PullRequest: 355 [Lite] GRPO pre-commit 2: Refactor RemoteSGLangEngine thread and SGLang configuration
Merge branch fw/lite-fix1 of git@code.alipay.com:inclusionAI/AReaL.git into lite
https://code.alipay.com/inclusionAI/AReaL/pull_requests/355?tab=commit
Reviewed-by: 晓雷 <meizhiyu.mzy@antgroup.com>
* .
* .
* .
* .
* .
* .
* fix
* .
* PullRequest: 357 [lite] GRPO pre-commit 3: Fix typos and experiment utilities
Merge branch fw/lite-fix2 of git@code.alipay.com:inclusionAI/AReaL.git into lite
https://code.alipay.com/inclusionAI/AReaL/pull_requests/357?tab=comment
Reviewed-by: 晓雷 <meizhiyu.mzy@antgroup.com>
* .
* .
* .
* .
* .
* fix destroy process group
* PullRequest: 358 [lite] Support GRPO training locally with the GSM8k dataset
Merge branch fw/lite-fix3 of git@code.alipay.com:inclusionAI/AReaL.git into lite
https://code.alipay.com/inclusionAI/AReaL/pull_requests/358
Reviewed-by: 晓雷 <meizhiyu.mzy@antgroup.com>
* .
* .
* .
* .
* fix loss mask
* fix
* .
* PullRequest: 368 [lite] Refactor train engine after merging contributions from GitHub
Merge branch fw/lite-train-engine of git@code.alipay.com:inclusionAI/AReaL.git into lite
https://code.alipay.com/inclusionAI/AReaL/pull_requests/368
Reviewed-by: 晓雷 <meizhiyu.mzy@antgroup.com>
* .
* .
* PullRequest: 371 [lite] [fix] fix misc bugs in GRPO implementation
Merge branch fw/lite-fix0716 of git@code.alipay.com:inclusionAI/AReaL.git into lite
https://code.alipay.com/inclusionAI/AReaL/pull_requests/371
Reviewed-by: 晓雷 <meizhiyu.mzy@antgroup.com>
* .
* PullRequest: 370 [lite] Add Slurm Launcher and Ray Launcher
Merge branch mzy/lite/launcher of git@code.alipay.com:inclusionAI/AReaL.git into lite
https://code.alipay.com/inclusionAI/AReaL/pull_requests/370
Reviewed-by: 博惟 <bowei.fw@antgroup.com>
* .
* .
* .
* fix
* PullRequest: 392 [lite] Fix several bugs regarding RL learning and add an example to reproduce boba-math results.
Merge branch fw/lite-boba of git@code.alipay.com:inclusionAI/AReaL.git into lite
https://code.alipay.com/inclusionAI/AReaL/pull_requests/392
Reviewed-by: 晓雷 <meizhiyu.mzy@antgroup.com>
* support fsdp engine and sglang remote engine
* minor fix
* .
* refactor trainer
* add close
* rm mb_spec
* .
* fix
* .
* qwen2 grpo works
* fix
* fix
* async works
* fix
* slurm launcher not tested
* fix arg parse
* .
* sglang server wrapper
* .
* .
* slurm run
* ready for boba
* debug
* 32k run
* .
* .
* fix
* .
* .
* .
* .
* .
* fix
* .
* fix
* .
* .
* .
* .
* fix
* .
* .
* .
* .
* .
* .
* .
* refactor train engine
* refactor train engine
* .
* fix update weight error
* .
* .
* match train
* format
* .
* fix
* seems to work
* .
* .
* .
* .
* format
* format
* .
* rename
* rename
---------
Co-authored-by: 晓雷 <meizhiyu.mzy@antgroup.com>
|
2025-07-21 12:57:50 +08:00 |
Wei Fu
|
18f8a056b6
|
[Feature] [lite] Merge from internal dev repo (#189)
* PullRequest: 353 [Lite] Add gradient checkpointing to FSDPEngine
Merge branch mzy/add-gradient-ckpt of git@code.alipay.com:inclusionAI/AReaL.git into lite
https://code.alipay.com/inclusionAI/AReaL/pull_requests/353
Reviewed-by: 博惟 <bowei.fw@antgroup.com>
* add gradient checkpointing
* PullRequest: 354 [lite] GRPO pre-commit: minor changes in FSDP engine
Merge branch fw/lite-fix1 of git@code.alipay.com:inclusionAI/AReaL.git into lite
https://code.alipay.com/inclusionAI/AReaL/pull_requests/354
Reviewed-by: 晓雷 <meizhiyu.mzy@antgroup.com>
* .
* .
* .
* .
* PullRequest: 355 [Lite] GRPO pre-commit 2: Refactor RemoteSGLangEngine thread and SGLang configuration
Merge branch fw/lite-fix1 of git@code.alipay.com:inclusionAI/AReaL.git into lite
https://code.alipay.com/inclusionAI/AReaL/pull_requests/355?tab=commit
Reviewed-by: 晓雷 <meizhiyu.mzy@antgroup.com>
* .
* .
* .
* .
* .
* .
* fix
* .
* PullRequest: 357 [lite] GRPO pre-commit 3: Fix typos and experiment utilities
Merge branch fw/lite-fix2 of git@code.alipay.com:inclusionAI/AReaL.git into lite
https://code.alipay.com/inclusionAI/AReaL/pull_requests/357?tab=comment
Reviewed-by: 晓雷 <meizhiyu.mzy@antgroup.com>
* .
* .
* .
* .
* .
* fix destroy process group
* PullRequest: 358 [lite] Support GRPO training locally with the GSM8k dataset
Merge branch fw/lite-fix3 of git@code.alipay.com:inclusionAI/AReaL.git into lite
https://code.alipay.com/inclusionAI/AReaL/pull_requests/358
Reviewed-by: 晓雷 <meizhiyu.mzy@antgroup.com>
* .
* .
* .
* .
* fix loss mask
* fix
* .
* PullRequest: 368 [lite] Refactor train engine after merging contributions from GitHub
Merge branch fw/lite-train-engine of git@code.alipay.com:inclusionAI/AReaL.git into lite
https://code.alipay.com/inclusionAI/AReaL/pull_requests/368
Reviewed-by: 晓雷 <meizhiyu.mzy@antgroup.com>
* .
* .
* PullRequest: 371 [lite] [fix] fix misc bugs in GRPO implementation
Merge branch fw/lite-fix0716 of git@code.alipay.com:inclusionAI/AReaL.git into lite
https://code.alipay.com/inclusionAI/AReaL/pull_requests/371
Reviewed-by: 晓雷 <meizhiyu.mzy@antgroup.com>
* .
* PullRequest: 370 [lite] Add Slurm Launcher and Ray Launcher
Merge branch mzy/lite/launcher of git@code.alipay.com:inclusionAI/AReaL.git into lite
https://code.alipay.com/inclusionAI/AReaL/pull_requests/370
Reviewed-by: 博惟 <bowei.fw@antgroup.com>
* .
* .
* .
* fix
* PullRequest: 392 [lite] Fix several bugs regarding RL learning and add an example to reproduce boba-math results.
Merge branch fw/lite-boba of git@code.alipay.com:inclusionAI/AReaL.git into lite
https://code.alipay.com/inclusionAI/AReaL/pull_requests/392
Reviewed-by: 晓雷 <meizhiyu.mzy@antgroup.com>
* support fsdp engine and sglang remote engine
* minor fix
* .
* refactor trainer
* add close
* rm mb_spec
* .
* fix
* .
* qwen2 grpo works
* fix
* fix
* async works
* fix
* slurm launcher not tested
* fix arg parse
* .
* sglang server wrapper
* .
* .
* slurm run
* ready for boba
* debug
* 32k run
* .
* .
* fix
* .
* .
* .
* .
* .
* fix
* .
* fix
* .
* .
* .
* .
* fix
* .
* .
* .
* .
* .
* .
* .
* refactor train engine
* refactor train engine
* .
* fix update weight error
* .
* .
* match train
* format
* .
* fix
* seems to work
* .
* .
* .
* .
* format
* format
* .
---------
Co-authored-by: 晓雷 <meizhiyu.mzy@antgroup.com>
|
2025-07-21 12:52:43 +08:00 |
博惟
|
9c4da33f61
|
Merge branch 'lite' of https://github.com/inclusionAI/AReaL into lite
|
2025-07-21 12:44:55 +08:00 |
Night
|
f68a4f677d
|
Implement fsdp distributed update (#183)
* PullRequest: 353 [Lite] Add gradient checkpointing to FSDPEngine
Merge branch mzy/add-gradient-ckpt of git@code.alipay.com:inclusionAI/AReaL.git into lite
https://code.alipay.com/inclusionAI/AReaL/pull_requests/353
Reviewed-by: 博惟 <bowei.fw@antgroup.com>
* add gradient checkpointing
* PullRequest: 354 [lite] GRPO pre-commit: minor changes in FSDP engine
Merge branch fw/lite-fix1 of git@code.alipay.com:inclusionAI/AReaL.git into lite
https://code.alipay.com/inclusionAI/AReaL/pull_requests/354
Reviewed-by: 晓雷 <meizhiyu.mzy@antgroup.com>
* .
* .
* .
* .
* PullRequest: 355 [Lite] GRPO pre-commit 2: Refactor RemoteSGLangEngine thread and SGLang configuration
Merge branch fw/lite-fix1 of git@code.alipay.com:inclusionAI/AReaL.git into lite
https://code.alipay.com/inclusionAI/AReaL/pull_requests/355?tab=commit
Reviewed-by: 晓雷 <meizhiyu.mzy@antgroup.com>
* .
* .
* .
* .
* .
* .
* fix
* .
* PullRequest: 357 [lite] GRPO pre-commit 3: Fix typos and experiment utilities
Merge branch fw/lite-fix2 of git@code.alipay.com:inclusionAI/AReaL.git into lite
https://code.alipay.com/inclusionAI/AReaL/pull_requests/357?tab=comment
Reviewed-by: 晓雷 <meizhiyu.mzy@antgroup.com>
* .
* .
* .
* .
* .
* fix destroy process group
* PullRequest: 358 [lite] Support GRPO training locally with the GSM8k dataset
Merge branch fw/lite-fix3 of git@code.alipay.com:inclusionAI/AReaL.git into lite
https://code.alipay.com/inclusionAI/AReaL/pull_requests/358
Reviewed-by: 晓雷 <meizhiyu.mzy@antgroup.com>
* .
* .
* .
* .
* fix loss mask
* fix
* .
* PullRequest: 368 [lite] Refactor train engine after merging contributions from GitHub
Merge branch fw/lite-train-engine of git@code.alipay.com:inclusionAI/AReaL.git into lite
https://code.alipay.com/inclusionAI/AReaL/pull_requests/368
Reviewed-by: 晓雷 <meizhiyu.mzy@antgroup.com>
* .
* .
* PullRequest: 371 [lite] [fix] fix misc bugs in GRPO implementation
Merge branch fw/lite-fix0716 of git@code.alipay.com:inclusionAI/AReaL.git into lite
https://code.alipay.com/inclusionAI/AReaL/pull_requests/371
Reviewed-by: 晓雷 <meizhiyu.mzy@antgroup.com>
* .
* added remote nccl weight update
feat: implement update_weights_from_distributed in fsdp_engine.py
unfinishd test, raise PR first
coroutine for each server
chore: change uploads weights behavior, change test order
fix small bug
fixed test
* fix rebase
* add test.sh
* updated, test stil fails
* .
* .
* .
* fix: full_tensor() should happen in all rank (#187)
Co-authored-by: ChangyiYang <changyiyang2023@gmail.com>
---------
Co-authored-by: 晓雷 <meizhiyu.mzy@antgroup.com>
Co-authored-by: 博惟 <bowei.fw@antgroup.com>
Co-authored-by: ChangyiYang <changyiyang2023@gmail.com>
Co-authored-by: ChangyiYang <112288487+ChangyiYang@users.noreply.github.com>
|
2025-07-21 12:42:00 +08:00 |
朱晗
|
8f263713eb
|
merge_0721_2
|
2025-07-21 11:42:18 +08:00 |
朱晗
|
a157510799
|
0721_3
|
2025-07-21 11:34:28 +08:00 |
lichangye.lcy
|
588ffd2d91
|
0721_2
|
2025-07-21 11:30:50 +08:00 |
博惟
|
2f1b679ad6
|
PullRequest: 392 [lite] Fix several bugs regarding RL learning and add an example to reproduce boba-math results.
Merge branch fw/lite-boba of git@code.alipay.com:inclusionAI/AReaL.git into lite
https://code.alipay.com/inclusionAI/AReaL/pull_requests/392
Reviewed-by: 晓雷 <meizhiyu.mzy@antgroup.com>
* support fsdp engine and sglang remote engine
* minor fix
* .
* refactor trainer
* add close
* rm mb_spec
* .
* fix
* .
* qwen2 grpo works
* fix
* fix
* async works
* fix
* slurm launcher not tested
* fix arg parse
* .
* sglang server wrapper
* .
* .
* slurm run
* ready for boba
* debug
* 32k run
* .
* .
* fix
* .
* .
* .
* .
* .
* fix
* .
* fix
* .
* .
* .
* .
* fix
* .
* .
* .
* .
* .
* .
* .
* refactor train engine
* refactor train engine
* .
* fix update weight error
* .
* .
* match train
* format
* .
* fix
* seems to work
* .
* .
* .
* .
|
2025-07-21 11:26:36 +08:00 |
lichangye.lcy
|
25b65a2119
|
0721_1
|
2025-07-21 11:16:51 +08:00 |
Wei Fu
|
40b0070f97
|
[Fix] Add a title for the v0.3 blog (#186)
|
2025-07-21 10:54:41 +08:00 |
朱晗
|
c8952f0da3
|
merge_0721
|
2025-07-21 10:32:38 +08:00 |
朱晗
|
ade6a1d95c
|
Merge remote-tracking branch 'origin/lite' into lcy/refactor
|
2025-07-21 10:31:13 +08:00 |
晓雷
|
ddabd9cc9d
|
PullRequest: 370 [lite] Add Slurm Launcher and Ray Launcher
Merge branch mzy/lite/launcher of git@code.alipay.com:inclusionAI/AReaL.git into lite
https://code.alipay.com/inclusionAI/AReaL/pull_requests/370
Reviewed-by: 博惟 <bowei.fw@antgroup.com>
* .
* .
* .
* fix
|
2025-07-21 10:29:51 +08:00 |
朱晗
|
090850abb4
|
0718_5
|
2025-07-18 13:27:37 +08:00 |
lichangye.lcy
|
a08043e5d0
|
0718_4
|
2025-07-18 13:21:51 +08:00 |
lichangye.lcy
|
0e27a105d9
|
0718_2
|
2025-07-18 11:27:19 +08:00 |
nuzant
|
71c47c5f17
|
fix single node bug (#185)
|
2025-07-18 10:54:49 +08:00 |
朱晗
|
c0176b5bee
|
0718_1
|
2025-07-18 10:25:06 +08:00 |
lichangye.lcy
|
a0323335d1
|
0717_6
|
2025-07-17 18:20:01 +08:00 |
lichangye.lcy
|
587544b2e0
|
0717_6
|
2025-07-17 17:57:40 +08:00 |
朱晗
|
e9dc11283b
|
0717_6
|
2025-07-17 16:50:56 +08:00 |
朱晗
|
ce796f286d
|
0717_5
|
2025-07-17 16:48:29 +08:00 |
朱晗
|
510313b449
|
0717_4
|
2025-07-17 16:31:47 +08:00 |
nuzant
|
0d45f43285
|
[Bug] Fix a bug in ray training utils. (#184)
* fix ray schedule
* .
* .
* remove debug info
|
2025-07-17 16:30:10 +08:00 |