bowei.fw
8208f986b9
.
2025-07-14 16:44:32 +08:00
bowei.fw
d8bd161b1d
.
2025-07-14 16:39:59 +08:00
博惟
724628eaf0
PullRequest: 355 [Lite] GRPO pre-commit 2: Refactor RemoteSGLangEngine thread and SGLang configuration
...
Merge branch fw/lite-fix1 of git@code.alipay.com:inclusionAI/AReaL.git into lite
https://code.alipay.com/inclusionAI/AReaL/pull_requests/355?tab=commit
Reviewed-by: 晓雷 <meizhiyu.mzy@antgroup.com>
* .
* .
* .
* .
* .
* .
* fix
* .
2025-07-14 15:20:17 +08:00
bowei.fw
938c06a652
fix
2025-07-14 14:58:28 +08:00
bowei.fw
037adedc70
.
2025-07-14 11:27:05 +08:00
博惟
d8038b2669
PullRequest: 354 [lite] GRPO pre-commit: minor changes in FSDP engine
...
Merge branch fw/lite-fix1 of git@code.alipay.com:inclusionAI/AReaL.git into lite
https://code.alipay.com/inclusionAI/AReaL/pull_requests/354
Reviewed-by: 晓雷 <meizhiyu.mzy@antgroup.com>
* .
* .
* .
* .
2025-07-14 11:19:40 +08:00
晓雷
434d2f5064
PullRequest: 353 [Lite] Add gradient checkpointing to FSDPEngine
...
Merge branch mzy/add-gradient-ckpt of git@code.alipay.com:inclusionAI/AReaL.git into lite
https://code.alipay.com/inclusionAI/AReaL/pull_requests/353
Reviewed-by: 博惟 <bowei.fw@antgroup.com>
* add gradient checkpointing
2025-07-14 10:39:15 +08:00
bowei.fw
29172e0e10
.
2025-07-14 10:09:24 +08:00
bowei.fw
ae20d51cce
.
2025-07-13 22:12:54 +08:00
bowei.fw
932f9b9232
32k run
2025-07-13 21:48:48 +08:00
bowei.fw
eda0e79725
debug
2025-07-13 15:27:49 +08:00
bowei.fw
34b6941a2f
ready for boba
2025-07-12 19:20:02 +08:00
bowei.fw
43b3c3f8d0
slurm run
2025-07-12 17:57:58 +08:00
bowei.fw
0d03141cbc
merge slurm launcher
2025-07-12 14:54:49 +08:00
bowei.fw
d65df45780
fix
2025-07-12 14:53:43 +08:00
bowei.fw
fda19261cd
async works
2025-07-12 13:43:57 +08:00
meizhiyu.mzy
1af70c332c
.
2025-07-12 12:27:06 +08:00
晓雷
e222cea659
.
2025-07-12 11:43:29 +08:00
meizhiyu.mzy
805437463f
sglang server wrapper
2025-07-12 11:08:21 +08:00
bowei.fw
bb67116f51
fix
2025-07-12 09:59:09 +08:00
bowei.fw
d37405081b
fix
2025-07-12 09:45:42 +08:00
bowei.fw
888751da38
qwen2 grpo works
2025-07-12 09:44:50 +08:00
meizhiyu.mzy
4a26f28adf
.
2025-07-11 22:56:59 +08:00
晓雷
97511e43ff
fix arg parse
2025-07-11 16:47:29 +08:00
晓雷
57ce1213ae
merge ppo
2025-07-11 16:24:07 +08:00
晓雷
6acd1696f6
slurm launcher not tested
2025-07-11 16:14:59 +08:00
bowei.fw
0cbddb8aba
.
2025-07-11 11:10:55 +08:00
Wei Fu
8d4b8dc90f
[Doc] Add an instruction about how to run the SFT example. ( #164 )
2025-07-10 13:14:10 +08:00
Wei Fu
3bf9c85e40
[Fix] Merge previous contributions from fw/refactor to lite ( #163 )
...
* initial proposal
* add arealite
* .
* change api
* .
* remove LOG_ROOT
* remove MODEL_SAVE_PATH
* remove PARAM_REALLOC_PATH, DATASET_CACHE
* prepare for testing
* prepare for testing
* ready for run
* local run
* tests mainly pass
* format
* .
* amend cluster.py
* .
* .
* client test pass
* pass rollout test
* remove unused imports
* add arealite readme
* change api
* .
* .
* .
* .
* .
* .
* .
* .
* format
* .
* implement iteraptable generation (#112 )
Co-authored-by: zhaochenyang <zhaochenyang20@gmail.com>
* .
* fix
* .
* .
* .
* pass controller generate batch test
* .
* refactor rollout controller into worker and controller
* .
* .
* .
* change to async rollout
* pass rollout controller test
* pass test
* .
* update readme
* .
* sft debug
* .
* add lisence
* remove unused files
* remove unsed args in ppo
* add hf engine wrapper (#116 )
* add hf engine
* fix issues
* fix ppo bugs and add test
* add hf client interface and modify cli args
* fix bugs
* fix issues
* Merge fw/refactor
* Finish hf wrapper test
* add test
---------
Co-authored-by: Wei Fu <36355462+garrett4wade@users.noreply.github.com>
* format
* format
* .
* refine hf engine
* .
* fix
* add fsdp engine and sft tests
* .
* .
* .
* pass ppo unittest
* pass ppo and rollout controller tests
* clear unused imports
* rename ppo to grpo
* change reward function organization
* reorganize code
* add dataset api
* .
* .
* .
* format
* chmod fix
* .
* rename workflow to collector
* refactor llm_client location
* .
* .
* fix llm server api
* refactor config structure
* .
* fix tests
* .
* .
* .
* Fix unresolved issue in SFTTrainer PR (#139 )
* .
* .
* efficient loading
* format
* .
* .
* .
* .
* .
* .
* Add CI for testing AReaLite (#150 )
* ci: add test-arealite
* ci: add checkout before running test-arealite
* ci: add USERNAME
* ci: add test script
* ci: add GitHub mirror
* ci: fix typo
* ci: clone one commit
* ci: fix condition
* ci: set command timeout to 60m
* ci: enable pip cache
* ci: optimize container lifecycle
* ci: split into many stages
* ci(test-arealite): fix typo
* ci: fix wrong env
* ci: fix pytest
* ci: uninstall transformer-engine
* ci: uninstall transformer-engine
* ci: fix model paths
* ci: show stdout/stderr
* ci: fix not clean up
* ci: backup sglang
* ci: remove tmp repo dir when run
* ci: fix docker run exit 1 condition
* ci(test-arealite): limit the concurrency and extend command timeout
* .
* merge fw/refactor
* revert some changes
* fix
---------
Co-authored-by: meizhiyu.mzy <meizhiyu.mzy@antgroup.com>
Co-authored-by: Chayenne <zhaochen20@outlook.com>
Co-authored-by: zhaochenyang <zhaochenyang20@gmail.com>
Co-authored-by: Jayon02 <qiujiangc@outlook.com>
Co-authored-by: root <meizhiyu.mzy>
Co-authored-by: Zijian Zhang <futrime@outlook.com>
2025-07-10 12:56:24 +08:00
博惟
d48bf007cf
Merge branch 'main' of https://github.com/inclusionAI/AReaL into lite
2025-07-10 12:53:30 +08:00
博惟
42c717b6e4
Merge branch 'lite' of https://github.com/inclusionAI/AReaL into lite
2025-07-10 11:15:01 +08:00
博惟
c38cffc023
PullRequest: 340 [lite] Refactor trainer API into utilities and remove mb_spec in engine methods
...
Merge branch fw/lite-dev of git@code.alipay.com:inclusionAI/AReaL.git into lite
https://code.alipay.com/inclusionAI/AReaL/pull_requests/340
Reviewed-by: 晓雷 <meizhiyu.mzy@antgroup.com>
* support fsdp engine and sglang remote engine
* minor fix
* .
* refactor trainer
* add close
* rm mb_spec
* fix
2025-07-10 11:10:10 +08:00
bowei.fw
1bb23f2399
Merge branch 'fw/lite-dev' of code.alipay.com:inclusionAI/AReaL into fw/ppo
2025-07-10 11:08:47 +08:00
bowei.fw
520e38b1ee
fix
2025-07-10 11:06:57 +08:00
bowei.fw
347bcc07a6
.
2025-07-10 11:05:08 +08:00
Zijian Zhang
a203c7c750
ci: fix env sha
2025-07-09 22:31:46 +08:00
bowei.fw
a78fd2dd24
rm mb_spec
2025-07-09 22:16:31 +08:00
bowei.fw
32077b02ed
add close
2025-07-09 21:56:27 +08:00
bowei.fw
b75aaaf130
refactor trainer
2025-07-09 21:48:28 +08:00
Zijian Zhang
a70cd28f8f
ci: fix on demand condition
2025-07-09 21:19:15 +08:00
Zijian Zhang
8e201ef3e2
ci: build images on demand
2025-07-09 21:16:23 +08:00
Zijian Zhang
ee6f5a89f3
chore: empty commit
2025-07-09 18:52:38 +08:00
bowei.fw
9d756686cb
Merge branch 'lite' of code.alipay.com:inclusionAI/AReaL into fw/lite-dev
2025-07-09 16:57:36 +08:00
博惟
7be4ab0d18
PullRequest: 339 [Fix] Fix some minor issues to pass all tests.
...
Merge branch fw/lite of git@code.alipay.com:inclusionAI/AReaL.git into lite
https://code.alipay.com/inclusionAI/AReaL/pull_requests/339
Reviewed-by: 晓雷 <meizhiyu.mzy@antgroup.com>
* support fsdp engine and sglang remote engine
* minor fix
* .
2025-07-09 16:51:26 +08:00
bowei.fw
fe1f622841
.
2025-07-09 16:48:57 +08:00
bowei.fw
124184a7b7
minor fix
2025-07-09 16:48:11 +08:00
bowei.fw
a6bcab22ba
support fsdp engine and sglang remote engine
2025-07-09 16:45:29 +08:00
博惟
15dfbe837c
PullRequest: 332 [lite] Support FSDP engines
...
Merge branch mzy/lite/engines of git@code.alipay.com:inclusionAI/AReaL.git into lite
https://code.alipay.com/inclusionAI/AReaL/pull_requests/332
Reviewed-by: 晓雷 <meizhiyu.mzy@antgroup.com>
* fsdp2 engine
* fix utils
* add fsdp engine test
* .
* fsdp engine test passed
* unsqueeze immediately before model inputs and after model outputts
* add optimizer save/load, add position id calculation for input
* .
* format
* not to squeeze
* add train and eval api
* .
* .
* improve fsdp engine data preprocessing
* format
* PullRequest: 337 [lite] Add SFT trainer example.
* trainer log
* minor changes
* add update weights from disk
* fix type annotation
2025-07-09 16:24:25 +08:00
郭唯
7a438c0650
PullRequest: 336 add wrapper
...
Merge branch lite-util-wrapper of git@code.alipay.com:inclusionAI/AReaL.git into lite
https://code.alipay.com/inclusionAI/AReaL/pull_requests/336
Reviewed-by: 博惟 <bowei.fw@antgroup.com>
* add wrapper
2025-07-09 15:44:27 +08:00
博惟
8771778995
PullRequest: 331 [lite] Support remote sglang engine with corresponding testcases.
...
Merge branch fw/lite of git@code.alipay.com:inclusionAI/AReaL.git into lite
https://code.alipay.com/inclusionAI/AReaL/pull_requests/331
Reviewed-by: 晓雷 <meizhiyu.mzy@antgroup.com>
* .
* add test for sglang remote engine
* fix
2025-07-09 14:18:55 +08:00