Commit Graph

882 Commits

Author SHA1 Message Date
朱晗 8c7affe108 merge_lite 2025-07-10 15:59:25 +08:00
antoinegg1 e1f28531e8 0710_4 2025-07-10 14:14:16 +08:00
朱晗 0640d5aa47 0710_5 2025-07-10 14:00:31 +08:00
朱晗 0d3c579a20 0710_3 2025-07-10 13:56:26 +08:00
antoinegg1 622781d4bb 0710_3 2025-07-10 13:40:54 +08:00
Wei Fu 8d4b8dc90f
[Doc] Add an instruction about how to run the SFT example. (#164) 2025-07-10 13:14:10 +08:00
antoinegg1 27c06b9d48 Merge branch 'lcy/refactor' of https://code.alipay.com/inclusionAI/AReaL into lcy/refactor 2025-07-10 13:05:07 +08:00
朱晗 50cf951a1b 0710_3 2025-07-10 13:04:52 +08:00
antoinegg1 3122d900a6 Merge branch 'lcy/refactor' of https://code.alipay.com/inclusionAI/AReaL into lcy/refactor 2025-07-10 13:01:44 +08:00
朱晗 e57cb20855 0710_2 2025-07-10 12:58:23 +08:00
Wei Fu 3bf9c85e40
[Fix] Merge previous contributions from fw/refactor to lite (#163)
* initial proposal

* add arealite

* .

* change api

* .

* remove LOG_ROOT

* remove MODEL_SAVE_PATH

* remove PARAM_REALLOC_PATH, DATASET_CACHE

* prepare for testing

* prepare for testing

* ready for run

* local run

* tests mainly pass

* format

* .

* amend cluster.py

* .

* .

* client test pass

* pass rollout test

* remove unused imports

* add arealite readme

* change api

* .

* .

* .

* .

* .

* .

* .

* .

* format

* .

* implement iteraptable generation (#112)

Co-authored-by: zhaochenyang <zhaochenyang20@gmail.com>

* .

* fix

* .

* .

* .

* pass controller generate batch test

* .

* refactor rollout controller into worker and controller

* .

* .

* .

* change to async rollout

* pass rollout controller test

* pass test

* .

* update readme

* .

* sft debug

* .

* add lisence

* remove unused files

* remove unsed args in ppo

* add hf engine wrapper  (#116)

* add hf engine

* fix issues

* fix ppo bugs and add test

* add hf client interface and modify cli args

* fix bugs

* fix issues

* Merge fw/refactor

* Finish hf wrapper test

* add test

---------

Co-authored-by: Wei Fu <36355462+garrett4wade@users.noreply.github.com>

* format

* format

* .

* refine hf engine

* .

* fix

* add fsdp engine and sft tests

* .

* .

* .

* pass ppo unittest

* pass ppo and rollout controller tests

* clear unused imports

* rename ppo to grpo

* change reward function organization

* reorganize code

* add dataset api

* .

* .

* .

* format

* chmod fix

* .

* rename workflow to collector

* refactor llm_client location

* .

* .

* fix llm server api

* refactor config structure

* .

* fix tests

* .

* .

* .

* Fix unresolved issue in SFTTrainer PR (#139)

* .

* .

* efficient loading

* format

* .

* .

* .

* .

* .

* .

* Add CI for testing AReaLite (#150)

* ci: add test-arealite

* ci: add checkout before running test-arealite

* ci: add USERNAME

* ci: add test script

* ci: add GitHub mirror

* ci: fix typo

* ci: clone one commit

* ci: fix condition

* ci: set command timeout to 60m

* ci: enable pip cache

* ci: optimize container lifecycle

* ci: split into many stages

* ci(test-arealite): fix typo

* ci: fix wrong env

* ci: fix pytest

* ci: uninstall transformer-engine

* ci: uninstall transformer-engine

* ci: fix model paths

* ci: show stdout/stderr

* ci: fix not clean up

* ci: backup sglang

* ci: remove tmp repo dir when run

* ci: fix docker run exit 1 condition

* ci(test-arealite): limit the concurrency and extend command timeout

* .

* merge fw/refactor

* revert some changes

* fix

---------

Co-authored-by: meizhiyu.mzy <meizhiyu.mzy@antgroup.com>
Co-authored-by: Chayenne <zhaochen20@outlook.com>
Co-authored-by: zhaochenyang <zhaochenyang20@gmail.com>
Co-authored-by: Jayon02 <qiujiangc@outlook.com>
Co-authored-by: root <meizhiyu.mzy>
Co-authored-by: Zijian Zhang <futrime@outlook.com>
2025-07-10 12:56:24 +08:00
antoinegg1 496413f2ac 0710_2 2025-07-10 12:54:12 +08:00
博惟 d48bf007cf Merge branch 'main' of https://github.com/inclusionAI/AReaL into lite 2025-07-10 12:53:30 +08:00
antoinegg1 0cd58b5f38 0710_1 2025-07-10 12:43:32 +08:00
博惟 42c717b6e4 Merge branch 'lite' of https://github.com/inclusionAI/AReaL into lite 2025-07-10 11:15:01 +08:00
博惟 c38cffc023 PullRequest: 340 [lite] Refactor trainer API into utilities and remove mb_spec in engine methods
Merge branch fw/lite-dev of git@code.alipay.com:inclusionAI/AReaL.git into lite
https://code.alipay.com/inclusionAI/AReaL/pull_requests/340

Reviewed-by: 晓雷 <meizhiyu.mzy@antgroup.com>


* support fsdp engine and sglang remote engine
* minor fix
* .
* refactor trainer
* add close
* rm mb_spec
* fix
2025-07-10 11:10:10 +08:00
Zijian Zhang a203c7c750 ci: fix env sha 2025-07-09 22:31:46 +08:00
Zijian Zhang a70cd28f8f ci: fix on demand condition 2025-07-09 21:19:15 +08:00
Zijian Zhang 8e201ef3e2 ci: build images on demand 2025-07-09 21:16:23 +08:00
Zijian Zhang ee6f5a89f3 chore: empty commit 2025-07-09 18:52:38 +08:00
博惟 7be4ab0d18 PullRequest: 339 [Fix] Fix some minor issues to pass all tests.
Merge branch fw/lite of git@code.alipay.com:inclusionAI/AReaL.git into lite
https://code.alipay.com/inclusionAI/AReaL/pull_requests/339

Reviewed-by: 晓雷 <meizhiyu.mzy@antgroup.com>


* support fsdp engine and sglang remote engine
* minor fix
* .
2025-07-09 16:51:26 +08:00
博惟 15dfbe837c PullRequest: 332 [lite] Support FSDP engines
Merge branch mzy/lite/engines of git@code.alipay.com:inclusionAI/AReaL.git into lite
https://code.alipay.com/inclusionAI/AReaL/pull_requests/332

Reviewed-by: 晓雷 <meizhiyu.mzy@antgroup.com>


* fsdp2 engine
* fix utils
* add fsdp engine test
* .
* fsdp engine test passed
* unsqueeze immediately before model inputs and after model outputts
* add optimizer save/load, add position id calculation for input
* .
* format
* not to squeeze
* add train and eval api
* .
* .
* improve fsdp engine data preprocessing
* format
* PullRequest: 337 [lite] Add SFT trainer example.
* trainer log
* minor changes
* add update weights from disk
* fix type annotation
2025-07-09 16:24:25 +08:00
朱晗 2edcd2a92c 0709_9 2025-07-09 16:06:26 +08:00
朱晗 3eaf620cc9 0709_8 2025-07-09 16:03:32 +08:00
antoinegg1 92f144eee9 0709_7 2025-07-09 16:00:20 +08:00
郭唯 7a438c0650 PullRequest: 336 add wrapper
Merge branch lite-util-wrapper of git@code.alipay.com:inclusionAI/AReaL.git into lite
https://code.alipay.com/inclusionAI/AReaL/pull_requests/336

Reviewed-by: 博惟 <bowei.fw@antgroup.com>


* add wrapper
2025-07-09 15:44:27 +08:00
朱晗 8a7d6566b6 0709_7 2025-07-09 15:40:47 +08:00
朱晗 7379a9d7eb 0709_6 2025-07-09 15:15:38 +08:00
antoinegg1 605342db4d 0709_ 2025-07-09 15:09:53 +08:00
朱晗 c01052a016 0709_5 2025-07-09 14:51:55 +08:00
朱晗 223cafd6ea 0709_4 2025-07-09 14:48:17 +08:00
博惟 8771778995 PullRequest: 331 [lite] Support remote sglang engine with corresponding testcases.
Merge branch fw/lite of git@code.alipay.com:inclusionAI/AReaL.git into lite
https://code.alipay.com/inclusionAI/AReaL/pull_requests/331

Reviewed-by: 晓雷 <meizhiyu.mzy@antgroup.com>


* .
* add test for sglang remote engine
* fix
2025-07-09 14:18:55 +08:00
朱晗 e7991fceb3 0709_3 2025-07-09 14:04:53 +08:00
朱晗 2b6f962743 0709_2 2025-07-09 11:42:29 +08:00
朱晗 184f9e88ac 0709_1 2025-07-09 10:55:18 +08:00
Wei Fu b9dbd4a2c1
Update to persistent wechat QR code. (#159) 2025-07-09 10:50:19 +08:00
朱晗 3d3f682f7a 0708_4 2025-07-08 17:42:13 +08:00
antoinegg1 b584cd25cd 0708_7 2025-07-08 12:39:01 +08:00
朱晗 fcfa067cdf 0708_3 2025-07-08 12:11:59 +08:00
朱晗 74a2ebaf74 0708_2 2025-07-08 11:27:59 +08:00
antoinegg1 6018376691 Merge remote-tracking branch 'origin/lcy/refactor' into lcy/refactor 2025-07-08 11:13:03 +08:00
antoinegg1 aced39b894 0708_1 2025-07-08 11:12:10 +08:00
朱晗 b006b31161 f 2025-07-07 18:41:27 +08:00
antoinegg1 90f4cf0182 0707_undone 2025-07-07 18:40:17 +08:00
朱晗 645b58c69e refactor1 2025-07-07 18:03:35 +08:00
博惟 57b9b945ab add readme 2025-07-07 17:46:46 +08:00
朱晗 9dd893cce4 0707_7 2025-07-07 17:45:01 +08:00
antoinegg1 34a64a9c06 0707_6 2025-07-07 17:35:05 +08:00
garrett4wade cf0db6a43c format 2025-07-07 17:02:32 +08:00
garrett4wade 7ab6755379 Merge branch 'lite' of https://github.com/inclusionAI/AReaL into lite 2025-07-07 17:02:21 +08:00