Commit Graph

8 Commits

Author SHA1 Message Date
bowei.fw de8243cc78 . 2025-03-21 21:22:50 +08:00
博惟 69681d9fe4 PullRequest: 35 Support log probability recomputation in PPO.
Merge branch fw/recompute-logprob of git@code.alipay.com:inclusionAI/AReaL.git into main
https://code.alipay.com/inclusionAI/AReaL/pull_requests/35

Signed-off-by: 晓雷 <meizhiyu.mzy@antgroup.com>


* .
* .
* .
* .
2025-03-17 14:38:53 +08:00
博惟 26a48be73e PullRequest: 29 fix the dataloading bug during recover
Merge branch fw/fix-recover-dataloading of git@code.alipay.com:inclusionAI/AReaL.git into main
https://code.alipay.com/inclusionAI/AReaL/pull_requests/29

Signed-off-by: 晓雷 <meizhiyu.mzy@antgroup.com>


* fix the dataloading bug during recover
2025-03-12 16:00:49 +08:00
博惟 fb23009e99 PullRequest: 27 support bf16 training
Merge branch fw/bf16 of git@code.alipay.com:inclusionAI/AReaL.git into main
https://code.alipay.com/inclusionAI/AReaL/pull_requests/27

Signed-off-by: 晓雷 <meizhiyu.mzy@antgroup.com>


* support bf16 training
2025-03-12 13:04:30 +08:00
君末 b3bedd7b9d PullRequest: 17 Support functioncall for math and code verify
Merge branch functioncall-code of git@code.alipay.com:inclusionAI/AReaL.git into main
https://code.alipay.com/inclusionAI/AReaL/pull_requests/17

Signed-off-by: 晓雷 <meizhiyu.mzy@antgroup.com>


* test code evaluation with faas
* support functioncall for code
* fix code crash bug
* format
* .
2025-03-07 11:26:57 +08:00
博惟 46c5a10eb9 PullRequest: 5 修改微批次分割逻辑
Merge branch fw/balanced-datapck of git@code.alipay.com:inclusionAI/AReaL.git into main
https://code.alipay.com/inclusionAI/AReaL/pull_requests/5

Signed-off-by: 温差 <xushusheng.xss@antgroup.com>


* .
* fw/fix-dataloading-not-shuffle
* .
* .
* .
* .
* .
2025-03-03 09:10:04 +08:00
meizhiyu.mzy 2436ce519e normalize loss scale by tokens 2025-02-27 11:36:26 +08:00
晓雷 2963a67311 Initial commit. 2025-02-24 18:58:19 +08:00