Commit Graph

9 Commits

Author SHA1 Message Date
bowei.fw 25c45c7e83 . 2025-03-22 12:22:34 +08:00
bowei.fw 9dcdb7a684 . 2025-03-21 22:38:21 +08:00
博惟 f90fe19e00 PullRequest: 53 Fix a potential reward hacking issue related to "emptyset"
Merge branch fw/fix-rwd-hacking of git@code.alipay.com:inclusionAI/AReaL.git into main
https://code.alipay.com/inclusionAI/AReaL/pull_requests/53

Signed-off-by: 温差 <xushusheng.xss@antgroup.com>


* .
* .
2025-03-21 16:42:24 +08:00
晓雷 4ac9595295 PullRequest: 43 Reduce GPU memory used by data transfer.
Merge branch mzy/fix-data-transfer-oom of git@code.alipay.com:inclusionAI/AReaL.git into main
https://code.alipay.com/inclusionAI/AReaL/pull_requests/43

Signed-off-by: 博惟 <bowei.fw@antgroup.com>


* add oom observe logs
* tested
* format and clear code
* .
* format
* remove logging
* .
* add comments
2025-03-18 15:20:57 +08:00
Jun Mo 8310d7beb7 code format 2025-03-18 09:00:13 +08:00
meijun.mei 92e0777178 optimize code/math functioncall param 2025-03-17 15:45:52 +08:00
博惟 fb23009e99 PullRequest: 27 support bf16 training
Merge branch fw/bf16 of git@code.alipay.com:inclusionAI/AReaL.git into main
https://code.alipay.com/inclusionAI/AReaL/pull_requests/27

Signed-off-by: 晓雷 <meizhiyu.mzy@antgroup.com>


* support bf16 training
2025-03-12 13:04:30 +08:00
meijun.mei 6fd0db01c7 add functioncall switch 2025-03-11 18:41:46 +08:00
君末 b3bedd7b9d PullRequest: 17 Support functioncall for math and code verify
Merge branch functioncall-code of git@code.alipay.com:inclusionAI/AReaL.git into main
https://code.alipay.com/inclusionAI/AReaL/pull_requests/17

Signed-off-by: 晓雷 <meizhiyu.mzy@antgroup.com>


* test code evaluation with faas
* support functioncall for code
* fix code crash bug
* format
* .
2025-03-07 11:26:57 +08:00