bowei.fw
25c45c7e83
.
2025-03-22 12:22:34 +08:00
bowei.fw
9dcdb7a684
.
2025-03-21 22:38:21 +08:00
博惟
f90fe19e00
PullRequest: 53 Fix a potential reward hacking issue related to "emptyset"
...
Merge branch fw/fix-rwd-hacking of git@code.alipay.com:inclusionAI/AReaL.git into main
https://code.alipay.com/inclusionAI/AReaL/pull_requests/53
Signed-off-by: 温差 <xushusheng.xss@antgroup.com>
* .
* .
2025-03-21 16:42:24 +08:00
晓雷
4ac9595295
PullRequest: 43 Reduce GPU memory used by data transfer.
...
Merge branch mzy/fix-data-transfer-oom of git@code.alipay.com:inclusionAI/AReaL.git into main
https://code.alipay.com/inclusionAI/AReaL/pull_requests/43
Signed-off-by: 博惟 <bowei.fw@antgroup.com>
* add oom observe logs
* tested
* format and clear code
* .
* format
* remove logging
* .
* add comments
2025-03-18 15:20:57 +08:00
Jun Mo
8310d7beb7
code format
2025-03-18 09:00:13 +08:00
meijun.mei
92e0777178
optimize code/math functioncall param
2025-03-17 15:45:52 +08:00
博惟
fb23009e99
PullRequest: 27 support bf16 training
...
Merge branch fw/bf16 of git@code.alipay.com:inclusionAI/AReaL.git into main
https://code.alipay.com/inclusionAI/AReaL/pull_requests/27
Signed-off-by: 晓雷 <meizhiyu.mzy@antgroup.com>
* support bf16 training
2025-03-12 13:04:30 +08:00
meijun.mei
6fd0db01c7
add functioncall switch
2025-03-11 18:41:46 +08:00
君末
b3bedd7b9d
PullRequest: 17 Support functioncall for math and code verify
...
Merge branch functioncall-code of git@code.alipay.com:inclusionAI/AReaL.git into main
https://code.alipay.com/inclusionAI/AReaL/pull_requests/17
Signed-off-by: 晓雷 <meizhiyu.mzy@antgroup.com>
* test code evaluation with faas
* support functioncall for code
* fix code crash bug
* format
* .
2025-03-07 11:26:57 +08:00