Wei Fu
c0200f10d0
[Feature] Support behavior importance weight capping and update evaluation scripts ( #59 )
...
* PullRequest: 168 添加Codeforces测试,修复其它测试问题
Merge branch areal-eval-0.3 of git@code.alipay.com:inclusionAI/AReaL.git into main
https://code.alipay.com/inclusionAI/AReaL/pull_requests/168
Reviewed-by: 博惟 <bowei.fw@antgroup.com>
* fix eval and add codeforces elo calc
* fix codeforce test
* fix qwen3 prompt
* change annotations to eng
* add code verify files
* PullRequest: 173 [FIX} format code and fix a recover error in rollout worker
Merge branch fw/fix-rollout-recover of git@code.alipay.com:inclusionAI/AReaL.git into main
https://code.alipay.com/inclusionAI/AReaL/pull_requests/173
Reviewed-by: 温差 <xushusheng.xss@antgroup.com>
* format code and fix a recover error in rollout worker
* PullRequest: 171 更新评估文档
Merge branch eval-doc of git@code.alipay.com:inclusionAI/AReaL.git into main
https://code.alipay.com/inclusionAI/AReaL/pull_requests/171?tab=diff
Reviewed-by: 博惟 <bowei.fw@antgroup.com>
* update eval doc
* complete eval doc
* complete eval doc
* fix ood info
* add data obtaining guide
* fix supported datasets
* PullRequest: 174 decouple max_behav_imp_weight and c_clip & track entropy, positve_seq_len and negative_seq_len
Merge branch xss/max_behav_imp_weight of git@code.alipay.com:inclusionAI/AReaL.git into main
https://code.alipay.com/inclusionAI/AReaL/pull_requests/174
Reviewed-by: 博惟 <bowei.fw@antgroup.com>
* decouple max_behav_imp_weight and c_clip
* rename log: positve_* -> correct_*, negative_* -> incorrect_*
* rename hyper-parameter: max_behav_imp_weight -> behav_imp_weight_cap
* PullRequest: 175 [Fix] Fix the "event loop is already running" error in ray scripts
Merge branch fw/fix-ray-asyncio of git@code.alipay.com:inclusionAI/AReaL.git into main
https://code.alipay.com/inclusionAI/AReaL/pull_requests/175
Reviewed-by: 晓雷 <meizhiyu.mzy@antgroup.com>
* format code and fix a recover error in rollout worker
* .
---------
Co-authored-by: 乘鹭 <hechuyi.hcy@antgroup.com>
Co-authored-by: 温差 <xushusheng.xss@antgroup.com>
2025-05-30 10:29:21 +08:00
温差
1b0306631b
PullRequest: 70 update evalution: aime25, gpqa
...
Merge branch xss/eval of git@code.alipay.com:inclusionAI/AReaL.git into main
https://code.alipay.com/inclusionAI/AReaL/pull_requests/70
Signed-off-by: 博惟 <bowei.fw@antgroup.com>
* update evalution: aime25, gpqa
* aime25 dataset
* format code
2025-03-30 20:16:08 +08:00
晓雷
e9bf229581
PullRequest: 11 支持训练时自动拉起evaluate任务
...
Merge branch mzy/auto-eval of git@code.alipay.com:inclusionAI/AReaL.git into main
https://code.alipay.com/inclusionAI/AReaL/pull_requests/11
Signed-off-by: 博惟 <bowei.fw@antgroup.com>
* test
* move evaluator to main process
* .
* clear codes
* add docstring
* .
* separate wandb groups
* .
2025-03-05 18:06:40 +08:00
Wei Guo
82c9b08e4b
Fix dp size for evaluation script
2025-02-25 20:55:08 +08:00
晓雷
2963a67311
Initial commit.
2025-02-24 18:58:19 +08:00