AReaL/realhf
xssstory 17ea7fe94d
fix math reward verifier (#156)
* PullRequest: 293 fix get_param_realloc_path

Merge branch xss/debug of git@code.alipay.com:inclusionAI/AReaL.git into gh
https://code.alipay.com/inclusionAI/AReaL/pull_requests/293

Reviewed-by: 博惟 <bowei.fw@antgroup.com>


* fix get_param_realloc_path

* PullRequest: 297 bugfix: reward is always -5

Merge branch xss/debug of git@code.alipay.com:inclusionAI/AReaL.git into gh
https://code.alipay.com/inclusionAI/AReaL/pull_requests/297

Reviewed-by: 博惟 <bowei.fw@antgroup.com>


* bugfix: reward is always -5

* PullRequest: 321 fix checkpoint save dir

Merge branch xss/debug of git@code.alipay.com:inclusionAI/AReaL.git into gh
https://code.alipay.com/inclusionAI/AReaL/pull_requests/321

Reviewed-by: 博惟 <bowei.fw@antgroup.com>


* fix checkpoint save dir

* PullRequest: 328 [Doc] update installation

Merge branch sxj/doc of git@code.alipay.com:inclusionAI/AReaL.git into gh
https://code.alipay.com/inclusionAI/AReaL/pull_requests/328

Reviewed-by: 博惟 <bowei.fw@antgroup.com>


* [Doc] update installation

* PullRequest: 329 bugfix: math verifier blocks the async training

Merge branch xss/debug of git@code.alipay.com:inclusionAI/AReaL.git into gh
https://code.alipay.com/inclusionAI/AReaL/pull_requests/329

Reviewed-by: 博惟 <bowei.fw@antgroup.com>


* bugfix: math verifier block the async training

* format

---------

Co-authored-by: 冰临 <shenxujie.sxj@antgroup.com>
Co-authored-by: garrett4wade <fuwth17@gmail.com>
2025-07-07 15:49:13 +08:00
..
api PullRequest: 252 [Feature] Fix constants initialization. (#122) 2025-06-23 12:52:49 +08:00
apps PullRequest: 252 [Feature] Fix constants initialization. (#122) 2025-06-23 12:52:49 +08:00
base [Fix] Merge error fixes. (#152) 2025-07-07 10:30:27 +08:00
experiments PullRequest: 252 [Feature] Fix constants initialization. (#122) 2025-06-23 12:52:49 +08:00
impl fix math reward verifier (#156) 2025-07-07 15:49:13 +08:00
scheduler [Fix] Fix yaml configurations for v0.2 experiments. (#129) 2025-06-24 13:48:02 +08:00
system [Fix] Merge error fixes. (#152) 2025-07-07 10:30:27 +08:00
__init__.py [Feature & Doc & Bug Fix] Add docs, simplified ray-based scripts, and fix issues to stablize asynchronous experiments (#52) 2025-05-28 19:18:05 +08:00
utils.py Fix formatting (#90) 2025-06-06 21:42:12 +08:00
version.py Merge updates from ant repository. (#34) 2025-04-27 11:09:25 +08:00