AReaL/realhf/base
Wei Fu 0ff8c59435
[Fix] Merge error fixes. (#152)
* PullRequest: 293 fix get_param_realloc_path

Merge branch xss/debug of git@code.alipay.com:inclusionAI/AReaL.git into gh
https://code.alipay.com/inclusionAI/AReaL/pull_requests/293

Reviewed-by: 博惟 <bowei.fw@antgroup.com>


* fix get_param_realloc_path

* PullRequest: 297 bugfix: reward is always -5

Merge branch xss/debug of git@code.alipay.com:inclusionAI/AReaL.git into gh
https://code.alipay.com/inclusionAI/AReaL/pull_requests/297

Reviewed-by: 博惟 <bowei.fw@antgroup.com>


* bugfix: reward is always -5

* PullRequest: 321 fix checkpoint save dir

Merge branch xss/debug of git@code.alipay.com:inclusionAI/AReaL.git into gh
https://code.alipay.com/inclusionAI/AReaL/pull_requests/321

Reviewed-by: 博惟 <bowei.fw@antgroup.com>


* fix checkpoint save dir

* PullRequest: 328 [Doc] update installation

Merge branch sxj/doc of git@code.alipay.com:inclusionAI/AReaL.git into gh
https://code.alipay.com/inclusionAI/AReaL/pull_requests/328

Reviewed-by: 博惟 <bowei.fw@antgroup.com>


* [Doc] update installation

---------

Co-authored-by: 温差 <xushusheng.xss@antgroup.com>
Co-authored-by: 冰临 <shenxujie.sxj@antgroup.com>
2025-07-07 10:30:27 +08:00
..
__init__.py [Feature & Doc & Bug Fix] Add docs, simplified ray-based scripts, and fix issues to stablize asynchronous experiments (#52) 2025-05-28 19:18:05 +08:00
cluster.py PullRequest: 252 [Feature] Fix constants initialization. (#122) 2025-06-23 12:52:49 +08:00
constants.py [Fix] Merge error fixes. (#152) 2025-07-07 10:30:27 +08:00
datapack.py PullRequest: 33 Force the partition to be balanced when the capacity is a large number 2025-03-17 10:47:29 +08:00
gpu_utils.py Support asynchronous RL training, Qwen3, and the latest SGLang (#47) 2025-05-26 09:45:13 +08:00
importing.py Initial commit. 2025-02-24 18:58:19 +08:00
logging.py [Fix] Fix yaml configurations for v0.2 experiments. (#129) 2025-06-24 13:48:02 +08:00
monitor.py PullRequest: 252 [Feature] Fix constants initialization. (#122) 2025-06-23 12:52:49 +08:00
name_resolve.py PullRequest: 252 [Feature] Fix constants initialization. (#122) 2025-06-23 12:52:49 +08:00
names.py [Feature & Doc & Bug Fix] Add docs, simplified ray-based scripts, and fix issues to stablize asynchronous experiments (#52) 2025-05-28 19:18:05 +08:00
network.py PullRequest: 252 [Feature] Fix constants initialization. (#122) 2025-06-23 12:52:49 +08:00
numpy_utils.py Initial commit. 2025-02-24 18:58:19 +08:00
pkg_version.py PullRequest: 61 [Patch v0.2.0] Fix the PPO bug in old environments. 2025-03-28 09:58:26 +08:00
prologue.py PullRequest: 252 [Feature] Fix constants initialization. (#122) 2025-06-23 12:52:49 +08:00
ray_utils.py Initial commit. 2025-02-24 18:58:19 +08:00
recover.py PullRequest: 252 [Feature] Fix constants initialization. (#122) 2025-06-23 12:52:49 +08:00
saveload_utils.py Initial commit. 2025-02-24 18:58:19 +08:00
security.py Initial commit. 2025-02-24 18:58:19 +08:00
seeding.py Merge updates from ant repository. (#34) 2025-04-27 11:09:25 +08:00
slurm_utils.py PullRequest: 252 [Feature] Fix constants initialization. (#122) 2025-06-23 12:52:49 +08:00
stats_tracker.py Support asynchronous RL training, Qwen3, and the latest SGLang (#47) 2025-05-26 09:45:13 +08:00
testing.py PullRequest: 252 [Feature] Fix constants initialization. (#122) 2025-06-23 12:52:49 +08:00
timeutil.py PullRequest: 21 fix timeutil consistency during recover 2025-03-10 16:27:55 +08:00
topology.py PullRequest: 252 [Feature] Fix constants initialization. (#122) 2025-06-23 12:52:49 +08:00