AReaL/examples
博惟 b56f5998ec PullRequest: 371 [lite] [fix] fix misc bugs in GRPO implementation
Merge branch fw/lite-fix0716 of git@code.alipay.com:inclusionAI/AReaL.git into lite
https://code.alipay.com/inclusionAI/AReaL/pull_requests/371

Reviewed-by: 晓雷 <meizhiyu.mzy@antgroup.com>


* .
2025-07-16 17:22:54 +08:00
..
arealite PullRequest: 371 [lite] [fix] fix misc bugs in GRPO implementation 2025-07-16 17:22:54 +08:00
configs [Fix] Fix yaml configurations for v0.2 experiments. (#129) 2025-06-24 13:48:02 +08:00
data_preprocess add a preprocessing script for code training data and update readme (#126) 2025-06-24 09:44:15 +08:00
env format (#174) 2025-07-15 10:24:48 +08:00
run_async_ppo.sh [Feature] Switch dataset path / model path to HF location to ease community usage (#82) 2025-06-06 21:38:06 +08:00
run_sft.sh [Doc & Fix] Simplify the environment setup procedure (#62) 2025-06-01 14:57:21 +08:00
run_sync_ppo.sh PullRequest: 252 [Feature] Fix constants initialization. (#122) 2025-06-23 12:52:49 +08:00