AReaL

History

Wei Fu 2d4d937d10 [Doc] Add doc for reproducing released models (#73 ) * update benchmark script * . * add benchmark docs * add v0.3.0 configs * . * PullRequest: 178 multi turn math agent training Merge branch gjx/multi-turn-math of git@code.alipay.com:inclusionAI/AReaL.git into main https://code.alipay.com/inclusionAI/AReaL/pull_requests/178?tab=diff Reviewed-by: 博惟 <bowei.fw@antgroup.com> * multi turn math agent training * training data logging and clean math multi-turn exp * fix * . * fix * add docs and config * format * revert multi-turn agent * add config --------- Co-authored-by: 步偶 <sam.gjx@antgroup.com>		2025-06-03 20:33:48 +08:00
..
configs	[Doc] Add doc for reproducing released models (#73 )	2025-06-03 20:33:48 +08:00
main_async_ppo.py	[Doc] Fix documentation for using Docker containers and customized agents (#64 )	2025-06-01 16:33:29 +08:00
main_sft.py	[Doc] Fix documentation for using Docker containers and customized agents (#64 )	2025-06-01 16:33:29 +08:00
main_sync_ppo.py	[Doc] Mark the equivalent between zero-staleness and synchronous PPO (#69 )	2025-06-02 21:44:35 +08:00
utils.py	[Doc & Fix] Simplify the environment setup procedure (#62 )	2025-06-01 14:57:21 +08:00