AReaL/training
Wei Fu 2d4d937d10
[Doc] Add doc for reproducing released models (#73)
* update benchmark script

* .

* add benchmark docs

* add v0.3.0 configs

* .

* PullRequest: 178 multi turn math agent training

Merge branch gjx/multi-turn-math of git@code.alipay.com:inclusionAI/AReaL.git into main
https://code.alipay.com/inclusionAI/AReaL/pull_requests/178?tab=diff

Reviewed-by: 博惟 <bowei.fw@antgroup.com>


* multi turn math agent training
* training data logging and clean math multi-turn exp
* fix
* .

* fix

* add docs and config

* format

* revert multi-turn agent

* add config

---------

Co-authored-by: 步偶 <sam.gjx@antgroup.com>
2025-06-03 20:33:48 +08:00
..
configs [Doc] Add doc for reproducing released models (#73) 2025-06-03 20:33:48 +08:00
main_async_ppo.py [Doc] Fix documentation for using Docker containers and customized agents (#64) 2025-06-01 16:33:29 +08:00
main_sft.py [Doc] Fix documentation for using Docker containers and customized agents (#64) 2025-06-01 16:33:29 +08:00
main_sync_ppo.py [Doc] Mark the equivalent between zero-staleness and synchronous PPO (#69) 2025-06-02 21:44:35 +08:00
utils.py [Doc & Fix] Simplify the environment setup procedure (#62) 2025-06-01 14:57:21 +08:00