Merge branch fw/fix-fp16 of git@code.alipay.com:inclusionAI/AReaL.git into main
https://code.alipay.com/inclusionAI/AReaL/pull_requests/34
Signed-off-by: 晓雷 <meizhiyu.mzy@antgroup.com>
Signed-off-by: 步偶 <sam.gjx@antgroup.com>
* fix bf16 training issue
Merge branch fw/topo of git@code.alipay.com:inclusionAI/AReaL.git into main
https://code.alipay.com/inclusionAI/AReaL/pull_requests/30?tab=comment
Signed-off-by: 晓雷 <meizhiyu.mzy@antgroup.com>
* change non-training topo order
* .
* fix the dataloading bug during recover
* fix typo
Merge branch fw/fix-recover-dataloading of git@code.alipay.com:inclusionAI/AReaL.git into main
https://code.alipay.com/inclusionAI/AReaL/pull_requests/29
Signed-off-by: 晓雷 <meizhiyu.mzy@antgroup.com>
* fix the dataloading bug during recover
Merge branch rw_save_version of git@code.alipay.com:inclusionAI/AReaL.git into main
https://code.alipay.com/inclusionAI/AReaL/pull_requests/25
Signed-off-by: 博惟 <bowei.fw@antgroup.com>
* fix the save version in rw interface
* format
Merge branch fw/fix-timeutil-recover of git@code.alipay.com:inclusionAI/AReaL.git into main
https://code.alipay.com/inclusionAI/AReaL/pull_requests/21
Signed-off-by: 步偶 <sam.gjx@antgroup.com>
* fix timeutil consistency during recover
Merge branch fw/datatransfer-v2 of git@code.alipay.com:inclusionAI/AReaL.git into main
https://code.alipay.com/inclusionAI/AReaL/pull_requests/9
Signed-off-by: 晓雷 <meizhiyu.mzy@antgroup.com>
* .
* fw/fix-dataloading-not-shuffle
* .
* .
* .
* .
* .
* add v2 master worker
* cpu test pass
* ppo run
* .
* pass sft test
* pass ppo dp test
* format
* fix
* run
* .
* cleanup
* .
* format
* run
* merge and format
* refactor
* sft pass
* .
* format
* format
* format
* .
* .
Merge branch functioncall-code of git@code.alipay.com:inclusionAI/AReaL.git into main
https://code.alipay.com/inclusionAI/AReaL/pull_requests/17
Signed-off-by: 晓雷 <meizhiyu.mzy@antgroup.com>
* test code evaluation with faas
* support functioncall for code
* fix code crash bug
* format
* .
Merge branch fw/uvloop of git@code.alipay.com:inclusionAI/AReaL.git into main
https://code.alipay.com/inclusionAI/AReaL/pull_requests/6
Signed-off-by: 晓雷 <meizhiyu.mzy@antgroup.com>
* .
* fw/fix-dataloading-not-shuffle
* .
* .
* .
* .
* .
* add v2 master worker
* cpu test pass
* ppo run
* .
* format
* fix
* merge and format
* change default env vars to v1 worker
Merge branch ranghou-math of git@code.alipay.com:inclusionAI/AReaL.git into main
https://code.alipay.com/inclusionAI/AReaL/pull_requests/18
Signed-off-by: 晓雷 <meizhiyu.mzy@antgroup.com>
Signed-off-by: 博惟 <bowei.fw@antgroup.com>
* support specifying number of gpus and mems for actors
* PR fix
* support tensorboard
* bug fix
* add doc
Merge branch mzy/auto-eval of git@code.alipay.com:inclusionAI/AReaL.git into main
https://code.alipay.com/inclusionAI/AReaL/pull_requests/11
Signed-off-by: 博惟 <bowei.fw@antgroup.com>
* test
* move evaluator to main process
* .
* clear codes
* add docstring
* .
* separate wandb groups
* .
Merge branch fw/update-requirements of git@code.alipay.com:inclusionAI/AReaL.git into main
https://code.alipay.com/inclusionAI/AReaL/pull_requests/14
Signed-off-by: 晓雷 <meizhiyu.mzy@antgroup.com>
* format and update requirements.txt
* cleanup
Merge branch math_verifier_timeout of git@code.alipay.com:inclusionAI/AReaL.git into main
https://code.alipay.com/inclusionAI/AReaL/pull_requests/8
Signed-off-by: 博惟 <bowei.fw@antgroup.com>
* debug: math verifier timeout
* debug: math verifier timeout
* timeout in math_verify_utils
* format