AReaL/examples/arealite
朱晗 6aeeabf7b9 0724_4 2025-07-24 19:29:46 +08:00
..
configs 0724_merge3 2025-07-24 15:22:12 +08:00
dataset 0724_1 2025-07-24 13:30:20 +08:00
boba.py PullRequest: 392 [lite] Fix several bugs regarding RL learning and add an example to reproduce boba-math results. 2025-07-21 11:26:36 +08:00
clevr_count_70k_grpo.py 0724_4 2025-07-24 19:29:46 +08:00
clevr_count_70k_sft.py 0724_merge3 2025-07-24 15:22:12 +08:00
gsm8k_grpo.py 0724_merge5 2025-07-24 15:38:24 +08:00
gsm8k_sft.py 0724_merge5 2025-07-24 15:38:24 +08:00