This commit is contained in:
晓雷 2025-07-30 16:38:26 +08:00
parent e6bf47f7b0
commit 5ad65846c7
2 changed files with 6 additions and 2 deletions

View File

@ -116,6 +116,9 @@ Evaluation (on a single node):
python3 -m arealite.launcher.local examples/arealite/eval.py --config examples/arealite/configs/eval.yaml
```
For more detailed guide on how to run experiments in AReaLite, please check out
[our quickstart guide](/docs/tutorial/quickstart.md)!
## Switching from AReaL to AReaLite
We also provide a convenient script to convert your AReaL YAML config into AReaLite

View File

@ -253,7 +253,8 @@ tasks. The following code shows the simplified version of rollout thread impleme
which iteratively:
- Checks available capacity. The capacity controls current number of rollout workflows
to limit concurrency and data off-policyness.
to limit concurrency and **data off-policyness** (The difference between the model
version used by generation and the model version updated by the trainer).
- If there is capacity left and rollout is not paused for weight update, continuously
obtains data from `input_queue` and creates `asyncio` tasks to run the workflows.
- Waits for rollout workflows to finish.
@ -435,7 +436,7 @@ After a training step is finished, we transfer new weights from actor engine to
inference servers:
1. The rollout engine needs to stop sending generation requests to remote servers
(`rollout.pause()`) to avoid server-side congestion.
(`rollout.pause()`) before weight update to avoid server-side congestion.
1. Since we need to invoke weight update on the trainer engine and remote inference
servers at the same time, in the training script, we asynchronously send requests to
remote inference servers, and then immediately upload weights on the trainer engine.