.

2025-07-30 16:38:26 +08:00 · 2025-07-30 16:38:26 +08:00 · 5ad65846c7
parent e6bf47f7b0
commit 5ad65846c7
2 changed files with 6 additions and 2 deletions
--- a/README.md
+++ b/README.md
@ -116,6 +116,9 @@ Evaluation (on a single node):
 python3 -m arealite.launcher.local examples/arealite/eval.py --config examples/arealite/configs/eval.yaml
 ```

+For more detailed guide on how to run experiments in AReaLite, please check out
+[our quickstart guide](/docs/tutorial/quickstart.md)!
+
 ## Switching from AReaL to AReaLite

 We also provide a convenient script to convert your AReaL YAML config into AReaLite
--- a/docs/arealite/gsm8k_grpo.md
+++ b/docs/arealite/gsm8k_grpo.md
@ -253,7 +253,8 @@ tasks. The following code shows the simplified version of rollout thread impleme
 which iteratively:

 - Checks available capacity. The capacity controls current number of rollout workflows
-  to limit concurrency and data off-policyness.
+  to limit concurrency and **data off-policyness** (The difference between the model
+  version used by generation and the model version updated by the trainer).
 - If there is capacity left and rollout is not paused for weight update, continuously
  obtains data from `input_queue` and creates `asyncio` tasks to run the workflows.
 - Waits for rollout workflows to finish.
@ -435,7 +436,7 @@ After a training step is finished, we transfer new weights from actor engine to
 inference servers:

 1. The rollout engine needs to stop sending generation requests to remote servers
-   (`rollout.pause()`) to avoid server-side congestion.
+   (`rollout.pause()`) before weight update to avoid server-side congestion.
 1. Since we need to invoke weight update on the trainer engine and remote inference
   servers at the same time, in the training script, we asynchronously send requests to
   remote inference servers, and then immediately upload weights on the trainer engine.