Commit Graph

882 Commits

Author SHA1 Message Date
博惟 8b1b2aa494 PullRequest: 2 Add CLI options such that users can control loss scale window and initial loss scale.
Merge branch fw/loss-scale of git@code.alipay.com:inclusionAI/AReaL.git into main
https://code.alipay.com/inclusionAI/AReaL/pull_requests/2

Signed-off-by: 晓雷 <meizhiyu.mzy@antgroup.com>


* .
2025-02-27 16:48:12 +08:00
博惟 4b11997721 PullRequest: 2 Add CLI options such that users can control loss scale window and initial loss scale.
Merge branch fw/loss-scale of git@code.alipay.com:inclusionAI/AReaL.git into main
https://code.alipay.com/inclusionAI/AReaL/pull_requests/2

Signed-off-by: 晓雷 <meizhiyu.mzy@antgroup.com>


* .
2025-02-27 16:48:12 +08:00
晓雷 0aaf6d6c6d PullRequest: 1 Normalize loss scale by tokens across micro batches
Merge branch fix-loss-scale of git@code.alipay.com:inclusionAI/AReaL.git into main
https://code.alipay.com/inclusionAI/AReaL/pull_requests/1

Signed-off-by: 博惟 <bowei.fw@antgroup.com>
2025-02-27 16:47:04 +08:00
晓雷 db3e3ded9c PullRequest: 1 Normalize loss scale by tokens across micro batches
Merge branch fix-loss-scale of git@code.alipay.com:inclusionAI/AReaL.git into main
https://code.alipay.com/inclusionAI/AReaL/pull_requests/1

Signed-off-by: 博惟 <bowei.fw@antgroup.com>
2025-02-27 16:47:04 +08:00
meizhiyu.mzy e60e68244e . 2025-02-27 16:38:42 +08:00
meizhiyu.mzy b7815bb2c3 . 2025-02-27 16:38:42 +08:00
meizhiyu.mzy 30852feb96 . 2025-02-27 16:06:19 +08:00
meizhiyu.mzy e2e1b231d8 . 2025-02-27 16:06:19 +08:00
meizhiyu.mzy 75d745ee54 . 2025-02-27 15:51:17 +08:00
meizhiyu.mzy 40b5d12490 . 2025-02-27 15:51:17 +08:00
bowei.fw 63b6f286d8 fix 2025-02-27 15:47:27 +08:00
bowei.fw e30790db11 fix 2025-02-27 15:47:27 +08:00
bowei.fw 78b724f5ca updata doc 2025-02-27 15:44:39 +08:00
bowei.fw 3c5f603f59 updata doc 2025-02-27 15:44:39 +08:00
meizhiyu.mzy 50a4e5dae8 . 2025-02-27 15:40:21 +08:00
meizhiyu.mzy 35f50e3f53 . 2025-02-27 15:40:21 +08:00
meizhiyu.mzy 37fdf91843 normalize loss scale by tokens 2025-02-27 11:36:26 +08:00
meizhiyu.mzy 2436ce519e normalize loss scale by tokens 2025-02-27 11:36:26 +08:00
Wei Guo 944a1dec6e Merge pull request #2 from kdada/main
Fix evaluation script and update tutorials
2025-02-25 21:23:54 +08:00
Wei Guo 241185227d
Merge pull request #2 from kdada/main
Fix evaluation script and update tutorials
2025-02-25 21:23:54 +08:00
Wei Guo 5cc4516d85 Update tutorials 2025-02-25 20:55:52 +08:00
Wei Guo 3b2788d32f Update tutorials 2025-02-25 20:55:52 +08:00
Wei Guo 1fb27bd6d1 Fix dp size for evaluation script 2025-02-25 20:55:08 +08:00
Wei Guo 82c9b08e4b Fix dp size for evaluation script 2025-02-25 20:55:08 +08:00
Wei Fu ea0096a26f Merge pull request #1 from inclusionAI/fw/sct-ack
Update README.md
2025-02-25 10:39:01 +08:00
Wei Fu 8fc76fadf0
Merge pull request #1 from inclusionAI/fw/sct-ack
Update README.md
2025-02-25 10:39:01 +08:00
Wei Fu d48dc0315f Update README.md 2025-02-25 10:37:37 +08:00
Wei Fu c8a3090acb
Update README.md 2025-02-25 10:37:37 +08:00
Wei Fu c513f0562a Update README.md 2025-02-25 09:55:34 +08:00
Wei Fu 0f2c5c564d
Update README.md 2025-02-25 09:55:34 +08:00
Wei Guo 4c235a1d51 Remove deleted parameters 2025-02-25 01:08:40 +08:00
晓雷 2963a67311 Initial commit. 2025-02-24 18:58:19 +08:00