lzhengning
  • Joined on 2022-03-22
lzhengning synced commits to refs/pull/2167/merge at lzhengning/cutlass from mirror 2025-07-29 22:33:57 +08:00
856fccc1ef Merge bb31006f89491b6da74a53714b4f077651100436 into 0e026982ce2ed10b27ec569c6e42035cb9118f62
0e026982ce Example 77 add blackwell fmha bwd for MLA shape (#2466)
9a9a579714 Merge pull request #2489 from NVIDIA/update_workflow_script
51d730b8be Support "CuTe DSL" auto-labeling in workflow
6c0c8b7484 1. Update bug/feature report template to add component selection. (#2485)
Compare 10 commits »
lzhengning synced commits to refs/pull/2160/merge at lzhengning/cutlass from mirror 2025-07-29 22:33:57 +08:00
9312fa5145 Merge 9d9fd239a13f7bd3abb8e136026daf23dc9ef7bf into 6c0c8b7484870da009d580060a4918550eba7bb2
6c0c8b7484 1. Update bug/feature report template to add component selection. (#2485)
e51efbfe18 Update CHANGELOG.md
fd6cfe1ed0 v4.1 release update v2. (#2481)
9baa06dd57 Add Blackwell MLA forward (shape: d=192, dv=128) implementation in example_77 (#2472)
Compare 7 commits »
lzhengning synced commits to refs/pull/2078/merge at lzhengning/cutlass from mirror 2025-07-29 22:33:56 +08:00
f2a6bf1cad Merge a51e7794fcbae60145f68634fea697bbc28f594e into 6c0c8b7484870da009d580060a4918550eba7bb2
6c0c8b7484 1. Update bug/feature report template to add component selection. (#2485)
e51efbfe18 Update CHANGELOG.md
fd6cfe1ed0 v4.1 release update v2. (#2481)
9baa06dd57 Add Blackwell MLA forward (shape: d=192, dv=128) implementation in example_77 (#2472)
Compare 7 commits »
lzhengning synced commits to refs/pull/2035/merge at lzhengning/cutlass from mirror 2025-07-29 22:33:56 +08:00
352018611e Merge 3b34c090332aba0c03e3ad338a0f883e528a6fa1 into 9baa06dd57804ce8fb5efe9e471b3451341522c6
9baa06dd57 Add Blackwell MLA forward (shape: d=192, dv=128) implementation in example_77 (#2472)
ebe98c549a cache procedural_name in GemmOperation (#2317)
9892624b66 Fix typos in the text (#2417)
Compare 4 commits »
lzhengning synced commits to refs/pull/1891/merge at lzhengning/cutlass from mirror 2025-07-29 22:33:56 +08:00
5a5ba20cd2 Merge 105cf9c8691f3317765a64966cc3240e4da601db into 9baa06dd57804ce8fb5efe9e471b3451341522c6
9baa06dd57 Add Blackwell MLA forward (shape: d=192, dv=128) implementation in example_77 (#2472)
ebe98c549a cache procedural_name in GemmOperation (#2317)
9892624b66 Fix typos in the text (#2417)
Compare 4 commits »
lzhengning synced commits to refs/pull/1832/merge at lzhengning/cutlass from mirror 2025-07-29 22:33:56 +08:00
be7f803fb2 Merge b75dfe0a1e5a3ca71ca71fc06c5347a1906f4922 into 9a9a579714a7075546be7aa20af89fb17d0cd56f
9a9a579714 Merge pull request #2489 from NVIDIA/update_workflow_script
51d730b8be Support "CuTe DSL" auto-labeling in workflow
6c0c8b7484 1. Update bug/feature report template to add component selection. (#2485)
e51efbfe18 Update CHANGELOG.md
Compare 10 commits »
lzhengning synced commits to refs/pull/1702/merge at lzhengning/cutlass from mirror 2025-07-29 22:33:56 +08:00
98eec81bcc Merge dbf42ae1c71216300ecc92df120296879fcfee2e into 9baa06dd57804ce8fb5efe9e471b3451341522c6
9baa06dd57 Add Blackwell MLA forward (shape: d=192, dv=128) implementation in example_77 (#2472)
ebe98c549a cache procedural_name in GemmOperation (#2317)
9892624b66 Fix typos in the text (#2417)
a1aaf2300a v4.1 release
Compare 24 commits »
lzhengning synced commits to refs/pull/1653/merge at lzhengning/cutlass from mirror 2025-07-29 22:33:56 +08:00
228e770181 Merge f72fe012f61ea31f5adbeffadd000c27c6c2e93f into 664c4f7b3ed1959414905025728eef5568209479
664c4f7b3e Update CUTLASS version to 4.1
0e026982ce Example 77 add blackwell fmha bwd for MLA shape (#2466)
9a9a579714 Merge pull request #2489 from NVIDIA/update_workflow_script
51d730b8be Support "CuTe DSL" auto-labeling in workflow
Compare 14 commits »
lzhengning synced commits to refs/pull/1618/merge at lzhengning/cutlass from mirror 2025-07-29 22:33:56 +08:00
194fb5568a Merge e8de90ef2d2b4f3d0547a9f8ea49f32952d98c4c into 9baa06dd57804ce8fb5efe9e471b3451341522c6
9baa06dd57 Add Blackwell MLA forward (shape: d=192, dv=128) implementation in example_77 (#2472)
ebe98c549a cache procedural_name in GemmOperation (#2317)
9892624b66 Fix typos in the text (#2417)
a1aaf2300a v4.1 release
Compare 24 commits »
lzhengning synced commits to refs/pull/1604/merge at lzhengning/cutlass from mirror 2025-07-29 22:33:56 +08:00
ab06e3f8f9 Merge 205e4f3942738142eb7982884c3258066dc9741d into 0e026982ce2ed10b27ec569c6e42035cb9118f62
0e026982ce Example 77 add blackwell fmha bwd for MLA shape (#2466)
9a9a579714 Merge pull request #2489 from NVIDIA/update_workflow_script
51d730b8be Support "CuTe DSL" auto-labeling in workflow
6c0c8b7484 1. Update bug/feature report template to add component selection. (#2485)
Compare 31 commits »
lzhengning synced commits to refs/pull/1593/merge at lzhengning/cutlass from mirror 2025-07-29 22:33:55 +08:00
aa1c32cf7c Merge 77bb92113dfd3ccd777a8e31e1db6e841849b8be into 9baa06dd57804ce8fb5efe9e471b3451341522c6
9baa06dd57 Add Blackwell MLA forward (shape: d=192, dv=128) implementation in example_77 (#2472)
ebe98c549a cache procedural_name in GemmOperation (#2317)
9892624b66 Fix typos in the text (#2417)
a1aaf2300a v4.1 release
Compare 5 commits »
lzhengning synced commits to refs/pull/1584/merge at lzhengning/cutlass from mirror 2025-07-29 22:33:55 +08:00
6abf8fde94 Merge 53556696d1935ada5df21d1f2a53a783feaf2e71 into 664c4f7b3ed1959414905025728eef5568209479
664c4f7b3e Update CUTLASS version to 4.1
0e026982ce Example 77 add blackwell fmha bwd for MLA shape (#2466)
9a9a579714 Merge pull request #2489 from NVIDIA/update_workflow_script
51d730b8be Support "CuTe DSL" auto-labeling in workflow
Compare 32 commits »
lzhengning synced commits to refs/pull/1554/merge at lzhengning/cutlass from mirror 2025-07-29 22:33:55 +08:00
9dfbdd837d Merge 9e1b7ef12fb02c4e18673aae92419fcc1164a65b into 9baa06dd57804ce8fb5efe9e471b3451341522c6
9baa06dd57 Add Blackwell MLA forward (shape: d=192, dv=128) implementation in example_77 (#2472)
ebe98c549a cache procedural_name in GemmOperation (#2317)
9892624b66 Fix typos in the text (#2417)
a1aaf2300a v4.1 release
Compare 25 commits »
lzhengning synced commits to refs/pull/1534/merge at lzhengning/cutlass from mirror 2025-07-29 22:33:55 +08:00
ce87d8f52e Merge cf8290eb6ddbab656b6be7def12ebef1f2e226f9 into 9baa06dd57804ce8fb5efe9e471b3451341522c6
9baa06dd57 Add Blackwell MLA forward (shape: d=192, dv=128) implementation in example_77 (#2472)
ebe98c549a cache procedural_name in GemmOperation (#2317)
9892624b66 Fix typos in the text (#2417)
a1aaf2300a v4.1 release
Compare 25 commits »
lzhengning synced commits to refs/pull/1528/merge at lzhengning/cutlass from mirror 2025-07-29 22:33:55 +08:00
2111c3347d Merge b9f2ada561ff9bfc781ab6f7fb0cdcd0fb43601a into 9baa06dd57804ce8fb5efe9e471b3451341522c6
9baa06dd57 Add Blackwell MLA forward (shape: d=192, dv=128) implementation in example_77 (#2472)
ebe98c549a cache procedural_name in GemmOperation (#2317)
9892624b66 Fix typos in the text (#2417)
a1aaf2300a v4.1 release
Compare 25 commits »
lzhengning synced commits to refs/pull/1470/merge at lzhengning/cutlass from mirror 2025-07-29 22:33:55 +08:00
ab34b18b38 Merge d8b05ed1fa3ea5a1e5445f30ca22c1c875a81317 into 9baa06dd57804ce8fb5efe9e471b3451341522c6
9baa06dd57 Add Blackwell MLA forward (shape: d=192, dv=128) implementation in example_77 (#2472)
ebe98c549a cache procedural_name in GemmOperation (#2317)
9892624b66 Fix typos in the text (#2417)
a1aaf2300a v4.1 release
Compare 25 commits »
lzhengning synced commits to refs/pull/1453/merge at lzhengning/cutlass from mirror 2025-07-29 22:33:54 +08:00
3586824df0 Merge be88f24ad42ec4a8e3b3a52e1f72112f0437ec00 into 9baa06dd57804ce8fb5efe9e471b3451341522c6
9baa06dd57 Add Blackwell MLA forward (shape: d=192, dv=128) implementation in example_77 (#2472)
ebe98c549a cache procedural_name in GemmOperation (#2317)
9892624b66 Fix typos in the text (#2417)
a1aaf2300a v4.1 release
Compare 5 commits »
lzhengning synced commits to refs/pull/1380/merge at lzhengning/cutlass from mirror 2025-07-29 22:33:54 +08:00
96bd2a705c Merge d99c9bcf398c5dd1f704128d56e06a40f5c0d599 into 9baa06dd57804ce8fb5efe9e471b3451341522c6
9baa06dd57 Add Blackwell MLA forward (shape: d=192, dv=128) implementation in example_77 (#2472)
ebe98c549a cache procedural_name in GemmOperation (#2317)
9892624b66 Fix typos in the text (#2417)
Compare 4 commits »
lzhengning synced commits to refs/pull/1218/merge at lzhengning/cutlass from mirror 2025-07-29 22:33:54 +08:00
0bfc86935a Merge a6c3a24dd573175c1902e9e700d33eb65e57d186 into 664c4f7b3ed1959414905025728eef5568209479
664c4f7b3e Update CUTLASS version to 4.1
0e026982ce Example 77 add blackwell fmha bwd for MLA shape (#2466)
9a9a579714 Merge pull request #2489 from NVIDIA/update_workflow_script
51d730b8be Support "CuTe DSL" auto-labeling in workflow
Compare 12 commits »
lzhengning synced commits to refs/pull/1190/merge at lzhengning/cutlass from mirror 2025-07-29 22:33:54 +08:00
9a401432fa Merge ea7888cfbd1541bb7cfe86720f1fc3c2dd1c2810 into 664c4f7b3ed1959414905025728eef5568209479
664c4f7b3e Update CUTLASS version to 4.1
0e026982ce Example 77 add blackwell fmha bwd for MLA shape (#2466)
9a9a579714 Merge pull request #2489 from NVIDIA/update_workflow_script
51d730b8be Support "CuTe DSL" auto-labeling in workflow
Compare 12 commits »