Commit Graph

15 Commits

Author SHA1 Message Date
zjp_shadow e4be9b1f78 Update HCCL to support multi npus 2025-02-10 16:18:34 +08:00
Yuxuan Han 2580a98710 add isnan, isinf 2024-11-26 10:01:58 +08:00
Yuxuan Han 66e18b85a8 add stack 2024-11-24 09:46:23 +08:00
张仪 2e137f73b0 big op and recompile bug 2024-11-21 23:59:41 +08:00
Yuxuan Han 0ff1deccb7 fix flip, add softmax 2024-11-21 15:12:12 +08:00
CHEN Xinsheng 19d2e2e912 add ACL op `nonzero`, a temporary implementation, a bit slow 2024-11-04 22:38:29 +08:00
uyzhang c268a0bfaf Refactor aclnn.h and acl_op.h to add support for FlashAttention and FlashAttentionBackward 2024-09-29 12:29:41 +08:00
zjp_shadow d648713ec5 Update concat 2024-09-25 00:37:36 +08:00
lidongyang 464009af42 add getitem&setitem mask 2024-09-21 22:57:54 +08:00
lidongyang 651b24e634 add sigmoid embedding silu 2024-09-13 03:19:25 +08:00
张仪 c55d49a8de add new aclop 2024-09-12 20:14:22 +08:00
张仪 3beeec78b1 add new aclop & fixed some bugs 2024-09-12 17:11:23 +08:00
张仪 eb89ae19ed add new aclop 2024-09-07 22:11:39 +08:00
张仪 21580ce80e update aclnn 2024-09-07 18:18:00 +08:00
张仪 b4244090ae first commit 2024-08-21 22:15:12 +08:00