Commit Graph

1596 Commits

Author SHA1 Message Date
Exusial c3b1f380eb Merge branch 'main' of https://github.com/CSCG-Lab/JittorHW 2024-12-12 16:31:40 +08:00
Exusial 14af6f0980 Merge branch 'ddd' 2024-12-12 16:27:07 +08:00
Exusial 2b63a07aa0 udpate base. 2024-12-12 16:26:38 +08:00
Exusial c9c02508d4 Debug nan. 2024-12-12 14:49:18 +08:00
lusz a5fdfd1408 cusparse 2024-12-11 21:38:17 +08:00
lusz 2d93b36cbb cusparse 2024-12-11 21:33:33 +08:00
Exusial 722cb8e3fc add sync in broadcast_to when shape is [1] 2024-12-10 19:20:18 +08:00
Exusial da6acc6cc3 Add flags for sync. 2024-12-10 10:17:58 +08:00
Exusial d1b313bf1d add random 2024-12-09 17:18:27 +08:00
Exusial ce533cbeb3 fix setitem 2024-12-09 16:22:37 +08:00
Exusial 99413285cb modify getitem & setitem 2024-12-07 18:52:34 +08:00
邓一轩 1db9bc2993 splite matmul and bmm from acl_op 2024-12-07 11:16:49 +08:00
Exusial 420f94f283 update 2024-12-06 14:23:08 +08:00
Exusial 86331a8d8f add setitem & getitem op 2024-12-06 11:21:55 +08:00
Exusial fb00b8a558 add conv_forward op. 2024-12-04 15:45:48 +08:00
Exusial 15a7fba3da add conv_op. 2024-12-04 15:41:38 +08:00
Exusial 5544147573 Merge branch 'main' of https://github.com/CSCG-Lab/JittorHW 2024-12-04 15:40:05 +08:00
Exusial d71e59b262 fixed bug of cpp 2024-12-04 15:36:25 +08:00
CHEN Xinsheng 135446ca59 improve reduce op output 2024-12-03 11:12:56 +08:00
Exusial 3bea663698 fixed the bug of not recompile 2024-12-02 22:24:01 +08:00
Exusial f7edd32327 fix bug 2024-12-02 17:53:42 +08:00
Exusial e24a37f5ce add base op class 2024-12-01 23:36:28 +08:00
DongYang Li 63d9392e49 update version 1.3.9.13 2024-11-28 22:13:06 +08:00
CHEN Xinsheng acf5d1a05e add `jt.Var.isnan` and `jt.Var.isinf` 2024-11-28 22:04:42 +08:00
514flowey 421d5a4fa4 Merge branch 'Jittor:master' into master 2024-11-28 19:40:55 +08:00
514flowey 9ee61d26f1 fix rnn op bug 2024-11-28 19:40:08 +08:00
Exusial 352bb8d6a7 update reduce. 2024-11-28 16:24:07 +08:00
Exusial ca712e241b update 2024-11-28 15:45:15 +08:00
Exusial a1add64d6c fix compile include aclops.h in aclops 2024-11-28 13:24:47 +08:00
Exusial 0dc84ebed8 update get_dtype 2024-11-27 20:21:34 +08:00
Exusial 8e5ee574f5 merge reduce. 2024-11-27 19:31:48 +08:00
Exusial edf2755cb5 Merge branch 'main' into dev 2024-11-27 11:18:47 +08:00
Exusial 8c33770036 update. 2024-11-27 11:18:19 +08:00
张仪 89f5b98741 split binary and unary op by hy 2024-11-26 23:22:41 +08:00
张仪 e8ae65d797 update 2024-11-26 21:27:06 +08:00
Xinsheng Chen 2bab0bb8dd fix `nn.Dropout` (dtype convert) 2024-11-26 19:07:20 +08:00
CHEN Xinsheng b37fae105b fix `nn.Dropout` (dtype) 2024-11-26 18:59:12 +08:00
Yuxuan Han 2580a98710 add isnan, isinf 2024-11-26 10:01:58 +08:00
Yuxuan Han 66e18b85a8 add stack 2024-11-24 09:46:23 +08:00
张仪 2e137f73b0 big op and recompile bug 2024-11-21 23:59:41 +08:00
Yuxuan Han d6917eda4c fix relu grad, add more unnittest 2024-11-21 15:59:40 +08:00
Yuxuan Han 0ff1deccb7 fix flip, add softmax 2024-11-21 15:12:12 +08:00
CHEN Xinsheng 8159093262 fix `getitem` (list case) 2024-11-18 20:15:39 +08:00
CHEN Xinsheng d3f2dc5606 add: support numpy int as an index for `getitem` 2024-11-18 17:36:22 +08:00
张仪 47f0c8acda update unit test 2024-11-18 11:54:32 +08:00
CHEN Xinsheng 1d0602ae32 fix `stack` 2024-11-14 19:30:22 +08:00
CHEN Xinsheng 19b7bbbe57 fix `stack` 2024-11-14 18:01:25 +08:00
CHEN Xinsheng a5b16925e8 add unit test for `any` 2024-11-14 17:35:25 +08:00
CHEN Xinsheng 9b6fd17e20 add `jt.Var.cumsum` and `jt.Var.cub_cumsum` 2024-11-14 17:34:48 +08:00
DongYang Li c10acf34bc Update version 1.3.9.12 2024-11-14 16:17:33 +08:00
CHEN Xinsheng 3d06d25077 add ACL op `any` 2024-11-12 10:32:07 +08:00
CHEN Xinsheng a0dfdc5ff0 add `getitem` (`None` case) 2024-11-11 21:33:55 +08:00
邓一轩 e0537e5c1a concat 2024-11-11 19:58:31 +08:00
邓一轩 58fc5a9b35 fix conv 2024-11-11 19:43:56 +08:00
邓一轩 495a26a458 add sync at end of all op 2024-11-11 17:09:22 +08:00
邓一轩 b9986ac53b use switch 2024-11-11 15:52:35 +08:00
CHEN Xinsheng c747053a54 fix `concat` 2024-11-09 20:13:47 +08:00
514flowey 382bd3f0e5 update optimizer 2024-11-05 14:14:10 +08:00
CHEN Xinsheng 4e4e67dfd5 fix `nonzero` 2024-11-05 12:46:00 +08:00
dengyx21 810af5953b Revert "sync only on broadcast_to from [1]"
This reverts commit 1439a03fca.
2024-11-05 11:29:10 +08:00
CHEN Xinsheng 33bd28fdb3 add ACL op `where` (unary case) 2024-11-04 22:41:16 +08:00
CHEN Xinsheng 19d2e2e912 add ACL op `nonzero`, a temporary implementation, a bit slow 2024-11-04 22:38:29 +08:00
dengyx21 1902dab9c5 sync only on broadcast_to from [1] 2024-11-04 19:42:09 +08:00
CHEN Xinsheng f79e2908ed Merge pull request #3 from CSCG-Lab/jtorch
fix some bugs for jtorch
2024-10-30 12:34:08 +08:00
dengyx21 158ec0756c shut off a stream 2024-10-29 20:14:41 +08:00
CHEN Xinsheng b279960344 Merge branch 'main' 2024-10-29 15:42:39 +08:00
DongYang Li 56bc5f65be fix jt.index error 2024-10-24 02:32:02 +08:00
CHEN Xinsheng f34e1beafa fix warp (class case) 2024-10-21 20:39:15 +08:00
CHEN Xinsheng 1776dd4da9 fix warp (class case) 2024-10-21 17:08:01 +08:00
CHEN Xinsheng 8a31c402de fix `jt.index` 2024-10-18 17:18:06 +08:00
CHEN Xinsheng 811dc241d4 fix `jt.Var.triu_` 2024-10-18 11:13:32 +08:00
CHEN Xinsheng 17048da065 fix finfo & iinfo 2024-10-17 22:00:22 +08:00
CHEN Xinsheng cf4ce2c95e fix finfo bug in jittor 2024-10-17 21:36:00 +08:00
CHEN Xinsheng 89010f5475 fix `Var.triu` & `Var.triu_` 2024-10-17 21:24:19 +08:00
CHEN Xinsheng 6edc1f74a3 add cub_cumsum & cumprod 2024-10-17 21:23:32 +08:00
CHEN Xinsheng c886f01b53 fix warp (class case) 2024-10-17 21:22:26 +08:00
lidongyang fc1fff8c0e update version to 1.3.9.11 2024-10-08 23:03:24 +08:00
lidongyang 72be1396d9 fix: jupyter restart error 2024-10-08 23:02:27 +08:00
zjp_shadow 8966ca4320 fix transpose 2024-10-06 21:20:12 +08:00
uyzhang a078268e18 polish 2024-10-01 19:34:51 +08:00
uyzhang 33898421e4 Merge branch 'main' of https://github.com/CSCG-Lab/JittorHW into main 2024-10-01 18:16:47 +08:00
uyzhang 4c6d726a4c Refactor transpose_acl function and fix bug in matmul_acl 2024-10-01 18:14:08 +08:00
张仪 cb75c8dedd format 2024-09-29 13:47:22 +08:00
uyzhang c268a0bfaf Refactor aclnn.h and acl_op.h to add support for FlashAttention and FlashAttentionBackward 2024-09-29 12:29:41 +08:00
uyzhang 146574d7d1 Refactor transpose_acl function and fix bug in matmul_acl 2024-09-27 19:47:13 +08:00
uyzhang 4329f3b287 Refactor transpose_acl function and fix bug in matmul_acl 2024-09-27 19:44:04 +08:00
zjp_shadow b48d8664a1 add transpose 2024-09-27 19:37:19 +08:00
uyzhang c7c7326456 fixed the bug in matmul 2024-09-27 16:54:43 +08:00
zjp_shadow d648713ec5 Update concat 2024-09-25 00:37:36 +08:00
uyzhang 934885c96e Merge branch 'main' of https://github.com/CSCG-Lab/JittorHW into main 2024-09-23 23:12:49 +08:00
uyzhang dc29fa69dc FEAT! opt transpose in matmul and bmm 2024-09-23 23:12:44 +08:00
uyzhang c3df41e77b Refactor acl_compiler.py to handle gradient accumulation in bmm_acl and matmul_acl functions 2024-09-23 23:11:04 +08:00
uyzhang d092b83d0b Merge branch 'main' of https://github.com/CSCG-Lab/JittorHW into main 2024-09-23 22:43:47 +08:00
uyzhang 74aa4e68c2 Refactor acl_compiler.py to handle gradient accumulation in bmm_acl and matmul_acl functions 2024-09-23 22:43:44 +08:00
uyzhang 7fa22e2e32 add Ellipsis 2024-09-23 22:27:44 +08:00
uyzhang 9578e30972 Refactor acl_compiler.py to handle gradient accumulation in bmm_acl and matmul_acl functions 2024-09-23 20:40:53 +08:00
uyzhang 37671ccec1 Refactor acl_compiler.py to handle gradient accumulation in bmm_acl and matmul_acl functions 2024-09-23 20:26:46 +08:00
uyzhang 657687e0c0 Refactor acl_compiler.py to handle gradient accumulation in bmm_acl and matmul_acl functions 2024-09-23 16:09:42 +08:00
uyzhang 2a142ae73d fix bug of setitem cpu when use acl 2024-09-23 15:34:45 +08:00
uyzhang 9907aad7de fix getitem&setitem slice bug 2024-09-23 13:58:37 +08:00