llvm-project

Commit Graph

Author	SHA1	Message	Date
Morten Borup Petersen	6c1436a9b0	[MLIR][SCF] Parenthesize multiple return types in scf.execute_region asm op Previously, ExecuteRegionOps with multiple return values would fail a round-trip test due to missing parenthesis around the types. Differential Revision: https://reviews.llvm.org/D108402	2021-08-19 21:31:51 +01:00
Matthias Springer	76a1861816	[mlir][SparseTensor] Split scf.for loop into masked/unmasked parts Apply the "for loop peeling" pattern from SCF dialect transforms. This pattern splits scf.for loops into full and partial iterations. In the full iteration, all masked loads/stores are canonicalized to unmasked loads/stores. Differential Revision: https://reviews.llvm.org/D107733	2021-08-19 21:53:11 +09:00
Matthias Springer	8e8b70aa84	[mlir][scf] Simplify affine.min ops after loop peeling Simplify affine.min ops, enabling various other canonicalizations inside the peeled loop body. affine.min ops such as: ``` map = affine_map<(d0)[s0, s1] -> (s0, -d0 + s1)> %r = affine.min #affine.min #map(%iv)[%step, %ub] ``` are rewritten them into (in the case the peeled loop): ``` %r = %step ``` To determine how an affine.min op should be rewritten and to prove its correctness, FlatAffineConstraints is utilized. Differential Revision: https://reviews.llvm.org/D107222	2021-08-19 17:24:53 +09:00
Matthias Springer	08dbed8a57	[mlir][linalg] Canonicalize dim ops of tiled_loop block args E.g.: ``` %y = ... : tensor<...> linalg.tiled_loop ... ins(%x = %y : tensor<...>) { tensor.dim %x, %c0 : tensor<...> } ``` is rewritten to: ``` %y = ... : tensor<...> linalg.tiled_loop ... ins(%x = %y : tensor<...>) { tensor.dim %y, %c0 : tensor<...> } ``` Differential Revision: https://reviews.llvm.org/D108272	2021-08-19 11:24:33 +09:00
Aart Bik	d37d72eaf8	[mlir][sparse] use shared util for DimOp generation This shares more code with existing utilities. Also, to be consistent, we moved dimension permutation on the DimOp to the tensor lowering phase. This way, both pre-existing DimOps on sparse tensors (not likely but possible) as well as compiler generated DimOps are handled consistently. Reviewed By: bixia Differential Revision: https://reviews.llvm.org/D108309	2021-08-18 17:12:32 -07:00
Chia-hung Duan	41e5dbe0fa	Enables inferring return types for Shape op if possible Reviewed By: jpienaar Differential Revision: https://reviews.llvm.org/D102565	2021-08-18 21:36:55 +00:00
Butygin	ddc3d51d58	[mlir][spirv] Add (InBounds)PtrAccessChain ops Differential Revision: https://reviews.llvm.org/D108070	2021-08-18 17:59:21 +03:00
Lei Zhang	4c15ad2321	[mlir][linalg] Don't drop existing attributes when creating ops Reviewed By: mravishankar Differential Revision: https://reviews.llvm.org/D108219	2021-08-17 15:44:56 -04:00
Robert Suderman	65532ea6dd	[mlir][linalg] Clear unused linalg tc operations These operations are not lowered to from any source dialect and are only used for redundant tests. Removing these named ops, along with their associated tests, will make migration to YAML operations much more convenient. Reviewed By: stellaraccident Differential Revision: https://reviews.llvm.org/D107993	2021-08-16 11:55:45 -07:00
tashuang.zk	2d45e332ba	[MLIR][DISC] Revise ParallelLoopTilingPass with inbound_check mode Expand ParallelLoopTilingPass with an inbound_check mode. In default mode, the upper bound of the inner loop is from the min op; in inbound_check mode, the upper bound of the inner loop is the step of the outer loop and an additional inbound check will be emitted inside of the inner loop. This was 'FIXME' in the original codes and a typical usage is for GPU backends, thus the outer loop and inner loop can be mapped to blocks/threads in seperate. Differential Revision: https://reviews.llvm.org/D105455	2021-08-16 14:02:53 +02:00
harsh-nod	e33f301ec2	[mlir] Add support for moving reductions to outer most dimensions in vector.multi_reduction The approach for handling reductions in the outer most dimension follows that for inner most dimensions, outlined below First, transpose to move reduction dims, if needed Convert reduction from n-d to 2-d canonical form Then, for outer reductions, we emit the appropriate op (add/mul/min/max/or/and/xor) and combine the results. Differential Revision: https://reviews.llvm.org/D107675	2021-08-13 12:59:50 -07:00
Tyler Augustine	3a2ff982d7	Support post-processing Ops in unrolled loop iterations This can be useful when one needs to know which unrolled iteration an Op belongs to, for example, conveying noalias information among memory-affecting ops in parallel-access loops. Reviewed By: mehdi_amini Differential Revision: https://reviews.llvm.org/D107789	2021-08-11 23:11:10 +00:00
Rob Suderman	7de439b2be	[mlir][tosa] Migrate tosa to more efficient linalg.conv Existing linalg.conv2d is not well optimized for performance. Changed to a version that is more aligned for optimziation. Include the corresponding transposes to use this optimized version. This also splits the conv and depthwise conv into separate implementations to avoid overly complex lowerings. Reviewed By: antiagainst Differential Revision: https://reviews.llvm.org/D107504	2021-08-11 11:05:12 -07:00
Benjamin Kramer	c1ebefdf77	[mlir] Make polynomial approximation emit std instead of LLVM ops This is a bit cleaner and removes issues with 2d vectors. It also has a big impact on constant folding, hence the test changes. Differential Revision: https://reviews.llvm.org/D107896	2021-08-11 16:37:21 +02:00
Alex Zinenko	79b0576dd4	[mlir] Tighten LLVM_AnyNonAggregate ODS type constraint The constraint was checking that the type is not an LLVM structure or array type, but was not checking that it is an LLVM-compatible type, making it accept incorrect types. As a result, some LLVM dialect ops could process values that are not compatible with the LLVM dialect leading to further issues with conversions and translations that assume all values are LLVM-compatible. Make LLVM_AnyNonAggregate only accept LLVM-compatible types. Reviewed By: cota, akuegel Differential Revision: https://reviews.llvm.org/D107889	2021-08-11 16:30:19 +02:00
Alexander Belyaev	1e733a8c04	Revert "Bufferization for tiled loop." This reverts commit `edaffebcb2`.	2021-08-11 10:04:12 +02:00
Alexander Belyaev	967578f0b8	Revert "[mlir] Change the pattern for TiledLoopOp bufferization." This reverts commit `2f946eaa9d`.	2021-08-11 10:01:36 +02:00
Rob Suderman	2b2ebb6f98	[mlir][tosa] Add folders for trivial tosa operation cases Some folding cases are trivial to fold away, specifically no-op cases where an operation's input and output are the same. Canonicalizing these away removes unneeded operations. The current version includes tensor cast operations to resolve shape discreprencies that occur when an operation's result type differs from the input type. These are resolved during a tosa shape propagation pass. Reviewed By: NatashaKnk Differential Revision: https://reviews.llvm.org/D107321	2021-08-10 14:43:00 -07:00
Alexander Belyaev	2f946eaa9d	[mlir] Change the pattern for TiledLoopOp bufferization. This version is does not affect the patterns for Extract/InsertSliceOp and LinalgOps. Differential Revision: https://reviews.llvm.org/D107858	2021-08-10 21:27:02 +02:00
bakhtiyar	391456f33c	Fix a bug in algebraic simplification, and enable the tests. Reviewed By: ezhulenev Differential Revision: https://reviews.llvm.org/D107788	2021-08-10 04:15:56 -07:00
Alexander Belyaev	edaffebcb2	Cloned from CL 389610703 by 'g4 patch'. Original change by pifon@pifon:tfrt_clean:6896:citc on 2021/08/09 05:30:17. Ad b Differential Revision: https://reviews.llvm.org/D107762	2021-08-09 21:57:06 +02:00
Aart Bik	05c7f450df	[mlir][sparse] add dense to sparse conversion implementation Implements lowering dense to sparse conversion, for static tensor types only. First step towards general sparse_tensor.convert support. Reviewed By: ThomasRaoux Differential Revision: https://reviews.llvm.org/D107681	2021-08-09 12:12:39 -07:00
Max Kudryavtsev	0b8cb87e0d	[MLIR][STD] Add safe scalar constant propagation for FPTruncOp Perform scalar constant propagation for FPTruncOp only if the resulting value can be represented without precision loss or rounding. Example: %cst = constant 1.000000e+00 : f32 %0 = fptrunc %cst : f32 to bf16 --> %cst = constant 1.000000e+00 : bf16 Reviewed By: mehdi_amini Differential Revision: https://reviews.llvm.org/D107518	2021-08-06 16:31:29 -07:00
Alexander Belyaev	a552debdcf	[mlir] Add patterns for vector.transfer_read/write to Linalg bufferization. Differential Revision: https://reviews.llvm.org/D107643	2021-08-06 20:24:44 +02:00
Geoffrey Martin-Noble	ca6baf1e1d	[MLIR][std] Introduce bitcast operation This patch introduces a bitcast operation to the standard dialect. RFC: https://llvm.discourse.group/t/rfc-introduce-a-bitcast-op/3774 Reviewed By: silvas Differential Revision: https://reviews.llvm.org/D105376	2021-08-06 08:47:51 -07:00
Adrian Kuegel	d6b4993736	[mlir][MemRef] Fix canonicalization of BufferCast(TensorLoad). CastOp::areCastCompatible does not check whether casts are definitely compatible. When going from dynamic to static offset or stride, the canonicalization cannot know whether it is really cast compatible. In that case, it can only canonicalize to an alloc plus copy. Differential Revision: https://reviews.llvm.org/D107545	2021-08-06 08:32:35 +02:00
Jacques Pienaar	9d10be70a8	[mlir] std.call reference function return types in failure Makes it easier to see type mismatch from failure locally. Differential Revision: https://reviews.llvm.org/D107288	2021-08-05 19:51:48 -07:00
Stephen Neuendorffer	432341d8a8	[mlir] Handle cases where transfer_read should turn into a scalar load The existing vector transforms reduce the dimension of transfer_read ops. However, beyond a certain point, the vector op actually has to be reduced to a scalar load, since we can't load a zero-dimension vector. This handles this case. Note that in the longer term, it may be preferaby to support zero-dimension vectors. see https://llvm.discourse.group/t/should-we-have-0-d-vectors/3097. Differential Revision: https://reviews.llvm.org/D103432	2021-08-03 22:53:40 -07:00
Rob Suderman	1b00b94ffc	[mlir][tosa] Tosa shape propagation for tosa.cond_if We can propagate the shape from tosa.cond_if operands into the true/false regions then through the connected blocks. Then, using the tosa.yield ops we can determine what all possible return types are. Reviewed By: jpienaar Differential Revision: https://reviews.llvm.org/D105940	2021-08-03 17:54:54 -07:00
Rob Suderman	143edeca6d	[mlir][tosa] Shape inference for a few remaining easy cases: Handles shape inference for identity, cast, and rescale. These were missed during the initialy elementwise work. This includes resize shape propagation which includes both attribute and input type based propagation. Reviewed By: jpienaar Differential Revision: https://reviews.llvm.org/D105845	2021-08-03 17:20:32 -07:00
Aart Bik	817303ef34	[mlir][sparse] fix bug in permuting data structure Reviewed By: bixia Differential Revision: https://reviews.llvm.org/D107379	2021-08-03 14:27:43 -07:00
KareemErgawy-TomTom	f984a805f3	[MLIR][Linalg] Extend detensoring control flow model. This patch extends the PureControlFlowDetectionModel to consider detensoring br and cond_br operands. See: https://github.com/google/iree/issues/1159#issuecomment-884322687, for a disccusion on the need for such extension. Reviewed By: silvas Differential Revision: https://reviews.llvm.org/D107358	2021-08-03 18:08:13 +02:00
Kiran Chandramohan	59989d68ba	[MLIR][OpenMP] Add support for critical construct This patch adds the critical construct to the OpenMP dialect. The implementation models the definition in 2.17.1 of the OpenMP 5 standard. A name and hint can be specified. The name is a global entity or has external linkage, it is modelled as a FlatSymbolRefAttr. Hint is modelled as an integer enum attribute. Also lowering to LLVM IR using the OpenMP IRBuilder. Reviewed By: ftynse Differential Revision: https://reviews.llvm.org/D107135	2021-08-03 10:50:21 +01:00
Matthias Springer	3a41ff4883	[mlir][SCF] Peel scf.for loops for even step divison Add ForLoopBoundSpecialization pass, which specializes scf.for loops into a "main loop" where `step` divides the iteration space evenly and into an scf.if that handles the last iteration. This transformation is useful for vectorization and loop tiling. E.g., when vectorizing loads/stores, programs will spend most of their time in the main loop, in which only unmasked loads/stores are used. Only the in the last iteration (scf.if), slower masked loads/stores are used. Subsequent commits will apply this transformation in the SparseDialect and in Linalg's loop tiling. Differential Revision: https://reviews.llvm.org/D105804	2021-08-03 10:21:38 +09:00
Eugene Zhulenev	b537c5b414	[mlir] Async: clone constants into async.execute functions and parallel compute functions Reviewed By: mehdi_amini Differential Revision: https://reviews.llvm.org/D107007	2021-08-02 12:17:41 -07:00
Aart Bik	697ea09d47	[mlir][sparse] add sparse tensor type conversion operation Introduces a conversion from one (sparse) tensor type to another (sparse) tensor type. See the operation doc for details. Actual codegen for all cases is still TBD. Reviewed By: ThomasRaoux Differential Revision: https://reviews.llvm.org/D107205	2021-07-31 12:53:31 -07:00
Nicolas Vasilache	14c1450d5c	[mlir][Vector] Add vector to outerproduct lowering for the [reduction, parallel] case. Differential Revision: https://reviews.llvm.org/D105373	2021-07-30 14:32:57 +00:00
Amy Zhuang	a8b7e56f65	[mlir] Set insertion point of vector constant to the top of the vectorized loop body When we vectorize a scalar constant, the vector constant is inserted before its first user if the scalar constant is defined outside the loops to be vectorized. It is possible that the vector constant does not dominate all its users. To fix the problem, we find the innermost vectorized loop that encloses that first user and insert the vector constant at the top of the loop body. Reviewed By: nicolasvasilache Differential Revision: https://reviews.llvm.org/D106609	2021-07-29 15:42:23 -07:00
Rob Suderman	2d0ba5e144	[mlir][tosa] Fix tosa.reshape failures due to implicit broadcasting Make broadcastable needs the output shape to determine whether the operation includes additional broadcasting. Include some canonicalizations for TOSA to remove unneeded reshape. Reviewed By: NatashaKnk Differential Revision: https://reviews.llvm.org/D106846	2021-07-29 15:21:57 -07:00
Yi Zhang	9a82482313	[mlir][linalg] Fix pad tensor cast folding with changed type `PadTensorOp` has verification logic to make sure result dim must be static if all the padding values are static. Cast folding might add more static information for the src operand of `PadTensorOp` which might change a valid operation to be invalid. Change the canonicalizing pattern to fix this.	2021-07-29 17:47:01 -04:00
bakhtiyar	1c144410e7	Refactor AsyncToAsyncRuntime pass to boost understandability. Depends On D106730 Reviewed By: ezhulenev Differential Revision: https://reviews.llvm.org/D106731	2021-07-29 12:01:07 -07:00
bakhtiyar	9a5bc83660	Add an escape-hatch for conversion of funcs with blocking awaits to coroutines. Currently TFRT does not support top-level coroutines, so this functionality will allow to have a single blocking await at the top level until TFRT implements the necessary functionality. Reviewed By: ezhulenev Differential Revision: https://reviews.llvm.org/D106730	2021-07-29 08:52:28 -07:00
River Riddle	f8479d9de5	[mlir] Set the namespace of the BuiltinDialect to 'builtin' Historically the builtin dialect has had an empty namespace. This has unfortunately created a very awkward situation, where many utilities either have to special case the empty namespace, or just don't work at all right now. This revision adds a namespace to the builtin dialect, and starts to cleanup some of the utilities to no longer handle empty namespaces. For now, the assembly form of builtin operations does not require the `builtin.` prefix. (This should likely be re-evaluated though) Differential Revision: https://reviews.llvm.org/D105149	2021-07-28 21:00:10 +00:00
bakhtiyar	6ea22d4626	Optionally eliminate blocking runtime.await calls by converting functions to coroutines. Interop parallelism requires needs awaiting on results. Blocking awaits are bad for performance. TFRT supports lightweight resumption on threads, and coroutines are an abstraction than can be used to lower the kernels onto TFRT threads. Reviewed By: ezhulenev Differential Revision: https://reviews.llvm.org/D106508	2021-07-28 12:37:05 -07:00
Alex Zinenko	c1f719d1a7	[mlir] harden result type verification in llvm.call The verifier of the llvm.call operation was not checking for mismatches between the number of operation results and the number of results in the signature of the callee. Furthermore, it was possible to construct an llvm.call operation producing an SSA value of !llvm.void type, which should not exist. Add the verification and treat !llvm.void result type as absence of call results. Update the GPU conversions to LLVM that were mistakenly assuming that it was fine for llvm.call to produce values of !llvm.void type and ensure these calls do not produce results. Reviewed By: mehdi_amini Differential Revision: https://reviews.llvm.org/D106937	2021-07-28 18:15:56 +02:00
Lei Zhang	23326b9f17	[mlir][spirv] Fix a few issues in ModuleCombiner - Fixed symbol insertion into `symNameToModuleMap`. Insertion needs to happen whether symbols are renamed or not. - Added check for the VCE triple and avoid dropping it. - Disabled function deduplication. It requires more careful rules. Right now it can remove different functions. - Added tests for symbol rename listener. - And some other code/comment cleanups. Reviewed By: ergawy Differential Revision: https://reviews.llvm.org/D106886	2021-07-28 10:31:01 -04:00
Tobias Gysi	ca0d244e99	[mlir][linalg] Introduce a separate EraseIdentityCopyOp Pattern. Split out an EraseIdentityCopyOp from the existing RemoveIdentityLinalgOps pattern. Introduce an additional check to ensure the pattern checks the permutation maps match. This is a preparation step to specialize RemoveIdentityLinalgOps to GenericOp only. Reviewed By: nicolasvasilache Differential Revision: https://reviews.llvm.org/D105622	2021-07-28 11:18:22 +00:00
Yi Zhang	8ed66cb88b	[mlir][memref] Fix collapsed shape ops memref.cast folding with changed type `memref.collapse_shape` has verification logic to make sure result dim must be static if all the collapsing src dims are static. Cast folding might add more static information for the src operand of `memref.collapse_shape` which might change a valid collapsing operation to be invalid. Add `CollapseShapeOpMemRefCastFolder` pattern to fix this. Minor changes to `convertReassociationIndicesToExprs` to use `context` instead of `builder` to avoid extra steps to construct temporary builders. Reviewed By: nicolasvasilache, mravishankar Differential Revision: https://reviews.llvm.org/D106670	2021-07-28 10:19:20 +00:00
River Riddle	ddd8482117	[PDL] Remove RewriteEndOp and mark RewriteOp as NoTerminator RewriteEndOp was a fake terminator operation that is no longer needed now that blocks are not required to have terminators. Differential Revision: https://reviews.llvm.org/D106911	2021-07-27 20:45:10 +00:00
Eugene Zhulenev	d94426d22a	[mlir] Math: add algebraic simplification patterns to math transforms Reviewed By: bkramer Differential Revision: https://reviews.llvm.org/D106822	2021-07-27 09:22:33 -07:00

1 2 3 4 5 ...

1607 Commits