llvm-project

Commit Graph

Author	SHA1	Message	Date
Nuno Lopes	e5e844b37e	[NFC] Pre-commit test for InstSimplify phi(poison)	2021-12-30 12:37:20 +00:00
Sanjay Patel	0edf99950e	[Analysis] allow caller to choose signed/unsigned when computing constant range We should not lose analysis precision if an 'add' has both no-wrap flags (nsw and nuw) compared to just one or the other. This patch is modeled on a similar construct that was added with D59386. I don't think it is possible to expose a problem with an unsigned compare because of the way this was coded (nuw is handled first). InstCombine has an assert that fires with the example from: https://github.com/llvm/llvm-project/issues/52884 ...because it was expecting InstSimplify to handle this kind of pattern with an smax. Fixes #52884 Differential Revision: https://reviews.llvm.org/D116322	2021-12-28 09:45:37 -05:00
Sanjay Patel	aaeae842ef	[InstSimplify] add tests for icmp with no-wrap add operand; NFC	2021-12-28 09:45:37 -05:00
Nikita Popov	3bfe0962ba	[ConstFold] Add another icmp of gep of global test (NFC) This time with some complex arithmetic involving bitcasts.	2021-12-28 14:28:28 +01:00
Nikita Popov	23de66d163	[ConstFold] Don't fold signed comparison of gep of global An inbounds GEP may still cross the sign boundary, so signed icmps cannot be folded (https://alive2.llvm.org/ce/z/XSgi4D). This was previously fixed for other folds in this function, but this one was missed.	2021-12-28 14:13:33 +01:00
Nikita Popov	1bd11d34fe	[ConstFold] Add additional icmp of gep of global tests (NFC) The fold is incorrect for the sgt case, as gep inbounds is allowed to cross the sign boundary.	2021-12-28 14:07:15 +01:00
Mehrnoosh Heidarpour	0ff20f2f44	[InstSimplify] Fold logic AND to zero Adding following fold opportunity: ((A \| B) ^ A) & ((A \| B) ^ B) --> 0 Reviewed By: spatel, rampitec Differential Revision: https://reviews.llvm.org/D115755	2021-12-23 10:06:26 -05:00
Serge Pavlov	185c80b89a	[ConstantFolding] Tests for constrained compare intrinsics This are tests extracted from https://reviews.llvm.org/D110322, committed prior to that patch to show the change in behavior.	2021-12-22 18:08:30 +07:00
Serge Pavlov	77b923d0db	[ConstantFolding] Do not remove side effect from constrained functions According to the discussion in https://reviews.llvm.org/D110322 the code that removes side effect from replaced function call is deleted. Differential Revision: https://reviews.llvm.org/D115870	2021-12-22 13:45:49 +07:00
Nikita Popov	2926d6d335	[ConstantFold][GlobalOpt] Don't create x86_mmx null value This fixes the assertion failure reported at https://reviews.llvm.org/D114889#3198921 with a straightforward check, until the cleaner fix in D115924 can be reapplied.	2021-12-21 09:11:41 +01:00
Nikita Popov	aeb36ae0f4	Revert "[ConstantFolding] Unify handling of load from uniform value" This reverts commit `9fd4f80e33`. This breaks SingleSource/Regression/C/gcc-c-torture/execute/pr19687.c in test-suite. Either the test is incorrect, or clang is generating incorrect union initialization code. I've submitted https://reviews.llvm.org/D115994 to fix the test, assuming my interpretation is correct. Reverting this in the meantime as it may take some time to resolve.	2021-12-18 20:46:52 +01:00
Nikita Popov	9fd4f80e33	[ConstantFolding] Unify handling of load from uniform value There are a number of places that specially handle loads from a uniform value where all the bits are the same (zero, one, undef, poison), because we a) don't care about the load offset in that case and b) it bypasses casts that might not be legal generally but do work with uniform values. We had multiple implementations of this, with a different set of supported values each time, as well as incomplete type checks in some cases. In particular, this fixes the assertion reported in https://reviews.llvm.org/D114889#3198921, as well as a similar assertion that could be triggered via constant folding. Differential Revision: https://reviews.llvm.org/D115924	2021-12-17 17:05:06 +01:00
Mehrnoosh Heidarpour	b5c49b62a2	[InstSimplify] Add tests for logic AND; NFC	2021-12-14 15:50:07 -05:00
Nikita Popov	65bec04295	[ConstantFold] Handle same type in ConstantFoldLoadThroughBitcast Usually the case where the types are the same ends up being handled fine because it's legal to do a trivial bitcast to the same type. However, this is not true for aggregate types. Short-circuit the whole code if the types match exactly to account for this.	2021-12-10 16:39:50 +01:00
Nikita Popov	9c244a33e7	[InstSimplify] Add test for load of aggregate (NFC) The test is switched to use -instsimplify as it is in the InstSimplify directory. In this particular case InstCombine does fold the load (in a very roundabout way), but InstSimplify does not.	2021-12-10 16:18:18 +01:00
Hasyimi Bahrudin	c1cd698a52	[InstSimplify] Simplify bool icmp with not in LHS Refer to https://llvm.org/PR52546. Simplifies the following cases: not(X) == 0 -> X != 0 -> X not(X) <=u 0 -> X >u 0 -> X not(X) >=s 0 -> X <s 0 -> X not(X) != 1 -> X == 1 -> X not(X) <=u 1 -> X >=u 1 -> X not(X) >s 1 -> X <=s -1 -> X Differential Revision: https://reviews.llvm.org/D114666	2021-12-09 16:26:46 -05:00
Hasyimi Bahrudin	64d4bd02dc	[InstCombine][InstSimplify] Add baseline tests for icmp bool with not on LHS; NFC See D114666 for proposed code change to instsimplify. The difference between the CHECK result of these 2 tests highlights missed folds in instsimplify (e.g. (icmp eq (xor X, true), false) -> X) that are already being handled by instcombine. The tests are based on: llvm/test/Transforms/InstSimplify/icmp-bool-constant.ll Differential Revision: https://reviews.llvm.org/D115209	2021-12-08 08:48:03 -05:00
Sanjay Patel	8a69b04478	[InstSimplify] add logic fold for 'or' with 'xor'+'and' This replaces the 'or' from `4b30076f16` with an 'and'. We have to guard against propagating undef elements from vector 'not' values: https://alive2.llvm.org/ce/z/irMwRc	2021-12-07 11:08:26 -05:00
Sanjay Patel	c65e651e60	[InstSimplify] fix logic fold of 'or' for vectors Reduce code duplication for commutative pattern matching and fix a miscompile. We can't safely propagate an undef element in this transform: https://alive2.llvm.org/ce/z/s5xy55	2021-12-05 09:57:07 -05:00
Sanjay Patel	0bb8a97b41	[InstSimplify] add/adjust tests for 'or' logic fold; NFC The last test shows a miscompile: https://alive2.llvm.org/ce/z/s5xy55	2021-12-05 09:28:11 -05:00
Mehrnoosh Heidarpour	e94134052f	[InstSimplify] Add logic 'or' fold to -1 Adding the following folding opportunity: (~A \| B) \| (A ^ B) --> -1 https://alive2.llvm.org/ce/z/PMtdYB Differential revision: https://reviews.llvm.org/D114996	2021-12-04 15:04:18 -05:00
David Green	ab0c5cea0b	[ARM] Use v2i1 for MVE and CDE intrinsics This adjusts all the MVE and CDE intrinsics now that v2i1 is a legal type, to use a <2 x i1> as opposed to emulating the predicate with a <4 x i1>. The v4i1 workarounds have been removed leaving the natural v2i1 types, notably in vctp64 which now generates a v2i1 type. AutoUpgrade code has been added to upgrade old IR, which needs to convert the old v4i1 to a v2i1 be converting it back and forth to an integer with arm.mve.v2i and arm.mve.i2v intrinsics. These should be optimized away in the final assembly. Differential Revision: https://reviews.llvm.org/D114455	2021-12-03 15:27:58 +00:00
Mehrnoosh Heidarpour	54dc03b97b	[InstSimplify] Add test case for logic 'or' fold; NFC	2021-12-03 09:29:43 -05:00
Sanjay Patel	4b30076f16	[InstSimplify] add logic fold for 'or' https://alive2.llvm.org/ce/z/4PaPDy There's a related fold where the inner 'or' is replaced by 'and', but that needs to be more careful about matching a 'not'.	2021-11-30 14:08:54 -05:00
Sanjay Patel	6076c1dc1c	[InstSimplify] make 'or' test names more descriptive; NFC Also, vary the types in a couple of tests for better coverage.	2021-11-30 14:08:54 -05:00
Sanjay Patel	33f8c1168f	[InstSimplify] adjust tests for 'or' of logic ops; NFC Half of the tests had an extra instruction so were not testing the minimal patterns.	2021-11-30 12:55:37 -05:00
Sanjay Patel	1fdb0f6ffd	[InstSimplify] add tests for 'or' with logic ops; NFC The code for these transforms can be refactored, but the existing tests are incomplete.	2021-11-30 12:55:36 -05:00
Sanjay Patel	746e632daf	[InstSimplify] add tests for 'or' logic folds; NFC The tests are adapted from the xor patterns used with: `892648b18a` `b326c05814`	2021-11-30 12:55:36 -05:00
Erik Desjardins	53b00b8215	[InstSimplify] Fold X {lshr,udiv} C <u X --> true for nonzero X, non-identity C This eliminates the bounds check in Rust code like pub fn mid(data: &[i32]) -> i32 { if data.is_empty() { return 0; } return data[data.len()/2]; } (from https://blog.sigplan.org/2021/11/18/undefined-behavior-deserves-a-better-reputation/) Alive proofs: lshr https://alive2.llvm.org/ce/z/nyTu8D udiv https://alive2.llvm.org/ce/z/CNUZH7 Differential Revision: https://reviews.llvm.org/D114279	2021-11-26 16:48:33 -05:00
Erik Desjardins	a68af62b42	[InstSimplify] baseline tests for icmp of lshr/udiv fold (NFC) Precommits tests for https://reviews.llvm.org/D114279 Differential Revision: https://reviews.llvm.org/D114280	2021-11-26 15:57:04 -05:00
Zarko Todorovski	7f7dac7126	[NFC][llvm] Inclusive language: reword uses of sanity test and check Part of continuing work to use more inclusive language. Reworded uses of sanity check and sanity test in llvm/test/	2021-11-25 07:21:42 -05:00
Sanjay Patel	b326c05814	[InstSimplify] fold xor logic of 2 variables, part 2 (~a & b) ^ (a \| b) --> a This is the swapped and/or (Demorgan?) sibling fold for the fold added with D114462 ( `892648b18a` ). This case is easier to specify because we are returning a root value, not a 'not': https://alive2.llvm.org/ce/z/SRzj4f	2021-11-24 08:15:47 -05:00
Sanjay Patel	823fc8aa06	[InstSimplify] add tests for xor logic; NFC	2021-11-24 08:15:47 -05:00
Sanjay Patel	892648b18a	[InstSimplify] fold xor logic of 2 variables (a & b) ^ (~a \| b) --> ~a I was looking for a shortcut to reduce some of the complex logic folds that are currently up for review (D113216 and others in that stack), and I found this missing from instcombine/instsimplify. There is a trade-off in putting it into instsimplify: because we can't create new values here, we need a strict 'not' op (no undef elements). Otherwise, the fold is not valid: https://alive2.llvm.org/ce/z/k_AGGj If this was in instcombine instead, we could create the proper 'not'. But having the fold here benefits other passes like GVN that use instsimplify as an analysis. There is a related fold where 'and' and 'or' are swapped, and that is planned as a follow-up commit. Differential Revision: https://reviews.llvm.org/D114462	2021-11-23 16:50:23 -05:00
Sanjay Patel	14d743457c	[InstSimplify] add tests for xor logic fold; NFC	2021-11-23 15:35:52 -05:00
Mehrnoosh Heidarpour	62c51a72f9	[InstSimplify] Fold A\|B \| (A^B) --> A\|B This patch adds the following fold opportunity: A\|B \| (A^B) --> A\|B that is reported here : https://bugs.llvm.org/show_bug.cgi?id=52479 https://alive2.llvm.org/ce/z/33-My- Test cases with base results are added in D113860 Reviewed By: rampitec Differential Revision: https://reviews.llvm.org/D113861	2021-11-15 18:55:04 -05:00
Stanislav Mekhanoshin	833cdb0a07	Revert "[InstSimplify] Fold A\|B \| (A^B) --> A\|B" This reverts commit `193c40e966`.	2021-11-15 14:56:20 -08:00
Stanislav Mekhanoshin	193c40e966	[InstSimplify] Fold A\|B \| (A^B) --> A\|B This patch adds the following fold opportunity: A\|B \| (A^B) --> A\|B that is reported here : https://bugs.llvm.org/show_bug.cgi?id=52479 https://alive2.llvm.org/ce/z/33-My- Test cases with base results are added in D113860 (authored by MehrHeidar, committed by rampitec). Differential Revision: https://reviews.llvm.org/D113861	2021-11-15 13:49:20 -08:00
Mehrnoosh Heidarpour	a7f7cf115b	[NFC][InstSimplify] add test cases with base results for or-xor fold This patch adds tests with baseline results as a pre-commit for D113861 Differential Revision: https://reviews.llvm.org/D113860	2021-11-15 10:08:31 -05:00
Nikita Popov	e3cec17b2d	[InstSimplify] Remove incorrect icmp of gep fold (PR52429) As described in https://bugs.llvm.org/show_bug.cgi?id=52429 this fold is incorrect, because inbounds only guarantees that the pointers don't wrap in the unsigned space: It is possible that the sign boundary is crossed by an object. I'm dropping the fold entirely rather than adjusting it, because computePointerICmp() fully subsumes it (just with correct predicate handling). Differential Revision: https://reviews.llvm.org/D113343	2021-11-06 21:03:21 +01:00
David Green	2c4a9e830c	[ValueTracking] Teach computeConstantRange that the maximum value of a half is 65504 The maximal value of a half is 0x7bff, which is 65504 when converted to an integer. This patch teaches that to computeConstantRange to compute a constant range with the correct maximum value. https://alive2.llvm.org/ce/z/BV_Spb https://alive2.llvm.org/ce/z/Nwuqvb The maximum value for a float converted in the same way is 3.4e38, which requires 129bits of data. I have not added that here as integer types so larger are rare, compared to integers types larger than 17 bits require for half floats. The MVE tests change because instsimplify happens to be run as a part of the backend, where it doesn't tend to for other backends. Differential Revision: https://reviews.llvm.org/D112694	2021-10-30 14:27:38 +01:00
David Green	0a2708d2ae	[InstSimplify] Add tests for the range of a half float. NFC	2021-10-28 12:58:13 +01:00
Kevin P. Neal	727a891ec8	[FPEnv][InstSimplify] Fold fadd X, 0 ==> X, when we know X is not -0 Currently the fadd optimizations in InstSimplify don't know how to do this NoSignedZeros "X + 0.0 ==> X" fold when using the constrained intrinsics. This adds the support. This review is derived from D106362 with some improvements from D107285 and is a follow-on to D111085. Differential Revision: https://reviews.llvm.org/D111450	2021-10-14 12:32:45 -04:00
Kevin P. Neal	bdf6ba2d30	[FPEnv][InstSimplify] Precommit tests: Enable more folds for constrained fsub Precommit tests for D107285 as requested. TODO notes left at individual functions also as requested.	2021-10-12 13:47:28 -04:00
Sanjay Patel	fdbf2bb4ee	[InstSimplify] (x \|\| y) && (x \|\| !y) --> x https://alive2.llvm.org/ce/z/4BE33w This is the logical (select-form) equivalent of the bitwise logic fold: `e36d351d19` This is another part of solving the regression from: https://llvm.org/PR52077	2021-10-07 12:25:25 -04:00
Sanjay Patel	5ae6df1fea	[InstSimplify] add tests for (x \|\| y) && (x \|\| !y); NFC	2021-10-07 10:37:34 -04:00
Kevin P. Neal	f86c930cc9	[FPEnv][InstSimplify] Fold constrained X + -0.0 ==> X Currently the fadd optimizations in InstSimplify don't know how to do this "X + -0.0 ==> X" fold when using the constrained intrinsics. This adds the support. This commit is derived from D106362 with some improvements from D107285. Differential Revision: https://reviews.llvm.org/D111085	2021-10-06 13:52:31 -04:00
Sanjay Patel	e36d351d19	[InstSimplify] (x \| y) & (x \| !y) --> x https://alive2.llvm.org/ce/z/QagQMn This fold is handled by instcombine via SimplifyUsingDistributiveLaws(), but we are missing the sibliing fold for 'logical and' (implemented with 'select'). Retrofitting the code in instcombine looks much harder than just adding a small adjustment here, and this is potentially more efficient and beneficial to other passes.	2021-10-06 12:31:25 -04:00
Sanjay Patel	4666324f2b	[InstSimplify] add tests for bitwise logic fold of 'and'; NFC	2021-10-06 12:31:25 -04:00
Nikita Popov	c117d77e93	[ConstantFold] Refactor load folding This refactors load folding to happen in two cleanly separated steps: ConstantFoldLoadFromConstPtr() takes a pointer to load from and decomposes it into a constant initializer base and an offset. Then ConstantFoldLoadFromConst() loads from that initializer at the given offset. This makes the core logic independent of having actual GEP expressions (and those GEP expressions having certain structure) and will allow exposing ConstantFoldLoadFromConst() as an independent API in the future. This is mostly only a refactoring, but it does make the folding logic slightly more powerful. Differential Revision: https://reviews.llvm.org/D111023	2021-10-05 18:07:57 +02:00
Kevin P. Neal	770c57898e	[FPEnv][InstSimplify] Prepush more tests for D106362. In working on D106362 I found that a few more tests were needed. I've been asked to pre-push the tests for that ticket. This should complete the tests needed for now.	2021-10-04 13:48:34 -04:00
Nikita Popov	3be4acbaa3	[InstSimplify] Add additional load from constant test (NFC) This case does not get folded, because the GEP indexes too deeply (to the i8), making the bitcast logic not apply (on the [8 x i8]).	2021-10-03 15:52:36 +02:00
Sanjay Patel	4414e2ad97	[InstSimplify] (-1 << x) s>> x --> -1 This was noticed in: https://llvm.org/PR51351 https://alive2.llvm.org/ce/z/aLxunD	2021-09-29 13:03:12 -04:00
Sanjay Patel	d3e2067c7c	[InstSimplify] add tests for (-1 << x) s>> x; NFC	2021-09-29 11:43:18 -04:00
Alex Richardson	05663dc146	[InstSimplify] Don't lose inbounds when simplifying a GEP I noticed this while working on a (ptrtoint (gep null, x)) -> x fold. Reviewed By: nikic Differential Revision: https://reviews.llvm.org/D110168	2021-09-23 09:25:06 +01:00
David Sherwood	f988f68064	[Analysis] Add support for vscale in computeKnownBitsFromOperator In ValueTracking.cpp we use a function called computeKnownBitsFromOperator to determine the known bits of a value. For the vscale intrinsic if the function contains the vscale_range attribute we can use the maximum and minimum values of vscale to determine some known zero and one bits. This should help to improve code quality by allowing certain optimisations to take place. Tests added here: Transforms/InstCombine/icmp-vscale.ll Differential Revision: https://reviews.llvm.org/D109883	2021-09-20 15:01:59 +01:00
Nikita Popov	af382b9383	[IR] Handle constant expressions in containsUndefinedElement() If the constant is a constant expression, then getAggregateElement() will return null. Guard against this before calling HasFn().	2021-09-09 22:04:12 +02:00
Roman Lebedev	3f1f08f0ed	Revert @llvm.isnan intrinsic patchset. Please refer to https://lists.llvm.org/pipermail/llvm-dev/2021-September/152440.html (and that whole thread.) TLDR: the original patch had no prior RFC, yet it had some changes that really need a proper RFC discussion. It won't be productive to discuss such an RFC, once it's actually posted, while said patch is already committed, because that introduces bias towards already-committed stuff, and the tree is potentially in broken state meanwhile. While the end result of discussion may lead back to the current design, it may also not lead to the current design. Therefore i take it upon myself to revert the tree back to last known good state. This reverts commit `4c4093e6e3`. This reverts commit `0a2b1ba33a`. This reverts commit `d9873711cb`. This reverts commit `791006fb8c`. This reverts commit `c22b64ef66`. This reverts commit `72ebcd3198`. This reverts commit `5fa6039a5f`. This reverts commit `9efda541bf`. This reverts commit `94d3ff09cf`.	2021-09-02 13:53:56 +03:00
Sanjay Patel	204038d52e	[InstSimplify] fold or+shifted -1 to -1 These are similar to the rotate pattern added with: `dcf659e821` ...but we don't have guard ops on the shift amount, so we don't canonicalize to the intrinsic. declare void @llvm.assume(i1) define i32 @src(i32 %shamt, i32 %bitwidth) { ; subtract must be in range of bitwidth %lt = icmp ule i32 %bitwidth, 32 call void @llvm.assume(i1 %lt) %r = lshr i32 -1, %shamt %s = sub i32 %bitwidth, %shamt %l = shl i32 -1, %s %o = or i32 %r, %l ret i32 %o } define i32 @tgt(i32 %shamt, i32 %bitwidth) { ret i32 -1 } https://alive2.llvm.org/ce/z/aF7WHx	2021-08-24 15:38:38 -04:00
Sanjay Patel	ac7694596d	[InstSimplify] add tests for or-of-shifted-one-bits; NFC	2021-08-24 15:38:38 -04:00
Sanjay Patel	dcf659e821	[InstSimplify] fold rotate of -1 to -1 This is part of solving more general rotate patterns seen in bugs related to: https://llvm.org/PR51575 https://alive2.llvm.org/ce/z/GpkFCt	2021-08-22 09:15:48 -04:00
Sanjay Patel	d41e308f10	[InstSimplify] fold rotate of zero to zero This is part of solving more general rotate patterns seen in bugs related to: https://llvm.org/PR51575 https://alive2.llvm.org/ce/z/fjKwqv	2021-08-22 09:15:48 -04:00
Sanjay Patel	a0ebac4466	[InstSimplify] add tests for rotates of 0/-1; NFC	2021-08-22 09:15:48 -04:00
Usman Nadeem	a7c4e9b1f7	[InstSimplify] Eliminate vector reverse of a splat vector experimental.vector.reverse(splat(X)) -> splat(X) Differential Revision: https://reviews.llvm.org/D107793 Change-Id: Id29ba88fd669ff8686712e96b1bdc46dda5b853c	2021-08-11 11:27:58 -07:00
Sanjay Patel	e260e10c4a	[InstSimplify] fold min/max with limit constant This is already done within InstCombine: https://alive2.llvm.org/ce/z/MiGE22 ...but leaving it out of analysis makes it harder to avoid infinite loops there.	2021-08-10 10:57:25 -04:00
Sanjay Patel	188832f419	Revert "[InstSimplify] fold min/max with limit constant; NFC" This reverts commit `f43859b437`. This is not NFC, so I'll try again without that mistake in the commit message.	2021-08-10 10:50:09 -04:00
Sanjay Patel	f43859b437	[InstSimplify] fold min/max with limit constant; NFC This is already done within InstCombine: https://alive2.llvm.org/ce/z/MiGE22 ...but leaving it out of analysis makes it harder to avoid infinite loops there.	2021-08-10 10:43:07 -04:00
Sanjay Patel	9b942a545c	[InstSimplify] add tests for min/max idioms; NFC	2021-08-10 10:43:07 -04:00
David Sherwood	8439415333	[IR] Let ConstantVector::getSplat use poison instead of undef This patch updates ConstantVector::getSplat to use poison instead of undef when using insertelement/shufflevector to splat. This follows on from D93793. Differential Revision: https://reviews.llvm.org/D107751	2021-08-10 08:27:43 +01:00
Serge Pavlov	4c4093e6e3	Introduce intrinsic llvm.isnan This is recommit of the patch `16ff91ebcc`, reverted in `0c28a7c990` because it had an error in call of getFastMathFlags (base type should be FPMathOperator but not Instruction). The original commit message is duplicated below: Clang has builtin function '__builtin_isnan', which implements C library function 'isnan'. This function now is implemented entirely in clang codegen, which expands the function into set of IR operations. There are three mechanisms by which the expansion can be made. * The most common mechanism is using an unordered comparison made by instruction 'fcmp uno'. This simple solution is target-independent and works well in most cases. It however is not suitable if floating point exceptions are tracked. Corresponding IEEE 754 operation and C function must never raise FP exception, even if the argument is a signaling NaN. Compare instructions usually does not have such property, they raise 'invalid' exception in such case. So this mechanism is unsuitable when exception behavior is strict. In particular it could result in unexpected trapping if argument is SNaN. * Another solution was implemented in https://reviews.llvm.org/D95948. It is used in the cases when raising FP exceptions by 'isnan' is not allowed. This solution implements 'isnan' using integer operations. It solves the problem of exceptions, but offers one solution for all targets, however some can do the check in more efficient way. * Solution implemented by https://reviews.llvm.org/D96568 introduced a hook 'clang::TargetCodeGenInfo::testFPKind', which injects target specific code into IR. Now only SystemZ implements this hook and it generates a call to target specific intrinsic function. Although these mechanisms allow to implement 'isnan' with enough efficiency, expanding 'isnan' in clang has drawbacks: * The operation 'isnan' is hidden behind generic integer operations or target-specific intrinsics. It complicates analysis and can prevent some optimizations. * IR can be created by tools other than clang, in this case treatment of 'isnan' has to be duplicated in that tool. Another issue with the current implementation of 'isnan' comes from the use of options '-ffast-math' or '-fno-honor-nans'. If such option is specified, 'fcmp uno' may be optimized to 'false'. It is valid optimization in general, but it results in 'isnan' always returning 'false'. For example, in some libc++ implementations the following code returns 'false': std::isnan(std::numeric_limits<float>::quiet_NaN()) The options '-ffast-math' and '-fno-honor-nans' imply that FP operation operands are never NaNs. This assumption however should not be applied to the functions that check FP number properties, including 'isnan'. If such function returns expected result instead of actually making checks, it becomes useless in many cases. The option '-ffast-math' is often used for performance critical code, as it can speed up execution by the expense of manual treatment of corner cases. If 'isnan' returns assumed result, a user cannot use it in the manual treatment of NaNs and has to invent replacements, like making the check using integer operations. There is a discussion in https://reviews.llvm.org/D18513#387418, which also expresses the opinion, that limitations imposed by '-ffast-math' should be applied only to 'math' functions but not to 'tests'. To overcome these drawbacks, this change introduces a new IR intrinsic function 'llvm.isnan', which realizes the check as specified by IEEE-754 and C standards in target-agnostic way. During IR transformations it does not undergo undesirable optimizations. It reaches instruction selection, where is lowered in target-dependent way. The lowering can vary depending on options like '-ffast-math' or '-ffp-model' so the resulting code satisfies requested semantics. Differential Revision: https://reviews.llvm.org/D104854	2021-08-06 14:32:27 +07:00
Serge Pavlov	0c28a7c990	Revert "Introduce intrinsic llvm.isnan" This reverts commit `16ff91ebcc`. Several errors were reported mainly test-suite execution time. Reverted for investigation.	2021-08-04 17:18:15 +07:00
Serge Pavlov	16ff91ebcc	Introduce intrinsic llvm.isnan Clang has builtin function '__builtin_isnan', which implements C library function 'isnan'. This function now is implemented entirely in clang codegen, which expands the function into set of IR operations. There are three mechanisms by which the expansion can be made. * The most common mechanism is using an unordered comparison made by instruction 'fcmp uno'. This simple solution is target-independent and works well in most cases. It however is not suitable if floating point exceptions are tracked. Corresponding IEEE 754 operation and C function must never raise FP exception, even if the argument is a signaling NaN. Compare instructions usually does not have such property, they raise 'invalid' exception in such case. So this mechanism is unsuitable when exception behavior is strict. In particular it could result in unexpected trapping if argument is SNaN. * Another solution was implemented in https://reviews.llvm.org/D95948. It is used in the cases when raising FP exceptions by 'isnan' is not allowed. This solution implements 'isnan' using integer operations. It solves the problem of exceptions, but offers one solution for all targets, however some can do the check in more efficient way. * Solution implemented by https://reviews.llvm.org/D96568 introduced a hook 'clang::TargetCodeGenInfo::testFPKind', which injects target specific code into IR. Now only SystemZ implements this hook and it generates a call to target specific intrinsic function. Although these mechanisms allow to implement 'isnan' with enough efficiency, expanding 'isnan' in clang has drawbacks: * The operation 'isnan' is hidden behind generic integer operations or target-specific intrinsics. It complicates analysis and can prevent some optimizations. * IR can be created by tools other than clang, in this case treatment of 'isnan' has to be duplicated in that tool. Another issue with the current implementation of 'isnan' comes from the use of options '-ffast-math' or '-fno-honor-nans'. If such option is specified, 'fcmp uno' may be optimized to 'false'. It is valid optimization in general, but it results in 'isnan' always returning 'false'. For example, in some libc++ implementations the following code returns 'false': std::isnan(std::numeric_limits<float>::quiet_NaN()) The options '-ffast-math' and '-fno-honor-nans' imply that FP operation operands are never NaNs. This assumption however should not be applied to the functions that check FP number properties, including 'isnan'. If such function returns expected result instead of actually making checks, it becomes useless in many cases. The option '-ffast-math' is often used for performance critical code, as it can speed up execution by the expense of manual treatment of corner cases. If 'isnan' returns assumed result, a user cannot use it in the manual treatment of NaNs and has to invent replacements, like making the check using integer operations. There is a discussion in https://reviews.llvm.org/D18513#387418, which also expresses the opinion, that limitations imposed by '-ffast-math' should be applied only to 'math' functions but not to 'tests'. To overcome these drawbacks, this change introduces a new IR intrinsic function 'llvm.isnan', which realizes the check as specified by IEEE-754 and C standards in target-agnostic way. During IR transformations it does not undergo undesirable optimizations. It reaches instruction selection, where is lowered in target-dependent way. The lowering can vary depending on options like '-ffast-math' or '-ffp-model' so the resulting code satisfies requested semantics. Differential Revision: https://reviews.llvm.org/D104854	2021-08-04 15:27:49 +07:00
Sander de Smalen	84a4caeb84	[InstSimplify] Don't assume parent function when simplifying llvm.vscale. D106850 introduced a simplification for llvm.vscale by looking at the surrounding function's vscale_range attributes. The call that's being simplified may not yet have been inserted into the IR. This happens for example during function cloning. This patch fixes the issue by checking if the instruction is in a parent basic block.	2021-07-29 20:08:08 +01:00
Jun Ma	e2fe26e77b	[NFC][InstSimplify] Use more intuitive variable names.	2021-07-29 13:55:47 +08:00
Jun Ma	ca0fe3447f	[InstSimplify] Simplify llvm.vscale when vscale_range attribute exists Reduce llvm.vscale to constant based on vscale_range attribute. Differential Revision: https://reviews.llvm.org/D106850	2021-07-28 21:41:52 +08:00
Kevin P. Neal	2a7ee6b5c1	[FPEnv][InstSimplify] Enable more folds for constrained fadd Precommit tests, try 2. My tree is up-to-date as of this morning so this should go better than my first try.	2021-07-26 14:06:21 -04:00
Kevin P. Neal	aee8457b8d	Revert "[FPEnv][InstSimplify] Enable more folds for constrained fadd" Build bots have started failing. This reverts commit `64c2b2c69d`.	2021-07-23 15:09:05 -04:00
Kevin P. Neal	64c2b2c69d	[FPEnv][InstSimplify] Enable more folds for constrained fadd Precommit tests.	2021-07-23 14:59:38 -04:00
Serge Pavlov	1c64b5dc5e	[ConstantFolding] Fold constrained arithmetic intrinsics Constfold constrained variants of operations fadd, fsub, fmul, fdiv, frem, fma and fmuladd. The change also sets up some means to support for removal of unused constrained intrinsics. They are declared as accessing memory to model interaction with floating point environment, so they were not removed, as they have side effect. Now constrained intrinsics that have "fpexcept.ignore" as exception behavior are removed if they have no uses. As for intrinsics that have exception behavior other than "fpexcept.ignore", they can be removed if it is known that they do not raise floating point exceptions. It happens when doing constant folding, attributes of such intrinsic are changed so that the intrinsic is not claimed as accessing memory. Differential Revision: https://reviews.llvm.org/D102673	2021-07-23 14:39:51 +07:00
Sanjay Patel	13302c06cd	[ConstantFolding] avoid crashing on a fake math library call https://llvm.org/PR50960	2021-07-20 18:25:21 -04:00
Serge Pavlov	a0b4f424f5	Use update_test_checks.py to auto-generate check lines	2021-07-16 18:20:08 +07:00
Serge Pavlov	39a36999f9	Fix typo in test	2021-07-16 12:43:57 +07:00
Kevin P. Neal	52900486a1	[FPEnv][InstSimplify] Constrained FP support for NaN Currently InstructionSimplify.cpp knows how to simplify floating point instructions that have a NaN operand. It does not know how to handle the matching constrained FP intrinsic. This patch teaches it how to simplify so long as the exception handling is not "fpexcept.strict". Differential Revision: https://reviews.llvm.org/D103169	2021-07-09 11:26:28 -04:00
Sanjay Patel	4ec7c02197	[InstSimplify] fix bug in poison propagation for FP ops If any operand of a math op is poison, that takes precedence over general undef/NaN. This should not be visible with binary ops because it requires 2 constant operands to trigger (and if both operands of a binop are constant, that should get handled first in ConstantFolding).	2021-07-06 14:06:50 -04:00
Sanjay Patel	35e8cc4979	[InstSimplify][test] add tests for poison propagation through FP calls; NFC	2021-07-06 14:06:50 -04:00
Sanjay Patel	3d3c0ed932	[InstSimplify] fold extractelement of splat with variable extract index We already have a fold for variable index with constant vector, but if we can determine a scalar splat value, then it does not matter whether that value is constant or not. We overlooked this fold in D102404 and earlier patches, but the fixed vector variant is shown in: https://llvm.org/PR50817 Alive2 agrees on that: https://alive2.llvm.org/ce/z/HpijPC The same logic applies to scalable vectors. Differential Revision: https://reviews.llvm.org/D104867	2021-07-05 08:19:40 -04:00
Sanjay Patel	3ba090e5f6	[InstSimplify][test] add test for extract of splat; NFC This is shown in: https://llvm.org/PR50817	2021-06-24 13:13:35 -04:00
Sanjay Patel	e13c62a103	[InstSimplify][test] move tests that don't require InstCombine; NFC These are existing/missing simplifications, so the tests don't need the full power of InstCombine.	2021-06-24 13:13:34 -04:00
Sanjay Patel	656001e7b2	[ValueTracking] look through bitcast of vector in computeKnownBits This borrows as much as possible from the SDAG version of the code (originally added with D27129 and since updated with big endian support). In IR, we can test more easily for correctness than we did in the original patch. I'm using the simplest cases that I could find for InstSimplify: we computeKnownBits on variable shift amounts to see if they are zero or in range. So shuffle constant elements into a vector, cast it, and shift it. The motivating x86 example from https://llvm.org/PR50123 is also here. We computeKnownBits in the caller code, but we only check if the shift amount is in range. That could be enhanced to catch the 2nd x86 test - if the shift amount is known too big, the result is 0. Alive2 understands the datalayout and agrees that the tests here are correct - example: https://alive2.llvm.org/ce/z/KZJFMZ Differential Revision: https://reviews.llvm.org/D104472	2021-06-23 11:46:46 -04:00
Juneyoung Lee	5af8bacc94	[InstSimplify] Add more poison folding optimizations This adds more poison folding optimizations to InstSimplify. Since all binary operators propagate poison, these are fine. Also, the precondition of `select cond, undef, x` -> `x` is relaxed to allow the case when `x` is undef. Reviewed By: nikic Differential Revision: https://reviews.llvm.org/D104661	2021-06-23 20:25:24 +09:00
Juneyoung Lee	09e8c0d5aa	[InstSimplify] icmp poison, X -> poison This adds a simple transformation from icmp with poison constant to poison. Comparing poison with something else is poison, so this is okay. https://alive2.llvm.org/ce/z/e8iReb https://alive2.llvm.org/ce/z/q4MurY	2021-06-20 15:39:07 +09:00
Sanjay Patel	61196f855c	[InstSimplify] add tests for computeKnownBits of shift-with-bitcast op; NFC	2021-06-17 12:39:16 -04:00
Kevin P. Neal	60a8edf30d	[FPEnv][InstSimplify] Precommit tests for D103169. In D103169 I'm adding to InstSimplify support for NaN to constrained intrinsics that have a regular FP IR instruction counterpart. Precommit the tests for clarity when that ticket lands.	2021-06-17 10:34:39 -04:00
Bjorn Pettersson	4c7f820b2b	Update @llvm.powi to handle different int sizes for the exponent This can be seen as a follow up to commit `0ee439b705`, that changed the second argument of __powidf2, __powisf2 and __powitf2 in compiler-rt from si_int to int. That was to align with how those runtimes are defined in libgcc. One thing that seem to have been missing in that patch was to make sure that the rest of LLVM also handle that the argument now depends on the size of int (not using the si_int machine mode for 32-bit). When using __builtin_powi for a target with 16-bit int clang crashed. And when emitting libcalls to those rtlib functions, typically when lowering @llvm.powi), the backend would always prepare the exponent argument as an i32 which caused miscompiles when the rtlib was compiled with 16-bit int. The solution used here is to use an overloaded type for the second argument in @llvm.powi. This way clang can use the "correct" type when lowering __builtin_powi, and then later when emitting the libcall it is assumed that the type used in @llvm.powi matches the rtlib function. One thing that needed some extra attention was that when vectorizing calls several passes did not support that several arguments could be overloaded in the intrinsics. This patch allows overload of a scalar operand by adding hasVectorInstrinsicOverloadedScalarOpd, with an entry for powi. Differential Revision: https://reviews.llvm.org/D99439	2021-06-17 09:38:28 +02:00
Sanjay Patel	ce95200b79	[InstSimplify] propagate poison through FP ops We already have this fold: fadd float poison, 1.0 --> poison ...via ConstantFolding, so this makes the behavior consistent if the other operand(s) are non-constant. The fold for undef was added before poison existed as a value/type in IR. This came up in D102673 / D103169 because we're trying to sort out the more complicated handling for constrained math ops. We should have the handling for the regular instructions done first, so we can build on that (or diverge as needed). Differential Revision: https://reviews.llvm.org/D104383	2021-06-16 11:31:58 -04:00
Arthur Eubanks	9aa1428174	[InstSimplify] Treat invariant group insts as bitcasts for load operands We can look through invariant group intrinsics for the purposes of simplifying the result of a load. Since intrinsics can't be constants, but we also don't want to completely rewrite load constant folding, we convert the load operand to a constant. For GEPs and bitcasts we just treat them as constants. For invariant group intrinsics, we treat them as a bitcast. Relanding with a check for self-referential values. Reviewed By: lebedev.ri Differential Revision: https://reviews.llvm.org/D101103	2021-06-15 12:59:43 -07:00
Caroline Concatto	3c1f0e9ef8	[InstSimplify] Add constant fold for extractelement + splat for scalable vectors This patch allows that scalable vector can fold extractelement and constant splat only when the lane index is lower than the minimum number of elements of the vector. Differential Revision: https://reviews.llvm.org/D103180	2021-06-10 12:41:40 +01:00
Serge Pavlov	8ff36aab69	[ConstantFolding] Enable folding of min/max/copysign for all floats Previously such folding was enabled for half, float and double values only. With this change it is allowed for other floating point values also. Differential Revision: https://reviews.llvm.org/D103956	2021-06-10 11:57:51 +07:00
Arthur Eubanks	222cce3828	Revert "[InstSimplify] Treat invariant group insts as bitcasts for load operands" This reverts commit `26044c6a54`. Breaks on invalid IR (see D101103).	2021-06-09 11:46:10 -07:00
Sanjay Patel	8a4d05ddb3	[ConstantFolding] add copysign tests for more FP types; NFC D102673 proposes to ease the current type check, but there doesn't appear to be any test coverage for that.	2021-06-04 11:42:53 -04:00
Arthur Eubanks	26044c6a54	[InstSimplify] Treat invariant group insts as bitcasts for load operands We can look through invariant group intrinsics for the purposes of simplifying the result of a load. Since intrinsics can't be constants, but we also don't want to completely rewrite load constant folding, we convert the load operand to a constant. For GEPs and bitcasts we just treat them as constants. For invariant group intrinsics, we treat them as a bitcast. Reviewed By: lebedev.ri Differential Revision: https://reviews.llvm.org/D101103	2021-06-01 16:33:06 -07:00
Arthur Eubanks	3aa943070c	[test] Precommit test for D101103	2021-06-01 16:31:02 -07:00
Arthur Eubanks	8086f9d87e	[ConstFold] Simplify a load's GEP operand through local aliases MSVC-style RTTI produces loads through a GEP of a local alias which itself is a GEP. Currently we aren't able to devirtualize any virtual calls when MSVC RTTI is enabled. This patch attempts to simplify a load's GEP operand by calling SymbolicallyEvaluateGEP() with an option to look through local aliases. Differential Revision: https://reviews.llvm.org/D101100	2021-05-27 16:04:19 -07:00
Sanjay Patel	ca7eaa0a54	[InstSimplify] allow undef element match in vector select condition value The semantics of select with undefined/poison condition are not explicitly stated in the LangRef, but this matches comments in the code and Alive2 appears to concur: https://alive2.llvm.org/ce/z/KXytmd We can find this pattern after demanded elements transforms. As noted in D101191, fuzzers are finding infinite loops because we may not account for this pattern in other passes.	2021-05-25 14:25:34 -04:00
David Goldblatt	8607a02357	[InstSimplify] Transform X * Y % Y --> 0 simplifyDiv already handles the case X * Y / Y --> X (barring overflow). This adds the equivalent handling to simplifyRem. Correctness: https://alive2.llvm.org/ce/z/J2cUbS https://alive2.llvm.org/ce/z/us9NUM https://alive2.llvm.org/ce/z/AvaDGJ https://alive2.llvm.org/ce/z/kq9ige Extending the situations in which we apply this transform would not be correct: https://alive2.llvm.org/ce/z/Lf9V63 https://alive2.llvm.org/ce/z/6RPQK3 https://alive2.llvm.org/ce/z/p9UdxC https://alive2.llvm.org/ce/z/A2zlhE https://alive2.llvm.org/ce/z/vHTtLw https://alive2.llvm.org/ce/z/lvpH42 Differential Revision: https://reviews.llvm.org/D102864	2021-05-25 10:16:04 -04:00
Sanjay Patel	a0e71f1832	[ConstProp] propagate poison from vector reduction element(s) to result This follows from the underlying logic for binops and min/max. Although it does not appear that we handle this for min/max intrinsics currently. https://alive2.llvm.org/ce/z/Kq9Xnh	2021-05-24 10:34:40 -04:00
Sanjay Patel	3dd2063671	[ConstProp] add tests for vector reductions with poison elements; NFC	2021-05-24 10:34:40 -04:00
Sanjay Patel	cb3bc9d81d	[InstSimplify] add more tests for rem-mul-div; NFC See D102864 for discussion.	2021-05-23 09:46:29 -04:00
David Goldblatt	3c4b79481d	[InstSimplify] add tests for rem-of-mul; NFC These are baseline tests for D102864	2021-05-21 15:46:39 -04:00
Joe Ellis	5a476987f7	[InstSimplify] Properly constrain {insert,extract}_subvector intrinsic fold The previous rule: (insert_vector _, (extract_vector X, 0), 0) -> X is not quite correct. The correct fold should be: (insert_vector Y, (extract_vector X, 0), 0) -> X where: Y is X, or Y is undef This commit updates the pattern. Reviewed By: peterwaller-arm, paulwalker-arm Differential Revision: https://reviews.llvm.org/D102699	2021-05-21 10:05:03 +00:00
Joe Ellis	2ed7db0d20	[InstSimplify] Remove redundant {insert,extract}_vector intrinsic chains This commit removes some redundant {insert,extract}_vector intrinsic chains by implementing the following patterns as instsimplifies: (insert_vector _, (extract_vector X, 0), 0) -> X (extract_vector (insert_vector _, X, 0), 0) -> X Reviewed By: peterwaller-arm Differential Revision: https://reviews.llvm.org/D101986	2021-05-13 16:09:50 +00:00
Juneyoung Lee	395607af3c	Reapply [ConstantFold] Fold more operations to poison This was reverted to mitigate mitigate miscompiles caused by the logical and/or to bitwise and/or fold. Reapply it now that the underlying issue has been fixed by D101191. ----- This patch folds more operations to poison. Alive2 proof: https://alive2.llvm.org/ce/z/mxcb9G (it does not contain tests about div/rem because they fold to poison when raising UB) Reviewed By: nikic Differential Revision: https://reviews.llvm.org/D92270	2021-05-13 16:04:12 +02:00
Stanislav Mekhanoshin	22d295f695	[AMDGPU] Constant fold Intrinsic::amdgcn_perm Differential Revision: https://reviews.llvm.org/D102203	2021-05-10 16:23:11 -07:00
Dávid Bolvanský	62fcda9378	Revert "[InstSimplify] Added tests for PR50173, NFC" This reverts commit `4e7a4c73da`. Not needed, pattern is handled by instcombine already.	2021-05-04 23:04:05 +02:00
Dávid Bolvanský	4e7a4c73da	[InstSimplify] Added tests for PR50173, NFC	2021-05-04 19:32:44 +02:00
Sanjay Patel	15a42339fe	[ValueTracking] soften assert for invertible recurrence matching There's a TODO comment in the code and discussion in D99912 about generalizing this, but I wasn't sure how to implement that, so just going with a potential minimal fix to avoid crashing. The test is a reduction beyond useful code (there's no user of %user...), but it is based on https://llvm.org/PR50191, so this is asserting on real code. Differential Revision: https://reviews.llvm.org/D101772	2021-05-03 15:57:40 -04:00
Sanjay Patel	1089158c5a	[ConstantFolding] propagate poison through vector reduction intrinsics	2021-04-29 12:54:20 -04:00
Sanjay Patel	678018138d	[ConstProp] add tests for vector reductions of poison; NFC	2021-04-29 12:20:59 -04:00
Sanjay Patel	5e6dc5e404	[InstSimplify] generalize ctlz-of-shifted-constant https://alive2.llvm.org/ce/z/zWL_VQ	2021-04-21 14:23:55 -04:00
Sanjay Patel	859e1f420d	[InstSimplify] add tests for ctlz-of-shift-constant; NFC	2021-04-21 14:23:55 -04:00
Nikita Popov	de18fa9e52	Revert "[InstSimplify] Bypass no-op `and`-mask, using known bits (PR49543)" This reverts commit `ea1a0d7c9a`. While this is strictly more powerful, it is also strictly slower. InstSimplify intentionally does not perform many folds that it is allowed to perform, if doing so requires a KnownBits calculation that will be repeated in InstCombine. Maybe it's worthwhile to do this here, but that needs a more explicitly stated motivation, evaluated in a review.	2021-04-21 09:55:25 +02:00
Roman Lebedev	ea1a0d7c9a	[InstSimplify] Bypass no-op `and`-mask, using known bits (PR49543) We already special-cased a few interesting patterns, but that is strictly less powerful than using KnownBits. So instead get the known bits for the operand of `and`, and iff all the unset bits of the `and`-mask are known to be zeros in the operand, we can omit said `and`.	2021-04-21 00:31:46 +03:00
Roman Lebedev	8cff391995	[NFC][InstSimplify] Add one more test for unneeded 'and'	2021-04-21 00:31:46 +03:00
Joe Ellis	effacc1599	[AArch64] Constant fold sve_convert_from_svbool(zero) to zero Co-authored-by: Paul Walker <paul.walker@arm.com> Differential Revision: https://reviews.llvm.org/D100463	2021-04-20 10:02:49 +00:00
Juneyoung Lee	2813acb7d1	Update m_Undef to match vectors/aggrs with undefs and poisons mixed This fixes https://reviews.llvm.org/D93990#2666922 by teaching `m_Undef` to match vectors/aggrs with poison elements. As suggested, fixes in InstCombine files to use the `m_Undef` matcher instead of `isa<UndefValue>` will be followed. Reviewed By: lebedev.ri Differential Revision: https://reviews.llvm.org/D100122	2021-04-18 10:57:04 +09:00
Thomas Lively	5c729750a6	[WebAssembly] Remove saturating fp-to-int target intrinsics Use the target-independent @llvm.fptosi and @llvm.fptoui intrinsics instead. This includes removing the instrinsics for i32x4.trunc_sat_zero_f64x2_{s,u}, which are now represented in IR as a saturating truncation to a v2i32 followed by a concatenation with a zero vector. Differential Revision: https://reviews.llvm.org/D100596	2021-04-16 12:11:20 -07:00
Sander de Smalen	672f673004	[SVE] Remove checks for warnings in scalable-vector tests. After D98856 these tests will by default break (fatal_error) if any of the wrong interfaces are used, so there's no longer a need to have a RUN line that checks for a warning message emitted by the compiler.	2021-04-07 15:59:32 +01:00
Florian Hahn	4059c1c32d	[SimplifyInst] Use correct type for GEPs with vector indices. The current code does not properly handle vector indices unless they are the first index. At the moment LangRef gives the impression that the vector index must be the one and only index (https://llvm.org/docs/LangRef.html#getelementptr-instruction). But vector indices can appear at any position and according to the verifier there may be multiple vector indices. If that's the case, the number of elements must match. This patch updates SimplifyGEPInst to properly handle those additional cases. Reviewed By: nikic Differential Revision: https://reviews.llvm.org/D99961	2021-04-06 17:56:10 +01:00
Sanjay Patel	e2a0f512ea	[InstSimplify] fix potential miscompile in select value equivalence This is the sibling fix to `c590a9880d` - as there, we can't subsitute a vector value the equality compare replacement that we are trying requires that the comparison is true for the entire value. Vector select can be partly true/false.	2021-04-05 16:52:34 -04:00
Sanjay Patel	78e5cf66fe	[InstSimplify] add test for vector select with operand replacement; NFC We need a sibling fix to `c590a9880d` ( https://llvm.org/PR49832 ) to avoid miscompiling.	2021-04-05 16:52:34 -04:00
Juneyoung Lee	6147501617	[InstSimplify] Add a test for folding comparison with a undef vector (NFC) This is to fix https://reviews.llvm.org/D93990#2666922	2021-04-04 13:46:56 +09:00
George Mitenkov	807b019ca2	[ConstantFolding] Fixing addo/subo with undef When folding addo/subo with undef, the current convention is to use { -1, false } for addo and { 0, false } for subo. This was fixed for InstSimplify in https://reviews.llvm.org/rGf094d65beaa492e845b03561eddd75b5be653a01, but not in ConstantFolding. Reviewed By: nikic, lebedev.ri Differential Revision: https://reviews.llvm.org/D99564	2021-03-31 21:47:29 +03:00
Sanjay Patel	adf42dff42	[ValueTracking] peek through min/max to find isKnownToBeAPowerOfTwo This is similar to the select logic just ahead of the new code. Min/max choose exactly one value from the inputs, so if both of those are a power-of-2, then the result must be a power-of-2. This might help with D98152, but we likely still need other pieces of the puzzle to avoid regressions. The change in PatternMatch.h is needed to build with clang. It's possible there is a better way to deal with the 'const' incompatibities. Differential Revision: https://reviews.llvm.org/D99276	2021-03-24 17:54:38 -04:00
Sanjay Patel	a8708708cf	[InstSimplify] add tests for min/max intrinsic analysis; NFC	2021-03-24 12:21:59 -04:00
Craig Topper	4c38c35c8d	[ValueTracking] Teach canCreateUndefOrPoison that ctpop does not create undef or poison. This select of ctpop with 0 pattern can get left behind after loop idiom recognize converts a loop to ctpop. LLVM 10 was able to optimize this, but LLVM 11 and later is not. The difference seems to be that some select transforms are now limited based on canCreateUndefOrPoison. Teaching canCreateUndefOrPoison about ctpop restores the LLVM 10 codegen. Differential Revision: https://reviews.llvm.org/D99207	2021-03-23 12:42:18 -07:00
Nikita Popov	7e18cd887c	[InstCombine] Whitelist non-refining folds in SimplifyWithOpReplaced This is an alternative to D98391/D98585, playing things more conservatively. If AllowRefinement == false, then we don't use InstSimplify methods at all, and instead explicitly implement a small number of non-refining folds. Most cases are handled by constant folding, and I only had to add three folds to cover our unit tests / test-suite. While this may lose some optimization power, I think it is safer to approach from this direction, given how many issues this code has already caused. Differential Revision: https://reviews.llvm.org/D99027	2021-03-22 22:12:56 +01:00
Nikita Popov	59dbf4d516	[InstSimplify] Add load of undef aggregate test (NFC) To make sure this doesn't crash the following commit.	2021-03-21 17:42:26 +01:00
Nikita Popov	b32f5d5045	[InstSimplify] Regenerate test checks (NFC)	2021-03-21 17:41:21 +01:00
Nikita Popov	ece1403aca	[InstSimplify] Add additional select operand replacement tests (NFC) This tests for binops with identity elements.	2021-03-21 15:30:30 +01:00
Simonas Kazlauskas	6513995be3	[InstSimplify] Restrict a GEP transform to avoid provenance changes This is a follow-up to D98588, and fixes the inline `FIXME` about a GEP-related simplification not preserving the provenance. https://alive2.llvm.org/ce/z/qbQoAY Additional tests were added in {rGf125f28afdb59eba29d2491dac0dfc0a7bf1b60b} Depends on D98672 Reviewed By: lebedev.ri Differential Revision: https://reviews.llvm.org/D98611	2021-03-16 18:53:05 +02:00
Sanjay Patel	660728acd4	[InstSimplify] ctlz({signbit} >>u x) --> x The motivating pattern was handled in `0a2d69480d` , but we should have this for symmetry. But this really highlights that we could generalize for any shifted constant if we match this in instcombine. https://alive2.llvm.org/ce/z/MrmVNt	2021-03-15 12:03:35 -04:00
Sanjay Patel	3c93852a78	[InstSimplify] add tests for ctlz of shifted constant; NFC	2021-03-15 12:03:35 -04:00
Simonas Kazlauskas	f125f28afd	[InstSimplify] Add additional GEP transform tests & regenerate	2021-03-14 23:18:26 +02:00
Bjorn Pettersson	529c8e8dc6	[InstSimplify] Simplify smul.fix and smul.fix.sat Add simplification of smul.fix and smul.fix.sat according to X * 0 -> 0 X * undef -> 0 X * (1 << scale) -> X This includes the commuted patterns and splatted vectors. Reviewed By: nikic Differential Revision: https://reviews.llvm.org/D98299	2021-03-12 09:09:58 +01:00
Bjorn Pettersson	3638bdfbda	[ConstantFold] Handle undef/poison when constant folding smul_fix/smul_fix_sat Do constant folding according to posion * C -> poison C * poison -> poison undef * C -> 0 C * undef -> 0 for smul_fix and smul_fix_sat intrinsics (for any scale). Reviewed By: nikic, aqjune, nagisa Differential Revision: https://reviews.llvm.org/D98410	2021-03-12 09:09:58 +01:00
Juneyoung Lee	3170978173	[InstSimplify] Add tests for pr49495 (NFC)	2021-03-10 17:54:46 +09:00
Sanjay Patel	0a2d69480d	[InstSimplify] cttz(1<<x) --> x https://alive2.llvm.org/ce/z/TDacYu https://alive2.llvm.org/ce/z/KF84S3	2021-03-08 16:30:14 -05:00
Sanjay Patel	afa443831b	[InstSimplify] add tests for cttz of shifted-1; NFC	2021-03-08 16:30:13 -05:00
Nikita Popov	7faad5c900	[ConstantFold] Handle icmp of global and null consistently Return UGT rather than NE for icmp @g, null, which is slightly stronger. This is consistent with what we do for more complex folds. It is somewhat silly that @g ugt null does not get folded while (gep @g) ugt null does.	2021-03-08 17:18:01 +01:00
Nikita Popov	f08148e874	[ConstProp] Fix folding of pointer icmp with signed predicates While @g ugt null is always true (ignoring weak symbols), @g sgt null is not necessarily the case -- that would imply that it is forbidden to place globals in the high half of the address space.	2021-03-08 17:12:12 +01:00

1 2 3 4 5 ...

1121 Commits