llvm-project

Commit Graph

Author	SHA1	Message	Date
Sanjay Patel	2e87333bfe	[InstCombine] convert mul by negative-pow2 to negate and shift This is an unusual canonicalization because we create an extra instruction, but it's likely better for analysis and codegen (similar reasoning as D133399). InstCombine::Negator may create this kind of multiply from negate and shift, but this should not conflict because of the narrow negation. I don't know how to create a fully general proof for this kind of transform in Alive2, but here's an example with bitwidths similar to one of the regression tests: https://alive2.llvm.org/ce/z/J3jTjR Differential Revision: https://reviews.llvm.org/D133667	2022-10-02 12:22:25 -04:00
Sanjay Patel	e239198cdb	[InstCombine] fold select shuffles with shared operand together We don't combine generic shuffles together in IR, but select shuffles are a special-case because a select shuffle of a select shuffle is just another select shuffle; codegen is expected to efficiently lower those (select shuffles are also the canonical form of a vector select with constant condition).	2022-09-28 11:56:27 -04:00
Sanjay Patel	def6cbd2bd	[InstCombine] add assert/test for zext to i1 This is a test to verify that we do not crash with the problem noted in issue #57986. The root problem should be fixed with a prior change to InstSimplify.	2022-09-26 16:01:25 -04:00
Nikita Popov	8df376db72	[InstCombine] Remove buggy zext of icmp eq with pow2 fold (PR57899) For the case where the constant is a power of two rather than zero, the fold is incorrect, because it fails to check that the bit set in the LHS matches the bit in the RHS. Rather than fixing this, remove the power of two handling entirely, as a different fold will already canonicalize such comparisons to use a zero constant. Fixes https://github.com/llvm/llvm-project/issues/57899.	2022-09-22 16:37:10 +02:00
Nikita Popov	c2e76f914c	[InstCombine] Use simplifyWithOpReplaced() for non-bool selects Perform the simplifyWithOpReplaced() fold even for non-bool selects. This subsumes a number of recently added folds for zext/sext of the condition. We still need to manually handle variations with both sext/zext and not, because simplifyWithOpReplaced() only performs one level of replacements.	2022-09-22 15:46:00 +02:00
Nikita Popov	41dde5d858	[InstSimplify] Support vectors in simplifyWithOpReplaced() We can handle vectors inside simplifyWithOpReplaced(), as long as cross-lane operations are excluded. The equality can hold (or not hold) for each vector lane independently, so we shouldn't use the replacement value from other lanes. I believe the only operations relevant here are shufflevector (where all previous bugs were seen) and calls (which might use shuffle-like intrinsics and would require more careful classification). Differential Revision: https://reviews.llvm.org/D134348	2022-09-22 10:45:42 +02:00
Sanjay Patel	ee0bf64722	[InstCombine] try to fold mul by neg-power-of-2 to shl `(A * -2**C) + B --> B - (A << C)` https://alive2.llvm.org/ce/z/A6BWkf This inverts what Negator was doing before: D134310 / `0f32a5dea0` Analysis and codegen are generally better without multiply, so we should favor this form even if we trade add for sub (because those are generally equivalent cost operations).	2022-09-21 15:09:39 -04:00
Sanjay Patel	64d309131a	[InstCombine] try multi-use demanded bits fold for 'sub' This is similar to D133788 / `73919a87e9`, but for sub the transform is valid only for low zeros in operand 1. https://alive2.llvm.org/ce/z/EmRsXC	2022-09-21 14:13:05 -04:00
Sanjay Patel	0f32a5dea0	[InstCombine] don't canonicalize shl+sub to mul+add This stops Negator from transforming: `C1 - shl X, C2 --> mul X, (1<<C2) + C1` ...in the general case. There does not seem to be any analysis benefit to using mul in IR, and there's definitely downside in codegen (particularly when the multiply has to be expanded). If `C1` is 0, then there's a stronger argument that the single mul is a better canonicalization than negate-of-shl, but we may want to remove that too. This was noted as a potential conflict for D133667. Differential Revision: https://reviews.llvm.org/D134310	2022-09-21 08:39:07 -04:00
Markus Böck	b751da43b2	[InstCombine] Handle integer extension in `select` patterns using the condition as value These patterns were previously only implemented for i1 type but can be extended for any integer type by also handling zext and sext operands. Differential Revision: https://reviews.llvm.org/D134142	2022-09-20 22:25:13 +02:00
Zain Jaffal	68cc35d52c	[InstCombine] Matrix multiplication negation optimisation If one of the operands in a matrix multiplication is negated we can optimise the equation by moving the negation to the smallest element of the operands or the result. Reviewed By: spatel, fhahn Differential Revision: https://reviews.llvm.org/D133300	2022-09-20 19:50:39 +01:00
Simon Pilgrim	09cb9fdef9	[InstCombine] Fold ult(add(x,-1),c) -> ule(x,c) iff x != 0 (PR57635) Alive2: https://alive2.llvm.org/ce/z/sZ6wwS As detailed on Issue #57635 and #37628 - for unsigned comparisons, we can compare prior to a decrement iff the value is known never to be zero. Differential Revision: https://reviews.llvm.org/D134172	2022-09-20 16:44:41 +01:00
Matt Arsenault	fd37ab6cf6	InstCombine: Pass AssumptionCache through isDereferenceablePointer	2022-09-19 19:10:51 -04:00
Sanjay Patel	d6498abc24	[InstCombine] remove multi-use add demanded constant fold This was originally part of D133788. There are no visible regressions. All of the diffs show a large unsigned constant becoming a small negative constant. This should be better for analysis (and slightly less compile-time) and codegen.	2022-09-18 14:23:43 -04:00
Marc Auberer	f52dd920d4	[InstCombine] Fix bug when folding x + (x \| -x) to x & (x - 1) Addresses concern: https://reviews.llvm.org/rG09cdddea0c4d284c2c22f5dfade40a60850c5ea7 There was a copy/paste mistake in the code. Updated code and test ref. Differential Revision: https://reviews.llvm.org/D134135	2022-09-18 13:16:12 -04:00
Sanjay Patel	1d1d1e6f22	[InstCombine] fold full-shift of sdiv to icmp+extend This is a disguised sign-bit test with offset: (X / +DivC) >> (Width - 1) --> ext (X <= -DivC) (X / -DivC) >> (Width - 1) --> ext (X >= +DivC) https://alive2.llvm.org/ce/z/cO8JO4 We don't match/test poison in the sdiv constant because that would be immediate undefined behavior.	2022-09-18 13:13:14 -04:00
Sanjay Patel	6174da2299	[InstCombine] reduce code duplication in foldICmpMulConstant(); NFC	2022-09-16 10:39:54 -04:00
Sanjay Patel	02a27b3890	[InstCombine] fold X*X == 0 --> X == 0 This is safe when the mul does not overflow: https://alive2.llvm.org/ce/z/LedVVP This could be extended to handle non-zero compare constants and non-squared multiplies.	2022-09-15 12:02:50 -04:00
Zain Jaffal	8253f7e286	[InstCombine] Optimize multiplication where both operands are negated Handle the case where both operands are negated in matrix multiplication Reviewed By: fhahn Differential Revision: https://reviews.llvm.org/D133695	2022-09-14 16:29:39 +01:00
Sanjay Patel	73919a87e9	[InstCombine] try multi-use demanded bits folds for 'add' This patch enables a multi-use demanded bits fold (motivated by issue #57576): https://alive2.llvm.org/ce/z/DsZakh This mimics transforms that we already do on the single-use path. Originally, this patch did not include the last part to form a constant, but that can be removed independently to reduce risk. It's not clear what the effect of either change will be when viewed end-to-end. This is expected to be neutral or a slight win for compile-time. See the "add-demand2" series for experimental timing results: https://llvm-compile-time-tracker.com/?config=NewPM-O3&stat=instructions&remote=rotateright Differential Revision: https://reviews.llvm.org/D133788	2022-09-14 09:30:59 -04:00
Sanjay Patel	53eede597e	[InstCombine] look through 'not' of ctlz/cttz op with 0-is-undef https://alive2.llvm.org/ce/z/MNsC1S This pattern was flagged at: https://discourse.llvm.org/t/instcombines-select-optimizations-dont-trigger-reliably/64927	2022-09-12 15:06:21 -04:00
Marc Auberer	09cdddea0c	[InstCombine] Fold x + (x \| -x) to x & (x - 1) Fixes #57531 This transformation may be particularly useful on x86-64, because x & (x - 1) can be performed by a single blsr instruction. Differential Revision: https://reviews.llvm.org/D133362	2022-09-11 06:14:24 -04:00
Sanjay Patel	6113e6738d	[InstCombine] move/adjust comments about demanded bits; NFC The code has been moved/copied around, but the comments were not updated to match.	2022-09-09 11:48:20 -04:00
Sebastian Neubauer	c7750c522e	Add helper func to get first non-alloca position The LLVM performance tips suggest that allocas should be placed at the beginning of the entry block. So far, llvm doesn’t provide any helper to find that position. Add BasicBlock::getFirstNonPHIOrDbgOrAlloca and IRBuilder::SetInsertPointPastAllocas(Function*) that get an insert position after the (static) allocas at the start of a function and use it in ShadowStackGCLowering. Differential Revision: https://reviews.llvm.org/D132554	2022-09-09 15:39:53 +02:00
Sanjay Patel	444f08c832	[InstCombine] fold icmp of truncated left shift, part 2 (trunc (1 << Y) to iN) == 2C --> Y == C (trunc (1 << Y) to iN) != 2C --> Y != C https://alive2.llvm.org/ce/z/xnFPo5 Follow-up to `d9e1f9d759`. This was a suggested enhancement mentioned in issue #51889.	2022-09-08 12:44:02 -04:00
Joe Loser	5e96cea1db	[llvm] Use std::size instead of llvm::array_lengthof LLVM contains a helpful function for getting the size of a C-style array: `llvm::array_lengthof`. This is useful prior to C++17, but not as helpful for C++17 or later: `std::size` already has support for C-style arrays. Change call sites to use `std::size` instead. Differential Revision: https://reviews.llvm.org/D133429	2022-09-08 09:01:53 -06:00
Sanjay Patel	d9e1f9d759	[InstCombine] Fold icmp of truncated left shift (trunc (1 << Y) to iN) == 0 --> Y u>= N (trunc (1 << Y) to iN) != 0 --> Y u< N These can be generalized in several ways as noted by the TODO items, but this handles the pattern in the motivating bug report. Fixes #51889 Differential Revision: https://reviews.llvm.org/D115480	2022-09-08 10:48:14 -04:00
Chenbing Zheng	01cea7ac10	[InstCombine] extractvalue (any_mul_with_overflow X, 2^n), 0 -> X << n Alive2: https://alive2.llvm.org/ce/z/JLmabt (umul) https://alive2.llvm.org/ce/z/J_ruXR (smul) https://alive2.llvm.org/ce/z/o9SVSz (vector) Reviewed By: spatel, RKSimon Differential Revision: https://reviews.llvm.org/D133188	2022-09-08 11:12:55 +08:00
Sami Tolvanen	52967a5306	[InstCombine] Fix a crash in -kcfi debug block Don't attempt to print out DebugLoc as we may not have one.	2022-09-07 22:59:12 +00:00
Sanjay Patel	7c57180900	[InstCombine] fold add+negate through select into sub This transform came up as a potential DAGCombine in D133282, so I wanted to see how it escaped in IR too. We do general folds in InstCombiner::SimplifySelectsFeedingBinaryOp() by checking if either arm of a select simplifies when the trailing binop is threaded into the select. So as long as one side simplifies, it's a good fold to combine a negate and add into 1 subtract. This is an example with a zero arm in the select: https://alive2.llvm.org/ce/z/Hgu_Tj And this models the tests with a cancelling 'not' op: https://alive2.llvm.org/ce/z/BuzVV_ Differential Revision: https://reviews.llvm.org/D133369	2022-09-07 08:23:35 -04:00
Sanjay Patel	ae117e1c1b	[InstCombine] remove dead code for add (select cond, (sub), 0); NFC This pattern is handled more generally in SimplifySelectsFeedingBinaryOp(). Tests to confirm that added to the add.ll test file in the previous commit.	2022-09-06 12:19:50 -04:00
Sanjay Patel	dd6eb4d67f	[InstCombine] reduce code duplication; NFC	2022-09-06 08:19:30 -04:00
Tian Zhou	8fa432be4f	[InstCombine] reduce test-for-overflow of shifted value Fixes #57338. The added code makes the following transformations: For unsigned predicates / eq / ne: icmp pred (x << 1), x --> icmp getSignedPredicate(pred) x, 0 icmp pred x, (x << 1) --> icmp getSignedPredicate(pred) 0, x Some examples: https://alive2.llvm.org/ce/z/ckn4cj https://alive2.llvm.org/ce/z/h-4bAQ Differential Revision: https://reviews.llvm.org/D132888	2022-09-05 09:51:51 -04:00
Sanjay Patel	5c759edc57	[InstCombine] reduce another or-xor bitwise logic pattern ~(A & ?) \| (A ^ B) --> ~((A & ?) & B) https://alive2.llvm.org/ce/z/mxex6V This is similar to `9d218b61cc` where we peeked through another logic op to find a common operand.	2022-09-03 09:32:08 -04:00
Muhammad Omair Javaid	18de7c6a3b	Revert "[InstCombine] Treat passing undef to noundef params as UB" This reverts commit `c911befaec`. It has broken LLDB Arm/AArch64 Linux buildbots. I dont really understand the underlying reason. Reverting for now make buildbot green. https://reviews.llvm.org/D133036	2022-09-02 16:09:50 +05:00
Chenbing Zheng	d30cf77cb1	[InstCombine] complete fold extractvalue (any_mul_with_overflow X, -1) When we do extractvalue (any_mul_with_overflow X, -1) --> (-X and icmp), which left partly failed to match vector constant with poison element. This patch try to fix it. Alive2: https://alive2.llvm.org/ce/z/2rGp_3 Reviewed By: spatel Differential Revision: https://reviews.llvm.org/D132996	2022-09-02 10:58:42 +08:00
Arthur Eubanks	c911befaec	[InstCombine] Treat passing undef to noundef params as UB Reviewed By: nikic Differential Revision: https://reviews.llvm.org/D133036	2022-09-01 15:16:45 -07:00
Sanjay Patel	c3d1504d63	[InstCombine] fix crash on type mismatch with fcmp fold The existing predicate doesn't work for a single-element vector, so make sure we are not crossing scalar/vector types. Test (was crashing) based on the post-commit example for: `4827771234`	2022-09-01 08:57:55 -04:00
Sanjay Patel	addbdac5d5	[InstCombine] fold power-of-2 ctlz/cttz with inverted result When X is a power-of-two or zero and zero input is poison: ctlz(i32 X) ^ 31 --> cttz(X) cttz(i32 X) ^ 31 --> ctlz(X) https://alive2.llvm.org/ce/z/Cs7sFE	2022-09-01 08:57:55 -04:00
Nikita Popov	43e7d9af1d	[InstCombine] Fold extractvalue of phi Just as we do for most other operations, we should push extractvalue instructions through phis, if this does not increase unfolded instruction count.	2022-09-01 10:51:54 +02:00
Nikita Popov	ad66bc42b0	[InstCombine] Use getInsertionPointAfterDef() in freeze fold This simplifies the code and fixes handling of catchswitch, in which case we have no insertion point for the freeze. Originally part of D129660.	2022-08-31 11:32:57 +02:00
Nikita Popov	972840aa3b	[IR] Add Instruction::getInsertionPointAfterDef() Transforms occasionally want to insert an instruction directly after the definition point of a value. This involves quite a few different edge cases, e.g. for phi nodes the next insertion point is not the next instruction, and for invokes and callbrs its not even in the same block. Additionally, the insertion point may not exist at all if catchswitch is involved. This adds a general Instruction::getInsertionPointAfterDef() API to implement the necessary logic. For now it is used in two places where this should be mostly NFC. I will follow up with additional uses where this fixes specific bugs in the existing implementations. Differential Revision: https://reviews.llvm.org/D129660	2022-08-31 10:50:10 +02:00
Chenbing Zheng	35a3048c25	[InstCombine] add support for multi-use Y of (X op Y) op Z --> (Y op Z) op X For (X op Y) op Z --> (Y op Z) op X we can still do transform when Y is multi-use. In D131356 limit it to one-use, this patch remove this limit. This is still not a complete solution, I add a todo test to show it. In this case, X and Y are both multi use, we can't differentiate how to convert based on this. But at least we don't make the code worse，and it can solve half the scenarios.	2022-08-31 10:55:05 +08:00
Sanjay Patel	8a19842c0e	[InstCombine] delete redundant folds; NFC InstSimplify does this via isKnownNonEqual(), so it's already using knownbits on these patterns and trying other folds.	2022-08-30 14:21:29 -04:00
zhongyunde	23a5de4294	[InstCombine] Distributive or+mul with const operand We aleady support the transform: `(X+C1)CI -> XCI+C1CI` Here the case is a little special as the form of `(X+C1)CI` is transformed into `(X\|C1)CI`, so we should also support the transform: `(X\|C1)CI -> XCI+C1CI` Fixes https://github.com/llvm/llvm-project/issues/57278 Reviewed By: bcl5980, spatel, RKSimon Differential Revision: https://reviews.llvm.org/D132658	2022-08-30 20:36:52 +08:00
jacquesguan	df525c7705	[InstCombine] fold fake floating point vector extract to shift+trunc. This patch supports the FP part of D111082. Differential Revision: https://reviews.llvm.org/D125750	2022-08-30 10:12:16 +08:00
Sanjay Patel	6c39a3aae1	[InstCombine] fold not-shift of signbit to icmp+zext https://alive2.llvm.org/ce/z/j_8Wz9 The arithmetic shift was converted to logical shift with: `246078604c` That does not seem to uncover any other missing/conflicting folds, so convert directly to signbit test + cast. We still need to fold the pattern with logical shift to test + cast. This allows reducing patterns where the output type is not the same as the input value: https://alive2.llvm.org/ce/z/nydwFV Fixes #57394	2022-08-29 10:06:31 -04:00
Sanjay Patel	246078604c	[InstCombine] fold inc-of-signbit-splat to not+lshr (iN X s>> (N - 1)) + 1 --> (~X) u>> (N - 1) https://alive2.llvm.org/ce/z/wzS474	2022-08-29 08:48:22 -04:00
Kazu Hirata	c63f823875	[llvm] Use range-based for loops (NFC)	2022-08-28 17:35:04 -07:00
Sanjay Patel	ab6892967c	[InstCombine] allow sext in fold of mask using signbit, part 2 https://alive2.llvm.org/ce/z/rcbZmx Sibling tranform to `275aa24c0a` This pattern is seen in the examples in issue #57381.	2022-08-28 11:50:52 -04:00

1 2 3 4 5 ...

5134 Commits