llvm-project

Commit Graph

Author	SHA1	Message	Date
Nikita Popov	56c9976d46	[IndVarSimplify] Don't assert that terminator is not SCEVable (PR55925) The IV widening code currently asserts that terminators aren't SCEVable -- however, this is not the case for invokes with a returned attribute. As far as I can tell, this assertions is not necessary -- even if we have a critical edge (the second test case), the trunc gets inserted in a legal position. Fixes https://github.com/llvm/llvm-project/issues/55925. Differential Revision: https://reviews.llvm.org/D127288	2022-06-09 10:12:13 +02:00
Max Kazantsev	16c028a8c8	[Test] Add XFAIL test for PR55689 SCEV issues in dynamically unreached code, see details at https://github.com/llvm/llvm-project/issues/55689 1st reduced test by Nikic!	2022-06-08 16:01:29 +07:00
Nikita Popov	03aceab08b	[ValueTracking] Enable -branch-on-poison-as-ub by default Now that SimpleLoopUnswitch and other transforms no longer introduce branch on poison, enable the -branch-on-poison-as-ub option by default. The practical impact of this is mostly better flag preservation in SCEV, and some freeze instructions no longer being necessary. Differential Revision: https://reviews.llvm.org/D125299	2022-06-01 10:46:06 +02:00
Nikita Popov	c8b675eaa1	[SCEV] Use umin_seq for BECount of multi-exit loops When computing the BECount for multi-exit loops, we need to combine individual exit counts using umin_seq rather than umin. This is because an earlier exit may exit on the first iteration, in which case later exit expressions will not be evaluated and could be poisonous. We cannot propagate potential poison values from later exits. In particular, this avoids the introduction of "branch on poison" UB when optimizing multi-exit loops. Differential Revision: https://reviews.llvm.org/D124910	2022-05-21 15:48:14 +02:00
Nikita Popov	e9a1c82d69	[SCEVExpander] Expand umin_seq using freeze %x umin_seq %y is currently expanded to %x == 0 ? 0 : umin(%x, %y). This patch changes the expansion to umin(%x, freeze %y) instead (https://alive2.llvm.org/ce/z/wujUhp). The motivation for this change are the test cases affected by D124910, where the freeze expansion ultimately produces better optimization results. This is largely because `(%x umin_seq %y) == %x` is a common expansion pattern, which reliably optimizes in freeze representation, but only sometimes with the zero comparison (in particular, if %x == 0 can fold to something else, we generally won't be able to cover reasonable code from this.) Differential Revision: https://reviews.llvm.org/D125372	2022-05-18 09:53:07 +02:00
Nikita Popov	b4a5340be5	[IndVarSimplify] Regenerate test checks (NFC)	2022-05-11 12:20:53 +02:00
Nikita Popov	3898759538	[IndVarSimplify] Regenerate test checks (NFC)	2022-05-10 14:51:09 +02:00
Florian Hahn	76b57f49a2	[IndVarSimplify] Replace legacy loop-unswitch with SimpleLoopUnswitch. Also moves the test to use the new pass manager.	2022-04-23 14:08:47 +01:00
Florian Hahn	85eb8b7244	[IndVars] Add test for crash exposed by D114650.	2022-04-22 10:44:29 +01:00
Nikita Popov	3c9f3f76f1	[ConstantFold] Fold zero-index GEPs with opaque pointers With opaque pointers, we can eliminate zero-index GEPs even if they have multiple indices, as this no longer impacts the result type of the GEP. This optimization is already done for instructions in InstSimplify, but we were missing the corresponding constant expression handling. The constexpr transform is a bit more powerful, because it can produce a vector splat constant and also handles undef values -- it is an extension of an existing single-index transform.	2022-04-04 13:04:27 +02:00
Nikita Popov	6ffb3ad631	[SCEV] Use constant ranges when determining reachable blocks (PR54434) This avoids false positive verification failures if the condition is not literally true/false, but SCEV still makes use of the fact that a loop is not reachable through more complex reasoning. Fixes https://github.com/llvm/llvm-project/issues/54434.	2022-03-18 12:04:35 +01:00
Eli Friedman	523c572c37	[IndVars] Add a new test affected by `62f86d4f`	2022-03-17 13:49:07 -07:00
Nikita Popov	d1e880acaa	[SCEV] Enable verification in LoopPM Currently, we hardly ever actually run SCEV verification, even in tests with -verify-scev. This is because the NewPM LPM does not verify SCEV. The reason for this is that SCEV verification can actually change the result of subsequent SCEV queries, which means that you see different transformations depending on whether verification is enabled or not. To allow verification in the LPM, this limits verification to BECounts that have actually been cached. It will not calculate new BECounts. BackedgeTakenInfo::getExact() is still not entirely readonly, it still calls getUMinFromMismatchedTypes(). But I hope that this is not problematic in the same way. (This could be avoided by performing the umin in the other SCEV instance, but this would require duplicating some of the code.) Differential Revision: https://reviews.llvm.org/D120551	2022-03-07 09:46:20 +01:00
Nikita Popov	aeab6167b0	[SCEV] Only verify BECounts for reachable loops (PR50523) For unreachable loops, any BECount is legal, and since D98706 SCEV can make use of this for loops that are unreachable due to constant branches. To avoid false positives, adjust SCEV verification to only check BECounts in reachable loops. Fixes https://github.com/llvm/llvm-project/issues/50523. Differential Revision: https://reviews.llvm.org/D120651	2022-03-01 11:52:35 +01:00
Max Kazantsev	9453cda088	[Test] Move test for PR53969 to LoopDeletion folder where it truly belongs	2022-02-21 18:39:31 +07:00
Max Kazantsev	18bfc57708	[Test] Add failing test for PR53969	2022-02-21 17:48:46 +07:00
Arthur Eubanks	129af4daa7	[SCEVExpander][OpaquePtr] Check GEP source type when finding identical GEP Fixes an opaque pointers miscompile. Reviewed By: #opaque-pointers, nikic Differential Revision: https://reviews.llvm.org/D120004	2022-02-17 08:48:11 -08:00
Arthur Eubanks	edde46b5d0	[test][IndVarSimplify][OpaquePtr] Precommit test	2022-02-17 08:48:07 -08:00
Nikita Popov	859567725d	[IndVars] Don't run full optimization pipeline in test (NFC) This extracts the IR prior to IndVarSimplify and only runs the single pass.	2022-02-17 09:28:33 +01:00
Max Kazantsev	c913dccfde	[SCEV] Use lshr in implications This patch adds support for implication inference logic for the following pattern: ``` lhs < (y >> z) <= y, y <= rhs --> lhs < rhs ``` We should be able to use the fact that value shifted to right is not greater than the original value (provided it is non-negative). Differential Revision: https://reviews.llvm.org/D116150 Reviewed-By: apilipenko	2022-01-25 13:25:19 +07:00
Roman Lebedev	76a0abbc13	[SCEV] Reenable umin_seq support and fix the `computeSCEVAtScope()` This reverts commit `f62f47f5e1`.	2022-01-11 16:03:35 +03:00
Philip Reames	f62f47f5e1	Partial revert of `82fb4f4` Two crashes have been reported. This change disables the new logic while leaving the new node in tree. Hopefully, that's enough to allow investigation without breakage while avoiding massive churn.	2022-01-10 18:18:34 -08:00
Roman Lebedev	82fb4f4b22	[SCEV] Sequential/in-order `UMin` expression As discussed in https://github.com/llvm/llvm-project/issues/53020 / https://reviews.llvm.org/D116692, SCEV is forbidden from reasoning about 'backedge taken count' if the branch condition is a poison-safe logical operation, which is conservatively correct, but is severely limiting. Instead, we should have a way to express those poison blocking properties in SCEV expressions. The proposed semantics is: ``` Sequential/in-order min/max SCEV expressions are non-commutative variants of commutative min/max SCEV expressions. If none of their operands are poison, then they are functionally equivalent, otherwise, if the operand that represents the saturation point* of given expression, comes before the first poison operand, then the whole expression is not poison, but is said saturation point. ``` * saturation point - the maximal/minimal possible integer value for the given type The lowering is straight-forward: ``` compare each operand to the saturation point, perform sequential in-order logical-or (poison-safe!) ordered reduction over those checks, and if reduction returned true then return saturation point else return the naive min/max reduction over the operands ``` https://alive2.llvm.org/ce/z/Q7jxvH (2 ops) https://alive2.llvm.org/ce/z/QCRrhk (3 ops) Note that we don't need to check the last operand: https://alive2.llvm.org/ce/z/abvHQS Note that this is not commutative: https://alive2.llvm.org/ce/z/FK9e97 That allows us to handle the patterns in question. Reviewed By: nikic, reames Differential Revision: https://reviews.llvm.org/D116766	2022-01-10 20:51:26 +03:00
Roman Lebedev	6a563e2570	[NFC][SCEV][IndVars] Add more tests for exit count w/ `select` See https://github.com/llvm/llvm-project/issues/53020	2022-01-07 01:30:21 +03:00
Nikita Popov	daf32b13d7	[IndVars] Support opaque pointers in LFTR Remove the assertion about the pointer element type, only check that the stride is one. Ultimately, the actual pointer type here doesn't matter, because SCEVExpander would insert appropriate casts if necessary.	2021-12-27 12:32:50 +01:00
Max Kazantsev	958e7a284d	[Test] Add test showing missing opportunity in IndVar's handling of lshr	2021-12-22 15:08:22 +07:00
Nikita Popov	37d72991c1	[SCEV] Track and invalidate ValuesAtScopes users ValuesAtScopes maps a SCEV and a Loop to another SCEV. While we invalidate entries if the left-hand SCEV is invalidated, we currently don't do this for the right-hand SCEV. Fix this by tracking users in a reverse map and using it for invalidation. This is conceptually the same change as D114738, but using the reverse map to avoid performance issues. Differential Revision: https://reviews.llvm.org/D114788	2021-11-30 18:21:14 +01:00
Philip Reames	8906a0fe64	[SCEVExpander] Drop poison generating flags when reusing instructions The basic problem we have is that we're trying to reuse an instruction which is mapped to some SCEV. Since we can have multiple such instructions (potentially with different flags), this is analogous to our need to drop flags when performing CSE. A trivial implementation would simply drop flags on any instruction we decided to reuse, and that would be correct. This patch is almost that trivial patch except that we preserve flags on the reused instruction when existing users would imply UB on overflow already. Adding new users can, at most, refine this program to one which doesn't execute UB which is valid. In practice, this fixes two conceptual problems with the previous code: 1) a binop could have been canonicalized into a form with different opcode or operands, or 2) the inbounds GEP case which was simply unhandled. On the test changes, most are pretty straight forward. We loose some flags (in some cases, they'd have been dropped on the next CSE pass anyways). The one that took me the longest to understand was the ashr-expansion test. What's happening there is that we're considering reuse of the mul, previously we disallowed it entirely, now we allow it with no flags. The surrounding diffs are all effects of generating the same mul with a different operand order, and then doing simple DCE. The loss of the inbounds is unfortunate, but even there, we can recover most of those once we actually treat branch-on-poison as immediate UB. Differential Revision: https://reviews.llvm.org/D112734	2021-11-29 15:23:34 -08:00
Zarko Todorovski	7f7dac7126	[NFC][llvm] Inclusive language: reword uses of sanity test and check Part of continuing work to use more inclusive language. Reworded uses of sanity check and sanity test in llvm/test/	2021-11-25 07:21:42 -05:00
Philip Reames	03d8bc184a	[indvars] Fix lftr crash when preheader is terminated by switch This was found by oss-fuzz. The switch will get canonicalized to a branch, but if it hasn't been when we run LFTR, we crashed on an unneeded assert.	2021-11-23 09:58:46 -08:00
Sander.DeSmalen@arm.com	305816ff1e	[IndVarSimplify] Reduce nondeterministic behaviour in visitIVCast. rGf39978b84f1d3a1da6c32db48f64c8daae64b3ad led to and/or exposed an issue with IndVarSimplification for a loop where a i32 phi node is no longer replaced by a widened (i64) phi node, because the SCEVs of a sign-extend no longer folded the same way. I'm unsure how to properly explain this because it's all rather complicated, but in short: SCEVs don't fold as nicely as they used to and this caused a difference. While investigating this, I found that IndVarSimplify can actually optimise the case in the way we want to if it chooses the widened IV to be 'signed' (the i32 IV is both sign and zero-extended). Oddly enough, there is some level of indeterminism in the way the algorithm works, it just picks the sign of the 'first' zext/sext user, where the order of the users-iterator is not guaranteed to be the same on each invocation of the pass (e.g. shown by first running loop-rotate, which puts the users in a different order). While I think the fix is valid in the sense that consistently picking _any_ order is better than having an nondeterministic order, I can use a bit of advice from people more familiar in this area of the code-base. For example, I'm not sure if this fix is hiding another issue where the IndVarSimplify pass could actually draw the same conclusions (i.e. that it only needs an i64 phi node) if it does a bit more work, regardless of whether it chooses the induction variable to be signed or unsigned. I'm also not sure if choosing signed is better than unsigned, or whether that just happens to be beneficial only in this individual case. Any feedback would be much appreciated! Reviewed By: reames Differential Revision: https://reviews.llvm.org/D112573	2021-11-16 12:41:04 +00:00
Dmitry Makogon	62f86d4f95	Reapply `5ec2386` "Reapply `db28934` "[IndVars] Pass TTI to replaceCongruentIVs"" This reverts commit `7cd273c339`. Several patches with tests fixes have been applied: `0cada82f0a` "[Test] Remove incorrect test in GVN" `97cb13615d` "[Test] Separate IndVars test into AArch64 and X86 parts" `985cc490f1` "[Test] Remove separated test in IndVars", and test failures caused by `5ec2386` should be resolved now.	2021-11-10 17:36:14 +07:00
Dmitry Makogon	985cc490f1	[Test] Remove separated test in IndVars This patch removes a test file, which was forgotten to be removed in `97cb13615d`. The deleted test is separated into 2 parts by that patch.	2021-11-10 16:19:11 +07:00
Dmitry Makogon	97cb13615d	[Test] Separate IndVars test into AArch64 and X86 parts The widen-loop-comp.ll in indvars has a target triple with specified aarch64 architecture. This caused test failures with `db28934` "[IndVars] Pass TTI to replaceCongruentIVs" applied, because with the patch indvars performed some target-specific transforms, and for example if a build supported only X86, then indvars would not have applied those transforms. However, the checks in the test were generated as for aarch64. Thus the test failures on such builds. This patch separates widen-loop-comp.ll into two parts. The first one is intended to be run only if a build supports aarch64. This is now in AArch64 directory with a lit config. The second one was added recently to show `db28934` improvements. This one is now in X86 directory. This patch should resolve build issues caused by `5ec2386332`.	2021-11-10 16:15:20 +07:00
Douglas Yung	7cd273c339	Revert "Reapply `db28934` "[IndVars] Pass TTI to replaceCongruentIVs"" This reverts commit `5ec2386332`. This change is causing test failures on the PS4 linux build bot: https://lab.llvm.org/buildbot/#/builders/139/builds/12871	2021-11-09 10:28:41 -08:00
Dmitry Makogon	5ec2386332	Reapply `db28934` "[IndVars] Pass TTI to replaceCongruentIVs" This reapplies patch `db289340c8`. The test failures on build with expensive checks caused by the patch happened due to the fact that we sorted loop Phis in replaceCongruentIVs using llvm::sort, which shuffles the given container if the expensive checks are enabled, so equivalent Phis in the sorted vector had different mutual order from run to run. replaceCongruentIVs tries to replace narrow Phis with truncations of wide ones. In some test cases there were several Phis with the same width, so if their order differs from run to run, the narrow Phis would be replaced with a different Phi, depending on the shuffling result. The patch `ae14fae0ff` fixed this issue by replacing llvm::sort with llvm::stable_sort.	2021-11-09 17:42:29 +07:00
Dmitry Makogon	8d4eba6c0d	Revert "[IndVars] Pass TTI to replaceCongruentIVs" This reverts commit `db289340c8`. The patch caused 2 crashes with expensive checks enabled.	2021-11-08 19:35:14 +07:00
Dmitry Makogon	db289340c8	[IndVars] Pass TTI to replaceCongruentIVs In IndVarSimplify after simplifying and extending loop IVs we call 'replaceCongruentIVs'. This function optionally takes a TTI argument to be able to replace narrow IVs uses with truncates of the widest one. For some reason the TTI wasn't passed to the function, so it couldn't perform such transform. This patch fixes it. Reviewed By: mkazantsev Differential Revision: https://reviews.llvm.org/D113024	2021-11-08 19:20:53 +07:00
Philip Reames	d24a0e8857	[SCEV] Use constant range of RHS to prove NUW on narrow IV in trip count logic The basic idea here is that given a zero extended narrow IV, we can prove the inner IV to be NUW if we can prove there's a value the inner IV must take before overflow which must exit the loop. Differential Revision: https://reviews.llvm.org/D109457	2021-11-05 15:36:47 -07:00
Philip Reames	e69f6476a8	Autogen tests for ease of future update	2021-11-05 12:46:07 -07:00
Philip Reames	dec15d9a0a	[indvars] Use loop guards when canonicalizing exit conditions This extends the logic in canonicalizeExitConditions to use loop guards to specialize the SCEV of the loop invariant term before quering it's range.	2021-11-04 15:23:34 -07:00
Philip Reames	c0d9bf2f6a	[indvars] Allow rotation (narrowing) of exit test when discovering trip count This relaxes the one-use requirement on the rotation transform specifically for the case where we know we're zexting an IV of the loop. This allows us to discover trip count information in SCEV, which seems worth a single extra loop invariant truncate. Honestly, I'd prefer if SCEV could just compute the trip count directly (e.g. D109457), but this unblocks practical benefit.	2021-11-04 14:49:24 -07:00
Philip Reames	453fdebd48	[indvars] Extend canonicalizeExitConditions to inverted operands As discussed in the original reviews, but done in a follow on.	2021-11-04 14:20:37 -07:00
Philip Reames	d4708fa480	Backout must-exit based parts of `3fc9882e`, and 412eb0 Not sure these are correct. I think I missed a case when porting this from the original SCEV change to the IndVar changes. I may end up reapplying this later with a comment about how this is correct, but in case the current bad feeling turns out to be true, I'm removing from tree while investigating further.	2021-11-03 15:19:49 -07:00
Philip Reames	c97bb5d19d	[tests] Precommit for generalization of D112262	2021-11-03 14:33:16 -07:00
Philip Reames	3fc9882e88	[indvars] Rotate zext though icmp to reduce loop varying computation This change looks for cases where we can prove that an exit test of a loop can be performed in a narrower bitwidth, and that by doing so we can replace a loop-varying extend with a loop-invariant truncate. The motivation here is that doing this unblocks the trip count analysis for narrow IVs involved in extended compare exit tests. It also has the nice side effect of simply making the code faster, even if we gain no other benefit from the improved analysis ability. I've noted a few places this could be extended, but I think this stands reasonable on it's own as well. Differential Revision: https://reviews.llvm.org/D112262	2021-11-03 12:09:20 -07:00
Dmitry Makogon	dd000e67f0	[Test] Regenerate IndVars test's checks This just regenerates a certain IndVars test's checks.	2021-11-02 22:03:58 +07:00
Dmitry Makogon	94128f04d6	[Test] Add tests showing congruent IVs not removed by IndVars In the added cases we have two congruent IVs. IndVars widens at least one of them. If they are both widened, then one of them is erased as they stay congruent after widening. However if only one IV is widened, the other one stays in the loop. We can simply erase the narrow IV and replace its uses with truncates of the widest IV.	2021-11-02 21:46:56 +07:00
Philip Reames	6caff716da	Regen some autogen tests to account for format change	2021-10-28 09:22:20 -07:00
Philip Reames	9ed528e089	Autogen a test for ease of update	2021-10-28 09:07:15 -07:00

1 2 3 4 5 ...

660 Commits