llvm-project

Commit Graph

Author	SHA1	Message	Date
Nikita Popov	ba8ee4a43e	[SCEV] Verify all IR -> SCEV mappings This extends SCEV verification to check not only backedge-taken counts, but all entries in the IR -> SCEV cache. The restrictions are the same as for the BECount case, i.e. we ignore expressions based on undef, we only diagnose constant deltas (there are way too many false positives otherwise) and we limit to reachable code. Differential Revision: https://reviews.llvm.org/D121104	2022-03-09 09:33:22 +01:00
Nikita Popov	81b43b23e4	[SCEV] Enable verification under EXPENSIVE_CHECKS SCEV verification should no longer affect results of subsequent queries, and our lit tests as well as llvm-test-suite pass with SCEV verification enabled, so I think we can enable it by default under EXPENSIVE_CHECKS now. Differential Revision: https://reviews.llvm.org/D120708	2022-03-07 09:53:00 +01:00
Nikita Popov	d1e880acaa	[SCEV] Enable verification in LoopPM Currently, we hardly ever actually run SCEV verification, even in tests with -verify-scev. This is because the NewPM LPM does not verify SCEV. The reason for this is that SCEV verification can actually change the result of subsequent SCEV queries, which means that you see different transformations depending on whether verification is enabled or not. To allow verification in the LPM, this limits verification to BECounts that have actually been cached. It will not calculate new BECounts. BackedgeTakenInfo::getExact() is still not entirely readonly, it still calls getUMinFromMismatchedTypes(). But I hope that this is not problematic in the same way. (This could be avoided by performing the umin in the other SCEV instance, but this would require duplicating some of the code.) Differential Revision: https://reviews.llvm.org/D120551	2022-03-07 09:46:20 +01:00
Nikita Popov	8133778d3c	[SCEV] Fully invalidate SCEVUnknown on RAUW When a SCEVUnknown gets RAUWd, we currently drop it from the folding set, but don't forget memoized values. I believe we should be treating RAUW the same way as deletion here and invalidate all caches and dependent expressions. I don't have any specific cases where this causes issues right now, but it does address the FIXME in https://reviews.llvm.org/D119488. Differential Revision: https://reviews.llvm.org/D120033	2022-03-07 09:28:28 +01:00
Arthur Eubanks	f909aed671	Revert "[SCEV] Infer ranges for SCC consisting of cycled Phis" This reverts commit `fc539b0004`. Causes miscompiles, see D110620.	2022-03-04 19:52:44 -08:00
serge-sans-paille	71c3a5519d	Cleanup includes: LLVMAnalysis Number of lines output by preprocessor: before: 1065940348 after: 1065307662 Discourse thread: https://discourse.llvm.org/t/include-what-you-use-include-cleanup Differential Revision: https://reviews.llvm.org/D120659	2022-03-01 18:01:54 +01:00
Nikita Popov	aeab6167b0	[SCEV] Only verify BECounts for reachable loops (PR50523) For unreachable loops, any BECount is legal, and since D98706 SCEV can make use of this for loops that are unreachable due to constant branches. To avoid false positives, adjust SCEV verification to only check BECounts in reachable loops. Fixes https://github.com/llvm/llvm-project/issues/50523. Differential Revision: https://reviews.llvm.org/D120651	2022-03-01 11:52:35 +01:00
Nikita Popov	2d0fc3e46f	[SCEV] Return ArrayRef from getSCEVValues() (NFC) Return a read-only view on this set. For the one internal use, directly access ExprValueMap.	2022-02-25 09:32:22 +01:00
Nikita Popov	d9715a7266	[SCEV] Don't try to reuse expressions with offset SCEVs ExprValueMap currently tracks not only which IR Values correspond to a given SCEV expression, but additionally stores that it may be expanded in the form X+Offset. In theory, this allows reusing existing IR Values in more cases. In practice, this doesn't seem to be particularly useful (the test changes are rather underwhelming) and adds a good bit of complexity. Per https://github.com/llvm/llvm-project/issues/53905, we have an invalidation issue with these offseted expressions. Differential Revision: https://reviews.llvm.org/D120311	2022-02-25 09:16:48 +01:00
Mircea Trofin	7e3606f43c	[ScalarEvolution] Control flag for nonstrict inequalities in finite loops D118090 causes a pretty significant (19%) regression in some Eigen benchmarks. Investigating is a bit time consuming as the compilation unit where this occurs is large. Rather than revert, this patch adds a flag controlling that behavior (enabled by default).	2022-02-23 17:56:35 -08:00
Max Kazantsev	ad3b1fe472	[SCEV] Do not erase LoopUsers. PR53969 This patch fixes a logical error in how we work with `LoopUsers` map. It maps a loop onto a set of AddRecs that depend on it. The Addrecs are added to this map only once when they are created and put to the UniqueSCEVs` map. The only purpose of this map is to make sure that, whenever we forget a loop, all (directly or indirectly) dependent SCEVs get forgotten too. Current code erases SCEVs from dependent set of a given loop whenever we forget this loop. This is not a correct behavior due to the following scenario: 1. We have a loop `L` and an AddRec `AR` that depends on it; 2. We modify something in the loop, but don't destroy it. We still call forgetLoop on it; 3. `AR` is no longer dependent on `L` according to `LoopUsers`. It is erased from ValueExprMap` and `ExprValue map, but still exists in UniqueSCEVs; 4. We can later request the very same AddRec for the very same loop again, and get existing SCEV `AR`. 5. Now, `AR` exists and is used again, but its notion that it depends on `L` is lost; 6. Then we decide to delete `L`. `AR` will not be forgotten because we have lost it; 7. Just you wait when you run into a dangling pointer problem, or any other kind of problem because an active SCEV is now referecing a non-existent loop. The solution to this is to stop erasing values from `LoopUsers`. Yes, we will maybe forget something that is already not used, but it's cheap. This fixes a functional bug and potentially may have negative compile time impact on methods with huge or numerous loops. Differential Revision: https://reviews.llvm.org/D120303 Reviewed By: nikic	2022-02-22 17:24:39 +07:00
Max Kazantsev	40d06c4ce9	[SCEV][NFC] Replace contains+insert check with insert.second	2022-02-21 20:11:13 +07:00
Max Kazantsev	fc539b0004	[SCEV] Infer ranges for SCC consisting of cycled Phis Our current strategy of computing ranges of SCEVUnknown Phis was to simply compute the union of ranges of all its inputs. In order to avoid infinite recursion, we mark Phis as pending and conservatively return full set for them. As result, even simplest patterns of cycled phis always have a range of full set. This patch makes this logic a bit smarter. We basically do the same, but instead of taking inputs of single Phi we find its strongly connected component (SCC) and compute the union of all inputs that come into this SCC from outside. Processing entire SCC together has one more advantage: we can set range for all of them at once, because the only thing that happens to them is the same value is being passed between those Phis. So, despite we spend more time analyzing a single Phi, overall we may save time by not processing other SCC members, so amortized compile time spent should be approximately the same. Differential Revision: https://reviews.llvm.org/D110620 Reviewed By: reames	2022-02-17 18:03:52 +07:00
Roman Lebedev	ae48af582b	[NFC][SCEV] Recognize umin_seq when operand is zext'ed in zero-check zext(umin(x,y)) == umin(zext(x),zext(y)) zext(x) == 0 -> x == 0 While it is not a very likely scenario, we probably should not expect that instcombine already dropped such a redundant zext, but handle directly. Moreover, perhaps there was no ZExtInst, and SCEV somehow managed to pull out said zext out of the SCEV expression.	2022-02-16 22:16:02 +03:00
Roman Lebedev	3c7d48ed90	[NFC][SCEV] Recognize umin_seq when operand is zext'ed in umin but not in zero-check zext(umin(x,y)) == umin(zext(x),zext(y)) zext(x) == 0 -> x == 0 Extra leading zeros do not affect the result of comparison with zero, nor do they matter for the unsigned min/max, so we should not be dissuaded when we find a zero-extensions, but instead we should just skip it.	2022-02-16 22:16:02 +03:00
Philip Reames	c02deae18c	[SCEVPredicate] Remove getExpr mechanism [NFC] This mechanism was used for a couple of purposes, but the primary one was keeping track of which predicates in a union might apply to an expression. As these sets are small and agressively deduped, this has little value.	2022-02-11 11:35:58 -08:00
Roman Lebedev	97484f46eb	[NFCI][SCEV] `SCEVTraversal`: if search terminated, don't push further ops of nary Even if the search is marked as terminated after only looking at the first operand, we'd still look at the remaining operands before actually ending the search. This seems pointless and wasteful, let's not do that.	2022-02-11 21:58:19 +03:00
Roman Lebedev	65715ac72a	[SCEV] Generalize umin_seq matching Since we don't greedily flatten `umin_seq(a, umin(b, c))` into `umin_seq(a, b, c)`, just looking at the operands of the outer-level `umin` is not sufficient, and we need to recurse into all same-typed `umin`'s.	2022-02-11 21:58:19 +03:00
Roman Lebedev	c234809ff8	[SCEV] Recognize `x == 0 ? 0 : umin_seq(..., x, ...) -> umin_seq(x, umin_seq(...))`	2022-02-11 21:58:19 +03:00
Roman Lebedev	281421693b	[SCEV] Recognize `x == 0 ? 0 : umin(..., x, ...) -> umin_seq(x, umin(...))` That is the canonical expansion for umin_seq, so we really should roundtrip it.	2022-02-11 21:58:19 +03:00
Roman Lebedev	4d0c0e6cc2	[SCEV] `createNodeForSelectOrPHIInstWithICmpInstCond()`: generalize eq handling The current logic was: https://alive2.llvm.org/ce/z/j8muXk but in reality the offset to the Y in the 'true' hand does not need to exist: https://alive2.llvm.org/ce/z/MNQ7DZ https://alive2.llvm.org/ce/z/S2pMQD To catch that, instead of computing the Y's in both hands and checking their equality, compute Y and C, and check that C is 0 or 1.	2022-02-11 21:58:19 +03:00
Roman Lebedev	a473c457f6	[NFC][SCEV] `createNodeForSelectOrPHIInstWithICmpInstCond()`: dedup eq/ne pred handling	2022-02-11 21:58:19 +03:00
Philip Reames	3e27fb8590	[PSE] Allow duplicate predicates in debug output This lets us avoid redundant implication work in the constructor of SCEVUnionPredicate which simplifies an upcoming change. If we're actually building a predicate via PSE, that goes through addPredicate which does include the implication check.	2022-02-11 10:39:01 -08:00
Philip Reames	5ba115031d	[PSE] Remove assumption that top level predicate is union from public interface [NFC] Note that this doesn't actually cause the top level predicate to become a non-union just yet. The above comes from a case in the LoopVectorizer where a predicate which is later proven no longer blocks vectorization due to a change from checking if predicates exists to whether the predicate is possibly false.	2022-02-10 16:14:52 -08:00
Philip Reames	01b56b8bdd	[SCEVPredicateRewriter] Remove assumption top level predicate is a union [NFC]	2022-02-10 15:51:15 -08:00
Roman Lebedev	c94ec7997a	[NFC][SCEV] `createNodeForSelectOrPHIViaUMinSeq()`: use sub instead of add For booleans, xor/add/sub are interchangeable: https://alive2.llvm.org/ce/z/ziav3d But for larger bitwidths, we'll need sub, so change it now.	2022-02-11 01:21:45 +03:00
Philip Reames	e43b1ce4d5	[SCEV] Constify some uses of SCEVUnionPredicate* [NFC] This exploits the immutability introduced in `d334fec`.	2022-02-10 12:42:19 -08:00
Roman Lebedev	580d3a14b2	[NFC][SCEV] `createNodeForSelectOrPHIViaUMinSeq()`: refactor `i1 cond ? i1 x : i1 y` handling While that effectively concludes i1 select handling, that boolean restriction can be lifted later.	2022-02-10 17:42:56 +03:00
Roman Lebedev	9a322e430f	[NFC][SCEV] `createNodeForSelectOrPHIViaUMinSeq()`: refactor `i1 cond ? i1 C : i1 y` pattern https://alive2.llvm.org/ce/z/uRvVtN	2022-02-10 17:42:56 +03:00
Roman Lebedev	576a45f20d	[NFC][SCEV] `createNodeForSelectOrPHIViaUMinSeq()`: refactor `i1 cond ? i1 x : i1 C` pattern https://alive2.llvm.org/ce/z/2Q7Du_	2022-02-10 17:42:55 +03:00
Roman Lebedev	9766a0cca0	[SCEV] Recognize `cond ? i1 0 : i1 y` as `umin_seq ~cond, x` By definition, `umin_seq` has the exact same poison stopping properties the original `select` had: https://alive2.llvm.org/ce/z/N6XwV-	2022-02-10 17:42:55 +03:00
Roman Lebedev	418604fd90	[SCEV] Recognize `cond ? i1 x : i1 1` as `~umin_seq cond, ~x` By definition, `umin_seq` has the exact same poison stopping properties the original `select` had: https://alive2.llvm.org/ce/z/aqe9GK	2022-02-10 17:42:55 +03:00
Roman Lebedev	49d9acc242	[SCEV] Recognize logical `or` as `not umin_seq (not, not)` By definition, `umin_seq` has the exact same poison stopping properties the original `select` had: https://alive2.llvm.org/ce/z/MUfbTL	2022-02-10 17:42:55 +03:00
Roman Lebedev	16bc24e7be	[SCEV] Recognize logical `and` as `umin_seq` By definition, `umin_seq` has the exact same poison stopping properties the original `select` had: https://alive2.llvm.org/ce/z/59KuZZ	2022-02-10 17:42:55 +03:00
Roman Lebedev	1c69444863	[SCEV] `createNodeForSelectOrPHI()`: try constant-folding even if not an Instruction We'd catch the tautological select pattern later anyways due to constant folding, so that leaves PHI-like select, but it does not appear to fire there.	2022-02-10 17:42:55 +03:00
Roman Lebedev	97930f85af	[NFC][SCEV] Prepare `createNodeForSelectOrPHI()` for gaining additional strategy Currently `createNodeForSelectOrPHI()` takes an Instruction, and only works on the Cond that is an ICmpInst, but that can be relaxed somewhat. For now, simply rename the existing function, and add a thin wrapper ontop that still does the same thing as it used to.	2022-02-10 17:42:55 +03:00
Roman Lebedev	73990ff8a7	[SCEV] Recognize binary `xor` as bit-wise `add` https://alive2.llvm.org/ce/z/ULuZxB We could transparently handle wider bitwidths, by effectively casting iN to <N x i1> and performing the `add` bit/element -wise, the expression will be rather large, so let's not do that for now.	2022-02-10 17:42:55 +03:00
Roman Lebedev	503541fa93	[SCEV] Recognize binary `and` as bit-wise `umin` https://alive2.llvm.org/ce/z/aKAr94 We could transparently handle wider bitwidths, by effectively casting iN to <N x i1> and performing the `umin` bit/element -wise, the expression will be rather large, so let's not do that for now.	2022-02-10 17:42:54 +03:00
Roman Lebedev	e7e0834f07	[SCEV] Recognize binary `or` as bit-wise `umax` https://alive2.llvm.org/ce/z/SMEaoc We could transparently handle wider bitwidths, by effectively casting iN to <N x i1> and performing the `umax` bit/element -wise, the expression will be rather large, so let's not do that for now.	2022-02-10 17:42:54 +03:00
Philip Reames	d334fec140	[SCEV] Make SCEVUnionPredicate externally immutable [NFC] This is the last major stepping stone before being able to allocate the node via the folding set allocator. That will in turn allow more general SCEV predicate expression trees.	2022-02-09 13:47:28 -08:00
Philip Reames	e6d9bab558	[SCEV] Remove a direct call to SCEVUnionPredicate::add [NFC]	2022-02-09 13:04:12 -08:00
Philip Reames	d39f4ac494	[SCEV] Unwind SCEVUnionPredicate from getPredicatedBackedgeTakenCount [NFC] For those curious, the whole reason for tracking the predicate set seperately as opposed to just immediately registering the dependencies appears to be allowing the printing code to print a result without changing the PSE state. It's slightly questionable if this justifies the complexity, but since we can preserve it with local ugliness, I did so.	2022-02-09 12:55:40 -08:00
Philip Reames	aa845d7a24	[SCEV] Remove conversion to SCEVUnionPredicate in ExitNotTakenInfo [NFC] This removes one of the places where we mutate an existing union predicate.	2022-02-09 12:10:23 -08:00
Philip Reames	83f895d952	[SCEV] Add interface for constructing generic SCEVComparePredicate [NFC}	2022-02-09 10:29:04 -08:00
Philip Reames	c302f1e677	[SCEV] Generalize SCEVEqualsPredicate to any compare [NFC] PredicatedScalarEvolution has a predicate type for representing A == B. This change generalizes it into something which can represent a A <pred> B. This generality is currently unused, but is motivated by a couple of recent cases which have come up. In particular, I'm currently playing around with using this to simplify the runtime checking code in LoopVectorizer. Regardless of the outcome of that prototyping, generalizing the compare node seemed useful.	2022-02-08 08:18:09 -08:00
Eli Friedman	b2837bf2f2	[ScalarEvolution] Add bailout to avoid zext of pointer. The RHS of an isImpliedCond call can be a pointer even if the LHS is not. This is similar to `bfa2a81e`. Not going to include a testcase; an IR testcase would be extremely complicated and fragile. Fixes https://github.com/llvm/llvm-project/issues/51936 . Differential Revision: https://reviews.llvm.org/D114555	2022-01-31 11:41:39 -08:00
Kazu Hirata	cda7b6aaf3	[Analysis] Drop an unnecessary const from a return type (NFC) Identified with readability-const-return-type.	2022-01-30 16:04:58 -08:00
William S. Moses	99d2582164	[ScalarEvolution] Handle <= and >= in non infinite loops Extend scalar evolution to handle >= and <= if a loop is known to be finite and the induction variable guards the condition. Specifically, with these assumptions lhs <= rhs is equivalent to lhs < rhs + 1 and lhs >= rhs to lhs > rhs -1. In the case of lhs <= rhs, this is true since the only case these are not equivalent is when rhs == unsigned/signed intmax, which would have resulted in an infinite loop. In the case of lhs >= rhs, this is true since the only case these are not equivalent is when rhs == unsigned/signed intmin, which would again have resulted in an infinite loop. Reviewed By: lebedev.ri Differential Revision: https://reviews.llvm.org/D118090	2022-01-28 17:41:08 -05:00
William S. Moses	0d04c77856	[ScalarEvolution] Mark a loop as finite if in a willreturn function A limited version of (https://reviews.llvm.org/D118090) that only marks a loop as finite if in a willreturn function. Reviewed By: jdoerfert Differential Revision: https://reviews.llvm.org/D118429	2022-01-28 14:17:05 -05:00
Nikita Popov	3e2ae92d3f	[SCEV] Remove an unnecessary GEP type check The code already checked that the addrec step size and type alloc size are the same. The actual pointer element type is irrelevant here.	2022-01-25 12:56:46 +01:00

1 2 3 4 5 ...

1759 Commits