llvm-project

Commit Graph

Author	SHA1	Message	Date
Nikita Popov	a5ee62a141	[IndVars] Call replaceLoopPHINodesWithPreheaderValues() for already constant exits Currently we only call replaceLoopPHINodesWithPreheaderValues() if optimizeLoopExits() replaces the exit with an unconditional exit. However, it is very common that this already happens as part of eliminateIVComparison(), in which case we're leaving behind the dead header phi. Tweak the early bailout for already-constant exits to also call replaceLoopPHINodesWithPreheaderValues(). Differential Revision: https://reviews.llvm.org/D129214	2022-07-13 09:43:21 +02:00
ChenYang Li	6d036b83d1	[JumpThreading] Avoid threadThroughTwoBasicBlocks when PredPred BB ends with indirectbranch Since we can't change the destination of indirectbr, so when encounter indirectbr as PredPredBB terminator, we should pass it. Differential Revision: https://reviews.llvm.org/D129193	2022-07-08 09:29:17 +02:00
Nikita Popov	34a5c2bcf2	[BasicBlockUtils] Allow critical edge splitting with callbr terminators After D129205, we support SplitBlockPredecessors() for predecessors with callbr terminators. This means that it is now also safe to invoke critical edge splitting for an edge coming from a callbr terminator. Remove checks in various passes that were protecting against that. Differential Revision: https://reviews.llvm.org/D129256	2022-07-08 09:20:44 +02:00
Nikita Popov	40a4078e14	[BasicBlockUtils] Allow splitting predecessors with callbr terminators SplitBlockPredecessors currently asserts if one of the predecessor terminators is a callbr. This limitation was originally necessary, because just like with indirectbr, it was not possible to replace successors of a callbr. However, this is no longer the case since D67252. As the requirement nowadays is that callbr must reference all blockaddrs directly in the call arguments, and these get automatically updated when setSuccessor() is called, we no longer need this limitation. The only thing we need to do here is use replaceSuccessorWith() instead of replaceUsesOfWith(), because only the former does the necessary blockaddr updating magic. I believe there's other similar limitations that can be removed, e.g. related to critical edge splitting. Differential Revision: https://reviews.llvm.org/D129205	2022-07-07 09:13:25 +02:00
Vir Narula	89a99ec900	[GVN] Bug fix to reportMayClobberedLoad remark Bug fix to avoid assert crashing when generating remarks for GVN crashing. Intention of assert is correct but ignores edge case of instructions being equivalent. Reduced input that causes crash when remarks are turned on: ``` target datalayout = "e-m:o-i64:64-i128:128-n32:64-S128" target triple = "arm64-apple-macosx12.0.0" define ptr @ReplaceWithTidy(ptr %zz_hold) { cond.end480.us: %0 = load ptr, ptr null, align 8 store ptr %0, ptr %0, align 8 store ptr null, ptr %zz_hold, align 8 %1 = load ptr, ptr %0, align 8 store ptr %1, ptr null, align 8 ret ptr null } ``` Reviewed By: fhahn Differential Revision: https://reviews.llvm.org/D129235	2022-07-06 17:42:05 -07:00
Zaara Syeda	dbf6ab5ef9	[LSR] Fix bug for optimizing unused IVs to final values This is a fix for a crash reported for https://reviews.llvm.org/D118808 The fix is to only consider PHINodes which are induction phis. Fixes #55529 Differential Revision: https://reviews.llvm.org/D125990	2022-07-05 12:30:58 -04:00
Nikita Popov	93cbdaef04	[Reassociate] Avoid ConstantExpr::get() Use ConstantFoldBinaryOpOperands() instead, to handle the case where not all binary ops have a constant expression variant. This is a bit awkward because we only want to pop the element from Ops once we're sure that it has folded.	2022-07-04 15:17:22 +02:00
Nuno Lopes	53dc0f1078	[NFC] Switch a few uses of undef to poison as placeholders for unreachble code	2022-07-03 14:34:03 +01:00
Nuno Lopes	022bd92c78	[LowerMatrixMultiplication] Switch dummy values from undef to poison [NFC]	2022-07-03 12:32:19 +01:00
Nuno Lopes	7c4f45f87a	Revert [LowerMatrixMultiplication] Switch dummy values from undef to poison [NFC] This reverts commits `47e6f98f84` and `3e701bcd2a`	2022-07-01 23:53:41 +01:00
Nuno Lopes	47e6f98f84	[LowerMatrixMultiplication] Switch dummy values from undef to poison [NFC]	2022-07-01 23:31:31 +01:00
Nuno Lopes	373571dbb4	[NFC] Switch a few uses of undef to poison as placeholders for unreachble code	2022-06-30 23:01:43 +01:00
Nuno Lopes	0586d1cac2	[NFC] Switch a few uses of undef to poison as placeholders for unreachble code	2022-06-30 21:47:31 +01:00
Nikita Popov	014c4bdb9d	[VNCoercion] Use ConstantFoldLoadFromConst API (NFCI) Nowdays we have a generic constant folding API to load a type from an offset. It should be able to do anything that VNCoercion can do. This avoids the weird templating between IRBuilder and ConstantFolder in one function, which is will stop working as the IRBuilderFolder moves from CreateXYZ to FoldXYZ APIs. Unfortunately, this doesn't eliminate this pattern from VNCoercion entirely yet.	2022-06-30 14:52:27 +02:00
Nikita Popov	10c531cd5b	[SCCP] Simplify CFG in SCCP as well Currently, we only remove dead blocks and non-feasible edges in IPSCCP, but not in SCCP. I'm not aware of any strong reason for that difference, so this patch updates SCCP to perform the CFG cleanup as well. Compile-time impact seems to be pretty minimal, in the 0.05% geomean range on CTMark. For the test case from https://reviews.llvm.org/D126962#3611579 the result after -sccp now looks like this: define void @test(i1 %c) { entry: br i1 %c, label %unreachable, label %next next: unreachable unreachable: call void @bar() unreachable } -jump-threading does nothing on this, but -simplifycfg will produce the optimal result. Differential Revision: https://reviews.llvm.org/D128796	2022-06-30 09:25:03 +02:00
Nikita Popov	2124b2f0e6	[JumpThreading] Avoid ConstantExpr::get() (NFCI) This code requires the result to be an UndefValue/ConstantInt anyway (checked by getKnownConstant), so we are only interested in the case where this folds.	2022-06-29 16:43:05 +02:00
Nikita Popov	0af53fcb99	[SROA] Don't create constant expressions (NFC) Use IRBuilder instead, which will fold these. Just to clarify that this does not actually create any udiv expression.	2022-06-29 11:51:22 +02:00
Guillaume Chatelet	3c126d5fe4	[Alignment] Replace commonAlignment with std::min `commonAlignment` is a shortcut to pick the smallest of two `Align` objects. As-is it doesn't bring much value compared to `std::min`. Differential Revision: https://reviews.llvm.org/D128345	2022-06-28 07:15:02 +00:00
Congzhe Cao	b941857b40	[LoopInterchange] New cost model for loop interchange This is another attempt to land this patch. The patch proposed to use a new cost model for loop interchange, which is obtained from loop cache analysis. Given a loopnest, what loop cache analysis returns is a vector of loops [loop0, loop1, loop2, ...] where loop0 should be replaced as the outermost loop, loop1 should be placed one more level inside, and loop2 one more level inside, etc. What loop cache analysis does is not only more comprehensive than the current cost model, it is also a "one-shot" query which means that we only need to query it once during the entire loop interchange pass, which is better than the current cost model where we query it every time we check whether it is profitable to interchange two loops. Thus complexity is reduced, especially after D120386 where we do more interchanges to get the globally optimal loop access pattern. Updates made to test cases are mostly minor changes and some corrections. One change that applies to all tests is that we added an option `-cache-line-size=64` to the RUN lines. This is ensure that loop cache analysis receives a valid number of cache line size for correct analysis. Test coverage for loop interchange is not reduced. Currently we did not completely remove the legacy cost model, but keep it as fall-back in case the new cost model did not run successfully. This is because currently we have some limitations in delinearization, which sometimes makes loop cache analysis bail out. The longer term goal is to enhance delinearization and eventually remove the legacy cost model compeletely. Reviewed By: bmahjour, #loopoptwg Differential Revision: https://reviews.llvm.org/D124926	2022-06-28 00:08:37 -04:00
Kazu Hirata	d08f34b592	[llvm] Don't use Optional::hasValue (NFC) This patch replaces Optional::hasValue with the implicit cast to bool in conditionals only.	2022-06-26 18:31:51 -07:00
Kazu Hirata	a81b64a1fb	[llvm] Use Optional::has_value instead of Optional::hasValue (NFC) This patch replaces x.hasValue() with x.has_value() where x is not contextually convertible to bool.	2022-06-26 16:10:42 -07:00
Nuno Lopes	6ef9a2ad01	[LICM] Use poison to replace unreachable values instead of undef [NFC]	2022-06-26 14:56:35 +01:00
Nuno Lopes	3fa2411dc5	[LoopSimplifyCFG] use poison when replacing dead instructions instead of undef [NFC]	2022-06-26 14:15:55 +01:00
Kazu Hirata	a7938c74f1	[llvm] Don't use Optional::hasValue (NFC) This patch replaces Optional::hasValue with the implicit cast to bool in conditionals only.	2022-06-25 21:42:52 -07:00
Kazu Hirata	3b7c3a654c	Revert "Don't use Optional::hasValue (NFC)" This reverts commit `aa8feeefd3`.	2022-06-25 11:56:50 -07:00
Kazu Hirata	aa8feeefd3	Don't use Optional::hasValue (NFC)	2022-06-25 11:55:57 -07:00
Nikita Popov	871197d0a3	[MemoryBuiltins] Accept any value in getInitialValueOfAllocation() (NFC) Drop the requirement that getInitialValueOfAllocation() must be passed an allocator function, shifting the responsibility for checking that into the function (which it does anyway). The motivation is to avoid some calls to isAllocationFn(), which has somewhat ill-defined semantics (given the number of allocator-related attributes we have floating around...) (For this function, all we eventually need is an allockind of zeroed or uninitialized.) Differential Revision: https://reviews.llvm.org/D127274	2022-06-24 16:08:07 +02:00
Florian Hahn	92f87787b3	Recommit "[ConstraintElimination] Transfer info from ULT to signed system." This reverts commit `94ed2caf70`. The issue with no-determinism with the test has been fixed in `d9526e8a52`.	2022-06-24 09:27:14 +02:00
Evgenii Stepanov	878309cc54	Revert "[LoopInterchange] New cost model for loop interchange" llvm/lib/Analysis/LoopCacheAnalysis.cpp:702:30: runtime error: signed integer overflow: 6148914691236517209 * 100 cannot be represented in type 'long' https://lab.llvm.org/buildbot/#/builders/5/builds/25185 This reverts commit `1b24fe34b0`.	2022-06-23 16:10:53 -07:00
Congzhe Cao	1b24fe34b0	[LoopInterchange] New cost model for loop interchange This is the second attempt to land this patch. The patch proposed to use a new cost model for loop interchange, which is obtained from loop cache analysis. Given a loopnest, what loop cache analysis returns is a vector of loops [loop0, loop1, loop2, ...] where loop0 should be replaced as the outermost loop, loop1 should be placed one more level inside, and loop2 one more level inside, etc. What loop cache analysis does is not only more comprehensive than the current cost model, it is also a "one-shot" query which means that we only need to query it once during the entire loop interchange pass, which is better than the current cost model where we query it every time we check whether it is profitable to interchange two loops. Thus complexity is reduced, especially after D120386 where we do more interchanges to get the globally optimal loop access pattern. Updates made to test cases are mostly minor changes and some corrections. One change that applies to all tests is that we added an option `-cache-line-size=64` to the RUN lines. This is ensure that loop cache analysis receives a valid number of cache line size for correct analysis. Test coverage for loop interchange is not reduced. Currently we did not completely remove the legacy cost model, but keep it as fall-back in case the new cost model did not run successfully. This is because currently we have some limitations in delinearization, which sometimes makes loop cache analysis bail out. The longer term goal is to enhance delinearization and eventually remove the legacy cost model compeletely. Reviewed By: bmahjour, #loopoptwg Differential Revision: https://reviews.llvm.org/D124926	2022-06-23 16:34:57 -04:00
Florian Hahn	d9526e8a52	[ConstraintElimination] Use stable_sort to sort worklist. If there are multiple constraints in the same block, at the moment the order they are processed may be different depending on the sort implementation. Use stable_sort to ensure consistent ordering.	2022-06-23 19:22:15 +02:00
Florian Hahn	94ed2caf70	Revert "[ConstraintElimination] Transfer info from ULT to signed system." This reverts commit `316e106f49`. This breaks a bot with expensive checks.	2022-06-23 17:27:33 +02:00
Florian Hahn	316e106f49	[ConstraintElimination] Transfer info from ULT to signed system. If A u< B holds, then A s>= 0 && A s< B holds if B s>= 0. https://alive2.llvm.org/ce/z/RrNxHh	2022-06-23 17:17:01 +02:00
Florian Hahn	9a33f3975e	[ConstraintElimination] Transfer info from SLT to unsigned system. If A s< B holds, then A u< also holds, if A s>= 0. https://alive2.llvm.org/ce/z/J4JZuN	2022-06-23 15:57:59 +02:00
Florian Hahn	24a98881cd	[ConstraintElimination] Transfer info from SGT to unsigned system. If A >s B then A >=u 0, if B >=s -1. https://alive2.llvm.org/ce/z/cncGKi	2022-06-23 11:04:51 +02:00
Serguei Katkov	5e1ccdf960	[RS4GC] Handle freeze case for vector Finding BDV for vector value does not handle freeze instruction. Adding its handling as it is done for scalar case. Reviewed By: apilipenko Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D128254	2022-06-23 11:58:41 +07:00
Adrian Tong	4e555a3df4	Fix a misspell. NFC	2022-06-22 21:23:21 +00:00
Brendon Cahoon	f1b05a0a2b	[StructurizeCFG] Improve basic block ordering StructurizeCFG linearizes the successors of branching basic block by adding Flow blocks to record the true/false path for branches and back edges. This patch reduces the number of Phi values needed to capture the control flow path by improving the basic block ordering. Previously, StructurizeCFG adds loop exit blocks outside of the loop. StructurizeCFG sets a boolean value to indicate the path taken, and all exit block live values extend to after the loop. For loops with a large number of exits blocks, this creates a huge number of values that are maintained, which increases compilation time and register pressure. This is problem especially with ASAN, which adds early exits to blocks with unreachable instructions for each instrumented check in the loop. In specific cases, this patch reduces the number of values needed after the loop by moving the exit block into the loop. This is done for blocks that have a single predecessor and single successor by moving the block to appear just after the predecessor. Differential Revision: https://reviews.llvm.org/D123231	2022-06-22 16:10:41 -05:00
Mingming Liu	67dc8021a1	[Support] Change TrackingStatistic and NoopStatistic to use uint64_t instead of unsigned. Binary size of `clang` is trivial; namely, numerical value doesn't change when measured in MiB, and `.data` section increases from 139Ki to 173 Ki. Differential Revision: https://reviews.llvm.org/D128070	2022-06-22 10:11:40 -07:00
Max Kazantsev	cff4f04e2e	[LSR] Don't allow zero quotient as scale ref. PR56160 Scale reg should never be zero, so when the quotient is zero, we cannot assign it there. Limit this transform to avoid this situation. Differential Revision: https://reviews.llvm.org/D128339 Reviewed By: eopXD	2022-06-22 23:33:57 +07:00
Guillaume Chatelet	57ffff6db0	Revert "[NFC] Remove dead code" This reverts commit `8ba2cbff70`.	2022-06-22 14:55:47 +00:00
Guillaume Chatelet	8ba2cbff70	[NFC] Remove dead code	2022-06-22 13:33:58 +00:00
Florian Hahn	098b0b18a7	[ConstraintElimination] Transfer info from SGE to unsigned system. This patch adds a new transferToOtherSystem helper that tries to transfer information from signed predicates to the unsigned system and vice versa. The initial version adds A >=u B for A >=s B && B >=s 0 https://alive2.llvm.org/ce/z/8b6F9i	2022-06-22 15:27:59 +02:00
Nikita Popov	1f88d80408	[SCCP] Don't mark edges feasible when resolving undefs As branch on undef is immediate undefined behavior, there is no need to mark one of the edges as feasible. We can leave all the edges non-feasible. In IPSCCP, we can replace the branch with an unreachable terminator. Differential Revision: https://reviews.llvm.org/D126962	2022-06-22 10:28:27 +02:00
Florian Hahn	ac62b8f704	[ConstraintElimination] Update addFact to take Predicate and ops (NFC). This allows adding facts without necessarily having a corresponding CmpInst.	2022-06-22 08:36:41 +02:00
chenglin.bi	810b5c471f	[NewGVN] add context instruction for SimplifyQuery NewGVN will find operator from other context. ValueTracking currently doesn't have a way to run completely without context instruction. So it will use operator itself as conext instruction. If the operator in another branch will never be executed but it has an assume, it may caused value tracking use the assume to do wrong simpilfy. It would be better to make these simplification queries not use context at all, but that would require some API changes. For now we just use the orignial instruction as context instruction to fix the issue. Fix #56039 Reviewed By: nikic Differential Revision: https://reviews.llvm.org/D127942	2022-06-22 12:25:24 +08:00
Heejin Ahn	27e4afcea7	[DSE] Don't remove nounwind invokes For non-mem-intrinsic and non-lifetime `CallBase`s, the current `isRemovable` function only checks if the `CallBase` 1. has no uses 2. will return 3. does not throw: `80fb782336/llvm/lib/Transforms/Scalar/DeadStoreElimination.cpp (L1017)` But we should also exclude invokes even in case they don't throw, because they are terminators and thus cannot be removed. While it doesn't seem to make much sense for `invoke`s to have an `nounwind` target, this kind of code can be generated and is also valid bitcode. Reviewed By: nikic Differential Revision: https://reviews.llvm.org/D128224	2022-06-21 11:54:09 -07:00
serge-sans-paille	aaf1630ac3	[Scalarizer] No need to gather a scattered extracted element ExtractElement does not produce a vector out of a vector, so there's no need to call a gather once done. Fix #54469 Credits to npopov@redhat.com for the original approach. Differential Revision: https://reviews.llvm.org/D126012	2022-06-21 18:43:54 +02:00
Florian Hahn	4ea6891f95	[ConstraintElimination] Remove unneeded StackEntry::Condition (NFC). The field was only used for debug printing. Print constraint from the system instead.	2022-06-21 15:57:29 +02:00
Florian Hahn	2a9313ee0b	[ConstraintElimination] Move logic to check condition to helper (NFC).	2022-06-21 11:50:33 +02:00
Kazu Hirata	7a47ee51a1	[llvm] Don't use Optional::getValue (NFC)	2022-06-20 22:45:45 -07:00
Kazu Hirata	d66cbc565a	Don't use Optional::hasValue (NFC)	2022-06-20 20:26:05 -07:00
Kazu Hirata	0916d96d12	Don't use Optional::hasValue (NFC)	2022-06-20 20:17:57 -07:00
Florian Hahn	6dd772d348	[ConstraintElimination] Move logic to get a constraint to helper (NFC).	2022-06-20 21:34:07 +02:00
Kazu Hirata	e0e687a615	[llvm] Don't use Optional::hasValue (NFC)	2022-06-20 10:38:12 -07:00
Florian Hahn	cebe7ae881	[ConstraintElimination] Move logic to add constraint to helper (NFC).	2022-06-20 17:08:35 +02:00
Florian Hahn	bd9632afd2	[ConstraintElimination] Move StackEntry up, to allow use earlier (NFC).	2022-06-20 16:40:42 +02:00
Kazu Hirata	129b531c9c	[llvm] Use value_or instead of getValueOr (NFC)	2022-06-18 23:07:11 -07:00
Kazu Hirata	b254d67160	[llvm] Call *set::insert without checking membership first (NFC)	2022-06-18 08:32:54 -07:00
Florian Hahn	7c0089d735	[Matrix] Check if iterator is at beginning of BB in optimizeTranspose. If an instruction at the beginning of a block is erased, this may trigger crash due to dereferencing an invalid iterator. Check if II is at the end before dereferencing it. Reviewed By: thegameg Differential Revision: https://reviews.llvm.org/D127736	2022-06-14 21:37:02 +01:00
Florian Hahn	782e912246	[ConstraintElimination] Support constraints with only const ops. Remove the early exit if both constraints contain no variables. This restriction is unnecessayr for correctness and removing it simplifies handling of trivial constant conditions in follow-up changes.	2022-06-14 10:37:12 +01:00
Guillaume Chatelet	f9bb8c24ac	[NFC][Alignment] Convert MemCpyOptimizer.cpp	2022-06-13 10:07:09 +00:00
Kazu Hirata	5d7b1a5f1b	[Scalar] Use llvm::append_range (NFC)	2022-06-10 23:09:01 -07:00
Guillaume Chatelet	dc9c2eac98	[NFC][Alignment] Simplify code	2022-06-10 15:25:28 +00:00
Guillaume Chatelet	12ccdd67aa	[NFC] Use proper getSliceAlign type in SROA	2022-06-10 12:37:41 +00:00
Philip Reames	206f10d3f6	Plumb InstructionCost through unroll costing Teach the unroller(s) how to handle an invalid cost. This avoids crashes when the backend can't provide a cost due to either a fundemental limitation or an unimplemented cost model case. Differential Revision: https://reviews.llvm.org/D127305	2022-06-09 15:42:53 -07:00
Philip Reames	f85c5079b8	Pipe potentially invalid InstructionCost through CodeMetrics Per the documentation in Support/InstructionCost.h, the purpose of an invalid cost is so that clients can change behavior on impossible to cost inputs. CodeMetrics was instead asserting that invalid costs never occurred. On a target with an incomplete cost model - e.g. RISCV - this means that transformations would crash on (falsely) invalid constructs - e.g. scalable vectors. While we certainly should improve the cost model - and I plan to do so in the near future - we also shouldn't be crashing. This violates the explicitly stated purpose of an invalid InstructionCost. I updated all of the "easy" consumers where bailouts were locally obvious. I plan to follow up with loop unroll in a following change. Differential Revision: https://reviews.llvm.org/D127131	2022-06-09 15:17:24 -07:00
Simon Moll	b8c2781ff6	[NFC] format InstructionSimplify & lowerCaseFunctionNames Clang-format InstructionSimplify and convert all "FunctionName"s to "functionName". This patch does touch a lot of files but gets done with the cleanup of InstructionSimplify in one commit. This is the alternative to the less invasive clang-format only patch: D126783 Reviewed By: spatel, rengolin Differential Revision: https://reviews.llvm.org/D126889	2022-06-09 16:10:08 +02:00
Philip Reames	89c4b29e8d	[GuardWidening] Fix a nasty cast bug in `c2eccc6` `c2eccc6` introduced a call to etHasNoUnsignedWrap which implicitly assumes that Inst is a OverflowingBinaryOperator. This is frequently untrue, but was not caught because cast<Ty>(X) has been broken, see https://discourse.llvm.org/t/cast-x-is-broken-implications-and-proposal-to-address/63033 for context. I considered reverting this, but since doing so re-introduces a nasty miscompile of its own, I decided to fix forward instead. I'll note that this is a particularly nasty form of the cast<Ty>(X) issue. Because the cast was succeeding unexpected, we were writing data to instructions which weren't OBOs. This could result in near arbitrary data or memory corruption. I'm a bit shocked that the sanitizers didn't find this TBH.	2022-06-07 13:27:13 -07:00
Craig Topper	d73684e223	[LoopFlatten] Fix crash if the inner loop trip count comes from a sext instruction. If we look through a truncate in matchLinearIVUser, it's possible we find a sext/zext instruction that didn't come from widening. This will fail the MatchedItCount->getType() == InnerInductionPHI->getType() assertion. Fix this by checking that we did not look through a truncate already. Reviewed By: SjoerdMeijer Differential Revision: https://reviews.llvm.org/D127149	2022-06-07 08:21:21 -07:00
Craig Topper	fdd5843572	[LoopFlatten] Replace unchecked dyn_cast with cast. Spotted while reading through the code. Reviewed By: SjoerdMeijer Differential Revision: https://reviews.llvm.org/D127146	2022-06-07 08:21:00 -07:00
Kevin P. Neal	a1f1bd547b	[IPSCCP] Switch away from Instruction::isSafeToRemove() In D115737 I found that I needed to teach Instruction::isSafeToRemove() about strictfp/constrained intrinsics. It was pointed out that this is probably the wrong function to use isInstructionTriviallyDead(). It doesn't make sense to have a "second, worse implementation". I also believe that the Instruction class is the wrong place for this functionality. The information about whether or not an instruction can be removed is in the transform passes and should stay there. Differential Revision: https://reviews.llvm.org/D118387	2022-06-06 09:24:11 -04:00
Kazu Hirata	8daf23d364	[Scalar] Use llvm::make_early_inc_range (NFC)	2022-06-05 23:53:18 -07:00
Kazu Hirata	30f19382c6	[Scalar] Remove isValidSingle (NFC) The last use was removed on Feb 18, 2022 in commit `00ab91b70d`.	2022-06-05 08:45:11 -07:00
Fangrui Song	95a134254a	Remove unneeded cl::ZeroOrMore for cl::opt/cl::list options	2022-06-05 01:07:51 -07:00
Fangrui Song	d86a206f06	Remove unneeded cl::ZeroOrMore for cl::opt/cl::list options	2022-06-05 00:31:44 -07:00
Kazu Hirata	e0039b8d6a	Use llvm::less_second (NFC)	2022-06-04 22:48:32 -07:00
Kazu Hirata	f83a88a179	[Transforms] Use llvm::is_contained (NFC)	2022-06-04 20:48:26 -07:00
Fangrui Song	36c7d79dc4	Remove unneeded cl::ZeroOrMore for cl::opt options Similar to `557efc9a8b`. This commit handles options where cl::ZeroOrMore is more than one line below cl::opt.	2022-06-04 00:10:42 -07:00
Fangrui Song	557efc9a8b	[llvm] Remove unneeded cl::ZeroOrMore for cl::opt options. NFC Some cl::ZeroOrMore were added to avoid the `may only occur zero or one times!` error. More were added due to cargo cult. Since the error has been removed, cl::ZeroOrMore is unneeded. Also remove cl::init(false) while touching the lines.	2022-06-03 21:59:05 -07:00
Daniil Suchkov	f1940a5895	Revert "[LoopInterchange] New cost model for loop interchange" Reverting the commit due to numerous buildbot failures. This reverts commit `006334470d`.	2022-06-03 00:52:08 +00:00
Congzhe Cao	006334470d	[LoopInterchange] New cost model for loop interchange This patch proposed to use a new cost model for loop interchange, which is obtained from loop cache analysis. Given a loopnest, what loop cache analysis returns is a vector of loops [loop0, loop1, loop2, ...] where loop0 should be replaced as the outermost loop, loop1 should be placed one more level inside, and loop2 one more level inside, etc. What loop cache analysis does is not only more comprehensive than the current cost model, it is also a "one-shot" query which means that we only need to query it once during the entire loop interchange pass, which is better than the current cost model where we query it every time we check whether it is profitable to interchange two loops. Thus complexity is reduced, especially after D120386 where we do more interchanges to get the globally optimal loop access pattern. Updates made to test cases are mostly minor changes and some corrections. Test coverage for loop interchange is not reduced. Currently we did not completely remove the legacy cost model, but keep it as fall-back in case the new cost model did not run successfully. This is because currently we have some limitations in delinearization, which sometimes makes loop cache analysis bail out. The longer term goal is to enhance delinearization and eventually remove the legacy cost model compeletely. Reviewed By: bmahjour, #loopoptwg Differential Revision: https://reviews.llvm.org/D124926	2022-06-02 19:07:14 -04:00
eopXD	6eab5cade7	[LSR] Early exit for RateFormula when it is already losing. NFC This patch does not effect any behavior of the current code. The codebase implicitly implies that `Cost::RateFormula` is only called when the `Cost` is not in losing status, or else there may be possible to trigger the assertion of `Cost::isValid`. The intention here is to prevent mis-use where future development allow `Cost` that is already loser to call `Cost::RateFormula` - Early exit when `Cost` is already losing. Reviewed By: Meinersbur, #loopoptwg Differential Revision: https://reviews.llvm.org/D125670	2022-06-01 21:02:40 -07:00
Eli Friedman	abdf0da800	[LoopIdiom] Fix bailout for aliasing in memcpy transform. Commit `dd5991cc` modified the aliasing checks here to allow transforming a memcpy where the source and destination point into the same object. However, the change accidentally made the code skip the alias check for other operations in the loop. Instead of completely skipping the alias check, just skip the check for whether the memcpy aliases itself. Differential Revision: https://reviews.llvm.org/D126486	2022-05-31 17:24:23 -07:00
Nuno Lopes	80b3dcc045	[Support] Make report_fatal_error respect its GenCrashDiag argument so it doesn't generate a backtrace There are a few places where we use report_fatal_error when the input is broken. Currently, this function always crashes LLVM with an abort signal, which then triggers the backtrace printing code. I think this is excessive, as wrong input shouldn't give a link to LLVM's github issue URL and tell users to file a bug report. We shouldn't print a stack trace either. This patch changes report_fatal_error so it uses exit() rather than abort() when its argument GenCrashDiag=false. Reviewed by: nikic, MaskRay, RKSimon Differential Revision: https://reviews.llvm.org/D126550	2022-05-30 19:19:23 +01:00
Nikita Popov	1721ff1dfd	[GVN] Enable enable-split-backedge-in-load-pre option by default This option was added in D89854. It prevents GVN from performing load PRE in a loop, if doing so would require critical edge splitting on the backedge. From the review: > I know that GVN Load PRE negatively impacts peeling, > loop predication, so the passes expecting that latch has > a conditional branch. In the PhaseOrdering test in this patch, splitting the backedge negatively affects vectorization: After critical edge splitting, the loop gets rotated, effectively peeling off the first loop iteration. The effect is that the first element is handled separately, then the bulk of the elements use a vectorized reduction (but using unaligned, off-by-one memory accesses) and then a tail of 15 elements is handled separately again. It's probably worth noting that the loop load PRE from D99926 is not affected by this change (as it does not need backedge splitting). This is about normal load PRE that happens to occur inside a loop. Differential Revision: https://reviews.llvm.org/D126382	2022-05-30 09:55:58 +02:00
Max Kazantsev	503d5771b6	[JumpThreading][NFCI] Reuse existing DT instead of recomputation This whole part with recomputation of BPI and BFI looks redundant, and we tried to get rid of it in D124439. Unfortunately, it causes some hard-to-reproduce failures due to invalid state of analysis. Until this is investigated and fixed, let's try to reuse at least part of available analyzes. DT is available at this point, and there is no need to recompute it. Please revert if you see it causing any behavior changes.	2022-05-30 12:48:10 +07:00
Florian Hahn	0776c48f9b	Recommit "[LICM] Only create load in ph when promoting load or store doesn't exec." This reverts the revert commit `ad95255b92`. The updated version also creates a load when the store may not execute. In those cases, we still need to introduce a load in a function where there may not have been one before, so this doesn't completely resolve issue #51248. Original message: When only a store is sunk, there is no need to create a load in the pre-header, as the result of the load will never get used. The dead load can can introduce UB, if the function is marked as writeonly. Reviewed By: nikic Differential Revision: https://reviews.llvm.org/D123473	2022-05-29 21:57:14 +01:00
eopXD	6a84579243	[LSR][TTI][PowerPC][SystemZ][X86] Add const-ness to TTI::isLSRCostLess. NFC Reviewed By: Meinersbur Differential Revision: https://reviews.llvm.org/D126350	2022-05-27 15:22:23 -07:00
Arthur Eubanks	36096c2b38	[NFC][JumpThreading] Remove InsertFreezeWhenUnfoldingSelect pass parameter All callers pass true. select-unfold-freeze.ll is now a subset of select.ll so delete it. Reviewed By: nikic Differential Revision: https://reviews.llvm.org/D126501	2022-05-26 16:13:34 -07:00
Owen Anderson	939a43461b	Revert "Replace the custom linked list in LeaderTableEntry with TinyPtrVector." This reverts commit `1e91149844`. Pending further discussion.	2022-05-26 09:50:36 -07:00
Alex Zhikhartsev	8b0d763474	[DFAJumpThreading] Relax analysis to handle unpredictable initial values Responding to a feature request from the Rust community: https://github.com/rust-lang/rust/issues/80630 void foo(X) { for (...) switch (X) case A X = B case B X = C } Even though the initial switch value is non-constant, the switch statement can still be threaded: the initial value will hit the switch statement but the rest of the state changes will proceed by jumping unconditionally. The early predictability check is relaxed to allow unpredictable values anywhere, but later, after the paths through the switch statement have been enumerated, no non-constant state values are allowed along the paths. Any state value not along a path will be an initial switch value, which can be safely ignored. Differential Revision: https://reviews.llvm.org/D124394	2022-05-26 11:29:54 -04:00
Florian Hahn	f96aa493f0	[SimpleLoopUnswitch] Always skip trivial select and set condition. When updating the branch instruction outside the loopduring non-trivial unswitching, always skip trivial selects and update the condition. Otherwise we might create invalid IR, because the trivial select is inside the loop, while the condition is outside the loop. Fixes #55697.	2022-05-26 09:46:24 +01:00
Owen Anderson	1e91149844	Replace the custom linked list in LeaderTableEntry with TinyPtrVector. The purpose of the custom linked list was to optimize for the case of a single-element list. It turns out that TinyPtrVector handles the same basic scenario even better, reducing the size of LeaderTableEntry by 33%, and requiring only log2(N) allocations as the size of the list grows. The only downside is that we have to store the Value's and BasicBlock's in separate vectors, which is slightly awkward in a few cases. Fortunately that ends up being entirely encapsulated inside helper functions. Reviewed By: asbirlea Differential Revision: https://reviews.llvm.org/D125205	2022-05-25 23:52:44 -07:00
Serguei Katkov	c2eccc67ce	[GuardWidening] Remove nuw/nsw flags for hoisted instructions When we hoist instructions over guard we must clear flags due to these flags might be implied using this guard, so they make sense only after the guard. As an example of the bug due to current behavior. L is known to be in range say [0, 100) c1 = x u< L guard (c1) x1 = add x, 1 c2 = x1 u< L guard(c2) basing on guard(c1) we can say that x1 = add nuw nsw x, 1 after guard widening we get c1 = x u< L x1 = add nuw nsw x, 1 c2 = x1 u< L c = and c1, c2 guard(c) now, basing on fact that x + 1 < L and x >= 0 due to x + 1 is nuw we can prove that x + 1 u< L implies that x u< L, so we can just remove c1 x1 = add nuw nsw x, 1 c2 = x1 u< L guard(c2) But that is not correct due to we will pass x == -1 value. Reviewed By: mkazantsev Subscribers: llvm-commits, nikic Differential Revision: https://reviews.llvm.org/D126354	2022-05-26 13:20:55 +07:00
Nikita Popov	6f0ca6fd23	[JumpThreading] Insert freeze when unfolding select JumpThreading may convert selects into branch instructions, in which case the condition needs to be frozen (as branch on poison is immediate undefined behavior, unlike select on poison). The necessary code for this is already in place, this just enables the option. Differential Revision: https://reviews.llvm.org/D125869	2022-05-21 11:24:27 +02:00
Florian Hahn	32d6ef36d6	[SimpleLoopUnswitch] Skip trivial selects during trivial unswitching. Update the remaining places in unswitchTrivialBranch to properly skip trivial selects. Fixes #55526.	2022-05-19 17:01:13 +01:00
Jay Foad	6bec3e9303	[APInt] Remove all uses of zextOrSelf, sextOrSelf and truncOrSelf Most clients only used these methods because they wanted to be able to extend or truncate to the same bit width (which is a no-op). Now that the standard zext, sext and trunc allow this, there is no reason to use the OrSelf versions. The OrSelf versions additionally have the strange behaviour of allowing extending to a smaller width, or truncating to a larger width, which are also treated as no-ops. A small amount of client code relied on this (ConstantRange::castOp and MicrosoftCXXNameMangler::mangleNumber) and needed rewriting. Differential Revision: https://reviews.llvm.org/D125557	2022-05-19 11:23:13 +01:00
Nikita Popov	c9e7049754	[JumpThreading] Look through freeze in getPredicateAt() fold This code is valid for any icmp, so we can safely look through a freeze when trying to find one. A caveat here is that replaceFoldableUses() may not end up replacing any uses in this case. It might make sense to use the freeze as the context instruction (rather than the terminator) if there is a freeze, to ensure that it always gets folded. This would require some changes to how replaceFoldedUses() works though, as it currently assumes that the value is valid at the end of the block.	2022-05-18 12:09:59 +02:00
Nikita Popov	18c70a7bd9	[JumpThreading] Simplify getPredicateAt() based folding It's sufficient to just fold the icmp to true/false here, and then let constant terminator folding take care of the rest. It should be noted that while replaceFoldableUses() may not replace all uses of the icmp, at least the use in the terminator we're working on is always replaceable, so terminator constant folding should be reliably enabled as a subsequent step.	2022-05-18 11:24:52 +02:00

1 2 3 4 5 ...

11781 Commits