llvm-project

Commit Graph

Author	SHA1	Message	Date
Alex Brachet	ecac223b0e	[PGO] Make emitted symbols hidden This was reverted because it was breaking when targeting Darwin which tried to export these symbols which are now hidden. It should be safe to just stop attempting to export these symbols in the clang driver, though Apple folks will need to change their TAPI allow list described in the commit where these symbols were originally exported `f538018562` Bug: https://github.com/llvm/llvm-project/issues/58265 Differential Revision: https://reviews.llvm.org/D135340	2022-10-13 19:47:15 +00:00
Florian Hahn	71c49d189a	[ConstraintElim] Move check-and-replace logic to helper function (NFC). Move logic to check and replace conditions to a helper function. This isolates the code, allows using early returns, reduces the indentation and simplifies eliminateConstraints.	2022-10-13 18:58:37 +01:00
Nikita Popov	b54b84fde6	[MemCpyOpt] Add additional debug output (NFC)	2022-10-13 17:03:44 +02:00
Alexey Bataev	c787986cdd	[SLP]Improve costs of vectorized loads/stores by analyzing GEPs. When generating masked gathers nodes, SLP vectorizer accounts the cost of the GEPs for loads as part of the scalar-vector transformation cost estimation. But it does not do it for vectorized loads/stores, while it may completely remove some of the GEPs completely. Because of this in some cases masked gather operation can be much more profitable rather than regular vectorization (masked-gather cost + vector GEP - scalar loads + GEPs comparing to vectorized loads - scalar loads). Added the analysis of the removed scalarGEPs for vectorized load/store nodes for better cost estimation. Differential Revision: https://reviews.llvm.org/D135282	2022-10-13 07:20:41 -07:00
Philip Reames	fe755af3a9	Revert "Remove PlaceSafepoints pass" This reverts commit `cb66e123c6`. It was reported via https://reviews.llvm.org/rGcb66e123c6bc82a793300b6fb3ecbed79c58f557#1132969 that the Microsoft.NET compiler is still using this pass.	2022-10-13 07:17:25 -07:00
Florian Hahn	019049a1ca	[ConstraintElim] Use MulOverflow to avoid UB on signed overflow. This fixes an UBSan failure after `359bc5c541`. For inbounds GEP with index sizes <= 64, having the coefficients overflowing is fine.	2022-10-13 13:57:43 +01:00
Nikita Popov	d44cd1bbeb	Revert "[FunctionAttrs] Make location classification more precise" This reverts commit `b05f5b90a1`. There are thread sanitizer buildbot failures in simple_stack.c. I think that's because this ended up affecting the handling of volatile accesses to allocas. Reverting for now.	2022-10-13 12:11:04 +02:00
Nikita Popov	b05f5b90a1	[FunctionAttrs] Make location classification more precise Don't add argmem if the pointer is clearly not an argument (e.g. a global). I don't think this makes a difference right now, but gives more obvious results with D135780.	2022-10-13 11:24:23 +02:00
Florian Hahn	359bc5c541	[ConstraintElim] Bail out for GEPs when index size > 64 bits. Limit pointer decomposition to pointers with index sizes of at most 64 bits. int64_t is used for coefficients, so as long as the index size <= 64 bits we should be able to represent all pointer offsets. Pointer decomposition is limited to inbounds GEPs, so if a index computation would overflow the result is poison, so it doesn't matter that the coefficient overflows. This allows replacing MulOverflow with regular multiplications.	2022-10-13 10:19:30 +01:00
Nikita Popov	440ce05fbf	[FunctionAttrs] Handle potential access of captured argument We have to account for accesses to argument memory via captures. I don't think there's any way to make this produce incorrect results right now (because as soon as "other" is set, we lose the ability to infer argmemonly), but this avoids incorrect results once we have more precise representation.	2022-10-13 11:15:36 +02:00
Nikita Popov	5b3776842f	[FunctionAttrs] Account for memory effects of inalloca/preallocated The code for inferring memory attributes on arguments claims that inalloca/preallocated arguments are always clobbered: `d71ad41080/llvm/lib/Transforms/IPO/FunctionAttrs.cpp (L640-L642)` However, we would still infer memory attributes for the whole function without taking this into account, so we could still end up inferring readnone for the function. This adds an argument clobber if there are any inalloca/preallocated arguments. Differential Revision: https://reviews.llvm.org/D135783	2022-10-13 10:20:17 +02:00
Florian Hahn	0ebd288338	[ConstraintElim] Move GEP decomposition code to separate fn (NFC). Breaks up a large function and allows for the use to early exits.	2022-10-12 20:39:05 +01:00
Arthur Eubanks	f59e1bcc22	[PrintPipeline] Handle CoroConditionalWrapper and add more verification Add a check (can be disabled via a flag) that the pipeline we generate is actually parsable. Can be disabled because we don't expect to handle every pass in -print-pipeline-passes. Fixes #58280. Reviewed By: ChuanqiXu Differential Revision: https://reviews.llvm.org/D135703	2022-10-12 09:36:45 -07:00
Sanjay Patel	7b9482df3d	[InstCombine] fold sdiv with common shl amount in operands (X << Z) / (Y << Z) --> X / Y https://alive2.llvm.org/ce/z/CLKzqT This requires a surprising "nuw" constraint because we have to guard against immediate UB via signed-div overflow with -1 divisor. This extends `008a89037a` and is another transform derived from issue #58137.	2022-10-12 11:32:15 -04:00
Alexey Bataev	d71ad41080	[SLP]Fix insertpoint of the extractellements instructions to avoid reshuffle crash. Need to set the insertpoint for extractelement to point to the first instruction in the node to avoid possible crash during external uses combine process. Without it we may endup with the incorrect transformation. Differential Revision: https://reviews.llvm.org/D135591	2022-10-12 08:18:30 -07:00
Sanjay Patel	008a89037a	[InstCombine] fold udiv with common shl amount in operands (X << Z) / (Y << Z) --> X / Y https://alive2.llvm.org/ce/z/E5eaxU This fixes the motivating example from issue #58137, but it is not the most general transform. We should probably also convert left-shift in the divisor to right-shift in the dividend for that, but that exposes another missed canonicalization for shifts and adds.	2022-10-12 11:12:26 -04:00
Jordan Rupprecht	cbae57c0e1	[NFC] Ignore unused var in no-asserts builds	2022-10-12 08:11:10 -07:00
Alexey Bataev	1be3428ea0	[SLP]Fix PR58177: Improve isUndefVector function to avoid extra freeze. Freeze instruction in some cases makes codegen worse, so need to be very careful when emitting it. Instead improve analysis in isUndefVector function to generate mask of unused elements and use it in the analysis. Differential Revision: https://reviews.llvm.org/D135382	2022-10-12 07:32:54 -07:00
Sanjay Patel	fe97f95036	[InstCombine] propagate "exact" through folds of div These folds were added recently with: `6b869be810` `8da2fa856f` ...but they didn't account for the "exact" attribute, and that can be safely propagated: https://alive2.llvm.org/ce/z/F_WhnR https://alive2.llvm.org/ce/z/ft9Cgr	2022-10-12 09:25:05 -04:00
Sanjay Patel	d117ee25b8	[InstCombine] add helper function for div+shl folds; NFC There are at least 2 similar patterns that could be added here, and the existing fold can be improved because it fails to propagate "exact".	2022-10-12 09:25:04 -04:00
Florian Hahn	c1fe52bfa6	[VPlan] Remove dead recipes before sinking. optimizeInductions may leave dead recipes which can prevent sinking. Sinking on the other hand should not introduce new dead recipes, so clean up dead recipes before sinking. Reviewed By: Ayal Differential Revision: https://reviews.llvm.org/D133762	2022-10-12 12:49:42 +01:00
Max Kazantsev	fbad5fdc03	[NFC] Perform all legality checks for non-trivial unswitch in one function They have been scattered over the code. For better structuring, perform them in one place. Potential CT drop is possible because we collect exit blocks twice, but it's small price to pay for much better code structure.	2022-10-12 18:35:12 +07:00
Max Kazantsev	6bfcac612f	[SimpleLoopUnswitch][NFC] Separate legality checks from cost computation These are semantically two different stages, but were entwined in the old implementation. Now cost computation does not do legality checks, and they all are done beforehead.	2022-10-12 13:31:36 +07:00
Max Kazantsev	421728b40c	[NFC] Factor out computation of best unswitch cost candidate Split out a major peice of this method to make code more readable.	2022-10-12 12:36:46 +07:00
Fangrui Song	8ef3fd8d59	[LTO] Make local linkage GlobalValue in non-prevailing COMDAT available_externally For a local linkage GlobalObject in a non-prevailing COMDAT, it remains defined while its leader has been made available_externally. This violates the COMDAT rule that its members must be retained or discarded as a unit. To fix this, update the regular LTO change D34803 to track local linkage GlobalValues, and port the code to ThinLTO (GlobalAliases are not handled.) This fixes two problems. (a) `__cxx_global_var_init` in a non-prevailing COMDAT group used to linger around (unreferenced, hence benign), and is now correctly discarded. ``` int foo(); inline int v = foo(); ``` (b) Fix https://github.com/llvm/llvm-project/issues/58215: as a size optimization, we place private `__profd_` in a COMDAT with a `__profc_` key. When FuncImport.cpp makes `__profc_` available_externally due to a non-prevailing COMDAT, `__profd_` incorrectly remains private. This change makes the `__profd_` available_externally. ``` cat > c.h <<'eof' extern void bar(); inline __attribute__((noinline)) void foo() {} eof cat > m1.cc <<'eof' int main() { bar(); foo(); } eof cat > m2.cc <<'eof' __attribute__((noinline)) void bar() { foo(); } eof clang -O2 -fprofile-generate=./t m1.cc m2.cc -flto -fuse-ld=lld -o t_gen rm -fr t && ./t_gen && llvm-profdata show -function=foo t/default_.profraw clang -O2 -fprofile-generate=./t m1.cc m2.cc -flto=thin -fuse-ld=lld -o t_gen rm -fr t && ./t_gen && llvm-profdata show -function=foo t/default_.profraw ``` Reviewed By: tejohnson Differential Revision: https://reviews.llvm.org/D135427	2022-10-11 15:30:07 -07:00
Florian Hahn	be611ef7fa	[LoopRotation] Also drop block dispositions. LoopRotation may also fold basic blocks, so cached block dispositions also need to be dropped. Fixes #58291.	2022-10-11 15:25:27 +01:00
Sanjay Patel	7ec604a317	[InstCombine] try harder to cancel out mul/div ((Op1 * X) / Y) / Op1 --> X / Y https://alive2.llvm.org/ce/z/JYxWjA InstSimplify handles the more basic mul+div pattern with shared operand, but we don't seem to have any reassociation folds to handle cases where the common op is further away. This is a generalization of `9cff4711ac` and another transform derived from issue #58137.	2022-10-11 09:51:51 -04:00
Max Kazantsev	91aa9097ae	[NFC] Factor out collection of unswitch candidate to a separate function Just to make the code more structured and easier to understand.	2022-10-11 19:35:16 +07:00
Max Kazantsev	f18979912d	[NFC] Refine API in SimpleLoopUnswitch: add missing const notions	2022-10-11 19:35:16 +07:00
Max Kazantsev	a8a07890aa	[NFC] Refine API: add missing const notion in hasPartialIVCondition	2022-10-11 19:35:16 +07:00
Nikita Popov	df8264c46a	[SimplifyLibCalls] Use helper methods to query attributes (NFC)	2022-10-11 11:41:28 +02:00
Daniel Sanders	4a95a64e4a	[instcombine] (extelt (inselt Vec, Value, Index), Index) -> Value When Index is variable but still trivially known to be equal we can use Value from before the insertion, possibly eliminating the vector. Reverts a functional change from: Author: Philip Reames <listmail@philipreames.com> Date: Wed Dec 8 12:21:10 2021 -0800 [instcombine] A couple style tweaks to visitExtractElementInst [nfc] Thanks to Michele Scandale for identifying the bug Differential Revision: https://reviews.llvm.org/D135625	2022-10-10 15:41:53 -07:00
Sanjay Patel	baab4aa1ba	[VectorCombine] convert scalar fneg with insert/extract to vector fneg insertelt DestVec, (fneg (extractelt SrcVec, Index)), Index --> shuffle DestVec, (fneg SrcVec), Mask This is a specialized form of what could be a more general fold for a binop. It's also possible that fneg is overlooked by SLP in this kind of insert/extract pattern since it's a unary op. This shows up in the motivating example from #issue 58139, but it won't solve it (that probably requires some x86-specific backend changes). There are also some small enhancements (see TODO comments) that can be done as follow-up patches. Differential Revision: https://reviews.llvm.org/D135278	2022-10-10 14:59:56 -04:00
Jordan Rupprecht	fb27fd5f88	Revert "[LTO] Make local linkage GlobalValue in non-prevailing COMDAT available_externally" This reverts commit `4fbe33593c`. It causes linking errors, with details provided internally. (Hopefully the author/reviewers will be able to upstream the internal repro).	2022-10-10 11:40:45 -07:00
Florian Hahn	4b6bd1c9d5	[LoopSimplifyCFG] Clear SCEV dispositions when removing dead blocks. When removing loops & blocks we also need to clear the SCEV dispositions as they may now contain incorrect values. Fixes #58262.	2022-10-10 18:08:35 +01:00
Florian Hahn	80e49f49e4	[ConstraintElimination] Bail out for GEPs with scalable vectors. This fixes a crash with scalable vectors, thanks @nikic for spotting this!	2022-10-10 16:01:20 +01:00
Shubham Narlawar	b920407cf5	[LICM] Disable thread-safety checks in single-thread model If the single-thread model is used, or the -licm-force-thread-model-single flag is specified, skip checks related to thread-safety. This means that store promotion for conditionally executed stores only requires proof of dereferenceability and writability, but not of thread-safety. For example, this enables promotion of stores to (non-constant) globals, as well as captured allocas. Fixes https://github.com/llvm/llvm-project/issues/50537. Differential Revision: https://reviews.llvm.org/D130466	2022-10-10 16:51:16 +02:00
Alex Brachet	deb82d4a20	Revert "[PGO] Make emitted symbols hidden" This reverts commit `4ea1a647ff`. This breaks on Darwin which tries to export these symbols `ebb258d3b0/clang/lib/Driver/ToolChains/Darwin.cpp (L1363)` I'll try to reland which that removed and approval from Apple folks.	2022-10-10 14:37:59 +00:00
Sanjay Patel	9cff4711ac	[InstCombine] fold udiv with common factor ((X *nuw Y) >> Z) / X --> Y >> Z https://alive2.llvm.org/ce/z/x3kKnq This is similar to `6b869be810` / `8da2fa856f`, but I have not found a signed equivalent, so it's just an unsigned match for now.	2022-10-10 08:12:06 -04:00
Nikita Popov	874c0327e7	[Attributor] Use ConstantFoldLoadFromConst() When determining the initial value of the object, use the constant folding API to load a given type at a given offset in the global initializer. This makes it work for cases where the load doesn't directly correspond to an aggregate member. Differential Revision: https://reviews.llvm.org/D135435	2022-10-10 10:17:37 +02:00
Florian Hahn	fee8f561bd	[ConstraintElimination] Include index type scale. The current decomposition for GEPs did not correctly handle cases where GEPs access different source types. Adjust the constraints by including the indexed type-size as coefficients. Further generalization to allow GEPs with more than one index is a needed general follow-up improvement.	2022-10-09 21:53:30 +01:00
luxufan	eaf6e2fc33	[DSE] Relax constraint on isGuaranteedLoopInvariant If the location ptr to be killed is in no loop and the Function does not have irreducible loops, then we can regard it as loop invariant. Differential Revision: https://reviews.llvm.org/D135369	2022-10-06 03:01:21 +00:00
Florian Hahn	11a6e64ba7	[ConstraintElim] Move logic to get constraint for solving to helper. Move common logic shared by callers of getConstraint that use the result to query the constraint system to a new helper getConstraintForSolving. This includes common legality checks (i.e. not an equality constraint, no new variables) and the logic to query the unsigned system if possible for signed predicates.	2022-10-09 10:44:36 +01:00
Fangrui Song	4fbe33593c	[LTO] Make local linkage GlobalValue in non-prevailing COMDAT available_externally See the updated linkonce_resolution_comdat.ll. For a local linkage GV in a non-prevailing COMDAT, it remains defined while its leader has been made available_externally. This violates the COMDAT rule that its members must be retained or discarded as a unit. To fix this, update the regular LTO change D34803 to track local linkage GlobalValues, and port the code to ThinLTO (GlobalAliases are not handled.) Fix https://github.com/llvm/llvm-project/issues/58215: as a size optimization, we place private `__profd_` in a COMDAT with a `__profc_` key. When FuncImport.cpp makes `__profc_` available_externally due to a non-prevailing COMDAT, `__profd_` incorrectly remains private. This change makes the `__profd_` available_externally. ``` cat > c.h <<'eof' extern void bar(); inline __attribute__((noinline)) void foo() {} eof cat > m1.cc <<'eof' #include "c.h" int main() { bar(); foo(); } eof cat > m2.cc <<'eof' #include "c.h" __attribute__((noinline)) void bar() { foo(); } eof clang -O2 -fprofile-generate=./t m1.cc m2.cc -flto -fuse-ld=lld -o t_gen rm -fr t && ./t_gen && llvm-profdata show -function=foo t/default_.profraw # one _Z3foov clang -O2 -fprofile-generate=./t m1.cc m2.cc -flto=thin -fuse-ld=lld -o t_gen rm -fr t && ./t_gen && llvm-profdata show -function=foo t/default_.profraw # one _Z3foov ``` Reviewed By: tejohnson Differential Revision: https://reviews.llvm.org/D135427	2022-10-08 11:09:43 -07:00
Florian Hahn	e0136a62cc	[ConstraintElimination] Support chained GEPs with constant offsets. Handle the (gep (gep ....), C) case by incrementing the constant coefficient of the inner GEP, if C is a constant.	2022-10-08 16:59:27 +01:00
Florian Hahn	73950f26f5	[LV] Replace check with assert for reduction resume values (NFC). At this point, we need to have resume values for all inductions. If not, this would result in silent mis-compiles.	2022-10-08 16:26:10 +01:00
Florian Hahn	be858bda69	[ConstraintElimination] Remove unused function (NFC).	2022-10-08 16:05:56 +01:00
Sanjay Patel	eccb9a77c6	[InstCombine] fold exact sdiv to ashr (2nd try) The 1st attempt failed to updated the test checks as expected. Original commit message: sdiv exact X, (1<<ShAmt) --> ashr exact X, ShAmt (if shl is non-negative) https://alive2.llvm.org/ce/z/kB6VF7 It would probably be better to use ValueTracking to replace this and the existing transform above it, but the analysis does not account for the no-wrap properly, and it's not immediately clear to me how to fix it.	2022-10-08 10:09:44 -04:00
Sanjay Patel	68d4dbc2c1	Revert "[InstCombine] fold exact sdiv to ashr" This reverts commit `fe15290e0c`. The test checks were not updated as expected.	2022-10-08 10:02:03 -04:00
Sanjay Patel	fe15290e0c	[InstCombine] fold exact sdiv to ashr sdiv exact X, (1<<ShAmt) --> ashr exact X, ShAmt (if shl is non-negative) https://alive2.llvm.org/ce/z/kB6VF7 It would probably be better to use ValueTracking to replace this and the existing transform above it, but the analysis does not account for the no-wrap properly, and it's not immediately clear to me how to fix it.	2022-10-08 09:23:46 -04:00
Florian Hahn	9d31d1c214	[ConstraintElimination] Use logic from `3771310eed` for queries only. The logic added in `3771310eed` was placed sub-optimally. Applying the transform in ::getConstraint meant that it would also impact conditions that are added to the system by the signed <-> unsigned transfer logic. This meant we failed to add some signed facts to the signed system. To make sure we still add as many useful facts to the signed/unsigned systems, move the logic to the point where we query the system.	2022-10-08 11:03:45 +01:00
Florian Hahn	13ac102726	[LoopSimplifyCFG] Invalidate SCEV dispositions. Clear all dispositions if there are any dead blocks (which will get removed later) and also clear dispositions for removed instructions. Clearing all dispositions in case there are dead blocks happens first, which should avoid traversing SCEV use-lists for invalidating dispositions for individual values. Fixes #58179.	2022-10-07 21:35:42 +01:00
Florian Hahn	19ad1cd5ce	Recommit "[SCEV] Support clearing Block/LoopDispositions for a single value." This reverts commit `92f698f01f`. The updated version of the patch includes handling for non-SCEVable types. A test case has been added in `ec86e9a99b`.	2022-10-07 20:15:44 +01:00
Philip Reames	cb66e123c6	Remove PlaceSafepoints pass This patch was added way back in the beginning of the work which became the statepoint infrastructure. The idea was that safepoints could be inserted late in the optimization pipeline. This is true if the only concern is garbage collection, but this approach turned out to be incompatible with the requirement to also support deoptimization at safepoints. In theory, this pass would still be quite useful for an AOT compiled language which wants to support garbage collection, but we have no known users, and haven't for over 5 years. Time to remove unused code. If someone wants to use this, restoring it would not be hard. The immediate motivation for removal is that this is one of the last passes remaining which hasn't been ported to the new pass manager and the (straight forward) work to do so is not justified for unused code. Differential Revision: https://reviews.llvm.org/D135371	2022-10-07 11:51:00 -07:00
Sanjay Patel	3e6767ed5f	[InstCombine] propagate 'exact' when converting ashr to lshr The shift amount is not changing, so if we guaranteed shifting out zeros before, those bits are still zeros. https://alive2.llvm.org/ce/z/sokQca	2022-10-07 13:17:19 -04:00
Florian Hahn	92f698f01f	Revert "[SCEV] Support clearing Block/LoopDispositions for a single value." This reverts commit `9e931439dd`. This commit causes a crash when TSan, e.g. with https://lab.llvm.org/buildbot/#/builders/70/builds/28309/steps/10/logs/stdio Reverting while I extract a reproducer and submit a fix.	2022-10-07 17:58:54 +01:00
Sanjay Patel	bdfefac9a4	[InstCombine] refactor sdiv by (negative) power-of-2 folds; NFCI It's probably better to try harder on this kind of pattern by using ValueTracking.	2022-10-07 11:35:17 -04:00
Florian Hahn	9e931439dd	[SCEV] Support clearing Block/LoopDispositions for a single value. Extend forgetBlockAndLoopDisposition to allow clearing information for a single value. This can be useful when only a single value is changed, e.g. because the instruction is moved. We also need to clear the cached values for all SCEV users, because they may depend on the starting value's disposition. Reviewed By: nikic Differential Revision: https://reviews.llvm.org/D134614	2022-10-07 16:07:17 +01:00
Florian Hahn	3771310eed	[ConstraintElimination] Convert to unsigned Pred if possible. Convert SLE/SLT predicates to unsigned equivalents if both operands are known to be signed-positive. https://alive2.llvm.org/ce/z/tBeiZr	2022-10-07 12:27:36 +01:00
Nikita Popov	b43a4d0850	[LoopPeeling] Support peeling loops with non-latch exits Loop peeling currently requires that a) the latch is exiting b) a branch and c) other exits are unreachable/deopt. This patch removes all of these limitations, and adds the necessary branch weight updating support. It essentially works the same way as before with latch -> exiting terminator and loop trip count -> per exit trip count. It's worth noting that there are still other limitations in profitability heuristics: This patch enables peeling of loops to make conditions invariant (which is pretty much always highly profitable if possible), while peeling to make loads dereferenceable still checks that non-latch exits are unreachable and PGO-based peeling has even more conditions. Those checks could be relaxed later if we consider those cases profitable. The motivation for this change is that loops using iterator adaptors in Rust often optimize very badly, and end up with a loop phi of the form phi(true, false) in the final result. Peeling eliminates that phi and conditions based on it, which enables a lot of follow-on simplification. Differential Revision: https://reviews.llvm.org/D134803	2022-10-07 12:35:52 +02:00
Nikita Popov	ccf53cae32	[ValueTracking] Remove unused Offset argument in getConstantStringInfo() (NFC)	2022-10-07 11:35:55 +02:00
Dmitry Makogon	8307f6c854	[LoopPredication] Insert assumes of conditions of predicated guards As LoopPredication performs non-equivalent transforms removing some checks from loops, other passes may not be able to perform transforms they'd be able to do if the checks were left in loops. This patch makes LoopPredication insert assumes of the replaced conditions either after a guard call or in the true block of widenable condition branch. Differential Revision: https://reviews.llvm.org/D135354	2022-10-07 16:10:24 +07:00
Nikita Popov	333246b48e	Reapply [InstCombine] Switch foldOpIntoPhi() to use InstSimplify Relative to the previous attempt, this adjusts simplification to use the correct context instruction: We need to use the terminator of the incoming block, not the original instruction. ----- foldOpIntoPhi() currently only folds operations into the phi if all but one operands constant-fold. The two exceptions to this are freeze and select, where we allow more general simplification. This patch makes foldOpIntoPhi() generally simplification based and removes all the instruction-specific logic. We just try to simplify the instruction for each operand, and for the (potentially) one non-simplified operand, we move it into the new block with adjusted operands. This fixes https://github.com/llvm/llvm-project/issues/57448, which was my original motivation for the change. Differential Revision: https://reviews.llvm.org/D134954	2022-10-07 11:04:19 +02:00
Alina Sbirlea	b9898e7ed1	Revert "Reapply [InstCombine] Switch foldOpIntoPhi() to use InstSimplify" This reverts commit `e94619b955`.	2022-10-06 13:12:24 -07:00
Alexey Bataev	323ed2308a	[SLP]Improve/fix CSE analysis of the blocks/instructions. Added analysis for invariant extractelement instructions and improved detection of the CSE blocks for generated extractelement instructions. Differential Revision: https://reviews.llvm.org/D135279	2022-10-06 12:08:48 -07:00
Alex Brachet	4ea1a647ff	[PGO] Make emitted symbols hidden Differential Revision: https://reviews.llvm.org/D135340	2022-10-06 18:28:16 +00:00
Bjorn Pettersson	0db4b1d1a8	[SimplifyLibCalls] Adjust code comment in optimizeStringLength. NFC The limitation in LibCallSimplifier::optimizeStringLength to only optimize when the string is an i8 array was changed already in commit `50ec0b5dce` back in 2017. We still only simplify when 's' points at an array of 'CharSize', so the comment is still valid in the sense that we do not support arbitrary array types. Differential Revision: https://reviews.llvm.org/D135261	2022-10-06 20:00:27 +02:00
Arthur Eubanks	ae5733346f	Revert "[DSE] Eliminate noop store even through has clobbering between LoadI and StoreI" This reverts commit `cd8f3e7581`. Causes miscompiles, see D132657	2022-10-06 10:36:02 -07:00
Sanjay Patel	8da2fa856f	[InstCombine] fold sdiv with hidden common factor (X * Y) s/ (X << Z) --> Y s/ (1 << Z) https://alive2.llvm.org/ce/z/yRSddG issue #58137	2022-10-06 13:11:50 -04:00
Florian Hahn	a7ac0dd0cf	[ConstraintElimination] Generalize AND matching. Extend more general matching used for chains of ORs to also support chains of ANDs.	2022-10-06 17:17:38 +01:00
Sanjay Patel	6b869be810	[InstCombine] fold udiv with hidden common factor (X * Y) u/ (X << Z) --> Y u>> Z https://alive2.llvm.org/ce/z/4G9D_W	2022-10-06 11:35:27 -04:00
Florian Hahn	8e3e96298f	[ConstraintElimination] Order cmps for signed <-> unsigned transfer first. Make sure conditions with constant operands come before conditions without constant operands. This increases the effectiveness of the current signed <-> unsigned fact transfer logic.	2022-10-06 15:56:25 +01:00
Florian Hahn	349375d093	[ConstraintElimination] Generalize OR matching. Extend OR handling to traverse chains of ORs.	2022-10-06 11:56:22 +01:00
Nikita Popov	028874dd61	[Local] Fix unused variable warnings (NFC)	2022-10-06 10:30:59 +02:00
Florian Hahn	7449570ff7	[ConstraintElimination] Use ConstraintTy::IsSigned instead of Predicate. This should be NFC and ensure the sign of the constraint is used consistently in the future.	2022-10-06 07:51:49 +01:00
Carl Ritson	c316332e17	[Sink] Allow sinking of invariant loads across critical edges Invariant loads can always be sunk. Reviewed By: foad, arsenm Differential Revision: https://reviews.llvm.org/D135133	2022-10-06 09:21:12 +09:00
Florian Hahn	9aa004a04c	[ConstraintElimination] Convert NewIndices to vector and rename (NFCI). The callers of getConstraint only require a list of new variables. Update the naming and types to make this clearer.	2022-10-05 16:25:00 +01:00
Johannes Doerfert	e18736149c	[Attributor] Teach AAPointerInfo about atomic cmxchg and rmw The atomic operations behave similar to a store except that we don't know the new value and we read the result first.	2022-10-05 06:48:00 -07:00
Johannes Doerfert	93e51fa444	[Attributor] AAPointerInfo can model non-escaping call uses If a call base use will not capture a pointer we can approximate the effects. This is important especially for readnone/only uses. Even may-write uses are not too bad with reachability in place. Capturing is the problem as we loose track of update sides.	2022-10-05 06:29:14 -07:00
Johannes Doerfert	477e8e10f0	[Attributor] Teach AAPointerInfo to look into aggregates If we have a constant aggregate, e.g., as an initializer, we usually failed to extract the proper value/type from it. This patch provides the size and offset information necessary to extract the right part of the constant.	2022-10-05 06:19:47 -07:00
Nikita Popov	5fa14ee835	[MemCpyOpt] Don't hoist above producer of pointer operand This was already handled correctly below, but not checked for the original store pointer operand. Encountered when converting tests to opaque pointers, where the intermediate bitcast goes away.	2022-10-05 14:52:33 +02:00
David Stuttard	d1d7d2235c	[AggressiveInstCombine] Fix cases where non-opaque pointers are used In the case of non-opaque pointers, when combining consecutive loads, need to bitcast the pointer source to the combined type size, otherwise asserts are triggered. Differential Revision: https://reviews.llvm.org/D135249	2022-10-05 13:42:46 +01:00
Nikita Popov	e94619b955	Reapply [InstCombine] Switch foldOpIntoPhi() to use InstSimplify The infinite loop seen on buildbots should be fixed by `11897708c0` (assuming there are not multiple infinite combine loops...) ----- foldOpIntoPhi() currently only folds operations into the phi if all but one operands constant-fold. The two exceptions to this are freeze and select, where we allow more general simplification. This patch makes foldOpIntoPhi() generally simplification based and removes all the instruction-specific logic. We just try to simplify the instruction for each operand, and for the (potentially) one non-simplified operand, we move it into the new block with adjusted operands. This fixes https://github.com/llvm/llvm-project/issues/57448, which was my original motivation for the change. Differential Revision: https://reviews.llvm.org/D134954	2022-10-05 14:00:19 +02:00
Nikita Popov	11897708c0	[InstCombine] Directly replace instr in foldIntegerTypedPHI() (NFCI) Rather than inserting a ptrtoint + inttoptr pair, directly replace the inttoptr with the new phi node. This ensures that no other transform can undo it before the pair gets folded away. This avoids the infinite loop when combined with D134954. This is NFCI in the sense that it shouldn't make a difference, but could due to different worklist order.	2022-10-05 13:28:23 +02:00
Florian Hahn	469f0fc6a6	[SimpleLoopUnswitch] Clear dispos in deleteDeadBlocksFromLoop. SimpleLoopUnswitch may remove blocks from loops. Clear block and loop dispositions in that case, to clean up invalid entries in the cache. Fixes #58158. Fixes #58159.	2022-10-05 10:28:15 +01:00
Johannes Doerfert	a9557115b4	[Attributor] Qualify variables to avoid clashes in the future	2022-10-04 19:43:04 -07:00
Gulfem Savrun Yeniceri	d7592bbb03	Revert "Reapply [InstCombine] Switch foldOpIntoPhi() to use InstSimplify" This reverts commit `e1dd2cd063` because the original commit `b20e34b39f` had a dramatic increase in the build time of RTfuzzer, which caused Fuchsia Clang toolchain builders to timeout: https://luci-milo.appspot.com/ui/p/fuchsia/builders/toolchain.ci/clang-linux-x64/b8801248587754572961/overview	2022-10-04 20:57:34 +00:00
Florian Hahn	4f827318e3	[LoopVersioning,LLE] Clear LoopAccessInfoManager after making changes. Loop versioning changes the control-flow, which may impact SCEVs cached by for other loops in LoopAccessInfoManager. Clear the manager after making changes. Fixes #57825. Depends on D134609. Reviewed By: aeubanks Differential Revision: https://reviews.llvm.org/D134611	2022-10-04 21:35:42 +01:00
Ram-NK	a58b6acf1f	[NFC][LoopInterchange] Clean up of irrelevent dependency checking with isOuterMostDepPositive() The function isOuterMostDepPositive() is checked after negative dependence vectors are normalized to be non-negative, so there will not be any negative dependency ('>' as the outermost non-equal sign) after normalization. And therefore the check in isOuterMostDepPositive() is irrelevent and redundant. Reviewed By: congzhe Differential Revision: https://reviews.llvm.org/D132982	2022-10-04 14:54:08 -04:00
Alexey Bataev	ab9a81f736	[SLP]Try to emit canonical shuffle with undef operand. In the canonical form of the shuffle the poison/undef operand is the second operand, the patch tries to emit canonical form for partial vectorization of the buildvector sequence. Also, this patch starts emitting freeze instruction for shuffles with undef indices if the second shuffle operan is undef, not poison. It is an initial step to D93818, where undef mask element are treated as returning poison value. Differential Revision: https://reviews.llvm.org/D134377	2022-10-04 08:16:07 -07:00
Nikita Popov	e1dd2cd063	Reapply [InstCombine] Switch foldOpIntoPhi() to use InstSimplify Reapply with a fix for the case where an operand simplified back to the original phi: We need to map this case to the new phi node. ----- foldOpIntoPhi() currently only folds operations into the phi if all but one operands constant-fold. The two exceptions to this are freeze and select, where we allow more general simplification. This patch makes foldOpIntoPhi() generally simplification based and removes all the instruction-specific logic. We just try to simplify the instruction for each operand, and for the (potentially) one non-simplified operand, we move it into the new block with adjusted operands. This fixes https://github.com/llvm/llvm-project/issues/57448, which was my original motivation for the change.	2022-10-04 15:18:34 +02:00
Alex Richardson	16f9c5577d	[SimplifyLibCalls] Retain attributes added by Builder.CreateMem* This currently does not make much of a difference (only one tests is affected), but it is helpful e.g. for the out-of-tree CHERI target where Builder.CreateMemCpy() can add attributes other than parameter alignment. Reviewed By: nikic Differential Revision: https://reviews.llvm.org/D135075	2022-10-04 13:11:34 +00:00
Bjorn Pettersson	491ac8f3e8	[LibCalls] Cast Char argument to 'int' before calling emitFPutC The helpers in BuildLibCalls normally expect that the Value arguments already have the correct type (matching the lib call signature). And exception has been emitFPutC which casted the Char argument to 'int' using CreateIntCast. This patch moves the cast to the caller instead of doing it inside emitFPutC. I think it makes sense to make the BuildLibCall API:s a bit more consistent this way, despite the need to handle the int cast in two different places now. Differential Revision: https://reviews.llvm.org/D135066	2022-10-04 12:52:05 +02:00
Bjorn Pettersson	aa1b64cc42	[BuildLibCalls] Use TLI to get 'int' and 'size_t' type sizes Stop assuming that an 'int' is 32 bits in helpers that emit libcalls to lib functions that had 'int' in the signature. For most targets this is NFC. For a target with 16 bit 'int' type this could help out detecting if trying to emit a libcall with incorrect signature. Similarly we now derive the type mapping to 'size_t' by asking TLI about the size of 'size_t'. This should be NFC (at least for in-tree targets) since getSizeTSize(), in TLI, is deriving the size in the same way as DataLayout::getIntPtrType(). Differential Revision: https://reviews.llvm.org/D135065	2022-10-04 12:52:05 +02:00
Bjorn Pettersson	73e8d95d28	[BuildLibCalls] Name types to identify when 'int' and 'size_t' is assumed. NFC Lots of BuildLibCalls helpers are using Builder::getInt32Ty to get a type matching an 'int', and DataLayout::getIntPtrType to get a type matching 'size_t'. The former is not true for all targets, since and 'int' isn't always 32 bits. And the latter is a bit weird as well as the definition of DataLayout::getIntPtrType isn't clearly mapping it to 'size_t'. This patch is not aiming at solving any such problems. It is merely highlighting when a libcall is expecting to use 'int' and 'size_t' by naming the types as IntTy and SizeTTy when preparing the type signatures for the emitted libcalls. Differential Revision: https://reviews.llvm.org/D135064	2022-10-04 12:52:05 +02:00
Florian Hahn	825e16969e	[LAA] Pass LoopAccessInfoManager instead of GetLAA function. Use LoopAccessInfoManager directly instead of various GetLAA lambdas. Depends on D134608. Reviewed By: aeubanks Differential Revision: https://reviews.llvm.org/D134609	2022-10-04 11:51:25 +01:00
Florian Hahn	e399dd601f	[SimpleLoopUnswitch] Clear block and loop dispos after destroying loop. SimpleLoopUnswitch may remove loops. Clear block and loop dispositions, to clean up invalid entries in the cache. Fixes #58136.	2022-10-04 10:27:52 +01:00
Nikita Popov	635f93dff7	[SimplifyLibCalls] Place deref attr even if nonnull already set If nonnull is already set, we currently skip setting both nonnull and dereferenceable. Make these independent, to avoid regressions when additional nonnull attributes are inferred earlier.	2022-10-04 11:26:15 +02:00
Nikita Popov	0f32f0e147	Revert "[InstCombine] Switch foldOpIntoPhi() to use InstSimplify" This reverts commit `b20e34b39f`. This causes RAUW type mismatch assertions on some buildbots, reverting for now.	2022-10-04 11:17:09 +02:00
Nikita Popov	b20e34b39f	[InstCombine] Switch foldOpIntoPhi() to use InstSimplify foldOpIntoPhi() currently only folds operations into the phi if all but one operands constant-fold. The two exceptions to this are freeze and select, where we allow more general simplification. This patch makes foldOpIntoPhi() generally simplification based and removes all the instruction-specific logic. We just try to simplify the instruction for each operand, and for the (potentially) one non-simplified operand, we move it into the new block with adjusted operands. This fixes https://github.com/llvm/llvm-project/issues/57448, which was my original motivation for the change.	2022-10-04 10:12:14 +02:00

1 2 3 4 5 ...

31780 Commits