llvm-project

Commit Graph

Author	SHA1	Message	Date
Fangrui Song	89fae41ef1	[IR] llvm::Optional => std::optional Many llvm/IR/* files have been migrated by other contributors. This migrates most remaining files.	2022-12-05 04:13:11 +00:00
Krzysztof Parzyszek	f3b6dbfda8	Instructions: convert Optional to std::optional	2022-12-04 14:25:11 -06:00
Benjamin Kramer	856f7937c7	Compress a few pairs using PointerIntPairs Use the uniform structured bindings interface where possible. NFCI.	2022-12-04 16:55:16 +01:00
Kazu Hirata	343de6856e	[Transforms] Use std::nullopt instead of None (NFC) This patch mechanically replaces None with std::nullopt where the compiler would warn if None were deprecated. The intent is to reduce the amount of manual work required in migrating from Optional to std::optional. This is part of an effort to migrate from llvm::Optional to std::optional: https://discourse.llvm.org/t/deprecating-llvm-optional-x-hasvalue-getvalue-getvalueor/63716	2022-12-02 21:11:37 -08:00
chenglin.bi	e719550e6f	[InstCombine] fold icmp + select pattern by distributive laws `C ? (Y != X) : (Z != X) --> (C ? Y : Z) != X` `C ? (Y == X) : (Z == X) --> (C ? Y : Z) == X` https://alive2.llvm.org/ce/z/-frXfs Reviewed By: spatel Differential Revision: https://reviews.llvm.org/D139076	2022-12-03 07:56:19 +08:00
Krzysztof Parzyszek	86fe4dfdb6	TargetTransformInfo: convert Optional to std::optional Recommit: added missing "#include <cstdint>".	2022-12-02 11:42:15 -08:00
Krzysztof Parzyszek	4e12d1836a	Revert "TargetTransformInfo: convert Optional to std::optional" This reverts commit `b83711248c`. Some buildbots are failing.	2022-12-02 11:34:04 -08:00
Krzysztof Parzyszek	b83711248c	TargetTransformInfo: convert Optional to std::optional	2022-12-02 11:27:12 -08:00
chenglin.bi	683b9fc7bd	[Instcombine] Code refactors for foldSelectOpOp; NFC Reuse the code about find common operator. Reviewed By: RKSimon Differential Revision: https://reviews.llvm.org/D139075	2022-12-02 22:27:10 +08:00
Krzysztof Parzyszek	26424c96c0	Attributes: convert Optional to std::optional	2022-12-02 08:15:45 -06:00
Krzysztof Parzyszek	467432899b	MemoryLocation: convert Optional to std::optional	2022-12-01 15:36:20 -08:00
Sanjay Patel	dd8d0d21ce	[InstCombine] canonicalize trunc + insert as bitcast + shuffle, part 2 This enhances the base fold from part 1 to allow mapping a right-shift to an insert index. Example of translating a middle chunk of the scalar to vector for either endian: https://alive2.llvm.org/ce/z/fRXCOZ This only allows creating an identity shuffle (with optional shortening/lengthening) because that is considered the safe baseline for any target (can be inverted if needed). If we tried this fold with target-specific costs/legality, then we could do the transform more generally. Differential Revision: https://reviews.llvm.org/D138873	2022-12-01 14:47:37 -05:00
Sanjay Patel	b7c7fe3d07	[InstCombine] improve efficiency of bool logic; NFC As noted in issue #59266, the logic reduction could be beyond the capabilities of an optimizing compiler, and the code with ternary op is easier to read either way.	2022-12-01 14:47:37 -05:00
Sanjay Patel	e71b81cab0	[InstCombine] canonicalize trunc + insert as bitcast + shuffle, part 1 (2nd try) The first attempt was reverted because a clang test changed unexpectedly - the file is already marked with a FIXME, so I just updated it this time to pass. Original commit message: This is the main patch for converting a truncated scalar that is inserted into a vector to bitcast+shuffle. We could go either way on patterns like this, but this direction will allow collapsing a pair of these sequences on the motivating example from issue The patch is split into 3 parts to make it easier to see the progression of tests diffs. We allow inserting/shuffling into a different size vector for flexibility, so there are several test variations. The length-changing is handled by shortening/padding the shuffle mask with undef elements. In part 1, handle the basic pattern: inselt undef, (trunc T), IndexC --> shuffle (bitcast T), IdentityMask Proof for the endian-dependency behaving as expected: https://alive2.llvm.org/ce/z/BsA7yC The TODO items for handling shifts and insert into an arbitrary base vector value are implemented as follow-ups. Differential Revision: https://reviews.llvm.org/D138872	2022-11-30 14:52:20 -05:00
Sanjay Patel	5eacdcff06	Revert "[InstCombine] canonicalize trunc + insert as bitcast + shuffle, part 1" This reverts commit `a4c466766d`. This broke clang tests that are wrongly dependent on the optimizer.	2022-11-30 14:10:50 -05:00
Sanjay Patel	a4c466766d	[InstCombine] canonicalize trunc + insert as bitcast + shuffle, part 1 This is the main patch for converting a truncated scalar that is inserted into a vector to bitcast+shuffle. We could go either way on patterns like this, but this direction will allow collapsing a pair of these sequences on the motivating example from issue The patch is split into 3 parts to make it easier to see the progression of tests diffs. We allow inserting/shuffling into a different size vector for flexibility, so there are several test variations. The length-changing is handled by shortening/padding the shuffle mask with undef elements. In part 1, handle the basic pattern: inselt undef, (trunc T), IndexC --> shuffle (bitcast T), IdentityMask Proof for the endian-dependency behaving as expected: https://alive2.llvm.org/ce/z/BsA7yC The TODO items for handling shifts and insert into an arbitrary base vector value are implemented as follow-ups. Differential Revision: https://reviews.llvm.org/D138872	2022-11-30 13:22:04 -05:00
William Huang	be4b1dd35b	[InstCombine] Revert D125845 Reverting D125845 `[InstCombine] Canonicalize GEP of GEP by swapping constant-indexed GEP to the back` because multiple users reported performance regression Reviewed By: davidxl Differential Revision: https://reviews.llvm.org/D138950	2022-11-29 22:02:40 +00:00
Sanjay Patel	a00936484b	[InstCombine] improve readability of combineLoadToOperationType(); NFC	2022-11-28 16:00:06 -05:00
Kazu Hirata	3da96e0361	[InstCombine] Use std::optional in InstructionCombining.cpp (NFC) This is part of an effort to migrate from llvm::Optional to std::optional: https://discourse.llvm.org/t/deprecating-llvm-optional-x-hasvalue-getvalue-getvalueor/63716	2022-11-25 23:49:50 -08:00
Kazu Hirata	881076cde2	[InstCombine] Use std::optional in InstCombinePHI.cpp (NFC) This is part of an effort to migrate from llvm::Optional to std::optional: https://discourse.llvm.org/t/deprecating-llvm-optional-x-hasvalue-getvalue-getvalueor/63716	2022-11-25 23:46:38 -08:00
Matthias Gehre	5a1d92fa3e	[InstCombine] Update debug intrinsics when rewriting allocas	2022-11-25 08:20:54 +01:00
Sanjay Patel	535c5d56a7	[InstCombine] ease restriction for extractelt (bitcast X) fold We were checking for a desirable integer type even when there is no shift in the transform. This is unnecessary since we are truncating directly to the destination type. This removes an extractelt in more cases and seems to make the canonicalization more uniform overall. There's still a potential difference between patterns that need a shift vs. trunc-only. I'm not sure if that is worth keeping at this point, but it can be adjusted in another step (assuming this change does not cause trouble). In the most basic case where I noticed this, we missed a fold that would have completely removed vector ops from a pattern like: https://alive2.llvm.org/ce/z/y4Qdte	2022-11-24 13:27:19 -05:00
Sanjay Patel	bf7f87e62c	[InstCombine] reduce code duplication in foldBitcastExtElt(); NFC	2022-11-24 10:16:37 -05:00
Matt Arsenault	6463961941	InstCombine: Fold some identities for canonicalize Equality is directly stated as true in the LangRef, and I believe this works for every compare type.	2022-11-22 21:42:44 -05:00
Thomas Symalla	470aea5ed4	[InstCombine] Fold extractelt with select of constants An extractelt with a constant index which extracts an element from the two vector operands of a select can be directly folded into a select. extractelt (select %x, %vec1, %vec2), %const -> select %x, %vec1[%const], %vec2[%const] Note: the implementation currently only works for constant vector operands. Reviewed By: foad, spatel Differential Revision: https://reviews.llvm.org/D137934	2022-11-22 14:07:06 +01:00
OCHyams	4ba08d512c	[Assignment Tracking][24/*] Always RemoveRedundantDbgInstrs in instcombine in assignment tracking builds The Assignment Tracking debug-info feature is outlined in this RFC: https://discourse.llvm.org/t/ rfc-assignment-tracking-a-better-way-of-specifying-variable-locations-in-ir This reduces peak memory overhead by 15% when building CTMark's tramp3d-v4 with -O2 -g with assignment tracking enabled. Reviewed By: jmorse Differential Revision: https://reviews.llvm.org/D133321	2022-11-18 12:36:41 +00:00
OCHyams	fcd5098a03	[Assignment Tracking][14/*] Account for assignment tracking in instcombine The Assignment Tracking debug-info feature is outlined in this RFC: https://discourse.llvm.org/t/ rfc-assignment-tracking-a-better-way-of-specifying-variable-locations-in-ir Most of the updates here are just to ensure DIAssignID attachments are maintained and propagated correctly. Reviewed By: jmorse Differential Revision: https://reviews.llvm.org/D133307	2022-11-18 09:25:33 +00:00
Sanjay Patel	362c23500a	Revert "[InstCombine] allow more folds for multi-use selects (2nd try)" This reverts commit `6eae6b3722`. This version of the patch results in the same DFSAN bot failure as before, so my guess about the SimplifyQuery context instruction was wrong. I don't know what the real bug is.	2022-11-13 11:47:21 -05:00
Sanjay Patel	6eae6b3722	[InstCombine] allow more folds for multi-use selects (2nd try) The 1st try ( `681a6a3990` ) was reverted because it caused a DataFlowSanitizer bot failure. This try modifies the existing calls to simplifyBinOp() to not use a query that sets the context instruction because that seems like a likely source of failure. Since we already try those simplifies with multi-use patterns in some cases, that means the bug is likely present even without this patch. However, I have not been able to reduce a test to prove that this was the bug, so if we see any bot failures with this patch, then it should be reverted again. The reduced simplify power does not affect any optimizations in existing, motivating regression tests. Original commit message: The 'and' case showed up in a recent bug report and prevented more follow-on transforms from happening. We could handle more patterns (for example, the select arms simplified, but not to constant values), but this seems like a safe, conservative enhancement. The backend can convert select-of-constants to math/logic in many cases if it is profitable. There is a lot of overlapping logic for these kinds of patterns (see SimplifySelectsFeedingBinaryOp() and FoldOpIntoSelect()), so there may be some opportunity to improve efficiency. There are also optimization gaps/inconsistency because we do not call this code for all bin-opcodes (see TODO for ashr test).	2022-11-13 10:28:06 -05:00
Michał Górny	f6f1fd443f	Revert "[InstCombine] allow more folds more multi-use selects" This reverts commit `681a6a3990`. It broke sanitizer tests (as seen on buildbots), see: https://reviews.llvm.org/rG681a6a399022#1143137	2022-11-13 07:27:01 +01:00
Sanjay Patel	681a6a3990	[InstCombine] allow more folds more multi-use selects The 'and' case showed up in a recent bug report and prevented more follow-on transforms from happening. We could handle more patterns (for example, the select arms simplified, but not to constant values), but this seems like a safe, conservative enhancement. The backend can convert select-of-constants to math/logic in many cases if it is profitable. There is a lot of overlapping logic for these kinds of patterns (see SimplifySelectsFeedingBinaryOp() and FoldOpIntoSelect()), so there may be some opportunity to improve efficiency. There are also optimization gaps/inconsistency because we do not call this code for all bin-opcodes (see TODO for ashr test).	2022-11-11 15:26:54 -05:00
William Huang	bd2b5ec803	[InstCombine] PR58901 - fix bug with swapping GEP of different types Fix https://github.com/llvm/llvm-project/issues/58901 by adding stricter check whether non-opaque GEP can be swapped. This will not affect GEP swapping optimization in the future since we are switching to opaque GEP Reviewed By: clin1 Differential Revision: https://reviews.llvm.org/D137752	2022-11-10 20:24:41 +00:00
Matt Arsenault	e661185fb3	InstCombine: Fold fdiv nnan x, 0 -> copysign(inf, x) https://alive2.llvm.org/ce/z/gLBFKB	2022-11-07 22:00:15 -08:00
Matt Devereau	a8c24d57b8	[InstCombine] Remove redundant splats in InstCombineVectorOps Splatting the first vector element of the result of a BinOp, where any of the BinOp's operands are the result of a first vector element splat can be simplified to splatting the first vector element of the result of the BinOp Differential Revision: https://reviews.llvm.org/D135876	2022-11-07 15:39:05 +00:00
Matt Arsenault	0f68ffe1e2	InstCombine: Fold compare with smallest normal if input denormals are flushed Try to simplify comparisons with the smallest normalized value. If denormals will be treated as 0, we can simplify by using an equality comparison with 0. fcmp olt fabs(x), smallest_normalized_number -> fcmp oeq x, 0.0 fcmp ult fabs(x), smallest_normalized_number -> fcmp ueq x, 0.0 fcmp oge fabs(x), smallest_normalized_number -> fcmp one x, 0.0 fcmp ult fabs(x), smallest_normalized_number -> fcmp ueq x, 0.0 The device libraries have a few range checks that look like this for denormal handling paths.	2022-11-07 07:16:47 -08:00
Sanjay Patel	1c6ebe29d3	[InstCombine] reduce multi-use casts+masks As noted in the code comment, we could generalize this: https://alive2.llvm.org/ce/z/N5m-eZ It saves an instruction even without a constant operand, but the 'and' is wider. We can do that as another step if it doesn't harm anything. I noticed that this missing pattern with a constant operand inhibited other transforms in a recent bug report, so this is enough to solve that case.	2022-11-06 09:07:17 -05:00
chenglin.bi	6703290361	[InstCombine] fold `sub + and` pattern with specific const value `C1 - ((C3 - X) & C2) --> (X & C2) + (C1 - (C2 & C3))` when: (C3 - ((C2 & C3) - 1)) is pow2 && ((C2 + C3) & ((C2 & C3) - 1)) == ((C2 & C3) - 1) && C2 is negative pow2 \|\| (C3 - X) is nuw https://alive2.llvm.org/ce/z/HXQJV- Fix: #58523 Reviewed By: spatel Differential Revision: https://reviews.llvm.org/D136582	2022-11-05 12:58:45 +08:00
Peter Waller	e1790c8c29	Revert "[InstCombine] Remove redundant splats in InstCombineVectorOps" This reverts commit `957eed0b1a`.	2022-11-03 07:56:03 +00:00
Sanjay Patel	f03b069c5b	[InstCombine] fold mul with decremented "shl -1" factor (2nd try) This is a corrected version of: `bc886e9b58` I made a copy-paste error that created an "add" instead of the intended "sub" on that attempt. The regression tests showed the bug, but I overlooked that. As I said in a comment on issue #58717, the bug reports resulting from the botched patch confirm that the pattern does occur in many real-world applications, so hopefully eliminating the multiply results in better code. I added one more regression test in this version of the patch, and here's an Alive2 proof to show that exact example: https://alive2.llvm.org/ce/z/dge7VC Original commit message: This is a sibling to: `6064e92b0a` ...but we canonicalize the shl+add to shl+xor, so the pattern is different than I expected: https://alive2.llvm.org/ce/z/8CX16e I have not found any patterns that are safe to propagate no-wrap, so that is not included here. Differential Revision: https://reviews.llvm.org/D137157	2022-11-02 09:30:01 -04:00
Sanjay Patel	b24e2f6ef6	[InstCombine] use logical-and matcher to avoid crash Follow-on to: `ec0b406e16` This should prevent crashing for example like issue #58552 by not matching a select-of-vectors-with-scalar-condition. The test that shows a regression seems unlikely to occur in real code. This also picks up an optimization in the case where a real (bitwise) logic op is used. We could already convert some similar select ops to real logic via impliesPoison(), so we don't see more diffs on commuted tests. Using commutative matchers (when safe) might also handle one of the TODO tests.	2022-11-02 08:23:52 -04:00
Matt Devereau	957eed0b1a	[InstCombine] Remove redundant splats in InstCombineVectorOps Splatting the first vector element of the result of a BinOp, where any of the BinOp's operands are the result of a first vector element splat can be simplified to splatting the first vector element of the result of the BinOp Differential Revision: https://reviews.llvm.org/D135876	2022-11-02 11:57:05 +00:00
Sanjay Patel	ec0b406e16	[InstCombine] use logical-or matcher to avoid crash This should prevent crashing for the example in issue #58552 by not matching a select-of-vectors-with-scalar-condition. A similar change is likely needed for the related fold to properly fix that kind of bug. The test that shows a regression seems unlikely to occur in real code. This also picks up an optimization in the case where a real (bitwise) logic op is used. We could already convert some similar select ops to real logic via impliesPoison(), so we don't see more diffs on commuted tests. Using commutative matchers (when safe) might also handle one of the TODO tests.	2022-11-01 16:47:41 -04:00
Sanjay Patel	4299b28a9b	[InstCombine] add helper function for select-of-bools folds; NFC This set of folds keeps growing, and it contains bugs like issue #58552, so make it easier to spot those via backtrace.	2022-11-01 11:06:18 -04:00
Florian Mayer	e1de7ac20f	Revert "[InstCombine] fold mul with decremented "shl -1" factor" This reverts commit `bc886e9b58`. Broke MSAN bootstrap buildbots with Assertion `RangeAfterCopy % ExtraScale == 0 && "Extra instruction requires immediate to be aligned"' failed.	2022-10-31 17:39:05 -07:00
Sanjay Patel	1f8ac37e2d	[InstCombine] adjust branch on logical-and fold The transform was just added with: `115d2f69a5` ...but as noted in post-commit feedback, it was confusingly coded. Now, we create the final expected canonicalized form directly and put an extra use check on the match, so we should not ever end up with more instructions.	2022-10-31 17:39:29 -04:00
Philip Reames	a819f6c8d1	[InstCombine] Allow simplify demanded transformations on scalable vectors Differential Revision: https://reviews.llvm.org/D136475	2022-10-31 13:39:36 -07:00
Patrick Walton	01859da84b	[AliasAnalysis] Introduce getModRefInfoMask() as a generalization of pointsToConstantMemory(). The pointsToConstantMemory() method returns true only if the memory pointed to by the memory location is globally invariant. However, the LLVM memory model also has the semantic notion of locally-invariant: memory that is known to be invariant for the life of the SSA value representing that pointer. The most common example of this is a pointer argument that is marked readonly noalias, which the Rust compiler frequently emits. It'd be desirable for LLVM to treat locally-invariant memory the same way as globally-invariant memory when it's safe to do so. This patch implements that, by introducing the concept of a ModRefInfo mask. A ModRefInfo mask is a bound on the Mod/Ref behavior of an instruction that writes to a memory location, based on the knowledge that the memory is globally-constant memory (in which case the mask is NoModRef) or locally-constant memory (in which case the mask is Ref). ModRefInfo values for an instruction can be combined with the ModRefInfo mask by simply using the & operator. Where appropriate, this patch has modified uses of pointsToConstantMemory() to instead examine the mask. The most notable optimization change I noticed with this patch is that now redundant loads from readonly noalias pointers can be eliminated across calls, even when the pointer is captured. Internally, before this patch, AliasAnalysis was assigning Ref to reads from constant memory; now AA can assign NoModRef, which is a tighter bound. Differential Revision: https://reviews.llvm.org/D136659	2022-10-31 13:03:41 -07:00
Sanjay Patel	115d2f69a5	[InstCombine] canonicalize branch with logical-and-not condition https://alive2.llvm.org/ce/z/EfHlWN In the motivating case from issue #58313, this allows forming a duplicate 'not' op which then gets CSE'd and simplifyCFG'd and combined into the expected 'xor'.	2022-10-31 15:51:45 -04:00
Sanjay Patel	bc886e9b58	[InstCombine] fold mul with decremented "shl -1" factor This is a sibling to: `6064e92b0a` ...but we canonicalize the shl+add to shl+xor, so the pattern is different than I expected: https://alive2.llvm.org/ce/z/8CX16e I have not found any patterns that are safe to propagate no-wrap, so that is not included here.	2022-10-31 09:06:55 -04:00
Patrick Walton	36bbd68667	[InstCombine] Allow memcpys from constant memory to readonly nocapture parameters to be elided. Currently, InstCombine can elide a memcpy from a constant to a local alloca if that alloca is passed as a nocapture parameter to a function that's readnone or readonly, but it can't forward the memcpy if the argument is marked readonly nocapture, even though readonly guarantees that the callee won't mutate the pointee through that pointer. This patch adds support for detecting and handling such situations, which arise relatively frequently in Rust, a frontend that liberally emits readonly. A more general version of this optimization would use alias analysis to check the call's ModRef info for the pointee, but I was concerned about blowing up compile time, so for now I'm just checking for one of readnone on the function, readonly on the function, or readonly on the parameter. Differential Revision: https://reviews.llvm.org/D136822	2022-10-30 14:41:03 -07:00

1 2 3 4 5 ...

5236 Commits