llvm-project

Commit Graph

Author	SHA1	Message	Date
Sanjay Patel	1a60ae02c6	[InstCombine] fold mask-with-signbit-splat to icmp+select ~(iN X s>> (N-1)) & Y --> (X s< 0) ? 0 : Y https://alive2.llvm.org/ce/z/JKlQ9x This is similar to D111410 / `727e642e97` , but it includes a 'not' of the signbit and so it saves an instruction in the basic pattern. DAGCombiner or target-specific folds can expand this back into bit-hacks. The diffs in the logical-select tests are not true regressions - running early-cse and another round of instcombine is expected in a normal opt pipeline, and that reduces back to a minimal form as shown in the duplicated PhaseOrdering test. I have no understanding of the SystemZ diffs, so I made the minimal edits suggested by FileCheck to make that test pass again. That whole test file is wrong though. It is running the entire optimizer (-O2) to check IR, and then topping that by even running codegen and checking asm. It needs to be split up. Fixes #52631	2021-12-14 16:00:42 -05:00
Sanjay Patel	fd5e493874	[PhaseOrdering] add tests for vector select; NFC The 1st test corresponds to a minimally optimized (mem2reg) version of the example in: issue #52631 The 2nd test copies an existing instcombine test with the same pattern. If we canonicalize differently, we can miss reducing to minimal form in a single invocation of -instcombine, but that should not escape the normal opt pipeline.	2021-12-14 14:35:10 -05:00
Sanjay Patel	8bfcf1ab6c	[InstCombine] move/add tests for binops with sext operand; NFC	2021-11-22 14:43:57 -05:00
Sanjay Patel	337948ac6e	[InstCombine] add folds for binop with sexted bool and constant operands This is a generalization/extension of the existing and/or folds noted with TODO comments. Those have a one-use constraint that is not necessary. Potential follow-ups are noted by the TODO comments in the new function. We can also call this function from other binop visit* functions, but we need to add tests first. This solves: https://llvm.org/PR52543 https://alive2.llvm.org/ce/z/NWuCR5	2021-11-20 12:33:00 -05:00
Sanjay Patel	1d007d0e5a	[InstCombine] add tests for bitwise logic with bool op; NFC	2021-11-20 12:32:55 -05:00
Sanjay Patel	491efa7f31	[InstCombine] add/adjust tests for mask of sext i1; NFC These are sibling transforms, but the test coverage was uneven and incomplete.	2021-11-19 16:07:18 -05:00
Nikita Popov	861adaf2ad	[InstCombine] Support splat vectors in some and of icmp folds Replace m_ConstantInt() with m_APInt() to support splat vectors in addition to scalar integers.	2021-11-10 22:37:54 +01:00
Nikita Popov	fa4e9e64e2	[InstCombine] Add vector variants to merge-icmps.ll (NFC) And regenerate test checks.	2021-11-10 22:36:06 +01:00
Sanjay Patel	727e642e97	[InstCombine] generalize fold for mask-with-signbit-splat (iN X s>> (N-1)) & Y --> (X < 0) ? Y : 0 https://alive2.llvm.org/ce/z/qeYhdz I was looking at a missing abs() transform and found my way to this generalization of an existing fold that was added with D67799. As discussed in that review, we want to make sure codegen handles this difference well, and for all of the targets/types that I spot-checked, it looks good. I am leaving the existing fold in place in this commit because it covers a potentially missing icmp fold, but I plan to remove that as a follow-up commit as suggested during review. Differential Revision: https://reviews.llvm.org/D111410	2021-10-15 16:25:48 -04:00
Sanjay Patel	a35673f4cf	[InstCombine] add tests for (i32 X s>> 31) & Y; NFC Also regenerate some check lines to more accurately show current transforms via name changes.	2021-10-08 11:07:57 -04:00
Sanjay Patel	41ff7612b3	[InstCombine] allow splat vectors for narrowing masked fold Mostly cosmetic diffs, but the use of m_APInt matches splat constants.	2021-09-17 11:24:16 -04:00
Sanjay Patel	3a587ed20f	[InstCombine] add vector tests for 'and' folds; NFC	2021-09-17 11:24:16 -04:00
Juneyoung Lee	8a156d1c27	[InstCombine] Fully disable select to and/or i1 folding This is a patch that disables the poison-unsafe select -> and/or i1 folding. It has been blocking D72396 and also has been the source of a few miscompilations described in llvm.org/pr49688 . D99674 conditionally blocked this folding and successfully fixed the latter one. The former one was still blocked, and this patch addresses it. Note that a few test functions that has `_logical` suffix are now deoptimized. These are created by @nikic to check the impact of disabling this optimization by copying existing original functions and replacing and/or with select. I can see that most of these are poison-unsafe; they can be revived by introducing freeze instruction. I left comments at fcmp + select optimizations (or-fcmp.ll, and-fcmp.ll) because I think they are good targets for freeze fix. Reviewed By: nikic Differential Revision: https://reviews.llvm.org/D101191	2021-05-06 09:29:52 +09:00
Nikita Popov	fb063c933f	[InstCombine] Duplicate tests for logical and/or (NFC) This replicates existing and/or tests to also test variants using select. This should help us get a more accurate view on which optimizations we're missing if we disable the select -> and/or fold.	2021-01-12 21:50:41 +01:00
Dávid Bolvanský	c1937c2af2	[NFC] Added/adjusted tests for PR48604; second pattern	2020-12-31 14:59:15 +01:00
Dávid Bolvanský	742ea77ca4	[InstCombine] Transform (A + B) - (A \| B) to A & B (PR48604) define i32 @src(i32 %x, i32 %y) { %0: %a = add i32 %x, %y %o = or i32 %x, %y %r = sub i32 %a, %o ret i32 %r } => define i32 @tgt(i32 %x, i32 %y) { %0: %b = and i32 %x, %y ret i32 %b } Transformation seems to be correct! https://alive2.llvm.org/ce/z/aQRh2j	2020-12-31 14:03:20 +01:00
Dávid Bolvanský	9b64939463	[NFC] Added tests for PR48604	2020-12-31 14:03:20 +01:00
Bhramar Vatsa	fd679107d6	[InstCombine] Optimize away the unnecessary multi-use sign-extend C.f. https://bugs.llvm.org/show_bug.cgi?id=47765 Added a case for handling the sign-extend (Shl+AShr) for multiple uses, to optimize it away for an individual use, when the demanded bits aren't affected by sign-extend. https://rise4fun.com/Alive/lgf Reviewed By: lebedev.ri Differential Revision: https://reviews.llvm.org/D91343	2020-12-01 16:54:00 +03:00
Sanjay Patel	dfd2858b7f	[InstCombine] add test comments for negative tests; NFC	2020-11-20 07:59:46 -05:00
Sanjay Patel	4a66a1d17a	[InstCombine] allow vectors for masked-add -> xor fold https://rise4fun.com/Alive/I4Ge Name: add with pow2 mask Pre: isPowerOf2(C2) && (C1 & C2) != 0 && (C1 & (C2-1)) == 0 %a = add i8 %x, C1 %r = and i8 %a, C2 => %n = and i8 %x, C2 %r = xor i8 %n, C2	2020-11-17 13:36:08 -05:00
Sanjay Patel	f791ad7e1e	[InstCombine] remove scalar constraint for mask-of-add fold https://rise4fun.com/Alive/V6fP Name: add with low mask Pre: (C1 & (-1 u>> countLeadingZeros(C2))) == 0 %a = add i8 %x, C1 %r = and i8 %a, C2 => %r = and i8 %x, C2	2020-11-17 12:13:45 -05:00
Sanjay Patel	c15e5bdfb7	[InstCombine] add vector test for mask of add; NFC	2020-11-17 12:13:45 -05:00
Sanjay Patel	433696911a	[InstCombine] relax constraints on mask-of-add There are 2 changes: 1. Remove the unnecessary one-use check. 2. Remove the unnecessary power-of-2 check. https://rise4fun.com/Alive/V6fP Name: add with low mask Pre: (C1 & (-1 u>> countLeadingZeros(C2))) == 0 %a = add i8 %x, C1 %r = and i8 %a, C2 => %r = and i8 %x, C2	2020-11-17 12:13:44 -05:00
Sanjay Patel	1d7d9d6685	[InstCombine] add tests for masked add; NFC	2020-11-17 12:13:44 -05:00
Sanjay Patel	4e68bc0999	Revert "[InstCombine] add multi-use demanded bits fold for add with low-bit mask" This reverts commit `e56103d250`. There is a stage2 msan failure blamed on this commit: http://lab.llvm.org:8011/#/builders/74/builds/888/steps/9/logs/stdio	2020-11-16 14:48:09 -05:00
Sanjay Patel	6ddc237766	[InstCombine] reduce code for flip of masked bit; NFC There are 1-2 potential follow-up NFC commits to reduce this further on the way to generalizing this for vectors. The operand replacing path should be dead code because demanded bits handles that more generally (D91415).	2020-11-15 15:43:34 -05:00
Sanjay Patel	e56103d250	[InstCombine] add multi-use demanded bits fold for add with low-bit mask I noticed an add example like the one from D91343, so here's a similar patch. The logic is based on existing code for the single-use demanded bits fold. But I only matched a constant instead of using compute known bits on the operands because that was the motivating patterni that I noticed. I think this will allow removing a special-case (but incomplete) dedicated fold within visitAnd(), but I need to untangle the existing code to be sure. https://rise4fun.com/Alive/V6fP Name: add with low mask Pre: (C1 & (-1 u>> countLeadingZeros(C2))) == 0 %a = add i8 %x, C1 %r = and i8 %a, C2 => %r = and i8 %x, C2 Differential Revision: https://reviews.llvm.org/D91415	2020-11-15 15:09:49 -05:00
Sanjay Patel	91aa211ea1	[InstCombine] add vector tests for multi-use demanded bits; NFC See D91415.	2020-11-15 15:09:49 -05:00
Sanjay Patel	96f4aa6765	[InstCombine] add tests for low-mask-of-add; NFC	2020-11-12 17:02:14 -05:00
Sanjay Patel	8a1e6366d0	[InstCombine] add tests for mask of sext-in-reg; NFC	2020-11-12 12:42:54 -05:00
Simon Pilgrim	fe8281e2d0	[InstCombine] visitAnd - add some ((val OP C1) & C2) vector test coverage	2020-10-16 15:43:11 +01:00
Sanjay Patel	2552f65183	[InstCombine] fold mask op into casted shift (PR46013) https://rise4fun.com/Alive/Qply8 Pre: C2 == (-1 u>> zext(C1)) %a = ashr %x, C1 %s = sext %a to i16 %r = and i16 %s, C2 => %s2 = sext %x to i16 %r = lshr i16 %s2, zext(C1) https://bugs.llvm.org/show_bug.cgi?id=46013	2020-06-07 09:33:18 -04:00
Sanjay Patel	c6719d0b47	[InstCombine] add tests for bitmask of casted shift; NFC (PR46013)	2020-06-07 09:33:18 -04:00
Sanjay Patel	455ce7816c	[InstCombine] fold a shifted bool zext to a select (2nd try) The 1st attempt at rL374828 inserted the code at the wrong position (outside of the constant-shift-amount block). Trying again with an additional test to verify const-ness. For a constant shift amount, add the following fold. shl (zext (i1 X)), ShAmt --> select (X, 1 << ShAmt, 0) https://rise4fun.com/Alive/IZ9 Fixes PR42257. Based on original patch by @zvi (Zvi Rackover) Differential Revision: https://reviews.llvm.org/D63382 llvm-svn: 374886	2019-10-15 13:12:44 +00:00
Sanjay Patel	4335d8f0e8	Revert [InstCombine] fold a shifted bool zext to a select This reverts r374828 (git commit `1f40f15d54`) due to bot breakage llvm-svn: 374851	2019-10-14 23:55:39 +00:00
Sanjay Patel	1f40f15d54	[InstCombine] fold a shifted bool zext to a select For a constant shift amount, add the following fold. shl (zext (i1 X)), ShAmt --> select (X, 1 << ShAmt, 0) https://rise4fun.com/Alive/IZ9 Fixes PR42257. Based on original patch by @zvi (Zvi Rackover) Differential Revision: https://reviews.llvm.org/D63382 llvm-svn: 374828	2019-10-14 21:56:40 +00:00
Eric Christopher	cee313d288	Revert "Temporarily Revert "Add basic loop fusion pass."" The reversion apparently deleted the test/Transforms directory. Will be re-reverting again. llvm-svn: 358552	2019-04-17 04:52:47 +00:00
Eric Christopher	a863435128	Temporarily Revert "Add basic loop fusion pass." As it's causing some bot failures (and per request from kbarton). This reverts commit r358543/ab70da07286e618016e78247e4a24fcb84077fda. llvm-svn: 358546	2019-04-17 02:12:23 +00:00
Sanjay Patel	aa766efd09	[InstCombine] fix demanded-bits propagation for zext/trunc I was comparing the demanded-bits implementations between InstCombine and TargetLowering as part of investigating questions in D42088 and noticed that this was wrong in IR. We were losing all of the prior known bits when we got back to the 'zext'. llvm-svn: 322662	2018-01-17 14:39:28 +00:00
Sanjay Patel	178deccb63	[InstCombine] add test to show hole in demanded bits; NFC llvm-svn: 322660	2018-01-17 14:27:35 +00:00
Craig Topper	d918d5b36b	[InstCombine] Improve the expansion in SimplifyUsingDistributiveLaws to handle cases where one side doesn't simplify, but the other side resolves to an identity value Summary: If one side simplifies to the identity value for inner opcode, we can replace the value with just the operation that can't be simplified. I've removed a couple now unneeded special cases in visitAnd and visitOr. There are probably other cases I missed. Reviewers: spatel, majnemer, hfinkel, dberlin Reviewed By: spatel Subscribers: grandinj, llvm-commits, spatel Differential Revision: https://reviews.llvm.org/D35451 llvm-svn: 308111	2017-07-15 21:49:49 +00:00
Craig Topper	e29b112f49	[InstCombine] Add test cases for (X & (Y \| ~X)) -> (X & Y) where the not is an inverted compare. NFC Do the same for (X \| (Y & ~X)) -> (X \| Y) llvm-svn: 308104	2017-07-15 17:09:23 +00:00
Craig Topper	860e0ba5da	[InstCombine] Move 4 test cases from a test that didn't use FileCheck and merge them into a existing test file. NFC llvm-svn: 308103	2017-07-15 17:09:22 +00:00
Sanjay Patel	3d8905f3a0	[InstCombine] fix typo in test comment; NFC llvm-svn: 302669	2017-05-10 14:25:23 +00:00
Sanjay Patel	4e312203af	[InstCombine] consolidate more DeMorgan tests; NFC llvm-svn: 301800	2017-05-01 14:10:59 +00:00
Craig Topper	ba01143193	[InstCombine] Add missing commute handling to (A \| B) & (B ^ (~A)) -> (A & B) The matching here wasn't able to handle all the possible commutes. It always assumed the not would be on the left of the xor, but that's not guaranteed. Differential Revision: https://reviews.llvm.org/D32474 llvm-svn: 301316	2017-04-25 15:19:04 +00:00
Craig Topper	e4d7ac4cb1	[InstCombine] Add test cases showing failures to handle commuted patterns after tricking the operand complexity sorting. llvm-svn: 301296	2017-04-25 06:22:17 +00:00
Sanjay Patel	9d39a9d860	[InstCombine] add/move tests for and/or-of-icmps equality folds; NFC llvm-svn: 300357	2017-04-14 18:19:27 +00:00
Craig Topper	afa07c5ef6	[InstCombine] Extend some OR combines to support vectors. This adds support for these combines for vectors (X^C)\|Y -> (X\|Y)^C iff Y&C == 0 Y\|(X^C) -> (X\|Y)^C iff Y&C == 0 llvm-svn: 299822	2017-04-09 06:12:41 +00:00
Craig Topper	e63c21b1ba	[InstCombine] Extend a canonicalization check to apply to vector constants too. llvm-svn: 299821	2017-04-09 06:12:39 +00:00

1 2

71 Commits