llvm-project

Commit Graph

Author	SHA1	Message	Date
Sanjay Patel	0f29e953b7	[InstCombine] canonicalize fneg with llvm.sin This is a follow-up to rL339604 which did the same transform for a sin libcall. The handling of intrinsics vs. libcalls is unfortunately scattered, so I'm just adding this next to the existing transform for llvm.cos for now. This should resolve PR38458: https://bugs.llvm.org/show_bug.cgi?id=38458 If the call was already negated, the negates will cancel each other out. llvm-svn: 340952	2018-08-29 18:27:49 +00:00
Sanjay Patel	12a7ea44ed	[InstCombine] add tests for llvm.sin(-x); NFC Also add a corresponding test for llvm.cos with FMF to make sure that was handled correctly. llvm-svn: 340950	2018-08-29 18:11:42 +00:00
Evandro Menezes	22e0bdf4ed	[InstCombine] Expand the simplification of pow() with nested exp{,2}() Expand the simplification of `pow(exp{,2}(x), y)` to all FP types. This improvement helps some benchmarks in SPEC CPU2000 and CPU2006, such as 252.eon, 447.dealII, 453.povray. Otherwise, no significant regressions on x86-64 or A64. Differential revision: https://reviews.llvm.org/D51195 llvm-svn: 340948	2018-08-29 17:59:48 +00:00
Evandro Menezes	a3a7b53571	[InstCombine] Expand the simplification of pow() into exp2() Generalize the simplification of `pow(2.0, y)` to `pow(2.0 ** n, y)` for all scalar and vector types. This improvement helps some benchmarks in SPEC CPU2000 and CPU2006, such as 252.eon, 447.dealII, 453.povray. Otherwise, no significant regressions on x86-64 or A64. Differential revision: https://reviews.llvm.org/D49273 llvm-svn: 340947	2018-08-29 17:59:34 +00:00
Sanjay Patel	3abd9f6bdc	[InstCombine] add test for vector demanded elements + shrinking; NFC llvm-svn: 340933	2018-08-29 15:34:19 +00:00
Matt Arsenault	10de2775bd	AMDGPU: Remove nan tests in class if src is nnan llvm-svn: 340850	2018-08-28 18:10:02 +00:00
Sanjay Patel	60ffc2e9a4	[InstCombine] fix baseline assertions rL340842 contained the wrong version of the check lines. llvm-svn: 340846	2018-08-28 17:23:20 +00:00
Sanjay Patel	c9756e5a23	[InstCombine] add tests for select narrowing (PR38691); NFC llvm-svn: 340842	2018-08-28 16:45:00 +00:00
Craig Topper	a6cd4b9bce	[InstCombine] Extend (add (sext x), cst) --> (sext (add x, cst')) and (add (zext x), cst) --> (zext (add x, cst')) to work for vectors Differential Revision: https://reviews.llvm.org/D51236 llvm-svn: 340796	2018-08-28 02:02:29 +00:00
Kit Barton	7c80f98b69	[PPC] Remove Darwin support from POWER backend. This patch issues an error message if Darwin ABI is attempted with the PPC backend. It also cleans up existing test cases, either converting the test to use an alternative triple or removing the test if the coverage is no longer needed. Updated Tests ------------- The majority of test cases were updated to use a different triple that does not include the Darwin ABI. Many tests were also updated to use FileCheck, in place of grep. Deleted Tests ------------- llvm/test/tools/dsymutil/PowerPC/sibling.test was originally added to test specific functionality of dsymutil using an object file created with an old version of llvm-gcc for a Powerbook G4. After a discussion with @JDevlieghere he suggested removing the test. llvm/test/CodeGen/PowerPC/combine_loads_from_build_pair.ll was converted from a PPC test to a SystemZ test, as the behavior is also reproducible there. All other tests that were deleted were specific to the darwin/ppc ABI and no longer necessary. Phabricator Review: https://reviews.llvm.org/D50988 llvm-svn: 340795	2018-08-28 01:18:29 +00:00
Craig Topper	e23e8a4f53	[InstCombine] Add test cases for D51236. NFC llvm-svn: 340789	2018-08-27 22:55:49 +00:00
Sanjay Patel	42d31c20a8	[InstCombine] allow shuffle+binop canonicalization with widening shuffles This lines up with the behavior of an existing transform where if both operands of the binop are shuffled, we allow moving the binop before the shuffle regardless of whether the shuffle changes the size of the vector. llvm-svn: 340787	2018-08-27 22:41:44 +00:00
Evandro Menezes	253991cfaf	[PATCH] [InstCombine] Fix issue in the simplification of pow() with nested exp{,2}() Fix the issue of duplicating the call to `exp{,2}()` when it's nested in `pow()`, as exposed by rL340462. Differential revision: https://reviews.llvm.org/D51194 llvm-svn: 340784	2018-08-27 22:11:15 +00:00
Sanjay Patel	57a0b4edd7	[InstCombine] add tests for shuffle+binop transform; NFC llvm-svn: 340683	2018-08-25 14:37:08 +00:00
Florian Hahn	406f1ff1cd	[Local] Make DoesKMove required for combineMetadata. This patch makes the DoesKMove argument non-optional, to force people to think about it. Most cases where it is false are either code hoisting or code sinking, where we pick one instruction from a set of equal instructions among different code paths. Reviewers: dberlin, nlopes, efriedma, davide Reviewed By: efriedma Differential Revision: https://reviews.llvm.org/D47475 llvm-svn: 340606	2018-08-24 11:40:04 +00:00
Craig Topper	dfa176e813	[ValueTracking] Fix assert message and add test case for r340546 and PR38677. The bug was already fixed. This just adds a test case for it. llvm-svn: 340556	2018-08-23 17:45:53 +00:00
David Bolvansky	43b0e25847	[InstCombine] Fold Select with binary op - FP opcodes Summary: Follow up for https://reviews.llvm.org/rL339520 and https://reviews.llvm.org/rL338300 Alive: ``` %A = fcmp oeq float %x, 0.0 %B = fadd nsz float %x, %z %C = select i1 %A, float %B, float %y => %C = select i1 %A, float %z, float %y ---------- %A = fcmp oeq float %x, 0.0 %B = fadd nsz float %x, %z %C = select %A, float %B, float %y => %C = select %A, float %z, float %y Done: 1 Optimization is correct %A = fcmp une float %x, -0.0 %B = fadd nsz float %x, %z %C = select i1 %A, float %y, float %B => %C = select i1 %A, float %y, float %z ---------- %A = fcmp une float %x, -0.0 %B = fadd nsz float %x, %z %C = select %A, float %y, float %B => %C = select %A, float %y, float %z Done: 1 Optimization is correct ``` Reviewers: spatel, lebedev.ri Reviewed By: spatel Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D50714 llvm-svn: 340538	2018-08-23 15:22:15 +00:00
Craig Topper	bec15b6516	[ValueTracking] Teach computeNumSignBits to understand min/max clamp patterns with constant/splat values If we have a min/max pair we can do a better job of counting sign bits if we look at them together. This is similar to what is done in the SelectionDAG version of computeNumSignBits for ISD::SMAX/SMIN. Differential Revision: https://reviews.llvm.org/D51112 llvm-svn: 340480	2018-08-22 23:27:50 +00:00
Evandro Menezes	74135fc79f	[NFC] Expand test cases for simplifying pow() llvm-svn: 340462	2018-08-22 22:44:06 +00:00
Nicola Zaghen	8a012cbabf	[InstCombine] Add new tests for icmp ugt/ult (add nuw X, C2), C Differential Revision: https://reviews.llvm.org/D51040 llvm-svn: 340284	2018-08-21 15:27:32 +00:00
Sanjay Patel	f3ae9cc33e	[InstSimplify] use isKnownNeverNaN to fold more fcmp ord/uno Remove duplicate tests from InstCombine that were added with D50582. I left negative tests there to verify that nothing in InstCombine tries to go overboard. If isKnownNeverNaN is improved to handle the FP binops or other cases, we should have coverage under InstSimplify, so we could remove more duplicate tests from InstCombine at that time. llvm-svn: 340279	2018-08-21 14:45:13 +00:00
Craig Topper	bee74793a3	[InstCombine] Add splat vector constant support to foldICmpAddOpConst. Differential Revision: https://reviews.llvm.org/D50946 llvm-svn: 340231	2018-08-20 23:04:25 +00:00
Michael Berg	0b838deddc	extend binop folds for selects to include true and false binops flag intersection Summary: This change address bug 38641 Reviewers: spatel, wristow Reviewed By: spatel Differential Revision: https://reviews.llvm.org/D50996 llvm-svn: 340222	2018-08-20 22:26:58 +00:00
Matt Arsenault	450fcc77a7	ValueTracking: Handle more instructions in isKnownNeverNaN llvm-svn: 340187	2018-08-20 16:51:00 +00:00
Sanjay Patel	5ae83a21b5	[InstCombine] add tests for insertelement+binop; NFC llvm-svn: 340184	2018-08-20 16:49:08 +00:00
Craig Topper	5f695cc1e9	[InstCombine] Add test cases for an icmp combine that is missing support for splat vector constants. llvm-svn: 340144	2018-08-19 18:03:34 +00:00
Matt Arsenault	ea4b476a30	ValueTracking: Add tests for isKnownNeverNaN llvm-svn: 340090	2018-08-17 21:39:52 +00:00
Evandro Menezes	e219d384f9	[NFC] Expand test cases for simplifying pow() In prepatration for the improvements that D49273 enables. llvm-svn: 340060	2018-08-17 17:59:38 +00:00
Sanjay Patel	8ba631d9c8	[InstCombine] add reflection fold for tan(-x) This is a follow-up suggested with rL339604. For tan(), we don't have a corresponding LLVM intrinsic -- unlike sin/cos -- so this is the only way/place that we can do this fold currently. llvm-svn: 339958	2018-08-16 22:46:20 +00:00
Sanjay Patel	75714b598d	[InstCombine] add tests for tan with negated arg; NFC llvm-svn: 339953	2018-08-16 22:05:51 +00:00
Michael Berg	ed89d069f4	add a missed case for binary op FMF propagation under select folds llvm-svn: 339938	2018-08-16 20:59:45 +00:00
Evandro Menezes	42422b33cf	[NFC] Fix typo in test cases llvm-svn: 339900	2018-08-16 17:03:22 +00:00
Evandro Menezes	c05c7e11bb	[InstCombine] Expand the simplification of pow(x, 0.5) to sqrt(x) Expand the number of cases when `pow(x, 0.5)` is simplified into `sqrt(x)` by considering the math semantics with more granularity. Differential revision: https://reviews.llvm.org/D50036 llvm-svn: 339887	2018-08-16 15:58:08 +00:00
Sanjay Patel	039f556f44	[InstCombine] move vector compare before same-shuffled ops This is a step towards fixing PR37463: https://bugs.llvm.org/show_bug.cgi?id=37463 llvm-svn: 339875	2018-08-16 12:52:17 +00:00
Matt Arsenault	9a389fbd79	AMDGPU: Stop producing icmp/fcmp intrinsics with invalid types llvm-svn: 339815	2018-08-15 21:14:25 +00:00
Amara Emerson	070ac768ff	[InstCombine] Fix IC trying to create a xor of pointer types. rdar://42473741 Differential Revision: https://reviews.llvm.org/D50775 llvm-svn: 339796	2018-08-15 17:46:22 +00:00
Sanjay Patel	b1546da0e8	[InstCombine] fix typos in tests; NFC See D50036. llvm-svn: 339713	2018-08-14 19:13:07 +00:00
Sanjay Patel	73b7e9f65e	[InstCombine] add tests for pow->sqrt; NFC D50036 should fix the missed optimizations. llvm-svn: 339711	2018-08-14 19:05:37 +00:00
David Bolvansky	ba74d1c4ea	[NFC] Tests for select with binop fold - FP opcodes llvm-svn: 339692	2018-08-14 17:03:47 +00:00
Sanjay Patel	c8e3943e89	[InstCombine] regenerate checks; NFC llvm-svn: 339683	2018-08-14 15:21:13 +00:00
Sanjay Patel	19c7e7dab4	[InstCombine] regenerate checks; NFC llvm-svn: 339681	2018-08-14 15:18:52 +00:00
Tomasz Krupa	e766e5f636	[X86] Constant folding of adds/subs intrinsics Summary: This adds constant folding of signed add/sub with saturation intrinsics. Reviewers: craig.topper, spatel, RKSimon, chandlerc, efriedma Reviewed By: craig.topper Subscribers: rnk, llvm-commits Differential Revision: https://reviews.llvm.org/D50499 llvm-svn: 339659	2018-08-14 09:04:01 +00:00
Roman Lebedev	3534874fbf	[InstCombine] Re-land: Optimize redundant 'signed truncation check pattern'. Summary: This comes with `Implicit Conversion Sanitizer - integer sign change` (D50250): ``` signed char test(unsigned int x) { return x; } ``` `clang++ -fsanitize=implicit-conversion -S -emit-llvm -o - /tmp/test.cpp -O3` * Old: {F6904292} * With this patch: {F6904294} General pattern: X & Y Where `Y` is checking that all the high bits (covered by a mask `4294967168`) are uniform, i.e. `%arg & 4294967168` can be either `4294967168` or `0` Pattern can be one of: %t = add i32 %arg, 128 %r = icmp ult i32 %t, 256 Or %t0 = shl i32 %arg, 24 %t1 = ashr i32 %t0, 24 %r = icmp eq i32 %t1, %arg Or %t0 = trunc i32 %arg to i8 %t1 = sext i8 %t0 to i32 %r = icmp eq i32 %t1, %arg This pattern is a signed truncation check. And `X` is checking that some bit in that same mask is zero. I.e. can be one of: %r = icmp sgt i32 %arg, -1 Or %t = and i32 %arg, 2147483648 %r = icmp eq i32 %t, 0 Since we are checking that all the bits in that mask are the same, and a particular bit is zero, what we are really checking is that all the masked bits are zero. So this should be transformed to: %r = icmp ult i32 %arg, 128 The transform itself ended up being rather horrible, even though i omitted some cases. Surely there is some infrastructure that can help clean this up that i missed? https://rise4fun.com/Alive/3Ou The initial commit (rL339610) was reverted, since the first assert was being triggered. The @positive_with_extra_and test now has coverage for that case. Reviewers: spatel, craig.topper Reviewed By: spatel Subscribers: RKSimon, erichkeane, vsk, llvm-commits Differential Revision: https://reviews.llvm.org/D50465 llvm-svn: 339621	2018-08-13 21:54:37 +00:00
Roman Lebedev	93f7e7f03e	[NFC][InstCombine] Add a test for D50465 that used to assert This is valid to fold, too. https://rise4fun.com/Alive/0lz llvm-svn: 339619	2018-08-13 21:49:33 +00:00
Sanjay Patel	15bff18c6f	[SimplifyLibCalls] don't drop fast-math-flags on trig reflection folds (retry r339608) Even though this code is below a function called optimizeFloatingPointLibCall(), we apparently can't guarantee that we're dealing with FPMathOperators, so bail out immediately if that's not true. llvm-svn: 339618	2018-08-13 21:49:19 +00:00
Roman Lebedev	28a42c7706	Revert "[InstCombine] Optimize redundant 'signed truncation check pattern'." At least one buildbot was able to actually trigger that assert on the top of the function. Will investigate. This reverts commit r339610. llvm-svn: 339612	2018-08-13 20:46:22 +00:00
Roman Lebedev	4c4750771f	[InstCombine] Optimize redundant 'signed truncation check pattern'. Summary: This comes with `Implicit Conversion Sanitizer - integer sign change` (D50250): ``` signed char test(unsigned int x) { return x; } ``` `clang++ -fsanitize=implicit-conversion -S -emit-llvm -o - /tmp/test.cpp -O3` * Old: {F6904292} * With this patch: {F6904294} General pattern: X & Y Where `Y` is checking that all the high bits (covered by a mask `4294967168`) are uniform, i.e. `%arg & 4294967168` can be either `4294967168` or `0` Pattern can be one of: %t = add i32 %arg, 128 %r = icmp ult i32 %t, 256 Or %t0 = shl i32 %arg, 24 %t1 = ashr i32 %t0, 24 %r = icmp eq i32 %t1, %arg Or %t0 = trunc i32 %arg to i8 %t1 = sext i8 %t0 to i32 %r = icmp eq i32 %t1, %arg This pattern is a signed truncation check. And `X` is checking that some bit in that same mask is zero. I.e. can be one of: %r = icmp sgt i32 %arg, -1 Or %t = and i32 %arg, 2147483648 %r = icmp eq i32 %t, 0 Since we are checking that all the bits in that mask are the same, and a particular bit is zero, what we are really checking is that all the masked bits are zero. So this should be transformed to: %r = icmp ult i32 %arg, 128 https://rise4fun.com/Alive/3Ou Reviewers: spatel, craig.topper Reviewed By: spatel Subscribers: RKSimon, erichkeane, vsk, llvm-commits Differential Revision: https://reviews.llvm.org/D50465 llvm-svn: 339610	2018-08-13 20:33:08 +00:00
Sanjay Patel	66c6fe6534	revert r339608 - [SimplifyLibCalls] don't drop fast-math-flags on trig reflection folds Can't set the builder flags without knowing this is an FPMathOperator. I'll add a test for that and try again. llvm-svn: 339609	2018-08-13 20:20:38 +00:00
Sanjay Patel	981f50919e	[SimplifyLibCalls] don't drop fast-math-flags on trig reflection folds llvm-svn: 339608	2018-08-13 20:14:27 +00:00
Sanjay Patel	e45a83d447	[SimplifyLibCalls] add reflection fold for -sin(-x) (PR38458) This is a very partial fix for the reported problem. I suspect we do not get this fold in most motivating cases because most of the time, the libcall would have been replaced by an intrinsic, and that optimization is handled elsewhere...but maybe it should be handled here? llvm-svn: 339604	2018-08-13 19:24:41 +00:00

1 2 3 4 5 ...

3658 Commits