llvm-project

Commit Graph

Author	SHA1	Message	Date
Simon Pilgrim	433897da4a	[InstCombine][X86] simplifyX86immShift - convert variable in-range vector shift by immediate amounts to generic shifts (PR40391) The slli/srli/srai 'immediate' vector shifts (although its not immediate anymore to match gcc) can be replaced with generic shifts if the shift amount is known to be in range.	2020-03-19 15:44:24 +00:00
Simon Pilgrim	fb11455038	[InstCombine][X86] Tests for variable but in-range vector-by-scalar shift amounts (PR40391) These shifts are masked to be inrange so we should be able to replace them with generic shifts.	2020-03-19 13:11:06 +00:00
Florian Hahn	4a58996dd2	[SCCP] Use constant ranges for PHI nodes. For PHIs with multiple incoming values, we can improve precision by using constant ranges for integers. We can over-approximate phis by merging the incoming values. Reviewers: davide, efriedma, mssimpso Reviewed By: efriedma Differential Revision: https://reviews.llvm.org/D71933	2020-03-19 12:45:33 +00:00
Simon Pilgrim	0b458d4dca	[ValueTracking] Add computeKnownBits DemandedElts support to ADD/SUB/MUL instructions (PR36319)	2020-03-19 12:41:29 +00:00
Simon Pilgrim	7ce7f78963	[InstSimplify] Add missing vector ADD+SUB tests to show lack of DemandedElts support	2020-03-19 11:27:27 +00:00
Simon Pilgrim	d259e31a17	[InstSimplify] Add missing vector MUL tests to show lack of DemandedElts support	2020-03-19 11:27:27 +00:00
Florian Hahn	8a36594a7e	[SCCP] Use constant ranges for binary operators. If one of the operands of a binary operator is a constant range, we can use ConstantRange::binaryOp to approximate the result. We still handle single element constant ranges as we did previously, with ConstantExpr::get(), because ConstantRange::binaryOp still gives worse results in a few cases for single element ranges. Also note that we bail out early if any of the operands is still unknown. Reviewers: davide, efriedma, mssimpso Reviewed By: efriedma Differential Revision: https://reviews.llvm.org/D71936	2020-03-19 09:35:48 +00:00
Chen Zheng	d8fcdcdf68	[Reassociate] add testcases for more than 1 pairs - NFC	2020-03-19 05:21:24 -04:00
Chen Zheng	3f85134d71	[PowerPC] implement target hook isProfitableToHoist On Powerpc fma is faster than fadd + fmul for some types, (PPCTargetLowering::isFMAFasterThanFMulAndFAdd). we should implement target hook isProfitableToHoist to prevent simplifyCFGpass from breaking fma pattern by hoisting fmul to predecessor block. Reviewed By: nemanjai Differential Revision: https://reviews.llvm.org/D76207	2020-03-19 00:17:25 -04:00
Huihui Zhang	2ea5495759	[InstCombine][SVE] Fix InstCombiner::visitAllocaInst for scalable vector. Summary: DataLayout::getTypeAllocSize() return TypeSize. For cases where scalable property doesn't matter (check for zero-sized alloca), we should explicitly call getKnownMinSize() to avoid implicit type conversion to uint64_t, which is invalid for scalable vector type. Reviewers: sdesmalen, efriedma, spatel, apazos Reviewed By: efriedma Subscribers: tschuett, hiraditya, rkruppe, psnobl, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D76386	2020-03-18 20:57:14 -07:00
Simon Pilgrim	99336bf95a	[ValueTracking] Add computeKnownBits DemandedElts support to masked add instructions (PR36319)	2020-03-18 21:50:56 +00:00
Simon Pilgrim	49bdfd888d	[InstSimplify] Add missing vector masked add tests to show lack of DemandedElts support	2020-03-18 21:04:54 +00:00
Sanjay Patel	22c66c1a28	[JumpThreading] add a miscompile test based on discussion in D76332; NFC	2020-03-18 16:46:18 -04:00
Simon Pilgrim	9d40292a64	[ValueTracking] Add computeKnownBits DemandedElts support to XOR instructions (PR36319)	2020-03-18 20:24:14 +00:00
Simon Pilgrim	47ce1406c8	[InstSimplify] Add missing vector OR test to show lack of DemandedElts support	2020-03-18 20:24:14 +00:00
Simon Pilgrim	6bdb0efa42	[InstSimplify] Regenerate OR tests	2020-03-18 20:24:13 +00:00
Simon Pilgrim	1010c44b4c	[ValueTracking] Add computeKnownBits DemandedElts support to EXTRACTELEMENT/OR/BSWAP/BITREVERSE instructions (PR36319) These are all covered by the bswap/bitreverse vector tests.	2020-03-18 18:49:58 +00:00
Simon Pilgrim	9c6458ecf8	[InstSimplify] Add bitreverse/bswap vector tests Shows missing DemandedElts support (PR36319)	2020-03-18 18:17:10 +00:00
Sam Parker	fc2a5ef9c8	[NFC][PowerPC] Update test Run the update script on one of the loop unroll tests.	2020-03-18 16:21:37 +00:00
Simon Pilgrim	06150e8356	[ValueTracking] Add computeKnownBits DemandedElts support to AND instructions (PR36319)	2020-03-18 15:38:15 +00:00
Sander de Smalen	ef64ba8311	[InstCombine] GEPOperator::accumulateConstantOffset does not support scalable vectors Avoid transforming: %0 = bitcast i8* %base to <vscale x 16 x i8>* %1 = getelementptr <vscale x 16 x i8>, <vscale x 16 x i8>* %0, i64 1 into: %0 = getelementptr i8, i8* %base, i64 16 %1 = bitcast i8* %0 to <vscale x 16 x i8>* Reviewers: efriedma, ctetreau Reviewed By: efriedma Tags: #llvm Differential Revision: https://reviews.llvm.org/D76236	2020-03-18 14:58:46 +00:00
Simon Pilgrim	24c2e61362	[InstCombine][X86] Add additional demandedelts style test for in-range variable per-element shift amounts (PR40391) If we've shuffled the shift amount some of the (undemanded) elements may have become undef - this should be handled by the missing support in PR36319.	2020-03-18 14:36:34 +00:00
Florian Hahn	0db7244295	[SCCP] Precommit some additional tests for integer ranges.	2020-03-18 11:34:04 +00:00
Simon Pilgrim	f4e495a18e	[InstCombine][X86] simplifyX86varShift - convert variable in-range per-element shift amounts to generic shifts (PR40391) AVX2/AVX512 per-element shifts can be replaced with generic shifts if the shift amounts are guaranteed to be in-range (upper bits are known zero).	2020-03-18 11:26:54 +00:00
Simon Pilgrim	cda2b0769f	[InstCombine][X86] Tests for variable but in-range per-element shift amounts (PR40391) These shifts are masked to be inrange so we should be able to replace them with generic shifts.	2020-03-18 10:29:47 +00:00
Florian Hahn	5672ae8d86	[SCCP] Use constant ranges for select, if cond is overdefined. For selects with an unknown condition, we can approximate the result by merging the state of both options. This automatically takes care of the case where on operand is undef. Reviewers: davide, efriedma, mssimpso Reviewed By: efriedma Differential Revision: https://reviews.llvm.org/D71935	2020-03-18 09:26:02 +00:00
Sanjay Patel	be9e3d9416	[InstCombine] reduce demand-limited bool math to logic, part 2 Follow-on suggested in: D75961	2020-03-17 15:18:18 -04:00
Sanjay Patel	586565c514	[InstCombine] add tests for bool math; NFC	2020-03-17 15:18:18 -04:00
Huihui Zhang	1bf0c99375	[ValueTracking][SVE] Fix isGEPKnownNonNull for scalable vector. Summary: DataLayout::getTypeAllocSize() return TypeSize. For cases where the scalable property doesn't matter, we should explicitly call getKnownMinSize() to avoid implicit type conversion to uint64_t, which is not valid for scalable vector type. Reviewers: sdesmalen, efriedma, apazos, reames Reviewed By: efriedma Subscribers: tschuett, hiraditya, rkruppe, psnobl, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D76260	2020-03-17 11:31:30 -07:00
Tyker	e8ac825f5b	[AssumeBundles] Detection of Empty bundles Summary: Prevent InstCombine from removing llvm.assume for which the arguement is true when they have operand bundles with usefull information. Reviewers: jdoerfert, nikic, lebedev.ri Reviewed By: jdoerfert Subscribers: hiraditya, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D76147	2020-03-17 15:50:15 +01:00
Florian Hahn	1d6f919df2	[SCCP] Explicitly mark values as overdefined (NFC). This was part of D60582 but can be committed separately.	2020-03-17 12:13:30 +00:00
Serguei Katkov	80c351cdb6	[InstCombine] Transform to undef incorrect atomic unordered mem intrinsics According to LangRef: If len is not a positive integer multiple of element_size, then the behaviour of the intrinsic is undefined. Add InstCombine rule to transform intrinsic to undef operation. This is a follow-up for D76116. Reviewers: reames Reviewed By: reames Subscribers: hiraditya, jfb, dantrushin, llvm-commits Differential Revision: https://reviews.llvm.org/D76215	2020-03-17 10:20:16 +07:00
Chen Zheng	fa72b29bec	[PowerPC] add test cases for target hook isProfitableToHoist - NFC	2020-03-16 23:07:30 -04:00
Nico Weber	623cb95eb3	Revert "[InstSimplify] Simplify calls with "returned" attribute" This reverts commit `45555c3819`. Causes clang crashes in some causes, see comments on https://reviews.llvm.org/D75815 for details (including repro steps).	2020-03-16 15:21:30 -04:00
Huihui Zhang	0616e9964b	[InstSimplify][SVE] Fix SimplifyGEPInst for scalable vector. Summary: Skip folds that rely on DataLayout::getTypeAllocSize(). For scalable vector, only minimal type alloc size is known at compile-time. Reviewers: sdesmalen, efriedma, spatel, apazos Reviewed By: efriedma Subscribers: tschuett, hiraditya, rkruppe, psnobl, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D75892	2020-03-16 11:46:12 -07:00
Juneyoung Lee	7aecf2323c	[ExpandMemCmp] Correctly set alignment of generated loads Summary: This is a part of the series of efforts for correcting alignment of memory operations. (Another related bugs: https://bugs.llvm.org/show_bug.cgi?id=44388 , https://bugs.llvm.org/show_bug.cgi?id=44543 ) This fixes https://bugs.llvm.org/show_bug.cgi?id=43880 by giving default alignment of loads to 1. The test CodeGen/AArch64/bcmp-inline-small.ll should have been changed; it was introduced by https://reviews.llvm.org/D64805 . I talked with @evandro, and confirmed that the test is okay to be changed. Other two tests from PowerPC needed changes as well, but fixes were straightforward. Reviewers: courbet Reviewed By: courbet Subscribers: nlopes, gchatelet, wuzish, nemanjai, kristof.beyls, hiraditya, steven.zhang, danielkiss, llvm-commits, evandro Tags: #llvm Differential Revision: https://reviews.llvm.org/D76113	2020-03-16 22:39:48 +09:00
Juneyoung Lee	acdcd23b7b	Add tests to ExpandMemCmp/X86/memcmp.ll before submitting D76113	2020-03-16 22:19:37 +09:00
Juneyoung Lee	6ad63606ea	[CodeGenPrepare] Freeze condition when transforming select to br Summary: This is a simple fix for CodeGenPrepare that freezes branch condition when transforming select to branch. If it is not frozen, instsimplify or the later pipeline can potentially exploit undefined behavior. The diff shows optimized form becase D75859 and D76048 already made a few changes to CodeGenPrepare for optimizing freeze(cmp). Reviewers: jdoerfert, spatel, lebedev.ri, efriedma Reviewed By: lebedev.ri Subscribers: hiraditya, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D76179	2020-03-16 12:46:20 +09:00
Juneyoung Lee	4ffe3ac729	Revert "[CodeGenPrepare] Freeze condition when transforming select to br" This reverts commit `10aa7ea951`.	2020-03-16 12:45:54 +09:00
Florian Hahn	650f363bd7	[ValueLattice] Add singlecrfromundef lattice value. This patch adds a new singlecrfromundef lattice value, indicating a single element constant range which was merge with undef at some point. Merging it with another constant range results in overdefined, as we won't be able to replace all users with a single value. This patch uses a ConstantRange instead of a Constant*, because regular integer constants are represented as single element constant ranges as well and this allows the existing code working without additional changes. Reviewers: efriedma, nikic, reames, davide Reviewed By: efriedma Differential Revision: https://reviews.llvm.org/D75845	2020-03-15 11:23:46 +00:00
Juneyoung Lee	10aa7ea951	[CodeGenPrepare] Freeze condition when transforming select to br Summary: This is a simple fix for CodeGenPrepare that freezes branch condition when transforming select to branch. If it is not freezed, instsimplify or the later pipeline can potentially exploit undefined behavior. The diff shows optimized form becase D75859 and D76048 already made a few changes to CodeGenPrepare for optimizing freeze(cmp). Reviewers: jdoerfert, spatel, lebedev.ri, efriedma Reviewed By: lebedev.ri Subscribers: hiraditya, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D76179	2020-03-15 11:10:46 +09:00
Florian Hahn	4878aa36d4	[ValueLattice] Add new state for undef constants. This patch adds a new undef lattice state, which is used to represent UndefValue constants or instructions producing undef. The main difference to the unknown state is that merging undef values with constants (or single element constant ranges) produces the constant/constant range, assuming all uses of the merge result will be replaced by the found constant. Contrary, merging non-single element ranges with undef needs to go to overdefined. Using unknown for UndefValues currently causes mis-compiles in CVP/LVI (PR44949) and will become problematic once we use ValueLatticeElement for SCCP. Reviewers: efriedma, reames, davide, nikic Reviewed By: efriedma Differential Revision: https://reviews.llvm.org/D75120	2020-03-14 17:19:59 +00:00
Akira Hatanaka	c6f1713c46	[ObjC][ARC] Don't remove autoreleaseRV/retainRV pairs if the call isn't a tail call This reapplies the patch in https://reviews.llvm.org/rG1f5b471b8bf4, which was reverted because it was causing crashes. https://bugs.chromium.org/p/chromium/issues/detail?id=1061289#c2 Check that HasSafePathToCall is true before checking the call is a tail call. Original commit message: Previosly ARC optimizer removed the autoreleaseRV/retainRV pair in the following code, which caused the object returned by @something to be placed in the autorelease pool because the call to @something isn't a tail call: ``` %call = call i8* @something(...) %2 = call i8* @objc_retainAutoreleasedReturnValue(i8* %call) %3 = call i8* @objc_autoreleaseReturnValue(i8* %2) ret i8* %3 ``` Fix the bug by checking whether @something is a tail call. rdar://problem/59275894	2020-03-13 13:52:14 -07:00
Alexey Zhikhartsev	f71abec661	[LoopInterchange] Fix interchanging contents of preheader BBs Summary: Previously LCSSA was getting broken by placing instructions into the (newly) inner header instead of the preheader. Fixes PR43474 Reviewers: fhahn Reviewed By: fhahn Subscribers: hiraditya, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D75943	2020-03-13 15:59:37 -04:00
Reid Kleckner	478b06e687	Revert "[ObjC][ARC] Check the basic block size before calling DominatorTree::dominate" This reverts commit `5c3117b0a9` This should not be necessary after `7593a480db`, and Florian Hahn has confirmed that the problem no longer reproduces with this patch. I happened to notice this code because the FIXME talks about OrderedBasicBlock. Reviewed By: fhahn, dexonsmith Differential Revision: https://reviews.llvm.org/D76075	2020-03-13 11:57:55 -07:00
Sanjay Patel	89b19e8959	[SimplifyCFG] add test for chain of empty block conditional branches; NFC	2020-03-13 14:39:31 -04:00
Huihui Zhang	fc1f205745	[SLPVectorizer][SVE] Bail out early for scalable vector. Summary: SLPVectorizer try to vectorize list of scalar instructions of the same type, instructions already vectorized are rejected through isValidElementType(). Without this patch, tryToVectorizeList() will first try to determine vectorization factor of a list of Instructions before checking whether each instruction has unsupported type or not. For instructions already vectorized for SVE, it will crash at getVectorElementSize(), where it try to return a fixed size. This patch make sure invalid element types are rejected before trying to get vectorization factor. This make sure we are not trying to vectorize instructions already vectorized. Reviewers: sdesmalen, efriedma, spatel, RKSimon, ABataev, apazos, rengolin Reviewed By: efriedma Subscribers: tschuett, hiraditya, rkruppe, psnobl, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D76017	2020-03-13 11:23:31 -07:00
Sanjay Patel	afc4dcee83	[SimplifyCFG] regenerate complete test checks; NFC	2020-03-13 14:12:28 -04:00
Sanjay Patel	7fe0e70ecc	[SimplifyCFG] regenerate test checks; NFC	2020-03-13 14:12:28 -04:00
Florian Hahn	e30c257811	[CVP,SCCP] Precommit test for D75055. Test case for PR44949.	2020-03-13 17:53:39 +00:00

1 2 3 4 5 ...

14492 Commits