llvm-project

Commit Graph

Author	SHA1	Message	Date
Sanjay Patel	2e069f250a	[InstCombine] add (ashr (shl i32 X, 31), 31), 1 --> and (not X), 1 This is another step towards favoring 'not' ops over random 'xor' in IR: https://bugs.llvm.org/show_bug.cgi?id=32706 This transformation may have occurred in longer IR sequences using computeKnownBits, but that could be much more expensive to calculate. As the scalar result shows, we do not currently favor 'not' in all cases. The 'not' created by the transform is transformed again (unnecessarily). Vectors don't have this problem because vectors are (wrongly) excluded from several other combines. llvm-svn: 302659	2017-05-10 13:56:52 +00:00
Serge Guelton	778ece82ae	Use explicit false instead of casted nullptr. NFC. llvm-svn: 302656	2017-05-10 13:24:17 +00:00
Chandler Carruth	f3bd8ddedb	Revert r301950: SpeculativeExecution: Stop using whitelist for costs This pass doesn't correctly handle testing for when it is legal to hoist arbitrary instructions. The whitelist happens to make it safe, so before it is removed the pass's legality checks will need to be enhanced. Details have been added to the code review thread for the patch. llvm-svn: 302640	2017-05-10 12:30:07 +00:00
Amara Emerson	836b0f48c1	Add a late IR expansion pass for the experimental reduction intrinsics. This pass uses a new target hook to decide whether or not to expand a particular intrinsic to the shuffevector sequence. Differential Revision: https://reviews.llvm.org/D32245 llvm-svn: 302631	2017-05-10 09:42:49 +00:00
Sanjay Patel	4133d4a56e	[InstCombine] add helper function for add X, C folds; NFCI llvm-svn: 302605	2017-05-10 00:07:16 +00:00
Easwaran Raman	f5f9160072	[ProfileSummary] Make getProfileCount a non-static member function. This change is required because the notion of count is different for sample profiling and getProfileCount will need to determine the underlying profile type. Differential revision: https://reviews.llvm.org/D33012 llvm-svn: 302597	2017-05-09 23:21:10 +00:00
Peter Collingbourne	c3d677f9d9	FunctionImport: Simplify function llvm::thinLTOInternalizeModule. NFCI. llvm-svn: 302595	2017-05-09 22:43:31 +00:00
Keno Fischer	06f962c1e8	[GVN] Fix a crash on encountering non-integral pointers Summary: This fixes the immediate crash caused by introducing an incorrect inttoptr before attempting the conversion. There may still be a legality check missing somewhere earlier for non-integral pointers, but this change seems necessary in any case. Reviewers: sanjoy, dberlin Reviewed By: dberlin Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D32623 llvm-svn: 302587	2017-05-09 21:07:20 +00:00
Matthew Simpson	78fd46b230	[AArch64] Consider widening instructions in cost calculations The AArch64 instruction set has a few "widening" instructions (e.g., uaddl, saddl, uaddw, etc.) that take one or more doubleword operands and produce quadword results. The operands are automatically sign- or zero-extended as appropriate. However, in LLVM IR, these extends are explicit. This patch updates TTI to consider these widening instructions as single operations whose cost is attached to the arithmetic instruction. It marks extends that are part of a widening operation "free" and applies a sub-target specified overhead (zero by default) to the arithmetic instructions. Differential Revision: https://reviews.llvm.org/D32706 llvm-svn: 302582	2017-05-09 20:18:12 +00:00
Sanjay Patel	7caaa79879	[InstCombine] clean up matchDeMorgansLaws(); NFCI The motivation for getting rid of dyn_castNotVal is to allow fixing: https://bugs.llvm.org/show_bug.cgi?id=32706 So this was supposed to be functional-change-intended for the case of inverting constants and applying DeMorgan. However, I can't find any cases where that pattern will actually get to matchDeMorgansLaws() because we have other folds in visitAnd/visitOr that do the same thing. So this ends up just being a clean-up patch with slight efficiency improvement, but no-functional-change-intended. llvm-svn: 302581	2017-05-09 20:05:05 +00:00
Davide Italiano	b7a6698ae9	[NewGVN] Simplify a DEBUG() statement. NFCI. llvm-svn: 302579	2017-05-09 20:02:48 +00:00
Adrian Prantl	c10d0e5ccd	Make it illegal for two Functions to point to the same DISubprogram As recently discussed on llvm-dev [1], this patch makes it illegal for two Functions to point to the same DISubprogram and updates FunctionCloner to also clone the debug info of a function to conform to the new requirement. To simplify the implementation it also factors out the creation of inlineAt locations from the Inliner into a general-purpose utility in DILocation. [1] http://lists.llvm.org/pipermail/llvm-dev/2017-May/112661.html <rdar://problem/31926379> Differential Revision: https://reviews.llvm.org/D32975 This reapplies r302469 with a fix for a bot failure (reparentDebugInfo now checks for the case the orig and new function are identical). llvm-svn: 302576	2017-05-09 19:47:37 +00:00
Piotr Padlewski	d979c1f806	NFC: refactor replaceDominatedUsesWith Summary: Since I will post patch with some changes to replaceDominatedUsesWith, it would be good to avoid duplicating code again. Reviewers: davide, dberlin Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D32798 llvm-svn: 302575	2017-05-09 19:39:44 +00:00
Serge Guelton	e38003f839	Suppress all uses of LLVM_END_WITH_NULL. NFC. Use variadic templates instead of relying on <cstdarg> + sentinel. This enforces better type checking and makes code more readable. Differential Revision: https://reviews.llvm.org/D32541 llvm-svn: 302571	2017-05-09 19:31:13 +00:00
Davide Italiano	63998ec3c8	[NewGVN] Explain why sorting by pointer values doesn't introduce non-determinism. Thanks to Eli for pointing out in a post-commit review comment. llvm-svn: 302566	2017-05-09 18:29:37 +00:00
Davide Italiano	d6bb8cab03	[NewGVN] Fix a consistent order for phi nodes operands. The way we currently define congruency for two PHIExpression(s) is: 1) The operands to the phi functions are congruent 2) The PHIs are defined in the same BasicBlock. NewGVN works under the assumption that phi operands are in predecessor order, or at least in some consistent order. OTOH, is valid IR: patatino: %meh = phi i16 [ %0, %winky ], [ %conv1, %tinky ] %banana = phi i16 [ %0, %tinky ], [ %conv1, %winky ] br label %end and the in-memory representations of the two SSA registers have an inconsistent order. This violation of NewGVN assumptions results into two PHIs found congruent when they're not. While we think it's useful to have always a consistent order enforced, let's fix this in NewGVN sorting uses in predecessor order before creating a PHI expression. Differential Revision: https://reviews.llvm.org/D32990 llvm-svn: 302552	2017-05-09 16:58:28 +00:00
Daniel Berlin	6604a2ffbb	NewGVN: Make all of symbolic evaluation logically const. llvm-svn: 302550	2017-05-09 16:40:04 +00:00
Sanjay Patel	6844e21f59	[InstCombineCasts] Fix checks in sext->lshr->trunc pattern. The comment says to avoid the case where zero bits are shifted into the truncated value, but the code checks that the shift is smaller than the truncated value instead of the number of bits added by the sign extension. Fixing this allows a shift by more than the value size to be introduced, which is undefined behavior, so the shift is capped at the value size minus one, which has the expected behavior of filling the value with the sign bit. Patch by Jacob Young! Differential Revision: https://reviews.llvm.org/D32285 llvm-svn: 302548	2017-05-09 16:24:59 +00:00
Hans Wennborg	66fb0d9768	Revert r302469 "Make it illegal for two Functions to point to the same DISubprogram" This caused PR32977. Original commit message: > Make it illegal for two Functions to point to the same DISubprogram > > As recently discussed on llvm-dev [1], this patch makes it illegal for > two Functions to point to the same DISubprogram and updates > FunctionCloner to also clone the debug info of a function to conform > to the new requirement. To simplify the implementation it also factors > out the creation of inlineAt locations from the Inliner into a > general-purpose utility in DILocation. > > [1] http://lists.llvm.org/pipermail/llvm-dev/2017-May/112661.html > <rdar://problem/31926379> > > Differential Revision: https://reviews.llvm.org/D32975 llvm-svn: 302533	2017-05-09 14:44:15 +00:00
Anna Thomas	0691483435	[LV] Fix insertion point for shuffle vectors in first order recurrence Summary: In first order recurrence vectorization, when the previous value is a phi node, we need to set the insertion point to the first non-phi node. We can have the previous value being a phi node, due to the generation of new IVs as part of trunc optimization [1]. [1] https://reviews.llvm.org/rL294967 Reviewers: mssimpso, mkuper Subscribers: mzolotukhin, llvm-commits Differential Revision: https://reviews.llvm.org/D32969 llvm-svn: 302532	2017-05-09 14:29:33 +00:00
Amara Emerson	cf9daa33a7	Introduce experimental generic intrinsics for horizontal vector reductions. - This change allows targets to opt-in to using them instead of the log2 shufflevector algorithm. - The SLP and Loop vectorizers have the common code to do shuffle reductions factored out into LoopUtils, and now have a unified interface for generating reductions regardless of the preference of the target. LoopUtils now uses TTI to determine what kind of reductions the target wants to handle. - For CodeGen, basic legalization support is added. Differential Revision: https://reviews.llvm.org/D30086 llvm-svn: 302514	2017-05-09 10:43:25 +00:00
Sanjoy Das	76bbdd1a16	[InstNamer] Use range-for llvm-svn: 302481	2017-05-08 23:18:43 +00:00
Sanjoy Das	0ac3bf1cc8	[InstNamer] Don't check type of arguments (they're never void) llvm-svn: 302480	2017-05-08 23:18:39 +00:00
Sanjoy Das	7be961b081	Delete trailing whitespace llvm-svn: 302479	2017-05-08 23:18:36 +00:00
Adrian Prantl	200a5ef526	Make it illegal for two Functions to point to the same DISubprogram As recently discussed on llvm-dev [1], this patch makes it illegal for two Functions to point to the same DISubprogram and updates FunctionCloner to also clone the debug info of a function to conform to the new requirement. To simplify the implementation it also factors out the creation of inlineAt locations from the Inliner into a general-purpose utility in DILocation. [1] http://lists.llvm.org/pipermail/llvm-dev/2017-May/112661.html <rdar://problem/31926379> Differential Revision: https://reviews.llvm.org/D32975 llvm-svn: 302469	2017-05-08 21:17:08 +00:00
Sanjay Patel	a1c8814891	[InstCombine] add folds for not-of-shift-right This is another step towards getting rid of dyn_castNotVal, so we can recommit: https://reviews.llvm.org/rL300977 As the tests show, we were missing the lshr case for constants and both ashr/lshr vector splat folds. The ashr case with constant was being performed inefficiently in 2 steps. It's also possible there was a latent bug in that case because we can't do that fold if the constant is positive: http://rise4fun.com/Alive/Bge llvm-svn: 302465	2017-05-08 20:49:59 +00:00
Davide Italiano	aa42a10051	[PartialInlining] Capture by reference rather than by value. llvm-svn: 302464	2017-05-08 20:44:01 +00:00
Sanjay Patel	2a06263036	[InstCombine] use local variable to reduce code duplication; NFCI llvm-svn: 302438	2017-05-08 16:33:42 +00:00
Sanjay Patel	2df38a80f1	[InstCombine/InstSimplify] add comments about code duplication; NFC llvm-svn: 302436	2017-05-08 16:21:55 +00:00
Craig Topper	7e3e7afca8	[ConstantRange][SimplifyCFG] Add a helper method to allow SimplifyCFG to determine if a ConstantRange has more than 8 elements without requiring an allocation if the ConstantRange is 64-bits wide. Previously SimplifyCFG used getSetSize which returns an APInt that is 1 bit wider than the ConstantRange's bit width. In the reasonably common case that the ConstantRange is 64-bits wide, this requires returning a 65-bit APInt. APInt's can only store 64-bits without a memory allocation so this is inefficient. The new method takes the 8 as an input and tells if the range contains more than that many elements without requiring any wider math. llvm-svn: 302385	2017-05-07 22:22:11 +00:00
Sanjay Patel	599e65b1ff	[InstSimplify] use ConstantRange to simplify or-of-icmps We can simplify (or (icmp X, C1), (icmp X, C2)) to 'true' or one of the icmps in many cases. I had to check some of these with Alive to prove to myself it's right, but everything seems to check out. Eg, the deleted code in instcombine was completely ignoring predicates with mismatched signedness. This is a follow-up to: https://reviews.llvm.org/rL301260 https://reviews.llvm.org/D32143 llvm-svn: 302370	2017-05-07 15:11:40 +00:00
Kostya Serebryany	424bfed693	[sanitizer-coverage] implement -fsanitize-coverage=no-prune,... instead of a hidden -mllvm flag. llvm part. llvm-svn: 302319	2017-05-05 23:14:40 +00:00
Craig Topper	a49e768977	Fix spelling error in command line option description. NFC llvm-svn: 302311	2017-05-05 22:31:11 +00:00
Matthias Braun	60b40b8fec	TargetLibraryInfo: Introduce wcslen wcslen is part of the C99 and C++98 standards. - This introduces the function to TargetLibraryInfo. - Also set attributes for wcslen in llvm::inferLibFuncAttributes(). Differential Revision: https://reviews.llvm.org/D32837 llvm-svn: 302278	2017-05-05 20:25:50 +00:00
Craig Topper	f0aeee01c3	[KnownBits] Add wrapper methods for setting and clear all bits in the underlying APInts in KnownBits. This adds routines for reseting KnownBits to unknown, making the value all zeros or all ones. It also adds methods for querying if the value is zero, all ones or unknown. Differential Revision: https://reviews.llvm.org/D32637 llvm-svn: 302262	2017-05-05 17:36:09 +00:00
Craig Topper	fc481e5eb7	[Float2Int] Replace a ConstantRange copy with a move. Remove an extra call to MapVector::find. llvm-svn: 302256	2017-05-05 17:09:29 +00:00
Aditya Kumar	1c42d135e1	[LoopIdiom] check for safety while expanding Loop Idiom recognition was generating memset in a case that would result generating a division operation to an unsafe location. Differential Revision: https://reviews.llvm.org/D32674 llvm-svn: 302238	2017-05-05 14:49:45 +00:00
Evgeniy Stepanov	9aff829f78	Remap metadata attached to global variables. Fix for PR32577. Global variables may have !associated metadata, which includes a reference to another global. It needs remapping. llvm-svn: 302203	2017-05-04 23:29:39 +00:00
Craig Topper	1f673d4450	[JumpThreading] When processing compares, explicitly check that the result type is not a vector rather than check for it being an integer. Compares always return a scalar integer or vector of integers. isIntegerTy returns false for vectors, but that's not completely obvious. So using isVectorTy is less confusing. llvm-svn: 302198	2017-05-04 21:45:49 +00:00
Craig Topper	930689ada4	[JumpThreading] Change a dyn_cast that is already protected by an isa check to a static cast. Combine the with another static cast. NFC Differential Revision: https://reviews.llvm.org/D32874 llvm-svn: 302197	2017-05-04 21:45:45 +00:00
Craig Topper	5974dadc69	[Float2Int] Remove return of ConstantRange from seen method. Nothing uses it so it just creates and discards a ConstantRange object for no reason. llvm-svn: 302193	2017-05-04 21:29:45 +00:00
Peter Collingbourne	9667b91b13	Re-apply r302108, "IR: Use pointers instead of GUIDs to represent edges in the module summary. NFCI." with a fix for the clang backend. llvm-svn: 302176	2017-05-04 18:03:25 +00:00
Davide Italiano	94bf7846fd	[NewGVN] Remove unneeded newline and format assertions. NFCI. llvm-svn: 302173	2017-05-04 17:26:15 +00:00
Eric Liu	f6039f255e	Revert "IR: Use pointers instead of GUIDs to represent edges in the module summary. NFCI." This reverts commit r302108. This causes crash in clang bootstrap with LTO. Contacted the auther in the original commit. llvm-svn: 302140	2017-05-04 11:49:39 +00:00
Martin Storsjo	e81233d0ed	[ArgPromotion] Fix a truncated variable This fixes a regression since SVN rev 273808 (which was supposed to not change functionality). The regression caused miscompilations (noted in the wild when targeting AArch64) on platforms with 32 bit long. Differential Revision: https://reviews.llvm.org/D32850 llvm-svn: 302137	2017-05-04 10:54:35 +00:00
Jonas Paulsson	8bf1fdcc91	Use right function in LoopVectorize. - unsigned AS = getMemInstAlignment(I); + unsigned AS = getMemInstAddressSpace(I); Review: Hal Finkel llvm-svn: 302114	2017-05-04 05:31:56 +00:00
Peter Collingbourne	5f85a9deda	IR: Use pointers instead of GUIDs to represent edges in the module summary. NFCI. When profiling a no-op incremental link of Chromium I found that the functions computeImportForFunction and computeDeadSymbols were consuming roughly 10% of the profile. The goal of this change is to improve the performance of those functions by changing the map lookups that they were previously doing into pointer dereferences. This is achieved by changing the ValueInfo data structure to be a pointer to an element of the global value map owned by ModuleSummaryIndex, and changing reference lists in the GlobalValueSummary to hold ValueInfos instead of GUIDs. This means that a ValueInfo will take a client directly to the summary list for a given GUID. Differential Revision: https://reviews.llvm.org/D32471 llvm-svn: 302108	2017-05-04 03:36:16 +00:00
Craig Topper	cff357c322	[InstCombine][KnownBits] Use KnownBits better to detect nsw adds Change checkRippleForAdd from a heuristic to a full check - if it is provable that the add does not overflow return true, otherwise false. Patch by Yoav Ben-Shalom Differential Revision: https://reviews.llvm.org/D32686 llvm-svn: 302093	2017-05-03 23:22:46 +00:00
Craig Topper	8189a87a1e	[KnownBits] Add methods for determining if KnownBits is a constant value This patch adds isConstant and getConstant for determining if KnownBits represents a constant value and to retrieve the value. Use them to simplify code. Differential Revision: https://reviews.llvm.org/D32785 llvm-svn: 302091	2017-05-03 23:12:29 +00:00
Craig Topper	d938fd1397	[KnownBits] Add zext, sext, and trunc methods to KnownBits This patch adds zext, sext, and trunc methods to KnownBits and uses them where possible. Differential Revision: https://reviews.llvm.org/D32784 llvm-svn: 302088	2017-05-03 22:07:25 +00:00
Xin Tong	46fb813ac3	[TailCallElim] Remove an unused argument. NFCI llvm-svn: 302080	2017-05-03 20:37:07 +00:00
Anna Thomas	f475fa3575	Avoid warning of unused variable in release builds. NFC llvm-svn: 302068	2017-05-03 19:25:04 +00:00
Sanjoy Das	23f314d04f	Fix typos in comment llvm-svn: 302063	2017-05-03 18:29:34 +00:00
Anna Thomas	d4c0295cc8	Fix PPC64 warning for missing parantheses. NFC. llvm-svn: 302061	2017-05-03 18:25:43 +00:00
Reid Kleckner	a0b45f4bfc	[IR] Abstract away ArgNo+1 attribute indexing as much as possible Summary: Do three things to help with that: - Add AttributeList::FirstArgIndex, which is an enumerator currently set to 1. It allows us to change the indexing scheme with fewer changes. - Add addParamAttr/removeParamAttr. This just shortens addAttribute call sites that would otherwise need to spell out FirstArgIndex. - Remove some attribute-specific getters and setters from Function that take attribute list indices. Most of these were only used from BuildLibCalls, and doesNotAlias was only used to test or set if the return value is malloc-like. I'm happy to split the patch, but I think they are probably easier to review when taken together. This patch should be NFC, but it sets the stage to change the indexing scheme to this, which is more convenient when indexing into an array: 0: func attrs 1: retattrs 2...: arg attrs Reviewers: chandlerc, pete, javed.absar Subscribers: david2050, llvm-commits Differential Revision: https://reviews.llvm.org/D32811 llvm-svn: 302060	2017-05-03 18:17:31 +00:00
Anna Thomas	ac0ec2240b	[RuntimeLoopUnroller] Add assert that we dont unroll non-rotated loops Summary: Cloning basic blocks in the loop for runtime loop unroller depends on loop being in rotated form (i.e. loop latch target is the exit block). Assert that this is true, so that callers of runtime loop unroller pass in canonical loops. The single caller of this function has that check recently added: https://reviews.llvm.org/rL301239 Reviewers: davide Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D32801 llvm-svn: 302058	2017-05-03 17:43:59 +00:00
Anna Thomas	53c8d95c85	[Loop Deletion] Delete loops that are never executed Summary: Currently, loop deletion deletes loop where the only values that are used outside the loop are loop-invariant. This patch adds logic to delete loops where the loop is proven to be never executed (i.e. the only predecessor of the loop preheader has a constant conditional branch as terminator, and the preheader is not the taken target). This will remove loops that become dead after loop-unswitching generates constant conditional branches. The next steps are: 1. moving the loop deletion implementation to LoopUtils. 2. Add logic in loop-simplifyCFG which will support changing conditional constant branches to unconditional branches. If loops become unreachable in this process, they can be removed using `deleteDeadLoop` function. Reviewers: chandlerc, efriedma, sanjoy, reames Reviewed by: sanjoy Subscribers: mzolotukhin, llvm-commits Differential Revision: https://reviews.llvm.org/D32494 llvm-svn: 302015	2017-05-03 11:47:11 +00:00
Matt Arsenault	6a288c1e32	Replace hardcoded intrinsic list with speculatable attribute. No change in which intrinsics should be speculated. llvm-svn: 301995	2017-05-03 02:26:10 +00:00
Reid Kleckner	ee4930b688	Re-land r301697 "[IR] Make add/remove Attributes use AttrBuilder instead of AttributeList" This time, I fixed, built, and tested clang. This reverts r301712. llvm-svn: 301981	2017-05-02 22:07:37 +00:00
Davide Italiano	839c7e6cfb	[NewGVN] Fix typo and format comment. NFCI. llvm-svn: 301974	2017-05-02 21:11:40 +00:00
Xinliang David Li	ab8722f80a	[PartialInlining] Add more early filtering This is a follow up to the previous inline cost patch for quicker filtering. llvm-svn: 301959	2017-05-02 18:43:21 +00:00
Matt Arsenault	9ac7d6be3c	SpeculativeExecution: Stop using whitelist for costs Just let TTI's cost do this instead of arbitrarily restricting this. llvm-svn: 301950	2017-05-02 18:02:18 +00:00
Sanjay Patel	6381db18fe	[InstCombine] don't use DeMorgan's Law on integer constants (2nd try) This was originally checked in here: https://reviews.llvm.org/rL301923 And reverted here: https://reviews.llvm.org/rL301924 Because there's a clang test that would fail after this. I fixed/removed the offending CHECK lines in: https://reviews.llvm.org/rL301928 So let's try this again. Original commit message: This is the fold that causes the infinite loop in BoringSSL (https://github.com/google/boringssl/blob/master/crypto/cipher/e_rc2.c) when we fix instcombine demanded bits to prefer 'not' ops as in https://reviews.llvm.org/D32255. There are 2 or 3 problems with dyn_castNotVal, and I don't think we can reinstate https://reviews.llvm.org/D32255 until dyn_castNotVal is completely eliminated. 1. As shown here, it transforms 'not' into random xor. This transform is harmful to SCEV and codegen because 'not' can often be folded while random xor cannot. 2. It does not transform vector constants. This is actually a good thing, but if you don't believe the above argument, then we shouldn't have excluded vectors. 3. It tries to avoid transforming not(not(X)). That's nice, but it doesn't match the greedy nature of instcombine. If we DeMorganize a pattern that has an extra 'not' in it: ~(~(~X) & Y) --> (~X \| ~Y) That's just another case of DeMorgan, so we should trust that we'll fold that pattern too: (~X \| ~ Y) --> ~(X & Y) Differential Revision: https://reviews.llvm.org/D32665 llvm-svn: 301929	2017-05-02 15:31:40 +00:00
Sanjay Patel	da0b4deafa	revert r301923 : [InstCombine] don't use DeMorgan's Law on integer constants There's a clang test that is wrongly using -O1 and failing after this commit. llvm-svn: 301924	2017-05-02 14:48:23 +00:00
Sanjay Patel	096a981982	[InstCombine] don't use DeMorgan's Law on integer constants This is the fold that causes the infinite loop in BoringSSL (https://github.com/google/boringssl/blob/master/crypto/cipher/e_rc2.c) when we fix instcombine demanded bits to prefer 'not' ops as in D32255. There are 2 or 3 problems with dyn_castNotVal, and I don't think we can reinstate D32255 until dyn_castNotVal is completely eliminated. 1. As shown here, it transforms 'not' into random xor. This transform is harmful to SCEV and codegen because 'not' can often be folded while random xor cannot. 2. It does not transform vector constants. This is actually a good thing, but if you don't believe the above argument, then we shouldn't have excluded vectors. 3. It tries to avoid transforming not(not(X)). That's nice, but it doesn't match the greedy nature of instcombine. If we DeMorganize a pattern that has an extra 'not' in it: ~(~(~X) & Y) --> (~X \| ~Y) That's just another case of DeMorgan, so we should trust that we'll fold that pattern too: (~X \| ~ Y) --> ~(X & Y) Differential Revision: https://reviews.llvm.org/D32665 llvm-svn: 301923	2017-05-02 14:31:30 +00:00
Xinliang David Li	6133846be1	[PartialInlining] Hook up inline cost analysis Differential Revision: http://reviews.llvm.org/D32666 llvm-svn: 301894	2017-05-02 02:44:14 +00:00
Xin Tong	a41bf70bea	Empty Space. NFC llvm-svn: 301878	2017-05-01 23:08:19 +00:00
Davide Italiano	2dfd46bf08	[NewGVN] Don't derive incorrect implications. In the testcase attached, we believe %tmp1 implies %tmp4. where: br i1 %tmp1, label %bb2, label %bb7 br i1 %tmp4, label %bb5, label %bb7 because Wwhile looking at PredicateInfo stuffs we end up calling isImpliedTrueByMatchingCmp() with the arguments backwards. Differential Revision: https://reviews.llvm.org/D32718 llvm-svn: 301849	2017-05-01 22:26:28 +00:00
Sanjay Patel	59d0aeaafe	[InstCombine] check one-use before applying DeMorgan nor/nand folds If we have ~(~X & Y), it only makes sense to transform it to (X \| ~Y) when we do not need the intermediate (~X & Y) value. In that case, we would need an extra instruction to generate ~Y + 'or' (as shown in the test changes). It's ok if we have multiple uses of ~X or Y, however. In those cases, we may not reduce the instruction count or critical path, but we might improve throughput because we can generate ~X and ~Y in parallel. Whether that actually makes perf sense or not for a target is something we can't answer in IR. Differential Revision: https://reviews.llvm.org/D32703 llvm-svn: 301848	2017-05-01 22:25:42 +00:00
Peter Collingbourne	a992f53099	IPO: Add missing build dep. llvm-svn: 301835	2017-05-01 20:57:20 +00:00
Peter Collingbourne	c15d60b772	Object: Remove ModuleSummaryIndexObjectFile class. Differential Revision: https://reviews.llvm.org/D32195 llvm-svn: 301832	2017-05-01 20:42:32 +00:00
Xin Tong	a4b9b9f42a	Take indirect branch into account as well when folding. We may not be able to rewrite indirect branch target, but we also want to take it into account when folding, i.e. if it and all its successor's predecessors go to the same destination, we can fold, i.e. no need to thread. llvm-svn: 301816	2017-05-01 17:15:37 +00:00
Sanjoy Das	e6bca0eecb	Rename WeakVH to WeakTrackingVH; NFC This relands r301424. llvm-svn: 301812	2017-05-01 17:07:49 +00:00
Xin Tong	99dce428bc	[JumpThread] Add some assertions for expected ConstantInt/BlockAddress llvm-svn: 301808	2017-05-01 16:19:59 +00:00
Xin Tong	21f8ac235e	[JumpThread] Do RAUW in case Cond folds to a constant in the CFG Summary: [JumpThread] Do RAUW in case Cond folds to a constant in the CFG Reviewers: sanjoy Reviewed By: sanjoy Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D32407 llvm-svn: 301804	2017-05-01 15:34:17 +00:00
Sanjoy Das	08989c7ecd	Rename isKnownNotFullPoison to programUndefinedIfPoison; NFC Summary: programUndefinedIfPoison makes more sense, given what the function does; and I'm about to add a function with a name similar to isKnownNotFullPoison (so do the rename to avoid confusion). Reviewers: broune, majnemer, bjarke.roune Reviewed By: broune Subscribers: mcrosier, llvm-commits, mzolotukhin Differential Revision: https://reviews.llvm.org/D30444 llvm-svn: 301776	2017-04-30 19:41:19 +00:00
Craig Topper	ca48af3c87	[KnownBits] Add methods for determining if the known bits represent a negative/nonnegative number and add methods for changing the negative/nonnegative state Summary: This patch adds isNegative, isNonNegative for querying whether the sign bit is known. It also adds makeNegative and makeNonNegative for controlling the sign bit. Reviewers: RKSimon, spatel, davide Reviewed By: RKSimon Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D32651 llvm-svn: 301747	2017-04-29 16:43:11 +00:00
Akira Hatanaka	6fdcb3c2ce	[ObjCARC] Do not move a release between a call and a retainAutoreleasedReturnValue that retains the returned value. This commit fixes a bug in ARC optimizer where it moves a release between a call and a retainAutoreleasedReturnValue, causing the returned object to be released before the retainAutoreleasedReturnValue can retain it. This commit accomplishes that by doing a lookahead and checking whether the call prevents the release from moving upwards. In the long term, we should treat the region between the retainAutoreleasedReturnValue and the call as a critical section and disallow moving anything there (possibly using operand bundles). rdar://problem/20449878 llvm-svn: 301724	2017-04-29 00:23:11 +00:00
Davide Italiano	0aaa96a07b	[LoopUnswitch] Make DEBUG output more readable (part 2). I fixed my miscompile in r301722 and I hope I don't have to take a look at this code again now that Chandler has a new LoopUnswitch pass, but maybe this could be of use for somebody else in the meanwhile. llvm-svn: 301723	2017-04-29 00:18:26 +00:00
Davide Italiano	534e314356	[LoopUnswitch] Don't remove instructions with side effects. This fixes PR32818. Differential Revision: https://reviews.llvm.org/D32664 llvm-svn: 301722	2017-04-29 00:12:18 +00:00
Hans Wennborg	0f88d863b4	Revert r301697 "[IR] Make add/remove Attributes use AttrBuilder instead of AttributeList" This broke the Clang build. (Clang-side patch missing?) Original commit message: > [IR] Make add/remove Attributes use AttrBuilder instead of > AttributeList > > This change cleans up call sites and avoids creating temporary > AttributeList objects. > > NFC llvm-svn: 301712	2017-04-28 23:01:32 +00:00
Matt Arsenault	e0f9e984fd	InferAddressSpaces: Search constant expressions for addrspacecasts These are pretty common when using local memory, and the 64-bit generic addressing is much more expensive to compute. llvm-svn: 301711	2017-04-28 22:52:41 +00:00
Matt Arsenault	c20ccd2c02	InferAddressSpaces: Avoid looking up deleted values While looking at pure addressing expressions, it's possible for the value to appear later in Postorder. I haven't been able to come up with a testcase where this exhibits an actual issue, but if you insert a dump before the value map lookup, a few testcases crash. llvm-svn: 301705	2017-04-28 22:18:19 +00:00
Matt Arsenault	a1e734050c	InferAddressSpaces: Infer from just addrspacecasts Eliminates some more cases where some subset of the addressing computation remains flat. Some cases with addrspacecasts in nested constant expressions are still left behind however. llvm-svn: 301704	2017-04-28 22:18:08 +00:00
Daniel Berlin	98a1de85cb	LoopRotate: Fix use after scope bug llvm-svn: 301702	2017-04-28 22:05:55 +00:00
Reid Kleckner	608c8b63b3	[IR] Make add/remove Attributes use AttrBuilder instead of AttributeList This change cleans up call sites and avoids creating temporary AttributeList objects. NFC llvm-svn: 301697	2017-04-28 21:48:28 +00:00
Davide Italiano	e27cb87754	[LoopUnswitch] Make DEBUG output more readable. While debugging a miscompile I realized loopunswitch doesn't put newlines when printing the instruction being replacement. Ending up with a single line with many instruction replaced isn't the best for readability and/or mental sanity. llvm-svn: 301692	2017-04-28 21:30:50 +00:00
Reid Kleckner	859f8b544a	Make getParamAlignment use argument numbers The method is called "get Param Alignment", and is only used for return values exactly once, so it should take argument indices, not attribute indices. Avoids confusing code like: IsSwiftError = CS->paramHasAttr(ArgIdx, Attribute::SwiftError); Alignment = CS->getParamAlignment(ArgIdx + 1); Add getRetAlignment to handle the one case in Value.cpp that wants the return value alignment. This is a potentially breaking change for out-of-tree backends that do their own call lowering. llvm-svn: 301682	2017-04-28 20:34:27 +00:00
Daniel Berlin	4d0fe64ae3	Kill off the old SimplifyInstruction API by converting remaining users. llvm-svn: 301673	2017-04-28 19:55:38 +00:00
Davide Italiano	b6681e2b4e	[IPO/MergeFunctions] This function is used only under DEBUG(). llvm-svn: 301672	2017-04-28 19:39:45 +00:00
Reid Kleckner	99351967c7	[RS4GC] Simplify attribute handling code NFC Avoids use of AttributeList::getNumSlots, making it easier to change the underlying implementation. llvm-svn: 301671	2017-04-28 19:22:40 +00:00
Reid Kleckner	6652a52e2b	Use Argument::hasAttribute and AttributeList::ReturnIndex more This eliminates many extra 'Idx' induction variables in loops over arguments in CodeGen/ and Target/. It also reduces the number of places where we assume that ReturnIndex is 0 and that we should add one to argument numbers to get the corresponding attribute list index. NFC llvm-svn: 301666	2017-04-28 18:37:16 +00:00
Adrian Prantl	109b236850	Clean up DIExpression::prependDIExpr a little. (NFC) llvm-svn: 301662	2017-04-28 17:51:05 +00:00
Craig Topper	24db6b800f	[APInt] Add clearSignBit method. Use it and setSignBit in a few places. NFCI llvm-svn: 301656	2017-04-28 16:58:05 +00:00
Teresa Johnson	51177295c4	Memory intrinsic value profile optimization: Avoid divide by 0 Summary: Skip memops if the total value profiled count is 0, we can't correctly scale up the counts and there is no point anyway. Reviewers: davidxl Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D32624 llvm-svn: 301645	2017-04-28 14:30:54 +00:00
Andrew Ng	03e35b6bc0	[DebugInfo][X86] Improve X86 Optimize LEAs handling of debug values. This is a follow up to the fix in r298360 to improve the handling of debug values when redundant LEAs are removed. The fix in r298360 effectively discarded the debug values. This patch now attempts to preserve the debug values by using the DWARF DW_OP_stack_value operation via prependDIExpr. Moved functions appendOffset and prependDIExpr from Local.cpp to DebugInfoMetadata.cpp and made them available as static member functions of DIExpression. Differential Revision: https://reviews.llvm.org/D31604 llvm-svn: 301630	2017-04-28 08:44:30 +00:00
Max Kazantsev	531db9a504	[EarlyCSE] Mark the condition of assume intrinsic as true EarlyCSE should not just ignore assumes. It should use the fact that its condition is true for all dominated instructions. Reviewers: sanjoy, reames, apilipenko, anna, skatkov Reviewed By: reames, sanjoy Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D32482 llvm-svn: 301625	2017-04-28 06:25:39 +00:00
Max Kazantsev	0589d9fa0f	[EarlyCSE] Remove guards with conditions known to be true If a condition is calculated only once, and there are multiple guards on this condition, we should be able to remove all guards dominated by the first of them. This patch allows EarlyCSE to try to find the condition of a guard among the known values, and if it is true, remove the guard. Otherwise we keep the guard and mark its condition as 'true' for future consideration. Reviewers: sanjoy, reames, apilipenko, skatkov, anna, dberlin Reviewed By: reames, sanjoy Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D32476 llvm-svn: 301623	2017-04-28 06:05:48 +00:00
Craig Topper	24e71017aa	[APInt] Use inplace shift methods where possible. NFCI llvm-svn: 301612	2017-04-28 03:36:24 +00:00
Davide Italiano	81a26da1e5	[SROA] Fix nondeterminism exposed by Simon's r299221. Use a SmallSetSetVector instead of a SmallPtrSet as iterating over the latter is not stable ('<' relies on addresses). llvm-svn: 301599	2017-04-27 23:09:01 +00:00
Sanjay Patel	73d8c43da8	[InstCombine] fix matcher to bind to specific operand (PR32830) Matching any random value would be very wrong: https://bugs.llvm.org/show_bug.cgi?id=32830 llvm-svn: 301594	2017-04-27 21:55:03 +00:00
Evgeniy Stepanov	964f4663c4	[asan] Fix dead stripping of globals on Linux. Use a combination of !associated, comdat, @llvm.compiler.used and custom sections to allow dead stripping of globals and their asan metadata. Sometimes. Currently this works on LLD, which supports SHF_LINK_ORDER with sh_link pointing to the associated section. This also works on BFD, which seems to treat comdats as all-or-nothing with respect to linker GC. There is a weird quirk where the "first" global in each link is never GC-ed because of the section symbols. At this moment it does not work on Gold (as in the globals are never stripped). This is a second re-land of r298158. This time, this feature is limited to -fdata-sections builds. llvm-svn: 301587	2017-04-27 20:27:27 +00:00
Evgeniy Stepanov	716f0ff222	[asan] Put ctor/dtor in comdat. When possible, put ASan ctor/dtor in comdat. The only reason not to is global registration, which can be TU-specific. This is not the case when there are no instrumented globals. This is also limited to ELF targets, because MachO does not have comdat, and COFF linkers may GC comdat constructors. The benefit of this is a lot less __asan_init() calls: one per DSO instead of one per TU. It's also necessary for the upcoming gc-sections-for-globals change on Linux, where multiple references to section start symbols trigger quadratic behaviour in gold linker. This is a second re-land of r298756. This time with a flag to disable the whole thing to avoid a bug in the gold linker: https://sourceware.org/bugzilla/show_bug.cgi?id=19002 llvm-svn: 301586	2017-04-27 20:27:23 +00:00
Chandler Carruth	1353f9a48b	[PM/LoopUnswitch] Introduce a new, simpler loop unswitch pass. Currently, this pass only focuses on trivial loop unswitching. At that reduced problem it remains significantly better than the current loop unswitch: - Old pass is worse than cubic complexity. New pass is (I think) linear. - New pass is much simpler in its design by focusing on full unswitching. (See below for details on this). - New pass doesn't carry state for thresholds between pass iterations. - New pass doesn't carry state for correctness (both miscompile and infloop) between pass iterations. - New pass produces substantially better code after unswitching. - New pass can handle more trivial unswitch cases. - New pass doesn't recompute the dominator tree for the entire function and instead incrementally updates it. I've ported all of the trivial unswitching test cases from the old pass to the new one to make sure that major functionality isn't lost in the process. For several of the test cases I've worked to improve the precision and rigor of the CHECKs, but for many I've just updated them to handle the new IR produced. My initial motivation was the fact that the old pass carried state in very unreliable ways between pass iterations, and these mechansims were incompatible with the new pass manager. However, I discovered many more improvements to make along the way. This pass makes two very significant assumptions that enable most of these improvements: 1) Focus on full unswitching -- that is, completely removing whatever control flow construct is being unswitched from the loop. In the case of trivial unswitching, this means removing the trivial (exiting) edge. In non-trivial unswitching, this means removing the branch or switch itself. This is in opposition to partial unswitching where some part of the unswitched control flow remains in the loop. Partial unswitching only really applies to switches and to folded branches. These are very similar to full unrolling and partial unrolling. The full form is an effective canonicalization, the partial form needs a complex cost model, cannot be iterated, isn't canonicalizing, and should be a separate pass that runs very late (much like unrolling). 2) Leverage LLVM's Loop machinery to the fullest. The original unswitch dates from a time when a great deal of LLVM's loop infrastructure was missing, ineffective, and/or unreliable. As a consequence, a lot of complexity was added which we no longer need. With these two overarching principles, I think we can build a fast and effective unswitcher that fits in well in the new PM and in the canonicalization pipeline. Some of the remaining functionality around partial unswitching may not be relevant today (not many test cases or benchmarks I can find) but if they are I'd like to add support for them as a separate layer that runs very late in the pipeline. Purely to make reviewing and introducing this code more manageable, I've split this into first a trivial-unswitch-only pass and in the next patch I'll add support for full non-trivial unswitching against a fixed threshold, exactly like full unrolling. I even plan to re-use the unrolling thresholds, as these are incredibly similar cost tradeoffs: we're cloning a loop body in order to end up with simplified control flow. We should only do that when the total growth is reasonably small. One of the biggest changes with this pass compared to the previous one is that previously, each individual trivial exiting edge from a switch was unswitched separately as a branch. Now, we unswitch the entire switch at once, with cases going to the various destinations. This lets us unswitch multiple exiting edges in a single operation and also avoids numerous extremely bad behaviors, where we would introduce 1000s of branches to test for thousands of possible values, all of which would take the exact same exit path bypassing the loop. Now we will use a switch with 1000s of cases that can be efficiently lowered into a jumptable. This avoids relying on somehow forming a switch out of the branches or getting horrible code if that fails for any reason. Another significant change is that this pass actively updates the CFG based on unswitching. For trivial unswitching, this is actually very easy because of the definition of loop simplified form. Doing this makes the code coming out of loop unswitch dramatically more friendly. We still should run loop-simplifycfg (at the least) after this to clean up, but it will have to do a lot less work. Finally, this pass makes much fewer attempts to simplify instructions based on the unswitch. Something like loop-instsimplify, instcombine, or GVN can be used to do increasingly powerful simplifications based on the now dominating predicate. The old simplifications are things that something like loop-instsimplify should get today or a very, very basic loop-instcombine could get. Keeping that logic separate is a big simplifying technique. Most of the code in this pass that isn't in the old one has to do with achieving specific goals: - Updating the dominator tree as we go - Unswitching all cases in a switch in a single step. I think it is still shorter than just the trivial unswitching code in the old pass despite having this functionality. Differential Revision: https://reviews.llvm.org/D32409 llvm-svn: 301576	2017-04-27 18:45:20 +00:00
Eli Friedman	10ab923b32	[GlobalOpt] Correctly update metadata when localizing a global. Just calling dropAllReferences leaves pointers to the ConstantExpr behind, so we would eventually crash with a null pointer dereference. Differential Revision: https://reviews.llvm.org/D32551 llvm-svn: 301575	2017-04-27 18:39:08 +00:00
Teresa Johnson	f9ea176f05	Memory intrinsic value profile optimization: Improve debug output (NFC) Summary: Misc improvements to debug output. Fix a couple typos and also dump the value profile before we make any profitability checks. Reviewers: davidxl Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D32607 llvm-svn: 301574	2017-04-27 18:25:22 +00:00
Xinliang David Li	d21601a929	[PartialInlining]: Improve partial inlining to handle complex conditions Differential Revision: http://reviews.llvm.org/D32249 llvm-svn: 301561	2017-04-27 16:34:00 +00:00
Craig Topper	9474e9b6c8	[InstCombine] Use APInt bit counting methods to avoid a temporary APInt. NFC llvm-svn: 301516	2017-04-27 04:51:25 +00:00
Chandler Carruth	c246a4c973	Disable GVN Hoist due to still more bugs being found in it. There is also a discussion about exactly what we should do prior to re-enabling it. The current bug is http://llvm.org/PR32821 and the discussion about this is in the review thread for r300200. llvm-svn: 301505	2017-04-27 00:28:03 +00:00
Davide Italiano	d7b2a9981c	[LibCallsShrinkWrap] Remove an unnecessary class member variable. llvm-svn: 301477	2017-04-26 21:28:40 +00:00
Davide Italiano	11817ba2ea	[LibCallsShrinkWrap] More descriptive assertion messages. Fix a typo while I'm here. llvm-svn: 301474	2017-04-26 21:21:02 +00:00
Davide Italiano	3c3785fd1f	[LibCallsShrinkWrap] Remove some temporary cl::opt(s). The pass has been on and working for a while. llvm-svn: 301473	2017-04-26 21:19:05 +00:00
Davide Italiano	6abada8ab8	[LibCallsShrinkWrap] Teach the pass how to preserve the dominator. llvm-svn: 301471	2017-04-26 21:05:40 +00:00
Daniel Berlin	ede130d490	NewGVN: Use new SimplifyQuery based API llvm-svn: 301466	2017-04-26 20:56:14 +00:00
Daniel Berlin	2c75c63063	InstCombine: Use the new SimplifyQuery versions of Simplify*. Use AssumptionCache, DominatorTree, TargetLibraryInfo everywhere. llvm-svn: 301464	2017-04-26 20:56:07 +00:00
Daniel Berlin	c9f0a4f1ec	CorrelatedValuePropagation: Rename a variable for consistency llvm-svn: 301435	2017-04-26 17:41:46 +00:00
Craig Topper	b45eabcf82	[ValueTracking] Introduce a KnownBits struct to wrap the two APInts for computeKnownBits This patch introduces a new KnownBits struct that wraps the two APInt used by computeKnownBits. This allows us to treat them as more of a unit. Initially I've just altered the signatures of computeKnownBits and InstCombine's simplifyDemandedBits to pass a KnownBits reference instead of two separate APInt references. I'll do similar to the SelectionDAG version of computeKnownBits/simplifyDemandedBits as a separate patch. I've added a constructor that allows initializing both APInts to the same bit width with a starting value of 0. This reduces the repeated pattern of initializing both APInts. Once place default constructed the APInts so I added a default constructor for those cases. Going forward I would like to add more methods that will work on the pairs. For example trunc, zext, and sext occur on both APInts together in several places. We should probably add a clear method that can be used to clear both pieces. Maybe a method to check for conflicting information. A method to return (Zero\|One) so we don't write it out everywhere. Maybe a method for (Zero\|One).isAllOnesValue() to determine if all bits are known. I'm sure there are many other methods we can come up with. Differential Revision: https://reviews.llvm.org/D32376 llvm-svn: 301432	2017-04-26 16:39:58 +00:00
Sanjoy Das	2cbeb00f38	Reverts commit r301424, r301425 and r301426 Commits were: "Use WeakVH instead of WeakTrackingVH in AliasSetTracker's UnkownInsts" "Add a new WeakVH value handle; NFC" "Rename WeakVH to WeakTrackingVH; NFC" The changes assumed pointers are 8 byte aligned on all architectures. llvm-svn: 301429	2017-04-26 16:37:05 +00:00
Matthew Simpson	9eed0bee3d	[LV] Handle external uses of floating-point induction variables Reference: https://bugs.llvm.org/show_bug.cgi?id=32758 Differential Revision: https://reviews.llvm.org/D32445 llvm-svn: 301428	2017-04-26 16:23:02 +00:00
Sanjoy Das	01de557738	Rename WeakVH to WeakTrackingVH; NFC Summary: I plan to use WeakVH to mean "nulls itself out on deletion, but does not track RAUW" in a subsequent commit. Reviewers: dblaikie, davide Reviewed By: davide Subscribers: arsenm, mehdi_amini, mcrosier, mzolotukhin, jfb, llvm-commits, nhaehnle Differential Revision: https://reviews.llvm.org/D32266 llvm-svn: 301424	2017-04-26 16:20:52 +00:00
Haojian Wu	e43db0a834	Fix unused-variable warning caused by r301407. llvm-svn: 301411	2017-04-26 14:31:05 +00:00
Daniel Berlin	62aee14978	Convert LoopRotation to use SimplifyQuery version of SimplifyInstruction. Add AssumptionCache, DominatorTree, TLI if available. llvm-svn: 301407	2017-04-26 13:52:18 +00:00
Daniel Berlin	954006fde8	Convert SimplifyInstructions to use the SimplifyQuery version of SimplifyInstruction llvm-svn: 301406	2017-04-26 13:52:16 +00:00
Daniel Berlin	9bae449d78	Convert CVP to use SimplifyQuery version of SimplifyInstruction. Add AssumptionCache, DominatorTree, TLI if available. llvm-svn: 301405	2017-04-26 13:52:13 +00:00
Filipe Cabecinhas	92dc348773	Simplify the CFG after loop pass cleanup. Summary: Otherwise we might end up with some empty basic blocks or single-entry-single-exit basic blocks. This fixes PR32085 Reviewers: chandlerc, danielcdh Subscribers: mehdi_amini, RKSimon, llvm-commits Differential Revision: https://reviews.llvm.org/D30468 llvm-svn: 301395	2017-04-26 12:02:41 +00:00
Matthias Braun	c36a78c3f3	SimplifyLibCalls: Fix crash on memset(notmalloc()) rdar://31520787 llvm-svn: 301352	2017-04-25 19:44:25 +00:00
Stanislav Mekhanoshin	f2db5434be	Skip bitcasts while looking for GEP in LoadStoreVectorizer Differential Revisison: https://reviews.llvm.org/D32101 llvm-svn: 301343	2017-04-25 18:00:08 +00:00
Craig Topper	09a5878d33	[InstCombine] Remove redundant code from SimplifyUsingDistributiveLaws The code I've removed here exists in ExpandBinOp in InstSimplify which we call into before SimplifyUsingDistributiveLaws. The code in InstSimplify looks to have been copied from here. I verified this code doesn't fire on any lit tests. Not that that proves its definitely dead. Differential Revision: https://reviews.llvm.org/D32472 llvm-svn: 301341	2017-04-25 17:54:12 +00:00
Craig Topper	f3dbd17d0a	[APInt] Use isSubsetOf, intersects, and bit counting methods to reduce temporary APInts This patch uses various APInt methods to reduce temporary APInt creation. This should be all of the unrelated cleanups that got buried in D32376(creating a KnownBits struct) as well as some pointed out by Simon during the review of that. Plus a few improvements to use counting instead of masking. I've left out any places where we do something like (KnownZero & KnownOne) != 0 as I plan to add a helper method to KnownBits to ask that question and didn't want to thrash that code an additional time. Differential Revision: https://reviews.llvm.org/D32495 llvm-svn: 301338	2017-04-25 17:46:30 +00:00
Davide Italiano	058abf1f61	[PM] Run IndirectCallPromotion only when PGO is enabled. Differential Revision: https://reviews.llvm.org/D32465 llvm-svn: 301327	2017-04-25 16:54:45 +00:00
Craig Topper	7603dce6b2	[InstCombine] Remove superfluous curly braces around a single line if body. NFC llvm-svn: 301326	2017-04-25 16:48:19 +00:00
Craig Topper	ba01143193	[InstCombine] Add missing commute handling to (A \| B) & (B ^ (~A)) -> (A & B) The matching here wasn't able to handle all the possible commutes. It always assumed the not would be on the left of the xor, but that's not guaranteed. Differential Revision: https://reviews.llvm.org/D32474 llvm-svn: 301316	2017-04-25 15:19:04 +00:00
Andrew Ng	1606fc0bf9	[SimplifyLibCalls] Fix infinite loop with fast-math optimization. One of the fast-math optimizations is to replace calls to standard double functions with their float equivalents, e.g. exp -> expf. However, this can cause infinite loops for the following: float expf(float val) { return (float) exp((double) val); } A similar inline declaration exists in the MinGW-w64 math.h header file which when compiled with -O2/3 and fast-math generates infinite loops. So this fix checks that the calling function to the standard double function that is being replaced does not match the float equivalent. Differential Revision: https://reviews.llvm.org/D31806 llvm-svn: 301304	2017-04-25 12:36:14 +00:00
Craig Topper	c4b48a32f0	[InstCombine] Use commutable matchers to reduce some code. NFC llvm-svn: 301294	2017-04-25 06:02:11 +00:00
Gil Rapaport	860f0a2bad	[LV] Remove redundant basic block split This patch is part of D28975's breakdown. Genreating the control-flow to guard predicated instructions modified to only use SplitBlockAndInsertIfThen() for producing the if-then construct. Differential Revision: https://reviews.llvm.org/D32224 llvm-svn: 301293	2017-04-25 05:57:22 +00:00
Xinliang David Li	f12a0faf88	[CodeExtractor]: Fixup use refs of the old phi. Differential Revision: http://reviews.llvm.org/D32468 llvm-svn: 301291	2017-04-25 04:51:19 +00:00
Akira Hatanaka	490397fc08	[ObjCARC] Do not sink an objc_retain past a clang.arc.use. We need to do this to prevent a miscompile which sinks an objc_retain past an objc_release that releases the object objc_retain retains. This happens because the top-down and bottom-up traversals each determines the insert point for retain or release individually without knowing where the other instruction is moved. For example, when the following IR is fed to the ARC optimizer, the top-down traversal decides to insert objc_retain right before objc_release and the bottom-up traversal decides to insert objc_release right after clang.arc.use. (IR before ARC optimizer) %11 = call i8* @objc_retain(i8* %10) call void (...) @clang.arc.use(%0* %5) call void @llvm.dbg.value(...) call void @objc_release(i8* %6) This reverses the order of objc_release and objc_retain, which causes the object to be destructed prematurely. (IR after ARC optimizer) call void (...) @clang.arc.use(%0* %5) call void @objc_release(i8* %6) call void @llvm.dbg.value(...) %11 = call i8* @objc_retain(i8* %10) rdar://problem/30530580 llvm-svn: 301289	2017-04-25 04:06:35 +00:00
Davide Italiano	5b65f12bfa	[SimplifyLibCalls] Remove a cl::opt that's been `true` for a long time. llvm-svn: 301288	2017-04-25 03:48:47 +00:00
Matt Arsenault	6d7f01e3d8	InferAddressSpaces: Use reference arguments instead of pointers llvm-svn: 301276	2017-04-24 23:42:41 +00:00
Matt Arsenault	e8d0539f20	InferAddressSpaces: Remove redundant assert This is just asserting all the operations are handled in the switch, which the unreachable already handles. llvm-svn: 301270	2017-04-24 23:02:57 +00:00
Sanjay Patel	35c362ebbb	[InstSimplify] use ConstantRange to simplify more and-of-icmps We can simplify (and (icmp X, C1), (icmp X, C2)) to one of the icmps in many cases. I had to check some of these with Alive to prove to myself it's right, but everything seems to check out. Eg, the code in instcombine was completely ignoring predicates with mismatched signedness. Handling or-of-icmps would be a follow-up step. Differential Revision: https://reviews.llvm.org/D32143 llvm-svn: 301260	2017-04-24 21:52:39 +00:00
Teresa Johnson	b2c390e9f5	Update profile during memory instrinsic optimization Summary: Ensure that the new merge BB (which contains the rest of the original BB after the mem op being optimized) gets a profile frequency, in case there are additional mem ops later in the BB. Otherwise they get skipped as the merge BB looks cold. Reviewers: davidxl, xur Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D32447 llvm-svn: 301244	2017-04-24 20:30:42 +00:00
Matt Arsenault	4474652c95	Revert "StructurizeCFG: Directly invert cmp instructions" This reverts commit r300732. This breaks a few tests. I think the problem is related to adding more uses of the condition that don't yet exist at this point. llvm-svn: 301242	2017-04-24 20:25:01 +00:00
Davide Italiano	ca81fbcadb	[LoopUnroll] Remove spurious newline. Eli pointed out in the review, but I didn't squash the two commits correctly. Pointy-hat to me. llvm-svn: 301241	2017-04-24 20:17:38 +00:00
Davide Italiano	0f62eea7ff	[LoopUnroll] Don't try to unroll non canonical loops. The current Loop Unroll implementation works with loops having a single latch that contains a conditional branch to a block outside the loop (the other successor is, by defition of latch, the header). If this precondition doesn't hold, avoid unrolling the loop as the code is not ready to handle such circumstances. Differential Revision: https://reviews.llvm.org/D32261 llvm-svn: 301239	2017-04-24 20:14:11 +00:00
Sanjoy Das	206f65c049	[LIR] Obey non-integral pointer semantics Summary: See http://llvm.org/docs/LangRef.html#non-integral-pointer-type Reviewers: haicheng Reviewed By: haicheng Subscribers: mcrosier, mzolotukhin, llvm-commits Differential Revision: https://reviews.llvm.org/D32196 llvm-svn: 301238	2017-04-24 20:12:10 +00:00
Evgeniy Stepanov	9e536081fe	[asan] Let the frontend disable gc-sections optimization for asan globals. Also extend -asan-globals-live-support flag to all binary formats. llvm-svn: 301226	2017-04-24 19:34:13 +00:00
Mandeep Singh Grang	799a2edb3d	[SimplifyCFG] Fix for non-determinism in codegen Summary: This patch fixes issues in codegen uncovered due to https://reviews.llvm.org/D26718 Reviewers: majnemer, chenli, davide Reviewed By: davide Subscribers: davide, arsenm, llvm-commits Differential Revision: https://reviews.llvm.org/D26726 llvm-svn: 301222	2017-04-24 19:20:45 +00:00
Evgeniy Stepanov	58ccc0949a	Revert "Compute safety information in a much finer granularity." Use-after-free in llvm::isGuaranteedToExecute. llvm-svn: 301214	2017-04-24 18:25:07 +00:00
Sanjay Patel	0889225f51	[InstSimplify] move (A & ~B) \| (A ^ B) -> (A ^ B) from InstCombine This is a straight cut and paste, but there's a bigger problem: if this fold exists for simplifyOr, there should be a DeMorganized version for simplifyAnd. But more than that, we have a patchwork of ad hoc logic optimizations in InstCombine. There should be some structure to ensure that we're not missing sibling folds across and/or/xor. llvm-svn: 301213	2017-04-24 18:24:36 +00:00
Adrian Prantl	f2c7997013	Use DW_OP_stack_value when reconstructing variable values with arithmetic. When the location description of a source variable involves arithmetic on the value itself, it needs to be marked with DW_OP_stack_value since it is not describing the variable's location, but rather its value. This is a follow-up to r297971 and fixes the source testcase quoted in the comment in debuginfo-dce.ll. rdar://problem/30725338 This reapplies r301093 without modifications. llvm-svn: 301210	2017-04-24 18:11:42 +00:00
Matt Arsenault	02907f3039	InstCombine: Fix assert when reassociating fsub with undef There is logic to track the expected number of instructions produced. It thought in this case an instruction would be necessary to negate the result, but here it folded into a ConstantExpr fneg when the non-undef value operand was cancelled out by the second fsub. I'm not sure why we don't fold constant FP ops with undef currently, but I think that would also avoid this problem. llvm-svn: 301199	2017-04-24 17:24:37 +00:00
Xin Tong	a266923d57	Compute safety information in a much finer granularity. Summary: Instead of keeping a variable indicating whether there are early exits in the loop. We keep all the early exits. This improves LICM's ability to move instructions out of the loop based on is-guaranteed-to-execute. I am going to update compilation time as well soon. Reviewers: hfinkel, sanjoy, efriedma, mkuper Reviewed By: hfinkel Subscribers: llvm-commits, mzolotukhin Differential Revision: https://reviews.llvm.org/D32433 llvm-svn: 301196	2017-04-24 17:12:22 +00:00
Nicolai Haehnle	9c66185315	InstCombine/AMDGPU: Fix constant folding of llvm.amdgcn.{icmp,fcmp} Summary: The return value of these intrinsics should always have 0 bits for inactive threads. This means that when all arguments are constant and the comparison evaluates to true, the intrinsic should return the current exec mask. Fixes some GL_ARB_shader_ballot tests. Reviewers: arsenm Subscribers: kzhuravl, wdng, yaxunl, dstuttard, tpr, llvm-commits, t-tye Differential Revision: https://reviews.llvm.org/D32344 llvm-svn: 301195	2017-04-24 17:08:43 +00:00
Xinliang David Li	db8d09b6c2	[PartialInine]: add triaging options There are more bugs (runtime failures) triggered when partial inlining is turned on. Add options to help triaging problems. llvm-svn: 301148	2017-04-23 23:39:04 +00:00
Sanjay Patel	e0c26e0640	[InstCombine] add/move folds for [not]-xor We handled all of the commuted variants for plain xor already, although they were scattered around and sometimes folded less efficiently using distributive laws. We had no folds for not-xor. Handling all of these patterns consistently is part of trying to reinstate: https://reviews.llvm.org/rL300977 llvm-svn: 301144	2017-04-23 22:00:02 +00:00
Xinliang David Li	15744ad87b	[PartialInlining] Add optimization remark support Differential Revision: http://reviews.llvm.org/D32387 llvm-svn: 301143	2017-04-23 21:40:58 +00:00
Xin Tong	f98602a1ab	[JumpThread] We want to fold (not thread) when all predecessor go to single BB's successor. Summary: In case all predecessor go to a single successor of current BB. We want to fold (not thread). I failed to update the phi nodes properly in the last patch https://reviews.llvm.org/rL300657. Phi nodes values are per predecessor in LLVM. Reviewers: sanjoy Reviewed By: sanjoy Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D32400 llvm-svn: 301139	2017-04-23 20:56:29 +00:00
Xin Tong	b7b081262a	Correct grammar. NFC llvm-svn: 301135	2017-04-23 17:36:25 +00:00
Sanjay Patel	d13b0bfdac	[InstCombine] add pattern matches for commuted variants of xor-to-xor There's probably some better way to write this that eliminates the code duplication without hurting readability, but at least this eliminates the logic holes and is hopefully slightly more efficient than creating new instructions. llvm-svn: 301129	2017-04-23 16:03:00 +00:00
Renato Golin	4abfb3d741	Revert "[APInt] Fix a few places that use APInt::getRawData to operate within the normal API." This reverts commit r301105, 4, 3 and 1, as a follow up of the previous revert, which broke even more bots. For reference: Revert "[APInt] Use operator<<= where possible. NFC" Revert "[APInt] Use operator<<= instead of shl where possible. NFC" Revert "[APInt] Use ashInPlace where possible." PR32754. llvm-svn: 301111	2017-04-23 12:15:30 +00:00
Craig Topper	5f68af0806	[APInt] Use operator<<= instead of shl where possible. NFC llvm-svn: 301103	2017-04-23 05:18:31 +00:00
Davide Italiano	5da7090256	[ThinLTO/Summary] Rename anonymous globals as last action ... ... in the per-TU -O0 pipeline. The problem is that there could be passes registered using `addExtensionsToPM()` introducing unnamed globals. Asan is an example, but there may be others. Building cppcheck with `-flto=thin` and `-fsanitize=address` triggers an assertion while we're reading bitcode (in lib/LTO), as the BitcodeReader assumes there are no unnamed globals (because the namer has run). Unfortunately I wasn't able to find an easy way to test this. I added a comment in the hope nobody moves this again. llvm-svn: 301102	2017-04-23 04:49:34 +00:00
Adrian Prantl	4677205010	Revert "Use DW_OP_stack_value when reconstructing variable values with arithmetic." This reverts commit r301093 while investigating stage2 bot breakage. llvm-svn: 301099	2017-04-23 00:44:40 +00:00
Adrian Prantl	a2d25ac14a	Use DW_OP_stack_value when reconstructing variable values with arithmetic. When the location description of a source variable involves arithmetic on the value itself, it needs to be marked with DW_OP_stack_value since it is not describing the variable's location, but rather its value. This is a follow-up to r297971 and fixes the source testcase quoted in the comment in debuginfo-dce.ll. rdar://problem/30725338 llvm-svn: 301093	2017-04-22 20:54:06 +00:00
Xinliang David Li	016a82ba51	[PartialInlining] Using existing hasAddressTaken interface to legality check/NFC llvm-svn: 301090	2017-04-22 19:24:19 +00:00
Sanjay Patel	3b863f8a1e	[InstCombine] use 'match' to reduce code; NFCI The later uses of dyn_castNotVal in this block are either incomplete (doesn't handle vector constants) or overstepping (shouldn't handle constants at all), but this first use is just unnecessary. 'I' is obviously not a constant, and it can't be a not-of-a-not because that would already be instsimplified. llvm-svn: 301088	2017-04-22 18:05:35 +00:00
Artur Pilipenko	0632bdc648	Fix for PR32740 - Invalid floating type, unreachable between r300969 and r301029 The bug was introduced by r301018 "[InstCombine] fadd double (sitofp x), y check that the promotion is valid". The patch didn't expect that fadd can be on vectors not necessarily scalars. Add vector support along with the test. llvm-svn: 301070	2017-04-22 07:24:52 +00:00
Matt Arsenault	01d17e7c5f	LowerSwitch: Fix producing invalid IR on unreachable code If a switch was in an unreachable block that branched to a block with a phi, it would leave phis with missing predecessors. llvm-svn: 301064	2017-04-21 23:54:12 +00:00
Matt Arsenault	c07bda7b87	InferAddressSpaces: Infer for just GEPs Fixes leaving intermediate flat addressing computations where a GEP instruction's source is a constant expression. Still leaves behind a trivial addrspacecast + gep pair that instcombine is able to handle, which ideally could be folded here directly. llvm-svn: 301044	2017-04-21 21:35:04 +00:00
Xinliang David Li	0e9f6df169	[PartialInliner] Partial inliner needs to check use kind before transformation Differential Revision: https://reviews.llvm.org/D32373 llvm-svn: 301042	2017-04-21 21:20:56 +00:00
Sanjay Patel	8ce1d4cbe1	[InstCombine] revert r300977 and r301021 This can cause an inf-loop. Investigating... llvm-svn: 301035	2017-04-21 20:29:17 +00:00
Adrian Prantl	1a18f1ad10	typo llvm-svn: 301030	2017-04-21 20:06:41 +00:00
Sanjay Patel	0f001a4701	[InstCombine] use isSubsetOf() for efficiency C \| ~D == -1 ~(C \| ~D) == 0 ~C & D == 0 D & ~C == 0 D.isSubsetOf(C) llvm-svn: 301021	2017-04-21 19:16:52 +00:00
Artur Pilipenko	134d94f9a3	[InstCombine] fadd double (sitofp x), y check that the promotion is valid Doing these transformations check that the result of integer addition is representable in the FP type. (fadd double (sitofp x), fpcst) --> (sitofp (add int x, intcst)) (fadd double (sitofp x), (sitofp y)) --> (sitofp (add int x, y)) This is a fix for https://bugs.llvm.org//show_bug.cgi?id=27036 Reviewed By: andrew.w.kaylor, scanon, spatel Differential Revision: https://reviews.llvm.org/D31182 llvm-svn: 301018	2017-04-21 18:45:25 +00:00
Craig Topper	7af078847c	[SimplifyCFG] Fix the determination of PostBB in conditional store merging to handle the targets on the second branch being commuted Currently we choose PostBB as the single successor of QFB, but its possible that QTB's single successor is QFB which would make QFB the correct choice. Differential Revision: https://reviews.llvm.org/D32323 llvm-svn: 300992	2017-04-21 15:53:42 +00:00
Wei Mi	337d4d95c2	[ConstHoisting] Add BFI in constanthoisting pass and select the best insertion places based on it. Existing constant hoisting pass will merge a group of contants in a small range and hoist the const materialization code to the common dominator of their uses. However, if the uses are all in cold pathes, existing implementation may hoist the materialization code from cold pathes to a hot place. This may hurt performance. The patch introduces BFI to the pass and selects the best insertion places based on it. The change is controlled by an option consthoist-with-block-frequency which is off by default for now. Differential Revision: https://reviews.llvm.org/D28962 llvm-svn: 300989	2017-04-21 15:50:16 +00:00
Matthew Simpson	e2037d24f9	[LV] Model if-converted phi node costs Phi nodes in non-header blocks are converted to select instructions after if-conversion. This patch updates the cost model to account for the selects. Differential Revision: https://reviews.llvm.org/D31906 llvm-svn: 300980	2017-04-21 14:14:54 +00:00
Sanjay Patel	347b54b093	[InstCombine] prefer xor with -1 because 'not' is easier to understand (PR32706) This matches the demanded bits behavior in the DAG and should fix: https://bugs.llvm.org/show_bug.cgi?id=32706 Differential Revision: https://reviews.llvm.org/D32255 llvm-svn: 300977	2017-04-21 14:03:54 +00:00
Davide Italiano	fa15de34b7	[PartialInliner] Fix crash when inlining functions with unreachable blocks. CodeExtractor looks up the dominator node corresponding to return blocks when splitting them. If one of these blocks is unreachable, there's no node in the Dom and CodeExtractor crashes because it doesn't check for domtree node validity. In theory, we could add just a check for skipping null DTNodes in `splitReturnBlock` but the fix I propose here is slightly different. To the best of my knowledge, unreachable blocks are irrelevant for the algorithm, therefore we can just skip them when building the candidate set in the constructor. Differential Revision: https://reviews.llvm.org/D32335 llvm-svn: 300946	2017-04-21 04:25:00 +00:00
Davide Italiano	059574c537	[CodeExtractor] Remove an unneeded level of indirection. NFCI. llvm-svn: 300931	2017-04-21 00:21:09 +00:00
Craig Topper	358cd9ae3a	[InstCombine] Remove the zextOrTrunc from ShrinkDemandedConstant. The demanded mask and the constant should always be the same width for all callers today. Also stop copying the demanded mask as its passed in. We should avoid allocating memory unless we are going to do something. The final AND to create the new constant will take care of it. llvm-svn: 300927	2017-04-20 23:58:27 +00:00
Sanjay Patel	cc663b82fa	[InstCombine] function names start with lower-case letter; NFC Forgot to make this fix with the signature change in r300911. llvm-svn: 300912	2017-04-20 22:37:01 +00:00
Sanjay Patel	c9485ca895	[InstCombine] allow shl+shr demanded bits folds with splat constants llvm-svn: 300911	2017-04-20 22:33:54 +00:00
Xinliang David Li	99e3ca1526	Use basicblock split block utility function Instead of calling BasicBlock::SplitBasicBlock directly in CodeExtractor. Differential Revision: https://reviews.llvm.org/D32308 llvm-svn: 300899	2017-04-20 21:40:22 +00:00
Sanjay Patel	3e1ae72fcf	[InstCombine] allow shl demanded bits folds with splat constants More fixes are needed to enable the helper SimplifyShrShlDemandedBits(). llvm-svn: 300898	2017-04-20 21:33:02 +00:00
Craig Topper	ff23889609	[InstCombine] Use APInt::intersects and APInt::isSubsetOf to improve a few more places in SimplifyDemandedBits. llvm-svn: 300896	2017-04-20 21:24:37 +00:00
Sanjay Patel	fb5b3e773a	[InstCombine] allow ashr/lshr demanded bits folds with splat constants llvm-svn: 300888	2017-04-20 20:59:02 +00:00
Craig Topper	17f37ba3b9	[InstCombine] Use APInt::isSubsetOf to simplify some code in SimplifyDemandedBits. NFC This allows us to use less temporary APInt for And and Invert operations. llvm-svn: 300885	2017-04-20 20:47:35 +00:00
Craig Topper	0ec3f2f39a	[InstCombine] Remove redundant code from SimplifyDemandedBits handling for Or. The code above it is equivalent if you work through the bitwise math. llvm-svn: 300876	2017-04-20 19:31:22 +00:00
Davide Italiano	b965121ba8	[CodeExtractor] Remove a bunch of unneeded constructors. Differential Revision: https://reviews.llvm.org/D32305 llvm-svn: 300869	2017-04-20 18:33:40 +00:00
Craig Topper	bcfd2d1789	[APInt] Rename getSignBit to getSignMask getSignBit is a static function that creates an APInt with only the sign bit set. getSignMask seems like a better name to convey its functionality. In fact several places use it and then store in an APInt named SignMask. Differential Revision: https://reviews.llvm.org/D32108 llvm-svn: 300856	2017-04-20 16:56:25 +00:00
Craig Topper	a8129a1122	[APInt] Add isSubsetOf method that can check if one APInt is a subset of another without creating temporary APInts This question comes up in many places in SimplifyDemandedBits. This makes it easy to ask without allocating additional temporary APInts. The BitVector class provides a similar functionality through its (IMHO badly named) test(const BitVector&) method. Though its output polarity is reversed. I've provided one example use case in this patch. I plan to do more as a follow up. Differential Revision: https://reviews.llvm.org/D32258 llvm-svn: 300851	2017-04-20 16:17:13 +00:00
Craig Topper	83dc1c60aa	In SimplifyDemandedUseBits, use computeKnownBits directly to handle Constants Currently we don't explicitly process ConstantDataSequential, ConstantAggregateZero, or ConstantVector, or Undef before applying the Depth limit. Instead they occur after the depth check in the non-instruction path. For the constant types that we do handle, the code is replicated from computeKnownBits. This patch fixes the missing constant handling and the reduces the amount of code by just using computeKnownBits directly for any type of Constant. Differential Revision: https://reviews.llvm.org/D32123 llvm-svn: 300849	2017-04-20 16:14:58 +00:00
Reid Kleckner	f1de9e83c2	[DAE] Simplify attribute list creation, NFC Removes a use of getSlotAttributes, which I intend to change. llvm-svn: 300795	2017-04-19 23:45:45 +00:00
Reid Kleckner	0a5ed3d5dc	[GlobalOpt] Simplify attribute code stripping nest, NFC llvm-svn: 300787	2017-04-19 23:26:44 +00:00
Reid Kleckner	aa0cec7d6d	Simplify test for sret attribute in instcombine This change is correct because the verifier requires that at most one argument be marked 'sret'. NFC, removes a use of AttributeList slot APIs. llvm-svn: 300784	2017-04-19 23:17:47 +00:00
Kostya Serebryany	c5d3d49034	[sanitizer-coverage] remove some more stale code llvm-svn: 300778	2017-04-19 22:42:11 +00:00
Evgeniy Stepanov	7c9b086ef5	Remove two unused variables (-Werror). llvm-svn: 300777	2017-04-19 22:27:23 +00:00
Kostya Serebryany	be87d480ff	[sanitizer-coverage] remove stale code llvm-svn: 300769	2017-04-19 21:48:09 +00:00
Craig Topper	9b71a402c2	[APInt] Cast calls to add/sub/mul overflow methods to void if only their overflow bool out param is used. This is preparation for a clang change to improve the [[nodiscard]] warning to not be ignored on methods that return a class marked [[nodiscard]] that are defined in the class itself. See D32207. We should consider adding wrapper methods to APInt that return the overflow flag directly and discard the APInt result. This would eliminate the void casts and the need to create a bool before the call to pass to the out param. llvm-svn: 300758	2017-04-19 21:09:45 +00:00
Matt Arsenault	d3406bc45c	StructurizeCFG: Directly invert cmp instructions The most common case for a branch condition is a single use compare. Directly invert the branch predicate rather than adding a lot of xor i1 true which the DAG will have to fold later. This produces nicer to read structurizer output. This produces some random changes in codegen due to the DAG swapping branch conditions itself, and then does a poor job of dealing with those inverts. llvm-svn: 300732	2017-04-19 18:29:07 +00:00
Sanjoy Das	5945447d84	[GVN] Don't coerce non-integral pointers to integers or vice versa Summary: See http://llvm.org/docs/LangRef.html#non-integral-pointer-type The NewGVN test does not fail without these changes (perhaps it does try to coerce pointers <-> integers to begin with?), but I added the test case anyway. Reviewers: dberlin Subscribers: mcrosier, llvm-commits, Prazek Differential Revision: https://reviews.llvm.org/D32208 llvm-svn: 300730	2017-04-19 18:21:09 +00:00
Reid Kleckner	9d16fa09c6	Prefer addAttr(Attribute::AttrKind) over the AttributeList overload This should simplify the call sites, which typically want to tweak one attribute at a time. It should also avoid creating ephemeral AttributeLists that live forever. llvm-svn: 300718	2017-04-19 17:28:52 +00:00
Davide Italiano	ffcb4df204	[InstCombine] Reduce visitLoadInst() code duplication. NFCI. llvm-svn: 300717	2017-04-19 17:26:57 +00:00
Chandler Carruth	ae3386aa74	Revert r300657 due to crashes in stage2 of bootstraps: http://lab.llvm.org:8011/builders/clang-with-lto-ubuntu/builds/2476/steps/build-stage2-LLVMgold.so/logs/stdio http://bb.pgr.jp/builders/clang-3stage-x86_64-linux/builds/15036/steps/build_llvmclang/logs/stdio I've updated the commit thread, reverting to get the bots back to green. Original commit summary: [JumpThread] We want to fold (not thread) when all predecessor go to single BB's successor. llvm-svn: 300662	2017-04-19 06:23:20 +00:00
Xin Tong	636a332906	[JumpThread] We want to fold (not thread) when all predecessor go to single BB's successor. . Summary: In case all predecessor go to a single successor of current BB. We want to fold (not thread). Reviewers: efriedma, sanjoy Reviewed By: sanjoy Subscribers: dberlin, majnemer, llvm-commits Differential Revision: https://reviews.llvm.org/D30869 llvm-svn: 300657	2017-04-19 05:15:57 +00:00
Sanjoy Das	f09c1e346e	Add a getPointerOperandType() helper to LoadInst and StoreInst; NFC I will use this in a later change. llvm-svn: 300613	2017-04-18 22:00:54 +00:00
Davide Italiano	80fe987b42	[LoopReroll] Prefer hasNUses/hasNUses or more as they're cheaper. NFCI. llvm-svn: 300607	2017-04-18 21:42:21 +00:00
Daniel Berlin	9d0042b47c	NewGVN: Fix memory congruence verification. The return true should be a return false. Merge the appropriate if statements so it doesn't happen again. llvm-svn: 300584	2017-04-18 20:15:47 +00:00
Easwaran Raman	76aba5f6d7	[SLP vectorizer] Allow phi node reordering in tryToVectorizeList. In tryToVectorizeList, under a very limited circumstance (when entered from tryToVectorizePair), the values may be reordered (swapped) and the SLP tree is built with the new order. This extends that to the case when starting from phis in vectorizeChainsInBlock when there are exactly two phis. The textual order of phi nodes shouldn't really matter. Without this change, the loop body in the accompnaying test case is fully vectorized when we swap the orde of the phis but not with this order. While this doesn't solve the phi-ordering problem in a general way (for more than 2 phis), this is simple fix that piggybacks on an existing mechanism and is useful in cases like multiplying two complex numbers. Differential revision: https://reviews.llvm.org/D32065 llvm-svn: 300574	2017-04-18 18:16:57 +00:00
Craig Topper	fc947bcfba	[APInt] Use lshrInPlace to replace lshr where possible This patch uses lshrInPlace to replace code where the object that lshr is called on is being overwritten with the result. This adds an lshrInPlace(const APInt &) version as well. Differential Revision: https://reviews.llvm.org/D32155 llvm-svn: 300566	2017-04-18 17:14:21 +00:00
Daniel Berlin	ec9deb7f54	NewGVN: Don't waste time value numbering unreachable blocks llvm-svn: 300565	2017-04-18 17:06:11 +00:00
Zvi Rackover	d942397e24	LoopRerollPass: Prefer Value::hasOneUse() over Value::getNumUses(). NFC. getNumUses() can be more expensive as it iterates over all list's elements. llvm-svn: 300558	2017-04-18 14:55:43 +00:00
Gil Rapaport	fb1d915ab2	[LV] Cache block mask values This patch is part of D28975's breakdown. Add caching for block masks similar to the cache already used for edge masks, replacing generation per user with reusing the first generated value which dominates all uses. Differential Revision: https://reviews.llvm.org/D32054 llvm-svn: 300557	2017-04-18 14:43:43 +00:00
Nikolai Bozhenov	9e4a1c39db	[GVNHoist] Mark GlobalsAA as preserved by GVNHoist. Reviewers: sebpop, hiraditya Reviewed By: sebpop Subscribers: n.bozhenov, llvm-commits Differential Revision: https://reviews.llvm.org/D32158 Patch by Andrei Elovikov <andrei.elovikov@intel.com> llvm-svn: 300552	2017-04-18 13:25:49 +00:00
Andrea Di Biagio	517e3fc34c	[SampleProfile] Don't assert when printing the DebugLoc of a branch. NFC. llvm-svn: 300544	2017-04-18 11:27:58 +00:00
Andrea Di Biagio	e3edef0977	[SampleProfile] Skip intrinsic calls when visiting callsites in InlineHotFunctions. Before this patch, we always called method 'findCalleeFunctionSamples()' on intrinsic calls. However, intrinsic calls like llvm.dbg.value() are not viable candidates for obvious reasons. No functional change intended. Differential Revision: https://reviews.llvm.org/D32008 llvm-svn: 300541	2017-04-18 10:08:53 +00:00
Adrian Prantl	6825fb64e9	PR32382: Fix emitting complex DWARF expressions. The DWARF specification knows 3 kinds of non-empty simple location descriptions: 1. Register location descriptions - describe a variable in a register - consist of only a DW_OP_reg 2. Memory location descriptions - describe the address of a variable 3. Implicit location descriptions - describe the value of a variable - end with DW_OP_stack_value & friends The existing DwarfExpression code is pretty much ignorant of these restrictions. This used to not matter because we only emitted very short expressions that we happened to get right by accident. This patch makes DwarfExpression aware of the rules defined by the DWARF standard and now chooses the right kind of location description for each expression being emitted. This would have been an NFC commit (for the existing testsuite) if not for the way that clang describes captured block variables. Based on how the previous code in LLVM emitted locations, DW_OP_deref operations that should have come at the end of the expression are put at its beginning. Fixing this means changing the semantics of DIExpression, so this patch bumps the version number of DIExpression and implements a bitcode upgrade. There are two major changes in this patch: I had to fix the semantics of dbg.declare for describing function arguments. After this patch a dbg.declare always takes the address of a variable as the first argument, even if the argument is not an alloca. When lowering a DBG_VALUE, the decision of whether to emit a register location description or a memory location description depends on the MachineLocation — register machine locations may get promoted to memory locations based on their DIExpression. (Future) optimization passes that want to salvage implicit debug location for variables may do so by appending a DW_OP_stack_value. For example: DBG_VALUE, [RBP-8] --> DW_OP_fbreg -8 DBG_VALUE, RAX --> DW_OP_reg0 +0 DBG_VALUE, RAX, DIExpression(DW_OP_deref) --> DW_OP_reg0 +0 All testcases that were modified were regenerated from clang. I also added source-based testcases for each of these to the debuginfo-tests repository over the last week to make sure that no synchronized bugs slip in. The debuginfo-tests compile from source and run the debugger. https://bugs.llvm.org/show_bug.cgi?id=32382 <rdar://problem/31205000> Differential Revision: https://reviews.llvm.org/D31439 llvm-svn: 300522	2017-04-18 01:21:53 +00:00
Dehao Chen	1ea8bd8109	Build SymbolMap in SampleProfileLoader to help matchin function names with suffix. Summary: If there is suffix added in the function name (e.g. module hash added by thinLTO), we will not be able to find a match in profile as the suffix does not exist in profile. This patch build a map from function name to Function *. The map includes the entry for the stripped function name so that inlineHotFunctions can find the corresponding function to promote/inline. Reviewers: davidxl, dnovillo, tejohnson Reviewed By: davidxl Subscribers: mehdi_amini, llvm-commits Differential Revision: https://reviews.llvm.org/D31952 llvm-svn: 300507	2017-04-17 22:23:05 +00:00
Craig Topper	c228068d90	[SimplifyCFG] Use hasNUses instead of comparing getNumUses to a constant." The use list is a linked list so getNumUses requires a linear scan through the whole list. hasNUses will stop scanning at N and see if that is the end. llvm-svn: 300505	2017-04-17 22:13:00 +00:00
Davide Italiano	cdc937d0fc	[InstCombine] Matchers work with both ConstExpr and Instructions. So, `cast<Instruction>` is not guaranteed to succeed. Change the code so that we create a new constant and use it in the newly created instruction, as it's done in other places in InstCombine. OK'ed by Sanjay/Craig. Fixes PR32686. llvm-svn: 300495	2017-04-17 20:49:50 +00:00
Peter Collingbourne	a0f371a106	Bitcode: Add a string table to the bitcode format. Add a top-level STRTAB block containing a string table blob, and start storing strings for module codes FUNCTION, GLOBALVAR, ALIAS, IFUNC and COMDAT in the string table. This change allows us to share names between globals and comdats as well as between modules, and improves the efficiency of loading bitcode files by no longer using a bit encoding for symbol names. Once we start writing the irsymtab to the bitcode file we will also be able to share strings between it and the module. On my machine, link time for Chromium for Linux with ThinLTO decreases by about 7% for no-op incremental builds or about 1% for full builds. Total bitcode file size decreases by about 3%. As discussed on llvm-dev: http://lists.llvm.org/pipermail/llvm-dev/2017-April/111732.html Differential Revision: https://reviews.llvm.org/D31838 llvm-svn: 300464	2017-04-17 17:51:36 +00:00
Craig Topper	d23004c37b	Introduce APInt::isSignBitSet/isSignBitClear. Use in place isSignBitSet in place of isNegative in known bits tracking. This makes statements like KnownZero.isNegative() (which means the value we're tracking is positive) less confusing. llvm-svn: 300457	2017-04-17 16:38:20 +00:00
Matt Arsenault	7205f3c2e4	AMDGPU: SimplifyDemandedElts for image intrinsics Causes some VGPR usage improvements in shaderdb, but introduces some SGPR spilling regressions due to random scheduling changes later. llvm-svn: 300453	2017-04-17 15:12:44 +00:00
Davide Italiano	ce161a7812	[LCSSA] Don't insert tokens into the worklist at all. We're gonna skip them anyway, so there's no point in inserting them in the first place. llvm-svn: 300452	2017-04-17 14:32:05 +00:00
Max Kazantsev	751579cac0	[LoopPeeling] Get rid of Phis that become invariant after N steps This patch is a generalization of the improvement introduced in rL296898. Previously, we were able to peel one iteration of a loop to get rid of a Phi that becomes an invariant on the 2nd iteration. In more general case, if a Phi becomes invariant after N iterations, we can peel N times and turn it into invariant. In order to do this, we for every Phi in loop's header we define the Invariant Depth value which is calculated as follows: Given %x = phi <Inputs from above the loop>, ..., [%y, %back.edge]. If %y is a loop invariant, then Depth(%x) = 1. If %y is a Phi from the loop header, Depth(%x) = Depth(%y) + 1. Otherwise, Depth(%x) is infinite. Notice that if we peel a loop, all Phis with Depth = 1 become invariants, and all other Phis with finite depth decrease the depth by 1. Thus, peeling N first iterations allows us to turn all Phis with Depth <= N into invariants. Reviewers: reames, apilipenko, mkuper, skatkov, anna, sanjoy Reviewed By: sanjoy Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D31613 llvm-svn: 300446	2017-04-17 09:52:02 +00:00
Max Kazantsev	8ed6b66d85	[LoopPeeling] Fix condition for phi-eliminating peeling When peeling loops basing on phis becoming invariants, we make a wrong loop size check. UP.Threshold should be compared against the total numbers of instructions after the transformation, which is equal to 2 * LoopSize in case of peeling one iteration. We should also check that the maximum allowed number of peeled iterations is not zero. Reviewers: sanjoy, anna, reames, mkuper Reviewed By: mkuper Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D31753 llvm-svn: 300441	2017-04-17 05:38:28 +00:00
Craig Topper	218a359fbd	[InstCombine] Simplify 1/X for vectors. llvm-svn: 300439	2017-04-17 03:41:47 +00:00
Craig Topper	1a18a7c51e	[InstCombine] Add support for vector srem->urem. llvm-svn: 300437	2017-04-17 01:51:24 +00:00
Craig Topper	f248468359	[InstCombine] Add support for turning vector sdiv into udiv. llvm-svn: 300435	2017-04-17 01:51:19 +00:00
Davide Italiano	ee654bf5f1	[LCSSA] Simplify a loop. NFCI. llvm-svn: 300433	2017-04-17 00:02:45 +00:00
Craig Topper	da886c665b	[InstCombine][ValueTracking] When computing known bits for Srem make sure we don't compute known bits for the LHS twice. If we already called computeKnownBits for the RHS being a constant power of 2, we've already computed everything we can and should just stop. I think previously we would still recurse if we had determined the result was negative or had not determined the sign bit at all. llvm-svn: 300432	2017-04-16 21:46:12 +00:00
Davide Italiano	dd37c67d81	[LCSSA] Fix non-determinism due to iterating over a SmallPtrSet. Use a SmallSetVector instead. llvm-svn: 300431	2017-04-16 21:07:04 +00:00
Craig Topper	0d304f01b4	[InstCombine] In SimplifyDemandedUseBits, don't bother to mask known bits of constants with DemandedMask. Just because we didn't demand them doesn't mean they aren't known. llvm-svn: 300430	2017-04-16 20:55:58 +00:00
Michael Zuckerman	16b20d2fc5	[X86][X86 intrinsics]Folding cmp(sub(a,b),0) into cmp(a,b) optimization This patch adds new optimization (Folding cmp(sub(a,b),0) into cmp(a,b)) to instCombineCall pass and was written specific for X86 CMP intrinsics. Differential Revision: https://reviews.llvm.org/D31398 llvm-svn: 300422	2017-04-16 13:26:08 +00:00
Sanjay Patel	ef9f586bb2	[InstCombine] allow (X != C1 && X != C2) and similar patterns to match splat vector constants llvm-svn: 300402	2017-04-15 17:55:06 +00:00
Vedant Kumar	1a6a2b642b	[ProfileData] Unify getInstrProfSectionName helpers This is a version of D32090 that unifies all of the `getInstrProfSectionName` helper functions. (Note: the build failures which D32090 would have addressed were fixed with r300352.) We should unify these helper functions because they are hard to use in their current form. E.g we recently introduced more helpers to fix section naming for COFF files. This scheme doesn't totally succeed at hiding low-level details about section naming, so we should switch to an API that is easier to maintain. This is not an NFC commit because it fixes llvm-cov's testing support for COFF files (this falls out of the API change naturally). This is an area where we lack tests -- I will see about adding one as a follow up. Testing: check-clang, check-profile, check-llvm. Differential Revision: https://reviews.llvm.org/D32097 llvm-svn: 300381	2017-04-15 00:09:57 +00:00
Craig Topper	9a458cd517	[InstCombine] MakeAnd/Or/Xor handling to reuse previous APInt computations When checking if we should return a constant, we create some temporary APInts to see if we know all bits. But the exact computations we do are needed in several other locations in the same code. This patch moves them to named temporaries so we can reuse them. Ideally we'd write directly to KnownZero/One, but we currently seem to only write those variables after all the simplifications checks and I didn't want to change that with this patch. Differential Revision: https://reviews.llvm.org/D32094 llvm-svn: 300376	2017-04-14 22:34:14 +00:00
Reid Kleckner	fb502d2f5e	[IR] Make paramHasAttr to use arg indices instead of attr indices This avoids the confusing 'CS.paramHasAttr(ArgNo + 1, Foo)' pattern. Previously we were testing return value attributes with index 0, so I introduced hasReturnAttr() for that use case. llvm-svn: 300367	2017-04-14 20:19:02 +00:00
Sanjay Patel	7cfe41659c	[InstCombine] (X != C1 && X != C2) --> (X \| (C1 ^ C2)) != C2 ...when C1 differs from C2 by one bit and C1 <u C2: http://rise4fun.com/Alive/Vuo And move related folds to a helper function. This reduces code duplication and will make it easier to remove the scalar-only restriction as a follow-up step. llvm-svn: 300364	2017-04-14 19:23:50 +00:00
Craig Topper	fb71b7d3e0	[InstCombine] Support folding a subtract with a constant LHS into a phi node We currently only support folding a subtract into a select but not a PHI. This fixes that. I had to fix an assumption in FoldOpIntoPhi that assumed the PHI node was always in operand 0. Now we pass it in like we do for FoldOpIntoSelect. But we still require some dancing to find the Constant when we create the BinOp or ConstantExpr. This is based code is similar to what we do for selects. Since I touched all call sites, this also renames FoldOpIntoPhi to foldOpIntoPhi to match coding standards. Differential Revision: https://reviews.llvm.org/D31686 llvm-svn: 300363	2017-04-14 19:20:12 +00:00
Craig Topper	c22c7b1459	[InstCombine] Refactor SimplifyUsingDistributiveLaws to more explicitly skip code when LHS/RHS aren't BinaryOperators Currently this code always makes 2 or 3 calls to tryFactorization regardless of whether the LHS/RHS are BinaryOperators. We make 3 calls when both operands are BinaryOperators with the same opcode. Or surprisingly, when neither are BinaryOperators. This is because getBinOpsForFactorization returns Instruction::BinaryOpsEnd when the operand is not a BinaryOperator. If both LHS and RHS are not BinaryOperators then they both have an Opcode of Instruction::BinaryOpsEnd. When this happens we rely on tryFactorization to early out due to A/B/C/D being null. Similar behavior occurs for the other calls, we rely on getBinOpsForFactorization having made A/B or C/D null to get tryFactorization to early out. We also rely on these null checks to check the result of getIdentityValue and early out for it. This patches refactors this to pull these checks up to SimplifyUsingDistributiveLaws so we don't rely on BinaryOpsEnd as a sentinel or this A/B/C/D null behavior. I think this makes this code easier to reason about. Should also give a tiny performance improvement for cases where the LHS or RHS isn't a BinaryOperator. Differential Revision: https://reviews.llvm.org/D31913 llvm-svn: 300353	2017-04-14 17:55:41 +00:00
Davide Italiano	91239088a1	[FunctionImport] assert(false) -> llvm_unreachable(). NFCI. llvm-svn: 300344	2017-04-14 17:22:02 +00:00
Sanjoy Das	e3a15e832c	Tighten the API for ScalarEvolutionNormalization llvm-svn: 300331	2017-04-14 15:49:59 +00:00
Sanjoy Das	ac9f3ea0b4	Remove NormalizeAutodetect; NFC It is cleaner to have a callback based system where the logic of whether an add recurrence is normalized or not lives on IVUsers. This is one step in a multi-step cleanup. llvm-svn: 300330	2017-04-14 15:49:53 +00:00
Gil Rapaport	334f8fbe47	[LV] Remove implicit single basic block assumption This patch is part of D28975's breakdown - no change in output intended. LV's code currently assumes the vectorized loop is a single basic block up until predicateInstructions() is called. This patch removes two manifestations of this assumption (loop phi incoming values, dominator tree update) by replacing the use of vectorLoopBody with the vectorized loop's latch/header. Differential Revision: https://reviews.llvm.org/D32040 llvm-svn: 300310	2017-04-14 07:30:23 +00:00
Craig Topper	c9a4fc0750	[InstCombine] Use APInt::setSignBit and APInt::isNegative(). NFC llvm-svn: 300305	2017-04-14 05:09:04 +00:00
Xinliang David Li	9a71766751	Fix test failure on windows: pass module to getInstrProfXXName calls llvm-svn: 300302	2017-04-14 03:03:24 +00:00
Daniel Berlin	2f72b19b05	NewGVN: Don't propagate over phi backedges where undef causes us to have >1 value, unless we can prove the phi node is cycle free. Fixes PR 32607. llvm-svn: 300299	2017-04-14 02:53:37 +00:00
Xinliang David Li	57dea2d359	[Profile] PE binary coverage bug fix PR/32584 Differential Revision: https://reviews.llvm.org/D32023 llvm-svn: 300277	2017-04-13 23:37:12 +00:00
Reid Kleckner	f021fab2af	[IR] Make getParamAttributes take argument numbers, not ArgNo+1 Add hasParamAttribute() and use it instead of hasAttribute(ArgNo+1, Kind) everywhere. The fact that the AttributeList index for an argument is ArgNo+1 should be a hidden implementation detail. NFC llvm-svn: 300272	2017-04-13 23:12:13 +00:00
Craig Topper	e7563f8dda	[InstCombine] Use APInt::getBitsSetFrom instead of inverting the result of getLowBitsSet. NFC llvm-svn: 300265	2017-04-13 21:49:48 +00:00
Davide Italiano	af36d02430	[LCSSA] Efficiently compute blocks dominating at least one exit. For LCSSA purposes, loop BBs not dominating any of the exits aren't interesting, as none of the values defined in these blocks can be used outside the loop. The way the code computed this information was by comparing each BB of the loop with each of the exit blocks and ask the dominator tree about their dominance relation. This is slow. A more efficient way, implemented here, is that of starting from the exit blocks and walking the dom upwards until we hit an header. By transitivity, all the blocks we encounter in our path dominate an exit. For the testcase provided in PR31851, this reduces compile time on `opt -O2` by ~25%, going from 1m47s to 1m22s. Thanks to Dan/MichaelZ for discussions/suggesting the approach/review. Differential Revision: https://reviews.llvm.org/D31843 llvm-svn: 300255	2017-04-13 20:36:59 +00:00
Richard Smith	6c2615177b	Revert accidentally-committed files in r300252. llvm-svn: 300253	2017-04-13 20:31:21 +00:00
Richard Smith	55bd375b69	Remove all allocation and divisions from GreatestCommonDivisor Switch from Euclid's algorithm to Stein's algorithm for computing GCD. This avoids the (expensive) APInt division operation in favour of bit operations. Remove all memory allocation from within the GCD loop by tweaking our `lshr` implementation so it can operate in-place. Differential Revision: https://reviews.llvm.org/D31968 llvm-svn: 300252	2017-04-13 20:29:59 +00:00
Reid Kleckner	257cb4e099	[InstCombine] Fix !prof metadata preservation for invokes Summary: Bug noticed by inspection. Extend the test to handle invokes as well as calls, and rewrite it to not depend on the inliner and other passes. Also simplify the call site replacement code with CallSite, similar to what I did to dead arg elimination and arg promotion (rL300235 and rL300229). Reviewers: danielcdh, davidxl Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D32041 llvm-svn: 300251	2017-04-13 20:26:38 +00:00
Davide Italiano	0b30227f75	[LCSSA] Assert that we always have a valid loop. We could otherwise add BBs not belonging to a loop in `formLCSSA` and later crash when trying to iterate the loop blocks. llvm-svn: 300244	2017-04-13 20:05:37 +00:00
Davide Italiano	549078d1ab	[LCSSA] Remove spurious whitespaces. NFCI. llvm-svn: 300243	2017-04-13 20:02:27 +00:00
Davide Italiano	5129951296	[LCSSA] Use `auto` when the type is obvious. NFCI. llvm-svn: 300242	2017-04-13 20:01:30 +00:00
Dehao Chen	2c7ca9b5df	SamplePGO: convert callsite samples map key from callsite_location to callsite_location+callee_name Summary: For iterative SamplePGO, an indirect call can be speculatively promoted to multiple direct calls and get inlined. All these promoted direct calls will share the same callsite location (offset+discriminator). With the current implementation, we cannot distinguish between different promotion candidates and its inlined instance. This patch adds callee_name to the key of the callsite sample map. And added helper functions to get all inlined callee samples for a given callsite location. This helps the profile annotator promote correct targets and inline it before annotation, and ensures all indirect call targets to be annotated correctly. Reviewers: davidxl, dnovillo Reviewed By: davidxl Subscribers: andreadb, llvm-commits Differential Revision: https://reviews.llvm.org/D31950 llvm-svn: 300240	2017-04-13 19:52:10 +00:00
Anna Thomas	dcdb325fee	[LV] Fix the vector code generation for first order recurrence Summary: In first order recurrences where phi's are used outside the loop, we should generate an additional vector.extract of the second last element from the vectorized phi update. This is because we require the phi itself (which is the value at the second last iteration of the vector loop) and not the phi's update within the loop. Also fix the code gen when we just unroll, but don't vectorize. Fixes PR32396. Reviewers: mssimpso, mkuper, anemet Subscribers: llvm-commits, mzolotukhin Differential Revision: https://reviews.llvm.org/D31979 llvm-svn: 300238	2017-04-13 18:59:25 +00:00
Sanjay Patel	445d03bf00	[InstCombine] fold X == 0 \|\| X == -1 to one compare (PR32524) This is effectively a retry of: https://reviews.llvm.org/rL299851 but now we have tests and an assert to make sure the bug that was exposed with that attempt will not happen again. I'll fix the code duplication and missing sibling fold next, but I want to make this change as small as possible to reduce risk since I messed it up last time. This should fix: https://bugs.llvm.org/show_bug.cgi?id=32524 llvm-svn: 300236	2017-04-13 18:47:06 +00:00
Reid Kleckner	aea2a28098	[DAE] Simplify call site replacement code with CallSite NFC llvm-svn: 300235	2017-04-13 18:42:03 +00:00
Reid Kleckner	c3fae796fd	[InstCombine] Simplify attribute code with new AttributeList::get NFC llvm-svn: 300230	2017-04-13 18:11:03 +00:00
Reid Kleckner	3a1150352d	[ArgPromotion] Don't drop !prof metadata on promoted calls Noticed by inspection while doing attribute work. DAE, InstCombineCalls, and ArgPromotion have a fair amount of duplicated code for hacking on call sites, and you can find bugs by comparing them. Add a test case for this. llvm-svn: 300229	2017-04-13 18:10:30 +00:00
Sanjay Patel	9745d24a66	[InstCombine] use similar ops for related folds; NFCI It's less efficient to produce 'ule' than 'ult' since we know we're going to canonicalize to 'ult', but we shouldn't have duplicated code for these folds. As a trade-off, this was a pretty terrible way to make a '2'. :) if (LHSC == SubOne(RHSC)) AddC = ConstantExpr::getSub(AddOne(RHSC), LHSC); The next steps are to share the code to fix PR32524 and add the missing 'and' fold that was left out when PR14708 was fixed: https://bugs.llvm.org/show_bug.cgi?id=14708 llvm-svn: 300222	2017-04-13 17:36:24 +00:00
Sanjay Patel	a8ebb46e0e	[InstCombine] fix assert to not always be true llvm-svn: 300202	2017-04-13 16:05:01 +00:00
Geoff Berry	85a530fb59	Re-apply "[GVNHoist] Move GVNHoist to function simplification part of pipeline." This reverts commit r296872 now that PR32153 has been fixed. llvm-svn: 300200	2017-04-13 15:36:25 +00:00
Ayal Zaks	cd712b6c49	[LV] Refactor ILV to provide vectorizeInstruction(); NFC Refactoring InnerLoopVectorizer's vectorizeBlockInLoop() to provide vectorizeInstruction(). Aligning DeadInstructions with its only user. Facilitates driving the transformation by VPlan - follows https://reviews.llvm.org/D28975 and its tentative breakdown. Differential Revision: https://reviews.llvm.org/D31997 llvm-svn: 300183	2017-04-13 09:07:23 +00:00
Reid Kleckner	7f72033e1c	[IR] Take func, ret, and arg attrs separately in AttributeList::get This seems like a much more natural API, based on Derek Schuff's comments on r300015. It further hides the implementation detail of AttributeList that function attributes come last and appear at index ~0U, which is easy for the user to screw up. git diff says it saves code as well: 97 insertions(+), 137 deletions(-) This also makes it easier to change the implementation, which I want to do next. llvm-svn: 300153	2017-04-13 00:58:09 +00:00
Craig Topper	c75f94bfa5	[InstCombine] Teach SimplifyMultipleUseDemandedBits to handle And/Or/Xor known bits using the LHS/RHS known bits it already acquired without recursing back into computeKnownBits. This replicates the known bits and constant creation code from the single use case for these instructions and adds it here. The computeKnownBits and constant creation code for other instructions is now in the default case of the opcode switch. llvm-svn: 300094	2017-04-12 19:32:47 +00:00
Craig Topper	cf3641fd57	[InstCombine] Remove unreachable code for turning an And where all demanded bits on both sides are known to be zero into a constant 0. We already handled a superset check that included the known ones too and folded to a constant that may include ones. But it can also handle the case of no ones. llvm-svn: 300093	2017-04-12 19:08:03 +00:00
Sanjay Patel	6e41018942	[InstCombine] fix wrong undef handling when converting select to shuffle As discussed in: https://bugs.llvm.org/show_bug.cgi?id=32486 ...the canonicalization of vector select to shufflevector does not hold up when undef elements are present in the condition vector. Try to make the undef handling clear in the code and the LangRef. Differential Revision: https://reviews.llvm.org/D31980 llvm-svn: 300092	2017-04-12 18:39:53 +00:00
Craig Topper	f35a7f7b49	[InstCombine] In SimplifyMultipleUseDemandedBits, use a switch instead of cascaded ifs on opcode. NFC llvm-svn: 300085	2017-04-12 18:25:25 +00:00
Craig Topper	9a51c7f343	[InstCombine] Teach SimplifyDemandedInstructionBits that even if we reach an instruction that has multiple uses, if we know all the bits for the demanded bits for this context we can go ahead and create a constant. Currently if we reach an instruction with multiples uses we know we can't do any optimizations to that instruction itself since we only have the demanded bits for one of the users. But if we know all of the bits are zero/one for that one user we can still go ahead and create a constant to give to that user. This might then reduce the instruction to having a single use and allow additional optimizations on the other path. This picks up an additional case that r300075 didn't catch. Differential Revision: https://reviews.llvm.org/D31552 llvm-svn: 300084	2017-04-12 18:17:46 +00:00
Craig Topper	b0076fe8b4	[InstCombine] Move portion of SimplifyDemandedUseBits that deals with instructions with multiple uses out to a separate method. NFCI llvm-svn: 300082	2017-04-12 18:05:21 +00:00
Craig Topper	845033a6c9	Teach SimplifyDemandedUseBits that adding or subtractings 0s from every bit below the highest demanded bit can be simplified If we are adding/subtractings 0s below the highest demanded bit we can just use the other operand and remove the operation. My primary motivation is observing that we can call ShrinkDemandedConstant for the add/sub and create a 0 constant, rather than removing the add completely. In the case I saw, we modified the constant on an add instruction to a 0, but the add is not put into the worklist. So we didn't revisit it until the next InstCombine iteration. This caused an IR modification to remove add and a subsequent iteration to be ran. With this change we get bypass the add in the first iteration and prevent the second iteration from changing anything. Differential Revision: https://reviews.llvm.org/D31120 llvm-svn: 300075	2017-04-12 16:49:59 +00:00
Sanjay Patel	33439f982b	[InstCombine] morph an existing instruction instead of creating a new one One potential way to make InstCombine (very slightly?) faster is to recycle instructions when possible instead of creating new ones. It's not explicitly stated AFAIK, but we don't consider this an "InstSimplify". We could, however, make a new layer to house transforms like this if that makes InstCombine more manageable (just throwing out an idea; not sure how much opportunity is actually here). Differential Revision: https://reviews.llvm.org/D31863 llvm-svn: 300067	2017-04-12 15:11:33 +00:00
Jonas Paulsson	22776892c9	[SLPVectorizer] Pass the right type argument to getCmpSelInstrCost() In getEntryCost(), make the scalar type for a compare instruction that of the operands, not i1. This is needed in order to call getCmpSelInstrCost() for a compare in a sensible way, the same way as the LoopVectorizer does. New test: test/Transforms/SLPVectorizer/SystemZ/SLP-cmp-cost-query.ll Review: Matthew Simpson https://reviews.llvm.org/D31601 llvm-svn: 300061	2017-04-12 13:29:25 +00:00
Jonas Paulsson	592dbea779	[LoopVectorizer] Improve handling of branches during cost estimation. The cost for a branch after vectorization is very different depending on if the vectorizer will if-convert the block (branch is eliminated), or if scalarized and predicated blocks will be produced (branch duplicated before each block). There is also the case of remaining scalar branches, such as the back-edge branch. This patch handles these cases differently with TTI based cost estimates. Review: Matthew Simpson https://reviews.llvm.org/D31175 llvm-svn: 300058	2017-04-12 13:13:15 +00:00
Jonas Paulsson	da74ed42da	[LoopVectorizer, TTI] New method supportsEfficientVectorElementLoadStore() Since SystemZ supports vector element load/store instructions, there is no need for extracts/inserts if a vector load/store gets scalarized. This patch lets Target specify that it supports such instructions by means of a new TTI hook that defaults to false. The use for this is in the LoopVectorizer getScalarizationOverhead() method, which will with this patch produce a smaller sum for a vector load/store on SystemZ. New test: test/Transforms/LoopVectorize/SystemZ/load-store-scalarization-cost.ll Review: Adam Nemet https://reviews.llvm.org/D30680 llvm-svn: 300056	2017-04-12 12:41:37 +00:00
Jonas Paulsson	fccc7d66c3	[SystemZ] TargetTransformInfo cost functions implemented. getArithmeticInstrCost(), getShuffleCost(), getCastInstrCost(), getCmpSelInstrCost(), getVectorInstrCost(), getMemoryOpCost(), getInterleavedMemoryOpCost() implemented. Interleaved access vectorization enabled. BasicTTIImpl::getCastInstrCost() improved to check for legal extending loads, in which case the cost of the z/sext instruction becomes 0. Review: Ulrich Weigand, Renato Golin. https://reviews.llvm.org/D29631 llvm-svn: 300052	2017-04-12 11:49:08 +00:00
Bjorn Pettersson	4af0593ecc	[LoadCombine] Avoid analysing dead basic blocks Summary: Dead basic blocks may be forming a loop, for which SSA form is fulfilled, but with a circular def-use chain. LoadCombine could enter an infinite loop when analysing such dead code. This patch solves the problem by simply avoiding to analyse all basic blocks that aren't forward reachable, from function entry, in LoadCombine. Fixes https://bugs.llvm.org/show_bug.cgi?id=27065 Reviewers: mehdi_amini, chandlerc, grosser, Bigcheese, davide Reviewed By: davide Subscribers: dberlin, zzheng, bjope, grandinj, Ka-Ka, materi, jholewinski, llvm-commits, mzolotukhin Differential Revision: https://reviews.llvm.org/D31032 llvm-svn: 300034	2017-04-12 08:07:55 +00:00
Chandler Carruth	927d8e610a	[IR] Redesign the case iterator in SwitchInst to actually be an iterator and to expose a handle to represent the actual case rather than having the iterator return a reference to itself. All of this allows the iterator to be used with common STL facilities, standard algorithms, etc. Doing this exposed some missing facilities in the iterator facade that I've fixed and required some work to the actual iterator to fully support the necessary API. Differential Revision: https://reviews.llvm.org/D31548 llvm-svn: 300032	2017-04-12 07:27:28 +00:00
Craig Topper	b5194eeebf	[InstCombine][IR] Add a commutable BinOp matcher. Use it to reduce some code. NFC llvm-svn: 300030	2017-04-12 05:49:28 +00:00
Bob Haarman	4075ccc717	ThinLTOBitcodeWriter: keep comdats together, rename if leader is renamed Summary: COFF requires that every comdat contain a symbol with the same name as the comdat. ThinLTOBitcodeWriter renames symbols, which may cause this requirement to be violated. This change avoids such violations by renaming comdats if their leaders are renamed. It also keeps comdats together when splitting modules. Reviewers: pcc, mehdi_amini, tejohnson Reviewed By: pcc Subscribers: rnk, Prazek, llvm-commits Differential Revision: https://reviews.llvm.org/D31963 llvm-svn: 300019	2017-04-12 01:43:07 +00:00
Reid Kleckner	c2cb560045	[IR] Add AttributeSet to hide AttributeSetNode* again, NFC Summary: For now, it just wraps AttributeSetNode*. Eventually, it will hold AvailableAttrs as an inline bitset, and adding and removing enum attributes will be super cheap. This sinks AttributeSetNode back down to lib/IR/AttributeImpl.h. Reviewers: pete, chandlerc Subscribers: llvm-commits, jfb Differential Revision: https://reviews.llvm.org/D31940 llvm-svn: 300014	2017-04-12 00:38:00 +00:00
Evgeniy Stepanov	90fd87303c	[asan] Give global metadata private linkage. Internal linkage preserves names like "__asan_global_foo" which may account to 2% of unstripped binary size. llvm-svn: 299995	2017-04-11 22:28:13 +00:00
Anna Thomas	00dc1b74b7	[LV] Avoid vectorizing first order recurrence when phi uses are outside loop In the vectorization of first order recurrence, we vectorize such that the last element in the vector will be the one extracted to pass into the scalar remainder loop. However, this is not true when there is a phi (other than the primary induction variable) is used outside the loop. In such a case, we need the value from the second last iteration (i.e. the phi value), not the last iteration (which would be the phi update). I've added a test case for this. Also see PR32396. A follow up patch would generate the correct code gen for such cases, and turn this vectorization on. Differential Revision: https://reviews.llvm.org/D31910 Reviewers: mssimpso llvm-svn: 299985	2017-04-11 21:02:00 +00:00
Daniel Berlin	554dcd8c89	MemorySSA: Move to Analysis, from Transforms/Utils. It's used as Analysis, it has Analysis passes, and once NewGVN is made an Analysis, this removes the cross dependency from Analysis to Transform/Utils. NFC. llvm-svn: 299980	2017-04-11 20:06:36 +00:00
Andrea Di Biagio	8e26936bfd	[AddDiscriminators] Assign discriminators to MemIntrinsic calls. Before this patch, pass AddDiscriminators always avoided to assign discriminators to intrinsic calls. This was done mainly for two reasons: 1) We wanted to minimize the number of based discriminators used. 2) We wanted to avoid non-deterministic discriminator assignment for different debug levels. Unfortunately, that approach was problematic for MemIntrinsic calls. MemIntrinsic calls can be split by SROA into loads and stores, and each new load/store instruction would obtain the debug location from the original intrinsic call. If we don't assign a discriminator to MemIntrinsic calls, then we cannot correctly set the discriminator for the newly created loads and stores. This may have a negative impact on the basic block weight computation performed by the SampleLoader. This patch fixes the issue by letting MemIntrinsic calls have a discriminator. Differential Revision: https://reviews.llvm.org/D31900 llvm-svn: 299972	2017-04-11 19:07:30 +00:00
Craig Topper	957a94cc03	Fix spelling compliment->complement. Mostly refering to 2s complement. NFC llvm-svn: 299970	2017-04-11 18:47:58 +00:00
Craig Topper	271b2245f4	[InstCombine] Use ConstantExpr::getBinOpIdentity to implement getIdentityValue. This removes a TODO in getIdentityValue and may allow some transforms to occur earlier. But I was unable to find any transforms we didn't already handle. llvm-svn: 299966	2017-04-11 17:42:40 +00:00
Sanjay Patel	28611acef9	revert r299851 - [InstCombine] fix matching of or-of-icmps constants (PR32524) This is a candidate culprit for multiple bot fails, so reverting pending investigation. llvm-svn: 299955	2017-04-11 15:57:32 +00:00
Serge Guelton	59a2d7b909	Module::getOrInsertFunction is using C-style vararg instead of variadic templates. From a user prospective, it forces the use of an annoying nullptr to mark the end of the vararg, and there's not type checking on the arguments. The variadic template is an obvious solution to both issues. Differential Revision: https://reviews.llvm.org/D31070 llvm-svn: 299949	2017-04-11 15:01:18 +00:00
Geoff Berry	9d597adde4	[GVNHoist] Re-enable GVNHoist by default Turn GVNHoist back on by default now that PR32153 has been fixed. llvm-svn: 299944	2017-04-11 14:36:30 +00:00
Keno Fischer	30779772cf	[StripDeadDebug/DIFinder] Track inlined SPs Summary: In rL299692 I improved strip-dead-debug-info's ability to drop CUs that are not referenced from the current module. However, in doing so I neglected to realize that some SPs could be referenced entirely from inlined functions. It appears I was not the only one to make this mistake, because DebugInfoFinder, doesn't find those SPs either. Fix this in DebugInfoFinder and then use that to make sure not to drop those CUs in strip-dead-debug-info. Reviewers: aprantl Reviewed By: aprantl Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D31904 llvm-svn: 299936	2017-04-11 13:32:11 +00:00
Diana Picus	b050c7fbe0	Revert "Turn some C-style vararg into variadic templates" This reverts commit r299925 because it broke the buildbots. See e.g. http://lab.llvm.org:8011/builders/clang-cmake-armv7-a15/builds/6008 llvm-svn: 299928	2017-04-11 10:07:12 +00:00
Serge Guelton	5fd75fb72e	Turn some C-style vararg into variadic templates Module::getOrInsertFunction is using C-style vararg instead of variadic templates. From a user prospective, it forces the use of an annoying nullptr to mark the end of the vararg, and there's not type checking on the arguments. The variadic template is an obvious solution to both issues. llvm-svn: 299925	2017-04-11 08:36:52 +00:00
Sylvestre Ledru	06faa9bf32	Simplify the code and remove dead code Summary: Fix coverity cid 1374240 Reviewers: dberlin Reviewed By: dberlin Differential Revision: https://reviews.llvm.org/D31928 llvm-svn: 299924	2017-04-11 08:21:27 +00:00
Craig Topper	8c75adf95b	[InstCombine] Refinement of r299915. Only consider a ConstantVector for Neg if all the elements are Undef or ConstantInt. llvm-svn: 299917	2017-04-11 06:32:48 +00:00
Craig Topper	18f9e424e7	[InstCombine] Support weird size element types in dyn_castNegVal. llvm-svn: 299915	2017-04-11 05:42:47 +00:00
Hal Finkel	b63ed91549	[LICM] Hoist fp division from the loops and replace by a reciprocal When allowed, we can hoist a division out of a loop in favor of a multiplication by the reciprocal. Fixes PR32157. Patch by vit9696! Differential Revision: https://reviews.llvm.org/D30819 llvm-svn: 299911	2017-04-11 02:22:54 +00:00
Daniel Berlin	bf80cfe6b6	Revert "NewGVN: Don't propagate over phi backedges where undef causes us to have >1 value." It's not ready yet this was an accidental commit :( This reverts r299903 llvm-svn: 299904	2017-04-11 00:07:26 +00:00
Daniel Berlin	3938111fe7	NewGVN: Don't propagate over phi backedges where undef causes us to have >1 value. Fixes PR 32607. llvm-svn: 299903	2017-04-11 00:02:38 +00:00
Reid Kleckner	eb9dd5b87f	Reland "[IR] Make AttributeSetNode public, avoid temporary AttributeList copies" This re-lands r299875. I introduced a bug in Clang code responsible for replacing K&R, no prototype declarations with a real function definition with a prototype. The bug was here: // Collect any return attributes from the call. - if (oldAttrs.hasAttributes(llvm::AttributeList::ReturnIndex)) - newAttrs.push_back(llvm::AttributeList::get(newFn->getContext(), - oldAttrs.getRetAttributes())); + newAttrs.push_back(oldAttrs.getRetAttributes()); Previously getRetAttributes() carried AttributeList::ReturnIndex in its AttributeList. Now that we return the AttributeSetNode* directly, it no longer carries that index, and we call this overload with a single node: AttributeList::get(LLVMContext&, ArrayRef<AttributeSetNode*>) That aborted with an assertion on x86_32 targets. I added an explicit triple to the test and added CHECKs to help find issues like this in the future sooner. llvm-svn: 299899	2017-04-10 23:31:05 +00:00
Davide Italiano	f58a30236b	[NewGVN] Surround with parens to clarify allegedly ambiguous precedence. This Placates GCC7 with -Werror. Also, clang-format the assertions while I'm here. llvm-svn: 299895	2017-04-10 23:08:35 +00:00
Davide Italiano	fa6a0a819d	[MemorySSA] We don't need to compute dominator levels anymore. Differential Revision: https://reviews.llvm.org/D31818 llvm-svn: 299893	2017-04-10 22:44:46 +00:00
Matt Arsenault	3c1fc768ed	Allow DataLayout to specify addrspace for allocas. LLVM makes several assumptions about address space 0. However, alloca is presently constrained to always return this address space. There's no real way to avoid using alloca, so without this there is no way to opt out of these assumptions. The problematic assumptions include: - That the pointer size used for the stack is the same size as the code size pointer, which is also the maximum sized pointer. - That 0 is an invalid, non-dereferencable pointer value. These are problems for AMDGPU because alloca is used to implement the private address space, which uses a 32-bit index as the pointer value. Other pointers are 64-bit and behave more like LLVM's notion of generic address space. By changing the address space used for allocas, we can change our generic pointer type to be LLVM's generic pointer type which does have similar properties. llvm-svn: 299888	2017-04-10 22:27:50 +00:00
Dehao Chen	d4a3397861	Emit less compiler optimization remarks in samplepgo to reduce a call to findCalleeFunctionSamples which is going to be refactored. Summary: Now the SamplePGO support is more stable, we do not need so many verbose optimization remarks emitted. Reviewers: dnovillo, davidxl Reviewed By: davidxl Subscribers: fhahn, llvm-commits Differential Revision: https://reviews.llvm.org/D31826 llvm-svn: 299883	2017-04-10 20:49:16 +00:00
Geoff Berry	635e505675	[GVNHoist] Call isGuaranteedToTransferExecutionToSuccessor on each instruction w.r.t. https://bugs.llvm.org/show_bug.cgi?id=32153 The consensus seems to be isGuaranteedToTransferExecutionToSuccessor should be called for each function. Patch by Aditya Kumar Differential Revision: https://reviews.llvm.org/D31035 llvm-svn: 299882	2017-04-10 20:45:17 +00:00
Evgeniy Stepanov	ed7fce7c84	Revert "[asan] Put ctor/dtor in comdat." This reverts commit r299696, which is causing mysterious test failures. llvm-svn: 299880	2017-04-10 20:36:36 +00:00
Evgeniy Stepanov	ba7c2e9661	Revert "[asan] Fix dead stripping of globals on Linux." This reverts commit r299697, which caused a big increase in object file size. llvm-svn: 299879	2017-04-10 20:36:30 +00:00
Reid Kleckner	211b1f324f	Revert "[IR] Make AttributeSetNode public, avoid temporary AttributeList copies" This reverts r299875. A Linux bot came back with a test failure: http://bb.pgr.jp/builders/test-clang-i686-linux-RA/builds/741/steps/test_clang/logs/Clang%20%3A%3A%20CodeGen__2006-05-19-SingleEltReturn.c llvm-svn: 299878	2017-04-10 20:34:19 +00:00
Reid Kleckner	324c99dee5	[IR] Make AttributeSetNode public, avoid temporary AttributeList copies Summary: AttributeList::get(Fn\|Ret\|Param)Attributes no longer creates a temporary AttributeList just to hide the AttributeSetNode type. I've also added a factory method to create AttributeLists from a parallel array of AttributeSetNodes. I think this simplifies construction of AttributeLists when rewriting function prototypes. Previously we would test if a particular index had attributes, and conditionally add a temporary attribute list to a vector. Now the attribute set vector is parallel to the argument vector already that these passes already construct. My long term vision is to wrap AttributeSetNode* inside an AttributeSet type that holds the enum attributes, but that will come in a follow up change. I haven't done any performance measurements for this change because profiling hasn't shown that any of the affected code is hot. Reviewers: pete, chandlerc, sanjoy, hfinkel Reviewed By: pete Subscribers: jfb, llvm-commits Differential Revision: https://reviews.llvm.org/D31198 llvm-svn: 299875	2017-04-10 20:18:10 +00:00
Sanjay Patel	e4159d2238	[InstCombine] improve variable names; NFCI llvm-svn: 299871	2017-04-10 19:38:36 +00:00
Matt Arsenault	daa08875b3	[MemCpyOpt] Only replace memcpy with bitcast if address spaces match Patch by James Price llvm-svn: 299866	2017-04-10 19:00:25 +00:00
Daniel Berlin	74603a68ef	MemorySSA: Make lifetime starts defs for mustaliased pointers Summary: While we don't want them aliasing with other pointers, there seems to be no point in not having them clobber must-aliased'd pointers. If some day, we split the aliasing and ordering chains, we'd make this not aliasing but an ordering barrier (IE it doesn't affect it's memory, but we can't hoist it above it). Reviewers: hfinkel, george.burgess.iv Subscribers: Prazek, llvm-commits Differential Revision: https://reviews.llvm.org/D31865 llvm-svn: 299865	2017-04-10 18:46:00 +00:00
Craig Topper	0d830ff7bf	[InstCombine] Use commutable matchers and m_OneUse in visitSub to shorten code. Add missing test cases. In one case I removed commute handling for a multiply with a constant since we'll eventually get the constant on the right hand side. llvm-svn: 299863	2017-04-10 18:09:25 +00:00
Craig Topper	98851adc2a	[InstCombine] Use m_c_Add to shorten some code. Add testcases for this fold since they were missing. NFC llvm-svn: 299853	2017-04-10 16:59:40 +00:00
Sanjay Patel	570e35c157	[InstCombine] fix matching of or-of-icmps constants (PR32524) Also, make the same change in and-of-icmps and remove a hack for detecting that case. Finally, add some FIXME comments because the code duplication here is awful. This should fix the remaining IR problem noted in: https://bugs.llvm.org/show_bug.cgi?id=32524 llvm-svn: 299851	2017-04-10 16:55:57 +00:00
Craig Topper	3eec73e20b	[InstCombine] Support folding of add instructions with vector constants into select operations We currently only fold scalar add of constants into selects. This improves this to support vectors too. Differential Revision: https://reviews.llvm.org/D31683 llvm-svn: 299847	2017-04-10 16:40:00 +00:00
Craig Topper	31cc143b51	[InstCombine] Use commutable and/or/xor matchers to simplify some code Summary: This is my first time using the commutable matchers so wanted to make sure I was doing it right. Are there any other matcher tricks to further shrink this? Can we commute the whole match so we don't have to LHS and RHS separately? Reviewers: davide, spatel Reviewed By: davide Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D31680 llvm-svn: 299840	2017-04-10 07:13:40 +00:00
Craig Topper	838d13e7ee	[InstCombine] Make sure we preserve fast math flags when folding fp instructions into phi nodes Summary: I noticed in the select folding code that we copied fast math flags, but did not do the same for the similar handling in phi nodes. This patch fixes that to do the same thing as select Reviewers: spatel, davide, majnemer, hfinkel Reviewed By: davide Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D31690 llvm-svn: 299838	2017-04-10 07:00:10 +00:00
Craig Topper	d8840d7b10	[InstCombine] use m_c_And and m_c_Xor to handle commuted versions of a transform. llvm-svn: 299837	2017-04-10 06:53:28 +00:00
Craig Topper	7639460367	[InstCombine] Remove unnecessary dyn_cast to BinaryOperator around some matcher checks in visitXor. The matchers themselves should be enough. llvm-svn: 299835	2017-04-10 06:53:23 +00:00
Craig Topper	4738321f0c	[InstCombine] Make the (A\|B)^B -> A & ~B transform code consistent with the very similar (A&B)^B -> ~A & B code. This should be NFC except for the addition of hasOneUse check. I think this code is still overly complicated and should use matchers, but first I wanted to make it consistent. llvm-svn: 299834	2017-04-10 06:53:21 +00:00
Craig Topper	4f16d82d6b	[InstCombine] Use m_OneUse to shorten some code. NFC llvm-svn: 299833	2017-04-10 06:53:19 +00:00
Xin Tong	34888c08bc	[SCCP] Resolve indirect branch target when possible. Summary: Resolve indirect branch target when possible. This potentially eliminates more basicblocks and result in better evaluation for phi and other things. Reviewers: davide, efriedma, sanjoy Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D30322 llvm-svn: 299830	2017-04-10 00:33:25 +00:00
Sanjay Patel	16a054d5c7	[InstCombine] remove dead cases from icmp pair switches; NFCI "PredicatesFoldable" returns false for signed/unsigned mismatched pairs, so these cases should never exist. We'll default to 'unreachable' on those predicate combos instead. Most of what's left in these switches belongs in InstSimplify (and may already be there), so there's probably more that can be done to reduce this code. llvm-svn: 299829	2017-04-09 21:51:34 +00:00
Davide Italiano	612d5a9c5c	[Mem2Reg] Remove AliasSetTracker updating logic from the pass. No caller has been passing it for a long time. llvm-svn: 299827	2017-04-09 20:47:14 +00:00
Hal Finkel	a9d67cf601	[MemorySSA] Fix use of pointsToConstantMemory in isUseTriviallyOptimizableToLiveOnEntry In isUseTriviallyOptimizableToLiveOnEntry, pointsToConstantMemory needs to be called on the load's pointer operand, not on the result of the load (which might not even be a pointer). llvm-svn: 299823	2017-04-09 12:57:50 +00:00
Craig Topper	afa07c5ef6	[InstCombine] Extend some OR combines to support vectors. This adds support for these combines for vectors (X^C)\|Y -> (X\|Y)^C iff Y&C == 0 Y\|(X^C) -> (X\|Y)^C iff Y&C == 0 llvm-svn: 299822	2017-04-09 06:12:41 +00:00
Craig Topper	e63c21b1ba	[InstCombine] Extend a canonicalization check to apply to vector constants too. llvm-svn: 299821	2017-04-09 06:12:39 +00:00
Craig Topper	437c97622b	[InstCombine] Use the SubOne helper function to shorten some code. NFC llvm-svn: 299819	2017-04-09 06:12:34 +00:00
Craig Topper	9d1821b262	[InstCombine] rename variable for easier reading; NFC We usually give constants a 'C' somewhere in the name... llvm-svn: 299818	2017-04-09 06:12:31 +00:00
Gor Nishanov	bfb2a9db31	[coroutines] Make CoroSplit pass deterministic coro-split-after-phi.ll test was flaky due to non-determinism in the coroutine frame construction that was sorting the spill vector using a pointer to a def as a part of the key. The sorting was intended to make sure that spills for the same def are kept together, however, we populate the vector by processing defs in order, so the spill entires will end up together anyways. This change removes spill sorting and restores the determinism in the test. llvm-svn: 299809	2017-04-08 00:49:46 +00:00
Evgeniy Stepanov	349adbacca	[cfi] Take over existing __cfi_check in CrossDSOCFI. https://reviews.llvm.org/D31796 will emit a dummy __cfi_check in the frontend. llvm-svn: 299805	2017-04-07 23:00:20 +00:00
Daniel Berlin	a823656ce7	NewGVN: Make CongruenceClass a real class in preparation for splitting NewGVN into analysis and eliminator. llvm-svn: 299792	2017-04-07 18:38:09 +00:00
Gor Nishanov	138ad6c9c0	[coroutines] Insert spills of PHI instructions correctly Summary: Fix a bug where we were inserting a spill in between the PHIs in the beginning of the block. Consider this fragment: ``` begin: %phi1 = phi i32 [ 0, %entry ], [ 2, %alt ] %phi2 = phi i32 [ 1, %entry ], [ 3, %alt ] %sp1 = call i8 @llvm.coro.suspend(token none, i1 false) switch i8 %sp1, label %suspend [i8 0, label %resume i8 1, label %cleanup] resume: call i32 @print(i32 %phi1) ``` Unless we are spilling the argument or result of the invoke, we were always inserting the spill immediately following the instruction. The fix adds a check that if the spilled instruction is a PHI Node, select an appropriate insert point with `getFirstInsertionPt()` that skips all the PHI Nodes and EH pads. Reviewers: majnemer, rnk Reviewed By: rnk Subscribers: qcolombet, EricWF, llvm-commits Differential Revision: https://reviews.llvm.org/D31799 llvm-svn: 299771	2017-04-07 14:16:49 +00:00
Matthew Simpson	11fe2e9f2b	Reapply r298620: [LV] Vectorize GEPs This patch reapplies r298620. The original patch was reverted because of two issues. First, the patch exposed a bug in InstCombine that caused the Chromium builds to fail (PR32414). This issue was fixed in r299017. Second, the patch introduced a bug in the vectorizer's scalars analysis that caused test suite builds to fail on SystemZ. The scalars analysis was too aggressive and marked a memory instruction scalar, even though it was going to be vectorized. This issue has been fixed in the current patch and several new test cases for the scalars analysis have been added. llvm-svn: 299770	2017-04-07 14:15:34 +00:00
Craig Topper	33e0dbcc58	[InstCombine] Handle more commuted cases of ((A & B) \| ~A) -> (~A \| B) llvm-svn: 299747	2017-04-07 07:32:00 +00:00
Daniel Berlin	d952ceae2f	AliasAnalysis: Be less conservative about volatile than atomic. Summary: getModRefInfo is meant to answer the question "what impact does this instruction have on a given memory location" (not even another instruction). Long debate on this on IRC comes to the conclusion the answer should be "nothing special". That is, a noalias volatile store does not affect a memory location just by being volatile. Note: DSE and GVN and memdep currently believe this, because memdep just goes behind AA's back after it says "modref" right now. see line 635 of memdep. Prior to this patch we would get modref there, then check aliasing, and if it said noalias, we would continue. getModRefInfo already has this same AA check, it just wasn't being used because volatile was lumped in with ordering. (I am separately testing whether this code in memdep is now dead except for the invariant load case) Reviewers: jyknight, chandlerc Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D31726 llvm-svn: 299741	2017-04-07 01:28:36 +00:00
Craig Topper	72a622cac7	[InstCombine] Add more commuted patterns to support folding ((~A & B) \| A) -> (A \| B). llvm-svn: 299737	2017-04-07 00:29:47 +00:00
Craig Topper	a521c30dc6	[InstCombine] Remove testing assert I accidentally left in r299710. llvm-svn: 299715	2017-04-06 21:29:43 +00:00
Craig Topper	b4da6840d8	[InstCombine] When checking to see if we can turn subtracts of 2^n - 1 into xor, we only need to call computeKnownBits on the RHS not the whole subtract. While there use isMask instead of isPowerOf2(C+1) Calling computeKnownBits on the RHS should allows us to recurse one step further. isMask is equivalent to the isPowerOf2(C+1) except in the case where C is all ones. But that was already handled earlier by creating a not which is an Xor with all ones. So this should be fine. llvm-svn: 299710	2017-04-06 21:06:03 +00:00
Rong Xu	2bf4c59025	[PGO] Preserve GlobalsAA in pgo-memop-opt pass. Preserve GlobalsAA analysis in memory intrinsic calls optimization based on profiled size. llvm-svn: 299707	2017-04-06 20:56:00 +00:00
Craig Topper	7226d796aa	[InstCombine] Remove redundant combine from visitAnd This combine is fully handled by SimplifyDemandedInstructionBits as of r299658 where I fixed this code to ensure the Add/Sub had only a single user. Otherwise it would fire and create additional instructions. That fix resulted in an improvement to code generated for tsan which is why I committed it before deleting. Differential Revision: https://reviews.llvm.org/D31543 llvm-svn: 299704	2017-04-06 20:41:48 +00:00
Mehdi Amini	db11fdfda5	Revert "Turn some C-style vararg into variadic templates" This reverts commit r299699, the examples needs to be updated. llvm-svn: 299702	2017-04-06 20:23:57 +00:00
Mehdi Amini	579540a8f7	Turn some C-style vararg into variadic templates Module::getOrInsertFunction is using C-style vararg instead of variadic templates. From a user prospective, it forces the use of an annoying nullptr to mark the end of the vararg, and there's not type checking on the arguments. The variadic template is an obvious solution to both issues. Patch by: Serge Guelton <serge.guelton@telecom-bretagne.eu> Differential Revision: https://reviews.llvm.org/D31070 llvm-svn: 299699	2017-04-06 20:09:31 +00:00
Evgeniy Stepanov	6c3a8cbc4d	[asan] Fix dead stripping of globals on Linux. Use a combination of !associated, comdat, @llvm.compiler.used and custom sections to allow dead stripping of globals and their asan metadata. Sometimes. Currently this works on LLD, which supports SHF_LINK_ORDER with sh_link pointing to the associated section. This also works on BFD, which seems to treat comdats as all-or-nothing with respect to linker GC. There is a weird quirk where the "first" global in each link is never GC-ed because of the section symbols. At this moment it does not work on Gold (as in the globals are never stripped). This is a re-land of r298158 rebased on D31358. This time, asan.module_ctor is put in a comdat as well to avoid quadratic behavior in Gold. llvm-svn: 299697	2017-04-06 19:55:17 +00:00
Evgeniy Stepanov	5dfe420d10	[asan] Put ctor/dtor in comdat. When possible, put ASan ctor/dtor in comdat. The only reason not to is global registration, which can be TU-specific. This is not the case when there are no instrumented globals. This is also limited to ELF targets, because MachO does not have comdat, and COFF linkers may GC comdat constructors. The benefit of this is a lot less __asan_init() calls: one per DSO instead of one per TU. It's also necessary for the upcoming gc-sections-for-globals change on Linux, where multiple references to section start symbols trigger quadratic behaviour in gold linker. This is a rebase of r298756. llvm-svn: 299696	2017-04-06 19:55:13 +00:00
Evgeniy Stepanov	039af609f1	[asan] Delay creation of asan ctor. Create the constructor in the module pass. This in needed for the GC-friendly globals change, where the constructor can be put in a comdat in some cases, but we don't know about that in the function pass. This is a rebase of r298731 which was reverted due to a false alarm. llvm-svn: 299695	2017-04-06 19:55:09 +00:00
Keno Fischer	bacc64b5fa	[StripDeadDebugInfo] Drop dead CUs entirely Summary: Prior to this while it would delete the dead DIGlobalVariables, it would leave dead DICompileUnits and everything referenced therefrom. For a bit bitcode file with thousands of compile units those dead nodes easily outnumbered the real ones. Clean that up. Reviewed By: aprantl Differential Revision: https://reviews.llvm.org/D31720 llvm-svn: 299692	2017-04-06 19:26:22 +00:00
Daniel Berlin	21279bd37a	NewGVN: Rename some functions for consistency llvm-svn: 299685	2017-04-06 18:52:58 +00:00
Daniel Berlin	08fe6e0f74	NewGVN: Fixup some small issues llvm-svn: 299684	2017-04-06 18:52:55 +00:00
Daniel Berlin	5845e0549e	NewGVN: Fix a small formatting issue in performSymbolicLoadEvaluation. llvm-svn: 299683	2017-04-06 18:52:53 +00:00
Daniel Berlin	1316a94ebc	NewGVN: This patch makes memory congruence work for all types of memorydefs, not just stores. Along the way, we audit and fixup issues about how we were tracking memory leaders, and improve the verifier to notice more memory congruency issues. llvm-svn: 299682	2017-04-06 18:52:50 +00:00
Craig Topper	3fc1225c18	[InstCombine] Fix a case where we weren't checking that an instruction had a single use resulting in extra instructions being created. llvm-svn: 299658	2017-04-06 16:42:46 +00:00
Daniel Berlin	d7a7ae061f	MemorySSA: Remove MemorySSA walker caching. Summary: Remove all the caching the clobber walker does, and that the caching walker does. With the patch to enable storing clobbering access results for stores, i can find no improvement with the cache turned on (and a number of degradations, both time and memory, from the cost of caching. For a large program i have, we do millions of lookups and inserts with zero hits). I haven't tried to rename or simplify the walker otherwise yet. (Appreciate some perf testing on this past my own testing) Reviewers: george.burgess.iv, davide Subscribers: Prazek, llvm-commits Differential Revision: https://reviews.llvm.org/D31576 llvm-svn: 299578	2017-04-05 19:01:58 +00:00
Sanjay Patel	50c82c4395	[InstCombine] add fold for icmp with or mask of low bits (PR32542) We already have these 'and' folds: // X & -C == -C -> X > u ~C // X & -C != -C -> X <= u ~C // iff C is a power of 2 ...but we were missing the 'or' siblings. http://rise4fun.com/Alive/n6 This should improve: https://bugs.llvm.org/show_bug.cgi?id=32524 ...but there are 2 or more other pieces to fix still. Differential Revision: https://reviews.llvm.org/D31712 llvm-svn: 299570	2017-04-05 17:57:05 +00:00
Sanjay Patel	519a87a468	[InstCombine] fix formatting and variable names; NFCI There must be some opportunity to refactor big chunks of nearly duplicated code in FoldOrOfICmps / FoldAndOfICmps. Also, none of this works with vectors, but it should. llvm-svn: 299568	2017-04-05 17:38:34 +00:00
Daniel Berlin	3082b8e062	MemorySSA: Fix and use optimized_def_chain llvm-svn: 299566	2017-04-05 17:26:25 +00:00
Akira Hatanaka	75be84f3c2	[ObjCArc] Do not dereference an invalidated iterator. Fix a bug in ARC contract pass where an iterator that pointed to a deleted instruction was dereferenced. It appears that tryToContractReleaseIntoStoreStrong was incorrectly assuming that a call to objc_retain would not immediately follow a call to objc_release. rdar://problem/25276306 llvm-svn: 299507	2017-04-05 03:44:09 +00:00
Bob Haarman	6de8134784	ThinLTOBitcodeWriter: handle aliases first in filterModule Summary: This change fixes a "local linkage requires default visibility" assert when attempting to build LLVM with ThinLTO on Windows. Reviewers: pcc, tejohnson, mehdi_amini Reviewed By: pcc Subscribers: llvm-commits, Prazek Differential Revision: https://reviews.llvm.org/D31632 llvm-svn: 299491	2017-04-05 00:42:07 +00:00
Daniel Berlin	e33bc31df4	Re-apply MemorySSA: Add support for caching clobbering access in stores with some fixes. Summary: This enables us to cache the clobbering access for stores, despite the fact that we can't rewrite the use-def chains themselves. Early testing shows that, after this change, for larger testcases, it will be a significant net positive (memory and time) to remove the walker caching. Reviewers: george.burgess.iv, davide Subscribers: Prazek, llvm-commits Differential Revision: https://reviews.llvm.org/D31567 llvm-svn: 299486	2017-04-04 23:43:10 +00:00
Daniel Berlin	f49d4c45a1	Revert "MemorySSA: Add support for caching clobbering access in stores" This reverts revision r299322. llvm-svn: 299485	2017-04-04 23:43:04 +00:00
Sanjay Patel	0bf0abedf6	[InstCombine] rename variable for easier reading; NFC We usually give constants a 'C' somewhere in the name... llvm-svn: 299474	2017-04-04 22:06:03 +00:00
Craig Topper	c745b6a1f6	[InstCombine] Turn subtract of vectors of i1 into xor like we do for scalar i1. Matches what we already do for add. llvm-svn: 299472	2017-04-04 21:44:56 +00:00
Craig Topper	86173600ec	[InstCombine] Support folding and/or/xor with a constant vector RHS into selects and phis Currently we only fold with ConstantInt RHS. This generalizes to any Constant RHS. Differential Revision: https://reviews.llvm.org/D31610 llvm-svn: 299466	2017-04-04 20:26:25 +00:00
Rong Xu	48596b6f7a	[PGO] Memory intrinsic calls optimization based on profiled size This patch optimizes two memory intrinsic operations: memset and memcpy based on the profiled size of the operation. The high level transformation is like: mem_op(..., size) ==> switch (size) { case s1: mem_op(..., s1); goto merge_bb; case s2: mem_op(..., s2); goto merge_bb; ... default: mem_op(..., size); goto merge_bb; } merge_bb: Differential Revision: http://reviews.llvm.org/D28966 llvm-svn: 299446	2017-04-04 16:42:20 +00:00
Craig Topper	e06b6bcfa1	[InstCombine] Use setAllBits in place of getAllOnesValue since we know the bitwidths are the same. NFCI llvm-svn: 299413	2017-04-04 05:03:02 +00:00
Zvi Rackover	82bf48d8b9	InstCombine: Use the InstSimplify hook for shufflevector Summary: Start using the recently added InstSimplify hook for shuffles in the respective InstCombine visitor. Reviewers: spatel, RKSimon, craig.topper, majnemer Reviewed By: majnemer Subscribers: majnemer, llvm-commits Differential Revision: https://reviews.llvm.org/D31526 llvm-svn: 299412	2017-04-04 04:47:57 +00:00
Craig Topper	1604f0773b	[InstCombine] Remove canonicalization for (X & C1) \| C2 --> (X \| C2) & (C1\|C2) when C1 & C2 have common bits. It turns out that SimplifyDemandedInstructionBits will get called earlier and remove bits from C1 first. Effectively doing (X & (C1&C2)) \| C2. So by the time it got to this check there could be no common bits. I think the DAGCombiner has the same check but its check can be executed because it handles demanded bits later. I'll look at it next. llvm-svn: 299384	2017-04-03 20:41:47 +00:00
Craig Topper	3882613956	[DAGCombine][InstCombine] Fix inverted if condition in equivalent comments in DAGCombine and InstCombine. NFC llvm-svn: 299378	2017-04-03 19:18:48 +00:00
Craig Topper	79120e80b8	Revert r299337 "[InstCombine] Remove redundant combine from visitAnd" One of the tsan bots started failing at this commit. I don't see anything obviously wrong with the commit so trying this to see if it recovers. Failing log: http://lab.llvm.org:8011/builders/sanitizer-x86_64-linux-autoconf/builds/6792 llvm-svn: 299366	2017-04-03 17:22:23 +00:00
Sanjay Patel	77bf622db6	[InstCombine] fix formatting for foldLogOpOfMaskedICmps and related bits; NFCI 1. Improve enum, function, and variable names. 2. Improve comments. 3. Fix variable capitalization. 4. Run clang-format. As an existing code comment suggests, this should work with vector types / splat constants too, so making this look right first will reduce the diffs needed for that change. llvm-svn: 299365	2017-04-03 16:53:12 +00:00
Craig Topper	d33ee1b960	[APInt] Move isMask and isShiftedMask out of APIntOps and into the APInt class. Implement them without memory allocation for multiword This moves the isMask and isShiftedMask functions to be class methods. They now use the MathExtras.h function for single word size and leading/trailing zeros/ones or countPopulation for the multiword size. The previous implementation made multiple temorary memory allocations to do the bitwise arithmetic operations to match the MathExtras.h implementation. Differential Revision: https://reviews.llvm.org/D31565 llvm-svn: 299362	2017-04-03 16:34:59 +00:00
Craig Topper	d0b053d229	[InstCombine] Make foldOpWithConstantIntoOperand take a BinaryOperator instead of a generic Instruction. It blindly assumes there are two operands so make it explicit. llvm-svn: 299351	2017-04-03 07:08:08 +00:00
Craig Topper	07944f891c	[InstCombine] Remove a And transform that should be handled by SimplifyDemandedInstructionBits. NFCI llvm-svn: 299349	2017-04-03 06:02:09 +00:00
Craig Topper	70e4f434ae	[InstCombine] Make InstCombiner::OptAndOp take a BinaryOperator instead of an Instruction. The callers have already performed the necessary cast before calling. This allows us to remove a comment that says the instruction must be a BinaryOperator and make it explicit in the argument type. Had to add a default case to the switch because BinaryOperator::getOpcode() returns a BinaryOps enum. llvm-svn: 299339	2017-04-02 17:57:30 +00:00
Craig Topper	d133591a7e	[InstCombine] Remove redundant combine from visitAnd As far as I can tell this combine is fully handled by SimplifyDemandedInstructionBits. I was only looking at this because it is the only user of APIntOps::isShiftedMask which is itself broken. As demonstrated by r299187. I was going to fix isShiftedMask and needed to make sure we had coverage for the new cases it would expose to this combine. But looks like we can nuke it instead. Differential Revision: https://reviews.llvm.org/D31543 llvm-svn: 299337	2017-04-02 17:34:30 +00:00
Daniel Berlin	07daac8a36	NewGVN: Handle coercion of constant stores, loads, memory insts. Summary: Depends on D30928. This adds support for coercion of stores and memory instructions that do not require insertion to process. Another few tests down. I added the relevant tests from rle.ll Reviewers: davide Subscribers: llvm-commits, Prazek Differential Revision: https://reviews.llvm.org/D30929 llvm-svn: 299330	2017-04-02 13:23:44 +00:00
Nikolai Bozhenov	fca527af5c	[BypassSlowDivision] Do not bypass division of hash-like values Disable bypassing if one of the operands looks like a hash value. Slow division often occurs in hashtable implementations and fast division is never taken there because a hash value is extremely unlikely to have enough upper bits set to zero. A value is considered to be hash-like if it is produced by 1) XOR operation 2) Multiplication by a constant wider than the shorter type 3) PHI node with all incoming values being hash-like Differential Revision: https://reviews.llvm.org/D28200 llvm-svn: 299329	2017-04-02 13:14:30 +00:00
Daniel Berlin	8a00270838	MemorySSA: Add support for caching clobbering access in stores Summary: This enables us to cache the clobbering access for stores, despite the fact that we can't rewrite the use-def chains themselves. Early testing shows that, after this change, for larger testcases, it will be a significant net positive (memory and time) to remove the walker caching. Reviewers: george.burgess.iv, davide Subscribers: Prazek, llvm-commits Differential Revision: https://reviews.llvm.org/D31567 llvm-svn: 299322	2017-04-02 05:09:15 +00:00
Daniel Berlin	9a9c9ff260	NewGVN: Don't try to kill off the stored value of stores when processing the congruence class of the store. Because we use the stored value of a store as the def, it isn't dead just because it appears as a def when it comes from a store. Note: I have not hit any cases with the memory code as it is where this breaks anything, just because of what memory congruences we actually allow. In a followup that improves memory congruence, this bug actually breaks real stuff (but the verifier catches it). llvm-svn: 299300	2017-04-01 09:44:33 +00:00
Daniel Berlin	9b4984926c	NewGVN: Clean up GVNExpression memory hierarchy, restructure hash computation a bit so we don't have to redefine it for loads, stores, and calls llvm-svn: 299299	2017-04-01 09:44:29 +00:00
Daniel Berlin	871ecd90ca	NewGVN: Use def_chain iterator in singleReachablePhiPath instead of recursion llvm-svn: 299298	2017-04-01 09:44:24 +00:00
Daniel Berlin	07275c3065	Move def_chain iterator to MemorySSA.h so it can be reused llvm-svn: 299297	2017-04-01 09:44:19 +00:00
Daniel Berlin	d042031f0f	MemorySSA: Push const correctness further. llvm-svn: 299295	2017-04-01 09:01:12 +00:00
Daniel Berlin	7500c5641e	MemorySSA: Kill the WalkTargetCache now that we have getBlockDefs. llvm-svn: 299294	2017-04-01 08:59:45 +00:00
Craig Topper	47fd2de304	[APInt] Fix bugs in isShiftedMask to match behavior of the similar function in MathExtras.h This removes a parameter from the routine that was responsible for a lot of the issue. It was a bit count that had to be set to the BitWidth of the APInt and would get passed to getLowBitsSet. This guaranteed the call to getLowBitsSet would create an all ones value. This was then compared to (V \| (V-1)). So the only shifted masks we detected had to have the MSB set. The one in tree user is a transform in InstCombine that never fires due to earlier transforms covering the case better. I've submitted a patch to remove it completely, but for now I've just adapted it to the new interface for isShiftedMask. llvm-svn: 299273	2017-03-31 22:23:42 +00:00
Craig Topper	e625d74271	[InstCombine] When adding an Instruction and its Users to the worklist at the same time, make sure we put the Users in first. Then put in the instruction. This way we ensure we immediately revisit the instruction and do any additional optimizations before visiting the users. Otherwise we might visit the users, then the instruction, then users again, then instruction again. llvm-svn: 299267	2017-03-31 21:35:30 +00:00
Craig Topper	885fa12e8a	[APInt] Remove shift functions from APIntOps namespace. Replace the few users with the APInt class methods. NFCI llvm-svn: 299248	2017-03-31 20:01:16 +00:00
Joerg Sonnenberger	28bed106e0	Do not translate rint into nearbyint, but truncate it like nearbyint. A common way to implement nearbyint is by fiddling with the floating point environment and calling rint. This is used at least by the BSD libm and musl. As such, canonicalizing the latter to the former will create infinite loops for libm and generally pessimize performance, at least when the generic C versions are used. This change preserves the rint in the libcall translation and also handles the domain truncation logic, so that rint with float argument will be reduced to rintf etc. llvm-svn: 299247	2017-03-31 19:58:07 +00:00
Dehao Chen	fed890ea3a	Fix the InstCombine to reserve the VP metadata and sets correct call count. Summary: Currently the VP metadata was dropped when InstCombine converts a call to direct call. This patch converts the VP metadata to branch_weights so that its hotness is recorded. Reviewers: eraman, davidxl Reviewed By: davidxl Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D31344 llvm-svn: 299228	2017-03-31 15:59:52 +00:00
Mikael Holmen	79235bd4d8	[Scalarizer] Handle scalar arguments in vector GEP Summary: Triggered by commit r298620: "[LV] Vectorize GEPs". If we encounter a vector GEP with scalar arguments, we splat the scalar into a vector of appropriate size before we scatter the argument. Reviewers: arsenm, mehdi_amini, bkramer Reviewed By: arsenm Subscribers: bjope, mssimpso, wdng, llvm-commits Differential Revision: https://reviews.llvm.org/D31416 llvm-svn: 299186	2017-03-31 06:29:49 +00:00
Peter Collingbourne	6b193966ac	ThinLTOBitcodeWriter: Use Module::global_values(). NFCI. llvm-svn: 299132	2017-03-30 23:43:08 +00:00
Craig Topper	79e5bc528d	[InstCombine] Fix typo last->least. NFC llvm-svn: 299123	2017-03-30 22:28:55 +00:00
Matt Arsenault	79f837c254	AMDGPU: Add all atomicrmw fields to atomic.inc/dec Add scope, order, isVolatile llvm-svn: 299122	2017-03-30 22:21:40 +00:00
Hongbin Zheng	bfd7c38de7	[SimplifyIndvar] Replace the sdiv used by IV if we can prove both of its operands are non-negative Since there is no sdiv in SCEV, an 'udiv' is a better canonical form than an 'sdiv' as the user of induction variable Differential Revision: https://reviews.llvm.org/D31488 llvm-svn: 299118	2017-03-30 21:56:56 +00:00
Simon Pilgrim	68168d17b9	Spelling mistakes in comments. NFCI. Based on corrections mentioned in patch for clang for PR27635 llvm-svn: 299072	2017-03-30 12:59:53 +00:00
Matthew Simpson	c8f0aeccda	[InstCombine] Correct the check for vector GEPs Some of the GEP combines (e.g., descaling) can't handle vector GEPs. We have an existing check that attempts to bail out if given a vector GEP. However, the check only tests the GEP's pointer operand. A GEP results in a vector of pointers if at least one of its operands is vector-typed (e.g., its pointer operand could be a scalar, but its index could be a vector). We should just check the type of the GEP itself. This should fix PR32414. Reference: https://bugs.llvm.org/show_bug.cgi?id=32414 Differential Revision: https://reviews.llvm.org/D31470 llvm-svn: 299017	2017-03-29 18:23:08 +00:00
Filipe Cabecinhas	8b94273fe6	Cleanup in preparation for D30703. NFCI Make the enumerators follow the coding convention and start with OW_... llvm-svn: 298996	2017-03-29 14:42:27 +00:00
Anna Thomas	923e574bff	[InstCombine] For select rule, use positive check of constant int for select operand. NFCI llvm-svn: 298906	2017-03-28 09:32:24 +00:00
Alex Shlyapnikov	bbd5cc63d7	Revert "[asan] Delay creation of asan ctor." Speculative revert. Some libfuzzer tests are affected. This reverts commit r298731. llvm-svn: 298890	2017-03-27 23:11:50 +00:00
Alex Shlyapnikov	09171aa31f	Revert "[asan] Put ctor/dtor in comdat." Speculative revert, some libfuzzer tests are affected. This reverts commit r298756. llvm-svn: 298889	2017-03-27 23:11:47 +00:00
Matthew Simpson	b8ff4a4a70	[LV] Transform truncations of non-primary induction variables The vectorizer tries to replace truncations of induction variables with new induction variables having the smaller type. After r295063, this optimization was applied to all integer induction variables, including non-primary ones. When optimizing the truncation of a non-primary induction variable, we still need to transform the new induction so that it has the correct start value. This should fix PR32419. Reference: https://bugs.llvm.org/show_bug.cgi?id=32419 llvm-svn: 298882	2017-03-27 20:07:38 +00:00
Anna Thomas	f57ae33381	[InstCombine] Avoid incorrect folding of select into phi nodes when incoming element is a vector type Summary: We are incorrectly folding selects into phi nodes when the incoming value of a phi node is a constant vector. This optimization is done in `FoldOpIntoPhi` when the select condition is a phi node with constant incoming values. Without the fix, we are miscompiling (i.e. incorrectly folding the select into the phi node) when the vector contains non-zero elements. This patch fixes the miscompile and we will correctly fold based on the select vector operand (see added test cases). Reviewers: majnemer, sanjoy, spatel Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D31189 llvm-svn: 298845	2017-03-27 13:52:51 +00:00
Serge Pavlov	b71bb80c2d	[LoopUnroll] Remap references in peeled iteration References in cloned blocks must be remapped prior to dominator calculation. Differential Revision: https://reviews.llvm.org/D31281 llvm-svn: 298811	2017-03-26 16:46:53 +00:00
Joerg Sonnenberger	fa7367428a	Split the SimplifyCFG pass into two variants. The first variant contains all current transformations except transforming switches into lookup tables. The second variant contains all current transformations. The switch-to-lookup-table conversion results in code that is more difficult to analyze and optimize by other passes. Most importantly, it can inhibit Dead Code Elimination. As such it is often beneficial to only apply this transformation very late. A common example is inlining, which can often result in range restrictions for the switch expression. Changes in execution time according to LNT: SingleSource/Benchmarks/Misc/fp-convert +3.03% MultiSource/Benchmarks/ASC_Sequoia/CrystalMk/CrystalMk -11.20% MultiSource/Benchmarks/Olden/perimeter/perimeter -10.43% and a couple of smaller changes. For perimeter it also results 2.6% a smaller binary. Differential Revision: https://reviews.llvm.org/D30333 llvm-svn: 298799	2017-03-26 06:44:08 +00:00
Chandler Carruth	0d256c0f5d	[IR] Make SwitchInst::CaseIt almost a normal iterator. This moves it to the iterator facade utilities giving it full random access semantics, etc. It can also now be used with standard algorithms like std::all_of and std::any_of and range adaptors like llvm::reverse. Also make the semantics of iterating match what every other iterator uses and forbid decrementing past the begin iterator. This was used as a hacky way to work around iterator invalidation. However, every instance trying to do this failed to actually avoid touching invalid iterators despite the clear documentation that the removed and all subsequent iterators become invalid including the end iterator. So I've added a return of the next iterator to removeCase and rewritten the loops that were doing this to correctly follow the iterator pattern of either incremneting or removing and assigning fresh values to the iterator and the end. In one case we were trying to go backwards to make this cleaner but it doesn't actually work. I've made that code match the code we use everywhere else to remove cases as we iterate. This changes the order of cases in one test output and I moved that test to CHECK-DAG so it wouldn't care -- the order isn't semantically meaningful anyways. llvm-svn: 298791	2017-03-26 02:49:23 +00:00
Craig Topper	47596dd4cc	[InstCombine] Change the interface of SimplifyDemandedBits so that it takes the instruction and operand instead of the Use. The first thing it did was get the User for the Use to get the instruction back. This requires looking through the Uses for the User using the waymarking walk. That's pretty fast, but its probably still better to just pass the Instruction we already had. llvm-svn: 298772	2017-03-25 06:52:52 +00:00
Davide Italiano	e9781e7b2f	[NewGVN] Adjust NDEBUG markers. This avoids 'used but not defined' warnings in Release builds with GCC. llvm-svn: 298760	2017-03-25 02:40:02 +00:00
Evgeniy Stepanov	71bb8f1ad0	[asan] Put ctor/dtor in comdat. When possible, put ASan ctor/dtor in comdat. The only reason not to is global registration, which can be TU-specific. This is not the case when there are no instrumented globals. This is also limited to ELF targets, because MachO does not have comdat, and COFF linkers may GC comdat constructors. The benefit of this is a lot less __asan_init() calls: one per DSO instead of one per TU. It's also necessary for the upcoming gc-sections-for-globals change on Linux, where multiple references to section start symbols trigger quadratic behaviour in gold linker. llvm-svn: 298756	2017-03-25 01:01:11 +00:00
Craig Topper	8fbb74b5b2	Revert r298711 "[InstCombine] Provide a way to calculate KnownZero/One for Add/Sub in SimplifyDemandedUseBits without recursing into ComputeKnownBits" Tsan bot is failing. llvm-svn: 298745	2017-03-24 22:12:10 +00:00
Ivan Krasin	c2124e185c	Revert r298620: [LV] Vectorize GEPs Reason: breaks linking Chromium with LLD + ThinLTO (a pass crashes) LLVM bug: https://bugs.llvm.org//show_bug.cgi?id=32413 Original change description: [LV] Vectorize GEPs This patch adds support for vectorizing GEPs. Previously, we only generated vector GEPs on-demand when creating gather or scatter operations. All GEPs from the original loop were scalarized by default, and if a pointer was to be stored to memory, we would have to build up the pointer vector with insertelement instructions. With this patch, we will vectorize all GEPs that haven't already been marked for scalarization. The patch refines collectLoopScalars to more exactly identify the scalar GEPs. The function now more closely resembles collectLoopUniforms. And the patch moves vector GEP creation out of vectorizeMemoryInstruction and into the main vectorization loop. The vector GEPs needed for gather and scatter operations will have already been generated before vectoring the memory accesses. Original Differential Revision: https://reviews.llvm.org/D30710 llvm-svn: 298735	2017-03-24 20:49:43 +00:00
Evgeniy Stepanov	64e872a91f	[asan] Delay creation of asan ctor. Create the constructor in the module pass. This in needed for the GC-friendly globals change, where the constructor can be put in a comdat in some cases, but we don't know about that in the function pass. llvm-svn: 298731	2017-03-24 20:42:15 +00:00
Matt Arsenault	4c7795dd31	AMDGPU: Fold rcp/rsq of undef to undef llvm-svn: 298725	2017-03-24 19:04:57 +00:00
Matt Arsenault	18bb24a1be	TTI: Split IsSimple in MemIntrinsicInfo All this did before was assert in EarlyCSE. llvm-svn: 298724	2017-03-24 18:56:43 +00:00
Teresa Johnson	428b9e0627	[ThinLTO] Correct counting of functions in inliner stats Summary: Declarations need to be filtered out when counting functions. Reviewers: eraman Subscribers: Prazek, llvm-commits Differential Revision: https://reviews.llvm.org/D31336 llvm-svn: 298720	2017-03-24 17:59:06 +00:00
Craig Topper	d4521c2fc2	[InstCombine] Provide a way to calculate KnownZero/One for Add/Sub in SimplifyDemandedUseBits without recursing into ComputeKnownBits SimplifyDemandedUseBits for Add/Sub already recursed down LHS and RHS for simplifying bits. If that didn't provide any simplifications we fall back to calling computeKnownBits which will recurse again. Instead just take the known bits for LHS and RHS we already have and call into a new function in ValueTracking that can calculate the known bits given the LHS/RHS bits. llvm-svn: 298711	2017-03-24 16:56:51 +00:00
Benjamin Kramer	46f5e2c47b	Make GCC happy again. llvm-svn: 298702	2017-03-24 14:15:35 +00:00
Daniel Berlin	ffc30781f4	NewGVN: Small cleanup of two dominance related functions to make them easier to understand. llvm-svn: 298692	2017-03-24 06:33:51 +00:00
Daniel Berlin	0e9001131d	NewGVN: Small cleanup of useless expression deletion, and don't uselessly create two expressions in symbolic store evaluation. llvm-svn: 298691	2017-03-24 06:33:48 +00:00
Daniel Berlin	9d0796e5d0	NewGVN: Fix PR32403 - Handling of undef in phis was not quite correct due to LLVM's view of phi nodes. It would cause NewGVN not to fixpoint in some interesting edge cases. llvm-svn: 298687	2017-03-24 05:30:34 +00:00
Craig Topper	36f2e0eee8	[InstCombine] Use range-based for loop. NFC llvm-svn: 298680	2017-03-24 02:58:02 +00:00
Craig Topper	df73e7c5b7	[InstCombine] Fix 80 column violation I accidentally introduced. NFC llvm-svn: 298679	2017-03-24 02:57:59 +00:00
Reid Kleckner	392f062675	[sancov] Don't instrument blocks with no insertion point This prevents crashes when attempting to instrument functions containing C++ try. Sanitizer coverage will still fail at runtime when an exception is thrown through a sancov instrumented function, but that seems marginally better than what we have now. The full solution is to color the blocks in LLVM IR and only instrument blocks that have an unambiguous color, using the appropriate token. llvm-svn: 298662	2017-03-23 23:30:41 +00:00
Dehao Chen	722e94061b	Set the prof weight correctly for call instructions in DeadArgumentElimination. Summary: In DeadArgumentElimination, the call instructions will be replaced. We also need to set the prof weights so that function inlining can find the correct profile. Reviewers: eraman Reviewed By: eraman Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D31143 llvm-svn: 298660	2017-03-23 23:26:00 +00:00
Bryant Wong	def79b21e4	[MetaRenamer] Don't rename library functions. Library functions can have specific semantics that affect the behavior of certain passes. DSE, for instance, gives special treatment to malloc-ed pointers but not to pointers returned from an equivalently typed (but differently named) function. MetaRenamer ought not to alter program semantics, so library functions must remain untouched. Reviewers: mehdi_amini, majnemer, chandlerc, davide Reviewed By: davide Subscribers: davide, llvm-commits Differential Revision: https://reviews.llvm.org/D31304 llvm-svn: 298659	2017-03-23 23:21:07 +00:00
Dehao Chen	8c88671985	Disable loop unrolling and icp in SamplePGO ThinLTO compile phase Summary: loop unrolling and icp will make the sample profile annotation much harder in the backend. So disable these 2 optimization in the ThinLTO compile phase. Will add a test in cfe in a separate patch. Reviewers: tejohnson Reviewed By: tejohnson Subscribers: mehdi_amini, llvm-commits, Prazek Differential Revision: https://reviews.llvm.org/D31217 llvm-svn: 298646	2017-03-23 21:20:05 +00:00
Craig Topper	74494d0179	[InstCombine] Remove some code from visitAnd that dealt with trying to reduce the LHS of a sub to 0. This should now be fully handled by SimplifyDemandedInstructionBits now. Now that we call ShrinkDemandedConstant on the RHS of sub this should be taken care of. This code doesn't trigger on any in tree regressions, but did before ShrinkDemandedConstant was added to the RHS. llvm-svn: 298644	2017-03-23 21:00:13 +00:00
Teresa Johnson	0c6a4ff8dc	[ThinLTO] Add support for emitting minimized bitcode for thin link Summary: The cumulative size of the bitcode files for a very large application can be huge, particularly with -g. In a distributed build environment, all of these files must be sent to the remote build node that performs the thin link step, and this can exceed size limits. The thin link actually only needs the summary along with a bitcode symbol table. Until we have a proper bitcode symbol table, simply stripping the debug metadata results in significant size reduction. Add support for an option to additionally emit minimized bitcode modules, just for use in the thin link step, which for now just strips all debug metadata. I plan to add a cc1 option so this can be invoked easily during the compile step. However, care must be taken to ensure that these minimized thin link bitcode files produce the same index as with the original bitcode files, as these original bitcode files will be used in the backends. Specifically: 1) The module hash used for caching is typically produced by hashing the written bitcode, and we want to include the hash that would correspond to the original bitcode file. This is because we want to ensure that changes in the stripped portions affect caching. Added plumbing to emit the same module hash in the minimized thin link bitcode file. 2) The module paths in the index are constructed from the module ID of each thin linked bitcode, and typically is automatically generated from the input file path. This is the path used for finding the modules to import from, and obviously we need this to point to the original bitcode files. Added gold-plugin support to take a suffix replacement during the thin link that is used to override the identifier on the MemoryBufferRef constructed from the loaded thin link bitcode file. The assumption is that the build system can specify that the minimized bitcode file has a name that is similar but uses a different suffix (e.g. out.thinlink.bc instead of out.o). Added various tests to ensure that we get identical index files out of the thin link step. Reviewers: mehdi_amini, pcc Subscribers: Prazek, llvm-commits Differential Revision: https://reviews.llvm.org/D31027 llvm-svn: 298638	2017-03-23 19:47:39 +00:00
Matthew Simpson	4e7b71bc86	[LV] Vectorize GEPs This patch adds support for vectorizing GEPs. Previously, we only generated vector GEPs on-demand when creating gather or scatter operations. All GEPs from the original loop were scalarized by default, and if a pointer was to be stored to memory, we would have to build up the pointer vector with insertelement instructions. With this patch, we will vectorize all GEPs that haven't already been marked for scalarization. The patch refines collectLoopScalars to more exactly identify the scalar GEPs. The function now more closely resembles collectLoopUniforms. And the patch moves vector GEP creation out of vectorizeMemoryInstruction and into the main vectorization loop. The vector GEPs needed for gather and scatter operations will have already been generated before vectoring the memory accesses. Differential Revision: https://reviews.llvm.org/D30710 llvm-svn: 298620	2017-03-23 16:29:58 +00:00
Matthew Simpson	1fb4064531	[LV] Delete unneeded scalar GEP creation code The code for generating scalar base pointers in vectorizeMemoryInstruction is not needed. We currently scalarize all GEPs and maintain the scalarized values in VectorLoopValueMap. The GEP cloning in this unneeded code is the same as that in scalarizeInstruction. The test cases that changed as a result of this patch changed because we were able to reuse the scalarized GEP that we previously generated instead of cloning a new one. Differential Revision: https://reviews.llvm.org/D30587 llvm-svn: 298615	2017-03-23 16:07:21 +00:00
Dehao Chen	53a0c082d2	Do not set branch weight if the branch weight annotation is present. Summary: ThinLTO will annotate the CFG twice. If the branch weight is set by the first annotation, we should not set the branch weight again in the second annotation because the first annotation is more accurate as there is less optimization that could affect debug info accuracy. Reviewers: tejohnson, davidxl Reviewed By: tejohnson Subscribers: mehdi_amini, aprantl, llvm-commits Differential Revision: https://reviews.llvm.org/D31228 llvm-svn: 298602	2017-03-23 14:43:10 +00:00
Luqman Aden	3f807c91dc	Preserve nonnull metadata on Loads through SROA & mem2reg. Summary: https://llvm.org/bugs/show_bug.cgi?id=31142 : SROA was dropping the nonnull metadata on loads from allocas that got optimized out. This patch simply preserves nonnull metadata on loads through SROA and mem2reg. Reviewers: chandlerc, efriedma Reviewed By: efriedma Subscribers: hfinkel, spatel, efriedma, arielb1, davide, llvm-commits Differential Revision: https://reviews.llvm.org/D27114 llvm-svn: 298540	2017-03-22 19:16:39 +00:00
Peter Collingbourne	f7691d8b41	IPO: Const correctness for summaries passed into passes. Pass const qualified summaries into importers and unqualified summaries into exporters. This lets us const-qualify the summary argument to thinBackend. Differential Revision: https://reviews.llvm.org/D31230 llvm-svn: 298534	2017-03-22 18:22:59 +00:00
Peter Collingbourne	9a3f97977f	IR: Fix a race condition in type id clients of ModuleSummaryIndex. Add a const version of the getTypeIdSummary accessor that avoids mutating the TypeIdMap. Differential Revision: https://reviews.llvm.org/D31226 llvm-svn: 298531	2017-03-22 18:04:39 +00:00
Sanjay Patel	2f602cea41	[InstCombine] canonicalize insertelement of scalar constant ahead of insertelement of variable insertelement (insertelement X, Y, IdxC1), ScalarC, IdxC2 --> insertelement (insertelement X, ScalarC, IdxC2), Y, IdxC1 As noted in the code comment and seen in the test changes, the motivation is that by pulling constant insertion up, we may be able to constant fold some insertelement instructions. Differential Revision: https://reviews.llvm.org/D31196 llvm-svn: 298520	2017-03-22 17:10:44 +00:00
Evgeny Astigeevich	7823c66e05	r286814 resulted that CallPenalty can be subtracted twice: - First time, during calculation of the cost in InlineCost.cpp - Second time, during calculation of the cost in Inliner.cpp This patches fixes this. Differential Revision: https://reviews.llvm.org/D31137 llvm-svn: 298496	2017-03-22 12:01:57 +00:00
Craig Topper	07f2915ad8	[InstCombine] Teach SimplifyDemandedUseBits to shrink Constants on the left side of subtracts Summary: Subtracts can have constants on the left side, but we don't shrink them based on demanded bits. This patch fixes that to match the right hand side. Reviewers: davide, majnemer, spatel, sanjoy, hfinkel Reviewed By: spatel Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D31119 llvm-svn: 298478	2017-03-22 04:03:53 +00:00
George Burgess IV	56c7e88c2c	Let llvm.objectsize be conservative with null pointers This adds a parameter to @llvm.objectsize that makes it return conservative values if it's given null. This fixes PR23277. Differential Revision: https://reviews.llvm.org/D28494 llvm-svn: 298430	2017-03-21 20:08:59 +00:00
Dehao Chen	9907e9d860	Do not inline hot callsites for samplepgo in thinlto compile phase. Summary: Because SamplePGO passes will be invoked twice in ThinLTO build: once at compile phase, the other at backend. We want to make sure the IR at the 2nd phase matches the hot part in profile, thus we do not want to inline hot callsites in the first phase. Reviewers: tejohnson, eraman Reviewed By: tejohnson Subscribers: mehdi_amini, llvm-commits, Prazek Differential Revision: https://reviews.llvm.org/D31201 llvm-svn: 298428	2017-03-21 19:55:36 +00:00
Reid Kleckner	b518054b87	Rename AttributeSet to AttributeList Summary: This class is a list of AttributeSetNodes corresponding the function prototype of a call or function declaration. This class used to be called ParamAttrListPtr, then AttrListPtr, then AttributeSet. It is typically accessed by parameter and return value index, so "AttributeList" seems like a more intuitive name. Rename AttributeSetImpl to AttributeListImpl to follow suit. It's useful to rename this class so that we can rename AttributeSetNode to AttributeSet later. AttributeSet is the set of attributes that apply to a single function, argument, or return value. Reviewers: sanjoy, javed.absar, chandlerc, pete Reviewed By: pete Subscribers: pete, jholewinski, arsenm, dschuff, mehdi_amini, jfb, nhaehnle, sbc100, void, llvm-commits Differential Revision: https://reviews.llvm.org/D31102 llvm-svn: 298393	2017-03-21 16:57:19 +00:00
Yi Kong	019e1c4f99	Test commit access Remove some trailing whitespaces. llvm-svn: 298379	2017-03-21 14:49:19 +00:00
Artur Pilipenko	4cc6130f52	NFC. InstCombiner::visitFAdd extract LHSIntVal/RHSIntVal local variables llvm-svn: 298359	2017-03-21 11:32:15 +00:00
Matt Arsenault	6b00d40900	InstCombine: Check source value precision when reducing cast intrinsic Missed this check when porting from the libcall version. llvm-svn: 298312	2017-03-20 21:59:24 +00:00

... 7 8 9 10 11 ...

18307 Commits