llvm-project

Commit Graph

Author	SHA1	Message	Date
Nick Lewycky	c7f1e7993c	Merge isObjectPointerWithTrustworthySize with getPointerSize. Use it when looking at the size of the pointee. Fixes PR11390! llvm-svn: 144773	2011-11-16 03:49:48 +00:00
Andrew Trick	90c7a108ca	Fix SCEV overly optimistic back edge taken count for multi-exit loops. Fixes PR11375: Different results for 'clang++ huh.cpp'... llvm-svn: 144746	2011-11-16 00:52:40 +00:00
Nick Lewycky	7013a19e8a	Refactor capture tracking (which already had a couple flags for whether returns and stores capture) to permit the caller to see each capture point and decide whether to continue looking. Use this inside memdep to do an analysis that basicaa won't do. This lets us solve another devirtualization case, fixing PR8908! llvm-svn: 144580	2011-11-14 22:49:42 +00:00
Nick Lewycky	d48ab84556	Don't try to loop on iterators that are potentially invalidated inside the loop. Fixes PR11361! llvm-svn: 144454	2011-11-12 03:09:12 +00:00
Eli Friedman	ecb453805d	Make sure scalarrepl picks the correct alloca when it rewrites a bitcast. Fixes PR11353. llvm-svn: 144442	2011-11-12 02:07:50 +00:00
Eli Friedman	0a309292c4	Get rid of an optimization in SCCP which appears to have many issues. Specifically, it doesn't handle many cases involving undef correctly, and it is missing other checks which lead to it trying to re-mark a value marked as a constant with a different value. It also appears to trigger very rarely. Fixes PR11357. llvm-svn: 144352	2011-11-11 01:16:15 +00:00
Pete Cooper	856977cb15	DeadStoreElimination can now trim the size of a store if the end of the store is dead. Currently checks alignment and killing stores on a power of 2 boundary as this is likely to trim the size of the earlier store without breaking large vector stores into scalar ones. Fixes <rdar://problem/10140300> llvm-svn: 144239	2011-11-09 23:07:35 +00:00
Eli Friedman	0bae8b2cfb	Fix code to match comment. Fixes PR11340, a regression from r143209. llvm-svn: 144121	2011-11-08 21:08:02 +00:00
Pete Cooper	9ee220915b	LICM pass now understands invariant load metadata. Nothing generates this yet so it will currently never get used in real tests llvm-svn: 144107	2011-11-08 19:30:00 +00:00
Bill Wendling	2a917595d2	Convert to the new EH model. llvm-svn: 144050	2011-11-08 00:23:01 +00:00
Nick Lewycky	f2905afe62	Do simple cross-block DSE when we encounter a free statement. Fixes PR11240. llvm-svn: 143808	2011-11-05 10:48:42 +00:00
Dan Gohman	ce3d6248b2	Add tests for existing InstSimplify features. llvm-svn: 143721	2011-11-04 18:39:16 +00:00
Dan Gohman	85977e6ab4	Teach instsimplify to simplify calls to undef. llvm-svn: 143719	2011-11-04 18:32:42 +00:00
Daniel Dunbar	e6d40de414	Speculatively revert "DeadStoreElimination can now trim the size of a store if the end of it is dead.", which appears to break bootstrapping LLVM. llvm-svn: 143668	2011-11-04 00:48:26 +00:00
Pete Cooper	8a95aedb5d	DeadStoreElimination can now trim the size of a store if the end of it is dead. Only currently done if the later store is writing to a power of 2 address or has the same alignment as the earlier store as then its likely to not break up large stores into smaller ones Fixes <rdar://problem/10140300> llvm-svn: 143630	2011-11-03 18:01:56 +00:00
Andrew Trick	c2c79c90f2	Rewrite LinearFunctionTestReplace to handle pointer-type IVs. We've been hitting asserts in this code due to the many supported combintions of modes (iv-rewrite/no-iv-rewrite) and IV types. This second rewrite of the code attempts to deal with these cases systematically. llvm-svn: 143546	2011-11-02 17:19:57 +00:00
Andrew Trick	0dae890346	Broaden an assert to handle enable-iv-rewrite=true following r143183. Narrowest possible fix for PR11279. llvm-svn: 143522	2011-11-02 00:02:45 +00:00
Eli Friedman	a49b828f8f	Make sure we use the right insertion point when instcombine replaces a PHI with another instruction. (Specifically, don't insert an arbitrary instruction before a PHI.) Fixes PR11275. llvm-svn: 143437	2011-11-01 04:49:29 +00:00
Duncan Sands	3d5692a475	Reapply commit 143214 with a fix: m_ICmp doesn't match conditions with the given predicate, it matches any condition and returns the predicate - d'oh! Original commit message: The expression icmp eq (select (icmp eq x, 0), 1, x), 0 folds to false. Spotted by my super-optimizer in 186.crafty and 450.soplex. We really need a proper infrastructure for handling generalizations of this kind of thing (which occur a lot), however this case is so simple that I decided to go ahead and implement it directly. llvm-svn: 143318	2011-10-30 19:56:36 +00:00
Benjamin Kramer	594ee77964	SimplifyLibCalls: Use IRBuilder.CreateGlobalString when creating a string for printf->puts, which correctly sets the unnamed_addr bit on the resulting GlobalVariable. Fixes PR11264. llvm-svn: 143289	2011-10-29 19:43:31 +00:00
Eli Friedman	3af3c046a9	Revert r143214; it's breaking a bunch of stuff. llvm-svn: 143265	2011-10-29 00:56:07 +00:00
Duncan Sands	280bc553b3	The expression icmp eq (select (icmp eq x, 0), 1, x), 0 folds to false. Spotted by my super-optimizer in 186.crafty and 450.soplex. We really need a proper infrastructure for handling generalizations of this kind of thing (which occur a lot), however this case is so simple that I decided to go ahead and implement it directly. llvm-svn: 143214	2011-10-28 19:01:20 +00:00
Duncan Sands	985ba6386d	A shift of a power of two is a power of two or zero. For completeness - not spotted in the wild. llvm-svn: 143211	2011-10-28 18:30:05 +00:00
Duncan Sands	92af0a8a7f	Fold icmp ugt (udiv X, Y), X to false. Spotted by my super-optimizer in 186.crafty. llvm-svn: 143209	2011-10-28 18:17:44 +00:00
Andrew Trick	effdca9441	LFTR should avoid a type mismatch with null pointer IVs. Fixes rdar://10359193 Indvar LinearFunctionTestReplace assertion llvm-svn: 143183	2011-10-28 03:45:11 +00:00
Duncan Sands	7cb61e5a0e	Reapply commit 143028 with a fix: the problem was casting a ConstantExpr Mul using BinaryOperator (which only works for instructions) when it should have been a cast to OverflowingBinaryOperator (which also works for constants). While there, correct a few other dubious looking uses of BinaryOperator. Thanks to Chad Rosier for the testcase. Original commit message: My super-optimizer noticed that we weren't folding this expression to true: (x *nsw x) sgt 0, where x = (y \| 1). This occurs in 464.h264ref. llvm-svn: 143125	2011-10-27 19:16:21 +00:00
Bob Wilson	1455ce27e4	Revert Duncan's r143028 expression folding which appears to be the culprit behind a compile failure on 483.xalancbmk. llvm-svn: 143102	2011-10-27 15:47:25 +00:00
Eli Friedman	73beaf7bbc	It is not safe to sink an alloca into a stacksave/stackrestore pair, so don't do that. <rdar://problem/10352360> llvm-svn: 143093	2011-10-27 01:33:51 +00:00
Duncan Sands	ba286d7c73	The maximum power of 2 dividing a power of 2 is itself. This occurs in 403.gcc and was spotted by my super-optimizer. llvm-svn: 143054	2011-10-26 20:55:21 +00:00
Duncan Sands	1d2bb9882d	My super-optimizer noticed that we weren't folding this expression to true: (x *nsw x) sgt 0, where x = (y \| 1). This occurs in 464.h264ref. llvm-svn: 143028	2011-10-26 15:31:51 +00:00
Nick Lewycky	dd1d3df524	A dead malloc, a free(NULL) and a free(undef) are all trivially dead instructions. This doesn't introduce any optimizations we weren't doing before (except potentially due to pass ordering issues), now passes will eliminate them sooner as part of their own cleanups. llvm-svn: 142787	2011-10-24 04:35:36 +00:00
Cameron Zwarich	057fbb1a10	The element insertion code in scalar replacement doesn't handle incorrect element types, even though the element extraction code does. It is surprising that this bug has been here for so long. Fixes <rdar://problem/10318778>. llvm-svn: 142740	2011-10-23 07:02:10 +00:00
Nick Lewycky	52340ac5f8	Oops! Fix test I forgot to submit as part of r142735. llvm-svn: 142736	2011-10-22 22:07:31 +00:00
Nick Lewycky	32f8051d66	A non-escaping malloc in the entry block is not unlike an alloca. Do dead-store elimination on them too. llvm-svn: 142735	2011-10-22 21:59:35 +00:00
Eli Friedman	688db1d6d0	Remap blockaddress correctly when inlining a function. Fixes PR10162. llvm-svn: 142684	2011-10-21 20:45:19 +00:00
Eli Friedman	ce818277fc	Extend instcombine's shufflevector simplification to handle more cases where the input and output vectors have different sizes. Patch by Xiaoyi Guo. llvm-svn: 142671	2011-10-21 19:06:29 +00:00
Eli Friedman	1923a330e6	Refactor code from inlining and globalopt that checks whether a function definition is unused, and enhance it so it can tell that functions which are only used by a blockaddress are in fact dead. This probably doesn't happen much on most code, but the Linux kernel's _THIS_IP_ can trigger this issue with blockaddress. (GlobalDCE can also handle the given tescase, but we only run that at -O3.) Found while looking at PR11180. llvm-svn: 142572	2011-10-20 05:23:42 +00:00
Nick Lewycky	462098824f	"@string = constant i8 0" is a value i8* string of length zero. Analyze that correctly in GetStringLength, fixing PR11181! llvm-svn: 142558	2011-10-20 00:34:35 +00:00
Dan Gohman	a7107f992e	Teach the ARC optimizer about the !clang.arc.copy_on_escape metadata tag on objc_retainBlock calls, which indicates that they may be optimized away. rdar://10211286. llvm-svn: 142298	2011-10-17 22:53:25 +00:00
Lang Hames	e7594abd87	Fixed quoting on default data layout option. llvm-svn: 142286	2011-10-17 21:54:43 +00:00
Bill Wendling	c68c8cb8d4	Add support for the Objective-C personality function to the instruction combining of the landingpad instruction. The ObjC personality function acts almost identically to the C++ personality function. In particular, it uses "null" as a "catch-all" value. llvm-svn: 142256	2011-10-17 21:20:24 +00:00
Dan Gohman	1736c14b85	Suppress partial retain+release elimination when there's a possibility that it will span multiple CFG diamonds/triangles which could have different controlling predicates. rdar://10282956 llvm-svn: 142222	2011-10-17 18:48:25 +00:00
Bill Wendling	63a4ea1859	Correct over-zealous removal of hack. Some code want to check that any call within a function has the 'returns twice' attribute, not just that the current function has one. llvm-svn: 142221	2011-10-17 18:43:40 +00:00
Bill Wendling	42cf65fe51	Temporarily XFAIL waiting for a fix. llvm-svn: 142215	2011-10-17 18:25:32 +00:00
Chandler Carruth	3e8aa65bc2	Add a routine to swap branch instruction operands, and update any profile metadata at the same time. Use it to preserve metadata attached to a branch when re-writing it in InstCombine. Add metadata to the canonicalize_branch InstCombine test, and check that it is tranformed correctly. Reviewed by Nick Lewycky! llvm-svn: 142168	2011-10-17 01:11:57 +00:00
Nick Lewycky	84baea77ea	Oops! Fix testcase. llvm-svn: 142151	2011-10-16 20:20:15 +00:00
Nick Lewycky	0a7e9ccf04	When looking for dependencies on the src pointer, scan the src pointer. Scanning on the memcpy call will pull up other unrelated stuff. Fixes PR11142. llvm-svn: 142150	2011-10-16 20:13:32 +00:00
Andrew Trick	fd4ca0f4ac	Fix SCEVExpander assert during LSR: "argument of incompatible type". Just because we're dealing with a GEP doesn't mean we can assert the SCEV has a pointer type. The fix is simply to ignore the SCEV pointer type, which we really didn't need. Fixes PR11138 webkit crash. llvm-svn: 142058	2011-10-15 06:19:55 +00:00
Andrew Trick	870c1a3f15	Reapply r141870, SCEV expansion of post-inc. Speculatively reapply to see if this test case still crashes on linux. I may have fixed it in my last checkin. llvm-svn: 141895	2011-10-13 21:55:29 +00:00
Andrew Trick	41c253c35c	Revert r141870. The test case crashes on linux with data corruption. A deeper issue was exposed. llvm-svn: 141873	2011-10-13 17:58:24 +00:00
Andrew Trick	e15d6e14e3	LSR: Reuse the post-inc expansion of expressions. This avoids unnecessary expansion of expressions and allows the SCEV expander to work on expression DAGs, not just trees. Fixes PR11090. llvm-svn: 141870	2011-10-13 17:31:47 +00:00
Lang Hames	850f7b3cdc	Removed colons from some target datalayout strings in test, since they don't match the required format. llvm-svn: 141825	2011-10-12 22:24:17 +00:00
Cameron Zwarich	1a761dcfbd	Fix PR11106 by correcting a typo that has been in the code for over a year. This would have never worked, since the element type of a vector type is never a vector type. Also fix the conditional to be more direct in checking whether EltTy is a vector type. llvm-svn: 141713	2011-10-11 21:26:40 +00:00
Cameron Zwarich	ab3a9b3baf	Add a test for PR10565. llvm-svn: 141647	2011-10-11 06:10:37 +00:00
Cameron Zwarich	d7515ccc47	Remove a lot of the fancy scalar replacement code for dealing with llvm-gcc's lowering of NEON code. It provides little-to-no benefit now and only introduces additional complexity. llvm-svn: 141646	2011-10-11 06:10:30 +00:00
Andrew Trick	f9201c572e	Move replaceCongruentIVs into SCEVExapander and bias toward "expanded" IVs. Indvars previously chose randomly between congruent IVs. Now it will bias the decision toward IVs that SCEVExpander likes to create. This was not done to fix any problem, it's just a welcome side effect of factoring code. llvm-svn: 141633	2011-10-11 02:28:51 +00:00
Lang Hames	44c78f809b	Added a testcase for r141599, rdar://problem/10063881. llvm-svn: 141628	2011-10-11 01:32:10 +00:00
Andrew Trick	ce0cb3a101	Unit test for LSR phi reuse in r141442. llvm-svn: 141472	2011-10-08 02:34:51 +00:00
Duncan Sands	c52af46484	Teach GVN to also propagate switch cases. For example, in this code switch (n) { case 27: do_something(x); ... } the call do_something(x) will be replaced with do_something(27). In gcc-as-one-big-file this results in the removal of about 500 lines of bitcode (about 0.02%), so has about 1/10 of the effect of propagating branch conditions. llvm-svn: 141360	2011-10-07 08:29:06 +00:00
Eli Friedman	3e3aecbc2c	PR11061: Make simplifylibcalls fold strcmp("", x) correctly. While I'm here, fix the related issue with strncmp, add some actual tests for strcmp and strncmp, and start using StringRef::compare for constant folding instead of using strcmp/strncmp so that the optimized IR isn't dependent on the host's implementation of strcmp. llvm-svn: 141227	2011-10-05 22:27:16 +00:00
Jim Grosbach	8f9acfac89	Revert 141203. InstCombine is looping on unit tests. llvm-svn: 141209	2011-10-05 20:44:29 +00:00
Rafael Espindola	79d0c4f4b0	Check for the returns_twice attribute in callsFunctionThatReturnsTwice. This fixes PR11038, but there are still some cleanups to be done. llvm-svn: 141204	2011-10-05 20:05:13 +00:00
Jim Grosbach	e37e030137	Update InstCombine worklist after instruction transform is complete. When updating the worklist for InstCombine, the Add/AddUsersToWorklist functions may access the instruction(s) being added, for debug output for example. If the instructions aren't yet added to the basic block, this can result in a crash. Finish the instruction transformation before adjusting the worklist instead. rdar://10238555 llvm-svn: 141203	2011-10-05 20:05:00 +00:00
Duncan Sands	f4f47ccd12	GVN does simple propagation of conditions: when it sees a conditional branch "br i1 %x, label %if_true, label %if_false" then it replaces "%x" with "true" in places only reachable via the %if_true arm, and with "false" in places only reachable via the %if_false arm. Except that actually it doesn't: if value numbering shows that %y is equal to %x then, yes, %y will be turned into true/false in this way, but any occurrences of %x itself are not transformed. Fix this. What's more, it's often the case that %x is an equality comparison such as "%x = icmp eq %A, 0", in which case every occurrence of %A that is only reachable via the %if_true arm can be replaced with 0. Implement this and a few other variations on this theme. This reduces the number of lines of LLVM IR in "GCC as one big file" by 0.2%. It has a bigger impact on Ada code, typically reducing the number of lines of bitcode by around 0.4% by removing repeated compiler generated checks. Passes the LLVM nightly testsuite and the Ada ACATS testsuite. llvm-svn: 141177	2011-10-05 14:28:49 +00:00
Duncan Sands	e90dd0587e	Generalize GVN's conditional propagation logic slightly: it's OK for the false/true destination to have multiple predecessors as long as the extra ones are dominated by the branch destination. llvm-svn: 141176	2011-10-05 14:17:01 +00:00
Andrew Trick	8de329a9fc	LSR should avoid redundant edge splitting. This handles the case in which LSR rewrites an IV user that is a phi and splits critical edges originating from a switch. Fixes <rdar://problem/6453893> LSR is not splitting edges "nicely" llvm-svn: 141059	2011-10-04 03:50:44 +00:00
Andrew Trick	bf51f97c28	Unit test for r140919, loop unroll heuristics. llvm-svn: 141049	2011-10-04 00:07:02 +00:00
Rafael Espindola	cc349c8dd8	Add the returns_twice attribute to LLVM. llvm-svn: 141001	2011-10-03 14:45:37 +00:00
Nick Lewycky	99fb091f65	Add a new icmp+select optz'n. Also shows off the load(cst) folding added in r140966. llvm-svn: 140969	2011-10-02 10:37:37 +00:00
Jim Grosbach	24ff834671	float comparison to double 'zero' constant can just be a float 'zero.' InstCombine was incorrectly considering the conversion of the constant zero to be unsafe. We want to transform: define float @bar(float %x) nounwind readnone optsize ssp { %conv = fpext float %x to double %cmp = fcmp olt double %conv, 0.000000e+00 %conv1 = zext i1 %cmp to i32 %conv2 = sitofp i32 %conv1 to float ret float %conv2 } Into: define float @bar(float %x) nounwind readnone optsize ssp { %cmp = fcmp olt float %x, 0.000000e+00 ; <---- This %conv1 = zext i1 %cmp to i32 %conv2 = sitofp i32 %conv1 to float ret float %conv2 } rdar://10215914 llvm-svn: 140869	2011-09-30 18:45:50 +00:00
Duncan Sands	d6c0011d92	Add forgotten tests that the cleanup flag is cleared if there is a catch-all landingpad clause. llvm-svn: 140858	2011-09-30 17:00:34 +00:00
Duncan Sands	5c05579f94	Inlining often produces landingpad instructions with repeated catch or repeated filter clauses. Teach instcombine a bunch of tricks for simplifying landingpad clauses. Currently the code only recognizes the GNU C++ and Ada personality functions, but that doesn't stop it doing a bunch of "generic" transforms which are hopefully fine for any real-world personality function. If these "generic" transforms turn out not to be generic, they can always be conditioned on the personality function. Probably someone should add the ObjC++ personality function. I didn't as I don't know anything about it. llvm-svn: 140852	2011-09-30 13:12:16 +00:00
Dan Gohman	4ac148dcbc	When eliminating unnecessary retain+autorelease on return values, handle the case where the retain is in a different basic block. rdar://10210274. llvm-svn: 140815	2011-09-29 22:27:34 +00:00
Dan Gohman	2053a5dd64	Don't eliminate objc_retainBlock calls on stack objects if the objc_retainBlock call is potentially responsible for copying the block to the heap to extend its lifetime. rdar://10209613. llvm-svn: 140814	2011-09-29 22:25:23 +00:00
Andrew Trick	8c219ecd1a	Test case for r140670: indvars should hoist sext. llvm-svn: 140671	2011-09-28 02:13:32 +00:00
Benjamin Kramer	547b6c5ecd	Stop emitting instructions with the name "tmp" they eat up memory and have to be uniqued, without any benefit. If someone prefers %tmp42 to %42, run instnamer. llvm-svn: 140634	2011-09-27 20:39:19 +00:00
Bill Wendling	90f90da156	Split the landing pad basic block with the correct function. Also merge the split landingpad instructions into a PHI node. PR11016 llvm-svn: 140592	2011-09-27 00:59:31 +00:00
Eli Friedman	c064f2c33e	Convert more tests over to the new atomic instructions. llvm-svn: 140559	2011-09-26 20:27:49 +00:00
Eli Friedman	bda9e7af58	Upgrade a couple more tests to the new atomic instructions. llvm-svn: 140558	2011-09-26 20:15:56 +00:00
Eli Friedman	f9b785f185	PR10987: add a missed safety check to isSafePHIToSpeculate in scalarrepl. llvm-svn: 140327	2011-09-22 18:56:30 +00:00
Eli Friedman	1815b688cc	Make sure IPSCCP never marks a tracked call as overdefined in SCCPSolver::ResolvedUndefsIn. If we do, we can end up in a situation where a function is resolved to return a constant, but the caller is marked overdefined, which confuses the code later. <rdar://problem/9956541> (again). llvm-svn: 140210	2011-09-20 23:28:51 +00:00
Bill Wendling	cf1372d183	Update this test to the new EH model. Though I think it may be obsolete with the loop extract changes. And I couldn't get the old version of LLVM to compile so that I could reduce this testcase. llvm-svn: 140197	2011-09-20 22:29:43 +00:00
Bill Wendling	626a6f4160	Update to new EH model. llvm-svn: 140177	2011-09-20 20:21:16 +00:00
Bill Wendling	708d38e06a	Update to the new EH syntax. llvm-svn: 140103	2011-09-19 23:50:34 +00:00
Bill Wendling	ff5d9cc389	Dramatically reduce this testcase. llvm-svn: 140101	2011-09-19 23:47:06 +00:00
Bill Wendling	7cdaa3a1a8	Revert r140083 and r140084 until buildbots can be fixed. llvm-svn: 140094	2011-09-19 23:30:41 +00:00
Bill Wendling	d01aee587f	Update test to remove the 'unwind' instruction. llvm-svn: 140084	2011-09-19 23:01:11 +00:00
Bill Wendling	8bf0eab34d	Replace uses of unwind with unreachable for the same effect. llvm-svn: 140077	2011-09-19 22:16:15 +00:00
Bill Wendling	e7b02b8170	Replace more uses of 'unwind' in the tests with calls to landingpad and resume. Note that some of these tests were basically dead. llvm-svn: 140076	2011-09-19 22:11:35 +00:00
Eli Friedman	61d7c8a065	Fix an infinite loop where a transform in InstCombiner::visitAnd claims a construct is changed when it is not. (See included testcase.) Patch by Xiaoyi Guo. llvm-svn: 140072	2011-09-19 21:58:15 +00:00
Bill Wendling	c945f54ea5	This testcase is dead. It doesn't inline even if I add the 'alwaysinline' attribute to the @foo function. llvm-svn: 140067	2011-09-19 21:14:33 +00:00
Andrew Trick	7251e41b16	[indvars] Fix PR10946: SCEV cannot handle Vector IVs. llvm-svn: 140026	2011-09-19 17:54:39 +00:00
Andrew Trick	74111ee07f	Reapply r139759. Disable IV rewriting by default. See PR10916. llvm-svn: 139842	2011-09-15 20:58:37 +00:00
Eli Friedman	888bea0b95	Make demanded-elt simplification for shufflevector slightly stronger. Spotted by inspection. llvm-svn: 139768	2011-09-15 01:14:29 +00:00
Dan Gohman	d4b5e3a4d9	objc_retainBlock is not NoModRef because it can update forwarding pointers in memory relevant to the optimizer. rdar://10050579. llvm-svn: 139708	2011-09-14 18:13:00 +00:00
Andrew Trick	f9f68b816b	[indvars] Revert r139579 until 401.bzip -arch i386 miscompilation is fixed. PR10920. llvm-svn: 139583	2011-09-13 05:23:49 +00:00
Andrew Trick	061d811c51	Disable IV rewriting by default. See PR10916. llvm-svn: 139579	2011-09-13 03:23:21 +00:00
Andrew Trick	5b28cc84f0	Generalize test case to handle multiple indvars modes. llvm-svn: 139578	2011-09-13 03:17:25 +00:00
Andrew Trick	3de5b8e4c1	[indvars] Fix bugs in floating point IV range checks noticed by inspection. llvm-svn: 139574	2011-09-13 01:59:32 +00:00
Andrew Trick	54a109845d	Conditionalize indvars test that relies on SCEV expansion of geps, which is only relevant with canonical IVs llvm-svn: 139556	2011-09-12 23:13:57 +00:00
Andrew Trick	a8315c3f2b	indvars test only relevant for -enable-iv-rewrite. Otherwise this case is now covered by no-iv-rewrite.ll. llvm-svn: 139552	2011-09-12 22:59:00 +00:00
Andrew Trick	d2e61e1f70	Conditionalize indvars tests that rely on SCEV expansion of geps, which is relevant with canonical IVs. Anything else being checked by these tests is already covered by early CSE. llvm-svn: 139535	2011-09-12 20:26:34 +00:00
Andrew Trick	30e8db98b8	Removing indvars tests that directly test canonical IVs and nothing else. llvm-svn: 139518	2011-09-12 18:33:08 +00:00
Andrew Trick	183013d8d4	Rename -disable-iv-rewrite to -enable-iv-rewrite=false in preparation for default change. llvm-svn: 139517	2011-09-12 18:28:44 +00:00
Andrew Trick	8c6fb3af6e	Test case for r139453, WidenIV::GetExtendedOperandRecurrence. llvm-svn: 139504	2011-09-12 17:20:57 +00:00
Duncan Sands	ba60b04148	Mark the eh.typeid.for intrinsic as being 'const', which it is inside any given function. As pointed out by John McCall, this is needed to have redundant eh.typeid.for tests be eliminated in the presence of cleanups. llvm-svn: 139360	2011-09-09 07:50:37 +00:00
Duncan Sands	524c33a27f	When inlining exception handling code into another function, ensure that duplicate tests are eliminated (for example if the two functions both have a catch clause catching the same type, ensure the redundant one is removed). Note that it would probably be safe to say that eh.typeid.for is 'const', but since two calls to it with the same argument can give different results (but only if the calls are in different functions), it seems more correct to mark it only 'pure'; this doesn't get in the way of the optimization. llvm-svn: 139236	2011-09-07 16:44:14 +00:00
Duncan Sands	d7430cea10	Forgot to add this trampoline testcase. llvm-svn: 139229	2011-09-07 09:21:38 +00:00
Owen Anderson	58704ee442	Try again at r138809 (make DSE more aggressive in removing dead stores at the end of a function), now with less deleting stores before memcpy's. llvm-svn: 139150	2011-09-06 18:14:09 +00:00
Duncan Sands	a098436b32	Split the init.trampoline intrinsic, which currently combines GCC's init.trampoline and adjust.trampoline intrinsics, into two intrinsics like in GCC. While having one combined intrinsic is tempting, it is not natural because typically the trampoline initialization needs to be done in one function, and the result of adjust trampoline is needed in a different (nested) function. To get around this llvm-gcc hacks the nested function lowering code to insert an additional parent variable holding the adjust.trampoline result that can be accessed from the child function. Dragonegg doesn't have the luxury of tweaking GCC code, so it stored the result of adjust.trampoline in the memory GCC set aside for the trampoline itself (this is always available in the child function), and set up some new memory (using an alloca) to hold the trampoline. Unfortunately this breaks Go which allocates trampoline memory on the heap and wants to use it even after the parent has exited (!). Rather than doing even more hacks to get Go working, it seemed best to just use two intrinsics like in GCC. Patch mostly by Sanjoy Das. llvm-svn: 139140	2011-09-06 13:37:06 +00:00
Benjamin Kramer	4b79c21ef2	InstSimplify: Don't try to replace an extractvalue/insertvalue pair with the original value if types don't match. Fixes clang selfhost. llvm-svn: 139120	2011-09-05 18:16:19 +00:00
Duncan Sands	29192d042e	Delete trivial landing pads that just continue unwinding the caught exception. llvm-svn: 139117	2011-09-05 12:57:57 +00:00
Duncan Sands	fd26a954a8	Add some simple insertvalue simplifications, for the purpose of cleaning up do-nothing exception handling code produced by dragonegg. llvm-svn: 139113	2011-09-05 06:52:48 +00:00
Bill Wendling	4aa2573748	Try to eliminate the use of the 'unwind' instruction. llvm-svn: 139046	2011-09-02 22:41:11 +00:00
Andrew Trick	310a448cfe	Test case update for unroll-scev. llvm-svn: 139037	2011-09-02 21:21:03 +00:00
Jakub Staszak	63a3a0e010	Extra CHECK-NOT to make sure that GVN transform works properly. llvm-svn: 139012	2011-09-02 17:40:39 +00:00
Andrew Trick	4a31ba3bae	-unroll-scev flag removal llvm-svn: 139010	2011-09-02 17:36:14 +00:00
Jakub Staszak	057d423e4b	ConstantVector returns arbitrary value for the wrong index. This fixes PR10813. llvm-svn: 139006	2011-09-02 15:43:43 +00:00
Jakub Staszak	7470fb01d0	Compare type size instead of type _store_ size to make sure that BitCastInst will be valid. This fixes PR10820. llvm-svn: 139005	2011-09-02 14:57:37 +00:00
Bill Wendling	723cec7a5f	Update to new EH scheme. llvm-svn: 138989	2011-09-02 01:25:11 +00:00
Eli Friedman	4028a51c74	Fix test; sorry for any inconvenience. llvm-svn: 138966	2011-09-01 21:25:42 +00:00
Eli Friedman	71f5c2f158	Fix an issue with the IR sink pass found by inspection. (I'm not sure anyone is actually using this, but might as well fix it since I found the issue.) llvm-svn: 138965	2011-09-01 21:21:24 +00:00
Eli Friedman	cc6e92892f	Add missing newline. llvm-svn: 138964	2011-09-01 21:20:11 +00:00
Bill Wendling	185d377597	Update to new EH scheme. llvm-svn: 138933	2011-09-01 01:28:25 +00:00
Bill Wendling	d33e3007fa	Update to new EH scheme. llvm-svn: 138928	2011-09-01 01:08:21 +00:00
Bill Wendling	3b17c1b48d	Update to new EH scheme. llvm-svn: 138927	2011-09-01 01:02:41 +00:00
Bill Wendling	e88632d667	Update some tests to the new EH scheme. llvm-svn: 138925	2011-09-01 00:58:03 +00:00
Bill Wendling	080f40a49d	Reenable test. llvm-svn: 138916	2011-08-31 23:08:05 +00:00
Bill Wendling	54b91028f0	Revert accidental commit llvm-svn: 138915	2011-08-31 23:07:46 +00:00
Eli Friedman	c472975fd9	Disable this test until Bill fixes it properly. llvm-svn: 138914	2011-08-31 23:03:30 +00:00
Bill Wendling	5624fe0cff	Update to new EH scheme. llvm-svn: 138908	2011-08-31 21:50:07 +00:00
Bill Wendling	d4e871404d	Update more tests to the new EH scheme. llvm-svn: 138903	2011-08-31 21:39:05 +00:00
Eli Friedman	293c31b81c	Add tests for the transformations SCCP can do on atomic loads and stores (which are safe without any modifications). llvm-svn: 138902	2011-08-31 21:37:06 +00:00
Bill Wendling	e6174a2c85	Update more tests to the new EH scheme. llvm-svn: 138894	2011-08-31 21:04:11 +00:00
Bill Wendling	b1e680fd3f	Update the tests to the new EH scheme. llvm-svn: 138891	2011-08-31 20:55:40 +00:00
Rafael Espindola	a45c20b049	Remove the old tail duplication pass. It is not used and is unable to update ssa, so it has to be run really early in the pipeline. Any replacement should probably use the SSAUpdater. llvm-svn: 138841	2011-08-30 23:03:45 +00:00
Owen Anderson	b359367a3d	Remove empty file. llvm-svn: 138830	2011-08-30 21:17:20 +00:00
Owen Anderson	e316e5b2ad	Speculatively revert r138809 in an attempt to fix DragonEgg. llvm-svn: 138829	2011-08-30 21:11:06 +00:00
Owen Anderson	d708ec4c6a	When walking backwards to eliminate final stores to allocas at the end of a function, encountering an unrelated store should not cause us to give up like encountering a load does. llvm-svn: 138809	2011-08-30 18:51:55 +00:00
Bill Wendling	e79ce47ad7	Update tests to new EH model. Add landingpad instructions to landing pads. llvm-svn: 138759	2011-08-29 20:39:23 +00:00
Nadav Rotem	5fc81ffbac	Fixes following the CR by Chris and Duncan: Optimize chained bitcasts of the form A->B->A. Undo r138722 and change isEliminableCastPair to allow this case. llvm-svn: 138756	2011-08-29 19:58:36 +00:00
Nadav Rotem	52600ee8c3	Bitcasts are transitive. Bitcast-Bitcast-X becomes Bitcast-X. llvm-svn: 138722	2011-08-28 11:51:08 +00:00
Bill Wendling	ba198e661e	Auto upgrade the old EH scheme to use the new one. This is on a trial basis. If things to disasterously over night, this can be reverted. llvm-svn: 138702	2011-08-27 06:11:03 +00:00
Bill Wendling	eed1e8905a	Don't sink landingpad instructions during ind-var simplification. llvm-svn: 138651	2011-08-26 20:40:15 +00:00
Andrew Trick	07aeb629ec	Use %% for literals in RUN lines. llvm-svn: 138647	2011-08-26 20:09:48 +00:00
Benjamin Kramer	fb212a6309	SimplifyCFG: If we have a PHI node that can evaluate to NULL and do a load or store to the address returned by the PHI node then we can consider this incoming value as dead and remove the edge pointing there, unless there are instructions that can affect control flow executed in between. In theory this could be extended to other instructions, eg. division by zero, but it's likely that it will "miscompile" some code because people depend on div by zero not trapping. NULL pointer dereference usually leads to a crash so we should be on the safe side. This shrinks the size of a Release clang by 16k on x86_64. llvm-svn: 138618	2011-08-26 01:22:29 +00:00
Nadav Rotem	365af6f17b	Implement Constant::isAllOnesValue(). Fix ConstantFolding to use the new api. llvm-svn: 138469	2011-08-24 20:18:38 +00:00
Eric Christopher	7bc78f692c	Revert "Address Duncan's CR request:" This reverts commit 20a05be15ea5271ab6185b83200fa88263362400. (svn rev 138340) Conflicts: test/Transforms/InstCombine/bitcast.ll llvm-svn: 138366	2011-08-23 20:11:10 +00:00
Nadav Rotem	7d3effa389	Fix a typo in the test from the previous commit. llvm-svn: 138342	2011-08-23 17:56:54 +00:00
Nadav Rotem	c78e6607b5	Address Duncan's CR request: 1. Cleanup the tests in ConstantFolding.cpp 2. Implement isAllOnes for Constant, ConstantFP, ConstantVector llvm-svn: 138340	2011-08-23 17:48:43 +00:00
Dan Gohman	56e1cef705	Constant pointers to objects don't need reference counting. llvm-svn: 138242	2011-08-22 17:29:11 +00:00
Dan Gohman	bce94fded8	Make a few tests slightly more strict. llvm-svn: 138241	2011-08-22 17:27:02 +00:00
Nadav Rotem	ad4a70ad3e	Add constant folding support for bitcasts of splat vectors to integers. llvm-svn: 138206	2011-08-20 14:02:29 +00:00
Eric Christopher	8af4e41734	Add file. llvm-svn: 138083	2011-08-19 21:21:20 +00:00
Dan Gohman	b38940135b	Track a retain+release nesting level independently of the known-incremented level, because the two concepts can be used to prove the saftey of a retain+release removal in different ways. llvm-svn: 138016	2011-08-19 00:26:36 +00:00
Eli Friedman	9a468153e1	Atomic load/store handling for the passes using memdep (GVN, DSE, memcpyopt). llvm-svn: 137888	2011-08-17 22:22:24 +00:00
Eli Friedman	d7749be2d7	Silly mistake from r137777; restore significant isStructTy() checks. While here, be a bit more defensive with unknown instructions. Fixes PR10687. llvm-svn: 137836	2011-08-17 18:10:43 +00:00
Eli Friedman	e1df253200	An additional atomic test; related to r137662. llvm-svn: 137786	2011-08-16 23:29:17 +00:00
Eli Friedman	0793eb4c46	A bunch of misc fixes to SCCPSolver::ResolvedUndefsIn, including a fix to stop making random bad assumptions about instructions which are not explicitly listed. Includes fix for rdar://9956541, a version of "undef ^ undef should return 0 because it's easier than arguing with users". llvm-svn: 137777	2011-08-16 22:06:31 +00:00
Eli Friedman	56f2f21254	Minor bug in SCCP found by inspection. (I don't think it's possible to hit this with a normal pass pipeline, but fixing for completeness.) llvm-svn: 137755	2011-08-16 21:12:35 +00:00
Eli Friedman	ac992afd93	Fix test. llvm-svn: 137703	2011-08-16 01:42:56 +00:00
Eli Friedman	a917d4f9b4	Revert a bit of r137667; the logic in question can safely handle atomic load/store. llvm-svn: 137702	2011-08-16 01:28:22 +00:00
Eric Christopher	5403862ab7	Migrate this test from llvm/test/FrontendC++/ptr-to-method-devirt.cpp and FileCheckize. It is more properly an optimizer test. llvm-svn: 137700	2011-08-16 01:17:17 +00:00
Eli Friedman	0ffdf2ea0b	Update SimplifyCFG for atomic operations. This commit includes a mention of the landingpad instruction, but it's not changing the behavior around it. I think the current behavior is correct, though. Bill, can you double-check that? llvm-svn: 137691	2011-08-15 23:59:28 +00:00
Eli Friedman	01a67111d1	Add comments and test for atomic load/store and mem2reg. llvm-svn: 137690	2011-08-15 23:55:52 +00:00
Eli Friedman	8bc586e770	Update instcombine for atomic load/store. llvm-svn: 137664	2011-08-15 22:09:40 +00:00
Eli Friedman	91386c7be4	Atomic load/store support in LICM. llvm-svn: 137648	2011-08-15 20:52:09 +00:00
Eric Christopher	990dd3d0fb	Add an ipsccp test. Migrated from test/FrontendC++. llvm-svn: 137646	2011-08-15 20:50:36 +00:00
Nick Lewycky	746e317953	This transform is not safe. Thanks to Eli for pointing that out! llvm-svn: 137575	2011-08-14 04:51:49 +00:00
Nick Lewycky	ae13df60a6	Don't attempt to add 'nsw' when intermediate instructions had no such guarantee. llvm-svn: 137572	2011-08-14 03:41:33 +00:00
Nick Lewycky	de49278c26	Teach instcombine to preserve the nsw bit by doing an after-the-fact analysis when combining add and sub instructions. Patch by Pranav Bhandarkar! llvm-svn: 137570	2011-08-14 01:45:19 +00:00
Eli Friedman	02e737b08e	Move "atomic" and "volatile" designations on instructions after the opcode of the instruction. Note that this change affects the existing non-atomic load and store instructions; the parser now accepts both forms, and the change is noted in the release notes. llvm-svn: 137527	2011-08-12 22:50:01 +00:00
Dan Gohman	10a18d55ce	Don't convert objc_autoreleaseReturnValue to objc_autorelease if the result is returned through a bitcast. llvm-svn: 137402	2011-08-12 00:36:31 +00:00
Dan Gohman	121302772d	Don't let arbitrary calls disrupt nested retain+release pairs if the retains and releases all use the same SSA pointer value. Also, don't let CFG hazards disrupt nested retain+release pair optimizations. llvm-svn: 137399	2011-08-12 00:26:31 +00:00
Andrew Trick	2d8494a030	A slew of unit tests for the recent LoopInfo::updateUnloop feature checked in at r137276 and r137341. llvm-svn: 137385	2011-08-11 23:38:09 +00:00
Andrew Trick	2b6860f0a1	Allow loop unrolling to get known trip counts from ScalarEvolution. SCEV unrolling can unroll loops with arbitrary induction variables. It is a prerequisite for -disable-iv-rewrite performance. It is also easily handles loops of arbitrary structure including multiple exits and is generally more robust. This is under a temporary option to avoid affecting default behavior for the next couple of weeks. It is needed so that I can checkin unit tests for updateUnloop. llvm-svn: 137384	2011-08-11 23:36:16 +00:00
Andrew Trick	4d0040baf8	Invoke SimplifyIndVar when we partially unroll a loop. Fixes PR10534. llvm-svn: 137203	2011-08-10 04:29:49 +00:00
Andrew Trick	b72bbe2a92	Fix the LoopUnroller to handle nontrivial loops and partial unrolling. These are not individual bug fixes. I had to rewrite a good chunk of the unroller to make it sane. I think it was getting lucky on trivial completely unrolled loops with no early exits. I included some fairly simple unit tests for partial unrolling. I didn't do much stress testing, so it may not be perfect, but should be usable now. llvm-svn: 137190	2011-08-10 00:28:10 +00:00
Dan Gohman	b24a1d29cb	Tidy up these testcases to look more like real code does. llvm-svn: 137085	2011-08-09 00:33:11 +00:00
Andrew Trick	6d45a01b67	Made SCEV's UDiv expressions more canonical. When dividing a recurrence, the initial values low bits can sometimes be ignored. To take advantage of this, added FoldIVUser to IndVarSimplify to fold an IV operand into a udiv/lshr if the operator doesn't affect the result. -indvars -disable-iv-rewrite now transforms i = phi i4 i1 = i0 + 1 idx = i1 >> (2 or more) i4 = i + 4 into i = phi i4 idx = i0 >> ... i4 = i + 4 llvm-svn: 137013	2011-08-06 07:00:37 +00:00
Bill Wendling	2d3138c112	Remove the LowerSetJmp pass. It wasn't used effectively by any of the targets. This is some of my original LLVM code. wipes tear llvm-svn: 136821	2011-08-03 22:18:20 +00:00
Andrew Trick	bf69d03382	SCEV: Use AssertingVH to catch dangling BasicBlock* when passes forget to notify SCEV of a change. Add forgetLoop in a couple of those places. llvm-svn: 136797	2011-08-03 18:32:11 +00:00
Chris Lattner	5b82a0ac0c	fix PR10286, a problem with the .ll printer handling block addresses that are out-of-scope. llvm-svn: 136768	2011-08-03 06:15:41 +00:00
Nick Lewycky	50f4966ceb	Fix logical error when detecting lifetime intrinsics. Don't replace a gep/bitcast with 'undef' because that will form a "free(undef)" which in turn means "unreachable". What we wanted was a no-op. Instead, analyze the whole tree and look for all the instructions we need to delete first, then delete them second, not relying on the use_list to stay consistent. llvm-svn: 136752	2011-08-03 00:43:35 +00:00
Nick Lewycky	e8ae02dfb9	Teach InstCombine that lifetime intrincs aren't a real user on the result of a malloc call. llvm-svn: 136732	2011-08-02 22:08:01 +00:00
Nick Lewycky	99890a225f	Lifetime intrinsics on undef are dead. llvm-svn: 136722	2011-08-02 21:19:27 +00:00
Benjamin Kramer	c4189ff0fc	Remove empty test. llvm-svn: 136675	2011-08-02 02:47:45 +00:00
Owen Anderson	bddf40e082	Revert r136503 and r136480 in an effort to fix non-determinism in the llvm-gcc buildbots on i386. Devang is looking into the root cause. llvm-svn: 136674	2011-08-02 02:23:42 +00:00
Rafael Espindola	a3a44f3fc3	Add a small gep optimization I noticed was missing while reading some IL. llvm-svn: 136585	2011-07-31 04:43:41 +00:00
Devang Patel	3e02522fee	Clean up debug info after reassociation. llvm-svn: 136480	2011-07-29 19:00:35 +00:00
Eli Friedman	530341d748	Make sure to correctly clear the exact/nuw/nsw flags off of shifts when they are combined together. <rdar://problem/9859829> llvm-svn: 136435	2011-07-29 00:18:19 +00:00
Nick Lewycky	8ac9ecedfd	Teach the ConstantMerge pass about alignment. Fixes PR10514! llvm-svn: 136250	2011-07-27 19:47:34 +00:00
Nick Lewycky	15e2d90746	Finish adding support for lifetime intrinsics to SROA. Fixes PR10121! llvm-svn: 136008	2011-07-25 23:14:22 +00:00
Dan Gohman	6320f52ff4	Move the last uses of RetainFunc etc. over to using getRetainCallee() etc. so that a declaration for objc_retain is created when needed if it doesn't already exist. rdar://9825114. llvm-svn: 135821	2011-07-22 22:29:21 +00:00
Dan Gohman	e106aee6f5	Fix MergeInVectorType to check for vector types with the same alloc size but different element types, so that it filters out the cases that CreateShuffleVectorCast doesn't handle. This fixes rdar://9786827. llvm-svn: 135721	2011-07-21 23:30:09 +00:00
Andrew Trick	bd243d0dfe	LSR, correct fix for rdar://9786536. Silly casting bug. llvm-svn: 135654	2011-07-21 01:45:54 +00:00
Andrew Trick	858e9f083d	LSR must sometimes sign-extend before generating double constants. rdar://9786536 llvm-svn: 135650	2011-07-21 01:05:01 +00:00
Andrew Trick	8acb434402	LSR crashes on an empty IVUsers list. rdar://9786536 llvm-svn: 135644	2011-07-21 00:40:04 +00:00
Eli Friedman	0cdc148ab8	Bring LICM into compliance with the new "Memory Model for Concurrent Operations" in LangRef. llvm-svn: 135625	2011-07-20 21:37:47 +00:00
Andrew Trick	638b355a16	indvars: Added getInsertPointForUses to find a valid place to truncate the IV. llvm-svn: 135568	2011-07-20 05:32:06 +00:00
Andrew Trick	f53622e129	indvars test case for r135558. llvm-svn: 135559	2011-07-20 02:14:37 +00:00
Andrew Trick	c5dd3e976a	indvars -disable-iv-rewrite fix: derived GEP IVs llvm-svn: 135558	2011-07-20 02:08:58 +00:00
Eli Friedman	55d6ccbb79	PR10386: Don't try to split an edge from an indirectbr. llvm-svn: 135534	2011-07-19 22:59:41 +00:00
Nick Lewycky	56e99c7933	Remove bogus test: for all possible inputs of %X, the 'sub nsw' is guaranteed to perform a signed wrap. Don't rely on any particular handling of that case. llvm-svn: 135471	2011-07-19 08:22:57 +00:00
Andrew Trick	7da2417c8a	indvars: LinearFunctionTestReplace for non-canonical IVs. For -disable-iv-rewrite, perform LFTR without generating a new "canonical" induction variable. Instead find the "best" existing induction variable for use in the loop exit test and compute the final value of that IV for use in the new loop exit test. In short, convert to a simple eq/ne exit test as long as it's cheap to do so. llvm-svn: 135420	2011-07-18 20:32:31 +00:00
Chad Rosier	c1e40f8d26	A real testcase for r135286. llvm-svn: 135299	2011-07-15 20:58:38 +00:00
Chad Rosier	b45111556d	Add testcase for r135286. llvm-svn: 135291	2011-07-15 19:06:58 +00:00
Evan Cheng	4a40a747ba	Change test case, one that actually failed before my commit. llvm-svn: 135064	2011-07-13 19:19:44 +00:00
Evan Cheng	b94674b325	It's not safe to fold (fptrunc (sqrt (fpext x))) to (sqrtf x) if there is another use of sqrt. rdar://9763193 llvm-svn: 135058	2011-07-13 19:08:16 +00:00
Rafael Espindola	403256763f	Don't duplicate the work done by a gep into a "bitcast" if the gep has more than one use. Fixes PR10322. llvm-svn: 134883	2011-07-11 03:43:47 +00:00
Chris Lattner	b1ed91f397	Land the long talked about "type system rewrite" patch. This patch brings numerous advantages to LLVM. One way to look at it is through diffstat: 109 files changed, 3005 insertions(+), 5906 deletions(-) Removing almost 3K lines of code is a good thing. Other advantages include: 1. Value::getType() is a simple load that can be CSE'd, not a mutating union-find operation. 2. Types a uniqued and never move once created, defining away PATypeHolder. 3. Structs can be "named" now, and their name is part of the identity that uniques them. This means that the compiler doesn't merge them structurally which makes the IR much less confusing. 4. Now that there is no way to get a cycle in a type graph without a named struct type, "upreferences" go away. 5. Type refinement is completely gone, which should make LTO much MUCH faster in some common cases with C++ code. 6. Types are now generally immutable, so we can use "Type " instead "const Type " everywhere. Downsides of this patch are that it removes some functions from the C API, so people using those will have to upgrade to (not yet added) new API. "LLVM 3.0" is the right time to do this. There are still some cleanups pending after this, this patch is large enough as-is. llvm-svn: 134829	2011-07-09 17:41:24 +00:00
Chris Lattner	2522d2df06	more tests not making the jump into the brave new world. llvm-svn: 134820	2011-07-09 16:57:10 +00:00
Lang Hames	c5c191b0a4	Added test cases for GVN signed intrinsics recognition, r134777. llvm-svn: 134778	2011-07-09 00:36:54 +00:00
Lang Hames	29cd98fd52	Make GVN look through extractvalues for recognised intrinsics. GVN can then CSE ops that match values produced by the intrinsics. llvm-svn: 134677	2011-07-08 01:50:54 +00:00
Andrew Trick	3239055dee	indvars -disable-iv-rewrite: Added SimplifyCongruentIVs. llvm-svn: 134530	2011-07-06 20:50:43 +00:00
Tobias Grosser	4a5d9a9c20	LICM: Do not loose alignment on promotion The promotion code lost any alignment information, when hoisting loads and stores out of the loop. This lead to incorrect aligned memory accesses. We now use the largest alignment we can prove to be correct. llvm-svn: 134520	2011-07-06 19:19:55 +00:00
Jakub Staszak	3f158fdf6e	Introduce "expect" intrinsic instructions. llvm-svn: 134516	2011-07-06 18:22:43 +00:00
Benjamin Kramer	9eca5feff1	PR10267: Don't combine an equality compare with an AND into an inequality compare when the AND has more than one use. This can pessimize code, inequalities are generally more expensive. llvm-svn: 134379	2011-07-04 20:16:36 +00:00
Andrew Trick	6d12309475	indvars -disable-iv-rewrite: bug fix involving weird geps and related cleanup. llvm-svn: 134306	2011-07-02 02:34:25 +00:00
Dan Gohman	54664ed714	Improve constant folding of undef for cmp and select operators. llvm-svn: 134223	2011-07-01 01:03:43 +00:00
Dan Gohman	ca8d9e1341	Improve constant folding of undef for binary operators. llvm-svn: 134221	2011-07-01 00:42:17 +00:00
Rafael Espindola	b10a0f223a	Add r134057 back, but splice the predecessor after the successors phi nodes. Original message: Let simplify cfg simplify bb with only debug and lifetime intrinsics. llvm-svn: 134182	2011-06-30 20:14:24 +00:00
Andrew Trick	efe89ad414	indvars -disable-iv-rewrite: handle cloning binary operators that cannot overflow. llvm-svn: 134177	2011-06-30 19:02:17 +00:00
Andrew Trick	cc68605353	indvars -disable-iv-rewrite: handle an edge case involving identity phis. llvm-svn: 134124	2011-06-30 01:27:23 +00:00
Andrew Trick	ecdd6e4c67	indvars -disable-iv-rewrite: insert new trunc instructions carefully. llvm-svn: 134112	2011-06-29 23:03:57 +00:00
Chad Rosier	96ed721d9b	Temporarily revert r134057: "Let simplify cfg simplify bb with only debug and lifetime intrinsics" due to buildbot failures. llvm-svn: 134071	2011-06-29 16:22:11 +00:00
Rafael Espindola	4c0dfcec7e	Let simplify cfg simplify bb with only debug and lifetime intrinsics. llvm-svn: 134057	2011-06-29 05:25:47 +00:00
Andrew Trick	efe2b1963d	indvars -disable-iv-rewrite: just because SCEV ignores casts doesn't mean they can be removed. llvm-svn: 134054	2011-06-29 03:13:40 +00:00
Andrew Trick	9083ef1918	FileCheckify and prepare for -disable-iv-rewrite. llvm-svn: 133998	2011-06-28 06:34:10 +00:00
Nick Lewycky	a61df3f843	Teach one piece of scalarrepl to handle lifetime markers. When transforming an alloca that only holds a copy of a global and we're going to replace the users of the alloca with that global, just nuke the lifetime intrinsics. Part of PR10121. llvm-svn: 133905	2011-06-27 05:40:02 +00:00
Eli Friedman	2c980fafff	PR10180: Fix a instcombine crash with FP vectors. llvm-svn: 133756	2011-06-23 20:40:23 +00:00
Jay Foad	165910fa6d	Add a reduced test case for the buildbot failure (clang self-hosted build) caused by r133435. llvm-svn: 133509	2011-06-21 08:33:49 +00:00
Andrew Trick	69d4452f2e	indvars -disable-iv-rewrite: Adds support for eliminating identity ops. This is a rewrite of the IV simplification algorithm used by -disable-iv-rewrite. To avoid perturbing the default mode, I temporarily split the driver and created SimplifyIVUsersNoRewrite. The idea is to avoid doing opcode/pattern matching inside IndVarSimplify. SCEV already does it. We want to optimize with the full generality of SCEV, but optimize def-use chains top down on-demand rather than rewriting the entire expression bottom-up. This was easy to do for operations that SCEV can prove are identity function. So we're now eliminating bitmasks and zero extends this way. A result of this rewrite is that indvars -disable-iv-rewrite no longer requires IVUsers. llvm-svn: 133502	2011-06-21 03:22:38 +00:00
Jay Foad	29ed2e3bdc	This is an automatically reduced test case that crashed in GVN, at some point during the development of the phi operand changes. llvm-svn: 133436	2011-06-20 14:46:47 +00:00
Chris Lattner	8936d2bfbc	Remove support for parsing the "type i32" syntax for defining a numbered top level type without a specified number. This syntax isn't documented and blocks forward progress. llvm-svn: 133371	2011-06-19 00:03:46 +00:00
Hans Wennborg	4ab4a8e63a	Fix PR10103: Less code for enum type translation. In cases such as the attached test, where the case value for a switch destination is used in a phi node that follows the destination, it might be better to replace that value with the condition value of the switch, so that more blocks can be folded away with TryToSimplifyUncondBranchFromEmptyBlock because there are less conflicts in the phi node. llvm-svn: 133344	2011-06-18 10:28:47 +00:00
Nick Lewycky	4c94631505	Add test for r133251. llvm-svn: 133339	2011-06-18 07:23:25 +00:00
Cameron Zwarich	9601ddb2f3	When scalar replacement returns a vector type, only accept it if the vector type's bitwidth matches the (allocated) size of the alloca. This severely pessimizes vector scalar replacement when the only vector type being used is something like <3 x float> on x86 or ARM whose allocated size matches a <4 x float>. I hope to fix some of the flawed assumptions about allocated size throughout scalar replacement and reenable this in most cases. llvm-svn: 133338	2011-06-18 06:17:51 +00:00
Chris Lattner	80ed9dc9e5	rip out a ton of intrinsic modernization logic from AutoUpgrade.cpp, which is for pre-2.9 bitcode files. We keep x86 unaligned loads, movnt, crc32, and the target indep prefetch change. As usual, updating the testsuite is a PITA. llvm-svn: 133337	2011-06-18 06:05:24 +00:00
Cameron Zwarich	2a26100c87	Fix an invalid bitcast crash that occurs when doing a partial memset of a vector alloca. Fixes part of <rdar://problem/9580800>. llvm-svn: 133336	2011-06-18 05:47:49 +00:00
Chris Lattner	6bc5c89093	Stop accepting and ignoring attributes in function types. Attributes are applied to functions and call/invokes, not to types. llvm-svn: 133266	2011-06-17 17:37:13 +00:00
Chris Lattner	5756c16cdf	make the asmparser reject function and type redefinitions. 'Merging' hasn't been needed since llvm-gcc 3.4 days. llvm-svn: 133248	2011-06-17 07:06:44 +00:00
Chris Lattner	59345c8b65	remove asmparser support for the old getresult instruction, which has been subsumed by extractvalue. llvm-svn: 133247	2011-06-17 06:57:15 +00:00
Chris Lattner	33de427cd6	remove parser support for the obsolete "multiple return values" syntax, which was replaced with return of a "first class aggregate". llvm-svn: 133245	2011-06-17 06:49:41 +00:00
Chris Lattner	4649a73cc3	stop accepting begin/end around function bodies in the .ll parser, this isn't pascal anymore. llvm-svn: 133244	2011-06-17 06:42:57 +00:00
Chris Lattner	def1949c00	Remove support for using "foo" as symbols instead of %"foo". This is ancient syntax and has been long obsolete. As usual, updating the tests is the nasty part of this. llvm-svn: 133242	2011-06-17 06:36:20 +00:00
Chris Lattner	b90ed2233c	manually upgrade a bunch of tests to modern syntax, and remove some that are either unreduced or only test old syntax. llvm-svn: 133228	2011-06-17 03:14:27 +00:00
Dan Gohman	00fa9634d5	Fix ARCOpt to insert releases on both successors of an invoke rather than trying to insert them immediately after the invoke. llvm-svn: 133188	2011-06-16 20:57:14 +00:00
John McCall	d935e9c359	The ARC language-specific optimizer. Credit to Dan Gohman. llvm-svn: 133108	2011-06-15 23:37:01 +00:00
Stuart Hastings	351a3f881f	Avoid fusing bitcasts with dynamic allocas if the amount-to-allocate might overflow. Re-typing the alloca to a larger type (e.g. double) hoists a shift into the alloca, potentially exposing overflow in the expression. rdar://problem/9265821 llvm-svn: 132926	2011-06-13 18:48:49 +00:00
Benjamin Kramer	c970849ea0	InstCombine: Fold A-b == C --> b == A-C if A and C are constants. The backend already knew this trick. llvm-svn: 132915	2011-06-13 15:24:24 +00:00
Benjamin Kramer	91f914ce21	InstCombine: Shrink ((zext X) & C1) == C2 to fold away the cast if the "zext" and the "and" have one use. llvm-svn: 132897	2011-06-12 22:48:00 +00:00
Benjamin Kramer	35159c114c	Simplify code. No functionality changes, name changes aside. llvm-svn: 132896	2011-06-12 22:47:53 +00:00
John McCall	fc1ca36866	SplitCriticalEdge can sometimes split the edge from an invoke to a landing pad, separating the exception and selector calls from the new lpad. Teaching it not to do that, or to properly adjust the CFG afterwards, is out of scope because it would require the other edges to the landing pad to be split as well (effectively). Instead, just recover from the most likely cases during inlining. The best long-term solution is to change the exception representation and commit to either requiring or not requiring the more complex edge-splitting logic; this is just a shorter-term hack. llvm-svn: 132799	2011-06-09 20:06:24 +00:00
Cameron Zwarich	77a699a829	Fix PR10104 by adding a bounds check on a vector element access check. It was assuming that all offsets are legal vector accesses, and thus trying to access the float member of { <2 x float>, float } as the 3rd element of the first member. llvm-svn: 132766	2011-06-09 01:45:33 +00:00
Cameron Zwarich	c3b1cc9aca	Fix an assymmetry between ConvertScalar_ExtractValue and ConvertScalar_InsertValue. The former was using the size of the entire alloca, whereas the latter was correctly using the allocated size of the immediate type being converted (which may differ from the size of the alloca). This fixes PR10082. llvm-svn: 132759	2011-06-08 22:08:31 +00:00
Nick Lewycky	40b4e80ce8	This directory was missing the dg.exp to cause the tests to run. Some time since it was added, the test has regressed, so XFAIL it. llvm-svn: 132686	2011-06-06 20:23:00 +00:00
Bill Wendling	4f163dfed1	If the block that we're threading through is jumped to by an indirect branch, then we don't want to set the destination in the indirect branch to the destination. This is because the indirect branch needs its destinations to have had their block addresses taken. This isn't so of the new critical edge that's split during this process. If it turns out that the destination block has only one predecessor, and that being a BB with an indirect branch, then it won't be marked as 'used' and may be removed. PR10072 llvm-svn: 132638	2011-06-04 09:42:04 +00:00
Dan Gohman	baf1afb289	Add a testcase to demonstrate the problem where phi translation is ignored for clobbering partial-alias loads. llvm-svn: 132633	2011-06-04 07:05:05 +00:00
Dan Gohman	a471751c24	Disable the main feature of 130180, the elimination of loads that are redundant with partially-aliasing loads. When computing what portion of a clobbering load value is needed, it doesn't consider phi-translation which may have occurred between the clobbing load and the redundant load. llvm-svn: 132631	2011-06-04 06:48:50 +00:00
Nick Lewycky	611582401f	Bail on unswitching a switch statement for a case with a critical edge. We name which edge to split by pred/succ pair, which means that we can end up splitting the wrong edge (by case value) in the switch statement entirely. Fixes PR10031! llvm-svn: 132535	2011-06-03 06:27:15 +00:00
Andrew Trick	443332deca	Test case pasto (failed when run with IR verifier). llvm-svn: 132516	2011-06-02 23:57:27 +00:00
Eli Friedman	5da0ff41d7	PR10067: Add missing safety check to call return transformation in MemCpyOpt::processStore. If something accesses the dest of the "copy" between the call and the copy, the performCallSlotOptzn transformation is not valid. llvm-svn: 132485	2011-06-02 21:24:42 +00:00
Eli Friedman	b576b1675c	When marking a block as being unanalyzable, use "Clobber" on the terminator instead of the first instruction in the block. This is a bit of a hack; "Clobber" isn't really the right marking in the first place. memdep doesn't really have any way of properly expressing "unanalyzable" at the moment. Using it on the terminator is much less ambiguous than using it on an arbitrary instruction, though. In the given testcase, the "Clobber" was pointing to a load, and GVN was incorrectly assuming that meant that the "Clobber" load overlapped the load being analyzed (when they are actually unrelated). The included testcase tests both this commit and r132434. Part two of rdar://9429882. (r132434 was mislabeled.) llvm-svn: 132442	2011-06-02 00:08:52 +00:00
Stuart Hastings	2380483355	Reapply 132348 with fixes. rdar://problem/6501862 llvm-svn: 132402	2011-06-01 16:42:47 +00:00
John McCall	fca7786267	First, do no harm -- even if we can't find a selector for an enclosing landing pad, forward llvm.eh.resume calls to it instead of turning them invalidly into invokes. llvm-svn: 132382	2011-06-01 02:17:11 +00:00
Andrew Trick	812276eed4	scev: Better sign-extend removal. Normalize postincrement recurrences so that their sign extended forms are congruent when no overflow occurs. llvm-svn: 132360	2011-05-31 21:17:47 +00:00
Stuart Hastings	9d6a06d536	Revert to pacify a buildbot. rdar://problem/6501862 llvm-svn: 132351	2011-05-31 19:56:35 +00:00
Stuart Hastings	780f723309	Followup to 132316; accept arbitrary constants, add with a constant, sub with a non-constant. Fix comments, enlarge test case. rdar://problem/6501862 llvm-svn: 132348	2011-05-31 19:29:55 +00:00
Stuart Hastings	8284374b07	(1 - X) * (-2) -> (x - 1) * 2, for all positive nonzero powers of 2 rdar://problem/6501862 llvm-svn: 132316	2011-05-30 20:00:33 +00:00
John McCall	f19cf99097	Add the test case for phis in the outer landing pad during the inliner's forwarding of eh.resume that I promised yesterday. llvm-svn: 132307	2011-05-30 01:08:04 +00:00
Nick Lewycky	63353933c6	Add testcase for r132290, to check for the crasher caught by the buildbots doing llvm-gcc selfhost (or cross). llvm-svn: 132292	2011-05-29 19:41:14 +00:00
Nick Lewycky	a3bb03e400	Obey the isVolatile bit on memory intrinsics when analyzing uses of a global variable. Noticed by inspection. Simulate memset in EvaluateFunction where the target of the memset and the value we're setting are both the null value. Fixes PR10047! llvm-svn: 132288	2011-05-29 18:41:56 +00:00
Benjamin Kramer	fd53a27f99	ConstantFoldInstOperands doesn't like compares, hand it off to instsimplify instead. Fixes PR10040. llvm-svn: 132254	2011-05-28 10:16:58 +00:00
John McCall	046c47e970	Implement and document the llvm.eh.resume intrinsic, which is transformed by the inliner into a branch to the enclosing landing pad (when inlined through an invoke). If not so optimized, it is lowered DWARF EH preparation into a call to _Unwind_Resume (or _Unwind_SjLj_Resume as appropriate). Its chief advantage is that it takes both the exception value and the selector value as arguments, meaning that there is zero effort in recovering these; however, the frontend is required to pass these down, which is not actually particularly difficult. Also document the behavior of landing pads a bit better, and make it clearer that it's okay that personality functions don't always land at landing pads. This is just a fact of life. Don't write optimizations that rely on pushing things over an unwind edge. llvm-svn: 132253	2011-05-28 07:45:59 +00:00
John McCall	bd04b74bb2	Fix the inliner to maintain the current de facto invoke semantics: - the selector for the landing pad must provide all available information about the handlers, filters, and cleanups within that landing pad - calls to _Unwind_Resume must be converted to branches to the enclosing lpad so as to avoid re-entering the unwinder when the lpad claimed it was going to handle the exception in some way This is quite specific to libUnwind-based unwinding. In an effort to not interfere too badly with other unwinders, and with existing hacks in frontends, this only triggers on _Unwind_Resume (not _Unwind_Resume_or_Rethrow) and does nothing with selectors if it cannot find a selector call for either lpad. llvm-svn: 132200	2011-05-27 18:34:38 +00:00
Benjamin Kramer	749ef5f420	InstCombine: Make switch folding with equality compares more aggressive by trying instsimplify on the arm where we know the compared value. Stuff like "x == y ? y : x&y" now folds into "x&y". llvm-svn: 132185	2011-05-27 13:00:16 +00:00
Chad Rosier	b362884ca9	Renamed llvm.x86.sse42.crc32 intrinsics; crc64 doesn't exist. crc32.[8\|16\|32] have been renamed to .crc32.32.[8\|16\|32] and crc64.[8\|16\|32] have been renamed to .crc32.64.[8\|64]. llvm-svn: 132163	2011-05-26 23:13:19 +00:00
Andrew Trick	7fac79e255	indvars: incremental fixes for -disable-iv-rewrite and testcases. Use a proper worklist for use-def traversal without holding onto an iterator. Now that we process all IV uses, we need complete logic for resusing existing derived IV defs. See HoistStep. llvm-svn: 132103	2011-05-26 00:46:11 +00:00
Eli Friedman	865866e7fe	PR9998: ashr exact %x, 31 is not equivalent to sdiv exact %x, -2147483648. llvm-svn: 132097	2011-05-25 23:26:20 +00:00
Andrew Trick	eb3c36e69c	indvars: fixed IV cloning in -disable-iv-rewrite mode with associated cleanup and overdue test cases. llvm-svn: 132038	2011-05-25 04:42:22 +00:00
Cameron Zwarich	d7707fc911	Fix "make check" in Release by removing debug-only options from an 'opt' invocation. llvm-svn: 131972	2011-05-24 18:26:09 +00:00
Cameron Zwarich	843bc7d673	Make LoadAndStorePromoter preserve debug info and create llvm.dbg.values when promoting allocas to SSA variables. Fixes <rdar://problem/9479036>. llvm-svn: 131953	2011-05-24 03:10:43 +00:00
Andrew Trick	37f0082804	FileCheck-ize a couple of IV unit tests. llvm-svn: 131946	2011-05-24 01:02:49 +00:00
Andrew Trick	1ea0243bd0	Test case for r130799 - indvars: Added canExpandBackEdgeTakenCount. llvm-svn: 131939	2011-05-24 00:17:53 +00:00
Chris Lattner	026f5e61f0	fix a really nasty basicaa mod/ref calculation bug that was causing miscompilation of UnitTests/ObjC/messages-2.m with the recent optimizer improvements. llvm-svn: 131897	2011-05-23 05:15:43 +00:00
Chris Lattner	8aff4f8efc	Transform any logical shift of a power of two into an exact/NUW shift when in a known-non-zero context. llvm-svn: 131887	2011-05-23 00:21:50 +00:00
Chris Lattner	83791ced7b	Teach valuetracking that byval arguments with a specified alignment are aligned. Teach memcpyopt to not give up all hope when confonted with an underaligned memcpy feeding an overaligned byval. If the source of the memcpy can be determined to be adequeately aligned, or if it can be forced to be, we can eliminate the memcpy. This addresses PR9794. We now compile the example into: define i32 @f(%struct.p* nocapture byval align 8 %q) nounwind ssp { entry: %call = call i32 @g(%struct.p* byval align 8 %q) nounwind ret i32 %call } in both x86-64 and x86-32 mode. We still don't get a tailcall though, because tailcalls apparently can't handle byval. llvm-svn: 131884	2011-05-23 00:03:39 +00:00
Chris Lattner	713d52364f	implement PR9315, constant folding exp2 in terms of pow (since hosts without C99 runtimes don't have exp2). llvm-svn: 131872	2011-05-22 22:22:35 +00:00
Chris Lattner	7c99f19d9f	Carve out a place in instcombine to put transformations which work knowing that their result is non-zero. Implement an example optimization (PR9814), which allows us to transform: A / ((1 << B) >>u 2) into: A >>u (B-2) which we compile into: _divu3: ## @divu3 leal -2(%rsi), %ecx shrl %cl, %edi movl %edi, %eax ret instead of: _divu3: ## @divu3 movb %sil, %cl movl $1, %esi shll %cl, %esi shrl $2, %esi movl %edi, %eax xorl %edx, %edx divl %esi, %eax ret llvm-svn: 131860	2011-05-22 18:18:41 +00:00
Chris Lattner	c4ca7ab7e7	Fix PR9815: I was trying to get out of "generating code and then failing to form a memset, then having to delete it" but my approximation isn't safe for self recurrent loops. Instead of doign a hack, just do it the right way. llvm-svn: 131858	2011-05-22 17:39:56 +00:00
Frits van Bommel	ad964559ef	Add a parameter to ConstantFoldTerminator() that callers can use to ask it to also clean up the condition of any conditional terminator it folds to be unconditional, if that turns the condition into dead code. This just means it calls RecursivelyDeleteTriviallyDeadInstructions() in strategic spots. It defaults to the old behavior. I also changed -simplifycfg, -jump-threading and -codegenprepare to use this to produce slightly better code without any extra cleanup passes (AFAICT this was the only place in -simplifycfg where now-dead conditions of replaced terminators weren't being cleaned up). The only other user of this function is -sccp, but I didn't read that thoroughly enough to figure out whether it might be holding pointers to instructions that could be deleted by this. llvm-svn: 131855	2011-05-22 16:24:18 +00:00
Chris Lattner	1a1acc2191	fix PR9856, an incorrectly conservative assertion: a global can be "stored once" even if its address is compared. llvm-svn: 131849	2011-05-22 07:15:13 +00:00
Chris Lattner	f0d59072de	fix PR9841 by having GVN not process dead loads. This was causing it to get into infinite loops when it would widen a load (which can necessarily leave around dead loads). llvm-svn: 131847	2011-05-22 07:03:34 +00:00
Chris Lattner	a10327f531	remove a trivial test, make some other tests less trivial. llvm-svn: 131846	2011-05-22 07:02:43 +00:00
Chris Lattner	cc87723178	make this test less trivial. llvm-svn: 131845	2011-05-22 06:59:33 +00:00
Nick Lewycky	d60e135cfe	Commit test change, forgotten as part of r131838. llvm-svn: 131839	2011-05-22 05:31:47 +00:00
Nick Lewycky	a68ec83b36	Teach the inliner to emit llvm.lifetime.start/end, to scope the local variables of the inlinee to the code representing the original function. llvm-svn: 131838	2011-05-22 05:22:10 +00:00
Nick Lewycky	1c8af13719	Fix grammar in test. llvm-svn: 131831	2011-05-22 01:16:00 +00:00
Benjamin Kramer	fda5dc4968	Revert "InstCombine: Turn mul.with.overflow(X, 2) into the cheaper add.with.overflow(X, X)" It's better to do this in codegen, mul.with.overflow(X, 2) is more canonical because it has only one use on "X". llvm-svn: 131798	2011-05-21 18:31:42 +00:00
Benjamin Kramer	691731eb9c	InstCombine: Turn mul.with.overflow(X, 2) into the cheaper add.with.overflow(X, X) llvm-svn: 131789	2011-05-21 09:22:06 +00:00
Evan Cheng	e8d2e9eb35	Revert r131664 and fix it in instcombine instead. rdar://9467055 llvm-svn: 131708	2011-05-20 00:54:37 +00:00
Stuart Hastings	ae012a7525	Move test to Transforms/InstCombine. llvm-svn: 131634	2011-05-19 05:53:22 +00:00
Rafael Espindola	3f60a0b411	Add test for PR9946. llvm-svn: 131621	2011-05-19 02:35:26 +00:00
Eli Friedman	41e509a33d	More instcombine cleanup, towards improving debug line info. llvm-svn: 131604	2011-05-18 23:58:37 +00:00
Dan Gohman	3268e4d692	When forming an ICmpZero LSRUse, normalize the non-IV operand of the comparison, so that the resulting expression is fully normalized. This fixes PR9939. llvm-svn: 131576	2011-05-18 21:02:18 +00:00
Eli Friedman	49346010f8	More instcombine cleanup aimed towards improving debug line info. llvm-svn: 131559	2011-05-18 19:57:14 +00:00
Eli Friedman	96254a0d53	Start trying to make InstCombine preserve more debug info. The idea here is to set the debug location on the IRBuilder, which will be then right location in most cases. This should magically give many transformations debug locations, and fixing places which are missing a debug location will usually just means changing the code creating it to use the IRBuilder. As an example, the change to InstCombineCalls catches a common case where a call to a bitcast of a function is rewritten. Chris, does this approach look reasonable? llvm-svn: 131516	2011-05-18 01:28:27 +00:00
Stuart Hastings	a7ae4552af	Drop lli, revise test. llvm-svn: 131452	2011-05-17 02:38:59 +00:00
Rafael Espindola	2050af838d	Don't do tail calls in a function that call setjmp. The stack might be corrupted when setjmp returns again. llvm-svn: 131399	2011-05-16 03:05:33 +00:00
Benjamin Kramer	cb7e56e592	Disable test harder. llvm-svn: 131363	2011-05-14 19:30:39 +00:00
Stuart Hastings	3c2fd1cf62	Disable this test while I revise it. rdar://problem/9267970 llvm-svn: 131350	2011-05-14 18:39:05 +00:00
Benjamin Kramer	d96205c4e5	SimplifyCFG: Use ComputeMaskedBits to prune dead cases from switch instructions. llvm-svn: 131345	2011-05-14 15:57:25 +00:00
Stuart Hastings	66a82b966e	Avoid combining GEPs that might overflow at runtime. rdar://problem/9267970 Patch by Julien Lerouge! llvm-svn: 131339	2011-05-14 05:55:10 +00:00
Duncan Sands	af32728a57	The comparision "max(x,y)==x" is equivalent to "x>=y". Since the max is often expressed as "x >= y ? x : y", there is a good chance we can extract the existing "x >= y" from it and use that as a replacement for "max(x,y)==x". llvm-svn: 131049	2011-05-07 16:56:49 +00:00
Galina Kistanova	a335f5aeeb	Move few target-dependant tests to appropriate directories. llvm-svn: 131002	2011-05-06 18:24:46 +00:00
Duncan Sands	a071c82900	Fix PR9820: a read-only call differs from a load in that a load doesn't return the pointer being dereferenced, it returns the pointee, but a call might return the pointer itself. llvm-svn: 130979	2011-05-06 10:30:37 +00:00
Eli Friedman	8a20e66926	PR9838: Fix transform introduced in r127064 to not trigger when only one side of the icmp is an exact shift. llvm-svn: 130954	2011-05-05 21:59:18 +00:00
Duncan Sands	a228785526	Add variations on: max(x,y) >= min(x,z) folds to true. This isn't that common, but according to my super-optimizer there are only two missed simplifications of -instsimplify kind when compiling bzip2, and this is one of them. It amuses me to have bzip2 be perfectly optimized as far as instsimplify goes! llvm-svn: 130840	2011-05-04 16:05:05 +00:00
Duncan Sands	0a9c1246d7	Implement some basic simplifications involving min/max, for example max(a,b) >= a -> true. According to my super-optimizer, these are by far the most common simplifications (of the -instsimplify kind) that occur in the testsuite and aren't caught by -std-compile-opts. llvm-svn: 130780	2011-05-03 19:53:10 +00:00
Duncan Sands	f91c5ab341	Fix PR9579: when simplifying a compare to "true" or "false", and it was a vector compare, generate a vector result rather than i1 (and crashing). llvm-svn: 130706	2011-05-02 18:51:41 +00:00
Duncan Sands	a3e3699c88	Move some rem transforms out of instcombine and into instsimplify. This automagically provides a transform noticed by my super-optimizer as occurring quite often: "rem x, (select cond, x, 1)" -> 0. llvm-svn: 130694	2011-05-02 16:27:02 +00:00
Benjamin Kramer	9aa91b1f4e	InstCombine: Turn (zext A) udiv (zext B) into (zext (A udiv B)). Same for urem or constant B. This obviously helps a lot if the division would be turned into a libcall (think i64 udiv on i386), but div is also one of the few remaining instructions on modern CPUs that become more expensive when the bitwidth gets bigger. This also helps register pressure on i386 when dividing chars, divb needs two 8-bit parts of a 16 bit register as input where divl uses two registers. int foo(unsigned char a) { return a/10; } int bar(unsigned char a, unsigned char b) { return a/b; } compiles into (x86_64) _foo: imull $205, %edi, %eax shrl $11, %eax ret _bar: movzbl %dil, %eax divb %sil, %al movzbl %al, %eax ret llvm-svn: 130615	2011-04-30 18:16:07 +00:00
Benjamin Kramer	57b3df59b9	Use SimplifyDemandedBits on div instructions. This folds away silly stuff like (a&255)/1000 -> 0. llvm-svn: 130614	2011-04-30 18:16:00 +00:00
Benjamin Kramer	6a50bbd284	FileCheckize. llvm-svn: 130613	2011-04-30 18:15:53 +00:00
Peter Collingbourne	616044acd5	SimplifyCFG: Expose phi node folding cost threshold as command line parameter llvm-svn: 130528	2011-04-29 18:47:38 +00:00
Peter Collingbourne	e3511e15e0	SimplifyCFG: Add CostRemaining parameter to DominatesMergePoint llvm-svn: 130527	2011-04-29 18:47:31 +00:00
Peter Collingbourne	61f6602acd	SimplifyCFG: Add Trunc, ZExt and SExt to the list of cheap instructions for phi node folding llvm-svn: 130526	2011-04-29 18:47:25 +00:00
Benjamin Kramer	16f18ed7b5	InstCombine: turn (C1 << A) << C2) into (C1 << C2) << A) Fixes PR9809. llvm-svn: 130485	2011-04-29 08:15:41 +00:00
Chris Lattner	1777601a74	final step needed to resolve PR6627, which allows us to flatten the code down to a nice and tidy: %x1 = load i32* %0, align 4 %1 = icmp eq i32 %x1, 1179403647 br i1 %1, label %if.then, label %if.end instead of doing lots of loads and branches. May the FreeBSD bootloader long fit in its allocated space. llvm-svn: 130416	2011-04-28 18:15:47 +00:00
Benjamin Kramer	4145c0d3b1	InstCombine: Merge "(trunc x) == C1 & (and x, CA) == C2" into a single and+icmp. This happens when GVN widens loads. Part of PR6627. llvm-svn: 130405	2011-04-28 16:58:40 +00:00
Chris Lattner	827a270a2a	teach GVN to widen integer loads when they are overaligned, when doing an wider load would allow elimination of subsequent loads, and when the wider load is still a native integer type. This eliminates a ton of loads on various benchmarks involving struct fields, though it is somewhat hobbled by clang not being very aggressive about field alignment. This is yet another step along the way towards resolving PR6627. llvm-svn: 130390	2011-04-28 07:29:08 +00:00
Andrew Trick	29ac7b8858	Fixes PR9730: indvars: An asserting value handle still pointed to this value Modified LinearFunctionTestReplace to push the condition on the dead list instead of eagerly deleting it. This can cause unnecessary IV rewrites, which should have no effect on codegen and will not be an issue once we stop generating canonical IVs. llvm-svn: 130340	2011-04-27 23:00:03 +00:00
Devang Patel	12bf0ab4b5	Simplify cfg inserts a call to trap when unreachable code is detected. Assign DebugLoc to this new trap instruction. llvm-svn: 130315	2011-04-27 17:59:27 +00:00
Chris Lattner	6b96621a8a	remove support for llvm.invariant.end from memdep. It is a work-in-progress that is not progressing, and it has issues. llvm-svn: 130247	2011-04-26 21:50:51 +00:00
Chris Lattner	029afe4787	make a couple of changes to the standard pass pipeline: 1. Only run the early (in the module pass pipe) instcombine/simplifycfg if the "unit at a time" passes they are cleaning up after runs. 2. Move the "clean up after the unroller" pass to the very end of the function-level pass pipeline. Loop unroll uses instsimplify now, so it doesn't create a ton of trash. Moving instcombine later allows it to clean up after opportunities are exposed by GVN, DSE, etc. 3. Introduce some phase ordering tests for things that are specifically intended to be simplified by the full optimizer as a whole. This resolves PR2338, and is progress towards PR6627, which will be generating code that looks similar to test2. llvm-svn: 130241	2011-04-26 20:45:33 +00:00
Chris Lattner	1b06c71668	Transform: "icmp eq (trunc (lshr(X, cst1)), cst" to "icmp (and X, mask), cst" when X has multiple uses. This is useful for exposing secondary optimizations, but the X86 backend isn't ready for this when X has a single use. For example, this can disable load folding. This is inching towards resolving PR6627. llvm-svn: 130238	2011-04-26 20:18:20 +00:00
Chris Lattner	eb045f9c02	Improve the bail-out predicate to really only kick in when phi translation fails. We were bailing out in some cases that would cause us to miss GVN'ing some non-local cases away. llvm-svn: 130206	2011-04-26 17:41:02 +00:00
Chris Lattner	6f83d06ffa	Enhance MemDep: When alias analysis returns a partial alias result, return it as a clobber. This allows GVN to do smart things. Enhance GVN to be smart about the case when a small load is clobbered by a larger overlapping load. In this case, forward the value. This allows us to compile stuff like this: int test(void P) { int tmp = (unsigned int)P; return tmp+((unsigned char*)P+1); } into: _test: ## @test movl (%rdi), %ecx movzbl %ch, %eax addl %ecx, %eax ret which has one load. We already handled the case where the smaller load was from a must-aliased base pointer. llvm-svn: 130180	2011-04-26 01:21:15 +00:00
Cameron Zwarich	ca4c633489	Fix another case of <rdar://problem/9184212> that only occurs with code generated by llvm-gcc, since llvm-gcc uses 2 i64s for passing a 4 x float vector on ARM rather than an i64 array like Clang. llvm-svn: 129878	2011-04-20 21:48:38 +00:00
Frits van Bommel	d097212a08	Add test cases for Jay's r129641 and fix a 32-bit-centric testcase in a file with a 64-bit datalayout. llvm-svn: 129643	2011-04-16 14:31:50 +00:00
Chris Lattner	0ab5e2cded	Fix a ton of comment typos found by codespell. Patch by Luis Felipe Strano Moraes! llvm-svn: 129558	2011-04-15 05:18:47 +00:00
Eli Friedman	2395626605	Add an instcombine for constructs like a \| -(b != c); a select is more canonical, and generally leads to better code. Found while looking at an article about saturating arithmetic. llvm-svn: 129545	2011-04-14 22:41:27 +00:00
Owen Anderson	92651ec374	Fix an infinite alternation in JumpThreading where two transforms would repeatedly undo each other. The solution is to perform more aggressive constant folding to make one of the edges just folded away rather than trying to thread it. Fixes <rdar://problem/9284786>. Discovered with CSmith. llvm-svn: 129538	2011-04-14 21:35:50 +00:00
Mon P Wang	2e5528f0b2	Vectors with different number of elements of the same element type can have the same allocation size but different primitive sizes(e.g., <3xi32> and <4xi32>). When ScalarRepl promotes them, it can't use a bit cast but should use a shuffle vector instead. llvm-svn: 129472	2011-04-13 21:40:02 +00:00
Dan Gohman	1c6c34834b	Fix reassociate to use a worklist instead of recursing when new reassociation opportunities are exposed. This fixes a bug where the nested reassociation expects to be the IR to be consistent, but it isn't, because the outer reassociation has disconnected some of the operands. rdar://9167457 llvm-svn: 129324	2011-04-12 00:11:56 +00:00
Chris Lattner	e81d045d94	remove the StructRetPromotion pass. It is unused, not maintained and has some bugs. If this is interesting functionality, it should be reimplemented in the argpromotion pass. llvm-svn: 129314	2011-04-11 23:09:44 +00:00
Eli Friedman	9cca0715aa	Add back a couple checks removed by r129128; the fact that an intitializer is an array of structures doesn't imply it's a ConstantArray of ConstantStruct. llvm-svn: 129207	2011-04-09 09:11:09 +00:00
Chris Lattner	88974f4625	fix PR9523, a crash in looprotate on a non-canonical loop made out of indirectbr. llvm-svn: 129203	2011-04-09 07:25:58 +00:00
Eli Friedman	17822fcde9	PR9604; try to deal with RAUW updates correctly in the AST. I'm not convinced it's completely safe to cache the AST across LICM runs even with this fix, but this fix can't hurt. llvm-svn: 129198	2011-04-09 06:55:46 +00:00
Eli Friedman	4db39cefdb	Test for r129190. llvm-svn: 129197	2011-04-09 06:39:43 +00:00
Devang Patel	bc3d8b212f	Do not let debug info interfer with branch folding. llvm-svn: 129114	2011-04-07 23:11:25 +00:00
Devang Patel	197c35298a	While hoisting common code from if/else, hoist debug info intrinsics if they match. llvm-svn: 129078	2011-04-07 17:27:36 +00:00
Eli Friedman	c5f22a7815	PR9634: Don't unconditionally tell the AliasSetTracker that the PreheaderLoad is equivalent to any other relevant value; it isn't true in general. If it is equivalent, the LoopPromoter will tell the AST the equivalence. Also, delete the PreheaderLoad if it is unused. Chris, since you were the last one to make major changes here, can you check that this is sane? llvm-svn: 129049	2011-04-07 01:35:06 +00:00
Nadav Rotem	cc771acd77	This testcase passed even without the fix. Added the target info to make the test fail (without the fix). Thanks Dan. llvm-svn: 128999	2011-04-06 11:18:29 +00:00
Nadav Rotem	a069c6ce05	InstCombine optimizes gep(bitcast(x)) even when the bitcasts casts away address space info. We crash with an assert in this case. This change checks that the address space of the bitcasted pointer is the same as the gep ptr. llvm-svn: 128884	2011-04-05 14:29:52 +00:00
Eli Friedman	17bf4922c9	PR9446: RecursivelyDeleteTriviallyDeadInstructions can delete the instruction after the given instruction; make sure to handle that case correctly. (It's difficult to trigger; the included testcase involves a dead block, but I don't think that's a requirement.) While I'm here, get rid of the unnecessary warning about SimplifyInstructionsInBlock, since it should work correctly as far as I know. llvm-svn: 128782	2011-04-02 22:45:17 +00:00
Benjamin Kramer	d121765e64	InstCombine: Turn icmp + sext into bitwise/integer ops when the input has only one unknown bit. int test1(unsigned x) { return (x&8) ? 0 : -1; } int test3(unsigned x) { return (x&8) ? -1 : 0; } before (x86_64): _test1: andl $8, %edi cmpl $1, %edi sbbl %eax, %eax ret _test3: andl $8, %edi cmpl $1, %edi sbbl %eax, %eax notl %eax ret after: _test1: shrl $3, %edi andl $1, %edi leal -1(%rdi), %eax ret _test3: shll $28, %edi movl %edi, %eax sarl $31, %eax ret llvm-svn: 128732	2011-04-01 20:09:10 +00:00
Nadav Rotem	d74b72b8a9	Instcombile optimization: extractelement(cast) -> cast(extractelement) llvm-svn: 128683	2011-03-31 22:57:29 +00:00
Benjamin Kramer	5291054ef1	InstCombine: APFloat can't perform arithmetic on PPC double doubles, don't even try. Thanks Eli! llvm-svn: 128676	2011-03-31 21:35:49 +00:00
Benjamin Kramer	be209ab8a2	InstCombine: Fix transform to use the swapped predicate. Thanks Frits! llvm-svn: 128628	2011-03-31 10:46:03 +00:00
Benjamin Kramer	d159d94644	InstCombine: fold fcmp (fneg x), (fneg y) -> fcmp x, y llvm-svn: 128627	2011-03-31 10:12:22 +00:00
Benjamin Kramer	a8c5d0872d	InstCombine: fold fcmp pred (fneg x), C -> fcmp swap(pred) x, -C llvm-svn: 128626	2011-03-31 10:12:15 +00:00
Benjamin Kramer	cbb18e91a8	InstCombine: Shrink "fcmp (fpext x), C" to "fcmp x, C" if C can be losslessly converted to the type of x. Fixes PR9592. llvm-svn: 128625	2011-03-31 10:12:07 +00:00
Benjamin Kramer	2ccfbc8b71	InstCombine: fold fcmp (fpext x), (fpext y) -> fcmp x, y. llvm-svn: 128624	2011-03-31 10:11:58 +00:00
Bill Wendling	5034159c5f	* The DSE code that tested for overlapping needed to take into account the fact that one of the numbers is signed while the other is unsigned. This could lead to a wrong result when the signed was promoted to an unsigned int. * Add the data layout line to the testcase so that it will test the appropriate thing. Patch by David Terei! llvm-svn: 128577	2011-03-30 21:37:19 +00:00
Benjamin Kramer	af0ed953c5	Avoid turning a floating point division with a constant power of two into a denormal multiplication. Some platforms may treat denormals as zero, on other platforms multiplication with a subnormal is slower than dividing by a normal. llvm-svn: 128555	2011-03-30 17:02:54 +00:00
Benjamin Kramer	8564e0de96	InstCombine: If the divisor of an fdiv has an exact inverse, turn it into an fmul. Fixes PR9587. llvm-svn: 128546	2011-03-30 15:42:35 +00:00
Benjamin Kramer	272f2b0044	InstCombine: Add a few missing combines for ANDs and ORs of sign bit tests. On x86 we now compile "if (a < 0 && b < 0)" into testl %edi, %esi js IF.THEN llvm-svn: 128496	2011-03-29 22:06:41 +00:00
Cameron Zwarich	ff811cc475	Do some simple copy propagation through integer loads and stores when promoting vector types. This helps a lot with inlined functions when using the ARM soft float ABI. Fixes <rdar://problem/9184212>. llvm-svn: 128453	2011-03-29 05:19:52 +00:00
Nick Lewycky	8544228d5a	Teach the transformation that moves binary operators around selects to preserve the subclass optional data. llvm-svn: 128388	2011-03-27 19:51:23 +00:00
Frits van Bommel	0bb2ad2cf7	Constant folding support for calls to umul.with.overflow(), basically identical to the smul.with.overflow() code. llvm-svn: 128379	2011-03-27 14:26:13 +00:00
Nick Lewycky	83167df787	Add a small missed optimization: turn X == C ? X : Y into X == C ? C : Y. This removes one use of X which helps it pass the many hasOneUse() checks. In my analysis, this turns up very often where X = A >>exact B and that can't be simplified unless X has one use (except by increasing the lifetime of A which is generally a performance loss). llvm-svn: 128373	2011-03-27 07:30:57 +00:00
Cameron Zwarich	d4174ee43e	Fix a typo and add a test. llvm-svn: 128331	2011-03-26 04:58:50 +00:00
Bill Wendling	db40b5c899	PR9561: A store with a negative offset (via GEP) could erroniously say that it completely overlaps a previous store, thus mistakenly deleting that store. Check for this condition. llvm-svn: 128319	2011-03-26 01:20:37 +00:00
Cameron Zwarich	10ebc189ee	Fix PR9464 by correcting some math that just happened to be right in most cases that were hit in practice. llvm-svn: 128146	2011-03-23 05:25:55 +00:00
Anders Carlsson	ee6bc70d2f	Add an optimization to GlobalOpt that eliminates calls to __cxa_atexit, if the function passed is empty. llvm-svn: 127970	2011-03-20 17:59:11 +00:00
Andrew Trick	1c4b42d00f	Avoid creating canonical induction variables for non-native types. For example, on 32-bit architecture, don't promote all uses of the IV to 64-bits just because one use is a 64-bit cast. Alternate implementation of the patch by Arnaud de Grandmaison. llvm-svn: 127884	2011-03-18 16:50:32 +00:00
Eli Friedman	c17c9a78aa	FileCheck-ize and update test. llvm-svn: 127845	2011-03-18 01:10:31 +00:00
Devang Patel	aad34d882d	Try to not lose variable's debug info during instcombine. This is done by lowering dbg.declare intrinsic into dbg.value intrinsic. Radar 9143931. llvm-svn: 127834	2011-03-17 22:18:16 +00:00
Cameron Zwarich	0454253d7a	Only convert allocas to scalars if it is profitable. The profitability metric I chose is having a non-memcpy/memset use and being larger than any native integer type. Originally I chose having an access of a size smaller than the total size of the alloca, but this caused some minor issues on the spirit benchmark where SRoA runs again after some inlining. This fixes <rdar://problem/8613163>. llvm-svn: 127718	2011-03-16 00:13:44 +00:00
Cameron Zwarich	7b0f3c6a1a	Add native integer type TargetData to some existing tests. llvm-svn: 127717	2011-03-16 00:13:40 +00:00
Cameron Zwarich	0b8cdfb6ec	Do not add PHIs with no users when creating LCSSA form. Patch by Andrew Clinton. llvm-svn: 127674	2011-03-15 07:41:25 +00:00
Eli Friedman	c4414c6e92	PR9450: Make switch optimization in SimplifyCFG not dependent on the ordering of pointers in an std::map. llvm-svn: 127650	2011-03-15 02:23:35 +00:00
Eric Christopher	2139d3148f	If we don't know how long a string is we can't fold an _chk version to the normal version. Fixes rdar://9123638 llvm-svn: 127636	2011-03-15 00:25:41 +00:00
Benjamin Kramer	5acc751b6f	Teach ComputeMaskedBits about sub nsw. llvm-svn: 127548	2011-03-12 17:18:11 +00:00
Cameron Zwarich	338d362200	Roll r127459 back in: Optimize trivial branches in CodeGenPrepare, which often get created from the lowering of objectsize intrinsics. Unfortunately, a number of tests were relying on llc not optimizing trivial branches, so I had to add an option to allow them to continue to test what they originally tested. This fixes <rdar://problem/8785296> and <rdar://problem/9112893>. llvm-svn: 127498	2011-03-11 21:52:04 +00:00
Daniel Dunbar	94ccb27b43	Revert r127459, "Optimize trivial branches in CodeGenPrepare, which often get created from the", it broke some GCC test suite tests. llvm-svn: 127477	2011-03-11 19:30:30 +00:00
Benjamin Kramer	391a946fa9	ComputeMaskedBits: sub falls through to add, and sub doesn't have the same overflow semantics as add. Should fix the selfhost failures that started with r127463. llvm-svn: 127465	2011-03-11 14:46:49 +00:00
Benjamin Kramer	51897bcd3e	InstCombine: Fix a thinko where transform an icmp under the assumption that it's a zero comparison when it's not. Fixes PR9454. llvm-svn: 127464	2011-03-11 11:37:40 +00:00
Nick Lewycky	cc79973856	Teach ComputeMaskedBits about nsw on add. I don't think there's anything we can do with nuw here, but sub and mul should be given similar treatment. Fixes PR9343 #15! llvm-svn: 127463	2011-03-11 09:00:19 +00:00
Cameron Zwarich	cc27b3acc4	Optimize trivial branches in CodeGenPrepare, which often get created from the lowering of objectsize intrinsics. Unfortunately, a number of tests were relying on llc not optimizing trivial branches, so I had to add an option to allow them to continue to test what they originally tested. This fixes <rdar://problem/8785296> and <rdar://problem/9112893>. llvm-svn: 127459	2011-03-11 04:54:27 +00:00
Dan Gohman	154ed49784	Fix reassociate to postpone certain instruction deletions until after it has finished all of its reassociations, because its habit of unlinking operands and holding them in a datastructure while working means that it's not easy to determine when an instruction is really dead until after all its regular work is done. rdar://9096268. llvm-svn: 127424	2011-03-10 19:51:54 +00:00
Benjamin Kramer	b49b964b98	InstCombine: Turn umul_with_overflow into mul nuw if we can prove that it cannot overflow. This happens a lot in clang-compiled C++ code because it adds overflow checks to operator new[]: unsigned foo(unsigned n) { return new unsigned[n]; } We can optimize away the overflow check on 64 bit targets because (uint64_t)n4 cannot overflow. llvm-svn: 127418	2011-03-10 18:40:14 +00:00
Benjamin Kramer	1885d21700	Fix mistyped CHECK lines. llvm-svn: 127366	2011-03-09 22:07:31 +00:00
Devang Patel	13f8c7d48e	Preserve line number information while simplifying libcalls. llvm-svn: 127362	2011-03-09 21:27:52 +00:00
Cameron Zwarich	718918b07a	Add a test case for r127320. llvm-svn: 127321	2011-03-09 08:11:02 +00:00
Nick Lewycky	980104d1d6	Add another micro-optimization. Apologies for the lack of refactoring, but I gave up when I realized I couldn't come up with a good name for what the refactored function would be, to describe what it does. This is PR9343 test12, which is test3 with arguments reordered. Whoops! llvm-svn: 127318	2011-03-09 06:26:03 +00:00
Cameron Zwarich	3b649f4d01	Add support to scalar replacement for partial vector accesses of an alloca, e.g. a union of a float, <2 x float>, and <4 x float>. This mostly comes up with the use of vector intrinsics, especially in NEON when programmers know the layout of the register file. This enables codegen to eliminate a lot of the subregister traffic it would otherwise generate. This commit only enables this for a small number of floating-point cases, but a lot more integer cases. I assume this is okay for all ports, but I did not do extensive testing of the quality of code involving i512 vectors and the like. If there is a use case where this generates worse code than before, let me know and we can scale it back. This fixes <rdar://problem/9036264>. llvm-svn: 127317	2011-03-09 05:43:05 +00:00
Eli Friedman	a81a82dcaf	PR9346: Prevent SimplifyDemandedBits from incorrectly introducing INT_MIN % -1. llvm-svn: 127306	2011-03-09 01:28:35 +00:00
Eli Friedman	aac35b3fbb	PR9420; an instruction before an unreachable is guaranteed not to have any reachable uses, but there still might be uses in dead blocks. Use the standard solution of replacing all the uses with undef. This is a rare case because it's very sensitive to phase ordering in SimplifyCFG. llvm-svn: 127299	2011-03-09 00:48:33 +00:00
Duncan Sands	7dc3d47c34	Fix PR9331. Simplified version of a patch by Jakub Staszak. llvm-svn: 127243	2011-03-08 12:39:03 +00:00
Devang Patel	97d0be8ee1	While sinking an instruction, do not lose llvm.dbg.value intrinsic. llvm-svn: 127214	2011-03-08 03:06:19 +00:00
Devang Patel	d00c628f8f	Preserve line no. info. Radar `9097659` llvm-svn: 127182	2011-03-07 22:43:45 +00:00
Rafael Espindola	15a29867ed	Add test for r127138. llvm-svn: 127172	2011-03-07 21:28:14 +00:00
Nick Lewycky	ac55c79dd6	Tweak this test. We can analyze what happens and show that we still do the right thing, instead of merely being unable to analyze and the transform doesn't occur. llvm-svn: 127149	2011-03-07 02:10:18 +00:00
Nick Lewycky	e467979d0a	Add more analysis of the sign bit of an srem instruction. If the LHS is negative then the result could go either way. If it's provably positive then so is the srem. Fixes PR9343 #7! llvm-svn: 127146	2011-03-07 01:50:10 +00:00
Nick Lewycky	92db8e8e39	ConstantInt has some getters which return ConstantInt's or ConstantVector's of the value splatted into every element. Extend this to getTrue and getFalse which by providing new overloads that take Types that are either i1 or <N x i1>. Use it in InstCombine to add vector support to some code, fixing PR8469! llvm-svn: 127116	2011-03-06 03:36:19 +00:00
Nick Lewycky	9719a719c7	Thread comparisons over udiv/sdiv/ashr/lshr exact and lshr nuw/nsw whenever possible. This goes into instcombine and instsimplify because instsimplify doesn't need to check hasOneUse since it returns (almost exclusively) constants. This fixes PR9343 #4 #5 and #8! llvm-svn: 127064	2011-03-05 05:19:11 +00:00
Nick Lewycky	25cc338d88	Try once again to optimize "icmp (srem X, Y), Y" by turning the comparison into true/false or "icmp slt/sge Y, 0". llvm-svn: 127063	2011-03-05 04:28:48 +00:00
Nick Lewycky	41c529bd09	Revert broken srem logic from r126991. llvm-svn: 127021	2011-03-04 19:26:08 +00:00
Nick Lewycky	8e3a79da9f	Fold "icmp pred (srem X, Y), Y" like we do for urem. Handle signed comparisons in the urem case, though not the other way around. This is enough to get #3 from PR9343! llvm-svn: 126991	2011-03-04 10:06:52 +00:00
Nick Lewycky	3cec6f5563	Teach instruction simplify to use constant ranges to solve problems of the form "icmp pred %X, CI" and a number of examples where "%X = binop %Y, CI2". Some of these cases (div and rem) used to make it through opt -O2, but the others are probably now making code elsewhere redundant (probably instcombine). llvm-svn: 126988	2011-03-04 07:00:57 +00:00
Richard Osborne	af52c52569	Optimize fprintf -> iprintf if there are no floating point arguments and siprintf is available on the target. llvm-svn: 126940	2011-03-03 14:20:22 +00:00
Richard Osborne	2dfb888392	Optimize sprintf -> siprintf if there are no floating point arguments and siprintf is available on the target. llvm-svn: 126937	2011-03-03 14:09:28 +00:00
Richard Osborne	815de536e5	Optimize printf -> iprintf if there are no floating point arguments and iprintf is available on the target. Currently iprintf is only marked as being available on the XCore. llvm-svn: 126935	2011-03-03 13:17:51 +00:00
Anders Carlsson	da80afef99	Make InstCombiner::FoldAndOfICmps create a ConstantRange that's the intersection of the LHS and RHS ConstantRanges and return "false" when the range is empty. This simplifies some code and catches some extra cases. llvm-svn: 126744	2011-03-01 15:05:01 +00:00
Nick Lewycky	c9d20067cd	Optimize "icmp pred (urem X, Y), Y" --> true/false depending on pred. There's more work to do here, "icmp ult (urem X, 10), 11" doesn't optimize away yet. Fixes example 3 from PR9343! llvm-svn: 126741	2011-03-01 08:15:50 +00:00
Eli Friedman	683bbc16c4	Add an obvious missing safety check to DAE::RemoveDeadArgumentsFromCallers. llvm-svn: 126720	2011-03-01 00:33:47 +00:00
Dan Gohman	6564ca0c23	Delete obsolete test. llvm-svn: 126680	2011-02-28 19:58:14 +00:00
Frits van Bommel	8ae07996c9	Teach SimplifyCFG that (switch (select cond, X, Y)) is better expressed as a branch. Based on a patch by Alistair Lynn. llvm-svn: 126647	2011-02-28 09:44:07 +00:00
Nick Lewycky	66f4f22f7b	srem doesn't actually have the same resulting sign as its numerator, you could also have a zero when numerator = denominator. Reverts parts of r126635 and r126637. llvm-svn: 126644	2011-02-28 09:17:39 +00:00
Nick Lewycky	174a705497	Teach InstCombine to fold "(shr exact X, Y) == 0" --> X == 0, fixing #1 from PR9343. llvm-svn: 126643	2011-02-28 08:31:40 +00:00
Nick Lewycky	6b445419b0	The sign of an srem instruction is the sign of its dividend (the first argument), regardless of the divisor. Teach instcombine about this and fix test7 in PR9343! llvm-svn: 126635	2011-02-28 06:20:05 +00:00
Benjamin Kramer	ceb5daa567	Revert "SimplifyCFG: GEPs with just one non-constant index are also cheap." Yes, there are other types than i8* and GEPs on them can produce an add+multiply. We don't consider that cheap enough to be speculatively executed. llvm-svn: 126481	2011-02-25 10:33:33 +00:00
Benjamin Kramer	dfdca1a14d	SimplifyCFG: GEPs with just one non-constant index are also cheap. llvm-svn: 126452	2011-02-24 23:26:09 +00:00
Benjamin Kramer	27361a7124	SimplifyCFG: GEPs with constant indices are cheap enough to be executed unconditionally. llvm-svn: 126445	2011-02-24 22:46:11 +00:00
Chris Lattner	adf38b3e09	change instcombine to not turn a call to non-varargs bitcast of function prototype into a call to a varargs prototype. We do allow the xform if we have a definition, but otherwise we don't want to risk that we're changing the abi in a subtle way. On X86-64, for example, varargs require passing stuff in %al. llvm-svn: 126363	2011-02-24 05:10:56 +00:00
Cameron Zwarich	826308586c	Make LoopDeletion work on loops with multiple edges, as long as the incoming values from all of the loop's exiting blocks are equal. Patch by Andrew Clinton. llvm-svn: 126253	2011-02-22 22:25:39 +00:00
Benjamin Kramer	d5d7f37beb	InstCombine: Add a bunch of combines of the form x \| (y ^ z). We usually catch this kind of optimization through InstSimplify's distributive magic, but or doesn't distribute over xor in general. "A \| ~(A \| B) -> A \| ~B" hits 24 times on gcc.c. llvm-svn: 126081	2011-02-20 13:23:43 +00:00
Nick Lewycky	c8a1569950	Teach RecursivelyDeleteDeadPHINodes to handle multiple self-references. Patch by Andrew Clinton! llvm-svn: 126077	2011-02-20 08:38:20 +00:00
Eli Friedman	ef200db4fd	PR9218: SimplifyDemandedVectorElts can return a non-null value that is not the instruction passed in. Make sure to account for this correctly, instead of looping infinitely. llvm-svn: 126058	2011-02-19 22:42:40 +00:00
Chris Lattner	72a35fb974	rewrite the memset_pattern pattern generation stuff to accept any 2/4/8/16-byte constant, including globals. This makes us generate much more "pretty" pattern globals as well because it doesn't break it down to an array of bytes all the time. This enables us to handle stores of relocatable globals. This kicks in about 48 times in 254.gap, giving us stuff like this: @.memset_pattern40 = internal constant [2 x %struct.TypHeader* (%struct.TypHeader, %struct.TypHeader)] [%struct.TypHeader (%struct.TypHeader, %struct .TypHeader)* @IsFalse, %struct.TypHeader* (%struct.TypHeader, %struct.TypHeader)* @IsFalse], align 16 ... call void @memset_pattern16(i8* %scevgep5859, i8* bitcast ([2 x %struct.TypHeader* (%struct.TypHeader, %struct.TypHeader)] @.memset_pattern40 to i8* ), i64 %tmp75) nounwind llvm-svn: 126044	2011-02-19 19:56:44 +00:00
Chris Lattner	acf6b0776a	Stores of null pointers should turn into memset, we weren't recognizing them as splat values. llvm-svn: 126041	2011-02-19 19:35:49 +00:00
Chris Lattner	0f4a64011e	Implement rdar://9009151, transforming strided loop stores of unsplatable values into memset_pattern16 when it is available (recent darwins). This transforms lots of strided loop stores of ints for example, like 5 in vpr: Formed memset: call void @memset_pattern16(i8* %4, i8* getelementptr inbounds ([16 x i8]* @.memset_pattern9, i32 0, i32 0), i64 %tmp25) from store to: {%3,+,4}<%11> at: store i32 3, i32* %scevgep, align 4, !tbaa !4 llvm-svn: 126040	2011-02-19 19:31:39 +00:00
Duncan Sands	84653b3674	Add some transforms of the kind X-Y>X -> 0>Y which are valid when there is no overflow. These subsume some existing equality transforms, so zap those. llvm-svn: 125843	2011-02-18 16:25:37 +00:00
Chris Lattner	6b88c76f13	add a testcase for r125827 llvm-svn: 125831	2011-02-18 05:05:01 +00:00
Chris Lattner	1a924e770a	prevent jump threading from merging blocks when their address is taken (and used!). This prevents merging the blocks (invalidating the block addresses) in a case like this: #define _THIS_IP_ ({ __label__ __here; __here: (unsigned long)&&__here; }) void foo() { printf("%p\n", _THIS_IP_); printf("%p\n", _THIS_IP_); printf("%p\n", _THIS_IP_); } which fixes PR4151. llvm-svn: 125829	2011-02-18 04:43:06 +00:00
Chris Lattner	a8fed47eed	have instcombine preserve nsw/nuw/exact when sinking common operations through a phi. llvm-svn: 125790	2011-02-17 23:01:49 +00:00
Chris Lattner	abb8eb2c63	fix instcombine merging GEPs through a PHI to only make the result inbounds if all of the inputs are inbounds. llvm-svn: 125785	2011-02-17 22:21:26 +00:00
Nadav Rotem	7cc6d12ad0	Enhance constant folding of bitcast operations on vectors of floats. Add getAllOnesValue of FP numbers to Constants and APFloat. Add more tests. llvm-svn: 125776	2011-02-17 21:22:27 +00:00
Duncan Sands	e522001171	Transform "A + B >= A + C" into "B >= C" if the adds do not wrap. Likewise for some variations (some of these were already present so I unified the code). Spotted by my auto-simplifier as occurring a lot. llvm-svn: 125734	2011-02-17 07:46:37 +00:00
Chris Lattner	5592071768	preserve NUW/NSW when transforming add x,x llvm-svn: 125711	2011-02-17 02:23:02 +00:00
Chris Lattner	0ad64291d8	filecheckize llvm-svn: 125710	2011-02-17 02:21:03 +00:00
Chris Lattner	3eb0af94c4	fix PR9215, preventing -reassociate from clearing nsw/nuw when it swaps the LHS/RHS of a single binop. llvm-svn: 125700	2011-02-17 01:29:24 +00:00
Nick Lewycky	038124b671	Teach PatternMatch that splat vectors could be floating point as well as integer. Fixes PR9228! llvm-svn: 125613	2011-02-15 23:13:23 +00:00
Nadav Rotem	67d67a0385	Fix 9216 - Endless loop in InstCombine pass. The pattern "A&(A^B) -> A & ~B" recreated itself because ~B is actually a xor -1. llvm-svn: 125557	2011-02-15 07:13:48 +00:00
Devang Patel	3058398655	Do not hoist @llvm.dbg.value. Here, @llvm.dbg.value is "referring" a value that is modified inside loop. llvm-svn: 125529	2011-02-14 23:03:23 +00:00
Duncan Sands	d114ab331c	Teach instsimplify that X+Y>=X+Z is the same as Y>=Z if neither side overflows, plus some variations of this. According to my auto-simplifier this occurs a lot but usually in combination with max/min idioms. Because max/min aren't handled yet this unfortunately doesn't have much effect in the testsuite. llvm-svn: 125462	2011-02-13 17:15:40 +00:00
Nadav Rotem	0e162c57f8	Fix test llvm-svn: 125460	2011-02-13 16:13:16 +00:00

... 7 8 9 10 11 ...

3029 Commits