llvm-project

Commit Graph

Author	SHA1	Message	Date
Benjamin Kramer	11743249e6	Move optimization to avoid redundant matching. llvm-svn: 108140	2010-07-12 13:34:22 +00:00
Benjamin Kramer	9675e759cf	Revert r108136 until I figure out why it broke selfhost. llvm-svn: 108139	2010-07-12 12:35:49 +00:00
Gabor Greif	782f62412f	cache dereferenced iterators llvm-svn: 108138	2010-07-12 12:03:02 +00:00
Gabor Greif	433b975fe2	recommit r108131 (hich has been backed out in r108135) with a fix llvm-svn: 108137	2010-07-12 12:02:10 +00:00
Benjamin Kramer	35473faa50	instcombine: fold (x & y) \| (~x & z) and (x & y) ^ (~x & z) into ((y ^ z) & x) ^ z which is one instruction shorter. (PR6773) before: %and = and i32 %y, %x %neg = xor i32 %x, -1 %and4 = and i32 %z, %neg %xor = xor i32 %and4, %and after: %xor1 = xor i32 %z, %y %and2 = and i32 %xor1, %x %xor = xor i32 %and2, %z llvm-svn: 108136	2010-07-12 11:54:45 +00:00
Gabor Greif	f9610827ce	back out r108131 (of TailDuplication.cpp) for now, it causes a buildbot failure llvm-svn: 108135	2010-07-12 11:32:39 +00:00
Gabor Greif	6143704ac5	cache dereferenced iterators llvm-svn: 108134	2010-07-12 11:19:24 +00:00
Gabor Greif	8629f12bb8	cache dereferenced iterators llvm-svn: 108133	2010-07-12 10:59:23 +00:00
Gabor Greif	d993402df3	cache dereferenced iterators llvm-svn: 108132	2010-07-12 10:49:54 +00:00
Gabor Greif	2a464d7308	cache dereferenced iterators llvm-svn: 108131	2010-07-12 10:36:48 +00:00
Duncan Sands	41b4a6b36a	Convert some tab stops into spaces. llvm-svn: 108130	2010-07-12 08:16:59 +00:00
Chris Lattner	601e390a3b	make the prototypes for CreateMalloc and CreateFree more consistent. Patch by Hans Vandierendonck from PR7605 llvm-svn: 108116	2010-07-12 00:57:28 +00:00
Chris Lattner	bbc25ff5cc	if jump threading is able to infer interesting values on both the LHS and RHS of an and/or instruction, don't multiply add known predecessor values. This fixes the crash on testcase from PR7498 llvm-svn: 108114	2010-07-12 00:47:34 +00:00
Duncan Sands	82b21c086e	The accumulator tail recursion transform claims to work for any associative operation, but the way it's implemented requires the operation to also be commutative. So add a check for commutativity (and tweak the corresponding comments). This makes no difference in practice since every associative LLVM instruction is also commutative! Here's an example to show the need for commutativity: the accum_recursion.ll testcase calculates the factorial function. Before the transformation the result of a call is ((((11)2)3)...)x while afterwards it is (((1x)(x-1))...2)1 which clearly requires both associativity and commutativity of * to be equal to the original. llvm-svn: 108056	2010-07-10 20:31:42 +00:00
Gabor Greif	9d5ae03404	cache result of operator* llvm-svn: 107990	2010-07-09 16:51:20 +00:00
Gabor Greif	fd8e7d4a0f	cache result of operator* llvm-svn: 107984	2010-07-09 16:31:08 +00:00
Gabor Greif	e7650c7c29	cache result of operator* llvm-svn: 107983	2010-07-09 16:26:41 +00:00
Gabor Greif	04af1e4f65	cache result of operator* llvm-svn: 107981	2010-07-09 16:17:52 +00:00
Gabor Greif	e82532a1c5	cache result of operator* llvm-svn: 107976	2010-07-09 15:40:10 +00:00
Gabor Greif	6d8870fc35	cache result of operator* llvm-svn: 107975	2010-07-09 15:25:42 +00:00
Gabor Greif	329c4d8ed9	cache result of operator* llvm-svn: 107974	2010-07-09 15:25:09 +00:00
Gabor Greif	0028cc6730	cache result of operator* llvm-svn: 107972	2010-07-09 15:01:36 +00:00
Gabor Greif	d323f5e161	cache result of operator* (found by inspection) llvm-svn: 107971	2010-07-09 14:48:08 +00:00
Gabor Greif	b0d56ffc85	cache result of operator* llvm-svn: 107969	2010-07-09 14:36:49 +00:00
Gabor Greif	4247949ce9	cache result of operator* llvm-svn: 107968	2010-07-09 14:29:14 +00:00
Gabor Greif	a02f232c1b	cache result of operator* llvm-svn: 107966	2010-07-09 14:18:23 +00:00
Gabor Greif	f0821f39ee	cache operator*'s result (in multiple functions) llvm-svn: 107965	2010-07-09 14:02:13 +00:00
Gabor Greif	60a346d0f1	do not repeatedly dereference use_iterator llvm-svn: 107962	2010-07-09 12:23:50 +00:00
Benjamin Kramer	2321e6a4d4	Teach instcombine to transform (X >s -1) ? C1 : C2 and (X <s 0) ? C2 : C1 into ((X >>s 31) & (C2 - C1)) + C1, avoiding the conditional. This optimization could be extended to take non-const C1 and C2 but we better stay conservative to avoid code size bloat for now. for int sel(int n) { return n >= 0 ? 60 : 100; } we now generate sarl $31, %edi andl $40, %edi leal 60(%rdi), %eax instead of testl %edi, %edi movl $60, %ecx movl $100, %eax cmovnsl %ecx, %eax llvm-svn: 107866	2010-07-08 11:39:10 +00:00
Chris Lattner	efa3c824cc	Fix the second half of PR7437: scalarrepl wasn't preserving address spaces when SRoA'ing memcpy's. llvm-svn: 107846	2010-07-08 00:27:05 +00:00
Duncan Sands	408bb192de	Rename "Release" builds as "Release+Asserts"; rename "Release-Asserts" builds to "Release". The default build is unchanged (optimization on, assertions on), however it is now called Release+Asserts. The intent is that future LLVM releases released via llvm.org will be Release builds in the new sense, i.e. will have assertions disabled (currently they have assertions enabled, for a more than 20% slowdown). This will bring them in line with MacOS releases, which ship with assertions disabled. It also means that "Release" now means the same things in make and cmake builds: cmake already disables assertions for "Release" builds AFAICS. llvm-svn: 107758	2010-07-07 07:48:00 +00:00
Nick Lewycky	dace239949	Detabify this file. llvm-svn: 107637	2010-07-06 03:53:43 +00:00
Devang Patel	cefe3831b7	MDString is already checked earlier. llvm-svn: 107516	2010-07-02 21:13:23 +00:00
Dan Gohman	832282e061	Don't claim to preserve AliasAnalysis. First, this is doesn't actually have any effect, and second, deleting stores can potentially invalidate an AliasAnalysis, and there's currently no notification for this. llvm-svn: 107496	2010-07-02 18:43:05 +00:00
Bill Wendling	03bcd6ecc8	Implement the "linker_private_weak" linkage type. This will be used for Objective-C metadata types which should be marked as "weak", but which the linker will remove upon final linkage. However, this linkage isn't specific to Objective-C. For example, the "objc_msgSend_fixup_alloc" symbol is defined like this: .globl l_objc_msgSend_fixup_alloc .weak_definition l_objc_msgSend_fixup_alloc .section __DATA, __objc_msgrefs, coalesced .align 3 l_objc_msgSend_fixup_alloc: .quad _objc_msgSend_fixup .quad L_OBJC_METH_VAR_NAME_1 This is different from the "linker_private" linkage type, because it can't have the metadata defined with ".weak_definition". Currently only supported on Darwin platforms. llvm-svn: 107433	2010-07-01 21:55:59 +00:00
Devang Patel	2b434e12cd	Debugging infomration is encoded in llvm IR using metadata. This is designed such a way that debug info for symbols preserved even if symbols are optimized away by the optimizer. Add new special pass to remove debug info for such symbols. llvm-svn: 107416	2010-07-01 19:49:20 +00:00
Devang Patel	b9e2e4b762	If a named mdnode is removed then mark module as changed. llvm-svn: 107412	2010-07-01 18:27:46 +00:00
Jim Grosbach	e74c78d539	lowerinvoke needs to handle aggregate function args like sjlj eh does. llvm-svn: 107335	2010-06-30 22:22:59 +00:00
Devang Patel	db735cbbab	Remove all debug info related named mdnodes. llvm-svn: 107323	2010-06-30 21:29:00 +00:00
Gabor Greif	74470192d7	use ArgOperand API llvm-svn: 107278	2010-06-30 12:42:43 +00:00
Gabor Greif	d50572802e	use ArgOperand API llvm-svn: 107277	2010-06-30 12:40:35 +00:00
Gabor Greif	3abd881bea	use getArgOperand (corrected by CallInst::ArgOffset) instead of getOperand llvm-svn: 107275	2010-06-30 12:38:26 +00:00
Gabor Greif	743b3fd196	use getArgOperand (corrected by CallInst::ArgOffset) instead of getOperand llvm-svn: 107273	2010-06-30 09:19:23 +00:00
Gabor Greif	f628ecd15f	use getNumArgOperands instead of getNumOperands llvm-svn: 107272	2010-06-30 09:17:53 +00:00
Gabor Greif	fe252e6fa0	use getArgOperand instead of getOperand llvm-svn: 107271	2010-06-30 09:16:16 +00:00
Gabor Greif	8ae3095286	use getArgOperand instead of getOperand llvm-svn: 107270	2010-06-30 09:15:28 +00:00
Gabor Greif	e9acc46f65	use getArgOperand instead of getOperand llvm-svn: 107269	2010-06-30 09:14:26 +00:00
Bill Wendling	3632171750	Revert r107205 and r107207. llvm-svn: 107215	2010-06-29 22:34:52 +00:00
Bill Wendling	1767723dbe	Introducing the "linker_weak" linkage type. This will be used for Objective-C metadata types which should be marked as "weak", but which the linker will remove upon final linkage. For example, the "objc_msgSend_fixup_alloc" symbol is defined like this: .globl l_objc_msgSend_fixup_alloc .weak_definition l_objc_msgSend_fixup_alloc .section __DATA, __objc_msgrefs, coalesced .align 3 l_objc_msgSend_fixup_alloc: .quad _objc_msgSend_fixup .quad L_OBJC_METH_VAR_NAME_1 This is different from the "linker_private" linkage type, because it can't have the metadata defined with ".weak_definition". llvm-svn: 107205	2010-06-29 21:24:00 +00:00
Duncan Sands	17f1ca8793	Return Changed. This required setting Changed if dbg metadata is stripped off. Currently set unconditionally, since the API does not provide a way of working out if anything was actually stripped off. llvm-svn: 107142	2010-06-29 14:52:10 +00:00
Gabor Greif	5b1370ee80	use ArgOperand API llvm-svn: 107017	2010-06-28 16:50:57 +00:00
Gabor Greif	e23efeef10	use ArgOperand API llvm-svn: 107016	2010-06-28 16:45:00 +00:00
Gabor Greif	18c5bae727	employ CallInst::ArgOffset (for now) llvm-svn: 107015	2010-06-28 16:43:57 +00:00
Gabor Greif	2dd4307e45	use setArgOperand llvm-svn: 107004	2010-06-28 12:31:35 +00:00
Gabor Greif	ec60adf161	use CallInst::ArgOffset llvm-svn: 107003	2010-06-28 12:30:07 +00:00
Gabor Greif	2de43a7c5c	use ArgOperand API and CallInst::ArgOffset llvm-svn: 107002	2010-06-28 12:29:20 +00:00
Gabor Greif	4300fc77ae	use cached value llvm-svn: 107000	2010-06-28 11:20:42 +00:00
Chris Lattner	25a843fcd2	minor cleanup to SROA: when lowering type unsafe accesses to large integers, the first inserted value would always create an 'or X, 0'. Even though this is trivially zapped by instcombine, don't bother creating this pointless instruction. llvm-svn: 106979	2010-06-27 07:58:26 +00:00
Duncan Sands	3a5cb69cb8	Fix PR7328: when turning a tail recursion into a loop, need to preserve the returned value after the tail call if it differs from other return values. The optimal thing to do would be to introduce a phi node for the return value, but for the moment just fix the miscompile. llvm-svn: 106947	2010-06-26 12:53:31 +00:00
Dan Gohman	fb9712bdae	In GenerateReassociations, don't bother thinking about individual SCEVUnknown values which are loop-variant, as LSR can't do anything interesting with these values in any case. This fixes very slow compile times on loops which have large numbers of such values. llvm-svn: 106897	2010-06-25 22:32:18 +00:00
Dale Johannesen	ce97d55ad9	The hasMemory argument is irrelevant to how the argument for an "i" constraint should get lowered; PR 6309. While this argument was passed around a lot, this is the only place it was used, so it goes away from a lot of other places. llvm-svn: 106893	2010-06-25 21:55:36 +00:00
Gabor Greif	e3ba486c9f	use ArgOperand API (one more hunk I could split) llvm-svn: 106825	2010-06-25 07:58:41 +00:00
Gabor Greif	5f3e656a1b	use ArgOperand API (some hunks I could split) llvm-svn: 106824	2010-06-25 07:57:14 +00:00
Gabor Greif	07e9284c75	use ArgOperand API; tighten type of handleFreeWithNonTrivialDependency to be able to use isFreeCall whithout a cast or new overload llvm-svn: 106823	2010-06-25 07:40:32 +00:00
Dan Gohman	4143e9deeb	Add an exports file for the Hello example plugin. llvm-svn: 106768	2010-06-24 17:36:51 +00:00
Dan Gohman	963b1c142e	A few minor micro-optimizations. llvm-svn: 106764	2010-06-24 16:57:52 +00:00
Dan Gohman	47ddf76d89	Teach getExactSDiv to evaluate x/1 to x up front, as it's a common enough special case, and it theoretically allows more folding because it works even when x is unanalyzable. llvm-svn: 106763	2010-06-24 16:51:25 +00:00
Dan Gohman	ab5422200b	Fix copy+pasto issues in isMulSExtable. llvm-svn: 106759	2010-06-24 16:45:11 +00:00
Gabor Greif	7ccec09252	use ArgOperand API llvm-svn: 106752	2010-06-24 16:11:44 +00:00
Gabor Greif	a6d75e2cf7	use (even more, still) ArgOperand API llvm-svn: 106750	2010-06-24 15:51:11 +00:00
Gabor Greif	218f5541b2	use ArgOperand API and CallSite for arg range; add necessary casts and perform some cosmetics llvm-svn: 106747	2010-06-24 14:42:01 +00:00
Gabor Greif	5aafdf1e43	use ArgOperand API and CallSite for arg range llvm-svn: 106745	2010-06-24 14:13:36 +00:00
Gabor Greif	0a136c9b53	use (even more) ArgOperand API llvm-svn: 106744	2010-06-24 13:54:33 +00:00
Gabor Greif	590d95ed18	use ArgOperand API llvm-svn: 106743	2010-06-24 13:42:49 +00:00
Gabor Greif	589a0b950a	use ArgOperand API llvm-svn: 106740	2010-06-24 12:58:35 +00:00
Gabor Greif	7943017490	use ArgOperand API llvm-svn: 106737	2010-06-24 12:35:13 +00:00
Gabor Greif	75f6943c95	use ArgOperand API, also tighten the type of visitFree to make this work out smoothly llvm-svn: 106736	2010-06-24 12:21:15 +00:00
Gabor Greif	91f9589057	use ArgOperand API; introduce downcasted pointers into scope to facilitate this llvm-svn: 106734	2010-06-24 12:03:56 +00:00
Gabor Greif	e2f482ca0b	use ArgOperand API llvm-svn: 106731	2010-06-24 10:42:46 +00:00
Gabor Greif	2d958d4db5	use ArgOperand API llvm-svn: 106730	2010-06-24 10:17:17 +00:00
Gabor Greif	5bcaa55761	use callsite to obtain all arguments llvm-svn: 106729	2010-06-24 10:04:07 +00:00
Gabor Greif	42f620cc55	use callsite to obtain all arguments llvm-svn: 106728	2010-06-24 09:56:43 +00:00
Gabor Greif	0f60709f0e	use getNumArgOperands llvm-svn: 106709	2010-06-24 00:48:48 +00:00
Gabor Greif	4a39b84a9d	use ArgOperand API llvm-svn: 106707	2010-06-24 00:44:01 +00:00
Devang Patel	0dc3c2d37e	Use ValueMap instead of DenseMap. The ValueMapper used by various cloning utility maps MDNodes also. llvm-svn: 106706	2010-06-24 00:33:28 +00:00
Devang Patel	d8dedee96d	Use available typedef for " DenseMap<const Value, Value>". llvm-svn: 106699	2010-06-24 00:00:42 +00:00
Devang Patel	b8f11de105	Cosmetic change. Do not use "ValueMap" as a name for a local variable or an argument. llvm-svn: 106698	2010-06-23 23:55:51 +00:00
Devang Patel	9ad629367d	Revert 106592 for now. It causes clang-selfhost build failure. llvm-svn: 106598	2010-06-22 23:29:55 +00:00
Dan Gohman	1081f1a0f5	Fix OptimizeMax to handle an odd case where one of the max operands is another max which folds. This fixes PR7454. llvm-svn: 106594	2010-06-22 23:07:13 +00:00
Devang Patel	87f75f75be	If a metadata operand is seeded in value map and the metadata should also be seeded in value map. This is not limited to function local metadata. Failure to seed metdata in such cases causes troubles when in a cloned module, metadata from a new module refers to values in old module. Usually this results in mysterious bugpoint crashes. For example, Checking to see if we can delete global inits: Unknown constant! UNREACHABLE executed at /d/g/llvm/lib/Bitcode/Writer/BitcodeWriter.cpp:904! llvm-svn: 106592	2010-06-22 22:53:21 +00:00
Devang Patel	e43c6487da	While cloning a module, clone metadata attached with instructions. llvm-svn: 106591	2010-06-22 22:50:42 +00:00
Devang Patel	e3fbbd19ed	Clone named metadata while cloning a module. Reapply Bob's patch. llvm-svn: 106560	2010-06-22 18:52:38 +00:00
Dan Gohman	d2d1ae105d	Use pre-increment instead of post-increment when the result is not used. llvm-svn: 106542	2010-06-22 15:08:57 +00:00
Devang Patel	f040dec68a	Revert 106528. It is causing self host failures. llvm-svn: 106529	2010-06-22 06:14:09 +00:00
Devang Patel	b195eb4acf	Do not rely on DenseMap slot which can be easily invalidated when DenseMap grows. llvm-svn: 106528	2010-06-22 05:16:56 +00:00
Bob Wilson	6c1fc79cab	Revert my change to clone named metadata. Buildbots are complaining. --- Reverse-merging r106508 into '.': U lib/Transforms/Utils/CloneModule.cpp llvm-svn: 106521	2010-06-22 02:08:51 +00:00
Bob Wilson	5f9575c1cd	Include named metadata when cloning a module. llvm-svn: 106508	2010-06-22 00:11:03 +00:00
Dan Gohman	dd41bba517	Use A.append(...) instead of A.insert(A.end(), ...) when A is a SmallVector, and other SmallVector simplifications. llvm-svn: 106452	2010-06-21 19:47:52 +00:00
Dan Gohman	32655906e4	Add a TODO comment. llvm-svn: 106397	2010-06-19 21:30:18 +00:00
Dan Gohman	51d00092b6	Include the use kind along with the expression in the key of the use sharing map. The reconcileNewOffset logic already forces a separate use if the kinds differ, so incorporating the kind in the key means we can track more sharing opportunities. More sharing means fewer total uses to track, which means smaller problem sizes, which means the conservative throttles don't kick in as often. llvm-svn: 106396	2010-06-19 21:29:59 +00:00
Dan Gohman	297fb8b9fc	Don't include things in anonymous namespaces that don't need it. llvm-svn: 106395	2010-06-19 21:21:39 +00:00
Dan Gohman	f3aea7aecf	Disable indvars on loops when LoopSimplify form is not available. This fixes PR7333. llvm-svn: 106267	2010-06-18 01:35:11 +00:00
Jim Grosbach	e94f1ded24	remove trailing whitespace llvm-svn: 106164	2010-06-16 22:41:09 +00:00
Rafael Espindola	a20e2dfe86	Make sure that simplify libcalls does not replace a call with one calling convention with a new call with a different calling convention. llvm-svn: 106134	2010-06-16 19:34:01 +00:00
Benjamin Kramer	a13bd20396	simplify-libcalls: fold strncmp(x, y, 1) -> memcmp(x, y, 1) The memcmp will be optimized further and even the pathological case 'strstr(x, "x") == x' generates optimal code now. llvm-svn: 106097	2010-06-16 10:30:29 +00:00
Benjamin Kramer	1118860e3a	simplify-libcalls: fold strstr(a, b) == a -> strncmp(a, b, strlen(b)) == 0 llvm-svn: 106047	2010-06-15 21:34:25 +00:00
Chris Lattner	329ea064ed	jump threading can't split a critical edge from an indirectbr. This fixes PR7356. llvm-svn: 105950	2010-06-14 19:45:43 +00:00
Benjamin Kramer	b82de426de	SimplifyCFG: don't turn volatile stores to null/undef into unreachable. Fixes PR7369. llvm-svn: 105914	2010-06-13 14:35:54 +00:00
Kenneth Uildriks	9b21208bfb	Pulled CodeMetrics out of InlineCost.h and made it a bit more general, so it can be reused from PartialSpecializationCost llvm-svn: 105725	2010-06-09 15:11:37 +00:00
Dan Gohman	fb8ed43349	Make bugpoint dead-argument-hacking actually work, and actually test it. llvm-svn: 105551	2010-06-07 20:20:33 +00:00
Kenneth Uildriks	1850444000	Partial specialization was not checking the callsite to make sure it was using the same constants as the specialization, leading to calls to the wrong specialization. Patch by Takumi Nakamura\! llvm-svn: 105528	2010-06-05 14:50:21 +00:00
Dan Gohman	67b4403101	Don't track users of undef values; they aren't interesting for register pressure. llvm-svn: 105501	2010-06-04 23:16:05 +00:00
Devang Patel	36da24b546	Copy location info for current function argument from dbg.declare if respective store instruction does not have any location info. llvm-svn: 105490	2010-06-04 22:27:30 +00:00
Jim Grosbach	5ba76b94f8	Remove unused code llvm-svn: 105293	2010-06-01 21:56:30 +00:00
Jim Grosbach	0e20dc5cd6	fix think-o llvm-svn: 105291	2010-06-01 21:35:50 +00:00
Jim Grosbach	b69c68742a	Simplify things a bit more. Fix prototype to use SmallVectorImpl and change a few SmallVectors to vanilla C arrays. llvm-svn: 105289	2010-06-01 21:06:46 +00:00
Jim Grosbach	a37af16221	mirror of r105280 changes for LowerInvoke, which uses the same basic logic here llvm-svn: 105281	2010-06-01 18:04:56 +00:00
Jim Grosbach	7352167560	Use SmallVector instead of std::vector. llvm-svn: 105279	2010-06-01 17:56:41 +00:00
Duncan Sands	4c904fa797	Fix PR7272: when inlining through a callsite with byval arguments, the newly created allocas may be used by inlined calls, so these need to have their tail call flags cleared. Fixes PR7272. llvm-svn: 105255	2010-05-31 21:00:26 +00:00
Benjamin Kramer	5ac57e3440	Avoid swap when a copy suffices. llvm-svn: 105220	2010-05-31 12:50:41 +00:00
Nick Lewycky	aee2632be3	The memcpy intrinsic only takes i8* for %src and %dst, so cast them to that first. Fixes PR7265. llvm-svn: 105206	2010-05-31 06:16:35 +00:00
Dan Gohman	826bdf8c10	Move FindAvailableLoadedValue isSafeToLoadUnconditionally out of lib/Transforms/Utils and into lib/Analysis so that Analysis passes can use them. llvm-svn: 104949	2010-05-28 16:19:17 +00:00
Dan Gohman	df5d7dcef1	Teach instcombine to promote alloca array sizes. llvm-svn: 104945	2010-05-28 15:09:00 +00:00
Dan Gohman	05a6555acb	Fix instcombine's handling of alloca to accept non-i32 types. llvm-svn: 104935	2010-05-28 04:33:04 +00:00
Devang Patel	3e0fbafab2	Fix typo. llvm-svn: 104914	2010-05-28 01:29:50 +00:00
Devang Patel	e2099e8088	Fix typo. llvm-svn: 104913	2010-05-28 01:17:51 +00:00
Devang Patel	7a9dedf0ab	Do not drop location info for inlined function args. llvm-svn: 104884	2010-05-27 20:25:04 +00:00
Duncan Sands	f162eace49	Teach instCombine to remove malloc+free if malloc's only uses are comparisons to null. Patch by Matti Niemenmaa. llvm-svn: 104871	2010-05-27 19:09:06 +00:00
Benjamin Kramer	6877119ef3	Kill unneeded SExt. llvm-svn: 104692	2010-05-26 09:45:04 +00:00
Benjamin Kramer	9439084cea	Properly promote operands when optimizing a single-character memcmp. llvm-svn: 104648	2010-05-25 22:53:43 +00:00
Dan Gohman	a4abd035ea	Fix a missing newline in debug output. llvm-svn: 104644	2010-05-25 21:50:35 +00:00
Dan Gohman	9b48b856ea	DominatorTree.getNode can return null for unreachable blocks. llvm-svn: 104290	2010-05-20 22:46:54 +00:00
Dan Gohman	86110fa2bb	Minor code cleanups. llvm-svn: 104287	2010-05-20 22:25:20 +00:00
Dan Gohman	6295f2ebb8	Make Solve check its own post-condition, to reduce clutter in the top-level LSRInstance logic. llvm-svn: 104278	2010-05-20 20:59:23 +00:00
Dan Gohman	a4ca28a3ae	Add comments. llvm-svn: 104276	2010-05-20 20:52:00 +00:00
Dan Gohman	927bcaadda	More code cleanups. Use iterators instead of indices when indices aren't needed. llvm-svn: 104273	2010-05-20 20:33:18 +00:00
Dan Gohman	4c4043cf34	Fix OptimizeShadowIV to set Changed. Change OptimizeLoopTermCond to set Changed directly instead of using a return value. Rename FilterOutUndesirableDedicatedRegisters's Changed variable to distinguish it from LSRInstance's Changed member. llvm-svn: 104269	2010-05-20 20:05:31 +00:00
Dan Gohman	8ec018cedf	Add some comments. llvm-svn: 104268	2010-05-20 20:00:41 +00:00
Dan Gohman	8ce95cc3c5	Simplify this code. Don't do a DomTreeNode lookup for each visited block. llvm-svn: 104267	2010-05-20 20:00:25 +00:00
Dan Gohman	ab5fb7f559	Minor code cleanups. llvm-svn: 104263	2010-05-20 19:44:23 +00:00
Dan Gohman	ee2fea3cd7	When canonicalizing icmp operand order to put the loop invariant operand on the left, the interesting operand is on the right. This fixes a bug where LSR was failing to recognize ICmpZero uses, which led it to be unable to reverse the induction variable in the attached testcase. Delete test/CodeGen/X86/stack-color-with-reg-2.ll, because its test is extremely fragile and hard to meaningfully update. llvm-svn: 104262	2010-05-20 19:26:52 +00:00
Dan Gohman	fdf9874ba7	Set Changed to true when canonicalizing ICmp operand order; even though it isn't a very interesting change, it's a change nonetheless. llvm-svn: 104260	2010-05-20 19:16:03 +00:00
Devang Patel	e2ff7f3a7d	Strip llvm.dbg.lv also. llvm-svn: 104236	2010-05-20 16:49:22 +00:00
Dan Gohman	981563d0ba	Rename a variable to avoid shadowing. llvm-svn: 104234	2010-05-20 16:41:11 +00:00
Dan Gohman	6b733fc189	Minor code simplification. llvm-svn: 104232	2010-05-20 16:23:28 +00:00
Dan Gohman	80a9608442	Move the code for deleting BaseRegs and LSRUses into helper functions, and fix a bug that valgrind noticed where the code would std::swap an element with itself. llvm-svn: 104225	2010-05-20 15:17:54 +00:00
Dan Gohman	20fab456da	Teach LSR how to cope better with unrolled loops on targets where the addressing modes don't make this trivially easy. This allows it to avoid falling into the less precise heuristics in more cases. llvm-svn: 104186	2010-05-19 23:43:12 +00:00
Dan Gohman	beebef4137	Add a comment. llvm-svn: 104089	2010-05-18 23:55:57 +00:00
Dan Gohman	50f8f2c23d	Fix the predicate which checks for non-sensical formulae which have constants in registers which partially cancel out their immediate fields. llvm-svn: 104088	2010-05-18 23:48:08 +00:00
Dan Gohman	4cf99b5303	Factor out the code for recomputing an LSRUse's Regs set after some of its formulae have been removed into a helper function, and also teach it how to update the RegUseTracker. llvm-svn: 104087	2010-05-18 23:42:37 +00:00
Dan Gohman	a4eca05174	Factor out code for estimating search space complexity into a helper function. llvm-svn: 104082	2010-05-18 22:51:59 +00:00
Dan Gohman	63e9015248	Add some more debug output. llvm-svn: 104080	2010-05-18 22:41:32 +00:00
Dan Gohman	f1c7b1b42f	Factor out the code for deleting a formula from an LSRUse into a helper function. llvm-svn: 104079	2010-05-18 22:39:15 +00:00
Dan Gohman	8aca7ef903	Make some debug output more informative. llvm-svn: 104078	2010-05-18 22:37:37 +00:00
Dan Gohman	06ab08f795	Print an error message in Formula::print if the HasBaseReg flag is inconsistent with the BaseRegs field. It's not print's job to assert on an invalid condition, but it can make one more obvious. llvm-svn: 104077	2010-05-18 22:35:55 +00:00
Dan Gohman	248c41d108	Rename RegUseTracker's RegUses member to RegUsesMap to avoid confusion with LSRInstance's RegUses member. llvm-svn: 104076	2010-05-18 22:33:00 +00:00
Nick Lewycky	b35818eb25	Teach the always inliner to release its inline cost estimates, like the basic inliner did in r103653. Why does the always inliner even bother with cost estimates anyways? llvm-svn: 103858	2010-05-15 04:26:25 +00:00
Nick Lewycky	002a45eb64	Clean up, no functional change. llvm-svn: 103857	2010-05-15 03:41:58 +00:00
Nick Lewycky	2b3cbac0ee	Remove heinous tabs. llvm-svn: 103700	2010-05-13 06:45:13 +00:00
Nick Lewycky	d3c6dfe853	Replace the core comparison login in merge functions. We can now merge vector<>::push_back() in: int foo(vector<int> &a, vector<unsigned> &b) { a.push_back(10); b.push_back(11); } to two calls to the same push_back function, or fold away the two copies of push_back() in: struct T { int; }; struct S { char; }; vector<T> t; vector<S> s; void f(T x) { t.push_back(x); } void g(S x) { s.push_back(x); } but leave f() and g() separate, since they refer to two different global variables. llvm-svn: 103698	2010-05-13 05:48:45 +00:00
Nick Lewycky	c63aa1e8ab	Clear CachedFunctionInfo upon Pass::releaseMemory. Because ValueMap will abort on RAUW of functions, this is a correctness issue instead of a mere memory usage problem. No testcase until the new MergeFunctions can land. llvm-svn: 103653	2010-05-12 21:48:15 +00:00
Duncan Sands	6c5e4355bb	I got tired of VISIBILITY_HIDDEN colliding with the gcc enum. Rename it to LLVM_LIBRARY_VISIBILITY and introduce LLVM_GLOBAL_VISIBILITY, which is the opposite, for future use by dragonegg. llvm-svn: 103495	2010-05-11 20:16:09 +00:00
Douglas Gregor	6739a89117	Fixes for Microsoft Visual Studio 2010, from Steven Watanabe! llvm-svn: 103457	2010-05-11 06:17:44 +00:00
Chris Lattner	84d4618659	make simplifycfg insert an llvm.trap before the 'unreachable' it introduces when it detects undefined behavior. llvm.trap generally codegens into some thing really small (e.g. a 2 byte ud2 instruction on x86) and debugging this sort of thing is "nontrivial". For example, we now compile: void foo() { (int)0 = 42; } into: _foo: pushl %ebp movl %esp, %ebp ud2 Some may even claim that this is a security hole, though that seems dubious to me. This addresses rdar://7958343 - Optimizing away null dereference potentially allows arbitrary code execution llvm-svn: 103356	2010-05-08 22:15:59 +00:00
Chris Lattner	02b0df5338	Teach instcombine to transform a bitcast/(zext\|trunc)/bitcast sequence with a vector input and output into a shuffle vector. This sort of sequence happens when the input code stores with one type and reloads with another type and then SROA promotes to i96 integers, which make everyone sad. This fixes rdar://7896024 llvm-svn: 103354	2010-05-08 21:50:26 +00:00
Chris Lattner	5a62d6e578	Fix PR7052, patch by Jakub Staszak! llvm-svn: 103347	2010-05-08 20:01:44 +00:00
Dan Gohman	d0800241d2	When pruning candidate formulae out of an LSRUse, update the LSRUse's Regs set after all pruning is done, rather than trying to do it on the fly, which can produce an incomplete result. This fixes a case where heuristic pruning was stripping all formulae from a use, which led the solver to enter an infinite loop. Also, add a few asserts to diagnose this kind of situation. llvm-svn: 103328	2010-05-07 23:36:59 +00:00
Devang Patel	32cc43c242	Wrap const MDNode * inside DIDescriptor. llvm-svn: 103295	2010-05-07 20:54:48 +00:00
Devang Patel	4423abd734	Use overloaded operators instead of DIDescriptor::getNode() llvm-svn: 103276	2010-05-07 18:19:32 +00:00
Ted Kremenek	d90773ebe0	Update CMake build. llvm-svn: 103266	2010-05-07 17:13:20 +00:00
Dan Gohman	5d5b8b1b8c	Add an LLVM IR version of code sinking. This uses the same simple algorithm as MachineSink, but it isn't constrained by MachineInstr-level details. llvm-svn: 103257	2010-05-07 15:40:13 +00:00
Bob Wilson	0c8b29bcdb	Use the right version of "append" to combine two SmallVectors. This fixes the compile-time regressions seen in last night's tests. llvm-svn: 103118	2010-05-05 20:44:15 +00:00
Bob Wilson	d1b38e317d	Combine the implementations of the core part of the SSAUpdater and MachineSSAUpdater to avoid duplicating all the code. llvm-svn: 103060	2010-05-04 23:18:19 +00:00
Bob Wilson	a2fda8b648	Defer adding critical edges to the "toSplit" list until after checking for indirect branches in all the predecessors. This avoids unnecessarily splitting edges in cases where load PRE is not possible anyway. Thanks to Jakub Staszak for pointing this out. llvm-svn: 103034	2010-05-04 20:03:21 +00:00
Dan Gohman	1d2ded75e2	Use getConstant instead of getIntegerSCEV. The two are basically the same, now that getConstant has overloads consistent with ConstantInt::get. llvm-svn: 102965	2010-05-03 22:09:21 +00:00
Devang Patel	9f5200a122	Check for side effects before splitting loop. Patch by Jakub Staszak! llvm-svn: 102928	2010-05-03 18:06:58 +00:00
Chris Lattner	b49a622fe9	revert r102831. We already delete dead readonly calls in other places, killing a valid transformation is not the right answer. llvm-svn: 102850	2010-05-01 17:19:38 +00:00
Owen Anderson	550986ea90	Disable the call-deletion transformation introduced in r86975. Without halting analysis, it is illegal to delete a call to a read-only function. The correct solution is almost certainly to add a "must halt" attribute and only allow deletions in its presence. XFAIL the relevant testcase for now. llvm-svn: 102831	2010-05-01 08:34:28 +00:00
Chris Lattner	c2432b9d44	rename InlineInfo.DevirtualizedCalls -> InlinedCalls to reflect that it includes all inlined calls now, not just devirtualized ones. llvm-svn: 102824	2010-05-01 01:26:13 +00:00
Chris Lattner	fc8d9ee6c3	Implement rdar://6295824 and PR6724 with two tiny changes that can have a big effect :). The first is to enable the iterative SCC passmanager juice that kicks in when the scc passmgr detects that a function pass has devirtualized a call. In this case, it will rerun all the passes it manages on the SCC, up to the iteration count limit (4). This is useful because a function pass may devirualize a call, and we want the inliner to inline it, or pruneeh to infer stuff about it, etc. The second patch is to add all call sites to the DevirtualizedCalls list the inliner uses. This list is about to get renamed, but the jist of this is that the inliner now reconsiders all inlined call sites as candidates for further inlining. The intuition is this that in cases like this: f() { g(1); } g(int x) { h(x); } We analyze this bottom up, and may decide that it isn't profitable to inline H into G. Next step, we decide that it is profitable to inline G into F, and do so, which means that F now calls H. Even though the call from G -> H may not have been profitable to inline, the call from F -> H may be (in this case because a constant allows folding etc). In my spot checks, this doesn't have a big impact on code. For example, the LLC output for 252.eon grew from 0.02% (from 317252 to 317308) and 176.gcc actually shrunk by .3% (from 1525612 to 1520964 bytes). 252.eon never iterated in the SCC Passmgr, 176.gcc iterated at most 1 time. llvm-svn: 102823	2010-05-01 01:15:56 +00:00
Chris Lattner	e8262675a3	The inliner has traditionally not considered call sites that appear due to inlining a callee as candidates for futher inlining, but a recent patch made it do this if those call sites were indirect and became direct. Unfortunately, in bizarre cases (see testcase) doing this can cause us to infinitely inline mutually recursive functions into callers not in the cycle. Fix this by keeping track of the inline history from which callsite inline candidates got inlined from. This shouldn't affect any "real world" code, but is required for a follow on patch that is coming up next. llvm-svn: 102822	2010-05-01 01:05:10 +00:00
Devang Patel	3ca9a9b59c	Preserve debug info attached with call instruction while eliminating dead argument. Radar 7927803 llvm-svn: 102760	2010-04-30 20:23:54 +00:00
Chris Lattner	4bd85e47bf	further clarify alignment of globals, fix instcombine to not increase the alignment of globals with an assigned alignment and section. llvm-svn: 102476	2010-04-28 00:31:12 +00:00
Chris Lattner	44a27efdf9	Fix a problem that lower invoke has with allocas (PR6694), and add a version of createLowerInvokePass that allows the client to specify whether it wants "expensive" or "cheap" lowering. Patch by Alex Mac! llvm-svn: 102402	2010-04-26 23:49:32 +00:00
Chris Lattner	87aa2243e2	fix PR6940: sitofp(undef) folds to 0.0, not undef. llvm-svn: 102358	2010-04-26 18:21:23 +00:00
Chris Lattner	b34ffe36ae	remove #if 1's. llvm-svn: 102296	2010-04-25 04:43:02 +00:00
Dan Gohman	534ba376f6	Generalize LSR's OptimizeMax to handle the new kinds of max expressions that indvars may use, now that indvars is recognizing le and ge loops. llvm-svn: 102235	2010-04-24 03:13:44 +00:00
Chris Lattner	d3b361d1b6	enable my inliner change: add newly devirtualized call sites to the worklist, making them inline candidates. llvm-svn: 102213	2010-04-23 21:16:07 +00:00
Chris Lattner	c691de3b4e	switch InlineInfo.DevirtualizedCalls's list to be of WeakVH. This fixes a bug where calls inlined into an invoke would get changed into an invoke but the array would keep pointing to the (now dead) call. The improved inliner behavior is still disabled for now. llvm-svn: 102196	2010-04-23 18:37:01 +00:00
Dan Gohman	997bbc54d6	Fix LSR to tolerate cases where ScalarEvolution initially misses an opportunity to fold add operands, but folds them after LSR has separated them out. This fixes rdar://7886751. llvm-svn: 102157	2010-04-23 01:55:05 +00:00
Chris Lattner	d8d898dbd3	disable my previous inliner patch, it appears to be busting self-host. llvm-svn: 102153	2010-04-23 00:41:03 +00:00
Chris Lattner	2eee5d3467	The inliner was choosing to not consider call sites that appear in the SCC as a result of inlining as candidates for inlining. Change this so that it does consider call sites that change from being indirect to being direct as a result of inlining. This allows it to completely "devirtualize" the testcase. llvm-svn: 102146	2010-04-22 23:37:35 +00:00
Chris Lattner	4ba01ec869	refactor the interface to InlineFunction so that most of the in/out arguments are handled with a new InlineFunctionInfo class. This makes it easier to extend InlineFunction to return more info in the future. llvm-svn: 102137	2010-04-22 23:07:58 +00:00
Chris Lattner	016c00a311	when inlining something like this: define void @f3(void (i8) %__f) ssp { entry: call void %__f(i8* undef) unreachable } define void @f4(i8* %this) ssp align 2 { entry: call void @f3(void (i8) @f2) ssp ret void } The inliner is turning the indirect call to %__f into a direct call to F2. Make the call graph more precise when this happens. The inliner doesn't revisit call sites introduced by inlining, so there isn't an easy way to test for this, but a more precise callgraph is a good thing. llvm-svn: 102131	2010-04-22 21:31:00 +00:00
Chris Lattner	0a3b5b4e39	eliminate dead #include. llvm-svn: 102119	2010-04-22 20:41:10 +00:00
Bob Wilson	4c7f50afb8	Fix a performance problem with the new SSAUpdater. This showed up in the GCCAS time for MultiSource/Benchmarks/ASCI_Purple/SMG2000. llvm-svn: 102009	2010-04-21 18:39:03 +00:00
Devang Patel	2176643241	Rename ValueMapTy as ValueToValueMapTy to clearly indicate that this has no replationship with ADT/ValueMap. llvm-svn: 101950	2010-04-20 22:24:18 +00:00
Devang Patel	382b969647	There is no need to install ValueMapper.h header. llvm-svn: 101949	2010-04-20 22:18:31 +00:00
Gabor Greif	27b3d55194	use abstract accessors to CallInst llvm-svn: 101899	2010-04-20 13:13:04 +00:00
Chris Lattner	66e809acc0	remove a bunch of ad-hoc code to simplify instructions from loop unswitch, and use inst simplify instead. It is more powerful and less duplication. llvm-svn: 101874	2010-04-20 05:33:18 +00:00
Chris Lattner	c707fa9651	move some select simplifications out out instcombine into inst simplify. No functionality change. llvm-svn: 101873	2010-04-20 05:32:14 +00:00
Chris Lattner	5814d9d9da	RewriteLoopBodyWithConditionConstant can end up rewriting the condition we're unswitching on. In this case, don't try to simplify the second copy of the loop which may be dead or not, but is probably a constant now. This fixes PR6879 llvm-svn: 101870	2010-04-20 05:09:16 +00:00
Chris Lattner	a5cdd5e6a2	make the inliner do less work for leaf functions. llvm-svn: 101846	2010-04-20 00:47:08 +00:00
Chris Lattner	e93846762a	Fix rdar://7879828 - crash in CallGraph, a self host issue. Arg promotion was deleting call graph nodes that still had references from the 'indirect' CGN. Like the inliner, it should only delete the function if all references are gone. llvm-svn: 101845	2010-04-20 00:46:50 +00:00
Dan Gohman	e637ff5e9a	Remove the Expr member from IVUsers. Instead of remembering the expression, just ask ScalarEvolution for it on demand. This helps IVUsers be more robust in the case of expressions changing underneath it. This fixes PR6862. llvm-svn: 101819	2010-04-19 21:48:58 +00:00
Bob Wilson	ca51425d94	Re-commit my previous SSAUpdater changes. The previous version naively tried to determine where to place PHIs by iteratively comparing reaching definitions at each block. That was just plain wrong. This version now computes the dominator tree within the subset of the CFG where PHIs may need to be placed, and then places the PHIs in the iterated dominance frontier of each definition. The rest of the patch is mostly the same, with a few more performance improvements added in. llvm-svn: 101612	2010-04-17 03:08:24 +00:00
Eric Christopher	7258dcd77f	Revert 101465, it broke internal OpenGL testing. Probably the best way to know that all getOperand() calls have been handled is to replace that API instead of updating. llvm-svn: 101579	2010-04-16 23:37:20 +00:00
Chris Lattner	4422d31b84	introduce a new CallGraphSCC class, and pass it around to CallGraphSCCPass's instead of passing around a std::vector<CallGraphNode*>. No functionality change, but now we have a much tidier interface. llvm-svn: 101558	2010-04-16 22:42:17 +00:00
Dan Gohman	99e5327bfd	Refine the detection of seemingly infinitely recursive calls where the callee is expected to be expanded to something else by codegen, so that normal infinitely recursive calls are still transformed. llvm-svn: 101468	2010-04-16 15:57:50 +00:00
Gabor Greif	f375520f7b	reapply r101434 with a fix for self-hosting rotate CallInst operands, i.e. move callee to the back of the operand array the motivation for this patch are laid out in my mail to llvm-commits: more efficient access to operands and callee, faster callgraph-construction, smaller compiler binary llvm-svn: 101465	2010-04-16 15:33:14 +00:00
Chris Lattner	bd2d9430d6	fix comment noticed by Bob llvm-svn: 101437	2010-04-16 02:32:17 +00:00
Gabor Greif	403e9694f9	back out r101423 and r101397, they break llvm-gcc self-host on darwin10 llvm-svn: 101434	2010-04-16 01:16:20 +00:00
Chris Lattner	1146d326a7	fix PR6832: we were using the alignment of a pointer when we wanted the alignment of the pointee. llvm-svn: 101432	2010-04-16 01:05:38 +00:00
Chris Lattner	b73552908e	improve comments. llvm-svn: 101429	2010-04-16 00:38:19 +00:00
Chris Lattner	78d7dbbc30	pull all the ConvertToScalarInfo code together into one place. llvm-svn: 101427	2010-04-16 00:24:57 +00:00
Chris Lattner	d69c3ee958	more refactoring: suck some stuff out of SRoA into ConvertToScalarInfo. llvm-svn: 101425	2010-04-16 00:20:00 +00:00
Gabor Greif	6af0ad846e	shift intrinsic operand llvm-svn: 101423	2010-04-16 00:06:45 +00:00
Chris Lattner	9ef4eae6e6	introduce a new ConvertToScalarInfo struct to simplify CanConvertToScalar/MergeInType. Eliminate a pointless LLVMContext argument to MergeInType. llvm-svn: 101422	2010-04-15 23:50:26 +00:00
Chris Lattner	9c1172d848	tidy interface to isOnlyCopiedFromConstantGlobal llvm-svn: 101405	2010-04-15 21:59:20 +00:00
Gabor Greif	33ae80bff7	reapply r101364, which has been backed out in r101368 with a fix rotate CallInst operands, i.e. move callee to the back of the operand array the motivation for this patch are laid out in my mail to llvm-commits: more efficient access to operands and callee, faster callgraph-construction, smaller compiler binary llvm-svn: 101397	2010-04-15 20:51:13 +00:00
Anton Korobeynikov	839cdaa70a	Revert r100896 and around - this breaks the only mingw32 buildbot we have. llvm-svn: 101387	2010-04-15 19:51:42 +00:00
Dan Gohman	b29cda9b3c	Fix a bunch of namespace polution. llvm-svn: 101376	2010-04-15 17:08:50 +00:00
Gabor Greif	9fd00c7d25	back out r101364, as it trips the linux nightlybot on some clang C++ tests llvm-svn: 101368	2010-04-15 12:46:56 +00:00
Gabor Greif	aafd209632	rotate CallInst operands, i.e. move callee to the back of the operand array the motivation for this patch are laid out in my mail to llvm-commits: more efficient access to operands and callee, faster callgraph-construction, smaller compiler binary llvm-svn: 101364	2010-04-15 10:49:53 +00:00
Tobias Grosser	de1a37b872	IPO needs ScalarOpts and InstCombine in its libs The commit "Adding IPSCCP and Internalize passes to the C-bindings" introduced new dependencies for IPO. Add these to the CMAKE build as otherwise the BUILD_SHARED_LIBS=1 build fails. llvm-svn: 101313	2010-04-14 23:42:23 +00:00
Evan Cheng	21b588b678	- Code clean up to reduce indentation. - TryToOptimizeStoreOfMallocToGlobal should check if TargetData is available and bail out if it is not. The transformations being done requires TD. llvm-svn: 101285	2010-04-14 20:52:55 +00:00
Gabor Greif	c08e5df836	performance: cache the dereferenced use_iterator llvm-svn: 101253	2010-04-14 16:48:56 +00:00
Gabor Greif	a49686fa3e	performance: cache the dereferenced use_iterator llvm-svn: 101250	2010-04-14 16:13:56 +00:00
Nick Lewycky	163a743b51	I don't know how, but I managed to goof the revert. Remove function that should have been removed in r101231. llvm-svn: 101232	2010-04-14 05:03:50 +00:00
Nick Lewycky	ca615eb0d6	Revert r101213. llvm-svn: 101231	2010-04-14 04:51:58 +00:00
Nick Lewycky	087d59cf25	Remove tab. llvm-svn: 101223	2010-04-14 04:19:05 +00:00
Nick Lewycky	3cdae269f0	While DAE can't modify the function signature of an externally visible function, it can check whether the visible direct callers are passing in parameters to dead arguments and replace those with undef. This reinstates r94322 with bugs fixed. llvm-svn: 101213	2010-04-14 03:38:11 +00:00
Eric Christopher	4016dcd625	Actually... return after the check for invalid input. llvm-svn: 101139	2010-04-13 16:41:29 +00:00
Owen Anderson	b516f1c6cc	Remove SCCVN from the CMake build system. llvm-svn: 101125	2010-04-13 08:33:09 +00:00
Owen Anderson	9ed6abfe0b	SCCVN, we hardly knew ye! llvm-svn: 101117	2010-04-13 05:24:08 +00:00
Dan Gohman	5867a56db8	Teach IndVarSimplify how to eliminate remainder operators where the numerator is an induction variable. For example, with code like this: for (i=0;i<n;++i) x[i%n] = 0; IndVarSimplify will now recognize that i is always less than n inside the loop, and eliminate the remainder. llvm-svn: 101113	2010-04-13 01:46:36 +00:00
Dan Gohman	4a645b88ef	Suppress LinearFunctionTestReplace when the computed backedge-taken expression is a UDiv and it doesn't appear that the UDiv came from the user's source. ScalarEvolution has recently figured out how to compute a tripcount expression for the inner loop in SingleSource/Benchmarks/Shootout/sieve.c, using a udiv. Emitting a udiv instruction dramatically slows down the enclosing loop. llvm-svn: 101068	2010-04-12 21:13:43 +00:00
Dan Gohman	27c8e79839	Delete this code, which is no longer needed. llvm-svn: 101033	2010-04-12 08:00:22 +00:00
Dan Gohman	07f6563e81	Move the EliminateIVUsers call back out to its original location. Now that a ScalarEvolution bug with overflow handling is fixed, the normal analysis code will automatically decline to operate on the icmp instructions which are responsible for the loop exit. llvm-svn: 101032	2010-04-12 07:56:56 +00:00
Dan Gohman	15f90c294c	Use RecursivelyDeleteTriviallyDeadInstructions in EliminateIVComparisons, instead of deleting just the user. This makes it more consistent with other code in IndVarSimplify, and theoretically can eliminate more users earlier. llvm-svn: 101027	2010-04-12 07:29:15 +00:00
Eric Christopher	1f272f7fd8	Verify function prototypes before trying to optimize functions. We also need TargetData, just return false if we don't have it. Update testcases accordingly. Fixes PR6807. llvm-svn: 101011	2010-04-12 04:48:00 +00:00
Dan Gohman	fa5ad797e3	Re-apply r101000, with a fix: Don't eliminate an icmp which is part of the loop exit test. This usually doesn't come up for a variety of reasons, but it isn't impossible, so make IndVarSimplify handle it conservatively. llvm-svn: 101008	2010-04-12 02:21:50 +00:00
Dan Gohman	c0f1efaf8d	Revert 101000, which is breaking self-host builds. llvm-svn: 101002	2010-04-12 00:17:10 +00:00
Dan Gohman	af4ab1b681	Teach IndVarSimplify how to eliminate comparisons involving induction variables. For example, with code like this: for (i=0;i<n;++i) if (i<n) x[i] = 0; IndVarSimplify will now recognize that i is always less than n inside the loop, and eliminate the if. llvm-svn: 101000	2010-04-11 23:10:12 +00:00
Dan Gohman	b50349a979	Rename isLoopGuardedByCond to isLoopEntryGuardedByCond, to emphasise that it's only testing for the entry condition, not full loop-invariant conditions. llvm-svn: 100979	2010-04-11 19:27:13 +00:00
Chris Lattner	4568ed7893	Implement support for varargs functions without any fixed parameters in the CBE by implicitly adding a fixed argument. This allows eliminating a work-around from DAE. Patch by Sylvere Teissier! llvm-svn: 100944	2010-04-10 19:12:44 +00:00
Chris Lattner	9ae28b141f	fix PR6743, a case where we'd delete an instruction before using it in some cases. llvm-svn: 100937	2010-04-10 18:26:57 +00:00
Chris Lattner	b9801ffcb5	fix PR6760, a missing check in heap SRoA. llvm-svn: 100936	2010-04-10 18:19:22 +00:00
Dan Gohman	607e02b33a	When determining a canonical insert position, don't climb deeper into adjacent loops. Also, ensure that the insert position is dominated by the loop latch of any loop in the post-inc set which has a latch. llvm-svn: 100906	2010-04-09 22:07:05 +00:00
Chris Lattner	74e2ef68b9	suck the propagating "has dynamic libs" check into a single makefile variable TARGET_HAS_DYNAMIC_LIBS llvm-svn: 100896	2010-04-09 20:51:47 +00:00
Chris Lattner	c86cdc7d47	add minix support, patch by Kees van Reeuwijk! PR6797 llvm-svn: 100895	2010-04-09 20:45:04 +00:00
Wesley Peck	a2ca3fa781	Adding IPSCCP and Internalize passes to the C-bindings llvm-svn: 100893	2010-04-09 20:43:20 +00:00
Dan Gohman	42ec4eb351	When looking for loop-invariant users, look through no-op instructions, so that an unfortunately placed bitcast doesn't pin a value in a register. llvm-svn: 100883	2010-04-09 19:12:34 +00:00
Gabor Greif	ef60190a00	performance: cache result of looking up user llvm-svn: 100862	2010-04-09 15:18:34 +00:00
Dan Gohman	0a8175d1db	Minor code simplification. llvm-svn: 100859	2010-04-09 14:53:59 +00:00
Gabor Greif	ce6dd889ec	const-ize a predicate llvm-svn: 100856	2010-04-09 10:57:00 +00:00
Dan Gohman	d2df643ddb	Refactor the code for computing the insertion point for an expression into a separate function. llvm-svn: 100845	2010-04-09 02:00:38 +00:00
Chris Lattner	c6c153be45	fix a SCCP miscompilation that could happen when a forced constant is changed to a constant, we would end up adding the instruction to the wrong worklist, preventing it from being properly revisited. This fixes rdar://7832370 llvm-svn: 100837	2010-04-09 01:14:31 +00:00
Dan Gohman	9b5d0bb774	Avoid allocating a value of zero in a register if the initial formula inputs happen to negate each other. llvm-svn: 100828	2010-04-08 23:36:27 +00:00
Dan Gohman	4ce1fb1448	Add variants of ult, ule, etc. which take a uint64_t RHS, for convenience. llvm-svn: 100824	2010-04-08 23:03:40 +00:00
Dan Gohman	4506539d84	When expanding expressions which are using post-inc mode for multiple loops, ensure that the expansion is dominated by the increments of those loops. llvm-svn: 100748	2010-04-08 05:57:57 +00:00
Dan Gohman	eb7111b98f	Say bitcast instead of bitconvert. llvm-svn: 100720	2010-04-07 23:22:42 +00:00
Eric Christopher	e8b281c3c3	Add support for stpncpy_chk. llvm-svn: 100710	2010-04-07 23:00:07 +00:00
Chris Lattner	2104b8d36e	rename llvm::llvm_report_error -> llvm::report_fatal_error llvm-svn: 100709	2010-04-07 22:58:41 +00:00
Dan Gohman	d006ab90dd	Generalize IVUsers to track arbitrary expressions rather than expressions explicitly split into stride-and-offset pairs. Also, add the ability to track multiple post-increment loops on the same expression. This refines the concept of "normalizing" SCEV expressions used for to post-increment uses, and introduces a dedicated utility routine for normalizing and denormalizing expressions. This fixes the expansion of expressions which are post-increment users of more than one loop at a time. More broadly, this takes LSR another step closer to being able to reason about more than one loop at a time. llvm-svn: 100699	2010-04-07 22:27:08 +00:00
Gabor Greif	08d85da6cc	fix 80-col violations llvm-svn: 100677	2010-04-07 18:59:26 +00:00
Gabor Greif	df323a51f5	performance: get rid of repeated dereferencing of use_iterator by caching its result llvm-svn: 100550	2010-04-06 19:32:30 +00:00
Gabor Greif	679728790b	make more two predicates constant llvm-svn: 100549	2010-04-06 19:24:18 +00:00
Gabor Greif	08355d6cda	performance: get rid of repeated dereferencing of use_iterator by caching its result llvm-svn: 100547	2010-04-06 19:14:05 +00:00
Gabor Greif	a21bc0fbd5	const-ize predicate ValueIsOnlyUsedLocallyOrStoredToOneGlobal llvm-svn: 100546	2010-04-06 18:58:22 +00:00
Gabor Greif	0439789023	use CallSite to access calls vs. invokes uniformly and remove assumptions about operand order llvm-svn: 100544	2010-04-06 18:45:08 +00:00
Chris Lattner	adca608281	fix a really nasty bug that Evan was tracking in SCCP. When resolving undefs in branches/switches, we have two cases: a branch on a literal undef or a branch on a symbolic value which is undef. If we have a literal undef, the code was correct: forcing it to a constant is the right thing to do. If we have a branch on a symbolic value that is undef, we should force the symbolic value to a constant, which then makes the successor block live. Forcing the condition of the branch to being a constant isn't safe if later paths become live and the value becomes overdefined. This is the case that 'forcedconstant' is designed to handle, so just use it. This fixes rdar://7765019 but there is no good testcase for this, the one I have is too insane to be useful in the future. llvm-svn: 100478	2010-04-05 22:14:48 +00:00
Chris Lattner	c832c1bf69	some code cleanups, use SwitchInst::findCaseValue, reduce indentation llvm-svn: 100468	2010-04-05 21:18:32 +00:00
Evan Cheng	ba930449a9	Code clean up. llvm-svn: 100467	2010-04-05 21:16:25 +00:00
Mon P Wang	c576ee9040	Reapply address space patch after fixing an issue in MemCopyOptimizer. Added support for address spaces and added a isVolatile field to memcpy, memmove, and memset, e.g., llvm.memcpy.i32(i8, i8, i32, i32) -> llvm.memcpy.p0i8.p0i8.i32(i8, i8, i32, i32, i1) llvm-svn: 100304	2010-04-04 03:10:48 +00:00
Chris Lattner	ecb536313f	require that the branch being controlled by the IV exits the loop. With this information we can guarantee the iteration count of the loop is bounded by the compare. I think this xforms is finally safe now. llvm-svn: 100285	2010-04-03 07:21:39 +00:00
Chris Lattner	40060d33f6	add integer overflow check for the fp induction variable checker. Amusingly, we already had tests that we should have rejects because they would be miscompiled in the testsuite. The remaining issue with this is that we don't check that the branch causes us to exit the loop if it fails, so we don't actually know if we remain in bounds. llvm-svn: 100284	2010-04-03 07:18:48 +00:00
Chris Lattner	69913466cb	add a comment and fix some consistency issues, converting to a signed vs unsigned value depending on the sign of the constant fp means that we can't distinguish between a truly negative number and a positive number so large the 32nd bit is set. So, do don't this! llvm-svn: 100283	2010-04-03 06:41:49 +00:00
Chris Lattner	40ea690f39	fix PR6761, a miscompilation due to the fp->int IV conversion stuff. More bugs remain though. llvm-svn: 100282	2010-04-03 06:30:03 +00:00
Chris Lattner	42202868c3	just eliminate the uitofp checks. This code isn't doing the required validity checks in the first place, and supporting a condition large enough to require the 32'nd bit isn't worth it. llvm-svn: 100280	2010-04-03 06:25:21 +00:00
Chris Lattner	ca25b60f4e	rename PH -> PN to be consistent with WeakPN and the rest of llvm. llvm-svn: 100276	2010-04-03 06:17:08 +00:00
Chris Lattner	774858fc38	improve comment and drop a dead check. If PH had no uses, it would have been deleted by RecursivelyDeleteTriviallyDeadInstructions llvm-svn: 100275	2010-04-03 06:16:22 +00:00
Chris Lattner	915322bc4a	strength reduce a ridiculous use of APInt. llvm-svn: 100274	2010-04-03 06:13:12 +00:00
Chris Lattner	0b941347f9	rename stuff improve comment grammar. llvm-svn: 100273	2010-04-03 06:11:07 +00:00
Chris Lattner	d77bde5f94	simplify some code and resolve a fixme. llvm-svn: 100272	2010-04-03 06:06:59 +00:00
Chris Lattner	2ff33f91d5	There is no guarantee that the increment and the branch are in the same block. Insert the new increment in the correct location. Also, more cleanups. llvm-svn: 100271	2010-04-03 06:05:10 +00:00
Chris Lattner	c558b49f14	first half of a pass through IndVarSimplify::HandleFloatingPointIV, this cleans up a bunch of code and also fixes several crashes and miscompiles. More to come unfortunately, this optimization is quite broken. llvm-svn: 100270	2010-04-03 05:54:59 +00:00
Chris Lattner	2e23e5284c	don't internalize available_externally functions, they are really just declarations. This is related to PR6524 llvm-svn: 100269	2010-04-03 05:24:50 +00:00
Bob Wilson	f1aa4743d9	Revert all my SSAUpdater patches. The PHI placement algorithm is not correct (what was I thinking?) and there's also a problem with LCSSA. I'll try again later with fixes. --- Reverse-merging r100263 into '.': U lib/Transforms/Utils/SSAUpdater.cpp --- Reverse-merging r100177 into '.': G lib/Transforms/Utils/SSAUpdater.cpp --- Reverse-merging r100148 into '.': G lib/Transforms/Utils/SSAUpdater.cpp --- Reverse-merging r100147 into '.': U include/llvm/Transforms/Utils/SSAUpdater.h G lib/Transforms/Utils/SSAUpdater.cpp --- Reverse-merging r100131 into '.': G include/llvm/Transforms/Utils/SSAUpdater.h G lib/Transforms/Utils/SSAUpdater.cpp --- Reverse-merging r100130 into '.': G lib/Transforms/Utils/SSAUpdater.cpp --- Reverse-merging r100126 into '.': G include/llvm/Transforms/Utils/SSAUpdater.h G lib/Transforms/Utils/SSAUpdater.cpp --- Reverse-merging r100050 into '.': D test/Transforms/GVN/2010-03-31-RedundantPHIs.ll --- Reverse-merging r100047 into '.': G include/llvm/Transforms/Utils/SSAUpdater.h G lib/Transforms/Utils/SSAUpdater.cpp llvm-svn: 100264	2010-04-03 03:50:38 +00:00
Bob Wilson	25f1aefd5b	Add a DEBUG_TYPE for the SSAUpdater. llvm-svn: 100263	2010-04-03 03:28:44 +00:00
Evan Cheng	ed66db3f9b	Code refactoring. llvm-svn: 100262	2010-04-03 02:23:43 +00:00
Mon P Wang	999c1b927b	Revert r100191 since it breaks objc in clang llvm-svn: 100199	2010-04-02 18:43:02 +00:00
Mon P Wang	a972ab8564	Reapply address space patch after fixing an issue in MemCopyOptimizer. Added support for address spaces and added a isVolatile field to memcpy, memmove, and memset, e.g., llvm.memcpy.i32(i8, i8, i32, i32) -> llvm.memcpy.p0i8.p0i8.i32(i8, i8, i32, i32, i1) llvm-svn: 100191	2010-04-02 18:04:15 +00:00
Dan Gohman	f7239102fe	Manually notify ScalarEvolution before making an operand replacement, since it can't currently observe such changes automatically. llvm-svn: 100186	2010-04-02 14:48:31 +00:00
Bob Wilson	3c54edf9b3	Recommit 100158 now that the buildbots are happy again. llvm-svn: 100177	2010-04-02 05:09:46 +00:00
Dan Gohman	4bd755419f	Revert the recent alignment changes. They're broken for -Os because, in particular, they end up aligning strings at 16-byte boundaries, and there's no way for GlobalOpt to check OptForSize. llvm-svn: 100172	2010-04-02 03:04:37 +00:00
Bob Wilson	0389adcd73	Revert 100158 in case it is causing some of the buildbot problems. llvm-svn: 100164	2010-04-02 01:22:49 +00:00
Dan Gohman	c671347fcb	Make globalopt refine global variable alignment. llvm-svn: 100160	2010-04-02 00:14:16 +00:00
Bob Wilson	9af4e118c6	Check for terminating conditions before adding PHIs to the worklists. This is more efficient than adding them to the worklist and then ignoring them. llvm-svn: 100158	2010-04-02 00:10:41 +00:00
Bob Wilson	737195069a	Remove trailing whitespace. llvm-svn: 100148	2010-04-01 23:06:38 +00:00
Bob Wilson	37b73d9d3e	Rewrite another SSAUpdater function to avoid recursion. llvm-svn: 100147	2010-04-01 23:05:58 +00:00
Bob Wilson	8409feadf0	Change another SSAUpdater function to avoid recursion. llvm-svn: 100131	2010-04-01 20:04:30 +00:00
Bob Wilson	043c0406f7	Simplify the code to check for existing PHIs, now that it is only used in one place. This removes the template function added in svn 94690. llvm-svn: 100130	2010-04-01 19:53:48 +00:00
Bob Wilson	38fc88ee5d	The SSAUpdater should avoid recursive traversals of the CFG, since that may blow out the stack for really big functions. Start by fixing an easy case. llvm-svn: 100126	2010-04-01 18:46:59 +00:00
Gabor Greif	5d5db5342b	Introduce ImmutableCallSite, useful for contexts where no mutation is necessary. Inherits from new templated baseclass CallSiteBase<> which is highly customizable. Base CallSite on it too, in a configuration that allows full mutation. Adapt some call sites in analyses to employ ImmutableCallSite. llvm-svn: 100100	2010-04-01 08:21:08 +00:00
Nick Lewycky	bfb50a0d43	Clean up this file a little, no functionality change. This is a subset of my patch back in r94322. llvm-svn: 100097	2010-04-01 07:34:00 +00:00
Bob Wilson	ac229124f4	Rewrite part of the SSAUpdater to be more careful about inserting redundant PHIs. The previous algorithm was unable to reliably detect when existing PHIs in a cycle can be reused. I'm still working on reducing a testcase. Radar 7711900. llvm-svn: 100047	2010-03-31 20:51:00 +00:00
Dale Johannesen	b67a6e6620	Fix a nasty dangling-pointer heisenbug that could generate wrong code pretty much anywhere AFAICT. A case that hits the bug reproducibly is impossible, but the situation was like this: Addr = ... Store -> Addr Addr2 = GEP , 0, 0 Store -> Addr2 Handling the first store, the code changed replaced Addr with a sunkaddr and deleted Addr, but not its table entry. Code in OptimizedBlock replaced Addr2 with a bitcast; if that happened to reuse the memory of Addr, the old table entry was erroneously found when handling the second store. llvm-svn: 100044	2010-03-31 20:37:15 +00:00
Bob Wilson	6f7fd28824	Revert Mon Ping's change 99928, since it broke all the llvm-gcc buildbots. llvm-svn: 99948	2010-03-30 22:27:04 +00:00
Mon P Wang	7460571381	Added support for address spaces and added a isVolatile field to memcpy, memmove, and memset, e.g., llvm.memcpy.i32(i8, i8, i32, i32) -> llvm.memcpy.p0i8.p0i8.i32(i8, i8, i32, i32, i1) A update of langref will occur in a subsequent checkin. llvm-svn: 99928	2010-03-30 20:55:56 +00:00
Dan Gohman	39027c403c	Fix a grammaro. llvm-svn: 99917	2010-03-30 20:04:57 +00:00
Gabor Greif	b469818279	fix two cases where the arguments were extracted from the wrong range out of the InvokeInst spotted by baldrick -- thanks\! llvm-svn: 99914	2010-03-30 19:20:53 +00:00
Jeffrey Yasskin	12fd516e51	Remove another memory leak from ABCD by using Edges by value instead of pointer. There was also a SmallPtrSet whose settiness wasn't being used, so I changed it to a SmallVector. llvm-svn: 99713	2010-03-27 09:09:17 +00:00
Jeffrey Yasskin	97e613b6da	In ABCD, change the non-null Bound*s to Bound&s. llvm-svn: 99711	2010-03-27 08:15:46 +00:00
Jeffrey Yasskin	33bc7e4cb5	Fix a memory leak in ABCD by giving ownership of Bound objects to the MemoizedResultChart. llvm-svn: 99710	2010-03-27 08:09:24 +00:00
Eric Christopher	81c03447fc	When we promote a load of an argument make sure to take the alignment of the previous load - it's usually important. For example, we don't want to blindly turn an unaligned load into an aligned one. llvm-svn: 99699	2010-03-27 01:54:00 +00:00
Dan Gohman	d42e09d91e	Ignore debug intrinsics in yet more places. llvm-svn: 99580	2010-03-26 00:33:27 +00:00
Gabor Greif	6c6b2fd2b2	rename pred_const_iterator to const_pred_iterator for consistency's sake llvm-svn: 99567	2010-03-25 23:25:28 +00:00
Gabor Greif	c78d720f02	rename use_const_iterator to const_use_iterator for consistency's sake llvm-svn: 99564	2010-03-25 23:06:16 +00:00
Chris Lattner	0563804982	fix PR6642, GVN forwarding from memset to load of the base of the memset. llvm-svn: 99488	2010-03-25 05:58:19 +00:00
Eric Christopher	1d38538fb6	Temporarily revert this, it's causing an issue with an internal project. llvm-svn: 99451	2010-03-24 23:35:21 +00:00
Evan Cheng	c12c2d9bb4	Move OptChkCall off LibCallOptimization into StrCpyOpt. llvm-svn: 99418	2010-03-24 20:19:04 +00:00
Gabor Greif	a2fbc0ae1b	Finally land the InvokeInst operand reordering. I have audited all getOperandNo calls now, fixing hidden assumptions. CallSite related uglyness will be eliminated successively. Note this patch has a long and griveous history, for all the back-and-forths have a look at CallSite.h's log. llvm-svn: 99399	2010-03-24 13:21:49 +00:00
Gabor Greif	be18ae6781	tighten a type and remove trailing whitespace, no functional changes llvm-svn: 99398	2010-03-24 11:58:07 +00:00
Gabor Greif	9027ffb918	increase const goodness and remove pointless getUser() calls llvm-svn: 99395	2010-03-24 10:29:52 +00:00
Gabor Greif	11ff53146f	cache result of UI.getOperandNo() instead of calling it twice, it is cheaper this way llvm-svn: 99394	2010-03-24 10:12:54 +00:00
Chris Lattner	00eeac4179	add some accessors to callsite/callinst/invokeinst to check for the noinline attribute, and make the inliner refuse to inline a call site when the call site is marked noinline even if the callee isn't. This fixes PR6682. llvm-svn: 99341	2010-03-23 22:59:07 +00:00
Bill Wendling	04803e8ef6	Skip debugging intrinsics when sinking unused invariants. llvm-svn: 99324	2010-03-23 21:15:59 +00:00
Evan Cheng	d9e822345c	Teach simplify libcall to transform __strcpy_chk to __memcpy_chk to enable optimizations down stream. llvm-svn: 99282	2010-03-23 15:48:04 +00:00
Gabor Greif	161cb044f3	add assert in argpromotion, which cannot trigger if Function::hasAddressTaken works as advertised also included some cosmetic cleanups llvm-svn: 99276	2010-03-23 14:40:20 +00:00
Evan Cheng	3f7842232e	Fix an incorrect logic causing instcombine to miss some _chk -> non-chk transformations. llvm-svn: 99263	2010-03-23 06:06:09 +00:00
Evan Cheng	9a7b270825	Fix 80 col violation. llvm-svn: 99224	2010-03-22 22:44:31 +00:00
Gabor Greif	e1517a084f	backing out r99170 because it still fails on clang-x86_64-darwin10-fnt llvm-svn: 99171	2010-03-22 09:11:00 +00:00
Gabor Greif	7a743e15e3	Now that hopefully all direct accesses to InvokeInst operands are fixed we can reapply the InvokeInst operand reordering patch. (see r98957). llvm-svn: 99170	2010-03-22 08:28:00 +00:00
Gabor Greif	febf6ab718	Add a setCalledFunction member to InvokeInst (like in CallInst) and use this (as well as getCalledValue) to access the callee, instead of {g\|s}etOperand(0). llvm-svn: 99084	2010-03-20 21:00:25 +00:00
Dan Gohman	1a2abe5580	Clear the SCEVExpander's insertion point after making deletions, so that the SCEVExpander doesn't retain a dangling pointer as its insert position. The dangling pointer in this case wasn't ever used to insert new instructions, but it was causing trouble with SCEVExpander's code for automatically advancing its insert position past debug intrinsics. This fixes use-after-free errors that valgrind noticed in test/Transforms/IndVarSimplify/2007-06-06-DeleteDanglesPtr.ll and test/Transforms/IndVarSimplify/exit_value_tests.ll. llvm-svn: 99036	2010-03-20 03:53:53 +00:00
Gabor Greif	6c56ed847e	back out r98957, it broke http://smooshlab.apple.com:8010/builders/clang-x86_64-darwin10-fnt/builds/703 in the nightly test suite llvm-svn: 98958	2010-03-19 13:50:02 +00:00
Gabor Greif	8335f9c0bf	Recommit r80858 again (which has been backed out in r80871). This time I did a self-hosted bootstrap on Linux x86-64, with no problems. Let's see how darwin 64-bit self-hosting goes. At the first sign of failure I'll back this out. Maybe the valgrind bots give me a hint of what may be wrong (it at all). llvm-svn: 98957	2010-03-19 11:55:53 +00:00
Benjamin Kramer	f2e4b5dd7f	str[r]chr returns its pointer argument so we cannot mark it as nocapture. Thanks to Duncan for spotting my mistake. llvm-svn: 98671	2010-03-16 20:33:15 +00:00
Benjamin Kramer	5cf5fd2ffa	Mark str[r]chr readonly. llvm-svn: 98663	2010-03-16 19:36:43 +00:00
Devang Patel	45c1505bf6	Skip debug info intrinsics. llvm-svn: 98584	2010-03-15 22:23:03 +00:00
Devang Patel	b21991c4f5	Skip debug info intrinsics. llvm-svn: 98581	2010-03-15 21:25:29 +00:00
Devang Patel	d3f41e8939	In "empty" bb, the return instruction may not be first instruction, if dbg value intrinsics are present in this bb. Use terminator to find return instructions. llvm-svn: 98565	2010-03-15 19:05:46 +00:00
Bill Wendling	55e69d179b	Skip over debug info when trying to merge two return BBs. llvm-svn: 98491	2010-03-14 10:40:55 +00:00
Bill Wendling	ee84f27536	Make returns more consistent with others. llvm-svn: 98490	2010-03-14 10:40:28 +00:00
Benjamin Kramer	a956527c92	Add a virtual destructor and give vtable a home. llvm-svn: 98376	2010-03-12 20:41:29 +00:00
Benjamin Kramer	7b88a49f3e	Factor checked library call optimization into a common helper class and use it to unify the almost identical code in CodeGenPrepare and InstCombineCalls. llvm-svn: 98338	2010-03-12 09:27:41 +00:00
Nate Begeman	2e41605d4f	Whoops this already existed. llvm-svn: 98297	2010-03-11 23:21:19 +00:00
Nate Begeman	5daa235c91	Add a handful of additional useful pass manager things to the C API llvm-svn: 98296	2010-03-11 23:06:07 +00:00
Benjamin Kramer	2fc395659c	stpcpy is so similar to strcpy, it doesn't deserve a complete copy of the __strcpy_chk -> strcpy code. llvm-svn: 98284	2010-03-11 20:45:13 +00:00
Eric Christopher	607de1de53	Lower stpcpy_chk when possible. llvm-svn: 98274	2010-03-11 19:24:34 +00:00
Eric Christopher	103e3ef893	Fix typo. llvm-svn: 98260	2010-03-11 17:45:38 +00:00
Eric Christopher	4b7948e09e	Do some final lowering in CodeGenPrepare of _chk calls similar to that in InstCombineCalls. More call lowering needed. llvm-svn: 98228	2010-03-11 02:41:03 +00:00
Eric Christopher	43dc11c525	Add strncpy libcall creator. Use it when it should be used. llvm-svn: 98219	2010-03-11 01:25:07 +00:00
Dan Gohman	2734ebd37f	Add a DominatorTree argument to isLCSSA so that it doesn't have to compute a set of reachable blocks for itself each time it is called, which is fairly frequently. llvm-svn: 98179	2010-03-10 19:38:49 +00:00
Dan Gohman	b7e0b87441	Fix a comment. llvm-svn: 98122	2010-03-10 02:18:48 +00:00
Jakob Stoklund Olesen	b495cad7ca	Try to keep the cached inliner costs around for a bit longer for big functions. The Caller cost info would be reset everytime a callee was inlined. If the caller has lots of calls and there is some mutual recursion going on, the caller cost info could be calculated many times. This patch reduces inliner runtime from 240s to 0.5s for a function with 20000 small function calls. This is a more conservative version of r98089 that doesn't break the clang test CodeGenCXX/temp-order.cpp. That test relies on rather extreme inlining for constant folding. llvm-svn: 98099	2010-03-09 23:02:17 +00:00
Jakob Stoklund Olesen	4497475905	Revert r98089, it was breaking a clang test. llvm-svn: 98094	2010-03-09 22:43:37 +00:00
Jakob Stoklund Olesen	741dec43e4	Try to keep the cached inliner costs around for a bit longer for big functions. The Caller cost info would be reset everytime a callee was inlined. If the caller has lots of calls and there is some mutual recursion going on, the caller cost info could be calculated many times. This patch reduces inliner runtime from 240s to 0.5s for a function with 20000 small function calls. llvm-svn: 98089	2010-03-09 22:17:11 +00:00
Jakob Stoklund Olesen	d62c2f554c	Add inlining threshold to log output. llvm-svn: 98024	2010-03-09 00:59:53 +00:00
Evan Cheng	4f2fd2d2be	Re-commit 97860 with fix. getMallocAllocatedType may return null. llvm-svn: 98000	2010-03-08 22:54:36 +00:00
Devang Patel	3b548aa8e2	Avoid using DIDescriptor.isNull(). This is a first step towards eliminating checks in Descriptor constructors. llvm-svn: 97975	2010-03-08 20:52:55 +00:00
Devang Patel	bc97f6b757	Revert r97947. llvm-svn: 97963	2010-03-08 19:20:38 +00:00
Devang Patel	fe28599f6f	Avoid using DIDescriptor.isNull(). This is a first step towards eliminating unncessary constructor checks in light weight DIDescriptor wrappers. llvm-svn: 97947	2010-03-08 18:25:48 +00:00
Eric Christopher	1810d77cb4	Let the fallthrough handle whether or not we've changed anything before we try to optimize. llvm-svn: 97876	2010-03-06 10:59:25 +00:00
Eric Christopher	a7fb58f5f5	Migrate _chk call lowering from SimplifyLibCalls to InstCombine. Stub out the remainder of the calls that we should lower in some way and move the tests to the new correct directory. Fix up tests that are now optimized more than they were before by -instcombine. llvm-svn: 97875	2010-03-06 10:50:38 +00:00
Eric Christopher	d8b43d0e59	Temporarily revert: Log: Transform @llvm.objectsize to integer if the argument is a result of malloc of known size. Modified: llvm/trunk/lib/Transforms/InstCombine/InstCombineCalls.cpp llvm/trunk/test/Transforms/InstCombine/objsize.ll It appears to be causing swb and nightly test failures. llvm-svn: 97866	2010-03-06 03:11:35 +00:00
Evan Cheng	afdc7d3aab	Transform @llvm.objectsize to integer if the argument is a result of malloc of known size. llvm-svn: 97860	2010-03-06 01:01:42 +00:00
Ted Kremenek	65bb311629	Update CMake build. llvm-svn: 97846	2010-03-05 22:34:16 +00:00
Eric Christopher	87abfc506f	Move SimplifyLibCalls's LibCall builders to a separate file so they can be used in more places. Add an argument for the TargetData that most of them need. Update for the getInt8PtrTy() change. Should be no functionality change. llvm-svn: 97844	2010-03-05 22:25:30 +00:00
Evan Cheng	d214ed0e75	Safely turn memset_chk etc. to non-chk variant if the known object size is >= memset / memcpy / memmove size. llvm-svn: 97828	2010-03-05 20:59:47 +00:00
Evan Cheng	fffdad58ac	Instcombine should turn llvm.objectsize of a alloca with static size to an integer. llvm-svn: 97827	2010-03-05 20:47:23 +00:00
Chris Lattner	f6befffbb2	fix PR6512, a case where instcombine would incorrectly merge loads from different addr spaces. llvm-svn: 97813	2010-03-05 18:53:28 +00:00
Chris Lattner	067459c62b	Fix PR6503. This turned into a much more interesting and nasty bug. Various parts of the cmp\|cmp and cmp&cmp folding logic wasn't prepared for vectors (unrelated to the bug but noticed while in the code) and the code was definitely not safe to use by the (cast icmp)\|(cast icmp) handling logic that I added in r95855. Fix all this up by changing the various routines to more consistently use IRBuilder and not pass in the I which had the wrong type. llvm-svn: 97801	2010-03-05 08:46:26 +00:00
Chris Lattner	343d2e48b2	simplify some functions and make them work with vector compares, noticed by inspection. llvm-svn: 97795	2010-03-05 07:47:57 +00:00
Chris Lattner	c6c1523f59	fix a nice subtle reassociate bug which would only occur in a very specific use pattern embodied in the carefully reduced testcase. llvm-svn: 97794	2010-03-05 07:18:54 +00:00
Eric Christopher	4899cbc77d	Move GetStringLength and helper from SimplifyLibCalls to ValueTracking. No functionality change. llvm-svn: 97793	2010-03-05 06:58:57 +00:00
Evan Cheng	43d6ff7701	Add missing break for Intrinsic::objectsize case. It was falling through to the following Intrinsic::bswap code. I have no idea why it wasn't breaking stuff. llvm-svn: 97774	2010-03-05 01:22:47 +00:00
Dan Gohman	29707de4fe	Make SCEVExpander and LSR more aggressive about hoisting expressions out of loops. llvm-svn: 97642	2010-03-03 05:29:13 +00:00
Bill Wendling	af13d82945	This test case: long test(long x) { return (x & 123124) \| 3; } Currently compiles to: _test: orl $3, %edi movq %rdi, %rax andq $123127, %rax ret This is because instruction and DAG combiners canonicalize (or (and x, C), D) -> (and (or, D), (C \| D)) However, this is only profitable if (C & D) != 0. It gets in the way of the 3-addressification because the input bits are known to be zero. llvm-svn: 97616	2010-03-03 00:35:56 +00:00
Dan Gohman	52f5563973	Non-affine post-inc SCEV expansions have more code which must be emitted after the increment. Make sure the insert position reflects this. This fixes PR6453. llvm-svn: 97537	2010-03-02 01:59:21 +00:00
Dan Gohman	6f34abd092	Floating-point add, sub, and mul are now spelled fadd, fsub, and fmul, respectively. llvm-svn: 97531	2010-03-02 01:11:08 +00:00
Bob Wilson	0fd415820b	Don't attempt load PRE when there is no real redundancy (i.e., the load is in a loop and is itself the only dependency). llvm-svn: 97526	2010-03-02 00:09:29 +00:00
Bob Wilson	892432b7ef	When GVN needs to split critical edges for load PRE, check all of the predecessors before returning. Otherwise, if multiple predecessor edges need splitting, we only get one of them per iteration. This makes a small but measurable compile time improvement with -enable-full-load-pre. llvm-svn: 97521	2010-03-01 23:37:32 +00:00
Evan Cheng	7263cf8431	MemoryDepAnalysis is not used if redundant load processing is disabled. llvm-svn: 97512	2010-03-01 22:23:12 +00:00
Dan Gohman	39917c7c81	Add some debug output to LoopSimplify. llvm-svn: 97458	2010-03-01 17:55:27 +00:00
Dan Gohman	8b0a419eb1	Spelling fixes. llvm-svn: 97453	2010-03-01 17:49:51 +00:00
Dan Gohman	0c39a35457	Prune #includes. llvm-svn: 97448	2010-03-01 17:42:17 +00:00
Bob Wilson	1136166ee9	Revert r97245 which seems to be causing performance problems. llvm-svn: 97366	2010-02-28 05:34:05 +00:00
Chris Lattner	2af7e3dceb	fix grammaro's pointed out by daniel llvm-svn: 97313	2010-02-27 07:50:40 +00:00
Chris Lattner	d887f1da73	fix PR6414, a nondeterminism issue in IPSCCP which was because of a subtle interation in a loop operating in densemap order. llvm-svn: 97288	2010-02-27 00:07:42 +00:00
Chris Lattner	65d3a0a5f8	Fix rdar://7694996 a miscompile of 183.equake from my patch yesterday, confusing the old MAT variable with the new GlobalType one. This caused us to promote the @disp global pointer into: @disp.body = internal global double* undef instead of: @disp.body = internal global [3 x double] undef llvm-svn: 97285	2010-02-26 23:42:13 +00:00
Chris Lattner	da5fcdace0	remove dead code, by this point all uses of CI are gone. llvm-svn: 97283	2010-02-26 23:35:25 +00:00
Bob Wilson	ed1b0c31a7	Move the EnableFullLoadPRE flag from a separate command-line option to an argument of createGVNPass and set it automatically for -O3. llvm-svn: 97245	2010-02-26 19:09:47 +00:00
Bob Wilson	d4655991c3	Remove unused "NoPRE" parameter in GVN and createGVNPass(). llvm-svn: 97235	2010-02-26 18:35:19 +00:00
Chris Lattner	0521c09d97	fix PR6435 another bug from the MallocInst elimination work. llvm-svn: 97231	2010-02-26 18:23:13 +00:00
Chris Lattner	7939f795f5	rewrite OptimizeGlobalAddressOfMalloc to fix PR6422, some bugs introduced when mallocinst was eliminated. llvm-svn: 97178	2010-02-25 22:33:52 +00:00
Dan Gohman	a9c205cc88	Make LoopSimplify change conditional branches in loop exiting blocks which branch on undef to branch on a boolean constant for the edge exiting the loop. This helps ScalarEvolution compute trip counts for loops. Teach ScalarEvolution to recognize single-value PHIs, when safe, and ForgetSymbolicName to forget such single-value PHI nodes as apprpriate in ForgetSymbolicName. llvm-svn: 97126	2010-02-25 06:57:05 +00:00
Nick Lewycky	614fb949b9	Modernize comment. llvm-svn: 97121	2010-02-25 06:39:10 +00:00
Nick Lewycky	dc835c4361	Correct whitespace. llvm-svn: 97120	2010-02-25 06:38:51 +00:00
Daniel Dunbar	693ea89214	Reapply r97010, the speculative revert failed. llvm-svn: 97036	2010-02-24 08:48:04 +00:00
Daniel Dunbar	0a2031e5b6	Speculatively revert r97010, "Add an argument to PHITranslateValue to specify the DominatorTree. ...", in hopes of restoring poor old PPC bootstrap. llvm-svn: 97027	2010-02-24 06:55:22 +00:00
Dan Gohman	94732024eb	Fix indentation. llvm-svn: 97024	2010-02-24 06:46:09 +00:00
Bob Wilson	66e58ac742	Add an argument to PHITranslateValue to specify the DominatorTree. If this argument is non-null, pass it along to PHITranslateSubExpr so that it can prefer using existing values that dominate the PredBB, instead of just blindly picking the first equivalent value that it finds on a uselist. Also when the DominatorTree is specified, have PHITranslateValue filter out any result that does not dominate the PredBB. This is basically just refactoring the check that used to be in GetAvailablePHITranslatedSubExpr and also in GVN. Despite my initial expectations, this change does not affect the results of GVN for any testcases that I could find, but it should help compile time. Before this change, if PHITranslateSubExpr picked a value that does not dominate, PHITranslateWithInsertion would then insert a new value, which GVN would later determine to be redundant and would replace. By picking a good value to begin with, we save GVN the extra work of inserting and then replacing a new value. llvm-svn: 97010	2010-02-24 01:39:00 +00:00
Dan Gohman	cd4c03e886	Don't do (X != Y) ? X : Y -> X for floating-point values; it doesn't handle NaN properly. Do (X une Y) ? X : Y -> X if one of X and Y is not zero. llvm-svn: 96955	2010-02-23 17:17:57 +00:00
Bob Wilson	923261bbe9	Update memdep when load PRE inserts a new load, and add some debug output. I don't have a small testcase for this. llvm-svn: 96890	2010-02-23 05:55:00 +00:00
Evan Cheng	3688b8fa68	Instcombine constant folding can normalize gep with negative index to index with large offset. When instcombine objsize checking transformation sees these geps where the offset seemingly point out of bound, it should just return "i don't know" rather than asserting. llvm-svn: 96825	2010-02-22 23:34:00 +00:00
Bob Wilson	1da9041913	Erase deleted instructions from GVN's ValueTable. This fixes assertion failures from ValueTable::verifyRemoved() when using -debug. llvm-svn: 96805	2010-02-22 21:39:41 +00:00
Dan Gohman	8c16b38262	Remove unused variables and parameters. llvm-svn: 96780	2010-02-22 04:11:59 +00:00
Dan Gohman	4506fcb3c2	When emitting an instruction which depends on both a post-incremented induction variable value and a loop-variant value, don't force the insert position to be at the post-increment position, because it may not be dominated by the loop-variant value. This fixes a use-before-def problem noticed on PPC. llvm-svn: 96774	2010-02-22 03:59:54 +00:00
Dan Gohman	740909be2d	This cast<Instruction> is unnecessary. llvm-svn: 96771	2010-02-22 02:07:36 +00:00
Dan Gohman	4eebb94094	Rename getSDiv to getExactSDiv to reflect its behavior in cases where the division would have a remainder. llvm-svn: 96693	2010-02-19 19:35:48 +00:00
Dan Gohman	85af256779	Check for overflow when scaling up an add or an addrec for scaled reuse. llvm-svn: 96692	2010-02-19 19:32:49 +00:00
Dale Johannesen	1d6827adef	recommit 96626, evidence that it broke things appears to be spurious llvm-svn: 96662	2010-02-19 07:14:22 +00:00
Dale Johannesen	1f790c28d0	Revert 96626, which causes build failure on ppc Darwin. llvm-svn: 96653	2010-02-19 01:54:37 +00:00
Dan Gohman	2446f57503	When determining the set of interesting reuse factors, consider strides in foreign loops. This helps locate reuse opportunities with existing induction variables in foreign loops and reduces the need for inserting new ones. This fixes rdar://7657764. llvm-svn: 96629	2010-02-19 00:05:23 +00:00
Dan Gohman	60b3326435	Indvars needs to explicitly notify ScalarEvolution when it is replacing a loop exit value, so that if a loop gets deleted, ScalarEvolution isn't stick holding on to dangling SCEVAddRecExprs for that loop. This fixes PR6339. llvm-svn: 96626	2010-02-18 23:26:33 +00:00
Dan Gohman	c43d264cc0	Hoist this loop-invariant logic out of the loop. llvm-svn: 96614	2010-02-18 21:34:02 +00:00
Dan Gohman	13ac3b2139	Delete some unneeded casts. llvm-svn: 96429	2010-02-17 00:42:19 +00:00
Dan Gohman	5f10d6c52c	Don't attempt to divide INT_MIN by -1; consider such cases to have overflowed. llvm-svn: 96428	2010-02-17 00:41:53 +00:00
Bob Wilson	aff96b2132	Rename SuccessorNumber to GetSuccessorNumber. llvm-svn: 96387	2010-02-16 21:06:42 +00:00
Dan Gohman	6deab96c81	Refactor rewriting for PHI nodes into a separate function. llvm-svn: 96382	2010-02-16 20:25:07 +00:00
Bob Wilson	92cdb6eec5	Split critical edges as needed for load PRE. llvm-svn: 96378	2010-02-16 19:51:59 +00:00
Bob Wilson	3de492ec35	Refactor to share code to find the position of a basic block successor in the terminator's list of successors. llvm-svn: 96377	2010-02-16 19:49:17 +00:00
Dan Gohman	0849ed5e26	Fix whitespace. llvm-svn: 96372	2010-02-16 19:42:34 +00:00
Duncan Sands	19d0b47b1f	There are two ways of checking for a given type, for example isa<PointerType>(T) and T->isPointerTy(). Convert most instances of the first form to the second form. Requested by Chris. llvm-svn: 96344	2010-02-16 11:11:14 +00:00
Dan Gohman	521efe68ab	Split the main for-each-use loop again, this time for GenerateTruncates, as it also peeks at which registers are being used by other uses. This makes LSR less sensitive to use-list order. llvm-svn: 96308	2010-02-16 01:42:53 +00:00
Chris Lattner	6fbfe5897c	fix PR6305 by handling BlockAddress in a helper function called by jump threading. llvm-svn: 96263	2010-02-15 20:47:49 +00:00
Duncan Sands	9dff9bec31	Uniformize the names of type predicates: rather than having isFloatTy and isInteger, we now have isFloatTy and isIntegerTy. Requested by Chris! llvm-svn: 96223	2010-02-15 16:12:20 +00:00
Dan Gohman	e4e51a63da	Fix whitespace. llvm-svn: 96179	2010-02-14 18:51:39 +00:00
Dan Gohman	e7f74bb16c	Fix a comment. llvm-svn: 96178	2010-02-14 18:51:20 +00:00
Dan Gohman	bb7d52213c	When complicated expressions are broken down into subexpressions with multiplication by constants distributed through, occasionally those subexpressions can include both x and -x. For now, if this condition is discovered within LSR, just prune such cases away, as they won't be profitable. This fixes a "zero allocated in a base register" assertion failure. llvm-svn: 96177	2010-02-14 18:50:49 +00:00
Dan Gohman	2d0f96d49a	Actually, this code doesn't have to be quite so conservative in the no-TLI case. But it should still default to declining the transformation. llvm-svn: 96152	2010-02-14 03:21:49 +00:00
Dan Gohman	cb76a806f0	Don't attempt aggressive post-inc uses if TargetLowering is not available, because profitability can't be sufficiently approximated. llvm-svn: 96148	2010-02-14 02:45:21 +00:00
John McCall	0daaf13b97	Make LSR not crash if invoked without target lowering info, e.g. if invoked from opt. llvm-svn: 96135	2010-02-13 23:40:16 +00:00
Eric Christopher	843a4cc43c	Fix a problem where we had bitcasted operands that gave us odd offsets since the bitcasted pointer size and the offset pointer size are going to be different types for the GEP vs base object. llvm-svn: 96134	2010-02-13 23:38:01 +00:00
Chris Lattner	b8639bc2d1	remove dead code. llvm-svn: 96109	2010-02-13 19:07:06 +00:00
Chris Lattner	42c66b7270	Split some code out to a helper function (FindReusablePredBB) and add a doxygen comment. Cache the phi entry to avoid doing tons of PHINode::getBasicBlockIndex calls in the common case. On my insane testcase from re2c, this speeds up CGP from 617.4s to 7.9s (78x). llvm-svn: 96083	2010-02-13 05:35:08 +00:00
Chris Lattner	5e7f705934	Speed up codegen prepare from 3.58s to 0.488s. llvm-svn: 96081	2010-02-13 05:01:14 +00:00
Chris Lattner	72c4dce884	PHINode::getBasicBlockIndex is O(n) in the number of inputs to a PHI, avoid it in the common case where the BB occurs in the same index for multiple phis. This speeds up CGP on an insane testcase from 8.35 to 3.58s. llvm-svn: 96080	2010-02-13 04:24:19 +00:00
Chris Lattner	b0ebb65ab0	iterate over preds using PHI information when available instead of using pred_begin/end. It is much faster. llvm-svn: 96079	2010-02-13 04:15:26 +00:00
Chris Lattner	96b8826542	speed up CGP a bit by scanning predecessors through phi operands instead of with pred_begin/end. llvm-svn: 96078	2010-02-13 04:04:42 +00:00
Dan Gohman	5b18f039eb	Fix a pruning heuristic which implicitly assumed that SmallPtrSet is deterministically sorted. llvm-svn: 96071	2010-02-13 02:06:02 +00:00
Jakob Stoklund Olesen	492b8b42cd	Enable the inlinehint attribute in the Inliner. Functions explicitly marked inline will get an inlining threshold slightly more aggressive than the default for -O3. This means than -O3 builds are mostly unaffected while -Os builds will be a bit bigger and faster. The difference depends entirely on how many 'inline's are sprinkled on the source. In the CINT2006 suite, only these tests are significantly affected under -Os: Size Time 471.omnetpp +1.63% -1.85% 473.astar +4.01% -6.02% 483.xalancbmk +4.60% 0.00% Note that 483.xalancbmk runs too quickly to give useful timing results. llvm-svn: 96066	2010-02-13 01:51:53 +00:00
Dan Gohman	2b75de97c0	Reapply 95979, a compile-time speedup, now that the bug it exposed is fixed. llvm-svn: 96005	2010-02-12 19:35:25 +00:00
Dan Gohman	363f847ec6	Fix this code to avoid dereferencing an end() iterator in offset distributions it doesn't expect. llvm-svn: 96002	2010-02-12 19:20:37 +00:00
Chris Lattner	75879be9d8	1. modernize the constantmerge pass, using densemap/smallvector. 2. don't bother trying to merge globals in non-default sections, doing so is quite dubious at best anyway. 3. fix a bug reported by Arnaud de Grandmaison where we'd try to merge two globals in different address spaces. llvm-svn: 95995	2010-02-12 18:17:23 +00:00
Daniel Dunbar	e0b2c69d3c	Revert "Reverse the order for collecting the parts of an addrec. The order", it is breaking llvm-gcc bootstrap. llvm-svn: 95988	2010-02-12 17:27:08 +00:00
Dan Gohman	0194f58047	Reverse the order for collecting the parts of an addrec. The order doesn't matter, except that ScalarEvolution tends to need less time to fold the results this way. llvm-svn: 95979	2010-02-12 11:08:26 +00:00
Dan Gohman	45774ce0ad	Reapply the new LoopStrengthReduction code, with compile time and bug fixes, and with improved heuristics for analyzing foreign-loop addrecs. This change also flattens IVUsers, eliminating the stride-oriented groupings, which makes it easier to work with. llvm-svn: 95975	2010-02-12 10:34:29 +00:00
Eric Christopher	cccdc13662	Make sure that ConstantExpr offsets also aren't off of extern symbols. Thanks to Duncan Sands for the testcase! llvm-svn: 95877	2010-02-11 17:44:04 +00:00
Chris Lattner	4e8137d678	Rename ValueRequiresCast to ShouldOptimizeCast, to better reflect what it does. Enhance it to return false to optimizing vector sign extensions from vector comparisions, which is the idiom used to get a splatted vector for a vector comparison. Doing this breaks vector-casts.ll, add some compensating transformations to handle the important case they cover without depending on this canonicalization. This fixes rdar://7434900 a serious pessimization of vector compares. llvm-svn: 95855	2010-02-11 06:26:33 +00:00
Chris Lattner	c053cbbc4d	Make DSE only scan blocks that are reachable from the entry block. Other blocks may have pointer cycles that will crash basicaa and other alias analyses. In any case, there is no point wasting cycles optimizing dead blocks. This fixes rdar://7635088 llvm-svn: 95852	2010-02-11 05:11:54 +00:00
Chris Lattner	d924f63692	Make jump threading honor x\|undef -> true and x&undef -> false, instead of considering x\|undef -> x, which may not be true. llvm-svn: 95850	2010-02-11 04:40:44 +00:00
Eric Christopher	531ea566a6	Add ConstantExpr handling to Intrinsic::objectsize lowering. Update testcase accordingly now that we can optimize another section. llvm-svn: 95846	2010-02-11 01:48:54 +00:00
Devang Patel	03936a1880	Ignore dbg info intrinsics. llvm-svn: 95828	2010-02-11 00:20:49 +00:00
Devang Patel	211746a69a	Strip new llvm.dbg.value intrinsic. llvm-svn: 95807	2010-02-10 21:19:56 +00:00
Dan Gohman	4a618827de	Fix "the the" and similar typos. llvm-svn: 95781	2010-02-10 16:03:48 +00:00
Eric Christopher	7b7028fd24	Move Intrinsic::objectsize lowering back to InstCombineCalls and enable constant 0 offset lowering. llvm-svn: 95691	2010-02-09 21:24:27 +00:00
Eric Christopher	ad1aa86276	Pull these back out, they're a little too aggressive and time consuming for a simple optimization. llvm-svn: 95671	2010-02-09 17:29:18 +00:00
Chris Lattner	f4c8d3cea9	simplify this code, duh. llvm-svn: 95643	2010-02-09 01:14:06 +00:00
Chris Lattner	9b6a1789e5	fix PR6193, only considering sign extensions from i1 for this xform. llvm-svn: 95642	2010-02-09 01:12:41 +00:00
Eric Christopher	be2f0b2b7b	Add file in here too. llvm-svn: 95641	2010-02-09 01:11:03 +00:00
Eric Christopher	9f85e7eb16	Add a new pass to do llvm.objsize lowering using SCEV. Initial skeleton and SCEVUnknown lowering implemented, the rest should come relatively quickly. Move testcase to new directory. Move pass to right before SimplifyLibCalls - which is moved down a bit so we can take advantage of a few opts. llvm-svn: 95628	2010-02-09 00:35:38 +00:00
Chris Lattner	b22423c89a	fix some problems handling large vectors reported in PR6230 llvm-svn: 95616	2010-02-08 23:56:03 +00:00
Jakob Stoklund Olesen	74bb06c0f0	Reintroduce the InlineHint function attribute. This time it's for real! I am going to hook this up in the frontends as well. The inliner has some experimental heuristics for dealing with the inline hint. When given a -respect-inlinehint option, functions marked with the inline keyword are given a threshold just above the default for -O3. We need some experiments to determine if that is the right thing to do. llvm-svn: 95466	2010-02-06 01:16:28 +00:00
Jakob Stoklund Olesen	5f9ead2714	Don't unroll loops containing function calls. llvm-svn: 95454	2010-02-05 23:21:31 +00:00
Jakob Stoklund Olesen	916f48a054	Teach SimplifyCFG about magic pointer constants. Weird code sometimes uses pointer constants other than null. This patch teaches SimplifyCFG to build switch instructions in those cases. Code like this: void f(const char x) { if (!x) puts("null"); else if ((uintptr_t)x == 1) puts("one"); else if (x == (char)2 \|\| x == (char)3) puts("two"); else if ((intptr_t)x == 4) puts("four"); else puts(x); } Now becomes a switch: define void @f(i8 %x) nounwind ssp { entry: %magicptr23 = ptrtoint i8* %x to i64 ; <i64> [#uses=1] switch i64 %magicptr23, label %if.else16 [ i64 0, label %if.then i64 1, label %if.then2 i64 2, label %if.then9 i64 3, label %if.then9 i64 4, label %if.then14 ] Note that LLVM's own DenseMap uses magic pointers. llvm-svn: 95439	2010-02-05 22:03:18 +00:00
Chris Lattner	64ffd11d49	fix logical-select to invoke filecheck right, and fix hte instcombine xform it is checking to actually pass. There is no need to match m_SelectCst<0, -1> since instcombine canonicalizes that into not(sext). Add matches for sext(not(x)) in addition to not(sext(x)). llvm-svn: 95420	2010-02-05 19:53:02 +00:00
Dan Gohman	4739e41ce9	Implement releaseMemory in CodeGenPrepare and free the BackEdges container data. This prevents it from holding onto dangling pointers and potentially behaving unpredictably. llvm-svn: 95409	2010-02-05 19:24:11 +00:00
Dan Gohman	8abb67df63	Use a SmallSetVector instead of a SetVector; this code showed up as a malloc caller in a profile. llvm-svn: 95407	2010-02-05 19:20:15 +00:00
Eric Christopher	04371b4f12	Remove this code for now. I have a better idea and will rewrite with that in mind. llvm-svn: 95402	2010-02-05 19:04:06 +00:00
Bob Wilson	27dfb1e1a4	Do not reassociate expressions with i1 type. SimplifyCFG converts some short-circuited conditions to AND/OR expressions, and those expressions are often converted back to a short-circuited form in code gen. The original source order may have been optimized to take advantage of the expected values, and if we reassociate them, we change the order and subvert that optimization. Radar 7497329. llvm-svn: 95333	2010-02-04 23:32:37 +00:00
Jakob Stoklund Olesen	113fb54bcb	Increase inliner thresholds by 25. This makes the inliner about as agressive as it was before my changes to the inliner cost calculations. These levels give the same performance and slightly smaller code than before. llvm-svn: 95320	2010-02-04 18:48:20 +00:00
Eric Christopher	107a1fbf61	Temporarily revert this since it appears to have caused a build failure. llvm-svn: 95294	2010-02-04 06:41:27 +00:00
Eric Christopher	42fa84a880	Rework constant expr and array handling for objectsize instcombining. Fix bugs where we would compute out of bounds as in bounds, and where we couldn't know that the linker could override the size of an array. Add a few new testcases, change existing testcase to use a private global array instead of extern. llvm-svn: 95283	2010-02-04 02:55:34 +00:00
Eric Christopher	f12e18db21	If we're dealing with a zero-length array, don't lower to any particular size, we just don't know what the length is yet. llvm-svn: 95266	2010-02-03 23:56:07 +00:00
Bob Wilson	04365c5f72	Adjust the heuristics used to decide when SROA is likely to be profitable. The SRThreshold value makes perfect sense for checking if an entire aggregate should be promoted to a scalar integer, but it is not so good for splitting an aggregate into its separate elements. A struct may contain a large embedded array along with some scalar fields that would benefit from being split apart by SROA. Even if the total aggregate size is large, it may still be good to perform SROA. Thus, the most important piece of this patch is simply moving the aggregate size comparison vs. SRThreshold so that it guards only the aggregate promotion. We have also been checking the number of elements to decide if an aggregate should be split up. The limit of "SRThreshold/4" seemed rather arbitrary, and I don't think it's very useful to derive this limit from SRThreshold anyway. I've collected some data showing that the current default limit of 32 (since SRThreshold defaults to 128) is a reasonable cutoff for struct types. One thing suggested by the data is that distinguishing between structs and arrays might be useful. There are (obviously) a lot more large arrays than large structs (as measured by the number of elements and not the total size -- a large array inside a struct still counts as a single element given the way we do SROA right now). Out of 8377 arrays where we successfully performed SROA while compiling a large set of benchmarks, only 16 of them had more than 8 elements. And, for those 16 arrays, it's not at all clear that SROA was actually beneficial. So, to offset the compile time cost of investigating more large structs for SROA, the patch lowers the limit on array elements to 8. This fixes Apple Radar 7563690. llvm-svn: 95224	2010-02-03 17:23:56 +00:00
Evan Cheng	27a41d5473	Revert 94937 and move the noreturn check to codegen. llvm-svn: 95198	2010-02-03 03:55:59 +00:00
Bob Wilson	76e8c59509	Fix some comment typos. llvm-svn: 95170	2010-02-03 00:33:21 +00:00
Eric Christopher	d86233c118	Recommit this, looks like it wasn't the cause. llvm-svn: 95165	2010-02-03 00:21:58 +00:00
Eric Christopher	e67d01a9a8	Hopefully temporarily revert this. llvm-svn: 95154	2010-02-02 23:01:31 +00:00
Eric Christopher	f9553572b7	Reformat my last patch slightly. llvm-svn: 95147	2010-02-02 22:29:26 +00:00
Eric Christopher	4264e7e46f	Re-add strcmp and known size object size checking optimization. Passed bootstrap and nightly test run here. llvm-svn: 95145	2010-02-02 22:10:43 +00:00
Chris Lattner	8e2c471614	don't turn (A & (C0?-1:0)) \| (B & ~(C0?-1:0)) -> C0 ? A : B for vectors. Codegen is generating awful code or segfaulting in various cases (e.g. PR6204). llvm-svn: 95058	2010-02-02 02:43:51 +00:00
Chris Lattner	302240d73e	fix a crash in loop unswitch on a loop invariant vector condition. llvm-svn: 95055	2010-02-02 02:26:54 +00:00
Dan Gohman	949458d014	LangRef.html says that inttoptr and ptrtoint always use zero-extension when the cast is extending. llvm-svn: 95046	2010-02-02 01:44:02 +00:00
Eric Christopher	14dfc3f6df	Don't need to check the last argument since it'll always be bool. We also don't use TargetData here. llvm-svn: 95040	2010-02-02 00:51:45 +00:00
Eric Christopher	9afa973203	More indentation/tabification fixes. llvm-svn: 95036	2010-02-02 00:13:06 +00:00
Eric Christopher	1408234753	Untabify previous commit. llvm-svn: 95035	2010-02-02 00:06:55 +00:00
Eric Christopher	56e4182c49	Formatting. llvm-svn: 95027	2010-02-01 23:25:03 +00:00
Bob Wilson	d517b52012	Add an option to GVN to remove all partially redundant loads. This is currently disabled by default. This divides the existing load PRE code into 2 phases: first it checks that it is safe to move the load to each of the predecessors where it is unavailable, and then if it is safe, the code is changed to move the load. Radar 7571861. llvm-svn: 95007	2010-02-01 21:17:14 +00:00
Chris Lattner	9306ffa05a	cleanups. llvm-svn: 94995	2010-02-01 19:54:45 +00:00
Chris Lattner	846a52e228	fix rdar://7590304, a miscompilation of objc apps on arm. The caller of objc message send was getting marked arm_apcscc, but the prototype isn't. This is fine at runtime because objcmsgsend is implemented in assembly. Only turn a mismatched caller and callee into 'unreachable' if the callee is a definition. llvm-svn: 94986	2010-02-01 18:11:34 +00:00
Chris Lattner	2cecedf081	fix rdar://7590304, an infinite loop in instcombine. In the invoke case, instcombine can't zap the invoke for fear of changing the CFG. However, we have to do something to prevent the next iteration of instcombine from inserting another store -> undef before the invoke thereby getting into infinite iteration between dead store elim and store insertion. Just zap the callee to null, which will prevent the next iteration from doing anything. llvm-svn: 94985	2010-02-01 18:04:58 +00:00
Bob Wilson	f65ba356e1	Fix pr6198 by moving the isSized() check to an outer conditional. The testcase from pr6198 does not crash for me -- I don't know what's up with that -- so I'm not adding it to the tests. llvm-svn: 94984	2010-02-01 17:41:44 +00:00
Eli Friedman	a2cc2875fc	Simplify/generalize the xor+add->sign-extend instcombine. llvm-svn: 94943	2010-01-31 04:29:12 +00:00
Eli Friedman	37a8197b61	Add a small transform: transform -(X<<Y) to (-X<<Y) when the shift has a single use and X is free to negate. llvm-svn: 94941	2010-01-31 02:30:23 +00:00
Evan Cheng	d86d3fe0c3	Do not mark no-return calls tail calls. It'll screw up special calls like longjmp and it doesn't make much sense for performance reason. If my logic is faulty, please let me know. llvm-svn: 94937	2010-01-31 00:59:31 +00:00
Bob Wilson	56600a15ad	Check alignment of loads when deciding whether it is safe to execute them unconditionally. Besides checking the offset, also check that the underlying object is aligned as much as the load itself. llvm-svn: 94875	2010-01-30 04:42:39 +00:00
Bob Wilson	4b71b6c179	Use more specific types to avoid casts. No functionality change. llvm-svn: 94863	2010-01-30 00:41:10 +00:00
Jakob Stoklund Olesen	e27dc727e2	Keep iterating over all uses when meeting a phi node in AllUsesOfValueWillTrapIfNull(). This bug was exposed by my inliner cost changes in r94615, and caused failures of lencod on most architectures when building with LTO. This patch fixes lencod and 464.h264ref on x86-64 (and likely others). llvm-svn: 94858	2010-01-29 23:54:14 +00:00
Bob Wilson	1b8453067b	Preserve load alignment in instcombine transformations. I've been unable to create a testcase where this matters. The select+load transformation only occurs when isSafeToLoadUnconditionally is true, and in those situations, instcombine also changes the underlying objects to be aligned. This seems like a good idea regardless, and I've verified that it doesn't pessimize the subsequent realignment. llvm-svn: 94850	2010-01-29 22:39:21 +00:00
Eric Christopher	5a0e174863	Revert my last couple of patches. They appear to have broken bison. llvm-svn: 94841	2010-01-29 21:16:24 +00:00
Bob Wilson	34e10c2218	Use uint64_t instead of unsigned for offsets and sizes. llvm-svn: 94835	2010-01-29 20:34:28 +00:00
Bob Wilson	7c42b9d51e	Improve isSafeToLoadUnconditionally to recognize that GEPs with constant indices are safe if the result is known to be within the bounds of the underlying object. llvm-svn: 94829	2010-01-29 19:19:08 +00:00
Duncan Sands	c8a3e56870	Having RHSKnownZero and RHSKnownOne be alternative names for KnownZero and KnownOne (via APInt &RHSKnownZero = KnownZero, etc) seems dangerous and confusing to me: it is easy not to notice this, and then wonder why KnownZero/RHSKnownZero changed underneath you when you modified RHSKnownZero/KnownZero etc. So get rid of this. No intended functionality change (tested with "make check" + llvm-gcc bootstrap). llvm-svn: 94802	2010-01-29 06:18:46 +00:00
Eric Christopher	9b3c02b7da	Make strcpy_chk lower to strcpy if we have a safe size. llvm-svn: 94783	2010-01-29 01:37:11 +00:00
Eric Christopher	997f7ca8c5	Add constant support to object size handling and remove default lowering. We'll either figure it out, or not and be lowered by SelectionDAGBuild. Add test. llvm-svn: 94775	2010-01-29 01:09:57 +00:00
Bill Wendling	48816a0b3f	Generic reformatting and comment fixing. No functionality change. llvm-svn: 94771	2010-01-29 00:52:43 +00:00
Bill Wendling	8277838cf8	Add newline to debugging output, and fix some grammar-os in comment. llvm-svn: 94765	2010-01-29 00:27:39 +00:00
Victor Hernandez	006b53f199	mem2reg erases the dbg.declare intrinsics that it converts to dbg.val intrinsics llvm-svn: 94763	2010-01-29 00:01:35 +00:00
Duncan Sands	3a48b87c54	Fix PR6165. The bug was that LHSKnownZero was being and'd with DemandedMask when it should have been and'd with LowBits. Fix that and while there beef up the logic in the case of a negative LHS. llvm-svn: 94745	2010-01-28 17:22:42 +00:00
Bob Wilson	7577e948e4	Avoid creating redundant PHIs in SSAUpdater::GetValueInMiddleOfBlock. This was already being done in SSAUpdater::GetValueAtEndOfBlock so I've just changed SSAUpdater to check for existing PHIs in both places. llvm-svn: 94690	2010-01-27 22:01:02 +00:00
Jeffrey Yasskin	091217be6f	Kill ModuleProvider and ghost linkage by inverting the relationship between Modules and ModuleProviders. Because the "ModuleProvider" simply materializes GlobalValues now, and doesn't provide modules, it's renamed to "GVMaterializer". Code that used to need a ModuleProvider to materialize Functions can now materialize the Functions directly. Functions no longer use a magic linkage to record that they're materializable; they simply ask the GVMaterializer. Because the C ABI must never change, we can't remove LLVMModuleProviderRef or the functions that refer to it. Instead, because Module now exposes the same functionality ModuleProvider used to, we store a Module* in any LLVMModuleProviderRef and translate in the wrapper methods. The bindings to other languages still use the ModuleProvider concept. It would probably be worth some time to update them to follow the C++ more closely, but I don't intend to do it. Fixes http://llvm.org/PR5737 and http://llvm.org/PR5735. llvm-svn: 94686	2010-01-27 20:34:15 +00:00
Benjamin Kramer	1266d46d32	Don't bother with sprintf, just pass the Twine through. llvm-svn: 94684	2010-01-27 19:58:47 +00:00
Benjamin Kramer	40582a891c	Use the less expensive getName function instead of getNameStr. llvm-svn: 94683	2010-01-27 19:46:52 +00:00
Chris Lattner	65f4733b77	some cleanups. llvm-svn: 94649	2010-01-27 02:12:20 +00:00
Chris Lattner	711e701f1c	no need to check for null llvm-svn: 94648	2010-01-27 02:04:20 +00:00
Victor Hernandez	477d9274bb	When converting dbg.declare to dbg.value, attach promoted store's debug metadata to dbg.value llvm-svn: 94634	2010-01-27 00:44:36 +00:00
Victor Hernandez	2b17e2a452	Avoid extra calls to MD->getNumOperands() llvm-svn: 94618	2010-01-26 23:29:09 +00:00
Victor Hernandez	9ecd2f039f	Switch AllocaDbgDeclares to SmallVector and don't leak DIFactory llvm-svn: 94567	2010-01-26 18:57:53 +00:00
Victor Hernandez	cd94410152	In mem2reg, for all alloca/stores that get promoted where the alloca has an associated llvm.dbg.declare instrinsic, insert an llvm.dbg.var intrinsic before each store. llvm-svn: 94493	2010-01-26 02:42:15 +00:00
Bob Wilson	70c8fe5e4e	Remove check for an impossible condition: the condition of the while loop has already checked that TmpBB->getSinglePredecessor() is non-null. llvm-svn: 94451	2010-01-25 21:28:05 +00:00
Bob Wilson	fc060e4337	Change Value::getUnderlyingObject to have the MaxLookup value specified as a parameter with a default value, instead of just hardcoding it in the implementation. The limit of MaxLookup = 6 was introduced in r69151 to fix a performance problem with O(n^2) behavior in instcombine, but the scalarrepl pass is relying on getUnderlyingObject to go all the way back to an AllocaInst. Making the limit part of the method signature makes it clear that by default the result is limited and should help avoid similar problems in the future. This fixes pr6126. llvm-svn: 94433	2010-01-25 18:26:54 +00:00
Victor Hernandez	8a588e1444	Revert r94260 until findDbgDeclare() is made more efficient llvm-svn: 94432	2010-01-25 17:52:13 +00:00
Chris Lattner	823aed16f9	make -fno-rtti the default unless a directory builds with REQUIRES_RTTI. llvm-svn: 94378	2010-01-24 20:43:08 +00:00
Chris Lattner	1b35bbe813	change the canonical form of "cond ? -1 : 0" to be "sext cond" instead of a select. This simplifies some instcombine code, matches the policy for zext (cond ? 1 : 0 -> zext), and allows us to generate better code for a testcase on ppc. llvm-svn: 94339	2010-01-24 00:09:49 +00:00
Chris Lattner	e112ff64c5	fix a potential overflow issue Eli pointed out. llvm-svn: 94336	2010-01-23 23:31:46 +00:00
Nick Lewycky	7e7ed8b9e5	Speculatively revert r94322 to see if it fixes darwin selfhost buildbot. llvm-svn: 94331	2010-01-23 20:32:12 +00:00
Chris Lattner	29b15c5cfd	third bug from PR6119: the xor dupe extension allows for arbitrary terminators in predecessors, don't assume it is a conditional or uncond branch. The testcase shows an example where they can happen with switches. llvm-svn: 94323	2010-01-23 19:21:31 +00:00
Nick Lewycky	32966aed9d	Teach DAE that even though it can't modify the function signature of an externally visible function, it can still find all callers of it and replace the parameters to a dead argument with undef. llvm-svn: 94322	2010-01-23 19:19:34 +00:00
Chris Lattner	ba2d0b89ff	add an early out to ProcessBranchOnXOR to speed it up, handle the case when we can infer an input to the xor from all inputs that agree, instead of going into an infinite loop. Another part of PR6199 llvm-svn: 94321	2010-01-23 19:16:25 +00:00
Chris Lattner	de5ab4860f	fix a crash in jump threading, PR6119 llvm-svn: 94319	2010-01-23 18:56:07 +00:00
Chris Lattner	249da5cb73	implement a simple instcombine xform that has been in the readme forever. llvm-svn: 94318	2010-01-23 18:49:30 +00:00
Eric Christopher	ba7cd4c393	Reapply 94059 while fixing the calling convention setup for strcpy. llvm-svn: 94287	2010-01-23 05:29:06 +00:00
Victor Hernandez	5006e43faf	In mem2reg, for all alloca/stores that get promoted where the alloca has an associated llvm.dbg.declare instrinsic, insert an llvm.dbg.var intrinsic before each store llvm-svn: 94260	2010-01-23 00:17:34 +00:00
Benjamin Kramer	3838dfbaea	Another strncmp -> StringRef.startswith simplification. llvm-svn: 94203	2010-01-22 20:00:21 +00:00
Bob Wilson	6c0c8d41b4	Revert 94059. It is breaking the MultiSource/Benchmarks/Prolangs-C/bison test on ARM. llvm-svn: 94198	2010-01-22 19:16:40 +00:00
Victor Hernandez	5f8c8c034a	Keep ignoring pointer-to-pointer bitcasts llvm-svn: 94194	2010-01-22 19:05:05 +00:00
Chris Lattner	7ba0661f27	Stop building RTTI information for most llvm libraries. Notable missing ones are libsupport, libsystem and libvmcore. libvmcore is currently blocked on bugpoint, which uses EH. Once it stops using EH, we can switch it off. This #if 0's out 3 unit tests, because gtest requires RTTI information. Suggestions welcome on how to fix this. llvm-svn: 94164	2010-01-22 06:49:46 +00:00
Dan Gohman	045f81981a	Revert LoopStrengthReduce.cpp to pre-r94061 for now. llvm-svn: 94123	2010-01-22 00:46:49 +00:00
Victor Hernandez	7b151e9f06	No need to look through bitcasts for DbgInfoIntrinsic llvm-svn: 94114	2010-01-21 23:09:12 +00:00
Victor Hernandez	ae4d949721	DbgInfoIntrinsic no longer appear in an instruction's use list llvm-svn: 94113	2010-01-21 23:08:36 +00:00
Victor Hernandez	5f5abd598c	No need to look through bitcasts for DbgInfoIntrinsic llvm-svn: 94112	2010-01-21 23:07:15 +00:00
Victor Hernandez	1df65186d1	DbgInfoIntrinsics no longer appear in an instruction's use list; so clean up looking for them in use iterations and remove OnlyUsedByDbgInfoIntrinsics() llvm-svn: 94111	2010-01-21 23:05:53 +00:00
Dan Gohman	b1ee154b6b	When inserting expressions for post-increment users which contain loop-variant components, adds must be inserted after the increment. Keep track of the increment position for this case, and insert these adds in the correct location. llvm-svn: 94110	2010-01-21 23:01:22 +00:00
Dan Gohman	cb8d577eb2	Include IVUsers information in LSR's debug output. llvm-svn: 94108	2010-01-21 22:46:32 +00:00
Dan Gohman	29916e023d	Prune the search for candidate formulae if the number of register operands exceeds the number of registers used in the initial solution, as that wouldn't lead to a profitable solution anyway. llvm-svn: 94107	2010-01-21 22:42:49 +00:00
Dan Gohman	c903499ff8	Add a comment. llvm-svn: 94104	2010-01-21 21:31:09 +00:00
Chris Lattner	24716b6c63	It turns out that this #include is needed because otherwise ValueMapper.cpp ends up calling an out of line __ZNK4llvm12PATypeHolder3getEv, which is a template and llvm-config determines arbitrarily to use the one in libipo. This sucks, but keeping the #include is a reasonable workaround. llvm-svn: 94103	2010-01-21 21:29:25 +00:00
Chris Lattner	9889b4be04	unbreak the build, apparently without this transformutils starts depending on libipa? llvm-svn: 94102	2010-01-21 21:20:51 +00:00
Chris Lattner	e39837d5ee	tidy up llvm-svn: 94101	2010-01-21 21:05:54 +00:00
Victor Hernandez	a9ad174b49	Don't need to include IntrinsicInst.h any more llvm-svn: 94092	2010-01-21 19:33:59 +00:00
Victor Hernandez	d089f4e10b	No need to map NULL operands of metadata llvm-svn: 94091	2010-01-21 19:26:20 +00:00
Dan Gohman	51ad99d2c5	Re-implement the main strength-reduction portion of LoopStrengthReduction. This new version is much more aggressive about doing "full" reduction in cases where it reduces register pressure, and also more aggressive about rewriting induction variables to count down (or up) to zero when doing so reduces register pressure. It currently uses fairly simplistic algorithms for finding reuse opportunities, but it introduces a new framework allows it to combine multiple strategies at once to form hybrid solutions, instead of doing all full-reduction or all base+index. llvm-svn: 94061	2010-01-21 02:09:26 +00:00
Eric Christopher	fa863258d0	Add strcpy_chk -> strcpy support for "don't know" object size answers. This will update as object size checking gets better information. llvm-svn: 94059	2010-01-21 01:04:38 +00:00
Chris Lattner	3c5bf71353	simplify this code. llvm-svn: 94048	2010-01-20 23:30:28 +00:00
Jakob Stoklund Olesen	8a19d3c96c	Move per-function inline threshold calculation to a method. No functional change except the forgotten test for InlineLimit.getNumOccurrences() == 0 in the CurrentThreshold2 calculation. llvm-svn: 94007	2010-01-20 17:51:28 +00:00
Victor Hernandez	f2462407ee	Switch Elts from vector to SmallVector llvm-svn: 93989	2010-01-20 06:56:16 +00:00
Victor Hernandez	5fa88d4e30	Map operands of all function-local metadata, not just metadata passed to llvm.dbg.declare intrinsics llvm-svn: 93979	2010-01-20 05:49:59 +00:00
Dan Gohman	ca19445d08	When doing address-mode sinking, expand the base register first, rather than the scaled register. This makes it more likely that subsequent AddrModeMatcher queries will match the new address the same way as the old, instead of accidentally matching what had been the base register as the new scaled register, and then failing to match the scaled register. This fixes some problems with address-mode sinking multiple muls into a block, which will be a lot more common with some upcoming LoopStrengthReduction changes. llvm-svn: 93935	2010-01-19 22:45:06 +00:00
Chris Lattner	18f49ce2d3	optimize ~(~X >>s Y) --> (X >>s Y), patch by Edmund Grimley Evans! llvm-svn: 93884	2010-01-19 18:16:19 +00:00
Bob Wilson	58d59fe394	Fix a crash in scalarrepl for memcpy/memmove where the source and destination are the same. I had already fixed a similar problem where the source and destination were different bitcasts derived from the same alloca, but the previous fix still did not handle the case where both operands are exactly the same value. Radar 7552893. llvm-svn: 93848	2010-01-19 04:32:48 +00:00
Eric Christopher	84bd316bd6	Fix comment. llvm-svn: 93831	2010-01-19 01:20:15 +00:00
Chris Lattner	43f2fa6201	my instcombine transformations to make extension elimination more aggressive changed the canonical form from sext(trunc(x)) to ashr(lshr(x)), make sure to transform a couple more things into that canonical form, and catch a case where we missed turning zext/shl/ashr into a single sext. llvm-svn: 93787	2010-01-18 22:19:16 +00:00
Devang Patel	696cb8d410	While mapping llvm.dbg.declare intrinsic manually map its operand, if possible, because it points to an alloca instruction through metadata. llvm-svn: 93757	2010-01-18 19:52:14 +00:00
Owen Anderson	cdea3572fa	Convert some of the dynamic opcode lookups into static ones. llvm-svn: 93693	2010-01-17 19:33:27 +00:00
Owen Anderson	fa1edea9ce	Fix comment. llvm-svn: 93679	2010-01-17 06:49:03 +00:00
Bob Wilson	e0da4b6cff	Fix a comment typo. llvm-svn: 93560	2010-01-15 21:55:02 +00:00
Bill Wendling	ad7a5b07a7	When the visitSub method was split into visitSub and visitFSub, this xform was added to the FSub version. However, the original version of this xform guarded against doing this for floating point (!Op0->getType()->isFPOrFPVector()). This is causing LLVM to perform incorrect xforms for code like: void func(double rhi, double rlo, double xh, double xl, double yh, double yl){ double mh, ml; double c = 134217729.0; double up, u1, u2, vp, v1, v2; up = xhc; u1 = (xh - up) + up; u2 = xh - u1; vp = yhc; v1 = (yh - vp) + vp; v2 = yh - v1; mh = xhyh; ml = (((u1v1 - mh) + (u1v2)) + (u2v1)) + (u2v2); ml += xhyl + xlyh; rhi = mh + ml; rlo = (mh - (rhi)) + ml; } The last line was optimized away, but rl is intended to be the difference between the infinitely precise result of mh + ml and after it has been rounded to double precision. llvm-svn: 93369	2010-01-13 23:23:17 +00:00
Chris Lattner	573da8ac90	1) Use the new SimplifyInstructionsInBlock routine instead of the copy in JT. 2) When cloning blocks for PHI or xor conditions, use instsimplify to simplify the code as we go. This allows us to squish common cases early in JT which opens up opportunities for subsequent iterations, and allows it to completely simplify the testcase. llvm-svn: 93253	2010-01-12 20:41:47 +00:00
Chris Lattner	7c743f2c74	add a helper function. llvm-svn: 93251	2010-01-12 19:40:54 +00:00
Chris Lattner	af7855d571	tidy up llvm-svn: 93222	2010-01-12 02:07:50 +00:00
Chris Lattner	eb73bdb2e1	Teach jump threading to duplicate small blocks when the branch condition is a xor with a phi node. This eliminates nonsense like this from 176.gcc in several places: LBB166_84: testl %eax, %eax - setne %al - xorb %cl, %al - notb %al - testb $1, %al - je LBB166_85 + je LBB166_69 + jmp LBB166_85 This is rdar://7391699 llvm-svn: 93221	2010-01-12 02:07:17 +00:00
Chris Lattner	6a19ed0b86	some cleanup, and make it obvious that ProcessJumpOnPHI only works on branches by renaming it and checking for a branch at the call site. llvm-svn: 93208	2010-01-11 23:41:09 +00:00
Chris Lattner	d1a3efedd8	reenable the piece that turns trunc(zext(x)) -> x even if zext has multiple uses, codegen has no apparent problem with the trunc version of this, because it turns into a simple subreg idiom llvm-svn: 93202	2010-01-11 22:49:40 +00:00
Chris Lattner	a6b1356cf9	Disable folding sext(trunc(x)) -> x (and other similar cast/cast cases) when the trunc has multiple uses. Codegen is not able to coalesce the subreg case correctly and so this leads to higher register pressure and spilling (see PR5997). This speeds up 256.bzip2 from 8.60 -> 8.04s on my machine, ~7%. llvm-svn: 93200	2010-01-11 22:45:25 +00:00
Chris Lattner	9518869423	add one more bitfield optimization, allowing clang to generate good code on PR4216: _test_bitfield: ## @test_bitfield orl $32962, %edi movl $4294941946, %eax andq %rdi, %rax ret instead of: _test_bitfield: movl $4294941696, %ecx movl %edi, %eax orl $194, %edi orl $32768, %eax andq $250, %rdi andq %rax, %rcx movq %rdi, %rax orq %rcx, %rax ret Evan is looking into the remaining andq+imm -> andl optimization. llvm-svn: 93147	2010-01-11 06:55:24 +00:00
Chris Lattner	0a85420409	Extend CanEvaluateZExtd to handle and/or/xor more aggressively in the BitsToClear case. This allows it to promote expressions which have an and/or/xor after the lshr, promoting cases like test2 (from PR4216) and test3 (random extample extracted from a spec benchmark). clang now compiles the code in PR4216 into: _test_bitfield: ## @test_bitfield movl %edi, %eax orl $194, %eax movl $4294902010, %ecx andq %rax, %rcx orl $32768, %edi andq $39936, %rdi movq %rdi, %rax orq %rcx, %rax ret instead of: _test_bitfield: ## @test_bitfield movl %edi, %eax orl $194, %eax movl $4294902010, %ecx andq %rax, %rcx shrl $8, %edi orl $128, %edi shlq $8, %rdi andq $39936, %rdi movq %rdi, %rax orq %rcx, %rax ret which is still not great, but is progress. llvm-svn: 93145	2010-01-11 04:05:13 +00:00
Chris Lattner	12bd8992b3	Remove the dead TD argument to CanEvaluateZExtd, and add a new BitsToClear result which allows us to start promoting expressions that end with a lshr-by-constant. This is conservatively correct and better than what we had before (see testcases) but still needs to be extended further. llvm-svn: 93144	2010-01-11 03:32:00 +00:00
Chris Lattner	172630abd2	improve comments, remove dead TD argument to CanEvaluateSExtd. llvm-svn: 93143	2010-01-11 02:43:35 +00:00
Chris Lattner	7dd540ee24	teach sext optimization to handle truncs from types that are not the dest of the sext. llvm-svn: 93128	2010-01-10 20:30:41 +00:00
Chris Lattner	39d2daa94c	teach zext optimization how to deal with truncs that don't come from the zext dest type. This allows us to handle test52/53 in cast.ll, and allows llvm-gcc to generate much better code for PR4216 in -m64 mode: _test_bitfield: ## @test_bitfield orl $32962, %edi movl %edi, %eax andl $-25350, %eax ret This also fixes a bug handling vector extends, ensuring that the mask produced is a vector constant, not an integer constant. llvm-svn: 93127	2010-01-10 20:25:54 +00:00
Chris Lattner	1a05fddcdc	simplify CanEvaluateSExtd to return a bool now that we have a simpler profitability predicate. llvm-svn: 93111	2010-01-10 07:57:20 +00:00
Chris Lattner	d7816780e2	the NumCastsRemoved argument to CanEvaluateSExtd is dead, remove it. llvm-svn: 93110	2010-01-10 07:42:21 +00:00
Chris Lattner	2fff10c424	now that the cost model has changed, we can always consider elimination of a sign extend to be a win, which simplifies the client of CanEvaluateSExtd, and allows us to eliminate more casts (examples taken from real code). llvm-svn: 93109	2010-01-10 07:40:50 +00:00
Chris Lattner	d8509424a4	change the preferred canonical form for a sign extension to be lshr+ashr instead of trunc+sext. We want to avoid type conversions whenever possible, it is easier to codegen expressions without truncates and extensions. llvm-svn: 93107	2010-01-10 07:08:30 +00:00
Chris Lattner	2b459fe7e1	fix indentation of switch statements, no functionality change. llvm-svn: 93106	2010-01-10 06:59:55 +00:00
Chris Lattner	127bbc715e	fix pasto that broke bootstrap. llvm-svn: 93105	2010-01-10 06:50:04 +00:00
Chris Lattner	b7be7cc486	simplify CanEvaluateZExtd now that we don't care about the number of bits known clear in the result and don't care about the # casts eliminated. TD is also dead but keeping it for now. llvm-svn: 93098	2010-01-10 02:50:04 +00:00
Chris Lattner	49d2c9764d	two changes: 1) don't try to optimize a sext or zext that is only used by a trunc, let the trunc get optimized first. This avoids some pointless effort in some common cases since instcombine scans down a block in the first pass. 2) Change the cost model for zext elimination to consider an 'and' cheaper than a zext. This allows us to do it more aggressively, and for the next patch to simplify the code quite a bit. llvm-svn: 93097	2010-01-10 02:39:31 +00:00
Chris Lattner	f0af17dab3	enhance CanEvaluateZExtd to handle shift left and sext, allowing more expressions to be promoted and casts eliminated. llvm-svn: 93096	2010-01-10 02:22:12 +00:00
Chris Lattner	7723e2b10f	remove an xform subsumed by EvaluateInDifferentType. llvm-svn: 93095	2010-01-10 01:35:55 +00:00
Julien Lerouge	321098ebec	Fix nondeterministic behavior. llvm-svn: 93093	2010-01-10 01:07:22 +00:00
Chris Lattner	c95a7a21b7	clean up this xform by using m_Trunc. llvm-svn: 93092	2010-01-10 01:04:31 +00:00
Chris Lattner	883550afe8	inline and remove the rest of commonIntCastTransforms. llvm-svn: 93091	2010-01-10 01:00:46 +00:00
Chris Lattner	c3aca38468	Inline the expression type promotion/demotion stuff out of commonIntCastTransforms into the callers, eliminating a switch, and allowing the static predicate methods to be moved down to live next to the corresponding function. No functionality change. llvm-svn: 93089	2010-01-10 00:58:42 +00:00
Chris Lattner	ab7087ad66	only factor from expressions whose uses are empty and whose base is the right expression type. This fixes PR5981. llvm-svn: 93045	2010-01-09 06:01:36 +00:00
Julien Lerouge	f50a3f19da	Fix nondeterministic behavior. llvm-svn: 93038	2010-01-09 01:06:49 +00:00
Eric Christopher	4a1d7e1506	Remove unnecessary dyn_cast and add a comment. Part of a WIP. llvm-svn: 93026	2010-01-08 21:37:11 +00:00
Chris Lattner	9242ae047c	mplement a theoretical fixme. llvm-svn: 93024	2010-01-08 19:28:47 +00:00
Chris Lattner	10840e9e13	rename CanEvaluateInDifferentType -> CanEvaluateTruncated and simplify it now that it is only used for truncates. llvm-svn: 93021	2010-01-08 19:19:23 +00:00
Chris Lattner	a1e223ea10	teach instcombine to delete sign extending shift pairs (sra(shl X, C), C) when the input is already sign extended. llvm-svn: 93019	2010-01-08 19:04:21 +00:00
Duncan Sands	4a8b15dc74	Suppress an unused variable warning when assertions are off; remove some trailing whitespace while there. llvm-svn: 93008	2010-01-08 17:51:48 +00:00
Chris Lattner	8c92b57df9	tidy up some stuff duncan pointed out. llvm-svn: 93007	2010-01-08 17:48:19 +00:00
Chris Lattner	35d3b9dcd0	teach ComputeNumSignBits to look through PHI nodes. llvm-svn: 92964	2010-01-07 23:44:37 +00:00
Chris Lattner	3057c37959	Enhance instcombine to reason more strongly about promoting computation that feeds into a zext, similar to the patch I did yesterday for sext. There is a lot of room for extension beyond this patch. llvm-svn: 92962	2010-01-07 23:41:00 +00:00
Benjamin Kramer	76e2766442	Use a do-while loop instead of while + boolean. llvm-svn: 92912	2010-01-07 13:50:07 +00:00
Duncan Sands	f117880ab0	Be less stingy as to how many selects and phi nodes we are prepared to look through. llvm-svn: 92898	2010-01-07 05:48:42 +00:00
Chris Lattner	9855a6bb7c	handle ConstantVector while I'm in here. llvm-svn: 92892	2010-01-07 01:20:20 +00:00
Chris Lattner	64ecc468bd	fix a globalopt crash on 'bullet' (handling evaluation of a store to an element of a vector in a static ctor) which occurs with an unrelated patch I'm testing. Annoyingly, EvaluateStoreInto basically does exactly the same stuff as InsertElement constant folding, but it now handles vectors, and you can't insertelement into a vector. It would be 'really nice' if GEP into a vector were not legal. llvm-svn: 92889	2010-01-07 01:16:21 +00:00
Eric Christopher	2cdb806fd8	Move the object size intrinsic optimization to inst-combine and make it work for any integer size return type. llvm-svn: 92853	2010-01-06 20:04:44 +00:00
Duncan Sands	c8493da5b1	Fix a README item: have functionattrs look through selects and phi nodes when deciding which pointers point to local memory. I actually checked long ago how useful this is, and it isn't very: it hardly ever fires in the testsuite, but since Chris wants it here it is! llvm-svn: 92836	2010-01-06 15:37:47 +00:00
Mikhail Glushenkov	40d2429b28	Formatting. llvm-svn: 92831	2010-01-06 09:20:39 +00:00
Duncan Sands	78376ad7e1	Partially address a README by having functionattrs consider calls to memcpy, memset and other intrinsics that only access their arguments to be readnone if the intrinsic's arguments all point to local memory. This improves the testcase in the README to readonly, but it could in theory be made readnone, however this would involve more sophisticated analysis that looks through the memcpy. llvm-svn: 92829	2010-01-06 08:45:52 +00:00
Chris Lattner	4339f2abdb	tweaks suggested by Duncan llvm-svn: 92824	2010-01-06 05:32:15 +00:00
Chris Lattner	98748c0964	Teach instcombine's sext elimination logic to be more aggressive. Previously, instcombine would only promote an expression tree to the larger type if doing so eliminated two casts. This is because a need to manually do the sign extend after the promoted expression tree with two shifts. Now, we keep track of whether the result of the computation is going to be properly sign extended already. If so, we can unconditionally promote the expression, which allows us to zap more sext's. This implements rdar://6598839 (aka gcc pr38751) llvm-svn: 92815	2010-01-06 01:56:21 +00:00
Chris Lattner	8600dd3d7c	simplify this code. llvm-svn: 92800	2010-01-05 23:00:30 +00:00
Chris Lattner	554d0564ff	make this a static function instead of a method. llvm-svn: 92795	2010-01-05 22:30:42 +00:00
Chris Lattner	a93c63c22d	more rearrangement and cleanup, fix my test failure. llvm-svn: 92792	2010-01-05 22:21:18 +00:00
Chris Lattner	f476ef502c	cleanup llvm-svn: 92790	2010-01-05 22:07:33 +00:00
Chris Lattner	f88dd5ed64	remove two trunc xforms that are subsumed by EvaluateInDifferentType. The only difference is that EvaluateInDifferentType checks to ensure they are profitable before doing them :) llvm-svn: 92788	2010-01-05 22:01:41 +00:00
Chris Lattner	44a63815b9	just remove this xform which is subsumed by others. llvm-svn: 92775	2010-01-05 21:16:30 +00:00
Chris Lattner	b82a840eb2	move a trunc-specific transform out of commonIntCastTransforms into visitTrunc. llvm-svn: 92773	2010-01-05 21:11:17 +00:00
Benjamin Kramer	d2564e3afb	Move remaining stuff to the isInteger predicate. llvm-svn: 92771	2010-01-05 21:05:54 +00:00
Chris Lattner	fd7e42b65d	move a zext specific xform out of commonIntCastTransforms into visitZExt and modernize it. llvm-svn: 92770	2010-01-05 21:04:47 +00:00
Chris Lattner	aaccc8de62	move a trunc-specific xform out of commonIntCastTransforms into visitTrunc llvm-svn: 92768	2010-01-05 20:57:30 +00:00
Chris Lattner	dec6847bf6	reduce indentation llvm-svn: 92766	2010-01-05 20:56:24 +00:00
Benjamin Kramer	a81a6dff0d	Convert a ton of simple integer type equality tests to the new predicate. llvm-svn: 92760	2010-01-05 20:07:06 +00:00
Chris Lattner	54f4e39956	optimize comparisons against cttz/ctlz/ctpop, patch by Alastair Lynn! llvm-svn: 92745	2010-01-05 18:09:56 +00:00
Dan Gohman	c3c031bb37	Nick Lewycky pointed out that this code makes changes unconditionally. llvm-svn: 92739	2010-01-05 17:50:58 +00:00
Dan Gohman	b5358003fb	Set Changed properly after calling DeleteDeadPHIs. llvm-svn: 92735	2010-01-05 16:31:45 +00:00
Dan Gohman	28943873e6	Use do+while instead of while for loops which obviously have a non-zero trip count. Use SmallVector's pop_back_val(). llvm-svn: 92734	2010-01-05 16:27:25 +00:00
Dan Gohman	92fdb96474	Fix indentation. llvm-svn: 92733	2010-01-05 16:20:55 +00:00
Dan Gohman	cb99fe9839	Make RecursivelyDeleteTriviallyDeadInstructions, RecursivelyDeleteDeadPHINode, and DeleteDeadPHIs return a flag indicating whether they made any changes. llvm-svn: 92732	2010-01-05 15:45:31 +00:00
Benjamin Kramer	f7cc698b69	Add newline at EOF. llvm-svn: 92727	2010-01-05 13:32:48 +00:00
Benjamin Kramer	ccce8bae14	Avoid going through the LLVMContext for type equality where it's safe to dereference the type pointer. llvm-svn: 92726	2010-01-05 13:12:22 +00:00
Chris Lattner	223812d547	prune some #includes. llvm-svn: 92712	2010-01-05 07:54:43 +00:00
Chris Lattner	0a8191ee88	split and/or/xor out into one overly-large (2000LOC) file. However, I think it does make sense to keep them together, at least for now. llvm-svn: 92711	2010-01-05 07:50:36 +00:00
Chris Lattner	ed41b14f54	missed file with previous commit. llvm-svn: 92710	2010-01-05 07:45:02 +00:00
Chris Lattner	dc67e13442	split instcombine of shifts out to its own file. llvm-svn: 92709	2010-01-05 07:44:46 +00:00
Chris Lattner	e903f38b4d	eliminate getBitCastOperand and simplify some over-complex inbounds stuff. llvm-svn: 92708	2010-01-05 07:42:10 +00:00
Chris Lattner	7a9e47ac4b	split call handling out to InstCombineCalls.cpp llvm-svn: 92707	2010-01-05 07:32:13 +00:00
Chris Lattner	9da1cb243b	optimize cttz and ctlz when we can prove something about the leading/trailing bits. Patch by Alastair Lynn! llvm-svn: 92706	2010-01-05 07:23:56 +00:00
Chris Lattner	85e65e58ac	this inline function moved to addsub llvm-svn: 92705	2010-01-05 07:20:54 +00:00
Chris Lattner	82aa888e8c	split add/sub out to its own file. Eliminate use of dyn_castNotVal in the X+~X transform. dyn_castNotVal is dramatic overkill for what the xform needed. llvm-svn: 92704	2010-01-05 07:18:46 +00:00
Chris Lattner	c7de92ae15	all the places we use hasOneUse() we know are instructions, so inline and simplify. llvm-svn: 92700	2010-01-05 07:04:23 +00:00
Chris Lattner	c6493f070e	eliminate AssociativeOpt and its last uses. llvm-svn: 92697	2010-01-05 07:01:16 +00:00
Chris Lattner	94694c7f0b	inline the FoldICmpLogical functor. llvm-svn: 92695	2010-01-05 06:59:49 +00:00

... 11 12 13 14 15 ...

7461 Commits