llvm-project

Commit Graph

Author	SHA1	Message	Date
Chris Lattner	24716b6c63	It turns out that this #include is needed because otherwise ValueMapper.cpp ends up calling an out of line __ZNK4llvm12PATypeHolder3getEv, which is a template and llvm-config determines arbitrarily to use the one in libipo. This sucks, but keeping the #include is a reasonable workaround. llvm-svn: 94103	2010-01-21 21:29:25 +00:00
Chris Lattner	9889b4be04	unbreak the build, apparently without this transformutils starts depending on libipa? llvm-svn: 94102	2010-01-21 21:20:51 +00:00
Chris Lattner	e39837d5ee	tidy up llvm-svn: 94101	2010-01-21 21:05:54 +00:00
Victor Hernandez	a9ad174b49	Don't need to include IntrinsicInst.h any more llvm-svn: 94092	2010-01-21 19:33:59 +00:00
Victor Hernandez	d089f4e10b	No need to map NULL operands of metadata llvm-svn: 94091	2010-01-21 19:26:20 +00:00
Dan Gohman	51ad99d2c5	Re-implement the main strength-reduction portion of LoopStrengthReduction. This new version is much more aggressive about doing "full" reduction in cases where it reduces register pressure, and also more aggressive about rewriting induction variables to count down (or up) to zero when doing so reduces register pressure. It currently uses fairly simplistic algorithms for finding reuse opportunities, but it introduces a new framework allows it to combine multiple strategies at once to form hybrid solutions, instead of doing all full-reduction or all base+index. llvm-svn: 94061	2010-01-21 02:09:26 +00:00
Eric Christopher	fa863258d0	Add strcpy_chk -> strcpy support for "don't know" object size answers. This will update as object size checking gets better information. llvm-svn: 94059	2010-01-21 01:04:38 +00:00
Chris Lattner	3c5bf71353	simplify this code. llvm-svn: 94048	2010-01-20 23:30:28 +00:00
Jakob Stoklund Olesen	8a19d3c96c	Move per-function inline threshold calculation to a method. No functional change except the forgotten test for InlineLimit.getNumOccurrences() == 0 in the CurrentThreshold2 calculation. llvm-svn: 94007	2010-01-20 17:51:28 +00:00
Victor Hernandez	f2462407ee	Switch Elts from vector to SmallVector llvm-svn: 93989	2010-01-20 06:56:16 +00:00
Victor Hernandez	5fa88d4e30	Map operands of all function-local metadata, not just metadata passed to llvm.dbg.declare intrinsics llvm-svn: 93979	2010-01-20 05:49:59 +00:00
Dan Gohman	ca19445d08	When doing address-mode sinking, expand the base register first, rather than the scaled register. This makes it more likely that subsequent AddrModeMatcher queries will match the new address the same way as the old, instead of accidentally matching what had been the base register as the new scaled register, and then failing to match the scaled register. This fixes some problems with address-mode sinking multiple muls into a block, which will be a lot more common with some upcoming LoopStrengthReduction changes. llvm-svn: 93935	2010-01-19 22:45:06 +00:00
Chris Lattner	18f49ce2d3	optimize ~(~X >>s Y) --> (X >>s Y), patch by Edmund Grimley Evans! llvm-svn: 93884	2010-01-19 18:16:19 +00:00
Bob Wilson	58d59fe394	Fix a crash in scalarrepl for memcpy/memmove where the source and destination are the same. I had already fixed a similar problem where the source and destination were different bitcasts derived from the same alloca, but the previous fix still did not handle the case where both operands are exactly the same value. Radar 7552893. llvm-svn: 93848	2010-01-19 04:32:48 +00:00
Eric Christopher	84bd316bd6	Fix comment. llvm-svn: 93831	2010-01-19 01:20:15 +00:00
Chris Lattner	43f2fa6201	my instcombine transformations to make extension elimination more aggressive changed the canonical form from sext(trunc(x)) to ashr(lshr(x)), make sure to transform a couple more things into that canonical form, and catch a case where we missed turning zext/shl/ashr into a single sext. llvm-svn: 93787	2010-01-18 22:19:16 +00:00
Devang Patel	696cb8d410	While mapping llvm.dbg.declare intrinsic manually map its operand, if possible, because it points to an alloca instruction through metadata. llvm-svn: 93757	2010-01-18 19:52:14 +00:00
Owen Anderson	cdea3572fa	Convert some of the dynamic opcode lookups into static ones. llvm-svn: 93693	2010-01-17 19:33:27 +00:00
Owen Anderson	fa1edea9ce	Fix comment. llvm-svn: 93679	2010-01-17 06:49:03 +00:00
Bob Wilson	e0da4b6cff	Fix a comment typo. llvm-svn: 93560	2010-01-15 21:55:02 +00:00
Bill Wendling	ad7a5b07a7	When the visitSub method was split into visitSub and visitFSub, this xform was added to the FSub version. However, the original version of this xform guarded against doing this for floating point (!Op0->getType()->isFPOrFPVector()). This is causing LLVM to perform incorrect xforms for code like: void func(double rhi, double rlo, double xh, double xl, double yh, double yl){ double mh, ml; double c = 134217729.0; double up, u1, u2, vp, v1, v2; up = xhc; u1 = (xh - up) + up; u2 = xh - u1; vp = yhc; v1 = (yh - vp) + vp; v2 = yh - v1; mh = xhyh; ml = (((u1v1 - mh) + (u1v2)) + (u2v1)) + (u2v2); ml += xhyl + xlyh; rhi = mh + ml; rlo = (mh - (rhi)) + ml; } The last line was optimized away, but rl is intended to be the difference between the infinitely precise result of mh + ml and after it has been rounded to double precision. llvm-svn: 93369	2010-01-13 23:23:17 +00:00
Chris Lattner	573da8ac90	1) Use the new SimplifyInstructionsInBlock routine instead of the copy in JT. 2) When cloning blocks for PHI or xor conditions, use instsimplify to simplify the code as we go. This allows us to squish common cases early in JT which opens up opportunities for subsequent iterations, and allows it to completely simplify the testcase. llvm-svn: 93253	2010-01-12 20:41:47 +00:00
Chris Lattner	7c743f2c74	add a helper function. llvm-svn: 93251	2010-01-12 19:40:54 +00:00
Chris Lattner	af7855d571	tidy up llvm-svn: 93222	2010-01-12 02:07:50 +00:00
Chris Lattner	eb73bdb2e1	Teach jump threading to duplicate small blocks when the branch condition is a xor with a phi node. This eliminates nonsense like this from 176.gcc in several places: LBB166_84: testl %eax, %eax - setne %al - xorb %cl, %al - notb %al - testb $1, %al - je LBB166_85 + je LBB166_69 + jmp LBB166_85 This is rdar://7391699 llvm-svn: 93221	2010-01-12 02:07:17 +00:00
Chris Lattner	6a19ed0b86	some cleanup, and make it obvious that ProcessJumpOnPHI only works on branches by renaming it and checking for a branch at the call site. llvm-svn: 93208	2010-01-11 23:41:09 +00:00
Chris Lattner	d1a3efedd8	reenable the piece that turns trunc(zext(x)) -> x even if zext has multiple uses, codegen has no apparent problem with the trunc version of this, because it turns into a simple subreg idiom llvm-svn: 93202	2010-01-11 22:49:40 +00:00
Chris Lattner	a6b1356cf9	Disable folding sext(trunc(x)) -> x (and other similar cast/cast cases) when the trunc has multiple uses. Codegen is not able to coalesce the subreg case correctly and so this leads to higher register pressure and spilling (see PR5997). This speeds up 256.bzip2 from 8.60 -> 8.04s on my machine, ~7%. llvm-svn: 93200	2010-01-11 22:45:25 +00:00
Chris Lattner	9518869423	add one more bitfield optimization, allowing clang to generate good code on PR4216: _test_bitfield: ## @test_bitfield orl $32962, %edi movl $4294941946, %eax andq %rdi, %rax ret instead of: _test_bitfield: movl $4294941696, %ecx movl %edi, %eax orl $194, %edi orl $32768, %eax andq $250, %rdi andq %rax, %rcx movq %rdi, %rax orq %rcx, %rax ret Evan is looking into the remaining andq+imm -> andl optimization. llvm-svn: 93147	2010-01-11 06:55:24 +00:00
Chris Lattner	0a85420409	Extend CanEvaluateZExtd to handle and/or/xor more aggressively in the BitsToClear case. This allows it to promote expressions which have an and/or/xor after the lshr, promoting cases like test2 (from PR4216) and test3 (random extample extracted from a spec benchmark). clang now compiles the code in PR4216 into: _test_bitfield: ## @test_bitfield movl %edi, %eax orl $194, %eax movl $4294902010, %ecx andq %rax, %rcx orl $32768, %edi andq $39936, %rdi movq %rdi, %rax orq %rcx, %rax ret instead of: _test_bitfield: ## @test_bitfield movl %edi, %eax orl $194, %eax movl $4294902010, %ecx andq %rax, %rcx shrl $8, %edi orl $128, %edi shlq $8, %rdi andq $39936, %rdi movq %rdi, %rax orq %rcx, %rax ret which is still not great, but is progress. llvm-svn: 93145	2010-01-11 04:05:13 +00:00
Chris Lattner	12bd8992b3	Remove the dead TD argument to CanEvaluateZExtd, and add a new BitsToClear result which allows us to start promoting expressions that end with a lshr-by-constant. This is conservatively correct and better than what we had before (see testcases) but still needs to be extended further. llvm-svn: 93144	2010-01-11 03:32:00 +00:00
Chris Lattner	172630abd2	improve comments, remove dead TD argument to CanEvaluateSExtd. llvm-svn: 93143	2010-01-11 02:43:35 +00:00
Chris Lattner	7dd540ee24	teach sext optimization to handle truncs from types that are not the dest of the sext. llvm-svn: 93128	2010-01-10 20:30:41 +00:00
Chris Lattner	39d2daa94c	teach zext optimization how to deal with truncs that don't come from the zext dest type. This allows us to handle test52/53 in cast.ll, and allows llvm-gcc to generate much better code for PR4216 in -m64 mode: _test_bitfield: ## @test_bitfield orl $32962, %edi movl %edi, %eax andl $-25350, %eax ret This also fixes a bug handling vector extends, ensuring that the mask produced is a vector constant, not an integer constant. llvm-svn: 93127	2010-01-10 20:25:54 +00:00
Chris Lattner	1a05fddcdc	simplify CanEvaluateSExtd to return a bool now that we have a simpler profitability predicate. llvm-svn: 93111	2010-01-10 07:57:20 +00:00
Chris Lattner	d7816780e2	the NumCastsRemoved argument to CanEvaluateSExtd is dead, remove it. llvm-svn: 93110	2010-01-10 07:42:21 +00:00
Chris Lattner	2fff10c424	now that the cost model has changed, we can always consider elimination of a sign extend to be a win, which simplifies the client of CanEvaluateSExtd, and allows us to eliminate more casts (examples taken from real code). llvm-svn: 93109	2010-01-10 07:40:50 +00:00
Chris Lattner	d8509424a4	change the preferred canonical form for a sign extension to be lshr+ashr instead of trunc+sext. We want to avoid type conversions whenever possible, it is easier to codegen expressions without truncates and extensions. llvm-svn: 93107	2010-01-10 07:08:30 +00:00
Chris Lattner	2b459fe7e1	fix indentation of switch statements, no functionality change. llvm-svn: 93106	2010-01-10 06:59:55 +00:00
Chris Lattner	127bbc715e	fix pasto that broke bootstrap. llvm-svn: 93105	2010-01-10 06:50:04 +00:00
Chris Lattner	b7be7cc486	simplify CanEvaluateZExtd now that we don't care about the number of bits known clear in the result and don't care about the # casts eliminated. TD is also dead but keeping it for now. llvm-svn: 93098	2010-01-10 02:50:04 +00:00
Chris Lattner	49d2c9764d	two changes: 1) don't try to optimize a sext or zext that is only used by a trunc, let the trunc get optimized first. This avoids some pointless effort in some common cases since instcombine scans down a block in the first pass. 2) Change the cost model for zext elimination to consider an 'and' cheaper than a zext. This allows us to do it more aggressively, and for the next patch to simplify the code quite a bit. llvm-svn: 93097	2010-01-10 02:39:31 +00:00
Chris Lattner	f0af17dab3	enhance CanEvaluateZExtd to handle shift left and sext, allowing more expressions to be promoted and casts eliminated. llvm-svn: 93096	2010-01-10 02:22:12 +00:00
Chris Lattner	7723e2b10f	remove an xform subsumed by EvaluateInDifferentType. llvm-svn: 93095	2010-01-10 01:35:55 +00:00
Julien Lerouge	321098ebec	Fix nondeterministic behavior. llvm-svn: 93093	2010-01-10 01:07:22 +00:00
Chris Lattner	c95a7a21b7	clean up this xform by using m_Trunc. llvm-svn: 93092	2010-01-10 01:04:31 +00:00
Chris Lattner	883550afe8	inline and remove the rest of commonIntCastTransforms. llvm-svn: 93091	2010-01-10 01:00:46 +00:00
Chris Lattner	c3aca38468	Inline the expression type promotion/demotion stuff out of commonIntCastTransforms into the callers, eliminating a switch, and allowing the static predicate methods to be moved down to live next to the corresponding function. No functionality change. llvm-svn: 93089	2010-01-10 00:58:42 +00:00
Chris Lattner	ab7087ad66	only factor from expressions whose uses are empty and whose base is the right expression type. This fixes PR5981. llvm-svn: 93045	2010-01-09 06:01:36 +00:00
Julien Lerouge	f50a3f19da	Fix nondeterministic behavior. llvm-svn: 93038	2010-01-09 01:06:49 +00:00
Eric Christopher	4a1d7e1506	Remove unnecessary dyn_cast and add a comment. Part of a WIP. llvm-svn: 93026	2010-01-08 21:37:11 +00:00
Chris Lattner	9242ae047c	mplement a theoretical fixme. llvm-svn: 93024	2010-01-08 19:28:47 +00:00
Chris Lattner	10840e9e13	rename CanEvaluateInDifferentType -> CanEvaluateTruncated and simplify it now that it is only used for truncates. llvm-svn: 93021	2010-01-08 19:19:23 +00:00
Chris Lattner	a1e223ea10	teach instcombine to delete sign extending shift pairs (sra(shl X, C), C) when the input is already sign extended. llvm-svn: 93019	2010-01-08 19:04:21 +00:00
Duncan Sands	4a8b15dc74	Suppress an unused variable warning when assertions are off; remove some trailing whitespace while there. llvm-svn: 93008	2010-01-08 17:51:48 +00:00
Chris Lattner	8c92b57df9	tidy up some stuff duncan pointed out. llvm-svn: 93007	2010-01-08 17:48:19 +00:00
Chris Lattner	35d3b9dcd0	teach ComputeNumSignBits to look through PHI nodes. llvm-svn: 92964	2010-01-07 23:44:37 +00:00
Chris Lattner	3057c37959	Enhance instcombine to reason more strongly about promoting computation that feeds into a zext, similar to the patch I did yesterday for sext. There is a lot of room for extension beyond this patch. llvm-svn: 92962	2010-01-07 23:41:00 +00:00
Benjamin Kramer	76e2766442	Use a do-while loop instead of while + boolean. llvm-svn: 92912	2010-01-07 13:50:07 +00:00
Duncan Sands	f117880ab0	Be less stingy as to how many selects and phi nodes we are prepared to look through. llvm-svn: 92898	2010-01-07 05:48:42 +00:00
Chris Lattner	9855a6bb7c	handle ConstantVector while I'm in here. llvm-svn: 92892	2010-01-07 01:20:20 +00:00
Chris Lattner	64ecc468bd	fix a globalopt crash on 'bullet' (handling evaluation of a store to an element of a vector in a static ctor) which occurs with an unrelated patch I'm testing. Annoyingly, EvaluateStoreInto basically does exactly the same stuff as InsertElement constant folding, but it now handles vectors, and you can't insertelement into a vector. It would be 'really nice' if GEP into a vector were not legal. llvm-svn: 92889	2010-01-07 01:16:21 +00:00
Eric Christopher	2cdb806fd8	Move the object size intrinsic optimization to inst-combine and make it work for any integer size return type. llvm-svn: 92853	2010-01-06 20:04:44 +00:00
Duncan Sands	c8493da5b1	Fix a README item: have functionattrs look through selects and phi nodes when deciding which pointers point to local memory. I actually checked long ago how useful this is, and it isn't very: it hardly ever fires in the testsuite, but since Chris wants it here it is! llvm-svn: 92836	2010-01-06 15:37:47 +00:00
Mikhail Glushenkov	40d2429b28	Formatting. llvm-svn: 92831	2010-01-06 09:20:39 +00:00
Duncan Sands	78376ad7e1	Partially address a README by having functionattrs consider calls to memcpy, memset and other intrinsics that only access their arguments to be readnone if the intrinsic's arguments all point to local memory. This improves the testcase in the README to readonly, but it could in theory be made readnone, however this would involve more sophisticated analysis that looks through the memcpy. llvm-svn: 92829	2010-01-06 08:45:52 +00:00
Chris Lattner	4339f2abdb	tweaks suggested by Duncan llvm-svn: 92824	2010-01-06 05:32:15 +00:00
Chris Lattner	98748c0964	Teach instcombine's sext elimination logic to be more aggressive. Previously, instcombine would only promote an expression tree to the larger type if doing so eliminated two casts. This is because a need to manually do the sign extend after the promoted expression tree with two shifts. Now, we keep track of whether the result of the computation is going to be properly sign extended already. If so, we can unconditionally promote the expression, which allows us to zap more sext's. This implements rdar://6598839 (aka gcc pr38751) llvm-svn: 92815	2010-01-06 01:56:21 +00:00
Chris Lattner	8600dd3d7c	simplify this code. llvm-svn: 92800	2010-01-05 23:00:30 +00:00
Chris Lattner	554d0564ff	make this a static function instead of a method. llvm-svn: 92795	2010-01-05 22:30:42 +00:00
Chris Lattner	a93c63c22d	more rearrangement and cleanup, fix my test failure. llvm-svn: 92792	2010-01-05 22:21:18 +00:00
Chris Lattner	f476ef502c	cleanup llvm-svn: 92790	2010-01-05 22:07:33 +00:00
Chris Lattner	f88dd5ed64	remove two trunc xforms that are subsumed by EvaluateInDifferentType. The only difference is that EvaluateInDifferentType checks to ensure they are profitable before doing them :) llvm-svn: 92788	2010-01-05 22:01:41 +00:00
Chris Lattner	44a63815b9	just remove this xform which is subsumed by others. llvm-svn: 92775	2010-01-05 21:16:30 +00:00
Chris Lattner	b82a840eb2	move a trunc-specific transform out of commonIntCastTransforms into visitTrunc. llvm-svn: 92773	2010-01-05 21:11:17 +00:00
Benjamin Kramer	d2564e3afb	Move remaining stuff to the isInteger predicate. llvm-svn: 92771	2010-01-05 21:05:54 +00:00
Chris Lattner	fd7e42b65d	move a zext specific xform out of commonIntCastTransforms into visitZExt and modernize it. llvm-svn: 92770	2010-01-05 21:04:47 +00:00
Chris Lattner	aaccc8de62	move a trunc-specific xform out of commonIntCastTransforms into visitTrunc llvm-svn: 92768	2010-01-05 20:57:30 +00:00
Chris Lattner	dec6847bf6	reduce indentation llvm-svn: 92766	2010-01-05 20:56:24 +00:00
Benjamin Kramer	a81a6dff0d	Convert a ton of simple integer type equality tests to the new predicate. llvm-svn: 92760	2010-01-05 20:07:06 +00:00
Chris Lattner	54f4e39956	optimize comparisons against cttz/ctlz/ctpop, patch by Alastair Lynn! llvm-svn: 92745	2010-01-05 18:09:56 +00:00
Dan Gohman	c3c031bb37	Nick Lewycky pointed out that this code makes changes unconditionally. llvm-svn: 92739	2010-01-05 17:50:58 +00:00
Dan Gohman	b5358003fb	Set Changed properly after calling DeleteDeadPHIs. llvm-svn: 92735	2010-01-05 16:31:45 +00:00
Dan Gohman	28943873e6	Use do+while instead of while for loops which obviously have a non-zero trip count. Use SmallVector's pop_back_val(). llvm-svn: 92734	2010-01-05 16:27:25 +00:00
Dan Gohman	92fdb96474	Fix indentation. llvm-svn: 92733	2010-01-05 16:20:55 +00:00
Dan Gohman	cb99fe9839	Make RecursivelyDeleteTriviallyDeadInstructions, RecursivelyDeleteDeadPHINode, and DeleteDeadPHIs return a flag indicating whether they made any changes. llvm-svn: 92732	2010-01-05 15:45:31 +00:00
Benjamin Kramer	f7cc698b69	Add newline at EOF. llvm-svn: 92727	2010-01-05 13:32:48 +00:00
Benjamin Kramer	ccce8bae14	Avoid going through the LLVMContext for type equality where it's safe to dereference the type pointer. llvm-svn: 92726	2010-01-05 13:12:22 +00:00
Chris Lattner	223812d547	prune some #includes. llvm-svn: 92712	2010-01-05 07:54:43 +00:00
Chris Lattner	0a8191ee88	split and/or/xor out into one overly-large (2000LOC) file. However, I think it does make sense to keep them together, at least for now. llvm-svn: 92711	2010-01-05 07:50:36 +00:00
Chris Lattner	ed41b14f54	missed file with previous commit. llvm-svn: 92710	2010-01-05 07:45:02 +00:00
Chris Lattner	dc67e13442	split instcombine of shifts out to its own file. llvm-svn: 92709	2010-01-05 07:44:46 +00:00
Chris Lattner	e903f38b4d	eliminate getBitCastOperand and simplify some over-complex inbounds stuff. llvm-svn: 92708	2010-01-05 07:42:10 +00:00
Chris Lattner	7a9e47ac4b	split call handling out to InstCombineCalls.cpp llvm-svn: 92707	2010-01-05 07:32:13 +00:00
Chris Lattner	9da1cb243b	optimize cttz and ctlz when we can prove something about the leading/trailing bits. Patch by Alastair Lynn! llvm-svn: 92706	2010-01-05 07:23:56 +00:00
Chris Lattner	85e65e58ac	this inline function moved to addsub llvm-svn: 92705	2010-01-05 07:20:54 +00:00
Chris Lattner	82aa888e8c	split add/sub out to its own file. Eliminate use of dyn_castNotVal in the X+~X transform. dyn_castNotVal is dramatic overkill for what the xform needed. llvm-svn: 92704	2010-01-05 07:18:46 +00:00
Chris Lattner	c7de92ae15	all the places we use hasOneUse() we know are instructions, so inline and simplify. llvm-svn: 92700	2010-01-05 07:04:23 +00:00
Chris Lattner	c6493f070e	eliminate AssociativeOpt and its last uses. llvm-svn: 92697	2010-01-05 07:01:16 +00:00
Chris Lattner	94694c7f0b	inline the FoldICmpLogical functor. llvm-svn: 92695	2010-01-05 06:59:49 +00:00
Chris Lattner	98d48a0b76	inline the 'AddRHS' transformation, simplifying things significantly. Eliminate the 'AddMaskingAnd' transformation, it is redundant with this more general code right below it: // A+B --> A\|B iff A and B have no bits set in common. llvm-svn: 92693	2010-01-05 06:29:13 +00:00
Chris Lattner	39b063bf37	remove massive over-genality manifested as a big template that got instantiated. There is no reason for instcombine to try this hard for simple associative optimizations. Next up, eliminate the template completely. llvm-svn: 92692	2010-01-05 06:24:06 +00:00
Chris Lattner	dc054bf39a	split mul/div/rem instructions out to their own file. llvm-svn: 92689	2010-01-05 06:09:35 +00:00
Chris Lattner	1e7b7b50b1	clean up header. llvm-svn: 92688	2010-01-05 06:05:07 +00:00
Chris Lattner	8f771cb78f	split select out to its own file. llvm-svn: 92687	2010-01-05 06:03:12 +00:00
Chris Lattner	a65e2f7304	split out load/store/alloca. llvm-svn: 92685	2010-01-05 05:57:49 +00:00
Chris Lattner	841af4f03d	reduce indentation llvm-svn: 92684	2010-01-05 05:42:08 +00:00
Chris Lattner	ec97a90221	split vector stuff out to InstCombineVectorOps.cpp llvm-svn: 92683	2010-01-05 05:36:20 +00:00
Chris Lattner	de1feded32	split PHI node stuff out to InstCombinePHI.cpp llvm-svn: 92682	2010-01-05 05:31:55 +00:00
Chris Lattner	27acfcd1c4	convert various IntrinsicInst's to use class instead of struct. llvm-svn: 92681	2010-01-05 05:21:26 +00:00
Chris Lattner	f741d72b84	fix an infinite loop in reassociate building emacs. llvm-svn: 92679	2010-01-05 04:55:35 +00:00
David Greene	cf0addf927	Change errs() to dbgs(). llvm-svn: 92639	2010-01-05 01:28:37 +00:00
David Greene	6ef94ad615	Change errs() to dbgs(). llvm-svn: 92636	2010-01-05 01:28:29 +00:00
David Greene	74e8bd05cc	Change errs() to dbgs(). llvm-svn: 92633	2010-01-05 01:28:12 +00:00
David Greene	9fcfd96da9	Change errs() to dbgs(). llvm-svn: 92631	2010-01-05 01:28:07 +00:00
David Greene	44cb8ade45	Change errs() to dbgs(). llvm-svn: 92629	2010-01-05 01:28:05 +00:00
David Greene	8306b60d56	Change errs() to dbgs(). llvm-svn: 92627	2010-01-05 01:27:54 +00:00
David Greene	0122fc495d	Change errs() to dbgs(). llvm-svn: 92625	2010-01-05 01:27:51 +00:00
David Greene	241992382e	Change errs() to dbgs(). llvm-svn: 92624	2010-01-05 01:27:47 +00:00
David Greene	e0b9789593	Change errs() to dbgs(). llvm-svn: 92623	2010-01-05 01:27:44 +00:00
David Greene	6bc0776343	Change errs() to dbgs(). llvm-svn: 92622	2010-01-05 01:27:39 +00:00
David Greene	3a79df0993	Change errs() to dbgs(). llvm-svn: 92620	2010-01-05 01:27:33 +00:00
David Greene	0fd862254e	Change errs() to dbgs(). llvm-svn: 92619	2010-01-05 01:27:30 +00:00
David Greene	d17c3916d0	Change errs() to dbgs(). llvm-svn: 92617	2010-01-05 01:27:24 +00:00
David Greene	9ddc6e2e12	Change errs() to dbgs(). llvm-svn: 92615	2010-01-05 01:27:21 +00:00
David Greene	1efdb45562	Change errs() to dbgs(). llvm-svn: 92614	2010-01-05 01:27:19 +00:00
David Greene	2e6efc441f	Change errs() to dbgs(). llvm-svn: 92613	2010-01-05 01:27:17 +00:00
David Greene	389fc3b9f6	Change errs() to dbgs(). llvm-svn: 92612	2010-01-05 01:27:15 +00:00
David Greene	74e2d4917d	Change errs() to dbgs(). llvm-svn: 92611	2010-01-05 01:27:11 +00:00
David Greene	48c86bedbd	Change errs() to dbgs(). llvm-svn: 92610	2010-01-05 01:27:09 +00:00
David Greene	0dd384cfd0	Change errs() to dbgs(). llvm-svn: 92609	2010-01-05 01:27:06 +00:00
David Greene	d9c355d590	Change errs() to dbgs(). llvm-svn: 92608	2010-01-05 01:27:04 +00:00
David Greene	b72ad95ecf	Change errs() to dbgs(). llvm-svn: 92607	2010-01-05 01:27:01 +00:00
David Greene	084b0dde9d	Change errs() to dbgs(). llvm-svn: 92606	2010-01-05 01:26:57 +00:00
David Greene	76a4e852f8	Change errs() to dbgs(). llvm-svn: 92605	2010-01-05 01:26:54 +00:00
David Greene	725c7c3f2e	Change errs() to dbgs(). llvm-svn: 92604	2010-01-05 01:26:52 +00:00
David Greene	3774a38fdf	Change errs() to dbgs(). llvm-svn: 92603	2010-01-05 01:26:49 +00:00
David Greene	50c54238e4	Change errs() to dbgs(). llvm-svn: 92602	2010-01-05 01:26:45 +00:00
David Greene	0ad6dce031	Change errs() to dbgs(). llvm-svn: 92601	2010-01-05 01:26:44 +00:00
David Greene	627f40a9f2	Change errs() to dbgs(). llvm-svn: 92600	2010-01-05 01:26:41 +00:00
David Greene	a8a32dd987	Change errs() to dbgs(). llvm-svn: 92599	2010-01-05 01:26:39 +00:00
Devang Patel	be94f23992	Remove dead debug info intrinsics. Intrinsic::dbg_stoppoint Intrinsic::dbg_region_start Intrinsic::dbg_region_end Intrinsic::dbg_func_start AutoUpgrade simply ignores these intrinsics now. llvm-svn: 92557	2010-01-05 01:10:40 +00:00
Daniel Dunbar	72a87448c1	Fix some struct/class specifier mismatches. llvm-svn: 92550	2010-01-05 00:15:58 +00:00
Chris Lattner	a751d09c08	Truncate GEP indexes larger than the pointer size down to pointer size when doing this transform if the GEP is not inbounds. No testcase because it is very difficult to trigger this: instcombine already canonicalizes GEP indices to pointer size, so it relies specific permutations of the instcombine worklist. Thanks to Duncan for pointing this possible problem out. llvm-svn: 92495	2010-01-04 18:57:15 +00:00
Chris Lattner	2cb08e69b1	silence a bogus 'might be used uninit' warning from GCC. llvm-svn: 92494	2010-01-04 18:48:26 +00:00
Chris Lattner	59d95743c8	move some more cast-related stuff llvm-svn: 92471	2010-01-04 07:59:07 +00:00
Mikhail Glushenkov	6a8ac8ce8f	80-col violations, trailing whitespace. llvm-svn: 92470	2010-01-04 07:55:25 +00:00
Chris Lattner	92be2adba6	move the [Can]EvaluateInDifferentType functions out to InstCombineCasts.cpp llvm-svn: 92469	2010-01-04 07:54:59 +00:00
Chris Lattner	2b295a0eba	split 943 lines of instcombine out to a new InstCombineCasts.cpp file. InstructionCombining.cpp is now down to a svelte 9300 lines :) llvm-svn: 92468	2010-01-04 07:53:58 +00:00
Chris Lattner	2188e40e4c	split instcombine of compares (visit[FI]Cmp) out to a new InstCombineCompares.cpp file. llvm-svn: 92467	2010-01-04 07:37:31 +00:00
Chris Lattner	6ea40f1542	update cmakefile llvm-svn: 92466	2010-01-04 07:19:55 +00:00
Chris Lattner	7e0449172c	move the 'SimplifyDemandedFoo' methods out to their own file, cutting 1K lines out of instcombine.cpp llvm-svn: 92465	2010-01-04 07:17:19 +00:00
Chris Lattner	35522b7465	split the instcombine class definition out to a header shared among the instcombine library. llvm-svn: 92463	2010-01-04 07:12:23 +00:00
Chris Lattner	b8906bda13	remove a ton of unneeded LLVMContext stuff. llvm-svn: 92462	2010-01-04 07:02:48 +00:00
Chris Lattner	66c2e54bcd	move InstCombineWorklist out to its own header. llvm-svn: 92461	2010-01-04 06:30:00 +00:00
Chris Lattner	e2b9da98b0	forgot to svn add these. llvm-svn: 92460	2010-01-04 06:28:20 +00:00
Chris Lattner	c0e6640d3a	move instcombine to its own library, it's past time. llvm-svn: 92459	2010-01-04 06:23:24 +00:00
Chris Lattner	2d91231d82	implement an instcombine xform needed by clang's codegen on the example in PR4216. This doesn't trigger in the testsuite, so I'd really appreciate someone scrutinizing the logic for correctness. llvm-svn: 92458	2010-01-04 06:03:59 +00:00
Chris Lattner	48218e42cd	pull my debug hooks out, I'm done with this xform for now. llvm-svn: 92446	2010-01-03 06:58:48 +00:00
Nick Lewycky	475d3d1215	Small cleanups, refactor some duplicated code into a single method. No functionality change. llvm-svn: 92445	2010-01-03 04:39:07 +00:00
Chris Lattner	fca0c8f93a	generalize the previous transformation to handle indexing into arrays of structs and other arrays, so long as all the subsequent indexes are constants. This triggers frequently for stuff like: @divisions = internal constant [29 x [2 x i32]] [[2 x i32] zeroinitializer, [2 x i32] [i32 0, i32 1], [2 x i32] [i32 0, i32 2], [2 x i32] [i32 0, i32 1], [2 x i32] zeroinitializer, [2 x i32] [i32 0, i32 1], [2 x i32] [i32 0, i32 1], [2 x i32] [i32 0, i32 2], [2 x i32] [i32 0, i32 2], [2 x i32] zeroinitializer, [2 x i32] zeroinitializer, [2 x i32] zeroinitializer, [2 x i32] [i32 0, i32 2], [2 x i32] [i32 0, i32 1], [2 x i32] zeroinitializer, [2 x i32] [i32 1, i32 0], [2 x i32] [i32 1, i32 1], [2 x i32] [i32 1, i32 1], [2 x i32] [i32 1, i32 2], [2 x i32] [i32 1, i32 1], [2 x i32] [i32 1, i32 0], [2 x i32] [i32 1, i32 2], [2 x i32] [i32 1, i32 2], [2 x i32] [i32 1, i32 0], [2 x i32] [i32 1, i32 0], [2 x i32] [i32 1, i32 0], [2 x i32] [i32 1, i32 1], [2 x i32] [i32 1, i32 2], [2 x i32] [i32 1, i32 2]], align 32 ; <[29 x [2 x i32]]> [#uses=50] %623 = getelementptr inbounds [29 x [2 x i32]] @divisions, i64 0, i64 %619, i64 0 ; <i32*> [#uses=1] %684 = icmp eq i32 %683, 999 also for the "my_defs" table in 'gs', etc. llvm-svn: 92444	2010-01-03 03:03:27 +00:00
Nick Lewycky	ff9cd7ace7	Cleanup. llvm-svn: 92436	2010-01-03 00:55:31 +00:00
Chris Lattner	98ad2b56cc	teach instcombine to optimize idioms like A[i]&42 == 0. This occurs in 403.gcc in mode_mask_array, in safe-ctype.c (which is copied in multiple apps) in _sch_istable, etc. llvm-svn: 92427	2010-01-02 22:08:28 +00:00
Chris Lattner	b56bef45f8	Teach the table lookup optimization to generate range compares when a consequtive sequence of elements all satisfies the predicate. Like the double compare case, this generates better code than the magic constant case and generalizes to more than 32/64 element array lookups. Here are some examples where it triggers. From 403.gcc, most accesses to the rtx_class array are handled, e.g.: @rtx_class = constant [153 x i8] c"xxxxxmmmmmmmmxxxxxxxxxxxxmxxxxxxiiixxxxxxxxxxxxxxxxxxxooxooooooxxoooooox3x2c21c2222ccc122222ccccaaaaaa<<<<<<<<<<<<<<<<<<111111111111bbooxxxxxxxxxxcc2211x", align 32 ; <[153 x i8]> [#uses=547] %142 = icmp eq i8 %141, 105 @rtx_class = constant [153 x i8] c"xxxxxmmmmmmmmxxxxxxxxxxxxmxxxxxxiiixxxxxxxxxxxxxxxxxxxooxooooooxxoooooox3x2c21c2222ccc122222ccccaaaaaa<<<<<<<<<<<<<<<<<<111111111111bbooxxxxxxxxxxcc2211x", align 32 ; <[153 x i8]> [#uses=543] %165 = icmp eq i8 %164, 60 Also, most of the 59-element arrays (mode_class/rid_to_yy, etc) optimized before are actually range compares. This lets 32-bit machines optimize them. 400.perlbmk has stuff like this: 400.perlbmk: PL_regkind, even for 32-bit: @PL_regkind = constant [62 x i8] c"\00\00\02\02\02\06\06\06\06\09\09\0B\0B\0D\0E\0E\0E\11\12\12\14\14\16\16\18\18\1A\1A\1C\1C\1E\1F !!!$$&'((((,-.///88886789:;8$", align 32 ; <[62 x i8]> [#uses=4] %811 = icmp ne i8 %810, 33 @PL_utf8skip = constant [256 x i8] c"\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\02\02\02\02\02\02\02\02\02\02\02\02\02\02\02\02\02\02\02\02\02\02\02\02\02\02\02\02\02\02\02\02\03\03\03\03\03\03\03\03\03\03\03\03\03\03\03\03\04\04\04\04\04\04\04\04\05\05\05\05\06\06\07\0D", align 32 ; <[256 x i8]> [#uses=94] %12 = icmp ult i8 %10, 2 etc. llvm-svn: 92426	2010-01-02 21:50:18 +00:00
Chris Lattner	e199d2df80	theoretically the negate we find could be in a different function, check for this case. llvm-svn: 92425	2010-01-02 21:46:33 +00:00
Chris Lattner	2fa4ec70fc	use enums for the over/underdefined markers for clarity. Switch to using -2/-3 instead of -1/-2 for a future xform. llvm-svn: 92423	2010-01-02 20:20:33 +00:00
Chris Lattner	351e22aa36	remove the random sampling framework, which is not maintained anymore. If there is interest, it can be resurrected from SVN. PR4912. llvm-svn: 92422	2010-01-02 20:07:03 +00:00
Nick Lewycky	a67519be12	Fix logic error in previous commit. The != case needs to become an or, not an and. llvm-svn: 92419	2010-01-02 16:14:56 +00:00
Nick Lewycky	357d41b3c1	Optimize pointer comparison into the typesafe form, now that the backends will handle them efficiently. This is the opposite direction of the transformation we used to have here. llvm-svn: 92418	2010-01-02 15:25:44 +00:00
Chris Lattner	cfda435c73	Generalize the previous xform to handle cases where exactly two elements match or don't match with two comparisons. For example, the testcase compiles into: define i1 @test5(i32 %X) { %1 = icmp eq i32 %X, 2 ; <i1> [#uses=1] %2 = icmp eq i32 %X, 7 ; <i1> [#uses=1] %R = or i1 %1, %2 ; <i1> [#uses=1] ret i1 %R } This generalizes the previous xforms when the array is larger than 64 elements (and this case matches) and generates better code for cases where it overlaps with the magic bitshift case. This generalizes more cases than you might expect. For example, 400.perlbmk has: @PL_utf8skip = constant [256 x i8] c"\01\01\01\... %15 = icmp ult i8 %7, 7 403.gcc has: @rid_to_yy = internal constant [114 x i16] [i16 259, i16 260, ... %18 = icmp eq i16 %16, 295 and xalancbmk has a bunch of examples, such as _ZN11xercesc_2_5L15gCombiningCharsE and _ZN11xercesc_2_5L10gBaseCharsE. llvm-svn: 92417	2010-01-02 09:35:17 +00:00
Chris Lattner	c6ac078423	fix a miscompilation I introduced of cdecl with a late change. llvm-svn: 92416	2010-01-02 09:22:13 +00:00
Chris Lattner	935a4a606a	enhance the compare/load/index optimization to work on any load from a global with 32/64 elements or less (depending on whether i64 is native on the target), generating a bitshift idiom to determine the result. For example, on test4 we produce: define i1 @test4(i32 %X) { %1 = lshr i32 933, %X ; <i32> [#uses=1] %2 = and i32 %1, 1 ; <i32> [#uses=1] %R = icmp ne i32 %2, 0 ; <i1> [#uses=1] ret i1 %R } This triggers in a number of interesting cases, for example, here's an fp case: @A.3255 = internal constant [4 x double] [double 4.100000e+00, double -3.900000e+00, double -1.000000e+00, double 1.000000e+00], align 32 ; <[4 x double]> [#uses=7] ... %7 = fcmp olt double %3, 0.000000e+00 In this case we make the slen2_tab global dead, which is nice: @slen2_tab = internal constant [16 x i32] [i32 0, i32 1, i32 2, i32 3, i32 0, i32 1, i32 2, i32 3, i32 1, i32 2, i32 3, i32 1, i32 2, i32 3, i32 2, i32 3], align 32 ; <[16 x i32]> [#uses=1] ... %204 = icmp eq i32 %46, 0 Perl has a bunch of these, also on the 'Perl_regkind' array: @Perl_yygindex = internal constant [51 x i16] [i16 0, i16 0, i16 0, i16 0, i16 374, i16 351, i16 0, i16 -12, i16 0, i16 946, i16 413, i16 -83, i16 0, i16 0, i16 0, i16 -311, i16 -13, i16 4007, i16 2893, i16 0, i16 0, i16 0, i16 0, i16 0, i16 372, i16 -8, i16 0, i16 0, i16 246, i16 -131, i16 43, i16 86, i16 208, i16 -45, i16 -169, i16 987, i16 0, i16 0, i16 0, i16 0, i16 308, i16 0, i16 -271, i16 0, i16 0, i16 0, i16 0, i16 0, i16 0, i16 0, i16 0], align 32 ; <[51 x i16]> [#uses=1] ... %1364 = icmp eq i16 %1361, 0 186.crafty really likes this on 64-bit machines, because it triggers on a bunch of globals like this: @white_outpost = internal constant [64 x i8] c"\00\00\00\00\00\00\00\00\00\00\00\00\00\00\00\00\00\00\00\00\00\00\00\00\00\00\00\02\02\00\00\00\00\00\04\05\05\04\00\00\00\00\03\06\06\03\00\00\00\00\00\01\01\00\00\00\00\00\00\00\00\00\00\00", align 32 ; <[64 x i8]> [#uses=2] However the big winner is 403.gcc, which triggers hundreds of times, eliminating all the accesses to the 57-element arrays 'mode_class', mode_unit_size, mode_bitsize, regclass_map, etc. go 64-bit machines :) llvm-svn: 92415	2010-01-02 08:56:52 +00:00
Chris Lattner	b1567bd584	enhance the previous optimization to work with fcmp in addition to icmp. llvm-svn: 92412	2010-01-02 08:20:51 +00:00
Chris Lattner	a061859ccc	Teach instcombine to fold compares of loads from constant arrays with variable indices into a comparison of the index with a constant. The most common occurrence of this that I see by far is stuff like: if ("foobar"[i] == '\0') ... which we compile into: if (i == 6), saving a load and materialization of the global address. This also exposes loop trip count information to later passes in many cases. This triggers hundreds of times in xalancbmk, which is where I first noticed it, but it also triggers in many other apps. Here are a few interesting ones from various apps: @must_be_connected_without = internal constant [8 x i8] [i8 getelementptr inbounds ([3 x i8]* @.str64320, i64 0, i64 0), i8* getelementptr inbounds ([3 x i8]* @.str27283, i64 0, i64 0), i8* getelementptr inbounds ([4 x i8]* @.str71327, i64 0, i64 0), i8* getelementptr inbounds ([4 x i8]* @.str72328, i64 0, i64 0), i8* getelementptr inbounds ([3 x i8]* @.str18274, i64 0, i64 0), i8* getelementptr inbounds ([6 x i8]* @.str11267, i64 0, i64 0), i8* getelementptr inbounds ([3 x i8]* @.str32288, i64 0, i64 0), i8* null], align 32 ; <[8 x i8]> [#uses=2] %scevgep.i = getelementptr [8 x i8] @must_be_connected_without, i64 0, i64 %indvar.i ; <i8*> [#uses=1] %17 = load ... %18 = icmp eq i8 %17, null ; <i1> [#uses=1] -> icmp eq i64 %indvar.i, 7 @yytable1095 = internal constant [84 x i8] c"\12\01(\05\06\07\08\09\0A\0B\0C\0D\0E1\0F\10\11266\1D: \10\11,-,0\03'\10\11B6\04\17&\18\1945\05\06\07\08\09\0A\0B\0C\0D\0E\1E\0F\10\11\1A\1B\1C$3+>#%;<IJ=ADFEGH9KL\00\00\00C", align 32 ; <[84 x i8]> [#uses=2] %57 = getelementptr inbounds [84 x i8]* @yytable1095, i64 0, i64 %56 ; <i8> [#uses=1] %mode.0.in = getelementptr inbounds [9 x i32] @mb_mode_table, i64 0, i64 %.pn ; <i32> [#uses=1] load ... %64 = icmp eq i8 %58, 4 ; <i1> [#uses=1] -> icmp eq i64 %.pn, 35 ; <i1> [#uses=0] @gsm_DLB = internal constant [4 x i16] [i16 6554, i16 16384, i16 26214, i16 32767] %scevgep.i = getelementptr [4 x i16] @gsm_DLB, i64 0, i64 %indvar.i ; <i16*> [#uses=1] %425 = load %scevgep.i %426 = icmp eq i16 %425, -32768 ; <i1> [#uses=0] -> false llvm-svn: 92411	2010-01-02 08:12:04 +00:00
Chris Lattner	2e4be2c340	remove the instcombine transformations that are inserting nasty pointer to int casts that confuse later optimizations. See PR3351 for details. This improves but doesn't complete fix 483.xalancbmk because llvm-gcc does this xform in GCC's "fold" routine as well. Clang++ will do better I guess. llvm-svn: 92408	2010-01-02 00:31:05 +00:00
Chris Lattner	faf1337acb	add a simple instcombine xform, simplify another one to use hasAllZeroIndices() instead of hand rolling a loop. llvm-svn: 92403	2010-01-01 23:09:08 +00:00
Chris Lattner	30c0a2833d	generalize the pointer difference optimization to handle a constantexpr gep on the 'base' side of the expression. This completes comment #4 in PR3351, which comes from 483.xalancbmk. llvm-svn: 92402	2010-01-01 22:42:29 +00:00
Chris Lattner	4394f71752	teach instcombine to optimize pointer difference idioms involving constant expressions. This is a step towards comment #4 in PR3351. llvm-svn: 92401	2010-01-01 22:29:12 +00:00
Chris Lattner	9d4c5414bb	use 'match' to simplify some code. llvm-svn: 92400	2010-01-01 22:12:03 +00:00
Chris Lattner	25c87e9cf9	implement the transform requested in PR5284 llvm-svn: 92398	2010-01-01 18:34:40 +00:00
Chris Lattner	ee1f861d81	add missing line. llvm-svn: 92384	2010-01-01 01:54:08 +00:00
Chris Lattner	8330daf733	add a few trivial instcombines for llvm.powi. llvm-svn: 92383	2010-01-01 01:52:15 +00:00
Chris Lattner	0c59ac3f41	When factoring multiply expressions across adds, factor both positive and negative forms of constants together. This allows us to compile: int foo(int x, int y) { return (x-y) + (x-y) + (x-y); } into: _foo: ## @foo subl %esi, %edi leal (%rdi,%rdi,2), %eax ret instead of (where the 3 and -3 were not factored): _foo: imull $-3, 8(%esp), %ecx imull $3, 4(%esp), %eax addl %ecx, %eax ret this started out as: movl 12(%ebp), %ecx imull $3, 8(%ebp), %eax subl %ecx, %eax subl %ecx, %eax subl %ecx, %eax ret This comes from PR5359. llvm-svn: 92381	2010-01-01 01:13:15 +00:00
Chris Lattner	a552683fd4	clean up some comments. llvm-svn: 92377	2010-01-01 00:04:26 +00:00
Chris Lattner	17229a7cb8	switch from std::map to DenseMap for rank data structures. llvm-svn: 92375	2010-01-01 00:01:34 +00:00
Chris Lattner	fed3397654	reuse negates where possible instead of always creating them from scratch. This allows us to optimize test12 into: define i32 @test12(i32 %X) { %factor = mul i32 %X, -3 ; <i32> [#uses=1] %Z = add i32 %factor, 6 ; <i32> [#uses=1] ret i32 %Z } instead of: define i32 @test12(i32 %X) { %Y = sub i32 6, %X ; <i32> [#uses=1] %C = sub i32 %Y, %X ; <i32> [#uses=1] %Z = sub i32 %C, %X ; <i32> [#uses=1] ret i32 %Z } llvm-svn: 92373	2009-12-31 20:34:32 +00:00
Chris Lattner	60c2ca743d	we don't need a smallptrset to detect duplicates, the values are sorted, so we can just do a linear scan. llvm-svn: 92372	2009-12-31 19:49:01 +00:00
Chris Lattner	1d8979422a	make reassociate more careful about not leaving around dead mul's llvm-svn: 92370	2009-12-31 19:34:45 +00:00
Chris Lattner	ed18917665	remove debug llvm-svn: 92369	2009-12-31 19:25:19 +00:00
Chris Lattner	60b71b5c4d	teach reassociate to factor x+x+x -> x*3. While I'm at it, fix RemoveDeadBinaryOp to actually do something. llvm-svn: 92368	2009-12-31 19:24:52 +00:00
Chris Lattner	38abecbad0	change reassociate to use SmallVector for its key datastructures instead of std::vector. llvm-svn: 92366	2009-12-31 18:40:32 +00:00
Chris Lattner	ac61550504	change an if to an assert, fix comment. llvm-svn: 92364	2009-12-31 18:18:46 +00:00
Chris Lattner	177140ad12	move the rest of the add optimization code out to OptimizeAdd, improve some comments, simplify a bit of code. llvm-svn: 92363	2009-12-31 18:17:13 +00:00
Chris Lattner	ba1f36aa99	factor statistic updating better. llvm-svn: 92362	2009-12-31 17:51:05 +00:00
Chris Lattner	4e3a5678af	simple fix for an incorrect factoring which causes a miscompilation, PR5458. llvm-svn: 92354	2009-12-31 08:33:49 +00:00
Chris Lattner	5f8a005d38	factor code out into helper functions. llvm-svn: 92347	2009-12-31 07:59:34 +00:00
Chris Lattner	f5c2b8b8d7	switch some std::vector's to smallvector. Reduce nesting. llvm-svn: 92346	2009-12-31 07:48:51 +00:00
Chris Lattner	9039ff8912	use more modern datastructures. llvm-svn: 92344	2009-12-31 07:33:14 +00:00
Chris Lattner	bc1512c8d1	clean up -debug output. llvm-svn: 92343	2009-12-31 07:17:37 +00:00
Chris Lattner	6a0ca6aa90	fix Analysis/DebugInfo.h to not include Metadata.h. Do this by moving one method out of line and eliminating redundant checks from other methods. llvm-svn: 92337	2009-12-31 03:02:08 +00:00
Chris Lattner	9b493028df	rename "elements" of metadata to "operands". "Elements" are things that occur in types. "operands" are things that occur in values. llvm-svn: 92322	2009-12-31 01:22:29 +00:00
Benjamin Kramer	756d7086c1	Use an array instead of a SmallVector. llvm-svn: 92264	2009-12-29 11:04:52 +00:00
Chris Lattner	22e13ba4e5	prune #includes. llvm-svn: 92260	2009-12-29 09:12:29 +00:00
Chris Lattner	a0566979b7	Final step in the metadata API restructuring: move the getMDKindID/getMDKindNames methods to LLVMContext (and add convenience methods to Module), eliminating MetadataContext. Move the state that it maintains out to LLVMContext. llvm-svn: 92259	2009-12-29 09:01:33 +00:00
Chris Lattner	6311212bf9	remove useless argument. llvm-svn: 92256	2009-12-29 08:03:58 +00:00
Chris Lattner	2f2aa2b067	This is a major cleanup of the instruction metadata interfaces that I asked Devang to do back on Sep 27. Instead of going through the MetadataContext class with methods like getMD() and getMDs(), just ask the instruction directly for its metadata with getMetadata() and getAllMetadata(). This includes a variety of other fixes and improvements: previously all Value*'s were bloated because the HasMetadata bit was thrown into value, adding a 9th bit to a byte. Now this is properly sunk down to the Instruction class (the only place where it makes sense) and it will be folded away somewhere soon. This also fixes some confusion in getMDs and its clients about whether the returned list is indexed by the MDID or densely packed. This is now returned sorted and densely packed and the comments make this clear. This introduces a number of fixme's which I'll follow up on. llvm-svn: 92235	2009-12-28 23:41:32 +00:00
Chris Lattner	17079fc0fa	split code that doesn't need to be templated out of IRBuilder into a new non-templated IRBuilderBase class. Move that large CreateGlobalString out of line, eliminating the need to #include GlobalVariable.h in IRBuilder.h llvm-svn: 92227	2009-12-28 21:28:46 +00:00
Chris Lattner	7093946ab1	rename getMDKind -> getMDKindID, make it autoinsert if an MD Kind doesn't exist already, eliminate registerMDKind. Tidy up a bunch of random stuff. llvm-svn: 92225	2009-12-28 20:45:51 +00:00
Chris Lattner	f8d22fc77d	Metadata.h doesn't need to include ValueHandle.h anymore. llvm-svn: 92211	2009-12-28 08:20:46 +00:00
Chris Lattner	1a32ede6fd	move an optimization for memcmp out of simplifylibcalls and into SDISel. This optimization was causing simplifylibcalls to introduce type-unsafe nastiness. This is the first step, I'll be expanding the memcmp optimizations shortly, covering things that we really really wouldn't want simplifylibcalls to do. llvm-svn: 92098	2009-12-24 00:37:38 +00:00
Chris Lattner	efebb234b7	reorder to follow a normal fall-through style, no functionality change. llvm-svn: 92084	2009-12-23 23:24:51 +00:00
David Greene	2330f78075	Remove dump routine and the associated Debug.h from a header. Patch up other files to compensate. llvm-svn: 92075	2009-12-23 22:58:38 +00:00
Eric Christopher	fdb33458fc	Update objectsize intrinsic and associated dependencies. Fix lowering code and update testcases. llvm-svn: 91979	2009-12-23 02:51:48 +00:00
Chris Lattner	c0f6402a94	Fix the Convert to scalar to not insert dead loads in the store case. The load is needed when we have a small store into a large alloca (at which point we get a load/insert/store sequence), but when you do a full-sized store, this load ends up being dead. This dead load is bad in really large nasty testcases where the load ends up causing mem2reg to insert large chains of dependent phi nodes which only ADCE can delete. Instead of doing this, just don't insert the dead load. This fixes rdar://6864035 llvm-svn: 91917	2009-12-22 19:33:28 +00:00
Chris Lattner	fda3b559e6	fix some fixme's by using twines llvm-svn: 91916	2009-12-22 19:23:33 +00:00
Bob Wilson	62a84ea8e3	Generalize SROA to allow the first index of a GEP to be non-zero. Add a missing check that an array reference doesn't go past the end of the array, and remove some redundant checks for in-bound array and vector references that are no longer needed. llvm-svn: 91897	2009-12-22 06:57:14 +00:00
Chris Lattner	f21a220bcd	Implement PR5795 by merging duplicated return blocks. This could go further by merging all returns in a function into a single one, but simplifycfg currently likes to duplicate the return (an unfortunate choice!) llvm-svn: 91890	2009-12-22 06:07:30 +00:00
Chris Lattner	9b7d99eb76	The phi translated pointer can be computed when returning a partially cached result instead of stored. This reduces memdep memory usage, and also eliminates a bunch of weakvh's. This speeds up gvn on gcc.c-torture/20001226-1.c from 23.9s to 8.45s (2.8x) on a different machine than earlier. llvm-svn: 91885	2009-12-22 04:25:02 +00:00
Eric Christopher	ab6a0d60d5	Whitespace fixes. llvm-svn: 91875	2009-12-22 01:23:51 +00:00
Daniel Dunbar	c661a2d4d8	Add suggested parentheses. llvm-svn: 91853	2009-12-21 23:27:57 +00:00
Chris Lattner	bf20018423	Add a fastpath to Load GVN to special case when we have exactly one dominating load to avoid even messing around with SSAUpdate at all. In this case (which is very common, we can just use the input value directly). This speeds up GVN time on gcc.c-torture/20001226-1.c from 36.4s to 16.3s, which still isn't great, but substantially better and this is a simple speedup that applies to lots of different cases. llvm-svn: 91851	2009-12-21 23:15:48 +00:00
Chris Lattner	927b0ac4b2	refactor some code out to a new helper method. llvm-svn: 91849	2009-12-21 23:04:33 +00:00
Chris Lattner	eaa25da8bb	improve indentation avoid a pointless conversion from weakvh to trackingvh, no functionality change. llvm-svn: 91848	2009-12-21 22:43:03 +00:00
Bob Wilson	88a0598fe8	Remove special-case SROA optimization of variable indexes to one-element and two-element arrays. After restructuring the SROA code, it was not safe to do this without adding more checking. It is not clear that this special-case has really been useful, and removing this simplifies the code quite a bit. llvm-svn: 91828	2009-12-21 18:39:47 +00:00
Chris Lattner	4edfcb88e1	revert r89298, which was committed without a testcase. I think the underlying PHI node insertion issue in SSAUpdate is fixed. llvm-svn: 91821	2009-12-21 07:45:57 +00:00
Chris Lattner	8fb07c5a21	fix PR5837 by having SSAUpdate reuse phi nodes for the 'GetValueInMiddleOfBlock' case, instead of inserting duplicates. A similar fix is almost certainly needed by the machine-level SSAUpdate implementation. llvm-svn: 91820	2009-12-21 07:16:11 +00:00
Chris Lattner	d4fb4296df	give instcombine some helper functions for matching MIN and MAX, and implement some optimizations for MIN(MIN()) and MAX(MAX()) and MIN(MAX()) etc. This substantially improves the code in PR5822 but doesn't kick in much elsewhere. 2 max's were optimized in pairlocalalign and one in smg2000. llvm-svn: 91814	2009-12-21 06:03:05 +00:00
Chris Lattner	ffbd02829c	enhance x-(-A) -> x+A to preserve NUW/NSW. Use the presence of NSW/NUW to fold "icmp (x+cst), x" to a constant in cases where it would otherwise be undefined behavior. Surprisingly (to me at least), this triggers hundreds of the times in a few benchmarks: lencode, ldecode, and 466.h264ref seem to really like this. llvm-svn: 91812	2009-12-21 04:04:05 +00:00
Chris Lattner	900ce231f9	Optimize all cases of "icmp (X+Cst), X" to something simpler. This triggers a bunch in lencode, ldecod, spass, 176.gcc, 252.eon, among others. It is also the first part of PR5822 llvm-svn: 91811	2009-12-21 03:19:28 +00:00
Douglas Gregor	740ab38bb7	Fix a bunch of little errors that Clang complains about when its being pedantic llvm-svn: 91764	2009-12-19 07:05:23 +00:00
Chris Lattner	4ad5eba568	fix PR5827 by disabling the phi slicing transformation in a case where instcombine would have to split a critical edge due to a phi node of an invoke. Since instcombine can't change the CFG, it has to bail out from doing the transformation. llvm-svn: 91763	2009-12-19 07:01:15 +00:00
Bob Wilson	c16811b575	Update my SROA changes in response to review. * change FindElementAndOffset to return a uint64_t instead of unsigned, and to identify the type to be used for that result in a GEP instruction. * move "isa<ConstantInt>" to be first in conditional. * replace some dyn_casts with casts. * add a comment about handling mem intrinsics. llvm-svn: 91762	2009-12-19 06:53:17 +00:00
Bob Wilson	532cd232fb	Reapply 91459 with a simple fix for the problem that broke the x86_64-darwin bootstrap. This also replaces the WeakVH references that Chris objected to with normal Value references. llvm-svn: 91711	2009-12-18 20:14:40 +00:00
Eli Friedman	86b9d75dc8	Optimize icmp of null and select of two constants even if the select has multiple uses. (The construct in question was found in gcc.) llvm-svn: 91675	2009-12-18 08:22:35 +00:00
Dan Gohman	57e808628c	Eliminte unnecessary uses of <cstdio>. llvm-svn: 91666	2009-12-18 03:25:51 +00:00
Dan Gohman	18fa5686f6	Add Loop contains utility methods for testing whether a loop contains another loop, or an instruction. The loop form is substantially more efficient on large loops than the typical code it replaces. llvm-svn: 91654	2009-12-18 01:24:09 +00:00
Dan Gohman	fd7231f1fe	Minor code simplification. llvm-svn: 91653	2009-12-18 01:20:44 +00:00
Dan Gohman	b1924e8a0f	Don't pass const pointers by reference. llvm-svn: 91647	2009-12-18 00:38:08 +00:00
Dan Gohman	1af1954852	Update a comment. llvm-svn: 91645	2009-12-18 00:28:43 +00:00
Dan Gohman	92c3696524	Reapply LoopStrengthReduce and IVUsers cleanups, excluding the part of 91296 that caused trouble -- the Processed list needs to be preserved for the livetime of the pass, as AddUsersIfInteresting is called from other passes. llvm-svn: 91641	2009-12-18 00:06:20 +00:00
Eli Friedman	250b119d98	Allow instcombine to combine "sext(a) >u const" to "a >u trunc(const)". llvm-svn: 91631	2009-12-17 22:42:29 +00:00
Eli Friedman	7cc86b4cc6	Make the ptrtoint comparison simplification work if one side is a global. llvm-svn: 91624	2009-12-17 21:27:47 +00:00
Eli Friedman	5842c9968a	Slightly generalize transformation of memmove(a,a,n) so that it also applies to memcpy. (Such a memcpy is technically illegal, but in practice is safe and is generated by struct self-assignment in C code.) llvm-svn: 91621	2009-12-17 21:07:31 +00:00
Bob Wilson	f3927b7994	Re-revert 91459. It's breaking the x86_64 darwin bootstrap. llvm-svn: 91607	2009-12-17 18:34:24 +00:00
Evan Cheng	090ac0865a	Revert 91280-91283, 91286-91289, 91291, 91293, 91295-91296. It apparently introduced a non-deterministic behavior in the optimizer somewhere. llvm-svn: 91598	2009-12-17 09:39:49 +00:00
Daniel Dunbar	ab42d42390	Reapply r91459, it was only unmasking the bug, and since TOT is still broken having it reverted does no good. llvm-svn: 91559	2009-12-16 20:09:53 +00:00
Daniel Dunbar	133efc317e	Revert "Reapply 91184 with fixes and an addition to the testcase to cover the problem", this broke llvm-gcc bootstrap for release builds on x86_64-apple-darwin10. This reverts commit db22309800b224a9f5f51baf76071d7a93ce59c9. llvm-svn: 91534	2009-12-16 10:56:17 +00:00
Chris Lattner	f278addbdc	reapply my strstr optimization. I have reproduced the x86-64 bootstrap miscompile (i386.o miscompares) but it happens both with and without this patch. llvm-svn: 91532	2009-12-16 09:32:05 +00:00
Chris Lattner	177be32334	revert my strstr optimization, I'm told it breaks x86-64 bootstrap. Will reapply with a fix when I get a chance. llvm-svn: 91486	2009-12-16 00:46:02 +00:00
Bob Wilson	e44756d7c2	Reapply 91184 with fixes and an addition to the testcase to cover the problem found last time. Instead of trying to modify the IR while iterating over it, I've change it to keep a list of WeakVH references to dead instructions, and then delete those instructions later. I also added some special case code to detect and handle the situation when both operands of a memcpy intrinsic are referencing the same alloca. llvm-svn: 91459	2009-12-15 22:00:51 +00:00
Chris Lattner	26ab363361	optimize strstr, PR5783 llvm-svn: 91438	2009-12-15 19:14:40 +00:00
Dan Gohman	265ce318b8	Delete an unused function. llvm-svn: 91432	2009-12-15 16:30:09 +00:00
Chris Lattner	24aba42d04	add some other xforms that should be done as part of PR5783 llvm-svn: 91428	2009-12-15 09:05:13 +00:00
Chris Lattner	45d040bd85	Remove isPod() from DenseMapInfo, splitting it out to its own isPodLike type trait. This is a generally useful type trait for more than just DenseMap, and we really care about whether something acts like a pod, not whether it really is a pod. llvm-svn: 91421	2009-12-15 07:26:43 +00:00
Dan Gohman	fbeec7270c	Fix a thinko; isNotAlreadyContainedIn had a built-in negative, so the condition was inverted when the code was converted to contains(). llvm-svn: 91295	2009-12-14 17:31:01 +00:00
Dan Gohman	416d5b7361	Remove unnecessary #includes. llvm-svn: 91293	2009-12-14 17:19:06 +00:00
Dan Gohman	163fb26927	Instead of having a ScalarEvolution pointer member in BasedUser, just pass the ScalarEvolution pointer into the functions which need it. llvm-svn: 91289	2009-12-14 17:12:51 +00:00
Dan Gohman	8dbd4e3d16	Don't bother cleaning up if there's nothing to clean up. llvm-svn: 91288	2009-12-14 17:10:44 +00:00
Dan Gohman	88c7e61c5b	Delete an unused variable. llvm-svn: 91287	2009-12-14 17:08:09 +00:00
Dan Gohman	838f604543	LSR itself doesn't need LoopInfo. llvm-svn: 91283	2009-12-14 17:02:34 +00:00
Dan Gohman	273e692952	LSR itself doesn't need DominatorTree. llvm-svn: 91282	2009-12-14 16:57:08 +00:00
Dan Gohman	c3513095cf	Remove the code in LSR that manually hoists expansions out of loops; SCEVExpander does this automatically. llvm-svn: 91281	2009-12-14 16:52:55 +00:00
Dan Gohman	ec2a7c58e8	Minor code cleanups. llvm-svn: 91280	2009-12-14 16:37:29 +00:00
Chris Lattner	aaa6ac10a6	revert r91184, because it causes a crash on a .bc file I just sent to Bob. llvm-svn: 91268	2009-12-14 05:11:02 +00:00
Chandler Carruth	dcf5dacb2c	Don't leave pointers uninitialized in the default constructor. GCC complains about the potential use of these uninitialized members under certain conditions. llvm-svn: 91239	2009-12-13 07:04:45 +00:00
Bob Wilson	895f364ae6	Revise scalar replacement to be more flexible about handle bitcasts and GEPs. While scanning through the uses of an alloca, keep track of the current offset relative to the start of the alloca, and check memory references to see if the offset & size correspond to a component within the alloca. This has the nice benefit of unifying much of the code from isSafeUseOfAllocation, isSafeElementUse, and isSafeUseOfBitCastedAllocation. The code to rewrite the uses of a promoted alloca, after it is determined to be safe, is reorganized in the same way. Also, when rewriting GEP instructions, mark them as "in-bounds" since all the indices are known to be safe. llvm-svn: 91184	2009-12-11 23:47:40 +00:00
Eric Christopher	22889c049d	Make sure the immediate dominator isn't NULL through iterations of the loop. We could get to this condition via indirect branches. llvm-svn: 91009	2009-12-10 00:25:41 +00:00
Chris Lattner	9ccc879006	Fix PR5744, a case where we were getting the pointer size instead of the value size. This only manifested when memdep inprecisely returns clobber, which is do to a caching issue in the PR5744 testcase. We can 'efficiently emulate' this by using '-no-aa' llvm-svn: 91004	2009-12-10 00:11:45 +00:00
Chris Lattner	3ddf804f78	allow this to build when the #if 0's are enabled. No functionality change. llvm-svn: 90999	2009-12-10 00:04:46 +00:00
Dan Gohman	72c367fb52	Dereference loopHeader after checking for null rather than before. llvm-svn: 90990	2009-12-09 22:55:01 +00:00
Chris Lattner	ca5f9cb18b	fix hte last remaining known (by me) phi translation bug. When we reanalyze clobbers to forward pieces of large stores to small loads, we need to consider the properly phi translated pointer in the store block. llvm-svn: 90978	2009-12-09 18:21:46 +00:00
Chris Lattner	f8ba1253f1	change GetStoreValueForLoad to use IRBuilder, which is cleaner and implicitly constant folds. llvm-svn: 90977	2009-12-09 18:13:28 +00:00
Bob Wilson	1c5a6fb299	Fix a comment. llvm-svn: 90975	2009-12-09 18:05:27 +00:00
Chris Lattner	07df9efb35	change AnalyzeLoadFromClobberingMemInst/AnalyzeLoadFromClobberingStore to require the load ty/ptr to be passed in, no functionality change. llvm-svn: 90960	2009-12-09 07:37:07 +00:00
Chris Lattner	0def861ee9	change AnalyzeLoadFromClobberingWrite and clients to pass in type and pointer instead of the load. No functionality change. llvm-svn: 90959	2009-12-09 07:34:10 +00:00
Chris Lattner	0c31547168	change NonLocalDepEntry from being a typedef for an std::pair to be its own small class. No functionality change. llvm-svn: 90956	2009-12-09 07:08:01 +00:00
Chris Lattner	946b58dd90	add some aborts to #if 0's. llvm-svn: 90929	2009-12-09 02:41:54 +00:00
Chris Lattner	972e6d8d00	Switch GVN and memdep to use PHITransAddr, which correctly handles phi translation of complex expressions like &A[i+1]. This has the following benefits: 1. The phi translation logic is all contained in its own class with a strong interface and verification that it is self consistent. 2. The logic is more correct than before. Previously, if intermediate expressions got PHI translated, we'd miss the update and scan for the wrong pointers in predecessor blocks. @phi_trans2 is a testcase for this. 3. We have a lot less code in memdep. We can handle phi translation across blocks of things like @phi_trans3, which is pretty insane :). This patch should fix the miscompiles of 255.vortex, and I tested it with a bootstrap of llvm-gcc, llvm-test and dejagnu of course. llvm-svn: 90926	2009-12-09 01:59:31 +00:00
Bob Wilson	c5d082fd5d	Some superficial cleanups. llvm-svn: 90866	2009-12-08 18:27:03 +00:00
Bob Wilson	2029ea04f9	Clean up dead operands left around after SROA replaces a mem intrinsic. I'm not aware that this does anything significant on its own, but it's needed for another patch that I'm working on. llvm-svn: 90864	2009-12-08 18:22:03 +00:00
Duncan Sands	6a3df7b0c7	Teach GlobalOpt to delete aliases with internal linkage (after forwarding any uses). GlobalDCE can also do this, but is only run at -O3. llvm-svn: 90850	2009-12-08 10:10:20 +00:00
Nick Lewycky	8bca014d7f	Remove unnecessary #include "llvm/LLVMContext.h". llvm-svn: 90836	2009-12-08 05:45:41 +00:00
Chris Lattner	6d6f10fe91	fix PR5698 llvm-svn: 90708	2009-12-06 17:17:23 +00:00
Chris Lattner	778cb92235	constant fold loads from memcpy's from global constants. This is important because clang lowers nontrivial automatic struct/array inits to memcpy from a global array. llvm-svn: 90698	2009-12-06 05:29:56 +00:00
Chris Lattner	93236ba327	add support for forwarding mem intrinsic values to non-local loads. llvm-svn: 90697	2009-12-06 04:54:31 +00:00
Chris Lattner	42376066eb	Handle forwarding local memsets to loads. For example, we optimize this: short x(short A) { memset(A, 1, sizeof(A)*100); return A[42]; } to 'return 257' instead of doing the load. llvm-svn: 90695	2009-12-06 01:57:02 +00:00
Nick Lewycky	a0e9d700dc	Generalize this optimization to work on equality comparisons between any two integers that are constant except for a single bit (the same n-th bit in each). llvm-svn: 90646	2009-12-05 05:00:00 +00:00
Bob Wilson	050b812fe7	Fix up some comments. llvm-svn: 90603	2009-12-04 21:57:37 +00:00
Bob Wilson	5ca37b274c	Fix 80-column violations. llvm-svn: 90601	2009-12-04 21:51:35 +00:00
Chris Lattner	2bd9609992	add an assert to make it really clear what this is doing. Return singularval as a compile time perf optimization to avoid a load. llvm-svn: 90507	2009-12-04 01:03:32 +00:00
Bob Wilson	53bdae3802	Fix a comment typo. llvm-svn: 90487	2009-12-03 21:47:07 +00:00
Owen Anderson	0b6e260066	Fix this crasher, and add a FIXME for a missed optimization. llvm-svn: 90408	2009-12-03 03:43:29 +00:00
Chris Lattner	a48f44d9ee	improve portability to avoid conflicting with std::next in c++'0x. Patch by Howard Hinnant! llvm-svn: 90365	2009-12-03 00:50:42 +00:00
Jim Grosbach	d831ef4945	Move EliminateDuplicatePHINodes() from SimplifyCFG.cpp to Local.cpp llvm-svn: 90324	2009-12-02 17:06:45 +00:00
Andreas Neustifter	3d207290fe	Cheap, mostly strict, stable sorting. This is necessary for tests so the results are comparable. llvm-svn: 90320	2009-12-02 15:57:15 +00:00
Owen Anderson	b9878ee6b6	Cleanup/remove some parts of the lifetime region handling code in memdep and GVN, per Chris' comments. Adjust testcases to match. llvm-svn: 90304	2009-12-02 07:35:19 +00:00
Chris Lattner	c468025ac9	factor some code better. llvm-svn: 90299	2009-12-02 06:44:58 +00:00
Chris Lattner	2764b4dc55	formatting cleanups. llvm-svn: 90298	2009-12-02 06:35:55 +00:00
Chris Lattner	eea42c7b51	tidy up, remove dependence on order of evaluation of function args from EmitMemCpy. llvm-svn: 90297	2009-12-02 06:05:42 +00:00
Chris Lattner	3c9aca9079	fix PR5640 by tracking whether a block is the header of a loop more precisely, which prevents us from infinitely peeling the loop. llvm-svn: 90211	2009-12-01 06:04:43 +00:00
Benjamin Kramer	3efc050ac4	Revert r90089 for now, it's breaking selfhost. llvm-svn: 90097	2009-11-29 21:17:48 +00:00
Benjamin Kramer	bfa993ab20	Fix two FIXMEs. llvm-svn: 90089	2009-11-29 20:29:30 +00:00
Chris Lattner	1cc4cca193	add testcases for the foo_with_overflow op xforms added recently and fix bugs exposed by the tests. Testcases from Alastair Lynn! llvm-svn: 90056	2009-11-29 02:57:29 +00:00
Chris Lattner	cd261c9c26	Implement PR5634. llvm-svn: 90046	2009-11-29 00:51:17 +00:00
Chris Lattner	32140312ca	reenable load address insertion in load pre. This allows us to handle cases like this: void test(int N, double* G) { long j; for (j = 1; j < N - 1; j++) G[j+1] = G[j] + G[j+1]; } where G[1] isn't live into the loop. llvm-svn: 90041	2009-11-28 16:08:18 +00:00
Chris Lattner	44da5bd837	Enhance InsertPHITranslatedPointer to be able to return a list of newly inserted instructions. No functionality change until someone starts using it. llvm-svn: 90039	2009-11-28 15:39:14 +00:00
Chris Lattner	cf0b198827	disable value insertion for now, I need to figure out how to inform GVN about the newly inserted values. This fixes PR5631. llvm-svn: 90022	2009-11-27 22:50:07 +00:00
Chris Lattner	2be52e72ae	Rework InsertPHITranslatedPointer to handle the recursive case, this fixes PR5630 and sets the stage for the next phase of goodness (testcase pending). llvm-svn: 90019	2009-11-27 22:05:15 +00:00
Chris Lattner	3d9823b9cf	factor some logic out of instcombine into a new SimplifyAddInst method. llvm-svn: 90011	2009-11-27 17:42:22 +00:00
Chris Lattner	2226db66ab	fix PR5436 by making the 'simple' case of SRoA not promote out of range array indexes. The "complex" case of SRoA still handles them, and correctly. This fixes a weirdness where we'd correctly avoid transforming A[0][42] if the 42 was too large, but we'd only do it if it was one gep, not two separate ones. llvm-svn: 90007	2009-11-27 16:37:41 +00:00
Chris Lattner	25be93dfed	teach GVN's load PRE to insert computations of the address in predecessors where it is not available. It's unclear how to get this inserted computation into GVN's scalar availability sets, Owen, help? :) llvm-svn: 89997	2009-11-27 08:25:10 +00:00
Chris Lattner	a9a76ccf56	Fix phi translation in load PRE to agree with the phi translation done by memdep, and reenable gep translation again. llvm-svn: 89992	2009-11-27 06:31:14 +00:00
Chris Lattner	8574aba4ea	factor some instcombine simplifications for getelementptr out to a new SimplifyGEPInst method in InstructionSimplify.h. No functionality change. llvm-svn: 89980	2009-11-27 00:29:05 +00:00
Chris Lattner	a5bc618a91	fix crash on Transforms/InstCombine/intrinsics.ll introduced by r89970 llvm-svn: 89972	2009-11-26 22:08:06 +00:00
Chris Lattner	a73ecf0b00	Fix PR5471 by removing an instcombine xform. Some pieces of the code generates store to undef and some generates store to null as the idiom for undefined behavior. Since simplifycfg zaps both, don't remove the undefined behavior in instcombine. llvm-svn: 89971	2009-11-26 22:04:42 +00:00
Chris Lattner	5b83ba215d	implement a bunch of xforms for overflow intrinsics, based on a patch by Alastair Lynn. llvm-svn: 89970	2009-11-26 21:42:47 +00:00
Edward O'Callaghan	2b8fed15e0	Reverting patch in revision 89758, initial attempt at fixing PR5373 has proven to be bogus. llvm-svn: 89844	2009-11-25 05:38:41 +00:00
Edward O'Callaghan	5fd452d596	Fix for PR5373, Credit to Jakub Staszak. llvm-svn: 89758	2009-11-24 11:51:52 +00:00
Dan Gohman	580b80d6d9	Make ConstantFoldConstantExpression recursively visit the entire ConstantExpr, not just the top-level operator. This allows it to fold many more constants. Also, make GlobalOpt call ConstantFoldConstantExpression on GlobalVariable initializers. llvm-svn: 89659	2009-11-23 16:22:21 +00:00
Dan Gohman	1f522d98f8	Fix a use of an invalidated iterator in the case where there are multiple adjacent uses of a dead basic block from the same user. This fixes PR5596. llvm-svn: 89658	2009-11-23 16:13:39 +00:00
Nick Lewycky	15a1287c1f	Pull LLVMContext out of PromoteMemToReg. llvm-svn: 89645	2009-11-23 03:50:44 +00:00
Nick Lewycky	621fe5614e	Remove LLVMContext and its include. llvm-svn: 89644	2009-11-23 03:34:29 +00:00
Nick Lewycky	39dbfd3c58	Remove unused LLVMContext. llvm-svn: 89642	2009-11-23 03:29:18 +00:00
Nick Lewycky	922d4ab574	Reapply r88830 with a bugfix: this transform only applies to icmp eq/ne. This fixes part of PR5438. llvm-svn: 89639	2009-11-23 03:17:33 +00:00
Eric Christopher	0c7bd96de2	Add more optimizations for object size checking, enable handling of object size intrinsic and verify return type is correct. Collect various code in one place. llvm-svn: 89523	2009-11-21 01:01:30 +00:00
Dan Gohman	fbffe63528	Make Loop::getLoopLatch() work on loops which don't have preheaders, as it may be used in contexts where preheader insertion may have failed due to an indirectbr. Make LoopSimplify's LoopSimplify::SeparateNestedLoop properly fail in the case that it would require splitting an indirectbr edge. These fix PR5502. llvm-svn: 89484	2009-11-20 20:51:18 +00:00
Dan Gohman	d15302afa0	Fix IPSCCP's code for deleting dead blocks to tolerate outstanding blockaddress users. This fixes PR5569. llvm-svn: 89483	2009-11-20 20:19:14 +00:00
Daniel Dunbar	f87c75706f	Revert "Add some rough optimizations for checking routines.", it buildeth not. llvm-svn: 89482	2009-11-20 20:17:30 +00:00
Eric Christopher	cf97d01dff	Add some rough optimizations for checking routines. llvm-svn: 89479	2009-11-20 19:57:37 +00:00
Duncan Sands	9e26aac773	Fix PR5563, an expensive checks failure when running on tests/Transforms/InstCombine/shufflemask-undef.ll. If anyone cares, the use of 2*e here (and the equivalent all over the place in instcombine) seems wrong, though harmless: it should really be twice the length of the input vector. I think shufflevector used to require that the mask have the same length as the input, but I don't think that's true any more. I don't care enough about vectors to do anything about this... llvm-svn: 89456	2009-11-20 13:19:51 +00:00
Dan Gohman	94e617627d	Extend CaptureTracking to indicate when a value is never stored, even if it is not ultimately captured. Teach BasicAliasAnalysis that a local object address which does not escape and is never stored does not alias with a value resulting from a load. llvm-svn: 89398	2009-11-19 21:57:48 +00:00
Dan Gohman	cbc6ebb6fd	Enable hoisting of loads from constant memory by default. In cases where they are lowered to instruction sequences more complex than a simple load, such that CodeGen cannot rematerialize them, a reload from a spill slot is likely to be cheaper than the complex sequence. llvm-svn: 89374	2009-11-19 19:00:10 +00:00
Jim Grosbach	dcef55b2ef	Eliminate duplicate phi nodes in loops. Loop rotation, for example, can introduce these, and it's beneficial to later passes to clean them up. llvm-svn: 89298	2009-11-19 02:03:18 +00:00
Jim Grosbach	cc69a1ba9a	Make EliminateDuplicatePHINodes() available as a utility function llvm-svn: 89297	2009-11-19 02:02:10 +00:00
Jim Grosbach	6bf5305f5d	grammar llvm-svn: 89145	2009-11-17 21:37:04 +00:00
Jim Grosbach	e4e018ae67	80-column violations llvm-svn: 89123	2009-11-17 19:05:35 +00:00
Evan Cheng	ba4e5da727	Generalize OptimizeLoopTermCond to optimize more loop terminating icmp to use postinc iv. llvm-svn: 89116	2009-11-17 18:10:11 +00:00
Jim Grosbach	60f4854c76	Remove trailing whitespace llvm-svn: 89110	2009-11-17 17:53:56 +00:00
Devang Patel	12144a2348	Remove debug info attached with an instruction. llvm-svn: 89016	2009-11-17 00:47:06 +00:00
David Greene	a3ce7828b2	Fix an expensive-checks error. The Mask and LHSMask may not be of the same size, so don't do the transformation if they're different. llvm-svn: 88972	2009-11-16 21:52:23 +00:00
Duncan Sands	e5de4a9ad6	CreateIntCast takes an "isSigned" parameter. Pass "true" for it, rather than a name. llvm-svn: 88908	2009-11-16 12:32:28 +00:00
Chris Lattner	9d9812a636	make PRE of loads preserve the alignment of the moved load instruction. llvm-svn: 88865	2009-11-15 19:58:31 +00:00
Chris Lattner	5f037b6439	fix a bug handling 'not x' when x is undef. llvm-svn: 88864	2009-11-15 19:57:43 +00:00
Nick Lewycky	95148689c9	Revert r88830 and r88831 which appear to have caused a selfhost buildbot some grief. I suspect this patch merely exposed a bug else. llvm-svn: 88841	2009-11-15 07:47:32 +00:00
Nick Lewycky	e29fa4c7a1	Teach instcombine to look for booleans in wider integers when it encounters a zext(icmp). It may be able to optimize that away. This fixes one of the cases in PR5438. llvm-svn: 88830	2009-11-15 05:55:17 +00:00
Nick Lewycky	7935bcb0fe	Remove LLVMContext from reassociate. It was threaded through every function but ultimately never used. llvm-svn: 88763	2009-11-14 07:25:54 +00:00
Dan Gohman	81132465d3	Add an option for running GVN with redundant load processing disabled. llvm-svn: 88742	2009-11-14 02:27:51 +00:00
Owen Anderson	e96b2111b1	Re-enable this code, since redundant PHIs are now being better nuked. llvm-svn: 87042	2009-11-12 23:22:41 +00:00
Chris Lattner	5c89f4b4ef	use isInstructionTriviallyDead, as pointed out by Duncan llvm-svn: 87035	2009-11-12 21:58:18 +00:00
Chris Lattner	eb9acbfb05	implement a nice little efficiency hack in the inliner. Since we're now running IPSCCP early, and we run functionattrs interlaced with the inliner, we often (particularly for small or noop functions) completely propagate all of the information about a call to its call site in IPSSCP (making a call dead) and functionattrs is smart enough to realize that the function is readonly (because it is interlaced with inliner). To improve compile time and make the inliner threshold more accurate, realize that we don't have to inline dead readonly function calls. Instead, just delete the call. This happens all the time for C++ codes, here are some counters from opt/llvm-ld counting the number of times calls were deleted vs inlined on various apps: Tramp3d opt: 5033 inline - Number of call sites deleted, not inlined 24596 inline - Number of functions inlined llvm-ld: 667 inline - Number of functions deleted because all callers found 699 inline - Number of functions inlined 483.xalancbmk opt: 8096 inline - Number of call sites deleted, not inlined 62528 inline - Number of functions inlined llvm-ld: 217 inline - Number of allocas merged together 2158 inline - Number of functions inlined 471.omnetpp: 331 inline - Number of call sites deleted, not inlined 8981 inline - Number of functions inlined llvm-ld: 171 inline - Number of functions deleted because all callers found 629 inline - Number of functions inlined Deleting a call is much faster than inlining it, and is insensitive to the size of the callee. :) llvm-svn: 86975	2009-11-12 07:56:08 +00:00
Evan Cheng	85a9f430e9	- Teach LSR to avoid changing cmp iv stride if it will create an immediate that cannot be folded into target cmp instruction. - Avoid a phase ordering issue where early cmp optimization would prevent the later count-to-zero optimization. - Add missing checks which could cause LSR to reuse stride that does not have users. - Fix a bug in count-to-zero optimization code which failed to find the pre-inc iv's phi node. - Remove, tighten, loosen some incorrect checks disable valid transformations. - Quite a bit of code clean up. llvm-svn: 86969	2009-11-12 07:35:05 +00:00
Chris Lattner	5f6b8b2bcb	use getPredicateOnEdge to fold comparisons through PHI nodes, which implements GCC PR18046. This also gets us 360 more jump threads on 176.gcc. llvm-svn: 86953	2009-11-12 05:24:05 +00:00
Chris Lattner	22db4b5e0c	various fixes to the lattice transfer functions. llvm-svn: 86952	2009-11-12 04:57:13 +00:00
Chris Lattner	c893c4ed10	switch jump threading to use getPredicateOnEdge in one place making the new LVI stuff smart enough to subsume some special cases in the old code. Disable them when LVI is around, the testcase still passes. llvm-svn: 86951	2009-11-12 04:37:50 +00:00
Daniel Dunbar	11881e2283	Add the braces gcc suggested. llvm-svn: 86933	2009-11-12 02:52:56 +00:00
Chris Lattner	ba45616958	with the new code we can thread non-instruction values. This allows us to handle the test10 testcase. llvm-svn: 86924	2009-11-12 01:41:34 +00:00
Chris Lattner	3f80d85191	this argument can be an arbitrary value, it doesn't need to be an instruction. llvm-svn: 86923	2009-11-12 01:37:43 +00:00
Chris Lattner	d5e25436a1	expose edge information and switch j-t to use it. llvm-svn: 86920	2009-11-12 01:29:10 +00:00
Chris Lattner	67146695b6	pass TD into a SimplifyCmpInst call. Add another case that uses LVI info when -enable-jump-threading-lvi is passed. llvm-svn: 86886	2009-11-11 22:31:38 +00:00
Duncan Sands	ba61fed5d3	Don't trivially delete unused calls to llvm.invariant.start. This allows llvm.invariant.start to be used without necessarily being paired with a call to llvm.invariant.end. If you run the entire optimization pipeline then such calls are in fact deleted (adce does it), but that's actually a good thing since we probably do want them to be zapped late in the game. There should really be an integration test that checks that the llvm.invariant.start call lasts long enough that all passes that do interesting things with it get to do their stuff before it is deleted. But since no passes do anything interesting with it yet this will have to wait for later. llvm-svn: 86840	2009-11-11 15:34:13 +00:00
Chris Lattner	852f2653c4	remove the now dead condprop pass, PR3906. llvm-svn: 86810	2009-11-11 05:56:35 +00:00
Chris Lattner	fde1f8d0d8	stub out some LazyValueInfo interfaces, and have JumpThreading start using them in a trivial way when -enable-jump-threading-lvi is passed. enable-jump-threading-lvi will be my playground for awhile. llvm-svn: 86789	2009-11-11 02:08:33 +00:00
Chris Lattner	3a2ae908fe	add a fixme llvm-svn: 86766	2009-11-11 00:21:58 +00:00
Evan Cheng	12f146d8f7	Block terminator may be a switch. llvm-svn: 86761	2009-11-11 00:00:21 +00:00
Devang Patel	f6eeaebd76	Implement support to debug inlined functions. llvm-svn: 86748	2009-11-10 23:06:00 +00:00
Chris Lattner	9518fbb54e	implement a TODO by teaching jump threading about "xor x, 1". llvm-svn: 86739	2009-11-10 22:39:16 +00:00
Chris Lattner	852d6d64ff	move some generally useful functions out of jump threading into libanalysis and transformutils. llvm-svn: 86735	2009-11-10 22:26:15 +00:00
Chris Lattner	02e2cee7dc	fix a crash in SCCP handling extractvalue of an array, pointed out and tracked down by Stephan Reiter! llvm-svn: 86726	2009-11-10 22:02:09 +00:00
Chris Lattner	40b15f220d	improve comment. llvm-svn: 86723	2009-11-10 21:45:09 +00:00
Chris Lattner	80e7e5a429	Make jump threading eliminate blocks that just contain phi nodes, debug intrinsics, and an unconditional branch when possible. This reuses the TryToSimplifyUncondBranchFromEmptyBlock function split out of simplifycfg. llvm-svn: 86722	2009-11-10 21:40:01 +00:00
Evan Cheng	87fe40b32d	Generalize lsr code that optimize loop to count down towards zero. llvm-svn: 86715	2009-11-10 21:14:05 +00:00
Duncan Sands	23344095de	Add defensive break. llvm-svn: 86705	2009-11-10 19:36:40 +00:00
Duncan Sands	8d4cde2b55	Fix obvious typo. llvm-svn: 86694	2009-11-10 18:21:37 +00:00
Chris Lattner	b8f79ba10e	clarify logic. llvm-svn: 86689	2009-11-10 17:00:47 +00:00
Duncan Sands	1925d3a1d1	Teach DSE to eliminate useless trampolines. llvm-svn: 86683	2009-11-10 13:49:50 +00:00
Duncan Sands	04e0c95248	Add brackets to make gcc-4.4 happy. llvm-svn: 86681	2009-11-10 09:32:10 +00:00
Victor Hernandez	fcc77b1c02	Update computeArraySize() to use ComputeMultiple() to determine the array size associated with a malloc; also extend PerformHeapAllocSRoA() to check if the optimized malloc's arg had its highest bit set, so that it is safe for ComputeMultiple() to look through sext instructions while determining the optimized malloc's array size llvm-svn: 86676	2009-11-10 08:32:25 +00:00
Chris Lattner	1559bedcc7	unify the code that determines whether it is a good idea to change the type of a computation. This fixes some infinite loops when dealing with TD that has no native types. llvm-svn: 86670	2009-11-10 07:23:37 +00:00
Nick Lewycky	5b3def9b86	Simplify. llvm-svn: 86668	2009-11-10 07:00:43 +00:00
Nick Lewycky	9027147fb1	Reapply r86359, "Teach dead store elimination that certain intrinsics write to memory just like a store" with bug fixed (partial-overwrite.ll is the regression test). llvm-svn: 86667	2009-11-10 06:46:40 +00:00
Chris Lattner	cbd18fc93d	refactor TryToSimplifyUncondBranchFromEmptyBlock out of SimplifyCFG. llvm-svn: 86666	2009-11-10 05:59:26 +00:00
Oscar Fuentes	bbc1067001	CMake: Support for building llvm loadable modules. llvm-svn: 86656	2009-11-10 02:45:37 +00:00
Chris Lattner	38c44ea6b0	make jump threading recursively simplify expressions instead of doing it just one level deep. On the testcase we go from getting this: F1: ; preds = %T2 %F = and i1 true, %cond ; <i1> [#uses=1] br i1 %F, label %X, label %Y to a fully threaded: F1: ; preds = %T2 br label %Y This changes gets us to the point where we're forming (too many) switch instructions on doug's strswitch testcase. llvm-svn: 86646	2009-11-10 01:57:31 +00:00
Chris Lattner	be11db6894	don't invalidate PN, rewrite of this code is in progress anyway. llvm-svn: 86639	2009-11-10 01:19:06 +00:00
Chris Lattner	fb7f87d5a3	add a new SimplifyInstruction API, which is like ConstantFoldInstruction, except that the result may not be a constant. Switch jump threading to use it so that it gets things like (X & 0) -> 0, which occur when phi preds are deleted and the remaining phi pred was a zero. llvm-svn: 86637	2009-11-10 01:08:51 +00:00
Jeffrey Yasskin	b40d3f76a0	Fix DenseMap iterator constness. This patch forbids implicit conversion of DenseMap::const_iterator to DenseMap::iterator which was possible because DenseMapIterator inherited (publicly) from DenseMapConstIterator. Conversion the other way around is now allowed as one may expect. The template DenseMapConstIterator is removed and the template parameter IsConst which specifies whether the iterator is constant is added to DenseMapIterator. Actually IsConst parameter is not necessary since the constness can be determined from KeyT but this is not relevant to the fix and can be addressed later. Patch by Victor Zverovich! llvm-svn: 86636	2009-11-10 01:02:17 +00:00
Chris Lattner	a71e9d61be	factor simplification logic for AND and OR out to InstSimplify from instcombine. llvm-svn: 86635	2009-11-10 00:55:12 +00:00
Chris Lattner	ccfdceb22c	pull a bunch of logic out of instcombine into instsimplify for compare simplification, this handles the foldable fcmp x,x cases among many others. llvm-svn: 86627	2009-11-09 23:55:12 +00:00
Chris Lattner	beadc6e8c7	inline a simple function. llvm-svn: 86625	2009-11-09 23:31:49 +00:00
Chris Lattner	c1f19071f8	rename SimplifyCompare -> SimplifyCmpInst and split it into Simplify[IF]Cmp pieces. Add some predicates to CmpInst to determine whether a predicate is fp or int. llvm-svn: 86624	2009-11-09 23:28:39 +00:00
Chris Lattner	cdfb80de16	fix ConstantFoldCompareInstOperands to take the LHS/RHS as individual operands instead of taking a temporary array llvm-svn: 86619	2009-11-09 23:06:58 +00:00
Chris Lattner	800aad3dda	use instructionsimplify instead of a weak clone of ad-hoc folding stuff. llvm-svn: 86616	2009-11-09 23:00:14 +00:00
Chris Lattner	2978ca7b79	stub out a new form of BasicBlock::RemovePredecessorAndSimplify which simplifies instruction users of PHIs when the phi is eliminated. This will be moved to transforms/utils after some other refactoring. llvm-svn: 86603	2009-11-09 22:32:36 +00:00
Dan Gohman	f324dd65f8	Fix a comment in a typo that Duncan noticed. llvm-svn: 86575	2009-11-09 18:59:22 +00:00
Dan Gohman	c146c78060	Generalize LCSSA to handle loops with exits with predecessors outside the loop. This is needed because with indirectbr it may not be possible for LoopSimplify to guarantee that all loop exit predecessors are inside the loop. This fixes PR5437. LCCSA no longer actually requires LoopSimplify form, but for now it must still have the dependency because the PassManager doesn't know how to schedule LoopSimplify otherwise. llvm-svn: 86569	2009-11-09 18:28:24 +00:00
Chris Lattner	39c07b2eef	if a 'with overflow' intrinsic just has the normal result used, simplify it to a normal binop. Patch by Alastair Lynn, testcase by me. llvm-svn: 86524	2009-11-09 07:07:56 +00:00
Chris Lattner	feeabde753	fix PR5104: when printing a single character, return the result of putchar in case there is an error. llvm-svn: 86515	2009-11-09 04:57:04 +00:00
Chris Lattner	0685be3441	enhance PHI slicing to handle the case when a slicable PHI is begin used by a chain of other PHIs. llvm-svn: 86503	2009-11-09 01:38:00 +00:00
Owen Anderson	939ea35244	Small cleanups. llvm-svn: 86499	2009-11-09 00:48:15 +00:00
Owen Anderson	73fc616838	Revert my previous patch to ABCD and fix things the right way. There are two problems addressed here: 1) We need to avoid processing sigma nodes as phi nodes for constraint generation. 2) We need to generate constraints for comparisons against constants properly. This includes our first working ABCD test! llvm-svn: 86498	2009-11-09 00:44:44 +00:00
Chris Lattner	ea465e221e	comment typos pointed out by Duncan llvm-svn: 86497	2009-11-09 00:41:49 +00:00
Owen Anderson	058088f219	Fix an issue where the ordering of blocks within a function could lead to different constraint graphs being produced. The cause was that we were incorrectly marking sigma instructions as processed after handling the sigma-specific constraints for them, potentially neglecting to process them as normal instructions as well. Unfortunately, the testcase that inspired this still doesn't work because of a bug in the solver, which is next on the list to debug. llvm-svn: 86486	2009-11-08 22:36:55 +00:00
Chris Lattner	2299d4b6d8	Teach an instcombine to not pull trunc instructions through PHI nodes when both the source and dest are illegal types, since it would cause the phi to grow (for example, we shouldn't transform test14b's phi to a phi on i320). This fixes an infinite loop on i686 bootstrap with phi slicing turned on, so turn it back on. llvm-svn: 86483	2009-11-08 21:20:06 +00:00
Chris Lattner	a837e4db6b	reapply r8644[3-5] with only the scary part (SliceUpIllegalIntegerPHI) disabled. llvm-svn: 86480	2009-11-08 19:23:30 +00:00
Daniel Dunbar	4c41373c56	Speculatively revert r8644[3-5], they seem to be leading to infinite loops in llvm-gcc bootstrap. llvm-svn: 86478	2009-11-08 17:52:47 +00:00
Chris Lattner	c7a450b5b2	teach a couple of instcombine transformations involving PHIs to not turn a PHI in a legal type into a PHI of an illegal type, and add a new optimization that breaks up insane integer PHI nodes into small pieces (PR3451). llvm-svn: 86443	2009-11-08 08:21:13 +00:00
Nick Lewycky	b9397262b7	Improve tail call elimination to handle the switch statement. llvm-svn: 86403	2009-11-07 21:10:15 +00:00
Chris Lattner	c77d24b792	make instcombine only rewrite a chain of computation (eliminating some extends) if the new type of the computation is legal or if both the source and dest are illegal. This prevents instcombine from changing big chains of computation into i64 on 32-bit targets for example. llvm-svn: 86398	2009-11-07 19:11:46 +00:00
Chris Lattner	431000da21	Revert r86359, it is breaking the self host on the llvm-gcc-i386-darwin9 build bot. llvm-svn: 86391	2009-11-07 17:59:32 +00:00
Nick Lewycky	b6a3dd48f4	Teach dead store elimination that certain intrinsics write to memory just like a store. llvm-svn: 86359	2009-11-07 08:34:40 +00:00
Chris Lattner	5ff7f5672e	reapply 86289, 86278, 86270, 86267, 86266 & 86264 plus a fix (making pred factoring only happen if threading is guaranteed to be successful). This now survives an X86-64 bootstrap of llvm-gcc. llvm-svn: 86355	2009-11-07 08:05:03 +00:00
Nick Lewycky	9b669b3c4f	Oops, FunctionContainsEscapingAllocas is really used to mean two different things. Back out part of r86349 for a moment. llvm-svn: 86353	2009-11-07 07:42:38 +00:00
Nick Lewycky	5091272fdf	Dust off tail recursion elimination. Fix a fixme by applying CaptureTracking and add a .ll to demo the new capability. llvm-svn: 86349	2009-11-07 07:10:01 +00:00
Devang Patel	3a42e7ac65	Revert following patches to fix llvmgcc bootstrap. 86289, 86278, 86270, 86267, 86266 & 86264 Chris, please take a look. llvm-svn: 86321	2009-11-07 01:32:59 +00:00
Victor Hernandez	bde558c536	- new SROA mallocs should have the mallocs running-or'ed, not the malloc's bitcast - fix ProcessInternalGlobal() debug output llvm-svn: 86317	2009-11-07 00:41:19 +00:00
Jeffrey Yasskin	8f77e948e5	Avoid "ambiguous 'else'" warning from gcc. llvm-svn: 86314	2009-11-07 00:26:47 +00:00
Victor Hernandez	f3db915294	Re-commit r86077 now that r86290 fixes the 179.art and 175.vpr ARM regressions. Here is the original commit message: This commit updates malloc optimizations to operate on malloc calls that have constant int size arguments. Update CreateMalloc so that its callers specify the size to allocate: MallocInst-autoupgrade users use non-TargetData-computed allocation sizes. Optimization uses use TargetData to compute the allocation size. Now that malloc calls can have constant sizes, update isArrayMallocHelper() to use TargetData to determine the size of the malloced type and the size of malloced arrays. Extend getMallocType() to support malloc calls that have non-bitcast uses. Update OptimizeGlobalAddressOfMalloc() to optimize malloc calls that have non-bitcast uses. The bitcast use of a malloc call has to be treated specially here because the uses of the bitcast need to be replaced and the bitcast needs to be erased (just like the malloc call) for OptimizeGlobalAddressOfMalloc() to work correctly. Update PerformHeapAllocSRoA() to optimize malloc calls that have non-bitcast uses. The bitcast use of the malloc is not handled specially here because ReplaceUsesOfMallocWithGlobal replaces through the bitcast use. Update OptimizeOnceStoredGlobal() to not care about the malloc calls' bitcast use. Update all globalopt malloc tests to not rely on autoupgraded-MallocInsts, but instead use explicit malloc calls with correct allocation sizes. llvm-svn: 86311	2009-11-07 00:16:28 +00:00
Chris Lattner	eb690feaef	Fix a bug where we'd call SplitBlockPredecessors with a pred in the set only once even if it has multiple edges to BB. llvm-svn: 86299	2009-11-06 23:19:58 +00:00
Eli Friedman	a70917b2f4	Remove function left over from other jump threading cleanup. llvm-svn: 86289	2009-11-06 21:24:57 +00:00
Chris Lattner	a8b9ce3f07	Fix a problem discovered on self host. llvm-svn: 86278	2009-11-06 19:21:48 +00:00
Chris Lattner	d91a7960bf	remove more code subsumed by r86264 llvm-svn: 86270	2009-11-06 18:24:32 +00:00
Chris Lattner	899ef22acb	eliminate some more code subsumed by r86264 llvm-svn: 86267	2009-11-06 18:22:54 +00:00
Chris Lattner	2f6184f6aa	remove now redundant code, r86264 handles this case. llvm-svn: 86266	2009-11-06 18:20:58 +00:00
Chris Lattner	68d2417e05	Extend jump threading to support much more general threading predicates. This allows us to jump thread things like: _ZN12StringSwitchI5ColorE4CaseILj7EEERS1_RAT__KcRKS0_.exit119: %tmp1.i24166 = phi i8 [ 1, %bb5.i117 ], [ %tmp1.i24165, %_Z....exit ], [ %tmp1.i24165, %bb4.i114 ] %toBoolnot.i87 = icmp eq i8 %tmp1.i24166, 0 ; <i1> [#uses=1] %tmp4.i90 = icmp eq i32 %tmp2.i, 6 ; <i1> [#uses=1] %or.cond173 = and i1 %toBoolnot.i87, %tmp4.i90 ; <i1> [#uses=1] br i1 %or.cond173, label %bb4.i96, label %_ZN12... Where it is "obvious" that when coming from %bb5.i117 that the 'and' is always false. This triggers a surprisingly high number of times in the testsuite, and gets us closer to generating good code for doug's strswitch testcase. This also make a bunch of other code in jump threading redundant, I'll rip out in the next patch. This survived an enable-checking llvm-gcc bootstrap. llvm-svn: 86264	2009-11-06 18:15:14 +00:00
Chris Lattner	8c12bb8cd7	remove some more Context arguments. llvm-svn: 86235	2009-11-06 05:59:53 +00:00
Chris Lattner	46b5c642b9	remove a bunch of extraneous LLVMContext arguments from various APIs, addressing PR5325. llvm-svn: 86231	2009-11-06 04:27:31 +00:00
Victor Hernandez	b9f5899779	Revert r86077 because it caused crashes in 179.art and 175.vpr on ARM llvm-svn: 86213	2009-11-06 01:33:24 +00:00
Dan Gohman	a1bf0c0acc	Teach LSR to avoid calling SplitCriticalEdge on edges with indirectbr. llvm-svn: 86193	2009-11-05 23:34:59 +00:00
Dan Gohman	928068a886	Avoid calling getUniqueExitBlocks from within LoopSimplify, as it depends on loops having dedicated exits, which LoopSimplify can no longer always guarantee. llvm-svn: 86181	2009-11-05 21:48:32 +00:00
Dan Gohman	dca7ac335b	LoopDeletion depends on loops having dedicated exits. llvm-svn: 86180	2009-11-05 21:47:04 +00:00
Dan Gohman	1ef784db67	The introduction of indirectbr meant the introduction of unsplittable critical edges, which means the introduction of loops which cannot be transformed to LoopSimplify form. Fix LoopSimplify to avoid transforming such loops into invalid code. llvm-svn: 86176	2009-11-05 21:14:46 +00:00
Dan Gohman	a83ac2d9e7	Update various Loop optimization passes to cope with the possibility that LoopSimplify form may not be available. llvm-svn: 86175	2009-11-05 21:11:53 +00:00
Dan Gohman	415c64ea3f	Teach LoopUnroll how to bail if LoopSimplify can't give it what it needs. llvm-svn: 86164	2009-11-05 19:44:06 +00:00
Dan Gohman	d9fa1c9c1e	Call getAnalysis<LoopInfo> the normal way, instead of asking passed-in LoopPassManager for it. llvm-svn: 86163	2009-11-05 19:43:25 +00:00
Dan Gohman	885c46e387	Delete an unused member variable. llvm-svn: 86160	2009-11-05 19:33:15 +00:00
Dan Gohman	00c793822e	Add an assertion to catch indirectbr in SplitBlockPredecessors. This makes several optimization passes abort in cases where they're currently silently miscompiling code. Remove the indirectbr assertion from SplitEdge. Indirectbr is only a problem for critical edges, and SplitEdge defers to SplitCriticalEdge to handle those, and SplitCriticalEdge has its own assertion for indirectbr. llvm-svn: 86147	2009-11-05 18:25:44 +00:00
Benjamin Kramer	b971445ab7	Teach SimplifyLibCalls to fold memcmp calls with constant arguments. llvm-svn: 86141	2009-11-05 17:44:22 +00:00
Benjamin Kramer	3fcbb82151	Do map insert+find in one step. TODO -= 2. llvm-svn: 86133	2009-11-05 14:33:27 +00:00
Victor Hernandez	492ed30a32	Update CreateMalloc so that its callers specify the size to allocate: MallocInst-autoupgrade users use non-TargetData-computed allocation sizes. Optimization uses use TargetData to compute the allocation size. Now that malloc calls can have constant sizes, update isArrayMallocHelper() to use TargetData to determine the size of the malloced type and the size of malloced arrays. Extend getMallocType() to support malloc calls that have non-bitcast uses. Update OptimizeGlobalAddressOfMalloc() to optimize malloc calls that have non-bitcast uses. The bitcast use of a malloc call has to be treated specially here because the uses of the bitcast need to be replaced and the bitcast needs to be erased (just like the malloc call) for OptimizeGlobalAddressOfMalloc() to work correctly. Update PerformHeapAllocSRoA() to optimize malloc calls that have non-bitcast uses. The bitcast use of the malloc is not handled specially here because ReplaceUsesOfMallocWithGlobal replaces through the bitcast use. Update OptimizeOnceStoredGlobal() to not care about the malloc calls' bitcast use. Update all globalopt malloc tests to not rely on autoupgraded-MallocInsts, but instead use explicit malloc calls with correct allocation sizes. llvm-svn: 86077	2009-11-05 00:03:03 +00:00
Chris Lattner	a09062758b	improve DSE when TargetData is not around, based on work by Hans Wennborg! llvm-svn: 86067	2009-11-04 23:20:12 +00:00
Chris Lattner	762b56fa8c	Fix an iterator invalidation bug that happens when a hashtable resizes in IPSCCP. This fixes PR5394. llvm-svn: 86036	2009-11-04 18:57:42 +00:00
Chris Lattner	cb3c64ee3c	move two functions up higher in the file. Delete a useless argument to EmitGEPOffset. Implement some new transforms for optimizing subtracts of two pointer to ints into the same vector. This happens for C++ iterator idioms for example, stringmap takes a const char* that points to the start and end of a string. Once inlined, we want the pointer difference to turn back into a length. This is rdar://7362831. llvm-svn: 86021	2009-11-04 08:05:20 +00:00
Chris Lattner	156b8c7109	reimplement multiple return value handling in IPSCCP, making it more aggressive an correct. This survives building llvm in 64-bit mode with optimizations and the built llvm passes make check. llvm-svn: 85973	2009-11-03 23:40:48 +00:00
Chris Lattner	2c427233d4	finish half thunk thought llvm-svn: 85937	2009-11-03 20:52:57 +00:00
Chris Lattner	cde8de519d	fix an IPSCCP bug I introduced when I changed IPSCCP to start working on functions that don't have local linkage. Basically, we need to be more careful about propagating argument information to functions whose results we aren't tracking. This fixes a miscompilation of LLVMCConfigurationEmitter.cpp when built with an llvm-gcc that has ipsccp enabled. llvm-svn: 85923	2009-11-03 19:24:51 +00:00
Chris Lattner	e1d5cd9f48	fix a subtle bug I introduced when refactoring SCCP. Testcase to follow. llvm-svn: 85903	2009-11-03 16:50:11 +00:00
Benjamin Kramer	5573971453	Eliminate some temporaries. llvm-svn: 85896	2009-11-03 12:52:50 +00:00
Chris Lattner	5a3832496a	remove a isFreeCall check: it is a callinst that can write to memory already. llvm-svn: 85863	2009-11-03 05:33:46 +00:00
Ted Kremenek	2124f0d43f	Alphabetize. llvm-svn: 85859	2009-11-03 04:01:53 +00:00
Chris Lattner	fb14181b18	turn IPSCCP back on now that the iterator invalidation bug is fixed. llvm-svn: 85858	2009-11-03 03:42:51 +00:00
Chris Lattner	b70ef3c8c7	fix a nasty iterator invalidation bug from my conversion from std::map to DenseMap, exposed on release llvm-gcc bootstrap. llvm-svn: 85840	2009-11-02 23:25:39 +00:00
Chris Lattner	a15cc59dcb	revert r8579[56], which are causing unhappiness in buildbot land. llvm-svn: 85818	2009-11-02 19:31:10 +00:00
Chris Lattner	a3d794ebbb	disable IPSCCP support for multiple return values, it is buggy, so just disable it until I can fix it. llvm-svn: 85810	2009-11-02 18:22:51 +00:00
Chris Lattner	9d49f0c858	improve IPSCCP to be able to propagate the result of "!mayBeOverridden" function to calls of that function, regardless of whether it has local linkage or has its address taken. Not escaping should only affect whether we make an aggressive assumption about the arguments to a function, not whether we can track the result of it. llvm-svn: 85795	2009-11-02 07:33:59 +00:00
Chris Lattner	47837c5182	don't mark the arguments of prototype overdefined, they will never be queried. llvm-svn: 85793	2009-11-02 06:34:04 +00:00
Chris Lattner	5503328332	restore some code I removed in r85788, refactor it into a shared place instead of duplicating it 4 times. llvm-svn: 85792	2009-11-02 06:28:16 +00:00
Chris Lattner	4910b656b2	remove some confused code that dates from when we had "multiple return values" but not "first class aggregates" llvm-svn: 85791	2009-11-02 06:17:06 +00:00
Chris Lattner	809aee2f40	avoid redundant lookups in BBExecutable, and make it a SmallPtrSet. llvm-svn: 85790	2009-11-02 06:11:23 +00:00
Chris Lattner	e77c9aa04a	Use the libanalysis 'ConstantFoldLoadFromConstPtr' function instead of reinventing SCCP-specific logic. This gives us new powers. llvm-svn: 85789	2009-11-02 06:06:14 +00:00
Chris Lattner	f548403989	switch the main 'ValueState' map from being an std::map to being a DenseMap. Doing this required being aware of subtle iterator invalidation issues, but it provides a big speedup. In a release-asserts build, this sped up optimizing 403.gcc from 1.34s -> 0.79s (IPSCCP) and 1.11s -> 0.44s (SCCP). This commit also conflates in a bunch of general cleanups, sorry. llvm-svn: 85788	2009-11-02 05:55:40 +00:00
Chris Lattner	4e849162ef	fix a bug exposed by moving SRoA earlier which caused a crash building kc++ llvm-svn: 85786	2009-11-02 04:37:17 +00:00
Chris Lattner	e82b087ae6	only IPSCCP incoming arguments if the function is executable, this fixes an assertion on the buildbot. llvm-svn: 85784	2009-11-02 03:25:55 +00:00
Chris Lattner	9e97fbe114	add a new ValueState::getConstantInt() helper, use it to simplify some code. llvm-svn: 85783	2009-11-02 03:21:36 +00:00
Chris Lattner	7ccf1a6df6	tidy up some more: remove some extraneous inline specifiers, return harder. llvm-svn: 85780	2009-11-02 03:03:42 +00:00
Chris Lattner	b5a13d4c90	eliminate the SCCPSolver::getValueMapping method. llvm-svn: 85778	2009-11-02 02:54:24 +00:00
Chris Lattner	c49ae9912a	fix failures introduced in r85774 llvm-svn: 85777	2009-11-02 02:48:17 +00:00
Chris Lattner	e405ed9651	factor duplicated code into a new DeleteInstructionInBlock function, eliminate temporary (and pointless) smallvector. llvm-svn: 85776	2009-11-02 02:47:51 +00:00
Chris Lattner	a3c39d394d	Chris used to use '...' instead of proper grammar. llvm-svn: 85775	2009-11-02 02:33:50 +00:00
Chris Lattner	6df5cec72f	remove some extraneous llvmcontext stuff. llvm-svn: 85774	2009-11-02 02:30:06 +00:00
Chris Lattner	efdd2bbce6	change LatticeVal to use PointerIntPair to save some space. llvm-svn: 85773	2009-11-02 02:20:32 +00:00
Chris Lattner	3cd6a61b27	fix instcombine to only do store sinking when the alignments of the two loads agree. Propagate that onto the new store. llvm-svn: 85772	2009-11-02 02:06:37 +00:00
Chris Lattner	328ef89bd1	when merging two loads, make sure to take the min of their alignment, not the max. This didn't matter until the previous patch because instcombine would refuse to sink loads with differenting alignments. llvm-svn: 85738	2009-11-01 20:07:07 +00:00
Chris Lattner	2a249e267a	split load sinking out to its own function, like gep sinking. llvm-svn: 85737	2009-11-01 20:04:24 +00:00
Chris Lattner	0b40a8bc0e	fix a bug noticed by inspection: when instcombine sinks loads through phis, it didn't preserve the alignment of the load. This is a missed optimization of the alignment is high and a miscompilation when the alignment is low. llvm-svn: 85736	2009-11-01 19:50:13 +00:00
Chris Lattner	b5d9c8c708	cleanups, switch GlobalDCE to SmallPtrSet instead of std::set llvm-svn: 85730	2009-11-01 19:03:42 +00:00
Chris Lattner	37536b90e1	remove a bunch of locking from LLVMContextImpl. Since only one thread can be banging on a context at a time, this isn't needed. Owen, please review. llvm-svn: 85728	2009-11-01 18:42:03 +00:00
Chris Lattner	249f96e339	improve comment. llvm-svn: 85725	2009-11-01 18:17:37 +00:00
Douglas Gregor	291f6145b8	Reverting 85714, 85715, 85716, which are breaking the build llvm-svn: 85717	2009-11-01 16:42:53 +00:00
Dan Gohman	576ac96367	Remove the #include of Pass.h from PassManager.h. This breaks a significant #include dependency, as frontends commonly pull in PassManager.h. llvm-svn: 85714	2009-11-01 15:20:19 +00:00
Chris Lattner	1a8b80ed5a	teach ipsccp and ipconstprop that a blockaddress doesn't 'take the address' of a function in a way that should prevent ip constprop. This allows clang/test/CodeGen/indirect-goto.c to pass with the new indirect goto lowering. llvm-svn: 85709	2009-11-01 06:11:53 +00:00
Chris Lattner	a1dc101f66	change llvm::MergeBlockIntoPredecessor to not merge two blocks BB1->BB2 when BB2 has its address taken. Since it ends up doing BB2->rauw(BB1), this can cause the address of the entry block to be taken. Since it is generally undesirable to nuke blocks whose address is taken, even when we can, just unconditionally stop this xform. llvm-svn: 85708	2009-11-01 04:57:33 +00:00
Chris Lattner	746139b736	strengthen an assumption: RevectorBlockTo knows that PredBB ended in an uncond branch because the pass requires BreakCriticalEdges. However, BCE doesn't eliminate critical adges from indbrs. llvm-svn: 85707	2009-11-01 04:23:20 +00:00
Chris Lattner	7a8db3a41a	if CostMetrics says to never duplicate some code, don't unswitch a loop. This prevents unswitching from duplicating indbr's. llvm-svn: 85705	2009-11-01 03:42:55 +00:00
Chris Lattner	54a4b84012	constant fold indirectbr(blockaddress(%bb)) -> br label %bb. llvm-svn: 85704	2009-11-01 03:40:38 +00:00
Chris Lattner	aa99c94e2a	Revert 85678/85680. The decision is to stay with the current form of indirectbr, thus we don't need "blockaddr(@func, null)". Eliminate it for simplicity. llvm-svn: 85699	2009-11-01 01:27:45 +00:00
Chris Lattner	a546dcf418	Make sure PRE doesn't split crit edges from indirectbr. llvm-svn: 85692	2009-10-31 22:11:15 +00:00
Chris Lattner	c872b09676	llvm::SplitEdge should refuse to split an edge from an indirectbr. Fix CodeGenPrepare to not try to split edges from indirectbr. llvm-svn: 85690	2009-10-31 22:04:43 +00:00
Chris Lattner	ba364b0a9a	update the comment above llvm::SplitCriticalEdge, and make it abort on IndirectBrInst as describe in the comment. llvm-svn: 85688	2009-10-31 21:51:10 +00:00
Chris Lattner	3c89c53f35	adjust a couple xforms to work with null bb's in BlockAddress. llvm-svn: 85680	2009-10-31 20:13:24 +00:00
Chris Lattner	a742b8f94f	add a comment. llvm-svn: 85671	2009-10-31 17:48:31 +00:00
Dan Gohman	2d02ff8cbb	Revert r85667. LoopUnroll currently can't call utility functions which auto-update the DominatorTree because it doesn't keep the DominatorTree current while it works. llvm-svn: 85670	2009-10-31 17:33:01 +00:00
Dan Gohman	144694bcb7	Remove redundant code. llvm-svn: 85668	2009-10-31 16:16:41 +00:00
Dan Gohman	041e2dbad1	Merge the enhancements from LoopUnroll's FoldBlockIntoPredecessor into MergeBlockIntoPredecessor. This makes SimplifyCFG slightly more aggressive, and makes it unnecessary for LoopUnroll to have its own copy of this code. llvm-svn: 85667	2009-10-31 16:08:00 +00:00
Dan Gohman	880c92ac1c	Rename forgetLoopBackedgeTakenCount to forgetLoop, because it clears out more information than just the stored backedge taken count. llvm-svn: 85664	2009-10-31 15:04:55 +00:00
Dan Gohman	969e83a4ff	Replace LoopUnrollPass.cpp's custom code-size estimation code using the new common CodeMetrics code. llvm-svn: 85663	2009-10-31 14:54:17 +00:00
Dan Gohman	fa8969f70e	Simplify this code. llvm-svn: 85662	2009-10-31 14:46:50 +00:00
Dan Gohman	af94015c18	Remove an unnecessary #include. llvm-svn: 85661	2009-10-31 14:39:43 +00:00
Dan Gohman	f35b6640f6	Update CMakeLists for recent renames. llvm-svn: 85660	2009-10-31 14:38:25 +00:00
Dan Gohman	f70e76c435	Rename UnrollLoop.cpp to LoopUnroll.cpp, and LoopUnroll.cpp to LoopUnrollPass.cpp, for consistency with other passes which are similarly split. llvm-svn: 85659	2009-10-31 14:37:31 +00:00
Dan Gohman	fb7f0e57b6	Remove CodeGenLICM. It's largely obsoleted by MachineLICM's new ability to unfold loop-invariant loads. llvm-svn: 85657	2009-10-31 14:35:41 +00:00
Dan Gohman	930aa9d3d2	Reapply r85634, with the bug fixed. llvm-svn: 85655	2009-10-31 14:22:52 +00:00
Evan Cheng	c16d8f2054	Revert 85634. It's breaking consumer-typeset (and others). llvm-svn: 85641	2009-10-31 01:28:06 +00:00
Dan Gohman	7f7d97eb73	Add a comment about a missed opportunity. llvm-svn: 85635	2009-10-30 23:15:43 +00:00
Dan Gohman	5bec30ca5d	Optimize around the fact that pred_iterator is slow: instead of sorting PHI operands by the predecessor order, sort them by the order used by the first PHI in the block. This is still suffucient to expose duplicates. llvm-svn: 85634	2009-10-30 23:15:21 +00:00
Dan Gohman	1a95106602	Teach SimplifyCFG how to eliminate duplicate PHI nodes within a block. This reduces codesize on a variety of codes by 1-2% on x86-64. It also helps clean up after SSAUpdater. llvm-svn: 85626	2009-10-30 22:39:04 +00:00
Dan Gohman	13e41edc71	Sort the incoming values in PHI nodes to match the predecessor order. This helps expose duplicate PHIs, which will make it easier for them to be eliminated. llvm-svn: 85623	2009-10-30 22:22:22 +00:00
Evan Cheng	5a6b9c40d6	Add option to createGVNPass to disable PRE. llvm-svn: 85609	2009-10-30 20:12:24 +00:00
Nick Lewycky	b43a43a8fd	Apply some cleanups. No functionality changes. llvm-svn: 85498	2009-10-29 07:35:15 +00:00
Chris Lattner	312748848f	just for the hell of it, allow globalopt to statically evaluate static constructors with indirect gotos :) llvm-svn: 85495	2009-10-29 05:51:50 +00:00
Chris Lattner	ee8b951e73	teach various passes about blockaddress. We no longer crash on any clang tests. llvm-svn: 85465	2009-10-29 01:21:20 +00:00
Chris Lattner	be060382e9	teach ValueMapper about BlockAddress', making bugpoint a lot more useful. llvm-svn: 85458	2009-10-29 00:31:02 +00:00
Chris Lattner	cf5a47d63d	unindent massive blocks, no functionality change. llvm-svn: 85457	2009-10-29 00:28:30 +00:00
Victor Hernandez	0d025421cd	Extend getMallocArraySize() to determine the array size if the malloc argument is: ArraySize * ElementSize ElementSize * ArraySize ArraySize << log2(ElementSize) ElementSize << log2(ArraySize) Refactor isArrayMallocHelper and delete isSafeToGetMallocArraySize, so that there is only 1 copy of the malloc array determining logic. Update users of getMallocArraySize() to not bother calling isArrayMalloc() as well. llvm-svn: 85421	2009-10-28 20:18:55 +00:00
Devang Patel	ffd561bc2d	llvm.dbg.global_variables do not exist anymore. llvm-svn: 85402	2009-10-28 16:51:52 +00:00
Edward O'Callaghan	1042ca112f	No newline at end of file. llvm-svn: 85390	2009-10-28 15:04:53 +00:00
Benjamin Kramer	ecc60b80b0	Update CMake file. llvm-svn: 85389	2009-10-28 13:29:18 +00:00
Owen Anderson	2b2bd28973	Treat lifetime begin/end markers as allocations/frees respectively for the purposes for GVN/DSE. llvm-svn: 85383	2009-10-28 07:05:35 +00:00
Nick Lewycky	175308c43e	Add ABCD, a generalized implementation of the Elimination of Array Bounds Checks on Demand algorithm which looks at arbitrary branches instead of loop iterations. This is GSoC work by Andre Tavares with only editorial changes applied! llvm-svn: 85382	2009-10-28 07:03:15 +00:00
Chris Lattner	a91a563530	Previously, all operands to Constant were themselves constant. In the new world order, BlockAddress can have a BasicBlock operand. This doesn't permute much, because if you have a ConstantExpr (or anything more specific than Constant) we still know the operand has to be a Constant. llvm-svn: 85375	2009-10-28 05:14:34 +00:00
Devang Patel	11cf3f4a27	Factor out redundancy from clone() implementations. llvm-svn: 85327	2009-10-27 22:16:29 +00:00
Victor Hernandez	f390e04a47	Rename MallocFreeHelper as MemoryBuiltins llvm-svn: 85286	2009-10-27 20:05:49 +00:00
Chris Lattner	c6b3b25f94	Fix a pretty serious misfeature of the inliner: if it inlines a function with multiple return values it inserts a PHI to merge them all together. However, if the return values are all the same, it ends up with a pointless PHI and this pointless PHI happens to really block SRoA from happening in at least a silly C++ example written by Doug, but probably others. This fixes rdar://7339069. llvm-svn: 85206	2009-10-27 05:39:41 +00:00
Mike Stump	2b0a49a682	VS build fix, patch by Marius Wachtler. llvm-svn: 85197	2009-10-27 02:14:13 +00:00
Eric Christopher	7a50b280c1	Add objectsize intrinsic and hook it up through codegen. Doesn't do anything than return "I don't know" at the moment. llvm-svn: 85189	2009-10-27 00:52:25 +00:00
Dan Gohman	f808106bbe	Add braces to avoid ambiguous else. llvm-svn: 85185	2009-10-27 00:11:02 +00:00
Victor Hernandez	762195bd01	Rename MallocHelper as MallocFreeHelper, since it now also identifies calls to free() llvm-svn: 85181	2009-10-26 23:58:56 +00:00
Owen Anderson	03b5de67b0	Add a straight-forward implementation of SCCVN for aggressively eliminating scalar redundancies. llvm-svn: 85179	2009-10-26 23:55:47 +00:00
Victor Hernandez	de5ad42aa1	Remove FreeInst. Remove LowerAllocations pass. Update some more passes to treate free calls just like they were treating FreeInst. llvm-svn: 85176	2009-10-26 23:43:48 +00:00
Dan Gohman	34e38afa96	Simplify this code. LoopDeletion doesn't need to explicit check that the loop exiting block dominates the latch block; if ScalarEvolution can prove that the trip-count is finite, that's sufficient. llvm-svn: 85165	2009-10-26 22:18:58 +00:00
Dan Gohman	672927f393	Code that checks WillNotOverflowSignedAdd before creating an Add can safely use the NSW bit on the Add. llvm-svn: 85164	2009-10-26 22:14:22 +00:00
Ted Kremenek	ce8f626f82	Update CMake files. llvm-svn: 85161	2009-10-26 22:06:01 +00:00
Dan Gohman	6a1d9eace9	Check in the experimental GEP splitter pass. This pass splits complex GEPs (more than one non-zero index) into simple GEPs (at most one non-zero index). In some simple experiments using this it's not uncommon to see 3% overall code size wins, because it exposes redundancies that can be eliminated, however it's tricky to use because instcombine aggressively undoes the work that this pass does. llvm-svn: 85144	2009-10-26 19:12:14 +00:00
Dan Gohman	6a10d5ebd3	Fix a typo in a comment. llvm-svn: 85120	2009-10-26 15:55:24 +00:00
Chris Lattner	683eed3286	reapply r85085 with a bugfix to avoid infinite looping. All of the 'demorgan' related xforms need to use dyn_castNotVal, not m_Not. llvm-svn: 85119	2009-10-26 15:40:07 +00:00
Dan Gohman	d632f89596	Make LSR's OptimizeShadowIV ignore induction variables with negative strides for now, because it doesn't handle them correctly. This fixes a miscompile of SingleSource/Benchmarks/Misc-C++/ray. This problem was usually hidden because indvars transforms such induction variables into negations of canonical induction variables. llvm-svn: 85118	2009-10-26 15:32:57 +00:00
Evan Cheng	8014a728b9	Revert 85085. It causes infinite looping during llvm-gcc build. llvm-svn: 85090	2009-10-26 03:51:32 +00:00
Chris Lattner	2e6564d6ff	Implement PR3266 & PR5276, folding: not (or (icmp, icmp)) -> and(icmp, icmp) llvm-svn: 85085	2009-10-26 01:06:31 +00:00
Nick Lewycky	974e12b2d3	Remove includes of Support/Compiler.h that are no longer needed after the VISIBILITY_HIDDEN removal. llvm-svn: 85043	2009-10-25 06:57:41 +00:00
Nick Lewycky	02d5f77d26	Remove VISIBILITY_HIDDEN from class/struct found inside anonymous namespaces. Chris claims we should never have visibility_hidden inside any .cpp file but that's still not true even after this commit. llvm-svn: 85042	2009-10-25 06:33:48 +00:00
Nick Lewycky	54d7179a25	Remove ICmpInst::isSignedPredicate which was a reimplementation CmpInst::isSigned. llvm-svn: 85037	2009-10-25 05:20:17 +00:00
Dan Gohman	ef41a1ce3c	MapValue doesn't needs its LLVMContext argument. llvm-svn: 85020	2009-10-24 23:37:16 +00:00
Dan Gohman	8f4078ba39	Rename isLoopExit to isLoopExiting, for consistency with the wording used elsewhere - an exit block is a block outside the loop branched to from within the loop. An exiting block is a block inside the loop that branches out. llvm-svn: 85019	2009-10-24 23:34:26 +00:00
Dan Gohman	b979794e4b	Rewrite LoopRotation's SSA updating code using SSAUpdater. llvm-svn: 85016	2009-10-24 23:19:52 +00:00
Victor Hernandez	e297149e26	Auto-upgrade free instructions to calls to the builtin free function. Update all analysis passes and transforms to treat free calls just like FreeInst. Remove RaiseAllocations and all its tests since FreeInst no longer needs to be raised. llvm-svn: 84987	2009-10-24 04:23:03 +00:00
Victor Hernandez	8acf2956b8	Remove AllocationInst. Since MallocInst went away, AllocaInst is the only subclass of AllocationInst, so it no longer is necessary. llvm-svn: 84969	2009-10-23 21:09:37 +00:00
Dan Gohman	41d00ac45b	Make LoopDeletion check the maximum backedge taken count, rather than the exact backedge taken count, when checking for infinite loops. This allows it to delete loops with multiple exit conditions. llvm-svn: 84952	2009-10-23 17:10:01 +00:00
Chris Lattner	cf7e8947e9	move another load optimization from instcombine -> libanalysis. llvm-svn: 84841	2009-10-22 06:44:07 +00:00
Chris Lattner	51d2f70e32	move 'loading i32 from string' optimization from instcombine to libanalysis. Instcombine shrinking... does this even make sense??? llvm-svn: 84840	2009-10-22 06:38:35 +00:00
Chris Lattner	1664a4fd86	Move some constant folding logic for loads out of instcombine into Analysis/ConstantFolding.cpp. This doesn't change the behavior of instcombine but makes other clients of ConstantFoldInstruction able to handle loads. This was partially extracted from Eli's patch in PR3152. llvm-svn: 84836	2009-10-22 06:25:11 +00:00
Chris Lattner	c7a962d3b3	fix PR5262. llvm-svn: 84810	2009-10-22 00:17:26 +00:00
Devang Patel	27e0be274e	Derive metadata hierarchy from Value instead of User. llvm-svn: 84801	2009-10-21 23:57:35 +00:00
Chris Lattner	966526cbfb	revert r84754, it isn't the right approach. Edwin, please propose patches for fixes like this instead of committing them directly. llvm-svn: 84799	2009-10-21 23:41:58 +00:00
Victor Hernandez	be9e179104	Make changes to rev 84292 as requested by Chris Lattner. Most changes are cleanup, but there is 1 correctness fix: I fixed InstCombine so that the icmp is removed only if the malloc call is removed (which requires explicit removal because the Worklist won't DCE any calls since they can have side-effects). llvm-svn: 84772	2009-10-21 19:11:40 +00:00
Torok Edwin	1539a352a6	Fix PR5262: when folding select into PHI, make sure all operands are available in the PHI's Basic Block. This uses a conservative approach, because we don't have dominator info in instcombine. llvm-svn: 84754	2009-10-21 10:49:00 +00:00
Chris Lattner	8ed7bef409	make GVN work better when TD is not around: "In the existing code, if the load and the value to replace it with are of different types and target data is available, it tries to use the target data to coerce the replacement value to the type of the load. Otherwise, it skips all effort to handle the type mismatch and just feeds the wrongly-typed replacement value to replaceAllUsesWith, which triggers an assertion. The patch replaces it with an outer if checking for type mismatch, and an inner if-else that checks whether target data is available and, if not, returns false rather than trying to replace the load." Patch by Kenneth Uildriks! llvm-svn: 84739	2009-10-21 04:11:19 +00:00
Devang Patel	1d7f7d21dc	Do not remove dead metadata for now. llvm-svn: 84731	2009-10-21 02:21:34 +00:00
Chris Lattner	7f903681ac	alternate fix for PR5258 which avoids worklist problems, with reduced testcase. llvm-svn: 84667	2009-10-20 20:27:49 +00:00
Dan Gohman	b6b8ec769c	Restore LoopUnswitch's block-oriented threshold. LoopUnswitch now checks both the estimated code size and the number of blocks when deciding whether to do a non-trivial unswitch. This protects it from some very undesirable worst-case behavior on large numbers of loop-unswitchable conditions, such as in the testcase in PR5259. llvm-svn: 84661	2009-10-20 20:06:09 +00:00
Torok Edwin	cf10ec951d	Fix PR5258, jump-threading creating invalid PHIs. When an incoming value for a PHI is updated, we must also updated all other incoming values for the same BB to match, otherwise we create invalid PHIs. llvm-svn: 84638	2009-10-20 15:42:00 +00:00
Torok Edwin	729d92bd74	Fix PR4313: IPSCCP was not setting the lattice value for the invoke instruction when the invoke had multiple return values: it set the lattice value only on the extractvalue. This caused the invoke's lattice value to remain the default (undefined), and later propagated to extractvalue's operand, which incorrectly introduces undefined behavior. llvm-svn: 84637	2009-10-20 15:15:09 +00:00
Owen Anderson	168ad6985e	Refactor lookup_or_add to contain _MUCH_ less duplicated code. Add support for numbering first class aggregate instructions while we're at it. llvm-svn: 84547	2009-10-19 22:14:22 +00:00
Victor Hernandez	5c704d505c	Malloc calls are marked NoAlias, so the code below the isMalloc() check makes it redundant. Removing the isMalloc() check. llvm-svn: 84541	2009-10-19 21:47:22 +00:00
Owen Anderson	1059b5b32d	Simplify some code. llvm-svn: 84533	2009-10-19 21:14:57 +00:00
Dan Gohman	8f986672a1	Fix SplitBlockPredecessors' LoopInfo updating code to handle the case where a loop's header is being split and it has predecessors which are not contained by the most-nested loop which contains the loop. This fixes PR5235. llvm-svn: 84505	2009-10-19 16:04:50 +00:00
Dan Gohman	511d2e26dd	Change instnamer to name arguments "arg" instead of "tmp" for clarity, and to name basic blocks "bb" instead of "BB", for consistency. llvm-svn: 84502	2009-10-19 14:47:32 +00:00
Chris Lattner	1fa98f0d74	remove the IndMemRemPass, which only made sense for when malloc/free were intrinsic instructions. llvm-svn: 84404	2009-10-18 05:02:09 +00:00
Daniel Dunbar	8eff29d805	Use raw_ostream::write_escaped instead of EscapeString. llvm-svn: 84356	2009-10-17 20:43:19 +00:00
Chris Lattner	88b36f1140	Simplify some code (first hunk) and fix PR5208 (second hunk) by updating the callgraph when introducing a call. llvm-svn: 84310	2009-10-17 05:39:39 +00:00
Victor Hernandez	a3aaf85e23	Remove MallocInst from LLVM Instructions. llvm-svn: 84299	2009-10-17 01:18:07 +00:00
Victor Hernandez	c7d6a8327c	Autoupgrade malloc insts to malloc calls. Update testcases that rely on malloc insts being present. Also prematurely remove MallocInst handling from IndMemRemoval and RaiseAllocations to help pass tests in this incremental step. llvm-svn: 84292	2009-10-17 00:00:19 +00:00
Victor Hernandez	264da3274e	HeapAllocSRoA also needs to check if malloc array size can be computed. llvm-svn: 84288	2009-10-16 23:12:25 +00:00
Dan Gohman	99429a00ff	Move zext and sext casts fed by loads into the same block as the load, to help SelectionDAG fold them into the loads, unless conditions are unfavorable. llvm-svn: 84271	2009-10-16 20:59:35 +00:00
Duncan Sands	0058c7bcb0	Strip trailing white space. llvm-svn: 84256	2009-10-16 15:20:13 +00:00
Victor Hernandez	13020b1faf	Fix bug where array malloc with unexpected computation of the size argument resulted in MallocHelper identifying the malloc as a non-array malloc. This broke GlobalOpt's optimization of stores of mallocs to global variables. The fix is to classify malloc's into 3 categories: 1. non-array mallocs 2. array mallocs whose array size can be determined 3. mallocs that cannot be determined to be of type 1 or 2 and cannot be optimized getMallocArraySize() returns NULL for category 3, and all users of this function must avoid their malloc optimization if this function returns NULL. Eventually, currently unexpected codegen for computing the malloc's size argument will be supported in isArrayMalloc() and getMallocArraySize(), extending malloc optimizations to those examples. llvm-svn: 84199	2009-10-15 20:14:52 +00:00
Chris Lattner	c855b45b78	only try to fold constantexpr operands when the worklist is first populated, don't bother every time going around the main worklist. This speeds up a release-asserts opt -std-compile-opts on 403.gcc by about 4% (1.5s). It seems to speed up the most expensive instances of instcombine by ~10%. llvm-svn: 84171	2009-10-15 04:59:28 +00:00
Chris Lattner	dd1f68a10c	don't bother calling ConstantFoldInstruction unless there is a use of the instruction (which disqualifies stores, unreachable, etc) and at least the first operand is a constant. This filters out a lot of obvious cases that can't be folded. Also, switch the IRBuilder to a TargetFolder, which tries harder. llvm-svn: 84170	2009-10-15 04:13:44 +00:00
Devang Patel	92f8619923	Use isVoidTy() llvm-svn: 84118	2009-10-14 17:29:00 +00:00
Chris Lattner	6b9044db01	make instcombine's instruction sinking more aggressive in the presence of PHI nodes. llvm-svn: 84103	2009-10-14 15:21:58 +00:00
Devang Patel	a677136900	Check void type before using RAUWd. llvm-svn: 84049	2009-10-13 22:56:32 +00:00
Devang Patel	115741ba79	Do not check use_empty() before replaceAllUsesWith(). This gives ValueHandles a chance to get properly updated. llvm-svn: 84033	2009-10-13 21:41:20 +00:00
Dan Gohman	2dc6f8de03	Use the new CodeMetrics class to compute code size instead of manually counting instructions. llvm-svn: 84016	2009-10-13 20:12:23 +00:00
Ted Kremenek	113d959f1b	Update CMake file. llvm-svn: 84001	2009-10-13 18:48:07 +00:00
Dan Gohman	54463e837a	Commit the removal of this file, which is now moved to lib/Analysis. llvm-svn: 83999	2009-10-13 18:37:20 +00:00
Dan Gohman	4552e3cd73	Move the InlineCost code from Transforms/Utils to Analysis. llvm-svn: 83998	2009-10-13 18:30:07 +00:00
Dan Gohman	5b3e05bcaa	Start refactoring the inline cost estimation code so that it can be used for purposes other than inlining. llvm-svn: 83997	2009-10-13 18:24:11 +00:00
Chris Lattner	19788ca686	change simplifycfg to not duplicate 'unwind' instructions. Hopefully this will increase the likelihood of common code getting sunk towards the unwind. llvm-svn: 83996	2009-10-13 18:13:05 +00:00
Dan Gohman	71ca652475	Make LoopUnswitch's cost estimation count Instructions, rather than BasicBlocks, so that it doesn't blindly procede in the presence of large individual BasicBlocks. This addresses a class of code-size expansion problems. llvm-svn: 83992	2009-10-13 17:50:43 +00:00
Evan Cheng	f815861591	Make licm debug message readable. llvm-svn: 83908	2009-10-12 22:25:23 +00:00
Dale Johannesen	4c9f0e8f53	Fix warning. llvm-svn: 83870	2009-10-12 18:45:32 +00:00
Chris Lattner	8abd572dae	populate instcombine's initial worklist more carefully, causing it to visit instructions from the start of the function to the end of the function in the first path. This greatly speeds up some pathological cases (e.g. PR5150). Try #3, this time with some unneeded debug info stuff removed which was causing dead pointers to be added to the worklist. llvm-svn: 83818	2009-10-12 03:58:40 +00:00
Chris Lattner	8ce6b36c86	revert r83814 for now, it is making the llvm-gcc bootstrap unhappy. llvm-svn: 83817	2009-10-11 23:56:08 +00:00
Chris Lattner	78d6310429	populate instcombine's initial worklist more carefully, causing it to visit instructions from the start of the function to the end of the function in the first path. This greatly speeds up some pathological cases (e.g. PR5150). llvm-svn: 83814	2009-10-11 23:17:43 +00:00
Chris Lattner	2c2deae5ac	remove some harmful code that would turn an insertelement on an undef into a shuffle even if it was used by another insertelement. If the visitation order of instcombine was wrong, this would turn a chain of insertelements into a chain of shufflevectors, which was quite painful. Since CollectShuffleElements handles these cases, the code can just be nuked. llvm-svn: 83810	2009-10-11 23:02:46 +00:00
Chris Lattner	c6cdbfbfdd	teach instcombine to simplify xor's harder, catching the new testcase. llvm-svn: 83799	2009-10-11 22:22:13 +00:00
Chris Lattner	6e6ac47125	cleanups llvm-svn: 83797	2009-10-11 22:00:32 +00:00
Chris Lattner	1639234775	cleanup, no functionality change. llvm-svn: 83795	2009-10-11 21:36:10 +00:00
Chris Lattner	fd27f8a5b3	generalize a transformation even more: we don't care whether the input the the mul is a zext from bool, just that it is all zeros other than the low bit. This fixes some phase ordering issues that would cause us to miss some xforms in mul.ll when the worklist is visited differently. llvm-svn: 83794	2009-10-11 21:29:45 +00:00
Chris Lattner	406cb75c6b	simplify a transformation by making it more general. llvm-svn: 83792	2009-10-11 21:22:21 +00:00
Chris Lattner	f39f4f928a	temporarily revert previous patch llvm-svn: 83791	2009-10-11 21:05:34 +00:00
Chris Lattner	bb058d3a23	populate instcombine's initial worklist more carefully, causing it to visit instructions from the start of the function to the end of the function in the first path. This greatly speeds up some pathological cases (e.g. PR5150). llvm-svn: 83790	2009-10-11 21:04:37 +00:00
Torok Edwin	8b3081350e	Remove CleanupDbgInfo, instcombine does this and its not worth duplicating it here. llvm-svn: 83789	2009-10-11 19:58:35 +00:00
Torok Edwin	907ec36943	LICM shouldn't sink/delete debug information. Fix this and add a testcase. For now the metadata of sinked/hoisted instructions is still wrong, but that'll be fixed when instructions will have debug metadata directly attached. llvm-svn: 83786	2009-10-11 19:15:54 +00:00
Chris Lattner	85c85c5e04	when folding duplicate conditions, delete the now-probably-dead instruction tree feeding it. llvm-svn: 83778	2009-10-11 18:39:58 +00:00
Chris Lattner	e374382b8f	implement rdar://7293527, a trivial instcombine that llvm-gcc gets but clang doesn't, because it is implemented in GCC's fold routine. llvm-svn: 83761	2009-10-11 07:53:15 +00:00
Chris Lattner	97b1405207	implement a transformation in jump threading that is currently done by condprop, but do it in a much more general form. The basic idea is that we can do a limited form of tail duplication in the case when we have a branch on a phi. Moving the branch up in to the predecessor block makes instruction selection much easier and encourages chained jump threadings. llvm-svn: 83759	2009-10-11 07:24:57 +00:00
Chris Lattner	6ce85e85f5	restructure some code, no functionality change. llvm-svn: 83756	2009-10-11 04:40:21 +00:00
Chris Lattner	f466bc84c9	factor some code better and move a function, no functionality change. llvm-svn: 83755	2009-10-11 04:33:43 +00:00
Chris Lattner	f99a74e24b	make jump threading on a phi with undef inputs happen. llvm-svn: 83754	2009-10-11 04:18:15 +00:00
Chris Lattner	71d353dd48	rewrite LCSSA to use SSAUpdate, to only return true if it modifies the IR, and to implement the FIXME'd optimization. llvm-svn: 83748	2009-10-11 02:53:37 +00:00
Chris Lattner	101dde30ed	clean up and simplify some code. Don't use setvector when things will be inserted only once, just use vector. Don't compute ExitBlocks unless we need it, change std::sort to array_pod_sort. llvm-svn: 83747	2009-10-11 01:07:15 +00:00
Chris Lattner	b6c65faa64	switch GVN to use SSAUpdater. Besides removing a lot of complexity from GVN, this also speeds it up, inserts fewer PHI nodes (see the testcase) and allows it to remove more loads (due to fewer PHI nodes standing in the way). llvm-svn: 83746	2009-10-10 23:50:30 +00:00
Chris Lattner	9c382cebc5	add a simple helper method. llvm-svn: 83745	2009-10-10 23:41:48 +00:00
Chris Lattner	249265de06	add ability for clients of SSAUpdater to find out about the PHI nodes inserted. llvm-svn: 83744	2009-10-10 23:15:24 +00:00
Chris Lattner	89d2a5c4f3	remove dead code llvm-svn: 83742	2009-10-10 23:04:12 +00:00
Chris Lattner	67cdd8b567	add the ability to get a rewritten value from the middle of a block, not just at the end. Add a big comment explaining when this could be useful (which never happens for jump threading). llvm-svn: 83741	2009-10-10 23:00:11 +00:00
Chris Lattner	e474a8d3a7	rename GetValueInBlock -> GetValueAtEndOfBlock to better reflect what it does. llvm-svn: 83740	2009-10-10 22:41:58 +00:00
Chris Lattner	65e69a77e1	use a typedef instead of spelling out an insane type. Yay for auto someday. llvm-svn: 83707	2009-10-10 09:09:20 +00:00
Chris Lattner	84095071ea	Change jump threading to use the new SSAUpdater class instead of DemoteRegToStack. This makes it more efficient (because it isn't creating a ton of load/stores that are eventually removed by a later mem2reg), and more slightly more effective (because those load/stores don't get in the way of threading). llvm-svn: 83706	2009-10-10 09:05:58 +00:00
Chris Lattner	60d4e69c81	Implement an efficient and fully general SSA update mechanism that works on unstructured CFGs. This implements PR217, our oldest open PR. llvm-svn: 83705	2009-10-10 09:04:27 +00:00
Chris Lattner	f30a2b0c86	random tidying llvm-svn: 83701	2009-10-10 06:22:45 +00:00
Dale Johannesen	96a5b87ae2	Use names instead of numbers for some of the magic constants used in inlining heuristics (especially those used in more than one file). No functional change. llvm-svn: 83675	2009-10-09 21:42:02 +00:00
Dale Johannesen	3059924bdd	When considering whether to inline Callee into Caller, and that will make Caller too big to inline, see if it might be better to inline Caller into its callers instead. This situation is described in PR 2973, although I haven't tried the specific case in SPASS. llvm-svn: 83602	2009-10-09 00:11:32 +00:00
Dan Gohman	09984279fd	Add a form of addPreserved which takes a string argument, to allow passes to declare that they preserve other passes without needing to pull in additional header file or library dependencies. Convert MachineFunctionPass and CodeGenLICM to make use of this. llvm-svn: 83555	2009-10-08 17:00:02 +00:00
Jeffrey Yasskin	dafd08ea7e	In instcombine's debug output, avoid printing ADD for instructions that are already on the worklist, and print Visited when an instruction is about to be visited. Net, on one input, this reduced the output size by at least 9x. llvm-svn: 83510	2009-10-08 00:12:24 +00:00
Eric Christopher	5b741f3d14	80-column and whitespace fixes. llvm-svn: 83489	2009-10-07 21:14:25 +00:00
Eric Christopher	e666bc9f64	Add FreeInst to the "is a call" check for Insts that are calls, but not intrinsics. llvm-svn: 83441	2009-10-07 00:54:08 +00:00
Eric Christopher	6ba26317ce	While we still have a MallocInst treat it as a call like any other for inlining. When MallocInst goes away this code will be subsumed as part of calls and work just fine... llvm-svn: 83434	2009-10-07 00:02:18 +00:00
Ted Kremenek	2275a7dfef	Update CMake file. llvm-svn: 83404	2009-10-06 19:45:38 +00:00
Chris Lattner	a893f5bdf5	remove predicate simplifier, it never got the last bugs beaten out of it, and jump threading, condprop and gvn are now getting most of the benefit. This was approved by Nicholas and Nicolas. llvm-svn: 83390	2009-10-06 16:59:46 +00:00
Duncan Sands	9ed7b16bf3	Introduce and use convenience methods for getting pointer types where the element is of a basic builtin type. For example, to get an i8* use getInt8PtrTy. llvm-svn: 83379	2009-10-06 15:40:36 +00:00
Dan Gohman	e525d9ddc0	Remove an unnnecessary LLVMContext argument in ConstantFoldLoadThroughGEPConstantExpr. llvm-svn: 83311	2009-10-05 16:36:26 +00:00
Dan Gohman	238cf49812	Use Use::operator= instead of Use::set, for consistency. llvm-svn: 83310	2009-10-05 16:31:55 +00:00
Chris Lattner	fdd8790718	strength reduce a ton of type equality tests to check the typeid (Through the new predicates I added) instead of going through a context and doing a pointer comparison. Besides being cheaper, this allows a smart compiler to turn the if sequence into a switch. llvm-svn: 83297	2009-10-05 05:54:46 +00:00
Chris Lattner	463716d559	instcombine shouldn't delete all null checks for mallocs. This fixes PR5130. llvm-svn: 83290	2009-10-05 02:47:47 +00:00
Owen Anderson	b5049bebb3	Do away with the strange use of BitVectors in SSI, and just use normal sets. This makes the code much more C++/LLVM-ish. llvm-svn: 83286	2009-10-04 18:49:55 +00:00
Owen Anderson	286feb16a9	Fix a typo in the comment. llvm-svn: 83283	2009-10-04 17:52:13 +00:00
Owen Anderson	a62bf10651	SSI needs to require DT and DF transitively, since it uses them outside of its runOnFunction. Similarly, it can be marked setPreservesAll, since it does no work in its runOnFunction. llvm-svn: 83282	2009-10-04 17:47:39 +00:00
Evan Cheng	bb4ed2394b	Allow -inline-threshold override default threshold even if compiling to optimize for size. llvm-svn: 83274	2009-10-04 06:13:54 +00:00
Douglas Gregor	d846fbf20d	Remove GVNPRE.cpp from the CMake makefile llvm-svn: 83194	2009-10-01 05:30:05 +00:00
Chris Lattner	5f3cc06cd2	remove the GVNPRE pass. It has been subsumed by the GVN pass. Ok'd by Owen. llvm-svn: 83193	2009-10-01 02:18:36 +00:00
Dan Gohman	ea0bb8f555	Fix this code so that it doesn't try to iterate through a std::vector while calling changeImmediateDominator, which removes elements from the vector. This fixes PR5097. llvm-svn: 83166	2009-09-30 20:54:16 +00:00
Dan Gohman	7d3b0be05b	Remove a redundant #ifndef and add an assertion string. llvm-svn: 82991	2009-09-28 14:38:19 +00:00
Dan Gohman	9a7320c711	Convert LoopSimplify and LoopExtractor from FunctionPass to LoopPass. llvm-svn: 82990	2009-09-28 14:37:51 +00:00
Chris Lattner	0261b5d2d2	The select instruction is not neccesarily in the same block as the phi nodes. Make sure to phi translate from the right block. This fixes a llvm-building-llvm failure on GVN-PRE.cpp llvm-svn: 82970	2009-09-28 06:49:44 +00:00
Chris Lattner	4425660b1f	simplify some code. llvm-svn: 82936	2009-09-27 21:46:50 +00:00
Chris Lattner	b2e88cd01c	The bitcast case is not needed here: instcombine turns icmp(bitcast(x), null) -> icmp(x, null) already. llvm-svn: 82935	2009-09-27 21:42:46 +00:00
Chris Lattner	8b4d3dfbbf	calls are already unmovable, malloc doesn't need a special case. llvm-svn: 82933	2009-09-27 21:36:19 +00:00
Chris Lattner	f9e0c7f84b	calls to external functions are already marked overdefined, special casing malloc isn't needed. llvm-svn: 82932	2009-09-27 21:35:11 +00:00
Chris Lattner	5abb1e4cd2	calls are already handled, malloc doesn't need a special case. llvm-svn: 82931	2009-09-27 21:33:46 +00:00
Chris Lattner	466d57f6c1	calls are rejected above, no need to special case malloc here. llvm-svn: 82929	2009-09-27 21:31:39 +00:00
Chris Lattner	43d0db70ac	remove special handling of bitcast(malloc), it will be handled when the loop inspects the bitcast operand. llvm-svn: 82928	2009-09-27 21:29:28 +00:00
Chris Lattner	a8627272c1	unlike the malloc instruction, "malloc" calls do not claim to be readonly, just nounwind. llvm-svn: 82927	2009-09-27 21:23:38 +00:00
Chris Lattner	b391e87263	allow pushing icmps through phis with multiple uses and across critical edges. These are important to push up to encourage jump threading. This shrinks 176.gcc a bit. llvm-svn: 82923	2009-09-27 20:46:36 +00:00
Chris Lattner	ae289632ef	Enhance the previous fix for PR4895 to allow more values than just simple constants for the true/false value of the select. We now do phi translation etc. This really fixes PR4895 :) llvm-svn: 82917	2009-09-27 20:18:49 +00:00
Chris Lattner	facb867af3	implement PR4895, by making FoldOpIntoPhi handle select conditions that are phi nodes. Also tighten up FoldOpIntoPhi to treat constantexpr operands to phis just like other variables, avoiding moving constantexpr computations around. Patch by Daniel Dunbar. llvm-svn: 82913	2009-09-27 19:57:57 +00:00
Dan Gohman	0e70af36c0	Grab an LLVM Context from an instruction that exists rather than one that is deleted in some situations. This fixes a use-after-free. llvm-svn: 82903	2009-09-27 16:10:30 +00:00
Dan Gohman	fc20b67e80	Tell ScalarEvolution to forget everything it knows about a loop before rotating the loop, since loop rotation is a very significant change. llvm-svn: 82901	2009-09-27 15:37:03 +00:00
Nick Lewycky	42fb7452df	Instruction::clone does not need to take an LLVMContext&. Remove that and update all the callers. llvm-svn: 82889	2009-09-27 07:38:41 +00:00
Dan Gohman	62995c71a2	Fix SimplifyLibCalls to transfer attributes from callees rather than calls, since direct calls don't always reflect the attributes of their callees. llvm-svn: 82867	2009-09-26 18:10:13 +00:00
Dan Gohman	394468dc8e	Rename ConstantFP's getInf to getInfinity. llvm-svn: 82823	2009-09-25 23:40:21 +00:00
Dan Gohman	5ffd53892d	Transform pow(x, 0.5) to (x == -inf ? inf : fabs(sqrt(x))), which is typically faster then doing a general pow. llvm-svn: 82819	2009-09-25 23:10:17 +00:00
Torok Edwin	21bd8c9fc5	Constant propagating byval pointer is safe if function is readonly. llvm-svn: 82700	2009-09-24 18:33:42 +00:00
Torok Edwin	f95a450ef9	Don't constant propagate byval pointers, since they are not really pointers, but rather structs passed by value. This fixes PR5038. llvm-svn: 82689	2009-09-24 09:47:18 +00:00
Dale Johannesen	fb1b55bc9c	A minor improvment in accuracy to inline cost computation, and some cosmetics. llvm-svn: 82660	2009-09-23 22:05:24 +00:00
Chris Lattner	e3ce1e2a37	tidy up llvm-svn: 82488	2009-09-21 22:26:02 +00:00
Chris Lattner	247053867e	big endian systems shift by bits too, hopefully this will fix the ppc bootstrap problems. llvm-svn: 82464	2009-09-21 17:55:47 +00:00
Dan Gohman	43d6830ea0	Nick pointed out that DominanceFrontier and DominanceTree are preserved by setPreservesCFG(). llvm-svn: 82463	2009-09-21 17:54:42 +00:00
Dan Gohman	af57ae3da4	Remove the special-case for constants in PHI nodes; it's not really helpful, and it didn't correctly handle the case of constants input to PHIs for backedges. llvm-svn: 82462	2009-09-21 17:53:35 +00:00
Chris Lattner	9045f235d2	fix PR5016, a crash I introduced in GVN handing first class arrays and structs, which cannot be bitcast to integers. llvm-svn: 82460	2009-09-21 17:24:04 +00:00
Chris Lattner	4d8af2f1ae	enable non-local analysis and PRE of large store -> little load. This doesn't kick in too much because of phi translation issues, but this can be resolved in the future. llvm-svn: 82447	2009-09-21 06:48:08 +00:00
Chris Lattner	0cdc17eb50	convert an std::pair to an explicit struct. llvm-svn: 82446	2009-09-21 06:30:24 +00:00
Chris Lattner	d28f90897a	move some functions, add a comment. llvm-svn: 82444	2009-09-21 06:24:16 +00:00
Chris Lattner	9d7fb29522	split HandleLoadFromClobberingStore in two pieces: one that does the analysis, one that does the xform. llvm-svn: 82443	2009-09-21 06:22:46 +00:00
Chris Lattner	0a9616d906	Improve GVN to be able to forward substitute a small load from a piece of a large store when both are in the same block. This allows clang to compile the testcase in PR4216 to this code: _test_bitfield: movl 4(%esp), %eax movl %eax, %ecx andl $-65536, %ecx orl $32962, %eax andl $40186, %eax orl %ecx, %eax ret This is not ideal, but is a whole lot better than the code produced by llvm-gcc: _test_bitfield: movw $-32574, %ax orw 4(%esp), %ax andw $-25350, %ax movw %ax, 4(%esp) movw 7(%esp), %cx shlw $8, %cx movzbl 6(%esp), %edx orw %cx, %dx movzwl %dx, %ecx shll $16, %ecx movzwl %ax, %eax orl %ecx, %eax ret and dramatically better than that produced by gcc 4.2: _test_bitfield: pushl %ebx call L3 "L00000000001$pb": L3: popl %ebx movl 8(%esp), %eax leal 0(,%eax,4), %edx sarb $7, %dl movl %eax, %ecx andl $7168, %ecx andl $-7201, %ebx movzbl %dl, %edx andl $1, %edx sall $5, %edx orl %ecx, %ebx orl %edx, %ebx andl $24, %eax andl $-58336, %ebx orl %eax, %ebx orl $32962, %ebx movl %ebx, %eax popl %ebx ret llvm-svn: 82439	2009-09-21 05:57:11 +00:00
Chris Lattner	1eefa9c427	formatting cleanups, no functionality change. llvm-svn: 82426	2009-09-21 02:42:51 +00:00
Chris Lattner	a0aa8fb6a6	Move CoerceAvailableValueToLoadType earlier in GVN.cpp. Hook it up so that nonlocal and partially redundant loads can use it as well. The testcase shows examples of craziness this can handle. This triggers many times in 176.gcc. llvm-svn: 82403	2009-09-20 20:09:34 +00:00
Chris Lattner	7c62d8a1a8	change the interface to CoerceAvailableValueToLoadType to be more generic. llvm-svn: 82402	2009-09-20 19:31:14 +00:00
Chris Lattner	1dd48c34e5	enhance GVN to forward substitute a stored value to a load (and load -> load) when the base pointers must alias but when they are different types. This occurs very very frequently in 176.gcc and other code that uses bitfields a lot. llvm-svn: 82399	2009-09-20 19:03:47 +00:00
Daniel Dunbar	7d6781b0fe	Tabs -> spaces, and remove trailing whitespace. llvm-svn: 82355	2009-09-20 02:20:51 +00:00
Nick Lewycky	1303c0ab86	Remove the default value for ConstantStruct::get's isPacked parameter and update the code which was broken by this. llvm-svn: 82327	2009-09-19 20:30:26 +00:00
Victor Hernandez	5d034499ad	Enhance transform passes so that they apply the same tranforms to malloc calls as to MallocInst. Reviewed by Dan Gohman. llvm-svn: 82300	2009-09-18 22:35:49 +00:00
Victor Hernandez	788eaabd18	Update malloc call creation code (AllocType is now the element type of the malloc, not the resulting type). In getMallocArraySize(), fix bug in the case that array size is the product of 2 constants. Extend isArrayMalloc() and getMallocArraySize() to handle case where malloc is used as char array. Ensure that ArraySize in LowerAllocations::runOnBasicBlock() is correct type. Extend Instruction::isSafeToSpeculativelyExecute() to handle malloc calls. Add verification for malloc calls. Reviewed by Dan Gohman. llvm-svn: 82257	2009-09-18 19:20:02 +00:00
Daniel Dunbar	487d1c8138	Update CMake. llvm-svn: 82097	2009-09-17 00:06:48 +00:00
Dan Gohman	0f64d71d99	Add a new pass for doing late hoisting of floating-point and vector constants out of loops. These aren't covered by the regular LICM pass, because in LLVM IR constants don't require separate instructions. They're not always covered by the MachineLICM pass either, because it doesn't know how to unfold folded constant-pool loads. This is somewhat experimental at this point, and off by default. llvm-svn: 82076	2009-09-16 20:25:11 +00:00
Dan Gohman	bd0050810c	Change FoldPHIArgBinOpIntoPHI to decline folding if it would introduce two phis, similar to the FoldPHIArgGEPIntoPHI change. Also, delete some comments that don't reflect the code. llvm-svn: 82053	2009-09-16 16:50:24 +00:00
Andreas Neustifter	41c1103273	Reapplied r81355 with the problems fixed. (See http://lists.cs.uiuc.edu/pipermail/llvm-commits/Week-of-Mon-20090907/086737.html and http://lists.cs.uiuc.edu/pipermail/llvm-commits/Week-of-Mon-20090907/086746.html) llvm-svn: 82039	2009-09-16 11:35:50 +00:00
Andreas Neustifter	f8cb758ba8	Preserve ProfileInfo during CodeGenPrepare. llvm-svn: 82034	2009-09-16 09:26:52 +00:00
Dan Gohman	3b7ce109ec	Don't sink gep operators through phi nodes if the result would require more than one phi, since that leads to higher register pressure on entry to the phi. This is especially problematic when the phi is in a loop header, as it increases register pressure throughout the loop. llvm-svn: 81993	2009-09-16 02:01:52 +00:00
Nick Lewycky	7465cd769c	Add more newlines to make up for the ones removed from the end of instructions. llvm-svn: 81851	2009-09-15 07:08:25 +00:00
Chris Lattner	e0987215f0	add a new CallGraphNode::replaceCallEdge method and use it from argpromote to avoid invalidating an iterator. This fixes PR4977. All clang tests now pass with expensive checking (on my system at least). llvm-svn: 81843	2009-09-15 05:40:35 +00:00
Chris Lattner	e9a4992399	add newline to debug dump llvm-svn: 81840	2009-09-15 05:14:57 +00:00
Dan Gohman	f9eafce3af	When extending a memset range past the front, set the alignment of the memset region to the alignment of the new start address. llvm-svn: 81810	2009-09-14 23:39:10 +00:00
Dan Gohman	7190d48075	Factor out the code for checking that all indices in a getelementptr are within the notional bounds of the static type of the getelementptr (which is not the same as "inbounds") from GlobalOpt into a utility routine, and use it in ConstantFold.cpp to check whether there are any mis-behaved indices. llvm-svn: 81478	2009-09-10 23:37:55 +00:00
Nick Lewycky	dddf5dcdaf	Correctly handle the case where a comparison is created in one BasicBlock and used by a terminator in another. llvm-svn: 81437	2009-09-10 07:02:09 +00:00
Evan Cheng	1d9d4bdc99	Add malloc call utility functions. Patch by Victor Hernandez. llvm-svn: 81426	2009-09-10 04:36:43 +00:00
Dan Gohman	ec4557f324	Fix SplitCriticalEdge to properly update LCSSA form when splitting a loop exit edge -- new PHIs may be needed not only for the additional splits that are made to preserve LoopSimplify form, but also for the original split. Factor out the code that inserts new PHIs so that it can be used for both. Remove LoopRotation.cpp's code for manually updating LCSSA form, as it is now redundant. This fixes PR4934. llvm-svn: 81363	2009-09-09 18:18:18 +00:00
Mike Stump	deaf572ca8	Reflow comment. llvm-svn: 81361	2009-09-09 17:57:16 +00:00
Andreas Neustifter	4c0b2847ef	Preserve ProfileInfo. llvm-svn: 81360	2009-09-09 17:53:39 +00:00
Dan Gohman	c56af25c01	Fix an 80-column violation. llvm-svn: 81354	2009-09-09 17:17:19 +00:00
Chris Lattner	9ded9ac8af	revert r81335, which breaks the build. llvm-svn: 81347	2009-09-09 16:00:57 +00:00
Andreas Neustifter	0bd472dc33	Updated ProfileInfo to have clean seperation between different sentinels. llvm-svn: 81335	2009-09-09 12:48:26 +00:00
Owen Anderson	f0081db7e8	Fix PR4909, patch by Jakub Staszak. llvm-svn: 81250	2009-09-08 19:53:15 +00:00
Chris Lattner	9ce1781ef4	remove an extremely dubious instcombine transformation of extractelement(load). llvm-svn: 81239	2009-09-08 18:48:01 +00:00
Dan Gohman	3ddbc242fb	Re-apply r80926, with fixes: keep the domtree informed of new blocks that get created during loop unswitching, and fix SplitBlockPredecessors' LCSSA updating code to create new PHIs instead of trying to just move existing ones. Also, optimize Loop::verifyLoop, since it gets called a lot. Use searches on a sorted list of blocks instead of calling the "contains" function, as is done in other places in the Loop class, since "contains" does a linear search. Also, don't call verifyLoop from LoopSimplify or LCSSA, as the PassManager is already calling verifyLoop as part of LoopInfo's verifyAnalysis. llvm-svn: 81221	2009-09-08 15:45:00 +00:00
Chris Lattner	d1b21c6092	remove a turd llvm-svn: 81186	2009-09-08 03:47:41 +00:00
Chris Lattner	d3210e1a20	instcombine transforms vector loads that are only used by extractelement operations into a bitcast of the pointer, then a gep, then a scalar load. Disable this when the vector only has one element, because it leads to infinite loops in instcombine (PR4908). This transformation seems like a really bad idea to me, as it will likely disable CSE of vector load/stores etc and can be better done in the code generator when profitable. This goes all the way back to the first days of packed types, r25299 specifically. I'll let those people who care about the performance of vector code decide what to do with this. llvm-svn: 81185	2009-09-08 03:44:51 +00:00
Chris Lattner	f2ab40a46f	Fix PR4882, by making MemCpyOpt not dereference removed stores to get the context for the newly created operations. Patch by Jakub Staszak! llvm-svn: 81175	2009-09-08 00:27:14 +00:00
Dan Gohman	1b84908f92	Reappy r80998, now that the GlobalOpt bug that it exposed on MiniSAT is fixed. llvm-svn: 81172	2009-09-07 23:54:19 +00:00
Dan Gohman	161429fe7e	Don't commit stores with addresses that have indices that are not compile-time constant integers or that are out of bounds for their corresponding static array types. These can cause aliasing that GlobalOpt assumes won't happen. llvm-svn: 81165	2009-09-07 22:44:55 +00:00
Dan Gohman	82e747580f	Don't commit addresses of aggregate values. This avoids problems with an aggregate store overlapping a different aggregate store, despite the stores having distinct addresses. llvm-svn: 81164	2009-09-07 22:42:05 +00:00
Dan Gohman	beee35a277	Fix GlobalOpt to avoid committing a store if the address getelementptr is missing the inbounds flag. This is slightly conservative, but it avoids problems with two constants pointing to the same address but getting distinct entries in the Memory DenseMap. llvm-svn: 81163	2009-09-07 22:40:13 +00:00
Dan Gohman	19244eaa4a	Preserve the InBounds flag when evaluating a getelementptr instruction into a getelementptr ConstantExpr. llvm-svn: 81162	2009-09-07 22:34:43 +00:00
Dan Gohman	f7f3fb1133	Simplify this code by using hasDefinitiveInitializer(). llvm-svn: 81161	2009-09-07 22:31:26 +00:00
Eric Christopher	66d8555f7e	Fix comment. llvm-svn: 81138	2009-09-06 22:20:54 +00:00
Duncan Sands	89720bbd11	Remove some not-really-used variables, as warned about by icc (#593, partial). Patch by Erick Tryzelaar. llvm-svn: 81115	2009-09-06 12:41:19 +00:00
Daniel Dunbar	86c6a6ef0f	Fix a possible crash call setIsInBounds. - I think there are more instances of this, but I think they are fixed in Dan's incoming patch. This one was preventing me from doing a bugpoint reduction though. llvm-svn: 81103	2009-09-06 02:31:36 +00:00
Evan Cheng	904199547b	Revert r80926. It causes loop unswitch assertion and slow down some JIT tests significantly. llvm-svn: 81101	2009-09-06 02:26:10 +00:00
Daniel Dunbar	10ea8bb8e0	Revert "Include optional subclass flags, such as inbounds, nsw, etc., ...", this breaks MiniSAT on x86_64. llvm-svn: 81098	2009-09-06 00:11:24 +00:00
Andreas Neustifter	18156bd75c	Converted MaximumSpanningTree algorithm to a generic template, this could go into llvm/ADT. llvm-svn: 81001	2009-09-04 12:34:44 +00:00
Dan Gohman	0c2477c26b	Include optional subclass flags, such as inbounds, nsw, etc., in the Constant uniquing tables. This allows distinct ConstantExpr objects with the same operation and different flags. Even though a ConstantExpr "a + b" is either always overflowing or never overflowing (due to being a ConstantExpr), it's still necessary to be able to represent it both with and without overflow flags at the same time within the IR, because the safety of the flag may depend on the context of the use. If the constant really does overflow, it wouldn't ever be safe to use with the flag set, however the use may be in code that is never actually executed. This also makes it possible to merge all the flags tests into a single test. llvm-svn: 80998	2009-09-04 12:08:11 +00:00
Dan Gohman	4c1bdcf5d7	Add a verifyAnalysis to LoopInfo, LoopSimplify, and LCSSA form that verify that these passes are properly preserved. Fix several transformation passes that claimed to preserve LoopSimplify form but weren't. llvm-svn: 80926	2009-09-03 16:31:42 +00:00
Dan Gohman	22571485b3	Change PHINode::hasConstantValue to have a DominatorTree argument instead of a bool argument, and to do the dominator check itself. This makes it eaiser to use when DominatorTree information is available. llvm-svn: 80920	2009-09-03 15:34:35 +00:00
Duncan Sands	0edc7100ba	Keep track of how many memmove calls were turned into memcpy calls. llvm-svn: 80915	2009-09-03 13:37:16 +00:00
Andreas Neustifter	7e86c3856b	Code Cleanup. Removed inverted flag form MaximumSpanningTree, also do not handle so much information to MaximumSpanningTree. llvm-svn: 80911	2009-09-03 08:52:52 +00:00
Nick Lewycky	88214fbd12	Remove VISIBILITY_HIDDEN from this file. llvm-svn: 80903	2009-09-03 06:43:15 +00:00
Chris Lattner	27266f164f	In C++, code is not allowed to call main. In C it is, this simplifylibcalls optimization is thus valid for C++ but not C. It's not important enough to worry about for C++ apps, so just remove it. rdar://7191924 llvm-svn: 80887	2009-09-03 05:19:59 +00:00
Gabor Greif	2d60e1ec0c	back out my recent commit (r80858), it seems to break self-hosting buildbot's stage 2 configure llvm-svn: 80871	2009-09-03 02:02:59 +00:00
Gabor Greif	14dfba6d66	re-commit r66920 (which has been backed out in r66953) I may have more luck this time. I'll back out if needed... llvm-svn: 80858	2009-09-03 00:18:58 +00:00
Andreas Neustifter	ae866b0c66	Sort edges in MaximumSpanningTree more stable in case of equal weight. (See http://lists.cs.uiuc.edu/pipermail/llvm-commits/Week-of-Mon-20090824/085890.html) llvm-svn: 80789	2009-09-02 14:03:11 +00:00
Andreas Neustifter	964fa2bdac	Changed set of BlocksToInstrument to set of InsertedBlocks that do not have to be instrumented. llvm-svn: 80788	2009-09-02 13:59:05 +00:00
Andreas Neustifter	4469c164d0	Code cleanups and added comments. llvm-svn: 80781	2009-09-02 12:38:39 +00:00
Chris Lattner	4916267c97	fix PR4815: some cases where DeleteDeadInstruction can delete the instruction BBI points to. llvm-svn: 80768	2009-09-02 06:31:02 +00:00
Chris Lattner	09a79dcfdf	clean up this code a bit. llvm-svn: 80767	2009-09-02 06:15:37 +00:00
Chris Lattner	2dd09dbdf7	eliminate VISIBILITY_HIDDEN from Transforms/Scalar. PR4861 llvm-svn: 80766	2009-09-02 06:11:42 +00:00
Chris Lattner	64b5842986	fix PR4837, some bugs folding vector compares. These return a vector of i1, not i1 itself. llvm-svn: 80761	2009-09-02 05:12:37 +00:00
Andreas Neustifter	759094e323	OptimalEdgeProfiling: Creation of profiles. This adds the instrumentation and runtime part of OptimalEdgeProfiling. llvm-svn: 80712	2009-09-01 19:03:44 +00:00
Chris Lattner	9b463729d7	remove CallGraphNode::replaceCallSite, it is redundant with other APIs. llvm-svn: 80708	2009-09-01 18:52:39 +00:00
Chris Lattner	f61b0fb5d0	cleanup/simplify llvm-svn: 80706	2009-09-01 18:50:55 +00:00
Chris Lattner	8900f3ec57	remove a bunch of explicit code previously needed to update the callgraph. This is now dead because RAUW does the job. llvm-svn: 80703	2009-09-01 18:44:06 +00:00
Chris Lattner	1145e33bc6	enhance memcpy opt to turn memmoves into memcpy when the src/dest don't alias. Remove an old and poorly reduced testcase that fails with this transform for reasons unrelated to the original test. llvm-svn: 80693	2009-09-01 17:56:32 +00:00
Chris Lattner	b5557a7b42	random code cleanups, no functionality change. llvm-svn: 80682	2009-09-01 17:09:55 +00:00
Ted Kremenek	1543d133db	Update CMake files. llvm-svn: 80680	2009-09-01 17:01:02 +00:00
Andreas Neustifter	eb5a9d34d6	Preparation for Optimal Edge Profiling: Add statistics for regular edge profiling, this enables the comparation of the number of edges inserted by regular and optimal edge profiling. llvm-svn: 80668	2009-09-01 10:08:39 +00:00
Chris Lattner	063d06527e	Change CallGraphNode to maintain it's Function as an AssertingVH for sanity. This didn't turn up any bugs. Change CallGraphNode to maintain its "callsite" information in the call edges list as a WeakVH instead of as an instruction*. This fixes a broad class of dangling pointer bugs, and makes CallGraph have a number of useful invariants again. This fixes the class of problem indicated by PR4029 and PR3601. llvm-svn: 80663	2009-09-01 06:31:31 +00:00
Chris Lattner	ff5f1e4d70	fix some cases where instcombine would change hte IR but not return true from runOnFunction llvm-svn: 80562	2009-08-31 06:57:37 +00:00
Chris Lattner	9e50747958	comment and simplify some code. llvm-svn: 80540	2009-08-31 05:34:32 +00:00
Chris Lattner	70ebbc59f3	add -debug output llvm-svn: 80539	2009-08-31 05:22:48 +00:00
Chris Lattner	19dd315e67	improve -debug output, so that -debug is more likely to print when instcombine is changing stuff. llvm-svn: 80538	2009-08-31 05:17:58 +00:00
Chris Lattner	4e3e930743	fix a bug I introduced with my 'instcombine builder' refactoring changes: SimplifyDemandedBits can't use the builder yet because it has the wrong insertion point. This fixes a crash building MultiSource/Benchmarks/PAQ8p llvm-svn: 80537	2009-08-31 04:36:22 +00:00
Chris Lattner	2f2110affa	simplify some code by making the SCCNodes set contain Function's instead of CallGraphNode's. This also papers over a callgraph problem where a pass (in this case, MemCpyOpt) introduces a new function into the module (llvm.memset.i64) but doesn't add it to the call graph (nor should it, since it is a function pass). While it might be a good idea for MemCpyOpt to not synthesize functions in a runOnFunction(), there is no need for FunctionAttrs to be boneheaded, so fix it there. This fixes an assertion building 176.gcc. llvm-svn: 80535	2009-08-31 04:09:04 +00:00
Chris Lattner	081375bb08	Fix PR4834, a tricky case where the inliner would resolve an indirect function pointer, inline it, then go to delete the body. The problem is that the callgraph had other references to the function, though the inliner had no way to know it, so we got a dangling pointer and an invalid iterator out of the deal. The fix to this is pretty simple: stop the inliner from deleting the function by knowing that there are references to it. Do this by making CallGraphNodes contain a refcount. This requires moving deletion of available_externally functions to the module-level cleanup sweep where it belongs. llvm-svn: 80533	2009-08-31 03:15:49 +00:00
Chris Lattner	305b115a87	Fix some nasty callgraph dangling pointer problems in argpromotion and structretpromote. Basically, when replacing a function, they used the 'changeFunction' api which changes the entry in the function map (and steals/reuses the callgraph node). This has some interesting effects: first, the problem is that it doesn't update the "callee" edges in any callees of the function in the call graph. Second, this covers for a major problem in all the CGSCC pass stuff, which is that it is completely broken when functions are deleted if they don't reuse a CGN. (there is a cute little fixme about this though :). This patch changes the protocol that CGSCC passes must obey: now the CGSCC pass manager copies the SCC and preincrements its iterator to avoid passes invalidating it. This allows CGSCC passes to mutate the current SCC. However multiple passes may be run on that SCC, so if passes do this, they are now required to update the SCC to be current when they return. Other less interesting parts of this patch are that it makes passes update the CG more directly, eliminates changeFunction, and requires clients of replaceCallSite to specify the new callee CGN if they are changing it. llvm-svn: 80527	2009-08-31 00:19:58 +00:00
Chris Lattner	73913f4cd3	Fix PR4748: don't fold gep(bitcast(x)) into bitcast(gep) when x is itself a bitcast. Since we have gep(bitcast(bitcast(y))) in this case, just wait for the two bitcasts to get zapped. This prevents instcombine from confusing some aliasing stuff, and allows it to directly eliminate the load in the testcase. llvm-svn: 80508	2009-08-30 20:38:21 +00:00
Chris Lattner	c2f2cf896e	misc cleanup llvm-svn: 80507	2009-08-30 20:36:46 +00:00
Chris Lattner	a3e620caba	add getPointerAddressSpace() to GEP instruction, use the method in a few scalar xforms to simplify things. llvm-svn: 80506	2009-08-30 20:06:40 +00:00
Chris Lattner	c856539edf	eliminate InsertCastBefore, use the builder instead. llvm-svn: 80505	2009-08-30 20:01:10 +00:00
Chris Lattner	606da5fed8	eliminate InsertBitCastBefore, just use the builder instead. llvm-svn: 80504	2009-08-30 19:47:22 +00:00
Chris Lattner	5966341a2e	convert a bunch more calls to InsertNewInstBefore to use the new Instcombine builder. llvm-svn: 80501	2009-08-30 18:50:58 +00:00
Chris Lattner	8326d529da	fix typo llvm-svn: 80500	2009-08-30 17:53:59 +00:00
Chris Lattner	022a582de2	give instcombine a custom IRBuilder that adds new instructions to the workslist and is set to insert new instructions before the current one. Convert a bunch of stuff that used to call InsertNewInstBefore over to use it, greatly simplifying code and making it more natural. There is still a lot more to go, but this is a good start. llvm-svn: 80492	2009-08-30 07:44:24 +00:00
Chris Lattner	a0c89ee1da	add a new InstCombineWorklist::AddValue method that works even if the operand is not an instruction. Simplify most uses of AddOperandsToWorkList to use AddValue and inline it into the one remaining callsite. llvm-svn: 80488	2009-08-30 06:27:41 +00:00
Chris Lattner	bacd05c2eb	move AddUsersToWorkList to the worklist processing class, make the argument stronger typed. llvm-svn: 80487	2009-08-30 06:22:51 +00:00
Chris Lattner	795bfdbb55	rename AddUsesToWorkList -> AddOperandsToWorkList. The former looks too much like AddUsersToWorkList and keeps confusing me. Remove AddSoonDeadInstToWorklist and change its two callers to do the same thing in a simpler way. llvm-svn: 80486	2009-08-30 06:20:05 +00:00
Chris Lattner	905976b1db	inline the trivial AddToWorkList/RemoveFromWorkList methods into their callers. simplify ReplaceInstUsesWith. Make EraseInstFromFunction only add operands to the worklist if there aren't too many of them (this was a scalability win for crazy programs that was only infrequently enforced). Switch more code to using EraseInstFromFunction instead of duplicating it inline. Change some fcmp/icmp optimizations to modify fcmp/icmp in place instead of creating a new one and deleting the old one just to change the predicate. llvm-svn: 80483	2009-08-30 06:13:40 +00:00
Chris Lattner	93ad6170fd	fix a bug I introduced in r80478 found by the build bot. llvm-svn: 80482	2009-08-30 05:56:44 +00:00
Chris Lattner	97fd3599e1	refactor instcombine's worklist processing stuff out to its own class. llvm-svn: 80481	2009-08-30 05:55:36 +00:00
Chris Lattner	b2995e1eb1	more cleanups: remove some redundant code, and simplify some other places. llvm-svn: 80478	2009-08-30 05:30:55 +00:00
Chris Lattner	06c687b59e	eliminate the temporary SrcGEPOperands smallvector. llvm-svn: 80477	2009-08-30 05:08:50 +00:00
Chris Lattner	e26bf17423	simplify/detangle some control flow. llvm-svn: 80476	2009-08-30 05:00:50 +00:00
Chris Lattner	d7b6e913fe	simplify and cleanup some code, remove some code that just does constant folding of gep's: this is already handled in a more general way. No functionality change. llvm-svn: 80475	2009-08-30 04:49:01 +00:00
Dan Gohman	0dfe73ac9e	Remove an unnecessary Context argument. llvm-svn: 80454	2009-08-29 23:39:38 +00:00
Benjamin Kramer	b83f691931	Inline empty destructor. llvm-svn: 80431	2009-08-29 13:38:21 +00:00
Bill Wendling	06a6057bbe	Fix warning about non-virtual destructor. llvm-svn: 80429	2009-08-29 12:31:38 +00:00
Devang Patel	80ae34974b	Reapply 79977. Use MDNodes to encode debug info in llvm IR. llvm-svn: 80406	2009-08-28 23:24:31 +00:00
Andreas Neustifter	991beb9aaf	Preparation for Optimal Edge Profiling: This implements the maximum spanning tree algorithm on CFGs according to weights given by the ProfileEstimator. This is then used to implement Optimal Edge Profiling. llvm-svn: 80358	2009-08-28 11:28:24 +00:00
Chris Lattner	0e8901803c	finish a half formed thought :) llvm-svn: 80334	2009-08-28 04:48:54 +00:00
Chris Lattner	bda82c20f3	Fix PR3913, patch by Jakub Staszak! llvm-svn: 80327	2009-08-28 00:43:14 +00:00
Chris Lattner	d3374e8dfd	Implement a new optimization in the inliner: if inlining multiple calls into a function and if the calls bring in arrays, try to merge them together to reduce stack size. For example, in the testcase we'd previously end up with 4 allocas, now we end up with 2 allocas. As described in the comments, this is not really the ideal solution to this problem, but it is surprisingly effective. For example, on 176.gcc, we end up eliminating 67 arrays at "gccas" time and another 24 at "llvm-ld" time. One piece of concern that I didn't look into: at -O0 -g with forced inlining this will almost certainly result in worse debug info. I think this is acceptable though given that this is a case of "debugging optimized code", and we don't want debug info to prevent the optimizer from doing things anyway. llvm-svn: 80215	2009-08-27 06:29:33 +00:00
Chris Lattner	1ce61b82ac	unbreak the build, yay for symlinks + makefiles. :( llvm-svn: 80205	2009-08-27 04:43:05 +00:00
Chris Lattner	b9d0a961f9	reduce header #include'age llvm-svn: 80204	2009-08-27 04:32:07 +00:00
Chris Lattner	b1cba3f91e	enhance InlineFunction to be able to optionally return a the list of static allocas that it inlined. llvm-svn: 80203	2009-08-27 04:20:52 +00:00
Chris Lattner	d84dbb3443	smallvectorize the list of returns built by CloneAndPruneFunctionInto. llvm-svn: 80202	2009-08-27 04:02:30 +00:00
Chris Lattner	9d0235dc6b	remove CloneTrace, which appears to be dead since 2004. llvm-svn: 80201	2009-08-27 03:56:43 +00:00
Chris Lattner	5eef6ad6a9	reduce inlining factor some stuff out to a static helper function, and other code cleanups. No functionality change. llvm-svn: 80199	2009-08-27 03:51:50 +00:00
Owen Anderson	109ca5a14a	Make this into a static method. llvm-svn: 80170	2009-08-26 22:55:11 +00:00
Devang Patel	f08e35d9dc	Revert 79977. It causes llvm-gcc bootstrap failures on some platforms. llvm-svn: 80073	2009-08-26 05:01:18 +00:00
Dan Gohman	3b1938dda4	Remove unused variables. llvm-svn: 80058	2009-08-26 00:13:22 +00:00
Dan Gohman	ad1f0a1101	Eliminate the unused Context argument on one of the ICmpInst and FCmpInst constructors. llvm-svn: 80049	2009-08-25 23:17:54 +00:00
Dan Gohman	c8a27f2a5c	Rename Instruction::isIdenticalTo to Instruction::isIdenticalToWhenDefined, and introduce a new Instruction::isIdenticalTo which tests for full identity, including the SubclassOptionalData flags. Also, fix the Instruction::clone implementations to preserve the SubclassOptionalData flags. Finally, teach several optimizations how to handle SubclassOptionalData correctly, given these changes. This fixes the counterintuitive behavior of isIdenticalTo not comparing the full value, and clone not returning an identical clone, as well as some subtle bugs that could be caused by these. Thanks to Nick Lewycky for reporting this, and for an initial patch! llvm-svn: 80038	2009-08-25 22:11:20 +00:00
Dan Gohman	337d56110e	Special-case static allocas in IndVarSimplify's loop invariant sinking code, since they are special. If the loop preheader happens to be the entry block of a function, don't sink static allocas out of it. This fixes PR4775. llvm-svn: 80010	2009-08-25 17:42:10 +00:00
Owen Anderson	4e9ac2a34b	Comment-ify. llvm-svn: 80009	2009-08-25 17:42:07 +00:00
Owen Anderson	f18cae4979	Switch to SmallVector. llvm-svn: 80007	2009-08-25 17:35:37 +00:00
Owen Anderson	5e39d1deec	Pull out this predicate loop into a helper function. llvm-svn: 80006	2009-08-25 17:26:32 +00:00
Devang Patel	02aac922b4	Update DebugInfo interface to use metadata, instead of special named llvm.dbg.... global variables, to encode debugging information in llvm IR. This is mostly a mechanical change that tests metadata support very well. This change speeds up llvm-gcc by more then 6% at "-O0 -g" (measured by compiling InstructionCombining.cpp!) llvm-svn: 79977	2009-08-25 05:24:07 +00:00
Dale Johannesen	c221a55f58	Allow multiple occurrences of -inline-threshold on the command line. This gives llvm-gcc developers a way to control inlining (documented as "not intended for end users"). llvm-svn: 79966	2009-08-25 01:13:58 +00:00
Owen Anderson	34e6148dc8	Handle a corner case when extracing code regions where one of the immediate successor of an extracted block contains a PHI using a value defined in the extracted region. With this patch, the partial inliner now passes MultiSource/Applications. llvm-svn: 79963	2009-08-25 00:54:39 +00:00
Owen Anderson	b4aa5b1511	When extracting SEME regions of code, the extractor needs to update the dominator tree for split return blocks. llvm-svn: 79957	2009-08-24 23:32:14 +00:00
Chris Lattner	06fa176862	prune the #includes in raw_ostream.h by moving a member out of line. ftostr is not particularly speedy, so that method is presumably not perf sensitive. llvm-svn: 79885	2009-08-24 03:52:50 +00:00
Benjamin Kramer	1a25d733f9	Kill off more cerr/cout uses and prune includes a bit. llvm-svn: 79852	2009-08-23 11:37:21 +00:00
Daniel Dunbar	5e0a58bef4	Fix -Asserts warnings. llvm-svn: 79849	2009-08-23 10:29:55 +00:00
Chris Lattner	4883d90396	convert LoopInfo.h and GraphWriter.h to use raw_ostream llvm-svn: 79836	2009-08-23 07:19:13 +00:00
Chris Lattner	317dbbcfb1	eliminate uses of cerr() llvm-svn: 79834	2009-08-23 07:05:07 +00:00
Chris Lattner	4dc3edde9f	remove a few DOUTs here and there. llvm-svn: 79832	2009-08-23 06:35:02 +00:00
Chris Lattner	1362602eb2	Change Pass::print to take a raw ostream instead of std::ostream, update all code that this affects. llvm-svn: 79830	2009-08-23 06:03:38 +00:00
Chris Lattner	b1d782bec9	eliminate the std::ostream form of WriteAsOperand and update clients. This also updates dominator related stuff. llvm-svn: 79825	2009-08-23 05:17:37 +00:00
Chris Lattner	3924bb5792	remove the std::ostream version of module and type printing. llvm-svn: 79823	2009-08-23 04:52:46 +00:00
Chris Lattner	b25de3ff60	eliminate the "Value" printing methods that print to a std::ostream. This required converting a bunch of stuff off DOUT and other cleanups. llvm-svn: 79819	2009-08-23 04:37:46 +00:00
Dan Gohman	16f5415f5b	Rename hasNoUnsignedOverflow and hasNoSignedOverflow to hasNoUnsignedWrap and hasNoSignedWrap, for consistency with the nuw and nsw properties. llvm-svn: 79539	2009-08-20 17:11:38 +00:00
Dan Gohman	7167f42769	Fix a few places to check if TargetData is available before using it. llvm-svn: 79493	2009-08-19 23:38:22 +00:00
Dan Gohman	915302c605	Make SROA and PredicateSimplifier cope if TargetData is not available. This is very conservative for now. llvm-svn: 79442	2009-08-19 18:22:18 +00:00
Dan Gohman	5d5bc6d000	Use hasDefinitiveInitializer() instead of testing the same thing by hand, and fix a few places that were using hasInitializer() that appear to depend on the initializer value. llvm-svn: 79441	2009-08-19 18:20:44 +00:00
Nick Lewycky	cbfe9b195c	Fix up PHI nodes correctly in the presence of unreachable BBs, part two. Also delete a newed pointer, and improve readability a little bit. llvm-svn: 79411	2009-08-19 07:16:57 +00:00
Nick Lewycky	9ddc52d5b8	Be more careful when modifying PHI nodes. Patch by Andre Tavares. llvm-svn: 79407	2009-08-19 06:24:33 +00:00
Dan Gohman	dea2358c68	Fix SimplifyLibcalls and ValueTracking to check mayBeOverridden before performing optimizations based on constant string values. llvm-svn: 79384	2009-08-19 00:11:12 +00:00
Dan Gohman	82ac81b1cc	Fix a bug that caused globalopt to miscompile tramp3d: don't miss unruly indices for arrays that are members of structs. llvm-svn: 79337	2009-08-18 14:58:19 +00:00
Dan Gohman	10f1471e2f	Make TargetData optional in MemCpyOptimizer. llvm-svn: 79306	2009-08-18 01:17:52 +00:00
Dan Gohman	9f2b3db428	Make TargetData optional in SimplifyLibCalls. llvm-svn: 79298	2009-08-18 00:48:13 +00:00
Anton Korobeynikov	90e17e787f	The attached patches attempt to fix cross builds. For example, if you try to use i686-darwin to build for arm-eabi, you'll quickly run into several false assumptions that the target OS must be the same as the host OS. These patches split $(OS) into $(HOST_OS) and $(TARGET_OS) to help builds like "make check" and the test-suite able to cross compile. Along the way a target of *-unknown-eabi is defined as "Freestanding" so that TARGET_OS checks have something to work with. Patch by Sandeep Patel! llvm-svn: 79296	2009-08-18 00:40:33 +00:00
Dan Gohman	7cb92a1e3d	Update comments to new-style syntax. llvm-svn: 79263	2009-08-17 18:45:31 +00:00
Nick Lewycky	4c737147e1	Don't crash on critical edge. Patch by Andre Tavares. llvm-svn: 79252	2009-08-17 17:00:57 +00:00
Dan Gohman	8dd69f88ea	Fix debug output to include a newline after printing a Value, now that Value's operator<< doesn't include one. llvm-svn: 79240	2009-08-17 15:25:05 +00:00
Duncan Sands	c4ce58d8fe	Don't access the first element of a potentially empty vector (&Formals[0]). With this change llvm-gcc builds with expensive checking enabled for C, C++ and Fortran. While there, change a std::vector into a SmallVector. This is partly gratuitous, but mostly because not all STL vector implementations define the data method (and it should be faster). llvm-svn: 79237	2009-08-17 14:33:27 +00:00
Nick Lewycky	aa464002f0	Don't crash trying to promote VLAs. llvm-svn: 79226	2009-08-17 05:37:31 +00:00
Eli Friedman	d56fca4708	Fix for PR3016: detect the tricky case, where there are unfoldable references to a PHI node in the block being folded, and disable the transformation in that case. The correct transformation of such PHI nodes depends on whether BB dominates Succ, and dominance is expensive to compute here. (Alternatively, it's possible to check whether any uses are live, but that's also essentially a dominance calculation. Another alternative is to use reg2mem, but it probably isn't a good idea to use that in simplifycfg.) Also, remove some incorrect code from CanPropagatePredecessorsForPHIs which is made unnecessary with this patch: it didn't consider the case where a PHI node in BB has multiple uses. llvm-svn: 79174	2009-08-16 04:23:49 +00:00
Benjamin Kramer	693a9c57a6	Don't try to get the context from an erased Instruction. llvm-svn: 79134	2009-08-15 21:07:49 +00:00
Nick Lewycky	dd0e74ceee	SSI construction should just go ahead and ignore instructions in unreachable blocks. llvm-svn: 79132	2009-08-15 20:12:18 +00:00
Dan Gohman	dbeb33936f	Make TargetData optional in GlobalOpt and ArgumentPromotion. llvm-svn: 78967	2009-08-14 00:11:03 +00:00
Owen Anderson	a42ac6953b	Actually privatize a IntegerTypes, and fix a few bugs exposed by this. llvm-svn: 78955	2009-08-13 23:27:32 +00:00
Owen Anderson	55f1c09e31	Push LLVMContexts through the IntegerType APIs. llvm-svn: 78948	2009-08-13 21:58:54 +00:00
Mon P Wang	a95379d165	When InstCombine simplifies a load -> extract element to gep -> load, place the new load by the old load instead of by the extract element because a store could have occurred between the load and extract element. llvm-svn: 78891	2009-08-13 05:12:13 +00:00
Andreas Bolka	5c2764b3e9	Simplify conditional. llvm-svn: 78889	2009-08-13 03:05:20 +00:00
Andreas Bolka	aef432505b	Simplify and reduce indentation using early exits. No intended functionality change. llvm-svn: 78888	2009-08-13 03:00:57 +00:00
Andreas Bolka	438ba80afa	DEBUGify some DOUTs. llvm-svn: 78887	2009-08-13 02:45:03 +00:00
Andreas Bolka	177a2f5313	Prune trailing whitespace. llvm-svn: 78886	2009-08-13 02:40:50 +00:00
Dan Gohman	4ac2f639cd	Transform -X/C to X/-C, implementing a README.txt entry. llvm-svn: 78812	2009-08-12 16:37:02 +00:00
Dan Gohman	908da3d97e	Optimize (x/C)*C to x if the division is exact. llvm-svn: 78811	2009-08-12 16:33:09 +00:00
Dan Gohman	43103abef0	Update instcombine's debug output to account for Value*'s operator<< not appending its own newline. llvm-svn: 78810	2009-08-12 16:28:31 +00:00
Dan Gohman	5476cfdb15	Remove a bunch more now-unnecessary Context arguments. llvm-svn: 78809	2009-08-12 16:23:25 +00:00
Dan Gohman	6b490ce4c7	Eliminate a bunch of now unnecessary explicit Context variables. llvm-svn: 78808	2009-08-12 16:04:34 +00:00
Owen Anderson	117c9e8497	Add contexts to some of the MVT APIs. No functionality change yet, just the infrastructure work needed to get the contexts to where they need to be first. llvm-svn: 78759	2009-08-12 00:36:31 +00:00
Dan Gohman	dbae4db67a	Optimize exact sdiv by a constant power of 2 to ashr. llvm-svn: 78714	2009-08-11 20:47:47 +00:00
Dan Gohman	38484ceec9	Remove unnecessary casts. llvm-svn: 78664	2009-08-11 15:15:10 +00:00
Devang Patel	c5aa8c6d29	Remove dead metadata. llvm-svn: 78651	2009-08-11 06:31:57 +00:00
Owen Anderson	53aa7a960c	Rename MVT to EVT, in preparation for splitting SimpleValueType out into its own struct type. llvm-svn: 78610	2009-08-10 22:56:29 +00:00
Daniel Dunbar	3b5008e23a	More ProfileInfo improvements. - Part of optimal static profiling patch sequence by Andreas Neustifter. - Store edge, block, and function information separately for each functions (instead of in one giant map). - Return frequencies as double instead of int, and use a sentinel value for missing information. llvm-svn: 78477	2009-08-08 17:43:09 +00:00
Devang Patel	b1106fbdbc	Fix dom frontier update. This fixes PR4667. Patch by Jakub Staszak. llvm-svn: 78388	2009-08-07 17:16:44 +00:00
Dan Gohman	a6d0afcb74	Fix a bunch of namespace pollution. llvm-svn: 78363	2009-08-07 01:32:21 +00:00
Devang Patel	ae5ead6df4	Use DebugInfoFinder. llvm-svn: 78333	2009-08-06 20:53:06 +00:00
Owen Anderson	03cb69fbd1	Privatize the StructType table, which unfortunately involves routing contexts through a number of APIs. llvm-svn: 78258	2009-08-05 23:16:16 +00:00
Dan Gohman	298bce2aa9	Check for !isa<Constant> instead of isa<Instruction>. This matches what the comment says, and it avoids spurious BitCast instructions for Argument values. llvm-svn: 78121	2009-08-04 23:23:56 +00:00
Dan Gohman	f011f5a8a2	Add a new Constant::getIntegerValue helper function, and convert a few places in InstCombine to use it, to fix problems handling pointer types. This fixes the recent llvm-gcc bootstrap error. llvm-svn: 78005	2009-08-03 22:07:33 +00:00
Eli Friedman	cfd3bbe643	Make SimplifyDemandedUseBits generate vector constants where appropriate. Patch per report on llvmdev. No testcase because the original report didn't come with a testcase, and I can't come up with a case that actually fails. llvm-svn: 77986	2009-08-03 19:15:42 +00:00
Owen Anderson	5a1acd9912	Move a few more APIs back to 2.5 forms. The only remaining ones left to change back are metadata related, which I'm waiting on to avoid conflicting with Devang. llvm-svn: 77721	2009-07-31 20:28:14 +00:00
Dan Gohman	ef3ef7f645	Fix GVN's debug output, now that operator<< on Value* doesn't print a trailing newline. llvm-svn: 77719	2009-07-31 20:24:18 +00:00
Bill Wendling	2602bb4cdc	- Convert the rest of the DOUTs to DEBUG+errs(). - One formatting change. No intended functionality change. llvm-svn: 77717	2009-07-31 19:52:24 +00:00
Eli Friedman	ca9a4f1045	PR4662: Fix a crash introduced by the recent LLVMContext changes. llvm-svn: 77716	2009-07-31 19:36:47 +00:00
Owen Anderson	23a204d91b	Move getTrue() and getFalse() to 2.5-like APIs. llvm-svn: 77685	2009-07-31 17:39:07 +00:00
Owen Anderson	b292b8ce70	Move more code back to 2.5 APIs. llvm-svn: 77635	2009-07-30 23:03:37 +00:00
Daniel Dunbar	132f78395a	Twines: Don't allow implicit conversion from integers, this is too tricky. llvm-svn: 77605	2009-07-30 17:37:43 +00:00
Daniel Dunbar	6afdc5e694	Switch obvious clients to Twine instead of utostr (when they were already using a Twine, e.g., for names). - I am a little ambivalent about this; we don't want the string conversion of utostr, but using overload '+' mixed with string and integer arguments is sketchy. On the other hand, this particular usage is something of an idiom. llvm-svn: 77579	2009-07-30 04:20:37 +00:00
Douglas Gregor	47d02732e0	Eliminate a few unused-variable warnings llvm-svn: 77519	2009-07-29 22:41:10 +00:00
Owen Anderson	4056ca9568	Move types back to the 2.5 API. llvm-svn: 77516	2009-07-29 22:17:13 +00:00
Daniel Dunbar	98ddd164d8	Fix PR4645 which was fallout from the fix for PR4641. - Call RAUW to delete all instructions (this is a patch from Nick Lewycky). llvm-svn: 77512	2009-07-29 22:00:43 +00:00
Benjamin Kramer	21d75078b5	Remove now unused Context variables. llvm-svn: 77495	2009-07-29 19:14:17 +00:00
Owen Anderson	487375e9a2	Move ConstantExpr to 2.5 API. llvm-svn: 77494	2009-07-29 18:55:55 +00:00
Nick Lewycky	f82326b984	Bulk erasing instructions without RAUWing them is unsafe. Instead, break them into a new BB that has no predecessors. llvm-svn: 77433	2009-07-29 05:17:50 +00:00
Devang Patel	a4f43fb5dd	Rename MDNode.h header. It defines MDnode and other metadata classes. New name is Metadata.h. llvm-svn: 77370	2009-07-28 21:49:47 +00:00
Owen Anderson	4aa3295a65	Return ConstantVector to 2.5 API. llvm-svn: 77366	2009-07-28 21:19:26 +00:00
Owen Anderson	c2c7932c64	Change ConstantArray to 2.5 API. llvm-svn: 77347	2009-07-28 18:32:17 +00:00
Dan Gohman	31a9b9880b	Teach instcombine to respect and preserve inbounds. Add inbounds to a few tests where it is required for the expected transformation. llvm-svn: 77290	2009-07-28 01:40:03 +00:00
Mike Stump	38a579fe5a	Fix a small little typo. llvm-svn: 77289	2009-07-28 01:35:34 +00:00
Dan Gohman	9ba43abc70	Replace dyn_castGetElementPtr with dyn_cast<GEPOperator>. llvm-svn: 77286	2009-07-28 00:37:50 +00:00
Dan Gohman	a3dcff5900	Grab the LLVMContext and parent Module of SI ahead of the point where SI can get deleted. This fixes a use of free'd memory. This fixes Externals/Povray. llvm-svn: 77285	2009-07-28 00:37:06 +00:00
Mike Stump	4798763e14	Fix a release-asserts warning. Debug functions should be marked used, if there are no other uses. If people don't need this routine anymore, if should be deleted. llvm-svn: 77274	2009-07-27 23:33:34 +00:00
Mike Stump	d934cc06c6	Avoid build warnings. llvm-svn: 77271	2009-07-27 23:14:11 +00:00
Owen Anderson	45308b578b	Move ConstantStruct back to 2.5 API. llvm-svn: 77266	2009-07-27 22:29:26 +00:00
Owen Anderson	69c464dec4	Move ConstantFP construction back to the 2.5-ish API. llvm-svn: 77247	2009-07-27 20:59:43 +00:00
Devang Patel	de6f46c32e	Do not seed mstadata into the value map. llvm-svn: 77208	2009-07-27 17:17:04 +00:00
Daniel Dunbar	6115b39ffd	Remove Value::getName{Start,End}, the last of the old Name APIs. llvm-svn: 77152	2009-07-26 09:48:23 +00:00
Daniel Dunbar	ca414c7cae	Remove Value::getNameLen llvm-svn: 77148	2009-07-26 08:34:35 +00:00
Daniel Dunbar	9813b0b025	Eliminate some uses of DOUT, cerr, and getNameStart(). llvm-svn: 77145	2009-07-26 07:49:05 +00:00
Daniel Dunbar	e03eecb75f	Remove Value::{isName, getNameRef}. Also, change MDString to use a StringRef. llvm-svn: 77098	2009-07-25 23:55:21 +00:00
Daniel Dunbar	4975db6276	Initial update to VMCore to use Twines for string arguments. - The only meat here is in Value.{h,cpp} the rest is essential 'const std::string &' -> 'const Twine &'. llvm-svn: 77048	2009-07-25 04:41:11 +00:00
Eric Christopher	53e1cd7254	Fix 80-col violations. llvm-svn: 77045	2009-07-25 02:45:27 +00:00
Eric Christopher	c974225976	Move ExtractElementInst to ::Create instead of new. Update all uses. llvm-svn: 77044	2009-07-25 02:28:41 +00:00
Dan Gohman	1ddf98ad8e	Convert a few more things to use raw_ostream. llvm-svn: 77039	2009-07-25 01:43:01 +00:00
Dan Gohman	29f2baf3b3	Convert a few more uses of llvm/Support/Streams.h to raw_ostream. llvm-svn: 77033	2009-07-25 01:13:51 +00:00
Dan Gohman	43d19d61d4	Make AliasAnalysis and related classes use getAnalysisIfAvailable<TargetData>(). llvm-svn: 77028	2009-07-25 00:48:42 +00:00
Daniel Dunbar	0dd5e1ed39	More migration to raw_ostream, the water has dried up around the iostream hole. - Some clients which used DOUT have moved to DEBUG. We are deprecating the "magic" DOUT behavior which avoided calling printing functions when the statement was disabled. In addition to being unnecessary magic, it had the downside of leaving code in -Asserts builds, and of hiding potentially unnecessary computations. llvm-svn: 77019	2009-07-25 00:23:56 +00:00
Owen Anderson	edb4a70325	Revert the ConstantInt constructors back to their 2.5 forms where possible, thanks to contexts-on-types. More to come. llvm-svn: 77011	2009-07-24 23:12:02 +00:00
Dan Gohman	85a791ef7a	AliasAnalysis wants sizes in address-units, not bits. llvm-svn: 77009	2009-07-24 23:01:30 +00:00
Dan Gohman	0b5be94c79	Fix this condition I accidentally inverted. llvm-svn: 76988	2009-07-24 18:31:07 +00:00
Dan Gohman	67243a4bec	Convert several more passes to use getAnalysisIfAvailable<TargetData>() instead of getAnalysis<TargetData>(). llvm-svn: 76982	2009-07-24 18:13:53 +00:00
Daniel Dunbar	796e43eede	Move more to raw_ostream, provide support for writing MachineBasicBlock, LiveInterval, etc to raw_ostream. llvm-svn: 76965	2009-07-24 10:36:58 +00:00
Daniel Dunbar	12368685d8	Switch to getNameStr(). llvm-svn: 76962	2009-07-24 08:24:36 +00:00
Daniel Dunbar	5bf72e20eb	Convert StringMap to using StringRef for its APIs. - Yay for '-'s and simplifications! - I kept StringMap::GetOrCreateValue for compatibility purposes, this can eventually go away. Likewise the StringMapEntry Create functions still follow the old style. - NIFC. llvm-svn: 76888	2009-07-23 18:17:34 +00:00
Chris Lattner	88ab854873	refactor a blob of code out to a new 'FoldOrOfFCmps' function and simplify it. llvm-svn: 76866	2009-07-23 05:46:22 +00:00
Chris Lattner	7d55541e56	Make some existing optimizations that would only trigger on scalars also apply to vectors. This allows us to compile this: #include <emmintrin.h> __m128i a(__m128 a, __m128 b) { return a==a & b==b; } __m128i b(__m128 a, __m128 b) { return a!=a \| b!=b; } to: _a: cmpordps %xmm1, %xmm0 ret _b: cmpunordps %xmm1, %xmm0 ret with clang instead of to a ton of horrible code. llvm-svn: 76863	2009-07-23 05:32:17 +00:00
Chris Lattner	9085438e4b	refactor a bunch of code out into a helper function, no functionality change. llvm-svn: 76859	2009-07-23 05:14:02 +00:00
Daniel Dunbar	0989a9a338	Remove unnecessary store to temporary std::string. llvm-svn: 76782	2009-07-22 20:46:46 +00:00
Eli Friedman	315596c39c	Don't give a massive inlining cost bonus to available_externally functions with a single use; eliminating the single use may eliminate the function from the current module, but usually doesn't eliminate it from the final program. llvm-svn: 76730	2009-07-22 08:12:59 +00:00
Owen Anderson	47db941fd3	Get rid of the Pass+Context magic. llvm-svn: 76702	2009-07-22 00:24:57 +00:00
Dan Gohman	3666c34db8	Convert instcombine from using using getAnalysis<TargetData> to getAnalysisIfAvailable<TargetData>. llvm-svn: 76676	2009-07-21 23:21:54 +00:00
Owen Anderson	c37bc69e91	Rename getConstantInt{True\|False} to get{True\|False} at Chris' behest. llvm-svn: 76598	2009-07-21 18:03:38 +00:00
Ted Kremenek	d0014cf36d	Update CMake files. llvm-svn: 76595	2009-07-21 17:43:20 +00:00
Owen Anderson	2ad52176f9	Move a bit more state over to the LLVMContext. llvm-svn: 76533	2009-07-21 02:47:59 +00:00
Chris Lattner	470a8da807	use ExpandInlineAsm on TargetLowering instead of TargetAsmInfo. llvm-svn: 76442	2009-07-20 17:52:52 +00:00
Dan Gohman	33a3fd0b9c	Revert the addition of hasNoPointerOverflow to GEPOperator. Getelementptrs that are defined to wrap are virtually useless to optimization, and getelementptrs that are undefined on any kind of overflow are too restrictive -- it's difficult to ensure that all intermediate addresses are within bounds. I'm going to take a different approach. Remove a few optimizations that depended on this flag. llvm-svn: 76437	2009-07-20 17:43:30 +00:00
Chris Lattner	58f9bb2ccd	implement a new magic global "llvm.compiler.used" which is like llvm.used, but doesn't cause ".no_dead_strip" to be emitted on darwin. llvm-svn: 76399	2009-07-20 06:14:25 +00:00
Bill Wendling	a3c6f6bffa	Add plumbing for the `linker_private' linkage type. This type is meant for "private" symbols which the assember shouldn't strip, but which the linker may remove after evaluation. This is mostly useful for Objective-C metadata. This is plumbing, so we don't have a use of it yet. More to come, etc. llvm-svn: 76385	2009-07-20 01:03:30 +00:00
Eli Friedman	048e78fc5b	Canonicalize bitcasts between types like <1 x i64> and i64 to insertelement/extractelement. I'm not entirely sure this is precisely what we want to do: should we prefer bitcast(insertelement) or insertelement(bitcast)? Similarly. should we prefer extractelement(bitcast) or bitcast(extractelement)? llvm-svn: 76345	2009-07-18 23:06:53 +00:00
Eli Friedman	eb6bcf3462	Back out 76300; apparently the preference is to canonicalize the other way (bitcast -> insert/extractelement). llvm-svn: 76325	2009-07-18 19:04:16 +00:00
Chris Lattner	1c71fd646b	add a fixme llvm-svn: 76324	2009-07-18 18:49:04 +00:00
Eli Friedman	52dbfc21c5	Add combine: X sdiv (1 << Y) -> X udiv (1 << Y) when X doesn't have the sign bit set. llvm-svn: 76304	2009-07-18 09:53:21 +00:00
Eli Friedman	992d0e0b74	Remove no-op check. llvm-svn: 76302	2009-07-18 09:21:25 +00:00
Eli Friedman	44e9836b17	Remove dead check. llvm-svn: 76301	2009-07-18 09:12:15 +00:00
Eli Friedman	a807aae226	Canonicalize insert/extractelement from single-element vectors into bitcasts. It would also be possible to canonicalize the other way; does anyone have a preference? llvm-svn: 76300	2009-07-18 09:07:47 +00:00
Eli Friedman	ff9bf97ceb	Fix simplifylibcalls memset recognition to work on 64-bit platforms where int is 32 bits. llvm-svn: 76293	2009-07-18 08:34:51 +00:00
Nick Lewycky	0d13903563	Replace intersectWith with maximalIntersectWith. The latter guarantees that all values belonging to the intersection will belong to the resulting range. The former was inconsistent about that point (either way is fine, just pick one.) This is part of PR4545. llvm-svn: 76289	2009-07-18 06:34:42 +00:00
Eli Friedman	e1b9216bc3	Fix the inline cost calculation to take into account instructions which cannot be folded even if they have constant operands. Significantly helps if_spppsubr.c attached to PR4573. llvm-svn: 76285	2009-07-18 05:26:06 +00:00
Eli Friedman	f13b36ddc5	Add line breaks to make the debug output a bit more readable. llvm-svn: 76284	2009-07-18 05:12:58 +00:00
Dan Gohman	e1019db658	Convert more code to use Operator instead of explicitly handling both ConstantExpr and Instruction. This involves duplicating some code between GetElementPtrInst and GEPOperator, but it's not a lot. llvm-svn: 76265	2009-07-17 23:55:56 +00:00
Dan Gohman	1d548d851a	Make BasicAliasAnalysis and Value::getUnderlyingObject use GEPOperator's hasNoPointer0verflow(), and make a few places in instcombine that create GEPs that may overflow clear the NoOverflow value. Among other things, this partially addresses PR2831. llvm-svn: 76252	2009-07-17 22:25:10 +00:00
Dan Gohman	a565d4f937	Fix some typos in a comment. llvm-svn: 76249	2009-07-17 22:16:21 +00:00
Dan Gohman	80ca01c466	Add a new Operator class, for handling Instructions and ConstantExprs in a convenient manner, factoring out some common code from InstructionCombining and ValueTracking. Move the contents of BinaryOperators.h into Operator.h and use Operator to generalize them to support ConstantExprs as well as Instructions. llvm-svn: 76232	2009-07-17 20:47:02 +00:00
Daniel Dunbar	482bd9dcb8	Initialize another Context, in the hopes of unbreaking CBE. llvm-svn: 76184	2009-07-17 16:20:23 +00:00
Eli Friedman	b8f6a4fc8e	Replace isTrapping with a new, similar method called isSafeToSpeculativelyExecute. The new method is a bit closer to what the callers actually care about in that it rejects more things callers don't want. It also adds more precise handling for integer division, and unifies code for analyzing the legality of a speculative load. llvm-svn: 76150	2009-07-17 04:28:42 +00:00
Owen Anderson	20b34ac794	Move the ConstantInt uniquing table into LLVMContextImpl. This exposed a number of issues in our current context-passing stuff, which is also fixed here llvm-svn: 76089	2009-07-16 18:04:31 +00:00
Owen Anderson	4fdeba9706	Revert yesterday's change by removing the LLVMContext parameter to AllocaInst and MallocInst. llvm-svn: 75863	2009-07-15 23:53:25 +00:00
Eli Friedman	662da55c5f	Switch invars away from using isTrapping when it really shouldn't be using it. llvm-svn: 75852	2009-07-15 22:48:29 +00:00
Eli Friedman	ebe66ab13b	Don't restrict the set of instructions where we try to constant-fold the operands; it's possible to end up with a constant-foldable operand to most instructions, even those which can't trap. llvm-svn: 75845	2009-07-15 22:13:34 +00:00
Dan Gohman	b0f8e9960d	Fix indentation. llvm-svn: 75723	2009-07-15 01:26:32 +00:00
Dan Gohman	c43e47938a	Make makeLoopInvariant report whether it made any changes or not, and use this to simplify more code. llvm-svn: 75722	2009-07-15 01:25:43 +00:00
Owen Anderson	b6b2530000	Move EVER MORE stuff over to LLVMContext. llvm-svn: 75703	2009-07-14 23:09:55 +00:00
Dale Johannesen	3be62697df	Revert 75571; I'm convinced this isn't the right thing to do. llvm-svn: 75642	2009-07-14 17:48:25 +00:00
Torok Edwin	fbcc663cbf	llvm_unreachable->llvm_unreachable(0), LLVM_UNREACHABLE->llvm_unreachable. This adds location info for all llvm_unreachable calls (which is a macro now) in !NDEBUG builds. In NDEBUG builds location info and the message is off (it only prints "UREACHABLE executed"). llvm-svn: 75640	2009-07-14 16:55:14 +00:00
Dan Gohman	e141364e5c	Require IVUsers after LCSSA, since LCSSA does not preserve IVUsers. This results in the pass manager running IVUsers only once for indvars, instead of twice. llvm-svn: 75633	2009-07-14 14:26:23 +00:00
Eli Friedman	14379df4e6	Fix trivial todo in instcombine. llvm-svn: 75586	2009-07-14 02:01:53 +00:00
Dan Gohman	4d6149f356	Update LoopSimplify and LoopUnswitch to use the new makeLoopInvariant function. llvm-svn: 75584	2009-07-14 01:37:59 +00:00
Dan Gohman	03d5d0f451	Fix indvars to not assume that a loop with a single unique exit block has a single unique exiting block. llvm-svn: 75579	2009-07-14 01:09:02 +00:00
Dale Johannesen	85ae7480d9	Don't delete asm's just because their inputs are undefined; xor R, R is a common and valid idiom for zeroing a register, for example. llvm-svn: 75571	2009-07-14 00:45:38 +00:00
Eli Friedman	4b95026194	PR4548: optimize zext+udiv+trunc to udiv. llvm-svn: 75539	2009-07-13 22:46:01 +00:00
Eli Friedman	7e1716dc9d	Canonicalize boolean +/- a constant to a select. (I think it's reasonably clear that we want to have a canonical form for constructs like this; if anyone thinks that a select is not the best canonical form, please tell me.) llvm-svn: 75531	2009-07-13 22:27:52 +00:00
Owen Anderson	bb2501bbbe	These don't really need contexts either. llvm-svn: 75528	2009-07-13 22:18:28 +00:00
Dan Gohman	cc85ae132c	Make Loop and MachineLoop be subclasses of LoopBase, rather than typedefs, using the Curiously Recurring Template Pattern with LoopBase. This will help further refactoring, and future functionality for Loop. Also, Headers can now foward-declare Loop, instead of pulling in LoopInfo.h or doing tricks. llvm-svn: 75519	2009-07-13 21:51:15 +00:00
Eli Friedman	42170b0a9e	Misc simplifications to InstCombiner::commonIntCastTransforms. Most of the changes are allowed by not calling this function for bitcasts. The Instruction::AShr case is dead because SimplifyDemandedInstructionBits handles that case. llvm-svn: 75514	2009-07-13 21:45:57 +00:00
Eli Friedman	7f3a529ae9	Fix comment. llvm-svn: 75499	2009-07-13 20:58:59 +00:00
Owen Anderson	542619e6d5	Move more functionality over to LLVMContext. llvm-svn: 75497	2009-07-13 20:58:05 +00:00
Eli Friedman	f13aa44d4f	Don't bother to call commonIntCastTransforms for bitcasts; int->int bitcasts will always be eliminated anyway. llvm-svn: 75495	2009-07-13 20:53:00 +00:00
Owen Anderson	53a52215b5	Begin the painful process of tearing apart the rat'ss nest that is Constants.cpp and ConstantFold.cpp. This involves temporarily hard wiring some parts to use the global context. This isn't ideal, but it's the only way I could figure out to make this process vaguely incremental. llvm-svn: 75445	2009-07-13 04:09:18 +00:00
Eli Friedman	575db66e1b	Remove check which is duplicated in InstCombiner::visitSelectInstWithICmp. llvm-svn: 75409	2009-07-12 02:00:05 +00:00
Chris Lattner	2f67295aac	silence a vc++ warning. llvm-svn: 75393	2009-07-11 22:31:59 +00:00
Torok Edwin	56d0659726	assert(0) -> LLVM_UNREACHABLE. Make llvm_unreachable take an optional string, thus moving the cerr<< out of line. LLVM_UNREACHABLE is now a simple wrapper that makes the message go away for NDEBUG builds. llvm-svn: 75379	2009-07-11 20:10:48 +00:00
Torok Edwin	ccb29cd290	Convert more assert(0)+abort() -> LLVM_UNREACHABLE, and abort()/exit() -> llvm_report_error(). llvm-svn: 75363	2009-07-11 13:10:19 +00:00
Nick Lewycky	dcfdce9067	Move a method that creates constant ranges relative to another constant range per icmp predicate out of predsimplify and into ConstantRange. Add another utility method that determines whether one range is a subset of another. Combine with the former to determine whether icmp pred range, range is known to be true or not. llvm-svn: 75357	2009-07-11 06:15:39 +00:00
Owen Anderson	16e7674f4b	Push LLVMContext through the PatternMatch API. llvm-svn: 75255	2009-07-10 17:35:01 +00:00
Owen Anderson	1e5f00e7a7	This started as a small change, I swear. Unfortunately, lots of things call the [I\|F]CmpInst constructors. Who knew!? llvm-svn: 75200	2009-07-09 23:48:35 +00:00
Owen Anderson	29fd313e9e	A little bit more LLVMContextification. llvm-svn: 75159	2009-07-09 18:36:20 +00:00
Nick Lewycky	ab81d2f5e9	There's no need to consider PHI nodes in the same block as the instruction we're inserting sigma/phi functions for. Patch by Andre Tavares. llvm-svn: 75138	2009-07-09 15:59:27 +00:00
Nick Lewycky	eb373ad2af	Add some statistics to SSI so we can see what it's up to. Add an -ssi-everything pass which calls createSSI on everything in the function. llvm-svn: 75135	2009-07-09 15:33:14 +00:00
Owen Anderson	a771459bb1	Push LLVMContext _back_ through IRBuilder. llvm-svn: 75040	2009-07-08 20:50:47 +00:00
Dan Gohman	7bb3173ff7	Tell ScalarEvolution to forget a loop before starting to delete it. This way ScalarEvolution can examine the loop to determine what state it needs to update, if it chooses. llvm-svn: 75029	2009-07-08 19:14:29 +00:00
Owen Anderson	b17f32945f	Switch GlobalVariable ctors to a sane API, where either a context or a module is required. llvm-svn: 75025	2009-07-08 19:03:57 +00:00
Nick Lewycky	a21d3daadc	Remove the vicmp and vfcmp instructions. Because we never had a release with these instructions, no autoupgrade or backwards compatibility support is provided. llvm-svn: 74991	2009-07-08 03:04:38 +00:00
Owen Anderson	5948fdf68b	Push LLVMContext through GlobalVariables and IRBuilder. llvm-svn: 74985	2009-07-08 01:26:06 +00:00
Dan Gohman	af75234955	Change all SCEV* to SCEV *. llvm-svn: 74918	2009-07-07 17:06:11 +00:00
Owen Anderson	38264b1554	"LLVMContext* " --> "LLVMContext *" llvm-svn: 74878	2009-07-06 23:00:19 +00:00
Owen Anderson	f1f1743b2e	Finish LLVMContext-ing lib/Analysis. This required pushing LLVMContext's through the ValueTracking API. llvm-svn: 74873	2009-07-06 22:37:39 +00:00
Owen Anderson	39f00cc1d4	Thread LLVMContext through the constant folding APIs, which touches a lot of files. llvm-svn: 74844	2009-07-06 18:42:36 +00:00
Owen Anderson	605a8c743f	More LLVMContext-ification. llvm-svn: 74811	2009-07-06 01:34:54 +00:00
Owen Anderson	e70b637033	More LLVMContext-ification. llvm-svn: 74807	2009-07-05 22:41:43 +00:00
Mike Stump	bbd8707f6e	Fix build. llvm-svn: 74782	2009-07-03 22:11:58 +00:00
Owen Anderson	340288c621	Even more passes being LLVMContext'd. llvm-svn: 74781	2009-07-03 19:42:02 +00:00
Nick Lewycky	cb23509546	Add Static Single Information construction pass written by André Tavares! Use it by requiring it through the pass manager, then calling its createSSI method on the variables that you want in SSI form. llvm-svn: 74780	2009-07-03 19:28:36 +00:00
Duncan Sands	29c8efce31	Add newline at end of file. llvm-svn: 74773	2009-07-03 15:30:58 +00:00
Owen Anderson	80baed63b4	Second batch of passes using LLVMContext. llvm-svn: 74753	2009-07-03 00:54:20 +00:00
Owen Anderson	b5618da226	Convert the first batch of passes to use LLVMContext. llvm-svn: 74748	2009-07-03 00:17:18 +00:00
Chris Lattner	f3f6aaa2c3	fix inverted logic pointed out by John McCall, noticed by inspection. This was considering vector intrinsics to have cost 2, but non-vector intrinsics to have cost 1, which is backward. llvm-svn: 74698	2009-07-02 15:39:39 +00:00
Dan Gohman	43f33dd550	Fix a bunch of other places that used operator[] to test whether a key is present in a std::map or DenseMap to use find instead. llvm-svn: 74676	2009-07-02 00:17:47 +00:00
Dan Gohman	cf092389a9	Request LCSSA after LoopSimplify. This fixes a problem in which the PassManager was scheduling LCSSA before LoopSimplify, which does not preserve LCSSA. llvm-svn: 74661	2009-07-01 23:21:38 +00:00
Dan Gohman	83348f80b6	Fix an instcombine abort on a scalar-to-vector bitcast. This fixes PR4487. llvm-svn: 74646	2009-07-01 21:38:46 +00:00
Owen Anderson	6773d388aa	Add a pointer to the owning LLVMContext to Module. This requires threading LLVMContext through a lot of the bitcode reader and ASM parser APIs, as well as supporting it in all of the tools. Patches for Clang and LLVM-GCC to follow. llvm-svn: 74614	2009-07-01 16:58:40 +00:00
Chris Lattner	96122debc1	improve the APIs for creating struct and function types with no arguments/elements to not have to create a temporary vector (in the API at least). Patch by Jay Foad! llvm-svn: 74584	2009-07-01 04:13:31 +00:00
Dan Gohman	4dfc680059	Minor code simplification. llvm-svn: 74491	2009-06-30 01:24:43 +00:00
Dan Gohman	317f054531	Don't try to split a loop when the controlling icmp instruction doesn't have an IV-based operand. This fixes PR4471. llvm-svn: 74399	2009-06-27 22:58:27 +00:00
Dan Gohman	ffdcba3dbd	Remove the block from the LoopInfo, rather than just the Loop. LoopInfo will handle removing it from the Loop, as well as updating its own tables. llvm-svn: 74398	2009-06-27 22:32:36 +00:00
Dan Gohman	c8ca49659a	Teach LoopSimplify how to merge multiple loop exits into a single exit, when one of them can be converted to a trivial icmp and conditional branch. This addresses what is essentially a phase ordering problem. SimplifyCFG knows how to do this transformation, but it doesn't do so if the primary block has any instructions in it other than an icmp and a branch. In the given testcase, the block contains other instructions, however they are loop-invariant and can be hoisted. SimplifyCFG doesn't have LoopInfo though, so it can't hoist them. And, it's important that the blocks be merged before LoopRotation, as it doesn't support multiple-exit loops. llvm-svn: 74396	2009-06-27 21:30:38 +00:00
Dan Gohman	8918b481bf	More minor code simplifications. llvm-svn: 74395	2009-06-27 21:23:40 +00:00
Dan Gohman	fe174b6952	When a value is used multiple times within a single PHI, instructions inserted to replace that value must dominate all of of the basic blocks associated with the uses of the value in the PHI, not just one of them. llvm-svn: 74376	2009-06-27 05:16:57 +00:00
Dan Gohman	daafbe6168	Incorporate the insertion point into the key of SCEVExpander's CSE map. This helps it avoid reusing an instruction that doesn't dominate all of the users, in cases where the original instruction was inserted before all of the users were known. This may result in redundant expansions of sub-expressions that depend on loop-unpredictable values in some cases, however this isn't very common, and it primarily impacts IndVarSimplify, so GVN can be expected to clean these up. This eliminates the need for IndVarSimplify's FixUsesBeforeDefs, which fixes several bugs. llvm-svn: 74352	2009-06-26 22:53:46 +00:00
Devang Patel	0f2eb5b9f7	Remove unused routines. llvm-svn: 74351	2009-06-26 22:53:22 +00:00
Owen Anderson	01ad6605c0	Constify this value. llvm-svn: 74330	2009-06-26 21:39:56 +00:00
Douglas Gregor	6d94e6a5f3	Fix linking of llvm-ld and lli with CMake, from Xerxes Rånby llvm-svn: 74285	2009-06-26 15:37:00 +00:00
Devang Patel	0751a28888	Remove debug info anchors - llvm.dbg.compile_units, llvm.dbg.subprograms and llvm.dbg.global_variables. llvm-svn: 74251	2009-06-26 01:49:18 +00:00
Dan Gohman	ac3b5382b8	Change this code to a form about which VC++ reportedly isn't unhappy. llvm-svn: 74243	2009-06-26 00:35:12 +00:00
Dan Gohman	7eaf50ecac	Fix LCSSA to avoid emitting a PHI node for the unwind destination of an invoke instruction, since the value isn't really live across that edge. llvm-svn: 74242	2009-06-26 00:31:13 +00:00
Dan Gohman	31167c61d5	Minor code simplification. llvm-svn: 74240	2009-06-26 00:26:03 +00:00
Dan Gohman	091e440568	Reword a few comments. llvm-svn: 74146	2009-06-25 00:22:44 +00:00
Dan Gohman	929fa7b0f4	When inserting code into a loop preheader, insert it before the terminator, instead of after the last phi. This fixes a bug exposed by ScalarEvolution analyzing more kinds of loops. This fixes PR4436. llvm-svn: 74072	2009-06-24 14:31:06 +00:00
Dan Gohman	f19aeec3f5	Extend ScalarEvolution's multiple-exit support to compute exact trip counts in more cases. Generalize ScalarEvolution's isLoopGuardedByCond code to recognize And and Or conditions, splitting the code out into an isNecessaryCond helper function so that it can evaluate Ands and Ors recursively, and make SCEVExpander be much more aggressive about hoisting instructions out of loops. test/CodeGen/X86/pr3495.ll has an additional instruction now, but it appears to be due to an arbitrary register allocation difference. llvm-svn: 74048	2009-06-24 01:18:18 +00:00
Dan Gohman	f522a4e034	Don't emit a redundant BitCastInst if the value to be defined in the preheader is already an instruction. llvm-svn: 74031	2009-06-24 00:28:59 +00:00
Dan Gohman	fd76113e28	Fix a few minor issues that were exposed by the removal of SCEVHandle. llvm-svn: 73910	2009-06-22 22:08:45 +00:00
Owen Anderson	65b6056e37	SCEVHandle is no more! llvm-svn: 73906	2009-06-22 21:39:50 +00:00
Dan Gohman	78ea89e161	Fix this code to correctly handle loops with multiple exits. Until now, this hasn't mattered, because ScalarEvolution hasn't been able to compute trip counts for loops with multiple exits. But it will soon. llvm-svn: 73864	2009-06-22 00:15:15 +00:00
Dan Gohman	860379bcc2	Rename a variable for consistency with the ExitBlock vs ExitingBlock terminology that LoopInfo uses. llvm-svn: 73863	2009-06-21 23:48:38 +00:00
Dan Gohman	724f825f96	Fix a typo in a comment that Frits von Bommel noticed. llvm-svn: 73796	2009-06-19 23:41:37 +00:00
Dan Gohman	cc31110b95	Re-apply r73718, now that the fix in r73787 is in, and add a hand-crafted testcase which demonstrates the bug that was exposed in 254.gap. llvm-svn: 73793	2009-06-19 23:23:27 +00:00
Dan Gohman	55e3dd9174	Fix LSR's OptimizeSMax to ignore max operators with more than 2 operands, which it isn't prepared to handle. llvm-svn: 73787	2009-06-19 23:03:46 +00:00
Evan Cheng	86076c9e30	Revert 73718. It's breaking 254.gap. llvm-svn: 73783	2009-06-19 21:15:06 +00:00
Chris Lattner	d0a363e03b	make jump threading handle lexically identical compare instructions as if they were multiple uses of the same instruction. This interacts well with the existing loadpre that j-t does to open up many new jump threads earlier. llvm-svn: 73768	2009-06-19 16:27:56 +00:00
Nick Lewycky	77585a24ac	Teach jump threading to look at comparisons between phi nodes and non-constants. llvm-svn: 73755	2009-06-19 04:56:29 +00:00
Chris Lattner	5ca4197829	Improve tail call elim to move loads above readonly calls when it allows forming a tail call. Patch by Frits van Bommel. This implements PR4323. llvm-svn: 73752	2009-06-19 04:22:16 +00:00
Chris Lattner	87a222c5c8	part of PR4405: disable a contentious optimization for strcmp -> memcmp when the lengths of the strings are unknown. Patch by Nick Lewycky! llvm-svn: 73751	2009-06-19 04:17:36 +00:00
Dan Gohman	8c9ac59455	Generalize LSR's OptimizeSMax to handle unsigned max tests as well as signed max tests. Along with r73717, this helps CodeGen avoid emitting code for a maximum operation for this class of loop. llvm-svn: 73718	2009-06-18 20:23:18 +00:00
Anton Korobeynikov	6ee547bb1b	Revert IRBuilder CC propagation. Fix SimplifyLibCalls instead. llvm-svn: 73715	2009-06-18 20:05:31 +00:00
Dan Gohman	a0348809b6	Remove the code from IVUsers that attempted to handle casted induction variables in cases where the cast isn't foldable. It ended up being a pessimization in many cases. This could be fixed, but it would require a bunch of complicated code in IVUsers' clients. The advantages of this approach aren't visible enough to justify it at this time. llvm-svn: 73706	2009-06-18 16:54:06 +00:00
Dan Gohman	56bd02c55c	Generalize the zext(trunc(t) & C) instcombine to work even with C is not a low-bits mask, and add a similar instcombine for zext((trunc(t) & C) ^ C). llvm-svn: 73705	2009-06-18 16:30:21 +00:00
Dan Gohman	7f836c7c61	Instcombine zext(trunc(x) & mask) to x&mask, even if the trunc has multiple users. llvm-svn: 73656	2009-06-17 23:17:05 +00:00
Dale Johannesen	81b6463ed0	This fixes a bug introduced in 72661, which can move loads back past a check that the load address is valid, see new testcase. The test that went in with 72661 has exactly this case, except that the conditional it's moving past is checking something else; I've settled for changing that test to reference a global, not a pointer. It may be possible to scan all the tests you pass and make sure none of them are checking any component of the address, but it's not trivial and I'm not trying to do that here. llvm-svn: 73632	2009-06-17 20:48:23 +00:00
Torok Edwin	ba93ea7632	Add debug message about non-local loads being clobbered. llvm-svn: 73625	2009-06-17 18:48:18 +00:00
Dan Gohman	d8329e8378	Update comments to use doxygen syntax. llvm-svn: 73621	2009-06-17 17:51:33 +00:00
Sanjiv Gupta	2f2b0a1985	>> What if my global variable was into a different address space than stack? >> > > It doesn't matter in terms of semantics: because AnalyzeGlobal > returned false, we're guaranteed the address of the global is never > taken. I wouldn't be surprised if we end up generating invalid IR in > some cases, though, because of the semantics of replaceAllUsesWith. > Do you have a testcase that breaks? > > The problem is replaceAllUsesWith asserts for type mismatch here. Try attached .bc with llvm-ld. assert(New->getType() == getType() && "replaceAllUses of value with new value of different type!"); Since stack is always on address space zero, I don't think that type of GV in a different address space is ever going to match. The other way is to allow replaceAllUsesWith to ignore address spaces while comparing types. (do we have a way to do that ?). But then such an optimization may fail the entire idea of user wanting to place a variable into different memory space. The original idea of user might be to save on the stack space (data memory) and hence he asked the variable to be placed into different memory space (program memory). So the best bet here is to deny this optimization by checking GV->getType()->getAddressSpace() == 0. llvm-svn: 73605	2009-06-17 06:47:15 +00:00
Eli Friedman	a0fba5319d	PR3439: Correct a silly mistake in the SimplifyDemandedUseBits code for SRem. llvm-svn: 73598	2009-06-17 02:57:36 +00:00
Dan Gohman	0ed7756fbe	Generalize a few more instcombines to be vector/scalar-independent. llvm-svn: 73541	2009-06-16 19:55:29 +00:00
Chris Lattner	945d08d76f	Generalize instcombine's isSafeToLoadUnconditionally() function to ignore readonly calls, and factor it out of instcombine so that it can be used by other passes. Patch by Frits van Bommel! llvm-svn: 73506	2009-06-16 17:23:12 +00:00
Dan Gohman	adfd42a3c8	Use Type::getScalarType. llvm-svn: 73451	2009-06-16 00:20:26 +00:00
Dan Gohman	7ccc52f131	Support vector casts in more places, fixing a variety of assertion failures. To support this, add some utility functions to Type to help support vector/scalar-independent code. Change ConstantInt::get and ConstantFP::get to support vector types, and add an overload to ConstantInt::get that uses a static IntegerType type, for convenience. Introduce a new getConstant method for ScalarEvolution, to simplify common use cases. llvm-svn: 73431	2009-06-15 22:12:54 +00:00
Dale Johannesen	9df78ee1ae	Fix the crash in this test. This is basically the same problem addressed in 31284, but the patch there only addressed the case where an invoke is the first thing in a block. llvm-svn: 73416	2009-06-15 20:59:27 +00:00
Owen Anderson	bd6a213725	Merge PartialInliner changes. llvm-svn: 73412	2009-06-15 20:50:26 +00:00
Dan Gohman	a8f8a85388	Make the EnableLoadPRE variable static. llvm-svn: 73398	2009-06-15 18:30:15 +00:00
Dan Gohman	4fe64deb7b	Fix old-style type names in comments. llvm-svn: 73362	2009-06-14 23:30:43 +00:00
Dan Gohman	0652fd59ff	Convert several parts of the ScalarEvolution framework to use SmallVector instead of std::vector. llvm-svn: 73357	2009-06-14 22:47:23 +00:00
Dan Gohman	9b4c85ff62	Add another item to the list of things that indvars does. llvm-svn: 73355	2009-06-14 22:38:41 +00:00
Torok Edwin	74d21958a2	Fix CMake build. Patch from Ingmar Vanhassel. llvm-svn: 73342	2009-06-14 13:39:56 +00:00
Owen Anderson	2f82e2735a	Add an early implementation of a partial inlining pass. The idea behind this is that, for functions whose bodies are entirely guarded by an if-statement, it can be profitable to pull the test out of the callee and into the caller. This code has had some cursory testing, but still has a number of known issues on the LLVM test suite. llvm-svn: 73338	2009-06-14 08:26:32 +00:00
Nick Lewycky	47b71c5844	Unlike the other instructions, GEP really does need to look at the type of a pointer. This fixes kimwitu++. Pointed out by Frits van Bommel on review! llvm-svn: 73299	2009-06-13 19:09:52 +00:00
Dan Gohman	426901aa19	Teach SCEVExpander's visitAddRecExpr to reuse an existing canonical induction variable when the addrec to be expanded does not require a wider type. This eliminates the need for IndVarSimplify to micro-manage SCEV expansions, because SCEVExpander now automatically expands them in the form that IndVarSimplify considers to be canonical. (LSR still micro-manages its SCEV expansions, because it's optimizing for the target, rather than for other optimizations.) Also, this uses the new getAnyExtendExpr, which has more clever expression simplification logic than the IndVarSimplify code it replaces, and this cleans up some ugly expansions in code such as the included masked-iv.ll testcase. llvm-svn: 73294	2009-06-13 16:25:49 +00:00
Chris Lattner	3dd5c5d28a	second half of fix for PR4366: don't zap store to null of non-default addrspaces. llvm-svn: 73253	2009-06-12 21:01:07 +00:00
Dan Gohman	9377b086f5	Don't do (x - (y - z)) --> (x + (z - y)) on floating-point types, because it may round differently. This fixes PR4374. llvm-svn: 73243	2009-06-12 19:23:25 +00:00
Dan Gohman	17fb0d24eb	Give Instruction::isSameOperationAs a corresponding comment to note the relationship with MergeFunctions.cpp's isEquivalentOperation, and make a trivial code reordering so that the two functions are easier to compare. Fix the name of Instruction::isSameOperationAs in MergeFunction.cpp's isEquivalentOperation's comment, and fix a nearby 80-column violation. llvm-svn: 73241	2009-06-12 19:03:05 +00:00
Nick Lewycky	ec06695579	Keep callers of a weak function calling it, instead of the non-weak equivalent. llvm-svn: 73235	2009-06-12 17:16:48 +00:00
Nick Lewycky	d5bf51faa2	Don't forget to match the calling convention when producing a thunk. llvm-svn: 73231	2009-06-12 16:04:00 +00:00
Nick Lewycky	25675ac14a	Given two identical weak functions, produce one internal function and two weak thunks. llvm-svn: 73230	2009-06-12 15:56:56 +00:00
Nick Lewycky	e04dc22ebd	Add an "are types equivalent" operation that ignores the types that a pointer points to while analyzing all other fields. Use FoldingSetNodeID to produce a good hash. This dramatically decreases run times. Emit thunks. This means that it can look at all functions regardless of what the linkage is or if the address is taken, but unfortunately some small functions can be even shorter than the thunk because our backend doesn't yet realize it can just turn these into jumps. This means that this pass will pessimize code on average. llvm-svn: 73222	2009-06-12 08:04:51 +00:00
Chris Lattner	61797e3291	Fix 4366: store to null in non-default addr space should not be turned into unreachable. llvm-svn: 73195	2009-06-11 17:54:56 +00:00
Jay Foad	557169d923	Implement and use new method Function::hasAddressTaken(). llvm-svn: 73164	2009-06-10 08:41:11 +00:00
Jay Foad	edea37d801	Remove an unused function SafeToDestroyConstant(). Rename an almost identical function ConstantIsDead() to SafeToDestroyConstant(), to emphasise the connection with Constant::destroyConstant(). llvm-svn: 73149	2009-06-09 21:37:11 +00:00
Nick Lewycky	7ea68536b5	Don't crash on multiple return value with no obvious inserted value. Fixes PR4314. llvm-svn: 73007	2009-06-06 23:13:08 +00:00
Eli Friedman	73a83066d5	PR4340: Run SimplifyDemandedVectorElts on insertelement instructions; sometimes it can find simplifications that won't be found otherwise. llvm-svn: 73006	2009-06-06 20:08:03 +00:00
Jay Foad	e57ba2eab5	Use cast<> instead of dyn_cast<> for things that are known to be Instructions. llvm-svn: 73002	2009-06-06 17:49:35 +00:00
Devang Patel	50fc5a3cd7	Simplify. llvm-svn: 72965	2009-06-05 22:39:21 +00:00
Dan Gohman	a5b9645c4b	Split the Add, Sub, and Mul instruction opcodes into separate integer and floating-point opcodes, introducing FAdd, FSub, and FMul. For now, the AsmParser, BitcodeReader, and IRBuilder all preserve backwards compatability, and the Core LLVM APIs preserve backwards compatibility for IR producers. Most front-ends won't need to change immediately. This implements the first step of the plan outlined here: http://nondot.org/sabre/LLVMNotes/IntegerOverflow.txt llvm-svn: 72897	2009-06-04 22:49:04 +00:00
Dan Gohman	7b6b5dd954	Don't do the X * 0.0 -> 0.0 transformation in instcombine, because instcombine doesn't know when it's safe. To partially compensate for this, introduce new code to do this transformation in dagcombine, which can use UnsafeFPMath. llvm-svn: 72872	2009-06-04 17:12:12 +00:00
Dan Gohman	c380cca7ae	Don't attempt to simplify an non-affine IV expression if it can't be simplified to a loop-invariant value. This fixes PR4315. llvm-svn: 72798	2009-06-03 19:11:31 +00:00
Dan Gohman	760377effc	Fix CodeGenPrepare's address-mode sinking to handle unusual addresses, involving Base values which do not have Pointer type. This fixes PR4297. llvm-svn: 72739	2009-06-02 21:29:13 +00:00
Evan Cheng	836894405f	Avoid infinite looping in AllGlobalLoadUsesSimpleEnoughForHeapSRA(). This can happen when PHI uses are recursively dependent on each other. llvm-svn: 72710	2009-06-02 00:56:07 +00:00
Eli Friedman	ee94e3cc9e	PR4286: Make RewriteLoadUserOfWholeAlloca and RewriteStoreUserOfWholeAlloca deal with tail padding because isSafeUseOfBitCastedAllocation expects them to. Otherwise, we crash trying to erase the bitcast. llvm-svn: 72688	2009-06-01 09:14:32 +00:00
Owen Anderson	cc0c75c74d	Be more aggressive in doing LoadPRE by tracing backwards when a block only has a single predecessor. Patch by Jakub Staszak. llvm-svn: 72661	2009-05-31 09:03:40 +00:00
Chris Lattner	221895303c	fix PR4284, a bug in simplifylibcalls handling memcmp. Patch by Benjamin Kramer! llvm-svn: 72625	2009-05-30 18:43:04 +00:00
Nick Lewycky	adbc284666	Give embedded metadata its own type instead of relying on EmptyStructTy. llvm-svn: 72610	2009-05-30 05:06:04 +00:00
Bill Wendling	006459ecd4	Enable GVN Load PRE. llvm-svn: 72589	2009-05-29 20:38:16 +00:00
Torok Edwin	0b0ddb21fe	just show the instruction, its not that slow. llvm-svn: 72577	2009-05-29 16:58:36 +00:00
Torok Edwin	6a94624a1b	for instructions with void type we have no choice but print the instruction as is, otherwise we get a <badref>. llvm-svn: 72567	2009-05-29 10:28:44 +00:00
Torok Edwin	72070282eb	Add a DEBUG() output to GVN that prints the instruction clobbering a load. This is useful when trying to figure out why GVN didn't eliminate redundant loads. llvm-svn: 72565	2009-05-29 09:46:03 +00:00
Owen Anderson	04cfdd38a2	Fix an issue where phiMap was not being updated properly when doing load PRE. Diagnosis and patch thanks to Jakub Staszak. llvm-svn: 72562	2009-05-29 05:37:54 +00:00
Nick Lewycky	206876e2da	Use Operands.data() instead of &Operands[0] where Operands is a potentially empty SmallVector. llvm-svn: 72512	2009-05-28 04:08:10 +00:00
Dan Gohman	4d1823680d	Revert 72493 and replace it with a more conservative fix, for now: don't rewrite the comparison if there is any implicit extension or truncation on the induction variable. I'm planning for IVUsers to eventually take over some of the work of this code, and for it to be generalized. llvm-svn: 72496	2009-05-27 21:10:47 +00:00
Dan Gohman	f4d85325c0	In ChangeCompareStride, when the stride to be reused is truncated to a smaller type, promoted its offset back up to the type of the new comparison. This fixes PR4222. llvm-svn: 72493	2009-05-27 20:00:18 +00:00
Dan Gohman	8ca0885d69	Change ScalarEvolution::getSCEVAtScope to always return the original value in the case where a loop exit value cannot be computed, instead of only in some cases while using SCEVCouldNotCompute in others. This simplifies getSCEVAtScope's callers. llvm-svn: 72375	2009-05-24 23:25:42 +00:00
Torok Edwin	26895b518b	Move Rewriter.clear() earlier, to avoid triggerring the AssertingVH by one of the RecursivelyDeleteTriviallyDeadInstructions. Add a comment explaining why the cache needs to be cleared. llvm-svn: 72372	2009-05-24 20:08:21 +00:00
Torok Edwin	5349cf5f4b	Instead of clearing the rewriter, don't attempt to rewrite dead phi nodes. Also fix 80 column violation. llvm-svn: 72371	2009-05-24 19:36:09 +00:00
Dan Gohman	4486da5b78	When rewriting the loop exit test with the canonical induction variable, leave the original comparison in place if it has other uses, since the other uses won't be dominated by the new comparison instruction. llvm-svn: 72369	2009-05-24 19:11:38 +00:00
Dan Gohman	fb56cf1b1d	When replacing a floating-point comparison with an integer comparison, use takeName to give the integer comparison a name. llvm-svn: 72367	2009-05-24 18:09:01 +00:00
Torok Edwin	d184bc209c	The rewriter may hold references to instructions that are deleted because they are trivially dead. Fix by clearing the rewriter cache before deleting the trivially dead instructions. Also make InsertedExpressions use an AssertingVH to catch these bugs easier. llvm-svn: 72364	2009-05-24 14:23:16 +00:00
Torok Edwin	7996339dd8	available_externall linkage is not local, this was confusing the codegenerator, and it wasn't generating calls through @PLT for these functions. hasLocalLinkage() is now false for available_externally, I attempted to fix the inliner and dce to handle available_externally properly. It passed make check. llvm-svn: 72328	2009-05-23 14:06:57 +00:00
Evan Cheng	a838a40bc4	Fix bug in FoldFCmp_IntToFP_Cst. If inttofp is a uintofp, use unsigned instead of signed integer constant. llvm-svn: 72300	2009-05-22 23:10:53 +00:00
Dan Gohman	781b75a7df	Teach IndVarSimplify's FixUsesBeforeDefs to handle InvokeInsts by assuming that the use of the value is in a block dominated by the "normal" destination. LangRef.html and other documentation sources don't explicitly guarantee this, but it seems to be assumed in other places in LLVM at least. This fixes an assertion failure on the included testcase, which is derived from the Ada testsuite. FixUsesBeforeDefs is a temporary measure which I'm looking to replace with a more capable solution. llvm-svn: 72266	2009-05-22 16:47:11 +00:00
Eli Friedman	0cf811df82	Fix loop-index-split to correctly preserve dominance frontiers. Part of PR4238. llvm-svn: 72244	2009-05-22 03:22:46 +00:00
Dan Gohman	bf0002e7c1	Teach ValueTracking a new way to analyze PHI nodes, and and teach Instcombine to be more aggressive about using SimplifyDemandedBits on shift nodes. This allows a shift to be simplified to zero in the included test case. llvm-svn: 72204	2009-05-21 02:28:33 +00:00
Dan Gohman	7248923a5d	Suppress the IV reversal transformation in the case that the RHS of the comparison is defined inside the loop. This fixes a use-before-def problem, because the transformation puts a use of the RHS outside the loop. llvm-svn: 72149	2009-05-20 00:34:08 +00:00
Dan Gohman	67587ce2e9	Remove an irrelevant comment. llvm-svn: 72132	2009-05-19 20:38:47 +00:00
Dan Gohman	97f70add3c	Add some more comments to the top of this file. llvm-svn: 72131	2009-05-19 20:37:36 +00:00
Dan Gohman	adc70d6806	Trim unneeded #includes. llvm-svn: 72130	2009-05-19 20:35:26 +00:00
Dan Gohman	2649491f9c	Teach SCEVExpander to expand arithmetic involving pointers into GEP instructions. It attempts to create high-level multi-operand GEPs, though in cases where this isn't possible it falls back to casting the pointer to i8* and emitting a GEP with that. Using GEP instructions instead of ptrtoint+arithmetic+inttoptr helps pointer analyses that don't use ScalarEvolution, such as BasicAliasAnalysis. Also, make the AddrModeMatcher more aggressive in handling GEPs. Previously it assumed that operand 0 of a GEP would require a register in almost all cases. It now does extra checking and can do more matching if operand 0 of the GEP is foldable. This fixes a problem that was exposed by SCEVExpander using GEPs. llvm-svn: 72093	2009-05-19 02:15:55 +00:00
Dan Gohman	14d1339579	Rename UseTy to AccessTy, for consistency with getAccessType, and to avoid ambiguity with the word "use" in IVStrideUse. llvm-svn: 72012	2009-05-18 16:45:28 +00:00
Dale Johannesen	1ac1969e09	Reuse existing getUnderlyingObject instead of adding another copy. llvm-svn: 71783	2009-05-14 18:41:18 +00:00
Dale Johannesen	f241df9abe	Use abs64 in one more place. llvm-svn: 71775	2009-05-14 16:47:34 +00:00
Dale Johannesen	3181652363	Handle some additonal cases of external weak globals. llvm-svn: 71717	2009-05-13 20:55:30 +00:00
Dale Johannesen	69921959b4	Don't generate a select whose operand is load of a weak external. These may have address 0 and are not safe to execute unconditionally. llvm-svn: 71688	2009-05-13 18:25:07 +00:00
Chris Lattner	149546a6a0	calls in nothrow functions can be marked nothrow even if the callee is not known to be nothrow. This allows readnone/readonly functions to be deleted even if we don't know whether the callee can throw. llvm-svn: 71676	2009-05-13 17:39:14 +00:00
Chris Lattner	7e335a763a	Fix PR4206 - crash in simplify lib calls llvm-svn: 71644	2009-05-13 06:26:11 +00:00
Dale Johannesen	536de01bcf	Add an int64_t variant of abs, for host environments without one. Use it where we were using abs on int64_t objects. (I strongly suspect the casts to unsigned in the fragments in LoopStrengthReduce are not doing whatever the original intent was, but the obvious change to uint64_t doesn't work. Maybe later.) llvm-svn: 71612	2009-05-13 00:24:22 +00:00
Dan Gohman	d76d71a291	Factor the code for collecting IV users out of LSR into an IVUsers class, and generalize it so that it can be used by IndVarSimplify. Implement the base IndVarSimplify transformation code using IVUsers. This removes TestOrigIVForWrap and associated code, as ScalarEvolution now has enough builtin overflow detection and folding logic to handle all the same cases, and more. Run "opt -iv-users -analyze -disable-output" on your favorite loop for an example of what IVUsers does. This lets IndVarSimplify eliminate IV casts and compute trip counts in more cases. Also, this happens to finally fix the remaining testcases in PR1301. Now that IndVarSimplify is being more aggressive, it occasionally runs into the problem where ScalarEvolutionExpander's code for avoiding duplicate expansions makes it difficult to ensure that all expanded instructions dominate all the instructions that will use them. As a temporary measure, IndVarSimplify now uses a FixUsesBeforeDefs function to fix up instructions inserted by SCEVExpander. Fortunately, this code is contained, and can be easily removed once a more comprehensive solution is available. llvm-svn: 71535	2009-05-12 02:17:14 +00:00
Evan Cheng	78a4eb844b	Teach LSR to optimize more loop exit compares, i.e. change them to use postinc iv value. Previously LSR would only optimize those which are in the loop latch block. However, if LSR can prove it is safe (and profitable), it's now possible to change those not in the latch blocks to use postinc values. Also, if the compare is the only use, LSR would place the iv increment instruction before the compare instead in the latch. llvm-svn: 71485	2009-05-11 22:33:01 +00:00
Dale Johannesen	02cb2bf2e3	Reverse a loop that is counting up to a maximum to count down to 0 instead, under very restricted circumstances. Adjust 4 testcases in which this optimization fires. llvm-svn: 71439	2009-05-11 17:15:42 +00:00
Duncan Sands	af9eaa830a	Rename PaddedSize to AllocSize, in the hope that this will make it more obvious what it represents, and stop it being confused with the StoreSize. llvm-svn: 71349	2009-05-09 07:06:46 +00:00
Evan Cheng	b9dcc2c0c9	Factor out code that optimize loop terminating condition. llvm-svn: 71305	2009-05-09 01:08:24 +00:00
Chris Lattner	c48091f141	fix RewriteStoreUserOfWholeAlloca to use the correct type size method, fixing a crash on PR4146. While the store will ultimately overwrite the "padded size" number of bits in memory, the stored value may be a subset of this size. This function only wants to handle the case where all bits are stored. llvm-svn: 71224	2009-05-08 15:54:41 +00:00
Nick Lewycky	702fbf94a0	This transform requires valid TargetData info. Wrap it in 'if (TD)' in preparation for the day we use null TargetData when no target is specified. llvm-svn: 71210	2009-05-08 06:47:37 +00:00
Eli Friedman	36b9026fa7	PR4123: don't crash when inlining a call which uses its own result. llvm-svn: 71199	2009-05-08 00:22:04 +00:00
Dan Gohman	140a6f24f0	Perform constant folding on operands of instructions with non-void types, such as loads and calls. llvm-svn: 71175	2009-05-07 19:43:39 +00:00
Evan Cheng	342053cd27	Unbreak the build. llvm-svn: 71091	2009-05-06 18:00:56 +00:00
David Greene	0dec5b9a75	Make sure to use signed arithmetic in APInt to fix a regression. llvm-svn: 71090	2009-05-06 17:39:26 +00:00
Dan Gohman	9a6fef0a52	Simplify code by using SmallVector's pop_back_val() instead of separate back() and pop_back() calls. llvm-svn: 71089	2009-05-06 17:22:41 +00:00
Duncan Sands	9759f2e063	Fix PR3754: don't mark functions that wrap MallocInst with the readnone. Since MallocInst is scheduled for deletion it doesn't seem worth doing anything more subtle, such as having mayWriteToMemory return true for MallocInst. llvm-svn: 71077	2009-05-06 08:42:00 +00:00
Duncan Sands	1efabaaa2a	Allow readonly functions to unwind exceptions. Teach the optimizers about this. For example, a readonly function with no uses cannot be removed unless it is also marked nounwind. llvm-svn: 71071	2009-05-06 06:49:50 +00:00
Dan Gohman	e58fc20f8d	Fix a copy+pasto in a comment. llvm-svn: 71035	2009-05-05 23:02:38 +00:00
Dan Gohman	96b18ccdd3	Delete a FIXME which is no longer relevant, and add a FIXME that is. llvm-svn: 71033	2009-05-05 22:59:55 +00:00
Bill Wendling	5e2ac0cd9c	Temporarily reverting r71008. It was causing this failure: Running /Volumes/Sandbox/Buildbot/llvm/full-llvm/build/llvm.src/test/ CodeGen/X86/dg.exp ... FAIL: /Volumes/Sandbox/Buildbot/llvm/full-llvm/build/llvm.src/test/ CodeGen/X86/change-compare-stride-1.ll Failed with exit(1) at line 2 while running: grep {cmpq $-478,} change-compare-stride-1.ll.tmp child process exited abnormally llvm-svn: 71013	2009-05-05 20:49:46 +00:00
David Greene	246a3dfb10	Handle overflow of 64-bit loop conditions. llvm-svn: 71008	2009-05-05 20:22:36 +00:00
Dan Gohman	48f8222293	Re-apply 70645, converting ScalarEvolution to use CallbackVH, with fixes. allUsesReplacedWith need to walk the def-use chains and invalidate all users of a value that is replaced. SCEVs of users need to be recalcualted even if the new value is equivalent. Also, make forgetLoopPHIs walk def-use chains, since any SCEV that depends on a PHI should be recalculated when more information about that PHI becomes available. llvm-svn: 70927	2009-05-04 22:30:44 +00:00
Dan Gohman	a30370bc33	Constify a bunch of SCEV-using code. llvm-svn: 70919	2009-05-04 22:02:23 +00:00
Chris Lattner	fa552d728d	fix some problems spotted by Duncan and Nicolas Geoffray llvm-svn: 70872	2009-05-04 16:29:24 +00:00
Chris Lattner	d579cb1167	* Sink 4 duplicates of edge threading validity checks and DOUT prints into ThreadEdge directly. This shares the code, but is just a refactoring. * Make JumpThreading compute the set of loop headers and avoid threading across them. This prevents jump threading from forming irreducible loops (goodness) but also prevents it from threading in other cases that are beneficial (see the comment above FindFunctionBackedges). llvm-svn: 70820	2009-05-04 02:28:08 +00:00
Chris Lattner	351134ba93	Factor loop backedge finding out of CodeGenPrepare into a new FindFunctionBackedges function. llvm-svn: 70819	2009-05-04 02:25:58 +00:00
Dan Gohman	5036695c32	Revert r70645 for now; it's causing a variety of regressions. llvm-svn: 70661	2009-05-03 05:46:20 +00:00
Nick Lewycky	431f97e4f0	Revert r70630. Go back to appending ".b" to internal globals when shrinking them to bool. llvm-svn: 70653	2009-05-03 03:49:08 +00:00
Dan Gohman	e9a38d16fe	Convert ScalarEvolution to use CallbackVH for its internal map. This makes ScalarEvolution::deleteValueFromRecords, and it's code that subtly needed to be called before ReplaceAllUsesWith, unnecessary. It also makes ValueDeletionListener unnecessary. llvm-svn: 70645	2009-05-02 21:19:20 +00:00
Dan Gohman	b17dcbdadc	The second argument to RecursivelyDeleteTriviallyDeadInstructions has a default value, and will hopefully be going away soon. llvm-svn: 70642	2009-05-02 20:22:10 +00:00
Dan Gohman	ff08995589	Previously, RecursivelyDeleteDeadInstructions provided an option of returning a list of pointers to Values that are deleted. This was unsafe, because the pointers in the list are, by nature of what RecursivelyDeleteDeadInstructions does, always dangling. Replace this with a simple callback mechanism. This may eventually be removed if all clients can reasonably be expected to use CallbackVH. Use this to factor out the dead-phi-cycle-elimination code from LSR utility function, and generalize it to use the RecursivelyDeleteTriviallyDeadInstructions utility function. This makes LSR more aggressive about eliminating dead PHI cycles; adjust tests to either be less trivial or to simply expect fewer instructions. llvm-svn: 70636	2009-05-02 18:29:22 +00:00
Dan Gohman	c27345f0b4	Tell ScalarEvolution that the loop is being deleted before actually deleting it. This will let ScalarEvolution be more complete about updating its records. llvm-svn: 70632	2009-05-02 17:29:26 +00:00
Nick Lewycky	462cd34332	Don't append ".b" to the names of globals that are being shrunk to booleans. llvm-svn: 70630	2009-05-02 16:21:50 +00:00
Dan Gohman	6409e7d4e9	Don't split critical edges during the AddUsersIfInteresting phase of LSR. This makes the AddUsersIfInteresting phase of LSR a pure analysis instead of a phase that potentially does CFG modifications. The conditions where this code would actually perform a split are rare, and in the cases where it actually would do a split the split is usually undone by CodeGenPrepare, and in cases where splits actually survive into codegen, they appear to hurt more often than they help. llvm-svn: 70625	2009-05-02 05:36:01 +00:00
Dan Gohman	65dbe7874f	Make RequiresTypeConversion canonicalize the types before calling the target hooks canLosslesslyBitCastTo and isTruncateFree. This allows targets to avoid worrying about handling all combinations of integer and pointer types. llvm-svn: 70555	2009-05-01 17:07:43 +00:00
Dan Gohman	d3aa4215ef	Minor whitespace fix. llvm-svn: 70551	2009-05-01 16:56:32 +00:00
Dan Gohman	6be8530158	Fix some code to work if TargetLowering is not available. llvm-svn: 70546	2009-05-01 16:29:14 +00:00
Dale Johannesen	f4031bd01e	Print correct instruction in dump. llvm-svn: 70427	2009-04-29 22:57:20 +00:00
Dan Gohman	8ddd0b3599	Reword and tidy up some comments. llvm-svn: 70416	2009-04-29 22:01:05 +00:00
Dan Gohman	3e6e188ee3	Remove an obsolete comment. llvm-svn: 70262	2009-04-27 22:12:34 +00:00
Dale Johannesen	27b4f222cf	Fix PR 4086, a bug in FP IV elimination. llvm-svn: 70247	2009-04-27 21:03:15 +00:00
Dan Gohman	e99f98262c	Permit ChangeCompareStride to rewrite a comparison when the factor between the comparison's iv stride and the candidate stride is exactly -1. llvm-svn: 70244	2009-04-27 20:35:32 +00:00
Dan Gohman	1b5055ab7f	Return null instead of false, as appropriate. llvm-svn: 70054	2009-04-25 17:28:45 +00:00
Dan Gohman	5638e0d642	Add several more icmp simplifications. Transform signed comparisons into unsigned ones when the operands are known to have the same sign bit value. llvm-svn: 70053	2009-04-25 17:12:48 +00:00
Sanjiv Gupta	46c97e626f	Allow i16 type indices to gep. llvm-svn: 69946	2009-04-24 02:37:54 +00:00
Dan Gohman	86bcd97014	Change SCEVExpander's expandCodeFor to provide more flexibility with the persistent insertion point, and change IndVars to make use of it. This fixes a bug where IndVars was holding on to a stale insertion point and forcing the SCEVExpander to continue to use it. This fixes PR4038. llvm-svn: 69892	2009-04-23 15:16:49 +00:00
Evan Cheng	d8174d3d09	Make sure both operands have binary instructions have the same type. llvm-svn: 69844	2009-04-22 23:39:28 +00:00
Evan Cheng	59ca33053b	A few more places where the check of use_empty is needed. llvm-svn: 69842	2009-04-22 23:09:16 +00:00
Evan Cheng	cbfe9df096	Avoid deferencing use_begin() if value does not have a use. llvm-svn: 69836	2009-04-22 22:45:37 +00:00
Owen Anderson	6cbf5bb9bb	Real fix for PR3549, by using caching for predecessor counts in addition to the predecessors themselves. This halves the time to optimize the testcase, beyond what my previous patch did. llvm-svn: 69792	2009-04-22 08:50:12 +00:00
Owen Anderson	bb754826c9	Use PredIteratorCache in LCSSA, which gives a 37% overall speedup on the testcase from PR3549. More improvements to come. llvm-svn: 69788	2009-04-22 08:09:13 +00:00
Chris Lattner	58be2d4413	use predicate instead of hand-rolled loop llvm-svn: 69752	2009-04-21 23:37:18 +00:00
Chris Lattner	69223bb7f5	fix a crash on a pointless but valid zero-length memset, rdar://6808691 llvm-svn: 69680	2009-04-21 16:52:12 +00:00
Dan Gohman	4860db61be	Factor out a common base class from SCEVTruncateExpr, SCEVZeroExtendExpr, and SCEVSignExtendExpr. llvm-svn: 69649	2009-04-21 01:25:57 +00:00
Dan Gohman	b397e1a7a2	Introduce encapsulation for ScalarEvolution's TargetData object, and refactor the code to minimize dependencies on TargetData. llvm-svn: 69644	2009-04-21 01:07:12 +00:00
Dale Johannesen	1238220473	Adjust loop size estimate for full unrolling; GEP's don't usually become instructions. llvm-svn: 69631	2009-04-20 22:19:33 +00:00
Sanjiv Gupta	428d490332	Before trying to introduce/eliminate cast/ext/trunc to make indices type as pointer type, make sure that the pointer size is a valid sequential index type. llvm-svn: 69574	2009-04-20 06:05:54 +00:00
Dan Gohman	056857aa21	Use more const qualifiers with SCEV interfaces. llvm-svn: 69450	2009-04-18 17:56:28 +00:00
Jim Grosbach	8d62763779	remove trailing whitespace llvm-svn: 69402	2009-04-17 23:30:55 +00:00
David Greene	22fa407ed7	Use a safer iterator interface and get rid of std C++ library misuse. This fixes a --enable-expensive-checks problem. llvm-svn: 69353	2009-04-17 14:56:18 +00:00
Dan Gohman	d2d6fd806c	Don't create ConstantInts with pointer type. This fixes a regression in 403.gcc in PIC_CODEGEN=1 and DISABLE_LTO=1 mode. llvm-svn: 69344	2009-04-17 02:02:52 +00:00
Dan Gohman	fec1d086e0	Use TargetData::getTypeSizeInBits instead of getPrimitiveSizeInBits() to get the correct answer for pointer types. llvm-svn: 69321	2009-04-16 22:35:57 +00:00
Eli Friedman	929207fd1d	Fix for PR3944: make mem2reg O(N) instead of O(N^2) in the number of incoming edges for a block with many predecessors. llvm-svn: 69312	2009-04-16 21:40:28 +00:00
Dan Gohman	8b6ebb1112	Minor code simplifications. Don't attempt LSR on theoretical targets with pointers larger than 64 bits, due to the code not yet being APInt clean. llvm-svn: 69296	2009-04-16 16:49:48 +00:00
Dan Gohman	e2ead2c328	LSR is no longer a GEP optimizer. It is now an IV expression optimizer, which just happen to frequently involve optimizing GEPs. llvm-svn: 69295	2009-04-16 16:46:01 +00:00
Dan Gohman	a8be04b2db	Use ConstantExpr::getIntToPtr instead of SCEVExpander::InsertCastOfTo, since the operand is always a constant. llvm-svn: 69291	2009-04-16 15:48:38 +00:00
Dan Gohman	71bccd3e0e	Use a SCEV expression cast instead of immediately inserting a new instruction with SCEVExpander::InsertCastOfTo. llvm-svn: 69290	2009-04-16 15:47:35 +00:00
Dan Gohman	0a40ad93a9	Expand GEPs in ScalarEvolution expressions. SCEV expressions can now have pointer types, though in contrast to C pointer types, SCEV addition is never implicitly scaled. This not only eliminates the need for special code like IndVars' EliminatePointerRecurrence and LSR's own GEP expansion code, it also does a better job because it lets the normal optimizations handle pointer expressions just like integer expressions. Also, since LLVM IR GEPs can't directly index into multi-dimensional VLAs, moving the GEP analysis out of client code and into the SCEV framework makes it easier for clients to handle multi-dimensional VLAs the same way as other arrays. Some existing regression tests show improved optimization. test/CodeGen/ARM/2007-03-13-InstrSched.ll in particular improved to the point where if-conversion started kicking in; I turned it off for this test to preserve the intent of the test. llvm-svn: 69258	2009-04-16 03:18:22 +00:00
Dale Johannesen	a71daa83c6	Eliminate zext over (iv \| const) or (signed iv), and sext over (iv \| const), if a longer iv is available. Allow expressions to have more than one zext/sext parent. All from OpenSSL. llvm-svn: 69241	2009-04-15 23:31:51 +00:00
Dale Johannesen	82230b5b17	Eliminate zext over (iv & const) or ((iv+const)&const) if a longer iv is available. These subscript forms are not common; they're a bottleneck in OpenSSL. llvm-svn: 69215	2009-04-15 20:41:02 +00:00
Dale Johannesen	7ffb7d5728	Enhance induction variable code to remove the sext around sext(shorter IV + constant), using a longer IV instead, when it can figure out the add can't overflow. This comes up a lot in subscripting; mainly affects 64 bit. llvm-svn: 69123	2009-04-15 01:10:12 +00:00
Evan Cheng	ffb83a155e	Avoid making the transformation enabled by my last patch if the new destinations have phi nodes. llvm-svn: 69121	2009-04-15 00:43:54 +00:00
Devang Patel	046bf624b9	While inlining, clone llvm.dbg.func.start intrinsic and adjust llvm.dbg.region.end instrinsic. This nested llvm.dbg.func.start/llvm.dbg.region.end pair now enables DW_TAG_inlined_subroutine support in code generator. llvm-svn: 69118	2009-04-15 00:17:06 +00:00
Evan Cheng	5ebf2acd84	Optimize conditional branch on i1 phis with non-constant inputs. This turns: eq: %3 = icmp eq i32 %1, %2 br label %join ne: %4 = icmp ne i32 %1, %2 br label %join join: %5 = phi i1 [%3, %eq], [%4, %ne] br i1 %5, label %yes, label %no => eq: %3 = icmp eq i32 %1, %2 br i1 %3, label %yes, label %no ne: %4 = icmp ne i32 %1, %2 br i1 %4, label %yes, label %no llvm-svn: 69102	2009-04-14 23:40:03 +00:00
Owen Anderson	a1902318e3	LoopIndexSplit needs to inform the loop pass manager of the instructions it is deleting, not just the basic block. llvm-svn: 69011	2009-04-14 01:04:19 +00:00
Chris Lattner	836e77d161	eliminate unneeded parens. llvm-svn: 68939	2009-04-13 05:38:23 +00:00
Chris Lattner	6cd82fb430	"There was a typo in my previous patch which leads to miscompilation of strncat :( strncat(foo, "bar", 99) would be optimized to memcpy(foo+strlen(foo), "bar", 100, 1) instead of memcpy(foo+strlen(foo), "bar", 4, 1)" Patch by Benjamin Kramer! llvm-svn: 68905	2009-04-12 18:22:33 +00:00
Chris Lattner	91b6af24ac	add some optimizations for strncpy/strncat and factor some code. Patch by Benjamin Kramer! llvm-svn: 68885	2009-04-12 05:06:39 +00:00
Chris Lattner	eb510d6b3d	Instcombine should not promote whole computation trees to "strange" integer types, unless they are already strange. This prevents it from turning the code produced by SROA into crazy libcalls and stuff that the code generator can't handle. In the attached example, the result was an i96 multiply that caused the x86 backend to assert. Note that if TargetData had an idea of what the legal types are for a target that this could be used to stop instcombine from introducing i64 muls, as Scott wanted. llvm-svn: 68598	2009-04-08 05:41:03 +00:00
Chris Lattner	321741af5f	fix rdar://6762290, a crash compiling cxx filt with clang. llvm-svn: 68500	2009-04-07 05:03:34 +00:00
Chris Lattner	47d6e7b93e	remove empty section llvm-svn: 68485	2009-04-07 02:55:53 +00:00
Ed Schouten	01aa6ec97a	Let the strcat optimizer return the pointer to the start of the buffer, instead of the place where it started to perform the string copy. - PR3661 - Patch by Benjamin Kramer! llvm-svn: 68443	2009-04-06 13:06:48 +00:00
Owen Anderson	98f912bf13	Reapply r68211, with the miscompilations it caused fixed. llvm-svn: 68262	2009-04-01 23:53:49 +00:00
Dan Gohman	c4971721ea	Revert r68172. It caused regressions in Applications/Burg/burg Applications/ClamAV/clamscan and many other tests. llvm-svn: 68211	2009-04-01 16:37:47 +00:00
Owen Anderson	ff5961b46c	Enhance GVN to propagate simple conditionals. This fixes PR3921. llvm-svn: 68172	2009-04-01 01:20:45 +00:00
Chris Lattner	f72ce6ea8b	Make the key of ValueRankMap an AssertingVH, so that we die violently if it dangles. llvm-svn: 68150	2009-03-31 22:13:29 +00:00
Evan Cheng	826b6f0f7c	Throttle back "fold select into operand" transformation. InstCombine should not generate selects of two constants unless they are selects of 0 and 1. e.g. define i32 @t1(i32 %c, i32 %x) nounwind { %t1 = icmp eq i32 %c, 0 %t2 = lshr i32 %x, 18 %t3 = select i1 %t1, i32 %t2, i32 %x ret i32 %t3 } was turned into define i32 @t2(i32 %c, i32 %x) nounwind { %t1 = icmp eq i32 %c, 0 %t2 = select i1 %t1, i32 18, i32 0 %t3 = lshr i32 %x, %t2 ret i32 %t3 } For most targets, that means materializing two constants and then a select. e.g. On x86-64 movl %esi, %eax shrl $18, %eax testl %edi, %edi cmovne %esi, %eax ret => xorl %eax, %eax testl %edi, %edi movl $18, %ecx cmovne %eax, %ecx movl %esi, %eax shrl %cl, %eax ret Also, the optimizer and codegen can reason about shl / and / add, etc. by a constant. This optimization will hinder optimizations using ComputeMaskedBits. llvm-svn: 68142	2009-03-31 20:42:45 +00:00
Devang Patel	4ce6e69022	Update call graph after inlining invoke. Patch by Jay Foad. llvm-svn: 68120	2009-03-31 17:36:12 +00:00
Devang Patel	6e68bd007a	Loop Index Split can eliminate a loop if it can determin if loop body is executed only once. There was a bug in determining IV based value of the iteration for which the loop body is executed. Fix it. llvm-svn: 68071	2009-03-30 22:24:10 +00:00
Duncan Sands	3241b74f69	Revert r67798: it breaks llvm-gcc bootstrap on x86-64-linux, presumably due to a miscompilation. make[4]: Entering directory `gcc-4.2.llvm-objects/x86_64-unknown-linux-gnu/libstdc++-v3/include' if [ ! -d "./x86_64-unknown-linux-gnu/bits/stdtr1c++.h.gch" ]; then \ mkdir -p ./x86_64-unknown-linux-gnu/bits/stdtr1c++.h.gch; \ fi; \ gcc-4.2.llvm-objects/./gcc/xgcc -shared-libgcc -Bgcc-4.2.llvm-objects/./gcc -nostdinc++ -Lgcc-4.2.llvm-objects/x86_64-unknown-linux-gnu/libstdc++-v3/src -Lgcc-4.2.llvm-objects/x86_64-unknown-linux-gnu/libstdc++-v3/src/.libs -B/usr/local/gnat-llvm/x86_64-unknown-linux-gnu/bin/ -B/usr/local/gnat-llvm/x86_64-unknown-linux-gnu/lib/ -isystem /usr/local/gnat-llvm/x86_64-unknown-linux-gnu/include -isystem /usr/local/gnat-llvm/x86_64-unknown-linux-gnu/sys-include -Winvalid-pch -Wno-deprecated -x c++-header -g -O2 -D_GNU_SOURCE -Igcc-4.2.llvm-objects/x86_64-unknown-linux-gnu/libstdc++-v3/include/x86_64-unknown-linux-gnu -Igcc-4.2.llvm-objects/x86_64-unknown-linux-gnu/libstdc++-v3/include -Igcc-4.2.llvm/libstdc++-v3/libsupc++ -O2 -g gcc-4.2.llvm/libstdc++-v3/include/precompiled/stdtr1c++.h -o x86_64-unknown-linux-gnu/bits/stdtr1c++.h.gch/O2g.gch In file included from gcc-4.2.llvm-objects/x86_64-unknown-linux-gnu/libstdc++-v3/include/tr1/repeat.h:247, from gcc-4.2.llvm-objects/x86_64-unknown-linux-gnu/libstdc++-v3/include/tr1/functional:1098, from gcc-4.2.llvm/libstdc++-v3/include/precompiled/stdtr1c++.h:53: gcc-4.2.llvm-objects/x86_64-unknown-linux-gnu/libstdc++-v3/include/tr1/functional_iterate.h:417: internal compiler error: in ggc_recalculate_in_use_p, at ggc-page.c:1602 Please submit a full bug report, with preprocessed source if appropriate. See <URL:http://llvm.org/bugs/> for instructions. make[4]: *** [x86_64-unknown-linux-gnu/bits/stdtr1c++.h.gch/O2g.gch] Error 1 llvm-svn: 67839	2009-03-27 14:56:47 +00:00
Dale Johannesen	4026b041ce	One more place to skip debug info. llvm-svn: 67811	2009-03-27 01:13:37 +00:00
Devang Patel	fe7c0492a0	While hoisting an instruction, update alias info set tracker. llvm-svn: 67798	2009-03-26 23:48:52 +00:00
Dale Johannesen	db90560c1c	Skip debug info one more place. (This one gets called from llc, not opt, but it's an IR level optimization nevertheless.) llvm-svn: 67724	2009-03-26 01:15:07 +00:00
Devang Patel	4555618854	Before deleting a basic block, give other loop passes a chance cleanup analysis values, related to the instructions in the basic block. llvm-svn: 67719	2009-03-25 23:57:48 +00:00
Chris Lattner	c3b2111d97	Fix PR3874 by restoring a condition I removed, but making it more precise than it used to be. llvm-svn: 67662	2009-03-25 00:28:58 +00:00
Chris Lattner	9e94538005	oops, I intended to remove this, not comment it out. Thanks Duncan! llvm-svn: 67657	2009-03-24 23:48:25 +00:00
Chris Lattner	306813cbbb	canonicalize inttoptr and ptrtoint instructions which cast pointers to/from integer types that are not intptr_t to convert to intptr_t then do an integer conversion to the dest type. This exposes the cast to the optimizer. llvm-svn: 67638	2009-03-24 18:35:40 +00:00
Chris Lattner	d9eb41177a	two changes: 1. Make instcombine always canonicalize trunc x to i1 into an icmp(x&1). This exposes the AND to other instcombine xforms and is more of what the code generator expects. 2. Rewrite the remaining trunc pattern match to use 'match', which simplifies it a lot. llvm-svn: 67635	2009-03-24 18:15:30 +00:00
Dale Johannesen	32dfb35281	Use a SmallPtrSet instead of std::set. llvm-svn: 67578	2009-03-23 23:39:20 +00:00
Dan Gohman	4f2fea1a21	Now that errs() is properly non-buffered, there's no need to explicitly flush it. llvm-svn: 67526	2009-03-23 15:57:19 +00:00
Duncan Sands	1f15ca7c7a	Factorize out a concept - no functionality change. llvm-svn: 67454	2009-03-21 21:27:31 +00:00
Chris Lattner	0a981d1d36	Fix instcombine to not introduce undefined shifts when merging two shifts together. This fixes PR3851. llvm-svn: 67411	2009-03-20 22:41:15 +00:00
Duncan Sands	a09e0afe74	Don't load values out of global constants with weak linkage: the value may be replaced with something different at link time. (Frontends that want to allow values to be loaded out of weak constants can give their constants weak_odr linkage). llvm-svn: 67407	2009-03-20 21:53:29 +00:00
Dale Johannesen	2050968df9	Clear the cached cost when removing a function in the inliner; prevents nondeterministic behavior when the same address is reallocated. Don't build call graph nodes for debug intrinsic calls; they're useless, and there were typically a lot of them. llvm-svn: 67311	2009-03-19 18:03:56 +00:00
Dale Johannesen	e4f361212b	Fix comment typo. llvm-svn: 67307	2009-03-19 17:23:29 +00:00
Dale Johannesen	52bc2aac8a	This pass keeps a map of Instructions to Rank numbers, and was deleting Instructions without clearing the corresponding map entry. This led to nondeterministic behavior if the same address got allocated to another Instruction within a short time. llvm-svn: 67306	2009-03-19 17:22:53 +00:00
Nick Lewycky	bfd4ad67c7	Remove strange extra semicolons. llvm-svn: 67287	2009-03-19 05:51:39 +00:00
Chris Lattner	514fc5b143	aha, DAE does have to think about PHI nodes. Many thanks to "Dr Evil" (aka Duncan) for pointing this out :) llvm-svn: 67212	2009-03-18 16:48:45 +00:00
Chris Lattner	595923ff75	Fix PR3826 - InstComb assert with vector shift, by not calling ComputeNumSignBits on a vector. llvm-svn: 67211	2009-03-18 16:32:19 +00:00
Chris Lattner	ab8022055a	add an assertion to make it clear that PHI nodes are not allowed. llvm-svn: 67210	2009-03-18 16:23:56 +00:00
Zhou Sheng	4e2af3cb55	Explicitly check for StoreInst, do not lose the chance to delete unused loads or bitcasts. llvm-svn: 67202	2009-03-18 12:48:48 +00:00

... 22 23 24 25 26 ...

7461 Commits