llvm-project

Commit Graph

Author	SHA1	Message	Date
Owen Anderson	3997a07fb9	More Chris-inspired JumpThreading fixes: use ConstantExpr to correctly constant-fold undef, and be more careful with its return value. This actually exposed an infinite recursion bug in ComputeValueKnownInPredecessors which theoretically already existed (in JumpThreading's handling of and/or of i1's), but never manifested before. This patch adds a tracking set to prevent this case. llvm-svn: 112589	2010-08-31 07:36:34 +00:00
Owen Anderson	b58b3c0dda	Fix a typo. llvm-svn: 112560	2010-08-30 23:59:30 +00:00
Owen Anderson	b974dbbdd7	Cleanups suggested by Chris. llvm-svn: 112553	2010-08-30 23:34:17 +00:00
Owen Anderson	c910acb54a	Re-apply r112539, being more careful to respect the return values of the constant folding methods. Additionally, use the ConstantExpr::get*() methods to simplify some constant folding. llvm-svn: 112550	2010-08-30 23:22:36 +00:00
Owen Anderson	30bacbdfdf	Add statistics to evaluate this pass. llvm-svn: 112545	2010-08-30 22:45:55 +00:00
Owen Anderson	1ddcbbe49c	Revert r112539. It accidentally introduced a miscompilation. llvm-svn: 112543	2010-08-30 22:33:41 +00:00
Owen Anderson	75f6037c7c	Fixes and cleanups pointed out by Chris. In general, be careful to handle 0 results from ComputeValueKnownInPredecessors (indicating undef), and re-use existing constant folding APIs. llvm-svn: 112539	2010-08-30 22:07:52 +00:00
Chris Lattner	c843fca2fd	rewrite DwarfEHPrepare to use SSAUpdater to promote its allocas instead of PromoteMemToReg. This allows it to stop using DF and DT, eliminating a computation of DT and DF from clang -O3. Clang is now down to 2 runs of DomFrontier. llvm-svn: 112457	2010-08-29 19:54:28 +00:00
Chris Lattner	f58382ed87	two changes: 1) make AliasSet hold the list of call sites with an assertingvh so we get a violent explosion if the pointer dangles. 2) Fix AliasSetTracker::deleteValue to remove call sites with by-pointer comparisons instead of by-alias queries. Using findAliasSetForCallSite can cause alias sets to get merged when they shouldn't, and can also miss alias sets when the call is readonly. #2 fixes PR6889, which only repros with a .c file :( llvm-svn: 112452	2010-08-29 18:42:23 +00:00
Chris Lattner	263f804699	LICM does get dead instructions input to it. Instead of sinking them out of loops, just delete them. llvm-svn: 112451	2010-08-29 18:22:25 +00:00
Chris Lattner	6ac0659a1c	use moveBefore instead of remove+insert, it avoids some symtab manipulation, so its faster (in addition to being more elegant) llvm-svn: 112450	2010-08-29 18:18:40 +00:00
Chris Lattner	f03b4eac48	revert 112448 for now. llvm-svn: 112449	2010-08-29 18:11:16 +00:00
Chris Lattner	11f8ad8211	optimize LICM::hoist to use moveBefore. Correct its updating of AST to remove the hoisted instruction from the AST, since it is no longer in the loop. llvm-svn: 112448	2010-08-29 18:03:33 +00:00
Chris Lattner	1a1ed69435	fix some bugs (found by inspection) where LICM would not update LICM correctly. When sinking an instruction, it should not add entries for the sunk instruction to the AST, it should remove the entry for the sunk instruction. The blocks being sunk to are not in the loop, so their instructions shouldn't be in the AST (yet)! llvm-svn: 112447	2010-08-29 18:00:00 +00:00
Chris Lattner	cc9cbc66a3	rework the ownership of subloop alias information: instead of keeping them around until the pass is destroyed, keep them around a) just when useful (not for outer loops) and b) destroy them right after we use them. This should reduce memory use and fixes potential bugs where a loop is deleted and another loop gets allocated to the same address. llvm-svn: 112446	2010-08-29 17:46:00 +00:00
Chris Lattner	bc1a65ac6c	apparently unswitch had the same "Feature". Stop its claims that it preserves domfrontier if it doesn't really. llvm-svn: 112445	2010-08-29 17:23:19 +00:00
Chris Lattner	d6f46b8af8	now that loop passes don't use DomFrontier, there is no reason for the unroller to pretend it supports updating it. It still has a horrible hack for DomTree. llvm-svn: 112444	2010-08-29 17:21:35 +00:00
Dan Gohman	002ff89cbd	Optionally rerun dedicated-register filtering after applying other filtering techniques, as those may allow it to filter out more obviously unprofitable candidates. llvm-svn: 112441	2010-08-29 16:39:22 +00:00
Dan Gohman	f031792cc6	Fix several areas in LSR to do a better job keeping the main LSRInstance data structures up to date. This fixes some pessimizations caused by stale data which will be exposed in an upcoming change. llvm-svn: 112440	2010-08-29 16:32:54 +00:00
Dan Gohman	e9e0873b08	Refactor the three main groups of code out of NarrowSearchSpaceUsingHeuristics into separate functions. llvm-svn: 112439	2010-08-29 16:09:42 +00:00
Dan Gohman	37a0f68036	Delete a bogus check. llvm-svn: 112438	2010-08-29 15:30:29 +00:00
Dan Gohman	b6a520d63c	Add some comments. llvm-svn: 112437	2010-08-29 15:27:08 +00:00
Dan Gohman	bf673e0652	Move this debug output into GenerateAllReuseFormula, to declutter the high-level logic. llvm-svn: 112436	2010-08-29 15:21:38 +00:00
Dan Gohman	d366b6d5c8	Delete an unused declaration. llvm-svn: 112435	2010-08-29 15:19:11 +00:00
Dan Gohman	4f13bbfefc	Do one lookup instead of two. llvm-svn: 112434	2010-08-29 15:18:49 +00:00
Chris Lattner	f94f6bb0ba	licm preserves the cfg, it doesn't have to explicitly say it preserves domfrontier. It does preserve AA though. llvm-svn: 112419	2010-08-29 07:02:56 +00:00
Chris Lattner	abe61ef3b4	now that it doesn't use the PromoteMemToReg function, LICM doesn't require DomFrontier. Dropping this doesn't actually save any runs of the pass though. llvm-svn: 112418	2010-08-29 06:49:44 +00:00
Chris Lattner	1dc98b47b5	completely rewrite the memory promotion algorithm in LICM. Among other things, this uses SSAUpdater instead of PromoteMemToReg. llvm-svn: 112417	2010-08-29 06:43:52 +00:00
Chris Lattner	9c3931a544	use getUniqueExitBlocks instead of a manual set. llvm-svn: 112412	2010-08-29 05:12:21 +00:00
Chris Lattner	85bf5421e1	reimplement LICM::sink to use SSAUpdater instead of PromoteMemToReg. This leads to much simpler code. llvm-svn: 112410	2010-08-29 04:55:06 +00:00
Chris Lattner	b50407f104	remove dead proto llvm-svn: 112408	2010-08-29 04:53:24 +00:00
Chris Lattner	cd96b4df56	reduce indentation in LICM::sink by using early exits, use getUniqueExitBlocks instead of getExitBlocks and a manual set to eliminate dupes. llvm-svn: 112405	2010-08-29 04:28:20 +00:00
Chris Lattner	188cc5a0fc	modernize this pass a bit: use efficient set/map and reduce indentation. llvm-svn: 112404	2010-08-29 04:23:04 +00:00
Chris Lattner	504e5100d3	remove the ABCD and SSI passes. They don't have any clients that I'm aware of, aren't maintained, and LVI will be replacing their value. nlewycky approved this on irc. llvm-svn: 112355	2010-08-28 03:51:24 +00:00
Chris Lattner	95bb297c26	squish dead code. llvm-svn: 112350	2010-08-28 03:21:03 +00:00
Benjamin Kramer	83f9ff0452	Update CMake build. Add newline at end of file. llvm-svn: 112332	2010-08-28 00:11:12 +00:00
Owen Anderson	cf7f941121	Add a prototype of a new peephole optimizing pass that uses LazyValue info to simplify PHIs and select's. This pass addresses the missed optimizations from PR2581 and PR4420. llvm-svn: 112325	2010-08-27 23:31:36 +00:00
Owen Anderson	99d4cb861b	Fix typos in comments. llvm-svn: 112286	2010-08-27 20:32:56 +00:00
Owen Anderson	6ebbd92380	Use LVI to eliminate conditional branches where we've tested a related condition previously. Update tests for this change. This fixes PR5652. llvm-svn: 112270	2010-08-27 17:12:29 +00:00
Owen Anderson	bd2ecc7e68	Make JumpThreading smart enough to properly thread StrSwitch when it's compiled with clang++. llvm-svn: 112198	2010-08-26 17:40:24 +00:00
Chris Lattner	8df99b523e	remove some llvmcontext arguments that are now dead post-refactoring. llvm-svn: 112104	2010-08-25 23:00:45 +00:00
Owen Anderson	7c853e877e	Turn LVI on, previously detected failures should be fixed now. llvm-svn: 111923	2010-08-24 17:21:18 +00:00
Owen Anderson	6ffa3f2aea	Turn LVI back off, I have a testcase now. llvm-svn: 111834	2010-08-23 19:59:27 +00:00
Owen Anderson	630add39a6	Re-enable LazyValueInfo. Monitoring for failures. llvm-svn: 111816	2010-08-23 18:12:23 +00:00
Owen Anderson	d31d82d75c	Now that PassInfo and Pass::ID have been separated, move the rest of the passes over to the new registration API. llvm-svn: 111815	2010-08-23 17:52:01 +00:00
Owen Anderson	aac8cbb261	Disable LVI while I evaluate a failure. llvm-svn: 111551	2010-08-19 19:47:08 +00:00
Owen Anderson	5c87dd55d3	Tentatively enabled LVI by default. I'll be monitoring for any failures. llvm-svn: 111543	2010-08-19 19:04:40 +00:00
Dan Gohman	129a816ee6	Process the step before the start, because it's usually the simpler of the two. llvm-svn: 111495	2010-08-19 01:02:31 +00:00
Owen Anderson	208636fa33	Inform LazyValueInfo whenever a block is deleted, to avoid dangling pointer issues. llvm-svn: 111382	2010-08-18 18:39:01 +00:00
Chris Lattner	3c603024bb	Fix PR7755: knowing something about an inval for a pred from the LHS should disable reconsidering that pred on the RHS. However, knowing something about the pred on the RHS shouldn't disable subsequent additions on the RHS from happening. llvm-svn: 111349	2010-08-18 03:14:36 +00:00
Chris Lattner	b45de95345	remove some dead code. llvm-svn: 111344	2010-08-18 02:41:56 +00:00
Chris Lattner	6aabb66139	remove dead prototype. llvm-svn: 111342	2010-08-18 02:37:06 +00:00
Dan Gohman	5047ca0c02	When rotating loops, put the original header at the bottom of the loop, making the resulting loop significantly less ugly. Also, zap its trivial PHI nodes, since it's easy. llvm-svn: 111255	2010-08-17 17:39:21 +00:00
Evan Cheng	8b637b177c	Add an option to disable codegen prepare critical edge splitting. In theory, PHI elimination is already doing all (most?) of the splitting needed. But machine-licm and machine-sink seem to miss some important optimizations when splitting is disabled. llvm-svn: 111224	2010-08-17 01:34:49 +00:00
Dan Gohman	89fdbaf99a	Instead of having CollectSubexpr's categorize operands as interesting or uninteresting, just put all the operands on one list and make GenerateReassociations make the decision about what's interesting. This is simpler, and it avoids an extra ScalarEvolution::getAddExpr call. llvm-svn: 111133	2010-08-16 15:50:00 +00:00
Dan Gohman	9b7632df26	Put add operands in ScalarEvolution-canonical order, when convenient. This isn't necessary, because ScalarEvolution sorts them anyway, but it's tidier this way. llvm-svn: 111132	2010-08-16 15:39:27 +00:00
Dan Gohman	4a63fad976	Teach SimplifyCFG how to simplify indirectbr instructions. - Eliminate redundant successors. - Convert an indirectbr with one successor into a direct branch. Also, generalize SimplifyCFG to be able to be run on a function entry block. It knows quite a few simplifications which are applicable to the entry block, and it only needs a few checks to avoid trouble with the entry block. llvm-svn: 111060	2010-08-14 00:29:42 +00:00
Dan Gohman	081ffcd00b	Fix LSR's ExtractImmediate and ExtractSymbol to avoid calling ScalarEvolution::getAddExpr, which can be pretty expensive, when nothing has changed, which is pretty common. llvm-svn: 111042	2010-08-13 21:17:19 +00:00
Chris Lattner	363226dfe8	fix PR7876: If ipsccp decides that a function's address is taken before it rewrites the code, we need to use that in the post-rewrite pass. llvm-svn: 110962	2010-08-12 22:25:23 +00:00
Owen Anderson	0398607714	Don't attempt the PRE inline asm calls, since we don't value number them yet. Fixes PR7835. llvm-svn: 110489	2010-08-07 00:20:35 +00:00
Owen Anderson	a7aed18624	Reapply r110396, with fixes to appease the Linux buildbot gods. llvm-svn: 110460	2010-08-06 18:33:48 +00:00
Nick Lewycky	5a2849e166	Fix uninitialized variable warning. Also move 'default' case next to a real case to help compiler optimize in non-Debug builds. No functionality change. llvm-svn: 110435	2010-08-06 07:43:46 +00:00
Owen Anderson	bda59bd247	Revert r110396 to fix buildbots. llvm-svn: 110410	2010-08-06 00:23:35 +00:00
Owen Anderson	755aceb5d0	Don't use PassInfo* as a type identifier for passes. Instead, use the address of the static ID member as the sole unique type identifier. Clean up APIs related to this change. llvm-svn: 110396	2010-08-05 23:42:04 +00:00
Owen Anderson	4674dd6cf5	Give JumpThreading+LVI a long-form cl::opt so that it's easier to toggle the default. llvm-svn: 110384	2010-08-05 22:11:31 +00:00
Owen Anderson	9f2bca02d7	Experiments show that we can safely increase our unrolling threshold without unduly impacting code size, particularly since unrolling is not enabled at -Os. llvm-svn: 110233	2010-08-04 18:32:46 +00:00
Dan Gohman	ba81fc16a5	Fix whitespace. llvm-svn: 110223	2010-08-04 17:43:57 +00:00
Dan Gohman	839c972102	Fix a comment. llvm-svn: 110181	2010-08-04 01:16:35 +00:00
Peter Collingbourne	ddaaf40d24	Add an atomic lowering pass llvm-svn: 110113	2010-08-03 16:19:16 +00:00
Oscar Fuentes	40b31ad3ee	Prefix `next' iterator operation with `llvm::'. Fixes potential ambiguity problems on VS 2010. Patch by nobled! llvm-svn: 110029	2010-08-02 06:00:15 +00:00
Nick Lewycky	299c6dfcbf	Add missing newline to debug statement. llvm-svn: 109886	2010-07-30 20:27:01 +00:00
Gabor Greif	62f0aac99d	simplify by using CallSite constructors; virtually eliminates CallSite::get from the tree llvm-svn: 109687	2010-07-28 22:50:26 +00:00
Gabor Greif	0a970698da	use Value* constructor of CallSite to create potentially improper site, and test that llvm-svn: 109581	2010-07-28 14:28:18 +00:00
Gabor Greif	f159085414	recommit simplification (r109502, backed out r109509); seems to innocent llvm-svn: 109510	2010-07-27 16:44:23 +00:00
Gabor Greif	5f91b7cf3e	back out this too to restore the bots llvm-svn: 109509	2010-07-27 15:56:07 +00:00
Gabor Greif	7527b2ed5c	simplify llvm-svn: 109502	2010-07-27 13:31:22 +00:00
Owen Anderson	aa7f66ba67	Add an initial implementation of LazyValueInfo updating for JumpThreading. Disabled for now. llvm-svn: 109424	2010-07-26 18:48:03 +00:00
Dan Gohman	0141c13b22	Remove LCSSA's bogus dependence on LoopSimplify and LoopSimplify's bogus dependence on DominanceFrontier. Instead, add an explicit DominanceFrontier pass in StandardPasses.h to ensure that it gets scheduled at the right time. Declare that loop unrolling preserves ScalarEvolution, and shuffle some getAnalysisUsages. This eliminates one LoopSimplify and one LCCSA run in the standard compile opts sequence. llvm-svn: 109413	2010-07-26 18:11:16 +00:00
Dan Gohman	65b257c9d2	Use DominatorTree::properlyDominates instead of dominates with an explicit inequality check. llvm-svn: 109401	2010-07-26 17:37:36 +00:00
Dan Gohman	31f73ef210	A block dominates itself, by definition. llvm-svn: 109400	2010-07-26 17:35:32 +00:00
Gabor Greif	dde79d8f1a	mass elimination of reliance on automatic iterator dereferencing llvm-svn: 109103	2010-07-22 13:36:47 +00:00
Gabor Greif	3e44ea1917	undo 80 column trespassing I caused llvm-svn: 109092	2010-07-22 10:37:47 +00:00
Owen Anderson	a57b97e7e7	Fix batch of converting RegisterPass<> to INTIALIZE_PASS(). llvm-svn: 109045	2010-07-21 22:09:45 +00:00
Dan Gohman	12725c7d46	Remember that the induction variable is always a PHINode and use getIncomingValueForBlock instead of LoopInfo::getCanonicalInductionVariableIncrement. llvm-svn: 108865	2010-07-20 17:18:52 +00:00
Dan Gohman	efd7f9c360	Reorder the contents of various getAnalysisUsage functions, eliminating a redundant loopsimplify run from the default -O2 sequence. llvm-svn: 108539	2010-07-16 17:58:45 +00:00
Gabor Greif	6d673953e3	eliminate CallInst::ArgOffset llvm-svn: 108522	2010-07-16 09:38:02 +00:00
Dan Gohman	1415208292	Don't merge uses when they are targetting fixup sites with different widths. In a use with a narrower fixup, formulae may be wider than the fixup, in which case the high bits aren't necessarily meaningful, so it isn't safe to reuse them for uses with wider fixups. This fixes PR7618, though the testcase is too large for a reasonable regression test, since it heavily dependes on hitting LSR's heuristics in a certain way. llvm-svn: 108455	2010-07-15 20:24:58 +00:00
Dan Gohman	a1501b9c50	Use dbgs() instead of errs() in a DEBUG. llvm-svn: 108453	2010-07-15 20:12:42 +00:00
Dan Gohman	4afd412d6b	Watch out for a constant offset cancelling out a base register, forming a zero. This situation arrises in Fortran code with induction variables that start at 1 instead of 0. This fixes PR7651. llvm-svn: 108424	2010-07-15 15:14:45 +00:00
Duncan Sands	f88a284579	Handle the case of a tail recursion in which the tail call is followed by a return that returns a constant, while elsewhere in the function another return instruction returns a different constant. This is a special case of accumulator recursion, so just generalize the existing logic a bit. llvm-svn: 108241	2010-07-13 15:41:41 +00:00
Gabor Greif	a5fa885d47	cache results of operator* llvm-svn: 108142	2010-07-12 14:10:24 +00:00
Gabor Greif	782f62412f	cache dereferenced iterators llvm-svn: 108138	2010-07-12 12:03:02 +00:00
Gabor Greif	433b975fe2	recommit r108131 (hich has been backed out in r108135) with a fix llvm-svn: 108137	2010-07-12 12:02:10 +00:00
Gabor Greif	f9610827ce	back out r108131 (of TailDuplication.cpp) for now, it causes a buildbot failure llvm-svn: 108135	2010-07-12 11:32:39 +00:00
Gabor Greif	2a464d7308	cache dereferenced iterators llvm-svn: 108131	2010-07-12 10:36:48 +00:00
Duncan Sands	41b4a6b36a	Convert some tab stops into spaces. llvm-svn: 108130	2010-07-12 08:16:59 +00:00
Chris Lattner	bbc25ff5cc	if jump threading is able to infer interesting values on both the LHS and RHS of an and/or instruction, don't multiply add known predecessor values. This fixes the crash on testcase from PR7498 llvm-svn: 108114	2010-07-12 00:47:34 +00:00
Duncan Sands	82b21c086e	The accumulator tail recursion transform claims to work for any associative operation, but the way it's implemented requires the operation to also be commutative. So add a check for commutativity (and tweak the corresponding comments). This makes no difference in practice since every associative LLVM instruction is also commutative! Here's an example to show the need for commutativity: the accum_recursion.ll testcase calculates the factorial function. Before the transformation the result of a call is ((((11)2)3)...)x while afterwards it is (((1x)(x-1))...2)1 which clearly requires both associativity and commutativity of * to be equal to the original. llvm-svn: 108056	2010-07-10 20:31:42 +00:00
Gabor Greif	e82532a1c5	cache result of operator* llvm-svn: 107976	2010-07-09 15:40:10 +00:00
Gabor Greif	d323f5e161	cache result of operator* (found by inspection) llvm-svn: 107971	2010-07-09 14:48:08 +00:00
Gabor Greif	b0d56ffc85	cache result of operator* llvm-svn: 107969	2010-07-09 14:36:49 +00:00
Chris Lattner	efa3c824cc	Fix the second half of PR7437: scalarrepl wasn't preserving address spaces when SRoA'ing memcpy's. llvm-svn: 107846	2010-07-08 00:27:05 +00:00
Nick Lewycky	dace239949	Detabify this file. llvm-svn: 107637	2010-07-06 03:53:43 +00:00
Dan Gohman	832282e061	Don't claim to preserve AliasAnalysis. First, this is doesn't actually have any effect, and second, deleting stores can potentially invalidate an AliasAnalysis, and there's currently no notification for this. llvm-svn: 107496	2010-07-02 18:43:05 +00:00
Gabor Greif	74470192d7	use ArgOperand API llvm-svn: 107278	2010-06-30 12:42:43 +00:00
Gabor Greif	743b3fd196	use getArgOperand (corrected by CallInst::ArgOffset) instead of getOperand llvm-svn: 107273	2010-06-30 09:19:23 +00:00
Gabor Greif	f628ecd15f	use getNumArgOperands instead of getNumOperands llvm-svn: 107272	2010-06-30 09:17:53 +00:00
Gabor Greif	fe252e6fa0	use getArgOperand instead of getOperand llvm-svn: 107271	2010-06-30 09:16:16 +00:00
Gabor Greif	8ae3095286	use getArgOperand instead of getOperand llvm-svn: 107270	2010-06-30 09:15:28 +00:00
Gabor Greif	18c5bae727	employ CallInst::ArgOffset (for now) llvm-svn: 107015	2010-06-28 16:43:57 +00:00
Gabor Greif	4300fc77ae	use cached value llvm-svn: 107000	2010-06-28 11:20:42 +00:00
Chris Lattner	25a843fcd2	minor cleanup to SROA: when lowering type unsafe accesses to large integers, the first inserted value would always create an 'or X, 0'. Even though this is trivially zapped by instcombine, don't bother creating this pointless instruction. llvm-svn: 106979	2010-06-27 07:58:26 +00:00
Duncan Sands	3a5cb69cb8	Fix PR7328: when turning a tail recursion into a loop, need to preserve the returned value after the tail call if it differs from other return values. The optimal thing to do would be to introduce a phi node for the return value, but for the moment just fix the miscompile. llvm-svn: 106947	2010-06-26 12:53:31 +00:00
Dan Gohman	fb9712bdae	In GenerateReassociations, don't bother thinking about individual SCEVUnknown values which are loop-variant, as LSR can't do anything interesting with these values in any case. This fixes very slow compile times on loops which have large numbers of such values. llvm-svn: 106897	2010-06-25 22:32:18 +00:00
Dale Johannesen	ce97d55ad9	The hasMemory argument is irrelevant to how the argument for an "i" constraint should get lowered; PR 6309. While this argument was passed around a lot, this is the only place it was used, so it goes away from a lot of other places. llvm-svn: 106893	2010-06-25 21:55:36 +00:00
Gabor Greif	07e9284c75	use ArgOperand API; tighten type of handleFreeWithNonTrivialDependency to be able to use isFreeCall whithout a cast or new overload llvm-svn: 106823	2010-06-25 07:40:32 +00:00
Dan Gohman	963b1c142e	A few minor micro-optimizations. llvm-svn: 106764	2010-06-24 16:57:52 +00:00
Dan Gohman	47ddf76d89	Teach getExactSDiv to evaluate x/1 to x up front, as it's a common enough special case, and it theoretically allows more folding because it works even when x is unanalyzable. llvm-svn: 106763	2010-06-24 16:51:25 +00:00
Dan Gohman	ab5422200b	Fix copy+pasto issues in isMulSExtable. llvm-svn: 106759	2010-06-24 16:45:11 +00:00
Gabor Greif	91f9589057	use ArgOperand API; introduce downcasted pointers into scope to facilitate this llvm-svn: 106734	2010-06-24 12:03:56 +00:00
Gabor Greif	e2f482ca0b	use ArgOperand API llvm-svn: 106731	2010-06-24 10:42:46 +00:00
Gabor Greif	2d958d4db5	use ArgOperand API llvm-svn: 106730	2010-06-24 10:17:17 +00:00
Gabor Greif	5bcaa55761	use callsite to obtain all arguments llvm-svn: 106729	2010-06-24 10:04:07 +00:00
Gabor Greif	0f60709f0e	use getNumArgOperands llvm-svn: 106709	2010-06-24 00:48:48 +00:00
Gabor Greif	4a39b84a9d	use ArgOperand API llvm-svn: 106707	2010-06-24 00:44:01 +00:00
Devang Patel	0dc3c2d37e	Use ValueMap instead of DenseMap. The ValueMapper used by various cloning utility maps MDNodes also. llvm-svn: 106706	2010-06-24 00:33:28 +00:00
Dan Gohman	1081f1a0f5	Fix OptimizeMax to handle an odd case where one of the max operands is another max which folds. This fixes PR7454. llvm-svn: 106594	2010-06-22 23:07:13 +00:00
Dan Gohman	d2d1ae105d	Use pre-increment instead of post-increment when the result is not used. llvm-svn: 106542	2010-06-22 15:08:57 +00:00
Dan Gohman	dd41bba517	Use A.append(...) instead of A.insert(A.end(), ...) when A is a SmallVector, and other SmallVector simplifications. llvm-svn: 106452	2010-06-21 19:47:52 +00:00
Dan Gohman	32655906e4	Add a TODO comment. llvm-svn: 106397	2010-06-19 21:30:18 +00:00
Dan Gohman	51d00092b6	Include the use kind along with the expression in the key of the use sharing map. The reconcileNewOffset logic already forces a separate use if the kinds differ, so incorporating the kind in the key means we can track more sharing opportunities. More sharing means fewer total uses to track, which means smaller problem sizes, which means the conservative throttles don't kick in as often. llvm-svn: 106396	2010-06-19 21:29:59 +00:00
Dan Gohman	297fb8b9fc	Don't include things in anonymous namespaces that don't need it. llvm-svn: 106395	2010-06-19 21:21:39 +00:00
Dan Gohman	f3aea7aecf	Disable indvars on loops when LoopSimplify form is not available. This fixes PR7333. llvm-svn: 106267	2010-06-18 01:35:11 +00:00
Rafael Espindola	a20e2dfe86	Make sure that simplify libcalls does not replace a call with one calling convention with a new call with a different calling convention. llvm-svn: 106134	2010-06-16 19:34:01 +00:00
Benjamin Kramer	a13bd20396	simplify-libcalls: fold strncmp(x, y, 1) -> memcmp(x, y, 1) The memcmp will be optimized further and even the pathological case 'strstr(x, "x") == x' generates optimal code now. llvm-svn: 106097	2010-06-16 10:30:29 +00:00
Benjamin Kramer	1118860e3a	simplify-libcalls: fold strstr(a, b) == a -> strncmp(a, b, strlen(b)) == 0 llvm-svn: 106047	2010-06-15 21:34:25 +00:00
Chris Lattner	329ea064ed	jump threading can't split a critical edge from an indirectbr. This fixes PR7356. llvm-svn: 105950	2010-06-14 19:45:43 +00:00
Benjamin Kramer	b82de426de	SimplifyCFG: don't turn volatile stores to null/undef into unreachable. Fixes PR7369. llvm-svn: 105914	2010-06-13 14:35:54 +00:00
Kenneth Uildriks	9b21208bfb	Pulled CodeMetrics out of InlineCost.h and made it a bit more general, so it can be reused from PartialSpecializationCost llvm-svn: 105725	2010-06-09 15:11:37 +00:00
Dan Gohman	67b4403101	Don't track users of undef values; they aren't interesting for register pressure. llvm-svn: 105501	2010-06-04 23:16:05 +00:00
Dan Gohman	826bdf8c10	Move FindAvailableLoadedValue isSafeToLoadUnconditionally out of lib/Transforms/Utils and into lib/Analysis so that Analysis passes can use them. llvm-svn: 104949	2010-05-28 16:19:17 +00:00
Benjamin Kramer	6877119ef3	Kill unneeded SExt. llvm-svn: 104692	2010-05-26 09:45:04 +00:00
Benjamin Kramer	9439084cea	Properly promote operands when optimizing a single-character memcmp. llvm-svn: 104648	2010-05-25 22:53:43 +00:00
Dan Gohman	9b48b856ea	DominatorTree.getNode can return null for unreachable blocks. llvm-svn: 104290	2010-05-20 22:46:54 +00:00
Dan Gohman	86110fa2bb	Minor code cleanups. llvm-svn: 104287	2010-05-20 22:25:20 +00:00
Dan Gohman	6295f2ebb8	Make Solve check its own post-condition, to reduce clutter in the top-level LSRInstance logic. llvm-svn: 104278	2010-05-20 20:59:23 +00:00
Dan Gohman	a4ca28a3ae	Add comments. llvm-svn: 104276	2010-05-20 20:52:00 +00:00
Dan Gohman	927bcaadda	More code cleanups. Use iterators instead of indices when indices aren't needed. llvm-svn: 104273	2010-05-20 20:33:18 +00:00
Dan Gohman	4c4043cf34	Fix OptimizeShadowIV to set Changed. Change OptimizeLoopTermCond to set Changed directly instead of using a return value. Rename FilterOutUndesirableDedicatedRegisters's Changed variable to distinguish it from LSRInstance's Changed member. llvm-svn: 104269	2010-05-20 20:05:31 +00:00
Dan Gohman	8ec018cedf	Add some comments. llvm-svn: 104268	2010-05-20 20:00:41 +00:00
Dan Gohman	8ce95cc3c5	Simplify this code. Don't do a DomTreeNode lookup for each visited block. llvm-svn: 104267	2010-05-20 20:00:25 +00:00
Dan Gohman	ab5fb7f559	Minor code cleanups. llvm-svn: 104263	2010-05-20 19:44:23 +00:00
Dan Gohman	ee2fea3cd7	When canonicalizing icmp operand order to put the loop invariant operand on the left, the interesting operand is on the right. This fixes a bug where LSR was failing to recognize ICmpZero uses, which led it to be unable to reverse the induction variable in the attached testcase. Delete test/CodeGen/X86/stack-color-with-reg-2.ll, because its test is extremely fragile and hard to meaningfully update. llvm-svn: 104262	2010-05-20 19:26:52 +00:00
Dan Gohman	fdf9874ba7	Set Changed to true when canonicalizing ICmp operand order; even though it isn't a very interesting change, it's a change nonetheless. llvm-svn: 104260	2010-05-20 19:16:03 +00:00
Dan Gohman	981563d0ba	Rename a variable to avoid shadowing. llvm-svn: 104234	2010-05-20 16:41:11 +00:00
Dan Gohman	6b733fc189	Minor code simplification. llvm-svn: 104232	2010-05-20 16:23:28 +00:00
Dan Gohman	80a9608442	Move the code for deleting BaseRegs and LSRUses into helper functions, and fix a bug that valgrind noticed where the code would std::swap an element with itself. llvm-svn: 104225	2010-05-20 15:17:54 +00:00
Dan Gohman	20fab456da	Teach LSR how to cope better with unrolled loops on targets where the addressing modes don't make this trivially easy. This allows it to avoid falling into the less precise heuristics in more cases. llvm-svn: 104186	2010-05-19 23:43:12 +00:00
Dan Gohman	beebef4137	Add a comment. llvm-svn: 104089	2010-05-18 23:55:57 +00:00
Dan Gohman	50f8f2c23d	Fix the predicate which checks for non-sensical formulae which have constants in registers which partially cancel out their immediate fields. llvm-svn: 104088	2010-05-18 23:48:08 +00:00
Dan Gohman	4cf99b5303	Factor out the code for recomputing an LSRUse's Regs set after some of its formulae have been removed into a helper function, and also teach it how to update the RegUseTracker. llvm-svn: 104087	2010-05-18 23:42:37 +00:00
Dan Gohman	a4eca05174	Factor out code for estimating search space complexity into a helper function. llvm-svn: 104082	2010-05-18 22:51:59 +00:00
Dan Gohman	63e9015248	Add some more debug output. llvm-svn: 104080	2010-05-18 22:41:32 +00:00
Dan Gohman	f1c7b1b42f	Factor out the code for deleting a formula from an LSRUse into a helper function. llvm-svn: 104079	2010-05-18 22:39:15 +00:00
Dan Gohman	8aca7ef903	Make some debug output more informative. llvm-svn: 104078	2010-05-18 22:37:37 +00:00
Dan Gohman	06ab08f795	Print an error message in Formula::print if the HasBaseReg flag is inconsistent with the BaseRegs field. It's not print's job to assert on an invalid condition, but it can make one more obvious. llvm-svn: 104077	2010-05-18 22:35:55 +00:00
Dan Gohman	248c41d108	Rename RegUseTracker's RegUses member to RegUsesMap to avoid confusion with LSRInstance's RegUses member. llvm-svn: 104076	2010-05-18 22:33:00 +00:00
Douglas Gregor	6739a89117	Fixes for Microsoft Visual Studio 2010, from Steven Watanabe! llvm-svn: 103457	2010-05-11 06:17:44 +00:00
Chris Lattner	84d4618659	make simplifycfg insert an llvm.trap before the 'unreachable' it introduces when it detects undefined behavior. llvm.trap generally codegens into some thing really small (e.g. a 2 byte ud2 instruction on x86) and debugging this sort of thing is "nontrivial". For example, we now compile: void foo() { (int)0 = 42; } into: _foo: pushl %ebp movl %esp, %ebp ud2 Some may even claim that this is a security hole, though that seems dubious to me. This addresses rdar://7958343 - Optimizing away null dereference potentially allows arbitrary code execution llvm-svn: 103356	2010-05-08 22:15:59 +00:00
Chris Lattner	5a62d6e578	Fix PR7052, patch by Jakub Staszak! llvm-svn: 103347	2010-05-08 20:01:44 +00:00
Dan Gohman	d0800241d2	When pruning candidate formulae out of an LSRUse, update the LSRUse's Regs set after all pruning is done, rather than trying to do it on the fly, which can produce an incomplete result. This fixes a case where heuristic pruning was stripping all formulae from a use, which led the solver to enter an infinite loop. Also, add a few asserts to diagnose this kind of situation. llvm-svn: 103328	2010-05-07 23:36:59 +00:00
Ted Kremenek	d90773ebe0	Update CMake build. llvm-svn: 103266	2010-05-07 17:13:20 +00:00
Dan Gohman	5d5b8b1b8c	Add an LLVM IR version of code sinking. This uses the same simple algorithm as MachineSink, but it isn't constrained by MachineInstr-level details. llvm-svn: 103257	2010-05-07 15:40:13 +00:00
Bob Wilson	0c8b29bcdb	Use the right version of "append" to combine two SmallVectors. This fixes the compile-time regressions seen in last night's tests. llvm-svn: 103118	2010-05-05 20:44:15 +00:00
Bob Wilson	a2fda8b648	Defer adding critical edges to the "toSplit" list until after checking for indirect branches in all the predecessors. This avoids unnecessarily splitting edges in cases where load PRE is not possible anyway. Thanks to Jakub Staszak for pointing this out. llvm-svn: 103034	2010-05-04 20:03:21 +00:00
Dan Gohman	1d2ded75e2	Use getConstant instead of getIntegerSCEV. The two are basically the same, now that getConstant has overloads consistent with ConstantInt::get. llvm-svn: 102965	2010-05-03 22:09:21 +00:00
Devang Patel	9f5200a122	Check for side effects before splitting loop. Patch by Jakub Staszak! llvm-svn: 102928	2010-05-03 18:06:58 +00:00
Chris Lattner	87aa2243e2	fix PR6940: sitofp(undef) folds to 0.0, not undef. llvm-svn: 102358	2010-04-26 18:21:23 +00:00
Dan Gohman	534ba376f6	Generalize LSR's OptimizeMax to handle the new kinds of max expressions that indvars may use, now that indvars is recognizing le and ge loops. llvm-svn: 102235	2010-04-24 03:13:44 +00:00
Dan Gohman	997bbc54d6	Fix LSR to tolerate cases where ScalarEvolution initially misses an opportunity to fold add operands, but folds them after LSR has separated them out. This fixes rdar://7886751. llvm-svn: 102157	2010-04-23 01:55:05 +00:00
Chris Lattner	4ba01ec869	refactor the interface to InlineFunction so that most of the in/out arguments are handled with a new InlineFunctionInfo class. This makes it easier to extend InlineFunction to return more info in the future. llvm-svn: 102137	2010-04-22 23:07:58 +00:00
Gabor Greif	27b3d55194	use abstract accessors to CallInst llvm-svn: 101899	2010-04-20 13:13:04 +00:00
Chris Lattner	66e809acc0	remove a bunch of ad-hoc code to simplify instructions from loop unswitch, and use inst simplify instead. It is more powerful and less duplication. llvm-svn: 101874	2010-04-20 05:33:18 +00:00
Chris Lattner	5814d9d9da	RewriteLoopBodyWithConditionConstant can end up rewriting the condition we're unswitching on. In this case, don't try to simplify the second copy of the loop which may be dead or not, but is probably a constant now. This fixes PR6879 llvm-svn: 101870	2010-04-20 05:09:16 +00:00
Dan Gohman	e637ff5e9a	Remove the Expr member from IVUsers. Instead of remembering the expression, just ask ScalarEvolution for it on demand. This helps IVUsers be more robust in the case of expressions changing underneath it. This fixes PR6862. llvm-svn: 101819	2010-04-19 21:48:58 +00:00
Eric Christopher	7258dcd77f	Revert 101465, it broke internal OpenGL testing. Probably the best way to know that all getOperand() calls have been handled is to replace that API instead of updating. llvm-svn: 101579	2010-04-16 23:37:20 +00:00
Dan Gohman	99e5327bfd	Refine the detection of seemingly infinitely recursive calls where the callee is expected to be expanded to something else by codegen, so that normal infinitely recursive calls are still transformed. llvm-svn: 101468	2010-04-16 15:57:50 +00:00
Gabor Greif	f375520f7b	reapply r101434 with a fix for self-hosting rotate CallInst operands, i.e. move callee to the back of the operand array the motivation for this patch are laid out in my mail to llvm-commits: more efficient access to operands and callee, faster callgraph-construction, smaller compiler binary llvm-svn: 101465	2010-04-16 15:33:14 +00:00
Chris Lattner	bd2d9430d6	fix comment noticed by Bob llvm-svn: 101437	2010-04-16 02:32:17 +00:00
Gabor Greif	403e9694f9	back out r101423 and r101397, they break llvm-gcc self-host on darwin10 llvm-svn: 101434	2010-04-16 01:16:20 +00:00
Chris Lattner	1146d326a7	fix PR6832: we were using the alignment of a pointer when we wanted the alignment of the pointee. llvm-svn: 101432	2010-04-16 01:05:38 +00:00
Chris Lattner	b73552908e	improve comments. llvm-svn: 101429	2010-04-16 00:38:19 +00:00
Chris Lattner	78d7dbbc30	pull all the ConvertToScalarInfo code together into one place. llvm-svn: 101427	2010-04-16 00:24:57 +00:00
Chris Lattner	d69c3ee958	more refactoring: suck some stuff out of SRoA into ConvertToScalarInfo. llvm-svn: 101425	2010-04-16 00:20:00 +00:00
Gabor Greif	6af0ad846e	shift intrinsic operand llvm-svn: 101423	2010-04-16 00:06:45 +00:00
Chris Lattner	9ef4eae6e6	introduce a new ConvertToScalarInfo struct to simplify CanConvertToScalar/MergeInType. Eliminate a pointless LLVMContext argument to MergeInType. llvm-svn: 101422	2010-04-15 23:50:26 +00:00
Chris Lattner	9c1172d848	tidy interface to isOnlyCopiedFromConstantGlobal llvm-svn: 101405	2010-04-15 21:59:20 +00:00
Gabor Greif	33ae80bff7	reapply r101364, which has been backed out in r101368 with a fix rotate CallInst operands, i.e. move callee to the back of the operand array the motivation for this patch are laid out in my mail to llvm-commits: more efficient access to operands and callee, faster callgraph-construction, smaller compiler binary llvm-svn: 101397	2010-04-15 20:51:13 +00:00
Dan Gohman	b29cda9b3c	Fix a bunch of namespace polution. llvm-svn: 101376	2010-04-15 17:08:50 +00:00
Gabor Greif	9fd00c7d25	back out r101364, as it trips the linux nightlybot on some clang C++ tests llvm-svn: 101368	2010-04-15 12:46:56 +00:00
Gabor Greif	aafd209632	rotate CallInst operands, i.e. move callee to the back of the operand array the motivation for this patch are laid out in my mail to llvm-commits: more efficient access to operands and callee, faster callgraph-construction, smaller compiler binary llvm-svn: 101364	2010-04-15 10:49:53 +00:00
Gabor Greif	c08e5df836	performance: cache the dereferenced use_iterator llvm-svn: 101253	2010-04-14 16:48:56 +00:00
Gabor Greif	a49686fa3e	performance: cache the dereferenced use_iterator llvm-svn: 101250	2010-04-14 16:13:56 +00:00
Owen Anderson	b516f1c6cc	Remove SCCVN from the CMake build system. llvm-svn: 101125	2010-04-13 08:33:09 +00:00
Owen Anderson	9ed6abfe0b	SCCVN, we hardly knew ye! llvm-svn: 101117	2010-04-13 05:24:08 +00:00
Dan Gohman	5867a56db8	Teach IndVarSimplify how to eliminate remainder operators where the numerator is an induction variable. For example, with code like this: for (i=0;i<n;++i) x[i%n] = 0; IndVarSimplify will now recognize that i is always less than n inside the loop, and eliminate the remainder. llvm-svn: 101113	2010-04-13 01:46:36 +00:00
Dan Gohman	4a645b88ef	Suppress LinearFunctionTestReplace when the computed backedge-taken expression is a UDiv and it doesn't appear that the UDiv came from the user's source. ScalarEvolution has recently figured out how to compute a tripcount expression for the inner loop in SingleSource/Benchmarks/Shootout/sieve.c, using a udiv. Emitting a udiv instruction dramatically slows down the enclosing loop. llvm-svn: 101068	2010-04-12 21:13:43 +00:00
Dan Gohman	27c8e79839	Delete this code, which is no longer needed. llvm-svn: 101033	2010-04-12 08:00:22 +00:00
Dan Gohman	07f6563e81	Move the EliminateIVUsers call back out to its original location. Now that a ScalarEvolution bug with overflow handling is fixed, the normal analysis code will automatically decline to operate on the icmp instructions which are responsible for the loop exit. llvm-svn: 101032	2010-04-12 07:56:56 +00:00
Dan Gohman	15f90c294c	Use RecursivelyDeleteTriviallyDeadInstructions in EliminateIVComparisons, instead of deleting just the user. This makes it more consistent with other code in IndVarSimplify, and theoretically can eliminate more users earlier. llvm-svn: 101027	2010-04-12 07:29:15 +00:00
Dan Gohman	fa5ad797e3	Re-apply r101000, with a fix: Don't eliminate an icmp which is part of the loop exit test. This usually doesn't come up for a variety of reasons, but it isn't impossible, so make IndVarSimplify handle it conservatively. llvm-svn: 101008	2010-04-12 02:21:50 +00:00
Dan Gohman	c0f1efaf8d	Revert 101000, which is breaking self-host builds. llvm-svn: 101002	2010-04-12 00:17:10 +00:00
Dan Gohman	af4ab1b681	Teach IndVarSimplify how to eliminate comparisons involving induction variables. For example, with code like this: for (i=0;i<n;++i) if (i<n) x[i] = 0; IndVarSimplify will now recognize that i is always less than n inside the loop, and eliminate the if. llvm-svn: 101000	2010-04-11 23:10:12 +00:00
Dan Gohman	b50349a979	Rename isLoopGuardedByCond to isLoopEntryGuardedByCond, to emphasise that it's only testing for the entry condition, not full loop-invariant conditions. llvm-svn: 100979	2010-04-11 19:27:13 +00:00
Chris Lattner	9ae28b141f	fix PR6743, a case where we'd delete an instruction before using it in some cases. llvm-svn: 100937	2010-04-10 18:26:57 +00:00
Dan Gohman	607e02b33a	When determining a canonical insert position, don't climb deeper into adjacent loops. Also, ensure that the insert position is dominated by the loop latch of any loop in the post-inc set which has a latch. llvm-svn: 100906	2010-04-09 22:07:05 +00:00
Dan Gohman	42ec4eb351	When looking for loop-invariant users, look through no-op instructions, so that an unfortunately placed bitcast doesn't pin a value in a register. llvm-svn: 100883	2010-04-09 19:12:34 +00:00
Gabor Greif	ce6dd889ec	const-ize a predicate llvm-svn: 100856	2010-04-09 10:57:00 +00:00
Dan Gohman	d2df643ddb	Refactor the code for computing the insertion point for an expression into a separate function. llvm-svn: 100845	2010-04-09 02:00:38 +00:00
Chris Lattner	c6c153be45	fix a SCCP miscompilation that could happen when a forced constant is changed to a constant, we would end up adding the instruction to the wrong worklist, preventing it from being properly revisited. This fixes rdar://7832370 llvm-svn: 100837	2010-04-09 01:14:31 +00:00
Dan Gohman	9b5d0bb774	Avoid allocating a value of zero in a register if the initial formula inputs happen to negate each other. llvm-svn: 100828	2010-04-08 23:36:27 +00:00
Dan Gohman	4ce1fb1448	Add variants of ult, ule, etc. which take a uint64_t RHS, for convenience. llvm-svn: 100824	2010-04-08 23:03:40 +00:00
Dan Gohman	4506539d84	When expanding expressions which are using post-inc mode for multiple loops, ensure that the expansion is dominated by the increments of those loops. llvm-svn: 100748	2010-04-08 05:57:57 +00:00
Dan Gohman	d006ab90dd	Generalize IVUsers to track arbitrary expressions rather than expressions explicitly split into stride-and-offset pairs. Also, add the ability to track multiple post-increment loops on the same expression. This refines the concept of "normalizing" SCEV expressions used for to post-increment uses, and introduces a dedicated utility routine for normalizing and denormalizing expressions. This fixes the expansion of expressions which are post-increment users of more than one loop at a time. More broadly, this takes LSR another step closer to being able to reason about more than one loop at a time. llvm-svn: 100699	2010-04-07 22:27:08 +00:00
Gabor Greif	df323a51f5	performance: get rid of repeated dereferencing of use_iterator by caching its result llvm-svn: 100550	2010-04-06 19:32:30 +00:00
Chris Lattner	adca608281	fix a really nasty bug that Evan was tracking in SCCP. When resolving undefs in branches/switches, we have two cases: a branch on a literal undef or a branch on a symbolic value which is undef. If we have a literal undef, the code was correct: forcing it to a constant is the right thing to do. If we have a branch on a symbolic value that is undef, we should force the symbolic value to a constant, which then makes the successor block live. Forcing the condition of the branch to being a constant isn't safe if later paths become live and the value becomes overdefined. This is the case that 'forcedconstant' is designed to handle, so just use it. This fixes rdar://7765019 but there is no good testcase for this, the one I have is too insane to be useful in the future. llvm-svn: 100478	2010-04-05 22:14:48 +00:00
Chris Lattner	c832c1bf69	some code cleanups, use SwitchInst::findCaseValue, reduce indentation llvm-svn: 100468	2010-04-05 21:18:32 +00:00
Evan Cheng	ba930449a9	Code clean up. llvm-svn: 100467	2010-04-05 21:16:25 +00:00
Mon P Wang	c576ee9040	Reapply address space patch after fixing an issue in MemCopyOptimizer. Added support for address spaces and added a isVolatile field to memcpy, memmove, and memset, e.g., llvm.memcpy.i32(i8, i8, i32, i32) -> llvm.memcpy.p0i8.p0i8.i32(i8, i8, i32, i32, i1) llvm-svn: 100304	2010-04-04 03:10:48 +00:00
Chris Lattner	ecb536313f	require that the branch being controlled by the IV exits the loop. With this information we can guarantee the iteration count of the loop is bounded by the compare. I think this xforms is finally safe now. llvm-svn: 100285	2010-04-03 07:21:39 +00:00
Chris Lattner	40060d33f6	add integer overflow check for the fp induction variable checker. Amusingly, we already had tests that we should have rejects because they would be miscompiled in the testsuite. The remaining issue with this is that we don't check that the branch causes us to exit the loop if it fails, so we don't actually know if we remain in bounds. llvm-svn: 100284	2010-04-03 07:18:48 +00:00
Chris Lattner	69913466cb	add a comment and fix some consistency issues, converting to a signed vs unsigned value depending on the sign of the constant fp means that we can't distinguish between a truly negative number and a positive number so large the 32nd bit is set. So, do don't this! llvm-svn: 100283	2010-04-03 06:41:49 +00:00
Chris Lattner	40ea690f39	fix PR6761, a miscompilation due to the fp->int IV conversion stuff. More bugs remain though. llvm-svn: 100282	2010-04-03 06:30:03 +00:00
Chris Lattner	42202868c3	just eliminate the uitofp checks. This code isn't doing the required validity checks in the first place, and supporting a condition large enough to require the 32'nd bit isn't worth it. llvm-svn: 100280	2010-04-03 06:25:21 +00:00
Chris Lattner	ca25b60f4e	rename PH -> PN to be consistent with WeakPN and the rest of llvm. llvm-svn: 100276	2010-04-03 06:17:08 +00:00
Chris Lattner	774858fc38	improve comment and drop a dead check. If PH had no uses, it would have been deleted by RecursivelyDeleteTriviallyDeadInstructions llvm-svn: 100275	2010-04-03 06:16:22 +00:00
Chris Lattner	915322bc4a	strength reduce a ridiculous use of APInt. llvm-svn: 100274	2010-04-03 06:13:12 +00:00
Chris Lattner	0b941347f9	rename stuff improve comment grammar. llvm-svn: 100273	2010-04-03 06:11:07 +00:00
Chris Lattner	d77bde5f94	simplify some code and resolve a fixme. llvm-svn: 100272	2010-04-03 06:06:59 +00:00
Chris Lattner	2ff33f91d5	There is no guarantee that the increment and the branch are in the same block. Insert the new increment in the correct location. Also, more cleanups. llvm-svn: 100271	2010-04-03 06:05:10 +00:00
Chris Lattner	c558b49f14	first half of a pass through IndVarSimplify::HandleFloatingPointIV, this cleans up a bunch of code and also fixes several crashes and miscompiles. More to come unfortunately, this optimization is quite broken. llvm-svn: 100270	2010-04-03 05:54:59 +00:00
Evan Cheng	ed66db3f9b	Code refactoring. llvm-svn: 100262	2010-04-03 02:23:43 +00:00
Mon P Wang	999c1b927b	Revert r100191 since it breaks objc in clang llvm-svn: 100199	2010-04-02 18:43:02 +00:00
Mon P Wang	a972ab8564	Reapply address space patch after fixing an issue in MemCopyOptimizer. Added support for address spaces and added a isVolatile field to memcpy, memmove, and memset, e.g., llvm.memcpy.i32(i8, i8, i32, i32) -> llvm.memcpy.p0i8.p0i8.i32(i8, i8, i32, i32, i1) llvm-svn: 100191	2010-04-02 18:04:15 +00:00
Dan Gohman	f7239102fe	Manually notify ScalarEvolution before making an operand replacement, since it can't currently observe such changes automatically. llvm-svn: 100186	2010-04-02 14:48:31 +00:00
Gabor Greif	5d5db5342b	Introduce ImmutableCallSite, useful for contexts where no mutation is necessary. Inherits from new templated baseclass CallSiteBase<> which is highly customizable. Base CallSite on it too, in a configuration that allows full mutation. Adapt some call sites in analyses to employ ImmutableCallSite. llvm-svn: 100100	2010-04-01 08:21:08 +00:00
Dale Johannesen	b67a6e6620	Fix a nasty dangling-pointer heisenbug that could generate wrong code pretty much anywhere AFAICT. A case that hits the bug reproducibly is impossible, but the situation was like this: Addr = ... Store -> Addr Addr2 = GEP , 0, 0 Store -> Addr2 Handling the first store, the code changed replaced Addr with a sunkaddr and deleted Addr, but not its table entry. Code in OptimizedBlock replaced Addr2 with a bitcast; if that happened to reuse the memory of Addr, the old table entry was erroneously found when handling the second store. llvm-svn: 100044	2010-03-31 20:37:15 +00:00
Bob Wilson	6f7fd28824	Revert Mon Ping's change 99928, since it broke all the llvm-gcc buildbots. llvm-svn: 99948	2010-03-30 22:27:04 +00:00
Mon P Wang	7460571381	Added support for address spaces and added a isVolatile field to memcpy, memmove, and memset, e.g., llvm.memcpy.i32(i8, i8, i32, i32) -> llvm.memcpy.p0i8.p0i8.i32(i8, i8, i32, i32, i1) A update of langref will occur in a subsequent checkin. llvm-svn: 99928	2010-03-30 20:55:56 +00:00
Jeffrey Yasskin	12fd516e51	Remove another memory leak from ABCD by using Edges by value instead of pointer. There was also a SmallPtrSet whose settiness wasn't being used, so I changed it to a SmallVector. llvm-svn: 99713	2010-03-27 09:09:17 +00:00
Jeffrey Yasskin	97e613b6da	In ABCD, change the non-null Bound*s to Bound&s. llvm-svn: 99711	2010-03-27 08:15:46 +00:00
Jeffrey Yasskin	33bc7e4cb5	Fix a memory leak in ABCD by giving ownership of Bound objects to the MemoizedResultChart. llvm-svn: 99710	2010-03-27 08:09:24 +00:00
Dan Gohman	d42e09d91e	Ignore debug intrinsics in yet more places. llvm-svn: 99580	2010-03-26 00:33:27 +00:00
Gabor Greif	c78d720f02	rename use_const_iterator to const_use_iterator for consistency's sake llvm-svn: 99564	2010-03-25 23:06:16 +00:00
Chris Lattner	0563804982	fix PR6642, GVN forwarding from memset to load of the base of the memset. llvm-svn: 99488	2010-03-25 05:58:19 +00:00
Evan Cheng	c12c2d9bb4	Move OptChkCall off LibCallOptimization into StrCpyOpt. llvm-svn: 99418	2010-03-24 20:19:04 +00:00
Gabor Greif	a2fbc0ae1b	Finally land the InvokeInst operand reordering. I have audited all getOperandNo calls now, fixing hidden assumptions. CallSite related uglyness will be eliminated successively. Note this patch has a long and griveous history, for all the back-and-forths have a look at CallSite.h's log. llvm-svn: 99399	2010-03-24 13:21:49 +00:00
Gabor Greif	9027ffb918	increase const goodness and remove pointless getUser() calls llvm-svn: 99395	2010-03-24 10:29:52 +00:00
Bill Wendling	04803e8ef6	Skip debugging intrinsics when sinking unused invariants. llvm-svn: 99324	2010-03-23 21:15:59 +00:00
Evan Cheng	d9e822345c	Teach simplify libcall to transform __strcpy_chk to __memcpy_chk to enable optimizations down stream. llvm-svn: 99282	2010-03-23 15:48:04 +00:00
Gabor Greif	e1517a084f	backing out r99170 because it still fails on clang-x86_64-darwin10-fnt llvm-svn: 99171	2010-03-22 09:11:00 +00:00
Gabor Greif	7a743e15e3	Now that hopefully all direct accesses to InvokeInst operands are fixed we can reapply the InvokeInst operand reordering patch. (see r98957). llvm-svn: 99170	2010-03-22 08:28:00 +00:00
Dan Gohman	1a2abe5580	Clear the SCEVExpander's insertion point after making deletions, so that the SCEVExpander doesn't retain a dangling pointer as its insert position. The dangling pointer in this case wasn't ever used to insert new instructions, but it was causing trouble with SCEVExpander's code for automatically advancing its insert position past debug intrinsics. This fixes use-after-free errors that valgrind noticed in test/Transforms/IndVarSimplify/2007-06-06-DeleteDanglesPtr.ll and test/Transforms/IndVarSimplify/exit_value_tests.ll. llvm-svn: 99036	2010-03-20 03:53:53 +00:00
Gabor Greif	6c56ed847e	back out r98957, it broke http://smooshlab.apple.com:8010/builders/clang-x86_64-darwin10-fnt/builds/703 in the nightly test suite llvm-svn: 98958	2010-03-19 13:50:02 +00:00
Gabor Greif	8335f9c0bf	Recommit r80858 again (which has been backed out in r80871). This time I did a self-hosted bootstrap on Linux x86-64, with no problems. Let's see how darwin 64-bit self-hosting goes. At the first sign of failure I'll back this out. Maybe the valgrind bots give me a hint of what may be wrong (it at all). llvm-svn: 98957	2010-03-19 11:55:53 +00:00
Benjamin Kramer	f2e4b5dd7f	str[r]chr returns its pointer argument so we cannot mark it as nocapture. Thanks to Duncan for spotting my mistake. llvm-svn: 98671	2010-03-16 20:33:15 +00:00
Benjamin Kramer	5cf5fd2ffa	Mark str[r]chr readonly. llvm-svn: 98663	2010-03-16 19:36:43 +00:00
Devang Patel	45c1505bf6	Skip debug info intrinsics. llvm-svn: 98584	2010-03-15 22:23:03 +00:00
Devang Patel	d3f41e8939	In "empty" bb, the return instruction may not be first instruction, if dbg value intrinsics are present in this bb. Use terminator to find return instructions. llvm-svn: 98565	2010-03-15 19:05:46 +00:00
Bill Wendling	55e69d179b	Skip over debug info when trying to merge two return BBs. llvm-svn: 98491	2010-03-14 10:40:55 +00:00
Benjamin Kramer	7b88a49f3e	Factor checked library call optimization into a common helper class and use it to unify the almost identical code in CodeGenPrepare and InstCombineCalls. llvm-svn: 98338	2010-03-12 09:27:41 +00:00
Nate Begeman	2e41605d4f	Whoops this already existed. llvm-svn: 98297	2010-03-11 23:21:19 +00:00
Nate Begeman	5daa235c91	Add a handful of additional useful pass manager things to the C API llvm-svn: 98296	2010-03-11 23:06:07 +00:00
Benjamin Kramer	2fc395659c	stpcpy is so similar to strcpy, it doesn't deserve a complete copy of the __strcpy_chk -> strcpy code. llvm-svn: 98284	2010-03-11 20:45:13 +00:00
Eric Christopher	607de1de53	Lower stpcpy_chk when possible. llvm-svn: 98274	2010-03-11 19:24:34 +00:00
Eric Christopher	4b7948e09e	Do some final lowering in CodeGenPrepare of _chk calls similar to that in InstCombineCalls. More call lowering needed. llvm-svn: 98228	2010-03-11 02:41:03 +00:00
Dan Gohman	2734ebd37f	Add a DominatorTree argument to isLCSSA so that it doesn't have to compute a set of reachable blocks for itself each time it is called, which is fairly frequently. llvm-svn: 98179	2010-03-10 19:38:49 +00:00
Eric Christopher	a7fb58f5f5	Migrate _chk call lowering from SimplifyLibCalls to InstCombine. Stub out the remainder of the calls that we should lower in some way and move the tests to the new correct directory. Fix up tests that are now optimized more than they were before by -instcombine. llvm-svn: 97875	2010-03-06 10:50:38 +00:00
Eric Christopher	87abfc506f	Move SimplifyLibCalls's LibCall builders to a separate file so they can be used in more places. Add an argument for the TargetData that most of them need. Update for the getInt8PtrTy() change. Should be no functionality change. llvm-svn: 97844	2010-03-05 22:25:30 +00:00
Evan Cheng	d214ed0e75	Safely turn memset_chk etc. to non-chk variant if the known object size is >= memset / memcpy / memmove size. llvm-svn: 97828	2010-03-05 20:59:47 +00:00
Chris Lattner	c6c1523f59	fix a nice subtle reassociate bug which would only occur in a very specific use pattern embodied in the carefully reduced testcase. llvm-svn: 97794	2010-03-05 07:18:54 +00:00
Eric Christopher	4899cbc77d	Move GetStringLength and helper from SimplifyLibCalls to ValueTracking. No functionality change. llvm-svn: 97793	2010-03-05 06:58:57 +00:00
Dan Gohman	29707de4fe	Make SCEVExpander and LSR more aggressive about hoisting expressions out of loops. llvm-svn: 97642	2010-03-03 05:29:13 +00:00
Dan Gohman	52f5563973	Non-affine post-inc SCEV expansions have more code which must be emitted after the increment. Make sure the insert position reflects this. This fixes PR6453. llvm-svn: 97537	2010-03-02 01:59:21 +00:00
Bob Wilson	0fd415820b	Don't attempt load PRE when there is no real redundancy (i.e., the load is in a loop and is itself the only dependency). llvm-svn: 97526	2010-03-02 00:09:29 +00:00
Bob Wilson	892432b7ef	When GVN needs to split critical edges for load PRE, check all of the predecessors before returning. Otherwise, if multiple predecessor edges need splitting, we only get one of them per iteration. This makes a small but measurable compile time improvement with -enable-full-load-pre. llvm-svn: 97521	2010-03-01 23:37:32 +00:00
Evan Cheng	7263cf8431	MemoryDepAnalysis is not used if redundant load processing is disabled. llvm-svn: 97512	2010-03-01 22:23:12 +00:00
Dan Gohman	8b0a419eb1	Spelling fixes. llvm-svn: 97453	2010-03-01 17:49:51 +00:00
Bob Wilson	1136166ee9	Revert r97245 which seems to be causing performance problems. llvm-svn: 97366	2010-02-28 05:34:05 +00:00
Chris Lattner	2af7e3dceb	fix grammaro's pointed out by daniel llvm-svn: 97313	2010-02-27 07:50:40 +00:00
Chris Lattner	d887f1da73	fix PR6414, a nondeterminism issue in IPSCCP which was because of a subtle interation in a loop operating in densemap order. llvm-svn: 97288	2010-02-27 00:07:42 +00:00
Bob Wilson	ed1b0c31a7	Move the EnableFullLoadPRE flag from a separate command-line option to an argument of createGVNPass and set it automatically for -O3. llvm-svn: 97245	2010-02-26 19:09:47 +00:00
Bob Wilson	d4655991c3	Remove unused "NoPRE" parameter in GVN and createGVNPass(). llvm-svn: 97235	2010-02-26 18:35:19 +00:00
Dan Gohman	a9c205cc88	Make LoopSimplify change conditional branches in loop exiting blocks which branch on undef to branch on a boolean constant for the edge exiting the loop. This helps ScalarEvolution compute trip counts for loops. Teach ScalarEvolution to recognize single-value PHIs, when safe, and ForgetSymbolicName to forget such single-value PHI nodes as apprpriate in ForgetSymbolicName. llvm-svn: 97126	2010-02-25 06:57:05 +00:00
Daniel Dunbar	693ea89214	Reapply r97010, the speculative revert failed. llvm-svn: 97036	2010-02-24 08:48:04 +00:00
Daniel Dunbar	0a2031e5b6	Speculatively revert r97010, "Add an argument to PHITranslateValue to specify the DominatorTree. ...", in hopes of restoring poor old PPC bootstrap. llvm-svn: 97027	2010-02-24 06:55:22 +00:00
Bob Wilson	66e58ac742	Add an argument to PHITranslateValue to specify the DominatorTree. If this argument is non-null, pass it along to PHITranslateSubExpr so that it can prefer using existing values that dominate the PredBB, instead of just blindly picking the first equivalent value that it finds on a uselist. Also when the DominatorTree is specified, have PHITranslateValue filter out any result that does not dominate the PredBB. This is basically just refactoring the check that used to be in GetAvailablePHITranslatedSubExpr and also in GVN. Despite my initial expectations, this change does not affect the results of GVN for any testcases that I could find, but it should help compile time. Before this change, if PHITranslateSubExpr picked a value that does not dominate, PHITranslateWithInsertion would then insert a new value, which GVN would later determine to be redundant and would replace. By picking a good value to begin with, we save GVN the extra work of inserting and then replacing a new value. llvm-svn: 97010	2010-02-24 01:39:00 +00:00
Bob Wilson	923261bbe9	Update memdep when load PRE inserts a new load, and add some debug output. I don't have a small testcase for this. llvm-svn: 96890	2010-02-23 05:55:00 +00:00
Bob Wilson	1da9041913	Erase deleted instructions from GVN's ValueTable. This fixes assertion failures from ValueTable::verifyRemoved() when using -debug. llvm-svn: 96805	2010-02-22 21:39:41 +00:00
Dan Gohman	8c16b38262	Remove unused variables and parameters. llvm-svn: 96780	2010-02-22 04:11:59 +00:00
Dan Gohman	4506fcb3c2	When emitting an instruction which depends on both a post-incremented induction variable value and a loop-variant value, don't force the insert position to be at the post-increment position, because it may not be dominated by the loop-variant value. This fixes a use-before-def problem noticed on PPC. llvm-svn: 96774	2010-02-22 03:59:54 +00:00
Dan Gohman	740909be2d	This cast<Instruction> is unnecessary. llvm-svn: 96771	2010-02-22 02:07:36 +00:00
Dan Gohman	4eebb94094	Rename getSDiv to getExactSDiv to reflect its behavior in cases where the division would have a remainder. llvm-svn: 96693	2010-02-19 19:35:48 +00:00
Dan Gohman	85af256779	Check for overflow when scaling up an add or an addrec for scaled reuse. llvm-svn: 96692	2010-02-19 19:32:49 +00:00
Dale Johannesen	1d6827adef	recommit 96626, evidence that it broke things appears to be spurious llvm-svn: 96662	2010-02-19 07:14:22 +00:00
Dale Johannesen	1f790c28d0	Revert 96626, which causes build failure on ppc Darwin. llvm-svn: 96653	2010-02-19 01:54:37 +00:00
Dan Gohman	2446f57503	When determining the set of interesting reuse factors, consider strides in foreign loops. This helps locate reuse opportunities with existing induction variables in foreign loops and reduces the need for inserting new ones. This fixes rdar://7657764. llvm-svn: 96629	2010-02-19 00:05:23 +00:00
Dan Gohman	60b3326435	Indvars needs to explicitly notify ScalarEvolution when it is replacing a loop exit value, so that if a loop gets deleted, ScalarEvolution isn't stick holding on to dangling SCEVAddRecExprs for that loop. This fixes PR6339. llvm-svn: 96626	2010-02-18 23:26:33 +00:00
Dan Gohman	c43d264cc0	Hoist this loop-invariant logic out of the loop. llvm-svn: 96614	2010-02-18 21:34:02 +00:00
Dan Gohman	13ac3b2139	Delete some unneeded casts. llvm-svn: 96429	2010-02-17 00:42:19 +00:00
Dan Gohman	5f10d6c52c	Don't attempt to divide INT_MIN by -1; consider such cases to have overflowed. llvm-svn: 96428	2010-02-17 00:41:53 +00:00
Bob Wilson	aff96b2132	Rename SuccessorNumber to GetSuccessorNumber. llvm-svn: 96387	2010-02-16 21:06:42 +00:00
Dan Gohman	6deab96c81	Refactor rewriting for PHI nodes into a separate function. llvm-svn: 96382	2010-02-16 20:25:07 +00:00
Bob Wilson	92cdb6eec5	Split critical edges as needed for load PRE. llvm-svn: 96378	2010-02-16 19:51:59 +00:00
Bob Wilson	3de492ec35	Refactor to share code to find the position of a basic block successor in the terminator's list of successors. llvm-svn: 96377	2010-02-16 19:49:17 +00:00
Dan Gohman	0849ed5e26	Fix whitespace. llvm-svn: 96372	2010-02-16 19:42:34 +00:00
Duncan Sands	19d0b47b1f	There are two ways of checking for a given type, for example isa<PointerType>(T) and T->isPointerTy(). Convert most instances of the first form to the second form. Requested by Chris. llvm-svn: 96344	2010-02-16 11:11:14 +00:00
Dan Gohman	521efe68ab	Split the main for-each-use loop again, this time for GenerateTruncates, as it also peeks at which registers are being used by other uses. This makes LSR less sensitive to use-list order. llvm-svn: 96308	2010-02-16 01:42:53 +00:00
Duncan Sands	9dff9bec31	Uniformize the names of type predicates: rather than having isFloatTy and isInteger, we now have isFloatTy and isIntegerTy. Requested by Chris! llvm-svn: 96223	2010-02-15 16:12:20 +00:00
Dan Gohman	e4e51a63da	Fix whitespace. llvm-svn: 96179	2010-02-14 18:51:39 +00:00
Dan Gohman	e7f74bb16c	Fix a comment. llvm-svn: 96178	2010-02-14 18:51:20 +00:00
Dan Gohman	bb7d52213c	When complicated expressions are broken down into subexpressions with multiplication by constants distributed through, occasionally those subexpressions can include both x and -x. For now, if this condition is discovered within LSR, just prune such cases away, as they won't be profitable. This fixes a "zero allocated in a base register" assertion failure. llvm-svn: 96177	2010-02-14 18:50:49 +00:00
Dan Gohman	2d0f96d49a	Actually, this code doesn't have to be quite so conservative in the no-TLI case. But it should still default to declining the transformation. llvm-svn: 96152	2010-02-14 03:21:49 +00:00
Dan Gohman	cb76a806f0	Don't attempt aggressive post-inc uses if TargetLowering is not available, because profitability can't be sufficiently approximated. llvm-svn: 96148	2010-02-14 02:45:21 +00:00
John McCall	0daaf13b97	Make LSR not crash if invoked without target lowering info, e.g. if invoked from opt. llvm-svn: 96135	2010-02-13 23:40:16 +00:00
Chris Lattner	b8639bc2d1	remove dead code. llvm-svn: 96109	2010-02-13 19:07:06 +00:00
Chris Lattner	42c66b7270	Split some code out to a helper function (FindReusablePredBB) and add a doxygen comment. Cache the phi entry to avoid doing tons of PHINode::getBasicBlockIndex calls in the common case. On my insane testcase from re2c, this speeds up CGP from 617.4s to 7.9s (78x). llvm-svn: 96083	2010-02-13 05:35:08 +00:00
Chris Lattner	96b8826542	speed up CGP a bit by scanning predecessors through phi operands instead of with pred_begin/end. llvm-svn: 96078	2010-02-13 04:04:42 +00:00
Dan Gohman	5b18f039eb	Fix a pruning heuristic which implicitly assumed that SmallPtrSet is deterministically sorted. llvm-svn: 96071	2010-02-13 02:06:02 +00:00
Dan Gohman	2b75de97c0	Reapply 95979, a compile-time speedup, now that the bug it exposed is fixed. llvm-svn: 96005	2010-02-12 19:35:25 +00:00
Dan Gohman	363f847ec6	Fix this code to avoid dereferencing an end() iterator in offset distributions it doesn't expect. llvm-svn: 96002	2010-02-12 19:20:37 +00:00
Daniel Dunbar	e0b2c69d3c	Revert "Reverse the order for collecting the parts of an addrec. The order", it is breaking llvm-gcc bootstrap. llvm-svn: 95988	2010-02-12 17:27:08 +00:00
Dan Gohman	0194f58047	Reverse the order for collecting the parts of an addrec. The order doesn't matter, except that ScalarEvolution tends to need less time to fold the results this way. llvm-svn: 95979	2010-02-12 11:08:26 +00:00
Dan Gohman	45774ce0ad	Reapply the new LoopStrengthReduction code, with compile time and bug fixes, and with improved heuristics for analyzing foreign-loop addrecs. This change also flattens IVUsers, eliminating the stride-oriented groupings, which makes it easier to work with. llvm-svn: 95975	2010-02-12 10:34:29 +00:00
Chris Lattner	c053cbbc4d	Make DSE only scan blocks that are reachable from the entry block. Other blocks may have pointer cycles that will crash basicaa and other alias analyses. In any case, there is no point wasting cycles optimizing dead blocks. This fixes rdar://7635088 llvm-svn: 95852	2010-02-11 05:11:54 +00:00
Chris Lattner	d924f63692	Make jump threading honor x\|undef -> true and x&undef -> false, instead of considering x\|undef -> x, which may not be true. llvm-svn: 95850	2010-02-11 04:40:44 +00:00
Devang Patel	03936a1880	Ignore dbg info intrinsics. llvm-svn: 95828	2010-02-11 00:20:49 +00:00
Dan Gohman	4a618827de	Fix "the the" and similar typos. llvm-svn: 95781	2010-02-10 16:03:48 +00:00
Eric Christopher	ad1aa86276	Pull these back out, they're a little too aggressive and time consuming for a simple optimization. llvm-svn: 95671	2010-02-09 17:29:18 +00:00
Eric Christopher	be2f0b2b7b	Add file in here too. llvm-svn: 95641	2010-02-09 01:11:03 +00:00
Eric Christopher	9f85e7eb16	Add a new pass to do llvm.objsize lowering using SCEV. Initial skeleton and SCEVUnknown lowering implemented, the rest should come relatively quickly. Move testcase to new directory. Move pass to right before SimplifyLibCalls - which is moved down a bit so we can take advantage of a few opts. llvm-svn: 95628	2010-02-09 00:35:38 +00:00
Jakob Stoklund Olesen	5f9ead2714	Don't unroll loops containing function calls. llvm-svn: 95454	2010-02-05 23:21:31 +00:00
Jakob Stoklund Olesen	916f48a054	Teach SimplifyCFG about magic pointer constants. Weird code sometimes uses pointer constants other than null. This patch teaches SimplifyCFG to build switch instructions in those cases. Code like this: void f(const char x) { if (!x) puts("null"); else if ((uintptr_t)x == 1) puts("one"); else if (x == (char)2 \|\| x == (char)3) puts("two"); else if ((intptr_t)x == 4) puts("four"); else puts(x); } Now becomes a switch: define void @f(i8 %x) nounwind ssp { entry: %magicptr23 = ptrtoint i8* %x to i64 ; <i64> [#uses=1] switch i64 %magicptr23, label %if.else16 [ i64 0, label %if.then i64 1, label %if.then2 i64 2, label %if.then9 i64 3, label %if.then9 i64 4, label %if.then14 ] Note that LLVM's own DenseMap uses magic pointers. llvm-svn: 95439	2010-02-05 22:03:18 +00:00
Dan Gohman	4739e41ce9	Implement releaseMemory in CodeGenPrepare and free the BackEdges container data. This prevents it from holding onto dangling pointers and potentially behaving unpredictably. llvm-svn: 95409	2010-02-05 19:24:11 +00:00
Bob Wilson	27dfb1e1a4	Do not reassociate expressions with i1 type. SimplifyCFG converts some short-circuited conditions to AND/OR expressions, and those expressions are often converted back to a short-circuited form in code gen. The original source order may have been optimized to take advantage of the expected values, and if we reassociate them, we change the order and subvert that optimization. Radar 7497329. llvm-svn: 95333	2010-02-04 23:32:37 +00:00
Bob Wilson	04365c5f72	Adjust the heuristics used to decide when SROA is likely to be profitable. The SRThreshold value makes perfect sense for checking if an entire aggregate should be promoted to a scalar integer, but it is not so good for splitting an aggregate into its separate elements. A struct may contain a large embedded array along with some scalar fields that would benefit from being split apart by SROA. Even if the total aggregate size is large, it may still be good to perform SROA. Thus, the most important piece of this patch is simply moving the aggregate size comparison vs. SRThreshold so that it guards only the aggregate promotion. We have also been checking the number of elements to decide if an aggregate should be split up. The limit of "SRThreshold/4" seemed rather arbitrary, and I don't think it's very useful to derive this limit from SRThreshold anyway. I've collected some data showing that the current default limit of 32 (since SRThreshold defaults to 128) is a reasonable cutoff for struct types. One thing suggested by the data is that distinguishing between structs and arrays might be useful. There are (obviously) a lot more large arrays than large structs (as measured by the number of elements and not the total size -- a large array inside a struct still counts as a single element given the way we do SROA right now). Out of 8377 arrays where we successfully performed SROA while compiling a large set of benchmarks, only 16 of them had more than 8 elements. And, for those 16 arrays, it's not at all clear that SROA was actually beneficial. So, to offset the compile time cost of investigating more large structs for SROA, the patch lowers the limit on array elements to 8. This fixes Apple Radar 7563690. llvm-svn: 95224	2010-02-03 17:23:56 +00:00
Evan Cheng	27a41d5473	Revert 94937 and move the noreturn check to codegen. llvm-svn: 95198	2010-02-03 03:55:59 +00:00
Bob Wilson	76e8c59509	Fix some comment typos. llvm-svn: 95170	2010-02-03 00:33:21 +00:00
Eric Christopher	d86233c118	Recommit this, looks like it wasn't the cause. llvm-svn: 95165	2010-02-03 00:21:58 +00:00
Eric Christopher	e67d01a9a8	Hopefully temporarily revert this. llvm-svn: 95154	2010-02-02 23:01:31 +00:00
Eric Christopher	4264e7e46f	Re-add strcmp and known size object size checking optimization. Passed bootstrap and nightly test run here. llvm-svn: 95145	2010-02-02 22:10:43 +00:00
Chris Lattner	302240d73e	fix a crash in loop unswitch on a loop invariant vector condition. llvm-svn: 95055	2010-02-02 02:26:54 +00:00
Eric Christopher	14dfc3f6df	Don't need to check the last argument since it'll always be bool. We also don't use TargetData here. llvm-svn: 95040	2010-02-02 00:51:45 +00:00
Eric Christopher	9afa973203	More indentation/tabification fixes. llvm-svn: 95036	2010-02-02 00:13:06 +00:00
Eric Christopher	1408234753	Untabify previous commit. llvm-svn: 95035	2010-02-02 00:06:55 +00:00
Eric Christopher	56e4182c49	Formatting. llvm-svn: 95027	2010-02-01 23:25:03 +00:00
Bob Wilson	d517b52012	Add an option to GVN to remove all partially redundant loads. This is currently disabled by default. This divides the existing load PRE code into 2 phases: first it checks that it is safe to move the load to each of the predecessors where it is unavailable, and then if it is safe, the code is changed to move the load. Radar 7571861. llvm-svn: 95007	2010-02-01 21:17:14 +00:00
Evan Cheng	d86d3fe0c3	Do not mark no-return calls tail calls. It'll screw up special calls like longjmp and it doesn't make much sense for performance reason. If my logic is faulty, please let me know. llvm-svn: 94937	2010-01-31 00:59:31 +00:00
Bob Wilson	56600a15ad	Check alignment of loads when deciding whether it is safe to execute them unconditionally. Besides checking the offset, also check that the underlying object is aligned as much as the load itself. llvm-svn: 94875	2010-01-30 04:42:39 +00:00
Eric Christopher	5a0e174863	Revert my last couple of patches. They appear to have broken bison. llvm-svn: 94841	2010-01-29 21:16:24 +00:00
Bob Wilson	7c42b9d51e	Improve isSafeToLoadUnconditionally to recognize that GEPs with constant indices are safe if the result is known to be within the bounds of the underlying object. llvm-svn: 94829	2010-01-29 19:19:08 +00:00
Eric Christopher	9b3c02b7da	Make strcpy_chk lower to strcpy if we have a safe size. llvm-svn: 94783	2010-01-29 01:37:11 +00:00
Bill Wendling	48816a0b3f	Generic reformatting and comment fixing. No functionality change. llvm-svn: 94771	2010-01-29 00:52:43 +00:00
Bill Wendling	8277838cf8	Add newline to debugging output, and fix some grammar-os in comment. llvm-svn: 94765	2010-01-29 00:27:39 +00:00
Benjamin Kramer	40582a891c	Use the less expensive getName function instead of getNameStr. llvm-svn: 94683	2010-01-27 19:46:52 +00:00
Bob Wilson	70c8fe5e4e	Remove check for an impossible condition: the condition of the while loop has already checked that TmpBB->getSinglePredecessor() is non-null. llvm-svn: 94451	2010-01-25 21:28:05 +00:00
Bob Wilson	fc060e4337	Change Value::getUnderlyingObject to have the MaxLookup value specified as a parameter with a default value, instead of just hardcoding it in the implementation. The limit of MaxLookup = 6 was introduced in r69151 to fix a performance problem with O(n^2) behavior in instcombine, but the scalarrepl pass is relying on getUnderlyingObject to go all the way back to an AllocaInst. Making the limit part of the method signature makes it clear that by default the result is limited and should help avoid similar problems in the future. This fixes pr6126. llvm-svn: 94433	2010-01-25 18:26:54 +00:00
Chris Lattner	823aed16f9	make -fno-rtti the default unless a directory builds with REQUIRES_RTTI. llvm-svn: 94378	2010-01-24 20:43:08 +00:00
Chris Lattner	29b15c5cfd	third bug from PR6119: the xor dupe extension allows for arbitrary terminators in predecessors, don't assume it is a conditional or uncond branch. The testcase shows an example where they can happen with switches. llvm-svn: 94323	2010-01-23 19:21:31 +00:00
Chris Lattner	ba2d0b89ff	add an early out to ProcessBranchOnXOR to speed it up, handle the case when we can infer an input to the xor from all inputs that agree, instead of going into an infinite loop. Another part of PR6199 llvm-svn: 94321	2010-01-23 19:16:25 +00:00
Chris Lattner	de5ab4860f	fix a crash in jump threading, PR6119 llvm-svn: 94319	2010-01-23 18:56:07 +00:00
Eric Christopher	ba7cd4c393	Reapply 94059 while fixing the calling convention setup for strcpy. llvm-svn: 94287	2010-01-23 05:29:06 +00:00
Bob Wilson	6c0c8d41b4	Revert 94059. It is breaking the MultiSource/Benchmarks/Prolangs-C/bison test on ARM. llvm-svn: 94198	2010-01-22 19:16:40 +00:00
Chris Lattner	7ba0661f27	Stop building RTTI information for most llvm libraries. Notable missing ones are libsupport, libsystem and libvmcore. libvmcore is currently blocked on bugpoint, which uses EH. Once it stops using EH, we can switch it off. This #if 0's out 3 unit tests, because gtest requires RTTI information. Suggestions welcome on how to fix this. llvm-svn: 94164	2010-01-22 06:49:46 +00:00
Dan Gohman	045f81981a	Revert LoopStrengthReduce.cpp to pre-r94061 for now. llvm-svn: 94123	2010-01-22 00:46:49 +00:00
Victor Hernandez	1df65186d1	DbgInfoIntrinsics no longer appear in an instruction's use list; so clean up looking for them in use iterations and remove OnlyUsedByDbgInfoIntrinsics() llvm-svn: 94111	2010-01-21 23:05:53 +00:00
Dan Gohman	b1ee154b6b	When inserting expressions for post-increment users which contain loop-variant components, adds must be inserted after the increment. Keep track of the increment position for this case, and insert these adds in the correct location. llvm-svn: 94110	2010-01-21 23:01:22 +00:00
Dan Gohman	cb8d577eb2	Include IVUsers information in LSR's debug output. llvm-svn: 94108	2010-01-21 22:46:32 +00:00
Dan Gohman	29916e023d	Prune the search for candidate formulae if the number of register operands exceeds the number of registers used in the initial solution, as that wouldn't lead to a profitable solution anyway. llvm-svn: 94107	2010-01-21 22:42:49 +00:00
Dan Gohman	c903499ff8	Add a comment. llvm-svn: 94104	2010-01-21 21:31:09 +00:00
Dan Gohman	51ad99d2c5	Re-implement the main strength-reduction portion of LoopStrengthReduction. This new version is much more aggressive about doing "full" reduction in cases where it reduces register pressure, and also more aggressive about rewriting induction variables to count down (or up) to zero when doing so reduces register pressure. It currently uses fairly simplistic algorithms for finding reuse opportunities, but it introduces a new framework allows it to combine multiple strategies at once to form hybrid solutions, instead of doing all full-reduction or all base+index. llvm-svn: 94061	2010-01-21 02:09:26 +00:00
Eric Christopher	fa863258d0	Add strcpy_chk -> strcpy support for "don't know" object size answers. This will update as object size checking gets better information. llvm-svn: 94059	2010-01-21 01:04:38 +00:00
Dan Gohman	ca19445d08	When doing address-mode sinking, expand the base register first, rather than the scaled register. This makes it more likely that subsequent AddrModeMatcher queries will match the new address the same way as the old, instead of accidentally matching what had been the base register as the new scaled register, and then failing to match the scaled register. This fixes some problems with address-mode sinking multiple muls into a block, which will be a lot more common with some upcoming LoopStrengthReduction changes. llvm-svn: 93935	2010-01-19 22:45:06 +00:00
Bob Wilson	58d59fe394	Fix a crash in scalarrepl for memcpy/memmove where the source and destination are the same. I had already fixed a similar problem where the source and destination were different bitcasts derived from the same alloca, but the previous fix still did not handle the case where both operands are exactly the same value. Radar 7552893. llvm-svn: 93848	2010-01-19 04:32:48 +00:00
Owen Anderson	cdea3572fa	Convert some of the dynamic opcode lookups into static ones. llvm-svn: 93693	2010-01-17 19:33:27 +00:00
Chris Lattner	573da8ac90	1) Use the new SimplifyInstructionsInBlock routine instead of the copy in JT. 2) When cloning blocks for PHI or xor conditions, use instsimplify to simplify the code as we go. This allows us to squish common cases early in JT which opens up opportunities for subsequent iterations, and allows it to completely simplify the testcase. llvm-svn: 93253	2010-01-12 20:41:47 +00:00
Chris Lattner	af7855d571	tidy up llvm-svn: 93222	2010-01-12 02:07:50 +00:00
Chris Lattner	eb73bdb2e1	Teach jump threading to duplicate small blocks when the branch condition is a xor with a phi node. This eliminates nonsense like this from 176.gcc in several places: LBB166_84: testl %eax, %eax - setne %al - xorb %cl, %al - notb %al - testb $1, %al - je LBB166_85 + je LBB166_69 + jmp LBB166_85 This is rdar://7391699 llvm-svn: 93221	2010-01-12 02:07:17 +00:00
Chris Lattner	6a19ed0b86	some cleanup, and make it obvious that ProcessJumpOnPHI only works on branches by renaming it and checking for a branch at the call site. llvm-svn: 93208	2010-01-11 23:41:09 +00:00
Chris Lattner	ab7087ad66	only factor from expressions whose uses are empty and whose base is the right expression type. This fixes PR5981. llvm-svn: 93045	2010-01-09 06:01:36 +00:00
Duncan Sands	4a8b15dc74	Suppress an unused variable warning when assertions are off; remove some trailing whitespace while there. llvm-svn: 93008	2010-01-08 17:51:48 +00:00
Benjamin Kramer	76e2766442	Use a do-while loop instead of while + boolean. llvm-svn: 92912	2010-01-07 13:50:07 +00:00
Eric Christopher	2cdb806fd8	Move the object size intrinsic optimization to inst-combine and make it work for any integer size return type. llvm-svn: 92853	2010-01-06 20:04:44 +00:00
Mikhail Glushenkov	40d2429b28	Formatting. llvm-svn: 92831	2010-01-06 09:20:39 +00:00
Benjamin Kramer	d2564e3afb	Move remaining stuff to the isInteger predicate. llvm-svn: 92771	2010-01-05 21:05:54 +00:00
Benjamin Kramer	a81a6dff0d	Convert a ton of simple integer type equality tests to the new predicate. llvm-svn: 92760	2010-01-05 20:07:06 +00:00
Dan Gohman	b5358003fb	Set Changed properly after calling DeleteDeadPHIs. llvm-svn: 92735	2010-01-05 16:31:45 +00:00
Dan Gohman	28943873e6	Use do+while instead of while for loops which obviously have a non-zero trip count. Use SmallVector's pop_back_val(). llvm-svn: 92734	2010-01-05 16:27:25 +00:00
Chris Lattner	f741d72b84	fix an infinite loop in reassociate building emacs. llvm-svn: 92679	2010-01-05 04:55:35 +00:00
David Greene	241992382e	Change errs() to dbgs(). llvm-svn: 92624	2010-01-05 01:27:47 +00:00
David Greene	e0b9789593	Change errs() to dbgs(). llvm-svn: 92623	2010-01-05 01:27:44 +00:00
David Greene	6bc0776343	Change errs() to dbgs(). llvm-svn: 92622	2010-01-05 01:27:39 +00:00
David Greene	3a79df0993	Change errs() to dbgs(). llvm-svn: 92620	2010-01-05 01:27:33 +00:00
David Greene	0fd862254e	Change errs() to dbgs(). llvm-svn: 92619	2010-01-05 01:27:30 +00:00
David Greene	d17c3916d0	Change errs() to dbgs(). llvm-svn: 92617	2010-01-05 01:27:24 +00:00
David Greene	9ddc6e2e12	Change errs() to dbgs(). llvm-svn: 92615	2010-01-05 01:27:21 +00:00
David Greene	1efdb45562	Change errs() to dbgs(). llvm-svn: 92614	2010-01-05 01:27:19 +00:00
David Greene	2e6efc441f	Change errs() to dbgs(). llvm-svn: 92613	2010-01-05 01:27:17 +00:00
David Greene	389fc3b9f6	Change errs() to dbgs(). llvm-svn: 92612	2010-01-05 01:27:15 +00:00
David Greene	74e2d4917d	Change errs() to dbgs(). llvm-svn: 92611	2010-01-05 01:27:11 +00:00
David Greene	48c86bedbd	Change errs() to dbgs(). llvm-svn: 92610	2010-01-05 01:27:09 +00:00
David Greene	0dd384cfd0	Change errs() to dbgs(). llvm-svn: 92609	2010-01-05 01:27:06 +00:00
David Greene	d9c355d590	Change errs() to dbgs(). llvm-svn: 92608	2010-01-05 01:27:04 +00:00
Devang Patel	be94f23992	Remove dead debug info intrinsics. Intrinsic::dbg_stoppoint Intrinsic::dbg_region_start Intrinsic::dbg_region_end Intrinsic::dbg_func_start AutoUpgrade simply ignores these intrinsics now. llvm-svn: 92557	2010-01-05 01:10:40 +00:00
Mikhail Glushenkov	6a8ac8ce8f	80-col violations, trailing whitespace. llvm-svn: 92470	2010-01-04 07:55:25 +00:00
Chris Lattner	c0e6640d3a	move instcombine to its own library, it's past time. llvm-svn: 92459	2010-01-04 06:23:24 +00:00
Chris Lattner	2d91231d82	implement an instcombine xform needed by clang's codegen on the example in PR4216. This doesn't trigger in the testsuite, so I'd really appreciate someone scrutinizing the logic for correctness. llvm-svn: 92458	2010-01-04 06:03:59 +00:00
Chris Lattner	48218e42cd	pull my debug hooks out, I'm done with this xform for now. llvm-svn: 92446	2010-01-03 06:58:48 +00:00
Nick Lewycky	475d3d1215	Small cleanups, refactor some duplicated code into a single method. No functionality change. llvm-svn: 92445	2010-01-03 04:39:07 +00:00
Chris Lattner	fca0c8f93a	generalize the previous transformation to handle indexing into arrays of structs and other arrays, so long as all the subsequent indexes are constants. This triggers frequently for stuff like: @divisions = internal constant [29 x [2 x i32]] [[2 x i32] zeroinitializer, [2 x i32] [i32 0, i32 1], [2 x i32] [i32 0, i32 2], [2 x i32] [i32 0, i32 1], [2 x i32] zeroinitializer, [2 x i32] [i32 0, i32 1], [2 x i32] [i32 0, i32 1], [2 x i32] [i32 0, i32 2], [2 x i32] [i32 0, i32 2], [2 x i32] zeroinitializer, [2 x i32] zeroinitializer, [2 x i32] zeroinitializer, [2 x i32] [i32 0, i32 2], [2 x i32] [i32 0, i32 1], [2 x i32] zeroinitializer, [2 x i32] [i32 1, i32 0], [2 x i32] [i32 1, i32 1], [2 x i32] [i32 1, i32 1], [2 x i32] [i32 1, i32 2], [2 x i32] [i32 1, i32 1], [2 x i32] [i32 1, i32 0], [2 x i32] [i32 1, i32 2], [2 x i32] [i32 1, i32 2], [2 x i32] [i32 1, i32 0], [2 x i32] [i32 1, i32 0], [2 x i32] [i32 1, i32 0], [2 x i32] [i32 1, i32 1], [2 x i32] [i32 1, i32 2], [2 x i32] [i32 1, i32 2]], align 32 ; <[29 x [2 x i32]]> [#uses=50] %623 = getelementptr inbounds [29 x [2 x i32]] @divisions, i64 0, i64 %619, i64 0 ; <i32*> [#uses=1] %684 = icmp eq i32 %683, 999 also for the "my_defs" table in 'gs', etc. llvm-svn: 92444	2010-01-03 03:03:27 +00:00
Nick Lewycky	ff9cd7ace7	Cleanup. llvm-svn: 92436	2010-01-03 00:55:31 +00:00
Chris Lattner	98ad2b56cc	teach instcombine to optimize idioms like A[i]&42 == 0. This occurs in 403.gcc in mode_mask_array, in safe-ctype.c (which is copied in multiple apps) in _sch_istable, etc. llvm-svn: 92427	2010-01-02 22:08:28 +00:00
Chris Lattner	b56bef45f8	Teach the table lookup optimization to generate range compares when a consequtive sequence of elements all satisfies the predicate. Like the double compare case, this generates better code than the magic constant case and generalizes to more than 32/64 element array lookups. Here are some examples where it triggers. From 403.gcc, most accesses to the rtx_class array are handled, e.g.: @rtx_class = constant [153 x i8] c"xxxxxmmmmmmmmxxxxxxxxxxxxmxxxxxxiiixxxxxxxxxxxxxxxxxxxooxooooooxxoooooox3x2c21c2222ccc122222ccccaaaaaa<<<<<<<<<<<<<<<<<<111111111111bbooxxxxxxxxxxcc2211x", align 32 ; <[153 x i8]> [#uses=547] %142 = icmp eq i8 %141, 105 @rtx_class = constant [153 x i8] c"xxxxxmmmmmmmmxxxxxxxxxxxxmxxxxxxiiixxxxxxxxxxxxxxxxxxxooxooooooxxoooooox3x2c21c2222ccc122222ccccaaaaaa<<<<<<<<<<<<<<<<<<111111111111bbooxxxxxxxxxxcc2211x", align 32 ; <[153 x i8]> [#uses=543] %165 = icmp eq i8 %164, 60 Also, most of the 59-element arrays (mode_class/rid_to_yy, etc) optimized before are actually range compares. This lets 32-bit machines optimize them. 400.perlbmk has stuff like this: 400.perlbmk: PL_regkind, even for 32-bit: @PL_regkind = constant [62 x i8] c"\00\00\02\02\02\06\06\06\06\09\09\0B\0B\0D\0E\0E\0E\11\12\12\14\14\16\16\18\18\1A\1A\1C\1C\1E\1F !!!$$&'((((,-.///88886789:;8$", align 32 ; <[62 x i8]> [#uses=4] %811 = icmp ne i8 %810, 33 @PL_utf8skip = constant [256 x i8] c"\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\02\02\02\02\02\02\02\02\02\02\02\02\02\02\02\02\02\02\02\02\02\02\02\02\02\02\02\02\02\02\02\02\03\03\03\03\03\03\03\03\03\03\03\03\03\03\03\03\04\04\04\04\04\04\04\04\05\05\05\05\06\06\07\0D", align 32 ; <[256 x i8]> [#uses=94] %12 = icmp ult i8 %10, 2 etc. llvm-svn: 92426	2010-01-02 21:50:18 +00:00
Chris Lattner	e199d2df80	theoretically the negate we find could be in a different function, check for this case. llvm-svn: 92425	2010-01-02 21:46:33 +00:00
Chris Lattner	2fa4ec70fc	use enums for the over/underdefined markers for clarity. Switch to using -2/-3 instead of -1/-2 for a future xform. llvm-svn: 92423	2010-01-02 20:20:33 +00:00
Chris Lattner	351e22aa36	remove the random sampling framework, which is not maintained anymore. If there is interest, it can be resurrected from SVN. PR4912. llvm-svn: 92422	2010-01-02 20:07:03 +00:00
Nick Lewycky	a67519be12	Fix logic error in previous commit. The != case needs to become an or, not an and. llvm-svn: 92419	2010-01-02 16:14:56 +00:00
Nick Lewycky	357d41b3c1	Optimize pointer comparison into the typesafe form, now that the backends will handle them efficiently. This is the opposite direction of the transformation we used to have here. llvm-svn: 92418	2010-01-02 15:25:44 +00:00
Chris Lattner	cfda435c73	Generalize the previous xform to handle cases where exactly two elements match or don't match with two comparisons. For example, the testcase compiles into: define i1 @test5(i32 %X) { %1 = icmp eq i32 %X, 2 ; <i1> [#uses=1] %2 = icmp eq i32 %X, 7 ; <i1> [#uses=1] %R = or i1 %1, %2 ; <i1> [#uses=1] ret i1 %R } This generalizes the previous xforms when the array is larger than 64 elements (and this case matches) and generates better code for cases where it overlaps with the magic bitshift case. This generalizes more cases than you might expect. For example, 400.perlbmk has: @PL_utf8skip = constant [256 x i8] c"\01\01\01\... %15 = icmp ult i8 %7, 7 403.gcc has: @rid_to_yy = internal constant [114 x i16] [i16 259, i16 260, ... %18 = icmp eq i16 %16, 295 and xalancbmk has a bunch of examples, such as _ZN11xercesc_2_5L15gCombiningCharsE and _ZN11xercesc_2_5L10gBaseCharsE. llvm-svn: 92417	2010-01-02 09:35:17 +00:00
Chris Lattner	c6ac078423	fix a miscompilation I introduced of cdecl with a late change. llvm-svn: 92416	2010-01-02 09:22:13 +00:00
Chris Lattner	935a4a606a	enhance the compare/load/index optimization to work on any load from a global with 32/64 elements or less (depending on whether i64 is native on the target), generating a bitshift idiom to determine the result. For example, on test4 we produce: define i1 @test4(i32 %X) { %1 = lshr i32 933, %X ; <i32> [#uses=1] %2 = and i32 %1, 1 ; <i32> [#uses=1] %R = icmp ne i32 %2, 0 ; <i1> [#uses=1] ret i1 %R } This triggers in a number of interesting cases, for example, here's an fp case: @A.3255 = internal constant [4 x double] [double 4.100000e+00, double -3.900000e+00, double -1.000000e+00, double 1.000000e+00], align 32 ; <[4 x double]> [#uses=7] ... %7 = fcmp olt double %3, 0.000000e+00 In this case we make the slen2_tab global dead, which is nice: @slen2_tab = internal constant [16 x i32] [i32 0, i32 1, i32 2, i32 3, i32 0, i32 1, i32 2, i32 3, i32 1, i32 2, i32 3, i32 1, i32 2, i32 3, i32 2, i32 3], align 32 ; <[16 x i32]> [#uses=1] ... %204 = icmp eq i32 %46, 0 Perl has a bunch of these, also on the 'Perl_regkind' array: @Perl_yygindex = internal constant [51 x i16] [i16 0, i16 0, i16 0, i16 0, i16 374, i16 351, i16 0, i16 -12, i16 0, i16 946, i16 413, i16 -83, i16 0, i16 0, i16 0, i16 -311, i16 -13, i16 4007, i16 2893, i16 0, i16 0, i16 0, i16 0, i16 0, i16 372, i16 -8, i16 0, i16 0, i16 246, i16 -131, i16 43, i16 86, i16 208, i16 -45, i16 -169, i16 987, i16 0, i16 0, i16 0, i16 0, i16 308, i16 0, i16 -271, i16 0, i16 0, i16 0, i16 0, i16 0, i16 0, i16 0, i16 0], align 32 ; <[51 x i16]> [#uses=1] ... %1364 = icmp eq i16 %1361, 0 186.crafty really likes this on 64-bit machines, because it triggers on a bunch of globals like this: @white_outpost = internal constant [64 x i8] c"\00\00\00\00\00\00\00\00\00\00\00\00\00\00\00\00\00\00\00\00\00\00\00\00\00\00\00\02\02\00\00\00\00\00\04\05\05\04\00\00\00\00\03\06\06\03\00\00\00\00\00\01\01\00\00\00\00\00\00\00\00\00\00\00", align 32 ; <[64 x i8]> [#uses=2] However the big winner is 403.gcc, which triggers hundreds of times, eliminating all the accesses to the 57-element arrays 'mode_class', mode_unit_size, mode_bitsize, regclass_map, etc. go 64-bit machines :) llvm-svn: 92415	2010-01-02 08:56:52 +00:00
Chris Lattner	b1567bd584	enhance the previous optimization to work with fcmp in addition to icmp. llvm-svn: 92412	2010-01-02 08:20:51 +00:00
Chris Lattner	a061859ccc	Teach instcombine to fold compares of loads from constant arrays with variable indices into a comparison of the index with a constant. The most common occurrence of this that I see by far is stuff like: if ("foobar"[i] == '\0') ... which we compile into: if (i == 6), saving a load and materialization of the global address. This also exposes loop trip count information to later passes in many cases. This triggers hundreds of times in xalancbmk, which is where I first noticed it, but it also triggers in many other apps. Here are a few interesting ones from various apps: @must_be_connected_without = internal constant [8 x i8] [i8 getelementptr inbounds ([3 x i8]* @.str64320, i64 0, i64 0), i8* getelementptr inbounds ([3 x i8]* @.str27283, i64 0, i64 0), i8* getelementptr inbounds ([4 x i8]* @.str71327, i64 0, i64 0), i8* getelementptr inbounds ([4 x i8]* @.str72328, i64 0, i64 0), i8* getelementptr inbounds ([3 x i8]* @.str18274, i64 0, i64 0), i8* getelementptr inbounds ([6 x i8]* @.str11267, i64 0, i64 0), i8* getelementptr inbounds ([3 x i8]* @.str32288, i64 0, i64 0), i8* null], align 32 ; <[8 x i8]> [#uses=2] %scevgep.i = getelementptr [8 x i8] @must_be_connected_without, i64 0, i64 %indvar.i ; <i8*> [#uses=1] %17 = load ... %18 = icmp eq i8 %17, null ; <i1> [#uses=1] -> icmp eq i64 %indvar.i, 7 @yytable1095 = internal constant [84 x i8] c"\12\01(\05\06\07\08\09\0A\0B\0C\0D\0E1\0F\10\11266\1D: \10\11,-,0\03'\10\11B6\04\17&\18\1945\05\06\07\08\09\0A\0B\0C\0D\0E\1E\0F\10\11\1A\1B\1C$3+>#%;<IJ=ADFEGH9KL\00\00\00C", align 32 ; <[84 x i8]> [#uses=2] %57 = getelementptr inbounds [84 x i8]* @yytable1095, i64 0, i64 %56 ; <i8> [#uses=1] %mode.0.in = getelementptr inbounds [9 x i32] @mb_mode_table, i64 0, i64 %.pn ; <i32> [#uses=1] load ... %64 = icmp eq i8 %58, 4 ; <i1> [#uses=1] -> icmp eq i64 %.pn, 35 ; <i1> [#uses=0] @gsm_DLB = internal constant [4 x i16] [i16 6554, i16 16384, i16 26214, i16 32767] %scevgep.i = getelementptr [4 x i16] @gsm_DLB, i64 0, i64 %indvar.i ; <i16*> [#uses=1] %425 = load %scevgep.i %426 = icmp eq i16 %425, -32768 ; <i1> [#uses=0] -> false llvm-svn: 92411	2010-01-02 08:12:04 +00:00
Chris Lattner	2e4be2c340	remove the instcombine transformations that are inserting nasty pointer to int casts that confuse later optimizations. See PR3351 for details. This improves but doesn't complete fix 483.xalancbmk because llvm-gcc does this xform in GCC's "fold" routine as well. Clang++ will do better I guess. llvm-svn: 92408	2010-01-02 00:31:05 +00:00
Chris Lattner	faf1337acb	add a simple instcombine xform, simplify another one to use hasAllZeroIndices() instead of hand rolling a loop. llvm-svn: 92403	2010-01-01 23:09:08 +00:00
Chris Lattner	30c0a2833d	generalize the pointer difference optimization to handle a constantexpr gep on the 'base' side of the expression. This completes comment #4 in PR3351, which comes from 483.xalancbmk. llvm-svn: 92402	2010-01-01 22:42:29 +00:00
Chris Lattner	4394f71752	teach instcombine to optimize pointer difference idioms involving constant expressions. This is a step towards comment #4 in PR3351. llvm-svn: 92401	2010-01-01 22:29:12 +00:00
Chris Lattner	9d4c5414bb	use 'match' to simplify some code. llvm-svn: 92400	2010-01-01 22:12:03 +00:00
Chris Lattner	25c87e9cf9	implement the transform requested in PR5284 llvm-svn: 92398	2010-01-01 18:34:40 +00:00
Chris Lattner	ee1f861d81	add missing line. llvm-svn: 92384	2010-01-01 01:54:08 +00:00
Chris Lattner	8330daf733	add a few trivial instcombines for llvm.powi. llvm-svn: 92383	2010-01-01 01:52:15 +00:00
Chris Lattner	0c59ac3f41	When factoring multiply expressions across adds, factor both positive and negative forms of constants together. This allows us to compile: int foo(int x, int y) { return (x-y) + (x-y) + (x-y); } into: _foo: ## @foo subl %esi, %edi leal (%rdi,%rdi,2), %eax ret instead of (where the 3 and -3 were not factored): _foo: imull $-3, 8(%esp), %ecx imull $3, 4(%esp), %eax addl %ecx, %eax ret this started out as: movl 12(%ebp), %ecx imull $3, 8(%ebp), %eax subl %ecx, %eax subl %ecx, %eax subl %ecx, %eax ret This comes from PR5359. llvm-svn: 92381	2010-01-01 01:13:15 +00:00
Chris Lattner	a552683fd4	clean up some comments. llvm-svn: 92377	2010-01-01 00:04:26 +00:00
Chris Lattner	17229a7cb8	switch from std::map to DenseMap for rank data structures. llvm-svn: 92375	2010-01-01 00:01:34 +00:00
Chris Lattner	fed3397654	reuse negates where possible instead of always creating them from scratch. This allows us to optimize test12 into: define i32 @test12(i32 %X) { %factor = mul i32 %X, -3 ; <i32> [#uses=1] %Z = add i32 %factor, 6 ; <i32> [#uses=1] ret i32 %Z } instead of: define i32 @test12(i32 %X) { %Y = sub i32 6, %X ; <i32> [#uses=1] %C = sub i32 %Y, %X ; <i32> [#uses=1] %Z = sub i32 %C, %X ; <i32> [#uses=1] ret i32 %Z } llvm-svn: 92373	2009-12-31 20:34:32 +00:00
Chris Lattner	60c2ca743d	we don't need a smallptrset to detect duplicates, the values are sorted, so we can just do a linear scan. llvm-svn: 92372	2009-12-31 19:49:01 +00:00
Chris Lattner	1d8979422a	make reassociate more careful about not leaving around dead mul's llvm-svn: 92370	2009-12-31 19:34:45 +00:00
Chris Lattner	ed18917665	remove debug llvm-svn: 92369	2009-12-31 19:25:19 +00:00
Chris Lattner	60b71b5c4d	teach reassociate to factor x+x+x -> x*3. While I'm at it, fix RemoveDeadBinaryOp to actually do something. llvm-svn: 92368	2009-12-31 19:24:52 +00:00
Chris Lattner	38abecbad0	change reassociate to use SmallVector for its key datastructures instead of std::vector. llvm-svn: 92366	2009-12-31 18:40:32 +00:00
Chris Lattner	ac61550504	change an if to an assert, fix comment. llvm-svn: 92364	2009-12-31 18:18:46 +00:00
Chris Lattner	177140ad12	move the rest of the add optimization code out to OptimizeAdd, improve some comments, simplify a bit of code. llvm-svn: 92363	2009-12-31 18:17:13 +00:00
Chris Lattner	ba1f36aa99	factor statistic updating better. llvm-svn: 92362	2009-12-31 17:51:05 +00:00
Chris Lattner	4e3a5678af	simple fix for an incorrect factoring which causes a miscompilation, PR5458. llvm-svn: 92354	2009-12-31 08:33:49 +00:00
Chris Lattner	5f8a005d38	factor code out into helper functions. llvm-svn: 92347	2009-12-31 07:59:34 +00:00
Chris Lattner	f5c2b8b8d7	switch some std::vector's to smallvector. Reduce nesting. llvm-svn: 92346	2009-12-31 07:48:51 +00:00
Chris Lattner	9039ff8912	use more modern datastructures. llvm-svn: 92344	2009-12-31 07:33:14 +00:00
Chris Lattner	bc1512c8d1	clean up -debug output. llvm-svn: 92343	2009-12-31 07:17:37 +00:00
Chris Lattner	17079fc0fa	split code that doesn't need to be templated out of IRBuilder into a new non-templated IRBuilderBase class. Move that large CreateGlobalString out of line, eliminating the need to #include GlobalVariable.h in IRBuilder.h llvm-svn: 92227	2009-12-28 21:28:46 +00:00
Chris Lattner	f8d22fc77d	Metadata.h doesn't need to include ValueHandle.h anymore. llvm-svn: 92211	2009-12-28 08:20:46 +00:00
Chris Lattner	1a32ede6fd	move an optimization for memcmp out of simplifylibcalls and into SDISel. This optimization was causing simplifylibcalls to introduce type-unsafe nastiness. This is the first step, I'll be expanding the memcmp optimizations shortly, covering things that we really really wouldn't want simplifylibcalls to do. llvm-svn: 92098	2009-12-24 00:37:38 +00:00
Chris Lattner	efebb234b7	reorder to follow a normal fall-through style, no functionality change. llvm-svn: 92084	2009-12-23 23:24:51 +00:00
David Greene	2330f78075	Remove dump routine and the associated Debug.h from a header. Patch up other files to compensate. llvm-svn: 92075	2009-12-23 22:58:38 +00:00
Eric Christopher	fdb33458fc	Update objectsize intrinsic and associated dependencies. Fix lowering code and update testcases. llvm-svn: 91979	2009-12-23 02:51:48 +00:00
Chris Lattner	c0f6402a94	Fix the Convert to scalar to not insert dead loads in the store case. The load is needed when we have a small store into a large alloca (at which point we get a load/insert/store sequence), but when you do a full-sized store, this load ends up being dead. This dead load is bad in really large nasty testcases where the load ends up causing mem2reg to insert large chains of dependent phi nodes which only ADCE can delete. Instead of doing this, just don't insert the dead load. This fixes rdar://6864035 llvm-svn: 91917	2009-12-22 19:33:28 +00:00
Chris Lattner	fda3b559e6	fix some fixme's by using twines llvm-svn: 91916	2009-12-22 19:23:33 +00:00
Bob Wilson	62a84ea8e3	Generalize SROA to allow the first index of a GEP to be non-zero. Add a missing check that an array reference doesn't go past the end of the array, and remove some redundant checks for in-bound array and vector references that are no longer needed. llvm-svn: 91897	2009-12-22 06:57:14 +00:00
Chris Lattner	f21a220bcd	Implement PR5795 by merging duplicated return blocks. This could go further by merging all returns in a function into a single one, but simplifycfg currently likes to duplicate the return (an unfortunate choice!) llvm-svn: 91890	2009-12-22 06:07:30 +00:00
Chris Lattner	9b7d99eb76	The phi translated pointer can be computed when returning a partially cached result instead of stored. This reduces memdep memory usage, and also eliminates a bunch of weakvh's. This speeds up gvn on gcc.c-torture/20001226-1.c from 23.9s to 8.45s (2.8x) on a different machine than earlier. llvm-svn: 91885	2009-12-22 04:25:02 +00:00
Eric Christopher	ab6a0d60d5	Whitespace fixes. llvm-svn: 91875	2009-12-22 01:23:51 +00:00
Daniel Dunbar	c661a2d4d8	Add suggested parentheses. llvm-svn: 91853	2009-12-21 23:27:57 +00:00
Chris Lattner	bf20018423	Add a fastpath to Load GVN to special case when we have exactly one dominating load to avoid even messing around with SSAUpdate at all. In this case (which is very common, we can just use the input value directly). This speeds up GVN time on gcc.c-torture/20001226-1.c from 36.4s to 16.3s, which still isn't great, but substantially better and this is a simple speedup that applies to lots of different cases. llvm-svn: 91851	2009-12-21 23:15:48 +00:00
Chris Lattner	927b0ac4b2	refactor some code out to a new helper method. llvm-svn: 91849	2009-12-21 23:04:33 +00:00
Bob Wilson	88a0598fe8	Remove special-case SROA optimization of variable indexes to one-element and two-element arrays. After restructuring the SROA code, it was not safe to do this without adding more checking. It is not clear that this special-case has really been useful, and removing this simplifies the code quite a bit. llvm-svn: 91828	2009-12-21 18:39:47 +00:00
Chris Lattner	d4fb4296df	give instcombine some helper functions for matching MIN and MAX, and implement some optimizations for MIN(MIN()) and MAX(MAX()) and MIN(MAX()) etc. This substantially improves the code in PR5822 but doesn't kick in much elsewhere. 2 max's were optimized in pairlocalalign and one in smg2000. llvm-svn: 91814	2009-12-21 06:03:05 +00:00
Chris Lattner	ffbd02829c	enhance x-(-A) -> x+A to preserve NUW/NSW. Use the presence of NSW/NUW to fold "icmp (x+cst), x" to a constant in cases where it would otherwise be undefined behavior. Surprisingly (to me at least), this triggers hundreds of the times in a few benchmarks: lencode, ldecode, and 466.h264ref seem to really like this. llvm-svn: 91812	2009-12-21 04:04:05 +00:00
Chris Lattner	900ce231f9	Optimize all cases of "icmp (X+Cst), X" to something simpler. This triggers a bunch in lencode, ldecod, spass, 176.gcc, 252.eon, among others. It is also the first part of PR5822 llvm-svn: 91811	2009-12-21 03:19:28 +00:00
Chris Lattner	4ad5eba568	fix PR5827 by disabling the phi slicing transformation in a case where instcombine would have to split a critical edge due to a phi node of an invoke. Since instcombine can't change the CFG, it has to bail out from doing the transformation. llvm-svn: 91763	2009-12-19 07:01:15 +00:00
Bob Wilson	c16811b575	Update my SROA changes in response to review. * change FindElementAndOffset to return a uint64_t instead of unsigned, and to identify the type to be used for that result in a GEP instruction. * move "isa<ConstantInt>" to be first in conditional. * replace some dyn_casts with casts. * add a comment about handling mem intrinsics. llvm-svn: 91762	2009-12-19 06:53:17 +00:00
Bob Wilson	532cd232fb	Reapply 91459 with a simple fix for the problem that broke the x86_64-darwin bootstrap. This also replaces the WeakVH references that Chris objected to with normal Value references. llvm-svn: 91711	2009-12-18 20:14:40 +00:00
Eli Friedman	86b9d75dc8	Optimize icmp of null and select of two constants even if the select has multiple uses. (The construct in question was found in gcc.) llvm-svn: 91675	2009-12-18 08:22:35 +00:00
Dan Gohman	57e808628c	Eliminte unnecessary uses of <cstdio>. llvm-svn: 91666	2009-12-18 03:25:51 +00:00
Dan Gohman	18fa5686f6	Add Loop contains utility methods for testing whether a loop contains another loop, or an instruction. The loop form is substantially more efficient on large loops than the typical code it replaces. llvm-svn: 91654	2009-12-18 01:24:09 +00:00
Dan Gohman	fd7231f1fe	Minor code simplification. llvm-svn: 91653	2009-12-18 01:20:44 +00:00
Dan Gohman	b1924e8a0f	Don't pass const pointers by reference. llvm-svn: 91647	2009-12-18 00:38:08 +00:00
Dan Gohman	92c3696524	Reapply LoopStrengthReduce and IVUsers cleanups, excluding the part of 91296 that caused trouble -- the Processed list needs to be preserved for the livetime of the pass, as AddUsersIfInteresting is called from other passes. llvm-svn: 91641	2009-12-18 00:06:20 +00:00
Eli Friedman	250b119d98	Allow instcombine to combine "sext(a) >u const" to "a >u trunc(const)". llvm-svn: 91631	2009-12-17 22:42:29 +00:00
Eli Friedman	7cc86b4cc6	Make the ptrtoint comparison simplification work if one side is a global. llvm-svn: 91624	2009-12-17 21:27:47 +00:00
Eli Friedman	5842c9968a	Slightly generalize transformation of memmove(a,a,n) so that it also applies to memcpy. (Such a memcpy is technically illegal, but in practice is safe and is generated by struct self-assignment in C code.) llvm-svn: 91621	2009-12-17 21:07:31 +00:00
Bob Wilson	f3927b7994	Re-revert 91459. It's breaking the x86_64 darwin bootstrap. llvm-svn: 91607	2009-12-17 18:34:24 +00:00
Evan Cheng	090ac0865a	Revert 91280-91283, 91286-91289, 91291, 91293, 91295-91296. It apparently introduced a non-deterministic behavior in the optimizer somewhere. llvm-svn: 91598	2009-12-17 09:39:49 +00:00
Daniel Dunbar	ab42d42390	Reapply r91459, it was only unmasking the bug, and since TOT is still broken having it reverted does no good. llvm-svn: 91559	2009-12-16 20:09:53 +00:00
Daniel Dunbar	133efc317e	Revert "Reapply 91184 with fixes and an addition to the testcase to cover the problem", this broke llvm-gcc bootstrap for release builds on x86_64-apple-darwin10. This reverts commit db22309800b224a9f5f51baf76071d7a93ce59c9. llvm-svn: 91534	2009-12-16 10:56:17 +00:00
Chris Lattner	f278addbdc	reapply my strstr optimization. I have reproduced the x86-64 bootstrap miscompile (i386.o miscompares) but it happens both with and without this patch. llvm-svn: 91532	2009-12-16 09:32:05 +00:00
Chris Lattner	177be32334	revert my strstr optimization, I'm told it breaks x86-64 bootstrap. Will reapply with a fix when I get a chance. llvm-svn: 91486	2009-12-16 00:46:02 +00:00
Bob Wilson	e44756d7c2	Reapply 91184 with fixes and an addition to the testcase to cover the problem found last time. Instead of trying to modify the IR while iterating over it, I've change it to keep a list of WeakVH references to dead instructions, and then delete those instructions later. I also added some special case code to detect and handle the situation when both operands of a memcpy intrinsic are referencing the same alloca. llvm-svn: 91459	2009-12-15 22:00:51 +00:00
Chris Lattner	26ab363361	optimize strstr, PR5783 llvm-svn: 91438	2009-12-15 19:14:40 +00:00
Dan Gohman	265ce318b8	Delete an unused function. llvm-svn: 91432	2009-12-15 16:30:09 +00:00
Chris Lattner	24aba42d04	add some other xforms that should be done as part of PR5783 llvm-svn: 91428	2009-12-15 09:05:13 +00:00
Chris Lattner	45d040bd85	Remove isPod() from DenseMapInfo, splitting it out to its own isPodLike type trait. This is a generally useful type trait for more than just DenseMap, and we really care about whether something acts like a pod, not whether it really is a pod. llvm-svn: 91421	2009-12-15 07:26:43 +00:00
Dan Gohman	fbeec7270c	Fix a thinko; isNotAlreadyContainedIn had a built-in negative, so the condition was inverted when the code was converted to contains(). llvm-svn: 91295	2009-12-14 17:31:01 +00:00
Dan Gohman	416d5b7361	Remove unnecessary #includes. llvm-svn: 91293	2009-12-14 17:19:06 +00:00
Dan Gohman	163fb26927	Instead of having a ScalarEvolution pointer member in BasedUser, just pass the ScalarEvolution pointer into the functions which need it. llvm-svn: 91289	2009-12-14 17:12:51 +00:00
Dan Gohman	8dbd4e3d16	Don't bother cleaning up if there's nothing to clean up. llvm-svn: 91288	2009-12-14 17:10:44 +00:00
Dan Gohman	88c7e61c5b	Delete an unused variable. llvm-svn: 91287	2009-12-14 17:08:09 +00:00
Dan Gohman	838f604543	LSR itself doesn't need LoopInfo. llvm-svn: 91283	2009-12-14 17:02:34 +00:00
Dan Gohman	273e692952	LSR itself doesn't need DominatorTree. llvm-svn: 91282	2009-12-14 16:57:08 +00:00
Dan Gohman	c3513095cf	Remove the code in LSR that manually hoists expansions out of loops; SCEVExpander does this automatically. llvm-svn: 91281	2009-12-14 16:52:55 +00:00
Dan Gohman	ec2a7c58e8	Minor code cleanups. llvm-svn: 91280	2009-12-14 16:37:29 +00:00
Chris Lattner	aaa6ac10a6	revert r91184, because it causes a crash on a .bc file I just sent to Bob. llvm-svn: 91268	2009-12-14 05:11:02 +00:00
Bob Wilson	895f364ae6	Revise scalar replacement to be more flexible about handle bitcasts and GEPs. While scanning through the uses of an alloca, keep track of the current offset relative to the start of the alloca, and check memory references to see if the offset & size correspond to a component within the alloca. This has the nice benefit of unifying much of the code from isSafeUseOfAllocation, isSafeElementUse, and isSafeUseOfBitCastedAllocation. The code to rewrite the uses of a promoted alloca, after it is determined to be safe, is reorganized in the same way. Also, when rewriting GEP instructions, mark them as "in-bounds" since all the indices are known to be safe. llvm-svn: 91184	2009-12-11 23:47:40 +00:00
Eric Christopher	22889c049d	Make sure the immediate dominator isn't NULL through iterations of the loop. We could get to this condition via indirect branches. llvm-svn: 91009	2009-12-10 00:25:41 +00:00
Chris Lattner	9ccc879006	Fix PR5744, a case where we were getting the pointer size instead of the value size. This only manifested when memdep inprecisely returns clobber, which is do to a caching issue in the PR5744 testcase. We can 'efficiently emulate' this by using '-no-aa' llvm-svn: 91004	2009-12-10 00:11:45 +00:00
Chris Lattner	3ddf804f78	allow this to build when the #if 0's are enabled. No functionality change. llvm-svn: 90999	2009-12-10 00:04:46 +00:00
Dan Gohman	72c367fb52	Dereference loopHeader after checking for null rather than before. llvm-svn: 90990	2009-12-09 22:55:01 +00:00
Chris Lattner	ca5f9cb18b	fix hte last remaining known (by me) phi translation bug. When we reanalyze clobbers to forward pieces of large stores to small loads, we need to consider the properly phi translated pointer in the store block. llvm-svn: 90978	2009-12-09 18:21:46 +00:00
Chris Lattner	f8ba1253f1	change GetStoreValueForLoad to use IRBuilder, which is cleaner and implicitly constant folds. llvm-svn: 90977	2009-12-09 18:13:28 +00:00
Bob Wilson	1c5a6fb299	Fix a comment. llvm-svn: 90975	2009-12-09 18:05:27 +00:00
Chris Lattner	07df9efb35	change AnalyzeLoadFromClobberingMemInst/AnalyzeLoadFromClobberingStore to require the load ty/ptr to be passed in, no functionality change. llvm-svn: 90960	2009-12-09 07:37:07 +00:00
Chris Lattner	0def861ee9	change AnalyzeLoadFromClobberingWrite and clients to pass in type and pointer instead of the load. No functionality change. llvm-svn: 90959	2009-12-09 07:34:10 +00:00
Chris Lattner	0c31547168	change NonLocalDepEntry from being a typedef for an std::pair to be its own small class. No functionality change. llvm-svn: 90956	2009-12-09 07:08:01 +00:00
Chris Lattner	946b58dd90	add some aborts to #if 0's. llvm-svn: 90929	2009-12-09 02:41:54 +00:00
Chris Lattner	972e6d8d00	Switch GVN and memdep to use PHITransAddr, which correctly handles phi translation of complex expressions like &A[i+1]. This has the following benefits: 1. The phi translation logic is all contained in its own class with a strong interface and verification that it is self consistent. 2. The logic is more correct than before. Previously, if intermediate expressions got PHI translated, we'd miss the update and scan for the wrong pointers in predecessor blocks. @phi_trans2 is a testcase for this. 3. We have a lot less code in memdep. We can handle phi translation across blocks of things like @phi_trans3, which is pretty insane :). This patch should fix the miscompiles of 255.vortex, and I tested it with a bootstrap of llvm-gcc, llvm-test and dejagnu of course. llvm-svn: 90926	2009-12-09 01:59:31 +00:00
Bob Wilson	c5d082fd5d	Some superficial cleanups. llvm-svn: 90866	2009-12-08 18:27:03 +00:00
Bob Wilson	2029ea04f9	Clean up dead operands left around after SROA replaces a mem intrinsic. I'm not aware that this does anything significant on its own, but it's needed for another patch that I'm working on. llvm-svn: 90864	2009-12-08 18:22:03 +00:00
Nick Lewycky	8bca014d7f	Remove unnecessary #include "llvm/LLVMContext.h". llvm-svn: 90836	2009-12-08 05:45:41 +00:00
Chris Lattner	6d6f10fe91	fix PR5698 llvm-svn: 90708	2009-12-06 17:17:23 +00:00
Chris Lattner	778cb92235	constant fold loads from memcpy's from global constants. This is important because clang lowers nontrivial automatic struct/array inits to memcpy from a global array. llvm-svn: 90698	2009-12-06 05:29:56 +00:00
Chris Lattner	93236ba327	add support for forwarding mem intrinsic values to non-local loads. llvm-svn: 90697	2009-12-06 04:54:31 +00:00
Chris Lattner	42376066eb	Handle forwarding local memsets to loads. For example, we optimize this: short x(short A) { memset(A, 1, sizeof(A)*100); return A[42]; } to 'return 257' instead of doing the load. llvm-svn: 90695	2009-12-06 01:57:02 +00:00
Nick Lewycky	a0e9d700dc	Generalize this optimization to work on equality comparisons between any two integers that are constant except for a single bit (the same n-th bit in each). llvm-svn: 90646	2009-12-05 05:00:00 +00:00
Bob Wilson	050b812fe7	Fix up some comments. llvm-svn: 90603	2009-12-04 21:57:37 +00:00
Bob Wilson	5ca37b274c	Fix 80-column violations. llvm-svn: 90601	2009-12-04 21:51:35 +00:00
Bob Wilson	53bdae3802	Fix a comment typo. llvm-svn: 90487	2009-12-03 21:47:07 +00:00
Owen Anderson	0b6e260066	Fix this crasher, and add a FIXME for a missed optimization. llvm-svn: 90408	2009-12-03 03:43:29 +00:00
Chris Lattner	a48f44d9ee	improve portability to avoid conflicting with std::next in c++'0x. Patch by Howard Hinnant! llvm-svn: 90365	2009-12-03 00:50:42 +00:00
Owen Anderson	b9878ee6b6	Cleanup/remove some parts of the lifetime region handling code in memdep and GVN, per Chris' comments. Adjust testcases to match. llvm-svn: 90304	2009-12-02 07:35:19 +00:00
Chris Lattner	c468025ac9	factor some code better. llvm-svn: 90299	2009-12-02 06:44:58 +00:00
Chris Lattner	2764b4dc55	formatting cleanups. llvm-svn: 90298	2009-12-02 06:35:55 +00:00
Chris Lattner	eea42c7b51	tidy up, remove dependence on order of evaluation of function args from EmitMemCpy. llvm-svn: 90297	2009-12-02 06:05:42 +00:00
Chris Lattner	3c9aca9079	fix PR5640 by tracking whether a block is the header of a loop more precisely, which prevents us from infinitely peeling the loop. llvm-svn: 90211	2009-12-01 06:04:43 +00:00
Benjamin Kramer	3efc050ac4	Revert r90089 for now, it's breaking selfhost. llvm-svn: 90097	2009-11-29 21:17:48 +00:00
Benjamin Kramer	bfa993ab20	Fix two FIXMEs. llvm-svn: 90089	2009-11-29 20:29:30 +00:00
Chris Lattner	1cc4cca193	add testcases for the foo_with_overflow op xforms added recently and fix bugs exposed by the tests. Testcases from Alastair Lynn! llvm-svn: 90056	2009-11-29 02:57:29 +00:00
Chris Lattner	cd261c9c26	Implement PR5634. llvm-svn: 90046	2009-11-29 00:51:17 +00:00
Chris Lattner	32140312ca	reenable load address insertion in load pre. This allows us to handle cases like this: void test(int N, double* G) { long j; for (j = 1; j < N - 1; j++) G[j+1] = G[j] + G[j+1]; } where G[1] isn't live into the loop. llvm-svn: 90041	2009-11-28 16:08:18 +00:00
Chris Lattner	44da5bd837	Enhance InsertPHITranslatedPointer to be able to return a list of newly inserted instructions. No functionality change until someone starts using it. llvm-svn: 90039	2009-11-28 15:39:14 +00:00
Chris Lattner	cf0b198827	disable value insertion for now, I need to figure out how to inform GVN about the newly inserted values. This fixes PR5631. llvm-svn: 90022	2009-11-27 22:50:07 +00:00
Chris Lattner	2be52e72ae	Rework InsertPHITranslatedPointer to handle the recursive case, this fixes PR5630 and sets the stage for the next phase of goodness (testcase pending). llvm-svn: 90019	2009-11-27 22:05:15 +00:00
Chris Lattner	3d9823b9cf	factor some logic out of instcombine into a new SimplifyAddInst method. llvm-svn: 90011	2009-11-27 17:42:22 +00:00
Chris Lattner	2226db66ab	fix PR5436 by making the 'simple' case of SRoA not promote out of range array indexes. The "complex" case of SRoA still handles them, and correctly. This fixes a weirdness where we'd correctly avoid transforming A[0][42] if the 42 was too large, but we'd only do it if it was one gep, not two separate ones. llvm-svn: 90007	2009-11-27 16:37:41 +00:00
Chris Lattner	25be93dfed	teach GVN's load PRE to insert computations of the address in predecessors where it is not available. It's unclear how to get this inserted computation into GVN's scalar availability sets, Owen, help? :) llvm-svn: 89997	2009-11-27 08:25:10 +00:00
Chris Lattner	a9a76ccf56	Fix phi translation in load PRE to agree with the phi translation done by memdep, and reenable gep translation again. llvm-svn: 89992	2009-11-27 06:31:14 +00:00
Chris Lattner	8574aba4ea	factor some instcombine simplifications for getelementptr out to a new SimplifyGEPInst method in InstructionSimplify.h. No functionality change. llvm-svn: 89980	2009-11-27 00:29:05 +00:00
Chris Lattner	a5bc618a91	fix crash on Transforms/InstCombine/intrinsics.ll introduced by r89970 llvm-svn: 89972	2009-11-26 22:08:06 +00:00
Chris Lattner	a73ecf0b00	Fix PR5471 by removing an instcombine xform. Some pieces of the code generates store to undef and some generates store to null as the idiom for undefined behavior. Since simplifycfg zaps both, don't remove the undefined behavior in instcombine. llvm-svn: 89971	2009-11-26 22:04:42 +00:00
Chris Lattner	5b83ba215d	implement a bunch of xforms for overflow intrinsics, based on a patch by Alastair Lynn. llvm-svn: 89970	2009-11-26 21:42:47 +00:00
Edward O'Callaghan	2b8fed15e0	Reverting patch in revision 89758, initial attempt at fixing PR5373 has proven to be bogus. llvm-svn: 89844	2009-11-25 05:38:41 +00:00
Edward O'Callaghan	5fd452d596	Fix for PR5373, Credit to Jakub Staszak. llvm-svn: 89758	2009-11-24 11:51:52 +00:00
Dan Gohman	1f522d98f8	Fix a use of an invalidated iterator in the case where there are multiple adjacent uses of a dead basic block from the same user. This fixes PR5596. llvm-svn: 89658	2009-11-23 16:13:39 +00:00
Nick Lewycky	15a1287c1f	Pull LLVMContext out of PromoteMemToReg. llvm-svn: 89645	2009-11-23 03:50:44 +00:00
Nick Lewycky	621fe5614e	Remove LLVMContext and its include. llvm-svn: 89644	2009-11-23 03:34:29 +00:00
Nick Lewycky	922d4ab574	Reapply r88830 with a bugfix: this transform only applies to icmp eq/ne. This fixes part of PR5438. llvm-svn: 89639	2009-11-23 03:17:33 +00:00
Eric Christopher	0c7bd96de2	Add more optimizations for object size checking, enable handling of object size intrinsic and verify return type is correct. Collect various code in one place. llvm-svn: 89523	2009-11-21 01:01:30 +00:00
Dan Gohman	d15302afa0	Fix IPSCCP's code for deleting dead blocks to tolerate outstanding blockaddress users. This fixes PR5569. llvm-svn: 89483	2009-11-20 20:19:14 +00:00
Daniel Dunbar	f87c75706f	Revert "Add some rough optimizations for checking routines.", it buildeth not. llvm-svn: 89482	2009-11-20 20:17:30 +00:00
Eric Christopher	cf97d01dff	Add some rough optimizations for checking routines. llvm-svn: 89479	2009-11-20 19:57:37 +00:00
Duncan Sands	9e26aac773	Fix PR5563, an expensive checks failure when running on tests/Transforms/InstCombine/shufflemask-undef.ll. If anyone cares, the use of 2*e here (and the equivalent all over the place in instcombine) seems wrong, though harmless: it should really be twice the length of the input vector. I think shufflevector used to require that the mask have the same length as the input, but I don't think that's true any more. I don't care enough about vectors to do anything about this... llvm-svn: 89456	2009-11-20 13:19:51 +00:00
Dan Gohman	cbc6ebb6fd	Enable hoisting of loads from constant memory by default. In cases where they are lowered to instruction sequences more complex than a simple load, such that CodeGen cannot rematerialize them, a reload from a spill slot is likely to be cheaper than the complex sequence. llvm-svn: 89374	2009-11-19 19:00:10 +00:00
Jim Grosbach	6bf5305f5d	grammar llvm-svn: 89145	2009-11-17 21:37:04 +00:00
Jim Grosbach	e4e018ae67	80-column violations llvm-svn: 89123	2009-11-17 19:05:35 +00:00
Evan Cheng	ba4e5da727	Generalize OptimizeLoopTermCond to optimize more loop terminating icmp to use postinc iv. llvm-svn: 89116	2009-11-17 18:10:11 +00:00
Jim Grosbach	60f4854c76	Remove trailing whitespace llvm-svn: 89110	2009-11-17 17:53:56 +00:00
David Greene	a3ce7828b2	Fix an expensive-checks error. The Mask and LHSMask may not be of the same size, so don't do the transformation if they're different. llvm-svn: 88972	2009-11-16 21:52:23 +00:00
Duncan Sands	e5de4a9ad6	CreateIntCast takes an "isSigned" parameter. Pass "true" for it, rather than a name. llvm-svn: 88908	2009-11-16 12:32:28 +00:00
Chris Lattner	9d9812a636	make PRE of loads preserve the alignment of the moved load instruction. llvm-svn: 88865	2009-11-15 19:58:31 +00:00
Chris Lattner	5f037b6439	fix a bug handling 'not x' when x is undef. llvm-svn: 88864	2009-11-15 19:57:43 +00:00
Nick Lewycky	95148689c9	Revert r88830 and r88831 which appear to have caused a selfhost buildbot some grief. I suspect this patch merely exposed a bug else. llvm-svn: 88841	2009-11-15 07:47:32 +00:00
Nick Lewycky	e29fa4c7a1	Teach instcombine to look for booleans in wider integers when it encounters a zext(icmp). It may be able to optimize that away. This fixes one of the cases in PR5438. llvm-svn: 88830	2009-11-15 05:55:17 +00:00
Nick Lewycky	7935bcb0fe	Remove LLVMContext from reassociate. It was threaded through every function but ultimately never used. llvm-svn: 88763	2009-11-14 07:25:54 +00:00
Dan Gohman	81132465d3	Add an option for running GVN with redundant load processing disabled. llvm-svn: 88742	2009-11-14 02:27:51 +00:00
Owen Anderson	e96b2111b1	Re-enable this code, since redundant PHIs are now being better nuked. llvm-svn: 87042	2009-11-12 23:22:41 +00:00
Evan Cheng	85a9f430e9	- Teach LSR to avoid changing cmp iv stride if it will create an immediate that cannot be folded into target cmp instruction. - Avoid a phase ordering issue where early cmp optimization would prevent the later count-to-zero optimization. - Add missing checks which could cause LSR to reuse stride that does not have users. - Fix a bug in count-to-zero optimization code which failed to find the pre-inc iv's phi node. - Remove, tighten, loosen some incorrect checks disable valid transformations. - Quite a bit of code clean up. llvm-svn: 86969	2009-11-12 07:35:05 +00:00
Chris Lattner	5f6b8b2bcb	use getPredicateOnEdge to fold comparisons through PHI nodes, which implements GCC PR18046. This also gets us 360 more jump threads on 176.gcc. llvm-svn: 86953	2009-11-12 05:24:05 +00:00
Chris Lattner	22db4b5e0c	various fixes to the lattice transfer functions. llvm-svn: 86952	2009-11-12 04:57:13 +00:00
Chris Lattner	c893c4ed10	switch jump threading to use getPredicateOnEdge in one place making the new LVI stuff smart enough to subsume some special cases in the old code. Disable them when LVI is around, the testcase still passes. llvm-svn: 86951	2009-11-12 04:37:50 +00:00
Chris Lattner	ba45616958	with the new code we can thread non-instruction values. This allows us to handle the test10 testcase. llvm-svn: 86924	2009-11-12 01:41:34 +00:00
Chris Lattner	3f80d85191	this argument can be an arbitrary value, it doesn't need to be an instruction. llvm-svn: 86923	2009-11-12 01:37:43 +00:00
Chris Lattner	d5e25436a1	expose edge information and switch j-t to use it. llvm-svn: 86920	2009-11-12 01:29:10 +00:00
Chris Lattner	67146695b6	pass TD into a SimplifyCmpInst call. Add another case that uses LVI info when -enable-jump-threading-lvi is passed. llvm-svn: 86886	2009-11-11 22:31:38 +00:00
Chris Lattner	852f2653c4	remove the now dead condprop pass, PR3906. llvm-svn: 86810	2009-11-11 05:56:35 +00:00
Chris Lattner	fde1f8d0d8	stub out some LazyValueInfo interfaces, and have JumpThreading start using them in a trivial way when -enable-jump-threading-lvi is passed. enable-jump-threading-lvi will be my playground for awhile. llvm-svn: 86789	2009-11-11 02:08:33 +00:00
Chris Lattner	3a2ae908fe	add a fixme llvm-svn: 86766	2009-11-11 00:21:58 +00:00
Evan Cheng	12f146d8f7	Block terminator may be a switch. llvm-svn: 86761	2009-11-11 00:00:21 +00:00
Chris Lattner	9518fbb54e	implement a TODO by teaching jump threading about "xor x, 1". llvm-svn: 86739	2009-11-10 22:39:16 +00:00
Chris Lattner	852d6d64ff	move some generally useful functions out of jump threading into libanalysis and transformutils. llvm-svn: 86735	2009-11-10 22:26:15 +00:00
Chris Lattner	02e2cee7dc	fix a crash in SCCP handling extractvalue of an array, pointed out and tracked down by Stephan Reiter! llvm-svn: 86726	2009-11-10 22:02:09 +00:00
Chris Lattner	40b15f220d	improve comment. llvm-svn: 86723	2009-11-10 21:45:09 +00:00
Chris Lattner	80e7e5a429	Make jump threading eliminate blocks that just contain phi nodes, debug intrinsics, and an unconditional branch when possible. This reuses the TryToSimplifyUncondBranchFromEmptyBlock function split out of simplifycfg. llvm-svn: 86722	2009-11-10 21:40:01 +00:00
Evan Cheng	87fe40b32d	Generalize lsr code that optimize loop to count down towards zero. llvm-svn: 86715	2009-11-10 21:14:05 +00:00
Duncan Sands	23344095de	Add defensive break. llvm-svn: 86705	2009-11-10 19:36:40 +00:00
Duncan Sands	8d4cde2b55	Fix obvious typo. llvm-svn: 86694	2009-11-10 18:21:37 +00:00
Chris Lattner	b8f79ba10e	clarify logic. llvm-svn: 86689	2009-11-10 17:00:47 +00:00
Duncan Sands	1925d3a1d1	Teach DSE to eliminate useless trampolines. llvm-svn: 86683	2009-11-10 13:49:50 +00:00
Duncan Sands	04e0c95248	Add brackets to make gcc-4.4 happy. llvm-svn: 86681	2009-11-10 09:32:10 +00:00
Chris Lattner	1559bedcc7	unify the code that determines whether it is a good idea to change the type of a computation. This fixes some infinite loops when dealing with TD that has no native types. llvm-svn: 86670	2009-11-10 07:23:37 +00:00
Nick Lewycky	5b3def9b86	Simplify. llvm-svn: 86668	2009-11-10 07:00:43 +00:00
Nick Lewycky	9027147fb1	Reapply r86359, "Teach dead store elimination that certain intrinsics write to memory just like a store" with bug fixed (partial-overwrite.ll is the regression test). llvm-svn: 86667	2009-11-10 06:46:40 +00:00
Chris Lattner	38c44ea6b0	make jump threading recursively simplify expressions instead of doing it just one level deep. On the testcase we go from getting this: F1: ; preds = %T2 %F = and i1 true, %cond ; <i1> [#uses=1] br i1 %F, label %X, label %Y to a fully threaded: F1: ; preds = %T2 br label %Y This changes gets us to the point where we're forming (too many) switch instructions on doug's strswitch testcase. llvm-svn: 86646	2009-11-10 01:57:31 +00:00
Chris Lattner	be11db6894	don't invalidate PN, rewrite of this code is in progress anyway. llvm-svn: 86639	2009-11-10 01:19:06 +00:00
Chris Lattner	fb7f87d5a3	add a new SimplifyInstruction API, which is like ConstantFoldInstruction, except that the result may not be a constant. Switch jump threading to use it so that it gets things like (X & 0) -> 0, which occur when phi preds are deleted and the remaining phi pred was a zero. llvm-svn: 86637	2009-11-10 01:08:51 +00:00
Jeffrey Yasskin	b40d3f76a0	Fix DenseMap iterator constness. This patch forbids implicit conversion of DenseMap::const_iterator to DenseMap::iterator which was possible because DenseMapIterator inherited (publicly) from DenseMapConstIterator. Conversion the other way around is now allowed as one may expect. The template DenseMapConstIterator is removed and the template parameter IsConst which specifies whether the iterator is constant is added to DenseMapIterator. Actually IsConst parameter is not necessary since the constness can be determined from KeyT but this is not relevant to the fix and can be addressed later. Patch by Victor Zverovich! llvm-svn: 86636	2009-11-10 01:02:17 +00:00
Chris Lattner	a71e9d61be	factor simplification logic for AND and OR out to InstSimplify from instcombine. llvm-svn: 86635	2009-11-10 00:55:12 +00:00
Chris Lattner	ccfdceb22c	pull a bunch of logic out of instcombine into instsimplify for compare simplification, this handles the foldable fcmp x,x cases among many others. llvm-svn: 86627	2009-11-09 23:55:12 +00:00
Chris Lattner	beadc6e8c7	inline a simple function. llvm-svn: 86625	2009-11-09 23:31:49 +00:00
Chris Lattner	c1f19071f8	rename SimplifyCompare -> SimplifyCmpInst and split it into Simplify[IF]Cmp pieces. Add some predicates to CmpInst to determine whether a predicate is fp or int. llvm-svn: 86624	2009-11-09 23:28:39 +00:00
Chris Lattner	800aad3dda	use instructionsimplify instead of a weak clone of ad-hoc folding stuff. llvm-svn: 86616	2009-11-09 23:00:14 +00:00
Chris Lattner	2978ca7b79	stub out a new form of BasicBlock::RemovePredecessorAndSimplify which simplifies instruction users of PHIs when the phi is eliminated. This will be moved to transforms/utils after some other refactoring. llvm-svn: 86603	2009-11-09 22:32:36 +00:00
Chris Lattner	39c07b2eef	if a 'with overflow' intrinsic just has the normal result used, simplify it to a normal binop. Patch by Alastair Lynn, testcase by me. llvm-svn: 86524	2009-11-09 07:07:56 +00:00
Chris Lattner	feeabde753	fix PR5104: when printing a single character, return the result of putchar in case there is an error. llvm-svn: 86515	2009-11-09 04:57:04 +00:00
Chris Lattner	0685be3441	enhance PHI slicing to handle the case when a slicable PHI is begin used by a chain of other PHIs. llvm-svn: 86503	2009-11-09 01:38:00 +00:00
Owen Anderson	939ea35244	Small cleanups. llvm-svn: 86499	2009-11-09 00:48:15 +00:00
Owen Anderson	73fc616838	Revert my previous patch to ABCD and fix things the right way. There are two problems addressed here: 1) We need to avoid processing sigma nodes as phi nodes for constraint generation. 2) We need to generate constraints for comparisons against constants properly. This includes our first working ABCD test! llvm-svn: 86498	2009-11-09 00:44:44 +00:00
Chris Lattner	ea465e221e	comment typos pointed out by Duncan llvm-svn: 86497	2009-11-09 00:41:49 +00:00
Owen Anderson	058088f219	Fix an issue where the ordering of blocks within a function could lead to different constraint graphs being produced. The cause was that we were incorrectly marking sigma instructions as processed after handling the sigma-specific constraints for them, potentially neglecting to process them as normal instructions as well. Unfortunately, the testcase that inspired this still doesn't work because of a bug in the solver, which is next on the list to debug. llvm-svn: 86486	2009-11-08 22:36:55 +00:00
Chris Lattner	2299d4b6d8	Teach an instcombine to not pull trunc instructions through PHI nodes when both the source and dest are illegal types, since it would cause the phi to grow (for example, we shouldn't transform test14b's phi to a phi on i320). This fixes an infinite loop on i686 bootstrap with phi slicing turned on, so turn it back on. llvm-svn: 86483	2009-11-08 21:20:06 +00:00
Chris Lattner	a837e4db6b	reapply r8644[3-5] with only the scary part (SliceUpIllegalIntegerPHI) disabled. llvm-svn: 86480	2009-11-08 19:23:30 +00:00
Daniel Dunbar	4c41373c56	Speculatively revert r8644[3-5], they seem to be leading to infinite loops in llvm-gcc bootstrap. llvm-svn: 86478	2009-11-08 17:52:47 +00:00
Chris Lattner	c7a450b5b2	teach a couple of instcombine transformations involving PHIs to not turn a PHI in a legal type into a PHI of an illegal type, and add a new optimization that breaks up insane integer PHI nodes into small pieces (PR3451). llvm-svn: 86443	2009-11-08 08:21:13 +00:00
Nick Lewycky	b9397262b7	Improve tail call elimination to handle the switch statement. llvm-svn: 86403	2009-11-07 21:10:15 +00:00
Chris Lattner	c77d24b792	make instcombine only rewrite a chain of computation (eliminating some extends) if the new type of the computation is legal or if both the source and dest are illegal. This prevents instcombine from changing big chains of computation into i64 on 32-bit targets for example. llvm-svn: 86398	2009-11-07 19:11:46 +00:00
Chris Lattner	431000da21	Revert r86359, it is breaking the self host on the llvm-gcc-i386-darwin9 build bot. llvm-svn: 86391	2009-11-07 17:59:32 +00:00
Nick Lewycky	b6a3dd48f4	Teach dead store elimination that certain intrinsics write to memory just like a store. llvm-svn: 86359	2009-11-07 08:34:40 +00:00
Chris Lattner	5ff7f5672e	reapply 86289, 86278, 86270, 86267, 86266 & 86264 plus a fix (making pred factoring only happen if threading is guaranteed to be successful). This now survives an X86-64 bootstrap of llvm-gcc. llvm-svn: 86355	2009-11-07 08:05:03 +00:00
Nick Lewycky	9b669b3c4f	Oops, FunctionContainsEscapingAllocas is really used to mean two different things. Back out part of r86349 for a moment. llvm-svn: 86353	2009-11-07 07:42:38 +00:00
Nick Lewycky	5091272fdf	Dust off tail recursion elimination. Fix a fixme by applying CaptureTracking and add a .ll to demo the new capability. llvm-svn: 86349	2009-11-07 07:10:01 +00:00
Devang Patel	3a42e7ac65	Revert following patches to fix llvmgcc bootstrap. 86289, 86278, 86270, 86267, 86266 & 86264 Chris, please take a look. llvm-svn: 86321	2009-11-07 01:32:59 +00:00
Jeffrey Yasskin	8f77e948e5	Avoid "ambiguous 'else'" warning from gcc. llvm-svn: 86314	2009-11-07 00:26:47 +00:00
Chris Lattner	eb690feaef	Fix a bug where we'd call SplitBlockPredecessors with a pred in the set only once even if it has multiple edges to BB. llvm-svn: 86299	2009-11-06 23:19:58 +00:00
Eli Friedman	a70917b2f4	Remove function left over from other jump threading cleanup. llvm-svn: 86289	2009-11-06 21:24:57 +00:00
Chris Lattner	a8b9ce3f07	Fix a problem discovered on self host. llvm-svn: 86278	2009-11-06 19:21:48 +00:00
Chris Lattner	d91a7960bf	remove more code subsumed by r86264 llvm-svn: 86270	2009-11-06 18:24:32 +00:00
Chris Lattner	899ef22acb	eliminate some more code subsumed by r86264 llvm-svn: 86267	2009-11-06 18:22:54 +00:00
Chris Lattner	2f6184f6aa	remove now redundant code, r86264 handles this case. llvm-svn: 86266	2009-11-06 18:20:58 +00:00
Chris Lattner	68d2417e05	Extend jump threading to support much more general threading predicates. This allows us to jump thread things like: _ZN12StringSwitchI5ColorE4CaseILj7EEERS1_RAT__KcRKS0_.exit119: %tmp1.i24166 = phi i8 [ 1, %bb5.i117 ], [ %tmp1.i24165, %_Z....exit ], [ %tmp1.i24165, %bb4.i114 ] %toBoolnot.i87 = icmp eq i8 %tmp1.i24166, 0 ; <i1> [#uses=1] %tmp4.i90 = icmp eq i32 %tmp2.i, 6 ; <i1> [#uses=1] %or.cond173 = and i1 %toBoolnot.i87, %tmp4.i90 ; <i1> [#uses=1] br i1 %or.cond173, label %bb4.i96, label %_ZN12... Where it is "obvious" that when coming from %bb5.i117 that the 'and' is always false. This triggers a surprisingly high number of times in the testsuite, and gets us closer to generating good code for doug's strswitch testcase. This also make a bunch of other code in jump threading redundant, I'll rip out in the next patch. This survived an enable-checking llvm-gcc bootstrap. llvm-svn: 86264	2009-11-06 18:15:14 +00:00
Chris Lattner	8c12bb8cd7	remove some more Context arguments. llvm-svn: 86235	2009-11-06 05:59:53 +00:00
Chris Lattner	46b5c642b9	remove a bunch of extraneous LLVMContext arguments from various APIs, addressing PR5325. llvm-svn: 86231	2009-11-06 04:27:31 +00:00
Dan Gohman	a1bf0c0acc	Teach LSR to avoid calling SplitCriticalEdge on edges with indirectbr. llvm-svn: 86193	2009-11-05 23:34:59 +00:00
Dan Gohman	dca7ac335b	LoopDeletion depends on loops having dedicated exits. llvm-svn: 86180	2009-11-05 21:47:04 +00:00
Dan Gohman	a83ac2d9e7	Update various Loop optimization passes to cope with the possibility that LoopSimplify form may not be available. llvm-svn: 86175	2009-11-05 21:11:53 +00:00
Dan Gohman	d9fa1c9c1e	Call getAnalysis<LoopInfo> the normal way, instead of asking passed-in LoopPassManager for it. llvm-svn: 86163	2009-11-05 19:43:25 +00:00
Benjamin Kramer	b971445ab7	Teach SimplifyLibCalls to fold memcmp calls with constant arguments. llvm-svn: 86141	2009-11-05 17:44:22 +00:00
Benjamin Kramer	3fcbb82151	Do map insert+find in one step. TODO -= 2. llvm-svn: 86133	2009-11-05 14:33:27 +00:00
Chris Lattner	a09062758b	improve DSE when TargetData is not around, based on work by Hans Wennborg! llvm-svn: 86067	2009-11-04 23:20:12 +00:00
Chris Lattner	762b56fa8c	Fix an iterator invalidation bug that happens when a hashtable resizes in IPSCCP. This fixes PR5394. llvm-svn: 86036	2009-11-04 18:57:42 +00:00
Chris Lattner	cb3c64ee3c	move two functions up higher in the file. Delete a useless argument to EmitGEPOffset. Implement some new transforms for optimizing subtracts of two pointer to ints into the same vector. This happens for C++ iterator idioms for example, stringmap takes a const char* that points to the start and end of a string. Once inlined, we want the pointer difference to turn back into a length. This is rdar://7362831. llvm-svn: 86021	2009-11-04 08:05:20 +00:00
Chris Lattner	156b8c7109	reimplement multiple return value handling in IPSCCP, making it more aggressive an correct. This survives building llvm in 64-bit mode with optimizations and the built llvm passes make check. llvm-svn: 85973	2009-11-03 23:40:48 +00:00
Chris Lattner	2c427233d4	finish half thunk thought llvm-svn: 85937	2009-11-03 20:52:57 +00:00
Chris Lattner	cde8de519d	fix an IPSCCP bug I introduced when I changed IPSCCP to start working on functions that don't have local linkage. Basically, we need to be more careful about propagating argument information to functions whose results we aren't tracking. This fixes a miscompilation of LLVMCConfigurationEmitter.cpp when built with an llvm-gcc that has ipsccp enabled. llvm-svn: 85923	2009-11-03 19:24:51 +00:00
Chris Lattner	e1d5cd9f48	fix a subtle bug I introduced when refactoring SCCP. Testcase to follow. llvm-svn: 85903	2009-11-03 16:50:11 +00:00
Chris Lattner	fb14181b18	turn IPSCCP back on now that the iterator invalidation bug is fixed. llvm-svn: 85858	2009-11-03 03:42:51 +00:00
Chris Lattner	b70ef3c8c7	fix a nasty iterator invalidation bug from my conversion from std::map to DenseMap, exposed on release llvm-gcc bootstrap. llvm-svn: 85840	2009-11-02 23:25:39 +00:00
Chris Lattner	a15cc59dcb	revert r8579[56], which are causing unhappiness in buildbot land. llvm-svn: 85818	2009-11-02 19:31:10 +00:00
Chris Lattner	a3d794ebbb	disable IPSCCP support for multiple return values, it is buggy, so just disable it until I can fix it. llvm-svn: 85810	2009-11-02 18:22:51 +00:00
Chris Lattner	9d49f0c858	improve IPSCCP to be able to propagate the result of "!mayBeOverridden" function to calls of that function, regardless of whether it has local linkage or has its address taken. Not escaping should only affect whether we make an aggressive assumption about the arguments to a function, not whether we can track the result of it. llvm-svn: 85795	2009-11-02 07:33:59 +00:00
Chris Lattner	47837c5182	don't mark the arguments of prototype overdefined, they will never be queried. llvm-svn: 85793	2009-11-02 06:34:04 +00:00
Chris Lattner	5503328332	restore some code I removed in r85788, refactor it into a shared place instead of duplicating it 4 times. llvm-svn: 85792	2009-11-02 06:28:16 +00:00
Chris Lattner	4910b656b2	remove some confused code that dates from when we had "multiple return values" but not "first class aggregates" llvm-svn: 85791	2009-11-02 06:17:06 +00:00
Chris Lattner	809aee2f40	avoid redundant lookups in BBExecutable, and make it a SmallPtrSet. llvm-svn: 85790	2009-11-02 06:11:23 +00:00
Chris Lattner	e77c9aa04a	Use the libanalysis 'ConstantFoldLoadFromConstPtr' function instead of reinventing SCCP-specific logic. This gives us new powers. llvm-svn: 85789	2009-11-02 06:06:14 +00:00
Chris Lattner	f548403989	switch the main 'ValueState' map from being an std::map to being a DenseMap. Doing this required being aware of subtle iterator invalidation issues, but it provides a big speedup. In a release-asserts build, this sped up optimizing 403.gcc from 1.34s -> 0.79s (IPSCCP) and 1.11s -> 0.44s (SCCP). This commit also conflates in a bunch of general cleanups, sorry. llvm-svn: 85788	2009-11-02 05:55:40 +00:00
Chris Lattner	e82b087ae6	only IPSCCP incoming arguments if the function is executable, this fixes an assertion on the buildbot. llvm-svn: 85784	2009-11-02 03:25:55 +00:00
Chris Lattner	9e97fbe114	add a new ValueState::getConstantInt() helper, use it to simplify some code. llvm-svn: 85783	2009-11-02 03:21:36 +00:00
Chris Lattner	7ccf1a6df6	tidy up some more: remove some extraneous inline specifiers, return harder. llvm-svn: 85780	2009-11-02 03:03:42 +00:00
Chris Lattner	b5a13d4c90	eliminate the SCCPSolver::getValueMapping method. llvm-svn: 85778	2009-11-02 02:54:24 +00:00
Chris Lattner	c49ae9912a	fix failures introduced in r85774 llvm-svn: 85777	2009-11-02 02:48:17 +00:00
Chris Lattner	e405ed9651	factor duplicated code into a new DeleteInstructionInBlock function, eliminate temporary (and pointless) smallvector. llvm-svn: 85776	2009-11-02 02:47:51 +00:00
Chris Lattner	a3c39d394d	Chris used to use '...' instead of proper grammar. llvm-svn: 85775	2009-11-02 02:33:50 +00:00
Chris Lattner	6df5cec72f	remove some extraneous llvmcontext stuff. llvm-svn: 85774	2009-11-02 02:30:06 +00:00
Chris Lattner	efdd2bbce6	change LatticeVal to use PointerIntPair to save some space. llvm-svn: 85773	2009-11-02 02:20:32 +00:00
Chris Lattner	3cd6a61b27	fix instcombine to only do store sinking when the alignments of the two loads agree. Propagate that onto the new store. llvm-svn: 85772	2009-11-02 02:06:37 +00:00
Chris Lattner	328ef89bd1	when merging two loads, make sure to take the min of their alignment, not the max. This didn't matter until the previous patch because instcombine would refuse to sink loads with differenting alignments. llvm-svn: 85738	2009-11-01 20:07:07 +00:00
Chris Lattner	2a249e267a	split load sinking out to its own function, like gep sinking. llvm-svn: 85737	2009-11-01 20:04:24 +00:00
Chris Lattner	0b40a8bc0e	fix a bug noticed by inspection: when instcombine sinks loads through phis, it didn't preserve the alignment of the load. This is a missed optimization of the alignment is high and a miscompilation when the alignment is low. llvm-svn: 85736	2009-11-01 19:50:13 +00:00
Chris Lattner	37536b90e1	remove a bunch of locking from LLVMContextImpl. Since only one thread can be banging on a context at a time, this isn't needed. Owen, please review. llvm-svn: 85728	2009-11-01 18:42:03 +00:00
Chris Lattner	1a8b80ed5a	teach ipsccp and ipconstprop that a blockaddress doesn't 'take the address' of a function in a way that should prevent ip constprop. This allows clang/test/CodeGen/indirect-goto.c to pass with the new indirect goto lowering. llvm-svn: 85709	2009-11-01 06:11:53 +00:00
Chris Lattner	746139b736	strengthen an assumption: RevectorBlockTo knows that PredBB ended in an uncond branch because the pass requires BreakCriticalEdges. However, BCE doesn't eliminate critical adges from indbrs. llvm-svn: 85707	2009-11-01 04:23:20 +00:00
Chris Lattner	7a8db3a41a	if CostMetrics says to never duplicate some code, don't unswitch a loop. This prevents unswitching from duplicating indbr's. llvm-svn: 85705	2009-11-01 03:42:55 +00:00
Chris Lattner	a546dcf418	Make sure PRE doesn't split crit edges from indirectbr. llvm-svn: 85692	2009-10-31 22:11:15 +00:00
Chris Lattner	c872b09676	llvm::SplitEdge should refuse to split an edge from an indirectbr. Fix CodeGenPrepare to not try to split edges from indirectbr. llvm-svn: 85690	2009-10-31 22:04:43 +00:00
Chris Lattner	a742b8f94f	add a comment. llvm-svn: 85671	2009-10-31 17:48:31 +00:00
Dan Gohman	880c92ac1c	Rename forgetLoopBackedgeTakenCount to forgetLoop, because it clears out more information than just the stored backedge taken count. llvm-svn: 85664	2009-10-31 15:04:55 +00:00
Dan Gohman	969e83a4ff	Replace LoopUnrollPass.cpp's custom code-size estimation code using the new common CodeMetrics code. llvm-svn: 85663	2009-10-31 14:54:17 +00:00
Dan Gohman	af94015c18	Remove an unnecessary #include. llvm-svn: 85661	2009-10-31 14:39:43 +00:00
Dan Gohman	f35b6640f6	Update CMakeLists for recent renames. llvm-svn: 85660	2009-10-31 14:38:25 +00:00
Dan Gohman	f70e76c435	Rename UnrollLoop.cpp to LoopUnroll.cpp, and LoopUnroll.cpp to LoopUnrollPass.cpp, for consistency with other passes which are similarly split. llvm-svn: 85659	2009-10-31 14:37:31 +00:00
Dan Gohman	fb7f0e57b6	Remove CodeGenLICM. It's largely obsoleted by MachineLICM's new ability to unfold loop-invariant loads. llvm-svn: 85657	2009-10-31 14:35:41 +00:00
Dan Gohman	930aa9d3d2	Reapply r85634, with the bug fixed. llvm-svn: 85655	2009-10-31 14:22:52 +00:00
Evan Cheng	c16d8f2054	Revert 85634. It's breaking consumer-typeset (and others). llvm-svn: 85641	2009-10-31 01:28:06 +00:00
Dan Gohman	5bec30ca5d	Optimize around the fact that pred_iterator is slow: instead of sorting PHI operands by the predecessor order, sort them by the order used by the first PHI in the block. This is still suffucient to expose duplicates. llvm-svn: 85634	2009-10-30 23:15:21 +00:00
Dan Gohman	13e41edc71	Sort the incoming values in PHI nodes to match the predecessor order. This helps expose duplicate PHIs, which will make it easier for them to be eliminated. llvm-svn: 85623	2009-10-30 22:22:22 +00:00
Evan Cheng	5a6b9c40d6	Add option to createGVNPass to disable PRE. llvm-svn: 85609	2009-10-30 20:12:24 +00:00
Nick Lewycky	b43a43a8fd	Apply some cleanups. No functionality changes. llvm-svn: 85498	2009-10-29 07:35:15 +00:00
Chris Lattner	ee8b951e73	teach various passes about blockaddress. We no longer crash on any clang tests. llvm-svn: 85465	2009-10-29 01:21:20 +00:00
Edward O'Callaghan	1042ca112f	No newline at end of file. llvm-svn: 85390	2009-10-28 15:04:53 +00:00
Benjamin Kramer	ecc60b80b0	Update CMake file. llvm-svn: 85389	2009-10-28 13:29:18 +00:00
Owen Anderson	2b2bd28973	Treat lifetime begin/end markers as allocations/frees respectively for the purposes for GVN/DSE. llvm-svn: 85383	2009-10-28 07:05:35 +00:00
Nick Lewycky	175308c43e	Add ABCD, a generalized implementation of the Elimination of Array Bounds Checks on Demand algorithm which looks at arbitrary branches instead of loop iterations. This is GSoC work by Andre Tavares with only editorial changes applied! llvm-svn: 85382	2009-10-28 07:03:15 +00:00
Devang Patel	11cf3f4a27	Factor out redundancy from clone() implementations. llvm-svn: 85327	2009-10-27 22:16:29 +00:00
Victor Hernandez	f390e04a47	Rename MallocFreeHelper as MemoryBuiltins llvm-svn: 85286	2009-10-27 20:05:49 +00:00
Mike Stump	2b0a49a682	VS build fix, patch by Marius Wachtler. llvm-svn: 85197	2009-10-27 02:14:13 +00:00
Eric Christopher	7a50b280c1	Add objectsize intrinsic and hook it up through codegen. Doesn't do anything than return "I don't know" at the moment. llvm-svn: 85189	2009-10-27 00:52:25 +00:00
Dan Gohman	f808106bbe	Add braces to avoid ambiguous else. llvm-svn: 85185	2009-10-27 00:11:02 +00:00
Victor Hernandez	762195bd01	Rename MallocHelper as MallocFreeHelper, since it now also identifies calls to free() llvm-svn: 85181	2009-10-26 23:58:56 +00:00
Owen Anderson	03b5de67b0	Add a straight-forward implementation of SCCVN for aggressively eliminating scalar redundancies. llvm-svn: 85179	2009-10-26 23:55:47 +00:00
Victor Hernandez	de5ad42aa1	Remove FreeInst. Remove LowerAllocations pass. Update some more passes to treate free calls just like they were treating FreeInst. llvm-svn: 85176	2009-10-26 23:43:48 +00:00
Dan Gohman	34e38afa96	Simplify this code. LoopDeletion doesn't need to explicit check that the loop exiting block dominates the latch block; if ScalarEvolution can prove that the trip-count is finite, that's sufficient. llvm-svn: 85165	2009-10-26 22:18:58 +00:00
Dan Gohman	672927f393	Code that checks WillNotOverflowSignedAdd before creating an Add can safely use the NSW bit on the Add. llvm-svn: 85164	2009-10-26 22:14:22 +00:00
Ted Kremenek	ce8f626f82	Update CMake files. llvm-svn: 85161	2009-10-26 22:06:01 +00:00
Dan Gohman	6a1d9eace9	Check in the experimental GEP splitter pass. This pass splits complex GEPs (more than one non-zero index) into simple GEPs (at most one non-zero index). In some simple experiments using this it's not uncommon to see 3% overall code size wins, because it exposes redundancies that can be eliminated, however it's tricky to use because instcombine aggressively undoes the work that this pass does. llvm-svn: 85144	2009-10-26 19:12:14 +00:00
Dan Gohman	6a10d5ebd3	Fix a typo in a comment. llvm-svn: 85120	2009-10-26 15:55:24 +00:00
Chris Lattner	683eed3286	reapply r85085 with a bugfix to avoid infinite looping. All of the 'demorgan' related xforms need to use dyn_castNotVal, not m_Not. llvm-svn: 85119	2009-10-26 15:40:07 +00:00
Dan Gohman	d632f89596	Make LSR's OptimizeShadowIV ignore induction variables with negative strides for now, because it doesn't handle them correctly. This fixes a miscompile of SingleSource/Benchmarks/Misc-C++/ray. This problem was usually hidden because indvars transforms such induction variables into negations of canonical induction variables. llvm-svn: 85118	2009-10-26 15:32:57 +00:00
Evan Cheng	8014a728b9	Revert 85085. It causes infinite looping during llvm-gcc build. llvm-svn: 85090	2009-10-26 03:51:32 +00:00
Chris Lattner	2e6564d6ff	Implement PR3266 & PR5276, folding: not (or (icmp, icmp)) -> and(icmp, icmp) llvm-svn: 85085	2009-10-26 01:06:31 +00:00
Nick Lewycky	54d7179a25	Remove ICmpInst::isSignedPredicate which was a reimplementation CmpInst::isSigned. llvm-svn: 85037	2009-10-25 05:20:17 +00:00
Dan Gohman	8f4078ba39	Rename isLoopExit to isLoopExiting, for consistency with the wording used elsewhere - an exit block is a block outside the loop branched to from within the loop. An exiting block is a block inside the loop that branches out. llvm-svn: 85019	2009-10-24 23:34:26 +00:00
Dan Gohman	b979794e4b	Rewrite LoopRotation's SSA updating code using SSAUpdater. llvm-svn: 85016	2009-10-24 23:19:52 +00:00
Victor Hernandez	e297149e26	Auto-upgrade free instructions to calls to the builtin free function. Update all analysis passes and transforms to treat free calls just like FreeInst. Remove RaiseAllocations and all its tests since FreeInst no longer needs to be raised. llvm-svn: 84987	2009-10-24 04:23:03 +00:00
Victor Hernandez	8acf2956b8	Remove AllocationInst. Since MallocInst went away, AllocaInst is the only subclass of AllocationInst, so it no longer is necessary. llvm-svn: 84969	2009-10-23 21:09:37 +00:00
Dan Gohman	41d00ac45b	Make LoopDeletion check the maximum backedge taken count, rather than the exact backedge taken count, when checking for infinite loops. This allows it to delete loops with multiple exit conditions. llvm-svn: 84952	2009-10-23 17:10:01 +00:00
Chris Lattner	cf7e8947e9	move another load optimization from instcombine -> libanalysis. llvm-svn: 84841	2009-10-22 06:44:07 +00:00
Chris Lattner	51d2f70e32	move 'loading i32 from string' optimization from instcombine to libanalysis. Instcombine shrinking... does this even make sense??? llvm-svn: 84840	2009-10-22 06:38:35 +00:00
Chris Lattner	1664a4fd86	Move some constant folding logic for loads out of instcombine into Analysis/ConstantFolding.cpp. This doesn't change the behavior of instcombine but makes other clients of ConstantFoldInstruction able to handle loads. This was partially extracted from Eli's patch in PR3152. llvm-svn: 84836	2009-10-22 06:25:11 +00:00
Chris Lattner	c7a962d3b3	fix PR5262. llvm-svn: 84810	2009-10-22 00:17:26 +00:00
Chris Lattner	966526cbfb	revert r84754, it isn't the right approach. Edwin, please propose patches for fixes like this instead of committing them directly. llvm-svn: 84799	2009-10-21 23:41:58 +00:00
Victor Hernandez	be9e179104	Make changes to rev 84292 as requested by Chris Lattner. Most changes are cleanup, but there is 1 correctness fix: I fixed InstCombine so that the icmp is removed only if the malloc call is removed (which requires explicit removal because the Worklist won't DCE any calls since they can have side-effects). llvm-svn: 84772	2009-10-21 19:11:40 +00:00
Torok Edwin	1539a352a6	Fix PR5262: when folding select into PHI, make sure all operands are available in the PHI's Basic Block. This uses a conservative approach, because we don't have dominator info in instcombine. llvm-svn: 84754	2009-10-21 10:49:00 +00:00
Chris Lattner	8ed7bef409	make GVN work better when TD is not around: "In the existing code, if the load and the value to replace it with are of different types and target data is available, it tries to use the target data to coerce the replacement value to the type of the load. Otherwise, it skips all effort to handle the type mismatch and just feeds the wrongly-typed replacement value to replaceAllUsesWith, which triggers an assertion. The patch replaces it with an outer if checking for type mismatch, and an inner if-else that checks whether target data is available and, if not, returns false rather than trying to replace the load." Patch by Kenneth Uildriks! llvm-svn: 84739	2009-10-21 04:11:19 +00:00
Dan Gohman	b6b8ec769c	Restore LoopUnswitch's block-oriented threshold. LoopUnswitch now checks both the estimated code size and the number of blocks when deciding whether to do a non-trivial unswitch. This protects it from some very undesirable worst-case behavior on large numbers of loop-unswitchable conditions, such as in the testcase in PR5259. llvm-svn: 84661	2009-10-20 20:06:09 +00:00
Torok Edwin	729d92bd74	Fix PR4313: IPSCCP was not setting the lattice value for the invoke instruction when the invoke had multiple return values: it set the lattice value only on the extractvalue. This caused the invoke's lattice value to remain the default (undefined), and later propagated to extractvalue's operand, which incorrectly introduces undefined behavior. llvm-svn: 84637	2009-10-20 15:15:09 +00:00
Owen Anderson	168ad6985e	Refactor lookup_or_add to contain _MUCH_ less duplicated code. Add support for numbering first class aggregate instructions while we're at it. llvm-svn: 84547	2009-10-19 22:14:22 +00:00
Owen Anderson	1059b5b32d	Simplify some code. llvm-svn: 84533	2009-10-19 21:14:57 +00:00
Victor Hernandez	a3aaf85e23	Remove MallocInst from LLVM Instructions. llvm-svn: 84299	2009-10-17 01:18:07 +00:00
Victor Hernandez	c7d6a8327c	Autoupgrade malloc insts to malloc calls. Update testcases that rely on malloc insts being present. Also prematurely remove MallocInst handling from IndMemRemoval and RaiseAllocations to help pass tests in this incremental step. llvm-svn: 84292	2009-10-17 00:00:19 +00:00
Dan Gohman	99429a00ff	Move zext and sext casts fed by loads into the same block as the load, to help SelectionDAG fold them into the loads, unless conditions are unfavorable. llvm-svn: 84271	2009-10-16 20:59:35 +00:00
Chris Lattner	c855b45b78	only try to fold constantexpr operands when the worklist is first populated, don't bother every time going around the main worklist. This speeds up a release-asserts opt -std-compile-opts on 403.gcc by about 4% (1.5s). It seems to speed up the most expensive instances of instcombine by ~10%. llvm-svn: 84171	2009-10-15 04:59:28 +00:00
Chris Lattner	dd1f68a10c	don't bother calling ConstantFoldInstruction unless there is a use of the instruction (which disqualifies stores, unreachable, etc) and at least the first operand is a constant. This filters out a lot of obvious cases that can't be folded. Also, switch the IRBuilder to a TargetFolder, which tries harder. llvm-svn: 84170	2009-10-15 04:13:44 +00:00
Devang Patel	92f8619923	Use isVoidTy() llvm-svn: 84118	2009-10-14 17:29:00 +00:00
Chris Lattner	6b9044db01	make instcombine's instruction sinking more aggressive in the presence of PHI nodes. llvm-svn: 84103	2009-10-14 15:21:58 +00:00
Devang Patel	a677136900	Check void type before using RAUWd. llvm-svn: 84049	2009-10-13 22:56:32 +00:00
Devang Patel	115741ba79	Do not check use_empty() before replaceAllUsesWith(). This gives ValueHandles a chance to get properly updated. llvm-svn: 84033	2009-10-13 21:41:20 +00:00
Dan Gohman	2dc6f8de03	Use the new CodeMetrics class to compute code size instead of manually counting instructions. llvm-svn: 84016	2009-10-13 20:12:23 +00:00
Dan Gohman	71ca652475	Make LoopUnswitch's cost estimation count Instructions, rather than BasicBlocks, so that it doesn't blindly procede in the presence of large individual BasicBlocks. This addresses a class of code-size expansion problems. llvm-svn: 83992	2009-10-13 17:50:43 +00:00
Evan Cheng	f815861591	Make licm debug message readable. llvm-svn: 83908	2009-10-12 22:25:23 +00:00
Dale Johannesen	4c9f0e8f53	Fix warning. llvm-svn: 83870	2009-10-12 18:45:32 +00:00
Chris Lattner	8abd572dae	populate instcombine's initial worklist more carefully, causing it to visit instructions from the start of the function to the end of the function in the first path. This greatly speeds up some pathological cases (e.g. PR5150). Try #3, this time with some unneeded debug info stuff removed which was causing dead pointers to be added to the worklist. llvm-svn: 83818	2009-10-12 03:58:40 +00:00
Chris Lattner	8ce6b36c86	revert r83814 for now, it is making the llvm-gcc bootstrap unhappy. llvm-svn: 83817	2009-10-11 23:56:08 +00:00
Chris Lattner	78d6310429	populate instcombine's initial worklist more carefully, causing it to visit instructions from the start of the function to the end of the function in the first path. This greatly speeds up some pathological cases (e.g. PR5150). llvm-svn: 83814	2009-10-11 23:17:43 +00:00
Chris Lattner	2c2deae5ac	remove some harmful code that would turn an insertelement on an undef into a shuffle even if it was used by another insertelement. If the visitation order of instcombine was wrong, this would turn a chain of insertelements into a chain of shufflevectors, which was quite painful. Since CollectShuffleElements handles these cases, the code can just be nuked. llvm-svn: 83810	2009-10-11 23:02:46 +00:00
Chris Lattner	c6cdbfbfdd	teach instcombine to simplify xor's harder, catching the new testcase. llvm-svn: 83799	2009-10-11 22:22:13 +00:00
Chris Lattner	6e6ac47125	cleanups llvm-svn: 83797	2009-10-11 22:00:32 +00:00
Chris Lattner	1639234775	cleanup, no functionality change. llvm-svn: 83795	2009-10-11 21:36:10 +00:00
Chris Lattner	fd27f8a5b3	generalize a transformation even more: we don't care whether the input the the mul is a zext from bool, just that it is all zeros other than the low bit. This fixes some phase ordering issues that would cause us to miss some xforms in mul.ll when the worklist is visited differently. llvm-svn: 83794	2009-10-11 21:29:45 +00:00
Chris Lattner	406cb75c6b	simplify a transformation by making it more general. llvm-svn: 83792	2009-10-11 21:22:21 +00:00
Chris Lattner	f39f4f928a	temporarily revert previous patch llvm-svn: 83791	2009-10-11 21:05:34 +00:00
Chris Lattner	bb058d3a23	populate instcombine's initial worklist more carefully, causing it to visit instructions from the start of the function to the end of the function in the first path. This greatly speeds up some pathological cases (e.g. PR5150). llvm-svn: 83790	2009-10-11 21:04:37 +00:00
Torok Edwin	8b3081350e	Remove CleanupDbgInfo, instcombine does this and its not worth duplicating it here. llvm-svn: 83789	2009-10-11 19:58:35 +00:00
Torok Edwin	907ec36943	LICM shouldn't sink/delete debug information. Fix this and add a testcase. For now the metadata of sinked/hoisted instructions is still wrong, but that'll be fixed when instructions will have debug metadata directly attached. llvm-svn: 83786	2009-10-11 19:15:54 +00:00
Chris Lattner	85c85c5e04	when folding duplicate conditions, delete the now-probably-dead instruction tree feeding it. llvm-svn: 83778	2009-10-11 18:39:58 +00:00
Chris Lattner	e374382b8f	implement rdar://7293527, a trivial instcombine that llvm-gcc gets but clang doesn't, because it is implemented in GCC's fold routine. llvm-svn: 83761	2009-10-11 07:53:15 +00:00
Chris Lattner	97b1405207	implement a transformation in jump threading that is currently done by condprop, but do it in a much more general form. The basic idea is that we can do a limited form of tail duplication in the case when we have a branch on a phi. Moving the branch up in to the predecessor block makes instruction selection much easier and encourages chained jump threadings. llvm-svn: 83759	2009-10-11 07:24:57 +00:00
Chris Lattner	6ce85e85f5	restructure some code, no functionality change. llvm-svn: 83756	2009-10-11 04:40:21 +00:00
Chris Lattner	f466bc84c9	factor some code better and move a function, no functionality change. llvm-svn: 83755	2009-10-11 04:33:43 +00:00
Chris Lattner	f99a74e24b	make jump threading on a phi with undef inputs happen. llvm-svn: 83754	2009-10-11 04:18:15 +00:00
Chris Lattner	b6c65faa64	switch GVN to use SSAUpdater. Besides removing a lot of complexity from GVN, this also speeds it up, inserts fewer PHI nodes (see the testcase) and allows it to remove more loads (due to fewer PHI nodes standing in the way). llvm-svn: 83746	2009-10-10 23:50:30 +00:00
Chris Lattner	89d2a5c4f3	remove dead code llvm-svn: 83742	2009-10-10 23:04:12 +00:00
Chris Lattner	84095071ea	Change jump threading to use the new SSAUpdater class instead of DemoteRegToStack. This makes it more efficient (because it isn't creating a ton of load/stores that are eventually removed by a later mem2reg), and more slightly more effective (because those load/stores don't get in the way of threading). llvm-svn: 83706	2009-10-10 09:05:58 +00:00
Chris Lattner	f30a2b0c86	random tidying llvm-svn: 83701	2009-10-10 06:22:45 +00:00
Dan Gohman	09984279fd	Add a form of addPreserved which takes a string argument, to allow passes to declare that they preserve other passes without needing to pull in additional header file or library dependencies. Convert MachineFunctionPass and CodeGenLICM to make use of this. llvm-svn: 83555	2009-10-08 17:00:02 +00:00
Jeffrey Yasskin	dafd08ea7e	In instcombine's debug output, avoid printing ADD for instructions that are already on the worklist, and print Visited when an instruction is about to be visited. Net, on one input, this reduced the output size by at least 9x. llvm-svn: 83510	2009-10-08 00:12:24 +00:00
Eric Christopher	5b741f3d14	80-column and whitespace fixes. llvm-svn: 83489	2009-10-07 21:14:25 +00:00
Ted Kremenek	2275a7dfef	Update CMake file. llvm-svn: 83404	2009-10-06 19:45:38 +00:00
Chris Lattner	a893f5bdf5	remove predicate simplifier, it never got the last bugs beaten out of it, and jump threading, condprop and gvn are now getting most of the benefit. This was approved by Nicholas and Nicolas. llvm-svn: 83390	2009-10-06 16:59:46 +00:00
Duncan Sands	9ed7b16bf3	Introduce and use convenience methods for getting pointer types where the element is of a basic builtin type. For example, to get an i8* use getInt8PtrTy. llvm-svn: 83379	2009-10-06 15:40:36 +00:00
Dan Gohman	e525d9ddc0	Remove an unnnecessary LLVMContext argument in ConstantFoldLoadThroughGEPConstantExpr. llvm-svn: 83311	2009-10-05 16:36:26 +00:00
Dan Gohman	238cf49812	Use Use::operator= instead of Use::set, for consistency. llvm-svn: 83310	2009-10-05 16:31:55 +00:00
Chris Lattner	fdd8790718	strength reduce a ton of type equality tests to check the typeid (Through the new predicates I added) instead of going through a context and doing a pointer comparison. Besides being cheaper, this allows a smart compiler to turn the if sequence into a switch. llvm-svn: 83297	2009-10-05 05:54:46 +00:00
Chris Lattner	463716d559	instcombine shouldn't delete all null checks for mallocs. This fixes PR5130. llvm-svn: 83290	2009-10-05 02:47:47 +00:00
Douglas Gregor	d846fbf20d	Remove GVNPRE.cpp from the CMake makefile llvm-svn: 83194	2009-10-01 05:30:05 +00:00
Chris Lattner	5f3cc06cd2	remove the GVNPRE pass. It has been subsumed by the GVN pass. Ok'd by Owen. llvm-svn: 83193	2009-10-01 02:18:36 +00:00
Chris Lattner	0261b5d2d2	The select instruction is not neccesarily in the same block as the phi nodes. Make sure to phi translate from the right block. This fixes a llvm-building-llvm failure on GVN-PRE.cpp llvm-svn: 82970	2009-09-28 06:49:44 +00:00
Chris Lattner	4425660b1f	simplify some code. llvm-svn: 82936	2009-09-27 21:46:50 +00:00
Chris Lattner	b2e88cd01c	The bitcast case is not needed here: instcombine turns icmp(bitcast(x), null) -> icmp(x, null) already. llvm-svn: 82935	2009-09-27 21:42:46 +00:00
Chris Lattner	8b4d3dfbbf	calls are already unmovable, malloc doesn't need a special case. llvm-svn: 82933	2009-09-27 21:36:19 +00:00
Chris Lattner	f9e0c7f84b	calls to external functions are already marked overdefined, special casing malloc isn't needed. llvm-svn: 82932	2009-09-27 21:35:11 +00:00
Chris Lattner	466d57f6c1	calls are rejected above, no need to special case malloc here. llvm-svn: 82929	2009-09-27 21:31:39 +00:00
Chris Lattner	b391e87263	allow pushing icmps through phis with multiple uses and across critical edges. These are important to push up to encourage jump threading. This shrinks 176.gcc a bit. llvm-svn: 82923	2009-09-27 20:46:36 +00:00
Chris Lattner	ae289632ef	Enhance the previous fix for PR4895 to allow more values than just simple constants for the true/false value of the select. We now do phi translation etc. This really fixes PR4895 :) llvm-svn: 82917	2009-09-27 20:18:49 +00:00
Chris Lattner	facb867af3	implement PR4895, by making FoldOpIntoPhi handle select conditions that are phi nodes. Also tighten up FoldOpIntoPhi to treat constantexpr operands to phis just like other variables, avoiding moving constantexpr computations around. Patch by Daniel Dunbar. llvm-svn: 82913	2009-09-27 19:57:57 +00:00
Dan Gohman	0e70af36c0	Grab an LLVM Context from an instruction that exists rather than one that is deleted in some situations. This fixes a use-after-free. llvm-svn: 82903	2009-09-27 16:10:30 +00:00
Dan Gohman	fc20b67e80	Tell ScalarEvolution to forget everything it knows about a loop before rotating the loop, since loop rotation is a very significant change. llvm-svn: 82901	2009-09-27 15:37:03 +00:00
Nick Lewycky	42fb7452df	Instruction::clone does not need to take an LLVMContext&. Remove that and update all the callers. llvm-svn: 82889	2009-09-27 07:38:41 +00:00
Dan Gohman	62995c71a2	Fix SimplifyLibCalls to transfer attributes from callees rather than calls, since direct calls don't always reflect the attributes of their callees. llvm-svn: 82867	2009-09-26 18:10:13 +00:00
Dan Gohman	394468dc8e	Rename ConstantFP's getInf to getInfinity. llvm-svn: 82823	2009-09-25 23:40:21 +00:00
Dan Gohman	5ffd53892d	Transform pow(x, 0.5) to (x == -inf ? inf : fabs(sqrt(x))), which is typically faster then doing a general pow. llvm-svn: 82819	2009-09-25 23:10:17 +00:00
Torok Edwin	21bd8c9fc5	Constant propagating byval pointer is safe if function is readonly. llvm-svn: 82700	2009-09-24 18:33:42 +00:00
Torok Edwin	f95a450ef9	Don't constant propagate byval pointers, since they are not really pointers, but rather structs passed by value. This fixes PR5038. llvm-svn: 82689	2009-09-24 09:47:18 +00:00
Chris Lattner	247053867e	big endian systems shift by bits too, hopefully this will fix the ppc bootstrap problems. llvm-svn: 82464	2009-09-21 17:55:47 +00:00
Dan Gohman	43d6830ea0	Nick pointed out that DominanceFrontier and DominanceTree are preserved by setPreservesCFG(). llvm-svn: 82463	2009-09-21 17:54:42 +00:00
Dan Gohman	af57ae3da4	Remove the special-case for constants in PHI nodes; it's not really helpful, and it didn't correctly handle the case of constants input to PHIs for backedges. llvm-svn: 82462	2009-09-21 17:53:35 +00:00
Chris Lattner	9045f235d2	fix PR5016, a crash I introduced in GVN handing first class arrays and structs, which cannot be bitcast to integers. llvm-svn: 82460	2009-09-21 17:24:04 +00:00
Chris Lattner	4d8af2f1ae	enable non-local analysis and PRE of large store -> little load. This doesn't kick in too much because of phi translation issues, but this can be resolved in the future. llvm-svn: 82447	2009-09-21 06:48:08 +00:00
Chris Lattner	0cdc17eb50	convert an std::pair to an explicit struct. llvm-svn: 82446	2009-09-21 06:30:24 +00:00
Chris Lattner	d28f90897a	move some functions, add a comment. llvm-svn: 82444	2009-09-21 06:24:16 +00:00
Chris Lattner	9d7fb29522	split HandleLoadFromClobberingStore in two pieces: one that does the analysis, one that does the xform. llvm-svn: 82443	2009-09-21 06:22:46 +00:00
Chris Lattner	0a9616d906	Improve GVN to be able to forward substitute a small load from a piece of a large store when both are in the same block. This allows clang to compile the testcase in PR4216 to this code: _test_bitfield: movl 4(%esp), %eax movl %eax, %ecx andl $-65536, %ecx orl $32962, %eax andl $40186, %eax orl %ecx, %eax ret This is not ideal, but is a whole lot better than the code produced by llvm-gcc: _test_bitfield: movw $-32574, %ax orw 4(%esp), %ax andw $-25350, %ax movw %ax, 4(%esp) movw 7(%esp), %cx shlw $8, %cx movzbl 6(%esp), %edx orw %cx, %dx movzwl %dx, %ecx shll $16, %ecx movzwl %ax, %eax orl %ecx, %eax ret and dramatically better than that produced by gcc 4.2: _test_bitfield: pushl %ebx call L3 "L00000000001$pb": L3: popl %ebx movl 8(%esp), %eax leal 0(,%eax,4), %edx sarb $7, %dl movl %eax, %ecx andl $7168, %ecx andl $-7201, %ebx movzbl %dl, %edx andl $1, %edx sall $5, %edx orl %ecx, %ebx orl %edx, %ebx andl $24, %eax andl $-58336, %ebx orl %eax, %ebx orl $32962, %ebx movl %ebx, %eax popl %ebx ret llvm-svn: 82439	2009-09-21 05:57:11 +00:00
Chris Lattner	1eefa9c427	formatting cleanups, no functionality change. llvm-svn: 82426	2009-09-21 02:42:51 +00:00
Chris Lattner	a0aa8fb6a6	Move CoerceAvailableValueToLoadType earlier in GVN.cpp. Hook it up so that nonlocal and partially redundant loads can use it as well. The testcase shows examples of craziness this can handle. This triggers many times in 176.gcc. llvm-svn: 82403	2009-09-20 20:09:34 +00:00
Chris Lattner	7c62d8a1a8	change the interface to CoerceAvailableValueToLoadType to be more generic. llvm-svn: 82402	2009-09-20 19:31:14 +00:00
Chris Lattner	1dd48c34e5	enhance GVN to forward substitute a stored value to a load (and load -> load) when the base pointers must alias but when they are different types. This occurs very very frequently in 176.gcc and other code that uses bitfields a lot. llvm-svn: 82399	2009-09-20 19:03:47 +00:00
Daniel Dunbar	7d6781b0fe	Tabs -> spaces, and remove trailing whitespace. llvm-svn: 82355	2009-09-20 02:20:51 +00:00
Victor Hernandez	5d034499ad	Enhance transform passes so that they apply the same tranforms to malloc calls as to MallocInst. Reviewed by Dan Gohman. llvm-svn: 82300	2009-09-18 22:35:49 +00:00
Daniel Dunbar	487d1c8138	Update CMake. llvm-svn: 82097	2009-09-17 00:06:48 +00:00
Dan Gohman	0f64d71d99	Add a new pass for doing late hoisting of floating-point and vector constants out of loops. These aren't covered by the regular LICM pass, because in LLVM IR constants don't require separate instructions. They're not always covered by the MachineLICM pass either, because it doesn't know how to unfold folded constant-pool loads. This is somewhat experimental at this point, and off by default. llvm-svn: 82076	2009-09-16 20:25:11 +00:00
Dan Gohman	bd0050810c	Change FoldPHIArgBinOpIntoPHI to decline folding if it would introduce two phis, similar to the FoldPHIArgGEPIntoPHI change. Also, delete some comments that don't reflect the code. llvm-svn: 82053	2009-09-16 16:50:24 +00:00
Andreas Neustifter	f8cb758ba8	Preserve ProfileInfo during CodeGenPrepare. llvm-svn: 82034	2009-09-16 09:26:52 +00:00
Dan Gohman	3b7ce109ec	Don't sink gep operators through phi nodes if the result would require more than one phi, since that leads to higher register pressure on entry to the phi. This is especially problematic when the phi is in a loop header, as it increases register pressure throughout the loop. llvm-svn: 81993	2009-09-16 02:01:52 +00:00
Nick Lewycky	7465cd769c	Add more newlines to make up for the ones removed from the end of instructions. llvm-svn: 81851	2009-09-15 07:08:25 +00:00
Chris Lattner	e9a4992399	add newline to debug dump llvm-svn: 81840	2009-09-15 05:14:57 +00:00
Dan Gohman	f9eafce3af	When extending a memset range past the front, set the alignment of the memset region to the alignment of the new start address. llvm-svn: 81810	2009-09-14 23:39:10 +00:00
Dan Gohman	ec4557f324	Fix SplitCriticalEdge to properly update LCSSA form when splitting a loop exit edge -- new PHIs may be needed not only for the additional splits that are made to preserve LoopSimplify form, but also for the original split. Factor out the code that inserts new PHIs so that it can be used for both. Remove LoopRotation.cpp's code for manually updating LCSSA form, as it is now redundant. This fixes PR4934. llvm-svn: 81363	2009-09-09 18:18:18 +00:00
Mike Stump	deaf572ca8	Reflow comment. llvm-svn: 81361	2009-09-09 17:57:16 +00:00
Dan Gohman	c56af25c01	Fix an 80-column violation. llvm-svn: 81354	2009-09-09 17:17:19 +00:00
Chris Lattner	9ce1781ef4	remove an extremely dubious instcombine transformation of extractelement(load). llvm-svn: 81239	2009-09-08 18:48:01 +00:00
Dan Gohman	3ddbc242fb	Re-apply r80926, with fixes: keep the domtree informed of new blocks that get created during loop unswitching, and fix SplitBlockPredecessors' LCSSA updating code to create new PHIs instead of trying to just move existing ones. Also, optimize Loop::verifyLoop, since it gets called a lot. Use searches on a sorted list of blocks instead of calling the "contains" function, as is done in other places in the Loop class, since "contains" does a linear search. Also, don't call verifyLoop from LoopSimplify or LCSSA, as the PassManager is already calling verifyLoop as part of LoopInfo's verifyAnalysis. llvm-svn: 81221	2009-09-08 15:45:00 +00:00
Chris Lattner	d1b21c6092	remove a turd llvm-svn: 81186	2009-09-08 03:47:41 +00:00
Chris Lattner	d3210e1a20	instcombine transforms vector loads that are only used by extractelement operations into a bitcast of the pointer, then a gep, then a scalar load. Disable this when the vector only has one element, because it leads to infinite loops in instcombine (PR4908). This transformation seems like a really bad idea to me, as it will likely disable CSE of vector load/stores etc and can be better done in the code generator when profitable. This goes all the way back to the first days of packed types, r25299 specifically. I'll let those people who care about the performance of vector code decide what to do with this. llvm-svn: 81185	2009-09-08 03:44:51 +00:00
Chris Lattner	f2ab40a46f	Fix PR4882, by making MemCpyOpt not dereference removed stores to get the context for the newly created operations. Patch by Jakub Staszak! llvm-svn: 81175	2009-09-08 00:27:14 +00:00
Dan Gohman	1b84908f92	Reappy r80998, now that the GlobalOpt bug that it exposed on MiniSAT is fixed. llvm-svn: 81172	2009-09-07 23:54:19 +00:00
Duncan Sands	89720bbd11	Remove some not-really-used variables, as warned about by icc (#593, partial). Patch by Erick Tryzelaar. llvm-svn: 81115	2009-09-06 12:41:19 +00:00
Daniel Dunbar	86c6a6ef0f	Fix a possible crash call setIsInBounds. - I think there are more instances of this, but I think they are fixed in Dan's incoming patch. This one was preventing me from doing a bugpoint reduction though. llvm-svn: 81103	2009-09-06 02:31:36 +00:00
Evan Cheng	904199547b	Revert r80926. It causes loop unswitch assertion and slow down some JIT tests significantly. llvm-svn: 81101	2009-09-06 02:26:10 +00:00
Daniel Dunbar	10ea8bb8e0	Revert "Include optional subclass flags, such as inbounds, nsw, etc., ...", this breaks MiniSAT on x86_64. llvm-svn: 81098	2009-09-06 00:11:24 +00:00
Dan Gohman	0c2477c26b	Include optional subclass flags, such as inbounds, nsw, etc., in the Constant uniquing tables. This allows distinct ConstantExpr objects with the same operation and different flags. Even though a ConstantExpr "a + b" is either always overflowing or never overflowing (due to being a ConstantExpr), it's still necessary to be able to represent it both with and without overflow flags at the same time within the IR, because the safety of the flag may depend on the context of the use. If the constant really does overflow, it wouldn't ever be safe to use with the flag set, however the use may be in code that is never actually executed. This also makes it possible to merge all the flags tests into a single test. llvm-svn: 80998	2009-09-04 12:08:11 +00:00
Dan Gohman	4c1bdcf5d7	Add a verifyAnalysis to LoopInfo, LoopSimplify, and LCSSA form that verify that these passes are properly preserved. Fix several transformation passes that claimed to preserve LoopSimplify form but weren't. llvm-svn: 80926	2009-09-03 16:31:42 +00:00
Dan Gohman	22571485b3	Change PHINode::hasConstantValue to have a DominatorTree argument instead of a bool argument, and to do the dominator check itself. This makes it eaiser to use when DominatorTree information is available. llvm-svn: 80920	2009-09-03 15:34:35 +00:00
Duncan Sands	0edc7100ba	Keep track of how many memmove calls were turned into memcpy calls. llvm-svn: 80915	2009-09-03 13:37:16 +00:00
Chris Lattner	27266f164f	In C++, code is not allowed to call main. In C it is, this simplifylibcalls optimization is thus valid for C++ but not C. It's not important enough to worry about for C++ apps, so just remove it. rdar://7191924 llvm-svn: 80887	2009-09-03 05:19:59 +00:00
Gabor Greif	2d60e1ec0c	back out my recent commit (r80858), it seems to break self-hosting buildbot's stage 2 configure llvm-svn: 80871	2009-09-03 02:02:59 +00:00
Gabor Greif	14dfba6d66	re-commit r66920 (which has been backed out in r66953) I may have more luck this time. I'll back out if needed... llvm-svn: 80858	2009-09-03 00:18:58 +00:00
Chris Lattner	4916267c97	fix PR4815: some cases where DeleteDeadInstruction can delete the instruction BBI points to. llvm-svn: 80768	2009-09-02 06:31:02 +00:00
Chris Lattner	09a79dcfdf	clean up this code a bit. llvm-svn: 80767	2009-09-02 06:15:37 +00:00
Chris Lattner	2dd09dbdf7	eliminate VISIBILITY_HIDDEN from Transforms/Scalar. PR4861 llvm-svn: 80766	2009-09-02 06:11:42 +00:00
Chris Lattner	64b5842986	fix PR4837, some bugs folding vector compares. These return a vector of i1, not i1 itself. llvm-svn: 80761	2009-09-02 05:12:37 +00:00
Chris Lattner	1145e33bc6	enhance memcpy opt to turn memmoves into memcpy when the src/dest don't alias. Remove an old and poorly reduced testcase that fails with this transform for reasons unrelated to the original test. llvm-svn: 80693	2009-09-01 17:56:32 +00:00
Chris Lattner	b5557a7b42	random code cleanups, no functionality change. llvm-svn: 80682	2009-09-01 17:09:55 +00:00
Chris Lattner	ff5f1e4d70	fix some cases where instcombine would change hte IR but not return true from runOnFunction llvm-svn: 80562	2009-08-31 06:57:37 +00:00
Chris Lattner	19dd315e67	improve -debug output, so that -debug is more likely to print when instcombine is changing stuff. llvm-svn: 80538	2009-08-31 05:17:58 +00:00
Chris Lattner	4e3e930743	fix a bug I introduced with my 'instcombine builder' refactoring changes: SimplifyDemandedBits can't use the builder yet because it has the wrong insertion point. This fixes a crash building MultiSource/Benchmarks/PAQ8p llvm-svn: 80537	2009-08-31 04:36:22 +00:00
Chris Lattner	73913f4cd3	Fix PR4748: don't fold gep(bitcast(x)) into bitcast(gep) when x is itself a bitcast. Since we have gep(bitcast(bitcast(y))) in this case, just wait for the two bitcasts to get zapped. This prevents instcombine from confusing some aliasing stuff, and allows it to directly eliminate the load in the testcase. llvm-svn: 80508	2009-08-30 20:38:21 +00:00
Chris Lattner	c2f2cf896e	misc cleanup llvm-svn: 80507	2009-08-30 20:36:46 +00:00
Chris Lattner	a3e620caba	add getPointerAddressSpace() to GEP instruction, use the method in a few scalar xforms to simplify things. llvm-svn: 80506	2009-08-30 20:06:40 +00:00
Chris Lattner	c856539edf	eliminate InsertCastBefore, use the builder instead. llvm-svn: 80505	2009-08-30 20:01:10 +00:00
Chris Lattner	606da5fed8	eliminate InsertBitCastBefore, just use the builder instead. llvm-svn: 80504	2009-08-30 19:47:22 +00:00
Chris Lattner	5966341a2e	convert a bunch more calls to InsertNewInstBefore to use the new Instcombine builder. llvm-svn: 80501	2009-08-30 18:50:58 +00:00
Chris Lattner	8326d529da	fix typo llvm-svn: 80500	2009-08-30 17:53:59 +00:00
Chris Lattner	022a582de2	give instcombine a custom IRBuilder that adds new instructions to the workslist and is set to insert new instructions before the current one. Convert a bunch of stuff that used to call InsertNewInstBefore over to use it, greatly simplifying code and making it more natural. There is still a lot more to go, but this is a good start. llvm-svn: 80492	2009-08-30 07:44:24 +00:00
Chris Lattner	a0c89ee1da	add a new InstCombineWorklist::AddValue method that works even if the operand is not an instruction. Simplify most uses of AddOperandsToWorkList to use AddValue and inline it into the one remaining callsite. llvm-svn: 80488	2009-08-30 06:27:41 +00:00
Chris Lattner	bacd05c2eb	move AddUsersToWorkList to the worklist processing class, make the argument stronger typed. llvm-svn: 80487	2009-08-30 06:22:51 +00:00
Chris Lattner	795bfdbb55	rename AddUsesToWorkList -> AddOperandsToWorkList. The former looks too much like AddUsersToWorkList and keeps confusing me. Remove AddSoonDeadInstToWorklist and change its two callers to do the same thing in a simpler way. llvm-svn: 80486	2009-08-30 06:20:05 +00:00
Chris Lattner	905976b1db	inline the trivial AddToWorkList/RemoveFromWorkList methods into their callers. simplify ReplaceInstUsesWith. Make EraseInstFromFunction only add operands to the worklist if there aren't too many of them (this was a scalability win for crazy programs that was only infrequently enforced). Switch more code to using EraseInstFromFunction instead of duplicating it inline. Change some fcmp/icmp optimizations to modify fcmp/icmp in place instead of creating a new one and deleting the old one just to change the predicate. llvm-svn: 80483	2009-08-30 06:13:40 +00:00
Chris Lattner	93ad6170fd	fix a bug I introduced in r80478 found by the build bot. llvm-svn: 80482	2009-08-30 05:56:44 +00:00
Chris Lattner	97fd3599e1	refactor instcombine's worklist processing stuff out to its own class. llvm-svn: 80481	2009-08-30 05:55:36 +00:00
Chris Lattner	b2995e1eb1	more cleanups: remove some redundant code, and simplify some other places. llvm-svn: 80478	2009-08-30 05:30:55 +00:00
Chris Lattner	06c687b59e	eliminate the temporary SrcGEPOperands smallvector. llvm-svn: 80477	2009-08-30 05:08:50 +00:00
Chris Lattner	e26bf17423	simplify/detangle some control flow. llvm-svn: 80476	2009-08-30 05:00:50 +00:00
Chris Lattner	d7b6e913fe	simplify and cleanup some code, remove some code that just does constant folding of gep's: this is already handled in a more general way. No functionality change. llvm-svn: 80475	2009-08-30 04:49:01 +00:00
Dan Gohman	0dfe73ac9e	Remove an unnecessary Context argument. llvm-svn: 80454	2009-08-29 23:39:38 +00:00
Chris Lattner	bda82c20f3	Fix PR3913, patch by Jakub Staszak! llvm-svn: 80327	2009-08-28 00:43:14 +00:00
Owen Anderson	109ca5a14a	Make this into a static method. llvm-svn: 80170	2009-08-26 22:55:11 +00:00
Dan Gohman	3b1938dda4	Remove unused variables. llvm-svn: 80058	2009-08-26 00:13:22 +00:00
Dan Gohman	ad1f0a1101	Eliminate the unused Context argument on one of the ICmpInst and FCmpInst constructors. llvm-svn: 80049	2009-08-25 23:17:54 +00:00
Dan Gohman	c8a27f2a5c	Rename Instruction::isIdenticalTo to Instruction::isIdenticalToWhenDefined, and introduce a new Instruction::isIdenticalTo which tests for full identity, including the SubclassOptionalData flags. Also, fix the Instruction::clone implementations to preserve the SubclassOptionalData flags. Finally, teach several optimizations how to handle SubclassOptionalData correctly, given these changes. This fixes the counterintuitive behavior of isIdenticalTo not comparing the full value, and clone not returning an identical clone, as well as some subtle bugs that could be caused by these. Thanks to Nick Lewycky for reporting this, and for an initial patch! llvm-svn: 80038	2009-08-25 22:11:20 +00:00
Dan Gohman	337d56110e	Special-case static allocas in IndVarSimplify's loop invariant sinking code, since they are special. If the loop preheader happens to be the entry block of a function, don't sink static allocas out of it. This fixes PR4775. llvm-svn: 80010	2009-08-25 17:42:10 +00:00
Benjamin Kramer	1a25d733f9	Kill off more cerr/cout uses and prune includes a bit. llvm-svn: 79852	2009-08-23 11:37:21 +00:00
Chris Lattner	317dbbcfb1	eliminate uses of cerr() llvm-svn: 79834	2009-08-23 07:05:07 +00:00
Chris Lattner	4dc3edde9f	remove a few DOUTs here and there. llvm-svn: 79832	2009-08-23 06:35:02 +00:00
Chris Lattner	b1d782bec9	eliminate the std::ostream form of WriteAsOperand and update clients. This also updates dominator related stuff. llvm-svn: 79825	2009-08-23 05:17:37 +00:00
Chris Lattner	3924bb5792	remove the std::ostream version of module and type printing. llvm-svn: 79823	2009-08-23 04:52:46 +00:00
Chris Lattner	b25de3ff60	eliminate the "Value" printing methods that print to a std::ostream. This required converting a bunch of stuff off DOUT and other cleanups. llvm-svn: 79819	2009-08-23 04:37:46 +00:00
Dan Gohman	16f5415f5b	Rename hasNoUnsignedOverflow and hasNoSignedOverflow to hasNoUnsignedWrap and hasNoSignedWrap, for consistency with the nuw and nsw properties. llvm-svn: 79539	2009-08-20 17:11:38 +00:00
Dan Gohman	7167f42769	Fix a few places to check if TargetData is available before using it. llvm-svn: 79493	2009-08-19 23:38:22 +00:00
Dan Gohman	915302c605	Make SROA and PredicateSimplifier cope if TargetData is not available. This is very conservative for now. llvm-svn: 79442	2009-08-19 18:22:18 +00:00
Dan Gohman	dea2358c68	Fix SimplifyLibcalls and ValueTracking to check mayBeOverridden before performing optimizations based on constant string values. llvm-svn: 79384	2009-08-19 00:11:12 +00:00
Dan Gohman	10f1471e2f	Make TargetData optional in MemCpyOptimizer. llvm-svn: 79306	2009-08-18 01:17:52 +00:00
Dan Gohman	9f2b3db428	Make TargetData optional in SimplifyLibCalls. llvm-svn: 79298	2009-08-18 00:48:13 +00:00
Dan Gohman	8dd69f88ea	Fix debug output to include a newline after printing a Value, now that Value's operator<< doesn't include one. llvm-svn: 79240	2009-08-17 15:25:05 +00:00
Nick Lewycky	aa464002f0	Don't crash trying to promote VLAs. llvm-svn: 79226	2009-08-17 05:37:31 +00:00
Benjamin Kramer	693a9c57a6	Don't try to get the context from an erased Instruction. llvm-svn: 79134	2009-08-15 21:07:49 +00:00
Owen Anderson	55f1c09e31	Push LLVMContexts through the IntegerType APIs. llvm-svn: 78948	2009-08-13 21:58:54 +00:00
Mon P Wang	a95379d165	When InstCombine simplifies a load -> extract element to gep -> load, place the new load by the old load instead of by the extract element because a store could have occurred between the load and extract element. llvm-svn: 78891	2009-08-13 05:12:13 +00:00
Andreas Bolka	5c2764b3e9	Simplify conditional. llvm-svn: 78889	2009-08-13 03:05:20 +00:00
Andreas Bolka	aef432505b	Simplify and reduce indentation using early exits. No intended functionality change. llvm-svn: 78888	2009-08-13 03:00:57 +00:00
Andreas Bolka	438ba80afa	DEBUGify some DOUTs. llvm-svn: 78887	2009-08-13 02:45:03 +00:00
Andreas Bolka	177a2f5313	Prune trailing whitespace. llvm-svn: 78886	2009-08-13 02:40:50 +00:00
Dan Gohman	4ac2f639cd	Transform -X/C to X/-C, implementing a README.txt entry. llvm-svn: 78812	2009-08-12 16:37:02 +00:00
Dan Gohman	908da3d97e	Optimize (x/C)*C to x if the division is exact. llvm-svn: 78811	2009-08-12 16:33:09 +00:00
Dan Gohman	43103abef0	Update instcombine's debug output to account for Value*'s operator<< not appending its own newline. llvm-svn: 78810	2009-08-12 16:28:31 +00:00
Dan Gohman	5476cfdb15	Remove a bunch more now-unnecessary Context arguments. llvm-svn: 78809	2009-08-12 16:23:25 +00:00
Dan Gohman	6b490ce4c7	Eliminate a bunch of now unnecessary explicit Context variables. llvm-svn: 78808	2009-08-12 16:04:34 +00:00
Owen Anderson	117c9e8497	Add contexts to some of the MVT APIs. No functionality change yet, just the infrastructure work needed to get the contexts to where they need to be first. llvm-svn: 78759	2009-08-12 00:36:31 +00:00
Dan Gohman	dbae4db67a	Optimize exact sdiv by a constant power of 2 to ashr. llvm-svn: 78714	2009-08-11 20:47:47 +00:00
Owen Anderson	53aa7a960c	Rename MVT to EVT, in preparation for splitting SimpleValueType out into its own struct type. llvm-svn: 78610	2009-08-10 22:56:29 +00:00
Daniel Dunbar	3b5008e23a	More ProfileInfo improvements. - Part of optimal static profiling patch sequence by Andreas Neustifter. - Store edge, block, and function information separately for each functions (instead of in one giant map). - Return frequencies as double instead of int, and use a sentinel value for missing information. llvm-svn: 78477	2009-08-08 17:43:09 +00:00
Devang Patel	b1106fbdbc	Fix dom frontier update. This fixes PR4667. Patch by Jakub Staszak. llvm-svn: 78388	2009-08-07 17:16:44 +00:00
Dan Gohman	298bce2aa9	Check for !isa<Constant> instead of isa<Instruction>. This matches what the comment says, and it avoids spurious BitCast instructions for Argument values. llvm-svn: 78121	2009-08-04 23:23:56 +00:00
Dan Gohman	f011f5a8a2	Add a new Constant::getIntegerValue helper function, and convert a few places in InstCombine to use it, to fix problems handling pointer types. This fixes the recent llvm-gcc bootstrap error. llvm-svn: 78005	2009-08-03 22:07:33 +00:00
Eli Friedman	cfd3bbe643	Make SimplifyDemandedUseBits generate vector constants where appropriate. Patch per report on llvmdev. No testcase because the original report didn't come with a testcase, and I can't come up with a case that actually fails. llvm-svn: 77986	2009-08-03 19:15:42 +00:00
Owen Anderson	5a1acd9912	Move a few more APIs back to 2.5 forms. The only remaining ones left to change back are metadata related, which I'm waiting on to avoid conflicting with Devang. llvm-svn: 77721	2009-07-31 20:28:14 +00:00
Dan Gohman	ef3ef7f645	Fix GVN's debug output, now that operator<< on Value* doesn't print a trailing newline. llvm-svn: 77719	2009-07-31 20:24:18 +00:00
Eli Friedman	ca9a4f1045	PR4662: Fix a crash introduced by the recent LLVMContext changes. llvm-svn: 77716	2009-07-31 19:36:47 +00:00
Owen Anderson	23a204d91b	Move getTrue() and getFalse() to 2.5-like APIs. llvm-svn: 77685	2009-07-31 17:39:07 +00:00
Owen Anderson	b292b8ce70	Move more code back to 2.5 APIs. llvm-svn: 77635	2009-07-30 23:03:37 +00:00
Daniel Dunbar	132f78395a	Twines: Don't allow implicit conversion from integers, this is too tricky. llvm-svn: 77605	2009-07-30 17:37:43 +00:00
Daniel Dunbar	6afdc5e694	Switch obvious clients to Twine instead of utostr (when they were already using a Twine, e.g., for names). - I am a little ambivalent about this; we don't want the string conversion of utostr, but using overload '+' mixed with string and integer arguments is sketchy. On the other hand, this particular usage is something of an idiom. llvm-svn: 77579	2009-07-30 04:20:37 +00:00
Douglas Gregor	47d02732e0	Eliminate a few unused-variable warnings llvm-svn: 77519	2009-07-29 22:41:10 +00:00
Owen Anderson	4056ca9568	Move types back to the 2.5 API. llvm-svn: 77516	2009-07-29 22:17:13 +00:00
Daniel Dunbar	98ddd164d8	Fix PR4645 which was fallout from the fix for PR4641. - Call RAUW to delete all instructions (this is a patch from Nick Lewycky). llvm-svn: 77512	2009-07-29 22:00:43 +00:00
Owen Anderson	487375e9a2	Move ConstantExpr to 2.5 API. llvm-svn: 77494	2009-07-29 18:55:55 +00:00
Nick Lewycky	f82326b984	Bulk erasing instructions without RAUWing them is unsafe. Instead, break them into a new BB that has no predecessors. llvm-svn: 77433	2009-07-29 05:17:50 +00:00
Owen Anderson	4aa3295a65	Return ConstantVector to 2.5 API. llvm-svn: 77366	2009-07-28 21:19:26 +00:00
Owen Anderson	c2c7932c64	Change ConstantArray to 2.5 API. llvm-svn: 77347	2009-07-28 18:32:17 +00:00
Dan Gohman	31a9b9880b	Teach instcombine to respect and preserve inbounds. Add inbounds to a few tests where it is required for the expected transformation. llvm-svn: 77290	2009-07-28 01:40:03 +00:00
Dan Gohman	9ba43abc70	Replace dyn_castGetElementPtr with dyn_cast<GEPOperator>. llvm-svn: 77286	2009-07-28 00:37:50 +00:00
Dan Gohman	a3dcff5900	Grab the LLVMContext and parent Module of SI ahead of the point where SI can get deleted. This fixes a use of free'd memory. This fixes Externals/Povray. llvm-svn: 77285	2009-07-28 00:37:06 +00:00
Mike Stump	d934cc06c6	Avoid build warnings. llvm-svn: 77271	2009-07-27 23:14:11 +00:00
Owen Anderson	69c464dec4	Move ConstantFP construction back to the 2.5-ish API. llvm-svn: 77247	2009-07-27 20:59:43 +00:00
Daniel Dunbar	6115b39ffd	Remove Value::getName{Start,End}, the last of the old Name APIs. llvm-svn: 77152	2009-07-26 09:48:23 +00:00
Daniel Dunbar	ca414c7cae	Remove Value::getNameLen llvm-svn: 77148	2009-07-26 08:34:35 +00:00
Daniel Dunbar	9813b0b025	Eliminate some uses of DOUT, cerr, and getNameStart(). llvm-svn: 77145	2009-07-26 07:49:05 +00:00
Daniel Dunbar	e03eecb75f	Remove Value::{isName, getNameRef}. Also, change MDString to use a StringRef. llvm-svn: 77098	2009-07-25 23:55:21 +00:00
Daniel Dunbar	4975db6276	Initial update to VMCore to use Twines for string arguments. - The only meat here is in Value.{h,cpp} the rest is essential 'const std::string &' -> 'const Twine &'. llvm-svn: 77048	2009-07-25 04:41:11 +00:00
Eric Christopher	53e1cd7254	Fix 80-col violations. llvm-svn: 77045	2009-07-25 02:45:27 +00:00
Eric Christopher	c974225976	Move ExtractElementInst to ::Create instead of new. Update all uses. llvm-svn: 77044	2009-07-25 02:28:41 +00:00
Dan Gohman	1ddf98ad8e	Convert a few more things to use raw_ostream. llvm-svn: 77039	2009-07-25 01:43:01 +00:00
Dan Gohman	29f2baf3b3	Convert a few more uses of llvm/Support/Streams.h to raw_ostream. llvm-svn: 77033	2009-07-25 01:13:51 +00:00
Dan Gohman	43d19d61d4	Make AliasAnalysis and related classes use getAnalysisIfAvailable<TargetData>(). llvm-svn: 77028	2009-07-25 00:48:42 +00:00
Daniel Dunbar	0dd5e1ed39	More migration to raw_ostream, the water has dried up around the iostream hole. - Some clients which used DOUT have moved to DEBUG. We are deprecating the "magic" DOUT behavior which avoided calling printing functions when the statement was disabled. In addition to being unnecessary magic, it had the downside of leaving code in -Asserts builds, and of hiding potentially unnecessary computations. llvm-svn: 77019	2009-07-25 00:23:56 +00:00
Owen Anderson	edb4a70325	Revert the ConstantInt constructors back to their 2.5 forms where possible, thanks to contexts-on-types. More to come. llvm-svn: 77011	2009-07-24 23:12:02 +00:00
Dan Gohman	0b5be94c79	Fix this condition I accidentally inverted. llvm-svn: 76988	2009-07-24 18:31:07 +00:00
Dan Gohman	67243a4bec	Convert several more passes to use getAnalysisIfAvailable<TargetData>() instead of getAnalysis<TargetData>(). llvm-svn: 76982	2009-07-24 18:13:53 +00:00
Daniel Dunbar	5bf72e20eb	Convert StringMap to using StringRef for its APIs. - Yay for '-'s and simplifications! - I kept StringMap::GetOrCreateValue for compatibility purposes, this can eventually go away. Likewise the StringMapEntry Create functions still follow the old style. - NIFC. llvm-svn: 76888	2009-07-23 18:17:34 +00:00
Chris Lattner	88ab854873	refactor a blob of code out to a new 'FoldOrOfFCmps' function and simplify it. llvm-svn: 76866	2009-07-23 05:46:22 +00:00
Chris Lattner	7d55541e56	Make some existing optimizations that would only trigger on scalars also apply to vectors. This allows us to compile this: #include <emmintrin.h> __m128i a(__m128 a, __m128 b) { return a==a & b==b; } __m128i b(__m128 a, __m128 b) { return a!=a \| b!=b; } to: _a: cmpordps %xmm1, %xmm0 ret _b: cmpunordps %xmm1, %xmm0 ret with clang instead of to a ton of horrible code. llvm-svn: 76863	2009-07-23 05:32:17 +00:00
Chris Lattner	9085438e4b	refactor a bunch of code out into a helper function, no functionality change. llvm-svn: 76859	2009-07-23 05:14:02 +00:00
Owen Anderson	47db941fd3	Get rid of the Pass+Context magic. llvm-svn: 76702	2009-07-22 00:24:57 +00:00
Dan Gohman	3666c34db8	Convert instcombine from using using getAnalysis<TargetData> to getAnalysisIfAvailable<TargetData>. llvm-svn: 76676	2009-07-21 23:21:54 +00:00
Owen Anderson	c37bc69e91	Rename getConstantInt{True\|False} to get{True\|False} at Chris' behest. llvm-svn: 76598	2009-07-21 18:03:38 +00:00
Owen Anderson	2ad52176f9	Move a bit more state over to the LLVMContext. llvm-svn: 76533	2009-07-21 02:47:59 +00:00
Chris Lattner	470a8da807	use ExpandInlineAsm on TargetLowering instead of TargetAsmInfo. llvm-svn: 76442	2009-07-20 17:52:52 +00:00
Dan Gohman	33a3fd0b9c	Revert the addition of hasNoPointerOverflow to GEPOperator. Getelementptrs that are defined to wrap are virtually useless to optimization, and getelementptrs that are undefined on any kind of overflow are too restrictive -- it's difficult to ensure that all intermediate addresses are within bounds. I'm going to take a different approach. Remove a few optimizations that depended on this flag. llvm-svn: 76437	2009-07-20 17:43:30 +00:00
Eli Friedman	048e78fc5b	Canonicalize bitcasts between types like <1 x i64> and i64 to insertelement/extractelement. I'm not entirely sure this is precisely what we want to do: should we prefer bitcast(insertelement) or insertelement(bitcast)? Similarly. should we prefer extractelement(bitcast) or bitcast(extractelement)? llvm-svn: 76345	2009-07-18 23:06:53 +00:00
Eli Friedman	eb6bcf3462	Back out 76300; apparently the preference is to canonicalize the other way (bitcast -> insert/extractelement). llvm-svn: 76325	2009-07-18 19:04:16 +00:00
Eli Friedman	52dbfc21c5	Add combine: X sdiv (1 << Y) -> X udiv (1 << Y) when X doesn't have the sign bit set. llvm-svn: 76304	2009-07-18 09:53:21 +00:00
Eli Friedman	992d0e0b74	Remove no-op check. llvm-svn: 76302	2009-07-18 09:21:25 +00:00
Eli Friedman	44e9836b17	Remove dead check. llvm-svn: 76301	2009-07-18 09:12:15 +00:00
Eli Friedman	a807aae226	Canonicalize insert/extractelement from single-element vectors into bitcasts. It would also be possible to canonicalize the other way; does anyone have a preference? llvm-svn: 76300	2009-07-18 09:07:47 +00:00
Eli Friedman	ff9bf97ceb	Fix simplifylibcalls memset recognition to work on 64-bit platforms where int is 32 bits. llvm-svn: 76293	2009-07-18 08:34:51 +00:00
Nick Lewycky	0d13903563	Replace intersectWith with maximalIntersectWith. The latter guarantees that all values belonging to the intersection will belong to the resulting range. The former was inconsistent about that point (either way is fine, just pick one.) This is part of PR4545. llvm-svn: 76289	2009-07-18 06:34:42 +00:00
Dan Gohman	e1019db658	Convert more code to use Operator instead of explicitly handling both ConstantExpr and Instruction. This involves duplicating some code between GetElementPtrInst and GEPOperator, but it's not a lot. llvm-svn: 76265	2009-07-17 23:55:56 +00:00
Dan Gohman	1d548d851a	Make BasicAliasAnalysis and Value::getUnderlyingObject use GEPOperator's hasNoPointer0verflow(), and make a few places in instcombine that create GEPs that may overflow clear the NoOverflow value. Among other things, this partially addresses PR2831. llvm-svn: 76252	2009-07-17 22:25:10 +00:00
Dan Gohman	a565d4f937	Fix some typos in a comment. llvm-svn: 76249	2009-07-17 22:16:21 +00:00
Dan Gohman	80ca01c466	Add a new Operator class, for handling Instructions and ConstantExprs in a convenient manner, factoring out some common code from InstructionCombining and ValueTracking. Move the contents of BinaryOperators.h into Operator.h and use Operator to generalize them to support ConstantExprs as well as Instructions. llvm-svn: 76232	2009-07-17 20:47:02 +00:00
Eli Friedman	b8f6a4fc8e	Replace isTrapping with a new, similar method called isSafeToSpeculativelyExecute. The new method is a bit closer to what the callers actually care about in that it rejects more things callers don't want. It also adds more precise handling for integer division, and unifies code for analyzing the legality of a speculative load. llvm-svn: 76150	2009-07-17 04:28:42 +00:00
Owen Anderson	20b34ac794	Move the ConstantInt uniquing table into LLVMContextImpl. This exposed a number of issues in our current context-passing stuff, which is also fixed here llvm-svn: 76089	2009-07-16 18:04:31 +00:00
Owen Anderson	4fdeba9706	Revert yesterday's change by removing the LLVMContext parameter to AllocaInst and MallocInst. llvm-svn: 75863	2009-07-15 23:53:25 +00:00
Eli Friedman	662da55c5f	Switch invars away from using isTrapping when it really shouldn't be using it. llvm-svn: 75852	2009-07-15 22:48:29 +00:00
Eli Friedman	ebe66ab13b	Don't restrict the set of instructions where we try to constant-fold the operands; it's possible to end up with a constant-foldable operand to most instructions, even those which can't trap. llvm-svn: 75845	2009-07-15 22:13:34 +00:00
Dan Gohman	b0f8e9960d	Fix indentation. llvm-svn: 75723	2009-07-15 01:26:32 +00:00
Dan Gohman	c43e47938a	Make makeLoopInvariant report whether it made any changes or not, and use this to simplify more code. llvm-svn: 75722	2009-07-15 01:25:43 +00:00
Owen Anderson	b6b2530000	Move EVER MORE stuff over to LLVMContext. llvm-svn: 75703	2009-07-14 23:09:55 +00:00
Dale Johannesen	3be62697df	Revert 75571; I'm convinced this isn't the right thing to do. llvm-svn: 75642	2009-07-14 17:48:25 +00:00
Torok Edwin	fbcc663cbf	llvm_unreachable->llvm_unreachable(0), LLVM_UNREACHABLE->llvm_unreachable. This adds location info for all llvm_unreachable calls (which is a macro now) in !NDEBUG builds. In NDEBUG builds location info and the message is off (it only prints "UREACHABLE executed"). llvm-svn: 75640	2009-07-14 16:55:14 +00:00
Dan Gohman	e141364e5c	Require IVUsers after LCSSA, since LCSSA does not preserve IVUsers. This results in the pass manager running IVUsers only once for indvars, instead of twice. llvm-svn: 75633	2009-07-14 14:26:23 +00:00
Eli Friedman	14379df4e6	Fix trivial todo in instcombine. llvm-svn: 75586	2009-07-14 02:01:53 +00:00
Dan Gohman	4d6149f356	Update LoopSimplify and LoopUnswitch to use the new makeLoopInvariant function. llvm-svn: 75584	2009-07-14 01:37:59 +00:00
Dan Gohman	03d5d0f451	Fix indvars to not assume that a loop with a single unique exit block has a single unique exiting block. llvm-svn: 75579	2009-07-14 01:09:02 +00:00
Dale Johannesen	85ae7480d9	Don't delete asm's just because their inputs are undefined; xor R, R is a common and valid idiom for zeroing a register, for example. llvm-svn: 75571	2009-07-14 00:45:38 +00:00
Eli Friedman	4b95026194	PR4548: optimize zext+udiv+trunc to udiv. llvm-svn: 75539	2009-07-13 22:46:01 +00:00
Eli Friedman	7e1716dc9d	Canonicalize boolean +/- a constant to a select. (I think it's reasonably clear that we want to have a canonical form for constructs like this; if anyone thinks that a select is not the best canonical form, please tell me.) llvm-svn: 75531	2009-07-13 22:27:52 +00:00
Owen Anderson	bb2501bbbe	These don't really need contexts either. llvm-svn: 75528	2009-07-13 22:18:28 +00:00
Dan Gohman	cc85ae132c	Make Loop and MachineLoop be subclasses of LoopBase, rather than typedefs, using the Curiously Recurring Template Pattern with LoopBase. This will help further refactoring, and future functionality for Loop. Also, Headers can now foward-declare Loop, instead of pulling in LoopInfo.h or doing tricks. llvm-svn: 75519	2009-07-13 21:51:15 +00:00
Eli Friedman	42170b0a9e	Misc simplifications to InstCombiner::commonIntCastTransforms. Most of the changes are allowed by not calling this function for bitcasts. The Instruction::AShr case is dead because SimplifyDemandedInstructionBits handles that case. llvm-svn: 75514	2009-07-13 21:45:57 +00:00
Eli Friedman	7f3a529ae9	Fix comment. llvm-svn: 75499	2009-07-13 20:58:59 +00:00
Owen Anderson	542619e6d5	Move more functionality over to LLVMContext. llvm-svn: 75497	2009-07-13 20:58:05 +00:00
Eli Friedman	f13aa44d4f	Don't bother to call commonIntCastTransforms for bitcasts; int->int bitcasts will always be eliminated anyway. llvm-svn: 75495	2009-07-13 20:53:00 +00:00
Owen Anderson	53a52215b5	Begin the painful process of tearing apart the rat'ss nest that is Constants.cpp and ConstantFold.cpp. This involves temporarily hard wiring some parts to use the global context. This isn't ideal, but it's the only way I could figure out to make this process vaguely incremental. llvm-svn: 75445	2009-07-13 04:09:18 +00:00
Eli Friedman	575db66e1b	Remove check which is duplicated in InstCombiner::visitSelectInstWithICmp. llvm-svn: 75409	2009-07-12 02:00:05 +00:00
Torok Edwin	56d0659726	assert(0) -> LLVM_UNREACHABLE. Make llvm_unreachable take an optional string, thus moving the cerr<< out of line. LLVM_UNREACHABLE is now a simple wrapper that makes the message go away for NDEBUG builds. llvm-svn: 75379	2009-07-11 20:10:48 +00:00
Torok Edwin	ccb29cd290	Convert more assert(0)+abort() -> LLVM_UNREACHABLE, and abort()/exit() -> llvm_report_error(). llvm-svn: 75363	2009-07-11 13:10:19 +00:00
Nick Lewycky	dcfdce9067	Move a method that creates constant ranges relative to another constant range per icmp predicate out of predsimplify and into ConstantRange. Add another utility method that determines whether one range is a subset of another. Combine with the former to determine whether icmp pred range, range is known to be true or not. llvm-svn: 75357	2009-07-11 06:15:39 +00:00
Owen Anderson	16e7674f4b	Push LLVMContext through the PatternMatch API. llvm-svn: 75255	2009-07-10 17:35:01 +00:00
Owen Anderson	1e5f00e7a7	This started as a small change, I swear. Unfortunately, lots of things call the [I\|F]CmpInst constructors. Who knew!? llvm-svn: 75200	2009-07-09 23:48:35 +00:00
Owen Anderson	29fd313e9e	A little bit more LLVMContextification. llvm-svn: 75159	2009-07-09 18:36:20 +00:00
Owen Anderson	a771459bb1	Push LLVMContext _back_ through IRBuilder. llvm-svn: 75040	2009-07-08 20:50:47 +00:00
Dan Gohman	7bb3173ff7	Tell ScalarEvolution to forget a loop before starting to delete it. This way ScalarEvolution can examine the loop to determine what state it needs to update, if it chooses. llvm-svn: 75029	2009-07-08 19:14:29 +00:00
Owen Anderson	b17f32945f	Switch GlobalVariable ctors to a sane API, where either a context or a module is required. llvm-svn: 75025	2009-07-08 19:03:57 +00:00
Nick Lewycky	a21d3daadc	Remove the vicmp and vfcmp instructions. Because we never had a release with these instructions, no autoupgrade or backwards compatibility support is provided. llvm-svn: 74991	2009-07-08 03:04:38 +00:00
Owen Anderson	5948fdf68b	Push LLVMContext through GlobalVariables and IRBuilder. llvm-svn: 74985	2009-07-08 01:26:06 +00:00
Dan Gohman	af75234955	Change all SCEV* to SCEV *. llvm-svn: 74918	2009-07-07 17:06:11 +00:00
Owen Anderson	38264b1554	"LLVMContext* " --> "LLVMContext *" llvm-svn: 74878	2009-07-06 23:00:19 +00:00
Owen Anderson	f1f1743b2e	Finish LLVMContext-ing lib/Analysis. This required pushing LLVMContext's through the ValueTracking API. llvm-svn: 74873	2009-07-06 22:37:39 +00:00
Owen Anderson	39f00cc1d4	Thread LLVMContext through the constant folding APIs, which touches a lot of files. llvm-svn: 74844	2009-07-06 18:42:36 +00:00
Owen Anderson	e70b637033	More LLVMContext-ification. llvm-svn: 74807	2009-07-05 22:41:43 +00:00
Owen Anderson	340288c621	Even more passes being LLVMContext'd. llvm-svn: 74781	2009-07-03 19:42:02 +00:00
Owen Anderson	80baed63b4	Second batch of passes using LLVMContext. llvm-svn: 74753	2009-07-03 00:54:20 +00:00
Owen Anderson	b5618da226	Convert the first batch of passes to use LLVMContext. llvm-svn: 74748	2009-07-03 00:17:18 +00:00
Chris Lattner	f3f6aaa2c3	fix inverted logic pointed out by John McCall, noticed by inspection. This was considering vector intrinsics to have cost 2, but non-vector intrinsics to have cost 1, which is backward. llvm-svn: 74698	2009-07-02 15:39:39 +00:00
Dan Gohman	43f33dd550	Fix a bunch of other places that used operator[] to test whether a key is present in a std::map or DenseMap to use find instead. llvm-svn: 74676	2009-07-02 00:17:47 +00:00
Dan Gohman	cf092389a9	Request LCSSA after LoopSimplify. This fixes a problem in which the PassManager was scheduling LCSSA before LoopSimplify, which does not preserve LCSSA. llvm-svn: 74661	2009-07-01 23:21:38 +00:00
Dan Gohman	83348f80b6	Fix an instcombine abort on a scalar-to-vector bitcast. This fixes PR4487. llvm-svn: 74646	2009-07-01 21:38:46 +00:00
Dan Gohman	317f054531	Don't try to split a loop when the controlling icmp instruction doesn't have an IV-based operand. This fixes PR4471. llvm-svn: 74399	2009-06-27 22:58:27 +00:00
Dan Gohman	8918b481bf	More minor code simplifications. llvm-svn: 74395	2009-06-27 21:23:40 +00:00
Dan Gohman	fe174b6952	When a value is used multiple times within a single PHI, instructions inserted to replace that value must dominate all of of the basic blocks associated with the uses of the value in the PHI, not just one of them. llvm-svn: 74376	2009-06-27 05:16:57 +00:00
Dan Gohman	daafbe6168	Incorporate the insertion point into the key of SCEVExpander's CSE map. This helps it avoid reusing an instruction that doesn't dominate all of the users, in cases where the original instruction was inserted before all of the users were known. This may result in redundant expansions of sub-expressions that depend on loop-unpredictable values in some cases, however this isn't very common, and it primarily impacts IndVarSimplify, so GVN can be expected to clean these up. This eliminates the need for IndVarSimplify's FixUsesBeforeDefs, which fixes several bugs. llvm-svn: 74352	2009-06-26 22:53:46 +00:00
Owen Anderson	01ad6605c0	Constify this value. llvm-svn: 74330	2009-06-26 21:39:56 +00:00
Douglas Gregor	6d94e6a5f3	Fix linking of llvm-ld and lli with CMake, from Xerxes Rånby llvm-svn: 74285	2009-06-26 15:37:00 +00:00
Dan Gohman	ac3b5382b8	Change this code to a form about which VC++ reportedly isn't unhappy. llvm-svn: 74243	2009-06-26 00:35:12 +00:00
Dan Gohman	31167c61d5	Minor code simplification. llvm-svn: 74240	2009-06-26 00:26:03 +00:00
Dan Gohman	091e440568	Reword a few comments. llvm-svn: 74146	2009-06-25 00:22:44 +00:00
Dan Gohman	929fa7b0f4	When inserting code into a loop preheader, insert it before the terminator, instead of after the last phi. This fixes a bug exposed by ScalarEvolution analyzing more kinds of loops. This fixes PR4436. llvm-svn: 74072	2009-06-24 14:31:06 +00:00
Dan Gohman	f19aeec3f5	Extend ScalarEvolution's multiple-exit support to compute exact trip counts in more cases. Generalize ScalarEvolution's isLoopGuardedByCond code to recognize And and Or conditions, splitting the code out into an isNecessaryCond helper function so that it can evaluate Ands and Ors recursively, and make SCEVExpander be much more aggressive about hoisting instructions out of loops. test/CodeGen/X86/pr3495.ll has an additional instruction now, but it appears to be due to an arbitrary register allocation difference. llvm-svn: 74048	2009-06-24 01:18:18 +00:00
Dan Gohman	f522a4e034	Don't emit a redundant BitCastInst if the value to be defined in the preheader is already an instruction. llvm-svn: 74031	2009-06-24 00:28:59 +00:00
Dan Gohman	fd76113e28	Fix a few minor issues that were exposed by the removal of SCEVHandle. llvm-svn: 73910	2009-06-22 22:08:45 +00:00
Owen Anderson	65b6056e37	SCEVHandle is no more! llvm-svn: 73906	2009-06-22 21:39:50 +00:00
Dan Gohman	78ea89e161	Fix this code to correctly handle loops with multiple exits. Until now, this hasn't mattered, because ScalarEvolution hasn't been able to compute trip counts for loops with multiple exits. But it will soon. llvm-svn: 73864	2009-06-22 00:15:15 +00:00
Dan Gohman	860379bcc2	Rename a variable for consistency with the ExitBlock vs ExitingBlock terminology that LoopInfo uses. llvm-svn: 73863	2009-06-21 23:48:38 +00:00
Dan Gohman	724f825f96	Fix a typo in a comment that Frits von Bommel noticed. llvm-svn: 73796	2009-06-19 23:41:37 +00:00
Dan Gohman	cc31110b95	Re-apply r73718, now that the fix in r73787 is in, and add a hand-crafted testcase which demonstrates the bug that was exposed in 254.gap. llvm-svn: 73793	2009-06-19 23:23:27 +00:00
Dan Gohman	55e3dd9174	Fix LSR's OptimizeSMax to ignore max operators with more than 2 operands, which it isn't prepared to handle. llvm-svn: 73787	2009-06-19 23:03:46 +00:00
Evan Cheng	86076c9e30	Revert 73718. It's breaking 254.gap. llvm-svn: 73783	2009-06-19 21:15:06 +00:00
Chris Lattner	d0a363e03b	make jump threading handle lexically identical compare instructions as if they were multiple uses of the same instruction. This interacts well with the existing loadpre that j-t does to open up many new jump threads earlier. llvm-svn: 73768	2009-06-19 16:27:56 +00:00
Nick Lewycky	77585a24ac	Teach jump threading to look at comparisons between phi nodes and non-constants. llvm-svn: 73755	2009-06-19 04:56:29 +00:00
Chris Lattner	5ca4197829	Improve tail call elim to move loads above readonly calls when it allows forming a tail call. Patch by Frits van Bommel. This implements PR4323. llvm-svn: 73752	2009-06-19 04:22:16 +00:00
Chris Lattner	87a222c5c8	part of PR4405: disable a contentious optimization for strcmp -> memcmp when the lengths of the strings are unknown. Patch by Nick Lewycky! llvm-svn: 73751	2009-06-19 04:17:36 +00:00
Dan Gohman	8c9ac59455	Generalize LSR's OptimizeSMax to handle unsigned max tests as well as signed max tests. Along with r73717, this helps CodeGen avoid emitting code for a maximum operation for this class of loop. llvm-svn: 73718	2009-06-18 20:23:18 +00:00
Anton Korobeynikov	6ee547bb1b	Revert IRBuilder CC propagation. Fix SimplifyLibCalls instead. llvm-svn: 73715	2009-06-18 20:05:31 +00:00
Dan Gohman	a0348809b6	Remove the code from IVUsers that attempted to handle casted induction variables in cases where the cast isn't foldable. It ended up being a pessimization in many cases. This could be fixed, but it would require a bunch of complicated code in IVUsers' clients. The advantages of this approach aren't visible enough to justify it at this time. llvm-svn: 73706	2009-06-18 16:54:06 +00:00
Dan Gohman	56bd02c55c	Generalize the zext(trunc(t) & C) instcombine to work even with C is not a low-bits mask, and add a similar instcombine for zext((trunc(t) & C) ^ C). llvm-svn: 73705	2009-06-18 16:30:21 +00:00

... 19 20 21 22 23 ...

5368 Commits