llvm-project

Commit Graph

Author	SHA1	Message	Date
Evan Cheng	824a711305	SimplifyCFG has stopped duplicating returns into predecessors to canonicalize IR to have single return block (at least getting there) for optimizations. This is general goodness but it would prevent some tailcall optimizations. One specific case is code like this: int f1(void); int f2(void); int f3(void); int f4(void); int f5(void); int f6(void); int foo(int x) { switch(x) { case 1: return f1(); case 2: return f2(); case 3: return f3(); case 4: return f4(); case 5: return f5(); case 6: return f6(); } } => LBB0_2: ## %sw.bb callq _f1 popq %rbp ret LBB0_3: ## %sw.bb1 callq _f2 popq %rbp ret LBB0_4: ## %sw.bb3 callq _f3 popq %rbp ret This patch teaches codegenprep to duplicate returns when the return value is a phi and where the phi operands are produced by tail calls followed by an unconditional branch: sw.bb7: ; preds = %entry %call8 = tail call i32 @f5() nounwind br label %return sw.bb9: ; preds = %entry %call10 = tail call i32 @f6() nounwind br label %return return: %retval.0 = phi i32 [ %call10, %sw.bb9 ], [ %call8, %sw.bb7 ], ... [ 0, %entry ] ret i32 %retval.0 This allows codegen to generate better code like this: LBB0_2: ## %sw.bb jmp _f1 ## TAILCALL LBB0_3: ## %sw.bb1 jmp _f2 ## TAILCALL LBB0_4: ## %sw.bb3 jmp _f3 ## TAILCALL rdar://9147433 llvm-svn: 127953	2011-03-19 17:17:39 +00:00
Cameron Zwarich	338d362200	Roll r127459 back in: Optimize trivial branches in CodeGenPrepare, which often get created from the lowering of objectsize intrinsics. Unfortunately, a number of tests were relying on llc not optimizing trivial branches, so I had to add an option to allow them to continue to test what they originally tested. This fixes <rdar://problem/8785296> and <rdar://problem/9112893>. llvm-svn: 127498	2011-03-11 21:52:04 +00:00
Daniel Dunbar	94ccb27b43	Revert r127459, "Optimize trivial branches in CodeGenPrepare, which often get created from the", it broke some GCC test suite tests. llvm-svn: 127477	2011-03-11 19:30:30 +00:00
Cameron Zwarich	cc27b3acc4	Optimize trivial branches in CodeGenPrepare, which often get created from the lowering of objectsize intrinsics. Unfortunately, a number of tests were relying on llc not optimizing trivial branches, so I had to add an option to allow them to continue to test what they originally tested. This fixes <rdar://problem/8785296> and <rdar://problem/9112893>. llvm-svn: 127459	2011-03-11 04:54:27 +00:00
Cameron Zwarich	13c885d193	Fix PR9398 - 10% of llc compile time is spent in Value::getNumUses. This reduces the percentage of time spent in CodeGenPrepare when llcing 403.gcc from 12.6% to 1.8% of total llc time. llvm-svn: 127069	2011-03-05 08:12:26 +00:00
Cameron Zwarich	86ade9510f	Remove some more unused code that I missed. llvm-svn: 126826	2011-03-02 03:48:29 +00:00
Cameron Zwarich	5dd2aa2615	Eliminate the unused CodeGenPrepare option to split critical edges. llvm-svn: 126825	2011-03-02 03:31:46 +00:00
Cameron Zwarich	b7f8eaafa3	Stop computing the number of uses twice per value in CodeGenPrepare's sinking of addressing code. On 403.gcc this almost halves CodeGenPrepare time and reduces total llc time by 9.5%. Unfortunately, getNumUses() is still the hottest function in llc. llvm-svn: 126782	2011-03-01 21:13:53 +00:00
Chris Lattner	86d56c651d	fix rdar://8878965, a regression I introduced with the recent llvm.objectsize changes. llvm-svn: 123771	2011-01-18 20:53:04 +00:00
Chris Lattner	af26390790	temporarily revert r123526. While working on a follow-on patch I realize that ConstantFoldTerminator doesn't preserve dominfo. llvm-svn: 123527	2011-01-15 07:51:19 +00:00
Chris Lattner	8df83c4a24	fix rdar://8785296 - -fcatch-undefined-behavior generates inefficient code The basic issue is that isel (very reasonably!) expects conditional branches to be folded, so CGP leaving around a bunch dead computation feeding conditional branches isn't such a good idea. Just fold branches on constants into unconditional branches. llvm-svn: 123526	2011-01-15 07:36:13 +00:00
Chris Lattner	ee588defc6	simplify code, no functionality change. llvm-svn: 123525	2011-01-15 07:29:01 +00:00
Chris Lattner	1b93be501d	Now that instruction optzns can update the iterator as they go, we can have objectsize folding recursively simplify away their result when it folds. It is important to catch this here, because otherwise we won't eliminate the cross-block values at isel and other times. llvm-svn: 123524	2011-01-15 07:25:29 +00:00
Chris Lattner	7a2771440f	make the current instruction iterator an ivar, allowing xforms that potentially invalidate it (like inline asm lowering) to be sunk into their proper place, cleaning up a ton of code. llvm-svn: 123523	2011-01-15 07:14:54 +00:00
Cameron Zwarich	84986b298a	Make more passes preserve dominators (or state that they preserve dominators if they all ready do). This removes two dominator recomputations prior to isel, which is a 1% improvement in total llc time for 403.gcc. The only potentially suspect thing is making GCStrategy recompute dominators if it used a custom lowering strategy. llvm-svn: 123064	2011-01-08 17:01:52 +00:00
Cameron Zwarich	9ec19ea06a	Add the CallInst optimizations that don't involve expanding inline assembly to OptimizeInst() so that they can be used on a worklist instruction. llvm-svn: 122945	2011-01-06 02:56:42 +00:00
Cameron Zwarich	d28c78eb4f	Move the GEP handling in CodeGenPrepare to OptimizeInst(). llvm-svn: 122944	2011-01-06 02:44:52 +00:00
Cameron Zwarich	14ac865ca9	Split the optimizations in CodeGenPrepare that don't manipulate the iterators into a separate function, so that it can be called from a loop using a worklist rather than a loop traversing a whole basic block. llvm-svn: 122943	2011-01-06 02:37:26 +00:00
Cameron Zwarich	ce3b930a98	Stop reallocating SunkAddrs for each basic block. When we move to an instruction worklist, the key will need to become std::pair<BasicBlock, Value>. llvm-svn: 122932	2011-01-06 00:42:50 +00:00
Cameron Zwarich	b62ccb241b	Add some more statistics to CodeGenPrepare. llvm-svn: 122891	2011-01-05 17:47:38 +00:00
Cameron Zwarich	ced753fadf	Add some stats to CodeGenPrepare to make it easier to speed it up without regressing code quality. llvm-svn: 122887	2011-01-05 17:27:27 +00:00
Cameron Zwarich	f4e13699e7	Avoid finding loop back edges when we are not splitting critical edges in CodeGenPrepare (which is the default behavior). llvm-svn: 122801	2011-01-04 04:43:31 +00:00
Cameron Zwarich	43cecb1200	Switch a worklist in CodeGenPrepare to SmallVector and increase the inline capacity on the Visited SmallPtrSet. On 403.gcc, this is about a 4.5% speedup of CodeGenPrepare time (which itself is 10% of time spent in the backend). This is progress towards PR8889. llvm-svn: 122741	2011-01-03 06:33:01 +00:00
Owen Anderson	5d690d4168	It is possible for SimplifyCFG to cause PHI nodes to become redundant too late in the optimization pipeline to be caught by instcombine, and it's not feasible to catch them in SimplifyCFG because the use-lists are in an inconsistent state at the point where it could know that it need to simplify them. Instead, have CodeGenPrepare look for trivially redundant PHIs as part of its general cleanup effort. llvm-svn: 122516	2010-12-23 20:57:35 +00:00
Chris Lattner	fb888622c3	revert r122164, I'm going to go with a different approach. llvm-svn: 122168	2010-12-19 04:23:03 +00:00
Chris Lattner	583ec6fa44	first step to fixing PR8642: don't fold away empty basic blocks which have trapping constant exprs in them due to PHI nodes. Eliminating them can cause the constant expr to be evalutated on new paths if the input edges are critical. llvm-svn: 122164	2010-12-19 03:02:34 +00:00
Owen Anderson	8ba5f39f70	Second attempt at fixing the performance regressions introduced by my recent GVN improvement. Looking through a single layer of PHI nodes when attempting to sink GEPs, we need to iteratively look through arbitrary PHI nests. llvm-svn: 120202	2010-11-27 08:15:55 +00:00
Owen Anderson	dfb8c3bbfc	When folding addressing modes in CodeGenPrepare, attempt to look through PHI nodes if all the operands of the PHI are equivalent. This allows CodeGenPrepare to undo unprofitable PRE transforms. llvm-svn: 119853	2010-11-19 22:15:03 +00:00
John Thompson	e8360b7182	Inline asm multiple alternative constraints development phase 2 - improved basic logic, added initial platform support. llvm-svn: 117667	2010-10-29 17:29:13 +00:00
Owen Anderson	6c18d1aac0	Get rid of static constructors for pass registration. Instead, every pass exposes an initializeMyPassFunction(), which must be called in the pass's constructor. This function uses static dependency declarations to recursively initialize the pass's dependencies. Clients that only create passes through the createFooPass() APIs will require no changes. Clients that want to use the CommandLine options for passes will need to manually call the appropriate initialization functions in PassInitialization.h before parsing commandline arguments. I have tested this with all standard configurations of clang and llvm-gcc on Darwin. It is possible that there are problems with the static dependencies that will only be visible with non-standard options. If you encounter any crash in pass registration/creation, please send the testcase to me directly. llvm-svn: 116820	2010-10-19 17:21:58 +00:00
Owen Anderson	df7a4f2515	Now with fewer extraneous semicolons! llvm-svn: 115996	2010-10-07 22:25:06 +00:00
Jakob Stoklund Olesen	eb12f49fb7	Try again to disable critical edge splitting in CodeGenPrepare. The bug that broke i386 linux has been fixed in r115191. llvm-svn: 115204	2010-09-30 20:51:52 +00:00
Jakob Stoklund Olesen	415a7a6fec	Revert "Disable codegen prepare critical edge splitting. Machine instruction passes now" This reverts revision 114633. It was breaking llvm-gcc-i386-linux-selfhost. It seems there is a downstream bug that is exposed by -cgp-critical-edge-splitting=0. When that bug is fixed, this patch can go back in. Note that the changes to tailcallfp2.ll are not reverted. They were good are required. llvm-svn: 114859	2010-09-27 18:43:48 +00:00
Evan Cheng	794aaa79e2	Disable codegen prepare critical edge splitting. Machine instruction passes now break critical edges on demand. llvm-svn: 114633	2010-09-23 06:55:34 +00:00
Bob Wilson	b6832a4372	When moving zext/sext to be folded with a load, ignore the issue of whether truncates are free only in the case where the extended type is legal but the load type is not. If both types are illegal, such as when they are too big, the load may not be legalized into an extended load. llvm-svn: 114568	2010-09-22 18:44:56 +00:00
Bob Wilson	4ddcb6a6b4	Move a sign-extend or a zero-extend of a load to the same basic block as the load when the type of the load is not legal, even if truncates are not free. The load is going to be legalized to an extending load anyway. llvm-svn: 114488	2010-09-21 21:54:27 +00:00
Bob Wilson	ff714f9992	Clarify a comment. llvm-svn: 114487	2010-09-21 21:44:14 +00:00
Dale Johannesen	f95f59a0c2	When substituting sunkaddrs into indirect arguments an asm, we were walking the asm arguments once and stashing their Values. This is wrong because the same memory location can be in the list twice, and if the first one has a sunkaddr substituted, the stashed value for the second one will be wrong (use-after-free). PR 8154. llvm-svn: 114104	2010-09-16 18:30:55 +00:00
Eric Christopher	e3a89f9f9c	Remove unused variable. llvm-svn: 113769	2010-09-13 18:27:59 +00:00
John Thompson	1094c80281	Added skeleton for inline asm multiple alternative constraint support. llvm-svn: 113766	2010-09-13 18:15:37 +00:00
Chris Lattner	8df99b523e	remove some llvmcontext arguments that are now dead post-refactoring. llvm-svn: 112104	2010-08-25 23:00:45 +00:00
Evan Cheng	8b637b177c	Add an option to disable codegen prepare critical edge splitting. In theory, PHI elimination is already doing all (most?) of the splitting needed. But machine-licm and machine-sink seem to miss some important optimizations when splitting is disabled. llvm-svn: 111224	2010-08-17 01:34:49 +00:00
Owen Anderson	a7aed18624	Reapply r110396, with fixes to appease the Linux buildbot gods. llvm-svn: 110460	2010-08-06 18:33:48 +00:00
Owen Anderson	bda59bd247	Revert r110396 to fix buildbots. llvm-svn: 110410	2010-08-06 00:23:35 +00:00
Owen Anderson	755aceb5d0	Don't use PassInfo* as a type identifier for passes. Instead, use the address of the static ID member as the sole unique type identifier. Clean up APIs related to this change. llvm-svn: 110396	2010-08-05 23:42:04 +00:00
Owen Anderson	a57b97e7e7	Fix batch of converting RegisterPass<> to INTIALIZE_PASS(). llvm-svn: 109045	2010-07-21 22:09:45 +00:00
Gabor Greif	6d673953e3	eliminate CallInst::ArgOffset llvm-svn: 108522	2010-07-16 09:38:02 +00:00
Gabor Greif	743b3fd196	use getArgOperand (corrected by CallInst::ArgOffset) instead of getOperand llvm-svn: 107273	2010-06-30 09:19:23 +00:00
Dale Johannesen	ce97d55ad9	The hasMemory argument is irrelevant to how the argument for an "i" constraint should get lowered; PR 6309. While this argument was passed around a lot, this is the only place it was used, so it goes away from a lot of other places. llvm-svn: 106893	2010-06-25 21:55:36 +00:00
Gabor Greif	4a39b84a9d	use ArgOperand API llvm-svn: 106707	2010-06-24 00:44:01 +00:00
Eric Christopher	7258dcd77f	Revert 101465, it broke internal OpenGL testing. Probably the best way to know that all getOperand() calls have been handled is to replace that API instead of updating. llvm-svn: 101579	2010-04-16 23:37:20 +00:00
Gabor Greif	f375520f7b	reapply r101434 with a fix for self-hosting rotate CallInst operands, i.e. move callee to the back of the operand array the motivation for this patch are laid out in my mail to llvm-commits: more efficient access to operands and callee, faster callgraph-construction, smaller compiler binary llvm-svn: 101465	2010-04-16 15:33:14 +00:00
Gabor Greif	403e9694f9	back out r101423 and r101397, they break llvm-gcc self-host on darwin10 llvm-svn: 101434	2010-04-16 01:16:20 +00:00
Gabor Greif	6af0ad846e	shift intrinsic operand llvm-svn: 101423	2010-04-16 00:06:45 +00:00
Dale Johannesen	b67a6e6620	Fix a nasty dangling-pointer heisenbug that could generate wrong code pretty much anywhere AFAICT. A case that hits the bug reproducibly is impossible, but the situation was like this: Addr = ... Store -> Addr Addr2 = GEP , 0, 0 Store -> Addr2 Handling the first store, the code changed replaced Addr with a sunkaddr and deleted Addr, but not its table entry. Code in OptimizedBlock replaced Addr2 with a bitcast; if that happened to reuse the memory of Addr, the old table entry was erroneously found when handling the second store. llvm-svn: 100044	2010-03-31 20:37:15 +00:00
Gabor Greif	c78d720f02	rename use_const_iterator to const_use_iterator for consistency's sake llvm-svn: 99564	2010-03-25 23:06:16 +00:00
Benjamin Kramer	7b88a49f3e	Factor checked library call optimization into a common helper class and use it to unify the almost identical code in CodeGenPrepare and InstCombineCalls. llvm-svn: 98338	2010-03-12 09:27:41 +00:00
Benjamin Kramer	2fc395659c	stpcpy is so similar to strcpy, it doesn't deserve a complete copy of the __strcpy_chk -> strcpy code. llvm-svn: 98284	2010-03-11 20:45:13 +00:00
Eric Christopher	607de1de53	Lower stpcpy_chk when possible. llvm-svn: 98274	2010-03-11 19:24:34 +00:00
Eric Christopher	4b7948e09e	Do some final lowering in CodeGenPrepare of _chk calls similar to that in InstCombineCalls. More call lowering needed. llvm-svn: 98228	2010-03-11 02:41:03 +00:00
Duncan Sands	19d0b47b1f	There are two ways of checking for a given type, for example isa<PointerType>(T) and T->isPointerTy(). Convert most instances of the first form to the second form. Requested by Chris. llvm-svn: 96344	2010-02-16 11:11:14 +00:00
Chris Lattner	b8639bc2d1	remove dead code. llvm-svn: 96109	2010-02-13 19:07:06 +00:00
Chris Lattner	42c66b7270	Split some code out to a helper function (FindReusablePredBB) and add a doxygen comment. Cache the phi entry to avoid doing tons of PHINode::getBasicBlockIndex calls in the common case. On my insane testcase from re2c, this speeds up CGP from 617.4s to 7.9s (78x). llvm-svn: 96083	2010-02-13 05:35:08 +00:00
Chris Lattner	96b8826542	speed up CGP a bit by scanning predecessors through phi operands instead of with pred_begin/end. llvm-svn: 96078	2010-02-13 04:04:42 +00:00
Dan Gohman	4739e41ce9	Implement releaseMemory in CodeGenPrepare and free the BackEdges container data. This prevents it from holding onto dangling pointers and potentially behaving unpredictably. llvm-svn: 95409	2010-02-05 19:24:11 +00:00
Dan Gohman	ca19445d08	When doing address-mode sinking, expand the base register first, rather than the scaled register. This makes it more likely that subsequent AddrModeMatcher queries will match the new address the same way as the old, instead of accidentally matching what had been the base register as the new scaled register, and then failing to match the scaled register. This fixes some problems with address-mode sinking multiple muls into a block, which will be a lot more common with some upcoming LoopStrengthReduction changes. llvm-svn: 93935	2010-01-19 22:45:06 +00:00
David Greene	74e2d4917d	Change errs() to dbgs(). llvm-svn: 92611	2010-01-05 01:27:11 +00:00
Evan Cheng	090ac0865a	Revert 91280-91283, 91286-91289, 91291, 91293, 91295-91296. It apparently introduced a non-deterministic behavior in the optimizer somewhere. llvm-svn: 91598	2009-12-17 09:39:49 +00:00
Nick Lewycky	8bca014d7f	Remove unnecessary #include "llvm/LLVMContext.h". llvm-svn: 90836	2009-12-08 05:45:41 +00:00
Bob Wilson	53bdae3802	Fix a comment typo. llvm-svn: 90487	2009-12-03 21:47:07 +00:00
Chris Lattner	c872b09676	llvm::SplitEdge should refuse to split an edge from an indirectbr. Fix CodeGenPrepare to not try to split edges from indirectbr. llvm-svn: 85690	2009-10-31 22:04:43 +00:00
Dan Gohman	99429a00ff	Move zext and sext casts fed by loads into the same block as the load, to help SelectionDAG fold them into the loads, unless conditions are unfavorable. llvm-svn: 84271	2009-10-16 20:59:35 +00:00
Andreas Neustifter	f8cb758ba8	Preserve ProfileInfo during CodeGenPrepare. llvm-svn: 82034	2009-09-16 09:26:52 +00:00
Chris Lattner	2dd09dbdf7	eliminate VISIBILITY_HIDDEN from Transforms/Scalar. PR4861 llvm-svn: 80766	2009-09-02 06:11:42 +00:00
Dan Gohman	ad1f0a1101	Eliminate the unused Context argument on one of the ICmpInst and FCmpInst constructors. llvm-svn: 80049	2009-08-25 23:17:54 +00:00
Chris Lattner	b25de3ff60	eliminate the "Value" printing methods that print to a std::ostream. This required converting a bunch of stuff off DOUT and other cleanups. llvm-svn: 79819	2009-08-23 04:37:46 +00:00
Owen Anderson	55f1c09e31	Push LLVMContexts through the IntegerType APIs. llvm-svn: 78948	2009-08-13 21:58:54 +00:00
Owen Anderson	117c9e8497	Add contexts to some of the MVT APIs. No functionality change yet, just the infrastructure work needed to get the contexts to where they need to be first. llvm-svn: 78759	2009-08-12 00:36:31 +00:00
Owen Anderson	53aa7a960c	Rename MVT to EVT, in preparation for splitting SimpleValueType out into its own struct type. llvm-svn: 78610	2009-08-10 22:56:29 +00:00
Owen Anderson	5a1acd9912	Move a few more APIs back to 2.5 forms. The only remaining ones left to change back are metadata related, which I'm waiting on to avoid conflicting with Devang. llvm-svn: 77721	2009-07-31 20:28:14 +00:00
Dan Gohman	29f2baf3b3	Convert a few more uses of llvm/Support/Streams.h to raw_ostream. llvm-svn: 77033	2009-07-25 01:13:51 +00:00
Owen Anderson	edb4a70325	Revert the ConstantInt constructors back to their 2.5 forms where possible, thanks to contexts-on-types. More to come. llvm-svn: 77011	2009-07-24 23:12:02 +00:00
Owen Anderson	47db941fd3	Get rid of the Pass+Context magic. llvm-svn: 76702	2009-07-22 00:24:57 +00:00
Chris Lattner	470a8da807	use ExpandInlineAsm on TargetLowering instead of TargetAsmInfo. llvm-svn: 76442	2009-07-20 17:52:52 +00:00
Owen Anderson	1e5f00e7a7	This started as a small change, I swear. Unfortunately, lots of things call the [I\|F]CmpInst constructors. Who knew!? llvm-svn: 75200	2009-07-09 23:48:35 +00:00
Owen Anderson	b5618da226	Convert the first batch of passes to use LLVMContext. llvm-svn: 74748	2009-07-03 00:17:18 +00:00
Dan Gohman	4fe64deb7b	Fix old-style type names in comments. llvm-svn: 73362	2009-06-14 23:30:43 +00:00
Dan Gohman	760377effc	Fix CodeGenPrepare's address-mode sinking to handle unusual addresses, involving Base values which do not have Pointer type. This fixes PR4297. llvm-svn: 72739	2009-06-02 21:29:13 +00:00
Chris Lattner	351134ba93	Factor loop backedge finding out of CodeGenPrepare into a new FindFunctionBackedges function. llvm-svn: 70819	2009-05-04 02:25:58 +00:00
Chris Lattner	47d6e7b93e	remove empty section llvm-svn: 68485	2009-04-07 02:55:53 +00:00
Dale Johannesen	4026b041ce	One more place to skip debug info. llvm-svn: 67811	2009-03-27 01:13:37 +00:00
Dale Johannesen	db90560c1c	Skip debug info one more place. (This one gets called from llc, not opt, but it's an IR level optimization nevertheless.) llvm-svn: 67724	2009-03-26 01:15:07 +00:00
Evan Cheng	94419d6fdd	Fix PR3784: If the source of a phi comes from a bb ended with an invoke, make sure the copy is inserted before the try range (unless it's used as an input to the invoke, then insert it after the last use), not at the end of the bb. Also re-apply r66140 which was disabled as a workaround. llvm-svn: 66976	2009-03-13 22:59:14 +00:00
Duncan Sands	1f853d6a2a	Revert commit 66140 since it caused several failures in the Ada testcase. Reverting this only covers up the real problem, which is a nasty conceptual difficulty in the phi elimination pass: when eliminating phi nodes in landing pads, the register copies need to come before the invoke, not at the end of the basic block which is too late... See PR3784. llvm-svn: 66826	2009-03-12 21:13:42 +00:00
Evan Cheng	b7922dee15	Do not split edges to EH landing pads. It will cause code size explosion. llvm-svn: 66140	2009-03-05 06:31:26 +00:00
Evan Cheng	c380864d2c	Factor address mode matcher out of codegen prepare to make it available to other passes, e.g. loop strength reduction. llvm-svn: 65134	2009-02-20 18:24:38 +00:00
Dan Gohman	55ea72179c	In CodeGenPrepare's debug output, use WriteAsOperand instead of printing getName(), so that unnamed values are printed correctly. llvm-svn: 64468	2009-02-13 17:45:12 +00:00
Chris Lattner	5297c63565	fix PR3537: if resetting bbi back to the start of a block, we need to forget about already inserted expressions. llvm-svn: 64362	2009-02-12 06:56:08 +00:00
Gabor Greif	eb61fcf2a1	Simplify the logic of getting hold of a PHI predecessor block. There is now a direct way from value-use-iterator to incoming block in PHINode's API. This way we avoid the iterator->index->iterator trip, and especially the costly getOperandNo() invocation. Additionally there is now an assertion that the iterator really refers to one of the PHI's Uses. llvm-svn: 62869	2009-01-23 19:40:15 +00:00
Chris Lattner	64b7bd7f9e	Fix rdar://6505632, an llc crash on 483.xalancbmk llvm-svn: 62470	2009-01-18 20:35:00 +00:00
Duncan Sands	dc020f9c3c	Rename getABITypeSize to getTypePaddedSize, as suggested by Chris. llvm-svn: 62099	2009-01-12 20:38:59 +00:00
Evan Cheng	8804293fe9	Find loop back edges only after empty blocks are eliminated. llvm-svn: 61752	2009-01-05 21:17:27 +00:00
Evan Cheng	3b3de7c228	- CodeGenPrepare does not split loop back edges but it only knows about back edges of single block loops. It now does a DFS walk to find loop back edges. - Use SplitBlockPredecessors to factor out common predecessors of the critical edge destination. This is disabled for now due to some regressions. llvm-svn: 61248	2008-12-19 18:03:11 +00:00
Chris Lattner	8a172daa55	don't call MergeBasicBlockIntoOnlyPred on a block whose only predecessor is itself. This doesn't make sense, and this is a dead infinite loop anyway. llvm-svn: 60210	2008-11-28 19:54:49 +00:00
Chris Lattner	c6c481cdfc	remove doConstantPropagation and dceInstruction, they are just wrappers around the interesting code and use an obscure iterator abstraction that dates back many many years. Move EraseDeadInstructions to Transforms/Utils and name it RecursivelyDeleteTriviallyDeadInstructions. llvm-svn: 60191	2008-11-27 22:57:53 +00:00
Chris Lattner	4059f43b74	defensive patch: if CGP is merging a block with the entry block, make sure it ends up being the entry block. llvm-svn: 60180	2008-11-27 19:29:14 +00:00
Chris Lattner	206250284d	Use the new MergeBasicBlockIntoOnlyPred function. llvm-svn: 60163	2008-11-27 07:54:12 +00:00
Chris Lattner	397a11ccd8	Turn on my codegen prepare heuristic by default. It doesn't affect performance in most cases on the Grawp tester, but does speed some things up (like shootout/hash by 15%). This also doesn't impact compile time in a noticable way on the Grawp tester. It also, of course, gets the testcase it was designed for right :) llvm-svn: 60120	2008-11-26 22:16:44 +00:00
Chris Lattner	fef04acc50	teach the new heuristic how to handle inline asm. llvm-svn: 60088	2008-11-26 04:59:11 +00:00
Chris Lattner	6d71b7fb95	Improve ValueAlreadyLiveAtInst with a cheap and dirty, but effective heuristic: the value is already live at the new memory operation if it is used by some other instruction in the memop's block. This is cheap and simple to compute (moreso than full liveness). This improves the new heuristic even more. For example, it cuts two out of three new instructions out of 255.vortex:DbmFileInGrpHdr, which is one of the functions that the heuristic regressed. This overall eliminates another 40 instructions from 403.gcc and visibly reduces register pressure in 255.vortex (though this only actually ends up saving the 2 instructions from the whole program). llvm-svn: 60084	2008-11-26 03:20:37 +00:00
Chris Lattner	e34fe2c52d	Start rewroking a subpiece of the profitability heuristic to be phrased in terms of liveness instead of as a horrible hack. :) In pratice, this doesn't change the generated code for either 255.vortex or 403.gcc, but it could cause minor code changes in theory. This is framework for coming changes. llvm-svn: 60082	2008-11-26 03:02:41 +00:00
Chris Lattner	383a797f42	add a comment, make save/restore logic more obvious. llvm-svn: 60076	2008-11-26 02:11:11 +00:00
Chris Lattner	eb3e4fb6fb	This adds in some code (currently disabled unless you pass -enable-smarter-addr-folding to llc) that gives CGP a better cost model for when to sink computations into addressing modes. The basic observation is that sinking increases register pressure when part of the addr computation has to be available for other reasons, such as having a use that is a non-memory operation. In cases where it works, it can substantially reduce register pressure. This code is currently an overall win on 403.gcc and 255.vortex (the two things I've been looking at), but there are several things I want to do before enabling it by default: 1. This isn't doing any caching of results, so it is much slower than it could be. It currently slows down release-asserts llc by 1.7% on 176.gcc: 27.12s -> 27.60s. 2. This doesn't think about inline asm memory operands yet. 3. The cost model botches the case when the needed value is live across the computation for other reasons. I'll continue poking at this, and eventually turn it on as llcbeta. llvm-svn: 60074	2008-11-26 02:00:14 +00:00
Chris Lattner	a9ab165b08	Teach CodeGenPrepare to look through Bitcast instructions when attempting to optimize addressing modes. This allows us to optimize things like isel-sink2.ll into: movl 4(%esp), %eax cmpb $0, 4(%eax) jne LBB1_2 ## F LBB1_1: ## TB movl $4, %eax ret LBB1_2: ## F movzbl 7(%eax), %eax ret instead of: _test: movl 4(%esp), %eax cmpb $0, 4(%eax) leal 4(%eax), %eax jne LBB1_2 ## F LBB1_1: ## TB movl $4, %eax ret LBB1_2: ## F movzbl 3(%eax), %eax ret This shrinks (e.g.) 403.gcc from 1133510 to 1128345 lines of .s. Note that the 2008-10-16-SpillerBug.ll testcase is dubious at best, I doubt it is really testing what it thinks it is. llvm-svn: 60068	2008-11-26 00:26:16 +00:00
Chris Lattner	f3e95505c5	Teach MatchScaledValue to handle Scales by 1 with MatchAddr (which can recursively match things) and scales by 0 by ignoring them. This triggers once in 403.gcc, saving 1 (!!!!) instruction in the whole huge app. llvm-svn: 60013	2008-11-25 07:25:26 +00:00
Chris Lattner	728f90220a	significantly refactor all the addressing mode matching logic into a new AddressingModeMatcher class. This makes it easier to reason about and reduces passing around of stuff, but has no functionality change. llvm-svn: 60012	2008-11-25 07:09:13 +00:00
Chris Lattner	58f49d2916	refactor all the constantexpr/instruction handling code out into a new FindMaximalLegalAddressingModeForOperation helper method. llvm-svn: 60011	2008-11-25 05:15:49 +00:00
Chris Lattner	a3fbff15b9	another minor tweak llvm-svn: 60010	2008-11-25 04:47:41 +00:00
Chris Lattner	d616ef5683	minor cleanups no functionality change. llvm-svn: 60009	2008-11-25 04:42:10 +00:00
Chris Lattner	6416a6b7a0	rearrange and tidy some code, no functionality change. llvm-svn: 59990	2008-11-24 22:44:16 +00:00
Chris Lattner	d917c8c8fe	minor cleanups to debug code, no functionality change. llvm-svn: 59989	2008-11-24 22:40:05 +00:00
Chris Lattner	d78894197a	reenable the right part of the code. llvm-svn: 59985	2008-11-24 21:26:21 +00:00
Chris Lattner	992a541002	revert an accidental commit, this fixes the regression on test/CodeGen/X86/isel-sink.ll llvm-svn: 59976	2008-11-24 19:40:34 +00:00
Chris Lattner	53d6a07869	Fix 3113: If we have a dead cyclic PHI, replace the whole thing with an undef. llvm-svn: 59972	2008-11-24 19:25:36 +00:00
Evan Cheng	25dd4a2daf	Commit CodeGenPrepare.cpp changes which was accidentially left out of 56526. llvm-svn: 56549	2008-09-24 06:48:55 +00:00
Eric Christopher	c1ea149dcd	Fix fallout in CodeGenPrepare from 56526. Will likely need more work. llvm-svn: 56546	2008-09-24 05:32:41 +00:00
Dan Gohman	a79db30d28	Tidy up several unbeseeming casts from pointer to intptr_t. llvm-svn: 55779	2008-09-04 17:05:41 +00:00
Dan Gohman	2ce6f2ad5e	Rename SDOperand to SDValue. llvm-svn: 54128	2008-07-27 21:46:04 +00:00
Duncan Sands	11dd424539	Remove comparison methods for MVT. The main cause of apint codegen failure is the DAG combiner doing the wrong thing because it was comparing MVT's using < rather than comparing the number of bits. Removing the < method makes this mistake impossible to commit. Instead, add helper methods for comparing bits and use them. llvm-svn: 52098	2008-06-08 20:54:56 +00:00
Duncan Sands	13237ac3b9	Wrap MVT::ValueType in a struct to get type safety and better control the abstraction. Rename the type to MVT. To update out-of-tree patches, the main thing to do is to rename MVT::ValueType to MVT, and rewrite expressions like MVT::getSizeInBits(VT) in the form VT.getSizeInBits(). Use VT.getSimpleVT() to extract a MVT::SimpleValueType for use in switch statements (you will get an assert failure if VT is an extended value type - these shouldn't exist after type legalization). This results in a small speedup of codegen and no new testsuite failures (x86-64 linux). llvm-svn: 52044	2008-06-06 12:08:01 +00:00
Dan Gohman	f96e1371e8	Tidy up BasicBlock::getFirstNonPHI, and change a bunch of places to use it instead of duplicating its functionality. llvm-svn: 51499	2008-05-23 21:05:58 +00:00
Gabor Greif	e1f6e4b21d	API change for {BinaryOperator\|CmpInst\|CastInst}::create*() --> Create. Legacy interfaces will be in place for some time. (Merge from use-diet branch.) llvm-svn: 51200	2008-05-16 19:29:10 +00:00
Dan Gohman	d78c400b5b	Clean up the use of static and anonymous namespaces. This turned up several things that were neither in an anonymous namespace nor static but not intended to be global. llvm-svn: 51017	2008-05-13 00:00:25 +00:00
Gordon Henriksen	829046b0b4	Improve pass documentation and comments. Patch by Matthijs Kooijman! llvm-svn: 50861	2008-05-08 17:46:35 +00:00
Chris Lattner	2237973438	Implement a signficant optimization for inline asm: When choosing between constraints with multiple options, like "ir", test to see if we can use the 'i' constraint and go with that if possible. This produces more optimal ASM in all cases (sparing a register and an instruction to load it), and fixes inline asm like this: void test () { asm volatile (" %c0 %1 " : : "imr" (42), "imr"(14)); } Previously we would dump "42" into a memory location (which is ok for the 'm' constraint) which would cause a problem because the 'c' modifier is not valid on memory operands. Isn't it great how inline asm turns 'missed optimization' into 'compile failed'?? Incidentally, this was the todo in PowerPC/2007-04-24-InlineAsm-I-Modifier.ll Please do NOT pull this into Tak. llvm-svn: 50315	2008-04-27 00:37:18 +00:00
Chris Lattner	4793515a9c	Move a bunch of inline asm code out of line. llvm-svn: 50313	2008-04-27 00:09:47 +00:00
Dan Gohman	ca95a5f49f	Remove the code from CodeGenPrepare that moved getresult instructions to the block that defines their operands. This doesn't work in the case that the operand is an invoke, because invoke is a terminator and must be the last instruction in a block. Replace it with support in SelectionDAGISel for copying struct values into sequences of virtual registers. llvm-svn: 50279	2008-04-25 18:27:55 +00:00
Chris Lattner	a39cfc5c5b	silence a warning when assertions are disabled. llvm-svn: 49283	2008-04-06 21:44:08 +00:00
Dan Gohman	a25dde6fee	Handle getresult instructions in different basic blocks from their aggregate operands by moving the getresult instructions. llvm-svn: 48657	2008-03-21 21:01:32 +00:00
Evan Cheng	a90fdc4340	Remove dead options. llvm-svn: 48556	2008-03-19 22:02:26 +00:00
Gabor Greif	aa2617206f	fix http://llvm.org/bugs/show_bug.cgi?id=2097 llvm-svn: 47615	2008-02-26 19:13:21 +00:00
Eli Friedman	666bbe34f4	Fix for pr2093: direct operands aren't necessarily addresses, so don't try to simplify them. llvm-svn: 47610	2008-02-26 18:37:49 +00:00
Evan Cheng	1da250097b	Fix PR2076. CodeGenPrepare now sinks address computation for inline asm memory operands into inline asm block. llvm-svn: 47589	2008-02-26 02:42:37 +00:00
Duncan Sands	afa84da4e0	Make sure the caller doesn't use freed memory. Fixes PR1935. llvm-svn: 46203	2008-01-20 16:51:46 +00:00
Chris Lattner	f3ebc3f3d2	Remove attribution from file headers, per discussion on llvmdev. llvm-svn: 45418	2007-12-29 20:36:04 +00:00
Chris Lattner	ef1bbfc762	Don't break critical edges for single-bb loops, this helps with PR1877, though it is only a partial fix. This change is noise for most programs, but speeds up Shootout-C++/matrix by 20%, Ptrdist/ks by 24%, smg2000 by 8%, hexxagon by 9%, bzip2 by 9% (not sure I trust this), ackerman by 13%, etc. OTOH, it slows down Shootout/fib2 by 40% (I'll update PR1877 with this info). llvm-svn: 45354	2007-12-25 19:06:45 +00:00
Chris Lattner	62a806d565	add a -backedge-hack llc-beta option to codegenprepare. When specified, don't split backedges of single-bb loops. This helps address PR1877 llvm-svn: 45344	2007-12-24 19:32:55 +00:00
Evan Cheng	2011df4e39	Fix typo. llvm-svn: 44997	2007-12-13 07:50:36 +00:00
Evan Cheng	37c36ed79a	Be extra careful with extension use optimation. Now turned on by default. llvm-svn: 44981	2007-12-13 03:32:53 +00:00
Evan Cheng	63d33cfd2b	Don't muck with phi nodes; bug fixes. llvm-svn: 44905	2007-12-12 02:53:41 +00:00

1 2 3 4 5 ...

268 Commits