llvm-project

Commit Graph

Author	SHA1	Message	Date
Chris Lattner	e82b087ae6	only IPSCCP incoming arguments if the function is executable, this fixes an assertion on the buildbot. llvm-svn: 85784	2009-11-02 03:25:55 +00:00
Chris Lattner	9e97fbe114	add a new ValueState::getConstantInt() helper, use it to simplify some code. llvm-svn: 85783	2009-11-02 03:21:36 +00:00
Chris Lattner	7ccf1a6df6	tidy up some more: remove some extraneous inline specifiers, return harder. llvm-svn: 85780	2009-11-02 03:03:42 +00:00
Chris Lattner	b5a13d4c90	eliminate the SCCPSolver::getValueMapping method. llvm-svn: 85778	2009-11-02 02:54:24 +00:00
Chris Lattner	c49ae9912a	fix failures introduced in r85774 llvm-svn: 85777	2009-11-02 02:48:17 +00:00
Chris Lattner	e405ed9651	factor duplicated code into a new DeleteInstructionInBlock function, eliminate temporary (and pointless) smallvector. llvm-svn: 85776	2009-11-02 02:47:51 +00:00
Chris Lattner	a3c39d394d	Chris used to use '...' instead of proper grammar. llvm-svn: 85775	2009-11-02 02:33:50 +00:00
Chris Lattner	6df5cec72f	remove some extraneous llvmcontext stuff. llvm-svn: 85774	2009-11-02 02:30:06 +00:00
Chris Lattner	efdd2bbce6	change LatticeVal to use PointerIntPair to save some space. llvm-svn: 85773	2009-11-02 02:20:32 +00:00
Chris Lattner	3cd6a61b27	fix instcombine to only do store sinking when the alignments of the two loads agree. Propagate that onto the new store. llvm-svn: 85772	2009-11-02 02:06:37 +00:00
Chris Lattner	328ef89bd1	when merging two loads, make sure to take the min of their alignment, not the max. This didn't matter until the previous patch because instcombine would refuse to sink loads with differenting alignments. llvm-svn: 85738	2009-11-01 20:07:07 +00:00
Chris Lattner	2a249e267a	split load sinking out to its own function, like gep sinking. llvm-svn: 85737	2009-11-01 20:04:24 +00:00
Chris Lattner	0b40a8bc0e	fix a bug noticed by inspection: when instcombine sinks loads through phis, it didn't preserve the alignment of the load. This is a missed optimization of the alignment is high and a miscompilation when the alignment is low. llvm-svn: 85736	2009-11-01 19:50:13 +00:00
Chris Lattner	b5d9c8c708	cleanups, switch GlobalDCE to SmallPtrSet instead of std::set llvm-svn: 85730	2009-11-01 19:03:42 +00:00
Chris Lattner	37536b90e1	remove a bunch of locking from LLVMContextImpl. Since only one thread can be banging on a context at a time, this isn't needed. Owen, please review. llvm-svn: 85728	2009-11-01 18:42:03 +00:00
Chris Lattner	249f96e339	improve comment. llvm-svn: 85725	2009-11-01 18:17:37 +00:00
Douglas Gregor	291f6145b8	Reverting 85714, 85715, 85716, which are breaking the build llvm-svn: 85717	2009-11-01 16:42:53 +00:00
Dan Gohman	576ac96367	Remove the #include of Pass.h from PassManager.h. This breaks a significant #include dependency, as frontends commonly pull in PassManager.h. llvm-svn: 85714	2009-11-01 15:20:19 +00:00
Chris Lattner	1a8b80ed5a	teach ipsccp and ipconstprop that a blockaddress doesn't 'take the address' of a function in a way that should prevent ip constprop. This allows clang/test/CodeGen/indirect-goto.c to pass with the new indirect goto lowering. llvm-svn: 85709	2009-11-01 06:11:53 +00:00
Chris Lattner	a1dc101f66	change llvm::MergeBlockIntoPredecessor to not merge two blocks BB1->BB2 when BB2 has its address taken. Since it ends up doing BB2->rauw(BB1), this can cause the address of the entry block to be taken. Since it is generally undesirable to nuke blocks whose address is taken, even when we can, just unconditionally stop this xform. llvm-svn: 85708	2009-11-01 04:57:33 +00:00
Chris Lattner	746139b736	strengthen an assumption: RevectorBlockTo knows that PredBB ended in an uncond branch because the pass requires BreakCriticalEdges. However, BCE doesn't eliminate critical adges from indbrs. llvm-svn: 85707	2009-11-01 04:23:20 +00:00
Chris Lattner	7a8db3a41a	if CostMetrics says to never duplicate some code, don't unswitch a loop. This prevents unswitching from duplicating indbr's. llvm-svn: 85705	2009-11-01 03:42:55 +00:00
Chris Lattner	54a4b84012	constant fold indirectbr(blockaddress(%bb)) -> br label %bb. llvm-svn: 85704	2009-11-01 03:40:38 +00:00
Chris Lattner	aa99c94e2a	Revert 85678/85680. The decision is to stay with the current form of indirectbr, thus we don't need "blockaddr(@func, null)". Eliminate it for simplicity. llvm-svn: 85699	2009-11-01 01:27:45 +00:00
Chris Lattner	a546dcf418	Make sure PRE doesn't split crit edges from indirectbr. llvm-svn: 85692	2009-10-31 22:11:15 +00:00
Chris Lattner	c872b09676	llvm::SplitEdge should refuse to split an edge from an indirectbr. Fix CodeGenPrepare to not try to split edges from indirectbr. llvm-svn: 85690	2009-10-31 22:04:43 +00:00
Chris Lattner	ba364b0a9a	update the comment above llvm::SplitCriticalEdge, and make it abort on IndirectBrInst as describe in the comment. llvm-svn: 85688	2009-10-31 21:51:10 +00:00
Chris Lattner	3c89c53f35	adjust a couple xforms to work with null bb's in BlockAddress. llvm-svn: 85680	2009-10-31 20:13:24 +00:00
Chris Lattner	a742b8f94f	add a comment. llvm-svn: 85671	2009-10-31 17:48:31 +00:00
Dan Gohman	2d02ff8cbb	Revert r85667. LoopUnroll currently can't call utility functions which auto-update the DominatorTree because it doesn't keep the DominatorTree current while it works. llvm-svn: 85670	2009-10-31 17:33:01 +00:00
Dan Gohman	144694bcb7	Remove redundant code. llvm-svn: 85668	2009-10-31 16:16:41 +00:00
Dan Gohman	041e2dbad1	Merge the enhancements from LoopUnroll's FoldBlockIntoPredecessor into MergeBlockIntoPredecessor. This makes SimplifyCFG slightly more aggressive, and makes it unnecessary for LoopUnroll to have its own copy of this code. llvm-svn: 85667	2009-10-31 16:08:00 +00:00
Dan Gohman	880c92ac1c	Rename forgetLoopBackedgeTakenCount to forgetLoop, because it clears out more information than just the stored backedge taken count. llvm-svn: 85664	2009-10-31 15:04:55 +00:00
Dan Gohman	969e83a4ff	Replace LoopUnrollPass.cpp's custom code-size estimation code using the new common CodeMetrics code. llvm-svn: 85663	2009-10-31 14:54:17 +00:00
Dan Gohman	fa8969f70e	Simplify this code. llvm-svn: 85662	2009-10-31 14:46:50 +00:00
Dan Gohman	af94015c18	Remove an unnecessary #include. llvm-svn: 85661	2009-10-31 14:39:43 +00:00
Dan Gohman	f35b6640f6	Update CMakeLists for recent renames. llvm-svn: 85660	2009-10-31 14:38:25 +00:00
Dan Gohman	f70e76c435	Rename UnrollLoop.cpp to LoopUnroll.cpp, and LoopUnroll.cpp to LoopUnrollPass.cpp, for consistency with other passes which are similarly split. llvm-svn: 85659	2009-10-31 14:37:31 +00:00
Dan Gohman	fb7f0e57b6	Remove CodeGenLICM. It's largely obsoleted by MachineLICM's new ability to unfold loop-invariant loads. llvm-svn: 85657	2009-10-31 14:35:41 +00:00
Dan Gohman	930aa9d3d2	Reapply r85634, with the bug fixed. llvm-svn: 85655	2009-10-31 14:22:52 +00:00
Evan Cheng	c16d8f2054	Revert 85634. It's breaking consumer-typeset (and others). llvm-svn: 85641	2009-10-31 01:28:06 +00:00
Dan Gohman	7f7d97eb73	Add a comment about a missed opportunity. llvm-svn: 85635	2009-10-30 23:15:43 +00:00
Dan Gohman	5bec30ca5d	Optimize around the fact that pred_iterator is slow: instead of sorting PHI operands by the predecessor order, sort them by the order used by the first PHI in the block. This is still suffucient to expose duplicates. llvm-svn: 85634	2009-10-30 23:15:21 +00:00
Dan Gohman	1a95106602	Teach SimplifyCFG how to eliminate duplicate PHI nodes within a block. This reduces codesize on a variety of codes by 1-2% on x86-64. It also helps clean up after SSAUpdater. llvm-svn: 85626	2009-10-30 22:39:04 +00:00
Dan Gohman	13e41edc71	Sort the incoming values in PHI nodes to match the predecessor order. This helps expose duplicate PHIs, which will make it easier for them to be eliminated. llvm-svn: 85623	2009-10-30 22:22:22 +00:00
Evan Cheng	5a6b9c40d6	Add option to createGVNPass to disable PRE. llvm-svn: 85609	2009-10-30 20:12:24 +00:00
Nick Lewycky	b43a43a8fd	Apply some cleanups. No functionality changes. llvm-svn: 85498	2009-10-29 07:35:15 +00:00
Chris Lattner	312748848f	just for the hell of it, allow globalopt to statically evaluate static constructors with indirect gotos :) llvm-svn: 85495	2009-10-29 05:51:50 +00:00
Chris Lattner	ee8b951e73	teach various passes about blockaddress. We no longer crash on any clang tests. llvm-svn: 85465	2009-10-29 01:21:20 +00:00
Chris Lattner	be060382e9	teach ValueMapper about BlockAddress', making bugpoint a lot more useful. llvm-svn: 85458	2009-10-29 00:31:02 +00:00
Chris Lattner	cf5a47d63d	unindent massive blocks, no functionality change. llvm-svn: 85457	2009-10-29 00:28:30 +00:00
Victor Hernandez	0d025421cd	Extend getMallocArraySize() to determine the array size if the malloc argument is: ArraySize * ElementSize ElementSize * ArraySize ArraySize << log2(ElementSize) ElementSize << log2(ArraySize) Refactor isArrayMallocHelper and delete isSafeToGetMallocArraySize, so that there is only 1 copy of the malloc array determining logic. Update users of getMallocArraySize() to not bother calling isArrayMalloc() as well. llvm-svn: 85421	2009-10-28 20:18:55 +00:00
Devang Patel	ffd561bc2d	llvm.dbg.global_variables do not exist anymore. llvm-svn: 85402	2009-10-28 16:51:52 +00:00
Edward O'Callaghan	1042ca112f	No newline at end of file. llvm-svn: 85390	2009-10-28 15:04:53 +00:00
Benjamin Kramer	ecc60b80b0	Update CMake file. llvm-svn: 85389	2009-10-28 13:29:18 +00:00
Owen Anderson	2b2bd28973	Treat lifetime begin/end markers as allocations/frees respectively for the purposes for GVN/DSE. llvm-svn: 85383	2009-10-28 07:05:35 +00:00
Nick Lewycky	175308c43e	Add ABCD, a generalized implementation of the Elimination of Array Bounds Checks on Demand algorithm which looks at arbitrary branches instead of loop iterations. This is GSoC work by Andre Tavares with only editorial changes applied! llvm-svn: 85382	2009-10-28 07:03:15 +00:00
Chris Lattner	a91a563530	Previously, all operands to Constant were themselves constant. In the new world order, BlockAddress can have a BasicBlock operand. This doesn't permute much, because if you have a ConstantExpr (or anything more specific than Constant) we still know the operand has to be a Constant. llvm-svn: 85375	2009-10-28 05:14:34 +00:00
Devang Patel	11cf3f4a27	Factor out redundancy from clone() implementations. llvm-svn: 85327	2009-10-27 22:16:29 +00:00
Victor Hernandez	f390e04a47	Rename MallocFreeHelper as MemoryBuiltins llvm-svn: 85286	2009-10-27 20:05:49 +00:00
Chris Lattner	c6b3b25f94	Fix a pretty serious misfeature of the inliner: if it inlines a function with multiple return values it inserts a PHI to merge them all together. However, if the return values are all the same, it ends up with a pointless PHI and this pointless PHI happens to really block SRoA from happening in at least a silly C++ example written by Doug, but probably others. This fixes rdar://7339069. llvm-svn: 85206	2009-10-27 05:39:41 +00:00
Mike Stump	2b0a49a682	VS build fix, patch by Marius Wachtler. llvm-svn: 85197	2009-10-27 02:14:13 +00:00
Eric Christopher	7a50b280c1	Add objectsize intrinsic and hook it up through codegen. Doesn't do anything than return "I don't know" at the moment. llvm-svn: 85189	2009-10-27 00:52:25 +00:00
Dan Gohman	f808106bbe	Add braces to avoid ambiguous else. llvm-svn: 85185	2009-10-27 00:11:02 +00:00
Victor Hernandez	762195bd01	Rename MallocHelper as MallocFreeHelper, since it now also identifies calls to free() llvm-svn: 85181	2009-10-26 23:58:56 +00:00
Owen Anderson	03b5de67b0	Add a straight-forward implementation of SCCVN for aggressively eliminating scalar redundancies. llvm-svn: 85179	2009-10-26 23:55:47 +00:00
Victor Hernandez	de5ad42aa1	Remove FreeInst. Remove LowerAllocations pass. Update some more passes to treate free calls just like they were treating FreeInst. llvm-svn: 85176	2009-10-26 23:43:48 +00:00
Dan Gohman	34e38afa96	Simplify this code. LoopDeletion doesn't need to explicit check that the loop exiting block dominates the latch block; if ScalarEvolution can prove that the trip-count is finite, that's sufficient. llvm-svn: 85165	2009-10-26 22:18:58 +00:00
Dan Gohman	672927f393	Code that checks WillNotOverflowSignedAdd before creating an Add can safely use the NSW bit on the Add. llvm-svn: 85164	2009-10-26 22:14:22 +00:00
Ted Kremenek	ce8f626f82	Update CMake files. llvm-svn: 85161	2009-10-26 22:06:01 +00:00
Dan Gohman	6a1d9eace9	Check in the experimental GEP splitter pass. This pass splits complex GEPs (more than one non-zero index) into simple GEPs (at most one non-zero index). In some simple experiments using this it's not uncommon to see 3% overall code size wins, because it exposes redundancies that can be eliminated, however it's tricky to use because instcombine aggressively undoes the work that this pass does. llvm-svn: 85144	2009-10-26 19:12:14 +00:00
Dan Gohman	6a10d5ebd3	Fix a typo in a comment. llvm-svn: 85120	2009-10-26 15:55:24 +00:00
Chris Lattner	683eed3286	reapply r85085 with a bugfix to avoid infinite looping. All of the 'demorgan' related xforms need to use dyn_castNotVal, not m_Not. llvm-svn: 85119	2009-10-26 15:40:07 +00:00
Dan Gohman	d632f89596	Make LSR's OptimizeShadowIV ignore induction variables with negative strides for now, because it doesn't handle them correctly. This fixes a miscompile of SingleSource/Benchmarks/Misc-C++/ray. This problem was usually hidden because indvars transforms such induction variables into negations of canonical induction variables. llvm-svn: 85118	2009-10-26 15:32:57 +00:00
Evan Cheng	8014a728b9	Revert 85085. It causes infinite looping during llvm-gcc build. llvm-svn: 85090	2009-10-26 03:51:32 +00:00
Chris Lattner	2e6564d6ff	Implement PR3266 & PR5276, folding: not (or (icmp, icmp)) -> and(icmp, icmp) llvm-svn: 85085	2009-10-26 01:06:31 +00:00
Nick Lewycky	974e12b2d3	Remove includes of Support/Compiler.h that are no longer needed after the VISIBILITY_HIDDEN removal. llvm-svn: 85043	2009-10-25 06:57:41 +00:00
Nick Lewycky	02d5f77d26	Remove VISIBILITY_HIDDEN from class/struct found inside anonymous namespaces. Chris claims we should never have visibility_hidden inside any .cpp file but that's still not true even after this commit. llvm-svn: 85042	2009-10-25 06:33:48 +00:00
Nick Lewycky	54d7179a25	Remove ICmpInst::isSignedPredicate which was a reimplementation CmpInst::isSigned. llvm-svn: 85037	2009-10-25 05:20:17 +00:00
Dan Gohman	ef41a1ce3c	MapValue doesn't needs its LLVMContext argument. llvm-svn: 85020	2009-10-24 23:37:16 +00:00
Dan Gohman	8f4078ba39	Rename isLoopExit to isLoopExiting, for consistency with the wording used elsewhere - an exit block is a block outside the loop branched to from within the loop. An exiting block is a block inside the loop that branches out. llvm-svn: 85019	2009-10-24 23:34:26 +00:00
Dan Gohman	b979794e4b	Rewrite LoopRotation's SSA updating code using SSAUpdater. llvm-svn: 85016	2009-10-24 23:19:52 +00:00
Victor Hernandez	e297149e26	Auto-upgrade free instructions to calls to the builtin free function. Update all analysis passes and transforms to treat free calls just like FreeInst. Remove RaiseAllocations and all its tests since FreeInst no longer needs to be raised. llvm-svn: 84987	2009-10-24 04:23:03 +00:00
Victor Hernandez	8acf2956b8	Remove AllocationInst. Since MallocInst went away, AllocaInst is the only subclass of AllocationInst, so it no longer is necessary. llvm-svn: 84969	2009-10-23 21:09:37 +00:00
Dan Gohman	41d00ac45b	Make LoopDeletion check the maximum backedge taken count, rather than the exact backedge taken count, when checking for infinite loops. This allows it to delete loops with multiple exit conditions. llvm-svn: 84952	2009-10-23 17:10:01 +00:00
Chris Lattner	cf7e8947e9	move another load optimization from instcombine -> libanalysis. llvm-svn: 84841	2009-10-22 06:44:07 +00:00
Chris Lattner	51d2f70e32	move 'loading i32 from string' optimization from instcombine to libanalysis. Instcombine shrinking... does this even make sense??? llvm-svn: 84840	2009-10-22 06:38:35 +00:00
Chris Lattner	1664a4fd86	Move some constant folding logic for loads out of instcombine into Analysis/ConstantFolding.cpp. This doesn't change the behavior of instcombine but makes other clients of ConstantFoldInstruction able to handle loads. This was partially extracted from Eli's patch in PR3152. llvm-svn: 84836	2009-10-22 06:25:11 +00:00
Chris Lattner	c7a962d3b3	fix PR5262. llvm-svn: 84810	2009-10-22 00:17:26 +00:00
Devang Patel	27e0be274e	Derive metadata hierarchy from Value instead of User. llvm-svn: 84801	2009-10-21 23:57:35 +00:00
Chris Lattner	966526cbfb	revert r84754, it isn't the right approach. Edwin, please propose patches for fixes like this instead of committing them directly. llvm-svn: 84799	2009-10-21 23:41:58 +00:00
Victor Hernandez	be9e179104	Make changes to rev 84292 as requested by Chris Lattner. Most changes are cleanup, but there is 1 correctness fix: I fixed InstCombine so that the icmp is removed only if the malloc call is removed (which requires explicit removal because the Worklist won't DCE any calls since they can have side-effects). llvm-svn: 84772	2009-10-21 19:11:40 +00:00
Torok Edwin	1539a352a6	Fix PR5262: when folding select into PHI, make sure all operands are available in the PHI's Basic Block. This uses a conservative approach, because we don't have dominator info in instcombine. llvm-svn: 84754	2009-10-21 10:49:00 +00:00
Chris Lattner	8ed7bef409	make GVN work better when TD is not around: "In the existing code, if the load and the value to replace it with are of different types and target data is available, it tries to use the target data to coerce the replacement value to the type of the load. Otherwise, it skips all effort to handle the type mismatch and just feeds the wrongly-typed replacement value to replaceAllUsesWith, which triggers an assertion. The patch replaces it with an outer if checking for type mismatch, and an inner if-else that checks whether target data is available and, if not, returns false rather than trying to replace the load." Patch by Kenneth Uildriks! llvm-svn: 84739	2009-10-21 04:11:19 +00:00
Devang Patel	1d7f7d21dc	Do not remove dead metadata for now. llvm-svn: 84731	2009-10-21 02:21:34 +00:00
Chris Lattner	7f903681ac	alternate fix for PR5258 which avoids worklist problems, with reduced testcase. llvm-svn: 84667	2009-10-20 20:27:49 +00:00
Dan Gohman	b6b8ec769c	Restore LoopUnswitch's block-oriented threshold. LoopUnswitch now checks both the estimated code size and the number of blocks when deciding whether to do a non-trivial unswitch. This protects it from some very undesirable worst-case behavior on large numbers of loop-unswitchable conditions, such as in the testcase in PR5259. llvm-svn: 84661	2009-10-20 20:06:09 +00:00
Torok Edwin	cf10ec951d	Fix PR5258, jump-threading creating invalid PHIs. When an incoming value for a PHI is updated, we must also updated all other incoming values for the same BB to match, otherwise we create invalid PHIs. llvm-svn: 84638	2009-10-20 15:42:00 +00:00
Torok Edwin	729d92bd74	Fix PR4313: IPSCCP was not setting the lattice value for the invoke instruction when the invoke had multiple return values: it set the lattice value only on the extractvalue. This caused the invoke's lattice value to remain the default (undefined), and later propagated to extractvalue's operand, which incorrectly introduces undefined behavior. llvm-svn: 84637	2009-10-20 15:15:09 +00:00
Owen Anderson	168ad6985e	Refactor lookup_or_add to contain _MUCH_ less duplicated code. Add support for numbering first class aggregate instructions while we're at it. llvm-svn: 84547	2009-10-19 22:14:22 +00:00
Victor Hernandez	5c704d505c	Malloc calls are marked NoAlias, so the code below the isMalloc() check makes it redundant. Removing the isMalloc() check. llvm-svn: 84541	2009-10-19 21:47:22 +00:00
Owen Anderson	1059b5b32d	Simplify some code. llvm-svn: 84533	2009-10-19 21:14:57 +00:00
Dan Gohman	8f986672a1	Fix SplitBlockPredecessors' LoopInfo updating code to handle the case where a loop's header is being split and it has predecessors which are not contained by the most-nested loop which contains the loop. This fixes PR5235. llvm-svn: 84505	2009-10-19 16:04:50 +00:00
Dan Gohman	511d2e26dd	Change instnamer to name arguments "arg" instead of "tmp" for clarity, and to name basic blocks "bb" instead of "BB", for consistency. llvm-svn: 84502	2009-10-19 14:47:32 +00:00
Chris Lattner	1fa98f0d74	remove the IndMemRemPass, which only made sense for when malloc/free were intrinsic instructions. llvm-svn: 84404	2009-10-18 05:02:09 +00:00
Daniel Dunbar	8eff29d805	Use raw_ostream::write_escaped instead of EscapeString. llvm-svn: 84356	2009-10-17 20:43:19 +00:00
Chris Lattner	88b36f1140	Simplify some code (first hunk) and fix PR5208 (second hunk) by updating the callgraph when introducing a call. llvm-svn: 84310	2009-10-17 05:39:39 +00:00
Victor Hernandez	a3aaf85e23	Remove MallocInst from LLVM Instructions. llvm-svn: 84299	2009-10-17 01:18:07 +00:00
Victor Hernandez	c7d6a8327c	Autoupgrade malloc insts to malloc calls. Update testcases that rely on malloc insts being present. Also prematurely remove MallocInst handling from IndMemRemoval and RaiseAllocations to help pass tests in this incremental step. llvm-svn: 84292	2009-10-17 00:00:19 +00:00
Victor Hernandez	264da3274e	HeapAllocSRoA also needs to check if malloc array size can be computed. llvm-svn: 84288	2009-10-16 23:12:25 +00:00
Dan Gohman	99429a00ff	Move zext and sext casts fed by loads into the same block as the load, to help SelectionDAG fold them into the loads, unless conditions are unfavorable. llvm-svn: 84271	2009-10-16 20:59:35 +00:00
Duncan Sands	0058c7bcb0	Strip trailing white space. llvm-svn: 84256	2009-10-16 15:20:13 +00:00
Victor Hernandez	13020b1faf	Fix bug where array malloc with unexpected computation of the size argument resulted in MallocHelper identifying the malloc as a non-array malloc. This broke GlobalOpt's optimization of stores of mallocs to global variables. The fix is to classify malloc's into 3 categories: 1. non-array mallocs 2. array mallocs whose array size can be determined 3. mallocs that cannot be determined to be of type 1 or 2 and cannot be optimized getMallocArraySize() returns NULL for category 3, and all users of this function must avoid their malloc optimization if this function returns NULL. Eventually, currently unexpected codegen for computing the malloc's size argument will be supported in isArrayMalloc() and getMallocArraySize(), extending malloc optimizations to those examples. llvm-svn: 84199	2009-10-15 20:14:52 +00:00
Chris Lattner	c855b45b78	only try to fold constantexpr operands when the worklist is first populated, don't bother every time going around the main worklist. This speeds up a release-asserts opt -std-compile-opts on 403.gcc by about 4% (1.5s). It seems to speed up the most expensive instances of instcombine by ~10%. llvm-svn: 84171	2009-10-15 04:59:28 +00:00
Chris Lattner	dd1f68a10c	don't bother calling ConstantFoldInstruction unless there is a use of the instruction (which disqualifies stores, unreachable, etc) and at least the first operand is a constant. This filters out a lot of obvious cases that can't be folded. Also, switch the IRBuilder to a TargetFolder, which tries harder. llvm-svn: 84170	2009-10-15 04:13:44 +00:00
Devang Patel	92f8619923	Use isVoidTy() llvm-svn: 84118	2009-10-14 17:29:00 +00:00
Chris Lattner	6b9044db01	make instcombine's instruction sinking more aggressive in the presence of PHI nodes. llvm-svn: 84103	2009-10-14 15:21:58 +00:00
Devang Patel	a677136900	Check void type before using RAUWd. llvm-svn: 84049	2009-10-13 22:56:32 +00:00
Devang Patel	115741ba79	Do not check use_empty() before replaceAllUsesWith(). This gives ValueHandles a chance to get properly updated. llvm-svn: 84033	2009-10-13 21:41:20 +00:00
Dan Gohman	2dc6f8de03	Use the new CodeMetrics class to compute code size instead of manually counting instructions. llvm-svn: 84016	2009-10-13 20:12:23 +00:00
Ted Kremenek	113d959f1b	Update CMake file. llvm-svn: 84001	2009-10-13 18:48:07 +00:00
Dan Gohman	54463e837a	Commit the removal of this file, which is now moved to lib/Analysis. llvm-svn: 83999	2009-10-13 18:37:20 +00:00
Dan Gohman	4552e3cd73	Move the InlineCost code from Transforms/Utils to Analysis. llvm-svn: 83998	2009-10-13 18:30:07 +00:00
Dan Gohman	5b3e05bcaa	Start refactoring the inline cost estimation code so that it can be used for purposes other than inlining. llvm-svn: 83997	2009-10-13 18:24:11 +00:00
Chris Lattner	19788ca686	change simplifycfg to not duplicate 'unwind' instructions. Hopefully this will increase the likelihood of common code getting sunk towards the unwind. llvm-svn: 83996	2009-10-13 18:13:05 +00:00
Dan Gohman	71ca652475	Make LoopUnswitch's cost estimation count Instructions, rather than BasicBlocks, so that it doesn't blindly procede in the presence of large individual BasicBlocks. This addresses a class of code-size expansion problems. llvm-svn: 83992	2009-10-13 17:50:43 +00:00
Evan Cheng	f815861591	Make licm debug message readable. llvm-svn: 83908	2009-10-12 22:25:23 +00:00
Dale Johannesen	4c9f0e8f53	Fix warning. llvm-svn: 83870	2009-10-12 18:45:32 +00:00
Chris Lattner	8abd572dae	populate instcombine's initial worklist more carefully, causing it to visit instructions from the start of the function to the end of the function in the first path. This greatly speeds up some pathological cases (e.g. PR5150). Try #3, this time with some unneeded debug info stuff removed which was causing dead pointers to be added to the worklist. llvm-svn: 83818	2009-10-12 03:58:40 +00:00
Chris Lattner	8ce6b36c86	revert r83814 for now, it is making the llvm-gcc bootstrap unhappy. llvm-svn: 83817	2009-10-11 23:56:08 +00:00
Chris Lattner	78d6310429	populate instcombine's initial worklist more carefully, causing it to visit instructions from the start of the function to the end of the function in the first path. This greatly speeds up some pathological cases (e.g. PR5150). llvm-svn: 83814	2009-10-11 23:17:43 +00:00
Chris Lattner	2c2deae5ac	remove some harmful code that would turn an insertelement on an undef into a shuffle even if it was used by another insertelement. If the visitation order of instcombine was wrong, this would turn a chain of insertelements into a chain of shufflevectors, which was quite painful. Since CollectShuffleElements handles these cases, the code can just be nuked. llvm-svn: 83810	2009-10-11 23:02:46 +00:00
Chris Lattner	c6cdbfbfdd	teach instcombine to simplify xor's harder, catching the new testcase. llvm-svn: 83799	2009-10-11 22:22:13 +00:00
Chris Lattner	6e6ac47125	cleanups llvm-svn: 83797	2009-10-11 22:00:32 +00:00
Chris Lattner	1639234775	cleanup, no functionality change. llvm-svn: 83795	2009-10-11 21:36:10 +00:00
Chris Lattner	fd27f8a5b3	generalize a transformation even more: we don't care whether the input the the mul is a zext from bool, just that it is all zeros other than the low bit. This fixes some phase ordering issues that would cause us to miss some xforms in mul.ll when the worklist is visited differently. llvm-svn: 83794	2009-10-11 21:29:45 +00:00
Chris Lattner	406cb75c6b	simplify a transformation by making it more general. llvm-svn: 83792	2009-10-11 21:22:21 +00:00
Chris Lattner	f39f4f928a	temporarily revert previous patch llvm-svn: 83791	2009-10-11 21:05:34 +00:00
Chris Lattner	bb058d3a23	populate instcombine's initial worklist more carefully, causing it to visit instructions from the start of the function to the end of the function in the first path. This greatly speeds up some pathological cases (e.g. PR5150). llvm-svn: 83790	2009-10-11 21:04:37 +00:00
Torok Edwin	8b3081350e	Remove CleanupDbgInfo, instcombine does this and its not worth duplicating it here. llvm-svn: 83789	2009-10-11 19:58:35 +00:00
Torok Edwin	907ec36943	LICM shouldn't sink/delete debug information. Fix this and add a testcase. For now the metadata of sinked/hoisted instructions is still wrong, but that'll be fixed when instructions will have debug metadata directly attached. llvm-svn: 83786	2009-10-11 19:15:54 +00:00
Chris Lattner	85c85c5e04	when folding duplicate conditions, delete the now-probably-dead instruction tree feeding it. llvm-svn: 83778	2009-10-11 18:39:58 +00:00
Chris Lattner	e374382b8f	implement rdar://7293527, a trivial instcombine that llvm-gcc gets but clang doesn't, because it is implemented in GCC's fold routine. llvm-svn: 83761	2009-10-11 07:53:15 +00:00
Chris Lattner	97b1405207	implement a transformation in jump threading that is currently done by condprop, but do it in a much more general form. The basic idea is that we can do a limited form of tail duplication in the case when we have a branch on a phi. Moving the branch up in to the predecessor block makes instruction selection much easier and encourages chained jump threadings. llvm-svn: 83759	2009-10-11 07:24:57 +00:00
Chris Lattner	6ce85e85f5	restructure some code, no functionality change. llvm-svn: 83756	2009-10-11 04:40:21 +00:00
Chris Lattner	f466bc84c9	factor some code better and move a function, no functionality change. llvm-svn: 83755	2009-10-11 04:33:43 +00:00
Chris Lattner	f99a74e24b	make jump threading on a phi with undef inputs happen. llvm-svn: 83754	2009-10-11 04:18:15 +00:00
Chris Lattner	71d353dd48	rewrite LCSSA to use SSAUpdate, to only return true if it modifies the IR, and to implement the FIXME'd optimization. llvm-svn: 83748	2009-10-11 02:53:37 +00:00
Chris Lattner	101dde30ed	clean up and simplify some code. Don't use setvector when things will be inserted only once, just use vector. Don't compute ExitBlocks unless we need it, change std::sort to array_pod_sort. llvm-svn: 83747	2009-10-11 01:07:15 +00:00
Chris Lattner	b6c65faa64	switch GVN to use SSAUpdater. Besides removing a lot of complexity from GVN, this also speeds it up, inserts fewer PHI nodes (see the testcase) and allows it to remove more loads (due to fewer PHI nodes standing in the way). llvm-svn: 83746	2009-10-10 23:50:30 +00:00
Chris Lattner	9c382cebc5	add a simple helper method. llvm-svn: 83745	2009-10-10 23:41:48 +00:00
Chris Lattner	249265de06	add ability for clients of SSAUpdater to find out about the PHI nodes inserted. llvm-svn: 83744	2009-10-10 23:15:24 +00:00
Chris Lattner	89d2a5c4f3	remove dead code llvm-svn: 83742	2009-10-10 23:04:12 +00:00
Chris Lattner	67cdd8b567	add the ability to get a rewritten value from the middle of a block, not just at the end. Add a big comment explaining when this could be useful (which never happens for jump threading). llvm-svn: 83741	2009-10-10 23:00:11 +00:00
Chris Lattner	e474a8d3a7	rename GetValueInBlock -> GetValueAtEndOfBlock to better reflect what it does. llvm-svn: 83740	2009-10-10 22:41:58 +00:00
Chris Lattner	65e69a77e1	use a typedef instead of spelling out an insane type. Yay for auto someday. llvm-svn: 83707	2009-10-10 09:09:20 +00:00
Chris Lattner	84095071ea	Change jump threading to use the new SSAUpdater class instead of DemoteRegToStack. This makes it more efficient (because it isn't creating a ton of load/stores that are eventually removed by a later mem2reg), and more slightly more effective (because those load/stores don't get in the way of threading). llvm-svn: 83706	2009-10-10 09:05:58 +00:00
Chris Lattner	60d4e69c81	Implement an efficient and fully general SSA update mechanism that works on unstructured CFGs. This implements PR217, our oldest open PR. llvm-svn: 83705	2009-10-10 09:04:27 +00:00
Chris Lattner	f30a2b0c86	random tidying llvm-svn: 83701	2009-10-10 06:22:45 +00:00
Dale Johannesen	96a5b87ae2	Use names instead of numbers for some of the magic constants used in inlining heuristics (especially those used in more than one file). No functional change. llvm-svn: 83675	2009-10-09 21:42:02 +00:00
Dale Johannesen	3059924bdd	When considering whether to inline Callee into Caller, and that will make Caller too big to inline, see if it might be better to inline Caller into its callers instead. This situation is described in PR 2973, although I haven't tried the specific case in SPASS. llvm-svn: 83602	2009-10-09 00:11:32 +00:00
Dan Gohman	09984279fd	Add a form of addPreserved which takes a string argument, to allow passes to declare that they preserve other passes without needing to pull in additional header file or library dependencies. Convert MachineFunctionPass and CodeGenLICM to make use of this. llvm-svn: 83555	2009-10-08 17:00:02 +00:00
Jeffrey Yasskin	dafd08ea7e	In instcombine's debug output, avoid printing ADD for instructions that are already on the worklist, and print Visited when an instruction is about to be visited. Net, on one input, this reduced the output size by at least 9x. llvm-svn: 83510	2009-10-08 00:12:24 +00:00
Eric Christopher	5b741f3d14	80-column and whitespace fixes. llvm-svn: 83489	2009-10-07 21:14:25 +00:00
Eric Christopher	e666bc9f64	Add FreeInst to the "is a call" check for Insts that are calls, but not intrinsics. llvm-svn: 83441	2009-10-07 00:54:08 +00:00
Eric Christopher	6ba26317ce	While we still have a MallocInst treat it as a call like any other for inlining. When MallocInst goes away this code will be subsumed as part of calls and work just fine... llvm-svn: 83434	2009-10-07 00:02:18 +00:00
Ted Kremenek	2275a7dfef	Update CMake file. llvm-svn: 83404	2009-10-06 19:45:38 +00:00
Chris Lattner	a893f5bdf5	remove predicate simplifier, it never got the last bugs beaten out of it, and jump threading, condprop and gvn are now getting most of the benefit. This was approved by Nicholas and Nicolas. llvm-svn: 83390	2009-10-06 16:59:46 +00:00
Duncan Sands	9ed7b16bf3	Introduce and use convenience methods for getting pointer types where the element is of a basic builtin type. For example, to get an i8* use getInt8PtrTy. llvm-svn: 83379	2009-10-06 15:40:36 +00:00
Dan Gohman	e525d9ddc0	Remove an unnnecessary LLVMContext argument in ConstantFoldLoadThroughGEPConstantExpr. llvm-svn: 83311	2009-10-05 16:36:26 +00:00
Dan Gohman	238cf49812	Use Use::operator= instead of Use::set, for consistency. llvm-svn: 83310	2009-10-05 16:31:55 +00:00
Chris Lattner	fdd8790718	strength reduce a ton of type equality tests to check the typeid (Through the new predicates I added) instead of going through a context and doing a pointer comparison. Besides being cheaper, this allows a smart compiler to turn the if sequence into a switch. llvm-svn: 83297	2009-10-05 05:54:46 +00:00
Chris Lattner	463716d559	instcombine shouldn't delete all null checks for mallocs. This fixes PR5130. llvm-svn: 83290	2009-10-05 02:47:47 +00:00
Owen Anderson	b5049bebb3	Do away with the strange use of BitVectors in SSI, and just use normal sets. This makes the code much more C++/LLVM-ish. llvm-svn: 83286	2009-10-04 18:49:55 +00:00
Owen Anderson	286feb16a9	Fix a typo in the comment. llvm-svn: 83283	2009-10-04 17:52:13 +00:00
Owen Anderson	a62bf10651	SSI needs to require DT and DF transitively, since it uses them outside of its runOnFunction. Similarly, it can be marked setPreservesAll, since it does no work in its runOnFunction. llvm-svn: 83282	2009-10-04 17:47:39 +00:00
Evan Cheng	bb4ed2394b	Allow -inline-threshold override default threshold even if compiling to optimize for size. llvm-svn: 83274	2009-10-04 06:13:54 +00:00
Douglas Gregor	d846fbf20d	Remove GVNPRE.cpp from the CMake makefile llvm-svn: 83194	2009-10-01 05:30:05 +00:00
Chris Lattner	5f3cc06cd2	remove the GVNPRE pass. It has been subsumed by the GVN pass. Ok'd by Owen. llvm-svn: 83193	2009-10-01 02:18:36 +00:00
Dan Gohman	ea0bb8f555	Fix this code so that it doesn't try to iterate through a std::vector while calling changeImmediateDominator, which removes elements from the vector. This fixes PR5097. llvm-svn: 83166	2009-09-30 20:54:16 +00:00
Dan Gohman	7d3b0be05b	Remove a redundant #ifndef and add an assertion string. llvm-svn: 82991	2009-09-28 14:38:19 +00:00
Dan Gohman	9a7320c711	Convert LoopSimplify and LoopExtractor from FunctionPass to LoopPass. llvm-svn: 82990	2009-09-28 14:37:51 +00:00
Chris Lattner	0261b5d2d2	The select instruction is not neccesarily in the same block as the phi nodes. Make sure to phi translate from the right block. This fixes a llvm-building-llvm failure on GVN-PRE.cpp llvm-svn: 82970	2009-09-28 06:49:44 +00:00
Chris Lattner	4425660b1f	simplify some code. llvm-svn: 82936	2009-09-27 21:46:50 +00:00
Chris Lattner	b2e88cd01c	The bitcast case is not needed here: instcombine turns icmp(bitcast(x), null) -> icmp(x, null) already. llvm-svn: 82935	2009-09-27 21:42:46 +00:00
Chris Lattner	8b4d3dfbbf	calls are already unmovable, malloc doesn't need a special case. llvm-svn: 82933	2009-09-27 21:36:19 +00:00
Chris Lattner	f9e0c7f84b	calls to external functions are already marked overdefined, special casing malloc isn't needed. llvm-svn: 82932	2009-09-27 21:35:11 +00:00
Chris Lattner	5abb1e4cd2	calls are already handled, malloc doesn't need a special case. llvm-svn: 82931	2009-09-27 21:33:46 +00:00
Chris Lattner	466d57f6c1	calls are rejected above, no need to special case malloc here. llvm-svn: 82929	2009-09-27 21:31:39 +00:00
Chris Lattner	43d0db70ac	remove special handling of bitcast(malloc), it will be handled when the loop inspects the bitcast operand. llvm-svn: 82928	2009-09-27 21:29:28 +00:00
Chris Lattner	a8627272c1	unlike the malloc instruction, "malloc" calls do not claim to be readonly, just nounwind. llvm-svn: 82927	2009-09-27 21:23:38 +00:00
Chris Lattner	b391e87263	allow pushing icmps through phis with multiple uses and across critical edges. These are important to push up to encourage jump threading. This shrinks 176.gcc a bit. llvm-svn: 82923	2009-09-27 20:46:36 +00:00
Chris Lattner	ae289632ef	Enhance the previous fix for PR4895 to allow more values than just simple constants for the true/false value of the select. We now do phi translation etc. This really fixes PR4895 :) llvm-svn: 82917	2009-09-27 20:18:49 +00:00
Chris Lattner	facb867af3	implement PR4895, by making FoldOpIntoPhi handle select conditions that are phi nodes. Also tighten up FoldOpIntoPhi to treat constantexpr operands to phis just like other variables, avoiding moving constantexpr computations around. Patch by Daniel Dunbar. llvm-svn: 82913	2009-09-27 19:57:57 +00:00
Dan Gohman	0e70af36c0	Grab an LLVM Context from an instruction that exists rather than one that is deleted in some situations. This fixes a use-after-free. llvm-svn: 82903	2009-09-27 16:10:30 +00:00
Dan Gohman	fc20b67e80	Tell ScalarEvolution to forget everything it knows about a loop before rotating the loop, since loop rotation is a very significant change. llvm-svn: 82901	2009-09-27 15:37:03 +00:00
Nick Lewycky	42fb7452df	Instruction::clone does not need to take an LLVMContext&. Remove that and update all the callers. llvm-svn: 82889	2009-09-27 07:38:41 +00:00
Dan Gohman	62995c71a2	Fix SimplifyLibCalls to transfer attributes from callees rather than calls, since direct calls don't always reflect the attributes of their callees. llvm-svn: 82867	2009-09-26 18:10:13 +00:00
Dan Gohman	394468dc8e	Rename ConstantFP's getInf to getInfinity. llvm-svn: 82823	2009-09-25 23:40:21 +00:00
Dan Gohman	5ffd53892d	Transform pow(x, 0.5) to (x == -inf ? inf : fabs(sqrt(x))), which is typically faster then doing a general pow. llvm-svn: 82819	2009-09-25 23:10:17 +00:00
Torok Edwin	21bd8c9fc5	Constant propagating byval pointer is safe if function is readonly. llvm-svn: 82700	2009-09-24 18:33:42 +00:00
Torok Edwin	f95a450ef9	Don't constant propagate byval pointers, since they are not really pointers, but rather structs passed by value. This fixes PR5038. llvm-svn: 82689	2009-09-24 09:47:18 +00:00
Dale Johannesen	fb1b55bc9c	A minor improvment in accuracy to inline cost computation, and some cosmetics. llvm-svn: 82660	2009-09-23 22:05:24 +00:00
Chris Lattner	e3ce1e2a37	tidy up llvm-svn: 82488	2009-09-21 22:26:02 +00:00
Chris Lattner	247053867e	big endian systems shift by bits too, hopefully this will fix the ppc bootstrap problems. llvm-svn: 82464	2009-09-21 17:55:47 +00:00
Dan Gohman	43d6830ea0	Nick pointed out that DominanceFrontier and DominanceTree are preserved by setPreservesCFG(). llvm-svn: 82463	2009-09-21 17:54:42 +00:00
Dan Gohman	af57ae3da4	Remove the special-case for constants in PHI nodes; it's not really helpful, and it didn't correctly handle the case of constants input to PHIs for backedges. llvm-svn: 82462	2009-09-21 17:53:35 +00:00
Chris Lattner	9045f235d2	fix PR5016, a crash I introduced in GVN handing first class arrays and structs, which cannot be bitcast to integers. llvm-svn: 82460	2009-09-21 17:24:04 +00:00
Chris Lattner	4d8af2f1ae	enable non-local analysis and PRE of large store -> little load. This doesn't kick in too much because of phi translation issues, but this can be resolved in the future. llvm-svn: 82447	2009-09-21 06:48:08 +00:00
Chris Lattner	0cdc17eb50	convert an std::pair to an explicit struct. llvm-svn: 82446	2009-09-21 06:30:24 +00:00
Chris Lattner	d28f90897a	move some functions, add a comment. llvm-svn: 82444	2009-09-21 06:24:16 +00:00
Chris Lattner	9d7fb29522	split HandleLoadFromClobberingStore in two pieces: one that does the analysis, one that does the xform. llvm-svn: 82443	2009-09-21 06:22:46 +00:00
Chris Lattner	0a9616d906	Improve GVN to be able to forward substitute a small load from a piece of a large store when both are in the same block. This allows clang to compile the testcase in PR4216 to this code: _test_bitfield: movl 4(%esp), %eax movl %eax, %ecx andl $-65536, %ecx orl $32962, %eax andl $40186, %eax orl %ecx, %eax ret This is not ideal, but is a whole lot better than the code produced by llvm-gcc: _test_bitfield: movw $-32574, %ax orw 4(%esp), %ax andw $-25350, %ax movw %ax, 4(%esp) movw 7(%esp), %cx shlw $8, %cx movzbl 6(%esp), %edx orw %cx, %dx movzwl %dx, %ecx shll $16, %ecx movzwl %ax, %eax orl %ecx, %eax ret and dramatically better than that produced by gcc 4.2: _test_bitfield: pushl %ebx call L3 "L00000000001$pb": L3: popl %ebx movl 8(%esp), %eax leal 0(,%eax,4), %edx sarb $7, %dl movl %eax, %ecx andl $7168, %ecx andl $-7201, %ebx movzbl %dl, %edx andl $1, %edx sall $5, %edx orl %ecx, %ebx orl %edx, %ebx andl $24, %eax andl $-58336, %ebx orl %eax, %ebx orl $32962, %ebx movl %ebx, %eax popl %ebx ret llvm-svn: 82439	2009-09-21 05:57:11 +00:00
Chris Lattner	1eefa9c427	formatting cleanups, no functionality change. llvm-svn: 82426	2009-09-21 02:42:51 +00:00
Chris Lattner	a0aa8fb6a6	Move CoerceAvailableValueToLoadType earlier in GVN.cpp. Hook it up so that nonlocal and partially redundant loads can use it as well. The testcase shows examples of craziness this can handle. This triggers many times in 176.gcc. llvm-svn: 82403	2009-09-20 20:09:34 +00:00
Chris Lattner	7c62d8a1a8	change the interface to CoerceAvailableValueToLoadType to be more generic. llvm-svn: 82402	2009-09-20 19:31:14 +00:00
Chris Lattner	1dd48c34e5	enhance GVN to forward substitute a stored value to a load (and load -> load) when the base pointers must alias but when they are different types. This occurs very very frequently in 176.gcc and other code that uses bitfields a lot. llvm-svn: 82399	2009-09-20 19:03:47 +00:00
Daniel Dunbar	7d6781b0fe	Tabs -> spaces, and remove trailing whitespace. llvm-svn: 82355	2009-09-20 02:20:51 +00:00
Nick Lewycky	1303c0ab86	Remove the default value for ConstantStruct::get's isPacked parameter and update the code which was broken by this. llvm-svn: 82327	2009-09-19 20:30:26 +00:00
Victor Hernandez	5d034499ad	Enhance transform passes so that they apply the same tranforms to malloc calls as to MallocInst. Reviewed by Dan Gohman. llvm-svn: 82300	2009-09-18 22:35:49 +00:00
Victor Hernandez	788eaabd18	Update malloc call creation code (AllocType is now the element type of the malloc, not the resulting type). In getMallocArraySize(), fix bug in the case that array size is the product of 2 constants. Extend isArrayMalloc() and getMallocArraySize() to handle case where malloc is used as char array. Ensure that ArraySize in LowerAllocations::runOnBasicBlock() is correct type. Extend Instruction::isSafeToSpeculativelyExecute() to handle malloc calls. Add verification for malloc calls. Reviewed by Dan Gohman. llvm-svn: 82257	2009-09-18 19:20:02 +00:00
Daniel Dunbar	487d1c8138	Update CMake. llvm-svn: 82097	2009-09-17 00:06:48 +00:00
Dan Gohman	0f64d71d99	Add a new pass for doing late hoisting of floating-point and vector constants out of loops. These aren't covered by the regular LICM pass, because in LLVM IR constants don't require separate instructions. They're not always covered by the MachineLICM pass either, because it doesn't know how to unfold folded constant-pool loads. This is somewhat experimental at this point, and off by default. llvm-svn: 82076	2009-09-16 20:25:11 +00:00
Dan Gohman	bd0050810c	Change FoldPHIArgBinOpIntoPHI to decline folding if it would introduce two phis, similar to the FoldPHIArgGEPIntoPHI change. Also, delete some comments that don't reflect the code. llvm-svn: 82053	2009-09-16 16:50:24 +00:00
Andreas Neustifter	41c1103273	Reapplied r81355 with the problems fixed. (See http://lists.cs.uiuc.edu/pipermail/llvm-commits/Week-of-Mon-20090907/086737.html and http://lists.cs.uiuc.edu/pipermail/llvm-commits/Week-of-Mon-20090907/086746.html) llvm-svn: 82039	2009-09-16 11:35:50 +00:00
Andreas Neustifter	f8cb758ba8	Preserve ProfileInfo during CodeGenPrepare. llvm-svn: 82034	2009-09-16 09:26:52 +00:00
Dan Gohman	3b7ce109ec	Don't sink gep operators through phi nodes if the result would require more than one phi, since that leads to higher register pressure on entry to the phi. This is especially problematic when the phi is in a loop header, as it increases register pressure throughout the loop. llvm-svn: 81993	2009-09-16 02:01:52 +00:00
Nick Lewycky	7465cd769c	Add more newlines to make up for the ones removed from the end of instructions. llvm-svn: 81851	2009-09-15 07:08:25 +00:00
Chris Lattner	e0987215f0	add a new CallGraphNode::replaceCallEdge method and use it from argpromote to avoid invalidating an iterator. This fixes PR4977. All clang tests now pass with expensive checking (on my system at least). llvm-svn: 81843	2009-09-15 05:40:35 +00:00
Chris Lattner	e9a4992399	add newline to debug dump llvm-svn: 81840	2009-09-15 05:14:57 +00:00
Dan Gohman	f9eafce3af	When extending a memset range past the front, set the alignment of the memset region to the alignment of the new start address. llvm-svn: 81810	2009-09-14 23:39:10 +00:00
Dan Gohman	7190d48075	Factor out the code for checking that all indices in a getelementptr are within the notional bounds of the static type of the getelementptr (which is not the same as "inbounds") from GlobalOpt into a utility routine, and use it in ConstantFold.cpp to check whether there are any mis-behaved indices. llvm-svn: 81478	2009-09-10 23:37:55 +00:00
Nick Lewycky	dddf5dcdaf	Correctly handle the case where a comparison is created in one BasicBlock and used by a terminator in another. llvm-svn: 81437	2009-09-10 07:02:09 +00:00
Evan Cheng	1d9d4bdc99	Add malloc call utility functions. Patch by Victor Hernandez. llvm-svn: 81426	2009-09-10 04:36:43 +00:00
Dan Gohman	ec4557f324	Fix SplitCriticalEdge to properly update LCSSA form when splitting a loop exit edge -- new PHIs may be needed not only for the additional splits that are made to preserve LoopSimplify form, but also for the original split. Factor out the code that inserts new PHIs so that it can be used for both. Remove LoopRotation.cpp's code for manually updating LCSSA form, as it is now redundant. This fixes PR4934. llvm-svn: 81363	2009-09-09 18:18:18 +00:00
Mike Stump	deaf572ca8	Reflow comment. llvm-svn: 81361	2009-09-09 17:57:16 +00:00
Andreas Neustifter	4c0b2847ef	Preserve ProfileInfo. llvm-svn: 81360	2009-09-09 17:53:39 +00:00
Dan Gohman	c56af25c01	Fix an 80-column violation. llvm-svn: 81354	2009-09-09 17:17:19 +00:00
Chris Lattner	9ded9ac8af	revert r81335, which breaks the build. llvm-svn: 81347	2009-09-09 16:00:57 +00:00
Andreas Neustifter	0bd472dc33	Updated ProfileInfo to have clean seperation between different sentinels. llvm-svn: 81335	2009-09-09 12:48:26 +00:00
Owen Anderson	f0081db7e8	Fix PR4909, patch by Jakub Staszak. llvm-svn: 81250	2009-09-08 19:53:15 +00:00
Chris Lattner	9ce1781ef4	remove an extremely dubious instcombine transformation of extractelement(load). llvm-svn: 81239	2009-09-08 18:48:01 +00:00
Dan Gohman	3ddbc242fb	Re-apply r80926, with fixes: keep the domtree informed of new blocks that get created during loop unswitching, and fix SplitBlockPredecessors' LCSSA updating code to create new PHIs instead of trying to just move existing ones. Also, optimize Loop::verifyLoop, since it gets called a lot. Use searches on a sorted list of blocks instead of calling the "contains" function, as is done in other places in the Loop class, since "contains" does a linear search. Also, don't call verifyLoop from LoopSimplify or LCSSA, as the PassManager is already calling verifyLoop as part of LoopInfo's verifyAnalysis. llvm-svn: 81221	2009-09-08 15:45:00 +00:00
Chris Lattner	d1b21c6092	remove a turd llvm-svn: 81186	2009-09-08 03:47:41 +00:00
Chris Lattner	d3210e1a20	instcombine transforms vector loads that are only used by extractelement operations into a bitcast of the pointer, then a gep, then a scalar load. Disable this when the vector only has one element, because it leads to infinite loops in instcombine (PR4908). This transformation seems like a really bad idea to me, as it will likely disable CSE of vector load/stores etc and can be better done in the code generator when profitable. This goes all the way back to the first days of packed types, r25299 specifically. I'll let those people who care about the performance of vector code decide what to do with this. llvm-svn: 81185	2009-09-08 03:44:51 +00:00
Chris Lattner	f2ab40a46f	Fix PR4882, by making MemCpyOpt not dereference removed stores to get the context for the newly created operations. Patch by Jakub Staszak! llvm-svn: 81175	2009-09-08 00:27:14 +00:00
Dan Gohman	1b84908f92	Reappy r80998, now that the GlobalOpt bug that it exposed on MiniSAT is fixed. llvm-svn: 81172	2009-09-07 23:54:19 +00:00
Dan Gohman	161429fe7e	Don't commit stores with addresses that have indices that are not compile-time constant integers or that are out of bounds for their corresponding static array types. These can cause aliasing that GlobalOpt assumes won't happen. llvm-svn: 81165	2009-09-07 22:44:55 +00:00
Dan Gohman	82e747580f	Don't commit addresses of aggregate values. This avoids problems with an aggregate store overlapping a different aggregate store, despite the stores having distinct addresses. llvm-svn: 81164	2009-09-07 22:42:05 +00:00
Dan Gohman	beee35a277	Fix GlobalOpt to avoid committing a store if the address getelementptr is missing the inbounds flag. This is slightly conservative, but it avoids problems with two constants pointing to the same address but getting distinct entries in the Memory DenseMap. llvm-svn: 81163	2009-09-07 22:40:13 +00:00
Dan Gohman	19244eaa4a	Preserve the InBounds flag when evaluating a getelementptr instruction into a getelementptr ConstantExpr. llvm-svn: 81162	2009-09-07 22:34:43 +00:00
Dan Gohman	f7f3fb1133	Simplify this code by using hasDefinitiveInitializer(). llvm-svn: 81161	2009-09-07 22:31:26 +00:00
Eric Christopher	66d8555f7e	Fix comment. llvm-svn: 81138	2009-09-06 22:20:54 +00:00
Duncan Sands	89720bbd11	Remove some not-really-used variables, as warned about by icc (#593, partial). Patch by Erick Tryzelaar. llvm-svn: 81115	2009-09-06 12:41:19 +00:00
Daniel Dunbar	86c6a6ef0f	Fix a possible crash call setIsInBounds. - I think there are more instances of this, but I think they are fixed in Dan's incoming patch. This one was preventing me from doing a bugpoint reduction though. llvm-svn: 81103	2009-09-06 02:31:36 +00:00
Evan Cheng	904199547b	Revert r80926. It causes loop unswitch assertion and slow down some JIT tests significantly. llvm-svn: 81101	2009-09-06 02:26:10 +00:00
Daniel Dunbar	10ea8bb8e0	Revert "Include optional subclass flags, such as inbounds, nsw, etc., ...", this breaks MiniSAT on x86_64. llvm-svn: 81098	2009-09-06 00:11:24 +00:00
Andreas Neustifter	18156bd75c	Converted MaximumSpanningTree algorithm to a generic template, this could go into llvm/ADT. llvm-svn: 81001	2009-09-04 12:34:44 +00:00
Dan Gohman	0c2477c26b	Include optional subclass flags, such as inbounds, nsw, etc., in the Constant uniquing tables. This allows distinct ConstantExpr objects with the same operation and different flags. Even though a ConstantExpr "a + b" is either always overflowing or never overflowing (due to being a ConstantExpr), it's still necessary to be able to represent it both with and without overflow flags at the same time within the IR, because the safety of the flag may depend on the context of the use. If the constant really does overflow, it wouldn't ever be safe to use with the flag set, however the use may be in code that is never actually executed. This also makes it possible to merge all the flags tests into a single test. llvm-svn: 80998	2009-09-04 12:08:11 +00:00
Dan Gohman	4c1bdcf5d7	Add a verifyAnalysis to LoopInfo, LoopSimplify, and LCSSA form that verify that these passes are properly preserved. Fix several transformation passes that claimed to preserve LoopSimplify form but weren't. llvm-svn: 80926	2009-09-03 16:31:42 +00:00
Dan Gohman	22571485b3	Change PHINode::hasConstantValue to have a DominatorTree argument instead of a bool argument, and to do the dominator check itself. This makes it eaiser to use when DominatorTree information is available. llvm-svn: 80920	2009-09-03 15:34:35 +00:00
Duncan Sands	0edc7100ba	Keep track of how many memmove calls were turned into memcpy calls. llvm-svn: 80915	2009-09-03 13:37:16 +00:00
Andreas Neustifter	7e86c3856b	Code Cleanup. Removed inverted flag form MaximumSpanningTree, also do not handle so much information to MaximumSpanningTree. llvm-svn: 80911	2009-09-03 08:52:52 +00:00
Nick Lewycky	88214fbd12	Remove VISIBILITY_HIDDEN from this file. llvm-svn: 80903	2009-09-03 06:43:15 +00:00
Chris Lattner	27266f164f	In C++, code is not allowed to call main. In C it is, this simplifylibcalls optimization is thus valid for C++ but not C. It's not important enough to worry about for C++ apps, so just remove it. rdar://7191924 llvm-svn: 80887	2009-09-03 05:19:59 +00:00
Gabor Greif	2d60e1ec0c	back out my recent commit (r80858), it seems to break self-hosting buildbot's stage 2 configure llvm-svn: 80871	2009-09-03 02:02:59 +00:00
Gabor Greif	14dfba6d66	re-commit r66920 (which has been backed out in r66953) I may have more luck this time. I'll back out if needed... llvm-svn: 80858	2009-09-03 00:18:58 +00:00
Andreas Neustifter	ae866b0c66	Sort edges in MaximumSpanningTree more stable in case of equal weight. (See http://lists.cs.uiuc.edu/pipermail/llvm-commits/Week-of-Mon-20090824/085890.html) llvm-svn: 80789	2009-09-02 14:03:11 +00:00
Andreas Neustifter	964fa2bdac	Changed set of BlocksToInstrument to set of InsertedBlocks that do not have to be instrumented. llvm-svn: 80788	2009-09-02 13:59:05 +00:00
Andreas Neustifter	4469c164d0	Code cleanups and added comments. llvm-svn: 80781	2009-09-02 12:38:39 +00:00
Chris Lattner	4916267c97	fix PR4815: some cases where DeleteDeadInstruction can delete the instruction BBI points to. llvm-svn: 80768	2009-09-02 06:31:02 +00:00
Chris Lattner	09a79dcfdf	clean up this code a bit. llvm-svn: 80767	2009-09-02 06:15:37 +00:00
Chris Lattner	2dd09dbdf7	eliminate VISIBILITY_HIDDEN from Transforms/Scalar. PR4861 llvm-svn: 80766	2009-09-02 06:11:42 +00:00
Chris Lattner	64b5842986	fix PR4837, some bugs folding vector compares. These return a vector of i1, not i1 itself. llvm-svn: 80761	2009-09-02 05:12:37 +00:00
Andreas Neustifter	759094e323	OptimalEdgeProfiling: Creation of profiles. This adds the instrumentation and runtime part of OptimalEdgeProfiling. llvm-svn: 80712	2009-09-01 19:03:44 +00:00
Chris Lattner	9b463729d7	remove CallGraphNode::replaceCallSite, it is redundant with other APIs. llvm-svn: 80708	2009-09-01 18:52:39 +00:00
Chris Lattner	f61b0fb5d0	cleanup/simplify llvm-svn: 80706	2009-09-01 18:50:55 +00:00
Chris Lattner	8900f3ec57	remove a bunch of explicit code previously needed to update the callgraph. This is now dead because RAUW does the job. llvm-svn: 80703	2009-09-01 18:44:06 +00:00
Chris Lattner	1145e33bc6	enhance memcpy opt to turn memmoves into memcpy when the src/dest don't alias. Remove an old and poorly reduced testcase that fails with this transform for reasons unrelated to the original test. llvm-svn: 80693	2009-09-01 17:56:32 +00:00
Chris Lattner	b5557a7b42	random code cleanups, no functionality change. llvm-svn: 80682	2009-09-01 17:09:55 +00:00
Ted Kremenek	1543d133db	Update CMake files. llvm-svn: 80680	2009-09-01 17:01:02 +00:00
Andreas Neustifter	eb5a9d34d6	Preparation for Optimal Edge Profiling: Add statistics for regular edge profiling, this enables the comparation of the number of edges inserted by regular and optimal edge profiling. llvm-svn: 80668	2009-09-01 10:08:39 +00:00
Chris Lattner	063d06527e	Change CallGraphNode to maintain it's Function as an AssertingVH for sanity. This didn't turn up any bugs. Change CallGraphNode to maintain its "callsite" information in the call edges list as a WeakVH instead of as an instruction*. This fixes a broad class of dangling pointer bugs, and makes CallGraph have a number of useful invariants again. This fixes the class of problem indicated by PR4029 and PR3601. llvm-svn: 80663	2009-09-01 06:31:31 +00:00
Chris Lattner	ff5f1e4d70	fix some cases where instcombine would change hte IR but not return true from runOnFunction llvm-svn: 80562	2009-08-31 06:57:37 +00:00
Chris Lattner	9e50747958	comment and simplify some code. llvm-svn: 80540	2009-08-31 05:34:32 +00:00
Chris Lattner	70ebbc59f3	add -debug output llvm-svn: 80539	2009-08-31 05:22:48 +00:00
Chris Lattner	19dd315e67	improve -debug output, so that -debug is more likely to print when instcombine is changing stuff. llvm-svn: 80538	2009-08-31 05:17:58 +00:00
Chris Lattner	4e3e930743	fix a bug I introduced with my 'instcombine builder' refactoring changes: SimplifyDemandedBits can't use the builder yet because it has the wrong insertion point. This fixes a crash building MultiSource/Benchmarks/PAQ8p llvm-svn: 80537	2009-08-31 04:36:22 +00:00
Chris Lattner	2f2110affa	simplify some code by making the SCCNodes set contain Function's instead of CallGraphNode's. This also papers over a callgraph problem where a pass (in this case, MemCpyOpt) introduces a new function into the module (llvm.memset.i64) but doesn't add it to the call graph (nor should it, since it is a function pass). While it might be a good idea for MemCpyOpt to not synthesize functions in a runOnFunction(), there is no need for FunctionAttrs to be boneheaded, so fix it there. This fixes an assertion building 176.gcc. llvm-svn: 80535	2009-08-31 04:09:04 +00:00
Chris Lattner	081375bb08	Fix PR4834, a tricky case where the inliner would resolve an indirect function pointer, inline it, then go to delete the body. The problem is that the callgraph had other references to the function, though the inliner had no way to know it, so we got a dangling pointer and an invalid iterator out of the deal. The fix to this is pretty simple: stop the inliner from deleting the function by knowing that there are references to it. Do this by making CallGraphNodes contain a refcount. This requires moving deletion of available_externally functions to the module-level cleanup sweep where it belongs. llvm-svn: 80533	2009-08-31 03:15:49 +00:00
Chris Lattner	305b115a87	Fix some nasty callgraph dangling pointer problems in argpromotion and structretpromote. Basically, when replacing a function, they used the 'changeFunction' api which changes the entry in the function map (and steals/reuses the callgraph node). This has some interesting effects: first, the problem is that it doesn't update the "callee" edges in any callees of the function in the call graph. Second, this covers for a major problem in all the CGSCC pass stuff, which is that it is completely broken when functions are deleted if they don't reuse a CGN. (there is a cute little fixme about this though :). This patch changes the protocol that CGSCC passes must obey: now the CGSCC pass manager copies the SCC and preincrements its iterator to avoid passes invalidating it. This allows CGSCC passes to mutate the current SCC. However multiple passes may be run on that SCC, so if passes do this, they are now required to update the SCC to be current when they return. Other less interesting parts of this patch are that it makes passes update the CG more directly, eliminates changeFunction, and requires clients of replaceCallSite to specify the new callee CGN if they are changing it. llvm-svn: 80527	2009-08-31 00:19:58 +00:00
Chris Lattner	73913f4cd3	Fix PR4748: don't fold gep(bitcast(x)) into bitcast(gep) when x is itself a bitcast. Since we have gep(bitcast(bitcast(y))) in this case, just wait for the two bitcasts to get zapped. This prevents instcombine from confusing some aliasing stuff, and allows it to directly eliminate the load in the testcase. llvm-svn: 80508	2009-08-30 20:38:21 +00:00
Chris Lattner	c2f2cf896e	misc cleanup llvm-svn: 80507	2009-08-30 20:36:46 +00:00
Chris Lattner	a3e620caba	add getPointerAddressSpace() to GEP instruction, use the method in a few scalar xforms to simplify things. llvm-svn: 80506	2009-08-30 20:06:40 +00:00
Chris Lattner	c856539edf	eliminate InsertCastBefore, use the builder instead. llvm-svn: 80505	2009-08-30 20:01:10 +00:00
Chris Lattner	606da5fed8	eliminate InsertBitCastBefore, just use the builder instead. llvm-svn: 80504	2009-08-30 19:47:22 +00:00
Chris Lattner	5966341a2e	convert a bunch more calls to InsertNewInstBefore to use the new Instcombine builder. llvm-svn: 80501	2009-08-30 18:50:58 +00:00
Chris Lattner	8326d529da	fix typo llvm-svn: 80500	2009-08-30 17:53:59 +00:00
Chris Lattner	022a582de2	give instcombine a custom IRBuilder that adds new instructions to the workslist and is set to insert new instructions before the current one. Convert a bunch of stuff that used to call InsertNewInstBefore over to use it, greatly simplifying code and making it more natural. There is still a lot more to go, but this is a good start. llvm-svn: 80492	2009-08-30 07:44:24 +00:00
Chris Lattner	a0c89ee1da	add a new InstCombineWorklist::AddValue method that works even if the operand is not an instruction. Simplify most uses of AddOperandsToWorkList to use AddValue and inline it into the one remaining callsite. llvm-svn: 80488	2009-08-30 06:27:41 +00:00

... 4 5 6 7 8 ...

6100 Commits