llvm-project

Commit Graph

Author	SHA1	Message	Date
Chandler Carruth	783b7198b7	Undo a previous restriction on the inline cost calculation which Nick introduced. Specifically, there are cost reductions for all constant-operand icmp instructions against an alloca, regardless of whether the alloca will in fact be elligible for SROA. That means we don't want to abort the icmp reduction computation when we abort the SROA reduction computation. That in turn frees us from the need to keep a separate worklist and defer the ICmp calculations. Use this new-found freedom and some judicious function boundaries to factor the innards of computing the cost factor of any given instruction out of the loop over the instructions and into static helper functions. This greatly simplifies the code, and hopefully makes it more clear what is happening here. Reviewed by Eric Christopher. There is some concern that we'd like to ensure this doesn't get out of hand, and I plan to benchmark the effects of this change over the next few days along with some further fixes to the inline cost. llvm-svn: 152368	2012-03-09 02:49:36 +00:00
Stepan Dyatkovskiy	5b648afb4d	Taken into account Duncan's comments for r149481 dated by 2nd Feb 2012: http://lists.cs.uiuc.edu/pipermail/llvm-commits/Week-of-Mon-20120130/136146.html Implemented CaseIterator and it solves almost all described issues: we don't need to mix operand/case/successor indexing anymore. Base iterator class is implemented as a template since it may be initialized either from "const SwitchInst" or from "SwitchInst". ConstCaseIt is just a read-only iterator. CaseIt is read-write iterator; it allows to change case successor and case value. Usage of iterator allows totally remove resolveXXXX methods. All indexing convertions done automatically inside the iterator's getters. Main way of iterator usage looks like this: SwitchInst SI = ... // intialize it somehow for (SwitchInst::CaseIt i = SI->caseBegin(), e = SI->caseEnd(); i != e; ++i) { BasicBlock BB = i.getCaseSuccessor(); ConstantInt *V = i.getCaseValue(); // Do something. } If you want to convert case number to TerminatorInst successor index, just use getSuccessorIndex iterator's method. If you want initialize iterator from TerminatorInst successor index, use CaseIt::fromSuccessorIndex(...) method. There are also related changes in llvm-clients: klee and clang. llvm-svn: 152297	2012-03-08 07:06:20 +00:00
Chandler Carruth	dd1637c393	Rotate two of the functions used to count bonuses for the inline cost analysis to be methods on the cost analysis's function info object instead of the code metrics object. These really are just users of the code metrics, they're building the information for the function's analysis. This is the first step of growing the amount of information we collect about a function in order to cope with pair-wise simplifications due to allocas. llvm-svn: 152283	2012-03-08 02:04:19 +00:00
Nick Lewycky	1d57ee341a	No functionality change. Type::isSized() can be expensive, so avoid calling it until after other inexpensive tests. llvm-svn: 152195	2012-03-07 02:27:53 +00:00
Eli Friedman	af3c6fe51e	A few more cases of missing masking in ComputeMaskedBits; found by inspection. llvm-svn: 152070	2012-03-05 23:22:40 +00:00
Eli Friedman	a8b75ac798	Make sure we don't return bits outside the mask in ComputeMaskedBits. PR12189. llvm-svn: 152066	2012-03-05 23:09:40 +00:00
Benjamin Kramer	d9d80b1dde	LVI: Recognize the form instcombine canonicalizes range checks into when forming constant ranges. This could probably be made a lot smarter, but this is a common case and doesn't require LVI to scan a lot of code. With this change CVP can optimize away the "shift == 0" case in Hashing.h that only gets hit when "shift" is in a range not containing 0. llvm-svn: 151919	2012-03-02 15:34:43 +00:00
Eli Friedman	0774902a00	Duncan pointed out that if the alignment isn't explicitly specified, it defaults to the ABI alignment. Given that, make this code a bit more aggressive in such cases. llvm-svn: 151584	2012-02-27 23:16:46 +00:00
Eli Friedman	8bc169c3c5	Teach BasicAA about the LLVM IR rules that allow reading past the end of an object given sufficient alignment. Fixes PR12098. llvm-svn: 151553	2012-02-27 20:46:07 +00:00
Rafael Espindola	09a4201d3c	Fix this assert. IP can point to an instruction with strange dominance properties (invoke). Just assert that the instruction we return dominates the insertion point. llvm-svn: 151511	2012-02-27 02:13:03 +00:00
Rafael Espindola	b660977c67	Don't call dominates on unreachable instructions. Should fix the dragonegg build. Testcase is still reducing. llvm-svn: 151474	2012-02-26 05:30:08 +00:00
Rafael Espindola	ae725715ef	And update the comment... llvm-svn: 151472	2012-02-26 02:36:56 +00:00
Rafael Espindola	fa75542078	Enable the assert that got all this dominator work started. llvm-svn: 151471	2012-02-26 02:29:18 +00:00
Rafael Espindola	94df267db3	Change the implementation of dominates(inst, inst) to one based on what the verifier does. This correctly handles invoke. Thanks to Duncan, Andrew and Chris for the comments. Thanks to Joerg for the early testing. llvm-svn: 151469	2012-02-26 02:19:19 +00:00
Nick Lewycky	3db143ea8c	Reinstate the optimization from r151449 with a fix to not turn 'gep %x' into 'gep null' when the icmp predicate is unsigned (or is signed without inbounds). llvm-svn: 151467	2012-02-26 02:09:49 +00:00
Rafael Espindola	c8c2b06a90	Don't call dominates on unreachable instructions. llvm-svn: 151466	2012-02-26 01:50:14 +00:00
Nick Lewycky	7bbd72da46	Roll these back to r151448 until I figure out how they're breaking MultiSource/Applications/lua. llvm-svn: 151463	2012-02-25 23:01:19 +00:00
Nick Lewycky	eeeffbb497	An argument and a local identified object (eg. a noalias call) could turn out equal if both are null. In the test, scope type %t and global @y by adding a 'gep' prefix to them. llvm-svn: 151452	2012-02-25 20:19:07 +00:00
Nick Lewycky	7b99bada0b	Fix five-letter typo in comment. llvm-svn: 151450	2012-02-25 19:12:58 +00:00
Nick Lewycky	51f2be8bff	Teach instsimplify to be more aggressive when analyzing comparisons of pointers by using llvm::isIdentifiedObject. Also teach it to handle GEPs that have the same base pointer and constant operands. Fixes PR11238! llvm-svn: 151449	2012-02-25 19:07:42 +00:00
Nick Lewycky	3f885b65a2	Move isKnownNonNull from private implementation detail of BasicAA to a public function that others can use, next to llvm::isIdentifiedObject. llvm-svn: 151446	2012-02-25 10:56:28 +00:00
Chris Lattner	01990f0e1c	fix PR12075, a regression in a recent transform I added. In unreachable code, gep chains can be infinite. Just like "stripPointerCasts", use a set to keep track of visited instructions so we don't recurse infinitely. llvm-svn: 151383	2012-02-24 19:01:58 +00:00
Rafael Espindola	f35c789031	Fix typo. llvm-svn: 151238	2012-02-23 05:38:51 +00:00
Chad Rosier	5dfe6dab25	Remove extra semi-colons. llvm-svn: 151169	2012-02-22 17:25:00 +00:00
Rafael Espindola	337cfaf757	Improve comment. Thanks for Andrew for the suggestion. llvm-svn: 151127	2012-02-22 03:44:46 +00:00
Rafael Espindola	cd06b482d2	Semantically revert 151015. Add a comment on why we should be able to assert the dominance once the dominates method is fixed and why we can use the builder's insertion point. Fixes pr12048. llvm-svn: 151125	2012-02-22 03:21:39 +00:00
Rafael Espindola	b41b407f3d	s/the the/the/ llvm-svn: 151079	2012-02-21 19:27:16 +00:00
Rafael Espindola	729e3aae92	Use more idiomatic assert. llvm-svn: 151026	2012-02-21 03:51:14 +00:00
Rafael Espindola	b2defca267	Avoid warning on non assert builds. llvm-svn: 151025	2012-02-21 03:48:30 +00:00
Rafael Espindola	7d445e92c3	It turns out that with the current scev organization ReuseOrCreateCast cannot know where users will be added. Because of this, it cannot use Builder.GetInsertPoint at all. This patch * removes the FIXME about adding the assert. * adds a comment explaining hy we don't have one. * removes a broken logic that only works for some callers and is not needed since r150884. * adds an assert to caller that would have caught the bug fixed by r150884. llvm-svn: 151015	2012-02-21 01:19:51 +00:00
Eric Christopher	4826c8fbe8	Make this a bit prettier and more obvious when a derived type isn't derived from anything. llvm-svn: 150975	2012-02-20 18:04:39 +00:00
Eric Christopher	300871076e	If a derived type is also a composite type, print that information too. llvm-svn: 150974	2012-02-20 18:04:35 +00:00
Eric Christopher	8979712685	Add support for runtime languages on our forward declarations. llvm-svn: 150973	2012-02-20 18:04:14 +00:00
Chris Lattner	445d8c6b50	fold comparisons of gep'd alloca points with null to false, implementing PR12013. We now compile the testcase to: __Z4testv: ## @_Z4testv ## BB#0: ## %_ZN4llvm15SmallVectorImplIiE9push_backERKi.exit pushq %rbx subq $64, %rsp leaq 32(%rsp), %rbx movq %rbx, (%rsp) leaq 64(%rsp), %rax movq %rax, 16(%rsp) movl $1, 32(%rsp) leaq 36(%rsp), %rax movq %rax, 8(%rsp) leaq (%rsp), %rdi callq __Z1gRN4llvm11SmallVectorIiLj8EEE movq (%rsp), %rdi cmpq %rbx, %rdi je LBB0_2 ## BB#1: callq _free LBB0_2: ## %_ZN4llvm11SmallVectorIiLj8EED1Ev.exit addq $64, %rsp popq %rbx ret instead of: __Z4testv: ## @_Z4testv ## BB#0: pushq %rbx subq $64, %rsp xorl %eax, %eax leaq (%rsp), %rbx addq $32, %rbx movq %rbx, (%rsp) movq %rbx, 8(%rsp) leaq 64(%rsp), %rcx movq %rcx, 16(%rsp) je LBB0_2 ## BB#1: movl $1, 32(%rsp) movq %rbx, %rax LBB0_2: ## %_ZN4llvm15SmallVectorImplIiE9push_backERKi.exit addq $4, %rax movq %rax, 8(%rsp) leaq (%rsp), %rdi callq __Z1gRN4llvm11SmallVectorIiLj8EEE movq (%rsp), %rdi cmpq %rbx, %rdi je LBB0_4 ## BB#3: callq _free LBB0_4: ## %_ZN4llvm11SmallVectorIiLj8EED1Ev.exit addq $64, %rsp popq %rbx ret This doesn't shrink clang noticably though. llvm-svn: 150944	2012-02-20 00:42:49 +00:00
Rafael Espindola	991356e89b	Temporarily disable this assert. Looks like it found a similar issue when building bullet. llvm-svn: 150885	2012-02-18 17:51:43 +00:00
Rafael Espindola	82d957593e	Don't skip debug instructions when looking for the insertion point of the cast. If we do, we can end up with inst1 --------------- < Insertion point dbg inst new inst instead of the desired inst1 new inst --------------- < Insertion point dbg inst Another option would be for InsertNoopCastOfTo (or its callers) to move the insertion point and we would end up with inst1 dbg inst new inst --------------- < Insertion point but that complicates the callers. This fixes PR12018 (and firefox's build). llvm-svn: 150884	2012-02-18 17:22:58 +00:00
Eli Friedman	952d1f9f40	Fix a rather nasty regression from r150690: LHS != RHS does not imply LHS->stripPointerCasts() != RHS->stripPointerCasts(). llvm-svn: 150863	2012-02-18 03:29:25 +00:00
Dan Gohman	9017b846d4	Remove a comment about an alternative approach that wouldn't actually work, at least as described. LLVM Metadata is not intended to suppress LLVM IR rules, as it can be stripped at any time. llvm-svn: 150821	2012-02-17 18:33:38 +00:00
Eric Christopher	b23b32e43b	Typo in variable name. llvm-svn: 150796	2012-02-17 07:08:46 +00:00
Benjamin Kramer	08f18b1b74	Revert "InstSimplify: Strip pointer casts early." Turns out this isn't safe, because the code below depends on LHS and RHS having the same type. llvm-svn: 150695	2012-02-16 15:19:59 +00:00
Benjamin Kramer	3d27f71f2d	InstSimplify: Strip pointer casts early. llvm-svn: 150694	2012-02-16 15:03:04 +00:00
Benjamin Kramer	ea51f62e4b	InstSimplify: Ignore pointer casts when constant folding compares between pointers. llvm-svn: 150690	2012-02-16 13:49:39 +00:00
Hal Finkel	56f6b0f219	Have AliasSet::aliasesUnknownInst use pointer TBAA info when available llvm-svn: 150249	2012-02-10 15:52:39 +00:00
Duncan Sands	26641d7c02	Fix PR11948: the result type of an icmp may be a vector of boolean - don't assume it is a boolean. llvm-svn: 150247	2012-02-10 14:31:24 +00:00
Eric Christopher	ae56eecf5f	Add support for a temporary forward decl type. We want this so we can rauw forward declarations if we decide to emit the full type. Part of rdar://10809898 llvm-svn: 150024	2012-02-08 00:22:26 +00:00
Devang Patel	a93cc25b79	Remove tabs. llvm-svn: 150022	2012-02-08 00:17:07 +00:00
Craig Topper	a2886c21d9	Convert assert(0) to llvm_unreachable llvm-svn: 149967	2012-02-07 05:05:23 +00:00
Kostya Serebryany	9e0d377400	The patch resolves the conflict between AddressSanitizer and load widening (GVN). The problem initially reported by Mozilla folks (http://code.google.com/p/address-sanitizer/issues/detail?id=20), but it also prevents us from enabling LLVM bootstrap with AddressSanitizer. llvm-svn: 149925	2012-02-06 22:48:56 +00:00
Chris Lattner	8213c8af29	Remove some dead code and tidy things up now that vectors use ConstantDataVector instead of always using ConstantVector. llvm-svn: 149912	2012-02-06 21:56:39 +00:00
Bill Wendling	0aef16afd5	[unwind removal] Remove all of the code for the dead 'unwind' instruction. There were no 'unwind' instructions being generated before this, so this is in effect a no-op. llvm-svn: 149906	2012-02-06 21:44:22 +00:00
Bill Wendling	d5d95b0b51	[unwind removal] We no longer have 'unwind' instructions being generated, so remove the code that handles them. llvm-svn: 149901	2012-02-06 21:16:41 +00:00
Devang Patel	4488217f73	DebugInfo: Provide a new hook to encode relationship between a property and an ivar. llvm-svn: 149874	2012-02-06 17:49:43 +00:00
Duncan Sands	ae22c60f90	Persuade GCC that there is nothing worth warning about here (there isn't). llvm-svn: 149834	2012-02-05 14:20:11 +00:00
Chris Lattner	cf9e8f6968	reapply the patches reverted in r149470 that reenable ConstantDataArray, but with a critical fix to the SelectionDAG code that optimizes copies from strings into immediate stores: the previous code was stopping reading string data at the first nul. Address this by adding a new argument to llvm::getConstantStringInfo, preserving the behavior before the patch. llvm-svn: 149800	2012-02-05 02:29:43 +00:00
Qirun Zhang	e788fac623	remove the blank line from previous ci. llvm-svn: 149758	2012-02-04 03:18:47 +00:00
Qirun Zhang	dabce3f4e9	test commit. add a blank line. llvm-svn: 149757	2012-02-04 03:15:26 +00:00
Devang Patel	cc481596e4	Introduce DIObjCProperty. This will be used to encode objective-c property. llvm-svn: 149732	2012-02-04 00:59:25 +00:00
Stepan Dyatkovskiy	513aaa5691	SwitchInst refactoring. The purpose of refactoring is to hide operand roles from SwitchInst user (programmer). If you want to play with operands directly, probably you will need lower level methods than SwitchInst ones (TerminatorInst or may be User). After this patch we can reorganize SwitchInst operands and successors as we want. What was done: 1. Changed semantics of index inside the getCaseValue method: getCaseValue(0) means "get first case", not a condition. Use getCondition() if you want to resolve the condition. I propose don't mix SwitchInst case indexing with low level indexing (TI successors indexing, User's operands indexing), since it may be dangerous. 2. By the same reason findCaseValue(ConstantInt*) returns actual number of case value. 0 means first case, not default. If there is no case with given value, ErrorIndex will returned. 3. Added getCaseSuccessor method. I propose to avoid usage of TerminatorInst::getSuccessor if you want to resolve case successor BB. Use getCaseSuccessor instead, since internal SwitchInst organization of operands/successors is hidden and may be changed in any moment. 4. Added resolveSuccessorIndex and resolveCaseIndex. The main purpose of these methods is to see how case successors are really mapped in TerminatorInst. 4.1 "resolveSuccessorIndex" was created if you need to level down from SwitchInst to TerminatorInst. It returns TerminatorInst's successor index for given case successor. 4.2 "resolveCaseIndex" converts low level successors index to case index that curresponds to the given successor. Note: There are also related compatability fix patches for dragonegg, klee, llvm-gcc-4.0, llvm-gcc-4.2, safecode, clang. llvm-svn: 149481	2012-02-01 07:49:51 +00:00
Argyrios Kyrtzidis	17c981a45b	Revert Chris' commits up to r149348 that started causing VMCoreTests unit test to fail. These are: r149348 r149351 r149352 r149354 r149356 r149357 r149361 r149362 r149364 r149365 llvm-svn: 149470	2012-02-01 04:51:17 +00:00
Chris Lattner	997348e9fe	remove the last vestiges of llvm::GetConstantStringInfo, in CodeGen. llvm-svn: 149356	2012-01-31 05:09:17 +00:00
Chris Lattner	108423a94a	Change ConstantArray::get to form a ConstantDataArray when possible, kicking in the big win of ConstantDataArray. As part of this, change the implementation of GetConstantStringInfo in ValueTracking to work with ConstantDataArray (and not ConstantArray) making it dramatically, amazingly, more efficient in the process and renaming it to getConstantStringInfo. This keeps around a GetConstantStringInfo entrypoint that (grossly) forwards to getConstantStringInfo and constructs the std::string required, but existing clients should move over to getConstantStringInfo instead. llvm-svn: 149351	2012-01-31 04:42:22 +00:00
Rafael Espindola	bb893fea6b	Add r149110 back with a fix for when the vector and the int have the same width. llvm-svn: 149151	2012-01-27 23:33:07 +00:00
Rafael Espindola	a4062624d1	Revert r149110 and add a testcase that was crashing since that revision. Unfortunately I also had to disable constant-pool-sharing.ll the code it tests has been updated to use the IL logic. llvm-svn: 149148	2012-01-27 22:42:48 +00:00
Chris Lattner	111d6ee655	enhance constant folding to be able to constant fold bitcast of ConstantVector's to integer type. llvm-svn: 149110	2012-01-27 01:44:03 +00:00
Chris Lattner	61a1d6cb81	progress making the world safe to ConstantDataVector. While we're at it, allow PatternMatch's "neg" pattern to match integer vector negations, and enhance ComputeNumSigned bits to handle shl of vectors. llvm-svn: 149082	2012-01-26 21:37:55 +00:00
Nick Lewycky	0e496cddf0	Use precomputed BB size instead of BB->size(). llvm-svn: 148964	2012-01-25 18:54:13 +00:00
Nick Lewycky	70d50ee8fb	Support pointer comparisons against constants, when looking at the inline-cost savings from a pointer argument becoming an alloca. Sometimes callees will even compare a pointer to null and then branch to an otherwise unreachable block! Detect these cases and compute the number of saved instructions, instead of bailing out and reporting no savings. llvm-svn: 148941	2012-01-25 08:27:40 +00:00
Chris Lattner	6705883ad8	use Constant::getAggregateElement to simplify a bunch of code. llvm-svn: 148934	2012-01-25 06:48:06 +00:00
Chris Lattner	9be59599b3	Use the right method to get the # elements in a CDS. llvm-svn: 148897	2012-01-25 01:27:20 +00:00
Chris Lattner	f7eb543380	teach valuetracking about ConstantDataSequential llvm-svn: 148790	2012-01-24 07:54:10 +00:00
Chris Lattner	e166a8548f	switch SCEV to use the new ConstantFoldLoadThroughGEPIndices function instead of its own hard coded thing, allowing it to handle ConstantDataSequential and fixing some obscure bugs (e.g. it would previously crash on a CAZ of vector type). llvm-svn: 148788	2012-01-24 05:49:24 +00:00
Chris Lattner	f488b35826	Split the interesting bits of ConstantFoldLoadThroughGEPConstantExpr out into a new ConstantFoldLoadThroughGEPIndices (more useful) function and rewrite it to be simpler, more efficient, and to handle the new ConstantDataSequential type. Enhance ConstantFoldLoadFromConstPtr to handle ConstantDataSequential. llvm-svn: 148786	2012-01-24 05:43:50 +00:00
David Blaikie	46a9f016c5	More dead code removal (using -Wunreachable-code) llvm-svn: 148578	2012-01-20 21:51:11 +00:00
Benjamin Kramer	fe4848b55d	Remove obviously invalid early exit that prevented analyzing ConstantAggregateZeros. Found by the clang static analyzer. llvm-svn: 148540	2012-01-20 14:42:25 +00:00
Nick Lewycky	e8415fea4b	Fix CountCodeReductionForAlloca to more accurately represent what SROA can and can't handle. Also don't produce non-zero results for things which won't be transformed by SROA at all just because we saw the loads/stores before we saw the use of the address. llvm-svn: 148536	2012-01-20 08:35:20 +00:00
Andrew Trick	c908b43d9f	SCEVExpander fixes. Affects LSR and indvars. LSR has gradually been improved to more aggressively reuse existing code, particularly existing phi cycles. This exposed problems with the SCEVExpander's sloppy treatment of its insertion point. I applied some rigor to the insertion point problem that will hopefully avoid an endless bug cycle in this area. Changes: - Always used properlyDominates to check safe code hoisting. - The insertion point provided to SCEV is now considered a lower bound. This is usually a block terminator or the use itself. Under no cirumstance may SCEVExpander insert below this point. - LSR is reponsible for finding a "canonical" insertion point across expansion of different expressions. - Robust logic to determine whether IV increments are in "expanded" form and/or can be safely hoisted above some insertion point. Fixes PR11783: SCEVExpander assert. llvm-svn: 148535	2012-01-20 07:41:13 +00:00
Bill Wendling	75afc7afe8	Remove dead code. llvm-svn: 148384	2012-01-18 10:10:28 +00:00
Jakub Staszak	173bce3d2b	Move includes to the .cpp file. llvm-svn: 148342	2012-01-17 22:16:31 +00:00
Andrew Trick	23ef0d6c40	Fix a corner case hit by redundant phi elimination running after LSR. Fixes PR11761: bad IR w/ redundant Phi elim llvm-svn: 148177	2012-01-14 03:17:23 +00:00
Bill Wendling	58c7569854	A DenseMap of a std::map isn't a very good idea because the "grow()" method will need to make a deep copy of each of the std::maps. Use a std::map of the std::map instead. This improves the compile time of sqlite3 by ~2%. llvm-svn: 148003	2012-01-12 01:41:03 +00:00
Bill Wendling	4ec081a4d2	Revert r147978. A DenseMap's iterators may become invalidated here. llvm-svn: 147980	2012-01-11 23:43:34 +00:00
Bill Wendling	f0275df9e3	Use a DenseMap. This appears to improve sqlite3's compile time by ~2%. llvm-svn: 147978	2012-01-11 22:57:32 +00:00
Andrew Trick	e81211f45c	Clarified the SCEV getSmallConstantTripCount interface with in-your-face comments. This interface is misleading and dangerous, but it is actually what we need for unrolling. llvm-svn: 147926	2012-01-11 06:52:55 +00:00
Eric Christopher	43a1182975	Don't avoid recursing for pointer types, just reference types. Expand on the comment. Fixes constvars.exp on the gdb test builder. llvm-svn: 147897	2012-01-11 00:01:29 +00:00
Chandler Carruth	4c0ee749bb	Cleanup these asserts to follow common LLVM style and coding conventions. Also, clarify the grouping of one of the asserts to silence -Wparentheses. llvm-svn: 147863	2012-01-10 18:18:52 +00:00
David Blaikie	edbb58c577	Remove unnecessary default cases in switches that cover all enum values. llvm-svn: 147855	2012-01-10 16:47:17 +00:00
Andrew Trick	d5d2db9af9	Enable LSR IV Chains with sufficient heuristics. These heuristics are sufficient for enabling IV chains by default. Performance analysis has been done for i386, x86_64, and thumbv7. The optimization is rarely important, but can significantly speed up certain cases by eliminating spill code within the loop. Unrolled loops are prime candidates for IV chains. In many cases, the final code could still be improved with more target specific optimization following LSR. The goal of this feature is for LSR to make the best choice of induction variables. Instruction selection may not completely take advantage of this feature yet. As a result, there could be cases of slight code size increase. Code size can be worse on x86 because it doesn't support postincrement addressing. In fact, when chains are formed, you may see redundant address plus stride addition in the addressing mode. GenerateIVChains tries to compensate for the common cases. On ARM, code size increase can be mitigated by using postincrement addressing, but downstream codegen currently misses some opportunities. llvm-svn: 147826	2012-01-10 01:45:08 +00:00
Devang Patel	fa8df4837a	Update language check. Do not ignore DW_LANG_Python. Patch by Joe Groff! llvm-svn: 147781	2012-01-09 17:49:47 +00:00
Andrew Trick	f730f39f3f	Cleanup comments and argument types related to my previous replaceCongruentPhis checkin. llvm-svn: 147709	2012-01-07 01:29:21 +00:00
Andrew Trick	5adedf5d47	Extended replaceCongruentPhis to handle mixed phi types. llvm-svn: 147707	2012-01-07 01:12:09 +00:00
Andrew Trick	881a776875	Expose isNonConstantNegative to users of ScalarEvolution. llvm-svn: 147700	2012-01-07 00:27:31 +00:00
Andrew Trick	9a5b242d3c	Put all IVUsers in the processed set. Allow querying IVUsers with isIVUserOrOperand. llvm-svn: 147686	2012-01-06 21:41:55 +00:00
Andrew Trick	b8045cbcb1	SCEVExpander: hoistStep should check strict dominance. llvm-svn: 147683	2012-01-06 21:23:43 +00:00
Dan Gohman	7ac046a261	Generalize isSafeToSpeculativelyExecute to work on arbitrary Values, rather than just Instructions, since it's interesting for ConstantExprs too. llvm-svn: 147560	2012-01-04 23:01:09 +00:00
Andrew Trick	cbcc98fb50	Fix SCEVExpander to handle loops with no preheader when LSR gives it a "phony" insertion point. Fixes rdar://10619599: "SelectionDAGBuilder shouldn't visit PHI nodes!" assert llvm-svn: 147439	2012-01-02 21:25:10 +00:00
Benjamin Kramer	9442cd01f6	PatternMatch: Introduce a matcher for instructions with the "exact" bit. Use it to simplify a few matchers. llvm-svn: 147403	2012-01-01 17:55:30 +00:00
Nick Lewycky	4c378a4453	Change CaptureTracking to pass a Use* instead of a Value* when a value is captured. This allows the tracker to look at the specific use, which may be especially interesting for function calls. Use this to fix 'nocapture' deduction in FunctionAttrs. The existing one does not iterate until a fixpoint and does not guarantee that it produces the same result regardless of iteration order. The new implementation builds up a graph of how arguments are passed from function to function, and uses a bottom-up walk on the argument-SCCs to assign nocapture. This gets us nocapture more often, and does so rather efficiently and independent of iteration order. llvm-svn: 147327	2011-12-28 23:24:21 +00:00
Benjamin Kramer	4ee5747fdd	ComputeMaskedBits: Make knownzero computation more aggressive for ctlz with undef zero. unsigned foo(unsigned x) { return 31 - __builtin_clz(x); } now compiles into a single "bsrl" instruction on x86. llvm-svn: 147255	2011-12-24 17:31:46 +00:00
Chandler Carruth	b024aa021d	Make the unreachable probability much much heavier. The previous probability wouldn't be considered "hot" in some weird loop structures or other compounding probability patterns. This makes it much harder to confuse, but isn't really a principled fix. I'd actually like it if we could model a zero probability, as it would make this much easier to reason about. Suggestions for how to do this better are welcome. llvm-svn: 147142	2011-12-22 09:26:37 +00:00
Nick Lewycky	c186d07bbe	Continue counting intrinsics as instructions (except when they aren't, such as debug info) and for being vector operations. Fixes regression from r147037. llvm-svn: 147093	2011-12-21 20:26:03 +00:00
Nick Lewycky	281e2747e0	Fix typo and spacing, no functionality change. llvm-svn: 147092	2011-12-21 20:21:55 +00:00
Nick Lewycky	da22fc6a1d	A call to a function marked 'noinline' is not an inline candidate. The sole call site of an intrinsic is also not an inline candidate. While here, make it more obvious that this code ignores all intrinsics. Noticed by inspection! llvm-svn: 147037	2011-12-21 06:06:30 +00:00
Nick Lewycky	b4039f633c	Make some intrinsics safe to speculatively execute. llvm-svn: 147036	2011-12-21 05:52:02 +00:00
Jakub Staszak	96f8c551e3	Add some constantness to BranchProbabilityInfo and BlockFrequnencyInfo. llvm-svn: 146986	2011-12-20 20:03:10 +00:00
David Blaikie	a379b18173	Unweaken vtables as per http://llvm.org/docs/CodingStandards.html#ll_virtual_anch llvm-svn: 146960	2011-12-20 02:50:00 +00:00
Andrew Trick	b9aa26f8ea	LSR: Fix another corner case in expansion of postinc users. Fixes PR11571: Instruction does not dominate all uses llvm-svn: 146950	2011-12-20 01:42:24 +00:00
Joerg Sonnenberger	d6cb7649d8	Allow inlining of functions with returns_twice calls, if they have the attribute themselve. llvm-svn: 146851	2011-12-18 20:35:43 +00:00
Eric Christopher	27886c6c1e	When recursing for the original size of a type, stop if we are at a pointer or a reference type - we actually just want the size of the pointer then for that. Fixes rdar://10335756 llvm-svn: 146785	2011-12-16 23:42:45 +00:00
Devang Patel	78847f0bbe	In DICompositeType, referenced to derived type is either metadata or null. llvm-svn: 146744	2011-12-16 17:51:31 +00:00
Devang Patel	cdd833eb28	Virtual table holder field is either metadata or null. llvm-svn: 146665	2011-12-15 17:55:56 +00:00
Dan Gohman	75d7d5e988	Move Instruction::isSafeToSpeculativelyExecute out of VMCore and into Analysis as a standalone function, since there's no need for it to be in VMCore. Also, update it to use isKnownNonZero and other goodies available in Analysis, making it more precise, enabling more aggressive optimization. llvm-svn: 146610	2011-12-14 23:49:11 +00:00
Andrew Trick	e0ced62119	LSR: Fold redundant bitcasts on-the-fly. llvm-svn: 146597	2011-12-14 22:07:19 +00:00
Eli Friedman	fdeaf25827	Fix a stupid typo in MemDepPrinter. llvm-svn: 146549	2011-12-14 02:54:39 +00:00
Daniel Dunbar	8889bb08b8	LLVMBuild: Introduce a common section which currently has a list of the subdirectories to traverse into. - Originally I wanted to avoid this and just autoscan, but this has one key flaw in that new subdirectories can not automatically trigger a rerun of the llvm-build tool. This is particularly a pain when switching back and forth between trees where one has added a subdirectory, as the dependencies will tend to be wrong. This will also eliminates FIXME implicitly. llvm-svn: 146436	2011-12-12 22:45:54 +00:00
Daniel Dunbar	27a7489a03	LLVMBuild: Remove trailing newline, which irked me. llvm-svn: 146409	2011-12-12 19:48:00 +00:00
Chandler Carruth	58a71ed339	Switch llvm.cttz and llvm.ctlz to accept a second i1 parameter which indicates whether the intrinsic has a defined result for a first argument equal to zero. This will eventually allow these intrinsics to accurately model the semantics of GCC's __builtin_ctz and __builtin_clz and the X86 instructions (prior to AVX) which implement them. This patch merely sets the stage by extending the signature of these intrinsics and establishing auto-upgrade logic so that the old spelling still works both in IR and in bitcode. The upgrade logic preserves the existing (inefficient) semantics. This patch should not change any behavior. CodeGen isn't updated because it can use the existing semantics regardless of the flag's value. Note that this will be followed by API updates to Clang and DragonEgg. Reviewed by Nick Lewycky! llvm-svn: 146357	2011-12-12 04:26:04 +00:00
Chad Rosier	8abf65a130	Probably not a good idea to convert a single vector load into a memcpy. We don't do this now, but add a test case to prevent this from happening in the future. Additional test for rdar://9892684 llvm-svn: 145879	2011-12-06 00:19:08 +00:00
Nadav Rotem	3924cb0267	Add support for vectors of pointers. llvm-svn: 145801	2011-12-05 06:29:09 +00:00
Benjamin Kramer	bbf3c60786	Clear the new cache. llvm-svn: 145771	2011-12-03 15:19:55 +00:00
Benjamin Kramer	3664708378	Add a "seen blocks" cache to LVI to avoid a linear scan over the whole cache just to remove no blocks from the maps. -15% on ARMDisassembler.cpp (Release build). It's not that great to add another layer of caching to the caching-heavy LVI but I don't see a better way. llvm-svn: 145770	2011-12-03 15:16:45 +00:00
Chad Rosier	0155a63513	Add support for constant folding the pow intrinsic. rdar://10514247 llvm-svn: 145730	2011-12-03 00:00:03 +00:00
Chad Rosier	43a33066b4	Fix a few more places where TargetData/TargetLibraryInfo is not being passed. Add FIXMEs to places that are non-trivial to fix. llvm-svn: 145661	2011-12-02 01:26:24 +00:00
Chad Rosier	576c0f8e54	Abuse of mass replace isn't warranted even when the build is failing. Thanks for the suggestion, Eric. llvm-svn: 145643	2011-12-01 23:16:03 +00:00
Chad Rosier	54a506dcb1	Fix build by not assuming TLI is guaranteed. Will have to track down cases where TLI isn't being passed to ensure we don't miss opportunities to fold calls. llvm-svn: 145641	2011-12-01 22:38:31 +00:00
Chad Rosier	3367123b12	Prevent library calls from being folded if -fno-builtin has been specified. rdar://10500969 llvm-svn: 145639	2011-12-01 22:14:50 +00:00
Chad Rosier	e6de63dfc5	Last bit of TargetLibraryInfo propagation. Also fixed a case for TargetData where it appeared beneficial to pass. More of rdar://10500969 llvm-svn: 145630	2011-12-01 21:29:16 +00:00
Chad Rosier	c24b86ffbe	Propagate TargetLibraryInfo throughout ConstantFolding.cpp and InstructionSimplify.cpp. Other fixups as needed. Part of rdar://10500969 llvm-svn: 145559	2011-12-01 03:08:23 +00:00
Nick Lewycky	e659b8459e	Make use of "getScalarType()". No functionality change. llvm-svn: 145556	2011-12-01 02:39:36 +00:00
Andrew Trick	ceafa2c746	LSR: handle the expansion of phi operands that use postinc forms of the IV. Fixes PR11431: SCEVExpander::expandAddRecExprLiterally(const llvm::SCEVAddRecExpr*): Assertion `(!isa<Instruction>(Result) \|\| SE.DT->dominates(cast<Instruction>(Result), Builder.GetInsertPoint())) && "postinc expansion does not dominate use"' failed. llvm-svn: 145482	2011-11-30 06:07:54 +00:00
Daniel Dunbar	539d0a8a09	build/CMake: Finish removal of add_llvm_library_dependencies. llvm-svn: 145420	2011-11-29 19:25:30 +00:00
Duncan Sands	ca6f8ddbf8	Fix a theoretical problem (not seen in the wild): if different instances of a weak variable are compiled by different compilers, such as GCC and LLVM, while LLVM may increase the alignment to the preferred alignment there is no reason to think that GCC will use anything more than the ABI alignment. Since it is the GCC version that might end up in the final program (as the linkage is weak), it is wrong to increase the alignment of loads from the global up to the preferred alignment as the alignment might only be the ABI alignment. Increasing alignment up to the ABI alignment might be OK, but I'm not totally convinced that it is. It seems better to just leave the alignment of weak globals alone. llvm-svn: 145413	2011-11-29 18:26:38 +00:00
Andrew Trick	d25089f8e0	SCEV fix. In general, Add/Mul expressions should not inherit NSW/NUW. This reverts r139450, fixes r139453, and adds much needed comments and a unit test. llvm-svn: 145367	2011-11-29 02:16:38 +00:00
Andrew Trick	d912a5b2e3	Make SCEV print <nsw><nuw> for Add/MulExpr. llvm-svn: 145364	2011-11-29 02:06:35 +00:00
Eli Friedman	e7ab1a2f0f	Make SelectionDAG::InferPtrAlignment use llvm::ComputeMaskedBits instead of duplicating the logic for globals. Make llvm::ComputeMaskedBits handle GlobalVariables slightly more aggressively, to match what InferPtrAlignment knew how to do. llvm-svn: 145304	2011-11-28 22:48:22 +00:00
Andrew Trick	a8bdb7cbf1	Remove the temporary flag -disable-unroll-scev and dead code. SCEV should now be used for trip count analysis, not LoopInfo. llvm-svn: 145262	2011-11-28 19:22:09 +00:00
Benjamin Kramer	7ba71be392	Move code into anonymous namespaces. llvm-svn: 145154	2011-11-26 23:01:57 +00:00
Benjamin Kramer	6e013bf96c	Validate the return type when checking if a function is malloc. Fixes PR11426. Not sure if a test case with a "wrong" malloc would be useful. llvm-svn: 145106	2011-11-23 17:58:47 +00:00
Duncan Sands	81a2af12d6	Fix a crash in which a multiplication was being reported as being both negative and positive: positive, because it could be directly computed to be positive; negative, because the nsw flags means it is either negative or undefined (the multiplication always overflowed). llvm-svn: 145104	2011-11-23 16:26:47 +00:00
Nick Lewycky	063ae5897c	Fix crasher in GVN due to my recent capture tracking changes. llvm-svn: 145047	2011-11-21 19:42:56 +00:00
Nick Lewycky	aa2a00db35	Add virtual destructor. Whoops! llvm-svn: 145044	2011-11-21 18:32:21 +00:00
Nick Lewycky	6ae03c3378	Less template, more virtual! Refactoring suggested by Chris in code review. llvm-svn: 145014	2011-11-20 19:37:06 +00:00
Nick Lewycky	612d70b19d	Refactor code to use new attribute getters on CallSite for NoCapture and ByVal. Suggested in code review by Eli. That code in InstCombine looks kinda suspicious. llvm-svn: 145013	2011-11-20 19:09:04 +00:00
Benjamin Kramer	b5ba2eef2d	SCEV: Actually set overflow flags on add expressions. setFlags doesn't modify its arguments. llvm-svn: 145007	2011-11-20 10:24:36 +00:00
Andrew Trick	6b4d578f54	Fix a corner case in updating LoopInfo after fully unrolling an outer loop. The loop tree's inclusive block lists are painful and expensive to update. (I have no idea why they're inclusive). The design was supposed to handle this case but the implementation missed it and my unit tests weren't thorough enough. Fixes PR11335: loop unroll update. llvm-svn: 144970	2011-11-18 03:42:41 +00:00
Andrew Trick	90c7a108ca	Fix SCEV overly optimistic back edge taken count for multi-exit loops. Fixes PR11375: Different results for 'clang++ huh.cpp'... llvm-svn: 144746	2011-11-16 00:52:40 +00:00
Benjamin Kramer	184e3ceea0	Missed some users of Value::getNameStr. llvm-svn: 144656	2011-11-15 18:30:06 +00:00
Benjamin Kramer	1f97a5a671	Remove all remaining uses of Value::getNameStr(). llvm-svn: 144648	2011-11-15 16:27:03 +00:00
Benjamin Kramer	4c93d15f09	Twinify GraphWriter a little bit. llvm-svn: 144647	2011-11-15 16:26:38 +00:00
Nick Lewycky	7013a19e8a	Refactor capture tracking (which already had a couple flags for whether returns and stores capture) to permit the caller to see each capture point and decide whether to continue looking. Use this inside memdep to do an analysis that basicaa won't do. This lets us solve another devirtualization case, fixing PR8908! llvm-svn: 144580	2011-11-14 22:49:42 +00:00
Nick Lewycky	d48ab84556	Don't try to loop on iterators that are potentially invalidated inside the loop. Fixes PR11361! llvm-svn: 144454	2011-11-12 03:09:12 +00:00
Nick Lewycky	47eebcfd66	Fix typo in comment. llvm-svn: 144236	2011-11-09 22:45:04 +00:00
Nick Lewycky	0485d51a76	Don't forget to check FlagNW when determining whether an AddRecExpr will wrap or not. Patch by Brendon Cahoon! llvm-svn: 144173	2011-11-09 07:11:37 +00:00
Eli Friedman	0bae8b2cfb	Fix code to match comment. Fixes PR11340, a regression from r143209. llvm-svn: 144121	2011-11-08 21:08:02 +00:00
Dan Gohman	85977e6ab4	Teach instsimplify to simplify calls to undef. llvm-svn: 143719	2011-11-04 18:32:42 +00:00
Daniel Dunbar	bf9bba47a1	build: Add initial cut at LLVMBuild.txt files. llvm-svn: 143634	2011-11-03 18:53:17 +00:00
Duncan Sands	3d5692a475	Reapply commit 143214 with a fix: m_ICmp doesn't match conditions with the given predicate, it matches any condition and returns the predicate - d'oh! Original commit message: The expression icmp eq (select (icmp eq x, 0), 1, x), 0 folds to false. Spotted by my super-optimizer in 186.crafty and 450.soplex. We really need a proper infrastructure for handling generalizations of this kind of thing (which occur a lot), however this case is so simple that I decided to go ahead and implement it directly. llvm-svn: 143318	2011-10-30 19:56:36 +00:00
Eli Friedman	3af3c046a9	Revert r143214; it's breaking a bunch of stuff. llvm-svn: 143265	2011-10-29 00:56:07 +00:00
Duncan Sands	280bc553b3	The expression icmp eq (select (icmp eq x, 0), 1, x), 0 folds to false. Spotted by my super-optimizer in 186.crafty and 450.soplex. We really need a proper infrastructure for handling generalizations of this kind of thing (which occur a lot), however this case is so simple that I decided to go ahead and implement it directly. llvm-svn: 143214	2011-10-28 19:01:20 +00:00
Duncan Sands	985ba6386d	A shift of a power of two is a power of two or zero. For completeness - not spotted in the wild. llvm-svn: 143211	2011-10-28 18:30:05 +00:00
Duncan Sands	92af0a8a7f	Fold icmp ugt (udiv X, Y), X to false. Spotted by my super-optimizer in 186.crafty. llvm-svn: 143209	2011-10-28 18:17:44 +00:00
Duncan Sands	7cb61e5a0e	Reapply commit 143028 with a fix: the problem was casting a ConstantExpr Mul using BinaryOperator (which only works for instructions) when it should have been a cast to OverflowingBinaryOperator (which also works for constants). While there, correct a few other dubious looking uses of BinaryOperator. Thanks to Chad Rosier for the testcase. Original commit message: My super-optimizer noticed that we weren't folding this expression to true: (x *nsw x) sgt 0, where x = (y \| 1). This occurs in 464.h264ref. llvm-svn: 143125	2011-10-27 19:16:21 +00:00
Bob Wilson	1455ce27e4	Revert Duncan's r143028 expression folding which appears to be the culprit behind a compile failure on 483.xalancbmk. llvm-svn: 143102	2011-10-27 15:47:25 +00:00
Duncan Sands	ba286d7c73	The maximum power of 2 dividing a power of 2 is itself. This occurs in 403.gcc and was spotted by my super-optimizer. llvm-svn: 143054	2011-10-26 20:55:21 +00:00
Duncan Sands	1d2bb9882d	My super-optimizer noticed that we weren't folding this expression to true: (x *nsw x) sgt 0, where x = (y \| 1). This occurs in 464.h264ref. llvm-svn: 143028	2011-10-26 15:31:51 +00:00
Duncan Sands	a370f3e34e	Restore commits 142790 and 142843 - they weren't breaking the build bots. Original commit messages: - Reapply r142781 with fix. Original message: Enhance SCEV's brute force loop analysis to handle multiple PHI nodes in the loop header when computing the trip count. With this, we now constant evaluate: struct ListNode { const struct ListNode next; int i; }; static const struct ListNode node1 = {0, 1}; static const struct ListNode node2 = {&node1, 2}; static const struct ListNode node3 = {&node2, 3}; int test() { int sum = 0; for (const struct ListNode n = &node3; n != 0; n = n->next) sum += n->i; return sum; } - Now that we look at all the header PHIs, we need to consider all the header PHIs when deciding that the loop has stopped evolving. Fixes miscompile in the gcc torture testsuite! llvm-svn: 142919	2011-10-25 12:28:52 +00:00
Chandler Carruth	32f46e7c07	Fix the API usage in loop probability heuristics. It was incorrectly classifying many edges as exiting which were in fact not. These mainly formed edges into sub-loops. It was also not correctly classifying all returning edges out of loops as leaving the loop. With this match most of the loop heuristics are more rational. Several serious regressions on loop-intesive benchmarks like perlbench's loop tests when built with -enable-block-placement are fixed by these updated heuristics. Unfortunately they in turn uncover some other regressions. There are still several improvemenst that should be made to loop heuristics including trip-count, and early back-edge management. llvm-svn: 142917	2011-10-25 09:47:41 +00:00
Duncan Sands	805c5b92c8	Speculatively revert commits 142790 and 142843 to see if it fixes the dragonegg and llvm-gcc self-host buildbots. Original commit messages: - Reapply r142781 with fix. Original message: Enhance SCEV's brute force loop analysis to handle multiple PHI nodes in the loop header when computing the trip count. With this, we now constant evaluate: struct ListNode { const struct ListNode next; int i; }; static const struct ListNode node1 = {0, 1}; static const struct ListNode node2 = {&node1, 2}; static const struct ListNode node3 = {&node2, 3}; int test() { int sum = 0; for (const struct ListNode n = &node3; n != 0; n = n->next) sum += n->i; return sum; } - Now that we look at all the header PHIs, we need to consider all the header PHIs when deciding that the loop has stopped evolving. Fixes miscompile in the gcc torture testsuite! llvm-svn: 142916	2011-10-25 09:26:43 +00:00
Nick Lewycky	a58fb48a55	Now that we look at all the header PHIs, we need to consider all the header PHIs when deciding that the loop has stopped evolving. Fixes miscompile in the gcc torture testsuite! llvm-svn: 142843	2011-10-24 21:02:38 +00:00
Chandler Carruth	7111f4564c	Remove return heuristics from the static branch probabilities, and introduce no-return or unreachable heuristics. The return heuristics from the Ball and Larus paper don't work well in practice as they pessimize early return paths. The only good hitrate return heuristics are those for: - NULL return - Constant return - negative integer return Only the last of these three can possibly require significant code for the returning block, and even the last is fairly rare and usually also a constant. As a consequence, even for the cold return paths, there is little code on that return path, and so little code density to be gained by sinking it. The places where sinking these blocks is valuable (inner loops) will already be weighted appropriately as the edge is a loop-exit branch. All of this aside, early returns are nearly as common as all three of these return categories, and should actually be predicted as taken! Rather than muddy the waters of the static predictions, just remain silent on returns and let the CFG itself dictate any layout or other issues. However, the return heuristic was flagging one very important case: unreachable. Unfortunately it still gave a 1/4 chance of the branch-to-unreachable occuring. It also didn't do a rigorous job of finding those blocks which post-dominate an unreachable block. This patch builds a more powerful analysis that should flag all branches to blocks known to then reach unreachable. It also has better worst-case runtime complexity by not looping through successors for each block. The previous code would perform an N^2 walk in the event of a single entry block branching to N successors with a switch where each successor falls through to the next and they finally fall through to a return. Test case added for noreturn heuristics. Also doxygen comments improved along the way. llvm-svn: 142793	2011-10-24 12:01:08 +00:00
Nick Lewycky	9be7f277e4	Reapply r142781 with fix. Original message: Enhance SCEV's brute force loop analysis to handle multiple PHI nodes in the loop header when computing the trip count. With this, we now constant evaluate: struct ListNode { const struct ListNode next; int i; }; static const struct ListNode node1 = {0, 1}; static const struct ListNode node2 = {&node1, 2}; static const struct ListNode node3 = {&node2, 3}; int test() { int sum = 0; for (const struct ListNode n = &node3; n != 0; n = n->next) sum += n->i; return sum; } llvm-svn: 142790	2011-10-24 06:57:05 +00:00
Nick Lewycky	8e904dee82	PHI nodes not in the loop header aren't part of the loop iteration initial state. Furthermore, they might not have two operands. This fixes the underlying issue behind the crashes introduced in r142781. llvm-svn: 142788	2011-10-24 05:51:01 +00:00
Nick Lewycky	9d28c26d77	Speculatively revert r142781. Bots are showing Assertion `i_nocapture < OperandTraits<PHINode>::operands(this) && "getOperand() out of range!"' failed. coming out of indvars. llvm-svn: 142786	2011-10-24 04:00:25 +00:00
Chandler Carruth	7a0094a673	Simplify the design of BranchProbabilityInfo by collapsing it into a single class. Previously it was split between two classes, one internal and one external. The concern seemed to center around exposing the weights used, but those can remain confined to the implementation file. Having a single class to maintain the state and analyses in use will also simplify several of the enhancements I want to make to our static heuristics. llvm-svn: 142783	2011-10-24 01:40:45 +00:00
Nick Lewycky	1700007ecc	Enhance SCEV's brute force loop analysis to handle multiple PHI nodes in the loop header when computing the trip count. With this, we now constant evaluate: struct ListNode { const struct ListNode next; int i; }; static const struct ListNode node1 = {0, 1}; static const struct ListNode node2 = {&node1, 2}; static const struct ListNode node3 = {&node2, 3}; int test() { int sum = 0; for (const struct ListNode n = &node3; n != 0; n = n->next) sum += n->i; return sum; } llvm-svn: 142781	2011-10-23 23:43:14 +00:00
Chandler Carruth	24cee10fb1	Tidy up a loop to be more idiomatic for LLVM's codebase, and remove some extraneous whitespace. Trying to clean-up this pass as much as I can before I start making functional changes. llvm-svn: 142780	2011-10-23 22:40:13 +00:00
Chandler Carruth	1c8ace0e89	Teach the BranchProbabilityInfo pass to print its results, and use that to bring it under direct test instead of merely indirectly testing it in the BlockFrequencyInfo pass. The next step is to start adding tests for the various heuristics employed, and to start fixing those heuristics once they're under test. llvm-svn: 142778	2011-10-23 21:21:50 +00:00
Benjamin Kramer	929f53f65c	Add compare operators to BranchProbability and use it to determine if an edge is hot. llvm-svn: 142751	2011-10-23 11:19:14 +00:00
Nick Lewycky	a6674c7fc9	Make SCEV's brute force analysis stronger in two ways. Firstly, we should be able to constant fold load instructions where the argument is a constant. Second, we should be able to watch multiple PHI nodes through the loop; this patch only supports PHIs in loop headers, more can be done here. With this patch, we now constant evaluate: static const int arr[] = {1, 2, 3, 4, 5}; int test() { int sum = 0; for (int i = 0; i < 5; ++i) sum += arr[i]; return sum; } llvm-svn: 142731	2011-10-22 19:58:20 +00:00
Benjamin Kramer	606a50a9f8	Extend the floating point heuristic to consider NaN checks unlikely. llvm-svn: 142687	2011-10-21 21:13:47 +00:00
Benjamin Kramer	1e731a10d0	BranchProbabilityInfo: floating point equality is unlikely. This is from the same paper from Ball and Larus as the rest of the currently implemented heuristics. llvm-svn: 142677	2011-10-21 20:12:47 +00:00
Eli Friedman	68db4c2699	A FIXME about block addresses and indirectbr. llvm-svn: 142569	2011-10-20 04:05:33 +00:00
Eli Friedman	f0bb0c2934	Simplify; no intended functional change. llvm-svn: 142567	2011-10-20 03:23:14 +00:00
Nick Lewycky	462098824f	"@string = constant i8 0" is a value i8* string of length zero. Analyze that correctly in GetStringLength, fixing PR11181! llvm-svn: 142558	2011-10-20 00:34:35 +00:00
Chandler Carruth	deac50cba9	Generalize the reading of probability metadata to work for both branches and switches, with arbitrary numbers of successors. Still optimized for the common case of 2 successors for a conditional branch. Add a test case for switch metadata showing up in the BlockFrequencyInfo pass. llvm-svn: 142493	2011-10-19 10:32:19 +00:00
Chandler Carruth	d27a7a947b	Teach the BranchProbabilityInfo analysis pass to read any metadata encoding of probabilities. In the absense of metadata, it continues to fall back on static heuristics. This allows __builtin_expect, after lowering through llvm.expect a branch instruction's metadata, to actually enter the branch probability model. This is one component of resolving PR2577. llvm-svn: 142492	2011-10-19 10:30:30 +00:00
Chandler Carruth	343fad44ea	Add pass printing support to BlockFrequencyInfo pass. The implementation layer already had support for printing the results of this analysis, but the wiring was missing. Now that printing the analysis works, actually bring some of this analysis, and the BranchProbabilityInfo analysis that it wraps, under test! I'm planning on fixing some bugs and doing other work here, so having a nice place to add regression tests and a way to observe the results is really useful. llvm-svn: 142491	2011-10-19 10:12:41 +00:00
Devang Patel	7973e78800	Update DebugInfoFinder to match recent debug info encoding changes. llvm-svn: 142295	2011-10-17 22:30:34 +00:00
Bill Wendling	63a4ea1859	Correct over-zealous removal of hack. Some code want to check that any call within a function has the 'returns twice' attribute, not just that the current function has one. llvm-svn: 142221	2011-10-17 18:43:40 +00:00
Bill Wendling	2a83a71c2a	Now that we have the ReturnsTwice function attribute, this method is obsolete. Check the attribute instead. <rdar://problem/8031714> llvm-svn: 142212	2011-10-17 18:22:52 +00:00
Chandler Carruth	91f4faf877	Delete a dead member. Dunno if this was ever used, but the current code directly manipulates the weights inside of the BranchProbabilityInfo that is passed in. llvm-svn: 142163	2011-10-16 22:27:54 +00:00
Andrew Trick	fd4ca0f4ac	Fix SCEVExpander assert during LSR: "argument of incompatible type". Just because we're dealing with a GEP doesn't mean we can assert the SCEV has a pointer type. The fix is simply to ignore the SCEV pointer type, which we really didn't need. Fixes PR11138 webkit crash. llvm-svn: 142058	2011-10-15 06:19:55 +00:00
Nick Lewycky	a447e0f38f	An instruction's operands aren't necessarily instructions or constants. They could be arguments, for example. No testcase because this is a bug-fix broken out of a larger optimization patch. llvm-svn: 141951	2011-10-14 09:38:46 +00:00
Eli Friedman	c1702c8f22	Enhance the memdep interface so that users can tell the difference between a dependency which cannot be calculated and a path reaching the entry point of the function. This patch introduces isNonFuncLocal, which replaces isUnknown in some cases. Patch by Xiaoyi Guo. llvm-svn: 141896	2011-10-13 22:14:57 +00:00
Andrew Trick	870c1a3f15	Reapply r141870, SCEV expansion of post-inc. Speculatively reapply to see if this test case still crashes on linux. I may have fixed it in my last checkin. llvm-svn: 141895	2011-10-13 21:55:29 +00:00
Andrew Trick	7e442569dc	Fix memory corruption I introduced a few checkins ago. Self-review easily caught this obvious bug. llvm-svn: 141880	2011-10-13 18:49:23 +00:00
Andrew Trick	41c253c35c	Revert r141870. The test case crashes on linux with data corruption. A deeper issue was exposed. llvm-svn: 141873	2011-10-13 17:58:24 +00:00
Andrew Trick	e15d6e14e3	LSR: Reuse the post-inc expansion of expressions. This avoids unnecessary expansion of expressions and allows the SCEV expander to work on expression DAGs, not just trees. Fixes PR11090. llvm-svn: 141870	2011-10-13 17:31:47 +00:00
Andrew Trick	1393ec29af	SCEV: Rewrite TrandformForPostIncUse to handle expression DAGs, not just expression trees. Partially fixes PR11090. Test case will be with the full fix. llvm-svn: 141868	2011-10-13 17:21:09 +00:00
Andrew Trick	adfe72b33c	Slightly more useful tracing. llvm-svn: 141867	2011-10-13 17:06:38 +00:00
Eric Christopher	6647b83087	Add a new wrapper node for a DILexicalBlock that encapsulates it and a file. Since it should only be used when necessary propagate it through the backend code generation and tweak testcases accordingly. This helps with code like in clang's test/CodeGen/debug-info-line.c where we have multiple #line directives within a single lexical block and want to generate only a single block that contains each file change. Part of rdar://10246360 llvm-svn: 141729	2011-10-11 22:59:11 +00:00
Andrew Trick	f9201c572e	Move replaceCongruentIVs into SCEVExapander and bias toward "expanded" IVs. Indvars previously chose randomly between congruent IVs. Now it will bias the decision toward IVs that SCEVExpander likes to create. This was not done to fix any problem, it's just a welcome side effect of factoring code. llvm-svn: 141633	2011-10-11 02:28:51 +00:00
Andrew Trick	eef7308df6	Add an extra safety check in front of the optimization in r141442. llvm-svn: 141470	2011-10-08 02:16:39 +00:00
Andrew Trick	7fb669ab48	LSR should only reuse phis that match its formula. Fixes rdar://problem/5064068 llvm-svn: 141442	2011-10-07 23:46:21 +00:00
Eli Friedman	1456cd20b4	Remove the old atomic instrinsics. autoupgrade functionality is included with this patch. llvm-svn: 141333	2011-10-06 23:20:49 +00:00
Andrew Trick	3e8a576da1	Fixes PR11070 - assert in SCEV getConstantEvolvingPHIOperands. llvm-svn: 141219	2011-10-05 22:06:53 +00:00
Andrew Trick	ed39bb8efd	Typo. Thanks Bob. llvm-svn: 141188	2011-10-05 16:52:28 +00:00
Chandler Carruth	f6567a131d	Fix a broken assert found by -Wparentheses. llvm-svn: 141168	2011-10-05 07:02:23 +00:00
Andrew Trick	e9162f1ff8	Fix disabled SCEV analysis caused r141161 and add unit test. I noticed during self-review that my previous checkin disabled some analysis. Even with the reenabled analysis the test case runs in about 5ms. Without the fix, it will take several minutes at least. llvm-svn: 141164	2011-10-05 05:58:49 +00:00
Andrew Trick	3a86ba767c	Avoid exponential recursion in SCEV getConstantEvolvingPHI and EvaluateExpression. Note to compiler writers: never recurse on multiple instruction operands without memoization. Fixes rdar://10187945. Was taking 45s, now taking 5ms. llvm-svn: 141161	2011-10-05 03:25:31 +00:00
Nick Lewycky	287682ead1	The product of two chrec's can always be represented as a chrec. llvm-svn: 141066	2011-10-04 06:51:26 +00:00
Nick Lewycky	3155552461	Reapply r140979 with fix! We never did get a testcase, but careful review of the logic by David Meyer revealed this bug. llvm-svn: 140992	2011-10-03 07:10:45 +00:00
Nick Lewycky	b1dbce1406	Revert r140979 due to reports of bootstrap failure. llvm-svn: 140980	2011-10-03 05:14:59 +00:00
Nick Lewycky	3c624b8d0d	Add one more case we compute a max trip count. llvm-svn: 140979	2011-10-03 01:03:57 +00:00
Andrew Trick	f7656015fc	Inlining and unrolling heuristics should be aware of free truncs. We want heuristics to be based on accurate data, but more importantly we don't want llvm to behave randomly. A benign trunc inserted by an upstream pass should not cause a wild swings in optimization level. See PR11034. It's a general problem with threshold-based heuristics, but we can make it less bad. llvm-svn: 140919	2011-10-01 01:39:05 +00:00
Andrew Trick	caa500bf93	whitespace llvm-svn: 140916	2011-10-01 01:27:56 +00:00
Andrew Trick	ef8e4efff8	indvars: generalize SCEV getPreStartForSignExtend. Handle general Add expressions to avoid leaving around redundant 32-bit IVs. llvm-svn: 140701	2011-09-28 17:02:54 +00:00
Eli Friedman	5f476dc3ef	PR10628: Fix getModRefInfo so it queries the underlying alias() implementation correctly while checking nocapture calls. llvm-svn: 140666	2011-09-28 00:34:27 +00:00
Benjamin Kramer	547b6c5ecd	Stop emitting instructions with the name "tmp" they eat up memory and have to be uniqued, without any benefit. If someone prefers %tmp42 to %42, run instnamer. llvm-svn: 140634	2011-09-27 20:39:19 +00:00
Eli Friedman	5c91891cf3	Enhance alias analysis for atomic instructions a bit. Upgrade a couple alias-analysis tests to the new atomic instructions. llvm-svn: 140557	2011-09-26 20:15:28 +00:00
Galina Kistanova	ef65f002df	Fix for DbgInfoPrinter.cpp:174:12: warning: ‘LineNo’ may be used uninitialized in this function. llvm-svn: 140281	2011-09-21 23:34:23 +00:00
Devang Patel	04d6d47865	Add support to emit debug info for C++0x nullptr type. llvm-svn: 139751	2011-09-14 23:13:28 +00:00
Eric Christopher	777c928369	Fix typo. llvm-svn: 139530	2011-09-12 19:58:22 +00:00
Devang Patel	1ad1abe165	Add asserts to keep front-ends honest while encoding debug info into LLVM IR using DIBuilder. llvm-svn: 139515	2011-09-12 18:26:08 +00:00
Andrew Trick	a51d74fc35	Set NSW/NUW flags on SCEVAddExpr when the operation is flagged as such. I'm doing this now for completeness because I can't think of/remember any reason that it was left out. I'm not sure it will help anything, but if we don't do it we need to explain why in comments. llvm-svn: 139450	2011-09-10 01:09:50 +00:00
Eli Friedman	b78ac543c7	A couple minor corrections to r139276. llvm-svn: 139277	2011-09-08 02:37:07 +00:00
Eli Friedman	3d1b307672	Fix the logic in BasicAliasAnalysis::aliasGEP for comparing GEP's with variable differences so that it actually does something sane. Fixes PR10881. llvm-svn: 139276	2011-09-08 02:23:31 +00:00
Owen Anderson	f4f09f8c26	memset_pattern16 uses a 16 BYTE pattern, not a 16 BIT pattern. Add comments to that effect. llvm-svn: 139205	2011-09-06 23:43:26 +00:00
Owen Anderson	653cb03191	Teach BasicAA about the aliasing properties of memset_pattern16. Fixes PR10872 and <rdar://problem/10065079>. llvm-svn: 139204	2011-09-06 23:33:25 +00:00
Nick Lewycky	e0aa54bb98	This transform only handles two-operand AddRec's. Prevent it from trying to handle anything more complex. Fixes PR10383 again! llvm-svn: 139186	2011-09-06 21:42:18 +00:00
Devang Patel	5ea5d7965b	Now, named mdnode llvm.dbg.cu keeps track of all compile units in a module. Update DebugInfoFinder to collect compile units from llvm.dbg.cu. llvm-svn: 139147	2011-09-06 17:40:08 +00:00
Nick Lewycky	78664db054	Fix typo in comment again. llvm-svn: 139139	2011-09-06 07:02:40 +00:00
Nick Lewycky	237878b7ac	Apparently we compile the code, not the comments. Thanks Eli! llvm-svn: 139138	2011-09-06 06:56:00 +00:00
Nick Lewycky	0af94cc50b	Fix typo in comment. llvm-svn: 139137	2011-09-06 06:46:01 +00:00
Nick Lewycky	702cf1eccc	Nope! I had it right the first time. Revert the operative part of r139135 and add more showing of my work. llvm-svn: 139136	2011-09-06 06:39:54 +00:00
Nick Lewycky	6f86e001d6	Fix flipped sign. While there, show my math. llvm-svn: 139135	2011-09-06 05:33:18 +00:00
Nick Lewycky	db66b82dd5	No no no, fix typo properly! llvm-svn: 139134	2011-09-06 05:08:09 +00:00
Nick Lewycky	658bdb5133	The logic inside getMulExpr to simplify {a,+,b}*{c,+,d} was wrong, which was visible given a=b=c=d=1, on iteration #1 (the second iteration). Replace it with correct math. Fixes PR10383! llvm-svn: 139133	2011-09-06 05:05:14 +00:00
Nick Lewycky	b1438c763a	Revert r139126 due to selfhost failures reported by buildbots. llvm-svn: 139130	2011-09-06 02:43:13 +00:00
Nick Lewycky	c4c43fbb07	Teach SCEV to report a max backedge count in one interesting case in HowFarToZero; the case for a canonical loop. llvm-svn: 139126	2011-09-05 23:25:16 +00:00
Benjamin Kramer	4b79c21ef2	InstSimplify: Don't try to replace an extractvalue/insertvalue pair with the original value if types don't match. Fixes clang selfhost. llvm-svn: 139120	2011-09-05 18:16:19 +00:00
Duncan Sands	fd26a954a8	Add some simple insertvalue simplifications, for the purpose of cleaning up do-nothing exception handling code produced by dragonegg. llvm-svn: 139113	2011-09-05 06:52:48 +00:00
Benjamin Kramer	0ca1ad0783	Use canonical forms for the branch probability zero heutistic. - Drop support for X >u 0, it's equivalent to X != 0 and should be canonicalized into the latter. - Add X < 1 -> unlikely, which is what instcombine canonicalizes X <= 0 into. - Add X > -1 -> likely, which is what instcombine canonicalizes X >= 0 into. llvm-svn: 139110	2011-09-04 23:53:04 +00:00
Andrew Trick	bbb226a827	Comment and clarifying assert. llvm-svn: 139036	2011-09-02 21:20:46 +00:00
Devang Patel	df060bc3c2	After r138010, subroutine type does not have context info. Update type verifier accordingly. This fixes ptype.exp gdb testsuite regressions. llvm-svn: 138869	2011-08-31 18:04:31 +00:00
Nadav Rotem	5fc81ffbac	Fixes following the CR by Chris and Duncan: Optimize chained bitcasts of the form A->B->A. Undo r138722 and change isEliminableCastPair to allow this case. llvm-svn: 138756	2011-08-29 19:58:36 +00:00
Andrew Trick	0896621a50	Reapply r138695. Fix PassManager stack depths. Patch by Xiaoyi Guo! llvm-svn: 138737	2011-08-29 17:07:00 +00:00
Nadav Rotem	52600ee8c3	Bitcasts are transitive. Bitcast-Bitcast-X becomes Bitcast-X. llvm-svn: 138722	2011-08-28 11:51:08 +00:00
Andrew Trick	5c29ebae8e	Reverting r138695 to see if it fixes clang self host. llvm-svn: 138701	2011-08-27 06:10:16 +00:00
Andrew Trick	b0cd1e65de	Fix PassManager stack depths. Patch by Xiaoyi Guo! llvm-svn: 138695	2011-08-27 02:11:03 +00:00
Eric Christopher	3cc90fe5a5	Whitespace and 80-col. llvm-svn: 138654	2011-08-26 21:02:40 +00:00
Andrew Trick	147d9cde78	LoopInfo::updateUnloop fix, and verify Block->Loop maps. Fixes an oversight, and adds verification to catch it in the unloop.ll tests. llvm-svn: 138622	2011-08-26 03:06:34 +00:00
Bill Wendling	86c5cbe613	Skip the landingpad instruction when determining the insertion point. llvm-svn: 138481	2011-08-24 21:06:46 +00:00
Nadav Rotem	365af6f17b	Implement Constant::isAllOnesValue(). Fix ConstantFolding to use the new api. llvm-svn: 138469	2011-08-24 20:18:38 +00:00
Eric Christopher	7bc78f692c	Revert "Address Duncan's CR request:" This reverts commit 20a05be15ea5271ab6185b83200fa88263362400. (svn rev 138340) Conflicts: test/Transforms/InstCombine/bitcast.ll llvm-svn: 138366	2011-08-23 20:11:10 +00:00
Nadav Rotem	c78e6607b5	Address Duncan's CR request: 1. Cleanup the tests in ConstantFolding.cpp 2. Implement isAllOnes for Constant, ConstantFP, ConstantVector llvm-svn: 138340	2011-08-23 17:48:43 +00:00
Nadav Rotem	ad4a70ad3e	Add constant folding support for bitcasts of splat vectors to integers. llvm-svn: 138206	2011-08-20 14:02:29 +00:00
Devang Patel	59e27c5f12	Do not use named md nodes to track variables that are completely optimized. This does not scale while doing LTO with debug info. New approach is to include list of variables in the subprogram info directly. llvm-svn: 138145	2011-08-19 23:28:12 +00:00
Benjamin Kramer	4938edb02c	Make a bunch of symbols private. llvm-svn: 138025	2011-08-19 01:42:18 +00:00
Benjamin Kramer	5a656883b1	C API functions must be able to see their extern "C" definitions, or it will be impossible to call them from C. llvm-svn: 138022	2011-08-19 01:36:54 +00:00
Devang Patel	425b4dcc30	There is no need to add file as context for subroutine type. The subroutine type does not need any context. llvm-svn: 138010	2011-08-18 23:50:57 +00:00
Bill Wendling	a9ee09f4be	Revert r137655. There is some question about whether the 'landingpad' instruction should be marked as potentially reading and/or writing memory. llvm-svn: 137863	2011-08-17 20:36:44 +00:00
Eli Friedman	ad3cfe7933	Revert r137781; I agree with Duncan's comment that the situation in question is clearly impossible given the current structure of the code. llvm-svn: 137853	2011-08-17 19:31:49 +00:00
Eli Friedman	55919a9ed7	Extend the undef ^ undef idiom once more. No testcase: I can't figure out how to actually trigger the codepath in question at the moment, but it might get exposed in the future. llvm-svn: 137781	2011-08-16 22:38:34 +00:00
Devang Patel	eb1bb4e419	Until now all debug info MDNodes referred to a root MDNode, a compile unit. This simplified handling of these needs in dwarf writer. However, one side effect of this is that during link time optimization all these MDNodes are _not_ uniqued. In other words there will be N number of MDNodes describing "int", "char" and all other types, which would suddenly grow when each object file starts using libraries like STL. MDNodes graph structure such that compiler unit keeps track of important MDNodes and update dwarf writer to process mdnodes top-down instead of bottom up. llvm-svn: 137778	2011-08-16 22:09:43 +00:00
Bill Wendling	8ddfc09e7a	Use the getFirstInsertionPt() method instead of getFirstNonPHI + an 'isa<>' check for a LandingPadInst. llvm-svn: 137745	2011-08-16 20:45:24 +00:00
Bill Wendling	be33e8d58d	A few places where we want to skip the landingpad instruction for insertion. llvm-svn: 137712	2011-08-16 04:52:55 +00:00
Devang Patel	2b8acaf4f3	Add a finalize() hook, that'll let DIBuilder construct compile unit lazily. llvm-svn: 137673	2011-08-15 23:00:00 +00:00
Eli Friedman	4419cd2464	Add some comments here because the lack of a check for volatile/atomic here is a bit unusual. llvm-svn: 137662	2011-08-15 21:56:39 +00:00
Bill Wendling	e86965ee19	Duncan pointed out that the LandingPadInst might read memory. (It might also write to memory.) Marking it as such makes some checks for immobility go away. llvm-svn: 137655	2011-08-15 21:14:31 +00:00
Eli Friedman	5494adac67	Misc analysis passes that need to be aware of atomic load/store. llvm-svn: 137650	2011-08-15 20:54:19 +00:00
Eli Friedman	91386c7be4	Atomic load/store support in LICM. llvm-svn: 137648	2011-08-15 20:52:09 +00:00
Bill Wendling	9af5b22b76	The landingpad instruction isn't loop-invariant. llvm-svn: 137628	2011-08-15 18:22:49 +00:00
Devang Patel	dfd6ec3ce1	Refactor. Global variables are part of compile unit so let CompileUnit create new global variable. llvm-svn: 137621	2011-08-15 17:57:41 +00:00
Duncan Sands	a41634e307	Silence a bunch (but not all) "variable written but not read" warnings when building with assertions disabled. llvm-svn: 137460	2011-08-12 14:54:45 +00:00
Andrew Trick	2b6860f0a1	Allow loop unrolling to get known trip counts from ScalarEvolution. SCEV unrolling can unroll loops with arbitrary induction variables. It is a prerequisite for -disable-iv-rewrite performance. It is also easily handles loops of arbitrary structure including multiple exits and is generally more robust. This is under a temporary option to avoid affecting default behavior for the next couple of weeks. It is needed so that I can checkin unit tests for updateUnloop. llvm-svn: 137384	2011-08-11 23:36:16 +00:00
Andrew Trick	c12c30a670	Fix for LoopInfo::updateUnloop. Remove subloop blocks from former ancestor loops. I have a unit test that depends on scev-unroll, which unfortunately isn't checked in. But I will check it in when I can. llvm-svn: 137341	2011-08-11 20:27:32 +00:00
Andrew Trick	266ab10012	Cleanup. Another thorough review by Nick! llvm-svn: 137317	2011-08-11 17:54:58 +00:00
Andrew Trick	d3530b9117	Reapplying r136844. An algorithm for incrementally updating LoopInfo within a LoopPassManager. The incremental update should be extremely cheap in most cases and can be used in places where it's not feasible to regenerate the entire loop forest. - "Unloop" is a node in the loop tree whose last backedge has been removed. - Perform reverse dataflow on the block inside Unloop to propagate the nearest loop from the block's successors. - For reducible CFG, each block in unloop is visited exactly once. This is because unloop no longer has a backedge and blocks within subloops don't change parents. - Immediate subloops are summarized by the nearest loop reachable from their exits or exits within nested subloops. - At completion the unloop blocks each have a new parent loop, and each immediate subloop has a new parent. llvm-svn: 137276	2011-08-10 23:22:57 +00:00
Devang Patel	bb23a4a9a5	Distinguish between two copies of one inlined variable. Take 2. llvm-svn: 137253	2011-08-10 21:50:54 +00:00
Andrew Trick	78b40c3f3a	Cleanup. Added LoopBlocksDFS::perform for simple clients. llvm-svn: 137195	2011-08-10 01:59:05 +00:00
Devang Patel	3d6e38942d	Provide method to print variable's extended name which includes inline location. llvm-svn: 137095	2011-08-09 01:03:14 +00:00
Andrew Trick	6d45a01b67	Made SCEV's UDiv expressions more canonical. When dividing a recurrence, the initial values low bits can sometimes be ignored. To take advantage of this, added FoldIVUser to IndVarSimplify to fold an IV operand into a udiv/lshr if the operator doesn't affect the result. -indvars -disable-iv-rewrite now transforms i = phi i4 i1 = i0 + 1 idx = i1 >> (2 or more) i4 = i + 4 into i = phi i4 idx = i0 >> ... i4 = i + 4 llvm-svn: 137013	2011-08-06 07:00:37 +00:00
Chandler Carruth	81b7e11c89	Temporarily revert r135528 which distinguishes between two copies of one inlined variable, based on the discussion in PR10542. This explodes the runtime of several passes down the pipeline due to a large number of "copies" remaining live across a large function. This only shows up with both debug and opt, but when it does it creates a many-minute compile when self-hosting LLVM+Clang. There are several other cases that show these types of regressions. All of this is tracked in PR10542, and progress is being made on fixing the issue. Once its addressed, the re-instated, but until then this restores the performance for self-hosting and other opt+debug builds. Devang, let me know if this causes any trouble, or impedes fixing it in any way, and thanks for working on this! llvm-svn: 136953	2011-08-05 00:51:31 +00:00
Duncan Sands	020c1947b7	Fix what seems an obvious typo. Patch by Ivan Krasin. Problem reported at http://habrahabr.ru/blogs/compilers/125626/. llvm-svn: 136865	2011-08-04 10:02:21 +00:00
Andrew Trick	bc673fb5f2	Reverting r136884 updateUnloop, which crashed a linux builder. llvm-svn: 136857	2011-08-04 01:04:37 +00:00
Andrew Trick	468eadbbb2	An algorithm for incrementally updating LoopInfo within a LoopPassManager. The incremental update should be extremely cheap in most cases and can be used in places where it's not feasible to regenerate the entire loop forest. - "Unloop" is a node in the loop tree whose last backedge has been removed. - Perform reverse dataflow on the block inside Unloop to propagate the nearest loop from the block's successors. - For reducible CFG, each block in unloop is visited exactly once. This is because unloop no longer has a backedge and blocks within subloops don't change parents. - Immediate subloops are summarized by the nearest loop reachable from their exits or exits within nested subloops. - At completion the unloop blocks each have a new parent loop, and each immediate subloop has a new parent. llvm-svn: 136844	2011-08-03 23:50:25 +00:00
Andrew Trick	f898cbde5e	whitespace llvm-svn: 136843	2011-08-03 23:45:50 +00:00
Jakub Staszak	a60d130f26	Add more constantness in BlockFrequencyInfo. llvm-svn: 136816	2011-08-03 21:30:57 +00:00
Bill Wendling	035ea32870	Add this back in for now. There are still a few passes which create unwind instructions at the moment. llvm-svn: 136756	2011-08-03 01:07:57 +00:00
Bill Wendling	ae3380faff	Replace the 'UnwindInst' check with a check for 'ResumeInst', which also exits the function, because the UnwindInst is going away. llvm-svn: 136751	2011-08-03 00:30:19 +00:00
Andrew Trick	77c55428fa	Use consistent terminology for loop exit/exiting blocks. Name change only. llvm-svn: 136677	2011-08-02 04:23:35 +00:00
Jakub Staszak	8b13b59f60	Change SmallVector to SmallPtrSet in BranchProbabilityInfo. Handle cases where one than one successor goes to the same block. llvm-svn: 136638	2011-08-01 19:16:26 +00:00
Jakub Staszak	6651b33671	Do not handle cases with >= and <= predicates. llvm-svn: 136588	2011-07-31 05:54:04 +00:00
Jakub Staszak	e348afb612	Remove untrue comment. llvm-svn: 136587	2011-07-31 04:51:14 +00:00
Jakub Staszak	bfb1ae223b	Do not handle case where LHS is equal to zero, because InstCombiner always moves it to RHS anyway. llvm-svn: 136586	2011-07-31 04:47:20 +00:00
Jakub Staszak	17af66a62f	Add Zero Heurestics to BranchProbabilityInfo. If we compare value to zero we decide whether condition is likely to be true this way: x == 0 -> false x < 0 -> false x <= 0 -> false x != 0 -> true x > 0 -> true x >= 0 -> true llvm-svn: 136583	2011-07-31 03:27:24 +00:00
Jakub Staszak	efd94c8fea	Add more constantness in BranchProbabilityInfo. llvm-svn: 136502	2011-07-29 19:30:00 +00:00
Jakub Staszak	0978426843	Remove incEdgeWeight and decEdgeWeight. Set edge weight directly to avoid rounding errors. llvm-svn: 136456	2011-07-29 02:36:53 +00:00
Chandler Carruth	9d7feab3e0	Rewrite the CMake build to use explicit dependencies between libraries, specified in the same file that the library itself is created. This is more idiomatic for CMake builds, and also allows us to correctly specify dependencies that are missed due to bugs in the GenLibDeps perl script, or change from compiler to compiler. On Linux, this returns CMake to a place where it can relably rebuild several targets of LLVM. I have tried not to change the dependencies from the ones in the current auto-generated file. The only places I've really diverged are in places where I was seeing link failures, and added a dependency. The goal of this patch is not to start changing the dependencies, merely to move them into the correct location, and an explicit form that we can control and change when necessary. This also removes a serialization point in the build because we don't have to scan all the libraries before we begin building various tools. We no longer have a step of the build that regenerates a file inside the source tree. A few other associated cleanups fall out of this. This isn't really finished yet though. After talking to dgregor he urged switching to a single CMake macro to construct libraries with both sources and dependencies in the arguments. Migrating from the two macros to that style will be a follow-up patch. Also, llvm-config is still generated with GenLibDeps.pl, which means it still has slightly buggy dependencies. The internal CMake 'llvm-config-like' macro uses the correct explicitly specified dependencies however. A future patch will switch llvm-config generation (when using CMake) to be based on these deps as well. This may well break Windows. I'm getting a machine set up now to dig into any failures there. If anyone can chime in with problems they see or ideas of how to solve them for Windows, much appreciated. llvm-svn: 136433	2011-07-29 00:14:25 +00:00
Jakub Staszak	eec01ccbf9	Change LBH_TAKEN_WEIGHT to 124 (from 128). Right now, sum of LBH_TAKEN_WEIGHT + LBH_NONTAKEN_WEIGHT = 128 which in _most_ cases reduce number of rounding errors. llvm-svn: 136428	2011-07-28 23:42:08 +00:00
Jakub Staszak	d07b2e159a	Heuristics are in descending priority now. If we use one of them, skip the rest. llvm-svn: 136402	2011-07-28 21:45:07 +00:00
Jakub Staszak	bcb3c65bb4	Add InEdges (edges from header to the loop) in Loop Branch Heuristics, so there is no frequency difference whether condition is in the header or in the latch. llvm-svn: 136398	2011-07-28 21:33:46 +00:00
Jakub Staszak	da3df4302a	Use BlockFrequency instead of uint32_t in BlockFrequencyInfo. llvm-svn: 136278	2011-07-27 22:05:51 +00:00
Jeffrey Yasskin	6381c0100b	Explicitly cast narrowing conversions inside {}s that will become errors in C++0x. llvm-svn: 136211	2011-07-27 06:22:51 +00:00
Eli Friedman	8b5277c6cf	Minor simplification. llvm-svn: 136202	2011-07-27 01:02:25 +00:00
Eli Friedman	ae8161e774	Fix AliasSetTracker so that it doesn't make any assumptions about instructions it doesn't know about (like the atomic instructions I'm adding). llvm-svn: 136198	2011-07-27 00:46:46 +00:00
Andrew Trick	3ca3f98c2c	SCEV: Added a data structure for storing not-taken info per loop exit. Added an interfaces for querying either the loop's exact/max backedge taken count or a specific loop exit's not-taken count. llvm-svn: 136100	2011-07-26 17:19:55 +00:00
Duncan Sands	c1c92719a4	Add helper function for getting true/false constants in a uniform way for i1 and vector of i1 types. Use these to make some code more self-documenting. llvm-svn: 136079	2011-07-26 15:03:53 +00:00
Jakub Staszak	875ebd5f5d	Rename BlockFrequency to BlockFrequencyInfo and MachineBlockFrequency to MachineBlockFrequencyInfo. llvm-svn: 135937	2011-07-25 19:25:40 +00:00
Frits van Bommel	ede0dc6dda	Shorten some expressions by using ArrayRef::slice(). llvm-svn: 135910	2011-07-25 15:13:01 +00:00
Jay Foad	d1b7849d49	Convert GetElementPtrInst to use ArrayRef. llvm-svn: 135904	2011-07-25 09:48:08 +00:00
Jay Foad	040dd82f44	Convert IRBuilder::CreateGEP and IRBuilder::CreateInBoundsGEP to use ArrayRef. llvm-svn: 135761	2011-07-22 08:16:57 +00:00
Jakub Staszak	b82bbf40bb	Allow getBlockFreq to return 0. llvm-svn: 135742	2011-07-22 02:24:57 +00:00
Jay Foad	ed8db7d9df	Convert ConstantExpr::getGetElementPtr and ConstantExpr::getInBoundsGetElementPtr to use ArrayRef. llvm-svn: 135673	2011-07-21 14:31:17 +00:00
Devang Patel	8fb9fd6769	There are two ways to map a variable to its lexical scope. Lexical scope information is embedded in MDNode describing the variable. It is also available as a part of DebugLoc attached with DBG_VALUE instruction. DebugLoc attached with an instruction is less reliable in optimized code so use information embedded in the MDNode. llvm-svn: 135629	2011-07-20 22:18:50 +00:00
Devang Patel	a59b24b090	Distinguish between two copies of one inlined variable. llvm-svn: 135528	2011-07-19 22:31:15 +00:00
Devang Patel	cfa82a378d	Reapply r135457. This needs llvm-gcc change, that I forgot to check-in yesterday. llvm-svn: 135504	2011-07-19 19:41:54 +00:00
Bob Wilson	da30cf84c3	Revert "Make a provision to encode inline location in a variable. This will enable dwarf writer to easily distinguish between two instances of a inlined variable in one basic block." This reverts commit 9fec5e346efdf744b151ae6604f912908315fa7a. llvm-svn: 135486	2011-07-19 16:32:50 +00:00
Jay Foad	b992a635fb	Convert SimplifyGEPInst to use ArrayRef. llvm-svn: 135482	2011-07-19 15:07:52 +00:00
Jay Foad	bf904773bb	Convert TargetData::getIndexedOffset to use ArrayRef. llvm-svn: 135478	2011-07-19 14:01:37 +00:00
Jay Foad	f4b14a2b0d	Use ArrayRef in ConstantFoldInstOperands and ConstantFoldCall. llvm-svn: 135477	2011-07-19 13:32:40 +00:00
Devang Patel	ac532dedf1	Make a provision to encode inline location in a variable. This will enable dwarf writer to easily distinguish between two instances of a inlined variable in one basic block. llvm-svn: 135457	2011-07-19 01:03:32 +00:00
Frits van Bommel	717d7edd3e	Migrate LLVM and Clang to use the new makeArrayRef(...) functions where previously explicit non-default constructors were used. Mostly mechanical with some manual reformatting. llvm-svn: 135390	2011-07-18 12:00:32 +00:00
Chris Lattner	229907cd11	land David Blaikie's patch to de-constify Type, with a few tweaks. llvm-svn: 135375	2011-07-18 04:54:35 +00:00
Benjamin Kramer	a7606b993c	Silence compiler warnings. llvm-svn: 135358	2011-07-16 22:26:27 +00:00
Jakub Staszak	623e1971ce	Remove "LoopInfo.h" include from BranchProbabilityInfo.h. llvm-svn: 135353	2011-07-16 20:31:15 +00:00
Andrew Trick	244e2c3e82	Fix SCEVEXpander to handle arbitrary phi expansion. Includes two related bug fixes and corresponding assertions for uninitialized data and missing NULL check. Test cases will be included with the new LFTR. llvm-svn: 135333	2011-07-16 00:59:39 +00:00
Jakub Staszak	abb236fe9b	Fix pointer heuristic. Check whether predicator is ICMP_NE instead of if it is not isEquality(). llvm-svn: 135296	2011-07-15 20:51:06 +00:00
Jay Foad	5bd375a6cc	Convert CallInst and InvokeInst APIs to use ArrayRef. llvm-svn: 135265	2011-07-15 08:37:34 +00:00
Jay Foad	57aa636794	Convert InsertValueInst and ExtractValueInst APIs to use ArrayRef. llvm-svn: 135040	2011-07-13 10:26:04 +00:00
Chris Lattner	13879a7091	stop using WriteTypeSymbolic. llvm-svn: 134833	2011-07-09 18:02:13 +00:00
Devang Patel	c3239d3965	Preserve debug loc. llvm-svn: 134441	2011-07-05 21:48:22 +00:00
Dan Gohman	a293f24a0d	Teach IVUsers to stop at non-affine expressions unless they are both outside the loop and reducible. This more completely hides them from LSR, which isn't usually able to do anything meaningful with non-affine expressions anyway, and this consequently hides them from SCEVExpander, which is acutely unprepared for non-affine expressions. Replace test/CodeGen/X86/lsr-nonaffine.ll with a new test that tests the new behavior. This works around the bug in PR10117 / rdar://problem/9633149, and is generally an improvement besides. llvm-svn: 134268	2011-07-01 22:05:19 +00:00
Dan Gohman	54664ed714	Improve constant folding of undef for cmp and select operators. llvm-svn: 134223	2011-07-01 01:03:43 +00:00
Andrew Trick	154d78a661	Cleanup. Fix a stupid variable name. llvm-svn: 133995	2011-06-28 05:41:52 +00:00
Andrew Trick	411daa5e81	SCEVExpander: give new insts a name that identifies the reponsible pass. llvm-svn: 133992	2011-06-28 05:07:32 +00:00
Andrew Trick	56b315a9cf	indvars --disable-iv-rewrite: sever ties with IVUsers. llvm-svn: 133988	2011-06-28 03:01:46 +00:00
Nick Lewycky	3e334a42d7	Move onlyUsedByLifetimeMarkers to ValueTracking so that it can be used by other passes as well. llvm-svn: 133904	2011-06-27 04:20:45 +00:00
Devang Patel	503c3998f3	Fix struct member's scope. Patch by Xi Wang. llvm-svn: 133828	2011-06-24 22:00:39 +00:00
Jakub Staszak	1aae619933	Calculate backedge probability correctly. llvm-svn: 133776	2011-06-23 23:52:11 +00:00
Jakub Staszak	668c6fae76	Missing files for the BlockFrequency analysis added. llvm-svn: 133767	2011-06-23 21:56:59 +00:00
Jakub Staszak	be52acc98a	Introduce BlockFrequency analysis for BasicBlocks. llvm-svn: 133766	2011-06-23 21:45:20 +00:00
Rafael Espindola	e2456536b5	Revert "revert 133714" This reverts commit e8e00f5efb4a22238f2407bf813de4606f30c5aa. The cmake build on OS X is still broken. llvm-svn: 133718	2011-06-23 14:19:39 +00:00
Dylan Noblesmith	8a4f22d017	revert 133714 It broke the build worse. llvm-svn: 133716	2011-06-23 13:56:01 +00:00
Rafael Espindola	250360d4bd	133713 broke the build, revert it. llvm-svn: 133714	2011-06-23 13:37:38 +00:00
Dylan Noblesmith	3595357772	Support: make floating-exception header private It has only one user. This eliminates the last include of config.h from the public headers -- ideally, config.h shouldn't even be installed by `make install` anymore. llvm-svn: 133713	2011-06-23 12:45:54 +00:00
Devang Patel	ccf8dbf885	New binops need debug loc. llvm-svn: 133642	2011-06-22 20:56:56 +00:00
Andrew Trick	fc4ccb20c6	IVUsers no longer needs to record the phis. llvm-svn: 133518	2011-06-21 15:43:52 +00:00
Chris Lattner	cc19efaa97	Revamp the "ConstantStruct::get" methods. Previously, these were scattered all over the place in different styles and variants. Standardize on two preferred entrypoints: one that takes a StructType and ArrayRef, and one that takes StructType and varargs. In cases where there isn't a struct type convenient, we now add a ConstantStruct::getAnon method (whose name will make more sense after a few more patches land). It would be "really really nice" if the ConstantStruct::get and ConstantVector::get methods didn't make temporary std::vectors. llvm-svn: 133412	2011-06-20 04:01:31 +00:00
Chris Lattner	67733f6557	simplify some code. llvm-svn: 133362	2011-06-18 21:46:23 +00:00
Benjamin Kramer	9319e9c5d8	Simplify code. No functionality change. llvm-svn: 133351	2011-06-18 14:42:42 +00:00
Jakub Staszak	12a43bdde5	Introduce MachineBranchProbabilityInfo class, which has similar API to BranchProbabilityInfo (expect setEdgeWeight which is not available here). Branch Weights are kept in MachineBasicBlocks. To turn off this analysis set -use-mbpi=false. llvm-svn: 133184	2011-06-16 20:22:37 +00:00
Eli Friedman	8b098b0d57	Add a limit to the number of instructions memdep will scan in a single block. This prevents (at least in some cases) O(N^2) runtime in passes like DSE. The limit in this patch is probably too high, but it is enough to stop DSE from going completely insane on a testcase I have (which has a single block with around 50,000 non-aliasing stores in it). rdar://9471075 llvm-svn: 133111	2011-06-15 23:59:25 +00:00
Eli Friedman	7d58bc7bc0	Add "unknown" results for memdep, which mean "I don't know whether a dependence for the given instruction exists in the given block". This cleans up all the existing hacks in memdep which represent this concept by returning clobber with various unrelated instructions. llvm-svn: 133031	2011-06-15 00:47:34 +00:00
Benjamin Kramer	558d09d87e	Move class into an anonymous namespace. llvm-svn: 132925	2011-06-13 18:38:56 +00:00
Andrew Trick	3d4e64b082	Branch profiling: floating-point avoidance. Patch by: Jakub Staszak! Introduces BranchProbability. Changes unsigned to uint32_t all over and uint64_t only when overflow is expected. llvm-svn: 132867	2011-06-11 01:05:22 +00:00
Dan Gohman	cc59548793	Initialize BasicAA's AliasCache to set it to use fewer buckets by default, since it usually has very few elements. This speeds up alias queries in many cases, because AliasCache.clear() doesn't have to visit as many buckets. llvm-svn: 132862	2011-06-10 22:30:30 +00:00
John McCall	729c35b680	Teach the CallGraph to ignore calls to intrinsics. llvm-svn: 132797	2011-06-09 19:46:27 +00:00
Dan Gohman	adf80ae9e4	Reapply r131781, now that the GVN bug with partially-aliasing loads is disabled. llvm-svn: 132632	2011-06-04 06:50:18 +00:00
Dan Gohman	a471751c24	Disable the main feature of 130180, the elimination of loads that are redundant with partially-aliasing loads. When computing what portion of a clobbering load value is needed, it doesn't consider phi-translation which may have occurred between the clobbing load and the redundant load. llvm-svn: 132631	2011-06-04 06:48:50 +00:00
Dan Gohman	87fdceaf73	Revert r131781 again. Apparently there is more going on here. llvm-svn: 132625	2011-06-04 05:11:22 +00:00
Nick Lewycky	75b2053863	Fold assert-only-used variable into the assert. llvm-svn: 132620	2011-06-04 02:07:10 +00:00
Andrew Trick	c73aa1ee81	Missing include of climits in the new BranchProbability pass. llvm-svn: 132616	2011-06-04 01:30:52 +00:00
Andrew Trick	49371f3f33	New BranchProbabilityInfo analysis. Patch by Jakub Staszak! BranchProbabilityInfo provides an interface for IR passes to query the likelihood that control follows a CFG edge. This patch provides an initial implementation of static branch predication that will populate BranchProbabilityInfo for branches with no external profile information using very simple heuristics. It currently isn't hooked up to any external profile data, so static prediction does all the work. llvm-svn: 132613	2011-06-04 01:16:30 +00:00
Dan Gohman	27b82f2f91	Reapply r131781 (revert r131809), now that some BasicAA shortcomings it exposed are fixed. llvm-svn: 132611	2011-06-04 00:46:31 +00:00
Dan Gohman	fb02cec44e	Fix BasicAA's recursion detection so that it doesn't pessimize queries in the case of a DAG, where a query reaches a node visited earlier, but it's not on a cycle. This avoids MayAlias results in cases where BasicAA is expected to return MustAlias or PartialAlias in order to protect TBAA. llvm-svn: 132609	2011-06-04 00:31:50 +00:00
Dan Gohman	4e7e7958d7	When merging MustAlias and PartialAlias, chose PartialAlias instead of conservatively choosing MayAlias. llvm-svn: 132579	2011-06-03 20:17:36 +00:00
Hans Wennborg	060b994a29	Test commit. llvm-svn: 132558	2011-06-03 17:15:37 +00:00
Devang Patel	1d40024322	A typedef's context is not the same as type's context. It is the context of typedef decl itself. Use extra parameter to communicate this to DIBuilder. llvm-svn: 132556	2011-06-03 17:04:51 +00:00
Eli Friedman	b576b1675c	When marking a block as being unanalyzable, use "Clobber" on the terminator instead of the first instruction in the block. This is a bit of a hack; "Clobber" isn't really the right marking in the first place. memdep doesn't really have any way of properly expressing "unanalyzable" at the moment. Using it on the terminator is much less ambiguous than using it on an arbitrary instruction, though. In the given testcase, the "Clobber" was pointing to a load, and GVN was incorrectly assuming that meant that the "Clobber" load overlapped the load being analyzed (when they are actually unrelated). The included testcase tests both this commit and r132434. Part two of rdar://9429882. (r132434 was mislabeled.) llvm-svn: 132442	2011-06-02 00:08:52 +00:00
Eli Friedman	4b6eeb9ca2	In MemoryDependenceAnalysis::getNonLocalPointerDepFromBB, if a given block is is deemed unanalyzable (and we execute one of the "goto PredTranslationFailure" statements), make sure we don't put information about the predecessors of that block into the returned data structures; this can lead to, among other things, extraneous results (which will confuse passes using memdep). Fixes an assert in GVN compiling ruby. Part of rdar://problem/9521954 . Testcase coming up soon. llvm-svn: 132434	2011-06-01 23:16:53 +00:00
Andrew Trick	8ef3ad049d	SCEV: missing null check fix for r132360, dragonegg crash. llvm-svn: 132416	2011-06-01 19:14:56 +00:00
Andrew Trick	812276eed4	scev: Better sign-extend removal. Normalize postincrement recurrences so that their sign extended forms are congruent when no overflow occurs. llvm-svn: 132360	2011-05-31 21:17:47 +00:00
Eli Friedman	7a5fc693f9	llvm.memcpy.* has two distinct associated address spaces; the source address space, and the destination address space. Fix up the interface on MemIntrinsic and MemTransferInst to make this clear, and fix InstructionDereferencesPointer in LazyValueInfo.cpp to use the interface properly. llvm-svn: 132356	2011-05-31 20:40:16 +00:00
Dan Gohman	c6f2ddfc04	Update this comment. llvm-svn: 132202	2011-05-27 18:42:33 +00:00
Chad Rosier	b362884ca9	Renamed llvm.x86.sse42.crc32 intrinsics; crc64 doesn't exist. crc32.[8\|16\|32] have been renamed to .crc32.32.[8\|16\|32] and crc64.[8\|16\|32] have been renamed to .crc32.64.[8\|64]. llvm-svn: 132163	2011-05-26 23:13:19 +00:00
Eli Friedman	bacb17906a	Change condition for determining whether a function is small for inlining metrics so that very long functions with few basic blocks are not re-analyzed. llvm-svn: 131994	2011-05-24 20:22:24 +00:00
Dan Gohman	0573b55c2b	Make DecomposeGEPExpression check SimplifyInstruction only after checking for a GEP, so that it matches what GetUnderlyingObject does. This fixes an obscure bug turned up by bugpoint in the testcase for PR9931. llvm-svn: 131971	2011-05-24 18:24:08 +00:00
Chris Lattner	026f5e61f0	fix a really nasty basicaa mod/ref calculation bug that was causing miscompilation of UnitTests/ObjC/messages-2.m with the recent optimizer improvements. llvm-svn: 131897	2011-05-23 05:15:43 +00:00
Chris Lattner	83791ced7b	Teach valuetracking that byval arguments with a specified alignment are aligned. Teach memcpyopt to not give up all hope when confonted with an underaligned memcpy feeding an overaligned byval. If the source of the memcpy can be determined to be adequeately aligned, or if it can be forced to be, we can eliminate the memcpy. This addresses PR9794. We now compile the example into: define i32 @f(%struct.p* nocapture byval align 8 %q) nounwind ssp { entry: %call = call i32 @g(%struct.p* byval align 8 %q) nounwind ret i32 %call } in both x86-64 and x86-32 mode. We still don't get a tailcall though, because tailcalls apparently can't handle byval. llvm-svn: 131884	2011-05-23 00:03:39 +00:00
Chris Lattner	713d52364f	implement PR9315, constant folding exp2 in terms of pow (since hosts without C99 runtimes don't have exp2). llvm-svn: 131872	2011-05-22 22:22:35 +00:00
Evan Cheng	2a746bfe36	Teach ValueTracking about x86 crc32 intrinsics. llvm-svn: 131861	2011-05-22 18:25:30 +00:00
Duncan Sands	5ec65765e6	Revert commit 131781, to see if it fixes the x86-64 dragonegg buildbot. Original log message: When BasicAA can determine that two pointers have the same base but differ by a dynamic offset, return PartialAlias instead of MayAlias. See the comment in the code for details. This fixes PR9971. llvm-svn: 131809	2011-05-21 20:54:46 +00:00
Dan Gohman	8b20187c82	When BasicAA can determine that two pointers have the same base but differ by a dynamic offset, return PartialAlias instead of MayAlias. See the comment in the code for details. This fixes PR9971. llvm-svn: 131781	2011-05-21 01:05:08 +00:00
Andrew Trick	f44aadf0fd	indvars: Prototyping Sign/ZeroExtend elimination without canonical IVs. No functionality enabled by default. Use -disable-iv-rewrite. Extended IVUsers to keep track of the phi that represents the users' IV. Added the WidenIV transform to replace a narrow IV with a wide IV by doing a one-for-one replacement of IV users instead of expanding the SCEV expressions. [sz]exts are removed and truncs are inserted. llvm-svn: 131744	2011-05-20 18:25:42 +00:00
Owen Anderson	97f0cf32ea	@llvm.lifetime.begin acts as a load, not @llvm.lifetime.end. llvm-svn: 131437	2011-05-17 00:05:49 +00:00
Rafael Espindola	71f8b08a80	Extra refactoring noticed by Eli Friedman. llvm-svn: 131405	2011-05-16 15:48:45 +00:00
Julien Lerouge	7e11f9e26d	Fix a source of non determinism in FindUsedTypes, use a SetVector instead of a set. rdar://9423996 llvm-svn: 131283	2011-05-13 05:20:42 +00:00
Dan Gohman	0daf687e1d	Change a few std::maps to DenseMaps. llvm-svn: 131088	2011-05-09 18:44:09 +00:00
Duncan Sands	af32728a57	The comparision "max(x,y)==x" is equivalent to "x>=y". Since the max is often expressed as "x >= y ? x : y", there is a good chance we can extract the existing "x >= y" from it and use that as a replacement for "max(x,y)==x". llvm-svn: 131049	2011-05-07 16:56:49 +00:00
Eli Friedman	8a20e66926	PR9838: Fix transform introduced in r127064 to not trigger when only one side of the icmp is an exact shift. llvm-svn: 130954	2011-05-05 21:59:18 +00:00
Hongbin Zheng	cd5afc5feb	Minor change: Fix the typo in RegionPass.h and RegionPass.cpp. llvm-svn: 130920	2011-05-05 13:59:38 +00:00
Duncan Sands	a228785526	Add variations on: max(x,y) >= min(x,z) folds to true. This isn't that common, but according to my super-optimizer there are only two missed simplifications of -instsimplify kind when compiling bzip2, and this is one of them. It amuses me to have bzip2 be perfectly optimized as far as instsimplify goes! llvm-svn: 130840	2011-05-04 16:05:05 +00:00
Andrew Trick	1abe296cfd	indvars: Added DisableIVRewrite and WidenIVs. This adds functionality to remove size/zero extension during indvars without generating a canonical IV and rewriting all IV users. It's disabled by default so should have no effect on codegen. Work in progress. llvm-svn: 130829	2011-05-04 02:10:13 +00:00
Duncan Sands	0a9c1246d7	Implement some basic simplifications involving min/max, for example max(a,b) >= a -> true. According to my super-optimizer, these are by far the most common simplifications (of the -instsimplify kind) that occur in the testsuite and aren't caught by -std-compile-opts. llvm-svn: 130780	2011-05-03 19:53:10 +00:00
Devang Patel	09fa69e151	Use llvm.dbg.cu named metadata to collect compile units. llvm-svn: 130756	2011-05-03 16:18:28 +00:00
Duncan Sands	f91c5ab341	Fix PR9579: when simplifying a compare to "true" or "false", and it was a vector compare, generate a vector result rather than i1 (and crashing). llvm-svn: 130706	2011-05-02 18:51:41 +00:00
Duncan Sands	a3e3699c88	Move some rem transforms out of instcombine and into instsimplify. This automagically provides a transform noticed by my super-optimizer as occurring quite often: "rem x, (select cond, x, 1)" -> 0. llvm-svn: 130694	2011-05-02 16:27:02 +00:00
Chris Lattner	827a270a2a	teach GVN to widen integer loads when they are overaligned, when doing an wider load would allow elimination of subsequent loads, and when the wider load is still a native integer type. This eliminates a ton of loads on various benchmarks involving struct fields, though it is somewhat hobbled by clang not being very aggressive about field alignment. This is yet another step along the way towards resolving PR6627. llvm-svn: 130390	2011-04-28 07:29:08 +00:00
Dan Gohman	5394c70d1e	Teach BasicAA about arm.neon.vld1 and vst1. llvm-svn: 130327	2011-04-27 20:44:28 +00:00
Dan Gohman	39b3a1ef7f	When analyzing functions known to only access argument pointees, only check arguments with pointer types. Update the documentation of IntrReadArgMem reflect this. While here, add support for TBAA tags on intrinsic calls. llvm-svn: 130317	2011-04-27 18:39:03 +00:00
Andrew Trick	7d1eea86d9	Corrects an old, old typo in a case that doesn't seem to be reached in practice. llvm-svn: 130316	2011-04-27 18:17:36 +00:00
Andrew Trick	01eff820ae	Test case and comment for PR9633. llvm-svn: 130294	2011-04-27 05:42:17 +00:00
Andrew Trick	759ba0802d	Fix for PR9633 [indvars] Assertion `isa<X>(Val) && "cast<Ty>() argument of incompatible type!"' failed. Added a type check in ScalarEvolution::computeSCEVAtScope to handle the case in which operands of an AddRecExpr in the current scope are folded. llvm-svn: 130271	2011-04-27 01:21:25 +00:00
Chris Lattner	7aab2799ae	Enhance memdep to return clobber relation between noalias loads when an earlier load could be widened to encompass a later load. For example, if we see: X = load i8* P, align 4 Y = load i8* (P+3), align 1 and we have a 32-bit native integer type, we can widen the former load to i32 which then makes the second load redundant. GVN can't actually do anything with this load/load relation yet, so this isn't testable, but it is the next step to resolving PR6627, and a fairly general class of "merge neighboring loads" missed optimizations. llvm-svn: 130250	2011-04-26 22:42:01 +00:00
Chris Lattner	32dc9bd1bb	use AA::isMustAlias to simplify some calls. llvm-svn: 130248	2011-04-26 21:53:34 +00:00
Chris Lattner	6b96621a8a	remove support for llvm.invariant.end from memdep. It is a work-in-progress that is not progressing, and it has issues. llvm-svn: 130247	2011-04-26 21:50:51 +00:00
Devang Patel	b5ea255fb4	Fix an off by one error while accessing complex address element of a DIVariable. This worked untill now because stars are aligned (i.e. num of complex address elments are always 0 or 2+ and when it is 2+ at least two elements are access together) llvm-svn: 130225	2011-04-26 18:24:39 +00:00
Chris Lattner	6f83d06ffa	Enhance MemDep: When alias analysis returns a partial alias result, return it as a clobber. This allows GVN to do smart things. Enhance GVN to be smart about the case when a small load is clobbered by a larger overlapping load. In this case, forward the value. This allows us to compile stuff like this: int test(void P) { int tmp = (unsigned int)P; return tmp+((unsigned char*)P+1); } into: _test: ## @test movl (%rdi), %ecx movzbl %ch, %eax addl %ecx, %eax ret which has one load. We already handled the case where the smaller load was from a must-aliased base pointer. llvm-svn: 130180	2011-04-26 01:21:15 +00:00
Dan Gohman	6acd95b3c1	Fix an iterator invalidation bug. llvm-svn: 130166	2011-04-25 22:48:29 +00:00
Jay Foad	dbf81d8ddf	PR9214: Convert the DIBuilder API to use ArrayRef. llvm-svn: 130086	2011-04-24 10:11:03 +00:00
Jay Foad	1a180156b6	Remove unused STL header includes. llvm-svn: 130068	2011-04-23 19:53:52 +00:00
Devang Patel	1d6bbd41aa	Let front-end tie subprogram declaration with subprogram definition directly. llvm-svn: 130028	2011-04-22 23:10:17 +00:00
Jay Foad	5514afe6b2	PR9214: Convert Metadata API to use ArrayRef. llvm-svn: 129932	2011-04-21 19:59:31 +00:00
Devang Patel	0c7732499b	Use ArrayRef variants. llvm-svn: 129735	2011-04-18 23:51:03 +00:00
Chandler Carruth	2b1ba48f8d	Mark some functions as used which are used within debug-only code. This silences Clang's -Wunused-function when building in release mode. llvm-svn: 129709	2011-04-18 18:49:44 +00:00
Devang Patel	514b4006c2	Introduce support to encode Objective-C property information in debugging information generated for an interface. llvm-svn: 129624	2011-04-16 00:11:51 +00:00
Chris Lattner	0ab5e2cded	Fix a ton of comment typos found by codespell. Patch by Luis Felipe Strano Moraes! llvm-svn: 129558	2011-04-15 05:18:47 +00:00
Jay Foad	0091fe8ca1	PR9214: Convert ConstantExpr::getIndices() to return an ArrayRef, plus related tweaks to ExprMapKeyType. llvm-svn: 129443	2011-04-13 15:22:40 +00:00
Jay Foad	7c14a558fe	Don't include Operator.h from InstrTypes.h. llvm-svn: 129271	2011-04-11 09:35:34 +00:00
Eli Friedman	17822fcde9	PR9604; try to deal with RAUW updates correctly in the AST. I'm not convinced it's completely safe to cache the AST across LICM runs even with this fix, but this fix can't hurt. llvm-svn: 129198	2011-04-09 06:55:46 +00:00
Devang Patel	9f738849ab	Add support to encode function's template parameters. llvm-svn: 128947	2011-04-05 22:52:06 +00:00
Chris Lattner	57ee5a5db7	remove postdom frontiers, because it is dead. Forward dom frontiers are still used by RegionInfo :( llvm-svn: 128943	2011-04-05 21:57:17 +00:00
Tobias Grosser	8b304ff9ac	Region: Allow user control the printing style of the print function. Contributed by: etherzhhb@gmail.com llvm-svn: 128808	2011-04-04 07:19:18 +00:00
Eli Friedman	8baa2c7ad9	Don't assume something which might be a constant expression is an instruction. Based on PR9429, but no testcase because I can't figure out how to trigger it anymore given other changes to the relevant code. llvm-svn: 128781	2011-04-02 22:11:56 +00:00
Jay Foad	52131344a2	Remove PHINode::reserveOperandSpace(). Instead, add a parameter to PHINode::Create() giving the (known or expected) number of operands. llvm-svn: 128537	2011-03-30 11:28:46 +00:00
Jay Foad	e0938d8a87	(Almost) always call reserveOperandSpace() on newly created PHINodes. llvm-svn: 128535	2011-03-30 11:19:20 +00:00
Frits van Bommel	0bb2ad2cf7	Constant folding support for calls to umul.with.overflow(), basically identical to the smul.with.overflow() code. llvm-svn: 128379	2011-03-27 14:26:13 +00:00
Anders Carlsson	c4f0ab397c	Revert r128140 for now. llvm-svn: 128149	2011-03-23 15:51:12 +00:00
Anders Carlsson	9ed8d93f55	A global variable with internal linkage where all uses are in one function and whose address is never taken is a non-escaping local object and can't alias anything else. llvm-svn: 128140	2011-03-23 02:19:48 +00:00
Nick Lewycky	f0469af63e	Fix INT_MIN gotcha pointed out by Eli Friedman. llvm-svn: 128028	2011-03-21 21:40:32 +00:00
Andrew Trick	1c4b42d00f	Avoid creating canonical induction variables for non-native types. For example, on 32-bit architecture, don't promote all uses of the IV to 64-bits just because one use is a 64-bit cast. Alternate implementation of the patch by Arnaud de Grandmaison. llvm-svn: 127884	2011-03-18 16:50:32 +00:00
Andrew Trick	87716c93c2	Added isValidRewrite() to check the result of ScalarEvolutionExpander. SCEV may generate expressions composed of multiple pointers, which can lead to invalid GEP expansion. Until we can teach SCEV to follow strict pointer rules, make sure no bad GEPs creep into IR. Fixes rdar://problem/9038671. llvm-svn: 127839	2011-03-17 23:51:11 +00:00
Nick Lewycky	b4d763b37d	Add comments for the demanglings. Correct mangled form of operator delete! llvm-svn: 127801	2011-03-17 05:20:12 +00:00
Nick Lewycky	c1f8658368	Add C++ global operator {new,new[],delete,delete[]}(unsigned {int,long}) to the memory builtins as equivalent to malloc/free. This is different from any attribute we have. For example, you can delete the allocators when their result is unused, but you can't collapse two calls to the same function, even if no global/memory state has changed in between. The noalias return states that the result does not alias any other pointer, but instcombine optimizes malloc() as though the result is non-null for the purpose of eliminating unused pointers. llvm-svn: 127673	2011-03-15 07:31:32 +00:00
Andrew Trick	a34f1b1f10	Remove getMinusSCEVForExitTest(). This function performed acrobatics to prove no-self-wrap, which we now have for free. llvm-svn: 127643	2011-03-15 01:16:14 +00:00
Andrew Trick	f6b01ff422	Propagate SCEV no-wrap flags whenever possible. This needs review. llvm-svn: 127638	2011-03-15 00:37:00 +00:00
Andrew Trick	e92dcceab7	Negating a recurrence preserves no-self-wrap. llvm-svn: 127593	2011-03-14 17:38:54 +00:00
Andrew Trick	f1781db622	HowFarToZero can compute a trip count as long as the recurrence has no-self-wrap. llvm-svn: 127591	2011-03-14 17:28:02 +00:00
Andrew Trick	8b55b736b1	Added SCEV::NoWrapFlags to manage unsigned, signed, and self wrap properties. Added the self-wrap flag for SCEV::AddRecExpr. A slew of temporary FIXMEs indicate the intention of the no-self-wrap flag without changing behavior in this revision. llvm-svn: 127590	2011-03-14 16:50:06 +00:00
Benjamin Kramer	5acc751b6f	Teach ComputeMaskedBits about sub nsw. llvm-svn: 127548	2011-03-12 17:18:11 +00:00
Benjamin Kramer	391a946fa9	ComputeMaskedBits: sub falls through to add, and sub doesn't have the same overflow semantics as add. Should fix the selfhost failures that started with r127463. llvm-svn: 127465	2011-03-11 14:46:49 +00:00
Nick Lewycky	cc79973856	Teach ComputeMaskedBits about nsw on add. I don't think there's anything we can do with nuw here, but sub and mul should be given similar treatment. Fixes PR9343 #15! llvm-svn: 127463	2011-03-11 09:00:19 +00:00
Devang Patel	fa31d38aad	Introduce DebugInfoProbe. This is used to monitor how llvm optimizer is treating debugging information. It generates output that lools like 8 times line number info lost by Scalar Replacement of Aggregates (SSAUp) 1 times line number info lost by Simplify well-known library calls 12 times variable info lost by Jump Threading llvm-svn: 127381	2011-03-10 00:21:25 +00:00
Andrew Trick	2afa325811	When SCEV can determine the loop test is X < X, set ExactBECount=0. When ExactBECount is a constant, use it for MaxBECount. When MaxBECount cannot be computed, replace it with ExactBECount. Fixes PR9424. llvm-svn: 127342	2011-03-09 17:29:58 +00:00
Andrew Trick	2a3b71684a	whitespace llvm-svn: 127340	2011-03-09 17:23:39 +00:00
Nick Lewycky	774647d974	Fix two cases I forgot to update when doing a mental "getSwappedPredicate". Thanks Duncan Sands! llvm-svn: 127323	2011-03-09 08:20:06 +00:00
Nick Lewycky	980104d1d6	Add another micro-optimization. Apologies for the lack of refactoring, but I gave up when I realized I couldn't come up with a good name for what the refactored function would be, to describe what it does. This is PR9343 test12, which is test3 with arguments reordered. Whoops! llvm-svn: 127318	2011-03-09 06:26:03 +00:00
Duncan Sands	7dc3d47c34	Fix PR9331. Simplified version of a patch by Jakub Staszak. llvm-svn: 127243	2011-03-08 12:39:03 +00:00
Nick Lewycky	e467979d0a	Add more analysis of the sign bit of an srem instruction. If the LHS is negative then the result could go either way. If it's provably positive then so is the srem. Fixes PR9343 #7! llvm-svn: 127146	2011-03-07 01:50:10 +00:00
Nick Lewycky	9719a719c7	Thread comparisons over udiv/sdiv/ashr/lshr exact and lshr nuw/nsw whenever possible. This goes into instcombine and instsimplify because instsimplify doesn't need to check hasOneUse since it returns (almost exclusively) constants. This fixes PR9343 #4 #5 and #8! llvm-svn: 127064	2011-03-05 05:19:11 +00:00
Dan Gohman	aa036eedb8	When decling to reuse existing expressions that involve casts, ignore bitcasts, which are really no-ops here. This fixes slowdowns on MultiSource/Applications/aha and others. llvm-svn: 127031	2011-03-04 20:46:46 +00:00
Nick Lewycky	41c529bd09	Revert broken srem logic from r126991. llvm-svn: 127021	2011-03-04 19:26:08 +00:00
Nick Lewycky	8e3a79da9f	Fold "icmp pred (srem X, Y), Y" like we do for urem. Handle signed comparisons in the urem case, though not the other way around. This is enough to get #3 from PR9343! llvm-svn: 126991	2011-03-04 10:06:52 +00:00
Nick Lewycky	3cec6f5563	Teach instruction simplify to use constant ranges to solve problems of the form "icmp pred %X, CI" and a number of examples where "%X = binop %Y, CI2". Some of these cases (div and rem) used to make it through opt -O2, but the others are probably now making code elsewhere redundant (probably instcombine). llvm-svn: 126988	2011-03-04 07:00:57 +00:00
Duncan Sands	bf577d6a86	Remove DIFactory. Patch by Devang. llvm-svn: 126871	2011-03-02 20:30:37 +00:00
Dan Gohman	7290868a1b	Don't re-use existing addrec expansions if they contain casts. This fixes PR9259. llvm-svn: 126812	2011-03-02 01:34:10 +00:00
Devang Patel	40eee1e970	Today, the language front ends produces llvm.dbg.* intrinsics, used to encode arguments' debug info, in order any way, most of the times. However, if a front end mix-n-matches llvm.dbg.declare and llvm.dbg.value intrinsics to encode debug info for arguments then code generator needs a way to find argument order. Use 8 bits from line number field to keep track of argument ordering while encoding debug info for an argument. That leaves 24 bit for line no, DebugLoc also allocates 24 bit for line numbers. If a function has more than 255 arguments then rest of the arguments will be ordered by llvm.dbg.* intrinsics' ordering in IR. llvm-svn: 126793	2011-03-01 22:58:13 +00:00
Nick Lewycky	c9d20067cd	Optimize "icmp pred (urem X, Y), Y" --> true/false depending on pred. There's more work to do here, "icmp ult (urem X, 10), 11" doesn't optimize away yet. Fixes example 3 from PR9343! llvm-svn: 126741	2011-03-01 08:15:50 +00:00
Ted Kremenek	49d15b959e	Unbreak CMake build. llvm-svn: 126717	2011-03-01 00:02:51 +00:00
Dan Gohman	161058838c	Delete the LiveValues pass. I won't get get back to the project it was started for in the foreseeable future. llvm-svn: 126668	2011-02-28 19:37:59 +00:00
Nick Lewycky	afe4a3062d	Fix comment. llvm-svn: 126645	2011-02-28 09:18:11 +00:00
Nick Lewycky	66f4f22f7b	srem doesn't actually have the same resulting sign as its numerator, you could also have a zero when numerator = denominator. Reverts parts of r126635 and r126637. llvm-svn: 126644	2011-02-28 09:17:39 +00:00
Nick Lewycky	c9aab8567b	Teach value tracking to make use of flags in more situations. llvm-svn: 126642	2011-02-28 08:02:21 +00:00
Nick Lewycky	29dbbd12c1	Teach ValueTracking to look at the dividend when determining the sign bit of an srem instruction. llvm-svn: 126637	2011-02-28 06:52:12 +00:00
Tobias Grosser	98eecaf0a9	RegionPrinter: Ignore back edges when layouting the graph llvm-svn: 126564	2011-02-27 04:11:07 +00:00
Devang Patel	9b4127349c	Follow LLVM coding style. clang uses DBuilder, so it requries corresponding change. llvm-svn: 126231	2011-02-22 18:56:12 +00:00
Benjamin Kramer	5b7a4e0195	Move "A \| ~(A & ?) -> -1" from InstCombine to InstructionSimplify. llvm-svn: 126082	2011-02-20 15:20:01 +00:00
Chris Lattner	acf6b0776a	Stores of null pointers should turn into memset, we weren't recognizing them as splat values. llvm-svn: 126041	2011-02-19 19:35:49 +00:00
Oscar Fuentes	5ed962656c	Move library stuff out of the toplevel CMakeLists.txt file. llvm-svn: 125968	2011-02-18 22:06:14 +00:00
Devang Patel	4ab0852080	Move DbgInfoPrinter specific utlities inside DbgInfoPrinter.cpp llvm-svn: 125571	2011-02-15 17:36:11 +00:00
Devang Patel	27924da676	Print function info. Patch by Minjang Kim. llvm-svn: 125567	2011-02-15 17:24:56 +00:00
Chris Lattner	69229316aa	convert ConstantVector::get to use ArrayRef. llvm-svn: 125537	2011-02-15 00:14:00 +00:00
Chris Lattner	34442e6ebf	revert my ConstantVector patch, it seems to have made the llvm-gcc builders unhappy. llvm-svn: 125504	2011-02-14 18:15:46 +00:00
Chris Lattner	d9f5b88548	Switch ConstantVector::get to use ArrayRef instead of a pointer+size idiom. Change various clients to simplify their code. llvm-svn: 125487	2011-02-14 07:55:32 +00:00
Duncan Sands	b86070933f	Remove pointless blank line. llvm-svn: 125463	2011-02-13 18:11:05 +00:00
Duncan Sands	d114ab331c	Teach instsimplify that X+Y>=X+Z is the same as Y>=Z if neither side overflows, plus some variations of this. According to my auto-simplifier this occurs a lot but usually in combination with max/min idioms. Because max/min aren't handled yet this unfortunately doesn't have much effect in the testsuite. llvm-svn: 125462	2011-02-13 17:15:40 +00:00
Chris Lattner	4f23f2be15	teach SCEV that the scale and addition of an inbounds gep don't NSW. This fixes a FIXME in scev-aa.ll (allowing a new no-alias result) and generally makes things more precise. llvm-svn: 125449	2011-02-13 03:14:49 +00:00
Chris Lattner	7936a8a488	Per discussion with Dan G, inbounds geps certainly can have unsigned overflow (e.g. "gep P, -1"), and while they can have signed wrap in theoretical situations, modelling an AddRec as not having signed wrap is going enough for any case we can think of today. In the future if this isn't enough, we can revisit this. Modeling them as having NUW isn't causing any known problems either FWIW. llvm-svn: 125410	2011-02-11 21:43:33 +00:00
Nick Lewycky	ac0b62c277	Tolerate degenerate phi nodes that can occur in the middle of optimization passes. Fixes PR9112. Patch by Jakub Staszak! llvm-svn: 125319	2011-02-10 23:54:10 +00:00
Duncan Sands	8b4e283bfb	Formatting and comment tweaks. llvm-svn: 125200	2011-02-09 17:45:03 +00:00
Chris Lattner	9e4aa0259f	Teach instsimplify some tricks about exact/nuw/nsw shifts. improve interfaces to instsimplify to take this info. llvm-svn: 125196	2011-02-09 17:15:04 +00:00
Chris Lattner	b940091388	Rework InstrTypes.h so to reduce the repetition around the NSW/NUW/Exact versions of creation functions. Eventually, the "insertion point" versions of these should just be removed, we do have IRBuilder afterall. Do a massive rewrite of much of pattern match. It is now shorter and less redundant and has several other widgets I will be using in other patches. Among other changes, m_Div is renamed to m_IDiv (since it only matches integer divides) and m_Shift is gone (it used to match all binops!!) and we now have m_LogicalShift for the one client to use. Enhance IRBuilder to have "isExact" arguments to things like CreateUDiv and reduce redundancy within IRbuilder by having these methods chain to each other more instead of duplicating code. llvm-svn: 125194	2011-02-09 17:00:45 +00:00
Duncan Sands	867cb633b4	Add an m_Div pattern for matching either a udiv or an sdiv and use it to simplify the "(X/Y)*Y->X when the division is exact" transform. llvm-svn: 125004	2011-02-07 09:36:32 +00:00
Chris Lattner	6e57b15228	teach instsimplify to transform (X / Y) * Y to X when the div is an exact udiv. llvm-svn: 124994	2011-02-06 22:05:31 +00:00
Eric Christopher	b54605b8e2	Remove premature optimization that avoided calculating argument weights if we weren't going to inline the function. The rest of the code using this was removed. Fixes PR9154. llvm-svn: 124991	2011-02-06 21:27:46 +00:00
Anders Carlsson	ecf8e159e3	Simplify test, as suggested by Chris. llvm-svn: 124990	2011-02-06 20:22:49 +00:00
Anders Carlsson	d21b06a0db	When loading from a constant, fold inttoptr if the integer type and the resulting pointer type both have the same size. llvm-svn: 124987	2011-02-06 20:11:56 +00:00
Anders Carlsson	36c6d23074	Fix another warning. llvm-svn: 124961	2011-02-05 18:33:43 +00:00
Eric Christopher	ceb4671ddd	Fix cut and paste error spotted by Jakob. llvm-svn: 124930	2011-02-05 02:48:47 +00:00
Eric Christopher	2dfbd7e0c1	Rewrite how the indirect call bonus is handled. This now works by: a) Making it a per call site bonus for functions that we can move from indirect to direct calls. b) Reduces the bonus from 500 to 100 per call site. c) Subtracts the size of the possible newly inlineable call from the bonus to only add a bonus if we can inline a small function to devirtualize it. Also changes the bonus from a positive that's subtracted to a negative that's added. Fixes the remainder of rdar://8546196 by reducing the object file size after inlining by 84%. llvm-svn: 124916	2011-02-05 00:49:15 +00:00
Duncan Sands	06504025d2	Improve threading of comparisons over select instructions (spotted by my auto-simplifier). This has a big impact on Ada code, but not much else. Unfortunately the impact is mostly negative! This is due to PR9004 (aka SCCP failing to resolve conditional branch conditions in the destination blocks of the branch), in which simple correlated expressions are not resolved but complicated ones are, so simplifying has a bad effect! llvm-svn: 124788	2011-02-03 09:37:39 +00:00
Devang Patel	df0dd7dc69	Fix typo in comment. llvm-svn: 124759	2011-02-03 00:13:47 +00:00
Devang Patel	be933b470a	Add support to describe template value parameter in debug info. llvm-svn: 124755	2011-02-02 22:35:53 +00:00
Devang Patel	3a9e65efb6	Add support to describe template parameter type in debug info. llvm-svn: 124752	2011-02-02 21:38:25 +00:00
Duncan Sands	5747abab10	Reenable the transform "(X*Y)/Y->X" when the multiplication is known not to overflow (nsw flag), which was disabled because it breaks 254.gap. I have informed the GAP authors of the mistake in their code, and arranged for the testsuite to use -fwrapv when compiling this benchmark. llvm-svn: 124746	2011-02-02 20:52:00 +00:00
Duncan Sands	a29ea9aa4c	Add a m_Undef pattern for convenience. This is so that code that uses pattern matching can also pattern match undef, creating a more uniform style. llvm-svn: 124657	2011-02-01 09:06:20 +00:00
Duncan Sands	4b397fcdc2	Add a m_SignBit pattern for convenience. llvm-svn: 124656	2011-02-01 08:50:33 +00:00
Duncan Sands	cf0ff030a8	Have m_One also match constant vectors for which every element is 1. llvm-svn: 124655	2011-02-01 08:39:12 +00:00
Eric Christopher	46308e666a	Reapply 124275 since the Dragonegg failure was unreproducible. llvm-svn: 124641	2011-02-01 01:16:32 +00:00
Duncan Sands	2e5a58da8f	Commit 124487 broke 254.gap. See if disabling the part that might be triggered by PR9088 fixes things. llvm-svn: 124561	2011-01-30 18:24:20 +00:00
Duncan Sands	b67edc6a29	Transform (X/Y)*Y into X if the division is exact. Instcombine already knows how to do this and more, but would only do it if X/Y had only one use. Spotted as the most common missed simplification in SPEC by my auto-simplifier, now that it knows about nuw/nsw/exact flags. This removes a bunch of multiplications from 447.dealII and 483.xalancbmk. It also removes a lot from tramp3d-v4, which results in much more inlining. llvm-svn: 124560	2011-01-30 18:03:50 +00:00
Nick Lewycky	b89d9a4412	Fix comment. llvm-svn: 124544	2011-01-29 19:55:23 +00:00
Frits van Bommel	c2549661af	Move InstCombine's knowledge of fdiv to SimplifyInstruction(). llvm-svn: 124534	2011-01-29 15:26:31 +00:00
Duncan Sands	2e9e4f1be3	Fix typo: should have been testing that X was odd, not V. llvm-svn: 124533	2011-01-29 13:27:00 +00:00
Andrew Trick	24f5ff0f23	Implementation of path profiling. Modified patch by Adam Preuss. This builds on the existing framework for block tracing, edge profiling and optimal edge profiling. See -help-hidden for new flags. For documentation, see the technical report "Implementation of Path Profiling..." in llvm.org/pubs. llvm-svn: 124515	2011-01-29 01:09:53 +00:00
Duncan Sands	e4b4d0c16d	This dyn_cast should be a cast. Pointed out by Frits van Bommel. llvm-svn: 124497	2011-01-28 18:53:08 +00:00
Duncan Sands	65995fa2a0	Thread divisions over selects and phis. This doesn't fire much and has basically zero effect on the testsuite (it improves two Ada testcases). llvm-svn: 124496	2011-01-28 18:50:50 +00:00
Duncan Sands	771e82a863	My auto-simplifier noticed that ((X/Y)Y)/Y occurs several times in SPEC benchmarks, and that it can be simplified to X/Y. (In general you can only simplify (ZY)/Y to Z if the multiplication did not overflow; if Z has the form "X/Y" then this is the case). This patch implements that transform and moves some Div logic out of instcombine and into InstructionSimplify. Unfortunately instcombine gets in the way somewhat, since it likes to change (X/Y)Y into X-(X rem Y), so I had to teach instcombine about this too. Finally, thanks to the NSW/NUW flags, sometimes we know directly that "ZY" does not overflow, because the flag says so, so I added that logic too. This eliminates a bunch of divisions and subtractions in 447.dealII, and has good effects on some other benchmarks too. It seems to have quite an effect on tramp3d-v4 but it's hard to say if it's good or bad because inlining decisions changed, resulting in massive changes all over. llvm-svn: 124487	2011-01-28 16:51:11 +00:00
Eric Christopher	cd55a46c31	Temporarily revert 124275 to see if it brings the dragonegg buildbot back. llvm-svn: 124312	2011-01-26 19:40:31 +00:00
Duncan Sands	8a33733228	APInt has a method for determining whether a number is a power of 2 which is more efficient than countPopulation - use it. llvm-svn: 124283	2011-01-26 08:44:16 +00:00
Nick Lewycky	d9e6b4a8ff	Fix memory corruption. If one of the SCEV creation functions calls another but doesn't return immediately after then the insert position in UniqueSCEVs will be out of date. No test because this is a memory corruption issue. Fixes PR9051! llvm-svn: 124282	2011-01-26 08:40:22 +00:00
Eric Christopher	078159e310	Separate out the constant bonus from the size reduction metrics. Rework a few loops accordingly. Should be no functional change. This is a step for more accurate cost/benefit analysis of devirt/inlining bonuses. llvm-svn: 124275	2011-01-26 02:58:39 +00:00
Eric Christopher	58f157a677	Coding style formatting changes. llvm-svn: 124260	2011-01-26 01:09:59 +00:00
Duncan Sands	9e9d5b25e2	In which I discover that zero+zero is zero, d'oh! llvm-svn: 124188	2011-01-25 15:14:15 +00:00
Duncan Sands	fced7620f5	See if this fixes llvm-gcc bootstrap. llvm-svn: 124184	2011-01-25 12:15:09 +00:00
Duncan Sands	d395108394	According to my auto-simplifier the most common missed simplifications in optimized code are: (non-negative number)+(power-of-two) != 0 -> true and (x \| 1) != 0 -> true Instcombine knows about the second one of course, but only does it if X\|1 has only one use. These fire thousands of times in the testsuite. llvm-svn: 124183	2011-01-25 09:38:29 +00:00
Eric Christopher	cd087f2512	Reorganize this so that the early exit and special cases come early rather than interspersed. No functional change. llvm-svn: 124168	2011-01-25 01:34:31 +00:00
Dan Gohman	0f124e1987	Give GetUnderlyingObject a TargetData, to keep it in sync with BasicAA's DecomposeGEPExpression, which recently began using a TargetData. This fixes PR8968, though the testcase is awkward to reduce. Also, update several off GetUnderlyingObject's users which happen to have a TargetData handy to pass it in. llvm-svn: 124134	2011-01-24 18:53:32 +00:00
Chris Lattner	f277b5d434	fix PR8928 by clearing a stale map, patch by Jakub Staszak! llvm-svn: 124132	2011-01-24 18:36:51 +00:00
Dan Gohman	3ac8cd614f	Add a comment. llvm-svn: 124126	2011-01-24 17:54:18 +00:00
Nick Lewycky	d4192f71b5	Simplify some code with no functionality change. Make the test a lot more robust against smarter optimizations, using the power of FileCheck. llvm-svn: 124081	2011-01-23 20:06:05 +00:00
Ted Kremenek	3c4408ceb6	Null initialize a few variables flagged by clang's -Wuninitialized-experimental warning. While these don't look like real bugs, clang's -Wuninitialized-experimental analysis is stricter than GCC's, and these fixes have the benefit of being general nice cleanups. llvm-svn: 124073	2011-01-23 17:05:06 +00:00
Nick Lewycky	bc98f5b78e	Use value ranges to fold ext(trunc) in SCEV when possible. llvm-svn: 124062	2011-01-23 06:20:19 +00:00
Nick Lewycky	b32c8943e6	Have SCEV turn sext(x) into zext(x) when x is s>= 0. This applies many times in "make check" alone. llvm-svn: 124046	2011-01-22 22:06:21 +00:00
Eric Christopher	c70e037b73	Add a FIXME explaining the move to a single indirect call bonus per function that we can change from indirect to direct. llvm-svn: 124045	2011-01-22 21:56:53 +00:00
Eric Christopher	08e8b3b629	Only apply the devirtualization bonus once instead of per-call site in the target function. Fixes part of rdar://8546196 llvm-svn: 124044	2011-01-22 21:17:33 +00:00
Duncan Sands	8fb2c3827c	At -O123 the early-cse pass is run before instcombine has run. According to my auto-simplier the transform most missed by early-cse is (zext X) != 0 -> X != 0. This patch adds this transform and some related logic to InstructionSimplify and removes some of the logic from instcombine (unfortunately not all because there are several situations in which instcombine can improve things by making new instructions, whereas instsimplify is not allowed to do this). At -O2 this often results in more than 15% more simplifications by early-cse, and results in hundreds of lines of bitcode being eliminated from the testsuite. I did see some small negative effects in the testsuite, for example a few additional instructions in three programs. One program, 483.xalancbmk, got an additional 35 instructions, which seems to be due to a function getting an additional instruction and then being inlined all over the place. llvm-svn: 123911	2011-01-20 13:21:55 +00:00
Nick Lewycky	5c901f3489	Similarly, analyze truncate through multiply. llvm-svn: 123842	2011-01-19 18:56:00 +00:00
Nick Lewycky	5143f0f09b	Add a missed SCEV fold that is required to continue analyzing the IR produced by indvars through the scev expander. trunc(add x, y) --> add(trunc x, y). Currently SCEV largely folds the other way which is probably wrong, but preserved to minimize churn. Instcombine doesn't do this fold either, demonstrating a missed optz'n opportunity on code doing add+trunc+add. llvm-svn: 123838	2011-01-19 16:59:46 +00:00
Nick Lewycky	e9ea75e3fc	Add a missing SCEV simplification sext(zext x) --> zext x. llvm-svn: 123832	2011-01-19 15:56:12 +00:00
Dan Gohman	44da55b7be	Teach BasicAA to return PartialAlias in cases where both pointers are pointing to the same object, one pointer is accessing the entire object, and the other is access has a non-zero size. This prevents TBAA from kicking in and saying NoAlias in such cases. llvm-svn: 123775	2011-01-18 21:16:06 +00:00
Duncan Sands	99589d07e9	For completeness, generalize the (X + Y) - Y -> X transform and add X - (X + 1) -> -1. These were not recommended by my auto-simplifier since they don't fire often enough. However they do fire from time to time, for example they remove one subtraction from the final bitcode for 483.xalancbmk. llvm-svn: 123755	2011-01-18 11:50:19 +00:00
Duncan Sands	9b8e2bd8ef	Simplify (X<<1)-X into X. According to my auto-simplier this is the most common missed simplification in fully optimized code. It occurs sporadically in the testsuite, and many times in 403.gcc: the final bitcode has 131 fewer subtractions after this change. The reason that the multiplies are not eliminated is the same reason that instcombine did not catch this: they are used by other instructions (instcombine catches this with a more general transform which in general is only profitable if the operands have only one use). llvm-svn: 123754	2011-01-18 09:24:58 +00:00
Cameron Zwarich	6b0c4c9b6c	Move DominanceFrontier from VMCore to Analysis. llvm-svn: 123747	2011-01-18 06:06:27 +00:00
Chris Lattner	08f43456c9	fix PR8983, a broken assertion. llvm-svn: 123562	2011-01-16 03:43:53 +00:00
Nick Lewycky	367f98f000	Teach LazyValueInfo that allocas aren't NULL. Over all of llvm-test, this saves half a million non-local queries, each of which would otherwise have triggered a linear scan over a basic block. Also fix a fixme for memory intrinsics which dereference pointers. With this, we prove that a pointer is non-null because it was dereferenced by an intrinsic 112 times in llvm-test. llvm-svn: 123533	2011-01-15 09:16:12 +00:00
Duncan Sands	d6f1a9584d	Turn X-(X-Y) into Y. According to my auto-simplifier this is the most common simplification present in fully optimized code (I think instcombine fails to transform some of these when "X-Y" has more than one use). Fires here and there all over the test-suite, for example it eliminates 8 subtractions in the final IR for 445.gobmk, 2 subs in 447.dealII, 2 in paq8p etc. llvm-svn: 123442	2011-01-14 15:26:10 +00:00
Duncan Sands	571fd9a606	Factorize common code out of the InstructionSimplify shift logic. Add in threading of shifts over selects and phis while there. This fires here and there in the testsuite, to not much effect. For example when compiling spirit it fires 5 times, during early-cse, resulting in 6 more cse simplifications, and 3 more terminators being folded by jump threading, but the final bitcode doesn't change in any interesting way: other optimizations would have caught the opportunity anyway, only later. llvm-svn: 123441	2011-01-14 14:44:12 +00:00
Duncan Sands	7f60dc1eb0	Move some shift transforms out of instcombine and into InstructionSimplify. While there, I noticed that the transform "undef >>a X -> undef" was wrong. For example if X is 2 then the top two bits must be equal, so the result can not be anything. I fixed this in the constant folder as well. Also, I made the transform for "X << undef" stronger: it now folds to undef always, even though X might be zero. This is in accordance with the LangRef, but I must admit that it is fairly aggressive. Also, I added "i32 X << 32 -> undef" following the LangRef and the constant folder, likewise fairly aggressive. llvm-svn: 123417	2011-01-14 00:37:45 +00:00
Tobias Grosser	b1d11c19da	Add single entry / single exit accessors. Add methods for accessing the (single) entry / exit edge of a region. If no such edge exists, null is returned. Both accessors return the start block of the corresponding edge. The edge can finally be formed by utilizing Region::getEntry() or Region::getExit(); Contributed by: Andreas Simbuerger <simbuerg@fim.uni-passau.de> llvm-svn: 123410	2011-01-13 23:18:04 +00:00
Duncan Sands	ad000d8f16	Remove some wrong code which fortunately was never executed (as explained in the comment I added): an extern weak global may have a null address. llvm-svn: 123373	2011-01-13 10:43:08 +00:00
Duncan Sands	8d25a7c3a0	The most common simplification missed by instsimplify in unoptimized bitcode is "X != 0 -> X" when X is a boolean. This occurs a lot because of the way llvm-gcc converts gcc's conditional expressions. Add this, and a few other similar transforms for completeness. llvm-svn: 123372	2011-01-13 08:56:29 +00:00
Chris Lattner	d30de95520	some comment improvements. llvm-svn: 123243	2011-01-11 17:11:59 +00:00
Eric Christopher	23bf3bafb7	Temporarily revert 123133, it's causing some regressions and I'm trying to get a testcase. llvm-svn: 123225	2011-01-11 09:02:09 +00:00
Chris Lattner	23109cb319	the GEP faq says that only inbounds geps are guaranteed to not overflow. llvm-svn: 123218	2011-01-11 06:44:41 +00:00
Jakob Stoklund Olesen	087f207009	Revert r123207: "Turn on memdep's verifyRemoved() in an attempt to smoke out the cause of our gcc bootstrap miscompare." It didn't. llvm-svn: 123215	2011-01-11 04:05:39 +00:00
Jakob Stoklund Olesen	9b6853efd6	Turn on memdep's verifyRemoved() in an attempt to smoke out the cause of our gcc bootstrap miscompare. llvm-svn: 123207	2011-01-11 01:18:03 +00:00
Chandler Carruth	b1e7f557b7	Teach constant folding to perform conversions from constant floating point values to their integer representation through the SSE intrinsic calls. This is the last part of a README.txt entry for which I have real world examples. llvm-svn: 123206	2011-01-11 01:07:24 +00:00
Chandler Carruth	352d9b14b3	Cleanup some of the constant folding code to consistently test intrinsic IDs when available rather than using a mixture of IDs and textual name comparisons. llvm-svn: 123165	2011-01-10 09:02:58 +00:00
Chris Lattner	67f82314af	add a fixme: ir isn't expressive enough. llvm-svn: 123139	2011-01-09 23:02:10 +00:00
Chris Lattner	28f140a33e	Step #4 in improving trip count analysis: HowFarToZero can analyze NUW AddRec's much more aggressively. We now get a trip count for @test2 in nsw.ll llvm-svn: 123138	2011-01-09 22:58:47 +00:00
Chris Lattner	dff679f4b6	rearrange some code, no functionality change. llvm-svn: 123136	2011-01-09 22:39:48 +00:00
Chris Lattner	a44274cb4f	Step #3 to improving trip count analysis: If we fold a + {b,+,stride} into {a+b,+,stride} (because a is LIV), then the resultant AddRec is NUW/NSW if the client says it is. llvm-svn: 123133	2011-01-09 22:31:26 +00:00
Chris Lattner	fc87752d55	Step #2 to improve trip count analysis for loops like this: void f(int* begin, int* end) { std::fill(begin, end, 0); } which turns into a != exit expression where one pointer is strided and (thanks to step #1) known to not overflow, and the other is loop invariant. The observation here is that, though the IV is strided by 4 in this case, that the IV has to become equal to the end value. It cannot "miss" the end value by stepping over it, because if it did, the strided IV expression would eventually wrap around. Handle this by turning A != B into "A-B != 0" where the A-B part is known to be NUW. llvm-svn: 123131	2011-01-09 22:26:35 +00:00
Chris Lattner	10223a3fbf	teach SCEV analysis of PHI nodes that PHI recurences formed with GEP instructions are always NUW, because PHIs cannot wrap the end of the address space. llvm-svn: 123105	2011-01-09 02:28:48 +00:00
Chris Lattner	a337f5ec5c	reduce indentation. Print <nuw> and <nsw> when dumping SCEV AddRec's that have the bit set. llvm-svn: 123104	2011-01-09 02:16:18 +00:00
Chris Lattner	171608e738	use isNullValue() to simplify code, add an assert. llvm-svn: 122977	2011-01-06 22:24:29 +00:00
Chris Lattner	5858e091a6	implement constant folding support for an exotic constant expr: ret i64 ptrtoint (i8* getelementptr ([1000 x i8]* @X, i64 1, i64 sub (i64 0, i64 ptrtoint ([1000 x i8]* @X to i64))) to i64) to "ret i64 1000". This allows us to correctly compute the trip count on a loop in PR8883, which occurs with std::fill on a char array. This allows us to transform it into a memset with a constant size. llvm-svn: 122950	2011-01-06 06:19:46 +00:00
Owen Anderson	6f060afbbd	Reorder, rename, and document some members to make this easier to follow. llvm-svn: 122929	2011-01-05 23:26:22 +00:00
Owen Anderson	e86dacf449	When computing the value on an edge, in certain cases LVI would fail to compute the value range in the predecessor block, leading to an incorrect conclusion for the edge value. Found by inspection. llvm-svn: 122908	2011-01-05 21:37:18 +00:00
Owen Anderson	118ac80c81	Re-convert several of LazyValueInfo's internal maps to Dense{Map\|Set}, and fix the issue in hasBlockValue() that was causing iterator invalidations. Many thanks to Dimitry Andric for tracking down those invalidations! llvm-svn: 122906	2011-01-05 21:15:29 +00:00
Chris Lattner	c86e67e110	fix an off-by-one bug that caused a crash analyzing ashr's with huge shift amounts, PR8896 llvm-svn: 122814	2011-01-04 18:19:15 +00:00
Owen Anderson	d62d37225a	Use the new addEscapingValue callback to update GlobalsModRef when GVN adds PHIs of GEPs. For the moment, have GlobalsModRef handle this conservatively by simply removing the value from its maps. llvm-svn: 122787	2011-01-03 23:51:43 +00:00
Owen Anderson	b6e4ff0d85	Stub out a new updating interface to AliasAnalysis, allowing stateful analyses to be informed when a pointer value has potentially become escaping. Implementations can choose to either fall back to conservative responses for that value, or may recompute their analysis to accomodate the change. llvm-svn: 122777	2011-01-03 21:38:41 +00:00
Chris Lattner	16e42128c2	fix rdar://8813415 - a miscompilation of 164.gzip that loop-idiom exposed. It turns out to be a latent bug in basicaa, scary. llvm-svn: 122772	2011-01-03 21:03:33 +00:00
Nick Lewycky	0f87ca7733	Add spliceFunction to the CallGraph interface. This allows users to efficiently update a callGraph when performing the common operation of splicing the body to a new function and updating all callers (such as via RAUW). No users yet, though this is intended for DeadArgumentElimination as part of PR8887. llvm-svn: 122728	2011-01-03 03:19:35 +00:00
Chris Lattner	bf0aa927cc	split dom frontier handling stuff out to its own DominanceFrontier header, so that Dominators.h is just domtree. Also prune #includes a bit. llvm-svn: 122714	2011-01-02 22:09:33 +00:00
Duncan Sands	772749aea1	Revert commit 122654 at the request of Chris, who reckons that instsimplify is the wrong hammer for this nail, and is probably right. llvm-svn: 122661	2011-01-01 20:08:02 +00:00
Duncan Sands	e3c539581c	Fix a README item by having InstructionSimplify do a mild form of value numbering, in which it considers (for example) "%a = add i32 %x, %y" and "%b = add i32 %x, %y" to be equal because the operands are equal and the result of the instructions only depends on the values of the operands. This has almost no effect (it removes 4 instructions from gcc-as-one-file), and perhaps slows down compilation: I measured a 0.4% slowdown on the large gcc-as-one-file testcase, but it wasn't statistically significant. llvm-svn: 122654	2011-01-01 16:12:09 +00:00
Benjamin Kramer	b6d52b8b64	Cast away "comparison between signed and unsigned integer" warnings. llvm-svn: 122598	2010-12-28 13:52:52 +00:00
Chris Lattner	9cb1035f94	move isBytewiseValue out to ValueTracking.h/cpp llvm-svn: 122565	2010-12-26 20:15:01 +00:00
Jeffrey Yasskin	9b43f33620	Change all self assignments X=X to (void)X, so that we can turn on a new gcc warning that complains on self-assignments and self-initializations. llvm-svn: 122458	2010-12-23 00:58:24 +00:00
Duncan Sands	a45cfbd405	When determining whether the new instruction was already present in the original instruction, half the cases were missed (making it not wrong but suboptimal). Also correct a typo (A <-> B) in the second chunk. llvm-svn: 122414	2010-12-22 17:15:25 +00:00
Duncan Sands	3547d2ebd8	Add some statistics, good for understanding how much more powerful instcombine is compared to instsimplify. llvm-svn: 122397	2010-12-22 09:40:51 +00:00
Duncan Sands	fecc642224	While I don't think any later transforms can fire, it seems cleaner to not assume this (for example in case more transforms get added below it). Suggested by Frits van Bommel. llvm-svn: 122332	2010-12-21 15:03:43 +00:00
Duncan Sands	5def0d6791	Fix inverted condition noticed by Frits van Bommel. llvm-svn: 122331	2010-12-21 14:48:48 +00:00
Duncan Sands	d0eb6d39f8	Pull a few more simplifications out of instcombine (there are still plenty left though!), in particular for multiplication. llvm-svn: 122330	2010-12-21 14:00:22 +00:00
Duncan Sands	ee3ec6eb94	Teach InstructionSimplify about distributive laws. These transforms fire quite often, but don't make much difference in practice presumably because instcombine also knows them and more. llvm-svn: 122328	2010-12-21 13:32:22 +00:00
Duncan Sands	f64e690c4f	Move checking of the recursion limit into the various Thread methods. No functionality change. llvm-svn: 122327	2010-12-21 09:09:15 +00:00
Duncan Sands	6c7a52cf80	Add generic simplification of associative operations, generalizing a couple of existing transforms. This fires surprisingly often, for example when compiling gcc "(X+(-1))+1->X" fires quite a lot as well as various "and" simplifications (usually with a phi node operand). Most of the time this doesn't make a real difference since the same thing would have been done elsewhere anyway, eg: by instcombine, but there are a few places where this results in simplifications that we were not doing before. llvm-svn: 122326	2010-12-21 08:49:00 +00:00
Owen Anderson	c6beda80ff	Speculatively revert the use of DenseMap in LazyValueInfo, which may be causing Linux self-host failures. llvm-svn: 122291	2010-12-20 23:53:19 +00:00
Owen Anderson	9be3ec6264	Attempt to appease the DragonEgg buildbots. llvm-svn: 122288	2010-12-20 23:23:18 +00:00
Owen Anderson	813a2c45a8	Convert one of LVI's primary maps to a DenseMap, now that we know are more assured of iterator stability. llvm-svn: 122273	2010-12-20 21:30:54 +00:00
Owen Anderson	d83f98a51e	More LVI cleanups, including trying to simplify the process of maintaining the OverDefinedCache. llvm-svn: 122256	2010-12-20 19:33:41 +00:00
Owen Anderson	64c2c5798a	Reuse the reference into the LVI cache throughout the solver subsystem. This is much easier to verify as being safe thanks its recent de-recursivization. llvm-svn: 122254	2010-12-20 18:18:16 +00:00
Duncan Sands	ed6d6c33dd	Have SimplifyBinOp dispatch Xor, Add and Sub to the corresponding methods (they had just been forgotten before). Adding Xor causes "main" in the existing testcase 2010-11-01-lshr-mask.ll to be hugely more simplified. llvm-svn: 122245	2010-12-20 14:47:04 +00:00
Nick Lewycky	55a700b0cf	Make LazyValueInfo non-recursive. llvm-svn: 122120	2010-12-18 01:00:40 +00:00
Nate Begeman	7aa18bf46a	Add vector versions of some existing scalar transforms to aid codegen in matching psign & pblend operations to the IR produced by clang/gcc for their C idioms. llvm-svn: 122105	2010-12-17 23:12:19 +00:00
Dan Gohman	91ab4ffd96	Update a comment. llvm-svn: 121946	2010-12-16 02:55:10 +00:00
Dan Gohman	e1a17a3473	Make memcpyopt TBAA-aware. llvm-svn: 121944	2010-12-16 02:51:19 +00:00
Dan Gohman	2c9d342f04	Enable TBAA by default. llvm-svn: 121923	2010-12-15 23:58:44 +00:00
Dan Gohman	05b18f143f	Reapply r121886, and also update DecomposeGEPExpression to keep it in sync. llvm-svn: 121895	2010-12-15 20:49:55 +00:00
Dan Gohman	d02b65982e	Revert r121886. DecomposeGEPExpression needs to be kept in sync. llvm-svn: 121892	2010-12-15 20:39:25 +00:00
Dan Gohman	949ab7889c	Strengthen GetUnderlyingObject using InstructionSimplify. While LLVM's main design is that analysis code shouldn't go out of its way to understand code which hasn't been InstCombined, analysis utility routines like this can find themselves being called in the middle of transform passes when instcombine hasn't had a chance to run. llvm-svn: 121886	2010-12-15 20:10:26 +00:00
Dan Gohman	a4fcd2418d	Move Value::getUnderlyingObject to be a standalone function so that it can live in Analysis instead of VMCore. llvm-svn: 121885	2010-12-15 20:02:24 +00:00
Nick Lewycky	11678bd299	Clean up some of LVI: * mergeIn now uses constant folding for constants that are provably not-equal. * sink some sanity checks from the get() methods into the mark() methods, to ensure that we never have a constant/notconstant ConstantInt * some textual cleanups, whitespace changes, removing "else" after return, that sort of thing. llvm-svn: 121877	2010-12-15 18:57:18 +00:00
Duncan Sands	0a2c416894	Move Sub simplifications and additional Add simplifications out of instcombine and into InstructionSimplify. llvm-svn: 121861	2010-12-15 14:07:39 +00:00
Duncan Sands	019a418808	If we detect that the instruction we are simplifying is unreachable, arrange for it to be replaced by undef rather than not replaced at all, the idea being that this may reduce the amount of work done by whoever called InstructionSimplify. llvm-svn: 121860	2010-12-15 11:02:22 +00:00
Dan Gohman	3cb55a1d23	Update a comment. llvm-svn: 121727	2010-12-13 22:53:18 +00:00
Dan Gohman	c4bf5cac9f	Reapply r121520, PartialAlias implementation for BasicAA, now that memdep is updated to handle it. llvm-svn: 121725	2010-12-13 22:50:24 +00:00
Dan Gohman	ba5d0abe39	Update memdep to handle PartialAlias as MayAlias. llvm-svn: 121723	2010-12-13 22:47:57 +00:00
Tobias Grosser	f3e1ada522	Remove useless dynamic_cast<>(). Thanks Peter for pointing me to something that should have never been committed to the llvm code base. llvm-svn: 121648	2010-12-12 21:58:28 +00:00
Dan Gohman	39de62348f	Revert r121520, which may have introduced miscompilations. llvm-svn: 121573	2010-12-10 21:48:28 +00:00
Dan Gohman	041f74e762	Implement PartialAlias checking in BasicAA. llvm-svn: 121520	2010-12-10 20:47:03 +00:00
Dan Gohman	704e7c2332	Minimally update this code to handle PartialAlias. llvm-svn: 121518	2010-12-10 20:14:49 +00:00
Dan Gohman	201acdb6db	Use PartialAlias to do better noalias lint checking. llvm-svn: 121514	2010-12-10 20:04:06 +00:00
Dan Gohman	4431e31df0	Teach AliasAnalysisCounter about PartialAlias. llvm-svn: 121513	2010-12-10 19:53:05 +00:00
Dan Gohman	105d60a5ef	Teach AliasAnalysisEvaluator about PartialAlias. llvm-svn: 121512	2010-12-10 19:52:40 +00:00
Dan Gohman	fb0a3754f5	Update this code to handle PartialAlias as MayAlias. llvm-svn: 121508	2010-12-10 19:40:47 +00:00
Owen Anderson	c7ed4dc932	Take the first step towards making LVI non-recursive: get rid of the LVIQuery abstraction. llvm-svn: 121357	2010-12-09 06:14:58 +00:00
Devang Patel	8817135cb9	Use type's file info while describing inheritance relationship. llvm-svn: 121289	2010-12-08 21:46:37 +00:00
Devang Patel	b68c6231e9	Add support to create debug info for functions and methods. llvm-svn: 121281	2010-12-08 20:42:44 +00:00
Devang Patel	81c3c87717	Add support to create class type. llvm-svn: 121279	2010-12-08 20:18:20 +00:00
Devang Patel	89ea4f27a8	Add support to create vector, array, enums etc... llvm-svn: 121224	2010-12-08 01:50:15 +00:00
Devang Patel	dd261afdd9	Global variable does not need linkage name. llvm-svn: 121212	2010-12-08 00:06:22 +00:00
Devang Patel	63f83cd861	Add support to create local variable's debug info. llvm-svn: 121211	2010-12-07 23:58:00 +00:00
Devang Patel	746660fc7b	Add support to create variables, structs etc.. using DIBuilder. This is still work in progress. llvm-svn: 121205	2010-12-07 23:25:47 +00:00
Jay Foad	583abbc4df	PR5207: Change APInt methods trunc(), sext(), zext(), sextOrTrunc() and zextOrTrunc(), and APSInt methods extend(), extOrTrunc() and new method trunc(), to be const and to return a new value instead of modifying the object in place. llvm-svn: 121120	2010-12-07 08:25:19 +00:00
Jakob Stoklund Olesen	8bdfb0c166	Also inore '()' while creating mdnode name from ObjC symbol name. llvm-svn: 120856	2010-12-03 23:40:45 +00:00
Devang Patel	f0227ccf3f	Ignore '+' while creating mdnode name from ObjC symbol name. llvm-svn: 120853	2010-12-03 23:29:30 +00:00
Jay Foad	25a5e4ca1f	PR5207: Rename overloaded APInt methods set(), clear(), flip() to setAllBits(), setBit(unsigned), etc. llvm-svn: 120564	2010-12-01 08:53:58 +00:00
Chris Lattner	e28618de59	move GetPointerBaseWithConstantOffset out of GVN into ValueTracking.h llvm-svn: 120476	2010-11-30 22:25:26 +00:00
Jay Foad	15084f085d	PR5207: Make APInt::set(), APInt::clear() and APInt::flip() return void. llvm-svn: 120413	2010-11-30 09:02:01 +00:00
Chris Lattner	d540a5d842	strength reduce this. llvm-svn: 120381	2010-11-30 01:56:13 +00:00
Chris Lattner	afbc0c2b8c	getLocationForDest should work for memset as well. llvm-svn: 120380	2010-11-30 01:48:20 +00:00
Chris Lattner	90c4947df7	enhance basicaa to return "Mod" for a memcpy call when the queried location doesn't overlap the source, and add a testcase. llvm-svn: 120370	2010-11-30 00:43:16 +00:00
Chris Lattner	9a146372b5	Teach basicaa that memset's modref set is at worst "mod" and never contains "ref". Enhance DSE to use a modref query instead of a store-specific hack to generalize the "ignore may-alias stores" optimization to handle memset and memcpy. llvm-svn: 120368	2010-11-30 00:28:45 +00:00
Frits van Bommel	a98214de10	Teach ConstantFoldInstruction() how to fold insertvalue and extractvalue. llvm-svn: 120316	2010-11-29 20:36:52 +00:00
Michael J. Spencer	447762da85	Merge System into Support. llvm-svn: 120298	2010-11-29 18:16:10 +00:00
Chandler Carruth	abcab28f9b	Add some dead stores to pacify my least favorite GCC warning: may be uninitialized. The warning is terrible, has incorrect source locations, and has a huge false positive rate such as all of these. If anyone has a better solution, please let me know. Alternatively, I'll happily add -Wno-uninitialized to the -Werror build mode. Maybe I can even do it only when building with GCC instead of Clang. llvm-svn: 120281	2010-11-29 01:41:13 +00:00
Duncan Sands	a021988d64	Expand a little on the description of what InstructionSimplify does. llvm-svn: 120016	2010-11-23 10:50:08 +00:00
Duncan Sands	763dec0ab8	Clarify that constant folding of instructions applies when all operands are constant. There was in fact one exception to this (phi nodes) - so remove that exception (InstructionSimplify handles this so there should be no loss). llvm-svn: 120015	2010-11-23 10:16:18 +00:00
Duncan Sands	c133c54426	If a GEP index simply advances by multiples of a type of zero size, then replace the index with zero. llvm-svn: 119974	2010-11-22 16:32:50 +00:00
Duncan Sands	8a0f486e36	Move the "gep undef" -> "undef" transform from instcombine to InstructionSimplify. llvm-svn: 119970	2010-11-22 13:42:49 +00:00
Benjamin Kramer	585dfa2b3d	Initialize MemDep's TD member so buildbots don't trip over an uninitialized pointer (TD is passed to PHITransAddr). I wonder why this didn't explode earlier. llvm-svn: 119944	2010-11-21 15:21:46 +00:00
Duncan Sands	cf4bceba49	Add a rather pointless InstructionSimplify transform, inspired by recent constant folding improvements: if P points to a type of size zero, turn "gep P, N" into "P". More generally, if a gep index type has size zero, instcombine could replace the index with zero, but that is not done here. llvm-svn: 119942	2010-11-21 13:53:09 +00:00
Duncan Sands	1f86be9164	Fix spelling. llvm-svn: 119941	2010-11-21 12:43:13 +00:00
Chris Lattner	6ce038082b	apply Dan's fix for PR8268 which allows constant folding to handle indexes over zero sized elements. This allows us to compile: #include <string> void foo() { std::string s; } into an empty function. llvm-svn: 119933	2010-11-21 08:39:01 +00:00
Chris Lattner	663ba91cc6	add "getLocation" method to AliasAnalysis for getting the source and destination location of a memcpy/memmove. I'm not clear about whether TBAA works on these, so I'm leaving it out for now. Dan, please revisit this when convenient. llvm-svn: 119928	2010-11-21 07:51:27 +00:00
Chris Lattner	e48c31ce33	implement PR8576, deleting dead stores with intervening may-alias stores. llvm-svn: 119927	2010-11-21 07:34:32 +00:00
Benjamin Kramer	ddd1b7b801	Simplify code. No change in functionality. llvm-svn: 119908	2010-11-20 18:43:35 +00:00
Benjamin Kramer	c77ebcc9a5	Silence warning about an uninitialized variable. llvm-svn: 119800	2010-11-19 11:37:26 +00:00
Duncan Sands	b238de0415	Remove threading of Xor over selects and phis, with an explanation of why such threading is pointless. llvm-svn: 119798	2010-11-19 09:20:39 +00:00
Duncan Sands	aef146b890	Factor code for testing whether replacing one value with another preserves LCSSA form out of ScalarEvolution and into the LoopInfo class. Use it to check that SimplifyInstruction simplifications are not breaking LCSSA form. Fixes PR8622. llvm-svn: 119727	2010-11-18 19:59:41 +00:00
Dan Gohman	f1ebfc1544	Strip trailing whitespace. llvm-svn: 119706	2010-11-18 17:06:31 +00:00
Dan Gohman	0ab28b62b1	Use llvm_unreachable for "impossible" situations. llvm-svn: 119705	2010-11-18 17:05:57 +00:00
Dan Gohman	2e1fc849b2	Add support for PHI-translating sext, zext, and trunc instructions, enabling more PRE. PR8586. llvm-svn: 119704	2010-11-18 17:05:13 +00:00
Dan Gohman	8ea83d81e0	Introduce memoization for ScalarEvolution dominates and properlyDominates queries, and SCEVExpander getRelevantLoop queries. llvm-svn: 119595	2010-11-18 00:34:22 +00:00
Dan Gohman	7e6b393e66	Factor out the code for purging a SCEV from all the various memoization maps. Some of these maps may merge in the future, but for now it's convenient to have a utility function for them. llvm-svn: 119587	2010-11-17 23:28:48 +00:00
Dan Gohman	7ee1bbb76c	Merge the implementations of isLoopInvariant and hasComputableLoopEvolution, and memoize the results. This improves compile time in code which highly complex expressions which get queried many times. llvm-svn: 119584	2010-11-17 23:21:44 +00:00
Dan Gohman	534749bf70	Make SCEV::getType() and SCEV::print non-virtual. Move SCEV::hasOperand to ScalarEvolution. Delete SCEV::~SCEV. SCEV is no longer virtual. llvm-svn: 119578	2010-11-17 22:27:42 +00:00
Dan Gohman	20d9ce21ef	Move SCEV::dominates and properlyDominates to ScalarEvolution. llvm-svn: 119570	2010-11-17 21:41:58 +00:00
Dan Gohman	afd6db9932	Move SCEV::isLoopInvariant and hasComputableLoopEvolution to be member functions of ScalarEvolution, in preparation for memoization and other optimizations. llvm-svn: 119562	2010-11-17 21:23:15 +00:00
Duncan Sands	39d77131a1	Before replacing a phi node with a different value, it needs to be checked that this won't break LCSSA form. Change the existing checking method to a more direct one: rather than seeing if all predecessors belong to the loop, check that the replacing value is either not in any loop or is in a loop that contains the phi node. llvm-svn: 119556	2010-11-17 20:49:12 +00:00
Dan Gohman	d3a32ae4c8	Verify SCEVAddRecExpr's invariant in ScalarEvolution::getAddRecExpr instead of in SCEVAddRecExpr's constructor, in preparation for an upcoming change. llvm-svn: 119554	2010-11-17 20:48:38 +00:00
Dan Gohman	ed75631743	Fix ScalarEvolution's range memoization to avoid using a default ctor with ConstantRange. llvm-svn: 119550	2010-11-17 20:23:08 +00:00
Duncan Sands	c89ac07e7a	Move some those Xor simplifications which don't require creating new instructions out of InstCombine and into InstructionSimplify. While there, introduce an m_AllOnes pattern to simplify matching with integers and vectors with all bits equal to one. llvm-svn: 119536	2010-11-17 18:52:15 +00:00
Duncan Sands	ec7a6ecb92	Now that hasConstantValue has been made simpler, it may return the phi node itself if it occurs in an unreachable basic block. Protect against this. Hopefully this will fix some more buildbots. llvm-svn: 119493	2010-11-17 10:23:23 +00:00
Duncan Sands	64e41cf865	Previously SimplifyInstruction could report that an instruction simplified to itself (this can only happen in unreachable blocks). Change it to return null instead. Hopefully this will fix some buildbot failures. llvm-svn: 119490	2010-11-17 08:35:29 +00:00
Duncan Sands	7412f6e53d	Fix a layering violation: hasConstantValue, which is part of the PHINode class, uses DominatorTree which is an analysis. This change moves all of the tricky hasConstantValue logic to SimplifyInstruction, and replaces it with a very simple literal implementation. I already taught users of hasConstantValue that need tricky stuff to use SimplifyInstruction instead. I didn't update InlineFunction because the IR looks like it might be in a funky state at the point it calls hasConstantValue, which makes calling SimplifyInstruction dangerous since it can in theory do a lot of tricky reasoning. This may be a pessimization, for example in the case where all phi node operands are either undef or a fixed constant. llvm-svn: 119459	2010-11-17 04:30:22 +00:00
Duncan Sands	d06f50e2db	Have ScalarEvolution use SimplifyInstruction rather than hasConstantValue. While there, add a note about an inefficiency I noticed. llvm-svn: 119458	2010-11-17 04:18:45 +00:00
Dan Gohman	761065e3b7	Memoize results from ScalarEvolution's getUnsignedRange and getSignedRange. This fixes some extreme compile times on unrolled sha512 code. llvm-svn: 119455	2010-11-17 02:44:44 +00:00
Duncan Sands	5ffc298bc7	In which I discover the existence of loops. Threading an operation over a phi node by applying it to each operand may be wrong if the operation and the phi node are mutually interdependent (the testcase has a simple example of this). So only do this transform if it would be correct to perform the operation in each predecessor of the block containing the phi, i.e. if the other operands all dominate the phi. This should fix the FFMPEG snow.c regression reported by İsmail Dönmez. llvm-svn: 119347	2010-11-16 12:16:38 +00:00
Duncan Sands	f12ba1dfe1	Teach InstructionSimplify the trick of skipping incoming phi values that are equal to the phi itself. llvm-svn: 119161	2010-11-15 17:52:45 +00:00
Duncan Sands	b99f39b9f6	If dom tree information is available, make it possible to pass it to get better phi node simplification. llvm-svn: 119055	2010-11-14 18:36:10 +00:00
Duncan Sands	4581ddc123	Teach InstructionSimplify about phi nodes. I chose to have it simply offload the work to hasConstantValue rather than do something more complicated (such handling mutually recursive phis) because (1) it is not clear it is worth it; and (2) if it is worth it, maybe such logic would be better placed in hasConstantValue. Adjust some GVN tests which are now cleaned up much further (eg: all phi nodes are removed). llvm-svn: 119043	2010-11-14 13:30:18 +00:00
Duncan Sands	1d27f01210	Boost the power of phi node constant folding slightly: if all operands are the phi node itself or undef, then return undef. This logic already existed at a higher level so in practice it shouldn't make the slightest difference. Note that this code could be replaced by a call to PN->hasConstantValue(). However since we bail out the moment we see a non-constant operand, it is more efficient to have a specialized version of that logic. llvm-svn: 119041	2010-11-14 12:53:18 +00:00
Duncan Sands	7e800d6f9c	Strip trailing whitespace. llvm-svn: 119038	2010-11-14 11:23:23 +00:00
Duncan Sands	e5ac78e16e	Fix typo pointed out by Trevor Harmon. llvm-svn: 119001	2010-11-13 12:16:27 +00:00
Dan Gohman	970afd926f	Re-disable TBAA for now; it broke MultiSource/Applications/JM/lencod, at least. llvm-svn: 118890	2010-11-12 11:21:08 +00:00
Dan Gohman	ea18d8ec2d	Enable TBAA. llvm-svn: 118884	2010-11-12 06:20:01 +00:00
Dan Gohman	65316d6749	Add helper functions for computing the Location of load, store, and vaarg instructions. llvm-svn: 118845	2010-11-11 21:50:19 +00:00
Dan Gohman	468638826e	Don't forget the TBAA info, if available. llvm-svn: 118842	2010-11-11 21:27:26 +00:00
Dan Gohman	7dacf8f3f3	Avoid calling alias on non-pointer values. llvm-svn: 118822	2010-11-11 19:23:51 +00:00
Dan Gohman	c87c843db7	It's not necessary to clear out the Size and TBAATag at each of these points. llvm-svn: 118752	2010-11-11 00:42:22 +00:00
Dan Gohman	8bf3d832e5	Set NonLocalDepInfo's Size field to UnknownSize when invalidating it, so that it doesn't appear to be a known size. llvm-svn: 118748	2010-11-11 00:20:27 +00:00
Dan Gohman	6791936848	When clearing a non-local pointer dependency cache entry, clear the reverse map too. This fixes seflhost build errors. llvm-svn: 118729	2010-11-10 22:35:02 +00:00
Devang Patel	364bf04267	Take care of special characters while creating named MDNode name to hold function specific local variable's info. This fixes radar 8653152. I am checking in testcase as a separate check-in. llvm-svn: 118726	2010-11-10 22:19:21 +00:00
Dan Gohman	1d760ce8b3	Factor out the code for computing an AliasAnalysis::Location for a given instruction into a helper function. llvm-svn: 118723	2010-11-10 21:51:35 +00:00
Dan Gohman	2e8ca44b81	Fully invalidate cached results when a prior query's size or type is insufficient for, or incompatible with, the current query. llvm-svn: 118721	2010-11-10 21:45:11 +00:00
Duncan Sands	8f7220e9fd	Reduce the maximum recursion depth, 5 seems pointlessly too much. Probably it should just be 1, but compromise with 3. llvm-svn: 118718	2010-11-10 20:53:24 +00:00
Dan Gohman	0a6021a54d	Enhance GVN to do more precise alias queries for non-local memory references. For example, this allows gvn to eliminate the load in this example: void foo(int n, int* p, int q) { p[0] = 0; p[1] = 1; if (n) { q = p[0]; } } llvm-svn: 118714	2010-11-10 20:37:15 +00:00
Duncan Sands	f3b1bf1606	Teach InstructionSimplify how to look through PHI nodes. Since PHI nodes can be used in loops, this could result in infinite looping if there is no recursion limit, so add such a limit. It is also used for the SelectInst case because in theory there could be an infinite loop there too if the basic block is unreachable. llvm-svn: 118694	2010-11-10 18:23:01 +00:00
Dan Gohman	066c1bb1e9	Add a doesAccessArgPointees helper function, and update code to use it, and to be consistent. llvm-svn: 118692	2010-11-10 18:17:28 +00:00
Duncan Sands	b0579e9d3f	Simplify binary operations where one operand is a select instruction. The simplifications performed here never create new instructions, they only return existing instructions (or a constant), and so are always a win. In theory they should transform (for example) %z = and i32 %x, %y %s = select i1 %cond, i32 %y, i32 %z %r = and i32 %x, %s into %r = and i32 %x, y but in practice they get into a fight with instcombine, and lose. Unfortunately instcombine does a poor job in this case. Nonetheless I'm committing this transform to make it easier to discuss what to do to make peace with instcombine. llvm-svn: 118679	2010-11-10 13:00:08 +00:00
Dan Gohman	2694e14087	Make ModRefBehavior a lattice. Use this to clean up AliasAnalysis chaining and simplify FunctionAttrs' GetModRefBehavior logic. llvm-svn: 118660	2010-11-10 01:02:18 +00:00
Dan Gohman	88ff1ece63	VAArg doesn't capture its operand. llvm-svn: 118623	2010-11-09 20:09:35 +00:00
Dan Gohman	5d06f892ef	Teach AliasAnalysis about AccessesArgumentsReadonly. llvm-svn: 118621	2010-11-09 20:06:55 +00:00
Dan Gohman	0f17507478	Teach LICM and AliasSetTracker about AccessesArgumentsReadonly. llvm-svn: 118618	2010-11-09 19:58:21 +00:00
Duncan Sands	fc5ad3f0f9	Factorize code, no functionality change. llvm-svn: 118516	2010-11-09 17:25:51 +00:00
Dan Gohman	142ff82a18	Re-introduce the MaxLookup limit to BasicAliasAnalysis' pointsToConstantMemory code to guard against possible compile time slowdowns. llvm-svn: 118440	2010-11-08 20:26:19 +00:00
Dan Gohman	601c94b309	Implement getModRefBehavior for TypeBasedAliasAnalysis. llvm-svn: 118416	2010-11-08 17:10:22 +00:00
Dan Gohman	9130bad71f	Extend the AliasAnalysis::pointsToConstantMemory interface to allow it to optionally look for constant or local (alloca) memory. Teach BasicAliasAnalysis::pointsToConstantMemory to look through Select and Phi nodes, and to support looking for local memory. Remove FunctionAttrs' PointsToLocalOrConstantMemory function, now that AliasAnalysis knows all the tricks that it knew. llvm-svn: 118412	2010-11-08 16:45:26 +00:00
Dan Gohman	0b56778d65	Delete getIntrinsicModRefBehavior. Clients can just use the normal getModRefBehavior now, since it now understands intrinsics as well as normal functions. llvm-svn: 118411	2010-11-08 16:11:19 +00:00
Dan Gohman	e461d7d135	Teach BasicAliasAnalysis::getModRefBehavior(const Function *F) to analyze intrinsic functions. llvm-svn: 118409	2010-11-08 16:08:43 +00:00
Duncan Sands	a620bd1fa3	Add simplification of floating point comparisons with the result of a select instruction, the same as already exists for integer comparisons. llvm-svn: 118379	2010-11-07 16:46:25 +00:00
Duncan Sands	f532d31198	Fix a README item: when doing a comparison with the result of a select instruction, see if doing the compare with the true and false values of the select gives the same result. If so, that can be used as the value of the comparison. llvm-svn: 118378	2010-11-07 16:12:23 +00:00
Benjamin Kramer	ed8b7bf9ed	Use arrays instead of constant-sized SmallVectors. llvm-svn: 118257	2010-11-04 18:45:27 +00:00
Devang Patel	57c5a20364	Introduce DIBuilder. It is intended to be a front-end friendly interface to emit debuggging information entries in LLVM IR. To create debugging information for a pointer, using DIBUilder front-end just needs DBuilder.CreatePointerType(Ty, Size); instead of DebugFactory.CreateDerivedType(llvm::dwarf::DW_TAG_pointer_type, TheCU, "", getOrCreateMainFile(), 0, Size, 0, 0, 0, OCTy); llvm-svn: 118248	2010-11-04 15:01:38 +00:00
Devang Patel	415c551459	Fix DIType verifier. The element 3 is DIFile now. llvm-svn: 118054	2010-11-02 20:41:13 +00:00
Dan Gohman	dcb354b234	Make ScalarEvolution::forgetLoop forget all contained loops too, because they may have ValuesAtScopes map entries referencing their outer loops. This fixes a user-after-free reported in PR8471. llvm-svn: 117698	2010-10-29 20:16:10 +00:00
Dan Gohman	15a43965ac	Teach memdep to use pointsToConstantMemory to determine that loads from constant memory don't alias any stores. llvm-svn: 117636	2010-10-29 01:14:04 +00:00
Dan Gohman	c6096263e2	Support TBAA attachments on calls. This is somewhat experimental. llvm-svn: 117317	2010-10-25 21:38:20 +00:00
Dan Gohman	82b2e0da9c	Fix chaining in TBAA's pointsToConstantMemory. llvm-svn: 117314	2010-10-25 21:24:55 +00:00
Dan Gohman	e6715d0755	Only read one bit for testing for a readonly type, leaving the other bits open for future uses. llvm-svn: 117301	2010-10-25 20:22:29 +00:00
Dan Gohman	fd864a1d31	Add a comment. llvm-svn: 117288	2010-10-25 19:47:25 +00:00
Dan Gohman	abaf2d8d3b	Update comments; BasicAA is no longer necessarily the end of the chain. llvm-svn: 117268	2010-10-25 16:29:52 +00:00
Dan Gohman	1033ce669b	Reintroduce these asserts, now that BasicAA is a normal AliasAnalysis pass. llvm-svn: 117266	2010-10-25 16:28:57 +00:00
Benjamin Kramer	9192e7ab12	Make some symbols static, move classes into anonymous namespaces. llvm-svn: 117111	2010-10-22 17:35:07 +00:00
Dan Gohman	8512270dbc	Add some more documentation. llvm-svn: 117070	2010-10-21 21:55:35 +00:00
Dan Gohman	12c9e0cf1c	Explain what "constant" means here. llvm-svn: 117053	2010-10-21 19:45:09 +00:00
Dan Gohman	104f1812ce	Update comments. llvm-svn: 117048	2010-10-21 19:01:22 +00:00
Dan Gohman	1b85604130	Memdep says that an instruction clobbers itself when it means there is no specific clobber instruction. llvm-svn: 116960	2010-10-20 22:37:41 +00:00
Dan Gohman	a2ab75bc8d	Factor out the main aliasing check into a separate function. llvm-svn: 116958	2010-10-20 22:11:14 +00:00
Dan Gohman	2549d0cf64	Fix comments; the type graph is currently a tree, not a DAG. llvm-svn: 116954	2010-10-20 22:02:58 +00:00
Tobias Grosser	23c8341c3d	Add RegionPass support. A RegionPass is executed like a LoopPass but on the regions detected by the RegionInfo pass instead of the loops detected by the LoopInfo pass. llvm-svn: 116905	2010-10-20 01:54:44 +00:00
Douglas Gregor	48b4568718	Fix CMake build llvm-svn: 116903	2010-10-20 01:36:56 +00:00
Dan Gohman	da85ed8541	Move NoAA out of BasicAliasAnalysis.cpp into its own file, now that it doesn't have a special relationship with BasicAliasAnalysis anymore. llvm-svn: 116876	2010-10-19 23:09:08 +00:00
Dan Gohman	f372cf869b	Reapply r116831 and r116839, converting AliasAnalysis to use uint64_t, plus fixes for places I missed before. llvm-svn: 116875	2010-10-19 22:54:46 +00:00
Dan Gohman	b4aa503501	Revert r116831 and r116839, which are breaking selfhost builds. llvm-svn: 116858	2010-10-19 21:06:16 +00:00
Dan Gohman	f4c5fe73be	Change AliasAnalysis and its clients to use uint64_t instead of unsigned for representing object sizes, for consistency with other parts of LLVM. llvm-svn: 116831	2010-10-19 18:00:02 +00:00
Owen Anderson	6c18d1aac0	Get rid of static constructors for pass registration. Instead, every pass exposes an initializeMyPassFunction(), which must be called in the pass's constructor. This function uses static dependency declarations to recursively initialize the pass's dependencies. Clients that only create passes through the createFooPass() APIs will require no changes. Clients that want to use the CommandLine options for passes will need to manually call the appropriate initialization functions in PassInitialization.h before parsing commandline arguments. I have tested this with all standard configurations of clang and llvm-gcc on Darwin. It is possible that there are problems with the static dependencies that will only be visible with non-standard options. If you encounter any crash in pass registration/creation, please send the testcase to me directly. llvm-svn: 116820	2010-10-19 17:21:58 +00:00
Dan Gohman	14fe8cf238	Consistently use AliasAnalysis::UnknownSize instead of hardcoding ~0u. llvm-svn: 116815	2010-10-19 17:06:23 +00:00
Dan Gohman	e4a82e2f21	Make the representation of AliasSets explicitly differentiate between "not known yet" and "known no tbaa info" so that it can merge them properly. llvm-svn: 116767	2010-10-18 23:31:47 +00:00
Dan Gohman	408beac597	Don't pass the raw invalid pointer used to represent conflicting TBAA information to AliasAnalysis. llvm-svn: 116751	2010-10-18 21:28:00 +00:00
Dan Gohman	71af9db0e8	Make AliasSetTracker TBAA-aware, enabling TBAA-enabled LICM. llvm-svn: 116743	2010-10-18 20:44:50 +00:00
Dan Gohman	f3702452c8	Fix BasicAA to pass TBAAInfo through to the chained analysis. llvm-svn: 116730	2010-10-18 18:45:11 +00:00
Dan Gohman	33fcde9b9c	Make TypeBasedAliasAnalysis default to doing nothing, with a command-line option to enable it. llvm-svn: 116722	2010-10-18 18:17:47 +00:00
Dan Gohman	f0a3bed6d6	Use chaining in TypeBasedAliasAnalysis::pointsToConstantMemory. llvm-svn: 116721	2010-10-18 18:10:31 +00:00
Dan Gohman	02538ac4d3	Make BasicAliasAnalysis a normal AliasAnalysis implementation which does normal initialization and normal chaining. Change the default AliasAnalysis implementation to NoAlias. Update StandardCompileOpts.h and friends to explicitly request BasicAliasAnalysis. Update tests to explicitly request -basicaa. llvm-svn: 116720	2010-10-18 18:04:47 +00:00
Benjamin Kramer	1dc34b48dd	Eliminate some calls to Value::getNameStr. llvm-svn: 116670	2010-10-16 11:28:23 +00:00
Dan Gohman	31a01ee3cb	Tolerate a null parent pointer. llvm-svn: 116533	2010-10-14 22:55:57 +00:00
Chris Lattner	698661c741	add uadd_ov/usub_ov to apint, consolidate constant folding logic to use the new APInt methods. Among other things this implements rdar://8501501 - llvm.smul.with.overflow.i32 should constant fold which comes from "clang -ftrapv", originally brought to my attention from PR8221. llvm-svn: 116457	2010-10-14 00:05:07 +00:00
Owen Anderson	c266a36625	Analysis groups need to initialize their default implementations. llvm-svn: 116441	2010-10-13 21:49:58 +00:00
Tobias Grosser	4b0986b6c1	Add Region::isTopLevelRegion(). llvm-svn: 116402	2010-10-13 11:02:44 +00:00
Tobias Grosser	4c71c117d1	RegionInfo: Fix trivial error that slipped in last minute. llvm-svn: 116400	2010-10-13 08:00:53 +00:00
Tobias Grosser	fe92a9384e	RegionInfo: Update RegionInfo after a BB was split. llvm-svn: 116398	2010-10-13 05:54:13 +00:00
Tobias Grosser	a8677226ab	RegioInfo: Add getExpandedRegion(). getExpandedRegion() enables us to create non canonical regions. Those regions can be used to define the largerst region, that fullfills a certain property. llvm-svn: 116397	2010-10-13 05:54:11 +00:00
Tobias Grosser	648594c920	RegionInfo: Allow to update exit and entry of a region. llvm-svn: 116396	2010-10-13 05:54:10 +00:00
Tobias Grosser	bf984fd78e	RegionInfo: Enhance addSubregion. llvm-svn: 116395	2010-10-13 05:54:09 +00:00
Tobias Grosser	8352ce5f8d	RegionInfo: Allow to set the parent region of a basic block. llvm-svn: 116394	2010-10-13 05:54:07 +00:00
Tobias Grosser	e910b9d9cd	RegionInfo: Free the RegionNodes in cache. Contributed by: ether llvm-svn: 116380	2010-10-13 00:07:59 +00:00
Owen Anderson	8ac477ffb5	Begin adding static dependence information to passes, which will allow us to perform initialization without static constructors AND without explicit initialization by the client. For the moment, passes are required to initialize both their (potential) dependencies and any passes they preserve. I hope to be able to relax the latter requirement in the future. llvm-svn: 116334	2010-10-12 19:48:12 +00:00
Dan Gohman	a8d3a7f93d	Support AA chaining. llvm-svn: 116264	2010-10-11 23:39:34 +00:00
Kenneth Uildriks	b8d7efe785	Now using a variant of the existing inlining heuristics to decide whether to create a given specialization of a function in PartialSpecialization. If the total performance bonus across all callsites passing the same constant exceeds the specialization cost, we create the specialization. llvm-svn: 116158	2010-10-09 22:06:36 +00:00
Kenneth Uildriks	99463ca8cf	Start separating out code metrics into code size metrics and code performance metrics. Partial Specialization will apply the former to function specializations, and the latter to all callsites that can use a specialization, in order to decide whether to create a specialization llvm-svn: 116057	2010-10-08 13:57:31 +00:00
Owen Anderson	df7a4f2515	Now with fewer extraneous semicolons! llvm-svn: 115996	2010-10-07 22:25:06 +00:00
Owen Anderson	98eb3ec6c5	Add an implementation of the initialization routine for IPA. llvm-svn: 115947	2010-10-07 18:31:27 +00:00
Owen Anderson	6875c2ea26	Add initialization routines for Analysis and IPA. llvm-svn: 115946	2010-10-07 18:31:00 +00:00
Owen Anderson	82d38df40c	Fix a warning when building with clang++. llvm-svn: 115924	2010-10-07 17:04:18 +00:00
Owen Anderson	5e19bfcde3	Move the pass initialization helper functions into the llvm namespace, and add a header declaring them all. This is also where we will declare per-library pass-set initializer functions down the road. llvm-svn: 115900	2010-10-07 04:13:08 +00:00
Owen Anderson	af08ad4350	Appease the clang self-host buildbot by providing a correct instantiation. llvm-svn: 115857	2010-10-06 22:23:20 +00:00
Owen Anderson	ad8134f03b	Hide analysis group registration behind a macro, just like pass registration. llvm-svn: 115835	2010-10-06 21:02:27 +00:00
Devang Patel	9a33ec24eb	Add support for DW_TAG_unspecified_parameters. llvm-svn: 115833	2010-10-06 20:50:40 +00:00
Dan Gohman	8e4c19ac44	Don't add the operand count to SCEV uniquing data; FoldingSetNodeID already knows its own length, so this is redundant. llvm-svn: 115521	2010-10-04 17:24:08 +00:00
Devang Patel	bea08d1c85	Let FE mark a variable as artificial variable. llvm-svn: 115102	2010-09-29 23:07:21 +00:00
Devang Patel	95ae73c394	Generalize DISubprogram element to encode various flags instead of just one boolean for isArtificial. This is a backword compatible change. llvm-svn: 115084	2010-09-29 21:04:46 +00:00
Benjamin Kramer	923a8cf356	Remove PointerTracking from cmakelists … llvm-svn: 115076	2010-09-29 19:39:50 +00:00
Chris Lattner	af995f0ee5	remove PointerTracking from mainline, Edwin is going to move it out to ClamAV for LLVM 2.9 llvm-svn: 115062	2010-09-29 18:43:27 +00:00
Oscar Fuentes	b4b12535e8	Removed a bunch of unnecessary target_link_libraries. llvm-svn: 114999	2010-09-28 22:39:14 +00:00
Devang Patel	7a55481fa4	Provide an interface to let FEs anchor debug info for types. llvm-svn: 114969	2010-09-28 18:08:20 +00:00
Jakob Stoklund Olesen	1083573796	Don't try to constant fold libm functions with non-finite arguments. Usually we wouldn't do this anyway because llvm_fenv_testexcept would return an exception, but we have seen some cases where neither errno nor fenv detect an exception on arm-linux. llvm-svn: 114893	2010-09-27 21:29:20 +00:00
Dan Gohman	2348393cf5	Teach memdep about TBAA tags. llvm-svn: 114588	2010-09-22 21:41:02 +00:00
Benjamin Kramer	4b57204e80	Simplify code. llvm-svn: 114444	2010-09-21 16:41:29 +00:00
Benjamin Kramer	4021d906f1	Make CreateComplexVariable independent of SmallVector. llvm-svn: 114439	2010-09-21 16:00:03 +00:00
Jakob Stoklund Olesen	4a253e5ac8	Don't include <fenv.h> now that we have llvm/System/FEnv.h. llvm-svn: 114219	2010-09-17 21:47:03 +00:00
Dan Gohman	b48f904602	Attempt to support platforms which don't have fenv.h. llvm-svn: 114196	2010-09-17 20:06:27 +00:00
Dan Gohman	18fa17cf3d	Fix the folding of floating-point math library calls, like sin(infinity), so that it detects errors on platforms where libm doesn't set errno. It's still subject to host libm details though. llvm-svn: 114148	2010-09-17 01:38:06 +00:00
Dan Gohman	2fa59799d9	Add an #include of raw_ostream.h. Previously, this only compiled because it was using Twine.h's declaration of operator<<(const Twine &). llvm-svn: 114141	2010-09-17 00:33:43 +00:00
Benjamin Kramer	d61e3833a3	Update CMake build. llvm-svn: 114128	2010-09-16 23:06:18 +00:00
Dan Gohman	f4925061af	Rename a variable to avoid a declaration conflict. llvm-svn: 114126	2010-09-16 22:50:09 +00:00
Dan Gohman	ee74402fe6	Add a pass which prints out all the memdep dependencies. llvm-svn: 114121	2010-09-16 22:08:32 +00:00
Owen Anderson	c33cdcfd80	Revert r114097, adding back in the assertion against replacing an Instruction by itself. Now that CorrelatedValuePropagation is more careful not to call SimplifyInstructionsInBlock() on an unreachable block, the issue has been fixed at a higher level. Add a big warning to SimplifyInstructionsInBlock() to hopefully prevent this in the future. llvm-svn: 114117	2010-09-16 20:51:41 +00:00
Owen Anderson	140296f5c0	It is possible, under specific circumstances involving ptrtoint ConstantExpr's, for LVI to end up trying to merge a Constant into a ConstantRange. Handle this conservatively for now, rather than asserting. The testcase is more complex that I would like, but the manifestation of the problem is sensitive to iteration orders and the state of the LVI cache, and I have not been able to reproduce it with manually constructed or simplified cases. Fixes PR8162. llvm-svn: 114103	2010-09-16 18:28:33 +00:00
Owen Anderson	94532cb297	Fix PR8161, in which an unreachable loop causes recursive instruction simplification to try to replace an instruction with itself. Add a predicate to the simplifier to prevent this case. llvm-svn: 114097	2010-09-16 17:42:36 +00:00
Eli Friedman	ab3a128582	PR7959: Handle negative scales in GEPs correctly in BasicAA for non-64-bit targets. llvm-svn: 114015	2010-09-15 20:08:03 +00:00
Dan Gohman	e0386dbef1	Convert TBAA to use the new TBAATag field of AliasAnalysis::Location. llvm-svn: 113892	2010-09-14 23:28:12 +00:00
Dan Gohman	41f14cf3e9	Remove the experimental AliasAnalysis::getDependency interface, which isn't a good level of abstraction for memdep. Instead, generalize AliasAnalysis::alias and related interfaces with a new Location class for describing a memory location. For now, this is the same Pointer and Size as before, plus an additional field for a TBAA tag. Also, introduce a fixed MD_tbaa metadata tag kind. llvm-svn: 113858	2010-09-14 21:25:10 +00:00
Michael J. Spencer	93c9b2ea93	Revert "CMake: Get rid of LLVMLibDeps.cmake and export the libraries normally." This reverts commit r113632 Conflicts: cmake/modules/AddLLVM.cmake llvm-svn: 113819	2010-09-13 23:59:48 +00:00
Benjamin Kramer	8c35fb0739	Teach InstructionSimplify to fold (A & B) & A -> A & B and (A \| B) \| A -> A \| B. Reassociate does this but it doesn't catch all cases (e.g. if the operands are i1). llvm-svn: 113651	2010-09-10 22:39:55 +00:00
Michael J. Spencer	dc38d36ccb	CMake: Get rid of LLVMLibDeps.cmake and export the libraries normally. llvm-svn: 113632	2010-09-10 21:14:25 +00:00
Owen Anderson	04cf3fd761	What the loop unroller cares about, rather than just not unrolling loops with calls, is not unrolling loops that contain calls that would be better off getting inlined. This mostly comes up when an interleaved devirtualization pass has devirtualized a call which the inliner will inline on a future pass. Thus, rather than blocking all loops containing calls, add a metric for "inline candidate calls" and block loops containing those instead. llvm-svn: 113535	2010-09-09 20:32:23 +00:00
Dan Gohman	1c5be00ec7	Extend the getDependence query with support for PHI translation. llvm-svn: 113521	2010-09-09 18:37:31 +00:00
Owen Anderson	a08318acb2	Refactor code-size reduction estimation methods out of InlineCostAnalyzer and into CodeMetrics. They don't use any InlineCostAnalyzer state, and are useful for other clients who don't necessarily want to use all of InlineCostAnalyzer's logic, some of which is fairly inlining-specific. No intended functionality change. llvm-svn: 113499	2010-09-09 16:56:42 +00:00
Dan Gohman	64d842ec72	Add a new experimental generalized dependence query interface to AliasAnalysis, and some code for implementing the new query on top of existing implementations by making standard alias and getModRefInfo queries. llvm-svn: 113329	2010-09-08 01:32:20 +00:00
Owen Anderson	a74fa15f32	Clean up some of the PassRegistry implementation, and pImpl-ize it to reduce #include clutter and exposing internal details. llvm-svn: 113252	2010-09-07 19:16:25 +00:00
Nick Lewycky	ad48e01eef	Add completely hokey binary-and and binary-or operations to ConstantRange and teach LazyValueInfo to use them. llvm-svn: 113196	2010-09-07 05:39:02 +00:00
Chris Lattner	a58edd1df3	cleanup some of the lifetime/invariant marker stuff, add a big fixme. llvm-svn: 113144	2010-09-06 03:58:04 +00:00
Chris Lattner	e34c835bde	speed up -gvn 3.4% on the testcase in PR7023 llvm-svn: 113135	2010-09-06 01:26:29 +00:00
Chris Lattner	da24b9a49a	pull a simple method out of LICM into a new Loop::hasLoopInvariantOperands method. Remove a useless and confusing Loop::isLoopInvariant(Instruction) method, which didn't do what you thought it did. No functionality change. llvm-svn: 113133	2010-09-06 01:05:37 +00:00
Chris Lattner	72d283c826	fix PR8063, a crash in globalopt in the malloc analysis code. llvm-svn: 113109	2010-09-05 17:20:46 +00:00
Chris Lattner	0963048185	dead method. llvm-svn: 113077	2010-09-04 18:19:16 +00:00
Chris Lattner	65b48b5dfc	zap dead code. llvm-svn: 113073	2010-09-04 18:12:00 +00:00
Dan Gohman	47bec3cb57	Disable the asserts that check that normalization is perfectly invertible. ScalarEvolution's folding routines don't always succeed in canonicalizing equal expressions to a single canonical form, and this can cause these asserts to fail, even though there's no actual correctness problem. This fixes PR8066. llvm-svn: 113021	2010-09-03 22:12:56 +00:00
Owen Anderson	c725462245	Add support for simplifying a load from a computed value to a load from a global when it is provable that they're equivalent. This fixes PR4855. llvm-svn: 112994	2010-09-03 19:08:37 +00:00
Chris Lattner	19199cce55	stop forcing a noop AssemblyAnnotationWriter to silence #uses comments, these don't happen anymore. llvm-svn: 112901	2010-09-02 23:03:10 +00:00
Owen Anderson	2912df072d	Remove incorrect and poorly tested code for trying to reason about values on default edges of switches. Just return the conservatively correct answer. llvm-svn: 112876	2010-09-02 22:16:52 +00:00
Owen Anderson	a8c896b704	Fix a bug in LazyValueInfo that CorrelatedValuePropagation exposed: In the LVI lattice, undef and the full set ConstantRange should not be treated as equivalent. llvm-svn: 112843	2010-09-02 18:23:58 +00:00
Dan Gohman	110ed64fbb	Revert 112442 and 112440 until the compile time problems introduced by 112440 are resolved. llvm-svn: 112692	2010-09-01 01:45:53 +00:00
Dan Gohman	47308d5da3	Reapply r112432, now that the real problem is addressed. llvm-svn: 112667	2010-08-31 22:53:17 +00:00
Dan Gohman	f01a5eed1e	Reapply r112433, now that the real problem is addressed. llvm-svn: 112666	2010-08-31 22:52:12 +00:00
Dan Gohman	aabfc52790	Revert r110916. This patch is buggy because the code inside the inner loop doesn't update all the variables in the outer loop. llvm-svn: 112665	2010-08-31 22:50:31 +00:00
Dan Gohman	90f29bcd90	Revert r112432. It appears to be exposing a problem in the emacs build. llvm-svn: 112638	2010-08-31 20:58:44 +00:00
Dan Gohman	444c24a9f0	Speculatively revert r112433. llvm-svn: 112608	2010-08-31 17:56:47 +00:00
Owen Anderson	9517943d11	It is possible to try to merge a not-constant with a constantrage, when dealing with ptrtoint ConstantExpr's. Unfortunately, the only testcase I have for this is huge and doesn't reduce well because the error is sensitive to iteration-order issues, since the problem only occurs when merging values in a particular order. llvm-svn: 112489	2010-08-30 17:03:45 +00:00
Benjamin Kramer	8548c892a8	Don't print two "0x" prefixes. Use a raw_ostream overload instead of llvm::format. llvm-svn: 112479	2010-08-30 14:46:53 +00:00
Chris Lattner	f58382ed87	two changes: 1) make AliasSet hold the list of call sites with an assertingvh so we get a violent explosion if the pointer dangles. 2) Fix AliasSetTracker::deleteValue to remove call sites with by-pointer comparisons instead of by-alias queries. Using findAliasSetForCallSite can cause alias sets to get merged when they shouldn't, and can also miss alias sets when the call is readonly. #2 fixes PR6889, which only repros with a .c file :( llvm-svn: 112452	2010-08-29 18:42:23 +00:00
Dan Gohman	3a08ed7904	Make IVUsers iterative instead of recursive. This has the side effect of reversing the order of most of IVUser's results. llvm-svn: 112442	2010-08-29 16:40:03 +00:00
Dan Gohman	d1da5cdfee	Restructure the {A,+,B}<L> * {C,+,D}<L> folding so that it folds all applicable addrecs before recursing on getMulExpr, instead of recursing on getMulExpr for each one. llvm-svn: 112433	2010-08-29 15:16:58 +00:00
Dan Gohman	3e6fc18943	Batch up subtracts along with adds, when analyzing long chains of operations. llvm-svn: 112432	2010-08-29 15:10:06 +00:00
Dan Gohman	7712d2900d	Micro-optimize GroupByComplexity. llvm-svn: 112431	2010-08-29 15:07:13 +00:00
Dan Gohman	0f2de01355	Hold AddRec->getLoop() in a variable, to make the Mul code more consistent with the Add code. llvm-svn: 112430	2010-08-29 14:55:19 +00:00
Dan Gohman	028c18158a	Rename a variable, for consistency. llvm-svn: 112429	2010-08-29 14:53:34 +00:00
Dan Gohman	28a84d4ba1	Use iterators instead of indices. llvm-svn: 112428	2010-08-29 14:52:02 +00:00
Chris Lattner	dc8070ed6d	when merging two alias sets, the result set is volatile if either of the sets is volatile. We were dropping the volatile bit of the merged in set, leading (luckily) to assertions in cases like PR7535. I cannot produce a testcase that repros with opt, but this is obviously correct. llvm-svn: 112402	2010-08-29 04:14:47 +00:00
Chris Lattner	eef6b19dcb	more cleanup llvm-svn: 112401	2010-08-29 04:13:43 +00:00
Chris Lattner	afb7074f18	clean this up llvm-svn: 112400	2010-08-29 04:06:55 +00:00
Dan Gohman	fe22f1d3cc	Fix an index calculation thinko. llvm-svn: 112337	2010-08-28 00:39:27 +00:00
Owen Anderson	38f6b7fe3b	Improve the precision of getConstant(). llvm-svn: 112323	2010-08-27 23:29:38 +00:00
Dan Gohman	15871f23e3	When merging adjacent operands, scan ahead and merge all equal adjacent operands at once, instead of just two at a time. llvm-svn: 112299	2010-08-27 21:39:59 +00:00
Dan Gohman	c866bf4fec	Make the {A,+,B}<L> + {C,+,D}<L> --> Other + {A+C,+,B+D}<L> transformation collect all the addrecs with the same loop add combine them at once rather than starting everything over at the first chance. llvm-svn: 112290	2010-08-27 20:45:56 +00:00
Dan Gohman	9bad2fb378	Switch ScalarEvolution's main Value->SCEV map from std::map to DenseMap. llvm-svn: 112281	2010-08-27 18:55:03 +00:00
Owen Anderson	6ebbd92380	Use LVI to eliminate conditional branches where we've tested a related condition previously. Update tests for this change. This fixes PR5652. llvm-svn: 112270	2010-08-27 17:12:29 +00:00
Dan Gohman	2706567c5c	Optimize SCEVComplexityCompare. Use a 3-way return instead of a 2-way return to avoid needing two calls to test for equivalence, and sort addrecs by their degree before examining their operands. llvm-svn: 112267	2010-08-27 15:26:01 +00:00
Owen Anderson	4afea9e3c6	In the default address space, any GEP off of null results in a trap value if you try to load it. Thus, any load in the default address space that completes implies that the base value that it GEP'd from was not null. llvm-svn: 112015	2010-08-25 01:16:47 +00:00
Owen Anderson	a10000006e	NULL loads are only invalid in the default address space. llvm-svn: 111972	2010-08-24 22:00:55 +00:00
Owen Anderson	b695c83de9	Add support for inferring values for the default cases of switches. llvm-svn: 111971	2010-08-24 21:59:42 +00:00
Owen Anderson	da34de1599	Add support for inferring that a load from a pointer implies that it is not null. llvm-svn: 111959	2010-08-24 20:47:29 +00:00
Owen Anderson	c62f704576	Don't assume that all constants with integer types are ConstantInts. llvm-svn: 111906	2010-08-24 07:55:44 +00:00
Devang Patel	dd719f701d	Let FE use derived types for DW_TAG_friend. Patch by Alexander Herz! llvm-svn: 111861	2010-08-23 23:16:25 +00:00
Devang Patel	a8652674e0	Handle qualified constants that are directly folded by FE. PR 7920. llvm-svn: 111820	2010-08-23 18:25:56 +00:00
Owen Anderson	d31d82d75c	Now that PassInfo and Pass::ID have been separated, move the rest of the passes over to the new registration API. llvm-svn: 111815	2010-08-23 17:52:01 +00:00
Dan Gohman	5fc55dc3cf	CreateTemporaryType doesn't needs its Context argument. llvm-svn: 111687	2010-08-20 22:39:47 +00:00
Dan Gohman	16a5d98c3a	Introduce a new temporary MDNode concept. Temporary MDNodes are not part of the IR, are not uniqued, and may be safely RAUW'd. This replaces a variety of alternate mechanisms for achieving the same effect. llvm-svn: 111681	2010-08-20 22:02:26 +00:00
Dan Gohman	a931605647	Convert DbgInfoPrinter to use errs() instead of outs(). llvm-svn: 111659	2010-08-20 18:03:05 +00:00
Dan Gohman	f71c521fb7	Revert r111199; it breaks -debug-pass=Structure output. llvm-svn: 111500	2010-08-19 01:29:07 +00:00
Chris Lattner	3decde9305	refix PR1143 by making basicaa analyze zexts of indices aggresively, which I broke with a recent patch. llvm-svn: 111452	2010-08-18 23:09:49 +00:00
Chris Lattner	26403acef7	GetLinearExpression is only called when TD is non-null, pass as a reference instead of pointer. llvm-svn: 111445	2010-08-18 22:52:09 +00:00
Chris Lattner	1b9c38796e	rework GEP decomposition to make a new VariableGEPIndex struct instead of using a pair. This tidies up the code a bit. While setting things up, add a (currently unused) field to keep track of how the value is extended. llvm-svn: 111444	2010-08-18 22:47:56 +00:00
Chris Lattner	9f7500f57b	move gep decomposition out of ValueTracking into BasicAA. The form of decomposition that it is doing is very basicaa specific and is only used by basicaa. Now with less tree breakingness. llvm-svn: 111433	2010-08-18 22:07:29 +00:00
Owen Anderson	80d19f0905	Use ConstantRange to propagate information through value definitions. llvm-svn: 111425	2010-08-18 21:11:37 +00:00
Daniel Dunbar	fbeeb130d8	Revert r111375, "move gep decomposition out of ValueTracking into BasicAA. The form of", it doesn't pass tests. llvm-svn: 111385	2010-08-18 18:43:08 +00:00
Owen Anderson	208636fa33	Inform LazyValueInfo whenever a block is deleted, to avoid dangling pointer issues. llvm-svn: 111382	2010-08-18 18:39:01 +00:00
Chris Lattner	54fe883203	move gep decomposition out of ValueTracking into BasicAA. The form of decomposition that it is doing is very basicaa specific and is only used by basicaa. llvm-svn: 111375	2010-08-18 18:22:17 +00:00
Chris Lattner	a33edcb56c	fix PR7589: In brief: gep P, (zext x) != gep P, (sext x) DecomposeGEPExpression was getting this wrong, confusing basicaa. llvm-svn: 111352	2010-08-18 04:28:19 +00:00
Dan Gohman	ed2b005842	Tweak IVUsers' concept of "interesting" to exclude add recurrences where the step value is an induction variable from an outer loop, to avoid trouble trying to re-expand such expressions. This effectively hides such expressions from indvars and lsr, which prevents them from getting into trouble. llvm-svn: 111317	2010-08-17 22:50:37 +00:00
Owen Anderson	fa7d44687f	Fix another iterator invalidation that caused a really nasty miscompilation in 403.gcc. llvm-svn: 111210	2010-08-16 23:42:33 +00:00
Dan Gohman	55cd6aadc9	Make dumpPassStructure be a PMDataManager abstraction, rather than a Pass abstraction, since that's the level it's actually used at. Rename Pass' dumpPassStructure to dumpPass. This eliminates an awkward use of getAsPass() to convert a PMDataManager* into a Pass* just to permit a dumpPassStructure call. llvm-svn: 111199	2010-08-16 22:45:12 +00:00
Dan Gohman	797a1dbb1c	To create a copy of a SmallVector with an element removed from the middle, copy the elements in two groups, rather than copying all the elements and then doing an erase on the middle of the result. These are SmallVectors, so we shouldn't expect to hit dynamic allocation in the common case. llvm-svn: 111151	2010-08-16 16:57:24 +00:00
Dan Gohman	0d0cc18af5	Tidy whitespace. llvm-svn: 111147	2010-08-16 16:34:09 +00:00
Dan Gohman	c29eeaecec	Add a comment. llvm-svn: 111145	2010-08-16 16:31:39 +00:00
Dan Gohman	7eac4961d7	Use const_iterator in a few places. llvm-svn: 111144	2010-08-16 16:30:01 +00:00
Dan Gohman	74c61503b1	Use iterators instead of indices in a few more places. llvm-svn: 111143	2010-08-16 16:27:53 +00:00
Dan Gohman	f29618236e	Micro-optimize SCEVConstant comparison. llvm-svn: 111142	2010-08-16 16:25:35 +00:00
Dan Gohman	3688ea5c7d	Move SCEVNAryExpr's virtual member functions out of line, and convert them to iterators. llvm-svn: 111140	2010-08-16 16:21:27 +00:00
Dan Gohman	d6925bbe0d	Use iterators instead of indices in simple cases. llvm-svn: 111138	2010-08-16 16:16:11 +00:00
Dan Gohman	b6c773ec2e	Avoid gratuitous inefficiency in ifndef NDEBUG code. llvm-svn: 111137	2010-08-16 16:13:54 +00:00
Dan Gohman	e5fb1036e6	Make one getAddExpr call when analyzing a+b+c+d+e+... instead of one for each add instruction. Ditto for Mul. llvm-svn: 111136	2010-08-16 16:03:49 +00:00
Dan Gohman	b094b39111	Delete an unused function. llvm-svn: 111135	2010-08-16 15:57:14 +00:00
Dan Gohman	fb83b043eb	Revert r111058, the lint check for indirectbr successors that aren't address-taken. This can occur normally, if the code which took the address got DCEd. llvm-svn: 111121	2010-08-16 14:39:19 +00:00
Argyrios Kyrtzidis	d0fcc9a818	Revert r111082. No warnings for this common pattern. llvm-svn: 111102	2010-08-15 10:27:23 +00:00
Argyrios Kyrtzidis	7c09ddf0ae	Add ATTRIBUTE_UNUSED to methods that are not supposed to be used. llvm-svn: 111082	2010-08-14 21:35:10 +00:00
Dan Gohman	21e6dc6aa3	Add a lint check for an indirectbr destination which has not had its address taken. llvm-svn: 111058	2010-08-13 23:56:28 +00:00
Dan Gohman	0c436ab356	Various optimizations. Don't compare two loops' depths when they are the same loop. Don't compare two instructions' loop depths when they are in the same block. llvm-svn: 111045	2010-08-13 21:24:58 +00:00
Dan Gohman	63c020a210	When testing whether one loop contains another, test this directly rather than testing whether the loop contains the other's header. llvm-svn: 111039	2010-08-13 20:23:25 +00:00
Dan Gohman	3324b9ec67	Add a const. llvm-svn: 111038	2010-08-13 20:17:27 +00:00
Dan Gohman	cf32f2bde1	When creating a symmetric SCEV with a constant operand, put the constant operand on the left, as that's where ScalarEvolution will end up canonicalizing to. llvm-svn: 111037	2010-08-13 20:17:14 +00:00
Dan Gohman	ec0120a123	An add recurrence is loop-invariant in any loop inside of its associated loop. This avoids potentially expensive traversals of the add recurrence's operands. llvm-svn: 111034	2010-08-13 20:11:39 +00:00
Dan Gohman	2de47777f4	Optimize ScalarEvolution::getAddExpr's operand factoring code by having it finish processing all of the muliply operands before starting the whole getAddExpr process over again, instead of immediately after the first simplification. llvm-svn: 110916	2010-08-12 15:00:23 +00:00
Dan Gohman	157847f5d1	Hoist some loop-invariant code out of a hot loop. llvm-svn: 110915	2010-08-12 14:52:55 +00:00
Dan Gohman	e67b287451	Optimize ScalarEvolution::getAddExpr's duplicate operand detection by having it finish processing the whole operand list before starting the whole getAddExpr process over again, instead of immediately after the first duplicate is found. llvm-svn: 110914	2010-08-12 14:46:54 +00:00
Devang Patel	4d597e8268	Even if a variable has constant value all the time, it is still a variable in gdb's eyes. Tested by scope.exp in gdb testsuite. llvm-svn: 110876	2010-08-11 23:17:54 +00:00
Owen Anderson	7b974a45db	Fix a subtle use-after-free issue. llvm-svn: 110863	2010-08-11 22:36:04 +00:00
Dan Gohman	a97e78b4ac	Make LoopPass::getContainedPass return a LoopPass* instead of a Pass* and remove casts from all its callers. llvm-svn: 110848	2010-08-11 20:34:43 +00:00
Owen Anderson	0bd61240e9	Improve indentation. llvm-svn: 110778	2010-08-11 04:24:25 +00:00
Dan Gohman	f7495f286a	When analyzing loop exit conditions combined with and and or, don't make any assumptions about when the two conditions will agree on when to permit the loop to exit. This fixes PR7845. llvm-svn: 110758	2010-08-11 00:12:36 +00:00
Dan Gohman	e18c2d6f99	Rename and reorder the arguments to isImpliedCond, for consistency and clarity. llvm-svn: 110750	2010-08-10 23:46:30 +00:00
Owen Anderson	5f1dd0967d	Now that we're using ConstantRange to represent potential values, make use of that represenation to create constraints from comparisons other than eq/neq. llvm-svn: 110742	2010-08-10 23:20:01 +00:00
Devang Patel	3e4d04230b	Add missing argument. CreateCompositeTypeEx() users, please verify. llvm-svn: 110717	2010-08-10 20:22:49 +00:00
Owen Anderson	185fe00633	Switch over to using ConstantRange to track integral values. llvm-svn: 110714	2010-08-10 20:03:09 +00:00
Devang Patel	8e06a5eb47	Do not forget debug info for enums. Use named mdnode to keep track of these types. llvm-svn: 110712	2010-08-10 20:01:20 +00:00
Tobias Grosser	7fbe6cb429	RegionInfo: Do not assert if a BB is not part of the dominance tree. llvm-svn: 110665	2010-08-10 09:54:35 +00:00
Devang Patel	b219746c80	Handle TAG_constant for integers. llvm-svn: 110656	2010-08-10 07:11:13 +00:00
Devang Patel	c7cf14f5f6	Refactor. llvm-svn: 110607	2010-08-09 21:39:24 +00:00
Owen Anderson	8afac043fb	Add ConstantRange information to the debugging output. llvm-svn: 110598	2010-08-09 20:50:46 +00:00
Owen Anderson	a7aed18624	Reapply r110396, with fixes to appease the Linux buildbot gods. llvm-svn: 110460	2010-08-06 18:33:48 +00:00
Dan Gohman	e68958fcdf	Implement a proper getModRefInfo for va_arg. llvm-svn: 110458	2010-08-06 18:24:38 +00:00
Dan Gohman	6b4671b208	Be more conservative in the face of volatile. llvm-svn: 110456	2010-08-06 18:11:28 +00:00
Dan Gohman	23976df6f2	Fix a comment. llvm-svn: 110455	2010-08-06 18:10:45 +00:00
Dan Gohman	5f1702e4fe	Move all the logic for function attributes and call attributes out of the AliasAnalysis base class and into BasicAliasAnalyais. This avoids confusion about where such logic is happening when there are other AliasAnalysis implementations present. Move the logic for translating two-callsite getModRefInfo queries into other AliasAnalysis queries out of BasicAliasAnalysis and into the AliasAnalysis base class, as it is useful for other AliasAnalysis implementations. llvm-svn: 110421	2010-08-06 01:25:49 +00:00
Owen Anderson	c2107d2eaa	Fix botched revert. llvm-svn: 110416	2010-08-06 00:36:20 +00:00
Owen Anderson	bda59bd247	Revert r110396 to fix buildbots. llvm-svn: 110410	2010-08-06 00:23:35 +00:00
Dan Gohman	e0d5c458ec	Fix 80-column violations. llvm-svn: 110401	2010-08-05 23:48:14 +00:00
Owen Anderson	755aceb5d0	Don't use PassInfo* as a type identifier for passes. Instead, use the address of the static ID member as the sole unique type identifier. Clean up APIs related to this change. llvm-svn: 110396	2010-08-05 23:42:04 +00:00
Dan Gohman	884dd752c3	Implement AccessesArguments checking in the two-callsite form of BasicAA::getModRefInfo. This allows BasicAA to say that two memset calls to non-aliasing memory locations don't interfere. llvm-svn: 110393	2010-08-05 23:34:50 +00:00
Dan Gohman	e2a67168bf	Yes, we can do better, but this is not the place for it. llvm-svn: 110391	2010-08-05 23:23:32 +00:00
Owen Anderson	0f306a45ad	Add the beginnings of infrastructure for range tracking. llvm-svn: 110388	2010-08-05 22:59:19 +00:00
Owen Anderson	c3a1413ea1	Split the tag and value members of LVILatticeVal in preparation for expanding the lattice to something that won't fit in two bits. llvm-svn: 110383	2010-08-05 22:10:46 +00:00
Dan Gohman	26ef7c7ab7	Fix memdep's code for reasoning about dependences between two calls. A Ref response from getModRefInfo is not useful here. Instead, check for identical calls only in the NoModRef case. Reapply r110270, and strengthen it to compensate for the memdep changes. When both calls are readonly, there is no dependence between them. llvm-svn: 110382	2010-08-05 22:09:15 +00:00
Dan Gohman	554b012f67	Revert r110270 for now. It appears to uncover a memdep bug. llvm-svn: 110293	2010-08-05 00:43:10 +00:00
Dan Gohman	109561845b	The trouble with testing for "ModRef" and "NoModRef" is that one is a suffix of the other, and FileCheck accepts superstrings. Adjust the output to avoid this problem. llvm-svn: 110280	2010-08-04 23:37:55 +00:00
Dan Gohman	bd33dab633	The two-callsite form of AliasAnalysis::getModRefInfo is documented to return Ref if the left callsite only reads memory read or written by the right callsite; fix BasicAliasAnalysis to implement this. Add AliasAnalysisEvaluator support for testing the two-callsite form of getModRefInfo. llvm-svn: 110270	2010-08-04 22:56:29 +00:00
Dan Gohman	db764c6e3b	Fix a minor bug which resulted in intermediate calculations using wider types than are necessary. llvm-svn: 110241	2010-08-04 19:52:50 +00:00
Torok Edwin	bfc17d0157	Add a missing function. llvm-svn: 110195	2010-08-04 11:42:45 +00:00
Dan Gohman	fc419ef6a0	Remove PointerAccessInfo, which nothing was using. llvm-svn: 110167	2010-08-03 23:08:10 +00:00
Dan Gohman	5442c71f2e	Thread const correctness through a bunch of AliasAnalysis interfaces and eliminate several const_casts. Make CallSite implicitly convertible to ImmutableCallSite. Rename the getModRefBehavior for intrinsic IDs to getIntrinsicModRefBehavior to avoid overload ambiguity with CallSite, which happens to be implicitly convertible to bool. llvm-svn: 110155	2010-08-03 21:48:53 +00:00
Dan Gohman	ad867b0aed	The singular of "indices" is "index". llvm-svn: 110135	2010-08-03 20:23:52 +00:00
Dan Gohman	852d6fc50c	Delete an unused function. llvm-svn: 110134	2010-08-03 20:20:56 +00:00
Dan Gohman	52f9d7d617	Make AliasAnalysis::getModRefInfo conservative in the face of volatility. llvm-svn: 110120	2010-08-03 17:27:43 +00:00
Dan Gohman	081627ceb8	Fix a typo Devang noticed. llvm-svn: 110115	2010-08-03 16:48:31 +00:00
Michael J. Spencer	2ce6994211	Fix CMake build llvm-svn: 110097	2010-08-03 02:38:20 +00:00
Dan Gohman	2a190081f6	Introduce a symbolic constant for ~0u for use with AliasAnalysis. llvm-svn: 110091	2010-08-03 01:03:11 +00:00
Dan Gohman	da7182e116	Add a convenient form of AliasAnalysis::alias for the case where the sizes are unknown. llvm-svn: 110090	2010-08-03 00:56:30 +00:00
Dan Gohman	7cac95778f	Make SCEVUnknown a CallbackVH, so that it can be notified directly of Value deletions and RAUWs, instead of relying on ScalarEvolution's Scalars map being notified, as that's complicated at best, and insufficient in general. This means SCEVUnknown needs a non-trivial destructor, so introduce a mechanism to allow ScalarEvolution to locate all the SCEVUnknowns. llvm-svn: 110086	2010-08-02 23:49:30 +00:00
Dan Gohman	272980b3f6	Sketch up a preliminary Type-Based Alias Analysis implementation. llvm-svn: 110077	2010-08-02 23:11:01 +00:00
Dan Gohman	d8968da2c5	Add a lint check for indirectbr with no successors. llvm-svn: 110074	2010-08-02 23:06:43 +00:00
Devang Patel	33a2cdf3f9	Add explicit constructors. Patch by Renato Golin. llvm-svn: 110072	2010-08-02 22:51:46 +00:00
Dan Gohman	abfafadfc7	Fix namespace polution. llvm-svn: 110056	2010-08-02 18:50:06 +00:00
Oscar Fuentes	40b31ad3ee	Prefix `next' iterator operation with `llvm::'. Fixes potential ambiguity problems on VS 2010. Patch by nobled! llvm-svn: 110029	2010-08-02 06:00:15 +00:00
Owen Anderson	c1561b8400	Add an initial implementation of PHI translation for LazyValueInfo. This involves rolling back some of my earlier data structure improvements until I can ensure that there are no iterator invalidation problems. llvm-svn: 109935	2010-07-30 23:59:40 +00:00
Owen Anderson	e4a0ab69d2	Revert my last two patches to LVI, which recent changes have exposed a miscompilation in. llvm-svn: 109889	2010-07-30 20:56:07 +00:00
Eric Christopher	ef6d5933a6	Speculatively revert r109705 since it seems to be causing some build bot angst. llvm-svn: 109718	2010-07-29 01:25:38 +00:00
Dan Gohman	3d6ac44d96	Factor out some of the code for updating old SCEVUnknown values, and extend it to handle the case where multiple RAUWs affect a single SCEVUnknown. Add a ScalarEvolution unittest to test for this situation. llvm-svn: 109705	2010-07-29 00:17:55 +00:00
Owen Anderson	a44f49f189	Pass the queried value by argument rather than in a member, in preparation for supporting PHI translation. llvm-svn: 109701	2010-07-28 23:50:08 +00:00
Owen Anderson	6982dd4e1f	Get rid of LVIQuery as a distinct data structure, so that we don't have to initialize a new set of maps on every query. llvm-svn: 109679	2010-07-28 22:07:25 +00:00
Daniel Dunbar	18e39cec7a	RegionInfo: Make sure to free cached nodes; Tobias, please check! llvm-svn: 109650	2010-07-28 20:28:50 +00:00
Gabor Greif	e497e5ef46	simplify llvm-svn: 109585	2010-07-28 15:31:37 +00:00
Gabor Greif	5bf74d648d	use Value* constructor of CallSite to create potentially improper site, and test that llvm-svn: 109580	2010-07-28 12:35:54 +00:00
Gabor Greif	67a970bff2	use Value* constructor of CallSite to create potentially improper site llvm-svn: 109579	2010-07-28 12:19:46 +00:00
Gabor Greif	7cf6056484	simplify llvm-svn: 109578	2010-07-28 10:57:28 +00:00
Gabor Greif	2e2503cd8d	simplify llvm-svn: 109577	2010-07-28 10:46:09 +00:00
Dan Gohman	7a066723d0	Make SCEVCallbackVH::allUsesReplacedWith update the old SCEVUnknown object, as it may still be referenced by SCEVs not cleaned up by the use list traversal. Also, in ScalarEvolution::forgetValue, only check for a SCEVUnknown object for the original value, not for any value in the use list, because other SCEVUnknown values aren't necessary obsolete at that point. llvm-svn: 109570	2010-07-28 01:09:07 +00:00
Dan Gohman	8aeb0fb5ca	Make SCEVCallbackVH::allUsesReplacedWith unconditionally delete the old value. llvm-svn: 109567	2010-07-28 00:28:25 +00:00
Owen Anderson	aac5a72139	Rearrange several datastructures in LazyValueInfo to improve compile time. This is still not perfect, but better than it was before. llvm-svn: 109563	2010-07-27 23:58:11 +00:00
Gabor Greif	0630a71742	reintroduce original (asserting) semantics of CallSite(Instruction II) add instead a CallSite(Value V) constructor that is consistent with ImmutableCallSize and use that one in client code llvm-svn: 109553	2010-07-27 22:53:28 +00:00
Gabor Greif	ef1ca24b91	recommit simplification (originally r109504, backed out in r109508) now that problem in CallSiteBase is fixed llvm-svn: 109547	2010-07-27 22:02:00 +00:00
Gabor Greif	ed1d92cb9a	back out r109504, breaks the bots llvm-svn: 109508	2010-07-27 15:18:11 +00:00
Gabor Greif	195a609c37	simplify llvm-svn: 109504	2010-07-27 14:38:38 +00:00
Gabor Greif	d59498bc97	use ImmutableCallSite for const-corrgoodness llvm-svn: 109503	2010-07-27 14:15:29 +00:00
Tobias Grosser	fc763867d5	RegionInfo: Add getMaxRegionExit() getMaxRegionExit returns the exit of the maximal refined region starting at a specific basic block. llvm-svn: 109496	2010-07-27 08:39:43 +00:00
Tobias Grosser	1bec81a888	Add function to query RegionInfo about loops. * contains(Loop), * getOutermostLoop() * Improve getNameStr() to return a sensible name, if basic blocks are not named. llvm-svn: 109490	2010-07-27 04:17:13 +00:00
Owen Anderson	aa7f66ba67	Add an initial implementation of LazyValueInfo updating for JumpThreading. Disabled for now. llvm-svn: 109424	2010-07-26 18:48:03 +00:00
Dan Gohman	cd83870faf	Fix SCEVExpander::visitAddRecExpr so that it remembers the induction variable it inserted rather than using LoopInfo::getCanonicalInductionVariable to rediscover it, since that doesn't work on non-canonical loops. This fixes infinite recurrsion on such loops; PR7562. llvm-svn: 109419	2010-07-26 18:28:14 +00:00
Dan Gohman	b3aa6c7110	Use DominatorTree::properlyDominates instead of dominates with an explicit inequality check. llvm-svn: 109398	2010-07-26 17:34:05 +00:00
Dan Gohman	7038bd5c1a	Eliminate getCanonicalInductionVariableIncrement's last user and eliminate it. llvm-svn: 109270	2010-07-23 21:34:51 +00:00
Dan Gohman	acafc61023	Simplify this code; it can use the regular CFG utlities rather than the BlockTraits abstractions. llvm-svn: 109268	2010-07-23 21:25:16 +00:00
Dan Gohman	5ae3102459	Micro-optimize SCEVComplexityCompare. llvm-svn: 109267	2010-07-23 21:20:52 +00:00
Dan Gohman	992db006d0	Add a const qualifier. llvm-svn: 109266	2010-07-23 21:18:55 +00:00
Gabor Greif	1a2da423c9	use cascading operator-> feature llvm-svn: 109104	2010-07-22 13:49:27 +00:00
Gabor Greif	dde79d8f1a	mass elimination of reliance on automatic iterator dereferencing llvm-svn: 109103	2010-07-22 13:36:47 +00:00
Gabor Greif	d9f48ecb2e	use -> instead of (*). llvm-svn: 109094	2010-07-22 11:12:32 +00:00
Gabor Greif	07c8ad54da	cache dereferenced iterator llvm-svn: 109093	2010-07-22 11:07:46 +00:00
Tobias Grosser	336734aca6	Add new RegionInfo pass. The RegionInfo pass detects single entry single exit regions in a function, where a region is defined as any subgraph that is connected to the remaining graph at only two spots. Furthermore an hierarchical region tree is built. Use it by calling "opt -regions analyze" or "opt -view-regions". llvm-svn: 109089	2010-07-22 07:46:31 +00:00
Dan Gohman	2637cc1a38	Make NamedMDNode not be a subclass of Value, and simplify the interface for creating and populating NamedMDNodes. llvm-svn: 109061	2010-07-21 23:38:33 +00:00
Owen Anderson	ac4a1ede17	Add INSTANTIATE_AG_PASS, which combines RegisterPass<> with RegisterAnalysisGroup<> for pass registration. llvm-svn: 109058	2010-07-21 23:07:00 +00:00
Owen Anderson	a57b97e7e7	Fix batch of converting RegisterPass<> to INTIALIZE_PASS(). llvm-svn: 109045	2010-07-21 22:09:45 +00:00
Jim Grosbach	6cd0deb997	tidy up. llvm-svn: 109038	2010-07-21 21:36:25 +00:00
Dan Gohman	093cb79d4b	Disallow null as a named metadata operand. Make MDNode::destroy private. Fix the one thing that used MDNode::destroy, outside of MDNode itself. One should never delete or destroy an MDNode explicitly. MDNodes implicitly go away when there are no references to them (implementation details aside). llvm-svn: 109028	2010-07-21 18:54:18 +00:00
Dan Gohman	625fd2292d	Fix SCEV denormalization of expressions where the exit value from one loop is involved in the increment of an addrec for another loop. This fixes rdar://8168938. llvm-svn: 108863	2010-07-20 17:06:20 +00:00
Dan Gohman	46f00a25f9	Add a fast path for x - x. llvm-svn: 108855	2010-07-20 16:53:00 +00:00
Dan Gohman	31158756e4	Simplify this code; LoopInfo::getCanonicalInductionVariable will only find integer induction variables. llvm-svn: 108853	2010-07-20 16:46:58 +00:00
Dan Gohman	4fd92434f1	Make getOrInsertCanonicalInductionVariable guarantee that its result is a PHINode*. llvm-svn: 108852	2010-07-20 16:44:52 +00:00
Dan Gohman	191f2e4dbd	Change an argument from an Instruction* to a Value*, which is all that is needed here. llvm-svn: 108850	2010-07-20 16:34:50 +00:00
Dan Gohman	d1488fd8bc	Minor code cleanups. llvm-svn: 108848	2010-07-20 16:32:11 +00:00
Owen Anderson	81781220d2	Speculatively revert r108813, in an attempt to get the self-host buildbots working again. I don't see why this patch would cause them to fail the way they are, but none of the other intervening patches seem likely either. llvm-svn: 108818	2010-07-20 08:26:15 +00:00
Owen Anderson	8dc129325f	Reapply r108794, a fix for the failing test from last time. llvm-svn: 108813	2010-07-20 06:52:42 +00:00
Daniel Dunbar	4a35d6f8cd	Revert r108794, "Separate PassInfo into two classes: a constructor-free superclass (StaticPassInfo) and a constructor-ful subclass (PassInfo).", it is breaking teh everything. llvm-svn: 108805	2010-07-20 03:06:07 +00:00
Owen Anderson	e7c5fe586a	Separate PassInfo into two classes: a constructor-free superclass (StaticPassInfo) and a constructor-ful subclass (PassInfo). llvm-svn: 108794	2010-07-20 01:19:58 +00:00
Dan Gohman	3ff13affda	Minor code simplification. llvm-svn: 108793	2010-07-20 00:57:18 +00:00
Stuart Hastings	61475c5c3c	Correct line info for declarations/definitions. Radar 8063111. llvm-svn: 108784	2010-07-19 23:56:30 +00:00
Gabor Greif	6d673953e3	eliminate CallInst::ArgOffset llvm-svn: 108522	2010-07-16 09:38:02 +00:00
Dan Gohman	fbbdfcaea7	Fix the order that SCEVExpander considers add operands in so that it doesn't miss an opportunity to form a GEP, regardless of the relative loop depths of the operands. This fixes rdar://8197217. llvm-svn: 108475	2010-07-15 23:38:13 +00:00
Dan Gohman	64b1e82a7c	Teach ScalarEvolution how to fold trunc(undef) and anyext(undef) to undef. This helps LSR behave more consistently on bugpoint-reduced testcases. llvm-svn: 108451	2010-07-15 20:02:11 +00:00
Gabor Greif	26ec65ac3c	cache another dereferenced iterator llvm-svn: 108421	2010-07-15 10:19:23 +00:00
Chris Lattner	19eff2a9f6	Fix PR7647, handling the case when 'To' ends up being mutated by recursive simplification. This also enhances ReplaceAndSimplifyAllUses to actually do a real RAUW at the end of it, which updates any value handles pointing to "From" to start pointing to "To". This seems useful for debug info and random other VH users. llvm-svn: 108415	2010-07-15 06:36:08 +00:00
Eli Friedman	8b3a17e613	Revert r108401; it breaks bootstrap :( llvm-svn: 108407	2010-07-15 05:09:31 +00:00
Eli Friedman	fd473a746c	Add AssertingVH which makes PR7647 break consistently. llvm-svn: 108401	2010-07-15 04:46:14 +00:00
Dan Gohman	c128e70ff2	Add a lint check for mismatched return types, inspired by PR6944. llvm-svn: 108162	2010-07-12 18:02:04 +00:00
Duncan Sands	41b4a6b36a	Convert some tab stops into spaces. llvm-svn: 108130	2010-07-12 08:16:59 +00:00
Chandler Carruth	57041d81df	Add parentheses around an \|\| to correct the logic. Also silences a GCC warning that was actually useful here. Chris, please double check that this is the correct interpretation. I was pretty sure, and ran it by Nick as well. llvm-svn: 108129	2010-07-12 06:47:05 +00:00
Chris Lattner	fd4a09fc0a	fix PR7429, a crash turning a load from a string into a float. llvm-svn: 108113	2010-07-12 00:22:51 +00:00
Gabor Greif	8e66a42784	remove useless cast and fix typos in comment llvm-svn: 107989	2010-07-09 16:42:04 +00:00
Gabor Greif	3b740e9085	cache result of operator* llvm-svn: 107988	2010-07-09 16:39:02 +00:00
Gabor Greif	aa389f5085	cache result of operator* llvm-svn: 107982	2010-07-09 16:22:36 +00:00
Gabor Greif	070b9a2cc4	cache result of operator* llvm-svn: 107978	2010-07-09 15:53:42 +00:00
Gabor Greif	d9a0e80213	cache result of operator* llvm-svn: 107977	2010-07-09 15:52:36 +00:00
Gabor Greif	e82532a1c5	cache result of operator* llvm-svn: 107976	2010-07-09 15:40:10 +00:00
Gabor Greif	2732561be9	cache result of operator* llvm-svn: 107967	2010-07-09 14:28:41 +00:00
Gabor Greif	1d20021d82	do not repeatedly dereference use_iterator llvm-svn: 107963	2010-07-09 13:17:13 +00:00
Stuart Hastings	d08fb75aaa	Reverting r107918 and r107919. Radar 8063111. llvm-svn: 107930	2010-07-08 23:25:39 +00:00
Stuart Hastings	43d226deea	Fix decl/def debug info for template functions. Radar 8063111. llvm-svn: 107919	2010-07-08 22:28:59 +00:00
Dan Gohman	5b0a8a863f	Minore code simplification. llvm-svn: 107777	2010-07-07 14:30:04 +00:00
Dan Gohman	00ef93258a	Remove interprocedural-basic-aa and associated code. The AliasAnalysis interface needs implementations to be consistent, so any code which wants to support different semantics must use a different interface. It's not currently worthwhile to add a new interface for this new concept. Document that AliasAnalysis doesn't support cross-function queries. llvm-svn: 107776	2010-07-07 14:27:09 +00:00
Gabor Greif	a22e8148d4	conditionalize by CallInst::ArgOffset llvm-svn: 107767	2010-07-07 10:34:03 +00:00
Dan Gohman	1e33b18e28	Add some more TODO comments. llvm-svn: 107657	2010-07-06 15:23:00 +00:00
Dan Gohman	f855b39edd	Add a comment. llvm-svn: 107656	2010-07-06 15:21:57 +00:00
Dan Gohman	84f90a387d	Remove context sensitivity concerns from interprocedural-basic-aa, and make it more aggressive in cases where both pointers are known to live in the same function. llvm-svn: 107420	2010-07-01 20:08:40 +00:00
Dan Gohman	f638f4ff84	In ScalarEvolution::forgetValue, eliminate any SCEVUnknown entries associated with the value being erased in the folding set map. These entries used to be harmless, because a SCEVUnknown doesn't store any information about its Value*, so having a new Value allocated at the old Value's address wasn't a problem. But now that ScalarEvolution is storing more information about values, this is no longer safe. llvm-svn: 107316	2010-06-30 20:21:12 +00:00
Dan Gohman	c0cca7fdda	Revert the part of r107257 which introduced new logic for using nsw and nuw flags from IR Instructions. On further consideration, this isn't valid. llvm-svn: 107298	2010-06-30 17:27:11 +00:00
Dan Gohman	16206132b6	Improve ScalarEvolution's nsw and nuw preservation. llvm-svn: 107257	2010-06-30 07:16:37 +00:00
Dan Gohman	9396b42ca4	When computing a new ConservativeResult, intersect it with the old one instead of replacing it, to be more precise. llvm-svn: 107256	2010-06-30 06:58:35 +00:00
Dan Gohman	0865966440	Rework scev-aa's basic computation so that it doesn't depend on ScalarEvolution successfully folding and preserving range information for both A-B and B-A. Now, if it gets either one, it's sufficient. llvm-svn: 107249	2010-06-30 06:12:16 +00:00
Dan Gohman	37f145c55b	Simplify. llvm-svn: 107248	2010-06-30 06:09:46 +00:00
Dan Gohman	ae36b1ed42	Fix ScalarEvolution's tripcount computation for chains of loops where each loop's induction variable's start value is the exit value of a preceding loop. llvm-svn: 107224	2010-06-29 23:43:06 +00:00
Dan Gohman	1be9e7c0b6	Fix whitespace style. llvm-svn: 107175	2010-06-29 18:12:34 +00:00
Duncan Sands	67aa21d7b5	Remove a pointless variable. llvm-svn: 107128	2010-06-29 11:39:45 +00:00
Benjamin Kramer	80b7bc042a	Use a more obvious way to avoid compiling functions which are only used when XDEBUG is enabled. llvm-svn: 107125	2010-06-29 10:03:11 +00:00
Chandler Carruth	b1adb88d05	Jump through some silly hoops to make GCC accept that a function may not always be called. llvm-svn: 107124	2010-06-29 06:46:00 +00:00
Dan Gohman	90db61d638	Just as its not safe to blindly transfer the nsw bit from an add instruction to an add scev, it's not safe to blindly transfer the inbounds flag from a gep instruction to an nsw on the scev for the gep. llvm-svn: 107117	2010-06-29 01:41:41 +00:00
Dan Gohman	0824affeff	Add an Intraprocedural form of BasicAliasAnalysis, which aims to properly handles instructions and arguments defined in different functions, or across recursive function iterations. llvm-svn: 107109	2010-06-29 00:50:39 +00:00
Dan Gohman	7c34ece501	Fix Value::stripPointerCasts and BasicAA to avoid trouble on code in unreachable blocks, which have have use-def cycles. This fixes PR7514. llvm-svn: 107071	2010-06-28 21:16:52 +00:00
Dan Gohman	875a296011	Generalize AAEval so that it can be used both per-function and interprocedurally. Note that as of this writing, existing alias analysis passes are not prepared to be used interprocedurally. llvm-svn: 107013	2010-06-28 16:01:37 +00:00
Devang Patel	f7869a4b81	Use named MDNode, llvm.dbg.sp, to collect subprogram info. This will be used to emit local variable's debug info of deleted functions. llvm-svn: 106989	2010-06-28 05:53:08 +00:00
Devang Patel	81170d23de	Do not forget last element, function, while creating Subprogram definition MDNode from subprogram declare MDNode. llvm-svn: 106985	2010-06-27 21:04:31 +00:00
Dan Gohman	89dd42af31	Eliminate a redundant FoldingSet lookup. llvm-svn: 106872	2010-06-25 18:47:08 +00:00
Dan Gohman	5235cc2c25	Don't try to preserve pointer types in SCEVConstants; the old code was over-complicated. llvm-svn: 106760	2010-06-24 16:47:03 +00:00
Dan Gohman	3ace9f4e3d	Make the trunc code consistent with the zext and sext code in its handling of pointer types. llvm-svn: 106757	2010-06-24 16:33:38 +00:00
Gabor Greif	1abbde3103	use ArgOperand accessors llvm-svn: 106697	2010-06-23 23:38:07 +00:00
Gabor Greif	253c6bf366	use the new isFreeCall API and ArgOperand accessors llvm-svn: 106692	2010-06-23 22:48:06 +00:00
Gabor Greif	5f5a864539	minor enhancement to llvm::isFreeCall API: return CallInst; no functional change llvm-svn: 106686	2010-06-23 21:51:12 +00:00
Gabor Greif	ad7884ad98	use ArgOperand getters llvm-svn: 106685	2010-06-23 21:41:47 +00:00
Dan Gohman	75c6b0bb1f	Replace ScalarEvolution's private copy of getLoopPredecessor with LoopInfo's public copy. llvm-svn: 106603	2010-06-22 23:43:28 +00:00
Dan Gohman	d2d1ae105d	Use pre-increment instead of post-increment when the result is not used. llvm-svn: 106542	2010-06-22 15:08:57 +00:00
Dan Gohman	f820bd327d	Allow "exhaustive" trip count evaluation on phi nodes with all constant operands. llvm-svn: 106537	2010-06-22 13:15:46 +00:00
Devang Patel	b6e058da18	Use single interface, using twine, to get named metadata. getNamedMetadata(). llvm-svn: 106518	2010-06-22 01:19:38 +00:00
Devang Patel	ad51735794	Do not rely on Twine temporaries to survive. llvm-svn: 106515	2010-06-22 01:01:58 +00:00
Dan Gohman	dd41bba517	Use A.append(...) instead of A.insert(A.end(), ...) when A is a SmallVector, and other SmallVector simplifications. llvm-svn: 106452	2010-06-21 19:47:52 +00:00
Devang Patel	e80de80270	Do not directly use function names to construct new name for named metadata. "llvm.dbg.lv.~A" is not a valid name. llvm-svn: 106438	2010-06-21 18:36:58 +00:00
Dan Gohman	c515ab1eb2	Restore a call to rememberInstruction which was accidentally dropped in refactoring. llvm-svn: 106398	2010-06-19 22:50:35 +00:00
Dan Gohman	866971ed3d	Fix ScalarEvolution's "exhaustive" trip count evaluation code to avoid assuming that loops are in canonical form, as ScalarEvolution doesn't depend on LoopSimplify itself. Also, with indirectbr not all loops can be simplified. This fixes PR7416. llvm-svn: 106389	2010-06-19 14:17:24 +00:00
Dan Gohman	d277246137	Factor out duplicated code for reusing and inserting casts into a helper function. llvm-svn: 106388	2010-06-19 13:25:23 +00:00
Dan Gohman	24ceda8eb0	Revert r106304 (105548 and friends), which are the SCEVComplexityCompare optimizations. There is still some nondeterminism remaining. llvm-svn: 106306	2010-06-18 19:54:20 +00:00
Dan Gohman	4c807fca97	Reapply 105540, 105542, and 105548, and revert r105732. llvm-svn: 106304	2010-06-18 19:26:04 +00:00
Dan Gohman	45073042eb	Reapply 105546. llvm-svn: 106302	2010-06-18 19:12:32 +00:00
Dan Gohman	9136d9fbf8	Reapply 105544. llvm-svn: 106301	2010-06-18 19:09:27 +00:00
Dan Gohman	3d8a9d7490	Remove getIntegerSCEV; it's redundant with getConstant, and getConstant is more consistent with the ConstantInt API. llvm-svn: 106281	2010-06-18 14:33:50 +00:00
Dan Gohman	f1d8304fe3	Eliminate unnecessary uses of getZExtValue(). llvm-svn: 106279	2010-06-18 14:22:04 +00:00
Dan Gohman	8ba26b48bb	Fix a typo in a comment. llvm-svn: 106260	2010-06-18 00:53:08 +00:00
Dan Gohman	8f5954f42c	Simplify this code. llvm-svn: 106254	2010-06-17 23:34:09 +00:00
Jim Grosbach	fd3b4e7390	A few more places where SCEVExpander bits need to skip over debug intrinsics when iterating through instructions. Yet more work for rdar://7797940 llvm-svn: 106149	2010-06-16 21:13:38 +00:00
Devang Patel	d119da54de	Check function pointer first, before comparing function names. llvm-svn: 106088	2010-06-16 06:42:02 +00:00
Devang Patel	a6d20f446f	Use separate named MDNode to hold each function's local variable info. This speeds up local variable handling in DwarfDebug. llvm-svn: 106075	2010-06-16 00:53:55 +00:00
Stuart Hastings	afe54f1625	Support for nested functions/classes in debug output. (Again.) Radar 7424645. llvm-svn: 105828	2010-06-11 20:08:44 +00:00
Stuart Hastings	6111abf8ad	Delete duplicate function. llvm-svn: 105827	2010-06-11 20:05:01 +00:00
Evan Cheng	ae83e1f5cb	Revert 105540, 105542, 105544, 105546, and 105548 to unbreak bootstrapping. llvm-svn: 105740	2010-06-09 18:59:43 +00:00
Kenneth Uildriks	9b21208bfb	Pulled CodeMetrics out of InlineCost.h and made it a bit more general, so it can be reused from PartialSpecializationCost llvm-svn: 105725	2010-06-09 15:11:37 +00:00
Dan Gohman	ebf2e977cf	The FoldingSet hash data includes pointer values, so it isn't determinstic. Instead, give SCEV objects an arbitrary sequence number. llvm-svn: 105548	2010-06-07 19:36:14 +00:00
Dan Gohman	3553feed79	Optimize this code somewhat by taking advantage of the fact that the operands are sorted. llvm-svn: 105546	2010-06-07 19:20:57 +00:00
Dan Gohman	a2effb6452	Micro-optimize this, to speed up this hotspot in debug builds a little. llvm-svn: 105544	2010-06-07 19:16:37 +00:00
Dan Gohman	18a4b46404	Micro-optimize this. llvm-svn: 105542	2010-06-07 19:12:54 +00:00
Dan Gohman	70910a6ab6	Optimize ScalarEvolution's SCEVComplexityCompare predicate: don't go scrounging through SCEVUnknown contents and SCEVNAryExpr operands; instead just do a simple deterministic comparison of the precomputed hash data. Also, since this is more precise, it eliminates the need for the slow N^2 duplicate detection code. llvm-svn: 105540	2010-06-07 19:06:13 +00:00
Bill Wendling	a3bba3371a	Create new accessors to get arguments for call/invoke instructions. It breaks encapsulation to force the users of these classes to know about the internal data structure of the Operands structure. It also can lead to errors, like in the MSIL writer. llvm-svn: 105539	2010-06-07 19:05:06 +00:00
Stuart Hastings	3ca391027f	Revert 105492 & 105493 due to a testcase regression. Radar 7424645. llvm-svn: 105511	2010-06-05 00:39:29 +00:00
Dan Gohman	bbfb6aca92	LSR needs to remember inserted instructions even in postinc mode, because there could be multiple subexpressions within a single expansion which require insert point adjustment. This fixes PR7306. llvm-svn: 105510	2010-06-05 00:33:07 +00:00
Stuart Hastings	7c015988fe	Support for nested functions/classes in debug output. Radar 7424645. llvm-svn: 105492	2010-06-04 22:36:03 +00:00
Dan Gohman	538b413ccb	Fix normalization and de-normalization of non-affine SCEVs. llvm-svn: 105480	2010-06-04 19:16:34 +00:00
Dan Gohman	49a372cebc	Fix the noalias checking so that it doesn't worry about an argument aliasing itself. Thanks Duncan! llvm-svn: 105288	2010-06-01 20:51:40 +00:00
Dan Gohman	34709d06c0	Fix AliasDebugger to be aware of operand values too. llvm-svn: 105012	2010-05-28 22:31:51 +00:00
Dan Gohman	0fa67e479a	Add lint checks for function attributes. llvm-svn: 105009	2010-05-28 21:43:57 +00:00
Dan Gohman	c575ec61ea	Fix lint's memcpy and memmove checks, and its basic block traversal. llvm-svn: 104970	2010-05-28 17:44:00 +00:00
Dan Gohman	862f034188	Detect self-referential values. llvm-svn: 104957	2010-05-28 16:45:33 +00:00
Stuart Hastings	c1e216583f	Revert 104841, 104842, 104876 due to buildbot failures. Radar 7424645. llvm-svn: 104953	2010-05-28 16:41:07 +00:00
Dan Gohman	cef9fc37f4	Eli pointed out that va_arg instruction result values don't reference the stack. llvm-svn: 104951	2010-05-28 16:34:49 +00:00
Dan Gohman	54d7aaa819	Teach lint how to look through simple store+load pairs and other effective no-op constructs, to make it more effective on unoptimized IR. llvm-svn: 104950	2010-05-28 16:21:24 +00:00
Dan Gohman	826bdf8c10	Move FindAvailableLoadedValue isSafeToLoadUnconditionally out of lib/Transforms/Utils and into lib/Analysis so that Analysis passes can use them. llvm-svn: 104949	2010-05-28 16:19:17 +00:00
Dan Gohman	a3b6c4b529	ConstantFoldConstantExpression can theoretically return null. llvm-svn: 104948	2010-05-28 16:12:08 +00:00
Dan Gohman	ddba4b725a	Add a lint check for returning the address of stack memory. llvm-svn: 104936	2010-05-28 04:33:42 +00:00
Stuart Hastings	8e99e50d08	Support for nested functions/classes in debug output. Radar 7424645. llvm-svn: 104841	2010-05-27 16:16:54 +00:00
Jakob Stoklund Olesen	d67defdfe2	Avoid counting InlineAsm as a call - it prevents loop unrolling. PR7026 Patch by Pekka Jääskeläinen! llvm-svn: 104780	2010-05-26 22:40:28 +00:00
Dan Gohman	084bcb1322	Fix Lint printing warnings multiple times. Remove the ErrorStr option from lintModule, which was an artifact from being based on Verifier code. llvm-svn: 104765	2010-05-26 22:28:53 +00:00
Dan Gohman	a20a5cd24f	Reinstate checking of stackrestore, with checking for both Read and Write, and add a comment explaining this. llvm-svn: 104756	2010-05-26 22:21:25 +00:00
Dan Gohman	996bc42a26	Stackrestore is not a load. llvm-svn: 104752	2010-05-26 22:00:10 +00:00
Dan Gohman	c96c6db59d	Remove a TODO which isn't practical. llvm-svn: 104748	2010-05-26 21:50:41 +00:00
Dan Gohman	1249adf160	Implement checking of the tail keyword. llvm-svn: 104744	2010-05-26 21:46:36 +00:00
Devang Patel	0adee9b362	Rename variable. add comment. llvm-svn: 104274	2010-05-20 20:35:24 +00:00
Devang Patel	e0a94bfe9f	Add support to preserve type info for the variables that are removed by the optimizer. llvm-svn: 103798	2010-05-14 21:01:35 +00:00
Nick Lewycky	c63aa1e8ab	Clear CachedFunctionInfo upon Pass::releaseMemory. Because ValueMap will abort on RAUW of functions, this is a correctness issue instead of a mere memory usage problem. No testcase until the new MergeFunctions can land. llvm-svn: 103653	2010-05-12 21:48:15 +00:00
Dan Gohman	bf2fb95b7c	Fix whitespace in debug output to be consistent. llvm-svn: 103422	2010-05-10 20:07:44 +00:00
Devang Patel	cbe7a8508a	Remove DIGlobal. llvm-svn: 103325	2010-05-07 23:19:07 +00:00
Devang Patel	54c59312b1	Add DINameSpace::Verify(). llvm-svn: 103318	2010-05-07 23:04:32 +00:00
Devang Patel	2ae3397536	Verify variable directly. llvm-svn: 103305	2010-05-07 22:04:20 +00:00
Devang Patel	2c4d69d7ad	Verify compile unit also. llvm-svn: 103300	2010-05-07 21:42:24 +00:00
Devang Patel	32cc43c242	Wrap const MDNode * inside DIDescriptor. llvm-svn: 103295	2010-05-07 20:54:48 +00:00
Devang Patel	4423abd734	Use overloaded operators instead of DIDescriptor::getNode() llvm-svn: 103276	2010-05-07 18:19:32 +00:00
Devang Patel	cfa8e9d45f	Avoid DIDescriptor::getNode(). Use overloaded operators instead. llvm-svn: 103272	2010-05-07 18:11:54 +00:00
Dan Gohman	50689f0bb9	Add some words to this output to indicate what the numbers mean. llvm-svn: 103264	2010-05-07 16:39:27 +00:00
Dan Gohman	fb64b5dff4	Add a simple module-level debug info printer. It just sets up a DebugInfoFinder and iterates over all the contents calling print. llvm-svn: 103262	2010-05-07 16:22:32 +00:00
Dan Gohman	6c30e879f8	Fix the new print functions to call print instead of dump. llvm-svn: 103261	2010-05-07 16:17:22 +00:00
Dan Gohman	4bbcf644da	Convert the DebugInfo classes dump() methods into print(raw_ostream &) methods, and add dump functions implemented in terms of the print. llvm-svn: 103254	2010-05-07 15:30:29 +00:00
Dan Gohman	70a3b12193	Use the SCEVAddRecExpr::getPostIncExpr utility function instead of doing the same thing manually. llvm-svn: 102997	2010-05-04 01:12:27 +00:00
Dan Gohman	5f18c547da	Fix a copy+pasto. llvm-svn: 102996	2010-05-04 01:11:15 +00:00
Devang Patel	801b8ea42a	Do not ignore debug loc attached with llvm.dbg.declare while collecting debug info used by a module. llvm-svn: 102995	2010-05-04 01:05:02 +00:00
Dan Gohman	1d2ded75e2	Use getConstant instead of getIntegerSCEV. The two are basically the same, now that getConstant has overloads consistent with ConstantInt::get. llvm-svn: 102965	2010-05-03 22:09:21 +00:00
Dan Gohman	267700c5aa	Silence warnings about -1 being converted to an unsigned value. Also, pass true for isSigned even when creating constants for unsigned comparisons, because the point is to create an all-ones constant, rather than UINT64_MAX, even for integers wider than 64 bits. llvm-svn: 102946	2010-05-03 20:23:47 +00:00
Dan Gohman	b5025c72eb	Use isTrueWhenEqual and isFalseWhenEqual instead of assuming that SimplifyICmpOperands will simplify such cases to EQ or NE. This makes the correcntess of the code independent on SimplifyICmpOperands doing certain simplifications. llvm-svn: 102927	2010-05-03 18:00:24 +00:00
Dan Gohman	d18dc2c876	In ScalarEvolution::print, don't bother printing out the SCEVs for comparison instructions, since they aren't interesting, despite having integer result types. llvm-svn: 102925	2010-05-03 17:03:23 +00:00
Dan Gohman	df564cacaf	In SimplifyICmpOperands, avoid needlessly swapping the operands in the case where both are addrecs in unrelated loops. llvm-svn: 102924	2010-05-03 17:00:11 +00:00
Dan Gohman	81585c18e1	Factor out the new <= and >= analysis code into SimplifyICmpOperands. llvm-svn: 102922	2010-05-03 16:35:17 +00:00
David Chisnall	f4b87f191b	Added a variant of InlineCostAnalyzer::getInlineCost() that takes the called function as an explicit argument, for use when inlining function pointers. llvm-svn: 102841	2010-05-01 15:47:41 +00:00
Chris Lattner	532112b98a	fix PR5009 by making CGSCCPM realize that a call was devirtualized if an indirect call site was removed and a direct one was added, not just if an indirect call site was modified to be direct. llvm-svn: 102830	2010-05-01 06:38:43 +00:00
Chris Lattner	fc8d9ee6c3	Implement rdar://6295824 and PR6724 with two tiny changes that can have a big effect :). The first is to enable the iterative SCC passmanager juice that kicks in when the scc passmgr detects that a function pass has devirtualized a call. In this case, it will rerun all the passes it manages on the SCC, up to the iteration count limit (4). This is useful because a function pass may devirualize a call, and we want the inliner to inline it, or pruneeh to infer stuff about it, etc. The second patch is to add all call sites to the DevirtualizedCalls list the inliner uses. This list is about to get renamed, but the jist of this is that the inliner now reconsiders all inlined call sites as candidates for further inlining. The intuition is this that in cases like this: f() { g(1); } g(int x) { h(x); } We analyze this bottom up, and may decide that it isn't profitable to inline H into G. Next step, we decide that it is profitable to inline G into F, and do so, which means that F now calls H. Even though the call from G -> H may not have been profitable to inline, the call from F -> H may be (in this case because a constant allows folding etc). In my spot checks, this doesn't have a big impact on code. For example, the LLC output for 252.eon grew from 0.02% (from 317252 to 317308) and 176.gcc actually shrunk by .3% (from 1525612 to 1520964 bytes). 252.eon never iterated in the SCC Passmgr, 176.gcc iterated at most 1 time. llvm-svn: 102823	2010-05-01 01:15:56 +00:00
Chris Lattner	a9bac86d16	Dan recently disabled recursive inlining within a function, but we were still inlining self-recursive functions into other functions. Inlining a recursive function into itself has the potential to reduce recursion depth by a factor of 2, inlining a recursive function into something else reduces recursion depth by exactly 1. Since inlining a recursive function into something else is a weird form of loop peeling, turn this off. The deleted testcase was added by Dale in r62107, since then we're leaning towards not inlining recursive stuff ever. In any case, if we like inlining recursive stuff, it should be done within the recursive function itself to get the algorithm recursion depth win. llvm-svn: 102798	2010-04-30 22:37:22 +00:00
Devang Patel	b4e3b9025c	Attach AT_APPLE_optimized attribute to optimized function's debug info. llvm-svn: 102743	2010-04-30 19:38:23 +00:00
Dan Gohman	a0a8a7fe40	Set isSigned to true when creating an all-ones integer constant, even for unsigned purposes, so >64-bit integer values get a full all-ones value. llvm-svn: 102739	2010-04-30 19:22:39 +00:00
Dan Gohman	1c07852e17	Silence compiler warnings. llvm-svn: 102734	2010-04-30 19:21:13 +00:00
Dan Gohman	299e7b93ac	Add lint checks for invalid uses of memory. llvm-svn: 102733	2010-04-30 19:05:00 +00:00
Devang Patel	0395553e35	Refactor. llvm-svn: 102661	2010-04-29 20:40:36 +00:00
Dan Gohman	58b0470592	When checking whether the special handling for an addrec increment which doesn't dominate the header is needed, don't check whether the increment expression has computable loop evolution. While the operands of an addrec are required to be loop-invariant, they're not required to dominate any part of the loop. This fixes PR6914. llvm-svn: 102389	2010-04-26 21:46:36 +00:00
Dan Gohman	f33bac3afe	ScalarEvolution support for <= and >= loops. Also, generalize ScalarEvolutions's min and max recognition to handle some new forms of min and max that this change makes more common. llvm-svn: 102234	2010-04-24 03:09:42 +00:00
Dan Gohman	36cce7e0dd	Use SimplifyICmpOperands in isKnownPredicate too. llvm-svn: 102233	2010-04-24 01:38:36 +00:00
Dan Gohman	3673aa1a51	Update isImpliedCond to use the new SimplifyICmpOperands utility. llvm-svn: 102232	2010-04-24 01:34:53 +00:00
Dan Gohman	48ff3cf63b	Add a new utility function SimplifyICmpOperands. Much of this code is refactored out of ScalarEvolution::isImpliedCond, which will be updated to use this new utility routine soon. llvm-svn: 102229	2010-04-24 01:28:42 +00:00
Chris Lattner	8c56254096	fix callgraph dump to not print 0x0x1234 for nodes. Add the instruction pointer value for debuggability. We now get dump output that looks like this: Call graph node for function: 'f1'<<0x1017086b0>> #uses=1 CS<0x1017046f8> calls external node Call graph node for function: '_ZNSt6vectorIdSaIdEEC1EmRKdRKS0_'<<0x1017086f0>> #uses=1 CS<0x0> calls external node Call graph node for function: 'f4'<<0x1017087a0>> #uses=1 CS<0x101708c88> calls function 'f3' llvm-svn: 102194	2010-04-23 18:23:40 +00:00
Dan Gohman	997bbc54d6	Fix LSR to tolerate cases where ScalarEvolution initially misses an opportunity to fold add operands, but folds them after LSR has separated them out. This fixes rdar://7886751. llvm-svn: 102157	2010-04-23 01:55:05 +00:00
Dan Gohman	ff3174e97f	When it doesn't matter whether zero or sign extension is used, use ScalarEvolutions "any" extend function. llvm-svn: 102156	2010-04-23 01:51:29 +00:00
Chris Lattner	055cf267db	add a DEBUG call so that -debug lists when CGSCCPM iterates. Fix RefreshCallGraph to use CGN->replaceCallEdge instead of hand rolling its own loop. replaceCallEdge properly maintains the reference counts of the nodes, fixing a crash exposed by the iterative callgraph stuff. llvm-svn: 102120	2010-04-22 20:42:33 +00:00
Dan Gohman	acd700a24b	Don't attempt to analyze values which are obviously undef. This fixes some assertion failures in extreme cases. llvm-svn: 102042	2010-04-22 01:35:11 +00:00
Dan Gohman	c951e6e414	Tidy a comment. llvm-svn: 102041	2010-04-22 01:30:05 +00:00
Dan Gohman	a029cbe93f	Make ScalarEvolution::getConstant support pointer types, for consistency with ScalarEvolution's overall approach to pointer types. llvm-svn: 102003	2010-04-21 16:04:04 +00:00
Chris Lattner	6fbe704932	Implement (but don't enable) PR6724 and rdar://6295824. In short, we have RefreshCallGraph detect when a function pass devirtualizes a call, and have CGSCCPassMgr iterate (up to a count) when this happens. This allows (in the example) GVN to devirtualize the call in foo, then the inliner to inline it away. This is not currently enabled because I haven't done any analysis on the (potentially substantial) code size or performance impact of doing this, and guess what, it exposes callgraph updating bugs in various passes. This is progress though, and you can play with it by passing -max-cg-scc-iterations=5 to opt. llvm-svn: 101973	2010-04-21 00:47:40 +00:00
Dan Gohman	4398308fa7	Revert r101471. For tight recursive functions which have multiple recursive callsites, inlining can reduce the number of calls by exponential factors, as it does in MultiSource/Benchmarks/Olden/treeadd. More involved heuristics will be needed. llvm-svn: 101969	2010-04-21 00:43:30 +00:00
Benjamin Kramer	395857705f	PR6880: Don't dereference CallsExternalNode if it's NULL. llvm-svn: 101897	2010-04-20 12:16:50 +00:00
Chris Lattner	c707fa9651	move some select simplifications out out instcombine into inst simplify. No functionality change. llvm-svn: 101873	2010-04-20 05:32:14 +00:00
Chris Lattner	aedb8a3535	make CallGraphNode dtor abort if a node is deleted when there are still references to it. llvm-svn: 101847	2010-04-20 00:47:34 +00:00
Dan Gohman	e637ff5e9a	Remove the Expr member from IVUsers. Instead of remembering the expression, just ask ScalarEvolution for it on demand. This helps IVUsers be more robust in the case of expressions changing underneath it. This fixes PR6862. llvm-svn: 101819	2010-04-19 21:48:58 +00:00
Chris Lattner	67e70971cc	fix PR6858: a dangling pointer use bug which was caused by switching CachedFunctionInfo from a std::map to a ValueMap (which is implemented in terms of a DenseMap). DenseMap has different iterator invalidation semantics than std::map. This should hopefully fix the dragonegg builder. llvm-svn: 101658	2010-04-17 17:57:56 +00:00
Chris Lattner	cea19a475b	a bunch of cleanups and tweaks, no functionality changes. llvm-svn: 101657	2010-04-17 17:55:00 +00:00
Chris Lattner	7c4f14bf90	reenable r101565, removing a problematic assertion. CGSCC can delete nodes in regions of the callgraph that have already been visited. If new CG nodes are allocated to the same pointer, we shouldn't abort, just handle it correctly by assigning a new number. This should restore stability by removing invalidated pointers that will be reused from the densemap in the iterator. llvm-svn: 101628	2010-04-17 07:17:19 +00:00
Chris Lattner	dddbcba270	disable r101565: an assert is getting triggered. More lurking badness no doubt. llvm-svn: 101583	2010-04-17 00:05:36 +00:00
Eric Christopher	7258dcd77f	Revert 101465, it broke internal OpenGL testing. Probably the best way to know that all getOperand() calls have been handled is to replace that API instead of updating. llvm-svn: 101579	2010-04-16 23:37:20 +00:00
Chris Lattner	de023a3c1d	building on the new CallGraphSCC abstraction, teach CallGraphSCCPassManager to keep the node entries in scc_iterator up to date instead of dangling as the SCC mutates. This is a really terrible problem which was causing -g to affect codegen because it would permute the memory image of the compiler process. Thanks to Dale for expertly hunting it down. llvm-svn: 101565	2010-04-16 23:04:30 +00:00
Chris Lattner	5518b81a98	move ReplaceNode out of line, rename scc_iterator::fini -> isAtEnd(). No functionality change. llvm-svn: 101562	2010-04-16 22:59:24 +00:00
Chris Lattner	4422d31b84	introduce a new CallGraphSCC class, and pass it around to CallGraphSCCPass's instead of passing around a std::vector<CallGraphNode*>. No functionality change, but now we have a much tidier interface. llvm-svn: 101558	2010-04-16 22:42:17 +00:00
Chris Lattner	6d1208fd2b	move PrintCallGraphPass out of the middle of CGPassManager. llvm-svn: 101543	2010-04-16 21:43:55 +00:00
Dan Gohman	f13f69f296	Disable inlining of recursive calls. It can complicate tailcallelim and dependent analyses, and increase code size, so doing it profitably would require more complex heuristics. llvm-svn: 101471	2010-04-16 16:01:18 +00:00
Gabor Greif	f375520f7b	reapply r101434 with a fix for self-hosting rotate CallInst operands, i.e. move callee to the back of the operand array the motivation for this patch are laid out in my mail to llvm-commits: more efficient access to operands and callee, faster callgraph-construction, smaller compiler binary llvm-svn: 101465	2010-04-16 15:33:14 +00:00
Dan Gohman	b3862ecd48	Make callIsSmall accessible as a utility function. llvm-svn: 101463	2010-04-16 15:14:50 +00:00
Dan Gohman	12293815de	Fix SCEVCommutativeExpr::print to be robust in the case of improper expression canonicalization. Its job is to print what's there, not to make judgements about it. llvm-svn: 101461	2010-04-16 15:03:25 +00:00
Gabor Greif	403e9694f9	back out r101423 and r101397, they break llvm-gcc self-host on darwin10 llvm-svn: 101434	2010-04-16 01:16:20 +00:00
Gabor Greif	33ae80bff7	reapply r101364, which has been backed out in r101368 with a fix rotate CallInst operands, i.e. move callee to the back of the operand array the motivation for this patch are laid out in my mail to llvm-commits: more efficient access to operands and callee, faster callgraph-construction, smaller compiler binary llvm-svn: 101397	2010-04-15 20:51:13 +00:00
Dan Gohman	b29cda9b3c	Fix a bunch of namespace polution. llvm-svn: 101376	2010-04-15 17:08:50 +00:00
Dan Gohman	4e3c1139a2	Make getPredecessorWithUniqueSuccessorForBB return the unique successor in addition to the predecessor. llvm-svn: 101374	2010-04-15 16:19:08 +00:00
Gabor Greif	9fd00c7d25	back out r101364, as it trips the linux nightlybot on some clang C++ tests llvm-svn: 101368	2010-04-15 12:46:56 +00:00
Gabor Greif	aafd209632	rotate CallInst operands, i.e. move callee to the back of the operand array the motivation for this patch are laid out in my mail to llvm-commits: more efficient access to operands and callee, faster callgraph-construction, smaller compiler binary llvm-svn: 101364	2010-04-15 10:49:53 +00:00
Dan Gohman	0b4df0425f	Constify GetConstantStringInfo. llvm-svn: 101298	2010-04-14 22:20:45 +00:00
Gabor Greif	fefdd42644	performance: cache the dereferenced use_iterator llvm-svn: 101265	2010-04-14 18:13:29 +00:00
Dan Gohman	65de3d140d	Add a comment. llvm-svn: 101248	2010-04-14 16:08:56 +00:00
Dan Gohman	7ef0dc2163	Teach ScalarEvolution to simplify smax and umax when it can prove that one operand is always greater than another. llvm-svn: 101142	2010-04-13 16:51:03 +00:00
Dan Gohman	fe4b29180b	Minor code micro-optimizations. llvm-svn: 101141	2010-04-13 16:49:23 +00:00
Dan Gohman	ebbd05f8ce	Micro-optimize a few hot spots. llvm-svn: 101086	2010-04-12 23:08:18 +00:00
Dan Gohman	11862a6ed3	Add fast paths to ScalarEvolution::getSizeOf and getOffsetOf, as they're used a lot by getNodeForGEP, which can be called a lot. This speeds up -iv-users by around 15% on several testcases. llvm-svn: 101083	2010-04-12 23:03:26 +00:00
Tobias Grosser	4885db6f52	Remove unneeded debug in PostDominator runOnFunction() The information is already available with "opt -analyze". The DominatorTree does also not have this in its runOnFunction. So they behave now more consistent. llvm-svn: 101038	2010-04-12 15:32:55 +00:00
Tobias Grosser	6a5eef4067	Remove dead code in the dotty dominance tree printer. This template is not needed anymore as it was replaced by the DOTGraphTraitsViewer. llvm-svn: 101036	2010-04-12 15:02:19 +00:00
Dan Gohman	6635bb26a6	Generalize ScalarEvolution's PHI analysis to handle loops that don't have preheaders or dedicated exit blocks, as clients may not otherwise need to run LoopSimplify. llvm-svn: 101030	2010-04-12 07:49:36 +00:00
Dan Gohman	f76210ead8	Rewrite the overflow checking in the get{Signed,Unsigned}Range code for AddRecs so that it checks for overflow in the computation that it is performing, rather than just checking hasNo{Signed,Unsigned}Wrap, since those flags are for a different computation. This fixes a bug that impacts an upcoming change. llvm-svn: 101028	2010-04-12 07:39:33 +00:00
Dan Gohman	f1e40e60d3	Minor code simplification. llvm-svn: 101009	2010-04-12 02:22:30 +00:00
Dan Gohman	068b793614	Fix indentation. llvm-svn: 101001	2010-04-11 23:44:58 +00:00
Dan Gohman	07591698ce	Enhance ScalarEvolution::isKnownPredicate with support for loop conditions which are invariants. llvm-svn: 100995	2010-04-11 22:16:48 +00:00
Dan Gohman	f7f28511a9	Minor code simplification. llvm-svn: 100994	2010-04-11 22:13:11 +00:00
Dan Gohman	ae4a4148ba	When creating a ConstantRange for [n,UINT_MAX], special case n == 0, because ConstantRange(0, 0) creates an empty range rather than a full one. llvm-svn: 100993	2010-04-11 22:12:18 +00:00
Dan Gohman	008a38b1d6	Add a cast to void to show that the return value is being intentionally ignored. llvm-svn: 100984	2010-04-11 19:30:19 +00:00
Dan Gohman	7841a6ecd2	Delete a dead check. llvm-svn: 100983	2010-04-11 19:29:41 +00:00
Dan Gohman	2532856704	Delete dead code. llvm-svn: 100981	2010-04-11 19:28:47 +00:00
Dan Gohman	b50349a979	Rename isLoopGuardedByCond to isLoopEntryGuardedByCond, to emphasise that it's only testing for the entry condition, not full loop-invariant conditions. llvm-svn: 100979	2010-04-11 19:27:13 +00:00
Dan Gohman	3295a6e5bc	When emitting code for an add, don't force a SCEVUnknown wrapper around a hoisted intermediate result if the intermediate result isn't an Instruction. llvm-svn: 100884	2010-04-09 19:14:31 +00:00
Dan Gohman	394b624215	Add a comment. llvm-svn: 100874	2010-04-09 18:20:03 +00:00
Dan Gohman	9ba08a4631	Add several more lint checks. llvm-svn: 100841	2010-04-09 01:39:53 +00:00
Dan Gohman	ee6451dca1	Fix a bug in IVUsers which was permitting non-affine addrecs to be sent to LSR, which it isn't prepared to handle. llvm-svn: 100839	2010-04-09 01:22:56 +00:00
Dan Gohman	7808d490d3	Add a few more lint checks. llvm-svn: 100825	2010-04-08 23:05:57 +00:00
Dan Gohman	4ce1fb1448	Add variants of ult, ule, etc. which take a uint64_t RHS, for convenience. llvm-svn: 100824	2010-04-08 23:03:40 +00:00
Ted Kremenek	7ffb294c5b	Update CMake build. llvm-svn: 100802	2010-04-08 18:52:18 +00:00
Dan Gohman	98bc4371c7	Add a -lint pass which checks for common sources of undefined or likely unintended behavior. llvm-svn: 100798	2010-04-08 18:47:09 +00:00
Dan Gohman	cb45bd9cb3	Pointers to zero-sized objects don't point to overlapping objects. llvm-svn: 100789	2010-04-08 18:11:50 +00:00
Gabor Greif	64d8d1a022	clean up algorithm and remove operand order assumptions llvm-svn: 100780	2010-04-08 16:46:24 +00:00
Dan Gohman	883105485b	Revert this change from a while ago; ScalarEvolution shouldn't analyze undef as 0, since it can't force other analyses to intepret the undef in the same way. llvm-svn: 100749	2010-04-08 05:58:24 +00:00
Benjamin Kramer	33f6413c58	Update cmake build. llvm-svn: 100713	2010-04-07 23:01:37 +00:00
Dan Gohman	d006ab90dd	Generalize IVUsers to track arbitrary expressions rather than expressions explicitly split into stride-and-offset pairs. Also, add the ability to track multiple post-increment loops on the same expression. This refines the concept of "normalizing" SCEV expressions used for to post-increment uses, and introduces a dedicated utility routine for normalizing and denormalizing expressions. This fixes the expansion of expressions which are post-increment users of more than one loop at a time. More broadly, this takes LSR another step closer to being able to reason about more than one loop at a time. llvm-svn: 100699	2010-04-07 22:27:08 +00:00
Dan Gohman	91ce8e9a5c	Add a const qualifier. llvm-svn: 100515	2010-04-06 01:31:12 +00:00
David Greene	9b063df40b	Ok, third time's the charm. No changes from last time except the CMake source addition. Apparently the buildbots were wrong about failures. --- Add some switches helpful for debugging: -print-before=<Pass Name> Dump IR before running pass <Pass Name>. -print-before-all Dump IR before running each pass. -print-after-all Dump IR after running each pass. These are helpful when tracking down a miscompilation. It is easy to get IR dumps and do diffs on them, etc. To make this work well, add a new getPrinterPass API to Pass so that each kind of pass (ModulePass, FunctionPass, etc.) can create a Pass suitable for dumping out the kind of object the Pass works on. llvm-svn: 100249	2010-04-02 23:17:14 +00:00
Chris Lattner	44714c9898	DebugInfoFinder::processModule was foiling my plot by materializing an MDNode for every debugloc. don't do that! :) "clang -g -S t.c" really no longer makes mdnodes for location tuples now. llvm-svn: 100224	2010-04-02 20:44:29 +00:00
Chris Lattner	915c5f9862	Switch the code generator (except the JIT) onto the new DebugLoc representation. This eliminates the 'DILocation' MDNodes for file/line/col tuples from -O0 -g codegen. This remove the old DebugLoc class, making it a typedef for DebugLoc, I'll rename NewDebugLoc next. I didn't update the JIT to use the new apis, so it will continue to work, but be as slow as before. Someone should eventually do this or, better yet, rip out the JIT debug info stuff and build the JIT on top of MC. llvm-svn: 100209	2010-04-02 19:42:39 +00:00
Evan Cheng	389525bdea	Revert 100204. It broke a bunch of tests and apparently changed what passes are run during codegen. llvm-svn: 100207	2010-04-02 19:29:15 +00:00
David Greene	8f32cb9fce	Let's try this again. Re-apply 100143 including an apparent missing <string> include. For some reason the buildbot choked on this while my builds did not. It's probably due to a difference in system headers. --- Add some switches helpful for debugging: -print-before=<Pass Name> Dump IR before running pass <Pass Name>. -print-before-all Dump IR before running each pass. -print-after-all Dump IR after running each pass. These are helpful when tracking down a miscompilation. It is easy to get IR dumps and do diffs on them, etc. To make this work well, add a new getPrinterPass API to Pass so that each kind of pass (ModulePass, FunctionPass, etc.) can create a Pass suitable for dumping out the kind of object the Pass works on. llvm-svn: 100204	2010-04-02 18:46:26 +00:00
Eric Christopher	5342ddaadf	Revert r100143. llvm-svn: 100146	2010-04-01 22:54:42 +00:00
David Greene	6789e21094	Add some switches helpful for debugging: -print-before=<Pass Name> Dump IR before running pass <Pass Name>. -print-before-all Dump IR before running each pass. -print-after-all Dump IR after running each pass. These are helpful when tracking down a miscompilation. It is easy to get IR dumps and do diffs on them, etc. To make this work well, add a new getPrinterPass API to Pass so that each kind of pass (ModulePass, FunctionPass, etc.) can create a Pass suitable for dumping out the kind of object the Pass works on. llvm-svn: 100143	2010-04-01 22:43:57 +00:00
Benjamin Kramer	f4512a3a9d	s/getNameStr/getName/ llvm-svn: 100011	2010-03-31 16:06:22 +00:00
Chris Lattner	743bdca344	microoptimize this hot method, also making it more consistent with other similar ones. llvm-svn: 99997	2010-03-31 05:53:47 +00:00
Chris Lattner	707431cf26	reapply my timer rewrite with a change for PassManager to store timers by pointer instead of by-value. llvm-svn: 99871	2010-03-30 04:03:22 +00:00
Chris Lattner	ec8ef9b643	revert r99862 which is causing FNT failures. llvm-svn: 99870	2010-03-30 03:57:00 +00:00
Chris Lattner	57a0542397	fairly major rewrite of various timing related stuff. llvm-svn: 99862	2010-03-30 02:38:19 +00:00
Gabor Greif	6c6b2fd2b2	rename pred_const_iterator to const_pred_iterator for consistency's sake llvm-svn: 99567	2010-03-25 23:25:28 +00:00
Gabor Greif	c78d720f02	rename use_const_iterator to const_use_iterator for consistency's sake llvm-svn: 99564	2010-03-25 23:06:16 +00:00
Eric Christopher	b1a382d8b9	Reapply r99451 with a fix to move the NoInline check to the cost functions instead of InlineFunction. llvm-svn: 99483	2010-03-25 04:49:10 +00:00
Gabor Greif	a2fbc0ae1b	Finally land the InvokeInst operand reordering. I have audited all getOperandNo calls now, fixing hidden assumptions. CallSite related uglyness will be eliminated successively. Note this patch has a long and griveous history, for all the back-and-forths have a look at CallSite.h's log. llvm-svn: 99399	2010-03-24 13:21:49 +00:00
Dan Gohman	dcddd5701c	Don't back past debug info intrinsics; SCEVExpander's strategy for ignoring debug info intrinsics everywhere else is to advance past them, and it needs to be consistent. llvm-svn: 99332	2010-03-23 21:53:22 +00:00
Gabor Greif	e1517a084f	backing out r99170 because it still fails on clang-x86_64-darwin10-fnt llvm-svn: 99171	2010-03-22 09:11:00 +00:00
Gabor Greif	7a743e15e3	Now that hopefully all direct accesses to InvokeInst operands are fixed we can reapply the InvokeInst operand reordering patch. (see r98957). llvm-svn: 99170	2010-03-22 08:28:00 +00:00
Dan Gohman	89d4e3c3fd	Fix more places to more thoroughly ignore debug intrinsics. This fixes use-before-def errors in SCEVExpander-produced code in sqlite3 when debug info with optimization is enabled, though the testcases for this are dependent on use-list order. llvm-svn: 99001	2010-03-19 21:51:03 +00:00
Gabor Greif	6c56ed847e	back out r98957, it broke http://smooshlab.apple.com:8010/builders/clang-x86_64-darwin10-fnt/builds/703 in the nightly test suite llvm-svn: 98958	2010-03-19 13:50:02 +00:00
Gabor Greif	8335f9c0bf	Recommit r80858 again (which has been backed out in r80871). This time I did a self-hosted bootstrap on Linux x86-64, with no problems. Let's see how darwin 64-bit self-hosting goes. At the first sign of failure I'll back this out. Maybe the valgrind bots give me a hint of what may be wrong (it at all). llvm-svn: 98957	2010-03-19 11:55:53 +00:00
Anton Korobeynikov	065232fcd1	FP16 constfolding llvm-svn: 98911	2010-03-19 00:36:35 +00:00
Dan Gohman	a5ca578384	Simplify this code. llvm-svn: 98853	2010-03-18 19:34:33 +00:00
Dan Gohman	01c65a2622	Define placement new wrappers for BumpPtrAllocator and RecyclingAllocator to allow client code to be simpler, and simplify several clients. llvm-svn: 98847	2010-03-18 18:49:47 +00:00
Dan Gohman	6556c8962c	Add the ability to "intern" FoldingSetNodeID data into a BumpPtrAllocator-allocated region to allow it to be stored in a more compact form and to avoid the need for a non-trivial destructor call. Use this new mechanism in ScalarEvolution instead of FastFoldingSetNode to avoid leaking memory in the case where a FoldingSetNodeID uses heap storage, and to reduce overall memory usage. llvm-svn: 98829	2010-03-18 16:16:38 +00:00
Dan Gohman	0052449e1a	Reapply r98755 with a thinko which miscompiled gengtype fixed. llvm-svn: 98793	2010-03-18 01:17:13 +00:00
Dan Gohman	d2abecaeea	Revert 98755, which may be causing trouble. llvm-svn: 98762	2010-03-17 19:54:53 +00:00
Dan Gohman	5c9b0e1a6e	Change SCEVNAryExpr's operand array from a SmallVector to a plain pointer and length, and allocate the arrays in ScalarEvolution's BumpPtrAllocator, so that they get released when their owning SCEV gets released. SCEVs are immutable, so they don't need to worry about operand array resizing. This fixes a memory leak reported in PR6637. llvm-svn: 98755	2010-03-17 18:51:01 +00:00
Duncan Sands	145584e037	Treat copysignl like the other copysign functions. llvm-svn: 98542	2010-03-15 14:01:44 +00:00
Evan Cheng	2a65429671	Fix a typo in ValueTracking that's causing instcombine to delete needed shift instructions. llvm-svn: 98416	2010-03-13 02:20:29 +00:00
Devang Patel	93142469ac	Do not ignore arg_size() impact while counting bb instructions. llvm-svn: 98408	2010-03-13 01:05:02 +00:00
Devang Patel	877d0355bd	Remove extra parameter. llvm-svn: 98403	2010-03-13 00:45:31 +00:00
Devang Patel	ad591dc6af	Do not overestimate code size reduction in presense of debug info. Use CodeMetrics.analyzeBasicBlock() to estimate BB size. llvm-svn: 98401	2010-03-13 00:10:20 +00:00
Duncan Sands	8c35506fbd	When constant folding GEP of GEP, do not crash if an index of the inner GEP is not a ConstantInt. llvm-svn: 98359	2010-03-12 17:55:20 +00:00
Dan Gohman	2734ebd37f	Add a DominatorTree argument to isLCSSA so that it doesn't have to compute a set of reachable blocks for itself each time it is called, which is fairly frequently. llvm-svn: 98179	2010-03-10 19:38:49 +00:00
Dan Gohman	474e488c06	Constant-fold GEP-of-GEP into a single GEP. llvm-svn: 98178	2010-03-10 19:31:51 +00:00
Dan Gohman	69451a0950	Avoid analyzing instructions in blocks not reachable from the entry block. They are lots of trouble, and they don't matter. This fixes PR6559. llvm-svn: 98103	2010-03-09 23:46:50 +00:00
Jakob Stoklund Olesen	b495cad7ca	Try to keep the cached inliner costs around for a bit longer for big functions. The Caller cost info would be reset everytime a callee was inlined. If the caller has lots of calls and there is some mutual recursion going on, the caller cost info could be calculated many times. This patch reduces inliner runtime from 240s to 0.5s for a function with 20000 small function calls. This is a more conservative version of r98089 that doesn't break the clang test CodeGenCXX/temp-order.cpp. That test relies on rather extreme inlining for constant folding. llvm-svn: 98099	2010-03-09 23:02:17 +00:00
Jakob Stoklund Olesen	4497475905	Revert r98089, it was breaking a clang test. llvm-svn: 98094	2010-03-09 22:43:37 +00:00
Jakob Stoklund Olesen	741dec43e4	Try to keep the cached inliner costs around for a bit longer for big functions. The Caller cost info would be reset everytime a callee was inlined. If the caller has lots of calls and there is some mutual recursion going on, the caller cost info could be calculated many times. This patch reduces inliner runtime from 240s to 0.5s for a function with 20000 small function calls. llvm-svn: 98089	2010-03-09 22:17:11 +00:00
Jakob Stoklund Olesen	5fba36cc1b	Permit inlining into huge functions. This heuristic is ancient, and inlining can sometimes help reduce function size. llvm-svn: 98088	2010-03-09 22:17:06 +00:00
Dan Gohman	93452cebda	Make isLCSSA ignore uses in blocks not reachable from the entry block, as LCSSA no longer transforms such uses. llvm-svn: 98033	2010-03-09 01:53:33 +00:00
Dale Johannesen	ace75dff75	Another place where debug info affected codegen. llvm-svn: 98026	2010-03-09 01:08:11 +00:00
Devang Patel	59445dbf78	Start using DIFile. See updated SourceLevelDebugging.html for more information. This patch updates LLVMDebugVersion to 8. Debug info descriptors encoded using LLVMDebugVersion 7 is supported. Corresponding llvmgcc and clang FE commits are required. llvm-svn: 98020	2010-03-09 00:44:10 +00:00
Devang Patel	2e520f6378	Introduce DIFile. This will be used to represent header files and source file(s) in debug info. llvm-svn: 97994	2010-03-08 22:27:22 +00:00
Devang Patel	8119fe87d8	Derive DIType from DIScope. This simplifies getContext() where for members the context is a type. This also eliminates need of CompileUnitMaps maintained by dwarf writer. llvm-svn: 97990	2010-03-08 22:02:50 +00:00
Devang Patel	4bd5f8ceca	Remove DbgNode checks in constructor. Debug descriptors are intended to be light weight wrappers. llvm-svn: 97988	2010-03-08 21:32:10 +00:00
Devang Patel	3b548aa8e2	Avoid using DIDescriptor.isNull(). This is a first step towards eliminating checks in Descriptor constructors. llvm-svn: 97975	2010-03-08 20:52:55 +00:00
Devang Patel	bc97f6b757	Revert r97947. llvm-svn: 97963	2010-03-08 19:20:38 +00:00
Devang Patel	fe28599f6f	Avoid using DIDescriptor.isNull(). This is a first step towards eliminating unncessary constructor checks in light weight DIDescriptor wrappers. llvm-svn: 97947	2010-03-08 18:25:48 +00:00
Dale Johannesen	066b8ea590	Fix another case where LSR was affected by debug info. llvm-svn: 97865	2010-03-06 02:45:26 +00:00
Dale Johannesen	f5cc1cdc65	Fix a case where LSR is sensitive to debug info. llvm-svn: 97830	2010-03-05 21:12:40 +00:00
Eric Christopher	4899cbc77d	Move GetStringLength and helper from SimplifyLibCalls to ValueTracking. No functionality change. llvm-svn: 97793	2010-03-05 06:58:57 +00:00
Chris Lattner	3afc0721c7	fix incorrect folding of icmp with undef, PR6481. llvm-svn: 97659	2010-03-03 19:46:03 +00:00
Dan Gohman	29707de4fe	Make SCEVExpander and LSR more aggressive about hoisting expressions out of loops. llvm-svn: 97642	2010-03-03 05:29:13 +00:00
Dan Gohman	2850b41412	Revert r97580; that's not the right way to fix this. llvm-svn: 97639	2010-03-03 04:36:42 +00:00
Dan Gohman	d55f574589	When expanding an expression such as (A + B + C + D), sort the operands by loop depth and emit loop-invariant subexpressions outside of loops. This speeds up MultiSource/Applications/viterbi and others. llvm-svn: 97580	2010-03-02 19:32:21 +00:00
Dan Gohman	52f5563973	Non-affine post-inc SCEV expansions have more code which must be emitted after the increment. Make sure the insert position reflects this. This fixes PR6453. llvm-svn: 97537	2010-03-02 01:59:21 +00:00
Ted Kremenek	5c74a4b00b	Update CMake build. llvm-svn: 97488	2010-03-01 19:42:47 +00:00
Chris Lattner	5ea3e65929	remove anders-aa from mainline, it isn't maintained and is tantalyzing enough that people keep trying to use it. llvm-svn: 97483	2010-03-01 19:24:17 +00:00
Dan Gohman	904d34c90f	Add a comment. llvm-svn: 97459	2010-03-01 17:56:04 +00:00
Dan Gohman	8b0a419eb1	Spelling fixes. llvm-svn: 97453	2010-03-01 17:49:51 +00:00
Dan Gohman	96d45008a6	Fix a missing newline in debug output. llvm-svn: 97449	2010-03-01 17:42:55 +00:00
Dan Gohman	a9c205cc88	Make LoopSimplify change conditional branches in loop exiting blocks which branch on undef to branch on a boolean constant for the edge exiting the loop. This helps ScalarEvolution compute trip counts for loops. Teach ScalarEvolution to recognize single-value PHIs, when safe, and ForgetSymbolicName to forget such single-value PHI nodes as apprpriate in ForgetSymbolicName. llvm-svn: 97126	2010-02-25 06:57:05 +00:00
Dan Gohman	4aad750333	ConstantFoldInstOperands can theoretically return null if it didn't fold anything. llvm-svn: 97049	2010-02-24 19:31:47 +00:00
Dan Gohman	007f5041a2	Simplify this code; these casts aren't necessary. llvm-svn: 97048	2010-02-24 19:31:06 +00:00
Dan Gohman	ba820344e3	Convert a few more backedge-taken count functions to use BackedgeTakenInfo. llvm-svn: 97042	2010-02-24 17:31:30 +00:00
Daniel Dunbar	693ea89214	Reapply r97010, the speculative revert failed. llvm-svn: 97036	2010-02-24 08:48:04 +00:00
Daniel Dunbar	0a2031e5b6	Speculatively revert r97010, "Add an argument to PHITranslateValue to specify the DominatorTree. ...", in hopes of restoring poor old PPC bootstrap. llvm-svn: 97027	2010-02-24 06:55:22 +00:00
Bob Wilson	66e58ac742	Add an argument to PHITranslateValue to specify the DominatorTree. If this argument is non-null, pass it along to PHITranslateSubExpr so that it can prefer using existing values that dominate the PredBB, instead of just blindly picking the first equivalent value that it finds on a uselist. Also when the DominatorTree is specified, have PHITranslateValue filter out any result that does not dominate the PredBB. This is basically just refactoring the check that used to be in GetAvailablePHITranslatedSubExpr and also in GVN. Despite my initial expectations, this change does not affect the results of GVN for any testcases that I could find, but it should help compile time. Before this change, if PHITranslateSubExpr picked a value that does not dominate, PHITranslateWithInsertion would then insert a new value, which GVN would later determine to be redundant and would replace. By picking a good value to begin with, we save GVN the extra work of inserting and then replacing a new value. llvm-svn: 97010	2010-02-24 01:39:00 +00:00
Dan Gohman	8a0eb36d23	Remove the code which constant-folded ptrtoint(inttoptr(x)+c) to getelementptr. Despite only doing so in the case where x is a known array object and c can be converted to an index within range, this could still be invalid if c is actually the address of an object allocated outside of LLVM. Also, SCEVExpander, the original motivation for this code, has since been improved to avoid inttoptr+ptroint in more cases. llvm-svn: 96950	2010-02-23 16:35:41 +00:00
Dan Gohman	6c5ac6de5c	Canonicalize ConstantInts to the right operand of commutative operators. The test difference is just due to the multiplication operands being commuted (and thus requiring a more elaborate match). In optimized code, that expression would be folded. llvm-svn: 96816	2010-02-22 22:43:23 +00:00
Dan Gohman	ebf57b06ea	Minor formatting cleanup. llvm-svn: 96808	2010-02-22 22:07:27 +00:00
Dan Gohman	8c16b38262	Remove unused variables and parameters. llvm-svn: 96780	2010-02-22 04:11:59 +00:00
Dan Gohman	754e4a9801	Constant-fold certain comparisons with infinity and negative infinity. llvm-svn: 96777	2010-02-22 04:06:03 +00:00
Dan Gohman	cf9c64e6e3	Add a comment. llvm-svn: 96688	2010-02-19 18:49:22 +00:00
Dan Gohman	6b1e2a829d	Teach ScalarEvolution how to compute a tripcount for a loop with true or false as its exit condition. These are usually eliminated by SimplifyCFG, but the may be left around during a pass which wishes to preserve the CFG. llvm-svn: 96683	2010-02-19 18:12:07 +00:00
Dale Johannesen	1d6827adef	recommit 96626, evidence that it broke things appears to be spurious llvm-svn: 96662	2010-02-19 07:14:22 +00:00
Dale Johannesen	1f790c28d0	Revert 96626, which causes build failure on ppc Darwin. llvm-svn: 96653	2010-02-19 01:54:37 +00:00
Dan Gohman	60b3326435	Indvars needs to explicitly notify ScalarEvolution when it is replacing a loop exit value, so that if a loop gets deleted, ScalarEvolution isn't stick holding on to dangling SCEVAddRecExprs for that loop. This fixes PR6339. llvm-svn: 96626	2010-02-18 23:26:33 +00:00
Dan Gohman	c70e994364	Fix SCEVExpander's existing PHI reuse checking to recognize the case where there are loop-invariant instructions somehow left inside the loop, and in a position where they won't dominate the IV increment position. llvm-svn: 96448	2010-02-17 02:39:31 +00:00
Dan Gohman	cf39be32bf	Fold bswap(undef) to undef. llvm-svn: 96432	2010-02-17 00:54:58 +00:00
Devang Patel	7c7cfbbc38	Use line and column number to distinguish two lexical blocks at the same level. llvm-svn: 96395	2010-02-16 21:39:34 +00:00
Bob Wilson	92cdb6eec5	Split critical edges as needed for load PRE. llvm-svn: 96378	2010-02-16 19:51:59 +00:00
Duncan Sands	19d0b47b1f	There are two ways of checking for a given type, for example isa<PointerType>(T) and T->isPointerTy(). Convert most instances of the first form to the second form. Requested by Chris. llvm-svn: 96344	2010-02-16 11:11:14 +00:00
Dan Gohman	148a972b67	When reusing an existing PHI node in a loop, be even more strict about the requirements. llvm-svn: 96301	2010-02-16 00:20:08 +00:00
Duncan Sands	9dff9bec31	Uniformize the names of type predicates: rather than having isFloatTy and isInteger, we now have isFloatTy and isIntegerTy. Requested by Chris! llvm-svn: 96223	2010-02-15 16:12:20 +00:00
Dan Gohman	fefbff9cd8	When testing whether a given SCEV depends on a temporary symbolic name, test whether the SCEV itself is that temporary symbolic name, in addition to checking whether the symbolic name appears as a possibly-indirect operand. llvm-svn: 96216	2010-02-15 10:28:37 +00:00
Dan Gohman	4d8feb11dd	When restoring a saved insert location, check to see if the saved insert location has become an "inserted" instruction since the time it was saved. If so, advance to the first non-"inserted" instruction. llvm-svn: 96203	2010-02-15 00:21:43 +00:00
Dan Gohman	6b7517342e	In rememberInstruction, if the value being remembered is the current insertion point, advance the current insertion point. This avoids a use-before-def situation in a testcase extracted from clang which is difficult to reduce to a reasonable-sized regression test. llvm-svn: 96151	2010-02-14 03:12:47 +00:00
Dan Gohman	f446713fd0	Simplify this code; no need for a custom subclass if it doesn't need to override anything from the parent class. llvm-svn: 96150	2010-02-14 02:48:58 +00:00
Dan Gohman	fe873e7c10	Override dominates and properlyDominates for SCEVAddRecExpr, as a SCEVAddRecExpr doesn't necessarily dominate blocks merely dominated by all of its operands. This fixes an abort compiling 403.gcc. llvm-svn: 96056	2010-02-13 00:19:39 +00:00
Dan Gohman	1a8674e60b	Fix a case of mismatched types in an Add that turned up in 447.dealII. llvm-svn: 96007	2010-02-12 20:39:25 +00:00
Dan Gohman	45774ce0ad	Reapply the new LoopStrengthReduction code, with compile time and bug fixes, and with improved heuristics for analyzing foreign-loop addrecs. This change also flattens IVUsers, eliminating the stride-oriented groupings, which makes it easier to work with. llvm-svn: 95975	2010-02-12 10:34:29 +00:00
Dan Gohman	c42c5243a1	Use an AssemblyAnnotatorWriter to clean up IVUsers' debug output. The "uses=" comments are just clutter in this context. llvm-svn: 95799	2010-02-10 20:42:37 +00:00
Dan Gohman	4a618827de	Fix "the the" and similar typos. llvm-svn: 95781	2010-02-10 16:03:48 +00:00
Dan Gohman	6f9646e1c5	Add const qualifiers. llvm-svn: 95582	2010-02-08 22:00:06 +00:00
Devang Patel	6efc8e5120	Set DW_AT_artificial only if argument is marked as artificial. llvm-svn: 95461	2010-02-06 01:02:37 +00:00
Jakob Stoklund Olesen	b0b2297066	Update CodeMetrics to count 'big' function calls explicitly. llvm-svn: 95453	2010-02-05 23:21:18 +00:00
Dan Gohman	9946b5109c	Change the argument to getIntegerSCEV to be an int64_t, rather than int. This will make it more convenient for LSR, which does a lot of things with int64_t offsets. llvm-svn: 95281	2010-02-04 02:43:51 +00:00
Devang Patel	999b499024	Provide interface to identifiy artificial methods. llvm-svn: 95240	2010-02-03 19:57:19 +00:00
Dan Gohman	7e5f1b2773	Various code simplifications. llvm-svn: 95044	2010-02-02 01:38:49 +00:00
Bill Wendling	c5829c4a50	Add "dump" method to IVUsersOneStride. llvm-svn: 95022	2010-02-01 22:51:23 +00:00
Dan Gohman	e5e1b7b05a	Generalize target-independent folding rules for sizeof to handle more cases, and implement target-independent folding rules for alignof and offsetof. Also, reassociate reassociative operators when it leads to more folding. Generalize ScalarEvolution's isOffsetOf to recognize offsetof on arrays. Rename getAllocSizeExpr to getSizeOfExpr, and getFieldOffsetExpr to getOffsetOfExpr, for consistency with analagous ConstantExpr routines. Make the target-dependent folder promote GEP array indices to pointer-sized integers, to make implicit casting explicit and exposed to subsequent folding. And add a bunch of testcases for this new functionality, and a bunch of related existing functionality. llvm-svn: 94987	2010-02-01 18:27:38 +00:00
Devang Patel	7f8be9ba95	Before inserting llvm.dbg.declare intrinsic at the end of a basic block, check whether the basic block has a terminator or not. This API is used by clang and the test case is test/CodeGen/debug-info-crash.c in clang module. llvm-svn: 94820	2010-01-29 18:30:57 +00:00
Duncan Sands	26cd6bd0b0	It looks like the changes to the SRem logic of SimplifyDemandedUseBits (fix for PR6165) are needed here too. llvm-svn: 94801	2010-01-29 06:18:37 +00:00
Dan Gohman	9f4ea22c88	Check Type::isSized before calling ScalarEvolution::getAllocSizeExpr, rather than after. llvm-svn: 94742	2010-01-28 06:32:46 +00:00
Dan Gohman	cf9138307d	Remove SCEVAllocSizeExpr and SCEVFieldOffsetExpr, and in their place use plain SCEVUnknowns with ConstantExpr::getSizeOf and ConstantExpr::getOffsetOf constants. This eliminates a bunch of special-case code. Also add code for pattern-matching these expressions, for clients that want to recognize them. Move ScalarEvolution's logic for expanding array and vector sizeof expressions into an element count times the element size, to expose the multiplication to subsequent folding, into the regular constant folder. llvm-svn: 94737	2010-01-28 02:15:55 +00:00
Jakob Stoklund Olesen	0234628284	Fix inline cost predictions with SCIENCE. After running a batch of measurements, it is clear that the inliner metrics need some adjustments: Own argument bonus: 20 -> 5 Outgoing argument penalty: 0 -> 5 Alloca bonus: 10 -> 5 Constant instr bonus: 7 -> 5 Dead successor bonus: 40 -> 5*(avg instrs/block) The new cost metrics are generaly 25 points higher than before, so we may need to move thresholds. With this change, InlineConstants::CallPenalty becomes a political correction: if (!isa<IntrinsicInst>(II) && !callIsSmall(CS.getCalledFunction())) NumInsts += InlineConstants::CallPenalty + CS.arg_size(); The code size is accurately modelled by CS.arg_size(). CallPenalty is added because calls tend to take a long time, so it may not be worth it to inline a function with lots of calls. All of the political corrections are in the InlineConstants namespace: IndirectCallBonus, CallPenalty, LastCallToStaticBonus, ColdccPenalty, NoreturnPenalty. llvm-svn: 94615	2010-01-26 23:21:56 +00:00
Jakob Stoklund Olesen	87256d8fe1	Revert test polarity to match comment and desired outcome. Remove undeserved bonus. A GEP with all constant indices is already considered free by analyzeBasicBlock(), so don't give it an extra bonus in CountCodeReductionForAlloca(). This patch should remove a small positive bias toward inlining functions with variable-index GEPs, and remove a smaller negative bias from functions with all-constant index GEPs. llvm-svn: 94591	2010-01-26 21:31:35 +00:00
Jakob Stoklund Olesen	832e79ca32	Remove dead code. Functions containing indirectbr are marked NeverInline by analyzeBasicBlock(), so there is no point in giving indirectbr special treatment in CountCodeReductionForConstant. It is never called. No functional change intended. llvm-svn: 94590	2010-01-26 21:31:30 +00:00
Jakob Stoklund Olesen	cab470b17a	Skip calculation of ArgumentWeights if it will never be used. Save a few bytes by allocating the correct size vector. No functional change intended. llvm-svn: 94589	2010-01-26 21:31:24 +00:00
Devang Patel	f4b25d6d7b	Add extra element to composite type. This new element will be used to record c++ class that holds current class's vtable. llvm-svn: 94586	2010-01-26 21:14:59 +00:00
Dan Gohman	85be4333ad	Make the unsigned-range code more consistent with the signed-range code, and clean up some loose ends. llvm-svn: 94572	2010-01-26 19:19:05 +00:00
Dan Gohman	a01418d75a	Fix a typo in a comment that Duncan noticed. llvm-svn: 94562	2010-01-26 18:32:54 +00:00
Dan Gohman	fdb744b203	Rename ItCount to BECount, since it holds a backedge-taken count rather than an iteration count. llvm-svn: 94549	2010-01-26 16:46:18 +00:00
Dan Gohman	51aaf02821	Fix the the ceiling-division used in computing the MaxBECount so that it doesn't have trouble with an intermediate add overflowing. Also, be more conservative about the case where the induction variable in an SLT loop exit can step past the RHS of the SLT and overflow in a single step. Make getSignedRange more aggressive, to recover for some common cases which the above fixes pessimized. This addresses rdar://7561161. llvm-svn: 94512	2010-01-26 04:40:18 +00:00
Victor Hernandez	907bdbb6be	Assert when debug intrinsic insert functions are passed empty arguments llvm-svn: 94491	2010-01-26 02:07:38 +00:00
Chris Lattner	823aed16f9	make -fno-rtti the default unless a directory builds with REQUIRES_RTTI. llvm-svn: 94378	2010-01-24 20:43:08 +00:00
Devang Patel	0e44c51a3e	Avoid using "Type" as the variable name. llvm-svn: 94262	2010-01-23 00:26:28 +00:00
Victor Hernandez	73b0f99f17	Make sure ValueFn starts off empty llvm-svn: 94256	2010-01-23 00:03:28 +00:00
Chris Lattner	7ba0661f27	Stop building RTTI information for most llvm libraries. Notable missing ones are libsupport, libsystem and libvmcore. libvmcore is currently blocked on bugpoint, which uses EH. Once it stops using EH, we can switch it off. This #if 0's out 3 unit tests, because gtest requires RTTI information. Suggestions welcome on how to fix this. llvm-svn: 94164	2010-01-22 06:49:46 +00:00
Chris Lattner	e0701f987f	drop the pass name from the output. llvm-svn: 94158	2010-01-22 05:52:51 +00:00
Chris Lattner	0b1c7235aa	eliminate dynamic_cast from this file. llvm-svn: 94157	2010-01-22 05:46:59 +00:00
Chris Lattner	9efd4fcceb	eliminate a bunch more unneeded dynamic_cast's. llvm-svn: 94156	2010-01-22 05:37:10 +00:00
Chris Lattner	2fa26e5fd0	eliminate a bunch of dynamic_cast's. llvm-svn: 94155	2010-01-22 05:24:46 +00:00
Dan Gohman	60a9bf414e	When re-using an existing cast for a user, it's still necessary to call rememberInstruction so that future users of that user will be inserted in the correct position. This fixes the Darwin selfhost. llvm-svn: 94070	2010-01-21 10:08:42 +00:00
Dan Gohman	51ad99d2c5	Re-implement the main strength-reduction portion of LoopStrengthReduction. This new version is much more aggressive about doing "full" reduction in cases where it reduces register pressure, and also more aggressive about rewriting induction variables to count down (or up) to zero when doing so reduces register pressure. It currently uses fairly simplistic algorithms for finding reuse opportunities, but it introduces a new framework allows it to combine multiple strategies at once to form hybrid solutions, instead of doing all full-reduction or all base+index. llvm-svn: 94061	2010-01-21 02:09:26 +00:00
Chris Lattner	da363d9af8	adopt getAdjustedAnalysisPointer in a few more passes. llvm-svn: 94018	2010-01-20 20:09:02 +00:00
Chris Lattner	3b03327c14	adopt getAdjustedAnalysisPointer in two more passes. llvm-svn: 94017	2010-01-20 19:53:32 +00:00
Chris Lattner	397af34e6f	adopt getAdjustedAnalysisPointer in BasicCallGraph. llvm-svn: 94015	2010-01-20 19:51:46 +00:00
Chris Lattner	af362f014d	add some new methods to adjust this pointers. Not used yet. llvm-svn: 94013	2010-01-20 19:26:14 +00:00
Victor Hernandez	20425abea4	Avoid unnecessary Elts array llvm-svn: 93978	2010-01-20 05:44:11 +00:00
Dan Gohman	8d67d2f5f8	Add a comment and tidy up some whitespace. llvm-svn: 93932	2010-01-19 22:27:22 +00:00
Dan Gohman	510bffca45	Fix a typo and an 80-column violation in comments. llvm-svn: 93931	2010-01-19 22:26:02 +00:00
Dan Gohman	f86d904b7d	Give ScalarEvolution access to the DominatorTree. It'll need this to make more intellegent AddRec folding decisions. llvm-svn: 93930	2010-01-19 22:21:27 +00:00
Dan Gohman	d693472821	Add a new helper function to IVUsers for returning the "canonical" form of an expression. This is the expression without the post-increment adjustment made, which is useful in determining which registers will be used by the expansion. llvm-svn: 93921	2010-01-19 21:55:32 +00:00
Victor Hernandez	870913f707	Make findDbgDeclare/findDbgGlobalDeclare local static functions; avoid Elts array llvm-svn: 93764	2010-01-18 20:42:09 +00:00
Tobias Grosser	53da3f8da8	Create Generic DOTGraphTraits Printer/Viewer Move the DOTGraphTraits dotty printer/viewer templates, that were developed for the dominance tree into their own header file. This will allow reuse in future passes. llvm-svn: 93632	2010-01-16 10:56:41 +00:00
Devang Patel	c0e17df3ce	Replace DebugLocTuple with DILocation. llvm-svn: 93630	2010-01-16 06:09:35 +00:00
Victor Hernandez	b324e66f4c	Improve llvm.dbg.declare intrinsic by referring directly to the storage in its first argument, via function-local metadata (instead of via a bitcast). This patch also cleans up code that expects there to be a bitcast in the first argument and testcases that call llvm.dbg.declare. It also strips old llvm.dbg.declare intrinsics that did not pass metadata as the first argument. llvm-svn: 93531	2010-01-15 19:04:09 +00:00
Victor Hernandez	8d4904b639	Revert r93504 because older uses of llvm.dbg.declare intrinsics need to be auto-upgraded llvm-svn: 93515	2010-01-15 17:36:47 +00:00
Victor Hernandez	5d6551816b	Improve llvm.dbg.declare intrinsic by referring directly to the storage in its first argument, via function-local metadata (instead of via a bitcast). This patch also cleans up code that expects there to be a bitcast in the first argument and testcases that call llvm.dbg.declare. llvm-svn: 93504	2010-01-15 03:37:48 +00:00
Eric Christopher	f567e1b426	Pad my commit stats by reducing indentation in this now separate commit. llvm-svn: 93473	2010-01-14 23:00:10 +00:00
Eric Christopher	35dd9e8e1d	Few minor changes that were requested. No functional change. llvm-svn: 93462	2010-01-14 21:48:00 +00:00
Evan Cheng	8e670ee381	Small tweak to inline cost computation. Ext of i/fcmp results are mostly optimized away in codegen. llvm-svn: 93453	2010-01-14 21:04:31 +00:00
Eric Christopher	f3ac066418	Reduce the inlining cost of functions that contain calls to easily, and frequently optimized functions. llvm-svn: 93448	2010-01-14 20:12:34 +00:00
Victor Hernandez	9ce5b5134d	Respond to Chris' review: Make InsertDbgValueIntrinsic() and get Offset take and recieve a uint64_t. Get constness correct for getVariable() and getValue(). llvm-svn: 93149	2010-01-11 07:45:19 +00:00
Chris Lattner	25963c6113	"In order to ease automatic bindings generation, it would be helpful if boolean values were distinguishable from integers. The attached patch introduces "typedef int LLVMBool;", and uses LLVMBool instead of int throughout the C API, wherever a boolean value is called for." Patch by James Y Knight! llvm-svn: 93079	2010-01-09 22:27:07 +00:00
Dan Gohman	bc694918cc	Use WriteAsOperand instead of getName() to print loop header names, so that unnamed blocks are handled. llvm-svn: 93059	2010-01-09 18:17:45 +00:00
Chris Lattner	a69f89c17a	fix PR5978 by peeling the loop so that we avoid shifting the result int by 8 for the first byte. While normally harmless, if the result is smaller than a byte, this shift is invalid. llvm-svn: 93018	2010-01-08 19:02:23 +00:00
Chris Lattner	35d3b9dcd0	teach ComputeNumSignBits to look through PHI nodes. llvm-svn: 92964	2010-01-07 23:44:37 +00:00
Duncan Sands	78376ad7e1	Partially address a README by having functionattrs consider calls to memcpy, memset and other intrinsics that only access their arguments to be readnone if the intrinsic's arguments all point to local memory. This improves the testcase in the README to readonly, but it could in theory be made readnone, however this would involve more sophisticated analysis that looks through the memcpy. llvm-svn: 92829	2010-01-06 08:45:52 +00:00
Dan Gohman	c3f2137c06	Restore dump() methods to Loop and MachineLoop. llvm-svn: 92772	2010-01-05 21:08:02 +00:00
Benjamin Kramer	d2564e3afb	Move remaining stuff to the isInteger predicate. llvm-svn: 92771	2010-01-05 21:05:54 +00:00
Benjamin Kramer	a81a6dff0d	Convert a ton of simple integer type equality tests to the new predicate. llvm-svn: 92760	2010-01-05 20:07:06 +00:00
Devang Patel	be94f23992	Remove dead debug info intrinsics. Intrinsic::dbg_stoppoint Intrinsic::dbg_region_start Intrinsic::dbg_region_end Intrinsic::dbg_func_start AutoUpgrade simply ignores these intrinsics now. llvm-svn: 92557	2010-01-05 01:10:40 +00:00
Chris Lattner	8fb74c6ee2	constant fold nasty constant expressions formed by llvm-gcc, wrapping up PR3351. llvm-svn: 92410	2010-01-02 01:22:23 +00:00
Chris Lattner	6a0ca6aa90	fix Analysis/DebugInfo.h to not include Metadata.h. Do this by moving one method out of line and eliminating redundant checks from other methods. llvm-svn: 92337	2009-12-31 03:02:08 +00:00
Chris Lattner	9b493028df	rename "elements" of metadata to "operands". "Elements" are things that occur in types. "operands" are things that occur in values. llvm-svn: 92322	2009-12-31 01:22:29 +00:00
Chris Lattner	8cb6c3476d	Optimize MDNode to coallocate the operand list immediately after the MDNode in memory. This eliminates the operands pointer and saves a new[] per node. Note that the code in DIDerivedType::replaceAllUsesWith is wrong and quite scary. A MDNode should not be RAUW'd with something else: this changes all uses of the mdnode, which may not be debug info related! Debug info should use something non-mdnode for declarations. llvm-svn: 92321	2009-12-31 01:05:46 +00:00
Chris Lattner	8e805be369	remove a bunch of unneeded functions. llvm-svn: 92263	2009-12-29 09:32:19 +00:00
Chris Lattner	047fd2fd97	major cleanups, much of this file was incorrectly indented. llvm-svn: 92262	2009-12-29 09:22:47 +00:00
Chris Lattner	573f294c00	one pass of cleanup over DebugInfo.h. Much more is still needed. llvm-svn: 92261	2009-12-29 09:15:46 +00:00
Chris Lattner	a0566979b7	Final step in the metadata API restructuring: move the getMDKindID/getMDKindNames methods to LLVMContext (and add convenience methods to Module), eliminating MetadataContext. Move the state that it maintains out to LLVMContext. llvm-svn: 92259	2009-12-29 09:01:33 +00:00
Chris Lattner	2f2aa2b067	This is a major cleanup of the instruction metadata interfaces that I asked Devang to do back on Sep 27. Instead of going through the MetadataContext class with methods like getMD() and getMDs(), just ask the instruction directly for its metadata with getMetadata() and getAllMetadata(). This includes a variety of other fixes and improvements: previously all Value*'s were bloated because the HasMetadata bit was thrown into value, adding a 9th bit to a byte. Now this is properly sunk down to the Instruction class (the only place where it makes sense) and it will be folded away somewhere soon. This also fixes some confusion in getMDs and its clients about whether the returned list is indexed by the MDID or densely packed. This is now returned sorted and densely packed and the comments make this clear. This introduces a number of fixme's which I'll follow up on. llvm-svn: 92235	2009-12-28 23:41:32 +00:00
Chris Lattner	7093946ab1	rename getMDKind -> getMDKindID, make it autoinsert if an MD Kind doesn't exist already, eliminate registerMDKind. Tidy up a bunch of random stuff. llvm-svn: 92225	2009-12-28 20:45:51 +00:00
David Greene	1495b5f887	Change dbgs() back to errs() as Chris requested. llvm-svn: 92086	2009-12-23 23:29:28 +00:00
David Greene	f790593e6e	Change dbgs() back to errs() as Chris requested. llvm-svn: 92085	2009-12-23 23:27:15 +00:00
David Greene	91b6bcefc6	Change dbgs() back to errs() for assert messages as Chris requested. llvm-svn: 92081	2009-12-23 23:14:41 +00:00
David Greene	3d64631fef	Change dbgs() back to errs() for assert messages as Chris requested. llvm-svn: 92080	2009-12-23 23:09:39 +00:00
David Greene	3fe18e72b3	Change dbgs() back to errs() for assert messages as Chris requested. llvm-svn: 92077	2009-12-23 23:00:50 +00:00
David Greene	d79102d6f2	Change dbgs() back to errs() for assert messages as Chris requested. llvm-svn: 92076	2009-12-23 22:59:29 +00:00
David Greene	2330f78075	Remove dump routine and the associated Debug.h from a header. Patch up other files to compensate. llvm-svn: 92075	2009-12-23 22:58:38 +00:00
David Greene	0295ecfea0	Change dbgs() back to errs() as Chris requested. llvm-svn: 92073	2009-12-23 22:49:57 +00:00
David Greene	452fc61a26	Convert debug messages to use dbgs(). Generally this means s/errs/dbgs/g except for certain special cases. llvm-svn: 92071	2009-12-23 22:35:10 +00:00
David Greene	faa00b7a7f	Convert debug messages to use dbgs(). Generally this means s/errs/dbgs/g except for certain special cases. llvm-svn: 92068	2009-12-23 22:28:01 +00:00
David Greene	df1c497c2f	Convert debug messages to use dbgs(). Generally this means s/errs/dbgs/g except for certain special cases. llvm-svn: 92067	2009-12-23 22:18:14 +00:00
David Greene	2e23db1156	Convert debug messages to use dbgs(). Generally this means s/errs/dbgs/g except for certain special cases. llvm-svn: 92066	2009-12-23 22:10:20 +00:00
David Greene	8135870f23	Convert debug messages to use dbgs(). Generally this means s/errs/dbgs/g except for certain special cases. llvm-svn: 92063	2009-12-23 21:58:29 +00:00
David Greene	cf1884c246	Convert debug messages to use dbgs(). Generally this means s/errs/dbgs/g except for certain special cases. llvm-svn: 92060	2009-12-23 21:48:18 +00:00
David Greene	1e4ea201d5	Convert debug messages to use dbgs(). Generally this means s/errs/dbgs/g except for certain special cases. llvm-svn: 92050	2009-12-23 21:27:29 +00:00
David Greene	047ac4aa79	Convert debug messages to use dbgs(). Generally this means s/errs/dbgs/g except for certain special cases. llvm-svn: 92048	2009-12-23 21:16:54 +00:00
David Greene	04e7ae6a57	Convert debug messages to use dbgs(). Generally this means s/errs/dbgs/g except for certain special cases. llvm-svn: 92046	2009-12-23 21:06:14 +00:00
David Greene	9507879bca	Convert debug messages to use dbgs(). Generally this means s/errs/dbgs/g except for certain special cases. llvm-svn: 92042	2009-12-23 20:52:41 +00:00
David Greene	37e9809294	Convert debug messages to use dbgs(). Generally this means s/errs/dbgs/g except for certain special cases. llvm-svn: 92040	2009-12-23 20:43:58 +00:00
David Greene	a7b92ee147	Convert debug messages to use dbgs(). Generally this means s/errs/dbgs/g except for certain special cases. llvm-svn: 92039	2009-12-23 20:34:27 +00:00
David Greene	069857ea31	Convert debug messages to use dbgs(). Generally this means s/errs/dbgs/g except for certain special cases. llvm-svn: 92037	2009-12-23 20:20:46 +00:00
David Greene	f8ed991e5a	Convert debug messages to use dbgs(). Generally this means s/errs/dbgs/g except for certain special cases. llvm-svn: 92035	2009-12-23 20:10:59 +00:00
David Greene	ba44b3ed59	Convert debug messages to use dbgs(). Generally this means s/errs/dbgs/g except for certain special cases. llvm-svn: 92034	2009-12-23 20:03:58 +00:00
David Greene	23e8c74d69	Convert debug messages to use dbgs(). Generally this means s/errs/dbgs/g except for certain special cases. llvm-svn: 92033	2009-12-23 19:51:44 +00:00
David Greene	83d478145d	Convert debug messages to use dbgs(). Generally this means s/errs/dbgs/g except for certain special cases. llvm-svn: 92032	2009-12-23 19:45:49 +00:00
David Greene	2ec90035e8	Convert debug messages to use dbgs(). Generally this means s/errs/dbgs/g except for certain special cases. llvm-svn: 92029	2009-12-23 19:27:59 +00:00
David Greene	2281998095	Convert debug messages to use dbgs(). Generally this means s/errs/dbgs/g except for certain special cases. llvm-svn: 92026	2009-12-23 19:21:19 +00:00
David Greene	a4375f1ffd	Convert debug messages to use dbgs(). Generally this means s/errs/dbgs/g except for certain special cases. llvm-svn: 92024	2009-12-23 19:15:13 +00:00
Chris Lattner	9b7d99eb76	The phi translated pointer can be computed when returning a partially cached result instead of stored. This reduces memdep memory usage, and also eliminates a bunch of weakvh's. This speeds up gvn on gcc.c-torture/20001226-1.c from 23.9s to 8.45s (2.8x) on a different machine than earlier. llvm-svn: 91885	2009-12-22 04:25:02 +00:00
Chris Lattner	2ee6787c1b	avoid calling extractMallocCall when it's obvious we don't have a call. This speeds up memdep ~1.5% llvm-svn: 91869	2009-12-22 01:00:32 +00:00
Chris Lattner	25bf6f8946	fix an overly conservative caching issue that caused memdep to cache a pointer as being unavailable due to phi trans in the wrong place. This would cause later queries to fail even when they didn't involve phi trans. llvm-svn: 91787	2009-12-19 21:29:22 +00:00
Dan Gohman	876f45d7d2	Fix a spello in a comment that Nick spotted. llvm-svn: 91742	2009-12-19 01:46:34 +00:00
Dan Gohman	f902c8c1b5	Eliminate unnecessary LLVMContexts. llvm-svn: 91729	2009-12-18 23:42:08 +00:00
Dan Gohman	7db230f5c9	Make this comment more precise. llvm-svn: 91722	2009-12-18 23:18:03 +00:00
Dan Gohman	51f13056bd	Revert this use of NUW/NSW also. Overflow-undefined multiplication isn't associative either. llvm-svn: 91701	2009-12-18 18:45:31 +00:00
Dan Gohman	7a2dab8826	Revert this use of NSW; this one isn't actually safe. NSW addition is not reassociative. llvm-svn: 91667	2009-12-18 03:57:04 +00:00
Dan Gohman	916fec41fb	Delete an unused variable. llvm-svn: 91659	2009-12-18 02:14:37 +00:00
Dan Gohman	b256ccfbe5	Preserve NSW information in more places. llvm-svn: 91656	2009-12-18 02:09:29 +00:00
Dan Gohman	18fa5686f6	Add Loop contains utility methods for testing whether a loop contains another loop, or an instruction. The loop form is substantially more efficient on large loops than the typical code it replaces. llvm-svn: 91654	2009-12-18 01:24:09 +00:00
Dan Gohman	cb0efecd33	Whitespace cleanups. llvm-svn: 91651	2009-12-18 01:14:11 +00:00
Dan Gohman	92c3696524	Reapply LoopStrengthReduce and IVUsers cleanups, excluding the part of 91296 that caused trouble -- the Processed list needs to be preserved for the livetime of the pass, as AddUsersIfInteresting is called from other passes. llvm-svn: 91641	2009-12-18 00:06:20 +00:00
Evan Cheng	090ac0865a	Revert 91280-91283, 91286-91289, 91291, 91293, 91295-91296. It apparently introduced a non-deterministic behavior in the optimizer somewhere. llvm-svn: 91598	2009-12-17 09:39:49 +00:00
Chris Lattner	a3aef788ec	Fix GetConstantStringInfo to not look into MDString (it works on real data, not metadata) and fix DbgInfoPrinter to not abuse GetConstantStringInfo. llvm-svn: 91444	2009-12-15 19:34:20 +00:00
Devang Patel	1f4690c624	Add support to emit debug info for C++ namespaces. llvm-svn: 91440	2009-12-15 19:16:48 +00:00
Chris Lattner	45d040bd85	Remove isPod() from DenseMapInfo, splitting it out to its own isPodLike type trait. This is a generally useful type trait for more than just DenseMap, and we really care about whether something acts like a pod, not whether it really is a pod. llvm-svn: 91421	2009-12-15 07:26:43 +00:00
John McCall	4ea24f19f5	You can't use typedefs to declare template member specializations, and clang enforces it. llvm-svn: 91397	2009-12-15 02:35:24 +00:00
Dan Gohman	2a07fd94f1	Clear the Processed set when it is no longer used, and clear the IVUses list in releaseMemory(). llvm-svn: 91296	2009-12-14 17:35:17 +00:00
Dan Gohman	fbeec7270c	Fix a thinko; isNotAlreadyContainedIn had a built-in negative, so the condition was inverted when the code was converted to contains(). llvm-svn: 91295	2009-12-14 17:31:01 +00:00
Dan Gohman	57eb6cda7a	Drop Loop::isNotAlreadyContainedIn in favor of Loop::contains. The former was just exposing a LoopInfoBase implementation detail. llvm-svn: 91286	2009-12-14 17:06:50 +00:00
Dan Gohman	84ba039cf2	Make getUniqueExitBlocks's precondition assert more precise, to avoid spurious failures. This fixes PR5758. llvm-svn: 91147	2009-12-11 20:05:23 +00:00
Dan Gohman	220b196c94	Reuse the Threshold value to size these containers because it's currently somewhat convenient for them to have the same value. llvm-svn: 90980	2009-12-09 18:48:53 +00:00
Chris Lattner	9f9010ef47	Add a minor optimization: if we haven't changed the operands of an add, there is no need to scan the world to find the same add again. This invalidates the previous testcase, which wasn't wonderful anyway, because it needed a run of instcombine to permute the use-lists in just the right way to before GVN was run (so it was really fragile). Not a big loss. llvm-svn: 90973	2009-12-09 17:27:45 +00:00
Chris Lattner	fa2e536831	fix PR5733, a case where we'd replace an add with a lexically identical binary operator that wasn't an add. In this case, a xor. Whoops. llvm-svn: 90971	2009-12-09 17:18:49 +00:00

... 26 27 28 29 30 ...

5537 Commits