llvm-project

Commit Graph

Author	SHA1	Message	Date
Arnold Schwaighofer	76dca58c66	BasicAA: GEPs of NoAlias'ing base ptr with equivalent indices are NoAlias If we can show that the base pointers of two GEPs don't alias each other using precise analysis and the indices and base offset are equal then the two GEPs also don't alias each other. This is primarily needed for the follow up patch that analyses NoAlias'ing PHI nodes. Part 1/2 of fix for PR13564. llvm-svn: 163317	2012-09-06 14:31:51 +00:00
Manman Ren	f3fedb6935	JumpThreading: when default destination is the destination of some cases in a switch, make sure we include the value for the cases when calculating edge value from switch to the default destination. rdar://12241132 llvm-svn: 163270	2012-09-05 23:45:58 +00:00
Roman Divacky	ad06cee239	Stop casting away const qualifier needlessly. llvm-svn: 163258	2012-09-05 22:26:57 +00:00
Benjamin Kramer	6c2649ca4e	Switch BasicAliasAnalysis' cache to SmallDenseMap. It relies on clear() being fast and the cache rarely has more than 1 or 2 elements, so give it an inline capacity and always shrink it back down in case it grows. DenseMap will grow to 64 buckets which makes clear() a lot slower. llvm-svn: 163215	2012-09-05 16:49:37 +00:00
Bob Wilson	01cfbfe9d0	Be conservative about allocations that may alias the accessed pointer. If an allocation has a must-alias relation to the access pointer, we treat it as a Def. Otherwise, without this check, the code here was just skipping over the allocation call and ignoring it. I noticed this by inspection and don't have a specific testcase that it breaks, but it seems like we need to treat a may-alias allocation as a Clobber. llvm-svn: 163127	2012-09-04 03:30:13 +00:00
Bob Wilson	dcc54decd5	Fix more fallout from r158919, similar to PR13547. This code used to only handle malloc-like calls, which do not read memory. r158919 changed it to check isNoAliasFn(), which includes strdup-like and realloc-like calls, but it was not checking for dependencies on the memory read by those calls. llvm-svn: 163106	2012-09-03 05:15:15 +00:00
Benjamin Kramer	e7e5235726	Clean up ProfileDataLoader a bit. - Overloading operator<< for raw_ostream and pointers is dangerous, it alters the behavior of code that includes the header. - Remove unused ID. - Use LLVM's byte swapping helpers instead of a hand-coded. - Make ReadProfilingData work directly on a pointer. No functionality change. llvm-svn: 162992	2012-08-31 12:43:07 +00:00
Bill Wendling	5aed004cf1	Cleanups due to feedback. No functionality change. Patch by Alistair. llvm-svn: 162979	2012-08-31 05:18:31 +00:00
Benjamin Kramer	8bcc971174	Make MemoryBuiltins aware of TargetLibraryInfo. This disables malloc-specific optimization when -fno-builtin (or -ffreestanding) is specified. This has been a problem for a long time but became more severe with the recent memory builtin improvements. Since the memory builtin functions are used everywhere, this required passing TLI in many places. This means that functions that now have an optional TLI argument, like RecursivelyDeleteTriviallyDeadFunctions, won't remove dead mallocs anymore if the TLI argument is missing. I've updated most passes to do the right thing. Fixes PR13694 and probably others. llvm-svn: 162841	2012-08-29 15:32:21 +00:00
Manman Ren	abbb01abea	Profile: set branch weight metadata with data generated from profiling. This patch implements ProfileDataLoader which loads profile data generated by -insert-edge-profiling and updates branch weight metadata accordingly. Patch by Alastair Murray. llvm-svn: 162799	2012-08-28 22:21:25 +00:00
Hongbin Zheng	14c05c409a	Remove the the block_node_iterator of Region, replace it by the block_iterator. llvm-svn: 162672	2012-08-27 13:49:24 +00:00
Richard Smith	228e6d4cf3	Fix integer undefined behavior due to signed left shift overflow in LLVM. Reviewed offline by chandlerc. llvm-svn: 162623	2012-08-24 23:29:28 +00:00
Manman Ren	cf10446ffa	BranchProb: modify the definition of an edge in BranchProbabilityInfo to handle the case of multiple edges from one block to another. A simple example is a switch statement with multiple values to the same destination. The definition of an edge is modified from a pair of blocks to a pair of PredBlock and an index into the successors. Also set the weight correctly when building SelectionDAG from LLVM IR, especially when converting a Switch. IntegersSubsetMapping is updated to calculate the weight for each cluster. llvm-svn: 162572	2012-08-24 18:14:27 +00:00
Richard Smith	c621af1f60	Fix floating-point divide by zero, in a case where the value was not going to be used anyway. llvm-svn: 162518	2012-08-24 00:31:45 +00:00
Benjamin Kramer	f29db275b2	Reduce duplicated hash map lookups. llvm-svn: 162362	2012-08-22 15:37:57 +00:00
Benjamin Kramer	34764fe2e4	MemoryBuiltins: Properly guard ObjectSizeOffsetVisitor against cycles in the IR. The previous fix only checked for simple cycles, use a set to catch longer cycles too. Drop the broken check from the ObjectSizeOffsetEvaluator. The BoundsChecking pass doesn't have to deal with invalid IR like InstCombine does. llvm-svn: 162120	2012-08-17 19:26:41 +00:00
Benjamin Kramer	4901f0d2a2	Guard MemoryBuiltins against self-looping GEPs, which can occur in unreachable code due to constant propagation. Fixes PR13621. llvm-svn: 162098	2012-08-17 14:16:37 +00:00
Bill Wendling	e1c54262f4	Set the branch probability of branching to the 'normal' destination of an invoke instruction to something absurdly high, while setting the probability of branching to the 'unwind' destination to the bare minimum. This should set cause the normal destination's invoke blocks to be moved closer to the invoke. PR13612 llvm-svn: 161944	2012-08-15 12:22:35 +00:00
Nadav Rotem	5d4e205874	MemoryDependenceAnalysis attempts to find the first memory dependency for function calls. Currently, if GetLocation reports that it did not find a valid pointer (this is the case for volatile load/stores), we ignore the result. This patch adds code to handle the cases where we did not obtain a valid pointer. rdar://11872864 PR12899 llvm-svn: 161802	2012-08-13 23:03:43 +00:00
Benjamin Kramer	c99d0e9186	PR13095: Give an inline cost bonus to functions using byval arguments. We give a bonus for every argument because the argument setup is not needed anymore when the function is inlined. With this patch we interpret byval arguments as a compact representation of many arguments. The byval argument setup is implemented in the backend as an inline memcpy, so to model the cost as accurately as possible we take the number of pointer-sized elements in the byval argument and give a bonus of 2 instructions for every one of those. The bonus is capped at 8 elements, which is the number of stores at which the x86 backend switches from an expanded inline memcpy to a real memcpy. It would be better to use the real memcpy threshold from the backend, but it's not available via TargetData. This change brings the performance of c-ray in line with gcc 4.7. The included test case tries to reproduce the c-ray problem to catch regressions for this benchmark early, its performance is dominated by the inline decision of a specific call. This only has a small impact on most code, more on x86 and arm than on x86_64 due to the way the ABI works. When building LLVM for x86 it gives a small inline cost boost to virtually any function using StringRef or STL allocators, but only a 0.01% increase in overall binary size. The size of gcc compiled by clang actually shrunk by a couple bytes with this patch applied, but not significantly. llvm-svn: 161413	2012-08-07 11:13:19 +00:00
Chandler Carruth	2f6cf4884c	Fix PR13412, a nasty miscompile due to the interleaved instsimplify+inline strategy. The crux of the problem is that instsimplify was reasonably relying on an invariant that is true within any single function, but is no longer true mid-inline the way we use it. This invariant is that an argument pointer != a local (alloca) pointer. The fix is really light weight though, and allows instsimplify to be resiliant to these situations: when checking the relation ships to function arguments, ensure that the argumets come from the same function. If they come from different functions, then none of these assumptions hold. All credit to Benjamin Kramer for coming up with this clever solution to the problem. llvm-svn: 161410	2012-08-07 10:59:59 +00:00
Hongbin Zheng	bb1d209210	Implement the block_iterator of Region based on df_iterator. llvm-svn: 161177	2012-08-02 14:20:02 +00:00
Nick Lewycky	fb78083b1c	Stay rational; don't assert trying to take the square root of a negative value. If it's negative, the loop is already proven to be infinite. Fixes PR13489! llvm-svn: 161107	2012-08-01 09:14:36 +00:00
Nadav Rotem	77f1b9c477	When constant folding GEP expressions, keep the address space information of pointers. Together with Ran Chachick <ran.chachick@intel.com> llvm-svn: 160954	2012-07-30 07:25:20 +00:00
Nuno Lopes	85591f899d	fix PR13390: do not loop forever with self-referencing self instructions llvm-svn: 160876	2012-07-27 18:21:15 +00:00
Nuno Lopes	f0626f2205	revert r160742: it's breaking CMake build original commit msg: MemoryBuiltins: add support to determine the size of strdup'ed non-constant strings llvm-svn: 160751	2012-07-25 18:49:28 +00:00
Nuno Lopes	f0441e04bd	MemoryBuiltins: add support to determine the size of strdup'ed non-constant strings llvm-svn: 160742	2012-07-25 17:29:22 +00:00
Duncan Sands	0b875a0c29	When folding a load from a global constant, if the load started in the middle of an array element (rather than at the beginning of the element) and extended into the next element, then the load from the second element was being handled wrong due to incorrect updating of the notion of which byte to load next. This fixes PR13442. Thanks to Chris Smowton for reporting the problem, analyzing it and providing a fix. llvm-svn: 160711	2012-07-25 09:14:54 +00:00
Nuno Lopes	2a4b09c9de	teach objectsize about strdup() and strndup() llvm-svn: 160676	2012-07-24 16:28:13 +00:00
Sylvestre Ledru	35521e2310	Fix a typo (the the => the) llvm-svn: 160621	2012-07-23 08:51:15 +00:00
Nuno Lopes	705141d4df	baby steps toward fixing some problems with inbound GEPs that overflow, as discussed 2 months ago or so. Make sure we do not emit index computations with NSW flags so that we dont get an undef value if the GEP overflows llvm-svn: 160589	2012-07-20 23:07:40 +00:00
Benjamin Kramer	5be8f60126	Remove unused private member variables uncovered by the recent changes to clang's -Wunused-private-field. llvm-svn: 160583	2012-07-20 22:05:57 +00:00
Chandler Carruth	36e2ecf528	Move llvm/Support/TypeBuilder.h -> llvm/TypeBuilder.h. This completes the move of *Builder classes into the Core library. No uses of this builder in Clang or DragonEgg I could find. If there is a desire to have an IR-building-support library that contains all of these builders, that can be easily added, but currently it seems likely that these add no real overhead to VMCore. llvm-svn: 160243	2012-07-15 23:45:24 +00:00
Andrew Trick	653513b8dd	LSR Fix: check SCEV expression safety before expansion. All SCEV expressions used by LSR formulae must be safe to expand. i.e. they may not contain UDiv unless we can prove nonzero denominator. Fixes PR11356: LSR hoists UDiv. llvm-svn: 160205	2012-07-13 23:33:10 +00:00
Andrew Trick	ee76065b7a	IVUsers should only generate SCEV's for values that are safe to speculate. This allows SCEVExpander to run on the IV expressions. This codifies an assumption made by LSR to complete the fix for PR11356, but I haven't been able to generate a separate unit test for this part. I'm adding it as an extra safety check. llvm-svn: 160204	2012-07-13 23:33:05 +00:00
Andrew Trick	365e31c36c	Factor SCEV traversal code so I can use it elsewhere. No functionality. llvm-svn: 160203	2012-07-13 23:33:03 +00:00
Dan Gohman	3d1512384f	Delete code for folding undefs in ScalarEvolution. It's invalid in obscure ways, and it isn't actually important in the real world. llvm-svn: 159969	2012-07-09 23:51:20 +00:00
Nuno Lopes	0d44a50426	PHINode::hasConstantValue(): return undef if the PHI is fully recursive. Thanks Duncan for the idea llvm-svn: 159687	2012-07-03 21:15:40 +00:00
Nuno Lopes	9291ff4078	fold PHI nodes in SizeOffsetEvaluator whenever possible. Unfortunately this change requires the cache map to hold WeakVHs instead llvm-svn: 159667	2012-07-03 17:13:25 +00:00
Benjamin Kramer	e2ef47c145	Reduce use list thrashing by using DenseMap's find_as for maps with ValueHandle keys. No functionality change. llvm-svn: 159497	2012-06-30 22:37:15 +00:00
Nuno Lopes	674acc12d0	RefreshCallGraph: ignore 'invoke intrinsic'. IntrinsicInst doesnt not recognize invoke, and shouldnt at this point, since the rest of LLVM codebase doesnt expect invoke of intrinsics llvm-svn: 159441	2012-06-29 17:49:32 +00:00
Bill Wendling	098d906dbb	Update the CMake files. llvm-svn: 159417	2012-06-29 09:01:47 +00:00
Bill Wendling	f799efdedc	The DIBuilder class is just a wrapper around debug info creation (a.k.a. MDNodes). The module doesn't belong in Analysis. Move it to the VMCore instead. llvm-svn: 159414	2012-06-29 08:32:07 +00:00
Nick Lewycky	474112d82c	If the step value is a constant zero, the loop isn't going to terminate. Fixes the assert reported in PR13228! llvm-svn: 159393	2012-06-28 23:44:57 +00:00
Nuno Lopes	181d67ecb1	MemoryBuiltins: - recognize C++ new(std::nothrow) friends - ignore ExtractElement and ExtractValue instructions in size/offset analysis (all easy cases are probably folded away before we get here) - also recognize realloc as noalias llvm-svn: 159356	2012-06-28 16:34:03 +00:00
Nuno Lopes	8650fb8e0e	make LazyValueInfo analyze the default case of switch statements (we know that in the default branch the value cannot be any of the switch cases) llvm-svn: 159353	2012-06-28 16:13:37 +00:00
Nuno Lopes	e6e049020b	make LVI::getEdgeValue() always intersect the constraints of the edge with the range of the block. Previously it was only performing the intersection for a few cases, thus losing precision llvm-svn: 159320	2012-06-28 01:16:18 +00:00
Bill Wendling	3b2ab9eaaa	Fix cmake failure from moving files around. llvm-svn: 159314	2012-06-28 00:18:12 +00:00
Bill Wendling	e38859dc8e	Move lib/Analysis/DebugInfo.cpp to lib/VMCore/DebugInfo.cpp and include/llvm/Analysis/DebugInfo.h to include/llvm/DebugInfo.h. The reasoning is because the DebugInfo module is simply an interface to the debug info MDNodes and has nothing to do with analysis. llvm-svn: 159312	2012-06-28 00:05:13 +00:00
Bill Wendling	3b70d784a2	Reduce indentation in function. Rearrange some methods. No functionality change. llvm-svn: 159239	2012-06-26 23:22:18 +00:00
Bill Wendling	e02a1f8cf2	Revamp how debugging information is emitted for debug info objects. It's not necessary for each DI class to have its own copy of `print' and `dump'. Instead, just give DIDescriptor those methods and have it call the appropriate debugging printing routine based on the type of the debug information. llvm-svn: 159237	2012-06-26 22:57:33 +00:00
Andrew Trick	fb2ba3e1cb	Enable the new LoopInfo algorithm by default. The primary advantage is that loop optimizations will be applied in a stable order. This helps debugging and unit test creation. It is also a better overall implementation without pathologically bad performance on deep functions. On large functions (llvm-stress --size=200000 \| opt -loops) Before: 0.1263s After: 0.0225s On deep functions (after tweaking llvm-stress, thanks Nadav): Before: 0.2281s After: 0.0227s See r158790 for more comments. The loop tree is now consistently generated in forward order, but loop passes are applied in reverse order over the program. If we have a loop optimization that prefers forward order, that can easily be achieved by adding a different type of LoopPassManager. llvm-svn: 159183	2012-06-26 04:11:38 +00:00
Andrew Trick	fecf937938	Remove unnecessary FIXME llvm-svn: 159182	2012-06-26 04:11:34 +00:00
Nuno Lopes	9ecc8761bc	check for the NoAlias attribute through CallSite llvm-svn: 159145	2012-06-25 16:17:54 +00:00
NAKAMURA Takumi	704de074b8	llvm/lib: [CMake] Add explicit dependency to intrinsics_gen. llvm-svn: 159112	2012-06-24 13:32:01 +00:00
Nuno Lopes	15dbcb4537	simplify code from previous commits (Thanks Duncan) llvm-svn: 158999	2012-06-22 15:50:53 +00:00
Nuno Lopes	9792d68381	remove extractMallocCallFromBitCast, since it was tailor maded for its sole user. Update GlobalOpt accordingly. llvm-svn: 158952	2012-06-22 00:25:01 +00:00
Nuno Lopes	dc6085e52d	Add support for invoke to the MemoryBuiltin analysid. Update comments accordingly. Make instcombine remove useless invokes to C++'s 'new' allocation function (test attached). llvm-svn: 158937	2012-06-21 21:25:05 +00:00
Nuno Lopes	f06b731fed	fix build in C++11 mode. Thanks to Chandler for pointing out the problem. llvm-svn: 158928	2012-06-21 18:38:26 +00:00
Nuno Lopes	a6aa3d3b5f	hopefully fix the buildbots: some tests have wrong definitions of malloc and were crashing this code on 64 bits machines llvm-svn: 158923	2012-06-21 16:47:58 +00:00
Nuno Lopes	55fff83422	refactor the MemoryBuiltin analysis: - provide more extensive set of functions to detect library allocation functions (e.g., malloc, calloc, strdup, etc) - provide an API to compute the size and offset of an object pointed by Move a few clients (GVN, AA, instcombine, ...) to the new API. This implementation is a lot more aggressive than each of the custom implementations being replaced. Patch reviewed by Nick Lewycky and Chandler Carruth, thanks. llvm-svn: 158919	2012-06-21 15:45:28 +00:00
Andrew Trick	ff2ed7b687	A new algorithm for computing LoopInfo. Temporarily disabled. -stable-loops enables a new algorithm for generating the Loop forest. It differs from the original algorithm in a few respects: - Not determined by use-list order. - Initially guarantees RPO order of block and subloops. - Linear in the number of CFG edges. - Nonrecursive. I didn't want to change the LoopInfo API yet, so the block lists are still inclusive. This seems strange to me, and it means that building LoopInfo is not strictly linear, but it may not be a problem in practice. At least the block lists start out in RPO order now. In the future we may add an attribute or wrapper analysis that allows other passes to assume RPO order. The primary motivation of this work was not to optimize LoopInfo, but to allow reproducing performance issues by decomposing the compilation stages. I'm often unable to do this with the current LoopInfo, because the loop tree order determines Loop pass order. Serializing the IR tends to invert the order, which reverses the optimization order. This makes it nearly impossible to debug interdependent loop optimizations such as LSR. I also believe this will provide more stable performance results across time. llvm-svn: 158790	2012-06-20 05:23:33 +00:00
Andrew Trick	cda51d430d	Move the implementation of LoopInfo into LoopInfoImpl.h. The implementation only needs inclusion from LoopInfo.cpp and MachineLoopInfo.cpp. Clients of the interface should only include the interface. This makes the interface readable and speeds up rebuilds after modifying the implementation. llvm-svn: 158787	2012-06-20 03:42:09 +00:00
Benjamin Kramer	009b1c1cf1	Round 2 of dead private variable removal. LLVM is now -Wunused-private-field clean except for - lib/MC/MCDisassembler/Disassembler.h. Not sure why it keeps all those unaccessible fields. - gtest. llvm-svn: 158096	2012-06-06 19:47:08 +00:00
Benjamin Kramer	bde9176663	Fix typos found by http://github.com/lyda/misspell-check llvm-svn: 157885	2012-06-02 10:20:22 +00:00
Eric Christopher	1cf3338bb4	Add support for enum forward declarations. Part of rdar://11570854 llvm-svn: 157786	2012-06-01 00:22:32 +00:00
Benjamin Kramer	406a2db1f6	Make sure that we're dealing with a binary SCEVExpr when simplifying. llvm-svn: 157704	2012-05-30 18:42:43 +00:00
Benjamin Kramer	50b26ebb2b	Teach SCEV's icmp simplification logic that a-b == 0 is equivalent to a == b. This also required making recursive simplifications until nothing changes or a hard limit (currently 3) is hit. With the simplification in place indvars can canonicalize loops of the form for (unsigned i = 0; i < a-b; ++i) into for (unsigned i = 0; i != a-b; ++i) which used to fail because SCEV created a weird umax expr for the backedge taken count. llvm-svn: 157701	2012-05-30 18:32:23 +00:00
Andrew Trick	a3f9043196	SCEV: Handle a corner case reducing AddRecExpr * AddRecExpr If integer overflow causes one of the terms to reach zero, that can force the entire expression to zero. Fixes PR12929: cast<Ty>() argument of incompatible type llvm-svn: 157673	2012-05-30 03:35:20 +00:00
Andrew Trick	946f76bf33	Reformat the loop that does AddRecExpr * AddRecExpr reduction. No functionality. llvm-svn: 157672	2012-05-30 03:35:17 +00:00
Craig Topper	9520719b9b	Mark some static arrays as const. llvm-svn: 157377	2012-05-24 06:35:32 +00:00
Eric Christopher	c49643586b	Add support for C++11 enum classes in llvm. Part of rdar://11496790 llvm-svn: 157303	2012-05-23 00:09:20 +00:00
Andrew Trick	a7a3de1bcf	LSR fix: add a missing phi check during IV hoisting. Fixes PR12898: SCEVExpander crash. llvm-svn: 157263	2012-05-22 17:39:59 +00:00
Eric Christopher	b5cf66cda2	Actually support DW_TAG_rvalue_reference_type that we were trying to generate out of the front end. rdar://11479676 llvm-svn: 157094	2012-05-19 01:36:37 +00:00
Andrew Trick	7fa4e0fea6	SCEV: Add MarkPendingLoopPredicates to avoid recursive isImpliedCond. getUDivExpr attempts to simplify by checking for overflow. isLoopEntryGuardedByCond then evaluates the loop predicate which may lead to the same getUDivExpr causing endless recursion. Fixes PR12868: clang 3.2 segmentation fault. llvm-svn: 157092	2012-05-19 00:48:25 +00:00
Nuno Lopes	ac59380dfd	allow LazyValueInfo::getEdgeValue() to reason about multiple edges from the same switch instruction by doing union of ranges (which may still be conservative, but it's more aggressive than before) llvm-svn: 157071	2012-05-18 21:02:10 +00:00
Eric Christopher	5d5338fb81	Clarify comment. llvm-svn: 157033	2012-05-18 00:16:22 +00:00
Nuno Lopes	097e37da0e	minor simplification in the call to ConstantRange constructor llvm-svn: 157024	2012-05-17 23:04:08 +00:00
Bill Wendling	e065dc8d8d	Remove extraneous ';'. llvm-svn: 157011	2012-05-17 20:27:58 +00:00
Nuno Lopes	c2a170e26e	reuse the result of some expensive computations in getSignExtendExpr() and getZeroExtendExpr() this gives a speedup of > 80 in a debug build in the test case of PR12825 (php_sha512_crypt_r) llvm-svn: 156849	2012-05-15 20:20:14 +00:00
Nuno Lopes	ab5c924006	minor simplification to code: Ty is already a SCEV type; don't need to run getEffectiveSCEVType() twice llvm-svn: 156823	2012-05-15 15:44:38 +00:00
Chad Rosier	a968caf8e0	Move the capture analysis from MemoryDependencyAnalysis to a more general place so that it can be reused in MemCpyOptimizer. This analysis is needed to remove an unnecessary memcpy when returning a struct into a local variable. rdar://11341081 PR12686 llvm-svn: 156776	2012-05-14 20:35:04 +00:00
Chad Rosier	10702d5f22	Hoist simpler checks above llvm::PointerMayBeCaptured. No functional change intended. llvm-svn: 156687	2012-05-12 00:43:40 +00:00
Chad Rosier	8244b1dc7e	Fix intendation. llvm-svn: 156589	2012-05-10 23:38:07 +00:00
Dan Gohman	ed7c24e2d9	Teach DeadStoreElimination to eliminate exit-block stores with phi addresses. llvm-svn: 156558	2012-05-10 18:57:38 +00:00
Dan Gohman	0291246ce7	Rewrite ScalarEvolution::hasOperand to use an explicit worklist instead of recursion, to avoid excessive stack usage on deep expressions. llvm-svn: 156554	2012-05-10 17:21:30 +00:00
Chandler Carruth	8880325a92	Rename the Region::block_iterator to Region::block_node_iterator, and add a new Region::block_iterator which actually iterates over the basic blocks of the region. The old iterator, now call 'block_node_iterator' iterates over RegionNodes which contain a single basic block. This works well with the GraphTraits-based iterator design, however most users actually want an iterator over the BasicBlocks inside these RegionNodes. Now the 'block_iterator' is a wrapper which exposes exactly this interface. Internally it uses the block_node_iterator to walk all nodes which are single basic blocks, but transparently unwraps the basic block to make user code simpler. While this patch is a bit of a wash, most of the updates are to internal users, not external users of the RegionInfo. I have an accompanying patch to Polly that is a strict simplification of every user of this interface, and I'm working on a pass that also wants the same simplified interface. This patch alone should have no functional impact. llvm-svn: 156202	2012-05-04 20:55:23 +00:00
Chandler Carruth	da7513a834	A pile of long over-due refactorings here. There are some very, very minor behavior changes with this, but nothing I have seen evidence of in the wild or expect to be meaningful. The real goal is unifying our logic and simplifying the interfaces. A summary of the changes follows: - Make 'callIsSmall' actually accept a callsite so it can handle intrinsics, and simplify callers appropriately. - Nuke a completely bogus declaration of 'callIsSmall' that was still lurking in InlineCost.h... No idea how this got missed. - Teach the 'isInstructionFree' about the various more intelligent 'free' heuristics that got added to the inline cost analysis during review and testing. This mostly surrounds int->ptr and ptr->int casts. - Switch most of the interesting parts of the inline cost analysis that were essentially computing 'is this instruction free?' to use the code metrics routine instead. This way we won't keep duplicating logic. All of this is motivated by the desire to allow other passes to compute a roughly equivalent 'cost' metric for a particular basic block as the inline cost analysis. Sadly, re-using the same analysis for both is really messy because only the actual inline cost analysis is ever going to go to the contortions required for simplification, SROA analysis, etc. llvm-svn: 156140	2012-05-04 00:58:03 +00:00
Nuno Lopes	d4cf35d775	remove calls to calloc if the allocated memory is not used (it was already being done for malloc) fix a few typos found by Chad in my previous commit llvm-svn: 156110	2012-05-03 22:08:19 +00:00
Nuno Lopes	d2b71e7fa9	add support for calloc to objectsize lowering llvm-svn: 156102	2012-05-03 21:19:58 +00:00
Duncan Sands	34c4869cf6	Just mark the sign bit as known zero, rather than any other irrelevant bits known zero in the LHS. Fixes PR12541. llvm-svn: 155818	2012-04-30 11:56:58 +00:00
Dan Gohman	1ccecdb2fd	Reapply r155682, making constant folding more consistent, with a fix to work properly with how the code handles all-undef PHI nodes. llvm-svn: 155721	2012-04-27 17:50:22 +00:00
NAKAMURA Takumi	6008dfdb70	Revert r155682, "Use ConstantExpr::getExtractElement when constant-folding vectors" It broke stage2 build. stage1/clang sometimes crashed. llvm-svn: 155699	2012-04-27 07:59:20 +00:00
Dan Gohman	90f3798f26	Use ConstantExpr::getExtractElement when constant-folding vectors instead of getAggregateElement. This has the advantage of being more consistent and allowing higher-level constant folding to procede even if an inner extract element cannot be folded. Make ConstantFoldInstruction call ConstantFoldConstantExpression on the instruction's operands, making it more consistent with ConstantFoldConstantExpression itself. This makes sure that ConstantExprs get TargetData-aware folding before being handed off as operands for further folding. This causes more expressions to be folded, but due to a known shortcoming in constant folding, this currently has the side effect of stripping a few more nuw and inbounds flags in the non-targetdata side of constant-fold-gep.ll. This is mostly harmless. This fixes rdar://11324230. llvm-svn: 155682	2012-04-27 00:54:36 +00:00
Chandler Carruth	aacb8a5809	Fix a crash on valid (if UB) bitcode that is produced for some global constants in C++11 mode. I have no idea why it required such particular circumstances to get here, the code seems clearly to rely upon unchecked assumptions. Specifically, when we decide to form an index into a struct type, we may have gone through (at least one) zero-length array indexing round, which would have left the offset un-adjusted, and thus not necessarily valid for use when indexing the struct type. This is just an canonicalization step, so the correct thing is to refuse to canonicalize nonsensical GEPs of this form. Implemented, and test case added. Fixes PR12642. Pair debugged and coded with Richard Smith. =] I credit him with most of the debugging, and preventing me from writing the wrong code. llvm-svn: 155466	2012-04-24 18:42:47 +00:00
Eric Christopher	27deb265f9	Allow forward declarations to take a context. This helps the debugger find forward declarations in the context that the actual definition will occur. rdar://11291658 llvm-svn: 155380	2012-04-23 19:00:11 +00:00
Benjamin Kramer	e364d195e9	Revert "SCEV: When expanding a GEP the final addition to the base pointer has NUW but not NSW." This isn't right either, reverting for now. llvm-svn: 154910	2012-04-17 06:33:57 +00:00
Chandler Carruth	7ae90d4d2d	Add two statistics to help track how we are computing the inline cost. Yea, 'NumCallerCallersAnalyzed' isn't a great name, suggestions welcome. llvm-svn: 154492	2012-04-11 10:15:10 +00:00
Andrew Trick	4442bfe559	Fix 12513: Loop unrolling breaks with indirect branches. Take this opportunity to generalize the indirectbr bailout logic for loop transformations. CFG transformations will never get indirectbr right, and there's no point trying. llvm-svn: 154386	2012-04-10 05:14:42 +00:00
Chandler Carruth	28192c9398	Fix ValueTracking to conclude that debug intrinsics are safe to speculate. Without this, loop rotate (among many other places) would suddenly stop working in the presence of debug info. I found this looking at loop rotate, and have augmented its tests with a reduction out of a very hot loop in yacr2 where failing to do this rotation costs sometimes more than 10% in runtime performance, perturbing numerous downstream optimizations. This should have no impact on performance without debug info, but the change in performance when debug info is enabled can be extreme. As a consequence (and this how I got to this yak) any profiling of performance problems should be treated with deep suspicion -- they may have been wildly innacurate of debug info was enabled for profiling. =/ Just a heads up. llvm-svn: 154263	2012-04-07 19:22:18 +00:00
Benjamin Kramer	e1f4ca1b0f	SCEV: When expanding a GEP the final addition to the base pointer has NUW but not NSW. Found by inspection. llvm-svn: 154262	2012-04-07 17:19:26 +00:00
David Chisnall	c1c9cdab23	Reintroduce InlineCostAnalyzer::getInlineCost() variant with explicit callee parameter until we have a more sensible API for doing the same thing. Reviewed by Chandler. llvm-svn: 154180	2012-04-06 17:27:41 +00:00
Rafael Espindola	ba0a6cabb8	Always compute all the bits in ComputeMaskedBits. This allows us to keep passing reduced masks to SimplifyDemandedBits, but know about all the bits if SimplifyDemandedBits fails. This allows instcombine to simplify cases like the one in the included testcase. llvm-svn: 154011	2012-04-04 12:51:34 +00:00
Eric Christopher	34164196af	Add a line number for the scope of the function (starting at the first brace) so that we get more accurate line number information about the declaration of a given function and the line where the function first starts. Part of rdar://11026482 llvm-svn: 153916	2012-04-03 00:43:49 +00:00
Rafael Espindola	80c540e656	Teach CodeGen's version of computeMaskedBits to understand the range metadata. This is the CodeGen equivalent of r153747. I tested that there is not noticeable performance difference with any combination of -O0/-O2 /-g when compiling gcc as a single compilation unit. llvm-svn: 153817	2012-03-31 18:14:00 +00:00
Chandler Carruth	1a4cc6cc9f	Fix a typo reported in IRC by someone reviewing this code. llvm-svn: 153815	2012-03-31 13:18:09 +00:00
Chandler Carruth	edd2826f3e	Remove a bunch of empty, dead, and no-op methods from all of these interfaces. These methods were used in the old inline cost system where there was a persistent cache that had to be updated, invalidated, and cleared. We're now doing more direct computations that don't require this intricate dance. Even if we resume some level of caching, it would almost certainly have a simpler and more narrow interface than this. llvm-svn: 153813	2012-03-31 12:48:08 +00:00
Chandler Carruth	0539c071ea	Initial commit for the rewrite of the inline cost analysis to operate on a per-callsite walk of the called function's instructions, in breadth-first order over the potentially reachable set of basic blocks. This is a major shift in how inline cost analysis works to improve the accuracy and rationality of inlining decisions. A brief outline of the algorithm this moves to: - Build a simplification mapping based on the callsite arguments to the function arguments. - Push the entry block onto a worklist of potentially-live basic blocks. - Pop the first block off of the front of the worklist (for breadth-first ordering) and walk its instructions using a custom InstVisitor. - For each instruction's operands, re-map them based on the simplification mappings available for the given callsite. - Compute any simplification possible of the instruction after re-mapping, and store that back int othe simplification mapping. - Compute any bonuses, costs, or other impacts of the instruction on the cost metric. - When the terminator is reached, replace any conditional value in the terminator with any simplifications from the mapping we have, and add any successors which are not proven to be dead from these simplifications to the worklist. - Pop the next block off of the front of the worklist, and repeat. - As soon as the cost of inlining exceeds the threshold for the callsite, stop analyzing the function in order to bound cost. The primary goal of this algorithm is to perfectly handle dead code paths. We do not want any code in trivially dead code paths to impact inlining decisions. The previous metric was extremely flawed here, and would always subtract the average cost of two successors of a conditional branch when it was proven to become an unconditional branch at the callsite. There was no handling of wildly different costs between the two successors, which would cause inlining when the path actually taken was too large, and no inlining when the path actually taken was trivially simple. There was also no handling of the code path, only the immediate successors. These problems vanish completely now. See the added regression tests for the shiny new features -- we skip recursive function calls, SROA-killing instructions, and high cost complex CFG structures when dead at the callsite being analyzed. Switching to this algorithm required refactoring the inline cost interface to accept the actual threshold rather than simply returning a single cost. The resulting interface is pretty bad, and I'm planning to do lots of interface cleanup after this patch. Several other refactorings fell out of this, but I've tried to minimize them for this patch. =/ There is still more cleanup that can be done here. Please point out anything that you see in review. I've worked really hard to try to mirror at least the spirit of all of the previous heuristics in the new model. It's not clear that they are all correct any more, but I wanted to minimize the change in this single patch, it's already a bit ridiculous. One heuristic that is not yet mirrored is to allow inlining of functions with a dynamic alloca if the caller has a dynamic alloca. I will add this back, but I think the most reasonable way requires changes to the inliner itself rather than just the cost metric, and so I've deferred this for a subsequent patch. The test case is XFAIL-ed until then. As mentioned in the review mail, this seems to make Clang run about 1% to 2% faster in -O0, but makes its binary size grow by just under 4%. I've looked into the 4% growth, and it can be fixed, but requires changes to other parts of the inliner. llvm-svn: 153812	2012-03-31 12:42:41 +00:00
Rafael Espindola	53190539db	Add computeMaskedBitsLoad back, as it was the change to instsimplify that caused the slowdown last time. llvm-svn: 153747	2012-03-30 15:52:11 +00:00
Eric Christopher	c13fd6d1e1	Lowercase the tag name to match the rest of dwarf. llvm-svn: 153691	2012-03-29 21:35:05 +00:00
Eric Christopher	70e1bd8872	Add support for objc property decls according to the page at: http://llvm.org/docs/SourceLevelDebugging.html#objcproperty including type and DECL. Expand the metadata needed accordingly. rdar://11144023 llvm-svn: 153639	2012-03-29 08:42:56 +00:00
Rafael Espindola	5054ee82cc	Handle intrinsics in GlobalsModRef. Fixes pr12351. llvm-svn: 153604	2012-03-28 21:31:24 +00:00
Chad Rosier	e27081d348	Revert r153521 as it's causing large regressions on the nightly testers. Original commit message for r153521 (aka r153423): Use the new range metadata in computeMaskedBits and add a new optimization to instruction simplify that lets us remove an and when loding a boolean value. llvm-svn: 153587	2012-03-28 18:42:50 +00:00
Chad Rosier	8e6dbccd03	Reapply r153423; the original commit was fine. The failing test, distray, had undefined behavior, which Rafael was kind enough to fix. Original commit message for r153423: Use the new range metadata in computeMaskedBits and add a new optimization to instruction simplify that lets us remove an and when loding a boolean value. llvm-svn: 153521	2012-03-27 17:44:52 +00:00
Andrew Trick	7004e4b95e	SCEV fix: Handle loop invariant loads. Fixes PR11882: NULL dereference in ComputeLoadConstantCompareExitLimit. llvm-svn: 153480	2012-03-26 22:33:59 +00:00
Chad Rosier	08e57e5ccf	Revert r153423 as this is causing failures on our internal nightly testers. Original commit message: Use the new range metadata in computeMaskedBits and add a new optimization to instruction simplify that lets us remove an and when loading a boolean value. llvm-svn: 153452	2012-03-26 18:07:14 +00:00
Rafael Espindola	df9b4adb82	Use the new range metadata in computeMaskedBits and add a new optimization to instruction simplify that lets us remove an and when loding a boolean value. llvm-svn: 153423	2012-03-26 01:44:11 +00:00
Chandler Carruth	8059c84af1	Teach instsimplify how to simplify comparisons of pointers which are constant-offsets of a common base using the generic GEP-walking logic I added for computing pointer differences in the same situation. llvm-svn: 153419	2012-03-25 21:28:14 +00:00
Chandler Carruth	2741aae80b	Switch the pointer-difference simplification logic to only work with inbounds GEPs. This isn't really necessary for simplifying pointer differences, but I'm planning to re-use the same code to simplify pointer comparisons where it is necessary. Since real code almost exclusively uses inbounds GEPs, it doesn't seem worth it to support the extra complexity of turning it on and off. If anyone would like that back, feel free to shout. Note that instcombine will still catch any of these patterns. llvm-svn: 153418	2012-03-25 20:43:07 +00:00
Chandler Carruth	77e8bfbb5e	Try to harden the recursive simplification still further. This is again spotted by inspection, and I've crafted no test case that triggers it on my machine, but some of the windows builders are hitting what looks like memory corruption, so something is amiss here. This patch takes a more generalized approach to eliminating double-visits. Imagine code such as: %x = ... %y = add %x, 1 %z = add %x, %y You can imagine that if we simplify %x, we would add %y and %z to the list. If the use-chain order happens to cause us to add them in reverse order, we could pull %y off first, and simplify it, adding %z to the list. We now have %z on the list twice, and will reference it after it is deleted. Currently, all my test cases happen to not trigger this, likely due to the use-chain ordering, but there seems no guarantee that such a situation could not occur, so we should handle it correctly. Again, if anyone knows how to craft a testcase that actually triggers this, please let me know. llvm-svn: 153397	2012-03-24 22:34:26 +00:00
Chandler Carruth	e41fc73f08	Don't add the instruction about to be RAUW'ed and erased to the worklist. This can happen in theory when an instruction uses itself, such as a PHI node. This was spotted by inspection, and unfortunately I've not been able to come up with a test case that would trigger it. If anyone has ideas, let me know... llvm-svn: 153396	2012-03-24 22:34:23 +00:00
Chandler Carruth	cf1b585f60	Refactor the interface to recursively simplifying instructions to be tad bit simpler by handling a common case explicitly. Also, refactor the implementation to use a worklist based walk of the recursive users, rather than trying to use value handles to detect and recover from RAUWs during the recursive descent. This fixes a very subtle bug in the previous implementation where degenerate control flow structures could cause mutually recursive instructions (PHI nodes) to collapse in just such a way that From became equal to To after some amount of recursion. At that point, we hit the inf-loop that the assert at the top attempted to guard against. This problem is defined away when not using value handles in this manner. There are lots of comments claiming that the WeakVH will protect against just this sort of error, but they're not accurate about the actual implementation of WeakVHs, which do still track RAUWs. I don't have any test case for the bug this fixes because it requires running the recursive simplification on unreachable phi nodes. I've no way to either run this or easily write an input that triggers it. It was found when using instruction simplification inside the inliner when running over the nightly test-suite. llvm-svn: 153393	2012-03-24 21:11:24 +00:00
Eric Christopher	3c0d51661f	Take out the debug info probe stuff. It's making some changes to the PassManager annoying and should be reimplemented as a decorator on top of existing passes (as should the timing data). llvm-svn: 153305	2012-03-23 03:54:05 +00:00
Andrew Trick	6d1bbb8755	Cleanup IVUsers::addUsersIfInteresting. Keep the public interface clean, even though LLVM proper does not currently use it. llvm-svn: 153263	2012-03-22 17:47:33 +00:00
Chandler Carruth	3ffccb3802	Teach instsimplify to gracefully degrade in the presence of instructions not attched to a basic block or function. There are conservatively correct answers in these cases, and this makes the analysis more useful in contexts where we have a partially formed bit of IR. I don't have any way to test this directly... suggestions welcome here, but I'm not seeing anything sadly. I only found this using a subsequent patch to the inliner which runs instsimplify on partially inlined instructions, and even then only on a quite large program. I never got a reasonable testcase out of it, and anything I do get is likely to be quite fragile due to requiring an interaction of two different passes, and the only result being a segfault if it goes wrong. llvm-svn: 153176	2012-03-21 10:58:47 +00:00
Andrew Trick	9c45706baf	LSR: teach isSimplifiedLoopNest to handle PHI IVUsers. llvm-svn: 153132	2012-03-20 21:24:44 +00:00
Andrew Trick	3660735e18	LSR: fix IVUsers isSimplifiedLoopNest to perform a full domtree walk instead of skipping the current loop. My prior fix was incomplete because of an overzealous compile-time optimization: Better fix for: <rdar://problem/11049788> Segmentation fault: 11 in LoopStrengthReduce llvm-svn: 153131	2012-03-20 21:24:40 +00:00
Nick Lewycky	fa30607eca	Factor out the multiply analysis code in ComputeMaskedBits and apply it to the overflow checking multiply intrinsic as well. Add a test for this, updating the test from grep to FileCheck. llvm-svn: 153028	2012-03-18 23:28:48 +00:00
Chandler Carruth	d7a5f2adb0	Start removing the use of an ad-hoc 'never inline' set and instead directly query the function information which this set was representing. This simplifies the interface of the inline cost analysis, and makes the always-inline pass significantly more efficient. Previously, always-inline would first make a single set of every function in the module except those marked with the always-inline attribute. It would then query this set at every call site to see if the function was a member of the set, and if so, refuse to inline it. This is quite wasteful. Instead, simply check the function attribute directly when looking at the callsite. The normal inliner also had similar redundancy. It added every function in the module with the noinline attribute to its set to ignore, even though inside the cost analysis function we already tested the noinline attribute and produced the same result. The only tricky part of removing this is that we have to be able to correctly remove only the functions inlined by the always-inline pass when finalizing, which requires a bit of a hack. Still, much less of a hack than the set of all non-always-inline functions was. While I was touching this function, I switched a heavy-weight set to a vector with sort+unique. The algorithm already had a two-phase insert and removal pattern, we were just needlessly paying the uniquing cost on every insert. This probably speeds up some compiles by a small amount (-O0 compiles with lots of always-inline, so potentially heavy libc++ users), but I've not tried to measure it. I believe there is no functional change here, but yell if you spot one. None are intended. Finally, the direction this is going in is to greatly simplify the inline cost query interface so that we can replace its implementation with a much more clever one. Along the way, all the APIs get simplified, so it seems incrementally good. llvm-svn: 152903	2012-03-16 06:10:13 +00:00
Chandler Carruth	3c256fbf2d	Pull the implementation of the code metrics out of the inline cost analysis implementation. The header was already separated. Also cleanup all the comments in the header to follow a nice modern doxygen form. There is still plenty of cruft here, but some of that will fall out in subsequent refactorings and this was an easy step in the right direction. No functionality changed here. llvm-svn: 152898	2012-03-16 05:51:52 +00:00
Andrew Trick	070e540a3e	LSR fix: Add isSimplifiedLoopNest to IVUsers analysis. Only record IVUsers that are dominated by simplified loop headers. Otherwise SCEVExpander will crash while looking for a preheader. I previously tried to work around this in LSR itself, but that was insufficient. This way, LSR can continue to run if some uses are not in simple loops, as long as we don't attempt to analyze those users. Fixes <rdar://problem/11049788> Segmentation fault: 11 in LoopStrengthReduce llvm-svn: 152892	2012-03-16 03:16:56 +00:00
Eric Christopher	a4a0cf8394	Do the right thing on NULL uint64 fields. Patch by Clemens Hammacher! Fixes PR12243 llvm-svn: 152880	2012-03-16 00:21:54 +00:00
Duncan Sands	bd415dec4e	Type sizes and fields offsets inside structs are unsigned. This is a highly theoretical fix since it only matters for types with >= 2^63 bits (!) and also only matters if pointers have more than 64 bits, which is not supported anyway. llvm-svn: 152831	2012-03-15 20:14:42 +00:00
Chandler Carruth	6d64bd4639	Make the swap code here a bit more obvious what its doing... We're essentially sorting the pair's arguments. I'd love to actually call sort here, but I'm just not that crazy. ;] llvm-svn: 152764	2012-03-15 00:55:51 +00:00
Chandler Carruth	899e439aea	Don't assume that the arguments are processed in some particular order. This appears to not be the case with dragonegg at least in some contexts. Hopefully will fix the bootstrap assert failure there. llvm-svn: 152763	2012-03-15 00:50:21 +00:00
Chandler Carruth	5b6ca5ca37	Remove all remnants of partial specialization in the cost computation side of things. This is all dead code. llvm-svn: 152759	2012-03-15 00:29:08 +00:00
Chandler Carruth	4d1d34fbfc	Extend the inline cost calculation to account for bonuses due to correlated pairs of pointer arguments at the callsite. This is designed to recognize the common C++ idiom of begin/end pointer pairs when the end pointer is a constant offset from the begin pointer. With the C-based idiom of a pointer and size, the inline cost saw the constant size calculation, and this provides the same level of information for begin/end pairs. In order to propagate this information we have to search for candidate operations on a pair of pointer function arguments (or derived from them) which would be simplified if the pointers had a known constant offset. Then the callsite analysis looks for such pointer pairs in the argument list, and applies the appropriate bonus. This helps LLVM detect that half of bounds-checked STL algorithms (such as hash_combine_range, and some hybrid sort implementations) disappear when inlined with a constant size input. However, it's not a complete fix due the inaccuracy of our cost metric for constants in general. I'm looking into that next. Benchmarks showed no significant code size change, and very minor performance changes. However, specific code such as hashing is showing significantly cleaner inlining decisions. llvm-svn: 152752	2012-03-14 23:19:53 +00:00
Chandler Carruth	a308955993	Refactor the inline cost bonus calculation for constants to use a worklist rather than a recursive call. No functionality changed. llvm-svn: 152706	2012-03-14 07:32:53 +00:00
Chris Lattner	87fa77bd8a	enhance jump threading to preserve TBAA information when PRE'ing loads, fixing rdar://11039258, an issue that came up when inspecting clang's bootstrapped codegen. llvm-svn: 152635	2012-03-13 18:07:41 +00:00
Duncan Sands	395ac42dd2	Generalize the "trunc(ptrtoint(x)) - trunc(ptrtoint(y)) -> trunc(ptrtoint(x-y))" optimization introduced by Chandler. llvm-svn: 152626	2012-03-13 14:07:05 +00:00
Duncan Sands	b8cee00841	Uniformize the InstructionSimplify interface by ensuring that all routines take a TargetLibraryInfo parameter. Internally, rather than passing TD, TLI and DT parameters around all over the place, introduce a struct for holding them. llvm-svn: 152623	2012-03-13 11:42:19 +00:00
Eli Friedman	c8cbd06947	Fix regression from r151466: an we can't replace uses of an instruction reachable from the entry block with uses of an instruction not reachable from the entry block. PR12231. llvm-svn: 152595	2012-03-13 01:06:07 +00:00
Chandler Carruth	e45781e673	Address some review comments from Duncan. This moves the iterative offset accumulation to use a boring APInt instead of ConstantExprs. I didn't go all the way to an 'int64_t' because I wanted APInt to handle any magic required to properly wrap the arithmetic when the pointer width is <64 bits. If there is a significant penalty from using APInt here, first off WTF, and secondly let me know and I'll do the math by hand. I've left one layer still operating w/ ConstantExpr because it makes the interface quite a bit simpler, and that one isn't iterative so has much lower cost. I suppose this may potentially speed up some strang compilation situations, but I don't really expect much. It should have no functional impact either way. llvm-svn: 152590	2012-03-13 00:06:15 +00:00
Chandler Carruth	a0796555e2	Teach instsimplify how to constant fold pointer differences. Typically instcombine has handled this, but pointer differences show up in several contexts where we would like to get constant folding, and cannot afford to run instcombine. Specifically, I'm working on improving the constant folding of arguments used in inline cost analysis with instsimplify. Doing this in instsimplify implies some algorithm changes. We have to handle multiple layers of all-constant GEPs because instsimplify cannot fold them into a single GEP the way instcombine can. Also, we're only interested in all-constant GEPs. The result is that this doesn't really replace the instcombine logic, it's just complimentary and focused on constant folding. Reviewed on IRC by Benjamin Kramer. llvm-svn: 152555	2012-03-12 11:19:31 +00:00
Stepan Dyatkovskiy	97b02fc1b3	llvm::SwitchInst Renamed methods caseBegin, caseEnd and caseDefault with case_begin, case_end, and case_default. Added some notes relative to case iterators. llvm-svn: 152532	2012-03-11 06:09:17 +00:00
Benjamin Kramer	71ff880ff9	Make helper static, so it can be inlined into its sole caller. llvm-svn: 152515	2012-03-10 22:41:06 +00:00
Bill Wendling	2bbb7945e7	As Duncan pointed out, pointers tend not to be in floating point format...for now. llvm-svn: 152499	2012-03-10 18:20:55 +00:00
Bill Wendling	0624d2a1ec	Make this transformation slightly less agressive and more correct. The 'CmpInst::isFalseWhenEqual' function returns 'false' for values other than simply equality. For instance, it returns 'false' for <= or >=. This isn't the correct behavior for this transformation, which is checking for strict equality and non-equality. It was causing the gcc.c-torture/execute/frame-address.c test to fail because it would completely (and incorrectly) optimize a whole function into a 'ret i32 0'. llvm-svn: 152497	2012-03-10 17:56:03 +00:00
Chandler Carruth	97f6f03c42	Refactor some methods to look through bitcasts and GEPs on pointers into a common collection of methods on Value, and share their implementation. We had two variations in two different places already, and I need the third variation for inline cost estimation. Reviewed by Duncan Sands on IRC, but further comments here welcome. llvm-svn: 152490	2012-03-10 08:39:09 +00:00
Nick Lewycky	fea3e00e09	Factor out the analysis of addition and subtraction in ComputeMaskedBits. Reuse it to analyze extractvalue(llvm.[us](add\|sub).with.overflow.*) intrinsics! llvm-svn: 152398	2012-03-09 09:23:50 +00:00
Chandler Carruth	783b7198b7	Undo a previous restriction on the inline cost calculation which Nick introduced. Specifically, there are cost reductions for all constant-operand icmp instructions against an alloca, regardless of whether the alloca will in fact be elligible for SROA. That means we don't want to abort the icmp reduction computation when we abort the SROA reduction computation. That in turn frees us from the need to keep a separate worklist and defer the ICmp calculations. Use this new-found freedom and some judicious function boundaries to factor the innards of computing the cost factor of any given instruction out of the loop over the instructions and into static helper functions. This greatly simplifies the code, and hopefully makes it more clear what is happening here. Reviewed by Eric Christopher. There is some concern that we'd like to ensure this doesn't get out of hand, and I plan to benchmark the effects of this change over the next few days along with some further fixes to the inline cost. llvm-svn: 152368	2012-03-09 02:49:36 +00:00
Stepan Dyatkovskiy	5b648afb4d	Taken into account Duncan's comments for r149481 dated by 2nd Feb 2012: http://lists.cs.uiuc.edu/pipermail/llvm-commits/Week-of-Mon-20120130/136146.html Implemented CaseIterator and it solves almost all described issues: we don't need to mix operand/case/successor indexing anymore. Base iterator class is implemented as a template since it may be initialized either from "const SwitchInst" or from "SwitchInst". ConstCaseIt is just a read-only iterator. CaseIt is read-write iterator; it allows to change case successor and case value. Usage of iterator allows totally remove resolveXXXX methods. All indexing convertions done automatically inside the iterator's getters. Main way of iterator usage looks like this: SwitchInst SI = ... // intialize it somehow for (SwitchInst::CaseIt i = SI->caseBegin(), e = SI->caseEnd(); i != e; ++i) { BasicBlock BB = i.getCaseSuccessor(); ConstantInt *V = i.getCaseValue(); // Do something. } If you want to convert case number to TerminatorInst successor index, just use getSuccessorIndex iterator's method. If you want initialize iterator from TerminatorInst successor index, use CaseIt::fromSuccessorIndex(...) method. There are also related changes in llvm-clients: klee and clang. llvm-svn: 152297	2012-03-08 07:06:20 +00:00
Chandler Carruth	dd1637c393	Rotate two of the functions used to count bonuses for the inline cost analysis to be methods on the cost analysis's function info object instead of the code metrics object. These really are just users of the code metrics, they're building the information for the function's analysis. This is the first step of growing the amount of information we collect about a function in order to cope with pair-wise simplifications due to allocas. llvm-svn: 152283	2012-03-08 02:04:19 +00:00
Nick Lewycky	1d57ee341a	No functionality change. Type::isSized() can be expensive, so avoid calling it until after other inexpensive tests. llvm-svn: 152195	2012-03-07 02:27:53 +00:00
Eli Friedman	af3c6fe51e	A few more cases of missing masking in ComputeMaskedBits; found by inspection. llvm-svn: 152070	2012-03-05 23:22:40 +00:00
Eli Friedman	a8b75ac798	Make sure we don't return bits outside the mask in ComputeMaskedBits. PR12189. llvm-svn: 152066	2012-03-05 23:09:40 +00:00
Benjamin Kramer	d9d80b1dde	LVI: Recognize the form instcombine canonicalizes range checks into when forming constant ranges. This could probably be made a lot smarter, but this is a common case and doesn't require LVI to scan a lot of code. With this change CVP can optimize away the "shift == 0" case in Hashing.h that only gets hit when "shift" is in a range not containing 0. llvm-svn: 151919	2012-03-02 15:34:43 +00:00
Eli Friedman	0774902a00	Duncan pointed out that if the alignment isn't explicitly specified, it defaults to the ABI alignment. Given that, make this code a bit more aggressive in such cases. llvm-svn: 151584	2012-02-27 23:16:46 +00:00
Eli Friedman	8bc169c3c5	Teach BasicAA about the LLVM IR rules that allow reading past the end of an object given sufficient alignment. Fixes PR12098. llvm-svn: 151553	2012-02-27 20:46:07 +00:00
Rafael Espindola	09a4201d3c	Fix this assert. IP can point to an instruction with strange dominance properties (invoke). Just assert that the instruction we return dominates the insertion point. llvm-svn: 151511	2012-02-27 02:13:03 +00:00
Rafael Espindola	b660977c67	Don't call dominates on unreachable instructions. Should fix the dragonegg build. Testcase is still reducing. llvm-svn: 151474	2012-02-26 05:30:08 +00:00
Rafael Espindola	ae725715ef	And update the comment... llvm-svn: 151472	2012-02-26 02:36:56 +00:00
Rafael Espindola	fa75542078	Enable the assert that got all this dominator work started. llvm-svn: 151471	2012-02-26 02:29:18 +00:00
Rafael Espindola	94df267db3	Change the implementation of dominates(inst, inst) to one based on what the verifier does. This correctly handles invoke. Thanks to Duncan, Andrew and Chris for the comments. Thanks to Joerg for the early testing. llvm-svn: 151469	2012-02-26 02:19:19 +00:00
Nick Lewycky	3db143ea8c	Reinstate the optimization from r151449 with a fix to not turn 'gep %x' into 'gep null' when the icmp predicate is unsigned (or is signed without inbounds). llvm-svn: 151467	2012-02-26 02:09:49 +00:00
Rafael Espindola	c8c2b06a90	Don't call dominates on unreachable instructions. llvm-svn: 151466	2012-02-26 01:50:14 +00:00
Nick Lewycky	7bbd72da46	Roll these back to r151448 until I figure out how they're breaking MultiSource/Applications/lua. llvm-svn: 151463	2012-02-25 23:01:19 +00:00
Nick Lewycky	eeeffbb497	An argument and a local identified object (eg. a noalias call) could turn out equal if both are null. In the test, scope type %t and global @y by adding a 'gep' prefix to them. llvm-svn: 151452	2012-02-25 20:19:07 +00:00
Nick Lewycky	7b99bada0b	Fix five-letter typo in comment. llvm-svn: 151450	2012-02-25 19:12:58 +00:00
Nick Lewycky	51f2be8bff	Teach instsimplify to be more aggressive when analyzing comparisons of pointers by using llvm::isIdentifiedObject. Also teach it to handle GEPs that have the same base pointer and constant operands. Fixes PR11238! llvm-svn: 151449	2012-02-25 19:07:42 +00:00
Nick Lewycky	3f885b65a2	Move isKnownNonNull from private implementation detail of BasicAA to a public function that others can use, next to llvm::isIdentifiedObject. llvm-svn: 151446	2012-02-25 10:56:28 +00:00
Chris Lattner	01990f0e1c	fix PR12075, a regression in a recent transform I added. In unreachable code, gep chains can be infinite. Just like "stripPointerCasts", use a set to keep track of visited instructions so we don't recurse infinitely. llvm-svn: 151383	2012-02-24 19:01:58 +00:00
Rafael Espindola	f35c789031	Fix typo. llvm-svn: 151238	2012-02-23 05:38:51 +00:00
Chad Rosier	5dfe6dab25	Remove extra semi-colons. llvm-svn: 151169	2012-02-22 17:25:00 +00:00
Rafael Espindola	337cfaf757	Improve comment. Thanks for Andrew for the suggestion. llvm-svn: 151127	2012-02-22 03:44:46 +00:00
Rafael Espindola	cd06b482d2	Semantically revert 151015. Add a comment on why we should be able to assert the dominance once the dominates method is fixed and why we can use the builder's insertion point. Fixes pr12048. llvm-svn: 151125	2012-02-22 03:21:39 +00:00
Rafael Espindola	b41b407f3d	s/the the/the/ llvm-svn: 151079	2012-02-21 19:27:16 +00:00
Rafael Espindola	729e3aae92	Use more idiomatic assert. llvm-svn: 151026	2012-02-21 03:51:14 +00:00
Rafael Espindola	b2defca267	Avoid warning on non assert builds. llvm-svn: 151025	2012-02-21 03:48:30 +00:00
Rafael Espindola	7d445e92c3	It turns out that with the current scev organization ReuseOrCreateCast cannot know where users will be added. Because of this, it cannot use Builder.GetInsertPoint at all. This patch * removes the FIXME about adding the assert. * adds a comment explaining hy we don't have one. * removes a broken logic that only works for some callers and is not needed since r150884. * adds an assert to caller that would have caught the bug fixed by r150884. llvm-svn: 151015	2012-02-21 01:19:51 +00:00
Eric Christopher	4826c8fbe8	Make this a bit prettier and more obvious when a derived type isn't derived from anything. llvm-svn: 150975	2012-02-20 18:04:39 +00:00
Eric Christopher	300871076e	If a derived type is also a composite type, print that information too. llvm-svn: 150974	2012-02-20 18:04:35 +00:00
Eric Christopher	8979712685	Add support for runtime languages on our forward declarations. llvm-svn: 150973	2012-02-20 18:04:14 +00:00
Chris Lattner	445d8c6b50	fold comparisons of gep'd alloca points with null to false, implementing PR12013. We now compile the testcase to: __Z4testv: ## @_Z4testv ## BB#0: ## %_ZN4llvm15SmallVectorImplIiE9push_backERKi.exit pushq %rbx subq $64, %rsp leaq 32(%rsp), %rbx movq %rbx, (%rsp) leaq 64(%rsp), %rax movq %rax, 16(%rsp) movl $1, 32(%rsp) leaq 36(%rsp), %rax movq %rax, 8(%rsp) leaq (%rsp), %rdi callq __Z1gRN4llvm11SmallVectorIiLj8EEE movq (%rsp), %rdi cmpq %rbx, %rdi je LBB0_2 ## BB#1: callq _free LBB0_2: ## %_ZN4llvm11SmallVectorIiLj8EED1Ev.exit addq $64, %rsp popq %rbx ret instead of: __Z4testv: ## @_Z4testv ## BB#0: pushq %rbx subq $64, %rsp xorl %eax, %eax leaq (%rsp), %rbx addq $32, %rbx movq %rbx, (%rsp) movq %rbx, 8(%rsp) leaq 64(%rsp), %rcx movq %rcx, 16(%rsp) je LBB0_2 ## BB#1: movl $1, 32(%rsp) movq %rbx, %rax LBB0_2: ## %_ZN4llvm15SmallVectorImplIiE9push_backERKi.exit addq $4, %rax movq %rax, 8(%rsp) leaq (%rsp), %rdi callq __Z1gRN4llvm11SmallVectorIiLj8EEE movq (%rsp), %rdi cmpq %rbx, %rdi je LBB0_4 ## BB#3: callq _free LBB0_4: ## %_ZN4llvm11SmallVectorIiLj8EED1Ev.exit addq $64, %rsp popq %rbx ret This doesn't shrink clang noticably though. llvm-svn: 150944	2012-02-20 00:42:49 +00:00
Rafael Espindola	991356e89b	Temporarily disable this assert. Looks like it found a similar issue when building bullet. llvm-svn: 150885	2012-02-18 17:51:43 +00:00
Rafael Espindola	82d957593e	Don't skip debug instructions when looking for the insertion point of the cast. If we do, we can end up with inst1 --------------- < Insertion point dbg inst new inst instead of the desired inst1 new inst --------------- < Insertion point dbg inst Another option would be for InsertNoopCastOfTo (or its callers) to move the insertion point and we would end up with inst1 dbg inst new inst --------------- < Insertion point but that complicates the callers. This fixes PR12018 (and firefox's build). llvm-svn: 150884	2012-02-18 17:22:58 +00:00
Eli Friedman	952d1f9f40	Fix a rather nasty regression from r150690: LHS != RHS does not imply LHS->stripPointerCasts() != RHS->stripPointerCasts(). llvm-svn: 150863	2012-02-18 03:29:25 +00:00
Dan Gohman	9017b846d4	Remove a comment about an alternative approach that wouldn't actually work, at least as described. LLVM Metadata is not intended to suppress LLVM IR rules, as it can be stripped at any time. llvm-svn: 150821	2012-02-17 18:33:38 +00:00
Eric Christopher	b23b32e43b	Typo in variable name. llvm-svn: 150796	2012-02-17 07:08:46 +00:00
Benjamin Kramer	08f18b1b74	Revert "InstSimplify: Strip pointer casts early." Turns out this isn't safe, because the code below depends on LHS and RHS having the same type. llvm-svn: 150695	2012-02-16 15:19:59 +00:00
Benjamin Kramer	3d27f71f2d	InstSimplify: Strip pointer casts early. llvm-svn: 150694	2012-02-16 15:03:04 +00:00
Benjamin Kramer	ea51f62e4b	InstSimplify: Ignore pointer casts when constant folding compares between pointers. llvm-svn: 150690	2012-02-16 13:49:39 +00:00
Hal Finkel	56f6b0f219	Have AliasSet::aliasesUnknownInst use pointer TBAA info when available llvm-svn: 150249	2012-02-10 15:52:39 +00:00
Duncan Sands	26641d7c02	Fix PR11948: the result type of an icmp may be a vector of boolean - don't assume it is a boolean. llvm-svn: 150247	2012-02-10 14:31:24 +00:00
Eric Christopher	ae56eecf5f	Add support for a temporary forward decl type. We want this so we can rauw forward declarations if we decide to emit the full type. Part of rdar://10809898 llvm-svn: 150024	2012-02-08 00:22:26 +00:00
Devang Patel	a93cc25b79	Remove tabs. llvm-svn: 150022	2012-02-08 00:17:07 +00:00
Craig Topper	a2886c21d9	Convert assert(0) to llvm_unreachable llvm-svn: 149967	2012-02-07 05:05:23 +00:00
Kostya Serebryany	9e0d377400	The patch resolves the conflict between AddressSanitizer and load widening (GVN). The problem initially reported by Mozilla folks (http://code.google.com/p/address-sanitizer/issues/detail?id=20), but it also prevents us from enabling LLVM bootstrap with AddressSanitizer. llvm-svn: 149925	2012-02-06 22:48:56 +00:00
Chris Lattner	8213c8af29	Remove some dead code and tidy things up now that vectors use ConstantDataVector instead of always using ConstantVector. llvm-svn: 149912	2012-02-06 21:56:39 +00:00
Bill Wendling	0aef16afd5	[unwind removal] Remove all of the code for the dead 'unwind' instruction. There were no 'unwind' instructions being generated before this, so this is in effect a no-op. llvm-svn: 149906	2012-02-06 21:44:22 +00:00
Bill Wendling	d5d95b0b51	[unwind removal] We no longer have 'unwind' instructions being generated, so remove the code that handles them. llvm-svn: 149901	2012-02-06 21:16:41 +00:00
Devang Patel	4488217f73	DebugInfo: Provide a new hook to encode relationship between a property and an ivar. llvm-svn: 149874	2012-02-06 17:49:43 +00:00
Duncan Sands	ae22c60f90	Persuade GCC that there is nothing worth warning about here (there isn't). llvm-svn: 149834	2012-02-05 14:20:11 +00:00
Chris Lattner	cf9e8f6968	reapply the patches reverted in r149470 that reenable ConstantDataArray, but with a critical fix to the SelectionDAG code that optimizes copies from strings into immediate stores: the previous code was stopping reading string data at the first nul. Address this by adding a new argument to llvm::getConstantStringInfo, preserving the behavior before the patch. llvm-svn: 149800	2012-02-05 02:29:43 +00:00
Qirun Zhang	e788fac623	remove the blank line from previous ci. llvm-svn: 149758	2012-02-04 03:18:47 +00:00
Qirun Zhang	dabce3f4e9	test commit. add a blank line. llvm-svn: 149757	2012-02-04 03:15:26 +00:00
Devang Patel	cc481596e4	Introduce DIObjCProperty. This will be used to encode objective-c property. llvm-svn: 149732	2012-02-04 00:59:25 +00:00
Stepan Dyatkovskiy	513aaa5691	SwitchInst refactoring. The purpose of refactoring is to hide operand roles from SwitchInst user (programmer). If you want to play with operands directly, probably you will need lower level methods than SwitchInst ones (TerminatorInst or may be User). After this patch we can reorganize SwitchInst operands and successors as we want. What was done: 1. Changed semantics of index inside the getCaseValue method: getCaseValue(0) means "get first case", not a condition. Use getCondition() if you want to resolve the condition. I propose don't mix SwitchInst case indexing with low level indexing (TI successors indexing, User's operands indexing), since it may be dangerous. 2. By the same reason findCaseValue(ConstantInt*) returns actual number of case value. 0 means first case, not default. If there is no case with given value, ErrorIndex will returned. 3. Added getCaseSuccessor method. I propose to avoid usage of TerminatorInst::getSuccessor if you want to resolve case successor BB. Use getCaseSuccessor instead, since internal SwitchInst organization of operands/successors is hidden and may be changed in any moment. 4. Added resolveSuccessorIndex and resolveCaseIndex. The main purpose of these methods is to see how case successors are really mapped in TerminatorInst. 4.1 "resolveSuccessorIndex" was created if you need to level down from SwitchInst to TerminatorInst. It returns TerminatorInst's successor index for given case successor. 4.2 "resolveCaseIndex" converts low level successors index to case index that curresponds to the given successor. Note: There are also related compatability fix patches for dragonegg, klee, llvm-gcc-4.0, llvm-gcc-4.2, safecode, clang. llvm-svn: 149481	2012-02-01 07:49:51 +00:00
Argyrios Kyrtzidis	17c981a45b	Revert Chris' commits up to r149348 that started causing VMCoreTests unit test to fail. These are: r149348 r149351 r149352 r149354 r149356 r149357 r149361 r149362 r149364 r149365 llvm-svn: 149470	2012-02-01 04:51:17 +00:00
Chris Lattner	997348e9fe	remove the last vestiges of llvm::GetConstantStringInfo, in CodeGen. llvm-svn: 149356	2012-01-31 05:09:17 +00:00
Chris Lattner	108423a94a	Change ConstantArray::get to form a ConstantDataArray when possible, kicking in the big win of ConstantDataArray. As part of this, change the implementation of GetConstantStringInfo in ValueTracking to work with ConstantDataArray (and not ConstantArray) making it dramatically, amazingly, more efficient in the process and renaming it to getConstantStringInfo. This keeps around a GetConstantStringInfo entrypoint that (grossly) forwards to getConstantStringInfo and constructs the std::string required, but existing clients should move over to getConstantStringInfo instead. llvm-svn: 149351	2012-01-31 04:42:22 +00:00
Rafael Espindola	bb893fea6b	Add r149110 back with a fix for when the vector and the int have the same width. llvm-svn: 149151	2012-01-27 23:33:07 +00:00
Rafael Espindola	a4062624d1	Revert r149110 and add a testcase that was crashing since that revision. Unfortunately I also had to disable constant-pool-sharing.ll the code it tests has been updated to use the IL logic. llvm-svn: 149148	2012-01-27 22:42:48 +00:00
Chris Lattner	111d6ee655	enhance constant folding to be able to constant fold bitcast of ConstantVector's to integer type. llvm-svn: 149110	2012-01-27 01:44:03 +00:00
Chris Lattner	61a1d6cb81	progress making the world safe to ConstantDataVector. While we're at it, allow PatternMatch's "neg" pattern to match integer vector negations, and enhance ComputeNumSigned bits to handle shl of vectors. llvm-svn: 149082	2012-01-26 21:37:55 +00:00
Nick Lewycky	0e496cddf0	Use precomputed BB size instead of BB->size(). llvm-svn: 148964	2012-01-25 18:54:13 +00:00
Nick Lewycky	70d50ee8fb	Support pointer comparisons against constants, when looking at the inline-cost savings from a pointer argument becoming an alloca. Sometimes callees will even compare a pointer to null and then branch to an otherwise unreachable block! Detect these cases and compute the number of saved instructions, instead of bailing out and reporting no savings. llvm-svn: 148941	2012-01-25 08:27:40 +00:00
Chris Lattner	6705883ad8	use Constant::getAggregateElement to simplify a bunch of code. llvm-svn: 148934	2012-01-25 06:48:06 +00:00
Chris Lattner	9be59599b3	Use the right method to get the # elements in a CDS. llvm-svn: 148897	2012-01-25 01:27:20 +00:00
Chris Lattner	f7eb543380	teach valuetracking about ConstantDataSequential llvm-svn: 148790	2012-01-24 07:54:10 +00:00
Chris Lattner	e166a8548f	switch SCEV to use the new ConstantFoldLoadThroughGEPIndices function instead of its own hard coded thing, allowing it to handle ConstantDataSequential and fixing some obscure bugs (e.g. it would previously crash on a CAZ of vector type). llvm-svn: 148788	2012-01-24 05:49:24 +00:00
Chris Lattner	f488b35826	Split the interesting bits of ConstantFoldLoadThroughGEPConstantExpr out into a new ConstantFoldLoadThroughGEPIndices (more useful) function and rewrite it to be simpler, more efficient, and to handle the new ConstantDataSequential type. Enhance ConstantFoldLoadFromConstPtr to handle ConstantDataSequential. llvm-svn: 148786	2012-01-24 05:43:50 +00:00
David Blaikie	46a9f016c5	More dead code removal (using -Wunreachable-code) llvm-svn: 148578	2012-01-20 21:51:11 +00:00
Benjamin Kramer	fe4848b55d	Remove obviously invalid early exit that prevented analyzing ConstantAggregateZeros. Found by the clang static analyzer. llvm-svn: 148540	2012-01-20 14:42:25 +00:00
Nick Lewycky	e8415fea4b	Fix CountCodeReductionForAlloca to more accurately represent what SROA can and can't handle. Also don't produce non-zero results for things which won't be transformed by SROA at all just because we saw the loads/stores before we saw the use of the address. llvm-svn: 148536	2012-01-20 08:35:20 +00:00
Andrew Trick	c908b43d9f	SCEVExpander fixes. Affects LSR and indvars. LSR has gradually been improved to more aggressively reuse existing code, particularly existing phi cycles. This exposed problems with the SCEVExpander's sloppy treatment of its insertion point. I applied some rigor to the insertion point problem that will hopefully avoid an endless bug cycle in this area. Changes: - Always used properlyDominates to check safe code hoisting. - The insertion point provided to SCEV is now considered a lower bound. This is usually a block terminator or the use itself. Under no cirumstance may SCEVExpander insert below this point. - LSR is reponsible for finding a "canonical" insertion point across expansion of different expressions. - Robust logic to determine whether IV increments are in "expanded" form and/or can be safely hoisted above some insertion point. Fixes PR11783: SCEVExpander assert. llvm-svn: 148535	2012-01-20 07:41:13 +00:00
Bill Wendling	75afc7afe8	Remove dead code. llvm-svn: 148384	2012-01-18 10:10:28 +00:00
Jakub Staszak	173bce3d2b	Move includes to the .cpp file. llvm-svn: 148342	2012-01-17 22:16:31 +00:00
Andrew Trick	23ef0d6c40	Fix a corner case hit by redundant phi elimination running after LSR. Fixes PR11761: bad IR w/ redundant Phi elim llvm-svn: 148177	2012-01-14 03:17:23 +00:00
Bill Wendling	58c7569854	A DenseMap of a std::map isn't a very good idea because the "grow()" method will need to make a deep copy of each of the std::maps. Use a std::map of the std::map instead. This improves the compile time of sqlite3 by ~2%. llvm-svn: 148003	2012-01-12 01:41:03 +00:00
Bill Wendling	4ec081a4d2	Revert r147978. A DenseMap's iterators may become invalidated here. llvm-svn: 147980	2012-01-11 23:43:34 +00:00
Bill Wendling	f0275df9e3	Use a DenseMap. This appears to improve sqlite3's compile time by ~2%. llvm-svn: 147978	2012-01-11 22:57:32 +00:00
Andrew Trick	e81211f45c	Clarified the SCEV getSmallConstantTripCount interface with in-your-face comments. This interface is misleading and dangerous, but it is actually what we need for unrolling. llvm-svn: 147926	2012-01-11 06:52:55 +00:00
Eric Christopher	43a1182975	Don't avoid recursing for pointer types, just reference types. Expand on the comment. Fixes constvars.exp on the gdb test builder. llvm-svn: 147897	2012-01-11 00:01:29 +00:00
Chandler Carruth	4c0ee749bb	Cleanup these asserts to follow common LLVM style and coding conventions. Also, clarify the grouping of one of the asserts to silence -Wparentheses. llvm-svn: 147863	2012-01-10 18:18:52 +00:00
David Blaikie	edbb58c577	Remove unnecessary default cases in switches that cover all enum values. llvm-svn: 147855	2012-01-10 16:47:17 +00:00
Andrew Trick	d5d2db9af9	Enable LSR IV Chains with sufficient heuristics. These heuristics are sufficient for enabling IV chains by default. Performance analysis has been done for i386, x86_64, and thumbv7. The optimization is rarely important, but can significantly speed up certain cases by eliminating spill code within the loop. Unrolled loops are prime candidates for IV chains. In many cases, the final code could still be improved with more target specific optimization following LSR. The goal of this feature is for LSR to make the best choice of induction variables. Instruction selection may not completely take advantage of this feature yet. As a result, there could be cases of slight code size increase. Code size can be worse on x86 because it doesn't support postincrement addressing. In fact, when chains are formed, you may see redundant address plus stride addition in the addressing mode. GenerateIVChains tries to compensate for the common cases. On ARM, code size increase can be mitigated by using postincrement addressing, but downstream codegen currently misses some opportunities. llvm-svn: 147826	2012-01-10 01:45:08 +00:00
Devang Patel	fa8df4837a	Update language check. Do not ignore DW_LANG_Python. Patch by Joe Groff! llvm-svn: 147781	2012-01-09 17:49:47 +00:00
Andrew Trick	f730f39f3f	Cleanup comments and argument types related to my previous replaceCongruentPhis checkin. llvm-svn: 147709	2012-01-07 01:29:21 +00:00
Andrew Trick	5adedf5d47	Extended replaceCongruentPhis to handle mixed phi types. llvm-svn: 147707	2012-01-07 01:12:09 +00:00
Andrew Trick	881a776875	Expose isNonConstantNegative to users of ScalarEvolution. llvm-svn: 147700	2012-01-07 00:27:31 +00:00
Andrew Trick	9a5b242d3c	Put all IVUsers in the processed set. Allow querying IVUsers with isIVUserOrOperand. llvm-svn: 147686	2012-01-06 21:41:55 +00:00
Andrew Trick	b8045cbcb1	SCEVExpander: hoistStep should check strict dominance. llvm-svn: 147683	2012-01-06 21:23:43 +00:00
Dan Gohman	7ac046a261	Generalize isSafeToSpeculativelyExecute to work on arbitrary Values, rather than just Instructions, since it's interesting for ConstantExprs too. llvm-svn: 147560	2012-01-04 23:01:09 +00:00
Andrew Trick	cbcc98fb50	Fix SCEVExpander to handle loops with no preheader when LSR gives it a "phony" insertion point. Fixes rdar://10619599: "SelectionDAGBuilder shouldn't visit PHI nodes!" assert llvm-svn: 147439	2012-01-02 21:25:10 +00:00
Benjamin Kramer	9442cd01f6	PatternMatch: Introduce a matcher for instructions with the "exact" bit. Use it to simplify a few matchers. llvm-svn: 147403	2012-01-01 17:55:30 +00:00
Nick Lewycky	4c378a4453	Change CaptureTracking to pass a Use* instead of a Value* when a value is captured. This allows the tracker to look at the specific use, which may be especially interesting for function calls. Use this to fix 'nocapture' deduction in FunctionAttrs. The existing one does not iterate until a fixpoint and does not guarantee that it produces the same result regardless of iteration order. The new implementation builds up a graph of how arguments are passed from function to function, and uses a bottom-up walk on the argument-SCCs to assign nocapture. This gets us nocapture more often, and does so rather efficiently and independent of iteration order. llvm-svn: 147327	2011-12-28 23:24:21 +00:00
Benjamin Kramer	4ee5747fdd	ComputeMaskedBits: Make knownzero computation more aggressive for ctlz with undef zero. unsigned foo(unsigned x) { return 31 - __builtin_clz(x); } now compiles into a single "bsrl" instruction on x86. llvm-svn: 147255	2011-12-24 17:31:46 +00:00
Chandler Carruth	b024aa021d	Make the unreachable probability much much heavier. The previous probability wouldn't be considered "hot" in some weird loop structures or other compounding probability patterns. This makes it much harder to confuse, but isn't really a principled fix. I'd actually like it if we could model a zero probability, as it would make this much easier to reason about. Suggestions for how to do this better are welcome. llvm-svn: 147142	2011-12-22 09:26:37 +00:00
Nick Lewycky	c186d07bbe	Continue counting intrinsics as instructions (except when they aren't, such as debug info) and for being vector operations. Fixes regression from r147037. llvm-svn: 147093	2011-12-21 20:26:03 +00:00
Nick Lewycky	281e2747e0	Fix typo and spacing, no functionality change. llvm-svn: 147092	2011-12-21 20:21:55 +00:00
Nick Lewycky	da22fc6a1d	A call to a function marked 'noinline' is not an inline candidate. The sole call site of an intrinsic is also not an inline candidate. While here, make it more obvious that this code ignores all intrinsics. Noticed by inspection! llvm-svn: 147037	2011-12-21 06:06:30 +00:00
Nick Lewycky	b4039f633c	Make some intrinsics safe to speculatively execute. llvm-svn: 147036	2011-12-21 05:52:02 +00:00
Jakub Staszak	96f8c551e3	Add some constantness to BranchProbabilityInfo and BlockFrequnencyInfo. llvm-svn: 146986	2011-12-20 20:03:10 +00:00
David Blaikie	a379b18173	Unweaken vtables as per http://llvm.org/docs/CodingStandards.html#ll_virtual_anch llvm-svn: 146960	2011-12-20 02:50:00 +00:00
Andrew Trick	b9aa26f8ea	LSR: Fix another corner case in expansion of postinc users. Fixes PR11571: Instruction does not dominate all uses llvm-svn: 146950	2011-12-20 01:42:24 +00:00
Joerg Sonnenberger	d6cb7649d8	Allow inlining of functions with returns_twice calls, if they have the attribute themselve. llvm-svn: 146851	2011-12-18 20:35:43 +00:00
Eric Christopher	27886c6c1e	When recursing for the original size of a type, stop if we are at a pointer or a reference type - we actually just want the size of the pointer then for that. Fixes rdar://10335756 llvm-svn: 146785	2011-12-16 23:42:45 +00:00
Devang Patel	78847f0bbe	In DICompositeType, referenced to derived type is either metadata or null. llvm-svn: 146744	2011-12-16 17:51:31 +00:00
Devang Patel	cdd833eb28	Virtual table holder field is either metadata or null. llvm-svn: 146665	2011-12-15 17:55:56 +00:00
Dan Gohman	75d7d5e988	Move Instruction::isSafeToSpeculativelyExecute out of VMCore and into Analysis as a standalone function, since there's no need for it to be in VMCore. Also, update it to use isKnownNonZero and other goodies available in Analysis, making it more precise, enabling more aggressive optimization. llvm-svn: 146610	2011-12-14 23:49:11 +00:00
Andrew Trick	e0ced62119	LSR: Fold redundant bitcasts on-the-fly. llvm-svn: 146597	2011-12-14 22:07:19 +00:00
Eli Friedman	fdeaf25827	Fix a stupid typo in MemDepPrinter. llvm-svn: 146549	2011-12-14 02:54:39 +00:00
Daniel Dunbar	8889bb08b8	LLVMBuild: Introduce a common section which currently has a list of the subdirectories to traverse into. - Originally I wanted to avoid this and just autoscan, but this has one key flaw in that new subdirectories can not automatically trigger a rerun of the llvm-build tool. This is particularly a pain when switching back and forth between trees where one has added a subdirectory, as the dependencies will tend to be wrong. This will also eliminates FIXME implicitly. llvm-svn: 146436	2011-12-12 22:45:54 +00:00
Daniel Dunbar	27a7489a03	LLVMBuild: Remove trailing newline, which irked me. llvm-svn: 146409	2011-12-12 19:48:00 +00:00
Chandler Carruth	58a71ed339	Switch llvm.cttz and llvm.ctlz to accept a second i1 parameter which indicates whether the intrinsic has a defined result for a first argument equal to zero. This will eventually allow these intrinsics to accurately model the semantics of GCC's __builtin_ctz and __builtin_clz and the X86 instructions (prior to AVX) which implement them. This patch merely sets the stage by extending the signature of these intrinsics and establishing auto-upgrade logic so that the old spelling still works both in IR and in bitcode. The upgrade logic preserves the existing (inefficient) semantics. This patch should not change any behavior. CodeGen isn't updated because it can use the existing semantics regardless of the flag's value. Note that this will be followed by API updates to Clang and DragonEgg. Reviewed by Nick Lewycky! llvm-svn: 146357	2011-12-12 04:26:04 +00:00
Chad Rosier	8abf65a130	Probably not a good idea to convert a single vector load into a memcpy. We don't do this now, but add a test case to prevent this from happening in the future. Additional test for rdar://9892684 llvm-svn: 145879	2011-12-06 00:19:08 +00:00
Nadav Rotem	3924cb0267	Add support for vectors of pointers. llvm-svn: 145801	2011-12-05 06:29:09 +00:00
Benjamin Kramer	bbf3c60786	Clear the new cache. llvm-svn: 145771	2011-12-03 15:19:55 +00:00
Benjamin Kramer	3664708378	Add a "seen blocks" cache to LVI to avoid a linear scan over the whole cache just to remove no blocks from the maps. -15% on ARMDisassembler.cpp (Release build). It's not that great to add another layer of caching to the caching-heavy LVI but I don't see a better way. llvm-svn: 145770	2011-12-03 15:16:45 +00:00
Chad Rosier	0155a63513	Add support for constant folding the pow intrinsic. rdar://10514247 llvm-svn: 145730	2011-12-03 00:00:03 +00:00
Chad Rosier	43a33066b4	Fix a few more places where TargetData/TargetLibraryInfo is not being passed. Add FIXMEs to places that are non-trivial to fix. llvm-svn: 145661	2011-12-02 01:26:24 +00:00
Chad Rosier	576c0f8e54	Abuse of mass replace isn't warranted even when the build is failing. Thanks for the suggestion, Eric. llvm-svn: 145643	2011-12-01 23:16:03 +00:00
Chad Rosier	54a506dcb1	Fix build by not assuming TLI is guaranteed. Will have to track down cases where TLI isn't being passed to ensure we don't miss opportunities to fold calls. llvm-svn: 145641	2011-12-01 22:38:31 +00:00
Chad Rosier	3367123b12	Prevent library calls from being folded if -fno-builtin has been specified. rdar://10500969 llvm-svn: 145639	2011-12-01 22:14:50 +00:00
Chad Rosier	e6de63dfc5	Last bit of TargetLibraryInfo propagation. Also fixed a case for TargetData where it appeared beneficial to pass. More of rdar://10500969 llvm-svn: 145630	2011-12-01 21:29:16 +00:00
Chad Rosier	c24b86ffbe	Propagate TargetLibraryInfo throughout ConstantFolding.cpp and InstructionSimplify.cpp. Other fixups as needed. Part of rdar://10500969 llvm-svn: 145559	2011-12-01 03:08:23 +00:00
Nick Lewycky	e659b8459e	Make use of "getScalarType()". No functionality change. llvm-svn: 145556	2011-12-01 02:39:36 +00:00
Andrew Trick	ceafa2c746	LSR: handle the expansion of phi operands that use postinc forms of the IV. Fixes PR11431: SCEVExpander::expandAddRecExprLiterally(const llvm::SCEVAddRecExpr*): Assertion `(!isa<Instruction>(Result) \|\| SE.DT->dominates(cast<Instruction>(Result), Builder.GetInsertPoint())) && "postinc expansion does not dominate use"' failed. llvm-svn: 145482	2011-11-30 06:07:54 +00:00
Daniel Dunbar	539d0a8a09	build/CMake: Finish removal of add_llvm_library_dependencies. llvm-svn: 145420	2011-11-29 19:25:30 +00:00
Duncan Sands	ca6f8ddbf8	Fix a theoretical problem (not seen in the wild): if different instances of a weak variable are compiled by different compilers, such as GCC and LLVM, while LLVM may increase the alignment to the preferred alignment there is no reason to think that GCC will use anything more than the ABI alignment. Since it is the GCC version that might end up in the final program (as the linkage is weak), it is wrong to increase the alignment of loads from the global up to the preferred alignment as the alignment might only be the ABI alignment. Increasing alignment up to the ABI alignment might be OK, but I'm not totally convinced that it is. It seems better to just leave the alignment of weak globals alone. llvm-svn: 145413	2011-11-29 18:26:38 +00:00
Andrew Trick	d25089f8e0	SCEV fix. In general, Add/Mul expressions should not inherit NSW/NUW. This reverts r139450, fixes r139453, and adds much needed comments and a unit test. llvm-svn: 145367	2011-11-29 02:16:38 +00:00
Andrew Trick	d912a5b2e3	Make SCEV print <nsw><nuw> for Add/MulExpr. llvm-svn: 145364	2011-11-29 02:06:35 +00:00
Eli Friedman	e7ab1a2f0f	Make SelectionDAG::InferPtrAlignment use llvm::ComputeMaskedBits instead of duplicating the logic for globals. Make llvm::ComputeMaskedBits handle GlobalVariables slightly more aggressively, to match what InferPtrAlignment knew how to do. llvm-svn: 145304	2011-11-28 22:48:22 +00:00
Andrew Trick	a8bdb7cbf1	Remove the temporary flag -disable-unroll-scev and dead code. SCEV should now be used for trip count analysis, not LoopInfo. llvm-svn: 145262	2011-11-28 19:22:09 +00:00
Benjamin Kramer	7ba71be392	Move code into anonymous namespaces. llvm-svn: 145154	2011-11-26 23:01:57 +00:00
Benjamin Kramer	6e013bf96c	Validate the return type when checking if a function is malloc. Fixes PR11426. Not sure if a test case with a "wrong" malloc would be useful. llvm-svn: 145106	2011-11-23 17:58:47 +00:00
Duncan Sands	81a2af12d6	Fix a crash in which a multiplication was being reported as being both negative and positive: positive, because it could be directly computed to be positive; negative, because the nsw flags means it is either negative or undefined (the multiplication always overflowed). llvm-svn: 145104	2011-11-23 16:26:47 +00:00
Nick Lewycky	063ae5897c	Fix crasher in GVN due to my recent capture tracking changes. llvm-svn: 145047	2011-11-21 19:42:56 +00:00
Nick Lewycky	aa2a00db35	Add virtual destructor. Whoops! llvm-svn: 145044	2011-11-21 18:32:21 +00:00
Nick Lewycky	6ae03c3378	Less template, more virtual! Refactoring suggested by Chris in code review. llvm-svn: 145014	2011-11-20 19:37:06 +00:00
Nick Lewycky	612d70b19d	Refactor code to use new attribute getters on CallSite for NoCapture and ByVal. Suggested in code review by Eli. That code in InstCombine looks kinda suspicious. llvm-svn: 145013	2011-11-20 19:09:04 +00:00
Benjamin Kramer	b5ba2eef2d	SCEV: Actually set overflow flags on add expressions. setFlags doesn't modify its arguments. llvm-svn: 145007	2011-11-20 10:24:36 +00:00
Andrew Trick	6b4d578f54	Fix a corner case in updating LoopInfo after fully unrolling an outer loop. The loop tree's inclusive block lists are painful and expensive to update. (I have no idea why they're inclusive). The design was supposed to handle this case but the implementation missed it and my unit tests weren't thorough enough. Fixes PR11335: loop unroll update. llvm-svn: 144970	2011-11-18 03:42:41 +00:00
Andrew Trick	90c7a108ca	Fix SCEV overly optimistic back edge taken count for multi-exit loops. Fixes PR11375: Different results for 'clang++ huh.cpp'... llvm-svn: 144746	2011-11-16 00:52:40 +00:00
Benjamin Kramer	184e3ceea0	Missed some users of Value::getNameStr. llvm-svn: 144656	2011-11-15 18:30:06 +00:00
Benjamin Kramer	1f97a5a671	Remove all remaining uses of Value::getNameStr(). llvm-svn: 144648	2011-11-15 16:27:03 +00:00
Benjamin Kramer	4c93d15f09	Twinify GraphWriter a little bit. llvm-svn: 144647	2011-11-15 16:26:38 +00:00
Nick Lewycky	7013a19e8a	Refactor capture tracking (which already had a couple flags for whether returns and stores capture) to permit the caller to see each capture point and decide whether to continue looking. Use this inside memdep to do an analysis that basicaa won't do. This lets us solve another devirtualization case, fixing PR8908! llvm-svn: 144580	2011-11-14 22:49:42 +00:00
Nick Lewycky	d48ab84556	Don't try to loop on iterators that are potentially invalidated inside the loop. Fixes PR11361! llvm-svn: 144454	2011-11-12 03:09:12 +00:00
Nick Lewycky	47eebcfd66	Fix typo in comment. llvm-svn: 144236	2011-11-09 22:45:04 +00:00
Nick Lewycky	0485d51a76	Don't forget to check FlagNW when determining whether an AddRecExpr will wrap or not. Patch by Brendon Cahoon! llvm-svn: 144173	2011-11-09 07:11:37 +00:00
Eli Friedman	0bae8b2cfb	Fix code to match comment. Fixes PR11340, a regression from r143209. llvm-svn: 144121	2011-11-08 21:08:02 +00:00
Dan Gohman	85977e6ab4	Teach instsimplify to simplify calls to undef. llvm-svn: 143719	2011-11-04 18:32:42 +00:00
Daniel Dunbar	bf9bba47a1	build: Add initial cut at LLVMBuild.txt files. llvm-svn: 143634	2011-11-03 18:53:17 +00:00
Duncan Sands	3d5692a475	Reapply commit 143214 with a fix: m_ICmp doesn't match conditions with the given predicate, it matches any condition and returns the predicate - d'oh! Original commit message: The expression icmp eq (select (icmp eq x, 0), 1, x), 0 folds to false. Spotted by my super-optimizer in 186.crafty and 450.soplex. We really need a proper infrastructure for handling generalizations of this kind of thing (which occur a lot), however this case is so simple that I decided to go ahead and implement it directly. llvm-svn: 143318	2011-10-30 19:56:36 +00:00
Eli Friedman	3af3c046a9	Revert r143214; it's breaking a bunch of stuff. llvm-svn: 143265	2011-10-29 00:56:07 +00:00
Duncan Sands	280bc553b3	The expression icmp eq (select (icmp eq x, 0), 1, x), 0 folds to false. Spotted by my super-optimizer in 186.crafty and 450.soplex. We really need a proper infrastructure for handling generalizations of this kind of thing (which occur a lot), however this case is so simple that I decided to go ahead and implement it directly. llvm-svn: 143214	2011-10-28 19:01:20 +00:00
Duncan Sands	985ba6386d	A shift of a power of two is a power of two or zero. For completeness - not spotted in the wild. llvm-svn: 143211	2011-10-28 18:30:05 +00:00
Duncan Sands	92af0a8a7f	Fold icmp ugt (udiv X, Y), X to false. Spotted by my super-optimizer in 186.crafty. llvm-svn: 143209	2011-10-28 18:17:44 +00:00
Duncan Sands	7cb61e5a0e	Reapply commit 143028 with a fix: the problem was casting a ConstantExpr Mul using BinaryOperator (which only works for instructions) when it should have been a cast to OverflowingBinaryOperator (which also works for constants). While there, correct a few other dubious looking uses of BinaryOperator. Thanks to Chad Rosier for the testcase. Original commit message: My super-optimizer noticed that we weren't folding this expression to true: (x *nsw x) sgt 0, where x = (y \| 1). This occurs in 464.h264ref. llvm-svn: 143125	2011-10-27 19:16:21 +00:00
Bob Wilson	1455ce27e4	Revert Duncan's r143028 expression folding which appears to be the culprit behind a compile failure on 483.xalancbmk. llvm-svn: 143102	2011-10-27 15:47:25 +00:00
Duncan Sands	ba286d7c73	The maximum power of 2 dividing a power of 2 is itself. This occurs in 403.gcc and was spotted by my super-optimizer. llvm-svn: 143054	2011-10-26 20:55:21 +00:00
Duncan Sands	1d2bb9882d	My super-optimizer noticed that we weren't folding this expression to true: (x *nsw x) sgt 0, where x = (y \| 1). This occurs in 464.h264ref. llvm-svn: 143028	2011-10-26 15:31:51 +00:00
Duncan Sands	a370f3e34e	Restore commits 142790 and 142843 - they weren't breaking the build bots. Original commit messages: - Reapply r142781 with fix. Original message: Enhance SCEV's brute force loop analysis to handle multiple PHI nodes in the loop header when computing the trip count. With this, we now constant evaluate: struct ListNode { const struct ListNode next; int i; }; static const struct ListNode node1 = {0, 1}; static const struct ListNode node2 = {&node1, 2}; static const struct ListNode node3 = {&node2, 3}; int test() { int sum = 0; for (const struct ListNode n = &node3; n != 0; n = n->next) sum += n->i; return sum; } - Now that we look at all the header PHIs, we need to consider all the header PHIs when deciding that the loop has stopped evolving. Fixes miscompile in the gcc torture testsuite! llvm-svn: 142919	2011-10-25 12:28:52 +00:00
Chandler Carruth	32f46e7c07	Fix the API usage in loop probability heuristics. It was incorrectly classifying many edges as exiting which were in fact not. These mainly formed edges into sub-loops. It was also not correctly classifying all returning edges out of loops as leaving the loop. With this match most of the loop heuristics are more rational. Several serious regressions on loop-intesive benchmarks like perlbench's loop tests when built with -enable-block-placement are fixed by these updated heuristics. Unfortunately they in turn uncover some other regressions. There are still several improvemenst that should be made to loop heuristics including trip-count, and early back-edge management. llvm-svn: 142917	2011-10-25 09:47:41 +00:00
Duncan Sands	805c5b92c8	Speculatively revert commits 142790 and 142843 to see if it fixes the dragonegg and llvm-gcc self-host buildbots. Original commit messages: - Reapply r142781 with fix. Original message: Enhance SCEV's brute force loop analysis to handle multiple PHI nodes in the loop header when computing the trip count. With this, we now constant evaluate: struct ListNode { const struct ListNode next; int i; }; static const struct ListNode node1 = {0, 1}; static const struct ListNode node2 = {&node1, 2}; static const struct ListNode node3 = {&node2, 3}; int test() { int sum = 0; for (const struct ListNode n = &node3; n != 0; n = n->next) sum += n->i; return sum; } - Now that we look at all the header PHIs, we need to consider all the header PHIs when deciding that the loop has stopped evolving. Fixes miscompile in the gcc torture testsuite! llvm-svn: 142916	2011-10-25 09:26:43 +00:00
Nick Lewycky	a58fb48a55	Now that we look at all the header PHIs, we need to consider all the header PHIs when deciding that the loop has stopped evolving. Fixes miscompile in the gcc torture testsuite! llvm-svn: 142843	2011-10-24 21:02:38 +00:00
Chandler Carruth	7111f4564c	Remove return heuristics from the static branch probabilities, and introduce no-return or unreachable heuristics. The return heuristics from the Ball and Larus paper don't work well in practice as they pessimize early return paths. The only good hitrate return heuristics are those for: - NULL return - Constant return - negative integer return Only the last of these three can possibly require significant code for the returning block, and even the last is fairly rare and usually also a constant. As a consequence, even for the cold return paths, there is little code on that return path, and so little code density to be gained by sinking it. The places where sinking these blocks is valuable (inner loops) will already be weighted appropriately as the edge is a loop-exit branch. All of this aside, early returns are nearly as common as all three of these return categories, and should actually be predicted as taken! Rather than muddy the waters of the static predictions, just remain silent on returns and let the CFG itself dictate any layout or other issues. However, the return heuristic was flagging one very important case: unreachable. Unfortunately it still gave a 1/4 chance of the branch-to-unreachable occuring. It also didn't do a rigorous job of finding those blocks which post-dominate an unreachable block. This patch builds a more powerful analysis that should flag all branches to blocks known to then reach unreachable. It also has better worst-case runtime complexity by not looping through successors for each block. The previous code would perform an N^2 walk in the event of a single entry block branching to N successors with a switch where each successor falls through to the next and they finally fall through to a return. Test case added for noreturn heuristics. Also doxygen comments improved along the way. llvm-svn: 142793	2011-10-24 12:01:08 +00:00
Nick Lewycky	9be7f277e4	Reapply r142781 with fix. Original message: Enhance SCEV's brute force loop analysis to handle multiple PHI nodes in the loop header when computing the trip count. With this, we now constant evaluate: struct ListNode { const struct ListNode next; int i; }; static const struct ListNode node1 = {0, 1}; static const struct ListNode node2 = {&node1, 2}; static const struct ListNode node3 = {&node2, 3}; int test() { int sum = 0; for (const struct ListNode n = &node3; n != 0; n = n->next) sum += n->i; return sum; } llvm-svn: 142790	2011-10-24 06:57:05 +00:00
Nick Lewycky	8e904dee82	PHI nodes not in the loop header aren't part of the loop iteration initial state. Furthermore, they might not have two operands. This fixes the underlying issue behind the crashes introduced in r142781. llvm-svn: 142788	2011-10-24 05:51:01 +00:00
Nick Lewycky	9d28c26d77	Speculatively revert r142781. Bots are showing Assertion `i_nocapture < OperandTraits<PHINode>::operands(this) && "getOperand() out of range!"' failed. coming out of indvars. llvm-svn: 142786	2011-10-24 04:00:25 +00:00
Chandler Carruth	7a0094a673	Simplify the design of BranchProbabilityInfo by collapsing it into a single class. Previously it was split between two classes, one internal and one external. The concern seemed to center around exposing the weights used, but those can remain confined to the implementation file. Having a single class to maintain the state and analyses in use will also simplify several of the enhancements I want to make to our static heuristics. llvm-svn: 142783	2011-10-24 01:40:45 +00:00
Nick Lewycky	1700007ecc	Enhance SCEV's brute force loop analysis to handle multiple PHI nodes in the loop header when computing the trip count. With this, we now constant evaluate: struct ListNode { const struct ListNode next; int i; }; static const struct ListNode node1 = {0, 1}; static const struct ListNode node2 = {&node1, 2}; static const struct ListNode node3 = {&node2, 3}; int test() { int sum = 0; for (const struct ListNode n = &node3; n != 0; n = n->next) sum += n->i; return sum; } llvm-svn: 142781	2011-10-23 23:43:14 +00:00
Chandler Carruth	24cee10fb1	Tidy up a loop to be more idiomatic for LLVM's codebase, and remove some extraneous whitespace. Trying to clean-up this pass as much as I can before I start making functional changes. llvm-svn: 142780	2011-10-23 22:40:13 +00:00
Chandler Carruth	1c8ace0e89	Teach the BranchProbabilityInfo pass to print its results, and use that to bring it under direct test instead of merely indirectly testing it in the BlockFrequencyInfo pass. The next step is to start adding tests for the various heuristics employed, and to start fixing those heuristics once they're under test. llvm-svn: 142778	2011-10-23 21:21:50 +00:00
Benjamin Kramer	929f53f65c	Add compare operators to BranchProbability and use it to determine if an edge is hot. llvm-svn: 142751	2011-10-23 11:19:14 +00:00
Nick Lewycky	a6674c7fc9	Make SCEV's brute force analysis stronger in two ways. Firstly, we should be able to constant fold load instructions where the argument is a constant. Second, we should be able to watch multiple PHI nodes through the loop; this patch only supports PHIs in loop headers, more can be done here. With this patch, we now constant evaluate: static const int arr[] = {1, 2, 3, 4, 5}; int test() { int sum = 0; for (int i = 0; i < 5; ++i) sum += arr[i]; return sum; } llvm-svn: 142731	2011-10-22 19:58:20 +00:00
Benjamin Kramer	606a50a9f8	Extend the floating point heuristic to consider NaN checks unlikely. llvm-svn: 142687	2011-10-21 21:13:47 +00:00
Benjamin Kramer	1e731a10d0	BranchProbabilityInfo: floating point equality is unlikely. This is from the same paper from Ball and Larus as the rest of the currently implemented heuristics. llvm-svn: 142677	2011-10-21 20:12:47 +00:00
Eli Friedman	68db4c2699	A FIXME about block addresses and indirectbr. llvm-svn: 142569	2011-10-20 04:05:33 +00:00
Eli Friedman	f0bb0c2934	Simplify; no intended functional change. llvm-svn: 142567	2011-10-20 03:23:14 +00:00
Nick Lewycky	462098824f	"@string = constant i8 0" is a value i8* string of length zero. Analyze that correctly in GetStringLength, fixing PR11181! llvm-svn: 142558	2011-10-20 00:34:35 +00:00
Chandler Carruth	deac50cba9	Generalize the reading of probability metadata to work for both branches and switches, with arbitrary numbers of successors. Still optimized for the common case of 2 successors for a conditional branch. Add a test case for switch metadata showing up in the BlockFrequencyInfo pass. llvm-svn: 142493	2011-10-19 10:32:19 +00:00
Chandler Carruth	d27a7a947b	Teach the BranchProbabilityInfo analysis pass to read any metadata encoding of probabilities. In the absense of metadata, it continues to fall back on static heuristics. This allows __builtin_expect, after lowering through llvm.expect a branch instruction's metadata, to actually enter the branch probability model. This is one component of resolving PR2577. llvm-svn: 142492	2011-10-19 10:30:30 +00:00
Chandler Carruth	343fad44ea	Add pass printing support to BlockFrequencyInfo pass. The implementation layer already had support for printing the results of this analysis, but the wiring was missing. Now that printing the analysis works, actually bring some of this analysis, and the BranchProbabilityInfo analysis that it wraps, under test! I'm planning on fixing some bugs and doing other work here, so having a nice place to add regression tests and a way to observe the results is really useful. llvm-svn: 142491	2011-10-19 10:12:41 +00:00
Devang Patel	7973e78800	Update DebugInfoFinder to match recent debug info encoding changes. llvm-svn: 142295	2011-10-17 22:30:34 +00:00
Bill Wendling	63a4ea1859	Correct over-zealous removal of hack. Some code want to check that any call within a function has the 'returns twice' attribute, not just that the current function has one. llvm-svn: 142221	2011-10-17 18:43:40 +00:00
Bill Wendling	2a83a71c2a	Now that we have the ReturnsTwice function attribute, this method is obsolete. Check the attribute instead. <rdar://problem/8031714> llvm-svn: 142212	2011-10-17 18:22:52 +00:00
Chandler Carruth	91f4faf877	Delete a dead member. Dunno if this was ever used, but the current code directly manipulates the weights inside of the BranchProbabilityInfo that is passed in. llvm-svn: 142163	2011-10-16 22:27:54 +00:00
Andrew Trick	fd4ca0f4ac	Fix SCEVExpander assert during LSR: "argument of incompatible type". Just because we're dealing with a GEP doesn't mean we can assert the SCEV has a pointer type. The fix is simply to ignore the SCEV pointer type, which we really didn't need. Fixes PR11138 webkit crash. llvm-svn: 142058	2011-10-15 06:19:55 +00:00
Nick Lewycky	a447e0f38f	An instruction's operands aren't necessarily instructions or constants. They could be arguments, for example. No testcase because this is a bug-fix broken out of a larger optimization patch. llvm-svn: 141951	2011-10-14 09:38:46 +00:00
Eli Friedman	c1702c8f22	Enhance the memdep interface so that users can tell the difference between a dependency which cannot be calculated and a path reaching the entry point of the function. This patch introduces isNonFuncLocal, which replaces isUnknown in some cases. Patch by Xiaoyi Guo. llvm-svn: 141896	2011-10-13 22:14:57 +00:00
Andrew Trick	870c1a3f15	Reapply r141870, SCEV expansion of post-inc. Speculatively reapply to see if this test case still crashes on linux. I may have fixed it in my last checkin. llvm-svn: 141895	2011-10-13 21:55:29 +00:00
Andrew Trick	7e442569dc	Fix memory corruption I introduced a few checkins ago. Self-review easily caught this obvious bug. llvm-svn: 141880	2011-10-13 18:49:23 +00:00
Andrew Trick	41c253c35c	Revert r141870. The test case crashes on linux with data corruption. A deeper issue was exposed. llvm-svn: 141873	2011-10-13 17:58:24 +00:00
Andrew Trick	e15d6e14e3	LSR: Reuse the post-inc expansion of expressions. This avoids unnecessary expansion of expressions and allows the SCEV expander to work on expression DAGs, not just trees. Fixes PR11090. llvm-svn: 141870	2011-10-13 17:31:47 +00:00
Andrew Trick	1393ec29af	SCEV: Rewrite TrandformForPostIncUse to handle expression DAGs, not just expression trees. Partially fixes PR11090. Test case will be with the full fix. llvm-svn: 141868	2011-10-13 17:21:09 +00:00
Andrew Trick	adfe72b33c	Slightly more useful tracing. llvm-svn: 141867	2011-10-13 17:06:38 +00:00
Eric Christopher	6647b83087	Add a new wrapper node for a DILexicalBlock that encapsulates it and a file. Since it should only be used when necessary propagate it through the backend code generation and tweak testcases accordingly. This helps with code like in clang's test/CodeGen/debug-info-line.c where we have multiple #line directives within a single lexical block and want to generate only a single block that contains each file change. Part of rdar://10246360 llvm-svn: 141729	2011-10-11 22:59:11 +00:00
Andrew Trick	f9201c572e	Move replaceCongruentIVs into SCEVExapander and bias toward "expanded" IVs. Indvars previously chose randomly between congruent IVs. Now it will bias the decision toward IVs that SCEVExpander likes to create. This was not done to fix any problem, it's just a welcome side effect of factoring code. llvm-svn: 141633	2011-10-11 02:28:51 +00:00
Andrew Trick	eef7308df6	Add an extra safety check in front of the optimization in r141442. llvm-svn: 141470	2011-10-08 02:16:39 +00:00
Andrew Trick	7fb669ab48	LSR should only reuse phis that match its formula. Fixes rdar://problem/5064068 llvm-svn: 141442	2011-10-07 23:46:21 +00:00
Eli Friedman	1456cd20b4	Remove the old atomic instrinsics. autoupgrade functionality is included with this patch. llvm-svn: 141333	2011-10-06 23:20:49 +00:00
Andrew Trick	3e8a576da1	Fixes PR11070 - assert in SCEV getConstantEvolvingPHIOperands. llvm-svn: 141219	2011-10-05 22:06:53 +00:00
Andrew Trick	ed39bb8efd	Typo. Thanks Bob. llvm-svn: 141188	2011-10-05 16:52:28 +00:00
Chandler Carruth	f6567a131d	Fix a broken assert found by -Wparentheses. llvm-svn: 141168	2011-10-05 07:02:23 +00:00
Andrew Trick	e9162f1ff8	Fix disabled SCEV analysis caused r141161 and add unit test. I noticed during self-review that my previous checkin disabled some analysis. Even with the reenabled analysis the test case runs in about 5ms. Without the fix, it will take several minutes at least. llvm-svn: 141164	2011-10-05 05:58:49 +00:00
Andrew Trick	3a86ba767c	Avoid exponential recursion in SCEV getConstantEvolvingPHI and EvaluateExpression. Note to compiler writers: never recurse on multiple instruction operands without memoization. Fixes rdar://10187945. Was taking 45s, now taking 5ms. llvm-svn: 141161	2011-10-05 03:25:31 +00:00
Nick Lewycky	287682ead1	The product of two chrec's can always be represented as a chrec. llvm-svn: 141066	2011-10-04 06:51:26 +00:00
Nick Lewycky	3155552461	Reapply r140979 with fix! We never did get a testcase, but careful review of the logic by David Meyer revealed this bug. llvm-svn: 140992	2011-10-03 07:10:45 +00:00
Nick Lewycky	b1dbce1406	Revert r140979 due to reports of bootstrap failure. llvm-svn: 140980	2011-10-03 05:14:59 +00:00
Nick Lewycky	3c624b8d0d	Add one more case we compute a max trip count. llvm-svn: 140979	2011-10-03 01:03:57 +00:00
Andrew Trick	f7656015fc	Inlining and unrolling heuristics should be aware of free truncs. We want heuristics to be based on accurate data, but more importantly we don't want llvm to behave randomly. A benign trunc inserted by an upstream pass should not cause a wild swings in optimization level. See PR11034. It's a general problem with threshold-based heuristics, but we can make it less bad. llvm-svn: 140919	2011-10-01 01:39:05 +00:00
Andrew Trick	caa500bf93	whitespace llvm-svn: 140916	2011-10-01 01:27:56 +00:00
Andrew Trick	ef8e4efff8	indvars: generalize SCEV getPreStartForSignExtend. Handle general Add expressions to avoid leaving around redundant 32-bit IVs. llvm-svn: 140701	2011-09-28 17:02:54 +00:00
Eli Friedman	5f476dc3ef	PR10628: Fix getModRefInfo so it queries the underlying alias() implementation correctly while checking nocapture calls. llvm-svn: 140666	2011-09-28 00:34:27 +00:00
Benjamin Kramer	547b6c5ecd	Stop emitting instructions with the name "tmp" they eat up memory and have to be uniqued, without any benefit. If someone prefers %tmp42 to %42, run instnamer. llvm-svn: 140634	2011-09-27 20:39:19 +00:00
Eli Friedman	5c91891cf3	Enhance alias analysis for atomic instructions a bit. Upgrade a couple alias-analysis tests to the new atomic instructions. llvm-svn: 140557	2011-09-26 20:15:28 +00:00
Galina Kistanova	ef65f002df	Fix for DbgInfoPrinter.cpp:174:12: warning: ‘LineNo’ may be used uninitialized in this function. llvm-svn: 140281	2011-09-21 23:34:23 +00:00
Devang Patel	04d6d47865	Add support to emit debug info for C++0x nullptr type. llvm-svn: 139751	2011-09-14 23:13:28 +00:00
Eric Christopher	777c928369	Fix typo. llvm-svn: 139530	2011-09-12 19:58:22 +00:00
Devang Patel	1ad1abe165	Add asserts to keep front-ends honest while encoding debug info into LLVM IR using DIBuilder. llvm-svn: 139515	2011-09-12 18:26:08 +00:00
Andrew Trick	a51d74fc35	Set NSW/NUW flags on SCEVAddExpr when the operation is flagged as such. I'm doing this now for completeness because I can't think of/remember any reason that it was left out. I'm not sure it will help anything, but if we don't do it we need to explain why in comments. llvm-svn: 139450	2011-09-10 01:09:50 +00:00
Eli Friedman	b78ac543c7	A couple minor corrections to r139276. llvm-svn: 139277	2011-09-08 02:37:07 +00:00
Eli Friedman	3d1b307672	Fix the logic in BasicAliasAnalysis::aliasGEP for comparing GEP's with variable differences so that it actually does something sane. Fixes PR10881. llvm-svn: 139276	2011-09-08 02:23:31 +00:00
Owen Anderson	f4f09f8c26	memset_pattern16 uses a 16 BYTE pattern, not a 16 BIT pattern. Add comments to that effect. llvm-svn: 139205	2011-09-06 23:43:26 +00:00
Owen Anderson	653cb03191	Teach BasicAA about the aliasing properties of memset_pattern16. Fixes PR10872 and <rdar://problem/10065079>. llvm-svn: 139204	2011-09-06 23:33:25 +00:00
Nick Lewycky	e0aa54bb98	This transform only handles two-operand AddRec's. Prevent it from trying to handle anything more complex. Fixes PR10383 again! llvm-svn: 139186	2011-09-06 21:42:18 +00:00
Devang Patel	5ea5d7965b	Now, named mdnode llvm.dbg.cu keeps track of all compile units in a module. Update DebugInfoFinder to collect compile units from llvm.dbg.cu. llvm-svn: 139147	2011-09-06 17:40:08 +00:00
Nick Lewycky	78664db054	Fix typo in comment again. llvm-svn: 139139	2011-09-06 07:02:40 +00:00
Nick Lewycky	237878b7ac	Apparently we compile the code, not the comments. Thanks Eli! llvm-svn: 139138	2011-09-06 06:56:00 +00:00
Nick Lewycky	0af94cc50b	Fix typo in comment. llvm-svn: 139137	2011-09-06 06:46:01 +00:00
Nick Lewycky	702cf1eccc	Nope! I had it right the first time. Revert the operative part of r139135 and add more showing of my work. llvm-svn: 139136	2011-09-06 06:39:54 +00:00
Nick Lewycky	6f86e001d6	Fix flipped sign. While there, show my math. llvm-svn: 139135	2011-09-06 05:33:18 +00:00
Nick Lewycky	db66b82dd5	No no no, fix typo properly! llvm-svn: 139134	2011-09-06 05:08:09 +00:00
Nick Lewycky	658bdb5133	The logic inside getMulExpr to simplify {a,+,b}*{c,+,d} was wrong, which was visible given a=b=c=d=1, on iteration #1 (the second iteration). Replace it with correct math. Fixes PR10383! llvm-svn: 139133	2011-09-06 05:05:14 +00:00
Nick Lewycky	b1438c763a	Revert r139126 due to selfhost failures reported by buildbots. llvm-svn: 139130	2011-09-06 02:43:13 +00:00
Nick Lewycky	c4c43fbb07	Teach SCEV to report a max backedge count in one interesting case in HowFarToZero; the case for a canonical loop. llvm-svn: 139126	2011-09-05 23:25:16 +00:00
Benjamin Kramer	4b79c21ef2	InstSimplify: Don't try to replace an extractvalue/insertvalue pair with the original value if types don't match. Fixes clang selfhost. llvm-svn: 139120	2011-09-05 18:16:19 +00:00
Duncan Sands	fd26a954a8	Add some simple insertvalue simplifications, for the purpose of cleaning up do-nothing exception handling code produced by dragonegg. llvm-svn: 139113	2011-09-05 06:52:48 +00:00
Benjamin Kramer	0ca1ad0783	Use canonical forms for the branch probability zero heutistic. - Drop support for X >u 0, it's equivalent to X != 0 and should be canonicalized into the latter. - Add X < 1 -> unlikely, which is what instcombine canonicalizes X <= 0 into. - Add X > -1 -> likely, which is what instcombine canonicalizes X >= 0 into. llvm-svn: 139110	2011-09-04 23:53:04 +00:00
Andrew Trick	bbb226a827	Comment and clarifying assert. llvm-svn: 139036	2011-09-02 21:20:46 +00:00
Devang Patel	df060bc3c2	After r138010, subroutine type does not have context info. Update type verifier accordingly. This fixes ptype.exp gdb testsuite regressions. llvm-svn: 138869	2011-08-31 18:04:31 +00:00
Nadav Rotem	5fc81ffbac	Fixes following the CR by Chris and Duncan: Optimize chained bitcasts of the form A->B->A. Undo r138722 and change isEliminableCastPair to allow this case. llvm-svn: 138756	2011-08-29 19:58:36 +00:00
Andrew Trick	0896621a50	Reapply r138695. Fix PassManager stack depths. Patch by Xiaoyi Guo! llvm-svn: 138737	2011-08-29 17:07:00 +00:00
Nadav Rotem	52600ee8c3	Bitcasts are transitive. Bitcast-Bitcast-X becomes Bitcast-X. llvm-svn: 138722	2011-08-28 11:51:08 +00:00
Andrew Trick	5c29ebae8e	Reverting r138695 to see if it fixes clang self host. llvm-svn: 138701	2011-08-27 06:10:16 +00:00
Andrew Trick	b0cd1e65de	Fix PassManager stack depths. Patch by Xiaoyi Guo! llvm-svn: 138695	2011-08-27 02:11:03 +00:00
Eric Christopher	3cc90fe5a5	Whitespace and 80-col. llvm-svn: 138654	2011-08-26 21:02:40 +00:00
Andrew Trick	147d9cde78	LoopInfo::updateUnloop fix, and verify Block->Loop maps. Fixes an oversight, and adds verification to catch it in the unloop.ll tests. llvm-svn: 138622	2011-08-26 03:06:34 +00:00
Bill Wendling	86c5cbe613	Skip the landingpad instruction when determining the insertion point. llvm-svn: 138481	2011-08-24 21:06:46 +00:00
Nadav Rotem	365af6f17b	Implement Constant::isAllOnesValue(). Fix ConstantFolding to use the new api. llvm-svn: 138469	2011-08-24 20:18:38 +00:00
Eric Christopher	7bc78f692c	Revert "Address Duncan's CR request:" This reverts commit 20a05be15ea5271ab6185b83200fa88263362400. (svn rev 138340) Conflicts: test/Transforms/InstCombine/bitcast.ll llvm-svn: 138366	2011-08-23 20:11:10 +00:00
Nadav Rotem	c78e6607b5	Address Duncan's CR request: 1. Cleanup the tests in ConstantFolding.cpp 2. Implement isAllOnes for Constant, ConstantFP, ConstantVector llvm-svn: 138340	2011-08-23 17:48:43 +00:00
Nadav Rotem	ad4a70ad3e	Add constant folding support for bitcasts of splat vectors to integers. llvm-svn: 138206	2011-08-20 14:02:29 +00:00
Devang Patel	59e27c5f12	Do not use named md nodes to track variables that are completely optimized. This does not scale while doing LTO with debug info. New approach is to include list of variables in the subprogram info directly. llvm-svn: 138145	2011-08-19 23:28:12 +00:00
Benjamin Kramer	4938edb02c	Make a bunch of symbols private. llvm-svn: 138025	2011-08-19 01:42:18 +00:00
Benjamin Kramer	5a656883b1	C API functions must be able to see their extern "C" definitions, or it will be impossible to call them from C. llvm-svn: 138022	2011-08-19 01:36:54 +00:00
Devang Patel	425b4dcc30	There is no need to add file as context for subroutine type. The subroutine type does not need any context. llvm-svn: 138010	2011-08-18 23:50:57 +00:00
Bill Wendling	a9ee09f4be	Revert r137655. There is some question about whether the 'landingpad' instruction should be marked as potentially reading and/or writing memory. llvm-svn: 137863	2011-08-17 20:36:44 +00:00
Eli Friedman	ad3cfe7933	Revert r137781; I agree with Duncan's comment that the situation in question is clearly impossible given the current structure of the code. llvm-svn: 137853	2011-08-17 19:31:49 +00:00
Eli Friedman	55919a9ed7	Extend the undef ^ undef idiom once more. No testcase: I can't figure out how to actually trigger the codepath in question at the moment, but it might get exposed in the future. llvm-svn: 137781	2011-08-16 22:38:34 +00:00
Devang Patel	eb1bb4e419	Until now all debug info MDNodes referred to a root MDNode, a compile unit. This simplified handling of these needs in dwarf writer. However, one side effect of this is that during link time optimization all these MDNodes are _not_ uniqued. In other words there will be N number of MDNodes describing "int", "char" and all other types, which would suddenly grow when each object file starts using libraries like STL. MDNodes graph structure such that compiler unit keeps track of important MDNodes and update dwarf writer to process mdnodes top-down instead of bottom up. llvm-svn: 137778	2011-08-16 22:09:43 +00:00
Bill Wendling	8ddfc09e7a	Use the getFirstInsertionPt() method instead of getFirstNonPHI + an 'isa<>' check for a LandingPadInst. llvm-svn: 137745	2011-08-16 20:45:24 +00:00
Bill Wendling	be33e8d58d	A few places where we want to skip the landingpad instruction for insertion. llvm-svn: 137712	2011-08-16 04:52:55 +00:00
Devang Patel	2b8acaf4f3	Add a finalize() hook, that'll let DIBuilder construct compile unit lazily. llvm-svn: 137673	2011-08-15 23:00:00 +00:00
Eli Friedman	4419cd2464	Add some comments here because the lack of a check for volatile/atomic here is a bit unusual. llvm-svn: 137662	2011-08-15 21:56:39 +00:00
Bill Wendling	e86965ee19	Duncan pointed out that the LandingPadInst might read memory. (It might also write to memory.) Marking it as such makes some checks for immobility go away. llvm-svn: 137655	2011-08-15 21:14:31 +00:00
Eli Friedman	5494adac67	Misc analysis passes that need to be aware of atomic load/store. llvm-svn: 137650	2011-08-15 20:54:19 +00:00
Eli Friedman	91386c7be4	Atomic load/store support in LICM. llvm-svn: 137648	2011-08-15 20:52:09 +00:00
Bill Wendling	9af5b22b76	The landingpad instruction isn't loop-invariant. llvm-svn: 137628	2011-08-15 18:22:49 +00:00
Devang Patel	dfd6ec3ce1	Refactor. Global variables are part of compile unit so let CompileUnit create new global variable. llvm-svn: 137621	2011-08-15 17:57:41 +00:00
Duncan Sands	a41634e307	Silence a bunch (but not all) "variable written but not read" warnings when building with assertions disabled. llvm-svn: 137460	2011-08-12 14:54:45 +00:00
Andrew Trick	2b6860f0a1	Allow loop unrolling to get known trip counts from ScalarEvolution. SCEV unrolling can unroll loops with arbitrary induction variables. It is a prerequisite for -disable-iv-rewrite performance. It is also easily handles loops of arbitrary structure including multiple exits and is generally more robust. This is under a temporary option to avoid affecting default behavior for the next couple of weeks. It is needed so that I can checkin unit tests for updateUnloop. llvm-svn: 137384	2011-08-11 23:36:16 +00:00
Andrew Trick	c12c30a670	Fix for LoopInfo::updateUnloop. Remove subloop blocks from former ancestor loops. I have a unit test that depends on scev-unroll, which unfortunately isn't checked in. But I will check it in when I can. llvm-svn: 137341	2011-08-11 20:27:32 +00:00
Andrew Trick	266ab10012	Cleanup. Another thorough review by Nick! llvm-svn: 137317	2011-08-11 17:54:58 +00:00
Andrew Trick	d3530b9117	Reapplying r136844. An algorithm for incrementally updating LoopInfo within a LoopPassManager. The incremental update should be extremely cheap in most cases and can be used in places where it's not feasible to regenerate the entire loop forest. - "Unloop" is a node in the loop tree whose last backedge has been removed. - Perform reverse dataflow on the block inside Unloop to propagate the nearest loop from the block's successors. - For reducible CFG, each block in unloop is visited exactly once. This is because unloop no longer has a backedge and blocks within subloops don't change parents. - Immediate subloops are summarized by the nearest loop reachable from their exits or exits within nested subloops. - At completion the unloop blocks each have a new parent loop, and each immediate subloop has a new parent. llvm-svn: 137276	2011-08-10 23:22:57 +00:00
Devang Patel	bb23a4a9a5	Distinguish between two copies of one inlined variable. Take 2. llvm-svn: 137253	2011-08-10 21:50:54 +00:00
Andrew Trick	78b40c3f3a	Cleanup. Added LoopBlocksDFS::perform for simple clients. llvm-svn: 137195	2011-08-10 01:59:05 +00:00
Devang Patel	3d6e38942d	Provide method to print variable's extended name which includes inline location. llvm-svn: 137095	2011-08-09 01:03:14 +00:00
Andrew Trick	6d45a01b67	Made SCEV's UDiv expressions more canonical. When dividing a recurrence, the initial values low bits can sometimes be ignored. To take advantage of this, added FoldIVUser to IndVarSimplify to fold an IV operand into a udiv/lshr if the operator doesn't affect the result. -indvars -disable-iv-rewrite now transforms i = phi i4 i1 = i0 + 1 idx = i1 >> (2 or more) i4 = i + 4 into i = phi i4 idx = i0 >> ... i4 = i + 4 llvm-svn: 137013	2011-08-06 07:00:37 +00:00
Chandler Carruth	81b7e11c89	Temporarily revert r135528 which distinguishes between two copies of one inlined variable, based on the discussion in PR10542. This explodes the runtime of several passes down the pipeline due to a large number of "copies" remaining live across a large function. This only shows up with both debug and opt, but when it does it creates a many-minute compile when self-hosting LLVM+Clang. There are several other cases that show these types of regressions. All of this is tracked in PR10542, and progress is being made on fixing the issue. Once its addressed, the re-instated, but until then this restores the performance for self-hosting and other opt+debug builds. Devang, let me know if this causes any trouble, or impedes fixing it in any way, and thanks for working on this! llvm-svn: 136953	2011-08-05 00:51:31 +00:00
Duncan Sands	020c1947b7	Fix what seems an obvious typo. Patch by Ivan Krasin. Problem reported at http://habrahabr.ru/blogs/compilers/125626/. llvm-svn: 136865	2011-08-04 10:02:21 +00:00
Andrew Trick	bc673fb5f2	Reverting r136884 updateUnloop, which crashed a linux builder. llvm-svn: 136857	2011-08-04 01:04:37 +00:00
Andrew Trick	468eadbbb2	An algorithm for incrementally updating LoopInfo within a LoopPassManager. The incremental update should be extremely cheap in most cases and can be used in places where it's not feasible to regenerate the entire loop forest. - "Unloop" is a node in the loop tree whose last backedge has been removed. - Perform reverse dataflow on the block inside Unloop to propagate the nearest loop from the block's successors. - For reducible CFG, each block in unloop is visited exactly once. This is because unloop no longer has a backedge and blocks within subloops don't change parents. - Immediate subloops are summarized by the nearest loop reachable from their exits or exits within nested subloops. - At completion the unloop blocks each have a new parent loop, and each immediate subloop has a new parent. llvm-svn: 136844	2011-08-03 23:50:25 +00:00
Andrew Trick	f898cbde5e	whitespace llvm-svn: 136843	2011-08-03 23:45:50 +00:00
Jakub Staszak	a60d130f26	Add more constantness in BlockFrequencyInfo. llvm-svn: 136816	2011-08-03 21:30:57 +00:00
Bill Wendling	035ea32870	Add this back in for now. There are still a few passes which create unwind instructions at the moment. llvm-svn: 136756	2011-08-03 01:07:57 +00:00
Bill Wendling	ae3380faff	Replace the 'UnwindInst' check with a check for 'ResumeInst', which also exits the function, because the UnwindInst is going away. llvm-svn: 136751	2011-08-03 00:30:19 +00:00
Andrew Trick	77c55428fa	Use consistent terminology for loop exit/exiting blocks. Name change only. llvm-svn: 136677	2011-08-02 04:23:35 +00:00
Jakub Staszak	8b13b59f60	Change SmallVector to SmallPtrSet in BranchProbabilityInfo. Handle cases where one than one successor goes to the same block. llvm-svn: 136638	2011-08-01 19:16:26 +00:00
Jakub Staszak	6651b33671	Do not handle cases with >= and <= predicates. llvm-svn: 136588	2011-07-31 05:54:04 +00:00
Jakub Staszak	e348afb612	Remove untrue comment. llvm-svn: 136587	2011-07-31 04:51:14 +00:00
Jakub Staszak	bfb1ae223b	Do not handle case where LHS is equal to zero, because InstCombiner always moves it to RHS anyway. llvm-svn: 136586	2011-07-31 04:47:20 +00:00
Jakub Staszak	17af66a62f	Add Zero Heurestics to BranchProbabilityInfo. If we compare value to zero we decide whether condition is likely to be true this way: x == 0 -> false x < 0 -> false x <= 0 -> false x != 0 -> true x > 0 -> true x >= 0 -> true llvm-svn: 136583	2011-07-31 03:27:24 +00:00
Jakub Staszak	efd94c8fea	Add more constantness in BranchProbabilityInfo. llvm-svn: 136502	2011-07-29 19:30:00 +00:00
Jakub Staszak	0978426843	Remove incEdgeWeight and decEdgeWeight. Set edge weight directly to avoid rounding errors. llvm-svn: 136456	2011-07-29 02:36:53 +00:00
Chandler Carruth	9d7feab3e0	Rewrite the CMake build to use explicit dependencies between libraries, specified in the same file that the library itself is created. This is more idiomatic for CMake builds, and also allows us to correctly specify dependencies that are missed due to bugs in the GenLibDeps perl script, or change from compiler to compiler. On Linux, this returns CMake to a place where it can relably rebuild several targets of LLVM. I have tried not to change the dependencies from the ones in the current auto-generated file. The only places I've really diverged are in places where I was seeing link failures, and added a dependency. The goal of this patch is not to start changing the dependencies, merely to move them into the correct location, and an explicit form that we can control and change when necessary. This also removes a serialization point in the build because we don't have to scan all the libraries before we begin building various tools. We no longer have a step of the build that regenerates a file inside the source tree. A few other associated cleanups fall out of this. This isn't really finished yet though. After talking to dgregor he urged switching to a single CMake macro to construct libraries with both sources and dependencies in the arguments. Migrating from the two macros to that style will be a follow-up patch. Also, llvm-config is still generated with GenLibDeps.pl, which means it still has slightly buggy dependencies. The internal CMake 'llvm-config-like' macro uses the correct explicitly specified dependencies however. A future patch will switch llvm-config generation (when using CMake) to be based on these deps as well. This may well break Windows. I'm getting a machine set up now to dig into any failures there. If anyone can chime in with problems they see or ideas of how to solve them for Windows, much appreciated. llvm-svn: 136433	2011-07-29 00:14:25 +00:00
Jakub Staszak	eec01ccbf9	Change LBH_TAKEN_WEIGHT to 124 (from 128). Right now, sum of LBH_TAKEN_WEIGHT + LBH_NONTAKEN_WEIGHT = 128 which in _most_ cases reduce number of rounding errors. llvm-svn: 136428	2011-07-28 23:42:08 +00:00
Jakub Staszak	d07b2e159a	Heuristics are in descending priority now. If we use one of them, skip the rest. llvm-svn: 136402	2011-07-28 21:45:07 +00:00
Jakub Staszak	bcb3c65bb4	Add InEdges (edges from header to the loop) in Loop Branch Heuristics, so there is no frequency difference whether condition is in the header or in the latch. llvm-svn: 136398	2011-07-28 21:33:46 +00:00
Jakub Staszak	da3df4302a	Use BlockFrequency instead of uint32_t in BlockFrequencyInfo. llvm-svn: 136278	2011-07-27 22:05:51 +00:00
Jeffrey Yasskin	6381c0100b	Explicitly cast narrowing conversions inside {}s that will become errors in C++0x. llvm-svn: 136211	2011-07-27 06:22:51 +00:00
Eli Friedman	8b5277c6cf	Minor simplification. llvm-svn: 136202	2011-07-27 01:02:25 +00:00
Eli Friedman	ae8161e774	Fix AliasSetTracker so that it doesn't make any assumptions about instructions it doesn't know about (like the atomic instructions I'm adding). llvm-svn: 136198	2011-07-27 00:46:46 +00:00
Andrew Trick	3ca3f98c2c	SCEV: Added a data structure for storing not-taken info per loop exit. Added an interfaces for querying either the loop's exact/max backedge taken count or a specific loop exit's not-taken count. llvm-svn: 136100	2011-07-26 17:19:55 +00:00
Duncan Sands	c1c92719a4	Add helper function for getting true/false constants in a uniform way for i1 and vector of i1 types. Use these to make some code more self-documenting. llvm-svn: 136079	2011-07-26 15:03:53 +00:00
Jakub Staszak	875ebd5f5d	Rename BlockFrequency to BlockFrequencyInfo and MachineBlockFrequency to MachineBlockFrequencyInfo. llvm-svn: 135937	2011-07-25 19:25:40 +00:00
Frits van Bommel	ede0dc6dda	Shorten some expressions by using ArrayRef::slice(). llvm-svn: 135910	2011-07-25 15:13:01 +00:00
Jay Foad	d1b7849d49	Convert GetElementPtrInst to use ArrayRef. llvm-svn: 135904	2011-07-25 09:48:08 +00:00
Jay Foad	040dd82f44	Convert IRBuilder::CreateGEP and IRBuilder::CreateInBoundsGEP to use ArrayRef. llvm-svn: 135761	2011-07-22 08:16:57 +00:00
Jakub Staszak	b82bbf40bb	Allow getBlockFreq to return 0. llvm-svn: 135742	2011-07-22 02:24:57 +00:00
Jay Foad	ed8db7d9df	Convert ConstantExpr::getGetElementPtr and ConstantExpr::getInBoundsGetElementPtr to use ArrayRef. llvm-svn: 135673	2011-07-21 14:31:17 +00:00
Devang Patel	8fb9fd6769	There are two ways to map a variable to its lexical scope. Lexical scope information is embedded in MDNode describing the variable. It is also available as a part of DebugLoc attached with DBG_VALUE instruction. DebugLoc attached with an instruction is less reliable in optimized code so use information embedded in the MDNode. llvm-svn: 135629	2011-07-20 22:18:50 +00:00
Devang Patel	a59b24b090	Distinguish between two copies of one inlined variable. llvm-svn: 135528	2011-07-19 22:31:15 +00:00
Devang Patel	cfa82a378d	Reapply r135457. This needs llvm-gcc change, that I forgot to check-in yesterday. llvm-svn: 135504	2011-07-19 19:41:54 +00:00
Bob Wilson	da30cf84c3	Revert "Make a provision to encode inline location in a variable. This will enable dwarf writer to easily distinguish between two instances of a inlined variable in one basic block." This reverts commit 9fec5e346efdf744b151ae6604f912908315fa7a. llvm-svn: 135486	2011-07-19 16:32:50 +00:00
Jay Foad	b992a635fb	Convert SimplifyGEPInst to use ArrayRef. llvm-svn: 135482	2011-07-19 15:07:52 +00:00
Jay Foad	bf904773bb	Convert TargetData::getIndexedOffset to use ArrayRef. llvm-svn: 135478	2011-07-19 14:01:37 +00:00
Jay Foad	f4b14a2b0d	Use ArrayRef in ConstantFoldInstOperands and ConstantFoldCall. llvm-svn: 135477	2011-07-19 13:32:40 +00:00
Devang Patel	ac532dedf1	Make a provision to encode inline location in a variable. This will enable dwarf writer to easily distinguish between two instances of a inlined variable in one basic block. llvm-svn: 135457	2011-07-19 01:03:32 +00:00
Frits van Bommel	717d7edd3e	Migrate LLVM and Clang to use the new makeArrayRef(...) functions where previously explicit non-default constructors were used. Mostly mechanical with some manual reformatting. llvm-svn: 135390	2011-07-18 12:00:32 +00:00
Chris Lattner	229907cd11	land David Blaikie's patch to de-constify Type, with a few tweaks. llvm-svn: 135375	2011-07-18 04:54:35 +00:00
Benjamin Kramer	a7606b993c	Silence compiler warnings. llvm-svn: 135358	2011-07-16 22:26:27 +00:00
Jakub Staszak	623e1971ce	Remove "LoopInfo.h" include from BranchProbabilityInfo.h. llvm-svn: 135353	2011-07-16 20:31:15 +00:00
Andrew Trick	244e2c3e82	Fix SCEVEXpander to handle arbitrary phi expansion. Includes two related bug fixes and corresponding assertions for uninitialized data and missing NULL check. Test cases will be included with the new LFTR. llvm-svn: 135333	2011-07-16 00:59:39 +00:00
Jakub Staszak	abb236fe9b	Fix pointer heuristic. Check whether predicator is ICMP_NE instead of if it is not isEquality(). llvm-svn: 135296	2011-07-15 20:51:06 +00:00
Jay Foad	5bd375a6cc	Convert CallInst and InvokeInst APIs to use ArrayRef. llvm-svn: 135265	2011-07-15 08:37:34 +00:00
Jay Foad	57aa636794	Convert InsertValueInst and ExtractValueInst APIs to use ArrayRef. llvm-svn: 135040	2011-07-13 10:26:04 +00:00
Chris Lattner	13879a7091	stop using WriteTypeSymbolic. llvm-svn: 134833	2011-07-09 18:02:13 +00:00
Devang Patel	c3239d3965	Preserve debug loc. llvm-svn: 134441	2011-07-05 21:48:22 +00:00
Dan Gohman	a293f24a0d	Teach IVUsers to stop at non-affine expressions unless they are both outside the loop and reducible. This more completely hides them from LSR, which isn't usually able to do anything meaningful with non-affine expressions anyway, and this consequently hides them from SCEVExpander, which is acutely unprepared for non-affine expressions. Replace test/CodeGen/X86/lsr-nonaffine.ll with a new test that tests the new behavior. This works around the bug in PR10117 / rdar://problem/9633149, and is generally an improvement besides. llvm-svn: 134268	2011-07-01 22:05:19 +00:00
Dan Gohman	54664ed714	Improve constant folding of undef for cmp and select operators. llvm-svn: 134223	2011-07-01 01:03:43 +00:00
Andrew Trick	154d78a661	Cleanup. Fix a stupid variable name. llvm-svn: 133995	2011-06-28 05:41:52 +00:00
Andrew Trick	411daa5e81	SCEVExpander: give new insts a name that identifies the reponsible pass. llvm-svn: 133992	2011-06-28 05:07:32 +00:00
Andrew Trick	56b315a9cf	indvars --disable-iv-rewrite: sever ties with IVUsers. llvm-svn: 133988	2011-06-28 03:01:46 +00:00
Nick Lewycky	3e334a42d7	Move onlyUsedByLifetimeMarkers to ValueTracking so that it can be used by other passes as well. llvm-svn: 133904	2011-06-27 04:20:45 +00:00
Devang Patel	503c3998f3	Fix struct member's scope. Patch by Xi Wang. llvm-svn: 133828	2011-06-24 22:00:39 +00:00
Jakub Staszak	1aae619933	Calculate backedge probability correctly. llvm-svn: 133776	2011-06-23 23:52:11 +00:00
Jakub Staszak	668c6fae76	Missing files for the BlockFrequency analysis added. llvm-svn: 133767	2011-06-23 21:56:59 +00:00
Jakub Staszak	be52acc98a	Introduce BlockFrequency analysis for BasicBlocks. llvm-svn: 133766	2011-06-23 21:45:20 +00:00
Rafael Espindola	e2456536b5	Revert "revert 133714" This reverts commit e8e00f5efb4a22238f2407bf813de4606f30c5aa. The cmake build on OS X is still broken. llvm-svn: 133718	2011-06-23 14:19:39 +00:00
Dylan Noblesmith	8a4f22d017	revert 133714 It broke the build worse. llvm-svn: 133716	2011-06-23 13:56:01 +00:00
Rafael Espindola	250360d4bd	133713 broke the build, revert it. llvm-svn: 133714	2011-06-23 13:37:38 +00:00
Dylan Noblesmith	3595357772	Support: make floating-exception header private It has only one user. This eliminates the last include of config.h from the public headers -- ideally, config.h shouldn't even be installed by `make install` anymore. llvm-svn: 133713	2011-06-23 12:45:54 +00:00
Devang Patel	ccf8dbf885	New binops need debug loc. llvm-svn: 133642	2011-06-22 20:56:56 +00:00
Andrew Trick	fc4ccb20c6	IVUsers no longer needs to record the phis. llvm-svn: 133518	2011-06-21 15:43:52 +00:00
Chris Lattner	cc19efaa97	Revamp the "ConstantStruct::get" methods. Previously, these were scattered all over the place in different styles and variants. Standardize on two preferred entrypoints: one that takes a StructType and ArrayRef, and one that takes StructType and varargs. In cases where there isn't a struct type convenient, we now add a ConstantStruct::getAnon method (whose name will make more sense after a few more patches land). It would be "really really nice" if the ConstantStruct::get and ConstantVector::get methods didn't make temporary std::vectors. llvm-svn: 133412	2011-06-20 04:01:31 +00:00
Chris Lattner	67733f6557	simplify some code. llvm-svn: 133362	2011-06-18 21:46:23 +00:00
Benjamin Kramer	9319e9c5d8	Simplify code. No functionality change. llvm-svn: 133351	2011-06-18 14:42:42 +00:00
Jakub Staszak	12a43bdde5	Introduce MachineBranchProbabilityInfo class, which has similar API to BranchProbabilityInfo (expect setEdgeWeight which is not available here). Branch Weights are kept in MachineBasicBlocks. To turn off this analysis set -use-mbpi=false. llvm-svn: 133184	2011-06-16 20:22:37 +00:00
Eli Friedman	8b098b0d57	Add a limit to the number of instructions memdep will scan in a single block. This prevents (at least in some cases) O(N^2) runtime in passes like DSE. The limit in this patch is probably too high, but it is enough to stop DSE from going completely insane on a testcase I have (which has a single block with around 50,000 non-aliasing stores in it). rdar://9471075 llvm-svn: 133111	2011-06-15 23:59:25 +00:00
Eli Friedman	7d58bc7bc0	Add "unknown" results for memdep, which mean "I don't know whether a dependence for the given instruction exists in the given block". This cleans up all the existing hacks in memdep which represent this concept by returning clobber with various unrelated instructions. llvm-svn: 133031	2011-06-15 00:47:34 +00:00
Benjamin Kramer	558d09d87e	Move class into an anonymous namespace. llvm-svn: 132925	2011-06-13 18:38:56 +00:00
Andrew Trick	3d4e64b082	Branch profiling: floating-point avoidance. Patch by: Jakub Staszak! Introduces BranchProbability. Changes unsigned to uint32_t all over and uint64_t only when overflow is expected. llvm-svn: 132867	2011-06-11 01:05:22 +00:00
Dan Gohman	cc59548793	Initialize BasicAA's AliasCache to set it to use fewer buckets by default, since it usually has very few elements. This speeds up alias queries in many cases, because AliasCache.clear() doesn't have to visit as many buckets. llvm-svn: 132862	2011-06-10 22:30:30 +00:00
John McCall	729c35b680	Teach the CallGraph to ignore calls to intrinsics. llvm-svn: 132797	2011-06-09 19:46:27 +00:00
Dan Gohman	adf80ae9e4	Reapply r131781, now that the GVN bug with partially-aliasing loads is disabled. llvm-svn: 132632	2011-06-04 06:50:18 +00:00
Dan Gohman	a471751c24	Disable the main feature of 130180, the elimination of loads that are redundant with partially-aliasing loads. When computing what portion of a clobbering load value is needed, it doesn't consider phi-translation which may have occurred between the clobbing load and the redundant load. llvm-svn: 132631	2011-06-04 06:48:50 +00:00
Dan Gohman	87fdceaf73	Revert r131781 again. Apparently there is more going on here. llvm-svn: 132625	2011-06-04 05:11:22 +00:00
Nick Lewycky	75b2053863	Fold assert-only-used variable into the assert. llvm-svn: 132620	2011-06-04 02:07:10 +00:00
Andrew Trick	c73aa1ee81	Missing include of climits in the new BranchProbability pass. llvm-svn: 132616	2011-06-04 01:30:52 +00:00
Andrew Trick	49371f3f33	New BranchProbabilityInfo analysis. Patch by Jakub Staszak! BranchProbabilityInfo provides an interface for IR passes to query the likelihood that control follows a CFG edge. This patch provides an initial implementation of static branch predication that will populate BranchProbabilityInfo for branches with no external profile information using very simple heuristics. It currently isn't hooked up to any external profile data, so static prediction does all the work. llvm-svn: 132613	2011-06-04 01:16:30 +00:00
Dan Gohman	27b82f2f91	Reapply r131781 (revert r131809), now that some BasicAA shortcomings it exposed are fixed. llvm-svn: 132611	2011-06-04 00:46:31 +00:00
Dan Gohman	fb02cec44e	Fix BasicAA's recursion detection so that it doesn't pessimize queries in the case of a DAG, where a query reaches a node visited earlier, but it's not on a cycle. This avoids MayAlias results in cases where BasicAA is expected to return MustAlias or PartialAlias in order to protect TBAA. llvm-svn: 132609	2011-06-04 00:31:50 +00:00
Dan Gohman	4e7e7958d7	When merging MustAlias and PartialAlias, chose PartialAlias instead of conservatively choosing MayAlias. llvm-svn: 132579	2011-06-03 20:17:36 +00:00
Hans Wennborg	060b994a29	Test commit. llvm-svn: 132558	2011-06-03 17:15:37 +00:00
Devang Patel	1d40024322	A typedef's context is not the same as type's context. It is the context of typedef decl itself. Use extra parameter to communicate this to DIBuilder. llvm-svn: 132556	2011-06-03 17:04:51 +00:00
Eli Friedman	b576b1675c	When marking a block as being unanalyzable, use "Clobber" on the terminator instead of the first instruction in the block. This is a bit of a hack; "Clobber" isn't really the right marking in the first place. memdep doesn't really have any way of properly expressing "unanalyzable" at the moment. Using it on the terminator is much less ambiguous than using it on an arbitrary instruction, though. In the given testcase, the "Clobber" was pointing to a load, and GVN was incorrectly assuming that meant that the "Clobber" load overlapped the load being analyzed (when they are actually unrelated). The included testcase tests both this commit and r132434. Part two of rdar://9429882. (r132434 was mislabeled.) llvm-svn: 132442	2011-06-02 00:08:52 +00:00
Eli Friedman	4b6eeb9ca2	In MemoryDependenceAnalysis::getNonLocalPointerDepFromBB, if a given block is is deemed unanalyzable (and we execute one of the "goto PredTranslationFailure" statements), make sure we don't put information about the predecessors of that block into the returned data structures; this can lead to, among other things, extraneous results (which will confuse passes using memdep). Fixes an assert in GVN compiling ruby. Part of rdar://problem/9521954 . Testcase coming up soon. llvm-svn: 132434	2011-06-01 23:16:53 +00:00
Andrew Trick	8ef3ad049d	SCEV: missing null check fix for r132360, dragonegg crash. llvm-svn: 132416	2011-06-01 19:14:56 +00:00
Andrew Trick	812276eed4	scev: Better sign-extend removal. Normalize postincrement recurrences so that their sign extended forms are congruent when no overflow occurs. llvm-svn: 132360	2011-05-31 21:17:47 +00:00
Eli Friedman	7a5fc693f9	llvm.memcpy.* has two distinct associated address spaces; the source address space, and the destination address space. Fix up the interface on MemIntrinsic and MemTransferInst to make this clear, and fix InstructionDereferencesPointer in LazyValueInfo.cpp to use the interface properly. llvm-svn: 132356	2011-05-31 20:40:16 +00:00
Dan Gohman	c6f2ddfc04	Update this comment. llvm-svn: 132202	2011-05-27 18:42:33 +00:00
Chad Rosier	b362884ca9	Renamed llvm.x86.sse42.crc32 intrinsics; crc64 doesn't exist. crc32.[8\|16\|32] have been renamed to .crc32.32.[8\|16\|32] and crc64.[8\|16\|32] have been renamed to .crc32.64.[8\|64]. llvm-svn: 132163	2011-05-26 23:13:19 +00:00
Eli Friedman	bacb17906a	Change condition for determining whether a function is small for inlining metrics so that very long functions with few basic blocks are not re-analyzed. llvm-svn: 131994	2011-05-24 20:22:24 +00:00
Dan Gohman	0573b55c2b	Make DecomposeGEPExpression check SimplifyInstruction only after checking for a GEP, so that it matches what GetUnderlyingObject does. This fixes an obscure bug turned up by bugpoint in the testcase for PR9931. llvm-svn: 131971	2011-05-24 18:24:08 +00:00
Chris Lattner	026f5e61f0	fix a really nasty basicaa mod/ref calculation bug that was causing miscompilation of UnitTests/ObjC/messages-2.m with the recent optimizer improvements. llvm-svn: 131897	2011-05-23 05:15:43 +00:00
Chris Lattner	83791ced7b	Teach valuetracking that byval arguments with a specified alignment are aligned. Teach memcpyopt to not give up all hope when confonted with an underaligned memcpy feeding an overaligned byval. If the source of the memcpy can be determined to be adequeately aligned, or if it can be forced to be, we can eliminate the memcpy. This addresses PR9794. We now compile the example into: define i32 @f(%struct.p* nocapture byval align 8 %q) nounwind ssp { entry: %call = call i32 @g(%struct.p* byval align 8 %q) nounwind ret i32 %call } in both x86-64 and x86-32 mode. We still don't get a tailcall though, because tailcalls apparently can't handle byval. llvm-svn: 131884	2011-05-23 00:03:39 +00:00
Chris Lattner	713d52364f	implement PR9315, constant folding exp2 in terms of pow (since hosts without C99 runtimes don't have exp2). llvm-svn: 131872	2011-05-22 22:22:35 +00:00
Evan Cheng	2a746bfe36	Teach ValueTracking about x86 crc32 intrinsics. llvm-svn: 131861	2011-05-22 18:25:30 +00:00
Duncan Sands	5ec65765e6	Revert commit 131781, to see if it fixes the x86-64 dragonegg buildbot. Original log message: When BasicAA can determine that two pointers have the same base but differ by a dynamic offset, return PartialAlias instead of MayAlias. See the comment in the code for details. This fixes PR9971. llvm-svn: 131809	2011-05-21 20:54:46 +00:00
Dan Gohman	8b20187c82	When BasicAA can determine that two pointers have the same base but differ by a dynamic offset, return PartialAlias instead of MayAlias. See the comment in the code for details. This fixes PR9971. llvm-svn: 131781	2011-05-21 01:05:08 +00:00
Andrew Trick	f44aadf0fd	indvars: Prototyping Sign/ZeroExtend elimination without canonical IVs. No functionality enabled by default. Use -disable-iv-rewrite. Extended IVUsers to keep track of the phi that represents the users' IV. Added the WidenIV transform to replace a narrow IV with a wide IV by doing a one-for-one replacement of IV users instead of expanding the SCEV expressions. [sz]exts are removed and truncs are inserted. llvm-svn: 131744	2011-05-20 18:25:42 +00:00
Owen Anderson	97f0cf32ea	@llvm.lifetime.begin acts as a load, not @llvm.lifetime.end. llvm-svn: 131437	2011-05-17 00:05:49 +00:00
Rafael Espindola	71f8b08a80	Extra refactoring noticed by Eli Friedman. llvm-svn: 131405	2011-05-16 15:48:45 +00:00
Julien Lerouge	7e11f9e26d	Fix a source of non determinism in FindUsedTypes, use a SetVector instead of a set. rdar://9423996 llvm-svn: 131283	2011-05-13 05:20:42 +00:00
Dan Gohman	0daf687e1d	Change a few std::maps to DenseMaps. llvm-svn: 131088	2011-05-09 18:44:09 +00:00
Duncan Sands	af32728a57	The comparision "max(x,y)==x" is equivalent to "x>=y". Since the max is often expressed as "x >= y ? x : y", there is a good chance we can extract the existing "x >= y" from it and use that as a replacement for "max(x,y)==x". llvm-svn: 131049	2011-05-07 16:56:49 +00:00
Eli Friedman	8a20e66926	PR9838: Fix transform introduced in r127064 to not trigger when only one side of the icmp is an exact shift. llvm-svn: 130954	2011-05-05 21:59:18 +00:00
Hongbin Zheng	cd5afc5feb	Minor change: Fix the typo in RegionPass.h and RegionPass.cpp. llvm-svn: 130920	2011-05-05 13:59:38 +00:00
Duncan Sands	a228785526	Add variations on: max(x,y) >= min(x,z) folds to true. This isn't that common, but according to my super-optimizer there are only two missed simplifications of -instsimplify kind when compiling bzip2, and this is one of them. It amuses me to have bzip2 be perfectly optimized as far as instsimplify goes! llvm-svn: 130840	2011-05-04 16:05:05 +00:00
Andrew Trick	1abe296cfd	indvars: Added DisableIVRewrite and WidenIVs. This adds functionality to remove size/zero extension during indvars without generating a canonical IV and rewriting all IV users. It's disabled by default so should have no effect on codegen. Work in progress. llvm-svn: 130829	2011-05-04 02:10:13 +00:00
Duncan Sands	0a9c1246d7	Implement some basic simplifications involving min/max, for example max(a,b) >= a -> true. According to my super-optimizer, these are by far the most common simplifications (of the -instsimplify kind) that occur in the testsuite and aren't caught by -std-compile-opts. llvm-svn: 130780	2011-05-03 19:53:10 +00:00
Devang Patel	09fa69e151	Use llvm.dbg.cu named metadata to collect compile units. llvm-svn: 130756	2011-05-03 16:18:28 +00:00
Duncan Sands	f91c5ab341	Fix PR9579: when simplifying a compare to "true" or "false", and it was a vector compare, generate a vector result rather than i1 (and crashing). llvm-svn: 130706	2011-05-02 18:51:41 +00:00
Duncan Sands	a3e3699c88	Move some rem transforms out of instcombine and into instsimplify. This automagically provides a transform noticed by my super-optimizer as occurring quite often: "rem x, (select cond, x, 1)" -> 0. llvm-svn: 130694	2011-05-02 16:27:02 +00:00
Chris Lattner	827a270a2a	teach GVN to widen integer loads when they are overaligned, when doing an wider load would allow elimination of subsequent loads, and when the wider load is still a native integer type. This eliminates a ton of loads on various benchmarks involving struct fields, though it is somewhat hobbled by clang not being very aggressive about field alignment. This is yet another step along the way towards resolving PR6627. llvm-svn: 130390	2011-04-28 07:29:08 +00:00
Dan Gohman	5394c70d1e	Teach BasicAA about arm.neon.vld1 and vst1. llvm-svn: 130327	2011-04-27 20:44:28 +00:00
Dan Gohman	39b3a1ef7f	When analyzing functions known to only access argument pointees, only check arguments with pointer types. Update the documentation of IntrReadArgMem reflect this. While here, add support for TBAA tags on intrinsic calls. llvm-svn: 130317	2011-04-27 18:39:03 +00:00
Andrew Trick	7d1eea86d9	Corrects an old, old typo in a case that doesn't seem to be reached in practice. llvm-svn: 130316	2011-04-27 18:17:36 +00:00
Andrew Trick	01eff820ae	Test case and comment for PR9633. llvm-svn: 130294	2011-04-27 05:42:17 +00:00
Andrew Trick	759ba0802d	Fix for PR9633 [indvars] Assertion `isa<X>(Val) && "cast<Ty>() argument of incompatible type!"' failed. Added a type check in ScalarEvolution::computeSCEVAtScope to handle the case in which operands of an AddRecExpr in the current scope are folded. llvm-svn: 130271	2011-04-27 01:21:25 +00:00
Chris Lattner	7aab2799ae	Enhance memdep to return clobber relation between noalias loads when an earlier load could be widened to encompass a later load. For example, if we see: X = load i8* P, align 4 Y = load i8* (P+3), align 1 and we have a 32-bit native integer type, we can widen the former load to i32 which then makes the second load redundant. GVN can't actually do anything with this load/load relation yet, so this isn't testable, but it is the next step to resolving PR6627, and a fairly general class of "merge neighboring loads" missed optimizations. llvm-svn: 130250	2011-04-26 22:42:01 +00:00
Chris Lattner	32dc9bd1bb	use AA::isMustAlias to simplify some calls. llvm-svn: 130248	2011-04-26 21:53:34 +00:00
Chris Lattner	6b96621a8a	remove support for llvm.invariant.end from memdep. It is a work-in-progress that is not progressing, and it has issues. llvm-svn: 130247	2011-04-26 21:50:51 +00:00
Devang Patel	b5ea255fb4	Fix an off by one error while accessing complex address element of a DIVariable. This worked untill now because stars are aligned (i.e. num of complex address elments are always 0 or 2+ and when it is 2+ at least two elements are access together) llvm-svn: 130225	2011-04-26 18:24:39 +00:00
Chris Lattner	6f83d06ffa	Enhance MemDep: When alias analysis returns a partial alias result, return it as a clobber. This allows GVN to do smart things. Enhance GVN to be smart about the case when a small load is clobbered by a larger overlapping load. In this case, forward the value. This allows us to compile stuff like this: int test(void P) { int tmp = (unsigned int)P; return tmp+((unsigned char*)P+1); } into: _test: ## @test movl (%rdi), %ecx movzbl %ch, %eax addl %ecx, %eax ret which has one load. We already handled the case where the smaller load was from a must-aliased base pointer. llvm-svn: 130180	2011-04-26 01:21:15 +00:00
Dan Gohman	6acd95b3c1	Fix an iterator invalidation bug. llvm-svn: 130166	2011-04-25 22:48:29 +00:00
Jay Foad	dbf81d8ddf	PR9214: Convert the DIBuilder API to use ArrayRef. llvm-svn: 130086	2011-04-24 10:11:03 +00:00
Jay Foad	1a180156b6	Remove unused STL header includes. llvm-svn: 130068	2011-04-23 19:53:52 +00:00
Devang Patel	1d6bbd41aa	Let front-end tie subprogram declaration with subprogram definition directly. llvm-svn: 130028	2011-04-22 23:10:17 +00:00
Jay Foad	5514afe6b2	PR9214: Convert Metadata API to use ArrayRef. llvm-svn: 129932	2011-04-21 19:59:31 +00:00
Devang Patel	0c7732499b	Use ArrayRef variants. llvm-svn: 129735	2011-04-18 23:51:03 +00:00
Chandler Carruth	2b1ba48f8d	Mark some functions as used which are used within debug-only code. This silences Clang's -Wunused-function when building in release mode. llvm-svn: 129709	2011-04-18 18:49:44 +00:00
Devang Patel	514b4006c2	Introduce support to encode Objective-C property information in debugging information generated for an interface. llvm-svn: 129624	2011-04-16 00:11:51 +00:00
Chris Lattner	0ab5e2cded	Fix a ton of comment typos found by codespell. Patch by Luis Felipe Strano Moraes! llvm-svn: 129558	2011-04-15 05:18:47 +00:00
Jay Foad	0091fe8ca1	PR9214: Convert ConstantExpr::getIndices() to return an ArrayRef, plus related tweaks to ExprMapKeyType. llvm-svn: 129443	2011-04-13 15:22:40 +00:00
Jay Foad	7c14a558fe	Don't include Operator.h from InstrTypes.h. llvm-svn: 129271	2011-04-11 09:35:34 +00:00
Eli Friedman	17822fcde9	PR9604; try to deal with RAUW updates correctly in the AST. I'm not convinced it's completely safe to cache the AST across LICM runs even with this fix, but this fix can't hurt. llvm-svn: 129198	2011-04-09 06:55:46 +00:00
Devang Patel	9f738849ab	Add support to encode function's template parameters. llvm-svn: 128947	2011-04-05 22:52:06 +00:00
Chris Lattner	57ee5a5db7	remove postdom frontiers, because it is dead. Forward dom frontiers are still used by RegionInfo :( llvm-svn: 128943	2011-04-05 21:57:17 +00:00
Tobias Grosser	8b304ff9ac	Region: Allow user control the printing style of the print function. Contributed by: etherzhhb@gmail.com llvm-svn: 128808	2011-04-04 07:19:18 +00:00
Eli Friedman	8baa2c7ad9	Don't assume something which might be a constant expression is an instruction. Based on PR9429, but no testcase because I can't figure out how to trigger it anymore given other changes to the relevant code. llvm-svn: 128781	2011-04-02 22:11:56 +00:00
Jay Foad	52131344a2	Remove PHINode::reserveOperandSpace(). Instead, add a parameter to PHINode::Create() giving the (known or expected) number of operands. llvm-svn: 128537	2011-03-30 11:28:46 +00:00
Jay Foad	e0938d8a87	(Almost) always call reserveOperandSpace() on newly created PHINodes. llvm-svn: 128535	2011-03-30 11:19:20 +00:00
Frits van Bommel	0bb2ad2cf7	Constant folding support for calls to umul.with.overflow(), basically identical to the smul.with.overflow() code. llvm-svn: 128379	2011-03-27 14:26:13 +00:00
Anders Carlsson	c4f0ab397c	Revert r128140 for now. llvm-svn: 128149	2011-03-23 15:51:12 +00:00
Anders Carlsson	9ed8d93f55	A global variable with internal linkage where all uses are in one function and whose address is never taken is a non-escaping local object and can't alias anything else. llvm-svn: 128140	2011-03-23 02:19:48 +00:00
Nick Lewycky	f0469af63e	Fix INT_MIN gotcha pointed out by Eli Friedman. llvm-svn: 128028	2011-03-21 21:40:32 +00:00
Andrew Trick	1c4b42d00f	Avoid creating canonical induction variables for non-native types. For example, on 32-bit architecture, don't promote all uses of the IV to 64-bits just because one use is a 64-bit cast. Alternate implementation of the patch by Arnaud de Grandmaison. llvm-svn: 127884	2011-03-18 16:50:32 +00:00
Andrew Trick	87716c93c2	Added isValidRewrite() to check the result of ScalarEvolutionExpander. SCEV may generate expressions composed of multiple pointers, which can lead to invalid GEP expansion. Until we can teach SCEV to follow strict pointer rules, make sure no bad GEPs creep into IR. Fixes rdar://problem/9038671. llvm-svn: 127839	2011-03-17 23:51:11 +00:00
Nick Lewycky	b4d763b37d	Add comments for the demanglings. Correct mangled form of operator delete! llvm-svn: 127801	2011-03-17 05:20:12 +00:00
Nick Lewycky	c1f8658368	Add C++ global operator {new,new[],delete,delete[]}(unsigned {int,long}) to the memory builtins as equivalent to malloc/free. This is different from any attribute we have. For example, you can delete the allocators when their result is unused, but you can't collapse two calls to the same function, even if no global/memory state has changed in between. The noalias return states that the result does not alias any other pointer, but instcombine optimizes malloc() as though the result is non-null for the purpose of eliminating unused pointers. llvm-svn: 127673	2011-03-15 07:31:32 +00:00
Andrew Trick	a34f1b1f10	Remove getMinusSCEVForExitTest(). This function performed acrobatics to prove no-self-wrap, which we now have for free. llvm-svn: 127643	2011-03-15 01:16:14 +00:00
Andrew Trick	f6b01ff422	Propagate SCEV no-wrap flags whenever possible. This needs review. llvm-svn: 127638	2011-03-15 00:37:00 +00:00
Andrew Trick	e92dcceab7	Negating a recurrence preserves no-self-wrap. llvm-svn: 127593	2011-03-14 17:38:54 +00:00
Andrew Trick	f1781db622	HowFarToZero can compute a trip count as long as the recurrence has no-self-wrap. llvm-svn: 127591	2011-03-14 17:28:02 +00:00
Andrew Trick	8b55b736b1	Added SCEV::NoWrapFlags to manage unsigned, signed, and self wrap properties. Added the self-wrap flag for SCEV::AddRecExpr. A slew of temporary FIXMEs indicate the intention of the no-self-wrap flag without changing behavior in this revision. llvm-svn: 127590	2011-03-14 16:50:06 +00:00
Benjamin Kramer	5acc751b6f	Teach ComputeMaskedBits about sub nsw. llvm-svn: 127548	2011-03-12 17:18:11 +00:00
Benjamin Kramer	391a946fa9	ComputeMaskedBits: sub falls through to add, and sub doesn't have the same overflow semantics as add. Should fix the selfhost failures that started with r127463. llvm-svn: 127465	2011-03-11 14:46:49 +00:00
Nick Lewycky	cc79973856	Teach ComputeMaskedBits about nsw on add. I don't think there's anything we can do with nuw here, but sub and mul should be given similar treatment. Fixes PR9343 #15! llvm-svn: 127463	2011-03-11 09:00:19 +00:00
Devang Patel	fa31d38aad	Introduce DebugInfoProbe. This is used to monitor how llvm optimizer is treating debugging information. It generates output that lools like 8 times line number info lost by Scalar Replacement of Aggregates (SSAUp) 1 times line number info lost by Simplify well-known library calls 12 times variable info lost by Jump Threading llvm-svn: 127381	2011-03-10 00:21:25 +00:00
Andrew Trick	2afa325811	When SCEV can determine the loop test is X < X, set ExactBECount=0. When ExactBECount is a constant, use it for MaxBECount. When MaxBECount cannot be computed, replace it with ExactBECount. Fixes PR9424. llvm-svn: 127342	2011-03-09 17:29:58 +00:00
Andrew Trick	2a3b71684a	whitespace llvm-svn: 127340	2011-03-09 17:23:39 +00:00
Nick Lewycky	774647d974	Fix two cases I forgot to update when doing a mental "getSwappedPredicate". Thanks Duncan Sands! llvm-svn: 127323	2011-03-09 08:20:06 +00:00
Nick Lewycky	980104d1d6	Add another micro-optimization. Apologies for the lack of refactoring, but I gave up when I realized I couldn't come up with a good name for what the refactored function would be, to describe what it does. This is PR9343 test12, which is test3 with arguments reordered. Whoops! llvm-svn: 127318	2011-03-09 06:26:03 +00:00
Duncan Sands	7dc3d47c34	Fix PR9331. Simplified version of a patch by Jakub Staszak. llvm-svn: 127243	2011-03-08 12:39:03 +00:00
Nick Lewycky	e467979d0a	Add more analysis of the sign bit of an srem instruction. If the LHS is negative then the result could go either way. If it's provably positive then so is the srem. Fixes PR9343 #7! llvm-svn: 127146	2011-03-07 01:50:10 +00:00
Nick Lewycky	9719a719c7	Thread comparisons over udiv/sdiv/ashr/lshr exact and lshr nuw/nsw whenever possible. This goes into instcombine and instsimplify because instsimplify doesn't need to check hasOneUse since it returns (almost exclusively) constants. This fixes PR9343 #4 #5 and #8! llvm-svn: 127064	2011-03-05 05:19:11 +00:00
Dan Gohman	aa036eedb8	When decling to reuse existing expressions that involve casts, ignore bitcasts, which are really no-ops here. This fixes slowdowns on MultiSource/Applications/aha and others. llvm-svn: 127031	2011-03-04 20:46:46 +00:00
Nick Lewycky	41c529bd09	Revert broken srem logic from r126991. llvm-svn: 127021	2011-03-04 19:26:08 +00:00
Nick Lewycky	8e3a79da9f	Fold "icmp pred (srem X, Y), Y" like we do for urem. Handle signed comparisons in the urem case, though not the other way around. This is enough to get #3 from PR9343! llvm-svn: 126991	2011-03-04 10:06:52 +00:00
Nick Lewycky	3cec6f5563	Teach instruction simplify to use constant ranges to solve problems of the form "icmp pred %X, CI" and a number of examples where "%X = binop %Y, CI2". Some of these cases (div and rem) used to make it through opt -O2, but the others are probably now making code elsewhere redundant (probably instcombine). llvm-svn: 126988	2011-03-04 07:00:57 +00:00
Duncan Sands	bf577d6a86	Remove DIFactory. Patch by Devang. llvm-svn: 126871	2011-03-02 20:30:37 +00:00
Dan Gohman	7290868a1b	Don't re-use existing addrec expansions if they contain casts. This fixes PR9259. llvm-svn: 126812	2011-03-02 01:34:10 +00:00
Devang Patel	40eee1e970	Today, the language front ends produces llvm.dbg.* intrinsics, used to encode arguments' debug info, in order any way, most of the times. However, if a front end mix-n-matches llvm.dbg.declare and llvm.dbg.value intrinsics to encode debug info for arguments then code generator needs a way to find argument order. Use 8 bits from line number field to keep track of argument ordering while encoding debug info for an argument. That leaves 24 bit for line no, DebugLoc also allocates 24 bit for line numbers. If a function has more than 255 arguments then rest of the arguments will be ordered by llvm.dbg.* intrinsics' ordering in IR. llvm-svn: 126793	2011-03-01 22:58:13 +00:00
Nick Lewycky	c9d20067cd	Optimize "icmp pred (urem X, Y), Y" --> true/false depending on pred. There's more work to do here, "icmp ult (urem X, 10), 11" doesn't optimize away yet. Fixes example 3 from PR9343! llvm-svn: 126741	2011-03-01 08:15:50 +00:00
Ted Kremenek	49d15b959e	Unbreak CMake build. llvm-svn: 126717	2011-03-01 00:02:51 +00:00
Dan Gohman	161058838c	Delete the LiveValues pass. I won't get get back to the project it was started for in the foreseeable future. llvm-svn: 126668	2011-02-28 19:37:59 +00:00
Nick Lewycky	afe4a3062d	Fix comment. llvm-svn: 126645	2011-02-28 09:18:11 +00:00
Nick Lewycky	66f4f22f7b	srem doesn't actually have the same resulting sign as its numerator, you could also have a zero when numerator = denominator. Reverts parts of r126635 and r126637. llvm-svn: 126644	2011-02-28 09:17:39 +00:00
Nick Lewycky	c9aab8567b	Teach value tracking to make use of flags in more situations. llvm-svn: 126642	2011-02-28 08:02:21 +00:00
Nick Lewycky	29dbbd12c1	Teach ValueTracking to look at the dividend when determining the sign bit of an srem instruction. llvm-svn: 126637	2011-02-28 06:52:12 +00:00
Tobias Grosser	98eecaf0a9	RegionPrinter: Ignore back edges when layouting the graph llvm-svn: 126564	2011-02-27 04:11:07 +00:00
Devang Patel	9b4127349c	Follow LLVM coding style. clang uses DBuilder, so it requries corresponding change. llvm-svn: 126231	2011-02-22 18:56:12 +00:00
Benjamin Kramer	5b7a4e0195	Move "A \| ~(A & ?) -> -1" from InstCombine to InstructionSimplify. llvm-svn: 126082	2011-02-20 15:20:01 +00:00
Chris Lattner	acf6b0776a	Stores of null pointers should turn into memset, we weren't recognizing them as splat values. llvm-svn: 126041	2011-02-19 19:35:49 +00:00
Oscar Fuentes	5ed962656c	Move library stuff out of the toplevel CMakeLists.txt file. llvm-svn: 125968	2011-02-18 22:06:14 +00:00
Devang Patel	4ab0852080	Move DbgInfoPrinter specific utlities inside DbgInfoPrinter.cpp llvm-svn: 125571	2011-02-15 17:36:11 +00:00
Devang Patel	27924da676	Print function info. Patch by Minjang Kim. llvm-svn: 125567	2011-02-15 17:24:56 +00:00
Chris Lattner	69229316aa	convert ConstantVector::get to use ArrayRef. llvm-svn: 125537	2011-02-15 00:14:00 +00:00
Chris Lattner	34442e6ebf	revert my ConstantVector patch, it seems to have made the llvm-gcc builders unhappy. llvm-svn: 125504	2011-02-14 18:15:46 +00:00
Chris Lattner	d9f5b88548	Switch ConstantVector::get to use ArrayRef instead of a pointer+size idiom. Change various clients to simplify their code. llvm-svn: 125487	2011-02-14 07:55:32 +00:00
Duncan Sands	b86070933f	Remove pointless blank line. llvm-svn: 125463	2011-02-13 18:11:05 +00:00
Duncan Sands	d114ab331c	Teach instsimplify that X+Y>=X+Z is the same as Y>=Z if neither side overflows, plus some variations of this. According to my auto-simplifier this occurs a lot but usually in combination with max/min idioms. Because max/min aren't handled yet this unfortunately doesn't have much effect in the testsuite. llvm-svn: 125462	2011-02-13 17:15:40 +00:00
Chris Lattner	4f23f2be15	teach SCEV that the scale and addition of an inbounds gep don't NSW. This fixes a FIXME in scev-aa.ll (allowing a new no-alias result) and generally makes things more precise. llvm-svn: 125449	2011-02-13 03:14:49 +00:00
Chris Lattner	7936a8a488	Per discussion with Dan G, inbounds geps certainly can have unsigned overflow (e.g. "gep P, -1"), and while they can have signed wrap in theoretical situations, modelling an AddRec as not having signed wrap is going enough for any case we can think of today. In the future if this isn't enough, we can revisit this. Modeling them as having NUW isn't causing any known problems either FWIW. llvm-svn: 125410	2011-02-11 21:43:33 +00:00
Nick Lewycky	ac0b62c277	Tolerate degenerate phi nodes that can occur in the middle of optimization passes. Fixes PR9112. Patch by Jakub Staszak! llvm-svn: 125319	2011-02-10 23:54:10 +00:00
Duncan Sands	8b4e283bfb	Formatting and comment tweaks. llvm-svn: 125200	2011-02-09 17:45:03 +00:00
Chris Lattner	9e4aa0259f	Teach instsimplify some tricks about exact/nuw/nsw shifts. improve interfaces to instsimplify to take this info. llvm-svn: 125196	2011-02-09 17:15:04 +00:00
Chris Lattner	b940091388	Rework InstrTypes.h so to reduce the repetition around the NSW/NUW/Exact versions of creation functions. Eventually, the "insertion point" versions of these should just be removed, we do have IRBuilder afterall. Do a massive rewrite of much of pattern match. It is now shorter and less redundant and has several other widgets I will be using in other patches. Among other changes, m_Div is renamed to m_IDiv (since it only matches integer divides) and m_Shift is gone (it used to match all binops!!) and we now have m_LogicalShift for the one client to use. Enhance IRBuilder to have "isExact" arguments to things like CreateUDiv and reduce redundancy within IRbuilder by having these methods chain to each other more instead of duplicating code. llvm-svn: 125194	2011-02-09 17:00:45 +00:00
Duncan Sands	867cb633b4	Add an m_Div pattern for matching either a udiv or an sdiv and use it to simplify the "(X/Y)*Y->X when the division is exact" transform. llvm-svn: 125004	2011-02-07 09:36:32 +00:00
Chris Lattner	6e57b15228	teach instsimplify to transform (X / Y) * Y to X when the div is an exact udiv. llvm-svn: 124994	2011-02-06 22:05:31 +00:00
Eric Christopher	b54605b8e2	Remove premature optimization that avoided calculating argument weights if we weren't going to inline the function. The rest of the code using this was removed. Fixes PR9154. llvm-svn: 124991	2011-02-06 21:27:46 +00:00
Anders Carlsson	ecf8e159e3	Simplify test, as suggested by Chris. llvm-svn: 124990	2011-02-06 20:22:49 +00:00
Anders Carlsson	d21b06a0db	When loading from a constant, fold inttoptr if the integer type and the resulting pointer type both have the same size. llvm-svn: 124987	2011-02-06 20:11:56 +00:00
Anders Carlsson	36c6d23074	Fix another warning. llvm-svn: 124961	2011-02-05 18:33:43 +00:00
Eric Christopher	ceb4671ddd	Fix cut and paste error spotted by Jakob. llvm-svn: 124930	2011-02-05 02:48:47 +00:00
Eric Christopher	2dfbd7e0c1	Rewrite how the indirect call bonus is handled. This now works by: a) Making it a per call site bonus for functions that we can move from indirect to direct calls. b) Reduces the bonus from 500 to 100 per call site. c) Subtracts the size of the possible newly inlineable call from the bonus to only add a bonus if we can inline a small function to devirtualize it. Also changes the bonus from a positive that's subtracted to a negative that's added. Fixes the remainder of rdar://8546196 by reducing the object file size after inlining by 84%. llvm-svn: 124916	2011-02-05 00:49:15 +00:00
Duncan Sands	06504025d2	Improve threading of comparisons over select instructions (spotted by my auto-simplifier). This has a big impact on Ada code, but not much else. Unfortunately the impact is mostly negative! This is due to PR9004 (aka SCCP failing to resolve conditional branch conditions in the destination blocks of the branch), in which simple correlated expressions are not resolved but complicated ones are, so simplifying has a bad effect! llvm-svn: 124788	2011-02-03 09:37:39 +00:00
Devang Patel	df0dd7dc69	Fix typo in comment. llvm-svn: 124759	2011-02-03 00:13:47 +00:00
Devang Patel	be933b470a	Add support to describe template value parameter in debug info. llvm-svn: 124755	2011-02-02 22:35:53 +00:00
Devang Patel	3a9e65efb6	Add support to describe template parameter type in debug info. llvm-svn: 124752	2011-02-02 21:38:25 +00:00
Duncan Sands	5747abab10	Reenable the transform "(X*Y)/Y->X" when the multiplication is known not to overflow (nsw flag), which was disabled because it breaks 254.gap. I have informed the GAP authors of the mistake in their code, and arranged for the testsuite to use -fwrapv when compiling this benchmark. llvm-svn: 124746	2011-02-02 20:52:00 +00:00
Duncan Sands	a29ea9aa4c	Add a m_Undef pattern for convenience. This is so that code that uses pattern matching can also pattern match undef, creating a more uniform style. llvm-svn: 124657	2011-02-01 09:06:20 +00:00
Duncan Sands	4b397fcdc2	Add a m_SignBit pattern for convenience. llvm-svn: 124656	2011-02-01 08:50:33 +00:00
Duncan Sands	cf0ff030a8	Have m_One also match constant vectors for which every element is 1. llvm-svn: 124655	2011-02-01 08:39:12 +00:00
Eric Christopher	46308e666a	Reapply 124275 since the Dragonegg failure was unreproducible. llvm-svn: 124641	2011-02-01 01:16:32 +00:00
Duncan Sands	2e5a58da8f	Commit 124487 broke 254.gap. See if disabling the part that might be triggered by PR9088 fixes things. llvm-svn: 124561	2011-01-30 18:24:20 +00:00
Duncan Sands	b67edc6a29	Transform (X/Y)*Y into X if the division is exact. Instcombine already knows how to do this and more, but would only do it if X/Y had only one use. Spotted as the most common missed simplification in SPEC by my auto-simplifier, now that it knows about nuw/nsw/exact flags. This removes a bunch of multiplications from 447.dealII and 483.xalancbmk. It also removes a lot from tramp3d-v4, which results in much more inlining. llvm-svn: 124560	2011-01-30 18:03:50 +00:00
Nick Lewycky	b89d9a4412	Fix comment. llvm-svn: 124544	2011-01-29 19:55:23 +00:00
Frits van Bommel	c2549661af	Move InstCombine's knowledge of fdiv to SimplifyInstruction(). llvm-svn: 124534	2011-01-29 15:26:31 +00:00
Duncan Sands	2e9e4f1be3	Fix typo: should have been testing that X was odd, not V. llvm-svn: 124533	2011-01-29 13:27:00 +00:00
Andrew Trick	24f5ff0f23	Implementation of path profiling. Modified patch by Adam Preuss. This builds on the existing framework for block tracing, edge profiling and optimal edge profiling. See -help-hidden for new flags. For documentation, see the technical report "Implementation of Path Profiling..." in llvm.org/pubs. llvm-svn: 124515	2011-01-29 01:09:53 +00:00
Duncan Sands	e4b4d0c16d	This dyn_cast should be a cast. Pointed out by Frits van Bommel. llvm-svn: 124497	2011-01-28 18:53:08 +00:00
Duncan Sands	65995fa2a0	Thread divisions over selects and phis. This doesn't fire much and has basically zero effect on the testsuite (it improves two Ada testcases). llvm-svn: 124496	2011-01-28 18:50:50 +00:00
Duncan Sands	771e82a863	My auto-simplifier noticed that ((X/Y)Y)/Y occurs several times in SPEC benchmarks, and that it can be simplified to X/Y. (In general you can only simplify (ZY)/Y to Z if the multiplication did not overflow; if Z has the form "X/Y" then this is the case). This patch implements that transform and moves some Div logic out of instcombine and into InstructionSimplify. Unfortunately instcombine gets in the way somewhat, since it likes to change (X/Y)Y into X-(X rem Y), so I had to teach instcombine about this too. Finally, thanks to the NSW/NUW flags, sometimes we know directly that "ZY" does not overflow, because the flag says so, so I added that logic too. This eliminates a bunch of divisions and subtractions in 447.dealII, and has good effects on some other benchmarks too. It seems to have quite an effect on tramp3d-v4 but it's hard to say if it's good or bad because inlining decisions changed, resulting in massive changes all over. llvm-svn: 124487	2011-01-28 16:51:11 +00:00
Eric Christopher	cd55a46c31	Temporarily revert 124275 to see if it brings the dragonegg buildbot back. llvm-svn: 124312	2011-01-26 19:40:31 +00:00
Duncan Sands	8a33733228	APInt has a method for determining whether a number is a power of 2 which is more efficient than countPopulation - use it. llvm-svn: 124283	2011-01-26 08:44:16 +00:00
Nick Lewycky	d9e6b4a8ff	Fix memory corruption. If one of the SCEV creation functions calls another but doesn't return immediately after then the insert position in UniqueSCEVs will be out of date. No test because this is a memory corruption issue. Fixes PR9051! llvm-svn: 124282	2011-01-26 08:40:22 +00:00
Eric Christopher	078159e310	Separate out the constant bonus from the size reduction metrics. Rework a few loops accordingly. Should be no functional change. This is a step for more accurate cost/benefit analysis of devirt/inlining bonuses. llvm-svn: 124275	2011-01-26 02:58:39 +00:00
Eric Christopher	58f157a677	Coding style formatting changes. llvm-svn: 124260	2011-01-26 01:09:59 +00:00
Duncan Sands	9e9d5b25e2	In which I discover that zero+zero is zero, d'oh! llvm-svn: 124188	2011-01-25 15:14:15 +00:00
Duncan Sands	fced7620f5	See if this fixes llvm-gcc bootstrap. llvm-svn: 124184	2011-01-25 12:15:09 +00:00
Duncan Sands	d395108394	According to my auto-simplifier the most common missed simplifications in optimized code are: (non-negative number)+(power-of-two) != 0 -> true and (x \| 1) != 0 -> true Instcombine knows about the second one of course, but only does it if X\|1 has only one use. These fire thousands of times in the testsuite. llvm-svn: 124183	2011-01-25 09:38:29 +00:00
Eric Christopher	cd087f2512	Reorganize this so that the early exit and special cases come early rather than interspersed. No functional change. llvm-svn: 124168	2011-01-25 01:34:31 +00:00
Dan Gohman	0f124e1987	Give GetUnderlyingObject a TargetData, to keep it in sync with BasicAA's DecomposeGEPExpression, which recently began using a TargetData. This fixes PR8968, though the testcase is awkward to reduce. Also, update several off GetUnderlyingObject's users which happen to have a TargetData handy to pass it in. llvm-svn: 124134	2011-01-24 18:53:32 +00:00
Chris Lattner	f277b5d434	fix PR8928 by clearing a stale map, patch by Jakub Staszak! llvm-svn: 124132	2011-01-24 18:36:51 +00:00
Dan Gohman	3ac8cd614f	Add a comment. llvm-svn: 124126	2011-01-24 17:54:18 +00:00
Nick Lewycky	d4192f71b5	Simplify some code with no functionality change. Make the test a lot more robust against smarter optimizations, using the power of FileCheck. llvm-svn: 124081	2011-01-23 20:06:05 +00:00
Ted Kremenek	3c4408ceb6	Null initialize a few variables flagged by clang's -Wuninitialized-experimental warning. While these don't look like real bugs, clang's -Wuninitialized-experimental analysis is stricter than GCC's, and these fixes have the benefit of being general nice cleanups. llvm-svn: 124073	2011-01-23 17:05:06 +00:00
Nick Lewycky	bc98f5b78e	Use value ranges to fold ext(trunc) in SCEV when possible. llvm-svn: 124062	2011-01-23 06:20:19 +00:00
Nick Lewycky	b32c8943e6	Have SCEV turn sext(x) into zext(x) when x is s>= 0. This applies many times in "make check" alone. llvm-svn: 124046	2011-01-22 22:06:21 +00:00
Eric Christopher	c70e037b73	Add a FIXME explaining the move to a single indirect call bonus per function that we can change from indirect to direct. llvm-svn: 124045	2011-01-22 21:56:53 +00:00
Eric Christopher	08e8b3b629	Only apply the devirtualization bonus once instead of per-call site in the target function. Fixes part of rdar://8546196 llvm-svn: 124044	2011-01-22 21:17:33 +00:00
Duncan Sands	8fb2c3827c	At -O123 the early-cse pass is run before instcombine has run. According to my auto-simplier the transform most missed by early-cse is (zext X) != 0 -> X != 0. This patch adds this transform and some related logic to InstructionSimplify and removes some of the logic from instcombine (unfortunately not all because there are several situations in which instcombine can improve things by making new instructions, whereas instsimplify is not allowed to do this). At -O2 this often results in more than 15% more simplifications by early-cse, and results in hundreds of lines of bitcode being eliminated from the testsuite. I did see some small negative effects in the testsuite, for example a few additional instructions in three programs. One program, 483.xalancbmk, got an additional 35 instructions, which seems to be due to a function getting an additional instruction and then being inlined all over the place. llvm-svn: 123911	2011-01-20 13:21:55 +00:00
Nick Lewycky	5c901f3489	Similarly, analyze truncate through multiply. llvm-svn: 123842	2011-01-19 18:56:00 +00:00
Nick Lewycky	5143f0f09b	Add a missed SCEV fold that is required to continue analyzing the IR produced by indvars through the scev expander. trunc(add x, y) --> add(trunc x, y). Currently SCEV largely folds the other way which is probably wrong, but preserved to minimize churn. Instcombine doesn't do this fold either, demonstrating a missed optz'n opportunity on code doing add+trunc+add. llvm-svn: 123838	2011-01-19 16:59:46 +00:00
Nick Lewycky	e9ea75e3fc	Add a missing SCEV simplification sext(zext x) --> zext x. llvm-svn: 123832	2011-01-19 15:56:12 +00:00
Dan Gohman	44da55b7be	Teach BasicAA to return PartialAlias in cases where both pointers are pointing to the same object, one pointer is accessing the entire object, and the other is access has a non-zero size. This prevents TBAA from kicking in and saying NoAlias in such cases. llvm-svn: 123775	2011-01-18 21:16:06 +00:00
Duncan Sands	99589d07e9	For completeness, generalize the (X + Y) - Y -> X transform and add X - (X + 1) -> -1. These were not recommended by my auto-simplifier since they don't fire often enough. However they do fire from time to time, for example they remove one subtraction from the final bitcode for 483.xalancbmk. llvm-svn: 123755	2011-01-18 11:50:19 +00:00
Duncan Sands	9b8e2bd8ef	Simplify (X<<1)-X into X. According to my auto-simplier this is the most common missed simplification in fully optimized code. It occurs sporadically in the testsuite, and many times in 403.gcc: the final bitcode has 131 fewer subtractions after this change. The reason that the multiplies are not eliminated is the same reason that instcombine did not catch this: they are used by other instructions (instcombine catches this with a more general transform which in general is only profitable if the operands have only one use). llvm-svn: 123754	2011-01-18 09:24:58 +00:00
Cameron Zwarich	6b0c4c9b6c	Move DominanceFrontier from VMCore to Analysis. llvm-svn: 123747	2011-01-18 06:06:27 +00:00
Chris Lattner	08f43456c9	fix PR8983, a broken assertion. llvm-svn: 123562	2011-01-16 03:43:53 +00:00
Nick Lewycky	367f98f000	Teach LazyValueInfo that allocas aren't NULL. Over all of llvm-test, this saves half a million non-local queries, each of which would otherwise have triggered a linear scan over a basic block. Also fix a fixme for memory intrinsics which dereference pointers. With this, we prove that a pointer is non-null because it was dereferenced by an intrinsic 112 times in llvm-test. llvm-svn: 123533	2011-01-15 09:16:12 +00:00
Duncan Sands	d6f1a9584d	Turn X-(X-Y) into Y. According to my auto-simplifier this is the most common simplification present in fully optimized code (I think instcombine fails to transform some of these when "X-Y" has more than one use). Fires here and there all over the test-suite, for example it eliminates 8 subtractions in the final IR for 445.gobmk, 2 subs in 447.dealII, 2 in paq8p etc. llvm-svn: 123442	2011-01-14 15:26:10 +00:00
Duncan Sands	571fd9a606	Factorize common code out of the InstructionSimplify shift logic. Add in threading of shifts over selects and phis while there. This fires here and there in the testsuite, to not much effect. For example when compiling spirit it fires 5 times, during early-cse, resulting in 6 more cse simplifications, and 3 more terminators being folded by jump threading, but the final bitcode doesn't change in any interesting way: other optimizations would have caught the opportunity anyway, only later. llvm-svn: 123441	2011-01-14 14:44:12 +00:00
Duncan Sands	7f60dc1eb0	Move some shift transforms out of instcombine and into InstructionSimplify. While there, I noticed that the transform "undef >>a X -> undef" was wrong. For example if X is 2 then the top two bits must be equal, so the result can not be anything. I fixed this in the constant folder as well. Also, I made the transform for "X << undef" stronger: it now folds to undef always, even though X might be zero. This is in accordance with the LangRef, but I must admit that it is fairly aggressive. Also, I added "i32 X << 32 -> undef" following the LangRef and the constant folder, likewise fairly aggressive. llvm-svn: 123417	2011-01-14 00:37:45 +00:00
Tobias Grosser	b1d11c19da	Add single entry / single exit accessors. Add methods for accessing the (single) entry / exit edge of a region. If no such edge exists, null is returned. Both accessors return the start block of the corresponding edge. The edge can finally be formed by utilizing Region::getEntry() or Region::getExit(); Contributed by: Andreas Simbuerger <simbuerg@fim.uni-passau.de> llvm-svn: 123410	2011-01-13 23:18:04 +00:00
Duncan Sands	ad000d8f16	Remove some wrong code which fortunately was never executed (as explained in the comment I added): an extern weak global may have a null address. llvm-svn: 123373	2011-01-13 10:43:08 +00:00
Duncan Sands	8d25a7c3a0	The most common simplification missed by instsimplify in unoptimized bitcode is "X != 0 -> X" when X is a boolean. This occurs a lot because of the way llvm-gcc converts gcc's conditional expressions. Add this, and a few other similar transforms for completeness. llvm-svn: 123372	2011-01-13 08:56:29 +00:00
Chris Lattner	d30de95520	some comment improvements. llvm-svn: 123243	2011-01-11 17:11:59 +00:00
Eric Christopher	23bf3bafb7	Temporarily revert 123133, it's causing some regressions and I'm trying to get a testcase. llvm-svn: 123225	2011-01-11 09:02:09 +00:00
Chris Lattner	23109cb319	the GEP faq says that only inbounds geps are guaranteed to not overflow. llvm-svn: 123218	2011-01-11 06:44:41 +00:00
Jakob Stoklund Olesen	087f207009	Revert r123207: "Turn on memdep's verifyRemoved() in an attempt to smoke out the cause of our gcc bootstrap miscompare." It didn't. llvm-svn: 123215	2011-01-11 04:05:39 +00:00
Jakob Stoklund Olesen	9b6853efd6	Turn on memdep's verifyRemoved() in an attempt to smoke out the cause of our gcc bootstrap miscompare. llvm-svn: 123207	2011-01-11 01:18:03 +00:00
Chandler Carruth	b1e7f557b7	Teach constant folding to perform conversions from constant floating point values to their integer representation through the SSE intrinsic calls. This is the last part of a README.txt entry for which I have real world examples. llvm-svn: 123206	2011-01-11 01:07:24 +00:00
Chandler Carruth	352d9b14b3	Cleanup some of the constant folding code to consistently test intrinsic IDs when available rather than using a mixture of IDs and textual name comparisons. llvm-svn: 123165	2011-01-10 09:02:58 +00:00
Chris Lattner	67f82314af	add a fixme: ir isn't expressive enough. llvm-svn: 123139	2011-01-09 23:02:10 +00:00
Chris Lattner	28f140a33e	Step #4 in improving trip count analysis: HowFarToZero can analyze NUW AddRec's much more aggressively. We now get a trip count for @test2 in nsw.ll llvm-svn: 123138	2011-01-09 22:58:47 +00:00
Chris Lattner	dff679f4b6	rearrange some code, no functionality change. llvm-svn: 123136	2011-01-09 22:39:48 +00:00
Chris Lattner	a44274cb4f	Step #3 to improving trip count analysis: If we fold a + {b,+,stride} into {a+b,+,stride} (because a is LIV), then the resultant AddRec is NUW/NSW if the client says it is. llvm-svn: 123133	2011-01-09 22:31:26 +00:00
Chris Lattner	fc87752d55	Step #2 to improve trip count analysis for loops like this: void f(int* begin, int* end) { std::fill(begin, end, 0); } which turns into a != exit expression where one pointer is strided and (thanks to step #1) known to not overflow, and the other is loop invariant. The observation here is that, though the IV is strided by 4 in this case, that the IV has to become equal to the end value. It cannot "miss" the end value by stepping over it, because if it did, the strided IV expression would eventually wrap around. Handle this by turning A != B into "A-B != 0" where the A-B part is known to be NUW. llvm-svn: 123131	2011-01-09 22:26:35 +00:00
Chris Lattner	10223a3fbf	teach SCEV analysis of PHI nodes that PHI recurences formed with GEP instructions are always NUW, because PHIs cannot wrap the end of the address space. llvm-svn: 123105	2011-01-09 02:28:48 +00:00
Chris Lattner	a337f5ec5c	reduce indentation. Print <nuw> and <nsw> when dumping SCEV AddRec's that have the bit set. llvm-svn: 123104	2011-01-09 02:16:18 +00:00
Chris Lattner	171608e738	use isNullValue() to simplify code, add an assert. llvm-svn: 122977	2011-01-06 22:24:29 +00:00
Chris Lattner	5858e091a6	implement constant folding support for an exotic constant expr: ret i64 ptrtoint (i8* getelementptr ([1000 x i8]* @X, i64 1, i64 sub (i64 0, i64 ptrtoint ([1000 x i8]* @X to i64))) to i64) to "ret i64 1000". This allows us to correctly compute the trip count on a loop in PR8883, which occurs with std::fill on a char array. This allows us to transform it into a memset with a constant size. llvm-svn: 122950	2011-01-06 06:19:46 +00:00
Owen Anderson	6f060afbbd	Reorder, rename, and document some members to make this easier to follow. llvm-svn: 122929	2011-01-05 23:26:22 +00:00
Owen Anderson	e86dacf449	When computing the value on an edge, in certain cases LVI would fail to compute the value range in the predecessor block, leading to an incorrect conclusion for the edge value. Found by inspection. llvm-svn: 122908	2011-01-05 21:37:18 +00:00
Owen Anderson	118ac80c81	Re-convert several of LazyValueInfo's internal maps to Dense{Map\|Set}, and fix the issue in hasBlockValue() that was causing iterator invalidations. Many thanks to Dimitry Andric for tracking down those invalidations! llvm-svn: 122906	2011-01-05 21:15:29 +00:00
Chris Lattner	c86e67e110	fix an off-by-one bug that caused a crash analyzing ashr's with huge shift amounts, PR8896 llvm-svn: 122814	2011-01-04 18:19:15 +00:00
Owen Anderson	d62d37225a	Use the new addEscapingValue callback to update GlobalsModRef when GVN adds PHIs of GEPs. For the moment, have GlobalsModRef handle this conservatively by simply removing the value from its maps. llvm-svn: 122787	2011-01-03 23:51:43 +00:00
Owen Anderson	b6e4ff0d85	Stub out a new updating interface to AliasAnalysis, allowing stateful analyses to be informed when a pointer value has potentially become escaping. Implementations can choose to either fall back to conservative responses for that value, or may recompute their analysis to accomodate the change. llvm-svn: 122777	2011-01-03 21:38:41 +00:00
Chris Lattner	16e42128c2	fix rdar://8813415 - a miscompilation of 164.gzip that loop-idiom exposed. It turns out to be a latent bug in basicaa, scary. llvm-svn: 122772	2011-01-03 21:03:33 +00:00
Nick Lewycky	0f87ca7733	Add spliceFunction to the CallGraph interface. This allows users to efficiently update a callGraph when performing the common operation of splicing the body to a new function and updating all callers (such as via RAUW). No users yet, though this is intended for DeadArgumentElimination as part of PR8887. llvm-svn: 122728	2011-01-03 03:19:35 +00:00
Chris Lattner	bf0aa927cc	split dom frontier handling stuff out to its own DominanceFrontier header, so that Dominators.h is just domtree. Also prune #includes a bit. llvm-svn: 122714	2011-01-02 22:09:33 +00:00
Duncan Sands	772749aea1	Revert commit 122654 at the request of Chris, who reckons that instsimplify is the wrong hammer for this nail, and is probably right. llvm-svn: 122661	2011-01-01 20:08:02 +00:00
Duncan Sands	e3c539581c	Fix a README item by having InstructionSimplify do a mild form of value numbering, in which it considers (for example) "%a = add i32 %x, %y" and "%b = add i32 %x, %y" to be equal because the operands are equal and the result of the instructions only depends on the values of the operands. This has almost no effect (it removes 4 instructions from gcc-as-one-file), and perhaps slows down compilation: I measured a 0.4% slowdown on the large gcc-as-one-file testcase, but it wasn't statistically significant. llvm-svn: 122654	2011-01-01 16:12:09 +00:00
Benjamin Kramer	b6d52b8b64	Cast away "comparison between signed and unsigned integer" warnings. llvm-svn: 122598	2010-12-28 13:52:52 +00:00
Chris Lattner	9cb1035f94	move isBytewiseValue out to ValueTracking.h/cpp llvm-svn: 122565	2010-12-26 20:15:01 +00:00
Jeffrey Yasskin	9b43f33620	Change all self assignments X=X to (void)X, so that we can turn on a new gcc warning that complains on self-assignments and self-initializations. llvm-svn: 122458	2010-12-23 00:58:24 +00:00
Duncan Sands	a45cfbd405	When determining whether the new instruction was already present in the original instruction, half the cases were missed (making it not wrong but suboptimal). Also correct a typo (A <-> B) in the second chunk. llvm-svn: 122414	2010-12-22 17:15:25 +00:00
Duncan Sands	3547d2ebd8	Add some statistics, good for understanding how much more powerful instcombine is compared to instsimplify. llvm-svn: 122397	2010-12-22 09:40:51 +00:00
Duncan Sands	fecc642224	While I don't think any later transforms can fire, it seems cleaner to not assume this (for example in case more transforms get added below it). Suggested by Frits van Bommel. llvm-svn: 122332	2010-12-21 15:03:43 +00:00
Duncan Sands	5def0d6791	Fix inverted condition noticed by Frits van Bommel. llvm-svn: 122331	2010-12-21 14:48:48 +00:00
Duncan Sands	d0eb6d39f8	Pull a few more simplifications out of instcombine (there are still plenty left though!), in particular for multiplication. llvm-svn: 122330	2010-12-21 14:00:22 +00:00
Duncan Sands	ee3ec6eb94	Teach InstructionSimplify about distributive laws. These transforms fire quite often, but don't make much difference in practice presumably because instcombine also knows them and more. llvm-svn: 122328	2010-12-21 13:32:22 +00:00
Duncan Sands	f64e690c4f	Move checking of the recursion limit into the various Thread methods. No functionality change. llvm-svn: 122327	2010-12-21 09:09:15 +00:00
Duncan Sands	6c7a52cf80	Add generic simplification of associative operations, generalizing a couple of existing transforms. This fires surprisingly often, for example when compiling gcc "(X+(-1))+1->X" fires quite a lot as well as various "and" simplifications (usually with a phi node operand). Most of the time this doesn't make a real difference since the same thing would have been done elsewhere anyway, eg: by instcombine, but there are a few places where this results in simplifications that we were not doing before. llvm-svn: 122326	2010-12-21 08:49:00 +00:00
Owen Anderson	c6beda80ff	Speculatively revert the use of DenseMap in LazyValueInfo, which may be causing Linux self-host failures. llvm-svn: 122291	2010-12-20 23:53:19 +00:00
Owen Anderson	9be3ec6264	Attempt to appease the DragonEgg buildbots. llvm-svn: 122288	2010-12-20 23:23:18 +00:00
Owen Anderson	813a2c45a8	Convert one of LVI's primary maps to a DenseMap, now that we know are more assured of iterator stability. llvm-svn: 122273	2010-12-20 21:30:54 +00:00
Owen Anderson	d83f98a51e	More LVI cleanups, including trying to simplify the process of maintaining the OverDefinedCache. llvm-svn: 122256	2010-12-20 19:33:41 +00:00
Owen Anderson	64c2c5798a	Reuse the reference into the LVI cache throughout the solver subsystem. This is much easier to verify as being safe thanks its recent de-recursivization. llvm-svn: 122254	2010-12-20 18:18:16 +00:00
Duncan Sands	ed6d6c33dd	Have SimplifyBinOp dispatch Xor, Add and Sub to the corresponding methods (they had just been forgotten before). Adding Xor causes "main" in the existing testcase 2010-11-01-lshr-mask.ll to be hugely more simplified. llvm-svn: 122245	2010-12-20 14:47:04 +00:00
Nick Lewycky	55a700b0cf	Make LazyValueInfo non-recursive. llvm-svn: 122120	2010-12-18 01:00:40 +00:00
Nate Begeman	7aa18bf46a	Add vector versions of some existing scalar transforms to aid codegen in matching psign & pblend operations to the IR produced by clang/gcc for their C idioms. llvm-svn: 122105	2010-12-17 23:12:19 +00:00
Dan Gohman	91ab4ffd96	Update a comment. llvm-svn: 121946	2010-12-16 02:55:10 +00:00
Dan Gohman	e1a17a3473	Make memcpyopt TBAA-aware. llvm-svn: 121944	2010-12-16 02:51:19 +00:00
Dan Gohman	2c9d342f04	Enable TBAA by default. llvm-svn: 121923	2010-12-15 23:58:44 +00:00
Dan Gohman	05b18f143f	Reapply r121886, and also update DecomposeGEPExpression to keep it in sync. llvm-svn: 121895	2010-12-15 20:49:55 +00:00
Dan Gohman	d02b65982e	Revert r121886. DecomposeGEPExpression needs to be kept in sync. llvm-svn: 121892	2010-12-15 20:39:25 +00:00
Dan Gohman	949ab7889c	Strengthen GetUnderlyingObject using InstructionSimplify. While LLVM's main design is that analysis code shouldn't go out of its way to understand code which hasn't been InstCombined, analysis utility routines like this can find themselves being called in the middle of transform passes when instcombine hasn't had a chance to run. llvm-svn: 121886	2010-12-15 20:10:26 +00:00
Dan Gohman	a4fcd2418d	Move Value::getUnderlyingObject to be a standalone function so that it can live in Analysis instead of VMCore. llvm-svn: 121885	2010-12-15 20:02:24 +00:00
Nick Lewycky	11678bd299	Clean up some of LVI: * mergeIn now uses constant folding for constants that are provably not-equal. * sink some sanity checks from the get() methods into the mark() methods, to ensure that we never have a constant/notconstant ConstantInt * some textual cleanups, whitespace changes, removing "else" after return, that sort of thing. llvm-svn: 121877	2010-12-15 18:57:18 +00:00
Duncan Sands	0a2c416894	Move Sub simplifications and additional Add simplifications out of instcombine and into InstructionSimplify. llvm-svn: 121861	2010-12-15 14:07:39 +00:00
Duncan Sands	019a418808	If we detect that the instruction we are simplifying is unreachable, arrange for it to be replaced by undef rather than not replaced at all, the idea being that this may reduce the amount of work done by whoever called InstructionSimplify. llvm-svn: 121860	2010-12-15 11:02:22 +00:00
Dan Gohman	3cb55a1d23	Update a comment. llvm-svn: 121727	2010-12-13 22:53:18 +00:00
Dan Gohman	c4bf5cac9f	Reapply r121520, PartialAlias implementation for BasicAA, now that memdep is updated to handle it. llvm-svn: 121725	2010-12-13 22:50:24 +00:00
Dan Gohman	ba5d0abe39	Update memdep to handle PartialAlias as MayAlias. llvm-svn: 121723	2010-12-13 22:47:57 +00:00
Tobias Grosser	f3e1ada522	Remove useless dynamic_cast<>(). Thanks Peter for pointing me to something that should have never been committed to the llvm code base. llvm-svn: 121648	2010-12-12 21:58:28 +00:00
Dan Gohman	39de62348f	Revert r121520, which may have introduced miscompilations. llvm-svn: 121573	2010-12-10 21:48:28 +00:00
Dan Gohman	041f74e762	Implement PartialAlias checking in BasicAA. llvm-svn: 121520	2010-12-10 20:47:03 +00:00
Dan Gohman	704e7c2332	Minimally update this code to handle PartialAlias. llvm-svn: 121518	2010-12-10 20:14:49 +00:00
Dan Gohman	201acdb6db	Use PartialAlias to do better noalias lint checking. llvm-svn: 121514	2010-12-10 20:04:06 +00:00
Dan Gohman	4431e31df0	Teach AliasAnalysisCounter about PartialAlias. llvm-svn: 121513	2010-12-10 19:53:05 +00:00
Dan Gohman	105d60a5ef	Teach AliasAnalysisEvaluator about PartialAlias. llvm-svn: 121512	2010-12-10 19:52:40 +00:00
Dan Gohman	fb0a3754f5	Update this code to handle PartialAlias as MayAlias. llvm-svn: 121508	2010-12-10 19:40:47 +00:00
Owen Anderson	c7ed4dc932	Take the first step towards making LVI non-recursive: get rid of the LVIQuery abstraction. llvm-svn: 121357	2010-12-09 06:14:58 +00:00
Devang Patel	8817135cb9	Use type's file info while describing inheritance relationship. llvm-svn: 121289	2010-12-08 21:46:37 +00:00
Devang Patel	b68c6231e9	Add support to create debug info for functions and methods. llvm-svn: 121281	2010-12-08 20:42:44 +00:00
Devang Patel	81c3c87717	Add support to create class type. llvm-svn: 121279	2010-12-08 20:18:20 +00:00
Devang Patel	89ea4f27a8	Add support to create vector, array, enums etc... llvm-svn: 121224	2010-12-08 01:50:15 +00:00
Devang Patel	dd261afdd9	Global variable does not need linkage name. llvm-svn: 121212	2010-12-08 00:06:22 +00:00
Devang Patel	63f83cd861	Add support to create local variable's debug info. llvm-svn: 121211	2010-12-07 23:58:00 +00:00
Devang Patel	746660fc7b	Add support to create variables, structs etc.. using DIBuilder. This is still work in progress. llvm-svn: 121205	2010-12-07 23:25:47 +00:00
Jay Foad	583abbc4df	PR5207: Change APInt methods trunc(), sext(), zext(), sextOrTrunc() and zextOrTrunc(), and APSInt methods extend(), extOrTrunc() and new method trunc(), to be const and to return a new value instead of modifying the object in place. llvm-svn: 121120	2010-12-07 08:25:19 +00:00
Jakob Stoklund Olesen	8bdfb0c166	Also inore '()' while creating mdnode name from ObjC symbol name. llvm-svn: 120856	2010-12-03 23:40:45 +00:00
Devang Patel	f0227ccf3f	Ignore '+' while creating mdnode name from ObjC symbol name. llvm-svn: 120853	2010-12-03 23:29:30 +00:00
Jay Foad	25a5e4ca1f	PR5207: Rename overloaded APInt methods set(), clear(), flip() to setAllBits(), setBit(unsigned), etc. llvm-svn: 120564	2010-12-01 08:53:58 +00:00
Chris Lattner	e28618de59	move GetPointerBaseWithConstantOffset out of GVN into ValueTracking.h llvm-svn: 120476	2010-11-30 22:25:26 +00:00
Jay Foad	15084f085d	PR5207: Make APInt::set(), APInt::clear() and APInt::flip() return void. llvm-svn: 120413	2010-11-30 09:02:01 +00:00
Chris Lattner	d540a5d842	strength reduce this. llvm-svn: 120381	2010-11-30 01:56:13 +00:00
Chris Lattner	afbc0c2b8c	getLocationForDest should work for memset as well. llvm-svn: 120380	2010-11-30 01:48:20 +00:00
Chris Lattner	90c4947df7	enhance basicaa to return "Mod" for a memcpy call when the queried location doesn't overlap the source, and add a testcase. llvm-svn: 120370	2010-11-30 00:43:16 +00:00
Chris Lattner	9a146372b5	Teach basicaa that memset's modref set is at worst "mod" and never contains "ref". Enhance DSE to use a modref query instead of a store-specific hack to generalize the "ignore may-alias stores" optimization to handle memset and memcpy. llvm-svn: 120368	2010-11-30 00:28:45 +00:00
Frits van Bommel	a98214de10	Teach ConstantFoldInstruction() how to fold insertvalue and extractvalue. llvm-svn: 120316	2010-11-29 20:36:52 +00:00
Michael J. Spencer	447762da85	Merge System into Support. llvm-svn: 120298	2010-11-29 18:16:10 +00:00
Chandler Carruth	abcab28f9b	Add some dead stores to pacify my least favorite GCC warning: may be uninitialized. The warning is terrible, has incorrect source locations, and has a huge false positive rate such as all of these. If anyone has a better solution, please let me know. Alternatively, I'll happily add -Wno-uninitialized to the -Werror build mode. Maybe I can even do it only when building with GCC instead of Clang. llvm-svn: 120281	2010-11-29 01:41:13 +00:00
Duncan Sands	a021988d64	Expand a little on the description of what InstructionSimplify does. llvm-svn: 120016	2010-11-23 10:50:08 +00:00
Duncan Sands	763dec0ab8	Clarify that constant folding of instructions applies when all operands are constant. There was in fact one exception to this (phi nodes) - so remove that exception (InstructionSimplify handles this so there should be no loss). llvm-svn: 120015	2010-11-23 10:16:18 +00:00
Duncan Sands	c133c54426	If a GEP index simply advances by multiples of a type of zero size, then replace the index with zero. llvm-svn: 119974	2010-11-22 16:32:50 +00:00
Duncan Sands	8a0f486e36	Move the "gep undef" -> "undef" transform from instcombine to InstructionSimplify. llvm-svn: 119970	2010-11-22 13:42:49 +00:00
Benjamin Kramer	585dfa2b3d	Initialize MemDep's TD member so buildbots don't trip over an uninitialized pointer (TD is passed to PHITransAddr). I wonder why this didn't explode earlier. llvm-svn: 119944	2010-11-21 15:21:46 +00:00
Duncan Sands	cf4bceba49	Add a rather pointless InstructionSimplify transform, inspired by recent constant folding improvements: if P points to a type of size zero, turn "gep P, N" into "P". More generally, if a gep index type has size zero, instcombine could replace the index with zero, but that is not done here. llvm-svn: 119942	2010-11-21 13:53:09 +00:00
Duncan Sands	1f86be9164	Fix spelling. llvm-svn: 119941	2010-11-21 12:43:13 +00:00
Chris Lattner	6ce038082b	apply Dan's fix for PR8268 which allows constant folding to handle indexes over zero sized elements. This allows us to compile: #include <string> void foo() { std::string s; } into an empty function. llvm-svn: 119933	2010-11-21 08:39:01 +00:00
Chris Lattner	663ba91cc6	add "getLocation" method to AliasAnalysis for getting the source and destination location of a memcpy/memmove. I'm not clear about whether TBAA works on these, so I'm leaving it out for now. Dan, please revisit this when convenient. llvm-svn: 119928	2010-11-21 07:51:27 +00:00
Chris Lattner	e48c31ce33	implement PR8576, deleting dead stores with intervening may-alias stores. llvm-svn: 119927	2010-11-21 07:34:32 +00:00
Benjamin Kramer	ddd1b7b801	Simplify code. No change in functionality. llvm-svn: 119908	2010-11-20 18:43:35 +00:00
Benjamin Kramer	c77ebcc9a5	Silence warning about an uninitialized variable. llvm-svn: 119800	2010-11-19 11:37:26 +00:00
Duncan Sands	b238de0415	Remove threading of Xor over selects and phis, with an explanation of why such threading is pointless. llvm-svn: 119798	2010-11-19 09:20:39 +00:00
Duncan Sands	aef146b890	Factor code for testing whether replacing one value with another preserves LCSSA form out of ScalarEvolution and into the LoopInfo class. Use it to check that SimplifyInstruction simplifications are not breaking LCSSA form. Fixes PR8622. llvm-svn: 119727	2010-11-18 19:59:41 +00:00
Dan Gohman	f1ebfc1544	Strip trailing whitespace. llvm-svn: 119706	2010-11-18 17:06:31 +00:00
Dan Gohman	0ab28b62b1	Use llvm_unreachable for "impossible" situations. llvm-svn: 119705	2010-11-18 17:05:57 +00:00
Dan Gohman	2e1fc849b2	Add support for PHI-translating sext, zext, and trunc instructions, enabling more PRE. PR8586. llvm-svn: 119704	2010-11-18 17:05:13 +00:00
Dan Gohman	8ea83d81e0	Introduce memoization for ScalarEvolution dominates and properlyDominates queries, and SCEVExpander getRelevantLoop queries. llvm-svn: 119595	2010-11-18 00:34:22 +00:00
Dan Gohman	7e6b393e66	Factor out the code for purging a SCEV from all the various memoization maps. Some of these maps may merge in the future, but for now it's convenient to have a utility function for them. llvm-svn: 119587	2010-11-17 23:28:48 +00:00
Dan Gohman	7ee1bbb76c	Merge the implementations of isLoopInvariant and hasComputableLoopEvolution, and memoize the results. This improves compile time in code which highly complex expressions which get queried many times. llvm-svn: 119584	2010-11-17 23:21:44 +00:00
Dan Gohman	534749bf70	Make SCEV::getType() and SCEV::print non-virtual. Move SCEV::hasOperand to ScalarEvolution. Delete SCEV::~SCEV. SCEV is no longer virtual. llvm-svn: 119578	2010-11-17 22:27:42 +00:00
Dan Gohman	20d9ce21ef	Move SCEV::dominates and properlyDominates to ScalarEvolution. llvm-svn: 119570	2010-11-17 21:41:58 +00:00
Dan Gohman	afd6db9932	Move SCEV::isLoopInvariant and hasComputableLoopEvolution to be member functions of ScalarEvolution, in preparation for memoization and other optimizations. llvm-svn: 119562	2010-11-17 21:23:15 +00:00
Duncan Sands	39d77131a1	Before replacing a phi node with a different value, it needs to be checked that this won't break LCSSA form. Change the existing checking method to a more direct one: rather than seeing if all predecessors belong to the loop, check that the replacing value is either not in any loop or is in a loop that contains the phi node. llvm-svn: 119556	2010-11-17 20:49:12 +00:00
Dan Gohman	d3a32ae4c8	Verify SCEVAddRecExpr's invariant in ScalarEvolution::getAddRecExpr instead of in SCEVAddRecExpr's constructor, in preparation for an upcoming change. llvm-svn: 119554	2010-11-17 20:48:38 +00:00
Dan Gohman	ed75631743	Fix ScalarEvolution's range memoization to avoid using a default ctor with ConstantRange. llvm-svn: 119550	2010-11-17 20:23:08 +00:00
Duncan Sands	c89ac07e7a	Move some those Xor simplifications which don't require creating new instructions out of InstCombine and into InstructionSimplify. While there, introduce an m_AllOnes pattern to simplify matching with integers and vectors with all bits equal to one. llvm-svn: 119536	2010-11-17 18:52:15 +00:00
Duncan Sands	ec7a6ecb92	Now that hasConstantValue has been made simpler, it may return the phi node itself if it occurs in an unreachable basic block. Protect against this. Hopefully this will fix some more buildbots. llvm-svn: 119493	2010-11-17 10:23:23 +00:00
Duncan Sands	64e41cf865	Previously SimplifyInstruction could report that an instruction simplified to itself (this can only happen in unreachable blocks). Change it to return null instead. Hopefully this will fix some buildbot failures. llvm-svn: 119490	2010-11-17 08:35:29 +00:00
Duncan Sands	7412f6e53d	Fix a layering violation: hasConstantValue, which is part of the PHINode class, uses DominatorTree which is an analysis. This change moves all of the tricky hasConstantValue logic to SimplifyInstruction, and replaces it with a very simple literal implementation. I already taught users of hasConstantValue that need tricky stuff to use SimplifyInstruction instead. I didn't update InlineFunction because the IR looks like it might be in a funky state at the point it calls hasConstantValue, which makes calling SimplifyInstruction dangerous since it can in theory do a lot of tricky reasoning. This may be a pessimization, for example in the case where all phi node operands are either undef or a fixed constant. llvm-svn: 119459	2010-11-17 04:30:22 +00:00
Duncan Sands	d06f50e2db	Have ScalarEvolution use SimplifyInstruction rather than hasConstantValue. While there, add a note about an inefficiency I noticed. llvm-svn: 119458	2010-11-17 04:18:45 +00:00
Dan Gohman	761065e3b7	Memoize results from ScalarEvolution's getUnsignedRange and getSignedRange. This fixes some extreme compile times on unrolled sha512 code. llvm-svn: 119455	2010-11-17 02:44:44 +00:00
Duncan Sands	5ffc298bc7	In which I discover the existence of loops. Threading an operation over a phi node by applying it to each operand may be wrong if the operation and the phi node are mutually interdependent (the testcase has a simple example of this). So only do this transform if it would be correct to perform the operation in each predecessor of the block containing the phi, i.e. if the other operands all dominate the phi. This should fix the FFMPEG snow.c regression reported by İsmail Dönmez. llvm-svn: 119347	2010-11-16 12:16:38 +00:00
Duncan Sands	f12ba1dfe1	Teach InstructionSimplify the trick of skipping incoming phi values that are equal to the phi itself. llvm-svn: 119161	2010-11-15 17:52:45 +00:00
Duncan Sands	b99f39b9f6	If dom tree information is available, make it possible to pass it to get better phi node simplification. llvm-svn: 119055	2010-11-14 18:36:10 +00:00
Duncan Sands	4581ddc123	Teach InstructionSimplify about phi nodes. I chose to have it simply offload the work to hasConstantValue rather than do something more complicated (such handling mutually recursive phis) because (1) it is not clear it is worth it; and (2) if it is worth it, maybe such logic would be better placed in hasConstantValue. Adjust some GVN tests which are now cleaned up much further (eg: all phi nodes are removed). llvm-svn: 119043	2010-11-14 13:30:18 +00:00
Duncan Sands	1d27f01210	Boost the power of phi node constant folding slightly: if all operands are the phi node itself or undef, then return undef. This logic already existed at a higher level so in practice it shouldn't make the slightest difference. Note that this code could be replaced by a call to PN->hasConstantValue(). However since we bail out the moment we see a non-constant operand, it is more efficient to have a specialized version of that logic. llvm-svn: 119041	2010-11-14 12:53:18 +00:00
Duncan Sands	7e800d6f9c	Strip trailing whitespace. llvm-svn: 119038	2010-11-14 11:23:23 +00:00
Duncan Sands	e5ac78e16e	Fix typo pointed out by Trevor Harmon. llvm-svn: 119001	2010-11-13 12:16:27 +00:00
Dan Gohman	970afd926f	Re-disable TBAA for now; it broke MultiSource/Applications/JM/lencod, at least. llvm-svn: 118890	2010-11-12 11:21:08 +00:00
Dan Gohman	ea18d8ec2d	Enable TBAA. llvm-svn: 118884	2010-11-12 06:20:01 +00:00
Dan Gohman	65316d6749	Add helper functions for computing the Location of load, store, and vaarg instructions. llvm-svn: 118845	2010-11-11 21:50:19 +00:00
Dan Gohman	468638826e	Don't forget the TBAA info, if available. llvm-svn: 118842	2010-11-11 21:27:26 +00:00
Dan Gohman	7dacf8f3f3	Avoid calling alias on non-pointer values. llvm-svn: 118822	2010-11-11 19:23:51 +00:00
Dan Gohman	c87c843db7	It's not necessary to clear out the Size and TBAATag at each of these points. llvm-svn: 118752	2010-11-11 00:42:22 +00:00
Dan Gohman	8bf3d832e5	Set NonLocalDepInfo's Size field to UnknownSize when invalidating it, so that it doesn't appear to be a known size. llvm-svn: 118748	2010-11-11 00:20:27 +00:00
Dan Gohman	6791936848	When clearing a non-local pointer dependency cache entry, clear the reverse map too. This fixes seflhost build errors. llvm-svn: 118729	2010-11-10 22:35:02 +00:00
Devang Patel	364bf04267	Take care of special characters while creating named MDNode name to hold function specific local variable's info. This fixes radar 8653152. I am checking in testcase as a separate check-in. llvm-svn: 118726	2010-11-10 22:19:21 +00:00
Dan Gohman	1d760ce8b3	Factor out the code for computing an AliasAnalysis::Location for a given instruction into a helper function. llvm-svn: 118723	2010-11-10 21:51:35 +00:00
Dan Gohman	2e8ca44b81	Fully invalidate cached results when a prior query's size or type is insufficient for, or incompatible with, the current query. llvm-svn: 118721	2010-11-10 21:45:11 +00:00
Duncan Sands	8f7220e9fd	Reduce the maximum recursion depth, 5 seems pointlessly too much. Probably it should just be 1, but compromise with 3. llvm-svn: 118718	2010-11-10 20:53:24 +00:00
Dan Gohman	0a6021a54d	Enhance GVN to do more precise alias queries for non-local memory references. For example, this allows gvn to eliminate the load in this example: void foo(int n, int* p, int q) { p[0] = 0; p[1] = 1; if (n) { q = p[0]; } } llvm-svn: 118714	2010-11-10 20:37:15 +00:00
Duncan Sands	f3b1bf1606	Teach InstructionSimplify how to look through PHI nodes. Since PHI nodes can be used in loops, this could result in infinite looping if there is no recursion limit, so add such a limit. It is also used for the SelectInst case because in theory there could be an infinite loop there too if the basic block is unreachable. llvm-svn: 118694	2010-11-10 18:23:01 +00:00
Dan Gohman	066c1bb1e9	Add a doesAccessArgPointees helper function, and update code to use it, and to be consistent. llvm-svn: 118692	2010-11-10 18:17:28 +00:00
Duncan Sands	b0579e9d3f	Simplify binary operations where one operand is a select instruction. The simplifications performed here never create new instructions, they only return existing instructions (or a constant), and so are always a win. In theory they should transform (for example) %z = and i32 %x, %y %s = select i1 %cond, i32 %y, i32 %z %r = and i32 %x, %s into %r = and i32 %x, y but in practice they get into a fight with instcombine, and lose. Unfortunately instcombine does a poor job in this case. Nonetheless I'm committing this transform to make it easier to discuss what to do to make peace with instcombine. llvm-svn: 118679	2010-11-10 13:00:08 +00:00
Dan Gohman	2694e14087	Make ModRefBehavior a lattice. Use this to clean up AliasAnalysis chaining and simplify FunctionAttrs' GetModRefBehavior logic. llvm-svn: 118660	2010-11-10 01:02:18 +00:00
Dan Gohman	88ff1ece63	VAArg doesn't capture its operand. llvm-svn: 118623	2010-11-09 20:09:35 +00:00
Dan Gohman	5d06f892ef	Teach AliasAnalysis about AccessesArgumentsReadonly. llvm-svn: 118621	2010-11-09 20:06:55 +00:00
Dan Gohman	0f17507478	Teach LICM and AliasSetTracker about AccessesArgumentsReadonly. llvm-svn: 118618	2010-11-09 19:58:21 +00:00
Duncan Sands	fc5ad3f0f9	Factorize code, no functionality change. llvm-svn: 118516	2010-11-09 17:25:51 +00:00
Dan Gohman	142ff82a18	Re-introduce the MaxLookup limit to BasicAliasAnalysis' pointsToConstantMemory code to guard against possible compile time slowdowns. llvm-svn: 118440	2010-11-08 20:26:19 +00:00
Dan Gohman	601c94b309	Implement getModRefBehavior for TypeBasedAliasAnalysis. llvm-svn: 118416	2010-11-08 17:10:22 +00:00
Dan Gohman	9130bad71f	Extend the AliasAnalysis::pointsToConstantMemory interface to allow it to optionally look for constant or local (alloca) memory. Teach BasicAliasAnalysis::pointsToConstantMemory to look through Select and Phi nodes, and to support looking for local memory. Remove FunctionAttrs' PointsToLocalOrConstantMemory function, now that AliasAnalysis knows all the tricks that it knew. llvm-svn: 118412	2010-11-08 16:45:26 +00:00
Dan Gohman	0b56778d65	Delete getIntrinsicModRefBehavior. Clients can just use the normal getModRefBehavior now, since it now understands intrinsics as well as normal functions. llvm-svn: 118411	2010-11-08 16:11:19 +00:00
Dan Gohman	e461d7d135	Teach BasicAliasAnalysis::getModRefBehavior(const Function *F) to analyze intrinsic functions. llvm-svn: 118409	2010-11-08 16:08:43 +00:00
Duncan Sands	a620bd1fa3	Add simplification of floating point comparisons with the result of a select instruction, the same as already exists for integer comparisons. llvm-svn: 118379	2010-11-07 16:46:25 +00:00
Duncan Sands	f532d31198	Fix a README item: when doing a comparison with the result of a select instruction, see if doing the compare with the true and false values of the select gives the same result. If so, that can be used as the value of the comparison. llvm-svn: 118378	2010-11-07 16:12:23 +00:00
Benjamin Kramer	ed8b7bf9ed	Use arrays instead of constant-sized SmallVectors. llvm-svn: 118257	2010-11-04 18:45:27 +00:00
Devang Patel	57c5a20364	Introduce DIBuilder. It is intended to be a front-end friendly interface to emit debuggging information entries in LLVM IR. To create debugging information for a pointer, using DIBUilder front-end just needs DBuilder.CreatePointerType(Ty, Size); instead of DebugFactory.CreateDerivedType(llvm::dwarf::DW_TAG_pointer_type, TheCU, "", getOrCreateMainFile(), 0, Size, 0, 0, 0, OCTy); llvm-svn: 118248	2010-11-04 15:01:38 +00:00
Devang Patel	415c551459	Fix DIType verifier. The element 3 is DIFile now. llvm-svn: 118054	2010-11-02 20:41:13 +00:00
Dan Gohman	dcb354b234	Make ScalarEvolution::forgetLoop forget all contained loops too, because they may have ValuesAtScopes map entries referencing their outer loops. This fixes a user-after-free reported in PR8471. llvm-svn: 117698	2010-10-29 20:16:10 +00:00
Dan Gohman	15a43965ac	Teach memdep to use pointsToConstantMemory to determine that loads from constant memory don't alias any stores. llvm-svn: 117636	2010-10-29 01:14:04 +00:00
Dan Gohman	c6096263e2	Support TBAA attachments on calls. This is somewhat experimental. llvm-svn: 117317	2010-10-25 21:38:20 +00:00
Dan Gohman	82b2e0da9c	Fix chaining in TBAA's pointsToConstantMemory. llvm-svn: 117314	2010-10-25 21:24:55 +00:00
Dan Gohman	e6715d0755	Only read one bit for testing for a readonly type, leaving the other bits open for future uses. llvm-svn: 117301	2010-10-25 20:22:29 +00:00
Dan Gohman	fd864a1d31	Add a comment. llvm-svn: 117288	2010-10-25 19:47:25 +00:00
Dan Gohman	abaf2d8d3b	Update comments; BasicAA is no longer necessarily the end of the chain. llvm-svn: 117268	2010-10-25 16:29:52 +00:00
Dan Gohman	1033ce669b	Reintroduce these asserts, now that BasicAA is a normal AliasAnalysis pass. llvm-svn: 117266	2010-10-25 16:28:57 +00:00
Benjamin Kramer	9192e7ab12	Make some symbols static, move classes into anonymous namespaces. llvm-svn: 117111	2010-10-22 17:35:07 +00:00
Dan Gohman	8512270dbc	Add some more documentation. llvm-svn: 117070	2010-10-21 21:55:35 +00:00
Dan Gohman	12c9e0cf1c	Explain what "constant" means here. llvm-svn: 117053	2010-10-21 19:45:09 +00:00
Dan Gohman	104f1812ce	Update comments. llvm-svn: 117048	2010-10-21 19:01:22 +00:00
Dan Gohman	1b85604130	Memdep says that an instruction clobbers itself when it means there is no specific clobber instruction. llvm-svn: 116960	2010-10-20 22:37:41 +00:00
Dan Gohman	a2ab75bc8d	Factor out the main aliasing check into a separate function. llvm-svn: 116958	2010-10-20 22:11:14 +00:00
Dan Gohman	2549d0cf64	Fix comments; the type graph is currently a tree, not a DAG. llvm-svn: 116954	2010-10-20 22:02:58 +00:00
Tobias Grosser	23c8341c3d	Add RegionPass support. A RegionPass is executed like a LoopPass but on the regions detected by the RegionInfo pass instead of the loops detected by the LoopInfo pass. llvm-svn: 116905	2010-10-20 01:54:44 +00:00
Douglas Gregor	48b4568718	Fix CMake build llvm-svn: 116903	2010-10-20 01:36:56 +00:00
Dan Gohman	da85ed8541	Move NoAA out of BasicAliasAnalysis.cpp into its own file, now that it doesn't have a special relationship with BasicAliasAnalysis anymore. llvm-svn: 116876	2010-10-19 23:09:08 +00:00
Dan Gohman	f372cf869b	Reapply r116831 and r116839, converting AliasAnalysis to use uint64_t, plus fixes for places I missed before. llvm-svn: 116875	2010-10-19 22:54:46 +00:00
Dan Gohman	b4aa503501	Revert r116831 and r116839, which are breaking selfhost builds. llvm-svn: 116858	2010-10-19 21:06:16 +00:00
Dan Gohman	f4c5fe73be	Change AliasAnalysis and its clients to use uint64_t instead of unsigned for representing object sizes, for consistency with other parts of LLVM. llvm-svn: 116831	2010-10-19 18:00:02 +00:00
Owen Anderson	6c18d1aac0	Get rid of static constructors for pass registration. Instead, every pass exposes an initializeMyPassFunction(), which must be called in the pass's constructor. This function uses static dependency declarations to recursively initialize the pass's dependencies. Clients that only create passes through the createFooPass() APIs will require no changes. Clients that want to use the CommandLine options for passes will need to manually call the appropriate initialization functions in PassInitialization.h before parsing commandline arguments. I have tested this with all standard configurations of clang and llvm-gcc on Darwin. It is possible that there are problems with the static dependencies that will only be visible with non-standard options. If you encounter any crash in pass registration/creation, please send the testcase to me directly. llvm-svn: 116820	2010-10-19 17:21:58 +00:00
Dan Gohman	14fe8cf238	Consistently use AliasAnalysis::UnknownSize instead of hardcoding ~0u. llvm-svn: 116815	2010-10-19 17:06:23 +00:00
Dan Gohman	e4a82e2f21	Make the representation of AliasSets explicitly differentiate between "not known yet" and "known no tbaa info" so that it can merge them properly. llvm-svn: 116767	2010-10-18 23:31:47 +00:00
Dan Gohman	408beac597	Don't pass the raw invalid pointer used to represent conflicting TBAA information to AliasAnalysis. llvm-svn: 116751	2010-10-18 21:28:00 +00:00
Dan Gohman	71af9db0e8	Make AliasSetTracker TBAA-aware, enabling TBAA-enabled LICM. llvm-svn: 116743	2010-10-18 20:44:50 +00:00
Dan Gohman	f3702452c8	Fix BasicAA to pass TBAAInfo through to the chained analysis. llvm-svn: 116730	2010-10-18 18:45:11 +00:00
Dan Gohman	33fcde9b9c	Make TypeBasedAliasAnalysis default to doing nothing, with a command-line option to enable it. llvm-svn: 116722	2010-10-18 18:17:47 +00:00
Dan Gohman	f0a3bed6d6	Use chaining in TypeBasedAliasAnalysis::pointsToConstantMemory. llvm-svn: 116721	2010-10-18 18:10:31 +00:00
Dan Gohman	02538ac4d3	Make BasicAliasAnalysis a normal AliasAnalysis implementation which does normal initialization and normal chaining. Change the default AliasAnalysis implementation to NoAlias. Update StandardCompileOpts.h and friends to explicitly request BasicAliasAnalysis. Update tests to explicitly request -basicaa. llvm-svn: 116720	2010-10-18 18:04:47 +00:00
Benjamin Kramer	1dc34b48dd	Eliminate some calls to Value::getNameStr. llvm-svn: 116670	2010-10-16 11:28:23 +00:00
Dan Gohman	31a01ee3cb	Tolerate a null parent pointer. llvm-svn: 116533	2010-10-14 22:55:57 +00:00
Chris Lattner	698661c741	add uadd_ov/usub_ov to apint, consolidate constant folding logic to use the new APInt methods. Among other things this implements rdar://8501501 - llvm.smul.with.overflow.i32 should constant fold which comes from "clang -ftrapv", originally brought to my attention from PR8221. llvm-svn: 116457	2010-10-14 00:05:07 +00:00
Owen Anderson	c266a36625	Analysis groups need to initialize their default implementations. llvm-svn: 116441	2010-10-13 21:49:58 +00:00
Tobias Grosser	4b0986b6c1	Add Region::isTopLevelRegion(). llvm-svn: 116402	2010-10-13 11:02:44 +00:00
Tobias Grosser	4c71c117d1	RegionInfo: Fix trivial error that slipped in last minute. llvm-svn: 116400	2010-10-13 08:00:53 +00:00
Tobias Grosser	fe92a9384e	RegionInfo: Update RegionInfo after a BB was split. llvm-svn: 116398	2010-10-13 05:54:13 +00:00
Tobias Grosser	a8677226ab	RegioInfo: Add getExpandedRegion(). getExpandedRegion() enables us to create non canonical regions. Those regions can be used to define the largerst region, that fullfills a certain property. llvm-svn: 116397	2010-10-13 05:54:11 +00:00
Tobias Grosser	648594c920	RegionInfo: Allow to update exit and entry of a region. llvm-svn: 116396	2010-10-13 05:54:10 +00:00
Tobias Grosser	bf984fd78e	RegionInfo: Enhance addSubregion. llvm-svn: 116395	2010-10-13 05:54:09 +00:00
Tobias Grosser	8352ce5f8d	RegionInfo: Allow to set the parent region of a basic block. llvm-svn: 116394	2010-10-13 05:54:07 +00:00
Tobias Grosser	e910b9d9cd	RegionInfo: Free the RegionNodes in cache. Contributed by: ether llvm-svn: 116380	2010-10-13 00:07:59 +00:00
Owen Anderson	8ac477ffb5	Begin adding static dependence information to passes, which will allow us to perform initialization without static constructors AND without explicit initialization by the client. For the moment, passes are required to initialize both their (potential) dependencies and any passes they preserve. I hope to be able to relax the latter requirement in the future. llvm-svn: 116334	2010-10-12 19:48:12 +00:00
Dan Gohman	a8d3a7f93d	Support AA chaining. llvm-svn: 116264	2010-10-11 23:39:34 +00:00
Kenneth Uildriks	b8d7efe785	Now using a variant of the existing inlining heuristics to decide whether to create a given specialization of a function in PartialSpecialization. If the total performance bonus across all callsites passing the same constant exceeds the specialization cost, we create the specialization. llvm-svn: 116158	2010-10-09 22:06:36 +00:00
Kenneth Uildriks	99463ca8cf	Start separating out code metrics into code size metrics and code performance metrics. Partial Specialization will apply the former to function specializations, and the latter to all callsites that can use a specialization, in order to decide whether to create a specialization llvm-svn: 116057	2010-10-08 13:57:31 +00:00
Owen Anderson	df7a4f2515	Now with fewer extraneous semicolons! llvm-svn: 115996	2010-10-07 22:25:06 +00:00
Owen Anderson	98eb3ec6c5	Add an implementation of the initialization routine for IPA. llvm-svn: 115947	2010-10-07 18:31:27 +00:00
Owen Anderson	6875c2ea26	Add initialization routines for Analysis and IPA. llvm-svn: 115946	2010-10-07 18:31:00 +00:00
Owen Anderson	82d38df40c	Fix a warning when building with clang++. llvm-svn: 115924	2010-10-07 17:04:18 +00:00
Owen Anderson	5e19bfcde3	Move the pass initialization helper functions into the llvm namespace, and add a header declaring them all. This is also where we will declare per-library pass-set initializer functions down the road. llvm-svn: 115900	2010-10-07 04:13:08 +00:00
Owen Anderson	af08ad4350	Appease the clang self-host buildbot by providing a correct instantiation. llvm-svn: 115857	2010-10-06 22:23:20 +00:00
Owen Anderson	ad8134f03b	Hide analysis group registration behind a macro, just like pass registration. llvm-svn: 115835	2010-10-06 21:02:27 +00:00
Devang Patel	9a33ec24eb	Add support for DW_TAG_unspecified_parameters. llvm-svn: 115833	2010-10-06 20:50:40 +00:00
Dan Gohman	8e4c19ac44	Don't add the operand count to SCEV uniquing data; FoldingSetNodeID already knows its own length, so this is redundant. llvm-svn: 115521	2010-10-04 17:24:08 +00:00
Devang Patel	bea08d1c85	Let FE mark a variable as artificial variable. llvm-svn: 115102	2010-09-29 23:07:21 +00:00
Devang Patel	95ae73c394	Generalize DISubprogram element to encode various flags instead of just one boolean for isArtificial. This is a backword compatible change. llvm-svn: 115084	2010-09-29 21:04:46 +00:00
Benjamin Kramer	923a8cf356	Remove PointerTracking from cmakelists … llvm-svn: 115076	2010-09-29 19:39:50 +00:00
Chris Lattner	af995f0ee5	remove PointerTracking from mainline, Edwin is going to move it out to ClamAV for LLVM 2.9 llvm-svn: 115062	2010-09-29 18:43:27 +00:00
Oscar Fuentes	b4b12535e8	Removed a bunch of unnecessary target_link_libraries. llvm-svn: 114999	2010-09-28 22:39:14 +00:00
Devang Patel	7a55481fa4	Provide an interface to let FEs anchor debug info for types. llvm-svn: 114969	2010-09-28 18:08:20 +00:00
Jakob Stoklund Olesen	1083573796	Don't try to constant fold libm functions with non-finite arguments. Usually we wouldn't do this anyway because llvm_fenv_testexcept would return an exception, but we have seen some cases where neither errno nor fenv detect an exception on arm-linux. llvm-svn: 114893	2010-09-27 21:29:20 +00:00
Dan Gohman	2348393cf5	Teach memdep about TBAA tags. llvm-svn: 114588	2010-09-22 21:41:02 +00:00
Benjamin Kramer	4b57204e80	Simplify code. llvm-svn: 114444	2010-09-21 16:41:29 +00:00
Benjamin Kramer	4021d906f1	Make CreateComplexVariable independent of SmallVector. llvm-svn: 114439	2010-09-21 16:00:03 +00:00
Jakob Stoklund Olesen	4a253e5ac8	Don't include <fenv.h> now that we have llvm/System/FEnv.h. llvm-svn: 114219	2010-09-17 21:47:03 +00:00
Dan Gohman	b48f904602	Attempt to support platforms which don't have fenv.h. llvm-svn: 114196	2010-09-17 20:06:27 +00:00
Dan Gohman	18fa17cf3d	Fix the folding of floating-point math library calls, like sin(infinity), so that it detects errors on platforms where libm doesn't set errno. It's still subject to host libm details though. llvm-svn: 114148	2010-09-17 01:38:06 +00:00
Dan Gohman	2fa59799d9	Add an #include of raw_ostream.h. Previously, this only compiled because it was using Twine.h's declaration of operator<<(const Twine &). llvm-svn: 114141	2010-09-17 00:33:43 +00:00
Benjamin Kramer	d61e3833a3	Update CMake build. llvm-svn: 114128	2010-09-16 23:06:18 +00:00
Dan Gohman	f4925061af	Rename a variable to avoid a declaration conflict. llvm-svn: 114126	2010-09-16 22:50:09 +00:00
Dan Gohman	ee74402fe6	Add a pass which prints out all the memdep dependencies. llvm-svn: 114121	2010-09-16 22:08:32 +00:00
Owen Anderson	c33cdcfd80	Revert r114097, adding back in the assertion against replacing an Instruction by itself. Now that CorrelatedValuePropagation is more careful not to call SimplifyInstructionsInBlock() on an unreachable block, the issue has been fixed at a higher level. Add a big warning to SimplifyInstructionsInBlock() to hopefully prevent this in the future. llvm-svn: 114117	2010-09-16 20:51:41 +00:00
Owen Anderson	140296f5c0	It is possible, under specific circumstances involving ptrtoint ConstantExpr's, for LVI to end up trying to merge a Constant into a ConstantRange. Handle this conservatively for now, rather than asserting. The testcase is more complex that I would like, but the manifestation of the problem is sensitive to iteration orders and the state of the LVI cache, and I have not been able to reproduce it with manually constructed or simplified cases. Fixes PR8162. llvm-svn: 114103	2010-09-16 18:28:33 +00:00
Owen Anderson	94532cb297	Fix PR8161, in which an unreachable loop causes recursive instruction simplification to try to replace an instruction with itself. Add a predicate to the simplifier to prevent this case. llvm-svn: 114097	2010-09-16 17:42:36 +00:00
Eli Friedman	ab3a128582	PR7959: Handle negative scales in GEPs correctly in BasicAA for non-64-bit targets. llvm-svn: 114015	2010-09-15 20:08:03 +00:00
Dan Gohman	e0386dbef1	Convert TBAA to use the new TBAATag field of AliasAnalysis::Location. llvm-svn: 113892	2010-09-14 23:28:12 +00:00
Dan Gohman	41f14cf3e9	Remove the experimental AliasAnalysis::getDependency interface, which isn't a good level of abstraction for memdep. Instead, generalize AliasAnalysis::alias and related interfaces with a new Location class for describing a memory location. For now, this is the same Pointer and Size as before, plus an additional field for a TBAA tag. Also, introduce a fixed MD_tbaa metadata tag kind. llvm-svn: 113858	2010-09-14 21:25:10 +00:00
Michael J. Spencer	93c9b2ea93	Revert "CMake: Get rid of LLVMLibDeps.cmake and export the libraries normally." This reverts commit r113632 Conflicts: cmake/modules/AddLLVM.cmake llvm-svn: 113819	2010-09-13 23:59:48 +00:00
Benjamin Kramer	8c35fb0739	Teach InstructionSimplify to fold (A & B) & A -> A & B and (A \| B) \| A -> A \| B. Reassociate does this but it doesn't catch all cases (e.g. if the operands are i1). llvm-svn: 113651	2010-09-10 22:39:55 +00:00
Michael J. Spencer	dc38d36ccb	CMake: Get rid of LLVMLibDeps.cmake and export the libraries normally. llvm-svn: 113632	2010-09-10 21:14:25 +00:00
Owen Anderson	04cf3fd761	What the loop unroller cares about, rather than just not unrolling loops with calls, is not unrolling loops that contain calls that would be better off getting inlined. This mostly comes up when an interleaved devirtualization pass has devirtualized a call which the inliner will inline on a future pass. Thus, rather than blocking all loops containing calls, add a metric for "inline candidate calls" and block loops containing those instead. llvm-svn: 113535	2010-09-09 20:32:23 +00:00
Dan Gohman	1c5be00ec7	Extend the getDependence query with support for PHI translation. llvm-svn: 113521	2010-09-09 18:37:31 +00:00
Owen Anderson	a08318acb2	Refactor code-size reduction estimation methods out of InlineCostAnalyzer and into CodeMetrics. They don't use any InlineCostAnalyzer state, and are useful for other clients who don't necessarily want to use all of InlineCostAnalyzer's logic, some of which is fairly inlining-specific. No intended functionality change. llvm-svn: 113499	2010-09-09 16:56:42 +00:00
Dan Gohman	64d842ec72	Add a new experimental generalized dependence query interface to AliasAnalysis, and some code for implementing the new query on top of existing implementations by making standard alias and getModRefInfo queries. llvm-svn: 113329	2010-09-08 01:32:20 +00:00
Owen Anderson	a74fa15f32	Clean up some of the PassRegistry implementation, and pImpl-ize it to reduce #include clutter and exposing internal details. llvm-svn: 113252	2010-09-07 19:16:25 +00:00
Nick Lewycky	ad48e01eef	Add completely hokey binary-and and binary-or operations to ConstantRange and teach LazyValueInfo to use them. llvm-svn: 113196	2010-09-07 05:39:02 +00:00
Chris Lattner	a58edd1df3	cleanup some of the lifetime/invariant marker stuff, add a big fixme. llvm-svn: 113144	2010-09-06 03:58:04 +00:00
Chris Lattner	e34c835bde	speed up -gvn 3.4% on the testcase in PR7023 llvm-svn: 113135	2010-09-06 01:26:29 +00:00
Chris Lattner	da24b9a49a	pull a simple method out of LICM into a new Loop::hasLoopInvariantOperands method. Remove a useless and confusing Loop::isLoopInvariant(Instruction) method, which didn't do what you thought it did. No functionality change. llvm-svn: 113133	2010-09-06 01:05:37 +00:00
Chris Lattner	72d283c826	fix PR8063, a crash in globalopt in the malloc analysis code. llvm-svn: 113109	2010-09-05 17:20:46 +00:00
Chris Lattner	0963048185	dead method. llvm-svn: 113077	2010-09-04 18:19:16 +00:00
Chris Lattner	65b48b5dfc	zap dead code. llvm-svn: 113073	2010-09-04 18:12:00 +00:00
Dan Gohman	47bec3cb57	Disable the asserts that check that normalization is perfectly invertible. ScalarEvolution's folding routines don't always succeed in canonicalizing equal expressions to a single canonical form, and this can cause these asserts to fail, even though there's no actual correctness problem. This fixes PR8066. llvm-svn: 113021	2010-09-03 22:12:56 +00:00
Owen Anderson	c725462245	Add support for simplifying a load from a computed value to a load from a global when it is provable that they're equivalent. This fixes PR4855. llvm-svn: 112994	2010-09-03 19:08:37 +00:00
Chris Lattner	19199cce55	stop forcing a noop AssemblyAnnotationWriter to silence #uses comments, these don't happen anymore. llvm-svn: 112901	2010-09-02 23:03:10 +00:00
Owen Anderson	2912df072d	Remove incorrect and poorly tested code for trying to reason about values on default edges of switches. Just return the conservatively correct answer. llvm-svn: 112876	2010-09-02 22:16:52 +00:00
Owen Anderson	a8c896b704	Fix a bug in LazyValueInfo that CorrelatedValuePropagation exposed: In the LVI lattice, undef and the full set ConstantRange should not be treated as equivalent. llvm-svn: 112843	2010-09-02 18:23:58 +00:00
Dan Gohman	110ed64fbb	Revert 112442 and 112440 until the compile time problems introduced by 112440 are resolved. llvm-svn: 112692	2010-09-01 01:45:53 +00:00
Dan Gohman	47308d5da3	Reapply r112432, now that the real problem is addressed. llvm-svn: 112667	2010-08-31 22:53:17 +00:00
Dan Gohman	f01a5eed1e	Reapply r112433, now that the real problem is addressed. llvm-svn: 112666	2010-08-31 22:52:12 +00:00
Dan Gohman	aabfc52790	Revert r110916. This patch is buggy because the code inside the inner loop doesn't update all the variables in the outer loop. llvm-svn: 112665	2010-08-31 22:50:31 +00:00
Dan Gohman	90f29bcd90	Revert r112432. It appears to be exposing a problem in the emacs build. llvm-svn: 112638	2010-08-31 20:58:44 +00:00
Dan Gohman	444c24a9f0	Speculatively revert r112433. llvm-svn: 112608	2010-08-31 17:56:47 +00:00
Owen Anderson	9517943d11	It is possible to try to merge a not-constant with a constantrage, when dealing with ptrtoint ConstantExpr's. Unfortunately, the only testcase I have for this is huge and doesn't reduce well because the error is sensitive to iteration-order issues, since the problem only occurs when merging values in a particular order. llvm-svn: 112489	2010-08-30 17:03:45 +00:00
Benjamin Kramer	8548c892a8	Don't print two "0x" prefixes. Use a raw_ostream overload instead of llvm::format. llvm-svn: 112479	2010-08-30 14:46:53 +00:00
Chris Lattner	f58382ed87	two changes: 1) make AliasSet hold the list of call sites with an assertingvh so we get a violent explosion if the pointer dangles. 2) Fix AliasSetTracker::deleteValue to remove call sites with by-pointer comparisons instead of by-alias queries. Using findAliasSetForCallSite can cause alias sets to get merged when they shouldn't, and can also miss alias sets when the call is readonly. #2 fixes PR6889, which only repros with a .c file :( llvm-svn: 112452	2010-08-29 18:42:23 +00:00
Dan Gohman	3a08ed7904	Make IVUsers iterative instead of recursive. This has the side effect of reversing the order of most of IVUser's results. llvm-svn: 112442	2010-08-29 16:40:03 +00:00
Dan Gohman	d1da5cdfee	Restructure the {A,+,B}<L> * {C,+,D}<L> folding so that it folds all applicable addrecs before recursing on getMulExpr, instead of recursing on getMulExpr for each one. llvm-svn: 112433	2010-08-29 15:16:58 +00:00
Dan Gohman	3e6fc18943	Batch up subtracts along with adds, when analyzing long chains of operations. llvm-svn: 112432	2010-08-29 15:10:06 +00:00
Dan Gohman	7712d2900d	Micro-optimize GroupByComplexity. llvm-svn: 112431	2010-08-29 15:07:13 +00:00
Dan Gohman	0f2de01355	Hold AddRec->getLoop() in a variable, to make the Mul code more consistent with the Add code. llvm-svn: 112430	2010-08-29 14:55:19 +00:00
Dan Gohman	028c18158a	Rename a variable, for consistency. llvm-svn: 112429	2010-08-29 14:53:34 +00:00
Dan Gohman	28a84d4ba1	Use iterators instead of indices. llvm-svn: 112428	2010-08-29 14:52:02 +00:00
Chris Lattner	dc8070ed6d	when merging two alias sets, the result set is volatile if either of the sets is volatile. We were dropping the volatile bit of the merged in set, leading (luckily) to assertions in cases like PR7535. I cannot produce a testcase that repros with opt, but this is obviously correct. llvm-svn: 112402	2010-08-29 04:14:47 +00:00
Chris Lattner	eef6b19dcb	more cleanup llvm-svn: 112401	2010-08-29 04:13:43 +00:00
Chris Lattner	afb7074f18	clean this up llvm-svn: 112400	2010-08-29 04:06:55 +00:00
Dan Gohman	fe22f1d3cc	Fix an index calculation thinko. llvm-svn: 112337	2010-08-28 00:39:27 +00:00
Owen Anderson	38f6b7fe3b	Improve the precision of getConstant(). llvm-svn: 112323	2010-08-27 23:29:38 +00:00
Dan Gohman	15871f23e3	When merging adjacent operands, scan ahead and merge all equal adjacent operands at once, instead of just two at a time. llvm-svn: 112299	2010-08-27 21:39:59 +00:00
Dan Gohman	c866bf4fec	Make the {A,+,B}<L> + {C,+,D}<L> --> Other + {A+C,+,B+D}<L> transformation collect all the addrecs with the same loop add combine them at once rather than starting everything over at the first chance. llvm-svn: 112290	2010-08-27 20:45:56 +00:00
Dan Gohman	9bad2fb378	Switch ScalarEvolution's main Value->SCEV map from std::map to DenseMap. llvm-svn: 112281	2010-08-27 18:55:03 +00:00
Owen Anderson	6ebbd92380	Use LVI to eliminate conditional branches where we've tested a related condition previously. Update tests for this change. This fixes PR5652. llvm-svn: 112270	2010-08-27 17:12:29 +00:00
Dan Gohman	2706567c5c	Optimize SCEVComplexityCompare. Use a 3-way return instead of a 2-way return to avoid needing two calls to test for equivalence, and sort addrecs by their degree before examining their operands. llvm-svn: 112267	2010-08-27 15:26:01 +00:00
Owen Anderson	4afea9e3c6	In the default address space, any GEP off of null results in a trap value if you try to load it. Thus, any load in the default address space that completes implies that the base value that it GEP'd from was not null. llvm-svn: 112015	2010-08-25 01:16:47 +00:00
Owen Anderson	a10000006e	NULL loads are only invalid in the default address space. llvm-svn: 111972	2010-08-24 22:00:55 +00:00
Owen Anderson	b695c83de9	Add support for inferring values for the default cases of switches. llvm-svn: 111971	2010-08-24 21:59:42 +00:00
Owen Anderson	da34de1599	Add support for inferring that a load from a pointer implies that it is not null. llvm-svn: 111959	2010-08-24 20:47:29 +00:00
Owen Anderson	c62f704576	Don't assume that all constants with integer types are ConstantInts. llvm-svn: 111906	2010-08-24 07:55:44 +00:00
Devang Patel	dd719f701d	Let FE use derived types for DW_TAG_friend. Patch by Alexander Herz! llvm-svn: 111861	2010-08-23 23:16:25 +00:00
Devang Patel	a8652674e0	Handle qualified constants that are directly folded by FE. PR 7920. llvm-svn: 111820	2010-08-23 18:25:56 +00:00
Owen Anderson	d31d82d75c	Now that PassInfo and Pass::ID have been separated, move the rest of the passes over to the new registration API. llvm-svn: 111815	2010-08-23 17:52:01 +00:00
Dan Gohman	5fc55dc3cf	CreateTemporaryType doesn't needs its Context argument. llvm-svn: 111687	2010-08-20 22:39:47 +00:00
Dan Gohman	16a5d98c3a	Introduce a new temporary MDNode concept. Temporary MDNodes are not part of the IR, are not uniqued, and may be safely RAUW'd. This replaces a variety of alternate mechanisms for achieving the same effect. llvm-svn: 111681	2010-08-20 22:02:26 +00:00
Dan Gohman	a931605647	Convert DbgInfoPrinter to use errs() instead of outs(). llvm-svn: 111659	2010-08-20 18:03:05 +00:00
Dan Gohman	f71c521fb7	Revert r111199; it breaks -debug-pass=Structure output. llvm-svn: 111500	2010-08-19 01:29:07 +00:00
Chris Lattner	3decde9305	refix PR1143 by making basicaa analyze zexts of indices aggresively, which I broke with a recent patch. llvm-svn: 111452	2010-08-18 23:09:49 +00:00
Chris Lattner	26403acef7	GetLinearExpression is only called when TD is non-null, pass as a reference instead of pointer. llvm-svn: 111445	2010-08-18 22:52:09 +00:00
Chris Lattner	1b9c38796e	rework GEP decomposition to make a new VariableGEPIndex struct instead of using a pair. This tidies up the code a bit. While setting things up, add a (currently unused) field to keep track of how the value is extended. llvm-svn: 111444	2010-08-18 22:47:56 +00:00
Chris Lattner	9f7500f57b	move gep decomposition out of ValueTracking into BasicAA. The form of decomposition that it is doing is very basicaa specific and is only used by basicaa. Now with less tree breakingness. llvm-svn: 111433	2010-08-18 22:07:29 +00:00
Owen Anderson	80d19f0905	Use ConstantRange to propagate information through value definitions. llvm-svn: 111425	2010-08-18 21:11:37 +00:00
Daniel Dunbar	fbeeb130d8	Revert r111375, "move gep decomposition out of ValueTracking into BasicAA. The form of", it doesn't pass tests. llvm-svn: 111385	2010-08-18 18:43:08 +00:00
Owen Anderson	208636fa33	Inform LazyValueInfo whenever a block is deleted, to avoid dangling pointer issues. llvm-svn: 111382	2010-08-18 18:39:01 +00:00
Chris Lattner	54fe883203	move gep decomposition out of ValueTracking into BasicAA. The form of decomposition that it is doing is very basicaa specific and is only used by basicaa. llvm-svn: 111375	2010-08-18 18:22:17 +00:00
Chris Lattner	a33edcb56c	fix PR7589: In brief: gep P, (zext x) != gep P, (sext x) DecomposeGEPExpression was getting this wrong, confusing basicaa. llvm-svn: 111352	2010-08-18 04:28:19 +00:00
Dan Gohman	ed2b005842	Tweak IVUsers' concept of "interesting" to exclude add recurrences where the step value is an induction variable from an outer loop, to avoid trouble trying to re-expand such expressions. This effectively hides such expressions from indvars and lsr, which prevents them from getting into trouble. llvm-svn: 111317	2010-08-17 22:50:37 +00:00
Owen Anderson	fa7d44687f	Fix another iterator invalidation that caused a really nasty miscompilation in 403.gcc. llvm-svn: 111210	2010-08-16 23:42:33 +00:00
Dan Gohman	55cd6aadc9	Make dumpPassStructure be a PMDataManager abstraction, rather than a Pass abstraction, since that's the level it's actually used at. Rename Pass' dumpPassStructure to dumpPass. This eliminates an awkward use of getAsPass() to convert a PMDataManager* into a Pass* just to permit a dumpPassStructure call. llvm-svn: 111199	2010-08-16 22:45:12 +00:00
Dan Gohman	797a1dbb1c	To create a copy of a SmallVector with an element removed from the middle, copy the elements in two groups, rather than copying all the elements and then doing an erase on the middle of the result. These are SmallVectors, so we shouldn't expect to hit dynamic allocation in the common case. llvm-svn: 111151	2010-08-16 16:57:24 +00:00
Dan Gohman	0d0cc18af5	Tidy whitespace. llvm-svn: 111147	2010-08-16 16:34:09 +00:00
Dan Gohman	c29eeaecec	Add a comment. llvm-svn: 111145	2010-08-16 16:31:39 +00:00
Dan Gohman	7eac4961d7	Use const_iterator in a few places. llvm-svn: 111144	2010-08-16 16:30:01 +00:00
Dan Gohman	74c61503b1	Use iterators instead of indices in a few more places. llvm-svn: 111143	2010-08-16 16:27:53 +00:00
Dan Gohman	f29618236e	Micro-optimize SCEVConstant comparison. llvm-svn: 111142	2010-08-16 16:25:35 +00:00
Dan Gohman	3688ea5c7d	Move SCEVNAryExpr's virtual member functions out of line, and convert them to iterators. llvm-svn: 111140	2010-08-16 16:21:27 +00:00
Dan Gohman	d6925bbe0d	Use iterators instead of indices in simple cases. llvm-svn: 111138	2010-08-16 16:16:11 +00:00
Dan Gohman	b6c773ec2e	Avoid gratuitous inefficiency in ifndef NDEBUG code. llvm-svn: 111137	2010-08-16 16:13:54 +00:00
Dan Gohman	e5fb1036e6	Make one getAddExpr call when analyzing a+b+c+d+e+... instead of one for each add instruction. Ditto for Mul. llvm-svn: 111136	2010-08-16 16:03:49 +00:00
Dan Gohman	b094b39111	Delete an unused function. llvm-svn: 111135	2010-08-16 15:57:14 +00:00
Dan Gohman	fb83b043eb	Revert r111058, the lint check for indirectbr successors that aren't address-taken. This can occur normally, if the code which took the address got DCEd. llvm-svn: 111121	2010-08-16 14:39:19 +00:00
Argyrios Kyrtzidis	d0fcc9a818	Revert r111082. No warnings for this common pattern. llvm-svn: 111102	2010-08-15 10:27:23 +00:00
Argyrios Kyrtzidis	7c09ddf0ae	Add ATTRIBUTE_UNUSED to methods that are not supposed to be used. llvm-svn: 111082	2010-08-14 21:35:10 +00:00
Dan Gohman	21e6dc6aa3	Add a lint check for an indirectbr destination which has not had its address taken. llvm-svn: 111058	2010-08-13 23:56:28 +00:00
Dan Gohman	0c436ab356	Various optimizations. Don't compare two loops' depths when they are the same loop. Don't compare two instructions' loop depths when they are in the same block. llvm-svn: 111045	2010-08-13 21:24:58 +00:00
Dan Gohman	63c020a210	When testing whether one loop contains another, test this directly rather than testing whether the loop contains the other's header. llvm-svn: 111039	2010-08-13 20:23:25 +00:00
Dan Gohman	3324b9ec67	Add a const. llvm-svn: 111038	2010-08-13 20:17:27 +00:00
Dan Gohman	cf32f2bde1	When creating a symmetric SCEV with a constant operand, put the constant operand on the left, as that's where ScalarEvolution will end up canonicalizing to. llvm-svn: 111037	2010-08-13 20:17:14 +00:00
Dan Gohman	ec0120a123	An add recurrence is loop-invariant in any loop inside of its associated loop. This avoids potentially expensive traversals of the add recurrence's operands. llvm-svn: 111034	2010-08-13 20:11:39 +00:00
Dan Gohman	2de47777f4	Optimize ScalarEvolution::getAddExpr's operand factoring code by having it finish processing all of the muliply operands before starting the whole getAddExpr process over again, instead of immediately after the first simplification. llvm-svn: 110916	2010-08-12 15:00:23 +00:00
Dan Gohman	157847f5d1	Hoist some loop-invariant code out of a hot loop. llvm-svn: 110915	2010-08-12 14:52:55 +00:00
Dan Gohman	e67b287451	Optimize ScalarEvolution::getAddExpr's duplicate operand detection by having it finish processing the whole operand list before starting the whole getAddExpr process over again, instead of immediately after the first duplicate is found. llvm-svn: 110914	2010-08-12 14:46:54 +00:00
Devang Patel	4d597e8268	Even if a variable has constant value all the time, it is still a variable in gdb's eyes. Tested by scope.exp in gdb testsuite. llvm-svn: 110876	2010-08-11 23:17:54 +00:00
Owen Anderson	7b974a45db	Fix a subtle use-after-free issue. llvm-svn: 110863	2010-08-11 22:36:04 +00:00
Dan Gohman	a97e78b4ac	Make LoopPass::getContainedPass return a LoopPass* instead of a Pass* and remove casts from all its callers. llvm-svn: 110848	2010-08-11 20:34:43 +00:00
Owen Anderson	0bd61240e9	Improve indentation. llvm-svn: 110778	2010-08-11 04:24:25 +00:00
Dan Gohman	f7495f286a	When analyzing loop exit conditions combined with and and or, don't make any assumptions about when the two conditions will agree on when to permit the loop to exit. This fixes PR7845. llvm-svn: 110758	2010-08-11 00:12:36 +00:00
Dan Gohman	e18c2d6f99	Rename and reorder the arguments to isImpliedCond, for consistency and clarity. llvm-svn: 110750	2010-08-10 23:46:30 +00:00
Owen Anderson	5f1dd0967d	Now that we're using ConstantRange to represent potential values, make use of that represenation to create constraints from comparisons other than eq/neq. llvm-svn: 110742	2010-08-10 23:20:01 +00:00
Devang Patel	3e4d04230b	Add missing argument. CreateCompositeTypeEx() users, please verify. llvm-svn: 110717	2010-08-10 20:22:49 +00:00
Owen Anderson	185fe00633	Switch over to using ConstantRange to track integral values. llvm-svn: 110714	2010-08-10 20:03:09 +00:00
Devang Patel	8e06a5eb47	Do not forget debug info for enums. Use named mdnode to keep track of these types. llvm-svn: 110712	2010-08-10 20:01:20 +00:00
Tobias Grosser	7fbe6cb429	RegionInfo: Do not assert if a BB is not part of the dominance tree. llvm-svn: 110665	2010-08-10 09:54:35 +00:00
Devang Patel	b219746c80	Handle TAG_constant for integers. llvm-svn: 110656	2010-08-10 07:11:13 +00:00
Devang Patel	c7cf14f5f6	Refactor. llvm-svn: 110607	2010-08-09 21:39:24 +00:00
Owen Anderson	8afac043fb	Add ConstantRange information to the debugging output. llvm-svn: 110598	2010-08-09 20:50:46 +00:00
Owen Anderson	a7aed18624	Reapply r110396, with fixes to appease the Linux buildbot gods. llvm-svn: 110460	2010-08-06 18:33:48 +00:00
Dan Gohman	e68958fcdf	Implement a proper getModRefInfo for va_arg. llvm-svn: 110458	2010-08-06 18:24:38 +00:00
Dan Gohman	6b4671b208	Be more conservative in the face of volatile. llvm-svn: 110456	2010-08-06 18:11:28 +00:00
Dan Gohman	23976df6f2	Fix a comment. llvm-svn: 110455	2010-08-06 18:10:45 +00:00
Dan Gohman	5f1702e4fe	Move all the logic for function attributes and call attributes out of the AliasAnalysis base class and into BasicAliasAnalyais. This avoids confusion about where such logic is happening when there are other AliasAnalysis implementations present. Move the logic for translating two-callsite getModRefInfo queries into other AliasAnalysis queries out of BasicAliasAnalysis and into the AliasAnalysis base class, as it is useful for other AliasAnalysis implementations. llvm-svn: 110421	2010-08-06 01:25:49 +00:00
Owen Anderson	c2107d2eaa	Fix botched revert. llvm-svn: 110416	2010-08-06 00:36:20 +00:00
Owen Anderson	bda59bd247	Revert r110396 to fix buildbots. llvm-svn: 110410	2010-08-06 00:23:35 +00:00
Dan Gohman	e0d5c458ec	Fix 80-column violations. llvm-svn: 110401	2010-08-05 23:48:14 +00:00
Owen Anderson	755aceb5d0	Don't use PassInfo* as a type identifier for passes. Instead, use the address of the static ID member as the sole unique type identifier. Clean up APIs related to this change. llvm-svn: 110396	2010-08-05 23:42:04 +00:00
Dan Gohman	884dd752c3	Implement AccessesArguments checking in the two-callsite form of BasicAA::getModRefInfo. This allows BasicAA to say that two memset calls to non-aliasing memory locations don't interfere. llvm-svn: 110393	2010-08-05 23:34:50 +00:00
Dan Gohman	e2a67168bf	Yes, we can do better, but this is not the place for it. llvm-svn: 110391	2010-08-05 23:23:32 +00:00
Owen Anderson	0f306a45ad	Add the beginnings of infrastructure for range tracking. llvm-svn: 110388	2010-08-05 22:59:19 +00:00
Owen Anderson	c3a1413ea1	Split the tag and value members of LVILatticeVal in preparation for expanding the lattice to something that won't fit in two bits. llvm-svn: 110383	2010-08-05 22:10:46 +00:00
Dan Gohman	26ef7c7ab7	Fix memdep's code for reasoning about dependences between two calls. A Ref response from getModRefInfo is not useful here. Instead, check for identical calls only in the NoModRef case. Reapply r110270, and strengthen it to compensate for the memdep changes. When both calls are readonly, there is no dependence between them. llvm-svn: 110382	2010-08-05 22:09:15 +00:00
Dan Gohman	554b012f67	Revert r110270 for now. It appears to uncover a memdep bug. llvm-svn: 110293	2010-08-05 00:43:10 +00:00
Dan Gohman	109561845b	The trouble with testing for "ModRef" and "NoModRef" is that one is a suffix of the other, and FileCheck accepts superstrings. Adjust the output to avoid this problem. llvm-svn: 110280	2010-08-04 23:37:55 +00:00
Dan Gohman	bd33dab633	The two-callsite form of AliasAnalysis::getModRefInfo is documented to return Ref if the left callsite only reads memory read or written by the right callsite; fix BasicAliasAnalysis to implement this. Add AliasAnalysisEvaluator support for testing the two-callsite form of getModRefInfo. llvm-svn: 110270	2010-08-04 22:56:29 +00:00
Dan Gohman	db764c6e3b	Fix a minor bug which resulted in intermediate calculations using wider types than are necessary. llvm-svn: 110241	2010-08-04 19:52:50 +00:00
Torok Edwin	bfc17d0157	Add a missing function. llvm-svn: 110195	2010-08-04 11:42:45 +00:00
Dan Gohman	fc419ef6a0	Remove PointerAccessInfo, which nothing was using. llvm-svn: 110167	2010-08-03 23:08:10 +00:00
Dan Gohman	5442c71f2e	Thread const correctness through a bunch of AliasAnalysis interfaces and eliminate several const_casts. Make CallSite implicitly convertible to ImmutableCallSite. Rename the getModRefBehavior for intrinsic IDs to getIntrinsicModRefBehavior to avoid overload ambiguity with CallSite, which happens to be implicitly convertible to bool. llvm-svn: 110155	2010-08-03 21:48:53 +00:00
Dan Gohman	ad867b0aed	The singular of "indices" is "index". llvm-svn: 110135	2010-08-03 20:23:52 +00:00
Dan Gohman	852d6fc50c	Delete an unused function. llvm-svn: 110134	2010-08-03 20:20:56 +00:00
Dan Gohman	52f9d7d617	Make AliasAnalysis::getModRefInfo conservative in the face of volatility. llvm-svn: 110120	2010-08-03 17:27:43 +00:00
Dan Gohman	081627ceb8	Fix a typo Devang noticed. llvm-svn: 110115	2010-08-03 16:48:31 +00:00
Michael J. Spencer	2ce6994211	Fix CMake build llvm-svn: 110097	2010-08-03 02:38:20 +00:00
Dan Gohman	2a190081f6	Introduce a symbolic constant for ~0u for use with AliasAnalysis. llvm-svn: 110091	2010-08-03 01:03:11 +00:00
Dan Gohman	da7182e116	Add a convenient form of AliasAnalysis::alias for the case where the sizes are unknown. llvm-svn: 110090	2010-08-03 00:56:30 +00:00
Dan Gohman	7cac95778f	Make SCEVUnknown a CallbackVH, so that it can be notified directly of Value deletions and RAUWs, instead of relying on ScalarEvolution's Scalars map being notified, as that's complicated at best, and insufficient in general. This means SCEVUnknown needs a non-trivial destructor, so introduce a mechanism to allow ScalarEvolution to locate all the SCEVUnknowns. llvm-svn: 110086	2010-08-02 23:49:30 +00:00
Dan Gohman	272980b3f6	Sketch up a preliminary Type-Based Alias Analysis implementation. llvm-svn: 110077	2010-08-02 23:11:01 +00:00
Dan Gohman	d8968da2c5	Add a lint check for indirectbr with no successors. llvm-svn: 110074	2010-08-02 23:06:43 +00:00
Devang Patel	33a2cdf3f9	Add explicit constructors. Patch by Renato Golin. llvm-svn: 110072	2010-08-02 22:51:46 +00:00
Dan Gohman	abfafadfc7	Fix namespace polution. llvm-svn: 110056	2010-08-02 18:50:06 +00:00
Oscar Fuentes	40b31ad3ee	Prefix `next' iterator operation with `llvm::'. Fixes potential ambiguity problems on VS 2010. Patch by nobled! llvm-svn: 110029	2010-08-02 06:00:15 +00:00
Owen Anderson	c1561b8400	Add an initial implementation of PHI translation for LazyValueInfo. This involves rolling back some of my earlier data structure improvements until I can ensure that there are no iterator invalidation problems. llvm-svn: 109935	2010-07-30 23:59:40 +00:00
Owen Anderson	e4a0ab69d2	Revert my last two patches to LVI, which recent changes have exposed a miscompilation in. llvm-svn: 109889	2010-07-30 20:56:07 +00:00
Eric Christopher	ef6d5933a6	Speculatively revert r109705 since it seems to be causing some build bot angst. llvm-svn: 109718	2010-07-29 01:25:38 +00:00
Dan Gohman	3d6ac44d96	Factor out some of the code for updating old SCEVUnknown values, and extend it to handle the case where multiple RAUWs affect a single SCEVUnknown. Add a ScalarEvolution unittest to test for this situation. llvm-svn: 109705	2010-07-29 00:17:55 +00:00
Owen Anderson	a44f49f189	Pass the queried value by argument rather than in a member, in preparation for supporting PHI translation. llvm-svn: 109701	2010-07-28 23:50:08 +00:00
Owen Anderson	6982dd4e1f	Get rid of LVIQuery as a distinct data structure, so that we don't have to initialize a new set of maps on every query. llvm-svn: 109679	2010-07-28 22:07:25 +00:00
Daniel Dunbar	18e39cec7a	RegionInfo: Make sure to free cached nodes; Tobias, please check! llvm-svn: 109650	2010-07-28 20:28:50 +00:00
Gabor Greif	e497e5ef46	simplify llvm-svn: 109585	2010-07-28 15:31:37 +00:00
Gabor Greif	5bf74d648d	use Value* constructor of CallSite to create potentially improper site, and test that llvm-svn: 109580	2010-07-28 12:35:54 +00:00
Gabor Greif	67a970bff2	use Value* constructor of CallSite to create potentially improper site llvm-svn: 109579	2010-07-28 12:19:46 +00:00
Gabor Greif	7cf6056484	simplify llvm-svn: 109578	2010-07-28 10:57:28 +00:00
Gabor Greif	2e2503cd8d	simplify llvm-svn: 109577	2010-07-28 10:46:09 +00:00
Dan Gohman	7a066723d0	Make SCEVCallbackVH::allUsesReplacedWith update the old SCEVUnknown object, as it may still be referenced by SCEVs not cleaned up by the use list traversal. Also, in ScalarEvolution::forgetValue, only check for a SCEVUnknown object for the original value, not for any value in the use list, because other SCEVUnknown values aren't necessary obsolete at that point. llvm-svn: 109570	2010-07-28 01:09:07 +00:00
Dan Gohman	8aeb0fb5ca	Make SCEVCallbackVH::allUsesReplacedWith unconditionally delete the old value. llvm-svn: 109567	2010-07-28 00:28:25 +00:00
Owen Anderson	aac5a72139	Rearrange several datastructures in LazyValueInfo to improve compile time. This is still not perfect, but better than it was before. llvm-svn: 109563	2010-07-27 23:58:11 +00:00
Gabor Greif	0630a71742	reintroduce original (asserting) semantics of CallSite(Instruction II) add instead a CallSite(Value V) constructor that is consistent with ImmutableCallSize and use that one in client code llvm-svn: 109553	2010-07-27 22:53:28 +00:00
Gabor Greif	ef1ca24b91	recommit simplification (originally r109504, backed out in r109508) now that problem in CallSiteBase is fixed llvm-svn: 109547	2010-07-27 22:02:00 +00:00
Gabor Greif	ed1d92cb9a	back out r109504, breaks the bots llvm-svn: 109508	2010-07-27 15:18:11 +00:00
Gabor Greif	195a609c37	simplify llvm-svn: 109504	2010-07-27 14:38:38 +00:00
Gabor Greif	d59498bc97	use ImmutableCallSite for const-corrgoodness llvm-svn: 109503	2010-07-27 14:15:29 +00:00
Tobias Grosser	fc763867d5	RegionInfo: Add getMaxRegionExit() getMaxRegionExit returns the exit of the maximal refined region starting at a specific basic block. llvm-svn: 109496	2010-07-27 08:39:43 +00:00
Tobias Grosser	1bec81a888	Add function to query RegionInfo about loops. * contains(Loop), * getOutermostLoop() * Improve getNameStr() to return a sensible name, if basic blocks are not named. llvm-svn: 109490	2010-07-27 04:17:13 +00:00
Owen Anderson	aa7f66ba67	Add an initial implementation of LazyValueInfo updating for JumpThreading. Disabled for now. llvm-svn: 109424	2010-07-26 18:48:03 +00:00
Dan Gohman	cd83870faf	Fix SCEVExpander::visitAddRecExpr so that it remembers the induction variable it inserted rather than using LoopInfo::getCanonicalInductionVariable to rediscover it, since that doesn't work on non-canonical loops. This fixes infinite recurrsion on such loops; PR7562. llvm-svn: 109419	2010-07-26 18:28:14 +00:00
Dan Gohman	b3aa6c7110	Use DominatorTree::properlyDominates instead of dominates with an explicit inequality check. llvm-svn: 109398	2010-07-26 17:34:05 +00:00
Dan Gohman	7038bd5c1a	Eliminate getCanonicalInductionVariableIncrement's last user and eliminate it. llvm-svn: 109270	2010-07-23 21:34:51 +00:00
Dan Gohman	acafc61023	Simplify this code; it can use the regular CFG utlities rather than the BlockTraits abstractions. llvm-svn: 109268	2010-07-23 21:25:16 +00:00
Dan Gohman	5ae3102459	Micro-optimize SCEVComplexityCompare. llvm-svn: 109267	2010-07-23 21:20:52 +00:00
Dan Gohman	992db006d0	Add a const qualifier. llvm-svn: 109266	2010-07-23 21:18:55 +00:00
Gabor Greif	1a2da423c9	use cascading operator-> feature llvm-svn: 109104	2010-07-22 13:49:27 +00:00
Gabor Greif	dde79d8f1a	mass elimination of reliance on automatic iterator dereferencing llvm-svn: 109103	2010-07-22 13:36:47 +00:00
Gabor Greif	d9f48ecb2e	use -> instead of (*). llvm-svn: 109094	2010-07-22 11:12:32 +00:00
Gabor Greif	07c8ad54da	cache dereferenced iterator llvm-svn: 109093	2010-07-22 11:07:46 +00:00
Tobias Grosser	336734aca6	Add new RegionInfo pass. The RegionInfo pass detects single entry single exit regions in a function, where a region is defined as any subgraph that is connected to the remaining graph at only two spots. Furthermore an hierarchical region tree is built. Use it by calling "opt -regions analyze" or "opt -view-regions". llvm-svn: 109089	2010-07-22 07:46:31 +00:00
Dan Gohman	2637cc1a38	Make NamedMDNode not be a subclass of Value, and simplify the interface for creating and populating NamedMDNodes. llvm-svn: 109061	2010-07-21 23:38:33 +00:00
Owen Anderson	ac4a1ede17	Add INSTANTIATE_AG_PASS, which combines RegisterPass<> with RegisterAnalysisGroup<> for pass registration. llvm-svn: 109058	2010-07-21 23:07:00 +00:00
Owen Anderson	a57b97e7e7	Fix batch of converting RegisterPass<> to INTIALIZE_PASS(). llvm-svn: 109045	2010-07-21 22:09:45 +00:00
Jim Grosbach	6cd0deb997	tidy up. llvm-svn: 109038	2010-07-21 21:36:25 +00:00
Dan Gohman	093cb79d4b	Disallow null as a named metadata operand. Make MDNode::destroy private. Fix the one thing that used MDNode::destroy, outside of MDNode itself. One should never delete or destroy an MDNode explicitly. MDNodes implicitly go away when there are no references to them (implementation details aside). llvm-svn: 109028	2010-07-21 18:54:18 +00:00
Dan Gohman	625fd2292d	Fix SCEV denormalization of expressions where the exit value from one loop is involved in the increment of an addrec for another loop. This fixes rdar://8168938. llvm-svn: 108863	2010-07-20 17:06:20 +00:00
Dan Gohman	46f00a25f9	Add a fast path for x - x. llvm-svn: 108855	2010-07-20 16:53:00 +00:00
Dan Gohman	31158756e4	Simplify this code; LoopInfo::getCanonicalInductionVariable will only find integer induction variables. llvm-svn: 108853	2010-07-20 16:46:58 +00:00
Dan Gohman	4fd92434f1	Make getOrInsertCanonicalInductionVariable guarantee that its result is a PHINode*. llvm-svn: 108852	2010-07-20 16:44:52 +00:00
Dan Gohman	191f2e4dbd	Change an argument from an Instruction* to a Value*, which is all that is needed here. llvm-svn: 108850	2010-07-20 16:34:50 +00:00
Dan Gohman	d1488fd8bc	Minor code cleanups. llvm-svn: 108848	2010-07-20 16:32:11 +00:00
Owen Anderson	81781220d2	Speculatively revert r108813, in an attempt to get the self-host buildbots working again. I don't see why this patch would cause them to fail the way they are, but none of the other intervening patches seem likely either. llvm-svn: 108818	2010-07-20 08:26:15 +00:00
Owen Anderson	8dc129325f	Reapply r108794, a fix for the failing test from last time. llvm-svn: 108813	2010-07-20 06:52:42 +00:00
Daniel Dunbar	4a35d6f8cd	Revert r108794, "Separate PassInfo into two classes: a constructor-free superclass (StaticPassInfo) and a constructor-ful subclass (PassInfo).", it is breaking teh everything. llvm-svn: 108805	2010-07-20 03:06:07 +00:00
Owen Anderson	e7c5fe586a	Separate PassInfo into two classes: a constructor-free superclass (StaticPassInfo) and a constructor-ful subclass (PassInfo). llvm-svn: 108794	2010-07-20 01:19:58 +00:00
Dan Gohman	3ff13affda	Minor code simplification. llvm-svn: 108793	2010-07-20 00:57:18 +00:00
Stuart Hastings	61475c5c3c	Correct line info for declarations/definitions. Radar 8063111. llvm-svn: 108784	2010-07-19 23:56:30 +00:00
Gabor Greif	6d673953e3	eliminate CallInst::ArgOffset llvm-svn: 108522	2010-07-16 09:38:02 +00:00
Dan Gohman	fbbdfcaea7	Fix the order that SCEVExpander considers add operands in so that it doesn't miss an opportunity to form a GEP, regardless of the relative loop depths of the operands. This fixes rdar://8197217. llvm-svn: 108475	2010-07-15 23:38:13 +00:00
Dan Gohman	64b1e82a7c	Teach ScalarEvolution how to fold trunc(undef) and anyext(undef) to undef. This helps LSR behave more consistently on bugpoint-reduced testcases. llvm-svn: 108451	2010-07-15 20:02:11 +00:00
Gabor Greif	26ec65ac3c	cache another dereferenced iterator llvm-svn: 108421	2010-07-15 10:19:23 +00:00
Chris Lattner	19eff2a9f6	Fix PR7647, handling the case when 'To' ends up being mutated by recursive simplification. This also enhances ReplaceAndSimplifyAllUses to actually do a real RAUW at the end of it, which updates any value handles pointing to "From" to start pointing to "To". This seems useful for debug info and random other VH users. llvm-svn: 108415	2010-07-15 06:36:08 +00:00
Eli Friedman	8b3a17e613	Revert r108401; it breaks bootstrap :( llvm-svn: 108407	2010-07-15 05:09:31 +00:00
Eli Friedman	fd473a746c	Add AssertingVH which makes PR7647 break consistently. llvm-svn: 108401	2010-07-15 04:46:14 +00:00
Dan Gohman	c128e70ff2	Add a lint check for mismatched return types, inspired by PR6944. llvm-svn: 108162	2010-07-12 18:02:04 +00:00
Duncan Sands	41b4a6b36a	Convert some tab stops into spaces. llvm-svn: 108130	2010-07-12 08:16:59 +00:00
Chandler Carruth	57041d81df	Add parentheses around an \|\| to correct the logic. Also silences a GCC warning that was actually useful here. Chris, please double check that this is the correct interpretation. I was pretty sure, and ran it by Nick as well. llvm-svn: 108129	2010-07-12 06:47:05 +00:00
Chris Lattner	fd4a09fc0a	fix PR7429, a crash turning a load from a string into a float. llvm-svn: 108113	2010-07-12 00:22:51 +00:00
Gabor Greif	8e66a42784	remove useless cast and fix typos in comment llvm-svn: 107989	2010-07-09 16:42:04 +00:00
Gabor Greif	3b740e9085	cache result of operator* llvm-svn: 107988	2010-07-09 16:39:02 +00:00
Gabor Greif	aa389f5085	cache result of operator* llvm-svn: 107982	2010-07-09 16:22:36 +00:00
Gabor Greif	070b9a2cc4	cache result of operator* llvm-svn: 107978	2010-07-09 15:53:42 +00:00
Gabor Greif	d9a0e80213	cache result of operator* llvm-svn: 107977	2010-07-09 15:52:36 +00:00
Gabor Greif	e82532a1c5	cache result of operator* llvm-svn: 107976	2010-07-09 15:40:10 +00:00
Gabor Greif	2732561be9	cache result of operator* llvm-svn: 107967	2010-07-09 14:28:41 +00:00
Gabor Greif	1d20021d82	do not repeatedly dereference use_iterator llvm-svn: 107963	2010-07-09 13:17:13 +00:00
Stuart Hastings	d08fb75aaa	Reverting r107918 and r107919. Radar 8063111. llvm-svn: 107930	2010-07-08 23:25:39 +00:00
Stuart Hastings	43d226deea	Fix decl/def debug info for template functions. Radar 8063111. llvm-svn: 107919	2010-07-08 22:28:59 +00:00
Dan Gohman	5b0a8a863f	Minore code simplification. llvm-svn: 107777	2010-07-07 14:30:04 +00:00
Dan Gohman	00ef93258a	Remove interprocedural-basic-aa and associated code. The AliasAnalysis interface needs implementations to be consistent, so any code which wants to support different semantics must use a different interface. It's not currently worthwhile to add a new interface for this new concept. Document that AliasAnalysis doesn't support cross-function queries. llvm-svn: 107776	2010-07-07 14:27:09 +00:00
Gabor Greif	a22e8148d4	conditionalize by CallInst::ArgOffset llvm-svn: 107767	2010-07-07 10:34:03 +00:00
Dan Gohman	1e33b18e28	Add some more TODO comments. llvm-svn: 107657	2010-07-06 15:23:00 +00:00
Dan Gohman	f855b39edd	Add a comment. llvm-svn: 107656	2010-07-06 15:21:57 +00:00
Dan Gohman	84f90a387d	Remove context sensitivity concerns from interprocedural-basic-aa, and make it more aggressive in cases where both pointers are known to live in the same function. llvm-svn: 107420	2010-07-01 20:08:40 +00:00
Dan Gohman	f638f4ff84	In ScalarEvolution::forgetValue, eliminate any SCEVUnknown entries associated with the value being erased in the folding set map. These entries used to be harmless, because a SCEVUnknown doesn't store any information about its Value*, so having a new Value allocated at the old Value's address wasn't a problem. But now that ScalarEvolution is storing more information about values, this is no longer safe. llvm-svn: 107316	2010-06-30 20:21:12 +00:00
Dan Gohman	c0cca7fdda	Revert the part of r107257 which introduced new logic for using nsw and nuw flags from IR Instructions. On further consideration, this isn't valid. llvm-svn: 107298	2010-06-30 17:27:11 +00:00
Dan Gohman	16206132b6	Improve ScalarEvolution's nsw and nuw preservation. llvm-svn: 107257	2010-06-30 07:16:37 +00:00
Dan Gohman	9396b42ca4	When computing a new ConservativeResult, intersect it with the old one instead of replacing it, to be more precise. llvm-svn: 107256	2010-06-30 06:58:35 +00:00
Dan Gohman	0865966440	Rework scev-aa's basic computation so that it doesn't depend on ScalarEvolution successfully folding and preserving range information for both A-B and B-A. Now, if it gets either one, it's sufficient. llvm-svn: 107249	2010-06-30 06:12:16 +00:00
Dan Gohman	37f145c55b	Simplify. llvm-svn: 107248	2010-06-30 06:09:46 +00:00
Dan Gohman	ae36b1ed42	Fix ScalarEvolution's tripcount computation for chains of loops where each loop's induction variable's start value is the exit value of a preceding loop. llvm-svn: 107224	2010-06-29 23:43:06 +00:00
Dan Gohman	1be9e7c0b6	Fix whitespace style. llvm-svn: 107175	2010-06-29 18:12:34 +00:00
Duncan Sands	67aa21d7b5	Remove a pointless variable. llvm-svn: 107128	2010-06-29 11:39:45 +00:00
Benjamin Kramer	80b7bc042a	Use a more obvious way to avoid compiling functions which are only used when XDEBUG is enabled. llvm-svn: 107125	2010-06-29 10:03:11 +00:00
Chandler Carruth	b1adb88d05	Jump through some silly hoops to make GCC accept that a function may not always be called. llvm-svn: 107124	2010-06-29 06:46:00 +00:00
Dan Gohman	90db61d638	Just as its not safe to blindly transfer the nsw bit from an add instruction to an add scev, it's not safe to blindly transfer the inbounds flag from a gep instruction to an nsw on the scev for the gep. llvm-svn: 107117	2010-06-29 01:41:41 +00:00
Dan Gohman	0824affeff	Add an Intraprocedural form of BasicAliasAnalysis, which aims to properly handles instructions and arguments defined in different functions, or across recursive function iterations. llvm-svn: 107109	2010-06-29 00:50:39 +00:00
Dan Gohman	7c34ece501	Fix Value::stripPointerCasts and BasicAA to avoid trouble on code in unreachable blocks, which have have use-def cycles. This fixes PR7514. llvm-svn: 107071	2010-06-28 21:16:52 +00:00
Dan Gohman	875a296011	Generalize AAEval so that it can be used both per-function and interprocedurally. Note that as of this writing, existing alias analysis passes are not prepared to be used interprocedurally. llvm-svn: 107013	2010-06-28 16:01:37 +00:00
Devang Patel	f7869a4b81	Use named MDNode, llvm.dbg.sp, to collect subprogram info. This will be used to emit local variable's debug info of deleted functions. llvm-svn: 106989	2010-06-28 05:53:08 +00:00
Devang Patel	81170d23de	Do not forget last element, function, while creating Subprogram definition MDNode from subprogram declare MDNode. llvm-svn: 106985	2010-06-27 21:04:31 +00:00
Dan Gohman	89dd42af31	Eliminate a redundant FoldingSet lookup. llvm-svn: 106872	2010-06-25 18:47:08 +00:00
Dan Gohman	5235cc2c25	Don't try to preserve pointer types in SCEVConstants; the old code was over-complicated. llvm-svn: 106760	2010-06-24 16:47:03 +00:00
Dan Gohman	3ace9f4e3d	Make the trunc code consistent with the zext and sext code in its handling of pointer types. llvm-svn: 106757	2010-06-24 16:33:38 +00:00
Gabor Greif	1abbde3103	use ArgOperand accessors llvm-svn: 106697	2010-06-23 23:38:07 +00:00
Gabor Greif	253c6bf366	use the new isFreeCall API and ArgOperand accessors llvm-svn: 106692	2010-06-23 22:48:06 +00:00
Gabor Greif	5f5a864539	minor enhancement to llvm::isFreeCall API: return CallInst; no functional change llvm-svn: 106686	2010-06-23 21:51:12 +00:00
Gabor Greif	ad7884ad98	use ArgOperand getters llvm-svn: 106685	2010-06-23 21:41:47 +00:00
Dan Gohman	75c6b0bb1f	Replace ScalarEvolution's private copy of getLoopPredecessor with LoopInfo's public copy. llvm-svn: 106603	2010-06-22 23:43:28 +00:00
Dan Gohman	d2d1ae105d	Use pre-increment instead of post-increment when the result is not used. llvm-svn: 106542	2010-06-22 15:08:57 +00:00
Dan Gohman	f820bd327d	Allow "exhaustive" trip count evaluation on phi nodes with all constant operands. llvm-svn: 106537	2010-06-22 13:15:46 +00:00
Devang Patel	b6e058da18	Use single interface, using twine, to get named metadata. getNamedMetadata(). llvm-svn: 106518	2010-06-22 01:19:38 +00:00
Devang Patel	ad51735794	Do not rely on Twine temporaries to survive. llvm-svn: 106515	2010-06-22 01:01:58 +00:00
Dan Gohman	dd41bba517	Use A.append(...) instead of A.insert(A.end(), ...) when A is a SmallVector, and other SmallVector simplifications. llvm-svn: 106452	2010-06-21 19:47:52 +00:00
Devang Patel	e80de80270	Do not directly use function names to construct new name for named metadata. "llvm.dbg.lv.~A" is not a valid name. llvm-svn: 106438	2010-06-21 18:36:58 +00:00
Dan Gohman	c515ab1eb2	Restore a call to rememberInstruction which was accidentally dropped in refactoring. llvm-svn: 106398	2010-06-19 22:50:35 +00:00
Dan Gohman	866971ed3d	Fix ScalarEvolution's "exhaustive" trip count evaluation code to avoid assuming that loops are in canonical form, as ScalarEvolution doesn't depend on LoopSimplify itself. Also, with indirectbr not all loops can be simplified. This fixes PR7416. llvm-svn: 106389	2010-06-19 14:17:24 +00:00
Dan Gohman	d277246137	Factor out duplicated code for reusing and inserting casts into a helper function. llvm-svn: 106388	2010-06-19 13:25:23 +00:00
Dan Gohman	24ceda8eb0	Revert r106304 (105548 and friends), which are the SCEVComplexityCompare optimizations. There is still some nondeterminism remaining. llvm-svn: 106306	2010-06-18 19:54:20 +00:00
Dan Gohman	4c807fca97	Reapply 105540, 105542, and 105548, and revert r105732. llvm-svn: 106304	2010-06-18 19:26:04 +00:00
Dan Gohman	45073042eb	Reapply 105546. llvm-svn: 106302	2010-06-18 19:12:32 +00:00
Dan Gohman	9136d9fbf8	Reapply 105544. llvm-svn: 106301	2010-06-18 19:09:27 +00:00
Dan Gohman	3d8a9d7490	Remove getIntegerSCEV; it's redundant with getConstant, and getConstant is more consistent with the ConstantInt API. llvm-svn: 106281	2010-06-18 14:33:50 +00:00
Dan Gohman	f1d8304fe3	Eliminate unnecessary uses of getZExtValue(). llvm-svn: 106279	2010-06-18 14:22:04 +00:00
Dan Gohman	8ba26b48bb	Fix a typo in a comment. llvm-svn: 106260	2010-06-18 00:53:08 +00:00
Dan Gohman	8f5954f42c	Simplify this code. llvm-svn: 106254	2010-06-17 23:34:09 +00:00
Jim Grosbach	fd3b4e7390	A few more places where SCEVExpander bits need to skip over debug intrinsics when iterating through instructions. Yet more work for rdar://7797940 llvm-svn: 106149	2010-06-16 21:13:38 +00:00
Devang Patel	d119da54de	Check function pointer first, before comparing function names. llvm-svn: 106088	2010-06-16 06:42:02 +00:00
Devang Patel	a6d20f446f	Use separate named MDNode to hold each function's local variable info. This speeds up local variable handling in DwarfDebug. llvm-svn: 106075	2010-06-16 00:53:55 +00:00
Stuart Hastings	afe54f1625	Support for nested functions/classes in debug output. (Again.) Radar 7424645. llvm-svn: 105828	2010-06-11 20:08:44 +00:00
Stuart Hastings	6111abf8ad	Delete duplicate function. llvm-svn: 105827	2010-06-11 20:05:01 +00:00
Evan Cheng	ae83e1f5cb	Revert 105540, 105542, 105544, 105546, and 105548 to unbreak bootstrapping. llvm-svn: 105740	2010-06-09 18:59:43 +00:00
Kenneth Uildriks	9b21208bfb	Pulled CodeMetrics out of InlineCost.h and made it a bit more general, so it can be reused from PartialSpecializationCost llvm-svn: 105725	2010-06-09 15:11:37 +00:00
Dan Gohman	ebf2e977cf	The FoldingSet hash data includes pointer values, so it isn't determinstic. Instead, give SCEV objects an arbitrary sequence number. llvm-svn: 105548	2010-06-07 19:36:14 +00:00
Dan Gohman	3553feed79	Optimize this code somewhat by taking advantage of the fact that the operands are sorted. llvm-svn: 105546	2010-06-07 19:20:57 +00:00
Dan Gohman	a2effb6452	Micro-optimize this, to speed up this hotspot in debug builds a little. llvm-svn: 105544	2010-06-07 19:16:37 +00:00
Dan Gohman	18a4b46404	Micro-optimize this. llvm-svn: 105542	2010-06-07 19:12:54 +00:00
Dan Gohman	70910a6ab6	Optimize ScalarEvolution's SCEVComplexityCompare predicate: don't go scrounging through SCEVUnknown contents and SCEVNAryExpr operands; instead just do a simple deterministic comparison of the precomputed hash data. Also, since this is more precise, it eliminates the need for the slow N^2 duplicate detection code. llvm-svn: 105540	2010-06-07 19:06:13 +00:00
Bill Wendling	a3bba3371a	Create new accessors to get arguments for call/invoke instructions. It breaks encapsulation to force the users of these classes to know about the internal data structure of the Operands structure. It also can lead to errors, like in the MSIL writer. llvm-svn: 105539	2010-06-07 19:05:06 +00:00
Stuart Hastings	3ca391027f	Revert 105492 & 105493 due to a testcase regression. Radar 7424645. llvm-svn: 105511	2010-06-05 00:39:29 +00:00
Dan Gohman	bbfb6aca92	LSR needs to remember inserted instructions even in postinc mode, because there could be multiple subexpressions within a single expansion which require insert point adjustment. This fixes PR7306. llvm-svn: 105510	2010-06-05 00:33:07 +00:00
Stuart Hastings	7c015988fe	Support for nested functions/classes in debug output. Radar 7424645. llvm-svn: 105492	2010-06-04 22:36:03 +00:00
Dan Gohman	538b413ccb	Fix normalization and de-normalization of non-affine SCEVs. llvm-svn: 105480	2010-06-04 19:16:34 +00:00
Dan Gohman	49a372cebc	Fix the noalias checking so that it doesn't worry about an argument aliasing itself. Thanks Duncan! llvm-svn: 105288	2010-06-01 20:51:40 +00:00
Dan Gohman	34709d06c0	Fix AliasDebugger to be aware of operand values too. llvm-svn: 105012	2010-05-28 22:31:51 +00:00
Dan Gohman	0fa67e479a	Add lint checks for function attributes. llvm-svn: 105009	2010-05-28 21:43:57 +00:00
Dan Gohman	c575ec61ea	Fix lint's memcpy and memmove checks, and its basic block traversal. llvm-svn: 104970	2010-05-28 17:44:00 +00:00
Dan Gohman	862f034188	Detect self-referential values. llvm-svn: 104957	2010-05-28 16:45:33 +00:00
Stuart Hastings	c1e216583f	Revert 104841, 104842, 104876 due to buildbot failures. Radar 7424645. llvm-svn: 104953	2010-05-28 16:41:07 +00:00
Dan Gohman	cef9fc37f4	Eli pointed out that va_arg instruction result values don't reference the stack. llvm-svn: 104951	2010-05-28 16:34:49 +00:00
Dan Gohman	54d7aaa819	Teach lint how to look through simple store+load pairs and other effective no-op constructs, to make it more effective on unoptimized IR. llvm-svn: 104950	2010-05-28 16:21:24 +00:00
Dan Gohman	826bdf8c10	Move FindAvailableLoadedValue isSafeToLoadUnconditionally out of lib/Transforms/Utils and into lib/Analysis so that Analysis passes can use them. llvm-svn: 104949	2010-05-28 16:19:17 +00:00
Dan Gohman	a3b6c4b529	ConstantFoldConstantExpression can theoretically return null. llvm-svn: 104948	2010-05-28 16:12:08 +00:00
Dan Gohman	ddba4b725a	Add a lint check for returning the address of stack memory. llvm-svn: 104936	2010-05-28 04:33:42 +00:00
Stuart Hastings	8e99e50d08	Support for nested functions/classes in debug output. Radar 7424645. llvm-svn: 104841	2010-05-27 16:16:54 +00:00
Jakob Stoklund Olesen	d67defdfe2	Avoid counting InlineAsm as a call - it prevents loop unrolling. PR7026 Patch by Pekka Jääskeläinen! llvm-svn: 104780	2010-05-26 22:40:28 +00:00
Dan Gohman	084bcb1322	Fix Lint printing warnings multiple times. Remove the ErrorStr option from lintModule, which was an artifact from being based on Verifier code. llvm-svn: 104765	2010-05-26 22:28:53 +00:00
Dan Gohman	a20a5cd24f	Reinstate checking of stackrestore, with checking for both Read and Write, and add a comment explaining this. llvm-svn: 104756	2010-05-26 22:21:25 +00:00
Dan Gohman	996bc42a26	Stackrestore is not a load. llvm-svn: 104752	2010-05-26 22:00:10 +00:00
Dan Gohman	c96c6db59d	Remove a TODO which isn't practical. llvm-svn: 104748	2010-05-26 21:50:41 +00:00
Dan Gohman	1249adf160	Implement checking of the tail keyword. llvm-svn: 104744	2010-05-26 21:46:36 +00:00
Devang Patel	0adee9b362	Rename variable. add comment. llvm-svn: 104274	2010-05-20 20:35:24 +00:00
Devang Patel	e0a94bfe9f	Add support to preserve type info for the variables that are removed by the optimizer. llvm-svn: 103798	2010-05-14 21:01:35 +00:00
Nick Lewycky	c63aa1e8ab	Clear CachedFunctionInfo upon Pass::releaseMemory. Because ValueMap will abort on RAUW of functions, this is a correctness issue instead of a mere memory usage problem. No testcase until the new MergeFunctions can land. llvm-svn: 103653	2010-05-12 21:48:15 +00:00
Dan Gohman	bf2fb95b7c	Fix whitespace in debug output to be consistent. llvm-svn: 103422	2010-05-10 20:07:44 +00:00
Devang Patel	cbe7a8508a	Remove DIGlobal. llvm-svn: 103325	2010-05-07 23:19:07 +00:00
Devang Patel	54c59312b1	Add DINameSpace::Verify(). llvm-svn: 103318	2010-05-07 23:04:32 +00:00
Devang Patel	2ae3397536	Verify variable directly. llvm-svn: 103305	2010-05-07 22:04:20 +00:00
Devang Patel	2c4d69d7ad	Verify compile unit also. llvm-svn: 103300	2010-05-07 21:42:24 +00:00
Devang Patel	32cc43c242	Wrap const MDNode * inside DIDescriptor. llvm-svn: 103295	2010-05-07 20:54:48 +00:00
Devang Patel	4423abd734	Use overloaded operators instead of DIDescriptor::getNode() llvm-svn: 103276	2010-05-07 18:19:32 +00:00
Devang Patel	cfa8e9d45f	Avoid DIDescriptor::getNode(). Use overloaded operators instead. llvm-svn: 103272	2010-05-07 18:11:54 +00:00
Dan Gohman	50689f0bb9	Add some words to this output to indicate what the numbers mean. llvm-svn: 103264	2010-05-07 16:39:27 +00:00
Dan Gohman	fb64b5dff4	Add a simple module-level debug info printer. It just sets up a DebugInfoFinder and iterates over all the contents calling print. llvm-svn: 103262	2010-05-07 16:22:32 +00:00
Dan Gohman	6c30e879f8	Fix the new print functions to call print instead of dump. llvm-svn: 103261	2010-05-07 16:17:22 +00:00
Dan Gohman	4bbcf644da	Convert the DebugInfo classes dump() methods into print(raw_ostream &) methods, and add dump functions implemented in terms of the print. llvm-svn: 103254	2010-05-07 15:30:29 +00:00
Dan Gohman	70a3b12193	Use the SCEVAddRecExpr::getPostIncExpr utility function instead of doing the same thing manually. llvm-svn: 102997	2010-05-04 01:12:27 +00:00
Dan Gohman	5f18c547da	Fix a copy+pasto. llvm-svn: 102996	2010-05-04 01:11:15 +00:00
Devang Patel	801b8ea42a	Do not ignore debug loc attached with llvm.dbg.declare while collecting debug info used by a module. llvm-svn: 102995	2010-05-04 01:05:02 +00:00
Dan Gohman	1d2ded75e2	Use getConstant instead of getIntegerSCEV. The two are basically the same, now that getConstant has overloads consistent with ConstantInt::get. llvm-svn: 102965	2010-05-03 22:09:21 +00:00
Dan Gohman	267700c5aa	Silence warnings about -1 being converted to an unsigned value. Also, pass true for isSigned even when creating constants for unsigned comparisons, because the point is to create an all-ones constant, rather than UINT64_MAX, even for integers wider than 64 bits. llvm-svn: 102946	2010-05-03 20:23:47 +00:00
Dan Gohman	b5025c72eb	Use isTrueWhenEqual and isFalseWhenEqual instead of assuming that SimplifyICmpOperands will simplify such cases to EQ or NE. This makes the correcntess of the code independent on SimplifyICmpOperands doing certain simplifications. llvm-svn: 102927	2010-05-03 18:00:24 +00:00
Dan Gohman	d18dc2c876	In ScalarEvolution::print, don't bother printing out the SCEVs for comparison instructions, since they aren't interesting, despite having integer result types. llvm-svn: 102925	2010-05-03 17:03:23 +00:00
Dan Gohman	df564cacaf	In SimplifyICmpOperands, avoid needlessly swapping the operands in the case where both are addrecs in unrelated loops. llvm-svn: 102924	2010-05-03 17:00:11 +00:00
Dan Gohman	81585c18e1	Factor out the new <= and >= analysis code into SimplifyICmpOperands. llvm-svn: 102922	2010-05-03 16:35:17 +00:00
David Chisnall	f4b87f191b	Added a variant of InlineCostAnalyzer::getInlineCost() that takes the called function as an explicit argument, for use when inlining function pointers. llvm-svn: 102841	2010-05-01 15:47:41 +00:00

... 23 24 25 26 27 ...

5537 Commits