llvm-project

Commit Graph

Author	SHA1	Message	Date
Yunzhong Gao	5cbcf56a7e	Fixing a heisenbug where the memory dependence analysis behaves differently with and without -g. Adding a test case to make sure that the threshold used in the memory dependence analysis is respected. The test case also checks that debug intrinsics are not counted towards this threshold. Differential Revision: http://llvm-reviews.chandlerc.com/D2141 llvm-svn: 194646	2013-11-14 01:10:52 +00:00
Nick Lewycky	7ed1dbfff4	Fix xemacs mode line, don't put them in .cpp files (just header files). No functionality change. llvm-svn: 183709	2013-06-10 23:10:59 +00:00
David Blaikie	041f1aa3e2	Use only explicit bool conversion operators BitVector/SmallBitVector::reference::operator bool remain implicit since they model more exactly a bool, rather than something else that can be boolean tested. The most common (non-buggy) case are where such objects are used as return expressions in bool-returning functions or as boolean function arguments. In those cases I've used (& added if necessary) a named function to provide the equivalent (or sometimes negative, depending on convenient wording) test. One behavior change (YAMLParser) was made, though no test case is included as I'm not sure how to reach that code path. Essentially any comparison of llvm::yaml::document_iterators would be invalid if neither iterator was at the end. This helped uncover a couple of bugs in Clang - test cases provided for those in a separate commit along with similar changes to `operator bool` instances in Clang. llvm-svn: 181868	2013-05-15 07:36:59 +00:00
Matt Arsenault	c23753a53e	Fix unchecked uses of DominatorTree in MemoryDependenceAnalysis. Use unknown results for places where it would be needed llvm-svn: 181176	2013-05-06 02:07:24 +00:00
Bill Wendling	9ca12c137f	A limit of 500 was still a bit too high for some tests. PR15000 has a testcase where the time to compile was bordering on 30s. When I dropped the limit value to 100, it became a much more managable 6s. The compile time seems to increase in a roughly linear fashion based on increasing the limit value. (See the runtimes below.) So, let's lower the limit to 100 so that they can get a more reasonable compile time. Limit Value Time ----------- ---- 10 0.9744s 20 1.8035s 30 2.3618s 40 2.9814s 50 3.6988s 60 4.5486s 70 4.9314s 80 5.8012s 90 6.4246s 100 7.0852s 110 7.6634s 120 8.3553s 130 9.0552s 140 9.6820s 150 9.8804s 160 10.8901s 170 10.9855s 180 12.0114s 190 12.6816s 200 13.2754s 210 13.9942s 220 13.8097s 230 14.3272s 240 15.7753s 250 15.6673s 260 16.0541s 270 16.7625s 280 17.3823s 290 18.8213s 300 18.6120s 310 20.0333s 320 19.5165s 330 20.2505s 340 20.7068s 350 21.1833s 360 22.9216s 370 22.2152s 380 23.9390s 390 23.4609s 400 24.0426s 410 24.6410s 420 26.5208s 430 27.7155s 440 26.4142s 450 28.5646s 460 27.3494s 470 29.7255s 480 29.4646s 490 30.5001s llvm-svn: 179713	2013-04-17 20:02:32 +00:00
Matt Arsenault	2080ecd107	Fix loop style llvm-svn: 178355	2013-03-29 18:48:42 +00:00
Jakub Staszak	fa41def6ce	Remove 'else' after 'return'. llvm-svn: 177607	2013-03-20 23:53:45 +00:00
Jakub Staszak	b0a7eed958	Remove trailing spaces. llvm-svn: 177584	2013-03-20 21:47:51 +00:00
Shuxin Yang	408bdad5b4	Memory Dependence Analysis (not mem-dep test) take advantage of "invariant.load" metadata. The "invariant.load" metadata indicates the memory unit being accessed is immutable. A load annotated with this metadata can be moved across any store. As I am not sure if it is legal to move such loads across barrier/fence, this change dose not allow such transformation. rdar://11311484 Thank Arnold for code review. llvm-svn: 176562	2013-03-06 17:48:48 +00:00
Kostya Serebryany	cf880b9443	Unify clang/llvm attributes for asan/tsan/msan (LLVM part) These are two related changes (one in llvm, one in clang). LLVM: - rename address_safety => sanitize_address (the enum value is the same, so we preserve binary compatibility with old bitcode) - rename thread_safety => sanitize_thread - rename no_uninitialized_checks -> sanitize_memory CLANG: - add __attribute__((no_sanitize_address)) as a synonym for __attribute__((no_address_safety_analysis)) - add __attribute__((no_sanitize_thread)) - add __attribute__((no_sanitize_memory)) for S in address thread memory If -fsanitize=S is present and __attribute__((no_sanitize_S)) is not set llvm attribute sanitize_S llvm-svn: 176075	2013-02-26 06:58:09 +00:00
Kostya Serebryany	3838f27905	[tsan] disable load widening in ThreadSanitizer mode llvm-svn: 175034	2013-02-13 05:59:45 +00:00
Dan Gohman	20a2ae9df5	Change GetPointerBaseWithConstantOffset's DataLayout argument from a reference to a pointer, so that it can handle the case where DataLayout is not available and behave conservatively. llvm-svn: 174024	2013-01-31 02:00:45 +00:00
Chandler Carruth	9fb823bbd4	Move all of the header files which are involved in modelling the LLVM IR into their new header subdirectory: include/llvm/IR. This matches the directory structure of lib, and begins to correct a long standing point of file layout clutter in LLVM. There are still more header files to move here, but I wanted to handle them in separate commits to make tracking what files make sense at each layer easier. The only really questionable files here are the target intrinsic tablegen files. But that's a battle I'd rather not fight today. I've updated both CMake and Makefile build systems (I think, and my tests think, but I may have missed something). I've also re-sorted the includes throughout the project. I'll be committing updates to Clang, DragonEgg, and Polly momentarily. llvm-svn: 171366	2013-01-02 11:36:10 +00:00
Bill Wendling	698e84fc4f	Remove the Function::getFnAttributes method in favor of using the AttributeSet directly. This is in preparation for removing the use of the 'Attribute' class as a collection of attributes. That will shift to the AttributeSet class instead. llvm-svn: 171253	2012-12-30 10:32:01 +00:00
Bill Wendling	3d7b0b8ac7	Rename the 'Attributes' class to 'Attribute'. It's going to represent a single attribute in the future. llvm-svn: 170502	2012-12-19 07:18:57 +00:00
Chandler Carruth	ed0881b2a6	Use the new script to sort the includes of every file under lib. Sooooo many of these had incorrect or strange main module includes. I have manually inspected all of these, and fixed the main module include to be the nearest plausible thing I could find. If you own or care about any of these source files, I encourage you to take some time and check that these edits were sensible. I can't have broken anything (I strictly added headers, and reordered them, never removed), but they may not be the headers you'd really like to identify as containing the API being implemented. Many forward declarations and missing includes were added to a header files to allow them to parse cleanly when included first. The main module rule does in fact have its merits. =] llvm-svn: 169131	2012-12-03 16:50:05 +00:00
Bill Wendling	5858b56ce3	Ignore unreachable blocks when doing memory dependence analysis on non-local loads. It's not really profitable and may result in GVN going into an infinite loop when it hits constructs like this: %x = gep %some.type %x, ... Found via an LTO build of LLVM. llvm-svn: 166490	2012-10-23 18:37:11 +00:00
Bill Wendling	c9b22d735a	Create enums for the different attributes. We use the enums to query whether an Attributes object has that attribute. The opaque layer is responsible for knowing where that specific attribute is stored. llvm-svn: 165488	2012-10-09 07:45:08 +00:00
Micah Villmow	cdfe20b97f	Move TargetData to DataLayout. llvm-svn: 165402	2012-10-08 16:38:25 +00:00
Bill Wendling	863bab689a	Remove the `hasFnAttr' method from Function. The hasFnAttr method has been replaced by querying the Attributes explicitly. No intended functionality change. llvm-svn: 164725	2012-09-26 21:48:26 +00:00
Bob Wilson	01cfbfe9d0	Be conservative about allocations that may alias the accessed pointer. If an allocation has a must-alias relation to the access pointer, we treat it as a Def. Otherwise, without this check, the code here was just skipping over the allocation call and ignoring it. I noticed this by inspection and don't have a specific testcase that it breaks, but it seems like we need to treat a may-alias allocation as a Clobber. llvm-svn: 163127	2012-09-04 03:30:13 +00:00
Bob Wilson	dcc54decd5	Fix more fallout from r158919, similar to PR13547. This code used to only handle malloc-like calls, which do not read memory. r158919 changed it to check isNoAliasFn(), which includes strdup-like and realloc-like calls, but it was not checking for dependencies on the memory read by those calls. llvm-svn: 163106	2012-09-03 05:15:15 +00:00
Benjamin Kramer	8bcc971174	Make MemoryBuiltins aware of TargetLibraryInfo. This disables malloc-specific optimization when -fno-builtin (or -ffreestanding) is specified. This has been a problem for a long time but became more severe with the recent memory builtin improvements. Since the memory builtin functions are used everywhere, this required passing TLI in many places. This means that functions that now have an optional TLI argument, like RecursivelyDeleteTriviallyDeadFunctions, won't remove dead mallocs anymore if the TLI argument is missing. I've updated most passes to do the right thing. Fixes PR13694 and probably others. llvm-svn: 162841	2012-08-29 15:32:21 +00:00
Nadav Rotem	5d4e205874	MemoryDependenceAnalysis attempts to find the first memory dependency for function calls. Currently, if GetLocation reports that it did not find a valid pointer (this is the case for volatile load/stores), we ignore the result. This patch adds code to handle the cases where we did not obtain a valid pointer. rdar://11872864 PR12899 llvm-svn: 161802	2012-08-13 23:03:43 +00:00
Nuno Lopes	55fff83422	refactor the MemoryBuiltin analysis: - provide more extensive set of functions to detect library allocation functions (e.g., malloc, calloc, strdup, etc) - provide an API to compute the size and offset of an object pointed by Move a few clients (GVN, AA, instcombine, ...) to the new API. This implementation is a lot more aggressive than each of the custom implementations being replaced. Patch reviewed by Nick Lewycky and Chandler Carruth, thanks. llvm-svn: 158919	2012-06-21 15:45:28 +00:00
Benjamin Kramer	bde9176663	Fix typos found by http://github.com/lyda/misspell-check llvm-svn: 157885	2012-06-02 10:20:22 +00:00
Chad Rosier	a968caf8e0	Move the capture analysis from MemoryDependencyAnalysis to a more general place so that it can be reused in MemCpyOptimizer. This analysis is needed to remove an unnecessary memcpy when returning a struct into a local variable. rdar://11341081 PR12686 llvm-svn: 156776	2012-05-14 20:35:04 +00:00
Chad Rosier	10702d5f22	Hoist simpler checks above llvm::PointerMayBeCaptured. No functional change intended. llvm-svn: 156687	2012-05-12 00:43:40 +00:00
Rafael Espindola	b660977c67	Don't call dominates on unreachable instructions. Should fix the dragonegg build. Testcase is still reducing. llvm-svn: 151474	2012-02-26 05:30:08 +00:00
Kostya Serebryany	9e0d377400	The patch resolves the conflict between AddressSanitizer and load widening (GVN). The problem initially reported by Mozilla folks (http://code.google.com/p/address-sanitizer/issues/detail?id=20), but it also prevents us from enabling LLVM bootstrap with AddressSanitizer. llvm-svn: 149925	2012-02-06 22:48:56 +00:00
David Blaikie	46a9f016c5	More dead code removal (using -Wunreachable-code) llvm-svn: 148578	2012-01-20 21:51:11 +00:00
Nick Lewycky	4c378a4453	Change CaptureTracking to pass a Use* instead of a Value* when a value is captured. This allows the tracker to look at the specific use, which may be especially interesting for function calls. Use this to fix 'nocapture' deduction in FunctionAttrs. The existing one does not iterate until a fixpoint and does not guarantee that it produces the same result regardless of iteration order. The new implementation builds up a graph of how arguments are passed from function to function, and uses a bottom-up walk on the argument-SCCs to assign nocapture. This gets us nocapture more often, and does so rather efficiently and independent of iteration order. llvm-svn: 147327	2011-12-28 23:24:21 +00:00
Nick Lewycky	063ae5897c	Fix crasher in GVN due to my recent capture tracking changes. llvm-svn: 145047	2011-11-21 19:42:56 +00:00
Nick Lewycky	6ae03c3378	Less template, more virtual! Refactoring suggested by Chris in code review. llvm-svn: 145014	2011-11-20 19:37:06 +00:00
Nick Lewycky	612d70b19d	Refactor code to use new attribute getters on CallSite for NoCapture and ByVal. Suggested in code review by Eli. That code in InstCombine looks kinda suspicious. llvm-svn: 145013	2011-11-20 19:09:04 +00:00
Nick Lewycky	7013a19e8a	Refactor capture tracking (which already had a couple flags for whether returns and stores capture) to permit the caller to see each capture point and decide whether to continue looking. Use this inside memdep to do an analysis that basicaa won't do. This lets us solve another devirtualization case, fixing PR8908! llvm-svn: 144580	2011-11-14 22:49:42 +00:00
Eli Friedman	c1702c8f22	Enhance the memdep interface so that users can tell the difference between a dependency which cannot be calculated and a path reaching the entry point of the function. This patch introduces isNonFuncLocal, which replaces isUnknown in some cases. Patch by Xiaoyi Guo. llvm-svn: 141896	2011-10-13 22:14:57 +00:00
Eli Friedman	5494adac67	Misc analysis passes that need to be aware of atomic load/store. llvm-svn: 137650	2011-08-15 20:54:19 +00:00
Chris Lattner	229907cd11	land David Blaikie's patch to de-constify Type, with a few tweaks. llvm-svn: 135375	2011-07-18 04:54:35 +00:00
Eli Friedman	8b098b0d57	Add a limit to the number of instructions memdep will scan in a single block. This prevents (at least in some cases) O(N^2) runtime in passes like DSE. The limit in this patch is probably too high, but it is enough to stop DSE from going completely insane on a testcase I have (which has a single block with around 50,000 non-aliasing stores in it). rdar://9471075 llvm-svn: 133111	2011-06-15 23:59:25 +00:00
Eli Friedman	7d58bc7bc0	Add "unknown" results for memdep, which mean "I don't know whether a dependence for the given instruction exists in the given block". This cleans up all the existing hacks in memdep which represent this concept by returning clobber with various unrelated instructions. llvm-svn: 133031	2011-06-15 00:47:34 +00:00
Dan Gohman	a471751c24	Disable the main feature of 130180, the elimination of loads that are redundant with partially-aliasing loads. When computing what portion of a clobbering load value is needed, it doesn't consider phi-translation which may have occurred between the clobbing load and the redundant load. llvm-svn: 132631	2011-06-04 06:48:50 +00:00
Eli Friedman	b576b1675c	When marking a block as being unanalyzable, use "Clobber" on the terminator instead of the first instruction in the block. This is a bit of a hack; "Clobber" isn't really the right marking in the first place. memdep doesn't really have any way of properly expressing "unanalyzable" at the moment. Using it on the terminator is much less ambiguous than using it on an arbitrary instruction, though. In the given testcase, the "Clobber" was pointing to a load, and GVN was incorrectly assuming that meant that the "Clobber" load overlapped the load being analyzed (when they are actually unrelated). The included testcase tests both this commit and r132434. Part two of rdar://9429882. (r132434 was mislabeled.) llvm-svn: 132442	2011-06-02 00:08:52 +00:00
Eli Friedman	4b6eeb9ca2	In MemoryDependenceAnalysis::getNonLocalPointerDepFromBB, if a given block is is deemed unanalyzable (and we execute one of the "goto PredTranslationFailure" statements), make sure we don't put information about the predecessors of that block into the returned data structures; this can lead to, among other things, extraneous results (which will confuse passes using memdep). Fixes an assert in GVN compiling ruby. Part of rdar://problem/9521954 . Testcase coming up soon. llvm-svn: 132434	2011-06-01 23:16:53 +00:00
Owen Anderson	97f0cf32ea	@llvm.lifetime.begin acts as a load, not @llvm.lifetime.end. llvm-svn: 131437	2011-05-17 00:05:49 +00:00
Chris Lattner	827a270a2a	teach GVN to widen integer loads when they are overaligned, when doing an wider load would allow elimination of subsequent loads, and when the wider load is still a native integer type. This eliminates a ton of loads on various benchmarks involving struct fields, though it is somewhat hobbled by clang not being very aggressive about field alignment. This is yet another step along the way towards resolving PR6627. llvm-svn: 130390	2011-04-28 07:29:08 +00:00
Chris Lattner	7aab2799ae	Enhance memdep to return clobber relation between noalias loads when an earlier load could be widened to encompass a later load. For example, if we see: X = load i8* P, align 4 Y = load i8* (P+3), align 1 and we have a 32-bit native integer type, we can widen the former load to i32 which then makes the second load redundant. GVN can't actually do anything with this load/load relation yet, so this isn't testable, but it is the next step to resolving PR6627, and a fairly general class of "merge neighboring loads" missed optimizations. llvm-svn: 130250	2011-04-26 22:42:01 +00:00
Chris Lattner	32dc9bd1bb	use AA::isMustAlias to simplify some calls. llvm-svn: 130248	2011-04-26 21:53:34 +00:00
Chris Lattner	6b96621a8a	remove support for llvm.invariant.end from memdep. It is a work-in-progress that is not progressing, and it has issues. llvm-svn: 130247	2011-04-26 21:50:51 +00:00
Chris Lattner	6f83d06ffa	Enhance MemDep: When alias analysis returns a partial alias result, return it as a clobber. This allows GVN to do smart things. Enhance GVN to be smart about the case when a small load is clobbered by a larger overlapping load. In this case, forward the value. This allows us to compile stuff like this: int test(void P) { int tmp = (unsigned int)P; return tmp+((unsigned char*)P+1); } into: _test: ## @test movl (%rdi), %ecx movzbl %ch, %eax addl %ecx, %eax ret which has one load. We already handled the case where the smaller load was from a must-aliased base pointer. llvm-svn: 130180	2011-04-26 01:21:15 +00:00
Dan Gohman	0f124e1987	Give GetUnderlyingObject a TargetData, to keep it in sync with BasicAA's DecomposeGEPExpression, which recently began using a TargetData. This fixes PR8968, though the testcase is awkward to reduce. Also, update several off GetUnderlyingObject's users which happen to have a TargetData handy to pass it in. llvm-svn: 124134	2011-01-24 18:53:32 +00:00
Jakob Stoklund Olesen	087f207009	Revert r123207: "Turn on memdep's verifyRemoved() in an attempt to smoke out the cause of our gcc bootstrap miscompare." It didn't. llvm-svn: 123215	2011-01-11 04:05:39 +00:00
Jakob Stoklund Olesen	9b6853efd6	Turn on memdep's verifyRemoved() in an attempt to smoke out the cause of our gcc bootstrap miscompare. llvm-svn: 123207	2011-01-11 01:18:03 +00:00
Jeffrey Yasskin	9b43f33620	Change all self assignments X=X to (void)X, so that we can turn on a new gcc warning that complains on self-assignments and self-initializations. llvm-svn: 122458	2010-12-23 00:58:24 +00:00
Dan Gohman	a4fcd2418d	Move Value::getUnderlyingObject to be a standalone function so that it can live in Analysis instead of VMCore. llvm-svn: 121885	2010-12-15 20:02:24 +00:00
Dan Gohman	ba5d0abe39	Update memdep to handle PartialAlias as MayAlias. llvm-svn: 121723	2010-12-13 22:47:57 +00:00
Chris Lattner	d540a5d842	strength reduce this. llvm-svn: 120381	2010-11-30 01:56:13 +00:00
Benjamin Kramer	585dfa2b3d	Initialize MemDep's TD member so buildbots don't trip over an uninitialized pointer (TD is passed to PHITransAddr). I wonder why this didn't explode earlier. llvm-svn: 119944	2010-11-21 15:21:46 +00:00
Chris Lattner	e48c31ce33	implement PR8576, deleting dead stores with intervening may-alias stores. llvm-svn: 119927	2010-11-21 07:34:32 +00:00
Dan Gohman	65316d6749	Add helper functions for computing the Location of load, store, and vaarg instructions. llvm-svn: 118845	2010-11-11 21:50:19 +00:00
Dan Gohman	c87c843db7	It's not necessary to clear out the Size and TBAATag at each of these points. llvm-svn: 118752	2010-11-11 00:42:22 +00:00
Dan Gohman	8bf3d832e5	Set NonLocalDepInfo's Size field to UnknownSize when invalidating it, so that it doesn't appear to be a known size. llvm-svn: 118748	2010-11-11 00:20:27 +00:00
Dan Gohman	6791936848	When clearing a non-local pointer dependency cache entry, clear the reverse map too. This fixes seflhost build errors. llvm-svn: 118729	2010-11-10 22:35:02 +00:00
Dan Gohman	1d760ce8b3	Factor out the code for computing an AliasAnalysis::Location for a given instruction into a helper function. llvm-svn: 118723	2010-11-10 21:51:35 +00:00
Dan Gohman	2e8ca44b81	Fully invalidate cached results when a prior query's size or type is insufficient for, or incompatible with, the current query. llvm-svn: 118721	2010-11-10 21:45:11 +00:00
Dan Gohman	0a6021a54d	Enhance GVN to do more precise alias queries for non-local memory references. For example, this allows gvn to eliminate the load in this example: void foo(int n, int* p, int q) { p[0] = 0; p[1] = 1; if (n) { q = p[0]; } } llvm-svn: 118714	2010-11-10 20:37:15 +00:00
Dan Gohman	15a43965ac	Teach memdep to use pointsToConstantMemory to determine that loads from constant memory don't alias any stores. llvm-svn: 117636	2010-10-29 01:14:04 +00:00
Owen Anderson	6c18d1aac0	Get rid of static constructors for pass registration. Instead, every pass exposes an initializeMyPassFunction(), which must be called in the pass's constructor. This function uses static dependency declarations to recursively initialize the pass's dependencies. Clients that only create passes through the createFooPass() APIs will require no changes. Clients that want to use the CommandLine options for passes will need to manually call the appropriate initialization functions in PassInitialization.h before parsing commandline arguments. I have tested this with all standard configurations of clang and llvm-gcc on Darwin. It is possible that there are problems with the static dependencies that will only be visible with non-standard options. If you encounter any crash in pass registration/creation, please send the testcase to me directly. llvm-svn: 116820	2010-10-19 17:21:58 +00:00
Owen Anderson	8ac477ffb5	Begin adding static dependence information to passes, which will allow us to perform initialization without static constructors AND without explicit initialization by the client. For the moment, passes are required to initialize both their (potential) dependencies and any passes they preserve. I hope to be able to relax the latter requirement in the future. llvm-svn: 116334	2010-10-12 19:48:12 +00:00
Owen Anderson	df7a4f2515	Now with fewer extraneous semicolons! llvm-svn: 115996	2010-10-07 22:25:06 +00:00
Dan Gohman	2348393cf5	Teach memdep about TBAA tags. llvm-svn: 114588	2010-09-22 21:41:02 +00:00
Chris Lattner	a58edd1df3	cleanup some of the lifetime/invariant marker stuff, add a big fixme. llvm-svn: 113144	2010-09-06 03:58:04 +00:00
Chris Lattner	e34c835bde	speed up -gvn 3.4% on the testcase in PR7023 llvm-svn: 113135	2010-09-06 01:26:29 +00:00
Owen Anderson	a7aed18624	Reapply r110396, with fixes to appease the Linux buildbot gods. llvm-svn: 110460	2010-08-06 18:33:48 +00:00
Owen Anderson	bda59bd247	Revert r110396 to fix buildbots. llvm-svn: 110410	2010-08-06 00:23:35 +00:00
Owen Anderson	755aceb5d0	Don't use PassInfo* as a type identifier for passes. Instead, use the address of the static ID member as the sole unique type identifier. Clean up APIs related to this change. llvm-svn: 110396	2010-08-05 23:42:04 +00:00
Dan Gohman	26ef7c7ab7	Fix memdep's code for reasoning about dependences between two calls. A Ref response from getModRefInfo is not useful here. Instead, check for identical calls only in the NoModRef case. Reapply r110270, and strengthen it to compensate for the memdep changes. When both calls are readonly, there is no dependence between them. llvm-svn: 110382	2010-08-05 22:09:15 +00:00
Dan Gohman	da7182e116	Add a convenient form of AliasAnalysis::alias for the case where the sizes are unknown. llvm-svn: 110090	2010-08-03 00:56:30 +00:00
Gabor Greif	0630a71742	reintroduce original (asserting) semantics of CallSite(Instruction II) add instead a CallSite(Value V) constructor that is consistent with ImmutableCallSize and use that one in client code llvm-svn: 109553	2010-07-27 22:53:28 +00:00
Gabor Greif	ef1ca24b91	recommit simplification (originally r109504, backed out in r109508) now that problem in CallSiteBase is fixed llvm-svn: 109547	2010-07-27 22:02:00 +00:00
Gabor Greif	ed1d92cb9a	back out r109504, breaks the bots llvm-svn: 109508	2010-07-27 15:18:11 +00:00
Gabor Greif	195a609c37	simplify llvm-svn: 109504	2010-07-27 14:38:38 +00:00
Owen Anderson	a57b97e7e7	Fix batch of converting RegisterPass<> to INTIALIZE_PASS(). llvm-svn: 109045	2010-07-21 22:09:45 +00:00
Gabor Greif	253c6bf366	use the new isFreeCall API and ArgOperand accessors llvm-svn: 106692	2010-06-23 22:48:06 +00:00
Dan Gohman	d2d1ae105d	Use pre-increment instead of post-increment when the result is not used. llvm-svn: 106542	2010-06-22 15:08:57 +00:00
Eric Christopher	7258dcd77f	Revert 101465, it broke internal OpenGL testing. Probably the best way to know that all getOperand() calls have been handled is to replace that API instead of updating. llvm-svn: 101579	2010-04-16 23:37:20 +00:00
Gabor Greif	f375520f7b	reapply r101434 with a fix for self-hosting rotate CallInst operands, i.e. move callee to the back of the operand array the motivation for this patch are laid out in my mail to llvm-commits: more efficient access to operands and callee, faster callgraph-construction, smaller compiler binary llvm-svn: 101465	2010-04-16 15:33:14 +00:00
Gabor Greif	403e9694f9	back out r101423 and r101397, they break llvm-gcc self-host on darwin10 llvm-svn: 101434	2010-04-16 01:16:20 +00:00
Gabor Greif	33ae80bff7	reapply r101364, which has been backed out in r101368 with a fix rotate CallInst operands, i.e. move callee to the back of the operand array the motivation for this patch are laid out in my mail to llvm-commits: more efficient access to operands and callee, faster callgraph-construction, smaller compiler binary llvm-svn: 101397	2010-04-15 20:51:13 +00:00
Gabor Greif	9fd00c7d25	back out r101364, as it trips the linux nightlybot on some clang C++ tests llvm-svn: 101368	2010-04-15 12:46:56 +00:00
Gabor Greif	aafd209632	rotate CallInst operands, i.e. move callee to the back of the operand array the motivation for this patch are laid out in my mail to llvm-commits: more efficient access to operands and callee, faster callgraph-construction, smaller compiler binary llvm-svn: 101364	2010-04-15 10:49:53 +00:00
Daniel Dunbar	693ea89214	Reapply r97010, the speculative revert failed. llvm-svn: 97036	2010-02-24 08:48:04 +00:00
Daniel Dunbar	0a2031e5b6	Speculatively revert r97010, "Add an argument to PHITranslateValue to specify the DominatorTree. ...", in hopes of restoring poor old PPC bootstrap. llvm-svn: 97027	2010-02-24 06:55:22 +00:00
Bob Wilson	66e58ac742	Add an argument to PHITranslateValue to specify the DominatorTree. If this argument is non-null, pass it along to PHITranslateSubExpr so that it can prefer using existing values that dominate the PredBB, instead of just blindly picking the first equivalent value that it finds on a uselist. Also when the DominatorTree is specified, have PHITranslateValue filter out any result that does not dominate the PredBB. This is basically just refactoring the check that used to be in GetAvailablePHITranslatedSubExpr and also in GVN. Despite my initial expectations, this change does not affect the results of GVN for any testcases that I could find, but it should help compile time. Before this change, if PHITranslateSubExpr picked a value that does not dominate, PHITranslateWithInsertion would then insert a new value, which GVN would later determine to be redundant and would replace. By picking a good value to begin with, we save GVN the extra work of inserting and then replacing a new value. llvm-svn: 97010	2010-02-24 01:39:00 +00:00
Bob Wilson	92cdb6eec5	Split critical edges as needed for load PRE. llvm-svn: 96378	2010-02-16 19:51:59 +00:00
Duncan Sands	19d0b47b1f	There are two ways of checking for a given type, for example isa<PointerType>(T) and T->isPointerTy(). Convert most instances of the first form to the second form. Requested by Chris. llvm-svn: 96344	2010-02-16 11:11:14 +00:00
Chris Lattner	9b7d99eb76	The phi translated pointer can be computed when returning a partially cached result instead of stored. This reduces memdep memory usage, and also eliminates a bunch of weakvh's. This speeds up gvn on gcc.c-torture/20001226-1.c from 23.9s to 8.45s (2.8x) on a different machine than earlier. llvm-svn: 91885	2009-12-22 04:25:02 +00:00
Chris Lattner	2ee6787c1b	avoid calling extractMallocCall when it's obvious we don't have a call. This speeds up memdep ~1.5% llvm-svn: 91869	2009-12-22 01:00:32 +00:00
Chris Lattner	25bf6f8946	fix an overly conservative caching issue that caused memdep to cache a pointer as being unavailable due to phi trans in the wrong place. This would cause later queries to fail even when they didn't involve phi trans. llvm-svn: 91787	2009-12-19 21:29:22 +00:00
Chris Lattner	eea0f58393	enhance NonLocalDepEntry to keep the per-block phi translated address of the query. llvm-svn: 90958	2009-12-09 07:31:04 +00:00
Chris Lattner	0c31547168	change NonLocalDepEntry from being a typedef for an std::pair to be its own small class. No functionality change. llvm-svn: 90956	2009-12-09 07:08:01 +00:00
Chris Lattner	972e6d8d00	Switch GVN and memdep to use PHITransAddr, which correctly handles phi translation of complex expressions like &A[i+1]. This has the following benefits: 1. The phi translation logic is all contained in its own class with a strong interface and verification that it is self consistent. 2. The logic is more correct than before. Previously, if intermediate expressions got PHI translated, we'd miss the update and scan for the wrong pointers in predecessor blocks. @phi_trans2 is a testcase for this. 3. We have a lot less code in memdep. We can handle phi translation across blocks of things like @phi_trans3, which is pretty insane :). This patch should fix the miscompiles of 255.vortex, and I tested it with a bootstrap of llvm-gcc, llvm-test and dejagnu of course. llvm-svn: 90926	2009-12-09 01:59:31 +00:00
Nick Lewycky	e91765fdbb	Fix indentation in switch statement. llvm-svn: 90650	2009-12-05 06:37:24 +00:00
Benjamin Kramer	eee88bc5d2	Silence compiler warnings. llvm-svn: 90319	2009-12-02 15:33:44 +00:00
Owen Anderson	b9878ee6b6	Cleanup/remove some parts of the lifetime region handling code in memdep and GVN, per Chris' comments. Adjust testcases to match. llvm-svn: 90304	2009-12-02 07:35:19 +00:00
Chris Lattner	e914c0eaa0	rename some variables. llvm-svn: 90258	2009-12-01 21:16:01 +00:00
Chris Lattner	506b858c45	tidy llvm-svn: 90257	2009-12-01 21:15:15 +00:00
Chris Lattner	9c2053b242	fix 255.vortex again, third time's the charm. llvm-svn: 90217	2009-12-01 07:33:32 +00:00
Nick Lewycky	2d32947099	Revert r90107, fixing test/Transforms/GVN/2009-11-29-ReverseMap.ll and the llvm-gcc build. llvm-svn: 90113	2009-11-30 07:05:51 +00:00
Chris Lattner	4d252d20e8	reapply r90093 with an addition of keeping the forward and reverse nonlocal memdep maps in synch, this should fix 255.vortex. llvm-svn: 90107	2009-11-30 02:26:29 +00:00
Chris Lattner	0311ade94c	revert this patch for now, it causes failures of: LLVM::Transforms/GVN/2009-02-17-LoadPRECrash.ll LLVM::Transforms/GVN/2009-06-17-InvalidPRE.ll llvm-svn: 90096	2009-11-29 21:14:59 +00:00
Chris Lattner	52e7715b0b	Fix a really nasty caching bug I introduced in memdep. An entry was being added to the Result vector, but not being put in the cache. This means that if the cache was reused wholesale for a later query that it would be missing this entry and we'd do an incorrect load elimination. Unfortunately, it's not really possible to write a useful testcase for this, but this unbreaks 255.vortex. llvm-svn: 90093	2009-11-29 21:09:36 +00:00
Nick Lewycky	0a1f25b927	Detabify. llvm-svn: 90085	2009-11-29 18:10:39 +00:00
Nick Lewycky	218a3393f4	Teach memdep to look for memory use intrinsics during dependency queries. Fixes PR5574. llvm-svn: 90045	2009-11-28 21:27:49 +00:00
Chris Lattner	44da5bd837	Enhance InsertPHITranslatedPointer to be able to return a list of newly inserted instructions. No functionality change until someone starts using it. llvm-svn: 90039	2009-11-28 15:39:14 +00:00
Chris Lattner	d5bd369a0f	enable code to handle un-phi-translatable cases more aggressively: if we don't have an address expression available in a predecessor, then model this as the value being clobbered at the end of the pred block instead of being modeled as a complete phi translation failure. This is important for PRE of loads because we want to see that the load is available in all but this predecessor, and complete phi translation failure results in not getting any information about predecessors. This doesn't do anything until I renable code insertion since PRE now sees that it is available in all but one predecessors, but can't insert the addressing in the predecessor that is missing it to eliminate the redundancy. llvm-svn: 90037	2009-11-28 14:54:10 +00:00
Chris Lattner	2be52e72ae	Rework InsertPHITranslatedPointer to handle the recursive case, this fixes PR5630 and sets the stage for the next phase of goodness (testcase pending). llvm-svn: 90019	2009-11-27 22:05:15 +00:00
Chris Lattner	4ee17e1482	recursively phi translate bitcast operands too, for consistency. llvm-svn: 90016	2009-11-27 20:25:30 +00:00
Chris Lattner	2f0354ecf0	add support for recursive phi translation and phi translation of add with immediate. This allows us to optimize this function: void test(int N, double* G) { long j; G[1] = 1; for (j = 1; j < N - 1; j++) G[j+1] = G[j] + G[j+1]; } to only do one load every iteration of the loop. llvm-svn: 90013	2009-11-27 19:11:31 +00:00
Chris Lattner	6d294de548	add comment. llvm-svn: 90002	2009-11-27 08:40:14 +00:00
Chris Lattner	ac323297e0	reduce nesting, no functionality change. llvm-svn: 90001	2009-11-27 08:37:22 +00:00
Chris Lattner	25be93dfed	teach GVN's load PRE to insert computations of the address in predecessors where it is not available. It's unclear how to get this inserted computation into GVN's scalar availability sets, Owen, help? :) llvm-svn: 89997	2009-11-27 08:25:10 +00:00
Chris Lattner	a9a76ccf56	Fix phi translation in load PRE to agree with the phi translation done by memdep, and reenable gep translation again. llvm-svn: 89992	2009-11-27 06:31:14 +00:00
Chris Lattner	b018bda665	redisable this, my bootstrap worked because it wasn't an optimized build, whoops. llvm-svn: 89991	2009-11-27 05:53:01 +00:00
Chris Lattner	fb8a718fc3	try again. llvm-svn: 89990	2009-11-27 05:19:56 +00:00
Chris Lattner	14444f5c1a	this is causing buildbot failures, disable for now. llvm-svn: 89985	2009-11-27 01:52:22 +00:00
Chris Lattner	5030c6ab21	teach phi translation of GEPs to simplify geps like 'gep x, 0'. This allows us to compile the example from PR5313 into: LBB1_2: ## %bb incl %ecx movb %al, (%rsi) movslq %ecx, %rax movb (%rdi,%rax), %al testb %al, %al jne LBB1_2 instead of: LBB1_2: ## %bb movslq %eax, %rcx incl %eax movb (%rdi,%rcx), %cl movb %cl, (%rsi) movslq %eax, %rcx cmpb $0, (%rdi,%rcx) jne LBB1_2 llvm-svn: 89981	2009-11-27 00:34:38 +00:00
Chris Lattner	4c88e814b8	teach memdep to do trivial PHI translation of GEPs. More to come. llvm-svn: 89979	2009-11-27 00:07:37 +00:00
Chris Lattner	9bd2136ca3	Teach memdep to phi translate bitcasts. This allows us to compile the example in GCC PR16799 to: LBB1_2: ## %bb1 movl %eax, %eax subq %rax, %rdi movq %rdi, (%rcx) movl (%rdi), %eax testl %eax, %eax je LBB1_2 instead of: LBB1_2: ## %bb1 movl (%rdi), %ecx subq %rcx, %rdi movq %rdi, (%rax) cmpl $0, (%rdi) je LBB1_2 llvm-svn: 89978	2009-11-26 23:41:07 +00:00
Chris Lattner	c49f5ac7d8	factor some code out into some helper functions. llvm-svn: 89975	2009-11-26 23:18:49 +00:00
Nick Lewycky	663e0a06b0	Remove dead code. While there, also turn a few 'T* ' into 'T *' to match the rest of the file. llvm-svn: 89577	2009-11-22 02:38:11 +00:00
Owen Anderson	2b2bd28973	Treat lifetime begin/end markers as allocations/frees respectively for the purposes for GVN/DSE. llvm-svn: 85383	2009-10-28 07:05:35 +00:00
Owen Anderson	fc16e5a98f	Be more careful about invariance reasoning on "store" queries. Stores still need to depend on Ref and ModRef calls within the invariant region. llvm-svn: 85380	2009-10-28 06:30:52 +00:00
Owen Anderson	d0e86d57c1	Add trivial support for the invariance intrinsics to memdep. This logic is purely local for now. llvm-svn: 85378	2009-10-28 06:18:42 +00:00
Victor Hernandez	f390e04a47	Rename MallocFreeHelper as MemoryBuiltins llvm-svn: 85286	2009-10-27 20:05:49 +00:00
Victor Hernandez	762195bd01	Rename MallocHelper as MallocFreeHelper, since it now also identifies calls to free() llvm-svn: 85181	2009-10-26 23:58:56 +00:00
Victor Hernandez	de5ad42aa1	Remove FreeInst. Remove LowerAllocations pass. Update some more passes to treate free calls just like they were treating FreeInst. llvm-svn: 85176	2009-10-26 23:43:48 +00:00
Victor Hernandez	e297149e26	Auto-upgrade free instructions to calls to the builtin free function. Update all analysis passes and transforms to treat free calls just like FreeInst. Remove RaiseAllocations and all its tests since FreeInst no longer needs to be raised. llvm-svn: 84987	2009-10-24 04:23:03 +00:00
Victor Hernandez	8acf2956b8	Remove AllocationInst. Since MallocInst went away, AllocaInst is the only subclass of AllocationInst, so it no longer is necessary. llvm-svn: 84969	2009-10-23 21:09:37 +00:00
Victor Hernandez	70e8505eb1	Memory dependence analysis was incorrectly stopping to scan for stores to a pointer at bitcast uses of a malloc call. It should continue scanning until the malloc call, and this patch fixes that. llvm-svn: 83931	2009-10-13 01:42:53 +00:00
Chris Lattner	7e6d56ebc5	Revert r82404, it is causing a bootstrap miscompile. This is very very scary, as it indicates a lurking bug. yay. llvm-svn: 82411	2009-09-20 22:44:26 +00:00
Chris Lattner	eea16a168a	improve memdep to eliminate bitcasts (and aliases, and noop geps) early for the stated reasons: this allows it to find more equivalences and depend less on code layout. llvm-svn: 82404	2009-09-20 21:00:18 +00:00
Victor Hernandez	537d8d99be	Enhance analysis passes so that they apply the same analysis to malloc calls as to MallocInst. Reviewed by Eli Friedman. llvm-svn: 82281	2009-09-18 21:34:51 +00:00
Dan Gohman	1ee6057b21	Make TargetData optional in MemoryDependenceAnalysis. llvm-svn: 77727	2009-07-31 20:53:12 +00:00
Dan Gohman	f3ee7eaac3	Remove an unnecessary header. llvm-svn: 77725	2009-07-31 20:47:45 +00:00
Chris Lattner	370aadabfc	factor the 'optimized sort' code out into a static helper function and use it from one more place. Patch by Jakub Staszak! llvm-svn: 75478	2009-07-13 17:20:05 +00:00
Chris Lattner	2f0c1c44d5	Move the re-sort of invalidated NonLocalPointerDeps cache earlier so that all code paths get it. PR4256 was about a case where the phi translation loop would find all preds in the Visited cache, so it could get by without re-sorting the NonLocalPointerDeps cache. Fix this by resorting it earlier, there is no reason not to do this. This patch inspired by Jakub Staszak's patch. llvm-svn: 75476	2009-07-13 17:14:23 +00:00
Chris Lattner	02274a7171	make memdep use the getModRefInfo method for stores instead of the low-level alias() method, allowing it to reason more aggressively about pointers into constant memory. PR4189 llvm-svn: 72403	2009-05-25 21:28:56 +00:00
Chris Lattner	8eda11bd9d	now that you can put a PointerIntPair in a SmallPtrSet, remove some hackish workarounds from memdep llvm-svn: 67971	2009-03-29 00:24:04 +00:00
Dale Johannesen	f61c8e81bd	Debug intriniscs should be skipped when looking for a dependency, not terminate the search. llvm-svn: 66709	2009-03-11 21:13:01 +00:00
Owen Anderson	f9a9cf96a1	Ignore debug intrinsics when computing dependences. llvm-svn: 66399	2009-03-09 05:12:38 +00:00
Zhou Sheng	c8e5085cd3	Remove this as dbginfo intrinsics has been defined as IntrNoMem. llvm-svn: 66256	2009-03-06 06:05:01 +00:00
Zhou Sheng	abe4192442	Ignore the debug info intrinsics when looking for dependency through basic block. llvm-svn: 66119	2009-03-05 01:45:43 +00:00
Chris Lattner	3f4591c89f	fix two more cases where we could let the NLPDI cache get unsorted. With this, sqlite3 now passes. llvm-svn: 62839	2009-01-23 07:12:16 +00:00
Chris Lattner	e3ea48c71e	Unconditionally reset 'cache' to zero, even if we don't need to resort it. This avoids using a dangling pointer. Reset NumSortedEntries after restoring Cache to avoid extraneous sorts. This fixes the reduced sqlite3 testcase, but apparently not the whole app. llvm-svn: 62838	2009-01-23 06:48:41 +00:00
Chris Lattner	706d40e662	a minor tweak to my previous patch, handle the invalidation case when there are multiple iterations of the loop. This fixes PR3375. llvm-svn: 62822	2009-01-23 00:27:03 +00:00
Chris Lattner	f09619d533	Fix PR3358, a really nasty bug where recursive phi translated analyses could be run without the caches properly sorted. This can fix all sorts of weirdness. Many thanks to Bill for coming up with the 'issorted' verification idea. llvm-svn: 62757	2009-01-22 07:04:01 +00:00
Chris Lattner	8b4be37275	fix PR3217: fully cached queries need to be verified against the visited set before they are used. If used, their blocks need to be added to the visited set so that subsequent queries don't use conflicting pointer values in the cache result blocks. llvm-svn: 61080	2008-12-16 07:10:09 +00:00
Chris Lattner	7ed5ccc517	if we have a phi translation failure of the start block, return just a clobber of the start block, not other random stuff as well. llvm-svn: 61026	2008-12-15 04:58:29 +00:00
Chris Lattner	ff9f3dba12	Implement initial support for PHI translation in memdep. This means that memdep keeps track of how PHIs affect the pointer in dep queries, which allows it to eliminate the load in cases like rle-phi-translate.ll, which basically end up being: BB1: X = load P br BB3 BB2: Y = load Q br BB3 BB3: R = phi [P] [Q] load R turning "load R" into a phi of X/Y. In addition to additional exposed opportunities, this makes memdep safe in many cases that it wasn't before (which is required for load PRE) and also makes it substantially more efficient. For example, consider: bb1: // has many predecessors. P = some_operator() load P In this example, previously memdep would scan all the predecessors of BB1 to see if they had something that would mustalias P. In some cases (e.g. test/Transforms/GVN/rle-must-alias.ll) it would actually find them and end up eliminating something. In many other cases though, it would scan and not find anything useful. MemDep now stops at a block if the pointer is defined in that block and cannot be phi translated to predecessors. This causes it to miss the (rare) cases like rle-must-alias.ll, but makes it faster by not scanning tons of stuff that is unlikely to be useful. For example, this speeds up GVN as a whole from 3.928s to 2.448s (60%)!. IMO, scalar GVN should be enhanced to simplify the rle-must-alias pointer base anyway, which would allow the loads to be eliminated. In the future, this should be enhanced to phi translate through geps and bitcasts as well (as indicated by FIXMEs) making memdep even more powerful. llvm-svn: 61022	2008-12-15 03:35:32 +00:00
Duncan Sands	c52b616ccf	Don't dereference the end() iterator. This was causing a bunch of failures when running "make ENABLE_EXPENSIVE_CHECKS=1 check". llvm-svn: 60832	2008-12-10 09:38:36 +00:00
Chris Lattner	0318b56f0e	loosen up an assertion that isn't valid when called from invalidateCachedPointerInfo. Thanks to Bill for sending me a testcase. llvm-svn: 60805	2008-12-09 22:45:32 +00:00
Chris Lattner	fa9f99aa12	Teach GVN to invalidate some memdep information when it does an RAUW of a pointer. This allows is to catch more equivalencies. For example, the type_lists_compatible_p function used to require two iterations of the gvn pass (!) to delete its 18 redundant loads because the first pass would CSE all the addressing computation cruft, which would unblock the second memdep/gvn passes from recognizing them. This change allows memdep/gvn to catch all 18 when run just once on the function (as is typical :) instead of just 3. On all of 403.gcc, this bumps up the # reundandancies found from: 63 gvn - Number of instructions PRE'd 153991 gvn - Number of instructions deleted 50069 gvn - Number of loads deleted to: 63 gvn - Number of instructions PRE'd 154137 gvn - Number of instructions deleted 50185 gvn - Number of loads deleted +120 loads deleted isn't bad. llvm-svn: 60799	2008-12-09 22:06:23 +00:00
Chris Lattner	702e46ed54	Teach BasicAA::getModRefInfo(CallSite, CallSite) some tricks based on readnone/readonly functions. Teach memdep to look past readonly calls when analyzing deps for a readonly call. This allows elimination of a few more calls from 403.gcc: before: 63 gvn - Number of instructions PRE'd 153986 gvn - Number of instructions deleted 50069 gvn - Number of loads deleted after: 63 gvn - Number of instructions PRE'd 153991 gvn - Number of instructions deleted 50069 gvn - Number of loads deleted 5 calls isn't much, but this adds plumbing for the next change. llvm-svn: 60794	2008-12-09 21:19:42 +00:00
Chris Lattner	41efb68c44	Fix a fixme: allow memdep to see past read-only calls when doing load dependence queries. This allows GVN to eliminate a few more instructions on 403.gcc: 152598 gvn - Number of instructions deleted 49240 gvn - Number of loads deleted after: 153986 gvn - Number of instructions deleted 50069 gvn - Number of loads deleted llvm-svn: 60786	2008-12-09 19:47:40 +00:00
Chris Lattner	254314e6bc	rename getNonLocalDependency -> getNonLocalCallDependency, and remove pointer stuff from it, simplifying the code a bit. llvm-svn: 60783	2008-12-09 19:38:05 +00:00
Chris Lattner	4f10733cf3	fix typos gabor noticed llvm-svn: 60754	2008-12-09 08:38:36 +00:00
Chris Lattner	75510d8d5c	restructure the top level non-local ptr dep query to handle the first block of a query specially. This makes the "complete query caching" subsystem more effective, avoiding predecessor queries. This speeds up GVN another 4%. llvm-svn: 60752	2008-12-09 07:52:59 +00:00
Chris Lattner	f903fe1df0	rename getNonLocalPointerDepInternal -> getNonLocalPointerDepFromBB and split its inner loop out into a new GetNonLocalInfoForBlock function. No functionality change. llvm-svn: 60751	2008-12-09 07:47:11 +00:00
Chris Lattner	aeaec0838b	if we have two elements, insert both, don't use std::sort. This speeds up the new GVN by another 3% llvm-svn: 60747	2008-12-09 07:05:45 +00:00
Chris Lattner	4d1281cdf2	If we're only adding one new element to 'Cache', insert it into its known position instead of using a full sort. This speeds up GVN by ~4% with the new memdep stuff. llvm-svn: 60746	2008-12-09 06:58:04 +00:00
Chris Lattner	e8113a70fa	convert a couple other places that use pred_iterator to use the caching pred iterator. llvm-svn: 60745	2008-12-09 06:44:17 +00:00
Chris Lattner	768e5bcafc	use hte new pred cache to speed up the new non-local memdep queries. This speeds up GVN using the new queries (not yet checked in) by just over 10%. llvm-svn: 60743	2008-12-09 06:28:49 +00:00
Chris Lattner	5ed409edfa	add another level of caching for non-local pointer queries, keeping track of whether the CachedNonLocalPointerInfo for a block is specific to a block. If so, just return it without any pred scanning. This is good for a 6% speedup on GVN (when it uses this lookup method, which it doesn't right now). llvm-svn: 60695	2008-12-08 07:31:50 +00:00
Chris Lattner	fdb8843133	add an assert. the cast<> below would catch this but a message is more useful. llvm-svn: 60674	2008-12-07 18:45:15 +00:00
Chris Lattner	82b7034753	factor some code better. llvm-svn: 60673	2008-12-07 18:42:51 +00:00
Chris Lattner	de4440c24b	factor some code, fixing some fixme's. llvm-svn: 60672	2008-12-07 18:39:13 +00:00
Chris Lattner	a28355de14	add support for caching pointer dependence queries. Nothing uses this yet so it "can't" break anything. That said, it does appear to work. llvm-svn: 60654	2008-12-07 08:50:20 +00:00
Chris Lattner	7564a3b81b	Some internal refactoring to make it easier to cache results. llvm-svn: 60650	2008-12-07 02:56:57 +00:00
Chris Lattner	2faa2c724a	Introduce a new MemDep::getNonLocalPointerDependency method. This will eventually take over load/store dep queries from getNonLocalDependency. For now it works fine, but is incredibly slow because it does no caching. Lets not switch GVN to use it until that is fixed :) llvm-svn: 60649	2008-12-07 02:15:47 +00:00
Chris Lattner	5a78604e39	push the "pointer case" up the analysis stack a bit. This causes duplication of logic (in 2 places) to determine what pointer a load/store touches. This will be addressed in a future commit. llvm-svn: 60648	2008-12-07 01:50:16 +00:00
Chris Lattner	ed494f791e	make clients have to know how to call getCallSiteDependencyFrom instead of making getDependencyFrom do it. llvm-svn: 60647	2008-12-07 01:21:14 +00:00
Chris Lattner	ccb9c3370a	rename some variables for consistency llvm-svn: 60644	2008-12-07 00:39:19 +00:00
Chris Lattner	e2069a6949	I love how using out of scope variables is not an error with GCC, no really I do. llvm-svn: 60643	2008-12-07 00:38:27 +00:00
Chris Lattner	056c090c67	Rename getCallSiteDependency -> getCallSiteDependencyFrom to emphasize the scanning and make it more similar to getDependencyFrom llvm-svn: 60642	2008-12-07 00:35:51 +00:00
Chris Lattner	d4d9588abc	a memdep query on a volatile load/store will always return clobber with the current implementation. Instead of returning a "precise clobber" just return a fuzzy one. This doesn't matter to any clients anyway and should speed up analysis time very very slightly. llvm-svn: 60641	2008-12-07 00:28:02 +00:00
Chris Lattner	f5891941b4	remove the ability to get memdep info for vaarg. I don't think the original impl was correct and noone actually makes the query anyway. llvm-svn: 60639	2008-12-07 00:21:18 +00:00
Chris Lattner	0e3d6337c6	Make a few major changes to memdep and its clients: 1. Merge the 'None' result into 'Normal', making loads and stores return their dependencies on allocations as Normal. 2. Split the 'Normal' result into 'Clobber' and 'Def' to distinguish between the cases when memdep knows the value is produced from when we just know if may be changed. 3. Move some of the logic for determining whether readonly calls are CSEs into memdep instead of it being in GVN. This still leaves verification that the arguments are hte same to GVN to let it know about value equivalences in different contexts. 4. Change memdep's call/call dependency analysis to use getModRefInfo(CallSite,CallSite) instead of doing something very weak. This only really matters for things like DSA, but someday maybe we'll have some other decent context sensitive analyses :) 5. This reimplements the guts of memdep to handle the new results. 6. This simplifies GVN significantly: a) readonly call CSE is slightly simpler b) I eliminated the "getDependencyFrom" chaining for load elimination and load CSE doesn't have to worry about volatile (they are always clobbers) anymore. c) GVN no longer does any 'lastLoad' caching, leaving it to memdep. 7. The logic in DSE is simplified a bit and sped up. A potentially unsafe case was eliminated. llvm-svn: 60607	2008-12-05 21:04:20 +00:00
Chris Lattner	eda6432beb	Make it illegal to call getDependency* on non-memory instructions like binary operators. llvm-svn: 60600	2008-12-05 18:46:19 +00:00
Chris Lattner	7e61dafc95	Reimplement the non-local dependency data structure in terms of a sorted vector instead of a densemap. This shrinks the memory usage of this thing substantially (the high water mark) as well as making operations like scanning it faster. This speeds up memdep slightly, gvn goes from 3.9376 to 3.9118s on 403.gcc This also splits out the statistics for the cached non-local case to differentiate between the dirty and clean cached case. Here's the stats for 403.gcc: 6153 memdep - Number of dirty cached non-local responses 169336 memdep - Number of fully cached non-local responses 162428 memdep - Number of uncached non-local responses yay for caching :) llvm-svn: 60313	2008-12-01 01:15:42 +00:00
Chris Lattner	47e81d0e90	Eliminate the DepResultTy abstraction. It is now completely redundant with MemDepResult, and MemDepResult has a nicer interface. llvm-svn: 60308	2008-11-30 23:17:19 +00:00
Chris Lattner	13cae612b9	Cache TargetData/AliasAnalysis in the pass instead of calling getAnalysis<>. getAnalysis<> is apparently extremely expensive. Doing this speeds up GVN on 403.gcc by 16%! llvm-svn: 60304	2008-11-30 19:24:31 +00:00
Chris Lattner	441042796d	Two changes: Make getDependency remove QueryInst for a dirty record's ReverseLocalDeps when we update it. This fixes a regression test failure from my last commit. Second, for each non-local cached information structure, keep a bit that indicates whether it is dirty or not. This saves us a scan over the whole thing in the common case when it isn't dirty. llvm-svn: 60274	2008-11-30 02:52:26 +00:00
Chris Lattner	fc678e2af5	introduce a typedef, no functionality change. llvm-svn: 60272	2008-11-30 02:30:50 +00:00
Chris Lattner	1b810bd5e6	Change NonLocalDeps to be a densemap of pointers to densemap instead of containing them by value. This increases the density (!) of NonLocalDeps as well as making the reallocation case faster. This speeds up gvn on 403.gcc by 2% and makes room for future improvements. I'm not super thrilled with having to explicitly manage the new/delete of the map, but it is necesary for the next change. llvm-svn: 60271	2008-11-30 02:28:25 +00:00
Chris Lattner	ff862c4e88	calls never depend on allocations. llvm-svn: 60268	2008-11-30 01:44:00 +00:00
Chris Lattner	3ff6d01586	Fix a fixme by making memdep's handling of allocations more logical. If we see that a load depends on the allocation of its memory with no intervening stores, we now return a 'None' depedency instead of "Normal". This tweaks GVN to do its optimization with the new result. llvm-svn: 60267	2008-11-30 01:39:32 +00:00
Chris Lattner	60444f8aa5	implement a fixme by introducing a new getDependencyFromInternal method that returns its result as a DepResultTy instead of as a MemDepResult. This reduces conversion back and forth. llvm-svn: 60266	2008-11-30 01:26:32 +00:00
Chris Lattner	2059753e66	Move the getNonLocalDependency method to a more logical place in the file, no functionality change. llvm-svn: 60265	2008-11-30 01:18:27 +00:00
Chris Lattner	3d5d5f2c6d	REmove an old fixme, resolve another fixme by adding liberal comments about what this class does. llvm-svn: 60264	2008-11-30 01:17:08 +00:00

... 2 3 4 5 6 ...

427 Commits