llvm-project

Commit Graph

Author	SHA1	Message	Date
Jakub Staszak	338863a546	Reverse order of checking SSE level when calculating compare cost, so we check AVX2 before AVX. llvm-svn: 170464	2012-12-18 22:57:56 +00:00
Arnold Schwaighofer	edd62b14e5	Optimistically analyse Phi cycles Analyse Phis under the starting assumption that they are NoAlias. Recursively look at their inputs. If they MayAlias/MustAlias there must be an input that makes them so. Addresses bug 14351. llvm-svn: 169788	2012-12-10 23:02:41 +00:00
Nadav Rotem	0a471ea66c	Cost Model: change the default cost of control flow instructions (br / ret / ...) to zero. llvm-svn: 169423	2012-12-05 21:21:26 +00:00
Preston Briggs	fd0b5c898a	Modified dump() to provide a little more information for dependences between instructions that don't share a common loop. Updated the test results appropriately. llvm-svn: 168965	2012-11-30 00:44:47 +00:00
Preston Briggs	5cb8cfae1e	Modified depends() to recognize that when all levels are "=" and there's no possible loo-independent dependence, then there's no dependence. Updated all test result appropriately. llvm-svn: 168719	2012-11-27 19:12:26 +00:00
Preston Briggs	1084fa2ef2	Modify depends(Src, Dst, PossiblyLoopIndependent). If the Src and Dst are the same instruction, no loop-independent dependence is possible, so we force the PossiblyLoopIndependent flag to false. The test case results are updated appropriately. llvm-svn: 168678	2012-11-27 06:41:46 +00:00
Preston Briggs	3ad394931d	Corrects a problem where we reply exclusively of GEPs to drive analysis. Better is to look for cases with useful GEPs and use them when possible. When a pair of useful GEPs is not available, use the raw SCEVs directly. This approach supports better analysis of pointer dereferencing. In parallel, all the test cases are updated appropriately. Cases where we have a store to *B++ can now be analyzed! llvm-svn: 168474	2012-11-21 23:50:04 +00:00
Hal Finkel	a6f86fc6fa	Phi speculation improvement for BasicAA This is a partial solution to PR14351. It removes some of the special significance of the first incoming phi value in the phi aliasing checking logic in BasicAA. In the context of a loop, the old logic assumes that the first incoming value is the interesting one (meaning that it is the one that comes from outside the loop), but this is often not the case. With this change, we now test first the incoming value that comes from a block other than the parent of the phi being tested. llvm-svn: 168245	2012-11-17 02:33:15 +00:00
Benjamin Kramer	3eb156306a	DependenceAnalysis: Print all dependency pairs when dumping. Update all testcases. Part of a patch by Preston Briggs. llvm-svn: 167827	2012-11-13 12:12:02 +00:00
Nadav Rotem	f036ca466e	CostModel: add another known vector trunc optimization. llvm-svn: 167488	2012-11-06 21:17:17 +00:00
Nadav Rotem	0914f0b262	Cost Model: add tables for some avx type-conversion hacks. llvm-svn: 167480	2012-11-06 19:33:53 +00:00
Nadav Rotem	c378a8067d	CostModel: Add tables for the common x86 compares. llvm-svn: 167421	2012-11-05 23:48:20 +00:00
Nadav Rotem	ae79765676	Code Model: Improve the accuracy of the zext/sext/trunc vector cost estimation. llvm-svn: 167412	2012-11-05 22:20:53 +00:00
Nadav Rotem	856ffa6677	Cost Model: Normalize the insert/extract index when splitting types llvm-svn: 167402	2012-11-05 21:12:13 +00:00
Nadav Rotem	020be9dc29	Cost Model: teach the cost model about expanding integers. llvm-svn: 167401	2012-11-05 21:11:10 +00:00
Nadav Rotem	7411623fd8	Implement the cost of abnormal x86 instruction lowering as a table. llvm-svn: 167395	2012-11-05 19:32:46 +00:00
Richard Osborne	a1fffcf73a	Don't infer whether a value is captured in the current function from the 'nocapture' attribute. The nocapture attribute only specifies that no copies are made that outlive the function. This isn't the same as there being no copies at all. This fixes PR14045. llvm-svn: 167381	2012-11-05 10:48:24 +00:00
Nadav Rotem	c2345cbe73	X86 CostModel: Add support for a some of the common arithmetic instructions for SSE4, AVX and AVX2. llvm-svn: 167347	2012-11-03 00:39:56 +00:00
Nadav Rotem	23848f8f1d	Add a stub for the x86 cost model impl. Implement a basic cost rule for inserting/extracting from XMM registers. llvm-svn: 167333	2012-11-02 23:27:16 +00:00
Nadav Rotem	13da94734c	CostModel: add support for Vector Insert and Extract. llvm-svn: 167329	2012-11-02 22:31:56 +00:00
Nadav Rotem	a6b91ac307	Add a cost model analysis that allows us to estimate the cost of IR-level instructions. llvm-svn: 167324	2012-11-02 21:48:17 +00:00
Benjamin Kramer	6dc1e2f287	Remove LoopDependenceAnalysis. It was unmaintained and not much more than a stub. The new DependenceAnalysis pass is both more general and complete. llvm-svn: 166810	2012-10-26 20:25:01 +00:00
Sebastian Pop	59b61b9e2c	dependence analysis Patch from Preston Briggs <preston.briggs@gmail.com>. This is an updated version of the dependence-analysis patch, including an MIV test based on Banerjee's inequalities. It's a fairly complete implementation of the paper Practical Dependence Testing Gina Goff, Ken Kennedy, and Chau-Wen Tseng PLDI 1991 It cannot yet propagate constraints between coupled RDIV subscripts (discussed in Section 5.3.2 of the paper). It's organized as a FunctionPass with a single entry point that supports testing for dependence between two instructions in a function. If there's no dependence, it returns null. If there's a dependence, it returns a pointer to a Dependence which can be queried about details (what kind of dependence, is it loop independent, direction and distance vector entries, etc). I haven't included every imaginable feature, but there's a good selection that should be adequate for supporting many loop transformations. Of course, it can be extended as necessary. Included in the patch file are many test cases, commented with C code showing the loops and array references. llvm-svn: 165708	2012-10-11 07:32:34 +00:00
James Molloy	77639e2c72	Add default JIT LIT variable. Patch by David Tweed! llvm-svn: 164996	2012-10-02 10:57:08 +00:00
Duncan Sands	8598a0ec80	Now that invoke of an intrinsic is possible (for the llvm.do.nothing intrinsic) teach the callgraph logic to not create callgraph edges to intrinsics for invoke instructions; it already skips this for call instructions. Fixes PR13903. llvm-svn: 164707	2012-09-26 17:16:01 +00:00
Arnold Schwaighofer	8dc34cfb99	BasicAA: Recognize cyclic NoAlias phis Enhances basic alias analysis to recognize phis whose first incoming values are NoAlias and whose other incoming values are just the phi node itself through some amount of recursion. Example: With this change basicaa reports that ptr_phi and ptr_phi2 do not alias each other. bb: ptr = ptr2 + 1 loop: ptr_phi = phi [bb, ptr], [loop, ptr_plus_one] ptr2_phi = phi [bb, ptr2], [loop, ptr2_plus_one] ... ptr_plus_one = gep ptr_phi, 1 ptr2_plus_one = gep ptr2_phi, 1 This enables the elimination of one load in code like the following: extern int foo; int test_noalias(int ptr, int num, int coeff) { int ptr2 = ptr; int result = (ptr++) * (coeff--); while (num--) { ptr2++ = ptr; result += (coeff--) * (ptr++); } ptr = foo; return result; } Part 2/2 of fix for PR13564. llvm-svn: 163319	2012-09-06 14:41:53 +00:00
Arnold Schwaighofer	76dca58c66	BasicAA: GEPs of NoAlias'ing base ptr with equivalent indices are NoAlias If we can show that the base pointers of two GEPs don't alias each other using precise analysis and the indices and base offset are equal then the two GEPs also don't alias each other. This is primarily needed for the follow up patch that analyses NoAlias'ing PHI nodes. Part 1/2 of fix for PR13564. llvm-svn: 163317	2012-09-06 14:31:51 +00:00
NAKAMURA Takumi	87abb0ee34	llvm/test/Analysis/Profiling: Mark 3 of them as REQUIRES: loadable_module. FIXME: profile_rt.dll could be built on win32. llvm-svn: 162811	2012-08-29 00:37:46 +00:00
Manman Ren	abbb01abea	Profile: set branch weight metadata with data generated from profiling. This patch implements ProfileDataLoader which loads profile data generated by -insert-edge-profiling and updates branch weight metadata accordingly. Patch by Alastair Murray. llvm-svn: 162799	2012-08-28 22:21:25 +00:00
Manman Ren	cf10446ffa	BranchProb: modify the definition of an edge in BranchProbabilityInfo to handle the case of multiple edges from one block to another. A simple example is a switch statement with multiple values to the same destination. The definition of an edge is modified from a pair of blocks to a pair of PredBlock and an index into the successors. Also set the weight correctly when building SelectionDAG from LLVM IR, especially when converting a Switch. IntegersSubsetMapping is updated to calculate the weight for each cluster. llvm-svn: 162572	2012-08-24 18:14:27 +00:00
Benjamin Kramer	2f47a3fb07	Fix broken check lines. I really need to find a way to automate this, but I can't come up with a regex that has no false positives while handling tricky cases like custom check prefixes. llvm-svn: 162097	2012-08-17 12:28:26 +00:00
Nadav Rotem	5d4e205874	MemoryDependenceAnalysis attempts to find the first memory dependency for function calls. Currently, if GetLocation reports that it did not find a valid pointer (this is the case for volatile load/stores), we ignore the result. This patch adds code to handle the cases where we did not obtain a valid pointer. rdar://11872864 PR12899 llvm-svn: 161802	2012-08-13 23:03:43 +00:00
Nick Lewycky	fb78083b1c	Stay rational; don't assert trying to take the square root of a negative value. If it's negative, the loop is already proven to be infinite. Fixes PR13489! llvm-svn: 161107	2012-08-01 09:14:36 +00:00
Chandler Carruth	ff123d5c63	Fix the remaining TCL-style quotes found in the testsuite. This is another mechanical change accomplished though the power of terrible Perl scripts. I have manually switched some "s to 's to make escaping simpler. While I started this to fix tests that aren't run in all configurations, the massive number of tests is due to a really frustrating fragility of our testing infrastructure: things like 'grep -v', 'not grep', and 'expected failures' can mask broken tests all too easily. Essentially, I'm deeply disturbed that I can change the testsuite so radically without causing any change in results for most platforms. =/ llvm-svn: 159547	2012-07-02 19:09:46 +00:00
Chandler Carruth	5da53436d5	Convert the uses of '\|&' to use '2>&1 \|' instead, which works on old versions of Bash. In addition, I can back out the change to the lit built-in shell test runner to support this. This should fix the majority of fallout on Darwin, but I suspect there will be a few straggling issues. llvm-svn: 159544	2012-07-02 18:37:59 +00:00
Chandler Carruth	a5a29f970e	Convert all tests using TCL-style quoting to use shell-style quoting. This was done through the aid of a terrible Perl creation. I will not paste any of the horrors here. Suffice to say, it require multiple staged rounds of replacements, state carried between, and a few nested-construct-parsing hacks that I'm not proud of. It happens, by luck, to be able to deal with all the TCL-quoting patterns in evidence in the LLVM test suite. If anyone is maintaining large out-of-tree test trees, feel free to poke me and I'll send you the steps I used to convert things, as well as answer any painful questions etc. IRC works best for this type of thing I find. Once converted, switch the LLVM lit config to use ShTests the same as Clang. In addition to being able to delete large amounts of Python code from 'lit', this will also simplify the entire test suite and some of lit's architecture. Finally, the test suite runs 33% faster on Linux now. ;] For my 16-hardware-thread (2x 4-core xeon e5520): 36s -> 24s llvm-svn: 159525	2012-07-02 12:47:22 +00:00
Nick Lewycky	474112d82c	If the step value is a constant zero, the loop isn't going to terminate. Fixes the assert reported in PR13228! llvm-svn: 159393	2012-06-28 23:44:57 +00:00
Andrew Trick	a3f9043196	SCEV: Handle a corner case reducing AddRecExpr * AddRecExpr If integer overflow causes one of the terms to reach zero, that can force the entire expression to zero. Fixes PR12929: cast<Ty>() argument of incompatible type llvm-svn: 157673	2012-05-30 03:35:20 +00:00
Andrew Trick	7fa4e0fea6	SCEV: Add MarkPendingLoopPredicates to avoid recursive isImpliedCond. getUDivExpr attempts to simplify by checking for overflow. isLoopEntryGuardedByCond then evaluates the loop predicate which may lead to the same getUDivExpr causing endless recursion. Fixes PR12868: clang 3.2 segmentation fault. llvm-svn: 157092	2012-05-19 00:48:25 +00:00
Bill Wendling	1981c0e533	FileCheck-ize tests. llvm-svn: 155434	2012-04-24 10:45:44 +00:00
Bill Wendling	4cf911c0cd	FileCheck-ize these tests. llvm-svn: 155433	2012-04-24 10:36:42 +00:00
Bill Wendling	cd6df16cb4	FileCheck-ize these tests. Harden some of them. llvm-svn: 155432	2012-04-24 09:15:38 +00:00
Benjamin Kramer	e364d195e9	Revert "SCEV: When expanding a GEP the final addition to the base pointer has NUW but not NSW." This isn't right either, reverting for now. llvm-svn: 154910	2012-04-17 06:33:57 +00:00
Benjamin Kramer	e1f4ca1b0f	SCEV: When expanding a GEP the final addition to the base pointer has NUW but not NSW. Found by inspection. llvm-svn: 154262	2012-04-07 17:19:26 +00:00
Rafael Espindola	5054ee82cc	Handle intrinsics in GlobalsModRef. Fixes pr12351. llvm-svn: 153604	2012-03-28 21:31:24 +00:00
Andrew Trick	7004e4b95e	SCEV fix: Handle loop invariant loads. Fixes PR11882: NULL dereference in ComputeLoadConstantCompareExitLimit. llvm-svn: 153480	2012-03-26 22:33:59 +00:00
Andrew Trick	bd11257df7	Test scalar evolution directly instead of testing the result of canonical indvars. llvm-svn: 153256	2012-03-22 17:09:31 +00:00
Eli Friedman	0774902a00	Duncan pointed out that if the alignment isn't explicitly specified, it defaults to the ABI alignment. Given that, make this code a bit more aggressive in such cases. llvm-svn: 151584	2012-02-27 23:16:46 +00:00
Eli Friedman	8bc169c3c5	Teach BasicAA about the LLVM IR rules that allow reading past the end of an object given sufficient alignment. Fixes PR12098. llvm-svn: 151553	2012-02-27 20:46:07 +00:00
Rafael Espindola	94df267db3	Change the implementation of dominates(inst, inst) to one based on what the verifier does. This correctly handles invoke. Thanks to Duncan, Andrew and Chris for the comments. Thanks to Joerg for the early testing. llvm-svn: 151469	2012-02-26 02:19:19 +00:00
Eli Bendersky	924f9a671d	Replace all instances of dg.exp file with lit.local.cfg, since all tests are run with LIT now and now Dejagnu. dg.exp is no longer needed. Patch reviewed by Daniel Dunbar. It will be followed by additional cleanup patches. llvm-svn: 150664	2012-02-16 06:28:33 +00:00
Nick Lewycky	4c378a4453	Change CaptureTracking to pass a Use* instead of a Value* when a value is captured. This allows the tracker to look at the specific use, which may be especially interesting for function calls. Use this to fix 'nocapture' deduction in FunctionAttrs. The existing one does not iterate until a fixpoint and does not guarantee that it produces the same result regardless of iteration order. The new implementation builds up a graph of how arguments are passed from function to function, and uses a bottom-up walk on the argument-SCCs to assign nocapture. This gets us nocapture more often, and does so rather efficiently and independent of iteration order. llvm-svn: 147327	2011-12-28 23:24:21 +00:00
Chandler Carruth	b024aa021d	Make the unreachable probability much much heavier. The previous probability wouldn't be considered "hot" in some weird loop structures or other compounding probability patterns. This makes it much harder to confuse, but isn't really a principled fix. I'd actually like it if we could model a zero probability, as it would make this much easier to reason about. Suggestions for how to do this better are welcome. llvm-svn: 147142	2011-12-22 09:26:37 +00:00
Chandler Carruth	6b0e34c445	Manually upgrade the test suite to specify the flag to cttz and ctlz. I followed three heuristics for deciding whether to set 'true' or 'false': - Everything target independent got 'true' as that is the expected common output of the GCC builtins. - If the target arch only has one way of implementing this operation, set the flag in the way that exercises the most of codegen. For most architectures this is also the likely path from a GCC builtin, with 'true' being set. It will (eventually) require lowering away that difference, and then lowering to the architecture's operation. - Otherwise, set the flag differently dependending on which target operation should be tested. Let me know if anyone has any issue with this pattern or would like specific tests of another form. This should allow the x86 codegen to just iteratively improve as I teach the backend how to differentiate between the two forms, and everything else should remain exactly the same. llvm-svn: 146370	2011-12-12 11:59:10 +00:00
Andrew Trick	d25089f8e0	SCEV fix. In general, Add/Mul expressions should not inherit NSW/NUW. This reverts r139450, fixes r139453, and adds much needed comments and a unit test. llvm-svn: 145367	2011-11-29 02:16:38 +00:00
Andrew Trick	5ec136c57e	Filecheckize. llvm-svn: 145363	2011-11-29 02:05:23 +00:00
Chris Lattner	6a144a2227	Upgrade syntax of tests using volatile instructions to use 'load volatile' instead of 'volatile load', which is archaic. llvm-svn: 145171	2011-11-27 06:54:59 +00:00
Nick Lewycky	0485d51a76	Don't forget to check FlagNW when determining whether an AddRecExpr will wrap or not. Patch by Brendon Cahoon! llvm-svn: 144173	2011-11-09 07:11:37 +00:00
Benjamin Kramer	652f576a70	2>&1 doesn't work here, it just creates an empty file called "&1" llvm-svn: 143117	2011-10-27 18:27:45 +00:00
Duncan Sands	a370f3e34e	Restore commits 142790 and 142843 - they weren't breaking the build bots. Original commit messages: - Reapply r142781 with fix. Original message: Enhance SCEV's brute force loop analysis to handle multiple PHI nodes in the loop header when computing the trip count. With this, we now constant evaluate: struct ListNode { const struct ListNode next; int i; }; static const struct ListNode node1 = {0, 1}; static const struct ListNode node2 = {&node1, 2}; static const struct ListNode node3 = {&node2, 3}; int test() { int sum = 0; for (const struct ListNode n = &node3; n != 0; n = n->next) sum += n->i; return sum; } - Now that we look at all the header PHIs, we need to consider all the header PHIs when deciding that the loop has stopped evolving. Fixes miscompile in the gcc torture testsuite! llvm-svn: 142919	2011-10-25 12:28:52 +00:00
Chandler Carruth	32f46e7c07	Fix the API usage in loop probability heuristics. It was incorrectly classifying many edges as exiting which were in fact not. These mainly formed edges into sub-loops. It was also not correctly classifying all returning edges out of loops as leaving the loop. With this match most of the loop heuristics are more rational. Several serious regressions on loop-intesive benchmarks like perlbench's loop tests when built with -enable-block-placement are fixed by these updated heuristics. Unfortunately they in turn uncover some other regressions. There are still several improvemenst that should be made to loop heuristics including trip-count, and early back-edge management. llvm-svn: 142917	2011-10-25 09:47:41 +00:00
Duncan Sands	805c5b92c8	Speculatively revert commits 142790 and 142843 to see if it fixes the dragonegg and llvm-gcc self-host buildbots. Original commit messages: - Reapply r142781 with fix. Original message: Enhance SCEV's brute force loop analysis to handle multiple PHI nodes in the loop header when computing the trip count. With this, we now constant evaluate: struct ListNode { const struct ListNode next; int i; }; static const struct ListNode node1 = {0, 1}; static const struct ListNode node2 = {&node1, 2}; static const struct ListNode node3 = {&node2, 3}; int test() { int sum = 0; for (const struct ListNode n = &node3; n != 0; n = n->next) sum += n->i; return sum; } - Now that we look at all the header PHIs, we need to consider all the header PHIs when deciding that the loop has stopped evolving. Fixes miscompile in the gcc torture testsuite! llvm-svn: 142916	2011-10-25 09:26:43 +00:00
Nick Lewycky	a58fb48a55	Now that we look at all the header PHIs, we need to consider all the header PHIs when deciding that the loop has stopped evolving. Fixes miscompile in the gcc torture testsuite! llvm-svn: 142843	2011-10-24 21:02:38 +00:00
Chandler Carruth	7111f4564c	Remove return heuristics from the static branch probabilities, and introduce no-return or unreachable heuristics. The return heuristics from the Ball and Larus paper don't work well in practice as they pessimize early return paths. The only good hitrate return heuristics are those for: - NULL return - Constant return - negative integer return Only the last of these three can possibly require significant code for the returning block, and even the last is fairly rare and usually also a constant. As a consequence, even for the cold return paths, there is little code on that return path, and so little code density to be gained by sinking it. The places where sinking these blocks is valuable (inner loops) will already be weighted appropriately as the edge is a loop-exit branch. All of this aside, early returns are nearly as common as all three of these return categories, and should actually be predicted as taken! Rather than muddy the waters of the static predictions, just remain silent on returns and let the CFG itself dictate any layout or other issues. However, the return heuristic was flagging one very important case: unreachable. Unfortunately it still gave a 1/4 chance of the branch-to-unreachable occuring. It also didn't do a rigorous job of finding those blocks which post-dominate an unreachable block. This patch builds a more powerful analysis that should flag all branches to blocks known to then reach unreachable. It also has better worst-case runtime complexity by not looping through successors for each block. The previous code would perform an N^2 walk in the event of a single entry block branching to N successors with a switch where each successor falls through to the next and they finally fall through to a return. Test case added for noreturn heuristics. Also doxygen comments improved along the way. llvm-svn: 142793	2011-10-24 12:01:08 +00:00
Nick Lewycky	9be7f277e4	Reapply r142781 with fix. Original message: Enhance SCEV's brute force loop analysis to handle multiple PHI nodes in the loop header when computing the trip count. With this, we now constant evaluate: struct ListNode { const struct ListNode next; int i; }; static const struct ListNode node1 = {0, 1}; static const struct ListNode node2 = {&node1, 2}; static const struct ListNode node3 = {&node2, 3}; int test() { int sum = 0; for (const struct ListNode n = &node3; n != 0; n = n->next) sum += n->i; return sum; } llvm-svn: 142790	2011-10-24 06:57:05 +00:00
Nick Lewycky	9d28c26d77	Speculatively revert r142781. Bots are showing Assertion `i_nocapture < OperandTraits<PHINode>::operands(this) && "getOperand() out of range!"' failed. coming out of indvars. llvm-svn: 142786	2011-10-24 04:00:25 +00:00
Nick Lewycky	1700007ecc	Enhance SCEV's brute force loop analysis to handle multiple PHI nodes in the loop header when computing the trip count. With this, we now constant evaluate: struct ListNode { const struct ListNode next; int i; }; static const struct ListNode node1 = {0, 1}; static const struct ListNode node2 = {&node1, 2}; static const struct ListNode node3 = {&node2, 3}; int test() { int sum = 0; for (const struct ListNode n = &node3; n != 0; n = n->next) sum += n->i; return sum; } llvm-svn: 142781	2011-10-23 23:43:14 +00:00
Chandler Carruth	1c8ace0e89	Teach the BranchProbabilityInfo pass to print its results, and use that to bring it under direct test instead of merely indirectly testing it in the BlockFrequencyInfo pass. The next step is to start adding tests for the various heuristics employed, and to start fixing those heuristics once they're under test. llvm-svn: 142778	2011-10-23 21:21:50 +00:00
Nick Lewycky	a6674c7fc9	Make SCEV's brute force analysis stronger in two ways. Firstly, we should be able to constant fold load instructions where the argument is a constant. Second, we should be able to watch multiple PHI nodes through the loop; this patch only supports PHIs in loop headers, more can be done here. With this patch, we now constant evaluate: static const int arr[] = {1, 2, 3, 4, 5}; int test() { int sum = 0; for (int i = 0; i < 5; ++i) sum += arr[i]; return sum; } llvm-svn: 142731	2011-10-22 19:58:20 +00:00
Chandler Carruth	deac50cba9	Generalize the reading of probability metadata to work for both branches and switches, with arbitrary numbers of successors. Still optimized for the common case of 2 successors for a conditional branch. Add a test case for switch metadata showing up in the BlockFrequencyInfo pass. llvm-svn: 142493	2011-10-19 10:32:19 +00:00
Chandler Carruth	d27a7a947b	Teach the BranchProbabilityInfo analysis pass to read any metadata encoding of probabilities. In the absense of metadata, it continues to fall back on static heuristics. This allows __builtin_expect, after lowering through llvm.expect a branch instruction's metadata, to actually enter the branch probability model. This is one component of resolving PR2577. llvm-svn: 142492	2011-10-19 10:30:30 +00:00
Chandler Carruth	343fad44ea	Add pass printing support to BlockFrequencyInfo pass. The implementation layer already had support for printing the results of this analysis, but the wiring was missing. Now that printing the analysis works, actually bring some of this analysis, and the BranchProbabilityInfo analysis that it wraps, under test! I'm planning on fixing some bugs and doing other work here, so having a nice place to add regression tests and a way to observe the results is really useful. llvm-svn: 142491	2011-10-19 10:12:41 +00:00
Andrew Trick	887a111e31	Missing test case for r141164. llvm-svn: 141166	2011-10-05 06:23:32 +00:00
Nick Lewycky	3155552461	Reapply r140979 with fix! We never did get a testcase, but careful review of the logic by David Meyer revealed this bug. llvm-svn: 140992	2011-10-03 07:10:45 +00:00
Nick Lewycky	b1dbce1406	Revert r140979 due to reports of bootstrap failure. llvm-svn: 140980	2011-10-03 05:14:59 +00:00
Nick Lewycky	3c624b8d0d	Add one more case we compute a max trip count. llvm-svn: 140979	2011-10-03 01:03:57 +00:00
Eli Friedman	5f476dc3ef	PR10628: Fix getModRefInfo so it queries the underlying alias() implementation correctly while checking nocapture calls. llvm-svn: 140666	2011-09-28 00:34:27 +00:00
Eli Friedman	5c91891cf3	Enhance alias analysis for atomic instructions a bit. Upgrade a couple alias-analysis tests to the new atomic instructions. llvm-svn: 140557	2011-09-26 20:15:28 +00:00
Andrew Trick	57d8afde93	This test only makes sense with -enable-iv-rewrite. llvm-svn: 139576	2011-09-13 02:45:26 +00:00
Eli Friedman	3d1b307672	Fix the logic in BasicAliasAnalysis::aliasGEP for comparing GEP's with variable differences so that it actually does something sane. Fixes PR10881. llvm-svn: 139276	2011-09-08 02:23:31 +00:00
Owen Anderson	653cb03191	Teach BasicAA about the aliasing properties of memset_pattern16. Fixes PR10872 and <rdar://problem/10065079>. llvm-svn: 139204	2011-09-06 23:33:25 +00:00
Nick Lewycky	e0aa54bb98	This transform only handles two-operand AddRec's. Prevent it from trying to handle anything more complex. Fixes PR10383 again! llvm-svn: 139186	2011-09-06 21:42:18 +00:00
Nick Lewycky	658bdb5133	The logic inside getMulExpr to simplify {a,+,b}*{c,+,d} was wrong, which was visible given a=b=c=d=1, on iteration #1 (the second iteration). Replace it with correct math. Fixes PR10383! llvm-svn: 139133	2011-09-06 05:05:14 +00:00
Nick Lewycky	b1438c763a	Revert r139126 due to selfhost failures reported by buildbots. llvm-svn: 139130	2011-09-06 02:43:13 +00:00
Nick Lewycky	c4c43fbb07	Teach SCEV to report a max backedge count in one interesting case in HowFarToZero; the case for a canonical loop. llvm-svn: 139126	2011-09-05 23:25:16 +00:00
Rafael Espindola	7161661863	Move the loads after the calls so that the fix for PR10292 doesn't show that the loads don't alias the allocas. llvm-svn: 134852	2011-07-09 23:53:58 +00:00
Rafael Espindola	7fbab4dcc7	Use CHECK-NEXT. llvm-svn: 134850	2011-07-09 22:56:50 +00:00
Chris Lattner	8936d2bfbc	Remove support for parsing the "type i32" syntax for defining a numbered top level type without a specified number. This syntax isn't documented and blocks forward progress. llvm-svn: 133371	2011-06-19 00:03:46 +00:00
Chris Lattner	80ed9dc9e5	rip out a ton of intrinsic modernization logic from AutoUpgrade.cpp, which is for pre-2.9 bitcode files. We keep x86 unaligned loads, movnt, crc32, and the target indep prefetch change. As usual, updating the testsuite is a PITA. llvm-svn: 133337	2011-06-18 06:05:24 +00:00
Chris Lattner	5756c16cdf	make the asmparser reject function and type redefinitions. 'Merging' hasn't been needed since llvm-gcc 3.4 days. llvm-svn: 133248	2011-06-17 07:06:44 +00:00
Chris Lattner	def1949c00	Remove support for using "foo" as symbols instead of %"foo". This is ancient syntax and has been long obsolete. As usual, updating the tests is the nasty part of this. llvm-svn: 133242	2011-06-17 06:36:20 +00:00
Chris Lattner	b90ed2233c	manually upgrade a bunch of tests to modern syntax, and remove some that are either unreduced or only test old syntax. llvm-svn: 133228	2011-06-17 03:14:27 +00:00
John McCall	51fbfc928c	Test case for r132797. llvm-svn: 132962	2011-06-14 03:02:05 +00:00
Dan Gohman	adf80ae9e4	Reapply r131781, now that the GVN bug with partially-aliasing loads is disabled. llvm-svn: 132632	2011-06-04 06:50:18 +00:00
Dan Gohman	43efe1c8bd	Remove this test, which should have been reverted along with r131781. llvm-svn: 132628	2011-06-04 06:21:23 +00:00
Dan Gohman	87fdceaf73	Revert r131781 again. Apparently there is more going on here. llvm-svn: 132625	2011-06-04 05:11:22 +00:00
Dan Gohman	27b82f2f91	Reapply r131781 (revert r131809), now that some BasicAA shortcomings it exposed are fixed. llvm-svn: 132611	2011-06-04 00:46:31 +00:00
Dan Gohman	fb02cec44e	Fix BasicAA's recursion detection so that it doesn't pessimize queries in the case of a DAG, where a query reaches a node visited earlier, but it's not on a cycle. This avoids MayAlias results in cases where BasicAA is expected to return MustAlias or PartialAlias in order to protect TBAA. llvm-svn: 132609	2011-06-04 00:31:50 +00:00
Dan Gohman	4e7e7958d7	When merging MustAlias and PartialAlias, chose PartialAlias instead of conservatively choosing MayAlias. llvm-svn: 132579	2011-06-03 20:17:36 +00:00
Dan Gohman	0573b55c2b	Make DecomposeGEPExpression check SimplifyInstruction only after checking for a GEP, so that it matches what GetUnderlyingObject does. This fixes an obscure bug turned up by bugpoint in the testcase for PR9931. llvm-svn: 131971	2011-05-24 18:24:08 +00:00
Chris Lattner	408cfef6f0	I missed a checking with my GVN change. llvm-svn: 131851	2011-05-22 07:20:02 +00:00
Duncan Sands	5ec65765e6	Revert commit 131781, to see if it fixes the x86-64 dragonegg buildbot. Original log message: When BasicAA can determine that two pointers have the same base but differ by a dynamic offset, return PartialAlias instead of MayAlias. See the comment in the code for details. This fixes PR9971. llvm-svn: 131809	2011-05-21 20:54:46 +00:00
Dan Gohman	8b20187c82	When BasicAA can determine that two pointers have the same base but differ by a dynamic offset, return PartialAlias instead of MayAlias. See the comment in the code for details. This fixes PR9971. llvm-svn: 131781	2011-05-21 01:05:08 +00:00
Dan Gohman	5394c70d1e	Teach BasicAA about arm.neon.vld1 and vst1. llvm-svn: 130327	2011-04-27 20:44:28 +00:00
Dan Gohman	39b3a1ef7f	When analyzing functions known to only access argument pointees, only check arguments with pointer types. Update the documentation of IntrReadArgMem reflect this. While here, add support for TBAA tags on intrinsic calls. llvm-svn: 130317	2011-04-27 18:39:03 +00:00
Andrew Trick	01eff820ae	Test case and comment for PR9633. llvm-svn: 130294	2011-04-27 05:42:17 +00:00
Benjamin Kramer	ba446cc12a	Make tests more useful. lit needs a linter ... llvm-svn: 130126	2011-04-25 10:12:01 +00:00
Eli Friedman	c5f22a7815	PR9634: Don't unconditionally tell the AliasSetTracker that the PreheaderLoad is equivalent to any other relevant value; it isn't true in general. If it is equivalent, the LoopPromoter will tell the AST the equivalence. Also, delete the PreheaderLoad if it is unused. Chris, since you were the last one to make major changes here, can you check that this is sane? llvm-svn: 129049	2011-04-07 01:35:06 +00:00
Chris Lattner	57ee5a5db7	remove postdom frontiers, because it is dead. Forward dom frontiers are still used by RegionInfo :( llvm-svn: 128943	2011-04-05 21:57:17 +00:00
Anders Carlsson	c4f0ab397c	Revert r128140 for now. llvm-svn: 128149	2011-03-23 15:51:12 +00:00
Anders Carlsson	9ed8d93f55	A global variable with internal linkage where all uses are in one function and whose address is never taken is a non-escaping local object and can't alias anything else. llvm-svn: 128140	2011-03-23 02:19:48 +00:00
Andrew Trick	f6b01ff422	Propagate SCEV no-wrap flags whenever possible. This needs review. llvm-svn: 127638	2011-03-15 00:37:00 +00:00
Andrew Trick	2afa325811	When SCEV can determine the loop test is X < X, set ExactBECount=0. When ExactBECount is a constant, use it for MaxBECount. When MaxBECount cannot be computed, replace it with ExactBECount. Fixes PR9424. llvm-svn: 127342	2011-03-09 17:29:58 +00:00
Chris Lattner	4f23f2be15	teach SCEV that the scale and addition of an inbounds gep don't NSW. This fixes a FIXME in scev-aa.ll (allowing a new no-alias result) and generally makes things more precise. llvm-svn: 125449	2011-02-13 03:14:49 +00:00
Chris Lattner	7936a8a488	Per discussion with Dan G, inbounds geps certainly can have unsigned overflow (e.g. "gep P, -1"), and while they can have signed wrap in theoretical situations, modelling an AddRec as not having signed wrap is going enough for any case we can think of today. In the future if this isn't enough, we can revisit this. Modeling them as having NUW isn't causing any known problems either FWIW. llvm-svn: 125410	2011-02-11 21:43:33 +00:00
Dan Gohman	4deda530c2	Add another rdar number. llvm-svn: 124125	2011-01-24 17:54:01 +00:00
Nick Lewycky	d4192f71b5	Simplify some code with no functionality change. Make the test a lot more robust against smarter optimizations, using the power of FileCheck. llvm-svn: 124081	2011-01-23 20:06:05 +00:00
Nick Lewycky	bc98f5b78e	Use value ranges to fold ext(trunc) in SCEV when possible. llvm-svn: 124062	2011-01-23 06:20:19 +00:00
Tobias Grosser	f07426b40d	Implement requiredTransitive The PassManager did not implement the transitivity of requiredTransitive. This was unnoticed since 2006. llvm-svn: 123942	2011-01-20 21:03:22 +00:00
Nick Lewycky	5c901f3489	Similarly, analyze truncate through multiply. llvm-svn: 123842	2011-01-19 18:56:00 +00:00
Nick Lewycky	5143f0f09b	Add a missed SCEV fold that is required to continue analyzing the IR produced by indvars through the scev expander. trunc(add x, y) --> add(trunc x, y). Currently SCEV largely folds the other way which is probably wrong, but preserved to minimize churn. Instcombine doesn't do this fold either, demonstrating a missed optz'n opportunity on code doing add+trunc+add. llvm-svn: 123838	2011-01-19 16:59:46 +00:00
Nick Lewycky	e9ea75e3fc	Add a missing SCEV simplification sext(zext x) --> zext x. llvm-svn: 123832	2011-01-19 15:56:12 +00:00
Dan Gohman	44da55b7be	Teach BasicAA to return PartialAlias in cases where both pointers are pointing to the same object, one pointer is accessing the entire object, and the other is access has a non-zero size. This prevents TBAA from kicking in and saying NoAlias in such cases. llvm-svn: 123775	2011-01-18 21:16:06 +00:00
Eric Christopher	31bb4c5811	Revert the testcase from the previous reverted commit. llvm-svn: 123227	2011-01-11 09:20:44 +00:00
Chris Lattner	1032965cbe	add a testcase I missed in previous commit. llvm-svn: 123143	2011-01-09 23:52:31 +00:00
Chris Lattner	10223a3fbf	teach SCEV analysis of PHI nodes that PHI recurences formed with GEP instructions are always NUW, because PHIs cannot wrap the end of the address space. llvm-svn: 123105	2011-01-09 02:28:48 +00:00
Chris Lattner	a337f5ec5c	reduce indentation. Print <nuw> and <nsw> when dumping SCEV AddRec's that have the bit set. llvm-svn: 123104	2011-01-09 02:16:18 +00:00
Chris Lattner	16e42128c2	fix rdar://8813415 - a miscompilation of 164.gzip that loop-idiom exposed. It turns out to be a latent bug in basicaa, scary. llvm-svn: 122772	2011-01-03 21:03:33 +00:00
Chris Lattner	12fa3c6a94	filecheckize llvm-svn: 122771	2011-01-03 21:01:26 +00:00
Dan Gohman	189508c4c5	-enable-tbaa is on by default now. llvm-svn: 121945	2010-12-16 02:53:48 +00:00
Dan Gohman	e1a17a3473	Make memcpyopt TBAA-aware. llvm-svn: 121944	2010-12-16 02:51:19 +00:00
Duncan Sands	0a2c416894	Move Sub simplifications and additional Add simplifications out of instcombine and into InstructionSimplify. llvm-svn: 121861	2010-12-15 14:07:39 +00:00
Dan Gohman	c4bf5cac9f	Reapply r121520, PartialAlias implementation for BasicAA, now that memdep is updated to handle it. llvm-svn: 121725	2010-12-13 22:50:24 +00:00
Dan Gohman	39de62348f	Revert r121520, which may have introduced miscompilations. llvm-svn: 121573	2010-12-10 21:48:28 +00:00
Dan Gohman	041f74e762	Implement PartialAlias checking in BasicAA. llvm-svn: 121520	2010-12-10 20:47:03 +00:00
Chris Lattner	d513faf41f	remove fixme comment too. llvm-svn: 120493	2010-11-30 23:25:01 +00:00
Chris Lattner	370797a1fb	check in all files. This is now handled by my previous DSE commit. llvm-svn: 120492	2010-11-30 23:23:59 +00:00
NAKAMURA Takumi	6ea8a947e8	test: Check the feature 'loadable_module' with load modules in %llvmshlibdir. %llvmshlibdir should be 'bin' on Cygming. llvm-svn: 120282	2010-11-29 07:58:32 +00:00
Dan Gohman	22e0e1cecb	Delete unneeded ssp attributes. llvm-svn: 118836	2010-11-11 21:08:46 +00:00
Dan Gohman	dcdfd8dd24	TBAA-enable ArgumentPromotion. llvm-svn: 118804	2010-11-11 18:09:32 +00:00
Dan Gohman	0cc4c7516e	Make Sink tbaa-aware. llvm-svn: 118788	2010-11-11 16:21:47 +00:00
Dan Gohman	3cb92d809b	Add a testcase which demonstrates alias analysis pass precedence. llvm-svn: 118755	2010-11-11 01:03:30 +00:00
Dan Gohman	2e8ca44b81	Fully invalidate cached results when a prior query's size or type is insufficient for, or incompatible with, the current query. llvm-svn: 118721	2010-11-10 21:45:11 +00:00
Dan Gohman	e3467a7687	Teach FunctionAttrs about the VAArg instruction. llvm-svn: 118627	2010-11-09 20:17:38 +00:00
Dan Gohman	2a9221793a	Add a testcase for a call which BasicAA says only accesses memory through its arguments and which TBAA says doesn't write to memory. llvm-svn: 118439	2010-11-08 20:20:11 +00:00
Dan Gohman	2cd1fd4a82	Make FunctionAttrs TBAA-aware. llvm-svn: 118417	2010-11-08 17:12:04 +00:00
Dan Gohman	15a43965ac	Teach memdep to use pointsToConstantMemory to determine that loads from constant memory don't alias any stores. llvm-svn: 117636	2010-10-29 01:14:04 +00:00
Dan Gohman	c16d9afe04	Add a basic testcase for TBAA-aware DSE. llvm-svn: 117632	2010-10-29 00:54:02 +00:00
Dan Gohman	55a028680c	Add some comments. llvm-svn: 116957	2010-10-20 22:04:02 +00:00
Dan Gohman	408beac597	Don't pass the raw invalid pointer used to represent conflicting TBAA information to AliasAnalysis. llvm-svn: 116751	2010-10-18 21:28:00 +00:00
Dan Gohman	fe8abf88a0	Add a basic testcase for TBAA-aware LICM. llvm-svn: 116745	2010-10-18 21:00:09 +00:00
Dan Gohman	f7a5e20372	Run tbaa before basicaa, since that's how it's expected to be used. llvm-svn: 116731	2010-10-18 18:45:59 +00:00
Dan Gohman	33fcde9b9c	Make TypeBasedAliasAnalysis default to doing nothing, with a command-line option to enable it. llvm-svn: 116722	2010-10-18 18:17:47 +00:00
Dan Gohman	02538ac4d3	Make BasicAliasAnalysis a normal AliasAnalysis implementation which does normal initialization and normal chaining. Change the default AliasAnalysis implementation to NoAlias. Update StandardCompileOpts.h and friends to explicitly request BasicAliasAnalysis. Update tests to explicitly request -basicaa. llvm-svn: 116720	2010-10-18 18:04:47 +00:00
Dan Gohman	65eb03ed6b	Add a simple testcase for tbaa. llvm-svn: 116272	2010-10-11 23:54:13 +00:00
Benjamin Kramer	0aea752f6d	Remove PointerTracking tests. llvm-svn: 115072	2010-09-29 19:20:35 +00:00
Eli Friedman	ab3a128582	PR7959: Handle negative scales in GEPs correctly in BasicAA for non-64-bit targets. llvm-svn: 114015	2010-09-15 20:08:03 +00:00
Chris Lattner	bb451461ec	remove some noise from tests. llvm-svn: 112889	2010-09-02 22:35:33 +00:00
Michael J. Spencer	7983340465	Fix constant-over-index.ll test on windows. llvm-svn: 112483	2010-08-30 15:08:02 +00:00
Chris Lattner	3decde9305	refix PR1143 by making basicaa analyze zexts of indices aggresively, which I broke with a recent patch. llvm-svn: 111452	2010-08-18 23:09:49 +00:00
Chris Lattner	a25c05ed15	fix a buggy test llvm-svn: 111354	2010-08-18 04:55:12 +00:00
Chris Lattner	a33edcb56c	fix PR7589: In brief: gep P, (zext x) != gep P, (sext x) DecomposeGEPExpression was getting this wrong, confusing basicaa. llvm-svn: 111352	2010-08-18 04:28:19 +00:00
Chris Lattner	c8e38eb60b	filecheckize and detrivialize. llvm-svn: 111350	2010-08-18 04:25:43 +00:00
Dan Gohman	f7495f286a	When analyzing loop exit conditions combined with and and or, don't make any assumptions about when the two conditions will agree on when to permit the loop to exit. This fixes PR7845. llvm-svn: 110758	2010-08-11 00:12:36 +00:00
Tobias Grosser	7fbe6cb429	RegionInfo: Do not assert if a BB is not part of the dominance tree. llvm-svn: 110665	2010-08-10 09:54:35 +00:00
Dan Gohman	e68958fcdf	Implement a proper getModRefInfo for va_arg. llvm-svn: 110458	2010-08-06 18:24:38 +00:00
Dan Gohman	884dd752c3	Implement AccessesArguments checking in the two-callsite form of BasicAA::getModRefInfo. This allows BasicAA to say that two memset calls to non-aliasing memory locations don't interfere. llvm-svn: 110393	2010-08-05 23:34:50 +00:00
Dan Gohman	26ef7c7ab7	Fix memdep's code for reasoning about dependences between two calls. A Ref response from getModRefInfo is not useful here. Instead, check for identical calls only in the NoModRef case. Reapply r110270, and strengthen it to compensate for the memdep changes. When both calls are readonly, there is no dependence between them. llvm-svn: 110382	2010-08-05 22:09:15 +00:00
Dan Gohman	554b012f67	Revert r110270 for now. It appears to uncover a memdep bug. llvm-svn: 110293	2010-08-05 00:43:10 +00:00
Dan Gohman	109561845b	The trouble with testing for "ModRef" and "NoModRef" is that one is a suffix of the other, and FileCheck accepts superstrings. Adjust the output to avoid this problem. llvm-svn: 110280	2010-08-04 23:37:55 +00:00
Dan Gohman	bd33dab633	The two-callsite form of AliasAnalysis::getModRefInfo is documented to return Ref if the left callsite only reads memory read or written by the right callsite; fix BasicAliasAnalysis to implement this. Add AliasAnalysisEvaluator support for testing the two-callsite form of getModRefInfo. llvm-svn: 110270	2010-08-04 22:56:29 +00:00
Tobias Grosser	336734aca6	Add new RegionInfo pass. The RegionInfo pass detects single entry single exit regions in a function, where a region is defined as any subgraph that is connected to the remaining graph at only two spots. Furthermore an hierarchical region tree is built. Use it by calling "opt -regions analyze" or "opt -view-regions". llvm-svn: 109089	2010-07-22 07:46:31 +00:00
Dan Gohman	00ef93258a	Remove interprocedural-basic-aa and associated code. The AliasAnalysis interface needs implementations to be consistent, so any code which wants to support different semantics must use a different interface. It's not currently worthwhile to add a new interface for this new concept. Document that AliasAnalysis doesn't support cross-function queries. llvm-svn: 107776	2010-07-07 14:27:09 +00:00
Dan Gohman	84f90a387d	Remove context sensitivity concerns from interprocedural-basic-aa, and make it more aggressive in cases where both pointers are known to live in the same function. llvm-svn: 107420	2010-07-01 20:08:40 +00:00
Dan Gohman	c0cca7fdda	Revert the part of r107257 which introduced new logic for using nsw and nuw flags from IR Instructions. On further consideration, this isn't valid. llvm-svn: 107298	2010-06-30 17:27:11 +00:00
Dan Gohman	725ed0364b	Add a testcase for scev-aa's new capability. llvm-svn: 107258	2010-06-30 07:17:47 +00:00
Dan Gohman	9bbd007f15	Add a few more interesting testcases. llvm-svn: 107177	2010-06-29 18:17:11 +00:00
Dan Gohman	0824affeff	Add an Intraprocedural form of BasicAliasAnalysis, which aims to properly handles instructions and arguments defined in different functions, or across recursive function iterations. llvm-svn: 107109	2010-06-29 00:50:39 +00:00
Dan Gohman	7c34ece501	Fix Value::stripPointerCasts and BasicAA to avoid trouble on code in unreachable blocks, which have have use-def cycles. This fixes PR7514. llvm-svn: 107071	2010-06-28 21:16:52 +00:00
Dan Gohman	f820bd327d	Allow "exhaustive" trip count evaluation on phi nodes with all constant operands. llvm-svn: 106537	2010-06-22 13:15:46 +00:00
Dan Gohman	866971ed3d	Fix ScalarEvolution's "exhaustive" trip count evaluation code to avoid assuming that loops are in canonical form, as ScalarEvolution doesn't depend on LoopSimplify itself. Also, with indirectbr not all loops can be simplified. This fixes PR7416. llvm-svn: 106389	2010-06-19 14:17:24 +00:00
Dan Gohman	24ceda8eb0	Revert r106304 (105548 and friends), which are the SCEVComplexityCompare optimizations. There is still some nondeterminism remaining. llvm-svn: 106306	2010-06-18 19:54:20 +00:00
Dan Gohman	4c807fca97	Reapply 105540, 105542, and 105548, and revert r105732. llvm-svn: 106304	2010-06-18 19:26:04 +00:00
Daniel Dunbar	e16d569932	Workaround SCEV non-determinism on this test, for now, to get buildbots back to green. Dan, please revert this once the real problem is fixed. llvm-svn: 105732	2010-06-09 17:54:40 +00:00
Dan Gohman	70910a6ab6	Optimize ScalarEvolution's SCEVComplexityCompare predicate: don't go scrounging through SCEVUnknown contents and SCEVNAryExpr operands; instead just do a simple deterministic comparison of the precomputed hash data. Also, since this is more precise, it eliminates the need for the slow N^2 duplicate detection code. llvm-svn: 105540	2010-06-07 19:06:13 +00:00
Dan Gohman	d07d2f9774	Add a comment to this test. llvm-svn: 102387	2010-04-26 21:37:43 +00:00
Dan Gohman	f33bac3afe	ScalarEvolution support for <= and >= loops. Also, generalize ScalarEvolutions's min and max recognition to handle some new forms of min and max that this change makes more common. llvm-svn: 102234	2010-04-24 03:09:42 +00:00
Chris Lattner	126a58e084	fix some failures my callgraph dump format change broke. llvm-svn: 102197	2010-04-23 18:38:40 +00:00
Dan Gohman	acd700a24b	Don't attempt to analyze values which are obviously undef. This fixes some assertion failures in extreme cases. llvm-svn: 102042	2010-04-22 01:35:11 +00:00
Dan Gohman	6635bb26a6	Generalize ScalarEvolution's PHI analysis to handle loops that don't have preheaders or dedicated exit blocks, as clients may not otherwise need to run LoopSimplify. llvm-svn: 101030	2010-04-12 07:49:36 +00:00
Dan Gohman	cb45bd9cb3	Pointers to zero-sized objects don't point to overlapping objects. llvm-svn: 100789	2010-04-08 18:11:50 +00:00
Chris Lattner	3ae2dd2ba5	add newlines at the end of files. llvm-svn: 100705	2010-04-07 22:53:17 +00:00
Mon P Wang	c576ee9040	Reapply address space patch after fixing an issue in MemCopyOptimizer. Added support for address spaces and added a isVolatile field to memcpy, memmove, and memset, e.g., llvm.memcpy.i32(i8, i8, i32, i32) -> llvm.memcpy.p0i8.p0i8.i32(i8, i8, i32, i32, i1) llvm-svn: 100304	2010-04-04 03:10:48 +00:00
Mon P Wang	999c1b927b	Revert r100191 since it breaks objc in clang llvm-svn: 100199	2010-04-02 18:43:02 +00:00
Mon P Wang	a972ab8564	Reapply address space patch after fixing an issue in MemCopyOptimizer. Added support for address spaces and added a isVolatile field to memcpy, memmove, and memset, e.g., llvm.memcpy.i32(i8, i8, i32, i32) -> llvm.memcpy.p0i8.p0i8.i32(i8, i8, i32, i32, i1) llvm-svn: 100191	2010-04-02 18:04:15 +00:00
Bob Wilson	6f7fd28824	Revert Mon Ping's change 99928, since it broke all the llvm-gcc buildbots. llvm-svn: 99948	2010-03-30 22:27:04 +00:00
Mon P Wang	7460571381	Added support for address spaces and added a isVolatile field to memcpy, memmove, and memset, e.g., llvm.memcpy.i32(i8, i8, i32, i32) -> llvm.memcpy.p0i8.p0i8.i32(i8, i8, i32, i32, i1) A update of langref will occur in a subsequent checkin. llvm-svn: 99928	2010-03-30 20:55:56 +00:00
Dan Gohman	69451a0950	Avoid analyzing instructions in blocks not reachable from the entry block. They are lots of trouble, and they don't matter. This fixes PR6559. llvm-svn: 98103	2010-03-09 23:46:50 +00:00
Chris Lattner	7d2c1592f3	remove andersen's tests. llvm-svn: 97490	2010-03-01 20:23:15 +00:00
Dan Gohman	6b1e2a829d	Teach ScalarEvolution how to compute a tripcount for a loop with true or false as its exit condition. These are usually eliminated by SimplifyCFG, but the may be left around during a pass which wishes to preserve the CFG. llvm-svn: 96683	2010-02-19 18:12:07 +00:00
Dan Gohman	80386c10d4	-disable-output is no longer needed with -analyze. llvm-svn: 94574	2010-01-26 19:25:59 +00:00
Dan Gohman	51aaf02821	Fix the the ceiling-division used in computing the MaxBECount so that it doesn't have trouble with an intermediate add overflowing. Also, be more conservative about the case where the induction variable in an SLT loop exit can step past the RHS of the SLT and overflow in a single step. Make getSignedRange more aggressive, to recover for some common cases which the above fixes pessimized. This addresses rdar://7561161. llvm-svn: 94512	2010-01-26 04:40:18 +00:00
Tobias Grosser	b478d3e0fc	Fix PR6047 Nodes that had children outside of the post dominator tree (infinite loops) where removed from the post dominator tree. This seems to be wrong. Leave them in the tree. llvm-svn: 93633	2010-01-16 13:38:07 +00:00
Dan Gohman	bc694918cc	Use WriteAsOperand instead of getName() to print loop header names, so that unnamed blocks are handled. llvm-svn: 93059	2010-01-09 18:17:45 +00:00
Dan Gohman	fb4193625a	Delete useless trailing semicolons. llvm-svn: 92740	2010-01-05 17:55:26 +00:00
Chris Lattner	850a3cd905	gvn is optimizing this better now. llvm-svn: 90696	2009-12-06 04:16:05 +00:00
Dan Gohman	03f90ab0a9	Add a comment about A[i+(j+1)]. llvm-svn: 90185	2009-12-01 01:38:10 +00:00
Chris Lattner	5fe97e7aca	@test9 is a testcase for r89958. Before 89958, we misanalyzed the first expression as P+4+4i which we considered to possibly alias P+4j. Now we correctly analyze the former one as P+1+4i. @test10 is a sanity test that verfies that we know that P+4+4i != P+4*i. llvm-svn: 89960	2009-11-26 19:25:46 +00:00
Chris Lattner	1bf7ff704a	Implement PR1143 (at -m64) by making basicaa look through extensions. We previously already handled it at -m32 because there were no i32->i64 extensions for addressing. llvm-svn: 89959	2009-11-26 18:53:33 +00:00
Chris Lattner	631c5b2cb9	teach GetLinearExpression to be a bit more aggressive. llvm-svn: 89955	2009-11-26 17:00:01 +00:00
Chris Lattner	ba0014a44c	update status of this. basicaa is much improved now, only missing the one form (in this testcase). Dan, do you consider this example to be important? llvm-svn: 89953	2009-11-26 16:42:00 +00:00
Chris Lattner	29bc8a91d3	Teach basicaa that x\|c == x+c when the c bits of x are clear. This allows us to compile the example in readme.txt into: LBB1_1: ## %bb movl 4(%rdx,%rax), %ecx movl %ecx, %esi imull (%rdx,%rax), %esi imull %esi, %ecx movl %esi, 8(%rdx,%rax) imull %ecx, %esi movl %ecx, 12(%rdx,%rax) movl %esi, 16(%rdx,%rax) imull %ecx, %esi movl %esi, 20(%rdx,%rax) addq $16, %rax cmpq $4000, %rax jne LBB1_1 instead of: LBB1_1: movl (%rdx,%rax), %ecx imull 4(%rdx,%rax), %ecx movl %ecx, 8(%rdx,%rax) imull 4(%rdx,%rax), %ecx movl %ecx, 12(%rdx,%rax) imull 8(%rdx,%rax), %ecx movl %ecx, 16(%rdx,%rax) imull 12(%rdx,%rax), %ecx movl %ecx, 20(%rdx,%rax) addq $16, %rax cmpq $4000, %rax jne LBB1_1 GCC (4.2) doesn't seem to be able to eliminate the loads in this testcase either, it generates: L2: movl (%rdx), %eax imull 4(%rdx), %eax movl %eax, 8(%rdx) imull 4(%rdx), %eax movl %eax, 12(%rdx) imull 8(%rdx), %eax movl %eax, 16(%rdx) imull 12(%rdx), %eax movl %eax, 20(%rdx) addl $4, %ecx addq $16, %rdx cmpl $1002, %ecx jne L2 llvm-svn: 89952	2009-11-26 16:26:43 +00:00
Chris Lattner	12dacdd359	teach basicaa that A[i] != A[i+1]. llvm-svn: 89951	2009-11-26 16:18:10 +00:00
Chris Lattner	453751031a	rename test llvm-svn: 89950	2009-11-26 16:08:41 +00:00
Chris Lattner	7a5b56aca9	Change the other half of aliasGEP (which handles GEP differencing) to use DecomposeGEPExpression. This dramatically simplifies and shrinks the code by eliminating the horrible CheckGEPInstructions method, fixes a miscompilation (@test3 ) and makes the code more aggressive. In particular, we now handle the @test4 case, which is reduced from the SmallPtrSet constructor. Missing this caused us to emit a variable length memset instead of a fixed size one. llvm-svn: 89922	2009-11-26 02:17:34 +00:00
Chris Lattner	0d23076adf	add a new random feature test llvm-svn: 89921	2009-11-26 02:16:28 +00:00
Chris Lattner	db1e9f1290	remove a silly condition that doesn't make a lot of sense anymore. llvm-svn: 89601	2009-11-22 16:15:59 +00:00
Victor Hernandez	fcc77b1c02	Update computeArraySize() to use ComputeMultiple() to determine the array size associated with a malloc; also extend PerformHeapAllocSRoA() to check if the optimized malloc's arg had its highest bit set, so that it is safe for ComputeMultiple() to look through sext instructions while determining the optimized malloc's array size llvm-svn: 86676	2009-11-10 08:32:25 +00:00
Victor Hernandez	f3db915294	Re-commit r86077 now that r86290 fixes the 179.art and 175.vpr ARM regressions. Here is the original commit message: This commit updates malloc optimizations to operate on malloc calls that have constant int size arguments. Update CreateMalloc so that its callers specify the size to allocate: MallocInst-autoupgrade users use non-TargetData-computed allocation sizes. Optimization uses use TargetData to compute the allocation size. Now that malloc calls can have constant sizes, update isArrayMallocHelper() to use TargetData to determine the size of the malloced type and the size of malloced arrays. Extend getMallocType() to support malloc calls that have non-bitcast uses. Update OptimizeGlobalAddressOfMalloc() to optimize malloc calls that have non-bitcast uses. The bitcast use of a malloc call has to be treated specially here because the uses of the bitcast need to be replaced and the bitcast needs to be erased (just like the malloc call) for OptimizeGlobalAddressOfMalloc() to work correctly. Update PerformHeapAllocSRoA() to optimize malloc calls that have non-bitcast uses. The bitcast use of the malloc is not handled specially here because ReplaceUsesOfMallocWithGlobal replaces through the bitcast use. Update OptimizeOnceStoredGlobal() to not care about the malloc calls' bitcast use. Update all globalopt malloc tests to not rely on autoupgraded-MallocInsts, but instead use explicit malloc calls with correct allocation sizes. llvm-svn: 86311	2009-11-07 00:16:28 +00:00
Victor Hernandez	b9f5899779	Revert r86077 because it caused crashes in 179.art and 175.vpr on ARM llvm-svn: 86213	2009-11-06 01:33:24 +00:00
Victor Hernandez	492ed30a32	Update CreateMalloc so that its callers specify the size to allocate: MallocInst-autoupgrade users use non-TargetData-computed allocation sizes. Optimization uses use TargetData to compute the allocation size. Now that malloc calls can have constant sizes, update isArrayMallocHelper() to use TargetData to determine the size of the malloced type and the size of malloced arrays. Extend getMallocType() to support malloc calls that have non-bitcast uses. Update OptimizeGlobalAddressOfMalloc() to optimize malloc calls that have non-bitcast uses. The bitcast use of a malloc call has to be treated specially here because the uses of the bitcast need to be replaced and the bitcast needs to be erased (just like the malloc call) for OptimizeGlobalAddressOfMalloc() to work correctly. Update PerformHeapAllocSRoA() to optimize malloc calls that have non-bitcast uses. The bitcast use of the malloc is not handled specially here because ReplaceUsesOfMallocWithGlobal replaces through the bitcast use. Update OptimizeOnceStoredGlobal() to not care about the malloc calls' bitcast use. Update all globalopt malloc tests to not rely on autoupgraded-MallocInsts, but instead use explicit malloc calls with correct allocation sizes. llvm-svn: 86077	2009-11-05 00:03:03 +00:00
Kenneth Uildriks	90fedc6ef9	Make opt default to not adding a target data string and update tests that depend on target data to supply it within the test llvm-svn: 85900	2009-11-03 15:29:06 +00:00
Edward O'Callaghan	f96ce30236	Convert Analysis tests to FileCheck in regards to PR5307. llvm-svn: 85241	2009-10-27 14:54:46 +00:00
Dan Gohman	3b7ba5f35b	Teach BasicAA how to analyze Select instructions, and make it more aggressive on PHI instructions. llvm-svn: 85158	2009-10-26 21:55:43 +00:00
Dan Gohman	7f2413f18b	Update these tests to match what Loop::print now prints. llvm-svn: 85021	2009-10-24 23:52:07 +00:00
Chris Lattner	1353518b6c	fix test llvm-svn: 84405	2009-10-18 05:03:00 +00:00
Chris Lattner	d2b3a4f7b8	tighten up test3, add test3a for the converse transform, which isn't happening yet. llvm-svn: 84402	2009-10-18 04:55:26 +00:00
Chris Lattner	457ecd5dab	tighten test2, add a test that it doesn't get transformed in the invalid edge case. llvm-svn: 84401	2009-10-18 04:50:18 +00:00
Nick Lewycky	ecb832fd93	Merge tests into modref.ll. Also add a test for r84174 at Chris' behest! llvm-svn: 84400	2009-10-18 04:41:36 +00:00
Nick Lewycky	91ea404e98	Add a couple new testcases. llvm-svn: 84385	2009-10-18 00:42:07 +00:00
Chris Lattner	ec411e9199	replace a useless test with a useful one llvm-svn: 84383	2009-10-17 23:59:51 +00:00
Nick Lewycky	f01ba005a7	Make use of the result of the loads even though that means adding -instcombine. llvm-svn: 84125	2009-10-14 19:02:13 +00:00
Evan Cheng	c1eed9d120	Another BasicAA fix. If a value does not alias a GEP's base pointer, then it cannot alias the GEP. GEP pointer alias rule states this clearly: A pointer value formed from a getelementptr instruction is associated with the addresses associated with the first operand of the getelementptr. llvm-svn: 84079	2009-10-14 06:41:49 +00:00
Evan Cheng	c745bf2d87	Replace test with a simpler hand crafted one. llvm-svn: 84069	2009-10-14 01:45:10 +00:00
Evan Cheng	c10e88db22	Teach basic AA about PHI nodes. If all operands of a phi NoAlias another value than it's safe to declare the PHI NoAlias the value. Ditto for MustAlias. llvm-svn: 84038	2009-10-13 22:02:20 +00:00
Chris Lattner	faa0320f27	don't use dead loads as tests. llvm-svn: 83985	2009-10-13 17:39:29 +00:00
Nick Lewycky	e2782c7614	Teach BasicAA a little something about the atomic intrinsics: they can only modify through the pointer they're given. llvm-svn: 83959	2009-10-13 07:48:38 +00:00
Victor Hernandez	e6ff7662b6	Revert 82694 "Auto-upgrade malloc instructions to malloc calls." because it causes regressions in the nightly tests. llvm-svn: 82784	2009-09-25 18:11:52 +00:00
Victor Hernandez	46cd467310	Auto-upgrade malloc instructions to malloc calls. Reviewed by Devang Patel. llvm-svn: 82694	2009-09-24 17:47:49 +00:00
Dan Gohman	36bad00bef	Teach ScalarEvolution how to reason about no-wrap flags on loops where the induction variable has a non-unit stride, such as {0,+,2}, and there are expressions such as {1,+,2} inside the loop formed with or or add nsw operators. llvm-svn: 82151	2009-09-17 18:05:20 +00:00
Dan Gohman	0f3ef7be50	Eliminate more redundant llvm-as calls. llvm-svn: 81540	2009-09-11 18:17:12 +00:00
Dan Gohman	1880092722	Change tests from "opt %s" to "opt < %s" so that opt doesn't see the input filename so that opt doesn't print the input filename in the output so that grep lines in the tests don't unintentionally match strings in the input filename. llvm-svn: 81537	2009-09-11 18:01:28 +00:00
Dan Gohman	c8054d90fb	Eliminate more uses of llvm-as and llvm-dis. llvm-svn: 81293	2009-09-09 00:09:15 +00:00
Dan Gohman	4f2527cd6d	Convert a few more opt \| llvm-dis to opt -S. llvm-svn: 81261	2009-09-08 22:41:33 +00:00
Dan Gohman	72a13d2476	Use opt -S instead of piping bitcode output through llvm-dis. llvm-svn: 81257	2009-09-08 22:34:10 +00:00
Dan Gohman	9737a63ed8	Change these tests to feed the assembly files to opt directly, instead of using llvm-as, now that opt supports this. llvm-svn: 81226	2009-09-08 16:50:01 +00:00
Andreas Neustifter	5673c0aace	Updated tests to use ProfileVerifer to test ProfileLoader and ProfileEstimator. (Keep disabled test disabled until selfhosted build issue is resolved.) llvm-svn: 81008	2009-09-04 17:21:59 +00:00
Daniel Dunbar	a48a2f6055	Revert "--- Reverse-merging r80908 into '.':", I already "fixed" this. llvm-svn: 80970	2009-09-03 23:40:10 +00:00
Bill Wendling	92291f6ad0	--- Reverse-merging r80908 into '.': D test/Analysis/Profiling --- Reverse-merging r80907 into '.': U lib/Analysis/ProfileInfoLoaderPass.cpp Attempt to remove failure in the self-hosting build bot. llvm-svn: 80966	2009-09-03 23:13:46 +00:00
Daniel Dunbar	d8df76eaeb	Disable some parts of the profiling-tool-chain test, which is currently failing on a self-hosted build (although it seems to work on non-self hosted). I'll work with Andreas to figure this out. llvm-svn: 80947	2009-09-03 21:09:53 +00:00
Daniel Dunbar	f563ac919b	Reapply profiling tests. llvm-svn: 80908	2009-09-03 07:38:00 +00:00
Andreas Neustifter	69e2afe030	Removed temporarily because of breaking Darwin builds. (See http://lists.cs.uiuc.edu/pipermail/llvm-commits/Week-of-Mon-20090831/086214.html) llvm-svn: 80799	2009-09-02 16:47:24 +00:00
Andreas Neustifter	da6b0fa8ed	Changed profiling-tool-chain.ll test to use optimal-edge-profiling instead of edge-profiling, this is more useful since the loading of the optimal-edge-profiling is more complicated. The edge-profiling is tested in edge-profiling.ll where only the instrumentation is tested. llvm-svn: 80791	2009-09-02 14:24:08 +00:00
Daniel Dunbar	0935339f81	Don't force the triple or data layout in this test. We just have to get them from the host and hope that works. llvm-svn: 80751	2009-09-02 02:43:11 +00:00
Chris Lattner	2d3e0a35bd	rename test so that name reflects what it is testing for. llvm-svn: 80519	2009-08-30 21:36:39 +00:00
Chris Lattner	c7d5796af8	convert to filecheck format. llvm-svn: 80518	2009-08-30 21:36:06 +00:00
Torok Edwin	e8a5863c3f	rm needs -f llvm-svn: 80363	2009-08-28 14:05:07 +00:00
Torok Edwin	483691a185	Remove the llvmprof.out from the test output, otherwise running make check in a non-clean directory causes it to fail (for example when running make check twice), since execution counts will differ. llvm-svn: 80362	2009-08-28 13:35:44 +00:00
Andreas Neustifter	ed3a6bedca	Remove profiling output file because two consecutive runs of make check give error. llvm-svn: 80357	2009-08-28 10:38:26 +00:00
Andreas Neustifter	d8e04ab883	Removed unnecessary file creation during test. llvm-svn: 80356	2009-08-28 10:07:41 +00:00
Andreas Neustifter	52afe22566	Pulled all tests into one test. Removed some redundant tests. Rename. llvm-svn: 80355	2009-08-28 10:00:28 +00:00
Andreas Neustifter	d7f3569555	Readded test from r79615, this tests the complete profiling tool chain. Furhter tests can test only parts of this system. llvm-svn: 80348	2009-08-28 06:41:00 +00:00
Dan Gohman	d926b985df	Create a ScalarEvolution-based AliasAnalysis implementation. This is a simple AliasAnalysis implementation which works by making ScalarEvolution queries. ScalarEvolution has a more complete understanding of arithmetic than BasicAA's collection of ad-hoc checks, so it handles some cases that BasicAA misses, for example p[i] and p[i+1] within the same iteration of a loop. This is currently experimental. It may be that the main use for this pass will be to help find cases where BasicAA can be profitably extended, or to help in the development of the overall AliasAnalysis infrastructure, however it's also possible that it could grow up to become a directly useful pass. llvm-svn: 80098	2009-08-26 14:53:06 +00:00
Andreas Neustifter	3fac9a4fc5	Removed profiling test, lli not available on all platforms. llvm-svn: 79633	2009-08-21 15:27:35 +00:00
Andreas Neustifter	f715778de9	Added tests for Profiling Infrastructure. llvm-svn: 79615	2009-08-21 09:36:28 +00:00
Dan Gohman	463d3407e2	Loosen up the regex for this test so that it doesn't implicitly depend on TargetData information. llvm-svn: 79491	2009-08-19 23:19:36 +00:00
Dan Gohman	e274526d78	Make LLVM Assembly dramatically easier to read by aligning the comments, using formatted_raw_ostream's PadToColumn. Before: bb1: ; preds = %bb %2 = sext i32 %i.01 to i64 ; <i64> [#uses=1] %3 = getelementptr double* %p, i64 %2 ; <double> [#uses=1] %4 = load double %3, align 8 ; <double> [#uses=1] %5 = fmul double %4, 1.100000e+00 ; <double> [#uses=1] %6 = sext i32 %i.01 to i64 ; <i64> [#uses=1] %7 = getelementptr double* %p, i64 %6 ; <double> [#uses=1] After: bb1: ; preds = %bb %2 = sext i32 %i.01 to i64 ; <i64> [#uses=1] %3 = getelementptr double %p, i64 %2 ; <double> [#uses=1] %4 = load double %3, align 8 ; <double> [#uses=1] %5 = fmul double %4, 1.100000e+00 ; <double> [#uses=1] %6 = sext i32 %i.01 to i64 ; <i64> [#uses=1] %7 = getelementptr double* %p, i64 %6 ; <double*> [#uses=1] Several tests required whitespace adjustments. llvm-svn: 78816	2009-08-12 17:23:50 +00:00
Andreas Bolka	3becda83ef	Add another Strong-SIV testcase. llvm-svn: 78446	2009-08-08 00:21:49 +00:00
Andreas Bolka	787591a594	Fix Strong-SIV testcase. llvm-svn: 78384	2009-08-07 15:42:32 +00:00
Andreas Bolka	13b860992a	ZIV tester for LDA. llvm-svn: 78157	2009-08-05 04:26:05 +00:00
Andreas Bolka	2979eb8f35	Fix LDA testcases. llvm-svn: 78153	2009-08-05 04:03:29 +00:00
Andreas Bolka	71fc19e991	Expand LDA testcases. llvm-svn: 77926	2009-08-02 23:28:14 +00:00
Andreas Bolka	2f84b5ab46	Slightly reformat LDA tests to ease grepping. llvm-svn: 77398	2009-07-28 23:40:40 +00:00
Dan Gohman	9c7f808201	Change the assembly syntax for nsw, nuw, and exact, putting them after their associated opcodes rather than before. This makes them a little easier to read. llvm-svn: 77194	2009-07-27 16:11:46 +00:00
Dan Gohman	534d66a426	When attempting to sign-extend an addrec by interpreting the step value as unsigned, the start value and the addrec itself still need to be treated as signed. llvm-svn: 77078	2009-07-25 16:03:30 +00:00
Dan Gohman	62ef6a7f1c	Teach ScalarEvolution to make use of no-overflow flags when analyzing add recurrences. llvm-svn: 77034	2009-07-25 01:22:26 +00:00
Andreas Bolka	dcb9f483bf	FileCheck'ize and expand LDA testcases. llvm-svn: 76880	2009-07-23 15:56:53 +00:00
Dan Gohman	430f0cc544	Replace the original ad-hoc code for determining whether (v pred w) implies (x pred y) with more thorough code that does more complete canonicalization before resorting to range checks. This helps it find more cases where the canonicalized expressions match. llvm-svn: 76671	2009-07-21 23:03:19 +00:00
Dan Gohman	52e14d2272	Add a testcase for PR4569, which is now fixed. llvm-svn: 76526	2009-07-21 00:50:52 +00:00
Torok Edwin	8f2906a2e8	Introduce a pointertracking pass. For now this only computes the allocated size of the memory pointed to by a pointer, and offset a pointer from allocated pointer. The actual checkLimits part will come later, after another round of review. llvm-svn: 75657	2009-07-14 18:44:28 +00:00
Dan Gohman	054d2a7837	Add testcases for PR4538, PR4537, and PR4534. llvm-svn: 75533	2009-07-13 22:30:31 +00:00
Nick Lewycky	3292908132	When comparing constants, consider a less wide constant to be "less complex" than a wider one, before trying to compare their contents which will crash if their sizes are different. llvm-svn: 74792	2009-07-04 17:24:52 +00:00
Andreas Bolka	9541801105	Array accesses are independent if the underlying arrays differ. llvm-svn: 74499	2009-06-30 02:12:10 +00:00
Andreas Bolka	9d09e20142	Print pairwise dependence results, add testcases. llvm-svn: 74402	2009-06-28 00:35:22 +00:00
Dan Gohman	5f71a2886a	Add a testcase demoing some of ScalarEvolution's new trip count logic. llvm-svn: 74049	2009-06-24 01:22:30 +00:00
Dan Gohman	53efeb0e45	Fix a bug in the trip-count computation with And/Or. If either of the sides is CouldNotCompute, the resulting exact count must be CouldNotCompute. llvm-svn: 73920	2009-06-22 23:28:56 +00:00
Dan Gohman	2636693a3c	Fix llvm::ComputeNumSignBits to handle pointer types conservatively correctly, instead of aborting. llvm-svn: 73908	2009-06-22 22:02:32 +00:00
Dan Gohman	96212b661c	Teach ScalarEvolution how to analyze loops with multiple exit blocks, and also exit blocks with multiple conditions (combined with (bitwise) ands and ors). It's often infeasible to compute an exact trip count in such cases, but a useful upper bound can often be found. llvm-svn: 73866	2009-06-22 00:31:57 +00:00
Dan Gohman	0104842ee3	Fix ScalarEvolution's backedge-taken count computations to check for overflow when computing a integer division to round up. Thanks to Nick Lewycky for noticing this! llvm-svn: 73862	2009-06-21 23:46:38 +00:00
Dan Gohman	eddf77123a	Teach ScalarEvolution how to recognize another xor(and(x, C), C) case. If C is a single bit and the and gets analyzed as a truncate and zero-extend, the xor can be represnted as an add. llvm-svn: 73664	2009-06-18 00:00:20 +00:00
Dan Gohman	432af7ace0	Add -disable-output to a bunch of tests that don't care about the output. llvm-svn: 73633	2009-06-17 20:56:26 +00:00
Dan Gohman	b50f5a46e0	Fix ScalarEvolution's Xor handling to not assume that an And that gets recognized with a SCEVZeroExtendExpr must be an And with a low-bits mask. With r73540, this is no longer the case. llvm-svn: 73594	2009-06-17 01:22:39 +00:00
Dan Gohman	a5b9645c4b	Split the Add, Sub, and Mul instruction opcodes into separate integer and floating-point opcodes, introducing FAdd, FSub, and FMul. For now, the AsmParser, BitcodeReader, and IRBuilder all preserve backwards compatability, and the Core LLVM APIs preserve backwards compatibility for IR producers. Most front-ends won't need to change immediately. This implements the first step of the plan outlined here: http://nondot.org/sabre/LLVMNotes/IntegerOverflow.txt llvm-svn: 72897	2009-06-04 22:49:04 +00:00
Dan Gohman	776e4c8d35	Teach BasicAliasAnalysis to understand constant gep indices that fall beyond their associated static array type. I believe that this fixes a legitimate bug, because BasicAliasAnalysis already has code to check for this condition that works for non-constant indices, however it was missing the case of constant indices. With this change, it checks for both. This fixes PR4267, and miscompiles of SPEC 188.ammp and 464.h264.href. llvm-svn: 72451	2009-05-27 01:48:27 +00:00
Dan Gohman	6350296efc	Teach ScalarEvolution to recognize x^-1 in the case where non-demanded bits have been stripped out by instcombine. llvm-svn: 72010	2009-05-18 16:29:04 +00:00
Dan Gohman	8c77f1a275	Make ScalarEvolution::isLoopGuardedByCond work even when the edge entering a loop is a non-split critical edge. llvm-svn: 72004	2009-05-18 15:36:09 +00:00
Dan Gohman	b81dd48fd2	Add nounwind to a few tests. llvm-svn: 72002	2009-05-18 15:16:49 +00:00
Eli Friedman	ebf98b0212	Allow scalar evolution to compute iteration counts for loops with a pointer-based condition. This fixes PR3171. llvm-svn: 71354	2009-05-09 12:32:42 +00:00
Dan Gohman	35dc9b65ed	Fix bogus overflow checks by replacing them with actual overflow checks. llvm-svn: 71284	2009-05-08 23:11:16 +00:00
Dan Gohman	2e55cc5a4a	Fold trunc casts into add-recurrence expressions, allowing the add-recurrence to be exposed. Add a new SCEV folding rule to help simplify expressions in the presence of these extra truncs. llvm-svn: 71264	2009-05-08 21:03:19 +00:00
Dan Gohman	7227bc88f0	When printing a SCEVUnknown with pointer type, don't print an artificial "ptrtoint", as it tends to clutter up complicated expressions. The cast operators now print both source and destination types, which is usually sufficient. llvm-svn: 70554	2009-05-01 17:02:22 +00:00
Dan Gohman	2b8da35f9d	Extend ScalarEvolution's getBackedgeTakenCount to be able to compute an upper-bound value for the trip count, in addition to the actual trip count. Use this to allow getZeroExtendExpr and getSignExtendExpr to fold casts in more cases. This may eventually morph into a more general value-range analysis capability; there are certainly plenty of places where more complete value-range information would allow more folding. llvm-svn: 70509	2009-04-30 20:47:05 +00:00
Dan Gohman	494dac3f84	Generalize the cast-of-addrec folding to handle folding of SCEVs like (sext i8 {-128,+,1} to i64) to i64 {-128,+,1}, where the iteration crosses from negative to positive, but is still safe if the trip count is within range. llvm-svn: 70421	2009-04-29 22:28:28 +00:00
Dan Gohman	d9775a3be1	Fix this test to match the new output from scalar-evolution. llvm-svn: 70410	2009-04-29 21:06:20 +00:00
Dan Gohman	d9b11b2ef4	Include the source type in SCEV cast expression debug output, and print sext, zext, and trunc, instead of signextend, zeroextend, and truncate, respectively, for consistency with the main IR. llvm-svn: 70405	2009-04-29 20:27:52 +00:00
Dan Gohman	807dff7486	Fix a grammaro in a comment. llvm-svn: 70331	2009-04-28 21:54:23 +00:00
Nick Lewycky	b4d9f7a9b3	Simplify trunc(extend(x)) in SCEVs, just for completeness. Also fix some odd whitespace in the same file. llvm-svn: 69870	2009-04-23 05:15:08 +00:00
Owen Anderson	7d82244be7	Testcase for PR3909. llvm-svn: 69868	2009-04-23 04:33:42 +00:00
Dan Gohman	e14efcc9f4	When turning (ashr(shl(x, n), n)) into sext(trunc(x)), the width of the type to truncate to should be the number of bits of the value that are preserved, not the number that are clobbered with sign-extension. This fixes regressions in ldecod. llvm-svn: 69704	2009-04-21 20:18:36 +00:00
Chris Lattner	d35d43dde8	change this to test for an alias result more directly. llvm-svn: 67046	2009-03-16 18:28:27 +00:00
Nick Lewycky	8e0f9ac051	Add a replacement for 2009-02-12-GEPNoalias.ll that works without -debug. llvm-svn: 67011	2009-03-14 19:40:09 +00:00
Chris Lattner	a18c768e6d	remove a buggy test, it is not ok to use -debug in RUN line. llvm-svn: 66918	2009-03-13 18:19:34 +00:00
Dan Gohman	b4e896baed	Update this test for the LoopInfo::print changes. llvm-svn: 65597	2009-02-27 00:17:49 +00:00
Dan Gohman	0bddac16a8	Rename ScalarEvolution's getIterationCount to getBackedgeTakenCount, to more accurately describe what it does. Expand its doxygen comment to describe what the backedge-taken count is and how it differs from the actual iteration count of the loop. Adjust names and comments in associated code accordingly. llvm-svn: 65382	2009-02-24 18:55:53 +00:00
Nick Lewycky	c60bd012bc	BasicAA was making the assumption that a local allocation which hadn't escaped couldn't ever be the return of call instruction. However, it's quite possible that said local allocation is itself the return of a function call. That's what malloc and calloc are for, actually. llvm-svn: 64442	2009-02-13 07:06:27 +00:00
Owen Anderson	1caf7fef8e	Finish making AliasAnalysis aware of the fact that most atomic intrinsics only dereference their arguments, and enhance BasicAA to make use of this fact when computing ModRef info. llvm-svn: 63718	2009-02-04 05:16:46 +00:00
Nick Lewycky	52348300a4	Wind SCEV back in time, to Nov 18th. This 'fixes' PR3275, PR3294, PR3295, PR3296 and PR3302. llvm-svn: 62160	2009-01-13 09:18:58 +00:00
Nick Lewycky	380292a51a	Don't try to analyze this "backward" case. This is overly conservative pending a correct solution. llvm-svn: 61589	2009-01-02 18:54:17 +00:00
Nick Lewycky	d80ff135b5	Check that the function prototypes are correct before assuming that the parameters are pointers. llvm-svn: 61451	2008-12-27 16:20:53 +00:00
Nick Lewycky	2abb108f1b	Resubmit support for the 'nocapture' attribute. The problematic part of this patch is that we were out of attribute bits, requiring some fancy bit hacking to make it fit (by shrinking alignment) without breaking existing users or the file format. This change will require users to rebuild llvm-gcc to match llvm. llvm-svn: 61239	2008-12-19 06:39:12 +00:00
Bill Wendling	e38c7400c9	Remove empty test. llvm-svn: 61095	2008-12-16 19:07:17 +00:00
Bill Wendling	a397baea88	Temporarily revert r61019, r61030, and r61040. These were breaking LLVM Release builds. llvm-svn: 61094	2008-12-16 19:06:48 +00:00
Nick Lewycky	69c9aa4ce5	Generalize support for analyzing loops to include SLE/SGE loop exit conditions and support for non-unit strides with signed exit conditions. llvm-svn: 61082	2008-12-16 08:30:01 +00:00
Chris Lattner	e3401db1f3	Teach basicaa to use the nocapture attribute when possible. When the intrinsics are properly marked nocapture, the fixme should be addressed. llvm-svn: 61040	2008-12-15 18:59:22 +00:00
Nick Lewycky	729bf137a8	Revert my re-instated reverted commit, fixes the bootstrap build on x86-64 linux. llvm-svn: 60951	2008-12-12 17:09:07 +00:00
Nick Lewycky	6a344e097c	Sneaky, sneaky: move the -1 to the outside of the SMax. Reinstate the optimization of SGE/SLE with unit stride, now that it works properly. llvm-svn: 60881	2008-12-11 17:40:14 +00:00
Chris Lattner	2e84a548d6	Allow basicaa to walk through geps with identical indices in parallel, allowing it to decide that P/Q must alias if A/B must alias in things like: P = gep A, 0, i, 1 Q = gep B, 0, i, 1 This allows GVN to delete 62 more instructions out of 403.gcc. llvm-svn: 60820	2008-12-10 01:04:47 +00:00
Evan Cheng	058522f1da	xfail this for now. llvm-svn: 60777	2008-12-09 18:43:00 +00:00
Nick Lewycky	f545749f2b	It's easy to handle SLE/SGE when the loop has a unit stride. llvm-svn: 60748	2008-12-09 07:25:04 +00:00
Nick Lewycky	f5ffcbcd0b	Extend the 'noalias' attribute to function return values. This is intended to indicate functions that allocate, such as operator new, or list::insert. The actual definition is slightly less strict (for now). No changes to the bitcode reader/writer, asm printer or verifier were needed. llvm-svn: 59934	2008-11-24 03:41:24 +00:00
Nick Lewycky	1c451ae43e	Add a utility function that detects whether a loop is guaranteed to be finite. Use it to safely handle less-than-or-equals-to exit conditions in loops. These also occur when the loop exit branch is exit on true because SCEV inverses the icmp predicate. Use it again to handle non-zero strides, but only with an unsigned comparison in the exit condition. llvm-svn: 59528	2008-11-18 15:10:54 +00:00
Nick Lewycky	625c6f79b2	Don't brute-force analyze cubic or higher polynomials. If this patch causes a performance regression for anyone, please let me know, and it can be fixed in a different way with much more effort. llvm-svn: 59384	2008-11-16 04:14:25 +00:00
Nick Lewycky	7b14e20a5e	Don't crash analyzing certain quadratics (addrec of {X,+,Y,+,1}). We're still waiting on code that actually analyzes them properly. llvm-svn: 58592	2008-11-03 02:43:49 +00:00
Duncan Sands	9c40c28926	Rationalize the names of passes that print information: -callgraph => print-callgraph -callscc => print-callgraph-sccs -cfgscc => print-cfg-sccs -externalfnconstants => print-externalfnconstants -print => print-function -print-alias-sets (no change) -print-callgraph => dot-callgraph -print-cfg => dot-cfg -print-cfg-only => dot-cfg-only -print-dom-info (no change) -printm => print-module -printusedtypes => print-used-types llvm-svn: 56487	2008-09-23 12:47:39 +00:00
Duncan Sands	310077034a	Remove the MarkModRef pass (use AddReadAttrs instead). Unfortunately this means removing one regression test of GlobalsModRef because I couldn't work out how to perform it without MarkModRef. llvm-svn: 56342	2008-09-19 08:23:44 +00:00
Duncan Sands	af25ee7ffc	Add a new pass AddReadAttrs which works out which functions can get the readnone/readonly attributes, and gives them it. The plan is to remove markmodref (which did the same thing by querying GlobalsModRef) and delete the analogous functionality from GlobalsModRef. llvm-svn: 56341	2008-09-19 08:17:05 +00:00
Duncan Sands	938e8f60d6	Teach -callgraph to always print the callgraph (as the description says it does), not just when -analyze is used as well. This means printing to stderr, so adjust some tests. llvm-svn: 56337	2008-09-19 07:57:09 +00:00
Dan Gohman	dc5f5cbe59	Finally re-apply r46959. This is made feasible by the combination of r56230, r56232, and r56246. llvm-svn: 56247	2008-09-16 18:52:57 +00:00
Dan Gohman	162568842e	Fix spacing in the grep line for this test, following the recent SCEV-whitespace changes. llvm-svn: 56234	2008-09-16 01:37:08 +00:00
Dan Gohman	f9081a2cd5	Teach ScalarEvolution to consider loop preheaders in the search for an if statement that guards a loop, to allow indvars to avoid smax operations in more situations. llvm-svn: 56232	2008-09-15 22:18:04 +00:00
Dan Gohman	81313fd8d1	Fix WriteAsOperand to not emit a leading space character. Adjust its callers to emit a space character before calling it when a space is needed. This fixes several spurious whitespace issues in ScalarEvolution's debug dumps. See the test changes for examples. This also fixes odd space-after-tab indentation in the output for switch statements, and changes calls from being printed like this: call void @foo( i32 %x ) to this: call void @foo(i32 %x) llvm-svn: 56196	2008-09-14 17:21:12 +00:00
Duncan Sands	9ddb3145ae	Fix PR2792: treat volatile loads as writing memory somewhere. Treat stores as reading memory, just to play safe. llvm-svn: 56188	2008-09-13 12:45:50 +00:00
Duncan Sands	c189e79440	Correct callgraph construction. It has two problems: (1) code left over from the days of ConstantPointerRef: if a use of a function is a GlobalValue then that is not considered a reason to add an edge from the external node, even though the use may be as an initializer for an externally visible global! There might be some point to this behaviour when the use is by an alias (though the code predated aliases by some centuries), but I think PR2782 is a better way of handling that. (2) If function F calls function G, and also G is a parameter to the call, then an F->G edge is not added to the callgraph. While this doesn't seem to matter much, adding such an edge makes the callgraph more regular. In addition, the new code should be faster as well as simpler. llvm-svn: 55987	2008-09-09 12:40:47 +00:00
Duncan Sands	b86a788862	Testcase for commits 55700 and 55714. llvm-svn: 55715	2008-09-03 19:38:41 +00:00
Duncan Sands	0eca0571f8	Since onlyReadsMemory returns true if in fact doesNotAccessMemory, check doesNotAccessMemory first, since otherwise functions may be marked readonly rather than readnone. llvm-svn: 55697	2008-09-03 15:31:24 +00:00
Duncan Sands	42c644ef03	Cleanup GlobalsModRef a bit. When analysing the callgraph, when one member of a SCC calls another then the analysis would drop to mod-ref because there is (usually) no function info for the callee yet; fix this. Teach the analysis about function attributes, in particular the readonly attribute (which requires being careful about globals). llvm-svn: 55696	2008-09-03 12:55:42 +00:00
Owen Anderson	2a6adfa4f0	Remove GCSE and LoadVN from the testsuite. llvm-svn: 54832	2008-08-16 00:00:54 +00:00
Dan Gohman	2a62fd96a6	Extend ScalarEvolution's executesAtLeastOnce logic to be able to continue past the first conditional branch when looking for a relevant test. This helps it avoid using MAX expressions in loop trip counts in more cases. llvm-svn: 54697	2008-08-12 20:17:31 +00:00
Eli Friedman	61f67624c3	PR2621: Improvements to the SCEV AddRec binomial expansion. This version uses a new algorithm for evaluating the binomial coefficients which is significantly more efficient for AddRecs of more than 2 terms (see the comments in the code for details on how the algorithm works). It also fixes some bugs: it removes the arbitrary length restriction for AddRecs, it fixes the silent generation of incorrect code for AddRecs which require a wide calculation width, and it fixes an issue where we were incorrectly truncating the iteration count too far when evaluating an AddRec expression narrower than the induction variable. There are still a few related issues I know of: I think there's still an issue with the SCEVExpander expansion of AddRec in terms of the width of the induction variable used. The hack to avoid generating too-wide integers shouldn't be necessary; instead, the callers should be considering the cost of the expansion before expanding it (in addition to not expanding too-wide integers, we might not want to expand expressions that are really expensive, especially when optimizing for size; calculating an length-17 32-bit AddRec currently generates about 250 instructions of straight-line code on X86). Also, for long 32-bit AddRecs on X86, CodeGen really sucks at scheduling the code. I'm planning on filing follow-up PRs for these issues. llvm-svn: 54332	2008-08-04 23:49:06 +00:00
Eli Friedman	4736916aa6	Another SCEV issue from PR2607; essentially the same issue, but this time applying to the implicit comparison in smin expressions. The correct way to transform an inequality into the opposite inequality, either signed or unsigned, is with a not expression. I looked through the SCEV code, and I don't think there are any more occurrences of this issue. llvm-svn: 54194	2008-07-30 04:36:32 +00:00
Eli Friedman	5ae90441c4	Fix for PR2607: SCEV miscomputing the loop count for loops with an SGT exit condition. Essentially, the correct way to flip an inequality in 2's complement is the not operator, not the negation operator. That said, the difference only affects cases involving INT_MIN. Also, enhance the pre-test search logic to be a bit smarter about inequalities flipped with a not operator, so it can eliminate the smax from the iteration count for simple loops. llvm-svn: 54184	2008-07-30 00:04:08 +00:00
Wojciech Matyjewicz	f0d21cdd19	Fix PR2088. Use modulo linear equation solver to compute loop iteration count. llvm-svn: 53810	2008-07-20 15:55:14 +00:00
Nick Lewycky	82510bf0e3	XFAIL this test. llvm-svn: 53793	2008-07-19 15:52:06 +00:00
Wojciech Matyjewicz	a78669c2cf	While testing particular algorithms to compute loop iteration count the brute force evaluation (ComputeIterationCountExhaustively) should be turned off. It doesn't apply to trip-count2.ll because this file tests the brute force evaluation. The test for PR2364 (2008-05-25-NegativeStepToZero.ll) currently fails showing that the patch for this bug doesn't work. I'll fix it in a few hours with a patch for PR2088. llvm-svn: 53792	2008-07-19 13:26:15 +00:00
Nick Lewycky	b5688ccf57	Stop creating extraneous smax/umax in SCEV. This removes a regression where we started complicating many loops ('for' loops, in fact). llvm-svn: 53508	2008-07-12 07:41:32 +00:00
Chris Lattner	b35d9b5e07	If we are checking to see if the result of a call aliases a pointer derived from a local allocation, if the local allocation never escapes, the pointers can't alias. This implements PR2436 llvm-svn: 52301	2008-06-16 06:19:11 +00:00
Nick Lewycky	ed169d531d	Crash less. The i64 restriction in BinomialCoefficient caused some problems with code that was expecting different bit widths for different values. Make getTruncateOrZeroExtend a method on ScalarEvolution, and use it. llvm-svn: 52248	2008-06-13 04:38:55 +00:00
Matthijs Kooijman	6436685110	Remove trailing whitespace after line continuations in test cases to them work. This fixes two test cases that were not being run properly before. llvm-svn: 52179	2008-06-10 15:07:07 +00:00
Matthijs Kooijman	d66e18aaf6	Suppress the (stderr) output of -aa-eval, this fixes 5 tests. llvm-svn: 52173	2008-06-10 12:39:15 +00:00
Wojciech Matyjewicz	416867a81b	Fixes PR2395. Looking for a constant in a GEP tail (when the first GEP is longer than the second one) should stop after finding one. Added break instruction guarantees it. It also changes difference between offsets to absolute value of this difference in the condition. llvm-svn: 51875	2008-06-02 17:26:12 +00:00
Owen Anderson	50d602cda2	Move these tests into the proper directory. llvm-svn: 51685	2008-05-29 16:30:29 +00:00
Nick Lewycky	a61cc6ece0	Whoops -- forgot PR reference on this test. llvm-svn: 51569	2008-05-26 20:23:33 +00:00
Nick Lewycky	be993358a7	Use {} instead of "" in RUN lines. llvm-svn: 51561	2008-05-26 01:27:08 +00:00
Nick Lewycky	3195b393d6	Don't treat values as signed when looking at loop steppings in HowForToNonZero. llvm-svn: 51560	2008-05-25 23:43:32 +00:00
Dan Gohman	5e7863de1b	Remove lingering references to .llx and .tr in the tests. llvm-svn: 51500	2008-05-23 21:15:35 +00:00
Gabor Greif	1e427c3264	sabre brings to my attention that the 'tr' suffix is also obsolete llvm-svn: 51349	2008-05-20 21:00:03 +00:00
Gabor Greif	f45ff35bfe	Rename the last test with .llx extension to .ll, resolve duplicate test by renaming to isnan2. Now that no test has llx ending there is no need to search for them from dg.exp too. llvm-svn: 51328	2008-05-20 19:52:04 +00:00
Owen Anderson	a74d72d01f	Fix this test. It was testing broken behavior in that it required ADCE to eliminate a potentially infinite loop, which is undesirable. Instead, test the LICM behavior that we're really interested in. llvm-svn: 51177	2008-05-16 04:25:09 +00:00
Owen Anderson	9d990dd218	Fix PR1098 by correcting the postdominators analysis. Patch by Florian Brandner. llvm-svn: 50628	2008-05-04 21:07:35 +00:00
Chris Lattner	b839c05a05	rename .llx -> .ll, last batch. llvm-svn: 49971	2008-04-19 22:32:52 +00:00
Owen Anderson	f9ae76d89c	Make GVN able to remove unnecessary calls to read-only functions again. llvm-svn: 49842	2008-04-17 05:36:50 +00:00
Dale Johannesen	8fc8a272e0	Don't assume a tail call can't reference a byval argument to the outer function, this isn't correct. llvm-svn: 49731	2008-04-15 17:41:34 +00:00
Owen Anderson	b1e8bf2cad	The functionality being tested was removed because it was horribly unsafe. llvm-svn: 49610	2008-04-13 09:51:06 +00:00
Duncan Sands	d6481955db	Testcase for pr2169. llvm-svn: 49344	2008-04-07 17:03:16 +00:00
Duncan Sands	e37b9c0d34	Testcase for PR2160. llvm-svn: 48655	2008-03-21 20:22:11 +00:00
Daniel Berlin	5fef9aea12	Fix PR 2160 by making sure arguments to external functions get marked as pointing to anything llvm-svn: 48509	2008-03-18 22:22:53 +00:00
Gabor Greif	f77e6977a0	Fix http://llvm.org/bugs/show_bug.cgi?id=2104 by ordering lexicographically what gets printed. Be const-correct in PrintResults and uninline it too llvm-svn: 47712	2008-02-28 08:38:45 +00:00
Evan Cheng	01d6257e81	Temporarily reverting 46959. llvm-svn: 47542	2008-02-25 03:57:32 +00:00
Nick Lewycky	1c44ebcf86	Add 'umax' similar to 'smax' SCEV. Closes PR2003. Parse reversed smax and umax as smin and umin and express them with negative or binary-not SCEVs (which are really just subtract under the hood). Parse 'xor %x, -1' as (-1 - %x). Remove dead code (ConstantInt::get always returns a ConstantInt). Don't use getIntegerSCEV(-1, Ty). The first value is an int, then it gets passed into a uint64_t. Instead, create the -1 directly from ConstantInt::getAllOnesValue(). llvm-svn: 47360	2008-02-20 06:48:22 +00:00
Tanya Lattner	f865dcd009	Remove llvm-upgrade. llvm-svn: 47110	2008-02-14 06:56:27 +00:00
Wojciech Matyjewicz	ddb265b905	Now that ScalarEvolution::print writes to the correct stream, there is no need to redirect stderr into stdout. llvm-svn: 47009	2008-02-12 15:12:40 +00:00
Wojciech Matyjewicz	995624f44d	Change negative grep into positive one in my yesterday's testcase. llvm-svn: 47008	2008-02-12 15:10:35 +00:00
Wojciech Matyjewicz	1d2c27b23e	Fix PR2002. Suppose n is the initial value for the induction variable (with step 1) and m is its final value. Then, the correct trip count is SMAX(m,n)-n. Previously, we used SMAX(0,m-n), but m-n may overflow and can't in general be interpreted as signed. Patch by Nick Lewycky. llvm-svn: 47007	2008-02-12 15:09:36 +00:00
Wojciech Matyjewicz	adae053b53	If the LHS of the comparison is a loop-invariant we also want to move it to the RHS. This simple change allows to compute loop iteration count for loops with condition similar to the one in the testcase (which seems to be quite common). llvm-svn: 46959	2008-02-11 18:37:34 +00:00
Wojciech Matyjewicz	d2d9764cc8	Fix PR1798 - an error in the evaluation of SCEVAddRecExpr at an arbitrary iteration. The patch: 1) changes SCEVSDivExpr into SCEVUDivExpr, 2) replaces PartialFact() function with BinomialCoefficient(); the computations (essentially, the division) in BinomialCoefficient() are performed with the apprioprate bitwidth necessary to avoid overflow; unsigned division is used instead of the signed one. Computations in BinomialCoefficient() require support from the code generator for APInts. Currently, we use a hack rounding up the neccessary bitwidth to the nearest power of 2. The hack is easy to turn off in future. One remaining issue: we assume the divisor of the binomial coefficient formula can be computed accurately using 16 bits. It means we can handle AddRecs of length up to 9. In future, we should use APInts to evaluate the divisor. Thanks to Nicholas for cooperation! llvm-svn: 46955	2008-02-11 11:03:14 +00:00
Chris Lattner	9104d71269	Teach basicaa that 'byval' arguments define a new memory location that can't be aliased to other known objects. This allows us to know that byval pointer args don't alias globals, etc. llvm-svn: 46315	2008-01-24 18:00:32 +00:00
Nick Lewycky	0e519bb555	Accept both %y, %x and %x, %y as valid answers. llvm-svn: 45649	2008-01-06 03:12:44 +00:00
Chris Lattner	3f42d12072	Fix PR1782, patch by Wojtek Matyjewicz! llvm-svn: 44733	2007-12-09 07:35:13 +00:00
Tanya Lattner	8f342f8ef3	Fix bug in regression tests that ignored stderr output in RUN lines. Updated tests and fixed broken run lines. XFAILed 3 arm regressions (will file bugs) llvm-svn: 44389	2007-11-28 04:57:00 +00:00
Dan Gohman	2dba0788a5	Change grep '' to grep {}. Change 2>&1 \| to \|&. llvm-svn: 44344	2007-11-27 00:10:35 +00:00
Owen Anderson	4f833c7610	Allow GVN to eliminate read-only function calls when it can detect that they are redundant. llvm-svn: 44323	2007-11-26 02:26:36 +00:00
Nick Lewycky	cdb7e54ca7	Add new SCEV, SCEVSMax. This allows LLVM to analyze do-while loops. llvm-svn: 44319	2007-11-25 22:41:31 +00:00
Duncan Sands	8a3e9d2bee	Ding dong, the DoesntAccessMemoryFns and OnlyReadsMemoryFns tables are dead! We get more, and more accurate, information from gcc via the readnone and readonly function attributes. llvm-svn: 44288	2007-11-23 19:30:27 +00:00
Duncan Sands	38a5e82ef4	Teach alias analysis about readnone/readonly functions. Based on a patch by Török Edwin. llvm-svn: 44279	2007-11-22 21:43:27 +00:00
Nick Lewycky	016547d226	Create nodes for inline asm so that we don't crash looking for the node later. llvm-svn: 44267	2007-11-22 03:07:37 +00:00
Nick Lewycky	5b18bd3368	Be more careful when transforming \| to +. Patch from Wojciech Matyjewicz. llvm-svn: 44248	2007-11-20 08:24:44 +00:00
Anton Korobeynikov	6a7ddfdb8f	Reverted r44163 per request llvm-svn: 44177	2007-11-15 18:33:16 +00:00
Nick Lewycky	fbb24817cc	Fix handling of overflow in loop calculation by adding new UDiv SCEV. This SCEV is disabled in the sense that it will refuse to create one from a UDiv instruction, until the code is better tested. llvm-svn: 44163	2007-11-15 06:30:50 +00:00
Chris Lattner	0fc613b85d	Fix PR1774 and BasicAA/2007-11-05-SizeCrash.ll llvm-svn: 43756	2007-11-06 05:58:42 +00:00
Owen Anderson	7827a3f366	Fix for PR1741. llvm-svn: 43326	2007-10-25 02:36:18 +00:00

... 6 7 8 9 10 ...

783 Commits