llvm-project

Commit Graph

Author	SHA1	Message	Date
Mon P Wang	3537a62704	Fixed optimization of combining two shuffles where the first shuffle inputs has a different number of elements than the output. llvm-svn: 62998	2009-01-26 04:39:00 +00:00
Chris Lattner	9449991c4f	Handle single-entry phi nodes gracefully in condprop. llvm-svn: 62985	2009-01-26 02:18:20 +00:00
Chris Lattner	7b6647c178	Fix PR3408 by making a non-obvious assumption very obvious, and handling the flaw inherent in that assumption. :) llvm-svn: 62984	2009-01-26 02:11:30 +00:00
Nick Lewycky	cb7a10ab63	Actually run the test in this directory. llvm-svn: 62957	2009-01-25 08:05:07 +00:00
Nick Lewycky	5647c5d1a4	The function that does nothing but call malloc is noalias return. llvm-svn: 62956	2009-01-25 07:59:57 +00:00
Torok Edwin	f4395ea97a	testcase for PR3381. Also it was an empty struct, not a void after all. llvm-svn: 62920	2009-01-24 17:16:04 +00:00
Chris Lattner	72cd68fe64	Make InstCombineStoreToCast handle aggregates more aggressively, handling the case in Transforms/InstCombine/cast-store-gep.ll, which is a heavily reduced testcase from Clang on x86-64. llvm-svn: 62904	2009-01-24 01:00:13 +00:00
Chris Lattner	3f4591c89f	fix two more cases where we could let the NLPDI cache get unsorted. With this, sqlite3 now passes. llvm-svn: 62839	2009-01-23 07:12:16 +00:00
Chris Lattner	bed6be62e4	fix a testcase. llvm-svn: 62758	2009-01-22 07:08:58 +00:00
Chris Lattner	f09619d533	Fix PR3358, a really nasty bug where recursive phi translated analyses could be run without the caches properly sorted. This can fix all sorts of weirdness. Many thanks to Bill for coming up with the 'issorted' verification idea. llvm-svn: 62757	2009-01-22 07:04:01 +00:00
Dale Johannesen	1f86498f93	Do not use host floating point types when emitting ASCII IR; loading and storing these can change the bits of NaNs on some hosts. Remove or add warnings at a few other places using host floating point; this is a bad thing to do in general. llvm-svn: 62712	2009-01-21 20:32:55 +00:00
Dale Johannesen	287b4bc44e	Disable on x86_64 until I figure out what's wrong. llvm-svn: 62660	2009-01-21 02:08:30 +00:00
Dale Johannesen	b5721632ee	Make special cases (0 inf nan) work for frem. Besides APFloat, this involved removing code from two places that thought they knew the result of frem(0., x) but were wrong. llvm-svn: 62645	2009-01-21 00:35:19 +00:00
Dale Johannesen	e75fdb0510	Calls to fmod, it turns out, are constant-folded by invoking the host fmod, not by lowering to frem and constant-folding that. Fix this so it tests what I want to test. llvm-svn: 62622	2009-01-20 21:58:13 +00:00
Bill Wendling	a908b60fb2	Temporarily XFAIL until this can be looked at. r62557 is what caused it to start failing. llvm-svn: 62578	2009-01-20 10:28:39 +00:00
Chris Lattner	c59945b4bd	another fix for PR3354 llvm-svn: 62561	2009-01-20 01:15:41 +00:00
Chris Lattner	ea9f1d3c47	Fix a problem exposed by PR3354: simplifycfg was making a potentially trapping instruction be executed unconditionally. llvm-svn: 62541	2009-01-19 23:03:13 +00:00
Dale Johannesen	d067ecd1c7	Move & restructure test per review. llvm-svn: 62538	2009-01-19 22:33:12 +00:00
Chris Lattner	7eeb1cc605	convert this to an unfoldable potentially trapping constant expr. llvm-svn: 62536	2009-01-19 22:12:33 +00:00
Chris Lattner	6f34e317e9	Fix PR3353, infinitely jump threading an infinite loop make from switches. llvm-svn: 62529	2009-01-19 21:20:34 +00:00
Bill Wendling	534d2e0bae	Temporarily revert r62487. It's causing this error during a release bootstrap of llvm-gcc. Most likely, it's miscompiling one of the "gen*" programs: /Volumes/Sandbox/Buildbot/llvm/full-llvm/build/llvm-gcc.obj/./prev-gcc/xgcc -B/Volumes/Sandbox/Buildbot/llvm/full-llvm/build/llvm-gcc.obj/./prev-gcc/ -B/Volumes/Sandbox/Buildbot/llvm/full-llvm/build/llvm-gcc.install/i386-apple-darwin9.6.0/bin/ -c -g -O2 -mdynamic-no-pic -DIN_GCC -W -Wall -Wwrite-strings -Wstrict-prototypes -Wmissing-prototypes -mdynamic-no-pic -DHAVE_CONFIG_H -DGENERATOR_FILE -I. -Ibuild -I../../llvm-gcc.src/gcc -I../../llvm-gcc.src/gcc/build -I../../llvm-gcc.src/gcc/../include -I./../intl -I../../llvm-gcc.src/gcc/../libcpp/include -I../../llvm-gcc.src/gcc/../libdecnumber -I../libdecnumber -I/Volumes/Sandbox/Buildbot/llvm/full-llvm/build/llvm.obj/include -I/Volumes/Sandbox/Buildbot/llvm/full-llvm/build/llvm.src/include -DENABLE_LLVM -I/Volumes/Sandbox/Buildbot/llvm/full-llvm/build/llvm.obj/../llvm.src/include -D_DEBUG -D_GNU_SOURCE -D__STDC_LIMIT_MACROS -D__STDC_CONSTANT_MACROS -o build/gencondmd.o build/gencondmd.c ../../llvm-gcc.src/gcc/config/i386/mmx.md:926: error: expected '}' before ')' token ../../llvm-gcc.src/gcc/config/i386/mmx.md:926: warning: excess elements in struct initializer ../../llvm-gcc.src/gcc/config/i386/mmx.md:926: warning: (near initialization for 'insn_conditions[4]') ../../llvm-gcc.src/gcc/config/i386/mmx.md:926: error: expected '}' before ')' token ../../llvm-gcc.src/gcc/config/i386/mmx.md:926: error: expected ',' or ';' before ')' token ../../llvm-gcc.src/gcc/config/i386/mmx.md:927: error: expected identifier or '(' before ',' token ../../llvm-gcc.src/gcc/config/i386/sse.md:3458: error: expected identifier or '(' before ',' token ... llvm-svn: 62506	2009-01-19 08:46:20 +00:00
Chris Lattner	f2bb4ea39c	Fix PR3016, a bug which can occur do to an invalid assumption: we assumed a CFG structure that would be valid when all code in the function is reachable, but not all code is necessarily reachable. Do a simple, but horrible, CFG walk to check for this case. llvm-svn: 62487	2009-01-19 02:46:28 +00:00
Nick Lewycky	e5be1cd635	Forgot this in the previous checkin: fopen now has nocapture, realloc is supposed to take two arguments. llvm-svn: 62457	2009-01-18 04:46:10 +00:00
Chris Lattner	db2d9613d2	Fix PR3335 by not turning a store to one address space into a store to another. llvm-svn: 62351	2009-01-16 20:12:52 +00:00
Evan Cheng	beac6f8b0c	Clean up previous cast optimization a bit. Also make zext elimination a bit more aggressive: if it's not necessary to emit an AND (i.e. high bits are already zero), it's profitable to evaluate the operand at a different type. llvm-svn: 62297	2009-01-16 02:11:43 +00:00
Evan Cheng	60e19a46f2	- Teach CanEvaluateInDifferentType of this xform: sext (zext ty1), ty2 -> zext ty2 - Looking at the number of sign bits of the a sext instruction to determine whether new trunc + sext pair should be added when its source is being evaluated in a different type. llvm-svn: 62263	2009-01-15 17:01:23 +00:00
Chris Lattner	8fb9480ed2	Fix PR3325, a miscompilation of invokes by IPSCCP. Patch by Jay Foad! llvm-svn: 62244	2009-01-14 21:01:16 +00:00
Dale Johannesen	1f0e0e7c9c	Fix the time regression I introduced in 464.h264ref with my earlier patch to this file. The issue there was that all uses of an IV inside a loop are actually references to Base[IV2], and there was one use outside that was the same but LSR didn't see the base or the scaling because it didn't recurse into uses outside the loop; thus, it used base+IVscale mode inside the loop instead of pulling base out of the loop. This was extra bad because register pressure later forced both base and IV into memory. Doing that recursion, at least enough to figure out addressing modes, is a good idea in general; the change in AddUsersIfInteresting does this. However, there were side effects.... It is also possible for recursing outside the loop to introduce another IV where there was only 1 before (if the refs inside are not scaled and the ref outside is). I don't think this is a common case, but it's in the testsuite. It is right to be very aggressive about getting rid of such introduced IVs (CheckForIVReuse and the handling of nonzero RewriteFactor in StrengthReduceStridedIVUsers). In the testcase in question the new IV produced this way has both a nonconstant stride and a nonzero base, neither of which was handled before. And when inserting new code that feeds into a PHI, it's right to put such code at the original location rather than in the PHI's immediate predecessor(s) when the original location is outside the loop (a case that couldn't happen before) (RewriteInstructionToUseNewBase); better to avoid making multiple copies of it in this case. Also, the mechanism for keeping SCEV's corresponding to GEP's no longer works, as the GEP might change after its SCEV is remembered, invalidating the SCEV, and we might get a bad SCEV value when looking up the GEP again for a later loop. This also couldn't happen before, as we weren't recursing into GEP's outside the loop. Also, when we build an expression that involves a (possibly non-affine) IV from a different loop as well as an IV from the one we're interested in (containsAddRecFromDifferentLoop), don't recurse into that. We can't do much with it and will get in trouble if we try to create new non-affine IVs or something. More testcases are coming. llvm-svn: 62212	2009-01-14 02:35:31 +00:00
Chris Lattner	2538eb664c	rewrite OptimizeAwayTrappingUsesOfLoads to 1) avoid a temporary vector and extraneous loop over it, 2) not delete globals used by phis/selects etc which could actually be useful. This fixes PR3321. Many thanks to Duncan for narrowing this down. llvm-svn: 62201	2009-01-14 00:12:58 +00:00
Dale Johannesen	0aeabdff57	Fix testsuite regressions from recursive inlining. llvm-svn: 62189	2009-01-13 22:43:37 +00:00
Dan Gohman	59af77376c	Make instcombine ensure that all allocas are explicitly aligned at at least their preferred alignment. llvm-svn: 62176	2009-01-13 20:18:38 +00:00
Dale Johannesen	433a9086c0	Enable recursive inlining. Reduce inlining threshold back to 200; 400 seems to be too high, loses more than it gains. llvm-svn: 62107	2009-01-12 22:11:50 +00:00
Chris Lattner	ae0e857b98	Fix PR3304 llvm-svn: 61995	2009-01-09 18:18:43 +00:00
Chris Lattner	f50aa6ae5c	Implement rdar://6480391, extending of equality icmp's to avoid a truncation. I noticed this in the code compiled for a routine using std::map, which produced this code: %25 = tail call i32 @memcmp(i8* %24, i8* %23, i32 6) nounwind readonly %.lobit.i = lshr i32 %25, 31 ; <i32> [#uses=1] %tmp.i = trunc i32 %.lobit.i to i8 ; <i8> [#uses=1] %toBool = icmp eq i8 %tmp.i, 0 ; <i1> [#uses=1] br i1 %toBool, label %bb3, label %bb4 which compiled to: call L_memcmp$stub shrl $31, %eax testb %al, %al jne LBB1_11 ## with this change, we compile it to: call L_memcmp$stub testl %eax, %eax js LBB1_11 This triggers all the time in common code, with patters like this: %169 = and i32 %ply, 1 ; <i32> [#uses=1] %170 = trunc i32 %169 to i8 ; <i8> [#uses=1] %toBool = icmp ne i8 %170, 0 ; <i1> [#uses=1] %7 = lshr i32 %6, 24 ; <i32> [#uses=1] %9 = trunc i32 %7 to i8 ; <i8> [#uses=1] %10 = icmp ne i8 %9, 0 ; <i1> [#uses=1] etc llvm-svn: 61985	2009-01-09 07:47:06 +00:00
Chris Lattner	482eb70a10	Fix PR3298, a crash in Jump Threading. Apparently even jump threading can have bugs, who knew? ;-) llvm-svn: 61983	2009-01-09 06:08:12 +00:00
Chris Lattner	fef138b140	Fix part 3/2 of PR3290, making instcombine zap (gep(bitcast)) when possible. llvm-svn: 61980	2009-01-09 05:44:56 +00:00
Dale Johannesen	b48fc71fc6	Do not inline functions with (dynamic) alloca into functions that don't already have a (dynamic) alloca. Dynamic allocas cause inefficient codegen and we shouldn't propagate this (behavior follows gcc). Two existing tests assumed such inlining would be done; they are hacked by adding an alloca in the caller, preserving the point of the tests. llvm-svn: 61946	2009-01-08 21:45:23 +00:00
Chris Lattner	f3e696bc5a	ValueTracker can't assume that an alloca with no specified alignment will get its preferred alignment. It has to be careful and cautiously assume it will just get the ABI alignment. This prevents instcombine from rounding up the alignment of a load/store without adjusting the alignment of the alloca. llvm-svn: 61934	2009-01-08 19:28:38 +00:00
Chris Lattner	c518dfd11b	This implements the second half of the fix for PR3290, handling loads from allocas that cover the entire aggregate. This handles some memcpy/byval cases that are produced by llvm-gcc. This triggers a few times in kc++ (with std::pair<std::_Rb_tree_const_iterator <kc::impl_abstract_phylum*>,bool>) and once in 176.gcc (with %struct..0anon). llvm-svn: 61915	2009-01-08 05:42:05 +00:00
Duncan Sands	289f59f233	Remove alloca tracking from nocapture analysis. Not only was it not very helpful, it was also wrong! The problem is shown in the testcase: the alloca might be passed to a nocapture callee which dereferences it and returns the original pointer. But because it was a nocapture call we think we don't need to track its uses, but we do. llvm-svn: 61876	2009-01-07 19:39:06 +00:00
Chris Lattner	f2b8c82ad1	Implement the first half of PR3290: if there is a store of an integer to a (transitive) bitcast the alloca and if that integer has the full size of the alloca, then it clobbers the whole thing. Handle this by extracting pieces out of the stored integer and filing them away in the SROA'd elements. This triggers fairly frequently because the CFE uses integers to pass small structs by value and the inliner exposes these. For example, in kimwitu++, I see a bunch of these with i64 stores to "%struct.std::pair<std::_Rb_tree_const_iterator<kc::impl_abstract_phylum*>,bool>" In 176.gcc I see a few i32 stores to "%struct..0anon". In the testcase, this is a difference between compiling test1 to: _test1: subl $12, %esp movl 20(%esp), %eax movl %eax, 4(%esp) movl 16(%esp), %eax movl %eax, (%esp) movl (%esp), %eax addl 4(%esp), %eax addl $12, %esp ret vs: _test1: movl 8(%esp), %eax addl 4(%esp), %eax ret The second half of this will be to handle loads of the same form. llvm-svn: 61853	2009-01-07 08:11:13 +00:00
Chris Lattner	4e735eb157	make m_ConstantInt(int64_t) safely match ConstantInt's that are larger than i64. This fixes an instcombine crash on PR3235. llvm-svn: 61775	2009-01-05 23:45:50 +00:00
Duncan Sands	582c53d147	Teach the internalize pass to also internalize global aliases. llvm-svn: 61754	2009-01-05 21:24:45 +00:00
Duncan Sands	f5dbbae4f4	Delete unused global aliases with internal linkage. In fact this also deletes those with linkonce linkage, however this is currently dead because for the moment aliases aren't allowed to have this linkage type. llvm-svn: 61742	2009-01-05 20:37:33 +00:00
Nick Lewycky	959af7ba30	Run a post-pass that marks known function declarations by name. llvm-svn: 61632	2009-01-04 20:27:34 +00:00
Bill Wendling	0a09d2de13	XFAIL this test. The xform was removed. llvm-svn: 61624	2009-01-04 06:32:28 +00:00
Duncan Sands	b193a37cd3	When calculating 'nocapture' argument attributes, allow the argument to be stored to an alloca by tracking uses of the alloca. This occurs 4 times (out of 7121, 0.05%) in MultiSource/Applications, so may not be worth it. On the other hand, it is easy to do and fairly cheap. The functions it helps are: W_addcom and W_addlit in spiff; process_args (argv) in d (make_dparser); ercPixConcealIMB in JM/ldecod. llvm-svn: 61570	2009-01-02 11:54:37 +00:00
Chris Lattner	ac161bff07	Reimplement the old and horrible bison parser for .ll files with a nice and clean recursive descent parser. This change has a couple of ramifications: 1. The parser code is about 400 lines shorter (in what we maintain, not including what is autogenerated). 2. The code should be significantly faster than the old code because we don't have to work around bison's poor handling of datatypes with ctors/dtors. This also makes the code much more resistant to memory leaks. 3. We now get caret diagnostics from the .ll parser, woo. 4. The actual diagnostics emited from the parser are completely different so a bunch of testcases had to be updated. 5. I now disallow "%ty = type opaque %ty = type i32". There was no good reason to support this, it was just an accident of the old implementation. I have no reason to think that anyone is actually using this. 6. The syntax for sticking a global variable has changed to make it unambiguous. I don't think anyone is depending on this since only clang supports this and it is not solid yet, so I'm not worried about anything breaking. 7. This gets rid of the last use of bison, and along with it the .cvs files. I'll prune this from the makefiles as a subsequent commit. There are a few minor cleanups that can be done after this commit (suggestions welcome!) but this passes dejagnu testing and is ready for its time in the limelight. llvm-svn: 61558	2009-01-02 07:01:27 +00:00
Nick Lewycky	0cfba9c6bf	Remove the cyclic part of this test, it was passing for the wrong reason. Two functions which mutually require each other to be nocapture are not currently supported. llvm-svn: 61553	2009-01-02 03:52:27 +00:00
Nick Lewycky	7e82055e88	Make adding nocapture a bit stronger. FreeInst is nocapture. Also, functions that don't write can't leak a pointer except through the return value, so a void readonly function is implicitly nocapture. Test these, and add a test that verifies that f1 calling f2 with an otherwise dead pointer gets both of them marked nocapture. llvm-svn: 61552	2009-01-02 03:46:56 +00:00
Duncan Sands	8c03a123ce	Add tests for two types of traps that escape analysis might one day fall into. llvm-svn: 61549	2009-01-02 00:55:51 +00:00
Bill Wendling	aedb54a947	Add transformation: xor (or (icmp, icmp), true) -> and(icmp, icmp) This is possible because of De Morgan's law. llvm-svn: 61537	2009-01-01 01:18:23 +00:00
Duncan Sands	163848021b	Look through phi nodes and select instructions when calculating nocapture attributes. llvm-svn: 61535	2008-12-31 20:21:34 +00:00
Duncan Sands	44c8cd97a5	Rename AddReadAttrs to FunctionAttrs, and teach it how to work out (in a very simplistic way) which function arguments (pointer arguments only) are only dereferenced and so do not escape. Mark such arguments 'nocapture'. llvm-svn: 61525	2008-12-31 16:14:43 +00:00
Duncan Sands	c125d6a3d3	Allow readnone functions to read (and write!) global constants, since doing so is irrelevant for aliasing purposes. While this doesn't increase the total number of functions marked readonly or readnone in MultiSource/ Applications (3089), it does result in 12 functions being marked readnone rather than readonly. Before: readnone: 820 readonly: 2269 After: readnone: 832 readonly: 2257 llvm-svn: 61469	2008-12-29 11:34:09 +00:00
Nick Lewycky	10eb8e533f	Turn strcmp into memcmp, such as strcmp(P, "x") --> memcmp(P, "x", 2). llvm-svn: 61297	2008-12-21 00:19:21 +00:00
Nick Lewycky	0f0e63fe73	Make all the vector elements positive in an srem of constant vector. llvm-svn: 61195	2008-12-18 06:31:11 +00:00
Chris Lattner	222ef4c489	Enhance heap sra to be substantially more aggressive w.r.t PHI nodes. This allows it to do fairly general phi insertion if a load from a pointer global wants to be SRAd but the load is used by (recursive) phi nodes. This fixes a pessimization on ppc introduced by Load PRE. llvm-svn: 61123	2008-12-17 05:28:49 +00:00
Chris Lattner	56b55387fc	Fix another crash found by inspection. If we have a PHI node merging the load multiple times, make sure the check the uses of the PHI to ensure they are transformable. llvm-svn: 61102	2008-12-16 21:24:51 +00:00
Chris Lattner	06a456b3f4	fix a crash found by inspection. llvm-svn: 61101	2008-12-16 21:04:51 +00:00
Eli Friedman	cb61afb546	Add a helper to remove a branch and DCE the condition, and use it consistently for deleting branches. In addition to being slightly more readable, this makes SimplifyCFG a bit better about cleaning up after itself when it makes conditions unused. llvm-svn: 61100	2008-12-16 20:54:32 +00:00
Chris Lattner	8b4be37275	fix PR3217: fully cached queries need to be verified against the visited set before they are used. If used, their blocks need to be added to the visited set so that subsequent queries don't use conflicting pointer values in the cache result blocks. llvm-svn: 61080	2008-12-16 07:10:09 +00:00
Chris Lattner	590b10dba2	add testcase for r61051 llvm-svn: 61052	2008-12-15 21:46:23 +00:00
Chris Lattner	3cdf0a8a2e	add a basic test for heap-sra llvm-svn: 61041	2008-12-15 19:42:05 +00:00
Chris Lattner	81ee731852	Add a testcase for GCC PR 23455, which lpre handles now. Add some comments about why we're not getting other cases. llvm-svn: 61032	2008-12-15 07:49:24 +00:00
Chris Lattner	3c2c36b590	gvn now hoists this load out of the hot non-call path. llvm-svn: 61028	2008-12-15 06:34:48 +00:00
Chris Lattner	b2429e2d69	Adjust testcase to make it more stable across visitation order changes, unbreaking it after r61024. llvm-svn: 61025	2008-12-15 04:42:00 +00:00
Chris Lattner	69131fd872	make GVN try to rename inputs to the resultant replaced values, which cleans up the generated code a bit. This should have the added benefit of not randomly renaming functions/globals like my previous patch did. :) llvm-svn: 61023	2008-12-15 03:46:38 +00:00
Chris Lattner	ff9f3dba12	Implement initial support for PHI translation in memdep. This means that memdep keeps track of how PHIs affect the pointer in dep queries, which allows it to eliminate the load in cases like rle-phi-translate.ll, which basically end up being: BB1: X = load P br BB3 BB2: Y = load Q br BB3 BB3: R = phi [P] [Q] load R turning "load R" into a phi of X/Y. In addition to additional exposed opportunities, this makes memdep safe in many cases that it wasn't before (which is required for load PRE) and also makes it substantially more efficient. For example, consider: bb1: // has many predecessors. P = some_operator() load P In this example, previously memdep would scan all the predecessors of BB1 to see if they had something that would mustalias P. In some cases (e.g. test/Transforms/GVN/rle-must-alias.ll) it would actually find them and end up eliminating something. In many other cases though, it would scan and not find anything useful. MemDep now stops at a block if the pointer is defined in that block and cannot be phi translated to predecessors. This causes it to miss the (rare) cases like rle-must-alias.ll, but makes it faster by not scanning tons of stuff that is unlikely to be useful. For example, this speeds up GVN as a whole from 3.928s to 2.448s (60%)!. IMO, scalar GVN should be enhanced to simplify the rle-must-alias pointer base anyway, which would allow the loads to be eliminated. In the future, this should be enhanced to phi translate through geps and bitcasts as well (as indicated by FIXMEs) making memdep even more powerful. llvm-svn: 61022	2008-12-15 03:35:32 +00:00
Chris Lattner	a236dc44d6	another random testcase that shouldn't crash gvn and is good for coverage with future changes. llvm-svn: 61011	2008-12-14 21:20:46 +00:00
Chris Lattner	9b9a145694	RLE isn't smart enough to eliminate this safely yet. llvm-svn: 60994	2008-12-13 21:04:20 +00:00
Chris Lattner	d923519cc5	rename some tests to be more uniform in naming convention. llvm-svn: 60988	2008-12-13 18:47:40 +00:00
Chris Lattner	9e24267120	gvn should never crash on this. llvm-svn: 60987	2008-12-13 18:39:44 +00:00
Bill Wendling	293b9181e5	Temporarily revert r60973. It's inexplicably causing a failure when self-hosting LLVM: llvm[2]: Linking Release executable opt (without symbols) ... Undefined symbols: "llvm::APFloat::IEEEsingle", referenced from: __ZN4llvm7APFloat10IEEEsingleE$non_lazy_ptr in libLLVMCore.a(Constants.o) __ZN4llvm7APFloat10IEEEsingleE$non_lazy_ptr in libLLVMCore.a(AsmWriter.o) __ZN4llvm7APFloat10IEEEsingleE$non_lazy_ptr in libLLVMCore.a(ConstantFold.o) "llvm::APFloat::IEEEdouble", referenced from: __ZN4llvm7APFloat10IEEEdoubleE$non_lazy_ptr in libLLVMCore.a(Constants.o) __ZN4llvm7APFloat10IEEEdoubleE$non_lazy_ptr in libLLVMCore.a(AsmWriter.o) __ZN4llvm7APFloat10IEEEdoubleE$non_lazy_ptr in libLLVMCore.a(ConstantFold.o) ld: symbol(s) not found This is in release mode. To replicate, compile llvm and llvm-gcc in optimized mode. Then build llvm, in optimized mode, with the newly created compiler. llvm-svn: 60977	2008-12-13 09:28:44 +00:00
Chris Lattner	1e29f7c97d	make RLE preserve the name of the load that it replaces. This is just a pretification of the IR. llvm-svn: 60973	2008-12-13 07:22:47 +00:00
Chris Lattner	0318b56f0e	loosen up an assertion that isn't valid when called from invalidateCachedPointerInfo. Thanks to Bill for sending me a testcase. llvm-svn: 60805	2008-12-09 22:45:32 +00:00
Chris Lattner	702e46ed54	Teach BasicAA::getModRefInfo(CallSite, CallSite) some tricks based on readnone/readonly functions. Teach memdep to look past readonly calls when analyzing deps for a readonly call. This allows elimination of a few more calls from 403.gcc: before: 63 gvn - Number of instructions PRE'd 153986 gvn - Number of instructions deleted 50069 gvn - Number of loads deleted after: 63 gvn - Number of instructions PRE'd 153991 gvn - Number of instructions deleted 50069 gvn - Number of loads deleted 5 calls isn't much, but this adds plumbing for the next change. llvm-svn: 60794	2008-12-09 21:19:42 +00:00
Devang Patel	5f769e2d40	Actually test something. Use PR3170 test case. llvm-svn: 60727	2008-12-08 23:44:46 +00:00
Devang Patel	1c469d36b0	Undo previous patch. llvm-svn: 60701	2008-12-08 17:02:37 +00:00
Chris Lattner	f50d7f76c6	fix a bug I introduced in simplifycfg handling single entry phi nodes. FoldSingleEntryPHINodes deletes the PHI, so there is no need to delete it afterward. llvm-svn: 60653	2008-12-07 07:22:45 +00:00
Chris Lattner	57e91eaf61	Reimplement the inner loop of DSE. It now uniformly uses getDependence(), doesn't do its own local caching, and is slightly more aggressive about free/store dse (see testcase). This eliminates the last external client of MemDep::getDependenceFrom(). llvm-svn: 60619	2008-12-06 00:53:22 +00:00
Chris Lattner	c100828026	Fix test/Transforms/GVN/pre-load.ll llvm-svn: 60594	2008-12-05 17:04:12 +00:00
Devang Patel	c56423b500	Rewrite code that 1) filters loops and 2) calculates new loop bounds. This fixes many bugs. I will add more test cases in a separate check-in. Some day, the code that manipulates CFG and updates dom. info could use refactoring help. llvm-svn: 60554	2008-12-04 21:38:42 +00:00
Chris Lattner	350fc5721d	testcase for br undef folding. llvm-svn: 60471	2008-12-03 07:48:27 +00:00
Chris Lattner	595c7279bd	Teach jump threading some more simple tricks: 1) have it fold "br undef", which does occur with surprising frequency as jump threading iterates. 2) teach j-t to delete dead blocks. This removes the successor edges, reducing the in-edges of other blocks, allowing recursive simplification. 3) Fold things like: br COND, BBX, BBY BBX: br COND, BBZ, BBW which also happens because jump threading iterates. llvm-svn: 60470	2008-12-03 07:48:08 +00:00
Chris Lattner	50532410d1	don't spew tons of stuff to the output. This testcase is not for loop deletion (it is for a ton of passes), which is very bad. llvm-svn: 60465	2008-12-03 06:41:50 +00:00
Chris Lattner	1db9bbe802	Implement PRE of loads in the GVN pass with a pretty cheap and straight-forward implementation. This does not require any extra alias analysis queries beyond what we already do for non-local loads. Some programs really really like load PRE. For example, SPASS triggers this ~1000 times, ~300 times in 255.vortex, and ~1500 times on 403.gcc. The biggest limitation to the implementation is that it does not split critical edges. This is a huge killer on many programs and should be addressed after the initial patch is enabled by default. The implementation of this should incidentally speed up rejection of non-local loads because it avoids creating the repl densemap in cases when it won't be used for fully redundant loads. This is currently disabled by default. Before I turn this on, I need to fix a couple of miscompilations in the testsuite, look at compile time performance numbers, and look at perf impact. This is pretty close to ready though. llvm-svn: 60408	2008-12-02 08:16:11 +00:00
Owen Anderson	35bd70c07a	Add a test for my previous PRE fix. llvm-svn: 60394	2008-12-02 04:25:42 +00:00
Bill Wendling	582fe6b0ca	Use m_Specific() instead of double matching. llvm-svn: 60341	2008-12-01 08:09:47 +00:00
Chris Lattner	9e6b243428	simplify these patterns using m_Specific. No need to grep for xor in testcase (or is a substring). llvm-svn: 60328	2008-12-01 05:16:26 +00:00
Chris Lattner	9d02a70a7d	Teach inst combine to merge GEPs through PHIs. This is really important because it is sinking the loads using the GEPs, but not the GEPs themselves. This triggers 647 times on 403.gcc and makes the .s file much much nicer. For example before: je LBB1_87 ## bb78 LBB1_62: ## bb77 leal 84(%esi), %eax LBB1_63: ## bb79 movl (%eax), %eax ... LBB1_87: ## bb78 movl $0, 4(%esp) movl %esi, (%esp) call L_make_decl_rtl$stub jmp LBB1_62 ## bb77 after: jne LBB1_63 ## bb79 LBB1_62: ## bb78 movl $0, 4(%esp) movl %esi, (%esp) call L_make_decl_rtl$stub LBB1_63: ## bb79 movl 84(%esi), %eax The input code was (and the GEPs are merged and the PHI is now eliminated by instcombine): br i1 %tmp233, label %bb78, label %bb77 bb77: %tmp234 = getelementptr %struct.tree_node* %t_addr.3, i32 0, i32 0, i32 22 br label %bb79 bb78: call void @make_decl_rtl(%struct.tree_node* %t_addr.3, i8* null) nounwind %tmp235 = getelementptr %struct.tree_node* %t_addr.3, i32 0, i32 0, i32 22 br label %bb79 bb79: %iftmp.12.0.in = phi %struct.rtx_def [ %tmp235, %bb78 ], [ %tmp234, %bb77 ] %iftmp.12.0 = load %struct.rtx_def %iftmp.12.0.in llvm-svn: 60322	2008-12-01 02:34:36 +00:00
Chris Lattner	8facc59e72	testcase for my previous commit. llvm-svn: 60315	2008-12-01 01:42:03 +00:00
Bill Wendling	5b902c5b1e	Implement ((A\|B)&1)\|(B&-2) -> (A&1) \| B transformation. This also takes care of permutations of this pattern. llvm-svn: 60312	2008-12-01 01:07:11 +00:00
Bill Wendling	de89bc275c	Add instruction combining for ((A&~B)\|(~A&B)) -> A^B and all permutations. llvm-svn: 60291	2008-11-30 13:52:49 +00:00
Bill Wendling	9eef421e12	Implement (A&((~A)\|B)) -> A&B transformation in the instruction combiner. This takes care of all permutations of this pattern. llvm-svn: 60290	2008-11-30 13:08:13 +00:00
Bill Wendling	2d2e7861b5	getSExtValue() doesn't work for ConstantInts with bitwidth > 64 bits. Use all APInt calls instead. This fixes PR3144. llvm-svn: 60288	2008-11-30 12:38:24 +00:00
Eli Friedman	09bc610945	Optimize memmove and memset into the LLVM builtins. Note that these only show up in code from front-ends besides llvm-gcc, like clang. llvm-svn: 60287	2008-11-30 08:32:11 +00:00
Bill Wendling	361c0e5f9c	Strengthen check for div inst-combining. llvm-svn: 60276	2008-11-30 04:33:53 +00:00
Bill Wendling	70635adea3	Instcombine was illegally transforming -X/C into X/-C when either X or C overflowed on negation. This commit checks to make sure that neithe C nor X overflows. This requires that the RHS of X (a subtract instruction) be a constant integer. llvm-svn: 60275	2008-11-30 03:42:12 +00:00
Chris Lattner	c40039c736	don't require GVN to work on dead values, just make the test return the loaded value. llvm-svn: 60252	2008-11-29 21:21:48 +00:00
Chris Lattner	8c5ff516c6	Fix a thinko that manifested as a crash on clamav last night. llvm-svn: 60251	2008-11-29 20:29:04 +00:00
Chris Lattner	d3d9111ede	Fix PR3141 by ensuring that MemoryDependenceAnalysis::removeInstruction properly updates the reverse dependency map when it installs updated dependencies for instructions that depend on the removed instruction. llvm-svn: 60222	2008-11-28 22:51:08 +00:00
Chris Lattner	8a172daa55	don't call MergeBasicBlockIntoOnlyPred on a block whose only predecessor is itself. This doesn't make sense, and this is a dead infinite loop anyway. llvm-svn: 60210	2008-11-28 19:54:49 +00:00
Nick Lewycky	4ab50b93c8	Chris prefers icmp/select over udiv! llvm-svn: 60187	2008-11-27 22:41:10 +00:00
Nick Lewycky	69941fd0a0	Add a couple of missed optimizations on integer vectors. Multiply and divide by 1, as well as multiply by -1. llvm-svn: 60182	2008-11-27 20:21:08 +00:00
Chris Lattner	5dfbfcd80d	Fix PR3138: if we merge the entry block into another block, make sure to move the other block back up into the entry position! llvm-svn: 60179	2008-11-27 19:25:19 +00:00
Chris Lattner	98d89d1b1b	Make jump threading substantially more powerful, in the following ways: 1. Make it fold blocks separated by an unconditional branch. This enables jump threading to see a broader scope. 2. Make jump threading able to eliminate locally redundant loads when they feed the branch condition of a block. This frequently occurs due to reg2mem running. 3. Make jump threading able to eliminate partially redundant loads when they feed the branch condition of a block. This is common in code with lots of loads and stores like C++ code and 255.vortex. This implements thread-loads.ll and rdar://6402033. Per the fixme's, several pieces of this should be moved into Transforms/Utils. llvm-svn: 60148	2008-11-27 05:07:53 +00:00
Evan Cheng	2e5aeff676	convertToSignExtendedInteger should return opInvalidOp instead of asserting if sematics of float does not allow arithmetics. llvm-svn: 60042	2008-11-25 19:00:29 +00:00
Chris Lattner	18065ce9fc	reenable test llvm-svn: 59986	2008-11-24 21:27:20 +00:00
Bill Wendling	e6fe59df6d	Temporarily XFAIL this test. r59976 and r59972 broke it. llvm-svn: 59981	2008-11-24 20:43:33 +00:00
Chris Lattner	53d6a07869	Fix 3113: If we have a dead cyclic PHI, replace the whole thing with an undef. llvm-svn: 59972	2008-11-24 19:25:36 +00:00
Nick Lewycky	07d726ec4d	Optimize (x/y)*y into x-(x%y) in general. Div and rem are about the same, and a subtract is cheaper than a multiply. This generalizes an existing transform. llvm-svn: 59800	2008-11-21 07:33:58 +00:00
Devang Patel	f1e9329209	Give SIToFPInst preference over UIToFPInst because it is faster on platforms that are widely used. llvm-svn: 59476	2008-11-18 00:40:02 +00:00
Devang Patel	180afd2c55	While handling floating point IVs lift restrictions on initial value and increment value. llvm-svn: 59471	2008-11-17 23:27:13 +00:00
Chris Lattner	c3f3b059d0	Handle the case where there is no "not". It is possible it got folded into the select. llvm-svn: 59389	2008-11-16 04:25:26 +00:00
Chris Lattner	645f60141b	make this actually test what it is trying to. llvm-svn: 59386	2008-11-16 04:21:51 +00:00
Devang Patel	d0ce981372	If the sign of exit condition and split condition does not match then do not split loop index. llvm-svn: 58995	2008-11-10 19:48:34 +00:00
Bill Wendling	3f547be28f	If the LHS of the FCMP is coming from a UIToFP instruction, then we don't want to generate signed ICMP instructions to replace the FCMP. This would violate the following: define i1 @test1(i32 %val) { %1 = uitofp i32 %val to double %2 = fcmp ole double %1, 0.000000e+00 ret i1 %2 } would be transformed into: define i1 @test1(i32 %val) { %1 = icmp slt i33 %val, 1 ret i1 %1 } which is obviously wrong. This patch modifes InstCombiner::FoldFCmp_IntToFP_Cst to handle when the LHS comes from UIToFP. llvm-svn: 58929	2008-11-09 04:26:50 +00:00
Devang Patel	f53d7ff870	Add PR number. llvm-svn: 58765	2008-11-05 18:41:15 +00:00
Devang Patel	f04bfc6989	New test case. llvm-svn: 58745	2008-11-05 01:40:30 +00:00
Dan Gohman	8cdea717a3	Add a new pass to simplify specific half_powr function calls. This is a specialized pass that it not likely to be generally useful. llvm-svn: 58732	2008-11-04 23:41:45 +00:00
Anton Korobeynikov	d6948315b7	Fix tests not to emit IR output llvm-svn: 58729	2008-11-04 23:02:39 +00:00
Devang Patel	fe57d109b6	Ignore conditions that are outside the loop. llvm-svn: 58631	2008-11-03 19:38:07 +00:00
Devang Patel	c1631db93b	Turn floating point IVs into integer IVs where possible. This allows SCEV users to effectively calculate trip count. LSR later on transforms back integer IVs to floating point IVs later on to avoid int-to-float casts inside the loop. llvm-svn: 58625	2008-11-03 18:32:19 +00:00
Nick Lewycky	3c6d34a7f0	Changes from Duncan's review: * merge two weak functions by making them both alias a third non-weak fn * don't reimplement CallSite::hasArgument * whitelist the safe linkage types llvm-svn: 58568	2008-11-02 16:46:26 +00:00
Nick Lewycky	d01d42e76c	Add a new MergeFunctions pass. It finds identical functions and merges them. This triggers only 60 times in llvm-test (look at .llvm.bc, not .linked.rbc) and so it probably wont be turned on by default. Also, may of those are likely to go away when PR2973 is fixed. llvm-svn: 58557	2008-11-02 05:52:50 +00:00
Nick Lewycky	8d8acf327b	Fix demanded bits analysis with srem by negative number. Based on a patch by Richard Osborne. llvm-svn: 58555	2008-11-02 02:41:50 +00:00
Dan Gohman	83eea0b17f	Fix this recently moved code to use the correct type. CI is now a ConstantInt, and SI is the original cast instruction. This fixes PR2996. llvm-svn: 58549	2008-11-02 00:17:33 +00:00
Dan Gohman	13cbcf1c18	Canonicalize sext(i1) to i1?-1:0, and update various instcombine optimizations accordingly. llvm-svn: 58457	2008-10-30 20:40:10 +00:00
Daniel Dunbar	3933e66a89	Add InlineCost class for represent the estimated cost of inlining a function. - This explicitly models the costs for functions which should "always" or "never" be inlined. This fixes bugs where such costs were not previously respected. llvm-svn: 58450	2008-10-30 19:26:59 +00:00
Chris Lattner	0934c0f35b	Fix PR2967 by not deleting volatile load/stores that occur before unreachable. I don't really see this as being needed, but there is little harm from doing it. llvm-svn: 58385	2008-10-29 17:46:26 +00:00
Dan Gohman	2c34c130bf	(A & sext(C)) \| (B & ~sext(C) -> C ? A : B llvm-svn: 58351	2008-10-28 22:38:57 +00:00
Chris Lattner	a9642ff459	no need to print output llvm-svn: 58228	2008-10-27 06:56:35 +00:00
Nick Lewycky	730d5dade6	Don't try to create a mask when we don't need one. Fixes a crash. llvm-svn: 58075	2008-10-24 06:14:27 +00:00
Chris Lattner	1baace07c4	apply Eli's patch for PR2165 and provide a testcase. llvm-svn: 57625	2008-10-16 05:26:51 +00:00
Dan Gohman	bc0278400c	Teach instcombine's visitLoad to scan back several instructions to find opportunities for store-to-load forwarding or load CSE, in the same way that visitStore scans back to do DSE. Also, define a new helper function for testing whether the addresses of two memory accesses are known to have the same value, and use it in both visitStore and visitLoad. These two changes allow instcombine to eliminate loads in code produced by front-ends that frequently emit obviously redundant addressing for memory references. llvm-svn: 57608	2008-10-15 23:19:35 +00:00
Evan Cheng	d885f6e139	Combine (fcmp cc0 x, y) \| (fcmp cc1 x, y) into a single fcmp when possible. llvm-svn: 57515	2008-10-14 18:44:08 +00:00
Evan Cheng	ce70752b11	- Somehow I forgot about one / une. - Renumber fcmp predicates to match their icmp counterparts. - Try swapping operands to expose more optimization opportunities. llvm-svn: 57513	2008-10-14 18:13:38 +00:00
Evan Cheng	67786cce66	Optimize anding of two fcmp into a single fcmp if the operands are the same. e.g. uno && ueq -> ueq ord && olt -> olt ord && ueq -> oeq llvm-svn: 57507	2008-10-14 17:15:11 +00:00
Chris Lattner	da435910e8	Fix PR2697 by rewriting the '(X / pos) op neg' logic. This also changes a couple other cases for clarity, but shouldn't affect correctness. Patch by Eli Friedman! llvm-svn: 57387	2008-10-11 22:55:00 +00:00
Devang Patel	647a1e532b	Check loop exit predicate properly while eliminating one iteration loop. This patch fixes PR 2869 llvm-svn: 57369	2008-10-10 22:02:57 +00:00
Devang Patel	40aafce00d	Fix typo, fix PR 2865. llvm-svn: 57221	2008-10-06 23:22:54 +00:00
Matthijs Kooijman	cbe5e16eb5	Allow scalarrepl to treat an all-zero GEP just as bitcast. This includes not marking a GEP involving a vector as unsafe, but only when it has all zero indices. This allows scalarrepl to work in a few more cases. llvm-svn: 57177	2008-10-06 16:23:31 +00:00
Chris Lattner	917a6c1343	rewrite bswap matching to be more general, allowing arbitrary shifting and masking inside a bswap expr. This allows it to handle the cases from PR2842, which involve the intermediate 'or' expressions being shifted, not just the input value. llvm-svn: 57095	2008-10-05 02:13:19 +00:00
Duncan Sands	1d35e9aebe	Ignore loads from and stores to local memory (i.e. allocas) when deciding whether to mark a function readnone/readonly. Since the pass is currently run before SROA, this may be quite helpful. Requested by Chris on IRC. llvm-svn: 57050	2008-10-04 13:24:24 +00:00
Nick Lewycky	fc9bc3cf23	Allow the construction of SCEVs with SCEVCouldNotCompute operands, by implementing folding. Fixes PR2857. llvm-svn: 57049	2008-10-04 11:19:07 +00:00
Devang Patel	f963403b58	Nick Lewycky's patch. While hosting instruction check PHI node. llvm-svn: 57025	2008-10-03 18:57:37 +00:00
Nick Lewycky	e8ced3ec19	Fix misoptimization of: xor i1 (icmp eq (X, C1), icmp s[lg]t (X, C2)) llvm-svn: 56834	2008-09-30 06:08:34 +00:00
Devang Patel	221fe42006	Support inreg, zext and sext as return value attributes. llvm-svn: 56801	2008-09-29 20:49:50 +00:00
Matthijs Kooijman	be4fc9b9fb	Add a testcase showing that scalarrepl supports first class structs. I originally made this script to show that scalarrepl didn't support them, but it turned out it does. Better to still add the testcase then. llvm-svn: 56781	2008-09-29 10:42:13 +00:00
Devang Patel	9eb525d4f9	Implement function notes as function attributes. llvm-svn: 56716	2008-09-26 23:51:19 +00:00
Duncan Sands	9c40c28926	Rationalize the names of passes that print information: -callgraph => print-callgraph -callscc => print-callgraph-sccs -cfgscc => print-cfg-sccs -externalfnconstants => print-externalfnconstants -print => print-function -print-alias-sets (no change) -print-callgraph => dot-callgraph -print-cfg => dot-cfg -print-cfg-only => dot-cfg-only -print-dom-info (no change) -printm => print-module -printusedtypes => print-used-types llvm-svn: 56487	2008-09-23 12:47:39 +00:00
Duncan Sands	5a896a9858	Add test for improvement of readonly to readnone, and non-demotion of readnone to readonly. llvm-svn: 56344	2008-09-19 09:20:05 +00:00
Duncan Sands	310dbffdc0	Turn on these tests! llvm-svn: 56343	2008-09-19 09:16:32 +00:00
Duncan Sands	af25ee7ffc	Add a new pass AddReadAttrs which works out which functions can get the readnone/readonly attributes, and gives them it. The plan is to remove markmodref (which did the same thing by querying GlobalsModRef) and delete the analogous functionality from GlobalsModRef. llvm-svn: 56341	2008-09-19 08:17:05 +00:00
Duncan Sands	a75815ebb0	Test the callgraph directly for the missing edge. llvm-svn: 56338	2008-09-19 08:01:57 +00:00
Devang Patel	c25be3b2de	splitLoop does not handle split condition EQ. Fixes PR 2805 llvm-svn: 56321	2008-09-18 23:45:14 +00:00
Devang Patel	7f9671ba37	Do not hoist instruction above branch condition. The instruction may use branch condition. llvm-svn: 56286	2008-09-17 18:21:49 +00:00
Devang Patel	dca8d3b183	Do not ignore iv uses outside the loop. This one slipped through cracks very well. llvm-svn: 56284	2008-09-17 17:53:47 +00:00
Dan Gohman	dafa9c6e85	Improve instcombine's handling of integer min and max in two ways: - Recognize expressions like "x > -1 ? x : 0" as min/max and turn them into expressions like "x < 0 ? 0 : x", which is easily recognizable as a min/max operation. - Refrain from folding expression like "y/2 < 1" to "y < 2" when the comparison is being used as part of a min or max idiom, like "y/2 < 1 ? 1 : y/2". In that case, the division has another use, so folding doesn't eliminate it, and obfuscates the min/max, making it harder to recognize as a min/max operation. These benefit ScalarEvolution, CodeGen, and anything else that wants to recognize integer min and max. llvm-svn: 56246	2008-09-16 18:46:06 +00:00
Dan Gohman	eff71f2953	On 64-bit targets, change 32-bit getelementptr indices to be 64-bit getelementptr indices, inserting an explicit cast if necessary. This helps expose the sign-extension operation to other optimizations. llvm-svn: 56133	2008-09-11 23:06:38 +00:00
Dan Gohman	7d01c0654c	Fix a vectorshuffle instcombine bug introduced by r55995. Patch by Nicolas Capens! llvm-svn: 56129	2008-09-11 22:47:57 +00:00
Dan Gohman	c1ae01688f	Fix an icmp+sdiv optimization to check for and handle an overflow condition. This fixes PR2740. llvm-svn: 56076	2008-09-10 23:30:57 +00:00
Devang Patel	b4061e8ce4	Remove. llvm-svn: 56018	2008-09-09 21:41:34 +00:00
Devang Patel	92b032f3e6	if loop induction variable is always sign or zero extended then extend the type of induction variable. llvm-svn: 56017	2008-09-09 21:41:07 +00:00
Devang Patel	92c5367705	fix overflow check. llvm-svn: 56011	2008-09-09 20:54:34 +00:00
Anton Korobeynikov	a9b60ee0fc	Resolve aliases, when possible llvm-svn: 56001	2008-09-09 19:04:59 +00:00
Dan Gohman	86fb5b48de	Make SimplifyDemandedVectorElts simplify vectors with multiple users, and teach it about shufflevector instructions. Also, fix a subtle bug in SimplifyDemandedVectorElts' insertelement code. This is a patch that was originally written by Eli Friedman, with some fixes and cleanup by me. llvm-svn: 55995	2008-09-09 18:11:14 +00:00
Devang Patel	0f7a3507cf	Fix simplifycfg crash in handing block merge. llvm-svn: 55971	2008-09-09 01:06:56 +00:00
Devang Patel	d92a8216ec	xfail llvm-svn: 55914	2008-09-08 16:24:30 +00:00
Duncan Sands	3cf7d86556	Update the callgraph correctly in ArgumentPromotion. llvm-svn: 55895	2008-09-08 11:07:35 +00:00
Duncan Sands	95c2a7848a	When PruneEH turned an invoke into an ordinary call (thus changing the call site) it didn't inform the callgraph about this. But the call site does matter - as shown by the testcase, the callgraph become invalid after the inliner ran (with an edge between two functions simply missing), resulting in wrong deductions by GlobalsModRef. llvm-svn: 55872	2008-09-06 17:19:29 +00:00
Nick Lewycky	f023db6444	Don't crash when trying to constant fold a vector with some elements that can't be folded. Instead, fail to fold the entire vector. We could also return a vector with some elements folded and some not. If anyone thinks that's a better approach, please speak up! llvm-svn: 55689	2008-09-03 05:54:33 +00:00
Devang Patel	b530f08122	Check iteration count. llvm-svn: 55680	2008-09-03 00:10:56 +00:00
Devang Patel	43c5a52e07	If all IV uses are extending integer IV then change the type of IV itself, if possible. llvm-svn: 55674	2008-09-02 22:18:08 +00:00
Devang Patel	bfa535af9f	respect inline=never and inline=always notes. llvm-svn: 55673	2008-09-02 22:16:13 +00:00
Devang Patel	4310d39844	If IV is used in a int-to-float cast inside the loop then try to eliminate the cast operation. llvm-svn: 55374	2008-08-26 17:57:54 +00:00
Chris Lattner	3f972c9150	Fix PR2423 by checking all indices for out of range access, not only indices that start with an array subscript. x->field[10000] is just as bad as (*X)[14][10000]. llvm-svn: 55226	2008-08-23 05:21:06 +00:00
Nick Lewycky	99f4558117	Revert r54876 r54877 r54906 and r54907. Evan found that these caused a 20% slowdown in bzip2. llvm-svn: 55113	2008-08-21 05:56:10 +00:00
Bill Wendling	fe18a8d9f1	XFAIL this test for now. llvm-svn: 54929	2008-08-18 18:29:54 +00:00
Nick Lewycky	53b44029d6	Consider the case where xor by -1 and xor by 128 have been combined already to produce an xor by 127. llvm-svn: 54906	2008-08-17 19:58:24 +00:00
Evan Cheng	8ec334f45e	Didn't mean to change this. llvm-svn: 54904	2008-08-17 19:25:28 +00:00
Evan Cheng	ab35bfdf18	Fix a (u)comiss intrinsic lowering bug. It was using anyext which can return junk in higher bits. Patch by Nate Begeman. llvm-svn: 54903	2008-08-17 19:22:34 +00:00
Nick Lewycky	18f50b2637	Xor'ing both sides of icmp by sign-bit is equivalent to swapping signedness of the predicate. Also, make this optz'n apply in more cases where it's safe to do so. llvm-svn: 54876	2008-08-17 07:34:14 +00:00
Owen Anderson	2a6adfa4f0	Remove GCSE and LoadVN from the testsuite. llvm-svn: 54832	2008-08-16 00:00:54 +00:00
Devang Patel	f2a03d5a4b	Reapply 54786. Add overflow and number of mantissa bits checks. llvm-svn: 54821	2008-08-15 21:21:34 +00:00
Evan Cheng	86834d29f3	Revert 54786. It's not checking for overflows, etc. llvm-svn: 54813	2008-08-15 08:12:11 +00:00
Devang Patel	054a833dd4	If IV is used in a int-to-float cast inside the loop then try to eliminate the cast opeation. llvm-svn: 54786	2008-08-14 20:58:31 +00:00
Dan Gohman	6134fbccef	Fix a bogus srem rule - a negative value srem'd by a power-of-2 can have a non-negative result; for example, -16%16 is 0. Also, clarify the related comments. This fixes PR2670. llvm-svn: 54767	2008-08-13 23:12:35 +00:00
Dan Gohman	8ded5d5884	Fix SCCP's handling of struct value loads and stores. SCCP doesn't track individual leaf values in such cases, so it needs to treat struct values as normal values in this case. llvm-svn: 54760	2008-08-13 21:22:48 +00:00
Devang Patel	97387e6615	Check sign to detect overflow before changing compare stride. llvm-svn: 54710	2008-08-13 02:05:14 +00:00
Chris Lattner	2aa0ff27aa	Implement support for simplifying vector comparisons by 0.0 and 1.0 like we do for scalars. Patch contributed by Nicolas Capens This also generalizes the previous xforms to work on long double, now that isExactlyValue works for long double. llvm-svn: 54653	2008-08-11 22:06:05 +00:00
Matthijs Kooijman	d705b2be1f	Add a basic test for the SRETPromotion pass. llvm-svn: 54466	2008-08-07 15:55:18 +00:00
Matthijs Kooijman	0620096c18	Move two tests from SRETPromotion to Inline, since they only call opt -inline. llvm-svn: 54465	2008-08-07 15:36:46 +00:00
Dan Gohman	ac22cfcae9	Fix a shufflevector instcombine that was emitting invalid masks indices when it meant to be emitting undef indices. llvm-svn: 54417	2008-08-06 18:17:32 +00:00
Evan Cheng	2bd97afb99	PR2535, not PR2355. llvm-svn: 54416	2008-08-06 18:06:48 +00:00
Evan Cheng	907dc2bc37	Fix PR2355: bug in ChangeCompareStride. When the loop termination compare is the only use of its iv stride, the stride can be eliminated by moving it to another stride. If the scale is negative, swap the predicate instead of using a inverse predicate. llvm-svn: 54415	2008-08-06 18:04:43 +00:00
Chris Lattner	f5b353c1fd	optimize a common idiom generated by clang for bitfield access, PR2638. llvm-svn: 54408	2008-08-06 07:35:52 +00:00
Chris Lattner	7bdaecb7f4	Zap sitofp/fptoui pairs. In all cases when the sign difference matters, the result is undefined anyway. llvm-svn: 54396	2008-08-06 05:13:06 +00:00
Nick Lewycky	bf42893567	Reinstate this optimization, but without the miscompile. Thanks to Bill for tracking down that this was breaking llvm-gcc bootstrap on Linux. llvm-svn: 54394	2008-08-06 04:54:03 +00:00
Bill Wendling	0e966d3e2c	Just grep for through the LL code instead of the ASM code llvm-svn: 54389	2008-08-06 00:10:32 +00:00
Bill Wendling	bc6786e7ee	Add default architecture. llvm-svn: 54384	2008-08-05 23:36:00 +00:00
Bill Wendling	3dfa168d22	Testcase for PR2629. llvm-svn: 54377	2008-08-05 22:23:59 +00:00
Bill Wendling	ee12a7aeff	Revert r53282. This was causing a miscompile on Linux. Also, the transformation looks bogus. Please see PR2629 for details on why this is breaking things. llvm-svn: 54372	2008-08-05 21:23:45 +00:00
Matthijs Kooijman	98b5c16e3b	Add -unroll-allow-partial command line option that enabled the loop unroller to partially unroll a loop when fully unrolling would not fit under the threshold. Patch by Mikael Lepistö. llvm-svn: 54160	2008-07-29 13:21:23 +00:00
Matthijs Kooijman	fd3070459b	Restructure ArgumentPromotion a bit. Instead of just having a single boolean that says "unconditional loads from this argument are safe", we now keep track of the safety per set of indices from which loads happen. This prevents ArgPromotion from promoting loads that aren't really valid. As an added effect, this will now disregard the the type of the indices passed to a GEP, so "load GEP %A, i32 1" and "load GEP %A, i64 1" will result in a single argument, not two. This fixes PR2598, for which a testcase has been added as well. llvm-svn: 54159	2008-07-29 10:00:13 +00:00
Owen Anderson	3f3389745d	Add support for eliminating stores that store the same value that was just loaded. This fixes PR2599. llvm-svn: 54133	2008-07-28 16:14:26 +00:00
Dan Gohman	5f36a32e7b	Put the LICM of constant GlobalVariables, introduced in r53945, under a command-line option, and disable it by default. It introduced performance regressions because CodeGen is currently not able to remat such loads. llvm-svn: 53997	2008-07-24 23:57:25 +00:00
Chris Lattner	8a8fb908dc	"Allow LICM to sink or lift loads from constant memory. Also add a test case for this. This allows instructions like loads from global variables declared to be constant to be moved out of loops." Patch by Stefanus Du Toit! llvm-svn: 53945	2008-07-23 05:06:28 +00:00
Dan Gohman	fa1211f69b	Enable first-class aggregates support. Remove the GetResultInst instruction. It is still accepted in LLVM assembly and bitcode, where it is now auto-upgraded to ExtractValueInst. Also, remove support for return instructions with multiple values. These are auto-upgraded to use InsertValueInst instructions. The IRBuilder still accepts multiple-value returns, and auto-upgrades them to InsertValueInst instructions. llvm-svn: 53941	2008-07-23 00:34:11 +00:00
Dan Gohman	60bae3faaf	Add the PR number to the test. llvm-svn: 53880	2008-07-21 21:50:25 +00:00
Dan Gohman	7ad3cd8c9d	Fix a bug in LSR's dead-PHI cleanup. If a PHI has a def-use chain that leads into a cycle involving a different PHI, LSR got stuck running around that cycle looking for the original PHI. To avoid this, keep track of visited PHIs and stop searching if we see one more than once. This fixes PR2570. llvm-svn: 53879	2008-07-21 21:45:02 +00:00
Matthijs Kooijman	8b69d77a7a	Make GlobalOpt preserve address spaces when scalar replacing aggregate globals. llvm-svn: 53716	2008-07-17 11:59:53 +00:00
Chris Lattner	c600c53d1f	Fix PR2553 llvm-svn: 53715	2008-07-17 06:07:20 +00:00
Matthijs Kooijman	f22f34f0cc	Add a few cases to instcombine's extractvalue testcase. llvm-svn: 53675	2008-07-16 12:57:25 +00:00
Matthijs Kooijman	c9d4e06f3a	Un-XFAIL multdeadretval, since instcombine now properly handles the mess deadargelim leaves behind :-) llvm-svn: 53674	2008-07-16 12:56:52 +00:00
Evan Cheng	c97094552c	Fix PR2296. Do not transform x86_sse2_storel_dq into a full-width store. llvm-svn: 53666	2008-07-16 07:28:14 +00:00
Matthijs Kooijman	af3594cca1	XFAIL the multdeadretval test for now, I will be fixing instcombine to make it work again tomorrow. llvm-svn: 53614	2008-07-15 16:05:09 +00:00
Matthijs Kooijman	121a206292	Remove a few tests which no longer hold for deadargelim (since it is now allowed to canonicalize return values). Add a test that checks if return value and function attributes are not removed. llvm-svn: 53612	2008-07-15 14:57:01 +00:00
Matthijs Kooijman	d881fb0a5b	Add a testcase for the canonicalizations now performed by deadargelim. llvm-svn: 53611	2008-07-15 14:42:58 +00:00
Matthijs Kooijman	c1da874478	Make deadargelim a bit less smart, so it doesn't choke on nested structs as return values that are still (partially) live. Instead of updating all uses of a call instruction after removing some elements, it now just rebuilds the original struct (With undef gaps where the unused values were) and leaves it to instcombine to clean this up. The added testcase still fails currently, but this is due to instcombine which isn't good enough yet. I will fix that part next. llvm-svn: 53608	2008-07-15 14:03:10 +00:00
Matthijs Kooijman	d337446b8e	Fix typo. llvm-svn: 53605	2008-07-15 13:15:10 +00:00
Chris Lattner	16395e51f4	Fix PR2506 by being a bit more careful about reverse fact propagation when disproving a condition. This actually compiles the existing testcase (udiv_select_to_select_shift) to: define i64 @test(i64 %X, i1 %Cond) { entry: %divisor1.t = lshr i64 %X, 3 ; <i64> [#uses=1] %quotient2 = lshr i64 %X, 3 ; <i64> [#uses=1] %sum = add i64 %divisor1.t, %quotient2 ; <i64> [#uses=1] ret i64 %sum } instead of: define i64 @test(i64 %X, i1 %Cond) { entry: %quotient1.v = select i1 %Cond, i64 3, i64 4 ; <i64> [#uses=1] %quotient1 = lshr i64 %X, %quotient1.v ; <i64> [#uses=1] %quotient2 = lshr i64 %X, 3 ; <i64> [#uses=1] %sum = add i64 %quotient1, %quotient2 ; <i64> [#uses=1] ret i64 %sum } llvm-svn: 53534	2008-07-14 00:15:52 +00:00
Chris Lattner	80b03a1b49	Fix mishandling of the infinite loop case when merging two blocks. This fixes PR2540. llvm-svn: 53533	2008-07-13 22:23:11 +00:00
Nick Lewycky	f76aa23b54	Enhance analysis of srem. Remove dead code analyzing urem. 'urem' of power-of-2 is canonicalized to an 'and' instruction. llvm-svn: 53506	2008-07-12 05:04:38 +00:00
Nick Lewycky	f95b64acaa	Add another optimization from PR2330. Also catch some missing cases that are similar. llvm-svn: 53451	2008-07-11 07:20:53 +00:00
Chris Lattner	6af608b8ce	Fix folding of icmp's of i1 where the comparison is signed. The code was using the algorithm for folding unsigned comparisons which is completely wrong. This has been broken since the signless types change. llvm-svn: 53444	2008-07-11 04:20:58 +00:00
Chris Lattner	4fa8bb3430	Fix a bogus optimization: folding (slt (zext i1 A to i32), 1) -> (slt i1 A, true) This cause a regression in InstCombine/JavaCompare, which was doing the right thing on accident. To handle the missed case, generalize the comparisons based on masked bits a little bit to handle comparisons against the max value. For example, we can now xform (slt i32 (and X, 4), 4) -> (setne i32 (and X, 4), 4) llvm-svn: 53443	2008-07-11 04:09:09 +00:00
Chris Lattner	1be09d9e21	make this condition more precise. llvm-svn: 53442	2008-07-11 03:54:57 +00:00
Matthijs Kooijman	e0f3ab82c4	Restructure dead argument elimination, try #3 :-) Rewrite the DeadArgumentElimination pass, to use a more explicit tracking of dependencies between return values and/or arguments. Also make the handling of arguments and return values the same. The pass now looks properly inside returned structs, but only at the first level (ie, not inside nested structs). This version fixed a few more bugs and was cleaned up a bit. It now passes all of LLVM's testing, and should still pass SPEC2006. There is still a minor bug with regard to returning nested structs. Since there is currently nothing that emits such IR, I will fix that in a seperate commit (partly because it requires a non-trivial fix). llvm-svn: 53400	2008-07-10 10:24:08 +00:00
Nick Lewycky	6193a564ab	Fix overzealous optimization. Thanks to Duncan Sands for pointing out my error! llvm-svn: 53393	2008-07-10 05:51:40 +00:00
Chris Lattner	67136cff7b	Fix a case where vector comparison constant folding would cause an infinite recursion. part of PR2529 llvm-svn: 53383	2008-07-10 00:29:28 +00:00
Chris Lattner	b69689e18b	elementwise comparison of vector constants was completely wrong. Fix it for PR2529 llvm-svn: 53380	2008-07-10 00:08:17 +00:00
Nick Lewycky	f9c27c343a	Fold (a < 8) && (b < 8) into (a\|b) < 8 for unsigned less or greater than. llvm-svn: 53282	2008-07-09 07:29:11 +00:00
Nick Lewycky	364661c43e	Fold ((1 << a) & 1) to (a == 0). llvm-svn: 53276	2008-07-09 05:20:13 +00:00
Chris Lattner	7212e2014c	Fix a broken test. Neither load is eliminable without changing the CFG. llvm-svn: 53273	2008-07-09 05:01:02 +00:00
Nick Lewycky	0d3645e673	Reduce x - y to -y when we know the 'x' part will get masked off anyways. llvm-svn: 53271	2008-07-09 04:32:37 +00:00
Devang Patel	51cbf928ab	If loop induction variable's start value is less then its exit value then do not split the loop. llvm-svn: 53265	2008-07-09 00:12:01 +00:00
Chris Lattner	4f1a9ffc09	'Optimize' test llvm-svn: 53242	2008-07-08 18:33:33 +00:00
Chris Lattner	855d2c38ec	new testcase for PR2496 llvm-svn: 53239	2008-07-08 17:18:05 +00:00
Chris Lattner	d137a08b0d	Fix three bugs: 1) evaluate [v]fcmp true/false with undefs to true or false instead of undef. 2) fix vector comparisons with undef to return a vector result instead of i1 3) fix vector comparisons with evaluatable results to return vector true/false instead of i1 true/false (PR2529) llvm-svn: 53220	2008-07-08 05:46:34 +00:00
Nick Lewycky	9f1a4dc672	Fix missed optimization opportunity when analyzing cast of mul and select. llvm-svn: 53151	2008-07-05 21:19:34 +00:00
Owen Anderson	d57cdc3c60	Remove the ability for ADCE to remove unreachable blocks in loop nests, because, as Eli pointed out, SimplifyCFG already does this. llvm-svn: 53104	2008-07-03 17:21:41 +00:00
Owen Anderson	323b5755a6	Add support to ADCE for pruning unreachable blocks. This addresses the final part of PR2509. llvm-svn: 53038	2008-07-02 18:05:19 +00:00
Owen Anderson	b22a640fe4	A better fix for PR2503 that doesn't pessimize GVN in the presence of unreachable blocks. llvm-svn: 53032	2008-07-02 17:20:16 +00:00
Evan Cheng	f79a4a1f01	XFAIL for now. llvm-svn: 52795	2008-06-26 22:09:29 +00:00
Owen Anderson	1fb47ad928	Use the -enable-pre flag so this test doesn't fail. llvm-svn: 52784	2008-06-26 17:03:28 +00:00
Chris Lattner	c9c81fb0df	Fix PR2488, a case where we deleted stack restores too aggressively. llvm-svn: 52702	2008-06-25 05:59:28 +00:00
Dan Gohman	04c8bd7e11	Revert 52645, the loop unroller changes. It caused a regression in 252.eon. llvm-svn: 52688	2008-06-24 20:44:42 +00:00
Matthijs Kooijman	c702e1d32f	Commit the new DeadArgElim pass again, this time with the gcc bootstrap failures fixed. Also add a testcase to reproduce the gcc bootstrap failure in very much reduced form. llvm-svn: 52677	2008-06-24 16:30:26 +00:00
Dan Gohman	48c5c7e860	Revamp the loop unroller, extending it to correctly update PHI nodes in the presence of out-of-loop users of in-loop values and the trip count is not a known multiple of the unroll count, and to be a bit simpler overall. This fixes PR2253. llvm-svn: 52645	2008-06-23 21:29:41 +00:00
Dan Gohman	5ca5e02480	Improve LSR's dead-phi detection to handle use-def cycles with more than two nodes. llvm-svn: 52617	2008-06-22 20:44:02 +00:00
Chris Lattner	6ff85681e4	Fix PR2369 by making scalarrepl more careful about promoting structures. Its default threshold is to promote things that are smaller than 128 bytes, which is sane. However, it is not sane to do this for things that turn into 128 registers. Add a cap on the number of registers introduced, defaulting to 128/4=32. llvm-svn: 52611	2008-06-22 17:46:21 +00:00
Eli Friedman	d3449df326	Fix for PR2479: correctly optimize expressions like (a > 13) & (a == 15). See also PR1800, which is about the signed case. llvm-svn: 52608	2008-06-21 23:36:13 +00:00
Duncan Sands	c7ef3cb43f	This file is empty. llvm-svn: 52596	2008-06-21 20:26:50 +00:00
Evan Cheng	33067210d1	Back out Matthijs' DAE patches. It's miscompiling gcc driver. llvm-svn: 52570	2008-06-21 00:31:44 +00:00
Matthijs Kooijman	48b282f03b	Add testcase that checks that DeadArgElim doesn't touch stuff it shouldn't touch. llvm-svn: 52540	2008-06-20 15:38:22 +00:00
Matthijs Kooijman	8d32dee428	Recommit r52459, rewriting of the dead argument elimination pass. This is a fixed version that no longer uses multimap::equal_range, which resulted in a pointer invalidation problem. Also, DAE::InspectedFunctions was not really necessary, so it got removed. Lastly, this version no longer applies the extra arg hack on functions who did not have any arguments to start with. llvm-svn: 52532	2008-06-20 09:36:16 +00:00
Chris Lattner	f3ecd2d290	Fix PR2471, which is a bug involving an invalid promotion from a conditional load. llvm-svn: 52525	2008-06-20 05:12:56 +00:00
Matthijs Kooijman	a087b73dba	Modify some ipconstprop tests to also test with invokes. llvm-svn: 52491	2008-06-19 09:27:44 +00:00
Owen Anderson	83c6a9d7c1	Remove this test until the corresponding patch is reapplied because it's causing make check to crash for some people. llvm-svn: 52473	2008-06-18 22:37:31 +00:00
Owen Anderson	6a903bc601	Add local PRE to GVN. This only operates in cases where it would not increase code size, namely when the instantiated expression would only need to be created in one predecessor. llvm-svn: 52471	2008-06-18 21:41:49 +00:00
Matthijs Kooijman	964557fdf5	Rewrite the DeadArgumentElimination pass, to use a more explicit tracking of dependencies between return values and/or arguments. Also make the handling of arguments and return values the same. The pass now looks properly inside returned structs, but only at the first level (ie, not inside nested structs). Also add a testcase for testing various variations of (multiple) dead rerturn values. llvm-svn: 52459	2008-06-18 11:12:53 +00:00
Matthijs Kooijman	fd17357643	Reapply r52397 (make IPConstProp promote returned arguments), but fixed this time. Sorry for the trouble! This time, also add a testcase, which I should have done in the first place... llvm-svn: 52455	2008-06-18 08:30:37 +00:00
Matthijs Kooijman	97034598b1	Reapply r52396, it was unrelated to the breakage (that was caused by r52397, my commit after this). llvm-svn: 52453	2008-06-18 08:09:27 +00:00
Chris Lattner	ef36dcd10b	implement some simple bswap optimizations, rdar://5992453 llvm-svn: 52442	2008-06-18 04:33:20 +00:00
Chris Lattner	466e1f8dd2	temporarily revert this testcase since its patch was reverted. llvm-svn: 52441	2008-06-18 04:03:23 +00:00
Chris Lattner	b5ee8b3e89	make truncate/sext elimination capable of changing phi's. This implements rdar://6013816 and the testcase in Transforms/InstCombine/sext-misc.ll. llvm-svn: 52440	2008-06-18 04:00:49 +00:00
Devang Patel	cd6b697945	Preserve dominance frontier while trivially unswitching loop. llvm-svn: 52438	2008-06-18 02:16:38 +00:00
Matthijs Kooijman	f03c1ae407	Learn IPConstProp to look at individual return values and propagate them individually. Also learn IPConstProp how returning first class aggregates work, in addition to old style multiple return instructions. Modify the return-constants testscase to confirm this behaviour. llvm-svn: 52396	2008-06-17 12:02:52 +00:00
Dan Gohman	ab0dccba6b	Refine the change in r52258 for avoiding use-before-def conditions when changing the stride of a comparison so that it's slightly more precise, by having it scan the instruction list to determine if there is a use of the condition after the point where the condition will be inserted. llvm-svn: 52371	2008-06-16 22:34:15 +00:00
Matthijs Kooijman	ac5bc8a3dd	Make testcase check for extractvalue instead of extractelement. llvm-svn: 52317	2008-06-16 13:03:44 +00:00
Matthijs Kooijman	bcd4047dd4	Store the result of multiple identical run lines in a temporary file. llvm-svn: 52314	2008-06-16 12:21:25 +00:00
Wojciech Matyjewicz	ae9753b29d	Fix PR2434. When scanning for exising binary operator to reuse don't take into account the instrucion pointed by InsertPt. Thanks to it, returning the new value of InsertPt to the InsertBinop() caller can be avoided. The bug was, actually, in visitAddRecExpr() method which wasn't correctly handling changes of InsertPt. There shouldn't be any performance regression, as -gvn pass (run after -indvars) removes any redundant binops. llvm-svn: 52291	2008-06-15 19:07:39 +00:00
Eli Friedman	2c580f0323	Remove unnecessary target lines. llvm-svn: 52261	2008-06-13 22:12:16 +00:00
Eli Friedman	795749845d	Remove unnecessary target lines. llvm-svn: 52260	2008-06-13 22:10:32 +00:00
Eli Friedman	5de0a77a9b	Don't skip over instructions other than loads that might read memory when trying to sink stores. llvm-svn: 52259	2008-06-13 22:02:12 +00:00
Dan Gohman	9ad8c54aab	Protect ChangeCompareStride from situations in which it is possible for it to generate use-before-def IR, such as in this testcase. llvm-svn: 52258	2008-06-13 21:43:41 +00:00
Eli Friedman	9833a1b407	Make sure SimplifyStoreAtEndOfBlock doesn't mess with loops; the structure checks are incorrect if the blocks aren't distinct. Fixes PR2435. llvm-svn: 52257	2008-06-13 21:17:49 +00:00
Evan Cheng	2d788ce3fb	Fix some tests. llvm-svn: 52245	2008-06-12 21:23:38 +00:00
Evan Cheng	70fe16353a	Revert 52223. llvm-svn: 52243	2008-06-12 20:55:39 +00:00
Matthijs Kooijman	653be276b5	Add line continuation character so the avoid dup loop header test actually runs. llvm-svn: 52228	2008-06-12 08:49:04 +00:00
Evan Cheng	f3c2902ead	Avoid duplicating loop header which leads to unnatural loops (and just seem like general badness to me, likely to cause code explosion). Patch by Florian Brandner. llvm-svn: 52223	2008-06-11 19:07:54 +00:00
Matthijs Kooijman	b2fc72bfbf	Teach instruction combining about the extractvalue. It can succesfully fold useless insert-extract chains, similar to how it folds them for vectors. Add a testcase for this. llvm-svn: 52217	2008-06-11 14:05:05 +00:00
Matthijs Kooijman	6c0890a169	Ignore stderr for some more tests that expect warnings there. This fixes 2 testcases. llvm-svn: 52184	2008-06-10 16:13:38 +00:00
Matthijs Kooijman	a2f743eaff	Fix some escaping and quoting in RUN lines, mainly involving { and <. In two cases quoting of <{ didn't work out, so I changed the grep to check for }> instead. This fixes 7 testcases that were not properly running before. llvm-svn: 52182	2008-06-10 16:04:47 +00:00
Matthijs Kooijman	d1057f49b4	Let some more tests ignore expected output on stderr. Also, use > %t instead of -o %t for output in one test since that also works when %t already exists. This fixes 6 testcases. llvm-svn: 52178	2008-06-10 15:04:14 +00:00
Dan Gohman	632a55e2cc	Fix two more not-grep tests that were missing llvm-dis. llvm-svn: 52159	2008-06-09 22:36:45 +00:00
Duncan Sands	d0cd426131	Test that prune-eh doesn't make deductions based on bodies of functions with weak linkage. llvm-svn: 52141	2008-06-09 11:28:41 +00:00
Chris Lattner	9c9f531a47	lower calls to abs to inline code, PR2337 llvm-svn: 52138	2008-06-09 08:26:51 +00:00
Chris Lattner	dbd595f22d	Fix PR2411, where ip constant prop would propagate the result of a weak function. llvm-svn: 52137	2008-06-09 07:58:07 +00:00
Chris Lattner	b4866ef30c	Limit the icmp+phi merging optimization to the cases where it is profitable: don't make i1 phis when it won't be possible to eliminate them. llvm-svn: 52097	2008-06-08 20:52:11 +00:00
Evan Cheng	89200c9177	Speculatively execute a block when the the block is the then part of a triangle shape and it contains a single, side effect free, cheap instruction. The branch is eliminated by adding a select instruction. i.e. Turn BB: %t1 = icmp br i1 %t1, label %BB1, label %BB2 BB1: %t3 = add %t2, c br label BB2 BB2: => BB: %t1 = icmp %t4 = add %t2, c %t3 = select i1 %t1, %t2, %t3 llvm-svn: 52073	2008-06-07 08:52:29 +00:00
Evan Cheng	003b4b0cd2	Fix run line. llvm-svn: 52072	2008-06-07 08:40:16 +00:00
Zhou Sheng	c775e462a8	Add a test case for opt -instcombine bug fix in revision 52003. llvm-svn: 52004	2008-06-05 14:25:11 +00:00
Matthijs Kooijman	812989b147	Learn ScalarReplAggregrates how stores and loads of first class aggregrates work and how to replace them into individual values. Also, when trying to replace an aggregrate that is used by load or store with a single (large) integer, don't crash (but don't replace the aggregrate either). Also adds a testcase for both structs and arrays. llvm-svn: 51997	2008-06-05 12:51:53 +00:00
Matthijs Kooijman	e0c5adc158	Let StructRetPromotion check if all if its users are really calls or invokesn, not other instructions. This fixes a crash with the added testcase. llvm-svn: 51992	2008-06-05 08:57:20 +00:00
Matthijs Kooijman	463f86639d	Let StructRetPromotion check if it's users are really calling it and not passing its pointer. Fixes test with added testcase. llvm-svn: 51991	2008-06-05 08:48:32 +00:00
Owen Anderson	6c871c0866	Testcase for LoopIndexSplit and DomFrontier. llvm-svn: 51916	2008-06-03 18:32:27 +00:00
Devang Patel	7314d0ee3c	Update dom tree. Fix PR 2372. llvm-svn: 51887	2008-06-02 22:52:56 +00:00
Owen Anderson	38099c1b6e	Fix two issues that Eli Friedman pointed out, where would misoptimized code like: char a[200]; init(a, a+200); OR int a[200]; char* b = (char)a; char c = (char*)a; foo(b, c); llvm-svn: 51850	2008-06-01 22:26:26 +00:00
Owen Anderson	e625a3b83f	Test for PR2401 llvm-svn: 51849	2008-06-01 21:55:55 +00:00
Duncan Sands	0397cd2ec4	When simplifying a call to a bitcast function, tighten up the conditions for performing the transform when only the function declaration is available: no longer allow turning i32 into i64 for example. Only allow changing between pointer types, and between pointer types and integers of the same size. For return values ptr -> intptr was already allowed; I added ptr -> ptr and intptr -> ptr while there. As shown by a recent objc testcase, changing the way parameters/return values are passed can be fatal when calling code written in assembler that directly manipulates call arguments and return values unless the transform has no impact on the way they are passed at the codegen level. While it is possible to imagine an ABI that treats integers of pointer size differently to pointers, I don't think LLVM supports any so the transform should now be safe while still being useful. llvm-svn: 51834	2008-06-01 07:38:42 +00:00
Nick Lewycky	035fe6f716	Peer through sext/zext when looking for not(cmp). llvm-svn: 51819	2008-05-31 19:01:33 +00:00
Nick Lewycky	26b8cd84b3	Add more i1 optimizations. add, sub, mul, s/udiv on i1 are now simplified away. llvm-svn: 51817	2008-05-31 17:59:52 +00:00
Nick Lewycky	df9242a833	Adding i1 is always Xor. llvm-svn: 51816	2008-05-31 17:10:28 +00:00
Owen Anderson	7686b555e2	Replace the old ADCE implementation with a new one that more simply solves the one case that ADCE catches that normal DCE doesn't: non-induction variable loop computations. This implementation handles this problem without using postdominators. llvm-svn: 51668	2008-05-29 08:45:13 +00:00
Chris Lattner	ecdefb5df7	Implement PR2370: memmove(x,x,size) -> noop. llvm-svn: 51636	2008-05-28 05:30:41 +00:00
Nick Lewycky	f6ccd2580c	"ret (constexpr)" can't be folded into a Constant. Add a method to Analysis/ConstantFolding to fold ConstantExpr's, then make instcombine use it to try to use targetdata to fold constant expressions on void instructions. Also extend the icmp(inttoptr, inttoptr) folding to handle the case where int size != ptr size. llvm-svn: 51559	2008-05-25 20:56:15 +00:00
Chris Lattner	87a099a057	Fix a serious brain-o. Obviously no-one reviewed my patch :( This fixes PR2359 llvm-svn: 51536	2008-05-24 04:06:28 +00:00
Chris Lattner	5c207c83c6	Fix PR2358 by resolving calls with undef arguments to overdefined. llvm-svn: 51535	2008-05-24 03:59:33 +00:00
Dan Gohman	5e7863de1b	Remove lingering references to .llx and .tr in the tests. llvm-svn: 51500	2008-05-23 21:15:35 +00:00
Matthijs Kooijman	aef2b8198b	Restucture a part of the SimplifyCFG pass and include a testcase. The SimplifyCFG pass looks at basic blocks that contain only phi nodes, followed by an unconditional branch. In a lot of cases, such a block (BB) can be merged into their successor (Succ). This merging is performed by TryToSimplifyUncondBranchFromEmptyBlock. It does this by taking all phi nodes in the succesor block Succ and expanding them to include the predecessors of BB. Furthermore, any phi nodes in BB are moved to Succ and expanded to include the predecessors of Succ as well. Before attempting this merge, CanPropagatePredecessorsForPHIs checks to see if all phi nodes can be properly merged. All functional changes are made to this function, only comments were updated in TryToSimplifyUncondBranchFromEmptyBlock. In the original code, CanPropagatePredecessorsForPHIs looks quite convoluted and more like stack of checks added to handle different kinds of situations than a comprehensive check. In particular the first check in the function did some value checking for the case that BB and Succ have a common predecessor, while the last check in the function simply rejected all cases where BB and Succ have a common predecessor. The first check was still useful in the case that BB did not contain any phi nodes at all, though, so it was not completely useless. Now, CanPropagatePredecessorsForPHIs is restructured to to look a lot more similar to the code that actually performs the merge. Both functions now look at the same phi nodes in about the same order. Any conflicts (phi nodes with different values for the same source) that could arise from merging or moving phi nodes are detected. If no conflicts are found, the merge can happen. Apart from only restructuring the checks, two main changes in functionality happened. Firstly, the old code rejected blocks with common predecessors in most cases. The new code performs some extra checks so common predecessors can be handled in a lot of cases. Wherever common predecessors still pose problems, the blocks are left untouched. Secondly, the old code rejected the merge when values (phi nodes) from BB were used in any other place than Succ. However, it does not seem that there is any situation that would require this check. Even more, this can be proven. Consider that BB is a block containing of a single phi node "%a" and a branch to Succ. Now, since the definition of %a will dominate all of its uses, BB will dominate all blocks that use %a. Furthermore, since the branch from BB to Succ is unconditional, Succ will also dominate all uses of %a. Now, assume that one predecessor of Succ is not dominated by BB (and thus not dominated by Succ). Since at least one use of %a (but in reality all of them) is reachable from Succ, you could end up at a use of %a without passing through it's definition in BB (by coming from X through Succ). This is a contradiction, meaning that our original assumption is wrong. Thus, all predecessors of Succ must also be dominated by BB (and thus also by Succ). This means that moving the phi node %a from BB to Succ does not pose any problems when the two blocks are merged, and any use checks are not needed. llvm-svn: 51478	2008-05-23 09:09:41 +00:00
Nick Lewycky	3bf5512d87	Constant integer vectors may also be negated. llvm-svn: 51476	2008-05-23 04:54:45 +00:00
Nick Lewycky	4f3d878507	Revert X + X --> X * 2 optz'n which pessimizes heavily on x86. llvm-svn: 51474	2008-05-23 04:34:58 +00:00
Nick Lewycky	452fb32927	Implement X + X for vectors. llvm-svn: 51472	2008-05-23 04:14:51 +00:00
Nick Lewycky	2ec9a01173	Fix a recently added optimization to not crash on vectors. llvm-svn: 51471	2008-05-23 03:26:47 +00:00
Dan Gohman	6d5f120c5c	Generalize the new code in instcombine's ComputeNumSignBits for handling and/or to handle more cases (such as this add-sitofp.ll testcase), and port it to selectiondag's ComputeNumSignBits. llvm-svn: 51469	2008-05-23 02:28:01 +00:00
Gabor Greif	d01c562e48	Eliminate questionable syntax for stdin redirection. This probably also speeds things up a bit. llvm-svn: 51357	2008-05-20 22:07:21 +00:00
Chris Lattner	b76ad168dc	Fix PR2346 by marking vaarg as volatile so that licm doesn't try to hoist them. llvm-svn: 51356	2008-05-20 22:05:28 +00:00
Dan Gohman	0843435b36	Oops, commit the version of this test that actually works. llvm-svn: 51351	2008-05-20 21:19:36 +00:00
Dan Gohman	81ab753b14	Port SelectionDAG's ComputeNumSignBits-using code to instcombine, now that instcombine also has ComputeNumSignBits. llvm-svn: 51350	2008-05-20 21:01:12 +00:00
Gabor Greif	1e427c3264	sabre brings to my attention that the 'tr' suffix is also obsolete llvm-svn: 51349	2008-05-20 21:00:03 +00:00
Gabor Greif	f45ff35bfe	Rename the last test with .llx extension to .ll, resolve duplicate test by renaming to isnan2. Now that no test has llx ending there is no need to search for them from dg.exp too. llvm-svn: 51328	2008-05-20 19:52:04 +00:00
Chris Lattner	7ac943fffd	Teach instcombine 4 new xforms: (add (sext x), cst) --> (sext (add x, cst')) (add (sext x), (sext y)) --> (sext (add int x, y)) (add double (sitofp x), fpcst) --> (sitofp (add int x, intcst)) (add double (sitofp x), (sitofp y)) --> (sitofp (add int x, y)) This generally reduces conversions. For example MiBench/telecomm-gsm gets these simplifications: HACK2: %tmp67.i142.i.i = sext i16 %tmp6.i141.i.i to i32 ; <i32> [#uses=1] %tmp23.i139.i.i = sext i16 %tmp2.i138.i.i to i32 ; <i32> [#uses=1] %tmp8.i143.i.i = add i32 %tmp67.i142.i.i, %tmp23.i139.i.i ; <i32> [#uses=3] HACK2: %tmp67.i121.i.i = sext i16 %tmp6.i120.i.i to i32 ; <i32> [#uses=1] %tmp23.i118.i.i = sext i16 %tmp2.i117.i.i to i32 ; <i32> [#uses=1] %tmp8.i122.i.i = add i32 %tmp67.i121.i.i, %tmp23.i118.i.i ; <i32> [#uses=3] HACK2: %tmp67.i.i190.i = sext i16 %tmp6.i.i189.i to i32 ; <i32> [#uses=1] %tmp23.i.i187.i = sext i16 %tmp2.i.i186.i to i32 ; <i32> [#uses=1] %tmp8.i.i191.i = add i32 %tmp67.i.i190.i, %tmp23.i.i187.i ; <i32> [#uses=3] HACK2: %tmp67.i173.i.i.i = sext i16 %tmp6.i172.i.i.i to i32 ; <i32> [#uses=1] %tmp23.i170.i.i.i = sext i16 %tmp2.i169.i.i.i to i32 ; <i32> [#uses=1] %tmp8.i174.i.i.i = add i32 %tmp67.i173.i.i.i, %tmp23.i170.i.i.i ; <i32> [#uses=3] HACK2: %tmp67.i152.i.i.i = sext i16 %tmp6.i151.i.i.i to i32 ; <i32> [#uses=1] %tmp23.i149.i.i.i = sext i16 %tmp2.i148.i.i.i to i32 ; <i32> [#uses=1] %tmp8.i153.i.i.i = add i32 %tmp67.i152.i.i.i, %tmp23.i149.i.i.i ; <i32> [#uses=3] HACK2: %tmp67.i.i.i.i = sext i16 %tmp6.i.i.i.i to i32 ; <i32> [#uses=1] %tmp23.i.i5.i.i = sext i16 %tmp2.i.i.i.i to i32 ; <i32> [#uses=1] %tmp8.i.i7.i.i = add i32 %tmp67.i.i.i.i, %tmp23.i.i5.i.i ; <i32> [#uses=3] This also fixes a bug in ComputeNumSignBits handling select and makes it more aggressive with and/or. llvm-svn: 51302	2008-05-20 05:46:13 +00:00
Devang Patel	ee7bf41c06	Do not erase induction variable increment if it is used outside the loop. llvm-svn: 51280	2008-05-19 22:23:55 +00:00
Chris Lattner	e35fe0f1c6	convert fptosi(sitofp x) -> x if the fp value has enough bits in its mantissa to accurately represent the integer. This triggers 9 times in 471.omnetpp, though 8 of those seem to be inlined from the same place. llvm-svn: 51271	2008-05-19 20:25:04 +00:00
Chris Lattner	5920a78034	Fold FP comparisons where one operand is converted from an integer type and the other operand is a constant into integer comparisons. This happens surprisingly frequently (e.g. 10 times in 471.omnetpp), which are things like this: %tmp8283 = sitofp i32 %tmp82 to double %tmp1013 = fcmp ult double %tmp8283, 0.0 Clearly comparing tmp82 against i32 0 is cheaper here. this also triggers 8 times in gobmk, including this one: %tmp375376 = sitofp i32 %tmp375 to double %tmp377 = fcmp ogt double %tmp375376, 8.150000e+01 which is comparing an integer against 81.5 :). llvm-svn: 51268	2008-05-19 20:18:56 +00:00
Chris Lattner	fc365b60dc	be more aggressive about transforming add -> or when the operands have no intersecting bits. This triggers all over the place, for example in lencode, with adds of stuff like: %tmp580 = mul i32 %tmp579, 2 %tmp582 = and i32 %b8, 1 and %tmp28 = shl i32 %abs.i, 1 %sign.0 = select i1 %tmp23, i32 1, i32 0 and %tmp344 = shl i32 %tmp343, 2 %tmp346 = and i32 %tmp96, 3 etc. llvm-svn: 51263	2008-05-19 20:01:56 +00:00
Duncan Sands	eec7a3c071	Fix PR2341 - when the length is 4 use an i32 not an i16! Cleaned up trailing whitespace while there. llvm-svn: 51240	2008-05-19 09:27:24 +00:00
Chris Lattner	4b2a724fb8	Fix PR2339 llvm-svn: 51226	2008-05-18 04:11:26 +00:00
Chris Lattner	14b3604dcf	remove empty file? llvm-svn: 51225	2008-05-18 04:10:18 +00:00
Nick Lewycky	eb185ca5e9	Revert constant-folding change that will miscompile in some cases. llvm-svn: 51223	2008-05-17 19:00:05 +00:00
Nick Lewycky	1ba90bb69b	Constant fold inttoptr and ptrtoint. llvm-svn: 51216	2008-05-17 09:03:26 +00:00
Evan Cheng	a9a0c1570d	Fix test. llvm-svn: 51191	2008-05-16 17:08:51 +00:00
Owen Anderson	bdcf408d67	Move this test from ADCE to loop deletion, where it is more appropriate. llvm-svn: 51181	2008-05-16 04:34:19 +00:00
Owen Anderson	ecc12fe81d	Use loop deletion instead of ADCE in these tests. llvm-svn: 51180	2008-05-16 04:33:37 +00:00
Owen Anderson	00c7d82391	Use loop deletion instead of ADCE for removing loops. llvm-svn: 51178	2008-05-16 04:27:38 +00:00
Chris Lattner	5c953b7d27	implement PR2328. llvm-svn: 51176	2008-05-16 02:59:42 +00:00
Bill Wendling	3716952f10	Situations can arise when you have a function called that returns a 'void', but is bitcast to return a floating point value. The result of the instruction may not be used by the program afterwards, and LLVM will happily remove all instructions except the call. But, on some platforms, if a value is returned as a floating point, it may need to be removed from the stack (like x87). Thus, we can't get rid of the bitcast even if there isn't a use of the value. llvm-svn: 51134	2008-05-14 22:45:20 +00:00
Devang Patel	f2763e233e	Simplify internalize pass. Add test case. Patch by Matthijs Kooijman! llvm-svn: 51114	2008-05-14 20:01:01 +00:00
Dale Johannesen	e695ab227c	Fix for PR 2323, infinite loop in tail dup. llvm-svn: 51063	2008-05-13 20:06:43 +00:00
Owen Anderson	525aa89356	Add a testcase for non-local CSE of read-only calls. llvm-svn: 51025	2008-05-13 08:17:44 +00:00
Duncan Sands	8111b67ca8	Testcase for PR2303. llvm-svn: 50951	2008-05-10 16:43:10 +00:00
Chris Lattner	aaba10e843	Implement PR2298. This transforms: ~x < ~y --> y < x -x == -y --> x == y llvm-svn: 50882	2008-05-09 05:19:28 +00:00
Chris Lattner	49a594e6ab	More than just loads can read from memory: readonly calls like strlen also need to be checked for memory modifying instructions before we can sink them. THis fixes the second half of PR2297. llvm-svn: 50860	2008-05-08 17:37:37 +00:00
Chris Lattner	4fa09669d8	Make instcombine's DSE respect loads as well as stores. It is not safe to delete the first store in: store x -> p load p store y -> p This is for PR2297. llvm-svn: 50859	2008-05-08 17:20:30 +00:00
Dan Gohman	5a3eecdfd8	Fix a bug in the ComputeMaskedBits logic for multiply. llvm-svn: 50793	2008-05-07 00:35:55 +00:00
Owen Anderson	ec9d8558d5	Testcase for r50770. llvm-svn: 50771	2008-05-06 21:01:34 +00:00
Dan Gohman	cf0e3acf16	Correct the value of LowBits in srem and urem handling in ComputeMaskedBits. llvm-svn: 50692	2008-05-06 00:51:48 +00:00
Chris Lattner	8ed8e3d0e6	Fix a crash when threading a block that includes a MRV call result. DemoteRegToStack doesn't work with MRVs yet, because it relies on the ability to load/store things. This fixes PR2285. llvm-svn: 50667	2008-05-05 20:21:22 +00:00
Dan Gohman	1962c2be6a	Fix a mistake in the computation of leading zeros for udiv. llvm-svn: 50591	2008-05-02 21:30:02 +00:00
Chris Lattner	5f0563ceb6	strength reduce exp2 into ldexp, rdar://5852514 llvm-svn: 50586	2008-05-02 18:43:35 +00:00
Dan Gohman	2cdcf2bd5f	Update old-style syntax in some "not grep" tests. llvm-svn: 50560	2008-05-01 23:50:07 +00:00
Dale Johannesen	6e91480c7c	New test for bug fixed in 50545. llvm-svn: 50548	2008-05-01 22:50:14 +00:00
Dan Gohman	4be6ae4e6c	Fix an overaggressive SimplifyDemandedBits optimization on urem. This fixes the 254.gap regression on x86 and the 403.gcc regression on x86-64. llvm-svn: 50537	2008-05-01 19:13:24 +00:00
Chris Lattner	0ceb8807e2	fix typo llvm-svn: 50519	2008-05-01 06:16:48 +00:00
Chris Lattner	9ad872201b	instcombine does memset optzns. llvm-svn: 50518	2008-05-01 06:16:38 +00:00
Chris Lattner	4add60753b	simplifylibcalls doesn't optimize llvm.memmove, instcombine does. llvm-svn: 50517	2008-05-01 06:14:24 +00:00
Chris Lattner	adf28cb71c	move some tests from libcall optimizer suite. llvm-svn: 50516	2008-05-01 06:13:48 +00:00
Owen Anderson	0255998dcf	Move this test to LoopDeletion, where it now passes. llvm-svn: 50474	2008-04-30 07:17:22 +00:00
Chris Lattner	2dc4426675	move lowering of llvm.memset -> store from simplify libcalls to instcombine. llvm-svn: 50472	2008-04-30 06:39:11 +00:00
Chris Lattner	f5f944aeaa	no reason for simplifylibcalls to simplify intrinsics, instcombine does a fine job. llvm-svn: 50470	2008-04-30 06:12:15 +00:00
Chris Lattner	4b20032b08	remove redundant check. llvm-svn: 50469	2008-04-30 06:06:37 +00:00
Owen Anderson	ff7d7b18e5	Fix a bug in memcpyopt where the memcpy-memcpy transform was never being applied because we were checking for it in the wrong order. This caused a miscompilation because the return slot optimization assumes that the call it is dealing with is NOT a memcpy. llvm-svn: 50444	2008-04-29 21:26:06 +00:00
Chris Lattner	d9e3b5c5bd	don't eliminate load from volatile value on paths where the load is dead. This fixes the second half of PR2262 llvm-svn: 50430	2008-04-29 17:28:22 +00:00
Chris Lattner	53bcf3609a	make this test reduced and valid llvm-svn: 50429	2008-04-29 17:25:32 +00:00
Chris Lattner	9233c124c9	fix a subtle volatile handling bug. llvm-svn: 50428	2008-04-29 17:13:43 +00:00
Chris Lattner	e331a65c79	don't delete the last store to an alloca if the store is volatile. llvm-svn: 50390	2008-04-29 04:58:38 +00:00
Dan Gohman	8cb19d967f	Fix DSE to not eliminate volatile loads with no uses. llvm-svn: 50370	2008-04-28 19:51:27 +00:00
Dan Gohman	72ec3f4562	Teach InstCombine's ComputeMaskedBits what SelectionDAG's ComputeMaskedBits knows about cttz, ctlz, and ctpop. Teach SelectionDAG's ComputeMaskedBits what InstCombine's knows about SRem. And teach them both some things about high bits in Mul, UDiv, URem, and Sub. This allows instcombine and dagcombine to eliminate sign-extension operations in several new cases. llvm-svn: 50358	2008-04-28 17:02:21 +00:00
Chris Lattner	8be72700b8	Fix PR2256, yet another miscompilation in simplifycfg of i multiple return values. Bill, please pull this into Tak. llvm-svn: 50332	2008-04-28 00:19:07 +00:00
Chris Lattner	67ca6f6347	When SRoA'ing a global variable, make sure the new globals get the appropriate alignment. This fixes a miscompilation of 252.eon on x86-64 (rdar://5891920). Bill, please pull this into Tak. llvm-svn: 50308	2008-04-26 07:40:11 +00:00
Nick Lewycky	4d43d3c72c	Remove 'unwinds to' support from mainline. This patch undoes r47802 r47989 r48047 r48084 r48085 r48086 r48088 r48096 r48099 r48109 and r48123. llvm-svn: 50265	2008-04-25 16:53:59 +00:00
Chris Lattner	f7de528463	Don't infininitely thread branches when a threaded edge goes back to the block, e.g.: Threading edge through bool from 'bb37.us.thread3829' to 'bb37.us' with cost: 1, across block: bb37.us: ; preds = %bb37.us.thread3829, %bb37.us, %bb33 %D1361.1.us = phi i32 [ %tmp36, %bb33 ], [ %D1361.1.us, %bb37.us ], [ 0, %bb37.us.thread3829 ] ; <i32> [#uses=2] %tmp39.us = icmp eq i32 %D1361.1.us, 0 ; <i1> [#uses=1] br i1 %tmp39.us, label %bb37.us, label %bb42.us llvm-svn: 50251	2008-04-25 04:12:29 +00:00
Chris Lattner	86bbf338e5	Split some code out of the main SimplifyCFG loop into its own function. Fix said code to handle merging return instructions together correctly when handling multiple return values. llvm-svn: 50199	2008-04-24 00:01:19 +00:00
Chris Lattner	5a58a4dc6d	Rewrite multiple return value handling in SCCP. Before, the -sccp pass would turn every getresult instruction into undef. This helps with rdar://5778210 llvm-svn: 50140	2008-04-23 05:38:20 +00:00
Chris Lattner	14f41bfc49	remove this testcase. It isn't testing loop rotate, it is testing all of -std-compile-opts and is now failing because other passes are generating IR that looks different to input of loop rotate. Devang, please introduce a testcase that only runs loop rotate. llvm-svn: 50136	2008-04-23 05:36:04 +00:00
Chris Lattner	3376d6d824	make this test more interesting. llvm-svn: 50128	2008-04-23 03:49:32 +00:00
Chris Lattner	2161d6c075	distill down the essense of this test. llvm-svn: 50125	2008-04-23 03:03:42 +00:00
Dale Johannesen	c4d3c1cbe0	new test llvm-svn: 50123	2008-04-23 01:22:22 +00:00
Evan Cheng	1c89ca7295	Don't do: "(X & 4) >> 1 == 2 --> (X & 4) == 4" if there are more than one uses of the shift result. llvm-svn: 50118	2008-04-23 00:38:06 +00:00
Chris Lattner	37e9c187b0	Start doing the significantly useful part of jump threading: handle cases where a comparison has a phi input and that phi is a constant. For example, stuff like: Threading edge through bool from 'bb2149' to 'bb2231' with cost: 1, across block: bb2237: ; preds = %bb2231, %bb2149 %tmp2328.rle = phi i32 [ %tmp2232, %bb2231 ], [ %tmp2232439, %bb2149 ] ; <i32> [#uses=2] %done.0 = phi i32 [ %done.2, %bb2231 ], [ 0, %bb2149 ] ; <i32> [#uses=1] %tmp2239 = icmp eq i32 %done.0, 0 ; <i1> [#uses=1] br i1 %tmp2239, label %bb2231, label %bb2327 or bb38.i298: ; preds = %bb33.i295, %bb1693 %tmp39.i296.rle = phi %struct.ibox* [ null, %bb1693 ], [ %tmp39.i296.rle1109, %bb33.i295 ] ; <%struct.ibox> [#uses=2] %minspan.1.i291.reg2mem.1 = phi i32 [ 32000, %bb1693 ], [ %minspan.0.i288, %bb33.i295 ] ; <i32> [#uses=1] %tmp40.i297 = icmp eq %struct.ibox %tmp39.i296.rle, null ; <i1> [#uses=1] br i1 %tmp40.i297, label %implfeeds.exit311, label %bb43.i301 This triggers thousands of times in spec. llvm-svn: 50110	2008-04-22 21:40:39 +00:00
Chris Lattner	d5425e8f8d	Dig through multiple levels of AND to thread jumps if needed. llvm-svn: 50106	2008-04-22 20:46:09 +00:00
Chris Lattner	3df4c15dc7	Teach jump threading to thread through blocks like: br (and X, phi(Y, Z, false)), label L1, label L2 This triggers once on 252.eon and 6 times on 176.gcc. Blocks in question often look like this: bb262: ; preds = %bb261, %bb248 %iftmp.251.0 = phi i1 [ true, %bb261 ], [ false, %bb248 ] ; <i1> [#uses=4] %tmp270 = icmp eq %struct.rtx_def* %tmp.0.i, null ; <i1> [#uses=1] %bothcond = or i1 %iftmp.251.0, %tmp270 ; <i1> [#uses=1] br i1 %bothcond, label %bb288, label %bb273 In this case, it is clear that it doesn't matter if tmp.0.i is null when coming from bb261. When coming from bb248, it is all that matters. Another random example: check_asm_operands.exit: ; preds = %check_asm_operands.exit.thr_comm, %bb30.i, %bb12.i, %bb6.i413 %tmp.0.i420 = phi i1 [ true, %bb6.i413 ], [ true, %bb12.i ], [ true, %bb30.i ], [ false, %check_asm_operands.exit.thr_comm ; <i1> [#uses=1] call void @llvm.stackrestore( i8* %savedstack ) nounwind %tmp4389 = icmp eq i32 %added_sets_1.0, 0 ; <i1> [#uses=1] %tmp4394 = icmp eq i32 %added_sets_2.0, 0 ; <i1> [#uses=1] %bothcond80 = and i1 %tmp4389, %tmp4394 ; <i1> [#uses=1] %bothcond81 = and i1 %bothcond80, %tmp.0.i420 ; <i1> [#uses=1] br i1 %bothcond81, label %bb4398, label %bb4397 Here is the case from 252.eon: bb290.i.i: ; preds = %bb23.i57.i.i, %bb8.i39.i.i, %bb100.i.i, %bb100.i.i, %bb85.i.i110 %myEOF.1.i.i = phi i1 [ true, %bb100.i.i ], [ true, %bb100.i.i ], [ true, %bb85.i.i110 ], [ true, %bb8.i39.i.i ], [ false, %bb23.i57.i.i ] ; <i1> [#uses=2] %i.4.i.i = phi i32 [ %i.1.i.i, %bb85.i.i110 ], [ %i.0.i.i, %bb100.i.i ], [ %i.0.i.i, %bb100.i.i ], [ %i.3.i.i, %bb8.i39.i.i ], [ %i.3.i.i, %bb23.i57.i.i ] ; <i32> [#uses=3] %tmp292.i.i = load i8* %tmp16.i.i100, align 1 ; <i8> [#uses=1] %tmp293.not.i.i = icmp ne i8 %tmp292.i.i, 0 ; <i1> [#uses=1] %bothcond.i.i = and i1 %tmp293.not.i.i, %myEOF.1.i.i ; <i1> [#uses=1] br i1 %bothcond.i.i, label %bb202.i.i, label %bb301.i.i Factoring out 3 common predecessors. On the path from any blocks other than bb23.i57.i.i, the load and compare are dead. llvm-svn: 50096	2008-04-22 07:05:46 +00:00
Chris Lattner	3cc28ce1ed	add a basic testcase. llvm-svn: 50093	2008-04-22 06:35:14 +00:00
Chris Lattner	c3a439351c	optimize "p != gep p, ..." better. This allows us to compile getelementptr-seteq.ll into: define i1 @test(i64 %X, %S* %P) { %C = icmp eq i64 %X, -1 ; <i1> [#uses=1] ret i1 %C } instead of: define i1 @test(i64 %X, %S* %P) { %A.idx.mask = and i64 %X, 4611686018427387903 ; <i64> [#uses=1] %C = icmp eq i64 %A.idx.mask, 4611686018427387903 ; <i1> [#uses=1] ret i1 %C } And fixes the second half of PR2235. This speeds up the insertion sort case by 45%, from 1.12s to 0.77s. In practice, this will significantly speed up for loops structured like: for (double *P = Base + N; P != Base; --P) ... Which happens frequently for C++ iterators. llvm-svn: 50079	2008-04-22 02:53:33 +00:00
Owen Anderson	6a7355caa2	Refactor memcpyopt based on Chris' suggestions. Consolidate several functions and simplify code that was fallout from the separation of memcpyopt and gvn. llvm-svn: 50034	2008-04-21 07:45:10 +00:00
Chris Lattner	b839c05a05	rename .llx -> .ll, last batch. llvm-svn: 49971	2008-04-19 22:32:52 +00:00
Owen Anderson	81f7584c4e	XFAIL this test for the moment. The real solution is to prevent ADCE from transforming loops and adding a separate loop pass for removing loops with know trip counts. Until that happens, ADCE is miscompiling this code. llvm-svn: 49769	2008-04-16 04:25:42 +00:00
Owen Anderson	90bde997b3	Add testcase for PR2213. llvm-svn: 49517	2008-04-11 05:13:32 +00:00
Dan Gohman	99b7b3f03b	Teach InstCombine's ComputeMaskedBits to handle pointer expressions in addition to integer expressions. Rewrite GetOrEnforceKnownAlignment as a ComputeMaskedBits problem, moving all of its special alignment knowledge to ComputeMaskedBits as low-zero-bits knowledge. Also, teach ComputeMaskedBits a few basic things about Mul and PHI instructions. This improves ComputeMaskedBits-based simplifications in a few cases, but more noticeably it significantly improves instcombine's alignment detection for loads, stores, and memory intrinsics. llvm-svn: 49492	2008-04-10 18:43:06 +00:00
Chris Lattner	802134fc02	Generalize getUnaryFloatFunction to handle any FP unary function, automatically figuring out the suffix to use. implement pow(2,x) -> exp2(x). llvm-svn: 49437	2008-04-09 17:48:11 +00:00
Chris Lattner	091afc7714	remove capital letter from test name. llvm-svn: 49436	2008-04-09 17:46:36 +00:00
Owen Anderson	ef9a6fd5c2	Factor a bunch of functionality related to memcpy and memset transforms out of GVN and into its own pass. llvm-svn: 49419	2008-04-09 08:23:16 +00:00
Chris Lattner	b859fb49ed	many cleanups to the pow optimizer. Allow it to handle powf, add support for pow(x, 2.0) -> x*x. llvm-svn: 49411	2008-04-09 00:07:45 +00:00
Gabor Greif	00fcdeddd3	merge r48768 from branches/ggreif/parallelized-test llvm-svn: 49382	2008-04-08 15:22:41 +00:00
Chris Lattner	28e7b57605	add a testcase for forming memset from noncontiguous stores. llvm-svn: 48938	2008-03-29 04:51:35 +00:00
Evan Cheng	2b72c05992	Handle a special case xor undef, undef -> 0. Technically this should be transformed to undef. But this is such a common idiom (misuse) we are going to handle it. llvm-svn: 48791	2008-03-25 20:07:13 +00:00
Tanya Lattner	8bf97c2324	Byebye llvm-upgrade! llvm-svn: 48762	2008-03-25 04:26:08 +00:00
Devang Patel	a38f58aa5c	Add incoming value from header only if phi node has any use inside the loop. llvm-svn: 48738	2008-03-24 20:16:14 +00:00
Chris Lattner	c2c0c8303c	apparently tclsh doesn't lex like bash. Weird. llvm-svn: 48732	2008-03-24 17:41:57 +00:00
Chris Lattner	9ca6bb4f16	pass the option so this test tests the right thing. llvm-svn: 48731	2008-03-24 17:36:38 +00:00
Evan Cheng	c3cf9f872a	Transform (zext (or (icmp), (icmp))) to (or (zext (cimp), (zext icmp))) if at least one of the (zext icmp) can be transformed to eliminate an icmp. llvm-svn: 48715	2008-03-24 00:21:34 +00:00
Owen Anderson	e3605ac108	Use normal naming convention for test. llvm-svn: 48693	2008-03-22 21:08:33 +00:00
Chris Lattner	53ccb62712	implement an initial hack at a straight-line store -> memset optimization. This fires dozens of times across spec and multisource, but I don't know if it actually speeds stuff up. Hopefully the testers will show something nice :) llvm-svn: 48680	2008-03-22 05:37:16 +00:00
Chris Lattner	c44160ce6e	Teach masked value is zero about add and sub, and use MVIZ to simplify things like (X & 4) >> 1 == 2 --> (X & 4) == 4. since it is obvious that the shift doesn't remove any bits. llvm-svn: 48631	2008-03-21 05:19:58 +00:00
Tanya Lattner	ab7872c06c	Upgrade tests. llvm-svn: 48538	2008-03-19 07:28:33 +00:00
Tanya Lattner	f9d25185d5	Upgrade tests. llvm-svn: 48536	2008-03-19 05:39:35 +00:00
Tanya Lattner	0ea4c8d706	Upgrade tests to not use llvm-upgrade. llvm-svn: 48530	2008-03-19 04:36:04 +00:00
Tanya Lattner	1d526b90aa	Upgrade tests to not use llvm-upgrade. llvm-svn: 48529	2008-03-19 04:14:49 +00:00
Tanya Lattner	f73582b17c	Remove llvm-upgrade and update tests. llvm-svn: 48527	2008-03-19 03:47:13 +00:00
Tanya Lattner	4e59897d3d	Upgrade tests to not use llvm-upgrade. llvm-svn: 48484	2008-03-18 04:14:37 +00:00
Tanya Lattner	baa370b37a	Upgrade tests to not use llvm-upgrade. llvm-svn: 48483	2008-03-18 03:45:45 +00:00
Bill Wendling	68a930b33e	The inst combining of inttoptr into GEP with one index was using the bit size of the type instead of the byte size. This was causing troublesome mis-compilations. True to form, this took 2 days to find and is a one-line fix. :-P llvm-svn: 48354	2008-03-14 05:12:19 +00:00
Owen Anderson	7a69e3aef3	Fix a bug in GVN that Duncan noticed, where we potentially need to insert a pointer bitcast when performing return slot optimization. llvm-svn: 48343	2008-03-13 22:07:10 +00:00
Owen Anderson	6ff0b822b4	Improve the return slot optimization to be both more aggressive (not limited to sret parameters), and safer (when the passed pointer might be invalid). Thanks to Duncan and Chris for the idea behind this, and extra thanks to Duncan for helping me work out the trap-safety. llvm-svn: 48280	2008-03-12 07:37:44 +00:00
Devang Patel	fa8667a2dd	Fix attribute handling. llvm-svn: 48262	2008-03-12 00:07:03 +00:00
Devang Patel	7358165c99	Handle multiple ret values. llvm-svn: 48254	2008-03-11 22:24:29 +00:00
Dan Gohman	20af5a0fe7	Check to see if a two-entry PHI block can be simplified before trying to merge the block into its predecessors. This allows two-entry-phi-return.ll to be simplified into a single basic block. llvm-svn: 48252	2008-03-11 21:53:06 +00:00
Dan Gohman	8e9ae96a4a	Make this test more challenging to help it avoid being optimized away before it tests what it is intended to test. llvm-svn: 48251	2008-03-11 21:47:57 +00:00
Devang Patel	a7a2075ab8	Initial multiple return values support. llvm-svn: 48210	2008-03-11 05:46:42 +00:00
Dan Gohman	319234d67c	Upgrade this test. llvm-svn: 48207	2008-03-11 02:19:59 +00:00
Devang Patel	741f491d90	Simplify llvm-svn: 48163	2008-03-10 18:38:30 +00:00
Tanya Lattner	5f4b355f20	Remove llvm-upgrade and update tests. llvm-svn: 48137	2008-03-10 07:21:50 +00:00
Nick Lewycky	fb2c1a999a	Turn unwind_to into "unwinds to". llvm-svn: 48123	2008-03-10 02:20:00 +00:00
Tanya Lattner	aa6f5c9ddd	Remove llvm-upgrade and update tests. llvm-svn: 48103	2008-03-09 08:16:40 +00:00
Nick Lewycky	42445be0df	Firstly, having a BranchInst isn't exclusive with having an unwind_to. Secondly, we have to check whether the branch is actually pointing to the block with the unwind in it. We could have gotten here because of the unwind_to alone. llvm-svn: 48099	2008-03-09 07:50:37 +00:00
Nick Lewycky	f3d637fa14	A BB that unwind_to an "unwind" inst is that same as one that doesn't unwind_to at all. llvm-svn: 48096	2008-03-09 07:36:38 +00:00
Nick Lewycky	5ce9b521d7	Update the inliner and simplifycfg to handle unwind_to. llvm-svn: 48086	2008-03-09 05:10:13 +00:00
Nick Lewycky	4d0ed842b1	Prune the unwind_to labels on BBs that don't need them. Another step in the removal of invoke, PR1269. llvm-svn: 48084	2008-03-09 04:55:16 +00:00
Devang Patel	780b3ca64b	Update inliner to handle functions that return multiple values. llvm-svn: 48020	2008-03-07 20:06:16 +00:00
Devang Patel	47d774b2c8	Place for sret promotion tests. llvm-svn: 48016	2008-03-07 20:00:15 +00:00
Nick Lewycky	3e2d7c9f85	Commit the testcase too. llvm-svn: 47988	2008-03-06 06:50:03 +00:00
Nick Lewycky	d0b62a1552	Don't try to simplify urem and srem using arithmetic rules that don't work under modulo (overflow). Fixes PR1933. llvm-svn: 47987	2008-03-06 06:48:30 +00:00
Devang Patel	941ab37ea8	Use cast instead of dyn_cast. Update test to use multiple return value directly, instead of relying on -sretpromotion. llvm-svn: 47907	2008-03-04 21:45:28 +00:00
Devang Patel	841322b32a	Handle multiple return values. llvm-svn: 47904	2008-03-04 21:15:15 +00:00
Tanya Lattner	5640bd186a	Remove llvm-upgrade and update test cases. llvm-svn: 47793	2008-03-01 09:15:35 +00:00
Chris Lattner	c966cebe93	fix a bug Anders ran into where scalarrepl would crash when promoting a union containing a vector and an array whose elements were smaller than the vector elements. this means we need to compile the load of the array elements into an extract element plus a truncate. llvm-svn: 47752	2008-02-29 07:12:06 +00:00
Chris Lattner	c612571555	Folding or(fcmp,fcmp) only works if the operands of the fcmps are the same fp type. llvm-svn: 47750	2008-02-29 06:09:11 +00:00
Owen Anderson	e41c19c987	Add PR number to testcase. llvm-svn: 47640	2008-02-26 23:16:11 +00:00
Owen Anderson	d29ed0b122	Fix an issue where GVN had the sizes of the two memcpy's reverse, resulting in an invalid transformation. llvm-svn: 47639	2008-02-26 23:06:17 +00:00
Chris Lattner	a39cff3aaa	fix this test so that the fn name doesn't match the regex llvm-svn: 47608	2008-02-26 18:13:51 +00:00
Gabor Greif	3d9755f6ca	Really feed llvm-as with the testcase, do not let it read from stdin. This fixes the hangs seen on solaris10. llvm-svn: 47604	2008-02-26 13:37:13 +00:00
Owen Anderson	df1d2b02f9	Fix an issue where GVN was performing the return slot optimization when it was not safe. This is fixed by more aggressively checking that the return slot is not used elsewhere in the function. llvm-svn: 47544	2008-02-25 04:08:09 +00:00
Owen Anderson	40dca46ddb	Fix an issue where GVN would try to use an instruction before its definition when performing return slot optimization. llvm-svn: 47541	2008-02-25 00:40:41 +00:00
Zhou Sheng	aae582ba99	Testcase for Revision 47478. llvm-svn: 47531	2008-02-23 10:59:51 +00:00
Nick Lewycky	fefd0202c9	Correctly fold divide-by-constant, even when faced with overflow. llvm-svn: 47287	2008-02-18 22:48:05 +00:00
Chris Lattner	23fe6630e3	make this just a bit more strict. llvm-svn: 47274	2008-02-18 17:33:10 +00:00
Owen Anderson	3549553262	Add support to GVN for performing sret return slot optimization. This means that, if an sret function tail calls another sret function, it should pass its own sret parameter to the tail callee, allowing it to fill in the correct return value. llvm-gcc does not emit this by default. Instead, it allocates space in the caller for the sret of the tail call and then uses memcpy to copy the result into the caller's sret parameter. This optimization detects and optimizes that case. llvm-svn: 47265	2008-02-18 09:24:53 +00:00
Chris Lattner	024f8c8f09	optimize away stackrestore calls that have no intervening alloca or call. llvm-svn: 47258	2008-02-18 06:12:38 +00:00
Chris Lattner	c8ec470b52	upgrade this test. llvm-svn: 47257	2008-02-18 06:11:00 +00:00
Chris Lattner	cc22601bc3	Fold (-x + -y) -> -(x+y) which promotes better association, fixing the second half of PR2047 llvm-svn: 47244	2008-02-17 21:03:36 +00:00
Chris Lattner	a70d138457	Split up subtracts into add+negate if they have a reassociable use or operand that is also a subtract. This implements PR2047 and Transforms/Reassociate/subtest2.ll llvm-svn: 47241	2008-02-17 20:51:26 +00:00
Chris Lattner	2de8c2d41f	upgrade and simplify this test. llvm-svn: 47240	2008-02-17 20:48:43 +00:00
Duncan Sands	573b3f89e4	Remove any 'nest' parameter attributes if the function is not passed as an argument to a trampoline intrinsic. llvm-svn: 47220	2008-02-16 20:56:04 +00:00
Devang Patel	2e622e4c2b	If loop header is also loop exiting block then OrigPN is incoming value for B loop header. Fixes PR 2030. llvm-svn: 47141	2008-02-14 23:18:47 +00:00
Chris Lattner	70e294660a	Fix PR2029 llvm-svn: 47129	2008-02-14 19:18:13 +00:00
Nick Lewycky	9592bb0390	Testcase for PR2032. llvm-svn: 47113	2008-02-14 07:15:11 +00:00
Devang Patel	0ecb76d820	A loop latch phi node may have uses inside loop, not just in loop header. llvm-svn: 47093	2008-02-13 22:23:07 +00:00
Devang Patel	22c3caab6e	While moving exit condition, do not drop loop latch on the floor. llvm-svn: 47089	2008-02-13 22:06:36 +00:00
Devang Patel	c281d8031b	Keep track of exit value operand number when operands are swapped. llvm-svn: 47082	2008-02-13 19:48:48 +00:00
Eli Friedman	460648abde	Add a note pointing to PR1996. llvm-svn: 47055	2008-02-13 07:56:04 +00:00
Eli Friedman	03ec63f29d	Add test for PR1996. (This is my first time adding a test for a transform, so please review.) llvm-svn: 47050	2008-02-13 06:55:57 +00:00
Owen Anderson	00dba4f734	Re-apply the patch to improve the optimizations of memcpy's, with several bugs fixed. This now passes PPC bootstrap. llvm-svn: 47026	2008-02-12 21:15:18 +00:00
Devang Patel	26f75e2576	Fix PR 1995. llvm-svn: 46898	2008-02-08 22:49:13 +00:00
Bill Wendling	c676a0329c	Temporarily reverting: http://lists.cs.uiuc.edu/pipermail/llvm-commits/Week-of-Mon-20080128/057882.html This is causing a miscompilation on PPC G5 and just now seeing it on iMac x86-64. llvm-svn: 46822	2008-02-06 20:03:07 +00:00
Chris Lattner	682a7dc653	Fix a bug compiling PR1978 (perhaps not the only one though) which was incorrectly simplifying "x == (gep x, 1, i)" into false, even though i could be negative. As it turns out, all the code to handle this already existed, we just need to disable the incorrect optimization case and let the general case handle it. llvm-svn: 46739	2008-02-05 04:45:32 +00:00
Owen Anderson	1a78ae76e4	Make this test more aggressive, to cover recent improvements. llvm-svn: 46695	2008-02-04 04:55:24 +00:00
Owen Anderson	c4a7c41869	Allow GVN to hack on memcpy's, making them open to further optimization. llvm-svn: 46693	2008-02-04 02:59:58 +00:00
Nick Lewycky	56178bc6ad	Tag this test with the PR reference. llvm-svn: 46688	2008-02-03 16:35:19 +00:00
Nick Lewycky	3b59214320	There are some cases where icmp(add) can be folded into a new icmp. Handle them. llvm-svn: 46687	2008-02-03 16:33:09 +00:00
Duncan Sands	9aa789fda3	Don't drop function/call return attributes like 'nounwind'. llvm-svn: 46645	2008-02-01 20:37:16 +00:00
Owen Anderson	4e4b116750	Make DSE much more aggressive by performing DCE earlier. Update a testcase to reflect this increased aggressiveness. llvm-svn: 46542	2008-01-30 01:24:47 +00:00
Chris Lattner	b9e5b8fb9e	Fix a bug where scalarrepl would discard offset if type would match. In practice this can only happen on code with already undefined behavior, but this is still a good thing to handle correctly. llvm-svn: 46539	2008-01-30 00:39:15 +00:00
Chris Lattner	ade0abb498	Don't let globalopt hack on volatile loads or stores. llvm-svn: 46523	2008-01-29 19:01:37 +00:00
Chris Lattner	17819d971e	eliminate additions of 0.0 when they are obviously dead. This has to be careful to avoid turning -0.0 + 0.0 -> -0.0 which is incorrect. llvm-svn: 46499	2008-01-29 06:52:45 +00:00
Owen Anderson	95bf1d4d7b	Add a testcase for eliminating memcpy's at the end of functions. Forgot to commit this with my last commit. llvm-svn: 46497	2008-01-29 06:40:32 +00:00
Devang Patel	67fa0521b6	Filter loops that subtract induction variables. These loops are not yet handled. Fix PR 1912. llvm-svn: 46484	2008-01-29 02:20:41 +00:00
Chris Lattner	a116071547	this test is now compiled into the right thing. llvm-svn: 46454	2008-01-28 17:38:46 +00:00
Nick Lewycky	8ea81e8ba4	Handle some more combinations of extend and icmp. Fixes PR1940. llvm-svn: 46431	2008-01-28 03:48:02 +00:00
Chris Lattner	710b441174	Fix PR1932 by disabling an xform invalid for fdiv. llvm-svn: 46429	2008-01-28 00:58:18 +00:00
Chris Lattner	1b706dd680	Fix PR1938 by forcing the code that uses an undefined value to branch one way or the other. Rewriting the code itself prevents subsequent analysis passes from making contradictory conclusions about the code that could cause an infeasible path to be made feasible. llvm-svn: 46427	2008-01-28 00:32:30 +00:00
Nick Lewycky	efb16f7057	Be more careful modifying the use_list while also iterating through it. llvm-svn: 46417	2008-01-27 18:35:00 +00:00
Duncan Sands	053c9871cd	Revert r46393: readonly/readnone functions are no longer allowed to write through byval arguments. llvm-svn: 46416	2008-01-27 18:12:58 +00:00
Bill Wendling	8c491162d2	The CorrelatedExpressions pass is now no more. llvm-svn: 46409	2008-01-27 06:13:32 +00:00
Chris Lattner	fa1e7eef30	Fold fptrunc(add (fpextend x), (fpextend y)) -> add(x,y), as GCC does. llvm-svn: 46406	2008-01-27 05:29:54 +00:00
Duncan Sands	dc157a4f0a	Invert this test, because it is wrong if we allow readonly functions to use byval parameters as local storage (how much do we want this?). llvm-svn: 46399	2008-01-26 12:33:01 +00:00
Owen Anderson	6af19fd1e2	DeadStoreElimination can treat byval parameters as if there were alloca's for the purpose of removing end-of-function stores. llvm-svn: 46351	2008-01-25 10:10:33 +00:00
Nick Lewycky	78712e5b59	Multiply can be evaluated in a different type, so long as the target type has a smaller bitwidth. llvm-svn: 46244	2008-01-22 05:08:48 +00:00
Evan Cheng	9a93dc9565	Test case for varargs parameter attribute issue I just fixed. llvm-svn: 46127	2008-01-17 07:26:31 +00:00
Chris Lattner	5630c4f217	Fix arg promotion to propagate the correct attrs on the calls to promoted functions. This is important for varargs calls in particular. Thanks to duncan for providing a great testcase. llvm-svn: 46108	2008-01-17 01:17:03 +00:00
Devang Patel	b3696e4f14	Do not strip llvm.used values. llvm-svn: 46045	2008-01-16 03:33:05 +00:00
Chris Lattner	f3e1155c41	add a test to ensure that argpromote of one argument doesn't break the byval attr on some other argument. llvm-svn: 46025	2008-01-15 22:38:12 +00:00
Duncan Sands	b5ca2e9fcb	I noticed that the trampoline straightening transformation could drop attributes on varargs call arguments. Also, it could generate invalid IR if the transformed call already had the 'nest' attribute somewhere (this can never happen for code coming from llvm-gcc, but it's a theoretical possibility). Fix both problems. llvm-svn: 45973	2008-01-14 19:52:09 +00:00
Chris Lattner	26fe7ebc03	Fix the miscompilation of MiBench/consumer-lame that was exposed by Evan's byval work. This miscompilation is due to the program indexing an array out of range and us doing a transformation that broke this. llvm-svn: 45949	2008-01-14 02:09:12 +00:00
Chris Lattner	92bd785323	Turn a memcpy from a double* into a load/store of double instead of a load/store of i64. The later prevents promotion/scalarrepl of the source and dest in many cases. This fixes the 300% performance regression of the byval stuff on stepanov_v1p2. llvm-svn: 45945	2008-01-14 00:28:35 +00:00
Chris Lattner	5bc253c8f2	Fix PR1907, a nasty miscompilation because instcombine didn't realize that ne & sgt was a signed comparison (it was only looking at whether the left compare was signed). llvm-svn: 45937	2008-01-13 20:59:02 +00:00
Duncan Sands	781f6549db	When turning a call to a bitcast function into a direct call, if this becomes a varargs call then deal correctly with any parameter attributes on the newly vararg call arguments. llvm-svn: 45931	2008-01-13 08:02:44 +00:00
Chris Lattner	4f6c81ac68	we don't have to make an explicit copy of a byval argument when inlining a function if we know that the function does not write to any memory. This implements test/Transforms/Inline/byval2.ll llvm-svn: 45912	2008-01-12 18:54:29 +00:00
Duncan Sands	5b721fc21d	When DAE drops the varargs part of a function, ensure any attributes on the vararg call arguments are also dropped. llvm-svn: 45892	2008-01-11 23:13:45 +00:00
Chris Lattner	b5bd924e83	Teach argpromote to ruthlessly hack small byval structs when it can get away with it, which exposes opportunities to eliminate the memory objects entirely. For example, we now compile byval.ll to: define internal void @f1(i32 %b.0, i64 %b.1) { entry: %tmp2 = add i32 %b.0, 1 ; <i32> [#uses=0] ret void } define i32 @main() nounwind { entry: call void @f1( i32 1, i64 2 ) ret i32 0 } This seems like it would trigger a lot for code that passes around small structs (e.g. SDOperand's or _Complex)... llvm-svn: 45886	2008-01-11 22:31:41 +00:00
Chris Lattner	908117bf69	When inlining a functino with a byval argument, make an explicit copy of it in case the callee modifies the struct. llvm-svn: 45853	2008-01-11 06:09:30 +00:00
Chris Lattner	2940c5c56d	Implement PR1795, an instcombine hack for forming GEPs with integer pointer arithmetic. llvm-svn: 45745	2008-01-08 07:23:51 +00:00
Duncan Sands	404eb05247	The transform that tries to turn calls to bitcast functions into direct calls bails out unless caller and callee have essentially equivalent parameter attributes. This is illogical - the callee's attributes should be of no relevance here. Rework the logic, which incidentally fixes a crash when removed arguments have attributes. llvm-svn: 45658	2008-01-06 18:27:01 +00:00
Duncan Sands	55e5090fe8	When transforming a call to a bitcast function into a direct call with cast parameters and cast return value (if any), instcombine was prepared to cast any non-void return value into any other, whether castable or not. Add a new predicate for testing whether casting is valid, and check it both for the return value and (as a cleanup) for the parameters. llvm-svn: 45657	2008-01-06 10:12:28 +00:00
Chris Lattner	e666bc272d	remove a couple more unsafe xforms in the face of overflow. llvm-svn: 45613	2008-01-05 01:22:42 +00:00
Chris Lattner	bdd6acfb59	Fix PR1896 llvm-svn: 45568	2008-01-04 05:04:53 +00:00
Chris Lattner	f391883670	don't hoist FP additions into unconditional adds + selects. This could theoretically introduce a trap, but is also a performance issue. This speeds up ptrdist/ks by 8%. llvm-svn: 45533	2008-01-03 07:25:26 +00:00
Bill Wendling	6f8c9a8372	Update this testcase. The output needs to be disabled to pass. llvm-svn: 45478	2008-01-01 01:34:36 +00:00
Chris Lattner	e96658392d	dead calls to llvm.stacksave can be deleted, even though they have potential side-effects. llvm-svn: 45392	2007-12-29 00:59:12 +00:00
Chris Lattner	bc03f70a07	upgrade this test llvm-svn: 45391	2007-12-29 00:57:06 +00:00
Devang Patel	b57ff068cd	Test -simplifycfg only. llvm-svn: 45389	2007-12-28 22:59:48 +00:00
Owen Anderson	3de3f9981e	Add a testcase for my recent InstCombine fix, written by Nicholas. llvm-svn: 45386	2007-12-28 21:08:43 +00:00
Chris Lattner	74b2ab59fd	implement InstCombine/shift-trunc-shift.ll. This allows us to compile: #include <math.h> int t1(double d) { return signbit(d); } into: _t1: movd %xmm0, %rax shrq $63, %rax ret instead of: _t1: movd %xmm0, %rax shrq $32, %rax shrl $31, %eax ret on x86-64. llvm-svn: 45311	2007-12-22 09:07:47 +00:00
Devang Patel	7a2c66b11e	If succ has succ itself as one of the predecessors then do not merge current bb and succ even if bb's terminator is unconditional branch to succ. llvm-svn: 45305	2007-12-22 01:32:53 +00:00
Duncan Sands	6a7703ed63	Make DAE not wipe out attributes on calls, and not drop return attributes on the floor. In the case of a call to a varargs function where the varargs arguments are being removed, any call attributes on those arguments need to be dropped. I didn't do this because I plan to make it illegal to have such attributes (see next patch). With this change, compiling the gcc filter2 eh test at -O0 and then running opt -std-compile-opts on it results in a correctly working program (compiling at -O1 or higher results in the test failing due to a problem with how we output eh info into the IR). llvm-svn: 45285	2007-12-21 19:16:16 +00:00
Christopher Lamb	7d82bc46b8	Implement review feedback, including additional transforms (icmp slt (sub A B) 1) -> (icmp sle A B) icmp sgt (sub A B) -1) -> (icmp sge A B) and add testcase. llvm-svn: 45256	2007-12-20 07:21:11 +00:00
Duncan Sands	aa31b92508	When inlining through an 'nounwind' call, mark inlined calls 'nounwind'. It is important for correct C++ exception handling that nounwind markings do not get lost, so this transformation is actually needed for correctness. llvm-svn: 45218	2007-12-19 21:13:37 +00:00
Christopher Lamb	74dbad9216	Remove an orthogonal transformation of the selection condition from my most recent submission. llvm-svn: 45169	2007-12-18 20:30:28 +00:00
Christopher Lamb	30291f4a30	Fix typos. llvm-svn: 45159	2007-12-18 09:45:40 +00:00
Christopher Lamb	8b09a464b4	Fold certain additions through selects (and their compares) so as to eliminate subtractions. This code is often produced by the SMAX expansion in SCEV. This implements test/Transforms/InstCombine/2007-12-18-AddSelCmpSub.ll llvm-svn: 45158	2007-12-18 09:34:41 +00:00
Duncan Sands	b5a79d0eaa	Make invokes of inline asm legal. Teach codegen how to lower them (with no attempt made to be efficient, since they should only occur for unoptimized code). llvm-svn: 45108	2007-12-17 18:08:19 +00:00
Duncan Sands	8e4847ee95	Make instcombine promote inline asm calls to 'nounwind' calls. Remove special casing of inline asm from the inliner. There is a potential problem: the verifier rejects invokes of inline asm (not sure why). If an asm call is not marked "nounwind" in some .ll, and instcombine is not run, but the inliner is run, then an illegal module will be created. This is bad but I'm not sure what the best approach is. I'm tempted to remove the check in the verifier... llvm-svn: 45073	2007-12-16 15:51:49 +00:00
Wojciech Matyjewicz	309e5a723b	1. "Upgrage" comments. 2. Using zero-extended value of Scale and unsigned division is safe provided that Scale doesn't have the sign bit set. Previously these 2 instructions: %p = bitcast [100 x {i8,i8,i8}]* %x to i8* %q = getelementptr i8* %p, i32 -4 were combined into: %q = getelementptr [100 x { i8, i8, i8 }]* %x, i32 0, i32 1431655764, i32 0 what was incorrect. llvm-svn: 44936	2007-12-12 15:21:32 +00:00
Chris Lattner	6a6b3fb62b	Implement constant folding if vector<->vector bitcasts where the number of source/dest elements changes. This implements test/Transforms/InstCombine/bitcast-vector-fold.ll llvm-svn: 44855	2007-12-11 07:29:44 +00:00
Chris Lattner	d2265b45ae	Fix PR1850 by removing an unsafe transformation from VMCore/ConstantFold.cpp. Reimplement the xform in Analysis/ConstantFolding.cpp where we can use targetdata to validate that it is safe. While I'm in there, fix some const correctness issues and generalize the interface to the "operand folder". llvm-svn: 44817	2007-12-10 22:53:04 +00:00
Duncan Sands	9f76be61d1	Make PruneEH update the nounwind/noreturn attributes on functions as it calculates them. llvm-svn: 44802	2007-12-10 19:09:40 +00:00
Devang Patel	bd75910fa7	If ExitValue operand is also defined in Loop header then insert new ExitValue after this operand definition. This fixes PR1828. llvm-svn: 44539	2007-12-03 19:17:21 +00:00
Duncan Sands	5208d1ab4a	Add some convenience methods for querying attributes, and use them. llvm-svn: 44403	2007-11-28 17:07:01 +00:00
Duncan Sands	ad0ea2d430	Fix PR1146: parameter attributes are longer part of the function type, instead they belong to functions and function calls. This is an updated and slightly corrected version of Reid Spencer's original patch. The only known problem is that auto-upgrading of bitcode files doesn't seem to work properly (see test/Bitcode/AutoUpgradeIntrinsics.ll). Hopefully a bitcode guru (who might that be? :) ) will fix it. llvm-svn: 44359	2007-11-27 13:23:08 +00:00
Nick Lewycky	cdb7e54ca7	Add new SCEV, SCEVSMax. This allows LLVM to analyze do-while loops. llvm-svn: 44319	2007-11-25 22:41:31 +00:00
Chris Lattner	c00e8adfe0	Implement PR1822 llvm-svn: 44318	2007-11-25 21:27:53 +00:00
Duncan Sands	185eeac0f8	Fix PR1816. If a bitcast of a function only exists because of a trivial difference in function attributes, allow calls to it to be converted to direct calls. Based on a patch by Török Edwin. While there, move the various lists of mutually incompatible parameters etc out of the verifier and into ParameterAttributes.h. llvm-svn: 44315	2007-11-25 14:10:56 +00:00
Chris Lattner	893fe3bbd1	Fix PR1816, by correcting the broken definition of APInt::countTrailingZeros. llvm-svn: 44296	2007-11-23 22:42:31 +00:00
Duncan Sands	8a3e9d2bee	Ding dong, the DoesntAccessMemoryFns and OnlyReadsMemoryFns tables are dead! We get more, and more accurate, information from gcc via the readnone and readonly function attributes. llvm-svn: 44288	2007-11-23 19:30:27 +00:00
Chris Lattner	a8fbde3f78	Fix a bug where we'd try to find a scev value for a bitcast operand, even though the bitcast operand did not have integer type. This fixes PR1814. llvm-svn: 44286	2007-11-23 08:46:22 +00:00
Chris Lattner	1985d96dc9	Fix PR1817. llvm-svn: 44284	2007-11-22 23:47:13 +00:00
Duncan Sands	a915b538d3	Turn invokes of nounwind functions into ordinary calls. llvm-svn: 44280	2007-11-22 22:24:59 +00:00
Duncan Sands	1c97d752df	Readonly/readnone functions are allowed to throw exceptions, so don't turn invokes of them into calls. llvm-svn: 44278	2007-11-22 21:40:06 +00:00
Chris Lattner	c53b18362a	Fix PR1800 by correcting mistaken logic. llvm-svn: 44188	2007-11-16 06:04:17 +00:00
Chris Lattner	a77e74edba	Implement PR1796 and Transforms/SimplifyCFG/noreturn-call.ll by inserting unreachable after no-return calls. llvm-svn: 44099	2007-11-14 06:19:25 +00:00
Chris Lattner	f150ace6cb	upgrade test llvm-svn: 44067	2007-11-13 21:42:48 +00:00
Chris Lattner	61ce4dff7a	Implement PR1786 by iterating between dead cycle elimination and simplifycfg in the rare cases when it is needed. llvm-svn: 44044	2007-11-13 07:32:38 +00:00
Chris Lattner	f9c0fd7488	Tighten up a check for folding away loads from (newly constant) globals. This fixes a crash on Transforms/GlobalOpt/2007-11-09-GEP-GEP-Crash.ll and rdar://5585488. llvm-svn: 43949	2007-11-09 17:33:02 +00:00
Andrew Lenharth	19ca5c7021	Better check llvm-svn: 43897	2007-11-08 18:45:15 +00:00
Andrew Lenharth	8cf11aa330	Fix PR1780 llvm-svn: 43893	2007-11-08 17:39:28 +00:00
Chris Lattner	d8515f8e80	Implement PR1777 by detecting dependent phis that all compute the same value. llvm-svn: 43777	2007-11-06 21:52:06 +00:00
Dan Gohman	4decbc5002	Fix an abort in instcombine when folding creates a vector rem instruction. llvm-svn: 43743	2007-11-05 23:16:33 +00:00
Devang Patel	b98d2050a2	If a value is incoming from outside the loop then the value does not need remapping and the value is never tracked through LastValueMap. llvm-svn: 43728	2007-11-05 19:32:30 +00:00
Duncan Sands	399d97987b	Change uses of getTypeSize to getABITypeSize, getTypeStoreSize or getTypeSizeInBits as appropriate in ScalarReplAggregates. The right change to make was not always obvious, so it would be good to have an sroa guru review this. While there I noticed some bugs, and fixed them: (1) arrays of x86 long double have holes due to alignment padding, but this wasn't being spotted by HasStructPadding (renamed to HasPadding). The same goes for arrays of oddly sized ints. Vectors also suffer from this, in fact the problem for vectors is much worse because basic vector assumptions seem to be broken by vectors of type with alignment padding. I didn't try to fix any of these vector problems. (2) The code for extracting smaller integers from larger ones (in the "int union" case) was wrong on big-endian machines for integers with size not a multiple of 8, like i1. Probably this is impossible to hit via llvm-gcc, but I fixed it anyway while there and added a testcase. I also got rid of some trailing whitespace and changed a function name which had an obvious typo in it. llvm-svn: 43672	2007-11-04 14:43:57 +00:00
Owen Anderson	2ed651ace7	Fix test/Transforms/DeadStoreElimination/PartialStore.ll, which had been silently failing because of an incorrect run line for some time. llvm-svn: 43605	2007-11-01 05:29:16 +00:00
Chris Lattner	6ab19ed78d	Fix InstCombine/2007-10-31-StringCrash.ll by removing an obvious (in hindsight) infinite recursion. Simplify the code. llvm-svn: 43597	2007-11-01 02:30:35 +00:00
Chris Lattner	74709473ed	Fix InstCombine/2007-10-31-RangeCrash.ll llvm-svn: 43596	2007-11-01 02:18:41 +00:00
Dan Gohman	9f39660c20	Add support for folding binary operators with vector zero operands. llvm-svn: 43510	2007-10-30 19:00:49 +00:00
Chris Lattner	00860d7574	update testcase llvm-svn: 43452	2007-10-29 17:06:35 +00:00
Chris Lattner	c541c3ee15	Model stacksave and stackrestore as both writing memory, since we don't model their dependences on allocas correctly. This fixes PR1745. llvm-svn: 43442	2007-10-29 05:47:52 +00:00
Chris Lattner	9a641510bd	Fix PR1749 and InstCombine/2007-10-28-EmptyField.ll by handling zero-length fields better. llvm-svn: 43427	2007-10-29 02:40:02 +00:00
Chris Lattner	4a15e04aee	Fix PR1752 and LoopSimplify/2007-10-28-InvokeCrash.ll: terminators can have uses too. Wouldn't it be nice if invoke didn't exist? :) llvm-svn: 43426	2007-10-29 02:30:37 +00:00
Chris Lattner	c62877e9da	Implement a couple of foldings for ordered and unordered comparisons, implementing cases related to PR1738. llvm-svn: 43289	2007-10-24 05:38:08 +00:00
Bill Wendling	ac5c93040f	Don't branch fold inline asm statements. llvm-svn: 43191	2007-10-19 21:09:55 +00:00
Devang Patel	c0ced49a14	This test now passes. llvm-svn: 43183	2007-10-19 17:11:01 +00:00
Chris Lattner	9715d9fb59	Fix PR1735 and Transforms/DeadArgElim/2007-10-18-VarargsReturn.ll by fixing some obviously broken code :( llvm-svn: 43141	2007-10-18 18:49:29 +00:00
Devang Patel	9497767458	XFAIL for now. llvm-svn: 43111	2007-10-18 00:48:43 +00:00
Devang Patel	b3dac3f5d9	Do not raise free() call that is called through invoke instruction. llvm-svn: 43083	2007-10-17 20:12:58 +00:00
Devang Patel	91ff13edcc	Apply "Instead of loading small c string constant, use integer constant directly" transformation while processing load instruction. llvm-svn: 43070	2007-10-17 07:24:40 +00:00
Chris Lattner	ad618f66e6	Fix a bug in my patch last night that broke InstCombine/2007-10-12-Crash.ll llvm-svn: 42920	2007-10-12 18:05:47 +00:00
Chris Lattner	3e99eb25ee	testcase for PR1728 llvm-svn: 42890	2007-10-12 05:29:53 +00:00
Devang Patel	899cc56612	Lower memcpy if it makes sense. llvm-svn: 42864	2007-10-11 17:21:57 +00:00
Devang Patel	a69f987b66	Fix bug in updating dominance frontier after loop unswitch when frontier includes basic blocks that are not inside loop. llvm-svn: 42654	2007-10-05 22:29:34 +00:00
Devang Patel	2a60ff1aeb	Relax unsafe use check. If there is one unconditional use inside the loop then it is safe to promote value even if there is another conditional use inside the loop. llvm-svn: 42493	2007-10-01 18:12:58 +00:00
Devang Patel	7bba386f72	Handle multiple induction variables. This fixes PR714. llvm-svn: 42309	2007-09-25 18:24:48 +00:00
Devang Patel	87d7e8ebcb	Add transformation to update loop interation space. Now, for (i=A; i<N; i++) { if (i < X && i > Y) do_something(); } is transformed into U=min(N,X); L=max(A,Y); for (i=L;i<U;i++) do_somethihg(); llvm-svn: 42299	2007-09-25 17:31:19 +00:00
Devang Patel	9e30e1a3be	Do not promote null values because it may be unsafe to do so. llvm-svn: 42270	2007-09-24 20:02:42 +00:00
Devang Patel	361e52f39c	Fix PR1692 llvm-svn: 42209	2007-09-21 21:18:19 +00:00
Duncan Sands	416b9f0410	Testcase for PR1678. llvm-svn: 42171	2007-09-20 18:56:24 +00:00
Nick Lewycky	e7be16a053	Excuse me. llvm-svn: 42158	2007-09-20 00:57:00 +00:00
Nick Lewycky	eae7e7d00b	Fix optimization. %x = sub %x, %y does not imply that %y is zero. llvm-svn: 42157	2007-09-20 00:48:36 +00:00
Devang Patel	464276f831	Avoid unsafe promotion. llvm-svn: 42149	2007-09-19 20:18:51 +00:00
Gabor Greif	49122edc98	rename test, it is obviously misspelled llvm-svn: 42108	2007-09-18 21:42:39 +00:00
Devang Patel	fcda998ab2	Fix PR1657 llvm-svn: 42075	2007-09-18 01:54:42 +00:00
Dan Gohman	2ac2652779	Instcombine x-((x/y)*y) into a remainder operator. llvm-svn: 42035	2007-09-17 17:31:57 +00:00
Chris Lattner	dd76f2f4ab	remove obsolete tests. llvm-svn: 41984	2007-09-15 17:38:04 +00:00
Duncan Sands	94580c7522	Test that a call to a trampoline is turned into a call to the underlying nested function. llvm-svn: 41846	2007-09-11 15:07:50 +00:00
Chris Lattner	6cf04f4952	remove obsolete testcase llvm-svn: 41820	2007-09-10 23:51:41 +00:00
Chris Lattner	52fe869374	Fix a buggy constant folding transformation when handling aliases. llvm-svn: 41818	2007-09-10 23:42:42 +00:00
Dale Johannesen	62a48cea56	Add missing llvm-dis. llvm-svn: 41813	2007-09-10 22:47:59 +00:00
Chris Lattner	c75cbe6473	Prevent tailcallelim from breaking "recursive" calls to builtins. llvm-svn: 41804	2007-09-10 20:58:55 +00:00
Devang Patel	f8ab0a9acc	Filter exit conditions which are not yet handled. llvm-svn: 41800	2007-09-10 18:33:42 +00:00
Chris Lattner	85a51e0060	Don't zap back to back volatile load/stores llvm-svn: 41759	2007-09-07 05:33:03 +00:00
Nick Lewycky	b0b066eaaa	When the two operands of an icmp are equal, there are five possible predicates that would make the icmp true. Fixes PR1637. llvm-svn: 41740	2007-09-06 01:10:22 +00:00
Dale Johannesen	6480cc6f8c	Change all floating constants that are not exactly representable to use hex format. llvm-svn: 41722	2007-09-05 17:50:36 +00:00
Anton Korobeynikov	24fb6b2f8c	Don't promote volatile loads/stores. This is needed (for example) to handle setjmp/longjmp properly. This fixes PR1520. llvm-svn: 41461	2007-08-26 21:43:30 +00:00
Devang Patel	c1ef32ef3d	Constant split values needs upper bound and lower bound check, just like any other split value. llvm-svn: 41389	2007-08-25 01:09:14 +00:00
Devang Patel	4e63e1f5b5	While calculating upper loop bound for first loop and lower loop bound for second loop, take care of edge cases. llvm-svn: 41387	2007-08-25 00:56:38 +00:00
Devang Patel	c2e2d15f45	Do not split loops rejected by processOneIterationLoop(). llvm-svn: 41194	2007-08-20 20:24:15 +00:00
Devang Patel	cec2ad95f4	Add loop index split tests. llvm-svn: 41146	2007-08-17 22:02:15 +00:00
Dan Gohman	ada7205b76	Convert tests using "grep -c ... \| grep ..." to use the count script. llvm-svn: 41100	2007-08-15 13:49:33 +00:00
Dan Gohman	f9dd170e36	Convert tests using "\| wc -l \| grep ..." to use the count script. llvm-svn: 41097	2007-08-15 13:36:28 +00:00
Chris Lattner	1399f64e3b	oops, forgot to commit this. llvm-svn: 41034	2007-08-12 16:55:14 +00:00
Chris Lattner	99c8ee2977	Transform a load from an undef/zero global into an undef/global even if we have complex pointer manipulation going on. This allows us to compile stuff like this: __m128i foo(__m128i x){ static const unsigned int c_0[4] = { 0, 0, 0, 0 }; __m128i v_Zero = _mm_loadu_si128((__m128i*)c_0); x = _mm_unpacklo_epi8(x, v_Zero); return x; } into: _foo: xorps %xmm1, %xmm1 punpcklbw %xmm1, %xmm0 ret llvm-svn: 41022	2007-08-11 18:48:48 +00:00
Chris Lattner	a8e4b4bc7b	when we see a unaligned load from an insufficiently aligned global or alloca, increase the alignment of the load, turning it into an aligned load. This allows us to compile: #include <xmmintrin.h> __m128i foo(__m128i x){ static const unsigned int c_0[4] = { 0, 0, 0, 0 }; __m128i v_Zero = _mm_loadu_si128((__m128i*)c_0); x = _mm_unpacklo_epi8(x, v_Zero); return x; } into: _foo: punpcklbw _c_0.5944, %xmm0 ret .data .lcomm _c_0.5944,16,4 # c_0.5944 instead of: _foo: movdqu _c_0.5944, %xmm1 punpcklbw %xmm1, %xmm0 ret .data .lcomm _c_0.5944,16,2 # c_0.5944 llvm-svn: 40971	2007-08-09 19:05:49 +00:00
Nick Lewycky	8052019a20	It's safe to fold not of fcmp. llvm-svn: 40870	2007-08-06 20:04:16 +00:00
Chandler Carruth	7132e00de7	This is the patch to provide clean intrinsic function overloading support in LLVM. It cleans up the intrinsic definitions and generally smooths the process for more complicated intrinsic writing. It will be used by the upcoming atomic intrinsics as well as vector and float intrinsics in the future. This also changes the syntax for llvm.bswap, llvm.part.set, llvm.part.select, and llvm.ct* intrinsics. They are automatically upgraded by both the LLVM ASM reader and the bitcode reader. The test cases have been updated, with special tests added to ensure the automatic upgrading is supported. llvm-svn: 40807	2007-08-04 01:51:18 +00:00
Chris Lattner	9ea0287e25	I don't have time to restore this functionality right now. llvm-svn: 40743	2007-08-02 17:43:39 +00:00
Chris Lattner	498137dbfc	Reduced testcase for PR1594 llvm-svn: 40740	2007-08-02 17:11:24 +00:00
Devang Patel	a882328e61	Update dominator info for the middle blocks created while spliting exit edge to preserve LCSSA. Fix dominance frontier update during loop unswitch. This fixes PR 1589, again llvm-svn: 40737	2007-08-02 15:25:57 +00:00
Chris Lattner	b0418fc607	Enhance instcombine to be more aggressive about folding casts of operations of casts. This implements InstCombine/zext-fold.ll llvm-svn: 40726	2007-08-02 06:11:14 +00:00
Chris Lattner	d7cb625a9e	Fix PR1575 and test/Transforms/CondProp/2007-08-01-InvalidRead.ll llvm-svn: 40720	2007-08-02 04:47:05 +00:00
Devang Patel	561b0c29a3	Update dominator info for the middle blocks created while spliting exit edge to preserve LCSSA. Fix dominance frontier update during loop unswitch. This fixes PR 1589. llvm-svn: 40695	2007-08-01 22:23:50 +00:00
Owen Anderson	4d34e40c6d	Forgot to update these files for the FastDSE changes. llvm-svn: 40674	2007-08-01 16:53:51 +00:00
Owen Anderson	10e52eddb3	Rename FastDSE to just DSE. llvm-svn: 40668	2007-08-01 06:36:51 +00:00
Owen Anderson	2464f4f048	Fix a failure I accidentally caused in my last commit by mishandling the removal of redundant phis. llvm-svn: 40650	2007-07-31 20:18:28 +00:00
Lauro Ramos Venancio	549e775e67	Fix a bug in GetKnownAlignment of packed structs. llvm-svn: 40649	2007-07-31 20:13:21 +00:00
Dan Gohman	54ec4bfa5f	Change the x86 assembly output to use tab characters to separate the mnemonics from their operands instead of single spaces. This makes the assembly output a little more consistent with various other compilers (f.e. GCC), and slightly easier to read. Also, update the regression tests accordingly. llvm-svn: 40648	2007-07-31 20:11:57 +00:00
Owen Anderson	d58fa6b09f	Fix a misoptimization in aha. llvm-svn: 40642	2007-07-31 17:43:14 +00:00
Devang Patel	dd34d91e1a	Bunch of tests to check loop passes. llvm-svn: 40629	2007-07-31 08:04:17 +00:00
Owen Anderson	d66e285b2e	Fix a bug caused by indiscriminantly asking for the dominators of a predecessor. llvm-svn: 40595	2007-07-30 16:57:08 +00:00
Owen Anderson	0f692f27a3	Fix a bug introduced in my last commit. llvm-svn: 40542	2007-07-26 18:57:04 +00:00
Owen Anderson	dbf23ccaa0	Fix a couple more bugs in the phi construction by pulling in code that does almost the same things from LCSSA. llvm-svn: 40540	2007-07-26 18:26:51 +00:00
Owen Anderson	3b8cc30a61	Fix what is _hopefully_ the last corner case for loops. llvm-svn: 40503	2007-07-25 23:54:42 +00:00
Owen Anderson	8707412593	My last commit was not correct for nested loops. Fix it, and add a testcase for it. llvm-svn: 40498	2007-07-25 22:19:40 +00:00
Owen Anderson	3c67004d47	Fix an infinite loop on 300.twolf. llvm-svn: 40497	2007-07-25 22:03:06 +00:00
Owen Anderson	9b796348bd	Fix a bug in non-local memdep that was causing an infinite loop on 175.vpr. llvm-svn: 40495	2007-07-25 21:26:36 +00:00
Owen Anderson	7bf26ee444	Fix a bug that was causing GVN to crash on 252.eon. llvm-svn: 40494	2007-07-25 21:13:41 +00:00
Owen Anderson	5e5599b7ce	Add basic support for performing whole-function RLE. Note: This has not yet been thoroughly tested. Use at your own risk. llvm-svn: 40489	2007-07-25 19:57:03 +00:00
Owen Anderson	ab6ec2eac2	Add a GVN pass, using the value numbering code I developed for GVNPRE and the load elimination code from RedundantLoadElimination. llvm-svn: 40469	2007-07-24 17:55:58 +00:00
Devang Patel	13b25df0e9	Unreachable block is not a root node in post dominator tree. llvm-svn: 40458	2007-07-24 01:02:25 +00:00
Owen Anderson	9baaaa52e6	Rename a lot of things to change FastDLE to RedundantLoadElimination. llvm-svn: 40457	2007-07-24 00:17:04 +00:00
Owen Anderson	0a75315d35	Add testcases for FastDLE. llvm-svn: 40449	2007-07-23 22:18:05 +00:00
Owen Anderson	59a6840d47	Move these tests to use FastDSE instead of old DSE. llvm-svn: 40444	2007-07-23 20:49:13 +00:00
Chris Lattner	7649abce46	This xform isn't safe, removing it. llvm-svn: 40378	2007-07-21 21:27:27 +00:00
Dan Gohman	e31a61eeca	Optimize alignment of loads and stores. llvm-svn: 40102	2007-07-20 16:34:21 +00:00
Reid Spencer	314e1cb7ee	For PR1553: Change the keywords for the zext and sext parameter attributes to be zeroext and signext so they don't conflict with the keywords for the instructions of the same name. This gets around the ambiguity. llvm-svn: 40069	2007-07-19 23:13:04 +00:00
Devang Patel	86dff8f8be	New test. llvm-svn: 40023	2007-07-18 23:47:02 +00:00
Chris Lattner	d8bdf53335	rename function to avoid llvm-upgrade warning llvm-svn: 39895	2007-07-16 04:09:00 +00:00
Chris Lattner	d4fef8dbca	Implement shift-simplify.ll:test[45]. First teach instcombine that sign bit checks only demand the sign bit, this allows simplify demanded bits to hack on expressions better. Second, teach instcombine that ashr is useless if only the sign bit is demanded. llvm-svn: 39880	2007-07-15 20:54:51 +00:00
Chris Lattner	06205d5567	Implement shift-simplify.ll:test3, turning: (X << 31) <s 0 --> (X&1) != 0 This happens dozens of times in the CFE. llvm-svn: 39879	2007-07-15 20:42:37 +00:00
Devang Patel	52c2dd4699	New test. llvm-svn: 39768	2007-07-11 23:54:25 +00:00
Owen Anderson	8b99e0ab20	Fix an error where ANTIC_OUT was ending up with more than one expression of the same value number. This fixes an infinite loop on 444.namd. llvm-svn: 37967	2007-07-07 20:13:57 +00:00
Owen Anderson	02e9698293	Fix a bunch of issues found in a testcase from 400.perlbench. llvm-svn: 37929	2007-07-05 23:11:26 +00:00
Owen Anderson	ca1a184fd8	Fix another bug, this time in PREing select instructions. llvm-svn: 37878	2007-07-04 22:33:23 +00:00
Owen Anderson	cd94fc982a	Fix a typo that was killing GVNPRE of select instructions. llvm-svn: 37871	2007-07-04 18:26:18 +00:00
Owen Anderson	664e260a9c	Fix an error in phi translation of GEPs that was causing failures. llvm-svn: 37868	2007-07-04 04:51:16 +00:00
Owen Anderson	2e4b6feac2	Add support for performing GVNPRE on GEP instructions. llvm-svn: 37862	2007-07-03 23:51:19 +00:00
Owen Anderson	59bd053fc5	Add support for performing GVNPRE on cast instructions, and add a testcase for this. llvm-svn: 37856	2007-07-03 18:37:08 +00:00
Zhou Sheng	bf61346b1c	Test case for recent patch for IndVarSimplify.cpp llvm-svn: 37838	2007-07-02 08:02:14 +00:00
John Criswell	2660cef6d7	Convert .cvsignore files llvm-svn: 37801	2007-06-29 16:35:07 +00:00
Owen Anderson	d630147c44	Add a test for performing GVNPRE on select instructions. llvm-svn: 37782	2007-06-28 23:50:31 +00:00
Owen Anderson	43790570ad	Add tests for performing GVNPRE on the three vector-specific instructions. llvm-svn: 37744	2007-06-27 04:06:32 +00:00
Chris Lattner	b2a9048dc4	new testcase, the inliner shouldn't inline this. llvm-svn: 37722	2007-06-25 21:49:53 +00:00
Owen Anderson	bfd1673366	Rename variables to expose the fact that this test is failing. llvm-svn: 37711	2007-06-24 08:17:41 +00:00
Chris Lattner	181ebd6f88	new testcase miscompiled by instcombine, reduced from perl llvm-svn: 37691	2007-06-21 18:09:25 +00:00
Owen Anderson	0b7c12be82	Testcase for instances where a constant only occurs as an operand to a phi node. llvm-svn: 37653	2007-06-19 05:55:01 +00:00
Owen Anderson	3552c9e845	Add a new testcase for memory corruption issues. llvm-svn: 37648	2007-06-19 05:41:22 +00:00
Owen Anderson	d2028a549f	Testcase where GVNPRE was getting confused by invoke instructions. llvm-svn: 37609	2007-06-16 00:25:10 +00:00
Owen Anderson	ad9743225e	Add a testcase where GVNPRE what getting confused by a loop. llvm-svn: 37594	2007-06-15 17:54:05 +00:00
Chris Lattner	9923af42cf	add vector versions of this test llvm-svn: 37588	2007-06-15 06:22:32 +00:00
Chris Lattner	a8de4cccd9	testcase for PR1510 llvm-svn: 37583	2007-06-15 05:57:20 +00:00
Owen Anderson	b1c82db828	Add a test where phi translation was producing a null result. llvm-svn: 37563	2007-06-12 22:42:35 +00:00
Owen Anderson	6d1df658c0	Testcase where GVNPRE crashes on functions with no exit nodes. llvm-svn: 37555	2007-06-12 16:56:00 +00:00
Owen Anderson	3c0c1376ea	Make the run line for this test correct. Thanks to Chris for spotting it. llvm-svn: 37552	2007-06-12 04:40:48 +00:00
Owen Anderson	ee35eab57f	Add a GVN-PRE basic regression test. llvm-svn: 37549	2007-06-12 00:49:33 +00:00
Tanya Lattner	cb90f1d881	Instruct the inliner to obey the noinline attribute. Add test case. llvm-svn: 37481	2007-06-06 21:59:26 +00:00
Lauro Ramos Venancio	be59acbfcc	Add a test for PR1499. llvm-svn: 37473	2007-06-06 17:10:02 +00:00
Nick Lewycky	3b70bb2778	new testcase for PR1487 llvm-svn: 37458	2007-06-06 04:11:21 +00:00
Chris Lattner	084c20fcc0	new testcase for PR1491 llvm-svn: 37422	2007-06-04 22:23:17 +00:00
Chris Lattner	ed24b3b2fa	Testcase for PR1421 llvm-svn: 37357	2007-05-30 06:10:46 +00:00
Chris Lattner	49a34fcca7	testcase for PR1446 llvm-svn: 37325	2007-05-24 18:42:47 +00:00
Chris Lattner	e44b6a6aaf	new testcase for PR1435 llvm-svn: 37304	2007-05-23 06:35:52 +00:00
Chris Lattner	faa31904e4	new testcase llvm-svn: 37255	2007-05-19 06:50:37 +00:00
Dan Gohman	6164a1b279	Add a testcase for unrolling loops with unknown tripcounts. llvm-svn: 37238	2007-05-18 19:59:23 +00:00
Devang Patel	f9ed308aba	New test. llvm-svn: 37184	2007-05-17 22:05:20 +00:00
Chris Lattner	120548e508	New testcase that crashes instcombine llvm-svn: 37056	2007-05-15 00:15:49 +00:00
Chris Lattner	3b4ae44057	this crashes globalopt llvm-svn: 37021	2007-05-13 21:28:25 +00:00
Chris Lattner	b6d85ad1e1	new testcase that crashes instcombine llvm-svn: 36983	2007-05-11 05:55:38 +00:00
Devang Patel	d385e3970f	Drop ModuleID from comment. llvm-svn: 36982	2007-05-11 00:45:58 +00:00
Devang Patel	00a08d129e	New test. llvm-svn: 36954	2007-05-09 08:19:24 +00:00
Devang Patel	519f2997cc	New test. llvm-svn: 36953	2007-05-09 08:08:46 +00:00
Chris Lattner	75d56499c6	add the & back. I'm not sure why bill removed it. llvm-svn: 36945	2007-05-08 20:08:06 +00:00
Bill Wendling	0c976b8762	Spare '&' in the RUN line. llvm-svn: 36933	2007-05-08 07:49:07 +00:00
Chris Lattner	e8f14fb4cd	this test is now in Target/README.txt llvm-svn: 36812	2007-05-05 22:44:27 +00:00
Chris Lattner	2601579ec9	remove an old xfailed test llvm-svn: 36810	2007-05-05 22:42:02 +00:00
Chris Lattner	3dde023021	un-xfail this. llvm-svn: 36808	2007-05-05 22:41:13 +00:00
Chris Lattner	df37169381	move these xfailed tests to lib/Target/README.txt llvm-svn: 36805	2007-05-05 22:28:33 +00:00
Chris Lattner	4dc9a76ad8	Move Mem2Reg/DifferingTypes.ll -> ScalarRepl/DifferingTypes.ll. -scalarrepl implements this xform. llvm-svn: 36804	2007-05-05 22:22:03 +00:00
Chris Lattner	c103b49f52	remvoe two tests that cee has never gotten right llvm-svn: 36803	2007-05-05 22:19:49 +00:00
Chris Lattner	580d824e5d	new testcase for PR1385 llvm-svn: 36783	2007-05-05 18:48:52 +00:00
Chris Lattner	8b332d32be	new testacse for PR1384 llvm-svn: 36774	2007-05-05 01:59:05 +00:00
Chris Lattner	193d2f09f0	update syntax llvm-svn: 36531	2007-04-28 06:03:12 +00:00
Chris Lattner	1df6c1c5b0	new testcase llvm-svn: 36520	2007-04-28 00:54:45 +00:00
Chris Lattner	7ebda6ba37	new testcase, should be able to eliminate the alloca and memcpy llvm-svn: 36428	2007-04-25 06:29:34 +00:00
Devang Patel	cbb4994f6b	New test. llvm-svn: 36379	2007-04-23 22:39:53 +00:00
Reid Spencer	4388f0b4fa	For PR1146: Make ParamAttrsList objects unique. You can no longer directly create or destroy them but instead must go through the ParamAttrsList::get() interface. llvm-svn: 36327	2007-04-22 05:46:44 +00:00
Reid Spencer	2bb29e778a	Add a .cvsignore file. llvm-svn: 36323	2007-04-21 21:53:04 +00:00
Devang Patel	74ede29a27	Add PR number for reference. llvm-svn: 36184	2007-04-16 23:52:37 +00:00
Devang Patel	369bec184b	New test case. llvm-svn: 36181	2007-04-16 23:02:22 +00:00
Reid Spencer	8c756a9ded	Fix this test from Duncan's experiment. llvm-svn: 36176	2007-04-16 21:57:14 +00:00
Chris Lattner	6d9b520091	fix incorrectly upgraded test llvm-svn: 36169	2007-04-16 21:24:14 +00:00
Reid Spencer	6e87ec4351	For PR1319: Remove && from the end of the lines to prevent tests from throwing run lines into the background. Also, clean up places where the same command is run multiple times by using a temporary file. llvm-svn: 36142	2007-04-16 17:36:08 +00:00
Reid Spencer	4dcf8bff4b	For PR1319: Fix syntax of tests to ensure grep pattern is properly quoted. llvm-svn: 36134	2007-04-16 15:31:49 +00:00
Reid Spencer	86f337eeda	For PR1319: Fix test syntax per new rules. llvm-svn: 36133	2007-04-16 15:15:52 +00:00
Reid Spencer	caaf8a1597	For PR1336: Un-XFAIL this since it now passes with fix to llvm-upgrade. llvm-svn: 36104	2007-04-16 02:57:47 +00:00
Reid Spencer	2875d2eadc	For PR1336: Correct this test case. It was passing a uint where a ubyte was expected. llvm-svn: 36101	2007-04-16 02:09:24 +00:00
Reid Spencer	f0cb944fd3	For PR1336: Un-XFAIL this now that its working. llvm-svn: 36100	2007-04-16 01:49:16 +00:00
Reid Spencer	6584cf60f2	For PR1336: XFAIL tests covered by the PR. These will be un-XFAILed as they are fixed. llvm-svn: 36093	2007-04-15 23:00:46 +00:00
Chris Lattner	d3fd9ecb2d	testcase for PR1335 llvm-svn: 36089	2007-04-15 21:37:53 +00:00
Reid Spencer	a551c041f9	For PR1319: Upgrade to use new Tcl exec based test harness. This exposes 3 bugs that were previously not being reported: test/Transforms/GlobalDCE/2002-08-17-FunctionDGE.ll test/Transforms/GlobalOpt/memset.ll test/Transforms/IndVarsSimplify/exit_value_tests.llx llvm-svn: 36065	2007-04-15 09:21:47 +00:00
Reid Spencer	951d8dc29f	For PR1319: Upgrade to use new Tcl exec based test harness. llvm-svn: 36062	2007-04-15 08:30:33 +00:00
Reid Spencer	8eba4c3d12	For PR1319: Convert to use the new Tcl expr based test harness. llvm-svn: 36061	2007-04-15 08:01:33 +00:00
Reid Spencer	ede8c3b92c	For PR1319: Make use of the END. facility on all files > 1K so that we aren't wasting CPU cycles searching for RUN: lines that we'll never find. llvm-svn: 36059	2007-04-15 07:38:21 +00:00
Reid Spencer	1114b48736	Fix this test in a slightly more obvious way. llvm-svn: 36058	2007-04-15 07:37:04 +00:00
Reid Spencer	8d9056c56c	For PR1319: Upgrade to use new Tcl exec based test harness llvm-svn: 36055	2007-04-15 06:53:51 +00:00
Reid Spencer	7b78d371a6	Use %prcontext, $prcontext is not resolving for some reason. llvm-svn: 36054	2007-04-15 06:52:45 +00:00
Reid Spencer	63fc5df7ba	PR1319: Upgrade tests to new Tcl exec based test harness requirements. llvm-svn: 36053	2007-04-15 06:51:14 +00:00
Zhou Sheng	8e58ad31ae	This test case is incorrect. Remove it. llvm-svn: 36048	2007-04-15 05:59:49 +00:00
Reid Spencer	ae81c8880a	For PR1319: Convert to new test system. This exposes IsDigit.ll as failing. llvm-svn: 36046	2007-04-15 05:16:38 +00:00
Reid Spencer	5374aeae70	For PR1319: Conver to new test system. llvm-svn: 36045	2007-04-15 05:03:58 +00:00
Reid Spencer	d025c07cce	Keep lines a reasonable length. llvm-svn: 36043	2007-04-15 04:54:53 +00:00
Jeff Cohen	610e1240ed	Patch supplied by gabor. llvm-svn: 36042	2007-04-15 03:09:23 +00:00
Chris Lattner	efcb4f6ed4	new testcase llvm-svn: 36039	2007-04-15 01:00:37 +00:00
Owen Anderson	0f6ccef96c	XFAIL this for now. llvm-svn: 36036	2007-04-14 23:57:41 +00:00
Reid Spencer	f70b8e5d79	Oops. A little aggressive on the name changes there. llvm-svn: 36029	2007-04-14 23:17:58 +00:00
Reid Spencer	bc7533c140	For PR1913: Convert to new test system. This exposes test/Transforms/ConstProp/calls.ll llvm-svn: 36027	2007-04-14 23:04:54 +00:00
Chris Lattner	45ab14d084	new testcase llvm-svn: 36024	2007-04-14 23:00:51 +00:00
Reid Spencer	cfe6e77e7b	For PR1319: Convert to new test system. llvm-svn: 36023	2007-04-14 22:54:01 +00:00
Reid Spencer	3fc53d6c53	Changes to fix problems with "make check". Apparently you can redefine functions and Tcl's just tickled with that. The fix is to give the "new" test system a different interface function name. llvm-svn: 36022	2007-04-14 22:51:29 +00:00
Chris Lattner	8a08819105	manually upgrade test. Add a new test2. I have no way to see if this works because of the tclification. :( llvm-svn: 36019	2007-04-14 22:27:33 +00:00
Reid Spencer	af1a99f3a7	This test should have been updated with llvm 1.7! llvm-svn: 36014	2007-04-14 20:21:37 +00:00
Reid Spencer	91948d4cad	For PR1319: Upgrade tests to work with new llvm.exp version of llvm_runtest. llvm-svn: 36013	2007-04-14 20:13:02 +00:00
Reid Spencer	be88bc4cb4	This test needs to use egrep. llvm-svn: 36012	2007-04-14 20:02:51 +00:00
Reid Spencer	2441c0ae3e	Fix a test test llvm.exp found. llvm-svn: 36006	2007-04-14 18:33:31 +00:00
Reid Spencer	7e4bde71c5	bool -> i1 (found by llvm.exp) llvm-svn: 36005	2007-04-14 18:30:06 +00:00
Reid Spencer	26f762270f	Fix a syntax error that llvm.exp found. llvm-svn: 36004	2007-04-14 18:28:16 +00:00
Reid Spencer	0c0fe0afa7	Fix an "already-upgraded" test that llvm.exp found. llvm-svn: 36003	2007-04-14 18:26:02 +00:00
Chris Lattner	ebf3cfccdb	new testcase llvm-svn: 35983	2007-04-14 01:17:38 +00:00
Chris Lattner	a930c3d4e4	testcase for PR1201 llvm-svn: 35980	2007-04-14 00:19:36 +00:00
Reid Spencer	9f9fe70b11	Add the SCCP regression tests for APInt expressions. These test cases turned up some regressions that have since been fixed. We don't want to loose the regression tests. Test cases by Guoling Han. llvm-svn: 35974	2007-04-13 22:33:10 +00:00
Reid Spencer	8e25101211	The "implementation" keyword is no more! llvm-svn: 35919	2007-04-11 20:06:03 +00:00
Reid Spencer	d029c7e666	Make the llvm-runtest function much more amenable by eliminating all the global variables that needed to be passed in. This makes it possible to add new global variables with only a couple changes (Makefile and llvm-dg.exp) instead of touching every single dg.exp file. llvm-svn: 35918	2007-04-11 19:56:59 +00:00
Chris Lattner	45ae13bb41	adjust test llvm-svn: 35907	2007-04-11 16:04:04 +00:00
Chris Lattner	81f14c63da	sext of compares. llvm-svn: 35892	2007-04-11 06:57:54 +00:00
Chris Lattner	764ec15b3f	new testcase llvm-svn: 35889	2007-04-11 06:52:24 +00:00
Devang Patel	d284fd1145	Add test case for PR 1154. llvm-svn: 35865	2007-04-10 16:57:08 +00:00
Chris Lattner	ec0020433b	new testcase llvm-svn: 35851	2007-04-09 23:51:49 +00:00
Devang Patel	4ccfdd7465	Add check for opt crash. llvm-svn: 35849	2007-04-09 23:40:15 +00:00
Devang Patel	5391501f05	Add Loop Rotate test cases. llvm-svn: 35838	2007-04-09 22:22:42 +00:00
Chris Lattner	e04c652f5d	new testcase for PR1304 llvm-svn: 35791	2007-04-09 01:37:35 +00:00
Chris Lattner	418bf4eb1c	new testcase for PR1286 llvm-svn: 35787	2007-04-09 01:10:13 +00:00
Chris Lattner	659ff4ca8d	this xform is correct, not an xfail llvm-svn: 35766	2007-04-08 08:02:39 +00:00
Chris Lattner	b79728b1ae	tweak this to test the right thing. llvm-svn: 35762	2007-04-08 07:52:40 +00:00
Chris Lattner	8ca3d48984	new testcase, should simplify down to a xor/and/xor sequence. llvm-svn: 35759	2007-04-08 07:45:36 +00:00
Chris Lattner	3dc477d5e3	testcase for PR1307 llvm-svn: 35705	2007-04-06 23:36:59 +00:00
Chris Lattner	992b451e33	new testcase, update old one. llvm-svn: 35699	2007-04-06 18:56:54 +00:00
Chris Lattner	cf1f986099	new testcase that crashes globalopt llvm-svn: 35688	2007-04-05 21:09:29 +00:00
Jeff Cohen	9da1cde86c	Any add is wrong, regardless of type. llvm-svn: 35671	2007-04-04 20:40:44 +00:00
Jeff Cohen	62c300a415	Get it right... llvm-svn: 35670	2007-04-04 20:35:31 +00:00
Dale Johannesen	9234629e60	Test for transformConstExprCastCall fix. llvm-svn: 35669	2007-04-04 19:18:16 +00:00
Jeff Cohen	6f98cd3710	Add new test. llvm-svn: 35664	2007-04-04 16:11:23 +00:00
Chris Lattner	d4594adf43	new testcase for PR1253 llvm-svn: 35611	2007-04-03 01:45:32 +00:00
Chris Lattner	a7152a90d1	fix this testcase so it passes llvm-svn: 35604	2007-04-02 20:46:28 +00:00
Chris Lattner	2d81c6d706	creative way to add one. llvm-svn: 35583	2007-04-02 05:35:08 +00:00
Reid Spencer	e51961b5ba	Fix illegal assembly syntax. llvm-svn: 35581	2007-04-02 03:24:47 +00:00
Reid Spencer	a3bc850712	Add a test case to make sure that constant folding of the bit counting intrinsics works. llvm-svn: 35577	2007-04-02 01:45:31 +00:00
Reid Spencer	a5f996bd27	Revert the name changes for llvm.bswap to allow (and test) llvm-upgrade of this intrinsic. llvm-svn: 35566	2007-04-02 00:51:15 +00:00
Reid Spencer	c3d87685ad	For PR1297: Update these test cases to use proper signatures for bswap which is now and overloaded intrinsic. Its name must be of the form llvm.bswap.i32.i32 since both the parameter and the result or of type "iAny". Also, the bit counting intrinsics changed to always return i32. llvm-svn: 35548	2007-04-01 07:36:28 +00:00
Chris Lattner	9729bdd8e3	New testcase llvm-svn: 35535	2007-04-01 05:34:53 +00:00
Reid Spencer	44259a29c0	Remove use of implementation keyword. llvm-svn: 35412	2007-03-28 02:38:26 +00:00
Chris Lattner	4ba1036a34	don't use 'not' when we can use a positive test llvm-svn: 35402	2007-03-28 01:43:43 +00:00
Reid Spencer	90bb12c2e7	new test case for PR1280 llvm-svn: 35401	2007-03-28 01:43:35 +00:00
Reid Spencer	94a8cb4b4e	For PR1280: Remove test cases for and/xor/add -> trunc/sext that use bit widths that the targets cannot code gen. llvm-svn: 35399	2007-03-28 01:35:28 +00:00
Chris Lattner	4776ebc195	new testcase llvm-svn: 35397	2007-03-28 01:31:33 +00:00
Reid Spencer	e01d0e8c39	Another test case for PR1271 where bad shift masks were generated. llvm-svn: 35372	2007-03-26 23:48:52 +00:00
Reid Spencer	d9fe01c7a4	Fix this test case to match output after a bug was fixed. llvm-svn: 35359	2007-03-26 18:04:38 +00:00
Duncan Sands	820ae03fda	Fix testsuite hang. llvm-svn: 35355	2007-03-26 10:59:13 +00:00
Reid Spencer	0bfa19eb13	Test case for PR1271 involving construction of a bad mask to replace a shift instruction. llvm-svn: 35349	2007-03-26 05:32:16 +00:00
Reid Spencer	726b0a7fa4	Add a test case for PR1271 (necessary, but not sufficient). llvm-svn: 35343	2007-03-25 21:30:41 +00:00
Chris Lattner	f323838c4c	new testcase llvm-svn: 35340	2007-03-25 20:42:40 +00:00
Reid Spencer	e3d00119e6	Remove the last vestiges of this directory. llvm-svn: 35309	2007-03-24 23:07:49 +00:00
Reid Spencer	562b715dd1	Add more test cases for APIntified InstCombine. llvm-svn: 35288	2007-03-23 21:57:47 +00:00
Reid Spencer	ea8b07ee6b	Add test case for testing InstCombine with arbitrary precision integer types. These tests mimic the integer test cases in the normal InstCombine test suite but use "strange" integer bit widths. Most tests written by Zhou Sheng, a few by me. llvm-svn: 35284	2007-03-23 20:48:34 +00:00
Reid Spencer	02b0b57101	Make this test actually match the generated code. llvm-svn: 35263	2007-03-22 02:53:05 +00:00
Reid Spencer	fa9925e263	Test case for PR1248 llvm-svn: 35251	2007-03-22 00:49:40 +00:00
Reid Spencer	09f4eb1098	Make this test a little simpler/faster. llvm-svn: 35193	2007-03-19 23:36:19 +00:00
Reid Spencer	eb0a221186	Add test case for PR1261, currently XFAILed. llvm-svn: 35192	2007-03-19 23:28:16 +00:00
Reid Spencer	7953b683fc	For PR1258: Revise numeric value references to accommodate collapsed type planes. llvm-svn: 35170	2007-03-19 18:27:35 +00:00
Chris Lattner	50fce05a21	add a testcase the resent patches fail on. llvm-svn: 35167	2007-03-19 18:25:48 +00:00
Chris Lattner	23dd31a3af	add PR# llvm-svn: 35151	2007-03-19 00:17:19 +00:00
Chris Lattner	dcd44dbbb0	add pr# llvm-svn: 35149	2007-03-19 00:15:43 +00:00
Chris Lattner	ee3c5d1b78	new testcase llvm-svn: 35148	2007-03-19 00:11:30 +00:00
Chris Lattner	2c0f36bc39	testcase for SROA with memset etc llvm-svn: 35147	2007-03-19 00:09:00 +00:00
Chris Lattner	1ada0693ab	new testcase llvm-svn: 35144	2007-03-18 22:50:57 +00:00
Nick Lewycky	17d20fd41e	Propagate ValueRanges across equality. Add some more micro-optimizations: x * 0 = 0, a - x = a --> x = 0. llvm-svn: 35138	2007-03-18 01:09:32 +00:00
Chris Lattner	091e75bbde	testcase for PR1244 llvm-svn: 35081	2007-03-13 14:25:35 +00:00
Anton Korobeynikov	8a6dc102d3	Use range tests in LowerSwitch, where possible llvm-svn: 35057	2007-03-10 16:46:28 +00:00
Reid Spencer	509acc186e	Add a test case for a particular udiv/select transform. llvm-svn: 34935	2007-03-05 22:51:08 +00:00
Chris Lattner	3a8b0c7607	new testcase llvm-svn: 34918	2007-03-05 00:01:38 +00:00
Chris Lattner	dd3f9f8c3b	New testcases for PR1179/PR1232. llvm-svn: 34895	2007-03-04 00:54:06 +00:00
Chris Lattner	05e93d7f05	new testcase: instcombine should remove all the casts. llvm-svn: 34869	2007-03-03 05:24:06 +00:00
Chris Lattner	c1991789c5	instcombine doesn't do CSE, simplify unrelated detail llvm-svn: 34867	2007-03-03 02:27:02 +00:00
Chris Lattner	49c505c6e6	new testcase. @foo should be marked fastcc by globalopt llvm-svn: 34607	2007-02-25 21:04:39 +00:00
Chris Lattner	3b7c905437	testcase for pr1215 llvm-svn: 34547	2007-02-24 01:16:39 +00:00
Chris Lattner	3fec2056a4	testcase for pr1217 llvm-svn: 34545	2007-02-24 01:03:11 +00:00
Chris Lattner	83908e664f	fix this testcase llvm-svn: 34530	2007-02-23 19:39:24 +00:00
Andrew Lenharth	f7a5332b53	missed cast elimination llvm-svn: 34490	2007-02-22 15:17:45 +00:00
Reid Spencer	d84d35ba70	For PR1195: Rename PackedType -> VectorType, ConstantPacked -> ConstantVector, and PackedTyID -> VectorTyID. No functional changes. llvm-svn: 34293	2007-02-15 02:26:10 +00:00
Chris Lattner	682918f99b	update to new t-d strings. llvm-svn: 34290	2007-02-15 00:54:16 +00:00
Andrew Lenharth	15a3af28d7	This really only affects pointers in high memory, and only llvm 1.9, but make a regression for it anyway llvm-svn: 34014	2007-02-07 22:23:47 +00:00
Chris Lattner	dd602bcd08	Testcase for a bug responsible for GCC bootstrap failure, fallout from PR411. llvm-svn: 34004	2007-02-07 19:28:52 +00:00
Chris Lattner	ecb38495af	Testcase for miscompilation llvm-svn: 33947	2007-02-06 02:22:37 +00:00
Reid Spencer	3aaaa0b2bd	For PR411: This patch replaces the SymbolTable class with ValueSymbolTable which does not support types planes. This means that all symbol names in LLVM must now be unique. The patch addresses the necessary changes to deal with this and removes code no longer needed as a result. This completes the bulk of the changes for this PR. Some cleanup patches will follow. llvm-svn: 33918	2007-02-05 20:47:22 +00:00
Reid Spencer	cb4d3f2902	Prepare for PR411 llvm-svn: 33865	2007-02-04 02:11:13 +00:00
Reid Spencer	8de97bba5a	For PR1072: Removing -raise has neglible positive or negative side effects so we are opting to remove it. See the PR for comparison details. llvm-svn: 33844	2007-02-03 23:15:56 +00:00
Reid Spencer	2341c22ec7	Changes to support making the shift instructions be true BinaryOperators. This feature is needed in order to support shifts of more than 255 bits on large integer types. This changes the syntax for llvm assembly to make shl, ashr and lshr instructions look like a binary operator: shl i32 %X, 1 instead of shl i32 %X, i8 1 Additionally, this should help a few passes perform additional optimizations. llvm-svn: 33776	2007-02-02 02:16:23 +00:00
Chris Lattner	9a677c585c	new testcase for serious code pessimization llvm-svn: 33770	2007-02-01 22:29:26 +00:00
Reid Spencer	af6a408117	For PR411: Update these tests to not use the same name even though the type of the value differs. After PR411 hits, type planes will be gone and it will be illegal for a name to be used twice, regardless of type. llvm-svn: 33660	2007-01-30 16:16:01 +00:00
Chris Lattner	d50698107e	Testcase for an instcombine miscompilation reduced by Anton. llvm-svn: 33590	2007-01-27 23:07:12 +00:00
Reid Spencer	ce380568b5	For PR761: Remove "target endian/pointersize" or add "target datalayout" to make the test parse properly or set the datalayout because defaults changes. For PR645: Make global names use the @ prefix. For llvm-upgrade changes: Fix test cases or completely remove use of llvm-upgrade for test cases that cannot survive the new renaming or upgrade capabilities. llvm-svn: 33533	2007-01-26 08:25:06 +00:00
Chris Lattner	c8dc67c2da	remove an execution test from llvm/test llvm-svn: 33344	2007-01-18 22:24:04 +00:00
Chris Lattner	bb4e2a547f	new testcase that causes instcombine to infinitely loop llvm-svn: 33342	2007-01-18 22:16:03 +00:00
Reid Spencer	83b3d82672	Regression is gone, don't try to find it on clean target. llvm-svn: 33296	2007-01-17 07:59:14 +00:00

... 166 167 168 169 170 ...

9171 Commits