llvm-project

Commit Graph

Author	SHA1	Message	Date
Chris Lattner	9449991c4f	Handle single-entry phi nodes gracefully in condprop. llvm-svn: 62985	2009-01-26 02:18:20 +00:00
Chris Lattner	7b6647c178	Fix PR3408 by making a non-obvious assumption very obvious, and handling the flaw inherent in that assumption. :) llvm-svn: 62984	2009-01-26 02:11:30 +00:00
Chris Lattner	57cb472b56	More cleanups and simplifications, no functionality change. llvm-svn: 62983	2009-01-26 01:57:01 +00:00
Chris Lattner	d67aaa6560	tidy asserts llvm-svn: 62982	2009-01-26 01:38:24 +00:00
Nick Lewycky	5647c5d1a4	The function that does nothing but call malloc is noalias return. llvm-svn: 62956	2009-01-25 07:59:57 +00:00
Dale Johannesen	2b3389a626	Revert previous change; even this mild and clearly more accurate change loses more than it gains on benchmarks. llvm-svn: 62938	2009-01-24 21:49:34 +00:00
Torok Edwin	f4395ea97a	testcase for PR3381. Also it was an empty struct, not a void after all. llvm-svn: 62920	2009-01-24 17:16:04 +00:00
Torok Edwin	73ff92272f	void* is represented as pointer to empty struct {}. Thus we need to check whether the struct is empty before trying to index into it. This fixes PR3381. llvm-svn: 62918	2009-01-24 11:30:49 +00:00
Dale Johannesen	899ecdbbba	Improve the inlining cost function a bit. Little practical effect. llvm-svn: 62908	2009-01-24 01:27:33 +00:00
Chris Lattner	72cd68fe64	Make InstCombineStoreToCast handle aggregates more aggressively, handling the case in Transforms/InstCombine/cast-store-gep.ll, which is a heavily reduced testcase from Clang on x86-64. llvm-svn: 62904	2009-01-24 01:00:13 +00:00
Gabor Greif	59c431347f	use CallSite::isCalle instead of slow getOperandNo llvm-svn: 62877	2009-01-23 21:17:04 +00:00
Gabor Greif	eb61fcf2a1	Simplify the logic of getting hold of a PHI predecessor block. There is now a direct way from value-use-iterator to incoming block in PHINode's API. This way we avoid the iterator->index->iterator trip, and especially the costly getOperandNo() invocation. Additionally there is now an assertion that the iterator really refers to one of the PHI's Uses. llvm-svn: 62869	2009-01-23 19:40:15 +00:00
Gabor Greif	f4013373cd	introduce a useful abstraction to find out if a Use is in the call position of an instruction llvm-svn: 62788	2009-01-22 21:35:57 +00:00
Chris Lattner	77527f5812	Remove uses of uint32_t in favor of 'unsigned' for better compatibility with cygwin. Patch by Jay Foad! llvm-svn: 62695	2009-01-21 18:09:24 +00:00
Dale Johannesen	b5721632ee	Make special cases (0 inf nan) work for frem. Besides APFloat, this involved removing code from two places that thought they knew the result of frem(0., x) but were wrong. llvm-svn: 62645	2009-01-21 00:35:19 +00:00
Chris Lattner	c59945b4bd	another fix for PR3354 llvm-svn: 62561	2009-01-20 01:15:41 +00:00
Bill Wendling	caf1d22243	Doxygen-ify comments. llvm-svn: 62546	2009-01-19 23:43:56 +00:00
Chris Lattner	ea9f1d3c47	Fix a problem exposed by PR3354: simplifycfg was making a potentially trapping instruction be executed unconditionally. llvm-svn: 62541	2009-01-19 23:03:13 +00:00
Chris Lattner	73d7fe5a34	improve compatibility with cygwin, patch by Jay Foad! llvm-svn: 62535	2009-01-19 22:00:18 +00:00
Chris Lattner	6f34e317e9	Fix PR3353, infinitely jump threading an infinite loop make from switches. llvm-svn: 62529	2009-01-19 21:20:34 +00:00
Bill Wendling	534d2e0bae	Temporarily revert r62487. It's causing this error during a release bootstrap of llvm-gcc. Most likely, it's miscompiling one of the "gen*" programs: /Volumes/Sandbox/Buildbot/llvm/full-llvm/build/llvm-gcc.obj/./prev-gcc/xgcc -B/Volumes/Sandbox/Buildbot/llvm/full-llvm/build/llvm-gcc.obj/./prev-gcc/ -B/Volumes/Sandbox/Buildbot/llvm/full-llvm/build/llvm-gcc.install/i386-apple-darwin9.6.0/bin/ -c -g -O2 -mdynamic-no-pic -DIN_GCC -W -Wall -Wwrite-strings -Wstrict-prototypes -Wmissing-prototypes -mdynamic-no-pic -DHAVE_CONFIG_H -DGENERATOR_FILE -I. -Ibuild -I../../llvm-gcc.src/gcc -I../../llvm-gcc.src/gcc/build -I../../llvm-gcc.src/gcc/../include -I./../intl -I../../llvm-gcc.src/gcc/../libcpp/include -I../../llvm-gcc.src/gcc/../libdecnumber -I../libdecnumber -I/Volumes/Sandbox/Buildbot/llvm/full-llvm/build/llvm.obj/include -I/Volumes/Sandbox/Buildbot/llvm/full-llvm/build/llvm.src/include -DENABLE_LLVM -I/Volumes/Sandbox/Buildbot/llvm/full-llvm/build/llvm.obj/../llvm.src/include -D_DEBUG -D_GNU_SOURCE -D__STDC_LIMIT_MACROS -D__STDC_CONSTANT_MACROS -o build/gencondmd.o build/gencondmd.c ../../llvm-gcc.src/gcc/config/i386/mmx.md:926: error: expected '}' before ')' token ../../llvm-gcc.src/gcc/config/i386/mmx.md:926: warning: excess elements in struct initializer ../../llvm-gcc.src/gcc/config/i386/mmx.md:926: warning: (near initialization for 'insn_conditions[4]') ../../llvm-gcc.src/gcc/config/i386/mmx.md:926: error: expected '}' before ')' token ../../llvm-gcc.src/gcc/config/i386/mmx.md:926: error: expected ',' or ';' before ')' token ../../llvm-gcc.src/gcc/config/i386/mmx.md:927: error: expected identifier or '(' before ',' token ../../llvm-gcc.src/gcc/config/i386/sse.md:3458: error: expected identifier or '(' before ',' token ... llvm-svn: 62506	2009-01-19 08:46:20 +00:00
Chris Lattner	f2bb4ea39c	Fix PR3016, a bug which can occur do to an invalid assumption: we assumed a CFG structure that would be valid when all code in the function is reachable, but not all code is necessarily reachable. Do a simple, but horrible, CFG walk to check for this case. llvm-svn: 62487	2009-01-19 02:46:28 +00:00
Chris Lattner	e381d7026f	reduce indentation by using 'continue', no functionality change. llvm-svn: 62477	2009-01-19 02:07:32 +00:00
Chris Lattner	54f0c61d71	Fix some problems in SpeculativelyExecuteBB. Basically, because of dead code, a phi could use the speculated instruction that was not in "BB2". Make this check explicit and tighten up some other corners. This fixes PR3292. No testcase becauase this depends entirely on visitation order of blocks and requires a sequence of 8 passes to repro. llvm-svn: 62476	2009-01-19 00:36:37 +00:00
Chris Lattner	e1c01e4e2b	Make this a bit more explicit about which cases need the check. No functionality change. llvm-svn: 62474	2009-01-18 23:22:07 +00:00
Chris Lattner	64b7bd7f9e	Fix rdar://6505632, an llc crash on 483.xalancbmk llvm-svn: 62470	2009-01-18 20:35:00 +00:00
Duncan Sands	e0aa0d677d	BasicAliasAnalysis and FunctionAttrs were both doing very similar pointer capture analysis. Factor out the common logic. The new version is from FunctionAttrs since it does a better job than the version in BasicAliasAnalysis llvm-svn: 62461	2009-01-18 12:19:30 +00:00
Nick Lewycky	3ced0dfa69	Fix copy and pasted typos that prevented strtok_r, realloc, getenv, ungetc, putc, puts, perror, vscanf and vsscanf from getting annotations. Add annotations for eight printf functions, memalign, pread and pwrite. On Linux, llvm-gcc sometimes renames strdup, getc, putc, strtok_r, scanf and sscanf. Match the alternate function names. Fix a crash annotating opendir. Don't mark fsetpos's second parameter as nocapture. It's supposed to be captured. Do mark fopen's path and mode strings as nocapture. Mark ferror as readonly, but not fileno which may set errno. llvm-svn: 62456	2009-01-18 04:34:36 +00:00
Gabor Greif	f1abfdccdc	introduce typedef for complicated vector, and use it too llvm-svn: 62384	2009-01-17 00:09:08 +00:00
Gabor Greif	8c573f7e49	typo llvm-svn: 62377	2009-01-16 23:08:50 +00:00
Chris Lattner	db2d9613d2	Fix PR3335 by not turning a store to one address space into a store to another. llvm-svn: 62351	2009-01-16 20:12:52 +00:00
Chris Lattner	733256fe31	reduce indentation by using early exits, no functionality change. llvm-svn: 62350	2009-01-16 20:08:59 +00:00
Evan Cheng	beac6f8b0c	Clean up previous cast optimization a bit. Also make zext elimination a bit more aggressive: if it's not necessary to emit an AND (i.e. high bits are already zero), it's profitable to evaluate the operand at a different type. llvm-svn: 62297	2009-01-16 02:11:43 +00:00
Rafael Espindola	6de96a1b5d	Add the private linkage. llvm-svn: 62279	2009-01-15 20:18:42 +00:00
Gabor Greif	5aa1922614	avoid using iterators when they get invalidated potentially this fixes PR3332 llvm-svn: 62271	2009-01-15 18:40:09 +00:00
Evan Cheng	ff716cb342	Eliminate a redundant check. llvm-svn: 62264	2009-01-15 17:09:07 +00:00
Evan Cheng	60e19a46f2	- Teach CanEvaluateInDifferentType of this xform: sext (zext ty1), ty2 -> zext ty2 - Looking at the number of sign bits of the a sext instruction to determine whether new trunc + sext pair should be added when its source is being evaluated in a different type. llvm-svn: 62263	2009-01-15 17:01:23 +00:00
Chris Lattner	8fb9480ed2	Fix PR3325, a miscompilation of invokes by IPSCCP. Patch by Jay Foad! llvm-svn: 62244	2009-01-14 21:01:16 +00:00
Dale Johannesen	1f0e0e7c9c	Fix the time regression I introduced in 464.h264ref with my earlier patch to this file. The issue there was that all uses of an IV inside a loop are actually references to Base[IV2], and there was one use outside that was the same but LSR didn't see the base or the scaling because it didn't recurse into uses outside the loop; thus, it used base+IVscale mode inside the loop instead of pulling base out of the loop. This was extra bad because register pressure later forced both base and IV into memory. Doing that recursion, at least enough to figure out addressing modes, is a good idea in general; the change in AddUsersIfInteresting does this. However, there were side effects.... It is also possible for recursing outside the loop to introduce another IV where there was only 1 before (if the refs inside are not scaled and the ref outside is). I don't think this is a common case, but it's in the testsuite. It is right to be very aggressive about getting rid of such introduced IVs (CheckForIVReuse and the handling of nonzero RewriteFactor in StrengthReduceStridedIVUsers). In the testcase in question the new IV produced this way has both a nonconstant stride and a nonzero base, neither of which was handled before. And when inserting new code that feeds into a PHI, it's right to put such code at the original location rather than in the PHI's immediate predecessor(s) when the original location is outside the loop (a case that couldn't happen before) (RewriteInstructionToUseNewBase); better to avoid making multiple copies of it in this case. Also, the mechanism for keeping SCEV's corresponding to GEP's no longer works, as the GEP might change after its SCEV is remembered, invalidating the SCEV, and we might get a bad SCEV value when looking up the GEP again for a later loop. This also couldn't happen before, as we weren't recursing into GEP's outside the loop. Also, when we build an expression that involves a (possibly non-affine) IV from a different loop as well as an IV from the one we're interested in (containsAddRecFromDifferentLoop), don't recurse into that. We can't do much with it and will get in trouble if we try to create new non-affine IVs or something. More testcases are coming. llvm-svn: 62212	2009-01-14 02:35:31 +00:00
Chris Lattner	2538eb664c	rewrite OptimizeAwayTrappingUsesOfLoads to 1) avoid a temporary vector and extraneous loop over it, 2) not delete globals used by phis/selects etc which could actually be useful. This fixes PR3321. Many thanks to Duncan for narrowing this down. llvm-svn: 62201	2009-01-14 00:12:58 +00:00
Dale Johannesen	0aeabdff57	Fix testsuite regressions from recursive inlining. llvm-svn: 62189	2009-01-13 22:43:37 +00:00
Dan Gohman	59af77376c	Make instcombine ensure that all allocas are explicitly aligned at at least their preferred alignment. llvm-svn: 62176	2009-01-13 20:18:38 +00:00
Duncan Sands	944ccc5d6a	Correct a comment. llvm-svn: 62165	2009-01-13 13:48:44 +00:00
Dale Johannesen	433a9086c0	Enable recursive inlining. Reduce inlining threshold back to 200; 400 seems to be too high, loses more than it gains. llvm-svn: 62107	2009-01-12 22:11:50 +00:00
Duncan Sands	dc020f9c3c	Rename getABITypeSize to getTypePaddedSize, as suggested by Chris. llvm-svn: 62099	2009-01-12 20:38:59 +00:00
Dale Johannesen	f84685290a	Increase default inlining aggressiveness in partial compensation for turning off gcc's inliner. This gets us closer to the amount of inlining we were getting before. It is not a win on everything, of course, but seems to gain overall. llvm-svn: 62058	2009-01-11 23:11:00 +00:00
Chris Lattner	bd3c7c8b52	Duncan is nervous about undefinedness of % with negatives. I'm not thrilled about 64-bit % in general, so rewrite to use * instead. llvm-svn: 62047	2009-01-11 20:41:36 +00:00
Chris Lattner	b19151686f	do not generated GEPs into vectors where they don't already exist. We should treat vectors as atomic types, not like arrays. llvm-svn: 62046	2009-01-11 20:23:52 +00:00
Chris Lattner	171d2d474f	Make a couple of cleanups to the instcombine bitcast/gep canonicalization transform based on duncan's comments: 1) improve the comment about %. 2) within our index loop make sure the offset stays within the type size, instead of within the abi size. This allows us to reason explicitly about landing in tail padding and means that issues like non-zero offsets into [0 x foo] types don't occur anymore. llvm-svn: 62045	2009-01-11 20:15:20 +00:00
Chris Lattner	5f54d50917	fix typo Duncan noticed. llvm-svn: 61997	2009-01-09 18:31:39 +00:00
Chris Lattner	ae0e857b98	Fix PR3304 llvm-svn: 61995	2009-01-09 18:18:43 +00:00
Misha Brukman	5cbf223916	Removed trailing whitespace from Makefiles. llvm-svn: 61991	2009-01-09 16:44:42 +00:00
Chris Lattner	f50aa6ae5c	Implement rdar://6480391, extending of equality icmp's to avoid a truncation. I noticed this in the code compiled for a routine using std::map, which produced this code: %25 = tail call i32 @memcmp(i8* %24, i8* %23, i32 6) nounwind readonly %.lobit.i = lshr i32 %25, 31 ; <i32> [#uses=1] %tmp.i = trunc i32 %.lobit.i to i8 ; <i8> [#uses=1] %toBool = icmp eq i8 %tmp.i, 0 ; <i1> [#uses=1] br i1 %toBool, label %bb3, label %bb4 which compiled to: call L_memcmp$stub shrl $31, %eax testb %al, %al jne LBB1_11 ## with this change, we compile it to: call L_memcmp$stub testl %eax, %eax js LBB1_11 This triggers all the time in common code, with patters like this: %169 = and i32 %ply, 1 ; <i32> [#uses=1] %170 = trunc i32 %169 to i8 ; <i8> [#uses=1] %toBool = icmp ne i8 %170, 0 ; <i1> [#uses=1] %7 = lshr i32 %6, 24 ; <i32> [#uses=1] %9 = trunc i32 %7 to i8 ; <i8> [#uses=1] %10 = icmp ne i8 %9, 0 ; <i1> [#uses=1] etc llvm-svn: 61985	2009-01-09 07:47:06 +00:00
Chris Lattner	0f7cf1d7e1	Remove some old code that looks like a remanant from signed-types days. llvm-svn: 61984	2009-01-09 07:10:58 +00:00
Chris Lattner	482eb70a10	Fix PR3298, a crash in Jump Threading. Apparently even jump threading can have bugs, who knew? ;-) llvm-svn: 61983	2009-01-09 06:08:12 +00:00
Chris Lattner	fef138b140	Fix part 3/2 of PR3290, making instcombine zap (gep(bitcast)) when possible. llvm-svn: 61980	2009-01-09 05:44:56 +00:00
Chris Lattner	a784a2ce01	move some code, check to see if the input to the GEP is a bitcast (which is constant time and cheap) before checking hasAllZeroIndices. llvm-svn: 61976	2009-01-09 04:53:57 +00:00
Dale Johannesen	4755d9df78	Adjustments to last patch based on review. llvm-svn: 61969	2009-01-09 01:30:11 +00:00
Dale Johannesen	b48fc71fc6	Do not inline functions with (dynamic) alloca into functions that don't already have a (dynamic) alloca. Dynamic allocas cause inefficient codegen and we shouldn't propagate this (behavior follows gcc). Two existing tests assumed such inlining would be done; they are hacked by adding an alloca in the caller, preserving the point of the tests. llvm-svn: 61946	2009-01-08 21:45:23 +00:00
Chris Lattner	c518dfd11b	This implements the second half of the fix for PR3290, handling loads from allocas that cover the entire aggregate. This handles some memcpy/byval cases that are produced by llvm-gcc. This triggers a few times in kc++ (with std::pair<std::_Rb_tree_const_iterator <kc::impl_abstract_phylum*>,bool>) and once in 176.gcc (with %struct..0anon). llvm-svn: 61915	2009-01-08 05:42:05 +00:00
Duncan Sands	0bcf085845	Whitespace - correct formatting. llvm-svn: 61879	2009-01-07 20:01:06 +00:00
Duncan Sands	289f59f233	Remove alloca tracking from nocapture analysis. Not only was it not very helpful, it was also wrong! The problem is shown in the testcase: the alloca might be passed to a nocapture callee which dereferences it and returns the original pointer. But because it was a nocapture call we think we don't need to track its uses, but we do. llvm-svn: 61876	2009-01-07 19:39:06 +00:00
Duncan Sands	94bcbbab74	Reorder these. llvm-svn: 61873	2009-01-07 19:17:02 +00:00
Duncan Sands	02599850b4	Use a switch rather than a sequence of "isa" tests. llvm-svn: 61872	2009-01-07 19:10:21 +00:00
Duncan Sands	187c5716b6	The verifier checks that the aliasee is not null. llvm-svn: 61870	2009-01-07 18:45:53 +00:00
Chris Lattner	f2b8c82ad1	Implement the first half of PR3290: if there is a store of an integer to a (transitive) bitcast the alloca and if that integer has the full size of the alloca, then it clobbers the whole thing. Handle this by extracting pieces out of the stored integer and filing them away in the SROA'd elements. This triggers fairly frequently because the CFE uses integers to pass small structs by value and the inliner exposes these. For example, in kimwitu++, I see a bunch of these with i64 stores to "%struct.std::pair<std::_Rb_tree_const_iterator<kc::impl_abstract_phylum*>,bool>" In 176.gcc I see a few i32 stores to "%struct..0anon". In the testcase, this is a difference between compiling test1 to: _test1: subl $12, %esp movl 20(%esp), %eax movl %eax, 4(%esp) movl 16(%esp), %eax movl %eax, (%esp) movl (%esp), %eax addl 4(%esp), %eax addl $12, %esp ret vs: _test1: movl 8(%esp), %eax addl 4(%esp), %eax ret The second half of this will be to handle loads of the same form. llvm-svn: 61853	2009-01-07 08:11:13 +00:00
Chris Lattner	9a2de65fd6	Factor a bunch of code out into a helper method. llvm-svn: 61852	2009-01-07 07:18:45 +00:00
Chris Lattner	db561146aa	use continue to simplify code and reduce nesting, no functionality change. llvm-svn: 61851	2009-01-07 06:39:58 +00:00
Chris Lattner	938b54f383	Get TargetData once up front and cache as an ivar instead of requerying it all over the place. llvm-svn: 61850	2009-01-07 06:34:28 +00:00
Chris Lattner	a63dba9e6c	Use the hasAllZeroIndices predicate to simplify some code, no functionality change. llvm-svn: 61849	2009-01-07 06:25:07 +00:00
Chris Lattner	2fdcc59bb6	Change m_ConstantInt and m_SelectCst to take their constant integers as template arguments instead of as instance variables, exposing more optimization opportunities to the compiler earlier. llvm-svn: 61776	2009-01-05 23:53:12 +00:00
Duncan Sands	582c53d147	Teach the internalize pass to also internalize global aliases. llvm-svn: 61754	2009-01-05 21:24:45 +00:00
Evan Cheng	8804293fe9	Find loop back edges only after empty blocks are eliminated. llvm-svn: 61752	2009-01-05 21:17:27 +00:00
Duncan Sands	52e5deece5	Not having an aliasee is a theoretical possibility. llvm-svn: 61745	2009-01-05 20:47:56 +00:00
Duncan Sands	821d13cf78	Format more neatly. llvm-svn: 61744	2009-01-05 20:39:50 +00:00
Duncan Sands	d24b93f339	Remove trailing spaces. llvm-svn: 61743	2009-01-05 20:38:27 +00:00
Duncan Sands	f5dbbae4f4	Delete unused global aliases with internal linkage. In fact this also deletes those with linkonce linkage, however this is currently dead because for the moment aliases aren't allowed to have this linkage type. llvm-svn: 61742	2009-01-05 20:37:33 +00:00
Dan Gohman	906152a20f	Tidy up #includes, deleting a bunch of unnecessary #includes. llvm-svn: 61715	2009-01-05 17:59:02 +00:00
Nick Lewycky	e4e5532e05	Move the libcall annotating part from doFinalization to doInitialization. Finalization occurs after all the FunctionPasses in the group have run, which is clearly not what we want. This also means that we have to make sure that we apply the right param attributes when creating a new function. Also, add a missed optimization: strdup and strndup. NoCapture and NoAlias return! llvm-svn: 61658	2009-01-05 00:07:50 +00:00
Nick Lewycky	959af7ba30	Run a post-pass that marks known function declarations by name. llvm-svn: 61632	2009-01-04 20:27:34 +00:00
Bill Wendling	0c04f9fdc3	Revert this transform. It was causing some dramatic slowdowns in a few tests. See PR3266. llvm-svn: 61623	2009-01-04 06:19:11 +00:00
Nick Lewycky	1d805c62c4	Any void readonly functions are provably dead, don't waste time adding nocapture attributes to them. llvm-svn: 61610	2009-01-03 17:05:32 +00:00
Duncan Sands	c7affb0a8f	Load tracking means that the value analyzed may not have pointer type. In particular, it may be the condition argument for a select or a GEP index. While I was unable to construct a testcase for which some bits of the original pointer are captured due to one of these, it's very very close to being possible - so play safe and exclude these possibilities. llvm-svn: 61580	2009-01-02 15:16:38 +00:00
Duncan Sands	b193a37cd3	When calculating 'nocapture' argument attributes, allow the argument to be stored to an alloca by tracking uses of the alloca. This occurs 4 times (out of 7121, 0.05%) in MultiSource/Applications, so may not be worth it. On the other hand, it is easy to do and fairly cheap. The functions it helps are: W_addcom and W_addlit in spiff; process_args (argv) in d (make_dparser); ercPixConcealIMB in JM/ldecod. llvm-svn: 61570	2009-01-02 11:54:37 +00:00
Duncan Sands	cefc8604aa	Improve comments and reorganize a bit - no functionality change. llvm-svn: 61569	2009-01-02 11:46:24 +00:00
Nick Lewycky	7e82055e88	Make adding nocapture a bit stronger. FreeInst is nocapture. Also, functions that don't write can't leak a pointer except through the return value, so a void readonly function is implicitly nocapture. Test these, and add a test that verifies that f1 calling f2 with an otherwise dead pointer gets both of them marked nocapture. llvm-svn: 61552	2009-01-02 03:46:56 +00:00
Duncan Sands	1f11d2bbc1	Mention that this pass does escape analysis in the leading comments. llvm-svn: 61548	2009-01-01 20:45:19 +00:00
Bill Wendling	0fcff2c203	Fix comment. llvm-svn: 61538	2009-01-01 01:19:59 +00:00
Bill Wendling	aedb54a947	Add transformation: xor (or (icmp, icmp), true) -> and(icmp, icmp) This is possible because of De Morgan's law. llvm-svn: 61537	2009-01-01 01:18:23 +00:00
Duncan Sands	163848021b	Look through phi nodes and select instructions when calculating nocapture attributes. llvm-svn: 61535	2008-12-31 20:21:34 +00:00
Duncan Sands	df128eb477	Don't analyze arguments already marked 'nocapture'. llvm-svn: 61532	2008-12-31 18:08:59 +00:00
Duncan Sands	44c8cd97a5	Rename AddReadAttrs to FunctionAttrs, and teach it how to work out (in a very simplistic way) which function arguments (pointer arguments only) are only dereferenced and so do not escape. Mark such arguments 'nocapture'. llvm-svn: 61525	2008-12-31 16:14:43 +00:00
Duncan Sands	f6069577fa	Experiments show that looking through phi nodes and select instructions doesn't buy anything here except extra complexity: the only difference in the entire testsuite was that a readonly function became readnone in MiBench/consumer-typeset. Add a comment about this. llvm-svn: 61478	2008-12-29 20:51:17 +00:00
Duncan Sands	c125d6a3d3	Allow readnone functions to read (and write!) global constants, since doing so is irrelevant for aliasing purposes. While this doesn't increase the total number of functions marked readonly or readnone in MultiSource/ Applications (3089), it does result in 12 functions being marked readnone rather than readonly. Before: readnone: 820 readonly: 2269 After: readnone: 832 readonly: 2257 llvm-svn: 61469	2008-12-29 11:34:09 +00:00
Dale Johannesen	656237beca	Revert 61362 and 61402 until SPEC breakage is fixed. llvm-svn: 61403	2008-12-23 23:21:35 +00:00
Dale Johannesen	f8b161bcd1	This fixes the bug in 175.vpr. It doesn't fix the other SPEC breakage. I'll be reverting all recent changes shortly, this checking is mostly so this change doesn't get lost. llvm-svn: 61402	2008-12-23 23:05:26 +00:00
Dale Johannesen	93b9aa8799	Fix the time regression I introduced in 464.h264ref with my last patch to this file. The issue there was that all uses of an IV inside a loop are actually references to Base[IV2], and there was one use outside that was the same but LSR didn't see the base or the scaling because it didn't recurse into uses outside the loop; thus, it used base+IVscale mode inside the loop instead of pulling base out of the loop. This was extra bad because register pressure later forced both base and IV into memory. Doing that recursion, at least enough to figure out addressing modes, is a good idea in general; the change in AddUsersIfInteresting does this. However, there were side effects.... It is also possible for recursing outside the loop to introduce another IV where there was only 1 before (if the refs inside are not scaled and the ref outside is). I don't think this is a common case, but it's in the testsuite. It is right to be very aggressive about getting rid of such introduced IVs (CheckForIVReuse and the handling of nonzero RewriteFactor in StrengthReduceStridedIVUsers). In the testcase in question the new IV produced this way has both a nonconstant stride and a nonzero base, neither of which was handled before. And when inserting new code that feeds into a PHI, it's right to put such code at the original location rather than in the PHI's immediate predecessor(s) when the original location is outside the loop (a case that couldn't happen before) (RewriteInstructionToUseNewBase); better to avoid making multiple copies of it in this case. Also, the mechanism for keeping SCEV's corresponding to GEP's no longer works, as the GEP might change after its SCEV is remembered, invalidating the SCEV, and we might get a bad SCEV value when looking up the GEP again for a later loop. This also couldn't happen before, as we weren't recursing into GEP's outside the loop. I owe some testcases for this, want to get it in for nightly runs. llvm-svn: 61362	2008-12-23 02:12:52 +00:00
Owen Anderson	164274eeb1	Don't forget to remove phi nodes from the value numbering table after we collapse them. llvm-svn: 61358	2008-12-23 00:49:51 +00:00
Bill Wendling	456e885382	Comment clean-ups. No functionality change. llvm-svn: 61354	2008-12-22 22:32:22 +00:00
Bill Wendling	e7f08e7250	Check that the instruction isn't in the value numbering scope. llvm-svn: 61353	2008-12-22 22:28:56 +00:00
Bill Wendling	86f01cb9f6	Simplification: Negate the operator== method instead of implementing a full operator!= method. llvm-svn: 61352	2008-12-22 22:16:31 +00:00
Bill Wendling	3c793441cb	Add verification that deleted instruction isn't hiding in the PHI map. llvm-svn: 61350	2008-12-22 22:14:07 +00:00
Bill Wendling	ebb6a543fa	Verify removed in a few more places. llvm-svn: 61349	2008-12-22 21:57:30 +00:00
Bill Wendling	6b18a3994b	Add verification functions to GVN which check to see that an instruction was truely deleted. These will be expanded with further checks of all of the data structures. llvm-svn: 61347	2008-12-22 21:36:08 +00:00
Nick Lewycky	10eb8e533f	Turn strcmp into memcmp, such as strcmp(P, "x") --> memcmp(P, "x", 2). llvm-svn: 61297	2008-12-21 00:19:21 +00:00
Nick Lewycky	4bc10c9e77	Remove redundant test for vector-nature. Scan the vector first to see whether our optz'n will apply to it, then build the replacement vector only if needed. llvm-svn: 61279	2008-12-20 16:48:00 +00:00
Evan Cheng	3b3de7c228	- CodeGenPrepare does not split loop back edges but it only knows about back edges of single block loops. It now does a DFS walk to find loop back edges. - Use SplitBlockPredecessors to factor out common predecessors of the critical edge destination. This is disabled for now due to some regressions. llvm-svn: 61248	2008-12-19 18:03:11 +00:00
Bill Wendling	070de29fcf	Didn't mean to commit this. llvm-svn: 61222	2008-12-18 22:19:50 +00:00
Bill Wendling	4c13e77d49	Re-XFAIL this test until debug stuff settles down. llvm-svn: 61219	2008-12-18 22:13:31 +00:00
Nick Lewycky	c3a70ade66	Oops! Left out a line. Simplifying the sdiv might allow further simplifications for our users. llvm-svn: 61196	2008-12-18 06:42:28 +00:00
Nick Lewycky	0f0e63fe73	Make all the vector elements positive in an srem of constant vector. llvm-svn: 61195	2008-12-18 06:31:11 +00:00
Chris Lattner	4caf5eb70c	Fix PR2929 by making bugpoint/code extract propagate the nothrow bit from the original function to the cloned one. llvm-svn: 61194	2008-12-18 05:52:56 +00:00
Dale Johannesen	3e5843b992	Revert previous patch, appears to break bootstrap. llvm-svn: 61181	2008-12-18 01:23:41 +00:00
Dale Johannesen	12d031b716	Fix the time regression I introduced in 464.h264ref with my last patch to this file. The issue there was that all uses of an IV inside a loop are actually references to Base[IV2], and there was one use outside that was the same but LSR didn't see the base or the scaling because it didn't recurse into uses outside the loop; thus, it used base+IVscale mode inside the loop instead of pulling base out of the loop. This was extra bad because register pressure later forced both base and IV into memory. Doing that recursion, at least enough to figure out addressing modes, is a good idea in general; the change in AddUsersIfInteresting does this. However, there were side effects.... It is also possible for recursing outside the loop to introduce another IV where there was only 1 before (if the refs inside are not scaled and the ref outside is). I don't think this is a common case, but it's in the testsuite. It is right to be very aggressive about getting rid of such introduced IVs (CheckForIVReuse and the handling of nonzero RewriteFactor in StrengthReduceStridedIVUsers). In the testcase in question the new IV produced this way has both a nonconstant stride and a nonzero base, neither of which was handled before. (This patch does not handle all the cases where this can happen.) And when inserting new code that feeds into a PHI, it's right to put such code at the original location rather than in the PHI's immediate predecessor(s) when the original location is outside the loop (a case that couldn't happen before) (RewriteInstructionToUseNewBase); better to avoid making multiple copies of it in this case. Everything above is exercised in CodeGen/X86/lsr-negative-stride.ll (and ifcvt4 in ARM which is the same IR). llvm-svn: 61178	2008-12-18 00:57:22 +00:00
Chris Lattner	b6372933b5	reapply this hunk from Bill's reversion in r61169, it is conservative and safe and orthogonal from turning off load pre. llvm-svn: 61177	2008-12-18 00:51:32 +00:00
Chris Lattner	c1c6404bba	make instnamer name unnamed blocks as well as instructions and args. llvm-svn: 61175	2008-12-18 00:33:11 +00:00
Bill Wendling	be4fb8a25f	Temporarily revert r61027. It was causing a bootstrap failure in "release" mode with everyone's favorite error messages: Comparing stages 2 and 3 warning: ./cc1-checksum.o differs warning: ./cc1plus-checksum.o differs Bootstrap comparison failure! ./c-decl.o differs ./cp/decl.o differs ./df-core.o differs ./gcc.o differs ./i386.o differs ./stor-layout.o differs ./tree-pretty-print.o differs ./tree.o differs make[2]: * [compare] Error 1 make[1]: * [stage3-bubble] Error 2 See PR3227. llvm-svn: 61169	2008-12-17 23:31:20 +00:00
Chris Lattner	0cdf52310a	insert some sequence points and preincrement an iterator to avoid iterator invalidation problems. llvm-svn: 61124	2008-12-17 05:42:08 +00:00
Chris Lattner	222ef4c489	Enhance heap sra to be substantially more aggressive w.r.t PHI nodes. This allows it to do fairly general phi insertion if a load from a pointer global wants to be SRAd but the load is used by (recursive) phi nodes. This fixes a pessimization on ppc introduced by Load PRE. llvm-svn: 61123	2008-12-17 05:28:49 +00:00
Dale Johannesen	904ce8120d	Clarify that the scale factor from CheckForIVReuse can be negative. Keep track of whether all uses of an IV are outside the loop. Some cosmetics; no functional change. llvm-svn: 61109	2008-12-16 22:16:28 +00:00
Chris Lattner	56b55387fc	Fix another crash found by inspection. If we have a PHI node merging the load multiple times, make sure the check the uses of the PHI to ensure they are transformable. llvm-svn: 61102	2008-12-16 21:24:51 +00:00
Chris Lattner	06a456b3f4	fix a crash found by inspection. llvm-svn: 61101	2008-12-16 21:04:51 +00:00
Eli Friedman	cb61afb546	Add a helper to remove a branch and DCE the condition, and use it consistently for deleting branches. In addition to being slightly more readable, this makes SimplifyCFG a bit better about cleaning up after itself when it makes conditions unused. llvm-svn: 61100	2008-12-16 20:54:32 +00:00
Chris Lattner	6ddde53783	switch some std::set/std::map to SmallPtrSet/DenseMap. llvm-svn: 61081	2008-12-16 07:34:30 +00:00
Chris Lattner	49e3bdc165	enhance heap-sra to apply to fixed sized array allocations, not just variable sized array allocations. llvm-svn: 61051	2008-12-15 21:44:34 +00:00
Chris Lattner	1c731fa86f	Use stripPointerCasts. llvm-svn: 61047	2008-12-15 21:20:32 +00:00
Chris Lattner	f0eb568021	minor tweaks for formatting, allow bitcast in ValueIsOnlyUsedLocallyOrStoredToOneGlobal. llvm-svn: 61046	2008-12-15 21:08:54 +00:00
Chris Lattner	c4274a71d5	refactor some code into a new TryToOptimizeStoreOfMallocToGlobal function. Use GetElementPtrInst::hasAllZeroIndices where possible. llvm-svn: 61045	2008-12-15 21:02:25 +00:00
Chris Lattner	0c68ae0603	Enable Load PRE. This teaches GVN to push partially redundant loads up the CFG when there is exactly one predecessor where the load is not available. This is designed to not increase code size but still eliminate partially redundant loads. This fires 1765 times on 403.gcc even though it doesn't do critical edge splitting yet (the most common reason for it to fail). llvm-svn: 61027	2008-12-15 05:28:29 +00:00
Owen Anderson	03aacbae90	Ifdef out some code that I didn't mean to enable by default yet. llvm-svn: 61024	2008-12-15 03:52:17 +00:00
Chris Lattner	69131fd872	make GVN try to rename inputs to the resultant replaced values, which cleans up the generated code a bit. This should have the added benefit of not randomly renaming functions/globals like my previous patch did. :) llvm-svn: 61023	2008-12-15 03:46:38 +00:00
Owen Anderson	bfe133e4ac	Add support for slow-path GVN with full phi construction for scalars. This is disabled for now, as it actually pessimizes code in the abscence of phi translation for load elimination. This slow down GVN a bit, by about 2% on 403.gcc. llvm-svn: 61021	2008-12-15 02:03:00 +00:00
Chris Lattner	f5eef9f6db	eliminate warning when asserts disabled. llvm-svn: 61012	2008-12-14 21:36:23 +00:00
Owen Anderson	e34c2399de	Generalize GVN's phi construciton routine to work for things other than loads. llvm-svn: 61009	2008-12-14 19:10:35 +00:00
Bill Wendling	293b9181e5	Temporarily revert r60973. It's inexplicably causing a failure when self-hosting LLVM: llvm[2]: Linking Release executable opt (without symbols) ... Undefined symbols: "llvm::APFloat::IEEEsingle", referenced from: __ZN4llvm7APFloat10IEEEsingleE$non_lazy_ptr in libLLVMCore.a(Constants.o) __ZN4llvm7APFloat10IEEEsingleE$non_lazy_ptr in libLLVMCore.a(AsmWriter.o) __ZN4llvm7APFloat10IEEEsingleE$non_lazy_ptr in libLLVMCore.a(ConstantFold.o) "llvm::APFloat::IEEEdouble", referenced from: __ZN4llvm7APFloat10IEEEdoubleE$non_lazy_ptr in libLLVMCore.a(Constants.o) __ZN4llvm7APFloat10IEEEdoubleE$non_lazy_ptr in libLLVMCore.a(AsmWriter.o) __ZN4llvm7APFloat10IEEEdoubleE$non_lazy_ptr in libLLVMCore.a(ConstantFold.o) ld: symbol(s) not found This is in release mode. To replicate, compile llvm and llvm-gcc in optimized mode. Then build llvm, in optimized mode, with the newly created compiler. llvm-svn: 60977	2008-12-13 09:28:44 +00:00
Chris Lattner	1e29f7c97d	make RLE preserve the name of the load that it replaces. This is just a pretification of the IR. llvm-svn: 60973	2008-12-13 07:22:47 +00:00
Misha Brukman	234b44add2	Fix spelling. llvm-svn: 60971	2008-12-13 05:21:37 +00:00
Chris Lattner	fa9f99aa12	Teach GVN to invalidate some memdep information when it does an RAUW of a pointer. This allows is to catch more equivalencies. For example, the type_lists_compatible_p function used to require two iterations of the gvn pass (!) to delete its 18 redundant loads because the first pass would CSE all the addressing computation cruft, which would unblock the second memdep/gvn passes from recognizing them. This change allows memdep/gvn to catch all 18 when run just once on the function (as is typical :) instead of just 3. On all of 403.gcc, this bumps up the # reundandancies found from: 63 gvn - Number of instructions PRE'd 153991 gvn - Number of instructions deleted 50069 gvn - Number of loads deleted to: 63 gvn - Number of instructions PRE'd 154137 gvn - Number of instructions deleted 50185 gvn - Number of loads deleted +120 loads deleted isn't bad. llvm-svn: 60799	2008-12-09 22:06:23 +00:00
Chris Lattner	254314e6bc	rename getNonLocalDependency -> getNonLocalCallDependency, and remove pointer stuff from it, simplifying the code a bit. llvm-svn: 60783	2008-12-09 19:38:05 +00:00
Chris Lattner	b6fc4b8d92	Switch GVN::processNonLocalLoad to using the new MemDep::getNonLocalPointerDependency method. There are some open issues with this (missed optimizations) and plenty of future work, but this does allow GVN to eliminate slightly more loads (49246 vs 49033). Switching over now allows simplification of the other code path in memdep. llvm-svn: 60780	2008-12-09 19:25:07 +00:00
Chris Lattner	0a5a8d54a9	random cleanups, no functionality change. llvm-svn: 60779	2008-12-09 19:21:47 +00:00
Chris Lattner	56b20ffc5f	Fix a really subtle off-by-one bug that Duncan noticed with valgrind on test/CodeGen/Generic/2007-06-06-CriticalEdgeLandingPad. llvm-svn: 60739	2008-12-09 04:47:21 +00:00
Chris Lattner	e598370ae9	remove DebugIterations option. Despite the accusations, jump threading has been shown to only expose problems not have bugs itself. I'm sure it's completely bug free! ;-) llvm-svn: 60725	2008-12-08 22:44:07 +00:00
Devang Patel	2bb8a2f80f	Fix spelling. Thanks Duncan! llvm-svn: 60702	2008-12-08 17:07:24 +00:00
Devang Patel	1c469d36b0	Undo previous patch. llvm-svn: 60701	2008-12-08 17:02:37 +00:00
Chris Lattner	f50d7f76c6	fix a bug I introduced in simplifycfg handling single entry phi nodes. FoldSingleEntryPHINodes deletes the PHI, so there is no need to delete it afterward. llvm-svn: 60653	2008-12-07 07:22:45 +00:00
Chris Lattner	5df5b4cc2e	don't bother touching volatile stores, they will just return clobber on everything interesting anyway. llvm-svn: 60640	2008-12-07 00:25:15 +00:00
Chris Lattner	57e91eaf61	Reimplement the inner loop of DSE. It now uniformly uses getDependence(), doesn't do its own local caching, and is slightly more aggressive about free/store dse (see testcase). This eliminates the last external client of MemDep::getDependenceFrom(). llvm-svn: 60619	2008-12-06 00:53:22 +00:00
Dale Johannesen	9efd2ce55b	Make LoopStrengthReduce smarter about hoisting things out of loops when they can be subsumed into addressing modes. Change X86 addressing mode check to realize that some PIC references need an extra register. (I believe this is correct for Linux, if not, I'm sure someone will tell me.) llvm-svn: 60608	2008-12-05 21:47:27 +00:00
Chris Lattner	0e3d6337c6	Make a few major changes to memdep and its clients: 1. Merge the 'None' result into 'Normal', making loads and stores return their dependencies on allocations as Normal. 2. Split the 'Normal' result into 'Clobber' and 'Def' to distinguish between the cases when memdep knows the value is produced from when we just know if may be changed. 3. Move some of the logic for determining whether readonly calls are CSEs into memdep instead of it being in GVN. This still leaves verification that the arguments are hte same to GVN to let it know about value equivalences in different contexts. 4. Change memdep's call/call dependency analysis to use getModRefInfo(CallSite,CallSite) instead of doing something very weak. This only really matters for things like DSA, but someday maybe we'll have some other decent context sensitive analyses :) 5. This reimplements the guts of memdep to handle the new results. 6. This simplifies GVN significantly: a) readonly call CSE is slightly simpler b) I eliminated the "getDependencyFrom" chaining for load elimination and load CSE doesn't have to worry about volatile (they are always clobbers) anymore. c) GVN no longer does any 'lastLoad' caching, leaving it to memdep. 7. The logic in DSE is simplified a bit and sped up. A potentially unsafe case was eliminated. llvm-svn: 60607	2008-12-05 21:04:20 +00:00
Anton Korobeynikov	24600bf05a	Revert invalid r60393. It causes llvm-gcc bootstrap fails in release builds. See PR3160 for details llvm-svn: 60604	2008-12-05 19:38:49 +00:00
Chris Lattner	c100828026	Fix test/Transforms/GVN/pre-load.ll llvm-svn: 60594	2008-12-05 17:04:12 +00:00
Chris Lattner	d2a653af0c	Make IsValueFullyAvailableInBlock safe. llvm-svn: 60588	2008-12-05 07:49:08 +00:00
Devang Patel	c56423b500	Rewrite code that 1) filters loops and 2) calculates new loop bounds. This fixes many bugs. I will add more test cases in a separate check-in. Some day, the code that manipulates CFG and updates dom. info could use refactoring help. llvm-svn: 60554	2008-12-04 21:38:42 +00:00
Chris Lattner	8f723670ce	Start simplifying a switch that has a successor that is a switch. llvm-svn: 60534	2008-12-04 06:31:07 +00:00
Chris Lattner	75c2661d24	add a debugging option to help track down j-t problems. llvm-svn: 60514	2008-12-04 00:07:59 +00:00
Dale Johannesen	4e9e6ea604	Remove an unused field. llvm-svn: 60508	2008-12-03 22:43:56 +00:00
Dale Johannesen	f7a588b909	Fix a misspelled function name. llvm-svn: 60506	2008-12-03 20:56:12 +00:00
Chris Lattner	dc3f6f2c12	Factor some code into a new FoldSingleEntryPHINodes method. llvm-svn: 60501	2008-12-03 19:44:02 +00:00
Dale Johannesen	d49ceff6ba	Fix a really wrong comment. llvm-svn: 60494	2008-12-03 19:25:46 +00:00
Chris Lattner	595c7279bd	Teach jump threading some more simple tricks: 1) have it fold "br undef", which does occur with surprising frequency as jump threading iterates. 2) teach j-t to delete dead blocks. This removes the successor edges, reducing the in-edges of other blocks, allowing recursive simplification. 3) Fold things like: br COND, BBX, BBY BBX: br COND, BBZ, BBW which also happens because jump threading iterates. llvm-svn: 60470	2008-12-03 07:48:08 +00:00
Chris Lattner	37e0136fef	third time is the charm. llvm-svn: 60469	2008-12-03 07:45:15 +00:00
Chris Lattner	c04a1ffa9a	fix assertion. llvm-svn: 60468	2008-12-03 07:43:05 +00:00
Chris Lattner	7eb270ed03	Rename DeleteBlockIfDead to DeleteDeadBlock and make it unconditionally delete the block. All likely clients will do the checking anyway. llvm-svn: 60464	2008-12-03 06:40:52 +00:00
Chris Lattner	bcc904a67c	Factor some code out of SimplifyCFG, forming a new DeleteBlockIfDead method. llvm-svn: 60463	2008-12-03 06:37:44 +00:00
Dale Johannesen	4d2ecb8f68	Minor rewrite per review feedback. llvm-svn: 60442	2008-12-02 21:17:11 +00:00
Dale Johannesen	70060013d2	Make the code do what the comment says it does. llvm-svn: 60431	2008-12-02 18:40:09 +00:00
Chris Lattner	1db9bbe802	Implement PRE of loads in the GVN pass with a pretty cheap and straight-forward implementation. This does not require any extra alias analysis queries beyond what we already do for non-local loads. Some programs really really like load PRE. For example, SPASS triggers this ~1000 times, ~300 times in 255.vortex, and ~1500 times on 403.gcc. The biggest limitation to the implementation is that it does not split critical edges. This is a huge killer on many programs and should be addressed after the initial patch is enabled by default. The implementation of this should incidentally speed up rejection of non-local loads because it avoids creating the repl densemap in cases when it won't be used for fully redundant loads. This is currently disabled by default. Before I turn this on, I need to fix a couple of miscompilations in the testsuite, look at compile time performance numbers, and look at perf impact. This is pretty close to ready though. llvm-svn: 60408	2008-12-02 08:16:11 +00:00
Bill Wendling	87beb9b909	Remove some errors that crept in. No functionality change. llvm-svn: 60403	2008-12-02 06:24:20 +00:00
Bill Wendling	790b4bf9a9	Merge two if-statements into one. llvm-svn: 60402	2008-12-02 06:22:04 +00:00
Bill Wendling	5635295266	More styalistic changes. No functionality change. llvm-svn: 60401	2008-12-02 06:18:11 +00:00
Bill Wendling	85de4b35ca	- Remove the buggy -X/C -> X/-C transform. This isn't valid when X isn't a constant. If X is a constant, then this is folded elsewhere. - Added a note to Target/README.txt to indicate that we'd like to implement this when we're able. llvm-svn: 60399	2008-12-02 05:12:47 +00:00
Bill Wendling	5369db5917	Improve comment. llvm-svn: 60398	2008-12-02 05:09:00 +00:00
Bill Wendling	21716dff5e	- Reduce nesting. - No need to do a swap on a canonicalized pattern. No functionality change. llvm-svn: 60397	2008-12-02 05:06:43 +00:00
Chris Lattner	ead1a61b47	some random comment improvements. llvm-svn: 60395	2008-12-02 04:52:26 +00:00
Owen Anderson	d930420ccf	Fix an issue that Chris noticed, where local PRE was not properly instantiating a new value numbering set after splitting a critical edge. This increases the number of instances of PRE on 403.gcc from ~60 to ~570. llvm-svn: 60393	2008-12-02 04:09:22 +00:00
Dale Johannesen	069a4eee55	Consider only references to an IV within the loop when figuring out the base of the IV. This produces better code in the example. (Addresses use (IV) instead of (BASE,IV) - a significant improvement on low-register machines like x86). llvm-svn: 60374	2008-12-01 22:00:01 +00:00
Bill Wendling	6f71bce4cf	Don't rebuild RHSNeg. Just use the one that's already there. llvm-svn: 60370	2008-12-01 21:06:30 +00:00
Bill Wendling	84f6f2539f	Document what this check is doing. Also, no need to cast to ConstantInt. llvm-svn: 60369	2008-12-01 21:03:43 +00:00
Bill Wendling	e6c87a4952	Use a simple comparison. Overflow on integer negation can only occur when the integer is "minint". llvm-svn: 60366	2008-12-01 19:46:27 +00:00
Bill Wendling	47f733e4ea	Generalize the FoldOrWithConstant method to fold for any two constants which don't have overlapping bits. llvm-svn: 60344	2008-12-01 08:32:40 +00:00
Bill Wendling	22e761b302	Reduce copy-and-paste code by splitting out the code into its own function. llvm-svn: 60343	2008-12-01 08:23:25 +00:00
Bill Wendling	582fe6b0ca	Use m_Specific() instead of double matching. llvm-svn: 60341	2008-12-01 08:09:47 +00:00
Bill Wendling	4eecfb655b	Move pattern check outside of the if-then statement. This prevents us from fiddling with constants unless we have to. llvm-svn: 60340	2008-12-01 07:47:02 +00:00
Chris Lattner	6f5bf6a718	Rename some variables, only increment BI once at the start of the loop instead of throughout it. llvm-svn: 60339	2008-12-01 07:35:54 +00:00
Chris Lattner	f00aae4968	pull the predMap densemap out of the inner loop of performPRE, so that it isn't reallocated all the time. This is a tiny speedup for GVN: 3.90->3.88s llvm-svn: 60338	2008-12-01 07:29:03 +00:00
Chris Lattner	2b07d3ccde	switch a couple more calls to use array_pod_sort. llvm-svn: 60337	2008-12-01 06:52:57 +00:00
Chris Lattner	2c2dd15a85	Introduce a new array_pod_sort function and switch LSR to use it instead of std::sort. This shrinks the release-asserts LSR.o file by 1100 bytes of code on my system. We should start using array_pod_sort where possible. llvm-svn: 60335	2008-12-01 06:49:59 +00:00
Chris Lattner	2aebea5735	Eliminate use of setvector for the DeadInsts set, just use a smallvector. This is a lot cheaper and conceptually simpler. llvm-svn: 60332	2008-12-01 06:27:41 +00:00
Chris Lattner	4da78e3774	DeleteTriviallyDeadInstructions is always passed the DeadInsts ivar, just use it directly. llvm-svn: 60330	2008-12-01 06:14:28 +00:00
Chris Lattner	a68a5a4784	simplify DeleteTriviallyDeadInstructions again, unlike my previous buggy rewrite, this notifies ScalarEvolution of a pending instruction about to be removed and then erases it, instead of erasing it then notifying. llvm-svn: 60329	2008-12-01 06:11:32 +00:00
Chris Lattner	9e6b243428	simplify these patterns using m_Specific. No need to grep for xor in testcase (or is a substring). llvm-svn: 60328	2008-12-01 05:16:26 +00:00
Chris Lattner	88a1f0213d	Teach jump threading to clean up after itself, DCE and constfolding the new instructions it simplifies. Because we're threading jumps on edges with constants coming in from PHI's, we inherently are exposing a lot more constants to the new block. Folding them and deleting dead conditions allows the cost model in jump threading to be more accurate as it iterates. llvm-svn: 60327	2008-12-01 04:48:07 +00:00
Chris Lattner	084b3a47d3	Change instcombine to use FoldPHIArgGEPIntoPHI to fold two operand PHIs instead of using FoldPHIArgBinOpIntoPHI. In addition to being more obvious, this also fixes a problem where instcombine wouldn't merge two phis that had different variable indices. This prevented instcombine from factoring big chunks of code in 403.gcc. For example: insn_cuid.exit: - %tmp336 = load i32** @uid_cuid, align 4 - %tmp337 = getelementptr %struct.rtx_def* %insn_addr.0.ph.i, i32 0, i32 3 - %tmp338 = bitcast [1 x %struct.rtunion]* %tmp337 to i32* - %tmp339 = load i32* %tmp338, align 4 - %tmp340 = getelementptr i32* %tmp336, i32 %tmp339 br label %bb62 bb61: - %tmp341 = load i32** @uid_cuid, align 4 - %tmp342 = getelementptr %struct.rtx_def* %insn, i32 0, i32 3 - %tmp343 = bitcast [1 x %struct.rtunion]* %tmp342 to i32* - %tmp344 = load i32* %tmp343, align 4 - %tmp345 = getelementptr i32* %tmp341, i32 %tmp344 br label %bb62 bb62: - %iftmp.62.0.in = phi i32* [ %tmp345, %bb61 ], [ %tmp340, %insn_cuid.exit ] + %insn.pn2 = phi %struct.rtx_def* [ %insn, %bb61 ], [ %insn_addr.0.ph.i, %insn_cuid.exit ] + %tmp344.pn.in.in = getelementptr %struct.rtx_def* %insn.pn2, i32 0, i32 3 + %tmp344.pn.in = bitcast [1 x %struct.rtunion]* %tmp344.pn.in.in to i32* + %tmp341.pn = load i32** @uid_cuid + %tmp344.pn = load i32* %tmp344.pn.in + %iftmp.62.0.in = getelementptr i32* %tmp341.pn, i32 %tmp344.pn %iftmp.62.0 = load i32* %iftmp.62.0.in llvm-svn: 60325	2008-12-01 03:42:51 +00:00
Chris Lattner	9d02a70a7d	Teach inst combine to merge GEPs through PHIs. This is really important because it is sinking the loads using the GEPs, but not the GEPs themselves. This triggers 647 times on 403.gcc and makes the .s file much much nicer. For example before: je LBB1_87 ## bb78 LBB1_62: ## bb77 leal 84(%esi), %eax LBB1_63: ## bb79 movl (%eax), %eax ... LBB1_87: ## bb78 movl $0, 4(%esp) movl %esi, (%esp) call L_make_decl_rtl$stub jmp LBB1_62 ## bb77 after: jne LBB1_63 ## bb79 LBB1_62: ## bb78 movl $0, 4(%esp) movl %esi, (%esp) call L_make_decl_rtl$stub LBB1_63: ## bb79 movl 84(%esi), %eax The input code was (and the GEPs are merged and the PHI is now eliminated by instcombine): br i1 %tmp233, label %bb78, label %bb77 bb77: %tmp234 = getelementptr %struct.tree_node* %t_addr.3, i32 0, i32 0, i32 22 br label %bb79 bb78: call void @make_decl_rtl(%struct.tree_node* %t_addr.3, i8* null) nounwind %tmp235 = getelementptr %struct.tree_node* %t_addr.3, i32 0, i32 0, i32 22 br label %bb79 bb79: %iftmp.12.0.in = phi %struct.rtx_def [ %tmp235, %bb78 ], [ %tmp234, %bb77 ] %iftmp.12.0 = load %struct.rtx_def %iftmp.12.0.in llvm-svn: 60322	2008-12-01 02:34:36 +00:00
Chris Lattner	9ce8995d24	Make GVN be more intelligent about redundant load elimination: when finding dependent load/stores, realize that they are the same if aliasing claims must alias instead of relying on the pointers to be exactly equal. This makes load elimination more aggressive. For example, on 403.gcc, we had: < 68 gvn - Number of instructions PRE'd < 152718 gvn - Number of instructions deleted < 49699 gvn - Number of loads deleted < 6153 memdep - Number of dirty cached non-local responses < 169336 memdep - Number of fully cached non-local responses < 162428 memdep - Number of uncached non-local responses now we have: > 64 gvn - Number of instructions PRE'd > 153623 gvn - Number of instructions deleted > 49856 gvn - Number of loads deleted > 5022 memdep - Number of dirty cached non-local responses > 159030 memdep - Number of fully cached non-local responses > 162443 memdep - Number of uncached non-local responses That's an extra 157 loads deleted and extra 905 other instructions nuked. This slows down GVN very slightly, from 3.91 to 3.96s. llvm-svn: 60314	2008-12-01 01:31:36 +00:00
Chris Lattner	7e61dafc95	Reimplement the non-local dependency data structure in terms of a sorted vector instead of a densemap. This shrinks the memory usage of this thing substantially (the high water mark) as well as making operations like scanning it faster. This speeds up memdep slightly, gvn goes from 3.9376 to 3.9118s on 403.gcc This also splits out the statistics for the cached non-local case to differentiate between the dirty and clean cached case. Here's the stats for 403.gcc: 6153 memdep - Number of dirty cached non-local responses 169336 memdep - Number of fully cached non-local responses 162428 memdep - Number of uncached non-local responses yay for caching :) llvm-svn: 60313	2008-12-01 01:15:42 +00:00
Bill Wendling	5b902c5b1e	Implement ((A\|B)&1)\|(B&-2) -> (A&1) \| B transformation. This also takes care of permutations of this pattern. llvm-svn: 60312	2008-12-01 01:07:11 +00:00
Chris Lattner	8541edec44	Cache analyses in ivars and add some useful DEBUG output. This speeds up GVN from 4.0386s to 3.9376s. llvm-svn: 60310	2008-12-01 00:40:32 +00:00
Chris Lattner	80c7d81e81	improve indentation, do cheap checks before expensive ones, remove some fixme's. This speeds up GVN very slightly on 403.gcc (4.06->4.03s) llvm-svn: 60309	2008-11-30 23:39:23 +00:00
Eli Friedman	11c15a5de7	Minor cleanup: use getTrue and getFalse where appropriate. No functional change. llvm-svn: 60307	2008-11-30 22:48:49 +00:00
Eli Friedman	55e4becba9	Some minor cleanups to instcombine; no functionality change. Note that the FoldOpIntoPhi call is dead because it's impossible for the first operand of a subtraction to be both a ConstantInt and a PHINode. llvm-svn: 60306	2008-11-30 21:09:11 +00:00
Bill Wendling	de89bc275c	Add instruction combining for ((A&~B)\|(~A&B)) -> A^B and all permutations. llvm-svn: 60291	2008-11-30 13:52:49 +00:00
Bill Wendling	9eef421e12	Implement (A&((~A)\|B)) -> A&B transformation in the instruction combiner. This takes care of all permutations of this pattern. llvm-svn: 60290	2008-11-30 13:08:13 +00:00
Bill Wendling	2fe3229824	Forgot one remaining call to getSExtValue(). llvm-svn: 60289	2008-11-30 12:41:09 +00:00
Bill Wendling	2d2e7861b5	getSExtValue() doesn't work for ConstantInts with bitwidth > 64 bits. Use all APInt calls instead. This fixes PR3144. llvm-svn: 60288	2008-11-30 12:38:24 +00:00
Eli Friedman	09bc610945	Optimize memmove and memset into the LLVM builtins. Note that these only show up in code from front-ends besides llvm-gcc, like clang. llvm-svn: 60287	2008-11-30 08:32:11 +00:00
Bill Wendling	7abf352f44	Don't make TwoToExp signed by default. llvm-svn: 60279	2008-11-30 05:29:33 +00:00
Bill Wendling	af200e9237	From Hacker's Delight: "For signed integers, the determination of overflow of xy is not so simple. If x and y have the same sign, then overflow occurs iff xy > 231 - 1. If they have opposite signs, then overflow occurs iff xy < -2*31." In this case, x == -1. llvm-svn: 60278	2008-11-30 05:01:05 +00:00
Bill Wendling	70635adea3	Instcombine was illegally transforming -X/C into X/-C when either X or C overflowed on negation. This commit checks to make sure that neithe C nor X overflows. This requires that the RHS of X (a subtract instruction) be a constant integer. llvm-svn: 60275	2008-11-30 03:42:12 +00:00
Chris Lattner	3ff6d01586	Fix a fixme by making memdep's handling of allocations more logical. If we see that a load depends on the allocation of its memory with no intervening stores, we now return a 'None' depedency instead of "Normal". This tweaks GVN to do its optimization with the new result. llvm-svn: 60267	2008-11-30 01:39:32 +00:00
Chris Lattner	63bd586d35	Eliminate the dropInstruction method, which is not needed any more. Fix a subtle iterator invalidation bug I introduced in the last commit. llvm-svn: 60258	2008-11-29 23:30:39 +00:00
Chris Lattner	1c6b62eb4d	Change MemDep::getNonLocalDependency to return its results as a smallvector instead of a DenseMap. This speeds up GVN by 5% on 403.gcc. llvm-svn: 60255	2008-11-29 21:33:22 +00:00
Chris Lattner	f280b0c729	reimplement getNonLocalDependency with a simpler worklist formulation that is faster and doesn't require nonLazyHelper. Much less code. llvm-svn: 60253	2008-11-29 21:22:42 +00:00
Chris Lattner	8c5ff516c6	Fix a thinko that manifested as a crash on clamav last night. llvm-svn: 60251	2008-11-29 20:29:04 +00:00
Chris Lattner	51ba8d0630	Split getDependency into getDependency and getDependencyFrom, the former does caching, the later doesn't. This dramatically simplifies the logic in getDependency and getDependencyFrom. llvm-svn: 60234	2008-11-29 03:47:00 +00:00
Bill Wendling	469e3aa696	Temporarily revert r60195. It's causing an optimized bootstrap of llvm-gcc to fail. llvm-svn: 60233	2008-11-29 03:43:04 +00:00
Chris Lattner	7f9c8a0f05	Introduce and use a new MemDepResult class to hold the results of a memdep query. This makes it crystal clear what cases can escape from MemDep that the clients have to handle. This also gives the clients a nice simplified interface to it that is easy to poke at. This patch also makes DepResultTy and MemoryDependenceAnalysis::DepType private, yay. llvm-svn: 60231	2008-11-29 02:29:27 +00:00
Chris Lattner	de04e1173a	Reimplement the internal abstraction used by MemDep in terms of a pointer/int pair instead of a manually bitmangled pointer. This forces clients to think a little more about checking the appropriate pieces and will be useful for internal implementation improvements later. I'm not particularly happy with this. After going through this I don't think that the clients of memdep should be exposed to the internal type at all. I'll fix this in a subsequent commit. This has no functionality change. llvm-svn: 60230	2008-11-29 01:43:36 +00:00
Chris Lattner	f3f6a801cc	don't revisit instructions off the beginning of the block. llvm-svn: 60221	2008-11-28 22:50:08 +00:00
Chris Lattner	f2a8ba4cf0	simplify some code, remove escaped newline. llvm-svn: 60213	2008-11-28 21:29:52 +00:00
Chris Lattner	8a172daa55	don't call MergeBasicBlockIntoOnlyPred on a block whose only predecessor is itself. This doesn't make sense, and this is a dead infinite loop anyway. llvm-svn: 60210	2008-11-28 19:54:49 +00:00
Chris Lattner	e9f6c355bf	rewrite RecursivelyDeleteTriviallyDeadInstructions to use a more efficient formulation that doesn't require set lookups or scanning a set. llvm-svn: 60203	2008-11-28 01:20:46 +00:00
Chris Lattner	d4b5ba615e	remove some weirdness that came from the LSR code that has nothing to do with dead instruction elimination. No tests in dejagnu depend on this, so I don't know what it was needed for. llvm-svn: 60202	2008-11-28 00:58:15 +00:00
Chris Lattner	1adb6759ef	rewrite a big chunk of how DSE does recursive dead operand elimination to use more modern infrastructure. Also do a bunch of small cleanups. llvm-svn: 60201	2008-11-28 00:27:14 +00:00
Chris Lattner	8e84c129ce	delete ErasePossiblyDeadInstructionTree, replacing uses of it with RecursivelyDeleteTriviallyDeadInstructions. llvm-svn: 60196	2008-11-27 23:25:44 +00:00
Chris Lattner	c077a2a535	Simplify LoopStrengthReduce::DeleteTriviallyDeadInstructions by making it use RecursivelyDeleteTriviallyDeadInstructions to do the heavy lifting. llvm-svn: 60195	2008-11-27 23:23:35 +00:00
Chris Lattner	a1bbdff933	enhance RecursivelyDeleteTriviallyDeadInstructions to make PHIs dead if they are single-value. llvm-svn: 60194	2008-11-27 23:18:11 +00:00
Chris Lattner	1cb4f72706	Enhance RecursivelyDeleteTriviallyDeadInstructions to optionally return a list of deleted instructions. llvm-svn: 60193	2008-11-27 23:14:34 +00:00
Chris Lattner	96e2dbe008	use continue to reduce indentation llvm-svn: 60192	2008-11-27 23:00:20 +00:00
Chris Lattner	c6c481cdfc	remove doConstantPropagation and dceInstruction, they are just wrappers around the interesting code and use an obscure iterator abstraction that dates back many many years. Move EraseDeadInstructions to Transforms/Utils and name it RecursivelyDeleteTriviallyDeadInstructions. llvm-svn: 60191	2008-11-27 22:57:53 +00:00
Chris Lattner	5ef9ebf787	simplify code. llvm-svn: 60190	2008-11-27 22:56:14 +00:00
Chris Lattner	c92fa42ddd	simplify this logic. llvm-svn: 60189	2008-11-27 22:46:09 +00:00
Nick Lewycky	4ab50b93c8	Chris prefers icmp/select over udiv! llvm-svn: 60187	2008-11-27 22:41:10 +00:00
Nick Lewycky	69941fd0a0	Add a couple of missed optimizations on integer vectors. Multiply and divide by 1, as well as multiply by -1. llvm-svn: 60182	2008-11-27 20:21:08 +00:00
Chris Lattner	4059f43b74	defensive patch: if CGP is merging a block with the entry block, make sure it ends up being the entry block. llvm-svn: 60180	2008-11-27 19:29:14 +00:00
Chris Lattner	5dfbfcd80d	Fix PR3138: if we merge the entry block into another block, make sure to move the other block back up into the entry position! llvm-svn: 60179	2008-11-27 19:25:19 +00:00
Chris Lattner	e0d019def6	switch InstCombine::visitLoadInst to use FindAvailableLoadedValue llvm-svn: 60169	2008-11-27 08:56:30 +00:00
Chris Lattner	c6ae56d23f	enhance FindAvailableLoadedValue to make use of AliasAnalysis if it has it. llvm-svn: 60167	2008-11-27 08:18:12 +00:00
Chris Lattner	72f16e70f0	move FindAvailableLoadedValue from JumpThreading to Transforms/Utils. llvm-svn: 60166	2008-11-27 08:10:05 +00:00
Chris Lattner	d6204bed3d	simplify this code a bit. llvm-svn: 60164	2008-11-27 07:54:38 +00:00
Chris Lattner	206250284d	Use the new MergeBasicBlockIntoOnlyPred function. llvm-svn: 60163	2008-11-27 07:54:12 +00:00
Chris Lattner	99d6809ac1	move MergeBasicBlockIntoOnlyPred to Transforms/Utils. llvm-svn: 60162	2008-11-27 07:43:12 +00:00
Chris Lattner	240051aace	rename ThreadBlock to ProcessBlock, since it does other things than just simple threading. llvm-svn: 60157	2008-11-27 07:20:04 +00:00
Chris Lattner	98d89d1b1b	Make jump threading substantially more powerful, in the following ways: 1. Make it fold blocks separated by an unconditional branch. This enables jump threading to see a broader scope. 2. Make jump threading able to eliminate locally redundant loads when they feed the branch condition of a block. This frequently occurs due to reg2mem running. 3. Make jump threading able to eliminate partially redundant loads when they feed the branch condition of a block. This is common in code with lots of loads and stores like C++ code and 255.vortex. This implements thread-loads.ll and rdar://6402033. Per the fixme's, several pieces of this should be moved into Transforms/Utils. llvm-svn: 60148	2008-11-27 05:07:53 +00:00
Chris Lattner	397a11ccd8	Turn on my codegen prepare heuristic by default. It doesn't affect performance in most cases on the Grawp tester, but does speed some things up (like shootout/hash by 15%). This also doesn't impact compile time in a noticable way on the Grawp tester. It also, of course, gets the testcase it was designed for right :) llvm-svn: 60120	2008-11-26 22:16:44 +00:00
Chris Lattner	fef04acc50	teach the new heuristic how to handle inline asm. llvm-svn: 60088	2008-11-26 04:59:11 +00:00
Chris Lattner	6d71b7fb95	Improve ValueAlreadyLiveAtInst with a cheap and dirty, but effective heuristic: the value is already live at the new memory operation if it is used by some other instruction in the memop's block. This is cheap and simple to compute (moreso than full liveness). This improves the new heuristic even more. For example, it cuts two out of three new instructions out of 255.vortex:DbmFileInGrpHdr, which is one of the functions that the heuristic regressed. This overall eliminates another 40 instructions from 403.gcc and visibly reduces register pressure in 255.vortex (though this only actually ends up saving the 2 instructions from the whole program). llvm-svn: 60084	2008-11-26 03:20:37 +00:00
Chris Lattner	e34fe2c52d	Start rewroking a subpiece of the profitability heuristic to be phrased in terms of liveness instead of as a horrible hack. :) In pratice, this doesn't change the generated code for either 255.vortex or 403.gcc, but it could cause minor code changes in theory. This is framework for coming changes. llvm-svn: 60082	2008-11-26 03:02:41 +00:00
Chris Lattner	383a797f42	add a comment, make save/restore logic more obvious. llvm-svn: 60076	2008-11-26 02:11:11 +00:00
Chris Lattner	eb3e4fb6fb	This adds in some code (currently disabled unless you pass -enable-smarter-addr-folding to llc) that gives CGP a better cost model for when to sink computations into addressing modes. The basic observation is that sinking increases register pressure when part of the addr computation has to be available for other reasons, such as having a use that is a non-memory operation. In cases where it works, it can substantially reduce register pressure. This code is currently an overall win on 403.gcc and 255.vortex (the two things I've been looking at), but there are several things I want to do before enabling it by default: 1. This isn't doing any caching of results, so it is much slower than it could be. It currently slows down release-asserts llc by 1.7% on 176.gcc: 27.12s -> 27.60s. 2. This doesn't think about inline asm memory operands yet. 3. The cost model botches the case when the needed value is live across the computation for other reasons. I'll continue poking at this, and eventually turn it on as llcbeta. llvm-svn: 60074	2008-11-26 02:00:14 +00:00
Evan Cheng	496b042e20	Revert r60042. IndVarSimplify should check if APFloat is PPCDoubleDouble first before trying to convert it to an integer. llvm-svn: 60072	2008-11-26 01:11:57 +00:00
Chris Lattner	a9ab165b08	Teach CodeGenPrepare to look through Bitcast instructions when attempting to optimize addressing modes. This allows us to optimize things like isel-sink2.ll into: movl 4(%esp), %eax cmpb $0, 4(%eax) jne LBB1_2 ## F LBB1_1: ## TB movl $4, %eax ret LBB1_2: ## F movzbl 7(%eax), %eax ret instead of: _test: movl 4(%esp), %eax cmpb $0, 4(%eax) leal 4(%eax), %eax jne LBB1_2 ## F LBB1_1: ## TB movl $4, %eax ret LBB1_2: ## F movzbl 3(%eax), %eax ret This shrinks (e.g.) 403.gcc from 1133510 to 1128345 lines of .s. Note that the 2008-10-16-SpillerBug.ll testcase is dubious at best, I doubt it is really testing what it thinks it is. llvm-svn: 60068	2008-11-26 00:26:16 +00:00
Chris Lattner	f3e95505c5	Teach MatchScaledValue to handle Scales by 1 with MatchAddr (which can recursively match things) and scales by 0 by ignoring them. This triggers once in 403.gcc, saving 1 (!!!!) instruction in the whole huge app. llvm-svn: 60013	2008-11-25 07:25:26 +00:00
Chris Lattner	728f90220a	significantly refactor all the addressing mode matching logic into a new AddressingModeMatcher class. This makes it easier to reason about and reduces passing around of stuff, but has no functionality change. llvm-svn: 60012	2008-11-25 07:09:13 +00:00
Chris Lattner	58f49d2916	refactor all the constantexpr/instruction handling code out into a new FindMaximalLegalAddressingModeForOperation helper method. llvm-svn: 60011	2008-11-25 05:15:49 +00:00
Chris Lattner	a3fbff15b9	another minor tweak llvm-svn: 60010	2008-11-25 04:47:41 +00:00
Chris Lattner	d616ef5683	minor cleanups no functionality change. llvm-svn: 60009	2008-11-25 04:42:10 +00:00
Chris Lattner	6416a6b7a0	rearrange and tidy some code, no functionality change. llvm-svn: 59990	2008-11-24 22:44:16 +00:00
Chris Lattner	d917c8c8fe	minor cleanups to debug code, no functionality change. llvm-svn: 59989	2008-11-24 22:40:05 +00:00
Chris Lattner	d78894197a	reenable the right part of the code. llvm-svn: 59985	2008-11-24 21:26:21 +00:00
Chris Lattner	992a541002	revert an accidental commit, this fixes the regression on test/CodeGen/X86/isel-sink.ll llvm-svn: 59976	2008-11-24 19:40:34 +00:00
Chris Lattner	53d6a07869	Fix 3113: If we have a dead cyclic PHI, replace the whole thing with an undef. llvm-svn: 59972	2008-11-24 19:25:36 +00:00
Devang Patel	702f45df58	Fix build failure. llvm-svn: 59844	2008-11-21 21:00:20 +00:00
Devang Patel	cb181bb203	Silence unused variable warnings. llvm-svn: 59841	2008-11-21 20:00:59 +00:00
Chris Lattner	dd7083452f	reapply Sanjiv's patch to genericize memcpy/memset/memmove to take an arbitrary integer width for the count. llvm-svn: 59823	2008-11-21 16:42:48 +00:00
Bill Wendling	4bce2bff88	Revert r59802. It was breaking the build of llvm-gcc: g++ -m32 -c -g -DIN_GCC -W -Wall -Wwrite-strings -Wmissing-format-attribute -fno-common -mdynamic-no-pic -DHAVE_CONFIG_H -Wno-unused -DTARGET_NAME=\"i386-apple-darwin9.5.0\" -I. -I. -I../../llvm-gcc.src/gcc -I../../llvm-gcc.src/gcc/. -I../../llvm-gcc.src/gcc/../include -I./../intl -I../../llvm-gcc.src/gcc/../libcpp/include -I../../llvm-gcc.src/gcc/../libdecnumber -I../libdecnumber -I/Volumes/Sandbox/Buildbot/llvm/full-llvm/build/llvm.obj/include -I/Volumes/Sandbox/Buildbot/llvm/full-llvm/build/llvm.src/include -DENABLE_LLVM -I/Volumes/Sandbox/Buildbot/llvm/full-llvm/build/llvm.obj/../llvm.src/include -D_DEBUG -D_GNU_SOURCE -D__STDC_LIMIT_MACROS -D__STDC_CONSTANT_MACROS -I. -I. -I../../llvm-gcc.src/gcc -I../../llvm-gcc.src/gcc/. -I../../llvm-gcc.src/gcc/../include -I./../intl -I../../llvm-gcc.src/gcc/../libcpp/include -I../../llvm-gcc.src/gcc/../libdecnumber -I../libdecnumber -I/Volumes/Sandbox/Buildbot/llvm/full-llvm/build/llvm.obj/include -I/Volumes/Sandbox/Buildbot/llvm/full-llvm/build/llvm.src/include ../../llvm-gcc.src/gcc/llvm-types.cpp -o llvm-types.o ../../llvm-gcc.src/gcc/llvm-convert.cpp: In member function 'void TreeToLLVM::EmitMemCpy(llvm::Value, llvm::Value, llvm::Value, unsigned int)': ../../llvm-gcc.src/gcc/llvm-convert.cpp:1496: error: 'memcpy_i32' is not a member of 'llvm::Intrinsic' ../../llvm-gcc.src/gcc/llvm-convert.cpp:1496: error: 'memcpy_i64' is not a member of 'llvm::Intrinsic' ../../llvm-gcc.src/gcc/llvm-convert.cpp: In member function 'void TreeToLLVM::EmitMemMove(llvm::Value, llvm::Value, llvm::Value, unsigned int)': ../../llvm-gcc.src/gcc/llvm-convert.cpp:1512: error: 'memmove_i32' is not a member of 'llvm::Intrinsic' ../../llvm-gcc.src/gcc/llvm-convert.cpp:1512: error: 'memmove_i64' is not a member of 'llvm::Intrinsic' ../../llvm-gcc.src/gcc/llvm-convert.cpp: In member function 'void TreeToLLVM::EmitMemSet(llvm::Value, llvm::Value, llvm::Value, unsigned int)': ../../llvm-gcc.src/gcc/llvm-convert.cpp:1528: error: 'memset_i32' is not a member of 'llvm::Intrinsic' ../../llvm-gcc.src/gcc/llvm-convert.cpp:1528: error: 'memset_i64' is not a member of 'llvm::Intrinsic' make[3]: [llvm-convert.o] Error 1 make[3]: * Waiting for unfinished jobs.... rm fsf-funding.pod gcov.pod gfdl.pod cpp.pod gpl.pod gcc.pod make[2]: * [all-stage1-gcc] Error 2 make[1]: * [stage1-bubble] Error 2 make: *** [all] Error 2 llvm-svn: 59809	2008-11-21 09:09:41 +00:00
Sanjiv Gupta	09a203765a	Make mem[cpy,move,set] intrinsics overloaded. llvm-svn: 59802	2008-11-21 07:49:09 +00:00
Nick Lewycky	07d726ec4d	Optimize (x/y)*y into x-(x%y) in general. Div and rem are about the same, and a subtract is cheaper than a multiply. This generalizes an existing transform. llvm-svn: 59800	2008-11-21 07:33:58 +00:00
Devang Patel	45f1ae028e	Fix unused variable warnings. llvm-svn: 59778	2008-11-21 01:52:59 +00:00
Bill Wendling	f5260d29c2	Fix error where it wasn't getting the correct caller function. llvm-svn: 59758	2008-11-21 00:09:21 +00:00
Bill Wendling	26c6a3e736	If the function being inlined has a higher stack protection level than the inlining function, then increase the stack protection level on the inlining function. llvm-svn: 59757	2008-11-21 00:06:32 +00:00
Devang Patel	38642e598e	Don't forget arguments! llvm-svn: 59745	2008-11-20 19:50:17 +00:00
Devang Patel	c8b2fe1eed	Do not forget llvm.dbg.declare's first argument while removing debugging information. llvm-svn: 59688	2008-11-20 01:20:42 +00:00
Oscar Fuentes	4fb443f81b	CMake: Removed source file. llvm-svn: 59662	2008-11-19 19:32:19 +00:00
Devang Patel	79303b2572	Do not use separate utility to walk all instructions and remove dead dbg intrinsics. Let instcombiner do this job. llvm-svn: 59659	2008-11-19 19:01:37 +00:00
Devang Patel	827bced2b1	Let instcombiner remove redundant dbg intrinsics. llvm-svn: 59658	2008-11-19 18:59:41 +00:00
Devang Patel	7ed6c5317c	If there are two consecutive llvm.dbg.stoppoint calls then it is likely that the optimizer deleted code in between these two intrinsics. Keep only the last llvm.dbg.stoppoint in this case. llvm-svn: 59657	2008-11-19 18:56:50 +00:00
Devang Patel	25662f3e4a	Remove unused variables. llvm-svn: 59570	2008-11-19 00:22:02 +00:00
Devang Patel	ebd2363339	Fix typo. llvm-svn: 59569	2008-11-19 00:19:18 +00:00
Devang Patel	b5e867acff	Add new helper pass that strips all symbol names except debugging information. This pass makes it easier to test wheter debugging info. influences optimization passes or not. llvm-svn: 59552	2008-11-18 21:34:39 +00:00
Devang Patel	3b7a2be88e	Remove even more llvm.dbg variables. Remove all dead globals from llvm.metadata. Ignore linkonce linkage for selected llvm.dbg values. llvm-svn: 59547	2008-11-18 21:13:41 +00:00
Devang Patel	a13f1f38fa	Initialize MallocFunc and FreeFunc properly. llvm-svn: 59538	2008-11-18 18:43:07 +00:00
Bill Wendling	cf194e9a27	Cast to remove warning about comparing signed and unsigned. llvm-svn: 59518	2008-11-18 10:57:27 +00:00
Devang Patel	f1e9329209	Give SIToFPInst preference over UIToFPInst because it is faster on platforms that are widely used. llvm-svn: 59476	2008-11-18 00:40:02 +00:00
Devang Patel	180afd2c55	While handling floating point IVs lift restrictions on initial value and increment value. llvm-svn: 59471	2008-11-17 23:27:13 +00:00
Devang Patel	aa3d68d301	Handle floating point ivs during doInitialization(). llvm-svn: 59466	2008-11-17 21:32:02 +00:00
Devang Patel	b63c74730c	Let AnalyzeAlloca() remove debug intrinsics. llvm-svn: 59454	2008-11-17 18:37:53 +00:00
Torok Edwin	026259faeb	If SI->size() is 0, we are not allowed to dereference ->begin(). This fixed PR3078. llvm-svn: 59416	2008-11-16 17:21:25 +00:00
Chris Lattner	7917b43a28	eliminate some std::set's. llvm-svn: 59409	2008-11-16 07:17:51 +00:00
Chris Lattner	f8f6270f14	simplify loop llvm-svn: 59406	2008-11-16 06:35:18 +00:00
Chris Lattner	44152742a0	simplify a bunch more instcombines to use m_Specific etc. llvm-svn: 59403	2008-11-16 05:38:51 +00:00
Chris Lattner	d397fef50d	factor the code for simplifying (icmp)\|(icmp) into its own function. llvm-svn: 59402	2008-11-16 05:20:07 +00:00
Chris Lattner	909b969b18	do some computation with apints instead of ConstantInts. llvm-svn: 59401	2008-11-16 05:14:43 +00:00
Chris Lattner	feaea9bdf7	merge a check into a place where it is simpler. llvm-svn: 59400	2008-11-16 05:10:52 +00:00
Chris Lattner	269cbd5770	factor a whole bunch of code out into a helper function. llvm-svn: 59398	2008-11-16 05:06:21 +00:00
Chris Lattner	b37b6e7e96	simplify the conditions on two gigantic if's, decreasing indentation a bit. Next step is to factor out into their own helper functions. llvm-svn: 59397	2008-11-16 04:55:20 +00:00
Chris Lattner	f1be285134	simplify some instcombine matches by using m_Specific llvm-svn: 59395	2008-11-16 04:46:19 +00:00
Chris Lattner	fae5e33111	Use new m_SelectCst template to eliminate macros. llvm-svn: 59392	2008-11-16 04:33:38 +00:00
Chris Lattner	569d78cbb5	simplify code. llvm-svn: 59390	2008-11-16 04:26:55 +00:00
Chris Lattner	c3f3b059d0	Handle the case where there is no "not". It is possible it got folded into the select. llvm-svn: 59389	2008-11-16 04:25:26 +00:00
Chris Lattner	5f6d9a313b	factor a bunch of copy/paste code out into a helper function. Eliminate the cases checking for cond?0:-1, since that is already handled by commutative checking. llvm-svn: 59388	2008-11-16 04:24:12 +00:00
Chris Lattner	68d2da2a19	rearrange some code, no functionality change. llvm-svn: 59381	2008-11-16 03:56:24 +00:00
Chris Lattner	e02c7c7ad2	if we're going to use a macro, use it maximally. no functionality change. llvm-svn: 59380	2008-11-16 03:54:57 +00:00
Devang Patel	8ada1d5de5	Refactor code. Strip debug information before stripping symbol names. llvm-svn: 59328	2008-11-14 22:49:37 +00:00
Devang Patel	3dd51c5c62	Really remove all debug information. llvm-svn: 59208	2008-11-13 01:28:40 +00:00
Oscar Fuentes	1b504d5372	CMake: Remove removed source file. llvm-svn: 59098	2008-11-12 00:14:12 +00:00
Devang Patel	4f02a0b740	Remove llvm-svn: 59093	2008-11-11 23:58:15 +00:00
Devang Patel	bf0835706c	Undo previous check-in. llvm-svn: 59092	2008-11-11 23:57:33 +00:00
Oscar Fuentes	2353ef3e91	CMake: Updated list of source files for lib/Transforms/Utils. llvm-svn: 59077	2008-11-11 19:51:36 +00:00
Devang Patel	6096f26bd4	Add utility pass to remove dbg info. llvm-svn: 59068	2008-11-11 19:33:39 +00:00
Devang Patel	95b18126ee	Use actual function name in comments. llvm-svn: 59063	2008-11-11 19:16:41 +00:00
Cedric Venet	8cb2e28e43	Update CMakeLists.txt llvm-svn: 59039	2008-11-11 09:55:48 +00:00
Devang Patel	53b39b5467	Cleanup debug info. assocated with deleted instructions. llvm-svn: 59012	2008-11-11 00:54:10 +00:00
Devang Patel	dc6699e82f	Add utility routines to remove dead debug info. llvm-svn: 59011	2008-11-11 00:53:02 +00:00
Devang Patel	d0ce981372	If the sign of exit condition and split condition does not match then do not split loop index. llvm-svn: 58995	2008-11-10 19:48:34 +00:00
Bill Wendling	7ef7314d1a	Third time's a charm. The previous patches didn't match correctly. Also, we need to make sure that the conditional is the same before doing the transformation. llvm-svn: 58978	2008-11-10 06:59:06 +00:00
Mon P Wang	25f0106fd9	Added support for the following definition of shufflevector <result> = shufflevector <n x <ty>> <v1>, <n x <ty>> <v2>, <m x i32> <mask> llvm-svn: 58964	2008-11-10 04:46:22 +00:00
Bill Wendling	4fb13c051d	Correction for the last patch. Should match the conditional in the first part of the select match, not the select instruction itself. llvm-svn: 58947	2008-11-09 23:37:53 +00:00
Bill Wendling	1579287550	The method of doing the matching with a 'select' instruction was wrong. The original code was matching like this: if (match(A, m_Not(m_Value(B)))) B was already matched as a 'select' instruction. However, this isn't matching what we think it's matching. It would match B as a 'Value', so basically anything would match to it. In this case, a Constant matched. B was replaced with a constant representation. And then the wrong value would be used in the SelectInst::Create statement, causing a crash. After thinking on this for a moment, and after Nick L. told me how the pattern matching stuff was supposed to work, the solution was to match NOT an m_Value, but an m_Select. llvm-svn: 58946	2008-11-09 23:17:42 +00:00
Nuno Lopes	2e42927e7c	fix leakage of ValueNumbering llvm-svn: 58933	2008-11-09 12:45:23 +00:00
Bill Wendling	3f547be28f	If the LHS of the FCMP is coming from a UIToFP instruction, then we don't want to generate signed ICMP instructions to replace the FCMP. This would violate the following: define i1 @test1(i32 %val) { %1 = uitofp i32 %val to double %2 = fcmp ole double %1, 0.000000e+00 ret i1 %2 } would be transformed into: define i1 @test1(i32 %val) { %1 = icmp slt i33 %val, 1 ret i1 %1 } which is obviously wrong. This patch modifes InstCombiner::FoldFCmp_IntToFP_Cst to handle when the LHS comes from UIToFP. llvm-svn: 58929	2008-11-09 04:26:50 +00:00
Daniel Dunbar	2b9dce2669	Rework r58829, allowing removal of dbg info intrinsics during alloca promotion. - Eliminate uses after free and simplify tests. Devang: Please check that this is still doing what you intended. llvm-svn: 58887	2008-11-08 04:12:17 +00:00
Bill Wendling	b9656df4ac	BCUI + 1 doesn't work. Use next instead. llvm-svn: 58830	2008-11-07 01:59:41 +00:00
Devang Patel	b8e0d59ceb	Handle (delete) dbg intrinsics while promoting alloca. llvm-svn: 58826	2008-11-07 01:30:07 +00:00
Mon P Wang	5ca2ec65bd	Fixed scalarizing an extract subvector and prevent an infinite loop when simplify a vector. llvm-svn: 58820	2008-11-06 22:52:21 +00:00
Devang Patel	5a5ab730e0	InstructionNamer preserves everything. llvm-svn: 58787	2008-11-06 01:00:16 +00:00
Devang Patel	f0ef35738c	Do now allow InlineAlways pass to remove dead functions. llvm-svn: 58744	2008-11-05 01:39:16 +00:00
Devang Patel	7a848b0ee3	Check Attribute::NoInline. llvm-svn: 58742	2008-11-05 01:37:05 +00:00
Oscar Fuentes	076e048cf7	CMake: updated list of source files. llvm-svn: 58736	2008-11-05 00:11:22 +00:00
Dan Gohman	8cdea717a3	Add a new pass to simplify specific half_powr function calls. This is a specialized pass that it not likely to be generally useful. llvm-svn: 58732	2008-11-04 23:41:45 +00:00
Dale Johannesen	0a7b4f5800	Allow SROA of vectors. Removing this caused a huge performance regression in something we care about. This may not be final fix. llvm-svn: 58718	2008-11-04 20:54:03 +00:00
Devang Patel	f33f8a8606	Fix unused variable warnings. llvm-svn: 58651	2008-11-03 23:14:09 +00:00
Devang Patel	fe57d109b6	Ignore conditions that are outside the loop. llvm-svn: 58631	2008-11-03 19:38:07 +00:00
Andrew Lenharth	348f3fa6a7	add a period at the end of the comment, ignoring the fact that the comment would be hard pressed to be considered a sentence, but if it makes Bill happy... llvm-svn: 58630	2008-11-03 19:29:29 +00:00
Devang Patel	c1631db93b	Turn floating point IVs into integer IVs where possible. This allows SCEV users to effectively calculate trip count. LSR later on transforms back integer IVs to floating point IVs later on to avoid int-to-float casts inside the loop. llvm-svn: 58625	2008-11-03 18:32:19 +00:00
Andrew Lenharth	45b86322f2	Ensure that we are checking only calls to the function we are interested in specializing llvm-svn: 58615	2008-11-03 16:05:35 +00:00
Nick Lewycky	d73806a9cc	Replace explicit loop with utility function. llvm-svn: 58593	2008-11-03 03:49:14 +00:00
Nick Lewycky	3c6d34a7f0	Changes from Duncan's review: * merge two weak functions by making them both alias a third non-weak fn * don't reimplement CallSite::hasArgument * whitelist the safe linkage types llvm-svn: 58568	2008-11-02 16:46:26 +00:00
Duncan Sands	cede1e035c	Get this building on 64 bit machines (error: cast from ‘const llvm::PointerType*’ to ‘unsigned int’ loses precision). llvm-svn: 58561	2008-11-02 09:00:33 +00:00
Oscar Fuentes	0433be6feb	CMake: added a source file. llvm-svn: 58559	2008-11-02 06:01:39 +00:00
Nick Lewycky	d01d42e76c	Add a new MergeFunctions pass. It finds identical functions and merges them. This triggers only 60 times in llvm-test (look at .llvm.bc, not .linked.rbc) and so it probably wont be turned on by default. Also, may of those are likely to go away when PR2973 is fixed. llvm-svn: 58557	2008-11-02 05:52:50 +00:00
Nick Lewycky	8d8acf327b	Fix demanded bits analysis with srem by negative number. Based on a patch by Richard Osborne. llvm-svn: 58555	2008-11-02 02:41:50 +00:00
Dan Gohman	83eea0b17f	Fix this recently moved code to use the correct type. CI is now a ConstantInt, and SI is the original cast instruction. This fixes PR2996. llvm-svn: 58549	2008-11-02 00:17:33 +00:00
Daniel Dunbar	a1c4fcfc29	Fix warning. llvm-svn: 58486	2008-10-31 01:50:01 +00:00
Dan Gohman	13cbcf1c18	Canonicalize sext(i1) to i1?-1:0, and update various instcombine optimizations accordingly. llvm-svn: 58457	2008-10-30 20:40:10 +00:00
Daniel Dunbar	3933e66a89	Add InlineCost class for represent the estimated cost of inlining a function. - This explicitly models the costs for functions which should "always" or "never" be inlined. This fixes bugs where such costs were not previously respected. llvm-svn: 58450	2008-10-30 19:26:59 +00:00
Chris Lattner	0934c0f35b	Fix PR2967 by not deleting volatile load/stores that occur before unreachable. I don't really see this as being needed, but there is little harm from doing it. llvm-svn: 58385	2008-10-29 17:46:26 +00:00
Daniel Dunbar	e7fbf9f425	Factor shouldInline method out of Inliner. - No functionality change. llvm-svn: 58355	2008-10-29 01:02:02 +00:00
Daniel Dunbar	cc20455346	Assorted comment/naming fixes, 80-col violations, and reindentation. - No functionality change. llvm-svn: 58352	2008-10-28 23:24:26 +00:00
Dan Gohman	2c34c130bf	(A & sext(C)) \| (B & ~sext(C) -> C ? A : B llvm-svn: 58351	2008-10-28 22:38:57 +00:00
Torok Edwin	ca97b42ef7	export an ID for the instructionNamer, allowing analysis/transformation passes that need it to require it by ID. llvm-svn: 58238	2008-10-27 10:16:27 +00:00
Chris Lattner	59b5691388	Rewrite all the 'PromoteLocallyUsedAlloca[s]' logic. With the power of LargeBlockInfo, we can now dramatically simplify their implementation and speed them up at the same time. Now the code has time proportional to the number of uses of the alloca, not the size of the block. This also eliminates code that tried to batch up different allocas which are used in the same blocks, and eliminates the 'retry list' logic which was baroque and no unneccesary. In addition to being a speedup for crazy cases, this is also a nice cleanup: PromoteMemoryToRegister.cpp \| 270 +++++++++++++++----------------------------- 1 file changed, 96 insertions(+), 174 deletions(-) llvm-svn: 58229	2008-10-27 07:05:53 +00:00
Chris Lattner	f594ecc453	Add a new LargeBlockInfo helper, which is just a wrapper around a trivial dense map. Use this in RewriteSingleStoreAlloca to avoid aggressively rescanning blocks over and over again. This fixes PR2925, speeding up mem2reg on the testcase in that bug from 4.56s to 0.02s in a debug build on my machine. llvm-svn: 58227	2008-10-27 06:05:26 +00:00
Nick Lewycky	f6e4dca67e	Add value range analyzing of Add and Sub. Understand that mul %x, 1 = %x. llvm-svn: 58069	2008-10-24 04:00:26 +00:00
Daniel Dunbar	7f39e2d85a	Change createPass factory functions to return Pass instead of LoopPass*. - Although less precise, this means they can be used in clients without RTTI (who would otherwise need to include LoopPass.h, which eventually includes things using dynamic_cast). This was the simplest solution that presented itself, but I am happy to use a better one if available. llvm-svn: 58010	2008-10-22 23:32:42 +00:00
Dan Gohman	72e66eedb8	Use Function::getEntryBlock() instead of Function::front(), for clarity. llvm-svn: 57870	2008-10-21 03:10:28 +00:00
Dan Gohman	fa29b67aee	Fix a bug that prevented llvm-extract -delete from working. llvm-svn: 57864	2008-10-21 01:08:07 +00:00
Dan Gohman	215742a966	Use 0 instead of false to return a null pointer. llvm-svn: 57660	2008-10-17 00:56:52 +00:00
Dan Gohman	bc0278400c	Teach instcombine's visitLoad to scan back several instructions to find opportunities for store-to-load forwarding or load CSE, in the same way that visitStore scans back to do DSE. Also, define a new helper function for testing whether the addresses of two memory accesses are known to have the same value, and use it in both visitStore and visitLoad. These two changes allow instcombine to eliminate loads in code produced by front-ends that frequently emit obviously redundant addressing for memory references. llvm-svn: 57608	2008-10-15 23:19:35 +00:00
Evan Cheng	d885f6e139	Combine (fcmp cc0 x, y) \| (fcmp cc1 x, y) into a single fcmp when possible. llvm-svn: 57515	2008-10-14 18:44:08 +00:00
Evan Cheng	ce70752b11	- Somehow I forgot about one / une. - Renumber fcmp predicates to match their icmp counterparts. - Try swapping operands to expose more optimization opportunities. llvm-svn: 57513	2008-10-14 18:13:38 +00:00
Evan Cheng	67786cce66	Optimize anding of two fcmp into a single fcmp if the operands are the same. e.g. uno && ueq -> ueq ord && olt -> olt ord && ueq -> oeq llvm-svn: 57507	2008-10-14 17:15:11 +00:00
Matthijs Kooijman	f7d3cb5435	Make InstructionCombining::getBitCastOperand() recognize GEP instructions and constant expression with all zero indices as being the same as a bitcast. llvm-svn: 57442	2008-10-13 15:17:01 +00:00
Chris Lattner	da435910e8	Fix PR2697 by rewriting the '(X / pos) op neg' logic. This also changes a couple other cases for clarity, but shouldn't affect correctness. Patch by Eli Friedman! llvm-svn: 57387	2008-10-11 22:55:00 +00:00
Devang Patel	647a1e532b	Check loop exit predicate properly while eliminating one iteration loop. This patch fixes PR 2869 llvm-svn: 57369	2008-10-10 22:02:57 +00:00
Nuno Lopes	e3127f3f80	fix memleak by cleaning the global sets on pass exit llvm-svn: 57353	2008-10-10 16:25:50 +00:00
Dale Johannesen	4f0bd68cfe	Add a "loses information" return value to APFloat::convert and APFloat::convertToInteger. Restore return value to IEEE754. Adjust all users accordingly. llvm-svn: 57329	2008-10-09 23:00:39 +00:00
Nick Lewycky	03c5fa18f1	Don't drop alignment on globals when cloning. llvm-svn: 57320	2008-10-09 06:27:14 +00:00
Nuno Lopes	06c67f88d7	dont specialize weak functions and the like llvm-svn: 57305	2008-10-08 18:45:59 +00:00
Duncan Sands	26ff6f9c54	Add <cstdio> include where needed by gcc-4.4. Patch by Samuel Tardieu. llvm-svn: 57291	2008-10-08 07:23:46 +00:00
Chris Lattner	42d5785dbd	Add parentheses to avoid warnings in GCC 4.4.0, patch by Samuel Tardieu! llvm-svn: 57288	2008-10-08 06:42:28 +00:00
Andrew Lenharth	5aa1cc4065	Correctly set attributes when removing args during cloning. Fixes PR2765 llvm-svn: 57254	2008-10-07 18:08:38 +00:00
Devang Patel	40aafce00d	Fix typo, fix PR 2865. llvm-svn: 57221	2008-10-06 23:22:54 +00:00
Matthijs Kooijman	cbe5e16eb5	Allow scalarrepl to treat an all-zero GEP just as bitcast. This includes not marking a GEP involving a vector as unsafe, but only when it has all zero indices. This allows scalarrepl to work in a few more cases. llvm-svn: 57177	2008-10-06 16:23:31 +00:00
Chris Lattner	917a6c1343	rewrite bswap matching to be more general, allowing arbitrary shifting and masking inside a bswap expr. This allows it to handle the cases from PR2842, which involve the intermediate 'or' expressions being shifted, not just the input value. llvm-svn: 57095	2008-10-05 02:13:19 +00:00
Chris Lattner	ca91f265c4	fix a bug where the bswap matcher could match a case involving ashr. It should only apply to lshr. llvm-svn: 57089	2008-10-05 00:50:57 +00:00
Duncan Sands	1d35e9aebe	Ignore loads from and stores to local memory (i.e. allocas) when deciding whether to mark a function readnone/readonly. Since the pass is currently run before SROA, this may be quite helpful. Requested by Chris on IRC. llvm-svn: 57050	2008-10-04 13:24:24 +00:00
Dan Gohman	e21903987f	Clean up some multiple-return-value code that is no longer applicable. llvm-svn: 57033	2008-10-03 22:21:24 +00:00
Devang Patel	f963403b58	Nick Lewycky's patch. While hosting instruction check PHI node. llvm-svn: 57025	2008-10-03 18:57:37 +00:00
Duncan Sands	3a813a5d3f	Teach internalize to preserve the callgraph. Why? Because it was there! llvm-svn: 56996	2008-10-03 07:36:09 +00:00
Owen Anderson	cb4f156b6b	SplitBlock should only attempt to update LoopInfo if it is actually being used. llvm-svn: 56994	2008-10-03 06:55:35 +00:00
Duncan Sands	d65a4daeea	Factorize code: remove variants of "strip off pointer bitcasts and GEP's", and centralize the logic in Value::getUnderlyingObject. The difference with stripPointerCasts is that stripPointerCasts only strips GEPs if all indices are zero, while getUnderlyingObject strips GEPs no matter what the indices are. llvm-svn: 56922	2008-10-01 15:25:41 +00:00
Nuno Lopes	96740aad86	revert the addition of Preverves(CallGraph), per Duncan's comments llvm-svn: 56917	2008-10-01 09:13:40 +00:00
Dan Gohman	67d90de2b0	Call ScalarEvolution's deleteValueFromRecords before deleting an instruction, not after. This fixes some uses of free'd memory. llvm-svn: 56908	2008-10-01 02:02:03 +00:00
Nuno Lopes	5093ab4c76	add preserversCFG() + preservers(CallGraph) llvm-svn: 56887	2008-09-30 22:04:30 +00:00
Nuno Lopes	2bd7b24f1a	add AU.setPreservesCFG() since this pass only adds and removes function attributes llvm-svn: 56868	2008-09-30 18:34:38 +00:00
Nick Lewycky	e8ced3ec19	Fix misoptimization of: xor i1 (icmp eq (X, C1), icmp s[lg]t (X, C2)) llvm-svn: 56834	2008-09-30 06:08:34 +00:00
Duncan Sands	57512a1be4	Speed up these passes when the callgraph has huge simply connected components. Suggested by Chris. llvm-svn: 56787	2008-09-29 14:59:04 +00:00
Nuno Lopes	ffc9da6772	remove redundant test (mayBeOverriden() includes hasLinkOnceLinkage) llvm-svn: 56786	2008-09-29 14:40:32 +00:00
Duncan Sands	e340e18783	Tweak some comments. llvm-svn: 56784	2008-09-29 13:35:31 +00:00
Duncan Sands	08d91178e9	Rename isWeakForLinker to mayBeOverridden. Use it instead of hasWeakLinkage in a bunch of optimization passes. llvm-svn: 56782	2008-09-29 11:25:42 +00:00
Devang Patel	9eb525d4f9	Implement function notes as function attributes. llvm-svn: 56716	2008-09-26 23:51:19 +00:00
Devang Patel	a05633e105	Now Attributes are divided in three groups - return attributes - inreg, zext and sext - parameter attributes - function attributes - nounwind, readonly, readnone, noreturn Return attributes use 0 as the index. Function attributes use ~0U as the index. This patch requires corresponding changes in llvm-gcc and clang. llvm-svn: 56704	2008-09-26 22:53:05 +00:00
Devang Patel	4c758ea3e0	Large mechanical patch. s/ParamAttr/Attribute/g s/PAList/AttrList/g s/FnAttributeWithIndex/AttributeWithIndex/g s/FnAttr/Attribute/g This sets the stage - to implement function notes as function attributes and - to distinguish between function attributes and return value attributes. This requires corresponding changes in llvm-gcc and clang. llvm-svn: 56622	2008-09-25 21:00:45 +00:00
Evan Cheng	25dd4a2daf	Commit CodeGenPrepare.cpp changes which was accidentially left out of 56526. llvm-svn: 56549	2008-09-24 06:48:55 +00:00
Eric Christopher	c1ea149dcd	Fix fallout in CodeGenPrepare from 56526. Will likely need more work. llvm-svn: 56546	2008-09-24 05:32:41 +00:00
Devang Patel	6402c7236f	s/ParamAttrsWithIndex/FnAttributeWithIndex/g llvm-svn: 56535	2008-09-24 00:55:02 +00:00
Devang Patel	e15607b7bb	Put FN_NOTE_AlwaysInline and others in FnAttr namespace. llvm-svn: 56527	2008-09-24 00:06:15 +00:00
Devang Patel	e87abd26ba	Move FN_NOTE_AlwaysInline and other out of ParamAttrs namespace. Do not check isDeclaration() in hasNote(). It is clients' responsibility. llvm-svn: 56524	2008-09-23 23:52:03 +00:00
Devang Patel	ba3fa6c6e1	s/ParameterAttributes/Attributes/g llvm-svn: 56513	2008-09-23 23:03:40 +00:00
Devang Patel	82fed6702b	Use parameter attribute store (soon to be renamed) for Function Notes also. Function notes are stored at index ~0. llvm-svn: 56511	2008-09-23 22:35:17 +00:00
Devang Patel	329fe728b5	Add hasNote() to check note associated with a function. llvm-svn: 56477	2008-09-22 22:32:29 +00:00
Oscar Fuentes	a229b3c9a7	Initial support for the CMake build system. llvm-svn: 56419	2008-09-22 01:08:49 +00:00
Duncan Sands	e1dc84be64	Implement review feedback from Devang: make use of mayReadFromMemory and mayWriteToMemory. llvm-svn: 56387	2008-09-20 16:45:58 +00:00
Duncan Sands	310077034a	Remove the MarkModRef pass (use AddReadAttrs instead). Unfortunately this means removing one regression test of GlobalsModRef because I couldn't work out how to perform it without MarkModRef. llvm-svn: 56342	2008-09-19 08:23:44 +00:00
Duncan Sands	af25ee7ffc	Add a new pass AddReadAttrs which works out which functions can get the readnone/readonly attributes, and gives them it. The plan is to remove markmodref (which did the same thing by querying GlobalsModRef) and delete the analogous functionality from GlobalsModRef. llvm-svn: 56341	2008-09-19 08:17:05 +00:00
Devang Patel	c25be3b2de	splitLoop does not handle split condition EQ. Fixes PR 2805 llvm-svn: 56321	2008-09-18 23:45:14 +00:00
Bill Wendling	a00fa322b1	Decrementing the iterator here could be wrong if the worklist is empty after the "erase". Thanks to Ji Young Park for the patch! llvm-svn: 56316	2008-09-18 23:04:18 +00:00
Devang Patel	76b22c1420	Try to place hoisted instructions befoe icmp instruction. llvm-svn: 56315	2008-09-18 22:50:42 +00:00
Devang Patel	7f9671ba37	Do not hoist instruction above branch condition. The instruction may use branch condition. llvm-svn: 56286	2008-09-17 18:21:49 +00:00
Devang Patel	dca8d3b183	Do not ignore iv uses outside the loop. This one slipped through cracks very well. llvm-svn: 56284	2008-09-17 17:53:47 +00:00
Dan Gohman	dafa9c6e85	Improve instcombine's handling of integer min and max in two ways: - Recognize expressions like "x > -1 ? x : 0" as min/max and turn them into expressions like "x < 0 ? 0 : x", which is easily recognizable as a min/max operation. - Refrain from folding expression like "y/2 < 1" to "y < 2" when the comparison is being used as part of a min or max idiom, like "y/2 < 1 ? 1 : y/2". In that case, the division has another use, so folding doesn't eliminate it, and obfuscates the min/max, making it harder to recognize as a min/max operation. These benefit ScalarEvolution, CodeGen, and anything else that wants to recognize integer min and max. llvm-svn: 56246	2008-09-16 18:46:06 +00:00
Dan Gohman	68e7735a38	Teach LSR to optimize away SMAX operations for tripcounts in common cases. See the comment above OptimizeSMax for the full story, and the testcase for an example. This cancels out a pessimization commonly attributed to indvars, and will allow us to lift some of the artificial throttles in indvars, rather than add new ones. llvm-svn: 56230	2008-09-15 21:22:06 +00:00
Dan Gohman	eff71f2953	On 64-bit targets, change 32-bit getelementptr indices to be 64-bit getelementptr indices, inserting an explicit cast if necessary. This helps expose the sign-extension operation to other optimizations. llvm-svn: 56133	2008-09-11 23:06:38 +00:00
Dan Gohman	7d01c0654c	Fix a vectorshuffle instcombine bug introduced by r55995. Patch by Nicolas Capens! llvm-svn: 56129	2008-09-11 22:47:57 +00:00
Dan Gohman	9b9d547a5c	Fix a copy+paste bug that Duncan spotted. For several cases it was still getting lucky and detecting overflow but it was clearly incorrect. llvm-svn: 56113	2008-09-11 18:53:02 +00:00
Dan Gohman	9d9a4be588	In my analysis for r56076 I missed the case where the original multiplication overflows. llvm-svn: 56082	2008-09-11 00:25:00 +00:00
Dan Gohman	c1ae01688f	Fix an icmp+sdiv optimization to check for and handle an overflow condition. This fixes PR2740. llvm-svn: 56076	2008-09-10 23:30:57 +00:00
Devang Patel	728c44ab56	fix white spaces. llvm-svn: 56056	2008-09-10 14:49:55 +00:00
Dan Gohman	97f0a0f28d	Fix a warning about comparing signed and unsigned values. llvm-svn: 56040	2008-09-10 01:09:32 +00:00
Devang Patel	92b032f3e6	if loop induction variable is always sign or zero extended then extend the type of induction variable. llvm-svn: 56017	2008-09-09 21:41:07 +00:00
Devang Patel	92c5367705	fix overflow check. llvm-svn: 56011	2008-09-09 20:54:34 +00:00
Anton Korobeynikov	1a1140429e	Make safer variant of alias resolution routine to be default llvm-svn: 56005	2008-09-09 20:05:04 +00:00
Anton Korobeynikov	a9b60ee0fc	Resolve aliases, when possible llvm-svn: 56001	2008-09-09 19:04:59 +00:00
Dan Gohman	86fb5b48de	Make SimplifyDemandedVectorElts simplify vectors with multiple users, and teach it about shufflevector instructions. Also, fix a subtle bug in SimplifyDemandedVectorElts' insertelement code. This is a patch that was originally written by Eli Friedman, with some fixes and cleanup by me. llvm-svn: 55995	2008-09-09 18:11:14 +00:00
Devang Patel	0f7a3507cf	Fix simplifycfg crash in handing block merge. llvm-svn: 55971	2008-09-09 01:06:56 +00:00
Devang Patel	3d56051f70	s/RemoveUnreachableBlocks/RemoveUnreachableBlocksFromFn/g llvm-svn: 55965	2008-09-08 22:14:17 +00:00
Devang Patel	7518f250b9	Remove unused counter. llvm-svn: 55924	2008-09-08 17:14:54 +00:00
Devang Patel	538a7f479a	Remove OptimizeIVType() llvm-svn: 55913	2008-09-08 16:13:27 +00:00
Duncan Sands	b9a6f861b4	Update the callgraph correctly. llvm-svn: 55896	2008-09-08 11:08:09 +00:00
Duncan Sands	3cf7d86556	Update the callgraph correctly in ArgumentPromotion. llvm-svn: 55895	2008-09-08 11:07:35 +00:00
Duncan Sands	46911f1271	Reapply 55859. This doesn't change anything as long as the callgraph is correct. It checks for wrong callgraphs more strictly. llvm-svn: 55894	2008-09-08 11:05:51 +00:00
Duncan Sands	1ea0d2e6db	Correct a comment and strip trailing whitespace. llvm-svn: 55883	2008-09-07 09:54:09 +00:00
Nuno Lopes	421f488cb7	fix crash when the malloc/free function is defined or is a declaration with 0 parameters. this pass doesnt seem to be used, but still it's now a little more correct llvm-svn: 55873	2008-09-06 17:44:06 +00:00
Duncan Sands	95c2a7848a	When PruneEH turned an invoke into an ordinary call (thus changing the call site) it didn't inform the callgraph about this. But the call site does matter - as shown by the testcase, the callgraph become invalid after the inliner ran (with an edge between two functions simply missing), resulting in wrong deductions by GlobalsModRef. llvm-svn: 55872	2008-09-06 17:19:29 +00:00
Owen Anderson	1dd2e40521	Revert r55859. This is breaking the build in the abscence of its companion commit. llvm-svn: 55865	2008-09-05 23:36:01 +00:00
Devang Patel	d94269f906	Remove unused map. llvm-svn: 55861	2008-09-05 21:55:33 +00:00
Duncan Sands	9e23602849	Delete the removeCallEdgeTo callgraph method, because it does not maintain a correct list of callsites. I discovered (see following commit) that the inliner will create a wrong callgraph if it is fed a callgraph with correct edges but incorrect callsites. These were created by Prune-EH, and while it wasn't done via removeCallEdgeTo, it could have been done via removeCallEdgeTo, which is an accident waiting to happen. Use removeCallEdgeFor instead. llvm-svn: 55859	2008-09-05 21:43:04 +00:00
Duncan Sands	3a52056d4d	Use removeAllCalledFunctions rather than removing edges one by one by hand. llvm-svn: 55836	2008-09-05 14:56:53 +00:00
Duncan Sands	7c8fb1ad93	Remove trailing whitespace. llvm-svn: 55835	2008-09-05 12:37:12 +00:00
Duncan Sands	6dd02b5219	Make this pass return that it made a change if it modifies a functions attributes. llvm-svn: 55831	2008-09-05 09:08:37 +00:00
Devang Patel	40519f0370	A loop may be unswitched multiple times. Reconstruct dom info. at the end. llvm-svn: 55806	2008-09-04 22:43:59 +00:00
Devang Patel	00ec74616b	Initialize loop data first. llvm-svn: 55792	2008-09-04 20:36:36 +00:00
Devang Patel	d52071540c	Do not unswitch if the function notes say we're optimizing this function for size. llvm-svn: 55786	2008-09-04 18:55:13 +00:00
Andrew Lenharth	19fb2aba50	try to seperate the mechanism into something others can use llvm-svn: 55785	2008-09-04 18:51:26 +00:00
Dale Johannesen	fe1bb7964c	Add intrinsic forms of pow and exp2. The non-intrinsic forms remain to handle older IR files, but will go away soon. llvm-svn: 55781	2008-09-04 18:30:46 +00:00
Dan Gohman	a79db30d28	Tidy up several unbeseeming casts from pointer to intptr_t. llvm-svn: 55779	2008-09-04 17:05:41 +00:00
Andrew Lenharth	95d573a7f0	cleanup as per Duncan's review llvm-svn: 55766	2008-09-04 14:34:22 +00:00
Devang Patel	a26e2075b8	Update inline threshold for current function if the notes say, optimize for size. llvm-svn: 55745	2008-09-03 23:06:09 +00:00
Owen Anderson	2fbfb70530	Fix a bug that prevented PRE from applying in some cases. llvm-svn: 55744	2008-09-03 23:06:07 +00:00
Andrew Lenharth	9fed8f5b9c	Initial version of a Partial Specialization IPO pass. It triggers a couple hundred times on 176.gcc. I don't know the performance impact yet, the heuristic is quite simple still. llvm-svn: 55734	2008-09-03 21:00:28 +00:00
Devang Patel	a563d24e5d	Fix typo in a comment. llvm-svn: 55720	2008-09-03 20:25:40 +00:00
Devang Patel	a4211876e5	Add parentheses to make code more readable. llvm-svn: 55717	2008-09-03 19:57:15 +00:00
Devang Patel	50c66cdb0d	Fix comments. llvm-svn: 55716	2008-09-03 19:52:17 +00:00
Devang Patel	924d9084d8	Add custom inliner that handles only functions that are marked as always_inline. llvm-svn: 55713	2008-09-03 18:50:53 +00:00
Devang Patel	0d442ffa2b	Handle "always inline" note during inline cost analysis. llvm-svn: 55712	2008-09-03 18:47:45 +00:00
Devang Patel	79661994b1	Check noinline note and ignore other notes. llvm-svn: 55711	2008-09-03 18:46:35 +00:00
Devang Patel	62be9ad270	Handle "noinline" note inside the simple inliner. llvm-svn: 55708	2008-09-03 18:10:21 +00:00
Nick Lewycky	2fcb26cc75	Don't apply this transform to vectors. Fixes PR2756. llvm-svn: 55690	2008-09-03 06:24:21 +00:00
Devang Patel	bcd39345de	Add additional check to ensure that iv is canonicalized. llvm-svn: 55682	2008-09-03 00:29:13 +00:00
Devang Patel	b530f08122	Check iteration count. llvm-svn: 55680	2008-09-03 00:10:56 +00:00
Devang Patel	81fed043c5	While removing PHI, use basicblock to identify incoming value. llvm-svn: 55678	2008-09-03 00:02:42 +00:00
Devang Patel	7e59270272	s/FP_AlwaysInline/FN_NOTE_AlwaysInline/g llvm-svn: 55676	2008-09-02 22:43:57 +00:00
Devang Patel	43c5a52e07	If all IV uses are extending integer IV then change the type of IV itself, if possible. llvm-svn: 55674	2008-09-02 22:18:08 +00:00
Devang Patel	bfa535af9f	respect inline=never and inline=always notes. llvm-svn: 55673	2008-09-02 22:16:13 +00:00
Duncan Sands	130d9efec3	Add a small pass that sets the readnone/readonly attributes on functions, based on the result of alias analysis. It's not hardwired to use GlobalsModRef even though this is the only (AFAIK) alias analysis that results in this pass actually doing something. Enable as follows: opt ... -globalsmodref-aa -markmodref ... Advantages of this pass: (1) records the result of globalsmodref in the bitcode, meaning it is available for use by later passes (currently the pass manager isn't smart enough to magically make an advanced alias analysis available to all later passes), which may expose more optimization opportunities; (2) hopefully speeds up compilation when code is optimized twice, for example when a file is compiled to bitcode, then later LTO is done on it: marking functions readonly/readnone when producing the initial bitcode should speed up alias analysis during LTO; (3) good for discovering that globalsmodref doesn't work very well :) Not currently turned on by default. llvm-svn: 55604	2008-09-01 11:40:11 +00:00
Devang Patel	d6adbb6a0f	Do not apply the transformation if the target does not support DestTy natively. llvm-svn: 55433	2008-08-27 20:55:23 +00:00
Devang Patel	cf7ca5d0ba	Fix typos and whitespaces. Other cosmetic changes based on feedback. llvm-svn: 55424	2008-08-27 17:50:18 +00:00
Owen Anderson	b39e0decf8	Put a heuristic in place to prevent GVN from falling into bad cases with massively complicated CFGs. This speeds up a particular testcase from 12+ hours to 5 seconds with little perceptible loss of quality. llvm-svn: 55391	2008-08-26 22:07:42 +00:00
Devang Patel	4310d39844	If IV is used in a int-to-float cast inside the loop then try to eliminate the cast operation. llvm-svn: 55374	2008-08-26 17:57:54 +00:00
Chris Lattner	add44f3fb7	improve encapsulation of the BBExecutable set. llvm-svn: 55271	2008-08-23 23:39:31 +00:00
Chris Lattner	65938fc69a	Switch an assortment of maps, sets and vectors to more efficient versions, patch contributed by m-s! llvm-svn: 55270	2008-08-23 23:36:38 +00:00
Chris Lattner	0c19df4871	Switch the asmprinter (.ll) and all the stuff it requires over to use raw_ostream instead of std::ostream. Among other goodness, this speeds up llvm-dis of kc++ with a release build from 0.85s to 0.49s (88% faster). Other interesting changes: 1) This makes Value::print be non-virtual. 2) AP[S]Int and ConstantRange can no longer print to ostream directly, use raw_ostream instead. 3) This fixes a bug in raw_os_ostream where it didn't flush itself when destroyed. 4) This adds a new SDNode::print method, instead of only allowing "dump". A lot of APIs have both std::ostream and raw_ostream versions, it would be useful to go through and systematically anihilate the std::ostream versions. This passes dejagnu, but there may be minor fallout, plz let me know if so and I'll fix it. llvm-svn: 55263	2008-08-23 22:23:09 +00:00
Chris Lattner	20abc419e5	Add a new trivial -inst-namer pass which makes it possible to diff the before/after effects of a pass, crazy! llvm-svn: 55230	2008-08-23 06:07:02 +00:00
Chris Lattner	3f972c9150	Fix PR2423 by checking all indices for out of range access, not only indices that start with an array subscript. x->field[10000] is just as bad as (*X)[14][10000]. llvm-svn: 55226	2008-08-23 05:21:06 +00:00
Chris Lattner	5fc8ab6d18	consolidate DenseMapInfo implementations, and add one for std::pair. Patch contributed by m-s. llvm-svn: 55167	2008-08-22 05:08:25 +00:00
Nick Lewycky	99f4558117	Revert r54876 r54877 r54906 and r54907. Evan found that these caused a 20% slowdown in bzip2. llvm-svn: 55113	2008-08-21 05:56:10 +00:00
Evan Cheng	f5a7e51c81	Silence a compiler warning. llvm-svn: 55087	2008-08-20 23:36:48 +00:00
Mon P Wang	1b2c061b73	Fixed shuffle optimizations to handle non power of 2 vectors llvm-svn: 55035	2008-08-20 02:23:25 +00:00
Chris Lattner	57693dda1d	don't use the result of WriteAsOperand llvm-svn: 54979	2008-08-19 04:45:19 +00:00
Nick Lewycky	75d4a83f2f	Make this comment clearer. Instead of using an ambiguous ~ (not) on an icmp predicate, swap the order of the operands. llvm-svn: 54907	2008-08-17 20:02:02 +00:00
Nick Lewycky	53b44029d6	Consider the case where xor by -1 and xor by 128 have been combined already to produce an xor by 127. llvm-svn: 54906	2008-08-17 19:58:24 +00:00
Gordon Henriksen	d930f913e6	Rename some GC classes so that their roll will hopefully be clearer. In particular, Collector was confusing to implementors. Several thought that this compile-time class was the place to implement their runtime GC heap. Of course, it doesn't even exist at runtime. Specifically, the renames are: Collector -> GCStrategy CollectorMetadata -> GCFunctionInfo CollectorModuleMetadata -> GCModuleInfo CollectorRegistry -> GCRegistry Function::getCollector -> getGC (setGC, hasGC, clearGC) Several accessors and nested types have also been renamed to be consistent. These changes should be obvious. llvm-svn: 54899	2008-08-17 18:44:35 +00:00
Evan Cheng	5dabe042a6	Revert 54821. It's miscompiling 252.eon and 447.dealII llvm-svn: 54878	2008-08-17 08:07:31 +00:00
Nick Lewycky	18c6f56c76	I found a better place for this optz'n. llvm-svn: 54877	2008-08-17 07:54:14 +00:00
Nick Lewycky	18f50b2637	Xor'ing both sides of icmp by sign-bit is equivalent to swapping signedness of the predicate. Also, make this optz'n apply in more cases where it's safe to do so. llvm-svn: 54876	2008-08-17 07:34:14 +00:00
Chris Lattner	17f7165f84	Rework the routines that convert AP[S]Int into a string. Now, instead of returning an std::string by value, it fills in a SmallString/SmallVector passed in. This significantly reduces string thrashing in some cases. More specifically, this: - Adds an operator<< and a print method for APInt that allows you to directly send them to an ostream. - Reimplements APInt::toString to be much simpler and more efficient algorithmically in addition to not thrashing strings quite as much. This speeds up llvm-dis on kc++ by 7%, and may also slightly speed up the asmprinter. This also fixes a bug I introduced into the asmwriter in a previous patch w.r.t. alias printing. llvm-svn: 54873	2008-08-17 07:19:36 +00:00
Owen Anderson	affe0267f8	Remove GCSE, ValueNumbering, and LoadValueNumbering. These have been deprecated for almost a year; it's finally time for them to go away. llvm-svn: 54822	2008-08-15 21:31:02 +00:00
Devang Patel	f2a03d5a4b	Reapply 54786. Add overflow and number of mantissa bits checks. llvm-svn: 54821	2008-08-15 21:21:34 +00:00
Evan Cheng	86834d29f3	Revert 54786. It's not checking for overflows, etc. llvm-svn: 54813	2008-08-15 08:12:11 +00:00
Chris Lattner	1d23915a8f	use smallvector instead of vector for a couple worklists. This speeds up instcombine by ~10% on some testcases. llvm-svn: 54811	2008-08-15 04:03:01 +00:00
Bill Wendling	861bec78f8	Temporarily revert r54792. It's causing an ICE during bootstrapping. llvm-svn: 54804	2008-08-14 23:05:24 +00:00
Devang Patel	52dc07b01a	Use DenseMap. Patch by Pratik Solanki. llvm-svn: 54792	2008-08-14 21:31:10 +00:00
Devang Patel	054a833dd4	If IV is used in a int-to-float cast inside the loop then try to eliminate the cast opeation. llvm-svn: 54786	2008-08-14 20:58:31 +00:00
Dan Gohman	8de6d22392	Use empty() instead of begin() == end(). llvm-svn: 54780	2008-08-14 18:13:49 +00:00
Matthijs Kooijman	4801bd41cf	Replace two for loops with while(!X->use_empty()) loops. This prevents invalidating the iterator by deleting the current use. This fixes a segfault on 64 bit linux reported in PR2675. Also remove an unneeded if. llvm-svn: 54778	2008-08-14 15:03:05 +00:00
Dan Gohman	6134fbccef	Fix a bogus srem rule - a negative value srem'd by a power-of-2 can have a non-negative result; for example, -16%16 is 0. Also, clarify the related comments. This fixes PR2670. llvm-svn: 54767	2008-08-13 23:12:35 +00:00
Dan Gohman	8ded5d5884	Fix SCCP's handling of struct value loads and stores. SCCP doesn't track individual leaf values in such cases, so it needs to treat struct values as normal values in this case. llvm-svn: 54760	2008-08-13 21:22:48 +00:00
Devang Patel	6369a798ba	Rename. s/FindIVForUser/FindIVUserForCond/g llvm-svn: 54754	2008-08-13 20:31:11 +00:00
Devang Patel	97387e6615	Check sign to detect overflow before changing compare stride. llvm-svn: 54710	2008-08-13 02:05:14 +00:00
Bill Wendling	f21a38700f	Remove tabs. llvm-svn: 54707	2008-08-12 23:15:44 +00:00
Chris Lattner	2aa0ff27aa	Implement support for simplifying vector comparisons by 0.0 and 1.0 like we do for scalars. Patch contributed by Nicolas Capens This also generalizes the previous xforms to work on long double, now that isExactlyValue works for long double. llvm-svn: 54653	2008-08-11 22:06:05 +00:00
Eric Christopher	5927883970	Have IRBuilder take a template argument on whether or not to preserve names. This can save a lot of allocations if you aren't going to be looking at the output. llvm-svn: 54546	2008-08-08 19:39:37 +00:00
Matthijs Kooijman	75b4fc2c84	Let SRETPromotion properly preserve the function name instead of (implicitly) postfixing it with a number. llvm-svn: 54468	2008-08-07 16:01:23 +00:00
Matthijs Kooijman	d6c1c8a974	Fix SRETPromotion, it was generating functions without returns statements since r53941 (but this was not noticed due to the lack of a basic test for SRETPromotion). llvm-svn: 54467	2008-08-07 15:58:09 +00:00
Matthijs Kooijman	41536988dd	Add some debug output to SRETPromotion. llvm-svn: 54464	2008-08-07 15:14:04 +00:00
Dan Gohman	ac22cfcae9	Fix a shufflevector instcombine that was emitting invalid masks indices when it meant to be emitting undef indices. llvm-svn: 54417	2008-08-06 18:17:32 +00:00
Evan Cheng	907dc2bc37	Fix PR2355: bug in ChangeCompareStride. When the loop termination compare is the only use of its iv stride, the stride can be eliminated by moving it to another stride. If the scale is negative, swap the predicate instead of using a inverse predicate. llvm-svn: 54415	2008-08-06 18:04:43 +00:00
Chris Lattner	f5b353c1fd	optimize a common idiom generated by clang for bitfield access, PR2638. llvm-svn: 54408	2008-08-06 07:35:52 +00:00
Chris Lattner	7bdaecb7f4	Zap sitofp/fptoui pairs. In all cases when the sign difference matters, the result is undefined anyway. llvm-svn: 54396	2008-08-06 05:13:06 +00:00
Nick Lewycky	bf42893567	Reinstate this optimization, but without the miscompile. Thanks to Bill for tracking down that this was breaking llvm-gcc bootstrap on Linux. llvm-svn: 54394	2008-08-06 04:54:03 +00:00
Dan Gohman	1fcc804cfd	Pass the computed iteration count value to RewriteLoopExitValues instead of having it call getIterationCount again. llvm-svn: 54380	2008-08-05 22:34:21 +00:00
Bill Wendling	ee12a7aeff	Revert r53282. This was causing a miscompile on Linux. Also, the transformation looks bogus. Please see PR2629 for details on why this is breaking things. llvm-svn: 54372	2008-08-05 21:23:45 +00:00
Dan Gohman	3da016d137	Trim #includes. llvm-svn: 54350	2008-08-05 15:32:23 +00:00
Duncan Sands	c1e48b582d	Fix comment typos. llvm-svn: 54266	2008-08-01 12:23:49 +00:00
Nate Begeman	fecbc8cff1	Add vector shifts to the IR, patch by Eli Friedman. CodeGen & Clang work coming next. llvm-svn: 54161	2008-07-29 15:49:41 +00:00
Matthijs Kooijman	98b5c16e3b	Add -unroll-allow-partial command line option that enabled the loop unroller to partially unroll a loop when fully unrolling would not fit under the threshold. Patch by Mikael Lepistö. llvm-svn: 54160	2008-07-29 13:21:23 +00:00
Matthijs Kooijman	fd3070459b	Restructure ArgumentPromotion a bit. Instead of just having a single boolean that says "unconditional loads from this argument are safe", we now keep track of the safety per set of indices from which loads happen. This prevents ArgPromotion from promoting loads that aren't really valid. As an added effect, this will now disregard the the type of the indices passed to a GEP, so "load GEP %A, i32 1" and "load GEP %A, i64 1" will result in a single argument, not two. This fixes PR2598, for which a testcase has been added as well. llvm-svn: 54159	2008-07-29 10:00:13 +00:00
Owen Anderson	813bf7af7f	Don't remove volatile loads. Thanks to Duncan for noticing this one. llvm-svn: 54144	2008-07-28 20:52:42 +00:00
Owen Anderson	3f3389745d	Add support for eliminating stores that store the same value that was just loaded. This fixes PR2599. llvm-svn: 54133	2008-07-28 16:14:26 +00:00
Dan Gohman	2ce6f2ad5e	Rename SDOperand to SDValue. llvm-svn: 54128	2008-07-27 21:46:04 +00:00
Dan Gohman	5f36a32e7b	Put the LICM of constant GlobalVariables, introduced in r53945, under a command-line option, and disable it by default. It introduced performance regressions because CodeGen is currently not able to remat such loads. llvm-svn: 53997	2008-07-24 23:57:25 +00:00
Chris Lattner	8a8fb908dc	"Allow LICM to sink or lift loads from constant memory. Also add a test case for this. This allows instructions like loads from global variables declared to be constant to be moved out of loops." Patch by Stefanus Du Toit! llvm-svn: 53945	2008-07-23 05:06:28 +00:00
Dan Gohman	fa1211f69b	Enable first-class aggregates support. Remove the GetResultInst instruction. It is still accepted in LLVM assembly and bitcode, where it is now auto-upgraded to ExtractValueInst. Also, remove support for return instructions with multiple values. These are auto-upgraded to use InsertValueInst instructions. The IRBuilder still accepts multiple-value returns, and auto-upgrades them to InsertValueInst instructions. llvm-svn: 53941	2008-07-23 00:34:11 +00:00
Dan Gohman	7ad3cd8c9d	Fix a bug in LSR's dead-PHI cleanup. If a PHI has a def-use chain that leads into a cycle involving a different PHI, LSR got stuck running around that cycle looking for the original PHI. To avoid this, keep track of visited PHIs and stop searching if we see one more than once. This fixes PR2570. llvm-svn: 53879	2008-07-21 21:45:02 +00:00
Duncan Sands	2c741145a7	Supress a gcc-4.3 warning. llvm-svn: 53771	2008-07-18 21:06:02 +00:00
Owen Anderson	04a6e0ba8c	Make PRE actually handle critical edges (by splitting them). Confirmed that bootstrap passes with this change. llvm-svn: 53762	2008-07-18 18:03:38 +00:00
Owen Anderson	9858691f25	Reapply r53735. My last patch fixed the failures Dan observed. llvm-svn: 53761	2008-07-18 17:49:43 +00:00
Owen Anderson	1468bec06e	Add some checks that got lost in the shuffle. This fixes 464.h264ref. llvm-svn: 53760	2008-07-18 17:46:41 +00:00
Dan Gohman	29c3adaae0	Revert r53735. It broke SPEC 464.h264ref. llvm-svn: 53757	2008-07-18 16:44:49 +00:00
Owen Anderson	fd7102037d	Use MergeBlockIntoPredecessor to simplify some code. llvm-svn: 53735	2008-07-17 20:00:46 +00:00
Owen Anderson	27405efdc0	Make MergeBlockIntoPredecessor more aggressive when the same successor appears more than once. llvm-svn: 53731	2008-07-17 19:42:29 +00:00
Owen Anderson	addbe3eed1	Enable PRE. My last batch of changes fixed the miscompile. llvm-svn: 53730	2008-07-17 19:41:00 +00:00
Matthijs Kooijman	8b69d77a7a	Make GlobalOpt preserve address spaces when scalar replacing aggregate globals. llvm-svn: 53716	2008-07-17 11:59:53 +00:00
Chris Lattner	c600c53d1f	Fix PR2553 llvm-svn: 53715	2008-07-17 06:07:20 +00:00
Evan Cheng	97cd0298cc	Inliner tweak. Function calls should cost more than one instruction! llvm-svn: 53712	2008-07-17 01:31:49 +00:00
Owen Anderson	c062381c7b	Factor MergeBlockIntoPredecessor out into BasicBlockUtils. llvm-svn: 53705	2008-07-17 00:01:40 +00:00
Owen Anderson	ac31096311	There's no need to iterate block merging and PRE. In fact, iterating the latter could cause problems for memdep when it breaks critical edges. llvm-svn: 53691	2008-07-16 17:52:31 +00:00
Matthijs Kooijman	c1d7477ed2	Redo InstCombiner::visitExtractValueInst. Instead of using the (complicate) FindInsertedValue, it now performs a number of simple transformations that should result in the same effect when applied iteratively. llvm-svn: 53673	2008-07-16 12:55:45 +00:00
Evan Cheng	c97094552c	Fix PR2296. Do not transform x86_sse2_storel_dq into a full-width store. llvm-svn: 53666	2008-07-16 07:28:14 +00:00
Owen Anderson	24768e3dc4	Revert this, as it seems to still be broken. llvm-svn: 53627	2008-07-15 17:59:02 +00:00
Owen Anderson	9d1f497a28	Enable local PRE by default. llvm-svn: 53616	2008-07-15 16:28:23 +00:00
Owen Anderson	53d546e40b	Have GVN do a pre-pass over the CFG that folds away unconditional branches where possible. This allows local PRE to be more aggressive. llvm-svn: 53615	2008-07-15 16:28:06 +00:00
Matthijs Kooijman	c893bf472d	Allow deadargelim to change return types even though now values were dead. This again canonicalizes {i32} into i32 and {} into void. llvm-svn: 53610	2008-07-15 14:42:31 +00:00
Matthijs Kooijman	5e8c022e21	Revert r53606. It turns out that explicitely tracking the liveness of the return value as a whole in deadargelim is really not needed now that we simply rebuild the old return value and actually prevents some canonicalization from taking place. This revert stops deadargelim from changing {i32} into i32 for now, but I'll fix that next. llvm-svn: 53609	2008-07-15 14:39:36 +00:00
Matthijs Kooijman	c1da874478	Make deadargelim a bit less smart, so it doesn't choke on nested structs as return values that are still (partially) live. Instead of updating all uses of a call instruction after removing some elements, it now just rebuilds the original struct (With undef gaps where the unused values were) and leaves it to instcombine to clean this up. The added testcase still fails currently, but this is due to instcombine which isn't good enough yet. I will fix that part next. llvm-svn: 53608	2008-07-15 14:03:10 +00:00
Matthijs Kooijman	04d4c328ac	Don't use isa when we can reuse a previous dyn_cast. llvm-svn: 53607	2008-07-15 13:39:08 +00:00
Matthijs Kooijman	84194b6768	Make DeadArgElim keep liveness of the return value as a whole in addition to only the liveness of partial return values (for functions returning a struct). This is more explicit to prevent unwanted changes in the return value. In particular, deadargelim now canonicalizes a function returning {i32} to returning i32 and {} to void, if the struct returned is not used in its entirety, but only the single element is used. llvm-svn: 53606	2008-07-15 13:36:06 +00:00
Matthijs Kooijman	79a8eb547c	Let DAE keep a list of live functions, instead of simply marking all arguments and return values live for those functions. This doesn't change anything yet, but prepares for the coming commits. llvm-svn: 53601	2008-07-15 09:11:16 +00:00
Matthijs Kooijman	e9af814669	Split DAE::MarkLive into MarkLive and PropagateLiveness. llvm-svn: 53600	2008-07-15 09:00:17 +00:00
Matthijs Kooijman	2ce5709e31	Pass around const RetOrArg references instead of copying values. Also, mark RetOrArg::getDescription() as const. llvm-svn: 53599	2008-07-15 08:56:49 +00:00
Matthijs Kooijman	f2860b9fb3	Simplify debug code by using RetOrArg::getDescription(). llvm-svn: 53598	2008-07-15 08:53:36 +00:00
Matthijs Kooijman	90d08addb0	Fix indentation (intentionally left out of the previous commit). llvm-svn: 53592	2008-07-15 08:47:32 +00:00
Matthijs Kooijman	06642d3812	Move the deadargelim code for intrinsically alive functions into its own method, to slightly simplify control flow. llvm-svn: 53591	2008-07-15 08:45:12 +00:00
Dan Gohman	162668fa78	Fix uninitialized use of the Changed variable. llvm-svn: 53564	2008-07-14 17:55:01 +00:00
Chris Lattner	8882b1c41c	Reapply r53540, now with the matching header! llvm-svn: 53557	2008-07-14 17:32:59 +00:00
Duncan Sands	68b0383057	Revert r53540 - it does not compile. llvm-svn: 53549	2008-07-14 07:59:28 +00:00
Chris Lattner	2831ad28be	If a function calls setjmp, never inline it into other functions. This is a hack around the fact that we don't represent the CFG correctly for sj/lj. It fixes PR2486. llvm-svn: 53540	2008-07-14 00:46:56 +00:00
Chris Lattner	6f5ea6e49c	simplify some code, shuffle and insertelt always return a vector. llvm-svn: 53538	2008-07-14 00:32:20 +00:00
Chris Lattner	16395e51f4	Fix PR2506 by being a bit more careful about reverse fact propagation when disproving a condition. This actually compiles the existing testcase (udiv_select_to_select_shift) to: define i64 @test(i64 %X, i1 %Cond) { entry: %divisor1.t = lshr i64 %X, 3 ; <i64> [#uses=1] %quotient2 = lshr i64 %X, 3 ; <i64> [#uses=1] %sum = add i64 %divisor1.t, %quotient2 ; <i64> [#uses=1] ret i64 %sum } instead of: define i64 @test(i64 %X, i1 %Cond) { entry: %quotient1.v = select i1 %Cond, i64 3, i64 4 ; <i64> [#uses=1] %quotient1 = lshr i64 %X, %quotient1.v ; <i64> [#uses=1] %quotient2 = lshr i64 %X, 3 ; <i64> [#uses=1] %sum = add i64 %quotient1, %quotient2 ; <i64> [#uses=1] ret i64 %sum } llvm-svn: 53534	2008-07-14 00:15:52 +00:00
Chris Lattner	80b03a1b49	Fix mishandling of the infinite loop case when merging two blocks. This fixes PR2540. llvm-svn: 53533	2008-07-13 22:23:11 +00:00
Chris Lattner	834ab4ec1b	more refactoring. Use early exits instead of really complex logic. No functionality change. llvm-svn: 53532	2008-07-13 22:04:41 +00:00
Chris Lattner	5eed37224a	improve comments. llvm-svn: 53531	2008-07-13 21:55:46 +00:00
Chris Lattner	9aada1d755	factor another large hunk of code out into its own function. No functionality change. llvm-svn: 53530	2008-07-13 21:53:26 +00:00
Chris Lattner	55eaae1e0c	Final bit of simplification for FoldBranchToCommonDest. llvm-svn: 53528	2008-07-13 21:20:19 +00:00
Chris Lattner	1b317ea48a	simplify logic a bit llvm-svn: 53527	2008-07-13 21:15:11 +00:00
Chris Lattner	2e25b8f444	Refactor some code out into its own helper function, getting rid of crazy multiline conditionals and commenting the code better. No functionality change. llvm-svn: 53526	2008-07-13 21:12:01 +00:00
Nick Lewycky	f76aa23b54	Enhance analysis of srem. Remove dead code analyzing urem. 'urem' of power-of-2 is canonicalized to an 'and' instruction. llvm-svn: 53506	2008-07-12 05:04:38 +00:00
Dan Gohman	3707f1daba	Use find instead of lower_bound. llvm-svn: 53474	2008-07-11 20:58:19 +00:00
Owen Anderson	8e462e9a82	Don't call lookupNumber more than we have to. llvm-svn: 53470	2008-07-11 20:05:13 +00:00
Nick Lewycky	45e127ab20	Document 'mask' in this calculation. llvm-svn: 53454	2008-07-11 08:16:26 +00:00
Nick Lewycky	da405e1155	Remove misleading constant from comment. llvm-svn: 53452	2008-07-11 07:36:19 +00:00
Nick Lewycky	f95b64acaa	Add another optimization from PR2330. Also catch some missing cases that are similar. llvm-svn: 53451	2008-07-11 07:20:53 +00:00
Chris Lattner	3994bed1a9	a missed optimization that Eli spotted llvm-svn: 53449	2008-07-11 06:40:29 +00:00
Chris Lattner	13a6911ea2	another bug in the same line. llvm-svn: 53448	2008-07-11 06:38:16 +00:00
Chris Lattner	de89b507dd	fix a bug spotted by Eli's eagle eyes llvm-svn: 53447	2008-07-11 06:36:01 +00:00
Chris Lattner	bd25b8507c	simplify and merge a bunch of code. Instead of comparing against the min/max values for an integer type, compare against the min/max values we can prove contain the input. This might be a tighter bound, so this is general goodness. llvm-svn: 53446	2008-07-11 05:40:05 +00:00
Chris Lattner	38a50c9528	fold away (x <= cst) earlier, allowing us to not have to handle them in some code. llvm-svn: 53445	2008-07-11 05:08:55 +00:00
Chris Lattner	6af608b8ce	Fix folding of icmp's of i1 where the comparison is signed. The code was using the algorithm for folding unsigned comparisons which is completely wrong. This has been broken since the signless types change. llvm-svn: 53444	2008-07-11 04:20:58 +00:00
Chris Lattner	4fa8bb3430	Fix a bogus optimization: folding (slt (zext i1 A to i32), 1) -> (slt i1 A, true) This cause a regression in InstCombine/JavaCompare, which was doing the right thing on accident. To handle the missed case, generalize the comparisons based on masked bits a little bit to handle comparisons against the max value. For example, we can now xform (slt i32 (and X, 4), 4) -> (setne i32 (and X, 4), 4) llvm-svn: 53443	2008-07-11 04:09:09 +00:00
Matthijs Kooijman	e0f3ab82c4	Restructure dead argument elimination, try #3 :-) Rewrite the DeadArgumentElimination pass, to use a more explicit tracking of dependencies between return values and/or arguments. Also make the handling of arguments and return values the same. The pass now looks properly inside returned structs, but only at the first level (ie, not inside nested structs). This version fixed a few more bugs and was cleaned up a bit. It now passes all of LLVM's testing, and should still pass SPEC2006. There is still a minor bug with regard to returning nested structs. Since there is currently nothing that emits such IR, I will fix that in a seperate commit (partly because it requires a non-trivial fix). llvm-svn: 53400	2008-07-10 10:24:08 +00:00
Nick Lewycky	6193a564ab	Fix overzealous optimization. Thanks to Duncan Sands for pointing out my error! llvm-svn: 53393	2008-07-10 05:51:40 +00:00
Nick Lewycky	bb89c2a3f6	Simplify, suggested by Chris Lattner. llvm-svn: 53283	2008-07-09 07:35:26 +00:00
Nick Lewycky	f9c27c343a	Fold (a < 8) && (b < 8) into (a\|b) < 8 for unsigned less or greater than. llvm-svn: 53282	2008-07-09 07:29:11 +00:00
Nick Lewycky	364661c43e	Fold ((1 << a) & 1) to (a == 0). llvm-svn: 53276	2008-07-09 05:20:13 +00:00
Nick Lewycky	0d3645e673	Reduce x - y to -y when we know the 'x' part will get masked off anyways. llvm-svn: 53271	2008-07-09 04:32:37 +00:00
Devang Patel	51cbf928ab	If loop induction variable's start value is less then its exit value then do not split the loop. llvm-svn: 53265	2008-07-09 00:12:01 +00:00
Chris Lattner	501d78fdc0	Fix PR2496, a really nasty bug which involved sinking volatile loads into phis. This is actually the same bug as PR2262 / 2008-04-29-VolatileLoadDontMerge.ll, but I missed checking the first predecessor for multiple successors. Testcase here: InstCombine/2008-07-08-VolatileLoadMerge.ll llvm-svn: 53240	2008-07-08 17:18:32 +00:00
Evan Cheng	03001cb820	Fix two serious LSR bugs. 1. LSR runOnLoop is always returning false regardless if any transformation is made. 2. AddUsersIfInteresting can create new instructions that are added to DeadInsts. But there is a later early exit which prevents them from being freed. llvm-svn: 53193	2008-07-07 19:51:32 +00:00
Dan Gohman	38740a98b2	Make DenseMap's insert return a pair, to more closely resemble std::map. llvm-svn: 53177	2008-07-07 17:46:23 +00:00
Nick Lewycky	9f1a4dc672	Fix missed optimization opportunity when analyzing cast of mul and select. llvm-svn: 53151	2008-07-05 21:19:34 +00:00
Owen Anderson	3ea90a7d55	Use information already present in the ValueTable to fast-fail when we know there won't be a value number match. This speeds up GVN on a case where there are very few redundancies by ~25%. llvm-svn: 53108	2008-07-03 17:44:33 +00:00
Devang Patel	eb611ddeb2	Do not try to update dominator info while manipulating CFG. This code does not handle all cases and keeps invalid dom info around some cases, which misleads other passes down stream. Right now, dom info is recaluclated in the end if the loop is switched. llvm-svn: 53106	2008-07-03 17:37:52 +00:00
Owen Anderson	d57cdc3c60	Remove the ability for ADCE to remove unreachable blocks in loop nests, because, as Eli pointed out, SimplifyCFG already does this. llvm-svn: 53104	2008-07-03 17:21:41 +00:00
Bill Wendling	a96eabaab7	Remove unused function. llvm-svn: 53090	2008-07-03 07:10:03 +00:00
Devang Patel	f94b9826b5	Preserve dom info. llvm-svn: 53089	2008-07-03 07:04:22 +00:00
Devang Patel	226edd1826	Remove extra FIXME llvm-svn: 53087	2008-07-03 06:50:04 +00:00
Devang Patel	c4dcf82a16	Reconstruct dom info, if loop is unswitched. llvm-svn: 53086	2008-07-03 06:48:21 +00:00
Devang Patel	e491bb8845	LoopUnswitch does not preserve dominator info in all cases. llvm-svn: 53085	2008-07-03 05:55:03 +00:00
Devang Patel	7dcfff392a	Undo previous patch. It is not that simple to fix dom info here. llvm-svn: 53062	2008-07-03 00:08:13 +00:00
Devang Patel	5adfcb5783	Preserve dom info while simplifing loop after the unswitch. llvm-svn: 53052	2008-07-02 22:58:54 +00:00
Owen Anderson	488b89f608	Use df_ext_iterator to capture the reachable set without allocating an extra set. Also, move large sets and vectors out of instance variables and onto the stack, and give them more reasonable sizes. llvm-svn: 53044	2008-07-02 18:41:09 +00:00
Owen Anderson	6acc782dad	Avoid a redundant call. llvm-svn: 53040	2008-07-02 18:15:31 +00:00
Owen Anderson	323b5755a6	Add support to ADCE for pruning unreachable blocks. This addresses the final part of PR2509. llvm-svn: 53038	2008-07-02 18:05:19 +00:00
Owen Anderson	9edcf24da9	Use DenseSet rather than SmallPtrSet for the alive set. Using SmallPtrSet with a huge "size" parameter is actually quite inefficient. llvm-svn: 53034	2008-07-02 17:32:04 +00:00
Owen Anderson	b22a640fe4	A better fix for PR2503 that doesn't pessimize GVN in the presence of unreachable blocks. llvm-svn: 53032	2008-07-02 17:20:16 +00:00
Devang Patel	ed50fb5b61	reuse vectors. llvm-svn: 53007	2008-07-02 01:44:29 +00:00
Devang Patel	57d94d6304	Fix comment. llvm-svn: 53006	2008-07-02 01:31:19 +00:00
Devang Patel	e149d4ed4d	Preserve loop data so that it is not fetched everytime it is needed. Keep track of currentLoop. llvm-svn: 53005	2008-07-02 01:18:13 +00:00
Evan Cheng	da3db11db3	- Re-apply 52748 and friends with fix. GetConstantStringInfo() returns an empty string for ConstantAggregateZero case which surprises selectiondag. - Correctly handle memcpy from constant string which is zero-initialized. llvm-svn: 52891	2008-06-30 07:31:25 +00:00
Anton Korobeynikov	a7c583d584	Revert (52748 and friends): Move GetConstantStringInfo to lib/Analysis. Remove string output routine from Constant. Update all callers. Change debug intrinsic api slightly to accomodate move of routine, these now return values instead of strings. This unbreaks llvm-gcc bootstrap. llvm-svn: 52884	2008-06-29 17:57:03 +00:00
Eric Christopher	3f1c75c4d8	Remove unused function. llvm-svn: 52749	2008-06-26 01:19:35 +00:00
Eric Christopher	d0ab9c47e6	Move GetConstantStringInfo to lib/Analysis. Remove string output routine from Constant. Update all callers. Change debug intrinsic api slightly to accomodate move of routine, these now return values instead of strings. llvm-svn: 52748	2008-06-26 00:31:12 +00:00
Evan Cheng	88ca48b09d	Restore DeadArgElim back to 52570. It's breaking 447.dealII. llvm-svn: 52736	2008-06-25 18:10:09 +00:00
Duncan Sands	1b03c2ac98	Pacify gcc-4.3. llvm-svn: 52723	2008-06-25 16:31:18 +00:00
Matthijs Kooijman	2e2001d8b9	Fix a (false) warning on darwin. llvm-svn: 52705	2008-06-25 08:12:16 +00:00
Matthijs Kooijman	4e1cf1e7d7	Fix some cosmetics in comments. llvm-svn: 52704	2008-06-25 08:10:21 +00:00
Evan Cheng	5fd28b54c7	- Use O(1) check of basic block size limit. - Avoid speculatively execute vector ops. llvm-svn: 52703	2008-06-25 07:50:12 +00:00
Chris Lattner	c9c81fb0df	Fix PR2488, a case where we deleted stack restores too aggressively. llvm-svn: 52702	2008-06-25 05:59:28 +00:00
Dan Gohman	04c8bd7e11	Revert 52645, the loop unroller changes. It caused a regression in 252.eon. llvm-svn: 52688	2008-06-24 20:44:42 +00:00
Dan Gohman	4be44e62b3	Fix a typo in a comment. llvm-svn: 52687	2008-06-24 18:00:21 +00:00
Matthijs Kooijman	c702e1d32f	Commit the new DeadArgElim pass again, this time with the gcc bootstrap failures fixed. Also add a testcase to reproduce the gcc bootstrap failure in very much reduced form. llvm-svn: 52677	2008-06-24 16:30:26 +00:00
Matthijs Kooijman	19a6469e1b	Rename a few variables to be more consistent. llvm-svn: 52672	2008-06-24 09:14:10 +00:00
Dan Gohman	abd8f41c81	Use use_empty() instead of getNumUses(), avoiding a use list traversal. llvm-svn: 52651	2008-06-23 23:23:49 +00:00
Dan Gohman	ac563833ae	Fix spelling and grammar in a comment. llvm-svn: 52648	2008-06-23 22:11:52 +00:00
Dan Gohman	48c5c7e860	Revamp the loop unroller, extending it to correctly update PHI nodes in the presence of out-of-loop users of in-loop values and the trip count is not a known multiple of the unroll count, and to be a bit simpler overall. This fixes PR2253. llvm-svn: 52645	2008-06-23 21:29:41 +00:00
Evan Cheng	403e567043	Disable PRE. It's breaking bootstrapping. llvm-svn: 52643	2008-06-23 21:22:35 +00:00
Owen Anderson	54e02194a1	Tighten the conditions under which we do PRE, remove some unneeded code, and correct our preserved analyses list, since we do now change the CFG by splitting critical edges during PRE. llvm-svn: 52631	2008-06-23 17:49:45 +00:00
Chris Lattner	4d754bc97b	minor tidying of comments. llvm-svn: 52630	2008-06-23 17:11:23 +00:00
Owen Anderson	00fdbd01e5	At Chris' suggestion, move the liveness and worklist datastructures into instance variables so they can be allocated just once, and reuse the worklist as the dead list as well. llvm-svn: 52618	2008-06-23 06:13:12 +00:00
Dan Gohman	5ca5e02480	Improve LSR's dead-phi detection to handle use-def cycles with more than two nodes. llvm-svn: 52617	2008-06-22 20:44:02 +00:00
Dan Gohman	90071075e2	Use Loop::block_iterator. llvm-svn: 52616	2008-06-22 20:18:58 +00:00
Chris Lattner	6ff85681e4	Fix PR2369 by making scalarrepl more careful about promoting structures. Its default threshold is to promote things that are smaller than 128 bytes, which is sane. However, it is not sane to do this for things that turn into 128 registers. Add a cap on the number of registers introduced, defaulting to 128/4=32. llvm-svn: 52611	2008-06-22 17:46:21 +00:00
Eli Friedman	d3449df326	Fix for PR2479: correctly optimize expressions like (a > 13) & (a == 15). See also PR1800, which is about the signed case. llvm-svn: 52608	2008-06-21 23:36:13 +00:00
Dan Gohman	158ff2c4a9	Use Instruction::eraseFromParent(). llvm-svn: 52606	2008-06-21 22:08:46 +00:00
Chris Lattner	8459e0bc59	Fix warning when assertions disabled. llvm-svn: 52590	2008-06-21 19:49:01 +00:00
Evan Cheng	42bbca11cc	Enable PRE. llvm-svn: 52574	2008-06-21 07:26:53 +00:00
Evan Cheng	33067210d1	Back out Matthijs' DAE patches. It's miscompiling gcc driver. llvm-svn: 52570	2008-06-21 00:31:44 +00:00
Dan Gohman	3ada1e118b	Clean up a use of std::distance. llvm-svn: 52544	2008-06-20 17:11:32 +00:00
Dan Gohman	a5dd67f002	Tidy up some commments and use the getAggregateOperand and getInsertedValueOperand accessors. Thanks Matthijs! llvm-svn: 52543	2008-06-20 16:41:17 +00:00
Dan Gohman	b5210efb31	Fix the conditions under which SCCP should examine insertvalue instructions. Thanks to Matthijs Kooijman for pointing this out! llvm-svn: 52542	2008-06-20 16:39:44 +00:00
Matthijs Kooijman	c456f9dfc6	80 column and trailing whitespace fixes. llvm-svn: 52539	2008-06-20 15:34:07 +00:00
Matthijs Kooijman	0c50b953c5	Don't let DeadArgumentElimination attempt to update callers when the return type wasn't changed. llvm-svn: 52538	2008-06-20 15:25:43 +00:00
Matthijs Kooijman	9dc59b7666	Don't let DeadArgElimination change the return type ({} into void and {T} into T) when no return values are actually dead. llvm-svn: 52537	2008-06-20 15:16:45 +00:00
Matthijs Kooijman	013b6a9a42	Explicitely track if any arguments or return values were removed in DeadArgumentElimination and assert that the function type does not change if nothing was changed. This should catch subtle changes in function type that are not intended. llvm-svn: 52536	2008-06-20 14:28:52 +00:00
Matthijs Kooijman	e91aed6ce1	Remove debug output. llvm-svn: 52535	2008-06-20 14:03:35 +00:00
Matthijs Kooijman	8d32dee428	Recommit r52459, rewriting of the dead argument elimination pass. This is a fixed version that no longer uses multimap::equal_range, which resulted in a pointer invalidation problem. Also, DAE::InspectedFunctions was not really necessary, so it got removed. Lastly, this version no longer applies the extra arg hack on functions who did not have any arguments to start with. llvm-svn: 52532	2008-06-20 09:36:16 +00:00
Owen Anderson	78fbcafb53	Really disable PRE. llvm-svn: 52531	2008-06-20 08:59:13 +00:00
Chris Lattner	f3ecd2d290	Fix PR2471, which is a bug involving an invalid promotion from a conditional load. llvm-svn: 52525	2008-06-20 05:12:56 +00:00
Owen Anderson	1b3ea963f7	Change around the data structures used to store availability sets, resulting in a GVN+PRE that is faster that GVN alone was before. llvm-svn: 52521	2008-06-20 01:15:47 +00:00
Dan Gohman	041f9d03ff	Teach SCCP about insertvalue and extractvalue, and about propagating constants across aggregate return values when insertvalue and extractvalue are used. llvm-svn: 52520	2008-06-20 01:15:44 +00:00
Dan Gohman	3b18fd7b02	Teach InlineFunction how to differentiate between multiple-value return statements and aggregate returns so that it handles both correctly. llvm-svn: 52519	2008-06-20 01:03:44 +00:00
Evan Cheng	9598f930f3	Disable PRE for now. It seems to be breaking llvm-gcc bootstrapping. llvm-svn: 52518	2008-06-20 01:01:07 +00:00
Owen Anderson	e780d66657	Add a hidden -disable-pre flag for testing purposes. This should be removed once benchmarking is completed. llvm-svn: 52506	2008-06-19 19:57:25 +00:00
Owen Anderson	fdf9f168b5	PRE requires that critical edges be split. llvm-svn: 52505	2008-06-19 19:54:19 +00:00
Bill Wendling	cd6fb1d0a8	Remove dead code causing a warning. llvm-svn: 52502	2008-06-19 18:00:44 +00:00
Dan Gohman	d6530872f3	Use the common API for adding instructions to basic blocks instead of using BasicBlock::getInstList. llvm-svn: 52500	2008-06-19 17:53:32 +00:00
Owen Anderson	ff21db851d	Be sure to remove values from the value numbering table after we delete them. This fixes a failure on povray. llvm-svn: 52499	2008-06-19 17:53:26 +00:00
Dan Gohman	ed2250990a	Use Instruction::moveBefore instead of manipulating the instruction list directly. llvm-svn: 52498	2008-06-19 17:47:47 +00:00
Dan Gohman	9eea470fcf	Avoid using BasicBlock::getInstList directly in a few places. llvm-svn: 52497	2008-06-19 17:37:25 +00:00
Owen Anderson	45d3701fce	Revert support for insertvalue and extractvalue instructions for the moment. GVN expects that all inputs which to an instruction fall somewhere in the value hierarchy, which isn't true for these. llvm-svn: 52496	2008-06-19 17:25:39 +00:00
Dan Gohman	68f539e807	Delete dead code. llvm-svn: 52494	2008-06-19 17:18:39 +00:00
Matthijs Kooijman	0c71732497	Use a CallSite to find the nth argument of a call/invoke instruction instead of using getOperand() directly. This makes things work with invoke instructions as well. llvm-svn: 52489	2008-06-19 08:53:24 +00:00
Owen Anderson	3ea800fbad	Add support for extractvalue and insertvalue instructions in GVN. llvm-svn: 52472	2008-06-18 21:59:00 +00:00
Owen Anderson	6a903bc601	Add local PRE to GVN. This only operates in cases where it would not increase code size, namely when the instantiated expression would only need to be created in one predecessor. llvm-svn: 52471	2008-06-18 21:41:49 +00:00
Chris Lattner	78119b4742	Fix the regressions on sext-misc.ll my patch yesterday caused. llvm-svn: 52466	2008-06-18 18:11:55 +00:00
Owen Anderson	9094cc957e	Revert r52459, which was causing an infinite loop or massive slowdown on MultiSource/Applications/SPASS, and possibly others as well. Please reapply once this is fixed. llvm-svn: 52465	2008-06-18 17:32:16 +00:00
Dan Gohman	be928e3b21	Move LSR's private isZero function to a public SCEV member function, and make use of it in several places. llvm-svn: 52463	2008-06-18 16:23:07 +00:00
Matthijs Kooijman	964557fdf5	Rewrite the DeadArgumentElimination pass, to use a more explicit tracking of dependencies between return values and/or arguments. Also make the handling of arguments and return values the same. The pass now looks properly inside returned structs, but only at the first level (ie, not inside nested structs). Also add a testcase for testing various variations of (multiple) dead rerturn values. llvm-svn: 52459	2008-06-18 11:12:53 +00:00
Matthijs Kooijman	fd17357643	Reapply r52397 (make IPConstProp promote returned arguments), but fixed this time. Sorry for the trouble! This time, also add a testcase, which I should have done in the first place... llvm-svn: 52455	2008-06-18 08:30:37 +00:00
Matthijs Kooijman	97034598b1	Reapply r52396, it was unrelated to the breakage (that was caused by r52397, my commit after this). llvm-svn: 52453	2008-06-18 08:09:27 +00:00
Chris Lattner	ef36dcd10b	implement some simple bswap optimizations, rdar://5992453 llvm-svn: 52442	2008-06-18 04:33:20 +00:00
Chris Lattner	b5ee8b3e89	make truncate/sext elimination capable of changing phi's. This implements rdar://6013816 and the testcase in Transforms/InstCombine/sext-misc.ll. llvm-svn: 52440	2008-06-18 04:00:49 +00:00
Devang Patel	cd6b697945	Preserve dominance frontier while trivially unswitching loop. llvm-svn: 52438	2008-06-18 02:16:38 +00:00
Owen Anderson	75f3732b23	We don't want to find dependencies within the same block in this case. It leads to incorrect results because we're detecting something at or after the call we're querying on. llvm-svn: 52433	2008-06-17 22:27:06 +00:00
Chris Lattner	aecc3750d1	revert recent patch which is causing widespread breakage. llvm-svn: 52415	2008-06-17 17:06:43 +00:00
Duncan Sands	4b50fde2c4	Fix typo that changed the logic to something wrong. Spotted by Nick Lewycky. llvm-svn: 52411	2008-06-17 15:55:30 +00:00
Matthijs Kooijman	332836d68d	Learn IPConstProp to propagate arguments that are directly returned. Strictly speaking these are not constant values. However, when a function always returns one of its arguments, then from the point of view of each caller the return value is constant (or at least a known value) and can be replaced. llvm-svn: 52397	2008-06-17 12:20:24 +00:00
Matthijs Kooijman	f03c1ae407	Learn IPConstProp to look at individual return values and propagate them individually. Also learn IPConstProp how returning first class aggregates work, in addition to old style multiple return instructions. Modify the return-constants testscase to confirm this behaviour. llvm-svn: 52396	2008-06-17 12:02:52 +00:00
Dan Gohman	ab0dccba6b	Refine the change in r52258 for avoiding use-before-def conditions when changing the stride of a comparison so that it's slightly more precise, by having it scan the instruction list to determine if there is a use of the condition after the point where the condition will be inserted. llvm-svn: 52371	2008-06-16 22:34:15 +00:00
Evan Cheng	319e9a4f63	Switch over to SetVector to ensure same order of iterations do not vary across runs. llvm-svn: 52361	2008-06-16 21:08:17 +00:00
Evan Cheng	a72cdcd1a2	Iterating over SmallPtrSet is not deterministic. llvm-svn: 52339	2008-06-16 18:17:09 +00:00
Matthijs Kooijman	86cda9e050	Pass around Instruction* instead of Instruction& in FindInsertedValue and friends. llvm-svn: 52318	2008-06-16 13:13:08 +00:00
Matthijs Kooijman	5cb387735d	80 column fixes. llvm-svn: 52316	2008-06-16 12:57:37 +00:00
Matthijs Kooijman	e92e18be5a	Move FindScalarValue from InstructionCombining.cpp to ValueTracking.cpp. While I'm at it, rename it to FindInsertedValue. The only functional change is that newly created instructions are no longer added to instcombine's worklist, but that is not really necessary anyway (and I'll commit some improvements next that will completely remove the need). llvm-svn: 52315	2008-06-16 12:48:21 +00:00
Chris Lattner	1c9922703f	Fix the crash on SimplifyLibCalls/2005-05-20-sprintf-crash.ll llvm-svn: 52295	2008-06-16 04:10:21 +00:00
Chris Lattner	a88cd4ea2a	Fix a case where tailcallelim wouldn't set the changed bit when it made a change. llvm-svn: 52267	2008-06-14 00:49:48 +00:00
Eli Friedman	5de0a77a9b	Don't skip over instructions other than loads that might read memory when trying to sink stores. llvm-svn: 52259	2008-06-13 22:02:12 +00:00
Dan Gohman	9ad8c54aab	Protect ChangeCompareStride from situations in which it is possible for it to generate use-before-def IR, such as in this testcase. llvm-svn: 52258	2008-06-13 21:43:41 +00:00
Eli Friedman	9833a1b407	Make sure SimplifyStoreAtEndOfBlock doesn't mess with loops; the structure checks are incorrect if the blocks aren't distinct. Fixes PR2435. llvm-svn: 52257	2008-06-13 21:17:49 +00:00
Wojciech Matyjewicz	25a7f5de92	Use recently added getTruncateOrZeroExtend method to make the code shorter. llvm-svn: 52251	2008-06-13 17:02:03 +00:00
Gabor Greif	431e9560b7	fix a minor deviation from the original in my previous commit llvm-svn: 52247	2008-06-12 21:51:29 +00:00
Gabor Greif	f6d8e77027	op_iterator-ify some loops, low hanging fruit only, there is more llvm-svn: 52246	2008-06-12 21:37:33 +00:00
Evan Cheng	89553cc42e	Do not speculatively execute an instruction by hoisting it to its predecessor BB if any of its operands are defined but not used in BB. The transformation will prevent the operand from being sunk into the use block. llvm-svn: 52244	2008-06-12 21:15:59 +00:00
Evan Cheng	70fe16353a	Revert 52223. llvm-svn: 52243	2008-06-12 20:55:39 +00:00
Owen Anderson	accdca1b03	Switch GVN to use ScopedHashTable. llvm-svn: 52242	2008-06-12 19:25:32 +00:00
Gabor Greif	0babc61631	op_iterator-ify some loops, fix 80col violations llvm-svn: 52226	2008-06-11 21:38:51 +00:00
Evan Cheng	933c743042	For now, avoid generating FP select instructions in order to speculatively execute integer arithmetic instructions. FP selects are more likely to be expensive (even compared to branch on fcmp). This is not a wonderful solution but I rather err on the side of conservative. This fixes the heapsort performance regressions. llvm-svn: 52224	2008-06-11 19:18:20 +00:00
Evan Cheng	f3c2902ead	Avoid duplicating loop header which leads to unnatural loops (and just seem like general badness to me, likely to cause code explosion). Patch by Florian Brandner. llvm-svn: 52223	2008-06-11 19:07:54 +00:00
Matthijs Kooijman	b2fc72bfbf	Teach instruction combining about the extractvalue. It can succesfully fold useless insert-extract chains, similar to how it folds them for vectors. Add a testcase for this. llvm-svn: 52217	2008-06-11 14:05:05 +00:00
Matthijs Kooijman	3453c7bcb5	Clarify a comment. llvm-svn: 52212	2008-06-11 09:00:12 +00:00
Gabor Greif	945f2f7fed	op_iterator-ify loops llvm-svn: 52191	2008-06-10 22:03:26 +00:00
Chris Lattner	9c9f531a47	lower calls to abs to inline code, PR2337 llvm-svn: 52138	2008-06-09 08:26:51 +00:00
Chris Lattner	dbd595f22d	Fix PR2411, where ip constant prop would propagate the result of a weak function. llvm-svn: 52137	2008-06-09 07:58:07 +00:00
Duncan Sands	11dd424539	Remove comparison methods for MVT. The main cause of apint codegen failure is the DAG combiner doing the wrong thing because it was comparing MVT's using < rather than comparing the number of bits. Removing the < method makes this mistake impossible to commit. Instead, add helper methods for comparing bits and use them. llvm-svn: 52098	2008-06-08 20:54:56 +00:00
Chris Lattner	b4866ef30c	Limit the icmp+phi merging optimization to the cases where it is profitable: don't make i1 phis when it won't be possible to eliminate them. llvm-svn: 52097	2008-06-08 20:52:11 +00:00
Evan Cheng	89200c9177	Speculatively execute a block when the the block is the then part of a triangle shape and it contains a single, side effect free, cheap instruction. The branch is eliminated by adding a select instruction. i.e. Turn BB: %t1 = icmp br i1 %t1, label %BB1, label %BB2 BB1: %t3 = add %t2, c br label BB2 BB2: => BB: %t1 = icmp %t4 = add %t2, c %t3 = select i1 %t1, %t2, %t3 llvm-svn: 52073	2008-06-07 08:52:29 +00:00
Devang Patel	8549e4ca07	LoopSimplify preserves AA. llvm-svn: 52053	2008-06-06 17:50:58 +00:00
Duncan Sands	13237ac3b9	Wrap MVT::ValueType in a struct to get type safety and better control the abstraction. Rename the type to MVT. To update out-of-tree patches, the main thing to do is to rename MVT::ValueType to MVT, and rewrite expressions like MVT::getSizeInBits(VT) in the form VT.getSizeInBits(). Use VT.getSimpleVT() to extract a MVT::SimpleValueType for use in switch statements (you will get an assert failure if VT is an extended value type - these shouldn't exist after type legalization). This results in a small speedup of codegen and no new testsuite failures (x86-64 linux). llvm-svn: 52044	2008-06-06 12:08:01 +00:00
Zhou Sheng	1152ca9101	As Chris suggested, handle the situation if ShAmt larger than BitWidth, otherwise, opt might crash. llvm-svn: 52041	2008-06-06 08:32:05 +00:00
Zhou Sheng	fbe1dc240c	If BitWidth equals to ShtAmt, the RHSKnownZero[BitWidth-ShiftAmt-1] will crash the opt. Just fix this. Test case in llvm/test/Transforms/InstCombine/2008-06-05-ashr-crash.ll llvm-svn: 52003	2008-06-05 14:23:44 +00:00
Matthijs Kooijman	812989b147	Learn ScalarReplAggregrates how stores and loads of first class aggregrates work and how to replace them into individual values. Also, when trying to replace an aggregrate that is used by load or store with a single (large) integer, don't crash (but don't replace the aggregrate either). Also adds a testcase for both structs and arrays. llvm-svn: 51997	2008-06-05 12:51:53 +00:00
Matthijs Kooijman	e0c5adc158	Let StructRetPromotion check if all if its users are really calls or invokesn, not other instructions. This fixes a crash with the added testcase. llvm-svn: 51992	2008-06-05 08:57:20 +00:00
Matthijs Kooijman	463f86639d	Let StructRetPromotion check if it's users are really calling it and not passing its pointer. Fixes test with added testcase. llvm-svn: 51991	2008-06-05 08:48:32 +00:00
Matthijs Kooijman	230d6fbfeb	Use use_iterator::getOperandNo instead of CallSite::hasArgument to check if a function is passed as an argument instead of called. Also do this check a bit earlier. llvm-svn: 51990	2008-06-05 08:34:25 +00:00
Matthijs Kooijman	5afc2740b7	Update comments and documentation to reflect that GCSE and ValueNumbering are deprecated by the GVN and GVNPRE passes. llvm-svn: 51983	2008-06-05 07:55:49 +00:00
Owen Anderson	61c7f2a633	Remove unneeded #include. llvm-svn: 51955	2008-06-04 18:28:10 +00:00
Matthijs Kooijman	2353f35989	Replace two manual loops with calls to CallSite::hasArguments (no functional changes). llvm-svn: 51947	2008-06-04 16:57:50 +00:00
Duncan Sands	fc3c489b52	Change packed struct layout so that field sizes are the same as in unpacked structs, only field positions differ. This only matters for structs containing x86 long double or an apint; it may cause backwards compatibility problems if someone has bitcode containing a packed struct with a field of one of those types. The issue is that only 10 bytes are needed to hold an x86 long double: the store size is 10 bytes, but the ABI size is 12 or 16 bytes (linux/ darwin) which comes from rounding the store size up by the alignment. Because it seemed silly not to pack an x86 long double into 10 bytes in a packed struct, this is what was done. I now think this was a mistake. Reserving the ABI size for an x86 long double field even in a packed struct makes things more uniform: the ABI size is now always used when reserving space for a type. This means that developers are less likely to make mistakes. It also makes life easier for the CBE which otherwise could not represent all LLVM packed structs (PR2402). Front-end people might need to adjust the way they create LLVM structs - see following change to llvm-gcc. llvm-svn: 51928	2008-06-04 08:21:45 +00:00
Owen Anderson	2df82e7cec	LoopIndexSplit can sometimes result in cases where a block in its own domfrontier. Don't crash when we encounter one of these. llvm-svn: 51915	2008-06-03 18:29:48 +00:00
Dan Gohman	2ad7e7341c	Fix whitespace in whitespace-significant pseudocode in a comment. llvm-svn: 51890	2008-06-03 00:57:21 +00:00
Devang Patel	7314d0ee3c	Update dom tree. Fix PR 2372. llvm-svn: 51887	2008-06-02 22:52:56 +00:00
Chris Lattner	a12a6de683	move CannotBeNegativeZero to ValueTracking. Simplify some signbit comparisons. llvm-svn: 51864	2008-06-02 01:29:46 +00:00
Chris Lattner	965c769b3c	move ComputeMaskedBits, MaskedValueIsZero, and ComputeNumSignBits out of instcombine into a new file in libanalysis. This also teaches ComputeNumSignBits about the number of sign bits in a constantint. llvm-svn: 51863	2008-06-02 01:18:21 +00:00
Owen Anderson	38099c1b6e	Fix two issues that Eli Friedman pointed out, where would misoptimized code like: char a[200]; init(a, a+200); OR int a[200]; char* b = (char)a; char c = (char*)a; foo(b, c); llvm-svn: 51850	2008-06-01 22:26:26 +00:00
Owen Anderson	d071a8708e	Don't remove the memcpy when call slot substitution fails. llvm-svn: 51848	2008-06-01 21:52:16 +00:00
Duncan Sands	0397cd2ec4	When simplifying a call to a bitcast function, tighten up the conditions for performing the transform when only the function declaration is available: no longer allow turning i32 into i64 for example. Only allow changing between pointer types, and between pointer types and integers of the same size. For return values ptr -> intptr was already allowed; I added ptr -> ptr and intptr -> ptr while there. As shown by a recent objc testcase, changing the way parameters/return values are passed can be fatal when calling code written in assembler that directly manipulates call arguments and return values unless the transform has no impact on the way they are passed at the codegen level. While it is possible to imagine an ABI that treats integers of pointer size differently to pointers, I don't think LLVM supports any so the transform should now be safe while still being useful. llvm-svn: 51834	2008-06-01 07:38:42 +00:00
Nick Lewycky	035fe6f716	Peer through sext/zext when looking for not(cmp). llvm-svn: 51819	2008-05-31 19:01:33 +00:00
Nick Lewycky	26b8cd84b3	Add more i1 optimizations. add, sub, mul, s/udiv on i1 are now simplified away. llvm-svn: 51817	2008-05-31 17:59:52 +00:00
Nick Lewycky	df9242a833	Adding i1 is always Xor. llvm-svn: 51816	2008-05-31 17:10:28 +00:00
Gabor Greif	5df4326d78	rewrite operand loops to use iterators llvm-svn: 51789	2008-05-30 21:24:22 +00:00
Owen Anderson	1f59d9937f	Since LCSSA switched over to DenseMap, we have to be more careful to avoid iterator invalidation. Fixes PR2385. llvm-svn: 51777	2008-05-30 17:31:01 +00:00
Matthijs Kooijman	57da7d2308	Use eraseFromParent() instead of doing that manually in two places. llvm-svn: 51770	2008-05-30 12:35:46 +00:00
Dan Gohman	86ff8536f9	const-ify getOpcode. llvm-svn: 51698	2008-05-29 19:53:46 +00:00
Duncan Sands	9e064a2180	Add a newline at the end of this file. llvm-svn: 51680	2008-05-29 14:38:23 +00:00
Owen Anderson	7686b555e2	Replace the old ADCE implementation with a new one that more simply solves the one case that ADCE catches that normal DCE doesn't: non-induction variable loop computations. This implementation handles this problem without using postdominators. llvm-svn: 51668	2008-05-29 08:45:13 +00:00
Owen Anderson	f4aece5976	Remove debugging code. llvm-svn: 51666	2008-05-29 08:15:48 +00:00
Gabor Greif	3a9fba5a72	convert more operand loops to iterator formulation llvm-svn: 51663	2008-05-29 01:59:18 +00:00
Chris Lattner	ecdefb5df7	Implement PR2370: memmove(x,x,size) -> noop. llvm-svn: 51636	2008-05-28 05:30:41 +00:00
Duncan Sands	698348dfac	Fix some constructs that gcc-4.4 warns about. llvm-svn: 51591	2008-05-27 11:50:51 +00:00
Nick Lewycky	3ebe82b57a	InequalityGraph::node() can create new nodes, invalidating iterators across the set of nodes. Fix makeEqual to handle this by creating the new node first then iterating across them second. llvm-svn: 51573	2008-05-27 00:59:05 +00:00
Nick Lewycky	6be65d2a84	Grammaro. llvm-svn: 51572	2008-05-26 22:49:36 +00:00
Duncan Sands	dd7daee850	Factor code to copy global value attributes like the section or the visibility from one global value to another: copyAttributesFrom. This is particularly useful for duplicating functions: previously this was done by explicitly copying each attribute in turn at each place where a new function was created out of an old one, with the result that obscure attributes were regularly forgotten (like the collector or the section). Hopefully now everything is uniform and nothing is forgotten. llvm-svn: 51567	2008-05-26 19:58:59 +00:00
Owen Anderson	d3f21d165f	Use a DenseMap instead of an std::map, speeding up the testcase in PR2368 by about a third. llvm-svn: 51565	2008-05-26 10:07:43 +00:00
Nick Lewycky	f6ccd2580c	"ret (constexpr)" can't be folded into a Constant. Add a method to Analysis/ConstantFolding to fold ConstantExpr's, then make instcombine use it to try to use targetdata to fold constant expressions on void instructions. Also extend the icmp(inttoptr, inttoptr) folding to handle the case where int size != ptr size. llvm-svn: 51559	2008-05-25 20:56:15 +00:00
Chris Lattner	87a099a057	Fix a serious brain-o. Obviously no-one reviewed my patch :( This fixes PR2359 llvm-svn: 51536	2008-05-24 04:06:28 +00:00
Chris Lattner	5c207c83c6	Fix PR2358 by resolving calls with undef arguments to overdefined. llvm-svn: 51535	2008-05-24 03:59:33 +00:00
Evan Cheng	02912418f1	Remove x86.sse2.loadh.pd and x86.sse2.loadl.pd. These will be lowered into load and shuffle instructions. llvm-svn: 51521	2008-05-24 00:07:06 +00:00
Dan Gohman	f96e1371e8	Tidy up BasicBlock::getFirstNonPHI, and change a bunch of places to use it instead of duplicating its functionality. llvm-svn: 51499	2008-05-23 21:05:58 +00:00
Matthijs Kooijman	f52b23c0eb	Replace some weird usage of UserOp1 introduced in r49492 by a plain if. llvm-svn: 51482	2008-05-23 16:17:48 +00:00
Matthijs Kooijman	aef2b8198b	Restucture a part of the SimplifyCFG pass and include a testcase. The SimplifyCFG pass looks at basic blocks that contain only phi nodes, followed by an unconditional branch. In a lot of cases, such a block (BB) can be merged into their successor (Succ). This merging is performed by TryToSimplifyUncondBranchFromEmptyBlock. It does this by taking all phi nodes in the succesor block Succ and expanding them to include the predecessors of BB. Furthermore, any phi nodes in BB are moved to Succ and expanded to include the predecessors of Succ as well. Before attempting this merge, CanPropagatePredecessorsForPHIs checks to see if all phi nodes can be properly merged. All functional changes are made to this function, only comments were updated in TryToSimplifyUncondBranchFromEmptyBlock. In the original code, CanPropagatePredecessorsForPHIs looks quite convoluted and more like stack of checks added to handle different kinds of situations than a comprehensive check. In particular the first check in the function did some value checking for the case that BB and Succ have a common predecessor, while the last check in the function simply rejected all cases where BB and Succ have a common predecessor. The first check was still useful in the case that BB did not contain any phi nodes at all, though, so it was not completely useless. Now, CanPropagatePredecessorsForPHIs is restructured to to look a lot more similar to the code that actually performs the merge. Both functions now look at the same phi nodes in about the same order. Any conflicts (phi nodes with different values for the same source) that could arise from merging or moving phi nodes are detected. If no conflicts are found, the merge can happen. Apart from only restructuring the checks, two main changes in functionality happened. Firstly, the old code rejected blocks with common predecessors in most cases. The new code performs some extra checks so common predecessors can be handled in a lot of cases. Wherever common predecessors still pose problems, the blocks are left untouched. Secondly, the old code rejected the merge when values (phi nodes) from BB were used in any other place than Succ. However, it does not seem that there is any situation that would require this check. Even more, this can be proven. Consider that BB is a block containing of a single phi node "%a" and a branch to Succ. Now, since the definition of %a will dominate all of its uses, BB will dominate all blocks that use %a. Furthermore, since the branch from BB to Succ is unconditional, Succ will also dominate all uses of %a. Now, assume that one predecessor of Succ is not dominated by BB (and thus not dominated by Succ). Since at least one use of %a (but in reality all of them) is reachable from Succ, you could end up at a use of %a without passing through it's definition in BB (by coming from X through Succ). This is a contradiction, meaning that our original assumption is wrong. Thus, all predecessors of Succ must also be dominated by BB (and thus also by Succ). This means that moving the phi node %a from BB to Succ does not pose any problems when the two blocks are merged, and any use checks are not needed. llvm-svn: 51478	2008-05-23 09:09:41 +00:00
Matthijs Kooijman	f399bbf980	Indent fix. llvm-svn: 51477	2008-05-23 07:57:02 +00:00
Nick Lewycky	3bf5512d87	Constant integer vectors may also be negated. llvm-svn: 51476	2008-05-23 04:54:45 +00:00
Nick Lewycky	8f3127c5b5	Typo. llvm-svn: 51475	2008-05-23 04:39:38 +00:00
Nick Lewycky	4f3d878507	Revert X + X --> X * 2 optz'n which pessimizes heavily on x86. llvm-svn: 51474	2008-05-23 04:34:58 +00:00
Nick Lewycky	452fb32927	Implement X + X for vectors. llvm-svn: 51472	2008-05-23 04:14:51 +00:00
Nick Lewycky	2ec9a01173	Fix a recently added optimization to not crash on vectors. llvm-svn: 51471	2008-05-23 03:26:47 +00:00
Dan Gohman	6d5f120c5c	Generalize the new code in instcombine's ComputeNumSignBits for handling and/or to handle more cases (such as this add-sitofp.ll testcase), and port it to selectiondag's ComputeNumSignBits. llvm-svn: 51469	2008-05-23 02:28:01 +00:00
Dan Gohman	53b2698531	Use isSingleValueType instead of isFirstClassType to exclude struct and array types. llvm-svn: 51467	2008-05-23 01:52:21 +00:00
Dale Johannesen	fecb88249f	Allow for switch with no cases. Was causing fault in gcc.dg/pr27531-1.c. llvm-svn: 51464	2008-05-23 01:01:31 +00:00
Dan Gohman	30ab45d01e	Use isSingleValueType instead of isFirstClassType to exclude struct and array types. llvm-svn: 51459	2008-05-23 00:17:26 +00:00
Dan Gohman	7a0566b9cd	Use isSingleValueType instead of isFirstClassType to exclude struct and array types. llvm-svn: 51456	2008-05-23 00:12:03 +00:00
Chris Lattner	c5ec1e19eb	rewrite the validity checking for memory promotion to be simpler, more aggressive, and more correct. Verify that we only attempt to promote loads and stores. llvm-svn: 51406	2008-05-22 03:22:42 +00:00
Chris Lattner	f12c08dcd8	Use 'continue' to reduce nesting in this loop. No functionality change. llvm-svn: 51399	2008-05-22 00:53:38 +00:00
Dan Gohman	e62632e0bb	When LSR is replacing an instruction, call ScalarEvolution::deleteValueFromRecords on it before doing the replaceAllUsesWith, because ScalarEvolution looks at the instruction's users to find SCEV references to the instruction's SCEV object in its internal maps. Move all of LSR's loop-related state clearing after processing the loop and before cleaning up dead PHI nodes. This eliminates all of LSR's SCEV references just before the calls to ScalarEvolution::deleteValueFromRecords so that when ScalarEvolution drops its own SCEV references, the reference counts will reach zero and the SCEVs will be deleted immediately. These changes fix some compiler aborts involving ScalarEvolution holding onto and reusing SCEV objects for instructions that have been deleted. No regression test unfortunately; because the symptoms were due to dangling pointers, reduced testcases ended up being fairly arbitrary. llvm-svn: 51359	2008-05-21 00:54:12 +00:00
Dan Gohman	81ab753b14	Port SelectionDAG's ComputeNumSignBits-using code to instcombine, now that instcombine also has ComputeNumSignBits. llvm-svn: 51350	2008-05-20 21:01:12 +00:00
Matthijs Kooijman	5148a4ba66	Fix typo. llvm-svn: 51303	2008-05-20 07:26:45 +00:00
Chris Lattner	7ac943fffd	Teach instcombine 4 new xforms: (add (sext x), cst) --> (sext (add x, cst')) (add (sext x), (sext y)) --> (sext (add int x, y)) (add double (sitofp x), fpcst) --> (sitofp (add int x, intcst)) (add double (sitofp x), (sitofp y)) --> (sitofp (add int x, y)) This generally reduces conversions. For example MiBench/telecomm-gsm gets these simplifications: HACK2: %tmp67.i142.i.i = sext i16 %tmp6.i141.i.i to i32 ; <i32> [#uses=1] %tmp23.i139.i.i = sext i16 %tmp2.i138.i.i to i32 ; <i32> [#uses=1] %tmp8.i143.i.i = add i32 %tmp67.i142.i.i, %tmp23.i139.i.i ; <i32> [#uses=3] HACK2: %tmp67.i121.i.i = sext i16 %tmp6.i120.i.i to i32 ; <i32> [#uses=1] %tmp23.i118.i.i = sext i16 %tmp2.i117.i.i to i32 ; <i32> [#uses=1] %tmp8.i122.i.i = add i32 %tmp67.i121.i.i, %tmp23.i118.i.i ; <i32> [#uses=3] HACK2: %tmp67.i.i190.i = sext i16 %tmp6.i.i189.i to i32 ; <i32> [#uses=1] %tmp23.i.i187.i = sext i16 %tmp2.i.i186.i to i32 ; <i32> [#uses=1] %tmp8.i.i191.i = add i32 %tmp67.i.i190.i, %tmp23.i.i187.i ; <i32> [#uses=3] HACK2: %tmp67.i173.i.i.i = sext i16 %tmp6.i172.i.i.i to i32 ; <i32> [#uses=1] %tmp23.i170.i.i.i = sext i16 %tmp2.i169.i.i.i to i32 ; <i32> [#uses=1] %tmp8.i174.i.i.i = add i32 %tmp67.i173.i.i.i, %tmp23.i170.i.i.i ; <i32> [#uses=3] HACK2: %tmp67.i152.i.i.i = sext i16 %tmp6.i151.i.i.i to i32 ; <i32> [#uses=1] %tmp23.i149.i.i.i = sext i16 %tmp2.i148.i.i.i to i32 ; <i32> [#uses=1] %tmp8.i153.i.i.i = add i32 %tmp67.i152.i.i.i, %tmp23.i149.i.i.i ; <i32> [#uses=3] HACK2: %tmp67.i.i.i.i = sext i16 %tmp6.i.i.i.i to i32 ; <i32> [#uses=1] %tmp23.i.i5.i.i = sext i16 %tmp2.i.i.i.i to i32 ; <i32> [#uses=1] %tmp8.i.i7.i.i = add i32 %tmp67.i.i.i.i, %tmp23.i.i5.i.i ; <i32> [#uses=3] This also fixes a bug in ComputeNumSignBits handling select and makes it more aggressive with and/or. llvm-svn: 51302	2008-05-20 05:46:13 +00:00
Chris Lattner	9c27f96d04	fix two issues Neil noticed, thanks! llvm-svn: 51296	2008-05-20 03:50:52 +00:00
Dan Gohman	e5572706e8	Refine the fix in r51169 to only apply when the operand val being replaced is a PHI. This prevents it from inserting uses before defs in the case that it isn't a PHI and it depends on other instructions later in the block. This fixes the 447.dealII regression on x86-64. llvm-svn: 51292	2008-05-20 03:01:48 +00:00
Dan Gohman	d717761a2b	Make AssociativeOpt static. llvm-svn: 51290	2008-05-20 01:14:05 +00:00
Devang Patel	ee7bf41c06	Do not erase induction variable increment if it is used outside the loop. llvm-svn: 51280	2008-05-19 22:23:55 +00:00
Dan Gohman	123438cc05	Add a ComputeNumSignBits function for use by instcombine, based on the code in SelectionDAG. llvm-svn: 51279	2008-05-19 22:14:15 +00:00
Chris Lattner	b42712288e	switch to Type::getFPMantissaWidth instead of reinventing it. llvm-svn: 51275	2008-05-19 21:17:23 +00:00
Chris Lattner	ba9acbe6dc	minor cleanups, teach instcombine that sitofp/uitofp cannot produce a negative zero. llvm-svn: 51272	2008-05-19 20:27:56 +00:00
Chris Lattner	e35fe0f1c6	convert fptosi(sitofp x) -> x if the fp value has enough bits in its mantissa to accurately represent the integer. This triggers 9 times in 471.omnetpp, though 8 of those seem to be inlined from the same place. llvm-svn: 51271	2008-05-19 20:25:04 +00:00
Chris Lattner	5920a78034	Fold FP comparisons where one operand is converted from an integer type and the other operand is a constant into integer comparisons. This happens surprisingly frequently (e.g. 10 times in 471.omnetpp), which are things like this: %tmp8283 = sitofp i32 %tmp82 to double %tmp1013 = fcmp ult double %tmp8283, 0.0 Clearly comparing tmp82 against i32 0 is cheaper here. this also triggers 8 times in gobmk, including this one: %tmp375376 = sitofp i32 %tmp375 to double %tmp377 = fcmp ogt double %tmp375376, 8.150000e+01 which is comparing an integer against 81.5 :). llvm-svn: 51268	2008-05-19 20:18:56 +00:00
Chris Lattner	6e70830af9	remove debug output llvm-svn: 51264	2008-05-19 20:03:53 +00:00
Chris Lattner	fc365b60dc	be more aggressive about transforming add -> or when the operands have no intersecting bits. This triggers all over the place, for example in lencode, with adds of stuff like: %tmp580 = mul i32 %tmp579, 2 %tmp582 = and i32 %b8, 1 and %tmp28 = shl i32 %abs.i, 1 %sign.0 = select i1 %tmp23, i32 1, i32 0 and %tmp344 = shl i32 %tmp343, 2 %tmp346 = and i32 %tmp96, 3 etc. llvm-svn: 51263	2008-05-19 20:01:56 +00:00
Duncan Sands	eec7a3c071	Fix PR2341 - when the length is 4 use an i32 not an i16! Cleaned up trailing whitespace while there. llvm-svn: 51240	2008-05-19 09:27:24 +00:00
Nate Begeman	65720c968c	Teach GVN to not assert on vector comparisons llvm-svn: 51230	2008-05-18 19:49:05 +00:00
Chris Lattner	4b2a724fb8	Fix PR2339 llvm-svn: 51226	2008-05-18 04:11:26 +00:00
Nick Lewycky	79376f4e02	Move isTrueWhenEqual to ICmpInst. llvm-svn: 51215	2008-05-17 07:33:39 +00:00
Dale Johannesen	5610dabac9	Less conservative verison of previous patch, suggested by Duncan. llvm-svn: 51211	2008-05-16 23:18:52 +00:00
Dale Johannesen	e7f5bc2c3b	Weak functions not declared non-throwing might be replaced at linktime with a body that throws, even if the body in this file does not. Make PruneEH be more conservative in this case. g++.dg/eh/weak1.C llvm-svn: 51207	2008-05-16 21:31:48 +00:00
Gabor Greif	e1f6e4b21d	API change for {BinaryOperator\|CmpInst\|CastInst}::create*() --> Create. Legacy interfaces will be in place for some time. (Merge from use-diet branch.) llvm-svn: 51200	2008-05-16 19:29:10 +00:00
Duncan Sands	67933e6692	Bill pointed out that system headers should be included after local headers. llvm-svn: 51187	2008-05-16 09:30:00 +00:00
Evan Cheng	173a53f87c	Do not dup malloc, vector instructions, etc. Throttle the default theshold way down. llvm-svn: 51183	2008-05-16 07:55:50 +00:00
Owen Anderson	c7d6eceb69	Remove ADCE's ability to delete loops. This ability is now implemented in a safer manner by loop deletion. llvm-svn: 51182	2008-05-16 04:34:51 +00:00
Owen Anderson	ad5f211b48	Clean ups for loop deletion based on Chris' feedback. Also, use SCEV to determine the trip count of the loop, which is more powerful and accurate that Loop::getTripCount. llvm-svn: 51179	2008-05-16 04:32:45 +00:00
Chris Lattner	5c953b7d27	implement PR2328. llvm-svn: 51176	2008-05-16 02:59:42 +00:00
Dan Gohman	0a0fa7cf78	Fix a bug in LoopStrengthReduce that caused it to emit IR with use-before-def. The problem comes up in code with multiple PHIs where one PHI is being rewritten in terms of the other, but the other needs to be casted first. LLVM rules requre the cast instruction to be inserted after any PHI instructions, but when instructions were inserted to replace the second PHI value with a function of the first, they were ended up going before the cast instruction. Avoid this problem by remembering the location of the cast instruction, when one is needed, and inserting the expansion of the new value after it. This fixes a bug that surfaced in 255.vortex on x86-64 when instcombine was removed from the middle of the loop optimization passes. llvm-svn: 51169	2008-05-15 23:26:57 +00:00
Devang Patel	61724355af	Remove useless check. Patch by Matthijs Kooijman. llvm-svn: 51154	2008-05-15 18:04:29 +00:00
Duncan Sands	783cb2d76d	Use of UINT_MAX requires climits, at least when compiling with gcc 4.3. llvm-svn: 51145	2008-05-15 11:22:50 +00:00
Gabor Greif	697e94cc22	Fix a bunch of 80col violations that arose from the Create API change. Tweak makefile targets to find these better. llvm-svn: 51143	2008-05-15 10:04:30 +00:00
Bill Wendling	3716952f10	Situations can arise when you have a function called that returns a 'void', but is bitcast to return a floating point value. The result of the instruction may not be used by the program afterwards, and LLVM will happily remove all instructions except the call. But, on some platforms, if a value is returned as a floating point, it may need to be removed from the stack (like x87). Thus, we can't get rid of the bitcast even if there isn't a use of the value. llvm-svn: 51134	2008-05-14 22:45:20 +00:00
Chris Lattner	e15051d64b	rename SimplifyCFG.cpp -> SimplifyCFGPass.cpp llvm-svn: 51130	2008-05-14 20:38:44 +00:00
Devang Patel	f2763e233e	Simplify internalize pass. Add test case. Patch by Matthijs Kooijman! llvm-svn: 51114	2008-05-14 20:01:01 +00:00
Dan Gohman	3dc2d92ebd	Split the loop unroll mechanism logic out into a utility function. Patch by Matthijs Kooijman! llvm-svn: 51083	2008-05-14 00:24:14 +00:00
Owen Anderson	17816b321f	Fix Analysis/BasicAA/pure-const-dce.ll. This turned out to be a correctness bug as well as a missed optimization. We weren't properly checking for local dependencies before moving on to non-local ones when doing non-local read-only call CSE. llvm-svn: 51082	2008-05-13 23:18:30 +00:00
Dale Johannesen	e695ab227c	Fix for PR 2323, infinite loop in tail dup. llvm-svn: 51063	2008-05-13 20:06:43 +00:00
Owen Anderson	8c2391d00d	Make the non-local CSE safety checks slightly more thorough. llvm-svn: 51035	2008-05-13 13:41:23 +00:00
Owen Anderson	69057b80c7	Add support for non-local CSE of read-only calls. llvm-svn: 51024	2008-05-13 08:17:22 +00:00
Dan Gohman	0479aa5c0b	Change class' public PassInfo variables to by initialized with the address of the PassInfo directly instead of calling getPassInfo. This eliminates a bunch of dynamic initializations of static data. Also, fold RegisterPassBase into PassInfo, make a bunch of its data members const, and rearrange some code to initialize data members in constructors instead of using setter member functions. llvm-svn: 51022	2008-05-13 02:05:11 +00:00
Nate Begeman	53c5c62d6d	80 col / tabs fixes llvm-svn: 51021	2008-05-13 01:48:26 +00:00
Dan Gohman	d78c400b5b	Clean up the use of static and anonymous namespaces. This turned up several things that were neither in an anonymous namespace nor static but not intended to be global. llvm-svn: 51017	2008-05-13 00:00:25 +00:00
Owen Anderson	f792860255	Go back to passing the analyses around as parameters. llvm-svn: 50995	2008-05-12 20:15:55 +00:00
Owen Anderson	4afb1c864a	Move the various analyses used by GVN into static variables so we don't have to keep passing them around or refetching them. llvm-svn: 50963	2008-05-12 08:15:27 +00:00
Chris Lattner	47fed61526	Fix various DOUTs to not call the extremely expensive Value::getName() method. DOUT statements are disabled when assertions are off, but the side effects of getName() are still evaluated. Just call getNameSTart, which is close enough and doesn't cause heap traffic. llvm-svn: 50958	2008-05-11 01:55:59 +00:00
Chris Lattner	82146fa267	Simplify code by using SwitchInst::findCaseValue instead of reimplementing it. llvm-svn: 50957	2008-05-10 23:56:54 +00:00
Chris Lattner	a4ee1f516f	don't sink invokes, even if they are readonly. This fixes a crash on kimwitu++. llvm-svn: 50901	2008-05-09 15:07:33 +00:00
Duncan Sands	437435dcbc	Fix a type and formatting. llvm-svn: 50900	2008-05-09 12:20:10 +00:00
Chris Lattner	aaba10e843	Implement PR2298. This transforms: ~x < ~y --> y < x -x == -y --> x == y llvm-svn: 50882	2008-05-09 05:19:28 +00:00
Chris Lattner	e7f0afe168	restore doxygen comment. llvm-svn: 50881	2008-05-09 04:43:13 +00:00
Gordon Henriksen	829046b0b4	Improve pass documentation and comments. Patch by Matthijs Kooijman! llvm-svn: 50861	2008-05-08 17:46:35 +00:00
Chris Lattner	49a594e6ab	More than just loads can read from memory: readonly calls like strlen also need to be checked for memory modifying instructions before we can sink them. THis fixes the second half of PR2297. llvm-svn: 50860	2008-05-08 17:37:37 +00:00
Chris Lattner	4fa09669d8	Make instcombine's DSE respect loads as well as stores. It is not safe to delete the first store in: store x -> p load p store y -> p This is for PR2297. llvm-svn: 50859	2008-05-08 17:20:30 +00:00
Devang Patel	4758caa926	Check linkage. llvm-svn: 50851	2008-05-08 15:08:39 +00:00
Anton Korobeynikov	fc2edad4ae	Turn StripPointerCast() into a method llvm-svn: 50836	2008-05-07 22:54:15 +00:00
Dan Gohman	5a3eecdfd8	Fix a bug in the ComputeMaskedBits logic for multiply. llvm-svn: 50793	2008-05-07 00:35:55 +00:00
Anton Korobeynikov	82c02b28f3	Make StripPointerCast a common function (should we mak it method of Value instead?) llvm-svn: 50775	2008-05-06 22:52:30 +00:00
Owen Anderson	0e1ab4a9be	We need to update PHIs containing the exiting block, not the exit block. We really should come up with better names for these. llvm-svn: 50770	2008-05-06 20:55:16 +00:00
Devang Patel	7ffc3c9a95	Fix typo. llvm-svn: 50713	2008-05-06 05:40:11 +00:00
Chris Lattner	de68fabb35	fix typo Duncan noticed llvm-svn: 50699	2008-05-06 02:31:18 +00:00
Dan Gohman	6a2da37c0e	Make several variable declarations static. llvm-svn: 50696	2008-05-06 01:53:16 +00:00
Dan Gohman	a8b7e78f54	Remove uses of llvm/System/IncludeFile.h that are no longer needed. llvm-svn: 50695	2008-05-06 01:32:53 +00:00
Dan Gohman	cf0e3acf16	Correct the value of LowBits in srem and urem handling in ComputeMaskedBits. llvm-svn: 50692	2008-05-06 00:51:48 +00:00
Bill Wendling	4ead264c08	Fix: Some classes were derived from a class in an anonymous namespace, but they themselves weren't in the anonymous namespace. llvm-svn: 50673	2008-05-05 21:37:59 +00:00
Chris Lattner	8ed8e3d0e6	Fix a crash when threading a block that includes a MRV call result. DemoteRegToStack doesn't work with MRVs yet, because it relies on the ability to load/store things. This fixes PR2285. llvm-svn: 50667	2008-05-05 20:21:22 +00:00
Torok Edwin	2d7a4d70c3	processStore may delete the instruction, avoid using dyn_cast<> on already freed memory. llvm-svn: 50618	2008-05-04 08:51:25 +00:00
Devang Patel	fa0e3c4a92	Handle multiple return values. llvm-svn: 50604	2008-05-03 01:12:15 +00:00
Devang Patel	a1ec89fbf1	Do not sink getresult. llvm-svn: 50600	2008-05-03 00:36:30 +00:00
Dan Gohman	1962c2be6a	Fix a mistake in the computation of leading zeros for udiv. llvm-svn: 50591	2008-05-02 21:30:02 +00:00
Chris Lattner	5f0563ceb6	strength reduce exp2 into ldexp, rdar://5852514 llvm-svn: 50586	2008-05-02 18:43:35 +00:00
Chris Lattner	a700b2bd0f	add a FIXME so we remember to eventually remove this code. llvm-svn: 50582	2008-05-02 17:18:31 +00:00
Bill Wendling	86ceb0db9c	Porting r50563 from Tak to mainline. llvm-svn: 50564	2008-05-02 00:43:20 +00:00
Dale Johannesen	78ffe6e939	Don't try to create PHIs of struct types. Fallout from x86-64 calling convention work. llvm-svn: 50545	2008-05-01 22:27:44 +00:00
Dan Gohman	4be6ae4e6c	Fix an overaggressive SimplifyDemandedBits optimization on urem. This fixes the 254.gap regression on x86 and the 403.gcc regression on x86-64. llvm-svn: 50537	2008-05-01 19:13:24 +00:00
Chris Lattner	bb41aab426	1) add '-debug' output 2) Return NULL instead of false in several places for tidiness. 3) fix a bug optimizing sprintf(p, "%c", x) llvm-svn: 50521	2008-05-01 06:39:12 +00:00
Chris Lattner	b9b5d6ddaa	Delete the IPO simplify-libcalls and completely reimplement it as a FunctionPass. This makes it simpler, fixes dozens of bugs, adds a couple of minor features, and shrinks is considerably: from 2214 to 1437 lines. llvm-svn: 50520	2008-05-01 06:25:24 +00:00
Owen Anderson	0ced13ccd9	This condition got inverted accidentally. llvm-svn: 50473	2008-04-30 07:16:33 +00:00
Chris Lattner	2dc4426675	move lowering of llvm.memset -> store from simplify libcalls to instcombine. llvm-svn: 50472	2008-04-30 06:39:11 +00:00
Chris Lattner	438e35c4d1	use string length computation to generalize several xforms. llvm-svn: 50464	2008-04-30 03:07:53 +00:00
Owen Anderson	ad5367f8ed	Revert r50441. The original code was correct. Add some more comments so that I don't make the same mistake in the future. llvm-svn: 50446	2008-04-29 21:51:00 +00:00
Owen Anderson	ff7d7b18e5	Fix a bug in memcpyopt where the memcpy-memcpy transform was never being applied because we were checking for it in the wrong order. This caused a miscompilation because the return slot optimization assumes that the call it is dealing with is NOT a memcpy. llvm-svn: 50444	2008-04-29 21:26:06 +00:00
Owen Anderson	f07de734cf	We should be returning true here since we've changed the function. llvm-svn: 50442	2008-04-29 21:02:46 +00:00
Owen Anderson	e674600266	A lot of cleanups and documentation improvements, as well as a few corner case fixes. Most of this was suggested by Chris. llvm-svn: 50441	2008-04-29 20:59:33 +00:00
Owen Anderson	2306a1e098	Rename DeadLoopElimination to LoopDeletion, part 2. llvm-svn: 50437	2008-04-29 20:06:54 +00:00
Owen Anderson	e9f05bd1f0	Rename DeadLoopElimination to LoopDeletion, part one. llvm-svn: 50436	2008-04-29 19:58:07 +00:00
Chris Lattner	d9e3b5c5bd	don't eliminate load from volatile value on paths where the load is dead. This fixes the second half of PR2262 llvm-svn: 50430	2008-04-29 17:28:22 +00:00
Chris Lattner	9233c124c9	fix a subtle volatile handling bug. llvm-svn: 50428	2008-04-29 17:13:43 +00:00
Chris Lattner	92f4702254	Implement more aggressive support for analyzing string length. This generalizes the previous code to handle the case when the string is not an immediate to the strlen call (for example, crazy stuff like strlen(c ? "foo" : "bart"+1) -> 3). This implements gcc.c-torture/execute/builtins/strlen-2.c. I will generalize other cases in simplifylibcalls to use the same routine later. llvm-svn: 50408	2008-04-29 06:56:02 +00:00
Owen Anderson	304ef22f6e	Clarify what we mean by a dead loop. llvm-svn: 50406	2008-04-29 06:34:55 +00:00
Chris Lattner	e331a65c79	don't delete the last store to an alloca if the store is volatile. llvm-svn: 50390	2008-04-29 04:58:38 +00:00
Owen Anderson	586216e5bb	Add some more comments. llvm-svn: 50384	2008-04-29 00:45:15 +00:00
Owen Anderson	41377175d3	Remove debugging code. llvm-svn: 50383	2008-04-29 00:39:24 +00:00
Owen Anderson	94ad702412	Add dead loop elimination, which removes dead loops for which we can compute the trip count. llvm-svn: 50382	2008-04-29 00:38:34 +00:00
Dan Gohman	8cb19d967f	Fix DSE to not eliminate volatile loads with no uses. llvm-svn: 50370	2008-04-28 19:51:27 +00:00
Dan Gohman	72ec3f4562	Teach InstCombine's ComputeMaskedBits what SelectionDAG's ComputeMaskedBits knows about cttz, ctlz, and ctpop. Teach SelectionDAG's ComputeMaskedBits what InstCombine's knows about SRem. And teach them both some things about high bits in Mul, UDiv, URem, and Sub. This allows instcombine and dagcombine to eliminate sign-extension operations in several new cases. llvm-svn: 50358	2008-04-28 17:02:21 +00:00
Chris Lattner	8be72700b8	Fix PR2256, yet another miscompilation in simplifycfg of i multiple return values. Bill, please pull this into Tak. llvm-svn: 50332	2008-04-28 00:19:07 +00:00
Chris Lattner	2237973438	Implement a signficant optimization for inline asm: When choosing between constraints with multiple options, like "ir", test to see if we can use the 'i' constraint and go with that if possible. This produces more optimal ASM in all cases (sparing a register and an instruction to load it), and fixes inline asm like this: void test () { asm volatile (" %c0 %1 " : : "imr" (42), "imr"(14)); } Previously we would dump "42" into a memory location (which is ok for the 'm' constraint) which would cause a problem because the 'c' modifier is not valid on memory operands. Isn't it great how inline asm turns 'missed optimization' into 'compile failed'?? Incidentally, this was the todo in PowerPC/2007-04-24-InlineAsm-I-Modifier.ll Please do NOT pull this into Tak. llvm-svn: 50315	2008-04-27 00:37:18 +00:00
Chris Lattner	4793515a9c	Move a bunch of inline asm code out of line. llvm-svn: 50313	2008-04-27 00:09:47 +00:00
Chris Lattner	67ca6f6347	When SRoA'ing a global variable, make sure the new globals get the appropriate alignment. This fixes a miscompilation of 252.eon on x86-64 (rdar://5891920). Bill, please pull this into Tak. llvm-svn: 50308	2008-04-26 07:40:11 +00:00
Dale Johannesen	0d1d3df564	change comments per review llvm-svn: 50300	2008-04-25 21:16:07 +00:00
Dan Gohman	ca95a5f49f	Remove the code from CodeGenPrepare that moved getresult instructions to the block that defines their operands. This doesn't work in the case that the operand is an invoke, because invoke is a terminator and must be the last instruction in a block. Replace it with support in SelectionDAGISel for copying struct values into sequences of virtual registers. llvm-svn: 50279	2008-04-25 18:27:55 +00:00
Nate Begeman	ca270ad96f	Feedback from chris llvm-svn: 50271	2008-04-25 17:45:52 +00:00
Nick Lewycky	4d43d3c72c	Remove 'unwinds to' support from mainline. This patch undoes r47802 r47989 r48047 r48084 r48085 r48086 r48088 r48096 r48099 r48109 and r48123. llvm-svn: 50265	2008-04-25 16:53:59 +00:00
Nate Begeman	6fed3b2038	Teach the PruningFunctionCloner how to look through loads with ConstantExpression GEPs pointing into constant globals. llvm-svn: 50256	2008-04-25 06:37:06 +00:00
Chris Lattner	f7de528463	Don't infininitely thread branches when a threaded edge goes back to the block, e.g.: Threading edge through bool from 'bb37.us.thread3829' to 'bb37.us' with cost: 1, across block: bb37.us: ; preds = %bb37.us.thread3829, %bb37.us, %bb33 %D1361.1.us = phi i32 [ %tmp36, %bb33 ], [ %D1361.1.us, %bb37.us ], [ 0, %bb37.us.thread3829 ] ; <i32> [#uses=2] %tmp39.us = icmp eq i32 %D1361.1.us, 0 ; <i1> [#uses=1] br i1 %tmp39.us, label %bb37.us, label %bb42.us llvm-svn: 50251	2008-04-25 04:12:29 +00:00
Evan Cheng	608eeef5ce	Adjust inline cost computation to be less aggressive. llvm-svn: 50222	2008-04-24 18:42:47 +00:00
Chris Lattner	97951ac580	code restructuring, not functionality change. llvm-svn: 50203	2008-04-24 00:21:50 +00:00
Chris Lattner	12f1e007f7	Don't replace multiple result of calls with undef, sccp tracks getresult values, not call values in this case. llvm-svn: 50202	2008-04-24 00:19:54 +00:00
Chris Lattner	769203cb03	code cleanup, no functionality change. llvm-svn: 50201	2008-04-24 00:16:28 +00:00
Chris Lattner	86bbf338e5	Split some code out of the main SimplifyCFG loop into its own function. Fix said code to handle merging return instructions together correctly when handling multiple return values. llvm-svn: 50199	2008-04-24 00:01:19 +00:00
Devang Patel	8f83081fea	Check type instead of no. of operands. llvm-svn: 50179	2008-04-23 20:18:29 +00:00
Dale Johannesen	f6e15a4774	Rewrite previous patch to suit Chris's preference. llvm-svn: 50174	2008-04-23 18:34:37 +00:00
Chris Lattner	a82d691caa	simplify code for propagation of constant arguments into callees. llvm-svn: 50142	2008-04-23 06:16:27 +00:00
Chris Lattner	5f1802cfdf	Fix a number of bugs in ipconstantprop, simplify the code, fit in 80 cols, fix read after free bug (PR2238). llvm-svn: 50141	2008-04-23 05:59:23 +00:00
Chris Lattner	5a58a4dc6d	Rewrite multiple return value handling in SCCP. Before, the -sccp pass would turn every getresult instruction into undef. This helps with rdar://5778210 llvm-svn: 50140	2008-04-23 05:38:20 +00:00
Dale Johannesen	493527d8c9	Do not change the type of a ByVal argument to a type of a different size. llvm-svn: 50121	2008-04-23 01:03:05 +00:00
Evan Cheng	1c89ca7295	Don't do: "(X & 4) >> 1 == 2 --> (X & 4) == 4" if there are more than one uses of the shift result. llvm-svn: 50118	2008-04-23 00:38:06 +00:00
Chris Lattner	37e9c187b0	Start doing the significantly useful part of jump threading: handle cases where a comparison has a phi input and that phi is a constant. For example, stuff like: Threading edge through bool from 'bb2149' to 'bb2231' with cost: 1, across block: bb2237: ; preds = %bb2231, %bb2149 %tmp2328.rle = phi i32 [ %tmp2232, %bb2231 ], [ %tmp2232439, %bb2149 ] ; <i32> [#uses=2] %done.0 = phi i32 [ %done.2, %bb2231 ], [ 0, %bb2149 ] ; <i32> [#uses=1] %tmp2239 = icmp eq i32 %done.0, 0 ; <i1> [#uses=1] br i1 %tmp2239, label %bb2231, label %bb2327 or bb38.i298: ; preds = %bb33.i295, %bb1693 %tmp39.i296.rle = phi %struct.ibox* [ null, %bb1693 ], [ %tmp39.i296.rle1109, %bb33.i295 ] ; <%struct.ibox> [#uses=2] %minspan.1.i291.reg2mem.1 = phi i32 [ 32000, %bb1693 ], [ %minspan.0.i288, %bb33.i295 ] ; <i32> [#uses=1] %tmp40.i297 = icmp eq %struct.ibox %tmp39.i296.rle, null ; <i1> [#uses=1] br i1 %tmp40.i297, label %implfeeds.exit311, label %bb43.i301 This triggers thousands of times in spec. llvm-svn: 50110	2008-04-22 21:40:39 +00:00
Chris Lattner	d5425e8f8d	Dig through multiple levels of AND to thread jumps if needed. llvm-svn: 50106	2008-04-22 20:46:09 +00:00
Chris Lattner	3df4c15dc7	Teach jump threading to thread through blocks like: br (and X, phi(Y, Z, false)), label L1, label L2 This triggers once on 252.eon and 6 times on 176.gcc. Blocks in question often look like this: bb262: ; preds = %bb261, %bb248 %iftmp.251.0 = phi i1 [ true, %bb261 ], [ false, %bb248 ] ; <i1> [#uses=4] %tmp270 = icmp eq %struct.rtx_def* %tmp.0.i, null ; <i1> [#uses=1] %bothcond = or i1 %iftmp.251.0, %tmp270 ; <i1> [#uses=1] br i1 %bothcond, label %bb288, label %bb273 In this case, it is clear that it doesn't matter if tmp.0.i is null when coming from bb261. When coming from bb248, it is all that matters. Another random example: check_asm_operands.exit: ; preds = %check_asm_operands.exit.thr_comm, %bb30.i, %bb12.i, %bb6.i413 %tmp.0.i420 = phi i1 [ true, %bb6.i413 ], [ true, %bb12.i ], [ true, %bb30.i ], [ false, %check_asm_operands.exit.thr_comm ; <i1> [#uses=1] call void @llvm.stackrestore( i8* %savedstack ) nounwind %tmp4389 = icmp eq i32 %added_sets_1.0, 0 ; <i1> [#uses=1] %tmp4394 = icmp eq i32 %added_sets_2.0, 0 ; <i1> [#uses=1] %bothcond80 = and i1 %tmp4389, %tmp4394 ; <i1> [#uses=1] %bothcond81 = and i1 %bothcond80, %tmp.0.i420 ; <i1> [#uses=1] br i1 %bothcond81, label %bb4398, label %bb4397 Here is the case from 252.eon: bb290.i.i: ; preds = %bb23.i57.i.i, %bb8.i39.i.i, %bb100.i.i, %bb100.i.i, %bb85.i.i110 %myEOF.1.i.i = phi i1 [ true, %bb100.i.i ], [ true, %bb100.i.i ], [ true, %bb85.i.i110 ], [ true, %bb8.i39.i.i ], [ false, %bb23.i57.i.i ] ; <i1> [#uses=2] %i.4.i.i = phi i32 [ %i.1.i.i, %bb85.i.i110 ], [ %i.0.i.i, %bb100.i.i ], [ %i.0.i.i, %bb100.i.i ], [ %i.3.i.i, %bb8.i39.i.i ], [ %i.3.i.i, %bb23.i57.i.i ] ; <i32> [#uses=3] %tmp292.i.i = load i8* %tmp16.i.i100, align 1 ; <i8> [#uses=1] %tmp293.not.i.i = icmp ne i8 %tmp292.i.i, 0 ; <i1> [#uses=1] %bothcond.i.i = and i1 %tmp293.not.i.i, %myEOF.1.i.i ; <i1> [#uses=1] br i1 %bothcond.i.i, label %bb202.i.i, label %bb301.i.i Factoring out 3 common predecessors. On the path from any blocks other than bb23.i57.i.i, the load and compare are dead. llvm-svn: 50096	2008-04-22 07:05:46 +00:00
Chris Lattner	e369c35a84	refactor some code, no functionality change. llvm-svn: 50094	2008-04-22 06:36:15 +00:00
Chris Lattner	8fb13cbe4e	remove dead code. llvm-svn: 50080	2008-04-22 03:21:48 +00:00
Chris Lattner	c3a439351c	optimize "p != gep p, ..." better. This allows us to compile getelementptr-seteq.ll into: define i1 @test(i64 %X, %S* %P) { %C = icmp eq i64 %X, -1 ; <i1> [#uses=1] ret i1 %C } instead of: define i1 @test(i64 %X, %S* %P) { %A.idx.mask = and i64 %X, 4611686018427387903 ; <i64> [#uses=1] %C = icmp eq i64 %A.idx.mask, 4611686018427387903 ; <i1> [#uses=1] ret i1 %C } And fixes the second half of PR2235. This speeds up the insertion sort case by 45%, from 1.12s to 0.77s. In practice, this will significantly speed up for loops structured like: for (double *P = Base + N; P != Base; --P) ... Which happens frequently for C++ iterators. llvm-svn: 50079	2008-04-22 02:53:33 +00:00
Chris Lattner	bab7bec9c8	fix grammar-o, thanks to Duncan for noticing. llvm-svn: 50047	2008-04-21 18:25:01 +00:00
Owen Anderson	a5b96ecef9	Remove unneeded #include's. llvm-svn: 50035	2008-04-21 07:47:38 +00:00
Owen Anderson	6a7355caa2	Refactor memcpyopt based on Chris' suggestions. Consolidate several functions and simplify code that was fallout from the separation of memcpyopt and gvn. llvm-svn: 50034	2008-04-21 07:45:10 +00:00
Chris Lattner	ad0d42ba15	don't assume that the argument passed to fprintf("%s" is a string. This fixes a crash in opt on 433.milc. llvm-svn: 50023	2008-04-21 03:18:33 +00:00
Chris Lattner	f6236cc2e9	Use the new SplitBlockPredecessors to implement a todo. llvm-svn: 50022	2008-04-21 02:57:57 +00:00
Chris Lattner	a5b11705b6	Move SplitBlockPredecessors out of loopsimplify into BasicBlockUtils.h as a global helper function. At the same type, switch it from taking a vector of predecessors to an arbitrary sequential input. This allows us to switch LoopSimplify to use a SmallVector for various temporary vectors that it passed into SplitBlockPredecessors. llvm-svn: 50020	2008-04-21 01:28:02 +00:00
Chris Lattner	d418b06abf	Move domtree/frontier updating earlier, allowing us to use it to update phi nodes, removing a hack. llvm-svn: 50019	2008-04-21 01:05:08 +00:00
Chris Lattner	96e9e22269	Factor dominator tree and frontier updating into SplitBlockPredecessors instead of doing it after every call. llvm-svn: 50018	2008-04-21 00:54:38 +00:00
Chris Lattner	559c867ece	fit some more code in 80 cols. llvm-svn: 50016	2008-04-21 00:25:49 +00:00
Chris Lattner	aca912d793	simplify code, fit in 80 cols. llvm-svn: 50015	2008-04-21 00:23:14 +00:00
Chris Lattner	38806c3e9c	fit in 80 cols llvm-svn: 50014	2008-04-21 00:19:16 +00:00
Chris Lattner	ff1c6e388c	finish the first cut of a jump threading pass implementation. llvm-svn: 50006	2008-04-20 22:39:42 +00:00
Chris Lattner	567166c0a8	replace a slow and verbose version of Instruction::isUsedOutsideOfBlock with a call to Instruction::isUsedOutsideOfBlock. llvm-svn: 50005	2008-04-20 22:18:22 +00:00
Chris Lattner	9c1f1a82bf	we can only thread blocks when there is a pred we can determine the succ of. llvm-svn: 50003	2008-04-20 21:18:09 +00:00
Chris Lattner	2115722ffa	improve comments, infrastructure, and add some validity checks for threading. Add a cost function. llvm-svn: 50002	2008-04-20 21:13:06 +00:00
Chris Lattner	b3b6007c8b	Add a new Jump Threading pass, which will handle cases such as those in PR2235. Right now the pass is not very effective. :) llvm-svn: 50000	2008-04-20 20:35:01 +00:00
Torok Edwin	ab20784740	g++-4.3 build-fix: CHAR_BIT requires <climits>. llvm-svn: 49989	2008-04-20 08:33:11 +00:00
Chris Lattner	3b18762f40	Switch to using Simplified ConstantFP::get API. llvm-svn: 49977	2008-04-20 00:41:09 +00:00
Chris Lattner	eb6bb803a7	Allow argpromote to promote struct arguments with a specified number of elements. Patch by Matthijs Kooijman! llvm-svn: 49962	2008-04-19 19:50:01 +00:00
Owen Anderson	f9ae76d89c	Make GVN able to remove unnecessary calls to read-only functions again. llvm-svn: 49842	2008-04-17 05:36:50 +00:00
Scott Michel	376acf4aaa	Remove unused variable llvm-svn: 49838	2008-04-17 01:30:44 +00:00
Scott Michel	f66cb3696a	Workaround for PR2207, in which pred_iterator assert gets triggered due to a wee problem in Xcode 2.[45]/gcc 4.0.1. llvm-svn: 49831	2008-04-16 23:46:39 +00:00
Chuck Rose III	c6a47e8a79	VisualStudio project files updated. #include <algorithm> added to make VisualStudio happy. Also had to undefine setjmp because of #include <csetjmp> turning setjmp into _setjmp in VisualStudio. llvm-svn: 49743	2008-04-15 21:27:11 +00:00
Dan Gohman	4fff979a43	Remove unnecessary <sstream> includes. llvm-svn: 49681	2008-04-14 20:40:47 +00:00
Dan Gohman	e36714c0b4	Minor whitespace and comment cleanups. llvm-svn: 49671	2008-04-14 18:26:16 +00:00
Owen Anderson	7629b71dd4	Revert r49614. As Dan pointed out, some of these aren't correct. llvm-svn: 49657	2008-04-14 17:38:21 +00:00
Owen Anderson	1f6fbc4bc3	Replace calls of the form V1->setName(V2->getName()) with V1->takeName(V2), which is significantly more efficient. llvm-svn: 49614	2008-04-13 19:15:17 +00:00
Owen Anderson	1e73f29a7f	Fix PR2213 by simultaneously making GVN more aggressive with the return values of calls and less aggressive with non-readnone calls. llvm-svn: 49516	2008-04-11 05:11:49 +00:00
Dan Gohman	99b7b3f03b	Teach InstCombine's ComputeMaskedBits to handle pointer expressions in addition to integer expressions. Rewrite GetOrEnforceKnownAlignment as a ComputeMaskedBits problem, moving all of its special alignment knowledge to ComputeMaskedBits as low-zero-bits knowledge. Also, teach ComputeMaskedBits a few basic things about Mul and PHI instructions. This improves ComputeMaskedBits-based simplifications in a few cases, but more noticeably it significantly improves instcombine's alignment detection for loads, stores, and memory intrinsics. llvm-svn: 49492	2008-04-10 18:43:06 +00:00
Chris Lattner	a29d2536aa	Disable an xform we've had for a long time, pow(x,0.5) -> sqrt. This is not safe for all inputs. llvm-svn: 49458	2008-04-10 02:07:51 +00:00
Chris Lattner	802134fc02	Generalize getUnaryFloatFunction to handle any FP unary function, automatically figuring out the suffix to use. implement pow(2,x) -> exp2(x). llvm-svn: 49437	2008-04-09 17:48:11 +00:00
Chris Lattner	cca74e5ab9	use the new ConstantFP::get method to make this work with long double and simplify the code. llvm-svn: 49435	2008-04-09 17:17:35 +00:00
Devang Patel	a7dfbc0366	Be conservative if getresult operand is neither call nor invoke. llvm-svn: 49430	2008-04-09 15:58:24 +00:00
Owen Anderson	ef9a6fd5c2	Factor a bunch of functionality related to memcpy and memset transforms out of GVN and into its own pass. llvm-svn: 49419	2008-04-09 08:23:16 +00:00
Owen Anderson	8ee792d1b6	Remove accidentally duplicated code. llvm-svn: 49418	2008-04-09 07:55:01 +00:00
Chris Lattner	b859fb49ed	many cleanups to the pow optimizer. Allow it to handle powf, add support for pow(x, 2.0) -> x*x. llvm-svn: 49411	2008-04-09 00:07:45 +00:00
Devang Patel	8cd2a3ae2a	Fix insert point handling for multiple return values. llvm-svn: 49367	2008-04-08 02:24:08 +00:00
Owen Anderson	ed92b41a39	Add operator= implementations to SparseBitVector, allowing it to be used in GVN. This results in both time and memory savings for GVN. For example, one testcase went from 10.5s to 6s with this patch. llvm-svn: 49345	2008-04-07 17:38:23 +00:00
Duncan Sands	813384951e	Use Intrinsic::getDeclaration in more places. llvm-svn: 49338	2008-04-07 13:45:04 +00:00
Duncan Sands	1416ebf1fe	The "stacksave is not nounwind problem" no longer needs to be fixed here - a previous commit made sure that intrinsics always get the right attributes. So remove no-longer needed code, and while there use Intrinsic::getDeclaration rather than getOrInsertFunction. llvm-svn: 49337	2008-04-07 13:43:58 +00:00
Duncan Sands	fbc6adcc59	Use Intrinsic::getDeclaration to get hold of intrinsics. Fix up the argument type (should be i8, was an array). llvm-svn: 49336	2008-04-07 13:41:19 +00:00
Owen Anderson	0c1e634cbb	Make GVN more memory efficient, particularly on code that contains a large number of allocations, which GVN can't optimize anyways. llvm-svn: 49329	2008-04-07 09:59:07 +00:00
Dale Johannesen	87e484f08b	Mark calls to llvm.stacksave, llvm.stackrestore as nounwind. When such calls are inlined into something else that is invoked, they were getting changed to invokes, which is badness. llvm-svn: 49299	2008-04-07 00:08:48 +00:00
Chris Lattner	a39cfc5c5b	silence a warning when assertions are disabled. llvm-svn: 49283	2008-04-06 21:44:08 +00:00
Gabor Greif	e9ecc68d8f	API changes for class Use size reduction, wave 1. Specifically, introduction of XXX::Create methods for Users that have a potentially variable number of Uses. llvm-svn: 49277	2008-04-06 20:25:17 +00:00
David Greene	586740f401	Iterators folloring a SmallVector erased element are invalidated so don't access cached iterators from after the erased element. Re-apply 49056 with SmallVector support. llvm-svn: 49106	2008-04-02 18:24:46 +00:00
Evan Cheng	ac38d444e2	1. Drop default inline threshold back down to 200. 2. Do not use # of basic blocks as part of the cost computation since it doesn't really figure into function size. 3. More aggressively inline function with vector code. llvm-svn: 49061	2008-04-01 23:59:29 +00:00
Tanya Lattner	052838c55d	Reverting 49056 due to the build being broken. llvm-svn: 49060	2008-04-01 23:41:44 +00:00
David Greene	7f7edc3824	Iterators folloring a SmallVector erased element are invalidated so don't access cached iterators from after the erased element. llvm-svn: 49056	2008-04-01 22:14:23 +00:00
Dale Johannesen	5e4e051c2a	Revert 49006 for the moment. llvm-svn: 49046	2008-04-01 20:00:57 +00:00
Dale Johannesen	7d02cf3c9c	Emit exception handling info for functions which are not marked nounwind, or for all functions when -enable-eh is set, provided the target supports Dwarf EH. llvm-gcc generates nounwind in the right places; other FEs will need to do so also. Given such a FE, -enable-eh should no longer be needed. llvm-svn: 49006	2008-03-31 23:40:23 +00:00
Nate Begeman	f2b0b0eb17	Don't eliminate bitcast instructions that change the type of a pointer llvm-svn: 48971	2008-03-31 00:22:16 +00:00
Chris Lattner	0f760dfe09	Fix "Control reaches the end of non-void function" warnings, patch by David Chisnall. llvm-svn: 48963	2008-03-30 18:22:13 +00:00
Chris Lattner	4311ad2dae	change iterator invalidation avoidance to just move the iterator backward when something changes, instead of moving forward. This allows us to simplify memset lowering, inserting the memset at the end of the range of stuff we're touching instead of at the start. This, in turn, allows us to make use of the addressing instructions already used in the function instead of inserting our own. For example, we now codegen: %tmp41 = getelementptr [8 x i8]* %ref_idx, i32 0, i32 0 ; <i8> [#uses=2] call void @llvm.memset.i64( i8 %tmp41, i8 -1, i64 8, i32 1 ) instead of: %tmp20 = getelementptr [8 x i8]* %ref_idx, i32 0, i32 7 ; <i8> [#uses=1] %ptroffset = getelementptr i8 %tmp20, i64 -7 ; <i8> [#uses=1] call void @llvm.memset.i64( i8 %ptroffset, i8 -1, i64 8, i32 1 ) llvm-svn: 48940	2008-03-29 05:15:47 +00:00
Chris Lattner	ac95515741	make the common case of a single store (which clearly shouldn't be turned into a memset!) faster by avoiding an allocation of an std::list node. llvm-svn: 48939	2008-03-29 04:52:12 +00:00
Chris Lattner	d528b21a65	give form-memset a significantly more sane heuristic, enable it by default. llvm-svn: 48937	2008-03-29 04:36:18 +00:00
Chris Lattner	d62964a7d8	make memset inference significantly more powerful: it can now handle memsets that initialize "structs of arrays" and other store sequences that are not sequential. This is still only enabled if you pass -form-memset-from-stores. The flag is not heavily tested and I haven't analyzed the perf regressions when -form-memset-from-stores is passed either, but this causes no make check regressions. llvm-svn: 48909	2008-03-28 06:45:13 +00:00
Devang Patel	eb1e3fcbe0	PHI->removeIncomingValue may remove PHInode. Increment iterator in advance. llvm-svn: 48890	2008-03-27 17:32:46 +00:00
Evan Cheng	2b72c05992	Handle a special case xor undef, undef -> 0. Technically this should be transformed to undef. But this is such a common idiom (misuse) we are going to handle it. llvm-svn: 48791	2008-03-25 20:07:13 +00:00
Devang Patel	a38f58aa5c	Add incoming value from header only if phi node has any use inside the loop. llvm-svn: 48738	2008-03-24 20:16:14 +00:00
Evan Cheng	3471ae8c5d	Increasing the inline limit from (overly conservative) 200 to 300. Given each BB costs 20 and each instruction costs 5, 200 means a 4 BB function + 24 instructions (actually less because caller's size also contributes to it). Furthermore, double the limit when more than 10% of the callee instructions are vector instructions. Multimedia kernels tend to love inlining. llvm-svn: 48725	2008-03-24 06:37:48 +00:00
Evan Cheng	21a8e3d260	Temporarily disabling memset forming optimization. Add an option. llvm-svn: 48720	2008-03-24 05:28:38 +00:00

... 17 18 19 20 21 ...

5782 Commits