llvm-project

Commit Graph

Author	SHA1	Message	Date
Dan Gohman	83348f80b6	Fix an instcombine abort on a scalar-to-vector bitcast. This fixes PR4487. llvm-svn: 74646	2009-07-01 21:38:46 +00:00
Dan Gohman	56bd02c55c	Generalize the zext(trunc(t) & C) instcombine to work even with C is not a low-bits mask, and add a similar instcombine for zext((trunc(t) & C) ^ C). llvm-svn: 73705	2009-06-18 16:30:21 +00:00
Dan Gohman	7f836c7c61	Instcombine zext(trunc(x) & mask) to x&mask, even if the trunc has multiple users. llvm-svn: 73656	2009-06-17 23:17:05 +00:00
Eli Friedman	a0fba5319d	PR3439: Correct a silly mistake in the SimplifyDemandedUseBits code for SRem. llvm-svn: 73598	2009-06-17 02:57:36 +00:00
Dan Gohman	0ed7756fbe	Generalize a few more instcombines to be vector/scalar-independent. llvm-svn: 73541	2009-06-16 19:55:29 +00:00
Chris Lattner	945d08d76f	Generalize instcombine's isSafeToLoadUnconditionally() function to ignore readonly calls, and factor it out of instcombine so that it can be used by other passes. Patch by Frits van Bommel! llvm-svn: 73506	2009-06-16 17:23:12 +00:00
Dan Gohman	7ccc52f131	Support vector casts in more places, fixing a variety of assertion failures. To support this, add some utility functions to Type to help support vector/scalar-independent code. Change ConstantInt::get and ConstantFP::get to support vector types, and add an overload to ConstantInt::get that uses a static IntegerType type, for convenience. Introduce a new getConstant method for ScalarEvolution, to simplify common use cases. llvm-svn: 73431	2009-06-15 22:12:54 +00:00
Dan Gohman	4fe64deb7b	Fix old-style type names in comments. llvm-svn: 73362	2009-06-14 23:30:43 +00:00
Dan Gohman	9377b086f5	Don't do (x - (y - z)) --> (x + (z - y)) on floating-point types, because it may round differently. This fixes PR4374. llvm-svn: 73243	2009-06-12 19:23:25 +00:00
Chris Lattner	61797e3291	Fix 4366: store to null in non-default addr space should not be turned into unreachable. llvm-svn: 73195	2009-06-11 17:54:56 +00:00
Eli Friedman	73a83066d5	PR4340: Run SimplifyDemandedVectorElts on insertelement instructions; sometimes it can find simplifications that won't be found otherwise. llvm-svn: 73006	2009-06-06 20:08:03 +00:00
Dan Gohman	a5b9645c4b	Split the Add, Sub, and Mul instruction opcodes into separate integer and floating-point opcodes, introducing FAdd, FSub, and FMul. For now, the AsmParser, BitcodeReader, and IRBuilder all preserve backwards compatability, and the Core LLVM APIs preserve backwards compatibility for IR producers. Most front-ends won't need to change immediately. This implements the first step of the plan outlined here: http://nondot.org/sabre/LLVMNotes/IntegerOverflow.txt llvm-svn: 72897	2009-06-04 22:49:04 +00:00
Dan Gohman	7b6b5dd954	Don't do the X * 0.0 -> 0.0 transformation in instcombine, because instcombine doesn't know when it's safe. To partially compensate for this, introduce new code to do this transformation in dagcombine, which can use UnsafeFPMath. llvm-svn: 72872	2009-06-04 17:12:12 +00:00
Evan Cheng	a838a40bc4	Fix bug in FoldFCmp_IntToFP_Cst. If inttofp is a uintofp, use unsigned instead of signed integer constant. llvm-svn: 72300	2009-05-22 23:10:53 +00:00
Dan Gohman	bf0002e7c1	Teach ValueTracking a new way to analyze PHI nodes, and and teach Instcombine to be more aggressive about using SimplifyDemandedBits on shift nodes. This allows a shift to be simplified to zero in the included test case. llvm-svn: 72204	2009-05-21 02:28:33 +00:00
Chris Lattner	149546a6a0	calls in nothrow functions can be marked nothrow even if the callee is not known to be nothrow. This allows readnone/readonly functions to be deleted even if we don't know whether the callee can throw. llvm-svn: 71676	2009-05-13 17:39:14 +00:00
Duncan Sands	af9eaa830a	Rename PaddedSize to AllocSize, in the hope that this will make it more obvious what it represents, and stop it being confused with the StoreSize. llvm-svn: 71349	2009-05-09 07:06:46 +00:00
Nick Lewycky	702fbf94a0	This transform requires valid TargetData info. Wrap it in 'if (TD)' in preparation for the day we use null TargetData when no target is specified. llvm-svn: 71210	2009-05-08 06:47:37 +00:00
Dan Gohman	140a6f24f0	Perform constant folding on operands of instructions with non-void types, such as loads and calls. llvm-svn: 71175	2009-05-07 19:43:39 +00:00
Duncan Sands	1efabaaa2a	Allow readonly functions to unwind exceptions. Teach the optimizers about this. For example, a readonly function with no uses cannot be removed unless it is also marked nounwind. llvm-svn: 71071	2009-05-06 06:49:50 +00:00
Dan Gohman	1b5055ab7f	Return null instead of false, as appropriate. llvm-svn: 70054	2009-04-25 17:28:45 +00:00
Dan Gohman	5638e0d642	Add several more icmp simplifications. Transform signed comparisons into unsigned ones when the operands are known to have the same sign bit value. llvm-svn: 70053	2009-04-25 17:12:48 +00:00
Sanjiv Gupta	46c97e626f	Allow i16 type indices to gep. llvm-svn: 69946	2009-04-24 02:37:54 +00:00
Sanjiv Gupta	428d490332	Before trying to introduce/eliminate cast/ext/trunc to make indices type as pointer type, make sure that the pointer size is a valid sequential index type. llvm-svn: 69574	2009-04-20 06:05:54 +00:00
Chris Lattner	eb510d6b3d	Instcombine should not promote whole computation trees to "strange" integer types, unless they are already strange. This prevents it from turning the code produced by SROA into crazy libcalls and stuff that the code generator can't handle. In the attached example, the result was an i96 multiply that caused the x86 backend to assert. Note that if TargetData had an idea of what the legal types are for a target that this could be used to stop instcombine from introducing i64 muls, as Scott wanted. llvm-svn: 68598	2009-04-08 05:41:03 +00:00
Chris Lattner	321741af5f	fix rdar://6762290, a crash compiling cxx filt with clang. llvm-svn: 68500	2009-04-07 05:03:34 +00:00
Evan Cheng	826b6f0f7c	Throttle back "fold select into operand" transformation. InstCombine should not generate selects of two constants unless they are selects of 0 and 1. e.g. define i32 @t1(i32 %c, i32 %x) nounwind { %t1 = icmp eq i32 %c, 0 %t2 = lshr i32 %x, 18 %t3 = select i1 %t1, i32 %t2, i32 %x ret i32 %t3 } was turned into define i32 @t2(i32 %c, i32 %x) nounwind { %t1 = icmp eq i32 %c, 0 %t2 = select i1 %t1, i32 18, i32 0 %t3 = lshr i32 %x, %t2 ret i32 %t3 } For most targets, that means materializing two constants and then a select. e.g. On x86-64 movl %esi, %eax shrl $18, %eax testl %edi, %edi cmovne %esi, %eax ret => xorl %eax, %eax testl %edi, %edi movl $18, %ecx cmovne %eax, %ecx movl %esi, %eax shrl %cl, %eax ret Also, the optimizer and codegen can reason about shl / and / add, etc. by a constant. This optimization will hinder optimizations using ComputeMaskedBits. llvm-svn: 68142	2009-03-31 20:42:45 +00:00
Chris Lattner	c3b2111d97	Fix PR3874 by restoring a condition I removed, but making it more precise than it used to be. llvm-svn: 67662	2009-03-25 00:28:58 +00:00
Chris Lattner	9e94538005	oops, I intended to remove this, not comment it out. Thanks Duncan! llvm-svn: 67657	2009-03-24 23:48:25 +00:00
Chris Lattner	306813cbbb	canonicalize inttoptr and ptrtoint instructions which cast pointers to/from integer types that are not intptr_t to convert to intptr_t then do an integer conversion to the dest type. This exposes the cast to the optimizer. llvm-svn: 67638	2009-03-24 18:35:40 +00:00
Chris Lattner	d9eb41177a	two changes: 1. Make instcombine always canonicalize trunc x to i1 into an icmp(x&1). This exposes the AND to other instcombine xforms and is more of what the code generator expects. 2. Rewrite the remaining trunc pattern match to use 'match', which simplifies it a lot. llvm-svn: 67635	2009-03-24 18:15:30 +00:00
Duncan Sands	1f15ca7c7a	Factorize out a concept - no functionality change. llvm-svn: 67454	2009-03-21 21:27:31 +00:00
Chris Lattner	0a981d1d36	Fix instcombine to not introduce undefined shifts when merging two shifts together. This fixes PR3851. llvm-svn: 67411	2009-03-20 22:41:15 +00:00
Duncan Sands	a09e0afe74	Don't load values out of global constants with weak linkage: the value may be replaced with something different at link time. (Frontends that want to allow values to be loaded out of weak constants can give their constants weak_odr linkage). llvm-svn: 67407	2009-03-20 21:53:29 +00:00
Chris Lattner	595923ff75	Fix PR3826 - InstComb assert with vector shift, by not calling ComputeNumSignBits on a vector. llvm-svn: 67211	2009-03-18 16:32:19 +00:00
Chris Lattner	e549493a55	Remove a condition which is always true. llvm-svn: 67089	2009-03-17 17:55:15 +00:00
Dale Johannesen	c65830519e	One more place where debug info affects codegen. llvm-svn: 66930	2009-03-13 19:23:20 +00:00
Bill Wendling	fa54bc2052	Oops...I committed too much. llvm-svn: 66867	2009-03-13 04:39:26 +00:00
Bill Wendling	b02eadf660	Temporarily XFAIL this test. llvm-svn: 66866	2009-03-13 04:37:11 +00:00
Dale Johannesen	08ccba73a7	Skip interleaved debug info when fast-forwarding through allocations. Apparently the assumption is there is an instruction (terminator?) following the allocation so I am allowing the same assumption. llvm-svn: 66716	2009-03-11 22:19:43 +00:00
Dale Johannesen	703703aacb	Removing a dead debug intrinsic shouldn't trigger another instcombine pass if we weren't going to make one without debug info. llvm-svn: 66576	2009-03-10 21:19:49 +00:00
Chris Lattner	dc35e5b43a	change the MemIntrinsic get/setAlignment method to take an unsigned instead of a Constant*, which is what the clients of it really want. llvm-svn: 66364	2009-03-08 03:59:00 +00:00
Chris Lattner	334268a211	Introduce a new MemTransferInst pseudo class, which is a common parent between MemCpyInst and MemMoveInst, simplify some code to use it. llvm-svn: 66361	2009-03-08 03:37:16 +00:00
Dale Johannesen	78ab338024	Fix another case where debug info was affecting codegen. I convinced myself it was OK to skip all pointer bitcasts here too. llvm-svn: 66122	2009-03-05 02:06:48 +00:00
Dale Johannesen	ad6b47377f	Fix another case where a dbg.declare meant something had 2 uses instead of 1. llvm-svn: 66112	2009-03-05 00:39:02 +00:00
Dale Johannesen	c8b5a6ef7d	Always skip ptr-to-ptr bitcasts when counting, per Chris' suggestion. Slightly faster. llvm-svn: 65999	2009-03-04 01:53:05 +00:00
Dale Johannesen	0365d3b8b5	Make my earlier patch to skip debug intrinsics when counting work; it was only off by 1. llvm-svn: 65993	2009-03-04 01:20:34 +00:00
Dale Johannesen	09c3e8ec00	Instruction counters must skip the bitcasts that feed into llvm.dbg.declare nodes, as well as the debug directives themselves. llvm-svn: 65976	2009-03-03 22:36:47 +00:00
Dale Johannesen	77456b7ab4	When removing a store to an alloca that has only one use, check also for the case where it has two uses, the other being a llvm.dbg.declare. This is needed so debug info doesn't affect codegen. llvm-svn: 65970	2009-03-03 21:26:39 +00:00
Dan Gohman	92b551bc2b	Fix a bunch of Doxygen syntax issues. Escape special characters, and put @file directives on their own comment line. llvm-svn: 65920	2009-03-03 02:55:14 +00:00
Dale Johannesen	0192552340	Don't count DebugInfo instructions in another limit (lest they affect codegen). llvm-svn: 65915	2009-03-03 01:43:03 +00:00
Dale Johannesen	e1bb2f86f9	When sinking an insn in InstCombine bring its debug info with it. Don't count debug info insns against the scan maximum in FindAvailableLoadedValue (lest they affect codegen). llvm-svn: 65910	2009-03-03 01:09:07 +00:00
Duncan Sands	5795a6091d	Fix PR3694: add an instcombine micro-optimization that helps clean up when using variable length arrays in llvm-gcc. llvm-svn: 65832	2009-03-02 09:18:21 +00:00
Nick Lewycky	34709f84d8	Silence compiler warning about use of uninitialized variables (in reality these are always set by reference on the path that uses them.) No functional change. llvm-svn: 65621	2009-02-27 06:37:39 +00:00
Chris Lattner	af618171f4	Fix PR3667 llvm-svn: 65464	2009-02-25 18:20:01 +00:00
Dan Gohman	4f356bb9b0	Fix a ValueTracking rule: RHS means operand 1, not 0. Add a simple ashr instcombine to help expose this code. And apply the fix to SelectionDAG's copy of this code too. llvm-svn: 65364	2009-02-24 02:00:40 +00:00
Zhou Sheng	3a86bcf134	Should reset DBI_Prev if DBI_Next == 0. llvm-svn: 65314	2009-02-23 10:14:11 +00:00
Chris Lattner	d5420f0957	fix some typos that Duncan noticed llvm-svn: 65306	2009-02-23 05:56:17 +00:00
Dan Gohman	648c5e9c99	Revert the part of 64623 that attempted to align the source in a memcpy to match the alignment of the destination. It isn't necessary for making loads and stores handled like the SSE loadu/storeu intrinsics, and it was causing a performance regression in MultiSource/Applications/JM/lencod. The problem appears to have been a memcpy that copies from some highly aligned array into an alloca; the alloca was then being assigned a large alignment, which required codegen to perform dynamic stack-pointer re-alignment, which forced the enclosing function to have a frame pointer, which led to increased spilling. llvm-svn: 65289	2009-02-22 18:06:32 +00:00
Nick Lewycky	d44e80d7fc	Don't sign extend the char when expanding char -> int during load(bitcast(char[4] to i32*)) evaluation. llvm-svn: 65246	2009-02-21 20:50:42 +00:00
Chris Lattner	bef6b2098e	rename a function to indicate that it checks for profitability as well as legality. Make load sinking and gep sinking more careful: we only do it when it won't pessimize loads from the stack. This has the added benefit of not producing code that is unanalyzable to SROA. llvm-svn: 65209	2009-02-21 00:46:50 +00:00
Chris Lattner	24f31a0e59	commit a tweaked version of Daniel's patch for PR3599. We now eliminate all the extensions and all but the one required truncate from the testcase, but the or/and/shift stuff still isn't zapped. llvm-svn: 64809	2009-02-17 20:47:23 +00:00
Dan Gohman	f68d29edd5	Fix EnforceKnownAlignment so that it doesn't ever reduce the alignment of an alloca or global variable. llvm-svn: 64693	2009-02-16 23:02:21 +00:00
Dan Gohman	9cdfd44521	Change these tests to use regular loads instead of llvm.x86.sse2.loadu.dq. Enhance instcombine to use the preferred field of GetOrEnforceKnownAlignment in more cases, so that regular IR operations are optimized in the same way that the intrinsics currently are. llvm-svn: 64623	2009-02-16 00:44:23 +00:00
Nate Begeman	318aea93bf	the two non-mask arguments to a shufflevector must be the same width, but they do not have to be the same width as the result value. llvm-svn: 64335	2009-02-11 22:36:25 +00:00
Mon P Wang	21eb52a74f	Instrcombine should not change load(cast p) to cast(load p) if the cast changes the address space of the pointer. llvm-svn: 64035	2009-02-07 22:19:29 +00:00
Evan Cheng	8542caa3f7	APInt'fy SimplifyDemandedVectorElts so it can analyze vectors with more than 64 elements. llvm-svn: 63631	2009-02-03 10:05:09 +00:00
Chris Lattner	1aafe4cece	reduce indentation, (~XorCST->getValue()).isSignBit() -> isMaxSignedValue() llvm-svn: 63500	2009-02-02 07:15:30 +00:00
Nick Lewycky	f23908151a	Reinstate this optimization to fold icmp of xor when possible. Don't try to turn icmp eq a+x, b+x into icmp eq a, b if a+x or b+x has other uses. This may have been increasing register pressure leading to the bzip2 slowdown. llvm-svn: 63487	2009-01-31 21:30:05 +00:00
Chris Lattner	9e2b9f3234	Fix PR3452 (an infinite loop bootstrapping) by disabling the recent improvements to the EvaluateInDifferentType code. This code works by just inserted a bunch of new code and then seeing if it is useful. Instcombine is not allowed to do this: it can only insert new code if it is useful, and only when it is converging to a more canonical fixed point. Now that we iterate when DCE makes progress, this causes an infinite loop when the code ends up not being used. llvm-svn: 63483	2009-01-31 19:05:27 +00:00
Chris Lattner	76a63ed099	now that all the pieces are in place, teach instcombine's simplifydemandedbits to simplify instructions with multiple uses in contexts where it can get away with it. This allows it to simplify the code in multi-use-or.ll into a single 'add double'. This change is particularly interesting because it will cover up for some common codegen bugs with large integers created due to the recent SROA patch. When working on fixing those bugs, this should be disabled. llvm-svn: 63481	2009-01-31 08:40:03 +00:00
Chris Lattner	3e2cb66c56	simplify/clarify control flow and improve comments, no functionality change. llvm-svn: 63480	2009-01-31 08:24:16 +00:00
Chris Lattner	83c6a141b8	make some fairly meaty internal changes to how SimplifyDemandedBits works. Now, if it detects that "V" is the same as some other value, SimplifyDemandedBits returns the new value instead of RAUW'ing it immediately. This has two benefits: 1) simpler code in the recursive SimplifyDemandedBits routine. 2) it allows future fun stuff in instcombine where an operation has multiple uses and can be simplified in one context, but not all. #2 isn't implemented yet, this patch should have no functionality change. llvm-svn: 63479	2009-01-31 08:15:18 +00:00
Chris Lattner	585cfb2ce7	minor cleanups llvm-svn: 63477	2009-01-31 07:26:06 +00:00
Chris Lattner	94cfb281c3	make sure to set Changed=true when instcombine hacks on the code, not doing so prevents it from properly iterating and prevents it from deleting the entire body of dce-iterate.ll llvm-svn: 63476	2009-01-31 07:04:22 +00:00
Mon P Wang	3537a62704	Fixed optimization of combining two shuffles where the first shuffle inputs has a different number of elements than the output. llvm-svn: 62998	2009-01-26 04:39:00 +00:00
Torok Edwin	f4395ea97a	testcase for PR3381. Also it was an empty struct, not a void after all. llvm-svn: 62920	2009-01-24 17:16:04 +00:00
Torok Edwin	73ff92272f	void* is represented as pointer to empty struct {}. Thus we need to check whether the struct is empty before trying to index into it. This fixes PR3381. llvm-svn: 62918	2009-01-24 11:30:49 +00:00
Chris Lattner	72cd68fe64	Make InstCombineStoreToCast handle aggregates more aggressively, handling the case in Transforms/InstCombine/cast-store-gep.ll, which is a heavily reduced testcase from Clang on x86-64. llvm-svn: 62904	2009-01-24 01:00:13 +00:00
Chris Lattner	77527f5812	Remove uses of uint32_t in favor of 'unsigned' for better compatibility with cygwin. Patch by Jay Foad! llvm-svn: 62695	2009-01-21 18:09:24 +00:00
Dale Johannesen	b5721632ee	Make special cases (0 inf nan) work for frem. Besides APFloat, this involved removing code from two places that thought they knew the result of frem(0., x) but were wrong. llvm-svn: 62645	2009-01-21 00:35:19 +00:00
Chris Lattner	db2d9613d2	Fix PR3335 by not turning a store to one address space into a store to another. llvm-svn: 62351	2009-01-16 20:12:52 +00:00
Chris Lattner	733256fe31	reduce indentation by using early exits, no functionality change. llvm-svn: 62350	2009-01-16 20:08:59 +00:00
Evan Cheng	beac6f8b0c	Clean up previous cast optimization a bit. Also make zext elimination a bit more aggressive: if it's not necessary to emit an AND (i.e. high bits are already zero), it's profitable to evaluate the operand at a different type. llvm-svn: 62297	2009-01-16 02:11:43 +00:00
Evan Cheng	ff716cb342	Eliminate a redundant check. llvm-svn: 62264	2009-01-15 17:09:07 +00:00
Evan Cheng	60e19a46f2	- Teach CanEvaluateInDifferentType of this xform: sext (zext ty1), ty2 -> zext ty2 - Looking at the number of sign bits of the a sext instruction to determine whether new trunc + sext pair should be added when its source is being evaluated in a different type. llvm-svn: 62263	2009-01-15 17:01:23 +00:00
Dan Gohman	59af77376c	Make instcombine ensure that all allocas are explicitly aligned at at least their preferred alignment. llvm-svn: 62176	2009-01-13 20:18:38 +00:00
Duncan Sands	dc020f9c3c	Rename getABITypeSize to getTypePaddedSize, as suggested by Chris. llvm-svn: 62099	2009-01-12 20:38:59 +00:00
Chris Lattner	bd3c7c8b52	Duncan is nervous about undefinedness of % with negatives. I'm not thrilled about 64-bit % in general, so rewrite to use * instead. llvm-svn: 62047	2009-01-11 20:41:36 +00:00
Chris Lattner	b19151686f	do not generated GEPs into vectors where they don't already exist. We should treat vectors as atomic types, not like arrays. llvm-svn: 62046	2009-01-11 20:23:52 +00:00
Chris Lattner	171d2d474f	Make a couple of cleanups to the instcombine bitcast/gep canonicalization transform based on duncan's comments: 1) improve the comment about %. 2) within our index loop make sure the offset stays within the type size, instead of within the abi size. This allows us to reason explicitly about landing in tail padding and means that issues like non-zero offsets into [0 x foo] types don't occur anymore. llvm-svn: 62045	2009-01-11 20:15:20 +00:00
Chris Lattner	5f54d50917	fix typo Duncan noticed. llvm-svn: 61997	2009-01-09 18:31:39 +00:00
Chris Lattner	f50aa6ae5c	Implement rdar://6480391, extending of equality icmp's to avoid a truncation. I noticed this in the code compiled for a routine using std::map, which produced this code: %25 = tail call i32 @memcmp(i8* %24, i8* %23, i32 6) nounwind readonly %.lobit.i = lshr i32 %25, 31 ; <i32> [#uses=1] %tmp.i = trunc i32 %.lobit.i to i8 ; <i8> [#uses=1] %toBool = icmp eq i8 %tmp.i, 0 ; <i1> [#uses=1] br i1 %toBool, label %bb3, label %bb4 which compiled to: call L_memcmp$stub shrl $31, %eax testb %al, %al jne LBB1_11 ## with this change, we compile it to: call L_memcmp$stub testl %eax, %eax js LBB1_11 This triggers all the time in common code, with patters like this: %169 = and i32 %ply, 1 ; <i32> [#uses=1] %170 = trunc i32 %169 to i8 ; <i8> [#uses=1] %toBool = icmp ne i8 %170, 0 ; <i1> [#uses=1] %7 = lshr i32 %6, 24 ; <i32> [#uses=1] %9 = trunc i32 %7 to i8 ; <i8> [#uses=1] %10 = icmp ne i8 %9, 0 ; <i1> [#uses=1] etc llvm-svn: 61985	2009-01-09 07:47:06 +00:00
Chris Lattner	0f7cf1d7e1	Remove some old code that looks like a remanant from signed-types days. llvm-svn: 61984	2009-01-09 07:10:58 +00:00
Chris Lattner	fef138b140	Fix part 3/2 of PR3290, making instcombine zap (gep(bitcast)) when possible. llvm-svn: 61980	2009-01-09 05:44:56 +00:00
Chris Lattner	a784a2ce01	move some code, check to see if the input to the GEP is a bitcast (which is constant time and cheap) before checking hasAllZeroIndices. llvm-svn: 61976	2009-01-09 04:53:57 +00:00
Chris Lattner	2fdcc59bb6	Change m_ConstantInt and m_SelectCst to take their constant integers as template arguments instead of as instance variables, exposing more optimization opportunities to the compiler earlier. llvm-svn: 61776	2009-01-05 23:53:12 +00:00
Bill Wendling	0c04f9fdc3	Revert this transform. It was causing some dramatic slowdowns in a few tests. See PR3266. llvm-svn: 61623	2009-01-04 06:19:11 +00:00
Bill Wendling	0fcff2c203	Fix comment. llvm-svn: 61538	2009-01-01 01:19:59 +00:00
Bill Wendling	aedb54a947	Add transformation: xor (or (icmp, icmp), true) -> and(icmp, icmp) This is possible because of De Morgan's law. llvm-svn: 61537	2009-01-01 01:18:23 +00:00
Nick Lewycky	4bc10c9e77	Remove redundant test for vector-nature. Scan the vector first to see whether our optz'n will apply to it, then build the replacement vector only if needed. llvm-svn: 61279	2008-12-20 16:48:00 +00:00
Nick Lewycky	c3a70ade66	Oops! Left out a line. Simplifying the sdiv might allow further simplifications for our users. llvm-svn: 61196	2008-12-18 06:42:28 +00:00
Nick Lewycky	0f0e63fe73	Make all the vector elements positive in an srem of constant vector. llvm-svn: 61195	2008-12-18 06:31:11 +00:00
Bill Wendling	87beb9b909	Remove some errors that crept in. No functionality change. llvm-svn: 60403	2008-12-02 06:24:20 +00:00
Bill Wendling	790b4bf9a9	Merge two if-statements into one. llvm-svn: 60402	2008-12-02 06:22:04 +00:00
Bill Wendling	5635295266	More styalistic changes. No functionality change. llvm-svn: 60401	2008-12-02 06:18:11 +00:00
Bill Wendling	85de4b35ca	- Remove the buggy -X/C -> X/-C transform. This isn't valid when X isn't a constant. If X is a constant, then this is folded elsewhere. - Added a note to Target/README.txt to indicate that we'd like to implement this when we're able. llvm-svn: 60399	2008-12-02 05:12:47 +00:00
Bill Wendling	5369db5917	Improve comment. llvm-svn: 60398	2008-12-02 05:09:00 +00:00
Bill Wendling	21716dff5e	- Reduce nesting. - No need to do a swap on a canonicalized pattern. No functionality change. llvm-svn: 60397	2008-12-02 05:06:43 +00:00
Bill Wendling	6f71bce4cf	Don't rebuild RHSNeg. Just use the one that's already there. llvm-svn: 60370	2008-12-01 21:06:30 +00:00
Bill Wendling	84f6f2539f	Document what this check is doing. Also, no need to cast to ConstantInt. llvm-svn: 60369	2008-12-01 21:03:43 +00:00
Bill Wendling	e6c87a4952	Use a simple comparison. Overflow on integer negation can only occur when the integer is "minint". llvm-svn: 60366	2008-12-01 19:46:27 +00:00
Bill Wendling	47f733e4ea	Generalize the FoldOrWithConstant method to fold for any two constants which don't have overlapping bits. llvm-svn: 60344	2008-12-01 08:32:40 +00:00
Bill Wendling	22e761b302	Reduce copy-and-paste code by splitting out the code into its own function. llvm-svn: 60343	2008-12-01 08:23:25 +00:00
Bill Wendling	582fe6b0ca	Use m_Specific() instead of double matching. llvm-svn: 60341	2008-12-01 08:09:47 +00:00
Bill Wendling	4eecfb655b	Move pattern check outside of the if-then statement. This prevents us from fiddling with constants unless we have to. llvm-svn: 60340	2008-12-01 07:47:02 +00:00
Chris Lattner	9e6b243428	simplify these patterns using m_Specific. No need to grep for xor in testcase (or is a substring). llvm-svn: 60328	2008-12-01 05:16:26 +00:00
Chris Lattner	084b3a47d3	Change instcombine to use FoldPHIArgGEPIntoPHI to fold two operand PHIs instead of using FoldPHIArgBinOpIntoPHI. In addition to being more obvious, this also fixes a problem where instcombine wouldn't merge two phis that had different variable indices. This prevented instcombine from factoring big chunks of code in 403.gcc. For example: insn_cuid.exit: - %tmp336 = load i32** @uid_cuid, align 4 - %tmp337 = getelementptr %struct.rtx_def* %insn_addr.0.ph.i, i32 0, i32 3 - %tmp338 = bitcast [1 x %struct.rtunion]* %tmp337 to i32* - %tmp339 = load i32* %tmp338, align 4 - %tmp340 = getelementptr i32* %tmp336, i32 %tmp339 br label %bb62 bb61: - %tmp341 = load i32** @uid_cuid, align 4 - %tmp342 = getelementptr %struct.rtx_def* %insn, i32 0, i32 3 - %tmp343 = bitcast [1 x %struct.rtunion]* %tmp342 to i32* - %tmp344 = load i32* %tmp343, align 4 - %tmp345 = getelementptr i32* %tmp341, i32 %tmp344 br label %bb62 bb62: - %iftmp.62.0.in = phi i32* [ %tmp345, %bb61 ], [ %tmp340, %insn_cuid.exit ] + %insn.pn2 = phi %struct.rtx_def* [ %insn, %bb61 ], [ %insn_addr.0.ph.i, %insn_cuid.exit ] + %tmp344.pn.in.in = getelementptr %struct.rtx_def* %insn.pn2, i32 0, i32 3 + %tmp344.pn.in = bitcast [1 x %struct.rtunion]* %tmp344.pn.in.in to i32* + %tmp341.pn = load i32** @uid_cuid + %tmp344.pn = load i32* %tmp344.pn.in + %iftmp.62.0.in = getelementptr i32* %tmp341.pn, i32 %tmp344.pn %iftmp.62.0 = load i32* %iftmp.62.0.in llvm-svn: 60325	2008-12-01 03:42:51 +00:00
Chris Lattner	9d02a70a7d	Teach inst combine to merge GEPs through PHIs. This is really important because it is sinking the loads using the GEPs, but not the GEPs themselves. This triggers 647 times on 403.gcc and makes the .s file much much nicer. For example before: je LBB1_87 ## bb78 LBB1_62: ## bb77 leal 84(%esi), %eax LBB1_63: ## bb79 movl (%eax), %eax ... LBB1_87: ## bb78 movl $0, 4(%esp) movl %esi, (%esp) call L_make_decl_rtl$stub jmp LBB1_62 ## bb77 after: jne LBB1_63 ## bb79 LBB1_62: ## bb78 movl $0, 4(%esp) movl %esi, (%esp) call L_make_decl_rtl$stub LBB1_63: ## bb79 movl 84(%esi), %eax The input code was (and the GEPs are merged and the PHI is now eliminated by instcombine): br i1 %tmp233, label %bb78, label %bb77 bb77: %tmp234 = getelementptr %struct.tree_node* %t_addr.3, i32 0, i32 0, i32 22 br label %bb79 bb78: call void @make_decl_rtl(%struct.tree_node* %t_addr.3, i8* null) nounwind %tmp235 = getelementptr %struct.tree_node* %t_addr.3, i32 0, i32 0, i32 22 br label %bb79 bb79: %iftmp.12.0.in = phi %struct.rtx_def [ %tmp235, %bb78 ], [ %tmp234, %bb77 ] %iftmp.12.0 = load %struct.rtx_def %iftmp.12.0.in llvm-svn: 60322	2008-12-01 02:34:36 +00:00
Bill Wendling	5b902c5b1e	Implement ((A\|B)&1)\|(B&-2) -> (A&1) \| B transformation. This also takes care of permutations of this pattern. llvm-svn: 60312	2008-12-01 01:07:11 +00:00
Eli Friedman	11c15a5de7	Minor cleanup: use getTrue and getFalse where appropriate. No functional change. llvm-svn: 60307	2008-11-30 22:48:49 +00:00
Eli Friedman	55e4becba9	Some minor cleanups to instcombine; no functionality change. Note that the FoldOpIntoPhi call is dead because it's impossible for the first operand of a subtraction to be both a ConstantInt and a PHINode. llvm-svn: 60306	2008-11-30 21:09:11 +00:00
Bill Wendling	de89bc275c	Add instruction combining for ((A&~B)\|(~A&B)) -> A^B and all permutations. llvm-svn: 60291	2008-11-30 13:52:49 +00:00
Bill Wendling	9eef421e12	Implement (A&((~A)\|B)) -> A&B transformation in the instruction combiner. This takes care of all permutations of this pattern. llvm-svn: 60290	2008-11-30 13:08:13 +00:00
Bill Wendling	2fe3229824	Forgot one remaining call to getSExtValue(). llvm-svn: 60289	2008-11-30 12:41:09 +00:00
Bill Wendling	2d2e7861b5	getSExtValue() doesn't work for ConstantInts with bitwidth > 64 bits. Use all APInt calls instead. This fixes PR3144. llvm-svn: 60288	2008-11-30 12:38:24 +00:00
Bill Wendling	7abf352f44	Don't make TwoToExp signed by default. llvm-svn: 60279	2008-11-30 05:29:33 +00:00
Bill Wendling	af200e9237	From Hacker's Delight: "For signed integers, the determination of overflow of xy is not so simple. If x and y have the same sign, then overflow occurs iff xy > 231 - 1. If they have opposite signs, then overflow occurs iff xy < -2*31." In this case, x == -1. llvm-svn: 60278	2008-11-30 05:01:05 +00:00
Bill Wendling	70635adea3	Instcombine was illegally transforming -X/C into X/-C when either X or C overflowed on negation. This commit checks to make sure that neithe C nor X overflows. This requires that the RHS of X (a subtract instruction) be a constant integer. llvm-svn: 60275	2008-11-30 03:42:12 +00:00
Nick Lewycky	4ab50b93c8	Chris prefers icmp/select over udiv! llvm-svn: 60187	2008-11-27 22:41:10 +00:00
Nick Lewycky	69941fd0a0	Add a couple of missed optimizations on integer vectors. Multiply and divide by 1, as well as multiply by -1. llvm-svn: 60182	2008-11-27 20:21:08 +00:00
Chris Lattner	e0d019def6	switch InstCombine::visitLoadInst to use FindAvailableLoadedValue llvm-svn: 60169	2008-11-27 08:56:30 +00:00
Chris Lattner	dd7083452f	reapply Sanjiv's patch to genericize memcpy/memset/memmove to take an arbitrary integer width for the count. llvm-svn: 59823	2008-11-21 16:42:48 +00:00
Bill Wendling	4bce2bff88	Revert r59802. It was breaking the build of llvm-gcc: g++ -m32 -c -g -DIN_GCC -W -Wall -Wwrite-strings -Wmissing-format-attribute -fno-common -mdynamic-no-pic -DHAVE_CONFIG_H -Wno-unused -DTARGET_NAME=\"i386-apple-darwin9.5.0\" -I. -I. -I../../llvm-gcc.src/gcc -I../../llvm-gcc.src/gcc/. -I../../llvm-gcc.src/gcc/../include -I./../intl -I../../llvm-gcc.src/gcc/../libcpp/include -I../../llvm-gcc.src/gcc/../libdecnumber -I../libdecnumber -I/Volumes/Sandbox/Buildbot/llvm/full-llvm/build/llvm.obj/include -I/Volumes/Sandbox/Buildbot/llvm/full-llvm/build/llvm.src/include -DENABLE_LLVM -I/Volumes/Sandbox/Buildbot/llvm/full-llvm/build/llvm.obj/../llvm.src/include -D_DEBUG -D_GNU_SOURCE -D__STDC_LIMIT_MACROS -D__STDC_CONSTANT_MACROS -I. -I. -I../../llvm-gcc.src/gcc -I../../llvm-gcc.src/gcc/. -I../../llvm-gcc.src/gcc/../include -I./../intl -I../../llvm-gcc.src/gcc/../libcpp/include -I../../llvm-gcc.src/gcc/../libdecnumber -I../libdecnumber -I/Volumes/Sandbox/Buildbot/llvm/full-llvm/build/llvm.obj/include -I/Volumes/Sandbox/Buildbot/llvm/full-llvm/build/llvm.src/include ../../llvm-gcc.src/gcc/llvm-types.cpp -o llvm-types.o ../../llvm-gcc.src/gcc/llvm-convert.cpp: In member function 'void TreeToLLVM::EmitMemCpy(llvm::Value, llvm::Value, llvm::Value, unsigned int)': ../../llvm-gcc.src/gcc/llvm-convert.cpp:1496: error: 'memcpy_i32' is not a member of 'llvm::Intrinsic' ../../llvm-gcc.src/gcc/llvm-convert.cpp:1496: error: 'memcpy_i64' is not a member of 'llvm::Intrinsic' ../../llvm-gcc.src/gcc/llvm-convert.cpp: In member function 'void TreeToLLVM::EmitMemMove(llvm::Value, llvm::Value, llvm::Value, unsigned int)': ../../llvm-gcc.src/gcc/llvm-convert.cpp:1512: error: 'memmove_i32' is not a member of 'llvm::Intrinsic' ../../llvm-gcc.src/gcc/llvm-convert.cpp:1512: error: 'memmove_i64' is not a member of 'llvm::Intrinsic' ../../llvm-gcc.src/gcc/llvm-convert.cpp: In member function 'void TreeToLLVM::EmitMemSet(llvm::Value, llvm::Value, llvm::Value, unsigned int)': ../../llvm-gcc.src/gcc/llvm-convert.cpp:1528: error: 'memset_i32' is not a member of 'llvm::Intrinsic' ../../llvm-gcc.src/gcc/llvm-convert.cpp:1528: error: 'memset_i64' is not a member of 'llvm::Intrinsic' make[3]: [llvm-convert.o] Error 1 make[3]: * Waiting for unfinished jobs.... rm fsf-funding.pod gcov.pod gfdl.pod cpp.pod gpl.pod gcc.pod make[2]: * [all-stage1-gcc] Error 2 make[1]: * [stage1-bubble] Error 2 make: *** [all] Error 2 llvm-svn: 59809	2008-11-21 09:09:41 +00:00
Sanjiv Gupta	09a203765a	Make mem[cpy,move,set] intrinsics overloaded. llvm-svn: 59802	2008-11-21 07:49:09 +00:00
Nick Lewycky	07d726ec4d	Optimize (x/y)*y into x-(x%y) in general. Div and rem are about the same, and a subtract is cheaper than a multiply. This generalizes an existing transform. llvm-svn: 59800	2008-11-21 07:33:58 +00:00
Devang Patel	7ed6c5317c	If there are two consecutive llvm.dbg.stoppoint calls then it is likely that the optimizer deleted code in between these two intrinsics. Keep only the last llvm.dbg.stoppoint in this case. llvm-svn: 59657	2008-11-19 18:56:50 +00:00
Chris Lattner	44152742a0	simplify a bunch more instcombines to use m_Specific etc. llvm-svn: 59403	2008-11-16 05:38:51 +00:00
Chris Lattner	d397fef50d	factor the code for simplifying (icmp)\|(icmp) into its own function. llvm-svn: 59402	2008-11-16 05:20:07 +00:00
Chris Lattner	909b969b18	do some computation with apints instead of ConstantInts. llvm-svn: 59401	2008-11-16 05:14:43 +00:00
Chris Lattner	feaea9bdf7	merge a check into a place where it is simpler. llvm-svn: 59400	2008-11-16 05:10:52 +00:00
Chris Lattner	269cbd5770	factor a whole bunch of code out into a helper function. llvm-svn: 59398	2008-11-16 05:06:21 +00:00
Chris Lattner	b37b6e7e96	simplify the conditions on two gigantic if's, decreasing indentation a bit. Next step is to factor out into their own helper functions. llvm-svn: 59397	2008-11-16 04:55:20 +00:00
Chris Lattner	f1be285134	simplify some instcombine matches by using m_Specific llvm-svn: 59395	2008-11-16 04:46:19 +00:00
Chris Lattner	fae5e33111	Use new m_SelectCst template to eliminate macros. llvm-svn: 59392	2008-11-16 04:33:38 +00:00
Chris Lattner	569d78cbb5	simplify code. llvm-svn: 59390	2008-11-16 04:26:55 +00:00
Chris Lattner	c3f3b059d0	Handle the case where there is no "not". It is possible it got folded into the select. llvm-svn: 59389	2008-11-16 04:25:26 +00:00
Chris Lattner	5f6d9a313b	factor a bunch of copy/paste code out into a helper function. Eliminate the cases checking for cond?0:-1, since that is already handled by commutative checking. llvm-svn: 59388	2008-11-16 04:24:12 +00:00
Chris Lattner	68d2da2a19	rearrange some code, no functionality change. llvm-svn: 59381	2008-11-16 03:56:24 +00:00
Chris Lattner	e02c7c7ad2	if we're going to use a macro, use it maximally. no functionality change. llvm-svn: 59380	2008-11-16 03:54:57 +00:00
Bill Wendling	7ef7314d1a	Third time's a charm. The previous patches didn't match correctly. Also, we need to make sure that the conditional is the same before doing the transformation. llvm-svn: 58978	2008-11-10 06:59:06 +00:00
Mon P Wang	25f0106fd9	Added support for the following definition of shufflevector <result> = shufflevector <n x <ty>> <v1>, <n x <ty>> <v2>, <m x i32> <mask> llvm-svn: 58964	2008-11-10 04:46:22 +00:00
Bill Wendling	4fb13c051d	Correction for the last patch. Should match the conditional in the first part of the select match, not the select instruction itself. llvm-svn: 58947	2008-11-09 23:37:53 +00:00
Bill Wendling	1579287550	The method of doing the matching with a 'select' instruction was wrong. The original code was matching like this: if (match(A, m_Not(m_Value(B)))) B was already matched as a 'select' instruction. However, this isn't matching what we think it's matching. It would match B as a 'Value', so basically anything would match to it. In this case, a Constant matched. B was replaced with a constant representation. And then the wrong value would be used in the SelectInst::Create statement, causing a crash. After thinking on this for a moment, and after Nick L. told me how the pattern matching stuff was supposed to work, the solution was to match NOT an m_Value, but an m_Select. llvm-svn: 58946	2008-11-09 23:17:42 +00:00
Bill Wendling	3f547be28f	If the LHS of the FCMP is coming from a UIToFP instruction, then we don't want to generate signed ICMP instructions to replace the FCMP. This would violate the following: define i1 @test1(i32 %val) { %1 = uitofp i32 %val to double %2 = fcmp ole double %1, 0.000000e+00 ret i1 %2 } would be transformed into: define i1 @test1(i32 %val) { %1 = icmp slt i33 %val, 1 ret i1 %1 } which is obviously wrong. This patch modifes InstCombiner::FoldFCmp_IntToFP_Cst to handle when the LHS comes from UIToFP. llvm-svn: 58929	2008-11-09 04:26:50 +00:00
Mon P Wang	5ca2ec65bd	Fixed scalarizing an extract subvector and prevent an infinite loop when simplify a vector. llvm-svn: 58820	2008-11-06 22:52:21 +00:00
Nick Lewycky	8d8acf327b	Fix demanded bits analysis with srem by negative number. Based on a patch by Richard Osborne. llvm-svn: 58555	2008-11-02 02:41:50 +00:00
Dan Gohman	83eea0b17f	Fix this recently moved code to use the correct type. CI is now a ConstantInt, and SI is the original cast instruction. This fixes PR2996. llvm-svn: 58549	2008-11-02 00:17:33 +00:00
Dan Gohman	13cbcf1c18	Canonicalize sext(i1) to i1?-1:0, and update various instcombine optimizations accordingly. llvm-svn: 58457	2008-10-30 20:40:10 +00:00
Dan Gohman	2c34c130bf	(A & sext(C)) \| (B & ~sext(C) -> C ? A : B llvm-svn: 58351	2008-10-28 22:38:57 +00:00
Dan Gohman	bc0278400c	Teach instcombine's visitLoad to scan back several instructions to find opportunities for store-to-load forwarding or load CSE, in the same way that visitStore scans back to do DSE. Also, define a new helper function for testing whether the addresses of two memory accesses are known to have the same value, and use it in both visitStore and visitLoad. These two changes allow instcombine to eliminate loads in code produced by front-ends that frequently emit obviously redundant addressing for memory references. llvm-svn: 57608	2008-10-15 23:19:35 +00:00
Evan Cheng	d885f6e139	Combine (fcmp cc0 x, y) \| (fcmp cc1 x, y) into a single fcmp when possible. llvm-svn: 57515	2008-10-14 18:44:08 +00:00
Evan Cheng	ce70752b11	- Somehow I forgot about one / une. - Renumber fcmp predicates to match their icmp counterparts. - Try swapping operands to expose more optimization opportunities. llvm-svn: 57513	2008-10-14 18:13:38 +00:00
Evan Cheng	67786cce66	Optimize anding of two fcmp into a single fcmp if the operands are the same. e.g. uno && ueq -> ueq ord && olt -> olt ord && ueq -> oeq llvm-svn: 57507	2008-10-14 17:15:11 +00:00
Matthijs Kooijman	f7d3cb5435	Make InstructionCombining::getBitCastOperand() recognize GEP instructions and constant expression with all zero indices as being the same as a bitcast. llvm-svn: 57442	2008-10-13 15:17:01 +00:00
Chris Lattner	da435910e8	Fix PR2697 by rewriting the '(X / pos) op neg' logic. This also changes a couple other cases for clarity, but shouldn't affect correctness. Patch by Eli Friedman! llvm-svn: 57387	2008-10-11 22:55:00 +00:00
Dale Johannesen	4f0bd68cfe	Add a "loses information" return value to APFloat::convert and APFloat::convertToInteger. Restore return value to IEEE754. Adjust all users accordingly. llvm-svn: 57329	2008-10-09 23:00:39 +00:00
Chris Lattner	42d5785dbd	Add parentheses to avoid warnings in GCC 4.4.0, patch by Samuel Tardieu! llvm-svn: 57288	2008-10-08 06:42:28 +00:00
Chris Lattner	917a6c1343	rewrite bswap matching to be more general, allowing arbitrary shifting and masking inside a bswap expr. This allows it to handle the cases from PR2842, which involve the intermediate 'or' expressions being shifted, not just the input value. llvm-svn: 57095	2008-10-05 02:13:19 +00:00
Chris Lattner	ca91f265c4	fix a bug where the bswap matcher could match a case involving ashr. It should only apply to lshr. llvm-svn: 57089	2008-10-05 00:50:57 +00:00
Duncan Sands	d65a4daeea	Factorize code: remove variants of "strip off pointer bitcasts and GEP's", and centralize the logic in Value::getUnderlyingObject. The difference with stripPointerCasts is that stripPointerCasts only strips GEPs if all indices are zero, while getUnderlyingObject strips GEPs no matter what the indices are. llvm-svn: 56922	2008-10-01 15:25:41 +00:00
Nick Lewycky	e8ced3ec19	Fix misoptimization of: xor i1 (icmp eq (X, C1), icmp s[lg]t (X, C2)) llvm-svn: 56834	2008-09-30 06:08:34 +00:00
Devang Patel	a05633e105	Now Attributes are divided in three groups - return attributes - inreg, zext and sext - parameter attributes - function attributes - nounwind, readonly, readnone, noreturn Return attributes use 0 as the index. Function attributes use ~0U as the index. This patch requires corresponding changes in llvm-gcc and clang. llvm-svn: 56704	2008-09-26 22:53:05 +00:00
Devang Patel	4c758ea3e0	Large mechanical patch. s/ParamAttr/Attribute/g s/PAList/AttrList/g s/FnAttributeWithIndex/AttributeWithIndex/g s/FnAttr/Attribute/g This sets the stage - to implement function notes as function attributes and - to distinguish between function attributes and return value attributes. This requires corresponding changes in llvm-gcc and clang. llvm-svn: 56622	2008-09-25 21:00:45 +00:00
Devang Patel	6402c7236f	s/ParamAttrsWithIndex/FnAttributeWithIndex/g llvm-svn: 56535	2008-09-24 00:55:02 +00:00
Devang Patel	ba3fa6c6e1	s/ParameterAttributes/Attributes/g llvm-svn: 56513	2008-09-23 23:03:40 +00:00
Dan Gohman	dafa9c6e85	Improve instcombine's handling of integer min and max in two ways: - Recognize expressions like "x > -1 ? x : 0" as min/max and turn them into expressions like "x < 0 ? 0 : x", which is easily recognizable as a min/max operation. - Refrain from folding expression like "y/2 < 1" to "y < 2" when the comparison is being used as part of a min or max idiom, like "y/2 < 1 ? 1 : y/2". In that case, the division has another use, so folding doesn't eliminate it, and obfuscates the min/max, making it harder to recognize as a min/max operation. These benefit ScalarEvolution, CodeGen, and anything else that wants to recognize integer min and max. llvm-svn: 56246	2008-09-16 18:46:06 +00:00
Dan Gohman	eff71f2953	On 64-bit targets, change 32-bit getelementptr indices to be 64-bit getelementptr indices, inserting an explicit cast if necessary. This helps expose the sign-extension operation to other optimizations. llvm-svn: 56133	2008-09-11 23:06:38 +00:00
Dan Gohman	7d01c0654c	Fix a vectorshuffle instcombine bug introduced by r55995. Patch by Nicolas Capens! llvm-svn: 56129	2008-09-11 22:47:57 +00:00
Dan Gohman	9b9d547a5c	Fix a copy+paste bug that Duncan spotted. For several cases it was still getting lucky and detecting overflow but it was clearly incorrect. llvm-svn: 56113	2008-09-11 18:53:02 +00:00
Dan Gohman	9d9a4be588	In my analysis for r56076 I missed the case where the original multiplication overflows. llvm-svn: 56082	2008-09-11 00:25:00 +00:00
Dan Gohman	c1ae01688f	Fix an icmp+sdiv optimization to check for and handle an overflow condition. This fixes PR2740. llvm-svn: 56076	2008-09-10 23:30:57 +00:00
Dan Gohman	97f0a0f28d	Fix a warning about comparing signed and unsigned values. llvm-svn: 56040	2008-09-10 01:09:32 +00:00
Dan Gohman	86fb5b48de	Make SimplifyDemandedVectorElts simplify vectors with multiple users, and teach it about shufflevector instructions. Also, fix a subtle bug in SimplifyDemandedVectorElts' insertelement code. This is a patch that was originally written by Eli Friedman, with some fixes and cleanup by me. llvm-svn: 55995	2008-09-09 18:11:14 +00:00
Dan Gohman	a79db30d28	Tidy up several unbeseeming casts from pointer to intptr_t. llvm-svn: 55779	2008-09-04 17:05:41 +00:00
Nick Lewycky	2fcb26cc75	Don't apply this transform to vectors. Fixes PR2756. llvm-svn: 55690	2008-09-03 06:24:21 +00:00
Nick Lewycky	99f4558117	Revert r54876 r54877 r54906 and r54907. Evan found that these caused a 20% slowdown in bzip2. llvm-svn: 55113	2008-08-21 05:56:10 +00:00
Evan Cheng	f5a7e51c81	Silence a compiler warning. llvm-svn: 55087	2008-08-20 23:36:48 +00:00
Mon P Wang	1b2c061b73	Fixed shuffle optimizations to handle non power of 2 vectors llvm-svn: 55035	2008-08-20 02:23:25 +00:00
Nick Lewycky	75d4a83f2f	Make this comment clearer. Instead of using an ambiguous ~ (not) on an icmp predicate, swap the order of the operands. llvm-svn: 54907	2008-08-17 20:02:02 +00:00
Nick Lewycky	53b44029d6	Consider the case where xor by -1 and xor by 128 have been combined already to produce an xor by 127. llvm-svn: 54906	2008-08-17 19:58:24 +00:00
Nick Lewycky	18c6f56c76	I found a better place for this optz'n. llvm-svn: 54877	2008-08-17 07:54:14 +00:00
Nick Lewycky	18f50b2637	Xor'ing both sides of icmp by sign-bit is equivalent to swapping signedness of the predicate. Also, make this optz'n apply in more cases where it's safe to do so. llvm-svn: 54876	2008-08-17 07:34:14 +00:00
Chris Lattner	1d23915a8f	use smallvector instead of vector for a couple worklists. This speeds up instcombine by ~10% on some testcases. llvm-svn: 54811	2008-08-15 04:03:01 +00:00
Dan Gohman	6134fbccef	Fix a bogus srem rule - a negative value srem'd by a power-of-2 can have a non-negative result; for example, -16%16 is 0. Also, clarify the related comments. This fixes PR2670. llvm-svn: 54767	2008-08-13 23:12:35 +00:00
Chris Lattner	2aa0ff27aa	Implement support for simplifying vector comparisons by 0.0 and 1.0 like we do for scalars. Patch contributed by Nicolas Capens This also generalizes the previous xforms to work on long double, now that isExactlyValue works for long double. llvm-svn: 54653	2008-08-11 22:06:05 +00:00
Dan Gohman	ac22cfcae9	Fix a shufflevector instcombine that was emitting invalid masks indices when it meant to be emitting undef indices. llvm-svn: 54417	2008-08-06 18:17:32 +00:00
Chris Lattner	f5b353c1fd	optimize a common idiom generated by clang for bitfield access, PR2638. llvm-svn: 54408	2008-08-06 07:35:52 +00:00
Chris Lattner	7bdaecb7f4	Zap sitofp/fptoui pairs. In all cases when the sign difference matters, the result is undefined anyway. llvm-svn: 54396	2008-08-06 05:13:06 +00:00
Nick Lewycky	bf42893567	Reinstate this optimization, but without the miscompile. Thanks to Bill for tracking down that this was breaking llvm-gcc bootstrap on Linux. llvm-svn: 54394	2008-08-06 04:54:03 +00:00
Bill Wendling	ee12a7aeff	Revert r53282. This was causing a miscompile on Linux. Also, the transformation looks bogus. Please see PR2629 for details on why this is breaking things. llvm-svn: 54372	2008-08-05 21:23:45 +00:00
Nate Begeman	fecbc8cff1	Add vector shifts to the IR, patch by Eli Friedman. CodeGen & Clang work coming next. llvm-svn: 54161	2008-07-29 15:49:41 +00:00
Dan Gohman	fa1211f69b	Enable first-class aggregates support. Remove the GetResultInst instruction. It is still accepted in LLVM assembly and bitcode, where it is now auto-upgraded to ExtractValueInst. Also, remove support for return instructions with multiple values. These are auto-upgraded to use InsertValueInst instructions. The IRBuilder still accepts multiple-value returns, and auto-upgrades them to InsertValueInst instructions. llvm-svn: 53941	2008-07-23 00:34:11 +00:00
Chris Lattner	c600c53d1f	Fix PR2553 llvm-svn: 53715	2008-07-17 06:07:20 +00:00
Matthijs Kooijman	c1d7477ed2	Redo InstCombiner::visitExtractValueInst. Instead of using the (complicate) FindInsertedValue, it now performs a number of simple transformations that should result in the same effect when applied iteratively. llvm-svn: 53673	2008-07-16 12:55:45 +00:00
Evan Cheng	c97094552c	Fix PR2296. Do not transform x86_sse2_storel_dq into a full-width store. llvm-svn: 53666	2008-07-16 07:28:14 +00:00
Chris Lattner	16395e51f4	Fix PR2506 by being a bit more careful about reverse fact propagation when disproving a condition. This actually compiles the existing testcase (udiv_select_to_select_shift) to: define i64 @test(i64 %X, i1 %Cond) { entry: %divisor1.t = lshr i64 %X, 3 ; <i64> [#uses=1] %quotient2 = lshr i64 %X, 3 ; <i64> [#uses=1] %sum = add i64 %divisor1.t, %quotient2 ; <i64> [#uses=1] ret i64 %sum } instead of: define i64 @test(i64 %X, i1 %Cond) { entry: %quotient1.v = select i1 %Cond, i64 3, i64 4 ; <i64> [#uses=1] %quotient1 = lshr i64 %X, %quotient1.v ; <i64> [#uses=1] %quotient2 = lshr i64 %X, 3 ; <i64> [#uses=1] %sum = add i64 %quotient1, %quotient2 ; <i64> [#uses=1] ret i64 %sum } llvm-svn: 53534	2008-07-14 00:15:52 +00:00
Nick Lewycky	f76aa23b54	Enhance analysis of srem. Remove dead code analyzing urem. 'urem' of power-of-2 is canonicalized to an 'and' instruction. llvm-svn: 53506	2008-07-12 05:04:38 +00:00
Nick Lewycky	45e127ab20	Document 'mask' in this calculation. llvm-svn: 53454	2008-07-11 08:16:26 +00:00
Nick Lewycky	da405e1155	Remove misleading constant from comment. llvm-svn: 53452	2008-07-11 07:36:19 +00:00
Nick Lewycky	f95b64acaa	Add another optimization from PR2330. Also catch some missing cases that are similar. llvm-svn: 53451	2008-07-11 07:20:53 +00:00
Chris Lattner	3994bed1a9	a missed optimization that Eli spotted llvm-svn: 53449	2008-07-11 06:40:29 +00:00
Chris Lattner	13a6911ea2	another bug in the same line. llvm-svn: 53448	2008-07-11 06:38:16 +00:00
Chris Lattner	de89b507dd	fix a bug spotted by Eli's eagle eyes llvm-svn: 53447	2008-07-11 06:36:01 +00:00
Chris Lattner	bd25b8507c	simplify and merge a bunch of code. Instead of comparing against the min/max values for an integer type, compare against the min/max values we can prove contain the input. This might be a tighter bound, so this is general goodness. llvm-svn: 53446	2008-07-11 05:40:05 +00:00
Chris Lattner	38a50c9528	fold away (x <= cst) earlier, allowing us to not have to handle them in some code. llvm-svn: 53445	2008-07-11 05:08:55 +00:00
Chris Lattner	6af608b8ce	Fix folding of icmp's of i1 where the comparison is signed. The code was using the algorithm for folding unsigned comparisons which is completely wrong. This has been broken since the signless types change. llvm-svn: 53444	2008-07-11 04:20:58 +00:00
Chris Lattner	4fa8bb3430	Fix a bogus optimization: folding (slt (zext i1 A to i32), 1) -> (slt i1 A, true) This cause a regression in InstCombine/JavaCompare, which was doing the right thing on accident. To handle the missed case, generalize the comparisons based on masked bits a little bit to handle comparisons against the max value. For example, we can now xform (slt i32 (and X, 4), 4) -> (setne i32 (and X, 4), 4) llvm-svn: 53443	2008-07-11 04:09:09 +00:00
Nick Lewycky	6193a564ab	Fix overzealous optimization. Thanks to Duncan Sands for pointing out my error! llvm-svn: 53393	2008-07-10 05:51:40 +00:00
Nick Lewycky	bb89c2a3f6	Simplify, suggested by Chris Lattner. llvm-svn: 53283	2008-07-09 07:35:26 +00:00
Nick Lewycky	f9c27c343a	Fold (a < 8) && (b < 8) into (a\|b) < 8 for unsigned less or greater than. llvm-svn: 53282	2008-07-09 07:29:11 +00:00
Nick Lewycky	364661c43e	Fold ((1 << a) & 1) to (a == 0). llvm-svn: 53276	2008-07-09 05:20:13 +00:00
Nick Lewycky	0d3645e673	Reduce x - y to -y when we know the 'x' part will get masked off anyways. llvm-svn: 53271	2008-07-09 04:32:37 +00:00
Chris Lattner	501d78fdc0	Fix PR2496, a really nasty bug which involved sinking volatile loads into phis. This is actually the same bug as PR2262 / 2008-04-29-VolatileLoadDontMerge.ll, but I missed checking the first predecessor for multiple successors. Testcase here: InstCombine/2008-07-08-VolatileLoadMerge.ll llvm-svn: 53240	2008-07-08 17:18:32 +00:00
Dan Gohman	38740a98b2	Make DenseMap's insert return a pair, to more closely resemble std::map. llvm-svn: 53177	2008-07-07 17:46:23 +00:00
Nick Lewycky	9f1a4dc672	Fix missed optimization opportunity when analyzing cast of mul and select. llvm-svn: 53151	2008-07-05 21:19:34 +00:00
Evan Cheng	da3db11db3	- Re-apply 52748 and friends with fix. GetConstantStringInfo() returns an empty string for ConstantAggregateZero case which surprises selectiondag. - Correctly handle memcpy from constant string which is zero-initialized. llvm-svn: 52891	2008-06-30 07:31:25 +00:00
Anton Korobeynikov	a7c583d584	Revert (52748 and friends): Move GetConstantStringInfo to lib/Analysis. Remove string output routine from Constant. Update all callers. Change debug intrinsic api slightly to accomodate move of routine, these now return values instead of strings. This unbreaks llvm-gcc bootstrap. llvm-svn: 52884	2008-06-29 17:57:03 +00:00
Eric Christopher	d0ab9c47e6	Move GetConstantStringInfo to lib/Analysis. Remove string output routine from Constant. Update all callers. Change debug intrinsic api slightly to accomodate move of routine, these now return values instead of strings. llvm-svn: 52748	2008-06-26 00:31:12 +00:00
Chris Lattner	c9c81fb0df	Fix PR2488, a case where we deleted stack restores too aggressively. llvm-svn: 52702	2008-06-25 05:59:28 +00:00
Eli Friedman	d3449df326	Fix for PR2479: correctly optimize expressions like (a > 13) & (a == 15). See also PR1800, which is about the signed case. llvm-svn: 52608	2008-06-21 23:36:13 +00:00
Chris Lattner	f3ecd2d290	Fix PR2471, which is a bug involving an invalid promotion from a conditional load. llvm-svn: 52525	2008-06-20 05:12:56 +00:00
Bill Wendling	cd6fb1d0a8	Remove dead code causing a warning. llvm-svn: 52502	2008-06-19 18:00:44 +00:00
Dan Gohman	ed2250990a	Use Instruction::moveBefore instead of manipulating the instruction list directly. llvm-svn: 52498	2008-06-19 17:47:47 +00:00
Chris Lattner	78119b4742	Fix the regressions on sext-misc.ll my patch yesterday caused. llvm-svn: 52466	2008-06-18 18:11:55 +00:00
Chris Lattner	ef36dcd10b	implement some simple bswap optimizations, rdar://5992453 llvm-svn: 52442	2008-06-18 04:33:20 +00:00
Chris Lattner	b5ee8b3e89	make truncate/sext elimination capable of changing phi's. This implements rdar://6013816 and the testcase in Transforms/InstCombine/sext-misc.ll. llvm-svn: 52440	2008-06-18 04:00:49 +00:00
Duncan Sands	4b50fde2c4	Fix typo that changed the logic to something wrong. Spotted by Nick Lewycky. llvm-svn: 52411	2008-06-17 15:55:30 +00:00
Matthijs Kooijman	86cda9e050	Pass around Instruction* instead of Instruction& in FindInsertedValue and friends. llvm-svn: 52318	2008-06-16 13:13:08 +00:00
Matthijs Kooijman	5cb387735d	80 column fixes. llvm-svn: 52316	2008-06-16 12:57:37 +00:00
Matthijs Kooijman	e92e18be5a	Move FindScalarValue from InstructionCombining.cpp to ValueTracking.cpp. While I'm at it, rename it to FindInsertedValue. The only functional change is that newly created instructions are no longer added to instcombine's worklist, but that is not really necessary anyway (and I'll commit some improvements next that will completely remove the need). llvm-svn: 52315	2008-06-16 12:48:21 +00:00
Eli Friedman	5de0a77a9b	Don't skip over instructions other than loads that might read memory when trying to sink stores. llvm-svn: 52259	2008-06-13 22:02:12 +00:00
Eli Friedman	9833a1b407	Make sure SimplifyStoreAtEndOfBlock doesn't mess with loops; the structure checks are incorrect if the blocks aren't distinct. Fixes PR2435. llvm-svn: 52257	2008-06-13 21:17:49 +00:00
Gabor Greif	431e9560b7	fix a minor deviation from the original in my previous commit llvm-svn: 52247	2008-06-12 21:51:29 +00:00
Gabor Greif	f6d8e77027	op_iterator-ify some loops, low hanging fruit only, there is more llvm-svn: 52246	2008-06-12 21:37:33 +00:00
Matthijs Kooijman	b2fc72bfbf	Teach instruction combining about the extractvalue. It can succesfully fold useless insert-extract chains, similar to how it folds them for vectors. Add a testcase for this. llvm-svn: 52217	2008-06-11 14:05:05 +00:00
Matthijs Kooijman	3453c7bcb5	Clarify a comment. llvm-svn: 52212	2008-06-11 09:00:12 +00:00
Chris Lattner	b4866ef30c	Limit the icmp+phi merging optimization to the cases where it is profitable: don't make i1 phis when it won't be possible to eliminate them. llvm-svn: 52097	2008-06-08 20:52:11 +00:00
Zhou Sheng	1152ca9101	As Chris suggested, handle the situation if ShAmt larger than BitWidth, otherwise, opt might crash. llvm-svn: 52041	2008-06-06 08:32:05 +00:00
Zhou Sheng	fbe1dc240c	If BitWidth equals to ShtAmt, the RHSKnownZero[BitWidth-ShiftAmt-1] will crash the opt. Just fix this. Test case in llvm/test/Transforms/InstCombine/2008-06-05-ashr-crash.ll llvm-svn: 52003	2008-06-05 14:23:44 +00:00

... 3 4 5 6 7 ...

1418 Commits