llvm-project

Commit Graph

Author	SHA1	Message	Date
Zhou Sheng	fbe1dc240c	If BitWidth equals to ShtAmt, the RHSKnownZero[BitWidth-ShiftAmt-1] will crash the opt. Just fix this. Test case in llvm/test/Transforms/InstCombine/2008-06-05-ashr-crash.ll llvm-svn: 52003	2008-06-05 14:23:44 +00:00
Chris Lattner	a12a6de683	move CannotBeNegativeZero to ValueTracking. Simplify some signbit comparisons. llvm-svn: 51864	2008-06-02 01:29:46 +00:00
Chris Lattner	965c769b3c	move ComputeMaskedBits, MaskedValueIsZero, and ComputeNumSignBits out of instcombine into a new file in libanalysis. This also teaches ComputeNumSignBits about the number of sign bits in a constantint. llvm-svn: 51863	2008-06-02 01:18:21 +00:00
Duncan Sands	0397cd2ec4	When simplifying a call to a bitcast function, tighten up the conditions for performing the transform when only the function declaration is available: no longer allow turning i32 into i64 for example. Only allow changing between pointer types, and between pointer types and integers of the same size. For return values ptr -> intptr was already allowed; I added ptr -> ptr and intptr -> ptr while there. As shown by a recent objc testcase, changing the way parameters/return values are passed can be fatal when calling code written in assembler that directly manipulates call arguments and return values unless the transform has no impact on the way they are passed at the codegen level. While it is possible to imagine an ABI that treats integers of pointer size differently to pointers, I don't think LLVM supports any so the transform should now be safe while still being useful. llvm-svn: 51834	2008-06-01 07:38:42 +00:00
Nick Lewycky	035fe6f716	Peer through sext/zext when looking for not(cmp). llvm-svn: 51819	2008-05-31 19:01:33 +00:00
Nick Lewycky	26b8cd84b3	Add more i1 optimizations. add, sub, mul, s/udiv on i1 are now simplified away. llvm-svn: 51817	2008-05-31 17:59:52 +00:00
Nick Lewycky	df9242a833	Adding i1 is always Xor. llvm-svn: 51816	2008-05-31 17:10:28 +00:00
Dan Gohman	86ff8536f9	const-ify getOpcode. llvm-svn: 51698	2008-05-29 19:53:46 +00:00
Chris Lattner	ecdefb5df7	Implement PR2370: memmove(x,x,size) -> noop. llvm-svn: 51636	2008-05-28 05:30:41 +00:00
Nick Lewycky	f6ccd2580c	"ret (constexpr)" can't be folded into a Constant. Add a method to Analysis/ConstantFolding to fold ConstantExpr's, then make instcombine use it to try to use targetdata to fold constant expressions on void instructions. Also extend the icmp(inttoptr, inttoptr) folding to handle the case where int size != ptr size. llvm-svn: 51559	2008-05-25 20:56:15 +00:00
Chris Lattner	87a099a057	Fix a serious brain-o. Obviously no-one reviewed my patch :( This fixes PR2359 llvm-svn: 51536	2008-05-24 04:06:28 +00:00
Dan Gohman	f96e1371e8	Tidy up BasicBlock::getFirstNonPHI, and change a bunch of places to use it instead of duplicating its functionality. llvm-svn: 51499	2008-05-23 21:05:58 +00:00
Matthijs Kooijman	f52b23c0eb	Replace some weird usage of UserOp1 introduced in r49492 by a plain if. llvm-svn: 51482	2008-05-23 16:17:48 +00:00
Nick Lewycky	3bf5512d87	Constant integer vectors may also be negated. llvm-svn: 51476	2008-05-23 04:54:45 +00:00
Nick Lewycky	8f3127c5b5	Typo. llvm-svn: 51475	2008-05-23 04:39:38 +00:00
Nick Lewycky	4f3d878507	Revert X + X --> X * 2 optz'n which pessimizes heavily on x86. llvm-svn: 51474	2008-05-23 04:34:58 +00:00
Nick Lewycky	452fb32927	Implement X + X for vectors. llvm-svn: 51472	2008-05-23 04:14:51 +00:00
Nick Lewycky	2ec9a01173	Fix a recently added optimization to not crash on vectors. llvm-svn: 51471	2008-05-23 03:26:47 +00:00
Dan Gohman	6d5f120c5c	Generalize the new code in instcombine's ComputeNumSignBits for handling and/or to handle more cases (such as this add-sitofp.ll testcase), and port it to selectiondag's ComputeNumSignBits. llvm-svn: 51469	2008-05-23 02:28:01 +00:00
Dan Gohman	53b2698531	Use isSingleValueType instead of isFirstClassType to exclude struct and array types. llvm-svn: 51467	2008-05-23 01:52:21 +00:00
Dan Gohman	81ab753b14	Port SelectionDAG's ComputeNumSignBits-using code to instcombine, now that instcombine also has ComputeNumSignBits. llvm-svn: 51350	2008-05-20 21:01:12 +00:00
Chris Lattner	7ac943fffd	Teach instcombine 4 new xforms: (add (sext x), cst) --> (sext (add x, cst')) (add (sext x), (sext y)) --> (sext (add int x, y)) (add double (sitofp x), fpcst) --> (sitofp (add int x, intcst)) (add double (sitofp x), (sitofp y)) --> (sitofp (add int x, y)) This generally reduces conversions. For example MiBench/telecomm-gsm gets these simplifications: HACK2: %tmp67.i142.i.i = sext i16 %tmp6.i141.i.i to i32 ; <i32> [#uses=1] %tmp23.i139.i.i = sext i16 %tmp2.i138.i.i to i32 ; <i32> [#uses=1] %tmp8.i143.i.i = add i32 %tmp67.i142.i.i, %tmp23.i139.i.i ; <i32> [#uses=3] HACK2: %tmp67.i121.i.i = sext i16 %tmp6.i120.i.i to i32 ; <i32> [#uses=1] %tmp23.i118.i.i = sext i16 %tmp2.i117.i.i to i32 ; <i32> [#uses=1] %tmp8.i122.i.i = add i32 %tmp67.i121.i.i, %tmp23.i118.i.i ; <i32> [#uses=3] HACK2: %tmp67.i.i190.i = sext i16 %tmp6.i.i189.i to i32 ; <i32> [#uses=1] %tmp23.i.i187.i = sext i16 %tmp2.i.i186.i to i32 ; <i32> [#uses=1] %tmp8.i.i191.i = add i32 %tmp67.i.i190.i, %tmp23.i.i187.i ; <i32> [#uses=3] HACK2: %tmp67.i173.i.i.i = sext i16 %tmp6.i172.i.i.i to i32 ; <i32> [#uses=1] %tmp23.i170.i.i.i = sext i16 %tmp2.i169.i.i.i to i32 ; <i32> [#uses=1] %tmp8.i174.i.i.i = add i32 %tmp67.i173.i.i.i, %tmp23.i170.i.i.i ; <i32> [#uses=3] HACK2: %tmp67.i152.i.i.i = sext i16 %tmp6.i151.i.i.i to i32 ; <i32> [#uses=1] %tmp23.i149.i.i.i = sext i16 %tmp2.i148.i.i.i to i32 ; <i32> [#uses=1] %tmp8.i153.i.i.i = add i32 %tmp67.i152.i.i.i, %tmp23.i149.i.i.i ; <i32> [#uses=3] HACK2: %tmp67.i.i.i.i = sext i16 %tmp6.i.i.i.i to i32 ; <i32> [#uses=1] %tmp23.i.i5.i.i = sext i16 %tmp2.i.i.i.i to i32 ; <i32> [#uses=1] %tmp8.i.i7.i.i = add i32 %tmp67.i.i.i.i, %tmp23.i.i5.i.i ; <i32> [#uses=3] This also fixes a bug in ComputeNumSignBits handling select and makes it more aggressive with and/or. llvm-svn: 51302	2008-05-20 05:46:13 +00:00
Chris Lattner	9c27f96d04	fix two issues Neil noticed, thanks! llvm-svn: 51296	2008-05-20 03:50:52 +00:00
Dan Gohman	d717761a2b	Make AssociativeOpt static. llvm-svn: 51290	2008-05-20 01:14:05 +00:00
Dan Gohman	123438cc05	Add a ComputeNumSignBits function for use by instcombine, based on the code in SelectionDAG. llvm-svn: 51279	2008-05-19 22:14:15 +00:00
Chris Lattner	b42712288e	switch to Type::getFPMantissaWidth instead of reinventing it. llvm-svn: 51275	2008-05-19 21:17:23 +00:00
Chris Lattner	ba9acbe6dc	minor cleanups, teach instcombine that sitofp/uitofp cannot produce a negative zero. llvm-svn: 51272	2008-05-19 20:27:56 +00:00
Chris Lattner	e35fe0f1c6	convert fptosi(sitofp x) -> x if the fp value has enough bits in its mantissa to accurately represent the integer. This triggers 9 times in 471.omnetpp, though 8 of those seem to be inlined from the same place. llvm-svn: 51271	2008-05-19 20:25:04 +00:00
Chris Lattner	5920a78034	Fold FP comparisons where one operand is converted from an integer type and the other operand is a constant into integer comparisons. This happens surprisingly frequently (e.g. 10 times in 471.omnetpp), which are things like this: %tmp8283 = sitofp i32 %tmp82 to double %tmp1013 = fcmp ult double %tmp8283, 0.0 Clearly comparing tmp82 against i32 0 is cheaper here. this also triggers 8 times in gobmk, including this one: %tmp375376 = sitofp i32 %tmp375 to double %tmp377 = fcmp ogt double %tmp375376, 8.150000e+01 which is comparing an integer against 81.5 :). llvm-svn: 51268	2008-05-19 20:18:56 +00:00
Chris Lattner	6e70830af9	remove debug output llvm-svn: 51264	2008-05-19 20:03:53 +00:00
Chris Lattner	fc365b60dc	be more aggressive about transforming add -> or when the operands have no intersecting bits. This triggers all over the place, for example in lencode, with adds of stuff like: %tmp580 = mul i32 %tmp579, 2 %tmp582 = and i32 %b8, 1 and %tmp28 = shl i32 %abs.i, 1 %sign.0 = select i1 %tmp23, i32 1, i32 0 and %tmp344 = shl i32 %tmp343, 2 %tmp346 = and i32 %tmp96, 3 etc. llvm-svn: 51263	2008-05-19 20:01:56 +00:00
Chris Lattner	4b2a724fb8	Fix PR2339 llvm-svn: 51226	2008-05-18 04:11:26 +00:00
Nick Lewycky	79376f4e02	Move isTrueWhenEqual to ICmpInst. llvm-svn: 51215	2008-05-17 07:33:39 +00:00
Gabor Greif	e1f6e4b21d	API change for {BinaryOperator\|CmpInst\|CastInst}::create*() --> Create. Legacy interfaces will be in place for some time. (Merge from use-diet branch.) llvm-svn: 51200	2008-05-16 19:29:10 +00:00
Chris Lattner	5c953b7d27	implement PR2328. llvm-svn: 51176	2008-05-16 02:59:42 +00:00
Gabor Greif	697e94cc22	Fix a bunch of 80col violations that arose from the Create API change. Tweak makefile targets to find these better. llvm-svn: 51143	2008-05-15 10:04:30 +00:00
Bill Wendling	3716952f10	Situations can arise when you have a function called that returns a 'void', but is bitcast to return a floating point value. The result of the instruction may not be used by the program afterwards, and LLVM will happily remove all instructions except the call. But, on some platforms, if a value is returned as a floating point, it may need to be removed from the stack (like x87). Thus, we can't get rid of the bitcast even if there isn't a use of the value. llvm-svn: 51134	2008-05-14 22:45:20 +00:00
Dan Gohman	d78c400b5b	Clean up the use of static and anonymous namespaces. This turned up several things that were neither in an anonymous namespace nor static but not intended to be global. llvm-svn: 51017	2008-05-13 00:00:25 +00:00
Chris Lattner	a4ee1f516f	don't sink invokes, even if they are readonly. This fixes a crash on kimwitu++. llvm-svn: 50901	2008-05-09 15:07:33 +00:00
Chris Lattner	aaba10e843	Implement PR2298. This transforms: ~x < ~y --> y < x -x == -y --> x == y llvm-svn: 50882	2008-05-09 05:19:28 +00:00
Chris Lattner	49a594e6ab	More than just loads can read from memory: readonly calls like strlen also need to be checked for memory modifying instructions before we can sink them. THis fixes the second half of PR2297. llvm-svn: 50860	2008-05-08 17:37:37 +00:00
Chris Lattner	4fa09669d8	Make instcombine's DSE respect loads as well as stores. It is not safe to delete the first store in: store x -> p load p store y -> p This is for PR2297. llvm-svn: 50859	2008-05-08 17:20:30 +00:00
Anton Korobeynikov	fc2edad4ae	Turn StripPointerCast() into a method llvm-svn: 50836	2008-05-07 22:54:15 +00:00
Dan Gohman	5a3eecdfd8	Fix a bug in the ComputeMaskedBits logic for multiply. llvm-svn: 50793	2008-05-07 00:35:55 +00:00
Anton Korobeynikov	82c02b28f3	Make StripPointerCast a common function (should we mak it method of Value instead?) llvm-svn: 50775	2008-05-06 22:52:30 +00:00
Devang Patel	7ffc3c9a95	Fix typo. llvm-svn: 50713	2008-05-06 05:40:11 +00:00
Dan Gohman	cf0e3acf16	Correct the value of LowBits in srem and urem handling in ComputeMaskedBits. llvm-svn: 50692	2008-05-06 00:51:48 +00:00
Devang Patel	a1ec89fbf1	Do not sink getresult. llvm-svn: 50600	2008-05-03 00:36:30 +00:00
Dan Gohman	1962c2be6a	Fix a mistake in the computation of leading zeros for udiv. llvm-svn: 50591	2008-05-02 21:30:02 +00:00
Dan Gohman	4be6ae4e6c	Fix an overaggressive SimplifyDemandedBits optimization on urem. This fixes the 254.gap regression on x86 and the 403.gcc regression on x86-64. llvm-svn: 50537	2008-05-01 19:13:24 +00:00
Chris Lattner	2dc4426675	move lowering of llvm.memset -> store from simplify libcalls to instcombine. llvm-svn: 50472	2008-04-30 06:39:11 +00:00
Chris Lattner	d9e3b5c5bd	don't eliminate load from volatile value on paths where the load is dead. This fixes the second half of PR2262 llvm-svn: 50430	2008-04-29 17:28:22 +00:00
Chris Lattner	9233c124c9	fix a subtle volatile handling bug. llvm-svn: 50428	2008-04-29 17:13:43 +00:00
Chris Lattner	e331a65c79	don't delete the last store to an alloca if the store is volatile. llvm-svn: 50390	2008-04-29 04:58:38 +00:00
Dan Gohman	72ec3f4562	Teach InstCombine's ComputeMaskedBits what SelectionDAG's ComputeMaskedBits knows about cttz, ctlz, and ctpop. Teach SelectionDAG's ComputeMaskedBits what InstCombine's knows about SRem. And teach them both some things about high bits in Mul, UDiv, URem, and Sub. This allows instcombine and dagcombine to eliminate sign-extension operations in several new cases. llvm-svn: 50358	2008-04-28 17:02:21 +00:00
Dale Johannesen	0d1d3df564	change comments per review llvm-svn: 50300	2008-04-25 21:16:07 +00:00
Nick Lewycky	4d43d3c72c	Remove 'unwinds to' support from mainline. This patch undoes r47802 r47989 r48047 r48084 r48085 r48086 r48088 r48096 r48099 r48109 and r48123. llvm-svn: 50265	2008-04-25 16:53:59 +00:00
Dale Johannesen	f6e15a4774	Rewrite previous patch to suit Chris's preference. llvm-svn: 50174	2008-04-23 18:34:37 +00:00
Dale Johannesen	493527d8c9	Do not change the type of a ByVal argument to a type of a different size. llvm-svn: 50121	2008-04-23 01:03:05 +00:00
Evan Cheng	1c89ca7295	Don't do: "(X & 4) >> 1 == 2 --> (X & 4) == 4" if there are more than one uses of the shift result. llvm-svn: 50118	2008-04-23 00:38:06 +00:00
Chris Lattner	8fb13cbe4e	remove dead code. llvm-svn: 50080	2008-04-22 03:21:48 +00:00
Chris Lattner	c3a439351c	optimize "p != gep p, ..." better. This allows us to compile getelementptr-seteq.ll into: define i1 @test(i64 %X, %S* %P) { %C = icmp eq i64 %X, -1 ; <i1> [#uses=1] ret i1 %C } instead of: define i1 @test(i64 %X, %S* %P) { %A.idx.mask = and i64 %X, 4611686018427387903 ; <i64> [#uses=1] %C = icmp eq i64 %A.idx.mask, 4611686018427387903 ; <i1> [#uses=1] ret i1 %C } And fixes the second half of PR2235. This speeds up the insertion sort case by 45%, from 1.12s to 0.77s. In practice, this will significantly speed up for loops structured like: for (double *P = Base + N; P != Base; --P) ... Which happens frequently for C++ iterators. llvm-svn: 50079	2008-04-22 02:53:33 +00:00
Torok Edwin	ab20784740	g++-4.3 build-fix: CHAR_BIT requires <climits>. llvm-svn: 49989	2008-04-20 08:33:11 +00:00
Chris Lattner	3b18762f40	Switch to using Simplified ConstantFP::get API. llvm-svn: 49977	2008-04-20 00:41:09 +00:00
Dan Gohman	99b7b3f03b	Teach InstCombine's ComputeMaskedBits to handle pointer expressions in addition to integer expressions. Rewrite GetOrEnforceKnownAlignment as a ComputeMaskedBits problem, moving all of its special alignment knowledge to ComputeMaskedBits as low-zero-bits knowledge. Also, teach ComputeMaskedBits a few basic things about Mul and PHI instructions. This improves ComputeMaskedBits-based simplifications in a few cases, but more noticeably it significantly improves instcombine's alignment detection for loads, stores, and memory intrinsics. llvm-svn: 49492	2008-04-10 18:43:06 +00:00
Gabor Greif	e9ecc68d8f	API changes for class Use size reduction, wave 1. Specifically, introduction of XXX::Create methods for Users that have a potentially variable number of Uses. llvm-svn: 49277	2008-04-06 20:25:17 +00:00
Nate Begeman	f2b0b0eb17	Don't eliminate bitcast instructions that change the type of a pointer llvm-svn: 48971	2008-03-31 00:22:16 +00:00
Evan Cheng	2b72c05992	Handle a special case xor undef, undef -> 0. Technically this should be transformed to undef. But this is such a common idiom (misuse) we are going to handle it. llvm-svn: 48791	2008-03-25 20:07:13 +00:00
Evan Cheng	c3cf9f872a	Transform (zext (or (icmp), (icmp))) to (or (zext (cimp), (zext icmp))) if at least one of the (zext icmp) can be transformed to eliminate an icmp. llvm-svn: 48715	2008-03-24 00:21:34 +00:00
Duncan Sands	c9e09a0588	Fix the build for gcc-4.2. llvm-svn: 48639	2008-03-21 08:32:17 +00:00
Chris Lattner	c44160ce6e	Teach masked value is zero about add and sub, and use MVIZ to simplify things like (X & 4) >> 1 == 2 --> (X & 4) == 4. since it is obvious that the shift doesn't remove any bits. llvm-svn: 48631	2008-03-21 05:19:58 +00:00
Bill Wendling	68a930b33e	The inst combining of inttoptr into GEP with one index was using the bit size of the type instead of the byte size. This was causing troublesome mis-compilations. True to form, this took 2 days to find and is a one-line fix. :-P llvm-svn: 48354	2008-03-14 05:12:19 +00:00
Chris Lattner	8a923e7c28	Reimplement the parameter attributes support, phase #1 . hilights: 1. There is now a "PAListPtr" class, which is a smart pointer around the underlying uniqued parameter attribute list object, and manages its refcount. It is now impossible to mess up the refcount. 2. PAListPtr is now the main interface to the underlying object, and the underlying object is now completely opaque. 3. Implementation details like SmallVector and FoldingSet are now no longer part of the interface. 4. You can create a PAListPtr with an arbitrary sequence of ParamAttrsWithIndex's, no need to make a SmallVector of a specific size (you can just use an array or scalar or vector if you wish). 5. All the client code that had to check for a null pointer before dereferencing the pointer is simplified to just access the PAListPtr directly. 6. The interfaces for adding attrs to a list and removing them is a bit simpler. Phase #2 will rename some stuff (e.g. PAListPtr) and do other less invasive changes. llvm-svn: 48289	2008-03-12 17:45:29 +00:00
Devang Patel	70c238a1d8	Skip functions that return multiple values. llvm-svn: 48233	2008-03-11 18:04:06 +00:00
Nick Lewycky	271506f29c	Don't eliminate blocks that are only reachable by unwind_to. llvm-svn: 48106	2008-03-09 08:50:23 +00:00
Nick Lewycky	d0b62a1552	Don't try to simplify urem and srem using arithmetic rules that don't work under modulo (overflow). Fixes PR1933. llvm-svn: 47987	2008-03-06 06:48:30 +00:00
Chris Lattner	c612571555	Folding or(fcmp,fcmp) only works if the operands of the fcmps are the same fp type. llvm-svn: 47750	2008-02-29 06:09:11 +00:00
Bill Wendling	d188e03715	De-tabify. llvm-svn: 47599	2008-02-26 10:53:30 +00:00
Dale Johannesen	09f410b6d7	Split ParameterAttributes.h, putting the complicated stuff into ParamAttrsList.h. Per feedback from ParamAttrs changes. llvm-svn: 47504	2008-02-22 22:17:59 +00:00
Zhou Sheng	3b8eb704fc	Fixed a typo. llvm-svn: 47478	2008-02-22 10:00:35 +00:00
Anton Korobeynikov	18991d78fa	Fix newly-introduced 4.3 warnings llvm-svn: 47375	2008-02-20 12:07:57 +00:00
Anton Korobeynikov	1bfd121321	Make Transforms to be 4.3 warnings-clean llvm-svn: 47371	2008-02-20 11:26:25 +00:00
Dale Johannesen	89268bc6e2	Expand ParameterAttributes to 32 bits (in preparation for adding alignment info, not there yet). Clean up interfaces to reference ParameterAttributes consistently. llvm-svn: 47342	2008-02-19 21:38:47 +00:00
Chris Lattner	0fe6bce9ce	fdiv/frem of undef can produce undef, because the undef operand can be a SNaN. We could be more aggressive and turn this into unreachable, but that is less nice, and not really worth it. llvm-svn: 47313	2008-02-19 06:12:18 +00:00
Nick Lewycky	fefd0202c9	Correctly fold divide-by-constant, even when faced with overflow. llvm-svn: 47287	2008-02-18 22:48:05 +00:00
Chris Lattner	1e3c501cb8	Transforming -A + -B --> -(A + B) isn't safe for FP, thanks to Dale for noticing this! llvm-svn: 47276	2008-02-18 17:50:16 +00:00
Chris Lattner	024f8c8f09	optimize away stackrestore calls that have no intervening alloca or call. llvm-svn: 47258	2008-02-18 06:12:38 +00:00
Chris Lattner	cc22601bc3	Fold (-x + -y) -> -(x+y) which promotes better association, fixing the second half of PR2047 llvm-svn: 47244	2008-02-17 21:03:36 +00:00
Dan Gohman	1ee8dc97d9	Rename APInt's isPositive to isNonNegative, to reflect what it actually does. llvm-svn: 47090	2008-02-13 22:09:18 +00:00
Chris Lattner	682a7dc653	Fix a bug compiling PR1978 (perhaps not the only one though) which was incorrectly simplifying "x == (gep x, 1, i)" into false, even though i could be negative. As it turns out, all the code to handle this already existed, we just need to disable the incorrect optimization case and let the general case handle it. llvm-svn: 46739	2008-02-05 04:45:32 +00:00
Nick Lewycky	3b59214320	There are some cases where icmp(add) can be folded into a new icmp. Handle them. llvm-svn: 46687	2008-02-03 16:33:09 +00:00
Nick Lewycky	c7a4ba044b	Hack on vectors too. llvm-svn: 46684	2008-02-03 08:19:11 +00:00
Nick Lewycky	e6e3a7f6ea	Fold away one multiply in instcombine. This would normally be caught in reassociate anyways, but they could be generated during instcombine's run. llvm-svn: 46683	2008-02-03 07:42:09 +00:00
Chris Lattner	17819d971e	eliminate additions of 0.0 when they are obviously dead. This has to be careful to avoid turning -0.0 + 0.0 -> -0.0 which is incorrect. llvm-svn: 46499	2008-01-29 06:52:45 +00:00
Nick Lewycky	8ea81e8ba4	Handle some more combinations of extend and icmp. Fixes PR1940. llvm-svn: 46431	2008-01-28 03:48:02 +00:00
Chris Lattner	710b441174	Fix PR1932 by disabling an xform invalid for fdiv. llvm-svn: 46429	2008-01-28 00:58:18 +00:00
Chris Lattner	fa1e7eef30	Fold fptrunc(add (fpextend x), (fpextend y)) -> add(x,y), as GCC does. llvm-svn: 46406	2008-01-27 05:29:54 +00:00
Nick Lewycky	f069264164	Enable the fix I just checked in, silly me. llvm-svn: 46247	2008-01-22 05:42:02 +00:00
Nick Lewycky	78712e5b59	Multiply can be evaluated in a different type, so long as the target type has a smaller bitwidth. llvm-svn: 46244	2008-01-22 05:08:48 +00:00
Duncan Sands	b5ca2e9fcb	I noticed that the trampoline straightening transformation could drop attributes on varargs call arguments. Also, it could generate invalid IR if the transformed call already had the 'nest' attribute somewhere (this can never happen for code coming from llvm-gcc, but it's a theoretical possibility). Fix both problems. llvm-svn: 45973	2008-01-14 19:52:09 +00:00
Chris Lattner	92bd785323	Turn a memcpy from a double* into a load/store of double instead of a load/store of i64. The later prevents promotion/scalarrepl of the source and dest in many cases. This fixes the 300% performance regression of the byval stuff on stepanov_v1p2. llvm-svn: 45945	2008-01-14 00:28:35 +00:00
Chris Lattner	57974c8d51	factor memcpy/memmove simplification out to its own SimplifyMemTransfer method, no functionality change. llvm-svn: 45944	2008-01-13 23:50:23 +00:00
Chris Lattner	8c5cdddfb9	simplify some code. If we can infer alignment for source and dest that are greater than memcpy alignment, and if we lower to load/store, use the best alignment info we have. llvm-svn: 45943	2008-01-13 22:30:28 +00:00
Chris Lattner	5a86612d3f	simplify some code by adding a InsertBitCastBefore method, make memmove->memcpy conversion a bit simpler. llvm-svn: 45942	2008-01-13 22:23:22 +00:00
Chris Lattner	5bc253c8f2	Fix PR1907, a nasty miscompilation because instcombine didn't realize that ne & sgt was a signed comparison (it was only looking at whether the left compare was signed). llvm-svn: 45937	2008-01-13 20:59:02 +00:00
Duncan Sands	781f6549db	When turning a call to a bitcast function into a direct call, if this becomes a varargs call then deal correctly with any parameter attributes on the newly vararg call arguments. llvm-svn: 45931	2008-01-13 08:02:44 +00:00
Chris Lattner	2940c5c56d	Implement PR1795, an instcombine hack for forming GEPs with integer pointer arithmetic. llvm-svn: 45745	2008-01-08 07:23:51 +00:00
Duncan Sands	b18c30acec	Small cleanup for handling of type/parameter attribute incompatibility. llvm-svn: 45704	2008-01-07 17:16:06 +00:00
Duncan Sands	404eb05247	The transform that tries to turn calls to bitcast functions into direct calls bails out unless caller and callee have essentially equivalent parameter attributes. This is illogical - the callee's attributes should be of no relevance here. Rework the logic, which incidentally fixes a crash when removed arguments have attributes. llvm-svn: 45658	2008-01-06 18:27:01 +00:00
Duncan Sands	55e5090fe8	When transforming a call to a bitcast function into a direct call with cast parameters and cast return value (if any), instcombine was prepared to cast any non-void return value into any other, whether castable or not. Add a new predicate for testing whether casting is valid, and check it both for the return value and (as a cleanup) for the parameters. llvm-svn: 45657	2008-01-06 10:12:28 +00:00
Chris Lattner	e666bc272d	remove a couple more unsafe xforms in the face of overflow. llvm-svn: 45613	2008-01-05 01:22:42 +00:00
Chris Lattner	db026d703b	remove the (x-y) < 0 comparison xform, it miscompiles things that are not equality comparisons, for example: (2147479553+4096)-2147479553 < 0 != (2147479553+4096) < 2147479553 llvm-svn: 45612	2008-01-05 01:18:20 +00:00
Chris Lattner	f3ebc3f3d2	Remove attribution from file headers, per discussion on llvmdev. llvm-svn: 45418	2007-12-29 20:36:04 +00:00
Christopher Lamb	b053b80b79	Disable null pointer folding transforms for non-generic address spaces. This should probably be a target-specific predicate based on address space. That way for targets where this isn't applicable the predicate can be optimized away. llvm-svn: 45403	2007-12-29 07:56:53 +00:00
Owen Anderson	7363914ef7	Repair a transform that Chris noticed a bug in. Thanks to Nicholas for pointing out my stupid mistakes when writing this patch. :-) llvm-svn: 45384	2007-12-28 07:42:12 +00:00
Chris Lattner	5179819beb	disable this instcombine xform, it miscompiles: define i32 @main() { entry: %z = alloca i32 ; <i32> [#uses=2] store i32 0, i32 %z %tmp = load i32* %z ; <i32> [#uses=1] %sub = sub i32 %tmp, 1 ; <i32> [#uses=1] %cmp = icmp ult i32 %sub, 0 ; <i1> [#uses=1] %retval = select i1 %cmp, i32 1, i32 0 ; <i32> [#uses=1] ret i32 %retval } into ret 1, instead of ret 0. Christopher, please investigate. llvm-svn: 45383	2007-12-28 06:24:31 +00:00
Chris Lattner	74b2ab59fd	implement InstCombine/shift-trunc-shift.ll. This allows us to compile: #include <math.h> int t1(double d) { return signbit(d); } into: _t1: movd %xmm0, %rax shrq $63, %rax ret instead of: _t1: movd %xmm0, %rax shrq $32, %rax shrl $31, %eax ret on x86-64. llvm-svn: 45311	2007-12-22 09:07:47 +00:00
Christopher Lamb	7d82bc46b8	Implement review feedback, including additional transforms (icmp slt (sub A B) 1) -> (icmp sle A B) icmp sgt (sub A B) -1) -> (icmp sge A B) and add testcase. llvm-svn: 45256	2007-12-20 07:21:11 +00:00
Chris Lattner	16a51da0e2	simplify this code with the new m_Zero() pattern. Make sure the select only has a single use, and generalize it to not require N to be a constant. llvm-svn: 45250	2007-12-20 01:56:58 +00:00
Duncan Sands	aa31b92508	When inlining through an 'nounwind' call, mark inlined calls 'nounwind'. It is important for correct C++ exception handling that nounwind markings do not get lost, so this transformation is actually needed for correctness. llvm-svn: 45218	2007-12-19 21:13:37 +00:00
Christopher Lamb	f00ac6dd93	Fold subtracts into integer compares vs. zero. This improves generate code for this case on X86 from _foo: movl $99, %ecx movl 4(%esp), %eax subl %eax, %ecx xorl %edx, %edx testl %ecx, %ecx cmovs %edx, %eax ret to _foo: xorl %ecx, %ecx movl 4(%esp), %eax cmpl $99, %eax cmovg %ecx, %eax ret llvm-svn: 45173	2007-12-18 21:32:20 +00:00
Christopher Lamb	b7016c53d1	Fix comments llvm-svn: 45170	2007-12-18 20:33:11 +00:00
Christopher Lamb	74dbad9216	Remove an orthogonal transformation of the selection condition from my most recent submission. llvm-svn: 45169	2007-12-18 20:30:28 +00:00
Duncan Sands	3353ed09ac	Rename isNoReturn to doesNotReturn, and isNoUnwind to doesNotThrow. llvm-svn: 45160	2007-12-18 09:59:50 +00:00
Christopher Lamb	30291f4a30	Fix typos. llvm-svn: 45159	2007-12-18 09:45:40 +00:00
Christopher Lamb	8b09a464b4	Fold certain additions through selects (and their compares) so as to eliminate subtractions. This code is often produced by the SMAX expansion in SCEV. This implements test/Transforms/InstCombine/2007-12-18-AddSelCmpSub.ll llvm-svn: 45158	2007-12-18 09:34:41 +00:00
Christopher Lamb	edf0788758	Change the PointerType api for creating pointer types. The old functionality of PointerType::get() has become PointerType::getUnqual(), which returns a pointer in the generic address space. The new prototype of PointerType::get() requires both a type and an address space. llvm-svn: 45082	2007-12-17 01:12:55 +00:00
Duncan Sands	8e4847ee95	Make instcombine promote inline asm calls to 'nounwind' calls. Remove special casing of inline asm from the inliner. There is a potential problem: the verifier rejects invokes of inline asm (not sure why). If an asm call is not marked "nounwind" in some .ll, and instcombine is not run, but the inliner is run, then an illegal module will be created. This is bad but I'm not sure what the best approach is. I'm tempted to remove the check in the verifier... llvm-svn: 45073	2007-12-16 15:51:49 +00:00
Wojciech Matyjewicz	309e5a723b	1. "Upgrage" comments. 2. Using zero-extended value of Scale and unsigned division is safe provided that Scale doesn't have the sign bit set. Previously these 2 instructions: %p = bitcast [100 x {i8,i8,i8}]* %x to i8* %q = getelementptr i8* %p, i32 -4 were combined into: %q = getelementptr [100 x { i8, i8, i8 }]* %x, i32 0, i32 1431655764, i32 0 what was incorrect. llvm-svn: 44936	2007-12-12 15:21:32 +00:00
Chris Lattner	d2bbbabbfb	simplify some code. llvm-svn: 44655	2007-12-06 06:25:04 +00:00
Chris Lattner	0ccb663cca	move some ashr-specific code out of commonShiftTransforms into visitAShr. llvm-svn: 44650	2007-12-06 01:59:46 +00:00
Duncan Sands	ad0ea2d430	Fix PR1146: parameter attributes are longer part of the function type, instead they belong to functions and function calls. This is an updated and slightly corrected version of Reid Spencer's original patch. The only known problem is that auto-upgrading of bitcode files doesn't seem to work properly (see test/Bitcode/AutoUpgradeIntrinsics.ll). Hopefully a bitcode guru (who might that be? :) ) will fix it. llvm-svn: 44359	2007-11-27 13:23:08 +00:00
Chris Lattner	c00e8adfe0	Implement PR1822 llvm-svn: 44318	2007-11-25 21:27:53 +00:00
Duncan Sands	185eeac0f8	Fix PR1816. If a bitcast of a function only exists because of a trivial difference in function attributes, allow calls to it to be converted to direct calls. Based on a patch by Török Edwin. While there, move the various lists of mutually incompatible parameters etc out of the verifier and into ParameterAttributes.h. llvm-svn: 44315	2007-11-25 14:10:56 +00:00
Chris Lattner	0cf083815a	add a comment. llvm-svn: 44293	2007-11-23 22:35:18 +00:00
Chris Lattner	1985d96dc9	Fix PR1817. llvm-svn: 44284	2007-11-22 23:47:13 +00:00
Chris Lattner	c53b18362a	Fix PR1800 by correcting mistaken logic. llvm-svn: 44188	2007-11-16 06:04:17 +00:00
Andrew Lenharth	19ca5c7021	Better check llvm-svn: 43897	2007-11-08 18:45:15 +00:00
Andrew Lenharth	8cf11aa330	Fix PR1780 llvm-svn: 43893	2007-11-08 17:39:28 +00:00
Chris Lattner	d8515f8e80	Implement PR1777 by detecting dependent phis that all compute the same value. llvm-svn: 43777	2007-11-06 21:52:06 +00:00
Chris Lattner	362709dff1	wrap long lines llvm-svn: 43745	2007-11-06 01:15:27 +00:00
Dan Gohman	4decbc5002	Fix an abort in instcombine when folding creates a vector rem instruction. llvm-svn: 43743	2007-11-05 23:16:33 +00:00
Duncan Sands	44b8721de8	Executive summary: getTypeSize -> getTypeStoreSize / getABITypeSize. The meaning of getTypeSize was not clear - clarifying it is important now that we have x86 long double and arbitrary precision integers. The issue with long double is that it requires 80 bits, and this is not a multiple of its alignment. This gives a primitive type for which getTypeSize differed from getABITypeSize. For arbitrary precision integers it is even worse: there is the minimum number of bits needed to hold the type (eg: 36 for an i36), the maximum number of bits that will be overwriten when storing the type (40 bits for i36) and the ABI size (i.e. the storage size rounded up to a multiple of the alignment; 64 bits for i36). This patch removes getTypeSize (not really - it is still there but deprecated to allow for a gradual transition). Instead there is: (1) getTypeSizeInBits - a number of bits that suffices to hold all values of the type. For a primitive type, this is the minimum number of bits. For an i36 this is 36 bits. For x86 long double it is 80. This corresponds to gcc's TYPE_PRECISION. (2) getTypeStoreSizeInBits - the maximum number of bits that is written when storing the type (or read when reading it). For an i36 this is 40 bits, for an x86 long double it is 80 bits. This is the size alias analysis is interested in (getTypeStoreSize returns the number of bytes). There doesn't seem to be anything corresponding to this in gcc. (3) getABITypeSizeInBits - this is getTypeStoreSizeInBits rounded up to a multiple of the alignment. For an i36 this is 64, for an x86 long double this is 96 or 128 depending on the OS. This is the spacing between consecutive elements when you form an array out of this type (getABITypeSize returns the number of bytes). This is TYPE_SIZE in gcc. Since successive elements in a SequentialType (arrays, pointers and vectors) need to be aligned, the spacing between them will be given by getABITypeSize. This means that the size of an array is the length times the getABITypeSize. It also means that GEP computations need to use getABITypeSize when computing offsets. Furthermore, if an alloca allocates several elements at once then these too need to be aligned, so the size of the alloca has to be the number of elements multiplied by getABITypeSize. Logically speaking this doesn't have to be the case when allocating just one element, but it is simpler to also use getABITypeSize in this case. So alloca's and mallocs should use getABITypeSize. Finally, since gcc's only notion of size is that given by getABITypeSize, if you want to output assembler etc the same as gcc then getABITypeSize is the size you want. Since a store will overwrite no more than getTypeStoreSize bytes, and a read will read no more than that many bytes, this is the notion of size appropriate for alias analysis calculations. In this patch I have corrected all type size uses except some of those in ScalarReplAggregates, lib/Codegen, lib/Target (the hard cases). I will get around to auditing these too at some point, but I could do with some help. Finally, I made one change which I think wise but others might consider pointless and suboptimal: in an unpacked struct the amount of space allocated for a field is now given by the ABI size rather than getTypeStoreSize. I did this because every other place that reserves memory for a type (eg: alloca) now uses getABITypeSize, and I didn't want to make an exception for unpacked structs, i.e. I did it to make things more uniform. This only effects structs containing long doubles and arbitrary precision integers. If someone wants to pack these types more tightly they can always use a packed struct. llvm-svn: 43620	2007-11-01 20:53:16 +00:00
Chris Lattner	74709473ed	Fix InstCombine/2007-10-31-RangeCrash.ll llvm-svn: 43596	2007-11-01 02:18:41 +00:00
Chris Lattner	55b8302dfe	simplify some code by using the new isNaN predicate llvm-svn: 43305	2007-10-24 18:54:45 +00:00
Chris Lattner	c62877e9da	Implement a couple of foldings for ordered and unordered comparisons, implementing cases related to PR1738. llvm-svn: 43289	2007-10-24 05:38:08 +00:00
Devang Patel	df49cf52e2	Try again. Instead of loading small global string from memory, use integer constant. llvm-svn: 43148	2007-10-18 19:52:32 +00:00
Evan Cheng	cdcc1d0444	Reverting r43070 for now. It's causing llc test failures. llvm-svn: 43103	2007-10-17 23:51:13 +00:00
Devang Patel	91ff13edcc	Apply "Instead of loading small c string constant, use integer constant directly" transformation while processing load instruction. llvm-svn: 43070	2007-10-17 07:24:40 +00:00
Devang Patel	8d818f5e80	Use immediate stores. llvm-svn: 43055	2007-10-16 23:44:18 +00:00
Devang Patel	bff4aea328	Achieve same result but use fewer lines of code. llvm-svn: 42985	2007-10-15 15:31:35 +00:00
Devang Patel	371e6ca690	Dest type is always i8 *. This allows some simplification. Do not filter memmove. llvm-svn: 42930	2007-10-12 20:10:21 +00:00
Chris Lattner	ad618f66e6	Fix a bug in my patch last night that broke InstCombine/2007-10-12-Crash.ll llvm-svn: 42920	2007-10-12 18:05:47 +00:00
Gabor Greif	5d8f7e0cc7	eliminate warning llvm-svn: 42892	2007-10-12 07:44:54 +00:00
Chris Lattner	d8675e4915	Fix some 80 column violations. Fix DecomposeSimpleLinearExpr to handle simple constants better. Don't nuke gep(bitcast(allocation)) if the bitcast(allocation) will fold the allocation. This fixes PR1728 and Instcombine/malloc3.ll llvm-svn: 42891	2007-10-12 05:30:59 +00:00
Devang Patel	899cc56612	Lower memcpy if it makes sense. llvm-svn: 42864	2007-10-11 17:21:57 +00:00
Dale Johannesen	9d559cfff5	Tone down an overzealous optimization. llvm-svn: 42582	2007-10-03 17:45:27 +00:00
Duncan Sands	d31649bc59	Improve comment. llvm-svn: 42132	2007-09-19 10:25:38 +00:00
Duncan Sands	56df7dec2b	A global variable with external weak linkage can be null, while an alias could alias such a global variable. llvm-svn: 42130	2007-09-19 10:10:31 +00:00
Dan Gohman	2ac2652779	Instcombine x-((x/y)*y) into a remainder operator. llvm-svn: 42035	2007-09-17 17:31:57 +00:00
Duncan Sands	6d5da71288	Factor the trampoline transformation into a subroutine. llvm-svn: 42021	2007-09-17 10:26:40 +00:00
Dale Johannesen	98d3a08d8f	Remove the assumption that FP's are either float or double from some of the many places in the optimizers it appears, and do something reasonable with x86 long double. Make APInt::dump() public, remove newline, use it to dump ConstantSDNode's. Allow APFloats in FoldingSet. Expand X86 backend handling of long doubles (conversions to/from int, mostly). llvm-svn: 41967	2007-09-14 22:26:36 +00:00
Chris Lattner	d9111b88d1	silence a bogus gcc warning. llvm-svn: 41949	2007-09-14 03:07:24 +00:00
Duncan Sands	9204663bcb	Turn calls to trampolines into calls to the underlying nested function. llvm-svn: 41844	2007-09-11 14:35:41 +00:00
Chris Lattner	e804567cd8	remove some dead code, this is handled by constant folding. llvm-svn: 41819	2007-09-10 23:46:29 +00:00
Chris Lattner	85a51e0060	Don't zap back to back volatile load/stores llvm-svn: 41759	2007-09-07 05:33:03 +00:00
Dale Johannesen	bed9dc423c	Next round of APFloat changes. Use APFloat in UpgradeParser and AsmParser. Change all references to ConstantFP to use the APFloat interface rather than double. Remove the ConstantFP double interfaces. Use APFloat functions for constant folding arithmetic and comparisons. (There are still way too many places APFloat is just a wrapper around host float/double, but we're getting there.) llvm-svn: 41747	2007-09-06 18:13:44 +00:00
Nick Lewycky	0c5c47944a	Use isTrueWhenEqual. Thanks Chris! llvm-svn: 41741	2007-09-06 02:40:25 +00:00
Nick Lewycky	b0b066eaaa	When the two operands of an icmp are equal, there are five possible predicates that would make the icmp true. Fixes PR1637. llvm-svn: 41740	2007-09-06 01:10:22 +00:00
Chuck Rose III	2320323647	Forgot to obey 80 column rule. Fixing that. llvm-svn: 41725	2007-09-05 20:36:41 +00:00
Chuck Rose III	e58572233d	Added default parameters to GetElementPtrInstr constructor call. Visual Studio 2k5 was getting confused and was unable to compile it. Suspected compiler error. llvm-svn: 41721	2007-09-05 16:54:38 +00:00
David Greene	c656cbb8c2	Update GEP constructors to use an iterator interface to fix GLIBCXX_DEBUG issues. llvm-svn: 41697	2007-09-04 15:46:09 +00:00
Chris Lattner	0e258b8518	Cut off crazy computation. This helps PR1622 slightly. llvm-svn: 41522	2007-08-28 04:23:55 +00:00
David Greene	703623d571	Update InvokeInst to work like CallInst llvm-svn: 41506	2007-08-27 19:04:21 +00:00
Chris Lattner	99c8ee2977	Transform a load from an undef/zero global into an undef/global even if we have complex pointer manipulation going on. This allows us to compile stuff like this: __m128i foo(__m128i x){ static const unsigned int c_0[4] = { 0, 0, 0, 0 }; __m128i v_Zero = _mm_loadu_si128((__m128i*)c_0); x = _mm_unpacklo_epi8(x, v_Zero); return x; } into: _foo: xorps %xmm1, %xmm1 punpcklbw %xmm1, %xmm0 ret llvm-svn: 41022	2007-08-11 18:48:48 +00:00
Chris Lattner	a8e4b4bc7b	when we see a unaligned load from an insufficiently aligned global or alloca, increase the alignment of the load, turning it into an aligned load. This allows us to compile: #include <xmmintrin.h> __m128i foo(__m128i x){ static const unsigned int c_0[4] = { 0, 0, 0, 0 }; __m128i v_Zero = _mm_loadu_si128((__m128i*)c_0); x = _mm_unpacklo_epi8(x, v_Zero); return x; } into: _foo: punpcklbw _c_0.5944, %xmm0 ret .data .lcomm _c_0.5944,16,4 # c_0.5944 instead of: _foo: movdqu _c_0.5944, %xmm1 punpcklbw %xmm1, %xmm0 ret .data .lcomm _c_0.5944,16,2 # c_0.5944 llvm-svn: 40971	2007-08-09 19:05:49 +00:00
Nick Lewycky	8052019a20	It's safe to fold not of fcmp. llvm-svn: 40870	2007-08-06 20:04:16 +00:00
Chris Lattner	f0da7975ea	at the end of instcombine, explicitly clear WorklistMap. This shrinks it down to something small. On the testcase from PR1432, this speeds up instcombine from 0.7959s to 0.5000s, (59%) llvm-svn: 40840	2007-08-05 08:47:58 +00:00
Chandler Carruth	7132e00de7	This is the patch to provide clean intrinsic function overloading support in LLVM. It cleans up the intrinsic definitions and generally smooths the process for more complicated intrinsic writing. It will be used by the upcoming atomic intrinsics as well as vector and float intrinsics in the future. This also changes the syntax for llvm.bswap, llvm.part.set, llvm.part.select, and llvm.ct* intrinsics. They are automatically upgraded by both the LLVM ASM reader and the bitcode reader. The test cases have been updated, with special tests added to ensure the automatic upgrading is supported. llvm-svn: 40807	2007-08-04 01:51:18 +00:00
Chris Lattner	dc2cf228ce	Replacing a cast with another one does not reduce the number of casts in the input. llvm-svn: 40741	2007-08-02 17:23:38 +00:00
Chris Lattner	222b214be7	Disable an xform that causes an infinite loop. This fixes PR1594 llvm-svn: 40739	2007-08-02 16:56:32 +00:00
Chris Lattner	2740694450	wrap some long lines. Major offenders that are left include gvn, gvnpre, dse, and predsimplify. To see these, use: make check-line-length llvm-svn: 40738	2007-08-02 16:53:43 +00:00
Chris Lattner	b0418fc607	Enhance instcombine to be more aggressive about folding casts of operations of casts. This implements InstCombine/zext-fold.ll llvm-svn: 40726	2007-08-02 06:11:14 +00:00
David Greene	17a5dfe6f7	New CallInst interface to address GLIBCXX_DEBUG errors caused by indexing an empty std::vector. Updates to all clients. llvm-svn: 40660	2007-08-01 03:43:44 +00:00
Lauro Ramos Venancio	549e775e67	Fix a bug in GetKnownAlignment of packed structs. llvm-svn: 40649	2007-07-31 20:13:21 +00:00
Reid Spencer	dff9d69cfb	Fix a typo/thinko. llvm-svn: 40599	2007-07-30 19:53:57 +00:00
Chris Lattner	4512cd2cab	completely remove a transformation that is unsafe in the face of undefs. llvm-svn: 40439	2007-07-23 17:10:17 +00:00
Devang Patel	5e39293e62	Apply temporary work around to fix llvm mis-compilation reported in PR 1556. llvm-svn: 40133	2007-07-21 00:34:29 +00:00
Chris Lattner	d82e4a19cc	this xform is already done by the constant folder. llvm-svn: 40124	2007-07-20 22:06:41 +00:00
Dan Gohman	e31a61eeca	Optimize alignment of loads and stores. llvm-svn: 40102	2007-07-20 16:34:21 +00:00
Dan Gohman	06c60b6032	Fix comments about vectors to use the current wording. llvm-svn: 39921	2007-07-16 14:29:03 +00:00
Chris Lattner	640fd5124d	Repair a regression in Transforms/InstCombine/mul.ll that Reid noticed. llvm-svn: 39896	2007-07-16 04:15:34 +00:00
Chris Lattner	d4fef8dbca	Implement shift-simplify.ll:test[45]. First teach instcombine that sign bit checks only demand the sign bit, this allows simplify demanded bits to hack on expressions better. Second, teach instcombine that ashr is useless if only the sign bit is demanded. llvm-svn: 39880	2007-07-15 20:54:51 +00:00
Chris Lattner	06205d5567	Implement shift-simplify.ll:test3, turning: (X << 31) <s 0 --> (X&1) != 0 This happens dozens of times in the CFE. llvm-svn: 39879	2007-07-15 20:42:37 +00:00
Chris Lattner	fb032b176b	Significantly improve the documentation of the instcombine divide/compare transformation. Also, keep track of which end of the integer interval overflows occur on. This fixes Transforms/InstCombine/2007-06-21-DivCompareMiscomp.ll and rdar://5278853, a miscompilation of perl. llvm-svn: 37692	2007-06-21 18:11:19 +00:00
Chris Lattner	3bbec59e8b	refactor a bunch of code out of visitICmpInstWithInstAndIntCst into its own routine. llvm-svn: 37679	2007-06-20 23:46:26 +00:00
Chris Lattner	09a33a4f64	silence a bogus warning Duraid ran into. llvm-svn: 37649	2007-06-19 05:43:49 +00:00
Chris Lattner	373389260f	Generalize many transforms to work on ~ of vectors in addition to ~ of integer ops. This implements Transforms/InstCombine/and-or-not.ll test3/test4, and finishes off PR1510 llvm-svn: 37589	2007-06-15 06:23:19 +00:00
Chris Lattner	481e28b1f5	Implement two xforms: 1. ~(~X \| Y) === (X & ~Y) 2. (A\|B) & ~(A&B) -> A^B This allows us to transform ~(~(a\|b) \| (a&b)) -> a^b. This implements PR1510 for scalar values. llvm-svn: 37584	2007-06-15 05:58:24 +00:00
Chris Lattner	f14e5175ed	delete some obviously dead vector operations, which deletes a few thousand operations from Duraids example. llvm-svn: 37582	2007-06-15 05:26:55 +00:00
Lauro Ramos Venancio	368e8872db	Fix PR1499. llvm-svn: 37472	2007-06-06 17:08:48 +00:00
Chris Lattner	f79577d314	fix a miscompilation when passing a float through varargs llvm-svn: 37297	2007-05-23 01:17:04 +00:00
Chris Lattner	a655a157a0	Fix Transforms/InstCombine/2007-05-18-CastFoldBug.ll, a bug that devastates objc code due to the way the FE lowers objc message sends. llvm-svn: 37256	2007-05-19 06:51:32 +00:00
Chris Lattner	234f96daa8	Fix Transforms/InstCombine/2007-05-14-Crash.ll llvm-svn: 37057	2007-05-15 00:16:00 +00:00
Dan Gohman	b5650ebd6a	Fix typos. llvm-svn: 36994	2007-05-11 21:10:54 +00:00
Chris Lattner	600db3eb96	fix regressions from my previous checking, including Transforms/InstCombine/2006-12-08-ICmp-Combining.ll llvm-svn: 36989	2007-05-11 16:58:45 +00:00
Chris Lattner	fe2b44de9f	fix Transforms/InstCombine/2007-05-10-icmp-or.ll llvm-svn: 36984	2007-05-11 05:55:56 +00:00
Nick Lewycky	e7da2d6ac3	Fix typo in comment. llvm-svn: 36873	2007-05-06 13:37:16 +00:00
Chris Lattner	9b35b3e863	Fix a bug in my previous patch llvm-svn: 36857	2007-05-06 07:24:03 +00:00
Chris Lattner	5aa73fe34c	Implement Transforms/InstCombine/cast_ptr.ll llvm-svn: 36809	2007-05-05 22:41:33 +00:00
Chris Lattner	361e981415	wrap long lines llvm-svn: 36807	2007-05-05 22:32:24 +00:00
Chris Lattner	5c827bda0d	Fix InstCombine/2007-05-04-Crash.ll and PR1384 llvm-svn: 36775	2007-05-05 01:59:31 +00:00
Devang Patel	8c78a0bff0	Drop 'const' llvm-svn: 36662	2007-05-03 01:11:54 +00:00
Devang Patel	e95c6ad802	Use 'static const char' instead of 'static const int'. Due to darwin gcc bug, one version of darwin linker coalesces static const int, which defauts PassID based pass identification. llvm-svn: 36652	2007-05-02 21:39:20 +00:00
Devang Patel	09f162ca6a	Do not use typeinfo to identify pass in pass manager. llvm-svn: 36632	2007-05-01 21:15:47 +00:00
Chris Lattner	089e35cc57	fix a bug triggered by 403.gcc llvm-svn: 36527	2007-04-28 05:27:36 +00:00
Chris Lattner	6e880871e9	Fix several latent bugs in EmitGEPOffset that didn't manifest with its previous clients. This fixes MallocBench/gs llvm-svn: 36525	2007-04-28 04:52:43 +00:00
Chris Lattner	c753800800	uhn zap cvs llvm-svn: 36523	2007-04-28 03:50:56 +00:00
Chris Lattner	acbf6a401d	Implement PR1345 and Transforms/InstCombine/bitcast-gep.ll llvm-svn: 36521	2007-04-28 00:57:34 +00:00
Chris Lattner	1db224db92	refactor some code relating to pointer cast xforms, pulling it out of the codepath for unrelated casts. llvm-svn: 36511	2007-04-27 17:44:50 +00:00
Zhou Sheng	aafe4e216e	Make use of ConstantInt::isZero instead of ConstantInt::isNullValue. llvm-svn: 36261	2007-04-19 05:39:12 +00:00
Chris Lattner	4a6e0cbd41	Extend store merging to support the 'if/then' version in addition to if/then/else. This sinks the two stores in this example into a single store in cond_next. In this case, it allows elimination of the load as well: store double 0.000000e+00, double* @s.3060 %tmp3 = fcmp ogt double %tmp1, 5.000000e-01 ; <i1> [#uses=1] br i1 %tmp3, label %cond_true, label %cond_next cond_true: ; preds = %entry store double 1.000000e+00, double* @s.3060 br label %cond_next cond_next: ; preds = %entry, %cond_true %tmp6 = load double* @s.3060 ; <double> [#uses=1] This implements Transforms/InstCombine/store-merge.ll:test2 llvm-svn: 36040	2007-04-15 01:02:18 +00:00
Chris Lattner	14a251b937	refactor some code, no functionality change. llvm-svn: 36037	2007-04-15 00:07:55 +00:00
Chris Lattner	28d921d04f	fix long lines llvm-svn: 36031	2007-04-14 23:32:02 +00:00
Chris Lattner	7bfdd0abe1	Implement Transforms/InstCombine/vec_extract_elt.ll, transforming: define i32 @test(float %f) { %tmp7 = insertelement <4 x float> undef, float %f, i32 0 %tmp17 = bitcast <4 x float> %tmp7 to <4 x i32> %tmp19 = extractelement <4 x i32> %tmp17, i32 0 ret i32 %tmp19 } into: define i32 @test(float %f) { %tmp19 = bitcast float %f to i32 ; <i32> [#uses=1] ret i32 %tmp19 } On PPC, this is the difference between: _test: mfspr r2, 256 oris r3, r2, 8192 mtspr 256, r3 stfs f1, -16(r1) addi r3, r1, -16 addi r4, r1, -32 lvx v2, 0, r3 stvx v2, 0, r4 lwz r3, -32(r1) mtspr 256, r2 blr and: _test: stfs f1, -4(r1) nop nop nop lwz r3, -4(r1) blr llvm-svn: 36025	2007-04-14 23:02:14 +00:00
Chris Lattner	b37fb6a0da	Implement InstCombine/vec_demanded_elts.ll:test2. This allows us to turn unsigned test(float f) { return _mm_cvtsi128_si32( (__m128i) _mm_set_ss( f*f )); } into: _test: movss 4(%esp), %xmm0 mulss %xmm0, %xmm0 movd %xmm0, %eax ret instead of: _test: movss 4(%esp), %xmm0 mulss %xmm0, %xmm0 xorps %xmm1, %xmm1 movss %xmm0, %xmm1 movd %xmm1, %eax ret GCC gets: _test: subl $28, %esp movss 32(%esp), %xmm0 mulss %xmm0, %xmm0 xorps %xmm1, %xmm1 movss %xmm0, %xmm1 movaps %xmm1, %xmm0 movd %xmm0, 12(%esp) movl 12(%esp), %eax addl $28, %esp ret llvm-svn: 36020	2007-04-14 22:29:23 +00:00
Chris Lattner	efb33d28c6	Implement PR1201 and test/Transforms/InstCombine/malloc-free-delete.ll llvm-svn: 35981	2007-04-14 00:20:02 +00:00
Chris Lattner	74ff60ff84	Turn stuff like: icmp slt i32 %X, 0 ; <i1>:0 [#uses=1] sext i1 %0 to i32 ; <i32>:1 [#uses=1] into: %X.lobit = ashr i32 %X, 31 ; <i32> [#uses=1] This implements InstCombine/icmp.ll:test[34] llvm-svn: 35891	2007-04-11 06:57:46 +00:00
Chris Lattner	d0f7942e23	Simplify some comparisons to arithmetic, this implements: Transforms/InstCombine/icmp.ll llvm-svn: 35890	2007-04-11 06:53:04 +00:00
Chris Lattner	20f2372a7c	canonicalize (x <u 2147483648) -> (x >s -1) and (x >u 2147483647) -> (x <s 0) llvm-svn: 35886	2007-04-11 06:12:58 +00:00
Chris Lattner	7ddbff090a	fix a miscompilation of: define i32 @test(i32 %X) { entry: %Y = and i32 %X, 4 ; <i32> [#uses=1] icmp eq i32 %Y, 0 ; <i1>:0 [#uses=1] sext i1 %0 to i32 ; <i32>:1 [#uses=1] ret i32 %1 } by moving code out of commonIntCastTransforms into visitZExt. Simplify the APInt gymnastics in it etc. llvm-svn: 35885	2007-04-11 05:45:39 +00:00
Chris Lattner	467b69cabb	Strengthen the boundary conditions of this fold, implementing InstCombine/set.ll:test25 llvm-svn: 35852	2007-04-09 23:52:13 +00:00
Chris Lattner	a87c9f6114	Fix PR1304 and Transforms/InstCombine/2007-04-08-SingleEltVectorCrash.ll llvm-svn: 35792	2007-04-09 01:37:55 +00:00
Chris Lattner	4ca9cbb170	Eliminate useless insertelement instructions. This implements Transforms/InstCombine/vec_insertelt.ll and fixes PR1286. We now compile the code from that bug into: _foo: movl 4(%esp), %eax movdqa (%eax), %xmm0 movl 8(%esp), %ecx psllw (%ecx), %xmm0 movdqa %xmm0, (%eax) ret instead of: _foo: subl $4, %esp movl %ebp, (%esp) movl %esp, %ebp movl 12(%ebp), %eax movdqa (%eax), %xmm0 #IMPLICIT_DEF %eax pinsrw $2, %eax, %xmm0 xorl %ecx, %ecx pinsrw $3, %ecx, %xmm0 pinsrw $4, %eax, %xmm0 pinsrw $5, %ecx, %xmm0 pinsrw $6, %eax, %xmm0 pinsrw $7, %ecx, %xmm0 movl 8(%ebp), %eax movdqa (%eax), %xmm1 psllw %xmm0, %xmm1 movdqa %xmm1, (%eax) movl %ebp, %esp popl %ebp ret woo :) llvm-svn: 35788	2007-04-09 01:11:16 +00:00
Chris Lattner	c8d3788f71	reenable this xform, whoops :) llvm-svn: 35765	2007-04-08 08:01:49 +00:00
Chris Lattner	7621a031d8	Fix regression on Instcombine/apint-or2.ll llvm-svn: 35763	2007-04-08 07:55:22 +00:00
Chris Lattner	1150df9cc4	Generalize the code that handles (A&B)\|(A&C) to work where B/C are not constants. Add a new xform to simplify (A&B)\|(~A&C). THis implements InstCombine/or2.ll:test1 llvm-svn: 35760	2007-04-08 07:47:01 +00:00
Chris Lattner	3dbe65f80a	implement Transforms/InstCombine/malloc2.ll and PR1313 llvm-svn: 35700	2007-04-06 18:57:34 +00:00
Dale Johannesen	7c2001d014	Prevent transformConstExprCastCall from generating conversions that assert elsewhere. llvm-svn: 35668	2007-04-04 19:16:42 +00:00
Jeff Cohen	5a1c750f31	Fix 2007-04-04-BadFoldBitcastIntoMalloc.ll llvm-svn: 35665	2007-04-04 16:58:57 +00:00
Duncan Sands	f01a47c93c	Fix comment. llvm-svn: 35655	2007-04-04 06:42:45 +00:00
Chris Lattner	e5bbb3cb1a	Fix a bug I introduced with my patch yesterday which broke Qt (I converted some constant exprs to apints). Thanks to Anton for tracking down a small testcase that triggered this! llvm-svn: 35633	2007-04-03 23:29:39 +00:00
Chris Lattner	a74deafb13	reinstate the previous two patches, with a bugfix :) ldecod now passes. llvm-svn: 35626	2007-04-03 17:43:25 +00:00
Evan Cheng	7511fa280d	Reverting back to 1.723. The last two commits broke JM (and possibily others) on ARM. llvm-svn: 35620	2007-04-03 08:11:50 +00:00
Chris Lattner	64c764cebc	Split a whole ton of code out of visitICmpInst into visitICmpInstWithInstAndIntCst. llvm-svn: 35614	2007-04-03 04:46:52 +00:00
Chris Lattner	8b2ec5f506	Fix PR1253 and xor2.ll:test[01] llvm-svn: 35612	2007-04-03 01:47:41 +00:00
Zhou Sheng	9bc8ab100d	1. Make use of APInt operation instead of using ConstantExpr::getXXX. 2. Use cheaper APInt methods. llvm-svn: 35594	2007-04-02 13:45:30 +00:00
Zhou Sheng	56cda95658	Use uint32_t for bitwidth instead of unsigned. llvm-svn: 35593	2007-04-02 08:20:41 +00:00
Chris Lattner	9d5aacee92	Wrap long line llvm-svn: 35588	2007-04-02 05:48:58 +00:00
Chris Lattner	50490d54f2	use more obvious function name. llvm-svn: 35587	2007-04-02 05:42:22 +00:00
Chris Lattner	b24acc7bee	simplify (x+c)^signbit as (x+c+signbit), pointed out by PR1288. This implements test/Transforms/InstCombine/xor.ll:test28 llvm-svn: 35584	2007-04-02 05:36:22 +00:00
Chris Lattner	c3eeb42809	simplify this code, make it work for ap ints llvm-svn: 35561	2007-04-01 20:57:36 +00:00
Zhou Sheng	150f3bbab2	Avoid unnecessary APInt construction. llvm-svn: 35555	2007-04-01 17:13:37 +00:00
Reid Spencer	6bba6c8143	For PR1297: Support overloaded intrinsics bswap, ctpop, cttz, ctlz. llvm-svn: 35547	2007-04-01 07:35:23 +00:00
Chris Lattner	0427799531	Fix InstCombine/2007-03-31-InfiniteLoop.ll llvm-svn: 35536	2007-04-01 05:36:37 +00:00
Zhou Sheng	82c42284f4	Delete dead code. llvm-svn: 35525	2007-03-31 02:50:26 +00:00
Zhou Sheng	4f16402e0d	Use APInt operators to calculate the carry bits, remove this loop. llvm-svn: 35524	2007-03-31 02:38:39 +00:00
Zhou Sheng	fd28a33031	Make sure the use of ConstantInt::getZExtValue() for shift amount safe. llvm-svn: 35510	2007-03-30 17:20:39 +00:00
Zhou Sheng	b25806fa5f	1. Make sure the use of ConstantInt::getZExtValue() for getting shift amount is safe. 2. Use new method on ConstantInt instead of (? :) operator. 3. Use new method uge() on ConstantInt to simplify codes. llvm-svn: 35505	2007-03-30 09:29:48 +00:00
Zhou Sheng	5e60a4a6b0	Use APInt operation instead of ConstantExpr::getXX. llvm-svn: 35503	2007-03-30 05:45:18 +00:00
Zhou Sheng	b3a80b1d70	1. Make more use of APInt::getHighBitsSet/getLowBitsSet. 2. Let APInt variable do the binary operation stuff instead of using ConstantExpr::getXXX. llvm-svn: 35450	2007-03-29 08:15:12 +00:00
Zhou Sheng	444af49cc0	Clean up some codes in InstCombiner::SimplifyDemandedBits(). llvm-svn: 35446	2007-03-29 04:45:55 +00:00
Zhou Sheng	a4475575c0	Clean up codes in InstCombiner::SimplifyDemandedBits(): 1. Line out nested call of APInt::zext/trunc. 2. Make more use of APInt::getHighBitsSet/getLowBitsSet. 3. Use APInt[] operator instead of expression like "APIntVal & SignBit". llvm-svn: 35444	2007-03-29 02:26:30 +00:00
Zhou Sheng	4961cf1c06	1. Make the APInt variable do the binary operation stuff if possible instead of using ConstantExpr::getXX. 2. Use constant reference to APInt if possible instead of expensive APInt copy. llvm-svn: 35443	2007-03-29 01:57:21 +00:00
Zhou Sheng	117477e28b	Avoid unnecessary APInt construction. llvm-svn: 35431	2007-03-28 17:38:21 +00:00
Zhou Sheng	23f7a1c947	1. Make more use of getLowBitsSet/getHighBitsSet. 2. Use APInt[] instead of "X & SignBit". 3. Clean up some codes. 4. Make the expression like "ShiftAmt = ShiftAmtC->getZExtValue()" safe. llvm-svn: 35424	2007-03-28 15:02:20 +00:00
Zhou Sheng	2777a31850	1. Make more use of getLowBitsSet/getHighBitsSet. 2. Make the APInt value do the zext/trunc stuff instead of using ConstantExpr::getZExt(). llvm-svn: 35422	2007-03-28 09:19:01 +00:00
Zhou Sheng	c2d3309b99	Use UnknownBIts[BitWidth-1] instead of UnknownBIts & SignBits. llvm-svn: 35418	2007-03-28 05:15:57 +00:00
Zhou Sheng	18570b1f14	Remove unused APInt variable. llvm-svn: 35414	2007-03-28 03:02:21 +00:00
Zhou Sheng	57e3f7324b	Clean up codes in ComputeMaskedBits(): 1. Line out nested use of zext/trunc. 2. Make more use of getHighBitsSet/getLowBitsSet. 3. Use APInt[] != 0 instead of "(APInt & SignBit) != 0". llvm-svn: 35408	2007-03-28 02:19:03 +00:00
Reid Spencer	a5c18bf798	For PR1280: When converting an add/xor/and triplet into a trunc/sext, only do so if the intermediate integer type is a bitwidth that the targets can handle. llvm-svn: 35400	2007-03-28 01:36:16 +00:00
Evan Cheng	a4ed8a512a	Unbreaks non-debug builds. llvm-svn: 35383	2007-03-27 16:44:48 +00:00
Reid Spencer	54d5b1b8f8	Implement some minor review feedback. llvm-svn: 35373	2007-03-26 23:58:26 +00:00
Reid Spencer	441486c172	For PR1271: Fix another incorrectly converted shift mask. llvm-svn: 35371	2007-03-26 23:45:51 +00:00
Chris Lattner	d2602d5054	eliminate use of std::set llvm-svn: 35361	2007-03-26 20:40:50 +00:00
Reid Spencer	755d0e7ffc	Get better debug output by having modified instructions print both the original and new instruction. A slight performance hit with ostringstream but it is only for debug. Also, clean up an uninitialized variable warning noticed in a release build. llvm-svn: 35358	2007-03-26 17:44:01 +00:00
Reid Spencer	769a5a8e0b	Get the number of bits to set in a mask correct for a shl/lshr transform. llvm-svn: 35357	2007-03-26 17:18:58 +00:00
Reid Spencer	50898607a9	For PR1271: Fix SingleSource/Regression/C/2003-05-21-UnionBitFields.c by changing a getHighBitsSet call to getLowBitsSet call that was incorrectly converted from the original lshr constant expression. llvm-svn: 35348	2007-03-26 05:25:00 +00:00
Reid Spencer	52830327e9	For PR1271: Remove a use of getLowBitsSet that caused the mask used for replacement of shl/lshr pairs with an AND instruction to be computed incorrectly. Its not clear exactly why this is the case. This solves the disappearing shifts problem, but it doesn't fix Regression/C/2003-05-21-UnionBitFields. It seems there is more going on. llvm-svn: 35342	2007-03-25 21:11:44 +00:00
Chris Lattner	9bf53ffaa2	implement Transforms/InstCombine/cast2.ll:test3 and PR1263 llvm-svn: 35341	2007-03-25 20:43:09 +00:00
Reid Spencer	624766f8a2	Some cleanup from review: * Don't assume shift amounts are <= 64 bits * Avoid creating an extra APInt in SubOne and AddOne by using -- and ++ * Add another use of getLowBitsSet * Convert a series of if statements to a switch llvm-svn: 35339	2007-03-25 19:55:33 +00:00
Reid Spencer	80263aadf3	Refactor several ConstantExpr::getXXX calls with ConstantInt arguments using the facilities of APInt. While this duplicates a tiny fraction of the constant folding code, it also makes the code easier to read and avoids large ConstantExpr overhead for simple, known computations. llvm-svn: 35335	2007-03-25 05:33:51 +00:00
Zhou Sheng	222d5ebfd2	1. Avoid unnecessary APInt construction if possible. 2. Use isStrictlyPositive() instead of isPositive() in two places where they need APInt value > 0 not only >=0. llvm-svn: 35333	2007-03-25 05:01:29 +00:00
Reid Spencer	cd99fbdf3b	Make more uses of getHighBitsSet and get rid of some pointless & of an APInt with its type mask. llvm-svn: 35325	2007-03-25 04:26:16 +00:00
Reid Spencer	d8aad61d4d	More APIntification: * Convert the last use of a uint64_t that should have been an APInt. * Change ComputeMaskedBits to have a const reference argument for the Mask so that recursions don't cause unneeded temporaries. This causes temps to be needed in other places (where the mask has to change) but this change optimizes for the recursion which is more frequent. * Remove two instances of &ing a Mask with getAllOnesValue. Its not needed any more because APInt is accurate in its bit computations. * Start using the getLowBitsSet and getHighBits set methods on APInt instead of shifting. This makes it more clear in the code what is going on. llvm-svn: 35321	2007-03-25 02:03:12 +00:00
Chris Lattner	3a8248f79d	fix a regression on vector or instructions. llvm-svn: 35314	2007-03-24 23:56:43 +00:00
Zhou Sheng	e9ebd3f6ba	Make some codes more efficient. llvm-svn: 35297	2007-03-24 15:34:37 +00:00
Reid Spencer	a962d18774	For PR1205: Convert some calls to ConstantInt::getZExtValue() into getValue() and use APInt facilities in the subsequent computations. llvm-svn: 35294	2007-03-24 00:42:08 +00:00
Reid Spencer	959a21d3dc	For PR1205: * APIntify visitAdd and visitSelectInst * Remove unused uint64_t versions of utility functions that have been replaced with APInt versions. This completes most of the changes for APIntification of InstCombine. This passes llvm-test and llvm/test/Transforms/InstCombine/APInt. Patch by Zhou Sheng. llvm-svn: 35287	2007-03-23 21:24:59 +00:00
Reid Spencer	6d39206bc2	For PR1205: APIntify visitDiv, visitMul and visitRem. Patch by Zhou Sheng. llvm-svn: 35283	2007-03-23 20:05:17 +00:00
Chris Lattner	12b89cc148	switch AddReachableCodeToWorklist from being recursive to being iterative. llvm-svn: 35282	2007-03-23 19:17:18 +00:00
Reid Spencer	6274c72ee1	For PR1205: APIntify several utility functions supporting logical operators and shift operators. Patch by Zhou Sheng. llvm-svn: 35281	2007-03-23 18:46:34 +00:00
Zhou Sheng	0900993ebc	Make the "KnownZero ^ TypeMask" computation just once. llvm-svn: 35276	2007-03-23 03:13:21 +00:00
Zhou Sheng	755f04b5d7	Simplify the code. llvm-svn: 35275	2007-03-23 02:39:25 +00:00
Reid Spencer	b722f2b110	For PR1205: APInt support for logical operators in visitAnd, visitOr, and visitXor. Patch by Zhou Sheng. llvm-svn: 35273	2007-03-22 22:19:58 +00:00
Reid Spencer	4154e732e6	For PR1205: * APIntify commonIntCastTransforms * APIntify visitTrunc * APIntify visitZExt Patch by Zhou Sheng. llvm-svn: 35271	2007-03-22 20:56:53 +00:00
Reid Spencer	c3e3b8a32f	For PR1205: * Re-enable the APInt version of MaskedValueIsZero. * APIntify the Comput{Un}SignedMinMaxValuesFromKnownBits functions * APIntify visitICmpInst. llvm-svn: 35270	2007-03-22 20:36:03 +00:00
Dan Gohman	dcb291faa4	Change uses of Function::front to Function::getEntryBlock for readability. llvm-svn: 35265	2007-03-22 16:38:57 +00:00
Reid Spencer	f40711637f	For PR1248: * Fix some indentation and comments in InsertRangeTest * Add an "IsSigned" parameter to AddWithOverflow and make it handle signed additions. Also, APIntify this function so it works with any bitwidth. * For the icmp pred ([us]div %X, C1), C2 transforms, exit early if the div instruction's RHS is zero. * Finally, for icmp pred (sdiv %X, C1), -C2, fix an off-by-one error. The HiBound needs to be incremented in order to get the range test correct. llvm-svn: 35247	2007-03-21 23:19:50 +00:00
Zhou Sheng	b3949340c8	Simplify isHighOnes(). llvm-svn: 35211	2007-03-20 12:49:06 +00:00
Reid Spencer	6682721316	Make isOneBitSet faster by using APInt::isPowerOf2. Thanks Chris. llvm-svn: 35194	2007-03-20 00:16:52 +00:00
Reid Spencer	cc031a43aa	APIntify the isHighOnes utility function. llvm-svn: 35190	2007-03-19 21:29:50 +00:00
Reid Spencer	ef599b0786	Implement isMaxValueMinusOne in terms of APInt instead of uint64_t. Patch by Sheng Zhou. llvm-svn: 35188	2007-03-19 21:10:28 +00:00
Reid Spencer	3b93db72b4	Implement isMinValuePlusOne using facilities of APInt instead of uint64_t Patch by Zhou Sheng. llvm-svn: 35187	2007-03-19 21:08:07 +00:00
Reid Spencer	129a86792d	Implement isOneBitSet in terms of APInt::countPopulation. llvm-svn: 35186	2007-03-19 21:04:43 +00:00
Reid Spencer	450434ed65	1. Use APInt::getSignBit to reduce clutter (patch by Sheng Zhou) 2. Replace uses of the "isPositive" utility function with APInt::isPositive llvm-svn: 35185	2007-03-19 20:58:18 +00:00
Reid Spencer	03c31d5bb0	Remove a redundant clause in an if statement. Patch by Sheng Zhou. llvm-svn: 35184	2007-03-19 20:47:50 +00:00
Chris Lattner	0741842b3b	Implement InstCombine/and-xor-merge.ll:test[12]. Rearrange some code to simplify it now that shifts are binops llvm-svn: 35145	2007-03-18 22:51:34 +00:00
Zhou Sheng	d8c645b0ba	ShiftAmt might equal to zero. Handle this situation. llvm-svn: 35094	2007-03-14 09:07:33 +00:00
Zhou Sheng	b912844554	Enable KnownZero/One.clear(). llvm-svn: 35093	2007-03-14 03:21:24 +00:00
Chris Lattner	d1bce956b4	ifdef out some dead code. Fix PR1244 and Transforms/InstCombine/2007-03-13-CompareMerge.ll llvm-svn: 35082	2007-03-13 14:27:42 +00:00
Zhou Sheng	ebe634e662	For expression like "APInt::getAllOnesValue(ShiftAmt).zextOrCopy(BitWidth)", to handle ShiftAmt == BitWidth situation, use zextOrCopy() instead of zext(). llvm-svn: 35080	2007-03-13 06:40:59 +00:00
Zhou Sheng	af4341d441	In APInt version ComputeMaskedBits(): 1. Ensure VTy, KnownOne and KnownZero have same bitwidth. 2. Make code more efficient. llvm-svn: 35078	2007-03-13 02:23:10 +00:00
Reid Spencer	1791f23803	Add an APInt version of SimplifyDemandedBits. Patch by Zhou Sheng. llvm-svn: 35064	2007-03-12 17:25:59 +00:00
Reid Spencer	d9281784be	Add an APInt version of ShrinkDemandedConstant. Patch by Zhou Sheng. llvm-svn: 35063	2007-03-12 17:15:10 +00:00
Zhou Sheng	be171ee5cd	Avoid to assert on "(KnownZero & KnownOne) == 0". llvm-svn: 35062	2007-03-12 16:54:56 +00:00
Zhou Sheng	b3e00c4656	In function ComputeMaskedBits(): 1. Replace getSignedMinValue() with getSignBit() for better code readability. 2. Replace APIntOps::shl() with operator<<= for convenience. 3. Make APInt construction more effective. llvm-svn: 35060	2007-03-12 05:44:52 +00:00
Zhou Sheng	d1eb3d593e	Fix a bug in function ComputeMaskedBits(). llvm-svn: 35027	2007-03-08 15:15:18 +00:00
Zhou Sheng	387d7b1a35	Fix a bug in APIntified ComputeMaskedBits(). llvm-svn: 35022	2007-03-08 05:42:00 +00:00
Reid Spencer	bb5741fb02	For PR1205: Provide an APIntified version of MaskedValueIsZero. This will (temporarily) cause a "defined but not used" message from the compiler. It will be used in the next patch in this series. Patch by Sheng Zhou. llvm-svn: 35019	2007-03-08 01:52:58 +00:00
Reid Spencer	aa69640b10	For PR1205: Add a new ComputeMaskedBits function that is APIntified. We'll slowly convert things over to use this version. When its all done, we'll remove the existing version. llvm-svn: 35018	2007-03-08 01:46:38 +00:00
Reid Spencer	3939b1a274	Remove an unnecessary if statement and adjust indentation. llvm-svn: 34939	2007-03-05 23:36:13 +00:00
Chris Lattner	fe53cf2459	fix a subtle bug that caused an MSVC warning. Thanks to Jeffc for pointing this out. llvm-svn: 34920	2007-03-05 00:11:19 +00:00
Chris Lattner	5fdded1d2f	Add some simplifications for demanded bits, this allows instcombine to turn: define i64 @test(i64 %A, i32 %B) { %tmp12 = zext i32 %B to i64 ; <i64> [#uses=1] %tmp3 = shl i64 %tmp12, 32 ; <i64> [#uses=1] %tmp5 = add i64 %tmp3, %A ; <i64> [#uses=1] %tmp6 = and i64 %tmp5, 123 ; <i64> [#uses=1] ret i64 %tmp6 } into: define i64 @test(i64 %A, i32 %B) { %tmp6 = and i64 %A, 123 ; <i64> [#uses=1] ret i64 %tmp6 } This implements Transforms/InstCombine/add2.ll:test1 llvm-svn: 34919	2007-03-05 00:02:29 +00:00
Jeff Cohen	b622c11f77	Unbreak VC++ build. llvm-svn: 34917	2007-03-05 00:00:42 +00:00
Chris Lattner	ab2f913b68	simplify some code llvm-svn: 34914	2007-03-04 23:16:36 +00:00
Chris Lattner	8258b44b22	Speed up -instcombine by 20% by avoiding a particularly expensive passmgr call. llvm-svn: 34902	2007-03-04 04:27:24 +00:00
Chris Lattner	da1d04a057	my recent change caused a failure in a bswap testcase, because it changed the order that instcombine processed instructions in the testcase. The end result is that instcombine finished with: define i16 @test1(i16 %a) { %tmp = zext i16 %a to i32 ; <i32> [#uses=2] %tmp21 = lshr i32 %tmp, 8 ; <i32> [#uses=1] %tmp5 = shl i32 %tmp, 8 ; <i32> [#uses=1] %tmp.upgrd.32 = or i32 %tmp21, %tmp5 ; <i32> [#uses=1] %tmp.upgrd.3 = trunc i32 %tmp.upgrd.32 to i16 ; <i16> [#uses=1] ret i16 %tmp.upgrd.3 } which can't get matched as a bswap. This patch makes instcombine more sophisticated about removing truncating casts, allowing it to turn this into: define i16 @test2(i16 %a) { %tmp211 = lshr i16 %a, 8 %tmp52 = shl i16 %a, 8 %tmp.upgrd.323 = or i16 %tmp211, %tmp52 ret i16 %tmp.upgrd.323 } which then matches as bswap. This fixes bswap.ll and implements InstCombine/cast2.ll:test[12]. This also implements cast elimination of add/sub. llvm-svn: 34870	2007-03-03 05:27:34 +00:00
Chris Lattner	960a543037	add a top-level iteration loop to instcombine. This means that it will never finish without combining something it is capable of. llvm-svn: 34865	2007-03-03 02:04:50 +00:00
Chris Lattner	b15e2b182f	Fix a significant algorithm problem with the instcombine worklist. removing a value from the worklist required scanning the entire worklist to remove all entries. We now use a combination map+vector to prevent duplicates from happening and prevent the scan. This speeds up instcombine on a large file from the llvm-gcc bootstrap from 189.7s to 4.84s in a debug build and from 5.04s to 1.37s in a release build. llvm-svn: 34848	2007-03-02 21:28:56 +00:00
Chris Lattner	51f5457ad4	minor cleanup llvm-svn: 34846	2007-03-02 19:59:19 +00:00
Reid Spencer	24f1a0e78f	The 64-bit constructor for ConstantInt changes from int64_t to uint64_t. This caused a warning for construction with -1. Avoid the warning by using -1ULL instead. llvm-svn: 34796	2007-03-01 19:33:52 +00:00
Chris Lattner	c4d8e7e614	Fix InstCombine/2007-02-23-PhiFoldInfLoop.ll and PR1217 llvm-svn: 34546	2007-02-24 01:03:45 +00:00
Chris Lattner	99c6cf60f1	convert more vectors to smallvectors, 2.8% speedup llvm-svn: 34333	2007-02-15 22:52:10 +00:00
Chris Lattner	af6094fe3f	change some vectors to smallvectors. This speeds up instcombine on 447.dealII by 5%. llvm-svn: 34332	2007-02-15 22:48:32 +00:00
Chris Lattner	7907e5fe07	switch an std::set to a SmallPtr set, this speeds up instcombine by 9.5% on 447.dealII llvm-svn: 34323	2007-02-15 19:41:52 +00:00
Reid Spencer	d84d35ba70	For PR1195: Rename PackedType -> VectorType, ConstantPacked -> ConstantVector, and PackedTyID -> VectorTyID. No functional changes. llvm-svn: 34293	2007-02-15 02:26:10 +00:00
Chris Lattner	945e437c65	Generalize TargetData strings, to support more interesting forms of data. Patch by Scott Michel. llvm-svn: 34266	2007-02-14 05:52:17 +00:00
Chris Lattner	a06a8fd2d7	Eliminate use of ctors that take vectors. llvm-svn: 34219	2007-02-13 02:10:56 +00:00
Chris Lattner	a731513406	stop using methods that take vectors. llvm-svn: 34205	2007-02-12 22:56:41 +00:00
Chris Lattner	6e0123b17f	Simplify code by using value::takename llvm-svn: 34176	2007-02-11 01:23:03 +00:00
Chris Lattner	83ac5ae9f3	Fix miscompilations of consumer-typeset, telecomm-gsm, and 176.gcc. llvm-svn: 33902	2007-02-05 05:57:49 +00:00
Chris Lattner	0a28e90f2c	fix a miscompilation of 176.gcc llvm-svn: 33900	2007-02-05 04:09:35 +00:00
Chris Lattner	3e009e8b8f	rewrite shift/shift folding, now that types are not signed. llvm-svn: 33892	2007-02-05 00:57:54 +00:00
Reid Spencer	3f4e6e84dc	For PR1163: Make the Module's dependent library use a std::vector instead of SetVector adjust #includes in .cpp files because SetVector.h is no longer included. llvm-svn: 33855	2007-02-04 00:40:42 +00:00
Chris Lattner	6c344e56b1	remove some dead code llvm-svn: 33845	2007-02-03 23:28:07 +00:00
Reid Spencer	2f34b98cbf	Remove dead code and fix indentation per Chris' review comments. llvm-svn: 33785	2007-02-02 14:41:37 +00:00
Reid Spencer	0d5f9237b6	Use short form of binary operator create functions. llvm-svn: 33783	2007-02-02 14:08:20 +00:00
Chris Lattner	d5fea61d98	bugfix for reid's shift patch. llvm-svn: 33779	2007-02-02 05:29:55 +00:00
Reid Spencer	2341c22ec7	Changes to support making the shift instructions be true BinaryOperators. This feature is needed in order to support shifts of more than 255 bits on large integer types. This changes the syntax for llvm assembly to make shl, ashr and lshr instructions look like a binary operator: shl i32 %X, 1 instead of shl i32 %X, i8 1 Additionally, this should help a few passes perform additional optimizations. llvm-svn: 33776	2007-02-02 02:16:23 +00:00
Chris Lattner	c904205d28	Fix Transforms/InstCombine/2007-02-01-LoadSinkAlloca.ll, a serious code pessimization where instcombine can sink a load (good for code size) that prevents an alloca from being promoted by mem2reg (bad for everything). llvm-svn: 33771	2007-02-01 22:30:07 +00:00
Chris Lattner	416a8939c3	remove temporary vectors. llvm-svn: 33715	2007-01-31 20:08:52 +00:00
Chris Lattner	4fc18a4cb8	Revert another incorrectly applied chunk, which fixes InstCombine/vec_insert_to_shuffle.ll llvm-svn: 33705	2007-01-31 18:09:17 +00:00
Chris Lattner	f96f4a874c	eliminate temporary vectors llvm-svn: 33693	2007-01-31 04:40:53 +00:00
Chris Lattner	aa17576933	Move symbolic constant folding code to libanalysis. llvm-svn: 33688	2007-01-31 00:53:10 +00:00
Chris Lattner	024f4ab383	Adjust #includes to match movement of constant folding code from transformutils to libanalysis. llvm-svn: 33680	2007-01-30 23:46:24 +00:00
Chris Lattner	e3eda25641	pass TD to constant folding apis llvm-svn: 33674	2007-01-30 23:16:15 +00:00
Chris Lattner	2b15f2ba9d	remove some bits that are not yet meant to land. llvm-svn: 33666	2007-01-30 22:50:32 +00:00
Chris Lattner	4284f6463a	Symbolically evaluate constant expressions like &A[123] - &A[4].f. This occurs in C++ code like: #include <iostream> #include <iterator> int a[] = { 1, 2, 3, 4, 5 }; int main() { using namespace std; copy(a, a + sizeof(a)/sizeof(a[0]), ostream_iterator<int>(cout, "\n")); return 0; } Before we would decide the loop trip count is: sdiv (i32 sub (i32 ptrtoint (i32* getelementptr ([5 x i32]* @a, i32 0, i32 5) to i32), i32 ptrtoint ([5 x i32]* @a to i32)), i32 4) Now we decide it is "5". Amazing. This code will need to be refactored, but I'm doing that as a separate commit. llvm-svn: 33665	2007-01-30 22:32:46 +00:00
Reid Spencer	5301e7c605	For PR1136: Rename GlobalVariable::isExternal as isDeclaration to avoid confusion with external linkage types. llvm-svn: 33663	2007-01-30 20:08:39 +00:00
Chris Lattner	c8fb6de78c	Fix test/Transforms/InstCombine/2007-01-27-AndICmp.ll, a miscompilation of Mozilla that Anton tracked down. llvm-svn: 33591	2007-01-27 23:08:34 +00:00
Reid Spencer	31a4ef4dc1	Cleanup checks in the load and store of casted pointer transforms. Two changes: (1) don't special case for i1 any more, (2) use the new TargetData::getTypeSizeInBits method to ensure source and dest are the same bit width. llvm-svn: 33427	2007-01-22 05:51:25 +00:00
Reid Spencer	9a4bed06dd	Revise the store V, (cast P) -> store (cast V) -> P transform. We only want to do this if the src and destination types have the same bit width. This patch uses TargetData::getTypeSizeInBits() instead of making a special case for integer types and avoiding the transform if they don't match. llvm-svn: 33414	2007-01-20 23:35:48 +00:00
Chris Lattner	50ee0e40e5	Teach TargetData to handle 'preferred' alignment for each target, and use these alignment amounts to align scalars when we can. Patch by Scott Michel! llvm-svn: 33409	2007-01-20 22:35:55 +00:00
Reid Spencer	e928a15c9e	For this transform: store V, (cast P) -> store (cast V), P don't allow the transform if V and the pointer's element type are different width integer types. llvm-svn: 33371	2007-01-19 21:20:31 +00:00
Reid Spencer	a94d394ad2	For PR1043: This is the final patch for this PR. It implements some minor cleanup in the use of IntegerType, to wit: 1. Type::getIntegerTypeMask -> IntegerType::getBitMask 2. Type::IntTy changed to IntegerType from Type* 3. ConstantInt::getType() returns IntegerType* now, not Type* This also fixes PR1120. Patch by Sheng Zhou. llvm-svn: 33370	2007-01-19 21:13:56 +00:00
Chris Lattner	120ab038eb	Fix InstCombine/2007-01-18-VectorInfLoop.ll, a case where instcombine infinitely loops. llvm-svn: 33343	2007-01-18 22:16:33 +00:00
Reid Spencer	c050af9126	Clean up some code around the store V, (cast P) -> store (cast V), P transform. Change some variable names so it is clear what is source and what is dest of the cast. Also, add an assert to ensure that the integer to integer case is asserting if the bitwidths are different. This prevents illegal casts from being formed and catches bitwidth bugs sooner. llvm-svn: 33337	2007-01-18 18:54:33 +00:00
Chris Lattner	479a9fc492	Fix a regression in my isIntegral patch that broke 471.omnetpp. This is because TargetData::getTypeSize() returns the same for i1 and i8. This fix is not right for the full generality of bitwise types, but it fixes the regression. llvm-svn: 33237	2007-01-15 17:55:20 +00:00
Chris Lattner	c8dcede292	Implement InstCombine/phi.ll:test7, deletion of trivial value loops for induction variables. llvm-svn: 33234	2007-01-15 07:30:06 +00:00
Chris Lattner	27df1db485	simplify some code now that types are signless llvm-svn: 33232	2007-01-15 07:02:54 +00:00
Chris Lattner	a4beeef76c	delete stores to allocas with one use. This is a trivial form of DSE which often kicks in for ?: expressions. llvm-svn: 33231	2007-01-15 06:51:56 +00:00
Chris Lattner	03c4953cdd	rename Type::isIntegral to Type::isInteger, eliminating the old Type::isInteger. rename Type::getIntegralTypeMask to Type::getIntegerTypeMask. This makes naming much more consistent. For example, there are now no longer any instances of IntegerType that are not considered isInteger! :) llvm-svn: 33225	2007-01-15 02:27:26 +00:00
Chris Lattner	1942249c5b	Eliminate calls to isInteger, generalizing code and tightening checks as needed. llvm-svn: 33218	2007-01-15 01:55:30 +00:00
Chris Lattner	6ee923f3bb	instcombine has always been miscompiling fcmp x, x, disregarding possible NANs. This fixes PR1111 and Transforms/InstCombine/2007-01-14-FcmpSelf.ll llvm-svn: 33208	2007-01-14 19:42:17 +00:00
Chris Lattner	387bf3f700	Fix Transforms/InstCombine/2007-01-13-ExtCompareMiscompile.ll, which is part of PR1107 llvm-svn: 33185	2007-01-13 23:11:38 +00:00
Reid Spencer	7a9c62baa6	For PR1064: Implement the arbitrary bit-width integer feature. The feature allows integers of any bitwidth (up to 64) to be defined instead of just 1, 8, 16, 32, and 64 bit integers. This change does several things: 1. Introduces a new Derived Type, IntegerType, to represent the number of bits in an integer. The Type classes SubclassData field is used to store the number of bits. This allows 2^23 bits in an integer type. 2. Removes the five integer Type::TypeID values for the 1, 8, 16, 32 and 64-bit integers. These are replaced with just IntegerType which is not a primitive any more. 3. Adjust the rest of LLVM to account for this change. Note that while this incremental change lays the foundation for arbitrary bit-width integers, LLVM has not yet been converted to actually deal with them in any significant way. Most optimization passes, for example, will still only deal with the byte-width integer types. Future increments will rectify this situation. llvm-svn: 33113	2007-01-12 07:05:14 +00:00
Reid Spencer	cddc9dfe97	Implement review feedback for the ConstantBool->ConstantInt merge. Chris recommended that getBoolValue be replaced with getZExtValue and that get(bool) be replaced by get(const Type*, uint64_t). This implements those changes. llvm-svn: 33110	2007-01-12 04:24:46 +00:00
Reid Spencer	542964f55b	Rename BoolTy as Int1Ty. Patch by Sheng Zhou. llvm-svn: 33076	2007-01-11 18:21:29 +00:00
Zhou Sheng	bd23db9968	Remove unnecessary boolean type check. llvm-svn: 33075	2007-01-11 14:38:17 +00:00
Zhou Sheng	75b871fb1e	For PR1043: Merge ConstantIntegral and ConstantBool into ConstantInt. Remove ConstantIntegral and ConstantBool from LLVM. llvm-svn: 33073	2007-01-11 12:24:14 +00:00
Jeff Cohen	223004cd12	Unbreak VC++ build. llvm-svn: 33021	2007-01-08 20:17:17 +00:00
Reid Spencer	8f166b0ef3	Comparison of primitive type sizes should now be done in bits, not bytes. This patch converts getPrimitiveSize to getPrimitiveSizeInBits where it is appropriate to do so (comparison of integer primitive types). llvm-svn: 33012	2007-01-08 16:32:00 +00:00
Chris Lattner	fbc524fe87	relax some types llvm-svn: 32980	2007-01-07 06:58:05 +00:00
Chris Lattner	7051d758de	Fix regressions in InstCombine/call-cast-target.ll and InstCombine/2003-11-13-ConstExprCastCall.ll llvm-svn: 32959	2007-01-06 19:53:32 +00:00
Chris Lattner	c343a99786	this final call to canLosslesslyBitCastTo is dead, because ValueRequiresCast is only called on integers. llvm-svn: 32949	2007-01-06 02:11:56 +00:00
Chris Lattner	400f959a0c	simplify some more code now that there are not multiple different integer types of the same size llvm-svn: 32948	2007-01-06 02:09:32 +00:00
Chris Lattner	64d87b0215	eliminate some uses of canLosslesslyBitCastTo, this actually makes the code stronger, by nuking relational pointer comparisons with casts. llvm-svn: 32947	2007-01-06 01:45:59 +00:00
Chris Lattner	d7b6ea166d	Implement InstCombine/vec_shuffle.ll:%test7, simplifying shuffles with undef operands. llvm-svn: 32899	2007-01-05 07:36:08 +00:00
Chris Lattner	17c7c030c2	fold things like a^b != c^a -> b != c. This implements InstCombine/xor.ll:test27 llvm-svn: 32893	2007-01-05 03:04:57 +00:00
Chris Lattner	23eb8ec78b	Compile X + ~X to -1. This implements Instcombine/add.ll:test34 llvm-svn: 32890	2007-01-05 02:17:46 +00:00
Reid Spencer	6ff3e73db6	Death to useless bitcast instructions! llvm-svn: 32866	2007-01-04 05:23:51 +00:00
Reid Spencer	c635f47d9a	For PR950: This patch replaces signed integer types with signless ones: 1. [US]Byte -> Int8 2. [U]Short -> Int16 3. [U]Int -> Int32 4. [U]Long -> Int64. 5. Removal of isSigned, isUnsigned, getSignedVersion, getUnsignedVersion and other methods related to signedness. In a few places this warranted identifying the signedness information from other sources. llvm-svn: 32785	2006-12-31 05:48:39 +00:00
Reid Spencer	193df25eb9	For PR1066: Fix this by ensuring that a bitcast is inserted to do sign switching. This is only temporarily needed as the merging of signed and unsigned is next on the SignlessTypes plate. llvm-svn: 32757	2006-12-24 00:40:59 +00:00
Reid Spencer	910f23f7d7	Shut up some compilers that can't accurately analyze variable usage correctly and emit "may be used uninitialized" warnings. llvm-svn: 32756	2006-12-23 19:17:57 +00:00
Reid Spencer	43c77d53ff	For PR1065: Don't allow CmpInst instances to be processed in FoldSelectOpOp because you can't easily swap their operands. llvm-svn: 32753	2006-12-23 18:58:04 +00:00
Reid Spencer	266e42b312	For PR950: This patch removes the SetCC instructions and replaces them with the ICmp and FCmp instructions. The SetCondInst instruction has been removed and been replaced with ICmpInst and FCmpInst. llvm-svn: 32751	2006-12-23 06:05:41 +00:00
Chris Lattner	79a42ac941	Switch over Transforms/Scalar to use the STATISTIC macro. For each statistic converted, we lose a static initializer. This also allows GCC to emit warnings about unused statistics. llvm-svn: 32690	2006-12-19 21:40:18 +00:00
Reid Spencer	668d90f289	Convert the last uses of CastInst::createInferredCast to a normal cast creation. These changes are still temporary but at least this pushes knowledge of signedness out closer to where it can be determined properly and allows signedness to be removed from VMCore. llvm-svn: 32654	2006-12-18 08:47:13 +00:00
Reid Spencer	74a528b427	Fix a bug in EvaluateInDifferentType. The type of operand should not be used to determine whether a ZExt or SExt cast is performed. Instead, pass an "isSigned" bool to the function and determine its value from the opcode of the cast involved. Also, clean up some cruft from previous patches. llvm-svn: 32548	2006-12-13 18:21:21 +00:00
Reid Spencer	2a499b0b6c	Implement review feedback. Most of this has to do with removing unnecessary cast instructions. A few are bug fixes. llvm-svn: 32544	2006-12-13 17:19:09 +00:00
Reid Spencer	612683b0d7	For mul transforms, when checking for a cast from bool as either operand, make sure to also check that it is a zext from bool, not any other cast operation type. llvm-svn: 32539	2006-12-13 08:33:33 +00:00
Reid Spencer	799b5bfc71	Fix and/or/xor (cast A), (cast B) --> cast (and/or/xor A, B) The cast patch introduced the possibility that the wrong cast opcode could be used and that this transform could trigger on different kinds of cast operations. This patch rectifies that. llvm-svn: 32538	2006-12-13 08:27:15 +00:00
Reid Spencer	bb65ebf9a1	Replace inferred getCast(V,Ty) calls with more strict variants. Rename getZeroExtend and getSignExtend to getZExt and getSExt to match the the casting mnemonics in the rest of LLVM. llvm-svn: 32514	2006-12-12 23:36:14 +00:00
Chris Lattner	2dc148e89d	this can be trunc or bitcast, per line 3092. llvm-svn: 32487	2006-12-12 19:11:20 +00:00
Chris Lattner	ade1f6894d	Fix regression on 400.perlbench last night. llvm-svn: 32486	2006-12-12 18:41:03 +00:00
Reid Spencer	13bc5d7b57	Fix numerous inferred casts. llvm-svn: 32479	2006-12-12 09:18:51 +00:00
Bill Wendling	f3baad3ee1	Changed llvm_ostream et all to OStream. llvm_cerr, llvm_cout, llvm_null, are now cerr, cout, and NullStream resp. llvm-svn: 32298	2006-12-07 01:30:32 +00:00
Reid Spencer	4ae56f3086	Update ConstantIntegral Max/Min tests for new interface. llvm-svn: 32288	2006-12-06 20:39:57 +00:00
Chris Lattner	700b873130	Detemplatize the Statistic class. The only type it is instantiated with is 'unsigned'. llvm-svn: 32279	2006-12-06 17:46:33 +00:00
Chris Lattner	c209b584eb	add an instcombine xform. This speeds up 462.libquantum from 9.78s to 7.48s. This regression is due to unforseen consequences of the cast patch. llvm-svn: 32209	2006-12-05 01:26:29 +00:00
Reid Spencer	14fbdd5523	Update call to CastInst::getCastOpcode for its new signature. llvm-svn: 32166	2006-12-04 02:48:01 +00:00
Chris Lattner	7a002fec1f	disable transformations that are invalid for fp vectors. This fixes Transforms/InstCombine/2006-12-01-BadFPVectorXform.ll llvm-svn: 32112	2006-12-02 00:13:08 +00:00
Reid Spencer	ad05ee9f39	Remove 4 FIXMEs to hack around cast-to-bool problems which no longer exist. llvm-svn: 32051	2006-11-30 23:13:36 +00:00
Chris Lattner	960acb008b	implement cast.ll:test35. With this, we recognize: unsigned short swp(unsigned short a) { return ((a & 0xff00) >> 8 \| (a & 0x00ff) << 8); } as an idiom for bswap. llvm-svn: 32011	2006-11-29 07:18:39 +00:00
Chris Lattner	d747f015ff	Teach instcombine to turn trunc(srl x, c) -> srl (trunc(x), c) when safe. This implements InstCombine/cast.ll:test34. It fires hundreds of times on 176.gcc. llvm-svn: 32009	2006-11-29 07:04:07 +00:00
Chris Lattner	a7942b7bbd	Implement Regression/Transforms/InstCombine/bswap-fold.ll, folding seteq (bswap(x)), c -> seteq(x,bswap(c)) llvm-svn: 32006	2006-11-29 05:02:16 +00:00
Reid Spencer	a736fdf216	Join a split line. llvm-svn: 31996	2006-11-29 01:11:01 +00:00
Reid Spencer	116ad83aa0	Undo the last patch until 253.perlbmk passes with these changes. llvm-svn: 31977	2006-11-28 20:23:51 +00:00
Reid Spencer	59fe2d89ae	Remove 4 FIXME's from the CAST patch now that the back end is correctly producing code for "trunc to bool". This passes all tests on Linux. llvm-svn: 31963	2006-11-28 07:23:01 +00:00
Chris Lattner	8e9a7b73d9	Fix PR1014 and InstCombine/2006-11-27-XorBug.ll. llvm-svn: 31941	2006-11-27 19:55:07 +00:00
Reid Spencer	6c38f0bb07	For PR950: The long awaited CAST patch. This introduces 12 new instructions into LLVM to replace the cast instruction. Corresponding changes throughout LLVM are provided. This passes llvm-test, llvm/test, and SPEC CPUINT2000 with the exception of 175.vpr which fails only on a slight floating point output difference. llvm-svn: 31931	2006-11-27 01:05:10 +00:00
Bill Wendling	5dbf43c983	Removed #include <iostream> and replaced with llvm_* streams. llvm-svn: 31923	2006-11-26 09:46:52 +00:00
Chris Lattner	ec45a4c88c	This xform is handled by FoldOpIntoPhi in visitCastInst in a more elegant way. llvm-svn: 31889	2006-11-21 17:05:13 +00:00
Chris Lattner	e3a63d136d	Fix a gcc 4.2 warning. llvm-svn: 31751	2006-11-15 04:53:24 +00:00
Chris Lattner	f05d69ae72	implement InstCombine/shift-simplify.ll by transforming: (X >> Z) op (Y >> Z) -> (X op Y) >> Z for all shifts and all ops={and/or/xor}. llvm-svn: 31729	2006-11-14 07:46:50 +00:00
Chris Lattner	d12a4bf799	implement InstCombine/and-compare.ll:test1. This compiles: typedef struct { unsigned prefix : 4; unsigned code : 4; unsigned unsigned_p : 4; } tree_common; int foo(tree_common a, tree_common b) { return a->code == b->code; } into: _foo: movl 4(%esp), %eax movl 8(%esp), %ecx movl (%eax), %eax xorl (%ecx), %eax # TRUNCATE movb %al, %al shrb $4, %al testb %al, %al sete %al movzbl %al, %eax ret instead of: _foo: movl 8(%esp), %eax movb (%eax), %al shrb $4, %al movl 4(%esp), %ecx movb (%ecx), %cl shrb $4, %cl cmpb %al, %cl sete %al movzbl %al, %eax ret saving one cycle by eliminating a shift. llvm-svn: 31727	2006-11-14 06:06:06 +00:00
Chris Lattner	d4dee405cb	Fix InstCombine/2006-11-10-ashr-miscompile.ll a miscompilation introduced by the shr -> [al]shr patch. This was reduced from 176.gcc. llvm-svn: 31653	2006-11-10 23:38:52 +00:00
Chris Lattner	6e2c15c158	Teach ShrinkDemandedConstant how to handle X+C. This implements: add.ll:test33, add.ll:test34, shift-sra.ll:test2 llvm-svn: 31586	2006-11-09 05:12:27 +00:00
Chris Lattner	4f218d56f5	reenable factoring of GEP expressions, being more precise about the case that it bad to do. llvm-svn: 31563	2006-11-08 19:42:28 +00:00
Chris Lattner	cd62f11227	make this code more efficient by not creating a phi node we are just going to delete in the first place. This also makes it simpler. llvm-svn: 31562	2006-11-08 19:29:23 +00:00
Chris Lattner	a3acfca920	disable this factoring optzn for GEPs for now, this severely pessimizes some loops. llvm-svn: 31560	2006-11-08 18:49:31 +00:00
Reid Spencer	fdff938a7e	For PR950: This patch converts the old SHR instruction into two instructions, AShr (Arithmetic) and LShr (Logical). The Shr instructions now are not dependent on the sign of their operands. llvm-svn: 31542	2006-11-08 06:47:33 +00:00
Andrew Lenharth	0ebb0b03e6	The wrong parameter was being tested to deturmine i32 vs i64 llvm-svn: 31431	2006-11-03 22:45:50 +00:00
Reid Spencer	de46e48420	For PR786: Turn on -Wunused and -Wno-unused-parameter. Clean up most of the resulting fall out by removing unused variables. Remaining warnings have to do with unused functions (I didn't want to delete code without review) and unused variables in generated code. Maintainers should clean up the remaining issues when they see them. All changes pass DejaGnu tests and Olden. llvm-svn: 31380	2006-11-02 20:25:50 +00:00
Reid Spencer	7eb55b395f	For PR950: Replace the REM instruction with UREM, SREM and FREM. llvm-svn: 31369	2006-11-02 01:53:59 +00:00
Chris Lattner	eebea43b48	Factor gep instructions through phi nodes. llvm-svn: 31346	2006-11-01 07:43:41 +00:00
Chris Lattner	14f82c7dcd	Turn a phi of many loads into a phi of the address and a single load of the result. This can significantly shrink code and exposes identities more aggressively. llvm-svn: 31344	2006-11-01 07:13:54 +00:00
Chris Lattner	dc826fc068	Fix a bug in the previous patch llvm-svn: 31342	2006-11-01 04:55:47 +00:00
Chris Lattner	cadac0c5c3	Fold things like "phi [add (a,b), add(c,d)]" into two phi's and one add. This triggers thousands of times on multisource. llvm-svn: 31341	2006-11-01 04:51:18 +00:00
Reid Spencer	00c482b7a2	Simplify code a bit by changing instances of: InsertNewInstBefore(new CastInst(Val, ValTy, Val->GetName()), I) into: InsertCastBefore(Val, ValTy, I) llvm-svn: 31204	2006-10-26 19:19:06 +00:00
Reid Spencer	7e80b0b31e	For PR950: Make necessary changes to support DIV -> [SUF]Div. This changes llvm to have three division instructions: signed, unsigned, floating point. The bytecode and assembler are bacwards compatible, however. llvm-svn: 31195	2006-10-26 06:15:43 +00:00
Chris Lattner	5dee3b2526	Fix miscompilation of MallocBench/espresso which code review pointed out but apparently didn't make it into the final patch. llvm-svn: 31070	2006-10-20 18:20:21 +00:00
Reid Spencer	e0fc4dfc22	For PR950: This patch implements the first increment for the Signless Types feature. All changes pertain to removing the ConstantSInt and ConstantUInt classes in favor of just using ConstantInt. llvm-svn: 31063	2006-10-20 07:07:24 +00:00
Devang Patel	5d417e35bc	While creating mask, use 1ULL instead of 1. llvm-svn: 31062	2006-10-20 01:16:56 +00:00
Devang Patel	5d6df959e3	It is OK to remove extra cast if operation is EQ/NE even though source and destination sign may not match but other conditions are met. llvm-svn: 31056	2006-10-19 20:59:13 +00:00
Devang Patel	88afd00d1d	Typo Typo. llvm-svn: 31055	2006-10-19 19:21:36 +00:00
Devang Patel	472530d9fc	Typo. llvm-svn: 31054	2006-10-19 19:05:38 +00:00
Devang Patel	b42aef4925	Fix bug in PR454 resolution. Added new test case. This fixes llvmAsmParser.cpp miscompile by llvm on PowerPC Darwin. llvm-svn: 31053	2006-10-19 18:54:08 +00:00
Reid Spencer	3c514959dd	Undo Chris' last patch, it caused a regression. llvm-svn: 30991	2006-10-16 23:08:08 +00:00

... 7 8 9 10 11 ...

1369 Commits