llvm-project

Commit Graph

Author	SHA1	Message	Date
Bill Wendling	dce6011eb5	Temporarily XFAIL this test. llvm-svn: 64987	2009-02-19 00:13:55 +00:00
Chris Lattner	778c62ccb5	add proper asmwriter and asmparser support for anonymous functions. llvm-svn: 64953	2009-02-18 21:48:13 +00:00
Devang Patel	66c5a1dd50	The subprogram die may not exist while creating "default" scope. llvm-svn: 64920	2009-02-18 17:29:38 +00:00
Dan Gohman	8078b8bddc	Use a sign-extend instead of a zero-extend when promoting a trip count value when the original loop iteration condition is signed and the canonical induction variable won't undergo signed overflow. This isn't required for correctness; it just preserves more information about original loop iteration values. Add a getTruncateOrSignExtend method to ScalarEvolution, following getTruncateOrZeroExtend. llvm-svn: 64918	2009-02-18 17:22:41 +00:00
Owen Anderson	ad4254935f	Add a test for r61358, which I forgot to add way back when. llvm-svn: 64904	2009-02-18 07:50:22 +00:00
Dan Gohman	b694533a54	Change the argument type in this test to something less convoluted, since it isn't actually used. llvm-svn: 64883	2009-02-18 04:25:04 +00:00
Evan Cheng	a40d5e14ab	GV with null value initializer shouldn't go to BSS if it's meant for a mergeable strings section. Currently it only checks for Darwin. Someone else please check if it should apply to other targets as well. llvm-svn: 64877	2009-02-18 02:19:52 +00:00
Dan Gohman	8212ebb5cf	Fix a corner case in the new indvars promotion logic: if there are multiple IV's in a loop, some of them may under go signed or unsigned wrapping even if the IV that's used in the loop exit condition doesn't. Restrict sign-extension-elimination and zero-extension-elimination to only those that operate on the original loop-controlling IV. llvm-svn: 64866	2009-02-18 00:52:00 +00:00
Duncan Sands	bf3ba5a1e9	If an alias is dead and so is its aliasee, then globaldce would crash because the alias would still be using the aliasee when the aliasee was deleted. llvm-svn: 64844	2009-02-17 23:05:26 +00:00
Devang Patel	f4dad74621	And now, not so elegant, test case... llvm-svn: 64838	2009-02-17 22:48:18 +00:00
Devang Patel	528987a1e8	Emit debug info for bitfields. llvm-svn: 64815	2009-02-17 21:23:59 +00:00
Chris Lattner	24f31a0e59	commit a tweaked version of Daniel's patch for PR3599. We now eliminate all the extensions and all but the one required truncate from the testcase, but the or/and/shift stuff still isn't zapped. llvm-svn: 64809	2009-02-17 20:47:23 +00:00
Evan Cheng	f505cd5ebb	A couple of places where reused use operands should be marked kill. This is exposed by recent availability fallthrough changes. llvm-svn: 64745	2009-02-17 06:41:03 +00:00
Devang Patel	19b9ed7a30	Testcase for rev. 64704 llvm-svn: 64705	2009-02-17 00:15:08 +00:00
Evan Cheng	161861deb0	Strengthen the "non-constant stride must dominate loop preheader" check. llvm-svn: 64703	2009-02-17 00:13:06 +00:00
Dan Gohman	f68d29edd5	Fix EnforceKnownAlignment so that it doesn't ever reduce the alignment of an alloca or global variable. llvm-svn: 64693	2009-02-16 23:02:21 +00:00
Devang Patel	ab0c5ecc54	Test case for llvm-gcc rev. 64648. llvm-svn: 64649	2009-02-16 19:24:29 +00:00
Dan Gohman	73f794af6a	Rename IndVarsSimplify to IndVarSimplify, to be consistent with the name used in the code that these tests are for. llvm-svn: 64624	2009-02-16 00:56:15 +00:00
Dan Gohman	9cdfd44521	Change these tests to use regular loads instead of llvm.x86.sse2.loadu.dq. Enhance instcombine to use the preferred field of GetOrEnforceKnownAlignment in more cases, so that regular IR operations are optimized in the same way that the intrinsics currently are. llvm-svn: 64623	2009-02-16 00:44:23 +00:00
Duncan Sands	b3f27881a9	If the target of an alias has internal linkage, then the alias can be morphed into the target. Implement this transform, and fix a crash in the existing transform at the same time. llvm-svn: 64583	2009-02-15 09:56:08 +00:00
Evan Cheng	2510436e20	Fix PR3522. It's not safe to sink into landing pad BB's. llvm-svn: 64582	2009-02-15 08:36:12 +00:00
Evan Cheng	e79841adbb	Fix pr3571: If stride is a value defined by an instruction, make sure it dominates the loop preheader. When IV users are strength reduced, the stride is inserted into the preheader. It could create a use before def situation. llvm-svn: 64579	2009-02-15 06:06:15 +00:00
Dan Gohman	671f2c085f	Extend the IndVarSimplify support for promoting induction variables: - Test for signed and unsigned wrapping conditions, instead of just testing for non-negative induction ranges. - Handle loops with GT comparisons, in addition to LT comparisons. - Support more cases of induction variables that don't start at 0. llvm-svn: 64532	2009-02-14 02:31:09 +00:00
Dale Johannesen	6e1b2e36a5	Testcase for llvm-gcc 64510. llvm-svn: 64511	2009-02-14 00:19:28 +00:00
Evan Cheng	c2fde91703	Teach x86 target -soft-float. llvm-svn: 64496	2009-02-13 22:36:38 +00:00
Nick Lewycky	d234a845f9	Mark strto* as readonly when the endptr is null. llvm-svn: 64460	2009-02-13 17:08:33 +00:00
Nick Lewycky	a0e83a0952	On strtod and friends, mark 'endptr' nocapture in the function prototype, and mark the first argument nocapture if endptr=NULL for each particular call. llvm-svn: 64453	2009-02-13 15:31:46 +00:00
Nick Lewycky	cdccffe731	Reapply r64300: Make sure the SCC pass manager initializes any contained function pass managers. Without this, simplify-libcalls would add nocapture attributes when run on its own, but not when run as part of -std-compile-opts or similar. llvm-svn: 64443	2009-02-13 07:15:53 +00:00
Nick Lewycky	c60bd012bc	BasicAA was making the assumption that a local allocation which hadn't escaped couldn't ever be the return of call instruction. However, it's quite possible that said local allocation is itself the return of a function call. That's what malloc and calloc are for, actually. llvm-svn: 64442	2009-02-13 07:06:27 +00:00
Dan Gohman	f71a473720	Fix the code that checked if a SCEVAddRecExpr Start contains an addrec in a different loop to check the value being added to the accumulated Start value, not the Start value before it has the new value added to it. This prevents LSR from going crazy on the included testcase. Dale, please review. llvm-svn: 64440	2009-02-13 03:58:31 +00:00
Dan Gohman	ba83228cdb	Fix LSR's IV sorting function to explicitly sort by bitwidth after sorting by stride value. This prevents it from missing IV reuse opportunities in a host-sensitive manner. llvm-svn: 64415	2009-02-13 00:26:43 +00:00
Dan Gohman	eb6be650ce	Teach IndVarSimplify to optimize code using the C "int" type for loop induction on LP64 targets. When the induction variable is used in addressing, IndVars now is usually able to inserst a 64-bit induction variable and eliminates the sign-extending cast. This is also useful for code using C "short" types for induction variables on targets with 32-bit addressing. Inserting a wider induction variable is easy; the tricky part is determining when trunc(sext(i)) expressions are no-ops. This requires range analysis of the loop trip count. A common case is when the original loop iteration starts at 0 and exits when the induction variable is signed-less-than a fixed value; this case is now handled. This replaces IndVarSimplify's OptimizeCanonicalIVType. It was doing the same optimization, but it was limited to loops with constant trip counts, because it was running after the loop rewrite, and the information about the original induction variable is lost by that point. Rename ScalarEvolution's executesAtLeastOnce to isLoopGuardedByCond, generalize it to be able to test for ICMP_NE conditions, and move it to be a public function so that IndVars can use it. llvm-svn: 64407	2009-02-12 22:19:27 +00:00
Nate Begeman	94aa38d568	Add suppport for ConstantExprs of shufflevectors whose result type is not equal to the type of the vectors being shuffled. llvm-svn: 64401	2009-02-12 21:28:33 +00:00
Dale Johannesen	655775293f	Arrange to print constants that match "n" and "i" constraints in inline asm as signed (what gcc does). Add partial support for x86-specific "e" and "Z" constraints, with appropriate signedness for printing. llvm-svn: 64400	2009-02-12 20:58:09 +00:00
Chris Lattner	aed3a4215b	fix the X86 backend to just drop llvm.declare nodes for VLAs instead of leaving them in the DAG and then getting selection errors. This is a fix for PR3538. llvm-svn: 64382	2009-02-12 17:33:11 +00:00
Chris Lattner	cede94ddc9	add PR llvm-svn: 64377	2009-02-12 17:04:57 +00:00
Evan Cheng	cf5cd6ecfe	It's (currently) not safe to keep certain physical registers live across basic blocks, e.g. x86 fp stack registers. llvm-svn: 64374	2009-02-12 10:32:17 +00:00
Evan Cheng	3a14efacb6	Replace one of burr scheduling heuristic with something more sensible. Now calcMaxScratches simply compute the number of true data dependencies. This actually improve a couple of tests in dejagnu suite as many tests in llvm nightly test suite. llvm-svn: 64369	2009-02-12 08:59:45 +00:00
Chris Lattner	feb129e813	Fix a nasty bug (PR3550) where the inline pass could incorrectly mark calls with the tail marker when inlining them through an invoke. Patch, testcase, and perfect analysis by Jay Foad! llvm-svn: 64364	2009-02-12 07:06:42 +00:00
Chris Lattner	5297c63565	fix PR3537: if resetting bbi back to the start of a block, we need to forget about already inserted expressions. llvm-svn: 64362	2009-02-12 06:56:08 +00:00
Chris Lattner	1331d53c27	rename test to avoid messing with tab completion of dates. llvm-svn: 64361	2009-02-12 06:54:55 +00:00
Evan Cheng	eb5ec4a0db	Remove a bogus assertion. It's possible a live-in available value is used by a previous instruction. llvm-svn: 64339	2009-02-11 23:41:57 +00:00
Dan Gohman	6571ef3577	Don't use special heuristics for nodes with no data predecessors unless they actually have data successors, and likewise for nodes with no data successors unless they actually have data precessors. llvm-svn: 64327	2009-02-11 21:29:39 +00:00
Daniel Dunbar	df8bc9fc7b	Update to match space changes in .ll llvm-svn: 64322	2009-02-11 20:48:21 +00:00
Dale Johannesen	cc5fc44d02	Make a transformation added in 63266 a bit less aggressive. It was transforming (x&y)==y to (x&y)!=0 in the case where y is variable and known to have at most one bit set (e.g. z&1). This is not correct; the expressions are not equivalent when y==0. I believe this patch salvages what can be salvaged, including all the cases in bt.ll. Dan, please review. Fixes gcc.c-torture/execute/20040709-[12].c llvm-svn: 64314	2009-02-11 19:19:41 +00:00
Bill Wendling	5f14a01340	Revert r64300 and r64301. These were causing the following errors respectively: During llvm-gcc bootstrap: Undefined symbols: "llvm::FPPassManager::doFinalization(llvm::Module&)", referenced from: (anonymous namespace)::CGPassManager::doFinalization(llvm::CallGraph&, llvm::Module&) in libLLVMipa.a(CallGraphSCCPass.o) "llvm::FPPassManager::doInitialization(llvm::Module&)", referenced from: (anonymous namespace)::CGPassManager::doInitialization(llvm::CallGraph&, llvm::Module&) in libLLVMipa.a(CallGraphSCCPass.o) ld: symbol(s) not found collect2: ld returned 1 exit status make[3]: *** [/Volumes/Sandbox/Buildbot/llvm/full-llvm/build/llvmCore.roots/llvmCore~obj/obj-llvm/Release/bin/opt] Error 1 During an LLVM release build: /Volumes/Sandbox/Buildbot/llvm/full-llvm/build/llvm.sh.build/Release/bin/tblgen -I /Volumes/Sandbox/Buildbot/llvm/full-llvm/build/llvmCore/lib/Target/X86 -I /Volumes/Sandbox/Buildbot/llvm/full-llvm/build/llvmCore/include -I /Volumes/Sandbox/Buildbot/llvm/full-llvm/build/llvmCore/include -I /Volumes/Sandbox/Buildbot/llvm/full-llvm/build/llvmCore/lib/Target -gen-register-desc -o /Volumes/Sandbox/Buildbot/llvm/full-llvm/build/llvm.sh.build/lib/Target/X86/Release/X86GenRegisterInfo.inc.tmp /Volumes/Sandbox/Buildbot/llvm/full-llvm/build/llvmCore/lib/Target/X86/X86.td llvm[3]: Building X86.td instruction names with tblgen /Volumes/Sandbox/Buildbot/llvm/full-llvm/build/llvm.sh.build/Release/bin/tblgen -I /Volumes/Sandbox/Buildbot/llvm/full-llvm/build/llvmCore/lib/Target/X86 -I /Volumes/Sandbox/Buildbot/llvm/full-llvm/build/llvmCore/include -I /Volumes/Sandbox/Buildbot/llvm/full-llvm/build/llvmCore/include -I /Volumes/Sandbox/Buildbot/llvm/full-llvm/build/llvmCore/lib/Target -gen-instr-enums -o /Volumes/Sandbox/Buildbot/llvm/full-llvm/build/llvm.sh.build/lib/Target/X86/Release/X86GenInstrNames.inc.tmp /Volumes/Sandbox/Buildbot/llvm/full-llvm/build/llvmCore/lib/Target/X86/X86.td llvm[3]: Building X86.td instruction information with tblgen /Volumes/Sandbox/Buildbot/llvm/full-llvm/build/llvm.sh.build/Release/bin/tblgen -I /Volumes/Sandbox/Buildbot/llvm/full-llvm/build/llvmCore/lib/Target/X86 -I /Volumes/Sandbox/Buildbot/llvm/full-llvm/build/llvmCore/include -I /Volumes/Sandbox/Buildbot/llvm/full-llvm/build/llvmCore/include -I /Volumes/Sandbox/Buildbot/llvm/full-llvm/build/llvmCore/lib/Target -gen-instr-desc -o /Volumes/Sandbox/Buildbot/llvm/full-llvm/build/llvm.sh.build/lib/Target/X86/Release/X86GenInstrInfo.inc.tmp /Volumes/Sandbox/Buildbot/llvm/full-llvm/build/llvmCore/lib/Target/X86/X86.td llvm[3]: Building X86.td assembly writer with tblgen /Volumes/Sandbox/Buildbot/llvm/full-llvm/build/llvm.sh.build/Release/bin/tblgen -I /Volumes/Sandbox/Buildbot/llvm/full-llvm/build/llvmCore/lib/Target/X86 -I /Volumes/Sandbox/Buildbot/llvm/full-llvm/build/llvmCore/include -I /Volumes/Sandbox/Buildbot/llvm/full-llvm/build/llvmCore/include -I /Volumes/Sandbox/Buildbot/llvm/full-llvm/build/llvmCore/lib/Target -gen-asm-writer -o /Volumes/Sandbox/Buildbot/llvm/full-llvm/build/llvm.sh.build/lib/Target/X86/Release/X86GenAsmWriter.inc.tmp /Volumes/Sandbox/Buildbot/llvm/full-llvm/build/llvmCore/lib/Target/X86/X86.td llvm[3]: Compiling InstructionCombining.cpp for Release build if /Volumes/Sandbox/Buildbot/llvm/full-llvm/build/llvmgcc42.roots/llvmgcc42~dst/Developer/usr/bin/llvm-g++-4.2 -I/Volumes/Sandbox/Buildbot/llvm/full-llvm/build/llvm.sh.build/include -I/Volumes/Sandbox/Buildbot/llvm/full-llvm/build/llvm.sh.build/lib/Transforms/Scalar -I/Volumes/Sandbox/Buildbot/llvm/full-llvm/build/llvmCore/include -I/Volumes/Sandbox/Buildbot/llvm/full-llvm/build/llvmCore/lib/Transforms/Scalar -D_DEBUG -D_GNU_SOURCE -D__STDC_LIMIT_MACROS -D__STDC_CONSTANT_MACROS -O3 -fno-exceptions -Woverloaded-virtual -pedantic -Wall -W -Wwrite-strings -Wno-long-long -Wunused -Wno-unused-parameter -fstrict-aliasing -Wstrict-aliasing -c -MMD -MP -MF "/Volumes/Sandbox/Buildbot/llvm/full-llvm/build/llvm.sh.build/lib/Transforms/Scalar/Release/InstructionCombining.d.tmp" -MT "/Volumes/Sandbox/Buildbot/llvm/full-llvm/build/llvm.sh.build/lib/Transforms/Scalar/Release/InstructionCombining.lo" -MT "/Volumes/Sandbox/Buildbot/llvm/full-llvm/build/llvm.sh.build/lib/Transforms/Scalar/Release/InstructionCombining.o" -MT "/Volumes/Sandbox/Buildbot/llvm/full-llvm/build/llvm.sh.build/lib/Transforms/Scalar/Release/InstructionCombining.d" /Volumes/Sandbox/Buildbot/llvm/full-llvm/build/llvmCore/lib/Transforms/Scalar/InstructionCombining.cpp -o /Volumes/Sandbox/Buildbot/llvm/full-llvm/build/llvm.sh.build/lib/Transforms/Scalar/Release/InstructionCombining.o ; \ then /bin/mv -f "/Volumes/Sandbox/Buildbot/llvm/full-llvm/build/llvm.sh.build/lib/Transforms/Scalar/Release/InstructionCombining.d.tmp" "/Volumes/Sandbox/Buildbot/llvm/full-llvm/build/llvm.sh.build/lib/Trans llvm-svn: 64311	2009-02-11 18:19:24 +00:00
Duncan Sands	003754f656	Make sure the SCC pass manager initializes any contained function pass managers. Without this, simplify-libcalls would add nocapture attributes when run on its own, but not when run as part of -std-compile-opts or similar. llvm-svn: 64300	2009-02-11 09:58:43 +00:00
Evan Cheng	a1968b0fc7	Implement PR3495: local spiller optimization. The local spiller can now keep availability information over BB boundaries. It visits BB's in depth first order. After visiting a BB if it find a successor which has a single predecessor it visits the successor next without clearing the availability information. This allows the successor to omit reloads or change them into copies. llvm-svn: 64298	2009-02-11 08:24:21 +00:00
Devang Patel	316705027b	If llvm.dbg.region.end is disappearing then remove corresponding llvm.dbg.func.start also. llvm-svn: 64278	2009-02-11 01:29:06 +00:00
Devang Patel	654e47f366	Ignore dbg intrinsic while folding unconditional branch. llvm-svn: 64242	2009-02-10 22:14:17 +00:00
Evan Cheng	589a539423	Handle llvm.x86.sse2.maskmov.dqu in 64-bit. llvm-svn: 64240	2009-02-10 22:06:28 +00:00
Duncan Sands	6632f12c11	This is now done using a real i33, rather than an emulated one. Adjust the check. llvm-svn: 64236	2009-02-10 20:44:15 +00:00
Evan Cheng	ce3bbe515b	Fix PR3457: Ignore control successors when looking for closest scheduled successor. A control successor doesn't read result(s) produced by the scheduling unit being evaluated. llvm-svn: 64210	2009-02-10 08:30:11 +00:00
Devang Patel	4bed3565f3	Do not clone llvm.dbg.func.start and corresponding llvm.dbg.region.end during inlining. llvm-svn: 64209	2009-02-10 07:48:18 +00:00
Devang Patel	caf4485781	Enable scalar replacement of AllocaInst whose one of the user is dbg info. llvm-svn: 64207	2009-02-10 07:00:59 +00:00
Evan Cheng	e5ade4a9a1	Implement FpSET_ST1_*. llvm-svn: 64186	2009-02-09 23:32:07 +00:00
Dale Johannesen	cd19967754	Fix PR 3471, and some cleanups. llvm-svn: 64177	2009-02-09 22:14:15 +00:00
Evan Cheng	020588cee3	Make sure constant subscript is truncated to ptr size if it may not fit. llvm-svn: 64163	2009-02-09 20:54:38 +00:00
Duncan Sands	8c469be54b	Testcase for PR2437. llvm-svn: 64131	2009-02-09 09:41:49 +00:00
Evan Cheng	f736bd9c79	Re-enable machine sinking pass now that the coalescer bugs and the AnalyzeBrnach bug are fixed. llvm-svn: 64126	2009-02-09 08:45:39 +00:00
Bill Wendling	c743d39585	Rename dg.exp to llvmc.exp. This is so I can ignore it during a make check if I want to. llvm-svn: 64103	2009-02-08 22:52:50 +00:00
Mikhail Glushenkov	cc4c8e848a	The 'false.c' test must depend on llvm-g++. Also, turn on Objective-C/C++ tests. This should fix http://llvm.org/bugs/show_bug.cgi?id=3499. llvm-svn: 64084	2009-02-08 11:44:37 +00:00
Evan Cheng	b3783639cb	Fix PR3486. Fix a bug in code that manually patch physical register live interval after its sub-register is coalesced with a virtual register. llvm-svn: 64082	2009-02-08 11:04:35 +00:00
Evan Cheng	e5e95f7717	(no commit message) llvm-svn: 64073	2009-02-08 07:48:37 +00:00
Bill Wendling	5469ec1072	Revert r63999. It was breaking self-hosting builds. llvm-svn: 64062	2009-02-08 00:58:05 +00:00
Chris Lattner	4f8542f31d	testcase for r64049 of llvm-gcc. llvm-svn: 64050	2009-02-07 23:37:03 +00:00
Mon P Wang	21eb52a74f	Instrcombine should not change load(cast p) to cast(load p) if the cast changes the address space of the pointer. llvm-svn: 64035	2009-02-07 22:19:29 +00:00
Evan Cheng	9571621665	Enable machine sinking pass in non-fast mode. llvm-svn: 63999	2009-02-07 01:57:46 +00:00
Devang Patel	7cb8df4ce7	Ignore DbgInfoIntrinsics. llvm-svn: 63923	2009-02-06 06:19:06 +00:00
Chris Lattner	bbbb74372b	fix PR3489, use bits instead of bytes. llvm-svn: 63916	2009-02-06 04:34:07 +00:00
Evan Cheng	8ad4e0bb19	Fix test. It produces unexpected code if sse4.1 is on. llvm-svn: 63906	2009-02-06 01:49:19 +00:00
Devang Patel	409b794cfe	Ignore dbg intrinsics while propagating conditional expression info. Take 2. llvm-svn: 63898	2009-02-05 23:32:52 +00:00
Evan Cheng	2599084ac5	isAsCheapAsMove instructions can have register src operands. Check if they are really re-materializable. This fixes sse.expandfft and sse.stepfft. llvm-svn: 63890	2009-02-05 22:24:17 +00:00
Devang Patel	02f58e1e8d	Revert rev. 63876. It is causing llvm-gcc bootstrap failure. llvm-svn: 63888	2009-02-05 21:46:41 +00:00
Devang Patel	58cb603d2a	Remove dead blocks in the end. llvm-svn: 63880	2009-02-05 19:59:42 +00:00
Devang Patel	5922e26d1a	Ignore dbg intrinsics while propagating conditional expression info. llvm-svn: 63876	2009-02-05 19:15:39 +00:00
Chris Lattner	a936b393e4	testcase for rdar://6551276 and llvm-gcc r63873 llvm-svn: 63874	2009-02-05 18:15:17 +00:00
Evan Cheng	409c25f78d	Turn on machine LICM in non-fast mode. llvm-svn: 63855	2009-02-05 08:46:33 +00:00
Chris Lattner	7e1d2862ca	if we have a large GEP offset on a 32-bit or other target, make sure to print the value properly sext'd to the right pointer size. This fixes PR3481. llvm-svn: 63843	2009-02-05 06:55:21 +00:00
Devang Patel	086b212277	Ignore dbg intrinsics while folding switch instruction. llvm-svn: 63802	2009-02-05 00:30:42 +00:00
Devang Patel	916fdce16d	Ignore dbg intrinsics. llvm-svn: 63781	2009-02-04 21:39:48 +00:00
Mon P Wang	4cf5f3a7e5	Add test case for r63760. llvm-svn: 63774	2009-02-04 21:10:56 +00:00
Nate Begeman	94fefbc98e	Remove now-incorrect test. llvm-svn: 63772	2009-02-04 21:07:37 +00:00
Duncan Sands	e7d5479136	Allow the inverse transform x86_fp80 -> i80 (also fires during the Ada build). llvm-svn: 63731	2009-02-04 11:17:06 +00:00
Duncan Sands	1ea1173143	Fix PR3468: a crash when constant folding a bitcast of i80 to x86 long double (this was presumably generated by sroa). llvm-svn: 63730	2009-02-04 10:17:14 +00:00
Owen Anderson	1caf7fef8e	Finish making AliasAnalysis aware of the fact that most atomic intrinsics only dereference their arguments, and enhance BasicAA to make use of this fact when computing ModRef info. llvm-svn: 63718	2009-02-04 05:16:46 +00:00
Mon P Wang	4379a795fe	Fixes a case where we generate an incorrect mask for pshfhw in the presence of undefs and incorrectly determining if we have punpckldq. llvm-svn: 63702	2009-02-04 01:16:59 +00:00
Devang Patel	fd9f635103	While folding vallue comparison terminators ignore dbg intrinsics. llvm-svn: 63700	2009-02-04 01:06:11 +00:00
Devang Patel	f10e287c65	Ignore dbg intrinsics while hoisting common code in the two blocks up into the branch block. llvm-svn: 63687	2009-02-04 00:03:08 +00:00
Devang Patel	2032cadd0f	Do not let dbg intrinsic block folding of two entry phi node. llvm-svn: 63671	2009-02-03 22:12:02 +00:00
Chris Lattner	ef37dc8511	teach "convert from scalar" to handle loads of fca's. llvm-svn: 63659	2009-02-03 21:08:45 +00:00
Chris Lattner	18f56c295c	make scalar conversion handle stores of first class aggregate values. loads are not yet handled (coming soon to an sroa near you). llvm-svn: 63649	2009-02-03 19:30:11 +00:00
Chris Lattner	73eff2e6e8	Make SROA produce a vector only when the alloca is actually accessed at least once as a vector. This prevents it from compiling the example in not-a-vector into: define double @test(double %A, double %B) { %tmp4 = insertelement <7 x double> undef, double %A, i32 0 %tmp = insertelement <7 x double> %tmp4, double %B, i32 4 %tmp2 = extractelement <7 x double> %tmp, i32 4 ret double %tmp2 } instead, producing the integer code. Producing vectors when they aren't otherwise in the program is dangerous because a lot of other code treats them carefully and doesn't want to break them down. OTOH, many things want to break down tasty i448's. llvm-svn: 63638	2009-02-03 18:15:05 +00:00
Chris Lattner	8fc6561993	this produces an undefined result, just check that the alloca is gone and that sroa doesn't crash. llvm-svn: 63637	2009-02-03 18:13:00 +00:00
Duncan Sands	a77c5f758c	Fix PR3411. When replacing values, nodes are analyzed in any old order. Since analyzing a node analyzes its operands also, this can mean that when we pop a node off the list of nodes to be analyzed, it may already have been analyzed. llvm-svn: 63632	2009-02-03 10:23:33 +00:00
Evan Cheng	8542caa3f7	APInt'fy SimplifyDemandedVectorElts so it can analyze vectors with more than 64 elements. llvm-svn: 63631	2009-02-03 10:05:09 +00:00
Chris Lattner	80810b4c2d	add another case of undefined behavior without crashing, PR3466. llvm-svn: 63620	2009-02-03 07:08:57 +00:00
Nick Lewycky	05daea5d32	Revert r63600. It didn't fix the bug, it just moved it a bit. llvm-svn: 63618	2009-02-03 06:30:37 +00:00
Nick Lewycky	12a130bd06	Update the callgraph when replacing InvokeInst with CallInst when inlining. llvm-svn: 63600	2009-02-03 04:34:40 +00:00
Chris Lattner	fa4e35aca7	fix a bitcode reader bug where it can't handle extractelement correctly: the index of the value being extracted is always an i32. This fixes PR3465 llvm-svn: 63597	2009-02-03 02:11:28 +00:00
Chris Lattner	6aa6b1f263	Teach ConvertUsesToScalar to handle memset, allowing it to handle crazy cases like: struct f { int A, B, C, D, E, F; }; short test4() { struct f A; A.A = 1; memset(&A.B, 2, 12); return A.C; } llvm-svn: 63596	2009-02-03 02:01:43 +00:00
Dan Gohman	7aa0c17cff	Delete these two tests. They are specific to x86-64, and there's no reliable way to do this with the current dejagnu infrastructure. If someone can figure out how to fix these tests so that they test what they are intended to test without spuriously failing on any popular platforms, they are invited to reinstate them. llvm-svn: 63592	2009-02-03 01:33:26 +00:00
Chris Lattner	09b65ab288	rearrange how SRoA handles promotion of allocas to vectors. With the new world order, it can handle cases where the first store into the alloca is an element of the vector, instead of requiring the first analyzed store to have the vector type itself. This allows us to un-xfail test/CodeGen/X86/vec_ins_extract.ll. llvm-svn: 63590	2009-02-03 01:30:09 +00:00
Chris Lattner	a0ce5f060d	this test produces an undefined value, we don't care what it is, but we do want the alloca promoted. llvm-svn: 63587	2009-02-03 01:13:52 +00:00
Bill Wendling	b0ad6f9a6c	It fails on Linux. XFAIL that machine. llvm-svn: 63582	2009-02-03 00:35:11 +00:00
Bill Wendling	423f3bc196	This is passing for us. Should it have been reenabled? llvm-svn: 63580	2009-02-03 00:27:09 +00:00
Dan Gohman	7948ef5f87	Add explicit -march=x86 to these tests so that they don't default to -march=x86-64 on 64-bit hosts. llvm-svn: 63579	2009-02-03 00:20:22 +00:00
Dan Gohman	f58f0cbfd5	Fix another test to not use -mcpu=yonah with 64-bit code. llvm-svn: 63572	2009-02-02 23:43:59 +00:00
Dan Gohman	e862b3dd96	Yonah does not support x86-64. Change the -mcpu value to one that does. llvm-svn: 63561	2009-02-02 22:50:08 +00:00
Devang Patel	dd5dbca59c	Run dsymutil on darwin, when it is expected, before running gdb test. llvm-svn: 63548	2009-02-02 21:09:36 +00:00
Chris Lattner	c81fdd1773	xfail this for now, will fix shortly. llvm-svn: 63533	2009-02-02 18:15:33 +00:00
Chris Lattner	64217e6a28	update test llvm-svn: 63532	2009-02-02 18:12:58 +00:00
Chris Lattner	18eba4f211	Fix a bug which caused us to miscompile a couple of Ada tests. Thanks for the beautiful reduced testcase Duncan! llvm-svn: 63529	2009-02-02 18:02:59 +00:00
Devang Patel	97ba824ad9	Do not add redundant arguments in a method definition DIE. llvm-svn: 63527	2009-02-02 17:51:41 +00:00
Devang Patel	e7a112111a	Make this test case smaller. llvm-svn: 63526	2009-02-02 17:50:43 +00:00
Duncan Sands	7e4cb0a1cf	This passes on x86-32 linux at least. llvm-svn: 63508	2009-02-02 09:10:57 +00:00
Duncan Sands	dca376ff07	Make the XFAIL line actually match x86-32 targets. llvm-svn: 63507	2009-02-02 09:07:13 +00:00
Evan Cheng	50e15bdf81	Teach LowerBRCOND to recognize (xor (setcc x), 1). The xor inverts the condition. It's normally transformed by the dag combiner, unless the condition is set by a arithmetic op with overflow. llvm-svn: 63505	2009-02-02 08:07:36 +00:00
Chris Lattner	1f386b8ec8	Fix PR3372 llvm-svn: 63501	2009-02-02 07:24:28 +00:00
Chris Lattner	c4eb63d412	reduce testcase. llvm-svn: 63499	2009-02-02 06:55:45 +00:00
Torok Edwin	c418287974	add 2 more testcases for -mattr=-sse (r63495). --This line, and those below, will be ignaored-- A test/CodeGen/X86/nosse-error1.ll A test/CodeGen/X86/nosse-error2.ll llvm-svn: 63496	2009-02-01 18:24:20 +00:00
Torok Edwin	a2d1f35e9a	Implement -mno-sse: if SSE is disabled on x86-64, don't store XMM on stack for var-args, and don't allow FP return values llvm-svn: 63495	2009-02-01 18:15:56 +00:00
Duncan Sands	3ed768868d	Fix PR3453 and probably a bunch of other potential crashes or wrong code with codegen of large integers: eliminate the legacy getIntegerVTBitMask and getIntegerVTSignBit methods, which returned their value as a uint64_t, so couldn't handle huge types. llvm-svn: 63494	2009-02-01 18:06:53 +00:00
Nick Lewycky	f23908151a	Reinstate this optimization to fold icmp of xor when possible. Don't try to turn icmp eq a+x, b+x into icmp eq a, b if a+x or b+x has other uses. This may have been increasing register pressure leading to the bzip2 slowdown. llvm-svn: 63487	2009-01-31 21:30:05 +00:00
Chris Lattner	9e2b9f3234	Fix PR3452 (an infinite loop bootstrapping) by disabling the recent improvements to the EvaluateInDifferentType code. This code works by just inserted a bunch of new code and then seeing if it is useful. Instcombine is not allowed to do this: it can only insert new code if it is useful, and only when it is converging to a more canonical fixed point. Now that we iterate when DCE makes progress, this causes an infinite loop when the code ends up not being used. llvm-svn: 63483	2009-01-31 19:05:27 +00:00
Duncan Sands	41826036b1	Fix PR3401: when using large integers, the type returned by getShiftAmountTy may be too small to hold shift values (it is an i8 on x86-32). Before and during type legalization, use a large but legal type for shift amounts: getPointerTy; afterwards use getShiftAmountTy, fixing up any shift amounts with a big type during operation legalization. Thanks to Dan for writing the original patch (which I shamelessly pillaged). llvm-svn: 63482	2009-01-31 15:50:11 +00:00
Chris Lattner	76a63ed099	now that all the pieces are in place, teach instcombine's simplifydemandedbits to simplify instructions with multiple uses in contexts where it can get away with it. This allows it to simplify the code in multi-use-or.ll into a single 'add double'. This change is particularly interesting because it will cover up for some common codegen bugs with large integers created due to the recent SROA patch. When working on fixing those bugs, this should be disabled. llvm-svn: 63481	2009-01-31 08:40:03 +00:00
Chris Lattner	94cfb281c3	make sure to set Changed=true when instcombine hacks on the code, not doing so prevents it from properly iterating and prevents it from deleting the entire body of dce-iterate.ll llvm-svn: 63476	2009-01-31 07:04:22 +00:00
Mon P Wang	b6080cf943	Used "-enable-unsafe-fp-math" to allow this transformation - (a * b -c) = c - a *b. llvm-svn: 63475	2009-01-31 06:50:54 +00:00
Mon P Wang	cf9ba82324	If unsafe FP optimization is not set, don't allow -(A-B) => B-A because when A==B, -0.0 != +0.0. llvm-svn: 63474	2009-01-31 06:07:45 +00:00
Chris Lattner	ec99c46d44	Simplify and generalize the SROA "convert to scalar" transformation to be able to handle ANY alloca that is poked by loads and stores of bitcasts and GEPs with constant offsets. Before the code had a number of annoying limitations and caused it to miss cases such as storing into holes in structs and complex casts (as in bitfield-sroa) where we had unions of bitfields etc. This also handles a number of important cases that are exposed due to the ABI lowering stuff we do to pass stuff by value. One case that is pretty great is that we compile 2006-11-07-InvalidArrayPromote.ll into: define i32 @func(<4 x float> %v0, <4 x float> %v1) nounwind { %tmp10 = call <4 x i32> @llvm.x86.sse2.cvttps2dq(<4 x float> %v1) %tmp105 = bitcast <4 x i32> %tmp10 to i128 %tmp1056 = zext i128 %tmp105 to i256 %tmp.upgrd.43 = lshr i256 %tmp1056, 96 %tmp.upgrd.44 = trunc i256 %tmp.upgrd.43 to i32 ret i32 %tmp.upgrd.44 } which turns into: _func: subl $28, %esp cvttps2dq %xmm1, %xmm0 movaps %xmm0, (%esp) movl 12(%esp), %eax addl $28, %esp ret Which is pretty good code all things considering :). One effect of this is that SROA will start generating arbitrary bitwidth integers that are a multiple of 8 bits. In the case above, we got a 256 bit integer, but the codegen guys assure me that it can handle the simple and/or/shift/zext stuff that we're doing on these operations. This addresses rdar://6532315 llvm-svn: 63469	2009-01-31 02:28:54 +00:00
Devang Patel	c094970cd2	Each input file is encoded as a separate compile unit in LLVM debugging information output. However, many target specific tool chains prefer to encode only one compile unit in an object file. In this situation, the LLVM code generator will include debugging information entities in the compile unit that is marked as main compile unit. The code generator accepts maximum one main compile unit per module. If a module does not contain any main compile unit then the code generator will emit multiple compile units in the output object file. [Part 1] Update DebugInfo APIs to accept optional boolean value while creating DICompileUnit to mark the unit as "main" unit. By defaults all units are considered non-main. Update SourceLevelDebugging.html to document "main" compile unit. Update DebugInfo APIs to not accept and encode separate source file/directory entries while creating various llvm.dbg.* entities. There was a recent, yet to be documented, change to include this additional information so no documentation changes are required here. Update DwarfDebug to handle "main" compile unit. If "main" compile unit is seen then all DIEs are inserted into "main" compile unit. All other compile units are used to find source location for llvm.dbg.* values. If there is not any "main" compile unit then create unique compile unit DIEs for each llvm.dbg.compile_unit. [Part 2] Create separate llvm.dbg.compile_unit for each input file. Mark compile unit create for main_input_filename as "main" compile unit. Use appropriate compile unit, based on source location information collected from the tree node, while creating llvm.dbg.* values using DebugInfo APIs. --- This is Part 1. llvm-svn: 63400	2009-01-30 18:20:31 +00:00
Zhou Sheng	1e36fbb5ed	This is case is to uncover the bug in IntrinsicLowering.cpp, the LowerPartSet(). It didn't handle the situation correctly when the low, high argument values are in reverse order (low > high) with 'Val' type is i32 (a corner case). llvm-svn: 63386	2009-01-30 08:59:51 +00:00
Devang Patel	acbb381cc4	Enable target tripple. llvm-svn: 63361	2009-01-30 01:40:58 +00:00
Devang Patel	a103ee4f0d	Linux and other target's encoding for DW_AT_declaration may not match. llvm-svn: 63360	2009-01-30 01:37:30 +00:00
Devang Patel	4ba91058d2	Add DW_AT_declaration for class methods. llvm-svn: 63356	2009-01-30 01:21:46 +00:00
Owen Anderson	ad89c410e6	XFAIL this test. It only worked before because of a bug in the spill point selection code. Not deleting because it should be possible to enhance the selection code to handle this in the future. llvm-svn: 63340	2009-01-29 22:27:56 +00:00
Evan Cheng	a160d4af82	Local register allocator shouldn't assume only the entry and landing pad basic blocks have live-ins. llvm-svn: 63323	2009-01-29 18:37:30 +00:00
Dan Gohman	ef04ed5477	In the case of an extractelement on an insertelement value, the element indices may be equal if either one is not a constant. llvm-svn: 63311	2009-01-29 16:10:46 +00:00
Evan Cheng	a115859df0	Add a always_inline test case. llvm-svn: 63304	2009-01-29 09:31:54 +00:00
Evan Cheng	45799abe61	Add a test case for Chris lvalue alignment fixes. llvm-svn: 63300	2009-01-29 08:59:46 +00:00
Evan Cheng	76a2736c74	Exit with nice warnings when register allocator run out of registers. llvm-svn: 63267	2009-01-29 02:20:59 +00:00
Dan Gohman	e58ab79f33	Make x86's BT instruction matching more thorough, and add some dagcombines that help it match in several more cases. Add several more cases to test/CodeGen/X86/bt.ll. This doesn't yet include matching for BT with an immediate operand, it just covers more register+register cases. llvm-svn: 63266	2009-01-29 01:59:02 +00:00
Mon P Wang	9150f735fa	Fixed lowering of v816 shuffles. llvm-svn: 63252	2009-01-28 23:11:14 +00:00
Bill Wendling	42b63bc175	Make test platform agnostic. llvm-svn: 63247	2009-01-28 22:20:56 +00:00
Dan Gohman	d21775ae0e	Give this test an explicit target, to make it host-independent. llvm-svn: 63244	2009-01-28 22:14:58 +00:00
Devang Patel	d7ecb3b661	Do not forget to derived type while constructing an array type. llvm-svn: 63233	2009-01-28 21:08:20 +00:00
Chris Lattner	df17987c19	Fix some issues with volatility, move "CanConvertToScalar" check after the others. llvm-svn: 63227	2009-01-28 20:16:43 +00:00
Chris Lattner	1498e62117	strengthen this test. llvm-svn: 63222	2009-01-28 19:29:30 +00:00
Evan Cheng	f31f288863	The memory alignment requirement on some of the mov{h\|l}p{d\|s} patterns are 16-byte. That is overly strict. These instructions read / write f64 memory locations without alignment requirement. llvm-svn: 63195	2009-01-28 08:35:02 +00:00
Mon P Wang	d880efc005	Added sse test patterns for r62979 and r63193. llvm-svn: 63194	2009-01-28 08:13:56 +00:00
Mikhail Glushenkov	2115d09a10	Add three new option properties. Adds new option properties 'multi_val', 'one_or_more' and 'zero_or_one'. llvm-svn: 63172	2009-01-28 03:47:20 +00:00
Bill Wendling	fd03bdd00c	Add testcase for r63142. llvm-svn: 63149	2009-01-27 23:00:53 +00:00
Evan Cheng	1bc8af207e	Implement multiple with overflow by 2 with an add instruction. llvm-svn: 63090	2009-01-27 03:30:42 +00:00
Evan Cheng	ce95cddd0f	Forgot this test case. llvm-svn: 63089	2009-01-27 02:59:39 +00:00
Dan Gohman	52e907a780	Add a FrontendC testcase for the x86-64 Red Zone feature, to help verify that the feature may be disabled through the -mno-red-zone option. llvm-svn: 63079	2009-01-27 00:59:55 +00:00
Devang Patel	45c899cd15	Assorted debug info fixes. - DW_AT_bit_size is only suitable for bitfields. - Encode source location info for derived types. - Source location and type size info is not useful for subroutine_type (info is included in respective DISubprogram) and array_type. llvm-svn: 63077	2009-01-27 00:45:04 +00:00
Dan Gohman	8738997c11	Add a regression test for x86-64 red zone usage. llvm-svn: 63075	2009-01-27 00:40:27 +00:00
Dale Johannesen	03490f0ce1	Testcase for 6522054. llvm-svn: 63067	2009-01-26 23:22:19 +00:00
Duncan Sands	d77e476921	Fix PR3393, which amounts to a bug in the expensive checking logic. Rather than make the checking more complicated, I've tweaked some logic to make things conform to how the checking thought things ought to be, since this results in a simpler "mental model". llvm-svn: 63048	2009-01-26 21:54:18 +00:00
Dan Gohman	ac272eaf13	At Nick Lewycky's request, rename this test with a more informative name. llvm-svn: 63042	2009-01-26 21:36:31 +00:00
Evan Cheng	6c7e85142b	Enhance logic in X86DAGToDAGISel::PreprocessForRMW which move load inside callseq_start to allow it to be folded into a call. It was not considering the cases where a token factor is between the load and the callseq_start. llvm-svn: 63022	2009-01-26 18:43:34 +00:00
Mon P Wang	3537a62704	Fixed optimization of combining two shuffles where the first shuffle inputs has a different number of elements than the output. llvm-svn: 62998	2009-01-26 04:39:00 +00:00
Scott Michel	9e3e4a9219	CellSPU: - Rename fcmp.ll test to fcmp32.ll, start adding new double tests to fcmp64.ll - Fix select_bits.ll test - Capitulate to the DAGCombiner and move i64 constant loads to instruction selection (SPUISelDAGtoDAG.cpp). <rant>DAGCombiner will insert all kinds of 64-bit optimizations after operation legalization occurs and now we have to do most of the work that instruction selection should be doing twice (once to determine if v2i64 build_vector can be handled by SelectCode(), which then runs all of the predicates a second time to select the necessary instructions.) But, CellSPU is a good citizen.</rant> llvm-svn: 62990	2009-01-26 03:31:40 +00:00
Chris Lattner	9449991c4f	Handle single-entry phi nodes gracefully in condprop. llvm-svn: 62985	2009-01-26 02:18:20 +00:00
Chris Lattner	7b6647c178	Fix PR3408 by making a non-obvious assumption very obvious, and handling the flaw inherent in that assumption. :) llvm-svn: 62984	2009-01-26 02:11:30 +00:00
Nate Begeman	5eca265519	Map address space 256 to gs; similar mappings could be supported for the other x86 segments. address space 0 is stack/default, 1-255 are reserved for client use. llvm-svn: 62980	2009-01-26 01:24:32 +00:00
Torok Edwin	97be2f5840	revert this patch for now, because Codegen does still want to generate SSE code, for example in the case of va-args. XFAIL associated tests. llvm-svn: 62972	2009-01-25 20:21:24 +00:00
Torok Edwin	3cc1940003	testcase for llvm-gcc part of PR3402. llvm-svn: 62969	2009-01-25 18:00:06 +00:00
Torok Edwin	a23c73bbdc	If user explicitly asks not to use SSE, don't force it. This fixes LLVM part of PR3402. llvm-svn: 62967	2009-01-25 17:58:56 +00:00
Nick Lewycky	cb7a10ab63	Actually run the test in this directory. llvm-svn: 62957	2009-01-25 08:05:07 +00:00
Nick Lewycky	5647c5d1a4	The function that does nothing but call malloc is noalias return. llvm-svn: 62956	2009-01-25 07:59:57 +00:00
Evan Cheng	1c7c019229	Private linkage support for PPC / Darwin. llvm-svn: 62955	2009-01-25 06:32:01 +00:00
Evan Cheng	abda665f5f	Teach 2addr pass to be do more commuting. If both uses of a two-address instruction are killed, but the first operand has a use before and after the def, commute if the second operand does not suffer from the same issue. %reg1028<def> = EXTRACT_SUBREG %reg1027<kill>, 1 %reg1029<def> = MOV8rr %reg1028 %reg1029<def> = SHR8ri %reg1029, 7, %EFLAGS<imp-def,dead> insert => %reg1030<def> = MOV8rr %reg1028 %reg1030<def> = ADD8rr %reg1028<kill>, %reg1029<kill>, %EFLAGS<imp-def,dead> In this case, it might not be possible to coalesce the second MOV8rr instruction if the first one is coalesced. So it would be profitable to commute it: %reg1028<def> = EXTRACT_SUBREG %reg1027<kill>, 1 %reg1029<def> = MOV8rr %reg1028 %reg1029<def> = SHR8ri %reg1029, 7, %EFLAGS<imp-def,dead> insert => %reg1030<def> = MOV8rr %reg1029 %reg1030<def> = ADD8rr %reg1029<kill>, %reg1028<kill>, %EFLAGS<imp-def,dead> llvm-svn: 62954	2009-01-25 03:53:59 +00:00
Dan Gohman	73fd5a5651	Add a PR comment to this test. llvm-svn: 62921	2009-01-24 17:32:54 +00:00
Torok Edwin	f4395ea97a	testcase for PR3381. Also it was an empty struct, not a void after all. llvm-svn: 62920	2009-01-24 17:16:04 +00:00
Chris Lattner	72cd68fe64	Make InstCombineStoreToCast handle aggregates more aggressively, handling the case in Transforms/InstCombine/cast-store-gep.ll, which is a heavily reduced testcase from Clang on x86-64. llvm-svn: 62904	2009-01-24 01:00:13 +00:00
Devang Patel	486d309b34	Fix test case. Use valid file name and directory in global variable's debug info entry. llvm-svn: 62883	2009-01-23 21:54:18 +00:00
Chris Lattner	3f4591c89f	fix two more cases where we could let the NLPDI cache get unsorted. With this, sqlite3 now passes. llvm-svn: 62839	2009-01-23 07:12:16 +00:00
Evan Cheng	f347c3615b	Update test to reflect command line option name change. llvm-svn: 62836	2009-01-23 05:45:31 +00:00
Dan Gohman	1f3411de47	Don't create ISD::FNEG nodes after legalize if they aren't legal. Simplify x+0 to x in unsafe-fp-math mode. This avoids a bunch of redundant work in many cases, because in unsafe-fp-math mode, ISD::FADD with a constant is considered free to negate, so the DAGCombiner often negates x+0 to -0-x thinking it's free, when in reality the end result is -x, which is more expensive than x. Also, combine x*0 to 0. This fixes PR3374. llvm-svn: 62789	2009-01-22 21:58:43 +00:00
Devang Patel	dec7fe2e71	Do not use buggy llvm-gcc to generate testcases. llvm-svn: 62770	2009-01-22 18:28:11 +00:00
Duncan Sands	e3a26635fb	Remove no-longer relevant comment. Pointed out by Gabor. llvm-svn: 62765	2009-01-22 15:37:29 +00:00
Duncan Sands	ac6f7eeb05	This passes on linux. llvm-svn: 62764	2009-01-22 15:07:15 +00:00
Chris Lattner	bed6be62e4	fix a testcase. llvm-svn: 62758	2009-01-22 07:08:58 +00:00
Chris Lattner	f09619d533	Fix PR3358, a really nasty bug where recursive phi translated analyses could be run without the caches properly sorted. This can fix all sorts of weirdness. Many thanks to Bill for coming up with the 'issorted' verification idea. llvm-svn: 62757	2009-01-22 07:04:01 +00:00
Bill Wendling	6cf1f8fd5b	Now with RUN line. llvm-svn: 62716	2009-01-21 21:28:03 +00:00
Bill Wendling	ba11cd338b	Run this through -simplifycfg and -mem2reg to test only what we need to test. llvm-svn: 62714	2009-01-21 21:02:27 +00:00
Dale Johannesen	1f86498f93	Do not use host floating point types when emitting ASCII IR; loading and storing these can change the bits of NaNs on some hosts. Remove or add warnings at a few other places using host floating point; this is a bad thing to do in general. llvm-svn: 62712	2009-01-21 20:32:55 +00:00
Dan Gohman	7e6b932f18	Simplify ReduceLoadWidth's logic: it doesn't need several different special cases after producing the new reduced-width load, because the new load already has the needed adjustments built into it. This fixes several bugs due to the special cases, including PR3317. llvm-svn: 62692	2009-01-21 15:17:51 +00:00
Dan Gohman	b43c8996f2	Fix a recent regression. ClrOpcode is not set for i8; for i8, if we want to clear %ah to zero before a division, just use a zero-extending mov to %al. This fixes PR3366. llvm-svn: 62691	2009-01-21 14:50:16 +00:00
Mikhail Glushenkov	bf9716e15d	Allow hooks with arguments. llvm-svn: 62685	2009-01-21 13:04:00 +00:00
Duncan Sands	d56cf3025f	This was causing invalid memory accesses when generating debug info in the compiler. llvm-svn: 62684	2009-01-21 11:51:17 +00:00
Duncan Sands	1de451d0d0	Let's try to have our cake and eat it to: move this test into FrontendC to ensure that llvm-gcc is available; assemble using "llvm-gcc -xassembler" rather than "as". llvm-svn: 62683	2009-01-21 11:37:31 +00:00
Duncan Sands	696f4a8598	Don't rely on grep -w working. llvm-svn: 62682	2009-01-21 09:41:42 +00:00
Scott Michel	ed7d79fce4	CellSPU: - Ensure that (operation) legalization emits proper FDIV libcall when needed. - Fix various bugs encountered during llvm-spu-gcc build, along with various cleanups. - Start supporting double precision comparisons for remaining libgcc2 build. Discovered interesting DAGCombiner feature, which is currently solved via custom lowering (64-bit constants are not legal on CellSPU, but DAGCombiner insists on inserting one anyway.) - Update README. llvm-svn: 62664	2009-01-21 04:58:48 +00:00
Evan Cheng	201501995f	Favors generating "not" over "xor -1". For example. unsigned test(unsigned a) { return ~a; } llvm used to generate: movl $4294967295, %eax xorl 4(%esp), %eax Now it generates: movl 4(%esp), %eax notl %eax It's 3 bytes shorter. llvm-svn: 62661	2009-01-21 02:09:05 +00:00
Dale Johannesen	287b4bc44e	Disable on x86_64 until I figure out what's wrong. llvm-svn: 62660	2009-01-21 02:08:30 +00:00
Dale Johannesen	b5721632ee	Make special cases (0 inf nan) work for frem. Besides APFloat, this involved removing code from two places that thought they knew the result of frem(0., x) but were wrong. llvm-svn: 62645	2009-01-21 00:35:19 +00:00
Owen Anderson	be7a29de0b	Be more aggressive about renumbering vregs after splitting them. llvm-svn: 62639	2009-01-21 00:13:28 +00:00
Devang Patel	6bbacbe372	Appropriately mark fowrad decls. llvm-svn: 62625	2009-01-20 22:27:02 +00:00
Devang Patel	6fbec1c230	Need compile unit to find location. llvm-svn: 62624	2009-01-20 22:26:11 +00:00
Dale Johannesen	e75fdb0510	Calls to fmod, it turns out, are constant-folded by invoking the host fmod, not by lowering to frem and constant-folding that. Fix this so it tests what I want to test. llvm-svn: 62622	2009-01-20 21:58:13 +00:00
Chris Lattner	f8a8c13c1e	Don't bother running the assembler, we don't know that it will be configured for whatever llc defaults to. This fixes PR3363 llvm-svn: 62619	2009-01-20 21:41:53 +00:00
Evan Cheng	f1e873a221	Fix PR3243: a LiveVariables bug. When HandlePhysRegKill is checking whether the last reference is also the last def (i.e. dead def), it should also check if last reference is the current machine instruction being processed. This can happen when it is processing a physical register use and setting the current machine instruction as sub-register's last ref. llvm-svn: 62617	2009-01-20 21:25:12 +00:00
Evan Cheng	4022b7c3f4	Add test case for PR3154. llvm-svn: 62604	2009-01-20 19:29:54 +00:00
Duncan Sands	489c5484d3	Check that the "don't barf on k8" fix is not accidentally reverted again. llvm-svn: 62587	2009-01-20 18:08:39 +00:00
Bill Wendling	a908b60fb2	Temporarily XFAIL until this can be looked at. r62557 is what caused it to start failing. llvm-svn: 62578	2009-01-20 10:28:39 +00:00
Bill Wendling	1d9c8e5522	Testcase for limited precision stuff. llvm-svn: 62572	2009-01-20 06:23:59 +00:00
Chris Lattner	c59945b4bd	another fix for PR3354 llvm-svn: 62561	2009-01-20 01:15:41 +00:00
Dan Gohman	161b7b66ac	Fix a dagcombine to not generate loads of non-round integer types, as its comment says, even in the case where it will be generating extending loads. This fixes PR3216. llvm-svn: 62557	2009-01-20 01:06:45 +00:00
Evan Cheng	8f79775a66	Make linear scan's trivial coalescer slightly more aggressive. llvm-svn: 62547	2009-01-20 00:16:18 +00:00
Chris Lattner	ea9f1d3c47	Fix a problem exposed by PR3354: simplifycfg was making a potentially trapping instruction be executed unconditionally. llvm-svn: 62541	2009-01-19 23:03:13 +00:00
Dale Johannesen	d067ecd1c7	Move & restructure test per review. llvm-svn: 62538	2009-01-19 22:33:12 +00:00
Chris Lattner	7eeb1cc605	convert this to an unfoldable potentially trapping constant expr. llvm-svn: 62536	2009-01-19 22:12:33 +00:00
Dan Gohman	cd0b1bf0a0	Fix SelectionDAG::ReplaceAllUsesWith to behave correctly when uses are added to the From node while it is processing From's use list, because of automatic local CSE. The fix is to avoid visiting any new uses. Fix a few places in the DAGCombiner that assumed that after a RAUW call, the From node has no users and may be deleted. This fixes PR3018. llvm-svn: 62533	2009-01-19 21:44:21 +00:00
Chris Lattner	6f34e317e9	Fix PR3353, infinitely jump threading an infinite loop make from switches. llvm-svn: 62529	2009-01-19 21:20:34 +00:00
Dale Johannesen	740e98704d	compile-time fmod was done incorrectly. PR 3316. llvm-svn: 62528	2009-01-19 21:17:05 +00:00
Devang Patel	8c8aa2ac29	Verify Intrinsic::dbg_declare. llvm-svn: 62526	2009-01-19 21:00:48 +00:00
Evan Cheng	44cc554311	DIVREM isel deficiency: If sign bit is known zero, zero out DX/EDX/RDX instead of sign extending the low part (in AX/EAX/RAX) into it. llvm-svn: 62519	2009-01-19 19:06:11 +00:00
Nick Lewycky	ee22611e33	Port this test from dejagnu to unit testing. The way this worked before was to test APInt by running "lli -force-interpreter=true" knowing the lli uses APInt under the hood to store its values. Now, we test APInt directly. llvm-svn: 62514	2009-01-19 18:08:33 +00:00
Bill Wendling	534d2e0bae	Temporarily revert r62487. It's causing this error during a release bootstrap of llvm-gcc. Most likely, it's miscompiling one of the "gen*" programs: /Volumes/Sandbox/Buildbot/llvm/full-llvm/build/llvm-gcc.obj/./prev-gcc/xgcc -B/Volumes/Sandbox/Buildbot/llvm/full-llvm/build/llvm-gcc.obj/./prev-gcc/ -B/Volumes/Sandbox/Buildbot/llvm/full-llvm/build/llvm-gcc.install/i386-apple-darwin9.6.0/bin/ -c -g -O2 -mdynamic-no-pic -DIN_GCC -W -Wall -Wwrite-strings -Wstrict-prototypes -Wmissing-prototypes -mdynamic-no-pic -DHAVE_CONFIG_H -DGENERATOR_FILE -I. -Ibuild -I../../llvm-gcc.src/gcc -I../../llvm-gcc.src/gcc/build -I../../llvm-gcc.src/gcc/../include -I./../intl -I../../llvm-gcc.src/gcc/../libcpp/include -I../../llvm-gcc.src/gcc/../libdecnumber -I../libdecnumber -I/Volumes/Sandbox/Buildbot/llvm/full-llvm/build/llvm.obj/include -I/Volumes/Sandbox/Buildbot/llvm/full-llvm/build/llvm.src/include -DENABLE_LLVM -I/Volumes/Sandbox/Buildbot/llvm/full-llvm/build/llvm.obj/../llvm.src/include -D_DEBUG -D_GNU_SOURCE -D__STDC_LIMIT_MACROS -D__STDC_CONSTANT_MACROS -o build/gencondmd.o build/gencondmd.c ../../llvm-gcc.src/gcc/config/i386/mmx.md:926: error: expected '}' before ')' token ../../llvm-gcc.src/gcc/config/i386/mmx.md:926: warning: excess elements in struct initializer ../../llvm-gcc.src/gcc/config/i386/mmx.md:926: warning: (near initialization for 'insn_conditions[4]') ../../llvm-gcc.src/gcc/config/i386/mmx.md:926: error: expected '}' before ')' token ../../llvm-gcc.src/gcc/config/i386/mmx.md:926: error: expected ',' or ';' before ')' token ../../llvm-gcc.src/gcc/config/i386/mmx.md:927: error: expected identifier or '(' before ',' token ../../llvm-gcc.src/gcc/config/i386/sse.md:3458: error: expected identifier or '(' before ',' token ... llvm-svn: 62506	2009-01-19 08:46:20 +00:00
Evan Cheng	7e9ef4d776	Now not UINT_TO_FP is legal (it's marked custom), dag combiner won't optimize it to a SINT_TO_FP when the sign bit is known zero. X86 isel should perform the optimization itself. llvm-svn: 62504	2009-01-19 08:08:22 +00:00
Chris Lattner	f2bb4ea39c	Fix PR3016, a bug which can occur do to an invalid assumption: we assumed a CFG structure that would be valid when all code in the function is reachable, but not all code is necessarily reachable. Do a simple, but horrible, CFG walk to check for this case. llvm-svn: 62487	2009-01-19 02:46:28 +00:00
Chris Lattner	64b7bd7f9e	Fix rdar://6505632, an llc crash on 483.xalancbmk llvm-svn: 62470	2009-01-18 20:35:00 +00:00
Nick Lewycky	e5be1cd635	Forgot this in the previous checkin: fopen now has nocapture, realloc is supposed to take two arguments. llvm-svn: 62457	2009-01-18 04:46:10 +00:00
Bill Wendling	9880a2cb2f	Testcase for last commit. llvm-svn: 62418	2009-01-17 07:42:44 +00:00
Evan Cheng	bf38a5e540	Fix MatchAddress bug that's preventing negative displacement from being folded in 64-bit mode. llvm-svn: 62413	2009-01-17 07:09:27 +00:00
Mon P Wang	ca6d6dea0b	Simplify extract element of a scalar to vector. llvm-svn: 62383	2009-01-17 00:07:25 +00:00
Evan Cheng	41e9f6a854	Fix PPC ISD::Declare isel and eliminate the need for PPCTargetLowering::LowerGlobalAddress to check if isVerifiedDebugInfoDesc() is true. Given the recent changes, it would falsely return true for a lot of GlobalAddressSDNode's. llvm-svn: 62373	2009-01-16 22:57:32 +00:00
Dan Gohman	f1002495e3	Disable the post-RA scheduler on this test, since it uses a simple %prcontext which doesn't find what it's looking for if the scheduler has rearranged the instructions. llvm-svn: 62363	2009-01-16 21:40:12 +00:00
Evan Cheng	968e2e7b3d	CreateVirtualRegisters does trivial copy coalescing. If a node def is used by a single CopyToReg, it reuses the virtual register assigned to the CopyToReg. This won't work for SDNode that is a clone or is itself cloned. Disable this optimization for those nodes or it can end up with non-SSA machine instructions. llvm-svn: 62356	2009-01-16 20:57:18 +00:00
Chris Lattner	db2d9613d2	Fix PR3335 by not turning a store to one address space into a store to another. llvm-svn: 62351	2009-01-16 20:12:52 +00:00
Bill Wendling	e04334730e	Add support for non-zero __builtin_return_address values on X86. llvm-svn: 62338	2009-01-16 19:25:27 +00:00
Evan Cheng	2d9e40ed24	This is now passing. llvm-svn: 62308	2009-01-16 06:59:14 +00:00
Evan Cheng	beac6f8b0c	Clean up previous cast optimization a bit. Also make zext elimination a bit more aggressive: if it's not necessary to emit an AND (i.e. high bits are already zero), it's profitable to evaluate the operand at a different type. llvm-svn: 62297	2009-01-16 02:11:43 +00:00
Devang Patel	fa1b408b3b	Do not stumble over forward declared struct member. llvm-svn: 62288	2009-01-16 00:50:53 +00:00
Devang Patel	76d190cf4a	Validate dbg_* intrinsics before lowering them. llvm-svn: 62286	2009-01-15 23:41:32 +00:00
Mon P Wang	e248edff1b	Added missing support to widen an operand from a bit convert. llvm-svn: 62285	2009-01-15 22:43:38 +00:00
Rafael Espindola	f2831d6cd1	Fix Alpha test and support for private linkage. llvm-svn: 62282	2009-01-15 21:51:46 +00:00
Mon P Wang	ebfafee903	Expand insert/extract of a <4 x i32> with a variable index. llvm-svn: 62281	2009-01-15 21:10:20 +00:00
Rafael Espindola	6de96a1b5d	Add the private linkage. llvm-svn: 62279	2009-01-15 20:18:42 +00:00
Devang Patel	851cdaf1fd	Use lightweight DebugInfo objects directly. llvm-svn: 62276	2009-01-15 19:26:23 +00:00
Devang Patel	8bdc698336	Use variable's context to identify respective DbgScope. Use light weight DebugInfo object directly. llvm-svn: 62269	2009-01-15 18:25:17 +00:00
Evan Cheng	60e19a46f2	- Teach CanEvaluateInDifferentType of this xform: sext (zext ty1), ty2 -> zext ty2 - Looking at the number of sign bits of the a sext instruction to determine whether new trunc + sext pair should be added when its source is being evaluated in a different type. llvm-svn: 62263	2009-01-15 17:01:23 +00:00
Richard Osborne	40119780a8	Don't fold address calculations which use negative offsets into the ADDRspii addressing mode. llvm-svn: 62258	2009-01-15 11:32:30 +00:00
Scott Michel	a292fc6d6b	- Convert remaining i64 custom lowering into custom instruction emission sequences in SPUDAGToDAGISel.cpp and SPU64InstrInfo.td, killing custom DAG node types as needed. - i64 mul is now a legal instruction, but emits an instruction sequence that stretches tblgen and the imagination, as well as violating laws of several small countries and most southern US states (just kidding, but looking at a function with 80+ parameters is really weird and just plain wrong.) - Update tests as needed. llvm-svn: 62254	2009-01-15 04:41:47 +00:00
Chris Lattner	8fb9480ed2	Fix PR3325, a miscompilation of invokes by IPSCCP. Patch by Jay Foad! llvm-svn: 62244	2009-01-14 21:01:16 +00:00
Devang Patel	08e5e62f98	xfail for now. llvm-svn: 62243	2009-01-14 20:10:24 +00:00
Richard Osborne	4359325ba8	Add pseudo instructions to the XCore for (load\|store\|load address) of a frame index. eliminateFrameIndex will replace these instructions with (LDWSP\|STWSP\|LDAWSP) or (LDW\|STW\|LDAWF) if a frame pointer is in use. This fixes PR 3324. Previously we used LDWSP, STWSP, LDAWSP before frame pointer elimination. However since they were marked as implicitly using SP they could not be rematerialised. llvm-svn: 62238	2009-01-14 18:26:46 +00:00
Dale Johannesen	1f0e0e7c9c	Fix the time regression I introduced in 464.h264ref with my earlier patch to this file. The issue there was that all uses of an IV inside a loop are actually references to Base[IV2], and there was one use outside that was the same but LSR didn't see the base or the scaling because it didn't recurse into uses outside the loop; thus, it used base+IVscale mode inside the loop instead of pulling base out of the loop. This was extra bad because register pressure later forced both base and IV into memory. Doing that recursion, at least enough to figure out addressing modes, is a good idea in general; the change in AddUsersIfInteresting does this. However, there were side effects.... It is also possible for recursing outside the loop to introduce another IV where there was only 1 before (if the refs inside are not scaled and the ref outside is). I don't think this is a common case, but it's in the testsuite. It is right to be very aggressive about getting rid of such introduced IVs (CheckForIVReuse and the handling of nonzero RewriteFactor in StrengthReduceStridedIVUsers). In the testcase in question the new IV produced this way has both a nonconstant stride and a nonzero base, neither of which was handled before. And when inserting new code that feeds into a PHI, it's right to put such code at the original location rather than in the PHI's immediate predecessor(s) when the original location is outside the loop (a case that couldn't happen before) (RewriteInstructionToUseNewBase); better to avoid making multiple copies of it in this case. Also, the mechanism for keeping SCEV's corresponding to GEP's no longer works, as the GEP might change after its SCEV is remembered, invalidating the SCEV, and we might get a bad SCEV value when looking up the GEP again for a later loop. This also couldn't happen before, as we weren't recursing into GEP's outside the loop. Also, when we build an expression that involves a (possibly non-affine) IV from a different loop as well as an IV from the one we're interested in (containsAddRecFromDifferentLoop), don't recurse into that. We can't do much with it and will get in trouble if we try to create new non-affine IVs or something. More testcases are coming. llvm-svn: 62212	2009-01-14 02:35:31 +00:00
Chris Lattner	2538eb664c	rewrite OptimizeAwayTrappingUsesOfLoads to 1) avoid a temporary vector and extraneous loop over it, 2) not delete globals used by phis/selects etc which could actually be useful. This fixes PR3321. Many thanks to Duncan for narrowing this down. llvm-svn: 62201	2009-01-14 00:12:58 +00:00
Dan Gohman	b8f5ba6781	Disable the register+memory forms of the bt instructions for now. Thanks to Eli for pointing out that these forms don't ignore the high bits of their index operands, and as such are not immediately suitable for use by isel. llvm-svn: 62194	2009-01-13 23:23:30 +00:00
Dale Johannesen	0aeabdff57	Fix testsuite regressions from recursive inlining. llvm-svn: 62189	2009-01-13 22:43:37 +00:00
Dan Gohman	1407484178	The list-td and list-tdrr schedulers don't yet support physreg scheduling dependencies. Add assertion checks to help catch this. It appears the Mips target defaults to list-td, and it has a regression test that uses a physreg dependence. Such code was liable to be miscompiled, and now evokes an assertion failure. llvm-svn: 62177	2009-01-13 20:24:13 +00:00
Dan Gohman	59af77376c	Make instcombine ensure that all allocas are explicitly aligned at at least their preferred alignment. llvm-svn: 62176	2009-01-13 20:18:38 +00:00
Duncan Sands	ffc6133318	When replacing uses and the same node is reached via two paths, process it once not twice, d'oh! Analysis, testcase and original patch thanks to Mon Ping Wang. llvm-svn: 62169	2009-01-13 15:17:14 +00:00
Duncan Sands	ab2fd9e4b9	Mark this XFAIL for the moment. llvm-svn: 62168	2009-01-13 15:15:46 +00:00
Nick Lewycky	52348300a4	Wind SCEV back in time, to Nov 18th. This 'fixes' PR3275, PR3294, PR3295, PR3296 and PR3302. llvm-svn: 62160	2009-01-13 09:18:58 +00:00
Evan Cheng	f343168f1f	FIX llvm-gcc bootstrap on x86_64 linux. If a virtual register is copied to a physical register, it's not necessarily defined by a copy. We have to watch out it doesn't clobber any sub-register that might be live during its live interval. If the live interval crosses a basic block, then it's not safe to check with the less conservative check (by scanning uses and defs) because it's possible a sub-register might be live out of the block. llvm-svn: 62144	2009-01-13 03:57:45 +00:00
Devang Patel	76007e009e	Use DebugInfo interface to lower dbg_* intrinsics. llvm-svn: 62126	2009-01-13 00:32:17 +00:00
Dale Johannesen	433a9086c0	Enable recursive inlining. Reduce inlining threshold back to 200; 400 seems to be too high, loses more than it gains. llvm-svn: 62107	2009-01-12 22:11:50 +00:00
Evan Cheng	2adb5cfb48	Second test is only valid in 32-bit mode. llvm-svn: 62084	2009-01-12 08:05:54 +00:00
Evan Cheng	0258874607	Test for r62076. llvm-svn: 62077	2009-01-12 03:46:55 +00:00
Evan Cheng	b2c42c648d	Fix PR3241: Currently EmitCopyFromReg emits a copy from the physical register to a virtual register unless it requires an expensive cross class copy. That means we are only treating "expensive to copy" register dependency as physical register dependency. Also future proof the scheduler to handle "normal" physical register dependencies. The code is not exercised yet. llvm-svn: 62074	2009-01-12 03:19:55 +00:00
Evan Cheng	8e7d88b916	This is a dup of pr2659.ll. llvm-svn: 62029	2009-01-10 19:06:32 +00:00
Evan Cheng	ed74d8ac2a	Duplicated node may produce a non-physical register def. llvm-svn: 62015	2009-01-09 22:44:02 +00:00
Evan Cheng	c1f5a659de	Add test case from PR2659. llvm-svn: 62006	2009-01-09 21:01:31 +00:00
Chris Lattner	ae0e857b98	Fix PR3304 llvm-svn: 61995	2009-01-09 18:18:43 +00:00
Dan Gohman	ea1086b7f2	PR2659 was fixed by r61847. Add the testcase as a regression test. llvm-svn: 61986	2009-01-09 08:16:12 +00:00
Chris Lattner	f50aa6ae5c	Implement rdar://6480391, extending of equality icmp's to avoid a truncation. I noticed this in the code compiled for a routine using std::map, which produced this code: %25 = tail call i32 @memcmp(i8* %24, i8* %23, i32 6) nounwind readonly %.lobit.i = lshr i32 %25, 31 ; <i32> [#uses=1] %tmp.i = trunc i32 %.lobit.i to i8 ; <i8> [#uses=1] %toBool = icmp eq i8 %tmp.i, 0 ; <i1> [#uses=1] br i1 %toBool, label %bb3, label %bb4 which compiled to: call L_memcmp$stub shrl $31, %eax testb %al, %al jne LBB1_11 ## with this change, we compile it to: call L_memcmp$stub testl %eax, %eax js LBB1_11 This triggers all the time in common code, with patters like this: %169 = and i32 %ply, 1 ; <i32> [#uses=1] %170 = trunc i32 %169 to i8 ; <i8> [#uses=1] %toBool = icmp ne i8 %170, 0 ; <i1> [#uses=1] %7 = lshr i32 %6, 24 ; <i32> [#uses=1] %9 = trunc i32 %7 to i8 ; <i8> [#uses=1] %10 = icmp ne i8 %9, 0 ; <i1> [#uses=1] etc llvm-svn: 61985	2009-01-09 07:47:06 +00:00
Chris Lattner	482eb70a10	Fix PR3298, a crash in Jump Threading. Apparently even jump threading can have bugs, who knew? ;-) llvm-svn: 61983	2009-01-09 06:08:12 +00:00
Chris Lattner	d48d1ec320	this doesn't depend on the gcc early inliner anymore. llvm-svn: 61982	2009-01-09 05:49:27 +00:00
Chris Lattner	7f88a1b512	PR3290 is now fixed. llvm-svn: 61981	2009-01-09 05:46:19 +00:00
Chris Lattner	fef138b140	Fix part 3/2 of PR3290, making instcombine zap (gep(bitcast)) when possible. llvm-svn: 61980	2009-01-09 05:44:56 +00:00
Chris Lattner	9170731cb7	this test should not run opt -std-compile-opts, it should run just llc. llvm-svn: 61979	2009-01-09 05:32:00 +00:00
Dale Johannesen	b48fc71fc6	Do not inline functions with (dynamic) alloca into functions that don't already have a (dynamic) alloca. Dynamic allocas cause inefficient codegen and we shouldn't propagate this (behavior follows gcc). Two existing tests assumed such inlining would be done; they are hacked by adding an alloca in the caller, preserving the point of the tests. llvm-svn: 61946	2009-01-08 21:45:23 +00:00
Chris Lattner	f3e696bc5a	ValueTracker can't assume that an alloca with no specified alignment will get its preferred alignment. It has to be careful and cautiously assume it will just get the ABI alignment. This prevents instcombine from rounding up the alignment of a load/store without adjusting the alignment of the alloca. llvm-svn: 61934	2009-01-08 19:28:38 +00:00
Chris Lattner	a2ed32eb4f	this testcase is huge and hasn't regressed ever, I don't think it is worth keeping. llvm-svn: 61931	2009-01-08 19:01:45 +00:00
Chris Lattner	55927bdccd	the new scalarrepl changes are optimizing away a temporary alloca in check242, which invalidates this test. This test is an x86-32 ABI test that is trying to be run in a target-independent way, which is not going to work very well. Just remove the test. llvm-svn: 61921	2009-01-08 07:58:23 +00:00
Chris Lattner	c518dfd11b	This implements the second half of the fix for PR3290, handling loads from allocas that cover the entire aggregate. This handles some memcpy/byval cases that are produced by llvm-gcc. This triggers a few times in kc++ (with std::pair<std::_Rb_tree_const_iterator <kc::impl_abstract_phylum*>,bool>) and once in 176.gcc (with %struct..0anon). llvm-svn: 61915	2009-01-08 05:42:05 +00:00
Misha Brukman	b51cdfadda	Fix off-by-one error in traversing an array; this fixes a test. The error was reported by gcc-4.3.0 during compilation. llvm-svn: 61896	2009-01-07 23:07:29 +00:00
Duncan Sands	289f59f233	Remove alloca tracking from nocapture analysis. Not only was it not very helpful, it was also wrong! The problem is shown in the testcase: the alloca might be passed to a nocapture callee which dereferences it and returns the original pointer. But because it was a nocapture call we think we don't need to track its uses, but we do. llvm-svn: 61876	2009-01-07 19:39:06 +00:00
Chris Lattner	f2b8c82ad1	Implement the first half of PR3290: if there is a store of an integer to a (transitive) bitcast the alloca and if that integer has the full size of the alloca, then it clobbers the whole thing. Handle this by extracting pieces out of the stored integer and filing them away in the SROA'd elements. This triggers fairly frequently because the CFE uses integers to pass small structs by value and the inliner exposes these. For example, in kimwitu++, I see a bunch of these with i64 stores to "%struct.std::pair<std::_Rb_tree_const_iterator<kc::impl_abstract_phylum*>,bool>" In 176.gcc I see a few i32 stores to "%struct..0anon". In the testcase, this is a difference between compiling test1 to: _test1: subl $12, %esp movl 20(%esp), %eax movl %eax, 4(%esp) movl 16(%esp), %eax movl %eax, (%esp) movl (%esp), %eax addl 4(%esp), %eax addl $12, %esp ret vs: _test1: movl 8(%esp), %eax addl 4(%esp), %eax ret The second half of this will be to handle loads of the same form. llvm-svn: 61853	2009-01-07 08:11:13 +00:00
Evan Cheng	f6768bd9cb	The coalescer does not coalesce a virtual register to a physical register if any of the physical register's sub-register live intervals overlaps with the virtual register. This is overly conservative. It prevents a extract_subreg from being coalesced away: v1024 = EDI // not killed = = EDI One possible solution is for the coalescer to examine the sub-register live intervals in the same manner as the physical register. Another possibility is to examine defs and uses (when needed) of sub-registers. Both solutions are too expensive. For now, look for "short virtual intervals" and scan instructions to look for conflict instead. This is a small win on x86-64. e.g. It shaves 403.gcc by ~80 instructions. llvm-svn: 61847	2009-01-07 02:08:57 +00:00
Chris Lattner	4687432d03	add a testcase. llvm-svn: 61845	2009-01-07 01:48:08 +00:00
Dan Gohman	8e8d1da35a	Add patterns to match conditional moves with loads folded into their left operand, rather than their right. Do this by commuting the operands and inverting the condition. llvm-svn: 61842	2009-01-07 01:00:24 +00:00
Dan Gohman	33e6fcd56f	X86_COND_C and X86_COND_NC are alternate mnemonics for X86_COND_B and X86_COND_AE, respectively. llvm-svn: 61835	2009-01-07 00:15:08 +00:00
Dan Gohman	44a3da6c4d	Now that fold-pcmpeqd-0.ll is effectively testing that scheduling helps avoid the need for spilling, add a new testcase that tests that the pcmpeqd used for V_SETALLONES is changed to a constant-pool load as needed. llvm-svn: 61831	2009-01-06 23:48:10 +00:00
Dan Gohman	beac19e299	Revert r42653 and forward-port the code that lets INC64_32r be converted to LEA64_32r in x86's convertToThreeAddress. This replaces code like this: movl %esi, %edi inc %edi with this: lea 1(%rsi), %edi which appears to be beneficial. llvm-svn: 61830	2009-01-06 23:34:46 +00:00
Dan Gohman	c7847cdb8d	Fix a bug in ComputeLinearIndex computation handling multi-level aggregate types. Don't increment the current index after reaching the end of a struct, as it will already be pointing at one-past-the end. This fixes PR3288. llvm-svn: 61828	2009-01-06 22:53:52 +00:00
Scott Michel	6887caf11c	CellSPU: - Fix bugs 3194, 3195: i128 load/stores produce correct code (although, we need to ensure that i128 is 16-byte aligned in real life), and 128 zero- extends are supported. - New td file: SPU128InstrInfo.td: this is where all new i128 support should be put in the future. - Continue to hammer on i64 operations and test cases; ensure that the only remaining problem will be i64 mul. llvm-svn: 61784	2009-01-06 03:36:14 +00:00
Dan Gohman	53c282cce8	Delete this test; it's a duplicate of 2006-07-03-schedulers.ll. llvm-svn: 61781	2009-01-06 01:36:23 +00:00
Dan Gohman	79c3516912	Use a latency value of 0 for the artificial edges inserted by AddPseudoTwoAddrDeps. This lets the scheduling infrastructure avoid recalculating node heights. In very large testcases this was a major bottleneck. Thanks to Roman Levenstein for finding this! As a side effect, fold-pcmpeqd-0.ll is now scheduled better and it no longer requires spilling on x86-32. llvm-svn: 61778	2009-01-06 01:19:04 +00:00
Chris Lattner	4e735eb157	make m_ConstantInt(int64_t) safely match ConstantInt's that are larger than i64. This fixes an instcombine crash on PR3235. llvm-svn: 61775	2009-01-05 23:45:50 +00:00
Bill Wendling	2012d84f01	Strength test. llvm-svn: 61755	2009-01-05 21:27:59 +00:00
Duncan Sands	582c53d147	Teach the internalize pass to also internalize global aliases. llvm-svn: 61754	2009-01-05 21:24:45 +00:00
Evan Cheng	8804293fe9	Find loop back edges only after empty blocks are eliminated. llvm-svn: 61752	2009-01-05 21:17:27 +00:00
Chris Lattner	84434a692b	testcase for bill's patch. llvm-svn: 61751	2009-01-05 21:07:34 +00:00
Duncan Sands	f5dbbae4f4	Delete unused global aliases with internal linkage. In fact this also deletes those with linkonce linkage, however this is currently dead because for the moment aliases aren't allowed to have this linkage type. llvm-svn: 61742	2009-01-05 20:37:33 +00:00

... 4 5 6 7 8 ...

6730 Commits