llvm-project

Commit Graph

Author	SHA1	Message	Date
Dan Gohman	f3d783a6d2	Temporarily disable some failing tests, until they can be properly investigated. llvm-svn: 110808	2010-08-11 15:09:00 +00:00
Bill Wendling	6a98131468	Consider this code snippet: float t1(int argc) { return (argc == 1123) ? 1.234f : 2.38213f; } We would generate truly awful code on ARM (those with a weak stomach should look away): _t1: movw r1, #1123 movs r2, #1 movs r3, #0 cmp r0, r1 mov.w r0, #0 it eq moveq r0, r2 movs r1, #4 cmp r0, #0 it ne movne r3, r1 adr r0, #LCPI1_0 ldr r0, [r0, r3] bx lr The problem was that legalization was creating a cascade of SELECT_CC nodes, for for the comparison of "argc == 1123" which was fed into a SELECT node for the ?: statement which was itself converted to a SELECT_CC node. This is because the ARM back-end doesn't have custom lowering for SELECT nodes, so it used the default "Expand". I added a fairly simple "LowerSELECT" to the ARM back-end. It takes care of this testcase, but can obviously be expanded to include more cases. Now we generate this, which looks optimal to me: _t1: movw r1, #1123 movs r2, #0 cmp r0, r1 adr r0, #LCPI0_0 it eq moveq r2, #4 ldr r0, [r0, r2] bx lr .align 2 LCPI0_0: .long 1075344593 @ float 2.382130e+00 .long 1067316150 @ float 1.234000e+00 llvm-svn: 110799	2010-08-11 08:43:16 +00:00
Evan Cheng	5190f09291	Report error if codegen tries to instantiate a ARM target when the cpu does support it. e.g. cortex-m* processors. llvm-svn: 110798	2010-08-11 07:17:46 +00:00
Evan Cheng	40921a4e62	Add ARM Archv6M and let it implies FeatureDB (having dmb, etc.) llvm-svn: 110795	2010-08-11 06:51:54 +00:00
Daniel Dunbar	188b47b214	MC/ARM: Add basic support for handling predication by parsing it out of the mnemonic into a separate operand form. llvm-svn: 110794	2010-08-11 06:37:20 +00:00
Evan Cheng	49e02fc414	Add Cortex-M0 support. It's a ARMv6m device (no ARM mode) with some 32-bit instructions: dmb, dsb, isb, msr, and mrs. llvm-svn: 110786	2010-08-11 06:30:38 +00:00
Evan Cheng	6e809de90c	- Add subtarget feature -mattr=+db which determine whether an ARM cpu has the memory and synchronization barrier dmb and dsb instructions. - Change instruction names to something more sensible (matching name of actual instructions). - Added tests for memory barrier codegen. llvm-svn: 110785	2010-08-11 06:22:01 +00:00
Bill Wendling	79937dfc5b	Update test to match output of optimize compares for ARM. llvm-svn: 110765	2010-08-11 01:05:02 +00:00
Dan Gohman	f7495f286a	When analyzing loop exit conditions combined with and and or, don't make any assumptions about when the two conditions will agree on when to permit the loop to exit. This fixes PR7845. llvm-svn: 110758	2010-08-11 00:12:36 +00:00
Bill Wendling	871d4e1170	The optimize comparisons pass removes the "cmp" instruction this is checking for. llvm-svn: 110739	2010-08-10 22:16:05 +00:00
Nate Begeman	3ec892c167	Add test for recent instcombine vector shuffle enhancement llvm-svn: 110737	2010-08-10 21:58:00 +00:00
Daniel Dunbar	18cc4acb00	tests: Don't error out if HOME isn't present in t the environment. llvm-svn: 110711	2010-08-10 19:36:25 +00:00
Evan Cheng	3f251fb26e	Re-apply r110655 with fixes. Epilogue must restore sp from fp if the function stack frame has a var-sized object. Also added a test case to check for the added benefit of this patch: it's optimizing away the unnecessary restore of sp from fp for some non-leaf functions. llvm-svn: 110707	2010-08-10 19:30:19 +00:00
Daniel Dunbar	0dd47bfca3	Revert r110655, "Fix ARM hasFP() semantics. It should return true whenever FP register is", it breaks a couple test-suite tests. llvm-svn: 110701	2010-08-10 18:32:02 +00:00
Daniel Dunbar	d215976208	MC/AsmParser: Fix a bug in macro argument parsing, which was dropping parentheses from argument lists. llvm-svn: 110692	2010-08-10 17:38:52 +00:00
Jakob Stoklund Olesen	5730846c2f	Fix test for more architectures. Patch by Tobias Grosser. llvm-svn: 110685	2010-08-10 16:48:24 +00:00
Tobias Grosser	7fbe6cb429	RegionInfo: Do not assert if a BB is not part of the dominance tree. llvm-svn: 110665	2010-08-10 09:54:35 +00:00
Tobias Grosser	fedeff8015	Fix failing testcase. Those look like typos to me. llvm-svn: 110664	2010-08-10 09:54:29 +00:00
Devang Patel	b219746c80	Handle TAG_constant for integers. llvm-svn: 110656	2010-08-10 07:11:13 +00:00
Evan Cheng	8d5d1c1331	Fix ARM hasFP() semantics. It should return true whenever FP register is reserved, not available for general allocation. This eliminates all the extra checks for Darwin. This change also fixes the use of FP to access frame indices in leaf functions and cleaned up some confusing code in epilogue emission. llvm-svn: 110655	2010-08-10 06:26:49 +00:00
Eli Friedman	f99e7e6643	PR7853: fix a silly mistake introduced in r101899, and add a test to make sure it doesn't regress again. llvm-svn: 110597	2010-08-09 20:49:43 +00:00
Kalle Raiskila	999da1f3a0	Have SPU handle halfvec stores aligned by 8 bytes. llvm-svn: 110576	2010-08-09 16:33:00 +00:00
Rafael Espindola	cc4a9670d3	XFAIL for mingw that has no plugins. llvm-svn: 110574	2010-08-09 15:14:06 +00:00
Nick Lewycky	7f36ac54d7	Reject unrepresentable pointer types in intrinsics. Fixes PR7316. llvm-svn: 110541	2010-08-08 06:12:09 +00:00
Rafael Espindola	8aa19b05ee	Use %shlibext instead of .so llvm-svn: 110529	2010-08-08 00:55:59 +00:00
Rafael Espindola	92a4a833f9	Move the bugpoint test passes to a plugin in preparation for having bugpoint use opt. llvm-svn: 110520	2010-08-07 21:48:09 +00:00
Dale Johannesen	a3bd31a923	Use sdmem and sse_load_f64 (etc.) for the vector form of CMPSD (etc.) Matching a 128-bit memory operand is wrong, the instruction uses only 64 bits (same as ADDSD etc.) 8193553. llvm-svn: 110491	2010-08-07 00:33:42 +00:00
Stuart Hastings	5afa738d7f	Test case for r110459. Radar 8264751. Test case by Fariborz Jahanian! llvm-svn: 110467	2010-08-06 19:02:24 +00:00
Dan Gohman	e68958fcdf	Implement a proper getModRefInfo for va_arg. llvm-svn: 110458	2010-08-06 18:24:38 +00:00
Rafael Espindola	027d5bcf89	Fix eabi calling convention when a 64 bit value shadows r3. Without this what was happening was: * R3 is not marked as "used" * ARM backend thinks it has to save it to the stack because of vaarg * Offset computation correctly ignores it * Offsets are wrong llvm-svn: 110446	2010-08-06 15:35:32 +00:00
Eric Christopher	e1fb772aa5	Add an option to always emit realignment code for a particular module. llvm-svn: 110404	2010-08-05 23:57:43 +00:00
Dan Gohman	884dd752c3	Implement AccessesArguments checking in the two-callsite form of BasicAA::getModRefInfo. This allows BasicAA to say that two memset calls to non-aliasing memory locations don't interfere. llvm-svn: 110393	2010-08-05 23:34:50 +00:00
Dan Gohman	26ef7c7ab7	Fix memdep's code for reasoning about dependences between two calls. A Ref response from getModRefInfo is not useful here. Instead, check for identical calls only in the NoModRef case. Reapply r110270, and strengthen it to compensate for the memdep changes. When both calls are readonly, there is no dependence between them. llvm-svn: 110382	2010-08-05 22:09:15 +00:00
Devang Patel	cc3f3b341d	Move x86 specific tests into test/CodeGen/X86. llvm-svn: 110372	2010-08-05 20:25:37 +00:00
Bob Wilson	72de307116	Add an ARM RSCrr instruction for disassembly only. Partial fix for PR7792. llvm-svn: 110361	2010-08-05 18:59:36 +00:00
Bob Wilson	adb93e56a3	Add an ARM RSBrr instruction for disassembly only. Partial fix for PR7792. llvm-svn: 110358	2010-08-05 18:23:43 +00:00
Dan Gohman	c53ee449a5	Move x86-specific tests out of test/Transforms/LoopStrengthReduce and into test/CodeGen/X86, so that they aren't run when the x86 target is not enabled. Fix uglygep.ll to not be x86-specific. llvm-svn: 110343	2010-08-05 17:04:15 +00:00
Daniel Dunbar	e62e664656	tests: CodeGen/X86/GC tests require X86. llvm-svn: 110338	2010-08-05 15:45:33 +00:00
Daniel Dunbar	57e3f71538	tests: Mark MC/AsmParser tests as requiring x86 for now -- almost all of them rely on using a specific x86 triple to test what they want to test. llvm-svn: 110337	2010-08-05 15:44:15 +00:00
Rafael Espindola	5bca58a290	check-lit was failing again on F13 64 bits :-( llvm-svn: 110311	2010-08-05 03:35:01 +00:00
Dan Gohman	554b012f67	Revert r110270 for now. It appears to uncover a memdep bug. llvm-svn: 110293	2010-08-05 00:43:10 +00:00
Bob Wilson	97886d59d1	ARM "rrx" shift operands do not have an immediate. PR7790. llvm-svn: 110292	2010-08-05 00:34:42 +00:00
Dan Gohman	109561845b	The trouble with testing for "ModRef" and "NoModRef" is that one is a suffix of the other, and FileCheck accepts superstrings. Adjust the output to avoid this problem. llvm-svn: 110280	2010-08-04 23:37:55 +00:00
Bill Wendling	ca1cb13646	The lower invoke pass needs to have unreachable code elimination run after it because it could create such things. This fixes a MingW buildbot test failure. llvm-svn: 110279	2010-08-04 23:36:02 +00:00
Dan Gohman	bd33dab633	The two-callsite form of AliasAnalysis::getModRefInfo is documented to return Ref if the left callsite only reads memory read or written by the right callsite; fix BasicAliasAnalysis to implement this. Add AliasAnalysisEvaluator support for testing the two-callsite form of getModRefInfo. llvm-svn: 110270	2010-08-04 22:56:29 +00:00
Eli Friedman	39d0f57cab	PR7814: Truncates cannot be ignored for signed comparisons. llvm-svn: 110268	2010-08-04 22:40:58 +00:00
Stuart Hastings	49af1ebf2e	Test case for r110250. Radar 8264670. Test case by Fariborz Jahanian! llvm-svn: 110254	2010-08-04 22:05:38 +00:00
Bill Wendling	26feb849a4	Testcase for r110248. llvm-svn: 110249	2010-08-04 21:56:30 +00:00
Devang Patel	5c1f56b78f	Test case for combination of r110234 & r110235. llvm-svn: 110238	2010-08-04 18:42:46 +00:00
Dan Gohman	6786a04d0d	These tests are no longer stored in CVS. llvm-svn: 110201	2010-08-04 15:58:01 +00:00
Stuart Hastings	cba0d06b7c	call-imm.ll test case regex fix. Patch by Dimitry Andric! llvm-svn: 110199	2010-08-04 15:31:35 +00:00
Kalle Raiskila	8b2f70125f	Make SPU backend handle insertelement and store for "half vectors" llvm-svn: 110198	2010-08-04 13:59:48 +00:00
Bob Wilson	79daf7e0ae	Combine NEON VABD (absolute difference) intrinsics with ADDs to make VABA (absolute difference with accumulate) intrinsics. Radar 8228576. llvm-svn: 110170	2010-08-04 00:12:08 +00:00
Dan Gohman	3619660529	Make instcombine set explicit alignments on load or store instructions with alignment 0, so that subsequent passes don't need to bother checking the TargetData ABI size manually. llvm-svn: 110128	2010-08-03 18:20:32 +00:00
Jakob Stoklund Olesen	011ff9bec9	OK, that's it. This test is going away now. But don't worry, I am taking it to a nice farm in the country where it can play with other tests. And bunnies. It is not clear what is being tested, and the revision history shows a bunch of random changes to the expected instruction count. Clearly, we are just fudging it to pass whenever it fails. llvm-svn: 110118	2010-08-03 17:21:14 +00:00
Peter Collingbourne	ddaaf40d24	Add an atomic lowering pass llvm-svn: 110113	2010-08-03 16:19:16 +00:00
Michael J. Spencer	54cfd42c33	MC: Fix symbol fragment offsets in COFF. Patch by Cameron Esfahani! llvm-svn: 110104	2010-08-03 05:02:46 +00:00
Michael J. Spencer	d32764c8a0	Revert "MC: Fix symbol fragment offsets in COFF." This reverts commit r110100 Wrong path caps. llvm-svn: 110103	2010-08-03 04:53:28 +00:00
Michael J. Spencer	cf3d8b4ec4	MC: Fix symbol fragment offsets in COFF. Patch by Cameron Esfahani! llvm-svn: 110100	2010-08-03 04:43:24 +00:00
Stuart Hastings	460a356bf6	Diabolical hack to make a test compatible with clang. (Thanks to Dale!) Radar 8246180. llvm-svn: 110081	2010-08-02 23:29:03 +00:00
Dan Gohman	d8968da2c5	Add a lint check for indirectbr with no successors. llvm-svn: 110074	2010-08-02 23:06:43 +00:00
Stuart Hastings	0e6e8858ff	Testcase for r110043. Radar 8246180. llvm-svn: 110070	2010-08-02 22:09:53 +00:00
Kalle Raiskila	77558b7d13	More SPU v2f32 stuff added: insertelement and shuffle. llvm-svn: 110038	2010-08-02 11:22:10 +00:00
Kalle Raiskila	68b3886678	Add preliminary v2f32 support for SPU. Like with v2i32, we just duplicate the instructions and operate on half vectors. Also reorder code in SPUInstrInfo.td for better coherency. llvm-svn: 110037	2010-08-02 10:25:47 +00:00
Owen Anderson	8f306a779b	Re-apply the infamous r108614, with a fix pointed out by Dirk Steinke. llvm-svn: 110036	2010-08-02 09:32:13 +00:00
Kalle Raiskila	622f8eb981	Add preliminary v2i32 support for SPU backend. As there are no such registers in SPU, this support boils down to "emulating" them by duplicating instructions on the general purpose registers. This adds the most basic operations on v2i32: passing parameters, addition, subtraction, multiplication and a few others. llvm-svn: 110035	2010-08-02 08:54:39 +00:00
Daniel Dunbar	1465d7cffa	Fix comment. llvm-svn: 110006	2010-08-02 01:25:20 +00:00
Daniel Dunbar	5eeae48783	tests: Kill off custom targets which were just there for TestRunner.sh. llvm-svn: 110003	2010-08-02 00:52:44 +00:00
Daniel Dunbar	4b77d23d40	tests: Deprecate TestRunner.sh, and have it just invoke 'llvm-lit' (which will need to be in your path). Please move to using 'llvm-lit' if you are still using TestRunner.sh. llvm-svn: 110002	2010-08-02 00:52:41 +00:00
Eli Friedman	7595ce05a2	PR7781: Fix incorrect shifting in PPCTargetLowering::LowerBUILD_VECTOR. llvm-svn: 109998	2010-08-02 00:18:19 +00:00
Daniel Dunbar	b1af605e58	tests: Make 'lit' the default test tool. You can still use 'make check-dg' to run the tests using DejaGNU, but not for much longer. This is a last call for DejaGNU supporters, if no one complains soon the DejaGNU support is going to die. llvm-svn: 109997	2010-08-02 00:05:18 +00:00
Eli Friedman	1b2bc1b844	PR7774: Fix undefined shifts in Alpha backend. As a bonus, this actually improves the generated code in some cases. llvm-svn: 109985	2010-08-01 21:13:28 +00:00
Bob Wilson	66161f5eb4	Revert new AVX intrinsic tests. They are breaking buildbots and Bruno is away from a computer now. --- Reverse-merging r109881 into '.': D test/CodeGen/X86/avx-intrinsics-x86.ll D test/CodeGen/X86/avx-intrinsics-x86_64.ll llvm-svn: 109959	2010-07-31 22:36:03 +00:00
Daniel Dunbar	0b636a24c7	Speculatively revert r108614, "Another attempt at getting the clang self-host to like my instcombine patch.", in an attempt to fix Clang i386 bootstrap. - Also PR7719. llvm-svn: 109953	2010-07-31 19:51:11 +00:00
Bob Wilson	cd5fc7bef1	Add support for disassembling VMVN (immediate) instructions. PR7747. llvm-svn: 109946	2010-07-31 05:57:44 +00:00
Dale Johannesen	cf0287e56d	PPC doesn't supported VLA with large alignment. This was formerly rejected by the FE, so asserted in the BE; now the FE only warns, so we treat it as a legitimate fatal error in PPC BE. This means the test for the feature won't pass, so it's xfail'd. llvm-svn: 109892	2010-07-30 21:09:48 +00:00
Bruno Cardoso Lopes	92941fdb26	A bunch of tests for AVX intrinsics llvm-svn: 109881	2010-07-30 19:57:56 +00:00
Bob Wilson	964179cb58	Attempt to fix the llvm-gcc-powerpc-darwin9 buildbot. llvm-svn: 109876	2010-07-30 18:52:47 +00:00
Eli Friedman	ffe64c06ef	Fix for bug reported by Evzen Muller on llvm-commits: make sure to correctly check the range of the constant when optimizing a comparison between a constant and a sign_extend_inreg node. llvm-svn: 109854	2010-07-30 06:44:31 +00:00
Jim Grosbach	d343166a0b	Many Thumb2 instructions can reference the full ARM register set (i.e., have 4 bits per register in the operand encoding), but have undefined behavior when the operand value is 13 or 15 (SP and PC, respectively). The trivial coalescer in linear scan sometimes will merge a copy from SP into a subsequent instruction which uses the copy, and if that instruction cannot legally reference SP, we get bad code such as: mls r0,r9,r0,sp instead of: mov r2, sp mls r0, r9, r0, r2 This patch adds a new register class for use by Thumb2 that excludes the problematic registers (SP and PC) and is used instead of GPR for those operands which cannot legally reference PC or SP. The trivial coalescer explicitly requires that the register class of the destination for the COPY instruction contain the source register for the COPY to be considered for coalescing. This prevents errant instructions like that above. PR7499 llvm-svn: 109842	2010-07-30 02:41:01 +00:00
Eric Christopher	2e276485cb	Fix this up per llvm-gcc r109819. llvm-svn: 109820	2010-07-29 23:20:29 +00:00
Benjamin Kramer	d9624e2d2e	Remove XFAIL, test doesn't leak anymore. llvm-svn: 109801	2010-07-29 20:36:36 +00:00
Dale Johannesen	2bff50546c	Implement vector constants which are splat of integers with mov + vdup. 8003375. This is currently disabled by default because LICM will not hoist a VDUP, so it pessimizes the code if the construct occurs inside a loop (8248029). llvm-svn: 109799	2010-07-29 20:10:08 +00:00
Dan Gohman	390914cbe8	Make GlobalValue alignment consistent with load, store, and alloca alignment, fixing silent truncation of alignment values. llvm-svn: 109653	2010-07-28 20:56:48 +00:00
Dan Gohman	a7e5a24093	Define a maximum supported alignment value for load, store, and alloca instructions (constrained by their internal encoding), and add error checking for it. Fix an instcombine bug which generated huge alignment values (null is infinitely aligned). This fixes undefined behavior noticed by John Regehr. llvm-svn: 109643	2010-07-28 20:12:04 +00:00
Nate Begeman	53afc8f06a	Implement a vectorized algorithm for <16 x i8> << <16 x i8> This is about 4x faster and smaller than the existing scalarization. llvm-svn: 109566	2010-07-28 00:21:48 +00:00
Stuart Hastings	a7f1d4a2ba	Testcase for r109556. Radar 8198362. llvm-svn: 109557	2010-07-27 23:15:25 +00:00
Nate Begeman	269a6da023	~40% faster vector shl <4 x i32> on SSE 4.1 Larger improvements for smaller types coming in future patches. For: define <2 x i64> @shl(<4 x i32> %r, <4 x i32> %a) nounwind readnone ssp { entry: %shl = shl <4 x i32> %r, %a ; <<4 x i32>> [#uses=1] %tmp2 = bitcast <4 x i32> %shl to <2 x i64> ; <<2 x i64>> [#uses=1] ret <2 x i64> %tmp2 } We get: _shl: ## @shl pslld $23, %xmm1 paddd LCPI0_0, %xmm1 cvttps2dq %xmm1, %xmm1 pmulld %xmm1, %xmm0 ret Instead of: _shl: ## @shl pshufd $3, %xmm0, %xmm2 movd %xmm2, %eax pshufd $3, %xmm1, %xmm2 movd %xmm2, %ecx shll %cl, %eax movd %eax, %xmm2 pshufd $1, %xmm0, %xmm3 movd %xmm3, %eax pshufd $1, %xmm1, %xmm3 movd %xmm3, %ecx shll %cl, %eax movd %eax, %xmm3 punpckldq %xmm2, %xmm3 movd %xmm0, %eax movd %xmm1, %ecx shll %cl, %eax movd %eax, %xmm2 movhlps %xmm0, %xmm0 movd %xmm0, %eax movhlps %xmm1, %xmm1 movd %xmm1, %ecx shll %cl, %eax movd %eax, %xmm0 punpckldq %xmm0, %xmm2 movdqa %xmm2, %xmm0 punpckldq %xmm3, %xmm0 ret llvm-svn: 109549	2010-07-27 22:37:06 +00:00
Devang Patel	bd32256e25	Update tests to not rely on input file's absolute path. llvm-svn: 109521	2010-07-27 18:13:53 +00:00
Nate Begeman	317b969ac5	Fix a crash in the dag combiner caused by ConstantFoldBIT_CONVERTofBUILD_VECTOR calling itself recursively and returning a SCALAR_TO_VECTOR node, but assuming the input was always a BUILD_VECTOR. llvm-svn: 109519	2010-07-27 18:02:18 +00:00
Tobias Grosser	731b079edb	Make coff-dump.py executable and add python as executable for this script. This fixes the MC/COFF/basic-coff.ll test case. llvm-svn: 109497	2010-07-27 09:01:26 +00:00
Michael J. Spencer	f8270bdb2d	Make MC use Windows COFF on Windows and add tests. llvm-svn: 109494	2010-07-27 06:46:15 +00:00
Anton Korobeynikov	6bcea068db	Currently EH lowering code expects typeinfo to be global only. This assumption is not satisfied due to global mergeing. Workaround the issue by temporary disablinge mergeing of const globals. Also, ignore LLVM "special" globals. This fixes PR7716 llvm-svn: 109423	2010-07-26 18:45:39 +00:00
Owen Anderson	bb4c4b59a4	Fix a test with malformed IR. Not sure why this didn't fail before. llvm-svn: 109422	2010-07-26 18:44:56 +00:00
Dan Gohman	cd83870faf	Fix SCEVExpander::visitAddRecExpr so that it remembers the induction variable it inserted rather than using LoopInfo::getCanonicalInductionVariable to rediscover it, since that doesn't work on non-canonical loops. This fixes infinite recurrsion on such loops; PR7562. llvm-svn: 109419	2010-07-26 18:28:14 +00:00
Dan Gohman	b0961f2443	Avoid depending on LCSSA implicitly pulling in LoopSimplify. llvm-svn: 109410	2010-07-26 18:00:43 +00:00
Bruno Cardoso Lopes	306a1f9721	Support x86 "eiz" and "riz" pseudo index registers in the assembler. llvm-svn: 109295	2010-07-24 00:06:39 +00:00
Matt Fleming	fbd7f65248	Consolidate the ELF section directive tests into a single file as suggested by Chris Lattner. llvm-svn: 109290	2010-07-23 23:40:41 +00:00
Evan Cheng	df907f4594	- Allow target to specify when is register pressure "too high". In most cases, it's too late to start backing off aggressive latency scheduling when most of the registers are in use so the threshold should be a bit tighter. - Correctly handle live out's and extract_subreg etc. - Enable register pressure aware scheduling by default for hybrid scheduler. For ARM, this is almost always a win on # of instructions. It's runtime neutral for most of the tests. But for some kernels with high register pressure it can be a huge win. e.g. 464.h264ref reduced number of spills by 54 and sped up by 20%. llvm-svn: 109279	2010-07-23 22:39:59 +00:00
Bruno Cardoso Lopes	6f38011196	Move AVX encoding tests to different files llvm-svn: 109269	2010-07-23 21:25:26 +00:00
Dan Gohman	55e244698a	Use the proper type for shift counts. This fixes a bootstrap error. llvm-svn: 109265	2010-07-23 21:08:12 +00:00
Stuart Hastings	caf8e3a2db	Test case to insure template function declaration refers to correct filename. Radar 8063111. llvm-svn: 109258	2010-07-23 20:15:49 +00:00
Bruno Cardoso Lopes	ea0e05a3ce	Add AVX version of CLMUL instructions llvm-svn: 109248	2010-07-23 18:41:12 +00:00
Dan Gohman	0818684a70	DAGCombine (shl (anyext x, c)) to (anyext (shl x, c)) if the high bits are not demanded. This often allows the anyext to be folded away. llvm-svn: 109242	2010-07-23 18:03:30 +00:00
Bruno Cardoso Lopes	acd9230b1b	Add complete assembler support for FMA3 instructions, with descriptions and encodings taken from the AVX manual llvm-svn: 109204	2010-07-23 00:54:35 +00:00
Bruno Cardoso Lopes	0710c74f29	Add remaining AVX instructions (most of them dealing with GR64 destinations. This complete the assembler support for the general AVX ISA. But we still miss instructions from FMA3 and CLMUL specific feature flags, which are now the next step llvm-svn: 109168	2010-07-22 21:18:49 +00:00
Tobias Grosser	336734aca6	Add new RegionInfo pass. The RegionInfo pass detects single entry single exit regions in a function, where a region is defined as any subgraph that is connected to the remaining graph at only two spots. Furthermore an hierarchical region tree is built. Use it by calling "opt -regions analyze" or "opt -view-regions". llvm-svn: 109089	2010-07-22 07:46:31 +00:00
Eric Christopher	9a77382685	Custom lower the memory barrier instructions and add support for lowering without sse2. Add a couple of new testcases. Fixes a few libgomp tests and latent bugs. Remove a few todos. llvm-svn: 109078	2010-07-22 02:48:34 +00:00
Evan Cheng	285903853f	More register pressure aware scheduling work. llvm-svn: 109064	2010-07-21 23:53:58 +00:00
Bruno Cardoso Lopes	e3acfd4d58	Add more 256-bit forms for a bunch of regular AVX instructions Add 64-bit (GR64) versions of some instructions (which are not described in their SSE forms, but are described in AVX) llvm-svn: 109063	2010-07-21 23:53:50 +00:00
Eric Christopher	84bdfd80df	Baby steps towards ARM fast-isel. llvm-svn: 109047	2010-07-21 22:26:11 +00:00
Bruno Cardoso Lopes	6238c1d102	Add missing AVX convert instructions. Those instructions are not described in their SSE forms (although they exist), but add the AVX forms anyway, so the assembler can benefit from it llvm-svn: 109039	2010-07-21 21:37:59 +00:00
Dan Gohman	093cb79d4b	Disallow null as a named metadata operand. Make MDNode::destroy private. Fix the one thing that used MDNode::destroy, outside of MDNode itself. One should never delete or destroy an MDNode explicitly. MDNodes implicitly go away when there are no references to them (implementation details aside). llvm-svn: 109028	2010-07-21 18:54:18 +00:00
Rafael Espindola	4277e14dc4	Fix calling convention on ARM if vfp2+ is enabled. llvm-svn: 109009	2010-07-21 11:38:30 +00:00
Bruno Cardoso Lopes	cdbec62510	Add AVX only vzeroall and vzeroupper instructions llvm-svn: 109002	2010-07-21 08:56:24 +00:00
Eric Christopher	690aa72437	Turn this test on again after the llvm-gcc change in r108986. llvm-svn: 108987	2010-07-21 04:54:06 +00:00
Eric Christopher	8d95d26eb1	Update this to use a "valid" alignment. llvm-svn: 108985	2010-07-21 04:51:24 +00:00
Bruno Cardoso Lopes	3499934da6	Add new AVX vpermilps, vpermilpd and vperm2f128 instructions llvm-svn: 108984	2010-07-21 03:07:42 +00:00
Bruno Cardoso Lopes	3ceaf7a0a2	Add new AVX vmaskmov instructions, and also fix the VEX encoding bits to support it llvm-svn: 108983	2010-07-21 02:46:58 +00:00
Bruno Cardoso Lopes	e706501975	Add new AVX vextractf128 instructions llvm-svn: 108964	2010-07-20 23:19:02 +00:00
Matt Fleming	c3eb5e3d4b	Include some tests for the recently committed ELF section directive handlers. llvm-svn: 108938	2010-07-20 21:37:30 +00:00
Eric Christopher	3f696ff489	Testcase for llvm-gcc commit r108910. llvm-svn: 108918	2010-07-20 20:32:47 +00:00
Bruno Cardoso Lopes	3b505848fd	Add new AVX instruction vinsertf128 llvm-svn: 108892	2010-07-20 19:44:51 +00:00
Dan Gohman	625fd2292d	Fix SCEV denormalization of expressions where the exit value from one loop is involved in the increment of an addrec for another loop. This fixes rdar://8168938. llvm-svn: 108863	2010-07-20 17:06:20 +00:00
Jim Grosbach	badf087e45	update tests for smarter BIC usage llvm-svn: 108846	2010-07-20 16:16:48 +00:00
Duncan Sands	2e839de377	The same problem was being tracked in PR7652. llvm-svn: 108843	2010-07-20 15:52:32 +00:00
Bruno Cardoso Lopes	160695fecb	Fix PR7174, a couple o Mips fixes: - Fix a typo for PIC check during jmp table lowering - Also fix the "first jump table basic block is not considered only reachable by fall through" problem, use this ad-hoc solution until I come up with something better. Patch by stetorvs@gmail.com llvm-svn: 108820	2010-07-20 08:37:04 +00:00
Bruno Cardoso Lopes	ea7863647b	Fix Mips PR7473. Patch by stetorvs@gmail.com llvm-svn: 108816	2010-07-20 07:58:51 +00:00
Bruno Cardoso Lopes	6c8041ea34	x86_32 tests for vbroadcast llvm-svn: 108789	2010-07-20 00:11:50 +00:00
Bruno Cardoso Lopes	14c5fd437c	Add AVX vbroadcast new instruction llvm-svn: 108788	2010-07-20 00:11:13 +00:00
Bruno Cardoso Lopes	9de0ca73d4	Add 256-bit vaddsub, vhadd, vhsub, vblend and vdpp instructions! llvm-svn: 108769	2010-07-19 23:32:44 +00:00
Dan Gohman	b5e918dc05	After a custom inserter, in a block which has constant instructions, update the current basic block in addition to the current insert position, so that they remain consistent. This fixes rdar://8204072. llvm-svn: 108765	2010-07-19 22:48:56 +00:00
Daniel Dunbar	9db7d0addd	X86: Mark JMP{32,64}[mr] as requires 32-bit/64-bit mode. They are the same instruction, we only want to allow the one for the current subtarget. - This also fixes suffix matching for jmp instructions, because it eliminates the ambiguity between 'jmpl' and 'jmpq'. llvm-svn: 108746	2010-07-19 20:44:16 +00:00
Dale Johannesen	d4e389441d	Testcase for 108732 (8195660). llvm-svn: 108733	2010-07-19 18:22:40 +00:00
Devang Patel	18efced1a2	Fix PR 7662. Do not try to insert local variable info to a DIE used for function declaration. llvm-svn: 108731	2010-07-19 17:53:55 +00:00
Owen Anderson	3ccd81864f	Testcase for r108687. llvm-svn: 108689	2010-07-19 08:14:26 +00:00
Owen Anderson	9c271e2835	Remove r108639 now that it is handled by InstCombine instead. llvm-svn: 108688	2010-07-19 08:10:24 +00:00
Daniel Dunbar	9aefb8ee4c	X86-64: Mark WINCALL and more tail call instructions as code gen only. llvm-svn: 108685	2010-07-19 07:21:07 +00:00
Daniel Dunbar	b82cd9319b	MC/X86: We now match instructions like "incl %eax" correctly for the arch we are assembling; remove crufty custom cleanup code. llvm-svn: 108681	2010-07-19 06:14:54 +00:00
Daniel Dunbar	af75e1923c	tests: Force another triple. llvm-svn: 108666	2010-07-19 00:43:58 +00:00
Daniel Dunbar	3b4621103a	tests: Force triples. llvm-svn: 108658	2010-07-18 21:16:10 +00:00
Daniel Dunbar	40a564f09f	MC/AsmParser: Fix .abort and .secure_log_unique to accept arbitrary token sequences, not just strings. llvm-svn: 108655	2010-07-18 20:15:59 +00:00
Daniel Dunbar	6fb1c3ad8a	MC/AsmParser: Add macro argument substitution support. llvm-svn: 108654	2010-07-18 19:00:10 +00:00
Daniel Dunbar	4323571efb	MC/AsmParser: Add basic support for macro instantiation. llvm-svn: 108653	2010-07-18 18:54:11 +00:00
Daniel Dunbar	c1f58ec83c	MC/AsmParser: Add basic parsing support for .macro definitions. llvm-svn: 108652	2010-07-18 18:47:21 +00:00
Chris Lattner	ede90a2a58	daniel doesn't hate me, he hates macpython 2.5, which is a very reasonable position on life! llvm-svn: 108650	2010-07-18 18:42:18 +00:00
Daniel Dunbar	828984ff4e	MC/AsmParser: Add .macros_{off,on} support, not that makes sense since we don't support macros. llvm-svn: 108649	2010-07-18 18:38:02 +00:00
Owen Anderson	41670a11a8	Add a testcase for r108639. llvm-svn: 108640	2010-07-18 08:57:19 +00:00
Owen Anderson	7d2818b073	Another attempt at getting the clang self-host to like my instcombine patch. llvm-svn: 108614	2010-07-17 06:56:35 +00:00
Jim Grosbach	b97e2bbe32	Add combiner patterns to more effectively utilize the BFI (bitfield insert) instruction for non-constant operands. This includes the case referenced in the README.txt regarding a bitfield copy. llvm-svn: 108608	2010-07-17 03:30:54 +00:00
Eli Friedman	ceb16a5ce9	Test for ELF .size directive. llvm-svn: 108607	2010-07-17 03:15:24 +00:00
Jim Grosbach	11013eda5a	Add basic support to code-gen the ARM/Thumb2 bit-field insert (BFI) instruction and a combine pattern to use it for setting a bit-field to a constant value. More to come for non-constant stores. llvm-svn: 108570	2010-07-16 23:05:05 +00:00
Bill Wendling	bf8370ff36	Consider this function: void foo() { __builtin_unreachable(); } It will output the following on Darwin X86: _func1: Leh_func_begin0: pushq %rbp Ltmp0: movq %rsp, %rbp Ltmp1: Leh_func_end0: This prolog adds a new Call Frame Information (CFI) row to the FDE with an address that is not within the address range of the code it describes -- part is equal to the end of the function -- and therefore results in an invalid EH frame. If we emit a nop in this situation, then the CFI row is now within the address range. llvm-svn: 108568	2010-07-16 22:51:10 +00:00
Jakob Stoklund Olesen	c30b4ddc58	Remove the X86::FP_REG_KILL pseudo-instruction and the X86FloatingPointRegKill pass that inserted it. It is no longer necessary to limit the live ranges of FP registers to a single basic block. llvm-svn: 108536	2010-07-16 17:41:44 +00:00
Benjamin Kramer	50729ad717	Feed the right output into FileCheck. llvm-svn: 108523	2010-07-16 10:58:02 +00:00
Nick Lewycky	375efe3157	Arrays and vectors with different numbers of elements are not equivalent. llvm-svn: 108517	2010-07-16 06:31:12 +00:00
Tobias Grosser	3d84c9c793	LoopSimplify does not update domfrontier correctly. This fixes PR7649. llvm-svn: 108513	2010-07-16 05:59:45 +00:00
Jakob Stoklund Olesen	37c42a3d02	Remove many calls to TII::isMoveInstr. Targets should be producing COPY anyway. TII::isMoveInstr is going tobe completely removed. llvm-svn: 108507	2010-07-16 04:45:42 +00:00
Jakob Stoklund Olesen	b1671271ab	Add forgotten test case. llvm-svn: 108506	2010-07-16 04:45:35 +00:00
Dan Gohman	103c4ebea5	Use the source-order scheduler instead of the "fast" scheduler at -O0, because it's more likely to keep debug line information in its original order. llvm-svn: 108496	2010-07-16 02:01:19 +00:00
Eric Christopher	15a81cddb4	Also revert 108422, it's causing some test failures. Working on testcases for Owen. llvm-svn: 108494	2010-07-16 01:36:12 +00:00
Dan Gohman	c6eefe4d4e	Fix this test. llvm-svn: 108491	2010-07-16 01:28:45 +00:00
Dale Johannesen	bfd4fd7bb7	The SelectionDAGBuilder's handling of debug info, on rare occasions, caused code to be generated in a different order. All cases I've seen involved float softening in the type legalizer, and this could be perhaps be fixed there, but it's better not to generate things differently in the first place. 7797940 (6/29/2010..7/15/2010). llvm-svn: 108484	2010-07-16 00:02:08 +00:00
Bill Wendling	4bda1c8e68	Revert. This isn't the correct way to go. llvm-svn: 108478	2010-07-15 23:42:21 +00:00
Dan Gohman	fbbdfcaea7	Fix the order that SCEVExpander considers add operands in so that it doesn't miss an opportunity to form a GEP, regardless of the relative loop depths of the operands. This fixes rdar://8197217. llvm-svn: 108475	2010-07-15 23:38:13 +00:00
Bill Wendling	973dc3b1d8	Handle code gen for the unreachable instruction if it's the only instruction in the function. We'll just turn it into a "trap" instruction instead. The problem with not handling this is that it might generate a prologue without the equivalent epilogue to go with it: $ cat t.ll define void @foo() { entry: unreachable } $ llc -o - t.ll -relocation-model=pic -disable-fp-elim -unwind-tables .section __TEXT,__text,regular,pure_instructions .globl _foo .align 4, 0x90 _foo: ## @foo Leh_func_begin0: ## BB#0: ## %entry pushq %rbp Ltmp0: movq %rsp, %rbp Ltmp1: Leh_func_end0: ... The unwind tables then have bad data in them causing all sorts of problems. Fixes <rdar://problem/8096481>. llvm-svn: 108473	2010-07-15 23:32:40 +00:00
Evan Cheng	55f0c6b9fc	Split -enable-finite-only-fp-math to two options: -enable-no-nans-fp-math and -enable-no-infs-fp-math. All of the current codegen fp math optimizations only care whether the fp arithmetics arguments and results can never be NaN. llvm-svn: 108465	2010-07-15 22:07:12 +00:00
Chris Lattner	60b131654b	fix the definitions of ConstTextCoalSection/ConstDataCoalSection to keep "Text" in sync with the "pure instructions" section attribute. Lack of this attribute was preventing the assembler from emitting multibyte noops instructions for templates (and inlines, and other coalesced stuff) and was causing the assembler to mismatch .o files. This fixes rdar://8018335 llvm-svn: 108461	2010-07-15 21:22:00 +00:00
Devang Patel	df09db62e2	Fix crash reported in PR7653. llvm-svn: 108441	2010-07-15 18:45:27 +00:00
Dan Gohman	4afd412d6b	Watch out for a constant offset cancelling out a base register, forming a zero. This situation arrises in Fortran code with induction variables that start at 1 instead of 0. This fixes PR7651. llvm-svn: 108424	2010-07-15 15:14:45 +00:00
Owen Anderson	7151dfd48a	Reapply r108378, with bugfixes, testcase, and improved comment formatting. This now passes LIT, nighty test, and llvm-gcc bootstrap on my machine. llvm-svn: 108422	2010-07-15 15:00:23 +00:00
Chris Lattner	19eff2a9f6	Fix PR7647, handling the case when 'To' ends up being mutated by recursive simplification. This also enhances ReplaceAndSimplifyAllUses to actually do a real RAUW at the end of it, which updates any value handles pointing to "From" to start pointing to "To". This seems useful for debug info and random other VH users. llvm-svn: 108415	2010-07-15 06:36:08 +00:00
Chris Lattner	e985a63bbf	see comment. llvm-svn: 108409	2010-07-15 05:17:36 +00:00
Eric Christopher	25e72a8920	Temporarily disable this test. llvm-svn: 108371	2010-07-14 23:12:58 +00:00
Devang Patel	29168baf4b	Make it a .ll test case. llvm-svn: 108370	2010-07-14 23:12:52 +00:00
Eric Christopher	e34b383e71	Add a testcase for the vla and stack realignment warning. llvm-svn: 108365	2010-07-14 22:26:35 +00:00
Dale Johannesen	6fe8c37a01	Tests for llvm-gcc commit 108360. llvm-svn: 108362	2010-07-14 21:22:35 +00:00
Jim Grosbach	a90af1ba38	Improve 64-subtraction of immediates when parts of the immediate can fit in the literal field of an instruction. E.g., long long foo(long long a) { return a - 734439407618LL; } rdar://7038284 llvm-svn: 108339	2010-07-14 17:45:16 +00:00
Dan Gohman	042523340b	Delete fast-isel's trivial load optimization; it breaks debugging because it can look past points where a debugger might modify user variables. llvm-svn: 108336	2010-07-14 17:25:37 +00:00
Bob Wilson	bb57896f8e	Fix test to appease the buildbots. llvm-svn: 108334	2010-07-14 16:43:47 +00:00
Evan Cheng	a8e8874552	Fix for PR7193 was overly conservative. The only case where sibcall callee address cannot be allocated a register is in 32-bit mode where the first three arguments are marked inreg. In that case EAX, EDX, and ECX will be used for argument passing. This fixes PR7610. llvm-svn: 108327	2010-07-14 06:44:01 +00:00
Bob Wilson	bad47f62f6	Add support for NEON VMVN immediate instructions. llvm-svn: 108324	2010-07-14 06:31:50 +00:00
Chris Lattner	ec0e7b1643	revert r108320, I see the failures now... llvm-svn: 108322	2010-07-14 06:16:35 +00:00
Chris Lattner	658680b2f5	reapply benjamin's instcombine patch, I don't see anything wrong with it and can't repro any problems with a manual self-host. llvm-svn: 108320	2010-07-14 05:59:13 +00:00
Evan Cheng	c893115312	Re-enable the test with fix. llvm-svn: 108319	2010-07-14 05:49:23 +00:00
Chris Lattner	711338fb04	temporarily disable to test to fix buildbots. llvm-svn: 108310	2010-07-14 02:21:59 +00:00
Evan Cheng	d542414945	Teach ProcessImplicitDefs to transform more COPY instructions into IMPLICIT_DEF (and subsequently eliminate them). This allows machine LICM to hoist IMPLICIT_DEF's. PR7620. llvm-svn: 108304	2010-07-14 01:22:19 +00:00
Bob Wilson	103a0dcfe1	Add an ARM-specific DAG combining to avoid redundant VDUPLANE nodes. Radar 7373643. llvm-svn: 108303	2010-07-14 01:22:12 +00:00
Bruno Cardoso Lopes	6c6c14a55c	Add AVX 256-bit compare instructions and a bunch of testcases llvm-svn: 108286	2010-07-13 22:06:38 +00:00
Bob Wilson	a3f1901531	Use a target-specific VMOVIMM DAG node instead of BUILD_VECTOR to represent NEON VMOV-immediate instructions. This simplifies some things. llvm-svn: 108275	2010-07-13 21:16:48 +00:00
Bruno Cardoso Lopes	fd8bfcd6e1	AVX 256-bit conversion instructions Add the x86 VEX_L form to handle special cases where VEX_L must be set. llvm-svn: 108274	2010-07-13 21:07:28 +00:00
Dale Johannesen	caca5488dc	In inline asm treat indirect 'X' constraint as 'm'. This may not be right in all cases, but it's better than asserting which it was doing before. PR 7528. llvm-svn: 108268	2010-07-13 20:17:05 +00:00
Dan Gohman	afd69cf5b7	Add support for empty named metadata too. This isn't particularly useful, but it is nice for consistency. llvm-svn: 108262	2010-07-13 19:42:44 +00:00
Dan Gohman	1e0213a758	Add support for empty metadata nodes: !{}. llvm-svn: 108259	2010-07-13 19:33:27 +00:00
Evan Cheng	0cc4ad983d	Extend the r107852 optimization which turns some fp compare to code sequence using only i32 operations. It now optimize some f64 compares when fp compare is exceptionally slow (e.g. cortex-a8). It also catches comparison against 0.0. llvm-svn: 108258	2010-07-13 19:27:42 +00:00
Evan Cheng	f43961007c	-enable-unsafe-fp-math should not imply -enable-finite-only-fp-math. llvm-svn: 108254	2010-07-13 18:46:14 +00:00
Dale Johannesen	f241d4626c	Fix PR number. llvm-svn: 108251	2010-07-13 18:14:47 +00:00
Duncan Sands	f88a284579	Handle the case of a tail recursion in which the tail call is followed by a return that returns a constant, while elsewhere in the function another return instruction returns a different constant. This is a special case of accumulator recursion, so just generalize the existing logic a bit. llvm-svn: 108241	2010-07-13 15:41:41 +00:00
Chris Lattner	55595fb291	my work on adding segment registers to LEA missed the disassembler. Remove some code from the disassembler to compensate, unbreaking disassembly of lea's. llvm-svn: 108226	2010-07-13 04:23:55 +00:00
Bruno Cardoso Lopes	dff283e146	Add AVX 256-bit packed logical forms llvm-svn: 108224	2010-07-13 02:38:35 +00:00
Bruno Cardoso Lopes	36b32aeaa5	Add AVX 256-bit unop arithmetic instructions llvm-svn: 108223	2010-07-13 01:53:31 +00:00
Bruno Cardoso Lopes	8e67a0482e	Add AVX 256 binary arithmetic instructions llvm-svn: 108207	2010-07-12 23:04:15 +00:00
Dan Gohman	51e6d9bbf6	Apply the SSE dependence idiom for SSE unary operations to SD instructions too, in addition to SS instructions. And add a comment about it. llvm-svn: 108191	2010-07-12 20:46:04 +00:00
Bruno Cardoso Lopes	f9bcaad76d	Add AVX 256-bit MOVMSK forms llvm-svn: 108184	2010-07-12 20:06:32 +00:00
Daniel Dunbar	d388c93f87	MC/AsmParser: Move .tbss and .zerofill parsing to Darwin specific parser. llvm-svn: 108180	2010-07-12 19:37:35 +00:00
Daniel Dunbar	63a379dd5c	MC/AsmParser: Move .desc parsing to Darwin specific parser. llvm-svn: 108179	2010-07-12 19:22:53 +00:00
Daniel Dunbar	ae9da1481a	MC/AsmParser: Move some misc. Darwin directive handling to DarwinAsmParser. llvm-svn: 108174	2010-07-12 18:49:22 +00:00
Dan Gohman	c128e70ff2	Add a lint check for mismatched return types, inspired by PR6944. llvm-svn: 108162	2010-07-12 18:02:04 +00:00
Benjamin Kramer	8f36402ac2	Nope, still breaks the release selfhost bots :( llvm-svn: 108153	2010-07-12 16:38:48 +00:00
Benjamin Kramer	07b695e052	Reapply the "or" half of r108136, which seems to be less problematic. llvm-svn: 108152	2010-07-12 16:15:48 +00:00
Benjamin Kramer	c719e8ae9e	Revert r108141 again, sigh. llvm-svn: 108148	2010-07-12 14:42:04 +00:00
Benjamin Kramer	f578c36035	Reapply 108136 with an ugly pasto fixed. llvm-svn: 108141	2010-07-12 13:44:00 +00:00
Benjamin Kramer	9675e759cf	Revert r108136 until I figure out why it broke selfhost. llvm-svn: 108139	2010-07-12 12:35:49 +00:00
Benjamin Kramer	35473faa50	instcombine: fold (x & y) \| (~x & z) and (x & y) ^ (~x & z) into ((y ^ z) & x) ^ z which is one instruction shorter. (PR6773) before: %and = and i32 %y, %x %neg = xor i32 %x, -1 %and4 = and i32 %z, %neg %xor = xor i32 %and4, %and after: %xor1 = xor i32 %z, %y %and2 = and i32 %xor1, %x %xor = xor i32 %and2, %z llvm-svn: 108136	2010-07-12 11:54:45 +00:00
Chris Lattner	25eea4db66	fix PR7311 by avoiding breaking casts when a bitcast from scalar->vector is involved. llvm-svn: 108117	2010-07-12 01:19:22 +00:00
Chris Lattner	bbc25ff5cc	if jump threading is able to infer interesting values on both the LHS and RHS of an and/or instruction, don't multiply add known predecessor values. This fixes the crash on testcase from PR7498 llvm-svn: 108114	2010-07-12 00:47:34 +00:00
Chris Lattner	fd4a09fc0a	fix PR7429, a crash turning a load from a string into a float. llvm-svn: 108113	2010-07-12 00:22:51 +00:00
Chris Lattner	f8feba368c	convert to filechecconvert to filecheckk llvm-svn: 108112	2010-07-12 00:21:10 +00:00
Chris Lattner	9338b0a1e2	merge two tests. llvm-svn: 108111	2010-07-12 00:19:47 +00:00
Jakob Stoklund Olesen	c4227f1362	Remove TargetInstrInfo::copyRegToReg entirely. Targets must now implement TargetInstrInfo::copyPhysReg instead. There is no longer a default implementation forwarding to copyRegToReg. llvm-svn: 108095	2010-07-11 17:01:17 +00:00
Rafael Espindola	a76eccf815	Fix va_arg for doubles. With this patch VAARG nodes always contain the correct alignment information, which simplifies ExpandRes_VAARG a bit. The patch introduces a new alignment information to TargetLoweringInfo. This is needed since the two natural candidates cannot be used: * The 's' in target data: If this is set to the minimal alignment of any argument, getCallFrameTypeAlignment would return 4 for doubles on ARM for example. * The getTransientStackAlignment method. It is possible for an architecture to have argument less aligned than what we maintain the stack pointer. llvm-svn: 108072	2010-07-11 04:01:49 +00:00
Dan Gohman	79be2b9be5	Fix this test. llvm-svn: 108059	2010-07-10 22:42:12 +00:00
Jakob Stoklund Olesen	c4b3bcc051	FileCheckize inline asm FP stack tests llvm-svn: 108046	2010-07-10 16:30:25 +00:00
Dan Gohman	30933b3bdb	Add an explicit triple to make this test behave consistently. llvm-svn: 108041	2010-07-10 09:01:35 +00:00
Dan Gohman	367b65b56e	Fix this XTARGET so that this does doesn't XPASS on non-darwin hosts. llvm-svn: 108040	2010-07-10 09:01:03 +00:00
Dan Gohman	d7b5ce3312	Reapply bottom-up fast-isel, with several fixes for x86-32: - Check getBytesToPopOnReturn(). - Eschew ST0 and ST1 for return values. - Fix the PIC base register initialization so that it doesn't ever fail to end up the top of the entry block. llvm-svn: 108039	2010-07-10 09:00:22 +00:00
Bruno Cardoso Lopes	2419606bfb	Add AVX 256-bit packed MOVNT variants llvm-svn: 108021	2010-07-09 21:42:42 +00:00
Bruno Cardoso Lopes	6bc772eec7	Add AVX 256-bit unpack and interleave llvm-svn: 108017	2010-07-09 21:20:35 +00:00
Jakob Stoklund Olesen	51702ec46b	Fix a few tests llvm-svn: 108011	2010-07-09 20:43:09 +00:00
Jim Grosbach	2a5725b1a3	In the presence of variable sized objects, allocate an emergency spill slot. rdar://8131327 llvm-svn: 108008	2010-07-09 20:27:06 +00:00
Dan Gohman	ea9ae3e6ed	Add a target triple. llvm-svn: 108003	2010-07-09 19:17:36 +00:00
Dan Gohman	7929c448fc	Fix MachineLICM to actually visit inner loops. llvm-svn: 108001	2010-07-09 18:49:45 +00:00
Bruno Cardoso Lopes	792e906bef	Start the support for AVX instructions with 256-bit %ymm registers. A couple of notes: - The instructions are being added with dummy placeholder patterns using some 256 specifiers, this is not meant to work now, but since there are some multiclasses generic enough to accept them, when we go for codegen, the stuff will be already there. - Add VEX encoding bits to support YMM - Add MOVUPS and MOVAPS in the first round - Use "Y" as suffix for those Instructions: MOVUPSYrr, ... - All AVX instructions in X86InstrSSE.td will move soon to a new X86InstrAVX file. llvm-svn: 107996	2010-07-09 18:27:43 +00:00
Bob Wilson	6586e9b203	--- Reverse-merging r107947 into '.': U utils/TableGen/FastISelEmitter.cpp --- Reverse-merging r107943 into '.': U test/CodeGen/X86/fast-isel.ll U test/CodeGen/X86/fast-isel-loads.ll U include/llvm/Target/TargetLowering.h U include/llvm/Support/PassNameParser.h U include/llvm/CodeGen/FunctionLoweringInfo.h U include/llvm/CodeGen/CallingConvLower.h U include/llvm/CodeGen/FastISel.h U include/llvm/CodeGen/SelectionDAGISel.h U lib/CodeGen/LLVMTargetMachine.cpp U lib/CodeGen/CallingConvLower.cpp U lib/CodeGen/SelectionDAG/SelectionDAGBuilder.cpp U lib/CodeGen/SelectionDAG/FunctionLoweringInfo.cpp U lib/CodeGen/SelectionDAG/FastISel.cpp U lib/CodeGen/SelectionDAG/SelectionDAGISel.cpp U lib/CodeGen/SelectionDAG/ScheduleDAGSDNodes.cpp U lib/CodeGen/SelectionDAG/InstrEmitter.cpp U lib/CodeGen/SelectionDAG/TargetLowering.cpp U lib/Target/XCore/XCoreISelLowering.cpp U lib/Target/XCore/XCoreISelLowering.h U lib/Target/X86/X86ISelLowering.cpp U lib/Target/X86/X86FastISel.cpp U lib/Target/X86/X86ISelLowering.h llvm-svn: 107987	2010-07-09 16:37:18 +00:00
Jakob Stoklund Olesen	a57965827f	Fix test to be less sensitive of regalloc accidents llvm-svn: 107951	2010-07-09 01:32:11 +00:00
Bob Wilson	88a4e6dc0e	Print "dregpair" NEON operands with a space between them, for readability and consistency with other instructions that have lists of register operands. llvm-svn: 107944	2010-07-09 00:47:20 +00:00
Dan Gohman	0b5aa1cdd3	Re-apply bottom-up fast-isel, with fixes. Be very careful to avoid emitting a DBG_VALUE after a terminator, or emitting any instructions before an EH_LABEL. llvm-svn: 107943	2010-07-09 00:39:23 +00:00
Bob Wilson	21eed476e8	Reenable DAG combining for vector shuffles. It looks like it was temporarily disabled and then never turned back on again. Adjust some tests, one because this change avoids an unnecessary instruction, and the other to make it continue testing what it was intended to test. llvm-svn: 107941	2010-07-09 00:38:12 +00:00
Bill Wendling	a992445ff2	Extension of r107506. Make sure that we don't mark a function as having a call if the inline ASM doesn't need a stack frame. llvm-svn: 107922	2010-07-08 22:38:02 +00:00
Chris Lattner	9f034c1e5d	Rework segment prefix emission code to handle segments in memory operands at the same type as hard coded segments. This fixes problems where we'd emit the segment override after the REX prefix on instructions like: mov %gs:(%rdi), %rax This fixes rdar://8127102. I have several cleanup patches coming next. llvm-svn: 107917	2010-07-08 22:28:12 +00:00
Stuart Hastings	aa246f5687	Test case for r107843. Radar 8152866. llvm-svn: 107907	2010-07-08 20:31:05 +00:00
Evan Cheng	0f54854a1d	Check for FiniteOnlyFPMath as well. llvm-svn: 107904	2010-07-08 20:12:24 +00:00
Benjamin Kramer	2321e6a4d4	Teach instcombine to transform (X >s -1) ? C1 : C2 and (X <s 0) ? C2 : C1 into ((X >>s 31) & (C2 - C1)) + C1, avoiding the conditional. This optimization could be extended to take non-const C1 and C2 but we better stay conservative to avoid code size bloat for now. for int sel(int n) { return n >= 0 ? 60 : 100; } we now generate sarl $31, %edi andl $40, %edi leal 60(%rdi), %eax instead of testl %edi, %edi movl $60, %ecx movl $100, %eax cmovnsl %ecx, %eax llvm-svn: 107866	2010-07-08 11:39:10 +00:00
Eric Christopher	e796253217	A slight reworking of the custom patterns for x86-64 tpoff codegen and correct the testcase for valid assembly. Needs more tests. llvm-svn: 107860	2010-07-08 07:36:46 +00:00
Evan Cheng	be1f7a931e	r107852 is only safe with -enable-unsafe-fp-math to account for +0.0 == -0.0. llvm-svn: 107856	2010-07-08 06:01:49 +00:00
Evan Cheng	25f9364cbd	Optimize some vfp comparisons to integer ones. This patch implements the simplest case when the following conditions are met: 1. The arguments are f32. 2. The arguments are loads and they have no uses other than the comparison. 3. The comparison code is EQ or NE. e.g. vldr.32 s0, [r1] vldr.32 s1, [r0] vcmpe.f32 s1, s0 vmrs apsr_nzcv, fpscr beq LBB0_2 => ldr r1, [r1] ldr r0, [r0] cmp r0, r1 beq LBB0_2 More complicated cases will be implemented in subsequent patches. llvm-svn: 107852	2010-07-08 02:08:50 +00:00
Dale Johannesen	e2289285ae	Changes to ARM tail calls, mostly cosmetic. Add explicit testcases for tail calls within the same module. Duplicate some code to humor those who think .w doesn't apply on ARM. Leave this disabled on Thumb1, and add some comments explaining why it's hard and won't gain much. llvm-svn: 107851	2010-07-08 01:18:23 +00:00
Dan Gohman	e75704369d	Revert 107840 107839 107813 107804 107800 107797 107791. Debug info intrinsics win for now. llvm-svn: 107850	2010-07-08 01:00:56 +00:00
Chris Lattner	efa3c824cc	Fix the second half of PR7437: scalarrepl wasn't preserving address spaces when SRoA'ing memcpy's. llvm-svn: 107846	2010-07-08 00:27:05 +00:00
Chris Lattner	ac5881295c	Implement the major chunk of PR7195: support for 'callw' in the integrated assembler. Still some discussion to be done. llvm-svn: 107825	2010-07-07 22:27:31 +00:00
Bruno Cardoso Lopes	6c61451011	Add more assembly opcodes for SSE compare instructions llvm-svn: 107823	2010-07-07 22:24:03 +00:00
Jakob Stoklund Olesen	ddaf0099a5	Allow copies between GR8_ABCD_L and GR8_ABCD_H. This fixes PR7540. llvm-svn: 107809	2010-07-07 20:33:27 +00:00
Dan Gohman	e7ccc51cc1	Implement bottom-up fast-isel. This has the advantage of not requiring a separate DCE pass over MachineInstrs. llvm-svn: 107804	2010-07-07 19:20:32 +00:00
Dan Gohman	2d4d01d0de	Add X86FastISel support for return statements. This entails refactoring a bunch of stuff, to allow the target-independent calling convention logic to be employed. llvm-svn: 107800	2010-07-07 18:32:53 +00:00
Bruno Cardoso Lopes	fd8060335b	Add AVX AES instructions llvm-svn: 107798	2010-07-07 18:24:20 +00:00
Dan Gohman	00ef93258a	Remove interprocedural-basic-aa and associated code. The AliasAnalysis interface needs implementations to be consistent, so any code which wants to support different semantics must use a different interface. It's not currently worthwhile to add a new interface for this new concept. Document that AliasAnalysis doesn't support cross-function queries. llvm-svn: 107776	2010-07-07 14:27:09 +00:00
Bruno Cardoso Lopes	6d122aef97	Add AVX SSE4.2 instructions llvm-svn: 107752	2010-07-07 03:39:29 +00:00
Bruno Cardoso Lopes	8f5472a8e8	Add AVX SSE4.1 insertps, ptest and movntdqa instructions llvm-svn: 107747	2010-07-07 01:14:56 +00:00
Bruno Cardoso Lopes	6430c7350d	Add AVX SSE4.1 extractps and pinsr instructions llvm-svn: 107746	2010-07-07 01:01:13 +00:00
Bruno Cardoso Lopes	f3116ebe96	Add AVX SSE4.1 Extract Integer instructions llvm-svn: 107740	2010-07-07 00:07:24 +00:00
Dale Johannesen	ce65663330	Accept RIP-relative symbols with 'i' constraint, and print the (%rip) only if the 'a' modifier is present. PR 7528. llvm-svn: 107727	2010-07-06 23:27:00 +00:00
Bruno Cardoso Lopes	1f9ad516c6	Add the rest of AVX SSE4.1 packed move with sign/zero extend instructions llvm-svn: 107723	2010-07-06 23:15:17 +00:00
Dale Johannesen	6f01541ae6	Make test not hang waiting for input. llvm-svn: 107721	2010-07-06 23:06:58 +00:00
Bruno Cardoso Lopes	35702d27c4	Add part of AVX SSE4.1 packed move with sign/zero extend instructions llvm-svn: 107720	2010-07-06 23:01:41 +00:00
Bruno Cardoso Lopes	e2bd058d32	Add AVX vblendvpd, vblendvps and vpblendvb instructions Update VEX encoding to support those new instructions llvm-svn: 107715	2010-07-06 22:36:24 +00:00
Jakob Stoklund Olesen	a64c0a3d22	Be more forgiving when calculating alias interference for physreg coalescing. It is OK for an alias live range to overlap if there is a copy to or from the physical register. CoalescerPair can work out if the copy is coalescable independently of the alias. This means that we can join with the actual destination interval instead of using the getOrigDstReg() hack. It is no longer necessary to merge clobber ranges into subregisters. llvm-svn: 107695	2010-07-06 20:31:51 +00:00
Devang Patel	23a7593534	Fix PR7545 crash. llvm-svn: 107678	2010-07-06 18:18:32 +00:00
Rafael Espindola	7c510aa7bc	Don't create neon moves in CopyRegToReg. NEONMoveFixPass will do the conversion if profitable. llvm-svn: 107673	2010-07-06 16:24:34 +00:00
Eric Christopher	8f06b4a294	Remove mistakenly added test. llvm-svn: 107641	2010-07-06 05:20:13 +00:00
Eric Christopher	2ad0c779c3	Fix up -fstack-protector on linux to use the segment registers. Split out testcases per architecture and os now. Patch from Nelson Elhage. llvm-svn: 107640	2010-07-06 05:18:56 +00:00
Chris Lattner	60db4557cd	another v2f32 case, in this case showing poor codegen. llvm-svn: 107614	2010-07-05 05:52:56 +00:00
Chris Lattner	431e81f2fb	fix test on non-x86 hosts. llvm-svn: 107608	2010-07-05 03:56:55 +00:00
Chris Lattner	45cc4d74a3	Just rip v2f32 support completely out of the X86 backend. In the example in the testcase, we now generate: _test1: ## @test1 movss 4(%esp), %xmm0 addss 8(%esp), %xmm0 movl 12(%esp), %eax movss %xmm0, (%eax) ret instead of: _test1: ## @test1 subl $20, %esp movl 24(%esp), %eax movq %mm0, (%esp) movq %mm0, 8(%esp) movss (%esp), %xmm0 addss 12(%esp), %xmm0 movss %xmm0, (%eax) addl $20, %esp ret v2f32 support did not work reliably because most of the X86 backend didn't know it was legal. It was apparently only added to support returning source-level v2f32 values in MMX registers in x86-32 mode. If ABI compatibility is important on this GCC-extended-vector type for some reason, then the frontend should generate IR that returns v2i32 instead of v2f32. However, we generally don't try very hard to be abi compatible on gcc extended vectors. llvm-svn: 107601	2010-07-04 23:07:25 +00:00
Chris Lattner	681b926d54	fix PR7518 - terrible codegen of <2 x float>, by only marking v2f32 as legal in 32-bit mode. It is just as terrible there, but I just care about x86-64 and noone claims it is valuable in 64-bit mode. llvm-svn: 107600	2010-07-04 22:57:10 +00:00
Bruno Cardoso Lopes	ca99012ac0	Add AVX SSE4.1 blend, mpsadbw and vdp llvm-svn: 107560	2010-07-03 01:37:03 +00:00
Bruno Cardoso Lopes	bc75502f09	Add AVX SSE4.1 binop (some forms of packed max,min,mul,pack,cmp) instructions llvm-svn: 107558	2010-07-03 01:15:47 +00:00
Bruno Cardoso Lopes	fc9cdc4d61	Add AVX SSE4.1 Horizontal Minimum and Position instruction llvm-svn: 107552	2010-07-03 00:49:21 +00:00
Bruno Cardoso Lopes	621c85b038	Add AVX SSE4.1 round instructions llvm-svn: 107549	2010-07-03 00:37:44 +00:00
Bruno Cardoso Lopes	c7111fd355	- Add support for the rest of AVX SSE3 instructions - Fix VEX prefix to be emitted with 3 bytes whenever VEX_5M represents a REX equivalent two byte leading opcode llvm-svn: 107523	2010-07-02 22:06:54 +00:00
Evan Cheng	0ce84486c3	- Two-address pass should not assume unfolding is always successful. - X86 unfolding should check if the instructions being unfolded has memoperands. If there is no memoperands, then it must assume conservative alignment. If this would introduce an expensive sse unaligned load / store, then unfoldMemoryOperand etc. should not unfold the instruction. llvm-svn: 107509	2010-07-02 20:36:18 +00:00
Dale Johannesen	4d887f7ca7	Propagate the AlignStack bit in InlineAsm's to the PrologEpilog code, and use it to determine whether the asm forces stack alignment or not. gcc consistently does not do this for GCC-style asms; Apple gcc inconsistently sometimes does it for asm blocks. There is no convenient place to put a bit in either the SDNode or the MachineInstr form, so I've added an extra operand to each; unlovely, but it does allow for expansion for more bits, should we need it. PR 5125. Some existing testcases are affected. The operand lists of the SDNode and MachineInstr forms are indexed with awesome mnemonics, like "2"; I may fix this someday, but not now. I'm not making it any worse. If anyone is inspired I think you can find all the right places from this patch. llvm-svn: 107506	2010-07-02 20:16:09 +00:00
Bob Wilson	771d04b969	Fix incorrect asm-printing of some NEON immediates. Fix weak testcase so that it checks the immediate values, not just the instructions opcodes. Radar 8110263. llvm-svn: 107487	2010-07-02 17:23:44 +00:00
Dale Johannesen	744c74c444	Prevent test from hanging waiting for input. llvm-svn: 107446	2010-07-01 22:57:11 +00:00
Bob Wilson	8a99b730a9	ARM function alignments were off by a power of two. svn 83242 changed getFunctionAlignment and the corresponding use of that value in the ARM asm printer, but now we're using the standard asm printer. The result of this was that function alignments were dropped completely for Thumb functions. Radar 8143571. llvm-svn: 107435	2010-07-01 22:26:26 +00:00
Bill Wendling	03bcd6ecc8	Implement the "linker_private_weak" linkage type. This will be used for Objective-C metadata types which should be marked as "weak", but which the linker will remove upon final linkage. However, this linkage isn't specific to Objective-C. For example, the "objc_msgSend_fixup_alloc" symbol is defined like this: .globl l_objc_msgSend_fixup_alloc .weak_definition l_objc_msgSend_fixup_alloc .section __DATA, __objc_msgrefs, coalesced .align 3 l_objc_msgSend_fixup_alloc: .quad _objc_msgSend_fixup .quad L_OBJC_METH_VAR_NAME_1 This is different from the "linker_private" linkage type, because it can't have the metadata defined with ".weak_definition". Currently only supported on Darwin platforms. llvm-svn: 107433	2010-07-01 21:55:59 +00:00
Dan Gohman	84f90a387d	Remove context sensitivity concerns from interprocedural-basic-aa, and make it more aggressive in cases where both pointers are known to live in the same function. llvm-svn: 107420	2010-07-01 20:08:40 +00:00
Devang Patel	2b434e12cd	Debugging infomration is encoded in llvm IR using metadata. This is designed such a way that debug info for symbols preserved even if symbols are optimized away by the optimizer. Add new special pass to remove debug info for such symbols. llvm-svn: 107416	2010-07-01 19:49:20 +00:00
Bruno Cardoso Lopes	5e88700f28	Move SSE3 Move patterns to a more appropriate section Add AVX SSE3 packed horizontal and & sub instructions llvm-svn: 107405	2010-07-01 17:35:02 +00:00
Bruno Cardoso Lopes	886ee33a38	Add AVX SSE3 packed addsub instructions llvm-svn: 107404	2010-07-01 17:08:18 +00:00
Dan Gohman	d2965c10a1	Temporarily disable on-demand fast-isel. llvm-svn: 107393	2010-07-01 12:15:30 +00:00
Dan Gohman	aef3d140b7	Teach fast-isel to avoid loading a value from memory when it's already available in a register. This is pretty primitive, but it reduces the number of instructions in common testcases by 4%. llvm-svn: 107380	2010-07-01 03:49:38 +00:00
Dan Gohman	722f5fc567	Enable on-demand fast-isel. llvm-svn: 107377	2010-07-01 02:58:57 +00:00
Bruno Cardoso Lopes	a7a0c83563	Add AVX SSE3 replicate and convert instructions llvm-svn: 107375	2010-07-01 02:33:39 +00:00
Dan Gohman	7937d5606d	Teach X86FastISel to fold constant offsets and scaled indices in the same address. llvm-svn: 107373	2010-07-01 02:27:15 +00:00
Bruno Cardoso Lopes	05166740eb	- Add AVX SSE2 Move doubleword and quadword instructions. - Add encode bits for VEX_W - All 128-bit SSE 1 & SSE2 instructions that are described in the .td file now have a AVX encoded form already working. llvm-svn: 107365	2010-07-01 01:20:06 +00:00
Mikhail Glushenkov	0354891d98	Test for the -filelist fix. llvm-svn: 107363	2010-07-01 01:00:37 +00:00
Devang Patel	db735cbbab	Remove all debug info related named mdnodes. llvm-svn: 107323	2010-06-30 21:29:00 +00:00
Bruno Cardoso Lopes	cbcebe2950	Add AVX SSE2 mask creation and conditional store instructions llvm-svn: 107306	2010-06-30 18:38:10 +00:00
Dan Gohman	c0cca7fdda	Revert the part of r107257 which introduced new logic for using nsw and nuw flags from IR Instructions. On further consideration, this isn't valid. llvm-svn: 107298	2010-06-30 17:27:11 +00:00
Bruno Cardoso Lopes	d079c91683	Add AVX SSE2 packed integer extract/insert instructions llvm-svn: 107293	2010-06-30 17:03:03 +00:00
Dan Gohman	725ed0364b	Add a testcase for scev-aa's new capability. llvm-svn: 107258	2010-06-30 07:17:47 +00:00
Bruno Cardoso Lopes	e82689fea2	Add AVX SSE2 integer unpack instructions llvm-svn: 107246	2010-06-30 04:06:39 +00:00
Bruno Cardoso Lopes	ec0115c9b7	Add AVX SSE2 packed integer shuffle instructions llvm-svn: 107245	2010-06-30 03:47:56 +00:00
Bruno Cardoso Lopes	be792feb8b	Add AVX SSE2 pack with saturation integer instructions llvm-svn: 107241	2010-06-30 02:30:25 +00:00
Bruno Cardoso Lopes	2686ea4555	Add AVX SSE2 integer packed compare instructions llvm-svn: 107240	2010-06-30 02:21:09 +00:00
Bruno Cardoso Lopes	2e2caefff9	- Add AVX form of all SSE2 logical instructions - Add VEX encoding bits to x86 MRM0r-MRM7r llvm-svn: 107238	2010-06-30 01:58:37 +00:00
Devang Patel	648df7bf64	Add variables into a scope before constructing scope DIE otherwise variables won't be included DIE tree. llvm-svn: 107228	2010-06-30 00:11:08 +00:00
Bruno Cardoso Lopes	3f71ddfaad	Add several AVX integer packed binop instructions llvm-svn: 107225	2010-06-29 23:47:49 +00:00
Dan Gohman	ae36b1ed42	Fix ScalarEvolution's tripcount computation for chains of loops where each loop's induction variable's start value is the exit value of a preceding loop. llvm-svn: 107224	2010-06-29 23:43:06 +00:00
Bruno Cardoso Lopes	30689a3a7f	Add AVX ld/st XCSR register. Add VEX encoding bits for MRMXm x86 form llvm-svn: 107204	2010-06-29 20:35:48 +00:00
Jakob Stoklund Olesen	dadea5b178	Fix the handling of partial redefines in the fast register allocator. A partial redefine needs to be treated like a tied operand, and the register must be reloaded while processing use operands. This fixes a bug where partially redefined registers were processed as normal defs with a reload added. The reload could clobber another use operand if it was a kill that allowed register reuse. llvm-svn: 107193	2010-06-29 19:15:30 +00:00
Bob Wilson	d91d5bfc95	Fix a register scavenger crash when dealing with undefined subregs. The LowerSubregs pass needs to preserve implicit def operands attached to EXTRACT_SUBREG instructions when it replaces those instructions with copies. llvm-svn: 107189	2010-06-29 18:42:49 +00:00
Bruno Cardoso Lopes	a4575f5b31	Add AVX non-temporal stores llvm-svn: 107178	2010-06-29 18:22:01 +00:00
Dan Gohman	9bbd007f15	Add a few more interesting testcases. llvm-svn: 107177	2010-06-29 18:17:11 +00:00
Bruno Cardoso Lopes	21a9433e9e	Add sqrt, rsqrt and rcp AVX instructions llvm-svn: 107166	2010-06-29 17:26:30 +00:00
Rafael Espindola	38a7d7cbc3	Add a VT argument to getMinimalPhysRegClass and replace the copy related uses of getPhysicalRegisterRegClass with it. If we want to make a copy (or estimate its cost), it is better to use the smallest class as more efficient operations might be possible. llvm-svn: 107140	2010-06-29 14:02:34 +00:00
Duncan Sands	dddf876e96	Looks like this test is missing an XFAIL line. llvm-svn: 107134	2010-06-29 13:18:50 +00:00
Evan Cheng	b59dd8f10a	PR7503: uxtb16 is not available for ARMv7-M. Patch by Brian G. Lucas. llvm-svn: 107122	2010-06-29 05:38:36 +00:00
Bob Wilson	1e5da550e5	Reapply my if-conversion cleanup from svn r106939 with fixes. There are 2 changes relative to the previous version of the patch: 1) For the "simple" if-conversion case, there's no need to worry about RemoveExtraEdges not handling an unanalyzable branch. Predicated terminators are ignored in this context, so RemoveExtraEdges does the right thing. This might break someday if we ever treat indirect branches (BRIND) as predicable, but for now, I just removed this part of the patch, because in the case where we do not add an unconditional branch, we rely on keeping the fall-through edge to CvtBBI (which is empty after this transformation). The change relative to the previous patch is: @@ -1036,10 +1036,6 @@ IterIfcvt = false; } - // RemoveExtraEdges won't work if the block has an unanalyzable branch, - // which is typically the case for IfConvertSimple, so explicitly remove - // CvtBBI as a successor. - BBI.BB->removeSuccessor(CvtBBI->BB); RemoveExtraEdges(BBI); // Update block info. BB can be iteratively if-converted. 2) My patch exposed a bug in the code for merging the tail of a "diamond", which had previously never been exercised. The code was simply checking that the tail had a single predecessor, but there was a case in MultiSource/Benchmarks/VersaBench/dbms where that single predecessor was neither edge of the diamond. I added the following change to check for that: @@ -1276,7 +1276,18 @@ // tail, add a unconditional branch to it. if (TailBB) { BBInfo TailBBI = BBAnalysis[TailBB->getNumber()]; - if (TailBB->pred_size() == 1 && !TailBBI.HasFallThrough) { + bool CanMergeTail = !TailBBI.HasFallThrough; + // There may still be a fall-through edge from BBI1 or BBI2 to TailBB; + // check if there are any other predecessors besides those. + unsigned NumPreds = TailBB->pred_size(); + if (NumPreds > 1) + CanMergeTail = false; + else if (NumPreds == 1 && CanMergeTail) { + MachineBasicBlock::pred_iterator PI = TailBB->pred_begin(); + if (PI != BBI1->BB && PI != BBI2->BB) + CanMergeTail = false; + } + if (CanMergeTail) { MergeBlocks(BBI, TailBBI); TailBBI.IsDone = true; } else { With these fixes, I was able to run all the SingleSource and MultiSource tests successfully. llvm-svn: 107110	2010-06-29 00:55:23 +00:00
Dan Gohman	0824affeff	Add an Intraprocedural form of BasicAliasAnalysis, which aims to properly handles instructions and arguments defined in different functions, or across recursive function iterations. llvm-svn: 107109	2010-06-29 00:50:39 +00:00
Bruno Cardoso Lopes	d6a091a4d4	Described the missing AVX forms of SSE2 convert instructions llvm-svn: 107108	2010-06-29 00:36:02 +00:00
Devang Patel	1575e9f5ce	The comment string does not match for all targets. PowerPC uses ;. llvm-svn: 107103	2010-06-29 00:04:40 +00:00
Bob Wilson	269a89fd3a	Unlike other targets, ARM now uses BUILD_VECTORs post-legalization so they can't be changed arbitrarily by the DAGCombiner without checking if it is running after legalization. llvm-svn: 107097	2010-06-28 23:40:25 +00:00
Dale Johannesen	764b056c30	Refix XTARGET. Previous attempt matches on powerpc-apple-darwin, although I don't see why. llvm-svn: 107090	2010-06-28 22:45:33 +00:00
Dale Johannesen	65cd5ba74d	Attempt to fix XTARGET. llvm-svn: 107088	2010-06-28 22:31:52 +00:00
Devang Patel	1de21ec498	Use DW_FORM_addr for DW_AT_entry_pc. llvm-svn: 107085	2010-06-28 22:22:47 +00:00
Dale Johannesen	17feb07c53	In asm's, output operands with matching input constraints have to be registers, per gcc documentation. This affects the logic for determining what "g" should lower to. PR 7393. A couple of existing testcases are affected. llvm-svn: 107079	2010-06-28 22:09:45 +00:00
Dan Gohman	e697a6f24f	Constant fold x == undef to undef. llvm-svn: 107074	2010-06-28 21:30:07 +00:00
Dan Gohman	7c34ece501	Fix Value::stripPointerCasts and BasicAA to avoid trouble on code in unreachable blocks, which have have use-def cycles. This fixes PR7514. llvm-svn: 107071	2010-06-28 21:16:52 +00:00
Devang Patel	68c81196f9	Remove this weak test. llvm-svn: 107059	2010-06-28 20:24:35 +00:00
Dale Johannesen	0e4d964bfe	Testcase for llvm-gcc fix 107051. llvm-svn: 107052	2010-06-28 20:07:30 +00:00
Jakob Stoklund Olesen	fde9c348e9	Don't write temporary files in test directory llvm-svn: 107049	2010-06-28 20:01:15 +00:00
Jakob Stoklund Olesen	0117091c16	Add a triple so test runs on Linux as well. llvm-svn: 107045	2010-06-28 19:31:15 +00:00
Jakob Stoklund Olesen	0d94d7af78	Add more special treatment for inline asm in RegAllocFast. When an instruction has tied operands and physreg defines, we must take extra care that the tied operands conflict with neither physreg defs nor uses. The special treatment is given to inline asm and instructions with tied operands / early clobbers and physreg defines. This fixes PR7509. llvm-svn: 107043	2010-06-28 18:34:34 +00:00
Devang Patel	f3b2db68c6	Preserve deleted function's local variables' debug info. Radar 8122864. llvm-svn: 107027	2010-06-28 18:25:03 +00:00
Devang Patel	6e34f19b17	Make this test darwin specific. llvm-svn: 107025	2010-06-28 18:04:03 +00:00
Chris Lattner	93e63a0218	this test is failing nondeterministically and blaming me, just disable it for now. llvm-svn: 106960	2010-06-26 22:08:30 +00:00
Benjamin Kramer	c1ecfd86a3	Fix test weirdness. llvm-svn: 106959	2010-06-26 22:06:50 +00:00
Benjamin Kramer	3bbc52ce3e	Fix some tests that didn't test anything. llvm-svn: 106954	2010-06-26 20:05:06 +00:00
Kenneth Uildriks	7228d98b85	Partial specialization test should not depend on the order of specialization operations or the names assigned to the specialized functions llvm-svn: 106953	2010-06-26 18:47:40 +00:00
Rafael Espindola	2041abd958	When splitting a VAARG, remember its alignment. This produces terrible but correct code. llvm-svn: 106952	2010-06-26 18:22:20 +00:00
Bob Wilson	418e64a385	Revert my if-conversion cleanup since it caused a bunch of nightly test regressions. --- Reverse-merging r106939 into '.': U test/CodeGen/Thumb2/thumb2-ifcvt3.ll U lib/CodeGen/IfConversion.cpp llvm-svn: 106951	2010-06-26 17:47:06 +00:00
Duncan Sands	3a5cb69cb8	Fix PR7328: when turning a tail recursion into a loop, need to preserve the returned value after the tail call if it differs from other return values. The optimal thing to do would be to introduce a phi node for the return value, but for the moment just fix the miscompile. llvm-svn: 106947	2010-06-26 12:53:31 +00:00
Eli Friedman	b9bdc5a52d	Remove bogus test. llvm-svn: 106941	2010-06-26 04:59:56 +00:00
Bob Wilson	c72da6bb56	Clean up some problems with extra CFG edges being introduced during if-conversion. The RemoveExtraEdges function doesn't work for blocks that end with unanalyzable branches, so in those cases, the "extra" edges must be explicitly removed. The CopyAndPredicateBlock and MergeBlocks methods can also avoid copying successor edges due to branches that have already been removed. The latter case is especially helpful when MergeBlocks is called for handling "diamond" if-conversions, where otherwise you can end up with some weird intermediate states in the CFG. Unfortunately I've been unable to find cases where this cleanup actually makes a significant difference in the code. There is one test where we manage to remove an empty block at the end of a function. Radar 6911268. llvm-svn: 106939	2010-06-26 04:27:33 +00:00
Jakob Stoklund Olesen	d7d0d4e882	When creating X86 MUL8 and DIV8 instructions, make sure we don't produce CopyFromReg nodes for aliasing registers (AX and AL). This confuses the fast register allocator. Instead of CopyFromReg(AL), use ExtractSubReg(CopyFromReg(AX), sub_8bit). This fixes PR7312. llvm-svn: 106934	2010-06-26 00:39:23 +00:00
Bruno Cardoso Lopes	74d716b9cd	Add AVX convert CVTSS2SI{rr,rm} and CVTDQ2PS{rr,rm} instructions llvm-svn: 106917	2010-06-25 23:47:23 +00:00
Bruno Cardoso Lopes	83651094ad	Reapply r106896: Add several AVX MOV flavors Support VEX encoding for MRMDestReg llvm-svn: 106912	2010-06-25 23:33:42 +00:00
Daniel Dunbar	acbdf53db4	Thumb2ITBlockPass: Fix a possible dereference of an invalid iterator. This was introduced in r106343, but only showed up recently (with a particular compiler & linker combination) because of the particular check, and because we have no builtin checking for dereferencing the end of an array, which is truly unfortunate. llvm-svn: 106908	2010-06-25 23:14:54 +00:00
Bruno Cardoso Lopes	4530fed87e	revert this now, it's using avx instead of sse :) llvm-svn: 106906	2010-06-25 23:04:29 +00:00
Evan Cheng	02b184de5b	Change if-conversion block size limit checks to add some flexibility. llvm-svn: 106901	2010-06-25 22:42:03 +00:00
Bruno Cardoso Lopes	a34d9b6d84	Add several AVX MOV flavors Support VEX encoding for MRMDestReg llvm-svn: 106896	2010-06-25 22:27:51 +00:00
Dale Johannesen	ce97d55ad9	The hasMemory argument is irrelevant to how the argument for an "i" constraint should get lowered; PR 6309. While this argument was passed around a lot, this is the only place it was used, so it goes away from a lot of other places. llvm-svn: 106893	2010-06-25 21:55:36 +00:00
Dan Gohman	8de1fe3ccf	pcmpeqd and friends are Commutable. llvm-svn: 106886	2010-06-25 21:05:35 +00:00
Bill Wendling	e41e40f689	- Reapply r106066 now that the bzip2 build regression has been fixed. - 2010-06-25-CoalescerSubRegDefDead.ll is the testcase for r106878. llvm-svn: 106880	2010-06-25 20:48:10 +00:00
Devang Patel	27510cc623	XFAIL this test on powerpc for now. llvm-svn: 106862	2010-06-25 17:32:23 +00:00
Bruno Cardoso Lopes	cbdcce6478	Add some AVX convert instructions llvm-svn: 106815	2010-06-25 00:39:30 +00:00
Dan Gohman	600658a4ba	Don't write an output file to cwd, and put an rdar prefix on an rdar number. llvm-svn: 106810	2010-06-24 23:45:15 +00:00
Dan Gohman	9a2f0473b2	Teach EmitLiveInCopies to omit copies for unused virtual registers, and to clean up unused incoming physregs from the live-in list. llvm-svn: 106805	2010-06-24 22:23:02 +00:00
Bill Wendling	2d3c490026	It's possible that a flag is added to the SDNode that points back to the original SDNode. This is badness. Also, this function allows one SDNode to point multiple flags to another SDNode. Badness as well. llvm-svn: 106793	2010-06-24 22:00:37 +00:00
Devang Patel	c657c621b7	DBG_VALUE machine instruction pointing to undefined register for a variable justify a separate scope if the variable is inlined function's argument. Radar 8122864. llvm-svn: 106792	2010-06-24 21:51:19 +00:00
Bruno Cardoso Lopes	4398fd7b83	- Add AVX COMI{SS,SD}{rr,rm} and UCOMI{SS,SD}{rr,rm}. - Fix a small VEX encoding issue. - Move compare instructions to their appropriate place. llvm-svn: 106787	2010-06-24 20:48:23 +00:00
Dale Johannesen	5ad5226c58	Disallow matching "i" constraint to symbol addresses when address requires a register or secondary load to compute (most PIC modes). This improves "g" constraint handling. 8015842. The test from 2007 is attempting to test the fix for PR1761, but since -relocation-model=static doesn't work on Darwin x86-64, it was not testing what it was supposed to be testing and was passing erroneously. Fixed to use Linux x86-64. llvm-svn: 106779	2010-06-24 20:14:51 +00:00
Jakob Stoklund Olesen	45230239e4	Replace a big gob of old coalescer logic with the new CoalescerPair class. CoalescerPair can determine if a copy can be coalesced, and which register gets merged away. The old logic in SimpleRegisterCoalescing had evolved into something a bit too convoluted. This second attempt fixes some crashes that only occurred Linux. llvm-svn: 106769	2010-06-24 18:15:01 +00:00
Bob Wilson	279e55fb2e	PR7458: Try commuting Thumb2 instruction operands to put them into 2-address form so they can be narrowed to 16-bit instructions. llvm-svn: 106762	2010-06-24 16:50:20 +00:00
Dan Gohman	463f26b4be	Eliminate the other half of the BRCOND optimization, and update as many tests as possible. llvm-svn: 106749	2010-06-24 15:24:03 +00:00
Dan Gohman	df6b33e778	Eliminate the first have of the optimization which eliminates BRCOND when the condition is constant. This optimization shouldn't be necessary, because codegen shouldn't be able to find dead control paths that the IR-level optimizer can't find. And it's undesirable, because it encourages bugpoint to leave "br i1 false" branches in its output. And it wasn't updating the CFG. I updated all the tests I could, but some tests are too reduced and I wasn't able to meaningfully preserve them. llvm-svn: 106748	2010-06-24 15:04:11 +00:00
Dan Gohman	600f62b3ba	Reapply r106634, now that the bug it exposed is fixed. llvm-svn: 106746	2010-06-24 14:30:44 +00:00
Chris Lattner	8048662539	Teach the x86 mc assembler that %dr6 = %db6, this implements rdar://8013734 llvm-svn: 106725	2010-06-24 07:29:18 +00:00
Dan Gohman	0695e09b09	Optimize the "bit test" code path for switch lowering in the case where the bit mask has exactly one bit. llvm-svn: 106716	2010-06-24 02:06:24 +00:00
Jakob Stoklund Olesen	dbb58d2974	Revert "Replace a big gob of old coalescer logic with the new CoalescerPair class." Whiny buildbots. llvm-svn: 106710	2010-06-24 00:52:22 +00:00
Bruno Cardoso Lopes	191a1cd2bb	Add AVX CMP{SS,SD}{rr,rm} instructions and encoding testcases llvm-svn: 106705	2010-06-24 00:32:06 +00:00
Jakob Stoklund Olesen	f38e6720cc	Replace a big gob of old coalescer logic with the new CoalescerPair class. CoalescerPair can determine if a copy can be coalesced, and which register gets merged away. The old logic in SimpleRegisterCoalescing had evolved into something a bit too convoluted. llvm-svn: 106701	2010-06-24 00:12:39 +00:00
Bill Wendling	f470747a36	We are missing opportunites to use ldm. Take code like this: void t(int cp0, int cp1, int dp, int fmd) { int c0, c1, d0, d1, d2, d3; c0 = (cp0++ & 0xffff) \| ((cp1++ << 16) & 0xffff0000); c1 = (cp0++ & 0xffff) \| ((cp1++ << 16) & 0xffff0000); / ... */ } It code gens into something pretty bad. But with this change (analogous to the X86 back-end), it will use ldm and generate few instructions. llvm-svn: 106693	2010-06-23 23:00:16 +00:00
Bruno Cardoso Lopes	05220c9a0d	Add AVX MOVMSK{PS,PD}rr instructions llvm-svn: 106683	2010-06-23 21:30:27 +00:00
Bruno Cardoso Lopes	3183dd5692	Add tests for different AVX cmp opcodes, also teach the x86 asm parser to understand the vcmp instruction llvm-svn: 106678	2010-06-23 21:10:57 +00:00
Bruno Cardoso Lopes	360d6fe299	Add AVX SHUF{PS,PD}{rr,rm} instructions llvm-svn: 106672	2010-06-23 20:07:15 +00:00
Nico Weber	337e8db712	Add support for the x86 instructions "pusha" and "popa". llvm-svn: 106671	2010-06-23 20:00:58 +00:00
Bruno Cardoso Lopes	30a28d6588	Fix a tblgen bug. Given the pattern below as an example: list<dag> Pattern = [(set RC:$dst, (v4f32 (shufp:src3 RC:$src1, (mem_frag addr:$src2))))]; The right reference resolving should lead to: list<dag> Pattern = [(set VR128:$dst, (v4f32 (shufp:src3 VR128:$src1, (mem_frag addr:$src2))))]; But was yielding: list<dag> Pattern = [(set VR128:$dst, (v4f32 (shufp VR128:$src1, (mem_frag addr:$src2))))]; Fix this by passing the right name when creating a new DagInit node. llvm-svn: 106670	2010-06-23 19:50:39 +00:00
Dale Johannesen	fc40f0a1ab	Reinstate correct test, remove the real invalidated test. llvm-svn: 106664	2010-06-23 18:56:06 +00:00
Dale Johannesen	6effb503f5	Remove tests invalidated by previous checkin. llvm-svn: 106663	2010-06-23 18:53:12 +00:00
Bill Wendling	a136521a17	MorphNodeTo doesn't preserve the memory operands. Because we're morphing a node into the same node, but with different non-memory operands, we need to replace the memory operands after it's finished morphing. llvm-svn: 106643	2010-06-23 18:16:24 +00:00
Daniel Dunbar	f643aa86b0	tests: Tweak lit.cfg to fix breakage with out-of-dir lookup. llvm-svn: 106638	2010-06-23 18:06:16 +00:00
Daniel Dunbar	4df321b7ad	Revert r106263, "Fold the ShrinkDemandedOps pass into the regular DAGCombiner pass,"... it was causing both 'file' (with clang) and 176.gcc (with llvm-gcc) to be miscompiled. llvm-svn: 106634	2010-06-23 17:09:26 +00:00
Daniel Dunbar	ef5a4383ad	Revert r106066, "Create a more targeted fix for not sinking instructions into a range where it"... it causes bzip2 to be miscompiled by Clang. Conflicts: lib/CodeGen/MachineSink.cpp llvm-svn: 106614	2010-06-23 00:48:25 +00:00
Stuart Hastings	c0efbd5b31	Less incorrect handling of zero-length bitfields. Radars `7992077` and 8093043. llvm-svn: 106611	2010-06-23 00:31:14 +00:00
Bruno Cardoso Lopes	1e13c17a55	Add AVX compare packed instructions llvm-svn: 106600	2010-06-22 23:37:59 +00:00
Dan Gohman	f1cf963c64	Loosen up this test so that it doesn't depend as much on register allocation details. llvm-svn: 106599	2010-06-22 23:32:47 +00:00
Dan Gohman	1081f1a0f5	Fix OptimizeMax to handle an odd case where one of the max operands is another max which folds. This fixes PR7454. llvm-svn: 106594	2010-06-22 23:07:13 +00:00
Bruno Cardoso Lopes	535aa8ea91	Reapply support for AVX unpack and interleave instructions, with testcases this time. llvm-svn: 106593	2010-06-22 23:02:38 +00:00
Bruno Cardoso Lopes	1a890f9dc0	Add AVX MOV{SS,SD}{rr,rm} instructions llvm-svn: 106588	2010-06-22 22:38:56 +00:00
Bob Wilson	c5d712232d	Thumb1 functions using @llvm.returnaddress were not saving the incoming LR. Radar 8031193. llvm-svn: 106582	2010-06-22 22:04:24 +00:00
Eric Christopher	6250bd9e3c	Move a 64-bit test to the 64-bit file. Fixes an llvm-mc assertion during test runs. llvm-svn: 106577	2010-06-22 21:11:51 +00:00
Dale Johannesen	6d4802ba6c	Add SSE so these actually pass on non-X86 hosts. llvm-svn: 106575	2010-06-22 20:54:03 +00:00
Bruno Cardoso Lopes	dc883cf45a	Fix a subtle multiclass bug: when using class inheritance on a toplevel 'defm', make sure to properly resolve references. llvm-svn: 106570	2010-06-22 20:30:50 +00:00
Bill Wendling	7e35d39fee	Corresponding test changes for r106564. llvm-svn: 106569	2010-06-22 20:30:14 +00:00
Mon P Wang	825639e849	Move v-binop-widen tests to X86 since they don't work on all platforms llvm-svn: 106562	2010-06-22 19:40:50 +00:00
Jakob Stoklund Olesen	9c47dac677	Remove the SimpleJoin optimization from SimpleRegisterCoalescing. Measurements show that it does not speed up coalescing, so there is no reason the keep the added complexity around. Also clean out some unused methods and static functions. llvm-svn: 106548	2010-06-22 16:13:57 +00:00
Dan Gohman	f820bd327d	Allow "exhaustive" trip count evaluation on phi nodes with all constant operands. llvm-svn: 106537	2010-06-22 13:15:46 +00:00
Evan Cheng	37bb617f8a	Tail merging pass shall not break up IT blocks. rdar://8115404 llvm-svn: 106517	2010-06-22 01:18:16 +00:00
Dan Gohman	3c1b3c61e9	Teach two-address lowering how to unfold a load to open up commuting opportunities. For example, this lets it emit this: movq (%rax), %rcx addq %rdx, %rcx instead of this: movq %rdx, %rcx addq (%rax), %rcx in the case where %rdx has subsequent uses. It's the same number of instructions, and usually the same encoding size on x86, but it appears faster, and in general, it may allow better scheduling for the load. llvm-svn: 106493	2010-06-21 22:17:20 +00:00
Evan Cheng	1fb4de8ec5	Fix PR7421: bug in kill transferring logic. It was ignoring loads / stores which have already been processed. llvm-svn: 106481	2010-06-21 21:21:14 +00:00
Dan Gohman	2dd1d3d182	Make this test more robust in case LLVM ever decides to align the global variable differently. llvm-svn: 106454	2010-06-21 19:56:27 +00:00
Dale Johannesen	dd471bbb10	Add missing FileCheck call. llvm-svn: 106443	2010-06-21 18:46:08 +00:00
Devang Patel	c8bceaa418	test case for r106438. llvm-svn: 106439	2010-06-21 18:37:23 +00:00
Dale Johannesen	d5c58b76ab	Fix PR 7433. Silly typo in non-Darwin ARM tail call handling, plus correct R9 handling in that mode. llvm-svn: 106434	2010-06-21 18:21:49 +00:00
Eric Christopher	bf572c7cea	Add some codegen patterns for x86_64-linux-gnu tls codegen matching. Based on a patch by Patrick Marlier! llvm-svn: 106433	2010-06-21 18:21:27 +00:00
Kalle Raiskila	df071b7e42	Add the check to the testcase of r106419. llvm-svn: 106421	2010-06-21 15:11:51 +00:00
Kalle Raiskila	0ab5a02579	Mark the SPU 'lr' instruction to never have side effects. This allows the fast regiser allocator to remove redundant register moves. Update a set of tests that depend on the register allocator to be linear scan. llvm-svn: 106420	2010-06-21 15:08:16 +00:00
Kalle Raiskila	d7f50c118a	Fix the lowering of VECTOR_SHUFFLE on SPU to handle splats. llvm-svn: 106419	2010-06-21 14:42:19 +00:00
Kalle Raiskila	6f58190f6f	Fix lowering of VECTOR_SHUFFLE on SPU. Old algorithm used to choke llc with the attached test. llvm-svn: 106411	2010-06-21 10:17:36 +00:00
Evan Cheng	884a8fe5fa	Fix a crash caused by dereference of MBB.end(). rdar://8110842 llvm-svn: 106399	2010-06-20 00:54:38 +00:00
Dan Gohman	51d00092b6	Include the use kind along with the expression in the key of the use sharing map. The reconcileNewOffset logic already forces a separate use if the kinds differ, so incorporating the kind in the key means we can track more sharing opportunities. More sharing means fewer total uses to track, which means smaller problem sizes, which means the conservative throttles don't kick in as often. llvm-svn: 106396	2010-06-19 21:29:59 +00:00
Dan Gohman	866971ed3d	Fix ScalarEvolution's "exhaustive" trip count evaluation code to avoid assuming that loops are in canonical form, as ScalarEvolution doesn't depend on LoopSimplify itself. Also, with indirectbr not all loops can be simplified. This fixes PR7416. llvm-svn: 106389	2010-06-19 14:17:24 +00:00
Bruno Cardoso Lopes	8737b7d73d	Refactor aliased packed logical instructions, also add AVX AND,OR,XOR,NAND{P}{S,D}{rr,rm} instructions. llvm-svn: 106374	2010-06-19 02:44:01 +00:00
Evan Cheng	f3c01f3ef6	Disable sibcall optimization for Thumb1 for now since Thumb1RegisterInfo::emitEpilogue is not expecting them. llvm-svn: 106368	2010-06-19 01:01:32 +00:00
Bruno Cardoso Lopes	1e205f6b1c	Shrink down code and add for free AVX {MIN,MAX}P{S,D}{rm,rr} instructions llvm-svn: 106366	2010-06-19 00:37:31 +00:00
Chris Lattner	e808a78ac1	fix rdar://7873482 by teaching the instruction encoder to emit segment prefixes. Daniel wrote most of this patch. llvm-svn: 106364	2010-06-19 00:34:00 +00:00
Evan Cheng	119824ed4d	Move ARM if-conversion before post-ra scheduling. llvm-svn: 106355	2010-06-18 23:32:07 +00:00
Evan Cheng	2d51c7c592	Allow ARM if-converter to be run after post allocation scheduling. - This fixed a number of bugs in if-converter, tail merging, and post-allocation scheduler. If-converter now runs branch folding / tail merging first to maximize if-conversion opportunities. - Also changed the t2IT instruction slightly. It now defines the ITSTATE register which is read by instructions in the IT block. - Added Thumb2 specific hazard recognizer to ensure the scheduler doesn't change the instruction ordering in the IT block (since IT mask has been finalized). It also ensures no other instructions can be scheduled between instructions in the IT block. This is not yet enabled. llvm-svn: 106344	2010-06-18 23:09:54 +00:00
Jakob Stoklund Olesen	07f4fa8198	TwoAddressInstructionPass::CoalesceExtSubRegs can insert INSERT_SUBREG instructions, but it doesn't really understand live ranges, so the first INSERT_SUBREG uses an implicitly defined register. Fix it in LiveVariableAnalysis by adding the <undef> flag. llvm-svn: 106333	2010-06-18 22:29:44 +00:00
Evan Cheng	cf9e8a987f	Fix an inverted condition. llvm-svn: 106330	2010-06-18 22:17:13 +00:00
Jakob Stoklund Olesen	22a212f97c	When using ADDri to get the address of a stack object, 255 is a conservative limit on the offset that can be materialized without using the register scavenger. llvm-svn: 106312	2010-06-18 20:59:25 +00:00
Dan Gohman	24ceda8eb0	Revert r106304 (105548 and friends), which are the SCEVComplexityCompare optimizations. There is still some nondeterminism remaining. llvm-svn: 106306	2010-06-18 19:54:20 +00:00
Bruno Cardoso Lopes	23f8321cbc	Teach tablegen how to inherit from classes in 'defm' definitions. The rule is simple: only inherit from a class list if they come in the end, after the last multiclass. llvm-svn: 106305	2010-06-18 19:53:41 +00:00
Dan Gohman	4c807fca97	Reapply 105540, 105542, and 105548, and revert r105732. llvm-svn: 106304	2010-06-18 19:26:04 +00:00
Dale Johannesen	c1570dda5c	Enable tail calls on ARM by default, with some basic tests. This has been well tested on Darwin but not elsewhere. It should work provided the linker correctly resolves B.W <label in other function> which it has not seen before, at least from llvm-based compilers. I'm leaving the arm-tail-calls switch in until I see if there's any problems because of that; it might need to be disabled for some environments. llvm-svn: 106299	2010-06-18 19:00:18 +00:00
Jakob Stoklund Olesen	b9f91667e1	Treat the ARM inline asm {cc} constraint as a physreg (%CPSR), just like X86 does for {flags}. If we create virtual registers of the CCR class, RegAllocFast may try to spill them, and we can't do that. llvm-svn: 106289	2010-06-18 16:49:33 +00:00
Dan Gohman	559020df1d	Don't write a file named "&1". llvm-svn: 106269	2010-06-18 01:49:17 +00:00
Dan Gohman	f3aea7aecf	Disable indvars on loops when LoopSimplify form is not available. This fixes PR7333. llvm-svn: 106267	2010-06-18 01:35:11 +00:00
Dan Gohman	99ba4dac59	Don't maintain a set of deleted nodes; instead, use a HandleSDNode to track a node over CSE events. This fixes PR7368. llvm-svn: 106266	2010-06-18 01:24:29 +00:00
Bruno Cardoso Lopes	2323168705	Add {mix,max}{ss,sd}{rr,rm} AVX forms. llvm-svn: 106264	2010-06-18 01:12:56 +00:00
Dan Gohman	b92156d5e4	Fold the ShrinkDemandedOps pass into the regular DAGCombiner pass, which is faster, simpler, and less surprising. llvm-svn: 106263	2010-06-18 01:05:21 +00:00
Dan Gohman	30d7a51d6c	Make this test less fragile. llvm-svn: 106255	2010-06-18 00:06:03 +00:00
Dale Johannesen	1f8e5fbc7a	Testcase for llvm-gcc 106225. llvm-svn: 106226	2010-06-17 17:43:14 +00:00
Rafael Espindola	29dda21e96	Remove arm_apcscc from the test files. It is the default and doing this matches what llvm-gcc and clang now produce. llvm-svn: 106221	2010-06-17 15:18:27 +00:00
Bruno Cardoso Lopes	4d1d798736	For a tablegen expression such as !if(a,b,c), let 'a' be evaluated for 'bit' operators llvm-svn: 106185	2010-06-17 00:31:36 +00:00
Bruno Cardoso Lopes	77a4a56251	let the '!eq' expression support 'int' and 'bit' types llvm-svn: 106171	2010-06-16 23:24:12 +00:00
Jakob Stoklund Olesen	207cd4bbd7	Allow a register to be redefined multiple times in a basic block. LiveVariableAnalysis was a bit picky about a register only being redefined once, but that really isn't necessary. Here is an example of chained INSERT_SUBREGs that we can handle now: 68 %reg1040<def> = INSERT_SUBREG %reg1040, %reg1028<kill>, 14 register: %reg1040 +[70,134:0) 76 %reg1040<def> = INSERT_SUBREG %reg1040, %reg1029<kill>, 13 register: %reg1040 replace range with [70,78:1) RESULT: %reg1040,0.000000e+00 = [70,78:1)[78,134:0) 0@78-(134) 1@70-(78) 84 %reg1040<def> = INSERT_SUBREG %reg1040, %reg1030<kill>, 12 register: %reg1040 replace range with [78,86:2) RESULT: %reg1040,0.000000e+00 = [70,78:1)[78,86:2)[86,134:0) 0@86-(134) 1@70-(78) 2@78-(86) 92 %reg1040<def> = INSERT_SUBREG %reg1040, %reg1031<kill>, 11 register: %reg1040 replace range with [86,94:3) RESULT: %reg1040,0.000000e+00 = [70,78:1)[78,86:2)[86,94:3)[94,134:0) 0@94-(134) 1@70-(78) 2@78-(86) 3@86-(94) rdar://problem/8096390 llvm-svn: 106152	2010-06-16 21:29:40 +00:00
Jim Grosbach	2c8b829238	modify so the test doesn't drop an output file in the test source directory. The test should also likely have some FileCheck bits to validate the output(?). llvm-svn: 106146	2010-06-16 21:07:06 +00:00
Devang Patel	79b0da30fb	Be specific. Use FileCheck. llvm-svn: 106135	2010-06-16 19:39:45 +00:00
Rafael Espindola	a20e2dfe86	Make sure that simplify libcalls does not replace a call with one calling convention with a new call with a different calling convention. llvm-svn: 106134	2010-06-16 19:34:01 +00:00
Devang Patel	e3721dd27c	This requires more investigation. Unblock buildbots for now. llvm-svn: 106122	2010-06-16 18:19:49 +00:00
Devang Patel	37e4f98cb6	Update test to explicitly capture llc output. llvm-svn: 106121	2010-06-16 18:04:12 +00:00
Benjamin Kramer	a13bd20396	simplify-libcalls: fold strncmp(x, y, 1) -> memcmp(x, y, 1) The memcmp will be optimized further and even the pathological case 'strstr(x, "x") == x' generates optimal code now. llvm-svn: 106097	2010-06-16 10:30:29 +00:00
Evan Cheng	f128bdcb55	Make post-ra scheduling, anti-dep breaking, and register scavenger (conservatively) aware of predicated instructions. This enables ARM to move if-conversion before post-ra scheduler. llvm-svn: 106091	2010-06-16 07:35:02 +00:00
Bill Wendling	8c0cf0994d	Create a more targeted fix for not sinking instructions into a range where it will conflict with another live range. The place which creates this scenerio is the code in X86 that lowers a select instruction by splitting the MBBs. This eliminates the need to check from the bottom up in an MBB for live pregs. llvm-svn: 106066	2010-06-15 23:46:31 +00:00
Rafael Espindola	1115afb092	Update test to match recent llvm-gcc change. llvm-svn: 106056	2010-06-15 22:16:40 +00:00
Jakob Stoklund Olesen	ec2e964fd6	Remove the local register allocator. Please use the fast allocator instead. llvm-svn: 106051	2010-06-15 21:58:33 +00:00
Benjamin Kramer	1118860e3a	simplify-libcalls: fold strstr(a, b) == a -> strncmp(a, b, strlen(b)) == 0 llvm-svn: 106047	2010-06-15 21:34:25 +00:00
Rafael Espindola	ae591be4e9	Set the mtriple in some tests so that they use AAPCS. llvm-svn: 106041	2010-06-15 20:42:00 +00:00
Mon P Wang	7a84689cc5	Fixed vector widening of binary instructions that can trap. Patch by Visa Putkinen! llvm-svn: 106038	2010-06-15 20:29:05 +00:00
Chris Lattner	874c92bd47	fix fastisel to handle GS and FS relative pointers. Patch by Nelson Elhage! llvm-svn: 106031	2010-06-15 19:08:40 +00:00
Rafael Espindola	5a24a56e1e	Remove the arm_aapcscc marker from the tests. It is the default for the linux targets. llvm-svn: 106029	2010-06-15 19:04:29 +00:00
Jakob Stoklund Olesen	246e9a07a2	Avoid processing early clobbers twice in RegAllocFast. Early clobbers defining a virtual register were first alocated to a physreg and then processed as a physreg EC, spilling the virtreg. This fixes PR7382. llvm-svn: 105998	2010-06-15 16:20:57 +00:00
Jakob Stoklund Olesen	82eca35b3e	Add CoalescerPair helper class. Given a copy instruction, CoalescerPair can determine which registers to coalesce in order to eliminate the copy. It deals with all the subreg fun to determine a tuple (DstReg, SrcReg, SubIdx) such that: - SrcReg is a virtual register that will disappear after coalescing. - DstReg is a virtual or physical register whose live range will be extended. - SubIdx is 0 when DstReg is a physical register. - SrcReg can be joined with DstReg:SubIdx. CoalescerPair::isCoalescable() determines if another copy instruction is compatible with the same tuple. This fixes some NEON miscompilations where shuffles are getting coalesced as if they were copies. The CoalescerPair class will replace a lot of the spaghetti logic in JoinCopy later. llvm-svn: 105997	2010-06-15 16:04:21 +00:00
Bob Wilson	a55b8877e6	Generalize the pre-coalescing of extract_subregs feeding reg_sequences, replacing the overly conservative checks that I had introduced recently to deal with correctness issues. This makes a pretty noticable difference in our testcases where reg_sequences are used. I've updated one test to check that we no longer emit the unnecessary subreg moves. llvm-svn: 105991	2010-06-15 05:56:31 +00:00
Chris Lattner	00ab615406	apparently lots of dupes. llvm-svn: 105956	2010-06-14 20:19:03 +00:00
Chris Lattner	faa7bdccbf	fix a nasty bug where we were not treating available_externally symbols as declarations in the X86 backend. This would manifest on darwin x86-32 as errors like this with -fvisibility=hidden: symbol '__ZNSbIcED1Ev' can not be undefined in a subtraction expression This fixes PR7353. llvm-svn: 105954	2010-06-14 20:11:56 +00:00
Chris Lattner	bbb798c7d1	remove old test. llvm-svn: 105953	2010-06-14 20:07:43 +00:00
Chris Lattner	b30f87b74e	rename test llvm-svn: 105952	2010-06-14 20:07:34 +00:00
Chris Lattner	329ea064ed	jump threading can't split a critical edge from an indirectbr. This fixes PR7356. llvm-svn: 105950	2010-06-14 19:45:43 +00:00
Stuart Hastings	37b827fd11	Test case for Radar 8004649. llvm-svn: 105949	2010-06-14 18:37:04 +00:00
Benjamin Kramer	6e42d53cb3	Test case for r105914. llvm-svn: 105915	2010-06-13 16:16:54 +00:00
Daniel Dunbar	250a21b79b	tests: Run macho-dump with binary unbuffered streams on Windows, I can't find a Python 2.6 way to change stdin to binary. llvm-svn: 105894	2010-06-12 17:05:28 +00:00
Daniel Dunbar	edcc628289	tests: Make macho-dump.bat actually work. llvm-svn: 105891	2010-06-12 16:21:54 +00:00
Daniel Dunbar	12225eb687	tests: Propogate LLVM_SRC_ROOT and PYTHON_EXECUTABLE environment variables to tests. llvm-svn: 105890	2010-06-12 16:21:19 +00:00
Bruno Cardoso Lopes	a714ea0f7d	More AVX: {ADD,SUB,MUL,DIV}{PD,PS}rm llvm-svn: 105870	2010-06-12 01:53:48 +00:00
Bruno Cardoso Lopes	b06f54b852	More AVX: {ADD,SUB,MUL,DIV}{PD,PS}rr Handle OpSize TSFlag for AVX llvm-svn: 105869	2010-06-12 01:23:26 +00:00
Bruno Cardoso Lopes	fd5458d4bd	More AVX instructions ({ADD,SUB,MUL,DIV}{SS,SD}rm) Introduce the VEX_X field llvm-svn: 105859	2010-06-11 23:50:47 +00:00
Daniel Dunbar	56b093f572	tests: Add wrapper script for calling macho-dump on Win32. llvm-svn: 105856	2010-06-11 23:29:48 +00:00
Bob Wilson	f07d33d8f1	Add a missing bitcast. This code used to only handle conversions between i64 and f64 types, but now it also handle Neon vector types, so the f64 result of VMOVDRR may need to be converted to a Neon type. Radar 8084742. llvm-svn: 105845	2010-06-11 22:45:25 +00:00
Stuart Hastings	afe54f1625	Support for nested functions/classes in debug output. (Again.) Radar 7424645. llvm-svn: 105828	2010-06-11 20:08:44 +00:00
Bruno Cardoso Lopes	5f2adccc1b	Teach tablegen to allow "let" expressions inside multiclasses, providing more ways to factor out commonality from the records. llvm-svn: 105776	2010-06-10 02:42:59 +00:00
Bill Wendling	d53a2cb4ac	Testcase for r105741. llvm-svn: 105750	2010-06-09 20:30:22 +00:00
Jakob Stoklund Olesen	8bc5eca331	Mark physregs defined by inline asm as implicit. This is a bit of a hack to make inline asm look more like call instructions. It would be better to produce correct dead flags during isel. llvm-svn: 105749	2010-06-09 20:05:00 +00:00
Daniel Dunbar	e16d569932	Workaround SCEV non-determinism on this test, for now, to get buildbots back to green. Dan, please revert this once the real problem is fixed. llvm-svn: 105732	2010-06-09 17:54:40 +00:00
Kalle Raiskila	5e0862f7f5	Fix SPU to cope with vector insertelement to an undef position. We default to inserting to lane 0. llvm-svn: 105722	2010-06-09 09:58:17 +00:00
Kalle Raiskila	056113a211	Handle loading from/storing to undef pointers on SPU by inserting a random load/store, rather than crashing llc. llvm-svn: 105710	2010-06-09 08:29:41 +00:00
Bruno Cardoso Lopes	c2f87b7bb2	Reapply r105521, this time appending "LLU" to 64 bit immediates to avoid breaking the build. llvm-svn: 105652	2010-06-08 22:51:23 +00:00
Rafael Espindola	efac7f5e90	Add more virtual memory to lit. The python in x86-64 fedora 13 needs it to run the llvm tests :-( It was failing with -- Testing: 5324 tests, 8 threads -- Fatal Python error: PyEval_AcquireThread: NULL new thread state llvm-svn: 105610	2010-06-08 16:17:58 +00:00
Stuart Hastings	8612940357	Tweak test for debug/metadata change, update to FileCheck. Radar 7424645. llvm-svn: 105559	2010-06-07 21:50:54 +00:00
Dan Gohman	22e1adbb11	Fix this test to work under lit. llvm-svn: 105553	2010-06-07 20:58:11 +00:00
Dan Gohman	fa9ad13002	Run dead type elimination after dead argument elimination. llvm-svn: 105552	2010-06-07 20:28:37 +00:00
Dan Gohman	fb8ed43349	Make bugpoint dead-argument-hacking actually work, and actually test it. llvm-svn: 105551	2010-06-07 20:20:33 +00:00
Dan Gohman	70910a6ab6	Optimize ScalarEvolution's SCEVComplexityCompare predicate: don't go scrounging through SCEVUnknown contents and SCEVNAryExpr operands; instead just do a simple deterministic comparison of the precomputed hash data. Also, since this is more precise, it eliminates the need for the slow N^2 duplicate detection code. llvm-svn: 105540	2010-06-07 19:06:13 +00:00
Kenneth Uildriks	1850444000	Partial specialization was not checking the callsite to make sure it was using the same constants as the specialization, leading to calls to the wrong specialization. Patch by Takumi Nakamura\! llvm-svn: 105528	2010-06-05 14:50:21 +00:00
Chris Lattner	fdd2614330	revert r105521, which is breaking the buildbots with stuff like this: In file included from X86InstrInfo.cpp:16: X86GenInstrInfo.inc:2789: error: integer constant is too large for 'long' type X86GenInstrInfo.inc:2790: error: integer constant is too large for 'long' type X86GenInstrInfo.inc:2792: error: integer constant is too large for 'long' type X86GenInstrInfo.inc:2793: error: integer constant is too large for 'long' type X86GenInstrInfo.inc:2808: error: integer constant is too large for 'long' type X86GenInstrInfo.inc:2809: error: integer constant is too large for 'long' type X86GenInstrInfo.inc:2816: error: integer constant is too large for 'long' type X86GenInstrInfo.inc:2817: error: integer constant is too large for 'long' type llvm-svn: 105524	2010-06-05 04:17:30 +00:00
Bruno Cardoso Lopes	594fa26317	Initial AVX support for some instructions. No patterns matched yet, only assembly encoding support. llvm-svn: 105521	2010-06-05 03:53:24 +00:00
Bruno Cardoso Lopes	c4f614870f	Teach tablegen to support 'defm' inside multiclasses. llvm-svn: 105519	2010-06-05 02:11:52 +00:00
Stuart Hastings	3ca391027f	Revert 105492 & 105493 due to a testcase regression. Radar 7424645. llvm-svn: 105511	2010-06-05 00:39:29 +00:00
Dan Gohman	bbfb6aca92	LSR needs to remember inserted instructions even in postinc mode, because there could be multiple subexpressions within a single expansion which require insert point adjustment. This fixes PR7306. llvm-svn: 105510	2010-06-05 00:33:07 +00:00
Devang Patel	3eed2cf587	test case for r105504. Radar 8055687. llvm-svn: 105505	2010-06-04 23:47:41 +00:00
Evan Cheng	a03e6f85fe	Re-apply 105308 with fix. llvm-svn: 105502	2010-06-04 23:28:13 +00:00
Stuart Hastings	7c015988fe	Support for nested functions/classes in debug output. Radar 7424645. llvm-svn: 105492	2010-06-04 22:36:03 +00:00
Devang Patel	36da24b546	Copy location info for current function argument from dbg.declare if respective store instruction does not have any location info. llvm-svn: 105490	2010-06-04 22:27:30 +00:00
Dale Johannesen	065d6fd537	More tail call removal. llvm-svn: 105485	2010-06-04 21:14:24 +00:00
Dan Gohman	538b413ccb	Fix normalization and de-normalization of non-affine SCEVs. llvm-svn: 105480	2010-06-04 19:16:34 +00:00
Mon P Wang	622cdd2297	Fixed a bug during widening where we would avoid legalizing a node. When we replace an OpA with a widened OpB, it is possible to get new uses of OpA due to CSE when recursively updating nodes. Since OpA has been processed, the new uses are not examined again. The patch checks if this occurred and it it did, updates the new uses of OpA to use OpB. llvm-svn: 105453	2010-06-04 01:20:10 +00:00
Dale Johannesen	b3780b1103	Remove more tail calls. llvm-svn: 105450	2010-06-04 01:01:24 +00:00
Dale Johannesen	e7b392dca9	Remove a tail call, and move some CHECKs to the functions where they belong. llvm-svn: 105449	2010-06-04 01:01:04 +00:00
Dan Gohman	8fdda8a655	This test doesn't need the ssp attribute. llvm-svn: 105440	2010-06-04 00:14:48 +00:00
Dale Johannesen	e288fee959	Remove tail call. A tail call version will follow. llvm-svn: 105438	2010-06-04 00:03:37 +00:00
Dale Johannesen	9f71f7f70c	Remove tail call to preserve this test. A tail call version will follow. llvm-svn: 105422	2010-06-03 21:57:48 +00:00
Dale Johannesen	41528aeb0b	Make this test not use tail calls. A tail call version will follow. llvm-svn: 105419	2010-06-03 21:53:01 +00:00
Dan Gohman	d83e3e7750	Fix SimplifyDemandedBits' AssertZext logic to demand all the bits. It needs to demand the high bits because it's asserting that they're zero. llvm-svn: 105406	2010-06-03 20:21:33 +00:00
Bob Wilson	30093b5d8b	Revert 105308. llvm-svn: 105399	2010-06-03 18:28:31 +00:00
Bill Wendling	f82aea634c	Machine sink could potentially sink instructions into a block where the physical registers it defines then interfere with an existing preg live range. For instance, if we had something like these machine instructions: BB#0 ... = imul ... EFLAGS<imp-def,dead> test ..., EFLAGS<imp-def> jcc BB#2 EFLAGS<imp-use> BB#1 ... ; fallthrough to BB#2 BB#2 ... ; No code that defines EFLAGS jcc ... EFLAGS<imp-use> Machine sink will come along, see that imul implicitly defines EFLAGS, but because it's "dead", it assumes that it can move imul into BB#2. But when it does, imul's "dead" imp-def of EFLAGS is raised from the dead (a zombie) and messes up the condition code for the jump (and pretty much anything else which relies upon it being correct). The solution is to know which pregs are live going into a basic block. However, that information isn't calculated at this point. Nor does the LiveVariables pass take into account non-allocatable physical registers. In lieu of this, we do a very conservative pass through the basic block to determine if a preg is live coming out of it. llvm-svn: 105387	2010-06-03 07:54:20 +00:00
Eric Christopher	f67fe3b1e8	One underscore, not two. llvm-svn: 105379	2010-06-03 04:02:59 +00:00
Eli Friedman	dbbbf73c96	Implement expansion in type legalization for add/sub with overflow. The expansion is the same as that used by LegalizeDAG. The resulting code sucks in terms of performance/codesize on x86-32 for a 64-bit operation; I haven't looked into whether different expansions might be better in general. llvm-svn: 105378	2010-06-03 03:49:50 +00:00
Evan Cheng	a2da22734f	Enable machine cse of instructions which define physical registers. llvm-svn: 105308	2010-06-02 01:08:27 +00:00
Devang Patel	89f2db6b67	DwarfWrite is now smart enough to drop debug value pointing to undefined register. Update this test to avoid this. iSel not properly lowring argument into a well formed DBG_VALUE in some cases is a separate issue and not related to the test in this testcase. llvm-svn: 105295	2010-06-01 23:01:43 +00:00
Devang Patel	b0c76394a3	Keep track of incoming debug value of unused argument. Radar 7927666. llvm-svn: 105285	2010-06-01 19:59:01 +00:00
Dan Gohman	b782caa393	Fill in missing support for ISD::FEXP, ISD::FPOWI, and friends. llvm-svn: 105283	2010-06-01 18:35:14 +00:00
Kalle Raiskila	8916358f97	Fix handling of 'load' nodes. llvm-svn: 105269	2010-06-01 13:34:47 +00:00
Bill Wendling	1a764f93a0	Debreak test for non-Darwin. llvm-svn: 105257	2010-05-31 21:47:24 +00:00
Duncan Sands	4c904fa797	Fix PR7272: when inlining through a callsite with byval arguments, the newly created allocas may be used by inlined calls, so these need to have their tail call flags cleared. Fixes PR7272. llvm-svn: 105255	2010-05-31 21:00:26 +00:00
Eric Christopher	24efc63000	Add a test for the llvm-gcc commit in r90200. llvm-svn: 105253	2010-05-31 20:39:10 +00:00
Chris Lattner	14c46517b5	fix PR6623: when optimizing for size, don't inline memcpy/memsets that are too large. This causes the freebsd bootloader to be too large apparently. It's unclear if this should be an -Os or -Oz thing. Thoughts welcome. llvm-svn: 105228	2010-05-31 17:30:14 +00:00
Chris Lattner	291a189cda	upgrade and filecheckize this test. llvm-svn: 105227	2010-05-31 17:27:17 +00:00
Nick Lewycky	aee2632be3	The memcpy intrinsic only takes i8* for %src and %dst, so cast them to that first. Fixes PR7265. llvm-svn: 105206	2010-05-31 06:16:35 +00:00
Evan Cheng	707b7cc429	Remove schedule-livein-copies. It's not being used. llvm-svn: 105095	2010-05-29 02:23:39 +00:00
Evan Cheng	27c4933e02	Fix PR7193: if sibling call address can take a register, make sure there are enough registers available by counting inreg arguments. llvm-svn: 105092	2010-05-29 01:35:22 +00:00
Evan Cheng	cc2efe11db	Fix some latency computation bugs: if the use is not a machine opcode do not just return zero. llvm-svn: 105061	2010-05-28 23:26:21 +00:00
Dan Gohman	0fa67e479a	Add lint checks for function attributes. llvm-svn: 105009	2010-05-28 21:43:57 +00:00
Kevin Enderby	4c71e08ed8	MC/X86: Add alias for movzx. llvm-svn: 105005	2010-05-28 21:20:21 +00:00
Kevin Enderby	b29228905f	MC/X86: Add alias for fwait. llvm-svn: 105001	2010-05-28 20:59:10 +00:00
Kevin Enderby	76413597a9	Fix the use of x86 control and debug registers so that the assertion failure in getX86RegNum() does not happen. Patch by Shantonu Sen! llvm-svn: 104994	2010-05-28 19:01:27 +00:00
Dale Johannesen	526bd59aaf	Add missing space; works for me. llvm-svn: 104992	2010-05-28 18:45:59 +00:00
Dan Gohman	c575ec61ea	Fix lint's memcpy and memmove checks, and its basic block traversal. llvm-svn: 104970	2010-05-28 17:44:00 +00:00
Jakob Stoklund Olesen	2085089c49	Fix more tests that depended on the default register allocator choice. llvm-svn: 104961	2010-05-28 17:06:30 +00:00
Dan Gohman	862f034188	Detect self-referential values. llvm-svn: 104957	2010-05-28 16:45:33 +00:00
Dan Gohman	672393f6c7	Remove this va_arg test, which is no longer applicable. llvm-svn: 104956	2010-05-28 16:44:04 +00:00
Stuart Hastings	c1e216583f	Revert 104841, 104842, 104876 due to buildbot failures. Radar 7424645. llvm-svn: 104953	2010-05-28 16:41:07 +00:00
Dan Gohman	cef9fc37f4	Eli pointed out that va_arg instruction result values don't reference the stack. llvm-svn: 104951	2010-05-28 16:34:49 +00:00
Dan Gohman	54d7aaa819	Teach lint how to look through simple store+load pairs and other effective no-op constructs, to make it more effective on unoptimized IR. llvm-svn: 104950	2010-05-28 16:21:24 +00:00
Dan Gohman	df5d7dcef1	Teach instcombine to promote alloca array sizes. llvm-svn: 104945	2010-05-28 15:09:00 +00:00
Dan Gohman	71505aa4de	Add a testcase for getelementptr index promotion. llvm-svn: 104944	2010-05-28 15:07:59 +00:00
Dan Gohman	ddba4b725a	Add a lint check for returning the address of stack memory. llvm-svn: 104936	2010-05-28 04:33:42 +00:00
Dan Gohman	2140a74979	Eliminate the restriction that the array size in an alloca must be i32. This will help reduce the amount of casting required on 64-bit targets. llvm-svn: 104911	2010-05-28 01:14:11 +00:00
Jakob Stoklund Olesen	b613ae2c89	Add a -regalloc=default option that chooses a register allocator based on the -O optimization level. This only really affects llc for now because both the llvm-gcc and clang front ends override the default register allocator. I intend to remove that code later. llvm-svn: 104904	2010-05-27 23:57:25 +00:00
Evan Cheng	3d3ee87d4e	llvm can't correctly support 'H', 'Q' and 'R' modifiers. Just mark it an error. llvm-svn: 104891	2010-05-27 22:08:38 +00:00
Kevin Enderby	9738f64bd9	MC/X86: Add aliases for Jcc variants. llvm-svn: 104890	2010-05-27 21:33:19 +00:00
Devang Patel	7a9dedf0ab	Do not drop location info for inlined function args. llvm-svn: 104884	2010-05-27 20:25:04 +00:00
Stuart Hastings	bf132360a8	Adjust test case for lexical block pruning. Follow-on to r104842 and Radar 7424645. llvm-svn: 104876	2010-05-27 19:57:51 +00:00
Devang Patel	91ad65e8b7	Let's try one more time to match patterns. The goal is to match following 3 lines. In otherwords, a temp. label between to DEBUG_VALUE comments. ;DEBUG_VALUE: bar:x <- undef ## 2010-01-18-Inlined-Debug.c:7 Ltmp1: ;DEBUG_VALUE: foo:__x <- undef ## 2010-01-18-Inlined-Debug.c:5 llvm-svn: 104872	2010-05-27 19:46:38 +00:00
Duncan Sands	f162eace49	Teach instCombine to remove malloc+free if malloc's only uses are comparisons to null. Patch by Matti Niemenmaa. llvm-svn: 104871	2010-05-27 19:09:06 +00:00
Devang Patel	da01e5e907	Temp. labels number may not match for all configurations. llvm-svn: 104858	2010-05-27 17:51:08 +00:00
Devang Patel	5e6b71ce34	inlined function's arguments need a label to mark the start point because they are not directly attached to current function. llvm-svn: 104848	2010-05-27 16:47:30 +00:00
Stuart Hastings	8e99e50d08	Support for nested functions/classes in debug output. Radar 7424645. llvm-svn: 104841	2010-05-27 16:16:54 +00:00
Gabor Greif	38303d7e3b	rename test to represent meaningful date llvm-svn: 104831	2010-05-27 09:32:38 +00:00
Bob Wilson	ebdc772457	Add a test for llvm-gcc svn r104726. llvm-svn: 104805	2010-05-27 05:30:36 +00:00
Eric Christopher	8ae57895f5	Add a quick test of relocations. llvm-svn: 104794	2010-05-27 00:53:40 +00:00
Devang Patel	6b9a9fe207	Simplify. Eliminate unneeded debug_loc entry. llvm-svn: 104785	2010-05-26 23:55:23 +00:00
Dan Gohman	a20a5cd24f	Reinstate checking of stackrestore, with checking for both Read and Write, and add a comment explaining this. llvm-svn: 104756	2010-05-26 22:21:25 +00:00
Dan Gohman	1249adf160	Implement checking of the tail keyword. llvm-svn: 104744	2010-05-26 21:46:36 +00:00
Devang Patel	1b08572a66	Update debug info when live-in reg is copied into a vreg. llvm-svn: 104732	2010-05-26 20:18:50 +00:00
Kevin Enderby	70e34983e8	Fix the x86 move to/from segment register instructions. llvm-svn: 104731	2010-05-26 20:10:45 +00:00
Dale Johannesen	053dd21c84	Testcase for 104624/104619/PR7191/8023512. Reduced from one provided by Duncan Sands, thanks! llvm-svn: 104710	2010-05-26 17:55:45 +00:00
Devang Patel	9fc11706e3	First cut at supporting .debug_loc section. This is used to track variable information. llvm-svn: 104649	2010-05-25 23:40:22 +00:00
Benjamin Kramer	9439084cea	Properly promote operands when optimizing a single-character memcmp. llvm-svn: 104648	2010-05-25 22:53:43 +00:00
Eric Christopher	19a4b843cc	Add support for initialized global data for darwin tls. Update comments and testcases accordingly. llvm-svn: 104635	2010-05-25 21:28:50 +00:00
Kevin Enderby	492d4f409a	Changed the encoding of X86 floating point stack operations where both operands are st(0). These can be encoded using an opcode for storing in st(0) or using an opcode for storing in st(i), where i can also be 0. To allow testing with the darwin assembler and get a matching binary the opcode for storing in st(0) is now used. To do this the same logical trick is use from the darwin assembler in converting things like this: fmul %st(0), %st into this: fmul %st(0) by looking for the second operand being X86::ST0 for specific floating point mnemonics then removing the second X86::ST0 operand. This also has the add benefit to allow things like: fmul %st(1), %st that llvm-mc did not assemble. llvm-svn: 104634	2010-05-25 20:52:34 +00:00
Dale Johannesen	cd4ba6caba	Removing test; Chris thinks it's better to have the bug go untested than have a testcase this large. So be it. llvm-svn: 104632	2010-05-25 20:40:10 +00:00
Daniel Dunbar	0e767d7364	MC/X86: Add a hack to allow recognizing 'cmpltps' and friends. llvm-svn: 104626	2010-05-25 19:49:32 +00:00
Dale Johannesen	60fe2cdc4f	Fix another variant of PR 7191. Also add a testcase Mon Ping provided; unfortunately bugpoint failed to reduce it, but I think it's important to have a test for this in the suite. 8023512. llvm-svn: 104624	2010-05-25 18:47:23 +00:00
Daniel Dunbar	4a5b2c597b	MC/X86: Define explicit immediate forms of cmp{ss,sd,ps,pd}. llvm-svn: 104622	2010-05-25 18:40:53 +00:00
Kevin Enderby	c798965e63	The BT64ri8 record in X86Instr64bit.td was missing a REX_W which is required for the 64-bit version of the Bit Test instruction. llvm-svn: 104621	2010-05-25 18:16:58 +00:00
Eric Christopher	f6562d35ac	Make sure aeskeygenassist uses an unsigned immediate field. Fixes rdar://8017638 llvm-svn: 104617	2010-05-25 17:33:22 +00:00
Dan Gohman	79b6a0f140	Fix an mmx movd encoding. llvm-svn: 104552	2010-05-24 20:51:08 +00:00
Kevin Enderby	dc71cc794b	MC/X86: Add aliases for CMOVcc variants. llvm-svn: 104549	2010-05-24 20:32:23 +00:00
Bob Wilson	3eb7691858	Thumb2 RSBS instructions were being printed without the 'S' suffix. Fix it by changing the T2I_rbin_s_is multiclass to handle the CPSR output and 'S' suffix in the same way as T2I_bin_s_irs. llvm-svn: 104531	2010-05-24 18:44:06 +00:00
Evan Cheng	755d45be43	LR is in GPR, not tGPR even in Thumb1 mode. llvm-svn: 104518	2010-05-24 18:00:18 +00:00
Daniel Dunbar	b52fcd6304	MC/X86: Subdivide immediates a bit more, so that we properly recognize immediates based on the width of the target instruction. For example: addw $0xFFFF, %ax should match the same as addw $-1, %ax but we used to match it to the longer encoding. llvm-svn: 104453	2010-05-22 21:02:33 +00:00
Daniel Dunbar	d459e29a0a	MC/X86: Add alias for setz, setnz, jz, jnz. llvm-svn: 104435	2010-05-22 06:37:33 +00:00
Evan Cheng	168ced94d8	Implement @llvm.returnaddress. rdar://8015977. llvm-svn: 104421	2010-05-22 01:47:14 +00:00
Eric Christopher	64087cd346	This test is darwin only. Make it so(tm). llvm-svn: 104418	2010-05-22 00:55:55 +00:00
Bob Wilson	91fdf68516	Recognize more BUILD_VECTORs and VECTOR_SHUFFLEs that can be implemented by copying VFP subregs. This exposed a bunch of dead code in the *spill-q.ll tests, so I tweaked those tests to keep that code from being optimized away. Radar 7872877. llvm-svn: 104415	2010-05-22 00:23:12 +00:00
Eric Christopher	6fdea1bda8	Add full bss data support for darwin tls variables. llvm-svn: 104414	2010-05-22 00:10:22 +00:00
Kevin Enderby	7e7482c80f	Added retl for 32-bit x86 and added retq for 64-bit x86. llvm-svn: 104394	2010-05-21 23:01:38 +00:00
Bob Wilson	51d9ee3ff6	Change CodeGen/ARM/2009-11-02-NegativeLane.ll to use 16-bit vector elements so that it will continue to test what it was meant to test when I commit a separate change for better support of BUILD_VECTOR and VECTOR_SHUFFLE for Neon. Fix a DAG combiner crash exposed by this test change. llvm-svn: 104380	2010-05-21 21:05:32 +00:00
Chris Lattner	0735ecfe17	now that fp reg kill insertion stuff happens as a separate pass after isel instead of being interlaced with it, we can trust that all the code for a function has been isel'd before it is run. The practical impact of this is that we can scan for machine instr phis instead of doing a fuzzy match on the LLVM BB for phi nodes. Doing the fuzzy match required knowing when isel would produce an fp reg stack phi which was gross. It was also wrong in cases where select got lowered to a branch tree because cmovs aren't available (PR6828). Just do the scan on machine phis which is simpler, faster and more correct. This fixes PR6828. llvm-svn: 104333	2010-05-21 18:17:54 +00:00
Jakob Stoklund Olesen	a648c6a757	Teach VirtRegRewriter to handle spilling in instructions that have multiple definitions of the virtual register. This happens when spilling the registers produced by REG_SEQUENCE: %reg1047:5<def>, %reg1047:6<def>, %reg1047:7<def> = VLD3d8 %reg1033, 0, pred:14, pred:%reg0 The rewriter would spill the register multiple times, dead store elimination tried to keep up, but ended up cutting the branch it was sitting on. llvm-svn: 104321	2010-05-21 16:36:13 +00:00
Dale Johannesen	b3b9c8ac48	Fix i64->f64 conversion, x86-64, -no-sse. A bit tricky since there's a 3rd 64-bit type, MMX vectors. PR 7135. llvm-svn: 104308	2010-05-21 00:52:33 +00:00
Evan Cheng	34c260458a	Change ARM scheduling default to list-hybrid if the target supports floating point instructions (and is not using soft float). llvm-svn: 104307	2010-05-21 00:43:17 +00:00
Daniel Dunbar	baf2eea6f4	MC/X86: Add movq alias for movabsq, to allow matching 64-bit immediates with movq. llvm-svn: 104275	2010-05-20 20:36:29 +00:00
Dan Gohman	ee2fea3cd7	When canonicalizing icmp operand order to put the loop invariant operand on the left, the interesting operand is on the right. This fixes a bug where LSR was failing to recognize ICmpZero uses, which led it to be unable to reverse the induction variable in the attached testcase. Delete test/CodeGen/X86/stack-color-with-reg-2.ll, because its test is extremely fragile and hard to meaningfully update. llvm-svn: 104262	2010-05-20 19:26:52 +00:00
Bob Wilson	5954994bba	Handle Neon v2f64 and v2i64 vector shuffles as register copies. This fixes the remaining issue with pr7167. llvm-svn: 104257	2010-05-20 18:39:53 +00:00
Dan Gohman	29790edb93	Fix assembly parsing and encoding of the pushf and popf family of instructions. llvm-svn: 104231	2010-05-20 16:16:00 +00:00
Dan Gohman	1e19eab963	Define the x86 pause instruction. llvm-svn: 104204	2010-05-20 01:35:50 +00:00
Dan Gohman	a3b7570a3a	Fix the sfence instruction to use MRM_F8 instead of MRM7r, since it doesn't have a register operand. Also, use I instead of PSI, for consistency with mfence and lfence. llvm-svn: 104203	2010-05-20 01:23:41 +00:00
Bill Wendling	de852faef9	Match "4" or "8" depending upon if it's 32- or 64-bit. llvm-svn: 104196	2010-05-20 00:27:10 +00:00
Eric Christopher	4b4446be7c	Once more, with feeling. llvm-svn: 104190	2010-05-20 00:07:13 +00:00
Dan Gohman	20fab456da	Teach LSR how to cope better with unrolled loops on targets where the addressing modes don't make this trivially easy. This allows it to avoid falling into the less precise heuristics in more cases. llvm-svn: 104186	2010-05-19 23:43:12 +00:00
Chris Lattner	7cbfa4462f	fix rdar://7986634 - match instruction opcodes case insensitively. llvm-svn: 104183	2010-05-19 23:34:33 +00:00
Bill Wendling	1c4687e350	Testcase for r104181. llvm-svn: 104182	2010-05-19 23:33:26 +00:00
Eric Christopher	63476ddae6	A more combo tls testcase. llvm-svn: 104163	2010-05-19 21:19:42 +00:00
Eric Christopher	b95493c495	Few more simple tls testcases. llvm-svn: 104148	2010-05-19 20:35:15 +00:00
Jakob Stoklund Olesen	e11cdf8cc8	TwoAddressInstructionPass doesn't really know how to merge live intervals when lowering REG_SEQUENCE instructions. Insert copies for REG_SEQUENCE sources not killed to avoid breaking later passes. llvm-svn: 104146	2010-05-19 20:08:00 +00:00
Eric Christopher	6304da132f	Attempt to run this test on x86 only. llvm-svn: 104143	2010-05-19 18:59:37 +00:00
Bob Wilson	f070b1b571	Testcase to go with 104141. llvm-svn: 104142	2010-05-19 18:58:37 +00:00
Evan Cheng	daeca2d156	t2LEApcrel and tLEApcrel are re-materializable. This makes it possible to hoist more loads during machine LICM. llvm-svn: 104115	2010-05-19 07:28:01 +00:00
Evan Cheng	abd0ad54a4	Intrinsics which do a vector compare (results are all zero or all ones) are modeled as icmp / fcmp + sext. This is turned into a vsetcc by dag combine (yes, not a good long term solution). The targets can then isel the vsetcc to the appropriate instruction. The trouble arises when the result of a vector cmp + sext is then and'ed with all ones. Instcombine will turn it into a vector cmp + zext, dag combiner will miss turning it into a vsetcc and hell breaks loose after that. Teach dag combine to turn a vector cpm + zest into a vsetcc + and 1. This fixes rdar://7923010. llvm-svn: 104094	2010-05-19 01:08:17 +00:00
Eric Christopher	c09d5a29d8	Add a test to make sure that we're lowering the shift amount correctly. llvm-svn: 104090	2010-05-19 00:22:04 +00:00
Jakob Stoklund Olesen	430b6e40ab	Remember to update VirtRegLastUse when spilling without killing before a call. llvm-svn: 104074	2010-05-18 22:20:09 +00:00
Dan Gohman	887dd1cd31	When converting a test to a cmp to fold a load, use the cmp that has an 8-bit immediate field rather than one with a wider immediate field. llvm-svn: 104064	2010-05-18 21:42:03 +00:00
Eric Christopher	7f173d1d27	Quick test to make sure we're emitting the tbss section correctly. llvm-svn: 104063	2010-05-18 21:40:20 +00:00
Evan Cheng	f19384d54a	Sink dag combine's post index load / store code that swap base ptr and index into the target hook. Only the target knows whether the swap is safe. In Thumb2 mode, the offset must be an immediate. rdar://7998649 llvm-svn: 104060	2010-05-18 21:31:17 +00:00
Dale Johannesen	6338d15939	Test passed on ppc, to my surprise; if it worked there it may work everywhere... llvm-svn: 104053	2010-05-18 20:47:04 +00:00
Evan Cheng	e7fc64a5c9	Fix PR7162: Use source register classes and sub-indices to determine the correct register class of the definitions of REG_SEQUENCE. llvm-svn: 104050	2010-05-18 20:03:28 +00:00
Dale Johannesen	fb7df5317a	Testcase for llvm-gcc checkin 104042. llvm-svn: 104043	2010-05-18 19:03:51 +00:00
Kevin Enderby	53e0631516	Fixed the problem with a branch to "0b" that was not parsed by llvm-mc correctly. The Lexer was incorrectly eating the newline casusing it to branch to address 0. Updated the test case to use a "0:" label and a branch to "0b". llvm-svn: 104038	2010-05-18 17:51:35 +00:00
Daniel Dunbar	d5563f420a	MC/Mach-O: Implement support for setting indirect symbol table offset in section header. Also, create symbol data for LHS of assignment, to match 'as' symbol ordering better. llvm-svn: 104033	2010-05-18 17:28:24 +00:00
Daniel Dunbar	a4820fcc78	MC/X86: Implement custom lowering to make sure we match things like X86::ADC32ri $0, %eax to X86::ADC32i32 $0 llvm-svn: 104030	2010-05-18 17:22:24 +00:00
Evan Cheng	48f0de96d6	FIX PR7158. SimplifyVBinOp was asserting when it fails to constant fold (op (build_vector), (build_vector)). llvm-svn: 104004	2010-05-18 00:03:40 +00:00
Evan Cheng	1e4f55200d	Fix PR7175. Insert copies of a REG_SEQUENCE source if it is used by other REG_SEQUENCE instructions. llvm-svn: 103994	2010-05-17 23:24:12 +00:00
Kevin Enderby	0510b48fd9	Added support in MC for Directional Local Labels. llvm-svn: 103989	2010-05-17 23:08:19 +00:00
Eric Christopher	9635b3da6b	More data/parsing support for tls directives. Add a few more testcases and cleanup comments as well. llvm-svn: 103985	2010-05-17 22:53:55 +00:00
Evan Cheng	f2c9a96f3c	Fix PR7156. If the sources of a REG_SEQUENCE are all IMPLICIT_DEF's. Replace it with an IMPLICIT_DEF rather than deleting it or else it would be left without a def. llvm-svn: 103984	2010-05-17 22:09:49 +00:00
Daniel Dunbar	bb166bed40	MC/Mach-O/x86: Optimal nop sequences should only be used for the .text sections, not all sections in the text segment. llvm-svn: 103981	2010-05-17 21:54:30 +00:00
Daniel Dunbar	b7b796cc11	MC/Mach-O: Reverse order of SymbolData scanning when emitting instructions. - This fixes a string table mismatch with 'as' when two new symbols are defined in a single instruction. llvm-svn: 103979	2010-05-17 21:19:59 +00:00
Evan Cheng	29c463862e	Careful with reg_sequence coalescing to not to overwrite sub-register indices. llvm-svn: 103971	2010-05-17 20:57:12 +00:00
Daniel Dunbar	0211a96989	MC/Mach-O: Fix some differences in symbol flag handling. - Don't clear weak reference flag, 'as' was only "trying" to do this, it wasn't actually succeeding. - Clear the "lazy bound" bit when we mark something external. This corresponds roughly to the lazy clearing of the bit that 'as' implements in symbol_table_lookup. - The exact meaning of these flags appears pretty loose, since 'as' isn't very consistent. For now we just try to match 'as', we will clean this up one day hopefully. llvm-svn: 103964	2010-05-17 20:12:31 +00:00
Evan Cheng	3d98b996ff	Turn on -neon-reg-sequence by default. Using NEON load / store multiple instructions will no longer create gobs of vmov of D registers! llvm-svn: 103960	2010-05-17 19:51:20 +00:00
Daniel Dunbar	9b4a824217	llvm-mc: Support reassignment of variables in one special case, when the variable has not yet been used in an expression. This allows us to support a few cases that show up in real code (mostly because gcc generates it for Objective-C on Darwin), without giving up a reasonable semantic model for assignment. llvm-svn: 103950	2010-05-17 17:46:23 +00:00
Jakob Stoklund Olesen	176a9c4272	Avoid allocating the same physreg to multiple virtregs in one instruction. While that approach works wonders for register pressure, it tends to break everything. This should unbreak the arm-linux builder and fix a number of miscompilations. llvm-svn: 103946	2010-05-17 17:18:59 +00:00
Jakob Stoklund Olesen	7d22a81b61	Only use clairvoyance when defining a register, and then only if it has one use. This makes allocation independent on the ordering of use-def chains. llvm-svn: 103935	2010-05-17 04:50:57 +00:00
Eric Christopher	68b1bbe66a	Assume that we'll handle mangling the symbols earlier and just put the symbol to the file as we have it. Simplifies out tbss handling. llvm-svn: 103928	2010-05-17 02:13:02 +00:00
Dale Johannesen	f92c344167	Removing as part of previous reversion. llvm-svn: 103915	2010-05-16 20:19:40 +00:00
Dale Johannesen	2ef974ee0e	Revert 103911; it broke a test that expects bitconvert <1xi64> -> i64 to work in MMX registers on hosts where -no-sse is the default (not mine). The right thing is to accept this and make i64->f64 conversions go through memory, but I don't have time right now. llvm-svn: 103914	2010-05-16 20:19:04 +00:00
Dale Johannesen	fc1492d71b	Make x86-64 64-bit bitconvert work when SSE is not available. (This worked as of about 6 months ago and I didn't track down exactly what broke it; I think this fix is appropriate.) llvm-svn: 103911	2010-05-16 18:22:38 +00:00
Anton Korobeynikov	8f35fabbc1	Add support for thiscall calling convention. Patch by Charles Davis and Steven Watanabe! llvm-svn: 103902	2010-05-16 09:08:45 +00:00
Anton Korobeynikov	1bf28a128b	Some cheap DAG combine goodness for multiplication with a particular constant. This can be extended later on to handle more "complex" constants. llvm-svn: 103881	2010-05-15 18:16:59 +00:00
Evan Cheng	4cad68eb34	Allow TargetLowering::getRegClassFor() to be called on illegal types. Also allow target to override it in order to map register classes to illegal but synthesizable types. e.g. v4i64, v8i64 for ARM / NEON. llvm-svn: 103854	2010-05-15 02:18:07 +00:00
Bill Wendling	0160e55893	SystemZ really does mean "has calls" and not just "adjusts stack." Go ahead and replace the check with the appropriate predicate. Modify the testcase to reflect the correct code. (It should be saving callee-saved registers on the stack allocated by the calling fuction.) llvm-svn: 103829	2010-05-14 22:17:42 +00:00
Devang Patel	c87e867111	Test case for r103800. llvm-svn: 103801	2010-05-14 21:04:45 +00:00
Kevin Enderby	7bc111f5a9	Fix so "int3" is correctly accepted, added "into" and fixed "int" with an argument, like "int $4", to not get an Assertion error. llvm-svn: 103791	2010-05-14 19:16:02 +00:00
Daniel Dunbar	2493ddfe42	MC/Mach-O/x86_64: Darwin's special "signed_N" relocation types should only be used to replace a normal relocation, not a reference to a GOT entry. llvm-svn: 103789	2010-05-14 18:53:40 +00:00
Jakob Stoklund Olesen	4d5c1061e3	Simplify the handling of physreg defs and uses in RegAllocFast. This adds extra security against using clobbered physregs, and it adds kill markers to physreg uses. llvm-svn: 103784	2010-05-14 18:03:25 +00:00
Daniel Dunbar	148e876ac2	XFAIL the test I added with vg_leak, apparently it is the first and only llc -filetype=obj test, and -filetype=obj leaks a few objects. Added a FIXME, we need to sort out the ownership model for the various MC objects. llvm-svn: 103769	2010-05-14 07:47:51 +00:00
Daniel Dunbar	3439ed6324	Inline Asm: Ensure buffer is newline terminated to match how the text is printed. - This is a hack, but I can't decide the best place to handle this. Chris? llvm-svn: 103765	2010-05-14 04:31:50 +00:00
Eric Christopher	9fb6bb07ca	Add AsmParser support for darwin tbss directive. Nothing uses this yet. llvm-svn: 103757	2010-05-14 01:50:28 +00:00
Nick Lewycky	23b545ca4b	Actually run the test. Thanks Daniel Dunbar! llvm-svn: 103720	2010-05-13 17:41:06 +00:00
Nick Lewycky	3230f0ac25	Add testcase for r103653. llvm-svn: 103699	2010-05-13 06:00:14 +00:00
Daniel Dunbar	e35c88d5ad	MC/Mach-O: Add another zerofill test to improve coverage. llvm-svn: 103691	2010-05-13 01:10:28 +00:00
Jakob Stoklund Olesen	0ba2e2a568	Take allocation hints from copy instructions to/from physregs. This causes way more identity copies to be generated, ripe for coalescing. llvm-svn: 103686	2010-05-13 00:19:43 +00:00
Chris Lattner	8cb4728a15	fix rdar://7965971 and a fixme: use ParseIdentifier in ParseDirectiveDarwinZerofill instead of hard coding the check for identifier. This allows quoted symbol names to be used. llvm-svn: 103682	2010-05-13 00:10:34 +00:00
Chris Lattner	9efef006cf	reapply r103668 with a fix. Never make "minor syntax changes" after testing before committing. llvm-svn: 103681	2010-05-13 00:02:47 +00:00
Chris Lattner	e354235512	revert r103668 for now, it is apparently breaking things. llvm-svn: 103677	2010-05-12 23:40:59 +00:00
Chris Lattner	a6df4650fd	moffset forms of moves are x86-32 only, make the parser lower them to the correct x86-64 instructions since we don't have a clean way to handle this in td files yet. rdar://7947184 llvm-svn: 103668	2010-05-12 23:13:36 +00:00
Chris Lattner	e132b0a92c	fix the encoding of the obscure "moffset" forms of moves, i386 part first. rdar://7947184 llvm-svn: 103660	2010-05-12 22:48:24 +00:00
Jakob Stoklund Olesen	955a0e71e9	Make sure to add kill flags to the last use of a virtreg when it is redefined. The X86 floating point stack pass and others depend on good kill flags. llvm-svn: 103635	2010-05-12 18:46:03 +00:00
Devang Patel	0bcbcbd23e	Test case for r103633. llvm-svn: 103634	2010-05-12 18:31:04 +00:00
Dale Johannesen	352117adf5	Testcase for llvm 103572 (7898991). llvm-svn: 103574	2010-05-12 05:04:20 +00:00
Daniel Dunbar	059379a9d7	MC/X86: Extend suffix matching hack to match 'q' suffix. llvm-svn: 103535	2010-05-12 00:54:20 +00:00
Daniel Dunbar	ba2f4c3884	MC/Mach-O/x86_64: Add a new hook for checking whether a particular section can be diced into atoms, and adjust getAtom() to take this into account. - This fixes relocations to symbols in fixed size literal sections, for example. llvm-svn: 103532	2010-05-12 00:38:17 +00:00
Jakob Stoklund Olesen	e6e39dc310	Enable a bunch more -regalloc=fast tests llvm-svn: 103531	2010-05-12 00:11:24 +00:00
Daniel Dunbar	53ce0e12d8	MC/Mach-O/x86_64: Fix PCrel adjustment for x86_64, which was using the fixup offset instead of the fixup address as intended. llvm-svn: 103527	2010-05-11 23:53:11 +00:00
Jakob Stoklund Olesen	132668102e	Keep track of the last place a live virtreg was used. This allows us to add accurate kill markers, something the scavenger likes. Add some more tests from ARM that needed this. llvm-svn: 103521	2010-05-11 23:24:45 +00:00
Jakob Stoklund Olesen	84c881e593	One more -regalloc=fast test llvm-svn: 103509	2010-05-11 20:51:07 +00:00
Jakob Stoklund Olesen	3f0241e0f9	Simplify the tracking of used physregs to a bulk bitor followed by a transitive closure after allocating all blocks. Add a few more test cases for -regalloc=fast. llvm-svn: 103500	2010-05-11 20:30:28 +00:00
Jakob Stoklund Olesen	f1b3029a54	Mostly rewrite RegAllocFast. Sorry for the big change. The path leading up to this patch had some TableGen changes that I didn't want to commit before I knew they were useful. They weren't, and this version does not need them. The fast register allocator now does no liveness calculations. Instead it relies on kill flags provided by isel. (Currently those kill flags are also ignored due to isel bugs). The allocation algorithm is supposed to work with any subset of valid kill flags. More kill flags simply means fewer spills inserted. Registers are allocated from a working set that contains no aliases. That means most allocations can be done directly without expensive alias checks. When the working set runs out of registers we do the full alias check to find new free registers. llvm-svn: 103488	2010-05-11 18:54:45 +00:00
Daniel Dunbar	3937e28da0	MC/Mach-O x86_64: Switch to using fragment atom symbol. - This eliminates getAtomForAddress() (which was a linear search) and simplifies getAtom(). - This also fixes some correctness problems where local labels at the same address as non-local labels could be assigned to the wrong atom. llvm-svn: 103480	2010-05-11 17:22:50 +00:00
Kalle Raiskila	9dd3ef8d01	Make SPU backend not assert on jump tables. llvm-svn: 103466	2010-05-11 11:00:02 +00:00
Evan Cheng	2fa5a7e7e4	Select @llvm.trap to the special B with 1111 condition (i.e. trap) instruction. llvm-svn: 103459	2010-05-11 07:26:32 +00:00
Daniel Dunbar	75778984f9	MC/Mach-O: Fix another mismatch with .weak_definition, we shouldn't use a scattered relocation entry with a .weak_definition. llvm-svn: 103443	2010-05-10 23:15:20 +00:00
Devang Patel	1a0df9a80e	Enable multiple Compile Units in one module. This means now 'llvm-ld a.bc b.bc' will preserve debug info appropriately. llvm-svn: 103439	2010-05-10 22:49:55 +00:00
Chris Lattner	d86a5a5e45	this really is needed. :( llvm-svn: 103434	2010-05-10 21:23:48 +00:00
Chris Lattner	ba44bf052a	just remove this, it isn't needed. llvm-svn: 103432	2010-05-10 21:01:47 +00:00
Chris Lattner	58aff8fb57	fix PR7105 by enumerating MDNodes on all @llvm.foo function calls, not just recognized intrinsics. llvm-svn: 103428	2010-05-10 20:53:17 +00:00
Chris Lattner	05b4caff3e	fix a pretty obvious typo. We test things before committing them, right? llvm-svn: 103427	2010-05-10 20:51:06 +00:00
David Greene	103d4b43e9	Fix PR6875: This includes a patch by Roman Divacky to fix the initial crash. Move the actual addition of passes from PassManager::add to PassManager::addImpl. That way, when adding printer passes we won't recurse infinitely. Finally, check to make sure that we are actually adding a FunctionPass to a FunctionPassManager before doing a print before or after it. Immutable passes are strange in this way because they aren't FunctionPasses yet they can be and are added to the FunctionPassManager. llvm-svn: 103425	2010-05-10 20:24:27 +00:00
Evan Cheng	02947a4551	Be careful with operand promotion. For a binary operation, the source operands may be the same. PR7018. rdar://7939869. llvm-svn: 103419	2010-05-10 19:03:57 +00:00
Devang Patel	fbc75d039a	Test case for 103414. llvm-svn: 103415	2010-05-10 17:49:40 +00:00
Kalle Raiskila	92ea401d8f	Fix encoding of 'sf' and 'sfh' instructions. llvm-svn: 103399	2010-05-10 08:13:49 +00:00
Chris Lattner	84d4618659	make simplifycfg insert an llvm.trap before the 'unreachable' it introduces when it detects undefined behavior. llvm.trap generally codegens into some thing really small (e.g. a 2 byte ud2 instruction on x86) and debugging this sort of thing is "nontrivial". For example, we now compile: void foo() { (int)0 = 42; } into: _foo: pushl %ebp movl %esp, %ebp ud2 Some may even claim that this is a security hole, though that seems dubious to me. This addresses rdar://7958343 - Optimizing away null dereference potentially allows arbitrary code execution llvm-svn: 103356	2010-05-08 22:15:59 +00:00
Chris Lattner	02b0df5338	Teach instcombine to transform a bitcast/(zext\|trunc)/bitcast sequence with a vector input and output into a shuffle vector. This sort of sequence happens when the input code stores with one type and reloads with another type and then SROA promotes to i96 integers, which make everyone sad. This fixes rdar://7896024 llvm-svn: 103354	2010-05-08 21:50:26 +00:00
Chris Lattner	5a62d6e578	Fix PR7052, patch by Jakub Staszak! llvm-svn: 103347	2010-05-08 20:01:44 +00:00
Bill Wendling	cd476b6760	Readd testcase. llvm-svn: 103335	2010-05-08 04:47:54 +00:00
Dan Gohman	d0800241d2	When pruning candidate formulae out of an LSRUse, update the LSRUse's Regs set after all pruning is done, rather than trying to do it on the fly, which can produce an incomplete result. This fixes a case where heuristic pruning was stripping all formulae from a use, which led the solver to enter an infinite loop. Also, add a few asserts to diagnose this kind of situation. llvm-svn: 103328	2010-05-07 23:36:59 +00:00
Bill Wendling	6b5897b4de	Remove. Don't XFAIL. llvm-svn: 103321	2010-05-07 23:09:17 +00:00
Bill Wendling	32d8981ec0	Temorarily revert r101984. llvm-svn: 103314	2010-05-07 22:45:36 +00:00
Dan Gohman	7de01ec2c9	SDDbgValues are apparently not being legalized. Fix a symptom of the problem, and not the real problem itself, by dropping debug info for i128 values. rdar://7958162. llvm-svn: 103310	2010-05-07 22:19:08 +00:00
Kevin Enderby	51bed9c870	Fix i386 relocations to Weak Definitions. The relocation entries should be external and the item to be relocated should not have the address of the symbol added in. llvm-svn: 103302	2010-05-07 21:44:23 +00:00
Dale Johannesen	51c1695a0a	Fix PR 7087, and probably other things, by extending getConstantFP to accept the two supported long double target types. This was not the original intent, but there are other places that assume this works and it's easy enough to do. llvm-svn: 103299	2010-05-07 21:35:53 +00:00
Devang Patel	be8ee1a09e	Update test to use valid debug info. llvm-svn: 103287	2010-05-07 20:34:00 +00:00
Jim Grosbach	2a41cad900	Clean up the conditional for handling of sign_extend_inreg based on whether the extract instructions are available. rdar://7956878 llvm-svn: 103277	2010-05-07 18:34:55 +00:00
Duncan Sands	ebf838274f	Correct some bogus target triples. llvm-svn: 103265	2010-05-07 17:03:48 +00:00
Dan Gohman	5d5b8b1b8c	Add an LLVM IR version of code sinking. This uses the same simple algorithm as MachineSink, but it isn't constrained by MachineInstr-level details. llvm-svn: 103257	2010-05-07 15:40:13 +00:00
Nick Lewycky	45f530db39	Revert r103133 and add testcase from PR7066. llvm-svn: 103233	2010-05-07 01:45:38 +00:00
Dale Johannesen	bbfa3067bd	Adjust tests affected by llvm-gcc 103229. All results here match gcc-4.2. llvm-svn: 103230	2010-05-07 01:11:31 +00:00
Dan Gohman	7421ae48bf	Disable the new unknown-location code for now. It causes a major increase in the debug line info section, and it's causing regressions in a gdb testsuite. llvm-svn: 103226	2010-05-07 01:08:53 +00:00
Daniel Dunbar	21aa523c28	MC/X86: X86AbsMemAsmOperand is subclass of X86NoSegMemAsmOperand. - This fixes "leal 0, %eax", for example. llvm-svn: 103205	2010-05-06 22:39:14 +00:00
Chris Lattner	348dc9b15a	fix rdar://7947167 - llvm-mc doesn't match movsq llvm-svn: 103199	2010-05-06 21:48:14 +00:00
Sean Callanan	e7e1cf9fbd	Eliminated the classification of control registers into %ecr_ and %rcr_, leaving just %cr_ which is what people expect. Updated the disassembler to support this unified register set. Added a testcase to verify that the registers continue to be decoded correctly. llvm-svn: 103196	2010-05-06 20:59:00 +00:00
Dan Gohman	779c69bbc5	Add a DebugLoc argument to TargetInstrInfo::copyRegToReg, so that it doesn't have to guess. llvm-svn: 103194	2010-05-06 20:33:48 +00:00
Dan Gohman	cb4e3e51a9	Add a testcase for r103135, explicitly representing unknown locations in debug line info. llvm-svn: 103189	2010-05-06 17:49:17 +00:00
Daniel Dunbar	b0ceb764b8	Revert r103137, fix for $ in labels. It looks like we can't actually handle this at the token level. Consider the following horrible test case: a = 1 .globl $a movl ($a), %eax movl $a, %eax movl $$a, %eax llvm-svn: 103178	2010-05-06 14:46:38 +00:00
Chris Lattner	35096e82c5	Fix PR7054 - Assertion `Symbol->isUndefined() && "Cannot define a symbol twice!"' failed. Users can write broken code that emits the same label twice with asm renaming, detect this and emit a fatal backend error instead of aborting. llvm-svn: 103140	2010-05-06 00:05:37 +00:00
Chris Lattner	482fa218d4	fix rdar://7946934 - in some limited cases, the assembler should allow $ at the start of a symbol name. llvm-svn: 103137	2010-05-05 23:51:28 +00:00
Jim Grosbach	151cd8f159	Cleanup of ARMv7M support. Move hardware divide and Thumb2 extract/pack instructions to subtarget features and update tests to reflect. PR5717. llvm-svn: 103136	2010-05-05 23:44:43 +00:00
Jakob Stoklund Olesen	1b6f698e85	Fix PR6520. An earlyclobber physreg must not be allocated to anything else. llvm-svn: 103133	2010-05-05 23:07:41 +00:00
Stuart Hastings	7e60a6bd71	Test case for pr2394 and r102979. llvm-svn: 103129	2010-05-05 22:49:33 +00:00
Jim Grosbach	245b169212	fix copy/paste oops. llvm-svn: 103122	2010-05-05 21:07:46 +00:00
Jim Grosbach	44d7f49887	Add tests for ARMV7M divide instruction use llvm-svn: 103120	2010-05-05 20:47:15 +00:00
Jim Grosbach	e36cd72e38	remove unneeded underscores. llvm-svn: 103114	2010-05-05 19:55:58 +00:00
Jim Grosbach	5ced648ba8	Convert to filecheck llvm-svn: 103113	2010-05-05 19:41:11 +00:00
Daniel Dunbar	f3a53baf00	MC/Mach-O: Mark absolute variable's appropriately, and add Mach-O support for writing them. - <rdar://problem/7885351> integrated assembler broken for i386 objc code llvm-svn: 103112	2010-05-05 19:01:05 +00:00
Daniel Dunbar	027fa5f31c	MC/Mach-O/x86_64: Relocations in debug sections should use local relocations when possible. - <rdar://problem/7934873> llvm-svn: 103092	2010-05-05 17:22:39 +00:00
Duncan Sands	687900ed83	Use llvm.foo as the intrinsic, rather than llvm.dbg.value. Since the values passed to llvm.dbg.value were not valid for the intrinsic, it might have caused trouble one day if the verifier ever started checking for valid debug info. llvm-svn: 103038	2010-05-04 20:09:25 +00:00
Chris Lattner	0185047b3f	"on the rare occasion the SPU BE produces illegal assembly - it tries to emit an add instruction of the form 'a reg, reg, imm'." Patch by Kalle Raiskila! llvm-svn: 103021	2010-05-04 17:58:46 +00:00
Daniel Dunbar	c3e0bafc6d	MC/X86: Chris pointed that 'as' isn't consistent in accepting the long form of instructions which have no direct register usage. Darwin 'as' accepts: add $0, (%rax) but rejects mov $0, (%rax) for example. Given that, only accept suffix matches which match exactly one form. We still need to emit nice diagnostics for failures... llvm-svn: 103015	2010-05-04 17:31:02 +00:00
Daniel Dunbar	9b816a1bb3	MC/X86: Add "support" for matching ATT style mnemonic prefixes. - The idea is that when a match fails, we just try to match each of +'b', +'w', +'l'. If exactly one matches, we assume this is a mnemonic prefix and accept it. If all match, we assume it is width generic, and take the 'l' form. - This would be a horrible hack, if it weren't so simple. Therefore it is an elegant solution! Chris gets the credit for this particular elegant solution. :) - Next step to making this more robust is to have the X86 matcher generate the mnemonic prefix information. Ideally we would also compute up-front exactly which mnemonic to attempt to match, but this may require more custom code in the matcher than is really worth it. llvm-svn: 103012	2010-05-04 16:12:42 +00:00
Duncan Sands	c2928c6ef5	Fix a variant of PR6112 found by thinking about it: when doing RAUW of a global variable with a local variable in function F, if function local metadata M in function G was using the global then M would become function-local to both F and G, which is not allowed. See the testcase for an example. Fixed by detecting this situation and zapping the metadata operand when it occurs. llvm-svn: 103007	2010-05-04 12:43:36 +00:00
Devang Patel	075e9b5d66	Set DW_AT_APPLE_omit_frame_ptr in endFunction() where MachineFunction is available all the time. llvm-svn: 103001	2010-05-04 06:15:30 +00:00
Devang Patel	801b8ea42a	Do not ignore debug loc attached with llvm.dbg.declare while collecting debug info used by a module. llvm-svn: 102995	2010-05-04 01:05:02 +00:00
Dale Johannesen	81bfca7bde	Implement builtin_return_address(x) and builtin_frame_address(x) on PPC for x!=0. 7624113. llvm-svn: 102972	2010-05-03 22:59:34 +00:00
Jakob Stoklund Olesen	f4e4e84115	Check that subregisters don't have independent values in RemoveCopyByCommutingDef(). This fixes PR6941. llvm-svn: 102970	2010-05-03 22:40:32 +00:00
Dan Gohman	0553acff5e	Fix tests to use fadd, fsub, and fmul, instead of add, sub, and mul, when the type is floating-point. llvm-svn: 102969	2010-05-03 22:36:46 +00:00
Bill Wendling	06bf470104	Revert r102948. llvm-svn: 102964	2010-05-03 21:51:21 +00:00
Kevin Enderby	6f2f8d0798	Changed llvm-mc to use the same suffixes with floating point compare instructions as the Mac OS X darwin assembler. Some of which like 'fcoml' assembled to different opcodes. While some of the suffixes were just different. llvm-svn: 102958	2010-05-03 21:31:40 +00:00
Kevin Enderby	e3a1726034	Fixed the encoding of two of the X86 movq instuctions. The Move quadword from mm to mm/m64 and the Move quadword from xmm2/mem64 to xmm1 had the incorrect encodings. llvm-svn: 102952	2010-05-03 21:03:31 +00:00
Kevin Enderby	1a51d4cec9	Fixed the encoding of the x86 push instructions. Using a 32-bit immediate value caused the a pushl instruction to be incorrectly encoding using only two bytes of immediate, causing the following 2 instruction bytes to be part of the 32-bit immediate value. Also fixed the one byte form of push to be used when the immediate would fit in a signed extended byte. Lastly changed the names to not include the 32 of PUSH32 since they actually push the size of the stack pointer. llvm-svn: 102951	2010-05-03 20:45:05 +00:00
Bill Wendling	88c734e8ae	Testcase for r102947. llvm-svn: 102948	2010-05-03 20:39:35 +00:00
Devang Patel	9f5200a122	Check for side effects before splitting loop. Patch by Jakub Staszak! llvm-svn: 102928	2010-05-03 18:06:58 +00:00
Dan Gohman	2ad68de4aa	Fix a bug which prevented tail merging of return instructions in beneficial cases. See the changes in test/CodeGen/X86/tail-opts.ll and test/CodeGen/ARM/ifcvt2.ll for details. The fix is to change HashEndOfMBB to hash at most one instruction, instead of trying to apply heuristics about when it will be profitable to consider more than one instruction. The regular tail-merging heuristics are already prepared to handle the same cases, and they're more precise. Also, make test/CodeGen/ARM/ifcvt5.ll and test/CodeGen/Thumb2/thumb2-branch.ll slightly more complex so that they continue to test what they're intended to test. And, this eliminates the problem in test/CodeGen/Thumb2/2009-10-15-ITBlockBranch.ll, the testcase from PR5204. Update it accordingly. llvm-svn: 102907	2010-05-03 14:35:47 +00:00
Duncan Sands	211427bda9	Remove the -enable-sjlj-eh option, which doesn't do anything. Remove the -enable-eh option which is only used by the JIT, and replace it with -jit-enable-eh. llvm-svn: 102865	2010-05-02 15:36:26 +00:00
Chris Lattner	b49a622fe9	revert r102831. We already delete dead readonly calls in other places, killing a valid transformation is not the right answer. llvm-svn: 102850	2010-05-01 17:19:38 +00:00
Anton Korobeynikov	737718d4f4	Insert ANY_EXTEND node instead of invalid truncate during DAG Combining (X & 1), when needed. This fixes PR7001 llvm-svn: 102838	2010-05-01 12:52:34 +00:00
Anton Korobeynikov	319d71f44f	Do folding for indirect branches, where possible llvm-svn: 102836	2010-05-01 12:28:21 +00:00
Anton Korobeynikov	ebbdfef2fc	Implement indirect branches on MSP430 llvm-svn: 102835	2010-05-01 12:04:32 +00:00
Owen Anderson	550986ea90	Disable the call-deletion transformation introduced in r86975. Without halting analysis, it is illegal to delete a call to a read-only function. The correct solution is almost certainly to add a "must halt" attribute and only allow deletions in its presence. XFAIL the relevant testcase for now. llvm-svn: 102831	2010-05-01 08:34:28 +00:00
Chris Lattner	532112b98a	fix PR5009 by making CGSCCPM realize that a call was devirtualized if an indirect call site was removed and a direct one was added, not just if an indirect call site was modified to be direct. llvm-svn: 102830	2010-05-01 06:38:43 +00:00
Chris Lattner	c3bc80a082	rename test llvm-svn: 102829	2010-05-01 06:34:13 +00:00
Chris Lattner	fc8d9ee6c3	Implement rdar://6295824 and PR6724 with two tiny changes that can have a big effect :). The first is to enable the iterative SCC passmanager juice that kicks in when the scc passmgr detects that a function pass has devirtualized a call. In this case, it will rerun all the passes it manages on the SCC, up to the iteration count limit (4). This is useful because a function pass may devirualize a call, and we want the inliner to inline it, or pruneeh to infer stuff about it, etc. The second patch is to add all call sites to the DevirtualizedCalls list the inliner uses. This list is about to get renamed, but the jist of this is that the inliner now reconsiders all inlined call sites as candidates for further inlining. The intuition is this that in cases like this: f() { g(1); } g(int x) { h(x); } We analyze this bottom up, and may decide that it isn't profitable to inline H into G. Next step, we decide that it is profitable to inline G into F, and do so, which means that F now calls H. Even though the call from G -> H may not have been profitable to inline, the call from F -> H may be (in this case because a constant allows folding etc). In my spot checks, this doesn't have a big impact on code. For example, the LLC output for 252.eon grew from 0.02% (from 317252 to 317308) and 176.gcc actually shrunk by .3% (from 1525612 to 1520964 bytes). 252.eon never iterated in the SCC Passmgr, 176.gcc iterated at most 1 time. llvm-svn: 102823	2010-05-01 01:15:56 +00:00
Chris Lattner	e8262675a3	The inliner has traditionally not considered call sites that appear due to inlining a callee as candidates for futher inlining, but a recent patch made it do this if those call sites were indirect and became direct. Unfortunately, in bizarre cases (see testcase) doing this can cause us to infinitely inline mutually recursive functions into callers not in the cycle. Fix this by keeping track of the inline history from which callsite inline candidates got inlined from. This shouldn't affect any "real world" code, but is required for a follow on patch that is coming up next. llvm-svn: 102822	2010-05-01 01:05:10 +00:00
Bill Wendling	02bc6787ca	Test failing too much on too many platforms. llvm-svn: 102812	2010-05-01 00:12:33 +00:00
Bill Wendling	06cacb1291	Maybe it needs sse2? llvm-svn: 102802	2010-04-30 23:19:29 +00:00
Bill Wendling	613fb7daa6	Force 64-bit. llvm-svn: 102800	2010-04-30 22:45:20 +00:00
Chris Lattner	a9bac86d16	Dan recently disabled recursive inlining within a function, but we were still inlining self-recursive functions into other functions. Inlining a recursive function into itself has the potential to reduce recursion depth by a factor of 2, inlining a recursive function into something else reduces recursion depth by exactly 1. Since inlining a recursive function into something else is a weird form of loop peeling, turn this off. The deleted testcase was added by Dale in r62107, since then we're leaning towards not inlining recursive stuff ever. In any case, if we like inlining recursive stuff, it should be done within the recursive function itself to get the algorithm recursion depth win. llvm-svn: 102798	2010-04-30 22:37:22 +00:00
Bill Wendling	de4b225093	EXTRACT_VECTOR_ELT of an INSERT_VECTOR_ELT may have the same index, but the indexes could be of a different value type. Or not even using the same SDNode for the constant (weird, I know). Compare the actual values instead of the pointers. llvm-svn: 102791	2010-04-30 22:19:17 +00:00
Jakob Stoklund Olesen	9afed0f98b	The local register allocator has to spill dirty callee saved registers before a call that might throw. The landing pad assumes that all registers are in stack slots. We used to spill those dirty CSRs after the call, and the stack slots would be wrong when arriving at the landing pad. llvm-svn: 102770	2010-04-30 21:19:29 +00:00
Devang Patel	3ca9a9b59c	Preserve debug info attached with call instruction while eliminating dead argument. Radar 7927803 llvm-svn: 102760	2010-04-30 20:23:54 +00:00
Devang Patel	cde3576e0d	New test. llvm-svn: 102746	2010-04-30 19:39:29 +00:00
Dan Gohman	299e7b93ac	Add lint checks for invalid uses of memory. llvm-svn: 102733	2010-04-30 19:05:00 +00:00
Dan Gohman	6221b85680	Add -o /dev/null to some tests which don't care about their output. llvm-svn: 102722	2010-04-30 17:42:30 +00:00
Evan Cheng	5f2314f3a3	Fix test. llvm-svn: 102694	2010-04-30 06:00:56 +00:00
Evan Cheng	5117a555e0	Another sibcall bug. If caller and callee calling conventions differ, then it's only safe to do a tail call if the results are returned in the same way. llvm-svn: 102683	2010-04-30 01:12:32 +00:00
Jakob Stoklund Olesen	8d4214578d	Reject really weird coalescer case when trying to merge identical subregisters of different register classes. e.g. %reg1048:3<def> = EXTRACT_SUBREG %RAX<kill>, 3 Where %reg1048 is a GR32 register. This is not impossible to handle, but it is pretty hard and very rare. This should unbreak the dragonegg builder. llvm-svn: 102672	2010-04-29 23:47:46 +00:00
Evan Cheng	38dfa5cf20	Load folding tail call should not use ebp / rbp after it's popped. PEI should use esp / rsp to reference frame instead. llvm-svn: 102596	2010-04-29 05:08:22 +00:00
Kevin Enderby	4822841b82	Fixed the word sized Bit Scan Forward/Reverse instructions, they needed the Operand size override prefix to be part of their records. llvm-svn: 102556	2010-04-28 23:20:40 +00:00
Chris Lattner	669064a772	fix this to work with objdir != srcdir llvm-svn: 102547	2010-04-28 22:34:35 +00:00
Dale Johannesen	2288ef6c33	Fix comment. llvm-svn: 102545	2010-04-28 22:23:46 +00:00
Dale Johannesen	8d6d94f493	Test for llvm-gcc checkin 102543. llvm-svn: 102544	2010-04-28 22:17:33 +00:00
Devang Patel	4c18a3ac80	Update tests. Now DBG_VALUE instruction is created only if alloca corresponding to llvm.dbg.declare is missing. llvm-svn: 102524	2010-04-28 20:27:48 +00:00
Chris Lattner	450e29cb4c	fix PR6112 - When globalopt (or any other pass) does RAUW(@G, %G), metadata references in non-function-local MDNodes should drop to null. llvm-svn: 102519	2010-04-28 20:16:12 +00:00
Chris Lattner	08e9e72fa9	Rework global alignment computation again. Now we do round up alignment of globals to the preferred alignment, but only when there is no section specified on the global (by far the common case). llvm-svn: 102515	2010-04-28 19:58:07 +00:00
Evan Cheng	050df1b8de	Enable i16 to i32 promotion by default. llvm-svn: 102493	2010-04-28 08:30:49 +00:00
Evan Cheng	fe420adde0	Update tests. llvm-svn: 102487	2010-04-28 01:53:13 +00:00
Devang Patel	50c9431203	Emit debug info for byval parameters. llvm-svn: 102486	2010-04-28 01:39:28 +00:00
Evan Cheng	eb828b6391	Do not count kill, implicit_def instructions as printed instructions. llvm-svn: 102453	2010-04-27 19:38:45 +00:00
Chris Lattner	64d43d80be	round zero-byte .zerofill directives up to 1 byte. This should fix some "g++.dg-struct-layout-1" failures, rdar://7886017 llvm-svn: 102421	2010-04-27 07:41:44 +00:00
Dale Johannesen	022e7b900f	Un-XFAIL this on ppc. My enabling of dbg_declare handling in ISel fixed it. llvm-svn: 102404	2010-04-27 00:01:42 +00:00
Chris Lattner	6a5e706e3c	on darwin empty functions need to codegen into something of non-zero length, otherwise labels get incorrectly merged. We handled this by emitting a ".byte 0", but this isn't correct on thumb/arm targets where the text segment needs to be a multiple of 2/4 bytes. Handle this by emitting a noop. This is more gross than it should be because arm/ppc are not fully mc'ized yet. This fixes rdar://7908505 llvm-svn: 102400	2010-04-26 23:37:21 +00:00
Bob Wilson	25f85947a3	Handle register-to-register copies within the tGPR class. Radar 7896289 llvm-svn: 102396	2010-04-26 23:20:08 +00:00
Devang Patel	bd798ce8dd	Use DW_AT_entry_pc instead of DW_AT_low_pc/DW_AT_high_pc pair. This simplifies debug range entries. llvm-svn: 102394	2010-04-26 22:54:28 +00:00
Dan Gohman	58b0470592	When checking whether the special handling for an addrec increment which doesn't dominate the header is needed, don't check whether the increment expression has computable loop evolution. While the operands of an addrec are required to be loop-invariant, they're not required to dominate any part of the loop. This fixes PR6914. llvm-svn: 102389	2010-04-26 21:46:36 +00:00
Dan Gohman	d07d2f9774	Add a comment to this test. llvm-svn: 102387	2010-04-26 21:37:43 +00:00
Chris Lattner	f740a8ceeb	fix PR6921 a different way. Intead of increasing the alignment of globals with a specified alignment, we fix common variables to obey their alignment. Add a comment explaining why this behavior is important. llvm-svn: 102365	2010-04-26 18:46:46 +00:00
Chris Lattner	e80442aa6d	Revert r102300/102301, which serious broke objc apps. llvm-svn: 102359	2010-04-26 18:30:45 +00:00
Chris Lattner	87aa2243e2	fix PR6940: sitofp(undef) folds to 0.0, not undef. llvm-svn: 102358	2010-04-26 18:21:23 +00:00
Chris Lattner	4d7b4b4d15	testcase for PR6913 llvm-svn: 102303	2010-04-25 05:51:14 +00:00
Chris Lattner	6ac247a092	this passes now. llvm-svn: 102301	2010-04-25 05:49:31 +00:00
Chris Lattner	386a220f70	Fix PR6921: globals were not getting correctly rounded up to their preferred alignment unless they were common or some other special case. llvm-svn: 102300	2010-04-25 05:30:43 +00:00
Dan Gohman	534ba376f6	Generalize LSR's OptimizeMax to handle the new kinds of max expressions that indvars may use, now that indvars is recognizing le and ge loops. llvm-svn: 102235	2010-04-24 03:13:44 +00:00
Dan Gohman	f33bac3afe	ScalarEvolution support for <= and >= loops. Also, generalize ScalarEvolutions's min and max recognition to handle some new forms of min and max that this change makes more common. llvm-svn: 102234	2010-04-24 03:09:42 +00:00
Chris Lattner	11d1df442d	no longer xfail llvm-svn: 102220	2010-04-23 22:39:33 +00:00
Stuart Hastings	c8b2fc0909	Per Chris, fuse four trivial tests using grep (r102199) into one that uses FileCheck. llvm-svn: 102216	2010-04-23 22:12:57 +00:00
Dan Gohman	e1931fa676	Change TargetData's algorithm for computing defualt vector type alignment to match what's used in clang and GCC for __alignof, rather than trying to guess what Legalize is going to be doing. llvm-svn: 102206	2010-04-23 19:41:15 +00:00
Stuart Hastings	24b63f1597	Add some missing x86 patterns for movdq2q. Fixes two (LLVM-)GCC DejaGNU testcases. Radar 6881029. llvm-svn: 102199	2010-04-23 19:03:32 +00:00
Chris Lattner	126a58e084	fix some failures my callgraph dump format change broke. llvm-svn: 102197	2010-04-23 18:38:40 +00:00
Chris Lattner	6f41ef9d31	testcase for the bug that required a patch to be reverted. llvm-svn: 102195	2010-04-23 18:31:01 +00:00
Dan Gohman	997bbc54d6	Fix LSR to tolerate cases where ScalarEvolution initially misses an opportunity to fold add operands, but folds them after LSR has separated them out. This fixes rdar://7886751. llvm-svn: 102157	2010-04-23 01:55:05 +00:00
Chris Lattner	d8d898dbd3	disable my previous inliner patch, it appears to be busting self-host. llvm-svn: 102153	2010-04-23 00:41:03 +00:00
Chris Lattner	2eee5d3467	The inliner was choosing to not consider call sites that appear in the SCC as a result of inlining as candidates for inlining. Change this so that it does consider call sites that change from being indirect to being direct as a result of inlining. This allows it to completely "devirtualize" the testcase. llvm-svn: 102146	2010-04-22 23:37:35 +00:00
Jim Grosbach	825cb299cd	Update ARM DAGtoDAG for matching UBFX instruction for unsigned bitfield extraction. This fixes PR5998. llvm-svn: 102144	2010-04-22 23:24:18 +00:00
Devang Patel	894874e7af	Remove the test for now. llvm-svn: 102135	2010-04-22 22:06:28 +00:00
Devang Patel	ea2744f4dc	Adjust debug range offsets for isWeakForLinker() functions. llvm-svn: 102127	2010-04-22 20:52:00 +00:00
Chris Lattner	055cf267db	add a DEBUG call so that -debug lists when CGSCCPM iterates. Fix RefreshCallGraph to use CGN->replaceCallEdge instead of hand rolling its own loop. replaceCallEdge properly maintains the reference counts of the nodes, fixing a crash exposed by the iterative callgraph stuff. llvm-svn: 102120	2010-04-22 20:42:33 +00:00
Dan Gohman	acd700a24b	Don't attempt to analyze values which are obviously undef. This fixes some assertion failures in extreme cases. llvm-svn: 102042	2010-04-22 01:35:11 +00:00
Evan Cheng	02e816b317	Do not try to optimize a copy that has already been marked for deletion. llvm-svn: 102027	2010-04-21 20:57:54 +00:00
Evan Cheng	4158a0ff6b	Implement -disable-non-leaf-fp-elim which disable frame pointer elimination optimization for non-leaf functions. This will be hooked up to gcc's -momit-leaf-frame-pointer option. rdar://7886181 llvm-svn: 101984	2010-04-21 03:18:23 +00:00
Johnny Chen	dd56c40591	Thumb instructions which have reglist operands at the end and predicate operands before reglist were not properly handled with respect to IT Block. Fix that by creating a new method ARMBasicMCBuilder::DoPredicateOperands() used by those instructions for disassembly. Add a test case. llvm-svn: 101974	2010-04-21 01:01:19 +00:00
Chris Lattner	6fbe704932	Implement (but don't enable) PR6724 and rdar://6295824. In short, we have RefreshCallGraph detect when a function pass devirtualizes a call, and have CGSCCPassMgr iterate (up to a count) when this happens. This allows (in the example) GVN to devirtualize the call in foo, then the inliner to inline it away. This is not currently enabled because I haven't done any analysis on the (potentially substantial) code size or performance impact of doing this, and guess what, it exposes callgraph updating bugs in various passes. This is progress though, and you can play with it by passing -max-cg-scc-iterations=5 to opt. llvm-svn: 101973	2010-04-21 00:47:40 +00:00
Evan Cheng	2034d9f2da	- Clean up some crappy code which deals with coalescing of copies which look at extract_subreg / insert_subreg, etc. - Add support for more aggressive insert_subreg coalescing. llvm-svn: 101971	2010-04-21 00:44:22 +00:00
Dan Gohman	4398308fa7	Revert r101471. For tight recursive functions which have multiple recursive callsites, inlining can reduce the number of calls by exponential factors, as it does in MultiSource/Benchmarks/Olden/treeadd. More involved heuristics will be needed. llvm-svn: 101969	2010-04-21 00:43:30 +00:00
Dan Gohman	ad33d33719	Add another variant of this test which found a place where CodeGen's ComputeMaskedBits was being over-conservative when computing bits for an ADD. llvm-svn: 101963	2010-04-21 00:19:28 +00:00
Chris Lattner	84776786a7	teach the x86 address matching stuff to handle (shl (or x,c), 3) the same as (shl (add x, c), 3) when x doesn't have any bits from c set. This finishes off PR1135. Before we compiled the block to: to: LBB0_3: ## %bb cmpb $4, %dl sete %dl addb %dl, %cl movb %cl, %dl shlb $2, %dl addb %r8b, %dl shlb $2, %dl movzbl %dl, %edx movl %esi, (%rdi,%rdx,4) leaq 2(%rdx), %r9 movl %esi, (%rdi,%r9,4) leaq 1(%rdx), %r9 movl %esi, (%rdi,%r9,4) addq $3, %rdx movl %esi, (%rdi,%rdx,4) incb %r8b decb %al movb %r8b, %dl jne LBB0_1 Now we produce: LBB0_3: ## %bb cmpb $4, %dl sete %dl addb %dl, %cl movb %cl, %dl shlb $2, %dl addb %r8b, %dl shlb $2, %dl movzbl %dl, %edx movl %esi, (%rdi,%rdx,4) movl %esi, 8(%rdi,%rdx,4) movl %esi, 4(%rdi,%rdx,4) movl %esi, 12(%rdi,%rdx,4) incb %r8b decb %al movb %r8b, %dl jne LBB0_1 llvm-svn: 101958	2010-04-20 23:18:40 +00:00
Johnny Chen	d7209d2d56	When doing Thumb disassembly, there's no need to consider t2ADDrSPi12/t2SUBrSPi12, as their generic counterparts t2ADDri12/t2SUBri12 should suffice. llvm-svn: 101929	2010-04-20 18:45:24 +00:00
Bill Wendling	a8ae1783b4	Move CodeGen/X86/2010-04-19-DAGCombineCrash.ll into CodeGen/X86/crash.ll. Also reduce. llvm-svn: 101925	2010-04-20 18:14:47 +00:00
Johnny Chen	7be315c414	For t2LDRT, t2LDRBT, t2LDRHT, t2LDRSBT, and t2LDRSHT, if Rn(Inst{19-16})=='1111', transform the Opcode to the corresponding t2LDR*pci counterpart. Ref: A8.6.86 LDRT, A8.6.65 LDRBT, A8.6.77 LDRHT, A8.6.81 LDRSBT, A8.6.85 LDRSHT llvm-svn: 101915	2010-04-20 17:28:50 +00:00
Devang Patel	db6f71b02f	Add RUN: llvm-svn: 101913	2010-04-20 17:20:10 +00:00
Chris Lattner	5100367ff3	Bill's change in r95336 broke empty aggregates embedded in other types. fix this by only bumping zero-byte globals up to a single byte if the entire global is zero size, fixing PR6340. This also fixes empty arrays etc to be handled correctly, and only does this on subsection-via-symbols targets (aka darwin) which is the only place where this matters. llvm-svn: 101879	2010-04-20 06:20:21 +00:00
Chris Lattner	38c1a1a247	teach cellspu how to return i8 and i16 from calls, patch by Kalle Raiskila! llvm-svn: 101875	2010-04-20 05:36:09 +00:00
Chris Lattner	5814d9d9da	RewriteLoopBodyWithConditionConstant can end up rewriting the condition we're unswitching on. In this case, don't try to simplify the second copy of the loop which may be dead or not, but is probably a constant now. This fixes PR6879 llvm-svn: 101870	2010-04-20 05:09:16 +00:00
Chris Lattner	c239eb79bd	reapply 'reject forward references to functions whose type don't match' now that the testsuite has been updated. llvm-svn: 101866	2010-04-20 04:49:11 +00:00
Bill Wendling	467e6c2deb	The visitXOR method can return the same SDNode. If so, we don't want to delete it as it's not dead. llvm-svn: 101855	2010-04-20 01:25:01 +00:00
Eric Christopher	64831c6a4c	Remove the palignr intrinsics now that we lower them to vector shuffles, shifts and null vectors. Autoupgrade these to what we'd lower them to. Add a testcase to exercise this. llvm-svn: 101851	2010-04-20 00:59:54 +00:00
Chris Lattner	e93846762a	Fix rdar://7879828 - crash in CallGraph, a self host issue. Arg promotion was deleting call graph nodes that still had references from the 'indirect' CGN. Like the inliner, it should only delete the function if all references are gone. llvm-svn: 101845	2010-04-20 00:46:50 +00:00
Bob Wilson	92a4685dd2	Fix tests for Neon load/store intrinsics to match the i8* types expected by the intrinsics. The reason for those i8* types is that the intrinsics are overloaded on the vector type and we don't have a way to declare an intrinsic where one argument is an overloaded vector type and another argument is a pointer to the vector element type. The bitcasts added here will match what the frontend will typically generate when these intrinsics are used. llvm-svn: 101840	2010-04-20 00:17:16 +00:00
Dan Gohman	e637ff5e9a	Remove the Expr member from IVUsers. Instead of remembering the expression, just ask ScalarEvolution for it on demand. This helps IVUsers be more robust in the case of expressions changing underneath it. This fixes PR6862. llvm-svn: 101819	2010-04-19 21:48:58 +00:00
Johnny Chen	777346e749	According to A8.6.16 B (Encoding T3) and A8.3 Conditional execution -- A8.3.1 Pseudocode details of conditional, Condition bits '111x' indicate the instruction is always executed. That is, '1111' is a leagl condition field value, which is now mapped to ARMCC::AL. Also add a test case for condition field '1111'. llvm-svn: 101817	2010-04-19 21:19:52 +00:00
Devang Patel	561fa8490e	Fix typo. add a test case. llvm-svn: 101812	2010-04-19 20:31:39 +00:00
Johnny Chen	cbe3e1a3df	ARM disassembler did not react to recent changes to the NEON instruction table. VLD1q_UPD and VST1q_UPD have the ${dst:dregpair} operand now. llvm-svn: 101784	2010-04-19 16:20:34 +00:00
Nick Lewycky	fbe8d2803d	Fix declarations in a few more tests. llvm-svn: 101676	2010-04-17 21:29:25 +00:00
Daniel Dunbar	c459a0ff81	Revert "reject forward references to functions whose type don't match", because DJG told me to! llvm-svn: 101675	2010-04-17 21:24:55 +00:00
Nick Lewycky	d4c0f86a5e	Fix intrinsic signature in this test. llvm-svn: 101674	2010-04-17 21:12:55 +00:00
Chris Lattner	5a44950aae	reject forward references to functions whose type don't match up with the definition (and fix a broken testcase). PR6491. llvm-svn: 101670	2010-04-17 20:45:56 +00:00
Chris Lattner	2b3a32f7a0	doh, didn't mean to check in my hackaround lit sucking. :) llvm-svn: 101663	2010-04-17 19:04:03 +00:00
Chris Lattner	0a8d91a816	fix PR6332, allowing an index of zero into a zero sized array even if the element of the array has no size. llvm-svn: 101662	2010-04-17 19:02:33 +00:00
Chris Lattner	b927073f2e	teach the x86 asm parser how to handle segment prefixes in memory operands. rdar://7874844 llvm-svn: 101661	2010-04-17 18:56:34 +00:00
Chris Lattner	5495c8e415	testcase for r101538, patch by Nico Schmidt! llvm-svn: 101642	2010-04-17 17:22:06 +00:00
Dan Gohman	4fee6f3bdd	Start function numbering at 0. llvm-svn: 101638	2010-04-17 16:29:15 +00:00
Chris Lattner	7f5088e6de	a bunch of ssse3 instructions are misencoded to think they have an i8 field when they really do not. This fixes rdar://7840289 llvm-svn: 101629	2010-04-17 07:38:24 +00:00
Evan Cheng	3af19e80c9	Add nounwind. llvm-svn: 101613	2010-04-17 03:43:36 +00:00
Bob Wilson	ca51425d94	Re-commit my previous SSAUpdater changes. The previous version naively tried to determine where to place PHIs by iteratively comparing reaching definitions at each block. That was just plain wrong. This version now computes the dominator tree within the subset of the CFG where PHIs may need to be placed, and then places the PHIs in the iterated dominance frontier of each definition. The rest of the patch is mostly the same, with a few more performance improvements added in. llvm-svn: 101612	2010-04-17 03:08:24 +00:00
Johnny Chen	034e0b1e68	Minor change to make the test case comply with Vd<0> == '0' when Q == '1'. llvm-svn: 101559	2010-04-16 22:48:31 +00:00
Johnny Chen	b90b6f1a35	Fixed a bug in DisassembleN1RegModImmFrm() where a break stmt was missing for a case. Also, the 0xFF hex literal involved in the shift for ESize64 should be suffixed "ul" to preserve the shift result. Implemented printHex*ImmOperand() by copying from ARMAsmPrinter.cpp and added a test case for DisassembleN1RegModImmFrm()/printHex64ImmOperand(). llvm-svn: 101557	2010-04-16 22:40:20 +00:00
Johnny Chen	2b7aba10c2	In the same spirit of r101524, which removed the assert() from printAddrMode2OffsetOperand(), this patch removes the assert() from printAddrMode3OffsetOperand() and adds a test case. llvm-svn: 101529	2010-04-16 19:57:21 +00:00
Johnny Chen	807e1748fc	Multiclass LdStCop was using pre-UAL syntax LDC<c>L for the L fragment. Changed to the UAL syntax of LDCL<c>, instead. Add a test case for this change which also tests the removal of assert() from printAddrMode2OffsetOperand(). llvm-svn: 101527	2010-04-16 19:33:23 +00:00
Dan Gohman	c1ce91603c	Revert r101455, which fails on the llvm-arm-linux buildbot. llvm-svn: 101515	2010-04-16 18:37:31 +00:00
Dan Gohman	f13f69f296	Disable inlining of recursive calls. It can complicate tailcallelim and dependent analyses, and increase code size, so doing it profitably would require more complex heuristics. llvm-svn: 101471	2010-04-16 16:01:18 +00:00
Dan Gohman	99e5327bfd	Refine the detection of seemingly infinitely recursive calls where the callee is expected to be expanded to something else by codegen, so that normal infinitely recursive calls are still transformed. llvm-svn: 101468	2010-04-16 15:57:50 +00:00
Bill Wendling	ae4541d758	Add JIT exception handling test. llvm-svn: 101455	2010-04-16 09:04:28 +00:00
Chris Lattner	393e08536d	move comment. llvm-svn: 101433	2010-04-16 01:05:52 +00:00
Chris Lattner	1146d326a7	fix PR6832: we were using the alignment of a pointer when we wanted the alignment of the pointee. llvm-svn: 101432	2010-04-16 01:05:38 +00:00
Johnny Chen	1d3ee607b3	Added another test case for am3offset operand, testing Rn, #+/-imm8. Previous checkin tested Rn, #+/-Rm. llvm-svn: 101418	2010-04-15 23:23:40 +00:00
Jakob Stoklund Olesen	dc6d42dbf8	Add test case for machine-sink on critical edges llvm-svn: 101416	2010-04-15 23:19:16 +00:00
Johnny Chen	acbc06c2a3	Fixed a bug in ARM disassembly where LDRSBT should have am3offset operand, not am2offset. Modified the instruction table entry and added a new test case. llvm-svn: 101415	2010-04-15 23:12:47 +00:00
Evan Cheng	f7f97b4bbd	Use default lowering of DYNAMIC_STACKALLOC. As far as I can tell, ARM isle is doing the right thing and codegen looks correct for both Thumb and Thumb2. llvm-svn: 101410	2010-04-15 22:20:34 +00:00
Jakob Stoklund Olesen	b642a27525	Fix PR6847. RegScavenger should ignore DebugValues. llvm-svn: 101392	2010-04-15 20:28:39 +00:00
Evan Cheng	1ba1428577	ARM SelectDYN_ALLOC should emit a copy from SP rather than referencing SP directly. In cases where there are two dyn_alloc in the same BB it would have caused the old SP value to be reused and badness ensues. rdar://7493908 llvm is generating poor code for dynamic alloca, I'll fix that later. llvm-svn: 101383	2010-04-15 18:42:28 +00:00
Chris Lattner	3245afdf05	enhance the load/store narrowing optimization to handle a tokenfactor in between the load/store. This allows us to optimize test7 into: _test7: ## @test7 ## BB#0: ## %entry movl (%rdx), %eax ## kill: SIL<def> ESI<kill> movb %sil, 5(%rdi) ret instead of: _test7: ## @test7 ## BB#0: ## %entry movl 4(%esp), %ecx movl $-65281, %eax ## imm = 0xFFFFFFFFFFFF00FF andl 4(%ecx), %eax movzbl 8(%esp), %edx shll $8, %edx addl %eax, %edx movl 12(%esp), %eax movl (%eax), %eax movl %edx, 4(%ecx) ret llvm-svn: 101355	2010-04-15 06:10:49 +00:00
Chris Lattner	6ebd8674eb	teach codegen to turn trunc(zextload) into load when possible. This doesn't occur much at all, it only seems to formed in the case when the trunc optimization kicks in due to phase ordering. In that case it is saves a few bytes on x86-32. llvm-svn: 101350	2010-04-15 05:40:59 +00:00
Chris Lattner	f9b2e3c68a	add a simple dag combine to replace trivial shl+lshr with and. This happens with the store->load narrowing stuff. llvm-svn: 101348	2010-04-15 05:28:43 +00:00
Chris Lattner	4041ab6e00	Implement rdar://7860110 (also in target/readme.txt) narrowing a load/or/and/store sequence into a narrower store when it is safe. Daniel tells me that clang will start producing this sort of thing with bitfields, and this does trigger a few dozen times on 176.gcc produced by llvm-gcc even now. This compiles code like CodeGen/X86/2009-05-28-DAGCombineCrash.ll into: movl %eax, 36(%rdi) instead of: movl $4294967295, %eax ## imm = 0xFFFFFFFF andq 32(%rdi), %rax shlq $32, %rcx addq %rax, %rcx movq %rcx, 32(%rdi) and each of the testcases into a single store. Each of them used to compile into craziness like this: _test4: movl $65535, %eax ## imm = 0xFFFF andl (%rdi), %eax shll $16, %esi addl %eax, %esi movl %esi, (%rdi) ret llvm-svn: 101343	2010-04-15 04:48:01 +00:00
Chris Lattner	60bbb8c356	further tweak this to do something useful. llvm-svn: 101341	2010-04-15 04:31:42 +00:00
Chris Lattner	9ebaf531ab	remove undef control flow. llvm-svn: 101340	2010-04-15 04:30:19 +00:00
Daniel Dunbar	5f372e2f13	tests: MC/Disassembler tests depend on ARM support being compiler in. llvm-svn: 101337	2010-04-15 03:47:20 +00:00
Jakob Stoklund Olesen	938f2ae310	Remove unneeded types from test. llvm-svn: 101286	2010-04-14 20:56:09 +00:00
Bob Wilson	c05b887c84	Don't custom lower bit converts to ARM VMOVDRRD or VMOVDRR when the operand does not have a legal type. The legalizer does not know how to handle those nodes. Radar 7854640. llvm-svn: 101282	2010-04-14 20:45:23 +00:00
Evan Cheng	9e100384cb	Trim tests and convert to FileCheck. llvm-svn: 101277	2010-04-14 20:22:17 +00:00
Nick Lewycky	ca615eb0d6	Revert r101213. llvm-svn: 101231	2010-04-14 04:51:58 +00:00
Chris Lattner	6b55cb9cd8	implement mc asmparser support for '.', which gets the current PC. rdar://7834775 We now produce an identical .o file compared to the cctools assembler for something like this: _f0: L0: jmp L1 .long . - L0 L1: jmp A .long . - L1 .zerofill __DATA,_bss,A,0 llvm-svn: 101227	2010-04-14 04:40:28 +00:00
Nick Lewycky	8408f33deb	Commit testcase for r101213. llvm-svn: 101214	2010-04-14 03:46:42 +00:00
Devang Patel	c48b976c08	XFAIL this test for powerpc. This test relies on iSel lowering dbg_declare intrinsic when CodeGen::OptLevel is None. On PPC side, CodeGen::OptLevel stays to default when -O0 is used on the command line. llvm-svn: 101190	2010-04-13 23:23:09 +00:00
Evan Cheng	6c35893aa6	Add test for post-ra machine licm. llvm-svn: 101182	2010-04-13 22:10:03 +00:00
Bob Wilson	699bdf7adf	Handle a v2f64 formal parameter that is split between registers and memory such that the entire second half is in memory. Radar 7855014. llvm-svn: 101181	2010-04-13 22:03:22 +00:00
Devang Patel	12d150ea43	Do not include types without any definition in pubtypes list. llvm-svn: 101171	2010-04-13 20:35:04 +00:00
Evan Cheng	4d89dd8353	Fix test on non-x86 hosts. llvm-svn: 101163	2010-04-13 18:54:04 +00:00
Evan Cheng	4ca4bc6f95	Re-apply 101075 and fix it properly. Just reuse the debug info of the branch instruction being optimized. There is no need to --I which can deref off start of the BB. llvm-svn: 101162	2010-04-13 18:50:27 +00:00
Eric Christopher	d67f66dc0c	Temporarily revert r101075, it's causing invalid iterator assertions in a nightly tester. llvm-svn: 101158	2010-04-13 18:37:58 +00:00
Dan Gohman	7ef0dc2163	Teach ScalarEvolution to simplify smax and umax when it can prove that one operand is always greater than another. llvm-svn: 101142	2010-04-13 16:51:03 +00:00
Dan Gohman	5867a56db8	Teach IndVarSimplify how to eliminate remainder operators where the numerator is an induction variable. For example, with code like this: for (i=0;i<n;++i) x[i%n] = 0; IndVarSimplify will now recognize that i is always less than n inside the loop, and eliminate the remainder. llvm-svn: 101113	2010-04-13 01:46:36 +00:00
Chris Lattner	5b212a31a2	add llvm codegen support for -ffunction-sections and -fdata-sections, patch by Sylvere Teissier! llvm-svn: 101106	2010-04-13 00:36:43 +00:00
Evan Cheng	d0d8e3343a	Use .set expression for x86 pic jump table reference to reduce assembly relocation. rdar://7738756 llvm-svn: 101085	2010-04-12 23:07:17 +00:00
Bill Wendling	caaf445a01	Third time's a charm... llvm-svn: 101081	2010-04-12 22:43:21 +00:00
Bill Wendling	4fc5a4d8b8	Genericize the label test. llvm-svn: 101079	2010-04-12 22:40:37 +00:00
Bill Wendling	4627917b9a	Correct test to test what I mean it to test. llvm-svn: 101077	2010-04-12 22:25:42 +00:00
Bill Wendling	b02bbe416f	Micro-optimization: If we have this situation: jCC L1 jmp L2 L1: ... L2: ... We can get a small performance boost by emitting this instead: jnCC L2 L1: ... L2: ... This testcase shows an example of this: float func(float x, float y) { double product = (double)x * y; if (product == 0.0) return product; return product - 1.0; } llvm-svn: 101075	2010-04-12 22:19:57 +00:00
Dan Gohman	4a645b88ef	Suppress LinearFunctionTestReplace when the computed backedge-taken expression is a UDiv and it doesn't appear that the UDiv came from the user's source. ScalarEvolution has recently figured out how to compute a tripcount expression for the inner loop in SingleSource/Benchmarks/Shootout/sieve.c, using a udiv. Emitting a udiv instruction dramatically slows down the enclosing loop. llvm-svn: 101068	2010-04-12 21:13:43 +00:00
Johnny Chen	fc93503c59	Fixed a crasher in arm disassembler within ARMInstPrinter.cpp after calling ARM_AM::getSoImmVal(V) with a legitimate so_imm value: #245 rotate right by 2. Introduce ARM_AM::getSOImmValOneOrNoRotate(unsigned Arg) which is called from ARMInstPrinter.cpp's printSOImm() function, replacing ARM_AM::getSOImmVal(V). [12:44:43] johnny:/Volumes/data/llvm/git/trunk (local-trunk) $ gdb Debug/bin/llvm-mc GNU gdb 6.3.50-20050815 (Apple version gdb-1346) (Fri Sep 18 20:40:51 UTC 2009) Copyright 2004 Free Software Foundation, Inc. GDB is free software, covered by the GNU General Public License, and you are welcome to change it and/or distribute copies of it under certain conditions. Type "show copying" to see the conditions. There is absolutely no warranty for GDB. Type "show warranty" for details. This GDB was configured as "x86_64-apple-darwin"...Reading symbols for shared libraries ... done (gdb) set args -triple=arm-apple-darwin9 -debug-only=arm-disassembler --disassemble (gdb) r Starting program: /Volumes/data/llvm/git/trunk/Debug/bin/llvm-mc -triple=arm-apple-darwin9 -debug-only=arm-disassembler --disassemble Reading symbols for shared libraries ++. done 0xf5 0x71 0xf0 0x53 Opcode=201 Name=MVNi Format=ARM_FORMAT_DPFRM(4) 31 30 29 28 27 26 25 24 23 22 21 20 19 18 17 16 15 14 13 12 11 10 9 8 7 6 5 4 3 2 1 0 ------------------------------------------------------------------------------------------------- \| 0: 1: 0: 1\| 0: 0: 1: 1\| 1: 1: 1: 1\| 0: 0: 0: 0\| 0: 1: 1: 1\| 0: 0: 0: 1\| 1: 1: 1: 1\| 0: 1: 0: 1\| ------------------------------------------------------------------------------------------------- mvnpls r7, Assertion failed: (V != -1 && "Not a valid so_imm value!"), function printSOImm, file ARMInstPrinter.cpp, line 229. Program received signal SIGABRT, Aborted. 0x00007fff88c65886 in __kill () (gdb) bt #0 0x00007fff88c65886 in __kill () #1 0x00007fff88d05eae in abort () #2 0x00007fff88cf2ef0 in __assert_rtn () #3 0x000000010020e422 in printSOImm (O=@0x1010bdf80, V=-1, VerboseAsm=false, MAI=0x1020106d0) at ARMInstPrinter.cpp:229 #4 0x000000010020e5fe in llvm::ARMInstPrinter::printSOImmOperand (this=0x1020107e0, MI=0x7fff5fbfee70, OpNum=1, O=@0x1010bdf80) at ARMInstPrinter.cpp:254 #5 0x00000001001ffbc0 in llvm::ARMInstPrinter::printInstruction (this=0x1020107e0, MI=0x7fff5fbfee70, O=@0x1010bdf80) at ARMGenAsmWriter.inc:3236 #6 0x000000010020c27c in llvm::ARMInstPrinter::printInst (this=0x1020107e0, MI=0x7fff5fbfee70, O=@0x1010bdf80) at ARMInstPrinter.cpp:182 #7 0x000000010003cbff in PrintInsts (DisAsm=@0x10200f4e0, Printer=@0x1020107e0, Bytes=@0x7fff5fbff060, SM=@0x7fff5fbff078) at Disassembler.cpp:65 #8 0x000000010003c8b4 in llvm::Disassembler::disassemble (T=@0x1010c13c0, Triple=@0x1010b6798, Buffer=@0x102010690) at Disassembler.cpp:153 #9 0x000000010004095c in DisassembleInput (ProgName=0x7fff5fbff3f0 "/Volumes/data/llvm/git/trunk/Debug/bin/llvm-mc") at llvm-mc.cpp:347 #10 0x000000010003eefb in main (argc=4, argv=0x7fff5fbff298) at llvm-mc.cpp:374 (gdb) q The program is running. Exit anyway? (y or n) y [13:36:26] johnny:/Volumes/data/llvm/git/trunk (local-trunk) $ llvm-svn: 101053	2010-04-12 18:46:53 +00:00
Dan Gohman	6635bb26a6	Generalize ScalarEvolution's PHI analysis to handle loops that don't have preheaders or dedicated exit blocks, as clients may not otherwise need to run LoopSimplify. llvm-svn: 101030	2010-04-12 07:49:36 +00:00
Evan Cheng	250283916d	Enable post regalloc machine licm by default. llvm-svn: 101023	2010-04-12 06:25:28 +00:00
Eric Christopher	1f272f7fd8	Verify function prototypes before trying to optimize functions. We also need TargetData, just return false if we don't have it. Update testcases accordingly. Fixes PR6807. llvm-svn: 101011	2010-04-12 04:48:00 +00:00
Dan Gohman	fa5ad797e3	Re-apply r101000, with a fix: Don't eliminate an icmp which is part of the loop exit test. This usually doesn't come up for a variety of reasons, but it isn't impossible, so make IndVarSimplify handle it conservatively. llvm-svn: 101008	2010-04-12 02:21:50 +00:00
Dan Gohman	c0f1efaf8d	Revert 101000, which is breaking self-host builds. llvm-svn: 101002	2010-04-12 00:17:10 +00:00
Dan Gohman	af4ab1b681	Teach IndVarSimplify how to eliminate comparisons involving induction variables. For example, with code like this: for (i=0;i<n;++i) if (i<n) x[i] = 0; IndVarSimplify will now recognize that i is always less than n inside the loop, and eliminate the if. llvm-svn: 101000	2010-04-11 23:10:12 +00:00
Chris Lattner	9ae28b141f	fix PR6743, a case where we'd delete an instruction before using it in some cases. llvm-svn: 100937	2010-04-10 18:26:57 +00:00
Chris Lattner	b9801ffcb5	fix PR6760, a missing check in heap SRoA. llvm-svn: 100936	2010-04-10 18:19:22 +00:00
Dan Gohman	607e02b33a	When determining a canonical insert position, don't climb deeper into adjacent loops. Also, ensure that the insert position is dominated by the loop latch of any loop in the post-inc set which has a latch. llvm-svn: 100906	2010-04-09 22:07:05 +00:00
Dan Gohman	3295a6e5bc	When emitting code for an add, don't force a SCEVUnknown wrapper around a hoisted intermediate result if the intermediate result isn't an Instruction. llvm-svn: 100884	2010-04-09 19:14:31 +00:00
Benjamin Kramer	7e4a475929	Make sure this test tests something. llvm-svn: 100879	2010-04-09 19:03:31 +00:00
Bob Wilson	030591320d	Add a testcase for svn r100568. llvm-svn: 100876	2010-04-09 18:29:29 +00:00
Chris Lattner	1ef9826ff8	"On SPU, variables in the .bss section that are allocated with the .lcomm directive are not aligned on 16 byte boundaries. This causes misaligned loads, as the generated assembly assumes this "default" alignment. this patch disables .lcomm in favour of '.local .comm' Patch by Kalle Raisklia! llvm-svn: 100875	2010-04-09 18:27:03 +00:00
Dan Gohman	d23fa7d90d	Merge a few fast-isel tests. llvm-svn: 100860	2010-04-09 15:03:55 +00:00
Dan Gohman	9ba08a4631	Add several more lint checks. llvm-svn: 100841	2010-04-09 01:39:53 +00:00
Dan Gohman	ee6451dca1	Fix a bug in IVUsers which was permitting non-affine addrecs to be sent to LSR, which it isn't prepared to handle. llvm-svn: 100839	2010-04-09 01:22:56 +00:00
Chris Lattner	c6c153be45	fix a SCCP miscompilation that could happen when a forced constant is changed to a constant, we would end up adding the instruction to the wrong worklist, preventing it from being properly revisited. This fixes rdar://7832370 llvm-svn: 100837	2010-04-09 01:14:31 +00:00
Dan Gohman	7808d490d3	Add a few more lint checks. llvm-svn: 100825	2010-04-08 23:05:57 +00:00
Evan Cheng	b083c47c21	Coalescer should not delete copy instructions whose defs are partially dead. e.g. %RDI<def,dead> = MOV64rr %RAX<kill>, %EDI<imp-def> llvm-svn: 100804	2010-04-08 20:02:37 +00:00
Dan Gohman	98bc4371c7	Add a -lint pass which checks for common sources of undefined or likely unintended behavior. llvm-svn: 100798	2010-04-08 18:47:09 +00:00
Dan Gohman	cb45bd9cb3	Pointers to zero-sized objects don't point to overlapping objects. llvm-svn: 100789	2010-04-08 18:11:50 +00:00
Dan Gohman	386e01e879	Print empty structs as {} rather than { }. llvm-svn: 100787	2010-04-08 18:03:05 +00:00
Evan Cheng	ebe47c872f	Avoid using f64 to lower memcpy from constant string. It's cheaper to use i32 store of immediates. llvm-svn: 100751	2010-04-08 07:37:57 +00:00
Dan Gohman	4506539d84	When expanding expressions which are using post-inc mode for multiple loops, ensure that the expansion is dominated by the increments of those loops. llvm-svn: 100748	2010-04-08 05:57:57 +00:00
Chris Lattner	3ae2dd2ba5	add newlines at the end of files. llvm-svn: 100705	2010-04-07 22:53:17 +00:00
Dan Gohman	d006ab90dd	Generalize IVUsers to track arbitrary expressions rather than expressions explicitly split into stride-and-offset pairs. Also, add the ability to track multiple post-increment loops on the same expression. This refines the concept of "normalizing" SCEV expressions used for to post-increment uses, and introduces a dedicated utility routine for normalizing and denormalizing expressions. This fixes the expansion of expressions which are post-increment users of more than one loop at a time. More broadly, this takes LSR another step closer to being able to reason about more than one loop at a time. llvm-svn: 100699	2010-04-07 22:27:08 +00:00
Benjamin Kramer	f812ff6f2e	unXFAIL, arm disassembler was reenabled. llvm-svn: 100692	2010-04-07 21:19:41 +00:00
Dale Johannesen	f118f9788b	Split big test into multiple directories to cater to those who don't build all targets. llvm-svn: 100688	2010-04-07 20:43:35 +00:00
Dale Johannesen	27c786bcf9	Test that DEBUG_VALUE comments come out on a variety of targets. llvm-svn: 100682	2010-04-07 20:01:24 +00:00
Chris Lattner	2c88f8a8c4	this has a pr! llvm-svn: 100637	2010-04-07 18:04:56 +00:00
Chris Lattner	f839ee0c13	fix a latent bug my inline asm stuff exposed: MachineOperand::isIdenticalTo wasn't handling metadata operands. llvm-svn: 100636	2010-04-07 18:03:19 +00:00
Sanjiv Gupta	4948793041	Remove XFAIL for vg_leak as the leaks are fixed by 100601. llvm-svn: 100612	2010-04-07 07:06:48 +00:00
Devang Patel	019922d1b0	Do not emit specification DIE with DW_AT_specification attribute for member functions of a funcation local class. This trips gdb's partial scan of DIEs at load time. Fixes Radar 7833483. llvm-svn: 100586	2010-04-06 23:53:48 +00:00
Jakob Stoklund Olesen	20ea258a09	Let that which does not matter truly slide. This test only cares about alignment, so don't test for other cruft. An upcoming llvm-gcc patch needs this. llvm-svn: 100584	2010-04-06 23:44:44 +00:00
Stuart Hastings	4bd3dd956f	Reverting 100530 & 100531 due to regressions in the GDB test suite. llvm-svn: 100563	2010-04-06 21:38:29 +00:00
Jakob Stoklund Olesen	41051a0bfe	Don't try to collapse DomainValues onto an incompatible SSE domain. This fixes the Bullet regression on i386/nocona. llvm-svn: 100553	2010-04-06 19:48:56 +00:00
Stuart Hastings	c067196984	Revise debug info machinery to digest nested functions and classes. A certain GDB testsuite case (local.cc) has a function nested inside a class nested inside another function. GCC presents the innermost function to llvm-convert first. Heretofore, the debug info mistakenly placed the inner function at module scope. This patch walks the GCC context links and instantiates the outer class and function so the debug info is properly nested. Radar 7426545. llvm-svn: 100530	2010-04-06 17:19:32 +00:00
Evan Cheng	b7a20ee5b5	Add nounwind. llvm-svn: 100482	2010-04-05 22:30:05 +00:00
Dan Gohman	918a90a3ca	Don't do code sinking on unreachable blocks. It's unprofitable and hazardous. llvm-svn: 100455	2010-04-05 19:17:22 +00:00
Evan Cheng	876a5015af	Reverting 100265 to try to get buildbots green again. Lots of self-hosting buildbots started complaining since this commit. Also xfail ARM disassembly tests. llvm-svn: 100378	2010-04-05 01:04:27 +00:00
Chris Lattner	4e4549deea	resolve a fixme. llvm-svn: 100346	2010-04-04 19:28:59 +00:00
Mon P Wang	c576ee9040	Reapply address space patch after fixing an issue in MemCopyOptimizer. Added support for address spaces and added a isVolatile field to memcpy, memmove, and memset, e.g., llvm.memcpy.i32(i8, i8, i32, i32) -> llvm.memcpy.p0i8.p0i8.i32(i8, i8, i32, i32, i1) llvm-svn: 100304	2010-04-04 03:10:48 +00:00
Chris Lattner	40060d33f6	add integer overflow check for the fp induction variable checker. Amusingly, we already had tests that we should have rejects because they would be miscompiled in the testsuite. The remaining issue with this is that we don't check that the branch causes us to exit the loop if it fails, so we don't actually know if we remain in bounds. llvm-svn: 100284	2010-04-03 07:18:48 +00:00
Chris Lattner	40ea690f39	fix PR6761, a miscompilation due to the fp->int IV conversion stuff. More bugs remain though. llvm-svn: 100282	2010-04-03 06:30:03 +00:00
Chris Lattner	2508bcf176	convert to filecheck llvm-svn: 100281	2010-04-03 06:27:56 +00:00
Chris Lattner	8ca668497f	rename feature test. llvm-svn: 100279	2010-04-03 06:24:28 +00:00
Chris Lattner	fc9e88ae76	actually just remove this, will move the real feature test here. llvm-svn: 100278	2010-04-03 06:24:03 +00:00
Chris Lattner	290f42a9c3	rename test since it is a feature test. llvm-svn: 100277	2010-04-03 06:22:52 +00:00
Chris Lattner	c558b49f14	first half of a pass through IndVarSimplify::HandleFloatingPointIV, this cleans up a bunch of code and also fixes several crashes and miscompiles. More to come unfortunately, this optimization is quite broken. llvm-svn: 100270	2010-04-03 05:54:59 +00:00
Bob Wilson	f1aa4743d9	Revert all my SSAUpdater patches. The PHI placement algorithm is not correct (what was I thinking?) and there's also a problem with LCSSA. I'll try again later with fixes. --- Reverse-merging r100263 into '.': U lib/Transforms/Utils/SSAUpdater.cpp --- Reverse-merging r100177 into '.': G lib/Transforms/Utils/SSAUpdater.cpp --- Reverse-merging r100148 into '.': G lib/Transforms/Utils/SSAUpdater.cpp --- Reverse-merging r100147 into '.': U include/llvm/Transforms/Utils/SSAUpdater.h G lib/Transforms/Utils/SSAUpdater.cpp --- Reverse-merging r100131 into '.': G include/llvm/Transforms/Utils/SSAUpdater.h G lib/Transforms/Utils/SSAUpdater.cpp --- Reverse-merging r100130 into '.': G lib/Transforms/Utils/SSAUpdater.cpp --- Reverse-merging r100126 into '.': G include/llvm/Transforms/Utils/SSAUpdater.h G lib/Transforms/Utils/SSAUpdater.cpp --- Reverse-merging r100050 into '.': D test/Transforms/GVN/2010-03-31-RedundantPHIs.ll --- Reverse-merging r100047 into '.': G include/llvm/Transforms/Utils/SSAUpdater.h G lib/Transforms/Utils/SSAUpdater.cpp llvm-svn: 100264	2010-04-03 03:50:38 +00:00
Johnny Chen	7b999ea7b7	Second try of initial ARM/Thumb disassembler check-in. It consists of a tablgen backend (ARMDecoderEmitter) which emits the decoder functions for ARM and Thumb, and the disassembler core which invokes the decoder function and builds up the MCInst based on the decoded Opcode. Reviewed by Chris Latter and Bob Wilson. llvm-svn: 100233	2010-04-02 22:27:38 +00:00
Evan Cheng	61399375a2	Correctly lower memset / memcpy of undef. It should be a nop. PR6767. llvm-svn: 100208	2010-04-02 19:36:14 +00:00
Mon P Wang	999c1b927b	Revert r100191 since it breaks objc in clang llvm-svn: 100199	2010-04-02 18:43:02 +00:00
Mon P Wang	a972ab8564	Reapply address space patch after fixing an issue in MemCopyOptimizer. Added support for address spaces and added a isVolatile field to memcpy, memmove, and memset, e.g., llvm.memcpy.i32(i8, i8, i32, i32) -> llvm.memcpy.p0i8.p0i8.i32(i8, i8, i32, i32, i1) llvm-svn: 100191	2010-04-02 18:04:15 +00:00
Dan Gohman	f7239102fe	Manually notify ScalarEvolution before making an operand replacement, since it can't currently observe such changes automatically. llvm-svn: 100186	2010-04-02 14:48:31 +00:00
Dan Gohman	4bd755419f	Revert the recent alignment changes. They're broken for -Os because, in particular, they end up aligning strings at 16-byte boundaries, and there's no way for GlobalOpt to check OptForSize. llvm-svn: 100172	2010-04-02 03:04:37 +00:00
Evan Cheng	604bc162da	After trivial coalescing, the MI being visited may have become a copy. Avoid adding it to CSE hash table since copies aren't being considered for CSE and they may be deleted. rdar://7819990 llvm-svn: 100170	2010-04-02 02:21:24 +00:00
Dan Gohman	8ceeeb444e	Remove this initializer so that the optimizer doesn't convert unaligned loads into aligned loads. llvm-svn: 100166	2010-04-02 01:26:13 +00:00
Dan Gohman	ffb9c71174	Update this test for the new preferred alignment heuristics. llvm-svn: 100165	2010-04-02 01:24:08 +00:00
Dan Gohman	c671347fcb	Make globalopt refine global variable alignment. llvm-svn: 100160	2010-04-02 00:14:16 +00:00
Evan Cheng	f997c31598	In 64-bit mode, use i64 to lower memcpy / memset instead of f64. llvm-svn: 100137	2010-04-01 20:27:45 +00:00
Evan Cheng	4c014c892a	- Avoid using floating point stores to implement memset unless the value is zero. - Do not try to infer GV alignment unless its type is sized. It's not possible to infer alignment if it has opaque type. llvm-svn: 100118	2010-04-01 18:19:11 +00:00
Evan Cheng	1e8ee79957	Add -mcpu to memcpy / memset tests to ensure they behave the same on all hosts / targets. llvm-svn: 100101	2010-04-01 08:25:26 +00:00
Evan Cheng	43cd9e3845	Fix sdisel memcpy, memset, memmove lowering: 1. Makes it possible to lower with floating point loads and stores. 2. Avoid unaligned loads / stores unless it's fast. 3. Fix some memcpy lowering logic bug related to when to optimize a load from constant string into a constant. 4. Adjust x86 memcpy lowering threshold to make it more sane. 5. Fix x86 target hook so it uses vector and floating point memory ops more effectively. rdar://7774704 llvm-svn: 100090	2010-04-01 06:04:33 +00:00
Chris Lattner	6159249891	change this from using '!dbg' to using '!dbgx'. The MD used here isn't valid for !dbg. llvm-svn: 100085	2010-04-01 05:13:10 +00:00
Bob Wilson	b9fb48bff7	Add a redundant PHI testcase for SSAUpdater to go with svn r100047. llvm-svn: 100050	2010-03-31 21:38:43 +00:00
Gabor Greif	6882a5eea1	testcase for r99914, provided by baldrick! llvm-svn: 100043	2010-03-31 20:37:13 +00:00
Jakob Stoklund Olesen	9986ba954c	Replace V_SET0 with variants for each SSE execution domain. llvm-svn: 99975	2010-03-31 00:40:13 +00:00
Jakob Stoklund Olesen	710c6892be	Fix typo. Thank you, valgrind. llvm-svn: 99974	2010-03-31 00:40:08 +00:00
Jakob Stoklund Olesen	19aa6f72a0	Not all platforms start symbols with _ llvm-svn: 99959	2010-03-30 23:12:48 +00:00
Jakob Stoklund Olesen	6f6ebb663c	Enable -sse-domain-fix by default. Now with tests! llvm-svn: 99954	2010-03-30 22:47:00 +00:00
Bob Wilson	6f7fd28824	Revert Mon Ping's change 99928, since it broke all the llvm-gcc buildbots. llvm-svn: 99948	2010-03-30 22:27:04 +00:00
Devang Patel	57c644f926	Ignore invalid metadata. llvm-svn: 99938	2010-03-30 22:09:52 +00:00
Mon P Wang	7460571381	Added support for address spaces and added a isVolatile field to memcpy, memmove, and memset, e.g., llvm.memcpy.i32(i8, i8, i32, i32) -> llvm.memcpy.p0i8.p0i8.i32(i8, i8, i32, i32, i1) A update of langref will occur in a subsequent checkin. llvm-svn: 99928	2010-03-30 20:55:56 +00:00
Eric Christopher	6ad8167714	Remove the pmulld intrinsic and autoupdate it as a vector multiply. Rewrite the pmulld patterns, and make sure that they fold in loads of arguments into the instruction. llvm-svn: 99910	2010-03-30 18:49:01 +00:00
Benjamin Kramer	0c1dcb083e	XFAIL some PIC16 tests when running under valgrind-leaks. I don't expect these to be fixed any time soon. llvm-svn: 99888	2010-03-30 14:34:13 +00:00
Daniel Dunbar	c95156262d	MC/Mach-O/x86_64: Support @GOTPCREL on symbols, even for non-PCrel relocations! llvm-svn: 99853	2010-03-29 23:56:40 +00:00
Evan Cheng	742db6874a	Fix PR4975. Avoid referencing empty vector. llvm-svn: 99840	2010-03-29 21:27:30 +00:00
Chris Lattner	f60c556b91	From Kalle Raiskila: "the bigstack patch for SPU, with testcase. It is essentially the patch committed as 97091, and reverted as 97099, but with the following additions: -in vararg handling, registers are marked to be live, to not confuse the register scavenger -function prologue and epilogue are not emitted, if the stack size is 16. 16 means it is empty - there is only the register scavenger emergency spill slot, which is not used as there is no stack." llvm-svn: 99819	2010-03-29 17:38:47 +00:00
Chris Lattner	61f3bd6772	add support for zero initialized unions, patch by Tim Northover! llvm-svn: 99818	2010-03-29 17:36:02 +00:00
Chris Lattner	a787c9e23a	teach tblgen to allow patterns like (add (i32 (bitconvert (i32 GPR))), 4), transforming it into (add (i32 GPR), 4). This allows us to write type generic multi patterns and have tblgen automatically drop the bitconvert in the case when the types align. This allows us to fold an extra load in the changed testcase. llvm-svn: 99756	2010-03-28 08:38:32 +00:00
Chris Lattner	6bba2f3c69	add some nounwinds llvm-svn: 99752	2010-03-28 07:58:37 +00:00
Chris Lattner	108667f3ec	this takes an insane amount of time to run, disable it for now (PR6727) llvm-svn: 99751	2010-03-28 07:58:09 +00:00
Jeffrey Yasskin	b832e3276a	XFAIL a new tblgen test for memory leak checking. llvm-svn: 99707	2010-03-27 04:59:47 +00:00
Evan Cheng	3365fb1412	Do not sibcall if stack needs to be dynamically aligned. llvm-svn: 99620	2010-03-26 16:26:03 +00:00
Evan Cheng	00a620c61e	Allow trivial sibcall of vararg callee when no arguments are being passed. llvm-svn: 99598	2010-03-26 02:13:13 +00:00
Evan Cheng	7b4a1a221b	Try trivial remat before the coalescer gives up on a vr / physreg coalescing for fear of tying up a physical register. llvm-svn: 99575	2010-03-26 00:07:25 +00:00
Jim Grosbach	71fcb4fedd	switch the flag for using NEON for SP floating point to a subtarget 'feature'. Re-commit. This time complete with testsuite updates. llvm-svn: 99570	2010-03-25 23:47:34 +00:00
Evan Cheng	dbcf861a96	Add nounwind. llvm-svn: 99546	2010-03-25 20:01:07 +00:00
Bob Wilson	e543e7fcb1	Reapply Kevin's change 94440, now that Chris has fixed the limitation on opcode values fitting in one byte (svn r99494). llvm-svn: 99514	2010-03-25 16:36:14 +00:00
Jakob Stoklund Olesen	0e45762250	Fix evil TableGen bug in template parameters with defaults. If a TableGen class has an initializer expression containing an X.Y subexpression, AND X depends on template parameters, AND those template parameters have defaults, AND some parameters with defaults are beyond position 1, THEN parts of the initializer expression are evaluated prematurely with the default values when the first explicit template parameter is substituted, before the remaining explicit template parameters have been substituted. llvm-svn: 99492	2010-03-25 06:23:34 +00:00
Chris Lattner	0563804982	fix PR6642, GVN forwarding from memset to load of the base of the memset. llvm-svn: 99488	2010-03-25 05:58:19 +00:00
Chris Lattner	4690af8567	Make the NDEBUG assertion stronger and more clear what is happening. Enhance scheduling to set the DEAD flag on implicit defs more aggressively. Before, we'd set an implicit def operand to dead if it were present in the SDNode corresponding to the machineinstr but had no use. Now we do it in this case AND if the implicit def does not exist in the SDNode at all. This exposes a couple of problems: one is the FIXME, which causes a live intervals crash on CodeGen/X86/sibcall.ll. The second is that it makes machinecse and licm more aggressive (which is a good thing) but also exposes a case where licm hoists a set0 and then it doesn't get resunk. Talking to codegen folks about both these issues, but I need this patch in in the meantime. llvm-svn: 99485	2010-03-25 05:40:48 +00:00
Eric Christopher	b1a382d8b9	Reapply r99451 with a fix to move the NoInline check to the cost functions instead of InlineFunction. llvm-svn: 99483	2010-03-25 04:49:10 +00:00
Eric Christopher	5bbda5130f	Make sure this runs in 64-bit only, 32-bit won't produce the correct stores. Fariborz please review and make sure this is what you meant. llvm-svn: 99472	2010-03-25 01:46:07 +00:00
Daniel Dunbar	5caf2ff561	MC: Fix refacto in MCExpr evaluation, I mistakenly replaced a fragment address with a symbol address. - This fixes the integrated-as nightly test regressions. llvm-svn: 99466	2010-03-25 01:03:17 +00:00
Eric Christopher	1d38538fb6	Temporarily revert this, it's causing an issue with an internal project. llvm-svn: 99451	2010-03-24 23:35:21 +00:00
Bob Wilson	5b2da69f6d	Speculatively revert this to see if it fixes buildbot failures. --- Reverse-merging r99440 into '.': U test/MC/AsmParser/X86/x86_32-bit_cat.s U test/MC/AsmParser/X86/x86_32-encoding.s U include/llvm/IntrinsicsX86.td U include/llvm/CodeGen/SelectionDAGNodes.h U lib/Target/X86/X86InstrSSE.td U lib/Target/X86/X86ISelLowering.h llvm-svn: 99450	2010-03-24 23:26:29 +00:00
Kevin Enderby	f5584a7397	Added the Advanced Encryption Standard (AES) Instructions. llvm-svn: 99440	2010-03-24 22:33:33 +00:00
Kevin Enderby	b96eb68497	Fixed the SS42AI template for the SSE 4.2 instructions with TA prefix so it does not get an "Unknown immediate size" assert failure when used. All instructions of this form have an 8-bit immediate. Also added a test case of an example instruction that is of this form. llvm-svn: 99435	2010-03-24 22:28:42 +00:00
Nate Begeman	583e05d8ce	BUILD_VECTOR was missing out on some prime opportunities to use SSE 4.1 inserts. llvm-svn: 99423	2010-03-24 20:49:50 +00:00
Bob Wilson	4d87012eb3	Revert Edwin's change that is breaking MultiSource/Applications/ClamAV/clamscan. --- Reverse-merging r99400 into '.': D test/CodeGen/Generic/2010-03-24-liveintervalleak.ll U lib/CodeGen/LiveIntervalAnalysis.cpp llvm-svn: 99419	2010-03-24 20:25:25 +00:00
Devang Patel	d7a6cc5129	Do not rely on getCompileUnit() to find source file information for a subprogram. llvm-svn: 99410	2010-03-24 18:48:00 +00:00
Torok Edwin	4bbfdd41ea	Fix memory leak in liveintervals: the destructor for VNInfos must be called, otherwise the SmallVector it contains doesn't free its memory. In most cases LiveIntervalAnalysis could get away by not calling the destructor, because VNInfos are bumpptr-allocated, and smallvectors usually don't grow. However when the SmallVector does grow it always leaks. This is the valgrind shown leak from the original testcase: ==8206== 18,304 bytes in 151 blocks are definitely lost in loss record 164 of 164 ==8206== at 0x4A079C7: operator new(unsigned long) (vg_replace_malloc.c:220) ==8206== by 0x4DB7A7E: llvm::SmallVectorBase::grow_pod(unsigned long, unsigned long) (in /home/edwin/clam/git/builds/defaul t/libclamav/.libs/libclamav.so.6.1.0) ==8206== by 0x4F90382: llvm::VNInfo::addKill(llvm::SlotIndex) (in /home/edwin/clam/git/builds/default/libclamav/.libs/libcl amav.so.6.1.0) ==8206== by 0x5126B5C: llvm::LiveIntervals::handleVirtualRegisterDef(llvm::MachineBasicBlock, llvm::ilist_iterator<llvm::M achineInstr>, llvm::SlotIndex, llvm::MachineOperand&, unsigned int, llvm::LiveInterval&) (in /home/edwin/clam/git/builds/defau lt/libclamav/.libs/libclamav.so.6.1.0) ==8206== by 0x512725E: llvm::LiveIntervals::handleRegisterDef(llvm::MachineBasicBlock, llvm::ilist_iterator<llvm::MachineI nstr>, llvm::SlotIndex, llvm::MachineOperand&, unsigned int) (in /home/edwin/clam/git/builds/default/libclamav/.libs/libclamav .so.6.1.0) ==8206== by 0x51278A8: llvm::LiveIntervals::computeIntervals() (in /home/edwin/clam/git/builds/default/libclamav/.libs/libc lamav.so.6.1.0) ==8206== by 0x5127CB4: llvm::LiveIntervals::runOnMachineFunction(llvm::MachineFunction&) (in /home/edwin/clam/git/builds/de fault/libclamav/.libs/libclamav.so.6.1.0) ==8206== by 0x4DAE935: llvm::FPPassManager::runOnFunction(llvm::Function&) (in /home/edwin/clam/git/builds/default/libclama v/.libs/libclamav.so.6.1.0) ==8206== by 0x4DAEB10: llvm::FunctionPassManagerImpl::run(llvm::Function&) (in /home/edwin/clam/git/builds/default/libclama v/.libs/libclamav.so.6.1.0) ==8206== by 0x4DAED3D: llvm::FunctionPassManager::run(llvm::Function&) (in /home/edwin/clam/git/builds/default/libclamav/.l ibs/libclamav.so.6.1.0) ==8206== by 0x4D8BE8E: llvm::JIT::runJITOnFunctionUnlocked(llvm::Function, llvm::MutexGuard const&) (in /home/edwin/clam/git/builds/default/libclamav/.libs/libclamav.so.6.1.0) ==8206== by 0x4D8CA72: llvm::JIT::getPointerToFunction(llvm::Function) (in /home/edwin/clam/git/builds/default/libclamav/.libs/libclamav.so.6.1.0) llvm-svn: 99400	2010-03-24 13:50:36 +00:00
Chris Lattner	00eeac4179	add some accessors to callsite/callinst/invokeinst to check for the noinline attribute, and make the inliner refuse to inline a call site when the call site is marked noinline even if the callee isn't. This fixes PR6682. llvm-svn: 99341	2010-03-23 22:59:07 +00:00
Stuart Hastings	2b9735138e	Test case for llvm-gcc r99305. Radar 7659636. llvm-svn: 99306	2010-03-23 18:39:23 +00:00
Evan Cheng	d9e822345c	Teach simplify libcall to transform __strcpy_chk to __memcpy_chk to enable optimizations down stream. llvm-svn: 99282	2010-03-23 15:48:04 +00:00
Evan Cheng	3f7842232e	Fix an incorrect logic causing instcombine to miss some _chk -> non-chk transformations. llvm-svn: 99263	2010-03-23 06:06:09 +00:00
Chris Lattner	b1c4f62cac	Fix PR6673: updating the callback should not clear the map. llvm-svn: 99227	2010-03-22 23:15:57 +00:00
Devang Patel	d22ed622b3	Emit DW_AT_low_pc and DW_AT_high_pc attributes for TAG_compile_unit. llvm-svn: 99225	2010-03-22 23:11:36 +00:00
Jeffrey Yasskin	c91f200c17	XFAIL tests from LLVMC on valgrind or valgrind+leak-checking. We don't care about leaks from tblgen, and I assume we don't care about valgrind errors in llvm-gcc/g++. llvm-svn: 99115	2010-03-21 08:12:46 +00:00
Jeffrey Yasskin	2f87b54f1a	Add support for XFAILing valgrind runs with memory leak checking independently of runs without leak checking. We add -vg to the triple for non-checked runs, or -vg_leak for checked runs. Also use this to XFAIL the TableGen tests, since tablegen leaks like a sieve. This includes some valgrindArgs refactoring. llvm-svn: 99103	2010-03-20 23:08:45 +00:00
Daniel Dunbar	98055cc154	MC/Mach-O: Remove Darwin host specific tests, we don't need them anymore. llvm-svn: 99100	2010-03-20 22:36:32 +00:00
Daniel Dunbar	9f4f9f9cf4	MC/Mach-O: Tweak optimal_nop test to be host independent. - This also avoids us running valgrind on /usr/bin/as, which has leaks. :) llvm-svn: 99099	2010-03-20 22:36:29 +00:00
Bob Wilson	162242b63b	pr6652: Use LDM to restore PC to the return address on ARMv4. Patch by John Tytgat! llvm-svn: 99096	2010-03-20 22:20:40 +00:00
Daniel Dunbar	e848de3911	tests: Mangle '-vg' onto the end of the triple when running under valgrind, so we can use the standard XFAIL and XTARGET to conditional tests based on valgrind. llvm-svn: 99088	2010-03-20 21:12:48 +00:00
Evan Cheng	b8d1fd0553	Stupid svn. Add back to the lost sibcall tests. llvm-svn: 99033	2010-03-20 03:17:05 +00:00
Devang Patel	5002454d2a	call void @llvm.dbg.declare(metadata !{i32* null}, metadata !1 ) is valid, but not useful, when variable identified by !1 is optimized away by the optimizer. llvm-svn: 98986	2010-03-19 21:06:24 +00:00
Kevin Enderby	cf0843ed93	Fixed the encoding problems of the crc32 instructions. All had the Operand size override prefix and only the r/m16 forms should have had that. Also for variant one, the AT&T syntax, added suffixes to all forms. Also added the missing 64-bit form for 'CRC32 r64, r/m8'. Plus added test cases for all forms and tweaked one test case to add the needed suffixes. llvm-svn: 98980	2010-03-19 20:04:42 +00:00
Daniel Dunbar	1a81ad3559	MC/Mach-O/x86_64: Add relocation support. - This is "extraordinarily" Darwin 'as' compatible. See the litany of FIXMEs littered about for more information. - There are a few cases which seem to clearly be 'as' bugs which I have left unsupported, and there is one cases where we diverge but should fix if it blocks diffing .o files (Darwin 'as' ends up widening a jump unnecessarily). - 403.gcc build, runs, and diffs equivalently to the 'as' built version now (using llvm-mc). However, it builds so slowly that I wouldn't recommend trying it quite yet. :) llvm-svn: 98974	2010-03-19 18:07:55 +00:00
Daniel Dunbar	c532697372	MC/X86: Rename alternate spellings of {ADD64,CMP64} and mark as "code gen only" so they don't get selected by the asm matcher. llvm-svn: 98972	2010-03-19 18:07:48 +00:00
Daniel Dunbar	5ec4bdd1b3	MC/Mach-O: Factor out isScatteredFixupFullyResolvedSimple predicate, and fix some corner cases. llvm-svn: 98924	2010-03-19 03:18:12 +00:00
Mon P Wang	7ad43f8768	Fixed a widening bug where we were not using the correct size for the load llvm-svn: 98920	2010-03-19 01:19:52 +00:00
Daniel Dunbar	c9deca20e8	X86: Fix encoding for TEST64rr. llvm-svn: 98919	2010-03-19 01:15:03 +00:00
Jeffrey Yasskin	fbd0109ca4	Remove `ignore` from LLVMC/TestWarnings.td. This avoids https://bugs.kde.org/show_bug.cgi?id=231257 and seems not to have been needed in the first place. llvm-svn: 98917	2010-03-19 01:10:41 +00:00
Jeffrey Yasskin	1734e47d20	Revert r98892. BSD systems may not have bash installed at all. llvm-svn: 98909	2010-03-19 00:32:11 +00:00
Jeffrey Yasskin	3eb346caeb	Work around a valgrind oddity where it doesn't pass the full path of a script to the #! command by using bash instead of /bin/sh. Bash searches $PATH for its script argument, but dash, which /bin/sh resolves to on some systems, does not. https://bugs.kde.org/show_bug.cgi?id=231257 tracks the valgrind problem. llvm-svn: 98892	2010-03-18 22:56:02 +00:00
Daniel Dunbar	2ca1108254	X86MCCodeEmitter: Fix two minor issues with reloc_riprel_4byte_movq_load, we were missing it on some movq instructions and were not including the appropriate PCrel bias. llvm-svn: 98880	2010-03-18 21:53:54 +00:00
Daniel Dunbar	63ec093b6e	MC/X86/AsmMatcher: Use the new instruction cleanup routine to implement a temporary workaround for matching inc/dec on x86_64 to the correct instruction. - This hack will eventually be replaced with a robust mechanism for handling matching instructions based on the available target features. llvm-svn: 98858	2010-03-18 20:06:02 +00:00
Chris Lattner	b3f659c8c8	fix an x86-64 encoding bug Daniel found. llvm-svn: 98855	2010-03-18 20:04:36 +00:00
Chris Lattner	a3a66b28b6	add a special relocation type for movq loads for object files that produce special relocation types where the linker changes movq's into lea's. llvm-svn: 98839	2010-03-18 18:10:56 +00:00
Evan Cheng	bf724b9ee0	Turning off post-ra scheduling for x86. It isn't a consistent win. llvm-svn: 98810	2010-03-18 06:55:42 +00:00
Evan Cheng	68333f5c6e	X86 address mode matching code MatchAddressRecursively does some aggressive hack which require doing a RAUW. It may end up deleting some SDNode up stream. It should avoid referencing deleted nodes. llvm-svn: 98780	2010-03-17 23:58:35 +00:00
Johnny Chen	8f3004cff2	Added sub-formats to the NeonI/NeonXI instructions to further refine the NEONFrm instructions to help disassembly. We also changed the output of the addressing modes to omit the '+' from the assembler syntax #+/-<imm> or +/-<Rm>. See, for example, A8.6.57/58/60. And modified test cases to not expect '+' in +reg or #+num. For example, ; CHECK: ldr.w r9, [r7, #28] llvm-svn: 98745	2010-03-17 17:52:21 +00:00
Stuart Hastings	6981c258f7	Testcase for r98728. llvm-svn: 98744	2010-03-17 17:51:08 +00:00
Evan Cheng	403062313f	Fix liveintervals handling of dbg_value instructions. llvm-svn: 98686	2010-03-16 21:51:27 +00:00
Daniel Dunbar	8801b810bb	Revert r98666 too; it's checkin-without-testing day! llvm-svn: 98673	2010-03-16 20:52:59 +00:00
Chris Lattner	5aa4a42c77	temporarily xfail llvm-svn: 98666	2010-03-16 20:08:07 +00:00
Dan Gohman	5a6dc1dd09	Add an rdar number to this test. llvm-svn: 98654	2010-03-16 19:08:20 +00:00
Duncan Sands	e0fa09cb05	Chris pointed out that producing undef here is wrong in general. llvm-svn: 98649	2010-03-16 18:50:54 +00:00
Bob Wilson	1b4e8cc69c	--- Reverse-merging r98637 into '.': U test/CodeGen/ARM/tls2.ll U test/CodeGen/ARM/arm-negative-stride.ll U test/CodeGen/ARM/2009-10-30.ll U test/CodeGen/ARM/globals.ll U test/CodeGen/ARM/str_pre-2.ll U test/CodeGen/ARM/ldrd.ll U test/CodeGen/ARM/2009-10-27-double-align.ll U test/CodeGen/Thumb2/thumb2-strb.ll U test/CodeGen/Thumb2/ldr-str-imm12.ll U test/CodeGen/Thumb2/thumb2-strh.ll U test/CodeGen/Thumb2/thumb2-ldr.ll U test/CodeGen/Thumb2/thumb2-str_pre.ll U test/CodeGen/Thumb2/thumb2-str.ll U test/CodeGen/Thumb2/thumb2-ldrh.ll U utils/TableGen/TableGen.cpp U utils/TableGen/DisassemblerEmitter.cpp D utils/TableGen/RISCDisassemblerEmitter.h D utils/TableGen/RISCDisassemblerEmitter.cpp U Makefile.rules U lib/Target/ARM/ARMInstrNEON.td U lib/Target/ARM/Makefile U lib/Target/ARM/AsmPrinter/ARMInstPrinter.cpp U lib/Target/ARM/AsmPrinter/ARMAsmPrinter.cpp U lib/Target/ARM/AsmPrinter/ARMInstPrinter.h D lib/Target/ARM/Disassembler U lib/Target/ARM/ARMInstrFormats.td U lib/Target/ARM/ARMAddressingModes.h U lib/Target/ARM/Thumb2ITBlockPass.cpp llvm-svn: 98640	2010-03-16 16:59:47 +00:00
Johnny Chen	3d9327bd06	Initial ARM/Thumb disassembler check-in. It consists of a tablgen backend (RISCDisassemblerEmitter) which emits the decoder functions for ARM and Thumb, and the disassembler core which invokes the decoder function and builds up the MCInst based on the decoded Opcode. Added sub-formats to the NeonI/NeonXI instructions to further refine the NEONFrm instructions to help disassembly. We also changed the output of the addressing modes to omit the '+' from the assembler syntax #+/-<imm> or +/-<Rm>. See, for example, A8.6.57/58/60. And modified test cases to not expect '+' in +reg or #+num. For example, ; CHECK: ldr.w r9, [r7, #28] llvm-svn: 98637	2010-03-16 16:36:54 +00:00
Bob Wilson	298a83ecfe	Stop using the old pre-UAL syntax for LDM/STM instruction suffixes. This does not move entirely to UAL syntax, since the default "increment after" suffix is empty but we still use "IA" for that. llvm-svn: 98635	2010-03-16 16:19:07 +00:00
Duncan Sands	57f1191b0d	Check that P is not zero initialized. llvm-svn: 98627	2010-03-16 11:36:35 +00:00
Bob Wilson	5125770346	Add a testcase for the change in r98586. llvm-svn: 98610	2010-03-16 05:33:29 +00:00
Bill Wendling	31d7f0d96a	Forgot testcase for r98599. llvm-svn: 98602	2010-03-16 01:54:20 +00:00
Chris Lattner	db035a0af2	Fix the third (and last known) case of code update problems due to LLVM IR changes with addr label weirdness. In the testcase, we generate references to the two bb's when codegen'ing the first function: _test1: ## @test1 leaq Ltmp0(%rip), %rax .. leaq Ltmp1(%rip), %rax Then continue to codegen the second function where the blocks get merged. We're now smart enough to emit both labels, producing this code: _test_fun: ## @test_fun ## BB#0: ## %entry Ltmp1: ## Block address taken Ltmp0: ## BB#1: ## %ret movl $-1, %eax ret Rejoice. llvm-svn: 98595	2010-03-16 00:29:39 +00:00
Daniel Dunbar	5599256415	MC: Allow modifiers in MCSymbolRefExpr, and eliminate X86MCTargetExpr. - Although it would be nice to allow this decoupling, the assembler needs to be able to reason about MCSymbolRefExprs in too many places to make this viable. We can use a target specific encoding of the variant if this becomes an issue. - This patch also extends llvm-mc to support parsing of the modifiers, as opposed to lumping them in with the symbol. llvm-svn: 98592	2010-03-15 23:51:06 +00:00
Dan Gohman	c6ddebd6d1	Recognize code for doing vector gather/scatter index calculations with 32-bit indices. Instead of shuffling each element out of the index vector, when all indices are needed, just store the input vector to the stack and load the elements out. llvm-svn: 98588	2010-03-15 23:23:03 +00:00
Daniel Dunbar	fe8d866fc7	MC/Mach-O/x86_64: Temporary labels in cstring sections require symbols (and external relocations, but we don't have x86_64 relocations yet). llvm-svn: 98583	2010-03-15 21:56:50 +00:00
Chris Lattner	561334a81f	Implement support for the case when a reference to a addr-of-bb label is generated, but then the block is deleted. Since the value is undefined, we just emit the label right after the entry label of the function. It might matter that the label is in the same section as the function was afterall. llvm-svn: 98579	2010-03-15 20:39:00 +00:00
Chris Lattner	347a0eb85c	Fix the case when a reference to an address taken BB is emitted in one function, then the BB is RAUW'd before the definition is emitted. There are still two cases not being handled, but this should improve us back to the situation before I touched anything. llvm-svn: 98566	2010-03-15 19:09:43 +00:00
Chris Lattner	d03a956a01	filecheckize a test and mark these wiht a cpu so it passes on hosts without cmovs. llvm-svn: 98521	2010-03-14 22:31:16 +00:00
Duncan Sands	ca595495e4	Turn calls to copysignl into an FCOPYSIGN node. Handle FCOPYSIGN nodes with ppc_f128 type by having the type legalizer turn these back into a call to copysignl. llvm-svn: 98514	2010-03-14 21:08:40 +00:00
Chris Lattner	f71cb6c439	fix ShrinkDemandedOps to not leave dead nodes around, fixing PR6607 llvm-svn: 98512	2010-03-14 19:46:02 +00:00
Chris Lattner	5049f23592	don't have i386-specific tests in CodeGen/Generic, PR6601. llvm-svn: 98508	2010-03-14 18:51:18 +00:00
Chris Lattner	6feb7e3325	fix PR6605, X86ISD::CMP always returns i32 (EFLAGS), not the operand type. llvm-svn: 98507	2010-03-14 18:44:35 +00:00
Anton Korobeynikov	79a7c7823d	Fix typo llvm-svn: 98506	2010-03-14 18:42:52 +00:00
Anton Korobeynikov	846a117892	Feature test for half precision FP. llvm-svn: 98504	2010-03-14 18:42:43 +00:00
Chris Lattner	9efbbcbe45	fix AsmPrinter::GetBlockAddressSymbol to always return a unique label instead of trying to form one based on the BB name (which causes collisions if the name is empty). This fixes PR6608 llvm-svn: 98495	2010-03-14 17:53:23 +00:00
Chris Lattner	6e52e9db31	get MMI out of the label uniquing business, just go to MCContext to get unique assembler temporary labels. llvm-svn: 98489	2010-03-14 08:36:50 +00:00
Chris Lattner	54109deae3	xfail properly llvm-svn: 98479	2010-03-14 07:55:34 +00:00
Chris Lattner	1e2dc539b9	xfail these tests temporarily to get teh buildbots back to happy land. llvm-svn: 98476	2010-03-14 07:32:48 +00:00
Evan Cheng	d703df67ce	Do not force indirect tailcall through fixed registers: eax, r11. Add support to allow loads to be folded to tail call instructions. llvm-svn: 98465	2010-03-14 03:48:46 +00:00
Daniel Dunbar	d324a7c990	X86: Fix ADD64i32 encoding. llvm-svn: 98457	2010-03-13 22:49:39 +00:00
Daniel Dunbar	7c1f3d8cad	MC/X86_64: Symbol support. llvm-svn: 98456	2010-03-13 22:49:35 +00:00
Daniel Dunbar	56597588f0	MC/Mach-O: Initial x86_64 support. llvm-svn: 98454	2010-03-13 22:10:17 +00:00
Daniel Dunbar	c19fb6f96d	macho-dump: Basic Mach 64 support. llvm-svn: 98453	2010-03-13 22:10:11 +00:00
Daniel Dunbar	906a432031	MC/X86_64: Fix matching of leaq. llvm-svn: 98444	2010-03-13 19:31:44 +00:00
Daniel Dunbar	e60c883bf4	MC/X86_64: Fix matching of callq. llvm-svn: 98443	2010-03-13 19:31:38 +00:00
Daniel Dunbar	34b8e553ea	MC/Mach-O: PCrel relocations weren't using the right base address, they are relative to the fragment address, not its offset. This was masked by the text section normally being at address 0. llvm-svn: 98420	2010-03-13 02:38:00 +00:00
Evan Cheng	2a65429671	Fix a typo in ValueTracking that's causing instcombine to delete needed shift instructions. llvm-svn: 98416	2010-03-13 02:20:29 +00:00
Daniel Dunbar	18fc344290	MC/X86: Add temporary hack to match shrl $1,%eax correctly, to support testing other functionality on 403.gcc compiled at -O0. llvm-svn: 98405	2010-03-13 00:47:29 +00:00
Daniel Dunbar	b86672059e	MC/X86: Add an XFAIL test where we aren't matching the correct instruction because we don't understand how the specific instruction is doing sign extension. llvm-svn: 98404	2010-03-13 00:47:25 +00:00
Daniel Dunbar	12f1e32d59	MC/Mach-O: Implement initial support for relaxation. - The implementation is currently very brain dead and inefficient, but I have a clear plan on how to fix it. - The good news is, it works and correctly assembles 403.gcc (when built with Clang, at '-Os', '-Os -g', and '-O3'). Even better, at '-Os' and '-Os -g', the resulting binary is exactly equivalent to that when built with the system assembler. So it probably works! :) llvm-svn: 98396	2010-03-12 22:07:14 +00:00
Devang Patel	d19e302f77	Fix llc crash on invalid input. llvm-svn: 98369	2010-03-12 19:18:30 +00:00
Chris Lattner	d75813970a	simplify code to use OutContext.GetOrCreateTemporarySymbol with no arguments instead of having to come up with a unique name. This also makes the code less fragile. llvm-svn: 98364	2010-03-12 18:47:50 +00:00
Duncan Sands	8c35506fbd	When constant folding GEP of GEP, do not crash if an index of the inner GEP is not a ConstantInt. llvm-svn: 98359	2010-03-12 17:55:20 +00:00
Chris Lattner	53ebf8a7ca	fix PR6577, a bug in sdbuilder lowering select instructions whose true value was not Val#0. llvm-svn: 98336	2010-03-12 07:15:36 +00:00
Bill Wendling	00810c39da	revert r98270. llvm-svn: 98281	2010-03-11 19:50:31 +00:00
Evan Cheng	31fe835bf2	Bad bad bug. x86 force indirect tail call address into eax when it's meant to force it into a call preserved register instead. Change it to ecx for now. llvm-svn: 98270	2010-03-11 18:49:14 +00:00
Richard Osborne	4780109254	Add dag combine to simplify lmul(x, 0, a, b) llvm-svn: 98258	2010-03-11 16:26:35 +00:00
Evan Cheng	8c4df8160e	The check for coalescing a virtual register to a physical register, e.g. cl = EXTRACT_SUBREG reg1024, 1, is overly conservative. It should check for overlaps of vr's live interval with the super registers of the physical register (ECX in this case) and let JoinIntervals() handle checking the coalescing feasibility against the physical register (cl in this case). llvm-svn: 98251	2010-03-11 08:20:21 +00:00
Eric Christopher	304f13c637	Have fast-isel understand llvm.objectsize. Update testcase for slightly different codegen. llvm-svn: 98244	2010-03-11 06:20:22 +00:00
Daniel Dunbar	5c5228a8f6	MC/Mach-O: Implement "absolutizing" semantics of .set, by evaluating the assembly time value of variables. llvm-svn: 98241	2010-03-11 05:53:37 +00:00
Chris Lattner	a179e4d0a8	add support, testcases, and dox for the new GHC calling convention. Patch by David Terei! llvm-svn: 98212	2010-03-11 00:22:57 +00:00
Chris Lattner	4ec0b670d5	fix PR6533 by updating the br(xor) code to remember the case when it looked past a trunc. llvm-svn: 98203	2010-03-10 23:46:44 +00:00
Dan Gohman	474e488c06	Constant-fold GEP-of-GEP into a single GEP. llvm-svn: 98178	2010-03-10 19:31:51 +00:00
Dan Gohman	fc7a25dc36	Fix whitespace. llvm-svn: 98173	2010-03-10 19:00:54 +00:00
Tobias Grosser	ab19e1e9b5	Fix make check with cmake/lit PR6540: Set the newly introduced variables ENABLE_SHARED and SHLIBPATH_VAR in lit.site.cfg not only in the autoconf build, but also in a cmake one. llvm-svn: 98171	2010-03-10 18:41:59 +00:00
Richard Osborne	54a2c32670	Handle MVT::i64 type in DAG combine for ISD::ADD. Fold 64 bit expression add(add(mul(x,y),a),b) -> lmul(x,y,a,b) if all operands are zero extended. llvm-svn: 98168	2010-03-10 18:12:27 +00:00
Bob Wilson	dfebf1ffac	Testcase for pr6552. I changed the code to use "ip" instead of "fp" because the "fp" register name is not valid on Darwin, and the "ip" register name was broken for all ARM targets. llvm-svn: 98166	2010-03-10 17:54:11 +00:00
Richard Osborne	1a396d53ed	Fold add(add(mul(x,y),a),b) -> lmul(x,y,a,b) if the intermediate results are unused elsewhere. llvm-svn: 98157	2010-03-10 16:19:31 +00:00
Richard Osborne	f57aea3d38	Prefer LMUL to MACCU as LMUL has no tied operands. llvm-svn: 98153	2010-03-10 13:27:10 +00:00
Richard Osborne	0012bc1e41	Custom lower (S\|U)MUL_LOHI -> MACC(S\|U) llvm-svn: 98152	2010-03-10 13:20:07 +00:00
Richard Osborne	54dfa01adc	Lower add (mul a, b), c into MACCU / MACCS nodes which translate directly to the maccu / maccs instructions. We handle this in ExpandADDSUB since after type legalisation it is messy to recognise these operations. llvm-svn: 98150	2010-03-10 11:41:08 +00:00
Richard Osborne	e35eabdd69	Convert test to FileCheck. llvm-svn: 98148	2010-03-10 11:24:03 +00:00
Evan Cheng	72811e8714	Fix typo. llvm-svn: 98142	2010-03-10 07:07:55 +00:00
Evan Cheng	a3b6739749	Unbreak test on Linux. llvm-svn: 98141	2010-03-10 07:07:45 +00:00
Evan Cheng	80ad113731	Enable machine cse pass. llvm-svn: 98132	2010-03-10 03:07:41 +00:00
Daniel Dunbar	27b984ac85	MC/Mach-O: Use the SECTDIFF relocation type for (A - B + constant) where A is external. - I'm not sure why, but this is what 'as' does. llvm-svn: 98115	2010-03-10 00:58:25 +00:00
Dan Gohman	69451a0950	Avoid analyzing instructions in blocks not reachable from the entry block. They are lots of trouble, and they don't matter. This fixes PR6559. llvm-svn: 98103	2010-03-09 23:46:50 +00:00
Daniel Dunbar	b70c2f795e	MC/X86: Rename alternate spellings of ADD{8,16,32} and mark as "code gen only" so they don't get selected by the asm matcher. llvm-svn: 98098	2010-03-09 22:50:46 +00:00
Daniel Dunbar	f5b6a1118d	MC/X86: Rename alternate spellings of CMP{8,16,32} and mark as "code gen only" so they don't get selected by the asm matcher. llvm-svn: 98097	2010-03-09 22:50:40 +00:00
Daniel Dunbar	3dde457b94	MC/Mach-O: For PCrel relocations, we need to compensate for the PCrel adjustment when determining if we need a scattered relocation. llvm-svn: 98082	2010-03-09 21:27:58 +00:00
Dale Johannesen	90eab67320	The address of an indirect call must be in R12 on Darwin. Make it so. (This patch is in LowerCall_Darwin, which seems to be used by SVR4 code as well; since that doesn't belong here, I haven't worried about this case.) llvm-svn: 98077	2010-03-09 20:15:42 +00:00
Richard Osborne	c420c4cb4e	In cases where the carry / borrow unused converted ladd / lsub to an add or a sub. llvm-svn: 98059	2010-03-09 16:34:25 +00:00
Richard Osborne	f4e76cf44d	Add DAG combine for ladd / lsub. llvm-svn: 98057	2010-03-09 16:07:47 +00:00
Dan Gohman	93452cebda	Make isLCSSA ignore uses in blocks not reachable from the entry block, as LCSSA no longer transforms such uses. llvm-svn: 98033	2010-03-09 01:53:33 +00:00
Devang Patel	59445dbf78	Start using DIFile. See updated SourceLevelDebugging.html for more information. This patch updates LLVMDebugVersion to 8. Debug info descriptors encoded using LLVMDebugVersion 7 is supported. Corresponding llvmgcc and clang FE commits are required. llvm-svn: 98020	2010-03-09 00:44:10 +00:00
Chris Lattner	9889c1eb9e	move .set generation out of DwarfPrinter into AsmPrinter and MCize it. llvm-svn: 98010	2010-03-08 23:58:37 +00:00
Chris Lattner	27a9732450	simplify EmitSectionOffset to always use .set if it is available, the only thing this affects is that we produce .set in one case we didn't before, which shouldn't harm anything. Make EmitSectionOffset call EmitDifference instead of duplicating it. llvm-svn: 98005	2010-03-08 23:23:25 +00:00
Bob Wilson	0bfbd9b68c	Fix a crash compiling 254.gap for Thumb2. The Thumb2 add/sub with 12-bit immediate instructions cannot set the condition codes, so they do not have the extra cc_out operand. We hit an assertion during tail duplication because the instruction being duplicated had more operands that expected. llvm-svn: 98001	2010-03-08 22:56:15 +00:00
Evan Cheng	4f2fd2d2be	Re-commit 97860 with fix. getMallocAllocatedType may return null. llvm-svn: 98000	2010-03-08 22:54:36 +00:00
Kevin Enderby	d2030e38a6	Fix the vmxon entry in the X86InstrInfo.td so it has the correct prefix bytes for the encoding and is not the same as vmptrld. llvm-svn: 97992	2010-03-08 22:17:26 +00:00
Daniel Dunbar	3a3f472cb4	MC/Macho-O: Align the zerofill section itself to the maximum alignment. llvm-svn: 97991	2010-03-08 22:03:42 +00:00
Daniel Dunbar	6622fe7873	MC/Mach-O: Fix address compution for zero fill sections. llvm-svn: 97984	2010-03-08 21:10:42 +00:00
Daniel Dunbar	b59f7734b9	X86: Fix encoding for TEST{8,16,32}rr. llvm-svn: 97982	2010-03-08 21:10:36 +00:00
Evan Cheng	5967649780	Add documentation on sibling call optimization. Rename tailcall2.ll test to sibcall.ll. llvm-svn: 97980	2010-03-08 21:05:02 +00:00
John McCall	953838d0d5	Revert r97726 and r97728 at ddunbar's request; we want to solve this some other way when it comes to be necessary. llvm-svn: 97972	2010-03-08 20:02:05 +00:00
Wesley Peck	1fb4edc05d	Re-committing the failed r97807 commit with changes to eliminate warnings. llvm-svn: 97891	2010-03-06 23:23:12 +00:00
Anton Korobeynikov	bf16a17fc1	Initial bits of ARMv4-only support. Patch by John Tytgat! llvm-svn: 97886	2010-03-06 19:39:36 +00:00
Anton Korobeynikov	6f5523aa8b	Do not use '&' prefix for globals when register base field is non-zero, otherwise msp430-as will silently miscompile the code (TI's assembler report an error though). This fixes PR6349 llvm-svn: 97877	2010-03-06 11:41:12 +00:00
Eric Christopher	a7fb58f5f5	Migrate _chk call lowering from SimplifyLibCalls to InstCombine. Stub out the remainder of the calls that we should lower in some way and move the tests to the new correct directory. Fix up tests that are now optimized more than they were before by -instcombine. llvm-svn: 97875	2010-03-06 10:50:38 +00:00
Chris Lattner	4279a078a5	revert r97807, it introduced build warnings. llvm-svn: 97869	2010-03-06 04:32:46 +00:00
Eric Christopher	d8b43d0e59	Temporarily revert: Log: Transform @llvm.objectsize to integer if the argument is a result of malloc of known size. Modified: llvm/trunk/lib/Transforms/InstCombine/InstCombineCalls.cpp llvm/trunk/test/Transforms/InstCombine/objsize.ll It appears to be causing swb and nightly test failures. llvm-svn: 97866	2010-03-06 03:11:35 +00:00
Evan Cheng	afdc7d3aab	Transform @llvm.objectsize to integer if the argument is a result of malloc of known size. llvm-svn: 97860	2010-03-06 01:01:42 +00:00
Erick Tryzelaar	381268e629	Add a LLVMWriteBitcodeToFD that exposes the raw_fd_ostream options. llvm-svn: 97858	2010-03-06 00:30:06 +00:00
Devang Patel	0a3d71af52	Test case for r97851. llvm-svn: 97852	2010-03-05 23:35:04 +00:00
Charles Davis	8545afe0b0	Don't emit global symbols into the (__TEXT,__ustring) section on Darwin. This is a workaround for <rdar://problem/7672401/> (which I filed). This let's us build Wine on Darwin, and it gets the Qt build there a little bit further (so Doug says). llvm-svn: 97845	2010-03-05 22:28:45 +00:00
Jakob Stoklund Olesen	2664d295cb	Better handling of dead super registers in LiveVariables. We used to do this: CALL ... %RAX<imp-def> ... [not using %RAX] %EAX = ..., %RAX<imp-use, kill> RET %EAX<imp-use,kill> Now we do this: CALL ... %RAX<imp-def, dead> ... [not using %RAX] %EAX = ... RET %EAX<imp-use,kill> By not artificially keeping %RAX alive, we lower register pressure a bit. The correct number of instructions for 2008-08-05-SpillerBug.ll is obviously 55, anybody can see that. Sheesh. llvm-svn: 97838	2010-03-05 21:49:17 +00:00
Jakob Stoklund Olesen	8c5b8db5cd	We don't really care about correct register liveness information after the post-ra scheduler has run. Disable the verifier checks that late in the game. llvm-svn: 97837	2010-03-05 21:49:13 +00:00
Jakob Stoklund Olesen	b0503beff1	Avoid creating bad PHI instructions when BR is being const-folded. llvm-svn: 97836	2010-03-05 21:49:10 +00:00
Evan Cheng	d214ed0e75	Safely turn memset_chk etc. to non-chk variant if the known object size is >= memset / memcpy / memmove size. llvm-svn: 97828	2010-03-05 20:59:47 +00:00
Evan Cheng	fffdad58ac	Instcombine should turn llvm.objectsize of a alloca with static size to an integer. llvm-svn: 97827	2010-03-05 20:47:23 +00:00
Chris Lattner	f0692603d5	fix bss section printing for cell, patch by Kalle Raiskila! llvm-svn: 97814	2010-03-05 18:55:36 +00:00
Chris Lattner	f6befffbb2	fix PR6512, a case where instcombine would incorrectly merge loads from different addr spaces. llvm-svn: 97813	2010-03-05 18:53:28 +00:00
Wesley Peck	34004170c5	Reworking the stack layout that the MicroBlaze backend generates. The MicroBlaze backend was generating stack layouts that did not conform correctly to the ABI. This update generates stack layouts which are closer to what GCC does. Variable arguments support was added as well but the stack layout for varargs has not been finalized. llvm-svn: 97807	2010-03-05 15:26:02 +00:00
Chris Lattner	067459c62b	Fix PR6503. This turned into a much more interesting and nasty bug. Various parts of the cmp\|cmp and cmp&cmp folding logic wasn't prepared for vectors (unrelated to the bug but noticed while in the code) and the code was definitely not safe to use by the (cast icmp)\|(cast icmp) handling logic that I added in r95855. Fix all this up by changing the various routines to more consistently use IRBuilder and not pass in the I which had the wrong type. llvm-svn: 97801	2010-03-05 08:46:26 +00:00
Chris Lattner	fc13a0343c	make these less sensitive to temporary naming. llvm-svn: 97799	2010-03-05 08:43:33 +00:00
Chris Lattner	48b6e27e7b	remove this testcase, it isn't clear what it was testing and it is subsumed by or.ll llvm-svn: 97798	2010-03-05 08:43:06 +00:00
Evan Cheng	654ec2a663	Fix an oops in x86 sibcall optimization. If the ByVal callee argument is itself passed as a pointer, then it's obviously not safe to do a tail call. llvm-svn: 97797	2010-03-05 08:38:04 +00:00
Chris Lattner	c6c1523f59	fix a nice subtle reassociate bug which would only occur in a very specific use pattern embodied in the carefully reduced testcase. llvm-svn: 97794	2010-03-05 07:18:54 +00:00
Chris Lattner	55e81eb49f	Fix PR6497, a bug where we'd fold a load into an addc node which has a flag. That flag in turn was used by an already-selected adde which turned into an ADC32ri8 which used a selected load which was chained to the load we folded. This flag use caused us to form a cycle. Fix this by not ignoring chains in IsLegalToFold even in cases where the isel thinks it can. llvm-svn: 97791	2010-03-05 06:19:13 +00:00
Chris Lattner	bfdd17a2ea	cleanup llvm-svn: 97790	2010-03-05 06:17:43 +00:00
Evan Cheng	cf67ffa500	Rever 96389 and 96990. They are causing some miscompilation that I do not fully understand. llvm-svn: 97782	2010-03-05 03:08:23 +00:00
Bill Wendling	543ce1f64a	Revert r97766. It's deleting a tag. llvm-svn: 97768	2010-03-05 00:33:59 +00:00
Bill Wendling	6517f88f25	Micro-optimization: This code: float floatingPointComparison(float x, float y) { double product = (double)x * y; if (product == 0.0) return product; return product - 1.0; } produces this: _floatingPointComparison: 0000000000000000 cvtss2sd %xmm1,%xmm1 0000000000000004 cvtss2sd %xmm0,%xmm0 0000000000000008 mulsd %xmm1,%xmm0 000000000000000c pxor %xmm1,%xmm1 0000000000000010 ucomisd %xmm1,%xmm0 0000000000000014 jne 0x00000004 0000000000000016 jp 0x00000002 0000000000000018 jmp 0x00000008 000000000000001a addsd 0x00000006(%rip),%xmm0 0000000000000022 cvtsd2ss %xmm0,%xmm0 0000000000000026 ret The "jne/jp/jmp" sequence can be reduced to this instead: _floatingPointComparison: 0000000000000000 cvtss2sd %xmm1,%xmm1 0000000000000004 cvtss2sd %xmm0,%xmm0 0000000000000008 mulsd %xmm1,%xmm0 000000000000000c pxor %xmm1,%xmm1 0000000000000010 ucomisd %xmm1,%xmm0 0000000000000014 jp 0x00000002 0000000000000016 je 0x00000008 0000000000000018 addsd 0x00000006(%rip),%xmm0 0000000000000020 cvtsd2ss %xmm0,%xmm0 0000000000000024 ret for a savings of 2 bytes. This xform can happen when we recognize that jne and jp jump to the same "true" MBB, the unconditional jump would jump to the "false" MBB, and the "true" branch is the fall-through MBB. llvm-svn: 97766	2010-03-05 00:24:26 +00:00
Johnny Chen	ece1797542	Drop the ".w" qualifier for t2UXTB16* instructions as there is no 16-bit version of either sxtb16 or uxtb16, and the unified syntax does not specify ".w". llvm-svn: 97760	2010-03-04 22:24:41 +00:00
Bob Wilson	749ba9a7d5	pr6478: The frame pointer spill frame index is only defined when there is a frame pointer. llvm-svn: 97755	2010-03-04 21:42:36 +00:00
Bob Wilson	cf6e29a818	pr6480: Don't try producing ld/st-multiple instructions when the address is an undef value. This is only going to come up for bugpoint-reduced tests -- correct programs will not access memory at undefined addresses -- so it's not worth the effort of doing anything more aggressive. llvm-svn: 97745	2010-03-04 21:04:38 +00:00
Jakob Stoklund Olesen	af6ca23294	Fix the remaining MUL8 and DIV8 to define AX instead of AL,AH. These instructions technically define AL,AH, but a trick in X86ISelDAGToDAG reads AX in order to avoid reading AH with a REX instruction. Fix PR6489. llvm-svn: 97742	2010-03-04 20:42:07 +00:00
Dan Gohman	b8ebd408da	Fix recognition of 16-bit bswap for C front-ends which emit the clobber registers in a different order. llvm-svn: 97741	2010-03-04 19:58:08 +00:00
John McCall	d423572e86	Teach lit to honor conditional directives. The syntax is: IF(condition(value)): If the value satisfies the condition, the line is processed by lit; otherwise it is skipped. A test with no unignored directives is resolved as Unsupported. The test suite is responsible for defining conditions; conditions are unary functions over strings. I've defined two conditions in the LLVM test suite, TARGET (with values like those in TARGETS_TO_BUILD) and BINDING (with values like those in llvm_bindings). So for example you can write: IF(BINDING(ocaml)): RUN: %blah %s -o - and the RUN line will only execute if LLVM was configured with the ocaml bindings. llvm-svn: 97726	2010-03-04 09:36:50 +00:00
Nick Lewycky	1a7ed5868b	Make the 'icmp pred trunc(ext(X)), CST --> icmp pred X, ext(trunc(CST))' transformation much more careful. Truncating binary '01' to '1' sounds like it's safe until you realize that it switched from positive to negative under a signed interpretation, and that depends on the icmp predicate. Also a few miscellaneous cleanups. llvm-svn: 97721	2010-03-04 06:54:10 +00:00
Erick Tryzelaar	ab5ac37c31	Expose the rest of the llvm-c scalar opts to ocaml. llvm-svn: 97685	2010-03-03 23:51:34 +00:00
Chris Lattner	3afc0721c7	fix incorrect folding of icmp with undef, PR6481. llvm-svn: 97659	2010-03-03 19:46:03 +00:00
Dan Gohman	2850b41412	Revert r97580; that's not the right way to fix this. llvm-svn: 97639	2010-03-03 04:36:42 +00:00
Bill Wendling	af13d82945	This test case: long test(long x) { return (x & 123124) \| 3; } Currently compiles to: _test: orl $3, %edi movq %rdi, %rax andq $123127, %rax ret This is because instruction and DAG combiners canonicalize (or (and x, C), D) -> (and (or, D), (C \| D)) However, this is only profitable if (C & D) != 0. It gets in the way of the 3-addressification because the input bits are known to be zero. llvm-svn: 97616	2010-03-03 00:35:56 +00:00
Erick Tryzelaar	98b05d67e9	Remove module providers from ocaml. llvm-svn: 97609	2010-03-02 23:59:00 +00:00
Chris Lattner	dd030701bd	Fix some issues in WalkChainUsers dealing with CopyToReg/CopyFromReg/INLINEASM. These are annoying because they have the same opcode before an after isel. Fix this by setting their NodeID to -1 to indicate that they are selected, just like what automatically happens when selecting things that end up being machine nodes. With that done, give IsLegalToFold a new flag that causes it to ignore chains. This lets the HandleMergeInputChains routine be the one place that validates chains after a match is successful, enabling the new hotness in chain processing. This smarter chain processing eliminates the need for "PreprocessRMW" in the X86 and MSP430 backends and enables MSP to start matching it's multiple mem operand instructions more aggressively. I currently #if out the dead code in the X86 backend and MSP backend, I'll remove it for real in a follow-on patch. The testcase changes are: test/CodeGen/X86/sse3.ll: we generate better code test/CodeGen/X86/store_op_load_fold2.ll: PreprocessRMW was miscompiling this before, we now generate correct code Convert it to filecheck while I'm at it. test/CodeGen/MSP430/Inst16mm.ll: Add a testcase for mem/mem folding to make anton happy. :) llvm-svn: 97596	2010-03-02 22:20:06 +00:00
Chris Lattner	f61e34d120	this testcase is failing because pic16 doesn't define a reg/reg xor pattern. I have no plans to fix this XFAIL. llvm-svn: 97587	2010-03-02 20:48:24 +00:00
Erick Tryzelaar	a48e627126	Add support for use to ocaml. llvm-svn: 97586	2010-03-02 20:32:32 +00:00
Chris Lattner	7ecdcadcc9	xfail this for now. llvm-svn: 97584	2010-03-02 19:53:25 +00:00
Dan Gohman	d55f574589	When expanding an expression such as (A + B + C + D), sort the operands by loop depth and emit loop-invariant subexpressions outside of loops. This speeds up MultiSource/Applications/viterbi and others. llvm-svn: 97580	2010-03-02 19:32:21 +00:00
Chris Lattner	35ec683b78	clean up some testcases. llvm-svn: 97576	2010-03-02 18:56:03 +00:00
Chris Lattner	925ac71f26	Fix the xfail I added a couple of patches back. The issue was that we weren't properly handling the case when interior nodes of a matched pattern become dead after updating chain and flag uses. Now we handle this explicitly in UpdateChainsAndFlags. llvm-svn: 97561	2010-03-02 07:50:03 +00:00
Chris Lattner	b884fe867e	Rewrite chain handling validation and input TokenFactor handling stuff now that we don't care about emulating the old broken behavior of the old isel. This eliminates the 'CheckChainCompatible' check (along with IsChainCompatible) which did an incorrect and inefficient scan up the chain nodes which happened as the pattern was being formed and does the validation at the end in HandleMergeInputChains when it forms a structural pattern. This scans "down" the graph, which means that it is quickly bounded by nodes already selected. This also handles token factors that get "trapped" in the dag. Removing the CheckChainCompatible nodes also shrinks the generated tables by about 6K for X86 (down to 83K). There are two pieces remaining before I can nuke PreprocessRMW: 1. I xfailed a test because we're now producing worse code in a case that has nothing to do with the change: it turns out that our use of MorphNodeTo will leave dead nodes in the graph which (depending on how the graph is walked) end up causing bogus uses of chains and blocking matches. This is really bad for other reasons, so I'll fix this in a follow-up patch. 2. CheckFoldableChainNode needs to be improved to handle the TF. llvm-svn: 97539	2010-03-02 02:22:10 +00:00
Dan Gohman	4cec543952	Fix several places to handle vector operands properly. Based on a patch by Micah Villmow for PR6438. llvm-svn: 97538	2010-03-02 02:14:38 +00:00
Dan Gohman	52f5563973	Non-affine post-inc SCEV expansions have more code which must be emitted after the increment. Make sure the insert position reflects this. This fixes PR6453. llvm-svn: 97537	2010-03-02 01:59:21 +00:00
Dan Gohman	6f34abd092	Floating-point add, sub, and mul are now spelled fadd, fsub, and fmul, respectively. llvm-svn: 97531	2010-03-02 01:11:08 +00:00
Chris Lattner	d39f75ba39	Fix PR2590 by making PatternSortingPredicate actually be ordered correctly. Previously it would get in trouble when two patterns were too similar and give them nondet ordering. We force this by using the record ID order as a fallback. The testsuite diff is due to alpha patterns being ordered slightly differently, the change is a semantic noop afaict: < lda $0,-100($16) --- > subq $16,100,$0 llvm-svn: 97509	2010-03-01 22:09:11 +00:00
Devang Patel	aaecdaeb5d	Remove tests that checks @llvm.dbg.stoppoint handling. llvm-svn: 97493	2010-03-01 20:33:48 +00:00
Chris Lattner	d35a728a34	stop using anders-aa llvm-svn: 97492	2010-03-01 20:24:50 +00:00
Chris Lattner	8c1132746b	stop using anders-aa llvm-svn: 97491	2010-03-01 20:24:05 +00:00
Chris Lattner	7d2c1592f3	remove andersen's tests. llvm-svn: 97490	2010-03-01 20:23:15 +00:00
Devang Patel	2e7ddea828	@llvm.dbg.stoppoint intrinsic is not used anymore. Delete dead testcase. llvm-svn: 97489	2010-03-01 19:46:08 +00:00
Devang Patel	4e728b3823	Update to use new debug info encoding scheme. As a bonus, now the test passes! llvm-svn: 97487	2010-03-01 19:41:26 +00:00
Devang Patel	ed56bcfd91	Remove this test because it checks wheter optimizer handled @llvm.dbg.global_variable appropriately or not. LLVM does not use this scheme to encode debug info for global variables any more. llvm-svn: 97480	2010-03-01 19:14:25 +00:00
Devang Patel	d8425df136	Remove test to check bugfix in handing debug info for global variables using intrinsics. Now, debug info for global variable is encoded using metadata. The old code path is now history and there is no need to have a test to check a bug fix in old code path. llvm-svn: 97477	2010-03-01 19:09:55 +00:00
Devang Patel	9aef3e5de1	Remove dead test. llvm-svn: 97474	2010-03-01 19:04:23 +00:00
Devang Patel	dd596b1248	Replace test case that uses @llvm.dbg.* intrinsic with a test that uses metadata. llvm-svn: 97473	2010-03-01 19:02:51 +00:00
Devang Patel	890644e3a7	These two tests check whether oprimizer safely ignores @llvm.dbg.stoppoint intrinsic or not. This intrinsic is not used anymore. llvm-svn: 97468	2010-03-01 18:45:28 +00:00
Devang Patel	1392621e0f	This test checks whether LICM ignores @llvm.dbg.stoppoint intrinsics appropriately or not. Now, llvm does not use this intrinsic. Remove this test. llvm-svn: 97466	2010-03-01 18:32:27 +00:00
Devang Patel	3bf0571bb0	Rewrite test to test VLA using new debug info encoding scheme. llvm-svn: 97465	2010-03-01 18:30:58 +00:00
Devang Patel	4aefd92040	Remove this generic debug info intrinsic test. LLVM does not use this llvm.dbg.stoppoint intrinsic anymore. There are tests to check new implementation, which attaches location information directly with an instruction using metadata. llvm-svn: 97464	2010-03-01 18:30:08 +00:00
Dan Gohman	882c95605f	LLVM instruction syntax doesn't have trailing semicolons. llvm-svn: 97456	2010-03-01 17:53:15 +00:00
Erick Tryzelaar	84f5ba80df	Add support getting the operands of a User to ocaml. llvm-svn: 97414	2010-02-28 20:45:03 +00:00
Erick Tryzelaar	9190a2af9d	Add support for global aliases to ocaml. llvm-svn: 97413	2010-02-28 20:44:58 +00:00
Erick Tryzelaar	e533a41c24	Add support for inserting inline asm to ocaml. llvm-svn: 97412	2010-02-28 20:44:53 +00:00
Chris Lattner	90e7924cf0	add some random nounwinds. llvm-svn: 97411	2010-02-28 20:36:49 +00:00
Erick Tryzelaar	28db1a3e61	Add support for getting a null pointer. llvm-svn: 97380	2010-02-28 09:46:27 +00:00
Erick Tryzelaar	272d62bc5a	Add a way to look up a type by it's name in a module. llvm-svn: 97379	2010-02-28 09:46:21 +00:00
Erick Tryzelaar	06894b3824	Add support for global variables in an address space for llvm-c and ocaml. llvm-svn: 97377	2010-02-28 09:46:13 +00:00
Erick Tryzelaar	0fb26ef01f	Add indirect br support to llvm-c and ocaml. llvm-svn: 97376	2010-02-28 09:46:06 +00:00
Erick Tryzelaar	d8531faf95	Add metadata functions to llvm-c and ocaml. llvm-svn: 97375	2010-02-28 09:45:59 +00:00
Erick Tryzelaar	4c340c7f7f	Add the new builder arthmetic instructions to llvm-c and ocaml. llvm-svn: 97372	2010-02-28 05:51:43 +00:00
Erick Tryzelaar	a8053dfd27	Add the new union arthmetic instructions to llvm-c and ocaml. llvm-svn: 97371	2010-02-28 05:51:33 +00:00
Erick Tryzelaar	eaaac73c1d	Rename ocaml vmcore tests to make it easier to insert tests. llvm-svn: 97369	2010-02-28 05:51:21 +00:00
Erick Tryzelaar	4417431e0e	Remove malloc and free from the ocaml bindings. llvm-svn: 97367	2010-02-28 05:51:09 +00:00
John McCall	dcb9a7ad3d	Teach APFloat how to create both QNaNs and SNaNs and with arbitrary-width payloads. APFloat's internal folding routines always make QNaNs now, instead of sometimes making QNaNs and sometimes SNaNs depending on the type. llvm-svn: 97364	2010-02-28 02:51:25 +00:00
Dan Gohman	34021b7445	Don't try to replace physical registers when doing CSE. llvm-svn: 97360	2010-02-28 01:33:43 +00:00
Dan Gohman	45e7ffc350	Add nounwinds. llvm-svn: 97349	2010-02-27 23:53:53 +00:00
Evan Cheng	228c31f045	Re-apply 97040 with fix. This survives a ppc self-host llvm-gcc bootstrap. llvm-svn: 97310	2010-02-27 07:36:59 +00:00
Chris Lattner	d887f1da73	fix PR6414, a nondeterminism issue in IPSCCP which was because of a subtle interation in a loop operating in densemap order. llvm-svn: 97288	2010-02-27 00:07:42 +00:00
Jakob Stoklund Olesen	ddbf7a858e	Use the right floating point load/store instructions in PPCInstrInfo::foldMemoryOperandImpl(). The PowerPC floating point registers can represent both f32 and f64 via the two register classes F4RC and F8RC. F8RC is considered a subclass of F4RC to allow cross-class coalescing. This coalescing only affects whether registers are spilled as f32 or f64. Spill slots must be accessed with load/store instructions corresponding to the class of the spilled register. PPCInstrInfo::foldMemoryOperandImpl was looking at the instruction opcode which is wrong. X86 has similar floating point register classes, but doesn't try to fold memory operands, so there is no problem there. llvm-svn: 97262	2010-02-26 21:09:24 +00:00
Chris Lattner	0521c09d97	fix PR6435 another bug from the MallocInst elimination work. llvm-svn: 97231	2010-02-26 18:23:13 +00:00
Sanjiv Gupta	2bdbb3c167	Reapply things reverted back in 97220, with the fixed test case. llvm-svn: 97228	2010-02-26 17:59:28 +00:00
Richard Osborne	333300e0df	Fix XCoreTargetLowering::isLegalAddressingMode() to handle VoidTy. Previously LoopStrengthReduce would sometimes be unable to find a legal formula, causing an assertion failure. llvm-svn: 97226	2010-02-26 16:44:51 +00:00
Chris Lattner	044ada532c	this file lacks a run line! llvm-svn: 97208	2010-02-26 02:40:57 +00:00
Chris Lattner	7939f795f5	rewrite OptimizeGlobalAddressOfMalloc to fix PR6422, some bugs introduced when mallocinst was eliminated. llvm-svn: 97178	2010-02-25 22:33:52 +00:00
Daniel Dunbar	de277d48f5	tests: Propogate the HOME environment variable through to tests. I'm ambivalent about this, but it can be useful for users who use ccache, since the LLVMC tests are fond of calling gcc. llvm-svn: 97171	2010-02-25 22:09:09 +00:00
Chris Lattner	f7fc2d8b86	change the scope node to include a list of children to be checked instead of to have a chained series of scope nodes. This makes the generated table smaller, improves the efficiency of the interpreter, and make the factoring optimization much more reasonable to implement. llvm-svn: 97160	2010-02-25 19:00:39 +00:00
Kevin Enderby	7f99302dc9	This is a patch to the assembler frontend to detect when aligning a text section with TextAlignFillValue and calls EmitCodeAlignment() instead of calling EmitValueToAlignment(). This allows x86 assembly code to be aligned with optimal nops. llvm-svn: 97158	2010-02-25 18:46:04 +00:00
Dan Gohman	a2684dbff0	Teach the constant folder about union types. llvm-svn: 97142	2010-02-25 16:45:19 +00:00
Dan Gohman	9b80f86e5b	Revert r97064. Duncan pointed out that bitcasts are defined in terms of store and load, which means bitcasting between scalar integer and vector has endian-specific results, which undermines this whole approach. llvm-svn: 97137	2010-02-25 15:20:39 +00:00
Dan Gohman	a9c205cc88	Make LoopSimplify change conditional branches in loop exiting blocks which branch on undef to branch on a boolean constant for the edge exiting the loop. This helps ScalarEvolution compute trip counts for loops. Teach ScalarEvolution to recognize single-value PHIs, when safe, and ForgetSymbolicName to forget such single-value PHI nodes as apprpriate in ForgetSymbolicName. llvm-svn: 97126	2010-02-25 06:57:05 +00:00
Jeffrey Yasskin	6b718f73a5	Try r96559 for the third time. This time the shared library is only built if --enable-shared is passed to configure. llvm-svn: 97119	2010-02-25 06:34:33 +00:00
Jakob Stoklund Olesen	63af51c1c8	Create a stack frame on ARM when - Function uses all scratch registers AND - Function does not use any callee saved registers AND - Stack size is too big to address with immediate offsets. In this case a register must be scavenged to calculate the address of a stack object, and the scavenger needs a spare register or emergency spill slot. llvm-svn: 97071	2010-02-24 22:43:17 +00:00
Bob Wilson	ba8ac74fd9	Check for comparisons of +/- zero when optimizing less-than-or-equal and greater-than-or-equal SELECT_CCs to NEON vmin/vmax instructions. This is only allowed when UnsafeFPMath is set or when at least one of the operands is known to be nonzero. llvm-svn: 97065	2010-02-24 22:15:53 +00:00
Dan Gohman	4b2b48daba	Make getTypeSizeInBits work correctly for array types; it should return the number of value bits, not the number of bits of allocation for in-memory storage. Make getTypeStoreSize and getTypeAllocSize work consistently for arrays and vectors. Fix several places in CodeGen which compute offsets into in-memory vectors to use TargetData information. This fixes PR1784. llvm-svn: 97064	2010-02-24 22:05:23 +00:00
Daniel Dunbar	4811d004be	Speculatively revert r97011, "Re-apply 96540 and 96556 with fixes.", again in the hopes of fixing PPC bootstrap. llvm-svn: 97040	2010-02-24 17:05:47 +00:00
Dan Gohman	3860521406	When forming SSE min and max nodes for UGE and ULE comparisons, it's necessary to swap the operands to handle NaN and negative zero properly. Also, reintroduce logic for checking for NaN conditions when forming SSE min and max instructions, fixed to take into consideration NaNs and negative zeros. This allows forming min and max instructions in more cases. llvm-svn: 97025	2010-02-24 06:52:40 +00:00
Chris Lattner	df8a8a8c6f	Change the scheduler from adding nodes in allnodes order to adding them in a determinstic order (bottom up from the root) based on the structure of the graph itself. This updates tests for some random changes, interesting bits: CodeGen/Blackfin/promote-logic.ll no longer crashes. I have no idea why, but that's good right? CodeGen/X86/2009-07-16-LoadFoldingBug.ll also fails, but now compiles to have one fewer constant pool entry, making the expected load that was being folded disappear. Since it is an unreduced mass of gnast, I just removed it. This fixes PR6370 llvm-svn: 97023	2010-02-24 06:11:37 +00:00
Jim Grosbach	6ad4bcb0da	LowerCall() should always do getCopyFromReg() to reference the stack pointer. Machine instruction selection is much happier when operands are in virtual registers. llvm-svn: 97012	2010-02-24 01:43:03 +00:00
Evan Cheng	328a607490	Re-apply 96540 and 96556 with fixes. llvm-svn: 97011	2010-02-24 01:42:31 +00:00
Jakob Stoklund Olesen	a2d8c97b65	DIV8r must define %AX since X86DAGToDAGISel::Select() sometimes uses it instead of %AL/%AH. llvm-svn: 97006	2010-02-24 00:39:35 +00:00
Jakob Stoklund Olesen	fe0a8cd210	Remember to handle sub-registers when moving imp-defs to a rematted instruction. llvm-svn: 96995	2010-02-23 22:44:02 +00:00
Jakob Stoklund Olesen	38b76e27a7	Keep track of phi join registers explicitly in LiveVariables. Previously, LiveIntervalAnalysis would infer phi joins by looking for multiply defined registers. That doesn't work if the phi join is implicitly defined in all but one of the predecessors. llvm-svn: 96994	2010-02-23 22:43:58 +00:00
Jeffrey Yasskin	15983e57d6	Roll back r96959 again. llvm-svn: 96981	2010-02-23 20:53:37 +00:00
Devang Patel	d09b921b7d	new test case for r96974. llvm-svn: 96975	2010-02-23 19:37:40 +00:00
Wesley Peck	e4801e49c9	Adding the MicroBlaze backend. The MicroBlaze is a highly configurable 32-bit soft-microprocessor for use on Xilinx FPGAs. For more information see: http://www.xilinx.com/tools/microblaze.htm http://en.wikipedia.org/wiki/MicroBlaze The current LLVM MicroBlaze backend generates assembly which can be compiled using the an appropriate binutils assembler. llvm-svn: 96969	2010-02-23 19:15:24 +00:00
Jeffrey Yasskin	3ac46ccdff	Roll r96559 forward again, adding libLLVM-2.7svn.so to LLVM. This links 3 of the examples shared to make sure the shared library keeps working. llvm-svn: 96959	2010-02-23 18:10:07 +00:00
Dan Gohman	cd4c03e886	Don't do (X != Y) ? X : Y -> X for floating-point values; it doesn't handle NaN properly. Do (X une Y) ? X : Y -> X if one of X and Y is not zero. llvm-svn: 96955	2010-02-23 17:17:57 +00:00
Dan Gohman	8a0eb36d23	Remove the code which constant-folded ptrtoint(inttoptr(x)+c) to getelementptr. Despite only doing so in the case where x is a known array object and c can be converted to an index within range, this could still be invalid if c is actually the address of an object allocated outside of LLVM. Also, SCEVExpander, the original motivation for this code, has since been improved to avoid inttoptr+ptroint in more cases. llvm-svn: 96950	2010-02-23 16:35:41 +00:00
Richard Osborne	f578196968	Lower BR_JT on the XCore to a jump into a series of jump instructions. llvm-svn: 96942	2010-02-23 13:25:07 +00:00
Daniel Dunbar	7e4acbdf53	tests: Don't make a missing llvm-gcc dir a fatal error. llvm-svn: 96938	2010-02-23 11:34:12 +00:00
Daniel Dunbar	e615b5fe4d	Switch .bc/.ll Makefile rules to use LLVM{CC,CXX} instead of LLVMG{CC,XX} llvm-svn: 96936	2010-02-23 10:28:06 +00:00
Daniel Dunbar	5c8f47863c	Fix a thinko in the lit.cfg. llvm-svn: 96931	2010-02-23 09:28:48 +00:00
Mikhail Glushenkov	d76f096a53	Update the test suite. llvm-svn: 96921	2010-02-23 09:04:51 +00:00
Daniel Dunbar	40886109ce	Inline and eliminate LLVMG{CC,XX}WITHPATH. llvm-svn: 96913	2010-02-23 07:56:41 +00:00
Daniel Dunbar	6d914f8904	Eliminate llvmgcc_version testing variable. llvm-svn: 96908	2010-02-23 07:56:28 +00:00
Daniel Dunbar	d6a395278b	Kill unused llvmgccmajvers testing variable. llvm-svn: 96906	2010-02-23 07:56:18 +00:00
Dan Gohman	e7f6feb469	Convert this test to FileCheck and add a testcase for PR3574. llvm-svn: 96851	2010-02-23 01:28:09 +00:00
Evan Cheng	2a33390e2b	These should not have been committed. llvm-svn: 96827	2010-02-22 23:37:48 +00:00
Chris Lattner	72334622d6	no need to run llvm-as here. llvm-svn: 96826	2010-02-22 23:34:12 +00:00
Evan Cheng	3688b8fa68	Instcombine constant folding can normalize gep with negative index to index with large offset. When instcombine objsize checking transformation sees these geps where the offset seemingly point out of bound, it should just return "i don't know" rather than asserting. llvm-svn: 96825	2010-02-22 23:34:00 +00:00
Dan Gohman	d8abbf0af6	Add a test for canonicalizing ConstantExpr operands. llvm-svn: 96820	2010-02-22 23:07:52 +00:00
Dan Gohman	6c5ac6de5c	Canonicalize ConstantInts to the right operand of commutative operators. The test difference is just due to the multiplication operands being commuted (and thus requiring a more elaborate match). In optimized code, that expression would be folded. llvm-svn: 96816	2010-02-22 22:43:23 +00:00
Dan Gohman	be24c455c4	Actually enable the -enable-unsafe-fp-math tests. llvm-svn: 96796	2010-02-22 18:53:26 +00:00
Arnold Schwaighofer	30ece5b807	Mark the return address stack slot as mutable when moving the return address during a tail call. A parameter might overwrite this stack slot during the tail call. The sequence during a tail call is: 1.) load return address to temp reg 2.) move parameters (might involve storing to return address stack slot) 3.) store return address to new location from temp reg If the stack location is marked immutable CodeGen can colocate load (1) with the store (3). This fixes bug 6225. llvm-svn: 96783	2010-02-22 16:18:09 +00:00
Daniel Dunbar	f16bc6d296	LLVMC/MultiplePluginPriorities.td: Generally XFAIL this test for now, it is still failing during (one) llvm-gcc powerpc build, and is also failing on my x86_64-apple-darwin10. llvm-svn: 96781	2010-02-22 05:55:32 +00:00
Dan Gohman	754e4a9801	Constant-fold certain comparisons with infinity and negative infinity. llvm-svn: 96777	2010-02-22 04:06:03 +00:00
Dan Gohman	b87de8d30d	Remove the logic for reasoning about NaNs from the code that forms SSE min and max instructions. The real thing this code needs to be concerned about is negative zero. Update the sse-minmax.ll test accordingly, and add tests for -enable-unsafe-fp-math mode as well. llvm-svn: 96775	2010-02-22 04:03:39 +00:00
Dan Gohman	4506fcb3c2	When emitting an instruction which depends on both a post-incremented induction variable value and a loop-variant value, don't force the insert position to be at the post-increment position, because it may not be dominated by the loop-variant value. This fixes a use-before-def problem noticed on PPC. llvm-svn: 96774	2010-02-22 03:59:54 +00:00
Chris Lattner	745219ea64	add some no-unwinds, other minor cleanups. llvm-svn: 96756	2010-02-21 20:33:20 +00:00
Chris Lattner	c43c88ebce	add a triple so that this doesn't fail due to linux/ppc register printing syntax. llvm-svn: 96748	2010-02-21 19:27:38 +00:00
Chris Lattner	53485469b4	filecheckize and add nouwinds. llvm-svn: 96745	2010-02-21 18:53:28 +00:00
Anton Korobeynikov	e96503faa1	IT turns out that during jumpless setcc lowering eq and ne were swapped. This fixes PR6348 llvm-svn: 96734	2010-02-21 12:28:58 +00:00
Chris Lattner	3c29aff9ff	fix and un-xfail X86/vec_ss_load_fold.ll llvm-svn: 96720	2010-02-21 04:53:34 +00:00
Chris Lattner	7d5f4a4c03	temporarily disable this. llvm-svn: 96717	2010-02-21 03:24:41 +00:00
Dan Gohman	85af256779	Check for overflow when scaling up an add or an addrec for scaled reuse. llvm-svn: 96692	2010-02-19 19:32:49 +00:00
Charles Davis	7e47767763	Add support for the 'alignstack' attribute to the x86 backend. Fixes PR5254. Also, FileCheck'ize a test. llvm-svn: 96686	2010-02-19 18:17:13 +00:00
Dan Gohman	6b1e2a829d	Teach ScalarEvolution how to compute a tripcount for a loop with true or false as its exit condition. These are usually eliminated by SimplifyCFG, but the may be left around during a pass which wishes to preserve the CFG. llvm-svn: 96683	2010-02-19 18:12:07 +00:00
Duncan Sands	d0bf6f640f	Revert commits 96556 and 96640, because commit 96556 breaks the dragonegg self-host build. I reverted 96640 in order to revert 96556 (96640 goes on top of 96556), but it also looks like with both of them applied the breakage happens even earlier. The symptom of the 96556 miscompile is the following crash: llvm[3]: Compiling AlphaISelLowering.cpp for Release build cc1plus: /home/duncan/tmp/tmp/llvm/lib/CodeGen/SelectionDAG/SelectionDAG.cpp:4982: void llvm::SelectionDAG::ReplaceAllUsesWith(llvm::SDNode, llvm::SDNode, llvm::SelectionDAG::DAGUpdateListener*): Assertion `(!From->hasAnyUseOfValue(i) \|\| From->getValueType(i) == To->getValueType(i)) && "Cannot use this version of ReplaceAllUsesWith!"' failed. Stack dump: 0. Running pass 'X86 DAG->DAG Instruction Selection' on function '@_ZN4llvm19AlphaTargetLowering14LowerOperationENS_7SDValueERNS_12SelectionDAGE' g++: Internal error: Aborted (program cc1plus) This occurs when building LLVM using LLVM built by LLVM (via dragonegg). Probably LLVM has miscompiled itself, though it may have miscompiled GCC and/or dragonegg itself: at this point of the self-host build, all of GCC, LLVM and dragonegg were built using LLVM. Unfortunately this kind of thing is extremely hard to debug, and while I did rummage around a bit I didn't find any smoking guns, aka obviously miscompiled code. Found by bisection. r96556 \| evancheng \| 2010-02-18 03:13:50 +0100 (Thu, 18 Feb 2010) \| 5 lines Some dag combiner goodness: Transform br (xor (x, y)) -> br (x != y) Transform br (xor (xor (x,y), 1)) -> br (x == y) Also normalize (and (X, 1) == / != 1 -> (and (X, 1)) != / == 0 to match to "test on x86" and "tst on arm" r96640 \| evancheng \| 2010-02-19 01:34:39 +0100 (Fri, 19 Feb 2010) \| 16 lines Transform (xor (setcc), (setcc)) == / != 1 to (xor (setcc), (setcc)) != / == 1. e.g. On x86_64 %0 = icmp eq i32 %x, 0 %1 = icmp eq i32 %y, 0 %2 = xor i1 %1, %0 br i1 %2, label %bb, label %return => testl %edi, %edi sete %al testl %esi, %esi sete %cl cmpb %al, %cl je LBB1_2 llvm-svn: 96672	2010-02-19 11:30:41 +00:00
Devang Patel	1f9e9ac766	Test case for r96656. llvm-svn: 96657	2010-02-19 02:58:33 +00:00
Evan Cheng	d2d9252f35	Transform (xor (setcc), (setcc)) == / != 1 to (xor (setcc), (setcc)) != / == 1. e.g. On x86_64 %0 = icmp eq i32 %x, 0 %1 = icmp eq i32 %y, 0 %2 = xor i1 %1, %0 br i1 %2, label %bb, label %return => testl %edi, %edi sete %al testl %esi, %esi sete %cl cmpb %al, %cl je LBB1_2 llvm-svn: 96640	2010-02-19 00:34:39 +00:00
Dan Gohman	2446f57503	When determining the set of interesting reuse factors, consider strides in foreign loops. This helps locate reuse opportunities with existing induction variables in foreign loops and reduces the need for inserting new ones. This fixes rdar://7657764. llvm-svn: 96629	2010-02-19 00:05:23 +00:00
Mon P Wang	c94892513d	getSplatIndex assumes that the first element of the mask contains the splat index which is not always true if the mask contains undefs. Modified it to return the first non undef value. llvm-svn: 96621	2010-02-18 22:33:18 +00:00
Jakob Stoklund Olesen	c953acbd7f	Always normalize spill weights, also for intervals created by spilling. Moderate the weight given to very small intervals. The spill weight given to new intervals created when spilling was not normalized in the same way as the original spill weights calculated by CalcSpillWeights. That meant that restored registers would tend to hang around because they had a much higher spill weight that unspilled registers. This improves the runtime of a few tests by up to 10%, and there are no significant regressions. llvm-svn: 96613	2010-02-18 21:33:05 +00:00
Dan Gohman	5ffef745c2	Make CodePlacementOpt detect special EH control flow by checking whether AnalyzeBranch disagrees with the CFG directly, rather than looking for EH_LABEL instructions. EH_LABEL instructions aren't always at the end of the block, due to FP_REG_KILL and other things. This fixes an infinite loop compiling MultiSource/Benchmarks/Bullet. llvm-svn: 96611	2010-02-18 21:25:53 +00:00
Devang Patel	441eb781ae	Ignore target dependent value in grep search. llvm-svn: 96604	2010-02-18 19:52:12 +00:00
Chris Lattner	6a9bdade29	remove empty file llvm-svn: 96573	2010-02-18 06:29:06 +00:00
Bob Wilson	c6c13a3515	Use NEON vmin/vmax instructions for floating-point selects. Radar 7461718. llvm-svn: 96572	2010-02-18 06:05:53 +00:00
Jeffrey Yasskin	c451027db9	Roll back the shared library, r96559. It broke two darwins and arm, mysteriously. llvm-svn: 96569	2010-02-18 04:43:02 +00:00
Jeffrey Yasskin	f750fefaf8	Add a shared library for LLVM, named libLLVM2.7svn.(so\|dylib), and add an --enable-shared configure flag to have the tools linked shared. (2.7svn is just $(LLVMVersion) so it'll change to "2.7" in the release.) Always link the example programs shared to test that the shared library keeps working. On my mac laptop, Debug libLLVM2.7svn.dylib is 39MB, and opt (for example) is 16M static vs 440K shared. Two things are less than ideal here: 1) The library doesn't include any version information. Since we expect to break the ABI with every release, this shouldn't be much of a problem. If we do release a compatible 2.7.1, we may be able to hack its library to work with binaries compiled against 2.7.0, or we can just ask them to recompile. I'm hoping to get a real packaging expert to look at this for the 2.8 release. 2) llvm-config doesn't yet have an option to print link options for the shared library. I'll add this as a subsequent patch. llvm-svn: 96559	2010-02-18 02:36:02 +00:00
Evan Cheng	0ceb68a552	Some dag combiner goodness: Transform br (xor (x, y)) -> br (x != y) Transform br (xor (xor (x,y), 1)) -> br (x == y) Also normalize (and (X, 1) == / != 1 -> (and (X, 1)) != / == 0 to match to "test on x86" and "tst on arm" llvm-svn: 96556	2010-02-18 02:13:50 +00:00
Devang Patel	4956ea0a51	New test case for r96543. llvm-svn: 96544	2010-02-18 00:53:49 +00:00
Eric Christopher	624ee8da0d	Revert: r95605 \| dpatel \| 2010-02-08 15:27:46 -0800 (Mon, 08 Feb 2010) \| 2 lines test case for r95604. Which was the testcase for the patch reverted from llvm-gcc. llvm-svn: 96474	2010-02-17 08:53:27 +00:00
Devang Patel	ca55a04273	Before setting scope end marker, pay attention to scope begin marker and existing scope end marker, if any. Scope must begin before it ends and nested inlined scope do not truncate surrounding scope. llvm-svn: 96445	2010-02-17 02:20:34 +00:00
Dan Gohman	104207b4c5	Don't check for comments, which vary between subtargets. llvm-svn: 96434	2010-02-17 01:08:57 +00:00
Dan Gohman	cf39be32bf	Fold bswap(undef) to undef. llvm-svn: 96432	2010-02-17 00:54:58 +00:00
Dan Gohman	5f10d6c52c	Don't attempt to divide INT_MIN by -1; consider such cases to have overflowed. llvm-svn: 96428	2010-02-17 00:41:53 +00:00
Chris Lattner	1fc2773a33	roundss is an sse 4 thing, fix the test on non-sse41 builders like llvm-gcc-x86_64-darwin10-selfhost llvm-svn: 96417	2010-02-17 00:29:06 +00:00
Dale Johannesen	cee887425e	Make g5 target explicit; scheduling affects register choice. llvm-svn: 96413	2010-02-16 23:25:23 +00:00
Chris Lattner	afac7dad21	fix rdar://7653908, a crash on a case where we would fold a load into a roundss intrinsic, producing a cyclic dag. The root cause of this is badness handling ComplexPattern nodes in the old dagisel that I noticed through inspection. Eliminate a copy of the of the code that handled ComplexPatterns by making EmitChildMatchCode call into EmitMatchCode. llvm-svn: 96408	2010-02-16 22:35:06 +00:00
Dale Johannesen	0062f7bf59	Adjust register numbers in tests to compensate for the new lack of R2. llvm-svn: 96407	2010-02-16 22:31:31 +00:00
Chris Lattner	c98beb567c	filecheckize llvm-svn: 96404	2010-02-16 22:13:43 +00:00
Devang Patel	8b9fec4428	New testcase. llvm-svn: 96391	2010-02-16 21:16:08 +00:00
Evan Cheng	82b04130cb	Look for SSE and instructions of this form: (and x, (build_vector c1,c2,c3,c4)). If there exists a use of a build_vector that's the bitwise complement of the mask, then transform the node to (and (xor x, (build_vector -1,-1,-1,-1)), (build_vector ~c1,~c2,~c3,~c4)). Since this transformation is only useful when 1) the given build_vector will become a load from constpool, and 2) (and (xor x -1), y) matches to a single instruction, I decided this is appropriate as a x86 specific transformation. rdar://7323335 llvm-svn: 96389	2010-02-16 21:09:44 +00:00
David Greene	9641d06809	Add support for emitting non-temporal stores for DAGs marked non-temporal. Fix from r96241 for botched encoding of MOVNTDQ. Add documentation for !nontemporal metadata. Add a simpler movnt testcase. llvm-svn: 96386	2010-02-16 20:50:18 +00:00
Bob Wilson	6bfacb7393	Testcase for critical edge splitting with load PRE. llvm-svn: 96385	2010-02-16 20:48:55 +00:00
Bob Wilson	70aa8d0745	Fix pr6111: Avoid using the LR register for the target address of an indirect branch in ARM v4 code, since it gets clobbered by the return address before it is used. Instead of adding a new register class containing all the GPRs except LR, just use the existing tGPR class. llvm-svn: 96360	2010-02-16 17:24:15 +00:00
Chris Lattner	4964ef88c2	make pcrel immediate values relative to the start of the field, not the end of the field, fixing rdar://7651978 llvm-svn: 96330	2010-02-16 05:03:17 +00:00
Dan Gohman	521efe68ab	Split the main for-each-use loop again, this time for GenerateTruncates, as it also peeks at which registers are being used by other uses. This makes LSR less sensitive to use-list order. llvm-svn: 96308	2010-02-16 01:42:53 +00:00
Anton Korobeynikov	ae4ccc10da	Preliminary patch to improve dwarf EH generation - Hooks to return Personality / FDE / LSDA / TType encoding depending on target / options (e.g. code model / relocation model) - MCIzation of Dwarf EH printer to use encoding information - Stub generation for ELF target (needed for indirect references) - Some other small changes here and there llvm-svn: 96285	2010-02-15 22:35:59 +00:00
Jakob Stoklund Olesen	2988d573e5	Fix PR6300. A virtual register can be used before it is defined in the same MBB if the MBB is part of a loop. Teach the implicit-def pass about this case. llvm-svn: 96279	2010-02-15 22:03:29 +00:00
Bob Wilson	9be7200b08	Last week we were generating code with duplicate induction variables in this test, but the problem seems to have gone away today. Add a check to make sure it doesn't come back. llvm-svn: 96277	2010-02-15 21:56:40 +00:00
Chris Lattner	3818d9763d	remove empty file. llvm-svn: 96271	2010-02-15 21:14:50 +00:00
Chris Lattner	bcbaaba532	revert r96241. It breaks two regression tests, isn't documented, and the testcase needs improvement. llvm-svn: 96265	2010-02-15 20:53:01 +00:00
Chris Lattner	6fbfe5897c	fix PR6305 by handling BlockAddress in a helper function called by jump threading. llvm-svn: 96263	2010-02-15 20:47:49 +00:00
David Greene	63cedef74b	Add support for emitting non-temporal stores for DAGs marked non-temporal. llvm-svn: 96241	2010-02-15 17:02:56 +00:00
Mikhail Glushenkov	5352f07f2c	Revert r96130 ("Forward parameter options as '-option=param'"). This behaviour must be configurable. llvm-svn: 96210	2010-02-15 03:17:06 +00:00
Eric Christopher	843a4cc43c	Fix a problem where we had bitcasted operands that gave us odd offsets since the bitcasted pointer size and the offset pointer size are going to be different types for the GEP vs base object. llvm-svn: 96134	2010-02-13 23:38:01 +00:00
Mikhail Glushenkov	32fa169648	Forward parameter options as '-option=parameter'. Some tools do not like the '-option parameter' form. Should this be configurable? llvm-svn: 96130	2010-02-13 22:37:28 +00:00
Chris Lattner	f83726f6ba	add encoder support and tests for rdtscp llvm-svn: 96076	2010-02-13 03:42:24 +00:00
Jakob Stoklund Olesen	b659c76c77	Fix PR6283. When coalescing with a physreg, remember to add imp-def and imp-kill when dealing with sub-registers. Also fix a related bug in VirtRegRewriter where substitutePhysReg may reallocate the operand list on an instruction and invalidate the reg_iterator. This can happen when a register is mentioned twice on the same instruction. llvm-svn: 96072	2010-02-13 02:06:10 +00:00
Daniel Dunbar	d0c6d361fe	MC/AsmParser: Attempt to constant fold expressions up-front. This ensures we avoid fixups for obvious cases like '-(16)'. llvm-svn: 96064	2010-02-13 01:28:07 +00:00
Chris Lattner	509154e0f9	rip out the 'heinous' x86 MCCodeEmitter implementation. We still have the templated X86 JIT emitter, and the almost-copy in X86InstrInfo for getting instruction sizes. llvm-svn: 96059	2010-02-13 00:49:29 +00:00
Chris Lattner	140caa7240	remove special cases for vmlaunch, vmresume, vmxoff, and swapgs fix swapgs to be spelled right. llvm-svn: 96058	2010-02-13 00:41:14 +00:00
Bob Wilson	01abf8fc2f	Besides removing phi cycles that reduce to a single value, also remove dead phi cycles. Adjust a few tests to keep dead instructions from being optimized away. This (together with my previous change for phi cycles) fixes Apple radar 7627077. llvm-svn: 96057	2010-02-13 00:31:44 +00:00
Daniel Dunbar	224340cabe	MC/X86: Push immediate operands as immediates not expressions when possible. llvm-svn: 96055	2010-02-13 00:17:21 +00:00
Chris Lattner	34749d879d	add some disassemble testcases for weird instructions llvm-svn: 96045	2010-02-12 23:46:48 +00:00
Chris Lattner	1e827fd8ca	implement the rest of correct x86-64 encoder support for rip-relative addresses, and add a testcase. llvm-svn: 96040	2010-02-12 23:24:09 +00:00
Dale Johannesen	26062150fa	When save/restoring CR at prolog/epilog, in a large stack frame, the prolog/epilog code was using the same register for the copy of CR and the address of the save slot. Oops. This is fixed here for Darwin, sort of, by reserving R2 for this case. A better way would be to do the store before the decrement of SP, which is safe on Darwin due to the red zone. SVR4 probably has the same problem, but I don't know how to fix it; there is no red zone and R2 is already used for something else. I'm going to leave it to someone interested in that target. Better still would be to rewrite the CR-saving code completely; spilling each CR subregister individually is horrible code. llvm-svn: 96015	2010-02-12 21:35:34 +00:00
Chris Lattner	392be58cad	Add support for a union type in LLVM IR. Patch by Talin! llvm-svn: 96011	2010-02-12 20:49:41 +00:00
Chris Lattner	75879be9d8	1. modernize the constantmerge pass, using densemap/smallvector. 2. don't bother trying to merge globals in non-default sections, doing so is quite dubious at best anyway. 3. fix a bug reported by Arnaud de Grandmaison where we'd try to merge two globals in different address spaces. llvm-svn: 95995	2010-02-12 18:17:23 +00:00
Chris Lattner	554003f481	rename test llvm-svn: 95993	2010-02-12 18:05:00 +00:00
Anton Korobeynikov	b9ce3cc458	Testcases for recent stdcall / fastcall mangling improvements llvm-svn: 95982	2010-02-12 15:29:13 +00:00
Anton Korobeynikov	c9276dfe04	Cleanup stdcall / fastcall name mangling. This should fix alot of problems we saw so far, e.g. PRs 5851 & 2936 llvm-svn: 95980	2010-02-12 15:28:40 +00:00
Dan Gohman	45774ce0ad	Reapply the new LoopStrengthReduction code, with compile time and bug fixes, and with improved heuristics for analyzing foreign-loop addrecs. This change also flattens IVUsers, eliminating the stride-oriented groupings, which makes it easier to work with. llvm-svn: 95975	2010-02-12 10:34:29 +00:00
Evan Cheng	0e4df63bd5	Update test to match 95961. llvm-svn: 95971	2010-02-12 07:48:46 +00:00
Evan Cheng	993bd1b7da	Test for 95961. llvm-svn: 95962	2010-02-12 02:35:03 +00:00
Evan Cheng	67e45e1670	Test case for 95958. llvm-svn: 95959	2010-02-12 02:02:23 +00:00
Bob Wilson	0827e040e0	Add a new pass on machine instructions to optimize away PHI cycles that reduce down to a single value. InstCombine already does this transformation but DAG legalization may introduce new opportunities. This has turned out to be important for ARM where 64-bit values are split up during type legalization: InstCombine is not able to remove the PHI cycles on the 64-bit values but the separate 32-bit values can be optimized. I measured the compile time impact of this (running llc on 176.gcc) and it was not significant. llvm-svn: 95951	2010-02-12 01:30:21 +00:00
Chris Lattner	1572e760bc	fix the encodings of monitor and mwait, which were completely busted in both encoders. I'm not bothering to fix it in the old one at this point. llvm-svn: 95947	2010-02-12 01:06:22 +00:00
Charles Davis	be5557e86b	Add a new function attribute, 'alignstack'. It will indicate (when the backends implement support for it) that the stack should be forcibly realigned in the prologue (and the process reversed in the epilogue). llvm-svn: 95945	2010-02-12 00:31:15 +00:00
Jakob Stoklund Olesen	93c92225af	Reapply coalescer fix for better cross-class coalescing. This time with fixed test cases. llvm-svn: 95938	2010-02-11 23:55:29 +00:00
Eric Christopher	cccdc13662	Make sure that ConstantExpr offsets also aren't off of extern symbols. Thanks to Duncan Sands for the testcase! llvm-svn: 95877	2010-02-11 17:44:04 +00:00
Chris Lattner	4e8137d678	Rename ValueRequiresCast to ShouldOptimizeCast, to better reflect what it does. Enhance it to return false to optimizing vector sign extensions from vector comparisions, which is the idiom used to get a splatted vector for a vector comparison. Doing this breaks vector-casts.ll, add some compensating transformations to handle the important case they cover without depending on this canonicalization. This fixes rdar://7434900 a serious pessimization of vector compares. llvm-svn: 95855	2010-02-11 06:26:33 +00:00
Chris Lattner	1d4eb8fac4	convert to filecheck. llvm-svn: 95854	2010-02-11 06:24:37 +00:00
Chris Lattner	c053cbbc4d	Make DSE only scan blocks that are reachable from the entry block. Other blocks may have pointer cycles that will crash basicaa and other alias analyses. In any case, there is no point wasting cycles optimizing dead blocks. This fixes rdar://7635088 llvm-svn: 95852	2010-02-11 05:11:54 +00:00
Chris Lattner	f492ece81e	a testcase that doesn't crash GVN but could someday. llvm-svn: 95851	2010-02-11 05:08:05 +00:00
Chris Lattner	d924f63692	Make jump threading honor x\|undef -> true and x&undef -> false, instead of considering x\|undef -> x, which may not be true. llvm-svn: 95850	2010-02-11 04:40:44 +00:00
Eric Christopher	531ea566a6	Add ConstantExpr handling to Intrinsic::objectsize lowering. Update testcase accordingly now that we can optimize another section. llvm-svn: 95846	2010-02-11 01:48:54 +00:00
Devang Patel	d0d1fc0221	test case for r95842. llvm-svn: 95844	2010-02-11 01:31:01 +00:00
Kevin Enderby	37993197bf	Remove the few # TAILCALL comments that snuck in. As they may fail on linux. llvm-svn: 95827	2010-02-11 00:18:12 +00:00
Kevin Enderby	cfd0e5a15e	Update the X86 assembler matcher test case now that a few more things match with some of the recent changes that have gone into llvm-mc. llvm-svn: 95826	2010-02-11 00:13:43 +00:00
Mon P Wang	5b77f0dac1	The previous fix of widening divides that trap was too fragile as it depends on custom lowering and requires that certain types exist in ValueTypes.h. Modified widening to check if an op can trap and if so, the widening algorithm will apply only the op on the defined elements. It is safer to do this in widening because the optimizer can't guarantee removing unused ops in some cases. llvm-svn: 95823	2010-02-10 23:37:45 +00:00
Bob Wilson	0f52d0c074	Delete dead PHI machine instructions. These can be created due to type legalization even when the IR-level optimizer has removed dead phis, such as when the high half of an i64 value is unused on a 32-bit target. I had to adjust a few test cases that had dead phis. This is a partial fix for Radar 7627077. llvm-svn: 95816	2010-02-10 22:58:57 +00:00
Daniel Dunbar	3e0c9790f2	MC/X86 AsmMatcher: Fix a use after free spotted by d0k, and de-XFAIL x86_32-encoding.s in on expectation of it passing. llvm-svn: 95806	2010-02-10 21:19:28 +00:00
Daniel Dunbar	df11958895	XFAIL this on linux until I figure out what is happening. llvm-svn: 95804	2010-02-10 21:01:04 +00:00
Kevin Enderby	cc152d6159	Replace this file containing 4 tests of x86 32-bit encodings with a file containing the subset of the full auto generated test case that currently encodes correctly. Again it is useful as we bring up the the new encoder to make sure currently working stuff stays working. llvm-svn: 95791	2010-02-10 19:13:56 +00:00
Dan Gohman	183a423af9	Canonicalize sizeof and alignof on pointer types to a canonical pointer type. llvm-svn: 95769	2010-02-10 06:13:07 +00:00
Evan Cheng	29b8f554fc	Now that ShrinkDemandedOps() is separated out from DAG combine. It sometimes leave some obvious nops which dag combine used to clean up afterwards e.g. (trunk (ext n)) -> n. Look for them and squash them. llvm-svn: 95757	2010-02-10 02:17:34 +00:00
Kevin Enderby	a7c1d6cfd1	Fix the encoding of the movntdqa X86 instruction. It was missing the 0x66 prefix which is part of the opcode encoding. llvm-svn: 95729	2010-02-10 00:10:31 +00:00
Chris Lattner	0c3b66cd87	fix X86 encoder to output [disp] only addresses with no SIB byte in X86-32 mode. This is still required in x86-64 mode to avoid forming [disp+rip] encoding. Rewrite the SIB byte decision logic to be actually understandable. llvm-svn: 95693	2010-02-09 21:47:19 +00:00
Eric Christopher	7b7028fd24	Move Intrinsic::objectsize lowering back to InstCombineCalls and enable constant 0 offset lowering. llvm-svn: 95691	2010-02-09 21:24:27 +00:00
Dale Johannesen	8d3aa40bae	Re-disable for Darwin; I was mistaken to think this was fixed. llvm-svn: 95688	2010-02-09 19:54:29 +00:00
Eric Christopher	ad1aa86276	Pull these back out, they're a little too aggressive and time consuming for a simple optimization. llvm-svn: 95671	2010-02-09 17:29:18 +00:00
Chris Lattner	78360a8184	move tests that depend on the x86 backend out of codegen/generic, and remove a few old and unreduced ones. Fixes PR5624. llvm-svn: 95656	2010-02-09 06:41:03 +00:00
Chris Lattner	4660c225a2	make target independent. llvm-svn: 95655	2010-02-09 06:36:30 +00:00
Chris Lattner	933509287b	merge a target-specific add test into x86 directory. llvm-svn: 95654	2010-02-09 06:35:50 +00:00
Chris Lattner	015ecd85d4	merge another test in, drop the trivially constant folded cases. llvm-svn: 95653	2010-02-09 06:33:27 +00:00
Chris Lattner	c77b9eb31c	consolidate and filecheckize two tests. llvm-svn: 95652	2010-02-09 06:24:00 +00:00
Chris Lattner	534e1667a0	merge two tests, make target independent. llvm-svn: 95651	2010-02-09 06:19:20 +00:00
Chris Lattner	9b6a1789e5	fix PR6193, only considering sign extensions from i1 for this xform. llvm-svn: 95642	2010-02-09 01:12:41 +00:00
Chris Lattner	d00faaa9c7	Implement x86 asm parsing support for %st and %st(4) llvm-svn: 95634	2010-02-09 00:49:22 +00:00
Eric Christopher	9f85e7eb16	Add a new pass to do llvm.objsize lowering using SCEV. Initial skeleton and SCEVUnknown lowering implemented, the rest should come relatively quickly. Move testcase to new directory. Move pass to right before SimplifyLibCalls - which is moved down a bit so we can take advantage of a few opts. llvm-svn: 95628	2010-02-09 00:35:38 +00:00
Chris Lattner	ae67ca33ed	convert to filecheck. llvm-svn: 95608	2010-02-08 23:47:34 +00:00
Devang Patel	557e4248cb	test case for r95604. llvm-svn: 95605	2010-02-08 23:27:46 +00:00
Chris Lattner	b6b2164e28	add an x86 implementation of MCTargetExpr for representing @GOT and friends. Use it for personality references as a first use. llvm-svn: 95588	2010-02-08 22:09:08 +00:00
Dan Gohman	4268d6a7c3	When CodeGen'ing unoptimized code, there may be unfolded constant expressions in global initializers. Instead of aborting, attempt to fold them on the spot. If folding succeeds, emit the folded expression instead. This fixes PR6255. llvm-svn: 95583	2010-02-08 22:02:38 +00:00
Dan Gohman	bd374da130	In guaranteed tailcall mode, don't decline the tailcall optimization for blocks ending in "unreachable". llvm-svn: 95565	2010-02-08 20:34:14 +00:00
Evan Cheng	ea5c6be766	Run codegen dce pass for all targets at all optimization levels. Previously it's only run for x86 with fastisel. I've found it being very effective in eliminating some obvious dead code as result of formal parameter lowering especially when tail call optimization eliminated the need for some of the loads from fixed frame objects. It also shrinks a number of the tests. A couple of tests no longer make sense and are now eliminated. llvm-svn: 95493	2010-02-06 09:07:11 +00:00
Evan Cheng	c72f7882c0	Remove a large test case that (soon will) no longer make sense. llvm-svn: 95492	2010-02-06 09:00:30 +00:00
Rafael Espindola	4536b9a904	Fix alignment on ppc linux. This fixes the build of crtend.o llvm-svn: 95477	2010-02-06 03:32:21 +00:00
Evan Cheng	d064aefefc	Do not emit callseq instructions around sibcalls. This eliminated some unnecessary stack adjustments. llvm-svn: 95475	2010-02-06 03:28:46 +00:00
Victor Hernandez	1b08138152	Function-local metadata whose operands had been optimized to no longer refer to function-local IR were not getting written by BitcodeWriter; solution is for these metadata to be enumerated just like global metadata. llvm-svn: 95467	2010-02-06 01:21:09 +00:00
Bob Wilson	a10e65c852	Add a test for my change to disable reassociation for i1 types. llvm-svn: 95465	2010-02-06 01:16:25 +00:00
Bob Wilson	5638c36efd	Handle AddrMode6 (for NEON load/stores) in Thumb2's rewriteT2FrameIndex. Radar 7614112. llvm-svn: 95456	2010-02-06 00:24:38 +00:00
Jakob Stoklund Olesen	5f9ead2714	Don't unroll loops containing function calls. llvm-svn: 95454	2010-02-05 23:21:31 +00:00
Chris Lattner	9d624778a3	fix incorrect encoding of SBB8mi that Kevin noticed. llvm-svn: 95448	2010-02-05 22:56:11 +00:00
Chris Lattner	d91f302a05	fix a case where we'd mis-encode fisttp because of an incorrect (and redundant with a correct one) pattern that was added for the disassembler. llvm-svn: 95446	2010-02-05 22:49:06 +00:00
Chris Lattner	d2e879a012	remove fixme llvm-svn: 95444	2010-02-05 22:46:46 +00:00
Jakob Stoklund Olesen	916f48a054	Teach SimplifyCFG about magic pointer constants. Weird code sometimes uses pointer constants other than null. This patch teaches SimplifyCFG to build switch instructions in those cases. Code like this: void f(const char x) { if (!x) puts("null"); else if ((uintptr_t)x == 1) puts("one"); else if (x == (char)2 \|\| x == (char)3) puts("two"); else if ((intptr_t)x == 4) puts("four"); else puts(x); } Now becomes a switch: define void @f(i8 %x) nounwind ssp { entry: %magicptr23 = ptrtoint i8* %x to i64 ; <i64> [#uses=1] switch i64 %magicptr23, label %if.else16 [ i64 0, label %if.then i64 1, label %if.then2 i64 2, label %if.then9 i64 3, label %if.then9 i64 4, label %if.then14 ] Note that LLVM's own DenseMap uses magic pointers. llvm-svn: 95439	2010-02-05 22:03:18 +00:00
Chris Lattner	64ffd11d49	fix logical-select to invoke filecheck right, and fix hte instcombine xform it is checking to actually pass. There is no need to match m_SelectCst<0, -1> since instcombine canonicalizes that into not(sext). Add matches for sext(not(x)) in addition to not(sext(x)). llvm-svn: 95420	2010-02-05 19:53:02 +00:00
Eric Christopher	04371b4f12	Remove this code for now. I have a better idea and will rewrite with that in mind. llvm-svn: 95402	2010-02-05 19:04:06 +00:00
Bill Wendling	994da1a479	Make test more fucused eliminating extraneous bits. llvm-svn: 95384	2010-02-05 11:21:05 +00:00
Evan Cheng	c8b4db77be	Fix test. llvm-svn: 95373	2010-02-05 06:37:00 +00:00
Evan Cheng	a366c61f77	Handle tail call with byval arguments. llvm-svn: 95351	2010-02-05 02:21:12 +00:00
Evan Cheng	3b245876c0	When the scheduler unfold a load folding instruction it move some of the predecessors to the unfolded load. It decides what gets moved to the load by checking whether the new load is using the predecessor as an operand. The check neglects the cases whether the predecessor is a flagged scheduling unit. rdar://7604000 llvm-svn: 95339	2010-02-05 01:27:11 +00:00
Bill Wendling	6510dc8dc3	An empty global constant (one of size 0) may have a section immediately following it. However, the EmitGlobalConstant method wasn't emitting a body for the constant. The assembler doesn't like that. Before, we were generating this: .zerofill __DATA, __common, __cmd, 1, 3 This fix puts us back to that semantic. llvm-svn: 95336	2010-02-05 00:17:02 +00:00
Jakob Stoklund Olesen	c7c89b8325	Fix small bug in handling instructions with more than one implicitly defined operand. ProcessImplicitDefs would only mark one operand per instruction with <undef>. This fixed PR6086. llvm-svn: 95319	2010-02-04 18:46:28 +00:00
Benjamin Kramer	72d36b59be	Get the LLVMC tests working with clang++ by removing the problematic CXXFLAG in lit. llvm-svn: 95318	2010-02-04 18:40:11 +00:00
Chris Lattner	f3c6b5008a	fix a broken archive that was breaking dejagnu only (not lit) after r95292 llvm-svn: 95296	2010-02-04 07:11:08 +00:00
Evan Cheng	aeba2250a5	Re-enable x86 tail call optimization. llvm-svn: 95295	2010-02-04 06:47:24 +00:00
Eric Christopher	107a1fbf61	Temporarily revert this since it appears to have caused a build failure. llvm-svn: 95294	2010-02-04 06:41:27 +00:00
Chris Lattner	8228b11abc	add support for the sparcv9-- target triple to turn on 64-bit sparc codegen. Patch by Nathan Keynes! llvm-svn: 95293	2010-02-04 06:34:01 +00:00
Chris Lattner	21fb024cc0	From PR6228: "Attached patch removes the extra NUL bytes from the output and changes test/Archive/MacOSX.toc from a binary to a text file (removes svn:mime-type=application/octet-stream and adds svn:eol-style=native). I can't figure out how to get SVN to include the new contents of the file in the patch so I'm attaching it separately." Patch by James Abbatiello! llvm-svn: 95292	2010-02-04 06:19:43 +00:00
Eric Christopher	42fa84a880	Rework constant expr and array handling for objectsize instcombining. Fix bugs where we would compute out of bounds as in bounds, and where we couldn't know that the linker could override the size of an array. Add a few new testcases, change existing testcase to use a private global array instead of extern. llvm-svn: 95283	2010-02-04 02:55:34 +00:00
Victor Hernandez	d44ee35f30	Fix (and test) function-local metadata that occurs before the instruction that it refers to; fix is to not enumerate operands of function-local metadata until after all instructions have been enumerated llvm-svn: 95269	2010-02-04 01:13:08 +00:00
Eric Christopher	f12e18db21	If we're dealing with a zero-length array, don't lower to any particular size, we just don't know what the length is yet. llvm-svn: 95266	2010-02-03 23:56:07 +00:00
Dale Johannesen	e5177e685c	This test passes now on ppc darwin; if it doesn't pass on some other ppc say something on the list. llvm-svn: 95265	2010-02-03 22:33:17 +00:00
Dale Johannesen	c5df1559ca	This test passes now on ppc darwin, so reenable it. llvm-svn: 95264	2010-02-03 22:29:02 +00:00
Dale Johannesen	0c426100d0	Debugging is now reenabled on PPC darwin, so reenable these tests (they pass). llvm-svn: 95263	2010-02-03 22:24:49 +00:00
Evan Cheng	f4139067ee	Speculatively disable x86 automatic tail call optimization while we track down a self-hosting issue. llvm-svn: 95259	2010-02-03 21:40:40 +00:00
Evan Cheng	112a871fe2	Make test less fragile llvm-svn: 95258	2010-02-03 21:39:04 +00:00
Kevin Enderby	00f1e6c030	Added support for X86 instruction prefixes so llvm-mc can assemble them. The Lock prefix, Repeat string operation prefixes and the Segment override prefixes. Also added versions of the move string and store string instructions without the repeat prefixes to X86InstrInfo.td. And finally marked the rep versions of move/store string records in X86InstrInfo.td as isCodeGenOnly = 1 so tblgen is happy building the disassembler files. llvm-svn: 95252	2010-02-03 21:04:42 +00:00
Daniel Dunbar	c0f5f284d4	Add llvm_supports_darwin_and_target to DejaGNU as well, I'd almost forgotten it ever existed. :) llvm-svn: 95230	2010-02-03 18:43:46 +00:00
Evan Cheng	27a41d5473	Revert 94937 and move the noreturn check to codegen. llvm-svn: 95198	2010-02-03 03:55:59 +00:00
Evan Cheng	40905b4302	Allow all types of callee's to be tail called. But avoid automatic tailcall if the callee is a result of bitcast to avoid losing necessary zext / sext etc. llvm-svn: 95195	2010-02-03 03:28:02 +00:00
Dale Johannesen	a466692552	Reapply 95050 with a tweak to check the register class. llvm-svn: 95183	2010-02-03 01:40:33 +00:00
Chris Lattner	dee74e2805	make these less sensitive to asm verbose changes by disabling it for them. llvm-svn: 95175	2010-02-03 00:48:53 +00:00
Eric Christopher	d86233c118	Recommit this, looks like it wasn't the cause. llvm-svn: 95165	2010-02-03 00:21:58 +00:00
Daniel Dunbar	bdbffbedf0	AsmParser/X86: Add temporary hack to allow parsing "sal". Eventually we need some mechanism for specifying alternative syntaxes, but I'm not sure what form that should take yet. llvm-svn: 95158	2010-02-02 23:46:47 +00:00
Eric Christopher	e67d01a9a8	Hopefully temporarily revert this. llvm-svn: 95154	2010-02-02 23:01:31 +00:00
Chris Lattner	73051044fd	remove the # TAILCALL markers, which was causing the to fail. It's unclear if the matcher is nondeterminstic of what here, but I'm getting matches without TAILCALL and some other hosts are getting matches with it. llvm-svn: 95149	2010-02-02 22:36:29 +00:00
Eric Christopher	4264e7e46f	Re-add strcmp and known size object size checking optimization. Passed bootstrap and nightly test run here. llvm-svn: 95145	2010-02-02 22:10:43 +00:00
Daniel Dunbar	09d81caa12	MCAssembler/Darwin: Add a test (on Darwin) that we assemble a bunch of instructions exactly like 'as', and produce equivalent .o files. llvm-svn: 95143	2010-02-02 22:00:15 +00:00
Daniel Dunbar	255a8c8b13	MC/Mach-O: Set SOME_INSTRUCTIONS bit for sections. llvm-svn: 95135	2010-02-02 21:44:01 +00:00
Chris Lattner	de9b3ada5d	this apparently depends on the host somehow. llvm-svn: 95122	2010-02-02 20:57:28 +00:00
Bill Wendling	ed0278c37f	XFAIL for PPC Darwin. llvm-svn: 95121	2010-02-02 20:56:02 +00:00
Chris Lattner	2481509162	disable this test for now. llvm-svn: 95120	2010-02-02 20:41:39 +00:00
Kevin Enderby	db32c4567b	Added another version of the X86 assembler matcher test case. This test case is different subset of the full auto generated test case, and a larger subset that is in x86_32-bit.s (that set will encode correctly). These instructions can pass though llvm-mc as it were a logical cat(1) and then reassemble to the same instruction. It is useful as we bring up the parser and matcher so we don't break things that currently work. llvm-svn: 95107	2010-02-02 19:05:57 +00:00
Dale Johannesen	da431c76fb	Test revert 95050; there's a good chance it's causing buildbot failure. llvm-svn: 95103	2010-02-02 18:52:56 +00:00
Chris Lattner	8e2c471614	don't turn (A & (C0?-1:0)) \| (B & ~(C0?-1:0)) -> C0 ? A : B for vectors. Codegen is generating awful code or segfaulting in various cases (e.g. PR6204). llvm-svn: 95058	2010-02-02 02:43:51 +00:00
Chris Lattner	302240d73e	fix a crash in loop unswitch on a loop invariant vector condition. llvm-svn: 95055	2010-02-02 02:26:54 +00:00
Chris Lattner	29bb9272a6	remove an unreduced testcase, rename another. llvm-svn: 95054	2010-02-02 02:23:37 +00:00
Evan Cheng	55afd2564c	Perform sibcall in some cases when arguments are passes memory. Look for cases where callee's arguments are already in the caller's own caller's stack and they line up perfectly. e.g. extern int foo(int a, int b, int c); int bar(int a, int b, int c) { return foo(a, b, c); } llvm-svn: 95053	2010-02-02 02:22:50 +00:00
Dale Johannesen	c84816a62e	Make local RA smarter about reusing input register of a copy as output. Needed for (functional) correctness in inline asm, and should be generally beneficial. 7361612. llvm-svn: 95050	2010-02-02 02:08:02 +00:00
Dan Gohman	f644af8bbe	Factor out alignof expression folding into a separate function and generalize it to handle more cases. llvm-svn: 95045	2010-02-02 01:41:39 +00:00
Dale Johannesen	257d2dafbd	Testcase for 94996 (PR 6157) llvm-svn: 95021	2010-02-01 22:46:05 +00:00
Evan Cheng	a49d8e6d38	Fix PR6196. GV callee may not be a function. llvm-svn: 95017	2010-02-01 22:40:09 +00:00
Evan Cheng	4eb3d2867c	Add test case for 95013. llvm-svn: 95014	2010-02-01 22:32:42 +00:00
Chris Lattner	94eb4b285b	fix PR6195, a bug constant folding scalar -> vector compares. llvm-svn: 94997	2010-02-01 20:04:40 +00:00
Chris Lattner	3c46e14137	fix PR6197 - infinite recursion in ipsccp due to block addresses evaluateICmpRelation wasn't handling blockaddress. llvm-svn: 94993	2010-02-01 19:35:08 +00:00
Dan Gohman	36bca4e4ba	Update this test for a trivial register allocation difference. llvm-svn: 94989	2010-02-01 19:00:32 +00:00
Dan Gohman	e5e1b7b05a	Generalize target-independent folding rules for sizeof to handle more cases, and implement target-independent folding rules for alignof and offsetof. Also, reassociate reassociative operators when it leads to more folding. Generalize ScalarEvolution's isOffsetOf to recognize offsetof on arrays. Rename getAllocSizeExpr to getSizeOfExpr, and getFieldOffsetExpr to getOffsetOfExpr, for consistency with analagous ConstantExpr routines. Make the target-dependent folder promote GEP array indices to pointer-sized integers, to make implicit casting explicit and exposed to subsequent folding. And add a bunch of testcases for this new functionality, and a bunch of related existing functionality. llvm-svn: 94987	2010-02-01 18:27:38 +00:00
Chris Lattner	846a52e228	fix rdar://7590304, a miscompilation of objc apps on arm. The caller of objc message send was getting marked arm_apcscc, but the prototype isn't. This is fine at runtime because objcmsgsend is implemented in assembly. Only turn a mismatched caller and callee into 'unreachable' if the callee is a definition. llvm-svn: 94986	2010-02-01 18:11:34 +00:00
Chris Lattner	2cecedf081	fix rdar://7590304, an infinite loop in instcombine. In the invoke case, instcombine can't zap the invoke for fear of changing the CFG. However, we have to do something to prevent the next iteration of instcombine from inserting another store -> undef before the invoke thereby getting into infinite iteration between dead store elim and store insertion. Just zap the callee to null, which will prevent the next iteration from doing anything. llvm-svn: 94985	2010-02-01 18:04:58 +00:00
Evan Cheng	ed8ca56eeb	Undo r94946 now all the tests are passing again. llvm-svn: 94970	2010-02-01 02:13:39 +00:00
Evan Cheng	7f62def0f9	Avoid recursive sibcall's. llvm-svn: 94946	2010-01-31 06:44:49 +00:00
Eli Friedman	690c7f4dfd	Remove test which is no longer relevant. llvm-svn: 94944	2010-01-31 04:40:45 +00:00
Eli Friedman	a2cc2875fc	Simplify/generalize the xor+add->sign-extend instcombine. llvm-svn: 94943	2010-01-31 04:29:12 +00:00
Eli Friedman	37a8197b61	Add a small transform: transform -(X<<Y) to (-X<<Y) when the shift has a single use and X is free to negate. llvm-svn: 94941	2010-01-31 02:30:23 +00:00
Evan Cheng	d86d3fe0c3	Do not mark no-return calls tail calls. It'll screw up special calls like longjmp and it doesn't make much sense for performance reason. If my logic is faulty, please let me know. llvm-svn: 94937	2010-01-31 00:59:31 +00:00
Anton Korobeynikov	25df248382	Fix a gross typo: ARMv6+ may or may not support unaligned memory operations. Even if they are suported by the core, they can be disabled (this is just a configuration bit inside some register). Allow unaligned memops on darwin and conservatively disallow them otherwise. llvm-svn: 94889	2010-01-30 14:08:12 +00:00
Bob Wilson	56600a15ad	Check alignment of loads when deciding whether it is safe to execute them unconditionally. Besides checking the offset, also check that the underlying object is aligned as much as the load itself. llvm-svn: 94875	2010-01-30 04:42:39 +00:00
Evan Cheng	70f714fdbe	Allow more tailcall optimization: calls with inputs that are all passed in registers. llvm-svn: 94873	2010-01-30 01:22:00 +00:00
Daniel Dunbar	76e5d70c57	MC/X86 AsmParser: Handle absolute memory operands correctly. We were doing something totally broken and parsing them as immediates, but the .td file also had the wrong match class so things sortof worked. Except, that is, that we would parse movl $0, %eax as movl 0, %eax Feel free to guess how well that worked. llvm-svn: 94869	2010-01-30 01:02:48 +00:00
Bob Wilson	2e51136f80	Remove ARM-specific calling convention from this test. Target data is needed for this test, but otherwise, there's nothing ARM-specific about it and no need to specify the calling convention. llvm-svn: 94862	2010-01-30 00:40:23 +00:00
Daniel Dunbar	7f0421eebb	MC/X86: Add a nice X86 assembler matcher test case from Kevin Enderby. - This test case is auto generated, and has been verified to round-trip correctly through llvm-mc by checking the assembled .o file before and after piping through llvm-mc. It will be extended over time as the matcher grows support for more instructions. llvm-svn: 94857	2010-01-29 23:32:40 +00:00
Eric Christopher	5a0e174863	Revert my last couple of patches. They appear to have broken bison. llvm-svn: 94841	2010-01-29 21:16:24 +00:00
Bob Wilson	7c42b9d51e	Improve isSafeToLoadUnconditionally to recognize that GEPs with constant indices are safe if the result is known to be within the bounds of the underlying object. llvm-svn: 94829	2010-01-29 19:19:08 +00:00
Evan Cheng	297a494f55	Catch more trivial tail call opportunities: no inputs and output types match. llvm-svn: 94804	2010-01-29 06:45:59 +00:00
Eric Christopher	9b3c02b7da	Make strcpy_chk lower to strcpy if we have a safe size. llvm-svn: 94783	2010-01-29 01:37:11 +00:00
Eric Christopher	997f7ca8c5	Add constant support to object size handling and remove default lowering. We'll either figure it out, or not and be lowered by SelectionDAGBuild. Add test. llvm-svn: 94775	2010-01-29 01:09:57 +00:00
Dan Gohman	a424b9fbd1	Remove the folding rule getelementptr (i8* inttoptr (i64 1 to i8), i32 -1) to inttoptr (i64 0 to i8) from the VMCore constant folder. It didn't handle sign-extension properly in the case where the source integer is smaller than a pointer size. And, it relied on an assumption about sizeof(i8). The Analysis constant folder still folds these kinds of things; it has access to TargetData, so it can do them right. Add a testcase which tests that the VMCore constant folder doesn't miscompile this, and that the Analysis folder does fold it. llvm-svn: 94750	2010-01-28 18:08:26 +00:00
Duncan Sands	3a48b87c54	Fix PR6165. The bug was that LHSKnownZero was being and'd with DemandedMask when it should have been and'd with LowBits. Fix that and while there beef up the logic in the case of a negative LHS. llvm-svn: 94745	2010-01-28 17:22:42 +00:00
Chris Lattner	cc9a6f0580	convert the last 3 targets to use EmitFunctionBody() now that it has before/end body hooks. lib/Target/Alpha/AsmPrinter/AlphaAsmPrinter.cpp \| 49 ++----------- lib/Target/Mips/AsmPrinter/MipsAsmPrinter.cpp \| 87 ++++++------------------ lib/Target/XCore/AsmPrinter/XCoreAsmPrinter.cpp \| 56 +++------------ test/CodeGen/XCore/ashr.ll \| 2 4 files changed, 48 insertions(+), 146 deletions(-) llvm-svn: 94741	2010-01-28 06:22:43 +00:00
Evan Cheng	346af88396	Fix a bug introduced by r94490 where it created a X86ISD::CMP whose output type is different from its inputs. This fixes PR6146. llvm-svn: 94731	2010-01-28 01:57:22 +00:00
Chris Lattner	73de5fbfc3	Give AsmPrinter the most common expected implementation of runOnMachineFunction, and switch PPC to use EmitFunctionBody. The two ppc asmprinters now don't heave to define runOnMachineFunction. llvm-svn: 94722	2010-01-28 01:28:58 +00:00
Chris Lattner	565896b9eb	emit a 0 byte instead of a noop if a function is empty on darwin. "0" is nice and target independent. llvm-svn: 94718	2010-01-28 01:06:32 +00:00
Bob Wilson	7577e948e4	Avoid creating redundant PHIs in SSAUpdater::GetValueInMiddleOfBlock. This was already being done in SSAUpdater::GetValueAtEndOfBlock so I've just changed SSAUpdater to check for existing PHIs in both places. llvm-svn: 94690	2010-01-27 22:01:02 +00:00
Chandler Carruth	4fe7a3bc08	Quick fix to a test that is currently failing on every Linux build bot. No idea if this is the "correct" fix, but it seems a strict improvement. llvm-svn: 94675	2010-01-27 10:36:15 +00:00
Duncan Sands	1a0203e057	Revert commit 94666 (ddunbar) [Suppress clang warning about unused arguments]. It causes g++ to complain: unrecognized option '-Qunused-arguments' llvm-svn: 94670	2010-01-27 10:08:08 +00:00
Daniel Dunbar	f99134f470	Suppress clang warning about unused arguments. llvm-svn: 94666	2010-01-27 07:10:10 +00:00
Evan Cheng	85476f304c	Perform trivial tail call optimization for callees with "C" ABI. These are done even when -tailcallopt is not specified and it does not require changing ABI. First case is the most trivial one. Perform tail call optimization when both the caller and callee do not return values and when the callee does not take any input arguments. llvm-svn: 94664	2010-01-27 06:25:16 +00:00
Victor Hernandez	477d9274bb	When converting dbg.declare to dbg.value, attach promoted store's debug metadata to dbg.value llvm-svn: 94634	2010-01-27 00:44:36 +00:00
Chris Lattner	b657c4cdc3	emit jump table an alias ".set" directives through MCStreamer as assignments. .set x, a-b is the same as: x = a-b llvm-svn: 94596	2010-01-26 21:53:08 +00:00
Rafael Espindola	dcb03f0f6b	Emit .comm alignment in bytes but .align in powers of 2 for ARM ELF. Original patch by Sandeep Patel and updated by me. llvm-svn: 94582	2010-01-26 20:21:43 +00:00
Chris Lattner	3dd38a8112	eliminate MCAsmInfo::NeedsSet: we now just use .set on any platform that has it. llvm-svn: 94581	2010-01-26 20:20:43 +00:00
Dan Gohman	80386c10d4	-disable-output is no longer needed with -analyze. llvm-svn: 94574	2010-01-26 19:25:59 +00:00
Dan Gohman	51aaf02821	Fix the the ceiling-division used in computing the MaxBECount so that it doesn't have trouble with an intermediate add overflowing. Also, be more conservative about the case where the induction variable in an SLT loop exit can step past the RHS of the SLT and overflow in a single step. Make getSignedRange more aggressive, to recover for some common cases which the above fixes pessimized. This addresses rdar://7561161. llvm-svn: 94512	2010-01-26 04:40:18 +00:00
Victor Hernandez	cd94410152	In mem2reg, for all alloca/stores that get promoted where the alloca has an associated llvm.dbg.declare instrinsic, insert an llvm.dbg.var intrinsic before each store. llvm-svn: 94493	2010-01-26 02:42:15 +00:00
Evan Cheng	555f61bf58	Implement cond ? -1 : 0 with sbb. llvm-svn: 94490	2010-01-26 02:00:44 +00:00
Dale Johannesen	d5575f29f1	Generate DEBUG_VALUE comments on x86. The (limited) dbg.declare's we currently generate go through both register allocators without perturbing the results. llvm-svn: 94480	2010-01-26 00:09:58 +00:00
Dan Gohman	00f4747bad	Fix the bitcode reader to deserialize nuw/nsw/etc. bits properly in the case of a forward-reference, which doesn't use an "abbrev" encoding. llvm-svn: 94454	2010-01-25 21:55:39 +00:00
Chris Lattner	d45adf28de	wirte up .file and .file to the mc asmparser. llvm-svn: 94438	2010-01-25 19:02:58 +00:00
Victor Hernandez	8a588e1444	Revert r94260 until findDbgDeclare() is made more efficient llvm-svn: 94432	2010-01-25 17:52:13 +00:00
Rafael Espindola	4cb52db485	Update test for darwin. llvm-svn: 94421	2010-01-25 15:32:10 +00:00
Chris Lattner	9b83727cfe	we removed support for darwin8 tools. llvm-svn: 94414	2010-01-25 07:43:40 +00:00
Rafael Espindola	a1141dd6ab	Fix PR6134. We are not emitting alignments on Darwin for "bar". Not sure what is the correct way to do it. llvm-svn: 94400	2010-01-25 02:27:39 +00:00
Daniel Dunbar	75652a6f2b	Attempt to unbreak test on Linux. Chris, please check. llvm-svn: 94399	2010-01-25 00:54:13 +00:00
Chris Lattner	45dd2327cb	just remove this test, it is not reduced, is not clear what its testing for and it is dying due to fragility in the asmprinter .s comments. llvm-svn: 94372	2010-01-24 19:23:09 +00:00
Chris Lattner	de765a3f39	this test has been failing or a long time, just disable it for now to get back to green. llvm-svn: 94371	2010-01-24 19:13:39 +00:00
Chris Lattner	807a3bcbbb	fix a parsing problem on instructions like: movw $8, (_cost_table_-L97$pb)+66(%eax) After the parens, we could still have a binop. llvm-svn: 94345	2010-01-24 01:07:33 +00:00

... 27 28 29 30 31 ...

11897 Commits