llvm-project

Commit Graph

Author	SHA1	Message	Date
Reed Kotler	ec8a54904e	Remove the form field from Mips16 instruction formats and set things up so that we can apply the direct object emitter patch. This patch should be a nop right now and it's test is to not break what is already there. llvm-svn: 175126	2013-02-14 03:05:25 +00:00
Rafael Espindola	3c818086f2	Don't assume the mangling of static functions. llvm-svn: 175121	2013-02-14 02:49:18 +00:00
Rafael Espindola	764993493c	Don't asume that a static function in an extern "C" block will not be mangled. Since functions with internal linkage don't have language linkage, it is valid to overload them: extern "C" { static int foo(); static int foo(int); } So we mangle them. llvm-svn: 175120	2013-02-14 01:58:08 +00:00
Weiming Zhao	090edf7e67	temporarily revert the patch due to some conflicts llvm-svn: 175107	2013-02-13 23:24:40 +00:00
Anshuman Dasgupta	e96f804eba	Hexagon: add support for predicate-GPR copies. llvm-svn: 175102	2013-02-13 22:56:34 +00:00
Tom Stellard	91da4e9199	R600: Add support for 128-bit parameters NOTE: This is a candidate for the Mesa stable branch. llvm-svn: 175096	2013-02-13 22:05:20 +00:00
Nick Lewycky	beba972659	Don't build tail calls to functions with three inreg arguments on x86-32 PIC. Fixes PR15250! llvm-svn: 175092	2013-02-13 21:59:15 +00:00
Weiming Zhao	0632a4b002	Bug fix 13622: Add paired register support for inline asm with 64-bit data on ARM llvm-svn: 175088	2013-02-13 21:43:02 +00:00
Jyotsna Verma	d92252469e	Hexagon: Use absolute addressing mode loads/stores for global+offset instead of redefining separate instructions for them. llvm-svn: 175086	2013-02-13 21:38:46 +00:00
Chad Rosier	282edd7caa	[ms-inline-asm] Add support for memory references that have non-immediate displacements. rdar://12974533 llvm-svn: 175083	2013-02-13 21:33:44 +00:00
Reed Kotler	f662cff689	For Mips 16, add the optimization where the 16 bit form of addiu sp can be used if the offset fits in 11 bits. This makes use of the fact that the abi requires sp to be 8 byte aligned so the actual offset can fit in 8 bits. It will be shifted left and sign extended before being actually used. The assembler or direct object emitter will shift right the 11 bit signed field by 3 bits. We don't need to deal with that here. llvm-svn: 175073	2013-02-13 20:28:27 +00:00
Andrew Trick	553e0fe365	MIsched: HazardRecognizers are created for each DAG. Free them. llvm-svn: 175067	2013-02-13 19:22:27 +00:00
Krzysztof Parzyszek	2680b53d90	Add registration for PPC-specific passes to allow the IR to be dumped via -print-after-all. llvm-svn: 175058	2013-02-13 17:40:07 +00:00
Benjamin Kramer	8e2637e2b0	X86: Disable generation of rep;movsl when %esi is used as a base pointer. This happens when there is both stack realignment and a dynamic alloca in the function. If we overwrite %esi (rep;movsl uses fixed registers) we'll lose the base pointer and the next register spill will write into oblivion. Fixes PR15249 and unbreaks firefox on i386/freebsd. Mozilla uses dynamic allocas and freebsd a 4 byte stack alignment. llvm-svn: 175057	2013-02-13 13:40:35 +00:00
Reed Kotler	9cb8e7b9f5	Make jumptables work for -static llvm-svn: 175044	2013-02-13 08:32:14 +00:00
Elena Demikhovsky	9e0df7cb01	Prevent insertion of "vzeroupper" before call that preserves YMM registers, since a caller uses preserved registers across the call. llvm-svn: 175043	2013-02-13 08:02:04 +00:00
Eric Christopher	389ee71b0a	Check i1 as well as i8 variables for 8 bit registers for x86 inline assembly. llvm-svn: 175036	2013-02-13 06:01:05 +00:00
David Peixotto	4299cf83a3	Test commit. Fixed typo. llvm-svn: 175020	2013-02-13 00:36:35 +00:00
Jyotsna Verma	39f7a2b7a0	Hexagon: Add support to generate predicated absolute addressing mode instructions. llvm-svn: 174973	2013-02-12 16:06:23 +00:00
Justin Holewinski	be8dc6499a	[NVPTX] Disable vector registers Vectors were being manually scalarized by the backend. Instead, let the target-independent code do all of the work. The manual scalarization was from a time before good target-independent support for scalarization in LLVM. However, this forces us to specially-handle vector loads and stores, which we can turn into PTX instructions that produce/consume multiple operands. llvm-svn: 174968	2013-02-12 14:18:49 +00:00
Michel Danzer	3bb17ebd93	R600: Fix regression with shadow array sampler on pre-SI GPUs. 'R600/SI: Use proper instructions for array/shadow samplers.' removed two cases from TEX_SHADOW. Vincent Lejeune reported on IRC that this broke some shadow array piglit tests with the r600g driver. Reinstating the removed cases should fix this, and still works with radeonsi as well. I will follow up with some lit tests which would have caught the regression. NOTE: This is a candidate for the Mesa stable branch. Tested-by: Vincent Lejeune <vljn@ovi.com> Reviewed-by: Tom Stellard <thomas.stellard@amd.com> llvm-svn: 174963	2013-02-12 12:11:23 +00:00
Arnold Schwaighofer	89aef93841	ARM cost model: Add vector reverse shuffle costs A reverse shuffle is lowered to a vrev and possibly a vext instruction (quad word). radar://13171406 llvm-svn: 174933	2013-02-12 02:40:39 +00:00
Arnold Schwaighofer	1f3d3ca769	ARM NEON: Handle v16i8 and v8i16 reverse shuffles Lower reverse shuffles to a vrev64 and a vext instruction instead of the default legalization of storing and loading to the stack. This is important because we generate reverse shuffles in the loop vectorizer when we reverse store to an array. uint8_t Arr[N]; for (i = 0; i < N; ++i) Arr[N - i - 1] = ... radar://13171760 llvm-svn: 174929	2013-02-12 01:58:32 +00:00
Kay Tiong Khoo	ab588efe42	Added 0x0D to 2-byte opcode extension table for prefetch* variants Fixed decode of existing 3dNow prefetchw instruction Intel is scheduled to add a compatible prefetchw (same encoding) to future CPUs llvm-svn: 174920	2013-02-12 00:19:12 +00:00
Akira Hatanaka	bf1af1acc7	[mips] Expand pseudo instructions before they are emitted in MipsCodeEmitter.cpp. JALR and NOP are expanded by function emitPseudoExpansionLowering, which is not called when the old JIT is used. This fixes the following tests which have been failing on llvm-mips-linux builder: LLVM :: ExecutionEngine__2003-01-04-LoopTest.ll LLVM :: ExecutionEngine__2003-05-06-LivenessClobber.ll LLVM :: ExecutionEngine__2003-06-04-bzip2-bug.ll LLVM :: ExecutionEngine__2005-12-02-TailCallBug.ll LLVM :: ExecutionEngine__2003-10-18-PHINode-ConstantExpr-CondCode-Failure.ll LLVM :: ExecutionEngine__hello2.ll LLVM :: ExecutionEngine__stubs.ll LLVM :: ExecutionEngine__test-branch.ll LLVM :: ExecutionEngine__test-call.ll LLVM :: ExecutionEngine__test-common-symbols.ll LLVM :: ExecutionEngine__test-loadstore.ll LLVM :: ExecutionEngine__test-loop.ll llvm-svn: 174912	2013-02-11 22:35:40 +00:00
Akira Hatanaka	3d38609fdd	[mips] Fix indentation. llvm-svn: 174907	2013-02-11 22:03:52 +00:00
Krzysztof Parzyszek	9a278f108a	Extend Hexagon hardware loop generation to handle various additional cases: - variety of compare instructions, - loops with no preheader, - arbitrary lower and upper bounds. llvm-svn: 174904	2013-02-11 21:37:55 +00:00
Krzysztof Parzyszek	cfe285e604	Implement HexagonInstrInfo::analyzeCompare. llvm-svn: 174901	2013-02-11 20:04:29 +00:00
Kay Tiong Khoo	d30b1a2ac7	fixed disassembly of some i386 system insts with intel syntax added file for test cases for i386 intel syntax llvm-svn: 174900	2013-02-11 19:46:36 +00:00
Michel Danzer	10ed47f927	R600/SI: Use V_ADD_F32 instead of V_MOV_B32 for clamp/neg/abs modifiers. The modifiers don't seem to have any effect with V_MOV_B32, supposedly it's meant to just move bits untouched. Fixes 46 piglit tests with radeonsi, though unfortunately 11 of those had just regressed because they started using the clamp modifier. NOTE: This is a candidate for the Mesa stable branch. Reviewed-by: Tom Stellard <thomas.stellard@amd.com> llvm-svn: 174890	2013-02-11 15:58:21 +00:00
Tim Northover	be867971cb	AArch64: fix build on some MSVC versions This does two things: It removes a call to abs() which may have "long long" parameter on Windows, which is not necessarily available in C++03. It also corrects the signedness of Amount, which was relying on implementation-defined conversions previously. Code was already tested (albeit in an implemnetation defined way) so no extra tests. llvm-svn: 174885	2013-02-11 14:25:52 +00:00
Tim Northover	e206778833	AArch64: Simplify logic in deciding whether bfi is valid Previous code had a confusing comment which was mostly an implementation detail. This condition corresponds to "lsb up to register width" and "width not ridiculous". llvm-svn: 174877	2013-02-11 12:32:18 +00:00
Tim Northover	60baeb984f	Make use of DiagnosticType to provide better AArch64 diagnostics. This gives a DiagnosticType to all AsmOperands in sight. This replaces all "invalid operand" diagnostics with something more specific. The messages given should still be sufficiently vague that they're not usually actively misleading when LLVM guesses your instruction incorrectly. llvm-svn: 174871	2013-02-11 09:29:37 +00:00
Evan Cheng	615620c9e8	Currently, codegen may spent some time in SDISel passes even if an entire function is successfully handled by fast-isel. That's because function arguments are always handled by SDISel. Introduce FastLowerArguments to allow each target to provide hook to handle formal argument lowering. As a proof-of-concept, add ARMFastIsel::FastLowerArguments to handle functions with 4 or fewer scalar integer (i8, i16, or i32) arguments. It completely eliminates the need for SDISel for trivial functions. rdar://13163905 llvm-svn: 174855	2013-02-11 01:27:15 +00:00
Joel Jones	440d8e48ae	Spelling correction llvm-svn: 174852	2013-02-10 23:56:30 +00:00
Vincent Lejeune	44bf8158c5	Test Commit - Remove some trailing whitespace in R600Instructions.td llvm-svn: 174839	2013-02-10 17:57:33 +00:00
Justin Holewinski	36a50991e7	[NVPTX] Make address space errors more explicit (llvm_unreachable -> report_fatal_error) llvm-svn: 174808	2013-02-09 13:34:15 +00:00
Tom Stellard	47d4201348	R600: Dump the function name when TargetLowering::LowerCall() fails Also output a more useful error message. NOTE: This is a candidate for the Mesa stable branch llvm-svn: 174763	2013-02-08 22:24:40 +00:00
Tom Stellard	7370ede2cd	R600: rework flow creation in the structurizer v2 This fixes a couple of bugs and incorrect assumptions, in total four more piglit tests now pass. v2: fix small bug in the dominator updating Patch by: Christian König Signed-off-by: Christian König <christian.koenig@amd.com> llvm-svn: 174762	2013-02-08 22:24:38 +00:00
Tom Stellard	048f14fd3b	R600: fix loop analyses in the structurizer Patch by: Christian König Intersecting loop handling was wrong. Signed-off-by: Christian König <christian.koenig@amd.com> Tested-by: Michel Dänzer <michel.daenzer@amd.com> llvm-svn: 174761	2013-02-08 22:24:37 +00:00
Tom Stellard	7ec0e4fbe3	R600: fix PHI value adding in the structurizer Otherwise we sometimes produce invalid code. Patch by: Christian König Signed-off-by: Christian König <christian.koenig@amd.com> Tested-by: Michel Dänzer <michel.daenzer@amd.com> llvm-svn: 174760	2013-02-08 22:24:35 +00:00
Reed Kotler	b9bf8dca47	Add the 16 bit version of addiu. To the assembler, the 16 and 32 bit are the same so we put in the comment field an indicator when we think we are emitting the 16 bit version. For the direct object emitter, the difference is important as well as for other passes which need an accurate count of program size. There will be other similar putbacks to this for various instructions. llvm-svn: 174747	2013-02-08 21:42:56 +00:00
Bill Schmidt	62fe7a5b17	Refine fix to bug 15041. Thanks to help from Nadav and Hal, I have a more reasonable (and even correct!) approach. This specifically penalizes the insertelement and extractelement operations for the performance hit that will occur on PowerPC processors. llvm-svn: 174725	2013-02-08 18:19:17 +00:00
Arnold Schwaighofer	594fa2dc2b	ARM cost model: Address computation in vector mem ops not free Adds a function to target transform info to query for the cost of address computation. The cost model analysis pass now also queries this interface. The code in LoopVectorize adds the cost of address computation as part of the memory instruction cost calculation. Only there, we know whether the instruction will be scalarized or not. Increase the penality for inserting in to D registers on swift. This becomes necessary because we now always assume that address computation has a cost and three is a closer value to the architecture. radar://13097204 llvm-svn: 174713	2013-02-08 14:50:48 +00:00
Reed Kotler	66165c8f96	When Mips16 frames grow large, the immediate field may exceed the maximum allowed size for the instruction. This code uses RegScavenger to fix this. We sometimes need 2 registers for Mips16 so we must handle things differently than how register scavenger is normally used. llvm-svn: 174696	2013-02-08 03:57:41 +00:00
Akira Hatanaka	a061281556	[mips] Make Filler a class and reduce indentation. llvm-svn: 174666	2013-02-07 21:32:32 +00:00
Bill Schmidt	b3cece13cf	Constrain PowerPC autovectorization to fix bug 15041. Certain vector operations don't vectorize well with the current PowerPC implementation. Element insert/extract performs poorly without VSX support because Altivec requires going through memory. SREM, UREM, and VSELECT all produce bad scalar code. There's a lot of work to do for the cost model before autovectorization will be tuned well, and this is not an attempt to address the larger problem. llvm-svn: 174660	2013-02-07 20:33:57 +00:00
Akira Hatanaka	061d1ea5da	[mips] Add definition of JALR instruction which has two register operands. Change the original JALR instruction with one register operand to be a pseudo-instruction. llvm-svn: 174657	2013-02-07 19:48:00 +00:00
Tom Stellard	1c822a8929	R600/SI: cleanup VGPR encoding Remove all the unused code. Patch by: Christian König Signed-off-by: Christian König <christian.koenig@amd.com> Reviewed-by: Tom Stellard <thomas.stellard@amd.com> llvm-svn: 174656	2013-02-07 19:39:45 +00:00
Tom Stellard	aac1889a84	R600/SI: Handle VGPR64 destination in copyPhysReg(). Allows nexuiz to run with radeonsi. Patch by: Michel Dänzer Signed-off-by: Michel Dänzer <michel.daenzer@amd.com> Reviewed-by: Tom Stellard <thomas.stellard@amd.com> llvm-svn: 174655	2013-02-07 19:39:43 +00:00
Tom Stellard	ecacb8010d	R600/SI: Add pattern for mul. 20 more little piglits with radeonsi. Patch by: Michel Dänzer Signed-off-by: Michel Dänzer <michel.daenzer@amd.com> Reviewed-by: Tom Stellard <thomas.stellard@amd.com> llvm-svn: 174654	2013-02-07 19:39:42 +00:00
Tom Stellard	8909380e71	R600/SI: simplify and fix SMRD encoding The _SGPR variants where wrong. Patch by: Christian König Signed-off-by: Christian König <christian.koenig@amd.com> Reviewed-by: Tom Stellard <thomas.stellard@amd.com> llvm-svn: 174653	2013-02-07 19:39:40 +00:00
Tom Stellard	26075d58a2	R600/SI: add proper 64bit immediate support v2 v2: rebased on current upstream Patch by: Christian König Signed-off-by: Christian König <christian.koenig@amd.com> Reviewed-by: Tom Stellard <thomas.stellard@amd.com> llvm-svn: 174652	2013-02-07 19:39:38 +00:00
Tom Stellard	4ded0c1c42	R600: Add an explicit default processor This is for the case when no processor is passed to the backend. This prevents the '' is not a recognized processor for this target (ignoring processor) warning from being generated by clang. llvm-svn: 174651	2013-02-07 19:39:34 +00:00
Tom Stellard	462516b737	R600/SI: Use proper instructions for array/shadow samplers. Patch by: Michel Dänzer Signed-off-by: Michel Dänzer <michel.daenzer@amd.com> Reviewed-by: Tom Stellard <thomas.stellard@amd.com> llvm-svn: 174634	2013-02-07 17:02:14 +00:00
Tom Stellard	ae6c06e5de	R600/SI: Make sample intrinsic address parameter type overloaded. Handle vectors of 1 to 16 integers. Change the intrinsic names to prevent the wrong one from being selected at runtime due to the overloading. Patch By: Michel Dänzer Signed-off-by: Michel Dänzer <michel.daenzer@amd.com> Reviewed-by: Tom Stellard <thomas.stellard@amd.com> llvm-svn: 174633	2013-02-07 17:02:13 +00:00
Tom Stellard	538ceeb6e0	R600/SI: Add basic support for more integer vector types. v1i32, v2i32, v8i32 and v16i32. Only add VGPR register classes for integer vector types, to avoid attempts copying from VGPR to SGPR registers, which is not possible. Patch By: Michel Dänzer Signed-off-by: Michel Dänzer <michel.daenzer@amd.com> Reviewed-by: Tom Stellard <thomas.stellard@amd.com> llvm-svn: 174632	2013-02-07 17:02:09 +00:00
Arnold Schwaighofer	213fced704	ARM cost model: Add costs for vector selects Vector selects are cheap on NEON. They get lowered to a vbsl instruction. radar://13158753 llvm-svn: 174631	2013-02-07 16:10:15 +00:00
Michel Danzer	349cabed2f	R600/SI: Add pattern for flog2 22 more little piglits with radeonsi. Reviewed-by: Tom Stellard <thomas.stellard@amd.com> llvm-svn: 174615	2013-02-07 14:55:16 +00:00
Tom Stellard	9355b22180	R600: Consolidate sub register indices. Use sub0-15 everywhere. Patch by: Michel Dänzerr Reviewed-by: Tom Stellard <thomas.stellard@amd.com> Signed-off-by: Michel Dänzer <michel.daenzer@amd.com> llvm-svn: 174610	2013-02-07 14:02:37 +00:00
Tom Stellard	e06163a9a6	R600: Add support for SET_DX10 instructions These instructions compare two floating point values and return an integer true (-1) or false (0) value. When compiling code generated by the Mesa GLSL frontend, the SET_DX10 instructions save us four instructions for most branch decisions that use floating-point comparisons. llvm-svn: 174609	2013-02-07 14:02:35 +00:00
Tom Stellard	b40ada9b85	R600: Fix assembly name for SETGT_INT llvm-svn: 174607	2013-02-07 14:02:27 +00:00
Reed Kotler	4a230ffa96	Make sure we call externals from libraries properly when -static. For example, when we are doing mips16 hard float or soft float. llvm-svn: 174583	2013-02-07 04:34:51 +00:00
Reed Kotler	ec60f7d335	Enable jumps when in -static mode. llvm-svn: 174580	2013-02-07 03:49:51 +00:00
Akira Hatanaka	556135d813	[mips] Make NOP a pseudo instruction and expand it to "sll $zero, $zero, 0". llvm-svn: 174546	2013-02-06 21:50:15 +00:00
Eli Bendersky	ef4558abd3	This is a follow-up on r174446, now taking Atom processors into account. Atoms use LEA for updating SP in prologs/epilogs, and the exact LEA opcode depends on the data model. Also reapplying the test case which was added and then reverted (because of Atom failures), this time specifying explicitly the CPU in addition to the triple. The test case now checks all variations (data mode, cpu Atom vs. Core). llvm-svn: 174542	2013-02-06 20:43:57 +00:00
Bill Schmidt	ef17c14254	PPC calling convention cleanup. Most of PPCCallingConv.td is used only by the 32-bit SVR4 ABI. Rename things to clarify this. Also delete some code that's been commented out for a long time. llvm-svn: 174526	2013-02-06 17:33:58 +00:00
Tom Stellard	f3b2a1e8b3	R600: Support for indirect addressing v4 Only implemented for R600 so far. SI is missing implementations of a few callbacks used by the Indirect Addressing pass and needs code to handle frame indices. At the moment R600 only supports array sizes of 16 dwords or less. Register packing of vector types is currently disabled, which means that a vec4 is stored in T0_X, T1_X, T2_X, T3_X, rather than T0_XYZW. In order to correctly pack registers in all cases, we will need to implement an analysis pass for R600 that determines the correct vector width for each array. v2: - Add support for i8 zext load from stack. - Coding style fixes v3: - Don't reserve registers for indirect addressing when it isn't being used. - Fix bug caused by LLVM limiting the number of SubRegIndex declarations. v4: - Fix 64-bit defines llvm-svn: 174525	2013-02-06 17:32:29 +00:00
Tim Northover	228d9d3aa2	Implement external weak (ELF) symbols on AArch64 Weakly defined symbols should evaluate to 0 if they're undefined at link-time. This is impossible to do with the usual address generation patterns, so we should use a literal pool entry to materlialise the address. llvm-svn: 174518	2013-02-06 16:43:33 +00:00
Tim Northover	a80c4c1a08	Add AArch64 CRC32 instructions These instructions are a late addition to the architecture, and may yet end up behind an optional attribute, but for now they're available at all times. llvm-svn: 174496	2013-02-06 09:13:13 +00:00
Tim Northover	91a51c5a7c	Add icache prefetch operations to AArch64 This adds hints to the various "prfm" instructions so that they can affect the instruction cache as well as the data cache. llvm-svn: 174495	2013-02-06 09:04:56 +00:00
Jim Grosbach	231e7aa460	ARM: Use MCTargetAsmParser::validateTargetOperandClass(). Use the validateTargetOperandClass() hook to match literal '#0' operands in InstAlias definitions. Previously this required per-instruction C++ munging of the operand list, but not is handled as a natural part of the matcher. Much better. No additional tests are required, as the pre-existing tests for these instructions exercise the new behaviour as being functionally equivalent to the old. llvm-svn: 174488	2013-02-06 06:00:11 +00:00
Eli Bendersky	44a40ca143	Make sure the correct opcodes are used to SUB and ADD the stack pointer in function prologs/epilogs. The opcodes should depend on the data model (LP64 vs. ILP32) rather than the architecture bit-ness. llvm-svn: 174446	2013-02-05 21:53:29 +00:00
Akira Hatanaka	dec25266d7	[mips] Do not use function CC_MipsN_VarArg unless the function being analyzed is a vararg function. The original code was examining flag OutputArg::IsFixed to determine whether CC_MipsN_VarArg or CC_MipsN should be called. This is not correct, since this flag is often set to false when the function being analyzed is a non-variadic function. llvm-svn: 174442	2013-02-05 21:18:11 +00:00
Jyotsna Verma	6031625b03	Hexagon: Use TFR_cond with cmpb.[eq,gt,gtu] to handle zext( set[ne,eq,gt,ugt] (...) ) type of dag patterns. llvm-svn: 174429	2013-02-05 19:20:45 +00:00
Jakob Stoklund Olesen	dbc8c51acb	Move MRI liveouts to AArch64 return instructions. llvm-svn: 174415	2013-02-05 18:21:49 +00:00
Jakob Stoklund Olesen	4af19d0014	Move MRI liveouts to XCore return instructions. llvm-svn: 174414	2013-02-05 18:21:46 +00:00
Jakob Stoklund Olesen	ef8bf3cd1f	Move MRI liveouts to Sparc return instructions. llvm-svn: 174413	2013-02-05 18:16:58 +00:00
Jyotsna Verma	50ca6dd8a7	Hexagon: Use multiclass for absolute addressing mode stores. llvm-svn: 174412	2013-02-05 18:15:34 +00:00
Jakob Stoklund Olesen	b52a3ec10b	Move MRI liveouts to MSP430 return instructions. llvm-svn: 174411	2013-02-05 18:12:06 +00:00
Jakob Stoklund Olesen	a206050ccb	Move MRI liveouts to Mips return instructions. llvm-svn: 174410	2013-02-05 18:12:03 +00:00
Jakob Stoklund Olesen	8660a8c0fc	Move MRI liveouts to PowerPC return instructions. llvm-svn: 174409	2013-02-05 18:12:00 +00:00
Jakob Stoklund Olesen	242546c99d	Move MRI liveouts to MBlaze return instructions. llvm-svn: 174408	2013-02-05 18:08:45 +00:00
Jakob Stoklund Olesen	0af477c3b1	Move MRI liveouts to Hexagon return instructions. llvm-svn: 174407	2013-02-05 18:08:43 +00:00
Jakob Stoklund Olesen	f90fb6e1ff	Move MRI liveouts to ARM return instructions. llvm-svn: 174406	2013-02-05 18:08:40 +00:00
Jakob Stoklund Olesen	dc69f6fbca	Move MRI liveouts to X86 return instructions. llvm-svn: 174402	2013-02-05 17:59:48 +00:00
Jakob Stoklund Olesen	fdc37670f6	Don't use MRI liveouts in R600. Something very strange is going on with the output registers in this target. Its ISelLowering code is inserting dangling CopyToReg nodes, hoping that those physregs won't get clobbered before the RETURN. This patch adds the output registers as implicit uses on RETURN instructions in the custom emission pass. I'd much prefer to have those CopyToReg nodes glued to the RETURNs, but I don't see how. llvm-svn: 174400	2013-02-05 17:53:52 +00:00
Jakob Stoklund Olesen	bf034dbd32	Avoid using MRI::liveout_iterator for computing VRSAVEs. The liveout lists are about to be removed from MRI, this is the only place they were used after register allocation. Get the live out V registers directly from the return instructions instead. llvm-svn: 174399	2013-02-05 17:40:36 +00:00
Tom Stellard	df063e617f	R600: Fold remaining CONST_COPY after expand pseudo inst Patch by: Vincent Lejeune Reviewed-by: Tom Stellard <thomas.stellard@amd.com> llvm-svn: 174395	2013-02-05 17:09:16 +00:00
Tom Stellard	41afe6a6fe	R600: improve inputs/interpolation handling Use one intrinsic for all sorts of interpolation. Use two separate unexpanded instructions to represent INTERP_XY and _ZW - this will allow to eliminate one part if it's not used. Track liveness of special interpolation regs instead of reserving them - this will allow to reuse those regs, lowering reg pressure. Patch By: Vadim Girlin v2[Vincent Lejeune]: Rebased against current llvm master Signed-off-by: Vadim Girlin <vadimgirlin@gmail.com> Reviewed-by: Tom Stellard <thomas.stellard@amd.com> llvm-svn: 174394	2013-02-05 17:09:14 +00:00
Tom Stellard	2e5e7a5bef	R600: Emit function name in the AsmPrinter Emitting the function name allows us to check for it in the FileCheck tests so we can make sure FileCheck is checking the output of the correct function. llvm-svn: 174392	2013-02-05 17:09:11 +00:00
Tom Stellard	836cdd97fe	R600/SI: Add patterns for fcos and fsin. Fixes 37 piglit tests and allows e.g. FlightGear to run with radeonsi. Patch by: Michel Dänzer Signed-off-by: Michel Dänzer <michel.daenzer@amd.com> Reviewed-by: Tom Stellard <thomas.stellard@amd.com> llvm-svn: 174391	2013-02-05 17:09:10 +00:00
Eli Bendersky	530a3bc5fa	Fix comments llvm-svn: 174390	2013-02-05 16:53:11 +00:00
Jyotsna Verma	6f635b5488	Hexagon: Add V4 compare instructions. Enable relationship mapping for the existing instructions. llvm-svn: 174389	2013-02-05 16:42:24 +00:00
Tim Northover	d03ef4b5a1	Fix signed-unsigned comparison warning. llvm-svn: 174387	2013-02-05 16:40:06 +00:00
Tim Northover	96e4946ac6	Fix remaining StringRef abuse. This should fix the valgrind buildbot failure. llvm-svn: 174375	2013-02-05 15:01:51 +00:00
Arnold Schwaighofer	a804bbee9b	ARM cost model: Cost for scalar integer casts and floating point conversions Also adds some costs for vector integer float conversions. llvm-svn: 174371	2013-02-05 14:05:55 +00:00
Tim Northover	bcaca87d53	Fix formatting in AArch64 backend. This should fix three purely whitespace issues: + 80 column violations. + Tab characters. + TableGen brace placement. No functional changes. llvm-svn: 174370	2013-02-05 13:24:56 +00:00
Tim Northover	969afbec64	Remove cyclic dependency in AArch64 libraries This moves the bit twiddling and string fiddling functions required by other parts of the backend into a separate library. Previously they resided in AArch64Desc, which created a circular dependency between various components. llvm-svn: 174369	2013-02-05 13:24:47 +00:00
Jack Carter	428a06cc75	This patch that sets the Mips ELF header flag for MicroMips architectures. Contributer: Zoran Jovanovic llvm-svn: 174360	2013-02-05 09:30:03 +00:00
Jack Carter	9c1a027fe8	This patch that sets the EmitAlias flag in td files and enables the instruction printer to print aliased instructions. Due to usage of RegisterOperands a change in common code (utils/TableGen/AsmWriterEmitter.cpp) is required to get the correct register value if it is a RegisterOperand. Contributer: Vladimir Medic llvm-svn: 174358	2013-02-05 08:32:10 +00:00
Jack Carter	10be6aef15	This patch changes a static_cast to dyn_cast for MipsELFStreamer objects. Contributer: Jack Carter llvm-svn: 174354	2013-02-05 07:47:41 +00:00
Jyotsna Verma	7ab68fbd1d	Hexagon: Add V4 combine instructions and some more Def Pats for V2. llvm-svn: 174331	2013-02-04 15:52:56 +00:00
Benjamin Kramer	c35d526489	Disable a couple more vector splat optimizations on PPC. I didn't see those because the test case used "not grep". FileCheck the test and XFAIL it, preserving the old optimization, so this can be fixed eventually. llvm-svn: 174330	2013-02-04 15:52:32 +00:00
Tim Northover	24937c12eb	Fix some abuses of StringRef We were taking a StringRef to a temporary result, which can go horribly wrong. llvm-svn: 174328	2013-02-04 15:44:38 +00:00
Benjamin Kramer	2c9da989c2	X86: Open up some opportunities for constant folding by postponing shift lowering. Fixes PR15141. llvm-svn: 174327	2013-02-04 15:19:33 +00:00
Benjamin Kramer	0611298446	X86: Simplify code. No functionality change. llvm-svn: 174326	2013-02-04 15:19:25 +00:00
Benjamin Kramer	548ffa274a	SelectionDAG: Teach FoldConstantArithmetic how to deal with vectors. This required disabling a PowerPC optimization that did the following: input: x = BUILD_VECTOR <i32 16, i32 16, i32 16, i32 16> lowered to: tmp = BUILD_VECTOR <i32 8, i32 8, i32 8, i32 8> x = ADD tmp, tmp The add now gets folded immediately and we're back at the BUILD_VECTOR we started from. I don't see a way to fix this currently so I left it disabled for now. Fix some trivially foldable X86 tests too. llvm-svn: 174325	2013-02-04 15:19:18 +00:00
Tim Northover	aefc30f2a4	Give explicit suffix to integer constant over 32-bits. llvm-svn: 174324	2013-02-04 14:14:58 +00:00
Evgeniy Stepanov	1f5a71492d	More MSan/ASan annotations. This change lets us bootstrap LLVM/Clang under ASan and MSan. It contains fixes for 2 issues: - X86JIT reads return address from stack, which MSan does not know is initialized. - bugpoint tests run binaries with RLIMIT_AS. This does not work with certain Sanitizers. We are no longer including config.h in Compiler.h with this change. llvm-svn: 174306	2013-02-04 07:03:24 +00:00
Arnold Schwaighofer	98f1012f9b	ARM cost model: Penalize insertelement into D subregisters Swift has a renaming dependency if we load into D subregisters. We don't have a way of distinguishing between insertelement operations of values from loads and other values. Therefore, we are pessimistic for now (The performance problem showed up in example 14 of gcc-loops). radar://13096933 llvm-svn: 174300	2013-02-04 02:52:05 +00:00
NAKAMURA Takumi	80159432de	PPCDarwinAsmPrinter::EmitStartOfAsmFile(): Add checking range in CPUDirectives[]. llvm-svn: 174298	2013-02-04 00:47:38 +00:00
NAKAMURA Takumi	3d591ae0b9	PPCDarwinAsmPrinter::EmitStartOfAsmFile(): Add possible elements in CPUDirectives[]. llvm-svn: 174297	2013-02-04 00:47:33 +00:00
Reed Kotler	f8933f83f0	Start static relocation implementation for mips16. This checkin makes hello world work. llvm-svn: 174264	2013-02-02 04:07:35 +00:00
Bill Schmidt	cc99a2f61d	Add notes about future PowerPC features llvm-svn: 174232	2013-02-01 23:10:09 +00:00
Bill Schmidt	52742c25ae	LLVM enablement for some older PowerPC CPUs llvm-svn: 174230	2013-02-01 22:59:51 +00:00
David Sehr	8114a7a651	Two changes relevant to LEA and x32: 1) allows the use of RIP-relative addressing in 32-bit LEA instructions under x86-64 (ILP32 and LP64) 2) separates the size of address registers in 64-bit LEA instructions from control by ILP32/LP64. llvm-svn: 174208	2013-02-01 19:28:09 +00:00
Jyotsna Verma	2ceafa6684	Replace LDriu*[bhdw]_indexed_V4 instructions with "def Pats". llvm-svn: 174193	2013-02-01 16:36:16 +00:00
Jyotsna Verma	d6eda1c227	Add appropriate TSFlags to the instructions that must be always extended. llvm-svn: 174186	2013-02-01 15:54:43 +00:00
Tim Northover	111b6cb37b	Remove currently unused register decoder from AArch64. This should fix a warning when building this backend. llvm-svn: 174177	2013-02-01 14:55:05 +00:00
Chandler Carruth	e5d8d0d64b	Switch the code added in r173885 to use the new, shiny RTTI infrastructure on MCStreamer to test for whether there is an MCELFStreamer object available. This is just a cleanup on the AsmPrinter side of things, moving ad-hoc tests of random APIs to a direct type query. But the AsmParser completely broken. There were no tests, it just blindly cast its streamer to an MCELFStreamer and started manipulating it. I don't have a test case -- this actually failed on LLVM's own regression test suite. Unfortunately the failure only appears when the stars, compilers, and runtime align to misbehave when we read a pointer to a formatted_raw_ostream as-if it were an MCAssembler. =/ UBSan would catch this immediately. Many thanks to Matt for doing about 80% of the debugging work here in GDB, Jim for helping to explain how exactly to fix this, and others for putting up with the hair pulling that ensued during debugging it. llvm-svn: 174118	2013-01-31 23:43:14 +00:00
Chandler Carruth	de093ef8d6	Give the MCStreamer class hierarchy LLVM RTTI facilities for use with isa<> and dyn_cast<>. In several places, code is already hacking around the absence of this, and there seem to be several interfaces that might be lifted and/or devirtualized using this. This change was based on a discussion with Jim Grosbach about how best to handle testing for specific MCStreamer subclasses. He said that this was the correct end state, and everything else was too hacky so I decided to just make it so. No functionality should be changed here, this is just threading the kind through all the constructors and setting up the classof overloads. llvm-svn: 174113	2013-01-31 23:29:57 +00:00
NAKAMURA Takumi	e1137a2058	Update AMDGPURegisterInfo::eliminateFrameIndex() corresponding to r174083. llvm-svn: 174106	2013-01-31 22:55:51 +00:00
Tom Stellard	4926921bd4	R600: Fold clamp, neg, abs Patch by: Vincent Lejeune Reviewed-by: Tom Stellard <thomas.stellard@amd.com> llvm-svn: 174099	2013-01-31 22:11:54 +00:00
Tom Stellard	dd04c83a4d	R600: Consider bitcast when folding const_address node. Patch by: Vincent Lejeune Reviewed-by: Tom Stellard <thomas.stellard@amd.com> llvm-svn: 174098	2013-01-31 22:11:53 +00:00
Tom Stellard	af1bce7d1d	R600: Make store_dummy intrinsic more general by passing export type Patch by: Vincent Lejeune Reviewed-by: Tom Stellard <thomas.stellard@amd.com> llvm-svn: 174097	2013-01-31 22:11:46 +00:00
Chad Rosier	3145deac19	Remove unused variable, which should have been removed with r174083. llvm-svn: 174094	2013-01-31 21:23:44 +00:00
Tim Northover	c6d39314b2	Update AArch64 backend to changed eliminateFrameIndex interface. llvm-svn: 174086	2013-01-31 20:46:53 +00:00
Chad Rosier	df782d2225	[PEI] Pass the frame index operand number to the eliminateFrameIndex function. Each target implementation was needlessly recomputing the index. Part of rdar://13076458 llvm-svn: 174083	2013-01-31 20:02:54 +00:00
Tim Northover	e0e3aefdd3	Add AArch64 as an experimental target. This patch adds support for AArch64 (ARM's 64-bit architecture) to LLVM in the "experimental" category. Currently, it won't be built unless requested explicitly. This initial commit should have support for: + Assembly of all scalar (i.e. non-NEON, non-Crypto) instructions (except the late addition CRC instructions). + CodeGen features required for C++03 and C99. + Compilation for the "small" memory model: code+static data < 4GB. + Absolute and position-independent code. + GNU-style (i.e. "__thread") TLS. + Debugging information. The principal omission, currently, is performance tuning. This patch excludes the NEON support also reviewed due to an outbreak of batshit insanity in our legal department. That will be committed soon bringing the changes to precisely what has been approved. Further reviews would be gratefully received. llvm-svn: 174054	2013-01-31 12:12:40 +00:00
Eric Christopher	258c867c0b	Whitespace. llvm-svn: 174009	2013-01-31 00:50:48 +00:00
Eric Christopher	4e3e94c13d	Check and allow floating point registers to select the size of the register for inline asm. This conforms to how gcc allows for effective casting of inputs into gprs (fprs is already handled). llvm-svn: 174008	2013-01-31 00:50:46 +00:00
Hal Finkel	e1df90958d	PPC QPX requires a 32-byte aligned stack On systems which support the QPX vector instructions, the stack must be 32-byte aligned. llvm-svn: 173993	2013-01-30 23:43:27 +00:00
Evan Cheng	d2ca4e2ed9	Restrict sin/cos optimization to 64-bit only for now. 32-bit is a bit messy and less critical. llvm-svn: 173987	2013-01-30 22:56:35 +00:00
Hal Finkel	b3fc509b23	Initialize hasQPX in PPCSubtarget This should have gone in with r173973. llvm-svn: 173984	2013-01-30 22:43:44 +00:00
Hal Finkel	efb305e54c	Add definitions for the PPC a2q core marked as having QPX available This is the first commit of a large series which will add support for the QPX vector instruction set to the PowerPC backend. This instruction set is used on the IBM Blue Gene/Q supercomputers. llvm-svn: 173973	2013-01-30 21:17:42 +00:00
Eli Bendersky	2e2ce49e59	Add a special ARM trap encoding for NaCl. More details in this thread: http://lists.cs.uiuc.edu/pipermail/llvm-commits/Week-of-Mon-20130128/163783.html Patch by JF Bastien llvm-svn: 173943	2013-01-30 16:30:19 +00:00
Logan Chien	a436e4c7e4	Add missing header and test cases for r173939. llvm-svn: 173941	2013-01-30 15:48:50 +00:00
Logan Chien	2bcc42c730	Override virtual function for ARM EH directives. llvm-svn: 173939	2013-01-30 15:39:04 +00:00
David Blaikie	24f44ac53a	Removing initializer for the field removed in r173887 llvm-svn: 173888	2013-01-30 03:04:07 +00:00
David Blaikie	b7fa813373	Remove unused variable (introduced in r173884) to clear clang -Werror build llvm-svn: 173887	2013-01-30 02:56:02 +00:00
Jack Carter	b01a90ce40	Forgot to add new file to CMakeLists llvm-svn: 173886	2013-01-30 02:32:36 +00:00
Jack Carter	718da0b53b	This patch implements runtime ARM specific setting of ELF header e_flags. Contributer: Jack Carter llvm-svn: 173885	2013-01-30 02:24:33 +00:00
Jack Carter	7f378104b6	This patch implements runtime Mips specific setting of ELF header e_flags. Contributer: Jack Carter llvm-svn: 173884	2013-01-30 02:16:36 +00:00
Jack Carter	1bd90ff6cc	This patch reworks how llvm targets set and update ELF header e_flags. Currently gathering information such as symbol, section and data is done by collecting it in an MCAssembler object. From MCAssembler and MCAsmLayout objects ELFObjectWriter::WriteObject() forms and streams out the ELF object file. This patch just adds a few members to the MCAssember class to store and access the e_flag settings. It allows for runtime additions to the e_flag by assembler directives. The standalone assembler can get to MCAssembler from getParser().getStreamer().getAssembler(). This patch is the generic infrastructure and will be followed by patches for ARM and Mips for their target specific use. Contributer: Jack Carter llvm-svn: 173882	2013-01-30 02:09:52 +00:00
Akira Hatanaka	c0b020690b	[mips] Lower EH_RETURN. Patch by Sasa Stankovic. llvm-svn: 173862	2013-01-30 00:26:49 +00:00
Renato Golin	5e9d55eca0	Adding simple cast cost to ARM Changing ARMBaseTargetMachine to return ARMTargetLowering intead of the generic one (similar to x86 code). Tests showing which instructions were added to cast when necessary or cost zero when not. Downcast to 16 bits are not lowered in NEON, so costs are not there yet. llvm-svn: 173849	2013-01-29 23:31:38 +00:00
Jyotsna Verma	b16a9cb132	Use multiclass for post-increment store instructions. llvm-svn: 173816	2013-01-29 18:42:41 +00:00
Jyotsna Verma	a609b1c89d	Add constant extender support for MInst type instructions. llvm-svn: 173813	2013-01-29 18:18:50 +00:00
Evan Cheng	27e41c9f70	Remove dead code. llvm-svn: 173812	2013-01-29 18:08:22 +00:00
NAKAMURA Takumi	978b5a0e02	R600/AMDILPeepholeOptimizer.cpp: Tweak std::make_pair to satisfy C++11. llvm-svn: 173807	2013-01-29 16:31:56 +00:00
Hans Wennborg	5deecd9043	Fix typo in X86BaseInfo.h that I introduced in r157818. llvm-svn: 173798	2013-01-29 14:05:57 +00:00
Tim Northover	a0edd3ee66	Fix 64-bit atomic operations in Thumb mode. The ARM and Thumb variants of LDREXD and STREXD have different constraints and take different operands. Previously the code expanding atomic operations didn't take this into account and asserted in Thumb mode. llvm-svn: 173780	2013-01-29 09:06:13 +00:00
Craig Topper	c048154b9b	Merge SSE and AVX shuffle instructions in the comment printer. llvm-svn: 173777	2013-01-29 07:54:31 +00:00
Evan Cheng	0e88c7d897	Teach SDISel to combine fsin / fcos into a fsincos node if the following conditions are met: 1. They share the same operand and are in the same BB. 2. Both outputs are used. 3. The target has a native instruction that maps to ISD::FSINCOS node or the target provides a sincos library call. Implemented the generic optimization in sdisel and enabled it for Mac OSX. Also added an additional optimization for x86_64 Mac OSX by using an alternative entry point __sincos_stret which returns the two results in xmm0 / xmm1. rdar://13087969 PR13204 llvm-svn: 173755	2013-01-29 02:32:37 +00:00
Hal Finkel	7f9e8d3eaa	Add isBGQ method to PPCSubtarget This function will be used in future commits. llvm-svn: 173729	2013-01-29 00:22:47 +00:00
Craig Topper	5c683972bc	Fix 256-bit PALIGNR comment decoding to understand that it works on independent 256-bit lanes. llvm-svn: 173674	2013-01-28 07:41:18 +00:00
Craig Topper	71d99ffe4a	Add missing break in 256-bit palignr comment printing. No test case yet because the comment itself is still wrong. llvm-svn: 173669	2013-01-28 07:19:11 +00:00
Craig Topper	8fb09f0abb	Fix inconsistent usage of PALIGN and PALIGNR when referring to the same instruction. llvm-svn: 173667	2013-01-28 06:48:25 +00:00
Craig Topper	b3ede5e3b1	Remove addToNoHelperNeeded function that was left unused after r173649. Fixes a -Wunused warning. llvm-svn: 173664	2013-01-28 06:09:24 +00:00
Reed Kotler	97f8e2fa8f	Make some code a little simpler. llvm-svn: 173649	2013-01-28 02:46:49 +00:00
Richard Osborne	038d24f90c	[XCore] Add missing l2rus instructions. These instructions are not targeted by the compiler but they are needed for the MC layer. llvm-svn: 173634	2013-01-27 22:28:30 +00:00
Richard Osborne	f2ecd40929	[XCore] Add missing l2r instructions. These instructions are not targeted by the compiler but they are needed for the MC layer. llvm-svn: 173629	2013-01-27 21:26:02 +00:00
Richard Osborne	7fe8f63544	[XCore] Add missing 1r instructions. These instructions are not targeted by the compiler but they are needed for the MC layer. llvm-svn: 173624	2013-01-27 20:46:21 +00:00
Richard Osborne	8f56317287	[XCore] Add missing 0r instructions. These instructions are not targeted by the compiler but they are needed for the MC layer. llvm-svn: 173623	2013-01-27 20:42:57 +00:00
Bill Wendling	cc1fc9465b	Convert the CPP backend to use the AttributeSet instead of AttributeWithIndex. Further removal of the introspective AttributeWithIndex thing. Also fix the #includes. llvm-svn: 173599	2013-01-27 01:22:51 +00:00
Benjamin Kramer	6a93596538	X86: Decode PALIGN operands so I don't have to do it in my head. llvm-svn: 173572	2013-01-26 13:31:37 +00:00
Benjamin Kramer	99c68dd964	X86: Do splat promotion later, so the optimizer can chew on it first. This catches many cases where we can emit a more efficient shuffle for a specific mask or when the mask contains undefs. Once the splat is lowered to unpacks we can't do that anymore. There is a possibility of moving the promotion after pshufb matching, but I'm not sure if pshufb with a mask loaded from memory is faster than 3 shuffles, so I avoided that for now. llvm-svn: 173569	2013-01-26 11:44:21 +00:00
Reed Kotler	233cee2b5b	fix use of std::std. it's ordered set. llvm-svn: 173563	2013-01-26 06:58:35 +00:00
Dmitri Gribenko	c451bdf9ff	Remove unused variables, silences -Wunused-variable llvm-svn: 173526	2013-01-25 23:17:21 +00:00
Bill Wendling	57625a4966	Remove some introspection functions. The 'getSlot' function and its ilk allow introspection into the AttributeSet class. However, that class should be opaque. Allow access through accessor methods instead. llvm-svn: 173522	2013-01-25 23:09:36 +00:00
Hal Finkel	4e5ca9e578	Initial implementation of PPCTargetTransformInfo This provides a place to add customized operation cost information and control some other target-specific IR-level transformations. The only non-trivial logic in this checkin assigns a higher cost to unaligned loads and stores (covered by the included test case). llvm-svn: 173520	2013-01-25 23:05:59 +00:00
Eli Bendersky	597fc1233a	In this patch, we teach X86_64TargetMachine that it has a ILP32 (defined by the x32 ABI) mode, in which case its pointers are 32-bits in size. This knowledge is also added to X86RegisterInfo that now returns the appropriate registers in getPointerRegClass. There are many outcomes to this change. In order to keep the patches separate and manageable, we start by focusing on some simple testable cases. The patch adds a test with passing a pointer to a function - focusing on the difference between the two data models for x86-64. Another test is added for handling of 'sret' arguments (and functionality is added in X86ISelLowering to make it work). A note on naming: the "x32 ABI" document refers to the AMD64 architecture (in LLVM it's distinguished by being is64Bits() in the x86 subtarget) with two variations: the LP64 (default) data model, and the ILP32 data model. This patch adds predicates to the subtarget which are consistent with this naming scheme. llvm-svn: 173503	2013-01-25 22:07:43 +00:00
Richard Osborne	6b86eec819	Add instruction encodings / disassembly support for l4r instructions. llvm-svn: 173501	2013-01-25 21:55:32 +00:00
Bill Wendling	8649283e75	Use the new 'getSlotIndex' method to retrieve the attribute's slot index. llvm-svn: 173499	2013-01-25 21:46:52 +00:00
Richard Osborne	a520a7dcf3	Use the correct format in the STW / SETPSC instruction names. llvm-svn: 173494	2013-01-25 21:25:12 +00:00
Richard Osborne	9a228a13c6	Fix order of operands for crc8_l4r The order in which operands appear in the encoded instruction is different to order in which they appear in assembly. This changes the XCore backend to use the instruction encoding order. llvm-svn: 173493	2013-01-25 21:20:28 +00:00
Richard Osborne	a19fa86a70	Add instruction encodings / disassembly support for l5r instructions. llvm-svn: 173479	2013-01-25 20:20:07 +00:00
Richard Osborne	8ae02d3cef	Fix order of operands for l5r instructions. With this change the operands order matches the order in which the operands are encoded in the instruction. llvm-svn: 173477	2013-01-25 20:16:00 +00:00
Richard Osborne	ea023fcde1	Use correct mnemonic / instruction name for ldivu. llvm-svn: 173476	2013-01-25 20:11:26 +00:00
Hal Finkel	53f4ba6ce3	More cleanup of PPC register definitions. Uses the new !add TableGen operator to do more cleanup of the PPC register definitions. llvm-svn: 173446	2013-01-25 14:49:10 +00:00
Silviu Baranga	3eb45a03af	Fixed the condition codes for the atomic64 min/umin code generation on ARM. If the sutraction of the higher 32 bit parts gives a 0 result, we need to do the store operation. llvm-svn: 173437	2013-01-25 10:39:49 +00:00
Andrew Trick	e2c3f5c982	MIsched: Improve the interface to SchedDFS analysis (subtrees). Allow the strategy to select SchedDFS. Allow the results of SchedDFS to affect initialization of the scheduler state. llvm-svn: 173425	2013-01-25 06:33:57 +00:00
Jack Carter	07c818d2da	This patch implements parsing the .word directive for the Mips assembler. Contributer: Vladimir Medic llvm-svn: 173407	2013-01-25 01:31:34 +00:00
Akira Hatanaka	28aed9ca85	[mips] Set flag neverHasSideEffects flag on some of the floating point instructions. llvm-svn: 173401	2013-01-25 00:20:39 +00:00
Renato Golin	d4c392e6ff	Moving Cost Tables up to share with other targets llvm-svn: 173382	2013-01-24 23:01:00 +00:00
Hal Finkel	41176f43c4	Start cleanup of PPC register definitions using foreach loops. No functionality change intended. This captures the first two cases GPR32/64. For the others, we need an addition operator (if we have one, I've not yet found it). Based on a suggestion made by Tom Stellard in the AArch64 review! llvm-svn: 173366	2013-01-24 20:43:18 +00:00
NAKAMURA Takumi	bf8f207519	MipsISelLowering.cpp: Fill unreachable paths to fix warnings. [-Wsometimes-uninitialized] FIXME: Could they, unreachable(s), be removed? FIXME: I could prefer the coding standards... llvm-svn: 173325	2013-01-24 06:08:06 +00:00
NAKAMURA Takumi	f25b7c6816	MipsISelLowering.cpp: Fix a warning, take two. [-Wunused-variable] ...and fix a typo, s/#ifdef/#ifndef/ llvm-svn: 173324	2013-01-24 05:54:23 +00:00
NAKAMURA Takumi	c77d028bfb	MipsISelLowering.cpp: Fix a warning. [-Wunused-variable] llvm-svn: 173323	2013-01-24 05:47:29 +00:00
Reed Kotler	a2d76bce1f	The next phase of Mips16 hard float implementation. Allow Mips16 routines to call Mips32 routines that have abi requirements that either arguments or return values are passed in floating point registers. This handles only the pic case. We have not done non pic for Mips16 yet in any form. The libm functions are Mips32, so with this addition we have a complete Mips16 hard float implementation. We still are not able to complete mix Mip16 and Mips32 with hard float. That will be the next phase which will have several steps. For Mips32 to freely call Mips16 some stub functions must be created. llvm-svn: 173320	2013-01-24 04:24:02 +00:00
Tom Stellard	6f1b8657f9	R600: Add a llvm.R600.store.swizzle intrinsics This intrinsic is translated to ALLOC_EXPORT_WORD1_SWIZ, hence its name. It is used to store vs/fs outputs Patch by: Vincent Lejeune Reviewed-by: Tom Stellard <thomas.stellard@amd.com> llvm-svn: 173297	2013-01-23 21:39:49 +00:00
Tom Stellard	d8ac91d436	R600: Simplify stream outputs intrinsic Patch by: Vincent Lejeune Reviewed-by: Tom Stellard <thomas.stellard@amd.com> llvm-svn: 173296	2013-01-23 21:39:47 +00:00
Richard Osborne	54e311821f	Add instruction encodings / disassembly support for l6r instructions. llvm-svn: 173288	2013-01-23 20:08:11 +00:00
Eli Bendersky	f759526983	Fix powerpc test failure - forgot to initialize stack slot size for PPCLinuxMCAsmInfo llvm-svn: 173275	2013-01-23 17:12:15 +00:00
Eli Bendersky	32aab2216d	Clean up assignment of CalleeSaveStackSlotSize: get rid of the default and explicitly set this in every target that needs to change it from the default. llvm-svn: 173270	2013-01-23 16:22:04 +00:00
Benjamin Kramer	c4231cc9b3	NVPTX: Stop leaking memory by using a managed constant instead of a new Argument. This is still an egregious hack since we don't have a nice interface for this kind of thing but should help the valgrind leak check buildbot to become green. llvm-svn: 173267	2013-01-23 15:21:44 +00:00
Bill Wendling	d154e283f2	Add the IR attribute 'sspstrong'. SSPStrong applies a heuristic to insert stack protectors in these situations: * A Protector is required for functions which contain an array, regardless of type or length. * A Protector is required for functions which contain a structure/union which contains an array, regardless of type or length. Note, there is no limit to the depth of nesting. * A protector is required when the address of a local variable (i.e., stack based variable) is exposed. (E.g., such as through a local whose address is taken as part of the RHS of an assignment or a local whose address is taken as part of a function argument.) This patch implements the SSPString attribute to be equivalent to SSPRequired. This will change in a subsequent patch. llvm-svn: 173230	2013-01-23 06:41:41 +00:00
Tom Stellard	365366f9ef	R600: rework handling of the constants Remove Cxxx registers, add new special register - "ALU_CONST" and new operand for each alu src - "sel". ALU_CONST is used to designate that the new operand contains the value to override src.sel, src.kc_bank, src.chan for constants in the driver. Patch by: Vadim Girlin Vincent Lejeune: - Use pointers for constants - Fold CONST_ADDRESS when possible Tom Stellard: - Give CONSTANT_BUFFER_0 its own address space - Use integer types for constant loads Reviewed-by: Tom Stellard <thomas.stellard@amd.com> llvm-svn: 173222	2013-01-23 02:09:06 +00:00
Tom Stellard	ff62c35da0	R600: Add a CONST_ADDRESS node to model constant buf read Patch by: Vincent Lejeune Reviewed-by: Tom Stellard <thomas.stellard@amd.com> llvm-svn: 173221	2013-01-23 02:09:03 +00:00
Tom Stellard	ab28e9a30a	R600: Factorise VTX_WORD0 and VTX_WORD1 in tblgen def Patch by: Vincent Lejeune Reviewed-by: Tom Stellard <thomas.stellard@amd.com> llvm-svn: 173220	2013-01-23 02:09:01 +00:00
Richard Osborne	1a06479f46	Add instruction encodings / disassembly support for u10 / lu10 instructions. llvm-svn: 173204	2013-01-22 22:55:04 +00:00
Michael Liao	3dffc5e2b7	Fix an issue of pseudo atomic instruction DAG schedule - Add list of physical registers clobbered in pseudo atomic insts Physical registers are clobbered when pseudo atomic instructions are expanded. Add them in clobber list to prevent DAG scheduler to mis-schedule them after these insns are declared side-effect free. - Add test case from Michael Kuperstein <michael.m.kuperstein@intel.com> llvm-svn: 173200	2013-01-22 21:47:38 +00:00
Akira Hatanaka	88c0ec826c	[mips] Implement MipsRegisterInfo::getRegPressureLimit. llvm-svn: 173197	2013-01-22 21:34:25 +00:00
Akira Hatanaka	f7d16d0563	[mips] Clean up code in MipsTargetLowering::LowerCall. No functional change intended llvm-svn: 173189	2013-01-22 20:05:56 +00:00
Benjamin Kramer	fee7d21ae7	X86: Make sure we account for the FMA4 register immediate value, otherwise rip-rel relocations will be off by one byte. PR15040. llvm-svn: 173176	2013-01-22 18:05:59 +00:00
Eli Bendersky	0893e1079d	Initial patch for x32 ABI support. Add the x32 environment kind to the triple, and separate the concept of pointer size and callee save stack slot size, since they're not equal on x32. llvm-svn: 173175	2013-01-22 18:02:49 +00:00
Tim Northover	29178a348a	Make APFloat constructor require explicit semantics. Previously we tried to infer it from the bit width size, with an added IsIEEE argument for the PPC/IEEE 128-bit case, which had a default value. This default value allowed bugs to creep in, where it was inappropriate. llvm-svn: 173138	2013-01-22 09:46:31 +00:00
Richard Osborne	5d477751df	Fix some incorrectly named u10 / lu10 instructions. llvm-svn: 173090	2013-01-21 21:12:30 +00:00
Richard Osborne	38cff3ea7f	Remove unused multiclass. llvm-svn: 173087	2013-01-21 20:50:54 +00:00
Richard Osborne	9d3ec06ef8	Add instruction encodings / disassembly support for u6 / lu6 instructions. llvm-svn: 173086	2013-01-21 20:44:17 +00:00
Richard Osborne	6e58c6d86d	Add instruction encoding / disassembly support for ru6 / lru6 instructions. llvm-svn: 173085	2013-01-21 20:42:16 +00:00
Richard Osborne	0d68e21ca7	Use correct format for the LDAWCP instruction (u6). llvm-svn: 173083	2013-01-21 20:32:54 +00:00
Tom Stellard	c9b903138d	R600/SI: Use unnormalized coordinates for sampling with the RECT target. Patch by: Michel Dänzer Reviewed-by: Tom Stellard <thomas.stellard@amd.com> Reviewed-by: Christian König <christian.koenig@amd.com> Signed-off-by: Michel Dänzer <michel.daenzer@amd.com> llvm-svn: 173053	2013-01-21 15:40:48 +00:00
Tom Stellard	14421a793f	R600/SI: Take target parameter for sample intrinsics. Patch by: Michel Dänzer Reviewed-by: Tom Stellard <thomas.stellard@amd.com> Reviewed-by: Christian König <christian.koenig@amd.com> Signed-off-by: Michel Dänzer <michel.daenzer@amd.com> llvm-svn: 173052	2013-01-21 15:40:47 +00:00
Tom Stellard	74dda0da31	R600/SI: Derive all sample intrinsics from a single class. Patch by: Michel Dänzer Reviewed-by: Tom Stellard <thomas.stellard@amd.com> Reviewed-by: Christian König <christian.koenig@amd.com> Signed-off-by: Michel Dänzer <michel.daenzer@amd.com> llvm-svn: 173051	2013-01-21 15:40:46 +00:00
NAKAMURA Takumi	c96fb1bd36	R600/SILowerControlFlow.cpp: Fix a warning. [-Wunused-variable] llvm-svn: 173040	2013-01-21 14:06:48 +00:00
Craig Topper	66163a35ee	Use <0 checks in place of ==-1 because it results in simpler code. llvm-svn: 173010	2013-01-21 07:25:16 +00:00
Craig Topper	9b29486f42	Use MVT instead of EVT in LowerVECTOR_SHUFFLEtoBlend. llvm-svn: 173009	2013-01-21 07:19:54 +00:00
Craig Topper	32c5406dcf	Remove trailing whitespace. llvm-svn: 173008	2013-01-21 06:57:59 +00:00
Craig Topper	5c84c25bf4	Fix some 80 column violations. llvm-svn: 173006	2013-01-21 06:21:54 +00:00
Craig Topper	2cd375896a	Make helper method static. llvm-svn: 173005	2013-01-21 06:13:28 +00:00
Craig Topper	cf93977920	Convert more EVT's to MVT's in the lowering methods. llvm-svn: 172995	2013-01-20 21:50:27 +00:00
Craig Topper	e65a08be64	Capitalize lowerTRUNCATE so that it matches the other lower functions in this file despite it not matching coding standards. llvm-svn: 172994	2013-01-20 21:34:37 +00:00
Renato Golin	e1fb059327	Revert CostTable algorithm, will re-write llvm-svn: 172992	2013-01-20 20:57:20 +00:00
Richard Osborne	4e69724869	Add instruction encodings / disassembly support for l2rus instructions. llvm-svn: 172987	2013-01-20 18:51:15 +00:00
Richard Osborne	9fbf57b26c	Add instruction encodings / disassembly support for l3r instructions. llvm-svn: 172986	2013-01-20 18:37:49 +00:00
Richard Osborne	f063fcee7a	Add instruction encodings / disassembler support for 2rus instructions. llvm-svn: 172985	2013-01-20 17:22:43 +00:00
Richard Osborne	3fb7395233	Add instruction encodings / disassembly support 3r instructions. It is not possible to distinguish 3r instructions from 2r / rus instructions using only the fixed bits. Therefore if an instruction doesn't match the 2r / rus format try to decode it as a 3r instruction before returning Fail. llvm-svn: 172984	2013-01-20 17:18:47 +00:00
Craig Topper	ce61fdf0a3	Make LowerVSETCC a static function and use MVT instead of EVT. llvm-svn: 172969	2013-01-20 09:02:22 +00:00
Nadav Rotem	9450fcfff1	Revert 172708. The optimization handles esoteric cases but adds a lot of complexity both to the X86 backend and to other backends. This optimization disables an important canonicalization of chains of SEXT nodes and makes SEXT and ZEXT asymmetrical. Disabling the canonicalization of consecutive SEXT nodes into a single node disables other DAG optimizations that assume that there is only one SEXT node. The AVX mask optimizations is one example. Additionally this optimization does not update the cost model. llvm-svn: 172968	2013-01-20 08:35:56 +00:00
Craig Topper	9976974cc6	Make some helper methods static. llvm-svn: 172936	2013-01-20 00:50:58 +00:00
Craig Topper	4ac87da529	Remove DebugLoc argument from static function. It can easily be obtained from the SVOp passed in. llvm-svn: 172935	2013-01-20 00:43:42 +00:00
Craig Topper	3da6507c41	Use MVT instead of EVT in more instruction lowering code. llvm-svn: 172933	2013-01-20 00:38:18 +00:00
Craig Topper	53c7fbabbf	Use MVT instead of EVT in more of the shuffle lowering code. llvm-svn: 172930	2013-01-19 23:36:09 +00:00
Craig Topper	bb772d27a7	Capitalize LowerVectorIntExtend to be consistent with all the other lower functions in this file. llvm-svn: 172927	2013-01-19 23:14:09 +00:00
Nadav Rotem	7b3120b9ae	On Sandybridge split unaligned 256bit stores into two xmm-sized stores. llvm-svn: 172894	2013-01-19 08:38:41 +00:00
Craig Topper	84b01120bc	Use MVT instead of EVT when computing shuffle immediates since they can only be for legal types. Keeps compiler from generating unneeded checks and handling for extended types. llvm-svn: 172893	2013-01-19 08:27:45 +00:00
Chandler Carruth	1fe21fc0b5	Sort all of the includes. Several files got checked in with mis-sorted includes. llvm-svn: 172891	2013-01-19 08:03:47 +00:00
Jack Carter	7ab15fafe3	This is a resubmittal. For some reason it broke the bots yesterday but I cannot reproduce the problem and have scrubed my sources and even tested with llvm-lit -v --vg. Formatting fixes. Mostly long lines and blank spaces at end of lines. Contributer: Jack Carter llvm-svn: 172882	2013-01-19 02:00:40 +00:00
Nadav Rotem	7431211214	On Sandybridge loading unaligned 256bits using two XMM loads (vmovups and vinsertf128) is faster than using a single vmovups instruction. llvm-svn: 172868	2013-01-18 23:10:30 +00:00
Jack Carter	c1b17ed2e1	This is a resubmittal. For some reason it broke the bots yesterday but I cannot reproduce the problem and have scrubed my sources and even tested with llvm-lit -v --vg. Support for Mips register information sections. Mips ELF object files have a section that is dedicated to register use info. Some of this information such as the assumed Global Pointer value is used by the linker in relocation resolution. The register info file is .reginfo in o32 and .MIPS.options in 64 and n32 abi files. This patch contains the changes needed to create the sections, but leaves the actual register accounting for a future patch. Contributer: Jack Carter llvm-svn: 172847	2013-01-18 21:20:38 +00:00
Tom Stellard	c4cabef782	R600: Proper insert S_WAITCNT instructions Some instructions like memory reads/writes are executed asynchronously, so we need to insert S_WAITCNT instructions to block before accessing their results. Previously we have just inserted S_WAITCNT instructions after each async instruction, this patch fixes this and adds a prober insertion pass. Patch by: Christian König Tested-by: Michel Dänzer <michel.daenzer@amd.com> Reviewed-by: Tom Stellard <thomas.stellard@amd.com> Signed-off-by: Christian König <deathsimple@vodafone.de> llvm-svn: 172846	2013-01-18 21:15:53 +00:00
Tom Stellard	be8ebeebf7	R600: Optimize and cleanup KILL on SI We shouldn't insert KILL optimization if we don't have a kill instruction at all. Patch by: Christian König Tested-by: Michel Dänzer <michel.daenzer@amd.com> Reviewed-by: Tom Stellard <thomas.stellard@amd.com> Signed-off-by: Christian König <deathsimple@vodafone.de> llvm-svn: 172845	2013-01-18 21:15:50 +00:00
Jack Carter	86c2c564ff	This is a resubmittal. For some reason it broke the bots yesterday but I cannot reproduce the problem and have scrubed my sources and even tested with llvm-lit -v --vg. Removal of redundant code and formatting fixes. Contributers: Jack Carter/Vladimir Medic llvm-svn: 172842	2013-01-18 20:15:06 +00:00
Craig Topper	1cb8aa581b	Calculate vector element size more directly for VINSERTF128/VEXTRACTF128 immediate handling. Also use MVT since this only called on legal types during pattern matching. llvm-svn: 172797	2013-01-18 08:41:28 +00:00
Craig Topper	e938138daf	Minor formatting fix. No functional change. llvm-svn: 172795	2013-01-18 07:27:20 +00:00
Craig Topper	908f7d14b5	Spelling fix: extened->extended. Trailing whitespace in same function. llvm-svn: 172793	2013-01-18 06:50:59 +00:00
Craig Topper	01fcf2e2f2	Make more use of is128BitVector/is256BitVector in place of getSizeInBits() == 128/256. llvm-svn: 172792	2013-01-18 06:44:29 +00:00
Chad Rosier	1e8f053bd1	[ms-inline asm] Make the error message more generic now that we support the 'SIZE' and 'LENGTH' operators. llvm-svn: 172773	2013-01-18 00:50:59 +00:00
Bill Schmidt	dee1ef8f53	This patch fixes PR13626 by providing i128 support in the return calling convention. 128-bit integers are now properly returned in GPR3 and GPR4 on PowerPC. llvm-svn: 172745	2013-01-17 19:34:57 +00:00
Chad Rosier	d0ed73acb4	[ms-inline asm] Add support for the 'SIZE' and 'LENGTH' operators. Part of rdar://12576868 llvm-svn: 172743	2013-01-17 19:21:48 +00:00
Jyotsna Verma	9b60c1d171	Add indexed load/store instructions for offset validation check. This patch fixes bug 14902 - http://llvm.org/bugs/show_bug.cgi?id=14902 llvm-svn: 172737	2013-01-17 18:42:37 +00:00
Bill Schmidt	6b2940b01e	This patch fixes the PPC calling convention to handle returns of _Complex float and _Complex long double, by simply increasing the number of floating point registers available for return values. The test case verifies that the correct registers are loaded. llvm-svn: 172733	2013-01-17 17:45:19 +00:00
Elena Demikhovsky	f6a30e05d5	Optimization for the following SIGN_EXTEND pairs: v8i8 -> v8i64, v8i8 -> v8i32, v4i8 -> v4i64, v4i16 -> v4i64 for AVX and AVX2. Bug 14865. llvm-svn: 172708	2013-01-17 09:59:53 +00:00
Craig Topper	c7e6feee42	Combine AVX and SSE forms of MOVSS and MOVSD into the same multiclasses so they get instantiated together. llvm-svn: 172704	2013-01-17 06:59:42 +00:00
Jakob Stoklund Olesen	213a2f8b3f	Provide a place for targets to insert ILP optimization passes. Move the early if-conversion pass into this group. ILP optimizations usually need to find the right balance between register pressure and ILP using the MachineTraceMetrics analysis to identify critical paths and estimate other costs. Such passes should run together so they can share dominator tree and loop info analyses. Besides if-conversion, future passes to run here here could include expression height reduction and ARM's MLxExpansion pass. llvm-svn: 172687	2013-01-17 00:58:38 +00:00
Jack Carter	2a74a87b71	This is a resubmittal. For some reason it broke the bots yesterday but I cannot reproduce the problem and have scrubed my sources and even tested with llvm-lit -v --vg. The Mips RDHWR (Read Hardware Register) instruction was not tested for assembler or dissassembler consumption. This patch adds that functionality. Contributer: Vladimir Medic llvm-svn: 172685	2013-01-17 00:28:20 +00:00
Renato Golin	f104c4c4ca	Change CostTable model to be global to all targets Moving the X86CostTable to a common place, so that other back-ends can share the code. Also simplifying it a bit and commoning up tables with one and two types on operations. llvm-svn: 172658	2013-01-16 21:29:55 +00:00
Jack Carter	5619f91bf7	reverting 172579 llvm-svn: 172594	2013-01-16 01:29:10 +00:00
Jack Carter	e0c1e1a47e	Akira, Hope you are feeling better. The Mips RDHWR (Read Hardware Register) instruction was not tested for assembler or dissassembler consumption. This patch adds that functionality. Contributer: Vladimir Medic llvm-svn: 172579	2013-01-16 00:07:45 +00:00
Jack Carter	f238510c43	This patch fixes a Mips specific bug where we need to generate a N64 compound relocation R_MIPS_GPREL_32/R_MIPS_64/R_MIPS_NONE. The bug was exposed by the SingleSourcetest case DuffsDevice.c. Contributer: Jack Carter llvm-svn: 172496	2013-01-15 01:08:02 +00:00
Chad Rosier	5c118fd2ec	[ms-inline asm] Extend support for parsing Intel bracketed memory operands that have an arbitrary ordering of the base register, index register and displacement. rdar://12527141 llvm-svn: 172484	2013-01-14 22:31:35 +00:00
Dmitri Gribenko	f24e57f227	Improve r172468: const_cast is not needed here llvm-svn: 172483	2013-01-14 22:18:18 +00:00
Dmitri Gribenko	2e1df0e354	Improve r172471: avoid all those extra casts on the lines nearby llvm-svn: 172481	2013-01-14 22:08:37 +00:00
Quentin Colombet	77ca8b83a9	Follow up of commit r172472. Refactor the big if/else sequence into one string switch for ARM subtype selection. llvm-svn: 172475	2013-01-14 21:34:09 +00:00
Quentin Colombet	1a71168624	Complete the existing support of ARM v6m, v7m, and v7em, i.e., respectively cortex-m0, cortex-m3, and cortex-m4 on the backend side. Adds new subtype values for the MachO format and use them when the related triple are set. llvm-svn: 172472	2013-01-14 21:07:43 +00:00
David Greene	cf7ae6c2fd	Fix Casting Fix a casting-away-const compiler warning. llvm-svn: 172471	2013-01-14 21:04:47 +00:00
David Greene	c311561708	Fix Another Cast Properly cast code to eliminate cast-away-const errors. llvm-svn: 172468	2013-01-14 21:04:42 +00:00
Craig Topper	0d2c29e807	Simplify nested strconcats in X86 td files since strconcat can take more than 2 arguments. llvm-svn: 172379	2013-01-14 07:46:34 +00:00
Craig Topper	4c69a05d2d	Create a single multiclass for SSE and AVX version of MOVL/MOVH. Prevents needing to specify everything twice. No functional change intended llvm-svn: 172378	2013-01-14 07:26:58 +00:00
Nick Lewycky	f41a80efd0	Fix typo in comment. llvm-svn: 172364	2013-01-13 19:03:55 +00:00
Dmitri Gribenko	226fea5bd6	Remove redundant 'llvm::' qualifications llvm-svn: 172358	2013-01-13 16:01:15 +00:00
Benjamin Kramer	bcd14a0f26	X86: Add patterns for X86ISD::VSEXT in registers. Those can occur when something between the sextload and the store is on the same chain and blocks isel. Fixes PR14887. llvm-svn: 172353	2013-01-13 11:37:04 +00:00
NAKAMURA Takumi	de45c3a485	MipsDisassembler.cpp: Prune DecodeHWRegs64RegisterClass() to suppress a warning. [-Wunused-function] llvm-svn: 172319	2013-01-12 15:37:00 +00:00
NAKAMURA Takumi	956c123ab6	MipsAsmParser: Try to unbreak tests to add extra check. llvm-svn: 172315	2013-01-12 15:19:10 +00:00
Jack Carter	873c724b4a	This patch tackles the problem of parsing Mips register names in the standalone assembler llvm-mc. Registers such as $A1 can represent either a 32 or 64 bit register based on the instruction using it. In addition, based on the abi, $T0 can represent different 32 bit registers. The problem is resolved by the Mips specific AsmParser td definitions changing to work together. Many cases of RegisterClass parameters are now RegisterOperand. Contributer: Vladimir Medic llvm-svn: 172284	2013-01-12 01:03:14 +00:00
Preston Gurd	99c6990457	Update patch for the pad short functions pass for Intel Atom (only). Adds a check for -Oz, changes the code to not re-visit BBs, and skips over DBG_VALUE instrs. Patch by Andy Zhang. llvm-svn: 172258	2013-01-11 22:06:56 +00:00
NAKAMURA Takumi	7f25427686	X86AsmParser.cpp: Fix up r172148, to add initializer in another CreateMem(). llvm-svn: 172157	2013-01-11 01:13:54 +00:00
Jakub Staszak	ab3d878f35	Remove heavy and unused #inclues from X86TargetObjectFile.cpp. llvm-svn: 172151	2013-01-10 23:43:56 +00:00
Chad Rosier	8c2a9c744e	[ms-inline asm] Make sure we set a default value for AddressOf. Follow on to r172121. llvm-svn: 172148	2013-01-10 23:39:07 +00:00
Chad Rosier	a4bc9437a2	[ms-inline asm] Add support for calling functions from inline assembly. Part of rdar://12991541 llvm-svn: 172121	2013-01-10 22:10:27 +00:00
Joel Jones	5459754d33	Fix description of ARMOperand llvm-svn: 172011	2013-01-09 22:34:16 +00:00
Nadav Rotem	b1791a75cd	ARM Cost model: Use the size of vector registers and widest vectorizable instruction to determine the max vectorization factor. llvm-svn: 172010	2013-01-09 22:29:00 +00:00
Adhemerval Zanella	1ae2248e14	PowerPC: EH adjustments This patch adjust the r171506 to make all DWARF enconding pc-relative for PPC64. It also adds the R_PPC64_REL32 relocation handling in MCJIT (since the eh_frame will not generate PIC-relative relocation) and also adds the emission of stubs created by the TTypeEncoding. llvm-svn: 171979	2013-01-09 17:08:15 +00:00
Nadav Rotem	977e0be4a0	Efficient lowering of vector sdiv when the divisor is a splatted power of two constant. PR 14848. The lowered sequence is based on the existing sequence the target-independent DAG Combiner creates for the scalar case. Patch by Zvi Rackover. llvm-svn: 171953	2013-01-09 05:14:33 +00:00
Eric Christopher	bf7bc4966c	Last in the series of removing unnecessary '0' arguments for address space. Reordered the EmitULEB128IntValue arguments to make this easier. llvm-svn: 171949	2013-01-09 03:52:05 +00:00
Andrew Trick	9f0b95f260	MIsched: add an ILP window property to machine model. This was an experimental option, but needs to be defined per-target. e.g. PPC A2 needs to aggressively hide latency. I converted some in-order scheduling tests to A2. Hal is working on more test cases. llvm-svn: 171946	2013-01-09 03:36:49 +00:00
Eric Christopher	e3ab3d0e2c	These functions have default arguments of 0 for the last arg. Use them. llvm-svn: 171933	2013-01-09 01:57:54 +00:00
Nadav Rotem	b696c36fcd	Cost Model: Move the 'max unroll factor' variable to the TTI and add initial Cost Model support on ARM. llvm-svn: 171928	2013-01-09 01:15:42 +00:00
Jack Carter	c3dd91c4d7	This patch produces the correct addend value for an R_MIPS_GPREL16 relocation. Contributer: Jack Carter llvm-svn: 171882	2013-01-08 19:01:28 +00:00
Jack Carter	9e28cd3fad	This patch produces the correct pointer size value in the 64 bit .eh_frame section. It doesn't however allow exception handling to work yet since it depends on the correct relocation model being set in the ELF header flags. Contributer: Jack Carter llvm-svn: 171881	2013-01-08 18:53:20 +00:00
Preston Gurd	a01daace88	Pad Short Functions for Intel Atom The current Intel Atom microarchitecture has a feature whereby when a function returns early then it is slightly faster to execute a sequence of NOP instructions to wait until the return address is ready, as opposed to simply stalling on the ret instruction until the return address is ready. When compiling for X86 Atom only, this patch will run a pass, called "X86PadShortFunction" which will add NOP instructions where less than four cycles elapse between function entry and return. It includes tests. This patch has been updated to address Nadav's review comments - Optimize only at >= O1 and don't do optimization if -Os is set - Stores MachineBasicBlock* instead of BBNum - Uses DenseMap instead of std::map - Fixes placement of braces Patch by Andy Zhang. llvm-svn: 171879	2013-01-08 18:27:24 +00:00
Eli Bendersky	4d9ada036c	Renamed MCInstFragment to MCRelaxableFragment and added some comments. No change in functionality. llvm-svn: 171822	2013-01-08 00:22:56 +00:00
Jim Grosbach	9dbf3ee9d0	ARM: Copy-paste error. llvm-svn: 171790	2013-01-07 21:24:35 +00:00
Jim Grosbach	553eb75663	ARM: Fix a few copy-paste errors. s/X86/ARM/ llvm-svn: 171789	2013-01-07 21:12:13 +00:00
Bill Schmidt	9b1e3e25dc	This patch addresses bug 14678 by fixing two problems in medium code model code generation. Variables addressed through a GlobalAlias were not being handled, and variables with available_externally linkage were treated incorrectly. The patch contains two new tests to verify the correct code generation for these cases. llvm-svn: 171778	2013-01-07 19:29:18 +00:00
Jordan Rose	e8f1eaea8a	Change SMRange to be half-open (exclusive end) instead of closed (inclusive) This is necessary not only for representing empty ranges, but for handling multibyte characters in the input. (If the end pointer in a range refers to a multibyte character, should it point to the beginning or the end of the character in a char array?) Some of the code in the asm parsers was already assuming this anyway. llvm-svn: 171765	2013-01-07 19:00:49 +00:00
NAKAMURA Takumi	458a8277cc	R600/SIISelLowering.cpp: Suppress a warning. [-Wunused-variable] llvm-svn: 171728	2013-01-07 11:14:44 +00:00
Tim Northover	2883da3b51	Add LICENSE.TXT covering contributions made by ARM. Absent a Contributor's License Agreement (CLA) with an LLVM legal entity and as reviewed and agreed with Chris Lattner, add a patent license covering future contributions from ARM until there is a CLA. This is to make explicit ARM's grant of patent rights to recipients of LLVM containing ARM-contributed material. llvm-svn: 171721	2013-01-07 10:04:49 +00:00
Craig Topper	ae65212a4b	Remove more unnecessary # operators with nothing to paste proceeding them. llvm-svn: 171702	2013-01-07 06:14:20 +00:00
Craig Topper	a8c5ec09c7	Remove # from the beginning and end of def names. The # is a paste operator and should only be used with something to paste on either side. llvm-svn: 171697	2013-01-07 05:45:56 +00:00
Craig Topper	25cdf92b34	Remove # from the beginning and end of def names. llvm-svn: 171696	2013-01-07 05:26:58 +00:00
Craig Topper	bd62d64cbf	Remove unnecessary # tokens at the beginning and end of defm names. llvm-svn: 171694	2013-01-07 05:04:39 +00:00
Chandler Carruth	2109f47d97	Fix the enumerator names for ShuffleKind to match tho coding standards, and make its comments doxygen comments. llvm-svn: 171688	2013-01-07 03:20:02 +00:00
Chandler Carruth	50a36cd148	Make the popcnt support enums and methods have more clear names and follow the conding conventions regarding enumerating a set of "kinds" of things. llvm-svn: 171687	2013-01-07 03:16:03 +00:00
Chandler Carruth	d3e73556d6	Move TargetTransformInfo to live under the Analysis library. This no longer would violate any dependency layering and it is in fact an analysis. =] llvm-svn: 171686	2013-01-07 03:08:10 +00:00
Chandler Carruth	664e354de7	Switch TargetTransformInfo from an immutable analysis pass that requires a TargetMachine to construct (and thus isn't always available), to an analysis group that supports layered implementations much like AliasAnalysis does. This is a pretty massive change, with a few parts that I was unable to easily separate (sorry), so I'll walk through it. The first step of this conversion was to make TargetTransformInfo an analysis group, and to sink the nonce implementations in ScalarTargetTransformInfo and VectorTargetTranformInfo into a NoTargetTransformInfo pass. This allows other passes to add a hard requirement on TTI, and assume they will always get at least on implementation. The TargetTransformInfo analysis group leverages the delegation chaining trick that AliasAnalysis uses, where the base class for the analysis group delegates to the previous analysis pass, allowing all but tho NoFoo analysis passes to only implement the parts of the interfaces they support. It also introduces a new trick where each pass in the group retains a pointer to the top-most pass that has been initialized. This allows passes to implement one API in terms of another API and benefit when some other pass above them in the stack has more precise results for the second API. The second step of this conversion is to create a pass that implements the TargetTransformInfo analysis using the target-independent abstractions in the code generator. This replaces the ScalarTargetTransformImpl and VectorTargetTransformImpl classes in lib/Target with a single pass in lib/CodeGen called BasicTargetTransformInfo. This class actually provides most of the TTI functionality, basing it upon the TargetLowering abstraction and other information in the target independent code generator. The third step of the conversion adds support to all TargetMachines to register custom analysis passes. This allows building those passes with access to TargetLowering or other target-specific classes, and it also allows each target to customize the set of analysis passes desired in the pass manager. The baseline LLVMTargetMachine implements this interface to add the BasicTTI pass to the pass manager, and all of the tools that want to support target-aware TTI passes call this routine on whatever target machine they end up with to add the appropriate passes. The fourth step of the conversion created target-specific TTI analysis passes for the X86 and ARM backends. These passes contain the custom logic that was previously in their extensions of the ScalarTargetTransformInfo and VectorTargetTransformInfo interfaces. I separated them into their own file, as now all of the interface bits are private and they just expose a function to create the pass itself. Then I extended these target machines to set up a custom set of analysis passes, first adding BasicTTI as a fallback, and then adding their customized TTI implementations. The fourth step required logic that was shared between the target independent layer and the specific targets to move to a different interface, as they no longer derive from each other. As a consequence, a helper functions were added to TargetLowering representing the common logic needed both in the target implementation and the codegen implementation of the TTI pass. While technically this is the only change that could have been committed separately, it would have been a nightmare to extract. The final step of the conversion was just to delete all the old boilerplate. This got rid of the ScalarTargetTransformInfo and VectorTargetTransformInfo classes, all of the support in all of the targets for producing instances of them, and all of the support in the tools for manually constructing a pass based around them. Now that TTI is a relatively normal analysis group, two things become straightforward. First, we can sink it into lib/Analysis which is a more natural layer for it to live. Second, clients of this interface can depend on it always being available which will simplify their code and behavior. These (and other) simplifications will follow in subsequent commits, this one is clearly big enough. Finally, I'm very aware that much of the comments and documentation needs to be updated. As soon as I had this working, and plausibly well commented, I wanted to get it committed and in front of the build bots. I'll be doing a few passes over documentation later if it sticks. Commits to update DragonEgg and Clang will be made presently. llvm-svn: 171681	2013-01-07 01:37:14 +00:00
Craig Topper	4f1c7256f9	Fix suffix handling for parsing and printing of cvtsi2ss, cvtsi2sd, cvtss2si, cvttss2si, cvtsd2si, and cvttsd2si to match gas behavior. cvtsi2* should parse with an 'l' or 'q' suffix or no suffix at all. No suffix should be treated the same as 'l' suffix. Printing should always print a suffix. Previously we didn't parse or print an 'l' suffix. cvtt2si/cvt2si should parse with an 'l' or 'q' suffix or not suffix at all. No suffix should use the destination register size to choose encoding. Printing should not print a suffix. Original 'l' suffix issue with cvtsi2* pointed out by Michael Kuperstein. llvm-svn: 171668	2013-01-06 20:39:29 +00:00
Evan Cheng	3fb03e23a4	Fix for PR14739. It's not safe to fold a load into a call across a store. Thanks to Nick Lewycky for the initial patch. llvm-svn: 171665	2013-01-06 19:00:15 +00:00
Chandler Carruth	539edf4ee0	Convert the TargetTransformInfo from an immutable pass with dynamic interfaces which could be extracted from it, and must be provided on construction, to a chained analysis group. The end goal here is that TTI works much like AA -- there is a baseline "no-op" and target independent pass which is in the group, and each target can expose a target-specific pass in the group. These passes will naturally chain allowing each target-specific pass to delegate to the generic pass as needed. In particular, this will allow a much simpler interface for passes that would like to use TTI -- they can have a hard dependency on TTI and it will just be satisfied by the stub implementation when that is all that is available. This patch is a WIP however. In particular, the "stub" pass is actually the one and only pass, and everything there is implemented by delegating to the target-provided interfaces. As a consequence the tools still have to explicitly construct the pass. Switching targets to provide custom passes and sinking the stub behavior into the NoTTI pass is the next step. llvm-svn: 171621	2013-01-05 11:43:11 +00:00
Craig Topper	92a70b1e65	Recommit r171461 which was incorrectly reverted. Mark DIV/IDIV instructions hasSideEffects=1 because they can trap when dividing by 0. This is needed to keep early if conversion from moving them across basic blocks. llvm-svn: 171608	2013-01-05 07:39:25 +00:00
Nadav Rotem	478b6a47ec	Revert revision 171524. Original message: URL: http://llvm.org/viewvc/llvm-project?rev=171524&view=rev Log: The current Intel Atom microarchitecture has a feature whereby when a function returns early then it is slightly faster to execute a sequence of NOP instructions to wait until the return address is ready, as opposed to simply stalling on the ret instruction until the return address is ready. When compiling for X86 Atom only, this patch will run a pass, called "X86PadShortFunction" which will add NOP instructions where less than four cycles elapse between function entry and return. It includes tests. Patch by Andy Zhang. llvm-svn: 171603	2013-01-05 05:42:48 +00:00
Chandler Carruth	4a7c311008	Refactor the ScalarTargetTransformInfo API for querying about the legality of an address mode to not use a struct of four values and instead to accept them as parameters. I'd love to have named parameters here as most callers only care about one or two of these, but the defaults aren't terribly scary to write out. That said, there is no real impact of this as the passes aren't yet using STTI for this and are still relying upon TargetLowering. llvm-svn: 171595	2013-01-05 03:36:17 +00:00
Akira Hatanaka	d35a263076	[mips] Fix data layout string. Add 64 to the list of native integer widths and add stack alignment information. llvm-svn: 171587	2013-01-05 02:00:56 +00:00
Jakub Staszak	43fafaf496	Move 'break' to the right place to prevent fallthru. There is no test-case because conditions in the next case prevented from doing anything nasty. llvm-svn: 171549	2013-01-04 23:01:26 +00:00
Preston Gurd	e36b685a94	The current Intel Atom microarchitecture has a feature whereby when a function returns early then it is slightly faster to execute a sequence of NOP instructions to wait until the return address is ready, as opposed to simply stalling on the ret instruction until the return address is ready. When compiling for X86 Atom only, this patch will run a pass, called "X86PadShortFunction" which will add NOP instructions where less than four cycles elapse between function entry and return. It includes tests. Patch by Andy Zhang. llvm-svn: 171524	2013-01-04 20:54:54 +00:00
Akira Hatanaka	b13b33359b	[mips] MipsTargetLowering::getSetCCResultType should return a vector type if vectors are being compared. llvm-svn: 171517	2013-01-04 20:06:01 +00:00
Akira Hatanaka	e067e5a13f	[mips] 80 columns. llvm-svn: 171515	2013-01-04 19:38:05 +00:00
Akira Hatanaka	f412e7501a	[mips] Reorder template parameters. Remove class shift_rotate_imm32 and shift_rotate_imm64. llvm-svn: 171513	2013-01-04 19:25:46 +00:00
Akira Hatanaka	a7a9fa1c16	[mips] Refactor conditional move instructions. llvm-svn: 171511	2013-01-04 19:16:38 +00:00
Akira Hatanaka	e36e2f6876	[mips] Refactor instructions which move data from or to coprocessors. llvm-svn: 171510	2013-01-04 19:13:49 +00:00
Adhemerval Zanella	9b0b781395	PowerPC: Fix eh_frame relocation for PIC This patch fixes the PPC eh_frame definitions for the personality and frame unwinding for PIC objects. It makes PIC build correctly creates relative relocations in the '.rela.eh_frame' segments and thus avoiding a text relocation that generates a DT_TEXTREL segments in link phase. llvm-svn: 171506	2013-01-04 19:08:13 +00:00
Nadav Rotem	e6bb35435d	Change the default number of registers to prevent unrolling on targets that dont have this hook. llvm-svn: 171489	2013-01-04 18:40:39 +00:00
Nadav Rotem	e1d5c4b8b9	LoopVectorizer: 1. Add code to estimate register pressure. 2. Add code to select the unroll factor based on register pressure. 3. Add bits to TargetTransformInfo to provide the number of registers. llvm-svn: 171469	2013-01-04 17:48:25 +00:00
Nadav Rotem	c616a5408a	Revert revision: 171467. This transformation is incorrect and makes some tests fail. Original message: Simplified TRUNCATE operation that comes after SETCC. It is possible since SETCC result is 0 or -1. Added a test. llvm-svn: 171468	2013-01-04 17:35:21 +00:00
Elena Demikhovsky	5f2f06d2d9	Simplified TRUNCATE operation that comes after SETCC. It is possible since SETCC result is 0 or -1. Added a test. llvm-svn: 171467	2013-01-03 08:48:33 +00:00
Michael Gottesman	820aac1c78	Revert "Mark DIV/IDIV instructions hasSideEffects=1 because they can trap when dividing by 0. This is needed to keep early if conversion from moving them across basic blocks." This reverts commit r171461 since it breaks the following tests: Clang :: Analysis/outofbound-notwork.c Clang :: Analysis/string-fail.c Clang :: CXX/basic/basic.lookup/basic.lookup.qual/p6-0x.cpp Clang :: CXX/basic/basic.lookup/basic.lookup.unqual/p15.cpp Clang :: CXX/dcl.dcl/dcl.spec/dcl.fct.spec/p4.cpp Clang :: CXX/dcl.dcl/dcl.spec/dcl.stc/p10.cpp Clang :: CXX/temp/temp.param/p14.cpp Clang :: CXX/temp/temp.res/temp.dep.res/temp.point/p1.cpp Clang :: CodeGen/2009-02-13-zerosize-union-field-ppc.c Clang :: CodeGen/blocks-2.c Clang :: CodeGen/libcalls-d.c Clang :: CodeGen/libcalls-ld.c Clang :: CodeGenCXX/conversion-function.cpp Clang :: CodeGenCXX/debug-info-limit-type.cpp Clang :: CodeGenCXX/inheriting-constructor.cpp Clang :: FixIt/fixit-errors.c Clang :: FixIt/fixit-pmem.cpp Clang :: Modules/namespaces.cpp Clang :: PCH/changed-files.c Clang :: PCH/pr4489.c Clang :: PCH/source-manager-stack.c Clang :: Parser/cxx-ambig-decl-expr-xfail.cpp Clang :: SemaCXX/switch-implicit-fallthrough-cxx98.cpp Clang :: SemaTemplate/instantiate-function-1.mm llvm-svn: 171466	2013-01-03 08:18:30 +00:00
Craig Topper	7c27cc9fd0	Mark DIV/IDIV instructions hasSideEffects=1 because they can trap when dividing by 0. This is needed to keep early if conversion from moving them across basic blocks. llvm-svn: 171461	2013-01-03 06:40:20 +00:00
Hal Finkel	95de3f3018	Add a subtype parameter to VTTI::getShuffleCost In order to cost subvector insertion and extraction, we need to know the type of the subvector being extracted. No functionality change. llvm-svn: 171453	2013-01-03 02:34:09 +00:00
Kevin Enderby	726e0ea6eb	Adds missing aliases for fcom and fcomp instructions without arguments. Patch by Michael M Kuperstein! llvm-svn: 171414	2013-01-02 21:20:15 +00:00
Nadav Rotem	761937a757	AVX: Fix a bug in WidenMaskArithmetic. llvm-svn: 171398	2013-01-02 17:41:03 +00:00
Chandler Carruth	9fb823bbd4	Move all of the header files which are involved in modelling the LLVM IR into their new header subdirectory: include/llvm/IR. This matches the directory structure of lib, and begins to correct a long standing point of file layout clutter in LLVM. There are still more header files to move here, but I wanted to handle them in separate commits to make tracking what files make sense at each layer easier. The only really questionable files here are the target intrinsic tablegen files. But that's a battle I'd rather not fight today. I've updated both CMake and Makefile build systems (I think, and my tests think, but I may have missed something). I've also re-sorted the includes throughout the project. I'll be committing updates to Clang, DragonEgg, and Polly momentarily. llvm-svn: 171366	2013-01-02 11:36:10 +00:00
Chandler Carruth	be81023d74	Resort the #include lines in include/... and lib/... with the utils/sort_includes.py script. Most of these are updating the new R600 target and fixing up a few regressions that have creeped in since the last time I sorted the includes. llvm-svn: 171362	2013-01-02 10:22:59 +00:00
Craig Topper	9791afb182	Merge SSE and AVX instruction definitions for scalar forms of SQRT, RSQRT, and RCP. llvm-svn: 171356	2013-01-02 08:00:39 +00:00
Craig Topper	4bc5c4e152	Merge SSE and AVX instruction definitions for PSHUFD/PSHUFHW/PSHUFLW. llvm-svn: 171355	2013-01-02 07:27:49 +00:00
Rafael Espindola	db1a84c84a	Revert 171351. It broke MC/X86/x86-32-avx.s. llvm-svn: 171352	2013-01-02 01:35:11 +00:00
Craig Topper	86d0cdb82f	Merge SSE and AVX instruction definitions for scalar forms of SQRT, RSQRT, and RCP. llvm-svn: 171351	2013-01-01 20:53:20 +00:00
Craig Topper	12ed9cd6ae	Remove unused argument from a multiclass. llvm-svn: 171340	2013-01-01 03:42:44 +00:00
Craig Topper	2edafc059d	Merge intrinsic instruction definitions for SSE and AVX versions of RCPPS and RSQRTPS. llvm-svn: 171339	2013-01-01 03:30:21 +00:00
Craig Topper	d04dbec6c9	Remove 2 unused multiclasses. llvm-svn: 171338	2013-01-01 02:02:45 +00:00
Craig Topper	7cc4f322cf	Merge AVX/SSE instruction definitions for SQRTPS/PD, RSQRTPS, RCPPS. No funcitonal change intended. llvm-svn: 171337	2013-01-01 00:11:07 +00:00
Craig Topper	c2521cd309	Use packed instead of scalar itineraries for SSE1/2 SQRTPS/PD, RCPPS, and RSQRTPS. VEX-encoded forms already use packed. llvm-svn: 171336	2012-12-31 23:49:05 +00:00
Bill Wendling	6e95ae803a	Remove the getAttributesAtIndex and getNumAttrs methods in favor of using the getAttrSomewhere predicate. This prevents the uses of 'Attribute' as a collection of attributes. llvm-svn: 171271	2012-12-31 00:49:59 +00:00
Nuno Lopes	b6ad98224a	convert a bunch of callers from DataLayout::getIndexedOffset() to GEP::accumulateConstantOffset(). The later API is nicer than the former, and is correct regarding wrap-around offsets (if anyone cares). There are a few more places left with duplicated code, which I'll remove soon. llvm-svn: 171259	2012-12-30 16:25:48 +00:00
Bill Wendling	749a43d874	Use the predicate methods off of AttributeSet instead of Attribute. llvm-svn: 171257	2012-12-30 13:50:49 +00:00
Bill Wendling	74dba875e2	Remove the Function::getRetAttributes method in favor of using the AttributeSet accessor method. llvm-svn: 171256	2012-12-30 13:01:51 +00:00
Bill Wendling	94dcaf8e2b	Remove Function::getParamAttributes and use the AttributeSet accessor methods instead. llvm-svn: 171255	2012-12-30 12:45:13 +00:00
Bill Wendling	698e84fc4f	Remove the Function::getFnAttributes method in favor of using the AttributeSet directly. This is in preparation for removing the use of the 'Attribute' class as a collection of attributes. That will shift to the AttributeSet class instead. llvm-svn: 171253	2012-12-30 10:32:01 +00:00
Bill Wendling	6190254e0f	s/hasAttribute/contains/g to be more consistent with other method names. llvm-svn: 171252	2012-12-30 09:17:46 +00:00
Craig Topper	fe82eb6bcd	Remove intrinsic specific instructions for (V)SQRTPS/PD. Instead lower to target-independent ISD nodes and use the existing patterns for those. llvm-svn: 171237	2012-12-29 18:18:20 +00:00
Craig Topper	f4a9c6e21b	Merge similar functionality using a nested switch. llvm-svn: 171229	2012-12-29 17:19:06 +00:00
Craig Topper	6b27251a76	Remove intrinsic specific instructions for SSE/SSE2/AVX floating point max/min instructions. Lower them to target specific nodes and use those patterns instead. This also allows them to be commuted if UnsafeFPMath is enabled. llvm-svn: 171227	2012-12-29 16:44:25 +00:00
Jakub Staszak	215f94143c	Simplify code, no functionality change. llvm-svn: 171226	2012-12-29 15:57:26 +00:00
Jakub Staszak	afe8109fce	Delete executive bit on ./lib/Target/Hexagon/HexagonAsmPrinter.h. llvm-svn: 171225	2012-12-29 15:23:06 +00:00
Nadav Rotem	9785f519b4	CostModel: initial checkin for code that estimates the cost of special shuffles. llvm-svn: 171180	2012-12-28 08:19:03 +00:00
Nadav Rotem	c982a2dc25	wrap 80-col lines. llvm-svn: 171179	2012-12-28 07:28:43 +00:00
Nadav Rotem	3da9ac72fa	AVX: Move the ZEXT/ANYEXT DAGCo optimizations to the lowering of these optimizations. The old test cases still cover all of these lowering/optimizations. The single change that we have is that now anyext does not need to zero a register, because it does not use the exact code path as the zero_extend. llvm-svn: 171178	2012-12-28 05:45:24 +00:00
Nadav Rotem	68441914a5	Reverse the 'if' condition and reduce the indentation. llvm-svn: 171172	2012-12-27 23:08:05 +00:00
Craig Topper	ab2e6842cc	Merge basic_sse12_fp_binop_p_int and basic_sse12_fp_binop_p_y_int multiclasses. llvm-svn: 171171	2012-12-27 22:53:47 +00:00
Nadav Rotem	3b34190100	AVX/AVX2: Move the SEXT lowering code from a target specific DAGco to a lowering function. llvm-svn: 171170	2012-12-27 22:47:16 +00:00
Craig Topper	e2eec3c52b	Merge basic_sse12_fp_binop_p and basic_sse12_fp_binop_p_y multiclasses. llvm-svn: 171166	2012-12-27 18:51:50 +00:00
Nadav Rotem	2a054b4475	On AVX/AVX2 the type v8i1 is legalized to v8i16, which is an XMM sized register. In most cases we actually compare or select YMM-sized registers and mixing the two types creates horrible code. This commit optimizes some of the transition sequences. PR14657. llvm-svn: 171148	2012-12-27 08:15:45 +00:00
Nadav Rotem	8e5d80eba3	AVX/AVX2: Move the code that lowers vector-trunc from a DAGCo-hook to custom lowering hook. The vector truncs were scalarized during LegalizeVectorOps, later vectorized again by some DAGCombine optimization and finally, lowered by a dagcombing optimization. Now, they are properly lowered during LegalizeVectorOps. No new testcase because the original testcases still work. llvm-svn: 171146	2012-12-27 07:45:10 +00:00
Craig Topper	757f3fc394	Add hasSideEffects=0 to some forms of ROUND, RCP, and RSQRT. llvm-svn: 171143	2012-12-27 07:16:08 +00:00
Craig Topper	09ce4b9efe	Move single letter 'P' prefix out of multiclass now that tablegen allows defm to start with #NAME. This makes instruction names more searchable again. llvm-svn: 171141	2012-12-27 06:34:54 +00:00
Craig Topper	396cb795bc	Add hasSideEffects=0 to some shift and rotate instructions. None of which are currently used by code generation. llvm-svn: 171137	2012-12-27 03:35:44 +00:00
Craig Topper	c7910828e4	Mark the divide instructions as hasSideEffects=0. llvm-svn: 171136	2012-12-27 03:01:18 +00:00
Craig Topper	5b807aaa38	Add hasSideEffects=0 to CMP*rr_REV. llvm-svn: 171130	2012-12-27 02:08:46 +00:00
Craig Topper	89e8607755	Add mayLoad, mayStore, and hasSideEffects tags to BT/BTS/BTR/BTC instructions. Shouldn't change any functionality since they don't have patterns to select them. llvm-svn: 171128	2012-12-27 02:01:33 +00:00
Craig Topper	c557343956	Fix operands and encoding form for ARPL instruction. Register form had and reversed. Memory form writes memory, but was marked as MRMSrcMem. llvm-svn: 171123	2012-12-26 23:27:57 +00:00
Craig Topper	d47a70de9f	Add hasSideEffects=0 to some atomic instructions. llvm-svn: 171122	2012-12-26 23:08:12 +00:00
Craig Topper	af2372087b	Mark the AL/AX/EAX forms of the basic arithmetic operations has never having side effects. llvm-svn: 171121	2012-12-26 22:19:23 +00:00
Craig Topper	1b8c0750ee	Mark all the _REV instructions as not having side effects. They aren't really emitted by the backend, but it reduces the number of instructions in the output files with unmodelled side effects to make auditing easier. llvm-svn: 171118	2012-12-26 21:30:22 +00:00
Craig Topper	18f2675e9b	Remove a special conditional setting of neverHasSideEffects if the instruction didn't have a pattern. This was leftover from when tablegen used to complain if things were already inferred from patterns. llvm-svn: 171117	2012-12-26 21:04:30 +00:00
Craig Topper	24f316e4db	Merge still more SSE/AVX instruction definitions. llvm-svn: 171103	2012-12-26 07:54:43 +00:00
Craig Topper	af629e2700	Merge more SSE/AVX instruction definitions. llvm-svn: 171102	2012-12-26 07:20:35 +00:00
Craig Topper	65fe30450d	Fix 80 column violation. llvm-svn: 171097	2012-12-26 06:15:53 +00:00
Craig Topper	f4d0fe8fcd	Fix class name in comment. llvm-svn: 171096	2012-12-26 06:15:09 +00:00
Craig Topper	59747c4dbd	Merge SSE/AVX PCMPEQ/PCMPGT instruction definitions. llvm-svn: 171095	2012-12-26 06:14:15 +00:00
Craig Topper	8a48677586	Remove 'v' from mnemonic to fix asm matching failures. llvm-svn: 171093	2012-12-26 06:02:15 +00:00
Craig Topper	b4ef0fa3a1	Use an additional multiclass to merge the 128/256-bit SSE/AVX instruction definitions for a bunch of SSE2 integer arithmetic instructions. llvm-svn: 171092	2012-12-26 05:49:15 +00:00
Nadav Rotem	5267bb71b8	Reformat the docs. llvm-svn: 171091	2012-12-26 04:59:20 +00:00
Craig Topper	a2594dd5f0	Use an additional multiclass to merge the 128/256-bit SSE/AVX instruction definitions for PAND/POR/PXOR/PANDN llvm-svn: 171087	2012-12-26 04:36:03 +00:00
Craig Topper	97730a0d6a	Merge an AVX/SSE 256-bit and 128-bit multiclass. llvm-svn: 171086	2012-12-26 03:56:47 +00:00
Craig Topper	8b59746390	Mark VANDNPD/VANDNPDS as not commutable. llvm-svn: 171085	2012-12-26 03:48:10 +00:00
Craig Topper	81d1e596bb	Remove alignment from a bunch more VEX encoded operations in the folding tables. llvm-svn: 171082	2012-12-26 02:44:47 +00:00
Craig Topper	b2922164f0	Remove alignment from folding table for VMOVUPD as an unaligned instruction it shouldn't require alignment... llvm-svn: 171081	2012-12-26 02:14:19 +00:00
Craig Topper	d09a9af9b6	Remove alignment requirements from (V)EXTRACTPS. This instruction does 32-bit stores which aren't required to be aligned on SSE or AVX. llvm-svn: 171080	2012-12-26 01:47:12 +00:00
Craig Topper	caef1c5d86	Remove alignment requirement from VCVTSS2SD in folding tables. Reverting r171049. This instruction doesn't require alignment. llvm-svn: 171078	2012-12-26 00:35:47 +00:00
Hal Finkel	1b5ff08d43	Expand PPC64 atomic load and store Use of store or load with the atomic specifier on 64-bit types would cause instruction-selection failures. As with the 32-bit case, these can use the default expansion in terms of cmp-and-swap. llvm-svn: 171072	2012-12-25 17:22:53 +00:00
Benjamin Kramer	81b5a8fd2e	X86: Shave off one shuffle from the pcmpeqq sequence for SSE2 by making use of and commutativity. llvm-svn: 171064	2012-12-25 13:09:08 +00:00
Benjamin Kramer	df4af41b9b	X86: Custom lower <2 x i64> eq and ne when SSE41 is not available. pcmpeqd, pshufd, pshufd, pand is cheaper than unpack + cmpq, sbbq, cmpq, sbbq + pack. Small speedup on loop-vectorized viterbi (-march=core2). llvm-svn: 171063	2012-12-25 12:54:19 +00:00
Nadav Rotem	00410ae625	VCVTSS2SD requires a strict alignment. Thanks Elena. llvm-svn: 171049	2012-12-25 03:29:18 +00:00
Nick Lewycky	521e0d59f3	Quiet gcc's -Wparenthesis warning. No functionality change. llvm-svn: 171044	2012-12-24 19:58:45 +00:00
Benjamin Kramer	9d46110ff1	Use a std::string rather than a dynamically allocated char* buffer. This affords us to use std::string's allocation routines and use the destructor for the memory management. Switching to that also means that we can use operator==(const std::string&, const char *) to perform the string comparison rather than resorting to libc functionality (i.e. strcmp). Patch by Saleem Abdulrasool! Differential Revision: http://llvm-reviews.chandlerc.com/D230 llvm-svn: 171042	2012-12-24 19:23:30 +00:00
Nadav Rotem	3ee6b10dd4	CostModel: We have API for checking the costs of known shuffles. This patch adds support for the insert-subvector and extract-subvector kinds. llvm-svn: 171027	2012-12-24 10:04:03 +00:00
Nadav Rotem	dc0ad92b64	Some x86 instructions can load/store one of the operands to memory. On SSE, this memory needs to be aligned. When these instructions are encoded in VEX (on AVX) there is no such requirement. This changes the folding tables and removes the alignment restrictions from VEX-encoded instructions. llvm-svn: 171024	2012-12-24 09:40:33 +00:00
Nadav Rotem	7e1599e100	Change the codegen Cost Model API for shuffeles. This patch removes the API for broadcast and adds a more general API that accepts an enum of known shuffles. llvm-svn: 171022	2012-12-24 08:57:47 +00:00
Nadav Rotem	cf9999d9d5	CostModel: Change the default target-independent implementation for finding the cost of arithmetic functions. We now assume that the cost of arithmetic operations that are marked as Legal or Promote is low, but ops that are marked as custom are higher. llvm-svn: 171002	2012-12-23 17:31:23 +00:00
Nadav Rotem	b15c69a725	whitespace llvm-svn: 170997	2012-12-23 07:33:44 +00:00
Nadav Rotem	1bef5a0509	Rename a function. llvm-svn: 170996	2012-12-23 07:30:09 +00:00
Nadav Rotem	2cade68025	Loop Vectorizer: Update the cost model of scatter/gather operations and make them more expensive. llvm-svn: 170995	2012-12-23 07:23:55 +00:00
Benjamin Kramer	76268ac682	X86: Turn mul of <4 x i32> into pmuludq when no SSE4.1 is available. pmuludq is slow, but it turns out that all the unpacking and packing of the scalarized mul is even slower. 10% speedup on loop-vectorized paq8p. llvm-svn: 170985	2012-12-22 16:07:56 +00:00
Benjamin Kramer	b2f0a2bd4b	X86: Emit vector sext as shuffle + sra if vpmovsx is not available. Also loosen the SSSE3 dependency a bit, expanded pshufb + psra is still better than scalarized loads. Fixes PR14590. llvm-svn: 170984	2012-12-22 11:34:28 +00:00
Nadav Rotem	d5aae980cb	In some cases, due to scheduling constraints we copy the EFLAGS. The only way to read the eflags is using push and pop. If we don't adjust the stack then we run over the first frame index. This is not something that we want to do, so we have to make sure that our machine function does not copy the flags. If it does then we have to emit the prolog that adjusts the stack. rdar://12896831 llvm-svn: 170961	2012-12-21 23:48:49 +00:00
Akira Hatanaka	6ac2fc4976	[mips] Refactor subword-swap, EXT/INS, load-effective-address and read-hardware instructions. llvm-svn: 170956	2012-12-21 23:21:32 +00:00
Akira Hatanaka	beea8a34c3	[mips] Refactor SYNC and multiply/divide instructions. llvm-svn: 170955	2012-12-21 23:17:36 +00:00
Akira Hatanaka	31ddec5887	[mips] Refactor BAL instructions. llvm-svn: 170954	2012-12-21 23:15:59 +00:00
Akira Hatanaka	d6b694f036	[mips] Fix encoding of BAL instruction. Also, fix assembler test case which was not catching the error. llvm-svn: 170953	2012-12-21 23:13:59 +00:00
Akira Hatanaka	a158042a56	[mips] Refactor jump, jump register, jump-and-link and nop instructions. llvm-svn: 170952	2012-12-21 23:03:50 +00:00
Akira Hatanaka	e1826d7464	[mips] Refactor load/store left/right and load-link and store-conditional instructions. llvm-svn: 170950	2012-12-21 23:01:24 +00:00
Akira Hatanaka	d9bf8424e5	[mips] Refactor load/store instructions. llvm-svn: 170948	2012-12-21 22:58:55 +00:00
Akira Hatanaka	b59b047fbe	[mips] Remove unnecessary isPseudo parameter. llvm-svn: 170947	2012-12-21 22:57:26 +00:00
Akira Hatanaka	e738efc95b	[mips] Refactor LUI instruction. llvm-svn: 170944	2012-12-21 22:46:07 +00:00
Akira Hatanaka	895e1cb2aa	[mips] Refactor count leading zero or one instructions. llvm-svn: 170942	2012-12-21 22:43:58 +00:00
Akira Hatanaka	4f4c4aa05e	[mips] Refactor sign-extension-in-register instructions. llvm-svn: 170940	2012-12-21 22:41:52 +00:00
Akira Hatanaka	b14c6e4e5f	[mips] Refactor instructions which copy from and to HI/LO registers. llvm-svn: 170939	2012-12-21 22:39:17 +00:00
Akira Hatanaka	9e89195dce	[mips] Refactor logical NOR instructions. llvm-svn: 170937	2012-12-21 22:35:47 +00:00
Akira Hatanaka	ac10697207	[mips] Move instruction definitions in MipsInstrInfo.td. llvm-svn: 170936	2012-12-21 22:33:43 +00:00
Tom Stellard	09ef8425e9	R600: Coding style - remove empty spaces from the beginning of functions No functionality change. llvm-svn: 170923	2012-12-21 20:12:02 +00:00
Tom Stellard	41398026e7	R600: Fix MAX_UINT definition Patch by: Vadim Girlin Reviewed-by: Michel Dänzer <michel.daenzer@amd.com> llvm-svn: 170922	2012-12-21 20:12:01 +00:00
Tom Stellard	4fa7ac29f1	R600: Add SHADOWCUBE to TEX_SHADOW pattern Patch by: Vadim Girlin Reviewed-by: Michel Dänzer <michel.daenzer@amd.com> llvm-svn: 170921	2012-12-21 20:11:59 +00:00
Benjamin Kramer	5521b94b07	Cleanup compiler warnings on discarding type qualifiers in casts. Switch to C++ style casts. Patch by Saleem Abdulrasool! Differential Revision: http://llvm-reviews.chandlerc.com/D204 llvm-svn: 170917	2012-12-21 19:09:53 +00:00
Benjamin Kramer	82d1c371e2	X86: Match pmin/pmax as a target specific dag combine. This occurs during vectorization. Part of PR14667. llvm-svn: 170908	2012-12-21 17:46:58 +00:00
Roman Divacky	a229186a82	Remove duplicate includes. llvm-svn: 170902	2012-12-21 17:06:44 +00:00
Tom Stellard	a8b0351720	R600: Expand vec4 INT <-> FP conversions llvm-svn: 170901	2012-12-21 16:33:24 +00:00
Benjamin Kramer	4669d18893	X86: Match the SSE/AVX min/max vector ops using a custom node instead of intrinsics This is very mechanical, no functionality change. Preparation for PR14667. llvm-svn: 170898	2012-12-21 14:04:55 +00:00
Nadav Rotem	eacbb731d3	Add a missing "virtual" keyword. llvm-svn: 170842	2012-12-21 05:02:12 +00:00
Quentin Colombet	b1b66e7a25	Add ARM cortex-r5 subtarget. llvm-svn: 170840	2012-12-21 04:35:05 +00:00
Nadav Rotem	6d4fdd6d2c	Improve the X86 cost model for loads and stores. llvm-svn: 170830	2012-12-21 01:33:59 +00:00
Nadav Rotem	a4b53f20a3	BB-Vectorizer: Check the cost of the store pointer type and not the return type, which is void. A number of test cases fail after adding the assertion in TTImpl. llvm-svn: 170828	2012-12-21 01:24:36 +00:00
Reed Kotler	9bff1ead0e	Call llvm_unreachable instead of assert. llvm-svn: 170822	2012-12-21 00:44:59 +00:00
Jakob Stoklund Olesen	33f5d1492d	Add an MF argument to MI::copyImplicitOps(). This function is often used to decorate dangling instructions, so a context reference is required to allocate memory for the operands. Also add a corresponding MachineInstrBuilder method. llvm-svn: 170797	2012-12-20 22:54:02 +00:00
Jakob Stoklund Olesen	2ea203694d	MachineInstrBuilderize ARM. llvm-svn: 170795	2012-12-20 22:53:55 +00:00
Jakob Stoklund Olesen	4255c96aed	MachineInstrBuilderize NVPTX. llvm-svn: 170794	2012-12-20 22:53:53 +00:00
Bob Wilson	7bba4f8957	Revert "Adding support for llvm.arm.neon.vaddl[su].* and" This reverts r170694. The operations can be represented in IR without adding any new intrinsics. llvm-svn: 170765	2012-12-20 21:09:38 +00:00
Evan Cheng	ddc0cb6dc5	On some ARM cpus, flags setting movs with shifter operand, i.e. lsl, lsr, asr, are more expensive than the non-flag setting variant. Teach thumb2 size reduction pass to avoid generating them unless we are optimizing for size. rdar://12892707 llvm-svn: 170728	2012-12-20 19:59:30 +00:00
Roman Divacky	ff95a1dc12	Remove MCTargetAsmLexer and its derived classes now that edis, its only user, is gone. llvm-svn: 170699	2012-12-20 14:43:30 +00:00
Renato Golin	6b2ea4a48f	Adding support for llvm.arm.neon.vaddl[su].* and llvm.arm.neon.vsub[su].* intrinsics. Patch by Pete Couperus <pjcoup@gmail.com> llvm-svn: 170694	2012-12-20 13:52:11 +00:00
Reed Kotler	d11acc7dc0	Implement cfi_def_cfa_offset. "Make check" test case for this comming in the next few days but it's already tested a lot from test-suite and works fine. This patch completes almost 100% pass of test-suite for mips 16. llvm-svn: 170674	2012-12-20 06:59:37 +00:00
Reed Kotler	8965d24a2a	There is one more patch to finish large frames. Make sure we assert on code that has large frames which will not yet compile correctly. llvm-svn: 170673	2012-12-20 06:57:00 +00:00
Jyotsna Verma	56605448f2	Add constant extender support to GP-relative load/store instructions. llvm-svn: 170672	2012-12-20 06:52:46 +00:00
Jyotsna Verma	bf75aaf53e	Add TSFlags to ALU32 type instructions for constant-extender/Relationship maps. llvm-svn: 170671	2012-12-20 06:45:39 +00:00
Reed Kotler	7bff8f1d7a	set register class properly for mips16 here llvm-svn: 170669	2012-12-20 06:06:35 +00:00
Rafael Espindola	fb8ac2df09	Undefine PPC harder. This was causing a build failure while trying to build on ppc ubuntu 12.10 with cmake. llvm-svn: 170668	2012-12-20 05:13:09 +00:00
Reed Kotler	92fc33bc97	This assert is overly restrictive and does not work for mips16. llvm-svn: 170667	2012-12-20 05:09:15 +00:00
Reed Kotler	fd633229f7	Turn on register scavenger for Mips 16 We use an unused Mips 32 register for the emergency slot instead of using the stack. llvm-svn: 170665	2012-12-20 04:44:58 +00:00
Akira Hatanaka	e7f1acc7c0	[mips] Refactor SLT (set on less than) instructions. Separate encoding information from the rest. llvm-svn: 170664	2012-12-20 04:27:52 +00:00
Akira Hatanaka	bbd197e9c4	[mips] Refactor unconditional branch instruction. Separate encoding information from the rest. llvm-svn: 170663	2012-12-20 04:22:39 +00:00

... 7 8 9 10 11 ...

23654 Commits