llvm-project

Commit Graph

Author	SHA1	Message	Date
Eric Christopher	24e3aa011a	Make more explicit and add some currently disabled error messages for stack realignment on ARM. Also check for function attributes as we do on X86 as well as make explicit that we're checking can as well as needs in this function. llvm-svn: 108582	2010-07-17 00:27:24 +00:00
Eric Christopher	c0be37287c	Make comment a bit more clear as well as return statement since needsStackRealignment is currently checking the can conditions as well. llvm-svn: 108581	2010-07-17 00:25:41 +00:00
Jim Grosbach	11013eda5a	Add basic support to code-gen the ARM/Thumb2 bit-field insert (BFI) instruction and a combine pattern to use it for setting a bit-field to a constant value. More to come for non-constant stores. llvm-svn: 108570	2010-07-16 23:05:05 +00:00
Jakob Stoklund Olesen	8289f78569	Remove the isMoveInstr() hook. llvm-svn: 108567	2010-07-16 22:35:46 +00:00
Jakob Stoklund Olesen	8e767bde16	Avoid isMoveInstr when printing XCore pseudo-moves. llvm-svn: 108566	2010-07-16 22:35:37 +00:00
Jakob Stoklund Olesen	2c130b8ead	Use MI.isCopy. llvm-svn: 108565	2010-07-16 22:35:34 +00:00
Jakob Stoklund Olesen	54bcf5049e	Use a small local function for a single remaining late isMoveInstr call in Thumb2ITBlockPass. llvm-svn: 108564	2010-07-16 22:35:32 +00:00
Bill Wendling	499f797cdd	Rename DBG_LABEL PROLOG_LABEL, because it's only used during prolog emission and thus is a much more meaningful name. llvm-svn: 108563	2010-07-16 22:20:36 +00:00
Jakob Stoklund Olesen	8d51149102	Keep valgrind quiet. The isLive() method can read uninitialized memory, but it still gives correct results. llvm-svn: 108561	2010-07-16 22:00:33 +00:00
Jakob Stoklund Olesen	44629eb81b	Emit COPY instead of FMR/FMSD instructions for floating point conversion on PowerPC. llvm-svn: 108555	2010-07-16 21:03:52 +00:00
Eli Friedman	ba9b25a6ba	Add missing attributes to cpp backend. llvm-svn: 108547	2010-07-16 18:47:20 +00:00
Dale Johannesen	da3e05db70	Accept registers with P modifier. PR 5314. llvm-svn: 108545	2010-07-16 18:35:46 +00:00
Jakob Stoklund Olesen	6353e534ec	Teach PPCInstrInfo::storeRegToStackSlot and loadRegFromStackSlot to add memory operands. Hopefully this fixes the llvm-gcc-powerpc-darwin9 buildbot. It really shouldn't since missing memoperands should not affect correctness. llvm-svn: 108540	2010-07-16 18:22:00 +00:00
Jakob Stoklund Olesen	c30b4ddc58	Remove the X86::FP_REG_KILL pseudo-instruction and the X86FloatingPointRegKill pass that inserted it. It is no longer necessary to limit the live ranges of FP registers to a single basic block. llvm-svn: 108536	2010-07-16 17:41:44 +00:00
Jakob Stoklund Olesen	f0af236874	Search for a free FP register instead of just assuming FP7 is not in use. llvm-svn: 108535	2010-07-16 17:41:40 +00:00
Jakob Stoklund Olesen	0e5fb020a0	Allow x87 FP registers to be alive globally in a function. FP_REG_KILL instructions are still inserted, but can be disabled by passing -live-x87 to llc. The X87FPRegKillInserterPass is going to be removed shortly. CFG edges are partioned into bundles where the x87 stack must be allocated identically. Code is insertad at the end of each basic block that shuffles the live FP registers to match the outgoing bundles expectations. This fix is in preparation for some upcoming register allocator improvements that may extend the live range of registers beyond a basic block, similar to LICM. It also provides a nice runtime speedup if you are building with -mfpmath=387. llvm-svn: 108529	2010-07-16 16:38:12 +00:00
Evan Cheng	55f0c6b9fc	Split -enable-finite-only-fp-math to two options: -enable-no-nans-fp-math and -enable-no-infs-fp-math. All of the current codegen fp math optimizations only care whether the fp arithmetics arguments and results can never be NaN. llvm-svn: 108465	2010-07-15 22:07:12 +00:00
Chris Lattner	620693806a	fix the encoding of MMX_MOVFR642Qrr, it starts with 0xF2 not 0xF3, this fixes rdar://8192860. Unfortunately it can only be triggered with llc because llvm-mc matches another (correctly encoded) version of this, so no testcase. llvm-svn: 108454	2010-07-15 20:13:34 +00:00
Eli Friedman	e4be4308a9	Random note about bswap. llvm-svn: 108396	2010-07-15 02:20:38 +00:00
Jakob Stoklund Olesen	8b1bb8cfbd	Last COPY conversion. llvm-svn: 108387	2010-07-14 23:58:21 +00:00
Bob Wilson	0b9aafddc5	Remove restriction on NEON alignment values. Some of the NEON ld/st instructions use different values (e.g., 2-byte or 4-byte alignment). Also fix ARMInstPrinter to print these alignments as bits instead of bytes. llvm-svn: 108386	2010-07-14 23:54:43 +00:00
Jakob Stoklund Olesen	9b449d5a92	Use TargetOpcode::COPY instead of X86-native register copy instructions when lowering atomics. This will allow those copies to still be coalesced after TII::isMoveInstr is removed. llvm-svn: 108385	2010-07-14 23:50:27 +00:00
Chris Lattner	769aedd523	fix indentation llvm-svn: 108368	2010-07-14 23:04:59 +00:00
Benjamin Kramer	92d8998348	Don't pass StringRef by reference. llvm-svn: 108366	2010-07-14 22:38:02 +00:00
Chris Lattner	254858031a	Merge lib/Target/X86/X86COFF.h into include/llvm/Support/COFF.h, patch by Michael Spencer! llvm-svn: 108342	2010-07-14 18:14:33 +00:00
Jim Grosbach	a90af1ba38	Improve 64-subtraction of immediates when parts of the immediate can fit in the literal field of an instruction. E.g., long long foo(long long a) { return a - 734439407618LL; } rdar://7038284 llvm-svn: 108339	2010-07-14 17:45:16 +00:00
Bob Wilson	1aef53403f	Add missing address register update to t2LDM_RET instruction. Patch by Brian Lucas. PR7636. llvm-svn: 108332	2010-07-14 16:02:13 +00:00
Eli Friedman	c4d70125ee	A couple potential optimizations inspired by comment 4 in PR6773. llvm-svn: 108328	2010-07-14 06:58:26 +00:00
Evan Cheng	a8e8874552	Fix for PR7193 was overly conservative. The only case where sibcall callee address cannot be allocated a register is in 32-bit mode where the first three arguments are marked inreg. In that case EAX, EDX, and ECX will be used for argument passing. This fixes PR7610. llvm-svn: 108327	2010-07-14 06:44:01 +00:00
Bob Wilson	bad47f62f6	Add support for NEON VMVN immediate instructions. llvm-svn: 108324	2010-07-14 06:31:50 +00:00
Bob Wilson	bd54a53628	The bits in the cmode field of 32-bit VMOV immediate instructions all depend of the value of the immediate. llvm-svn: 108323	2010-07-14 06:30:44 +00:00
Chris Lattner	164a0775bb	fix a bug found by a warning I added to clang this morning. llvm-svn: 108309	2010-07-14 01:57:17 +00:00
Bob Wilson	103a0dcfe1	Add an ARM-specific DAG combining to avoid redundant VDUPLANE nodes. Radar 7373643. llvm-svn: 108303	2010-07-14 01:22:12 +00:00
Dan Gohman	1f471435f8	Don't propagate debug locations to instructions for materializing constants, since they may not be emited near the other instructions which get the same line, and this confuses debug info. llvm-svn: 108302	2010-07-14 01:07:44 +00:00
Bruno Cardoso Lopes	6c6c14a55c	Add AVX 256-bit compare instructions and a bunch of testcases llvm-svn: 108286	2010-07-13 22:06:38 +00:00
Bob Wilson	a3f1901531	Use a target-specific VMOVIMM DAG node instead of BUILD_VECTOR to represent NEON VMOV-immediate instructions. This simplifies some things. llvm-svn: 108275	2010-07-13 21:16:48 +00:00
Bruno Cardoso Lopes	fd8bfcd6e1	AVX 256-bit conversion instructions Add the x86 VEX_L form to handle special cases where VEX_L must be set. llvm-svn: 108274	2010-07-13 21:07:28 +00:00
Kevin Enderby	76a6b663a3	Added a check that pusha cannot be encoded in 64-bit mode. llvm-svn: 108265	2010-07-13 20:05:41 +00:00
Evan Cheng	0cc4ad983d	Extend the r107852 optimization which turns some fp compare to code sequence using only i32 operations. It now optimize some f64 compares when fp compare is exceptionally slow (e.g. cortex-a8). It also catches comparison against 0.0. llvm-svn: 108258	2010-07-13 19:27:42 +00:00
Evan Cheng	58066e337d	Add an ARM "feature". Cortex-a8 fp comparison is very slow (> 20 cycles). llvm-svn: 108256	2010-07-13 19:21:50 +00:00
Evan Cheng	f43961007c	-enable-unsafe-fp-math should not imply -enable-finite-only-fp-math. llvm-svn: 108254	2010-07-13 18:46:14 +00:00
Gabor Greif	03e7e68caa	rotate CallInst operands with this commit the callee moves to the end of the operand array (from the start) and the call arguments now start at index 0 (formerly 1) this ordering is now consistent with InvokeInst this commit only flips the switch, functionally it is equivalent to r101465 I intend to commit several cleanups after a few days of soak period llvm-svn: 108240	2010-07-13 15:31:36 +00:00
Bob Wilson	c1c6f4796e	Move NEON "modified immediate" encode/decode into ARMAddressingModes.h to avoid replicated code. llvm-svn: 108227	2010-07-13 04:44:34 +00:00
Chris Lattner	55595fb291	my work on adding segment registers to LEA missed the disassembler. Remove some code from the disassembler to compensate, unbreaking disassembly of lea's. llvm-svn: 108226	2010-07-13 04:23:55 +00:00
Bruno Cardoso Lopes	dff283e146	Add AVX 256-bit packed logical forms llvm-svn: 108224	2010-07-13 02:38:35 +00:00
Bruno Cardoso Lopes	36b32aeaa5	Add AVX 256-bit unop arithmetic instructions llvm-svn: 108223	2010-07-13 01:53:31 +00:00
Bruno Cardoso Lopes	77a3c4462f	Since AVX is a superset of all SSE versions, only use HasAVX for AVX instructions llvm-svn: 108222	2010-07-13 00:38:47 +00:00
David Greene	03264efe30	Move some SIMD fragment code into X86InstrFragmentsSIMD so that the utility classes can be used from multiple files. This will aid transitioning to a new refactored x86 SIMD specification. llvm-svn: 108213	2010-07-12 23:41:28 +00:00
Bruno Cardoso Lopes	8e67a0482e	Add AVX 256 binary arithmetic instructions llvm-svn: 108207	2010-07-12 23:04:15 +00:00
Bruno Cardoso Lopes	91806311c9	More refactoring of basic SSE arith instructions. Open room for 256-bit instructions llvm-svn: 108204	2010-07-12 22:41:32 +00:00
Dan Gohman	51e6d9bbf6	Apply the SSE dependence idiom for SSE unary operations to SD instructions too, in addition to SS instructions. And add a comment about it. llvm-svn: 108191	2010-07-12 20:46:04 +00:00
Bob Wilson	8a2bdc8231	Remove some code that doesn't appear to do anything. All the ARM call instructions already have implicit defs of LR. The comment suggests that this is intended to fix something like pr6111, but it doesn't really do that either. llvm-svn: 108186	2010-07-12 20:22:45 +00:00
Bruno Cardoso Lopes	f9bcaad76d	Add AVX 256-bit MOVMSK forms llvm-svn: 108184	2010-07-12 20:06:32 +00:00
Dan Gohman	425b35681f	Check begin!=end, rather than !begin. llvm-svn: 108167	2010-07-12 18:12:35 +00:00
Dan Gohman	68d7424a65	Don't fast-isel an x87 comparison opcode, as fast-isel doesn't support branching on x87 comparisons yet. This fixes PR7624. llvm-svn: 108149	2010-07-12 15:46:30 +00:00
Duncan Sands	41b4a6b36a	Convert some tab stops into spaces. llvm-svn: 108130	2010-07-12 08:16:59 +00:00
Rafael Espindola	6635f9838e	Convert getLoadStoreRegOpcode to use a switch. llvm-svn: 108123	2010-07-12 03:43:04 +00:00
Rafael Espindola	871c724773	Convert the last use of getPhysicalRegisterRegClass and remove it. AggressiveAntiDepBreaker should not be using getPhysicalRegisterRegClass. An instruction might be using a register that can only be replaced with one from a subclass of getPhysicalRegisterRegClass. With this patch we use getMinimalPhysRegClass. This is correct, but conservative. We should check the uses of the register and select the largest register class that can be used in all of them. llvm-svn: 108122	2010-07-12 02:55:34 +00:00
Jakob Stoklund Olesen	de7201545e	A basic block that only uses RFP registers still needs the FP_REG_KILL marker. This fixes PR7375. llvm-svn: 108120	2010-07-12 02:12:47 +00:00
Rafael Espindola	e35d70fafa	Convert the last getPhysicalRegisterRegClass in VirtRegRewriter.cpp to getMinimalPhysRegClass. It was used to produce spills, and it is better to use the most specific class if possible. Update getLoadStoreRegOpcode to handle GR32_AD. llvm-svn: 108115	2010-07-12 00:52:33 +00:00
Jakob Stoklund Olesen	0961c55161	RISC architectures get their memory operand folding for free. The only folding these load/store architectures can do is converting COPY into a load or store, and the target independent part of foldMemoryOperand already knows how to do that. llvm-svn: 108099	2010-07-11 19:19:13 +00:00
Jakob Stoklund Olesen	f6c7d7fb3f	Use target independent COPY instructions for the fake fextend and fround operations in x87 code. llvm-svn: 108098	2010-07-11 18:19:39 +00:00
Jakob Stoklund Olesen	7c1392a765	Remove redundant branch. Thanks, Anton! llvm-svn: 108097	2010-07-11 17:17:35 +00:00
Jakob Stoklund Olesen	98ee37d878	Remove obsolete README_SSE note. We are generating movaps for all XMM register copies, including scalar floating point values. This is known to be at least as good as movss and movsd for all known architectures up to and including Nehalem because it avoids a partial register stall. The SSEDomainFix pass will switch movaps to movdqa when appropriate (i.e., when operands come from the integer unit). We don't now that switching movaps to movapd has any benefit. The same applies to andps -> pand. llvm-svn: 108096	2010-07-11 17:13:42 +00:00
Rafael Espindola	1da1cfccb1	Make getPhysicalRegisterRegClass non-virtual. Should be able to remove it soon. llvm-svn: 108094	2010-07-11 16:49:10 +00:00
Jakob Stoklund Olesen	74e5bf85f7	Replace copyRegToReg with copyPhysReg for SystemZ. llvm-svn: 108092	2010-07-11 16:40:46 +00:00
Jakob Stoklund Olesen	4806848799	Avoid SSE instructions in FastIsel when it is not available. llvm-svn: 108091	2010-07-11 16:22:13 +00:00
Chandler Carruth	34e0d14ff4	Remove two other uses of ATTRIBUTE_UNUSED for variables only used within assert()s, switching to void-casts. Removed an unneeded Compiler.h include as a result. There are two other uses in LLVM, but they're not due to assert()s, so I've left them alone. llvm-svn: 108088	2010-07-11 08:18:12 +00:00
Jakob Stoklund Olesen	928b593486	Replace copyRegToReg with copyPhysReg for XCore. llvm-svn: 108087	2010-07-11 07:56:13 +00:00
Jakob Stoklund Olesen	976b7b61fc	Replace copyRegToReg with copyPhysReg for Sparc. llvm-svn: 108086	2010-07-11 07:56:09 +00:00
Jakob Stoklund Olesen	1dba6814c9	Replace copyRegToReg with copyPhysReg for CellSPU. llvm-svn: 108084	2010-07-11 07:31:03 +00:00
Jakob Stoklund Olesen	0d611979a8	Replace copyRegToReg with copyPhysReg for PowerPC. llvm-svn: 108083	2010-07-11 07:31:00 +00:00
Jakob Stoklund Olesen	f889e280b8	Fix PIC16 comments referencing copyRegToReg. llvm-svn: 108082	2010-07-11 07:30:57 +00:00
Jakob Stoklund Olesen	e494d0ff3e	Replace copyRegToReg with copyPhysReg for PIC16. llvm-svn: 108081	2010-07-11 06:53:33 +00:00
Jakob Stoklund Olesen	65306369ae	Replace copyRegToReg with copyPhysReg for MSP430. llvm-svn: 108080	2010-07-11 06:53:30 +00:00
Jakob Stoklund Olesen	37a38f4b28	Replace copyRegToReg with copyPhysReg for MBlaze. llvm-svn: 108079	2010-07-11 06:53:27 +00:00
Jakob Stoklund Olesen	d7b33002dd	Replace copyRegToReg with copyPhysReg for ARM. llvm-svn: 108078	2010-07-11 06:33:54 +00:00
Jakob Stoklund Olesen	52984e1aef	Replace copyRegToReg with copyPhysReg for Blackfin. llvm-svn: 108077	2010-07-11 05:44:34 +00:00
Jakob Stoklund Olesen	e46f3eb0c4	X86InstrInfo::copyRegToReg is dead. Long live copyPhysReg! llvm-svn: 108076	2010-07-11 05:44:30 +00:00
Jakob Stoklund Olesen	8969657f0c	Use COPY in X86FastISel::X86SelectRet. Don't try a cross-class copy. That is very unlikely anywy since return value registers are usually register class friendly. (%EAX, %XMM0, etc). llvm-svn: 108074	2010-07-11 05:17:02 +00:00
Rafael Espindola	a76eccf815	Fix va_arg for doubles. With this patch VAARG nodes always contain the correct alignment information, which simplifies ExpandRes_VAARG a bit. The patch introduces a new alignment information to TargetLoweringInfo. This is needed since the two natural candidates cannot be used: * The 's' in target data: If this is set to the minimal alignment of any argument, getCallFrameTypeAlignment would return 4 for doubles on ARM for example. * The getTransientStackAlignment method. It is possible for an architecture to have argument less aligned than what we maintain the stack pointer. llvm-svn: 108072	2010-07-11 04:01:49 +00:00
Jakob Stoklund Olesen	3bb1267431	Use COPY in FastISel everywhere it is safe and trivial. The remaining copyRegToReg calls actually check the return value (shock!), so we cannot trivially replace them with COPY instructions. llvm-svn: 108069	2010-07-11 03:31:00 +00:00
Jakob Stoklund Olesen	7002c31480	Replace copyRegToReg with copyPhysReg for Mips. llvm-svn: 108066	2010-07-11 01:08:31 +00:00
Jakob Stoklund Olesen	7198d32fc6	Replace copyRegToReg with copyPhysReg for Alpha. llvm-svn: 108065	2010-07-11 01:08:23 +00:00
Jakob Stoklund Olesen	60af0681cb	Use COPY in targets llvm-svn: 108063	2010-07-10 22:43:03 +00:00
Jakob Stoklund Olesen	de457896b6	Don't emit st(0)/st(1) copies as FpMOV instructions. Use FpSET_ST? instead. Based on a patch by Rafael Espíndola. Attempt to make the FpSET_ST1 hack more robust, but we are still relying on FpSET_ST0 preceeding it. This is only for supporting really weird x87 inline asm. We support: FpSET_ST0 INLINEASM FpSET_ST0 FpSET_ST1 INLINEASM with and without kills on the arguments. We don't support: FpSET_ST1 FpSET_ST0 INLINEASM nor FpSET_ST1 INLINEASM Just Don't Do It! llvm-svn: 108047	2010-07-10 17:42:34 +00:00
Chandler Carruth	d162d85688	Add parentheses yet again to satisfy GCC's warnings. llvm-svn: 108043	2010-07-10 12:06:22 +00:00
Dan Gohman	d7b5ce3312	Reapply bottom-up fast-isel, with several fixes for x86-32: - Check getBytesToPopOnReturn(). - Eschew ST0 and ST1 for return values. - Fix the PIC base register initialization so that it doesn't ever fail to end up the top of the entry block. llvm-svn: 108039	2010-07-10 09:00:22 +00:00
Jakob Stoklund Olesen	be8d9b0bb8	An x86 function returns a floating point value in st(0), and we must make sure it is popped, even if it is ununsed. A CopyFromReg node is too weak to represent the required sideeffect, so insert an FpGET_ST0 instruction directly instead. This will matter when CopyFromReg gets lowered to a generic COPY instruction. llvm-svn: 108037	2010-07-10 04:04:25 +00:00
Bruno Cardoso Lopes	5e6c2155a3	Declare YMM subregisters in the right way! Thanks Jakob llvm-svn: 108022	2010-07-09 21:46:19 +00:00
Bruno Cardoso Lopes	2419606bfb	Add AVX 256-bit packed MOVNT variants llvm-svn: 108021	2010-07-09 21:42:42 +00:00
Jakob Stoklund Olesen	e2614a9979	Remember the *_TC opcodes for load/store llvm-svn: 108020	2010-07-09 21:27:55 +00:00
Bruno Cardoso Lopes	6bc772eec7	Add AVX 256-bit unpack and interleave llvm-svn: 108017	2010-07-09 21:20:35 +00:00
Jakob Stoklund Olesen	7a7b55eb67	Automatically fold COPY instructions into stack load/store. llvm-svn: 108012	2010-07-09 20:43:13 +00:00
Jakob Stoklund Olesen	51702ec46b	Fix a few tests llvm-svn: 108011	2010-07-09 20:43:09 +00:00
Jim Grosbach	2a5725b1a3	In the presence of variable sized objects, allocate an emergency spill slot. rdar://8131327 llvm-svn: 108008	2010-07-09 20:27:06 +00:00
Bruno Cardoso Lopes	792e906bef	Start the support for AVX instructions with 256-bit %ymm registers. A couple of notes: - The instructions are being added with dummy placeholder patterns using some 256 specifiers, this is not meant to work now, but since there are some multiclasses generic enough to accept them, when we go for codegen, the stuff will be already there. - Add VEX encoding bits to support YMM - Add MOVUPS and MOVAPS in the first round - Use "Y" as suffix for those Instructions: MOVUPSYrr, ... - All AVX instructions in X86InstrSSE.td will move soon to a new X86InstrAVX file. llvm-svn: 107996	2010-07-09 18:27:43 +00:00
Bob Wilson	6586e9b203	--- Reverse-merging r107947 into '.': U utils/TableGen/FastISelEmitter.cpp --- Reverse-merging r107943 into '.': U test/CodeGen/X86/fast-isel.ll U test/CodeGen/X86/fast-isel-loads.ll U include/llvm/Target/TargetLowering.h U include/llvm/Support/PassNameParser.h U include/llvm/CodeGen/FunctionLoweringInfo.h U include/llvm/CodeGen/CallingConvLower.h U include/llvm/CodeGen/FastISel.h U include/llvm/CodeGen/SelectionDAGISel.h U lib/CodeGen/LLVMTargetMachine.cpp U lib/CodeGen/CallingConvLower.cpp U lib/CodeGen/SelectionDAG/SelectionDAGBuilder.cpp U lib/CodeGen/SelectionDAG/FunctionLoweringInfo.cpp U lib/CodeGen/SelectionDAG/FastISel.cpp U lib/CodeGen/SelectionDAG/SelectionDAGISel.cpp U lib/CodeGen/SelectionDAG/ScheduleDAGSDNodes.cpp U lib/CodeGen/SelectionDAG/InstrEmitter.cpp U lib/CodeGen/SelectionDAG/TargetLowering.cpp U lib/Target/XCore/XCoreISelLowering.cpp U lib/Target/XCore/XCoreISelLowering.h U lib/Target/X86/X86ISelLowering.cpp U lib/Target/X86/X86FastISel.cpp U lib/Target/X86/X86ISelLowering.h llvm-svn: 107987	2010-07-09 16:37:18 +00:00
Bruno Cardoso Lopes	992d25da71	Merge VEX enums with other x86 enum forms. Also fix all checks of which VEX fields to use. llvm-svn: 107952	2010-07-09 01:56:45 +00:00
Dan Gohman	0a7d155d67	Fix the memoperand offsets in code generated for va_start. llvm-svn: 107948	2010-07-09 01:06:48 +00:00
Chris Lattner	88c185617c	have the mc lowering process handle a few tail call forms, lowering them to jumps where possible and turning the TAILCALL marker in the instruction asm string into a proper comment. This eliminates a FIXME and is on the path to finishing: rdar://7639610 - eliminate encoding and asm info for TAILJMPd TAILJMPr TAILJMPn, etc. However, I can't eliminate the encodings for these instructions because the JIT still exists and has its own copy of the encoder, sigh. llvm-svn: 107946	2010-07-09 00:49:41 +00:00
Bob Wilson	88a4e6dc0e	Print "dregpair" NEON operands with a space between them, for readability and consistency with other instructions that have lists of register operands. llvm-svn: 107944	2010-07-09 00:47:20 +00:00
Dan Gohman	0b5aa1cdd3	Re-apply bottom-up fast-isel, with fixes. Be very careful to avoid emitting a DBG_VALUE after a terminator, or emitting any instructions before an EH_LABEL. llvm-svn: 107943	2010-07-09 00:39:23 +00:00
Bruno Cardoso Lopes	e6cc0d33bb	Factor out x86 segment override prefix encoding, and also use it for VEX llvm-svn: 107942	2010-07-09 00:38:14 +00:00
Chris Lattner	061d70ad2c	reject pseudo instructions early in the encoder. llvm-svn: 107939	2010-07-09 00:17:50 +00:00
Bruno Cardoso Lopes	b652c1a145	Remove trailing whitespaces from file llvm-svn: 107937	2010-07-09 00:07:19 +00:00
Chris Lattner	f469307c77	Change LEA to have 5 operands for its memory operand, just like all other instructions, even though a segment is not allowed. This resolves a bunch of gross hacks in the encoder and makes LEA more consistent with the rest of the instruction set. No functionality change. llvm-svn: 107934	2010-07-08 23:46:44 +00:00
Chris Lattner	ec536276f0	add some long-overdue enums to refer to the parts of the 5-operand X86 memory operand. llvm-svn: 107925	2010-07-08 22:41:28 +00:00
Jakob Stoklund Olesen	ec58a43d81	Remember the VR64 register class llvm-svn: 107920	2010-07-08 22:30:35 +00:00
Chris Lattner	9f034c1e5d	Rework segment prefix emission code to handle segments in memory operands at the same type as hard coded segments. This fixes problems where we'd emit the segment override after the REX prefix on instructions like: mov %gs:(%rdi), %rax This fixes rdar://8127102. I have several cleanup patches coming next. llvm-svn: 107917	2010-07-08 22:28:12 +00:00
Chris Lattner	1dd82c7dc2	introduce a new X86II::getMemoryOperandNo method, which returns the start of the memory operand for an instruction. Introduce a new "X86AddrSegment" enum to reduce # magic numbers referring to X86 memory operand layout. llvm-svn: 107916	2010-07-08 22:27:06 +00:00
Kalle Raiskila	d799ea52cd	Switch SPU calling convention (function arguments) to a Tablegen implementation. llvm-svn: 107913	2010-07-08 21:15:22 +00:00
Evan Cheng	0f54854a1d	Check for FiniteOnlyFPMath as well. llvm-svn: 107904	2010-07-08 20:12:24 +00:00
Jakob Stoklund Olesen	63a622b768	Teach the x86 floating point stackifier to handle COPY instructions. This pass runs before COPY instructions are passed to copyPhysReg, so we simply translate COPY to the proper pseudo instruction. Note that copyPhysReg does not handle floating point stack copies. Once COPY is used everywhere, this can be cleaned up a bit, and most of the pseudo instructions can be removed. llvm-svn: 107899	2010-07-08 19:46:30 +00:00
Jakob Stoklund Olesen	930f8082c3	Implement X86InstrInfo::copyPhysReg llvm-svn: 107898	2010-07-08 19:46:25 +00:00
Bob Wilson	181e5af248	The NEONPreAllocPass should never have to assign fixed registers anymore. This pass can go away entirely soon. llvm-svn: 107892	2010-07-08 17:45:26 +00:00
Bob Wilson	1eade1a327	For big-endian systems, VLD2/VST2 with 32-bit vector elements will swap the words within the 64-bit D registers. Use VLD1/VST1 with 64-bit elements instead. llvm-svn: 107890	2010-07-08 17:44:00 +00:00
Bob Wilson	6c25043493	Clean up a comment. llvm-svn: 107882	2010-07-08 16:54:45 +00:00
Jakob Stoklund Olesen	00264624a9	Convert EXTRACT_SUBREG to COPY when emitting machine instrs. EXTRACT_SUBREG no longer appears as a machine instruction. Use COPY instead. Add isCopy() checks in many places using isMoveInstr() and isExtractSubreg(). The isMoveInstr hook will be removed later. llvm-svn: 107879	2010-07-08 16:40:22 +00:00
Jakob Stoklund Olesen	a1e883dcf6	Remove references to INSERT_SUBREG after de-SSA. Fix X86InstrInfo::convertToThreeAddressWithLEA to generate COPY instead of INSERT_SUBREG. llvm-svn: 107878	2010-07-08 16:40:15 +00:00
Benjamin Kramer	2321e6a4d4	Teach instcombine to transform (X >s -1) ? C1 : C2 and (X <s 0) ? C2 : C1 into ((X >>s 31) & (C2 - C1)) + C1, avoiding the conditional. This optimization could be extended to take non-const C1 and C2 but we better stay conservative to avoid code size bloat for now. for int sel(int n) { return n >= 0 ? 60 : 100; } we now generate sarl $31, %edi andl $40, %edi leal 60(%rdi), %eax instead of testl %edi, %edi movl $60, %ecx movl $100, %eax cmovnsl %ecx, %eax llvm-svn: 107866	2010-07-08 11:39:10 +00:00
Eric Christopher	e796253217	A slight reworking of the custom patterns for x86-64 tpoff codegen and correct the testcase for valid assembly. Needs more tests. llvm-svn: 107860	2010-07-08 07:36:46 +00:00
Evan Cheng	be1f7a931e	r107852 is only safe with -enable-unsafe-fp-math to account for +0.0 == -0.0. llvm-svn: 107856	2010-07-08 06:01:49 +00:00
Evan Cheng	25f9364cbd	Optimize some vfp comparisons to integer ones. This patch implements the simplest case when the following conditions are met: 1. The arguments are f32. 2. The arguments are loads and they have no uses other than the comparison. 3. The comparison code is EQ or NE. e.g. vldr.32 s0, [r1] vldr.32 s1, [r0] vcmpe.f32 s1, s0 vmrs apsr_nzcv, fpscr beq LBB0_2 => ldr r1, [r1] ldr r0, [r0] cmp r0, r1 beq LBB0_2 More complicated cases will be implemented in subsequent patches. llvm-svn: 107852	2010-07-08 02:08:50 +00:00
Dale Johannesen	e2289285ae	Changes to ARM tail calls, mostly cosmetic. Add explicit testcases for tail calls within the same module. Duplicate some code to humor those who think .w doesn't apply on ARM. Leave this disabled on Thumb1, and add some comments explaining why it's hard and won't gain much. llvm-svn: 107851	2010-07-08 01:18:23 +00:00
Dan Gohman	e75704369d	Revert 107840 107839 107813 107804 107800 107797 107791. Debug info intrinsics win for now. llvm-svn: 107850	2010-07-08 01:00:56 +00:00
Jakob Stoklund Olesen	6213ab789f	fix copies to/from GR8_ABCD_H even more llvm-svn: 107832	2010-07-07 23:04:56 +00:00
Jim Grosbach	73ef80f76f	grammar llvm-svn: 107831	2010-07-07 22:53:35 +00:00
Jim Grosbach	40eda1076a	Handle cases where the post-RA scheduler may move instructions between the address calculation instructions leading up to a jump table when we're trying to convert them into a TB[H] instruction in Thumb2. This realistically shouldn't happen much, if at all, for well formed inputs, but it's more correct to handle it. rdar://7387682 llvm-svn: 107830	2010-07-07 22:51:22 +00:00
Chris Lattner	05ea2a4791	finish up support for callw: PR7195 llvm-svn: 107826	2010-07-07 22:35:13 +00:00
Chris Lattner	ac5881295c	Implement the major chunk of PR7195: support for 'callw' in the integrated assembler. Still some discussion to be done. llvm-svn: 107825	2010-07-07 22:27:31 +00:00
Bruno Cardoso Lopes	6c61451011	Add more assembly opcodes for SSE compare instructions llvm-svn: 107823	2010-07-07 22:24:03 +00:00
Evan Cheng	1c349f18f8	Move getExtLoad() and (some) getLoad() DebugLoc argument after EVT argument for consistency sake. llvm-svn: 107820	2010-07-07 22:15:37 +00:00
Devang Patel	32a600b494	Print undefined/unknown debug value as "undef". llvm-svn: 107818	2010-07-07 21:52:21 +00:00
Jim Grosbach	e4ba2aa0c4	grammar and trailing whitespace llvm-svn: 107811	2010-07-07 21:06:51 +00:00
Jakob Stoklund Olesen	ddaf0099a5	Allow copies between GR8_ABCD_L and GR8_ABCD_H. This fixes PR7540. llvm-svn: 107809	2010-07-07 20:33:27 +00:00
Dan Gohman	e7ccc51cc1	Implement bottom-up fast-isel. This has the advantage of not requiring a separate DCE pass over MachineInstrs. llvm-svn: 107804	2010-07-07 19:20:32 +00:00
Dan Gohman	2d4d01d0de	Add X86FastISel support for return statements. This entails refactoring a bunch of stuff, to allow the target-independent calling convention logic to be employed. llvm-svn: 107800	2010-07-07 18:32:53 +00:00
Bruno Cardoso Lopes	fd8060335b	Add AVX AES instructions llvm-svn: 107798	2010-07-07 18:24:20 +00:00
Dan Gohman	ffe64b1ee5	Give FunctionLoweringInfo an MBB member, avoiding the need to pass it around everywhere, and also give it an InsertPt member, to enable isel to operate at an arbitrary position within a block, rather than just appending to a block. llvm-svn: 107791	2010-07-07 16:47:08 +00:00
Dan Gohman	87fb4e8fcd	Simplify FastISel's constructor by giving it a FunctionLoweringInfo instance, rather than pointers to all of FunctionLoweringInfo's members. This eliminates an NDEBUG ABI sensitivity. llvm-svn: 107789	2010-07-07 16:29:44 +00:00
Dan Gohman	fe7532a308	Split the SDValue out of OutputArg so that SelectionDAG-independent code can do calling-convention queries. This obviates OutputArgReg. llvm-svn: 107786	2010-07-07 15:54:55 +00:00
Bruno Cardoso Lopes	6d122aef97	Add AVX SSE4.2 instructions llvm-svn: 107752	2010-07-07 03:39:29 +00:00
Bruno Cardoso Lopes	3df55b2d6f	Use only one multiclass to pinsrq instructions llvm-svn: 107750	2010-07-07 01:43:01 +00:00
Bruno Cardoso Lopes	fd6c808154	Now that almost all SSE4.1 AVX instructions are added, move code around to more appropriate sections. No functionality changes llvm-svn: 107749	2010-07-07 01:33:38 +00:00
Bruno Cardoso Lopes	8f5472a8e8	Add AVX SSE4.1 insertps, ptest and movntdqa instructions llvm-svn: 107747	2010-07-07 01:14:56 +00:00
Bruno Cardoso Lopes	6430c7350d	Add AVX SSE4.1 extractps and pinsr instructions llvm-svn: 107746	2010-07-07 01:01:13 +00:00
Bob Wilson	5bc8a79e7f	Also use REG_SEQUENCE for VTBX instructions. llvm-svn: 107743	2010-07-07 00:08:54 +00:00
Jim Grosbach	3198483851	Mark eh.sjlj.set/longjmp custom lowerings as Darwin-only since that's where they've been tested to work. llvm-svn: 107742	2010-07-07 00:07:57 +00:00
Bruno Cardoso Lopes	f3116ebe96	Add AVX SSE4.1 Extract Integer instructions llvm-svn: 107740	2010-07-07 00:07:24 +00:00
Jim Grosbach	dc0a0659be	By default, the eh.sjlj.setjmp/longjmp intrinsics should just do nothing rather than assuming a target will custom lower them. Targets which do so should exlicitly mark them as having custom lowerings. PR7454. llvm-svn: 107734	2010-07-06 23:44:52 +00:00
Bob Wilson	3ed511bc6b	Use REG_SEQUENCE nodes to make the table registers for VTBL instructions be allocated to consecutive registers. llvm-svn: 107730	2010-07-06 23:36:25 +00:00
Dale Johannesen	ce65663330	Accept RIP-relative symbols with 'i' constraint, and print the (%rip) only if the 'a' modifier is present. PR 7528. llvm-svn: 107727	2010-07-06 23:27:00 +00:00
Jakob Stoklund Olesen	48deb12593	Track defs for all aliases in NEONMoveFix. This means that an instruction defining an S register will affect the domain of the parent D register. llvm-svn: 107725	2010-07-06 23:26:23 +00:00
Bruno Cardoso Lopes	1f9ad516c6	Add the rest of AVX SSE4.1 packed move with sign/zero extend instructions llvm-svn: 107723	2010-07-06 23:15:17 +00:00
Bruno Cardoso Lopes	35702d27c4	Add part of AVX SSE4.1 packed move with sign/zero extend instructions llvm-svn: 107720	2010-07-06 23:01:41 +00:00
Bruno Cardoso Lopes	13f0260e76	Fix comment from previous patch llvm-svn: 107717	2010-07-06 22:38:32 +00:00
Bruno Cardoso Lopes	e2bd058d32	Add AVX vblendvpd, vblendvps and vpblendvb instructions Update VEX encoding to support those new instructions llvm-svn: 107715	2010-07-06 22:36:24 +00:00
Dan Gohman	ee0cb70381	CanLowerReturn doesn't need a SelectionDAG; it just needs an LLVMContext. SelectBasicBlock doesn't needs its BasicBlock argument. llvm-svn: 107712	2010-07-06 22:19:37 +00:00
Devang Patel	a3ca21b228	Propagate debug loc. llvm-svn: 107710	2010-07-06 22:08:15 +00:00
Bob Wilson	4c1ca29039	Represent NEON load/store alignments in bytes, not bits. llvm-svn: 107701	2010-07-06 21:26:18 +00:00
Dan Gohman	3439629239	Reapply r107655 with fixes; insert the pseudo instruction into the block before calling the expansion hook. And don't put EFLAGS in a mbb's live-in list twice. llvm-svn: 107691	2010-07-06 20:24:04 +00:00
Devang Patel	23a7593534	Fix PR7545 crash. llvm-svn: 107678	2010-07-06 18:18:32 +00:00
Rafael Espindola	7c510aa7bc	Don't create neon moves in CopyRegToReg. NEONMoveFixPass will do the conversion if profitable. llvm-svn: 107673	2010-07-06 16:24:34 +00:00
Dan Gohman	f4f04107ef	Revert r107655. llvm-svn: 107668	2010-07-06 15:49:48 +00:00
Dan Gohman	2b2a1c3c86	Make getMinimalPhysRegClass' comment mention what makes it different from getPhysicalRegisterRegClass. llvm-svn: 107660	2010-07-06 15:31:55 +00:00
Dan Gohman	12205645a6	Fix a bunch of custom-inserter functions to handle the case where the pseudo instruction is not at the end of the block. llvm-svn: 107655	2010-07-06 15:18:19 +00:00
Eric Christopher	2ad0c779c3	Fix up -fstack-protector on linux to use the segment registers. Split out testcases per architecture and os now. Patch from Nelson Elhage. llvm-svn: 107640	2010-07-06 05:18:56 +00:00
Eric Christopher	d429846eca	Have the X86 backend use Triple instead of a string and some enums. llvm-svn: 107625	2010-07-05 19:26:33 +00:00
Kalle Raiskila	d5ac287140	Remove some unused/redundant code. llvm-svn: 107622	2010-07-05 18:40:09 +00:00
Chris Lattner	c4a7073db3	more tidying. llvm-svn: 107615	2010-07-05 05:53:14 +00:00
Chris Lattner	7b909ac785	some notes about suboptimal insertps's llvm-svn: 107613	2010-07-05 05:48:41 +00:00
Chris Lattner	6d60a14251	rip out even more sporadic v2f32 support. llvm-svn: 107610	2010-07-05 04:38:33 +00:00
Chris Lattner	feb2467bf4	rip out the various v2f32 "mmx" handling logic, now that v2f32 is illegal on x86. llvm-svn: 107609	2010-07-05 04:36:27 +00:00
Chris Lattner	45cc4d74a3	Just rip v2f32 support completely out of the X86 backend. In the example in the testcase, we now generate: _test1: ## @test1 movss 4(%esp), %xmm0 addss 8(%esp), %xmm0 movl 12(%esp), %eax movss %xmm0, (%eax) ret instead of: _test1: ## @test1 subl $20, %esp movl 24(%esp), %eax movq %mm0, (%esp) movq %mm0, 8(%esp) movss (%esp), %xmm0 addss 12(%esp), %xmm0 movss %xmm0, (%eax) addl $20, %esp ret v2f32 support did not work reliably because most of the X86 backend didn't know it was legal. It was apparently only added to support returning source-level v2f32 values in MMX registers in x86-32 mode. If ABI compatibility is important on this GCC-extended-vector type for some reason, then the frontend should generate IR that returns v2i32 instead of v2f32. However, we generally don't try very hard to be abi compatible on gcc extended vectors. llvm-svn: 107601	2010-07-04 23:07:25 +00:00
Chris Lattner	681b926d54	fix PR7518 - terrible codegen of <2 x float>, by only marking v2f32 as legal in 32-bit mode. It is just as terrible there, but I just care about x86-64 and noone claims it is valuable in 64-bit mode. llvm-svn: 107600	2010-07-04 22:57:10 +00:00
Chris Lattner	cb948d3329	indentation llvm-svn: 107599	2010-07-04 22:56:10 +00:00
Bill Wendling	199cacf179	Revert r107583. I no longer think that this is the way to solve the problem. llvm-svn: 107585	2010-07-04 09:16:57 +00:00
Bill Wendling	701aa053b9	Mark sse_load_f32 and sse_load_f64 as having memory operands (SDNPMemOperand). This way when they're morphed the memory operands will be copied as well. llvm-svn: 107583	2010-07-04 08:59:55 +00:00
Eli Friedman	c8f595212f	Minor amendment to switch-lowering improvement. llvm-svn: 107569	2010-07-03 08:43:32 +00:00
Eli Friedman	836fdbc85b	Note switch-lowering inefficiency. llvm-svn: 107565	2010-07-03 07:38:12 +00:00
Bruno Cardoso Lopes	ca99012ac0	Add AVX SSE4.1 blend, mpsadbw and vdp llvm-svn: 107560	2010-07-03 01:37:03 +00:00
Bruno Cardoso Lopes	bc75502f09	Add AVX SSE4.1 binop (some forms of packed max,min,mul,pack,cmp) instructions llvm-svn: 107558	2010-07-03 01:15:47 +00:00
Bruno Cardoso Lopes	fc9cdc4d61	Add AVX SSE4.1 Horizontal Minimum and Position instruction llvm-svn: 107552	2010-07-03 00:49:21 +00:00
Evan Cheng	0664a67fe1	Remove isSS argument from CreateFixedObject. Fixed objects cannot be spill slots so it's always false. llvm-svn: 107550	2010-07-03 00:40:23 +00:00
Bruno Cardoso Lopes	621c85b038	Add AVX SSE4.1 round instructions llvm-svn: 107549	2010-07-03 00:37:44 +00:00
Bruno Cardoso Lopes	5b59c1bf1f	Simple refactoring of SSE4.1 instructions, making room for the AVX forms llvm-svn: 107540	2010-07-02 23:27:59 +00:00
Bruno Cardoso Lopes	c7111fd355	- Add support for the rest of AVX SSE3 instructions - Fix VEX prefix to be emitted with 3 bytes whenever VEX_5M represents a REX equivalent two byte leading opcode llvm-svn: 107523	2010-07-02 22:06:54 +00:00
Evan Cheng	c3525dc0fd	Remove early IT block formation. It's not used. llvm-svn: 107513	2010-07-02 21:07:09 +00:00
Evan Cheng	0ce84486c3	- Two-address pass should not assume unfolding is always successful. - X86 unfolding should check if the instructions being unfolded has memoperands. If there is no memoperands, then it must assume conservative alignment. If this would introduce an expensive sse unaligned load / store, then unfoldMemoryOperand etc. should not unfold the instruction. llvm-svn: 107509	2010-07-02 20:36:18 +00:00
Gabor Greif	9da02a83e9	beautify output llvm-svn: 107500	2010-07-02 19:26:28 +00:00
Gabor Greif	e537ddbdb4	use ArgOperand API llvm-svn: 107498	2010-07-02 19:08:46 +00:00
Bob Wilson	771d04b969	Fix incorrect asm-printing of some NEON immediates. Fix weak testcase so that it checks the immediate values, not just the instructions opcodes. Radar 8110263. llvm-svn: 107487	2010-07-02 17:23:44 +00:00
Gabor Greif	56de4675b6	use ArgOperand API (found by my previous commit) llvm-svn: 107482	2010-07-02 13:37:16 +00:00
Bruno Cardoso Lopes	4ca8ddaceb	Shrink down SSE3 code by more multiclass refactoring llvm-svn: 107448	2010-07-01 23:10:49 +00:00
Bruno Cardoso Lopes	0a17241a0d	Shrink down SSE3 code by some multiclass refactoring - 1st part llvm-svn: 107438	2010-07-01 22:33:18 +00:00
Bob Wilson	8a99b730a9	ARM function alignments were off by a power of two. svn 83242 changed getFunctionAlignment and the corresponding use of that value in the ARM asm printer, but now we're using the standard asm printer. The result of this was that function alignments were dropped completely for Thumb functions. Radar 8143571. llvm-svn: 107435	2010-07-01 22:26:26 +00:00
Bill Wendling	03bcd6ecc8	Implement the "linker_private_weak" linkage type. This will be used for Objective-C metadata types which should be marked as "weak", but which the linker will remove upon final linkage. However, this linkage isn't specific to Objective-C. For example, the "objc_msgSend_fixup_alloc" symbol is defined like this: .globl l_objc_msgSend_fixup_alloc .weak_definition l_objc_msgSend_fixup_alloc .section __DATA, __objc_msgrefs, coalesced .align 3 l_objc_msgSend_fixup_alloc: .quad _objc_msgSend_fixup .quad L_OBJC_METH_VAR_NAME_1 This is different from the "linker_private" linkage type, because it can't have the metadata defined with ".weak_definition". Currently only supported on Darwin platforms. llvm-svn: 107433	2010-07-01 21:55:59 +00:00
Bruno Cardoso Lopes	5e88700f28	Move SSE3 Move patterns to a more appropriate section Add AVX SSE3 packed horizontal and & sub instructions llvm-svn: 107405	2010-07-01 17:35:02 +00:00
Bruno Cardoso Lopes	886ee33a38	Add AVX SSE3 packed addsub instructions llvm-svn: 107404	2010-07-01 17:08:18 +00:00
Dan Gohman	722f5fc567	Enable on-demand fast-isel. llvm-svn: 107377	2010-07-01 02:58:57 +00:00
Dan Gohman	207624edb0	Fix X86FastISel's add folding to actually work, and not fall back to SelectionDAG. llvm-svn: 107376	2010-07-01 02:58:21 +00:00
Bruno Cardoso Lopes	a7a0c83563	Add AVX SSE3 replicate and convert instructions llvm-svn: 107375	2010-07-01 02:33:39 +00:00
Dan Gohman	7937d5606d	Teach X86FastISel to fold constant offsets and scaled indices in the same address. llvm-svn: 107373	2010-07-01 02:27:15 +00:00
Bruno Cardoso Lopes	05166740eb	- Add AVX SSE2 Move doubleword and quadword instructions. - Add encode bits for VEX_W - All 128-bit SSE 1 & SSE2 instructions that are described in the .td file now have a AVX encoded form already working. llvm-svn: 107365	2010-07-01 01:20:06 +00:00
Bruno Cardoso Lopes	d0eacf715f	Move MOVD/MODQ code around, creating sections for each of them llvm-svn: 107308	2010-06-30 18:49:10 +00:00
Bruno Cardoso Lopes	cbcebe2950	Add AVX SSE2 mask creation and conditional store instructions llvm-svn: 107306	2010-06-30 18:38:10 +00:00
Bruno Cardoso Lopes	5c768e4915	Fix a bug introduced in r107211 where instructions with memory operands are declared as commutable llvm-svn: 107300	2010-06-30 18:06:01 +00:00
Bruno Cardoso Lopes	d079c91683	Add AVX SSE2 packed integer extract/insert instructions llvm-svn: 107293	2010-06-30 17:03:03 +00:00
Gabor Greif	12ca3d9fac	use ArgOperand API llvm-svn: 107280	2010-06-30 13:03:37 +00:00
Bruno Cardoso Lopes	e82689fea2	Add AVX SSE2 integer unpack instructions llvm-svn: 107246	2010-06-30 04:06:39 +00:00
Bruno Cardoso Lopes	ec0115c9b7	Add AVX SSE2 packed integer shuffle instructions llvm-svn: 107245	2010-06-30 03:47:56 +00:00
Bruno Cardoso Lopes	51ceead19c	Small refactoring of SSE2 packed integer shuffle instructions llvm-svn: 107243	2010-06-30 03:29:36 +00:00
Bruno Cardoso Lopes	be792feb8b	Add AVX SSE2 pack with saturation integer instructions llvm-svn: 107241	2010-06-30 02:30:25 +00:00
Bruno Cardoso Lopes	2686ea4555	Add AVX SSE2 integer packed compare instructions llvm-svn: 107240	2010-06-30 02:21:09 +00:00
Bruno Cardoso Lopes	2e2caefff9	- Add AVX form of all SSE2 logical instructions - Add VEX encoding bits to x86 MRM0r-MRM7r llvm-svn: 107238	2010-06-30 01:58:37 +00:00
Bruno Cardoso Lopes	3f71ddfaad	Add several AVX integer packed binop instructions llvm-svn: 107225	2010-06-29 23:47:49 +00:00
Bill Wendling	3632171750	Revert r107205 and r107207. llvm-svn: 107215	2010-06-29 22:34:52 +00:00
Eric Christopher	e34471bb31	Add another bswap idiom that isn't matched. llvm-svn: 107213	2010-06-29 22:22:22 +00:00
Bruno Cardoso Lopes	7fee95a38e	Move SSE2 Packed Integer instructions around, and create specific sections for each of them llvm-svn: 107211	2010-06-29 22:12:16 +00:00
Bruno Cardoso Lopes	ba21eb8054	Add AVX Move Aligned/Unaligned packed integers llvm-svn: 107206	2010-06-29 21:25:12 +00:00
Bill Wendling	1767723dbe	Introducing the "linker_weak" linkage type. This will be used for Objective-C metadata types which should be marked as "weak", but which the linker will remove upon final linkage. For example, the "objc_msgSend_fixup_alloc" symbol is defined like this: .globl l_objc_msgSend_fixup_alloc .weak_definition l_objc_msgSend_fixup_alloc .section __DATA, __objc_msgrefs, coalesced .align 3 l_objc_msgSend_fixup_alloc: .quad _objc_msgSend_fixup .quad L_OBJC_METH_VAR_NAME_1 This is different from the "linker_private" linkage type, because it can't have the metadata defined with ".weak_definition". llvm-svn: 107205	2010-06-29 21:24:00 +00:00
Bruno Cardoso Lopes	30689a3a7f	Add AVX ld/st XCSR register. Add VEX encoding bits for MRMXm x86 form llvm-svn: 107204	2010-06-29 20:35:48 +00:00
Bob Wilson	be157b0ea8	Add support for encoding VDUP (ARM core register) instructions. llvm-svn: 107201	2010-06-29 20:13:29 +00:00
Bruno Cardoso Lopes	a4575f5b31	Add AVX non-temporal stores llvm-svn: 107178	2010-06-29 18:22:01 +00:00
Bruno Cardoso Lopes	049f4ffab1	Move non-temporal movs to their own section llvm-svn: 107168	2010-06-29 17:42:37 +00:00
Bob Wilson	ab0819e10d	Add support for encoding NEON VMOV (from core register to scalar) instructions. The encoding is the same as VMOV (from scalar to core register) except that the operands are in different places. llvm-svn: 107167	2010-06-29 17:34:07 +00:00
Bruno Cardoso Lopes	21a9433e9e	Add sqrt, rsqrt and rcp AVX instructions llvm-svn: 107166	2010-06-29 17:26:30 +00:00
Jim Grosbach	5bee07ec68	skip dbg_value instructions llvm-svn: 107154	2010-06-29 16:55:24 +00:00
Bob Wilson	83b993a977	The t2MOVi16 and t2MOVTi16 instructions do not set CPSR. Trying to add a CPSR operand to them causes an assertion failure, so apparently these instructions haven't been getting a lot of use. llvm-svn: 107147	2010-06-29 16:25:11 +00:00
Rafael Espindola	38a7d7cbc3	Add a VT argument to getMinimalPhysRegClass and replace the copy related uses of getPhysicalRegisterRegClass with it. If we want to make a copy (or estimate its cost), it is better to use the smallest class as more efficient operations might be possible. llvm-svn: 107140	2010-06-29 14:02:34 +00:00
Duncan Sands	193bb1ee6a	Remove pointless variable LastDef. llvm-svn: 107135	2010-06-29 13:23:22 +00:00
Duncan Sands	257eba4df7	Remove unused variable Loc and pointless variables unified_syntax and thumb_mode. llvm-svn: 107133	2010-06-29 13:04:35 +00:00
Duncan Sands	78ad27ca2b	Remove an unused and a pointless variable. llvm-svn: 107131	2010-06-29 13:00:29 +00:00
Duncan Sands	67bfa9d109	Remove pointless and unused variables. llvm-svn: 107130	2010-06-29 12:48:49 +00:00
Duncan Sands	6d28e73acc	Remove initialized but otherwise unused variables. llvm-svn: 107127	2010-06-29 11:22:26 +00:00
Evan Cheng	b59dd8f10a	PR7503: uxtb16 is not available for ARMv7-M. Patch by Brian G. Lucas. llvm-svn: 107122	2010-06-29 05:38:36 +00:00
Evan Cheng	0c30739cbb	Change if-cvt options to something that actually as useable. llvm-svn: 107121	2010-06-29 05:37:59 +00:00
Bruno Cardoso Lopes	de736a6494	Refactoring of arithmetic instruction classes with unary operator llvm-svn: 107116	2010-06-29 01:33:09 +00:00
Jakob Stoklund Olesen	c1eccbc468	When no memoperands are present, assume unaligned, volatile. llvm-svn: 107114	2010-06-29 01:13:07 +00:00
Bruno Cardoso Lopes	d6a091a4d4	Described the missing AVX forms of SSE2 convert instructions llvm-svn: 107108	2010-06-29 00:36:02 +00:00
Bob Wilson	3d12ff797b	Fix Thumb encoding of VMOV (scalar to ARM core register). The encoding is the same as ARM except that the condition code field is always set to ARMCC::AL. llvm-svn: 107107	2010-06-29 00:26:13 +00:00
Bob Wilson	4469a892b4	Make the ARMCodeEmitter identify Thumb functions via ARMFunctionInfo instead of the Subtarget. llvm-svn: 107086	2010-06-28 22:23:17 +00:00
Jim Grosbach	f31c004666	tidy up style. no functional change. llvm-svn: 107073	2010-06-28 21:29:17 +00:00
Bob Wilson	544317dfda	Refactor encoding function for NEON 1-register with modified immediate format. llvm-svn: 107070	2010-06-28 21:16:30 +00:00
Bob Wilson	584387d5e3	Support Thumb mode encoding of NEON instructions. llvm-svn: 107068	2010-06-28 21:12:19 +00:00
Bill Wendling	0a5bb081cc	Reduce indentation via early exit. NFC. llvm-svn: 107067	2010-06-28 21:08:32 +00:00
Jim Grosbach	7ea5fc0794	minor housekeeping cleanup: 80-column, trailing whitespace, spelling, etc.. No functional change. llvm-svn: 106988	2010-06-28 04:27:01 +00:00
Gabor Greif	7d4038dd88	use ArgOperand API llvm-svn: 106946	2010-06-26 12:17:21 +00:00
Gabor Greif	c2ac8c4261	use ArgOperand API llvm-svn: 106945	2010-06-26 12:09:10 +00:00
Gabor Greif	83205af3fa	use ArgOperand API llvm-svn: 106944	2010-06-26 11:51:52 +00:00
Eli Friedman	8cfa7713e9	Followup to r106770: actually generate SXTB and SXTH for sign-extensions. llvm-svn: 106940	2010-06-26 04:36:50 +00:00
Bob Wilson	0248da9db4	Add support for encoding NEON VMOV (from scalar to core register) instructions. llvm-svn: 106938	2010-06-26 04:07:15 +00:00
Evan Cheng	b71233f34d	It's now possible to run code placement pass for ARM. llvm-svn: 106935	2010-06-26 01:52:05 +00:00
Jakob Stoklund Olesen	d7d0d4e882	When creating X86 MUL8 and DIV8 instructions, make sure we don't produce CopyFromReg nodes for aliasing registers (AX and AL). This confuses the fast register allocator. Instead of CopyFromReg(AL), use ExtractSubReg(CopyFromReg(AX), sub_8bit). This fixes PR7312. llvm-svn: 106934	2010-06-26 00:39:23 +00:00
Bob Wilson	b4d39841e4	Renumber NEON instruction formats to be consecutive. llvm-svn: 106927	2010-06-26 00:05:09 +00:00
Bob Wilson	cc386fb125	Rename ARM instruction formats NEONGetLnFrm, NEONSetLnFrm and NEONDupFrm to "N..." instead of "NEON..." for consistency with the other NEON format names. llvm-svn: 106921	2010-06-25 23:56:05 +00:00
Bruno Cardoso Lopes	74d716b9cd	Add AVX convert CVTSS2SI{rr,rm} and CVTDQ2PS{rr,rm} instructions llvm-svn: 106917	2010-06-25 23:47:23 +00:00
Bob Wilson	d66f66a5cf	Remove unused NEONFrm and ThumbMiscFrm ARM instruction formats. Renumber MiscFrm to 25. llvm-svn: 106916	2010-06-25 23:45:37 +00:00
Bruno Cardoso Lopes	83651094ad	Reapply r106896: Add several AVX MOV flavors Support VEX encoding for MRMDestReg llvm-svn: 106912	2010-06-25 23:33:42 +00:00
Daniel Dunbar	acbdf53db4	Thumb2ITBlockPass: Fix a possible dereference of an invalid iterator. This was introduced in r106343, but only showed up recently (with a particular compiler & linker combination) because of the particular check, and because we have no builtin checking for dereferencing the end of an array, which is truly unfortunate. llvm-svn: 106908	2010-06-25 23:14:54 +00:00
Bruno Cardoso Lopes	4530fed87e	revert this now, it's using avx instead of sse :) llvm-svn: 106906	2010-06-25 23:04:29 +00:00
Evan Cheng	02b184de5b	Change if-conversion block size limit checks to add some flexibility. llvm-svn: 106901	2010-06-25 22:42:03 +00:00
Bob Wilson	2530ca0647	Add support for encoding 3-register NEON instructions, and fix emitNEON2RegInstruction's handling of 2-address operands. llvm-svn: 106900	2010-06-25 22:40:46 +00:00
Bruno Cardoso Lopes	a34d9b6d84	Add several AVX MOV flavors Support VEX encoding for MRMDestReg llvm-svn: 106896	2010-06-25 22:27:51 +00:00
Dale Johannesen	ce97d55ad9	The hasMemory argument is irrelevant to how the argument for an "i" constraint should get lowered; PR 6309. While this argument was passed around a lot, this is the only place it was used, so it goes away from a lot of other places. llvm-svn: 106893	2010-06-25 21:55:36 +00:00
Bob Wilson	e70c8b150b	Add support for encoding 2-register NEON instructions. llvm-svn: 106891	2010-06-25 21:17:19 +00:00
Dan Gohman	8de1fe3ccf	pcmpeqd and friends are Commutable. llvm-svn: 106886	2010-06-25 21:05:35 +00:00
Bob Wilson	574f68f815	Fix indentation. llvm-svn: 106881	2010-06-25 20:54:44 +00:00
Bill Wendling	e41e40f689	- Reapply r106066 now that the bzip2 build regression has been fixed. - 2010-06-25-CoalescerSubRegDefDead.ll is the testcase for r106878. llvm-svn: 106880	2010-06-25 20:48:10 +00:00
Bruno Cardoso Lopes	553fafc6ce	Move the last piece of SSE2 convert instructions to the Convert Instructions section llvm-svn: 106877	2010-06-25 20:29:27 +00:00
Bruno Cardoso Lopes	62d1403a03	More SSE refactoring, this time with different types of MOVs llvm-svn: 106876	2010-06-25 20:22:12 +00:00
Jim Grosbach	ba3ece6f27	IT instructions are considered to be scheduling hazards, but are scheduled with the following instructions. This is done via trickery by considering the instruction preceding the IT to be the hazard. Care must be taken to ensure it's the first non-debug instruction, or the presence of debug info will affect codegen. Part of the continuing work for rdar://7797940, making ARM code-gen unaffected by the presence of debug information. llvm-svn: 106871	2010-06-25 18:43:14 +00:00
Bruno Cardoso Lopes	e76c0b13b9	Refactoring of more SSE conversion instructions. Also add some AVX instrinsics Int_V... placeholders llvm-svn: 106867	2010-06-25 18:06:22 +00:00
Bob Wilson	07aead2f8d	Add missing ARM and Thumb data layout info for vector types. Radar 8128745. llvm-svn: 106820	2010-06-25 04:41:08 +00:00
Bob Wilson	eadbf9732f	Reduce indentation. llvm-svn: 106819	2010-06-25 04:12:31 +00:00
Bruno Cardoso Lopes	cbdcce6478	Add some AVX convert instructions llvm-svn: 106815	2010-06-25 00:39:30 +00:00
Bruno Cardoso Lopes	447735aa98	Refactoring of SSE convert intrinsics llvm-svn: 106808	2010-06-24 23:37:07 +00:00
Bruno Cardoso Lopes	78827d1952	Refactoring of SSE conversion instructions llvm-svn: 106804	2010-06-24 22:22:21 +00:00
Bruno Cardoso Lopes	6b6b605917	Refactor SSE cmp intrinsics and declare the same for AVX llvm-svn: 106796	2010-06-24 22:04:40 +00:00
Bruno Cardoso Lopes	4398fd7b83	- Add AVX COMI{SS,SD}{rr,rm} and UCOMI{SS,SD}{rr,rm}. - Fix a small VEX encoding issue. - Move compare instructions to their appropriate place. llvm-svn: 106787	2010-06-24 20:48:23 +00:00
Dale Johannesen	5ad5226c58	Disallow matching "i" constraint to symbol addresses when address requires a register or secondary load to compute (most PIC modes). This improves "g" constraint handling. 8015842. The test from 2007 is attempting to test the fix for PR1761, but since -relocation-model=static doesn't work on Darwin x86-64, it was not testing what it was supposed to be testing and was passing erroneously. Fixed to use Linux x86-64. llvm-svn: 106779	2010-06-24 20:14:51 +00:00
Evan Cheng	c26e2f4b70	Oops. IT block formation pass needs to be run at any optimization level. llvm-svn: 106775	2010-06-24 19:10:14 +00:00
Eli Friedman	246c41d93e	Always allow Thumb-2 SXTB, SXTH, UXTB, and UXTH. Fixes PR7324. llvm-svn: 106770	2010-06-24 18:20:04 +00:00
Bob Wilson	279e55fb2e	PR7458: Try commuting Thumb2 instruction operands to put them into 2-address form so they can be narrowed to 16-bit instructions. llvm-svn: 106762	2010-06-24 16:50:20 +00:00
Dan Gohman	600f62b3ba	Reapply r106634, now that the bug it exposed is fixed. llvm-svn: 106746	2010-06-24 14:30:44 +00:00
Chris Lattner	8048662539	Teach the x86 mc assembler that %dr6 = %db6, this implements rdar://8013734 llvm-svn: 106725	2010-06-24 07:29:18 +00:00
Chris Lattner	c4e84309c4	more cleanups llvm-svn: 106724	2010-06-24 07:18:14 +00:00
Chris Lattner	056fd06c5f	reduce indentation llvm-svn: 106723	2010-06-24 07:16:25 +00:00
Chris Lattner	cfed96a410	fix breakage from r98938 by correctly marking msp430 calls as variadic. Patch by Ben Ransford! llvm-svn: 106722	2010-06-24 06:46:50 +00:00
Dan Gohman	c3e291c560	Fix a bug in the code which determines when it's safe to use the bt instruction, which was exposed by r106263. llvm-svn: 106718	2010-06-24 02:07:59 +00:00
Eric Christopher	fa6ce139a9	Add a couple more quick comments. llvm-svn: 106717	2010-06-24 02:07:57 +00:00
Devang Patel	0dc3c2d37e	Use ValueMap instead of DenseMap. The ValueMapper used by various cloning utility maps MDNodes also. llvm-svn: 106706	2010-06-24 00:33:28 +00:00
Bruno Cardoso Lopes	191a1cd2bb	Add AVX CMP{SS,SD}{rr,rm} instructions and encoding testcases llvm-svn: 106705	2010-06-24 00:32:06 +00:00
Bruno Cardoso Lopes	6af02a6f69	Move SSE and AVX shuffle, unpack and compare code to more appropriate places llvm-svn: 106702	2010-06-24 00:15:50 +00:00
Bill Wendling	f470747a36	We are missing opportunites to use ldm. Take code like this: void t(int cp0, int cp1, int dp, int fmd) { int c0, c1, d0, d1, d2, d3; c0 = (cp0++ & 0xffff) \| ((cp1++ << 16) & 0xffff0000); c1 = (cp0++ & 0xffff) \| ((cp1++ << 16) & 0xffff0000); / ... */ } It code gens into something pretty bad. But with this change (analogous to the X86 back-end), it will use ldm and generate few instructions. llvm-svn: 106693	2010-06-23 23:00:16 +00:00
Bruno Cardoso Lopes	05220c9a0d	Add AVX MOVMSK{PS,PD}rr instructions llvm-svn: 106683	2010-06-23 21:30:27 +00:00
Bruno Cardoso Lopes	3183dd5692	Add tests for different AVX cmp opcodes, also teach the x86 asm parser to understand the vcmp instruction llvm-svn: 106678	2010-06-23 21:10:57 +00:00
Eric Christopher	5fed9b7c6c	Update according to feedback. llvm-svn: 106677	2010-06-23 20:49:35 +00:00
Bruno Cardoso Lopes	360d6fe299	Add AVX SHUF{PS,PD}{rr,rm} instructions llvm-svn: 106672	2010-06-23 20:07:15 +00:00

... 4 5 6 7 8 ...

14933 Commits