llvm-project

Commit Graph

Author	SHA1	Message	Date
Andrew Trick	8460a3bfa1	whitespace llvm-svn: 192633	2013-10-14 22:18:56 +00:00
Eric Christopher	740025745b	Revert part of a fix from 2010, changes since then: a) x86-64 TLS has been documented b) the code path should use movq for the correct relocation to be generated. I've also added a fixme for the test case that we should improve the code generated, it should look something like is documented in the tls abi document. llvm-svn: 192631	2013-10-14 21:52:26 +00:00
Eric Christopher	755711e510	Reformat this routine slightly. llvm-svn: 192630	2013-10-14 21:52:23 +00:00
Eric Christopher	584d71c6cb	Remove some extraneous whitespace. llvm-svn: 192629	2013-10-14 21:52:18 +00:00
Elena Demikhovsky	82a46ebe0a	Fixed a bug in dynamic allocation memory on stack. The alignment of allocated space was wrong, see Bugzila 17345. Done by Zvi Rackover <zvi.rackover@intel.com>. llvm-svn: 192573	2013-10-14 07:26:51 +00:00
Craig Topper	d7abdb6f12	Create classes to reduce the size of the tablegen entries for the CRC32 instructions. llvm-svn: 192568	2013-10-14 05:19:58 +00:00
Craig Topper	a422b09ae3	Allow pinsrw/pinsrb/pextrb/pextrw/movmskps/movmskpd/pmovmskb/extractps instructions to parse either GR32 or GR64 without resorting to duplicating instructions. llvm-svn: 192567	2013-10-14 04:55:01 +00:00
Craig Topper	4432208884	Add disassembler support for SSE4.1 register/register form of PEXTRW. There is a shorter encoding that was part of SSE2, but a memory form was added in SSE4.1. This is the register form of that encoding. llvm-svn: 192566	2013-10-14 01:42:32 +00:00
Craig Topper	7158745e55	Mark MOVMSKPS/MOVMSKPD/VPINSRWrr64i as AsmParserOnly to remove them from the disassembler tables. Add PINSRWrr64i to complement the AVX version. llvm-svn: 192565	2013-10-14 01:21:22 +00:00
Craig Topper	c4a5a3f65d	Don't use 64-bit versions of MOVMSKPD in CodeGen. The instructions only produce a 1-bit result so we can just use SUBREG_TO_REG to extend the 32-bit versions. llvm-svn: 192562	2013-10-14 00:24:33 +00:00
Craig Topper	88adf2a49c	Remove more filters from the disassembler. Mark some AVX512 instructions as CodeGenOnly. llvm-svn: 192525	2013-10-12 05:41:08 +00:00
Craig Topper	aab53e7785	Mark some more instructions as CodeGenOnly. Remove filters from the disassembler. llvm-svn: 192522	2013-10-12 04:46:18 +00:00
Craig Topper	5fb5bd3373	Allow non-AVX form of pmovmskb to take a GR64 operand. llvm-svn: 192341	2013-10-10 05:33:31 +00:00
Craig Topper	3ada6deaac	Remove duplicate instructions. llvm-svn: 192340	2013-10-10 05:01:22 +00:00
Elena Demikhovsky	a3a714082b	AVX-512: Added VRCP28 and VRSQRT28 instructions and intrinsics. llvm-svn: 192283	2013-10-09 08:16:14 +00:00
Andrew Trick	15a4774345	Add missing HasAVX512 predicate. This was only working because AVX had cheaper rules in all cases. I'm sure there are other places in this file where predicates are missing. llvm-svn: 192276	2013-10-09 05:11:10 +00:00
Craig Topper	a5f628cecf	Replace a couple instructions with patterns referring to other instructions with same encoding and operands. Mark a couple other instructions as CodeGenOnly since we have FR and VR instructions and only one of them is needed by the assembler/disassembler. llvm-svn: 192274	2013-10-09 04:54:21 +00:00
Craig Topper	a328ee42cf	Use AVX512PIi8 for the alt forms of vcmp instructions. This adds the TB prefix and keeps the mnemonic from starting with an extra 'v' llvm-svn: 192272	2013-10-09 04:24:38 +00:00
Craig Topper	49d3319857	Mark some instructions as CodeGenOnly since they aren't needed by the assembler or disassembler. Disassembler already filtered them, but asm parser still had them in its tables. llvm-svn: 192271	2013-10-09 03:56:16 +00:00
Craig Topper	bc749db947	Add in64BitMode/in32BitMode to the MMX/SSE2/AVX maskmovq/dq instructions. This way the asm parser will pick the right one based on the mode. Instruction selection already did the right thing based on the pointer size. llvm-svn: 192266	2013-10-09 02:18:34 +00:00
Rafael Espindola	a17151ad5a	Add a MCTargetStreamer interface. This patch fixes an old FIXME by creating a MCTargetStreamer interface and moving the target specific functions for ARM, Mips and PPC to it. The ARM streamer is still declared in a common place because it is used from lib/CodeGen/ARMException.cpp, but the Mips and PPC are completely hidden in the corresponding Target directories. I will send an email to llvmdev with instructions on how to use this. llvm-svn: 192181	2013-10-08 13:08:17 +00:00
Craig Topper	a984729f8a	Remove unneeded MMX instruction definition by moving pattern to an equivalent instruction definition and removing the filtering from the disassembler table building. llvm-svn: 192175	2013-10-08 06:30:39 +00:00
Craig Topper	72c8cd7bc3	Remove some instructions that existed to provide aliases to the assembler. Can be done with InstAlias instead. Unfortunately, this was causing printer to use 'vmovq' or 'vmovd' based on what was parsed. To cleanup the inconsistencies convert all 'vmovd' with 64-bit registers to 'vmovq', but provide an alias so that 'vmovd' will still parse. llvm-svn: 192171	2013-10-08 05:53:50 +00:00
Benjamin Kramer	7b5e159450	X86: Fix type check. Just because an integer type is illegal doesn't mean it's i64. Fixes PR17495, where an i24 triggered this code. It's intended to optimize i64 loads on 32 bit x86. llvm-svn: 192123	2013-10-07 19:11:35 +00:00
Rafael Espindola	e90fd9c5e0	Remove getEHExceptionRegister and getEHHandlerRegister. They haven't been used for a long time. Patch by MathOnNapkins. llvm-svn: 192099	2013-10-07 13:39:22 +00:00
Craig Topper	07ad1b23bb	Remove some instructions that seem to only exist to trick the filtering checks in the disassembler table creation. Just fix up the filter to let the real instruction through instead. llvm-svn: 192090	2013-10-07 07:19:47 +00:00
Craig Topper	68d2546ec6	Remove FsMOVAPSrr and friends. They have no patterns and are no longer selected anywhere. llvm-svn: 192089	2013-10-07 06:10:45 +00:00
Craig Topper	a0e0735e6a	Teach X86 asm parser that VMOVAPSrr and other VEX-encoded register to register moves should be switched from using the MRMSrcReg form to the MRMDestReg form if the source register is a 64-bit extended register and the destination register is not. This allows the instruction to be encoded using the 2-byte VEX form instead of the 3-byte VEX form. The GNU assembler has similar behavior and instruction selection already does this. llvm-svn: 192088	2013-10-07 05:42:48 +00:00
Craig Topper	2658d89728	Add disassembler support for long encodings for INC/DEC in 32-bit mode. llvm-svn: 192086	2013-10-07 04:28:06 +00:00
Benjamin Kramer	858a3880d6	X86: Don't fold spills into SSE operations if the stack is unaligned. Regalloc can emit unaligned spills nowadays, but we can't fold the spills into SSE ops if we can't guarantee alignment. PR12250. llvm-svn: 192064	2013-10-06 13:48:22 +00:00
Elena Demikhovsky	2e408aefe0	AVX-512: added scalar convert instructions and intrinsics. Fixed load folding in VPERM2I instruction. llvm-svn: 192063	2013-10-06 13:11:09 +00:00
Elena Demikhovsky	462a2d235b	AVX-512: fixed shuffle lowering in case of BLEND and added VSHUFPS patterns. llvm-svn: 192055	2013-10-06 06:11:18 +00:00
Craig Topper	c81e29435a	Add TBM instructions to loading folding tables. llvm-svn: 192046	2013-10-05 20:20:51 +00:00
Nick Lewycky	3be42b8f06	Rename this feature to "cx16" to match gcc's flag name. Apparently these strings are directly tied to the flag names in clang with no remapping in between? llvm-svn: 192044	2013-10-05 20:11:44 +00:00
Craig Topper	9a9468ee02	Remove underscores from TBM instruction names for consistency with other instruction naming. llvm-svn: 192040	2013-10-05 19:27:26 +00:00
Craig Topper	52196640a2	Remove unneeded TBM intrinsics. The arithmetic/logical operation patterns are sufficient. llvm-svn: 192039	2013-10-05 19:22:59 +00:00
Craig Topper	80bd135e7a	Add an additional pattern for BLCI since opt can turn (not (add x, 1)) into (sub -2, x). llvm-svn: 192037	2013-10-05 17:17:53 +00:00
Elena Demikhovsky	85aeffaf5c	AVX-512: Fixed encoding of VMOVQ instruction. llvm-svn: 191889	2013-10-03 12:03:26 +00:00
Craig Topper	9eb8837ffa	Replace C++ style comment with a C style comment to satisfy some of the build bots. llvm-svn: 191880	2013-10-03 06:29:59 +00:00
Craig Topper	42e8a63e4f	Remove comma from the end of an enum. llvm-svn: 191877	2013-10-03 06:18:26 +00:00
Craig Topper	9e3e38ae3f	Add XOP disassembler support. Fixes PR13933. llvm-svn: 191874	2013-10-03 05:17:48 +00:00
Craig Topper	b01cd1aa74	Add patterns for selecting TBM instructions from logical operations. Patch from Yunzhong Gao. llvm-svn: 191871	2013-10-03 04:16:45 +00:00
Elena Demikhovsky	34586e7d41	AVX-512: fixed a bug in getLoadStoreRegOpcode() for AVX-512 target llvm-svn: 191818	2013-10-02 12:20:42 +00:00
Elena Demikhovsky	b30371cb6b	AVX-512: Added TB prefix to all instructions without prefixes, otherwise encoding fails after the last change in X86MCCodeEmitter.cpp. llvm-svn: 191812	2013-10-02 06:39:07 +00:00
Rafael Espindola	44fee4e0eb	Remove several unused variables. Patch by Alp Toker. llvm-svn: 191757	2013-10-01 13:32:03 +00:00
Elena Demikhovsky	3b75f5d282	AVX-512: Added X86vzmovl patterns llvm-svn: 191733	2013-10-01 08:38:02 +00:00
Craig Topper	766c934814	Remove 0 as a valid encoding for the m-mmmm field. llvm-svn: 191732	2013-10-01 07:10:28 +00:00
Craig Topper	8b278c5dc4	Remove unneeded fields from disassembler internal instruction format. llvm-svn: 191731	2013-10-01 06:56:57 +00:00
Craig Topper	3bf0317fec	BEXTR should be defined to take same type for bother operands. llvm-svn: 191728	2013-10-01 03:48:26 +00:00
Preston Gurd	f03a6e7fba	Forgot to add a break statement. llvm-svn: 191715	2013-09-30 23:51:22 +00:00
Preston Gurd	f0b6288cbf	The X86FixupLEAs pass for Intel Atom must not call convertToThreeAddress on ADD16rr opcodes, if src1 != src, since that would cause convertToThreeAddress to try to create a virtual register. This is not permitted after register allocation, which is when the X86FixupLEAs pass runs. This patch fixes PR16785. llvm-svn: 191711	2013-09-30 23:18:42 +00:00
Craig Topper	ed59dd34fd	Various x86 disassembler fixes. Add VEX_LIG to scalar FMA4 instructions. Use VEX_LIG in some of the inheriting checks in disassembler table generator. Make use of VEX_L_W, VEX_L_W_XS, VEX_L_W_XD contexts. Don't let VEX_L_W, VEX_L_W_XS, VEX_L_W_XD, VEX_L_W_OPSIZE inherit from their non-L forms unless VEX_LIG is set. Let VEX_L_W, VEX_L_W_XS, VEX_L_W_XD, VEX_L_W_OPSIZE inherit from all of their non-L or non-W cases. Increase ranking on VEX_L_W, VEX_L_W_XS, VEX_L_W_XD, VEX_L_W_OPSIZE so they get chosen over non-L/non-W forms. llvm-svn: 191649	2013-09-30 02:46:36 +00:00
Craig Topper	3aef88b1c7	Change type of XOP flag in code emitters to a bool. Remove a some unneeded cases from switch. llvm-svn: 191632	2013-09-29 08:33:34 +00:00
Craig Topper	e75666f47a	Add comments for XOPA map introduced with TBM instructions.a llvm-svn: 191630	2013-09-29 06:31:18 +00:00
Robert Wilhelm	f0cfb83bb4	Fix spelling intruction -> instruction. llvm-svn: 191610	2013-09-28 11:46:15 +00:00
Yunzhong Gao	b8bbcbfcc8	Adding intrinsics to the llvm backend for TBM instruction set. Phabricator code review is located here: http://llvm-reviews.chandlerc.com/D1750 llvm-svn: 191539	2013-09-27 18:38:42 +00:00
Craig Topper	dbe8b7d236	Put HasAVX512 predicate on some patterns to properly disable them when AVX512 isn't enabled. Currently it works simply because the SSE and AVX version of the same patterns are checked first in the DAG isel table. llvm-svn: 191490	2013-09-27 07:20:47 +00:00
Craig Topper	8f14de8f32	Switch HasAVX to UseAVX in one spot to ensure that AVX512 form of VINSERTPS is used in AVX512 mode. llvm-svn: 191489	2013-09-27 07:16:24 +00:00
Craig Topper	c6a1aac735	Removal some duplicate patterns. llvm-svn: 191488	2013-09-27 07:11:17 +00:00
Yunzhong Gao	4467f33e3c	Fixing Intel format of the vshufpd instruction. Phabricator code review is located at: http://llvm-reviews.chandlerc.com/D1759 llvm-svn: 191481	2013-09-27 01:44:23 +00:00
Andrew Trick	b6854d80e3	Mark the x86 machine model as incomplete. PR17367. Ideally, the machinel model is added at the time the instructions are defined. But many instructions in X86InstrSSE.td still need a model. Without this workaround the scheduler asserts because x86 already has itinerary classes for these instructions, indicating they should be modeled by the scheduler. Since we use the new machine model for other instructions, it expects a new machine model for these too. llvm-svn: 191391	2013-09-25 18:14:12 +00:00
David Majnemer	1ccd2f2aee	MC: Remove vestigial PCSymbol field from AsmInfo llvm-svn: 191362	2013-09-25 09:36:11 +00:00
Yunzhong Gao	dd36e9387b	Adding a feature flag to the llvm backend for x86 TBM instruction set. Adding TBM feature to bdver2 processor; piledriver supports this instruction set according to the following document: http://developer.amd.com/wordpress/media/2012/10/New-Bulldozer-and-Piledriver-Instructions.pdf Phabricator code review is located here: http://llvm-reviews.chandlerc.com/D1692 llvm-svn: 191324	2013-09-24 18:21:52 +00:00
Bill Wendling	c63c30c9a2	Followup to r191252. Make sure that the code that handles the constant addresses is run for the GEPs. This just refactors that code and then calls it for the GEPs that are collected during the iteration. <rdar://problem/12445434> llvm-svn: 191281	2013-09-24 07:19:30 +00:00
Bill Wendling	585a901a12	Selecting the address from a very long chain of GEPs can blow the stack. The recursive nature of the address selection code can cause the stack to explode if there is a long chain of GEPs. Convert the recursive bit into a iterative method to avoid this. <rdar://problem/12445434> llvm-svn: 191252	2013-09-24 00:13:08 +00:00
Tim Northover	31d093c705	ISelDAG: spot chain cycles involving MachineNodes Previously, the DAGISel function WalkChainUsers was spotting that it had entered already-selected territory by whether a node was a MachineNode (amongst other things). Since it's fairly common practice to insert MachineNodes during ISelLowering, this was not the correct check. Looking around, it seems that other nodes get their NodeId set to -1 upon selection, so this makes sure the same thing happens to all MachineNodes and uses that characteristic to determine whether we should stop looking for a loop during selection. This should fix PR15840. llvm-svn: 191165	2013-09-22 08:21:56 +00:00
David Majnemer	7b1cdb980b	X86: Use R_X86_64_TPOFF64 for FK_Data_8 Summary: LLVM would crash when trying to come up with a relocation type for assembly like: movabsq $V@TPOFF, %rax Instead, we say the relocation type is R_X86_64_TPOFF64. Fixes PR17274. Reviewers: dblaikie, nrieck, rafael CC: llvm-commits Differential Revision: http://llvm-reviews.chandlerc.com/D1717 llvm-svn: 191163	2013-09-22 05:30:16 +00:00
Juergen Ributzka	f043a65327	Revert "SelectionDAG: Teach the legalizer to split SETCC if VSELECT needs splitting too." This reverts commit r191130. llvm-svn: 191138	2013-09-21 15:09:46 +00:00
Craig Topper	58f6e64e07	Remove alignment restrictions from FMA load folding. llvm-svn: 191136	2013-09-21 05:58:59 +00:00
Juergen Ributzka	c2551eb4ff	Fix the buildbot llvm-svn: 191133	2013-09-21 05:15:01 +00:00
Juergen Ributzka	ab930591c7	[X86] Emulate AVX 256bit MIN/MAX support by splitting the vector. In AVX 256bit vectors are valid vectors and therefore the Type Legalizer doesn't split the VSELECT and SETCC nodes. AVX only supports MIN/MAX on 128bit vectors and this fix enables vector splitting for this special case in the X86 DAG Combiner. This fix is related to PR16695, PR17002, and <rdar://problem/14594431>. llvm-svn: 191131	2013-09-21 04:55:22 +00:00
Juergen Ributzka	e9a80fc912	SelectionDAG: Teach the legalizer to split SETCC if VSELECT needs splitting too. The Type Legalizer recognizes that VSELECT needs to be split, because the type is to wide for the given target. The same does not always apply to SETCC, because less space is required to encode the result of a comparison. As a result VSELECT is split and SETCC is unrolled into scalar comparisons. This commit fixes the issue by checking for VSELECT-SETCC patterns in the DAG Combiner. If a matching pattern is found, then the result mask of SETCC is promoted to the expected vector mask for the given target. This mask has usually te same size as the VSELECT return type (except for Intel KNL). Now the type legalizer will split both VSELECT and SETCC. This allows the following X86 DAG Combine code to sucessfully detect the MIN/MAX pattern. This fixes PR16695, PR17002, and <rdar://problem/14594431>. llvm-svn: 191130	2013-09-21 04:55:18 +00:00
Craig Topper	9a3915a74f	Lift alignment restrictions on load/store folding of VEXTRACTI128/VINSERTI128. llvm-svn: 191073	2013-09-20 05:37:49 +00:00
Yi Jiang	5c343de8d3	X86 horizontal vector reduction cost model llvm-svn: 191021	2013-09-19 17:48:48 +00:00
Tim Northover	97347a81bc	X86: FrameIndex addressing modes do have a base register. When selecting the DAG (add (WrapperRIP ...), (FrameIndex ...)), X86 code had spotted the FrameIndex possibility and was working out whether it could fold the WrapperRIP into this. The test for forming a %rip version is notionally whether we already have a base or index register (%rip precludes both), but we were forgetting to account for the register that would be inserted later to access the frame. rdar://problem/15024520 llvm-svn: 190995	2013-09-19 11:33:53 +00:00
Craig Topper	358c7989b1	Prevent extra calls to ToggleFeature for Feature64Bit and FeatureCMOV if they've already been enabled. The extra call ends up clearing the bit in FeatureBits since its a 'toggle'. Can't prove that anything was broken because of this since I don't think the FeatureBits for these are used. llvm-svn: 190920	2013-09-18 06:01:53 +00:00
Craig Topper	a8442344ed	Fix X86 subtarget to not overwrite the autodetected features by calling InitMCProcessorInfo right after detecting them. Instead add a new function that only updates the scheduling model and call that. llvm-svn: 190919	2013-09-18 05:54:09 +00:00
Craig Topper	98064b9f4d	Lift alignment restrictions for load/store folding on VINSERTF128/VEXTRACTF128. Fixes PR17268. llvm-svn: 190916	2013-09-18 03:55:53 +00:00
Reid Kleckner	c1e7621e01	COFF: Ensure that objects produced by LLVM link with /safeseh Summary: We indicate that the object files are safe by emitting a @feat.00 absolute address symbol. The address is presumably interpreted as a bitfield of features that the compiler would like to enable. Bit 0 is documented in the PE COFF spec to opt in to "registered SEH", which is what /safeseh enables. LLVM's object files are safe by default because LLVM doesn't know how to produce SEH handlers. Reviewers: Bigcheese CC: llvm-commits Differential Revision: http://llvm-reviews.chandlerc.com/D1691 llvm-svn: 190898	2013-09-17 23:18:05 +00:00
Preston Gurd	ba6f9d1b7d	Remove unused code, which had been commented out. llvm-svn: 190869	2013-09-17 16:53:36 +00:00
Ben Langmuir	de39520f79	Add llvm.x86.* intrinsics for Intel SHA Extensions Add llvm.x86.* intrinsics for all of the Intel SHA Extensions instructions, as well as tests. Also remove mayLoad and hasSideEffects, which can be inferred from the instruction patterns. llvm-svn: 190864	2013-09-17 13:44:39 +00:00
Elena Demikhovsky	ac3e8eb9f0	AVX-512: Converted to Unix style llvm-svn: 190851	2013-09-17 07:34:34 +00:00
Craig Topper	514f02cc07	Add AES and SHA instructions to the load folding tables. llvm-svn: 190850	2013-09-17 06:50:11 +00:00
Craig Topper	684abc8236	Fix column alignment. No functional change. llvm-svn: 190849	2013-09-17 06:05:17 +00:00
Craig Topper	a6d204ec68	Make F16C feature flag imply AVX rather than just checking both at the patterns. llvm-svn: 190775	2013-09-16 04:29:58 +00:00
Ben Langmuir	8eb45a4ef6	Add the remaining Intel SHA instructions Also assembly/disassembly tests, and for sha256rnds2, aliases with an explicit xmm0 dependency. llvm-svn: 190754	2013-09-14 15:03:21 +00:00
Preston Gurd	3fe264d625	Adds support for Atom Silvermont (SLM) - -march=slm Implements Instruction scheduler latencies for Silvermont, using latencies from the Intel Silvermont Optimization Guide. Auto detects SLM. Turns on post RA scheduler when generating code for SLM. llvm-svn: 190717	2013-09-13 19:23:28 +00:00
Craig Topper	21a916b6db	Move operator to end of previous line to match coding standards. llvm-svn: 190659	2013-09-13 04:41:06 +00:00
Ben Langmuir	1650175de6	Partial support for Intel SHA Extensions (sha1rnds4) Add basic assembly/disassembly support for the first Intel SHA instruction 'sha1rnds4'. Also includes feature flag, and test cases. Support for the remaining instructions will follow in a separate patch. llvm-svn: 190611	2013-09-12 15:51:31 +00:00
Joey Gouly	0e76fa7df5	Add an instruction deprecation feature to TableGen. The 'Deprecated' class allows you to specify a SubtargetFeature that the instruction is deprecated on. The 'ComplexDeprecationPredicate' class allows you to define a custom predicate that is called to check for deprecation. For example: ComplexDeprecationPredicate<"MCR"> would mean you would have to define the following function: bool getMCRDeprecationInfo(MCInst &MI, MCSubtargetInfo &STI, std::string &Info) Which returns 'false' for not deprecated, and 'true' for deprecated and store the warning message in 'Info'. The MCTargetAsmParser constructor was chaned to take an extra argument of the MCInstrInfo class, so out-of-tree targets will need to be changed. llvm-svn: 190598	2013-09-12 10:28:05 +00:00
Elena Demikhovsky	8952974e29	AVX-512: implemented extractelement with variable index. Added parsing of mask register and "zeroing" semantic, like {%k1} {z}. llvm-svn: 190595	2013-09-12 08:55:00 +00:00
Bill Wendling	7b650a751f	Use the appropriate return type for the compact unwind encoding. llvm-svn: 190551	2013-09-11 21:47:57 +00:00
Bill Wendling	184d5d31bc	Move into an anonymous namespace and closer to where it's used. llvm-svn: 190547	2013-09-11 20:38:09 +00:00
Bill Wendling	f27e331510	Revert r190366. It was breaking build bots. llvm-svn: 190373	2013-09-10 00:20:27 +00:00
Bill Wendling	b07305fcd4	Use a default value for the prologue's debug location. llvm-svn: 190366	2013-09-09 23:28:15 +00:00
Bill Wendling	58e2d3d856	Generate compact unwind encoding from CFI directives. We used to generate the compact unwind encoding from the machine instructions. However, this had the problem that if the user used `-save-temps' or compiled their hand-written `.s' file (with CFI directives), we wouldn't generate the compact unwind encoding. Move the algorithm that generates the compact unwind encoding into the MCAsmBackend. This way we can generate the encoding whether the code is from a `.ll' or `.s' file. <rdar://problem/13623355> llvm-svn: 190290	2013-09-09 02:37:14 +00:00
Craig Topper	adbb9a121f	Add neverHasSideEffects=1 on a couple move instructions. llvm-svn: 190259	2013-09-08 00:50:45 +00:00
Craig Topper	0a63e1da92	Using popcount should check the popcount feature flag not the SSE41 feature flag. llvm-svn: 190258	2013-09-08 00:47:31 +00:00
Juergen Ributzka	53d0b492f5	[X86] Perform VSELECT DAG combines also before DAG type legalization. If the DAG already has only legal types, then the second round of DAG combines is skipped. In this case VSELECT+SETCC patterns that match a more efficient instruction (e.g. min/max) are never recognized. This fix allows VSELECT+SETCC combines if the types are already legal before DAG type legalization. Reviewer: Nadav llvm-svn: 190105	2013-09-05 23:02:56 +00:00
Kevin Enderby	09cdb4385f	Fixed a crash in the integrated assembler for Mach-O when a symbol difference expression uses an assembler temporary symbol from an assignment. In this case the symbol does not have a fragment so the use of getFragment() would be NULL and caused a crash. In the case of an assembler temporary symbol we want to use the AliasedSymbol (if any) which will create a local relocation entry, but if it is not an assembler temporary symbol then let it use that symbol with an external relocation entry. rdar://9356266 llvm-svn: 190096	2013-09-05 20:25:06 +00:00
Jim Grosbach	6c6b425b30	X86: Mark non-crashing report_fatal_errors() as such. Previously, the clang crash handling code would kick in and give a crash report for these, even though they're not that sort of error. rdar://14882264 llvm-svn: 189878	2013-09-03 23:02:00 +00:00
Bill Wendling	c656b8e402	WIP: Refactor some code so that it can be called by more than just one method. No functionality change. llvm-svn: 189849	2013-09-03 20:59:07 +00:00
Craig Topper	8a1028f75e	Add hadSideEffects=0 to some instructions. llvm-svn: 189779	2013-09-03 03:56:17 +00:00
Craig Topper	b25f0f5538	Create BEXTR instructions for (and ((sra or srl) x, imm), (2**size - 1)). Fixes PR17028. llvm-svn: 189742	2013-09-02 07:53:17 +00:00
Elena Demikhovsky	402ee64f13	AVX-512: updated the list of high-latency instructions. llvm-svn: 189740	2013-09-02 07:41:01 +00:00
Elena Demikhovsky	534015e550	AVX-512: gather-scatter tests; added foldable instructions; Specify GATHER/SCATTER as heavy instructions. llvm-svn: 189736	2013-09-02 07:12:29 +00:00
Elena Demikhovsky	4def4b088f	AVX-512: Added GATHER and SCATTER instructions. llvm-svn: 189729	2013-09-01 14:24:41 +00:00
Charles Davis	8bdfafd505	Move everything depending on Object/MachOFormat.h over to Support/MachO.h. llvm-svn: 189728	2013-09-01 04:28:48 +00:00
Richard Mitton	79917a913e	Build fix llvm-svn: 189699	2013-08-30 21:32:42 +00:00
Richard Mitton	576ee003d0	Fixed a bug where diassembling an instruction that had a prefix would cause LLVM to identify a 1-byte instruction, but then upon querying it for that 1-byte instruction would cause an undefined opcode. llvm-svn: 189698	2013-08-30 21:19:48 +00:00
Andrey Churbanov	3535e04483	Checking commit access; removed one space added in previous test checkin by Jim llvm-svn: 189673	2013-08-30 14:40:24 +00:00
Benjamin Kramer	8f429384b5	X86: Add a description of the Intel Atom Silvermont CPU. Currently this is just the atom model with SSE4.2 enabled. llvm-svn: 189669	2013-08-30 14:05:32 +00:00
Craig Topper	f78c19c3bb	Fixup BZHI selection to remove an unneeded zero extension. llvm-svn: 189656	2013-08-30 07:16:16 +00:00
Craig Topper	48a5d69ee1	Remove unused X86andn_flag node. llvm-svn: 189654	2013-08-30 07:06:26 +00:00
Craig Topper	0bccad2d43	Teach X86 backend to create BMI2 BZHI instructions from (and X, (add (shl 1, Y), -1)). Fixes PR17038. llvm-svn: 189653	2013-08-30 06:52:21 +00:00
Cameron Esfahani	943908b78d	Clean up some usage of Triple. The base class has methods for determining if the target is iOS and Linux. llvm-svn: 189604	2013-08-29 20:23:14 +00:00
Elena Demikhovsky	980c6b08b1	AVX-512: added extend and truncate instructions. llvm-svn: 189580	2013-08-29 11:56:53 +00:00
Kevin Enderby	74946758a0	The darwin integrated assembler for X86 in 64-bit mode is not rejecting 32-bit absolute addressing in instructions likei this: mov $_f, %rsi which is not supported in 64-bit mode. rdar://8827134 llvm-svn: 189543	2013-08-29 00:19:03 +00:00
Elena Demikhovsky	9a5ed9c3bd	AVX-512: added SQRT, VRSQRT14, VCOMISS, VUCOMISS, VRCP14, VPABS llvm-svn: 189472	2013-08-28 11:21:58 +00:00
Eric Christopher	62caa709fe	Remove support for the .debug_inlined section. No known software in use supports it. llvm-svn: 189439	2013-08-28 04:04:28 +00:00
NAKAMURA Takumi	19675898da	X86JITInfo.cpp: Apply x64 version of X86CompilationCallback() to Cygwin64. For now, (defined(X86_64_JIT) && defined(__CYGWIN__)) satisfies Cygwin64. llvm-svn: 189437	2013-08-28 03:04:09 +00:00
NAKAMURA Takumi	9ea7c6d463	X86Subtarget.h: Recognize x86_64-cygwin. In the LLVM side, x86_64-cygwin is almost as same as x86_64-mingw32. llvm-svn: 189436	2013-08-28 03:04:02 +00:00
David Majnemer	aa34d79ab5	[ms-inline asm] Support offsets after segment registers Summary: MASM let's you do stuff like 'MOV FS:20, EAX' and 'MOV EAX, FS:20' Reviewers: craig.topper, rnk Reviewed By: rnk CC: llvm-commits Differential Revision: http://llvm-reviews.chandlerc.com/D1470 llvm-svn: 189407	2013-08-27 21:56:17 +00:00
Elena Demikhovsky	93eeb47d49	AVX-512: added conversion instructions. llvm-svn: 189349	2013-08-27 13:54:04 +00:00
Elena Demikhovsky	12f24673e0	AVX-512: Added FMA instructions. llvm-svn: 189326	2013-08-27 08:39:25 +00:00
Charles Davis	1827bd8a6c	Revert "Fix the build broken by r189315." and "Move everything depending on Object/MachOFormat.h over to Support/MachO.h." This reverts commits r189319 and r189315. r189315 broke some tests on what I believe are big-endian platforms. llvm-svn: 189321	2013-08-27 05:38:30 +00:00
Charles Davis	0c6f71b40d	Move everything depending on Object/MachOFormat.h over to Support/MachO.h. llvm-svn: 189315	2013-08-27 05:00:43 +00:00
Elena Demikhovsky	0a2b6290f1	AVX-512: Added shuffle instructions - VPSHUFD, VPERMILPS, VMOVDDUP, VMOVLHPS, VMOVHLPS, VSHUFPS, VALIGN single and double forms. llvm-svn: 189215	2013-08-26 12:45:35 +00:00
Craig Topper	6269f49505	Make sure x86 instructions using ssmem/sdmem operand types are only able to parse memory operands of the proper size in Intel syntax. Primarily affects some of sse cvt instructions. llvm-svn: 189206	2013-08-26 00:39:04 +00:00
Craig Topper	5500d83793	Remove some unnecessary PredicateMethod overrides. Add RenderMethod overrides to remove forwarding in the X86AsmParser code itself. No functional change. llvm-svn: 189205	2013-08-26 00:13:09 +00:00
Craig Topper	8c26c424c6	Put some of the AVX-512 parsing stuff in a more consistent place with the existing functions. llvm-svn: 189204	2013-08-25 23:18:05 +00:00
Craig Topper	1885417372	First round of fixes for the x86 fixes for the x86 move accumulator from/to memory offset instructions. -Assembly parser now properly check the size of the memory operation specified in intel syntax. So 'mov word ptr [5], al' is no longer accepted. -x86-32 disassembly of these instructions no longer sign extends the 32-bit address immediate based on size. -Intel syntax printing prints the ptr size and places brackets around the address immediate. Known remaining issues with these instructions: -Segment override prefix is not supported. PR16962 and PR16961. -Immediate size should be changed by address size prefix. llvm-svn: 189201	2013-08-25 22:23:38 +00:00
Elena Demikhovsky	f8f478b19d	AVX-512: added UNPACK instructions and tests for all-zero/all-ones vectors llvm-svn: 189189	2013-08-25 12:54:30 +00:00
Craig Topper	0714d51cba	Add hasSideEffects/mayLoad/mayStore flags to the X86 moffs8/moffs16/moffs32/moffs64 versions of move. llvm-svn: 189182	2013-08-24 20:31:14 +00:00
Craig Topper	092e2fe426	Remove trailing whitespace. llvm-svn: 189178	2013-08-24 19:50:11 +00:00
Rafael Espindola	94a2c5642d	Rename features to match what gcc and clang use. There is no advantage in being different and using the same names simplifies clang a bit. llvm-svn: 189141	2013-08-23 20:21:34 +00:00
Jim Cownie	b09bb1ce19	Checking commit access; added one space llvm-svn: 189111	2013-08-23 15:51:37 +00:00
Elena Demikhovsky	c35219e3ee	AVX-512: Added masked SHIFT commands, more encoding tests llvm-svn: 189005	2013-08-22 12:18:28 +00:00
Elena Demikhovsky	33d447a2d6	AVX-512: Added SHIFT instructions. llvm-svn: 188899	2013-08-21 09:36:02 +00:00
Craig Topper	77df9cdd0b	Synchronize VEX JIT encoding code with the MCJIT version. Fix a bug in the MCJIT code where CurOp was being incremented even if the operand it was pointing at wasn't used. Maybe only matters if there are any EVEX_K instructions that aren't VEX_4V. llvm-svn: 188868	2013-08-21 05:57:45 +00:00
Nadav Rotem	7efc04cb40	In LLVM FMA3 operands are dst, src1, src2, src3, however dst is not encoded as it is always src1. This was causing the encoding of the operands to be off by one. Patch by Chris Bieneman. llvm-svn: 188866	2013-08-21 05:03:10 +00:00
Craig Topper	5c94bb8551	Rename mattr names for AVX-512 to from avx-512 -> avx512f, avx-512-pfi -> av512pf, avx-512-cdi -> avx512cd, avx-512-eri->avx512er. This matches better with official docs and what gcc patches appearto be using. I didn't touch the has* functions or the feature flag names to avoid change the td and lowering file while commits are still happening. llvm-svn: 188859	2013-08-21 03:57:57 +00:00
NAKAMURA Takumi	de8880a23d	X86TargetMachine.cpp: Clarify to emit GOT in i686-{cygming\|win32}-elf for mcjit. I suppose all "lli -use-mcjit i686-*" should require GOT, (and to fail.) llvm-svn: 188856	2013-08-21 02:37:25 +00:00
Elena Demikhovsky	540d582594	AVX-512: Added more patterns for VMOVSS, VMOVSD, VMOVD, VMOVQ llvm-svn: 188786	2013-08-20 11:00:29 +00:00
Craig Topper	7a8cf01090	Fix formatting. No functional change. llvm-svn: 188746	2013-08-20 05:23:59 +00:00
Craig Topper	e13a066c94	Add AVX-512 and related features to the CPUID detection code. llvm-svn: 188745	2013-08-20 05:22:42 +00:00
Craig Topper	fd2b389263	Move AVX and non-AVX replication inside a couple multiclasses to avoid repeating each instruction for both individually. llvm-svn: 188743	2013-08-20 04:24:14 +00:00
Elena Demikhovsky	1490c5eb5b	AVX-512: added arithmetic and logical operations. ADD, SUB, MUL integer and FP types. OR, AND, XOR. Added embeded broadcast form for these instructions. llvm-svn: 188673	2013-08-19 13:26:14 +00:00
Elena Demikhovsky	3ce8dbbac2	AVX-512: Added VMOVD, VMOVQ, VMOVSS, VMOVSD instructions. llvm-svn: 188637	2013-08-18 13:08:57 +00:00
Craig Topper	e6861c9ce5	Make more of the lowering helpers static. Also use MVT instead of EVT in a couple places. llvm-svn: 188629	2013-08-18 08:53:01 +00:00
Craig Topper	8c929627d9	Don't use v16i32 for load pattern matching. All 512-bit loads are cated to v8i64. llvm-svn: 188534	2013-08-16 06:07:34 +00:00
Bill Wendling	2851907cdb	Constify the function parameters. llvm-svn: 188469	2013-08-15 18:46:14 +00:00
Craig Topper	8dbc7e9d35	Revert r188449 as it turns out we're just missing the instructions that need the v16i32/v16f32 matching. llvm-svn: 188454	2013-08-15 08:38:25 +00:00
Craig Topper	2ffd06528d	Don't let isPermImmMask handle v16i32 since VPERMI doesn't match on that type. Remove 128-bit vector handling from isPermImmMask too, it's covered by isPSHUFDMask. llvm-svn: 188449	2013-08-15 07:30:51 +00:00
Craig Topper	83e042a21b	Use MVT instead of EVT in X86ISelDAGToDAG since all the types should be legal. llvm-svn: 188446	2013-08-15 05:57:07 +00:00
Craig Topper	6f4dd2dacf	Use MVT in place of EVT in more X86 operation lowering functions. llvm-svn: 188445	2013-08-15 05:33:45 +00:00
Craig Topper	5671010cbb	Replace getValueType().getSimpleVT() with getSimpleValueType(). Also remove one weird cast from MVT->EVT just to call getSimpleVT(). llvm-svn: 188441	2013-08-15 02:33:50 +00:00
Craig Topper	d03748cf5e	Make more helper methods into static functions. llvm-svn: 188366	2013-08-14 07:53:41 +00:00
Craig Topper	7b7b159574	Remove tab characters. llvm-svn: 188365	2013-08-14 07:35:18 +00:00
Craig Topper	d905fded68	Make some helper methods static. llvm-svn: 188364	2013-08-14 07:34:43 +00:00
Craig Topper	60769e050d	Use MVT in more lowering code. llvm-svn: 188363	2013-08-14 07:04:42 +00:00
Craig Topper	52b00359b1	Replace EVT with MVT in isVectorShift. Keeps compiler from generating unneeded checks and handling for extended types. llvm-svn: 188362	2013-08-14 06:21:10 +00:00
Craig Topper	67476d7485	Replace EVT with MVT in many of the shuffle lowering functions. Keeps compiler from generating unneeded checks and handling for extended types. llvm-svn: 188361	2013-08-14 05:58:39 +00:00
Evgeniy Stepanov	7dee697faa	Fix compiler warnings. ../lib/Target/X86/X86ISelLowering.cpp:9715:7: error: unused variable 'OpVT' [-Werror,-Wunused-variable] EVT OpVT = Op0.getValueType(); ^ ../lib/Target/X86/X86ISelLowering.cpp:9763:14: error: unused variable 'NumElems' [-Werror,-Wunused-variable] unsigned NumElems = VT.getVectorNumElements(); llvm-svn: 188269	2013-08-13 14:04:20 +00:00
Elena Demikhovsky	60b1f289f2	AVX-512: Added CMP and BLEND instructions. Lowering for SETCC. llvm-svn: 188265	2013-08-13 13:24:07 +00:00
Kevin Enderby	b03f3fe4e8	Fix a crash with X86 Mach-O and a subtraction expression where both symbols are undefined and produce an error message instead as this is a non-relocatable expression with X86 Mach-O. rdar://8920876 llvm-svn: 188218	2013-08-12 22:45:44 +00:00
Elena Demikhovsky	5fed3b95db	AVX-512: Added more tests for BROADCAST llvm-svn: 188148	2013-08-11 12:29:16 +00:00
Elena Demikhovsky	cf5b1458e6	AVX-512: Added VPERM* instructons and MOV* zmm-to-zmm instructions. Added a test for shuffles using VPERM. llvm-svn: 188147	2013-08-11 07:55:09 +00:00
Benjamin Kramer	21585fd9c1	Add a overload to CostTable which allows it to infer the size of the table. Use it to avoid repeating ourselves too often. Also store MVT::SimpleValueType in the TTI tables so they can be statically initialized, MVT's constructors create bloated initialization code otherwise. llvm-svn: 188095	2013-08-09 19:33:32 +00:00
Michael J. Spencer	126973ba93	[Object] Split the ELF interface into 3 parts. * ELFTypes.h contains template magic for defining types based on endianess, size, and alignment. * ELFFile.h defines the ELFFile class which provides low level ELF specific access. * ELFObjectFile.h contains ELFObjectFile which uses ELFFile to implement the ObjectFile interface. llvm-svn: 188022	2013-08-08 22:27:13 +00:00
Jakub Staszak	9c34922ff2	Use pop_back() instead of pop_back_val() when the returned value is not used. llvm-svn: 187986	2013-08-08 15:48:46 +00:00
Jakub Staszak	b5ab81d5d0	Fix the comment. llvm-svn: 187984	2013-08-08 15:19:25 +00:00
Elena Demikhovsky	45c54ad8dc	AVX-512 set: Added BROADCAST instructions with lowering logic and a test. llvm-svn: 187884	2013-08-07 12:34:55 +00:00
Craig Topper	c5b0ad27ab	Simplify code. No functional change intended. llvm-svn: 187870	2013-08-07 08:16:07 +00:00
Tim Northover	a4415854db	Refactor isInTailCallPosition handling This change came about primarily because of two issues in the existing code. Niether of: define i64 @test1(i64 %val) { %in = trunc i64 %val to i32 tail call i32 @ret32(i32 returned %in) ret i64 %val } define i64 @test2(i64 %val) { tail call i32 @ret32(i32 returned undef) ret i32 42 } should be tail calls, and the function sameNoopInput is responsible. The main problem is that it is completely symmetric in the "tail call" and "ret" value, but in reality different things are allowed on each side. For these cases: 1. Any truncation should lead to a larger value being generated by "tail call" than needed by "ret". 2. Undef should only be allowed as a source for ret, not as a result of the call. Along the way I noticed that a mismatch between what this function treats as a valid truncation and what the backends see can lead to invalid calls as well (see x86-32 test case). This patch refactors the code so that instead of being based primarily on values which it recurses into when necessary, it starts by inspecting the type and considers each fundamental slot that the backend will see in turn. For example, given a pathological function that returned {{}, {{}, i32, {}}, i32} we would consider each "real" i32 in turn, and ask if it passes through unchanged. This is much closer to what the backend sees as a result of ComputeValueVTs. Aside from the bug fixes, this eliminates the recursion that's going on and, I believe, makes the bulk of the code significantly easier to understand. The trade-off is the nasty iterators needed to find the real types inside a returned value. llvm-svn: 187787	2013-08-06 09:12:35 +00:00
Craig Topper	cf969eadaf	Simplify vector lane handling math a bit. No functional change intended. llvm-svn: 187783	2013-08-06 07:23:12 +00:00
Craig Topper	7418ff460c	Simplify math a little bit. llvm-svn: 187781	2013-08-06 06:54:25 +00:00
NAKAMURA Takumi	aaf66c7357	Target//CMakeLists.txt: Add the dependency to CommonTableGen explicitly for each corresponding CodeGen. Without explicit dependencies, both per-file action and in-CommonTableGen action could run in parallel. It races to emit .inc files simultaneously. llvm-svn: 187780	2013-08-06 06:38:37 +00:00
Craig Topper	9bc00b65b6	Replace EVT with MVT in isHorizontalBinOp as it is only called with legal types. llvm-svn: 187779	2013-08-06 06:05:05 +00:00
Craig Topper	47d7c5c8fe	Simplify code slightly. No functional change. llvm-svn: 187771	2013-08-06 04:12:40 +00:00
Aaron Ballman	5b4634576e	Silencing an MSVC11 type conversion warning. llvm-svn: 187727	2013-08-05 13:47:03 +00:00
Elena Demikhovsky	40864b690b	AVX-512 set: added mask operations, lowering BUILD_VECTOR for i1 vector types. Added intrinsics and tests. llvm-svn: 187717	2013-08-05 08:52:21 +00:00
Benjamin Kramer	5bc180c14f	X86: Turn fp selects into mask operations. double test(double a, double b, double c, double d) { return a<b ? c : d; } before: _test: ucomisd %xmm0, %xmm1 ja LBB0_2 movaps %xmm3, %xmm2 LBB0_2: movaps %xmm2, %xmm0 after: _test: cmpltsd %xmm1, %xmm0 andpd %xmm0, %xmm2 andnpd %xmm3, %xmm0 orpd %xmm2, %xmm0 Small speedup on Benchmarks/SmallPT llvm-svn: 187706	2013-08-04 12:05:16 +00:00
Elena Demikhovsky	cd46691728	AVX-512 set: added VEXTRACTPS instruction llvm-svn: 187705	2013-08-04 10:46:07 +00:00
Tim Northover	ecc018c7b7	X86: correct tail return address calculation Due to the weird and wondeful usual arithmetic conversions, some calculations involving negative values were getting performed in uint32_t and then promoted to int64_t, which is really not a good idea. Patch by Katsuhiro Ueno. llvm-svn: 187703	2013-08-04 09:35:57 +00:00
Bill Wendling	a5c536e1ee	Use function attributes to indicate that we don't want to realign the stack. Function attributes are the future! So just query whether we want to realign the stack directly from the function instead of through a random target options structure. llvm-svn: 187618	2013-08-01 21:42:05 +00:00
Daniel Malea	a3d4245a72	Fixed the Intel-syntax X86 disassembler to respect the (existing) option for hexadecimal immediates, to match AT&T syntax. This also brings a new option for C-vs-MASM-style hex. Patch by Richard Mitton Reviewed: http://llvm-reviews.chandlerc.com/D1243 llvm-svn: 187614	2013-08-01 21:18:16 +00:00
Elena Demikhovsky	b1266b5447	EVEX and compressed displacement encoding for AVX512 llvm-svn: 187576	2013-08-01 13:34:06 +00:00
Elena Demikhovsky	b0a75431ad	Fixed assertion in Extract128BitVector() llvm-svn: 187493	2013-07-31 12:03:08 +00:00
Elena Demikhovsky	67b05fc0b3	Added INSERT and EXTRACT intructions from AVX-512 ISA. All insertf/extractf functions replaced with insert/extract since we have insertf and inserti forms. Added lowering for INSERT_VECTOR_ELT / EXTRACT_VECTOR_ELT for 512-bit vectors. Added lowering for EXTRACT/INSERT subvector for 512-bit vectors. Added a test. llvm-svn: 187491	2013-07-31 11:35:14 +00:00
Craig Topper	efd67d4612	Changed register names (and pointer keywords) to be lower case when using Intel X86 assembler syntax. Patch by Richard Mitton. llvm-svn: 187476	2013-07-31 02:47:52 +00:00
Craig Topper	75a5ba7ed0	Remove trailing whitespace and some tab characters. llvm-svn: 187472	2013-07-31 02:00:15 +00:00
Craig Topper	6e8cd80def	Fixed incorrect disassembly for MOV16o16a when using Intel syntax. Patch by Richard Mitton. llvm-svn: 187471	2013-07-31 01:50:26 +00:00
Nico Rieck	06d17c80cc	Proper va_arg/va_copy lowering on win64 Win64 uses CharPtrBuiltinVaList instead of X86_64ABIBuiltinVaList like other 64-bit targets. llvm-svn: 187355	2013-07-29 13:07:06 +00:00
Elena Demikhovsky	003e7d73b9	Added encoding prefixes for KNL instructions (EVEX). Added 512-bit operands printing. Added instruction formats for KNL instructions. llvm-svn: 187324	2013-07-28 08:28:38 +00:00
Justin Holewinski	d3f2035a3c	Add a target legalize hook for SplitVectorOperand (again) CustomLowerNode was not being called during SplitVectorOperand, meaning custom legalization could not be used by targets. This also adds a test case for NVPTX that depends on this custom legalization. Differential Revision: http://llvm-reviews.chandlerc.com/D1195 Attempt to fix the buildbots by making the X86 test I just added platform independent llvm-svn: 187202	2013-07-26 13:28:29 +00:00
Rafael Espindola	1d812728cc	Revert "Add a target legalize hook for SplitVectorOperand" This reverts commit 187198. It broke the bots. The soft float test probably needs a -triple because of name differences. On the hard float test I am getting a "roundss $1, %xmm0, %xmm0", instead of "vroundss $1, %xmm0, %xmm0, %xmm0". llvm-svn: 187201	2013-07-26 13:18:16 +00:00
Justin Holewinski	f848a24e50	Add a target legalize hook for SplitVectorOperand CustomLowerNode was not being called during SplitVectorOperand, meaning custom legalization could not be used by targets. This also adds a test case for NVPTX that depends on this custom legalization. Differential Revision: http://llvm-reviews.chandlerc.com/D1195 llvm-svn: 187198	2013-07-26 12:46:39 +00:00
Craig Topper	8ae6fb2932	Fix more Intel syntax issues with FP instruction aliases. Test cases coming in a subsequent patch. llvm-svn: 187187	2013-07-26 05:37:46 +00:00
Craig Topper	0f7807375a	Take advantage of the register enums being in order to remove a couple static tables. llvm-svn: 187182	2013-07-26 02:02:47 +00:00
Elena Demikhovsky	8cfb43f73b	I'm starting to commit KNL backend. I'll push patches one-by-one. This patch includes support for the extended register set XMM16-31, YMM16-31, ZMM0-31. The full ISA you can see here: http://software.intel.com/en-us/intel-isa-extensions llvm-svn: 187030	2013-07-24 11:02:47 +00:00
Craig Topper	690d8ea181	Split generated asm mnemonic matching table into a separate table for each asm variant. This removes the need to store the asm variant in each row of the single table that existed before. Shaves ~16K off the size of X86AsmParser.o. llvm-svn: 187026	2013-07-24 07:33:14 +00:00
Craig Topper	593b76de44	Fix aliases for shrd/shld to handle Intel syntax properly. Also suppress them from being used by the asm printer. llvm-svn: 187020	2013-07-24 04:38:13 +00:00
Craig Topper	6030a65039	Remove some errant space charcters in mnemonic strings. llvm-svn: 186932	2013-07-23 06:45:34 +00:00
Craig Topper	db90f65bbe	Don't let x86 asm printer use the no operand movsd alias. It should use the normal movsl instead. llvm-svn: 186924	2013-07-23 01:50:47 +00:00
Craig Topper	bf547cea0e	Revert r186907 to fix bots. llvm-svn: 186910	2013-07-23 01:29:37 +00:00
Craig Topper	80c310056c	Don't let x86 asm printer use the no operand movsd alias. It should use the normal movsl instead. llvm-svn: 186907	2013-07-23 01:21:36 +00:00
Craig Topper	c15a9e4fa4	Add aliases to map 'imm, mem' form of x86 bts/btr/btc without a size suffix to their 32-bit forms. This makes them consistent with 'bt' which already had this handling. gas has the same behavior. There have been discussions on the mailing list about determining size based on the immediate, but my goal here was just to remove the inconsistency. llvm-svn: 186904	2013-07-23 00:56:15 +00:00
Craig Topper	001582833a	Explicitly don't let the asm printer use the clrb/w/l aliases for xor %reg, %reg. It only didn't use it before because it seems InstAlias handling in the asm printer fails to count tied operands so it tried to find an xor with 2 operands instead of the 3 it wfails to count tied. llvm-svn: 186900	2013-07-23 00:15:19 +00:00
Craig Topper	c638f0cd4f	Suppress argumentless aliases for some x86 FP operations from being used by the asm writer. Prefer to use the explicit %st(1) form. llvm-svn: 186897	2013-07-23 00:03:33 +00:00
Kevin Enderby	285da02094	Fix the move to/from accumulator register instructions that use a full 64-bit absolute address encoded in the instruction. rdar://8612627 and rdar://14299221 llvm-svn: 186878	2013-07-22 21:25:31 +00:00
Craig Topper	998bcf9534	Recommit r186813: More Intel syntax alias fixes. With the addition of suppressing some of the aliases from being emitted by the asm printer. llvm-svn: 186869	2013-07-22 20:46:37 +00:00
Tim Northover	1a9dafcd6f	Revert "More Intel syntax alias fixes." This reverts commit r186813, which broke the bots. llvm-svn: 186818	2013-07-22 11:02:32 +00:00
Craig Topper	b095c09330	Fix typo. Change %cl to CL in Intel pattern. llvm-svn: 186815	2013-07-22 10:07:26 +00:00
Craig Topper	61da939a17	More Intel syntax alias fixes. llvm-svn: 186814	2013-07-22 09:58:07 +00:00
Craig Topper	1b9e4e7e9d	More Intel syntax alias fixes. llvm-svn: 186813	2013-07-22 09:42:31 +00:00
Craig Topper	03db790dc6	Change %xmm0 to XMM0 in Intel side of asm strings for PBLENDVB. llvm-svn: 186812	2013-07-22 09:22:49 +00:00
Craig Topper	8f9402a989	Add Intel variants to aliases for some FP instructions. llvm-svn: 186811	2013-07-22 09:18:43 +00:00
Craig Topper	d0ed3a417e	Reverse operands for Intel syntax form of 'bt' alias. llvm-svn: 186809	2013-07-22 07:47:51 +00:00
Craig Topper	8956fe0dbc	Mark that the _ftol2 function used by windows on x86 to handle fptoui modifies ECX. llvm-svn: 186787	2013-07-21 07:28:13 +00:00
Craig Topper	ad1fff9be7	Fix copy and paste bug from r186491 to make v2f64 use MOVAPD/MOVUPD as it should. llvm-svn: 186566	2013-07-18 07:16:44 +00:00
Craig Topper	55475d448b	Teach x86 fast-isel to use AVX opcodes for vector stores when AVX is enabled. llvm-svn: 186496	2013-07-17 06:58:23 +00:00
Craig Topper	4f55b0efd2	Make x86 fast-isel correctly choose between aligned and unaligned operations for vector stores. Fixes PR16640. llvm-svn: 186491	2013-07-17 05:57:45 +00:00
Juergen Ributzka	3d527d80b8	[X86] Use min/max to optimze unsigend vector comparison on X86 Use PMIN/PMAX for UGE/ULE vector comparions to reduce the number of required instructions. This trick also works for UGT/ULT, but there is no advantage in doing so. It wouldn't reduce the number of instructions and it would actually reduce performance. Reviewer: Ben radar:5972691 llvm-svn: 186432	2013-07-16 18:20:45 +00:00
Craig Topper	202fbc2c9b	Add 'static' keyword to some const arrays for consistency. llvm-svn: 186308	2013-07-15 06:54:12 +00:00
Craig Topper	b94011fd28	Use SmallVectorImpl& instead of SmallVector to avoid repeating small vector size. llvm-svn: 186274	2013-07-14 04:42:23 +00:00
Arnold Schwaighofer	6042a261b8	X86 cost model: Add cost for vectorized gather/scather radar://14351991 llvm-svn: 186189	2013-07-12 19:16:07 +00:00
Benjamin Kramer	068a2253e9	X86: Shrink certain forms of movsx. In particular: movsbw %al, %ax --> cbtw movswl %ax, %eax --> cwtl movslq %eax, %rax --> cltq According to Intel's manual those have the same performance characteristics but come with a smaller encoding. llvm-svn: 186174	2013-07-12 18:06:44 +00:00
Stephen Lin	fda967fdea	X86: fold SSE2/AVX2 logical shift by immediate amount into zero vector when possible Patch by Andrea Di Biagio llvm-svn: 186165	2013-07-12 15:31:36 +00:00
Charles Davis	e8f297ca94	Target/X86: Add explicit Win64 and System V/x86-64 calling conventions. Summary: This patch adds explicit calling convention types for the Win64 and System V/x86-64 ABIs. This allows code to override the default, and use the Win64 convention on a target that wants to use SysV (and vice-versa). This is needed to implement the `ms_abi` and `sysv_abi` GNU attributes. Reviewers: CC: llvm-svn: 186144	2013-07-12 06:02:35 +00:00
Stephen Lin	73de7bf5de	AArch64/PowerPC/SystemZ/X86: This patch fixes the interface, usage, and all in-tree implementations of TargetLoweringBase::isFMAFasterThanMulAndAdd in order to resolve the following issues with fmuladd (i.e. optional FMA) intrinsics: 1. On X86(-64) targets, ISD::FMA nodes are formed when lowering fmuladd intrinsics even if the subtarget does not support FMA instructions, leading to laughably bad code generation in some situations. 2. On AArch64 targets, ISD::FMA nodes are formed for operations on fp128, resulting in a call to a software fp128 FMA implementation. 3. On PowerPC targets, FMAs are not generated from fmuladd intrinsics on types like v2f32, v8f32, v4f64, etc., even though they promote, split, scalarize, etc. to types that support hardware FMAs. The function has also been slightly renamed for consistency and to force a merge/build conflict for any out-of-tree target implementing it. To resolve, see comments and fixed in-tree examples. llvm-svn: 185956	2013-07-09 18:16:56 +00:00
Jim Grosbach	340b6da4f2	X86: Add comment. llvm-svn: 185900	2013-07-09 02:07:28 +00:00
Jim Grosbach	c35388f103	X86 fast-isel: Avoid explicit AH subreg reference for [SU]Rem. Explicit references to %AH for an i8 remainder instruction can lead to references to %AH in a REX prefixed instruction, which causes things to blow up. Do the same thing in FastISel as we do for DAG isel and instead shift %AX right by 8 bits and then extract the 8-bit subreg from that result. rdar://14203849 http://llvm.org/bugs/show_bug.cgi?id=16105 llvm-svn: 185899	2013-07-09 02:07:25 +00:00
Nico Rieck	51969be724	Reuse %rax after calling __chkstk on win64 Reapply this as I reverted the wrong commit. llvm-svn: 185807	2013-07-08 11:20:11 +00:00
Nico Rieck	4801303ce1	Revert "Proper va_arg/va_copy lowering on win64" This reverts commit 2b52880592a525cfe04d8f9008a35da8c2ea94c3. Needs review. llvm-svn: 185806	2013-07-08 11:19:44 +00:00
Nico Rieck	43b51056d6	Revert "Reuse %rax after calling __chkstk on win64" This reverts commit 01f8d579f7672872324208ac5bc4ac311e81b22e. llvm-svn: 185781	2013-07-08 01:30:57 +00:00
Nico Rieck	7adf6111a8	Reuse %rax after calling __chkstk on win64 llvm-svn: 185778	2013-07-07 16:48:39 +00:00
Nico Rieck	99ef2890c0	Proper va_arg/va_copy lowering on win64 llvm-svn: 185763	2013-07-06 18:08:19 +00:00
Jakob Stoklund Olesen	db429d9483	Remove the EXCEPTIONADDR, EHSELECTION, and LSDAADDR ISD opcodes. These exception-related opcodes are not used any longer. llvm-svn: 185625	2013-07-04 13:54:20 +00:00
Jakob Stoklund Olesen	a1f5b901a5	Revert r185595-185596 which broke buildbots. Revert "Simplify landing pad lowering." Revert "Remove the EXCEPTIONADDR, EHSELECTION, and LSDAADDR ISD opcodes." llvm-svn: 185600	2013-07-04 00:26:30 +00:00
Jakob Stoklund Olesen	f33ec531fa	Remove the EXCEPTIONADDR, EHSELECTION, and LSDAADDR ISD opcodes. These exception-related opcodes are not used any longer. llvm-svn: 185596	2013-07-03 23:56:31 +00:00
Craig Topper	31ee5866de	Use SmallVectorImpl::iterator/const_iterator instead of SmallVector to avoid specifying the vector size. llvm-svn: 185540	2013-07-03 15:07:05 +00:00
Ulrich Weigand	2b6fc8d613	[DebugInfo] Allow getDebugThreadLocalSymbol to return MCExpr This allows getDebugThreadLocalSymbol to return a generic MCExpr instead of just a MCSymbolRefExpr. This is in preparation for supporting debug info for TLS variables on PowerPC, where we need to describe the variable location using a more complex expression than just MCSymbolRefExpr. llvm-svn: 185460	2013-07-02 18:47:09 +00:00
David Blaikie	1b01ae8648	PR16493: DebugInfo with TLS on PPC crashing due to invalid relocation Restrict the current TLS support to X86 ELF for now. Test that we don't produce it on PPC & we can flesh that test case out with the right thing once someone implements it. llvm-svn: 185389	2013-07-01 21:45:25 +00:00
Ahmed Bougacha	8347352e11	X86: POP*rmm: move address operand to (ins) from (outs). llvm-svn: 185292	2013-06-30 20:44:50 +00:00
Chad Rosier	ee740c4d88	Fix an off-by-one error. Also make the code a little more explicit in what it is trying to do. llvm-svn: 185191	2013-06-28 18:57:01 +00:00
David Blaikie	c3ccdbe2bf	Integrate Assembler: Support X86_64_DTPOFF64 relocations llvm-svn: 185131	2013-06-28 04:24:32 +00:00
Nadav Rotem	02dd93ec1a	Get rid of the unused class member. llvm-svn: 185086	2013-06-27 17:54:10 +00:00
Nadav Rotem	f9ecbcb835	CostModel: improve the cost model for load/store of non power-of-two types such as <3 x float>, which are popular in graphics. llvm-svn: 185085	2013-06-27 17:52:04 +00:00
Benjamin Kramer	02ff1cd015	Don't cast away constness. llvm-svn: 185071	2013-06-27 11:07:42 +00:00
Elena Demikhovsky	6769c50d9e	Optimized integer vector multiplication operation by replacing it with shift/xor/sub when it is possible. Fixed a bug in SDIV, where the const operand is not a splat constant vector. llvm-svn: 184931	2013-06-26 10:55:03 +00:00
Arnold Schwaighofer	a04b9ef1e8	X86 cost model: Vectorizing integer division is a bad idea radar://14057959 llvm-svn: 184872	2013-06-25 19:14:09 +00:00
Andrew Trick	121124acf8	Revert "Temporarily enable MI-Sched on X86." This reverts commit 98a9b72e8c56dc13a2617de84503a3d78352789c. llvm-svn: 184823	2013-06-25 02:48:58 +00:00
Andrew Trick	5a1e0af838	Temporarily enable MI-Sched on X86. Sorry for the unit test churn. I'll try to make the change permanently next time. llvm-svn: 184705	2013-06-24 09:13:20 +00:00
Andrew Trick	47740deb26	Add MI-Sched support for x86 macro fusion. This is an awful implementation of the target hook. But we don't have abstractions yet for common machine ops, and I don't see any quick way to make it table-driven. llvm-svn: 184664	2013-06-23 09:00:28 +00:00
Chad Rosier	295bd43adb	The getRegForInlineAsmConstraint function should only accept MVT value types. llvm-svn: 184642	2013-06-22 18:37:38 +00:00
David Blaikie	97c6c5bd98	DebugInfo: Don't lose unreferenced non-trivial by-value parameters A FastISel optimization was causing us to emit no information for such parameters & when they go missing we end up emitting a different function type. By avoiding that shortcut we not only get types correct (very important) but also location information (handy) - even if it's only live at the start of a function & may be clobbered later. Reviewed/discussion by Evan Cheng & Dan Gohman. llvm-svn: 184604	2013-06-21 22:56:30 +00:00
Andrew Trick	7201f4f7ec	Fix IMULX machine model. Multiple def operands require multiple SchedWrites. llvm-svn: 184566	2013-06-21 18:33:04 +00:00
Kevin Enderby	35fd79237f	Update the X86 disassembler to use xacquire and xrelease when appropriate. This is a bit tricky as the xacquire and xrelease hints use the same bytes, 0xf2 and 0xf3, as the repne and rep prefixes. Fortunately llvm has different llvm MCInst Opcode enums for rep/xrelease and repne/xacquire. So to make this work a boolean was added the InternalInstruction struct as part of the Prefix state which is set with the added logic in readPrefixes() when decoding an instruction to determine if these prefix bytes are to be disassembled as xacquire or xrelease. Then we let the matcher pick the normal prefix instructionID and we change the Opcode after that when it is set into the MCInst being created. rdar://11019859 llvm-svn: 184490	2013-06-20 22:32:18 +00:00
Bill Wendling	a3cd350249	Access the TargetLoweringInfo from the TargetMachine object instead of caching it. The TLI may change between functions. No functionality change. llvm-svn: 184360	2013-06-19 21:36:55 +00:00
Bill Wendling	afc1036f3e	Access the TargetLoweringInfo from the TargetMachine object instead of caching it. The TLI may change between functions. No functionality change. llvm-svn: 184349	2013-06-19 20:51:24 +00:00
Nadav Rotem	7d6c625235	Fix 80 col violation. llvm-svn: 184228	2013-06-18 20:41:52 +00:00
Stefanus Du Toit	8811ad4f81	Add support for encoding the HLE XACQUIRE and XRELEASE prefixes. For decoding, keep the current behavior of always decoding these as their REP versions. In the future, this could be improved to recognize the cases where these behave as XACQUIRE and XRELEASE and decode them as such. llvm-svn: 184207	2013-06-18 17:08:10 +00:00
Bill Wendling	bc07a8900c	Use pointers to the MCAsmInfo and MCRegInfo. Someone may want to do something crazy, like replace these objects if they change or something. No functionality change intended. llvm-svn: 184175	2013-06-18 07:20:20 +00:00
David Blaikie	b735b4d6db	DebugInfo: remove target-specific Frame Index handling for DBG_VALUE MachineInstrs Frame index handling is now target-agnostic, so delete the target hooks for creation & asm printing of target-specific addressing in DBG_VALUEs and any related functions. llvm-svn: 184067	2013-06-16 20:34:27 +00:00
Andrew Trick	40c4f38071	Support BufferSize on ProcResGroup for unified MOp schedulers. And add Sandybridge/Haswell resource buffers. llvm-svn: 184034	2013-06-15 04:50:06 +00:00
Andrew Trick	18dc3da855	Update machine models. Specify buffer sizes for OOO processors. llvm-svn: 184033	2013-06-15 04:50:02 +00:00
Andrew Trick	de2109eb4c	Machine Model: Add MicroOpBufferSize and resource BufferSize. Replace the ill-defined MinLatency and ILPWindow properties with with straightforward buffer sizes: MCSchedMode::MicroOpBufferSize MCProcResourceDesc::BufferSize These can be used to more precisely model instruction execution if desired. Disabled some misched tests temporarily. They'll be reenabled in a few commits. llvm-svn: 184032	2013-06-15 04:49:57 +00:00
Benjamin Kramer	b289319fb8	X86: cvtpi2ps is just an SSE instruction with MMX operands. It has no AVX equivalent. Give it the right register format so we can also emit it when AVX is enabled. llvm-svn: 183971	2013-06-14 09:31:41 +00:00
Benjamin Kramer	af6c3b7002	X86: Make the cmov aliases work with intel syntax too. llvm-svn: 183907	2013-06-13 15:45:24 +00:00
Patrik Hagglund	0cc888b786	Fix gcc -flto build, by adding LLVM_ATTRIBUTE_USED to LLVMX86CompilationCallback2, called from inline asm. llvm-svn: 183824	2013-06-12 08:45:39 +00:00
Eric Christopher	5331f0eeb8	Correct the def registers for the 8bit x86 divide instructions to match the comments and what the instruction actually does. Noticed on inspection. llvm-svn: 183809	2013-06-11 23:41:44 +00:00
Eric Christopher	8f6a083be7	Use the Copy we defined above here. llvm-svn: 183808	2013-06-11 23:41:41 +00:00
Tim Northover	6833e3fd75	X86: Stop LEA64_32r doing unspeakable things to its arguments. Previously LEA64_32r went through virtually the entire backend thinking it was using 32-bit registers until its blissful illusions were cruelly snatched away by MCInstLower and 64-bit equivalents were substituted at the last minute. This patch makes it behave normally, and take 64-bit registers as sources all the way through. Previous uses (for 32-bit arithmetic) are accommodated via SUBREG_TO_REG instructions which make the types and classes agree properly. llvm-svn: 183693	2013-06-10 20:43:49 +00:00
Elena Demikhovsky	89703c06f2	Removed PackedDouble domain from scalar instructions. Added more formats for the scalar stuff. llvm-svn: 183626	2013-06-09 07:37:10 +00:00
Bill Wendling	8f26840c5a	Don't cache the instruction and register info from the TargetMachine, because the internals of TargetMachine could change. No functionality change intended. llvm-svn: 183571	2013-06-07 21:00:34 +00:00
Bill Wendling	52ca447fdd	Remove unused c'tor. llvm-svn: 183570	2013-06-07 20:59:31 +00:00
Bill Wendling	f77190855d	Cache the TargetLowering info object as a pointer. Caching it as a pointer allows us to reset it if the TargetMachine object changes. llvm-svn: 183361	2013-06-06 00:43:09 +00:00
Michael Liao	00b20cc924	[PATCH] Fix VGATHER* operand constraints Add earlyclobber constaints to prevent input register being allocated as the output register because, according to Intel spec [1], "If any pair of the index, mask, or destination registers are the same, this instruction results a UD fault." --- [1] http://software.intel.com/sites/default/files/319433-014.pdf llvm-svn: 183327	2013-06-05 18:12:26 +00:00
Rafael Espindola	806f006490	Handle relocations that don't point to symbols. In ELF (as in MachO), not all relocations point to symbols. Represent this properly by using a symbol_iterator instead of a SymbolRef. Update llvm-readobj ELF's dumper to handle relocatios without symbols. llvm-svn: 183284	2013-06-05 01:33:53 +00:00
Ahmed Bougacha	05d53a018a	X86: sub_xmm registers are 128 bits wide. llvm-svn: 183103	2013-06-03 14:42:40 +00:00
Tim Northover	339bf154cc	Revert r183069: "TMP: LEA64_32r fixing" Very sorry, it was committed from the wrong branch by mistake. llvm-svn: 183070	2013-06-01 10:23:46 +00:00
Tim Northover	57954f04b3	TMP: LEA64_32r fixing llvm-svn: 183069	2013-06-01 10:21:54 +00:00
Tim Northover	3a1fd4c0ac	X86: change MOV64ri64i32 into MOV32ri64 The MOV64ri64i32 instruction required hacky MCInst lowering because it was allocated as setting a GR64, but the eventual instruction ("movl") only set a GR32. This converts it into a so-called "MOV32ri64" which still accepts a (appropriate) 64-bit immediate but defines a GR32. This is then converted to the full GR64 by a SUBREG_TO_REG operation, thus keeping everyone happy. This fixes a typo in the opcode field of the original patch, which should make the legact JIT work again (& adds test for that problem). llvm-svn: 183068	2013-06-01 09:55:14 +00:00
Eric Christopher	e1e57e5ebd	Temporarily Revert "X86: change MOV64ri64i32 into MOV32ri64" as it seems to have caused PR16192 and other JIT related failures. llvm-svn: 183059	2013-05-31 23:30:45 +00:00
Ahmed Bougacha	f1ed334d55	Add a way to define the bit range covered by a SubRegIndex. NOTE: If this broke your out-of-tree backend, in *RegisterInfo.td, change the instances of SubRegIndex that have a comps template arg to use the ComposedSubRegIndex class instead. In TableGen land, this adds Size and Offset attributes to SubRegIndex, and the ComposedSubRegIndex class, for which the Size and Offset are computed by TableGen. This also adds an accessor in MCRegisterInfo, and Size/Offsets for the X86 and ARM subreg indices. llvm-svn: 183020	2013-05-31 17:08:36 +00:00
Tim Northover	d4736d67f4	X86: change MOV64ri64i32 into MOV32ri64 The MOV64ri64i32 instruction required hacky MCInst lowering because it was allocated as setting a GR64, but the eventual instruction ("movl") only set a GR32. This converts it into a so-called "MOV32ri64" which still accepts a (appropriate) 64-bit immediate but defines a GR32. This is then converted to the full GR64 by a SUBREG_TO_REG operation, thus keeping everyone happy. llvm-svn: 182991	2013-05-31 09:57:13 +00:00
Tim Northover	64ec0ff433	X86: use sub-register sequences for MOV*r0 operations Instead of having a bunch of separate MOV8r0, MOV16r0, ... pseudo-instructions, it's better to use a single MOV32r0 (which will expand to "xorl %reg, %reg") and obtain other sizes with EXTRACT_SUBREG and SUBREG_TO_REG. The encoding is smaller and partial register updates can sometimes be avoided. Until recently, this sequence was a barrier to rematerialization though. That should now be fixed so it's an appropriate time to make the change. llvm-svn: 182928	2013-05-30 13:19:42 +00:00
Tim Northover	04eb4234fc	X86: change zext moves to use sub-register infrastructure. 32-bit writes on amd64 zero out the high bits of the corresponding 64-bit register. LLVM makes use of this for zero-extension, but until now relied on custom MCLowering and other code to fixup instructions. Now we have proper handling of sub-registers, this can be done by creating SUBREG_TO_REG instructions at selection-time. Should be no change in functionality. llvm-svn: 182921	2013-05-30 10:43:18 +00:00
Andrew Trick	ad6d08ac6f	Order CALLSEQ_START and CALLSEQ_END nodes. Fixes PR16146: gdb.base__call-ar-st.exp fails after pre-RA-sched=source fixes. Patch by Xiaoyi Guo! This also fixes an unsupported dbg.value test case. Codegen was previously incorrect but the test was passing by luck. llvm-svn: 182885	2013-05-29 22:03:55 +00:00
Ahmed Bougacha	00e08db393	X86: Fix Defs/Uses for insts that imp-def/imp-use both an A-register and EFLAGS. This corrects a problem where x86 instructions that implicitly define/use both an A-register (RAX, EAX, ..) and EFLAGS were declared as only defining/using EFLAGS, because the outer "let Defs/Uses = [EFLAGS]" in the various multiclasses overrides the "let Defs/Uses = [areg]" in BinOpAI. The instructions deriving from BinOpAI were moved out of the "let Defs", and a BinOpAI_FF class was created, for instructions that implicitly define and use EFLAGS and the A-register (SBC, ADC). llvm-svn: 182883	2013-05-29 21:13:57 +00:00
Chad Rosier	33b736626e	Don't assume the registers will be enumerated sequentially. llvm-svn: 182879	2013-05-29 20:42:21 +00:00
Andrew Trick	e2431c64bc	Track IR ordering of SelectionDAG nodes 3/4. Remove the old IR ordering mechanism and switch to new one. Fix unit test failures. llvm-svn: 182704	2013-05-25 03:08:10 +00:00
Andrew Trick	ef9de2a739	Track IR ordering of SelectionDAG nodes 2/4. Change SelectionDAG::getXXXNode() interfaces as well as call sites of these functions to pass in SDLoc instead of DebugLoc. llvm-svn: 182703	2013-05-25 02:42:55 +00:00
Quentin Colombet	f482805c28	Follow up of the introduction of MCSymbolizer. - Ressurect old MCDisassemble API to soften transition. - Extend MCTargetDesc to set target specific symbolizer. llvm-svn: 182688	2013-05-24 22:51:52 +00:00
Michael J. Spencer	df1ecbd734	Replace Count{Leading,Trailing}Zeros_{32,64} with count{Leading,Trailing}Zeros. llvm-svn: 182680	2013-05-24 22:23:49 +00:00
Ahmed Bougacha	ad1084de84	Add MCSymbolizer for symbolic/annotated disassembly. This is a basic first step towards symbolization of disassembled instructions. This used to be done using externally provided (C API) callbacks. This patch introduces: - the MCSymbolizer class, that mimics the same functions that were used in the X86 and ARM disassemblers to symbolize immediate operands and to annotate loads based off PC (for things like c string literals). - the MCExternalSymbolizer class, which implements the old C API. - the MCRelocationInfo class, which provides a way for targets to translate relocations (either object::RelocationRef, or disassembler C API VariantKinds) to MCExprs. - the MCObjectSymbolizer class, which does symbolization using what it finds in an object::ObjectFile. This makes simple symbolization (with no fancy relocation stuff) work for all object formats! - x86-64 Mach-O and ELF MCRelocationInfos. - A basic ARM Mach-O MCRelocationInfo, that provides just enough to support the C API VariantKinds. Most of what works in otool (the only user of the old symbolization API that I know of) for x86-64 symbolic disassembly (-tvV) works, namely: - symbol references: call _foo; jmp 15 <_foo+50> - relocations: call _foo-_bar; call _foo-4 - __cf?string: leaq 193(%rip), %rax ## literal pool for "hello" Stub support is the main missing part (because libObject doesn't know, among other things, about mach-o indirect symbols). As for the MCSymbolizer API, instead of relying on the disassemblers to call the tryAdding* methods, maybe this could be done automagically using InstrInfo? For instance, even though PC-relative LEAs are used to get the address of string literals in a typical Mach-O file, a MOV would be used in an ELF file. And right now, the explicit symbolization only recognizes PC-relative LEAs. InstrInfo should have already have most of what is needed to know what to symbolize, so this can definitely be improved. I'd also like to remove object::RelocationRef::getValueString (it seems only used by relocation printing in objdump), as simply printing the created MCExpr is definitely enough (and cleaner than string concats). llvm-svn: 182625	2013-05-24 00:39:57 +00:00
Chad Rosier	abdb1d69ab	Simplify logic now that r182490 is in place. No functional change intended. llvm-svn: 182531	2013-05-22 23:17:36 +00:00
Nadav Rotem	7b66c47051	X86: Fix a bug in EltsFromConsecutiveLoads. We can't generate new loads without chains. llvm-svn: 182507	2013-05-22 19:28:41 +00:00
Benjamin Kramer	d76cc186fc	X86: When expanding PCMPGTQ to PCMPGTD we always want to compare the lower halves as unsigned. Take #2 on fixing PR15977. llvm-svn: 182486	2013-05-22 17:01:12 +00:00
David Majnemer	7ea2a52a0c	X86: Remove test instructions proceeding shift by immediate instructions Allow LLVM to take advantage of shift instructions that set the ZF flag, making instructions that test the destination superfluous. llvm-svn: 182454	2013-05-22 08:13:02 +00:00
Elena Demikhovsky	0dd4025ae9	removed commented lines llvm-svn: 182377	2013-05-21 13:27:44 +00:00
Elena Demikhovsky	fad029202f	Removed SSEPacked domain from all forms (AVX, SSE, signed, unsigned) scalar compare instructions, like COMISS, COMISD. No functional changes. llvm-svn: 182371	2013-05-21 12:04:22 +00:00
Benjamin Kramer	18ef6b22b9	X86: When emulating unsigned PCMPGTQ with PCMPGTD, fix the sign bit for the smaller type. Otherwise we'll get a mix of signed and unsigned compares. Fixes PR15977. llvm-svn: 182364	2013-05-21 09:58:54 +00:00
David Majnemer	5ba473afb0	X86: Bad peephole interaction between adc, MOV32r0 The peephole tries to reorder MOV32r0 instructions such that they are before the instruction that modifies EFLAGS. The problem is that the peephole does not consider the case where the instruction that modifies EFLAGS also depends on the previous state of EFLAGS. Instead, walk backwards until we find an instruction that has a def for EFLAGS but does not have a use. If we find such an instruction, insert the MOV32r0 before it. If it cannot find such an instruction, skip the optimization. llvm-svn: 182184	2013-05-18 01:02:03 +00:00
Matt Arsenault	75865923c9	Add LLVMContext argument to getSetCCResultType llvm-svn: 182180	2013-05-18 00:21:46 +00:00
Benjamin Kramer	fc33e1d99b	X86: Make shuffle -> shift conversion more aggressive about undefs. Shuffles that only move an element into position 0 of the vector are common in the output of the loop vectorizer and often generate suboptimal code when SSSE3 is not available. Lower them to vector shifts if possible. We still prefer palignr over psrldq because it has higher throughput on sandybridge. llvm-svn: 182102	2013-05-17 14:48:34 +00:00
Rafael Espindola	b08d2c2db0	Remove addFrameMove. Now that we have good testing, remove addFrameMove and create cfi instructions directly. llvm-svn: 182052	2013-05-16 21:02:15 +00:00
Rafael Espindola	7242186b10	Delete dead code. llvm-svn: 181982	2013-05-16 04:59:17 +00:00
Rafael Espindola	84ee6c40a8	Delete dead code. llvm-svn: 181941	2013-05-15 22:27:35 +00:00
David Majnemer	8f16974273	X86: Remove redundant test instructions Increase the number of instructions LLVM recognizes as setting the ZF flag. This allows us to remove test instructions that redundantly recalculate the flag. llvm-svn: 181937	2013-05-15 22:03:08 +00:00
Eric Christopher	b27cd8bea6	Reapply "Subtract isn't commutative, fix this for MMX psub." with a somewhat randomly chosen cpu that will minimize cpu specific differences on bots. llvm-svn: 181814	2013-05-14 18:33:40 +00:00
Eric Christopher	3eee7454cf	Temporarily revert "Subtract isn't commutative, fix this for MMX psub." It's causing failures on the atom bot. llvm-svn: 181812	2013-05-14 18:20:42 +00:00
Eric Christopher	0344f495f9	Subtract isn't commutative, fix this for MMX psub. Patch by Andrea DiBiagio. llvm-svn: 181809	2013-05-14 17:52:05 +00:00
Duncan Sands	0480b9b54e	Suppress GCC compiler warnings in release builds about variables that are only read in asserts. llvm-svn: 181689	2013-05-13 07:50:47 +00:00
Rafael Espindola	227144c23c	Remove the MachineMove class. It was just a less powerful and more confusing version of MCCFIInstruction. A side effect is that, since MCCFIInstruction uses dwarf register numbers, calls to getDwarfRegNum are pushed out, which should allow further simplifications. I left the MachineModuleInfo::addFrameMove interface unchanged since this patch was already fairly big. llvm-svn: 181680	2013-05-13 01:16:13 +00:00
Rafael Espindola	1b09836bc3	Change getFrameMoves to return a const reference. To add a frame now there is a dedicated addFrameMove which also takes care of constructing the move itself. llvm-svn: 181657	2013-05-11 02:38:11 +00:00
Chad Rosier	c8569cba93	[ms-inline asm] Fix a crasher when we fail on a direct match. The issue was that the MatchingInlineAsm and VariantID args to the MatchInstructionImpl function weren't being set properly. Specifically, when parsing intel syntax, the parser thought it was parsing inline assembly in the at&t dialect; that will never be the case. The crash was caused when the emitter tried to emit the instruction, but the operands weren't set. When parsing inline assembly we only set the opcode, not the operands, which is used to lookup the instruction descriptor. rdar://13854391 and PR15945 Also, this commit reverts r176036. Now that we're correctly parsing the intel syntax the pushad/popad don't match properly. I've reimplemented that fix using a MnemonicAlias. llvm-svn: 181620	2013-05-10 18:24:17 +00:00
Rafael Espindola	140a837acd	Remove unused argument. llvm-svn: 181618	2013-05-10 18:16:59 +00:00
Chad Rosier	edb1dc8498	[x86AsmParser] It's valid to stop parsing an operand at an immediate. rdar://13854369 and PR15944 llvm-svn: 181564	2013-05-09 23:48:53 +00:00
Bill Wendling	07fe235e2b	Generate a compact unwind encoding in the face of a stack alignment push. We generate a `push' of a random register (%rax) if the stack needs to be aligned by the size of that register. However, this could mess up compact unwind generation. In particular, we want to still generate compact unwind in the presence of this monstrosity. Check if the push of of the %rax/%eax register. If it is and it's marked with the `FrameSetup' flag, then we can generate a compact unwind encoding for the function only if the push is the last FrameSetup instruction. llvm-svn: 181540	2013-05-09 20:10:38 +00:00
Bill Wendling	98d5c52d2e	Simplify the code a bit. The compact unwind registers were defined in two different places. It's better just to place them in the function that uses them and specify that this is a 64-bit or 32-bit machine. No functionality change. llvm-svn: 181529	2013-05-09 18:21:45 +00:00
Preston Gurd	9264c95400	Corrected Atom latencies for SSE SQRT instructions. llvm-svn: 181346	2013-05-07 19:57:34 +00:00
Michael Kuperstein	1a0c91f73b	Re-enable AVX detection on x64 platforms. llvm-svn: 181313	2013-05-07 14:05:33 +00:00
David Majnemer	66fb70de38	Remove a recently redundant transform from X86ISelLowering. X86ISelLowering has support to treat: (icmp ne (and (xor %flags, -1), (shl 1, flag)), 0) as if it were actually: (icmp eq (and %flags, (shl 1, flag)), 0) However, r179386 has code at the InstCombine level to handle this. llvm-svn: 181145	2013-05-05 02:00:10 +00:00
Dmitri Gribenko	3238fb7595	Add ArrayRef constructor from None, and do the cleanups that this constructor enables Patch by Robert Wilhelm. llvm-svn: 181138	2013-05-05 00:40:33 +00:00
Nadav Rotem	d61dcfc4fd	whitespace llvm-svn: 181137	2013-05-04 23:27:32 +00:00
Nadav Rotem	42932bdcd0	Fix an odd comment. llvm-svn: 181136	2013-05-04 23:24:56 +00:00
Benjamin Kramer	b44c4275d5	X86: Add target description for btver2; make autodetection logic aware of AVX. llvm-svn: 181005	2013-05-03 10:20:08 +00:00
Aaron Ballman	cc958f0050	Unbreaking the non-x86 build bots by protecting the AVX test code properly. llvm-svn: 180992	2013-05-03 02:52:21 +00:00
Aaron Ballman	63fe014888	Correctly testing for AVX support in x86 based off code from Hosts.cpp. llvm-svn: 180991	2013-05-03 02:39:21 +00:00
John McCall	f73981b213	In MC asm parsing, account for the possibility of whitespace within the "identifier" parsed by the frontend callback by skipping forward until we've consumed a token that ends at the point dictated by the callback. In addition, inform the callback when it's parsing an unevaluated operand (e.g. mov eax, LENGTH A::x) as opposed to an evaluated one (e.g. mov eax, [A::x]). This commit depends on a clang commit. llvm-svn: 180978	2013-05-03 00:15:41 +00:00
Michael Liao	06badde1ac	80-col fixup. llvm-svn: 180915	2013-05-02 09:22:04 +00:00
Michael Liao	afafa98fa8	Avoid duplicating logic on frame register selecting when lowering eh_return No functionality change llvm-svn: 180914	2013-05-02 09:18:38 +00:00
Michael Liao	31d39a4a47	Avoid duplicating logic on frame register selecting when lowering frameaddr No functionality change llvm-svn: 180912	2013-05-02 08:21:56 +00:00
Rafael Espindola	817c1d92b4	Put VMOVPQIto64rr in the VRPDI class. Patch by Joshua Magee. llvm-svn: 180842	2013-05-01 13:00:16 +00:00
Benjamin Kramer	aec90531f9	X86: Now that we have a canonical form for vector integer abs, match it into pabs. llvm-svn: 180600	2013-04-26 12:05:21 +00:00
Preston Gurd	128920d9fa	Make function documentation conform to llvm standards. Expunge all remaining traces and use of live variable information. llvm-svn: 180577	2013-04-25 21:31:33 +00:00
Preston Gurd	8b7ab4ba2b	This patch adds the X86FixupLEAs pass, which will reduce instruction latency for certain models of the Intel Atom family, by converting instructions into their equivalent LEA instructions, when it is both useful and possible to do so. llvm-svn: 180573	2013-04-25 20:29:37 +00:00
Rafael Espindola	b770f897ee	Fix section relocation for SECTIONREL32 with immediate offset. Patch by Kai Nacke. This matches the gnu as output. llvm-svn: 180568	2013-04-25 19:27:05 +00:00
Chad Rosier	65dd0399c6	[ms-inline asm] Removed this unnecessary check. In the current implementation, Disp will always be one of MCSymbolRefExpr or MCConstantExpr, and never NULL. llvm-svn: 180059	2013-04-22 22:38:35 +00:00
Chad Rosier	732b837a41	[ms-inline asm] Add the OpDecl to the InlineAsmIdentifierInfo struct and in turn the MCParsedAsmOperand. Part of rdar://13663589 llvm-svn: 180054	2013-04-22 22:04:25 +00:00
Chad Rosier	eeb0034918	Fix unused variable warning. llvm-svn: 180044	2013-04-22 20:42:32 +00:00
Chad Rosier	cb78f0d05e	[ms-inline asm] Remove the identifier parsing logic from the AsmParser. This is now taken care of by the frontend, which allows us to parse arbitrary C/C++ variables. Part of rdar://13663589 llvm-svn: 180037	2013-04-22 19:42:15 +00:00
Chad Rosier	f6675c3d3e	[ms-inline asm] Refactor/clean up the SemaLookup interface. No functional change indended. Part of rdar://13663589 llvm-svn: 180028	2013-04-22 17:01:46 +00:00
Tim Northover	16aba17024	Remove unused ShouldFoldAtomicFences flag. I think it's almost impossible to fold atomic fences profitably under LLVM/C++11 semantics. As a result, this is now unused and just cluttering up the target interface. llvm-svn: 179940	2013-04-20 12:32:43 +00:00
Tim Northover	a2b533906a	Remove unused MEMBARRIER DAG node; it's been replaced by ATOMIC_FENCE. llvm-svn: 179939	2013-04-20 12:32:17 +00:00
Eli Bendersky	90dd3e7dfd	Move TryToFoldFastISelLoad to FastISel, where it belongs. In general, I'm trying to move as much FastISel logic as possible out of the main path in SelectionDAGISel - intermixing them just adds confusion. llvm-svn: 179902	2013-04-19 22:29:18 +00:00
Michael Liao	b53d8963ce	ArrayRefize getMachineNode(). No functionality change. llvm-svn: 179901	2013-04-19 22:22:57 +00:00
Chad Rosier	e8f9bfdb71	[ms-inline asm] Make code layout more canonical with iniline asm handled last. llvm-svn: 179875	2013-04-19 19:29:50 +00:00

... 5 6 7 8 9 ...

9857 Commits