llvm-project

Commit Graph

Author	SHA1	Message	Date
Akira Hatanaka	c0b020690b	[mips] Lower EH_RETURN. Patch by Sasa Stankovic. llvm-svn: 173862	2013-01-30 00:26:49 +00:00
Renato Golin	5e9d55eca0	Adding simple cast cost to ARM Changing ARMBaseTargetMachine to return ARMTargetLowering intead of the generic one (similar to x86 code). Tests showing which instructions were added to cast when necessary or cost zero when not. Downcast to 16 bits are not lowered in NEON, so costs are not there yet. llvm-svn: 173849	2013-01-29 23:31:38 +00:00
Jyotsna Verma	b16a9cb132	Use multiclass for post-increment store instructions. llvm-svn: 173816	2013-01-29 18:42:41 +00:00
Jyotsna Verma	a609b1c89d	Add constant extender support for MInst type instructions. llvm-svn: 173813	2013-01-29 18:18:50 +00:00
Evan Cheng	27e41c9f70	Remove dead code. llvm-svn: 173812	2013-01-29 18:08:22 +00:00
NAKAMURA Takumi	978b5a0e02	R600/AMDILPeepholeOptimizer.cpp: Tweak std::make_pair to satisfy C++11. llvm-svn: 173807	2013-01-29 16:31:56 +00:00
Hans Wennborg	5deecd9043	Fix typo in X86BaseInfo.h that I introduced in r157818. llvm-svn: 173798	2013-01-29 14:05:57 +00:00
Tim Northover	a0edd3ee66	Fix 64-bit atomic operations in Thumb mode. The ARM and Thumb variants of LDREXD and STREXD have different constraints and take different operands. Previously the code expanding atomic operations didn't take this into account and asserted in Thumb mode. llvm-svn: 173780	2013-01-29 09:06:13 +00:00
Craig Topper	c048154b9b	Merge SSE and AVX shuffle instructions in the comment printer. llvm-svn: 173777	2013-01-29 07:54:31 +00:00
Evan Cheng	0e88c7d897	Teach SDISel to combine fsin / fcos into a fsincos node if the following conditions are met: 1. They share the same operand and are in the same BB. 2. Both outputs are used. 3. The target has a native instruction that maps to ISD::FSINCOS node or the target provides a sincos library call. Implemented the generic optimization in sdisel and enabled it for Mac OSX. Also added an additional optimization for x86_64 Mac OSX by using an alternative entry point __sincos_stret which returns the two results in xmm0 / xmm1. rdar://13087969 PR13204 llvm-svn: 173755	2013-01-29 02:32:37 +00:00
Hal Finkel	7f9e8d3eaa	Add isBGQ method to PPCSubtarget This function will be used in future commits. llvm-svn: 173729	2013-01-29 00:22:47 +00:00
Craig Topper	5c683972bc	Fix 256-bit PALIGNR comment decoding to understand that it works on independent 256-bit lanes. llvm-svn: 173674	2013-01-28 07:41:18 +00:00
Craig Topper	71d99ffe4a	Add missing break in 256-bit palignr comment printing. No test case yet because the comment itself is still wrong. llvm-svn: 173669	2013-01-28 07:19:11 +00:00
Craig Topper	8fb09f0abb	Fix inconsistent usage of PALIGN and PALIGNR when referring to the same instruction. llvm-svn: 173667	2013-01-28 06:48:25 +00:00
Craig Topper	b3ede5e3b1	Remove addToNoHelperNeeded function that was left unused after r173649. Fixes a -Wunused warning. llvm-svn: 173664	2013-01-28 06:09:24 +00:00
Reed Kotler	97f8e2fa8f	Make some code a little simpler. llvm-svn: 173649	2013-01-28 02:46:49 +00:00
Richard Osborne	038d24f90c	[XCore] Add missing l2rus instructions. These instructions are not targeted by the compiler but they are needed for the MC layer. llvm-svn: 173634	2013-01-27 22:28:30 +00:00
Richard Osborne	f2ecd40929	[XCore] Add missing l2r instructions. These instructions are not targeted by the compiler but they are needed for the MC layer. llvm-svn: 173629	2013-01-27 21:26:02 +00:00
Richard Osborne	7fe8f63544	[XCore] Add missing 1r instructions. These instructions are not targeted by the compiler but they are needed for the MC layer. llvm-svn: 173624	2013-01-27 20:46:21 +00:00
Richard Osborne	8f56317287	[XCore] Add missing 0r instructions. These instructions are not targeted by the compiler but they are needed for the MC layer. llvm-svn: 173623	2013-01-27 20:42:57 +00:00
Bill Wendling	cc1fc9465b	Convert the CPP backend to use the AttributeSet instead of AttributeWithIndex. Further removal of the introspective AttributeWithIndex thing. Also fix the #includes. llvm-svn: 173599	2013-01-27 01:22:51 +00:00
Benjamin Kramer	6a93596538	X86: Decode PALIGN operands so I don't have to do it in my head. llvm-svn: 173572	2013-01-26 13:31:37 +00:00
Benjamin Kramer	99c68dd964	X86: Do splat promotion later, so the optimizer can chew on it first. This catches many cases where we can emit a more efficient shuffle for a specific mask or when the mask contains undefs. Once the splat is lowered to unpacks we can't do that anymore. There is a possibility of moving the promotion after pshufb matching, but I'm not sure if pshufb with a mask loaded from memory is faster than 3 shuffles, so I avoided that for now. llvm-svn: 173569	2013-01-26 11:44:21 +00:00
Reed Kotler	233cee2b5b	fix use of std::std. it's ordered set. llvm-svn: 173563	2013-01-26 06:58:35 +00:00
Dmitri Gribenko	c451bdf9ff	Remove unused variables, silences -Wunused-variable llvm-svn: 173526	2013-01-25 23:17:21 +00:00
Bill Wendling	57625a4966	Remove some introspection functions. The 'getSlot' function and its ilk allow introspection into the AttributeSet class. However, that class should be opaque. Allow access through accessor methods instead. llvm-svn: 173522	2013-01-25 23:09:36 +00:00
Hal Finkel	4e5ca9e578	Initial implementation of PPCTargetTransformInfo This provides a place to add customized operation cost information and control some other target-specific IR-level transformations. The only non-trivial logic in this checkin assigns a higher cost to unaligned loads and stores (covered by the included test case). llvm-svn: 173520	2013-01-25 23:05:59 +00:00
Eli Bendersky	597fc1233a	In this patch, we teach X86_64TargetMachine that it has a ILP32 (defined by the x32 ABI) mode, in which case its pointers are 32-bits in size. This knowledge is also added to X86RegisterInfo that now returns the appropriate registers in getPointerRegClass. There are many outcomes to this change. In order to keep the patches separate and manageable, we start by focusing on some simple testable cases. The patch adds a test with passing a pointer to a function - focusing on the difference between the two data models for x86-64. Another test is added for handling of 'sret' arguments (and functionality is added in X86ISelLowering to make it work). A note on naming: the "x32 ABI" document refers to the AMD64 architecture (in LLVM it's distinguished by being is64Bits() in the x86 subtarget) with two variations: the LP64 (default) data model, and the ILP32 data model. This patch adds predicates to the subtarget which are consistent with this naming scheme. llvm-svn: 173503	2013-01-25 22:07:43 +00:00
Richard Osborne	6b86eec819	Add instruction encodings / disassembly support for l4r instructions. llvm-svn: 173501	2013-01-25 21:55:32 +00:00
Bill Wendling	8649283e75	Use the new 'getSlotIndex' method to retrieve the attribute's slot index. llvm-svn: 173499	2013-01-25 21:46:52 +00:00
Richard Osborne	a520a7dcf3	Use the correct format in the STW / SETPSC instruction names. llvm-svn: 173494	2013-01-25 21:25:12 +00:00
Richard Osborne	9a228a13c6	Fix order of operands for crc8_l4r The order in which operands appear in the encoded instruction is different to order in which they appear in assembly. This changes the XCore backend to use the instruction encoding order. llvm-svn: 173493	2013-01-25 21:20:28 +00:00
Richard Osborne	a19fa86a70	Add instruction encodings / disassembly support for l5r instructions. llvm-svn: 173479	2013-01-25 20:20:07 +00:00
Richard Osborne	8ae02d3cef	Fix order of operands for l5r instructions. With this change the operands order matches the order in which the operands are encoded in the instruction. llvm-svn: 173477	2013-01-25 20:16:00 +00:00
Richard Osborne	ea023fcde1	Use correct mnemonic / instruction name for ldivu. llvm-svn: 173476	2013-01-25 20:11:26 +00:00
Hal Finkel	53f4ba6ce3	More cleanup of PPC register definitions. Uses the new !add TableGen operator to do more cleanup of the PPC register definitions. llvm-svn: 173446	2013-01-25 14:49:10 +00:00
Silviu Baranga	3eb45a03af	Fixed the condition codes for the atomic64 min/umin code generation on ARM. If the sutraction of the higher 32 bit parts gives a 0 result, we need to do the store operation. llvm-svn: 173437	2013-01-25 10:39:49 +00:00
Andrew Trick	e2c3f5c982	MIsched: Improve the interface to SchedDFS analysis (subtrees). Allow the strategy to select SchedDFS. Allow the results of SchedDFS to affect initialization of the scheduler state. llvm-svn: 173425	2013-01-25 06:33:57 +00:00
Jack Carter	07c818d2da	This patch implements parsing the .word directive for the Mips assembler. Contributer: Vladimir Medic llvm-svn: 173407	2013-01-25 01:31:34 +00:00
Akira Hatanaka	28aed9ca85	[mips] Set flag neverHasSideEffects flag on some of the floating point instructions. llvm-svn: 173401	2013-01-25 00:20:39 +00:00
Renato Golin	d4c392e6ff	Moving Cost Tables up to share with other targets llvm-svn: 173382	2013-01-24 23:01:00 +00:00
Hal Finkel	41176f43c4	Start cleanup of PPC register definitions using foreach loops. No functionality change intended. This captures the first two cases GPR32/64. For the others, we need an addition operator (if we have one, I've not yet found it). Based on a suggestion made by Tom Stellard in the AArch64 review! llvm-svn: 173366	2013-01-24 20:43:18 +00:00
NAKAMURA Takumi	bf8f207519	MipsISelLowering.cpp: Fill unreachable paths to fix warnings. [-Wsometimes-uninitialized] FIXME: Could they, unreachable(s), be removed? FIXME: I could prefer the coding standards... llvm-svn: 173325	2013-01-24 06:08:06 +00:00
NAKAMURA Takumi	f25b7c6816	MipsISelLowering.cpp: Fix a warning, take two. [-Wunused-variable] ...and fix a typo, s/#ifdef/#ifndef/ llvm-svn: 173324	2013-01-24 05:54:23 +00:00
NAKAMURA Takumi	c77d028bfb	MipsISelLowering.cpp: Fix a warning. [-Wunused-variable] llvm-svn: 173323	2013-01-24 05:47:29 +00:00
Reed Kotler	a2d76bce1f	The next phase of Mips16 hard float implementation. Allow Mips16 routines to call Mips32 routines that have abi requirements that either arguments or return values are passed in floating point registers. This handles only the pic case. We have not done non pic for Mips16 yet in any form. The libm functions are Mips32, so with this addition we have a complete Mips16 hard float implementation. We still are not able to complete mix Mip16 and Mips32 with hard float. That will be the next phase which will have several steps. For Mips32 to freely call Mips16 some stub functions must be created. llvm-svn: 173320	2013-01-24 04:24:02 +00:00
Tom Stellard	6f1b8657f9	R600: Add a llvm.R600.store.swizzle intrinsics This intrinsic is translated to ALLOC_EXPORT_WORD1_SWIZ, hence its name. It is used to store vs/fs outputs Patch by: Vincent Lejeune Reviewed-by: Tom Stellard <thomas.stellard@amd.com> llvm-svn: 173297	2013-01-23 21:39:49 +00:00
Tom Stellard	d8ac91d436	R600: Simplify stream outputs intrinsic Patch by: Vincent Lejeune Reviewed-by: Tom Stellard <thomas.stellard@amd.com> llvm-svn: 173296	2013-01-23 21:39:47 +00:00
Richard Osborne	54e311821f	Add instruction encodings / disassembly support for l6r instructions. llvm-svn: 173288	2013-01-23 20:08:11 +00:00
Eli Bendersky	f759526983	Fix powerpc test failure - forgot to initialize stack slot size for PPCLinuxMCAsmInfo llvm-svn: 173275	2013-01-23 17:12:15 +00:00
Eli Bendersky	32aab2216d	Clean up assignment of CalleeSaveStackSlotSize: get rid of the default and explicitly set this in every target that needs to change it from the default. llvm-svn: 173270	2013-01-23 16:22:04 +00:00
Benjamin Kramer	c4231cc9b3	NVPTX: Stop leaking memory by using a managed constant instead of a new Argument. This is still an egregious hack since we don't have a nice interface for this kind of thing but should help the valgrind leak check buildbot to become green. llvm-svn: 173267	2013-01-23 15:21:44 +00:00
Bill Wendling	d154e283f2	Add the IR attribute 'sspstrong'. SSPStrong applies a heuristic to insert stack protectors in these situations: * A Protector is required for functions which contain an array, regardless of type or length. * A Protector is required for functions which contain a structure/union which contains an array, regardless of type or length. Note, there is no limit to the depth of nesting. * A protector is required when the address of a local variable (i.e., stack based variable) is exposed. (E.g., such as through a local whose address is taken as part of the RHS of an assignment or a local whose address is taken as part of a function argument.) This patch implements the SSPString attribute to be equivalent to SSPRequired. This will change in a subsequent patch. llvm-svn: 173230	2013-01-23 06:41:41 +00:00
Tom Stellard	365366f9ef	R600: rework handling of the constants Remove Cxxx registers, add new special register - "ALU_CONST" and new operand for each alu src - "sel". ALU_CONST is used to designate that the new operand contains the value to override src.sel, src.kc_bank, src.chan for constants in the driver. Patch by: Vadim Girlin Vincent Lejeune: - Use pointers for constants - Fold CONST_ADDRESS when possible Tom Stellard: - Give CONSTANT_BUFFER_0 its own address space - Use integer types for constant loads Reviewed-by: Tom Stellard <thomas.stellard@amd.com> llvm-svn: 173222	2013-01-23 02:09:06 +00:00
Tom Stellard	ff62c35da0	R600: Add a CONST_ADDRESS node to model constant buf read Patch by: Vincent Lejeune Reviewed-by: Tom Stellard <thomas.stellard@amd.com> llvm-svn: 173221	2013-01-23 02:09:03 +00:00
Tom Stellard	ab28e9a30a	R600: Factorise VTX_WORD0 and VTX_WORD1 in tblgen def Patch by: Vincent Lejeune Reviewed-by: Tom Stellard <thomas.stellard@amd.com> llvm-svn: 173220	2013-01-23 02:09:01 +00:00
Richard Osborne	1a06479f46	Add instruction encodings / disassembly support for u10 / lu10 instructions. llvm-svn: 173204	2013-01-22 22:55:04 +00:00
Michael Liao	3dffc5e2b7	Fix an issue of pseudo atomic instruction DAG schedule - Add list of physical registers clobbered in pseudo atomic insts Physical registers are clobbered when pseudo atomic instructions are expanded. Add them in clobber list to prevent DAG scheduler to mis-schedule them after these insns are declared side-effect free. - Add test case from Michael Kuperstein <michael.m.kuperstein@intel.com> llvm-svn: 173200	2013-01-22 21:47:38 +00:00
Akira Hatanaka	88c0ec826c	[mips] Implement MipsRegisterInfo::getRegPressureLimit. llvm-svn: 173197	2013-01-22 21:34:25 +00:00
Akira Hatanaka	f7d16d0563	[mips] Clean up code in MipsTargetLowering::LowerCall. No functional change intended llvm-svn: 173189	2013-01-22 20:05:56 +00:00
Benjamin Kramer	fee7d21ae7	X86: Make sure we account for the FMA4 register immediate value, otherwise rip-rel relocations will be off by one byte. PR15040. llvm-svn: 173176	2013-01-22 18:05:59 +00:00
Eli Bendersky	0893e1079d	Initial patch for x32 ABI support. Add the x32 environment kind to the triple, and separate the concept of pointer size and callee save stack slot size, since they're not equal on x32. llvm-svn: 173175	2013-01-22 18:02:49 +00:00
Tim Northover	29178a348a	Make APFloat constructor require explicit semantics. Previously we tried to infer it from the bit width size, with an added IsIEEE argument for the PPC/IEEE 128-bit case, which had a default value. This default value allowed bugs to creep in, where it was inappropriate. llvm-svn: 173138	2013-01-22 09:46:31 +00:00
Richard Osborne	5d477751df	Fix some incorrectly named u10 / lu10 instructions. llvm-svn: 173090	2013-01-21 21:12:30 +00:00
Richard Osborne	38cff3ea7f	Remove unused multiclass. llvm-svn: 173087	2013-01-21 20:50:54 +00:00
Richard Osborne	9d3ec06ef8	Add instruction encodings / disassembly support for u6 / lu6 instructions. llvm-svn: 173086	2013-01-21 20:44:17 +00:00
Richard Osborne	6e58c6d86d	Add instruction encoding / disassembly support for ru6 / lru6 instructions. llvm-svn: 173085	2013-01-21 20:42:16 +00:00
Richard Osborne	0d68e21ca7	Use correct format for the LDAWCP instruction (u6). llvm-svn: 173083	2013-01-21 20:32:54 +00:00
Tom Stellard	c9b903138d	R600/SI: Use unnormalized coordinates for sampling with the RECT target. Patch by: Michel Dänzer Reviewed-by: Tom Stellard <thomas.stellard@amd.com> Reviewed-by: Christian König <christian.koenig@amd.com> Signed-off-by: Michel Dänzer <michel.daenzer@amd.com> llvm-svn: 173053	2013-01-21 15:40:48 +00:00
Tom Stellard	14421a793f	R600/SI: Take target parameter for sample intrinsics. Patch by: Michel Dänzer Reviewed-by: Tom Stellard <thomas.stellard@amd.com> Reviewed-by: Christian König <christian.koenig@amd.com> Signed-off-by: Michel Dänzer <michel.daenzer@amd.com> llvm-svn: 173052	2013-01-21 15:40:47 +00:00
Tom Stellard	74dda0da31	R600/SI: Derive all sample intrinsics from a single class. Patch by: Michel Dänzer Reviewed-by: Tom Stellard <thomas.stellard@amd.com> Reviewed-by: Christian König <christian.koenig@amd.com> Signed-off-by: Michel Dänzer <michel.daenzer@amd.com> llvm-svn: 173051	2013-01-21 15:40:46 +00:00
NAKAMURA Takumi	c96fb1bd36	R600/SILowerControlFlow.cpp: Fix a warning. [-Wunused-variable] llvm-svn: 173040	2013-01-21 14:06:48 +00:00
Craig Topper	66163a35ee	Use <0 checks in place of ==-1 because it results in simpler code. llvm-svn: 173010	2013-01-21 07:25:16 +00:00
Craig Topper	9b29486f42	Use MVT instead of EVT in LowerVECTOR_SHUFFLEtoBlend. llvm-svn: 173009	2013-01-21 07:19:54 +00:00
Craig Topper	32c5406dcf	Remove trailing whitespace. llvm-svn: 173008	2013-01-21 06:57:59 +00:00
Craig Topper	5c84c25bf4	Fix some 80 column violations. llvm-svn: 173006	2013-01-21 06:21:54 +00:00
Craig Topper	2cd375896a	Make helper method static. llvm-svn: 173005	2013-01-21 06:13:28 +00:00
Craig Topper	cf93977920	Convert more EVT's to MVT's in the lowering methods. llvm-svn: 172995	2013-01-20 21:50:27 +00:00
Craig Topper	e65a08be64	Capitalize lowerTRUNCATE so that it matches the other lower functions in this file despite it not matching coding standards. llvm-svn: 172994	2013-01-20 21:34:37 +00:00
Renato Golin	e1fb059327	Revert CostTable algorithm, will re-write llvm-svn: 172992	2013-01-20 20:57:20 +00:00
Richard Osborne	4e69724869	Add instruction encodings / disassembly support for l2rus instructions. llvm-svn: 172987	2013-01-20 18:51:15 +00:00
Richard Osborne	9fbf57b26c	Add instruction encodings / disassembly support for l3r instructions. llvm-svn: 172986	2013-01-20 18:37:49 +00:00
Richard Osborne	f063fcee7a	Add instruction encodings / disassembler support for 2rus instructions. llvm-svn: 172985	2013-01-20 17:22:43 +00:00
Richard Osborne	3fb7395233	Add instruction encodings / disassembly support 3r instructions. It is not possible to distinguish 3r instructions from 2r / rus instructions using only the fixed bits. Therefore if an instruction doesn't match the 2r / rus format try to decode it as a 3r instruction before returning Fail. llvm-svn: 172984	2013-01-20 17:18:47 +00:00
Craig Topper	ce61fdf0a3	Make LowerVSETCC a static function and use MVT instead of EVT. llvm-svn: 172969	2013-01-20 09:02:22 +00:00
Nadav Rotem	9450fcfff1	Revert 172708. The optimization handles esoteric cases but adds a lot of complexity both to the X86 backend and to other backends. This optimization disables an important canonicalization of chains of SEXT nodes and makes SEXT and ZEXT asymmetrical. Disabling the canonicalization of consecutive SEXT nodes into a single node disables other DAG optimizations that assume that there is only one SEXT node. The AVX mask optimizations is one example. Additionally this optimization does not update the cost model. llvm-svn: 172968	2013-01-20 08:35:56 +00:00
Craig Topper	9976974cc6	Make some helper methods static. llvm-svn: 172936	2013-01-20 00:50:58 +00:00
Craig Topper	4ac87da529	Remove DebugLoc argument from static function. It can easily be obtained from the SVOp passed in. llvm-svn: 172935	2013-01-20 00:43:42 +00:00
Craig Topper	3da6507c41	Use MVT instead of EVT in more instruction lowering code. llvm-svn: 172933	2013-01-20 00:38:18 +00:00
Craig Topper	53c7fbabbf	Use MVT instead of EVT in more of the shuffle lowering code. llvm-svn: 172930	2013-01-19 23:36:09 +00:00
Craig Topper	bb772d27a7	Capitalize LowerVectorIntExtend to be consistent with all the other lower functions in this file. llvm-svn: 172927	2013-01-19 23:14:09 +00:00
Nadav Rotem	7b3120b9ae	On Sandybridge split unaligned 256bit stores into two xmm-sized stores. llvm-svn: 172894	2013-01-19 08:38:41 +00:00
Craig Topper	84b01120bc	Use MVT instead of EVT when computing shuffle immediates since they can only be for legal types. Keeps compiler from generating unneeded checks and handling for extended types. llvm-svn: 172893	2013-01-19 08:27:45 +00:00
Chandler Carruth	1fe21fc0b5	Sort all of the includes. Several files got checked in with mis-sorted includes. llvm-svn: 172891	2013-01-19 08:03:47 +00:00
Jack Carter	7ab15fafe3	This is a resubmittal. For some reason it broke the bots yesterday but I cannot reproduce the problem and have scrubed my sources and even tested with llvm-lit -v --vg. Formatting fixes. Mostly long lines and blank spaces at end of lines. Contributer: Jack Carter llvm-svn: 172882	2013-01-19 02:00:40 +00:00
Nadav Rotem	7431211214	On Sandybridge loading unaligned 256bits using two XMM loads (vmovups and vinsertf128) is faster than using a single vmovups instruction. llvm-svn: 172868	2013-01-18 23:10:30 +00:00
Jack Carter	c1b17ed2e1	This is a resubmittal. For some reason it broke the bots yesterday but I cannot reproduce the problem and have scrubed my sources and even tested with llvm-lit -v --vg. Support for Mips register information sections. Mips ELF object files have a section that is dedicated to register use info. Some of this information such as the assumed Global Pointer value is used by the linker in relocation resolution. The register info file is .reginfo in o32 and .MIPS.options in 64 and n32 abi files. This patch contains the changes needed to create the sections, but leaves the actual register accounting for a future patch. Contributer: Jack Carter llvm-svn: 172847	2013-01-18 21:20:38 +00:00
Tom Stellard	c4cabef782	R600: Proper insert S_WAITCNT instructions Some instructions like memory reads/writes are executed asynchronously, so we need to insert S_WAITCNT instructions to block before accessing their results. Previously we have just inserted S_WAITCNT instructions after each async instruction, this patch fixes this and adds a prober insertion pass. Patch by: Christian König Tested-by: Michel Dänzer <michel.daenzer@amd.com> Reviewed-by: Tom Stellard <thomas.stellard@amd.com> Signed-off-by: Christian König <deathsimple@vodafone.de> llvm-svn: 172846	2013-01-18 21:15:53 +00:00
Tom Stellard	be8ebeebf7	R600: Optimize and cleanup KILL on SI We shouldn't insert KILL optimization if we don't have a kill instruction at all. Patch by: Christian König Tested-by: Michel Dänzer <michel.daenzer@amd.com> Reviewed-by: Tom Stellard <thomas.stellard@amd.com> Signed-off-by: Christian König <deathsimple@vodafone.de> llvm-svn: 172845	2013-01-18 21:15:50 +00:00
Jack Carter	86c2c564ff	This is a resubmittal. For some reason it broke the bots yesterday but I cannot reproduce the problem and have scrubed my sources and even tested with llvm-lit -v --vg. Removal of redundant code and formatting fixes. Contributers: Jack Carter/Vladimir Medic llvm-svn: 172842	2013-01-18 20:15:06 +00:00
Craig Topper	1cb8aa581b	Calculate vector element size more directly for VINSERTF128/VEXTRACTF128 immediate handling. Also use MVT since this only called on legal types during pattern matching. llvm-svn: 172797	2013-01-18 08:41:28 +00:00
Craig Topper	e938138daf	Minor formatting fix. No functional change. llvm-svn: 172795	2013-01-18 07:27:20 +00:00
Craig Topper	908f7d14b5	Spelling fix: extened->extended. Trailing whitespace in same function. llvm-svn: 172793	2013-01-18 06:50:59 +00:00
Craig Topper	01fcf2e2f2	Make more use of is128BitVector/is256BitVector in place of getSizeInBits() == 128/256. llvm-svn: 172792	2013-01-18 06:44:29 +00:00
Chad Rosier	1e8f053bd1	[ms-inline asm] Make the error message more generic now that we support the 'SIZE' and 'LENGTH' operators. llvm-svn: 172773	2013-01-18 00:50:59 +00:00
Bill Schmidt	dee1ef8f53	This patch fixes PR13626 by providing i128 support in the return calling convention. 128-bit integers are now properly returned in GPR3 and GPR4 on PowerPC. llvm-svn: 172745	2013-01-17 19:34:57 +00:00
Chad Rosier	d0ed73acb4	[ms-inline asm] Add support for the 'SIZE' and 'LENGTH' operators. Part of rdar://12576868 llvm-svn: 172743	2013-01-17 19:21:48 +00:00
Jyotsna Verma	9b60c1d171	Add indexed load/store instructions for offset validation check. This patch fixes bug 14902 - http://llvm.org/bugs/show_bug.cgi?id=14902 llvm-svn: 172737	2013-01-17 18:42:37 +00:00
Bill Schmidt	6b2940b01e	This patch fixes the PPC calling convention to handle returns of _Complex float and _Complex long double, by simply increasing the number of floating point registers available for return values. The test case verifies that the correct registers are loaded. llvm-svn: 172733	2013-01-17 17:45:19 +00:00
Elena Demikhovsky	f6a30e05d5	Optimization for the following SIGN_EXTEND pairs: v8i8 -> v8i64, v8i8 -> v8i32, v4i8 -> v4i64, v4i16 -> v4i64 for AVX and AVX2. Bug 14865. llvm-svn: 172708	2013-01-17 09:59:53 +00:00
Craig Topper	c7e6feee42	Combine AVX and SSE forms of MOVSS and MOVSD into the same multiclasses so they get instantiated together. llvm-svn: 172704	2013-01-17 06:59:42 +00:00
Jakob Stoklund Olesen	213a2f8b3f	Provide a place for targets to insert ILP optimization passes. Move the early if-conversion pass into this group. ILP optimizations usually need to find the right balance between register pressure and ILP using the MachineTraceMetrics analysis to identify critical paths and estimate other costs. Such passes should run together so they can share dominator tree and loop info analyses. Besides if-conversion, future passes to run here here could include expression height reduction and ARM's MLxExpansion pass. llvm-svn: 172687	2013-01-17 00:58:38 +00:00
Jack Carter	2a74a87b71	This is a resubmittal. For some reason it broke the bots yesterday but I cannot reproduce the problem and have scrubed my sources and even tested with llvm-lit -v --vg. The Mips RDHWR (Read Hardware Register) instruction was not tested for assembler or dissassembler consumption. This patch adds that functionality. Contributer: Vladimir Medic llvm-svn: 172685	2013-01-17 00:28:20 +00:00
Renato Golin	f104c4c4ca	Change CostTable model to be global to all targets Moving the X86CostTable to a common place, so that other back-ends can share the code. Also simplifying it a bit and commoning up tables with one and two types on operations. llvm-svn: 172658	2013-01-16 21:29:55 +00:00
Jack Carter	5619f91bf7	reverting 172579 llvm-svn: 172594	2013-01-16 01:29:10 +00:00
Jack Carter	e0c1e1a47e	Akira, Hope you are feeling better. The Mips RDHWR (Read Hardware Register) instruction was not tested for assembler or dissassembler consumption. This patch adds that functionality. Contributer: Vladimir Medic llvm-svn: 172579	2013-01-16 00:07:45 +00:00
Jack Carter	f238510c43	This patch fixes a Mips specific bug where we need to generate a N64 compound relocation R_MIPS_GPREL_32/R_MIPS_64/R_MIPS_NONE. The bug was exposed by the SingleSourcetest case DuffsDevice.c. Contributer: Jack Carter llvm-svn: 172496	2013-01-15 01:08:02 +00:00
Chad Rosier	5c118fd2ec	[ms-inline asm] Extend support for parsing Intel bracketed memory operands that have an arbitrary ordering of the base register, index register and displacement. rdar://12527141 llvm-svn: 172484	2013-01-14 22:31:35 +00:00
Dmitri Gribenko	f24e57f227	Improve r172468: const_cast is not needed here llvm-svn: 172483	2013-01-14 22:18:18 +00:00
Dmitri Gribenko	2e1df0e354	Improve r172471: avoid all those extra casts on the lines nearby llvm-svn: 172481	2013-01-14 22:08:37 +00:00
Quentin Colombet	77ca8b83a9	Follow up of commit r172472. Refactor the big if/else sequence into one string switch for ARM subtype selection. llvm-svn: 172475	2013-01-14 21:34:09 +00:00
Quentin Colombet	1a71168624	Complete the existing support of ARM v6m, v7m, and v7em, i.e., respectively cortex-m0, cortex-m3, and cortex-m4 on the backend side. Adds new subtype values for the MachO format and use them when the related triple are set. llvm-svn: 172472	2013-01-14 21:07:43 +00:00
David Greene	cf7ae6c2fd	Fix Casting Fix a casting-away-const compiler warning. llvm-svn: 172471	2013-01-14 21:04:47 +00:00
David Greene	c311561708	Fix Another Cast Properly cast code to eliminate cast-away-const errors. llvm-svn: 172468	2013-01-14 21:04:42 +00:00
Craig Topper	0d2c29e807	Simplify nested strconcats in X86 td files since strconcat can take more than 2 arguments. llvm-svn: 172379	2013-01-14 07:46:34 +00:00
Craig Topper	4c69a05d2d	Create a single multiclass for SSE and AVX version of MOVL/MOVH. Prevents needing to specify everything twice. No functional change intended llvm-svn: 172378	2013-01-14 07:26:58 +00:00
Nick Lewycky	f41a80efd0	Fix typo in comment. llvm-svn: 172364	2013-01-13 19:03:55 +00:00
Dmitri Gribenko	226fea5bd6	Remove redundant 'llvm::' qualifications llvm-svn: 172358	2013-01-13 16:01:15 +00:00
Benjamin Kramer	bcd14a0f26	X86: Add patterns for X86ISD::VSEXT in registers. Those can occur when something between the sextload and the store is on the same chain and blocks isel. Fixes PR14887. llvm-svn: 172353	2013-01-13 11:37:04 +00:00
NAKAMURA Takumi	de45c3a485	MipsDisassembler.cpp: Prune DecodeHWRegs64RegisterClass() to suppress a warning. [-Wunused-function] llvm-svn: 172319	2013-01-12 15:37:00 +00:00
NAKAMURA Takumi	956c123ab6	MipsAsmParser: Try to unbreak tests to add extra check. llvm-svn: 172315	2013-01-12 15:19:10 +00:00
Jack Carter	873c724b4a	This patch tackles the problem of parsing Mips register names in the standalone assembler llvm-mc. Registers such as $A1 can represent either a 32 or 64 bit register based on the instruction using it. In addition, based on the abi, $T0 can represent different 32 bit registers. The problem is resolved by the Mips specific AsmParser td definitions changing to work together. Many cases of RegisterClass parameters are now RegisterOperand. Contributer: Vladimir Medic llvm-svn: 172284	2013-01-12 01:03:14 +00:00
Preston Gurd	99c6990457	Update patch for the pad short functions pass for Intel Atom (only). Adds a check for -Oz, changes the code to not re-visit BBs, and skips over DBG_VALUE instrs. Patch by Andy Zhang. llvm-svn: 172258	2013-01-11 22:06:56 +00:00
NAKAMURA Takumi	7f25427686	X86AsmParser.cpp: Fix up r172148, to add initializer in another CreateMem(). llvm-svn: 172157	2013-01-11 01:13:54 +00:00
Jakub Staszak	ab3d878f35	Remove heavy and unused #inclues from X86TargetObjectFile.cpp. llvm-svn: 172151	2013-01-10 23:43:56 +00:00
Chad Rosier	8c2a9c744e	[ms-inline asm] Make sure we set a default value for AddressOf. Follow on to r172121. llvm-svn: 172148	2013-01-10 23:39:07 +00:00
Chad Rosier	a4bc9437a2	[ms-inline asm] Add support for calling functions from inline assembly. Part of rdar://12991541 llvm-svn: 172121	2013-01-10 22:10:27 +00:00
Joel Jones	5459754d33	Fix description of ARMOperand llvm-svn: 172011	2013-01-09 22:34:16 +00:00
Nadav Rotem	b1791a75cd	ARM Cost model: Use the size of vector registers and widest vectorizable instruction to determine the max vectorization factor. llvm-svn: 172010	2013-01-09 22:29:00 +00:00
Adhemerval Zanella	1ae2248e14	PowerPC: EH adjustments This patch adjust the r171506 to make all DWARF enconding pc-relative for PPC64. It also adds the R_PPC64_REL32 relocation handling in MCJIT (since the eh_frame will not generate PIC-relative relocation) and also adds the emission of stubs created by the TTypeEncoding. llvm-svn: 171979	2013-01-09 17:08:15 +00:00
Nadav Rotem	977e0be4a0	Efficient lowering of vector sdiv when the divisor is a splatted power of two constant. PR 14848. The lowered sequence is based on the existing sequence the target-independent DAG Combiner creates for the scalar case. Patch by Zvi Rackover. llvm-svn: 171953	2013-01-09 05:14:33 +00:00
Eric Christopher	bf7bc4966c	Last in the series of removing unnecessary '0' arguments for address space. Reordered the EmitULEB128IntValue arguments to make this easier. llvm-svn: 171949	2013-01-09 03:52:05 +00:00
Andrew Trick	9f0b95f260	MIsched: add an ILP window property to machine model. This was an experimental option, but needs to be defined per-target. e.g. PPC A2 needs to aggressively hide latency. I converted some in-order scheduling tests to A2. Hal is working on more test cases. llvm-svn: 171946	2013-01-09 03:36:49 +00:00
Eric Christopher	e3ab3d0e2c	These functions have default arguments of 0 for the last arg. Use them. llvm-svn: 171933	2013-01-09 01:57:54 +00:00
Nadav Rotem	b696c36fcd	Cost Model: Move the 'max unroll factor' variable to the TTI and add initial Cost Model support on ARM. llvm-svn: 171928	2013-01-09 01:15:42 +00:00
Jack Carter	c3dd91c4d7	This patch produces the correct addend value for an R_MIPS_GPREL16 relocation. Contributer: Jack Carter llvm-svn: 171882	2013-01-08 19:01:28 +00:00
Jack Carter	9e28cd3fad	This patch produces the correct pointer size value in the 64 bit .eh_frame section. It doesn't however allow exception handling to work yet since it depends on the correct relocation model being set in the ELF header flags. Contributer: Jack Carter llvm-svn: 171881	2013-01-08 18:53:20 +00:00
Preston Gurd	a01daace88	Pad Short Functions for Intel Atom The current Intel Atom microarchitecture has a feature whereby when a function returns early then it is slightly faster to execute a sequence of NOP instructions to wait until the return address is ready, as opposed to simply stalling on the ret instruction until the return address is ready. When compiling for X86 Atom only, this patch will run a pass, called "X86PadShortFunction" which will add NOP instructions where less than four cycles elapse between function entry and return. It includes tests. This patch has been updated to address Nadav's review comments - Optimize only at >= O1 and don't do optimization if -Os is set - Stores MachineBasicBlock* instead of BBNum - Uses DenseMap instead of std::map - Fixes placement of braces Patch by Andy Zhang. llvm-svn: 171879	2013-01-08 18:27:24 +00:00
Eli Bendersky	4d9ada036c	Renamed MCInstFragment to MCRelaxableFragment and added some comments. No change in functionality. llvm-svn: 171822	2013-01-08 00:22:56 +00:00
Jim Grosbach	9dbf3ee9d0	ARM: Copy-paste error. llvm-svn: 171790	2013-01-07 21:24:35 +00:00

1 2 3 4 5 ...

23209 Commits