llvm-project

Commit Graph

Author	SHA1	Message	Date
Silviu Baranga	b47bb94f93	This patch introduces A15 as a target in LLVM. llvm-svn: 163803	2012-09-13 15:05:10 +00:00
Craig Topper	3e41a5bb31	Set operation action for FFLOOR to Expand for all vector types for X86. Set FFLOOR of v4f32 to Expand for ARM. v2f64 was already correct. llvm-svn: 163458	2012-09-08 04:58:43 +00:00
Jakob Stoklund Olesen	e45e22b20f	Custom DAGCombine for and/or/xor are for all ARMs. The 'select' transformations apply to all ARM architectures and don't require hasV6T2Ops. llvm-svn: 163396	2012-09-07 17:34:15 +00:00
James Molloy	9d30dc2432	Fix self-host; ensure signedness is consistent. llvm-svn: 163306	2012-09-06 10:32:08 +00:00
James Molloy	49bdbce8e1	Improve codegen for BUILD_VECTORs on ARM. If we have a BUILD_VECTOR that is mostly a constant splat, it is often better to splat that constant then insertelement the non-constant lanes instead of insertelementing every lane from an undef base. llvm-svn: 163304	2012-09-06 09:55:02 +00:00
Arnold Schwaighofer	f00fb1c581	Patch to implement UMLAL/SMLAL instructions for the ARM architecture This patch corrects the definition of umlal/smlal instructions and adds support for matching them to the ARM dag combiner. Bug 12213 Patch by Yin Ma! llvm-svn: 163136	2012-09-04 14:37:49 +00:00
Jakob Stoklund Olesen	d3bda3c5b9	Fix a couple of typos in EmitAtomic. Thumb2 instructions are mostly constrained to rGPR, not tGPR which is for Thumb1. rdar://problem/12203728 llvm-svn: 162968	2012-08-31 02:08:34 +00:00
Jakob Stoklund Olesen	710093e360	Use a SmallPtrSet to dedup successors in EmitSjLjDispatchBlock. The test case ARM/2011-05-04-MultipleLandingPadSuccs.ll was creating duplicate successor list entries. llvm-svn: 162222	2012-08-20 20:52:03 +00:00
Jakob Stoklund Olesen	e1014e7b98	Remove the CAND/COR/CXOR custom ISD nodes and their select code. These nodes are no longer needed because the peephole pass can fold CMOV+AND into ANDCC etc. llvm-svn: 162179	2012-08-18 21:49:50 +00:00
Jakob Stoklund Olesen	dded061f85	Also combine zext/sext into selects for ARM. This turns common i1 patterns into predicated instructions: (add (zext cc), x) -> (select cc (add x, 1), x) (add (sext cc), x) -> (select cc (add x, -1), x) For a function like: unsigned f(unsigned s, int x) { return s + (x>0); } We now produce: cmp r1, #0 it gt addgt.w r0, r0, #1 Instead of: movs r2, #0 cmp r1, #0 it gt movgt r2, #1 add r0, r2 llvm-svn: 162177	2012-08-18 21:25:22 +00:00
Jakob Stoklund Olesen	aab43dbfbb	Also pass logical ops to combineSelectAndUse. Add these transformations to the existing add/sub ones: (and (select cc, -1, c), x) -> (select cc, x, (and, x, c)) (or (select cc, 0, c), x) -> (select cc, x, (or, x, c)) (xor (select cc, 0, c), x) -> (select cc, x, (xor, x, c)) The selects can then be transformed to a single predicated instruction by peephole. This transformation will make it possible to eliminate the ISD::CAND, COR, and CXOR custom DAG nodes. llvm-svn: 162176	2012-08-18 21:25:16 +00:00
Jakob Stoklund Olesen	c1dee482c8	Add comment, clean up code. No functional change. llvm-svn: 162107	2012-08-17 16:59:09 +00:00
Jakob Stoklund Olesen	c19bf0282d	Handle ARM MOVCC optimization in PeepholeOptimizer. Use the target independent select analysis hooks. llvm-svn: 162060	2012-08-16 23:14:20 +00:00
Jakob Stoklund Olesen	6cb96120f1	Fold predicable instructions into MOVCC / t2MOVCC. The ARM select instructions are just predicated moves. If the select is the only use of an operand, the instruction defining the operand can be predicated instead, saving one instruction and decreasing register pressure. This implementation can turn AND/ORR/EOR instructions into their corresponding ANDCC/ORRCC/EORCC variants. Ideally, we should be able to predicate any instruction, but we don't yet support predicated instructions in SSA form. llvm-svn: 161994	2012-08-15 22:16:39 +00:00
Evan Cheng	eec6bc6270	Use vld1/vst1 to load/store f64 if alignment is < 4 and the target allows unaligned access. rdar://12091029 llvm-svn: 161962	2012-08-15 17:44:53 +00:00
Nadav Rotem	3a94c545cf	Do not optimize (or (and X,Y), Z) into BFI and other sequences if the AND ISDNode has more than one user. rdar://11876519 llvm-svn: 161775	2012-08-13 18:52:44 +00:00
Arnold Schwaighofer	b73da9453c	Revert 161581: Patch to implement UMLAL/SMLAL instructions for the ARM architecture It broke MultiSource/Applications/JM/ldecod/ldecod on armv7 thumb O0 g and armv7 thumb O3. llvm-svn: 161736	2012-08-12 05:11:56 +00:00
Craig Topper	4fa625fda7	Change addTypeForNeon to use MVT instead of EVT so all the calls to getSimpleVT can be removed. llvm-svn: 161735	2012-08-12 03:16:37 +00:00
Arnold Schwaighofer	81b2eec1ab	Patch to implement UMLAL/SMLAL instructions for the ARM architecture This patch corrects the definition of umlal/smlal instructions and adds support for matching them to the ARM dag combiner. Bug 12213 Patch by Yin Ma! llvm-svn: 161581	2012-08-09 15:25:52 +00:00
Bob Wilson	3e6fa462f3	Fall back to selection DAG isel for calls to builtin functions. Fast isel doesn't currently have support for translating builtin function calls to target instructions. For embedded environments where the library functions are not available, this is a matter of correctness and not just optimization. Most of this patch is just arranging to make the TargetLibraryInfo available in fast isel. <rdar://problem/12008746> llvm-svn: 161232	2012-08-03 04:06:28 +00:00
Eric Christopher	b3322364e4	Add support for the ARM GHC calling convention, this patch was in 3.0, but somehow managed to be dropped later. Patch by Karel Gardas. llvm-svn: 161226	2012-08-03 00:05:53 +00:00
Jim Grosbach	6df755cc4e	ARM: Don't assume an SDNode is a constant. Before accessing a node as a ConstandSDNode, make sure it actually is one. No testcase of non-trivial size. rdar://11948669 llvm-svn: 160735	2012-07-25 17:02:47 +00:00
Andrew Trick	a22cdb713b	Fix ARMTargetLowering::isLegalAddImmediate to consider thumb encodings. Based on Evan's suggestion without a commitable test. llvm-svn: 160441	2012-07-18 18:34:27 +00:00
Andrew Trick	bc325168c3	whitespace llvm-svn: 160440	2012-07-18 18:34:24 +00:00
Manman Ren	6e1fd46fdf	ARM: use NOEN loads and stores if possible when handling struct byval. This change is to be enabled in clang. rdar://9877866 llvm-svn: 158684	2012-06-18 22:23:48 +00:00
Manman Ren	e0763c7472	ARM: optimization for sub+abs. This patch will optimize abs(x-y) FROM sub, movs, rsbmi TO subs, rsbmi For abs, we will use cmp instead of movs. This is necessary because we already have an existing peephole pass which optimizes away cmp following sub. rdar: 11633193 llvm-svn: 158551	2012-06-15 21:32:12 +00:00
Bill Wendling	4b79647a6e	Re-enable the CMN instruction. We turned off the CMN instruction because it had semantics which we weren't getting correct. If we are comparing with an immediate, then it's okay to use the CMN instruction. <rdar://problem/7569620> llvm-svn: 158302	2012-06-11 08:07:26 +00:00
Manman Ren	e873552091	ARM: properly handle alignment for struct byval. Factor out the expansion code into a function. This change is to be enabled in clang. rdar://9877866 llvm-svn: 157830	2012-06-01 19:33:18 +00:00
Manman Ren	9f9111651e	ARM: support struct byval in llvm We handle struct byval by inserting a pseudo op, which will be expanded to a loop at ExpandISelPseudos. A separate patch for clang will be submitted to enable struct byval. rdar://9877866 llvm-svn: 157793	2012-06-01 02:44:42 +00:00
Justin Holewinski	aa58397b3c	Change interface for TargetLowering::LowerCallTo and TargetLowering::LowerCall to pass around a struct instead of a large set of individual values. This cleans up the interface and allows more information to be added to the struct for future targets without requiring changes to each and every target. NV_CONTRIB llvm-svn: 157479	2012-05-25 16:35:28 +00:00
Jakob Stoklund Olesen	691ae3388f	Use the right register class for LDRrs. llvm-svn: 157152	2012-05-20 06:38:47 +00:00
Benjamin Kramer	e31f31e5c0	Add a new target hook "predictableSelectIsExpensive". This will be used to determine whether it's profitable to turn a select into a branch when the branch is likely to be predicted. Currently enabled for everything but Atom on X86 and Cortex-A9 devices on ARM. I'm not entirely happy with the name of this flag, suggestions welcome ;) llvm-svn: 156233	2012-05-05 12:49:14 +00:00
Matt Beaumont-Gay	e82ab6baa7	Pacify GCC's -Wreturn-type llvm-svn: 156189	2012-05-04 18:34:27 +00:00
Hans Wennborg	aea412008e	Make ARM and Mips use TargetMachine::getTLSModel() This moves the logic for selecting a TLS model to a single place, instead of the previous three (ARM, Mips, and X86 which already uses this function). llvm-svn: 156162	2012-05-04 09:40:39 +00:00
Bob Wilson	9245c93656	Don't introduce illegal types when creating vmull operations. <rdar://11324364> ARM BUILD_VECTORs created after type legalization cannot use i8 or i16 operands, since those types are not legal. Instead use i32 operands, which will be implicitly truncated by the BUILD_VECTOR to match the element type. llvm-svn: 155824	2012-04-30 16:53:34 +00:00
Craig Topper	c7242e054d	Convert more uses of XXXRegisterClass to &XXXRegClass. No functional change since they are equivalent. llvm-svn: 155188	2012-04-20 07:30:17 +00:00
Evan Cheng	d0007f3c83	Handle llvm.fma.* intrinsics. rdar://10914096 llvm-svn: 154439	2012-04-10 21:40:28 +00:00
Evan Cheng	f8bad08001	Fix a long standing tail call optimization bug. When a libcall is emitted legalizer always use the DAG entry node. This is wrong when the libcall is emitted as a tail call since it effectively folds the return node. If the return node's input chain is not the entry (i.e. call, load, or store) use that as the tail call input chain. PR12419 rdar://9770785 rdar://11195178 llvm-svn: 154370	2012-04-10 01:51:00 +00:00
Chad Rosier	e0e38f61a5	When performing a truncating store, it's possible to rearrange the data in-register, such that we can use a single vector store rather then a series of scalar stores. For func_4_8 the generated code vldr d16, LCPI0_0 vmov d17, r0, r1 vadd.i16 d16, d17, d16 vmov.u16 r0, d16[3] strb r0, [r2, #3] vmov.u16 r0, d16[2] strb r0, [r2, #2] vmov.u16 r0, d16[1] strb r0, [r2, #1] vmov.u16 r0, d16[0] strb r0, [r2] bx lr becomes vldr d16, LCPI0_0 vmov d17, r0, r1 vadd.i16 d16, d17, d16 vuzp.8 d16, d17 vst1.32 {d16[0]}, [r2, :32] bx lr I'm not fond of how this combine pessimizes 2012-03-13-DAGCombineBug.ll, but I couldn't think of a way to judiciously apply this combine. This ldrh r0, [r0, #4] strh r0, [r1] becomes vldr d16, [r0] vmov.u16 r0, d16[2] vmov.32 d16[0], r0 vuzp.16 d16, d17 vst1.32 {d16[0]}, [r1, :32] PR11158 rdar://10703339 llvm-svn: 154340	2012-04-09 20:32:02 +00:00
Chad Rosier	99cbde9e82	Update comments and remove unnecessary isVolatile() check. llvm-svn: 154336	2012-04-09 19:38:15 +00:00
Jim Grosbach	0c509fa6bf	Tidy up. 80 columns. llvm-svn: 154226	2012-04-06 23:43:50 +00:00
Chandler Carruth	8a102c21e3	There is no portable std::abs overload for int64_t, use the llvm::abs64 which exists for this purpose. llvm-svn: 154199	2012-04-06 20:10:52 +00:00
Jakob Stoklund Olesen	967b86a0a2	Allow negative immediates in ARM and Thumb2 compares. ARM and Thumb2 mode can use cmn instructions to compare against negative immediates. Thumb1 mode can't. llvm-svn: 154183	2012-04-06 17:45:04 +00:00
Rafael Espindola	ba0a6cabb8	Always compute all the bits in ComputeMaskedBits. This allows us to keep passing reduced masks to SimplifyDemandedBits, but know about all the bits if SimplifyDemandedBits fails. This allows instcombine to simplify cases like the one in the included testcase. llvm-svn: 154011	2012-04-04 12:51:34 +00:00
Evan Cheng	a40d40602c	ARM target should allow codegenprep to duplicate ret instructions to enable tailcall opt. rdar://11140249 llvm-svn: 153717	2012-03-30 01:24:39 +00:00
Lang Hames	591cdaf2ee	Try using vmov.i32 to materialize FP32 constants that can't be materialized by vmov.f32. llvm-svn: 153696	2012-03-29 21:56:11 +00:00
Craig Topper	f6e7e12f75	Remove unnecessary llvm:: qualifications llvm-svn: 153500	2012-03-27 07:21:54 +00:00
Craig Topper	5fa0caafc0	Prune includes and replace uses of ARMRegisterInfo.h with ARMBaeRegisterInfo.h llvm-svn: 153422	2012-03-26 00:45:15 +00:00
Craig Topper	07720d8dcd	Replace uses of ARMBaseInstrInfo and ARMTargetMachine with the Base versions. llvm-svn: 153421	2012-03-25 23:49:58 +00:00
Anton Korobeynikov	3edd854d64	Perform mul combine when multiplying wiht negative constants. Patch by Weiming Zhao! This fixes PR12212 llvm-svn: 153049	2012-03-19 19:19:50 +00:00
Craig Topper	188ed9d56e	Reorder includes to match coding standards. Fix an issue or two exposed by that. llvm-svn: 152978	2012-03-17 07:33:42 +00:00
Lang Hames	c35ee8b54a	Use vmov.f32 to materialize f32 consts on ARM. This relaxes constraints on register allocation by allowing all 32 D-registers to be used. Patch by Cameron Zwarich. llvm-svn: 152824	2012-03-15 18:49:02 +00:00
Craig Topper	bef78fc2ee	Convert more static tables of registers used by calling convention to uint16_t to reduce space. llvm-svn: 152538	2012-03-11 07:57:25 +00:00
Craig Topper	420525ce3b	Use uint16_t to store registers in callee saved register tables to reduce size of static data. llvm-svn: 151996	2012-03-04 03:33:22 +00:00
Evan Cheng	d12af5dc69	Neuter the optimization I implemented with r107852 and r108258 which turn some floating point equality comparisons into integer ones with -ffast-math. The issue is the optimization causes +0.0 != -0.0. Now the optimization is only done when one side is known to be 0.0. The other side's sign bit is masked off for the comparison. rdar://10964603 llvm-svn: 151861	2012-03-01 23:27:13 +00:00
Evan Cheng	65f9d19c4f	Re-commit r151623 with fix. Only issue special no-return calls if it's a direct call. llvm-svn: 151645	2012-02-28 18:51:51 +00:00
Daniel Dunbar	ee7b899343	Revert r151623 "Some ARM implementaions, e.g. A-series, does return stack prediction. ...", it is breaking the Clang build during the Compiler-RT part. llvm-svn: 151630	2012-02-28 15:36:07 +00:00
Evan Cheng	87c7b09d8d	Some ARM implementaions, e.g. A-series, does return stack prediction. That is, the processor keeps a return addresses stack (RAS) which stores the address and the instruction execution state of the instruction after a function-call type branch instruction. Calling a "noreturn" function with normal call instructions (e.g. bl) can corrupt RAS and causes 100% return misprediction so LLVM should use a unconditional branch instead. i.e. mov lr, pc b _foo The "mov lr, pc" is issued in order to get proper backtrace. rdar://8979299 llvm-svn: 151623	2012-02-28 06:42:03 +00:00
Jakob Stoklund Olesen	fa7a53746c	Switch ARM target to register masks. I'll let the buildbots determine the compile time improvements from this change, but 464.h264ref has 5% faster codegen at -O2. This patch does cause some assembly changes. Branch folding can make different decisions about calls with dead return values. CriticalAntiDepBreaker may choose different registers because its liveness tracking is affected. MachineCopyPropagation may sometimes leave a dead copy behind. llvm-svn: 151331	2012-02-24 01:19:29 +00:00
Dan Gohman	d4a77c4682	When emitting a cmp with 0 for a lowered select, mask out the high bits of the value carying the boolean condition, as their contents are undefined. This fixes rdar://10887484. llvm-svn: 151310	2012-02-24 00:09:36 +00:00
Evan Cheng	f258a15bdf	Canonicalize (srl (bswap x), 16) to (rotr (bswap x), 16) if the high 16 bits of x are zero. This optimizes rev + lsr 16 to rev16. rdar://10750814 llvm-svn: 151230	2012-02-23 02:58:19 +00:00
Evan Cheng	e87681cf34	Optimize a couple of common patterns involving conditional moves where the false value is zero. Instead of a cmov + op, issue an conditional op instead. e.g. cmp r9, r4 mov r4, #0 moveq r4, #1 orr lr, lr, r4 should be: cmp r9, r4 orreq lr, lr, #1 That is, optimize (or x, (cmov 0, y, cond)) to (or.cond x, y). Similarly extend this to xor as well as (and x, (cmov -1, y, cond)) => (and.cond x, y). It's possible to extend this to ADD and SUB but I don't think they are common. rdar://8659097 llvm-svn: 151224	2012-02-23 01:19:06 +00:00
Craig Topper	760b134ffa	Make all pointers to TargetRegisterClass const since they are all pointers to static data that should not be modified. llvm-svn: 151134	2012-02-22 05:59:10 +00:00
Evan Cheng	0460ae8d80	Proper support for a bastardized darwin-eabi hybird ABI. llvm-svn: 151083	2012-02-21 20:46:00 +00:00
James Molloy	547d4c0662	Improve generated code for extending loads and some trunc stores on ARM. Teach TargetSelectionDAG about lengthening loads for vector types and set v4i8 as legal. Allow FP_TO_UINT for v4i16 from v4i32. llvm-svn: 150956	2012-02-20 09:24:05 +00:00
Bill Wendling	05d6f2ff1e	Don't reserve the R0 and R1 registers here. We don't use these registers, and marking them as "live-in" into a BB ruins some invariants that the back-end tries to maintain. llvm-svn: 150437	2012-02-13 23:47:16 +00:00
Jason W Kim	c7f4841769	Make valgrind happy. llvm-svn: 150251	2012-02-10 16:07:59 +00:00
Craig Topper	e55c556a24	Convert assert(0) to llvm_unreachable llvm-svn: 149961	2012-02-07 02:50:20 +00:00
Anton Korobeynikov	d0c46550fe	Cleanups for EABI standard functions llvm-svn: 149195	2012-01-29 09:11:50 +00:00
Anton Korobeynikov	1b42e64280	Use base AAPCS for varargs functions even for AAPCS-VFP CC llvm-svn: 149194	2012-01-29 09:06:09 +00:00
David Blaikie	46a9f016c5	More dead code removal (using -Wunreachable-code) llvm-svn: 148578	2012-01-20 21:51:11 +00:00
David Blaikie	5d8e42755c	Refactor variables unused under non-assert builds (& remove two entirely unused variables). llvm-svn: 148230	2012-01-16 05:17:39 +00:00
Benjamin Kramer	339ced4e34	Return an ArrayRef from ShuffleVectorSDNode::getMask and push it through CodeGen. llvm-svn: 148218	2012-01-15 13:16:05 +00:00
Jakob Stoklund Olesen	083dbdca7f	Match SelectionDAG logic for enabling movt. Darwin doesn't do static, and ELF targets only support static. llvm-svn: 147740	2012-01-07 20:49:15 +00:00
Benjamin Kramer	6898db6269	Remove VectorExtras. This unused helper was written for a type of API that is discouraged now. llvm-svn: 147738	2012-01-07 19:42:13 +00:00
Bob Wilson	1a74de9504	Add variants of the dispatchsetup pseudo for Thumb and !VFP. <rdar://10620138> My change r146949 added register clobbers to the eh_sjlj_dispatchsetup pseudo instruction, but on Thumb1 some of those registers cannot be used. This caused massive failures on the testsuite when compiling for Thumb1. While fixing that, I noticed that the eh_sjlj_setjmp instruction has a "nofp" variant, and I realized that dispatchsetup needs the same thing, so I have added that as well. llvm-svn: 147204	2011-12-22 23:39:48 +00:00
Eli Friedman	c9bf1b1bff	Make check a bit more strict so we don't call ARM_AM::getFP32Imm with a value that isn't a 32-bit value. (This is just to be safe; I don't think this actually causes any issues in practice.) llvm-svn: 146700	2011-12-15 22:56:53 +00:00
Chandler Carruth	637cc6a8aa	Initial CodeGen support for CTTZ/CTLZ where a zero input produces an undefined result. This adds new ISD nodes for the new semantics, selecting them when the LLVM intrinsic indicates that the undef behavior is desired. The new nodes expand trivially to the old nodes, so targets don't actually need to do anything to support these new nodes besides indicating that they should be expanded. I've done this for all the operand types that I could figure out for all the targets. Owners of various targets, please review and let me know if any of these are incorrect. Note that the expand behavior is conservatively correct, and exactly matches LLVM's current behavior with these operations. Ideally this patch will not change behavior in any way. For example the regtest suite finds the exact same instruction sequences coming out of the code generator. That's why there are no new tests here -- all of this is being exercised by the existing test suite. Thanks to Duncan Sands for reviewing the various bits of this patch and helping me get the wrinkles ironed out with expanding for each target. Also thanks to Chris for clarifying through all the discussions that this is indeed the approach he was looking for. That said, there are likely still rough spots. Further review much appreciated. llvm-svn: 146466	2011-12-13 01:56:10 +00:00
Stepan Dyatkovskiy	4683740967	Fixed bug 9905: Failure in code selection for llvm intrinsics sqrt/exp (fix for FSQRT, FSIN, FCOS, FPOWI, FPOW, FLOG, FLOG2, FLOG10, FEXP, FEXP2). Third attempt: simplified checks in test for armv7-apple-darwin11. llvm-svn: 146341	2011-12-11 14:35:48 +00:00
Chad Rosier	6641294e3b	Revert r146322 to appease buildbots. Original commit message: Fixed bug 9905: Failure in code selection for llvm intrinsics sqrt/exp (fix for FSQRT, FSIN, FCOS, FPOWI, FPOW, FLOG, FLOG2, FLOG10, FEXP, FEXP2). Second attempt. llvm-svn: 146328	2011-12-10 19:55:03 +00:00
Stepan Dyatkovskiy	df0b779e9f	Fixed bug 9905: Failure in code selection for llvm intrinsics sqrt/exp (fix for FSQRT, FSIN, FCOS, FPOWI, FPOW, FLOG, FLOG2, FLOG10, FEXP, FEXP2). Second attempt. llvm-svn: 146322	2011-12-10 08:42:24 +00:00
Eli Friedman	4e36a934dc	Splats can contain undef's; make sure to handle them correctly. PR11526. llvm-svn: 146299	2011-12-09 23:54:42 +00:00
Daniel Dunbar	c09e4593b2	Revert r146143, "Fix bug 9905: Failure in code selection for llvm intrinsics sqrt/exp (fix for FSQRT, FSIN, FCOS, FPOWI, FPOW, FLOG, FLOG2, FLOG10, FEXP, FEXP2).", it is failing tests. llvm-svn: 146157	2011-12-08 17:32:18 +00:00
Stepan Dyatkovskiy	a4bcf27dae	Fix bug 9905: Failure in code selection for llvm intrinsics sqrt/exp (fix for FSQRT, FSIN, FCOS, FPOWI, FPOW, FLOG, FLOG2, FLOG10, FEXP, FEXP2). llvm-svn: 146143	2011-12-08 07:55:03 +00:00
Evan Cheng	7f8e563a69	Add bundle aware API for querying instruction properties and switch the code generator to it. For non-bundle instructions, these behave exactly the same as the MC layer API. For properties like mayLoad / mayStore, look into the bundle and if any of the bundled instructions has the property it would return true. For properties like isPredicable, only return true if all of the bundled instructions have the property. For properties like canFoldAsLoad, isCompare, conservatively return false for bundles. llvm-svn: 146026	2011-12-07 07:15:52 +00:00
Nick Lewycky	50f02cb21b	Move global variables in TargetMachine into new TargetOptions class. As an API change, now you need a TargetOptions object to create a TargetMachine. Clang patch to follow. One small functionality change in PTX. PTX had commented out the machine verifier parts in their copy of printAndVerify. That now calls the version in LLVMTargetMachine. Users of PTX who need verification disabled should rely on not passing the command-line flag to enable it. llvm-svn: 145714	2011-12-02 22:16:29 +00:00
Benjamin Kramer	7ba71be392	Move code into anonymous namespaces. llvm-svn: 145154	2011-11-26 23:01:57 +00:00
Bob Wilson	f6d1728d8f	Fix ARM SjLj-EH dispatch setup code. <rdar://problem/10444602> The EmitBasePointerRecalculation function has 2 problems, one minor and one fatal. The minor problem is that it inserts the code at the setjmp instead of in the dispatch block. The fatal problem is that at the point where this code runs, we don't know whether there will be a base pointer, so the entire function is a no-op. The base pointer recalculation needs to be handled as it was before, by inserting a pseudo instruction that gets expanded late. Most of the support for the old approach is still here, but it no longer has any connection to the eh_sjlj_dispatchsetup intrinsic. Clean up the parts related to the intrinsic and just generate the pseudo instruction directly. llvm-svn: 144781	2011-11-16 07:11:57 +00:00
Jay Foad	0745e645e0	Remove some unnecessary includes of PseudoSourceValue.h. llvm-svn: 144631	2011-11-15 07:24:32 +00:00
Evan Cheng	7ca4b6eb5c	Add vmov.f32 to materialize f32 immediate splats which cannot be handled by integer variants. rdar://10437054 llvm-svn: 144608	2011-11-15 02:12:34 +00:00
Eli Friedman	c4a001478c	Make sure to expand SIGN_EXTEND_INREG for NEON vectors. PR11319, round 3. llvm-svn: 144361	2011-11-11 03:16:38 +00:00
Eli Friedman	2d4055b683	Make sure we correctly unroll conversions between v2f64 and v2i32 on ARM. llvm-svn: 144241	2011-11-09 23:36:02 +00:00
Lang Hames	b85fcd07df	Lower mem-ops to unaligned i32/i16 load/stores on ARM where supported. Add support for trimming constants to GetDemandedBits. This fixes some funky constant generation that occurs when stores are expanded for targets that don't support unaligned stores natively. llvm-svn: 144102	2011-11-08 18:56:23 +00:00
Pete Cooper	82cd9e81fc	Added invariant field to the DAG.getLoad method and changed all calls. When this field is true it means that the load is from constant (runt-time or compile-time) and so can be hoisted from loops or moved around other memory accesses llvm-svn: 144100	2011-11-08 18:42:53 +00:00
Eli Friedman	6f84fed675	Make sure to mark vector extload's as expand on ARM. Fixes PR11319. llvm-svn: 144057	2011-11-08 01:43:53 +00:00
Dan Gohman	198b7ffc11	Reapply r143206, with fixes. Disallow physical register lifetimes across calls, and only check for nested dependences on the special call-sequence-resource register. llvm-svn: 143660	2011-11-03 21:49:52 +00:00
Lang Hames	1f4603d498	Fixed parameter name. llvm-svn: 143594	2011-11-02 23:37:04 +00:00
Lang Hames	9929c423a1	Try to lower memset/memcpy/memmove to vector instructions on ARM where the alignment permits. llvm-svn: 143582	2011-11-02 22:52:45 +00:00
Dan Gohman	9b9c970148	Revert r143206, as there are still some failing tests. llvm-svn: 143262	2011-10-29 00:41:52 +00:00
Dan Gohman	73057ad24f	Reapply r143177 and r143179 (reverting r143188), with scheduler fixes: Use a separate register, instead of SP, as the calling-convention resource, to avoid spurious conflicts with actual uses of SP. Also, fix unscheduling of calling sequences, which can be triggered by pseudo-two-address dependencies. llvm-svn: 143206	2011-10-28 17:55:38 +00:00
Duncan Sands	225a7037d6	Speculatively disable Dan's commits 143177 and 143179 to see if it fixes the dragonegg self-host (it looks like gcc is miscompiled). Original commit messages: Eliminate LegalizeOps' LegalizedNodes map and have it just call RAUW on every node as it legalizes them. This makes it easier to use hasOneUse() heuristics, since unneeded nodes can be removed from the DAG earlier. Make LegalizeOps visit the DAG in an operands-last order. It previously used operands-first, because LegalizeTypes has to go operands-first, and LegalizeTypes used to be part of LegalizeOps, but they're now split. The operands-last order is more natural for several legalization tasks. For example, it allows lowering code for nodes with floating-point or vector constants to see those constants directly instead of seeing the lowered form (often constant-pool loads). This makes some things somewhat more complicated today, though it ought to allow things to be simpler in the future. It also fixes some bugs exposed by Legalizing using RAUW aggressively. Remove the part of LegalizeOps that attempted to patch up invalid chain operands on libcalls generated by LegalizeTypes, since it doesn't work with the new LegalizeOps traversal order. Instead, define what LegalizeTypes is doing to be correct, and transfer the responsibility of keeping calls from having overlapping calling sequences into the scheduler. Teach the scheduler to model callseq_begin/end pairs as having a physical register definition/use to prevent calls from having overlapping calling sequences. This is also somewhat complicated, though there are ways it might be simplified in the future. This addresses rdar://9816668, rdar://10043614, rdar://8434668, and others. Please direct high-level questions about this patch to management. Delete #if 0 code accidentally left in. llvm-svn: 143188	2011-10-28 09:55:57 +00:00
Dan Gohman	4db3f7dd83	Eliminate LegalizeOps' LegalizedNodes map and have it just call RAUW on every node as it legalizes them. This makes it easier to use hasOneUse() heuristics, since unneeded nodes can be removed from the DAG earlier. Make LegalizeOps visit the DAG in an operands-last order. It previously used operands-first, because LegalizeTypes has to go operands-first, and LegalizeTypes used to be part of LegalizeOps, but they're now split. The operands-last order is more natural for several legalization tasks. For example, it allows lowering code for nodes with floating-point or vector constants to see those constants directly instead of seeing the lowered form (often constant-pool loads). This makes some things somewhat more complicated today, though it ought to allow things to be simpler in the future. It also fixes some bugs exposed by Legalizing using RAUW aggressively. Remove the part of LegalizeOps that attempted to patch up invalid chain operands on libcalls generated by LegalizeTypes, since it doesn't work with the new LegalizeOps traversal order. Instead, define what LegalizeTypes is doing to be correct, and transfer the responsibility of keeping calls from having overlapping calling sequences into the scheduler. Teach the scheduler to model callseq_begin/end pairs as having a physical register definition/use to prevent calls from having overlapping calling sequences. This is also somewhat complicated, though there are ways it might be simplified in the future. This addresses rdar://9816668, rdar://10043614, rdar://8434668, and others. Please direct high-level questions about this patch to management. llvm-svn: 143177	2011-10-28 01:29:32 +00:00
Lang Hames	c47e283430	Make sure short memsets on ARM lower to stores, even when optimizing for size. llvm-svn: 143055	2011-10-26 20:56:52 +00:00
James Molloy	dd9137aa56	Revert r142530 at least temporarily while a discussion is had on llvm-commits regarding exactly how much optsize should optimize for size over performance. llvm-svn: 143023	2011-10-26 08:53:19 +00:00
Bill Wendling	1414bc5a14	Use a worklist to prevent the iterator from becoming invalidated because of the 'removeSuccessor' call. Noticed in a Release+Asserts+Check buildbot. llvm-svn: 143018	2011-10-26 07:16:18 +00:00
Evan Cheng	043c9d3f7a	Revert part of r142530. The patch potentially hurts performance especially on Darwin platforms where -Os means optimize for size without hurting performance. llvm-svn: 143002	2011-10-26 01:17:44 +00:00
Eli Friedman	a5e244c08d	Don't crash on variable insertelement on ARM. PR10258. llvm-svn: 142871	2011-10-24 23:08:52 +00:00
Dan Gohman	4ed1afa51d	Change this overloaded use of Sched::Latency to be an overloaded use of Sched::ILP instead, as Sched::Latency is going away. llvm-svn: 142813	2011-10-24 17:55:11 +00:00
Bill Wendling	94e6643fce	The different flavors of ARM have different valid subsets of registers. Check that the set of callee-saved registers is correct for the specific platform. <rdar://problem/10313708> & ctor_dtor_count & ctor_dtor_count-2 llvm-svn: 142706	2011-10-22 00:29:28 +00:00
Bill Wendling	cf7bdf4438	Add missing operand. <rdar://problem/10313323> llvm-svn: 142615	2011-10-20 20:37:11 +00:00
James Molloy	2d768fd379	Use literal pool loads instead of MOVW/MOVT for materializing global addresses when optimizing for size. On spec/gcc, this caused a codesize improvement of ~1.9% for ARM mode and ~4.9% for Thumb(2) mode. This is codesize including literal pools. The pools themselves doubled in size for ARM mode and quintupled for Thumb mode, leaving suggestion that there is still perhaps redundancy in LLVM's use of constant pools that could be decreased by sharing entries. Fixes PR11087. llvm-svn: 142530	2011-10-19 14:11:07 +00:00
Bill Wendling	2977a15ab1	Make sure we emit the 'movw' and 'movt' only if it's supported. Otherwise, use a constant pool. llvm-svn: 142485	2011-10-19 09:24:02 +00:00
Bill Wendling	7c1634556d	Remove some dead code. llvm-svn: 142484	2011-10-19 09:04:11 +00:00
Bill Wendling	94f60018e0	Emit the MOVT instruction only if the # LPads is > 64K. llvm-svn: 142460	2011-10-18 23:19:55 +00:00
Bill Wendling	64e6bfc16c	For Thumb mode, we need to use a constant pool if the value is too large to be used with the CMP instruction. llvm-svn: 142458	2011-10-18 23:11:05 +00:00
Bill Wendling	4969dcdef9	Use the integer compare when the value is small enough. Use the "move into a register and then compare against that" method when it's too large. We have to move the value into the register in the "movw, movt" pair of instructions. llvm-svn: 142440	2011-10-18 22:52:20 +00:00
Bill Wendling	85833f71c6	Use the integer compare when the value is small enough. Use the "move into a register and then compare against that" method when it's too large. We have to move the value into the register in the "movw, movt" pair of instructions. llvm-svn: 142437	2011-10-18 22:49:07 +00:00
Bill Wendling	973c817cde	The value we're comparing against may be too large for the ARM CMP instruction. Move the value into a register and then use that for the CMP. <rdar://problem/10305266> llvm-svn: 142431	2011-10-18 22:11:18 +00:00
Bill Wendling	b2a703d352	The immediate may be too large for the CMP instruction. Move it into a register and use that in the CMP. <rdar://problem/10305266> llvm-svn: 142429	2011-10-18 21:55:58 +00:00
Andrew Trick	88b2450adc	Use ARM/t2PseudoInst class from ARM/Thumb2 special adds/subs patterns. Clean up the patterns, fix comments, and avoid confusing both tools and coders. Note that the special adds/subs SelectionDAG nodes no longer have the dummy cc_out operand. llvm-svn: 142397	2011-10-18 19:18:52 +00:00
Bob Wilson	93b0f7b319	Use isIntN and isUIntN to check for valid signed/unsigned numbers. llvm-svn: 142395	2011-10-18 18:46:49 +00:00
Andrew Trick	3f07c429b5	whitespace llvm-svn: 142394	2011-10-18 18:40:53 +00:00
Bill Wendling	617075fcf6	A landing pad could have more than one predecessor. In that case, we want that predecessor to remove the jump to it as well. Delay clearing the 'landing pad' flag until after the jumps have been removed. (There is an implicit assumption in several modules that an MBB which jumps to a landing pad has only two successors.) <rdar://problem/10304224> llvm-svn: 142390	2011-10-18 18:30:49 +00:00
Bob Wilson	9258b76d8d	Fix incorrect check for sign-extended constant BUILD_VECTOR. <rdar://problem/10298332> llvm-svn: 142371	2011-10-18 17:34:51 +00:00
Duncan Sands	d278d35b13	Fix a bunch of unused variable warnings when doing a release build with gcc-4.6. llvm-svn: 142350	2011-10-18 12:44:00 +00:00
Bill Wendling	510fbcd440	Don't renumber the blocks here. This could cause problems later on if another pass renumbers the blocks again. llvm-svn: 142258	2011-10-17 21:32:56 +00:00
Bill Wendling	f7f223f69e	Add a call to EmitSjLjDispatchBlock. Once the intrinsics are marked as having a custom inserter, it will call this method to emit the dispatch table into the machine function. llvm-svn: 142245	2011-10-17 20:37:20 +00:00
Bill Wendling	26d2780d07	Add comment explaining that the order of processing doesn't matter here. llvm-svn: 142176	2011-10-17 05:25:09 +00:00
Nadav Rotem	097106b77a	ARM cannot select a pattern for trunc-store v4i8; /ARM/vrev.ll fails when promoting elements. llvm-svn: 142080	2011-10-15 20:03:12 +00:00
Bill Wendling	9c1019c6c7	Mark registers as DEAD because they're really just clobbers. llvm-svn: 142027	2011-10-15 00:27:44 +00:00
Eli Friedman	74d1da5a05	Add missing correctness check to ARMTargetLowering::ReconstructShuffle. Fixes PR11129. llvm-svn: 142022	2011-10-14 23:58:49 +00:00
Bill Wendling	9e0cd1ee17	Make sure that the register is in the register class before adding it as a machine op. llvm-svn: 142021	2011-10-14 23:55:44 +00:00
Bill Wendling	6f3f9a391e	Mark the invoke call instruction as implicitly defining the callee-saved registers. The callee-saved registers cannot be live across an invoke call because the control flow may continue along the exceptional edge. When this happens, all of the callee-saved registers are no longer valid. llvm-svn: 142018	2011-10-14 23:34:37 +00:00
Eli Friedman	aa6ec39056	Simplify and avoid undefined shift. Based on patch by Ahmed Charles. llvm-svn: 141903	2011-10-13 22:40:23 +00:00
Bill Wendling	a7d697e4a6	Reapply r141365 now that PR11107 is fixed. llvm-svn: 141591	2011-10-10 22:59:55 +00:00
Bill Wendling	47aac51043	Revert r141365. It was causing MultiSource/Benchmarks/MiBench/consumer-lame to hang, and possibly SPEC/CINT2006/464_h264ref. llvm-svn: 141560	2011-10-10 18:27:30 +00:00
Bill Wendling	883ec97115	Take all of the invoke basic blocks and make the dispatch basic block their new successor. Remove the old landing pad from their successor list, because it's now the successor of the dispatch block. Now that the landing pad blocks are no longer the destination of invokes, we can mark them as normal basic blocks instead of landing pads. This more closely resembles what the CFG is actually doing. llvm-svn: 141436	2011-10-07 23:18:02 +00:00
Bill Wendling	f9f5e455d4	Take the code that was emitted for the llvm.eh.dispatch.setup intrinsic and emit it with the new SjLj emitter stuff. This way there's no need to emit that kind-of-hacky intrinsic. llvm-svn: 141419	2011-10-07 22:08:37 +00:00
Bill Wendling	7ecfbd90ef	Thread the chain through the eh.sjlj.setjmp intrinsic, like it's documented to do. This will be useful later on with the new SJLJ stuff. llvm-svn: 141416	2011-10-07 21:25:38 +00:00
Bob Wilson	8decdc472f	Reenable tail calls for iOS 5.0 and later. llvm-svn: 141370	2011-10-07 17:17:49 +00:00
Bob Wilson	bc1589945d	Reenable use of divmod compiler_rt functions for iOS 5.0 and later. llvm-svn: 141368	2011-10-07 16:59:21 +00:00
Anton Korobeynikov	318d6bae80	Peephole optimization for ABS on ARM. Patch by Ana Pazos! llvm-svn: 141365	2011-10-07 16:15:08 +00:00
Bill Wendling	8d50ea0f82	Use the correct vreg here. llvm-svn: 141342	2011-10-06 23:41:14 +00:00
Bill Wendling	b3d4678877	Generate the dispatch code for a 'thumb' function. This is very similar to the others. They take the call site value. Determine if it's a proper value. And then jumps to the correct call site via a jump table. llvm-svn: 141341	2011-10-06 23:37:36 +00:00
Bill Wendling	5626c66a89	Generate the dispatch table for ARM mode. llvm-svn: 141327	2011-10-06 22:53:00 +00:00
Bill Wendling	030b58e5c9	Refactor some of the code that sets up the entry block for SjLj EH. No functionality change. llvm-svn: 141323	2011-10-06 22:18:16 +00:00
Bill Wendling	31d973cde6	Use a thumb ORR instead of thumb2 ORR when in thumb-only mode. (Picky! Picky!) Place the immediate to OR into a register so that it works. llvm-svn: 141319	2011-10-06 21:51:21 +00:00
Bill Wendling	362c1b01cc	* Set the low bit of the return address when we are in thumb mode. * Some code cleanup. llvm-svn: 141317	2011-10-06 21:29:56 +00:00
Bill Wendling	6134655f08	Add the MBBs before inserting the instructions. Doing it afterwards could lead to an infinite loop because of the def-use chains. Also use a frame load instead of store for the LD instruction. llvm-svn: 141263	2011-10-06 00:53:33 +00:00
Bill Wendling	f793e7ed5c	Get the proper call site numbers for the landing pads. Also remove a magic number (18) for the proper addressing mode. llvm-svn: 141245	2011-10-05 23:28:57 +00:00
Bill Wendling	324be98a3c	Look at the number of entries in the jump table and jump to a 'trap' block if the value exceeds that number. llvm-svn: 141143	2011-10-05 00:39:32 +00:00
Bill Wendling	202803e39c	Checkpoint for SJLJ EH code. This is a first pass at generating the jump table for the sjlj dispatch. It currently generates something plausible, but hasn't been tested thoroughly. llvm-svn: 141140	2011-10-05 00:02:33 +00:00
Bill Wendling	1eab54f8ba	Use the PC label ID rather than '1'. Add support for thumb-2, because I heard that some people use it. llvm-svn: 141042	2011-10-03 22:44:15 +00:00
Bill Wendling	374ee194f2	Check-pointing the new SjLj EH lowering. This code will replace the version in ARMAsmPrinter.cpp. It creates a new machine basic block, which is the dispatch for the return from a longjmp call. It then shoves the address of that machine basic block into the correct place in the function context so that the EH runtime will jump to it directly instead of having to go through a compare-and-jump-to-the-dispatch bit. This should be more efficient in the common case. llvm-svn: 141031	2011-10-03 21:25:38 +00:00
Bill Wendling	c214cb055d	Use the new ARMConstantPoolSymbol class to handle external symbols. llvm-svn: 140939	2011-10-01 08:58:29 +00:00
Bill Wendling	7753d66468	Switch over to using ARMConstantPoolConstant for global variables, functions, and block addresses. llvm-svn: 140936	2011-10-01 08:00:54 +00:00
Jim Grosbach	efc761a1eb	ARM fix encoding of VMOV.f32 and VMOV.f64 immediates. Encode the immediate into its 8-bit form as part of isel rather than later, which simplifies things for mapping the encoding bits, allows the removal of the custom disassembler decoding hook, makes the operand printer trivial, and prepares things more cleanly for handling these in the asm parser. rdar://10211428 llvm-svn: 140834	2011-09-30 00:50:06 +00:00
Evan Cheng	8156376aa9	Tighten a ARM dag combine condition to avoid an identity transformation, which ends up introducing a cycle in the DAG. rdar://10196296 llvm-svn: 140733	2011-09-28 23:16:31 +00:00
David Meyer	b1fbf9ff26	PR11004: Inline memcpy to avoid generating nested call sequence. Un-XFAIL 2011-06-09-TailCallByVal and 2010-11-04-BigByval llvm-svn: 140516	2011-09-26 06:13:20 +00:00
Andrew Trick	924123acb3	Lower ARM adds/subs to add/sub after adding optional CPSR operand. This is still a hack until we can teach tblgen to generate the optional CPSR operand rather than an implicit CPSR def. But the strangeness is now limited to the selection DAG. ADD/SUB MI's no longer have implicit CPSR defs, nor do we allow flag setting variants of these opcodes in machine code. There are several corner cases to consider, and getting one wrong would previously lead to nasty miscompilation. It's not the first time I've debugged one, so this time I added enough verification to ensure it won't happen again. llvm-svn: 140228	2011-09-21 02:20:46 +00:00
Andrew Trick	8586e62d91	ARM isel bug fix for adds/subs operands. Modified ARMISelLowering::AdjustInstrPostInstrSelection to handle the full gamut of CPSR defs/uses including instructins whose "optional" cc_out operand is not really optional. This allowed removal of the hasPostISelHook to simplify the .td files and make the implementation more robust. Fixes rdar://10137436: sqlite3 miscompile llvm-svn: 140134	2011-09-20 03:17:40 +00:00
Andrew Trick	53df4b6dfa	whitespace llvm-svn: 140133	2011-09-20 03:06:13 +00:00
Jim Grosbach	9c0b86a76d	Thumb2 assembly parsing and encoding for STR. More addressing mode encoding bits. Handle pre increment for STR/STRB/STRH and STR(register). llvm-svn: 139949	2011-09-16 21:55:56 +00:00
Eli Friedman	10f9ce2b7d	Minor cleanup. llvm-svn: 139869	2011-09-15 22:26:18 +00:00
Eli Friedman	ba912e06c2	Use a more efficient lowering for Unordered/Monotonic atomic load/store on Thumb1. llvm-svn: 139865	2011-09-15 22:18:49 +00:00
Jim Grosbach	e7e2aca322	Tidy up a few 80 column violations. llvm-svn: 139636	2011-09-13 20:30:37 +00:00
Owen Anderson	29cfe6c368	Thumb unconditional branches are allowed in IT blocks, and therefore should have a predicate operand, unlike conditional branches. llvm-svn: 139415	2011-09-09 21:48:23 +00:00
Jim Grosbach	a05627ebaf	Thumb2 assembly parsing and encoding for LDREX/LDREXB/LDREXD/LDREXH. llvm-svn: 139381	2011-09-09 18:37:27 +00:00
Duncan Sands	f2641e1bc1	Add codegen support for vector select (in the IR this means a select with a vector condition); such selects become VSELECT codegen nodes. This patch also removes VSETCC codegen nodes, unifying them with SETCC nodes (codegen was actually often using SETCC for vector SETCC already). This ensures that various DAG combiner optimizations kick in for vector comparisons. Passes dragonegg bootstrap with no testsuite regressions (nightly testsuite as well as "make check-all"). Patch mostly by Nadav Rotem. llvm-svn: 139159	2011-09-06 19:07:46 +00:00
Evan Cheng	0b758ed6ba	Fix fall outs from my recent change on how carry bit is modeled during isel. Now the 'S' instructions, e.g. ADDS, treat S bit as optional operand as well. Also fix isel hook to correctly set the optional operand. rdar://10073745 llvm-svn: 139157	2011-09-06 18:52:20 +00:00
Eli Friedman	d7776ed030	Null-initialize to shut up -Wuninitialized warnings. llvm-svn: 138974	2011-09-01 22:27:41 +00:00
Eli Friedman	1ccecbb9d3	64-bit atomic cmpxchg for ARM. llvm-svn: 138868	2011-08-31 17:52:22 +00:00
Eli Friedman	c3f9c4a852	Some 64-bit atomic operations on ARM. 64-bit cmpxchg coming next. llvm-svn: 138845	2011-08-31 00:31:29 +00:00
Evan Cheng	e6fba77971	Follow up to r138791. Add a instruction flag: hasPostISelHook which tells the pre-RA scheduler to call a target hook to adjust the instruction. For ARM, this is used to adjust instructions which may be setting the 's' flag. ADC, SBC, RSB, and RSC instructions have implicit def of CPSR (required since it now uses CPSR physical register dependency rather than "glue"). If the carry flag is used, then the target hook will fill in the optional operand with CPSR. Otherwise, the hook will remove the CPSR implicit def from the MachineInstr. llvm-svn: 138810	2011-08-30 19:09:48 +00:00
Evan Cheng	e891654a58	Change ARM / Thumb2 addc / adde and subc / sube modeling to use physical register dependency (rather than glue them together). This is general goodness as it gives scheduler more freedom. However it is motivated by a nasty bug in isel. When a i64 sub is expanded to subc + sube. libcall #1 \ \ subc \ / \ \ / \ \ / libcall #2 sube If the libcalls are not serialized (i.e. both have chains which are dag entry), legalizer can serialize them in arbitrary orders. If it's unlucky, it can force libcall #2 before libcall #1 in the above case. subc \| libcall #2 \| libcall #1 \| sube However since subc and sube are "glued" together, this ends up being a cycle when the scheduler combine subc and sube as a single scheduling unit. The right solution is to fix LegalizeType too chains the libcalls together. However, LegalizeType is not processing nodes in order so that's harder than it should be. For now, the move to physical register dependency will do. rdar://10019576 llvm-svn: 138791	2011-08-30 01:34:54 +00:00
Eli Friedman	7dfa791f4f	Expand ATOMIC_LOAD and ATOMIC_STORE for architectures I don't know well enough to fix properly. llvm-svn: 138751	2011-08-29 18:23:02 +00:00
Benjamin Kramer	61a1ff543c	Silence GCC warnings and make an array const. llvm-svn: 138706	2011-08-27 17:36:14 +00:00
Eli Friedman	452aae6202	Atomic load/store on ARM/Thumb. I don't really like the patterns, but I'm having trouble coming up with a better way to handle them. I plan on making other targets use the same legalization ARM-without-memory-barriers is using... it's not especially efficient, but if anyone cares, it's not that hard to fix for a given target if there's some better lowering. llvm-svn: 138621	2011-08-26 02:59:24 +00:00
Jim Grosbach	f402f694e2	ARM expansion of pre-indexed store pseudos should maintain memoperands. Partial fix for rdar://9945172. llvm-svn: 137513	2011-08-12 21:02:34 +00:00
Duncan Sands	a41634e307	Silence a bunch (but not all) "variable written but not read" warnings when building with assertions disabled. llvm-svn: 137460	2011-08-12 14:54:45 +00:00
Jim Grosbach	d886f8cd8d	ARM STRH assembly parsing and encoding. llvm-svn: 137353	2011-08-11 21:17:22 +00:00
Jim Grosbach	5e80abbb5d	ARM fix typo in pre-indexed store lowering. rdar://9915869 llvm-svn: 137148	2011-08-09 21:22:41 +00:00
Jim Grosbach	f0c95cadc7	ARM refactor indexed store instructions. Refactor STR[B] pre and post indexed instructions to use addressing modes for memory operands, which is necessary for assembly parsing and is more consistent with the rest of the memory instruction definitions. Make some incremental progress on refactoring away the mega-operand addrmode2 along the way, which is nice. llvm-svn: 136978	2011-08-05 20:35:44 +00:00
Eli Friedman	30a49e93e3	New approach to r136737: insert the necessary fences for atomic ops in platform-independent code, since a bunch of platforms (ARM, Mips, PPC, Alpha are the relevant targets here) need to do essentially the same thing. I think this completes the basic CodeGen for atomicrmw and cmpxchg. llvm-svn: 136813	2011-08-03 21:06:02 +00:00
Eli Friedman	5c863aeefd	ARM backend support for atomicrmw and cmpxchg with non-monotonic ordering. Not especially pretty, but seems to work well enough. If this looks okay, I'll put together similar patches for Mips, PPC, and Alpha. llvm-svn: 136737	2011-08-02 22:44:16 +00:00
Eric Christopher	aa5030066f	Add support for the 'Q' constraint. Fixes rdar://9866494 llvm-svn: 136523	2011-07-29 21:18:58 +00:00
Eli Friedman	26a484852e	Code generation for 'fence' instruction. llvm-svn: 136283	2011-07-27 22:21:52 +00:00
Jim Grosbach	8b31ef50c0	ARM extend instructions simplification. Refactor the SXTB, SXTH, SXTB16, UXTB, UXTH, and UXTB16 instructions to not have an 'r' and an 'r_rot' version, but just a single version with a rotate that can be zero. Use plain Pat<>'s for the ISel of the non-rotated version. llvm-svn: 136225	2011-07-27 16:47:19 +00:00
Owen Anderson	b595ed0085	Split up the ARM so_reg ComplexPattern into so_reg_reg and so_reg_imm, allowing us to distinguish the encodings that use shifted registers from those that use shifted immediates. This is necessary to allow the fixed-length decoder to distinguish things like BICS vs LDRH. llvm-svn: 135693	2011-07-21 18:54:16 +00:00
Evan Cheng	a20cde31e7	Sink ARMMCExpr and ARMAddressingModes into MC layer. First step to separate ARM MC code from target. llvm-svn: 135636	2011-07-20 23:34:39 +00:00
Chris Lattner	229907cd11	land David Blaikie's patch to de-constify Type, with a few tweaks. llvm-svn: 135375	2011-07-18 04:54:35 +00:00
Evan Cheng	f863e3fb73	Improve codegen for select's: if (x != 0) x = 1 if (x == 1) x = 1 Previous codegen looks like this: mov r1, r0 cmp r1, #1 mov r0, #0 moveq r0, #1 The naive lowering select between two different values. It should recognize the test is equality test so it's more a conditional move rather than a select: cmp r0, #1 movne r0, #0 rdar://9758317 llvm-svn: 135017	2011-07-13 00:42:17 +00:00
Cameron Zwarich	f03fa189ca	Add an intrinsic and codegen support for fused multiply-accumulate. The intent is to use this for architectures that have a native FMA instruction. llvm-svn: 134742	2011-07-08 21:39:21 +00:00
Jim Grosbach	3840c90f73	Add more info to FIXME. llvm-svn: 134729	2011-07-08 20:18:11 +00:00
Jim Grosbach	cf1464d943	ARMv7M vs. ARMv7E-M support. The DSP instructions in the Thumb2 instruction set are an optional extension in the Cortex-M* archtitecture. When present, the implementation is considered an "ARMv7E-M implementation," and when not, an "ARMv7-M implementation." Add a subtarget feature hook for the v7e-m instructions and hook it up. The cortex-m3 cpu is an example of a v7m implementation, while the cortex-m4 is a v7e-m implementation. rdar://9572992 llvm-svn: 134261	2011-07-01 21:12:19 +00:00
Eric Christopher	29f1db85dd	Add support for the 'j' immediate constraint. This is conditionalized on supporting the instruction that the constraint is for 'movw'. Part of rdar://9119939 llvm-svn: 134222	2011-07-01 01:00:07 +00:00
Eric Christopher	c011d31543	Add support for the ARM 't' register constraint. And another testcase for the 'x' register constraint. Part of rdar://9119939 llvm-svn: 134220	2011-07-01 00:30:46 +00:00
Eric Christopher	f09b0f1043	We'll return a null RC by default if we can't match. Part of rdar://9119939 llvm-svn: 134217	2011-07-01 00:19:27 +00:00
Eric Christopher	f1c74595aa	Add support for the 'x' constraint. Part of rdar://9307836 and rdar://9119939 llvm-svn: 134215	2011-07-01 00:14:47 +00:00
Eric Christopher	1f054f27af	Capitalize the unsigned part of the initializer. llvm-svn: 134211	2011-06-30 23:59:16 +00:00

... 2 3 4 5 6 ...

986 Commits