llvm-project

Commit Graph

Author	SHA1	Message	Date
Jakob Stoklund Olesen	00264624a9	Convert EXTRACT_SUBREG to COPY when emitting machine instrs. EXTRACT_SUBREG no longer appears as a machine instruction. Use COPY instead. Add isCopy() checks in many places using isMoveInstr() and isExtractSubreg(). The isMoveInstr hook will be removed later. llvm-svn: 107879	2010-07-08 16:40:22 +00:00
Evan Cheng	be1f7a931e	r107852 is only safe with -enable-unsafe-fp-math to account for +0.0 == -0.0. llvm-svn: 107856	2010-07-08 06:01:49 +00:00
Evan Cheng	25f9364cbd	Optimize some vfp comparisons to integer ones. This patch implements the simplest case when the following conditions are met: 1. The arguments are f32. 2. The arguments are loads and they have no uses other than the comparison. 3. The comparison code is EQ or NE. e.g. vldr.32 s0, [r1] vldr.32 s1, [r0] vcmpe.f32 s1, s0 vmrs apsr_nzcv, fpscr beq LBB0_2 => ldr r1, [r1] ldr r0, [r0] cmp r0, r1 beq LBB0_2 More complicated cases will be implemented in subsequent patches. llvm-svn: 107852	2010-07-08 02:08:50 +00:00
Dale Johannesen	e2289285ae	Changes to ARM tail calls, mostly cosmetic. Add explicit testcases for tail calls within the same module. Duplicate some code to humor those who think .w doesn't apply on ARM. Leave this disabled on Thumb1, and add some comments explaining why it's hard and won't gain much. llvm-svn: 107851	2010-07-08 01:18:23 +00:00
Jim Grosbach	73ef80f76f	grammar llvm-svn: 107831	2010-07-07 22:53:35 +00:00
Jim Grosbach	40eda1076a	Handle cases where the post-RA scheduler may move instructions between the address calculation instructions leading up to a jump table when we're trying to convert them into a TB[H] instruction in Thumb2. This realistically shouldn't happen much, if at all, for well formed inputs, but it's more correct to handle it. rdar://7387682 llvm-svn: 107830	2010-07-07 22:51:22 +00:00
Jim Grosbach	e4ba2aa0c4	grammar and trailing whitespace llvm-svn: 107811	2010-07-07 21:06:51 +00:00
Dan Gohman	fe7532a308	Split the SDValue out of OutputArg so that SelectionDAG-independent code can do calling-convention queries. This obviates OutputArgReg. llvm-svn: 107786	2010-07-07 15:54:55 +00:00
Bob Wilson	5bc8a79e7f	Also use REG_SEQUENCE for VTBX instructions. llvm-svn: 107743	2010-07-07 00:08:54 +00:00
Jim Grosbach	3198483851	Mark eh.sjlj.set/longjmp custom lowerings as Darwin-only since that's where they've been tested to work. llvm-svn: 107742	2010-07-07 00:07:57 +00:00
Jim Grosbach	dc0a0659be	By default, the eh.sjlj.setjmp/longjmp intrinsics should just do nothing rather than assuming a target will custom lower them. Targets which do so should exlicitly mark them as having custom lowerings. PR7454. llvm-svn: 107734	2010-07-06 23:44:52 +00:00
Bob Wilson	3ed511bc6b	Use REG_SEQUENCE nodes to make the table registers for VTBL instructions be allocated to consecutive registers. llvm-svn: 107730	2010-07-06 23:36:25 +00:00
Jakob Stoklund Olesen	48deb12593	Track defs for all aliases in NEONMoveFix. This means that an instruction defining an S register will affect the domain of the parent D register. llvm-svn: 107725	2010-07-06 23:26:23 +00:00
Devang Patel	a3ca21b228	Propagate debug loc. llvm-svn: 107710	2010-07-06 22:08:15 +00:00
Bob Wilson	4c1ca29039	Represent NEON load/store alignments in bytes, not bits. llvm-svn: 107701	2010-07-06 21:26:18 +00:00
Dan Gohman	3439629239	Reapply r107655 with fixes; insert the pseudo instruction into the block before calling the expansion hook. And don't put EFLAGS in a mbb's live-in list twice. llvm-svn: 107691	2010-07-06 20:24:04 +00:00
Rafael Espindola	7c510aa7bc	Don't create neon moves in CopyRegToReg. NEONMoveFixPass will do the conversion if profitable. llvm-svn: 107673	2010-07-06 16:24:34 +00:00
Dan Gohman	f4f04107ef	Revert r107655. llvm-svn: 107668	2010-07-06 15:49:48 +00:00
Dan Gohman	12205645a6	Fix a bunch of custom-inserter functions to handle the case where the pseudo instruction is not at the end of the block. llvm-svn: 107655	2010-07-06 15:18:19 +00:00
Evan Cheng	0664a67fe1	Remove isSS argument from CreateFixedObject. Fixed objects cannot be spill slots so it's always false. llvm-svn: 107550	2010-07-03 00:40:23 +00:00
Evan Cheng	c3525dc0fd	Remove early IT block formation. It's not used. llvm-svn: 107513	2010-07-02 21:07:09 +00:00
Bob Wilson	771d04b969	Fix incorrect asm-printing of some NEON immediates. Fix weak testcase so that it checks the immediate values, not just the instructions opcodes. Radar 8110263. llvm-svn: 107487	2010-07-02 17:23:44 +00:00
Bob Wilson	8a99b730a9	ARM function alignments were off by a power of two. svn 83242 changed getFunctionAlignment and the corresponding use of that value in the ARM asm printer, but now we're using the standard asm printer. The result of this was that function alignments were dropped completely for Thumb functions. Radar 8143571. llvm-svn: 107435	2010-07-01 22:26:26 +00:00
Bob Wilson	be157b0ea8	Add support for encoding VDUP (ARM core register) instructions. llvm-svn: 107201	2010-06-29 20:13:29 +00:00
Bob Wilson	ab0819e10d	Add support for encoding NEON VMOV (from core register to scalar) instructions. The encoding is the same as VMOV (from scalar to core register) except that the operands are in different places. llvm-svn: 107167	2010-06-29 17:34:07 +00:00
Jim Grosbach	5bee07ec68	skip dbg_value instructions llvm-svn: 107154	2010-06-29 16:55:24 +00:00
Bob Wilson	83b993a977	The t2MOVi16 and t2MOVTi16 instructions do not set CPSR. Trying to add a CPSR operand to them causes an assertion failure, so apparently these instructions haven't been getting a lot of use. llvm-svn: 107147	2010-06-29 16:25:11 +00:00
Rafael Espindola	38a7d7cbc3	Add a VT argument to getMinimalPhysRegClass and replace the copy related uses of getPhysicalRegisterRegClass with it. If we want to make a copy (or estimate its cost), it is better to use the smallest class as more efficient operations might be possible. llvm-svn: 107140	2010-06-29 14:02:34 +00:00
Duncan Sands	193bb1ee6a	Remove pointless variable LastDef. llvm-svn: 107135	2010-06-29 13:23:22 +00:00
Duncan Sands	257eba4df7	Remove unused variable Loc and pointless variables unified_syntax and thumb_mode. llvm-svn: 107133	2010-06-29 13:04:35 +00:00
Duncan Sands	78ad27ca2b	Remove an unused and a pointless variable. llvm-svn: 107131	2010-06-29 13:00:29 +00:00
Duncan Sands	6d28e73acc	Remove initialized but otherwise unused variables. llvm-svn: 107127	2010-06-29 11:22:26 +00:00
Evan Cheng	b59dd8f10a	PR7503: uxtb16 is not available for ARMv7-M. Patch by Brian G. Lucas. llvm-svn: 107122	2010-06-29 05:38:36 +00:00
Evan Cheng	0c30739cbb	Change if-cvt options to something that actually as useable. llvm-svn: 107121	2010-06-29 05:37:59 +00:00
Jakob Stoklund Olesen	c1eccbc468	When no memoperands are present, assume unaligned, volatile. llvm-svn: 107114	2010-06-29 01:13:07 +00:00
Bob Wilson	3d12ff797b	Fix Thumb encoding of VMOV (scalar to ARM core register). The encoding is the same as ARM except that the condition code field is always set to ARMCC::AL. llvm-svn: 107107	2010-06-29 00:26:13 +00:00
Bob Wilson	4469a892b4	Make the ARMCodeEmitter identify Thumb functions via ARMFunctionInfo instead of the Subtarget. llvm-svn: 107086	2010-06-28 22:23:17 +00:00
Jim Grosbach	f31c004666	tidy up style. no functional change. llvm-svn: 107073	2010-06-28 21:29:17 +00:00
Bob Wilson	544317dfda	Refactor encoding function for NEON 1-register with modified immediate format. llvm-svn: 107070	2010-06-28 21:16:30 +00:00
Bob Wilson	584387d5e3	Support Thumb mode encoding of NEON instructions. llvm-svn: 107068	2010-06-28 21:12:19 +00:00
Jim Grosbach	7ea5fc0794	minor housekeeping cleanup: 80-column, trailing whitespace, spelling, etc.. No functional change. llvm-svn: 106988	2010-06-28 04:27:01 +00:00
Eli Friedman	8cfa7713e9	Followup to r106770: actually generate SXTB and SXTH for sign-extensions. llvm-svn: 106940	2010-06-26 04:36:50 +00:00
Bob Wilson	0248da9db4	Add support for encoding NEON VMOV (from scalar to core register) instructions. llvm-svn: 106938	2010-06-26 04:07:15 +00:00
Evan Cheng	b71233f34d	It's now possible to run code placement pass for ARM. llvm-svn: 106935	2010-06-26 01:52:05 +00:00
Bob Wilson	b4d39841e4	Renumber NEON instruction formats to be consecutive. llvm-svn: 106927	2010-06-26 00:05:09 +00:00
Bob Wilson	cc386fb125	Rename ARM instruction formats NEONGetLnFrm, NEONSetLnFrm and NEONDupFrm to "N..." instead of "NEON..." for consistency with the other NEON format names. llvm-svn: 106921	2010-06-25 23:56:05 +00:00
Bob Wilson	d66f66a5cf	Remove unused NEONFrm and ThumbMiscFrm ARM instruction formats. Renumber MiscFrm to 25. llvm-svn: 106916	2010-06-25 23:45:37 +00:00
Daniel Dunbar	acbdf53db4	Thumb2ITBlockPass: Fix a possible dereference of an invalid iterator. This was introduced in r106343, but only showed up recently (with a particular compiler & linker combination) because of the particular check, and because we have no builtin checking for dereferencing the end of an array, which is truly unfortunate. llvm-svn: 106908	2010-06-25 23:14:54 +00:00
Evan Cheng	02b184de5b	Change if-conversion block size limit checks to add some flexibility. llvm-svn: 106901	2010-06-25 22:42:03 +00:00
Bob Wilson	2530ca0647	Add support for encoding 3-register NEON instructions, and fix emitNEON2RegInstruction's handling of 2-address operands. llvm-svn: 106900	2010-06-25 22:40:46 +00:00
Dale Johannesen	ce97d55ad9	The hasMemory argument is irrelevant to how the argument for an "i" constraint should get lowered; PR 6309. While this argument was passed around a lot, this is the only place it was used, so it goes away from a lot of other places. llvm-svn: 106893	2010-06-25 21:55:36 +00:00
Bob Wilson	e70c8b150b	Add support for encoding 2-register NEON instructions. llvm-svn: 106891	2010-06-25 21:17:19 +00:00
Bob Wilson	574f68f815	Fix indentation. llvm-svn: 106881	2010-06-25 20:54:44 +00:00
Jim Grosbach	ba3ece6f27	IT instructions are considered to be scheduling hazards, but are scheduled with the following instructions. This is done via trickery by considering the instruction preceding the IT to be the hazard. Care must be taken to ensure it's the first non-debug instruction, or the presence of debug info will affect codegen. Part of the continuing work for rdar://7797940, making ARM code-gen unaffected by the presence of debug information. llvm-svn: 106871	2010-06-25 18:43:14 +00:00
Bob Wilson	07aead2f8d	Add missing ARM and Thumb data layout info for vector types. Radar 8128745. llvm-svn: 106820	2010-06-25 04:41:08 +00:00
Bob Wilson	eadbf9732f	Reduce indentation. llvm-svn: 106819	2010-06-25 04:12:31 +00:00
Evan Cheng	c26e2f4b70	Oops. IT block formation pass needs to be run at any optimization level. llvm-svn: 106775	2010-06-24 19:10:14 +00:00
Eli Friedman	246c41d93e	Always allow Thumb-2 SXTB, SXTH, UXTB, and UXTH. Fixes PR7324. llvm-svn: 106770	2010-06-24 18:20:04 +00:00
Bob Wilson	279e55fb2e	PR7458: Try commuting Thumb2 instruction operands to put them into 2-address form so they can be narrowed to 16-bit instructions. llvm-svn: 106762	2010-06-24 16:50:20 +00:00
Bill Wendling	f470747a36	We are missing opportunites to use ldm. Take code like this: void t(int cp0, int cp1, int dp, int fmd) { int c0, c1, d0, d1, d2, d3; c0 = (cp0++ & 0xffff) \| ((cp1++ << 16) & 0xffff0000); c1 = (cp0++ & 0xffff) \| ((cp1++ << 16) & 0xffff0000); / ... */ } It code gens into something pretty bad. But with this change (analogous to the X86 back-end), it will use ldm and generate few instructions. llvm-svn: 106693	2010-06-23 23:00:16 +00:00
Dale Johannesen	d24c66b4a3	Do not do tail calls to external symbols. If the branch turns out to be ARM-to-Thumb or vice versa the linker cannot resolve this. 8120438. If this optimization is going to be useful we probably need a compiler flag "assume callees are same architecture" or something like that. llvm-svn: 106662	2010-06-23 18:52:34 +00:00
Jim Grosbach	a8ea498171	When using libcall expansions for the atomic intrinsics, the explicit MEMBARRIER fences aren't necessary for ARM. Tell the combiner to fold them away. llvm-svn: 106631	2010-06-23 16:08:49 +00:00
Bob Wilson	c5d712232d	Thumb1 functions using @llvm.returnaddress were not saving the incoming LR. Radar 8031193. llvm-svn: 106582	2010-06-22 22:04:24 +00:00
Dan Gohman	d2d1ae105d	Use pre-increment instead of post-increment when the result is not used. llvm-svn: 106542	2010-06-22 15:08:57 +00:00
Evan Cheng	37bb617f8a	Tail merging pass shall not break up IT blocks. rdar://8115404 llvm-svn: 106517	2010-06-22 01:18:16 +00:00
Bob Wilson	72df24037e	sign_extend_inreg needs to be expanded for pre-v6 Thumb as well as ARM. Radar 8104310. llvm-svn: 106484	2010-06-21 21:27:34 +00:00
Jim Grosbach	523e554afa	LEApcrelJT shouldn't be marked as neverHasSideEffects, as we don't want it being moved around away from the jump table it references. rdar://8104340 llvm-svn: 106483	2010-06-21 21:27:27 +00:00
Evan Cheng	1fb4de8ec5	Fix PR7421: bug in kill transferring logic. It was ignoring loads / stores which have already been processed. llvm-svn: 106481	2010-06-21 21:21:14 +00:00
Dale Johannesen	d5c58b76ab	Fix PR 7433. Silly typo in non-Darwin ARM tail call handling, plus correct R9 handling in that mode. llvm-svn: 106434	2010-06-21 18:21:49 +00:00
Jim Grosbach	97c8a6a928	early exit for dbg_value instructions llvm-svn: 106430	2010-06-21 17:49:23 +00:00
Evan Cheng	884a8fe5fa	Fix a crash caused by dereference of MBB.end(). rdar://8110842 llvm-svn: 106399	2010-06-20 00:54:38 +00:00
Bob Wilson	6d12973143	Remove a fixme comment that is no longer relevant. llvm-svn: 106382	2010-06-19 05:32:41 +00:00
Bob Wilson	0ae08935f6	Fix error message to match function name. llvm-svn: 106381	2010-06-19 05:32:09 +00:00
Evan Cheng	7079bf815d	Ignore dbg_value's. llvm-svn: 106373	2010-06-19 02:36:21 +00:00
Evan Cheng	f3c01f3ef6	Disable sibcall optimization for Thumb1 for now since Thumb1RegisterInfo::emitEpilogue is not expecting them. llvm-svn: 106368	2010-06-19 01:01:32 +00:00
Evan Cheng	e5fcd333da	Indentation and remove dead code. llvm-svn: 106362	2010-06-19 00:11:54 +00:00
Dan Gohman	5fc43eb186	Silence compiler warnings. llvm-svn: 106360	2010-06-19 00:02:06 +00:00
Evan Cheng	119824ed4d	Move ARM if-conversion before post-ra scheduling. llvm-svn: 106355	2010-06-18 23:32:07 +00:00
Evan Cheng	4f0781c9b3	Update cmake list. llvm-svn: 106348	2010-06-18 23:12:10 +00:00
Evan Cheng	285935939d	Thumb2 hazard recognizer. llvm-svn: 106347	2010-06-18 23:11:35 +00:00
Evan Cheng	2d51c7c592	Allow ARM if-converter to be run after post allocation scheduling. - This fixed a number of bugs in if-converter, tail merging, and post-allocation scheduler. If-converter now runs branch folding / tail merging first to maximize if-conversion opportunities. - Also changed the t2IT instruction slightly. It now defines the ITSTATE register which is read by instructions in the IT block. - Added Thumb2 specific hazard recognizer to ensure the scheduler doesn't change the instruction ordering in the IT block (since IT mask has been finalized). It also ensures no other instructions can be scheduled between instructions in the IT block. This is not yet enabled. llvm-svn: 106344	2010-06-18 23:09:54 +00:00
Jim Grosbach	a57c2885cf	back-end libcall handling for ATOMIC_SWAP (__sync_lock_test_and_set) llvm-svn: 106342	2010-06-18 23:03:10 +00:00
Jim Grosbach	6860bb7796	Enable Expand handling of atomics for subtargets that can't do them inline. llvm-svn: 106336	2010-06-18 22:35:32 +00:00
Bob Wilson	a92e41a50a	Rewrite chained if's as switches and replace assertions with llvm_unreachable (as suggested in radar 8104405). llvm-svn: 106318	2010-06-18 21:32:42 +00:00
Dale Johannesen	589ffb4902	Fix ARM/Thumb reversal in previous attempt. llvm-svn: 106314	2010-06-18 21:07:47 +00:00
Jakob Stoklund Olesen	22a212f97c	When using ADDri to get the address of a stack object, 255 is a conservative limit on the offset that can be materialized without using the register scavenger. llvm-svn: 106312	2010-06-18 20:59:25 +00:00
Dale Johannesen	a06c2f79fc	An attempt to fix the problem Anton reported with ARM tail calls. Don't know if it works, but it doesn't break Darwin. llvm-svn: 106309	2010-06-18 20:44:28 +00:00
Dale Johannesen	c1570dda5c	Enable tail calls on ARM by default, with some basic tests. This has been well tested on Darwin but not elsewhere. It should work provided the linker correctly resolves B.W <label in other function> which it has not seen before, at least from llvm-based compilers. I'm leaving the arm-tail-calls switch in until I see if there's any problems because of that; it might need to be disabled for some environments. llvm-svn: 106299	2010-06-18 19:00:18 +00:00
Dan Gohman	882bb2984e	Start TargetRegisterClass indices at 0 instead of 1, so that MachineRegisterInfo doesn't have to confusingly allocate an extra entry. llvm-svn: 106296	2010-06-18 18:13:55 +00:00
Dale Johannesen	3ac52b3e43	Last round of changes for ARM tail calls. Not turning them on yet. llvm-svn: 106295	2010-06-18 18:13:11 +00:00
Jakob Stoklund Olesen	b9f91667e1	Treat the ARM inline asm {cc} constraint as a physreg (%CPSR), just like X86 does for {flags}. If we create virtual registers of the CCR class, RegAllocFast may try to spill them, and we can't do that. llvm-svn: 106289	2010-06-18 16:49:33 +00:00
Dan Gohman	f1d8304fe3	Eliminate unnecessary uses of getZExtValue(). llvm-svn: 106279	2010-06-18 14:22:04 +00:00
Stuart Hastings	0125b6410a	Add a DebugLoc parameter to TargetInstrInfo::InsertBranch(). This addresses a longstanding deficiency noted in many FIXMEs scattered across all the targets. This effectively moves the problem up one level, replacing eleven FIXMEs in the targets with eight FIXMEs in CodeGen, plus one path through FastISel where we actually supply a DebugLoc, fixing Radar 7421831. llvm-svn: 106243	2010-06-17 22:43:56 +00:00
Jim Grosbach	5712c77c89	Thumb1 and any pre-v6 ARM target should use the libcall expansion of ISD::MEMBARRIER. v7 and v7 ARM mode continue to use the custom lowering. llvm-svn: 106204	2010-06-17 02:02:03 +00:00
Jim Grosbach	6e758c97fd	simplify code a bit and add a more explanatory assert for cases that previously would result in 'cannot yet select' errors. llvm-svn: 106199	2010-06-17 01:37:00 +00:00
Jim Grosbach	e3864cc15e	format and 80-column cleanup llvm-svn: 106173	2010-06-16 23:45:49 +00:00
Jakob Stoklund Olesen	2334144e6e	Don't attempt preserving conservative kill flags. We were doing it wrong. This is before LiveVariables anyway, where these kill flags are recalculated. llvm-svn: 106157	2010-06-16 22:11:08 +00:00
Bob Wilson	01ac8f9fc0	Remove the hidden "neon-reg-sequence" option. The reg sequences are working now, so there's no need to disable them. llvm-svn: 106155	2010-06-16 21:34:01 +00:00
Evan Cheng	f128bdcb55	Make post-ra scheduling, anti-dep breaking, and register scavenger (conservatively) aware of predicated instructions. This enables ARM to move if-conversion before post-ra scheduler. llvm-svn: 106091	2010-06-16 07:35:02 +00:00
Dale Johannesen	438c35b5d1	Add file missing from previous commit. llvm-svn: 106058	2010-06-15 22:24:08 +00:00
Dale Johannesen	44f9dfc9cf	Next round of tail call changes. Register used in a tail call must not be callee-saved; following x86, add a new regclass to represent this. Also fixes a couple of bugs. Still disabled by default; Thumb doesn't work yet. llvm-svn: 106053	2010-06-15 22:08:33 +00:00
Bob Wilson	f3f7a770b7	Add basic support for NEON modified immediates besides VMOV. llvm-svn: 106030	2010-06-15 19:05:35 +00:00
Daniel Dunbar	0904134252	Add <cstddef> include to get ptrdiff_t, for gcc-4.6; patch by Dimitry Andric. llvm-svn: 105994	2010-06-15 14:50:42 +00:00
Bob Wilson	1478142485	VMOVQQ and VMOVQQQQ are pseudo instructions and not predicable. llvm-svn: 105990	2010-06-15 05:51:27 +00:00
Jim Grosbach	f14e08b01b	Make sure to skip dbg_value instructions when finding an insertion point for the combined load/store instruction. rdar://7797940 llvm-svn: 105982	2010-06-15 00:41:09 +00:00
Bob Wilson	5b2b504038	Rename functions referring to VMOV immediates to refer to NEON "modified immediate" operands. These functions have so far only been used for VMOV but they also apply to other NEON instructions with modified immediate operands. No functional changes. llvm-svn: 105969	2010-06-14 22:19:57 +00:00
Bob Wilson	f07d33d8f1	Add a missing bitcast. This code used to only handle conversions between i64 and f64 types, but now it also handle Neon vector types, so the f64 result of VMOVDRR may need to be converted to a Neon type. Radar 8084742. llvm-svn: 105845	2010-06-11 22:45:25 +00:00
Bob Wilson	6eae520de9	Add instruction encoding for the Neon VMOV immediate instruction. This changes the machine instruction representation of the immediate value to be encoded into an integer with similar fields as the actual VMOV instruction. This makes things easier for the disassembler, since it can just stuff the bits into the immediate operand, but harder for the asm printer since it has to decode the value to be printed. Testcase for the encoding will follow later when MC has more support for ARM. llvm-svn: 105836	2010-06-11 21:34:50 +00:00
Evan Cheng	2901371c32	Delete code that's not safe. llvm-svn: 105774	2010-06-10 02:08:20 +00:00
Jim Grosbach	5fa0158ecd	be slightly more subtle about skipping dbg_value instructions; otherwise, if a dbg_value immediately follows a sequence of ldr/str instructions that should be combined into an ldm/stm and is the last instruction in the block, then combine may end up being skipped. llvm-svn: 105758	2010-06-09 22:21:24 +00:00
Evan Cheng	a0746bd50a	Allow target to place 2-address pass inserted copies in better spots. Thumb2 will use this to try to avoid breaking up IT blocks. llvm-svn: 105745	2010-06-09 19:26:01 +00:00
Evan Cheng	83c64ee8de	Typo. llvm-svn: 105677	2010-06-09 03:49:12 +00:00
Evan Cheng	47cd593023	Thumb2 IT blocks are fairly expensive. When there are multiple selects using the same condition, it's important to make sure they are scheduled together to avoid forming multiple IT blocks. I'm adding a pre-regalloc pass that forms IT blocks early (by re-scheduling instructions and split basic blocks) to attempt to fix this. This is not turned on by default since I am not sure this is the right fix. Another issue is llvm selects are modeled as two-address conditional moves. This can be very bad when the copies before the conditional moves are not coalesced away. Teach IT formation pass to move the copies above the IT block (when legal) to avoid breaking the IT block. llvm-svn: 105669	2010-06-09 01:46:50 +00:00
Jim Grosbach	8fe3cc8055	fix copy/paste/modify think-o llvm-svn: 105653	2010-06-08 22:53:32 +00:00
Bruno Cardoso Lopes	c2f87b7bb2	Reapply r105521, this time appending "LLU" to 64 bit immediates to avoid breaking the build. llvm-svn: 105652	2010-06-08 22:51:23 +00:00
Jim Grosbach	57c6fd452e	fix typo llvm-svn: 105634	2010-06-08 20:06:55 +00:00
Bob Wilson	0271c5928e	Fix up a comment. llvm-svn: 105591	2010-06-08 00:42:08 +00:00
Bob Wilson	846bd7992c	Further changes for Neon vector shuffles: - change isShuffleMaskLegal to show that all shuffles with 32-bit and 64-bit elements are legal - the Neon shuffle instructions do not support 64-bit elements, but we were not checking for that before lowering shuffles to use them - remove some 64-bit element vduplane patterns that are no longer needed llvm-svn: 105586	2010-06-07 23:53:38 +00:00
Jim Grosbach	723d242a95	Handle dbg_value instructions (i.e., skip them) when generating IT blocks. rdar://7797940 llvm-svn: 105557	2010-06-07 21:48:47 +00:00
Chris Lattner	fdd2614330	revert r105521, which is breaking the buildbots with stuff like this: In file included from X86InstrInfo.cpp:16: X86GenInstrInfo.inc:2789: error: integer constant is too large for 'long' type X86GenInstrInfo.inc:2790: error: integer constant is too large for 'long' type X86GenInstrInfo.inc:2792: error: integer constant is too large for 'long' type X86GenInstrInfo.inc:2793: error: integer constant is too large for 'long' type X86GenInstrInfo.inc:2808: error: integer constant is too large for 'long' type X86GenInstrInfo.inc:2809: error: integer constant is too large for 'long' type X86GenInstrInfo.inc:2816: error: integer constant is too large for 'long' type X86GenInstrInfo.inc:2817: error: integer constant is too large for 'long' type llvm-svn: 105524	2010-06-05 04:17:30 +00:00
Bruno Cardoso Lopes	594fa26317	Initial AVX support for some instructions. No patterns matched yet, only assembly encoding support. llvm-svn: 105521	2010-06-05 03:53:24 +00:00
Dale Johannesen	81ef35b3ca	Improvements to tail call code. No functional effect unless using -arm-tail-calls. llvm-svn: 105515	2010-06-05 00:51:39 +00:00
Dale Johannesen	d1b9311afa	More thoroughly disable tails calls by default. 8060143, although this doesn't fix the real problem with tail call. llvm-svn: 105472	2010-06-04 18:04:24 +00:00
Jim Grosbach	3548803f62	Another fix to prevent debug info from affecting codegen. rdar://7797940 llvm-svn: 105470	2010-06-04 17:57:34 +00:00
Jim Grosbach	4e5e6a8973	more dbg_value adjustments so debug info doesn't affect codegen llvm-svn: 105454	2010-06-04 01:23:30 +00:00
Jim Grosbach	1bcdf32d22	fix typo llvm-svn: 105441	2010-06-04 00:15:00 +00:00
Bob Wilson	d8a9a04739	For NEON vectors with 32- or 64-bit elements, select BUILD_VECTORs and VECTOR_SHUFFLEs to REG_SEQUENCE instructions. The standard ISD::BUILD_VECTOR node corresponds closely to REG_SEQUENCE but I couldn't use it here because its operands do not get legalized. That is pretty awful, but I guess it makes sense for other targets. Instead, I have added an ARM-specific version of BUILD_VECTOR that will have its operands properly legalized. This fixes the rest of Radar 7872877. llvm-svn: 105439	2010-06-04 00:04:02 +00:00
Jim Grosbach	b30b81edb6	Teach the ARM load-store optimizer to deal with dbg_value instructions. llvm-svn: 105427	2010-06-03 22:41:15 +00:00
Dale Johannesen	d679ff7330	Early implementation of tail call for ARM. A temporary flag -arm-tail-calls defaults to off, so there is no functional change by default. Intrepid users may try this; simple cases work but there are bugs. llvm-svn: 105413	2010-06-03 21:09:53 +00:00
Jakob Stoklund Olesen	a8ad97743d	Slightly change the meaning of the reMaterialize target hook when the original instruction defines subregisters. Any existing subreg indices on the original instruction are preserved or composed with the new subreg index. Also substitute multiple operands mentioning the original register by using the new MachineInstr::substituteRegister() function. This is necessary because there will soon be <imp-def> operands added to non read-modify-write partial definitions. This instruction: %reg1234:foo = FLAP %reg1234<imp-def> will reMaterialize(%reg3333, bar) like this: %reg3333:bar-foo = FLAP %reg333:bar<imp-def> Finally, replace the TargetRegisterInfo pointer argument with a reference to indicate that it cannot be NULL. llvm-svn: 105358	2010-06-02 22:47:25 +00:00
Jim Grosbach	84511e1526	Clean up 80 column violations. No functional change. llvm-svn: 105350	2010-06-02 21:53:11 +00:00
Rafael Espindola	f2dffcef82	Remove the TargetRegisterClass member from CalleeSavedInfo llvm-svn: 105344	2010-06-02 20:02:30 +00:00
Bob Wilson	2d35a9e810	Rename canCombinedSubRegIndex method to something more grammatically correct and tidy up the comment describing it. llvm-svn: 105339	2010-06-02 18:54:47 +00:00
Rafael Espindola	94801a47f8	Replace ARM's getCalleeSavedRegClasses with a simpler solution llvm-svn: 105335	2010-06-02 17:54:50 +00:00
Anton Korobeynikov	a09d95412e	Some A9 load/store cleanups llvm-svn: 105109	2010-05-29 19:25:39 +00:00
Anton Korobeynikov	2a21aef8f2	Some rough approximations for load/stores on A9 llvm-svn: 105108	2010-05-29 19:25:34 +00:00
Anton Korobeynikov	d4c7cceb70	NEON/VFP stuff can be issued only via Pipe1 on A9 llvm-svn: 105107	2010-05-29 19:25:29 +00:00
Anton Korobeynikov	94d7fd88fd	Add some integer instruction itineraries for A9 llvm-svn: 105106	2010-05-29 19:25:17 +00:00
Evan Cheng	bf91499f1a	Schedule high latency instructions for latency reduction even if they are not vfp / NEON instructions. llvm-svn: 105060	2010-05-28 23:25:23 +00:00
Jim Grosbach	b342e09b5e	correct retattr llvm-svn: 104980	2010-05-28 18:03:48 +00:00
Jim Grosbach	0b20fdaff0	Cosmetic cleanup. No functional change. llvm-svn: 104974	2010-05-28 17:51:20 +00:00
Jim Grosbach	37eb2c24b9	make sure accesses to set up the jmpbuf don't get moved after it by the scheduler. Add a missing \n. llvm-svn: 104967	2010-05-28 17:37:40 +00:00
Bob Wilson	b6112e8706	Add the cc_out operand for t2RSBrs instructions. I missed this when I changed the instruction class for t2RSB to add that operand in svn r104582. Radar 8033757. llvm-svn: 104907	2010-05-28 00:27:15 +00:00
Jim Grosbach	faa3abbe39	Update the saved stack pointer in the sjlj function context following either an alloca() or an llvm.stackrestore(). rdar://8031573 llvm-svn: 104900	2010-05-27 23:49:24 +00:00
Evan Cheng	c2ebe0334a	Use report_fatal_error, not llvm_unreachable. llvm-svn: 104899	2010-05-27 23:45:31 +00:00
Jim Grosbach	c9f532dddc	back out 104862/104869. Can reuse stacksave after all. Very cool. llvm-svn: 104897	2010-05-27 23:11:57 +00:00
Evan Cheng	3d3ee87d4e	llvm can't correctly support 'H', 'Q' and 'R' modifiers. Just mark it an error. llvm-svn: 104891	2010-05-27 22:08:38 +00:00
Bob Wilson	40e62dfdc0	Fix some bad fall-throughs in a switch statement. Both the 'Q' and 'R' cases should fall through to the 'H' case, but instead 'Q' was falling through to 'R' so that it would do the wrong thing for a big-endian ARM target. llvm-svn: 104883	2010-05-27 20:23:42 +00:00
Jim Grosbach	5cde219fb1	add ISD::STACKADDR to get the current stack pointer. Will be used by sjlj EH to update the jmpbuf in the presence of VLAs. llvm-svn: 104862	2010-05-27 18:23:48 +00:00
Jakob Stoklund Olesen	4f6da9e3a8	Give SubRegIndex names to all ARM subregisters. This will be required by TableGen shortly. llvm-svn: 104754	2010-05-26 22:15:03 +00:00
Jim Grosbach	c98892fdaa	Adjust eh.sjlj.setjmp to properly have a chain and to have an opcode entry in ISD::. No functional change. llvm-svn: 104734	2010-05-26 20:22:18 +00:00
Jakob Stoklund Olesen	7de379467e	Replace the SubRegSet tablegen class with a less error-prone mechanism. A Register with subregisters must also provide SubRegIndices for adressing the subregisters. TableGen automatically inherits indices for sub-subregisters to minimize typing. CompositeIndices may be specified for the weirder cases such as the XMM sub_sd index that returns the same register, and ARM NEON Q registers where both D subregs have ssub_0 and ssub_1 sub-subregs. It is now required that all subregisters are named by an index, and a future patch will also require inherited subregisters to be named. This is necessary to allow composite subregister indices to be reduced to a single index. llvm-svn: 104704	2010-05-26 17:27:12 +00:00
Shih-wei Liao	c4376b9b1b	Coding style change (Adding 1 missing space.) llvm-svn: 104670	2010-05-26 04:46:50 +00:00
Shih-wei Liao	0568ca0ddc	Adding the missing implementation for ARM::SBFX and ARM::UBFX. Fixing http://llvm.org/bugs/show_bug.cgi?id=7225. llvm-svn: 104667	2010-05-26 03:21:39 +00:00
Jim Grosbach	a6897ecbb5	fix off by 1 (insn) error in eh.sjlj.setjmp thumb code sequence. llvm-svn: 104661	2010-05-26 01:22:21 +00:00
Jakob Stoklund Olesen	50eec620f4	Revert "Replace the SubRegSet tablegen class with a less error-prone mechanism." This reverts commit 104654. llvm-svn: 104660	2010-05-26 01:21:14 +00:00
Jakob Stoklund Olesen	0b0274524c	Replace the SubRegSet tablegen class with a less error-prone mechanism. A Register with subregisters must also provide SubRegIndices for adressing the subregisters. TableGen automatically inherits indices for sub-subregisters to minimize typing. CompositeIndices may be specified for the weirder cases such as the XMM sub_sd index that returns the same register, and ARM NEON Q registers where both D subregs have ssub_0 and ssub_1 sub-subregs. It is now required that all subregisters are named by an index, and a future patch will also require inherited subregisters to be named. This is necessary to allow composite subregister indices to be reduced to a single index. llvm-svn: 104654	2010-05-26 00:28:19 +00:00
Shih-wei Liao	b6e0bc9457	Adding the missing implementation of Bitfield's "clear" and "insert". Fixing http://llvm.org/bugs/show_bug.cgi?id=7222. llvm-svn: 104653	2010-05-26 00:25:05 +00:00
Shih-wei Liao	e22abfa823	To handle s* registers in emitVFPLoadStoreMultipleInstruction(). Fixing http://llvm.org/bugs/show_bug.cgi?id=7221. llvm-svn: 104652	2010-05-26 00:02:28 +00:00
Jakob Stoklund Olesen	673e7e0f37	Remove NumberHack entirely. SubRegIndex instances are now numbered uniquely the same way Register instances are - in lexicographical order by name. llvm-svn: 104627	2010-05-25 19:49:33 +00:00
Zonr Chang	a6714e8a43	Add missing implementation to the materialization of VFP misc. instructions (vmrs, vmsr and vmov (immediate)) llvm-svn: 104588	2010-05-25 10:23:52 +00:00
Zonr Chang	2da5aa1b60	Add support to MOVimm32 using movt/movw for ARM JIT llvm-svn: 104587	2010-05-25 08:42:45 +00:00
Bob Wilson	4f48499d2c	Allow t2MOVsrl_flag and t2MOVsra_flag instructions to be predicated. I don't know of any particular reason why that would be important, but neither can I see any reason to disallow it. llvm-svn: 104583	2010-05-25 04:51:47 +00:00
Bob Wilson	debbbe3fd9	Fix up instruction classes for Thumb2 RSB instructions to be consistent with Thumb2 ADD and SUB instructions: allow RSB instructions be changed to set the condition codes, and allow RSBS instructions to be predicated. llvm-svn: 104582	2010-05-25 04:43:08 +00:00
Bob Wilson	26fdebcae9	Clean up indentation. llvm-svn: 104580	2010-05-25 03:36:52 +00:00
Jakob Stoklund Olesen	70affbd988	Use enums instead of literals in the ARM backend. llvm-svn: 104573	2010-05-25 00:15:15 +00:00
Jakob Stoklund Olesen	fdb25de17e	Switch SubRegSet to using symbolic SubRegIndices llvm-svn: 104571	2010-05-24 23:03:18 +00:00
Bob Wilson	91b2b8540c	Allow Thumb2 MVN instructions to set condition codes. The immediate operand version of t2MVN already allowed that, but not the register versions. llvm-svn: 104570	2010-05-24 22:41:19 +00:00
Jakob Stoklund Olesen	1181a19318	Lose the dummies llvm-svn: 104564	2010-05-24 21:47:01 +00:00
Jakob Stoklund Olesen	edab242488	Replace the tablegen RegisterClass field SubRegClassList with an alist-like data structure that represents a mapping without any dependencies on SubRegIndex numbering. This brings us closer to being able to remove the explicit SubRegIndex numbering, and it is now possible to specify any mapping without inventing *_INVALID register classes. llvm-svn: 104563	2010-05-24 21:46:58 +00:00
Bob Wilson	722bff2c7d	Clean up some extra whitespace. llvm-svn: 104544	2010-05-24 20:08:34 +00:00
Bob Wilson	3eb7691858	Thumb2 RSBS instructions were being printed without the 'S' suffix. Fix it by changing the T2I_rbin_s_is multiclass to handle the CPSR output and 'S' suffix in the same way as T2I_bin_s_irs. llvm-svn: 104531	2010-05-24 18:44:06 +00:00
Evan Cheng	755d45be43	LR is in GPR, not tGPR even in Thumb1 mode. llvm-svn: 104518	2010-05-24 18:00:18 +00:00
Jakob Stoklund Olesen	8d042c0269	Fix a few places that depended on the numeric value of subreg indices. Add assertions in places that depend on consecutive indices. llvm-svn: 104510	2010-05-24 17:13:28 +00:00
Jakob Stoklund Olesen	6c47d6423c	Switch ARMRegisterInfo.td to use SubRegIndex and eliminate the parallel enums from ARMRegisterInfo.h llvm-svn: 104508	2010-05-24 16:54:32 +00:00
Bob Wilson	49f40e8c32	VDUP doesn't support vectors with 64-bit elements. llvm-svn: 104455	2010-05-23 05:42:31 +00:00
Evan Cheng	168ced94d8	Implement @llvm.returnaddress. rdar://8015977. llvm-svn: 104421	2010-05-22 01:47:14 +00:00
Jim Grosbach	bd9485db63	Implement eh.sjlj.longjmp for ARM. Clean up the intrinsic a bit. Followups: docs patch for the builtin and eh.sjlj.setjmp cleanup to match longjmp. llvm-svn: 104419	2010-05-22 01:06:18 +00:00
Bob Wilson	91fdf68516	Recognize more BUILD_VECTORs and VECTOR_SHUFFLEs that can be implemented by copying VFP subregs. This exposed a bunch of dead code in the *spill-q.ll tests, so I tweaked those tests to keep that code from being optimized away. Radar 7872877. llvm-svn: 104415	2010-05-22 00:23:12 +00:00
Evan Cheng	34c260458a	Change ARM scheduling default to list-hybrid if the target supports floating point instructions (and is not using soft float). llvm-svn: 104307	2010-05-21 00:43:17 +00:00
Evan Cheng	4401f8873c	Allow targets more controls on what nodes are scheduled by reg pressure, what for latency in hybrid mode. llvm-svn: 104293	2010-05-20 23:26:43 +00:00
Bob Wilson	5954994bba	Handle Neon v2f64 and v2i64 vector shuffles as register copies. This fixes the remaining issue with pr7167. llvm-svn: 104257	2010-05-20 18:39:53 +00:00
Evan Cheng	738e920edf	Code refactoring: pull SchedPreference enum from TargetLowering.h to TargetMachine.h and put it in its own namespace. llvm-svn: 104147	2010-05-19 20:19:50 +00:00
Evan Cheng	daeca2d156	t2LEApcrel and tLEApcrel are re-materializable. This makes it possible to hoist more loads during machine LICM. llvm-svn: 104115	2010-05-19 07:28:01 +00:00
Evan Cheng	b7704fee4c	Use 'adr' for LEApcrel and LEApcrel. Mark LEApcrel re-materializable. llvm-svn: 104114	2010-05-19 07:26:50 +00:00
Evan Cheng	dd7f566597	Mark pattern-less mayLoad / mayStore instructions neverHasSideEffects. These do not have other un-modeled side effects. llvm-svn: 104111	2010-05-19 06:07:03 +00:00
Evan Cheng	e89f5ae9d4	Target instruction selection should copy memoperands. llvm-svn: 104110	2010-05-19 06:06:09 +00:00
Evan Cheng	2c452fcd14	Mark a few more pattern-less instructions with neverHasSideEffects. This is especially important on instructions like t2LEApcreal which are prime candidate for machine LICM. llvm-svn: 104102	2010-05-19 01:52:25 +00:00
Evan Cheng	f19384d54a	Sink dag combine's post index load / store code that swap base ptr and index into the target hook. Only the target knows whether the swap is safe. In Thumb2 mode, the offset must be an immediate. rdar://7998649 llvm-svn: 104060	2010-05-18 21:31:17 +00:00
Jakob Stoklund Olesen	93d8844699	ARMBaseRegisterInfo::estimateRSStackSizeLimit() could return prematurely with a too large limit. The function would return immediately when finding an addrmode 3/5 instruction. It needs to keep scanning in case there is an addrmode 6 instruction which drops the limit to 0. A test case is very difficult to produce because it will only fail when the scavenger is used. rdar://problem/7894847 llvm-svn: 103995	2010-05-17 23:29:23 +00:00
Evan Cheng	cd04ed3533	vmov of immediates are trivially re-materializable. llvm-svn: 103982	2010-05-17 21:54:50 +00:00
Bob Wilson	c601801a7e	Fix a regression in 464.h264 for thumb1 and thumb2 nightly tests. Obvious in retrospect but not fun to debug. llvm-svn: 103969	2010-05-17 20:31:13 +00:00
Evan Cheng	3d98b996ff	Turn on -neon-reg-sequence by default. Using NEON load / store multiple instructions will no longer create gobs of vmov of D registers! llvm-svn: 103960	2010-05-17 19:51:20 +00:00
Evan Cheng	5a2809cbd8	No reason not to run the NEON domain croassing fix up pass in thumb2 mode. llvm-svn: 103917	2010-05-17 01:11:46 +00:00
Anton Korobeynikov	497d831966	Chris said that the comment char should be escaped. Fix all the occurences of "@" in *.td llvm-svn: 103903	2010-05-16 09:15:36 +00:00
Anton Korobeynikov	4c719c4515	Generalize the ARM DAG combiner of mul with constants to all power-of-two cases. llvm-svn: 103901	2010-05-16 08:54:20 +00:00
Evan Cheng	298e6b82eb	Model vst lane instructions with REG_SEQUENCE. llvm-svn: 103898	2010-05-16 03:27:48 +00:00
Anton Korobeynikov	1bf28a128b	Some cheap DAG combine goodness for multiplication with a particular constant. This can be extended later on to handle more "complex" constants. llvm-svn: 103881	2010-05-15 18:16:59 +00:00
Anton Korobeynikov	2b7aace2e0	"trap" pseudo-op turned out to be apple-local. Temporary emit it as raw bytes until it will be added to binutils as well. llvm-svn: 103878	2010-05-15 17:19:20 +00:00
Evan Cheng	9e688cbcc9	Model 128-bit vld lane with REG_SEQUENCE. llvm-svn: 103868	2010-05-15 07:53:37 +00:00

... 2 3 4 5 6 ...

2875 Commits