llvm-project

Commit Graph

Author	SHA1	Message	Date
Eric Christopher	29b58afdf1	Hack to let the move lowering handle dynamic-no-pic absolute moves of TLVP: movl _a@TLVP, %eax Daniel: Please review if you get a chance. llvm-svn: 106194	2010-06-17 00:51:48 +00:00
Jim Grosbach	e3864cc15e	format and 80-column cleanup llvm-svn: 106173	2010-06-16 23:45:49 +00:00
Jakob Stoklund Olesen	2334144e6e	Don't attempt preserving conservative kill flags. We were doing it wrong. This is before LiveVariables anyway, where these kill flags are recalculated. llvm-svn: 106157	2010-06-16 22:11:08 +00:00
Bob Wilson	01ac8f9fc0	Remove the hidden "neon-reg-sequence" option. The reg sequences are working now, so there's no need to disable them. llvm-svn: 106155	2010-06-16 21:34:01 +00:00
Benjamin Kramer	41476410c9	TODO-- llvm-svn: 106102	2010-06-16 15:47:00 +00:00
Evan Cheng	f128bdcb55	Make post-ra scheduling, anti-dep breaking, and register scavenger (conservatively) aware of predicated instructions. This enables ARM to move if-conversion before post-ra scheduler. llvm-svn: 106091	2010-06-16 07:35:02 +00:00
Bill Wendling	8c0cf0994d	Create a more targeted fix for not sinking instructions into a range where it will conflict with another live range. The place which creates this scenerio is the code in X86 that lowers a select instruction by splitting the MBBs. This eliminates the need to check from the bottom up in an MBB for live pregs. llvm-svn: 106066	2010-06-15 23:46:31 +00:00
Eric Christopher	6c4d63e1a5	For 32-bit non-pic tlv mach-o addressing we don't need a pic base or a relative address. llvm-svn: 106064	2010-06-15 23:08:42 +00:00
Dale Johannesen	438c35b5d1	Add file missing from previous commit. llvm-svn: 106058	2010-06-15 22:24:08 +00:00
Dale Johannesen	44f9dfc9cf	Next round of tail call changes. Register used in a tail call must not be callee-saved; following x86, add a new regclass to represent this. Also fixes a couple of bugs. Still disabled by default; Thumb doesn't work yet. llvm-svn: 106053	2010-06-15 22:08:33 +00:00
Dale Johannesen	89456b2612	Reapply 105986 with fix for bug pointed out by Jakob: flag argument to addReg is not the same format as flags attached to MachineOperand, although both have the same info. I don't think this actually mattered; the bootstrap failure did not reproduce on the next run anyway. llvm-svn: 106049	2010-06-15 21:36:43 +00:00
Chris Lattner	874c92bd47	fix fastisel to handle GS and FS relative pointers. Patch by Nelson Elhage! llvm-svn: 106031	2010-06-15 19:08:40 +00:00
Bob Wilson	f3f7a770b7	Add basic support for NEON modified immediates besides VMOV. llvm-svn: 106030	2010-06-15 19:05:35 +00:00
Daniel Dunbar	0904134252	Add <cstddef> include to get ptrdiff_t, for gcc-4.6; patch by Dimitry Andric. llvm-svn: 105994	2010-06-15 14:50:42 +00:00
Bob Wilson	1478142485	VMOVQQ and VMOVQQQQ are pseudo instructions and not predicable. llvm-svn: 105990	2010-06-15 05:51:27 +00:00
Dale Johannesen	3f253d2353	Revert 105986; looks like I'd better try bootstrapping. llvm-svn: 105988	2010-06-15 04:55:06 +00:00
Dale Johannesen	c338ef2b65	The form of BuildMI used for TAILJMPr was changing the register containing the target address, an input, into an output. I don't think this actually broke anything on x86 (it does on ARM), but it's wrong. llvm-svn: 105986	2010-06-15 03:13:49 +00:00
Jim Grosbach	f14e08b01b	Make sure to skip dbg_value instructions when finding an insertion point for the combined load/store instruction. rdar://7797940 llvm-svn: 105982	2010-06-15 00:41:09 +00:00
Bob Wilson	5b2b504038	Rename functions referring to VMOV immediates to refer to NEON "modified immediate" operands. These functions have so far only been used for VMOV but they also apply to other NEON instructions with modified immediate operands. No functional changes. llvm-svn: 105969	2010-06-14 22:19:57 +00:00
Chris Lattner	faa7bdccbf	fix a nasty bug where we were not treating available_externally symbols as declarations in the X86 backend. This would manifest on darwin x86-32 as errors like this with -fvisibility=hidden: symbol '__ZNSbIcED1Ev' can not be undefined in a subtraction expression This fixes PR7353. llvm-svn: 105954	2010-06-14 20:11:56 +00:00
Chris Lattner	58c09b2859	fix a -Wbool-conversions warning from clang. llvm-svn: 105943	2010-06-14 18:28:57 +00:00
Eli Friedman	ba1f1fcae5	Add back some possible optimizations for va_arg, with wording that makes it more clear what exactly is missing. llvm-svn: 105934	2010-06-14 07:03:30 +00:00
Rafael Espindola	e302f833e1	Merge getStoreRegOpcode and getLoadRegOpcode. llvm-svn: 105900	2010-06-12 20:13:29 +00:00
Eli Friedman	e17e4aea2a	Add README entry; based on testcase from Bill Hart. llvm-svn: 105878	2010-06-12 05:54:27 +00:00
Bruno Cardoso Lopes	ada854f8b6	make the avx intrinsics 3 address llvm-svn: 105876	2010-06-12 03:12:14 +00:00
Bruno Cardoso Lopes	f203703467	Add some basic fp intrinsics for AVX llvm-svn: 105873	2010-06-12 02:38:32 +00:00
Bruno Cardoso Lopes	a714ea0f7d	More AVX: {ADD,SUB,MUL,DIV}{PD,PS}rm llvm-svn: 105870	2010-06-12 01:53:48 +00:00
Bruno Cardoso Lopes	b06f54b852	More AVX: {ADD,SUB,MUL,DIV}{PD,PS}rr Handle OpSize TSFlag for AVX llvm-svn: 105869	2010-06-12 01:23:26 +00:00
Bruno Cardoso Lopes	8947c32493	Add some comments about REX fields llvm-svn: 105860	2010-06-12 00:03:52 +00:00
Bruno Cardoso Lopes	fd5458d4bd	More AVX instructions ({ADD,SUB,MUL,DIV}{SS,SD}rm) Introduce the VEX_X field llvm-svn: 105859	2010-06-11 23:50:47 +00:00
Bob Wilson	f07d33d8f1	Add a missing bitcast. This code used to only handle conversions between i64 and f64 types, but now it also handle Neon vector types, so the f64 result of VMOVDRR may need to be converted to a Neon type. Radar 8084742. llvm-svn: 105845	2010-06-11 22:45:25 +00:00
Bob Wilson	6eae520de9	Add instruction encoding for the Neon VMOV immediate instruction. This changes the machine instruction representation of the immediate value to be encoded into an integer with similar fields as the actual VMOV instruction. This makes things easier for the disassembler, since it can just stuff the bits into the immediate operand, but harder for the asm printer since it has to decode the value to be printed. Testcase for the encoding will follow later when MC has more support for ARM. llvm-svn: 105836	2010-06-11 21:34:50 +00:00
Evan Cheng	2901371c32	Delete code that's not safe. llvm-svn: 105774	2010-06-10 02:08:20 +00:00
Jim Grosbach	5fa0158ecd	be slightly more subtle about skipping dbg_value instructions; otherwise, if a dbg_value immediately follows a sequence of ldr/str instructions that should be combined into an ldm/stm and is the last instruction in the block, then combine may end up being skipped. llvm-svn: 105758	2010-06-09 22:21:24 +00:00
Evan Cheng	a0746bd50a	Allow target to place 2-address pass inserted copies in better spots. Thumb2 will use this to try to avoid breaking up IT blocks. llvm-svn: 105745	2010-06-09 19:26:01 +00:00
Kalle Raiskila	5e0862f7f5	Fix SPU to cope with vector insertelement to an undef position. We default to inserting to lane 0. llvm-svn: 105722	2010-06-09 09:58:17 +00:00
Kalle Raiskila	056113a211	Handle loading from/storing to undef pointers on SPU by inserting a random load/store, rather than crashing llc. llvm-svn: 105710	2010-06-09 08:29:41 +00:00
Evan Cheng	83c64ee8de	Typo. llvm-svn: 105677	2010-06-09 03:49:12 +00:00
Eli Friedman	ab44d1281a	A few new x86-64 specific README entries. llvm-svn: 105674	2010-06-09 02:43:17 +00:00
Evan Cheng	47cd593023	Thumb2 IT blocks are fairly expensive. When there are multiple selects using the same condition, it's important to make sure they are scheduled together to avoid forming multiple IT blocks. I'm adding a pre-regalloc pass that forms IT blocks early (by re-scheduling instructions and split basic blocks) to attempt to fix this. This is not turned on by default since I am not sure this is the right fix. Another issue is llvm selects are modeled as two-address conditional moves. This can be very bad when the copies before the conditional moves are not coalesced away. Teach IT formation pass to move the copies above the IT block (when legal) to avoid breaking the IT block. llvm-svn: 105669	2010-06-09 01:46:50 +00:00
Kevin Enderby	0de0f3fc02	Incremental improvement to the handling of the x86 "Jump if rCX Zero" instruction. Added the 64-bit version "jrcxz" so it is recognized and also added the checks for incorrect uses of "jcxz" in 64-bit mode and "jrcxz" in 32-bit mode. Still to do is to correctly handle the encoding of the instruction adding the Address-size override prefix byte, 0x67, when the width of the count register is not the same as the mode the machine is running in. Which for example means the encoding of "jecxz" depends if you are assembling as a 32-bit target or a 64-bit target. llvm-svn: 105661	2010-06-08 23:48:44 +00:00
Eric Christopher	6ab55c5683	Split out these asserts so it's more apparent why we're not assembling that rip-relative address when executing in 32-bit mode. llvm-svn: 105656	2010-06-08 22:57:33 +00:00
Jim Grosbach	8fe3cc8055	fix copy/paste/modify think-o llvm-svn: 105653	2010-06-08 22:53:32 +00:00
Bruno Cardoso Lopes	c2f87b7bb2	Reapply r105521, this time appending "LLU" to 64 bit immediates to avoid breaking the build. llvm-svn: 105652	2010-06-08 22:51:23 +00:00
Eric Christopher	89d103a8ce	Ensure that mov and not lea are used to stick the address into the register. While we're at it, make sure it's in the right one. llvm-svn: 105645	2010-06-08 22:04:25 +00:00
Jim Grosbach	57c6fd452e	fix typo llvm-svn: 105634	2010-06-08 20:06:55 +00:00
Kalle Raiskila	6c40caf729	Flag SPU's function call sequence together. Discussed here: http://lists.cs.uiuc.edu/pipermail/llvmdev/2010-June/032107.html llvm-svn: 105601	2010-06-08 07:55:16 +00:00
Bob Wilson	0271c5928e	Fix up a comment. llvm-svn: 105591	2010-06-08 00:42:08 +00:00
Bob Wilson	846bd7992c	Further changes for Neon vector shuffles: - change isShuffleMaskLegal to show that all shuffles with 32-bit and 64-bit elements are legal - the Neon shuffle instructions do not support 64-bit elements, but we were not checking for that before lowering shuffles to use them - remove some 64-bit element vduplane patterns that are no longer needed llvm-svn: 105586	2010-06-07 23:53:38 +00:00
Jim Grosbach	723d242a95	Handle dbg_value instructions (i.e., skip them) when generating IT blocks. rdar://7797940 llvm-svn: 105557	2010-06-07 21:48:47 +00:00
Bill Wendling	a3bba3371a	Create new accessors to get arguments for call/invoke instructions. It breaks encapsulation to force the users of these classes to know about the internal data structure of the Operands structure. It also can lead to errors, like in the MSIL writer. llvm-svn: 105539	2010-06-07 19:05:06 +00:00
Duncan Sands	e4f45cc88f	This bug is also present in MSVC10. Requested by Elrood on IRC. llvm-svn: 105527	2010-06-05 12:40:43 +00:00
Chris Lattner	fdd2614330	revert r105521, which is breaking the buildbots with stuff like this: In file included from X86InstrInfo.cpp:16: X86GenInstrInfo.inc:2789: error: integer constant is too large for 'long' type X86GenInstrInfo.inc:2790: error: integer constant is too large for 'long' type X86GenInstrInfo.inc:2792: error: integer constant is too large for 'long' type X86GenInstrInfo.inc:2793: error: integer constant is too large for 'long' type X86GenInstrInfo.inc:2808: error: integer constant is too large for 'long' type X86GenInstrInfo.inc:2809: error: integer constant is too large for 'long' type X86GenInstrInfo.inc:2816: error: integer constant is too large for 'long' type X86GenInstrInfo.inc:2817: error: integer constant is too large for 'long' type llvm-svn: 105524	2010-06-05 04:17:30 +00:00
Bruno Cardoso Lopes	594fa26317	Initial AVX support for some instructions. No patterns matched yet, only assembly encoding support. llvm-svn: 105521	2010-06-05 03:53:24 +00:00
Dale Johannesen	81ef35b3ca	Improvements to tail call code. No functional effect unless using -arm-tail-calls. llvm-svn: 105515	2010-06-05 00:51:39 +00:00
Dale Johannesen	df1a7f83bf	Fix some liveout handling related to tail calls, see comments. I don't think this ever resulted in problems on x86, but it would on ARM. llvm-svn: 105509	2010-06-05 00:30:45 +00:00
Dale Johannesen	d1b9311afa	More thoroughly disable tails calls by default. 8060143, although this doesn't fix the real problem with tail call. llvm-svn: 105472	2010-06-04 18:04:24 +00:00
Jim Grosbach	3548803f62	Another fix to prevent debug info from affecting codegen. rdar://7797940 llvm-svn: 105470	2010-06-04 17:57:34 +00:00
Jim Grosbach	4e5e6a8973	more dbg_value adjustments so debug info doesn't affect codegen llvm-svn: 105454	2010-06-04 01:23:30 +00:00
Jim Grosbach	1bcdf32d22	fix typo llvm-svn: 105441	2010-06-04 00:15:00 +00:00
Bob Wilson	d8a9a04739	For NEON vectors with 32- or 64-bit elements, select BUILD_VECTORs and VECTOR_SHUFFLEs to REG_SEQUENCE instructions. The standard ISD::BUILD_VECTOR node corresponds closely to REG_SEQUENCE but I couldn't use it here because its operands do not get legalized. That is pretty awful, but I guess it makes sense for other targets. Instead, I have added an ARM-specific version of BUILD_VECTOR that will have its operands properly legalized. This fixes the rest of Radar 7872877. llvm-svn: 105439	2010-06-04 00:04:02 +00:00
Jim Grosbach	b30b81edb6	Teach the ARM load-store optimizer to deal with dbg_value instructions. llvm-svn: 105427	2010-06-03 22:41:15 +00:00
Dale Johannesen	d679ff7330	Early implementation of tail call for ARM. A temporary flag -arm-tail-calls defaults to off, so there is no functional change by default. Intrepid users may try this; simple cases work but there are bugs. llvm-svn: 105413	2010-06-03 21:09:53 +00:00
Eric Christopher	b0e1a458ce	Add first pass at darwin tls compiler support. llvm-svn: 105381	2010-06-03 04:07:48 +00:00
Eli Friedman	ceb13f2af3	Remove some already-fixed README entries. llvm-svn: 105377	2010-06-03 01:47:31 +00:00
Eli Friedman	a59b7a72b9	Remove README entry which no longer compiles to something sane. llvm-svn: 105376	2010-06-03 01:16:51 +00:00
Eli Friedman	1f41303260	Remove a fixed item, update a couple partially-fixed items. llvm-svn: 105375	2010-06-03 01:01:48 +00:00
Jakob Stoklund Olesen	a8ad97743d	Slightly change the meaning of the reMaterialize target hook when the original instruction defines subregisters. Any existing subreg indices on the original instruction are preserved or composed with the new subreg index. Also substitute multiple operands mentioning the original register by using the new MachineInstr::substituteRegister() function. This is necessary because there will soon be <imp-def> operands added to non read-modify-write partial definitions. This instruction: %reg1234:foo = FLAP %reg1234<imp-def> will reMaterialize(%reg3333, bar) like this: %reg3333:bar-foo = FLAP %reg333:bar<imp-def> Finally, replace the TargetRegisterInfo pointer argument with a reference to indicate that it cannot be NULL. llvm-svn: 105358	2010-06-02 22:47:25 +00:00
Jim Grosbach	84511e1526	Clean up 80 column violations. No functional change. llvm-svn: 105350	2010-06-02 21:53:11 +00:00
Rafael Espindola	f2dffcef82	Remove the TargetRegisterClass member from CalleeSavedInfo llvm-svn: 105344	2010-06-02 20:02:30 +00:00
Eli Friedman	6e3d5af945	Fix comment so it doesn't include comments which are irrelevant to the x86 backend. Add a FIXME noting what can be fixed here. llvm-svn: 105342	2010-06-02 19:35:46 +00:00
Dan Gohman	a690618c58	Use comments to document non-obvious code rather than mailing list archives. llvm-svn: 105341	2010-06-02 19:13:40 +00:00
Bob Wilson	2d35a9e810	Rename canCombinedSubRegIndex method to something more grammatically correct and tidy up the comment describing it. llvm-svn: 105339	2010-06-02 18:54:47 +00:00
Rafael Espindola	94801a47f8	Replace ARM's getCalleeSavedRegClasses with a simpler solution llvm-svn: 105335	2010-06-02 17:54:50 +00:00
Rafael Espindola	7881a64a50	Remove unused function. llvm-svn: 105325	2010-06-02 15:44:20 +00:00
Rafael Espindola	ef2b6ce00a	cleanup llvm-svn: 105322	2010-06-02 13:53:17 +00:00
Rafael Espindola	c08ecba597	Remove uses of getCalleeSavedRegClasses from outside the backends and removes the virtual declaration. With that out of the way I should be able to cleanup one backend at a time. llvm-svn: 105321	2010-06-02 12:39:06 +00:00
Eli Friedman	526e6d045f	Don't try to custom-lower 64-bit add-with-overflow and friends on x86-32; the x86 backend currently doesn't know how to handle them. This doesn't really fix anything because LegalizeTypes doesn't know how to handle them either. We do get a better error message, though. llvm-svn: 105305	2010-06-02 00:27:18 +00:00
Eli Friedman	6382c9c681	Remove outdated README entries. llvm-svn: 105303	2010-06-02 00:10:36 +00:00
Dan Gohman	47a0724425	Fix the allocation of shadow space for the Win64 calling convention in X86FastISel. Patch by Jan Sjodin. llvm-svn: 105290	2010-06-01 21:09:47 +00:00
Bruno Cardoso Lopes	d44677ba69	Refactor some SSE 2 unpack instructions llvm-svn: 105276	2010-06-01 17:02:50 +00:00
Kalle Raiskila	8916358f97	Fix handling of 'load' nodes. llvm-svn: 105269	2010-06-01 13:34:47 +00:00
Anton Korobeynikov	a09d95412e	Some A9 load/store cleanups llvm-svn: 105109	2010-05-29 19:25:39 +00:00
Anton Korobeynikov	2a21aef8f2	Some rough approximations for load/stores on A9 llvm-svn: 105108	2010-05-29 19:25:34 +00:00
Anton Korobeynikov	d4c7cceb70	NEON/VFP stuff can be issued only via Pipe1 on A9 llvm-svn: 105107	2010-05-29 19:25:29 +00:00
Anton Korobeynikov	94d7fd88fd	Add some integer instruction itineraries for A9 llvm-svn: 105106	2010-05-29 19:25:17 +00:00
Evan Cheng	27c4933e02	Fix PR7193: if sibling call address can take a register, make sure there are enough registers available by counting inreg arguments. llvm-svn: 105092	2010-05-29 01:35:22 +00:00
Jakob Stoklund Olesen	e02996ca8f	Merge the SystemZ subreg_even32 SubRegIndex into subreg_32bit. The SubRegIndices were overspecified when inheriting sub-subregisters, for instance: R0Q:subreg_even32 = R0Q:subreg_32bit = R0Q:subreg_even:subreg_32bit. This meant that composeSubRegIndices(subreg_even, subreg_32bit) was ambiguous. llvm-svn: 105063	2010-05-28 23:48:29 +00:00
Evan Cheng	bf91499f1a	Schedule high latency instructions for latency reduction even if they are not vfp / NEON instructions. llvm-svn: 105060	2010-05-28 23:25:23 +00:00
Dale Johannesen	e8be73f3e7	Fix comment typos. llvm-svn: 105059	2010-05-28 23:24:28 +00:00
Bruno Cardoso Lopes	1f79289806	More SSE 1 & 2 merge, this time with logical instructions llvm-svn: 105014	2010-05-28 22:47:03 +00:00
Kevin Enderby	4c71e08ed8	MC/X86: Add alias for movzx. llvm-svn: 105005	2010-05-28 21:20:21 +00:00
Kevin Enderby	b29228905f	MC/X86: Add alias for fwait. llvm-svn: 105001	2010-05-28 20:59:10 +00:00
Kevin Enderby	76413597a9	Fix the use of x86 control and debug registers so that the assertion failure in getX86RegNum() does not happen. Patch by Shantonu Sen! llvm-svn: 104994	2010-05-28 19:01:27 +00:00
Jim Grosbach	b342e09b5e	correct retattr llvm-svn: 104980	2010-05-28 18:03:48 +00:00
Jim Grosbach	0b20fdaff0	Cosmetic cleanup. No functional change. llvm-svn: 104974	2010-05-28 17:51:20 +00:00
Jim Grosbach	37eb2c24b9	make sure accesses to set up the jmpbuf don't get moved after it by the scheduler. Add a missing \n. llvm-svn: 104967	2010-05-28 17:37:40 +00:00
Bob Wilson	b6112e8706	Add the cc_out operand for t2RSBrs instructions. I missed this when I changed the instruction class for t2RSB to add that operand in svn r104582. Radar 8033757. llvm-svn: 104907	2010-05-28 00:27:15 +00:00
Jim Grosbach	faa3abbe39	Update the saved stack pointer in the sjlj function context following either an alloca() or an llvm.stackrestore(). rdar://8031573 llvm-svn: 104900	2010-05-27 23:49:24 +00:00
Evan Cheng	c2ebe0334a	Use report_fatal_error, not llvm_unreachable. llvm-svn: 104899	2010-05-27 23:45:31 +00:00
Jim Grosbach	c9f532dddc	back out 104862/104869. Can reuse stacksave after all. Very cool. llvm-svn: 104897	2010-05-27 23:11:57 +00:00
Evan Cheng	3d3ee87d4e	llvm can't correctly support 'H', 'Q' and 'R' modifiers. Just mark it an error. llvm-svn: 104891	2010-05-27 22:08:38 +00:00
Kevin Enderby	9738f64bd9	MC/X86: Add aliases for Jcc variants. llvm-svn: 104890	2010-05-27 21:33:19 +00:00
Bob Wilson	40e62dfdc0	Fix some bad fall-throughs in a switch statement. Both the 'Q' and 'R' cases should fall through to the 'H' case, but instead 'Q' was falling through to 'R' so that it would do the wrong thing for a big-endian ARM target. llvm-svn: 104883	2010-05-27 20:23:42 +00:00
Dale Johannesen	9e43c07bc5	Mark some math lib intrinsic nodes Legal on SSE4.1. No functional effect as these nodes are not generated yet. llvm-svn: 104879	2010-05-27 20:12:41 +00:00
Dan Gohman	dc53f1cb5c	FastISel doesn't yet handle callee-pop functions. To support this, move IsCalleePop from X86ISelLowering to X86Subtarget. llvm-svn: 104866	2010-05-27 18:43:40 +00:00
Jim Grosbach	5cde219fb1	add ISD::STACKADDR to get the current stack pointer. Will be used by sjlj EH to update the jmpbuf in the presence of VLAs. llvm-svn: 104862	2010-05-27 18:23:48 +00:00
Bruno Cardoso Lopes	54b07ad2cd	Merge basic binops SSE 1 & 2 instruction classes. This is a step towards refactoring common code between SSE versions. llvm-svn: 104860	2010-05-27 18:17:40 +00:00
Daniel Dunbar	c0b69020cd	AsmMatcher/X86: Mark _REV instructions as "code gen only", they aren't expected to be matched. llvm-svn: 104757	2010-05-26 22:21:28 +00:00
Jakob Stoklund Olesen	4f6da9e3a8	Give SubRegIndex names to all ARM subregisters. This will be required by TableGen shortly. llvm-svn: 104754	2010-05-26 22:15:03 +00:00
Daniel Dunbar	b33dfbcba4	MC: Add TargetMachine support for setting the value of MCRelaxAll with -filetype=obj. llvm-svn: 104747	2010-05-26 21:48:55 +00:00
Jakob Stoklund Olesen	d1d7ed63ff	Add StringRef::compare_numeric and use it to sort TableGen register records. This means that our Registers are now ordered R7, R8, R9, R10, R12, ... Not R1, R10, R11, R12, R2, R3, ... llvm-svn: 104745	2010-05-26 21:47:28 +00:00
Jim Grosbach	c98892fdaa	Adjust eh.sjlj.setjmp to properly have a chain and to have an opcode entry in ISD::. No functional change. llvm-svn: 104734	2010-05-26 20:22:18 +00:00
Kevin Enderby	70e34983e8	Fix the x86 move to/from segment register instructions. llvm-svn: 104731	2010-05-26 20:10:45 +00:00
Daniel Dunbar	7c8bd0fc98	MC: Change RelaxInstruction to only take the input and output instructions. llvm-svn: 104713	2010-05-26 18:15:06 +00:00
Dan Gohman	338674a323	Fix a typo in a comment that Gabor noticed. llvm-svn: 104711	2010-05-26 18:03:53 +00:00
Daniel Dunbar	a19838e107	MC: Simplify MayNeedRelaxation to not provide the fixups, so we can query it before encoding. llvm-svn: 104707	2010-05-26 17:45:29 +00:00
Jakob Stoklund Olesen	7de379467e	Replace the SubRegSet tablegen class with a less error-prone mechanism. A Register with subregisters must also provide SubRegIndices for adressing the subregisters. TableGen automatically inherits indices for sub-subregisters to minimize typing. CompositeIndices may be specified for the weirder cases such as the XMM sub_sd index that returns the same register, and ARM NEON Q registers where both D subregs have ssub_0 and ssub_1 sub-subregs. It is now required that all subregisters are named by an index, and a future patch will also require inherited subregisters to be named. This is necessary to allow composite subregister indices to be reduced to a single index. llvm-svn: 104704	2010-05-26 17:27:12 +00:00
Daniel Dunbar	b34440a6a8	MC: Eliminate MCAsmFixup, replace with MCFixup. llvm-svn: 104699	2010-05-26 15:18:56 +00:00
Daniel Dunbar	353a91ff76	MC: Use accessors for access to MCAsmFixup. llvm-svn: 104697	2010-05-26 15:18:31 +00:00
Daniel Dunbar	3627af5da4	MC: Change MCInst::dump_pretty to not include a trailing newline. llvm-svn: 104696	2010-05-26 15:18:13 +00:00
Zhongxing Xu	730a977e02	SRetReturnReg was set in LowerFormalArguments(). So only assert it here. llvm-svn: 104691	2010-05-26 08:10:02 +00:00
Shih-wei Liao	c4376b9b1b	Coding style change (Adding 1 missing space.) llvm-svn: 104670	2010-05-26 04:46:50 +00:00
Shih-wei Liao	0568ca0ddc	Adding the missing implementation for ARM::SBFX and ARM::UBFX. Fixing http://llvm.org/bugs/show_bug.cgi?id=7225. llvm-svn: 104667	2010-05-26 03:21:39 +00:00
Jim Grosbach	a6897ecbb5	fix off by 1 (insn) error in eh.sjlj.setjmp thumb code sequence. llvm-svn: 104661	2010-05-26 01:22:21 +00:00
Jakob Stoklund Olesen	50eec620f4	Revert "Replace the SubRegSet tablegen class with a less error-prone mechanism." This reverts commit 104654. llvm-svn: 104660	2010-05-26 01:21:14 +00:00
Jakob Stoklund Olesen	0b0274524c	Replace the SubRegSet tablegen class with a less error-prone mechanism. A Register with subregisters must also provide SubRegIndices for adressing the subregisters. TableGen automatically inherits indices for sub-subregisters to minimize typing. CompositeIndices may be specified for the weirder cases such as the XMM sub_sd index that returns the same register, and ARM NEON Q registers where both D subregs have ssub_0 and ssub_1 sub-subregs. It is now required that all subregisters are named by an index, and a future patch will also require inherited subregisters to be named. This is necessary to allow composite subregister indices to be reduced to a single index. llvm-svn: 104654	2010-05-26 00:28:19 +00:00
Shih-wei Liao	b6e0bc9457	Adding the missing implementation of Bitfield's "clear" and "insert". Fixing http://llvm.org/bugs/show_bug.cgi?id=7222. llvm-svn: 104653	2010-05-26 00:25:05 +00:00
Shih-wei Liao	e22abfa823	To handle s* registers in emitVFPLoadStoreMultipleInstruction(). Fixing http://llvm.org/bugs/show_bug.cgi?id=7221. llvm-svn: 104652	2010-05-26 00:02:28 +00:00
Jakob Stoklund Olesen	66c939a2ca	Drop the SuperregHashTable. It is essentially the same as SubregHashTable. llvm-svn: 104650	2010-05-25 23:43:18 +00:00
Kevin Enderby	492d4f409a	Changed the encoding of X86 floating point stack operations where both operands are st(0). These can be encoded using an opcode for storing in st(0) or using an opcode for storing in st(i), where i can also be 0. To allow testing with the darwin assembler and get a matching binary the opcode for storing in st(0) is now used. To do this the same logical trick is use from the darwin assembler in converting things like this: fmul %st(0), %st into this: fmul %st(0) by looking for the second operand being X86::ST0 for specific floating point mnemonics then removing the second X86::ST0 operand. This also has the add benefit to allow things like: fmul %st(1), %st that llvm-mc did not assemble. llvm-svn: 104634	2010-05-25 20:52:34 +00:00
Jakob Stoklund Olesen	3311eb50d7	Separate unrelated cases that once shared a numeric value llvm-svn: 104629	2010-05-25 19:49:40 +00:00
Jakob Stoklund Olesen	1ad0d5e25b	Print symbolic SubRegIndex names on machine operands. llvm-svn: 104628	2010-05-25 19:49:38 +00:00
Jakob Stoklund Olesen	673e7e0f37	Remove NumberHack entirely. SubRegIndex instances are now numbered uniquely the same way Register instances are - in lexicographical order by name. llvm-svn: 104627	2010-05-25 19:49:33 +00:00
Daniel Dunbar	0e767d7364	MC/X86: Add a hack to allow recognizing 'cmpltps' and friends. llvm-svn: 104626	2010-05-25 19:49:32 +00:00
Daniel Dunbar	4a5b2c597b	MC/X86: Define explicit immediate forms of cmp{ss,sd,ps,pd}. llvm-svn: 104622	2010-05-25 18:40:53 +00:00
Kevin Enderby	c798965e63	The BT64ri8 record in X86Instr64bit.td was missing a REX_W which is required for the 64-bit version of the Bit Test instruction. llvm-svn: 104621	2010-05-25 18:16:58 +00:00
Eric Christopher	f6562d35ac	Make sure aeskeygenassist uses an unsigned immediate field. Fixes rdar://8017638 llvm-svn: 104617	2010-05-25 17:33:22 +00:00
Jakob Stoklund Olesen	3b59e0601e	Ignore NumberHack and give each SubRegIndex instance a unique enum value instead. This passes lit tests, but I'll give it a go through the buildbots to smoke out any remaining places that depend on the old SubRegIndex numbering. Then I'll remove NumberHack entirely. llvm-svn: 104615	2010-05-25 17:21:04 +00:00
Jakob Stoklund Olesen	36caaf1c59	Use enums instead of literals for SystemZ subregisters llvm-svn: 104612	2010-05-25 17:04:18 +00:00
Jakob Stoklund Olesen	396c8802b2	Use enums instead of literals for X86 subregisters. The cases in getMatchingSuperRegClass cannot be broken up until the enums have unique values. llvm-svn: 104611	2010-05-25 17:04:16 +00:00
Zonr Chang	a6714e8a43	Add missing implementation to the materialization of VFP misc. instructions (vmrs, vmsr and vmov (immediate)) llvm-svn: 104588	2010-05-25 10:23:52 +00:00
Zonr Chang	2da5aa1b60	Add support to MOVimm32 using movt/movw for ARM JIT llvm-svn: 104587	2010-05-25 08:42:45 +00:00
Bob Wilson	4f48499d2c	Allow t2MOVsrl_flag and t2MOVsra_flag instructions to be predicated. I don't know of any particular reason why that would be important, but neither can I see any reason to disallow it. llvm-svn: 104583	2010-05-25 04:51:47 +00:00
Bob Wilson	debbbe3fd9	Fix up instruction classes for Thumb2 RSB instructions to be consistent with Thumb2 ADD and SUB instructions: allow RSB instructions be changed to set the condition codes, and allow RSBS instructions to be predicated. llvm-svn: 104582	2010-05-25 04:43:08 +00:00
Bob Wilson	26fdebcae9	Clean up indentation. llvm-svn: 104580	2010-05-25 03:36:52 +00:00
Jakob Stoklund Olesen	70affbd988	Use enums instead of literals in the ARM backend. llvm-svn: 104573	2010-05-25 00:15:15 +00:00
Jakob Stoklund Olesen	fdb25de17e	Switch SubRegSet to using symbolic SubRegIndices llvm-svn: 104571	2010-05-24 23:03:18 +00:00
Bob Wilson	91b2b8540c	Allow Thumb2 MVN instructions to set condition codes. The immediate operand version of t2MVN already allowed that, but not the register versions. llvm-svn: 104570	2010-05-24 22:41:19 +00:00
Jakob Stoklund Olesen	1181a19318	Lose the dummies llvm-svn: 104564	2010-05-24 21:47:01 +00:00
Jakob Stoklund Olesen	edab242488	Replace the tablegen RegisterClass field SubRegClassList with an alist-like data structure that represents a mapping without any dependencies on SubRegIndex numbering. This brings us closer to being able to remove the explicit SubRegIndex numbering, and it is now possible to specify any mapping without inventing *_INVALID register classes. llvm-svn: 104563	2010-05-24 21:46:58 +00:00
Dan Gohman	79b6a0f140	Fix an mmx movd encoding. llvm-svn: 104552	2010-05-24 20:51:08 +00:00
Kevin Enderby	dc71cc794b	MC/X86: Add aliases for CMOVcc variants. llvm-svn: 104549	2010-05-24 20:32:23 +00:00
Bob Wilson	722bff2c7d	Clean up some extra whitespace. llvm-svn: 104544	2010-05-24 20:08:34 +00:00
Bob Wilson	3eb7691858	Thumb2 RSBS instructions were being printed without the 'S' suffix. Fix it by changing the T2I_rbin_s_is multiclass to handle the CPSR output and 'S' suffix in the same way as T2I_bin_s_irs. llvm-svn: 104531	2010-05-24 18:44:06 +00:00
Evan Cheng	755d45be43	LR is in GPR, not tGPR even in Thumb1 mode. llvm-svn: 104518	2010-05-24 18:00:18 +00:00
Jakob Stoklund Olesen	ff2d118733	Add SubRegIndex defs to PowerPC. It looks like the CR subregister indices are never used. llvm-svn: 104517	2010-05-24 17:55:38 +00:00
Jakob Stoklund Olesen	8a57aeca2a	Use SubRegIndex in SystemZ. Anton, please review the change to SystemZAsmPrinter.cpp. It could be a bug. llvm-svn: 104515	2010-05-24 17:43:01 +00:00
Jakob Stoklund Olesen	5d56769fb6	SubRegIndex'ize Mips llvm-svn: 104514	2010-05-24 17:42:58 +00:00
Jakob Stoklund Olesen	fd6f16fab9	SubRegIndex'ize MSP430 llvm-svn: 104513	2010-05-24 17:42:55 +00:00
Jakob Stoklund Olesen	8d042c0269	Fix a few places that depended on the numeric value of subreg indices. Add assertions in places that depend on consecutive indices. llvm-svn: 104510	2010-05-24 17:13:28 +00:00
Jakob Stoklund Olesen	6c47d6423c	Switch ARMRegisterInfo.td to use SubRegIndex and eliminate the parallel enums from ARMRegisterInfo.h llvm-svn: 104508	2010-05-24 16:54:32 +00:00
Jakob Stoklund Olesen	9340ea59e1	Rename X86 subregister indices to something shorter. Use the tablegen-produced enums. llvm-svn: 104493	2010-05-24 14:48:17 +00:00
Jakob Stoklund Olesen	1c69646e99	Add the SubRegIndex TableGen class. This is the beginning of purely symbolic subregister indices, but we need a bit of jiggling before the explicit numeric indices can be completely removed. llvm-svn: 104492	2010-05-24 14:48:12 +00:00
Bob Wilson	49f40e8c32	VDUP doesn't support vectors with 64-bit elements. llvm-svn: 104455	2010-05-23 05:42:31 +00:00
Daniel Dunbar	b52fcd6304	MC/X86: Subdivide immediates a bit more, so that we properly recognize immediates based on the width of the target instruction. For example: addw $0xFFFF, %ax should match the same as addw $-1, %ax but we used to match it to the longer encoding. llvm-svn: 104453	2010-05-22 21:02:33 +00:00
Daniel Dunbar	346782c12c	tblgen/AsmMatcher: Change AsmOperandClass to allow a list of superclasses instead of just one. llvm-svn: 104452	2010-05-22 21:02:29 +00:00
Daniel Dunbar	d459e29a0a	MC/X86: Add alias for setz, setnz, jz, jnz. llvm-svn: 104435	2010-05-22 06:37:33 +00:00
Evan Cheng	168ced94d8	Implement @llvm.returnaddress. rdar://8015977. llvm-svn: 104421	2010-05-22 01:47:14 +00:00
Jim Grosbach	bd9485db63	Implement eh.sjlj.longjmp for ARM. Clean up the intrinsic a bit. Followups: docs patch for the builtin and eh.sjlj.setjmp cleanup to match longjmp. llvm-svn: 104419	2010-05-22 01:06:18 +00:00
Bob Wilson	91fdf68516	Recognize more BUILD_VECTORs and VECTOR_SHUFFLEs that can be implemented by copying VFP subregs. This exposed a bunch of dead code in the *spill-q.ll tests, so I tweaked those tests to keep that code from being optimized away. Radar 7872877. llvm-svn: 104415	2010-05-22 00:23:12 +00:00
Chris Lattner	4dc833c607	add a note llvm-svn: 104404	2010-05-21 23:16:21 +00:00
Kevin Enderby	7e7482c80f	Added retl for 32-bit x86 and added retq for 64-bit x86. llvm-svn: 104394	2010-05-21 23:01:38 +00:00
Evan Cheng	3858451e09	- Change MachineInstr::findRegisterDefOperandIdx so it can also look for defs that are aliases of the specified register. - Rename modifiesRegister to definesRegister since it's looking a def of the specific register or one of its super-registers. It's not looking for def of a sub-register or alias that could change the specified register. - Added modifiesRegister to look for defs of aliases. llvm-svn: 104377	2010-05-21 20:53:24 +00:00
Dale Johannesen	2b78565842	Previous commit message should refer to 104308. llvm-svn: 104337	2010-05-21 18:44:47 +00:00
Dale Johannesen	6361e3e8a2	Fix two bugs in 104348: Case where MMX is disabled wasn't handled right. MMX->MMX bitconverts are Legal. llvm-svn: 104336	2010-05-21 18:40:15 +00:00
Chris Lattner	0735ecfe17	now that fp reg kill insertion stuff happens as a separate pass after isel instead of being interlaced with it, we can trust that all the code for a function has been isel'd before it is run. The practical impact of this is that we can scan for machine instr phis instead of doing a fuzzy match on the LLVM BB for phi nodes. Doing the fuzzy match required knowing when isel would produce an fp reg stack phi which was gross. It was also wrong in cases where select got lowered to a branch tree because cmovs aren't available (PR6828). Just do the scan on machine phis which is simpler, faster and more correct. This fixes PR6828. llvm-svn: 104333	2010-05-21 18:17:54 +00:00
Chris Lattner	058a207436	Use less evil form of switch stmt. llvm-svn: 104331	2010-05-21 18:02:42 +00:00
Chris Lattner	39a8a43bd8	use continue to reduce nesting. llvm-svn: 104330	2010-05-21 18:01:24 +00:00
Chris Lattner	b7d68a2256	pull a nested loop of this pass out to its own function, eliminating the gymnastics around the ContainsFPCode var. llvm-svn: 104328	2010-05-21 17:57:03 +00:00
Chris Lattner	fb41aaefeb	modernize this pass a bit, fit in 80 columns. llvm-svn: 104326	2010-05-21 17:49:07 +00:00
Matt Fleming	638cdb2db1	Currently, createMachOStreamer() is invoked directly in llvm-mc which isn't ideal if we want to be able to use another object file format. Add a createObjectStreamer() factory method so that the correct object file streamer can be instantiated for a given target triple. llvm-svn: 104318	2010-05-21 12:54:43 +00:00
Matt Fleming	5abb6dd61e	Split out the x86_32 an x86_64 ELF backends as they handle ELF differently. This will make adding ELF support easier in the long run. llvm-svn: 104317	2010-05-21 11:39:07 +00:00
Dale Johannesen	b3b9c8ac48	Fix i64->f64 conversion, x86-64, -no-sse. A bit tricky since there's a 3rd 64-bit type, MMX vectors. PR 7135. llvm-svn: 104308	2010-05-21 00:52:33 +00:00
Evan Cheng	34c260458a	Change ARM scheduling default to list-hybrid if the target supports floating point instructions (and is not using soft float). llvm-svn: 104307	2010-05-21 00:43:17 +00:00
Evan Cheng	4401f8873c	Allow targets more controls on what nodes are scheduled by reg pressure, what for latency in hybrid mode. llvm-svn: 104293	2010-05-20 23:26:43 +00:00
Daniel Dunbar	baf2eea6f4	MC/X86: Add movq alias for movabsq, to allow matching 64-bit immediates with movq. llvm-svn: 104275	2010-05-20 20:36:29 +00:00
Daniel Dunbar	61655aa2bb	X86: Model i64i32imm properly, as a subclass of all immediates. llvm-svn: 104272	2010-05-20 20:20:39 +00:00
Daniel Dunbar	6d4c66dc1d	X86: Fix immediate type of FOO64i32 operations. llvm-svn: 104271	2010-05-20 20:20:35 +00:00
Bob Wilson	5954994bba	Handle Neon v2f64 and v2i64 vector shuffles as register copies. This fixes the remaining issue with pr7167. llvm-svn: 104257	2010-05-20 18:39:53 +00:00
Dan Gohman	098a47931c	Delete MMX_MOVQ64gmr. It was the same as MMX_MOVQ64mr, but it didn't have a pattern and it had an invalid encoding. llvm-svn: 104244	2010-05-20 18:05:01 +00:00
Dale Johannesen	d7d6638e3e	The PPC MFCR instruction implicitly uses all 8 of the CR registers. Currently it is not so marked, which leads to VCMPEQ instructions that feed into it getting deleted. If it is so marked, local RA complains about this sequence: vreg = MCRF CR0 MFCR <kill of whatever preg got assigned to vreg> All current uses of this instruction are only interested in one of the 8 CR registers, so redefine MFCR to be a normal unary instruction with a CR input (which is emitted only as a comment). That avoids all problems. 7739628. llvm-svn: 104238	2010-05-20 17:48:26 +00:00
Dan Gohman	29790edb93	Fix assembly parsing and encoding of the pushf and popf family of instructions. llvm-svn: 104231	2010-05-20 16:16:00 +00:00
Dan Gohman	5238275478	Set neverHasSideEffects on 64-bit pushf and popf, for consistency with 16-bit and 32-bit pushf and popf. llvm-svn: 104228	2010-05-20 15:42:55 +00:00
Dan Gohman	1e19eab963	Define the x86 pause instruction. llvm-svn: 104204	2010-05-20 01:35:50 +00:00
Dan Gohman	a3b7570a3a	Fix the sfence instruction to use MRM_F8 instead of MRM7r, since it doesn't have a register operand. Also, use I instead of PSI, for consistency with mfence and lfence. llvm-svn: 104203	2010-05-20 01:23:41 +00:00
Evan Cheng	738e920edf	Code refactoring: pull SchedPreference enum from TargetLowering.h to TargetMachine.h and put it in its own namespace. llvm-svn: 104147	2010-05-19 20:19:50 +00:00
Daniel Dunbar	52e37becf6	MC/X86: Add missing entry for TAILJMP_1 to getRelaxedOpcode(). llvm-svn: 104122	2010-05-19 17:20:58 +00:00
Daniel Dunbar	d2f78e755f	MC/X86: Lower TAILCALLd[64] to JMP_1, to allow relaxation and to avoid same prefix byte problem as in r104062. - As a total hack to keep the TAILCALL markers in the output, which some tests depend on, this invents a new TAILJMP_1 instruction. llvm-svn: 104120	2010-05-19 15:26:43 +00:00
Daniel Dunbar	b243dfb085	MC/X86: Strip spurious operands from TAILJMPr64 as we do for CALL64r and CALL64pcrel32, for the same reason. llvm-svn: 104116	2010-05-19 08:07:12 +00:00
Evan Cheng	daeca2d156	t2LEApcrel and tLEApcrel are re-materializable. This makes it possible to hoist more loads during machine LICM. llvm-svn: 104115	2010-05-19 07:28:01 +00:00
Evan Cheng	b7704fee4c	Use 'adr' for LEApcrel and LEApcrel. Mark LEApcrel re-materializable. llvm-svn: 104114	2010-05-19 07:26:50 +00:00
Daniel Dunbar	4f6c7c6d94	MC/X86: Lower MOV{8,16,32,64}{rm,mr} to fixed-register forms, as appropriate. llvm-svn: 104112	2010-05-19 06:20:44 +00:00
Evan Cheng	dd7f566597	Mark pattern-less mayLoad / mayStore instructions neverHasSideEffects. These do not have other un-modeled side effects. llvm-svn: 104111	2010-05-19 06:07:03 +00:00
Evan Cheng	e89f5ae9d4	Target instruction selection should copy memoperands. llvm-svn: 104110	2010-05-19 06:06:09 +00:00
Daniel Dunbar	45ace40959	MC/X86: Strip spurious operands from CALL64r as we do for CALL64pcrel32, to avoid same prefix byte problem as in r104062. llvm-svn: 104108	2010-05-19 04:31:36 +00:00
Evan Cheng	2c452fcd14	Mark a few more pattern-less instructions with neverHasSideEffects. This is especially important on instructions like t2LEApcreal which are prime candidate for machine LICM. llvm-svn: 104102	2010-05-19 01:52:25 +00:00
Dan Gohman	58c6f21453	Factor out the code for picking integer arithmetic with immediate opcodes into a helper function. This fixes a few places in the code which were not properly selecting the 8-bit-immediate opcodes. llvm-svn: 104091	2010-05-19 00:53:19 +00:00
Dan Gohman	f8bf663873	Teach mode load folding and unfolding code about CMP32ri8 and friends. llvm-svn: 104068	2010-05-18 21:54:15 +00:00
Bill Wendling	4ed63f8687	Don't eliminate frame pointers from leaf functions if "--disable-fp-elim" is specified. llvm-svn: 104066	2010-05-18 21:47:08 +00:00
Dan Gohman	887dd1cd31	When converting a test to a cmp to fold a load, use the cmp that has an 8-bit immediate field rather than one with a wider immediate field. llvm-svn: 104064	2010-05-18 21:42:03 +00:00
Chris Lattner	9f46539e07	make mcinstlower remove all but the first operand to CALL64pcrel32. The register use operands (e.g. the first argument is passed in a register) is currently being modeled as a normal register use, instead of correctly being an implicit use. This causes the operand to get propagated onto the mcinst, which was causing the encoder to emit a rex prefix byte, which generates an invalid call. This fixes rdar://7998435 llvm-svn: 104062	2010-05-18 21:40:18 +00:00
Evan Cheng	f19384d54a	Sink dag combine's post index load / store code that swap base ptr and index into the target hook. Only the target knows whether the swap is safe. In Thumb2 mode, the offset must be an immediate. rdar://7998649 llvm-svn: 104060	2010-05-18 21:31:17 +00:00
Daniel Dunbar	a4820fcc78	MC/X86: Implement custom lowering to make sure we match things like X86::ADC32ri $0, %eax to X86::ADC32i32 $0 llvm-svn: 104030	2010-05-18 17:22:24 +00:00
Jakob Stoklund Olesen	93d8844699	ARMBaseRegisterInfo::estimateRSStackSizeLimit() could return prematurely with a too large limit. The function would return immediately when finding an addrmode 3/5 instruction. It needs to keep scanning in case there is an addrmode 6 instruction which drops the limit to 0. A test case is very difficult to produce because it will only fail when the scavenger is used. rdar://problem/7894847 llvm-svn: 103995	2010-05-17 23:29:23 +00:00
Bill Wendling	02d3368831	- Set the "HasCalls" flag after instruction selection is finished. - Change the logic DisableFramePointerElim() to check for the -disable-non-leaf-fp-elim before -disable-fp-elim. llvm-svn: 103990	2010-05-17 23:09:50 +00:00
Evan Cheng	cd04ed3533	vmov of immediates are trivially re-materializable. llvm-svn: 103982	2010-05-17 21:54:50 +00:00
Daniel Dunbar	ce5e1bb326	MC: Add dyn_cast support to MCSection. - Of questionable utility, since in general anything which wants to do this should probably be within a target specific hook, which can rely on the sections being of the appropriate type. However, it can be useful for short term hacks. llvm-svn: 103980	2010-05-17 21:54:26 +00:00
Eric Christopher	bf79238599	Add some section and constant support for darwin TLS. llvm-svn: 103974	2010-05-17 21:02:07 +00:00
Bob Wilson	c601801a7e	Fix a regression in 464.h264 for thumb1 and thumb2 nightly tests. Obvious in retrospect but not fun to debug. llvm-svn: 103969	2010-05-17 20:31:13 +00:00
Evan Cheng	3d98b996ff	Turn on -neon-reg-sequence by default. Using NEON load / store multiple instructions will no longer create gobs of vmov of D registers! llvm-svn: 103960	2010-05-17 19:51:20 +00:00
Evan Cheng	5a2809cbd8	No reason not to run the NEON domain croassing fix up pass in thumb2 mode. llvm-svn: 103917	2010-05-17 01:11:46 +00:00
Dale Johannesen	2ef974ee0e	Revert 103911; it broke a test that expects bitconvert <1xi64> -> i64 to work in MMX registers on hosts where -no-sse is the default (not mine). The right thing is to accept this and make i64->f64 conversions go through memory, but I don't have time right now. llvm-svn: 103914	2010-05-16 20:19:04 +00:00
Dale Johannesen	fc1492d71b	Make x86-64 64-bit bitconvert work when SSE is not available. (This worked as of about 6 months ago and I didn't track down exactly what broke it; I think this fix is appropriate.) llvm-svn: 103911	2010-05-16 18:22:38 +00:00
Anton Korobeynikov	497d831966	Chris said that the comment char should be escaped. Fix all the occurences of "@" in *.td llvm-svn: 103903	2010-05-16 09:15:36 +00:00
Anton Korobeynikov	8f35fabbc1	Add support for thiscall calling convention. Patch by Charles Davis and Steven Watanabe! llvm-svn: 103902	2010-05-16 09:08:45 +00:00
Anton Korobeynikov	4c719c4515	Generalize the ARM DAG combiner of mul with constants to all power-of-two cases. llvm-svn: 103901	2010-05-16 08:54:20 +00:00
Evan Cheng	298e6b82eb	Model vst lane instructions with REG_SEQUENCE. llvm-svn: 103898	2010-05-16 03:27:48 +00:00
Dale Johannesen	3a366a88f2	Fix uint64->{float, double} conversion to do rounding correctly in 32-bit. The implementation in LegalizeIntegerTypes to handle this as sint64->float + appropriate power of 2 is subject to double rounding, considered incorrect by numerics people. Use this implementation only when it is safe. This leads to using library calls in some cases that produced inline code before, but it's correct now. (EVTToAPFloatSemantics belongs somewhere else, any suggestions?) Add a correctly rounding (though not particularly fast) conversion that uses X87 80-bit computations for x86-32. 7885399, 5901940. This shows up in gcc.c-torture/execute/ieee/rbug.c in the gcc testsuite on some platforms. llvm-svn: 103883	2010-05-15 18:51:12 +00:00
Anton Korobeynikov	1bf28a128b	Some cheap DAG combine goodness for multiplication with a particular constant. This can be extended later on to handle more "complex" constants. llvm-svn: 103881	2010-05-15 18:16:59 +00:00
Anton Korobeynikov	2b7aace2e0	"trap" pseudo-op turned out to be apple-local. Temporary emit it as raw bytes until it will be added to binutils as well. llvm-svn: 103878	2010-05-15 17:19:20 +00:00
Evan Cheng	9e688cbcc9	Model 128-bit vld lane with REG_SEQUENCE. llvm-svn: 103868	2010-05-15 07:53:37 +00:00
Evan Cheng	3d214cdfaf	v4i64 and v8i64 are only synthesizable when NEON is available. llvm-svn: 103855	2010-05-15 02:20:21 +00:00
Evan Cheng	4cad68eb34	Allow TargetLowering::getRegClassFor() to be called on illegal types. Also allow target to override it in order to map register classes to illegal but synthesizable types. e.g. v4i64, v8i64 for ARM / NEON. llvm-svn: 103854	2010-05-15 02:18:07 +00:00
Evan Cheng	0cbd11dfb2	Model 64-bit lane vld with REG_SEQUENCE. llvm-svn: 103851	2010-05-15 01:36:29 +00:00
Evan Cheng	8c2d062ea6	Teach two-address pass to do some coalescing while eliminating REG_SEQUENCE instructions. e.g. %reg1026<def> = VLDMQ %reg1025<kill>, 260, pred:14, pred:%reg0 %reg1027<def> = EXTRACT_SUBREG %reg1026, 6 %reg1028<def> = EXTRACT_SUBREG %reg1026<kill>, 5 ... %reg1029<def> = REG_SEQUENCE %reg1028<kill>, 5, %reg1027<kill>, 6, %reg1028, 7, %reg1027, 8, %reg1028, 9, %reg1027, 10, %reg1030<kill>, 11, %reg1032<kill>, 12 After REG_SEQUENCE is eliminated, we are left with: %reg1026<def> = VLDMQ %reg1025<kill>, 260, pred:14, pred:%reg0 %reg1029:6<def> = EXTRACT_SUBREG %reg1026, 6 %reg1029:5<def> = EXTRACT_SUBREG %reg1026<kill>, 5 The regular coalescer will not be able to coalesce reg1026 and reg1029 because it doesn't know how to combine sub-register indices 5 and 6. Now 2-address pass will consult the target whether sub-registers 5 and 6 of reg1026 can be combined to into a larger sub-register (or combined to be reg1026 itself as is the case here). If it is possible, it will be able to replace references of reg1026 with reg1029 + the larger sub-register index. llvm-svn: 103835	2010-05-14 23:21:14 +00:00
Evan Cheng	cb78e5558b	Model VST_UPD and VSToddUPD pair with REG_SEQUENCE. llvm-svn: 103833	2010-05-14 22:54:52 +00:00
Bill Wendling	0160e55893	SystemZ really does mean "has calls" and not just "adjusts stack." Go ahead and replace the check with the appropriate predicate. Modify the testcase to reflect the correct code. (It should be saving callee-saved registers on the stack allocated by the calling fuction.) llvm-svn: 103829	2010-05-14 22:17:42 +00:00
Dan Gohman	062a97f0f6	BR is a barrier. llvm-svn: 103826	2010-05-14 22:00:27 +00:00
Bill Wendling	1713d95874	Several tail call tests apparently rely upon this being "adjusts stack" instead of "has calls". That's probably wrong, but it needs further investigation. Revert to the original behavior until this is settled. llvm-svn: 103824	2010-05-14 21:58:35 +00:00
Bill Wendling	e9ac7ad68c	This should happen if there are no calls, not if it just doesn't adjust the stack. llvm-svn: 103813	2010-05-14 21:38:44 +00:00
Bill Wendling	db4bc54c4f	Revert r103804. The comment is correct. llvm-svn: 103808	2010-05-14 21:28:24 +00:00
Bill Wendling	7bf4bae5dc	Fix comment. llvm-svn: 103804	2010-05-14 21:17:29 +00:00
Bill Wendling	95f6ebcb37	Rename "HasCalls" in MachineFrameInfo to "AdjustsStack" to better describe what the variable actually tracks. N.B., several back-ends are using "HasCalls" as being synonymous for something that adjusts the stack. This isn't 100% correct and should be looked into. llvm-svn: 103802	2010-05-14 21:14:32 +00:00
Dan Gohman	35dd005d22	Lowering of atomic instructions can result in operands being used more than once. If ISel had put a kill flag on one of them, it's not valid to transfer the kill flag to each new instance. llvm-svn: 103799	2010-05-14 21:01:44 +00:00
Kevin Enderby	7bc111f5a9	Fix so "int3" is correctly accepted, added "into" and fixed "int" with an argument, like "int $4", to not get an Assertion error. llvm-svn: 103791	2010-05-14 19:16:02 +00:00
Evan Cheng	cfa7d02d6e	Model VLD_UPD and VLDodd_UPD pair with REG_SEQUENCE. llvm-svn: 103790	2010-05-14 18:54:59 +00:00
Dan Gohman	30e3db2ba3	Set isTerminator on TRAP instructions. llvm-svn: 103778	2010-05-14 16:46:02 +00:00
Dan Gohman	c56ca22616	Don't use isBarrier for the PowerPC sync instruction. isBarrier is for control barriers, not memory ordering barriers. llvm-svn: 103777	2010-05-14 16:42:16 +00:00
Dan Gohman	02d9947e60	Add mayLoad and mayStore flags to instructions which missed them. llvm-svn: 103776	2010-05-14 16:34:55 +00:00
Evan Cheng	cd67c21407	Added a QQQQ register file to model 4-consecutive Q registers. llvm-svn: 103760	2010-05-14 02:13:41 +00:00
Evan Cheng	ca21cc8b13	Fix comments. llvm-svn: 103749	2010-05-14 00:21:45 +00:00
Evan Cheng	2ca1bd119e	Add comment about the pseudo registers QQ, each of which is a pair of Q registers. llvm-svn: 103731	2010-05-13 20:02:08 +00:00
Bob Wilson	208dc08c2b	Fix pr7110: For non-Darwin targets UnspilledCS1GPRs may include high registers. Do not use those for Thumb1 functions. llvm-svn: 103730	2010-05-13 19:58:24 +00:00
Oscar Fuentes	82135e5a2f	CMake: fixes 64 bit Visual Studio IDE build. Fixes bug 4936. Patch by Dimitry Andric! llvm-svn: 103727	2010-05-13 19:34:06 +00:00
Anton Korobeynikov	449df9698f	Properly set thread-local flag on globals during cpp emission llvm-svn: 103702	2010-05-13 07:41:57 +00:00
Daniel Dunbar	a58dc0ffdb	Fix -Asserts warning. llvm-svn: 103694	2010-05-13 03:19:36 +00:00
Evan Cheng	9de7cfe3f4	Bring back VLD1q and VST1q and use them for reloading / spilling Q registers. This allows folding loads and stores into VMOVQ. llvm-svn: 103692	2010-05-13 01:12:06 +00:00
Evan Cheng	2f736c9577	Expand VMOVQQ into a pair of VMOVQ. llvm-svn: 103684	2010-05-13 00:17:02 +00:00
Evan Cheng	79efd71962	Mark some pattern-less instructions as neverHasSideEffects. llvm-svn: 103683	2010-05-13 00:16:46 +00:00
Chris Lattner	9efef006cf	reapply r103668 with a fix. Never make "minor syntax changes" after testing before committing. llvm-svn: 103681	2010-05-13 00:02:47 +00:00
Chris Lattner	e354235512	revert r103668 for now, it is apparently breaking things. llvm-svn: 103677	2010-05-12 23:40:59 +00:00
Chris Lattner	a6df4650fd	moffset forms of moves are x86-32 only, make the parser lower them to the correct x86-64 instructions since we don't have a clean way to handle this in td files yet. rdar://7947184 llvm-svn: 103668	2010-05-12 23:13:36 +00:00
Evan Cheng	7c1f56f29a	Fix some potential issues in the pseudo instruction expansion phase: copy implicit operands and memoperands. Also, expand instructions even if their defs are "dead" since they may have implicit kill operands. llvm-svn: 103667	2010-05-12 23:13:12 +00:00
Chris Lattner	e132b0a92c	fix the encoding of the obscure "moffset" forms of moves, i386 part first. rdar://7947184 llvm-svn: 103660	2010-05-12 22:48:24 +00:00
Evan Cheng	5aa20d6c26	Remove a dead fixme. llvm-svn: 103642	2010-05-12 20:20:22 +00:00
Rafael Espindola	b69c7b76f1	Add support for movi32 of global values to the new (MC) asm printer. llvm-svn: 103576	2010-05-12 05:16:34 +00:00
Evan Cheng	a2ff4fc96a	vst instructions are modeled as this: v1024 = REG_SEQUENCE ... v1025 = EXTRACT_SUBREG v1024, 5 v1026 = EXTRACR_SUBREG v1024, 6 = VSTxx <addr>, v1025, v1026 The REG_SEQUENCE ensures the sources that feed into the VST instruction are getting the right register allocation so they form a large super- register. The extract_subreg will be coalesced away all would just work: v1024 = REG_SEQUENCE ... = VSTxx <addr>, v1024:5, v1024:6 The problem is if the coalescer isn't run, the extract_subreg instructions would stick around and there is no assurance v1025 and v1026 will get the right registers. As a short term workaround, teach the NEON pre-allocation pass to transfer the sub-register indices over. An alternative would be do it 2addr pass when reg_sequence's are eliminated. But that seems wrong and require updating liveness information. Another alternative is to do this in the scheduler when the instructions are created. But that would mean somehow the scheduler this has to be done for correctness reason. That's yucky as well. So for now, we are leaving this in the target specific pass. llvm-svn: 103540	2010-05-12 01:42:50 +00:00
Daniel Dunbar	059379a9d7	MC/X86: Extend suffix matching hack to match 'q' suffix. llvm-svn: 103535	2010-05-12 00:54:20 +00:00
Daniel Dunbar	ba2f4c3884	MC/Mach-O/x86_64: Add a new hook for checking whether a particular section can be diced into atoms, and adjust getAtom() to take this into account. - This fixes relocations to symbols in fixed size literal sections, for example. llvm-svn: 103532	2010-05-12 00:38:17 +00:00
Dan Gohman	1a1b51ff59	Add initial kill flag support to FastISel. llvm-svn: 103529	2010-05-11 23:54:07 +00:00
Evan Cheng	44e865f487	Avoid breaking vstd when reg_sequence is not used. llvm-svn: 103513	2010-05-11 21:07:36 +00:00
Bill Wendling	508f661fbe	Simplify this logic of creating a default Features object. llvm-svn: 103507	2010-05-11 20:46:04 +00:00
Duncan Sands	6c5e4355bb	I got tired of VISIBILITY_HIDDEN colliding with the gcc enum. Rename it to LLVM_LIBRARY_VISIBILITY and introduce LLVM_GLOBAL_VISIBILITY, which is the opposite, for future use by dragonegg. llvm-svn: 103495	2010-05-11 20:16:09 +00:00
Dan Gohman	4cfccb801c	Remove the "WantsWholeFile" concept, as it's no longer needed. CBE and the others use the regular addPassesToEmitFile hook now, and llc no longer needs a bunch of redundant code to handle the whole-file case. llvm-svn: 103492	2010-05-11 19:57:55 +00:00
Dan Gohman	bb919dfb6b	Implement a bunch more TargetSelectionDAGInfo infrastructure. Move EmitTargetCodeForMemcpy, EmitTargetCodeForMemset, and EmitTargetCodeForMemmove out of TargetLowering and into SelectionDAGInfo to exercise this. llvm-svn: 103481	2010-05-11 17:31:57 +00:00
Dan Gohman	4df9d9ce11	Remove the TargetLowering::getSubtarget() virtual function, which was unused. TargetMachine::getSubtarget() is used instead. llvm-svn: 103474	2010-05-11 16:21:03 +00:00
Kalle Raiskila	9dd3ef8d01	Make SPU backend not assert on jump tables. llvm-svn: 103466	2010-05-11 11:00:02 +00:00
Evan Cheng	2fa5a7e7e4	Select @llvm.trap to the special B with 1111 condition (i.e. trap) instruction. llvm-svn: 103459	2010-05-11 07:26:32 +00:00
Bill Wendling	ea31737fef	Don't create a StringRef with a NULL value. llvm-svn: 103455	2010-05-11 01:33:39 +00:00
Evan Cheng	e276c18385	Model some vst3 and vst4 with reg_sequence. llvm-svn: 103453	2010-05-11 01:19:40 +00:00
Bill Wendling	a12c1ff25a	The getDefaultSubtargetFeatures method of SubtargetFeature did actually return a string of features for that target. However LTO was using that string to pass into the "create target machine" stuff. That stuff needed the feature string to be in a particular form. In particular, it needed the CPU specified first and then the attributes. If there isn't a CPU specified, it required it to be blank -- e.g., ",+altivec". Yuck. Modify the getDefaultSubtargetFeatures method to be a non-static member function. For all attributes for a specific subtarget, it will add them in like normal. It will also take a CPU string so that it can satisfy this horrible syntax. llvm-svn: 103451	2010-05-11 00:30:02 +00:00
Evan Cheng	630063aa0d	Model some vld3 instructions with REG_SEQUENCE. llvm-svn: 103437	2010-05-10 21:26:24 +00:00
Evan Cheng	c2ae5f546f	Model vld2 / vst2 with reg_sequence. llvm-svn: 103411	2010-05-10 17:34:18 +00:00
Kalle Raiskila	92ea401d8f	Fix encoding of 'sf' and 'sfh' instructions. llvm-svn: 103399	2010-05-10 08:13:49 +00:00
Nathan Jeffords	b19c598843	updated handling dllexport in X86AsmPrinter changed dllexport code to use EmitBytes instead of EmitRawText, and changed the export option to use /EXPORT: instead of -export: on the windows platform llvm-svn: 103377	2010-05-09 08:40:06 +00:00
Nathan Jeffords	2760216c89	made COFF target dllexport logic apply to all subtargets llvm-svn: 103373	2010-05-09 05:52:28 +00:00
Chris Lattner	72afa956db	break coff symbol definition stuff out into proper MCStreamer callbacks, patch by Nathan Jeffords! llvm-svn: 103346	2010-05-08 19:54:22 +00:00
Jim Grosbach	2a41cad900	Clean up the conditional for handling of sign_extend_inreg based on whether the extract instructions are available. rdar://7956878 llvm-svn: 103277	2010-05-07 18:34:55 +00:00
Devang Patel	4423abd734	Use overloaded operators instead of DIDescriptor::getNode() llvm-svn: 103276	2010-05-07 18:19:32 +00:00
Kalle Raiskila	b3c5c4611c	Testing svn access with a note added to documentation. llvm-svn: 103271	2010-05-07 18:06:28 +00:00
Chris Lattner	87cffa9498	switch MCSectionCOFF from a syntactic to semantic representation, patch by Peter Housel! llvm-svn: 103267	2010-05-07 17:17:41 +00:00
Evan Cheng	86eb22976f	Use VLD2q32 / VST2q32 to reload / spill QQ (pair of Q) registers when stack slot is sufficiently aligned. Use VLDMD / VSTMD otherwise. llvm-svn: 103235	2010-05-07 02:04:02 +00:00
Evan Cheng	04d47e8efa	Use VSTMD / VLDMD for spills and reloads of Q registers instead of VSTMQ / VLDQ. The later are aliases which ought to be eliminated but we can't because they are used for storing and loading v2f64 values. llvm-svn: 103234	2010-05-07 01:54:08 +00:00
Dan Gohman	90c600d6d2	When rematerializing, use the debug location of the original instruction, rather than a location near where the new instruction is being inserted. llvm-svn: 103232	2010-05-07 01:28:10 +00:00
Evan Cheng	ddc93c7e04	Remove VLD1q and VST1q for reloading and spilling Q registers. Just use VLD1q64 / VST1q64 and reference sub-registers. llvm-svn: 103218	2010-05-07 00:24:52 +00:00
Daniel Dunbar	21aa523c28	MC/X86: X86AbsMemAsmOperand is subclass of X86NoSegMemAsmOperand. - This fixes "leal 0, %eax", for example. llvm-svn: 103205	2010-05-06 22:39:14 +00:00
Chris Lattner	348dc9b15a	fix rdar://7947167 - llvm-mc doesn't match movsq llvm-svn: 103199	2010-05-06 21:48:14 +00:00
Sean Callanan	e7e1cf9fbd	Eliminated the classification of control registers into %ecr_ and %rcr_, leaving just %cr_ which is what people expect. Updated the disassembler to support this unified register set. Added a testcase to verify that the registers continue to be decoded correctly. llvm-svn: 103196	2010-05-06 20:59:00 +00:00
Daniel Dunbar	358b29c855	MC/X86: Error out if we see a non-constant FK_Data_1 or FK_Data_2 fixup, since we don't currently support relaxing them. llvm-svn: 103195	2010-05-06 20:34:01 +00:00
Dan Gohman	779c69bbc5	Add a DebugLoc argument to TargetInstrInfo::copyRegToReg, so that it doesn't have to guess. llvm-svn: 103194	2010-05-06 20:33:48 +00:00
Evan Cheng	efb126a665	Add argument TargetRegisterInfo to loadRegFromStackSlot and storeRegToStackSlot. llvm-svn: 103193	2010-05-06 19:06:44 +00:00
Bob Wilson	f765e1f34a	Add a missing break statement to fix unintentional fall-through (replacing the previous patch for the same issue). llvm-svn: 103183	2010-05-06 16:05:26 +00:00
Jim Grosbach	5e3cccb1e4	Fix unintentional fallthrough. Patch by Edmund Grimley-Evans <Edmund.Grimley-Evans@arm.com> llvm-svn: 103181	2010-05-06 15:32:49 +00:00
Shantonu Sen	94231eec1f	Fix "warning: extra ';' inside a struct or union" when building llvm with clang llvm-svn: 103179	2010-05-06 14:57:47 +00:00
Evan Cheng	31cdcd46d6	Re-apply 103156 and 103157. 103156 didn't break anything. 10315 exposed a coalescer bug that's fixed by 103170. llvm-svn: 103172	2010-05-06 06:36:08 +00:00
Dan Gohman	77c71811f5	Revert r103157, which broke test/CodeGen/ARM/2009-11-30-LiveVariablesBug.ll. llvm-svn: 103163	2010-05-06 05:08:57 +00:00
Eric Christopher	9feb1bb117	Revert r103156 since it was breaking the build bots. Reverse-merging r103156 into '.': U lib/Target/ARM/ARMInstrNEON.td U lib/Target/ARM/ARMRegisterInfo.h U lib/Target/ARM/ARMBaseRegisterInfo.cpp U lib/Target/ARM/ARMBaseInstrInfo.cpp U lib/Target/ARM/ARMRegisterInfo.td llvm-svn: 103159	2010-05-06 02:29:06 +00:00
Evan Cheng	8fd7b510d6	Fix an obvious bug in isMoveInstr. It needs to return sub-register indices. llvm-svn: 103157	2010-05-06 01:54:03 +00:00
Evan Cheng	8f99a1c6b4	Adding pseudo 256-bit registers QQ0 . . . QQ7 to represent pairs of Q registers. These will be used to model VLD2 / VST2 instructions in order to get substantially better codegen for them. llvm-svn: 103156	2010-05-06 01:52:03 +00:00
Evan Cheng	9d768f4445	Cosmetic changes. llvm-svn: 103155	2010-05-06 01:34:11 +00:00
Evan Cheng	718ff448df	storeRegToStackSlot has forgotten about QPR_8 register class. llvm-svn: 103154	2010-05-06 01:32:54 +00:00
Jim Grosbach	151cd8f159	Cleanup of ARMv7M support. Move hardware divide and Thumb2 extract/pack instructions to subtarget features and update tests to reflect. PR5717. llvm-svn: 103136	2010-05-05 23:44:43 +00:00
Sean Callanan	4cd930f417	Fixed a sign-extension bug in the X86 disassembler that was causing PC-relative branch targets to be evaluated incorrectly. Also added support for checking operand values to the llvm-mc tester. llvm-svn: 103128	2010-05-05 22:47:27 +00:00
Evan Cheng	61908f6b6c	Do not pre-allocate references of D registers pairs if they are extracted from the same Q register and are in the right order. llvm-svn: 103124	2010-05-05 22:15:40 +00:00
Dan Gohman	f62cd20b62	No-ops emitted for scheduling don't correspond with anything in the user's source, so don't arbitrarily assign them a debug location. llvm-svn: 103121	2010-05-05 20:58:01 +00:00
Jim Grosbach	92d999001c	Add initial support for ARMv7M subtarget and cortex-m3 cpu. Patch by Jordy <snhjordy@gmail.com>. Followup patches will add some tests and adjust to use Subtarget features for the instructions. llvm-svn: 103119	2010-05-05 20:44:35 +00:00
Evan Cheng	d85631e700	Model CONCAT_VECTORS of two 64-bit values as a REG_SEQUENCE. llvm-svn: 103104	2010-05-05 18:28:36 +00:00
Evan Cheng	5ab29366b5	Trim include. llvm-svn: 103103	2010-05-05 18:27:57 +00:00
Eric Christopher	4e7e4e6b60	Revert 102941, we're going to do this via attr and can just hack the code to turn it off when debugging. llvm-svn: 103083	2010-05-05 07:35:59 +00:00
Eric Christopher	a00830df31	Update comment. llvm-svn: 103057	2010-05-04 22:13:03 +00:00
Evan Cheng	8e6b40a881	With -neon-reg-sequence, models forming a Q register from a pair of consecutive D registers as a REG_SEQUENCE. llvm-svn: 103047	2010-05-04 20:39:49 +00:00
Evan Cheng	a3a7b0099c	Do not pre-allocate for registers which form a REG_SEQUENCE. llvm-svn: 103041	2010-05-04 20:38:12 +00:00
Chris Lattner	0185047b3f	"on the rare occasion the SPU BE produces illegal assembly - it tries to emit an add instruction of the form 'a reg, reg, imm'." Patch by Kalle Raiskila! llvm-svn: 103021	2010-05-04 17:58:46 +00:00
Daniel Dunbar	c3e0bafc6d	MC/X86: Chris pointed that 'as' isn't consistent in accepting the long form of instructions which have no direct register usage. Darwin 'as' accepts: add $0, (%rax) but rejects mov $0, (%rax) for example. Given that, only accept suffix matches which match exactly one form. We still need to emit nice diagnostics for failures... llvm-svn: 103015	2010-05-04 17:31:02 +00:00
Daniel Dunbar	9b816a1bb3	MC/X86: Add "support" for matching ATT style mnemonic prefixes. - The idea is that when a match fails, we just try to match each of +'b', +'w', +'l'. If exactly one matches, we assume this is a mnemonic prefix and accept it. If all match, we assume it is width generic, and take the 'l' form. - This would be a horrible hack, if it weren't so simple. Therefore it is an elegant solution! Chris gets the credit for this particular elegant solution. :) - Next step to making this more robust is to have the X86 matcher generate the mnemonic prefix information. Ideally we would also compute up-front exactly which mnemonic to attempt to match, but this may require more custom code in the matcher than is really worth it. llvm-svn: 103012	2010-05-04 16:12:42 +00:00
Gabor Greif	4c0f838637	fix operand indexes when outputting InvokeInsts llvm-svn: 103003	2010-05-04 09:23:54 +00:00
Kevin Enderby	8f0037097f	Fix to r102952. The MOV64toSDrm record in X86Instr64bit.td needed the opcode changed to 0x7E from 0x6E as well as the previous change of RPDI to S3SI. llvm-svn: 102991	2010-05-04 00:42:46 +00:00
Jim Grosbach	30e637c9bc	rdar://7937137 - dbg values not being handled in thumb1 version of eliminateFrameIndex(), leading to llvm_unreachable() assertion failure. llvm-svn: 102980	2010-05-04 00:11:37 +00:00
Dale Johannesen	81bfca7bde	Implement builtin_return_address(x) and builtin_frame_address(x) on PPC for x!=0. 7624113. llvm-svn: 102972	2010-05-03 22:59:34 +00:00
Kevin Enderby	6f2f8d0798	Changed llvm-mc to use the same suffixes with floating point compare instructions as the Mac OS X darwin assembler. Some of which like 'fcoml' assembled to different opcodes. While some of the suffixes were just different. llvm-svn: 102958	2010-05-03 21:31:40 +00:00
Kevin Enderby	e3a1726034	Fixed the encoding of two of the X86 movq instuctions. The Move quadword from mm to mm/m64 and the Move quadword from xmm2/mem64 to xmm1 had the incorrect encodings. llvm-svn: 102952	2010-05-03 21:03:31 +00:00
Kevin Enderby	1a51d4cec9	Fixed the encoding of the x86 push instructions. Using a 32-bit immediate value caused the a pushl instruction to be incorrectly encoding using only two bytes of immediate, causing the following 2 instruction bytes to be part of the 32-bit immediate value. Also fixed the one byte form of push to be used when the immediate would fit in a signed extended byte. Lastly changed the names to not include the 32 of PUSH32 since they actually push the size of the stack pointer. llvm-svn: 102951	2010-05-03 20:45:05 +00:00
Eric Christopher	937a5b75f9	Add an option, defaulting to off, to disable the sse domain crossing opts. llvm-svn: 102941	2010-05-03 19:54:02 +00:00
Dan Gohman	73c8145505	Add a README entry. llvm-svn: 102906	2010-05-03 14:31:00 +00:00
Duncan Sands	211427bda9	Remove the -enable-sjlj-eh option, which doesn't do anything. Remove the -enable-eh option which is only used by the JIT, and replace it with -jit-enable-eh. llvm-svn: 102865	2010-05-02 15:36:26 +00:00
Chris Lattner	2094488d81	fix some inconsistent line endings, patch by Jakub Staszak! llvm-svn: 102852	2010-05-01 17:36:49 +00:00
Anton Korobeynikov	319d71f44f	Do folding for indirect branches, where possible llvm-svn: 102836	2010-05-01 12:28:21 +00:00
Anton Korobeynikov	ebbdfef2fc	Implement indirect branches on MSP430 llvm-svn: 102835	2010-05-01 12:04:32 +00:00
Anton Korobeynikov	6fbff44893	Long branch target oparands are not pc-rel. This should fix PR6603. llvm-svn: 102834	2010-05-01 12:04:22 +00:00
Dan Gohman	25c1653700	Get rid of the EdgeMapping map. Instead, just check for BasicBlock changes before doing phi lowering for switches. llvm-svn: 102809	2010-05-01 00:01:06 +00:00
Dan Gohman	0cb06d64e8	Fix a typo. llvm-svn: 102799	2010-04-30 22:38:11 +00:00
Dan Gohman	2e2cc87081	Make this code less confusing. Instead of reassigning BB, just operate on the original variables, so it's easier to see what is being done to which blocks. llvm-svn: 102759	2010-04-30 20:14:26 +00:00
Dan Gohman	57bb73c80b	Remove the -disable-16bit command-line option, which is now obsolete. llvm-svn: 102730	2010-04-30 18:30:26 +00:00
Evan Cheng	5117a555e0	Another sibcall bug. If caller and callee calling conventions differ, then it's only safe to do a tail call if the results are returned in the same way. llvm-svn: 102683	2010-04-30 01:12:32 +00:00
Dan Gohman	0fd54fbbcf	Don't leave Base.FrameIndex uninitialized, so that it doesn't print randomly in debug output. llvm-svn: 102668	2010-04-29 23:30:41 +00:00
Dale Johannesen	6feac8a39b	Make naked functions work on PPC. llvm-svn: 102657	2010-04-29 19:32:19 +00:00
Devang Patel	080e4fb2f0	Print variable scope name in DEBUG_VALUE comment. Useful in some cases. e.g. ##DEBUG_VALUE: runOnMachineFunction:this <- RDI+0 ##DEBUG_VALUE: runOnMachineFunction:fn <- RSI+0 ##DEBUG_VALUE: DeadDefs <- undef ## SimpleRegisterCoalescing.cpp:2706 ##DEBUG_VALUE: getRegInfo:this <- [%rsp+$56]+$0 ##DEBUG_VALUE: getTarget:this <- [%rsp+$56]+$0 llvm-svn: 102655	2010-04-29 18:52:10 +00:00
Evan Cheng	38dfa5cf20	Load folding tail call should not use ebp / rbp after it's popped. PEI should use esp / rsp to reference frame instead. llvm-svn: 102596	2010-04-29 05:08:22 +00:00
Mon P Wang	b0a0a26df1	Add support for assemblers that don't support periods in a name llvm-svn: 102594	2010-04-29 04:00:56 +00:00
Evan Cheng	250e917e9d	Frame index can be negative. llvm-svn: 102577	2010-04-29 01:13:30 +00:00
Kevin Enderby	4822841b82	Fixed the word sized Bit Scan Forward/Reverse instructions, they needed the Operand size override prefix to be part of their records. llvm-svn: 102556	2010-04-28 23:20:40 +00:00
Jim Grosbach	04cbcca319	Add sizes non-floating point versions for the eh sjlj intrinsic expansions. rdar://7895451 llvm-svn: 102526	2010-04-28 20:33:09 +00:00
Jakob Stoklund Olesen	96fad31694	Teach X86FloatingPoint that a register can be killed multiple times by the same instruction. This instruction would crash the pass: INLINEASM <es:foo $0 $1>, 9, %FP0<kill>, 9, %FP0<kill>, 14, %EFLAGS<earlyclobber,def,dead> Now it doesn't. llvm-svn: 102509	2010-04-28 18:28:37 +00:00
Evan Cheng	050df1b8de	Enable i16 to i32 promotion by default. llvm-svn: 102493	2010-04-28 08:30:49 +00:00
Evan Cheng	d21f564543	Unbreak the build. Only form shld / shrd after legalization. llvm-svn: 102488	2010-04-28 02:25:18 +00:00
Devang Patel	50c9431203	Emit debug info for byval parameters. llvm-svn: 102486	2010-04-28 01:39:28 +00:00
Evan Cheng	347e3b8f15	Rather than having a ton of patterns for double shift instructions, e.g. SHLD16rrCL, just perform custom dag combine to form x86 specific dag so they match to the same pattern. This also makes sure later dag combine do not cause isel to miss them (e.g. promoting i16 to i32). llvm-svn: 102485	2010-04-28 01:18:01 +00:00
Chris Lattner	a3facc5cb5	further simplify EmitAlignment by eliminating the ForcedAlignBits argument, tweaking the single client of it. llvm-svn: 102484	2010-04-28 01:08:40 +00:00
Stuart Hastings	c0458f1a40	Tweak x86 INC/DEC generation to look for CopyToReg or SETCC. Radar 7866163. llvm-svn: 102477	2010-04-28 00:35:10 +00:00
Devang Patel	12f6855f85	Use MachineOperand::is* predicates. llvm-svn: 102472	2010-04-27 22:24:37 +00:00
Evan Cheng	9e3a4ef089	Fix obvious typos. llvm-svn: 102467	2010-04-27 21:46:03 +00:00
Evan Cheng	3b928af28f	SRA promotion is also not free. llvm-svn: 102456	2010-04-27 19:48:31 +00:00
Chris Lattner	6a5e706e3c	on darwin empty functions need to codegen into something of non-zero length, otherwise labels get incorrectly merged. We handled this by emitting a ".byte 0", but this isn't correct on thumb/arm targets where the text segment needs to be a multiple of 2/4 bytes. Handle this by emitting a noop. This is more gross than it should be because arm/ppc are not fully mc'ized yet. This fixes rdar://7908505 llvm-svn: 102400	2010-04-26 23:37:21 +00:00
Bob Wilson	25f85947a3	Handle register-to-register copies within the tGPR class. Radar 7896289 llvm-svn: 102396	2010-04-26 23:20:08 +00:00
Dale Johannesen	91358585d7	Handle target-specific form of DBG_VALUE in AsmPrinter. llvm-svn: 102373	2010-04-26 20:07:31 +00:00
Dale Johannesen	bc41cfa78f	Add PPC AsmPrinter handling for target-specific form of DBG_VALUE, and a cautionary comment. llvm-svn: 102371	2010-04-26 20:05:01 +00:00
Evan Cheng	6e45f1d1ff	Promoting 16-bit cmp / test aren't free. Don't do it. llvm-svn: 102366	2010-04-26 19:06:11 +00:00
Evan Cheng	1ff9d1b63e	Remove a redundant comment. llvm-svn: 102326	2010-04-26 08:16:57 +00:00
Evan Cheng	f19bd4ebba	Add PPC specific emitFrameIndexDebugValue. llvm-svn: 102325	2010-04-26 07:39:36 +00:00
Evan Cheng	bcb99ecc18	Add ARM specific emitFrameIndexDebugValue. llvm-svn: 102324	2010-04-26 07:39:25 +00:00
Evan Cheng	ed69b382ea	- Move TargetLowering::EmitTargetCodeForFrameDebugValue to TargetInstrInfo and rename it to emitFrameIndexDebugValue. - Teach spiller to modify DBG_VALUE instructions to reference spill slots. llvm-svn: 102323	2010-04-26 07:38:55 +00:00
Dale Johannesen	582565e991	Stop abusing EmitInstrWithCustomInserter for target-dependent form of DEBUG_VALUE, as it doesn't have reasonable default behavior for unsupported targets. Add a new hook instead. No functional change. llvm-svn: 102320	2010-04-25 21:33:54 +00:00
Evan Cheng	a02d0e7d6b	Avoid promoting a i16 node if it would eliminate a (store (op (load))) opportunity. llvm-svn: 102237	2010-04-24 04:44:57 +00:00
Dan Gohman	e1931fa676	Change TargetData's algorithm for computing defualt vector type alignment to match what's used in clang and GCC for __alignof, rather than trying to guess what Legalize is going to be doing. llvm-svn: 102206	2010-04-23 19:41:15 +00:00
Stuart Hastings	24b63f1597	Add some missing x86 patterns for movdq2q. Fixes two (LLVM-)GCC DejaGNU testcases. Radar 6881029. llvm-svn: 102199	2010-04-23 19:03:32 +00:00
Evan Cheng	0367559786	Fix X86ISD::CMP i16 to i32 promotion. llvm-svn: 102192	2010-04-23 18:21:16 +00:00
Jim Grosbach	825cb299cd	Update ARM DAGtoDAG for matching UBFX instruction for unsigned bitfield extraction. This fixes PR5998. llvm-svn: 102144	2010-04-22 23:24:18 +00:00
Dan Gohman	c594eab10f	Move HandlePHINodesInSuccessorBlocks functions out of SelectionDAGISel and into SelectionDAGBuilder and FastISel. llvm-svn: 102123	2010-04-22 20:46:50 +00:00
Evan Cheng	f1223bdec0	- It's not safe to promote rotates (at least not trivially). - Some code refactoring. llvm-svn: 102111	2010-04-22 20:19:46 +00:00
Johnny Chen	d85afee134	Modified some assert() msg strings; no other functionality change. llvm-svn: 102008	2010-04-21 18:37:48 +00:00
Evan Cheng	4158a0ff6b	Implement -disable-non-leaf-fp-elim which disable frame pointer elimination optimization for non-leaf functions. This will be hooked up to gcc's -momit-leaf-frame-pointer option. rdar://7886181 llvm-svn: 101984	2010-04-21 03:18:23 +00:00
Evan Cheng	9c8cd8c061	isel (i32 anyext i16) as insert_subreg when 16-bit ops are being promoted. llvm-svn: 101979	2010-04-21 01:47:12 +00:00
Evan Cheng	873310f635	Trim include. llvm-svn: 101978	2010-04-21 01:39:06 +00:00
Dan Gohman	57c732b032	Add more const qualifiers on TargetMachine and friends. llvm-svn: 101977	2010-04-21 01:34:56 +00:00
Johnny Chen	dd56c40591	Thumb instructions which have reglist operands at the end and predicate operands before reglist were not properly handled with respect to IT Block. Fix that by creating a new method ARMBasicMCBuilder::DoPredicateOperands() used by those instructions for disassembly. Add a test case. llvm-svn: 101974	2010-04-21 01:01:19 +00:00
Bill Wendling	11740305f9	Handle a displacement location in 64-bit as an RIP-relative displacement. It fixes a bug (<rdar://problem/7880900>) in the JIT. This code wouldn't work: target triple = "x86_64-apple-darwin" define double @func(double %a) { %tmp1 = fmul double %a, 5.000000e-01 ; <double> [#uses=1] ret double %tmp1 } define i32 @main() nounwind { %1 = call double @func(double 4.770000e-04) ; <i64> [#uses=0] ret i32 0 } llvm-svn: 101965	2010-04-21 00:34:04 +00:00
Chris Lattner	84776786a7	teach the x86 address matching stuff to handle (shl (or x,c), 3) the same as (shl (add x, c), 3) when x doesn't have any bits from c set. This finishes off PR1135. Before we compiled the block to: to: LBB0_3: ## %bb cmpb $4, %dl sete %dl addb %dl, %cl movb %cl, %dl shlb $2, %dl addb %r8b, %dl shlb $2, %dl movzbl %dl, %edx movl %esi, (%rdi,%rdx,4) leaq 2(%rdx), %r9 movl %esi, (%rdi,%r9,4) leaq 1(%rdx), %r9 movl %esi, (%rdi,%r9,4) addq $3, %rdx movl %esi, (%rdi,%rdx,4) incb %r8b decb %al movb %r8b, %dl jne LBB0_1 Now we produce: LBB0_3: ## %bb cmpb $4, %dl sete %dl addb %dl, %cl movb %cl, %dl shlb $2, %dl addb %r8b, %dl shlb $2, %dl movzbl %dl, %edx movl %esi, (%rdi,%rdx,4) movl %esi, 8(%rdi,%rdx,4) movl %esi, 4(%rdi,%rdx,4) movl %esi, 12(%rdi,%rdx,4) incb %r8b decb %al movb %r8b, %dl jne LBB0_1 llvm-svn: 101958	2010-04-20 23:18:40 +00:00
Dale Johannesen	0522b90cdb	Because of the EMMS problem, right now we have to support user-defined operations that use MMX register types, but the compiler shouldn't generate them on its own. This adds a Synthesizable abstraction to represent this, and changes the vector widening computation so it won't produce MMX types. (The motivation is to remove noise from the ABI compatibility part of the gcc test suite, which has some breakage right now.) llvm-svn: 101951	2010-04-20 22:34:09 +00:00
Johnny Chen	8bcc00b43e	Better error-handling of getBitFieldInvMask() where msb < lsb (encoding error), instead of just asserting. llvm-svn: 101942	2010-04-20 21:29:28 +00:00
Johnny Chen	7be315c414	For t2LDRT, t2LDRBT, t2LDRHT, t2LDRSBT, and t2LDRSHT, if Rn(Inst{19-16})=='1111', transform the Opcode to the corresponding t2LDR*pci counterpart. Ref: A8.6.86 LDRT, A8.6.65 LDRBT, A8.6.77 LDRHT, A8.6.81 LDRSBT, A8.6.85 LDRSHT llvm-svn: 101915	2010-04-20 17:28:50 +00:00
Chris Lattner	38c1a1a247	teach cellspu how to return i8 and i16 from calls, patch by Kalle Raiskila! llvm-svn: 101875	2010-04-20 05:36:09 +00:00
Chris Lattner	4025306a91	disable optimizations in this directory for MSVC9. This avoids an optimizer infinite loop on the file, PR6866. llvm-svn: 101854	2010-04-20 01:11:32 +00:00
Johnny Chen	2161e9f03b	Better error-handling for DisassembleThumb2DPModImm() with 2-reg operands where d==15 is considered illegal. Return false instead of assert(). llvm-svn: 101852	2010-04-20 01:01:57 +00:00
Eric Christopher	64831c6a4c	Remove the palignr intrinsics now that we lower them to vector shuffles, shifts and null vectors. Autoupgrade these to what we'd lower them to. Add a testcase to exercise this. llvm-svn: 101851	2010-04-20 00:59:54 +00:00
Johnny Chen	f3dd8b9487	More IT instruction error-handling improvements from fuzzing. llvm-svn: 101839	2010-04-20 00:15:41 +00:00
Johnny Chen	e62b680965	Better error handling of invalid IT mask '0000', instead of just asserting. llvm-svn: 101827	2010-04-19 23:02:58 +00:00
Dan Gohman	5ccd0b3686	Delete an unnecessary reference to SelectionDAGISel::BB. llvm-svn: 101824	2010-04-19 22:48:45 +00:00
Johnny Chen	777346e749	According to A8.6.16 B (Encoding T3) and A8.3 Conditional execution -- A8.3.1 Pseudocode details of conditional, Condition bits '111x' indicate the instruction is always executed. That is, '1111' is a leagl condition field value, which is now mapped to ARMCC::AL. Also add a test case for condition field '1111'. llvm-svn: 101817	2010-04-19 21:19:52 +00:00
Evan Cheng	e19aa5cc52	More progress on promoting i16 operations to i32 for x86. Work in progress. llvm-svn: 101808	2010-04-19 19:29:22 +00:00
Johnny Chen	25df2a75bd	Better error-handling for DisassembleThumb2DPSoReg() where the 3-reg operand instructions should have Rd (Inst{11-8}) != 0b1111. Ref: A6.3 32-bit Thumb instruction encoding A6.3.11 Data-processing (shifted register) llvm-svn: 101788	2010-04-19 17:16:40 +00:00
Johnny Chen	cbe3e1a3df	ARM disassembler did not react to recent changes to the NEON instruction table. VLD1q_UPD and VST1q_UPD have the ${dst:dregpair} operand now. llvm-svn: 101784	2010-04-19 16:20:34 +00:00
Anton Korobeynikov	7b056bfed0	Add missed part of prev. commit llvm-svn: 101755	2010-04-18 20:41:42 +00:00
Anton Korobeynikov	7d62e33291	Make processor FUs unique for given itinerary. This extends the limit of 32 FU per CPU arch to 32 per intinerary allowing precise modelling of quite complex pipelines in the future. llvm-svn: 101754	2010-04-18 20:31:01 +00:00
Chris Lattner	0a8d91a816	fix PR6332, allowing an index of zero into a zero sized array even if the element of the array has no size. llvm-svn: 101662	2010-04-17 19:02:33 +00:00
Chris Lattner	b927073f2e	teach the x86 asm parser how to handle segment prefixes in memory operands. rdar://7874844 llvm-svn: 101661	2010-04-17 18:56:34 +00:00
Dan Gohman	1f0f2142cc	Fix -Wcast-qual warnings. llvm-svn: 101655	2010-04-17 17:42:52 +00:00
Chris Lattner	05f34394d9	remove a dead variable, PR6856 llvm-svn: 101648	2010-04-17 17:28:00 +00:00
Dan Gohman	53d4a08d2b	Add const qualifiers to TargetLoweringObjectFile usage. llvm-svn: 101640	2010-04-17 16:44:48 +00:00
Dan Gohman	88f7f6aeda	Use const_cast instead of a C-style cast to cast away const. llvm-svn: 101639	2010-04-17 16:43:55 +00:00
Dan Gohman	8422e57baa	Delete now-unnecessary const_casts. llvm-svn: 101637	2010-04-17 15:32:28 +00:00
Dan Gohman	20e094c711	Use cast instead of dyn_cast when assuming success. llvm-svn: 101636	2010-04-17 15:31:16 +00:00
Dan Gohman	21cea8ac2e	Use const qualifiers with TargetLowering. This eliminates several const_casts, and it reinforces the design of the Target classes being immutable. SelectionDAGISel::IsLegalToFold is now a static member function, because PIC16 uses it in an unconventional way. There is more room for API cleanup here. And PIC16's AsmPrinter no longer uses TargetLowering. llvm-svn: 101635	2010-04-17 15:26:15 +00:00
Dan Gohman	31ae586c74	Move per-function state out of TargetLowering subclasses and into MachineFunctionInfo subclasses. llvm-svn: 101634	2010-04-17 14:41:14 +00:00
Chandler Carruth	ca0a53ac52	Name these stub files consistently with the SPU and PPC targets' conventions. Also rename the classes appropriately. The CMake build already used these names. llvm-svn: 101631	2010-04-17 08:50:29 +00:00
Chris Lattner	7f5088e6de	a bunch of ssse3 instructions are misencoded to think they have an i8 field when they really do not. This fixes rdar://7840289 llvm-svn: 101629	2010-04-17 07:38:24 +00:00
Evan Cheng	f1bd5fcdb4	More work to allow dag combiner to promote 16-bit ops to 32-bit. llvm-svn: 101621	2010-04-17 06:13:15 +00:00
Bob Wilson	59b70eacad	Revise my previous change to ExpandBIT_CONVERT. I hadn't realized that this may be called when either the source or destination type is i64, and my change also hadn't fixed the most obvious problem -- assuming that i64 will only be bitconverted to f64, ignoring the various vector types. Radar 7873160. llvm-svn: 101615	2010-04-17 05:30:19 +00:00
Chris Lattner	cfc921cd2a	add a note llvm-svn: 101581	2010-04-16 23:52:30 +00:00
Eric Christopher	7258dcd77f	Revert 101465, it broke internal OpenGL testing. Probably the best way to know that all getOperand() calls have been handled is to replace that API instead of updating. llvm-svn: 101579	2010-04-16 23:37:20 +00:00
Johnny Chen	c275414575	Cast to (uint64_t) instead of relying on the "ul" suffix. llvm-svn: 101573	2010-04-16 23:30:28 +00:00
Dan Gohman	9becdddc49	Add skeleton target-specific SelectionDAGInfo files. llvm-svn: 101564	2010-04-16 23:04:22 +00:00
Johnny Chen	ed9bee150b	Fixed logic error. Should check Builder for validity before calling SetSession on it. llvm-svn: 101563	2010-04-16 23:02:25 +00:00
Johnny Chen	b90b6f1a35	Fixed a bug in DisassembleN1RegModImmFrm() where a break stmt was missing for a case. Also, the 0xFF hex literal involved in the shift for ESize64 should be suffixed "ul" to preserve the shift result. Implemented printHex*ImmOperand() by copying from ARMAsmPrinter.cpp and added a test case for DisassembleN1RegModImmFrm()/printHex64ImmOperand(). llvm-svn: 101557	2010-04-16 22:40:20 +00:00
Chris Lattner	d4758fc393	add a missing break back, patch by Nico Schmidt! llvm-svn: 101538	2010-04-16 21:15:15 +00:00
Dan Gohman	148c69a3f6	Eliminate an unnecessary SelectionDAG dependency in getOptimalMemOpType. llvm-svn: 101531	2010-04-16 20:11:05 +00:00
Johnny Chen	2b7aba10c2	In the same spirit of r101524, which removed the assert() from printAddrMode2OffsetOperand(), this patch removes the assert() from printAddrMode3OffsetOperand() and adds a test case. llvm-svn: 101529	2010-04-16 19:57:21 +00:00
Johnny Chen	807e1748fc	Multiclass LdStCop was using pre-UAL syntax LDC<c>L for the L fragment. Changed to the UAL syntax of LDCL<c>, instead. Add a test case for this change which also tests the removal of assert() from printAddrMode2OffsetOperand(). llvm-svn: 101527	2010-04-16 19:33:23 +00:00
Johnny Chen	88599a42bb	Remove the assert() from printAddrMode2OffsetOperand(). "#0 and #-0" are considered legal instructions. Refs: A8.6.51 LDC, LDC2 (immediate) -- page A8-107, A8.6.58 LDR (immediate, ARM) -- page A8-121, and A8.6.194 STR (immediate, ARM) -- page A8-395. llvm-svn: 101524	2010-04-16 19:10:52 +00:00
Gabor Greif	f375520f7b	reapply r101434 with a fix for self-hosting rotate CallInst operands, i.e. move callee to the back of the operand array the motivation for this patch are laid out in my mail to llvm-commits: more efficient access to operands and callee, faster callgraph-construction, smaller compiler binary llvm-svn: 101465	2010-04-16 15:33:14 +00:00
Evan Cheng	af56facacd	Adding support for dag combiner to promote operations for profit. This requires target specific queries. For example, x86 should promote i16 to i32 when it does not impact load folding. x86 support is off by default. It can be enabled with -promote-16bit. Work in progress. llvm-svn: 101448	2010-04-16 06:14:10 +00:00
Evan Cheng	3da64f7672	Use getAL() rather than a major constant. llvm-svn: 101446	2010-04-16 05:46:06 +00:00
Gabor Greif	403e9694f9	back out r101423 and r101397, they break llvm-gcc self-host on darwin10 llvm-svn: 101434	2010-04-16 01:16:20 +00:00
Johnny Chen	acbc06c2a3	Fixed a bug in ARM disassembly where LDRSBT should have am3offset operand, not am2offset. Modified the instruction table entry and added a new test case. llvm-svn: 101415	2010-04-15 23:12:47 +00:00
Evan Cheng	f7f97b4bbd	Use default lowering of DYNAMIC_STACKALLOC. As far as I can tell, ARM isle is doing the right thing and codegen looks correct for both Thumb and Thumb2. llvm-svn: 101410	2010-04-15 22:20:34 +00:00
Gabor Greif	33ae80bff7	reapply r101364, which has been backed out in r101368 with a fix rotate CallInst operands, i.e. move callee to the back of the operand array the motivation for this patch are laid out in my mail to llvm-commits: more efficient access to operands and callee, faster callgraph-construction, smaller compiler binary llvm-svn: 101397	2010-04-15 20:51:13 +00:00
Evan Cheng	1ba1428577	ARM SelectDYN_ALLOC should emit a copy from SP rather than referencing SP directly. In cases where there are two dyn_alloc in the same BB it would have caused the old SP value to be reused and badness ensues. rdar://7493908 llvm is generating poor code for dynamic alloca, I'll fix that later. llvm-svn: 101383	2010-04-15 18:42:28 +00:00
Johnny Chen	4230e35879	DEBUG() print out "Unknown format" msg. llvm-svn: 101382	2010-04-15 18:13:51 +00:00
Dan Gohman	48a189280e	ReuseFrameIndexVals is used in multiple files, so it can't be static. llvm-svn: 101379	2010-04-15 17:34:58 +00:00
Dan Gohman	2085719a98	EnablePPC64RS and EnablePPC32RS are used in multiple files, so they can't be static. llvm-svn: 101377	2010-04-15 17:20:57 +00:00
Dan Gohman	b29cda9b3c	Fix a bunch of namespace polution. llvm-svn: 101376	2010-04-15 17:08:50 +00:00
Gabor Greif	9fd00c7d25	back out r101364, as it trips the linux nightlybot on some clang C++ tests llvm-svn: 101368	2010-04-15 12:46:56 +00:00
Gabor Greif	aafd209632	rotate CallInst operands, i.e. move callee to the back of the operand array the motivation for this patch are laid out in my mail to llvm-commits: more efficient access to operands and callee, faster callgraph-construction, smaller compiler binary llvm-svn: 101364	2010-04-15 10:49:53 +00:00
Chris Lattner	4041ab6e00	Implement rdar://7860110 (also in target/readme.txt) narrowing a load/or/and/store sequence into a narrower store when it is safe. Daniel tells me that clang will start producing this sort of thing with bitfields, and this does trigger a few dozen times on 176.gcc produced by llvm-gcc even now. This compiles code like CodeGen/X86/2009-05-28-DAGCombineCrash.ll into: movl %eax, 36(%rdi) instead of: movl $4294967295, %eax ## imm = 0xFFFFFFFF andq 32(%rdi), %rax shlq $32, %rcx addq %rax, %rcx movq %rcx, 32(%rdi) and each of the testcases into a single store. Each of them used to compile into craziness like this: _test4: movl $65535, %eax ## imm = 0xFFFF andl (%rdi), %eax shll $16, %esi addl %eax, %esi movl %esi, (%rdi) ret llvm-svn: 101343	2010-04-15 04:48:01 +00:00
Dan Gohman	913c998703	Add more const qualifiers for LLVM IR pointers in CodeGen. llvm-svn: 101342	2010-04-15 04:33:49 +00:00
Anders Carlsson	47bccf7f28	Fix build. llvm-svn: 101335	2010-04-15 03:11:28 +00:00
Dan Gohman	bcaf681cde	Add const qualifiers to CodeGen's use of LLVM IR constructs. llvm-svn: 101334	2010-04-15 01:51:59 +00:00
Eric Christopher	eabc9623da	Allow lowering for palignr instructions for mmx sized vectors. Add patterns to handle the lowering. llvm-svn: 101331	2010-04-15 01:40:20 +00:00
Johnny Chen	0175ec1263	Wrap the error msgs in DEBUG() macro so that they won't appear in NDEBUG build. llvm-svn: 101329	2010-04-15 01:20:56 +00:00
Johnny Chen	82c50b11fa	Fixed another assert exposed by fuzzing. Now, the DisassembleVFPLdStMulFrm() function checks whether we have a valid submode for VLDM/VSTM (must be either "ia" or "db") before calling ARM_AM::getAM5Opc(AMSubMode, unsigned char). llvm-svn: 101306	2010-04-14 22:37:17 +00:00
Jim Grosbach	32bb362655	Add -arm-long-calls option to force calls to be indirect. This makes the kernel linker happier when dealing with kexts. Radar 7805069 llvm-svn: 101303	2010-04-14 22:28:31 +00:00
Johnny Chen	9aaaf4d5fa	For t2BFI disassembly, apply the same error checking as in r101205. Change the error msg to read "Encoding error: msb < lsb". llvm-svn: 101293	2010-04-14 22:04:45 +00:00
Johnny Chen	7637827064	Fixed another assert exposed by fuzzing. The utility function getRegisterEnum() was asserting because the (RegClass, RegNum) combination doesn't make sense from an encoding point of view. Since getRegisterEnum() is used all over the place, to change the code to check for encoding error after each call would not only bloat the code, but also make it less readable. An Err flag is added to the ARMBasicMCBuilder where a client can set a non-zero value to indicate some kind of error condition while building up the MCInst. ARMBasicMCBuilder::BuildIt() checks this flag and returns false if a non-zero value is detected. llvm-svn: 101290	2010-04-14 21:03:13 +00:00
Bob Wilson	c05b887c84	Don't custom lower bit converts to ARM VMOVDRRD or VMOVDRR when the operand does not have a legal type. The legalizer does not know how to handle those nodes. Radar 7854640. llvm-svn: 101282	2010-04-14 20:45:23 +00:00
Dan Gohman	c87b74d913	Delete unneeeded arguments. llvm-svn: 101276	2010-04-14 20:17:22 +00:00
Dan Gohman	7deb447781	Factor out EH landing pad code into a separate function, and constify a bunch of stuff to support it. llvm-svn: 101273	2010-04-14 19:53:31 +00:00
Johnny Chen	48bbf4910e	Fixed another assert exposed by fuzzing. Now, when an encoding error occurs involing getBFCInvMask() where lsb <= msb does not hold true, the disassembler just returns false, instead of assert, to indicate disassembly error. llvm-svn: 101205	2010-04-14 02:05:29 +00:00
Johnny Chen	82c3cadad6	Fixed an assert() exposed by fuzzing. Now, instead of assert when an invalid instruction encoding is encountered, we just return a NULL ARMBasicMCBuilder instance and the client just returns false to indicate disassembly error. llvm-svn: 101201	2010-04-14 01:17:37 +00:00
Douglas Gregor	9078f954bf	Unbreak CMake build by improving the EnhancedDisassembly makefile a bit (we're not trying to build a shared library yet) and generating the X86GenEDInfo.inc and ARMGenEDInfo.inc files as necessary. llvm-svn: 101188	2010-04-13 22:47:43 +00:00
Bob Wilson	699bdf7adf	Handle a v2f64 formal parameter that is split between registers and memory such that the entire second half is in memory. Radar 7855014. llvm-svn: 101181	2010-04-13 22:03:22 +00:00
Sean Callanan	814e69b171	Fixed a nasty layering violation in the edis source code. It used to #include the enhanced disassembly information for the targets it supported straight out of lib/Target/{X86,ARM,...} but now it uses a new interface provided by MCDisassembler, and (so far) implemented by X86 and ARM. Also removed hacky #define-controlled initialization of targets in edis. If clients only want edis to initialize a limited set of targets, they can set --enable-targets on the configure command line. llvm-svn: 101179	2010-04-13 21:21:57 +00:00
Johnny Chen	44d7d183fa	Changed getSOImmValRotate()'s hunt retry logic to ignore the low order 6 bits, instead of 7, because we are only looking for even rotate amount. llvm-svn: 101172	2010-04-13 20:35:16 +00:00
Evan Cheng	4ca4bc6f95	Re-apply 101075 and fix it properly. Just reuse the debug info of the branch instruction being optimized. There is no need to --I which can deref off start of the BB. llvm-svn: 101162	2010-04-13 18:50:27 +00:00
Eric Christopher	d67f66dc0c	Temporarily revert r101075, it's causing invalid iterator assertions in a nightly tester. llvm-svn: 101158	2010-04-13 18:37:58 +00:00
Dan Gohman	9d2d053e11	Eliminate MachineBasicBlock::const_livein_iterator and make MachineBasicBlock::livein_iterator a const_iterator, because clients shouldn't ever be using the iterator interface to mutate the livein set. llvm-svn: 101147	2010-04-13 16:57:55 +00:00
Dan Gohman	a1cf9fef70	Use MachineBasicBlock::isLiveIn. llvm-svn: 101144	2010-04-13 16:53:51 +00:00
Bob Wilson	af7674cbd4	Replace r101053 with a fix for getSOImmValRotate() so that it will correctly recognize all the valid rotated immediates. This fixes the disassembler issue and will also help codegen for some unusual constant values. llvm-svn: 101114	2010-04-13 02:11:48 +00:00
Chris Lattner	5b212a31a2	add llvm codegen support for -ffunction-sections and -fdata-sections, patch by Sylvere Teissier! llvm-svn: 101106	2010-04-13 00:36:43 +00:00
Evan Cheng	d0d8e3343a	Use .set expression for x86 pic jump table reference to reduce assembly relocation. rdar://7738756 llvm-svn: 101085	2010-04-12 23:07:17 +00:00
Bill Wendling	b02bbe416f	Micro-optimization: If we have this situation: jCC L1 jmp L2 L1: ... L2: ... We can get a small performance boost by emitting this instead: jnCC L2 L1: ... L2: ... This testcase shows an example of this: float func(float x, float y) { double product = (double)x * y; if (product == 0.0) return product; return product - 1.0; } llvm-svn: 101075	2010-04-12 22:19:57 +00:00
Johnny Chen	fc93503c59	Fixed a crasher in arm disassembler within ARMInstPrinter.cpp after calling ARM_AM::getSoImmVal(V) with a legitimate so_imm value: #245 rotate right by 2. Introduce ARM_AM::getSOImmValOneOrNoRotate(unsigned Arg) which is called from ARMInstPrinter.cpp's printSOImm() function, replacing ARM_AM::getSOImmVal(V). [12:44:43] johnny:/Volumes/data/llvm/git/trunk (local-trunk) $ gdb Debug/bin/llvm-mc GNU gdb 6.3.50-20050815 (Apple version gdb-1346) (Fri Sep 18 20:40:51 UTC 2009) Copyright 2004 Free Software Foundation, Inc. GDB is free software, covered by the GNU General Public License, and you are welcome to change it and/or distribute copies of it under certain conditions. Type "show copying" to see the conditions. There is absolutely no warranty for GDB. Type "show warranty" for details. This GDB was configured as "x86_64-apple-darwin"...Reading symbols for shared libraries ... done (gdb) set args -triple=arm-apple-darwin9 -debug-only=arm-disassembler --disassemble (gdb) r Starting program: /Volumes/data/llvm/git/trunk/Debug/bin/llvm-mc -triple=arm-apple-darwin9 -debug-only=arm-disassembler --disassemble Reading symbols for shared libraries ++. done 0xf5 0x71 0xf0 0x53 Opcode=201 Name=MVNi Format=ARM_FORMAT_DPFRM(4) 31 30 29 28 27 26 25 24 23 22 21 20 19 18 17 16 15 14 13 12 11 10 9 8 7 6 5 4 3 2 1 0 ------------------------------------------------------------------------------------------------- \| 0: 1: 0: 1\| 0: 0: 1: 1\| 1: 1: 1: 1\| 0: 0: 0: 0\| 0: 1: 1: 1\| 0: 0: 0: 1\| 1: 1: 1: 1\| 0: 1: 0: 1\| ------------------------------------------------------------------------------------------------- mvnpls r7, Assertion failed: (V != -1 && "Not a valid so_imm value!"), function printSOImm, file ARMInstPrinter.cpp, line 229. Program received signal SIGABRT, Aborted. 0x00007fff88c65886 in __kill () (gdb) bt #0 0x00007fff88c65886 in __kill () #1 0x00007fff88d05eae in abort () #2 0x00007fff88cf2ef0 in __assert_rtn () #3 0x000000010020e422 in printSOImm (O=@0x1010bdf80, V=-1, VerboseAsm=false, MAI=0x1020106d0) at ARMInstPrinter.cpp:229 #4 0x000000010020e5fe in llvm::ARMInstPrinter::printSOImmOperand (this=0x1020107e0, MI=0x7fff5fbfee70, OpNum=1, O=@0x1010bdf80) at ARMInstPrinter.cpp:254 #5 0x00000001001ffbc0 in llvm::ARMInstPrinter::printInstruction (this=0x1020107e0, MI=0x7fff5fbfee70, O=@0x1010bdf80) at ARMGenAsmWriter.inc:3236 #6 0x000000010020c27c in llvm::ARMInstPrinter::printInst (this=0x1020107e0, MI=0x7fff5fbfee70, O=@0x1010bdf80) at ARMInstPrinter.cpp:182 #7 0x000000010003cbff in PrintInsts (DisAsm=@0x10200f4e0, Printer=@0x1020107e0, Bytes=@0x7fff5fbff060, SM=@0x7fff5fbff078) at Disassembler.cpp:65 #8 0x000000010003c8b4 in llvm::Disassembler::disassemble (T=@0x1010c13c0, Triple=@0x1010b6798, Buffer=@0x102010690) at Disassembler.cpp:153 #9 0x000000010004095c in DisassembleInput (ProgName=0x7fff5fbff3f0 "/Volumes/data/llvm/git/trunk/Debug/bin/llvm-mc") at llvm-mc.cpp:347 #10 0x000000010003eefb in main (argc=4, argv=0x7fff5fbff298) at llvm-mc.cpp:374 (gdb) q The program is running. Exit anyway? (y or n) y [13:36:26] johnny:/Volumes/data/llvm/git/trunk (local-trunk) $ llvm-svn: 101053	2010-04-12 18:46:53 +00:00
Chris Lattner	4568ed7893	Implement support for varargs functions without any fixed parameters in the CBE by implicitly adding a fixed argument. This allows eliminating a work-around from DAE. Patch by Sylvere Teissier! llvm-svn: 100944	2010-04-10 19:12:44 +00:00
Bob Wilson	0106063556	Provide versions of the ARM eh_sjlj_setjmp instructions for non-VFP subtargets such that the non-VFP versions have no implicit defs of VFP registers. If any callee-saved VFP registers are marked as having been defined, the prologue/epilogue code will try to save and restore them. Radar 7770432. llvm-svn: 100892	2010-04-09 20:41:18 +00:00
Chris Lattner	1ef9826ff8	"On SPU, variables in the .bss section that are allocated with the .lcomm directive are not aligned on 16 byte boundaries. This causes misaligned loads, as the generated assembly assumes this "default" alignment. this patch disables .lcomm in favour of '.local .comm' Patch by Kalle Raisklia! llvm-svn: 100875	2010-04-09 18:27:03 +00:00
Chris Lattner	80c345927e	delete a forwarding function. llvm-svn: 100815	2010-04-08 21:34:17 +00:00
Chris Lattner	5418dd5fda	move elf section uniquing to MCContext. Along the way merge XCore's section into MCSectionELF llvm-svn: 100812	2010-04-08 21:26:26 +00:00
Chris Lattner	433d40695b	remove the TargetLoweringObjectFileMachO::getMachoSection api and update clients to use MCContext instead. llvm-svn: 100808	2010-04-08 20:40:11 +00:00
Gabor Greif	c6a6d39289	use abstract interface in two more places llvm-svn: 100762	2010-04-08 13:50:42 +00:00
Gabor Greif	1c73242012	fix compile llvm-svn: 100760	2010-04-08 13:08:11 +00:00
Gabor Greif	11e7b32e4e	use abstract interface llvm-svn: 100758	2010-04-08 12:52:19 +00:00
Benjamin Kramer	a6769269f3	Use twines to simplify calls to report_fatal_error. For code size and readability. llvm-svn: 100756	2010-04-08 10:44:28 +00:00
Evan Cheng	ebe47c872f	Avoid using f64 to lower memcpy from constant string. It's cheaper to use i32 store of immediates. llvm-svn: 100751	2010-04-08 07:37:57 +00:00
Eric Christopher	c0f63cf7a9	mpsadbw is not commutative. Fixes PR3440. llvm-svn: 100736	2010-04-08 00:52:02 +00:00
Sean Callanan	03549ee5af	Added support for ARM disassembly to edis. I also added a rule to the ARM target's Makefile to build the ARM-specific instruction information table for the enhanced disassembler. I will add the test harness for all this stuff in a separate commit. llvm-svn: 100735	2010-04-08 00:48:21 +00:00
Ted Kremenek	4b1b4205ed	Update CMake build. llvm-svn: 100714	2010-04-07 23:05:23 +00:00
Chris Lattner	2104b8d36e	rename llvm::llvm_report_error -> llvm::report_fatal_error llvm-svn: 100709	2010-04-07 22:58:41 +00:00
Chris Lattner	5109d3e55d	add newlines at end of files. llvm-svn: 100706	2010-04-07 22:54:55 +00:00
Johnny Chen	85ce9f4f30	Missed this one line for the previous checkin to fix build warnings. llvm-svn: 100697	2010-04-07 22:21:03 +00:00
Johnny Chen	8b04b550df	Fixed warnings pointed out by clang. llvm-svn: 100696	2010-04-07 22:03:27 +00:00
Johnny Chen	80f8c3d533	Fixed warnings pointed out by clang. Next to work on is ARMDisassemblerCore.cpp. llvm-svn: 100695	2010-04-07 21:52:48 +00:00
Sean Callanan	1efe661b46	Fixed a bug where the disassembler would allow an immediate argument that had to be between 0 and 7 to have any value, firing an assert later in the AsmPrinter. Now, the disassembler rejects instructions with out-of-range values for that immediate. llvm-svn: 100694	2010-04-07 21:42:19 +00:00
Johnny Chen	3f253e2cb1	Fixed 3 warnings pointed out by clang. llvm-svn: 100693	2010-04-07 21:23:48 +00:00
Johnny Chen	4e2f8722c4	Re-enable ARM/Thumb disassembler and add a workaround for a memcpy() call in ARMDecoderEmitter.cpp, with FIXME comment. llvm-svn: 100690	2010-04-07 20:53:12 +00:00
Sean Callanan	643a55708f	Added an AsmLexer for the ARM target, which uses a simple mapping of register names to IDs to identify register tokens. llvm-svn: 100685	2010-04-07 20:29:34 +00:00
Dale Johannesen	60b289709e	Educate GetInstrSizeInBytes implementations that DBG_VALUE does not generate code. llvm-svn: 100681	2010-04-07 19:51:44 +00:00
Anton Korobeynikov	6e01726eae	Remove late ARM codegen optimization pass committed by accident. It is not ready for public yet. llvm-svn: 100673	2010-04-07 18:23:27 +00:00
Anton Korobeynikov	090323aee5	Split A8/A9 itins - they already were too big. llvm-svn: 100672	2010-04-07 18:22:11 +00:00
Anton Korobeynikov	32457d6c5e	Add some crude itin approximation for VFP load / stores on A9 llvm-svn: 100671	2010-04-07 18:22:03 +00:00
Anton Korobeynikov	d351104f19	Add some crude approximation for neon load/store instructions llvm-svn: 100670	2010-04-07 18:21:58 +00:00
Anton Korobeynikov	4acfad7c1b	Add some A8-based approximation for instructions with unknown cycle times llvm-svn: 100669	2010-04-07 18:21:52 +00:00
Anton Korobeynikov	4fb6a66c8f	Move NEON-VFP domain fixer upper, so post-RA scheduler would benefit from it. llvm-svn: 100668	2010-04-07 18:21:46 +00:00
Anton Korobeynikov	982f0ceaf8	Since tblgen bug was fixed (thanks Jakob!) we don't need InstrStage2 hack anymore. llvm-svn: 100667	2010-04-07 18:21:41 +00:00
Anton Korobeynikov	4050d69dcf	Fix A8 FP NEON MAC itins llvm-svn: 100666	2010-04-07 18:21:33 +00:00
Anton Korobeynikov	9ff2f8f7a5	A9 NEON FP itins llvm-svn: 100665	2010-04-07 18:21:27 +00:00
Anton Korobeynikov	03b317a286	Some permute goodness for A9 llvm-svn: 100664	2010-04-07 18:21:22 +00:00
Anton Korobeynikov	7ab31047a7	More shift itins for A9 llvm-svn: 100663	2010-04-07 18:21:16 +00:00
Anton Korobeynikov	4d36f8890f	More fixes for itins llvm-svn: 100662	2010-04-07 18:21:10 +00:00
Anton Korobeynikov	ceb54d5ab0	Fix invalid itins for 32-bit varians of VMLAL and friends llvm-svn: 100661	2010-04-07 18:21:04 +00:00
Anton Korobeynikov	f64c7ca5c3	Add MAC stuff for A9 llvm-svn: 100660	2010-04-07 18:20:58 +00:00
Anton Korobeynikov	2ef0a12fa1	Fix invalid NEON MAC itins on A8 llvm-svn: 100659	2010-04-07 18:20:53 +00:00
Anton Korobeynikov	5e208dc21b	Fix itins for VPAL llvm-svn: 100658	2010-04-07 18:20:47 +00:00
Anton Korobeynikov	a248becd6c	Fix itins for VABA llvm-svn: 100657	2010-04-07 18:20:42 +00:00
Anton Korobeynikov	a3e4989ad8	Correct VMVN itinerary: operand is read in the second cycle, not in the first. llvm-svn: 100656	2010-04-07 18:20:36 +00:00
Anton Korobeynikov	140a65ce0b	More A9 itineraries llvm-svn: 100655	2010-04-07 18:20:29 +00:00
Anton Korobeynikov	1a1af5a830	Correct itinerary class for VPADD llvm-svn: 100654	2010-04-07 18:20:24 +00:00
Anton Korobeynikov	4650fd5fc6	VP{MAX, MIN} are of IIC_VSUBi4D itin class as well. llvm-svn: 100653	2010-04-07 18:20:18 +00:00
Anton Korobeynikov	7d4fad5942	VHADD differs from VHSUB at least on A9 - the former reads both operands in the second cycle, while the latter reads second operand in first cycle. Introduce new itin classes to catch this behavior. Whether this is true for A8 as well is WIP. llvm-svn: 100652	2010-04-07 18:20:13 +00:00
Anton Korobeynikov	2cba05bbe1	Some easy NEON scheduling goodness for A9 llvm-svn: 100651	2010-04-07 18:20:07 +00:00
Anton Korobeynikov	2063705d91	Define new itin classes for ARM <-> VFP reg moves to distinguish from NEON ops. Define proper scheduling itinerary for them on A9. A8 TRM does not specify latency for them at all :( llvm-svn: 100650	2010-04-07 18:20:02 +00:00
Anton Korobeynikov	c1e7a6feac	FCONST{S,D} behaves the same way as FP unary instructions. This is true for both A8 and A9. llvm-svn: 100649	2010-04-07 18:19:56 +00:00
Anton Korobeynikov	dad973334b	Proper cycle times for locks, since wbck latency can be larger than fwd latency. llvm-svn: 100648	2010-04-07 18:19:51 +00:00
Anton Korobeynikov	4c1da0f82a	Add new itin classes for FP16 <-> FP32 conversions and make uise of them for A9. llvm-svn: 100647	2010-04-07 18:19:46 +00:00
Anton Korobeynikov	baeb210be7	Make use of new reserved/required scheduling stuff: introduce VFP and NEON locks to model domain cross stalls precisly. llvm-svn: 100646	2010-04-07 18:19:40 +00:00
Anton Korobeynikov	15ccae2a46	Some bits of A9 scheduling: VFP llvm-svn: 100643	2010-04-07 18:19:18 +00:00
Anton Korobeynikov	10fc6e9650	Separate const from non-const stuff during mergeing llvm-svn: 100642	2010-04-07 18:19:13 +00:00
Anton Korobeynikov	0453de0133	Some initial version of global merger llvm-svn: 100641	2010-04-07 18:19:07 +00:00
Sanjiv Gupta	dd1c82141c	Fix memory leaks for external symbol name strings. llvm-svn: 100601	2010-04-07 03:36:01 +00:00
John McCall	6ac5cc973c	Clean up some signedness oddities in this code noticed by clang. llvm-svn: 100599	2010-04-07 01:49:15 +00:00
Dale Johannesen	5d7f0a0fdd	Move printing of target-indepedent DEBUG_VALUE comments into AsmPrinter. Target-dependent form is still generated by FastISel and still handled in X86 code. llvm-svn: 100596	2010-04-07 01:15:14 +00:00
John McCall	796583eec0	Fix a number of clang -Wsign-compare warnings that didn't have an obvious solution. The only reason these don't fire with gcc-4.2 is that gcc turns off part of -Wsign-compare in C++ on accident. llvm-svn: 100581	2010-04-06 23:35:53 +00:00
Dale Johannesen	b36c70913b	Revert 100573, it's causing some testsuite problems. llvm-svn: 100578	2010-04-06 22:45:26 +00:00
Dale Johannesen	85b35b6214	Move printing of DEBUG_VALUE comments to target-independent place. There is probably a more elegant way to do this. llvm-svn: 100573	2010-04-06 22:21:07 +00:00
Bob Wilson	5202269dc4	Expand SELECT and SELECT_CC for NEON vector types. Radar 7770501. llvm-svn: 100568	2010-04-06 22:02:24 +00:00
Jim Grosbach	4dac890600	Fix PR6696 and PR6663 When a frame pointer is not otherwise required, and dynamic stack alignment is necessary solely due to the spilling of a register with larger alignment requirements than the default stack alignment, the frame pointer can be both used as a general purpose register and a frame pointer. That goes poorly, for obvious reasons. This patch brings back a bit of old logic for identifying the use of such registers and conservatively reserves the frame pointer during register allocation in such cases. For now, implement for X86 only since it's 32-bit linux which is hitting this, and we want a targeted fix for 2.7. As a follow-on, this will be expanded to handle other targets, as theoretically the problem could arise elsewhere as well. llvm-svn: 100559	2010-04-06 20:26:37 +00:00
Jakob Stoklund Olesen	41051a0bfe	Don't try to collapse DomainValues onto an incompatible SSE domain. This fixes the Bullet regression on i386/nocona. llvm-svn: 100553	2010-04-06 19:48:56 +00:00
Jakob Stoklund Olesen	1a9b3f3484	Properly enable load clustering. Operand 2 on a load instruction does not have to be a RegisterSDNode for this to work. llvm-svn: 100497	2010-04-05 23:48:02 +00:00
Evan Cheng	23d16d5b86	Fix ADD32rr_alt instruction encoding bug. Patch by Marius Wachtler. llvm-svn: 100480	2010-04-05 22:21:09 +00:00
Eric Christopher	1290fa0f72	Remove FIXME. llvm-svn: 100466	2010-04-05 21:14:32 +00:00
Chris Lattner	6e39c4a097	don't use emitlabel in the arm asm printer yet, the order isn't well specified. ARM really needs to have its instprinter finished at some point. llvm-svn: 100439	2010-04-05 17:52:31 +00:00
Chris Lattner	6a0e89aefb	fix a couple problems I introduced handling symbols with spaces in them. Sym->getName() != OS << *Sym llvm-svn: 100434	2010-04-05 16:32:14 +00:00
Benjamin Kramer	0151d7b025	Disambiguate else. llvm-svn: 100423	2010-04-05 10:17:15 +00:00
Chris Lattner	305f2efb63	unthread MMI from FastISel llvm-svn: 100416	2010-04-05 06:05:26 +00:00
Chris Lattner	82ff9af068	remove the MMI pointer from MachineFrameInfo. llvm-svn: 100415	2010-04-05 05:57:52 +00:00
Chris Lattner	50b1bf63a7	simplify code. llvm-svn: 100412	2010-04-05 05:48:36 +00:00
Johnny Chen	dacfd2c6d4	Get rid of traling whitespaces. No functionality change. llvm-svn: 100404	2010-04-05 04:51:50 +00:00
Johnny Chen	dba13e7922	The disassembler impl. of MCDisassembler::getInstruction() was using the pattern uint32_t insn; MemoryObject.readBytes(Address, 4, (uint8_t*)&insn, NULL) to read 4 bytes of memory contents into a 32-bit uint variable. This leaves the interpretation of byte order up to the host machine and causes PPC test cases of arm-tests, neon-tests, and thumb-tests to fail. Fixed to use a byte array for reading the memory contents and shift the bytes into place for the 32-bit uint variable in the ARM case and 16-bit halfword in the Thumb case. llvm-svn: 100403	2010-04-05 04:46:17 +00:00
Chris Lattner	f0ef4e4019	implement EmitFunctionEntryLabel to emit the .cc_top directive, allowing xcore to use the normal runOnMachineFunction implementation. llvm-svn: 100402	2010-04-05 04:44:02 +00:00
Chris Lattner	a49ac8ace0	prune some #includes. llvm-svn: 100399	2010-04-05 04:04:10 +00:00
Jakob Stoklund Olesen	b93331f3be	Replace TSFlagsFields and TSFlagsShifts with a simpler TSFlags field. When a target instruction wants to set target-specific flags, it should simply set bits in the TSFlags bit vector defined in the Instruction TableGen class. This works well because TableGen resolves member references late: class I : Instruction { AddrMode AM = AddrModeNone; let TSFlags{3-0} = AM.Value; } let AM = AddrMode4 in def ADD : I; TSFlags gets the expected bits from AddrMode4 in this example. llvm-svn: 100384	2010-04-05 03:10:20 +00:00
Chris Lattner	7cfa70e9b3	fastisel doesn't need DwarfWriter, remove some tendricles. llvm-svn: 100381	2010-04-05 02:19:28 +00:00
Evan Cheng	0b8adb0652	Temporarily remove to disable building of ARM disassembler. llvm-svn: 100380	2010-04-05 01:57:50 +00:00
Evan Cheng	492a82e426	Re-apply 100265 but instead disable building of ARM disassembly for now. llvm-svn: 100379	2010-04-05 01:34:00 +00:00
Evan Cheng	876a5015af	Reverting 100265 to try to get buildbots green again. Lots of self-hosting buildbots started complaining since this commit. Also xfail ARM disassembly tests. llvm-svn: 100378	2010-04-05 01:04:27 +00:00
Chris Lattner	626cb66fdb	just have all targets create the DwarfWriter. llvm-svn: 100377	2010-04-05 00:42:55 +00:00
Chris Lattner	8b30492da3	simplify various getAnalysisUsage implementations. llvm-svn: 100376	2010-04-05 00:38:44 +00:00
Chris Lattner	324c86600d	eliminate the magic AbsoluteDebugSectionOffsets MAI hook, which is really a property of the section being referenced. Add a predicate to MCSection to replace it. Yay for reduction in magic. llvm-svn: 100367	2010-04-04 23:22:29 +00:00
Chris Lattner	8964b838e4	revert my patch, need to reconsider this and figure out what is really going on. llvm-svn: 100358	2010-04-04 21:49:31 +00:00
Chris Lattner	407f848835	fix pasto, this is the wrong setting for arm elf. llvm-svn: 100357	2010-04-04 21:37:20 +00:00
Jakob Stoklund Olesen	d03ac95d5d	Clean up SSEDomainFix pass. Restrict bit mask operations to the DomainValue class. Rename methods for clarity. llvm-svn: 100353	2010-04-04 21:27:26 +00:00
Chris Lattner	4af7c5a650	don't reset the default. llvm-svn: 100352	2010-04-04 21:06:50 +00:00
Chris Lattner	7bde8c07a7	clean up the asmprinter header and privatize some stuff. llvm-svn: 100342	2010-04-04 18:52:31 +00:00
Chris Lattner	21dc46e256	remove TargetMachine.h #include, also, TRI isn't used frequently enough to warrant caching in AsmPrinter, so remove it. llvm-svn: 100336	2010-04-04 18:06:11 +00:00
Jakob Stoklund Olesen	42caaa4f5b	Switch SSEDomainFix to SpecificBumpPtrAllocator. llvm-svn: 100332	2010-04-04 18:00:21 +00:00
Chris Lattner	d20699bc87	Momentous day: remove the "O" member from AsmPrinter. Now all "asm printering" happens through MCStreamer. This also Streamerizes PIC16 debug info, which escaped my attention. This removes a leak from LLVMTargetMachine of the 'legacy' output stream. llvm-svn: 100327	2010-04-04 08:18:47 +00:00
Chris Lattner	3cb9086c26	mc'ize the remaining uses of O. llvm-svn: 100322	2010-04-04 07:23:00 +00:00
Chris Lattner	4f63f7ee8e	finish eliminating uses of O. llvm-svn: 100321	2010-04-04 07:17:25 +00:00
Chris Lattner	794b2f1b37	mcize more of ppc stub printing. llvm-svn: 100320	2010-04-04 07:12:28 +00:00
Chris Lattner	5e5961864f	mcize a bunch more stuff, using EmitRawText for things we don't have mcstreamer support for yet. llvm-svn: 100319	2010-04-04 07:05:53 +00:00
Chris Lattner	3d86cd6710	convert the non-MCInstPrinter'ized EmitInstruction implementations to use EmitRawText instead of writing directly to "O". llvm-svn: 100318	2010-04-04 06:12:20 +00:00
Chris Lattner	83a093183c	streamerize the rest of PIC16 asm printer. llvm-svn: 100317	2010-04-04 05:53:03 +00:00
Chris Lattner	d479317d65	streamerize printing of dbg_value, the x86 backend is now fully streamerized for everything. llvm-svn: 100316	2010-04-04 05:40:34 +00:00
Chris Lattner	bf43d4b6e9	split DEBUG_VALUE printing stuff out to its own method. llvm-svn: 100315	2010-04-04 05:38:19 +00:00
Chris Lattner	9b13639f45	mc'ize elf stub printing, convert cygwin stuff to EmitRawText, which will abort in .o file writing mode. llvm-svn: 100314	2010-04-04 05:35:04 +00:00
Chris Lattner	3bb09768cb	fix PrintAsmOperand and PrintAsmMemoryOperand to pass down raw_ostream to print to. llvm-svn: 100313	2010-04-04 05:29:35 +00:00
Chris Lattner	787253819a	use predicates in DBG_VALUE printing code to simplify it. llvm-svn: 100312	2010-04-04 05:21:31 +00:00
Chris Lattner	562e02e4e1	remove more implicit uses of "O". llvm-svn: 100311	2010-04-04 05:19:20 +00:00
Chris Lattner	7012916275	fix an ugly wart in the MCInstPrinter api where the raw_ostream to print an instruction to had to be specified at MCInstPrinter construction time instead of being able to pick at each call to printInstruction. llvm-svn: 100307	2010-04-04 05:04:31 +00:00
Chris Lattner	76c564b1bb	change a ton of code to not implicitly use the "O" raw_ostream member of AsmPrinter. Instead, pass it in explicitly. llvm-svn: 100306	2010-04-04 04:47:45 +00:00
Mon P Wang	c576ee9040	Reapply address space patch after fixing an issue in MemCopyOptimizer. Added support for address spaces and added a isVolatile field to memcpy, memmove, and memset, e.g., llvm.memcpy.i32(i8, i8, i32, i32) -> llvm.memcpy.p0i8.p0i8.i32(i8, i8, i32, i32, i1) llvm-svn: 100304	2010-04-04 03:10:48 +00:00
Chris Lattner	f33c7fcc28	asmstreamerize the .size directive for function bodies, force clients of printOffset to pass in a stream to print to. llvm-svn: 100296	2010-04-03 22:28:33 +00:00
Johnny Chen	fc8c3b7547	Get rid of the middleman (ARMAlgorithm), which causes more trouble than the abstraction it brings. And also get rid of the atexit() handler, it does not belong in the lib directory. :-) llvm-svn: 100265	2010-04-03 04:10:56 +00:00
Johnny Chen	884e66a545	Fix comment. llvm-svn: 100259	2010-04-03 01:17:30 +00:00
Johnny Chen	58e83eb50d	Register ARMAlgorithm::DoCleanup() to be called on exit to free the memory occuplied by the cached ARMAlgorithm objects. llvm-svn: 100258	2010-04-03 01:09:47 +00:00
Eric Christopher	000e502eb1	Rewrite aesimc handling. It only takes a single input and has a single dest. llvm-svn: 100252	2010-04-02 23:48:33 +00:00
Johnny Chen	a0d74064fe	Fix another build warning. llvm-svn: 100251	2010-04-02 23:43:38 +00:00
Johnny Chen	7b999ea7b7	Second try of initial ARM/Thumb disassembler check-in. It consists of a tablgen backend (ARMDecoderEmitter) which emits the decoder functions for ARM and Thumb, and the disassembler core which invokes the decoder function and builds up the MCInst based on the decoded Opcode. Reviewed by Chris Latter and Bob Wilson. llvm-svn: 100233	2010-04-02 22:27:38 +00:00
Sean Callanan	7ad0ad0b9a	Added support for reporting operand token ranges to the ARM AsmParser. llvm-svn: 100232	2010-04-02 22:27:05 +00:00
Eric Christopher	2ef63183a5	Separate out the AES-NI instructions from the SSE4.2 instructions. Add a new subtarget option for AES and check for the support. Add "westmere" line of processors and add AES-NI support to the core i7. Add a couple of TODOs for information I couldn't verify. llvm-svn: 100231	2010-04-02 21:54:27 +00:00
Sean Callanan	010b373cf3	Fixes to the X86 disassembler. The disassembler will now return an error status in all failure cases, printing messages to debugs() only when debugging is enabled. llvm-svn: 100229	2010-04-02 21:23:51 +00:00
Chris Lattner	593916d732	rename NewDebugLoc -> DebugLoc, prune #includes in DebugLoc.h. This keeps around temporary typedef for clang/llvm-gcc so the build won't break when I commit this :) llvm-svn: 100218	2010-04-02 20:21:22 +00:00
Chris Lattner	6f306d7d30	use DebugLoc default ctor instead of DebugLoc::getUnknownLoc() llvm-svn: 100214	2010-04-02 20:16:16 +00:00
Chris Lattner	915c5f9862	Switch the code generator (except the JIT) onto the new DebugLoc representation. This eliminates the 'DILocation' MDNodes for file/line/col tuples from -O0 -g codegen. This remove the old DebugLoc class, making it a typedef for DebugLoc, I'll rename NewDebugLoc next. I didn't update the JIT to use the new apis, so it will continue to work, but be as slow as before. Someone should eventually do this or, better yet, rip out the JIT debug info stuff and build the JIT on top of MC. llvm-svn: 100209	2010-04-02 19:42:39 +00:00
Evan Cheng	61399375a2	Correctly lower memset / memcpy of undef. It should be a nop. PR6767. llvm-svn: 100208	2010-04-02 19:36:14 +00:00
Mon P Wang	999c1b927b	Revert r100191 since it breaks objc in clang llvm-svn: 100199	2010-04-02 18:43:02 +00:00
Mon P Wang	a972ab8564	Reapply address space patch after fixing an issue in MemCopyOptimizer. Added support for address spaces and added a isVolatile field to memcpy, memmove, and memset, e.g., llvm.memcpy.i32(i8, i8, i32, i32) -> llvm.memcpy.p0i8.p0i8.i32(i8, i8, i32, i32, i1) llvm-svn: 100191	2010-04-02 18:04:15 +00:00
Eric Christopher	06a1639b98	Remove FIXME - if there's a better way to do this it isn't here. llvm-svn: 100176	2010-04-02 04:32:37 +00:00
Dan Gohman	4bd755419f	Revert the recent alignment changes. They're broken for -Os because, in particular, they end up aligning strings at 16-byte boundaries, and there's no way for GlobalOpt to check OptForSize. llvm-svn: 100172	2010-04-02 03:04:37 +00:00
Dale Johannesen	4244d12769	Teach AnalyzeBranch, RemoveBranch and the branch folder to be tolerant of debug info following the branch(es) at the end of a block. llvm-svn: 100168	2010-04-02 01:38:09 +00:00
Chandler Carruth	8d6d0d4c58	Disambiguate conditional expression for newer GCCs. llvm-svn: 100167	2010-04-02 01:31:24 +00:00
Dan Gohman	0e3218f6af	Change variables which are exactly 16 bytes to be 16-byte-aligned too. This fixes test/Transforms/GlobalOpt/gv-align.ll. llvm-svn: 100161	2010-04-02 00:46:07 +00:00
Eric Christopher	5342ddaadf	Revert r100143. llvm-svn: 100146	2010-04-01 22:54:42 +00:00
Evan Cheng	f997c31598	In 64-bit mode, use i64 to lower memcpy / memset instead of f64. llvm-svn: 100137	2010-04-01 20:27:45 +00:00
Evan Cheng	d9929f03cf	Add comments about DstAlign and SrcAlign. llvm-svn: 100132	2010-04-01 20:10:42 +00:00
Evan Cheng	4c014c892a	- Avoid using floating point stores to implement memset unless the value is zero. - Do not try to infer GV alignment unless its type is sized. It's not possible to infer alignment if it has opaque type. llvm-svn: 100118	2010-04-01 18:19:11 +00:00
Evan Cheng	43cd9e3845	Fix sdisel memcpy, memset, memmove lowering: 1. Makes it possible to lower with floating point loads and stores. 2. Avoid unaligned loads / stores unless it's fast. 3. Fix some memcpy lowering logic bug related to when to optimize a load from constant string into a constant. 4. Adjust x86 memcpy lowering threshold to make it more sane. 5. Fix x86 target hook so it uses vector and floating point memory ops more effectively. rdar://7774704 llvm-svn: 100090	2010-04-01 06:04:33 +00:00
Evan Cheng	738b0f9ec7	Nehalem unaligned memory access is fast. llvm-svn: 100089	2010-04-01 05:58:17 +00:00
Eric Christopher	9002ac5d93	Add aeskeygenassist intrinsic and rename all of the aes intrinsics to aes instead of sse4.2. Add a brief todo for a subtarget flag and rework the aeskeygenassist instruction to more closely match the docs. llvm-svn: 100078	2010-04-01 03:05:45 +00:00
Jim Grosbach	7c90d22f4c	vml[as] are slow on 1136jf-s also. llvm-svn: 100066	2010-04-01 00:13:43 +00:00
Chris Lattner	503a0ef6f4	reduce indentation, minor cleanups. llvm-svn: 100042	2010-03-31 20:32:51 +00:00
Jakob Stoklund Olesen	58ca0a649c	Use spaces, not tabs llvm-svn: 100037	2010-03-31 20:05:12 +00:00
Bill Wendling	d749aefbd5	Comment the changes for r98218 and friends inside the source code. llvm-svn: 100033	2010-03-31 18:48:58 +00:00
Bill Wendling	866f5764a7	Comment the changes for r98218 and friends inside the source code. llvm-svn: 100031	2010-03-31 18:47:10 +00:00
Jakob Stoklund Olesen	4cd5866f8e	Fix PR6750. Don't try to merge a DomainValue with itself. llvm-svn: 100016	2010-03-31 17:13:16 +00:00
Jakob Stoklund Olesen	9986ba954c	Replace V_SET0 with variants for each SSE execution domain. llvm-svn: 99975	2010-03-31 00:40:13 +00:00
Jakob Stoklund Olesen	710c6892be	Fix typo. Thank you, valgrind. llvm-svn: 99974	2010-03-31 00:40:08 +00:00
Jakob Stoklund Olesen	6f6ebb663c	Enable -sse-domain-fix by default. Now with tests! llvm-svn: 99954	2010-03-30 22:47:00 +00:00
Jakob Stoklund Olesen	3493398f13	V_SETALLONES is an integer instruction. Since it is just a pxor in disguise, we should probably expand it to a full polymorphic triple. llvm-svn: 99953	2010-03-30 22:46:55 +00:00
Jakob Stoklund Olesen	dbff4e8103	Renumber SSE execution domains for better code size. SSEDomainFix will collapse to the domain with the lower number when it has a choice. The SSEPackedSingle domain often has smaller instructions, so prefer that. llvm-svn: 99952	2010-03-30 22:46:53 +00:00
Bob Wilson	6f7fd28824	Revert Mon Ping's change 99928, since it broke all the llvm-gcc buildbots. llvm-svn: 99948	2010-03-30 22:27:04 +00:00
Jakob Stoklund Olesen	cf35648ebe	Revert "Enable -sse-domain-fix by default. What could possibly go wrong?" Not running 'make check-all' before committing is a bad idea. llvm-svn: 99933	2010-03-30 21:36:32 +00:00
Jakob Stoklund Olesen	a654df84e6	Enable -sse-domain-fix by default. What could possibly go wrong? llvm-svn: 99931	2010-03-30 21:09:31 +00:00
Mon P Wang	7460571381	Added support for address spaces and added a isVolatile field to memcpy, memmove, and memset, e.g., llvm.memcpy.i32(i8, i8, i32, i32) -> llvm.memcpy.p0i8.p0i8.i32(i8, i8, i32, i32, i1) A update of langref will occur in a subsequent checkin. llvm-svn: 99928	2010-03-30 20:55:56 +00:00
Jakob Stoklund Olesen	3b9af40938	Add cross-block inference to SSEDomainFix. llvm-svn: 99916	2010-03-30 20:04:01 +00:00
Eric Christopher	6ad8167714	Remove the pmulld intrinsic and autoupdate it as a vector multiply. Rewrite the pmulld patterns, and make sure that they fold in loads of arguments into the instruction. llvm-svn: 99910	2010-03-30 18:49:01 +00:00
Chris Lattner	9897043928	Rip out the 'is temporary' nonsense from the MCContext interface to create symbols. It is extremely error prone and a source of a lot of the remaining integrated assembler bugs on x86-64. This fixes rdar://7807601. llvm-svn: 99902	2010-03-30 18:10:53 +00:00
Benjamin Kramer	0c1dcb083e	XFAIL some PIC16 tests when running under valgrind-leaks. I don't expect these to be fixed any time soon. llvm-svn: 99888	2010-03-30 14:34:13 +00:00
Benjamin Kramer	7cc264bf19	PIC16: Plug a leak in PIC16Section by allocating name & address strings in the MCContext. There is still one leak left in PIC16Section (the Items vector). llvm-svn: 99887	2010-03-30 13:28:42 +00:00
Eric Christopher	c1ddaaf5b1	Add FIXME for operand promotion. llvm-svn: 99859	2010-03-30 01:04:59 +00:00
Jakob Stoklund Olesen	486aa2eadc	Be gentle to MSVC. C++ is hard, after all. llvm-svn: 99855	2010-03-30 00:09:32 +00:00
Jakob Stoklund Olesen	b551aa4da5	Basic implementation of SSEDomainFix pass. Cross-block inference is primitive and wrong, but the pass is working otherwise. llvm-svn: 99848	2010-03-29 23:24:21 +00:00
Benjamin Kramer	2788f797ca	Make isInt?? and isUint?? template specializations of the generic versions. This makes calls a little bit more consistent and allows easy removal of the specializations in the future. Convert all callers to the templated functions. llvm-svn: 99838	2010-03-29 21:13:41 +00:00
Eric Christopher	9bdadf0d99	We'll never match these as instructions, just as intrinsics so remove the SDNodes. llvm-svn: 99835	2010-03-29 20:41:51 +00:00
Benjamin Kramer	f633ba8213	Remove a bunch of integer width predicate functions in favor of MathExtras. Most of these were unused, some of them were wrong and unused (isS16Constant<short>, isS10Constant<short>). llvm-svn: 99827	2010-03-29 19:07:58 +00:00
Chris Lattner	f60c556b91	From Kalle Raiskila: "the bigstack patch for SPU, with testcase. It is essentially the patch committed as 97091, and reverted as 97099, but with the following additions: -in vararg handling, registers are marked to be live, to not confuse the register scavenger -function prologue and epilogue are not emitted, if the stack size is 16. 16 means it is empty - there is only the register scavenger emergency spill slot, which is not used as there is no stack." llvm-svn: 99819	2010-03-29 17:38:47 +00:00
Chris Lattner	9bc1ed9962	add a note. llvm-svn: 99815	2010-03-29 17:02:02 +00:00
Johnny Chen	c86256fa5d	Add NVTBLFrm to represent A8.6.406 VTBL, VTBX Vector Table Lookup Instructions. These instructions use byte index in a control vector (M:Vm) to lookup byte values in a table and generate a new vector (D:Vd). The table is specified via a list of vectors, which can be: {Dn} {Dn D<n+1>} {Dn D<n+1> D<n+2>} {Dn D<n+1> D<n+2> D<n+3>} llvm-svn: 99789	2010-03-29 01:14:22 +00:00
Chris Lattner	11f85ccf7d	zap an extra line that Eli noticed! llvm-svn: 99770	2010-03-28 18:52:28 +00:00
Chris Lattner	b7c48433df	fix a type contradition: XCoreISD::RETSP has one argument, not zero. llvm-svn: 99760	2010-03-28 08:47:39 +00:00
Chris Lattner	505849d277	remove a pattern with no testcase that doesn't appear to be matchable: it seems like it would always constant fold. llvm-svn: 99758	2010-03-28 08:40:48 +00:00
Chris Lattner	3dad5fbeb9	fix integer negates to use the proper type for the zero vectors, this also depends on the new "bitconvert dropping" behavior just added to tblgen. llvm-svn: 99757	2010-03-28 08:39:10 +00:00
Chris Lattner	240154e633	fix a typo, bitconvert from node to itself isn't valid. llvm-svn: 99755	2010-03-28 08:36:45 +00:00
Chris Lattner	6c223ee0e9	fix vnot matching to explicitly specify the type of the input to be v8i8 or v16i8, which buildvectors get canonicalized to. This allows the patterns that were previously using a bare 'vnot' to match, before they couldn't. llvm-svn: 99754	2010-03-28 08:08:07 +00:00
Chris Lattner	1c85e3476d	fix up vnot matching, eliminating a dead pattern, correcting a couple of patterns that would never match because of bitcast, and eliminating use of vnot_conv. llvm-svn: 99753	2010-03-28 08:00:23 +00:00
Chris Lattner	e549d9b1f2	stop using vnot_conv llvm-svn: 99750	2010-03-28 07:48:17 +00:00
Chris Lattner	227a83d6ed	revert r99743, this is saying that the repmovs instructinos have an input of other type, which is the VT. llvm-svn: 99749	2010-03-28 07:38:39 +00:00
Chris Lattner	be980f2df7	remove a bunch of dead patterns. llvm-svn: 99748	2010-03-28 07:38:00 +00:00
Chris Lattner	cba70c8162	claiming to return other is pointless. llvm-svn: 99743	2010-03-28 05:57:36 +00:00
Chris Lattner	a520b166dc	Improve systemz to model cmp and ucmp nodes as returning their flags correctly. llvm-svn: 99738	2010-03-28 05:21:52 +00:00
Chris Lattner	e83591c616	the FPCmp node returns an i32. llvm-svn: 99737	2010-03-28 05:12:57 +00:00
Chris Lattner	ec5fe65838	fix some modelling problems exposed by a patch I'm working on. bsr/bsf/ptest nodes all have an EFLAGS result when made by isel lowering. llvm-svn: 99736	2010-03-28 05:07:17 +00:00
Bob Wilson	0f8a02830a	Fix indentation. llvm-svn: 99705	2010-03-27 04:01:23 +00:00
Bob Wilson	cf603fb1c5	Add a format argument to the N3V and N3VX classes, removing the N3Vf class. llvm-svn: 99704	2010-03-27 03:56:52 +00:00
Chris Lattner	07943af506	eliminate the last of the parallel's! llvm-svn: 99700	2010-03-27 02:47:14 +00:00
Johnny Chen	6094cdab9f	Add NVMulSLFrm to represent "3-register multiply with scalar" operations and set it as the format for the appropriate N3VSL<> classes. These instructions require special handling of the M:Vm field which encodes the restricted Dm and the lane index within Dm. Examples are A8.6.325 VMLA, VMLAL, VMLS, VMLSL (by scalar): vmlal.s32 q3, d2, d10[0] llvm-svn: 99690	2010-03-27 01:03:13 +00:00
Chris Lattner	c5e20d9031	eliminate almost all the rest of the x86-32 parallels. llvm-svn: 99686	2010-03-27 00:45:04 +00:00
Jim Grosbach	44313db557	Thumb2 storeFrom/LoadToStackSlot() need to handle tGPR regs directly, not pass through to the generic version. The generic functions use STR/LDR, but T2 needs the t2STR/t2LDR instead so we get the addressing mode correct. llvm-svn: 99678	2010-03-27 00:09:12 +00:00
Johnny Chen	93acfbf441	Remove the duplicate multiclass N3VSh_QHSD and use N3VInt_QHSD which is modified to now take a format argument. N3VDInt<> and N3VQInt<> are modified to take a format argument as well. llvm-svn: 99676	2010-03-26 23:49:07 +00:00
Johnny Chen	0b57de3c4c	Add NVExtFrm to represent NEON Vector Extract Instructions, that uses Inst{11-8} to encode the byte location of the extracted result in the concatenation of the operands, from the least significant end. Modify VEXTd and VEXTq classes to use the format. llvm-svn: 99659	2010-03-26 22:28:56 +00:00
Johnny Chen	2cf04957c2	Add N3RegVShFrm to represent 3-Register Vector Shift Instructions, which do not follow the N3RegFrm's operand order of D:Vd N:Vn M:Vm. The operand order of N3RegVShFrm is D:Vd M:Vm N:Vn (notice that M:Vm is the first src operand). Add a parent class N3Vf which requires passing a Format argument and which the N3V class is modified to inherit from. N3V class represents the "normal" 3-Register NEON Instructions with N3RegFrm. Also add a multiclass N3VSh_QHSD to represent clusters of NEON 3-Register Shift Instructions and replace 8 invocations with it. llvm-svn: 99655	2010-03-26 21:26:28 +00:00
Jim Grosbach	bf59859b2b	vldm/vstm can only do up to 16 double-word registers at a time. Radar 7797856 llvm-svn: 99630	2010-03-26 18:41:09 +00:00
Johnny Chen	8fc94d6362	Add N3RegFrm to represent "NEON 3 vector register format" instructions. Examples are VABA (Vector Absolute Difference and Accumulate), VABAL (Vector Absolute Difference and Accumulate Long), and VABD (Vector Absolute Difference). llvm-svn: 99628	2010-03-26 18:32:20 +00:00
Evan Cheng	3365fb1412	Do not sibcall if stack needs to be dynamically aligned. llvm-svn: 99620	2010-03-26 16:26:03 +00:00
Evan Cheng	00a620c61e	Allow trivial sibcall of vararg callee when no arguments are being passed. llvm-svn: 99598	2010-03-26 02:13:13 +00:00
Johnny Chen	5d4e917d9f	Add N2RegVShLFrm and N2RegVShRFrm formats so that the disassembler can easily dispatch to the appropriate routines to handle the different interpretations of the shift amount encoded in the imm6 field. The Vd, Vm fields are interpreted the same between the two, though. See, for example, A8.6.367 VQSHL, VQSHLU (immediate) for N2RegVShLFrm format and A8.6.368 VQSHRN, VQSHRUN for N2RegVShRFrm format. llvm-svn: 99590	2010-03-26 01:07:59 +00:00
Jim Grosbach	71fcb4fedd	switch the flag for using NEON for SP floating point to a subtarget 'feature'. Re-commit. This time complete with testsuite updates. llvm-svn: 99570	2010-03-25 23:47:34 +00:00
Jim Grosbach	42bb89c7d9	need to fix 'make check' tests first. revert for a moment. llvm-svn: 99569	2010-03-25 23:34:05 +00:00
Jim Grosbach	7fce4e39aa	switch the flag for using NEON for SP floating point to a subtarget 'feature' llvm-svn: 99568	2010-03-25 23:32:19 +00:00
Johnny Chen	a3617ec88a	Removed instruction class NI from ARMInstrFormats.td. It doesn't seem to be used anywhere. llvm-svn: 99566	2010-03-25 23:11:56 +00:00
Jim Grosbach	a43386ba8f	switch the use-vml[as] instructions flag to a subtarget 'feature' llvm-svn: 99565	2010-03-25 23:11:16 +00:00
Johnny Chen	91d2774416	Add NVDupLnFrm and change NVDupLane class to use that format. llvm-svn: 99557	2010-03-25 21:49:12 +00:00
Jim Grosbach	4b3b2ef65c	ARM cortex-a8 doesn't do vmla/vmls well. disable them by default for that cpu llvm-svn: 99549	2010-03-25 20:48:50 +00:00
Johnny Chen	d82f9002e4	Add NVCVTFrm (NEON Convert with fractional bits immediate) and modify N2VImm to expect a Format arg. N2VCvtD/N2VCvtQ are modified to use the NVCVTFrm format. llvm-svn: 99548	2010-03-25 20:39:04 +00:00
Daniel Dunbar	d919276bc0	Fix -Asserts warning, again. llvm-svn: 99542	2010-03-25 19:35:53 +00:00
Jakob Stoklund Olesen	3758ff917e	Tag SSE2 integer instructions as SSEPackedInt. llvm-svn: 99540	2010-03-25 18:52:04 +00:00
Jakob Stoklund Olesen	f8d7eda663	Teach TableGen to understand X.Y notation in the TSFlagsFields strings. Remove much horribleness from X86InstrFormats as a result. Similar simplifications are probably possible for other targets. llvm-svn: 99539	2010-03-25 18:52:01 +00:00
Jakob Stoklund Olesen	49e121d5e4	Add a late SSEDomainFix pass that twiddles SSE instructions to avoid domain crossings. On Nehalem and newer CPUs there is a 2 cycle latency penalty on using a register in a different domain than where it was defined. Some instructions have equvivalents for different domains, like por/orps/orpd. The SSEDomainFix pass tries to minimize the number of domain crossings by changing between equvivalent opcodes where possible. This is a work in progress, in particular the pass doesn't do anything yet. SSE instructions are tagged with their execution domain in TableGen using the last two bits of TSFlags. Note that not all instructions are tagged correctly. Life just isn't that simple. The SSE execution domain issue is very similar to the ARM NEON/VFP pipeline issue handled by NEONMoveFixPass. This pass may become target independent to handle both. llvm-svn: 99524	2010-03-25 17:25:00 +00:00
Johnny Chen	45ab3f3ccf	Added a new instruction class NVDupLane to be inherited by VDUPLND and VDUPLNQ, instead of the current N2V. Format of NVDupLane instances are set to NEONFrm currently. llvm-svn: 99518	2010-03-25 17:01:27 +00:00
Bob Wilson	e543e7fcb1	Reapply Kevin's change 94440, now that Chris has fixed the limitation on opcode values fitting in one byte (svn r99494). llvm-svn: 99514	2010-03-25 16:36:14 +00:00
Chris Lattner	23bf99a97c	eliminate a bunch more parallels now that scheduling handles dead implicit results more aggressively. More to come, I think this is now just a data entry problem. llvm-svn: 99486	2010-03-25 05:44:01 +00:00
Evan Cheng	b07a29ecd4	Disable folding loads into tail call in 32-bit PIC mode. It can introduce illegal code like this: addl $12, %esp popl %esi popl %edi popl %ebx popl %ebp jmpl __Block_deallocator-L1$pb(%esi) # TAILCALL The problem is the global base register is assigned GR32 register class. TCRETURNmi needs the registers making up the address mode to have the GR32_TC register class. The proper* fix is for X86DAGToDAGISel::getGlobalBaseReg() to return a copy from the global base register of the machine function rather than returning the register itself. But that has the potential of causing it to be coalesced to a more restrictive register class: GR32_TC. It can introduce additional copies and spills. For something as important the PIC base, it's not worth it especially since this is not an issue on 64-bit. llvm-svn: 99455	2010-03-25 00:10:31 +00:00
Bob Wilson	5b2da69f6d	Speculatively revert this to see if it fixes buildbot failures. --- Reverse-merging r99440 into '.': U test/MC/AsmParser/X86/x86_32-bit_cat.s U test/MC/AsmParser/X86/x86_32-encoding.s U include/llvm/IntrinsicsX86.td U include/llvm/CodeGen/SelectionDAGNodes.h U lib/Target/X86/X86InstrSSE.td U lib/Target/X86/X86ISelLowering.h llvm-svn: 99450	2010-03-24 23:26:29 +00:00
Kevin Enderby	f5584a7397	Added the Advanced Encryption Standard (AES) Instructions. llvm-svn: 99440	2010-03-24 22:33:33 +00:00
Jim Grosbach	34de7768bf	Make the use of the vmla and vmls VFP instructions controllable via cmd line. Preliminary testing shows significant performance wins by not using these instructions. llvm-svn: 99436	2010-03-24 22:31:46 +00:00
Kevin Enderby	b96eb68497	Fixed the SS42AI template for the SSE 4.2 instructions with TA prefix so it does not get an "Unknown immediate size" assert failure when used. All instructions of this form have an 8-bit immediate. Also added a test case of an example instruction that is of this form. llvm-svn: 99435	2010-03-24 22:28:42 +00:00
Nate Begeman	2ceb288416	Per chris's request, add some comments. llvm-svn: 99434	2010-03-24 22:19:06 +00:00
Johnny Chen	bff23ca690	Trivial formating change. llvm-svn: 99428	2010-03-24 21:25:07 +00:00
Nate Begeman	583e05d8ce	BUILD_VECTOR was missing out on some prime opportunities to use SSE 4.1 inserts. llvm-svn: 99423	2010-03-24 20:49:50 +00:00
Johnny Chen	e99953ce9c	Reverted r99326 which added NVdVmVCVTFrm, and later renamed to NVCVTFrm. NVCVTFrm will later be used to describe "vcvt with fractional bits". llvm-svn: 99415	2010-03-24 19:47:14 +00:00
Johnny Chen	da44d5977f	Reverted r99376. The disassembler will deal with the 2-reg format of these two N3VX instructions using special case code. llvm-svn: 99409	2010-03-24 18:46:34 +00:00
Jim Grosbach	07607382d8	tweak the arm if conversion heuristic llvm-svn: 99402	2010-03-24 16:15:14 +00:00
Johnny Chen	aa9b1c81a7	Mark VMOVDneon and VMOVQ as having the N2RegFrm form to help the disassembler. llvm-svn: 99376	2010-03-24 01:29:25 +00:00
Chris Lattner	9096bcdeda	Switch INC8r to defining its pattern in terms of X86inc_flag and defining the add pattern with Pat<>, eliminating a use of parallel. llvm-svn: 99375	2010-03-24 01:02:12 +00:00
Johnny Chen	9b1f60adec	Renamed NVdVmImmFrm and NVdVmVCVTFrm to the more proper N2RegFrm and NVCVTFrm, respectively, and add some more comment. llvm-svn: 99373	2010-03-24 00:57:50 +00:00
Chris Lattner	f9c8bec6c5	switch SDTBinaryArithWithFlags to be a multiple-result node as well. llvm-svn: 99370	2010-03-24 00:49:29 +00:00
Chris Lattner	db1ac3cf3e	Switch SDTUnaryArithWithFlags to being modeled as a two-result ISD node. The only change in the generated isel code are comments like: < // Src: (X86dec_flag:i16 GR16:i16:$src) --- > // Src: (X86dec_flag:i16:i32 GR16:i16:$src) because now it knows that X86dec_flag returns both an i16 (for the result) and an i32 (for EFLAGS) in this case. Wewt. llvm-svn: 99369	2010-03-24 00:47:47 +00:00
Chris Lattner	cca83a7aa4	remove 64-bit or_is_add parallels. llvm-svn: 99360	2010-03-24 00:16:52 +00:00
Chris Lattner	f5e5004327	remove useless or_is_add parallel's. llvm-svn: 99359	2010-03-24 00:15:23 +00:00
Chris Lattner	237d38e748	reduce nesting. llvm-svn: 99358	2010-03-24 00:12:57 +00:00
Jim Grosbach	e0874fa02f	try being more permissive for if-conversion on ARM V7. see what the nightly test run permformance numbers say as to whether it helps. llvm-svn: 99355	2010-03-24 00:03:13 +00:00
Jakob Stoklund Olesen	a86ccbfe88	Revert "Add a late SSEDomainFix pass that twiddles SSE instructions to avoid domain crossings." This reverts commit 99345. It was breaking buildbots. llvm-svn: 99352	2010-03-23 23:48:51 +00:00

... 12 13 14 15 16 ...

14933 Commits