llvm-project

Commit Graph

Author	SHA1	Message	Date
Eli Friedman	26a484852e	Code generation for 'fence' instruction. llvm-svn: 136283	2011-07-27 22:21:52 +00:00
Jim Grosbach	8b31ef50c0	ARM extend instructions simplification. Refactor the SXTB, SXTH, SXTB16, UXTB, UXTH, and UXTB16 instructions to not have an 'r' and an 'r_rot' version, but just a single version with a rotate that can be zero. Use plain Pat<>'s for the ISel of the non-rotated version. llvm-svn: 136225	2011-07-27 16:47:19 +00:00
Owen Anderson	b595ed0085	Split up the ARM so_reg ComplexPattern into so_reg_reg and so_reg_imm, allowing us to distinguish the encodings that use shifted registers from those that use shifted immediates. This is necessary to allow the fixed-length decoder to distinguish things like BICS vs LDRH. llvm-svn: 135693	2011-07-21 18:54:16 +00:00
Evan Cheng	a20cde31e7	Sink ARMMCExpr and ARMAddressingModes into MC layer. First step to separate ARM MC code from target. llvm-svn: 135636	2011-07-20 23:34:39 +00:00
Chris Lattner	229907cd11	land David Blaikie's patch to de-constify Type, with a few tweaks. llvm-svn: 135375	2011-07-18 04:54:35 +00:00
Evan Cheng	f863e3fb73	Improve codegen for select's: if (x != 0) x = 1 if (x == 1) x = 1 Previous codegen looks like this: mov r1, r0 cmp r1, #1 mov r0, #0 moveq r0, #1 The naive lowering select between two different values. It should recognize the test is equality test so it's more a conditional move rather than a select: cmp r0, #1 movne r0, #0 rdar://9758317 llvm-svn: 135017	2011-07-13 00:42:17 +00:00
Cameron Zwarich	f03fa189ca	Add an intrinsic and codegen support for fused multiply-accumulate. The intent is to use this for architectures that have a native FMA instruction. llvm-svn: 134742	2011-07-08 21:39:21 +00:00
Jim Grosbach	3840c90f73	Add more info to FIXME. llvm-svn: 134729	2011-07-08 20:18:11 +00:00
Jim Grosbach	cf1464d943	ARMv7M vs. ARMv7E-M support. The DSP instructions in the Thumb2 instruction set are an optional extension in the Cortex-M* archtitecture. When present, the implementation is considered an "ARMv7E-M implementation," and when not, an "ARMv7-M implementation." Add a subtarget feature hook for the v7e-m instructions and hook it up. The cortex-m3 cpu is an example of a v7m implementation, while the cortex-m4 is a v7e-m implementation. rdar://9572992 llvm-svn: 134261	2011-07-01 21:12:19 +00:00
Eric Christopher	29f1db85dd	Add support for the 'j' immediate constraint. This is conditionalized on supporting the instruction that the constraint is for 'movw'. Part of rdar://9119939 llvm-svn: 134222	2011-07-01 01:00:07 +00:00
Eric Christopher	c011d31543	Add support for the ARM 't' register constraint. And another testcase for the 'x' register constraint. Part of rdar://9119939 llvm-svn: 134220	2011-07-01 00:30:46 +00:00
Eric Christopher	f09b0f1043	We'll return a null RC by default if we can't match. Part of rdar://9119939 llvm-svn: 134217	2011-07-01 00:19:27 +00:00
Eric Christopher	f1c74595aa	Add support for the 'x' constraint. Part of rdar://9307836 and rdar://9119939 llvm-svn: 134215	2011-07-01 00:14:47 +00:00
Eric Christopher	1f054f27af	Capitalize the unsigned part of the initializer. llvm-svn: 134211	2011-06-30 23:59:16 +00:00
Eric Christopher	cf2007ca78	Rename Pair to RCPair lacking any better naming ideas. llvm-svn: 134210	2011-06-30 23:50:52 +00:00
Eric Christopher	f45daac30f	Add support for the 'h' constraint. Part of rdar://9119939 llvm-svn: 134203	2011-06-30 23:23:01 +00:00
Eric Christopher	c486b47b15	Add a convenience typedef for std::pair<unsigned, const TargetRegisterClass*>. No functional change. Part of rdar://9119939 llvm-svn: 134198	2011-06-30 22:17:01 +00:00
Eric Christopher	1b8b9419ba	Remove getRegClassForInlineAsmConstraint from the ARM port. Part of rdar://9643582 llvm-svn: 134095	2011-06-29 21:10:36 +00:00
Evan Cheng	6cc775f905	- Rename TargetInstrDesc, TargetOperandInfo to MCInstrDesc and MCOperandInfo and sink them into MC layer. - Added MCInstrInfo, which captures the tablegen generated static data. Chang TargetInstrInfo so it's based off MCInstrInfo. llvm-svn: 134021	2011-06-28 19:10:37 +00:00
Chad Rosier	6b610b387d	Remove warning: 'c0' may be used uninitialized in this function. llvm-svn: 134014	2011-06-28 17:26:57 +00:00
Chad Rosier	fa8d89327f	The Neon VCVT (between floating-point and fixed-point, Advanced SIMD) instructions can be used to match combinations of multiply/divide and VCVT (between floating-point and integer, Advanced SIMD). Basically the VCVT immediate operand that specifies the number of fraction bits corresponds to a floating-point multiply or divide by the corresponding power of 2. For example, VCVT (floating-point to fixed-point, Advanced SIMD) can replace a combination of VMUL and VCVT (floating-point to integer) as follows: Example (assume d17 = <float 8.000000e+00, float 8.000000e+00>): vmul.f32 d16, d17, d16 vcvt.s32.f32 d16, d16 becomes: vcvt.s32.f32 d16, d16, #3 Similarly, VCVT (fixed-point to floating-point, Advanced SIMD) can replace a combinations of VCVT (integer to floating-point) and VDIV as follows: Example (assume d17 = <float 8.000000e+00, float 8.000000e+00>): vcvt.f32.s32 d16, d16 vdiv.f32 d16, d17, d16 becomes: vcvt.f32.s32 d16, d16, #3 llvm-svn: 133813	2011-06-24 19:23:04 +00:00
Eric Christopher	e256cd0565	Handle the memory-ness of all U+ ARM constraints. Noticed on inspection. llvm-svn: 133553	2011-06-21 22:10:57 +00:00
Benjamin Kramer	25e17b0f89	Remove unused but set variables. llvm-svn: 133347	2011-06-18 11:09:41 +00:00
Bruno Cardoso Lopes	d66ab9ead1	Mark ldrexd/strexd w/ volatile memory by default llvm-svn: 133175	2011-06-16 18:11:32 +00:00
Chad Rosier	2730162bee	Revision r128665 added an optimization to make use of NEON multiplier accumulator forwarding. Specifically (from SVN log entry): Distribute (A + B) * C to (A * C) + (B * C) to make use of NEON multiplier accumulator forwarding: vadd d3, d0, d1 vmul d3, d3, d2 => vmul d3, d0, d2 vmla d3, d1, d2 Make sure it catches cases where operand 1 is add/fadd/sub/fsub, which was intended in the original revision. llvm-svn: 133127	2011-06-16 01:21:54 +00:00
Bob Wilson	4b12a11f30	A minor simplification: no functional change. llvm-svn: 133047	2011-06-15 06:04:34 +00:00
Evan Cheng	6d02d9044b	PerformBFICombine - (bfi A, (and B, Mask1), Mask2) -> (bfi A, B, Mask2) iff the bits being cleared by the AND are not demanded by the BFI. The previous BFI dag combine rule was actually incorrect (or used to be correct until BFI representation changed). rdar://9609030 llvm-svn: 133034	2011-06-15 01:12:31 +00:00
Tanya Lattner	e9e6705cf9	Add an optimization that looks for a specific pair-wise add pattern and generates a vpaddl instruction instead of scalarizing the add. Includes a test case. llvm-svn: 133027	2011-06-14 23:48:48 +00:00
Bruno Cardoso Lopes	dc9ff3a4b1	Add one more argument to the prefetch intrinsic to indicate whether it's a data or instruction cache access. Update the targets to match it and also teach autoupgrade. llvm-svn: 132976	2011-06-14 04:58:37 +00:00
Cameron Zwarich	890197859b	Provide an ARMCCState subclass of CCState so that ARM clients will always set CallOrPrologue correctly and eliminate the existing setter. llvm-svn: 132856	2011-06-10 20:59:24 +00:00
Cameron Zwarich	361548d4b4	A CCState was being created without setting whether it is in the Call or Prologue state, causing an assertion failure downstream. This fixes <rdar://problem/9562908>. This really seems like it should always be set at CCState creation time, so mistakes like this can never happen. I'll take a look at doing that. llvm-svn: 132811	2011-06-09 22:30:07 +00:00
Eric Christopher	0713a9d8fc	Add a parameter to CCState so that it can access the MachineFunction. No functional change. Part of PR6965 llvm-svn: 132763	2011-06-08 23:55:35 +00:00
Eric Christopher	354b2a25f3	Make the Uv constraint a memory operand. This doesn't solve the addressing mode problem mentioned in r132559. Backend part of rdar://9037836 and part of rdar://9119939 llvm-svn: 132561	2011-06-03 17:24:37 +00:00
Eric Christopher	de9399bf76	Have LowerOperandForConstraint handle multiple character constraints. Part of rdar://9119939 llvm-svn: 132510	2011-06-02 23:16:42 +00:00
John McCall	7d84ece09b	On Darwin ARM, set the UNWIND_RESUME libcall to _Unwind_SjLj_Resume. This is important for the correct lowering of unwind instructions (which doesn't matter at all) and llvm.eh.resume calls (which does). Take 2, now with more basic competence. llvm-svn: 132295	2011-05-29 19:50:32 +00:00
John McCall	e64371b932	I didn't mean to commit these residues of a personal project. llvm-svn: 132293	2011-05-29 19:41:56 +00:00
John McCall	085d891d80	On Darwin ARM, set the UNWIND_RESUME libcall to _Unwind_SjLj_Resume. This is important for the correct lowering of unwind instructions (which doesn't matter at all) and llvm.eh.resume calls (which does). llvm-svn: 132291	2011-05-29 19:39:04 +00:00
Bruno Cardoso Lopes	325110f30d	Add support for ARM ldrexd/strexd intrinsics. They both use i32 register pairs to load/store i64 values. Since there's no current support to explicitly declare such restrictions, implement it by using specific hardcoded register pairs during isel. llvm-svn: 132248	2011-05-28 04:07:29 +00:00
Cameron Zwarich	1d553a2cc4	Fix the remaining atomic intrinsics to use the right register classes on Thumb2, and add some basic tests for them. llvm-svn: 132235	2011-05-27 23:54:00 +00:00
Evan Cheng	518bcd0ef4	Don't use movw / movt for iOS static codegen for now to workaround some tools issues. rdar://9514789 llvm-svn: 132211	2011-05-27 20:11:27 +00:00
Renato Golin	4cd5187f5b	RTABI chapter 4.3.4 specifies __eabi_mem* calls. Specifically, __eabi_memset accepts parameters (ptr, size, value) in a different order than GNU's memset (ptr, value, size), therefore the special lowering in AAPCS mode. Implementation by Evzen Muller. llvm-svn: 131868	2011-05-22 21:41:23 +00:00
Evan Cheng	4fcd8250ae	Revert accidental commit. llvm-svn: 131739	2011-05-20 17:38:48 +00:00
Evan Cheng	e8d2e9eb35	Revert r131664 and fix it in instcombine instead. rdar://9467055 llvm-svn: 131708	2011-05-20 00:54:37 +00:00
Mon P Wang	6d9e1c7c2e	Fixed sdiv and udiv for <4 x i16>. The test from r125402 still applies for this change. llvm-svn: 131630	2011-05-19 04:15:07 +00:00
Tanya Lattner	1d11720ae4	Handle perfect shuffle case that generates a vrev for vectors of floats. Add test case. llvm-svn: 131582	2011-05-18 21:44:54 +00:00
Evan Cheng	522fbfea3b	Revise r131553. Just use the type of the input node and forgo the bitcast. rdar://9449159. llvm-svn: 131555	2011-05-18 18:59:17 +00:00
Evan Cheng	80632c91b0	Fix an ARMTargetLowering::LowerSELECT bug: legalized result must have same type as input. Sorry test cases only trigger when dag combine is disabled. rdar://9449178 llvm-svn: 131553	2011-05-18 18:47:27 +00:00
Tanya Lattner	48b182c3a4	In r131488 I misunderstood how VREV works. It splits the vector in half and splits each half. Therefore, the real problem was that we were using a VREV64 for a 4xi16, when we should have been using a VREV32. Updated test case and reverted change to the PerfectShuffle Table. llvm-svn: 131529	2011-05-18 06:42:21 +00:00
Cameron Zwarich	f9839e4257	Fix typo. llvm-svn: 131519	2011-05-18 02:29:50 +00:00
Cameron Zwarich	d7c55fe2ef	Fix more of PR8825 by correctly using rGPR registers when lowering atomic compare-and-swap intrinsics. llvm-svn: 131518	2011-05-18 02:20:07 +00:00

1 2 3 4 5 ...

650 Commits