llvm-project

Commit Graph

Author	SHA1	Message	Date
Bob Wilson	8a99b730a9	ARM function alignments were off by a power of two. svn 83242 changed getFunctionAlignment and the corresponding use of that value in the ARM asm printer, but now we're using the standard asm printer. The result of this was that function alignments were dropped completely for Thumb functions. Radar 8143571. llvm-svn: 107435	2010-07-01 22:26:26 +00:00
Duncan Sands	6d28e73acc	Remove initialized but otherwise unused variables. llvm-svn: 107127	2010-06-29 11:22:26 +00:00
Eli Friedman	8cfa7713e9	Followup to r106770: actually generate SXTB and SXTH for sign-extensions. llvm-svn: 106940	2010-06-26 04:36:50 +00:00
Evan Cheng	b71233f34d	It's now possible to run code placement pass for ARM. llvm-svn: 106935	2010-06-26 01:52:05 +00:00
Evan Cheng	02b184de5b	Change if-conversion block size limit checks to add some flexibility. llvm-svn: 106901	2010-06-25 22:42:03 +00:00
Dale Johannesen	ce97d55ad9	The hasMemory argument is irrelevant to how the argument for an "i" constraint should get lowered; PR 6309. While this argument was passed around a lot, this is the only place it was used, so it goes away from a lot of other places. llvm-svn: 106893	2010-06-25 21:55:36 +00:00
Bob Wilson	eadbf9732f	Reduce indentation. llvm-svn: 106819	2010-06-25 04:12:31 +00:00
Dale Johannesen	d24c66b4a3	Do not do tail calls to external symbols. If the branch turns out to be ARM-to-Thumb or vice versa the linker cannot resolve this. 8120438. If this optimization is going to be useful we probably need a compiler flag "assume callees are same architecture" or something like that. llvm-svn: 106662	2010-06-23 18:52:34 +00:00
Jim Grosbach	a8ea498171	When using libcall expansions for the atomic intrinsics, the explicit MEMBARRIER fences aren't necessary for ARM. Tell the combiner to fold them away. llvm-svn: 106631	2010-06-23 16:08:49 +00:00
Bob Wilson	72df24037e	sign_extend_inreg needs to be expanded for pre-v6 Thumb as well as ARM. Radar 8104310. llvm-svn: 106484	2010-06-21 21:27:34 +00:00
Bob Wilson	0ae08935f6	Fix error message to match function name. llvm-svn: 106381	2010-06-19 05:32:09 +00:00
Evan Cheng	f3c01f3ef6	Disable sibcall optimization for Thumb1 for now since Thumb1RegisterInfo::emitEpilogue is not expecting them. llvm-svn: 106368	2010-06-19 01:01:32 +00:00
Jim Grosbach	a57c2885cf	back-end libcall handling for ATOMIC_SWAP (__sync_lock_test_and_set) llvm-svn: 106342	2010-06-18 23:03:10 +00:00
Jim Grosbach	6860bb7796	Enable Expand handling of atomics for subtargets that can't do them inline. llvm-svn: 106336	2010-06-18 22:35:32 +00:00
Dale Johannesen	c1570dda5c	Enable tail calls on ARM by default, with some basic tests. This has been well tested on Darwin but not elsewhere. It should work provided the linker correctly resolves B.W <label in other function> which it has not seen before, at least from llvm-based compilers. I'm leaving the arm-tail-calls switch in until I see if there's any problems because of that; it might need to be disabled for some environments. llvm-svn: 106299	2010-06-18 19:00:18 +00:00
Dale Johannesen	3ac52b3e43	Last round of changes for ARM tail calls. Not turning them on yet. llvm-svn: 106295	2010-06-18 18:13:11 +00:00
Jakob Stoklund Olesen	b9f91667e1	Treat the ARM inline asm {cc} constraint as a physreg (%CPSR), just like X86 does for {flags}. If we create virtual registers of the CCR class, RegAllocFast may try to spill them, and we can't do that. llvm-svn: 106289	2010-06-18 16:49:33 +00:00
Jim Grosbach	5712c77c89	Thumb1 and any pre-v6 ARM target should use the libcall expansion of ISD::MEMBARRIER. v7 and v7 ARM mode continue to use the custom lowering. llvm-svn: 106204	2010-06-17 02:02:03 +00:00
Jim Grosbach	6e758c97fd	simplify code a bit and add a more explanatory assert for cases that previously would result in 'cannot yet select' errors. llvm-svn: 106199	2010-06-17 01:37:00 +00:00
Jim Grosbach	e3864cc15e	format and 80-column cleanup llvm-svn: 106173	2010-06-16 23:45:49 +00:00
Bob Wilson	01ac8f9fc0	Remove the hidden "neon-reg-sequence" option. The reg sequences are working now, so there's no need to disable them. llvm-svn: 106155	2010-06-16 21:34:01 +00:00
Evan Cheng	f128bdcb55	Make post-ra scheduling, anti-dep breaking, and register scavenger (conservatively) aware of predicated instructions. This enables ARM to move if-conversion before post-ra scheduler. llvm-svn: 106091	2010-06-16 07:35:02 +00:00
Dale Johannesen	44f9dfc9cf	Next round of tail call changes. Register used in a tail call must not be callee-saved; following x86, add a new regclass to represent this. Also fixes a couple of bugs. Still disabled by default; Thumb doesn't work yet. llvm-svn: 106053	2010-06-15 22:08:33 +00:00
Bob Wilson	f3f7a770b7	Add basic support for NEON modified immediates besides VMOV. llvm-svn: 106030	2010-06-15 19:05:35 +00:00
Bob Wilson	5b2b504038	Rename functions referring to VMOV immediates to refer to NEON "modified immediate" operands. These functions have so far only been used for VMOV but they also apply to other NEON instructions with modified immediate operands. No functional changes. llvm-svn: 105969	2010-06-14 22:19:57 +00:00
Bob Wilson	f07d33d8f1	Add a missing bitcast. This code used to only handle conversions between i64 and f64 types, but now it also handle Neon vector types, so the f64 result of VMOVDRR may need to be converted to a Neon type. Radar 8084742. llvm-svn: 105845	2010-06-11 22:45:25 +00:00
Bob Wilson	6eae520de9	Add instruction encoding for the Neon VMOV immediate instruction. This changes the machine instruction representation of the immediate value to be encoded into an integer with similar fields as the actual VMOV instruction. This makes things easier for the disassembler, since it can just stuff the bits into the immediate operand, but harder for the asm printer since it has to decode the value to be printed. Testcase for the encoding will follow later when MC has more support for ARM. llvm-svn: 105836	2010-06-11 21:34:50 +00:00
Bob Wilson	846bd7992c	Further changes for Neon vector shuffles: - change isShuffleMaskLegal to show that all shuffles with 32-bit and 64-bit elements are legal - the Neon shuffle instructions do not support 64-bit elements, but we were not checking for that before lowering shuffles to use them - remove some 64-bit element vduplane patterns that are no longer needed llvm-svn: 105586	2010-06-07 23:53:38 +00:00
Dale Johannesen	81ef35b3ca	Improvements to tail call code. No functional effect unless using -arm-tail-calls. llvm-svn: 105515	2010-06-05 00:51:39 +00:00
Dale Johannesen	d1b9311afa	More thoroughly disable tails calls by default. 8060143, although this doesn't fix the real problem with tail call. llvm-svn: 105472	2010-06-04 18:04:24 +00:00
Bob Wilson	d8a9a04739	For NEON vectors with 32- or 64-bit elements, select BUILD_VECTORs and VECTOR_SHUFFLEs to REG_SEQUENCE instructions. The standard ISD::BUILD_VECTOR node corresponds closely to REG_SEQUENCE but I couldn't use it here because its operands do not get legalized. That is pretty awful, but I guess it makes sense for other targets. Instead, I have added an ARM-specific version of BUILD_VECTOR that will have its operands properly legalized. This fixes the rest of Radar 7872877. llvm-svn: 105439	2010-06-04 00:04:02 +00:00
Dale Johannesen	d679ff7330	Early implementation of tail call for ARM. A temporary flag -arm-tail-calls defaults to off, so there is no functional change by default. Intrepid users may try this; simple cases work but there are bugs. llvm-svn: 105413	2010-06-03 21:09:53 +00:00
Jim Grosbach	84511e1526	Clean up 80 column violations. No functional change. llvm-svn: 105350	2010-06-02 21:53:11 +00:00
Evan Cheng	bf91499f1a	Schedule high latency instructions for latency reduction even if they are not vfp / NEON instructions. llvm-svn: 105060	2010-05-28 23:25:23 +00:00
Jim Grosbach	faa3abbe39	Update the saved stack pointer in the sjlj function context following either an alloca() or an llvm.stackrestore(). rdar://8031573 llvm-svn: 104900	2010-05-27 23:49:24 +00:00
Jim Grosbach	c9f532dddc	back out 104862/104869. Can reuse stacksave after all. Very cool. llvm-svn: 104897	2010-05-27 23:11:57 +00:00
Jim Grosbach	5cde219fb1	add ISD::STACKADDR to get the current stack pointer. Will be used by sjlj EH to update the jmpbuf in the presence of VLAs. llvm-svn: 104862	2010-05-27 18:23:48 +00:00
Jim Grosbach	c98892fdaa	Adjust eh.sjlj.setjmp to properly have a chain and to have an opcode entry in ISD::. No functional change. llvm-svn: 104734	2010-05-26 20:22:18 +00:00
Bob Wilson	26fdebcae9	Clean up indentation. llvm-svn: 104580	2010-05-25 03:36:52 +00:00
Evan Cheng	755d45be43	LR is in GPR, not tGPR even in Thumb1 mode. llvm-svn: 104518	2010-05-24 18:00:18 +00:00
Bob Wilson	49f40e8c32	VDUP doesn't support vectors with 64-bit elements. llvm-svn: 104455	2010-05-23 05:42:31 +00:00
Evan Cheng	168ced94d8	Implement @llvm.returnaddress. rdar://8015977. llvm-svn: 104421	2010-05-22 01:47:14 +00:00
Jim Grosbach	bd9485db63	Implement eh.sjlj.longjmp for ARM. Clean up the intrinsic a bit. Followups: docs patch for the builtin and eh.sjlj.setjmp cleanup to match longjmp. llvm-svn: 104419	2010-05-22 01:06:18 +00:00
Bob Wilson	91fdf68516	Recognize more BUILD_VECTORs and VECTOR_SHUFFLEs that can be implemented by copying VFP subregs. This exposed a bunch of dead code in the *spill-q.ll tests, so I tweaked those tests to keep that code from being optimized away. Radar 7872877. llvm-svn: 104415	2010-05-22 00:23:12 +00:00
Evan Cheng	34c260458a	Change ARM scheduling default to list-hybrid if the target supports floating point instructions (and is not using soft float). llvm-svn: 104307	2010-05-21 00:43:17 +00:00
Evan Cheng	4401f8873c	Allow targets more controls on what nodes are scheduled by reg pressure, what for latency in hybrid mode. llvm-svn: 104293	2010-05-20 23:26:43 +00:00
Bob Wilson	5954994bba	Handle Neon v2f64 and v2i64 vector shuffles as register copies. This fixes the remaining issue with pr7167. llvm-svn: 104257	2010-05-20 18:39:53 +00:00
Evan Cheng	738e920edf	Code refactoring: pull SchedPreference enum from TargetLowering.h to TargetMachine.h and put it in its own namespace. llvm-svn: 104147	2010-05-19 20:19:50 +00:00
Evan Cheng	f19384d54a	Sink dag combine's post index load / store code that swap base ptr and index into the target hook. Only the target knows whether the swap is safe. In Thumb2 mode, the offset must be an immediate. rdar://7998649 llvm-svn: 104060	2010-05-18 21:31:17 +00:00
Anton Korobeynikov	4c719c4515	Generalize the ARM DAG combiner of mul with constants to all power-of-two cases. llvm-svn: 103901	2010-05-16 08:54:20 +00:00
Anton Korobeynikov	1bf28a128b	Some cheap DAG combine goodness for multiplication with a particular constant. This can be extended later on to handle more "complex" constants. llvm-svn: 103881	2010-05-15 18:16:59 +00:00
Evan Cheng	3d214cdfaf	v4i64 and v8i64 are only synthesizable when NEON is available. llvm-svn: 103855	2010-05-15 02:20:21 +00:00
Evan Cheng	4cad68eb34	Allow TargetLowering::getRegClassFor() to be called on illegal types. Also allow target to override it in order to map register classes to illegal but synthesizable types. e.g. v4i64, v8i64 for ARM / NEON. llvm-svn: 103854	2010-05-15 02:18:07 +00:00
Evan Cheng	cd67c21407	Added a QQQQ register file to model 4-consecutive Q registers. llvm-svn: 103760	2010-05-14 02:13:41 +00:00
Dan Gohman	bb919dfb6b	Implement a bunch more TargetSelectionDAGInfo infrastructure. Move EmitTargetCodeForMemcpy, EmitTargetCodeForMemset, and EmitTargetCodeForMemmove out of TargetLowering and into SelectionDAGInfo to exercise this. llvm-svn: 103481	2010-05-11 17:31:57 +00:00
Evan Cheng	2fa5a7e7e4	Select @llvm.trap to the special B with 1111 condition (i.e. trap) instruction. llvm-svn: 103459	2010-05-11 07:26:32 +00:00
Evan Cheng	c2ae5f546f	Model vld2 / vst2 with reg_sequence. llvm-svn: 103411	2010-05-10 17:34:18 +00:00
Jim Grosbach	2a41cad900	Clean up the conditional for handling of sign_extend_inreg based on whether the extract instructions are available. rdar://7956878 llvm-svn: 103277	2010-05-07 18:34:55 +00:00
Jim Grosbach	151cd8f159	Cleanup of ARMv7M support. Move hardware divide and Thumb2 extract/pack instructions to subtarget features and update tests to reflect. PR5717. llvm-svn: 103136	2010-05-05 23:44:43 +00:00
Jim Grosbach	92d999001c	Add initial support for ARMv7M subtarget and cortex-m3 cpu. Patch by Jordy <snhjordy@gmail.com>. Followup patches will add some tests and adjust to use Subtarget features for the instructions. llvm-svn: 103119	2010-05-05 20:44:35 +00:00
Evan Cheng	d85631e700	Model CONCAT_VECTORS of two 64-bit values as a REG_SEQUENCE. llvm-svn: 103104	2010-05-05 18:28:36 +00:00
Dan Gohman	25c1653700	Get rid of the EdgeMapping map. Instead, just check for BasicBlock changes before doing phi lowering for switches. llvm-svn: 102809	2010-05-01 00:01:06 +00:00
Dan Gohman	21cea8ac2e	Use const qualifiers with TargetLowering. This eliminates several const_casts, and it reinforces the design of the Target classes being immutable. SelectionDAGISel::IsLegalToFold is now a static member function, because PIC16 uses it in an unconventional way. There is more room for API cleanup here. And PIC16's AsmPrinter no longer uses TargetLowering. llvm-svn: 101635	2010-04-17 15:26:15 +00:00
Dan Gohman	31ae586c74	Move per-function state out of TargetLowering subclasses and into MachineFunctionInfo subclasses. llvm-svn: 101634	2010-04-17 14:41:14 +00:00
Bob Wilson	59b70eacad	Revise my previous change to ExpandBIT_CONVERT. I hadn't realized that this may be called when either the source or destination type is i64, and my change also hadn't fixed the most obvious problem -- assuming that i64 will only be bitconverted to f64, ignoring the various vector types. Radar 7873160. llvm-svn: 101615	2010-04-17 05:30:19 +00:00
Evan Cheng	f7f97b4bbd	Use default lowering of DYNAMIC_STACKALLOC. As far as I can tell, ARM isle is doing the right thing and codegen looks correct for both Thumb and Thumb2. llvm-svn: 101410	2010-04-15 22:20:34 +00:00
Anders Carlsson	47bccf7f28	Fix build. llvm-svn: 101335	2010-04-15 03:11:28 +00:00
Dan Gohman	bcaf681cde	Add const qualifiers to CodeGen's use of LLVM IR constructs. llvm-svn: 101334	2010-04-15 01:51:59 +00:00
Jim Grosbach	32bb362655	Add -arm-long-calls option to force calls to be indirect. This makes the kernel linker happier when dealing with kexts. Radar 7805069 llvm-svn: 101303	2010-04-14 22:28:31 +00:00
Bob Wilson	c05b887c84	Don't custom lower bit converts to ARM VMOVDRRD or VMOVDRR when the operand does not have a legal type. The legalizer does not know how to handle those nodes. Radar 7854640. llvm-svn: 101282	2010-04-14 20:45:23 +00:00
Bob Wilson	699bdf7adf	Handle a v2f64 formal parameter that is split between registers and memory such that the entire second half is in memory. Radar 7855014. llvm-svn: 101181	2010-04-13 22:03:22 +00:00
Bob Wilson	5202269dc4	Expand SELECT and SELECT_CC for NEON vector types. Radar 7770501. llvm-svn: 100568	2010-04-06 22:02:24 +00:00
Mon P Wang	c576ee9040	Reapply address space patch after fixing an issue in MemCopyOptimizer. Added support for address spaces and added a isVolatile field to memcpy, memmove, and memset, e.g., llvm.memcpy.i32(i8, i8, i32, i32) -> llvm.memcpy.p0i8.p0i8.i32(i8, i8, i32, i32, i1) llvm-svn: 100304	2010-04-04 03:10:48 +00:00
Mon P Wang	999c1b927b	Revert r100191 since it breaks objc in clang llvm-svn: 100199	2010-04-02 18:43:02 +00:00
Mon P Wang	a972ab8564	Reapply address space patch after fixing an issue in MemCopyOptimizer. Added support for address spaces and added a isVolatile field to memcpy, memmove, and memset, e.g., llvm.memcpy.i32(i8, i8, i32, i32) -> llvm.memcpy.p0i8.p0i8.i32(i8, i8, i32, i32, i1) llvm-svn: 100191	2010-04-02 18:04:15 +00:00
Bob Wilson	6f7fd28824	Revert Mon Ping's change 99928, since it broke all the llvm-gcc buildbots. llvm-svn: 99948	2010-03-30 22:27:04 +00:00
Mon P Wang	7460571381	Added support for address spaces and added a isVolatile field to memcpy, memmove, and memset, e.g., llvm.memcpy.i32(i8, i8, i32, i32) -> llvm.memcpy.p0i8.p0i8.i32(i8, i8, i32, i32, i1) A update of langref will occur in a subsequent checkin. llvm-svn: 99928	2010-03-30 20:55:56 +00:00
Jim Grosbach	07607382d8	tweak the arm if conversion heuristic llvm-svn: 99402	2010-03-24 16:15:14 +00:00
Jim Grosbach	e0874fa02f	try being more permissive for if-conversion on ARM V7. see what the nightly test run permformance numbers say as to whether it helps. llvm-svn: 99355	2010-03-24 00:03:13 +00:00
Bob Wilson	e4191e719b	Revert this change, since it was causing ARM performance regressions. --- Reverse-merging r98889 into '.': U lib/Target/ARM/ARMInstrNEON.td U lib/Target/ARM/ARMISelLowering.h U lib/Target/ARM/ARMInstrInfo.td U lib/Target/ARM/ARMInstrVFP.td U lib/Target/ARM/ARMISelLowering.cpp U lib/Target/ARM/ARMInstrFormats.td llvm-svn: 99010	2010-03-19 22:51:32 +00:00
Anton Korobeynikov	f11aa9e7b4	Get rid of target-specific fp <-> int nodes when still I'm here. llvm-svn: 98889	2010-03-18 22:35:45 +00:00
Anton Korobeynikov	64578d5599	Get rid of target-specific nodes for fp16 <-> fp32 conversion. llvm-svn: 98888	2010-03-18 22:35:37 +00:00
Bob Wilson	3f2293bc02	Translate "cc" clobber in ARM inline assembly to ARM::CCRRegisterClass. Radar 7459078. llvm-svn: 98586	2010-03-15 23:09:18 +00:00
Bill Wendling	bbcaa40227	Now that the default for Darwin platforms is to place the LSDA into the TEXT section, remove the target-specific code that performs this. llvm-svn: 98580	2010-03-15 21:09:38 +00:00
Anton Korobeynikov	0a65a37344	Add substarget feature for FP16 llvm-svn: 98503	2010-03-14 18:42:38 +00:00
Anton Korobeynikov	d7fece38fc	Add codegen support for FP16 on ARM llvm-svn: 98502	2010-03-14 18:42:31 +00:00
Bill Wendling	9481181d40	The ARM EH experiment worked! Place the LSDA into the TEXT section for ARM platforms. This involves making the encoding indirect, pcrel, and sdata4 instead of an absolute pointer. The references to the type infos are then non-lazy pointers. Revision 98019 changed the encoding of non-lazy pointers to add the symbol to the non-lazy pointer definition if it's a local symbol (otherwise, it's external and set to '0' so that the loader can adjust it to the real value). This paved the way for this change to work on ARM. llvm-svn: 98068	2010-03-09 18:31:07 +00:00
Bill Wendling	46ffefc66c	This is part of an LLC-beta test used to test <rdar://problem/6804645>. Please bear with the awful code. It won't last in its current state beyond tonight. llvm-svn: 98040	2010-03-09 02:46:12 +00:00
Bill Wendling	78c5b7a76d	Remove dead parameter passing. llvm-svn: 97536	2010-03-02 01:55:18 +00:00
Bob Wilson	ba8ac74fd9	Check for comparisons of +/- zero when optimizing less-than-or-equal and greater-than-or-equal SELECT_CCs to NEON vmin/vmax instructions. This is only allowed when UnsafeFPMath is set or when at least one of the operands is known to be nonzero. llvm-svn: 97065	2010-02-24 22:15:53 +00:00
Jim Grosbach	6ad4bcb0da	LowerCall() should always do getCopyFromReg() to reference the stack pointer. Machine instruction selection is much happier when operands are in virtual registers. llvm-svn: 97012	2010-02-24 01:43:03 +00:00
Bob Wilson	c6c13a3515	Use NEON vmin/vmax instructions for floating-point selects. Radar 7461718. llvm-svn: 96572	2010-02-18 06:05:53 +00:00
David Greene	0d0149f5ac	Remove an assumption of default arguments. This is in anticipation of a change to SelectionDAG build APIs. llvm-svn: 96230	2010-02-15 16:55:24 +00:00
Jim Grosbach	a570d05228	tighten up eh.setjmp sequence a bit. llvm-svn: 95603	2010-02-08 23:22:00 +00:00
Evan Cheng	6f36a083ef	Revert 95130. llvm-svn: 95160	2010-02-02 23:55:14 +00:00
Evan Cheng	c1b0116ff1	Pass callsite return type to TargetLowering::LowerCall and use that to check sibcall eligibility. llvm-svn: 95130	2010-02-02 21:29:10 +00:00
Anton Korobeynikov	25df248382	Fix a gross typo: ARMv6+ may or may not support unaligned memory operations. Even if they are suported by the core, they can be disabled (this is just a configuration bit inside some register). Allow unaligned memops on darwin and conservatively disallow them otherwise. llvm-svn: 94889	2010-01-30 14:08:12 +00:00
Evan Cheng	67a69dd2ed	Eliminate target hook IsEligibleForTailCallOptimization. Target independent isel should always pass along the "tail call" property. Change target hook LowerCall's parameter "isTailCall" into a refernce. If the target decides it's impossible to honor the tail call request, it should set isTailCall to false to make target independent isel happy. llvm-svn: 94626	2010-01-27 00:07:07 +00:00
Bob Wilson	6a4491b8c7	Wrap some comments to 80 columns. llvm-svn: 93940	2010-01-19 22:56:26 +00:00
Jim Grosbach	8546ec9c14	Patch by David Conrad: "On ARMv6T2 this turns cttz into rbit, clz instead of the 4 instruction sequence it is now." llvm-svn: 93758	2010-01-18 19:58:49 +00:00

1 2 3 4 5 ...

462 Commits