llvm-project

Commit Graph

Author	SHA1	Message	Date
Devang Patel	cae2fbd6fc	Let dwarf writer allocate extra space in the debug location expression. This space, if requested, will be used for complex addresses of the Blocks' variables. llvm-svn: 130178	2011-04-26 00:12:46 +00:00
Sebastian Redl	5519ff9d4e	Fix Target/ARM/Thumb1FrameLowering.h header guard. llvm-svn: 130097	2011-04-24 15:47:01 +00:00
Andrew Trick	0ed5778a1e	Thumb2 and ARM add/subtract with carry fixes. Fixes Thumb2 ADCS and SBCS lowering: <rdar://problem/9275821>. t2ADCS/t2SBCS are now pseudo instructions, consistent with ARM, so the assembly printer correctly prints the 's' suffix. Fixes Thumb2 adde -> SBC matching to check for live/dead carry flags. Fixes the internal ARM machine opcode mnemonic for ADCS/SBCS. Fixes ARM SBC lowering to check for live carry (potential bug). llvm-svn: 130048	2011-04-23 03:55:32 +00:00
Andrew Trick	1a1f8d4640	whitespace llvm-svn: 130046	2011-04-23 03:24:11 +00:00
Johnny Chen	57c892860e	Disassembly of A8.6.59 LDR (literal) Encoding T1 (16-bit thumb instruction) should print out ldr, not ldr.n. rdar://problem/9267772 llvm-svn: 130008	2011-04-22 19:12:43 +00:00
Devang Patel	3c39ec2933	Add asserts. llvm-svn: 129995	2011-04-22 16:44:29 +00:00
Evan Cheng	c0d2004e3c	In Thumb2 mode, lower frame indix references to: add <rd>, sp, #<imm8> ldr <rd>, [sp, #<imm8>] When the offset from sp is multiple of 4 and in range of 0-1020. This saves code size by utilizing 16-bit instructions. rdar://9321541 llvm-svn: 129971	2011-04-22 01:42:52 +00:00
Devang Patel	94ad6ac13c	Fix DWARF description of Q registers. llvm-svn: 129952	2011-04-21 23:22:35 +00:00
Devang Patel	3712c14be9	Fix DWARF description of S registers. llvm-svn: 129947	2011-04-21 22:48:26 +00:00
Devang Patel	46bda61a81	As per ARM docs, register Dx is described as DW_OP_regx(256+x) in DWARF. llvm-svn: 129922	2011-04-21 17:51:06 +00:00
Evan Cheng	5f1ba4cd2d	Remove -use-divmod-libcall. Let targets opt in when they are available. llvm-svn: 129884	2011-04-20 22:20:12 +00:00
Jakob Stoklund Olesen	0e34c1dfac	Prefer cheap registers for busy live ranges. On the x86-64 and thumb2 targets, some registers are more expensive to encode than others in the same register class. Add a CostPerUse field to the TableGen register description, and make it available from TRI->getCostPerUse. This represents the cost of a REX prefix or a 32-bit instruction encoding required by choosing a high register. Teach the greedy register allocator to prefer cheap registers for busy live ranges (as indicated by spill weight). llvm-svn: 129864	2011-04-20 18:19:48 +00:00
Stuart Hastings	7850af6ea0	Excise unintended hunk in 129858. <rdar://problem/7662569> llvm-svn: 129862	2011-04-20 18:09:26 +00:00
Stuart Hastings	45fe3c38c5	ARM byval support. Will be enabled by another patch to the FE. <rdar://problem/7662569> llvm-svn: 129858	2011-04-20 16:47:52 +00:00
Johnny Chen	dc62e59776	Fix typo in the comment. llvm-svn: 129837	2011-04-19 23:58:52 +00:00
Daniel Dunbar	2b9b0e3748	ADT/Triple: Move a variety of clients to using isOSDarwin() and isOSWindows() predicates. llvm-svn: 129816	2011-04-19 21:14:45 +00:00
Eric Christopher	c721b0db6d	Remove some duplicate op action entries and reorganize. llvm-svn: 129781	2011-04-19 18:49:19 +00:00
Bob Wilson	0858c3aaed	This patch combines several changes from Evan Cheng for rdar://8659675. Making use of VFP / NEON floating point multiply-accumulate / subtraction is difficult on current ARM implementations for a few reasons. 1. Even though a single vmla has latency that is one cycle shorter than a pair of vmul + vadd, a RAW hazard during the first (4? on Cortex-a8) can cause additional pipeline stall. So it's frequently better to single codegen vmul + vadd. 2. A vmla folowed by a vmul, vmadd, or vsub causes the second fp instruction to stall for 4 cycles. We need to schedule them apart. 3. A vmla followed vmla is a special case. Obvious issuing back to back RAW vmla + vmla is very bad. But this isn't ideal either: vmul vadd vmla Instead, we want to expand the second vmla: vmla vmul vadd Even with the 4 cycle vmul stall, the second sequence is still 2 cycles faster. Up to now, isel simply avoid codegen'ing fp vmla / vmls. This works well enough but it isn't the optimial solution. This patch attempts to make it possible to use vmla / vmls in cases where it is profitable. A. Add missing isel predicates which cause vmla to be codegen'ed. B. Make sure the fmul in (fadd (fmul)) has a single use. We don't want to compute a fmul and a fmla. C. Add additional isel checks for vmla, avoid cases where vmla is feeding into fp instructions (except for the #3 exceptional case). D. Add ARM hazard recognizer to model the vmla / vmls hazards. E. Add a special pre-regalloc case to expand vmla / vmls when it's likely the vmla / vmls will trigger one of the special hazards. Enable these fp vmlx codegen changes for Cortex-A9. llvm-svn: 129775	2011-04-19 18:11:57 +00:00
Bob Wilson	d04a83f8f2	Add -mcpu=cortex-a9-mp. It's cortex-a9 with MP extension. rdar://8648637. llvm-svn: 129774	2011-04-19 18:11:52 +00:00
Bob Wilson	a2881ee8a4	Avoid some 's' 16-bit instruction which partially update CPSR (and add false dependency) when it isn't dependent on last CPSR defining instruction. rdar://8928208 llvm-svn: 129773	2011-04-19 18:11:49 +00:00
Bob Wilson	df612ba006	Avoid write-after-write issue hazards for Cortex-A9. Add a avoidWriteAfterWrite() target hook to identify register classes that suffer from write-after-write hazards. For those register classes, try to avoid writing the same register in two consecutive instructions. This is currently disabled by default. We should not spill to avoid hazards! The command line flag -avoid-waw-hazard can be used to enable waw avoidance. llvm-svn: 129772	2011-04-19 18:11:45 +00:00
Bob Wilson	3e5944d96b	Some single-precision VFP instructions can execute in either the VPF or Neon pipelines, at least on Cortex-A9. llvm-svn: 129771	2011-04-19 18:11:38 +00:00
Bob Wilson	f33715e554	Improvements for the Cortex-A9 scheduling itineraries. llvm-svn: 129770	2011-04-19 18:11:36 +00:00
Evan Cheng	7d6cd4902e	Change A9 scheduling itineraries VLD* / VST* entries default to "aligned". That is, it assumes addresses are 64-bit aligned (which should be the more common case). If the alignment is found not to be aligned, then getOperandLatency() would adjust the operand latency computation by one to compensate for it. rdar://9294833 llvm-svn: 129742	2011-04-19 01:21:49 +00:00
Evan Cheng	4079133796	Do not lose mem_operands while lowering VLD / VST intrinsics. llvm-svn: 129738	2011-04-19 00:04:03 +00:00
Jim Grosbach	ddac5dd269	Trim a few unneeded includes. llvm-svn: 129723	2011-04-18 21:35:54 +00:00
Sean Callanan	5d73033e0f	Small fix to the ARM AsmParser to ensure that a superclass variable is instantiated properly. llvm-svn: 129713	2011-04-18 20:20:44 +00:00
Stuart Hastings	ebddfe60a0	Correct result when a branch condition is live across a block boundary. <rdar://problem/8933028> llvm-svn: 129634	2011-04-16 03:31:26 +00:00
Johnny Chen	48592ee5af	Thumb2 BFC was insufficiently encoded. rdar://problem/9292717 llvm-svn: 129619	2011-04-15 22:52:15 +00:00
Johnny Chen	761e1e3512	A8.6.315 VLD3 (single 3-element structure to all lanes) The a bit must be encoded as 0. rdar://problem/9292625 llvm-svn: 129618	2011-04-15 22:49:08 +00:00
Cameron Zwarich	9c65e4d69c	Add ORR and EOR to the CMP peephole optimizer. It's hard to get isel to generate a case involving EOR, so I only added a test for ORR. llvm-svn: 129610	2011-04-15 21:24:38 +00:00
Cameron Zwarich	0829b3065a	The AND instruction leaves the V flag unmodified, so it falls victim to the same problem as all of the other instructions we fold with CMPs. llvm-svn: 129602	2011-04-15 20:45:00 +00:00
Cameron Zwarich	93eae1571c	Add missing register forms of instructions to the ARM CMP-folding code. This fixes <rdar://problem/9287901>. llvm-svn: 129599	2011-04-15 20:28:28 +00:00
Chris Lattner	0ab5e2cded	Fix a ton of comment typos found by codespell. Patch by Luis Felipe Strano Moraes! llvm-svn: 129558	2011-04-15 05:18:47 +00:00
Evan Cheng	12bb05b75b	Fix another fcopysign lowering bug. If src is f64 and destination is f32, don't forget to right shift the source by 32 first. rdar://9287902 llvm-svn: 129556	2011-04-15 01:31:00 +00:00
Johnny Chen	681fef5986	For t2BFI, both Inst{26} and Inst{5} "should" be 0. Ref: I.1 Instruction encoding diagrams and pseudocode llvm-svn: 129552	2011-04-15 00:35:08 +00:00
Johnny Chen	421316178e	The ARM disassembler did not handle the alignment correctly for VLDDUP instructions (single element or n-element structure to all lanes). llvm-svn: 129550	2011-04-15 00:10:45 +00:00
Evan Cheng	44887f9c7e	Follow up on r127913. Fix Thumb revsh isel. rdar://9286766 llvm-svn: 129548	2011-04-14 23:27:44 +00:00
Johnny Chen	4251b151b1	Add sanity checkings for Thumb2 Load/Store Register Exclusive family of operations. llvm-svn: 129531	2011-04-14 19:13:28 +00:00
Johnny Chen	d0fb04f437	Thumb disassembler did not handle tBRIND (indirect branch) properly. rdar://problem/9280370 llvm-svn: 129480	2011-04-13 21:59:01 +00:00
Johnny Chen	b6a37bff21	Check for unallocated instruction encodings when disassembling Thumb Branch instructions (tBcc and t2Bcc). rdar://problem/9280470 llvm-svn: 129471	2011-04-13 21:35:49 +00:00
Johnny Chen	ffa6378fd6	The LDRT/STRT (unpriviledged load/store) operations don't take SP or PC as Rt. rdar://problem/9279440 llvm-svn: 129469	2011-04-13 21:04:32 +00:00
Cameron Zwarich	415b5e8341	Fix a typo in an ARM-specific DAG combine. This fixes <rdar://problem/9278274>. llvm-svn: 129468	2011-04-13 21:01:19 +00:00
Johnny Chen	70591cbc60	Check the corner cases for t2LDRSHi12 correctly and mark invalid encodings as such. rdar://problem/9276651 llvm-svn: 129462	2011-04-13 19:46:05 +00:00
Johnny Chen	0d306a7840	Fix a bug where for t2MOVCCi disassembly, the TIED_TO register operand was not properly handled. rdar://problem/9276427 llvm-svn: 129456	2011-04-13 17:51:02 +00:00
Johnny Chen	b2f9fa1fce	Forgot to add this change for http://llvm.org/viewvc/llvm-project?view=rev&revision=129387 . llvm-svn: 129451	2011-04-13 16:56:08 +00:00
Cameron Zwarich	8001850ee8	Fix a typo. llvm-svn: 129429	2011-04-13 06:39:16 +00:00
Johnny Chen	3c2f74c9f3	Add sanity check for Ld/St Dual forms of Thumb2 instructions. rdar://problem/9273947 llvm-svn: 129411	2011-04-12 23:31:00 +00:00
Jakob Stoklund Olesen	987164043c	Add @earlyclobber constraints to the writeback register of all ARM store instructions. The ARMARM specifies these instructions as unpredictable when storing the writeback register. This shouldn't affect code generation much since storing a pointer to itself is quite rare. llvm-svn: 129409	2011-04-12 23:27:48 +00:00
Johnny Chen	960eef3db3	The Thumb2 RFE instructions need to have their second halfword fully specified. In addition, the base register is not rGPR, but GPR with th exception that: if n == 15 then UNPREDICTABLE rdar://problem/9273836 llvm-svn: 129391	2011-04-12 21:41:51 +00:00

1 2 3 4 5 ...

4485 Commits