llvm-project

Commit Graph

Author	SHA1	Message	Date
Jakob Stoklund Olesen	3bb1267431	Use COPY in FastISel everywhere it is safe and trivial. The remaining copyRegToReg calls actually check the return value (shock!), so we cannot trivially replace them with COPY instructions. llvm-svn: 108069	2010-07-11 03:31:00 +00:00
Jakob Stoklund Olesen	7002c31480	Replace copyRegToReg with copyPhysReg for Mips. llvm-svn: 108066	2010-07-11 01:08:31 +00:00
Jakob Stoklund Olesen	7198d32fc6	Replace copyRegToReg with copyPhysReg for Alpha. llvm-svn: 108065	2010-07-11 01:08:23 +00:00
Jakob Stoklund Olesen	60af0681cb	Use COPY in targets llvm-svn: 108063	2010-07-10 22:43:03 +00:00
Jakob Stoklund Olesen	0c76d6ec21	Replace copyRegToReg with COPY everywhere in lib/CodeGen except for FastISel. llvm-svn: 108062	2010-07-10 22:42:59 +00:00
Jakob Stoklund Olesen	ad89613b65	Only collect subreg extracting copies for later coalescing. This also avoids fatal copies from physregs. llvm-svn: 108061	2010-07-10 22:42:53 +00:00
Dan Gohman	a64a323564	Fix a bug in the code which re-inserts DBG_VALUE nodes after scheduling; if a block is split (by a custom inserter), the insert point may be in a different block than it was originally. This fixes 32-bit llvm-gcc bootstrap builds, and I haven't been able to reproduce it otherwise. llvm-svn: 108060	2010-07-10 22:42:31 +00:00
Duncan Sands	82b21c086e	The accumulator tail recursion transform claims to work for any associative operation, but the way it's implemented requires the operation to also be commutative. So add a check for commutativity (and tweak the corresponding comments). This makes no difference in practice since every associative LLVM instruction is also commutative! Here's an example to show the need for commutativity: the accum_recursion.ll testcase calculates the factorial function. Before the transformation the result of a call is ((((11)2)3)...)x while afterwards it is (((1x)(x-1))...2)1 which clearly requires both associativity and commutativity of * to be equal to the original. llvm-svn: 108056	2010-07-10 20:31:42 +00:00
Jakob Stoklund Olesen	e50d30d586	Emit COPY instructions instead of using copyRegToReg in InstrEmitter, ScheduleDAGEmit, TwoAddressLowering, and PHIElimination. This switches the bulk of register copies to using COPY, but many less used copyRegToReg calls remain. llvm-svn: 108050	2010-07-10 19:08:25 +00:00
Jakob Stoklund Olesen	de457896b6	Don't emit st(0)/st(1) copies as FpMOV instructions. Use FpSET_ST? instead. Based on a patch by Rafael Espíndola. Attempt to make the FpSET_ST1 hack more robust, but we are still relying on FpSET_ST0 preceeding it. This is only for supporting really weird x87 inline asm. We support: FpSET_ST0 INLINEASM FpSET_ST0 FpSET_ST1 INLINEASM with and without kills on the arguments. We don't support: FpSET_ST1 FpSET_ST0 INLINEASM nor FpSET_ST1 INLINEASM Just Don't Do It! llvm-svn: 108047	2010-07-10 17:42:34 +00:00
Dan Gohman	fbdba81550	Insert IMPLICIT_DEF instructions at the current insert position, not at the end of the block. llvm-svn: 108045	2010-07-10 13:55:45 +00:00
Chandler Carruth	d162d85688	Add parentheses yet again to satisfy GCC's warnings. llvm-svn: 108043	2010-07-10 12:06:22 +00:00
Dan Gohman	d7b5ce3312	Reapply bottom-up fast-isel, with several fixes for x86-32: - Check getBytesToPopOnReturn(). - Eschew ST0 and ST1 for return values. - Fix the PIC base register initialization so that it doesn't ever fail to end up the top of the entry block. llvm-svn: 108039	2010-07-10 09:00:22 +00:00
Jakob Stoklund Olesen	be8d9b0bb8	An x86 function returns a floating point value in st(0), and we must make sure it is popped, even if it is ununsed. A CopyFromReg node is too weak to represent the required sideeffect, so insert an FpGET_ST0 instruction directly instead. This will matter when CopyFromReg gets lowered to a generic COPY instruction. llvm-svn: 108037	2010-07-10 04:04:25 +00:00
Devang Patel	57e72370ae	Update DBG_VALUE to refer appropriate stack slot in case of a spill. llvm-svn: 108023	2010-07-09 21:48:31 +00:00
Bruno Cardoso Lopes	5e6c2155a3	Declare YMM subregisters in the right way! Thanks Jakob llvm-svn: 108022	2010-07-09 21:46:19 +00:00
Bruno Cardoso Lopes	2419606bfb	Add AVX 256-bit packed MOVNT variants llvm-svn: 108021	2010-07-09 21:42:42 +00:00
Jakob Stoklund Olesen	e2614a9979	Remember the *_TC opcodes for load/store llvm-svn: 108020	2010-07-09 21:27:55 +00:00
Bruno Cardoso Lopes	6bc772eec7	Add AVX 256-bit unpack and interleave llvm-svn: 108017	2010-07-09 21:20:35 +00:00
Jakob Stoklund Olesen	b5c899d11b	Fix small bug in isMoveInstr -> COPY translation llvm-svn: 108013	2010-07-09 20:55:49 +00:00
Jakob Stoklund Olesen	7a7b55eb67	Automatically fold COPY instructions into stack load/store. llvm-svn: 108012	2010-07-09 20:43:13 +00:00
Jakob Stoklund Olesen	51702ec46b	Fix a few tests llvm-svn: 108011	2010-07-09 20:43:09 +00:00
Jakob Stoklund Olesen	e9fdcaa68a	Remat uncoalescable COPY instrs llvm-svn: 108010	2010-07-09 20:43:05 +00:00
Jim Grosbach	2a5725b1a3	In the presence of variable sized objects, allocate an emergency spill slot. rdar://8131327 llvm-svn: 108008	2010-07-09 20:27:06 +00:00
Bill Wendling	f831d86311	Clarify what mysterious check means. llvm-svn: 108005	2010-07-09 19:44:12 +00:00
Dan Gohman	7929c448fc	Fix MachineLICM to actually visit inner loops. llvm-svn: 108001	2010-07-09 18:49:45 +00:00
Bruno Cardoso Lopes	792e906bef	Start the support for AVX instructions with 256-bit %ymm registers. A couple of notes: - The instructions are being added with dummy placeholder patterns using some 256 specifiers, this is not meant to work now, but since there are some multiclasses generic enough to accept them, when we go for codegen, the stuff will be already there. - Add VEX encoding bits to support YMM - Add MOVUPS and MOVAPS in the first round - Use "Y" as suffix for those Instructions: MOVUPSYrr, ... - All AVX instructions in X86InstrSSE.td will move soon to a new X86InstrAVX file. llvm-svn: 107996	2010-07-09 18:27:43 +00:00
Jakob Stoklund Olesen	bd953d1805	Change TII::foldMemoryOperand API to require the machine instruction to be inserted in a MBB, and return an already inserted MI. This target API change is necessary to allow foldMemoryOperand to call storeToStackSlot and loadFromStackSlot when folding a COPY to a stack slot reference in a target independent way. The foldMemoryOperandImpl hook is going to change in the same way, but I'll wait until COPY folding is actually implemented. Most targets only fold copies and won't need to specialize this hook at all. llvm-svn: 107991	2010-07-09 17:29:08 +00:00
Gabor Greif	9d5ae03404	cache result of operator* llvm-svn: 107990	2010-07-09 16:51:20 +00:00
Gabor Greif	8e66a42784	remove useless cast and fix typos in comment llvm-svn: 107989	2010-07-09 16:42:04 +00:00
Gabor Greif	3b740e9085	cache result of operator* llvm-svn: 107988	2010-07-09 16:39:02 +00:00
Bob Wilson	6586e9b203	--- Reverse-merging r107947 into '.': U utils/TableGen/FastISelEmitter.cpp --- Reverse-merging r107943 into '.': U test/CodeGen/X86/fast-isel.ll U test/CodeGen/X86/fast-isel-loads.ll U include/llvm/Target/TargetLowering.h U include/llvm/Support/PassNameParser.h U include/llvm/CodeGen/FunctionLoweringInfo.h U include/llvm/CodeGen/CallingConvLower.h U include/llvm/CodeGen/FastISel.h U include/llvm/CodeGen/SelectionDAGISel.h U lib/CodeGen/LLVMTargetMachine.cpp U lib/CodeGen/CallingConvLower.cpp U lib/CodeGen/SelectionDAG/SelectionDAGBuilder.cpp U lib/CodeGen/SelectionDAG/FunctionLoweringInfo.cpp U lib/CodeGen/SelectionDAG/FastISel.cpp U lib/CodeGen/SelectionDAG/SelectionDAGISel.cpp U lib/CodeGen/SelectionDAG/ScheduleDAGSDNodes.cpp U lib/CodeGen/SelectionDAG/InstrEmitter.cpp U lib/CodeGen/SelectionDAG/TargetLowering.cpp U lib/Target/XCore/XCoreISelLowering.cpp U lib/Target/XCore/XCoreISelLowering.h U lib/Target/X86/X86ISelLowering.cpp U lib/Target/X86/X86FastISel.cpp U lib/Target/X86/X86ISelLowering.h llvm-svn: 107987	2010-07-09 16:37:18 +00:00
Gabor Greif	fd8e7d4a0f	cache result of operator* llvm-svn: 107984	2010-07-09 16:31:08 +00:00
Gabor Greif	e7650c7c29	cache result of operator* llvm-svn: 107983	2010-07-09 16:26:41 +00:00
Gabor Greif	aa389f5085	cache result of operator* llvm-svn: 107982	2010-07-09 16:22:36 +00:00
Gabor Greif	04af1e4f65	cache result of operator* llvm-svn: 107981	2010-07-09 16:17:52 +00:00
Gabor Greif	52617fc462	cache result of operator* llvm-svn: 107980	2010-07-09 16:08:33 +00:00
Gabor Greif	2c0ab48ac2	cache result of operator* llvm-svn: 107979	2010-07-09 16:01:21 +00:00
Gabor Greif	070b9a2cc4	cache result of operator* llvm-svn: 107978	2010-07-09 15:53:42 +00:00
Gabor Greif	d9a0e80213	cache result of operator* llvm-svn: 107977	2010-07-09 15:52:36 +00:00
Gabor Greif	e82532a1c5	cache result of operator* llvm-svn: 107976	2010-07-09 15:40:10 +00:00
Gabor Greif	6d8870fc35	cache result of operator* llvm-svn: 107975	2010-07-09 15:25:42 +00:00
Gabor Greif	329c4d8ed9	cache result of operator* llvm-svn: 107974	2010-07-09 15:25:09 +00:00
Gabor Greif	0028cc6730	cache result of operator* llvm-svn: 107972	2010-07-09 15:01:36 +00:00
Gabor Greif	d323f5e161	cache result of operator* (found by inspection) llvm-svn: 107971	2010-07-09 14:48:08 +00:00
Gabor Greif	b0d56ffc85	cache result of operator* llvm-svn: 107969	2010-07-09 14:36:49 +00:00
Gabor Greif	4247949ce9	cache result of operator* llvm-svn: 107968	2010-07-09 14:29:14 +00:00
Gabor Greif	2732561be9	cache result of operator* llvm-svn: 107967	2010-07-09 14:28:41 +00:00
Gabor Greif	a02f232c1b	cache result of operator* llvm-svn: 107966	2010-07-09 14:18:23 +00:00
Gabor Greif	f0821f39ee	cache operator*'s result (in multiple functions) llvm-svn: 107965	2010-07-09 14:02:13 +00:00
Gabor Greif	1d20021d82	do not repeatedly dereference use_iterator llvm-svn: 107963	2010-07-09 13:17:13 +00:00
Gabor Greif	60a346d0f1	do not repeatedly dereference use_iterator llvm-svn: 107962	2010-07-09 12:23:50 +00:00
Jakob Stoklund Olesen	d4d9e53b20	Avoid creating %physreg:subidx operands in SimpleRegisterCoalescing::RemoveCopyByCommutingDef. This fixes PR7602. llvm-svn: 107957	2010-07-09 05:56:21 +00:00
Jakob Stoklund Olesen	cac54d6435	Deal with a few remaining spots that assume physical registers have live intervals. This fixes PR7601. llvm-svn: 107955	2010-07-09 04:35:38 +00:00
Bruno Cardoso Lopes	992d25da71	Merge VEX enums with other x86 enum forms. Also fix all checks of which VEX fields to use. llvm-svn: 107952	2010-07-09 01:56:45 +00:00
Jakob Stoklund Olesen	66b3649030	Fix broken isCopy handling in TrimLiveIntervalToLastUse. llvm-svn: 107950	2010-07-09 01:27:21 +00:00
Jakob Stoklund Olesen	5165fa1c39	Handle COPY in VirtRegRewriter. llvm-svn: 107949	2010-07-09 01:27:19 +00:00
Dan Gohman	0a7d155d67	Fix the memoperand offsets in code generated for va_start. llvm-svn: 107948	2010-07-09 01:06:48 +00:00
Chris Lattner	88c185617c	have the mc lowering process handle a few tail call forms, lowering them to jumps where possible and turning the TAILCALL marker in the instruction asm string into a proper comment. This eliminates a FIXME and is on the path to finishing: rdar://7639610 - eliminate encoding and asm info for TAILJMPd TAILJMPr TAILJMPn, etc. However, I can't eliminate the encodings for these instructions because the JIT still exists and has its own copy of the encoder, sigh. llvm-svn: 107946	2010-07-09 00:49:41 +00:00
Bob Wilson	88a4e6dc0e	Print "dregpair" NEON operands with a space between them, for readability and consistency with other instructions that have lists of register operands. llvm-svn: 107944	2010-07-09 00:47:20 +00:00
Dan Gohman	0b5aa1cdd3	Re-apply bottom-up fast-isel, with fixes. Be very careful to avoid emitting a DBG_VALUE after a terminator, or emitting any instructions before an EH_LABEL. llvm-svn: 107943	2010-07-09 00:39:23 +00:00
Bruno Cardoso Lopes	e6cc0d33bb	Factor out x86 segment override prefix encoding, and also use it for VEX llvm-svn: 107942	2010-07-09 00:38:14 +00:00
Bob Wilson	21eed476e8	Reenable DAG combining for vector shuffles. It looks like it was temporarily disabled and then never turned back on again. Adjust some tests, one because this change avoids an unnecessary instruction, and the other to make it continue testing what it was intended to test. llvm-svn: 107941	2010-07-09 00:38:12 +00:00
Chris Lattner	061d70ad2c	reject pseudo instructions early in the encoder. llvm-svn: 107939	2010-07-09 00:17:50 +00:00
Bruno Cardoso Lopes	b652c1a145	Remove trailing whitespaces from file llvm-svn: 107937	2010-07-09 00:07:19 +00:00
Chris Lattner	f469307c77	Change LEA to have 5 operands for its memory operand, just like all other instructions, even though a segment is not allowed. This resolves a bunch of gross hacks in the encoder and makes LEA more consistent with the rest of the instruction set. No functionality change. llvm-svn: 107934	2010-07-08 23:46:44 +00:00
Stuart Hastings	d08fb75aaa	Reverting r107918 and r107919. Radar 8063111. llvm-svn: 107930	2010-07-08 23:25:39 +00:00
Jakob Stoklund Olesen	823e90e12a	Revert "Fix broken isCopy handling in TrimLiveIntervalToLastUse" This reverts commit 107921. It broke the clang self host. llvm-svn: 107926	2010-07-08 22:52:47 +00:00
Chris Lattner	ec536276f0	add some long-overdue enums to refer to the parts of the 5-operand X86 memory operand. llvm-svn: 107925	2010-07-08 22:41:28 +00:00
Devang Patel	4c6bd6612f	Relax assertion. In optimized code, it is possible that first instruction is coming from a inlined function. This fixes PR7596 . llvm-svn: 107923	2010-07-08 22:39:20 +00:00
Bill Wendling	a992445ff2	Extension of r107506. Make sure that we don't mark a function as having a call if the inline ASM doesn't need a stack frame. llvm-svn: 107922	2010-07-08 22:38:02 +00:00
Jakob Stoklund Olesen	75c465585a	Fix broken isCopy handling in TrimLiveIntervalToLastUse llvm-svn: 107921	2010-07-08 22:30:38 +00:00
Jakob Stoklund Olesen	ec58a43d81	Remember the VR64 register class llvm-svn: 107920	2010-07-08 22:30:35 +00:00
Stuart Hastings	43d226deea	Fix decl/def debug info for template functions. Radar 8063111. llvm-svn: 107919	2010-07-08 22:28:59 +00:00
Chris Lattner	9f034c1e5d	Rework segment prefix emission code to handle segments in memory operands at the same type as hard coded segments. This fixes problems where we'd emit the segment override after the REX prefix on instructions like: mov %gs:(%rdi), %rax This fixes rdar://8127102. I have several cleanup patches coming next. llvm-svn: 107917	2010-07-08 22:28:12 +00:00
Chris Lattner	1dd82c7dc2	introduce a new X86II::getMemoryOperandNo method, which returns the start of the memory operand for an instruction. Introduce a new "X86AddrSegment" enum to reduce # magic numbers referring to X86 memory operand layout. llvm-svn: 107916	2010-07-08 22:27:06 +00:00
Kalle Raiskila	d799ea52cd	Switch SPU calling convention (function arguments) to a Tablegen implementation. llvm-svn: 107913	2010-07-08 21:15:22 +00:00
Kevin Enderby	ea9207cd7a	Revert some unneeded parts of the change in r107886 for the .weak_def_can_be_hidden directive. Chris pointed out that the MCAsmInfo.h/.cpp chunks aren't needed for this until the compiler starts generating these. And when that happens it will be more convenient for it to be a bool than a const char*. llvm-svn: 107906	2010-07-08 20:30:44 +00:00
Evan Cheng	0f54854a1d	Check for FiniteOnlyFPMath as well. llvm-svn: 107904	2010-07-08 20:12:24 +00:00
Devang Patel	9c160e1213	Reuse DIEInteger for 1. This is frequently used while emitting an attribute using dwarf::DW_FORM_flag form. llvm-svn: 107903	2010-07-08 20:10:35 +00:00
Jakob Stoklund Olesen	63a622b768	Teach the x86 floating point stackifier to handle COPY instructions. This pass runs before COPY instructions are passed to copyPhysReg, so we simply translate COPY to the proper pseudo instruction. Note that copyPhysReg does not handle floating point stack copies. Once COPY is used everywhere, this can be cleaned up a bit, and most of the pseudo instructions can be removed. llvm-svn: 107899	2010-07-08 19:46:30 +00:00
Jakob Stoklund Olesen	930f8082c3	Implement X86InstrInfo::copyPhysReg llvm-svn: 107898	2010-07-08 19:46:25 +00:00
Bob Wilson	181e5af248	The NEONPreAllocPass should never have to assign fixed registers anymore. This pass can go away entirely soon. llvm-svn: 107892	2010-07-08 17:45:26 +00:00
Bob Wilson	1eade1a327	For big-endian systems, VLD2/VST2 with 32-bit vector elements will swap the words within the 64-bit D registers. Use VLD1/VST1 with 64-bit elements instead. llvm-svn: 107890	2010-07-08 17:44:00 +00:00
Kevin Enderby	082d0fd7ad	Added the darwin .weak_def_can_be_hidden directive. llvm-svn: 107886	2010-07-08 17:22:42 +00:00
Bob Wilson	6c25043493	Clean up a comment. llvm-svn: 107882	2010-07-08 16:54:45 +00:00
Jim Grosbach	c280fc7514	Clean up scavengeRegister() a bit to prefer available regs, which allows the simplification of frame index register scavenging to not have to check for available registers directly and instead just let scavengeRegister() handle it. llvm-svn: 107880	2010-07-08 16:49:26 +00:00
Jakob Stoklund Olesen	00264624a9	Convert EXTRACT_SUBREG to COPY when emitting machine instrs. EXTRACT_SUBREG no longer appears as a machine instruction. Use COPY instead. Add isCopy() checks in many places using isMoveInstr() and isExtractSubreg(). The isMoveInstr hook will be removed later. llvm-svn: 107879	2010-07-08 16:40:22 +00:00
Jakob Stoklund Olesen	a1e883dcf6	Remove references to INSERT_SUBREG after de-SSA. Fix X86InstrInfo::convertToThreeAddressWithLEA to generate COPY instead of INSERT_SUBREG. llvm-svn: 107878	2010-07-08 16:40:15 +00:00
Benjamin Kramer	0ae3f08c0d	Merge the duplicated iabs optimization in DAGCombiner and let it detected a few more idioms. llvm-svn: 107868	2010-07-08 12:09:56 +00:00
Benjamin Kramer	2321e6a4d4	Teach instcombine to transform (X >s -1) ? C1 : C2 and (X <s 0) ? C2 : C1 into ((X >>s 31) & (C2 - C1)) + C1, avoiding the conditional. This optimization could be extended to take non-const C1 and C2 but we better stay conservative to avoid code size bloat for now. for int sel(int n) { return n >= 0 ? 60 : 100; } we now generate sarl $31, %edi andl $40, %edi leal 60(%rdi), %eax instead of testl %edi, %edi movl $60, %ecx movl $100, %eax cmovnsl %ecx, %eax llvm-svn: 107866	2010-07-08 11:39:10 +00:00
Eric Christopher	e796253217	A slight reworking of the custom patterns for x86-64 tpoff codegen and correct the testcase for valid assembly. Needs more tests. llvm-svn: 107860	2010-07-08 07:36:46 +00:00
Evan Cheng	be1f7a931e	r107852 is only safe with -enable-unsafe-fp-math to account for +0.0 == -0.0. llvm-svn: 107856	2010-07-08 06:01:49 +00:00
Jakob Stoklund Olesen	89a4e25007	Add TargetInstrInfo::copyPhysReg hook and use it from LowerSubregs. This target hook is intended to replace copyRegToReg entirely, but for now it calls copyRegToReg. Any remaining calls to copyRegToReg wil be replaced by COPY instructions. llvm-svn: 107854	2010-07-08 05:01:41 +00:00
Evan Cheng	25f9364cbd	Optimize some vfp comparisons to integer ones. This patch implements the simplest case when the following conditions are met: 1. The arguments are f32. 2. The arguments are loads and they have no uses other than the comparison. 3. The comparison code is EQ or NE. e.g. vldr.32 s0, [r1] vldr.32 s1, [r0] vcmpe.f32 s1, s0 vmrs apsr_nzcv, fpscr beq LBB0_2 => ldr r1, [r1] ldr r0, [r0] cmp r0, r1 beq LBB0_2 More complicated cases will be implemented in subsequent patches. llvm-svn: 107852	2010-07-08 02:08:50 +00:00
Dale Johannesen	e2289285ae	Changes to ARM tail calls, mostly cosmetic. Add explicit testcases for tail calls within the same module. Duplicate some code to humor those who think .w doesn't apply on ARM. Leave this disabled on Thumb1, and add some comments explaining why it's hard and won't gain much. llvm-svn: 107851	2010-07-08 01:18:23 +00:00
Dan Gohman	e75704369d	Revert 107840 107839 107813 107804 107800 107797 107791. Debug info intrinsics win for now. llvm-svn: 107850	2010-07-08 01:00:56 +00:00
Jim Grosbach	6533f24370	When processing frame index virtual registers, consider all available registers (if there are any) and use the one which remains available for the longest rather than just using the first one. This should help enable better re-use of the loaded frame index values. rdar://7318760 llvm-svn: 107847	2010-07-08 00:38:54 +00:00
Chris Lattner	efa3c824cc	Fix the second half of PR7437: scalarrepl wasn't preserving address spaces when SRoA'ing memcpy's. llvm-svn: 107846	2010-07-08 00:27:05 +00:00
Dan Gohman	eb9164dc50	Don't forward-declare registers for static allocas, which we'll prefer to materialize as local constants. This fixes the clang bootstrap abort. llvm-svn: 107840	2010-07-07 23:52:58 +00:00
Dan Gohman	1adc499dda	Fix -fast-isel-abort to check the right instruction. llvm-svn: 107839	2010-07-07 23:47:25 +00:00
Chris Lattner	9380b81837	use PrintEscapedString to handle attribute section with escapes in it, PR7399. The asm parser already handles this. This is of dubious utility (see the PR) but the asmprinter was clearly broken here. llvm-svn: 107834	2010-07-07 23:16:37 +00:00
Jakob Stoklund Olesen	6213ab789f	fix copies to/from GR8_ABCD_H even more llvm-svn: 107832	2010-07-07 23:04:56 +00:00
Jim Grosbach	73ef80f76f	grammar llvm-svn: 107831	2010-07-07 22:53:35 +00:00
Jim Grosbach	40eda1076a	Handle cases where the post-RA scheduler may move instructions between the address calculation instructions leading up to a jump table when we're trying to convert them into a TB[H] instruction in Thumb2. This realistically shouldn't happen much, if at all, for well formed inputs, but it's more correct to handle it. rdar://7387682 llvm-svn: 107830	2010-07-07 22:51:22 +00:00
Chris Lattner	05ea2a4791	finish up support for callw: PR7195 llvm-svn: 107826	2010-07-07 22:35:13 +00:00
Chris Lattner	ac5881295c	Implement the major chunk of PR7195: support for 'callw' in the integrated assembler. Still some discussion to be done. llvm-svn: 107825	2010-07-07 22:27:31 +00:00
Bruno Cardoso Lopes	6c61451011	Add more assembly opcodes for SSE compare instructions llvm-svn: 107823	2010-07-07 22:24:03 +00:00
Devang Patel	a37a95ea2f	One MDNode may be used to create regular DIE as well as abstract DIE. Keep track of abstract subprogram DIEs. llvm-svn: 107822	2010-07-07 22:20:57 +00:00
Evan Cheng	1c349f18f8	Move getExtLoad() and (some) getLoad() DebugLoc argument after EVT argument for consistency sake. llvm-svn: 107820	2010-07-07 22:15:37 +00:00
Devang Patel	32a600b494	Print undefined/unknown debug value as "undef". llvm-svn: 107818	2010-07-07 21:52:21 +00:00
Dan Gohman	25d5c1b4f8	Not all custom inserters create new basic blocks. If the inserter didn't create a new block, don't reset the insert position. llvm-svn: 107813	2010-07-07 21:18:22 +00:00
Jim Grosbach	e4ba2aa0c4	grammar and trailing whitespace llvm-svn: 107811	2010-07-07 21:06:51 +00:00
Devang Patel	9a0339fc1f	Rename couple of maps. llvm-svn: 107810	2010-07-07 20:49:57 +00:00
Jakob Stoklund Olesen	ddaf0099a5	Allow copies between GR8_ABCD_L and GR8_ABCD_H. This fixes PR7540. llvm-svn: 107809	2010-07-07 20:33:27 +00:00
Devang Patel	30265c4f8b	80 cols. llvm-svn: 107807	2010-07-07 20:12:52 +00:00
Dan Gohman	e7ccc51cc1	Implement bottom-up fast-isel. This has the advantage of not requiring a separate DCE pass over MachineInstrs. llvm-svn: 107804	2010-07-07 19:20:32 +00:00
Dan Gohman	2d4d01d0de	Add X86FastISel support for return statements. This entails refactoring a bunch of stuff, to allow the target-independent calling convention logic to be employed. llvm-svn: 107800	2010-07-07 18:32:53 +00:00
Bruno Cardoso Lopes	fd8060335b	Add AVX AES instructions llvm-svn: 107798	2010-07-07 18:24:20 +00:00
Dan Gohman	b792f844ad	Update the insert position after scheduling, which may change the position when emitting multiple blocks when executing a custom inserter. llvm-svn: 107797	2010-07-07 18:22:13 +00:00
Devang Patel	637ee5f149	Update comment. llvm-svn: 107796	2010-07-07 18:18:18 +00:00
Dan Gohman	769201448d	Fix debugging strings. llvm-svn: 107795	2010-07-07 17:28:45 +00:00
Dan Gohman	ffe64b1ee5	Give FunctionLoweringInfo an MBB member, avoiding the need to pass it around everywhere, and also give it an InsertPt member, to enable isel to operate at an arbitrary position within a block, rather than just appending to a block. llvm-svn: 107791	2010-07-07 16:47:08 +00:00
Dan Gohman	87fb4e8fcd	Simplify FastISel's constructor by giving it a FunctionLoweringInfo instance, rather than pointers to all of FunctionLoweringInfo's members. This eliminates an NDEBUG ABI sensitivity. llvm-svn: 107789	2010-07-07 16:29:44 +00:00
Dan Gohman	e784616fbb	Move FunctionLoweringInfo.h out into include/llvm/CodeGen. This will allow target-specific fast-isel code to make use of it directly. llvm-svn: 107787	2010-07-07 16:01:37 +00:00
Dan Gohman	fe7532a308	Split the SDValue out of OutputArg so that SelectionDAG-independent code can do calling-convention queries. This obviates OutputArgReg. llvm-svn: 107786	2010-07-07 15:54:55 +00:00
Chris Lattner	ca97c92eb4	add some triple for minix, patch by Kees van Reeuwijk from PR7582 llvm-svn: 107785	2010-07-07 15:52:27 +00:00
Dan Gohman	498e5f899d	Move CallingConvLower.cpp out of the SelectionDAG directory. llvm-svn: 107781	2010-07-07 15:15:27 +00:00
Jakob Stoklund Olesen	8e1338eea8	Fix more places assuming subregisters have live intervals llvm-svn: 107780	2010-07-07 14:41:22 +00:00
Dan Gohman	88c547ede9	Add a getFirstNonPHI utility function. llvm-svn: 107778	2010-07-07 14:33:51 +00:00
Dan Gohman	5b0a8a863f	Minore code simplification. llvm-svn: 107777	2010-07-07 14:30:04 +00:00
Dan Gohman	00ef93258a	Remove interprocedural-basic-aa and associated code. The AliasAnalysis interface needs implementations to be consistent, so any code which wants to support different semantics must use a different interface. It's not currently worthwhile to add a new interface for this new concept. Document that AliasAnalysis doesn't support cross-function queries. llvm-svn: 107776	2010-07-07 14:27:09 +00:00
Gabor Greif	a22e8148d4	conditionalize by CallInst::ArgOffset llvm-svn: 107767	2010-07-07 10:34:03 +00:00
Duncan Sands	408bb192de	Rename "Release" builds as "Release+Asserts"; rename "Release-Asserts" builds to "Release". The default build is unchanged (optimization on, assertions on), however it is now called Release+Asserts. The intent is that future LLVM releases released via llvm.org will be Release builds in the new sense, i.e. will have assertions disabled (currently they have assertions enabled, for a more than 20% slowdown). This will bring them in line with MacOS releases, which ship with assertions disabled. It also means that "Release" now means the same things in make and cmake builds: cmake already disables assertions for "Release" builds AFAICS. llvm-svn: 107758	2010-07-07 07:48:00 +00:00
Bruno Cardoso Lopes	6d122aef97	Add AVX SSE4.2 instructions llvm-svn: 107752	2010-07-07 03:39:29 +00:00
Bruno Cardoso Lopes	3df55b2d6f	Use only one multiclass to pinsrq instructions llvm-svn: 107750	2010-07-07 01:43:01 +00:00
Bruno Cardoso Lopes	fd6c808154	Now that almost all SSE4.1 AVX instructions are added, move code around to more appropriate sections. No functionality changes llvm-svn: 107749	2010-07-07 01:33:38 +00:00
Bruno Cardoso Lopes	8f5472a8e8	Add AVX SSE4.1 insertps, ptest and movntdqa instructions llvm-svn: 107747	2010-07-07 01:14:56 +00:00
Bruno Cardoso Lopes	6430c7350d	Add AVX SSE4.1 extractps and pinsr instructions llvm-svn: 107746	2010-07-07 01:01:13 +00:00
Jakob Stoklund Olesen	f0e551d4f4	Revert "Remove references to INSERT_SUBREG after de-SSA" r107725. Buildbot breakage. llvm-svn: 107744	2010-07-07 00:32:25 +00:00
Bob Wilson	5bc8a79e7f	Also use REG_SEQUENCE for VTBX instructions. llvm-svn: 107743	2010-07-07 00:08:54 +00:00
Jim Grosbach	3198483851	Mark eh.sjlj.set/longjmp custom lowerings as Darwin-only since that's where they've been tested to work. llvm-svn: 107742	2010-07-07 00:07:57 +00:00
Bruno Cardoso Lopes	f3116ebe96	Add AVX SSE4.1 Extract Integer instructions llvm-svn: 107740	2010-07-07 00:07:24 +00:00
Jim Grosbach	dc0a0659be	By default, the eh.sjlj.setjmp/longjmp intrinsics should just do nothing rather than assuming a target will custom lower them. Targets which do so should exlicitly mark them as having custom lowerings. PR7454. llvm-svn: 107734	2010-07-06 23:44:52 +00:00
Jakob Stoklund Olesen	e2d3067f6b	Remove references to INSERT_SUBREG after de-SSA llvm-svn: 107732	2010-07-06 23:40:35 +00:00
Bob Wilson	3ed511bc6b	Use REG_SEQUENCE nodes to make the table registers for VTBL instructions be allocated to consecutive registers. llvm-svn: 107730	2010-07-06 23:36:25 +00:00
Dale Johannesen	ce65663330	Accept RIP-relative symbols with 'i' constraint, and print the (%rip) only if the 'a' modifier is present. PR 7528. llvm-svn: 107727	2010-07-06 23:27:00 +00:00
Jakob Stoklund Olesen	70ee3ecd33	Convert INSERT_SUBREG to COPY in TwoAddressInstructionPass. INSERT_SUBREG will now only appear in SSA machine instructions. Fix the handling of partial redefs in ProcessImplicitDefs. This is now relevant since partial redef COPY instructions appear. llvm-svn: 107726	2010-07-06 23:26:25 +00:00
Jakob Stoklund Olesen	48deb12593	Track defs for all aliases in NEONMoveFix. This means that an instruction defining an S register will affect the domain of the parent D register. llvm-svn: 107725	2010-07-06 23:26:23 +00:00
Bruno Cardoso Lopes	1f9ad516c6	Add the rest of AVX SSE4.1 packed move with sign/zero extend instructions llvm-svn: 107723	2010-07-06 23:15:17 +00:00
Bruno Cardoso Lopes	35702d27c4	Add part of AVX SSE4.1 packed move with sign/zero extend instructions llvm-svn: 107720	2010-07-06 23:01:41 +00:00
Bruno Cardoso Lopes	13f0260e76	Fix comment from previous patch llvm-svn: 107717	2010-07-06 22:38:32 +00:00
Bruno Cardoso Lopes	e2bd058d32	Add AVX vblendvpd, vblendvps and vpblendvb instructions Update VEX encoding to support those new instructions llvm-svn: 107715	2010-07-06 22:36:24 +00:00
Dan Gohman	ee0cb70381	CanLowerReturn doesn't need a SelectionDAG; it just needs an LLVMContext. SelectBasicBlock doesn't needs its BasicBlock argument. llvm-svn: 107712	2010-07-06 22:19:37 +00:00
Devang Patel	a3ca21b228	Propagate debug loc. llvm-svn: 107710	2010-07-06 22:08:15 +00:00
Bob Wilson	4c1ca29039	Represent NEON load/store alignments in bytes, not bits. llvm-svn: 107701	2010-07-06 21:26:18 +00:00
Jakob Stoklund Olesen	15fed3bd30	One more case assuming that subregs have live ranges. llvm-svn: 107700	2010-07-06 21:13:03 +00:00
Jakob Stoklund Olesen	bcf3409107	Fix buildbot breakage where a def is missing. llvm-svn: 107698	2010-07-06 21:06:39 +00:00
Devang Patel	b36df17b08	Add fixme. llvm-svn: 107697	2010-07-06 21:05:17 +00:00
Jakob Stoklund Olesen	a64c0a3d22	Be more forgiving when calculating alias interference for physreg coalescing. It is OK for an alias live range to overlap if there is a copy to or from the physical register. CoalescerPair can work out if the copy is coalescable independently of the alias. This means that we can join with the actual destination interval instead of using the getOrigDstReg() hack. It is no longer necessary to merge clobber ranges into subregisters. llvm-svn: 107695	2010-07-06 20:31:51 +00:00
Dan Gohman	3439629239	Reapply r107655 with fixes; insert the pseudo instruction into the block before calling the expansion hook. And don't put EFLAGS in a mbb's live-in list twice. llvm-svn: 107691	2010-07-06 20:24:04 +00:00
Eric Christopher	dfc8b745a2	Fix to 80-col. llvm-svn: 107684	2010-07-06 18:35:20 +00:00
Devang Patel	23a7593534	Fix PR7545 crash. llvm-svn: 107678	2010-07-06 18:18:32 +00:00
Rafael Espindola	7c510aa7bc	Don't create neon moves in CopyRegToReg. NEONMoveFixPass will do the conversion if profitable. llvm-svn: 107673	2010-07-06 16:24:34 +00:00
Chris Lattner	dde2ba0b60	tighten up this code. llvm-svn: 107670	2010-07-06 15:59:27 +00:00
Dan Gohman	f4f04107ef	Revert r107655. llvm-svn: 107668	2010-07-06 15:49:48 +00:00
Dan Gohman	4e49b59dad	Add versions of OutputArgReg, AnalyzeReturn, and AnalyzeCallOperands which do not depend on SelectionDAG. llvm-svn: 107666	2010-07-06 15:39:54 +00:00
Dan Gohman	2b2a1c3c86	Make getMinimalPhysRegClass' comment mention what makes it different from getPhysicalRegisterRegClass. llvm-svn: 107660	2010-07-06 15:31:55 +00:00
Anton Korobeynikov	e415230477	Fix a major regression on COFF targets introduced by r103267: 'discardable' section means that it is used only during the program load and can be discarded afterwards. This way only debug sections can be discarded, but not the opposite. Seems like the copy-and-pasto from ELF code, since there it contains the reverse flag ('alloc'). llvm-svn: 107658	2010-07-06 15:24:56 +00:00
Dan Gohman	1e33b18e28	Add some more TODO comments. llvm-svn: 107657	2010-07-06 15:23:00 +00:00
Dan Gohman	f855b39edd	Add a comment. llvm-svn: 107656	2010-07-06 15:21:57 +00:00
Dan Gohman	12205645a6	Fix a bunch of custom-inserter functions to handle the case where the pseudo instruction is not at the end of the block. llvm-svn: 107655	2010-07-06 15:18:19 +00:00
Eric Christopher	2ad0c779c3	Fix up -fstack-protector on linux to use the segment registers. Split out testcases per architecture and os now. Patch from Nelson Elhage. llvm-svn: 107640	2010-07-06 05:18:56 +00:00
Nick Lewycky	dace239949	Detabify this file. llvm-svn: 107637	2010-07-06 03:53:43 +00:00
Eric Christopher	d429846eca	Have the X86 backend use Triple instead of a string and some enums. llvm-svn: 107625	2010-07-05 19:26:33 +00:00
Kalle Raiskila	d5ac287140	Remove some unused/redundant code. llvm-svn: 107622	2010-07-05 18:40:09 +00:00
Chris Lattner	c4a7073db3	more tidying. llvm-svn: 107615	2010-07-05 05:53:14 +00:00
Chris Lattner	7b909ac785	some notes about suboptimal insertps's llvm-svn: 107613	2010-07-05 05:48:41 +00:00
Chris Lattner	2c0315a0f3	random tidying llvm-svn: 107612	2010-07-05 05:36:21 +00:00
Chris Lattner	6d60a14251	rip out even more sporadic v2f32 support. llvm-svn: 107610	2010-07-05 04:38:33 +00:00
Chris Lattner	feb2467bf4	rip out the various v2f32 "mmx" handling logic, now that v2f32 is illegal on x86. llvm-svn: 107609	2010-07-05 04:36:27 +00:00
Jakob Stoklund Olesen	ac0a210789	Print symbolic subreg indices on REG_SEQUENCE and INSERT_SUBREG. llvm-svn: 107602	2010-07-04 23:24:23 +00:00
Chris Lattner	45cc4d74a3	Just rip v2f32 support completely out of the X86 backend. In the example in the testcase, we now generate: _test1: ## @test1 movss 4(%esp), %xmm0 addss 8(%esp), %xmm0 movl 12(%esp), %eax movss %xmm0, (%eax) ret instead of: _test1: ## @test1 subl $20, %esp movl 24(%esp), %eax movq %mm0, (%esp) movq %mm0, 8(%esp) movss (%esp), %xmm0 addss 12(%esp), %xmm0 movss %xmm0, (%eax) addl $20, %esp ret v2f32 support did not work reliably because most of the X86 backend didn't know it was legal. It was apparently only added to support returning source-level v2f32 values in MMX registers in x86-32 mode. If ABI compatibility is important on this GCC-extended-vector type for some reason, then the frontend should generate IR that returns v2i32 instead of v2f32. However, we generally don't try very hard to be abi compatible on gcc extended vectors. llvm-svn: 107601	2010-07-04 23:07:25 +00:00
Chris Lattner	681b926d54	fix PR7518 - terrible codegen of <2 x float>, by only marking v2f32 as legal in 32-bit mode. It is just as terrible there, but I just care about x86-64 and noone claims it is valuable in 64-bit mode. llvm-svn: 107600	2010-07-04 22:57:10 +00:00
Chris Lattner	cb948d3329	indentation llvm-svn: 107599	2010-07-04 22:56:10 +00:00
Evan Cheng	f3aeb2c22c	Infer alignments of fixed frame objects when they are constructed. This ensures remat'ed loads from fixed slots have the right alignments. llvm-svn: 107591	2010-07-04 18:52:05 +00:00
Bill Wendling	199cacf179	Revert r107583. I no longer think that this is the way to solve the problem. llvm-svn: 107585	2010-07-04 09:16:57 +00:00
Bill Wendling	701aa053b9	Mark sse_load_f32 and sse_load_f64 as having memory operands (SDNPMemOperand). This way when they're morphed the memory operands will be copied as well. llvm-svn: 107583	2010-07-04 08:59:55 +00:00
Bill Wendling	f844642350	Proper indentation. llvm-svn: 107581	2010-07-04 08:58:43 +00:00
Eli Friedman	c8f595212f	Minor amendment to switch-lowering improvement. llvm-svn: 107569	2010-07-03 08:43:32 +00:00
Eli Friedman	836fdbc85b	Note switch-lowering inefficiency. llvm-svn: 107565	2010-07-03 07:38:12 +00:00
Bruno Cardoso Lopes	ca99012ac0	Add AVX SSE4.1 blend, mpsadbw and vdp llvm-svn: 107560	2010-07-03 01:37:03 +00:00
Bruno Cardoso Lopes	bc75502f09	Add AVX SSE4.1 binop (some forms of packed max,min,mul,pack,cmp) instructions llvm-svn: 107558	2010-07-03 01:15:47 +00:00
Eric Christopher	128a0197bb	Fix typo. llvm-svn: 107556	2010-07-03 01:09:18 +00:00
Bruno Cardoso Lopes	fc9cdc4d61	Add AVX SSE4.1 Horizontal Minimum and Position instruction llvm-svn: 107552	2010-07-03 00:49:21 +00:00
Evan Cheng	0664a67fe1	Remove isSS argument from CreateFixedObject. Fixed objects cannot be spill slots so it's always false. llvm-svn: 107550	2010-07-03 00:40:23 +00:00
Bruno Cardoso Lopes	621c85b038	Add AVX SSE4.1 round instructions llvm-svn: 107549	2010-07-03 00:37:44 +00:00
Jakob Stoklund Olesen	4c82a9e7d0	Detect and handle COPY in many places. This code is transitional, it will soon be possible to eliminate isExtractSubreg, isInsertSubreg, and isMoveInstr in most places. llvm-svn: 107547	2010-07-03 00:04:37 +00:00
Bruno Cardoso Lopes	5b59c1bf1f	Simple refactoring of SSE4.1 instructions, making room for the AVX forms llvm-svn: 107540	2010-07-02 23:27:59 +00:00
Eric Christopher	5e5416056b	80-col fixup. llvm-svn: 107537	2010-07-02 23:17:38 +00:00
Jakob Stoklund Olesen	676a15bdf5	Add a new target independent COPY instruction and code to lower it. The COPY instruction is intended to replace the target specific copy instructions for virtual registers as well as the EXTRACT_SUBREG and INSERT_SUBREG instructions in MachineFunctions. It won't we used in a selection DAG. COPY is lowered to native register copies by LowerSubregs. llvm-svn: 107529	2010-07-02 22:29:50 +00:00
Bruno Cardoso Lopes	c7111fd355	- Add support for the rest of AVX SSE3 instructions - Fix VEX prefix to be emitted with 3 bytes whenever VEX_5M represents a REX equivalent two byte leading opcode llvm-svn: 107523	2010-07-02 22:06:54 +00:00
Jim Grosbach	3c43248560	Custom inserters (e.g., conditional moves in Thumb1 can introduce new basic blocks, and if used as a function argument, that can cause call frame setup / destroy pairs to be split across a basic block boundary. That prevents us from doing a simple assertion to check that the pairs match and alloc/ dealloc the same amount of space. Modify the assertion to only check the amount allocated when there are matching pairs in the same basic block. rdar://8022442 llvm-svn: 107517	2010-07-02 21:23:37 +00:00
Devang Patel	cefe3831b7	MDString is already checked earlier. llvm-svn: 107516	2010-07-02 21:13:23 +00:00
Evan Cheng	c3525dc0fd	Remove early IT block formation. It's not used. llvm-svn: 107513	2010-07-02 21:07:09 +00:00
Evan Cheng	0ce84486c3	- Two-address pass should not assume unfolding is always successful. - X86 unfolding should check if the instructions being unfolded has memoperands. If there is no memoperands, then it must assume conservative alignment. If this would introduce an expensive sse unaligned load / store, then unfoldMemoryOperand etc. should not unfold the instruction. llvm-svn: 107509	2010-07-02 20:36:18 +00:00
Dale Johannesen	4d887f7ca7	Propagate the AlignStack bit in InlineAsm's to the PrologEpilog code, and use it to determine whether the asm forces stack alignment or not. gcc consistently does not do this for GCC-style asms; Apple gcc inconsistently sometimes does it for asm blocks. There is no convenient place to put a bit in either the SDNode or the MachineInstr form, so I've added an extra operand to each; unlovely, but it does allow for expansion for more bits, should we need it. PR 5125. Some existing testcases are affected. The operand lists of the SDNode and MachineInstr forms are indexed with awesome mnemonics, like "2"; I may fix this someday, but not now. I'm not making it any worse. If anyone is inspired I think you can find all the right places from this patch. llvm-svn: 107506	2010-07-02 20:16:09 +00:00
Jakob Stoklund Olesen	df8429aeb4	Remove invalid assert llvm-svn: 107505	2010-07-02 19:54:47 +00:00
Jakob Stoklund Olesen	cf6c5c960f	Properly handle debug values during inline spilling. llvm-svn: 107503	2010-07-02 19:54:40 +00:00
Gabor Greif	9da02a83e9	beautify output llvm-svn: 107500	2010-07-02 19:26:28 +00:00
Gabor Greif	e537ddbdb4	use ArgOperand API llvm-svn: 107498	2010-07-02 19:08:46 +00:00
Dan Gohman	832282e061	Don't claim to preserve AliasAnalysis. First, this is doesn't actually have any effect, and second, deleting stores can potentially invalidate an AliasAnalysis, and there's currently no notification for this. llvm-svn: 107496	2010-07-02 18:43:05 +00:00
Jakob Stoklund Olesen	96037187e5	Rematerialize as much as possible before inserting spills and reloads. This allows us to recognize the common case where all uses could be rematerialized, and no stack slot allocation is necessary. If some values could be fully rematerialized, remove them from the live range before allocating a stack slot for the rest. llvm-svn: 107492	2010-07-02 17:44:57 +00:00
Jim Grosbach	9b7755fbc6	80-column and trailing whitespace cleanup. llvm-svn: 107490	2010-07-02 17:41:59 +00:00
Jim Grosbach	64a4f3f062	grammar tweaks llvm-svn: 107489	2010-07-02 17:38:34 +00:00
Bob Wilson	771d04b969	Fix incorrect asm-printing of some NEON immediates. Fix weak testcase so that it checks the immediate values, not just the instructions opcodes. Radar 8110263. llvm-svn: 107487	2010-07-02 17:23:44 +00:00
Gabor Greif	56de4675b6	use ArgOperand API (found by my previous commit) llvm-svn: 107482	2010-07-02 13:37:16 +00:00
Dan Gohman	ee8d80d6a3	IndirectBr is not safe to speculatively execute (!) llvm-svn: 107454	2010-07-02 00:35:34 +00:00
Dan Gohman	93f5920914	Rename CreateReg to CreateRegs, and MakeReg to CreateReg. llvm-svn: 107451	2010-07-02 00:10:16 +00:00
Bruno Cardoso Lopes	4ca8ddaceb	Shrink down SSE3 code by more multiclass refactoring llvm-svn: 107448	2010-07-01 23:10:49 +00:00
Bill Wendling	504055ce9e	Make the "linker_private" linkage type emit a non-weak symbol to the file. It will still be stripped by the linker when it generates the final image. llvm-svn: 107440	2010-07-01 22:38:24 +00:00
Bruno Cardoso Lopes	0a17241a0d	Shrink down SSE3 code by some multiclass refactoring - 1st part llvm-svn: 107438	2010-07-01 22:33:18 +00:00
Bob Wilson	8a99b730a9	ARM function alignments were off by a power of two. svn 83242 changed getFunctionAlignment and the corresponding use of that value in the ARM asm printer, but now we're using the standard asm printer. The result of this was that function alignments were dropped completely for Thumb functions. Radar 8143571. llvm-svn: 107435	2010-07-01 22:26:26 +00:00
Bill Wendling	03bcd6ecc8	Implement the "linker_private_weak" linkage type. This will be used for Objective-C metadata types which should be marked as "weak", but which the linker will remove upon final linkage. However, this linkage isn't specific to Objective-C. For example, the "objc_msgSend_fixup_alloc" symbol is defined like this: .globl l_objc_msgSend_fixup_alloc .weak_definition l_objc_msgSend_fixup_alloc .section __DATA, __objc_msgrefs, coalesced .align 3 l_objc_msgSend_fixup_alloc: .quad _objc_msgSend_fixup .quad L_OBJC_METH_VAR_NAME_1 This is different from the "linker_private" linkage type, because it can't have the metadata defined with ".weak_definition". Currently only supported on Darwin platforms. llvm-svn: 107433	2010-07-01 21:55:59 +00:00
Devang Patel	429397529a	Do not require line number entry for undefined local variable. This is a regression caused by r106792 and caught by gdb testsuite. llvm-svn: 107430	2010-07-01 21:38:08 +00:00
Daniel Dunbar	0e980755d3	MC: Fix some stray hunks I didn't intend to commit. llvm-svn: 107428	2010-07-01 20:48:51 +00:00
Daniel Dunbar	02877d6e85	MC: Pass the target instance to the AsmParser constructor. llvm-svn: 107426	2010-07-01 20:41:56 +00:00
Daniel Dunbar	0aa4365e47	MC: Fix an error message. llvm-svn: 107424	2010-07-01 20:20:01 +00:00
Dan Gohman	84f90a387d	Remove context sensitivity concerns from interprocedural-basic-aa, and make it more aggressive in cases where both pointers are known to live in the same function. llvm-svn: 107420	2010-07-01 20:08:40 +00:00
Daniel Dunbar	329d202362	MC: Move COFF enumeration constants to llvm/Support/COFF.h, patch by Michael Spencer! llvm-svn: 107418	2010-07-01 20:07:24 +00:00
Devang Patel	2b434e12cd	Debugging infomration is encoded in llvm IR using metadata. This is designed such a way that debug info for symbols preserved even if symbols are optimized away by the optimizer. Add new special pass to remove debug info for such symbols. llvm-svn: 107416	2010-07-01 19:49:20 +00:00
Devang Patel	b9e2e4b762	If a named mdnode is removed then mark module as changed. llvm-svn: 107412	2010-07-01 18:27:46 +00:00
Bruno Cardoso Lopes	5e88700f28	Move SSE3 Move patterns to a more appropriate section Add AVX SSE3 packed horizontal and & sub instructions llvm-svn: 107405	2010-07-01 17:35:02 +00:00
Bruno Cardoso Lopes	886ee33a38	Add AVX SSE3 packed addsub instructions llvm-svn: 107404	2010-07-01 17:08:18 +00:00
Dan Gohman	d2965c10a1	Temporarily disable on-demand fast-isel. llvm-svn: 107393	2010-07-01 12:15:30 +00:00
Gabor Greif	9dc154bcb4	reformulate CallSite::getCallee to adapt to CallInst::ArgOffset, and make it work even if CallInst::op_* are private llvm-svn: 107390	2010-07-01 10:41:37 +00:00
Dan Gohman	42b7ee15f5	Use FuncInfo's isExportedInst accessor method instead of doing the work manually. llvm-svn: 107384	2010-07-01 03:57:05 +00:00
Dan Gohman	85e02e9340	Rename CreateRegForValue to CreateReg, and change its argument from a Value to a Type, because it doesn't actually care about the Value. llvm-svn: 107383	2010-07-01 03:55:39 +00:00
Dan Gohman	4d29fd85f9	Fast isel no longer needs DeadMachineInstrElim to clean up after it. llvm-svn: 107381	2010-07-01 03:49:59 +00:00
Dan Gohman	aef3d140b7	Teach fast-isel to avoid loading a value from memory when it's already available in a register. This is pretty primitive, but it reduces the number of instructions in common testcases by 4%. llvm-svn: 107380	2010-07-01 03:49:38 +00:00
Dan Gohman	722f5fc567	Enable on-demand fast-isel. llvm-svn: 107377	2010-07-01 02:58:57 +00:00
Dan Gohman	207624edb0	Fix X86FastISel's add folding to actually work, and not fall back to SelectionDAG. llvm-svn: 107376	2010-07-01 02:58:21 +00:00
Bruno Cardoso Lopes	a7a0c83563	Add AVX SSE3 replicate and convert instructions llvm-svn: 107375	2010-07-01 02:33:39 +00:00
Dan Gohman	7937d5606d	Teach X86FastISel to fold constant offsets and scaled indices in the same address. llvm-svn: 107373	2010-07-01 02:27:15 +00:00
Dan Gohman	d432223163	Reapply r106422, splitting the code for materializing a value out of SelectionDAGBuilder::getValue into a helper function, with fixes to use DenseMaps safely. llvm-svn: 107371	2010-07-01 01:59:43 +00:00
Dan Gohman	9576645a84	Don't use operator[] here, because it's not desirable to insert a default value if the search fails. llvm-svn: 107368	2010-07-01 01:33:21 +00:00
Bruno Cardoso Lopes	05166740eb	- Add AVX SSE2 Move doubleword and quadword instructions. - Add encode bits for VEX_W - All 128-bit SSE 1 & SSE2 instructions that are described in the .td file now have a AVX encoded form already working. llvm-svn: 107365	2010-07-01 01:20:06 +00:00
Mikhail Glushenkov	22fa66cf2b	80-col violation. llvm-svn: 107361	2010-07-01 01:00:27 +00:00
Mikhail Glushenkov	4721ad855e	Trailing whitespace. llvm-svn: 107360	2010-07-01 01:00:22 +00:00
Jakob Stoklund Olesen	8656a4549a	Add memory operand folding support to InlineSpiller. llvm-svn: 107355	2010-07-01 00:13:04 +00:00
Jakob Stoklund Olesen	bde96ad23e	Add support for rematerialization to InlineSpiller. llvm-svn: 107351	2010-06-30 23:03:52 +00:00
Bill Wendling	e0dfb98ea0	Use the catch-all selectors we already found when converting them to use the correct catch-all value. This saves having to iterate through all of the selectors in the program again. llvm-svn: 107345	2010-06-30 22:49:53 +00:00
Jim Grosbach	e74c78d539	lowerinvoke needs to handle aggregate function args like sjlj eh does. llvm-svn: 107335	2010-06-30 22:22:59 +00:00
Jim Grosbach	e8c97a7cd7	Handle array and vector typed parameters in sjljehprepare like we do structs. rdar://8145832 llvm-svn: 107332	2010-06-30 22:20:38 +00:00
Devang Patel	db735cbbab	Remove all debug info related named mdnodes. llvm-svn: 107323	2010-06-30 21:29:00 +00:00
Jim Grosbach	caf9b3ab7d	grammar tweak in comment. llvm-svn: 107321	2010-06-30 21:27:56 +00:00
Dan Gohman	f638f4ff84	In ScalarEvolution::forgetValue, eliminate any SCEVUnknown entries associated with the value being erased in the folding set map. These entries used to be harmless, because a SCEVUnknown doesn't store any information about its Value*, so having a new Value allocated at the old Value's address wasn't a problem. But now that ScalarEvolution is storing more information about values, this is no longer safe. llvm-svn: 107316	2010-06-30 20:21:12 +00:00
Bruno Cardoso Lopes	d0eacf715f	Move MOVD/MODQ code around, creating sections for each of them llvm-svn: 107308	2010-06-30 18:49:10 +00:00
Jakob Stoklund Olesen	59e1cae377	Some fool committed without testing (or even building) first. llvm-svn: 107307	2010-06-30 18:41:20 +00:00
Bruno Cardoso Lopes	cbcebe2950	Add AVX SSE2 mask creation and conditional store instructions llvm-svn: 107306	2010-06-30 18:38:10 +00:00
Jakob Stoklund Olesen	c39d3497c8	Remember to track spill slot uses in VirtRegMap when inserting loads and stores. LocalRewriter::runOnMachineFunction uses this information to mark dead spill slots. This means that InlineSpiller now also works for functions that spill. llvm-svn: 107302	2010-06-30 18:19:08 +00:00
Bruno Cardoso Lopes	5c768e4915	Fix a bug introduced in r107211 where instructions with memory operands are declared as commutable llvm-svn: 107300	2010-06-30 18:06:01 +00:00
Dan Gohman	c0cca7fdda	Revert the part of r107257 which introduced new logic for using nsw and nuw flags from IR Instructions. On further consideration, this isn't valid. llvm-svn: 107298	2010-06-30 17:27:11 +00:00
Duncan Sands	945a347478	Remove an unused variable. The call to getRoot has side-effects, so this could break something (but doesn't seem to). llvm-svn: 107295	2010-06-30 17:22:28 +00:00
Bruno Cardoso Lopes	d079c91683	Add AVX SSE2 packed integer extract/insert instructions llvm-svn: 107293	2010-06-30 17:03:03 +00:00
Duncan Sands	7b90966d4a	Rather than giving SmallPtrSetImpl a member field SmallArray which is magically replaced by a bigger array in SmallPtrSet (by overridding it), instead just use a pointer to the start of the storage, and have SmallPtrSet pass in the value to use. This has the disadvantage that SmallPtrSet becomes bigger by one pointer. It has the advantage that it no longer uses tricky C++ rules, and is clearly correct while I'm not sure the previous version was. This was inspired by g++-4.6 pointing out that SmallPtrSetImpl was writing off the end of SmallArray, which it was. Since SmallArray is replaced with a bigger array in SmallPtrSet, the write was still to valid memory. But it was writing off the end of the declared array type - sounds kind of dubious to me, like it sounded dubious to g++-4.6. Maybe g++-4.6 is wrong and this construct is perfectly valid and correctly compiled by all compilers, but I think it is better to avoid the whole can of worms by avoiding this construct. llvm-svn: 107285	2010-06-30 15:02:37 +00:00
Gabor Greif	647d9c9797	use ArgOperand API llvm-svn: 107282	2010-06-30 13:45:50 +00:00
Gabor Greif	12ca3d9fac	use ArgOperand API llvm-svn: 107280	2010-06-30 13:03:37 +00:00
Gabor Greif	f69acfe133	use ArgOperand API llvm-svn: 107279	2010-06-30 12:55:46 +00:00
Gabor Greif	74470192d7	use ArgOperand API llvm-svn: 107278	2010-06-30 12:42:43 +00:00
Gabor Greif	d50572802e	use ArgOperand API llvm-svn: 107277	2010-06-30 12:40:35 +00:00
Gabor Greif	3390e746fa	use CallSite::arg_end instead of CallInst::op_end llvm-svn: 107276	2010-06-30 12:39:23 +00:00
Gabor Greif	3abd881bea	use getArgOperand (corrected by CallInst::ArgOffset) instead of getOperand llvm-svn: 107275	2010-06-30 12:38:26 +00:00
Gabor Greif	743b3fd196	use getArgOperand (corrected by CallInst::ArgOffset) instead of getOperand llvm-svn: 107273	2010-06-30 09:19:23 +00:00
Gabor Greif	f628ecd15f	use getNumArgOperands instead of getNumOperands llvm-svn: 107272	2010-06-30 09:17:53 +00:00
Gabor Greif	fe252e6fa0	use getArgOperand instead of getOperand llvm-svn: 107271	2010-06-30 09:16:16 +00:00
Gabor Greif	8ae3095286	use getArgOperand instead of getOperand llvm-svn: 107270	2010-06-30 09:15:28 +00:00
Gabor Greif	e9acc46f65	use getArgOperand instead of getOperand llvm-svn: 107269	2010-06-30 09:14:26 +00:00
Dan Gohman	16206132b6	Improve ScalarEvolution's nsw and nuw preservation. llvm-svn: 107257	2010-06-30 07:16:37 +00:00
Dan Gohman	9396b42ca4	When computing a new ConservativeResult, intersect it with the old one instead of replacing it, to be more precise. llvm-svn: 107256	2010-06-30 06:58:35 +00:00
Dan Gohman	0865966440	Rework scev-aa's basic computation so that it doesn't depend on ScalarEvolution successfully folding and preserving range information for both A-B and B-A. Now, if it gets either one, it's sufficient. llvm-svn: 107249	2010-06-30 06:12:16 +00:00
Dan Gohman	37f145c55b	Simplify. llvm-svn: 107248	2010-06-30 06:09:46 +00:00
Bruno Cardoso Lopes	e82689fea2	Add AVX SSE2 integer unpack instructions llvm-svn: 107246	2010-06-30 04:06:39 +00:00
Bruno Cardoso Lopes	ec0115c9b7	Add AVX SSE2 packed integer shuffle instructions llvm-svn: 107245	2010-06-30 03:47:56 +00:00
John Mosby	5364655e02	Remove trailing whitespace, no functionality changes. llvm-svn: 107244	2010-06-30 03:40:54 +00:00
Bruno Cardoso Lopes	51ceead19c	Small refactoring of SSE2 packed integer shuffle instructions llvm-svn: 107243	2010-06-30 03:29:36 +00:00
Bruno Cardoso Lopes	be792feb8b	Add AVX SSE2 pack with saturation integer instructions llvm-svn: 107241	2010-06-30 02:30:25 +00:00
Bruno Cardoso Lopes	2686ea4555	Add AVX SSE2 integer packed compare instructions llvm-svn: 107240	2010-06-30 02:21:09 +00:00
Bruno Cardoso Lopes	2e2caefff9	- Add AVX form of all SSE2 logical instructions - Add VEX encoding bits to x86 MRM0r-MRM7r llvm-svn: 107238	2010-06-30 01:58:37 +00:00
Devang Patel	c5b3109bec	Do not construct DIE for already processed MDNode. llvm-svn: 107237	2010-06-30 01:40:11 +00:00
Jakob Stoklund Olesen	b3b89c3bc0	Use skipInstruction() as a simpler way of iterating over instructions using SrcReg llvm-svn: 107234	2010-06-30 00:30:36 +00:00
Jakob Stoklund Olesen	08baf59da1	Use clEnumValN macro to work around keyword clash llvm-svn: 107233	2010-06-30 00:24:51 +00:00
Devang Patel	648df7bf64	Add variables into a scope before constructing scope DIE otherwise variables won't be included DIE tree. llvm-svn: 107228	2010-06-30 00:11:08 +00:00
Jakob Stoklund Olesen	f888911932	Begin implementation of an inline spiller. InlineSpiller inserts loads and spills immediately instead of deferring to VirtRegMap. This is possible now because SlotIndexes allows instructions to be inserted and renumbered. This is work in progress, and is mostly a copy of TrivialSpiller so far. It works very well for functions that don't require spilling. llvm-svn: 107227	2010-06-29 23:58:39 +00:00
Bruno Cardoso Lopes	3f71ddfaad	Add several AVX integer packed binop instructions llvm-svn: 107225	2010-06-29 23:47:49 +00:00
Dan Gohman	ae36b1ed42	Fix ScalarEvolution's tripcount computation for chains of loops where each loop's induction variable's start value is the exit value of a preceding loop. llvm-svn: 107224	2010-06-29 23:43:06 +00:00
Bill Wendling	3632171750	Revert r107205 and r107207. llvm-svn: 107215	2010-06-29 22:34:52 +00:00
Devang Patel	be30551600	Print InlinedAt location. llvm-svn: 107214	2010-06-29 22:29:15 +00:00
Eric Christopher	e34471bb31	Add another bswap idiom that isn't matched. llvm-svn: 107213	2010-06-29 22:22:22 +00:00
Bruno Cardoso Lopes	7fee95a38e	Move SSE2 Packed Integer instructions around, and create specific sections for each of them llvm-svn: 107211	2010-06-29 22:12:16 +00:00
Devang Patel	c728518bfe	Print InlinedAt location. llvm-svn: 107208	2010-06-29 21:51:32 +00:00
Bruno Cardoso Lopes	ba21eb8054	Add AVX Move Aligned/Unaligned packed integers llvm-svn: 107206	2010-06-29 21:25:12 +00:00
Bill Wendling	1767723dbe	Introducing the "linker_weak" linkage type. This will be used for Objective-C metadata types which should be marked as "weak", but which the linker will remove upon final linkage. For example, the "objc_msgSend_fixup_alloc" symbol is defined like this: .globl l_objc_msgSend_fixup_alloc .weak_definition l_objc_msgSend_fixup_alloc .section __DATA, __objc_msgrefs, coalesced .align 3 l_objc_msgSend_fixup_alloc: .quad _objc_msgSend_fixup .quad L_OBJC_METH_VAR_NAME_1 This is different from the "linker_private" linkage type, because it can't have the metadata defined with ".weak_definition". llvm-svn: 107205	2010-06-29 21:24:00 +00:00
Bruno Cardoso Lopes	30689a3a7f	Add AVX ld/st XCSR register. Add VEX encoding bits for MRMXm x86 form llvm-svn: 107204	2010-06-29 20:35:48 +00:00
Devang Patel	24bc1b5b2f	Do not hardcode DW_AT_stmt_list value. Inspired by Artur Pietrek. llvm-svn: 107202	2010-06-29 20:17:53 +00:00
Bob Wilson	be157b0ea8	Add support for encoding VDUP (ARM core register) instructions. llvm-svn: 107201	2010-06-29 20:13:29 +00:00
Jakob Stoklund Olesen	dadea5b178	Fix the handling of partial redefines in the fast register allocator. A partial redefine needs to be treated like a tied operand, and the register must be reloaded while processing use operands. This fixes a bug where partially redefined registers were processed as normal defs with a reload added. The reload could clobber another use operand if it was a kill that allowed register reuse. llvm-svn: 107193	2010-06-29 19:15:30 +00:00
Bob Wilson	d91d5bfc95	Fix a register scavenger crash when dealing with undefined subregs. The LowerSubregs pass needs to preserve implicit def operands attached to EXTRACT_SUBREG instructions when it replaces those instructions with copies. llvm-svn: 107189	2010-06-29 18:42:49 +00:00
Bruno Cardoso Lopes	a4575f5b31	Add AVX non-temporal stores llvm-svn: 107178	2010-06-29 18:22:01 +00:00
Dan Gohman	1be9e7c0b6	Fix whitespace style. llvm-svn: 107175	2010-06-29 18:12:34 +00:00
Bruno Cardoso Lopes	049f4ffab1	Move non-temporal movs to their own section llvm-svn: 107168	2010-06-29 17:42:37 +00:00
Bob Wilson	ab0819e10d	Add support for encoding NEON VMOV (from core register to scalar) instructions. The encoding is the same as VMOV (from scalar to core register) except that the operands are in different places. llvm-svn: 107167	2010-06-29 17:34:07 +00:00
Bruno Cardoso Lopes	21a9433e9e	Add sqrt, rsqrt and rcp AVX instructions llvm-svn: 107166	2010-06-29 17:26:30 +00:00
Jim Grosbach	5bee07ec68	skip dbg_value instructions llvm-svn: 107154	2010-06-29 16:55:24 +00:00
Bob Wilson	83b993a977	The t2MOVi16 and t2MOVTi16 instructions do not set CPSR. Trying to add a CPSR operand to them causes an assertion failure, so apparently these instructions haven't been getting a lot of use. llvm-svn: 107147	2010-06-29 16:25:11 +00:00
Gabor Greif	eab748d409	use ArgOperand API llvm-svn: 107145	2010-06-29 16:17:26 +00:00
Duncan Sands	17f1ca8793	Return Changed. This required setting Changed if dbg metadata is stripped off. Currently set unconditionally, since the API does not provide a way of working out if anything was actually stripped off. llvm-svn: 107142	2010-06-29 14:52:10 +00:00
Duncan Sands	83d1dd637a	It seems clear that this should return Changed. llvm-svn: 107141	2010-06-29 14:49:35 +00:00
Rafael Espindola	38a7d7cbc3	Add a VT argument to getMinimalPhysRegClass and replace the copy related uses of getPhysicalRegisterRegClass with it. If we want to make a copy (or estimate its cost), it is better to use the smallest class as more efficient operations might be possible. llvm-svn: 107140	2010-06-29 14:02:34 +00:00
Duncan Sands	d34bb4e9b0	getMachineBasicBlockAddress returns a uintptr_t - don't truncate to unsigned only to extend back to a pointer sized value on the next line. llvm-svn: 107139	2010-06-29 13:34:20 +00:00
Duncan Sands	a85a90773c	The variable ValueSize is set to 1 on both code paths, and then ignored! Remove it. llvm-svn: 107138	2010-06-29 13:30:08 +00:00
Duncan Sands	1245e4c07d	The variable "Value" is carefully set to Layout.getSymbolAddress, but then not actually used - maybe a bug? Remove the variable. llvm-svn: 107137	2010-06-29 13:26:33 +00:00
Duncan Sands	5667a08468	Remove unused calls to Lexer.getLoc and the pointless variable HasFillExpr. llvm-svn: 107136	2010-06-29 13:24:40 +00:00
Duncan Sands	193bb1ee6a	Remove pointless variable LastDef. llvm-svn: 107135	2010-06-29 13:23:22 +00:00
Duncan Sands	257eba4df7	Remove unused variable Loc and pointless variables unified_syntax and thumb_mode. llvm-svn: 107133	2010-06-29 13:04:35 +00:00
Gabor Greif	e73d64c2cf	use ArgOperand APIs llvm-svn: 107132	2010-06-29 13:03:46 +00:00
Duncan Sands	78ad27ca2b	Remove an unused and a pointless variable. llvm-svn: 107131	2010-06-29 13:00:29 +00:00
Duncan Sands	67bfa9d109	Remove pointless and unused variables. llvm-svn: 107130	2010-06-29 12:48:49 +00:00
Gabor Greif	eec74583ca	encode operand initializations (at fixed index) in terms of Op<> and ArgOffset. This works for values of {0, 1} for ArgOffset. Please note that ArgOffset will become 0 soon and will go away eventually. llvm-svn: 107129	2010-06-29 11:41:38 +00:00
Duncan Sands	67aa21d7b5	Remove a pointless variable. llvm-svn: 107128	2010-06-29 11:39:45 +00:00
Duncan Sands	6d28e73acc	Remove initialized but otherwise unused variables. llvm-svn: 107127	2010-06-29 11:22:26 +00:00
Benjamin Kramer	80b7bc042a	Use a more obvious way to avoid compiling functions which are only used when XDEBUG is enabled. llvm-svn: 107125	2010-06-29 10:03:11 +00:00
Chandler Carruth	b1adb88d05	Jump through some silly hoops to make GCC accept that a function may not always be called. llvm-svn: 107124	2010-06-29 06:46:00 +00:00
Evan Cheng	b59dd8f10a	PR7503: uxtb16 is not available for ARMv7-M. Patch by Brian G. Lucas. llvm-svn: 107122	2010-06-29 05:38:36 +00:00
Evan Cheng	0c30739cbb	Change if-cvt options to something that actually as useable. llvm-svn: 107121	2010-06-29 05:37:59 +00:00
Jim Grosbach	907673c48d	When processing loops for scheduling latencies (used for live outs on loop back-edges), make sure not to include dbg_value instructions in the count. Closing in on the end of rdar://7797940 llvm-svn: 107119	2010-06-29 04:48:13 +00:00
Dan Gohman	90db61d638	Just as its not safe to blindly transfer the nsw bit from an add instruction to an add scev, it's not safe to blindly transfer the inbounds flag from a gep instruction to an nsw on the scev for the gep. llvm-svn: 107117	2010-06-29 01:41:41 +00:00
Bruno Cardoso Lopes	de736a6494	Refactoring of arithmetic instruction classes with unary operator llvm-svn: 107116	2010-06-29 01:33:09 +00:00
Jakob Stoklund Olesen	c1eccbc468	When no memoperands are present, assume unaligned, volatile. llvm-svn: 107114	2010-06-29 01:13:07 +00:00
Bob Wilson	1e5da550e5	Reapply my if-conversion cleanup from svn r106939 with fixes. There are 2 changes relative to the previous version of the patch: 1) For the "simple" if-conversion case, there's no need to worry about RemoveExtraEdges not handling an unanalyzable branch. Predicated terminators are ignored in this context, so RemoveExtraEdges does the right thing. This might break someday if we ever treat indirect branches (BRIND) as predicable, but for now, I just removed this part of the patch, because in the case where we do not add an unconditional branch, we rely on keeping the fall-through edge to CvtBBI (which is empty after this transformation). The change relative to the previous patch is: @@ -1036,10 +1036,6 @@ IterIfcvt = false; } - // RemoveExtraEdges won't work if the block has an unanalyzable branch, - // which is typically the case for IfConvertSimple, so explicitly remove - // CvtBBI as a successor. - BBI.BB->removeSuccessor(CvtBBI->BB); RemoveExtraEdges(BBI); // Update block info. BB can be iteratively if-converted. 2) My patch exposed a bug in the code for merging the tail of a "diamond", which had previously never been exercised. The code was simply checking that the tail had a single predecessor, but there was a case in MultiSource/Benchmarks/VersaBench/dbms where that single predecessor was neither edge of the diamond. I added the following change to check for that: @@ -1276,7 +1276,18 @@ // tail, add a unconditional branch to it. if (TailBB) { BBInfo TailBBI = BBAnalysis[TailBB->getNumber()]; - if (TailBB->pred_size() == 1 && !TailBBI.HasFallThrough) { + bool CanMergeTail = !TailBBI.HasFallThrough; + // There may still be a fall-through edge from BBI1 or BBI2 to TailBB; + // check if there are any other predecessors besides those. + unsigned NumPreds = TailBB->pred_size(); + if (NumPreds > 1) + CanMergeTail = false; + else if (NumPreds == 1 && CanMergeTail) { + MachineBasicBlock::pred_iterator PI = TailBB->pred_begin(); + if (PI != BBI1->BB && PI != BBI2->BB) + CanMergeTail = false; + } + if (CanMergeTail) { MergeBlocks(BBI, TailBBI); TailBBI.IsDone = true; } else { With these fixes, I was able to run all the SingleSource and MultiSource tests successfully. llvm-svn: 107110	2010-06-29 00:55:23 +00:00
Dan Gohman	0824affeff	Add an Intraprocedural form of BasicAliasAnalysis, which aims to properly handles instructions and arguments defined in different functions, or across recursive function iterations. llvm-svn: 107109	2010-06-29 00:50:39 +00:00
Bruno Cardoso Lopes	d6a091a4d4	Described the missing AVX forms of SSE2 convert instructions llvm-svn: 107108	2010-06-29 00:36:02 +00:00
Bob Wilson	3d12ff797b	Fix Thumb encoding of VMOV (scalar to ARM core register). The encoding is the same as ARM except that the condition code field is always set to ARMCC::AL. llvm-svn: 107107	2010-06-29 00:26:13 +00:00
Bob Wilson	269a89fd3a	Unlike other targets, ARM now uses BUILD_VECTORs post-legalization so they can't be changed arbitrarily by the DAGCombiner without checking if it is running after legalization. llvm-svn: 107097	2010-06-28 23:40:25 +00:00
Bob Wilson	4469a892b4	Make the ARMCodeEmitter identify Thumb functions via ARMFunctionInfo instead of the Subtarget. llvm-svn: 107086	2010-06-28 22:23:17 +00:00
Devang Patel	1de21ec498	Use DW_FORM_addr for DW_AT_entry_pc. llvm-svn: 107085	2010-06-28 22:22:47 +00:00
Dale Johannesen	17feb07c53	In asm's, output operands with matching input constraints have to be registers, per gcc documentation. This affects the logic for determining what "g" should lower to. PR 7393. A couple of existing testcases are affected. llvm-svn: 107079	2010-06-28 22:09:45 +00:00
Kevin Enderby	e233dda2e2	Added the darwin .secure_log_unique and .secure_log_reset directives. llvm-svn: 107077	2010-06-28 21:45:58 +00:00
Dan Gohman	e697a6f24f	Constant fold x == undef to undef. llvm-svn: 107074	2010-06-28 21:30:07 +00:00
Jim Grosbach	f31c004666	tidy up style. no functional change. llvm-svn: 107073	2010-06-28 21:29:17 +00:00
Dan Gohman	7c34ece501	Fix Value::stripPointerCasts and BasicAA to avoid trouble on code in unreachable blocks, which have have use-def cycles. This fixes PR7514. llvm-svn: 107071	2010-06-28 21:16:52 +00:00
Bob Wilson	544317dfda	Refactor encoding function for NEON 1-register with modified immediate format. llvm-svn: 107070	2010-06-28 21:16:30 +00:00
Bob Wilson	584387d5e3	Support Thumb mode encoding of NEON instructions. llvm-svn: 107068	2010-06-28 21:12:19 +00:00
Bill Wendling	0a5bb081cc	Reduce indentation via early exit. NFC. llvm-svn: 107067	2010-06-28 21:08:32 +00:00
Devang Patel	d10b2af260	Include inlined function in list of processed subprograms. llvm-svn: 107065	2010-06-28 20:53:04 +00:00
Jim Grosbach	ee6e29aa72	new, no longer brain-dead, r106907 llvm-svn: 107060	2010-06-28 20:26:00 +00:00
Jakob Stoklund Olesen	ffd628ec0a	After physreg coalescing, physical registers might not have live ranges where you would expect. Don't assert on that case, just give up. This fixes PR7513. llvm-svn: 107046	2010-06-28 19:39:57 +00:00
Jakob Stoklund Olesen	0d94d7af78	Add more special treatment for inline asm in RegAllocFast. When an instruction has tied operands and physreg defines, we must take extra care that the tied operands conflict with neither physreg defs nor uses. The special treatment is given to inline asm and instructions with tied operands / early clobbers and physreg defines. This fixes PR7509. llvm-svn: 107043	2010-06-28 18:34:34 +00:00
Eric Christopher	7f103a2653	Fix thinko. llvm-svn: 107042	2010-06-28 18:33:48 +00:00
Eric Christopher	51f2908328	Pull in the libCrashReporterClient.a information with a warning comment. Remove library check and regenerate configure. llvm-svn: 107028	2010-06-28 18:25:51 +00:00
Devang Patel	f3b2db68c6	Preserve deleted function's local variables' debug info. Radar 8122864. llvm-svn: 107027	2010-06-28 18:25:03 +00:00
Gabor Greif	5b1370ee80	use ArgOperand API llvm-svn: 107017	2010-06-28 16:50:57 +00:00
Gabor Greif	e23efeef10	use ArgOperand API llvm-svn: 107016	2010-06-28 16:45:00 +00:00
Gabor Greif	18c5bae727	employ CallInst::ArgOffset (for now) llvm-svn: 107015	2010-06-28 16:43:57 +00:00
Gabor Greif	cd09869dfc	simplify: we have solid argument iterator range llvm-svn: 107014	2010-06-28 16:40:52 +00:00
Dan Gohman	875a296011	Generalize AAEval so that it can be used both per-function and interprocedurally. Note that as of this writing, existing alias analysis passes are not prepared to be used interprocedurally. llvm-svn: 107013	2010-06-28 16:01:37 +00:00
Daniel Dunbar	b8c058cbb0	Revert r106907, "make sure to handle dbg_value instructions in the middle of the block, not...", it caused a bunch of nightly test regressions. llvm-svn: 107009	2010-06-28 15:47:17 +00:00
Gabor Greif	2dd4307e45	use setArgOperand llvm-svn: 107004	2010-06-28 12:31:35 +00:00
Gabor Greif	ec60adf161	use CallInst::ArgOffset llvm-svn: 107003	2010-06-28 12:30:07 +00:00
Gabor Greif	2de43a7c5c	use ArgOperand API and CallInst::ArgOffset llvm-svn: 107002	2010-06-28 12:29:20 +00:00
Gabor Greif	4300fc77ae	use cached value llvm-svn: 107000	2010-06-28 11:20:42 +00:00
Devang Patel	fb6f22f010	Remove dead code. llvm-svn: 106990	2010-06-28 05:59:13 +00:00
Devang Patel	f7869a4b81	Use named MDNode, llvm.dbg.sp, to collect subprogram info. This will be used to emit local variable's debug info of deleted functions. llvm-svn: 106989	2010-06-28 05:53:08 +00:00
Jim Grosbach	7ea5fc0794	minor housekeeping cleanup: 80-column, trailing whitespace, spelling, etc.. No functional change. llvm-svn: 106988	2010-06-28 04:27:01 +00:00
Devang Patel	81170d23de	Do not forget last element, function, while creating Subprogram definition MDNode from subprogram declare MDNode. llvm-svn: 106985	2010-06-27 21:04:31 +00:00
Chris Lattner	25a843fcd2	minor cleanup to SROA: when lowering type unsafe accesses to large integers, the first inserted value would always create an 'or X, 0'. Even though this is trivially zapped by instcombine, don't bother creating this pointless instruction. llvm-svn: 106979	2010-06-27 07:58:26 +00:00
Rafael Espindola	2041abd958	When splitting a VAARG, remember its alignment. This produces terrible but correct code. llvm-svn: 106952	2010-06-26 18:22:20 +00:00
Bob Wilson	418e64a385	Revert my if-conversion cleanup since it caused a bunch of nightly test regressions. --- Reverse-merging r106939 into '.': U test/CodeGen/Thumb2/thumb2-ifcvt3.ll U lib/CodeGen/IfConversion.cpp llvm-svn: 106951	2010-06-26 17:47:06 +00:00
Duncan Sands	3a5cb69cb8	Fix PR7328: when turning a tail recursion into a loop, need to preserve the returned value after the tail call if it differs from other return values. The optimal thing to do would be to introduce a phi node for the return value, but for the moment just fix the miscompile. llvm-svn: 106947	2010-06-26 12:53:31 +00:00
Gabor Greif	7d4038dd88	use ArgOperand API llvm-svn: 106946	2010-06-26 12:17:21 +00:00
Gabor Greif	c2ac8c4261	use ArgOperand API llvm-svn: 106945	2010-06-26 12:09:10 +00:00
Gabor Greif	83205af3fa	use ArgOperand API llvm-svn: 106944	2010-06-26 11:51:52 +00:00
Benjamin Kramer	a000002428	VNInfos don't need to be destructed anymore. llvm-svn: 106943	2010-06-26 11:30:59 +00:00
Gabor Greif	e9afee2910	resort to ArgOperand API llvm-svn: 106942	2010-06-26 09:35:09 +00:00
Eli Friedman	8cfa7713e9	Followup to r106770: actually generate SXTB and SXTH for sign-extensions. llvm-svn: 106940	2010-06-26 04:36:50 +00:00
Bob Wilson	c72da6bb56	Clean up some problems with extra CFG edges being introduced during if-conversion. The RemoveExtraEdges function doesn't work for blocks that end with unanalyzable branches, so in those cases, the "extra" edges must be explicitly removed. The CopyAndPredicateBlock and MergeBlocks methods can also avoid copying successor edges due to branches that have already been removed. The latter case is especially helpful when MergeBlocks is called for handling "diamond" if-conversions, where otherwise you can end up with some weird intermediate states in the CFG. Unfortunately I've been unable to find cases where this cleanup actually makes a significant difference in the code. There is one test where we manage to remove an empty block at the end of a function. Radar 6911268. llvm-svn: 106939	2010-06-26 04:27:33 +00:00
Bob Wilson	0248da9db4	Add support for encoding NEON VMOV (from scalar to core register) instructions. llvm-svn: 106938	2010-06-26 04:07:15 +00:00
Evan Cheng	b71233f34d	It's now possible to run code placement pass for ARM. llvm-svn: 106935	2010-06-26 01:52:05 +00:00
Jakob Stoklund Olesen	d7d0d4e882	When creating X86 MUL8 and DIV8 instructions, make sure we don't produce CopyFromReg nodes for aliasing registers (AX and AL). This confuses the fast register allocator. Instead of CopyFromReg(AL), use ExtractSubReg(CopyFromReg(AX), sub_8bit). This fixes PR7312. llvm-svn: 106934	2010-06-26 00:39:23 +00:00
Bob Wilson	b4d39841e4	Renumber NEON instruction formats to be consecutive. llvm-svn: 106927	2010-06-26 00:05:09 +00:00
Bob Wilson	cc386fb125	Rename ARM instruction formats NEONGetLnFrm, NEONSetLnFrm and NEONDupFrm to "N..." instead of "NEON..." for consistency with the other NEON format names. llvm-svn: 106921	2010-06-25 23:56:05 +00:00
Bruno Cardoso Lopes	74d716b9cd	Add AVX convert CVTSS2SI{rr,rm} and CVTDQ2PS{rr,rm} instructions llvm-svn: 106917	2010-06-25 23:47:23 +00:00
Bob Wilson	d66f66a5cf	Remove unused NEONFrm and ThumbMiscFrm ARM instruction formats. Renumber MiscFrm to 25. llvm-svn: 106916	2010-06-25 23:45:37 +00:00
Bruno Cardoso Lopes	83651094ad	Reapply r106896: Add several AVX MOV flavors Support VEX encoding for MRMDestReg llvm-svn: 106912	2010-06-25 23:33:42 +00:00
Daniel Dunbar	acbdf53db4	Thumb2ITBlockPass: Fix a possible dereference of an invalid iterator. This was introduced in r106343, but only showed up recently (with a particular compiler & linker combination) because of the particular check, and because we have no builtin checking for dereferencing the end of an array, which is truly unfortunate. llvm-svn: 106908	2010-06-25 23:14:54 +00:00
Jim Grosbach	c34befc78f	make sure to handle dbg_value instructions in the middle of the block, not just at the head, when doing diamond if-conversion. rdar://7797940 llvm-svn: 106907	2010-06-25 23:05:46 +00:00
Bruno Cardoso Lopes	4530fed87e	revert this now, it's using avx instead of sse :) llvm-svn: 106906	2010-06-25 23:04:29 +00:00
Jakob Stoklund Olesen	55d738e2e1	Don't track kills in VNInfo. Use interval ends instead. The VNInfo.kills vector was almost unused except for all the code keeping it updated. The few places using it were easily rewritten to check for interval ends instead. The two new methods LiveInterval::killedAt and killedInRange are replacements. This brings us down to 3 independent data structures tracking kills. llvm-svn: 106905	2010-06-25 22:53:05 +00:00
Evan Cheng	02b184de5b	Change if-conversion block size limit checks to add some flexibility. llvm-svn: 106901	2010-06-25 22:42:03 +00:00
Bob Wilson	2530ca0647	Add support for encoding 3-register NEON instructions, and fix emitNEON2RegInstruction's handling of 2-address operands. llvm-svn: 106900	2010-06-25 22:40:46 +00:00
Dan Gohman	fb9712bdae	In GenerateReassociations, don't bother thinking about individual SCEVUnknown values which are loop-variant, as LSR can't do anything interesting with these values in any case. This fixes very slow compile times on loops which have large numbers of such values. llvm-svn: 106897	2010-06-25 22:32:18 +00:00
Bruno Cardoso Lopes	a34d9b6d84	Add several AVX MOV flavors Support VEX encoding for MRMDestReg llvm-svn: 106896	2010-06-25 22:27:51 +00:00
Devang Patel	5c0f85c7dd	Collect debug info for optimized variables of inlined functions. llvm-svn: 106895	2010-06-25 22:07:34 +00:00
Jim Grosbach	8a6deefec6	80 column and typo fix llvm-svn: 106894	2010-06-25 22:02:28 +00:00
Dale Johannesen	ce97d55ad9	The hasMemory argument is irrelevant to how the argument for an "i" constraint should get lowered; PR 6309. While this argument was passed around a lot, this is the only place it was used, so it goes away from a lot of other places. llvm-svn: 106893	2010-06-25 21:55:36 +00:00
Bob Wilson	e70c8b150b	Add support for encoding 2-register NEON instructions. llvm-svn: 106891	2010-06-25 21:17:19 +00:00
Dan Gohman	8de1fe3ccf	pcmpeqd and friends are Commutable. llvm-svn: 106886	2010-06-25 21:05:35 +00:00
Bob Wilson	574f68f815	Fix indentation. llvm-svn: 106881	2010-06-25 20:54:44 +00:00
Bill Wendling	e41e40f689	- Reapply r106066 now that the bzip2 build regression has been fixed. - 2010-06-25-CoalescerSubRegDefDead.ll is the testcase for r106878. llvm-svn: 106880	2010-06-25 20:48:10 +00:00
Bill Wendling	ef7acd9a24	We should remove the live range from the destination register only if all defs are dead, not just the def of this register. I.e., a register could be dead, but it's subreg isn't. Testcase to follow with a subsequent patch. llvm-svn: 106878	2010-06-25 20:42:55 +00:00
Bruno Cardoso Lopes	553fafc6ce	Move the last piece of SSE2 convert instructions to the Convert Instructions section llvm-svn: 106877	2010-06-25 20:29:27 +00:00
Bruno Cardoso Lopes	62d1403a03	More SSE refactoring, this time with different types of MOVs llvm-svn: 106876	2010-06-25 20:22:12 +00:00
Dan Gohman	89dd42af31	Eliminate a redundant FoldingSet lookup. llvm-svn: 106872	2010-06-25 18:47:08 +00:00
Jim Grosbach	ba3ece6f27	IT instructions are considered to be scheduling hazards, but are scheduled with the following instructions. This is done via trickery by considering the instruction preceding the IT to be the hazard. Care must be taken to ensure it's the first non-debug instruction, or the presence of debug info will affect codegen. Part of the continuing work for rdar://7797940, making ARM code-gen unaffected by the presence of debug information. llvm-svn: 106871	2010-06-25 18:43:14 +00:00
Bruno Cardoso Lopes	e76c0b13b9	Refactoring of more SSE conversion instructions. Also add some AVX instrinsics Int_V... placeholders llvm-svn: 106867	2010-06-25 18:06:22 +00:00
Dale Johannesen	2ac3b9cbd4	Cosmetic. llvm-svn: 106865	2010-06-25 17:41:07 +00:00
Benjamin Kramer	58e6c2eded	Rewrite MemoryBuffer::getSTDIN to use read(2) and a SmallVector buffer. llvm-svn: 106856	2010-06-25 16:07:18 +00:00
Duncan Sands	2dc70bea54	Remove variables which are assigned to but for which the value is not used. Spotted by gcc-4.6. llvm-svn: 106854	2010-06-25 14:48:39 +00:00
Benjamin Kramer	948dd57945	Bring back the empty vector workaround I removed in r106839. Looks like MSVC needs it. llvm-svn: 106841	2010-06-25 12:51:01 +00:00
Benjamin Kramer	ce2a92220f	Tweak MemoryBuffer to allocate the class itself, the name and possibly the buffer in the same chunk of memory. 2 less mallocs for every uninitialized MemoryBuffer and 1 less malloc for every MemoryBuffer pointing to a memory range translate into 20% less mallocs on clang -cc1 -Eonly Cocoa_h.m. llvm-svn: 106839	2010-06-25 11:50:40 +00:00
Gabor Greif	b890fc8023	use ArgOperand accessors and CallInst for getting hold of the intrinsic's arguments simplify along the way (at least for me this is much more legible now) Bill, Baldrick or Anton, please review\! llvm-svn: 106838	2010-06-25 11:25:30 +00:00
Gabor Greif	7dd3afdff3	use ArgOperand API (the simple part) llvm-svn: 106837	2010-06-25 09:44:37 +00:00
Gabor Greif	eba0be7dc9	use ArgOperand API llvm-svn: 106836	2010-06-25 09:38:13 +00:00
Gabor Greif	41b81ee2fb	use ArgOperand API llvm-svn: 106835	2010-06-25 09:36:23 +00:00
Gabor Greif	ed9ae7bf21	use ArgOperand API and CallSite to access arguments of CallInst llvm-svn: 106833	2010-06-25 09:03:52 +00:00
Gabor Greif	b5874dea6e	use ArgOperand API and CallSite to access arguments of CallInst llvm-svn: 106829	2010-06-25 08:48:19 +00:00
Gabor Greif	e4eed709d4	use ArgOperand API llvm-svn: 106828	2010-06-25 08:24:59 +00:00
Gabor Greif	f6207e0a80	prune an include llvm-svn: 106827	2010-06-25 08:16:50 +00:00
Gabor Greif	e3ba486c9f	use ArgOperand API (one more hunk I could split) llvm-svn: 106825	2010-06-25 07:58:41 +00:00
Gabor Greif	5f3e656a1b	use ArgOperand API (some hunks I could split) llvm-svn: 106824	2010-06-25 07:57:14 +00:00
Gabor Greif	07e9284c75	use ArgOperand API; tighten type of handleFreeWithNonTrivialDependency to be able to use isFreeCall whithout a cast or new overload llvm-svn: 106823	2010-06-25 07:40:32 +00:00
Bob Wilson	07aead2f8d	Add missing ARM and Thumb data layout info for vector types. Radar 8128745. llvm-svn: 106820	2010-06-25 04:41:08 +00:00
Bob Wilson	eadbf9732f	Reduce indentation. llvm-svn: 106819	2010-06-25 04:12:31 +00:00
Dale Johannesen	e9eaaa91d8	Fix a case where an earlyclobber operand of an asm is reused as an input. PR 4118. Testcase is too big, as usual with bugs in this area, but there's one in the PR. llvm-svn: 106816	2010-06-25 00:49:43 +00:00
Bruno Cardoso Lopes	cbdcce6478	Add some AVX convert instructions llvm-svn: 106815	2010-06-25 00:39:30 +00:00
Jakob Stoklund Olesen	889ab7d158	Make sure all eliminated kills are removed from VNInfo lists. This fixes PR7479 and PR7485. The test cases from those PRs are big, so not included. However, PR7485 comes from self hosting on FreeBSD, so we will surely hear about any regression. llvm-svn: 106811	2010-06-24 23:57:35 +00:00
Dan Gohman	5f0bf64c0c	Add some comments. llvm-svn: 106809	2010-06-24 23:41:59 +00:00
Bruno Cardoso Lopes	447735aa98	Refactoring of SSE convert intrinsics llvm-svn: 106808	2010-06-24 23:37:07 +00:00
Dan Gohman	9a2f0473b2	Teach EmitLiveInCopies to omit copies for unused virtual registers, and to clean up unused incoming physregs from the live-in list. llvm-svn: 106805	2010-06-24 22:23:02 +00:00
Bruno Cardoso Lopes	78827d1952	Refactoring of SSE conversion instructions llvm-svn: 106804	2010-06-24 22:22:21 +00:00
Bruno Cardoso Lopes	6b6b605917	Refactor SSE cmp intrinsics and declare the same for AVX llvm-svn: 106796	2010-06-24 22:04:40 +00:00
Bill Wendling	2d3c490026	It's possible that a flag is added to the SDNode that points back to the original SDNode. This is badness. Also, this function allows one SDNode to point multiple flags to another SDNode. Badness as well. llvm-svn: 106793	2010-06-24 22:00:37 +00:00
Devang Patel	c657c621b7	DBG_VALUE machine instruction pointing to undefined register for a variable justify a separate scope if the variable is inlined function's argument. Radar 8122864. llvm-svn: 106792	2010-06-24 21:51:19 +00:00
Jakob Stoklund Olesen	2b87d44c5d	Don't return a std::vector in the Spiller interface, but take a reference to a vector instead. This avoids needless copying and allocation. Add documentation. llvm-svn: 106788	2010-06-24 20:54:29 +00:00
Bruno Cardoso Lopes	4398fd7b83	- Add AVX COMI{SS,SD}{rr,rm} and UCOMI{SS,SD}{rr,rm}. - Fix a small VEX encoding issue. - Move compare instructions to their appropriate place. llvm-svn: 106787	2010-06-24 20:48:23 +00:00
Jakob Stoklund Olesen	9b659142a6	Remove the now unused LiveIntervals::getVNInfoSourceReg(). This method was always a bit too simplistic for the real world. It didn't really deal with subregisters and such. llvm-svn: 106781	2010-06-24 20:18:15 +00:00
Jakob Stoklund Olesen	487ed997d0	Teach AdjustCopiesBackFrom to also use CoalescerPair to identify compatible copies. llvm-svn: 106780	2010-06-24 20:16:00 +00:00
Dale Johannesen	5ad5226c58	Disallow matching "i" constraint to symbol addresses when address requires a register or secondary load to compute (most PIC modes). This improves "g" constraint handling. 8015842. The test from 2007 is attempting to test the fix for PR1761, but since -relocation-model=static doesn't work on Darwin x86-64, it was not testing what it was supposed to be testing and was passing erroneously. Fixed to use Linux x86-64. llvm-svn: 106779	2010-06-24 20:14:51 +00:00
Jakob Stoklund Olesen	7f894d8fdc	Remove the -fast-spill option. This code path has never really been used, and we are going to be handling spilling through the Spiller interface in the future. llvm-svn: 106777	2010-06-24 19:56:08 +00:00
Evan Cheng	c26e2f4b70	Oops. IT block formation pass needs to be run at any optimization level. llvm-svn: 106775	2010-06-24 19:10:14 +00:00
Bill Wendling	3f0e992af1	Loosen up the requirements in the Horrible Hack(tm) to include all selectors which don't have a catch-all associated with them not just clean-ups. This fixes the SingleSource/Benchmarks/Shootout-C++/except.cpp testcase that broke because of my change r105902. llvm-svn: 106772	2010-06-24 18:49:10 +00:00
Eli Friedman	246c41d93e	Always allow Thumb-2 SXTB, SXTH, UXTB, and UXTH. Fixes PR7324. llvm-svn: 106770	2010-06-24 18:20:04 +00:00
Jakob Stoklund Olesen	45230239e4	Replace a big gob of old coalescer logic with the new CoalescerPair class. CoalescerPair can determine if a copy can be coalesced, and which register gets merged away. The old logic in SimpleRegisterCoalescing had evolved into something a bit too convoluted. This second attempt fixes some crashes that only occurred Linux. llvm-svn: 106769	2010-06-24 18:15:01 +00:00
Dan Gohman	4143e9deeb	Add an exports file for the Hello example plugin. llvm-svn: 106768	2010-06-24 17:36:51 +00:00
Jakob Stoklund Olesen	a612d7c012	Print the LSBs of a SlotIndex symbolically using letters referring to the [L]oad, [u]se, [d]ef, or [S]tore slots. This makes it easier to see if two indices refer to the same instruction, avoiding mental mod 4 calculations. llvm-svn: 106766	2010-06-24 17:31:07 +00:00
Dan Gohman	8a84cd57ae	Simplify this code; switch lowering shouldn't produce cases which trivially fold away. llvm-svn: 106765	2010-06-24 17:08:31 +00:00
Dan Gohman	963b1c142e	A few minor micro-optimizations. llvm-svn: 106764	2010-06-24 16:57:52 +00:00
Dan Gohman	47ddf76d89	Teach getExactSDiv to evaluate x/1 to x up front, as it's a common enough special case, and it theoretically allows more folding because it works even when x is unanalyzable. llvm-svn: 106763	2010-06-24 16:51:25 +00:00
Bob Wilson	279e55fb2e	PR7458: Try commuting Thumb2 instruction operands to put them into 2-address form so they can be narrowed to 16-bit instructions. llvm-svn: 106762	2010-06-24 16:50:20 +00:00
Dan Gohman	5235cc2c25	Don't try to preserve pointer types in SCEVConstants; the old code was over-complicated. llvm-svn: 106760	2010-06-24 16:47:03 +00:00
Dan Gohman	ab5422200b	Fix copy+pasto issues in isMulSExtable. llvm-svn: 106759	2010-06-24 16:45:11 +00:00
Dan Gohman	3ace9f4e3d	Make the trunc code consistent with the zext and sext code in its handling of pointer types. llvm-svn: 106757	2010-06-24 16:33:38 +00:00
Dan Gohman	b377e2828d	Add overloads for getFile and getFileOrSTDIN which take a const char * instead of a StringRef, avoiding the need to copy the string in the common case. llvm-svn: 106754	2010-06-24 16:25:50 +00:00
Jakob Stoklund Olesen	3b2b46a700	Be more strict about subreg-to-subreg copies in CoalescerPair. Also keep track of the original DstREg before subregister adjustments. llvm-svn: 106753	2010-06-24 16:19:28 +00:00
Gabor Greif	7ccec09252	use ArgOperand API llvm-svn: 106752	2010-06-24 16:11:44 +00:00
Jakob Stoklund Olesen	53ccab7d1c	Verify that VNI kills are pointing to existing instructions. In this case it is essential that the kill is real because the spiller will decide to omit a spill if it thinks there is a later kill. llvm-svn: 106751	2010-06-24 15:56:59 +00:00
Gabor Greif	a6d75e2cf7	use (even more, still) ArgOperand API llvm-svn: 106750	2010-06-24 15:51:11 +00:00
Dan Gohman	463f26b4be	Eliminate the other half of the BRCOND optimization, and update as many tests as possible. llvm-svn: 106749	2010-06-24 15:24:03 +00:00
Dan Gohman	df6b33e778	Eliminate the first have of the optimization which eliminates BRCOND when the condition is constant. This optimization shouldn't be necessary, because codegen shouldn't be able to find dead control paths that the IR-level optimizer can't find. And it's undesirable, because it encourages bugpoint to leave "br i1 false" branches in its output. And it wasn't updating the CFG. I updated all the tests I could, but some tests are too reduced and I wasn't able to meaningfully preserve them. llvm-svn: 106748	2010-06-24 15:04:11 +00:00
Gabor Greif	218f5541b2	use ArgOperand API and CallSite for arg range; add necessary casts and perform some cosmetics llvm-svn: 106747	2010-06-24 14:42:01 +00:00
Dan Gohman	600f62b3ba	Reapply r106634, now that the bug it exposed is fixed. llvm-svn: 106746	2010-06-24 14:30:44 +00:00
Gabor Greif	5aafdf1e43	use ArgOperand API and CallSite for arg range llvm-svn: 106745	2010-06-24 14:13:36 +00:00
Gabor Greif	0a136c9b53	use (even more) ArgOperand API llvm-svn: 106744	2010-06-24 13:54:33 +00:00
Gabor Greif	590d95ed18	use ArgOperand API llvm-svn: 106743	2010-06-24 13:42:49 +00:00
Gabor Greif	589a0b950a	use ArgOperand API llvm-svn: 106740	2010-06-24 12:58:35 +00:00
Gabor Greif	7943017490	use ArgOperand API llvm-svn: 106737	2010-06-24 12:35:13 +00:00
Gabor Greif	75f6943c95	use ArgOperand API, also tighten the type of visitFree to make this work out smoothly llvm-svn: 106736	2010-06-24 12:21:15 +00:00
Gabor Greif	91f9589057	use ArgOperand API; introduce downcasted pointers into scope to facilitate this llvm-svn: 106734	2010-06-24 12:03:56 +00:00
Gabor Greif	e2f482ca0b	use ArgOperand API llvm-svn: 106731	2010-06-24 10:42:46 +00:00
Gabor Greif	2d958d4db5	use ArgOperand API llvm-svn: 106730	2010-06-24 10:17:17 +00:00
Gabor Greif	5bcaa55761	use callsite to obtain all arguments llvm-svn: 106729	2010-06-24 10:04:07 +00:00
Gabor Greif	42f620cc55	use callsite to obtain all arguments llvm-svn: 106728	2010-06-24 09:56:43 +00:00
Chris Lattner	8048662539	Teach the x86 mc assembler that %dr6 = %db6, this implements rdar://8013734 llvm-svn: 106725	2010-06-24 07:29:18 +00:00
Chris Lattner	c4e84309c4	more cleanups llvm-svn: 106724	2010-06-24 07:18:14 +00:00
Chris Lattner	056fd06c5f	reduce indentation llvm-svn: 106723	2010-06-24 07:16:25 +00:00
Chris Lattner	cfed96a410	fix breakage from r98938 by correctly marking msp430 calls as variadic. Patch by Ben Ransford! llvm-svn: 106722	2010-06-24 06:46:50 +00:00
Dan Gohman	c3e291c560	Fix a bug in the code which determines when it's safe to use the bt instruction, which was exposed by r106263. llvm-svn: 106718	2010-06-24 02:07:59 +00:00
Eric Christopher	fa6ce139a9	Add a couple more quick comments. llvm-svn: 106717	2010-06-24 02:07:57 +00:00
Dan Gohman	0695e09b09	Optimize the "bit test" code path for switch lowering in the case where the bit mask has exactly one bit. llvm-svn: 106716	2010-06-24 02:06:24 +00:00
Jakob Stoklund Olesen	dbb58d2974	Revert "Replace a big gob of old coalescer logic with the new CoalescerPair class." Whiny buildbots. llvm-svn: 106710	2010-06-24 00:52:22 +00:00
Gabor Greif	0f60709f0e	use getNumArgOperands llvm-svn: 106709	2010-06-24 00:48:48 +00:00
Gabor Greif	4a39b84a9d	use ArgOperand API llvm-svn: 106707	2010-06-24 00:44:01 +00:00
Devang Patel	0dc3c2d37e	Use ValueMap instead of DenseMap. The ValueMapper used by various cloning utility maps MDNodes also. llvm-svn: 106706	2010-06-24 00:33:28 +00:00
Bruno Cardoso Lopes	191a1cd2bb	Add AVX CMP{SS,SD}{rr,rm} instructions and encoding testcases llvm-svn: 106705	2010-06-24 00:32:06 +00:00
Bruno Cardoso Lopes	6af02a6f69	Move SSE and AVX shuffle, unpack and compare code to more appropriate places llvm-svn: 106702	2010-06-24 00:15:50 +00:00
Jakob Stoklund Olesen	f38e6720cc	Replace a big gob of old coalescer logic with the new CoalescerPair class. CoalescerPair can determine if a copy can be coalesced, and which register gets merged away. The old logic in SimpleRegisterCoalescing had evolved into something a bit too convoluted. llvm-svn: 106701	2010-06-24 00:12:39 +00:00
Devang Patel	d8dedee96d	Use available typedef for " DenseMap<const Value, Value>". llvm-svn: 106699	2010-06-24 00:00:42 +00:00
Devang Patel	b8f11de105	Cosmetic change. Do not use "ValueMap" as a name for a local variable or an argument. llvm-svn: 106698	2010-06-23 23:55:51 +00:00
Gabor Greif	1abbde3103	use ArgOperand accessors llvm-svn: 106697	2010-06-23 23:38:07 +00:00
Bill Wendling	f470747a36	We are missing opportunites to use ldm. Take code like this: void t(int cp0, int cp1, int dp, int fmd) { int c0, c1, d0, d1, d2, d3; c0 = (cp0++ & 0xffff) \| ((cp1++ << 16) & 0xffff0000); c1 = (cp0++ & 0xffff) \| ((cp1++ << 16) & 0xffff0000); / ... */ } It code gens into something pretty bad. But with this change (analogous to the X86 back-end), it will use ldm and generate few instructions. llvm-svn: 106693	2010-06-23 23:00:16 +00:00
Gabor Greif	253c6bf366	use the new isFreeCall API and ArgOperand accessors llvm-svn: 106692	2010-06-23 22:48:06 +00:00
Gabor Greif	5f5a864539	minor enhancement to llvm::isFreeCall API: return CallInst; no functional change llvm-svn: 106686	2010-06-23 21:51:12 +00:00
Gabor Greif	ad7884ad98	use ArgOperand getters llvm-svn: 106685	2010-06-23 21:41:47 +00:00
Bruno Cardoso Lopes	05220c9a0d	Add AVX MOVMSK{PS,PD}rr instructions llvm-svn: 106683	2010-06-23 21:30:27 +00:00
Bruno Cardoso Lopes	3183dd5692	Add tests for different AVX cmp opcodes, also teach the x86 asm parser to understand the vcmp instruction llvm-svn: 106678	2010-06-23 21:10:57 +00:00
Eric Christopher	5fed9b7c6c	Update according to feedback. llvm-svn: 106677	2010-06-23 20:49:35 +00:00
Bruno Cardoso Lopes	360d6fe299	Add AVX SHUF{PS,PD}{rr,rm} instructions llvm-svn: 106672	2010-06-23 20:07:15 +00:00
Nico Weber	337e8db712	Add support for the x86 instructions "pusha" and "popa". llvm-svn: 106671	2010-06-23 20:00:58 +00:00
Dale Johannesen	d24c66b4a3	Do not do tail calls to external symbols. If the branch turns out to be ARM-to-Thumb or vice versa the linker cannot resolve this. 8120438. If this optimization is going to be useful we probably need a compiler flag "assume callees are same architecture" or something like that. llvm-svn: 106662	2010-06-23 18:52:34 +00:00
Bill Wendling	a136521a17	MorphNodeTo doesn't preserve the memory operands. Because we're morphing a node into the same node, but with different non-memory operands, we need to replace the memory operands after it's finished morphing. llvm-svn: 106643	2010-06-23 18:16:24 +00:00
Daniel Dunbar	4df321b7ad	Revert r106263, "Fold the ShrinkDemandedOps pass into the regular DAGCombiner pass,"... it was causing both 'file' (with clang) and 176.gcc (with llvm-gcc) to be miscompiled. llvm-svn: 106634	2010-06-23 17:09:26 +00:00
Jim Grosbach	6f71039fa4	The generic DAG combiner can now fold atomic fences when needed, so switch to using that. llvm-svn: 106633	2010-06-23 16:25:07 +00:00
Jim Grosbach	a8ea498171	When using libcall expansions for the atomic intrinsics, the explicit MEMBARRIER fences aren't necessary for ARM. Tell the combiner to fold them away. llvm-svn: 106631	2010-06-23 16:08:49 +00:00
Jim Grosbach	b58c08b0ba	Some targets don't require the fencing MEMBARRIER instructions surrounding atomic intrinsics, either because the use locking instructions for the atomics, or because they perform the locking directly. Add support in the DAG combiner to fold away the fences. llvm-svn: 106630	2010-06-23 16:07:42 +00:00
Jakob Stoklund Olesen	731ea71f59	Add a few VNInfo data structure checks. llvm-svn: 106627	2010-06-23 15:34:36 +00:00
Gabor Greif	4d18165f82	use ArgOperand accessors llvm-svn: 106626	2010-06-23 13:56:57 +00:00
Gabor Greif	c9a9251844	use ArgOperand accessors llvm-svn: 106623	2010-06-23 13:09:06 +00:00
Gabor Greif	e54065394e	use helper to neatly access arguments llvm-svn: 106622	2010-06-23 08:45:32 +00:00
Eric Christopher	3d6e2c6335	Update uses, defs, and comments for darwin tls patterns. llvm-svn: 106621	2010-06-23 08:01:49 +00:00
Daniel Dunbar	ef5a4383ad	Revert r106066, "Create a more targeted fix for not sinking instructions into a range where it"... it causes bzip2 to be miscompiled by Clang. Conflicts: lib/CodeGen/MachineSink.cpp llvm-svn: 106614	2010-06-23 00:48:25 +00:00
Eric Christopher	7f85520644	Get the addend correct for i386 pic. Thanks Daniel! llvm-svn: 106608	2010-06-22 23:51:47 +00:00
Dan Gohman	75c6b0bb1f	Replace ScalarEvolution's private copy of getLoopPredecessor with LoopInfo's public copy. llvm-svn: 106603	2010-06-22 23:43:28 +00:00
Bruno Cardoso Lopes	1e13c17a55	Add AVX compare packed instructions llvm-svn: 106600	2010-06-22 23:37:59 +00:00
Devang Patel	9ad629367d	Revert 106592 for now. It causes clang-selfhost build failure. llvm-svn: 106598	2010-06-22 23:29:55 +00:00
Dan Gohman	1081f1a0f5	Fix OptimizeMax to handle an odd case where one of the max operands is another max which folds. This fixes PR7454. llvm-svn: 106594	2010-06-22 23:07:13 +00:00
Bruno Cardoso Lopes	535aa8ea91	Reapply support for AVX unpack and interleave instructions, with testcases this time. llvm-svn: 106593	2010-06-22 23:02:38 +00:00
Devang Patel	87f75f75be	If a metadata operand is seeded in value map and the metadata should also be seeded in value map. This is not limited to function local metadata. Failure to seed metdata in such cases causes troubles when in a cloned module, metadata from a new module refers to values in old module. Usually this results in mysterious bugpoint crashes. For example, Checking to see if we can delete global inits: Unknown constant! UNREACHABLE executed at /d/g/llvm/lib/Bitcode/Writer/BitcodeWriter.cpp:904! llvm-svn: 106592	2010-06-22 22:53:21 +00:00
Devang Patel	e43c6487da	While cloning a module, clone metadata attached with instructions. llvm-svn: 106591	2010-06-22 22:50:42 +00:00
Bruno Cardoso Lopes	1a890f9dc0	Add AVX MOV{SS,SD}{rr,rm} instructions llvm-svn: 106588	2010-06-22 22:38:56 +00:00
Bill Wendling	8ce69cd95a	Fix the formatting of the switch statement and add a missing break. llvm-svn: 106586	2010-06-22 22:16:17 +00:00
Jakob Stoklund Olesen	1023f6bd98	Also convert SUBREG_TO_REG to a KILL when relevant, like the other subreg instructions. This does not affect codegen much because SUBREG_TO_REG is only used by X86 and X86 does not use the register scavenger, but it prevents verifier errors. llvm-svn: 106583	2010-06-22 22:11:07 +00:00
Bob Wilson	c5d712232d	Thumb1 functions using @llvm.returnaddress were not saving the incoming LR. Radar 8031193. llvm-svn: 106582	2010-06-22 22:04:24 +00:00
Eric Christopher	e9c1bb6cb1	Look for and use a different darwin crash reporter library. llvm-svn: 106576	2010-06-22 21:01:04 +00:00
Jim Grosbach	6c275bc5a2	fix typo llvm-svn: 106574	2010-06-22 20:52:02 +00:00
Gabor Greif	c89d2aad4c	use high-level accessors llvm-svn: 106573	2010-06-22 20:40:38 +00:00
Gabor Greif	b575cf69f4	warmup ritual: use high-level argument accessors llvm-svn: 106563	2010-06-22 19:46:37 +00:00
Devang Patel	e3fbbd19ed	Clone named metadata while cloning a module. Reapply Bob's patch. llvm-svn: 106560	2010-06-22 18:52:38 +00:00
Bruno Cardoso Lopes	3af915f84b	Reorganize logical and arithmetic SSE 1 & 2 instructions llvm-svn: 106557	2010-06-22 18:17:40 +00:00
Bruno Cardoso Lopes	b91af24d3e	Reorganize SSE instructions, making easier to see oportunities for refactoring llvm-svn: 106556	2010-06-22 18:09:32 +00:00
Dan Gohman	3570f81b1e	Move PHIElimination's SplitCriticalEdge for MachineBasicBlocks out into a utility routine, teach it how to update MachineLoopInfo, and make use of it in MachineLICM to split critical edges on demand. llvm-svn: 106555	2010-06-22 17:25:57 +00:00
Jakob Stoklund Olesen	9c47dac677	Remove the SimpleJoin optimization from SimpleRegisterCoalescing. Measurements show that it does not speed up coalescing, so there is no reason the keep the added complexity around. Also clean out some unused methods and static functions. llvm-svn: 106548	2010-06-22 16:13:57 +00:00
Dan Gohman	d2d1ae105d	Use pre-increment instead of post-increment when the result is not used. llvm-svn: 106542	2010-06-22 15:08:57 +00:00
Dan Gohman	2ceaa71bdb	Add an explicit keyword. llvm-svn: 106538	2010-06-22 13:53:29 +00:00
Dan Gohman	f820bd327d	Allow "exhaustive" trip count evaluation on phi nodes with all constant operands. llvm-svn: 106537	2010-06-22 13:15:46 +00:00
Devang Patel	f040dec68a	Revert 106528. It is causing self host failures. llvm-svn: 106529	2010-06-22 06:14:09 +00:00
Devang Patel	b195eb4acf	Do not rely on DenseMap slot which can be easily invalidated when DenseMap grows. llvm-svn: 106528	2010-06-22 05:16:56 +00:00
Bob Wilson	6c1fc79cab	Revert my change to clone named metadata. Buildbots are complaining. --- Reverse-merging r106508 into '.': U lib/Transforms/Utils/CloneModule.cpp llvm-svn: 106521	2010-06-22 02:08:51 +00:00
Dan Gohman	2370e2fe0f	When unfolding a load, avoid assuming which instruction that kill and dead flags will end up on. llvm-svn: 106520	2010-06-22 02:07:21 +00:00
Devang Patel	b6e058da18	Use single interface, using twine, to get named metadata. getNamedMetadata(). llvm-svn: 106518	2010-06-22 01:19:38 +00:00
Evan Cheng	37bb617f8a	Tail merging pass shall not break up IT blocks. rdar://8115404 llvm-svn: 106517	2010-06-22 01:18:16 +00:00
Devang Patel	cbc6fd8493	Discard special LLVM prefix from linkage name. llvm-svn: 106516	2010-06-22 01:06:05 +00:00
Devang Patel	ad51735794	Do not rely on Twine temporaries to survive. llvm-svn: 106515	2010-06-22 01:01:58 +00:00
Chris Lattner	60bb7c42a7	make sure to initialize indent_level llvm-svn: 106513	2010-06-22 00:40:26 +00:00
Dan Gohman	851e478e6b	Fix the new load-unfolding code to update LiveVariable's dead flags, in addition to the kill flags. llvm-svn: 106512	2010-06-22 00:32:04 +00:00
Bob Wilson	5f9575c1cd	Include named metadata when cloning a module. llvm-svn: 106508	2010-06-22 00:11:03 +00:00
Chris Lattner	64960f55fe	add some support for blockaddress. This isn't really enough to be useful, but it will cover uses of blockaddress that are actually in a function. llvm-svn: 106502	2010-06-21 23:19:36 +00:00
Chris Lattner	bb45b964f8	eliminate a mutable global variable, use raw_ostream::indent instead of rolling our own. llvm-svn: 106501	2010-06-21 23:14:47 +00:00
Chris Lattner	a0b8c90870	un-indent a huge amount of code out of an anonymous namespace. llvm-svn: 106500	2010-06-21 23:12:56 +00:00
Bruno Cardoso Lopes	b7dadb0e95	revert r106482 llvm-svn: 106499	2010-06-21 22:59:03 +00:00
Dan Gohman	3c1b3c61e9	Teach two-address lowering how to unfold a load to open up commuting opportunities. For example, this lets it emit this: movq (%rax), %rcx addq %rdx, %rcx instead of this: movq %rdx, %rcx addq (%rax), %rcx in the case where %rdx has subsequent uses. It's the same number of instructions, and usually the same encoding size on x86, but it appears faster, and in general, it may allow better scheduling for the load. llvm-svn: 106493	2010-06-21 22:17:20 +00:00
Bruno Cardoso Lopes	510d9a3404	change parameter name to avoid confusion with global definition llvm-svn: 106486	2010-06-21 21:28:07 +00:00
Bob Wilson	72df24037e	sign_extend_inreg needs to be expanded for pre-v6 Thumb as well as ARM. Radar 8104310. llvm-svn: 106484	2010-06-21 21:27:34 +00:00
Jim Grosbach	523e554afa	LEApcrelJT shouldn't be marked as neverHasSideEffects, as we don't want it being moved around away from the jump table it references. rdar://8104340 llvm-svn: 106483	2010-06-21 21:27:27 +00:00
Bruno Cardoso Lopes	374b2195f6	Add unpack and interleave AVX instructions, encoding tests cooming soon llvm-svn: 106482	2010-06-21 21:21:48 +00:00
Evan Cheng	1fb4de8ec5	Fix PR7421: bug in kill transferring logic. It was ignoring loads / stores which have already been processed. llvm-svn: 106481	2010-06-21 21:21:14 +00:00
Chris Lattner	79d2075e4a	"This is just a cosmetic change in MCAsmStreamer.cpp/EmitSymbolAttribute: all attributes have now a \t before and after, as done for '.type'. This makes the output look consistent, as well as help some third party assemblers expecting the attributes to be in the second column." Patch by Arnaud de Grandmaison! llvm-svn: 106469	2010-06-21 20:35:01 +00:00
Eric Christopher	6dd51a2bb6	Remove isTwoAddress from SystemZ. llvm-svn: 106467	2010-06-21 20:25:57 +00:00
Eric Christopher	d7a7356be6	Remove isTwoAddress from Sparc. llvm-svn: 106466	2010-06-21 20:22:35 +00:00
Eric Christopher	c7927f2013	Remove isTwoAddress from Mips. llvm-svn: 106465	2010-06-21 20:19:21 +00:00
Eric Christopher	fb008dfa05	Remove isTwoAddress from Blackfin. llvm-svn: 106457	2010-06-21 20:13:37 +00:00
Eric Christopher	fa1b54d26e	Remove isTwoAddress from MSP430. llvm-svn: 106455	2010-06-21 20:07:30 +00:00
Dan Gohman	dd41bba517	Use A.append(...) instead of A.insert(A.end(), ...) when A is a SmallVector, and other SmallVector simplifications. llvm-svn: 106452	2010-06-21 19:47:52 +00:00
Eric Christopher	0ca648d758	Make 80-column. llvm-svn: 106448	2010-06-21 18:56:55 +00:00
Eric Christopher	98392f69e3	Remove isTwoAddress from PIC16. llvm-svn: 106447	2010-06-21 18:55:01 +00:00
Eric Christopher	2401271217	Remove isTwoAddress from XCore. llvm-svn: 106446	2010-06-21 18:51:38 +00:00
Eric Christopher	e159407231	Remove isTwoAddress from Alpha. llvm-svn: 106445	2010-06-21 18:48:55 +00:00
Dan Gohman	ffdee30e90	Move several non-performance-critical member functinos out of line. llvm-svn: 106444	2010-06-21 18:46:45 +00:00
Devang Patel	e80de80270	Do not directly use function names to construct new name for named metadata. "llvm.dbg.lv.~A" is not a valid name. llvm-svn: 106438	2010-06-21 18:36:58 +00:00
Bruno Cardoso Lopes	29a894dd64	Move part of SSE 1 & 2 compare, shuffle and unpack instructions closely. Preparing them for refactoring and to the addition of their AVX forms llvm-svn: 106437	2010-06-21 18:36:04 +00:00
Bruno Cardoso Lopes	20de4258f8	Add AVX regular (non-aliased ones) and,or,xor,andn packed instructions. They are already tested in the MC framework, no test needed llvm-svn: 106436	2010-06-21 18:22:54 +00:00
Dale Johannesen	d5c58b76ab	Fix PR 7433. Silly typo in non-Darwin ARM tail call handling, plus correct R9 handling in that mode. llvm-svn: 106434	2010-06-21 18:21:49 +00:00
Eric Christopher	bf572c7cea	Add some codegen patterns for x86_64-linux-gnu tls codegen matching. Based on a patch by Patrick Marlier! llvm-svn: 106433	2010-06-21 18:21:27 +00:00
Jim Grosbach	97c8a6a928	early exit for dbg_value instructions llvm-svn: 106430	2010-06-21 17:49:23 +00:00
Chris Lattner	74b5e3e0ae	remove some dead variables reported by clang++ llvm-svn: 106428	2010-06-21 17:20:18 +00:00
Dan Gohman	bbc29ea821	Revert r106422, which is breaking the non-fast-isel path. llvm-svn: 106423	2010-06-21 16:02:28 +00:00
Dan Gohman	f64fdd69d0	More changes for non-top-down fast-isel. Split the code for materializing a value out of SelectionDAGBuilder::getValue into a helper function, so that it can be used in other ways. Add a new getNonRegisterValue function which uses it, for use in code which doesn't want a CopyFromReg even when FuncMap.ValueMap already has an entry for it. llvm-svn: 106422	2010-06-21 15:13:54 +00:00
Kalle Raiskila	0ab5a02579	Mark the SPU 'lr' instruction to never have side effects. This allows the fast regiser allocator to remove redundant register moves. Update a set of tests that depend on the register allocator to be linear scan. llvm-svn: 106420	2010-06-21 15:08:16 +00:00
Kalle Raiskila	d7f50c118a	Fix the lowering of VECTOR_SHUFFLE on SPU to handle splats. llvm-svn: 106419	2010-06-21 14:42:19 +00:00
Dan Gohman	f91aff5f13	Do one lookup instead of two. llvm-svn: 106415	2010-06-21 14:21:47 +00:00
Dan Gohman	7c58cf75fa	Generalize this to look in the regular ValueMap in addition to the LocalValueMap, to make it more flexible when fast-isel isn't proceding straight top-down. llvm-svn: 106414	2010-06-21 14:17:46 +00:00
Rafael Espindola	1cae86f704	Fix an unintentional commit. I think I typed "git svn dcommit" in the wrong branch. I was trying to do some refactoring on the copyRegToReg, but this is realyl a work in progress and not generally useful yet. llvm-svn: 106413	2010-06-21 13:31:32 +00:00
Kalle Raiskila	6f58190f6f	Fix lowering of VECTOR_SHUFFLE on SPU. Old algorithm used to choke llc with the attached test. llvm-svn: 106411	2010-06-21 10:17:36 +00:00
Rafael Espindola	c596baa56d	wip llvm-svn: 106408	2010-06-21 02:17:34 +00:00
Nick Lewycky	dcc7b6dcb6	Fix warning in no-asserts build. llvm-svn: 106405	2010-06-20 20:27:42 +00:00
Evan Cheng	884a8fe5fa	Fix a crash caused by dereference of MBB.end(). rdar://8110842 llvm-svn: 106399	2010-06-20 00:54:38 +00:00
Dan Gohman	c515ab1eb2	Restore a call to rememberInstruction which was accidentally dropped in refactoring. llvm-svn: 106398	2010-06-19 22:50:35 +00:00
Dan Gohman	32655906e4	Add a TODO comment. llvm-svn: 106397	2010-06-19 21:30:18 +00:00
Dan Gohman	51d00092b6	Include the use kind along with the expression in the key of the use sharing map. The reconcileNewOffset logic already forces a separate use if the kinds differ, so incorporating the kind in the key means we can track more sharing opportunities. More sharing means fewer total uses to track, which means smaller problem sizes, which means the conservative throttles don't kick in as often. llvm-svn: 106396	2010-06-19 21:29:59 +00:00
Dan Gohman	297fb8b9fc	Don't include things in anonymous namespaces that don't need it. llvm-svn: 106395	2010-06-19 21:21:39 +00:00
Benjamin Kramer	bf5c3d42ba	Use calloc instead of new/memset, it is more efficient when the set is very large. llvm-svn: 106390	2010-06-19 17:00:31 +00:00
Dan Gohman	866971ed3d	Fix ScalarEvolution's "exhaustive" trip count evaluation code to avoid assuming that loops are in canonical form, as ScalarEvolution doesn't depend on LoopSimplify itself. Also, with indirectbr not all loops can be simplified. This fixes PR7416. llvm-svn: 106389	2010-06-19 14:17:24 +00:00
Dan Gohman	d277246137	Factor out duplicated code for reusing and inserting casts into a helper function. llvm-svn: 106388	2010-06-19 13:25:23 +00:00
Bob Wilson	4581434c27	Tidy. llvm-svn: 106383	2010-06-19 05:33:57 +00:00
Bob Wilson	6d12973143	Remove a fixme comment that is no longer relevant. llvm-svn: 106382	2010-06-19 05:32:41 +00:00
Bob Wilson	0ae08935f6	Fix error message to match function name. llvm-svn: 106381	2010-06-19 05:32:09 +00:00
Bruno Cardoso Lopes	b86a3abcc7	Refactoring of regular logical packed instructions to prepare for AVX ones. llvm-svn: 106375	2010-06-19 04:09:22 +00:00
Bruno Cardoso Lopes	8737b7d73d	Refactor aliased packed logical instructions, also add AVX AND,OR,XOR,NAND{P}{S,D}{rr,rm} instructions. llvm-svn: 106374	2010-06-19 02:44:01 +00:00
Evan Cheng	7079bf815d	Ignore dbg_value's. llvm-svn: 106373	2010-06-19 02:36:21 +00:00
Bruno Cardoso Lopes	a588049ce9	Move new sse 1 & 2 generic classes to a more appropriate place llvm-svn: 106372	2010-06-19 01:32:46 +00:00
Bruno Cardoso Lopes	2787efd961	Remove unnecessary arguments llvm-svn: 106371	2010-06-19 01:22:34 +00:00
Bruno Cardoso Lopes	00ada89f95	Add AVX packed intrinsics for MIN, MAX llvm-svn: 106370	2010-06-19 01:17:05 +00:00
Evan Cheng	f3c01f3ef6	Disable sibcall optimization for Thumb1 for now since Thumb1RegisterInfo::emitEpilogue is not expecting them. llvm-svn: 106368	2010-06-19 01:01:32 +00:00
Eric Christopher	42105b2976	Finish ripping isTwoAddress out of X86. Some mindless formatting and operand renaming to help. The giant turn the constraints on and selectively turn it off should probably be inverted at some point since it's just largely 50/50. llvm-svn: 106367	2010-06-19 00:37:40 +00:00
Bruno Cardoso Lopes	1e205f6b1c	Shrink down code and add for free AVX {MIN,MAX}P{S,D}{rm,rr} instructions llvm-svn: 106366	2010-06-19 00:37:31 +00:00
Chris Lattner	c60cecd88b	rip out dead code. llvm-svn: 106365	2010-06-19 00:34:14 +00:00
Chris Lattner	e808a78ac1	fix rdar://7873482 by teaching the instruction encoder to emit segment prefixes. Daniel wrote most of this patch. llvm-svn: 106364	2010-06-19 00:34:00 +00:00
Evan Cheng	e5fcd333da	Indentation and remove dead code. llvm-svn: 106362	2010-06-19 00:11:54 +00:00
Bruno Cardoso Lopes	1888f11887	Clean up: remove now unnecessary Constraints llvm-svn: 106361	2010-06-19 00:09:27 +00:00
Dan Gohman	5fc43eb186	Silence compiler warnings. llvm-svn: 106360	2010-06-19 00:02:06 +00:00
Bruno Cardoso Lopes	502c4fe61c	more refactoring! yay! big win over the intrinsics llvm-svn: 106359	2010-06-19 00:00:22 +00:00
Eric Christopher	6bdbdb5544	Remove isTwoAddress from here too. llvm-svn: 106358	2010-06-18 23:56:07 +00:00
Bruno Cardoso Lopes	66d2d57d9b	Fix typo, SSE1 should be used by XS, not SSE2 llvm-svn: 106357	2010-06-18 23:53:27 +00:00
Eric Christopher	3577c1b811	Remove isTwoAddress from 64-bit files. llvm-svn: 106356	2010-06-18 23:51:21 +00:00
Evan Cheng	119824ed4d	Move ARM if-conversion before post-ra scheduling. llvm-svn: 106355	2010-06-18 23:32:07 +00:00
Dan Gohman	8693650422	Teach regular and fast isel to set dead flags on unused implicit defs on calls and similar instructions. llvm-svn: 106353	2010-06-18 23:28:01 +00:00
Bruno Cardoso Lopes	2bfad417a1	Apply some refactor to packed instructions llvm-svn: 106349	2010-06-18 23:13:35 +00:00
Evan Cheng	4f0781c9b3	Update cmake list. llvm-svn: 106348	2010-06-18 23:12:10 +00:00
Evan Cheng	285935939d	Thumb2 hazard recognizer. llvm-svn: 106347	2010-06-18 23:11:35 +00:00
Jakob Stoklund Olesen	678927e0b1	Only run CoalesceExtSubRegs when we can expect LiveIntervalAnalysis to clean up the inserted INSERT_SUBREGs after us. llvm-svn: 106345	2010-06-18 23:10:20 +00:00
Evan Cheng	2d51c7c592	Allow ARM if-converter to be run after post allocation scheduling. - This fixed a number of bugs in if-converter, tail merging, and post-allocation scheduler. If-converter now runs branch folding / tail merging first to maximize if-conversion opportunities. - Also changed the t2IT instruction slightly. It now defines the ITSTATE register which is read by instructions in the IT block. - Added Thumb2 specific hazard recognizer to ensure the scheduler doesn't change the instruction ordering in the IT block (since IT mask has been finalized). It also ensures no other instructions can be scheduled between instructions in the IT block. This is not yet enabled. llvm-svn: 106344	2010-06-18 23:09:54 +00:00
Jim Grosbach	a57c2885cf	back-end libcall handling for ATOMIC_SWAP (__sync_lock_test_and_set) llvm-svn: 106342	2010-06-18 23:03:10 +00:00
Jim Grosbach	6860bb7796	Enable Expand handling of atomics for subtargets that can't do them inline. llvm-svn: 106336	2010-06-18 22:35:32 +00:00
Jakob Stoklund Olesen	07f4fa8198	TwoAddressInstructionPass::CoalesceExtSubRegs can insert INSERT_SUBREG instructions, but it doesn't really understand live ranges, so the first INSERT_SUBREG uses an implicitly defined register. Fix it in LiveVariableAnalysis by adding the <undef> flag. llvm-svn: 106333	2010-06-18 22:29:44 +00:00
Evan Cheng	cf9e8a987f	Fix an inverted condition. llvm-svn: 106330	2010-06-18 22:17:13 +00:00
Bruno Cardoso Lopes	871439abd2	Use the new 'defm' class inheritance in SSE llvm-svn: 106327	2010-06-18 22:10:11 +00:00
Evan Cheng	f5d62535a5	Fix cross initialization compilation error. llvm-svn: 106324	2010-06-18 22:01:37 +00:00
Evan Cheng	c0e0d85b18	Teach iff-converter to properly count # of dups. It was not skipping over dbg_value's which resulted in non-duplicated instructions being deleted. rdar://8104384. llvm-svn: 106323	2010-06-18 21:52:57 +00:00
Jim Grosbach	d64dfc1568	Add Expand-to-libcall support for additional atomics. This covers the usual entries used by llvm-gcc. *_[U]MIN and such can be added later if needed. This enables the front ends to simplify handling of the atomic intrinsics by removing the target-specific decision about which targets can handle the intrinsics. llvm-svn: 106321	2010-06-18 21:43:38 +00:00
Bob Wilson	a92e41a50a	Rewrite chained if's as switches and replace assertions with llvm_unreachable (as suggested in radar 8104405). llvm-svn: 106318	2010-06-18 21:32:42 +00:00
Dale Johannesen	589ffb4902	Fix ARM/Thumb reversal in previous attempt. llvm-svn: 106314	2010-06-18 21:07:47 +00:00
Jakob Stoklund Olesen	22a212f97c	When using ADDri to get the address of a stack object, 255 is a conservative limit on the offset that can be materialized without using the register scavenger. llvm-svn: 106312	2010-06-18 20:59:25 +00:00
Dan Gohman	a46d607545	Make this comment less specific. llvm-svn: 106311	2010-06-18 20:45:41 +00:00
Dan Gohman	af4903d6ee	Fix X86FastISel's address-mode folding to stay within the original basic block. This avoids trouble with examining instructions in other basic blocks which haven't been assigned registers yet. llvm-svn: 106310	2010-06-18 20:44:47 +00:00
Dale Johannesen	a06c2f79fc	An attempt to fix the problem Anton reported with ARM tail calls. Don't know if it works, but it doesn't break Darwin. llvm-svn: 106309	2010-06-18 20:44:28 +00:00
Dan Gohman	24ceda8eb0	Revert r106304 (105548 and friends), which are the SCEVComplexityCompare optimizations. There is still some nondeterminism remaining. llvm-svn: 106306	2010-06-18 19:54:20 +00:00
Dan Gohman	4c807fca97	Reapply 105540, 105542, and 105548, and revert r105732. llvm-svn: 106304	2010-06-18 19:26:04 +00:00
Dan Gohman	45073042eb	Reapply 105546. llvm-svn: 106302	2010-06-18 19:12:32 +00:00
Dan Gohman	9136d9fbf8	Reapply 105544. llvm-svn: 106301	2010-06-18 19:09:27 +00:00
Dale Johannesen	c1570dda5c	Enable tail calls on ARM by default, with some basic tests. This has been well tested on Darwin but not elsewhere. It should work provided the linker correctly resolves B.W <label in other function> which it has not seen before, at least from llvm-based compilers. I'm leaving the arm-tail-calls switch in until I see if there's any problems because of that; it might need to be disabled for some environments. llvm-svn: 106299	2010-06-18 19:00:18 +00:00
Dan Gohman	e5457c275d	Don't leak RegClass2VRegMap, which is now a new[] array instead of a std::vector. llvm-svn: 106298	2010-06-18 18:54:05 +00:00
Dan Gohman	882bb2984e	Start TargetRegisterClass indices at 0 instead of 1, so that MachineRegisterInfo doesn't have to confusingly allocate an extra entry. llvm-svn: 106296	2010-06-18 18:13:55 +00:00
Dale Johannesen	3ac52b3e43	Last round of changes for ARM tail calls. Not turning them on yet. llvm-svn: 106295	2010-06-18 18:13:11 +00:00
Bob Wilson	f82c8fcc58	Fix PR7372: Conditional branches (at least on ARM) are treated as predicated, so when IfConverter::CopyAndPredicateBlock checks to see if it should ignore an instruction because it is a branch, it should not check if the branch is predicated. This case (when IgnoreBr is true) is only relevant from IfConvertTriangle, where new branches are inserted after the block has been copied and predicated. If the original branch is not removed, we end up with multiple conditional branches (possibly conflicting) at the end of the block. Aside from any immediate errors resulting from that, this confuses the AnalyzeBranch functions so that the branches are not analyzable. That in turn causes the IfConverter to think that the "Simple" pattern can be applied, and things go downhill fast because the "Simple" pattern does _not_ apply if the block can fall through. This is pretty fragile. If there are other degenerate cases where AnalyzeBranch fails, but where the block may still fall through, the IfConverter should not perform its "Simple" if-conversion. But, I don't know how to do that with the current AnalyzeBranch interface, so for now, the best thing seems to be to avoid creating branches that AnalyzeBranch cannot handle. Evan, please review! llvm-svn: 106291	2010-06-18 17:07:23 +00:00
Jakob Stoklund Olesen	b9f91667e1	Treat the ARM inline asm {cc} constraint as a physreg (%CPSR), just like X86 does for {flags}. If we create virtual registers of the CCR class, RegAllocFast may try to spill them, and we can't do that. llvm-svn: 106289	2010-06-18 16:49:33 +00:00
Dan Gohman	9f58b3e106	Don't bother calling releaseMemory before destroying the DominatorTreeBase. llvm-svn: 106287	2010-06-18 16:09:11 +00:00
Dan Gohman	7edb39cc6b	Minor code simplifications. llvm-svn: 106286	2010-06-18 16:00:29 +00:00
Dan Gohman	6e681a5fbe	Give NamedRegionTimer an Enabled flag, allowing all its clients to switch from this: if (TimePassesIsEnabled) { NamedRegionTimer T(Name, GroupName); do_something(); } else { do_something(); // duplicate the code, this time without a timer! } to this: { NamedRegionTimer T(Name, GroupName, TimePassesIsEnabled); do_something(); } llvm-svn: 106285	2010-06-18 15:56:31 +00:00
Dan Gohman	96ca25eba5	Don't replace the old Ordering object with a new one; just clear() the old one. llvm-svn: 106284	2010-06-18 15:40:58 +00:00
Dan Gohman	a4f46b3ef8	Don't call clear() on DbgInfo when it's going to be deleted anyway. Don't replace the old DbgInfo with a new one when clear() on the old one is sufficient. llvm-svn: 106283	2010-06-18 15:36:18 +00:00
Dan Gohman	92c11acdb8	Change UpdateNodeOperands' operand and return value from SDValue to SDNode *, since it doesn't care about the ResNo value. llvm-svn: 106282	2010-06-18 15:30:29 +00:00
Dan Gohman	3d8a9d7490	Remove getIntegerSCEV; it's redundant with getConstant, and getConstant is more consistent with the ConstantInt API. llvm-svn: 106281	2010-06-18 14:33:50 +00:00
Dan Gohman	c3479f5342	Delete unused variables. llvm-svn: 106280	2010-06-18 14:32:32 +00:00
Dan Gohman	f1d8304fe3	Eliminate unnecessary uses of getZExtValue(). llvm-svn: 106279	2010-06-18 14:22:04 +00:00
Dan Gohman	35b6f9a929	isValueValidForType can be a static member function. llvm-svn: 106278	2010-06-18 14:01:07 +00:00
Eric Christopher	67d25f91c5	Some assorted isTwoAddress -> Constraints cleanup. llvm-svn: 106273	2010-06-18 02:41:19 +00:00
Dan Gohman	c61056a421	Handle execution entrypoints with non-integer return types. Fix from Russel Power in PR7284. llvm-svn: 106271	2010-06-18 02:01:10 +00:00
Dan Gohman	f3aea7aecf	Disable indvars on loops when LoopSimplify form is not available. This fixes PR7333. llvm-svn: 106267	2010-06-18 01:35:11 +00:00
Dan Gohman	99ba4dac59	Don't maintain a set of deleted nodes; instead, use a HandleSDNode to track a node over CSE events. This fixes PR7368. llvm-svn: 106266	2010-06-18 01:24:29 +00:00
Bruno Cardoso Lopes	2323168705	Add {mix,max}{ss,sd}{rr,rm} AVX forms. llvm-svn: 106264	2010-06-18 01:12:56 +00:00
Dan Gohman	b92156d5e4	Fold the ShrinkDemandedOps pass into the regular DAGCombiner pass, which is faster, simpler, and less surprising. llvm-svn: 106263	2010-06-18 01:05:21 +00:00
Dan Gohman	8ba26b48bb	Fix a typo in a comment. llvm-svn: 106260	2010-06-18 00:53:08 +00:00
Dan Gohman	0883789ec4	Handle ext(ext(x)) -> ext(x) immediately, since it's simple. llvm-svn: 106256	2010-06-18 00:08:30 +00:00
Dan Gohman	8f5954f42c	Simplify this code. llvm-svn: 106254	2010-06-17 23:34:09 +00:00
Bruno Cardoso Lopes	6b98f7129f	Use new tablegen resources in SSE tablegen code. This will be done incrementally and intermixed with the adding of more AVX instructions. This is a first step in that direction llvm-svn: 106251	2010-06-17 23:05:30 +00:00
Stuart Hastings	0125b6410a	Add a DebugLoc parameter to TargetInstrInfo::InsertBranch(). This addresses a longstanding deficiency noted in many FIXMEs scattered across all the targets. This effectively moves the problem up one level, replacing eleven FIXMEs in the targets with eight FIXMEs in CodeGen, plus one path through FastISel where we actually supply a DebugLoc, fixing Radar 7421831. llvm-svn: 106243	2010-06-17 22:43:56 +00:00
Jim Grosbach	0ed5b460dc	add missing break. inconsequential as the code shouldn't be reached, but for correctness' sake, it should be there. llvm-svn: 106229	2010-06-17 17:58:54 +00:00
Jim Grosbach	3aeae8aeeb	Add entries for Expanding atomic intrinsics to libcalls. Just a placeholder for the moment. The implementation of the libcall will follow. Currently, the llvm-gcc knows when the intrinsics can be correctly handled by the back end and only generates them in those cases, issuing libcalls directly otherwise. That's too much coupling. The intrinsics should always be generated and the back end decide how to handle them, be it with a libcall, inline code, or whatever. This patch is a step in that direction. rdar://8097623 llvm-svn: 106227	2010-06-17 17:50:54 +00:00
Jim Grosbach	5712c77c89	Thumb1 and any pre-v6 ARM target should use the libcall expansion of ISD::MEMBARRIER. v7 and v7 ARM mode continue to use the custom lowering. llvm-svn: 106204	2010-06-17 02:02:03 +00:00
Jim Grosbach	ba451e80dc	ISD::MEMBARRIER should lower to a libcall (__sync_synchronize) if the target sets the legalize action to Expand. llvm-svn: 106203	2010-06-17 02:00:53 +00:00
Jim Grosbach	6e758c97fd	simplify code a bit and add a more explanatory assert for cases that previously would result in 'cannot yet select' errors. llvm-svn: 106199	2010-06-17 01:37:00 +00:00
Jason Molenda	dd6a4cabf6	Add the entire range of DW_OP_lit[0..31], DW_OP_reg[0..31], and DW_OP_breg[0..31] to Dwarf.h. Add "DW_" prefix to the llvm::dwarf::*String methods which did not already have them in Dwarf.cpp. llvm-svn: 106197	2010-06-17 01:23:24 +00:00
Eric Christopher	29b58afdf1	Hack to let the move lowering handle dynamic-no-pic absolute moves of TLVP: movl _a@TLVP, %eax Daniel: Please review if you get a chance. llvm-svn: 106194	2010-06-17 00:51:48 +00:00
Eric Christopher	93f16372f9	Update comment. llvm-svn: 106191	2010-06-17 00:49:46 +00:00
Jim Grosbach	e3864cc15e	format and 80-column cleanup llvm-svn: 106173	2010-06-16 23:45:49 +00:00
Jim Grosbach	e94f1ded24	remove trailing whitespace llvm-svn: 106164	2010-06-16 22:41:09 +00:00
Jakob Stoklund Olesen	2334144e6e	Don't attempt preserving conservative kill flags. We were doing it wrong. This is before LiveVariables anyway, where these kill flags are recalculated. llvm-svn: 106157	2010-06-16 22:11:08 +00:00
Bob Wilson	01ac8f9fc0	Remove the hidden "neon-reg-sequence" option. The reg sequences are working now, so there's no need to disable them. llvm-svn: 106155	2010-06-16 21:34:01 +00:00
Eric Christopher	74892d4f1f	In progress on 32-bit addends. llvm-svn: 106154	2010-06-16 21:32:38 +00:00
Jakob Stoklund Olesen	207cd4bbd7	Allow a register to be redefined multiple times in a basic block. LiveVariableAnalysis was a bit picky about a register only being redefined once, but that really isn't necessary. Here is an example of chained INSERT_SUBREGs that we can handle now: 68 %reg1040<def> = INSERT_SUBREG %reg1040, %reg1028<kill>, 14 register: %reg1040 +[70,134:0) 76 %reg1040<def> = INSERT_SUBREG %reg1040, %reg1029<kill>, 13 register: %reg1040 replace range with [70,78:1) RESULT: %reg1040,0.000000e+00 = [70,78:1)[78,134:0) 0@78-(134) 1@70-(78) 84 %reg1040<def> = INSERT_SUBREG %reg1040, %reg1030<kill>, 12 register: %reg1040 replace range with [78,86:2) RESULT: %reg1040,0.000000e+00 = [70,78:1)[78,86:2)[86,134:0) 0@86-(134) 1@70-(78) 2@78-(86) 92 %reg1040<def> = INSERT_SUBREG %reg1040, %reg1031<kill>, 11 register: %reg1040 replace range with [86,94:3) RESULT: %reg1040,0.000000e+00 = [70,78:1)[78,86:2)[86,94:3)[94,134:0) 0@94-(134) 1@70-(78) 2@78-(86) 3@86-(94) rdar://problem/8096390 llvm-svn: 106152	2010-06-16 21:29:40 +00:00
Jim Grosbach	fd3b4e7390	A few more places where SCEVExpander bits need to skip over debug intrinsics when iterating through instructions. Yet more work for rdar://7797940 llvm-svn: 106149	2010-06-16 21:13:38 +00:00
Daniel Dunbar	ede8e6d2f0	MC/Mach-O: Rewrite atom association to be a final pass we do in Finish(), instead of tracking as part of emission. - This allows sharing more code with the MCObjectStreamer. llvm-svn: 106143	2010-06-16 20:04:32 +00:00
Daniel Dunbar	aa627c39e4	MC: Simplify MCAssembler::isSymbolLinkerVisible to only take an MCSymbol. llvm-svn: 106142	2010-06-16 20:04:29 +00:00
Daniel Dunbar	b2347fe504	MC: Lift SwitchSection() and Finish() into MCObjectStreamer. llvm-svn: 106141	2010-06-16 20:04:25 +00:00
Daniel Dunbar	8a3c9d9bc4	MC: Factor out an MCObjectStreamer class, which will be shared by the concrete object file format writers. llvm-svn: 106140	2010-06-16 20:04:22 +00:00
Rafael Espindola	a20e2dfe86	Make sure that simplify libcalls does not replace a call with one calling convention with a new call with a different calling convention. llvm-svn: 106134	2010-06-16 19:34:01 +00:00
Jim Grosbach	6c0da25129	add FIXME llvm-svn: 106126	2010-06-16 18:45:08 +00:00
Bill Wendling	d71bd63600	Improve comment to include that the use of a preg is also verboten in this situation. llvm-svn: 106119	2010-06-16 18:01:31 +00:00
Benjamin Kramer	41476410c9	TODO-- llvm-svn: 106102	2010-06-16 15:47:00 +00:00
Benjamin Kramer	a13bd20396	simplify-libcalls: fold strncmp(x, y, 1) -> memcmp(x, y, 1) The memcmp will be optimized further and even the pathological case 'strstr(x, "x") == x' generates optimal code now. llvm-svn: 106097	2010-06-16 10:30:29 +00:00
Evan Cheng	f128bdcb55	Make post-ra scheduling, anti-dep breaking, and register scavenger (conservatively) aware of predicated instructions. This enables ARM to move if-conversion before post-ra scheduler. llvm-svn: 106091	2010-06-16 07:35:02 +00:00
Devang Patel	d119da54de	Check function pointer first, before comparing function names. llvm-svn: 106088	2010-06-16 06:42:02 +00:00
Devang Patel	a6d20f446f	Use separate named MDNode to hold each function's local variable info. This speeds up local variable handling in DwarfDebug. llvm-svn: 106075	2010-06-16 00:53:55 +00:00
Eric Christopher	b672ab9b53	Don't emit the linkage for initializer label for mach-o tls. llvm-svn: 106073	2010-06-16 00:27:30 +00:00
Eric Christopher	2092dc2acd	Fix indentation. llvm-svn: 106072	2010-06-16 00:26:36 +00:00
Bill Wendling	8c0cf0994d	Create a more targeted fix for not sinking instructions into a range where it will conflict with another live range. The place which creates this scenerio is the code in X86 that lowers a select instruction by splitting the MBBs. This eliminates the need to check from the bottom up in an MBB for live pregs. llvm-svn: 106066	2010-06-15 23:46:31 +00:00
Eric Christopher	6c4d63e1a5	For 32-bit non-pic tlv mach-o addressing we don't need a pic base or a relative address. llvm-svn: 106064	2010-06-15 23:08:42 +00:00
Stuart Hastings	9b5005cd4b	Added a comment. llvm-svn: 106063	2010-06-15 23:06:30 +00:00
Eric Christopher	a86c2bdd2c	Some more work on mach-o TLV relocations. llvm-svn: 106062	2010-06-15 22:59:05 +00:00
Dale Johannesen	438c35b5d1	Add file missing from previous commit. llvm-svn: 106058	2010-06-15 22:24:08 +00:00
Bob Wilson	8105144fcd	Fix 80col violations, remove trailing whitespace, and clarify a comment. llvm-svn: 106057	2010-06-15 22:18:54 +00:00
Dale Johannesen	44f9dfc9cf	Next round of tail call changes. Register used in a tail call must not be callee-saved; following x86, add a new regclass to represent this. Also fixes a couple of bugs. Still disabled by default; Thumb doesn't work yet. llvm-svn: 106053	2010-06-15 22:08:33 +00:00
Jakob Stoklund Olesen	ec2e964fd6	Remove the local register allocator. Please use the fast allocator instead. llvm-svn: 106051	2010-06-15 21:58:33 +00:00
Dale Johannesen	89456b2612	Reapply 105986 with fix for bug pointed out by Jakob: flag argument to addReg is not the same format as flags attached to MachineOperand, although both have the same info. I don't think this actually mattered; the bootstrap failure did not reproduce on the next run anyway. llvm-svn: 106049	2010-06-15 21:36:43 +00:00
Benjamin Kramer	1118860e3a	simplify-libcalls: fold strstr(a, b) == a -> strncmp(a, b, strlen(b)) == 0 llvm-svn: 106047	2010-06-15 21:34:25 +00:00
Mon P Wang	7a84689cc5	Fixed vector widening of binary instructions that can trap. Patch by Visa Putkinen! llvm-svn: 106038	2010-06-15 20:29:05 +00:00
Daniel Dunbar	e22295e8a6	fpcmp: Fix bug where fpcmp wouldn't early exit when files obviously differ and no tolerance is set. llvm-svn: 106033	2010-06-15 19:20:30 +00:00
Daniel Dunbar	b645fa13a9	fpcmp: Fix a possible infinite loop when comparing something like: 1..19 ok to 1..20 o k (yes, the odd space is necessary). llvm-svn: 106032	2010-06-15 19:20:28 +00:00
Chris Lattner	874c92bd47	fix fastisel to handle GS and FS relative pointers. Patch by Nelson Elhage! llvm-svn: 106031	2010-06-15 19:08:40 +00:00
Bob Wilson	f3f7a770b7	Add basic support for NEON modified immediates besides VMOV. llvm-svn: 106030	2010-06-15 19:05:35 +00:00
Bob Wilson	fc7d739422	IfConversion's AnalyzeBlocks method always returns false; clean it up. llvm-svn: 106027	2010-06-15 18:57:15 +00:00
Jim Grosbach	c964585ff8	fix naming llvm-svn: 106024	2010-06-15 18:53:34 +00:00
Jakob Stoklund Olesen	6e54c908e0	Fix an exotic bug that only showed up in an internal test case. SimpleRegisterCoalescing::JoinIntervals() uses CoalescerPair to determine if a copy is coalescable, and in very rare cases it can return true where LHS is not live - the coalescable copy can come from an alias of the physreg in LHS. llvm-svn: 106021	2010-06-15 18:49:14 +00:00
Bob Wilson	5947573f39	Fix a comment typo. llvm-svn: 106015	2010-06-15 18:19:27 +00:00
Bob Wilson	de94e66234	Add some missing checks for the case where the extract_subregs are combined to an insert_subreg, i.e., where the destination register is larger than the source. We need to check that the subregs can be composed for that case in a symmetrical way to the case when the destination is smaller. llvm-svn: 106004	2010-06-15 17:27:54 +00:00
Jakob Stoklund Olesen	246e9a07a2	Avoid processing early clobbers twice in RegAllocFast. Early clobbers defining a virtual register were first alocated to a physreg and then processed as a physreg EC, spilling the virtreg. This fixes PR7382. llvm-svn: 105998	2010-06-15 16:20:57 +00:00
Jakob Stoklund Olesen	82eca35b3e	Add CoalescerPair helper class. Given a copy instruction, CoalescerPair can determine which registers to coalesce in order to eliminate the copy. It deals with all the subreg fun to determine a tuple (DstReg, SrcReg, SubIdx) such that: - SrcReg is a virtual register that will disappear after coalescing. - DstReg is a virtual or physical register whose live range will be extended. - SubIdx is 0 when DstReg is a physical register. - SrcReg can be joined with DstReg:SubIdx. CoalescerPair::isCoalescable() determines if another copy instruction is compatible with the same tuple. This fixes some NEON miscompilations where shuffles are getting coalesced as if they were copies. The CoalescerPair class will replace a lot of the spaghetti logic in JoinCopy later. llvm-svn: 105997	2010-06-15 16:04:21 +00:00
Daniel Dunbar	0904134252	Add <cstddef> include to get ptrdiff_t, for gcc-4.6; patch by Dimitry Andric. llvm-svn: 105994	2010-06-15 14:50:42 +00:00
Bob Wilson	a55b8877e6	Generalize the pre-coalescing of extract_subregs feeding reg_sequences, replacing the overly conservative checks that I had introduced recently to deal with correctness issues. This makes a pretty noticable difference in our testcases where reg_sequences are used. I've updated one test to check that we no longer emit the unnecessary subreg moves. llvm-svn: 105991	2010-06-15 05:56:31 +00:00
Bob Wilson	1478142485	VMOVQQ and VMOVQQQQ are pseudo instructions and not predicable. llvm-svn: 105990	2010-06-15 05:51:27 +00:00
Dale Johannesen	3f253d2353	Revert 105986; looks like I'd better try bootstrapping. llvm-svn: 105988	2010-06-15 04:55:06 +00:00
Ted Kremenek	d52caa5244	Update CMake build. llvm-svn: 105987	2010-06-15 04:08:14 +00:00
Dale Johannesen	c338ef2b65	The form of BuildMI used for TAILJMPr was changing the register containing the target address, an input, into an output. I don't think this actually broke anything on x86 (it does on ARM), but it's wrong. llvm-svn: 105986	2010-06-15 03:13:49 +00:00
Jim Grosbach	f14e08b01b	Make sure to skip dbg_value instructions when finding an insertion point for the combined load/store instruction. rdar://7797940 llvm-svn: 105982	2010-06-15 00:41:09 +00:00
Bob Wilson	5b2b504038	Rename functions referring to VMOV immediates to refer to NEON "modified immediate" operands. These functions have so far only been used for VMOV but they also apply to other NEON instructions with modified immediate operands. No functional changes. llvm-svn: 105969	2010-06-14 22:19:57 +00:00
Jim Grosbach	412800d346	More dbg_value cleanup so the presence of debug info doesn't affect code-gen. Make sure to skip the dbg_value instructions when moving dups out of the diamond. rdar://7797940 llvm-svn: 105965	2010-06-14 21:30:32 +00:00
Evan Cheng	078f4cec21	- Do away with SimpleHazardRecognizer.h. It's not used and offers little value. - Rename ExactHazardRecognizer to PostRAHazardRecognizer and move its header to include to allow targets to extend it. llvm-svn: 105959	2010-06-14 21:06:53 +00:00
Evan Cheng	a397ada078	Avoid uncessary array copying. llvm-svn: 105955	2010-06-14 20:18:40 +00:00
Chris Lattner	faa7bdccbf	fix a nasty bug where we were not treating available_externally symbols as declarations in the X86 backend. This would manifest on darwin x86-32 as errors like this with -fvisibility=hidden: symbol '__ZNSbIcED1Ev' can not be undefined in a subtraction expression This fixes PR7353. llvm-svn: 105954	2010-06-14 20:11:56 +00:00
Chris Lattner	329ea064ed	jump threading can't split a critical edge from an indirectbr. This fixes PR7356. llvm-svn: 105950	2010-06-14 19:45:43 +00:00
Chris Lattner	58c09b2859	fix a -Wbool-conversions warning from clang. llvm-svn: 105943	2010-06-14 18:28:57 +00:00
Chris Lattner	0fc88efda3	fix a -Wbool-conversions warning from clang. llvm-svn: 105942	2010-06-14 18:28:34 +00:00
Eli Friedman	ba1f1fcae5	Add back some possible optimizations for va_arg, with wording that makes it more clear what exactly is missing. llvm-svn: 105934	2010-06-14 07:03:30 +00:00
Benjamin Kramer	b82de426de	SimplifyCFG: don't turn volatile stores to null/undef into unreachable. Fixes PR7369. llvm-svn: 105914	2010-06-13 14:35:54 +00:00
Rafael Espindola	e302f833e1	Merge getStoreRegOpcode and getLoadRegOpcode. llvm-svn: 105900	2010-06-12 20:13:29 +00:00
Chris Lattner	2ed39551a7	improve verifier error about unterminated block to include function name, patch by Yuri llvm-svn: 105887	2010-06-12 15:50:24 +00:00
Eli Friedman	e17e4aea2a	Add README entry; based on testcase from Bill Hart. llvm-svn: 105878	2010-06-12 05:54:27 +00:00
Bruno Cardoso Lopes	ada854f8b6	make the avx intrinsics 3 address llvm-svn: 105876	2010-06-12 03:12:14 +00:00
Bruno Cardoso Lopes	f203703467	Add some basic fp intrinsics for AVX llvm-svn: 105873	2010-06-12 02:38:32 +00:00
Bill Wendling	5d6103318a	When performing the Horrible Hack(tm-Duncan) on the EH code to convert a clean-up to a catch-all after inlining, take into account that there could be filter IDs as well. The presence of filters don't mean that the selector catches anything. It's just metadata information. llvm-svn: 105872	2010-06-12 02:34:29 +00:00
Bruno Cardoso Lopes	a714ea0f7d	More AVX: {ADD,SUB,MUL,DIV}{PD,PS}rm llvm-svn: 105870	2010-06-12 01:53:48 +00:00
Bruno Cardoso Lopes	b06f54b852	More AVX: {ADD,SUB,MUL,DIV}{PD,PS}rr Handle OpSize TSFlag for AVX llvm-svn: 105869	2010-06-12 01:23:26 +00:00
Evan Cheng	e60273fd70	Allow target to provide its own hazard recognizer to post-ra scheduler. llvm-svn: 105862	2010-06-12 00:12:18 +00:00
Evan Cheng	cb1fe56fd9	Code formatting. llvm-svn: 105861	2010-06-12 00:11:53 +00:00
Bruno Cardoso Lopes	8947c32493	Add some comments about REX fields llvm-svn: 105860	2010-06-12 00:03:52 +00:00
Bruno Cardoso Lopes	fd5458d4bd	More AVX instructions ({ADD,SUB,MUL,DIV}{SS,SD}rm) Introduce the VEX_X field llvm-svn: 105859	2010-06-11 23:50:47 +00:00
Bob Wilson	f07d33d8f1	Add a missing bitcast. This code used to only handle conversions between i64 and f64 types, but now it also handle Neon vector types, so the f64 result of VMOVDRR may need to be converted to a Neon type. Radar 8084742. llvm-svn: 105845	2010-06-11 22:45:25 +00:00
Bob Wilson	6eae520de9	Add instruction encoding for the Neon VMOV immediate instruction. This changes the machine instruction representation of the immediate value to be encoded into an integer with similar fields as the actual VMOV instruction. This makes things easier for the disassembler, since it can just stuff the bits into the immediate operand, but harder for the asm printer since it has to decode the value to be printed. Testcase for the encoding will follow later when MC has more support for ARM. llvm-svn: 105836	2010-06-11 21:34:50 +00:00
Stuart Hastings	afe54f1625	Support for nested functions/classes in debug output. (Again.) Radar 7424645. llvm-svn: 105828	2010-06-11 20:08:44 +00:00
Stuart Hastings	6111abf8ad	Delete duplicate function. llvm-svn: 105827	2010-06-11 20:05:01 +00:00
Duncan Sands	a349d522f7	Avoid "variable 'bits' set but not used [-Wunused-but-set-variable]" warnings with gcc-4.6, by not setting bits when the result is not used. llvm-svn: 105790	2010-06-10 16:23:15 +00:00
Evan Cheng	38f6560461	Code refactoring, no functionality changes. llvm-svn: 105775	2010-06-10 02:09:31 +00:00
Evan Cheng	2901371c32	Delete code that's not safe. llvm-svn: 105774	2010-06-10 02:08:20 +00:00
Jim Grosbach	5fa0158ecd	be slightly more subtle about skipping dbg_value instructions; otherwise, if a dbg_value immediately follows a sequence of ldr/str instructions that should be combined into an ldm/stm and is the last instruction in the block, then combine may end up being skipped. llvm-svn: 105758	2010-06-09 22:21:24 +00:00
Jakob Stoklund Olesen	8bc5eca331	Mark physregs defined by inline asm as implicit. This is a bit of a hack to make inline asm look more like call instructions. It would be better to produce correct dead flags during isel. llvm-svn: 105749	2010-06-09 20:05:00 +00:00
Evan Cheng	a0746bd50a	Allow target to place 2-address pass inserted copies in better spots. Thumb2 will use this to try to avoid breaking up IT blocks. llvm-svn: 105745	2010-06-09 19:26:01 +00:00
Bill Wendling	5ac1d23d3d	It's an error to translate this: %reg1025 = <sext> %reg1024 ... %reg1026 = SUBREG_TO_REG 0, %reg1024, 4 into this: %reg1025 = <sext> %reg1024 ... %reg1027 = EXTRACT_SUBREG %reg1025, 4 %reg1026 = SUBREG_TO_REG 0, %reg1027, 4 The problem here is that SUBREG_TO_REG is there to assert that an implicit zext occurs. It doesn't insert a zext instruction. If we allow the EXTRACT_SUBREG here, it will give us the value after the <sext>, not the original value of %reg1024 before <sext>. llvm-svn: 105741	2010-06-09 19:00:55 +00:00
Evan Cheng	ae83e1f5cb	Revert 105540, 105542, 105544, 105546, and 105548 to unbreak bootstrapping. llvm-svn: 105740	2010-06-09 18:59:43 +00:00
Kenneth Uildriks	9b21208bfb	Pulled CodeMetrics out of InlineCost.h and made it a bit more general, so it can be reused from PartialSpecializationCost llvm-svn: 105725	2010-06-09 15:11:37 +00:00
Kalle Raiskila	5e0862f7f5	Fix SPU to cope with vector insertelement to an undef position. We default to inserting to lane 0. llvm-svn: 105722	2010-06-09 09:58:17 +00:00
Kalle Raiskila	056113a211	Handle loading from/storing to undef pointers on SPU by inserting a random load/store, rather than crashing llc. llvm-svn: 105710	2010-06-09 08:29:41 +00:00
Evan Cheng	83c64ee8de	Typo. llvm-svn: 105677	2010-06-09 03:49:12 +00:00
Eli Friedman	ab44d1281a	A few new x86-64 specific README entries. llvm-svn: 105674	2010-06-09 02:43:17 +00:00
Evan Cheng	47cd593023	Thumb2 IT blocks are fairly expensive. When there are multiple selects using the same condition, it's important to make sure they are scheduled together to avoid forming multiple IT blocks. I'm adding a pre-regalloc pass that forms IT blocks early (by re-scheduling instructions and split basic blocks) to attempt to fix this. This is not turned on by default since I am not sure this is the right fix. Another issue is llvm selects are modeled as two-address conditional moves. This can be very bad when the copies before the conditional moves are not coalesced away. Teach IT formation pass to move the copies above the IT block (when legal) to avoid breaking the IT block. llvm-svn: 105669	2010-06-09 01:46:50 +00:00
Jakob Stoklund Olesen	a13b1c29b0	Add argument name comments. llvm-svn: 105665	2010-06-09 00:40:31 +00:00
Kevin Enderby	0de0f3fc02	Incremental improvement to the handling of the x86 "Jump if rCX Zero" instruction. Added the 64-bit version "jrcxz" so it is recognized and also added the checks for incorrect uses of "jcxz" in 64-bit mode and "jrcxz" in 32-bit mode. Still to do is to correctly handle the encoding of the instruction adding the Address-size override prefix byte, 0x67, when the width of the count register is not the same as the mode the machine is running in. Which for example means the encoding of "jecxz" depends if you are assembling as a 32-bit target or a 64-bit target. llvm-svn: 105661	2010-06-08 23:48:44 +00:00
Eric Christopher	6ab55c5683	Split out these asserts so it's more apparent why we're not assembling that rip-relative address when executing in 32-bit mode. llvm-svn: 105656	2010-06-08 22:57:33 +00:00
Jim Grosbach	8fe3cc8055	fix copy/paste/modify think-o llvm-svn: 105653	2010-06-08 22:53:32 +00:00
Bruno Cardoso Lopes	c2f87b7bb2	Reapply r105521, this time appending "LLU" to 64 bit immediates to avoid breaking the build. llvm-svn: 105652	2010-06-08 22:51:23 +00:00
Eric Christopher	89d103a8ce	Ensure that mov and not lea are used to stick the address into the register. While we're at it, make sure it's in the right one. llvm-svn: 105645	2010-06-08 22:04:25 +00:00
Jim Grosbach	57c6fd452e	fix typo llvm-svn: 105634	2010-06-08 20:06:55 +00:00
Daniel Dunbar	5729f51410	Use const_iterator where appropriate. llvm-svn: 105620	2010-06-08 17:21:57 +00:00
Daniel Dunbar	f2363de7ad	DeltaAlgorithm: Tweak split to split by first/second half instead of even/odd, since adjacent changes are more likely to be related. llvm-svn: 105613	2010-06-08 16:21:26 +00:00
Daniel Dunbar	579ba2ac42	ADT: Add DAGDeltaAlgorithm, which is a DAG minimization algorithm built on top of the standard 'delta debugging' algorithm. - This can give substantial speedups in the delta process for inputs we can construct dependency information for. llvm-svn: 105612	2010-06-08 16:21:22 +00:00
Benjamin Kramer	4e36e5bb4c	Use realloc instead of malloc+memcpy when growing a POD SmallVector. A smart realloc implementation can try to expand the allocated memory block in-place, avoiding the copy. llvm-svn: 105605	2010-06-08 11:44:30 +00:00
Kalle Raiskila	6c40caf729	Flag SPU's function call sequence together. Discussed here: http://lists.cs.uiuc.edu/pipermail/llvmdev/2010-June/032107.html llvm-svn: 105601	2010-06-08 07:55:16 +00:00
Bob Wilson	0271c5928e	Fix up a comment. llvm-svn: 105591	2010-06-08 00:42:08 +00:00
Bob Wilson	846bd7992c	Further changes for Neon vector shuffles: - change isShuffleMaskLegal to show that all shuffles with 32-bit and 64-bit elements are legal - the Neon shuffle instructions do not support 64-bit elements, but we were not checking for that before lowering shuffles to use them - remove some 64-bit element vduplane patterns that are no longer needed llvm-svn: 105586	2010-06-07 23:53:38 +00:00
Bob Wilson	7149cfcda3	Fix a mistake in my previous change r105437: don't access operand 2 and assume that it is an immediate before checking that the instruction is an EXTRACT_SUBREG. llvm-svn: 105585	2010-06-07 23:48:46 +00:00
Dan Gohman	7398758719	Add some basic debug output. llvm-svn: 105561	2010-06-07 22:32:10 +00:00
Jim Grosbach	723d242a95	Handle dbg_value instructions (i.e., skip them) when generating IT blocks. rdar://7797940 llvm-svn: 105557	2010-06-07 21:48:47 +00:00
Jim Grosbach	6201b991a2	Cleanup. Process the dbg_values separately llvm-svn: 105554	2010-06-07 21:28:55 +00:00
Dan Gohman	fb8ed43349	Make bugpoint dead-argument-hacking actually work, and actually test it. llvm-svn: 105551	2010-06-07 20:20:33 +00:00
Dan Gohman	ebf2e977cf	The FoldingSet hash data includes pointer values, so it isn't determinstic. Instead, give SCEV objects an arbitrary sequence number. llvm-svn: 105548	2010-06-07 19:36:14 +00:00
Dan Gohman	3553feed79	Optimize this code somewhat by taking advantage of the fact that the operands are sorted. llvm-svn: 105546	2010-06-07 19:20:57 +00:00
Bill Wendling	cfcd0e12cf	Another place where the code wanted to access the argument list and not all of the operands. llvm-svn: 105545	2010-06-07 19:18:58 +00:00
Dan Gohman	a2effb6452	Micro-optimize this, to speed up this hotspot in debug builds a little. llvm-svn: 105544	2010-06-07 19:16:37 +00:00
Dan Gohman	18a4b46404	Micro-optimize this. llvm-svn: 105542	2010-06-07 19:12:54 +00:00
Jim Grosbach	0f445f328e	Move exit check where it really belongs. llvm-svn: 105541	2010-06-07 19:12:21 +00:00
Dan Gohman	70910a6ab6	Optimize ScalarEvolution's SCEVComplexityCompare predicate: don't go scrounging through SCEVUnknown contents and SCEVNAryExpr operands; instead just do a simple deterministic comparison of the precomputed hash data. Also, since this is more precise, it eliminates the need for the slow N^2 duplicate detection code. llvm-svn: 105540	2010-06-07 19:06:13 +00:00
Bill Wendling	a3bba3371a	Create new accessors to get arguments for call/invoke instructions. It breaks encapsulation to force the users of these classes to know about the internal data structure of the Operands structure. It also can lead to errors, like in the MSIL writer. llvm-svn: 105539	2010-06-07 19:05:06 +00:00
Kenneth Uildriks	1850444000	Partial specialization was not checking the callsite to make sure it was using the same constants as the specialization, leading to calls to the wrong specialization. Patch by Takumi Nakamura\! llvm-svn: 105528	2010-06-05 14:50:21 +00:00
Duncan Sands	e4f45cc88f	This bug is also present in MSVC10. Requested by Elrood on IRC. llvm-svn: 105527	2010-06-05 12:40:43 +00:00
Chris Lattner	fdd2614330	revert r105521, which is breaking the buildbots with stuff like this: In file included from X86InstrInfo.cpp:16: X86GenInstrInfo.inc:2789: error: integer constant is too large for 'long' type X86GenInstrInfo.inc:2790: error: integer constant is too large for 'long' type X86GenInstrInfo.inc:2792: error: integer constant is too large for 'long' type X86GenInstrInfo.inc:2793: error: integer constant is too large for 'long' type X86GenInstrInfo.inc:2808: error: integer constant is too large for 'long' type X86GenInstrInfo.inc:2809: error: integer constant is too large for 'long' type X86GenInstrInfo.inc:2816: error: integer constant is too large for 'long' type X86GenInstrInfo.inc:2817: error: integer constant is too large for 'long' type llvm-svn: 105524	2010-06-05 04:17:30 +00:00
Bruno Cardoso Lopes	594fa26317	Initial AVX support for some instructions. No patterns matched yet, only assembly encoding support. llvm-svn: 105521	2010-06-05 03:53:24 +00:00
Dale Johannesen	81ef35b3ca	Improvements to tail call code. No functional effect unless using -arm-tail-calls. llvm-svn: 105515	2010-06-05 00:51:39 +00:00
Dan Gohman	520913cf9e	getFoldedOffsetOf no longer does anything special with vector types. llvm-svn: 105514	2010-06-05 00:47:34 +00:00
Stuart Hastings	3ca391027f	Revert 105492 & 105493 due to a testcase regression. Radar 7424645. llvm-svn: 105511	2010-06-05 00:39:29 +00:00
Dan Gohman	bbfb6aca92	LSR needs to remember inserted instructions even in postinc mode, because there could be multiple subexpressions within a single expansion which require insert point adjustment. This fixes PR7306. llvm-svn: 105510	2010-06-05 00:33:07 +00:00
Dale Johannesen	df1a7f83bf	Fix some liveout handling related to tail calls, see comments. I don't think this ever resulted in problems on x86, but it would on ARM. llvm-svn: 105509	2010-06-05 00:30:45 +00:00
Evan Cheng	a03e6f85fe	Re-apply 105308 with fix. llvm-svn: 105502	2010-06-04 23:28:13 +00:00
Dan Gohman	67b4403101	Don't track users of undef values; they aren't interesting for register pressure. llvm-svn: 105501	2010-06-04 23:16:05 +00:00
Jim Grosbach	a1e08fb256	Make if-conversion ignore dbg_value instructions in its analysis. rdar://7797940 llvm-svn: 105498	2010-06-04 23:01:26 +00:00
Stuart Hastings	7c015988fe	Support for nested functions/classes in debug output. Radar 7424645. llvm-svn: 105492	2010-06-04 22:36:03 +00:00
Devang Patel	36da24b546	Copy location info for current function argument from dbg.declare if respective store instruction does not have any location info. llvm-svn: 105490	2010-06-04 22:27:30 +00:00
Jim Grosbach	50d229e6b3	Skip dbg_value instructions when scanning instructions in register scavenging. llvm-svn: 105481	2010-06-04 20:18:30 +00:00
Dan Gohman	538b413ccb	Fix normalization and de-normalization of non-affine SCEVs. llvm-svn: 105480	2010-06-04 19:16:34 +00:00
Jakob Stoklund Olesen	864827afb0	Keep track of the call instructions whose clobber lists were skipped during fast register allocation. Process all of the clobber lists at the end of the function, marking the registers as used in MachineRegisterInfo. This is necessary in case the calls clobber callee-saved registers (sic). llvm-svn: 105473	2010-06-04 18:08:29 +00:00
Dale Johannesen	d1b9311afa	More thoroughly disable tails calls by default. 8060143, although this doesn't fix the real problem with tail call. llvm-svn: 105472	2010-06-04 18:04:24 +00:00
Jim Grosbach	3548803f62	Another fix to prevent debug info from affecting codegen. rdar://7797940 llvm-svn: 105470	2010-06-04 17:57:34 +00:00
Jim Grosbach	4e5e6a8973	more dbg_value adjustments so debug info doesn't affect codegen llvm-svn: 105454	2010-06-04 01:23:30 +00:00
Mon P Wang	622cdd2297	Fixed a bug during widening where we would avoid legalizing a node. When we replace an OpA with a widened OpB, it is possible to get new uses of OpA due to CSE when recursively updating nodes. Since OpA has been processed, the new uses are not examined again. The patch checks if this occurred and it it did, updates the new uses of OpA to use OpB. llvm-svn: 105453	2010-06-04 01:20:10 +00:00
Jim Grosbach	1bcdf32d22	fix typo llvm-svn: 105441	2010-06-04 00:15:00 +00:00
Bob Wilson	d8a9a04739	For NEON vectors with 32- or 64-bit elements, select BUILD_VECTORs and VECTOR_SHUFFLEs to REG_SEQUENCE instructions. The standard ISD::BUILD_VECTOR node corresponds closely to REG_SEQUENCE but I couldn't use it here because its operands do not get legalized. That is pretty awful, but I guess it makes sense for other targets. Instead, I have added an ARM-specific version of BUILD_VECTOR that will have its operands properly legalized. This fixes the rest of Radar 7872877. llvm-svn: 105439	2010-06-04 00:04:02 +00:00
Bob Wilson	a733daf18c	Add some missing checks in TwoAddressInstructionPass::CoalesceExtSubRegs. Check that all the instructions are in the same basic block, that the EXTRACT_SUBREGs write to the same subregs that are being extracted, and that the source and destination registers are in the same regclass. Some of these constraints can be relaxed with a bit more work. Jakob suggested that the loop that checks for subregs when NewSubIdx != 0 should use the "nodbg" iterator, so I made that change here, too. llvm-svn: 105437	2010-06-03 23:53:58 +00:00
Jim Grosbach	01edd68225	Cleanup 80-column and trim trailing whitespace llvm-svn: 105435	2010-06-03 23:49:57 +00:00
Jim Grosbach	b30b81edb6	Teach the ARM load-store optimizer to deal with dbg_value instructions. llvm-svn: 105427	2010-06-03 22:41:15 +00:00
Dale Johannesen	d679ff7330	Early implementation of tail call for ARM. A temporary flag -arm-tail-calls defaults to off, so there is no functional change by default. Intrepid users may try this; simple cases work but there are bugs. llvm-svn: 105413	2010-06-03 21:09:53 +00:00
Dan Gohman	d83e3e7750	Fix SimplifyDemandedBits' AssertZext logic to demand all the bits. It needs to demand the high bits because it's asserting that they're zero. llvm-svn: 105406	2010-06-03 20:21:33 +00:00
Bob Wilson	30093b5d8b	Revert 105308. llvm-svn: 105399	2010-06-03 18:28:31 +00:00
Bill Wendling	f82aea634c	Machine sink could potentially sink instructions into a block where the physical registers it defines then interfere with an existing preg live range. For instance, if we had something like these machine instructions: BB#0 ... = imul ... EFLAGS<imp-def,dead> test ..., EFLAGS<imp-def> jcc BB#2 EFLAGS<imp-use> BB#1 ... ; fallthrough to BB#2 BB#2 ... ; No code that defines EFLAGS jcc ... EFLAGS<imp-use> Machine sink will come along, see that imul implicitly defines EFLAGS, but because it's "dead", it assumes that it can move imul into BB#2. But when it does, imul's "dead" imp-def of EFLAGS is raised from the dead (a zombie) and messes up the condition code for the jump (and pretty much anything else which relies upon it being correct). The solution is to know which pregs are live going into a basic block. However, that information isn't calculated at this point. Nor does the LiveVariables pass take into account non-allocatable physical registers. In lieu of this, we do a very conservative pass through the basic block to determine if a preg is live coming out of it. llvm-svn: 105387	2010-06-03 07:54:20 +00:00
Eric Christopher	b0e1a458ce	Add first pass at darwin tls compiler support. llvm-svn: 105381	2010-06-03 04:07:48 +00:00
Eric Christopher	f67fe3b1e8	One underscore, not two. llvm-svn: 105379	2010-06-03 04:02:59 +00:00
Eli Friedman	dbbbf73c96	Implement expansion in type legalization for add/sub with overflow. The expansion is the same as that used by LegalizeDAG. The resulting code sucks in terms of performance/codesize on x86-32 for a 64-bit operation; I haven't looked into whether different expansions might be better in general. llvm-svn: 105378	2010-06-03 03:49:50 +00:00
Eli Friedman	ceb13f2af3	Remove some already-fixed README entries. llvm-svn: 105377	2010-06-03 01:47:31 +00:00
Eli Friedman	a59b7a72b9	Remove README entry which no longer compiles to something sane. llvm-svn: 105376	2010-06-03 01:16:51 +00:00
Eli Friedman	1f41303260	Remove a fixed item, update a couple partially-fixed items. llvm-svn: 105375	2010-06-03 01:01:48 +00:00
Jakob Stoklund Olesen	4029596f93	Use the fast register allocator by default for -O0 builds. This affects both llvm-gcc and clang. llvm-svn: 105372	2010-06-03 00:39:06 +00:00
Jakob Stoklund Olesen	818e4df2b4	Use readsWritesVirtualRegister instead of counting uses and defs when inserting spills and reloads. This means that a partial define of a register causes a reload so the other parts of the register are preserved. The reload can be prevented by adding an <imp-def> operand for the full register. This is already done by the coalescer and live interval analysis where relevant. llvm-svn: 105369	2010-06-03 00:07:47 +00:00
Jakob Stoklund Olesen	42c642cd24	Add full register <imp-def> operands when the coalescer is creating partial register updates. These operands tell the spiller that the other parts of the partially defined register are don't-care, and a reload is not necessary. llvm-svn: 105361	2010-06-02 23:22:11 +00:00
Devang Patel	df84e8baf7	Speedup bitcode writer. Do not walk all values for all functions to emit function local metadata. In one testcase, probably worst case scenario, the 70x speed up is seen. llvm-svn: 105360	2010-06-02 23:05:04 +00:00
Bill Wendling	7ee730eb40	Compulsive reformating. No functionalitical changes. llvm-svn: 105359	2010-06-02 23:04:26 +00:00
Jakob Stoklund Olesen	a8ad97743d	Slightly change the meaning of the reMaterialize target hook when the original instruction defines subregisters. Any existing subreg indices on the original instruction are preserved or composed with the new subreg index. Also substitute multiple operands mentioning the original register by using the new MachineInstr::substituteRegister() function. This is necessary because there will soon be <imp-def> operands added to non read-modify-write partial definitions. This instruction: %reg1234:foo = FLAP %reg1234<imp-def> will reMaterialize(%reg3333, bar) like this: %reg3333:bar-foo = FLAP %reg333:bar<imp-def> Finally, replace the TargetRegisterInfo pointer argument with a reference to indicate that it cannot be NULL. llvm-svn: 105358	2010-06-02 22:47:25 +00:00
Jim Grosbach	84511e1526	Clean up 80 column violations. No functional change. llvm-svn: 105350	2010-06-02 21:53:11 +00:00
Rafael Espindola	f2dffcef82	Remove the TargetRegisterClass member from CalleeSavedInfo llvm-svn: 105344	2010-06-02 20:02:30 +00:00
Eli Friedman	6e3d5af945	Fix comment so it doesn't include comments which are irrelevant to the x86 backend. Add a FIXME noting what can be fixed here. llvm-svn: 105342	2010-06-02 19:35:46 +00:00
Dan Gohman	a690618c58	Use comments to document non-obvious code rather than mailing list archives. llvm-svn: 105341	2010-06-02 19:13:40 +00:00
Devang Patel	c2254f6b98	Skip identical instruction while calculating DBG_VALUE range. llvm-svn: 105340	2010-06-02 19:05:13 +00:00
Bob Wilson	2d35a9e810	Rename canCombinedSubRegIndex method to something more grammatically correct and tidy up the comment describing it. llvm-svn: 105339	2010-06-02 18:54:47 +00:00
Rafael Espindola	94801a47f8	Replace ARM's getCalleeSavedRegClasses with a simpler solution llvm-svn: 105335	2010-06-02 17:54:50 +00:00
Devang Patel	21ccf05b4c	Use local small vector. llvm-svn: 105332	2010-06-02 16:42:51 +00:00
Rafael Espindola	7881a64a50	Remove unused function. llvm-svn: 105325	2010-06-02 15:44:20 +00:00
Jim Grosbach	848548300d	Not all entries in the range will have an SUnit. Check for that when looking for debug information. llvm-svn: 105324	2010-06-02 15:29:36 +00:00
Rafael Espindola	ef2b6ce00a	cleanup llvm-svn: 105322	2010-06-02 13:53:17 +00:00
Rafael Espindola	c08ecba597	Remove uses of getCalleeSavedRegClasses from outside the backends and removes the virtual declaration. With that out of the way I should be able to cleanup one backend at a time. llvm-svn: 105321	2010-06-02 12:39:06 +00:00
Evan Cheng	a2da22734f	Enable machine cse of instructions which define physical registers. llvm-svn: 105308	2010-06-02 01:08:27 +00:00
Eli Friedman	526e6d045f	Don't try to custom-lower 64-bit add-with-overflow and friends on x86-32; the x86 backend currently doesn't know how to handle them. This doesn't really fix anything because LegalizeTypes doesn't know how to handle them either. We do get a better error message, though. llvm-svn: 105305	2010-06-02 00:27:18 +00:00
Bob Wilson	f4a34b97b8	Fix an obvious mistake: don't change the operands until all of them have been checked and it is safe to proceed with the changes. llvm-svn: 105304	2010-06-02 00:16:08 +00:00
Eli Friedman	6382c9c681	Remove outdated README entries. llvm-svn: 105303	2010-06-02 00:10:36 +00:00
Jim Grosbach	12ac8f0352	Update debug information when breaking anti-dependencies. rdar://7759363 llvm-svn: 105300	2010-06-01 23:48:44 +00:00
Jakob Stoklund Olesen	7b0ac865a4	Properly compose subregister indices when coalescing. The comment about ordering of subreg indices is no longer true. This exposed a bug in the new substVirtReg method that is also fixed. llvm-svn: 105294	2010-06-01 22:39:25 +00:00
Jim Grosbach	5ba76b94f8	Remove unused code llvm-svn: 105293	2010-06-01 21:56:30 +00:00
Devang Patel	d43e0ca916	Ignore line number of debug value in undefined register. llvm-svn: 105292	2010-06-01 21:43:09 +00:00
Jim Grosbach	0e20dc5cd6	fix think-o llvm-svn: 105291	2010-06-01 21:35:50 +00:00
Dan Gohman	47a0724425	Fix the allocation of shadow space for the Win64 calling convention in X86FastISel. Patch by Jan Sjodin. llvm-svn: 105290	2010-06-01 21:09:47 +00:00
Jim Grosbach	b69c68742a	Simplify things a bit more. Fix prototype to use SmallVectorImpl and change a few SmallVectors to vanilla C arrays. llvm-svn: 105289	2010-06-01 21:06:46 +00:00
Dan Gohman	49a372cebc	Fix the noalias checking so that it doesn't worry about an argument aliasing itself. Thanks Duncan! llvm-svn: 105288	2010-06-01 20:51:40 +00:00
Devang Patel	b0c76394a3	Keep track of incoming debug value of unused argument. Radar 7927666. llvm-svn: 105285	2010-06-01 19:59:01 +00:00
Dan Gohman	b782caa393	Fill in missing support for ISD::FEXP, ISD::FPOWI, and friends. llvm-svn: 105283	2010-06-01 18:35:14 +00:00
Jim Grosbach	b24d5c6ce2	Add a FIXME llvm-svn: 105282	2010-06-01 18:06:35 +00:00
Jim Grosbach	a37af16221	mirror of r105280 changes for LowerInvoke, which uses the same basic logic here llvm-svn: 105281	2010-06-01 18:04:56 +00:00
Jim Grosbach	74d8345512	When processing function arguments when splitting live ranges across invokes, handle structs passed by value via an extract/insert pair, as a bitcast won't work on a struct. rdar://7742824 llvm-svn: 105280	2010-06-01 18:04:09 +00:00
Jim Grosbach	7352167560	Use SmallVector instead of std::vector. llvm-svn: 105279	2010-06-01 17:56:41 +00:00
Bruno Cardoso Lopes	d44677ba69	Refactor some SSE 2 unpack instructions llvm-svn: 105276	2010-06-01 17:02:50 +00:00
Dan Gohman	a76715fc88	Don't call flush() at a library level which isn't checking for errors and doesn't know where the output is going. llvm-svn: 105274	2010-06-01 16:31:34 +00:00
Kalle Raiskila	8916358f97	Fix handling of 'load' nodes. llvm-svn: 105269	2010-06-01 13:34:47 +00:00
Duncan Sands	4c904fa797	Fix PR7272: when inlining through a callsite with byval arguments, the newly created allocas may be used by inlined calls, so these need to have their tail call flags cleared. Fixes PR7272. llvm-svn: 105255	2010-05-31 21:00:26 +00:00
Chris Lattner	14c46517b5	fix PR6623: when optimizing for size, don't inline memcpy/memsets that are too large. This causes the freebsd bootloader to be too large apparently. It's unclear if this should be an -Os or -Oz thing. Thoughts welcome. llvm-svn: 105228	2010-05-31 17:30:14 +00:00
Chris Lattner	b4a773b452	the 'limit' argument to FindOptimalMemOpLowering is unsigned, not uint64_t. llvm-svn: 105226	2010-05-31 17:12:23 +00:00
Benjamin Kramer	5ac57e3440	Avoid swap when a copy suffices. llvm-svn: 105220	2010-05-31 12:50:41 +00:00
Nick Lewycky	aee2632be3	The memcpy intrinsic only takes i8* for %src and %dst, so cast them to that first. Fixes PR7265. llvm-svn: 105206	2010-05-31 06:16:35 +00:00
Oscar Fuentes	a97311f152	Use `llvm::next' instead of `next' to make VC++ 2010 happy. llvm-svn: 105168	2010-05-30 13:14:21 +00:00
Anton Korobeynikov	a09d95412e	Some A9 load/store cleanups llvm-svn: 105109	2010-05-29 19:25:39 +00:00
Anton Korobeynikov	2a21aef8f2	Some rough approximations for load/stores on A9 llvm-svn: 105108	2010-05-29 19:25:34 +00:00
Anton Korobeynikov	d4c7cceb70	NEON/VFP stuff can be issued only via Pipe1 on A9 llvm-svn: 105107	2010-05-29 19:25:29 +00:00
Anton Korobeynikov	94d7fd88fd	Add some integer instruction itineraries for A9 llvm-svn: 105106	2010-05-29 19:25:17 +00:00
Dan Gohman	4db93c9700	Reorder some code in SelectionDAGBuilder. llvm-svn: 105105	2010-05-29 17:53:24 +00:00
Dan Gohman	d16aa541af	SelectionDAG shouldn't have a FunctionLoweringInfo member. RegsForValue shouldn't have a TargetLoweringInfo member. And FunctionLoweringInfo::set doesn't needs its EnableFastISel argument. llvm-svn: 105101	2010-05-29 17:03:36 +00:00
Benjamin Kramer	c488e92f0b	Remove unused function. llvm-svn: 105100	2010-05-29 14:03:51 +00:00
Evan Cheng	707b7cc429	Remove schedule-livein-copies. It's not being used. llvm-svn: 105095	2010-05-29 02:23:39 +00:00
Evan Cheng	27c4933e02	Fix PR7193: if sibling call address can take a register, make sure there are enough registers available by counting inreg arguments. llvm-svn: 105092	2010-05-29 01:35:22 +00:00
Jakob Stoklund Olesen	ab6223949e	Handle composed subreg indices when processing REQ_SEQUENCE instructions. llvm-svn: 105066	2010-05-29 00:14:14 +00:00
Evan Cheng	032f3261a2	Doh. Machine LICM is re-initializing the CSE map over and over. Patch by Anna Zaks. rdar://8037934. llvm-svn: 105065	2010-05-29 00:06:36 +00:00
Jakob Stoklund Olesen	e02996ca8f	Merge the SystemZ subreg_even32 SubRegIndex into subreg_32bit. The SubRegIndices were overspecified when inheriting sub-subregisters, for instance: R0Q:subreg_even32 = R0Q:subreg_32bit = R0Q:subreg_even:subreg_32bit. This meant that composeSubRegIndices(subreg_even, subreg_32bit) was ambiguous. llvm-svn: 105063	2010-05-28 23:48:29 +00:00
Evan Cheng	cc2efe11db	Fix some latency computation bugs: if the use is not a machine opcode do not just return zero. llvm-svn: 105061	2010-05-28 23:26:21 +00:00
Evan Cheng	bf91499f1a	Schedule high latency instructions for latency reduction even if they are not vfp / NEON instructions. llvm-svn: 105060	2010-05-28 23:25:23 +00:00
Dale Johannesen	e8be73f3e7	Fix comment typos. llvm-svn: 105059	2010-05-28 23:24:28 +00:00
Bruno Cardoso Lopes	1f79289806	More SSE 1 & 2 merge, this time with logical instructions llvm-svn: 105014	2010-05-28 22:47:03 +00:00
Dan Gohman	34709d06c0	Fix AliasDebugger to be aware of operand values too. llvm-svn: 105012	2010-05-28 22:31:51 +00:00
Dan Gohman	0fa67e479a	Add lint checks for function attributes. llvm-svn: 105009	2010-05-28 21:43:57 +00:00
Dan Gohman	0d7f3b8195	Split the logic behind CastInst::isNoopCast into a separate static function, as is done with most other cast opcode predicates. llvm-svn: 105008	2010-05-28 21:41:37 +00:00
Kevin Enderby	4c71e08ed8	MC/X86: Add alias for movzx. llvm-svn: 105005	2010-05-28 21:20:21 +00:00
Kevin Enderby	b29228905f	MC/X86: Add alias for fwait. llvm-svn: 105001	2010-05-28 20:59:10 +00:00
Kevin Enderby	76413597a9	Fix the use of x86 control and debug registers so that the assertion failure in getX86RegNum() does not happen. Patch by Shantonu Sen! llvm-svn: 104994	2010-05-28 19:01:27 +00:00
Jakob Stoklund Olesen	64824ea99f	Add a TargetRegisterInfo::composeSubRegIndices hook with a default implementation that is correct for most targets. Tablegen will override where needed. Add MachineOperand::subst{Virt,Phys}Reg methods that correctly handle existing subreg indices when sustituting registers. llvm-svn: 104985	2010-05-28 18:18:53 +00:00
Jim Grosbach	b342e09b5e	correct retattr llvm-svn: 104980	2010-05-28 18:03:48 +00:00
Jim Grosbach	0b20fdaff0	Cosmetic cleanup. No functional change. llvm-svn: 104974	2010-05-28 17:51:20 +00:00
Dan Gohman	c575ec61ea	Fix lint's memcpy and memmove checks, and its basic block traversal. llvm-svn: 104970	2010-05-28 17:44:00 +00:00
Jim Grosbach	37eb2c24b9	make sure accesses to set up the jmpbuf don't get moved after it by the scheduler. Add a missing \n. llvm-svn: 104967	2010-05-28 17:37:40 +00:00
Dan Gohman	fb85820f9a	Minor code simplification. llvm-svn: 104959	2010-05-28 16:50:23 +00:00
Dan Gohman	feaeb36edf	Fix a redundant-return warning. llvm-svn: 104958	2010-05-28 16:50:01 +00:00
Dan Gohman	862f034188	Detect self-referential values. llvm-svn: 104957	2010-05-28 16:45:33 +00:00
Stuart Hastings	c1e216583f	Revert 104841, 104842, 104876 due to buildbot failures. Radar 7424645. llvm-svn: 104953	2010-05-28 16:41:07 +00:00
Dan Gohman	cef9fc37f4	Eli pointed out that va_arg instruction result values don't reference the stack. llvm-svn: 104951	2010-05-28 16:34:49 +00:00
Dan Gohman	54d7aaa819	Teach lint how to look through simple store+load pairs and other effective no-op constructs, to make it more effective on unoptimized IR. llvm-svn: 104950	2010-05-28 16:21:24 +00:00
Dan Gohman	826bdf8c10	Move FindAvailableLoadedValue isSafeToLoadUnconditionally out of lib/Transforms/Utils and into lib/Analysis so that Analysis passes can use them. llvm-svn: 104949	2010-05-28 16:19:17 +00:00
Dan Gohman	a3b6c4b529	ConstantFoldConstantExpression can theoretically return null. llvm-svn: 104948	2010-05-28 16:12:08 +00:00
Dan Gohman	df5d7dcef1	Teach instcombine to promote alloca array sizes. llvm-svn: 104945	2010-05-28 15:09:00 +00:00
Dan Gohman	ddba4b725a	Add a lint check for returning the address of stack memory. llvm-svn: 104936	2010-05-28 04:33:42 +00:00
Dan Gohman	05a6555acb	Fix instcombine's handling of alloca to accept non-i32 types. llvm-svn: 104935	2010-05-28 04:33:04 +00:00
Dan Gohman	9da5bb0756	Bitcode support for allocas with arbitrary array size types. llvm-svn: 104915	2010-05-28 01:38:28 +00:00
Devang Patel	3e0fbafab2	Fix typo. llvm-svn: 104914	2010-05-28 01:29:50 +00:00
Devang Patel	e2099e8088	Fix typo. llvm-svn: 104913	2010-05-28 01:17:51 +00:00
Dan Gohman	2140a74979	Eliminate the restriction that the array size in an alloca must be i32. This will help reduce the amount of casting required on 64-bit targets. llvm-svn: 104911	2010-05-28 01:14:11 +00:00
Bob Wilson	b6112e8706	Add the cc_out operand for t2RSBrs instructions. I missed this when I changed the instruction class for t2RSB to add that operand in svn r104582. Radar 8033757. llvm-svn: 104907	2010-05-28 00:27:15 +00:00
Jakob Stoklund Olesen	b613ae2c89	Add a -regalloc=default option that chooses a register allocator based on the -O optimization level. This only really affects llc for now because both the llvm-gcc and clang front ends override the default register allocator. I intend to remove that code later. llvm-svn: 104904	2010-05-27 23:57:25 +00:00
Jim Grosbach	faa3abbe39	Update the saved stack pointer in the sjlj function context following either an alloca() or an llvm.stackrestore(). rdar://8031573 llvm-svn: 104900	2010-05-27 23:49:24 +00:00
Evan Cheng	c2ebe0334a	Use report_fatal_error, not llvm_unreachable. llvm-svn: 104899	2010-05-27 23:45:31 +00:00
Jim Grosbach	c9f532dddc	back out 104862/104869. Can reuse stacksave after all. Very cool. llvm-svn: 104897	2010-05-27 23:11:57 +00:00
Dan Gohman	288999b829	Factor out the handler work from SignalHandler into a helper function, and change llvm::sys::RunInterruptHandlers to call that function directly instead of calling SignalHandler, because the rest of SignalHandler invokes side effects which aren't appropriate, including raising the signal. llvm-svn: 104896	2010-05-27 23:11:55 +00:00
Evan Cheng	3d3ee87d4e	llvm can't correctly support 'H', 'Q' and 'R' modifiers. Just mark it an error. llvm-svn: 104891	2010-05-27 22:08:38 +00:00
Kevin Enderby	9738f64bd9	MC/X86: Add aliases for Jcc variants. llvm-svn: 104890	2010-05-27 21:33:19 +00:00
Dan Gohman	6debf89587	Eliminate some unnessary Path::exists() calls. llvm-svn: 104888	2010-05-27 20:51:54 +00:00
Dan Gohman	a880546c65	Don't flush the raw_ostream in llvm::WriteBitcodeToFile; it's at the wrong level. Clients which need to leave the stream open but which still require the bitcode bits to be on disk should call flush themselves. llvm-svn: 104885	2010-05-27 20:26:51 +00:00
Devang Patel	7a9dedf0ab	Do not drop location info for inlined function args. llvm-svn: 104884	2010-05-27 20:25:04 +00:00
Bob Wilson	40e62dfdc0	Fix some bad fall-throughs in a switch statement. Both the 'Q' and 'R' cases should fall through to the 'H' case, but instead 'Q' was falling through to 'R' so that it would do the wrong thing for a big-endian ARM target. llvm-svn: 104883	2010-05-27 20:23:42 +00:00
Dale Johannesen	9e43c07bc5	Mark some math lib intrinsic nodes Legal on SSE4.1. No functional effect as these nodes are not generated yet. llvm-svn: 104879	2010-05-27 20:12:41 +00:00
Dan Gohman	d9225cee20	Don't special-case stdout in llvm::WriteBitcodeToFile; just consider it to be the caller's responsibility to provide a stream in binary mode. This fixes a layering violation and avoids an outs() call. llvm-svn: 104878	2010-05-27 20:06:51 +00:00
Duncan Sands	f162eace49	Teach instCombine to remove malloc+free if malloc's only uses are comparisons to null. Patch by Matti Niemenmaa. llvm-svn: 104871	2010-05-27 19:09:06 +00:00
Jim Grosbach	b68dfb45f5	hook ISD::STACKADDR to an intrinsic llvm-svn: 104869	2010-05-27 18:52:11 +00:00
Dan Gohman	dc53f1cb5c	FastISel doesn't yet handle callee-pop functions. To support this, move IsCalleePop from X86ISelLowering to X86Subtarget. llvm-svn: 104866	2010-05-27 18:43:40 +00:00
Jim Grosbach	5cde219fb1	add ISD::STACKADDR to get the current stack pointer. Will be used by sjlj EH to update the jmpbuf in the presence of VLAs. llvm-svn: 104862	2010-05-27 18:23:48 +00:00
Bruno Cardoso Lopes	54b07ad2cd	Merge basic binops SSE 1 & 2 instruction classes. This is a step towards refactoring common code between SSE versions. llvm-svn: 104860	2010-05-27 18:17:40 +00:00
Dan Gohman	c36b1f35f0	Add basic error checking to MemoryBuffer::getSTDIN. llvm-svn: 104855	2010-05-27 17:31:51 +00:00
Dan Gohman	ece4bf5148	Use the return value of getMagicNumber instead of using a separate canRead() call. llvm-svn: 104853	2010-05-27 17:18:38 +00:00
Dan Gohman	78d1e84521	Don't bother clearing the Magic string when the magic number can't be read, since it isn't cleared on other error paths. llvm-svn: 104852	2010-05-27 17:14:10 +00:00
Dan Gohman	16f4bd8140	Don't bother checking canRead() before calling getMagicNumber(); getMagicNumber() does its own error checking. llvm-svn: 104851	2010-05-27 17:12:23 +00:00
Devang Patel	5e6b71ce34	inlined function's arguments need a label to mark the start point because they are not directly attached to current function. llvm-svn: 104848	2010-05-27 16:47:30 +00:00
Stuart Hastings	8e99e50d08	Support for nested functions/classes in debug output. Radar 7424645. llvm-svn: 104841	2010-05-27 16:16:54 +00:00
Eric Christopher	eaddfac539	Rearrange conditionals so we don't get caught with the correct type as wrong. llvm-svn: 104793	2010-05-27 00:52:31 +00:00
Devang Patel	6b9a9fe207	Simplify. Eliminate unneeded debug_loc entry. llvm-svn: 104785	2010-05-26 23:55:23 +00:00

... 17 18 19 20 21 ...

40642 Commits