llvm-project

Commit Graph

Author	SHA1	Message	Date
Evan Cheng	ffef8b9412	Bottom up register pressure reduction work: clean up some hacks and enhanced the heuristic to further reduce spills for several test cases. (Note, it may not necessarily translate to runtime win!) llvm-svn: 28076	2006-05-03 02:10:45 +00:00
Owen Anderson	20a631fde7	Refactor TargetMachine, pushing handling of TargetData into the target-specific subclasses. This has one caller-visible change: getTargetData() now returns a pointer instead of a reference. This fixes PR 759. llvm-svn: 28074	2006-05-03 01:29:57 +00:00
Chris Lattner	d8b192ba3b	Change the BasicBlockAddrs map to be a vector, indexed by MBB number. llvm-svn: 28069	2006-05-03 00:32:55 +00:00
Chris Lattner	b8065a9a3a	Several related changes: 1. Change several methods in the MachineCodeEmitter class to be pure virtual. 2. Suck emitConstantPool/initJumpTableInfo into startFunction, removing them from the MachineCodeEmitter interface, and reducing the amount of target- specific code. 3. Change the JITEmitter so that it allocates constantpools and jump tables right next to the functions that they belong to, instead of in a separate pool of memory. This makes all memory for a function be contiguous, and means the JITEmitter only tracks one block of memory now. llvm-svn: 28065	2006-05-02 23:22:24 +00:00
Chris Lattner	23621fe8f4	Do not make the JIT memory manager manage the memory for globals. Instead just have the JIT malloc them. llvm-svn: 28062	2006-05-02 21:57:51 +00:00
Chris Lattner	c9aa3715e8	Refactor the machine code emitter interface to pull the pointers for the current code emission location into the base class, instead of being in the derived classes. This change means that low-level methods like emitByte/emitWord now are no longer virtual (yaay for speed), and we now have a framework to support growable code segments. This implements feature request #1 of PR469. llvm-svn: 28059	2006-05-02 18:27:26 +00:00
Nate Begeman	4971ba5f41	Print function number instead of name llvm-svn: 28057	2006-05-02 17:36:46 +00:00
Chris Lattner	67a3aa2aaa	Remove dead method llvm-svn: 28055	2006-05-02 17:20:28 +00:00
Chris Lattner	bb1c345ec6	Remove the debug machine code emitter. The "FilePrinterEmitter" is more useful for debugging. llvm-svn: 28051	2006-05-02 16:59:24 +00:00
Nate Begeman	b9d4f8324d	Extend printBasicBlockLabel a bit so that it can be used to print all basic block labels, consolidating the code to do so in one place for each target. llvm-svn: 28050	2006-05-02 05:37:32 +00:00
Jeff Cohen	470f431f44	De-virtualize SwitchSection. llvm-svn: 28047	2006-05-02 03:58:45 +00:00
Jeff Cohen	f34ddb1e0d	De-virtualize EmitZeroes. llvm-svn: 28046	2006-05-02 03:46:13 +00:00
Jeff Cohen	24a62a9bc1	Make Intel syntax mode friendlier to Microsoft ML assembler (still needs more work). llvm-svn: 28044	2006-05-02 01:16:28 +00:00
Chris Lattner	fd0a5478a1	Fix a latent bug that my spiller patch last week exposed: we were leaving instructions in the virtregfolded map that were deleted. Because they were deleted, newly allocated instructions could end up at the same address, magically finding themselves in the map. The solution is to remove entries from the map when we delete the instructions. llvm-svn: 28041	2006-05-01 22:03:24 +00:00
Chris Lattner	ab7dbe0cc9	When promoting a load to a reg-reg copy, where the load was a previous instruction folded with spill code, make sure the remove the load from the virt reg folded map. llvm-svn: 28040	2006-05-01 21:17:10 +00:00
Chris Lattner	4dee67c2cd	Remove previous patch, which wasn't quite right. llvm-svn: 28039	2006-05-01 21:16:03 +00:00
Evan Cheng	0d084fb9ca	Dis-favor stores more llvm-svn: 28035	2006-05-01 09:20:44 +00:00
Evan Cheng	24e795496d	Bottom up register-pressure reduction scheduler now pushes store operations up the schedule. This helps code that looks like this: loads ... computations (first set) ... stores (first set) ... loads computations (seccond set) ... stores (seccond set) ... Without this change, the stores and computations are more likely to interleave: loads ... loads ... computations (first set) ... computations (second set) ... computations (first set) ... stores (first set) ... computations (second set) ... stores (stores set) ... This can increase the number of spills if we are unlucky. llvm-svn: 28033	2006-05-01 09:14:40 +00:00
Evan Cheng	10ff7b27ce	Didn't mean ScheduleDAGList.cpp to make the last checkin. llvm-svn: 28030	2006-05-01 08:56:34 +00:00
Evan Cheng	a656242690	Remove temp. option -spiller-check-liveout, it didn't cause any failure nor performance regressions. llvm-svn: 28029	2006-05-01 08:54:57 +00:00
Chris Lattner	25f55ae74a	Format #APP lines a bit nicer llvm-svn: 28026	2006-05-01 04:11:03 +00:00
Evan Cheng	f71f0f2e0b	Local spiller kills a store if the folded restore is turned into a copy. But this is incorrect if the spilled value live range extends beyond the current BB. It is currently controlled by a temporary option -spiller-check-liveout. llvm-svn: 28024	2006-04-30 08:41:47 +00:00
Chris Lattner	2b48a94413	Remove a bogus transformation. This fixes SingleSource/UnitTests/2006-01-23-InitializedBitField.c with some changes I have to the new CFE. llvm-svn: 28022	2006-04-28 23:33:20 +00:00
Evan Cheng	c5e8ce8b8c	Remove the temporary option: -no-isel-fold-inflight llvm-svn: 28012	2006-04-28 18:54:11 +00:00
Evan Cheng	d43c5c6046	TargetLowering::LowerArguments should return a VBIT_CONVERT of FORMAL_ARGUMENTS SDOperand in the return result vector. llvm-svn: 28009	2006-04-28 05:25:15 +00:00
Chris Lattner	79c50d96c9	Mapping of physregs can make it so that the designated and input physregs are the same. In this case, don't emit a noop copy. llvm-svn: 28008	2006-04-28 04:43:18 +00:00
Evan Cheng	51ab4498e7	Added a temporary option -no-isel-fold-inflight to control whether a "inflight" node can be folded. llvm-svn: 28003	2006-04-28 02:09:19 +00:00
Chris Lattner	84e95d00b5	When we have a two-address instruction where the input cannot be clobbered and is already available, instead of falling back to emitting a load, fall back to emitting a reg-reg copy. This generates significantly better code for some SSE testcases, as SSE has lots of two-address instructions and none of them are read/modify/write. As one example, this change does: pshufd %XMM5, XMMWORD PTR [%ESP + 84], 255 xorps %XMM2, %XMM5 cmpltps %XMM1, %XMM0 - movaps XMMWORD PTR [%ESP + 52], %XMM0 - movapd %XMM6, XMMWORD PTR [%ESP + 52] + movaps %XMM6, %XMM0 cmpltps %XMM6, XMMWORD PTR [%ESP + 68] movapd XMMWORD PTR [%ESP + 52], %XMM6 movaps %XMM6, %XMM0 cmpltps %XMM6, XMMWORD PTR [%ESP + 36] cmpltps %XMM3, %XMM0 - movaps XMMWORD PTR [%ESP + 20], %XMM0 - movapd %XMM7, XMMWORD PTR [%ESP + 20] + movaps %XMM7, %XMM0 cmpltps %XMM7, XMMWORD PTR [%ESP + 4] movapd XMMWORD PTR [%ESP + 20], %XMM7 cmpltps %XMM4, %XMM0 ... which is far better than a store followed by a load! llvm-svn: 28001	2006-04-28 01:46:50 +00:00
Evan Cheng	3784f3c57c	Insert a VBIT_CONVERT between a FORMAL_ARGUMENT node and its vector uses (VAND, VADD, etc.). Legalizer will assert otherwise. llvm-svn: 27991	2006-04-27 08:29:42 +00:00
Chris Lattner	393d96a56c	Fix Regression/CodeGen/Generic/2006-04-26-SetCCAnd.ll and PR748. llvm-svn: 27987	2006-04-27 05:01:07 +00:00
Evan Cheng	9618df1190	Don't forget return void. llvm-svn: 27974	2006-04-25 23:03:35 +00:00
Nate Begeman	866b4b4d45	Fix the updating of the machine CFG when a PHI node was in a successor of the jump table's range check block. This re-enables 100% dense jump tables by default on PPC & x86 llvm-svn: 27952	2006-04-23 06:26:20 +00:00
Nate Begeman	3e04bb482b	Code cleanup associated with jump tables, thanks to Chris for noticing these. llvm-svn: 27950	2006-04-22 23:52:35 +00:00
Nate Begeman	ecb1dafd3d	Turn of jump tables for a bit, there are still some issues to work out with updating the machine CFG. llvm-svn: 27949	2006-04-22 23:51:56 +00:00
Nate Begeman	4ca2ea5b43	JumpTable support! What this represents is working asm and jit support for x86 and ppc for 100% dense switch statements when relocations are non-PIC. This support will be extended and enhanced in the coming days to support PIC, and less dense forms of jump tables. llvm-svn: 27947	2006-04-22 18:53:45 +00:00
Chris Lattner	b21d3bfd1f	The BFS scheduler is apparently nondeterminstic (causes many llvmgcc bootstrap miscompares). Switch RISC targets to use the list-td scheduler, which isn't. llvm-svn: 27933	2006-04-21 17:16:16 +00:00
Chris Lattner	662e940f73	Fix a couple more memory issues llvm-svn: 27930	2006-04-21 15:32:26 +00:00
Chris Lattner	cc47ab3305	Fix a really subtle and obnoxious memory bug that caused issues with an llvm-gcc4 boostrap. Whenever a node is deleted by the dag combiner, it must be returned by the visit function, or the dag combiner will not know that the node has been processed (and will, e.g., send it to the target dag combine xforms). llvm-svn: 27922	2006-04-20 23:55:59 +00:00
Chris Lattner	dcc1f995eb	This field no longer exists llvm-svn: 27899	2006-04-20 18:32:41 +00:00
Chris Lattner	a38c3580bd	Remove some of the obvious V9-specific cruft llvm-svn: 27893	2006-04-20 18:08:53 +00:00
Evan Cheng	a320abc494	Turn a VAND into a VECTOR_SHUFFLE is applicable. DAG combiner can turn a VAND V, <-1, 0, -1, -1>, i.e. vector clear elements, into a vector shuffle with a zero vector. It only does so when TLI tells it the xform is profitable. llvm-svn: 27874	2006-04-20 08:56:16 +00:00
Chris Lattner	bc1b262725	Implement folding of a bunch of binops with undef llvm-svn: 27863	2006-04-20 05:39:12 +00:00
Chris Lattner	73eb58e1a2	Simplify some code llvm-svn: 27846	2006-04-19 23:17:50 +00:00
Chris Lattner	916ae0775e	Fix handling of calls in functions that use vectors. This fixes a crash on the code in GCC PR26546. llvm-svn: 27780	2006-04-17 22:10:08 +00:00
Chris Lattner	bec79b4a59	Add a MachineInstr::eraseFromParent convenience method. llvm-svn: 27775	2006-04-17 21:35:41 +00:00
Chris Lattner	326870b40b	Codegen insertelement with constant insertion points as scalar_to_vector and a shuffle. For this: void %test2(<4 x float>* %F, float %f) { %tmp = load <4 x float>* %F ; <<4 x float>> [#uses=2] %tmp3 = add <4 x float> %tmp, %tmp ; <<4 x float>> [#uses=1] %tmp2 = insertelement <4 x float> %tmp3, float %f, uint 2 ; <<4 x float>> [#uses=2] %tmp6 = add <4 x float> %tmp2, %tmp2 ; <<4 x float>> [#uses=1] store <4 x float> %tmp6, <4 x float>* %F ret void } we now get this on X86 (which will get better): _test2: movl 4(%esp), %eax movaps (%eax), %xmm0 addps %xmm0, %xmm0 movaps %xmm0, %xmm1 shufps $3, %xmm1, %xmm1 movaps %xmm0, %xmm2 shufps $1, %xmm2, %xmm2 unpcklps %xmm1, %xmm2 movss 8(%esp), %xmm1 unpcklps %xmm1, %xmm0 unpcklps %xmm2, %xmm0 addps %xmm0, %xmm0 movaps %xmm0, (%eax) ret instead of: _test2: subl $28, %esp movl 32(%esp), %eax movaps (%eax), %xmm0 addps %xmm0, %xmm0 movaps %xmm0, (%esp) movss 36(%esp), %xmm0 movss %xmm0, 8(%esp) movaps (%esp), %xmm0 addps %xmm0, %xmm0 movaps %xmm0, (%eax) addl $28, %esp ret llvm-svn: 27765	2006-04-17 19:21:01 +00:00
Chris Lattner	91226e5799	Add support for promoting stores from one legal type to another, allowing us to write one pattern for vector stores instead of 4. llvm-svn: 27730	2006-04-16 01:36:45 +00:00
Chris Lattner	7e7ad593cc	Make these predicates return true for bit_convert(buildvector)'s as well as buildvectors. llvm-svn: 27723	2006-04-15 23:38:00 +00:00
Chris Lattner	086e986e94	Make this assertion better llvm-svn: 27695	2006-04-14 06:08:35 +00:00
Reid Spencer	709eaacb36	Expand some code with temporary variables to rid ourselves of the warning about "dereferencing type-punned pointer will break strict-aliasing rules" llvm-svn: 27671	2006-04-13 18:29:58 +00:00
Evan Cheng	119266ea92	Promote vector AND, OR, and XOR llvm-svn: 27632	2006-04-12 21:20:24 +00:00
Evan Cheng	be8a8933e6	Vector type promotion for ISD::LOAD and ISD::SELECT llvm-svn: 27606	2006-04-12 16:33:18 +00:00
Chris Lattner	d3b504ae10	Implement support for the formal_arguments node. To get this, targets shouldcustom legalize it and remove their XXXTargetLowering::LowerArguments overload llvm-svn: 27604	2006-04-12 16:20:43 +00:00
Chris Lattner	417b96b6dd	Don't memoize vloads in the load map! Don't memoize them anywhere here, let getNode do it. This fixes CodeGen/Generic/2006-04-11-vecload.ll llvm-svn: 27602	2006-04-12 03:25:41 +00:00
Evan Cheng	7256b0ae05	Only get Tmp2 for cases where number of operands is > 1. Fixed return void. llvm-svn: 27586	2006-04-11 06:33:39 +00:00
Chris Lattner	6cf3bbbe17	add some todos llvm-svn: 27580	2006-04-11 02:00:08 +00:00
Chris Lattner	2eb22eef7d	Add basic support for legalizing returns of vectors llvm-svn: 27578	2006-04-11 01:31:51 +00:00
Jim Laskey	dca2655daa	Use existing information. llvm-svn: 27574	2006-04-10 23:09:19 +00:00
Evan Cheng	cb73b8d419	Missing break llvm-svn: 27559	2006-04-10 18:54:36 +00:00
Chris Lattner	02274a5265	Add code generator support for VSELECT llvm-svn: 27542	2006-04-08 22:22:57 +00:00
Chris Lattner	e1401e3610	Canonicalize vvector_shuffle(x,x) -> vvector_shuffle(x,undef) to enable patterns to match again :) llvm-svn: 27533	2006-04-08 05:34:25 +00:00
Chris Lattner	098c01e94e	Codegen shufflevector as VVECTOR_SHUFFLE llvm-svn: 27529	2006-04-08 04:15:24 +00:00
Chris Lattner	101ea66813	add a sanity check: LegalizeOp should return a value that is the same type as its input. llvm-svn: 27528	2006-04-08 04:13:17 +00:00
Evan Cheng	78e3d565af	INSERT_VECTOR_ELT lowering bug: store vector to $esp store element to $esp + sizeof(VT) * index load vector from $esp The bug is VT is the type of the vector element, not the type of the vector! llvm-svn: 27517	2006-04-08 01:46:37 +00:00
Chris Lattner	aa3185f12e	Stub out shufflevector llvm-svn: 27514	2006-04-08 01:19:25 +00:00
Jim Laskey	7d4592733c	Remove section change in function end, preventing override of function's real section. llvm-svn: 27503	2006-04-08 00:35:59 +00:00
Jim Laskey	c0d6518f27	Make sure that debug labels are defined within the same section and after the entry point of a function. llvm-svn: 27494	2006-04-07 20:44:42 +00:00
Jim Laskey	2d7298c362	Foundation for call frame information. llvm-svn: 27491	2006-04-07 16:34:46 +00:00
Evan Cheng	613996c55e	1. If both vector operands of a vector_shuffle are undef, turn it into an undef. 2. A shuffle mask element can also be an undef. llvm-svn: 27472	2006-04-06 23:20:43 +00:00
Chris Lattner	4a2413a590	Make a vector live across blocks have the correct Vec type. This fixes CodeGen/X86/2006-04-04-CrossBlockCrash.ll llvm-svn: 27436	2006-04-05 06:54:42 +00:00
Evan Cheng	9fa8959dce	Exapnd a VECTOR_SHUFFLE to a BUILD_VECTOR if target asks for it to be expanded or custom lowering fails. llvm-svn: 27432	2006-04-05 06:07:11 +00:00
Chris Lattner	4ea52cac01	Do not create ZEXTLOAD's unless we are before legalize or the operation is legal. llvm-svn: 27402	2006-04-04 17:39:18 +00:00
Chris Lattner	6be79823e7	* Add supprot for SCALAR_TO_VECTOR operations where the input needs to be promoted/expanded (e.g. SCALAR_TO_VECTOR from i8/i16 on PPC). * Add support for targets to request that VECTOR_SHUFFLE nodes be promoted to a canonical type, for example, we only want v16i8 shuffles on PPC. * Move isShuffleLegal out of TLI into Legalize. * Teach isShuffleLegal to allow shuffles that need to be promoted. llvm-svn: 27399	2006-04-04 17:23:26 +00:00
Chris Lattner	a9e77d14c7	Constant fold bitconvert(undef) llvm-svn: 27391	2006-04-04 01:02:22 +00:00
Chris Lattner	b710a81e54	The stack alignment is now computed dynamically, just verify it is correct. llvm-svn: 27380	2006-04-03 21:39:57 +00:00
Chris Lattner	6bc4b9c7f8	Remove unused method llvm-svn: 27379	2006-04-03 21:39:03 +00:00
Chris Lattner	e1e3adf802	Add a missing check, this fixes UnitTests/Vector/sumarray.c llvm-svn: 27375	2006-04-03 17:29:28 +00:00
Chris Lattner	04c00fc844	Add a missing check, which broke a bunch of vector tests. llvm-svn: 27374	2006-04-03 17:21:50 +00:00
Andrew Lenharth	94f012f606	back this out llvm-svn: 27367	2006-04-03 03:16:50 +00:00
Andrew Lenharth	015eaf5f33	This should be a win of every arch llvm-svn: 27364	2006-04-02 21:42:45 +00:00
Chris Lattner	4993249a04	Add a little dag combine to compile this: int %AreSecondAndThirdElementsBothNegative(<4 x float>* %in) { entry: %tmp1 = load <4 x float>* %in ; <<4 x float>> [#uses=1] %tmp = tail call int %llvm.ppc.altivec.vcmpgefp.p( int 1, <4 x float> < float 0x7FF8000000000000, float 0.000000e+00, float 0.000000e+00, float 0x7FF8000000000000 >, <4 x float> %tmp1 ) ; <int> [#uses=1] %tmp = seteq int %tmp, 0 ; <bool> [#uses=1] %tmp3 = cast bool %tmp to int ; <int> [#uses=1] ret int %tmp3 } into this: _AreSecondAndThirdElementsBothNegative: mfspr r2, 256 oris r4, r2, 49152 mtspr 256, r4 li r4, lo16(LCPI1_0) lis r5, ha16(LCPI1_0) lvx v0, 0, r3 lvx v1, r5, r4 vcmpgefp. v0, v1, v0 mfcr r3, 2 rlwinm r3, r3, 27, 31, 31 mtspr 256, r2 blr instead of this: _AreSecondAndThirdElementsBothNegative: mfspr r2, 256 oris r4, r2, 49152 mtspr 256, r4 li r4, lo16(LCPI1_0) lis r5, ha16(LCPI1_0) lvx v0, 0, r3 lvx v1, r5, r4 vcmpgefp. v0, v1, v0 mfcr r3, 2 rlwinm r3, r3, 27, 31, 31 xori r3, r3, 1 cntlzw r3, r3 srwi r3, r3, 5 mtspr 256, r2 blr llvm-svn: 27356	2006-04-02 06:11:11 +00:00
Chris Lattner	42a5fca47e	Implement promotion for EXTRACT_VECTOR_ELT, allowing v16i8 multiplies to work with PowerPC. llvm-svn: 27349	2006-04-02 05:06:04 +00:00
Chris Lattner	87f080949b	Implement the Expand action for binary vector operations to break the binop into elements and operate on each piece. This allows generic vector integer multiplies to work on PPC, though the generated code is horrible. llvm-svn: 27347	2006-04-02 03:57:31 +00:00
Chris Lattner	a9c59156be	Intrinsics that just load from memory can be treated like loads: they don't have to serialize against each other. This allows us to schedule lvx's across each other, for example. llvm-svn: 27346	2006-04-02 03:41:14 +00:00
Chris Lattner	0442a18758	Constant fold all of the vector binops. This allows us to compile this: "vector unsigned char mergeLowHigh = (vector unsigned char) ( 8, 9, 10, 11, 16, 17, 18, 19, 12, 13, 14, 15, 20, 21, 22, 23 ); vector unsigned char mergeHighLow = vec_xor( mergeLowHigh, vec_splat_u8(8));" aka: void %test2(<16 x sbyte>* %P) { store <16 x sbyte> cast (<4 x int> xor (<4 x int> cast (<16 x ubyte> < ubyte 8, ubyte 9, ubyte 10, ubyte 11, ubyte 16, ubyte 17, ubyte 18, ubyte 19, ubyte 12, ubyte 13, ubyte 14, ubyte 15, ubyte 20, ubyte 21, ubyte 22, ubyte 23 > to <4 x int>), <4 x int> cast (<16 x sbyte> < sbyte 8, sbyte 8, sbyte 8, sbyte 8, sbyte 8, sbyte 8, sbyte 8, sbyte 8, sbyte 8, sbyte 8, sbyte 8, sbyte 8, sbyte 8, sbyte 8, sbyte 8, sbyte 8 > to <4 x int>)) to <16 x sbyte>), <16 x sbyte> * %P ret void } into this: _test2: mfspr r2, 256 oris r4, r2, 32768 mtspr 256, r4 li r4, lo16(LCPI2_0) lis r5, ha16(LCPI2_0) lvx v0, r5, r4 stvx v0, 0, r3 mtspr 256, r2 blr instead of this: _test2: mfspr r2, 256 oris r4, r2, 49152 mtspr 256, r4 li r4, lo16(LCPI2_0) lis r5, ha16(LCPI2_0) vspltisb v0, 8 lvx v1, r5, r4 vxor v0, v1, v0 stvx v0, 0, r3 mtspr 256, r2 blr ... which occurs here: http://developer.apple.com/hardware/ve/calcspeed.html llvm-svn: 27343	2006-04-02 03:25:57 +00:00
Chris Lattner	ef598059f2	Add a new -view-legalize-dags command line option llvm-svn: 27342	2006-04-02 03:07:27 +00:00
Chris Lattner	e4e64b6b85	Implement constant folding of bit_convert of arbitrary constant vbuild_vector nodes. llvm-svn: 27341	2006-04-02 02:53:43 +00:00
Chris Lattner	1c22728787	These entries already exist llvm-svn: 27340	2006-04-02 02:51:27 +00:00
Chris Lattner	1985e1cbb8	Add some missing node names llvm-svn: 27339	2006-04-02 02:41:18 +00:00
Chris Lattner	bec582f4cd	Prefer larger register classes over smaller ones when a register occurs in multiple register classes. This fixes PowerPC/2006-04-01-FloatDoubleExtend.ll llvm-svn: 27334	2006-04-02 00:24:45 +00:00
Chris Lattner	39dcf1a9e2	Delete identity shuffles, implementing CodeGen/Generic/vector-identity-shuffle.ll llvm-svn: 27317	2006-03-31 22:16:43 +00:00
Chris Lattner	d9e4daabd2	Do not endian swap split vector loads. This fixes UnitTests/Vector/sumarray-dbl on PPC. Now all UnitTests/Vector/* tests pass on PPC. llvm-svn: 27299	2006-03-31 18:22:37 +00:00
Chris Lattner	8d90f526d7	Do not endian swap the operands to a store if the operands came from a vector. This fixes UnitTests/Vector/simple.c with altivec. llvm-svn: 27298	2006-03-31 18:20:46 +00:00
Chris Lattner	7e30af3887	Remove dead *extloads. This allows us to codegen vector.ll:test_extract_elt to: test_extract_elt: alloc r3 = ar.pfs,0,1,0,0 adds r8 = 12, r32 ;; ldfs f8 = [r8] mov ar.pfs = r3 br.ret.sptk.many rp instead of: test_extract_elt: alloc r3 = ar.pfs,0,1,0,0 adds r8 = 28, r32 adds r9 = 24, r32 adds r10 = 20, r32 adds r11 = 16, r32 ;; ldfs f6 = [r8] ;; ldfs f6 = [r9] adds r8 = 12, r32 adds r9 = 8, r32 adds r14 = 4, r32 ;; ldfs f6 = [r10] ;; ldfs f6 = [r11] ldfs f8 = [r8] ;; ldfs f6 = [r9] ;; ldfs f6 = [r14] ;; ldfs f6 = [r32] mov ar.pfs = r3 br.ret.sptk.many rp llvm-svn: 27297	2006-03-31 18:10:41 +00:00
Chris Lattner	2d8551c85b	Delete dead loads in the dag. This allows us to compile vector.ll:test_extract_elt2 into: _test_extract_elt2: lfd f1, 32(r3) blr instead of: _test_extract_elt2: lfd f0, 56(r3) lfd f0, 48(r3) lfd f0, 40(r3) lfd f1, 32(r3) lfd f0, 24(r3) lfd f0, 16(r3) lfd f0, 8(r3) lfd f0, 0(r3) blr llvm-svn: 27296	2006-03-31 18:06:18 +00:00
Chris Lattner	6f42325dca	Implement PromoteOp for VEXTRACT_VECTOR_ELT. Thsi fixes Generic/vector.ll:test_extract_elt on non-sse X86 systems. llvm-svn: 27294	2006-03-31 17:55:51 +00:00
Chris Lattner	8e1fcab2bc	Scalarized vector stores need not be legal, e.g. if the vector element type needs to be promoted or expanded. Relegalize the scalar store once created. This fixes CodeGen/Generic/vector.ll:test1 on non-SSE x86 targets. llvm-svn: 27293	2006-03-31 17:37:22 +00:00
Chris Lattner	ba38035e21	Make sure to pass enough values to phi nodes when we are dealing with decimated vectors. This fixes UnitTests/Vector/sumarray-dbl.c llvm-svn: 27280	2006-03-31 02:12:18 +00:00
Chris Lattner	5fe1f54c17	Significantly improve handling of vectors that are live across basic blocks, handling cases where the vector elements need promotion, expansion, and when the vector type itself needs to be decimated. llvm-svn: 27278	2006-03-31 02:06:56 +00:00
Evan Cheng	168e45b0b3	Expand INSERT_VECTOR_ELT to store vec, sp; store elt, sp+k; vec = load sp; llvm-svn: 27274	2006-03-31 01:27:51 +00:00
Chris Lattner	67271869a8	Bug fixes: handle constantexpr insert/extract element operations Handle constantpacked vectors with constantexpr elements. This fixes CodeGen/Generic/vector-constantexpr.ll llvm-svn: 27241	2006-03-29 00:11:43 +00:00
Chris Lattner	20e619fba3	When building a VVECTOR_SHUFFLE node from extract_element operations, make sure to build it as SHUFFLE(X, undef, mask), not SHUFFLE(X, X, mask). The later is not canonical form, and prevents the PPC splat pattern from matching. For a particular splat, we go from generating this: li r10, lo16(LCPI1_0) lis r11, ha16(LCPI1_0) lvx v3, r11, r10 vperm v3, v2, v2, v3 to generating: vspltw v3, v2, 3 llvm-svn: 27236	2006-03-28 22:19:47 +00:00
Chris Lattner	a46dfe80c8	Canonicalize VECTOR_SHUFFLE(X, X, Y) -> VECTOR_SHUFFLE(X,undef,Y') llvm-svn: 27235	2006-03-28 22:11:53 +00:00
Chris Lattner	c9992548fc	Turn a series of extract_element's feeding a build_vector into a vector_shuffle node. For this: void test(__m128 res, __m128 A, __m128 B) { res = _mm_unpacklo_ps(A, B); } we now produce this code: _test: movl 8(%esp), %eax movaps (%eax), %xmm0 movl 12(%esp), %eax unpcklps (%eax), %xmm0 movl 4(%esp), %eax movaps %xmm0, (%eax) ret instead of this: _test: subl $76, %esp movl 88(%esp), %eax movaps (%eax), %xmm0 movaps %xmm0, (%esp) movaps %xmm0, 32(%esp) movss 4(%esp), %xmm0 movss 32(%esp), %xmm1 unpcklps %xmm0, %xmm1 movl 84(%esp), %eax movaps (%eax), %xmm0 movaps %xmm0, 16(%esp) movaps %xmm0, 48(%esp) movss 20(%esp), %xmm0 movss 48(%esp), %xmm2 unpcklps %xmm0, %xmm2 unpcklps %xmm1, %xmm2 movl 80(%esp), %eax movaps %xmm2, (%eax) addl $76, %esp ret GCC produces this (with -fomit-frame-pointer): _test: subl $12, %esp movl 20(%esp), %eax movaps (%eax), %xmm0 movl 24(%esp), %eax unpcklps (%eax), %xmm0 movl 16(%esp), %eax movaps %xmm0, (%eax) addl $12, %esp ret llvm-svn: 27233	2006-03-28 20:28:38 +00:00
Chris Lattner	f6f94d3bce	Teach Legalize how to pack VVECTOR_SHUFFLE nodes into VECTOR_SHUFFLE nodes. llvm-svn: 27232	2006-03-28 20:24:43 +00:00
Chris Lattner	8d57da2ffc	new node llvm-svn: 27231	2006-03-28 19:54:42 +00:00
Chris Lattner	b7163598f9	Don't crash on X^X if X is a vector. Instead, produce a vector of zeros. llvm-svn: 27229	2006-03-28 19:11:05 +00:00
Chris Lattner	ffec47ebff	Add an assertion llvm-svn: 27228	2006-03-28 19:04:49 +00:00
Jim Laskey	dea0348853	Refactor address attributes. Add base register to frame info. llvm-svn: 27226	2006-03-28 14:58:32 +00:00
Jim Laskey	67a636c587	More bulletproofing of llvm.dbg.declare. llvm-svn: 27224	2006-03-28 13:45:20 +00:00
Jim Laskey	8374e9c4eb	More bulletproofing of DebugInfoDesc verify. llvm-svn: 27203	2006-03-28 01:30:18 +00:00
Chris Lattner	e55d171ccd	Tblgen doesn't like multiple SDNode<> definitions that map to the sameenum value. Split them into separate enums. llvm-svn: 27201	2006-03-28 00:40:33 +00:00
Jim Laskey	d387cc5cde	Reactivate llvm.dbg.declare. llvm-svn: 27192	2006-03-27 23:31:10 +00:00
Chris Lattner	5bb1d90afd	Disable dbg_declare, it currently breaks the CFE build llvm-svn: 27182	2006-03-27 21:36:03 +00:00
Chris Lattner	d5f94c9574	Fix legalization of intrinsics with chain and result values llvm-svn: 27181	2006-03-27 20:28:29 +00:00
Chris Lattner	0e84f1e532	Unbreak the build on non-apple compilers :-( llvm-svn: 27173	2006-03-27 16:10:59 +00:00
Evan Cheng	d09b05b0bc	Try again llvm-svn: 27171	2006-03-27 08:10:26 +00:00
Evan Cheng	64efb35c32	Incorrect check for FP all one's llvm-svn: 27169	2006-03-27 07:26:17 +00:00
Evan Cheng	c70e33cd6e	Change isBuildVectorAllOnesInteger to isBuildVectorAllOnes. Also check for floating point cases. llvm-svn: 27165	2006-03-27 06:58:47 +00:00
Chris Lattner	52fcad3a37	Instead of printing "INTRINSIC" on intrinsic node, print the intrinsic name. llvm-svn: 27164	2006-03-27 06:45:25 +00:00
Jim Laskey	6e792f9ff8	Pass llvm/test/Regression/CodeGen/Generic/debug-info.ll. llvm-svn: 27158	2006-03-27 01:51:47 +00:00
Nate Begeman	ed728c1291	SelectionDAGISel can now natively handle Switch instructions, in the same manner that the LowerSwitch LLVM to LLVM pass does: emitting a binary search tree of basic blocks. The new approach has several advantages: it is faster, it generates significantly smaller code in many cases, and it paves the way for implementing dense switch tables as a jump table by handling switches directly in the instruction selector. This functionality is currently only enabled on x86, but should be safe for every target. In anticipation of making it the default, the cfg is now properly updated in the x86, ppc, and sparc select lowering code. llvm-svn: 27156	2006-03-27 01:32:24 +00:00
Jim Laskey	7092888bcc	Bullet proof against undefined args produced by upgrading ols-style debug info. llvm-svn: 27155	2006-03-26 22:46:27 +00:00
Jim Laskey	84c2f0a705	How to be dumb on $5/day. Need a tri-state to track valid debug descriptors. llvm-svn: 27154	2006-03-26 22:45:20 +00:00
Evan Cheng	a67899195f	Add ISD::isBuildVectorAllZeros predicate llvm-svn: 27147	2006-03-26 09:50:58 +00:00
Chris Lattner	30ee72586d	Allow targets to custom lower their own intrinsics if desired. llvm-svn: 27146	2006-03-26 09:12:51 +00:00
Chris Lattner	b6e2d0027a	Add some comments. llvm-svn: 27133	2006-03-25 23:00:56 +00:00
Chris Lattner	f6e3b957b8	Fix a bug in ISD::isBuildVectorAllOnesInteger that caused it to always return false llvm-svn: 27131	2006-03-25 22:59:28 +00:00
Chris Lattner	c2d2811a07	Implement the ISD::isBuildVectorAllOnesInteger predicate llvm-svn: 27130	2006-03-25 22:57:01 +00:00
Chris Lattner	dc1eab5886	Don't call SimplifyDemandedBits on vectors llvm-svn: 27128	2006-03-25 22:19:00 +00:00
Chris Lattner	313229c74b	fix inverted conditional llvm-svn: 27089	2006-03-24 22:49:42 +00:00
Jim Laskey	3324c7236f	Hack no more. llvm-svn: 27079	2006-03-24 21:10:36 +00:00
Evan Cheng	68d9bf26c8	Only to vector shuffle for {x,x,y,y} cases when SCALAR_TO_VECTOR is free. llvm-svn: 27071	2006-03-24 18:45:20 +00:00
Jim Laskey	53f1ecc560	Rename for truth in advertising. llvm-svn: 27063	2006-03-24 09:50:27 +00:00
Chris Lattner	77e271cb4e	prefer to generate constant pool loads over splats. This prevents us from using a splat for {1.0,1.0,1.0,1.0} llvm-svn: 27055	2006-03-24 07:29:17 +00:00
Chris Lattner	87b1dddb1c	fix spello llvm-svn: 27053	2006-03-24 07:15:07 +00:00
Chris Lattner	a4f6805a86	legalize vbit_convert nodes whose result is a legal type. Legalize intrinsic nodes. llvm-svn: 27036	2006-03-24 02:26:29 +00:00
Chris Lattner	d96b09a7b9	Lower target intrinsics into an INTRINSIC node llvm-svn: 27035	2006-03-24 02:22:33 +00:00
Chris Lattner	6b05290922	fix some bogus assertions: noop bitconverts are legal llvm-svn: 27032	2006-03-24 02:20:47 +00:00
Evan Cheng	1d2e995fc1	Lower BUILD_VECTOR to VECTOR_SHUFFLE if there are two distinct nodes (and if the target can handle it). Issue two SCALAR_TO_VECTOR ops followed by a VECTOR_SHUFFLE to select from the two vectors. llvm-svn: 27023	2006-03-24 01:17:21 +00:00
Chris Lattner	ebac9a4adf	Identify the INTRINSIC node llvm-svn: 27020	2006-03-24 01:04:30 +00:00
Jim Laskey	fb39d2a7f7	Unneeded forward. llvm-svn: 27004	2006-03-23 23:05:52 +00:00
Jim Laskey	f7cfa52e7a	Make sure types are allocated in the scope of their use. llvm-svn: 27002	2006-03-23 23:02:34 +00:00
Chris Lattner	d7c4e7d255	add support for splitting casts. This implements CodeGen/Generic/vector.ll:test_cast_2. llvm-svn: 26999	2006-03-23 21:16:34 +00:00
Jim Laskey	2b74656f25	Generate local variable and scope information and equivalent dwarf forms. llvm-svn: 26989	2006-03-23 18:07:55 +00:00
Jim Laskey	a8bdac875d	Handle new forms of llvm.dbg intrinsics. llvm-svn: 26988	2006-03-23 18:06:46 +00:00
Chris Lattner	9ea1b3f9fd	simplify some code llvm-svn: 26972	2006-03-23 05:29:04 +00:00
Chris Lattner	b893d04a67	Fix a typo llvm-svn: 26965	2006-03-22 22:20:49 +00:00
Chris Lattner	2f4119a608	Implement simple support for vector casting. This can currently only handle casts between legal vector types. llvm-svn: 26961	2006-03-22 20:09:35 +00:00
Chris Lattner	8fa445a89d	Endianness does not affect the order of vector fields. This fixes SingleSource/UnitTests/Vector/build.c llvm-svn: 26936	2006-03-22 01:46:54 +00:00

1 2 3 4 5 ...

2624 Commits