llvm-project

Commit Graph

Author	SHA1	Message	Date
Oscar Fuentes	a229b3c9a7	Initial support for the CMake build system. llvm-svn: 56419	2008-09-22 01:08:49 +00:00
Bill Wendling	91ef8fcd29	Add helper function to get a 32-bit floating point constant. No functionality change. llvm-svn: 56418	2008-09-22 00:44:35 +00:00
Chris Lattner	43f5449c48	don't print GlobalAddressSDNode's with an offset of zero as "foo0". llvm-svn: 56399	2008-09-21 18:38:31 +00:00
Dan Gohman	9801ba451a	Refactor X86SelectConstAddr, folding it into X86SelectAddress. This results in better code for globals. Also, unbreak the local CSE for GlobalValue stub loads. llvm-svn: 56371	2008-09-19 22:16:54 +00:00
Dan Gohman	95be7d7b85	Add a new "fast" scheduler. This is currently basically just a copy of the BURRList scheduler, but with several parts ripped out, such as backtracking, online topological sort maintenance (needed by backtracking), the priority queue, and Sethi-Ullman number computation and maintenance (needed by the priority queue). As a result of all this, it generates somewhat lower quality code, but that's its tradeoff for running about 30% faster than list-burr in -fast mode in many cases. This is somewhat experimental. Moving forward, major pieces of this can be refactored with pieces in common with ScheduleDAGRRList.cpp. llvm-svn: 56307	2008-09-18 16:26:26 +00:00
Dale Johannesen	f8610ebebc	Add a bit to mark operands of asm's that conflict with an earlyclobber operand elsewhere. Propagate this bit and the earlyclobber bit through SDISel. Change linear-scan RA not to allocate regs in a way that conflicts with an earlyclobber. See also comments. llvm-svn: 56290	2008-09-17 21:13:11 +00:00
Dan Gohman	6ab52a8018	Don't worry about clobbering physical register defs that aren't used. llvm-svn: 56281	2008-09-17 15:25:49 +00:00
Evan Cheng	a904f466e8	When converting a CopyFromReg to a copy instruction, use the register class of its uses to determine the right destination register class of the copy. This is important for targets where a physical register may belong to multiple register classes. llvm-svn: 56258	2008-09-16 23:12:11 +00:00
Dan Gohman	64d6c6fe30	Change SelectionDAG::getConstantPool to always set the alignment of the ConstantPoolSDNode, using the target's preferred alignment for the constant type. In LegalizeDAG, when performing loads from the constant pool, the ConstantPoolSDNode's alignment is used in the calls to getLoad and getExtLoad. This change prevents SelectionDAG::getLoad/getExtLoad from incorrectly choosing the ABI alignment for constant pool loads when Alignment == 0. The incorrect alignment is only a performance issue when ABI alignment does not equal preferred alignment (i.e., on x86 it was generating MOVUPS instead of MOVAPS for v4f32 constant loads when the default ABI alignment for 128bit vectors is forced to 1 byte.) Patch by Paul Redmond! llvm-svn: 56253	2008-09-16 22:05:41 +00:00
Bill Wendling	24c79f28b1	Reverting r56249. On further investigation, this functionality isn't needed. Apologies for the thrashing. llvm-svn: 56251	2008-09-16 21:48:12 +00:00
Dan Gohman	ab26f20d44	Include the alignment value when displaying ConstantPoolSDNodes. llvm-svn: 56250	2008-09-16 21:18:22 +00:00
Bill Wendling	8bc392fb1d	- Change "ExternalSymbolSDNode" to "SymbolSDNode". - Add linkage to SymbolSDNode (default to external). - Change ISD::ExternalSymbol to ISD::Symbol. - Change ISD::TargetExternalSymbol to ISD::TargetSymbol These changes pave the way to allowing SymbolSDNodes with non-external linkage. llvm-svn: 56249	2008-09-16 21:12:30 +00:00
Dan Gohman	050d7835c6	Don't take the time to CheckDAGForTailCallsAndFixThem when tail calls are not enabled. Instead just omit the tail call flag when calls are created. llvm-svn: 56235	2008-09-16 01:42:28 +00:00
Dan Gohman	3c7b9ba547	Re-enable SelectionDAG CSE for calls. It matters in the case of libcalls, as in this testcase on ARM. llvm-svn: 56226	2008-09-15 19:46:03 +00:00
Dan Gohman	d3fe174c53	Define CallSDNode, an SDNode subclass for use with ISD::CALL. Currently it just holds the calling convention and flags for isVarArgs and isTailCall. And it has several utility methods, which eliminate magic 5+2*i and similar index computations in several places. CallSDNodes are not CSE'd. Teach UpdateNodeOperands to handle nodes that are not CSE'd gracefully. llvm-svn: 56183	2008-09-13 01:54:27 +00:00
Dan Gohman	ec270fb640	Change ConstantSDNode and ConstantFPSDNode to use ConstantInt* and ConstantFP* instead of APInt and APFloat directly. This reduces the amount of time to create ConstantSDNode and ConstantFPSDNode nodes when ConstantInt* and ConstantFP* respectively are already available, as is the case in SelectionDAGBuild.cpp. Also, it reduces the amount of time to legalize constants into constant pools, and the amount of time to add ConstantFP operands to MachineInstrs, due to eliminating ConstantInt::get and ConstantFP::get calls. It increases the amount of work needed to create new constants in cases where the client doesn't already have a ConstantInt* or ConstantFP*, such as legalize expanding 64-bit integer constants to 32-bit constants. And it adds a layer of indirection for the accessor methods. But these appear to be outweight by the benefits in most cases. It will also make it easier to make ConstantSDNode and ConstantFPNode more consistent with ConstantInt and ConstantFP. llvm-svn: 56162	2008-09-12 18:08:03 +00:00
Dale Johannesen	1f3ab86804	Pass "earlyclobber" bit through to machine representation; coalescer and RA need to know about it. No functional change. llvm-svn: 56161	2008-09-12 17:49:03 +00:00
Dan Gohman	effb894453	Rename ConstantSDNode::getValue to getZExtValue, for consistency with ConstantInt. This led to fixing a bug in TargetLowering.cpp using getValue instead of getAPIntValue. llvm-svn: 56159	2008-09-12 16:56:44 +00:00
Dale Johannesen	baf6762e26	The sequence for ppcf128 compares was not IEEE safe in the presence of NaNs. llvm-svn: 56136	2008-09-12 00:30:56 +00:00
Dan Gohman	1dc9b0514f	FastISel support for i1 PHI nodes. llvm-svn: 56069	2008-09-10 21:01:31 +00:00
Dan Gohman	940bafb687	FastISel support for i1 constants. llvm-svn: 56068	2008-09-10 21:01:08 +00:00
Dan Gohman	39d82f902a	Add X86FastISel support for static allocas, and refences to static allocas. As part of this change, refactor the address mode code for laods and stores. llvm-svn: 56066	2008-09-10 20:11:02 +00:00
Dan Gohman	222018da7b	Add a break statement that I accidentally deleted when I shuffled the fast-isel command-line options around. This fixes a bunch of fast-isel failures. llvm-svn: 56057	2008-09-10 15:52:34 +00:00
Bill Wendling	6987fec11c	Remove unnecessary bit-wise AND from the limited precision work. llvm-svn: 56049	2008-09-10 06:26:10 +00:00
Daniel Dunbar	999096065f	Fix 80 col violation. llvm-svn: 56048	2008-09-10 04:16:29 +00:00
Bill Wendling	eb1db169bf	Check that both operands are f32 before attempting to lower. llvm-svn: 56036	2008-09-10 00:24:59 +00:00
Bill Wendling	648930b9ba	Implement "visitPow". This is mainly used to see if we have a pow() call of this form: powf(10.0f, x); If this is the case, and also we want limited precision floating-point calculations, then lower to do the limited-precision stuff. llvm-svn: 56035	2008-09-10 00:20:20 +00:00
Evan Cheng	0fff397a13	A few more places where FPOW is being ignored. llvm-svn: 56032	2008-09-09 23:35:53 +00:00
Dan Gohman	b4c0295b8e	Change -fast-isel-no-abort to -fast-isel-abort, which now defaults to being off by default. Also, add assertion checks to check that the various fast-isel-related command-line options are only used when -fast-isel itself is enabled. llvm-svn: 56029	2008-09-09 23:05:00 +00:00
Evan Cheng	f4e5de4583	Legalizer was missing code that expand fpow to a libcall. llvm-svn: 56028	2008-09-09 23:02:14 +00:00
Bill Wendling	ab6676a46a	Adding 6-, 12-, and 18-bit limited-precision floating-point support for exp2 function. llvm-svn: 56025	2008-09-09 22:39:21 +00:00
Bill Wendling	48217d89b4	Add support for 6-, 12-, and 18-bit limited precision calculations of exp for floating-point numbers. llvm-svn: 56023	2008-09-09 22:13:54 +00:00
Dan Gohman	91491b51e2	Add a new option, -fast-isel-verbose, that can be used with -fast-isel-no-abort to get a dump of all unhandled instructions, without an abort. llvm-svn: 56021	2008-09-09 22:06:46 +00:00
Owen Anderson	4a58bd331b	Clean this up, based on Evan's suggestions. llvm-svn: 56009	2008-09-09 20:47:17 +00:00
Bill Wendling	ed3bb7888d	- Add support for 6-, 12-, and 18-bit limited precision floating-point "log" values. - Refactored some of the code. llvm-svn: 56008	2008-09-09 20:39:27 +00:00
Anton Korobeynikov	1a1140429e	Make safer variant of alias resolution routine to be default llvm-svn: 56005	2008-09-09 20:05:04 +00:00
Bill Wendling	faeb4b6755	Add limited precision floating-point conversions of log10 for 6- and 18-bit precisions. llvm-svn: 56000	2008-09-09 18:42:23 +00:00
Owen Anderson	8529085f4f	Check for type legality before materializing integer constants in fast isel. With this change, all of MultiSource/Applications passes on Darwin/X86 under FastISel. llvm-svn: 55982	2008-09-09 06:32:02 +00:00
Dan Gohman	b6aef419b4	Remove the code that protected FastISel from aborting in the case of loads, stores, and conditional branches. It can handle those now, so any that aren't handled should trigger the abort. llvm-svn: 55977	2008-09-09 02:40:04 +00:00
Evan Cheng	1e97901388	Fix a constant lowering bug. Now we can do load and store instructions with funky getelementptr embedded in the address operand. llvm-svn: 55975	2008-09-09 01:26:59 +00:00
Bill Wendling	484167851a	Add support for floating-point calculations of log2 with limited precisions of 6 and 18. llvm-svn: 55968	2008-09-09 00:28:24 +00:00
Anton Korobeynikov	45165ed1ac	Reapply 55904: Unbreak and fix indentation llvm-svn: 55958	2008-09-08 21:13:56 +00:00
Dan Gohman	a333f3ccb8	Fix a few I's that were meant to be renamed to BI's. llvm-svn: 55942	2008-09-08 20:37:59 +00:00
Dale Johannesen	67f99f1454	Redo the 3 existing low-precision expansions to use float constants. An oversight by the numerics people who supplied this. llvm-svn: 55930	2008-09-08 18:00:26 +00:00
Bill Wendling	99b83712f3	Reverting r55898 to r55909. One of these patches was causing an ICE during the full bootstrap on Darwin: /Volumes/Sandbox/Buildbot/llvm/full-llvm/build/llvm-gcc.obj/./gcc/xgcc -B/Volumes/Sandbox/Buildbot/llvm/full-llvm/build/llvm-gcc.obj/./gcc/ -B/Volumes/Sandbox/Buildbot/llvm/full-llvm/build/llvm-gcc.install/i386-apple-darwin9.4.0/bin/ -B/Volumes/Sandbox/Buildbot/llvm/full-llvm/build/llvm-gcc.install/i386-apple-darwin9.4.0/lib/ -isystem /Volumes/Sandbox/Buildbot/llvm/full-llvm/build/llvm-gcc.install/i386-apple-darwin9.4.0/include -isystem /Volumes/Sandbox/Buildbot/llvm/full-llvm/build/llvm-gcc.install/i386-apple-darwin9.4.0/sys-include -O2 -O2 -g -O2 -DIN_GCC -W -Wall -Wwrite-strings -Wstrict-prototypes -Wmissing-prototypes -Wold-style-definition -isystem ./include -fPIC -pipe -g -DHAVE_GTHR_DEFAULT -DIN_LIBGCC2 -D__GCC_FLOAT_NOT_NEEDED -I. -I. -I../../llvm-gcc.src/gcc -I../../llvm-gcc.src/gcc/. -I../../llvm-gcc.src/gcc/../include -I./../intl -I../../llvm-gcc.src/gcc/../libcpp/include -I../../llvm-gcc.src/gcc/../libdecnumber -I../libdecnumber -I/Volumes/Sandbox/Buildbot/llvm/full-llvm/build/llvm.obj/include -I/Volumes/Sandbox/Buildbot/llvm/full-llvm/build/llvm.src/include -DSHARED -m64 -DL_negdi2 -c ../../llvm-gcc.src/gcc/libgcc2.c -o libgcc/x86_64/_negdi2_s.o Assertion failed: (TargetRegisterInfo::isVirtualRegister(regA) && TargetRegisterInfo::isVirtualRegister(regB) && "cannot update physical register live information"), function runOnMachineFunction, file /Volumes/Sandbox/Buildbot/llvm/full-llvm/build/llvm.src/lib/CodeGen/TwoAddressInstructionPass.cpp, line 311. /Volumes/Sandbox/Buildbot/llvm/full-llvm/build/llvm-gcc.obj/./gcc/xgcc -B/Volumes/Sandbox/Buildbot/llvm/full-llvm/build/llvm-gcc.obj/./gcc/ -B/Volumes/Sandbox/Buildbot/llvm/full-llvm/build/llvm-gcc.install/i386-apple-darwin9.4.0/bin/ -B/Volumes/Sandbox/Buildbot/llvm/full-llvm/build/llvm-gcc.install/i386-apple-darwin9.4.0/lib/ -isystem /Volumes/Sandbox/Buildbot/llvm/full-llvm/build/llvm-gcc.install/i386-apple-darwin9.4.0/include -isystem /Volumes/Sandbox/Buildbot/llvm/full-llvm/build/llvm-gcc.install/i386-apple-darwin9.4.0/sys-include -O2 -O2 -g -O2 -DIN_GCC -W -Wall -Wwrite-strings -Wstrict-prototypes -Wmissing-prototypes -Wold-style-definition -isystem ./include -fPIC -pipe -g -DHAVE_GTHR_DEFAULT -DIN_LIBGCC2 -D__GCC_FLOAT_NOT_NEEDED -I. -I. -I../../llvm-gcc.src/gcc -I../../llvm-gcc.src/gcc/. -I../../llvm-gcc.src/gcc/../include -I./../intl -I../../llvm-gcc.src/gcc/../libcpp/include -I../../llvm-gcc.src/gcc/../libdecnumber -I../libdecnumber -I/Volumes/Sandbox/Buildbot/llvm/full-llvm/build/llvm.obj/include -I/Volumes/Sandbox/Buildbot/llvm/full-llvm/build/llvm.src/include -DSHARED -m64 -DL_lshrdi3 -c ../../llvm-gcc.src/gcc/libgcc2.c -o libgcc/x86_64/_lshrdi3_s.o ../../llvm-gcc.src/gcc/unwind-dw2.c:1527: internal compiler error: Abort trap Please submit a full bug report, with preprocessed source if appropriate. See <URL:http://developer.apple.com/bugreporter> for instructions. {standard input}:unknown:Undefined local symbol LBB21_11 {standard input}:unknown:Undefined local symbol LBB21_12 {standard input}:unknown:Undefined local symbol LBB21_13 {standard input}:unknown:Undefined local symbol LBB21_8 llvm-svn: 55928	2008-09-08 17:59:12 +00:00
Dan Gohman	1df80f6b1c	In visitUREM, arrange for the temporary UDIV node to be revisited, consistent with the code in visitSREM. llvm-svn: 55923	2008-09-08 16:59:01 +00:00
Daniel Dunbar	ede2d7d745	Add VISIBILITY_HIDDEN on SDISelAsmOperandInfo llvm-svn: 55922	2008-09-08 16:56:08 +00:00
Dan Gohman	e19bc1844f	Fix the string for ISD::UDIVREM. llvm-svn: 55917	2008-09-08 16:30:29 +00:00
Evan Cheng	24776b554d	Avoid redefinition and nnbreak windows build. llvm-svn: 55911	2008-09-08 16:01:27 +00:00
Anton Korobeynikov	6a73698a85	Unbreak and fix indentation llvm-svn: 55904	2008-09-08 14:23:34 +00:00
Evan Cheng	e775d3526c	Add fast isel physical register definition support. llvm-svn: 55892	2008-09-08 08:38:20 +00:00
Bill Wendling	5f7371d7b1	Revert my previous change -- the subtraction of two constants was a no-op before. This is taken care of in the selection DAG pass. In my opinion, this should be in one place or the other. I.e., it should probably be removed from the DAG combiner (along with the other arithmetic transformations on constants that are essentially no-ops). llvm-svn: 55889	2008-09-08 01:56:32 +00:00
Bill Wendling	df81749886	Convert // fold (sub c1, c2) -> c1-c2 from a no-op into an actual transformation. llvm-svn: 55886	2008-09-07 11:34:47 +00:00
Evan Cheng	b9a0abb129	Indentation. llvm-svn: 55880	2008-09-07 09:04:52 +00:00
Evan Cheng	615739b991	- Doh. Pass vector by value is bad. - Add a AnalyzeCallResult specialized for calls which produce a single value. This is used by fastisel. llvm-svn: 55879	2008-09-07 09:02:18 +00:00
Dale Johannesen	36d532abd6	Next limited float precision expansion (log2 12 bits) llvm-svn: 55866	2008-09-05 23:49:37 +00:00
Owen Anderson	1dd2e40521	Revert r55859. This is breaking the build in the abscence of its companion commit. llvm-svn: 55865	2008-09-05 23:36:01 +00:00
Dan Gohman	f17a2f3602	Move the code that inserts copies for function livein registers out of ScheduleDAGEmit.cpp and into SelectionDAGISel.cpp. This allows it to be run exactly once per function, even if multiple SelectionDAG iterations happen in the entry block, as may happen with FastISel. llvm-svn: 55863	2008-09-05 22:59:21 +00:00
Dale Johannesen	d4dac0e9ea	Add the next limited-precision expansion. llvm-svn: 55856	2008-09-05 21:27:19 +00:00
Dan Gohman	fd634599dc	FastISel support for AND and OR with type i1. llvm-svn: 55846	2008-09-05 18:44:22 +00:00
Dale Johannesen	520143e563	Add hooks for other intrinsics to get low-precision expansions. llvm-svn: 55845	2008-09-05 18:38:42 +00:00
Dan Gohman	fcf545690c	FastISel support for ConstantExprs. llvm-svn: 55843	2008-09-05 18:18:20 +00:00
Dan Gohman	677c3afbd1	Revert r55817. It broke PIC. FastISel will need to find a different approach here. llvm-svn: 55842	2008-09-05 18:13:01 +00:00
Evan Cheng	6b8fae1777	Add a variant of AnalyzeCallOperands that can be used by fast isel. llvm-svn: 55838	2008-09-05 16:59:26 +00:00
Duncan Sands	4d50e984bb	"Fix" PR2762. The testcase now crashes codegen elsewhere due to a missing pattern for v2f64 = sint_to_fp v2i32. That is PR2687. llvm-svn: 55828	2008-09-05 08:13:35 +00:00
Dan Gohman	921ddd69ba	Fix a search+replace-o. llvm-svn: 55824	2008-09-05 01:58:21 +00:00
Dale Johannesen	f2a52bbee5	Add -flimit-float-precision to enable some faster, but less accurate (non-IEEE) code sequences for certain math library functions. Add the first of several such expansions. Don't worry, if you don't turn it on it won't affect you. llvm-svn: 55823	2008-09-05 01:48:15 +00:00
Dan Gohman	ea56bdde34	FastISel support for unreachable. llvm-svn: 55818	2008-09-05 01:08:41 +00:00
Dan Gohman	5b4a9f4a69	In FastISel mode, the scheduler may be invoked multiple times in the same block. Fix the entry-block handling to only run at at the beginning of the entry block, and not any other times. llvm-svn: 55817	2008-09-05 01:07:48 +00:00
Owen Anderson	50288e3c99	Add initial support for selecting constant materializations that require constant pool loads on X86 in fast isel. This isn't actually used yet. llvm-svn: 55814	2008-09-05 00:06:23 +00:00
Dan Gohman	5eba3bcac6	Add an include of SmallSet.h. llvm-svn: 55793	2008-09-04 20:49:27 +00:00
Dan Gohman	a79db30d28	Tidy up several unbeseeming casts from pointer to intptr_t. llvm-svn: 55779	2008-09-04 17:05:41 +00:00
Dan Gohman	634412fe35	Clean up uses of TargetLowering::getTargetMachine. llvm-svn: 55769	2008-09-04 15:39:15 +00:00
Dale Johannesen	da2d80688b	Add intrinsics for log, log2, log10, exp, exp2. No functional change (and no FE change to generate them). llvm-svn: 55753	2008-09-04 00:47:13 +00:00
Dan Gohman	e039d5580e	Do trivial local CSE for constants and other non-Instruction values in FastISel. llvm-svn: 55748	2008-09-03 23:32:19 +00:00
Dan Gohman	45df9951f5	Put RegsForValue in the llvm namespace to avoid warnings about classes in the llvm namespace having members with types from anonymous namespaces. llvm-svn: 55747	2008-09-03 23:18:39 +00:00
Dan Gohman	7bda51f5a4	Create HandlePHINodesInSuccessorBlocksFast, a version of HandlePHINodesInSuccessorBlocks that works FastISel-style. This allows PHI nodes to be updated correctly while using FastISel. This also involves some code reorganization; ValueMap and MBBMap are now members of the FastISel class, so they needn't be passed around explicitly anymore. Also, SelectInstructions is changed to SelectInstruction, and only does one instruction at a time. llvm-svn: 55746	2008-09-03 23:12:08 +00:00
Owen Anderson	b1b9398ea7	Oops, I accidentally broke the fallback case with my last commit. llvm-svn: 55704	2008-09-03 17:51:57 +00:00
Owen Anderson	ea666816c2	Fix an issue where we were reusing materializations of constants in blocks not dominated by the materialization. This is the simple fix, materializing the constant before every use. It might be better to either track domination of uses or to materialize all constants and the beginning of the function and let remat sort when to do materialization at uses. llvm-svn: 55703	2008-09-03 17:37:03 +00:00
Dan Gohman	575fad337c	Split the SelectionDAG-building code, including the FunctionLoweringInfo and SelectionDAGLowering classes, out of SelectionDAGISel.cpp and put it in a separate file, SelectionDAGBuild.cpp. llvm-svn: 55701	2008-09-03 16:12:24 +00:00
Dan Gohman	b10f1a5c60	Separate MachineInstr-emitting routines from actual scheduling routines and move them into a separate file, ScheduleDAGEmit.cpp. llvm-svn: 55699	2008-09-03 16:01:59 +00:00
Evan Cheng	31ddd09f4a	If TargetSelectInstruction returns true, move to next instruction. llvm-svn: 55692	2008-09-03 06:43:41 +00:00
Evan Cheng	09ff2e7372	80 col violations. llvm-svn: 55668	2008-09-02 21:59:13 +00:00
Dan Gohman	115267fdc6	Ensure that HandlePHINodesInSuccessorBlocks is run for all blocks, even in FastISel mode in the case where FastISel successfully selects all the instructions. llvm-svn: 55641	2008-09-02 20:17:56 +00:00
Gabor Greif	9c64e61176	Provide two overloads of AnalyzeNewNode. The first can update the SDNode in an SDValue while the second is called with SDNode* and returns a possibly updated SDNode*. This patch has no intended functional impact, but helps eliminating ugly temporary SDValues. llvm-svn: 55608	2008-09-01 15:10:19 +00:00
Duncan Sands	4b31a2a7ce	Even though no caller actually uses the new value (what matters is that it is added to the worklist), it seems more logical to return it. llvm-svn: 55606	2008-09-01 13:11:13 +00:00
Bill Wendling	11284ea499	Another situation where ROTR is cheaper than ROTL. llvm-svn: 55577	2008-08-31 01:13:31 +00:00
Bill Wendling	4822a7ac8a	For this pattern, ROTR is the cheaper option. llvm-svn: 55576	2008-08-31 01:04:56 +00:00
Bill Wendling	fc72416447	- Fix comment so that it describes how the code really works: // fold (or (shl x, (ext y)), (srl x, (ext (sub 32, y)))) -> // (rotl x, y) // fold (or (shl x, (ext y)), (srl x, (ext (sub 32, y)))) -> // (rotr x, (sub 32, y)) Example: (x == 0xDEADBEEF and y == 4) (x << 4) \| (x >> 28) => 0xEADBEEF0 \| 0x0000000D => 0xEADBEEFD (rotl x, 4) => 0xEADBEEFD (rotr x, 28) => 0xEADBEEFD - Fix comment and code for second version. It wasn't using the rot* propertly. // fold (or (shl x, (ext (sub 32, y))), (srl x, (ext r))) -> // (rotr x, y) // fold (or (shl x, (ext (sub 32, y))), (srl x, (ext r))) -> // (rotl x, (sub 32, y)) (x << 28) \| (x >> 4) => 0xD0000000 \| 0x0DEADBEE => 0xDDEADBEE (rotl x, 4) => 0xEADBEEFD (rotr x, 28) => (0xEADBEEFD) llvm-svn: 55575	2008-08-31 00:37:27 +00:00
Gabor Greif	66ccf603a9	typo llvm-svn: 55574	2008-08-30 22:16:05 +00:00
Gabor Greif	e12264bf41	fix some 80-col violations llvm-svn: 55571	2008-08-30 19:29:20 +00:00
Evan Cheng	cfb7f3abdf	Transform (x << (y&31)) -> (x << y). This takes advantage of the fact x86 shift instructions 2nd operand (shift count) is limited to 0 to 31 (or 63 in the x86-64 case). llvm-svn: 55558	2008-08-30 02:03:58 +00:00
Owen Anderson	6f0c51d9da	Fix an issue where a use might be selected before a def, and then we didn't respect the pre-chosen vreg assignment when selecting the def. This is the naive solution to the problem: insert a copy to the pre-chosen vreg. Other solutions might be preferable, such as: 1) Passing the dest reg into FastEmit_. However, this would require the higher level code to know about reg classes, which they don't currently. 2) Selecting blocks in reverse postorder. This has some compile time cost for computing the order, and we'd need to measure its impact. llvm-svn: 55555	2008-08-30 00:38:46 +00:00
Evan Cheng	894be333f1	Fix 80 col. violations. llvm-svn: 55551	2008-08-29 23:20:46 +00:00
Evan Cheng	5e7658c2e4	Back out 55498. It broken Apple style bootstrapping. llvm-svn: 55549	2008-08-29 22:21:44 +00:00
Dan Gohman	d58f3e36d0	Add a target callback for FastISel. llvm-svn: 55512	2008-08-28 23:21:34 +00:00
Gabor Greif	f304a7aa4d	erect abstraction boundaries for accessing SDValue members, rename Val -> Node to reflect semantics llvm-svn: 55504	2008-08-28 21:40:38 +00:00
Dan Gohman	c45733f194	Implement null and undef values for FastISel. llvm-svn: 55500	2008-08-28 21:19:07 +00:00
Dan Gohman	f27e33baa7	Optimize DAGCombiner's worklist processing. Previously it started its work by putting all nodes in the worklist, requiring a big dynamic allocation. Now, DAGCombiner just iterates over the AllNodes list and maintains a worklist for nodes that are newly created or need to be revisited. This allows the worklist to stay small in most cases, so it can be a SmallVector. This has the side effect of making DAGCombine not miss a folding opportunity in alloca-align-rounding.ll. llvm-svn: 55498	2008-08-28 21:01:56 +00:00
Dan Gohman	17da671922	Move CaseBlock, JumpTable, and BitTestBlock to be members of SelectionDAGLowering instead of being in an anonymous namespace. This fixes warnings about SelectionDAGLowering having fields using anonymous namespaces. llvm-svn: 55497	2008-08-28 20:38:18 +00:00
Dan Gohman	360c57f683	Fix a FastISel bug where the instructions from lowering the arguments were being emitted after the first instructions of the entry block. llvm-svn: 55496	2008-08-28 20:28:56 +00:00
Rafael Espindola	6c8a99a778	Reduce the size of the Parts vector. llvm-svn: 55483	2008-08-28 18:29:58 +00:00
Owen Anderson	d8a82b75e2	Hook up support for fast-isel of trunc instructions, using the newly working support for EXTRACT_SUBREG. llvm-svn: 55482	2008-08-28 18:26:01 +00:00
Owen Anderson	9cd1a5e530	FastEmitInst_extractsubreg doesn't need to be passed the register class. It can get it from MachineRegisterInfo instead. llvm-svn: 55476	2008-08-28 17:47:37 +00:00
Rafael Espindola	029c1c8460	Correctly resize the Parts array. llvm-svn: 55471	2008-08-28 14:24:45 +00:00
Dale Johannesen	41be0d4445	Split the ATOMIC NodeType's to include the size, e.g. ATOMIC_LOAD_ADD_{8,16,32,64} instead of ATOMIC_LOAD_ADD. Increased the Hardcoded Constant OpActionsCapacity to match. Large but boring; no functional change. This is to support partial-word atomics on ppc; i8 is not a valid type there, so by the time we get to lowering, the ATOMIC_LOAD nodes looks the same whether the type was i8 or i32. The information can be added to the AtomicSDNode, but that is the largest SDNode; I don't fully understand the SDNode allocation, but it is sensitive to the largest node size, so increasing that must be bad. This is the alternative. llvm-svn: 55457	2008-08-28 02:44:49 +00:00
Dan Gohman	e1a9a780a5	Reorganize the lifetimes of the major objects SelectionDAGISel works with. SelectionDAG, FunctionLoweringInfo, and SelectionDAGLowering objects now get created once per SelectionDAGISel instance, and can be reused across blocks and across functions. Previously, they were created and destroyed each time they were needed. This reorganization simplifies the handling of PHI nodes, and also SwitchCases, JumpTables, and BitTestBlocks. This simplification has the side effect of fixing a bug in FastISel where successor PHI nodes weren't being updated correctly. This is also a step towards making the transition from FastISel into and out of SelectionDAG faster, and also making plain SelectionDAG faster on code with lots of little blocks. llvm-svn: 55450	2008-08-27 23:52:12 +00:00
Owen Anderson	5f57bc2247	Add a helper method that will be used to support EXTRACT_SUBREG for selecting trunc's in fast-isel. llvm-svn: 55439	2008-08-27 22:30:02 +00:00
Dan Gohman	61cfa3095d	Fix FastISel's bitcast code for the case where getRegForValue fails. llvm-svn: 55431	2008-08-27 20:41:38 +00:00
Owen Anderson	90609850b2	Use TargetLowering to get the types in fast isel, which handles pointer types correctly for our purposes. llvm-svn: 55428	2008-08-27 18:58:30 +00:00
Dan Gohman	d01789be23	Don't check TLI.getOperationAction. The FastISel way is to just try to do the action and let the tablegen-generated code determine if there is target-support for an operation. llvm-svn: 55427	2008-08-27 18:15:05 +00:00
Dan Gohman	b0b5a27438	Add a new FastISel method, getRegForValue, which takes care of the details of materializing constants and other values into registers, and make use of it in several places. llvm-svn: 55426	2008-08-27 18:10:19 +00:00
Dan Gohman	f2a6c1579f	Add a comment about the current floating-point constant code in FastISel. llvm-svn: 55425	2008-08-27 18:01:42 +00:00
Dan Gohman	3a3a52de58	Optimize ScheduleDAGRRList's topological sort to use one pass instead of two, and to not need a scratch std::vector. Also, compute the ordering immediately in the result array, instead of in another scratch std::vector that is copied to the result array. llvm-svn: 55421	2008-08-27 16:29:48 +00:00
Dan Gohman	9cbdedcbcf	Optimize ScheduleDAG's ComputeDepths and ComputeHeights to not need a scratch std::vector. llvm-svn: 55420	2008-08-27 16:27:25 +00:00
Dan Gohman	5ca269e684	Basic FastISel support for floating-point constants. llvm-svn: 55401	2008-08-27 01:09:54 +00:00
Owen Anderson	54aff7bb23	Fix handling of inttoptr and ptrtoint when unhandled operands are present. llvm-svn: 55400	2008-08-27 00:35:37 +00:00
Owen Anderson	140549256f	Add support for fast isel of inttoptr and ptrtoint in the cases where truncation is not needed. llvm-svn: 55399	2008-08-27 00:31:01 +00:00
Owen Anderson	ca1711a5b5	Factor out a large amoutn of the cast handling code in fast isel into helper methods. This simultaneously makes the code simpler and adds support for sext as well. llvm-svn: 55398	2008-08-26 23:46:32 +00:00
Owen Anderson	343310a715	Add support for fast isel of zext. llvm-svn: 55396	2008-08-26 23:14:49 +00:00
Gabor Greif	abfdf928d8	disallow direct access to SDValue::ResNo, provide a getter instead llvm-svn: 55394	2008-08-26 22:36:50 +00:00
Owen Anderson	655c1dc63d	Add support for fptosi of constants in fast isel. llvm-svn: 55393	2008-08-26 22:34:28 +00:00
Dan Gohman	d56f73f2f2	Optimize SelectionDAG's topological sort to use one pass instead of two, and to not need a scratch std::vector. Also, use the SelectionDAG's topological sort in LegalizeDAG instead of having a separate implementation. llvm-svn: 55389	2008-08-26 21:42:18 +00:00
Dan Gohman	6fda9208d9	Refactor the bitcast code into its own function. llvm-svn: 55387	2008-08-26 21:28:54 +00:00
Dan Gohman	b5e04bfb18	Make FastISel use the correct argument type when casting GEP indices. llvm-svn: 55384	2008-08-26 20:57:08 +00:00
Dan Gohman	3bcbbece19	Don't select binary instructions with illegal types. llvm-svn: 55383	2008-08-26 20:52:40 +00:00
Owen Anderson	3c4dc434ee	Add support for fast isel of sitofp, and remove some unnecessary and imprecise legality checks. llvm-svn: 55381	2008-08-26 20:37:00 +00:00
Owen Anderson	e0ac9765b2	Use a combination of copyRegToReg and ISD::BIT_CONVERT when doing fast isel of bitcasts, allowing it to support the full range of conversions people might ask for in a correct manner. llvm-svn: 55378	2008-08-26 18:51:24 +00:00
Owen Anderson	27fb3dcbc7	Make TargetInstrInfo::copyRegToReg return a bool indicating whether the copy requested was inserted or not. This allows bitcast in fast isel to properly handle the case where an appropriate reg-to-reg copy is not available. llvm-svn: 55375	2008-08-26 18:03:31 +00:00
Owen Anderson	bf05ebaccf	Add support for fast isel of non-constant fptosi instructions. llvm-svn: 55373	2008-08-26 17:44:42 +00:00
Chris Lattner	54ef9f5831	typo fix. llvm-svn: 55355	2008-08-26 06:07:47 +00:00
Dan Gohman	2e834906b9	Actually recycle SDNode allocations. SelectionDAG is using RecyclingAllocator, but this change is needed for the nodes to actually be recycled. This cuts SelectionDAG's memory usage high-water-mark in half in some cases. llvm-svn: 55351	2008-08-26 01:44:34 +00:00
Owen Anderson	8dd01ccdd8	Add a RetVT parameter to emitted FastISel methods, so that we will be able to pass the desired return type down. This is not currently used. llvm-svn: 55345	2008-08-25 23:58:18 +00:00
Evan Cheng	2c067325d6	Unbreak build. llvm-svn: 55342	2008-08-25 22:20:39 +00:00
Owen Anderson	126afc5cb9	Expand bitcast support in fast isel to support bitcasts of non-constant values by emitting reg-reg copies. llvm-svn: 55340	2008-08-25 21:32:34 +00:00
Owen Anderson	32635dbfb2	Add support for fast isel of (integer) immediate materialization pattens, and use them to support bitcast of constants in fast isel. llvm-svn: 55325	2008-08-25 20:20:32 +00:00
Chris Lattner	f4bd5cf3dd	make sure to flush the stream after dumping, to make sure it goes out immediately. llvm-svn: 55288	2008-08-24 18:28:30 +00:00
Chris Lattner	838aff36dd	get MachineConstantPool off std::ostream, onto raw_ostream. It would be really nice if someone converted MachineFunction::print to raw_ostream. llvm-svn: 55268	2008-08-23 22:53:13 +00:00
Chris Lattner	0c19df4871	Switch the asmprinter (.ll) and all the stuff it requires over to use raw_ostream instead of std::ostream. Among other goodness, this speeds up llvm-dis of kc++ with a release build from 0.85s to 0.49s (88% faster). Other interesting changes: 1) This makes Value::print be non-virtual. 2) AP[S]Int and ConstantRange can no longer print to ostream directly, use raw_ostream instead. 3) This fixes a bug in raw_os_ostream where it didn't flush itself when destroyed. 4) This adds a new SDNode::print method, instead of only allowing "dump". A lot of APIs have both std::ostream and raw_ostream versions, it would be useful to go through and systematically anihilate the std::ostream versions. This passes dejagnu, but there may be minor fallout, plz let me know if so and I'll fix it. llvm-svn: 55263	2008-08-23 22:23:09 +00:00
Dan Gohman	48a3623591	Make MBBMap a DenseMap instead of a std::map. llvm-svn: 55220	2008-08-23 02:44:46 +00:00
Dan Gohman	eb0cee91f6	Move the point at which FastISel taps into the SelectionDAGISel process up to a higher level. This allows FastISel to leverage more of SelectionDAGISel's infastructure, such as updating Machine PHI nodes. Also, implement transitioning from SDISel back to FastISel in the middle of a block, so it's now possible to go back and forth. This allows FastISel to hand individual CallInsts and other complicated things off to SDISel to handle, while handling the rest of the block itself. To help support this, reorganize the SelectionDAG class so that it is allocated once and reused throughout a function, instead of being completely reallocated for each block. llvm-svn: 55219	2008-08-23 02:25:05 +00:00
Dan Gohman	95d1056831	Avoid creating shift-by-zero SDNodes in the common case of i8* getelementptr. DAGCombine eliminates these, but this is a fairly common case. llvm-svn: 55214	2008-08-23 01:06:51 +00:00
Dan Gohman	ac37f9a9be	Move SelectionDAG's constructor out of line. llvm-svn: 55212	2008-08-23 00:50:30 +00:00
Dan Gohman	2db3f8a095	Reapply r55191 and r55192. llvm-svn: 55205	2008-08-22 21:28:19 +00:00
Bill Wendling	fc4f64eed0	Reverting r55190, r55191, and r55192. They broke the build with this error message: {standard input}:17:bad register name `%sil' make[4]: * [libgcc/./_addvsi3.o] Error 1 make[4]: * Waiting for unfinished jobs.... {standard input}:23:bad register name `%dil' {standard input}:28:bad register name `%dil' make[4]: * [libgcc/./_addvdi3.o] Error 1 {standard input}:18:bad register name `%sil' make[4]: * [libgcc/./_subvsi3.o] Error 1 llvm-svn: 55200	2008-08-22 20:51:05 +00:00
Dan Gohman	04968da460	Fix the InsertBranch call. llvm-svn: 55192	2008-08-22 19:26:10 +00:00
Dan Gohman	87ff7058e7	Support non-fallthrough unconditional branches in FastISel. llvm-svn: 55191	2008-08-22 19:21:41 +00:00
Dan Gohman	a2292c0d34	Add FastISel support for PHINodes. Machine PHI nodes are not yet updated properly, but that's a separate task. llvm-svn: 55187	2008-08-22 17:37:48 +00:00
Dan Gohman	49e19e906f	Factor out the predicate check code from DAGISelEmitter.cpp and use it in FastISelEmitter.cpp, and make FastISel subtarget aware. Among other things, this lets it work properly on x86 targets that don't have SSE, where it successfully selects x87 instructions. llvm-svn: 55156	2008-08-22 00:20:26 +00:00
Dan Gohman	2af34bd309	Add libcalls for the new rounding opcodes. llvm-svn: 55133	2008-08-21 18:38:14 +00:00
Dan Gohman	c6337ac069	Add libm-oriented ISD opcodes for rounding operations. llvm-svn: 55130	2008-08-21 17:55:02 +00:00
Dan Gohman	6a7461ad9b	Have FastISel skip the multiply by 1 for getelementptr on i8*. llvm-svn: 55129	2008-08-21 17:37:05 +00:00
Dan Gohman	efb7d2d03d	MVT::getMVT uses iPTR for pointer types, while we need the actual intptr_t type in this case. FastISel can now select simple getelementptr instructions. llvm-svn: 55125	2008-08-21 17:25:26 +00:00
Dan Gohman	75ea0b83c5	Elements in DeadNodeSet are checked for use_empty() before they are actually deleted, so it's not necessary to remove re-used nodes from the set. llvm-svn: 55123	2008-08-21 16:24:54 +00:00
Dan Gohman	fe9056584b	Basic fast-isel support for instructions with constant int operands. llvm-svn: 55099	2008-08-21 01:41:07 +00:00
Evan Cheng	4b5c038cd0	Type of first GEP operand is always the same as the target pointer type. llvm-svn: 55097	2008-08-21 01:19:11 +00:00
Dan Gohman	6a0780cdd7	Fix unused variable warnings. llvm-svn: 55089	2008-08-20 23:53:10 +00:00
Evan Cheng	864fcc198d	First cut, un-optimized (and untested) fast isel lowering of GetElementPtrInst. llvm-svn: 55085	2008-08-20 22:45:34 +00:00
Dan Gohman	a4305cec93	Simplify the BuildMI calls even more. llvm-svn: 55077	2008-08-20 21:10:53 +00:00
Dan Gohman	02c84b8910	Simplify FastISel's constructor argument list, make the FastISel class hold a MachineRegisterInfo member, and make the MachineBasicBlock be passed in to SelectInstructions rather than the FastISel constructor. llvm-svn: 55076	2008-08-20 21:05:57 +00:00
Dan Gohman	43d1c7c607	Dump the instruction that foiled ISel even when -debug is not used. llvm-svn: 55075	2008-08-20 20:47:32 +00:00
Dan Gohman	07a34a5f69	Make more use of the BuildMI API. llvm-svn: 55072	2008-08-20 18:16:32 +00:00
Dan Gohman	24e8f0cfe6	Minor code reorganization. llvm-svn: 55071	2008-08-20 18:10:48 +00:00
Dan Gohman	2471f6ce0f	Minor whitespace cleanup. llvm-svn: 55070	2008-08-20 18:09:38 +00:00
Dan Gohman	39a5ffb03f	Fix 80 column violation. llvm-svn: 55069	2008-08-20 18:09:02 +00:00
Evan Cheng	7b9cd58596	Kill off SimpleBBISel, it's replaced by FastISel. llvm-svn: 55067	2008-08-20 17:50:32 +00:00
Dan Gohman	837c13a029	Disable DAGCombine's alignment inference in "fast" codegen mode. llvm-svn: 55059	2008-08-20 16:30:28 +00:00
Dan Gohman	2da2bedc72	Change the FoldingSetNodeID usage for objects which carry alignment and volatility information, such as loads and stores, to reduce the number of integer values added to the FoldingSetNodeID. llvm-svn: 55058	2008-08-20 15:58:01 +00:00
Dan Gohman	f6aa60ff71	Use BitVector instead of std::vector<unsigned char>. llvm-svn: 55054	2008-08-20 14:58:41 +00:00
Dan Gohman	c63a46ef39	Avoid an empty-if-body warning in release builds. llvm-svn: 55050	2008-08-20 14:00:56 +00:00
Dan Gohman	e8f9a00424	Fix FastISel to recognize that the last block in the function does not have a fall-through successor. llvm-svn: 55033	2008-08-20 01:17:01 +00:00
Dan Gohman	98265cae87	Fix a leak in the FastISel code that Chris pointed out. llvm-svn: 55031	2008-08-20 00:56:17 +00:00
Dan Gohman	847ebb90b8	Add support for running SelectionDAG if FastISel fails. This is under a command-line option, so that the default behavior is an abort, which is useful for exposing code that isn't supported yet. llvm-svn: 55028	2008-08-20 00:47:54 +00:00
Dan Gohman	f6884373c2	Fix FastISel to recognize unhandled operands, such as constants that aren't available as virtual registers (for now). llvm-svn: 55026	2008-08-20 00:35:17 +00:00
Dan Gohman	b16a7783c5	Add FastISel support for floating-point operations. llvm-svn: 55021	2008-08-20 00:23:20 +00:00
Dan Gohman	a3e4d5a5e1	Add FastISel support for several more binary operators. llvm-svn: 55020	2008-08-20 00:11:48 +00:00
Dan Gohman	697284fe0a	Add code to call FastISel, and a command-line option to enable it. llvm-svn: 55015	2008-08-19 22:33:34 +00:00
Dan Gohman	214343fbbe	Support unconditional fall-through branches in FastISel. llvm-svn: 55014	2008-08-19 22:31:46 +00:00
Dan Gohman	547ce65467	Use the BuildMI overload that sets up a destination register instead of the one that doesn't and then adding it manually. llvm-svn: 55006	2008-08-19 20:46:54 +00:00
Dan Gohman	c55fdcc935	Handle the case where target-specific fastisel code doesn't have a desired opcode. llvm-svn: 55005	2008-08-19 20:43:22 +00:00
Chris Lattner	5d2a9a4ae6	don't use the result of WriteTypeSymbolic or WriteAsOperand. llvm-svn: 54978	2008-08-19 04:44:30 +00:00
Gordon Henriksen	d930f913e6	Rename some GC classes so that their roll will hopefully be clearer. In particular, Collector was confusing to implementors. Several thought that this compile-time class was the place to implement their runtime GC heap. Of course, it doesn't even exist at runtime. Specifically, the renames are: Collector -> GCStrategy CollectorMetadata -> GCFunctionInfo CollectorModuleMetadata -> GCModuleInfo CollectorRegistry -> GCRegistry Function::getCollector -> getGC (setGC, hasGC, clearGC) Several accessors and nested types have also been renamed to be consistent. These changes should be obvious. llvm-svn: 54899	2008-08-17 18:44:35 +00:00
Gordon Henriksen	bcef14d2e4	Factor GC metadata table assembly generation out of Collector in preparation for splitting AsmPrinter into its own library. llvm-svn: 54881	2008-08-17 12:56:54 +00:00
Chris Lattner	17f7165f84	Rework the routines that convert AP[S]Int into a string. Now, instead of returning an std::string by value, it fills in a SmallString/SmallVector passed in. This significantly reduces string thrashing in some cases. More specifically, this: - Adds an operator<< and a print method for APInt that allows you to directly send them to an ostream. - Reimplements APInt::toString to be much simpler and more efficient algorithmically in addition to not thrashing strings quite as much. This speeds up llvm-dis on kc++ by 7%, and may also slightly speed up the asmprinter. This also fixes a bug I introduced into the asmwriter in a previous patch w.r.t. alias printing. llvm-svn: 54873	2008-08-17 07:19:36 +00:00
Dan Gohman	c44423853a	Make FastISel's constructor protected, and give it a destructor. llvm-svn: 54793	2008-08-14 21:51:29 +00:00
Dan Gohman	550c9af91f	Improve support for vector casts in LLVM IR and CodeGen. llvm-svn: 54784	2008-08-14 20:04:46 +00:00
Dan Gohman	6134fbccef	Fix a bogus srem rule - a negative value srem'd by a power-of-2 can have a non-negative result; for example, -16%16 is 0. Also, clarify the related comments. This fixes PR2670. llvm-svn: 54767	2008-08-13 23:12:35 +00:00
Dan Gohman	7e3c392248	Allow SelectionDAG to create EXTRACT_VECTOR_ELT nodes with non-constant indices. Only a few of the peephole checks require a constant index. llvm-svn: 54764	2008-08-13 21:51:37 +00:00
Dan Gohman	b2226e21c3	Initial checkin of the new "fast" instruction selection support. See the comments in FastISelEmitter.cpp for details on what this is. This is currently experimental and unusable. llvm-svn: 54751	2008-08-13 20:19:35 +00:00
Dan Gohman	a7b8aed469	Rename SelectionDAGISel's FastISel to Fast, to begin to make room for the new FastISel instruction selection code. llvm-svn: 54749	2008-08-13 19:47:40 +00:00
Dan Gohman	23785a1679	Correct the filename in the top-of-file comment. llvm-svn: 54688	2008-08-12 17:42:33 +00:00
Dan Gohman	127bb03b8c	Take the FrameOffset into account when computing the alignment of stack objects. This fixes PR2656. llvm-svn: 54646	2008-08-11 18:27:03 +00:00
Evan Cheng	38aa7de6e9	Add skeleton of simple basic block instruction selector. llvm-svn: 54522	2008-08-08 07:27:28 +00:00
Bruno Cardoso Lopes	de5161fdf2	Add the remaining fp_round libcalls: FPROUND_F80_F32, FPROUND_PPCF128_F32, FPROUND_F80_F64, FPROUND_PPCF128_F64 Support for soften float fp_round operands is added, Mips needs this to round f64->f32. Also added support to soften float FABS result, Mips doesn't support double fabs results while in 'single float only' mode. llvm-svn: 54484	2008-08-07 19:01:24 +00:00
Evan Cheng	0638115a6e	Factor code that finalize PHI nodes, jump tables, etc. out of SelectBasicBlock. No functionality changes. llvm-svn: 54438	2008-08-07 00:43:25 +00:00
Owen Anderson	7c42ac4133	Remove the -disable-correct-folding option, which was ugly and is no longer needed. llvm-svn: 54361	2008-08-05 18:27:54 +00:00
Dan Gohman	e955c481fd	Fix several const-correctness issues, resolving some -Wcast-qual warnings. llvm-svn: 54349	2008-08-05 14:45:15 +00:00
Owen Anderson	bbeb8f0807	This option doesn't need to be a target option. It can be in SDISel instead. llvm-svn: 54336	2008-08-05 00:27:28 +00:00
Owen Anderson	a102290bdc	- Fix SelectionDAG to generate correct CFGs. - Add a basic machine-level dead block eliminator. These two have to go together, since many other parts of the code generator are unable to handle the unreachable blocks otherwise created. llvm-svn: 54333	2008-08-04 23:54:43 +00:00
Dan Gohman	90c724cadc	Fix SDISel lowering of PHI nodes to use ComputeValueVTs. This allows it to work correctly on aggregate values. This fixes PR2623. llvm-svn: 54331	2008-08-04 23:42:46 +00:00
Dan Gohman	6e023e63cd	Fix SDISel lowering of zeroinitializer and undef to use ComputeValueVTs. This allows it to work correctly on nested aggregate values. This fixes PR2625. llvm-svn: 54330	2008-08-04 23:30:41 +00:00
Dale Johannesen	c31eb205c1	Add a flag to disable jump table generation (all switches use the binary search algorithm) for environments that don't support it. PPC64 JIT is such an environment; turn the flag on for that. llvm-svn: 54248	2008-07-31 18:13:12 +00:00
Dan Gohman	345d63ccf2	Improve dagcombining for sext-loads and sext-in-reg nodes. llvm-svn: 54239	2008-07-31 00:50:31 +00:00
Dan Gohman	88e0df0c91	Move SelectionDAG::viewGraph() out of line; as an inline function it isn't always visible to gdb. llvm-svn: 54228	2008-07-30 18:48:53 +00:00
Dan Gohman	2fe4352691	Don't look for leaf values to store when lowering stores of empty structs. This fixes PR2612. llvm-svn: 54226	2008-07-30 18:36:51 +00:00
Nate Begeman	82f1925708	Fix broken CellSPU lowering, re-instate braces in Legalize llvm-svn: 54168	2008-07-29 19:07:27 +00:00
Nate Begeman	d63495ff25	Disable a fix in the previous patch, since it breaks CellSPU. The CellSPU codegen is broken, but needs to be fixed before we can put this back in. llvm-svn: 54164	2008-07-29 18:28:31 +00:00
Nate Begeman	fecbc8cff1	Add vector shifts to the IR, patch by Eli Friedman. CodeGen & Clang work coming next. llvm-svn: 54161	2008-07-29 15:49:41 +00:00
Dan Gohman	804c95df52	Fold the useful features of alist and alist_node into ilist, and a new ilist_node class, and remove them. Unlike alist_node, ilist_node doesn't attempt to manage storage itself, so it avoids the associated problems, including being opaque in gdb. Adjust the Recycler class so that it doesn't depend on alist_node. Also, change it to use explicit Size and Align parameters, allowing it to work when the largest-sized node doesn't have the greatest alignment requirement. Change MachineInstr's MachineMemOperand list from a pool-backed alist to a std::list for now. llvm-svn: 54146	2008-07-28 21:51:04 +00:00
Dan Gohman	68e45a361b	Make the ScheduleDAG's GraphRoot edge be blue and dashed too, like the SelectionDAG's. llvm-svn: 54129	2008-07-27 22:46:49 +00:00
Dan Gohman	2ce6f2ad5e	Rename SDOperand to SDValue. llvm-svn: 54128	2008-07-27 21:46:04 +00:00
Dan Gohman	91e5dcb680	Tidy SDNode::use_iterator, and complete the transition to have it parallel its analogue, Value::value_use_iterator. The operator* method now returns the user, rather than the use. llvm-svn: 54127	2008-07-27 20:43:25 +00:00
Dan Gohman	bb5f43ed4d	Rename isOnlyUseOf to isOnlyUserOf. llvm-svn: 54124	2008-07-27 18:06:42 +00:00
Duncan Sands	d9374421ea	Some binary operations were being treated as unary operations! Add support for softening some additional unary operations like fp_to_sint. llvm-svn: 54122	2008-07-27 12:28:43 +00:00
Mon P Wang	7334350d31	When splitting a vector shuffle, fixed which type we used for the hi part llvm-svn: 54007	2008-07-25 01:30:26 +00:00
Dan Gohman	9268601d8a	Use AliasAnalysis::pointsToConstantMemory in SDISel to avoid unnecessary dependencies with constant load nodes. This allows them to be scheduled freely. llvm-svn: 54001	2008-07-25 00:04:14 +00:00
Dan Gohman	fa1211f69b	Enable first-class aggregates support. Remove the GetResultInst instruction. It is still accepted in LLVM assembly and bitcode, where it is now auto-upgraded to ExtractValueInst. Also, remove support for return instructions with multiple values. These are auto-upgraded to use InsertValueInst instructions. The IRBuilder still accepts multiple-value returns, and auto-upgrades them to InsertValueInst instructions. llvm-svn: 53941	2008-07-23 00:34:11 +00:00
Duncan Sands	775e509525	LegalizeTypes support for VSETCC. Fixes PR2575. llvm-svn: 53938	2008-07-22 23:54:03 +00:00
Evan Cheng	b8ff223f26	Fix pr2566: incorrect assumption about bit_convert. It doesn't not have to output a vector value. Patch by Nicolas Capens! llvm-svn: 53932	2008-07-22 20:42:56 +00:00
Dan Gohman	57c749294c	Make the GraphRoot edge look like a chain edge, which is more accurate, and use the right result number, in the off chance that the graph root has multiple result values. llvm-svn: 53923	2008-07-22 17:52:59 +00:00
Dan Gohman	ebeccb44cf	Fix grammaros in comments. llvm-svn: 53884	2008-07-21 22:38:59 +00:00
Dan Gohman	f1dc362547	Enhance the GraphWriter support for edge destinations, and teach the SelectionDAG graph writer to make use of them. Now, nodes with multiple values are displayed as such, with incoming edges pointing to the specific value they use. llvm-svn: 53875	2008-07-21 21:06:55 +00:00
Dan Gohman	a6191cde79	After early-lowering the FORMAL_ARGUMENTS node, delete it. llvm-svn: 53874	2008-07-21 21:04:07 +00:00
Dan Gohman	581cc87f57	Add titles to the various SelectionDAG viewGraph calls that include useful information like the name of the block being viewed and the current phase of compilation. llvm-svn: 53872	2008-07-21 20:00:07 +00:00
Duncan Sands	b0e3938651	Add VerifyNode, a place to put sanity checks on generic SDNode's (nodes with their own constructors should do sanity checking in the constructor). Add sanity checks for BUILD_VECTOR and fix all the places that were producing bogus BUILD_VECTORs, as found by "make check". My favorite is the BUILD_VECTOR with only two operands that was being used to build a vector with four elements! llvm-svn: 53850	2008-07-21 10:20:31 +00:00
Duncan Sands	6b418e750d	Softfloat support for FDIV. Patch by Richard Pennington. llvm-svn: 53773	2008-07-18 21:18:48 +00:00
Duncan Sands	694228b47d	Eliminate unused variable. llvm-svn: 53772	2008-07-18 21:07:41 +00:00
Duncan Sands	32e387c461	Revert 53729, after waking up in the middle of the night realising that it was wrong :) I think the reason the same type was being used for the shufflevec of indices as for the actual indices is so that if one of them needs splitting then so does the other. After my patch it might be that the indices need splitting but not the rest, yet there is no good way of handling that. I think the right solution is to not have the shufflevec be an operand at all: just have it be the list of numbers it actually is, stored as extra info in the node. llvm-svn: 53768	2008-07-18 20:12:05 +00:00
Dan Gohman	7168de7872	When printing MemOperand nodes, only use print() for PseudoSourceValue values, which never have names. Use getName() for all other values, because we want to print just a short summary of the value, not the entire instruction. llvm-svn: 53738	2008-07-17 21:12:16 +00:00
Duncan Sands	656b256a1a	Use a legal type for elements of the vector_shuffle mask. These are just indices into the shuffled vector so their type is unrelated to the type of the shuffled elements (which is what was being used before). This fixes vec_shuffle-11.ll when using LegalizeTypes. What seems to have happened is that Dan's recent change r53687, which corrected the result type of the shuffle, somehow caused LegalizeTypes to notice that the mask operand was a BUILD_VECTOR with a legal type but elements of an illegal type (i64). LegalizeTypes legalized this by introducing a new BUILD_VECTOR of i32 and bitcasting it to the old type. But the mask operand is not supposed to be a bitcast but a straight BUILD_VECTOR of constants, causing a crash. llvm-svn: 53729	2008-07-17 19:28:41 +00:00
Dan Gohman	1705968102	Add a new function, ReplaceAllUsesOfValuesWith, which handles bulk replacement of multiple values. This is slightly more efficient than doing multiple ReplaceAllUsesOfValueWith calls, and theoretically could be optimized even further. However, an important property of this new function is that it handles the case where the source value set and destination value set overlap. This makes it feasible for isel to use SelectNodeTo in many very common cases, which is advantageous because SelectNodeTo avoids a temporary node and it doesn't require CSEMap updates for users of values that don't change position. Revamp MorphNodeTo, which is what does all the work of SelectNodeTo, to handle operand lists more efficiently, and to correctly handle a number of corner cases to which its new wider use exposes it. This commit also includes a change to the encoding of post-isel opcodes in SDNodes; now instead of being sandwiched between the target-independent pre-isel opcodes and the target-dependent pre-isel opcodes, post-isel opcodes are now represented as negative values. This makes it possible to test if an opcode is pre-isel or post-isel without having to know the size of the current target's post-isel instruction set. These changes speed up llc overall by 3% and reduce memory usage by 10% on the InstructionCombining.cpp testcase with -fast and -regalloc=local. llvm-svn: 53728	2008-07-17 19:10:17 +00:00
Duncan Sands	7e5edf1a1f	LegalizeTypes support for what seems to be the only missing ppc long double operations: FNEG and FP_EXTEND. llvm-svn: 53723	2008-07-17 17:35:14 +00:00
Duncan Sands	d9256a7ceb	Turn LegalizeTypes back off again for the moment: it is breaking Darwin bootstrap due to missing functionality. llvm-svn: 53721	2008-07-17 17:06:03 +00:00
Duncan Sands	77a3d05f1e	Factorize some code for determining which libcall to use. llvm-svn: 53713	2008-07-17 02:36:29 +00:00
Dan Gohman	2714059079	Fix the result type of a VECTOR_SHUFFLE+BIT_CONVERT dagcombine. This was turned up by some new SelectionDAG assertion checks that I'm working on. llvm-svn: 53687	2008-07-16 16:13:58 +00:00
Duncan Sands	2d28e281e9	Add support for promoting and expanding AssertZext and AssertSext. Needed when passing huge integer parameters with the zeroext or signext attributes. llvm-svn: 53684	2008-07-16 16:03:07 +00:00
Duncan Sands	e766b4230e	Reorder methods alphabetically. No functionality change. While this is not a wonderful organizing principle, it does make it easy to find routines, and clear where to insert new ones. llvm-svn: 53672	2008-07-16 11:41:33 +00:00
Duncan Sands	c359055fa9	Turn on LegalizeTypes by default. llvm-svn: 53671	2008-07-16 11:36:51 +00:00
Dan Gohman	1e5aa12b7d	SelectionDAG::AssignNodeIds is unused. llvm-svn: 53636	2008-07-15 18:29:32 +00:00
Dan Gohman	1d093846b5	Don't sort SDNodes by their addresses in SelectionDAG::dump. Instead, just use the AllNodes order, which is at least relatively stable across runs. llvm-svn: 53632	2008-07-15 18:18:54 +00:00
Duncan Sands	0f1a1cdcf8	LegalizeTypes support for fabs on ppc long double. llvm-svn: 53613	2008-07-15 15:02:44 +00:00
Duncan Sands	6162e0377c	LegalizeTypes support for promotion of bswap. In LegalizeDAG the value is zero-extended to the new type before byte swapping. It doesn't matter how the extension is done since the new bits are shifted off anyway after the swap, so extend by any old rubbish bits. This results in the final assembler for the testcase being one line shorter. llvm-svn: 53604	2008-07-15 10:18:22 +00:00
Duncan Sands	202225cdf8	LegalizeTypes support for promotion of SIGN_EXTEND_INREG. llvm-svn: 53603	2008-07-15 10:14:24 +00:00
Duncan Sands	b9b5a671d3	Reorder the integer promotion methods alphabetically. No change in functionality. llvm-svn: 53602	2008-07-15 10:12:34 +00:00
Mon P Wang	97432f4f1b	Fixed potential bug if the source and target of a bit convert have different alignment llvm-svn: 53590	2008-07-15 05:28:34 +00:00
Dan Gohman	adec96f438	Reapply 53476 and 53480, with a fix so that it properly updates the BB member to the current basic block after emitting instructions. llvm-svn: 53567	2008-07-14 18:19:29 +00:00
Dan Gohman	e7c8387616	Improve debug output for MemOperandSDNode. PseudoSourceValue nodes don't have value names, so use print instead of getName() to get a useful string. llvm-svn: 53563	2008-07-14 17:51:24 +00:00
Duncan Sands	673cf1836b	I don't think BUILD_PAIR can have a vector result. Remove support for this. llvm-svn: 53559	2008-07-14 17:34:19 +00:00
Duncan Sands	0ca9a38f68	Tighten up some checks. Fix FPOWI splitting for non-power-of-two vectors. llvm-svn: 53558	2008-07-14 17:33:37 +00:00
Duncan Sands	a30cbd9797	An INSERT_VECTOR_ELT can insert a larger value than the vector element type. Don't forget to handle this when the insertion index is not a constant. llvm-svn: 53556	2008-07-14 17:32:02 +00:00
Duncan Sands	693185bcee	According to the docs, it is possible to have an extending load of a vector. Handle this case when splitting vector loads. I'm not completely sure what is supposed to happen, but I think it means hi should be set to undef. LegalizeDAG does not consider this case. llvm-svn: 53555	2008-07-14 17:27:46 +00:00
Duncan Sands	b766084cb0	There should be no extending loads or truncating stores of one-element vectors. Also, neaten the handling of INSERT_VECTOR_ELT when the inserted type is larger than the vector element type. llvm-svn: 53554	2008-07-14 17:22:31 +00:00
Duncan Sands	d47d2d6b12	Ignore TargetConstant with an illegal type. These are used for passing huge immediates in inline ASM from the front-end straight down to the ASM writer. Of course this is a hack, but it is simple, limited in scope, works in practice, and is what LegalizeDAG does. llvm-svn: 53553	2008-07-14 17:15:45 +00:00
Evan Cheng	ef8412c822	Back out 53476 and 53480 for now. Somehow they cause llc to miscompile 179.art. llvm-svn: 53502	2008-07-12 01:38:51 +00:00
Dan Gohman	02c7c6cb33	Include a frame index in the "fixed stack" pseudo source value instead of using the frame index for the SVOffset, which was inconsistent. llvm-svn: 53486	2008-07-11 22:44:52 +00:00
Dan Gohman	ed087a62dc	Fix an obsolete top-level comment. llvm-svn: 53481	2008-07-11 22:39:58 +00:00
Dan Gohman	f4cd404e6f	Factor out debugging code into the common base class. llvm-svn: 53480	2008-07-11 22:36:22 +00:00
Dan Gohman	36a69373dc	Add support for putting NamedRegionTimers in TimerGroups, and use a timer group for the timers in SelectionDAGISel. Also, Split scheduling out from emitting, to give each their own timer. llvm-svn: 53476	2008-07-11 21:54:34 +00:00
Dan Gohman	0597e5b697	Trim unnecessary #includes. llvm-svn: 53471	2008-07-11 20:38:31 +00:00
Duncan Sands	121641d601	Remove an apparently useless routine: there should be no need to split the result of a vector RET node, since they are always already legal. llvm-svn: 53462	2008-07-11 17:02:09 +00:00
Duncan Sands	3e7d0fa3ca	It is pointless to turn a UINT_TO_FP into an SINT_TO_FP libcall plus additional operations: it might as well be a direct UINT_TO_FP libcall. So only turn it into an SINT_TO_FP if the target has special handling for SINT_TO_FP. llvm-svn: 53461	2008-07-11 17:00:14 +00:00
Duncan Sands	37b7322b35	Add two missing SINT_TO_FP libcalls. llvm-svn: 53460	2008-07-11 16:57:02 +00:00
Duncan Sands	d9948110a6	Port a shift-by-1 optimization from LegalizeDAG: it was presumably added after the rest of the code was copied to LegalizeTypes. llvm-svn: 53459	2008-07-11 16:54:57 +00:00
Duncan Sands	927a3648d5	Add support for 128 bit shifts and 32 bit shifts on 16 bit machines. llvm-svn: 53458	2008-07-11 16:52:29 +00:00
Chris Lattner	87909d0629	Fix a bug in the soft-float handling of FCOPYSIGN that Duncan noticed when working on legalizetypes. Both legalizetypes and legalizeops now produce hte same code for CodeGen/ARM/fcopysign.ll. llvm-svn: 53435	2008-07-10 23:46:13 +00:00
Chris Lattner	17b234cf9b	make legalize types be a command line option: -enable-legalize-types. llvm-svn: 53434	2008-07-10 23:37:50 +00:00
Duncan Sands	abdcac66dc	Add support for 128 bit multiplicative operations. Lack of these caused a bootstrap failure with Fortran on x86-64 with LegalizeTypes turned on. While there, be nice to 16 bit machines and support expansion of i32 too. llvm-svn: 53408	2008-07-10 15:35:05 +00:00
Duncan Sands	5e6d1402c2	Add a mysteriously missing libcall, FPTOSINT_F80_I32. Be nice to 16 bit machines by supporting FP_TO_XINT expansion for these. llvm-svn: 53407	2008-07-10 15:33:02 +00:00
Duncan Sands	303524be58	Fix a FIXME: use an apint in CTTZ legalization. llvm-svn: 53406	2008-07-10 15:30:54 +00:00
Duncan Sands	e78352a125	Remove PromoteIntRes_FP_ROUND - not sure what it was doing there: FP_ROUND returns a float, not an integer. llvm-svn: 53405	2008-07-10 15:29:55 +00:00
Duncan Sands	4ac3984fc5	Make sure the alignment of the temporary created in CreateStackStoreLoad is good enough for both the source and destination types. llvm-svn: 53404	2008-07-10 15:26:17 +00:00
Duncan Sands	d4c09df689	Make the LegalizeType method naming scheme more regular. llvm-svn: 53403	2008-07-10 15:25:04 +00:00
Duncan Sands	74f23ff45c	Don't barf when dumping a constant that contains a ginormous value (eg: i128 -1). llvm-svn: 53402	2008-07-10 11:23:14 +00:00
Dan Gohman	7d94c49db9	Simplify hasNUsesOfValue and hasAnyUsesOfValue even more. This makes their special-case checks of use_size() less beneficial, so remove them. This eliminates all but one use of use_size(), which is in AssignTopologicalOrder, which uses it only once for each node, and so can reasonably afford to recompute it, as this allows the UsesSize field of SDNode to be removed altogether. llvm-svn: 53377	2008-07-09 23:03:14 +00:00
Dan Gohman	7a510c2990	hasAnyUseOfValue can check SDUse nodes of its users directly instead of examining every operand of every user. llvm-svn: 53374	2008-07-09 22:39:01 +00:00
Dan Gohman	db4504fa57	Move MemoryVT out of LSBaseNode into MemSDNode, allowing the getMemOperand function to be moved into the base class as well and made non-virtual. llvm-svn: 53372	2008-07-09 22:08:04 +00:00
Dan Gohman	89e71d48b8	Move the IsVolatile and SVOffset fields into the MemSDNode base class, and store IsVolatile and Alignment in a more compact form. This makes AtomicSDNode slightly larger, but it shrinks LoadSDNode and StoreSDNode, which are much more common and are the largest of the SDNode subclasses. Also, this lets the isVolatile() and getAlignment() accessors be non-virtual. llvm-svn: 53361	2008-07-09 21:23:02 +00:00
Duncan Sands	37ab611e8e	Remove some unneeded includes. llvm-svn: 53289	2008-07-09 12:08:25 +00:00
Duncan Sands	5e266c914a	Redo LegalizeTypes soft float support for SINT_TO_FP and UINT_TO_FP. This now produces the same code as LegalizeDAG (the previous code was based on a mistaken idea of what LegalizeDAG did in this case). llvm-svn: 53288	2008-07-09 12:07:22 +00:00
Duncan Sands	b9e63db718	Forgot to update the chain result when softening loads. llvm-svn: 53287	2008-07-09 11:15:31 +00:00
Duncan Sands	ed811f0ec1	LegalizeTypes soft float support for FP_TO_SINT and FP_TO_UINT. llvm-svn: 53286	2008-07-09 11:13:46 +00:00
Duncan Sands	8090f8576f	LegalizeTypes support for powi soft float. llvm-svn: 53285	2008-07-09 11:11:47 +00:00
Duncan Sands	c52d3bf646	Make the role of MVT::i32 clearer here, and add a note since it is not clear whether it is correct. llvm-svn: 53284	2008-07-09 08:07:41 +00:00
Evan Cheng	7898e98026	Missed alignment argument on stores lowered from memcpy. llvm-svn: 53281	2008-07-09 06:38:06 +00:00
Dan Gohman	919936815e	const-ify SelectionDAG::getNodeValueTypes. llvm-svn: 53264	2008-07-09 00:00:42 +00:00
Dan Gohman	e8d8d2ea42	Factor out the code for computing an alignment value, and make it available to getAtomic in addition to just getLoad and getStore, to prevent MachineMemOperands with 0 alignment. llvm-svn: 53261	2008-07-08 23:46:32 +00:00
Evan Cheng	34ef1db87c	Do not CSE DEBUG_LOC, DBG_LABEL, DBG_STOPPOINT, DECLARE, and EH_LABEL SDNode's. This improves compile time slightly at -O0 -g. llvm-svn: 53246	2008-07-08 20:06:39 +00:00
Duncan Sands	0797e5bf05	Remove custom expansion from LegalizeTypes when doing soft float: experiments show that targets aren't expecting this for results or for operands. Add support select/select_cc result soft float and correct operand soft float for these. llvm-svn: 53245	2008-07-08 20:03:24 +00:00
Duncan Sands	360d689db3	Add missing select_cc libcall line, somehow omitted in LegalizeTypes. llvm-svn: 53244	2008-07-08 20:00:05 +00:00
Duncan Sands	12525efdfc	LegalizeTypes support for FP_ROUND and FP_EXTEND soft float. llvm-svn: 53231	2008-07-08 10:50:55 +00:00
Dan Gohman	3b46030375	Pool-allocation for MachineInstrs, MachineBasicBlocks, and MachineMemOperands. The pools are owned by MachineFunctions. This drastically reduces the number of calls to malloc/free made during the "Emit" phase of scheduling, as well as later phases in CodeGen. Combined with other changes, this speeds up the "instruction selection" phase of CodeGen by 10% in some cases. llvm-svn: 53212	2008-07-07 23:14:23 +00:00
Dan Gohman	7f8b6d5f80	Pool-allocation for SDNodes. The pool is allocated once for each function, and reused across SelectionDAGs. This drastically reduces the number of calls to malloc/free made during instruction selection, and improves memory locality. llvm-svn: 53211	2008-07-07 23:02:41 +00:00
Dan Gohman	9169763955	Fix SDNode::MorphNodeTo (a function used by by SelectNodeTo) to properly track dead nodes that are on the original SDNode's operand list but not the new one, and have no other uses. llvm-svn: 53201	2008-07-07 20:57:48 +00:00
Dan Gohman	768f2c9246	Remove most of the uses of SDOperandPtr, usually replacing it with a simple const SDOperand, which is what's usually needed. For AddNodeIDOperands, which is small, just duplicate the function to accept an SDUse. For SelectionDAG::getNode - Add an overload that accepts SDUse* that copies the operands into a temporary SDOperand array, but also has special-case checks for 0 through 3 operands to avoid the copy in the common cases. llvm-svn: 53183	2008-07-07 18:26:29 +00:00
Dan Gohman	56e3f63ec5	Add explicit keywords. llvm-svn: 53179	2008-07-07 18:00:37 +00:00
Dan Gohman	38740a98b2	Make DenseMap's insert return a pair, to more closely resemble std::map. llvm-svn: 53177	2008-07-07 17:46:23 +00:00
Evan Cheng	d8b83e1292	LegalizeSetCCOperands should legalize the result of ExpandLibCall. Patch by Richard Osborne. llvm-svn: 53169	2008-07-07 07:18:09 +00:00
Duncan Sands	2fa6cf5c2f	LegalizeTypes soft-float support for stores of a float value. llvm-svn: 53165	2008-07-07 00:08:12 +00:00
Mon P Wang	5c755ff51b	Fixed generating incorrect aligned stores that I backout of r53031 that fixed problems in EmitStackConvert where the source and target type have different alignment by creating a stack slot with the max alignment of source and target type. llvm-svn: 53150	2008-07-05 20:40:31 +00:00
Duncan Sands	93e180342a	Rather than having a different custom legalization hook for each way in which a result type can be legalized (promotion, expansion, softening etc), just use one: ReplaceNodeResults, which returns a node with exactly the same result types as the node passed to it, but presumably with a bunch of custom code behind the scenes. No change if the new LegalizeTypes infrastructure is not turned on. llvm-svn: 53137	2008-07-04 11:47:58 +00:00
Bill Wendling	2e50689435	Revert my previous check-in that split up MachineModuleInfo. It turns out to slow the compiler down at -O0 some 30% or more. Ooops. llvm-svn: 53120	2008-07-03 22:53:42 +00:00
Evan Cheng	fad8be450d	Backed out 53031. llvm-svn: 53110	2008-07-03 18:20:14 +00:00
Dan Gohman	f3c4d7f877	Avoid unnecessarily copying APInt objects. llvm-svn: 53065	2008-07-03 00:52:03 +00:00
Dan Gohman	22e9707480	Replace a few uses of SelectionDAG::getTargetNode with SelectionDAG::SelectNodeTo in the instruction selector. This updates existing nodes in place instead of creating new ones. Go back to selecting ISD::DBG_LABEL nodes into TargetInstrInfo::DBG_LABEL nodes instead of leaving them unselected, now that SelectNodeTo allows us to update them in place. llvm-svn: 53057	2008-07-02 23:23:19 +00:00
Duncan Sands	739a0548c4	Add a new getMergeValues method that does not need to be passed the list of value types, and use this where appropriate. Inappropriate places are where the value type list is already known and may be long, in which case the existing method is more efficient. llvm-svn: 53035	2008-07-02 17:40:58 +00:00
Mon P Wang	4b7c1acf26	Fixed problem in EmitStackConvert where the source and target type have different alignment by creating a stack slot with the max alignment of source and target type. llvm-svn: 53031	2008-07-02 17:07:12 +00:00
Chris Lattner	6b2c4f6143	instead of aborting on shifts of i1, just implicitly fold them. The dag combiner can produce a shift of i1 when folding icmp i1's. llvm-svn: 53030	2008-07-02 17:01:57 +00:00
Duncan Sands	d353c265ff	Fix typo compounded by a cut-and-pasto. llvm-svn: 53012	2008-07-02 10:03:53 +00:00
Duncan Sands	ed283c49d5	Let AnalyzeNewNode take care of calling ExpungeNode. This makes sure that all new nodes are expunged, not just those the top node of a new subtree. llvm-svn: 53011	2008-07-02 09:56:41 +00:00
Evan Cheng	7e4abde27c	- Use a faster priority comparison function if -fast. - Code clean up. llvm-svn: 53010	2008-07-02 09:23:51 +00:00
Owen Anderson	501f207bdf	No need to use std::distance. We can just count the number of operands much more cheaply. llvm-svn: 52990	2008-07-01 22:34:11 +00:00
Evan Cheng	4c609abd90	Eliminate a compile time warning. llvm-svn: 52982	2008-07-01 21:35:46 +00:00
Evan Cheng	33696cd9cf	Do run ComputeLiveOutVRegInfo with -fast. llvm-svn: 52975	2008-07-01 18:15:04 +00:00
Evan Cheng	2c9773155a	Do not use computationally expensive scheduling heuristics with -fast. llvm-svn: 52971	2008-07-01 18:05:03 +00:00
Evan Cheng	fb2573554c	Apply Chris' suggestion. llvm-svn: 52970	2008-07-01 17:59:20 +00:00
Dan Gohman	fb19f9402b	Split ISD::LABEL into ISD::DBG_LABEL and ISD::EH_LABEL, eliminating the need for a flavor operand, and add a new SDNode subclass, LabelSDNode, for use with them to eliminate the need for a label id operand. Change instruction selection to let these label nodes through unmodified instead of creating copies of them. Teach the MachineInstr emitter how to emit a MachineInstr directly from an ISD label node. This avoids the need for allocating SDNodes for the label id and flavor value, as well as SDNodes for each of the post-isel label, label id, and label flavor. llvm-svn: 52943	2008-07-01 00:05:16 +00:00
Evan Cheng	819b770868	Suppress compiler warning. llvm-svn: 52934	2008-06-30 22:33:56 +00:00
Dan Gohman	e09a1c88cf	Use a simpler but equivalent form of RecordSource. llvm-svn: 52931	2008-06-30 22:21:03 +00:00
Evan Cheng	0d3628946f	Add timing report for various sub-passes under SelectionDAGISel. llvm-svn: 52930	2008-06-30 22:10:09 +00:00
Dan Gohman	a76e60a77a	Use reserve. SelectionDAG::allnodes_size is linear, but that doesn't appear to outweigh the benefit of reducing heap traffic. If it does become a problem, we should teach SelectionDAG to keep a count of how many nodes are live, because there are several other places where that information would be useful as well. llvm-svn: 52926	2008-06-30 21:04:06 +00:00
Dan Gohman	5c73a886b4	Rename ISD::LOCATION to ISD::DBG_STOPPOINT to better reflect its purpose, and give it a custom SDNode subclass so that it doesn't need to have line number, column number, filename string, and directory string, all existing as individual SDNodes to be the operands. This was the only user of ISD::STRING, StringSDNode, etc., so remove those and some associated code. This makes stop-points considerably easier to read in -view-legalize-dags output, and reduces overhead (creating new nodes and copying std::strings into them) on code containing debugging information. llvm-svn: 52924	2008-06-30 20:59:49 +00:00
Evan Cheng	0711d68fa7	Split scheduling from instruction selection. llvm-svn: 52923	2008-06-30 20:45:06 +00:00
Dan Gohman	31c8123d07	Replace some std::vectors that showed up in heap profiling with SmallVectors. Change the signature of TargetLowering::LowerArguments to avoid returning a vector by value, and update the two targets which still use this directly, Sparc and IA64, accordingly. llvm-svn: 52917	2008-06-30 20:31:15 +00:00
Dan Gohman	328e26d0ac	Correct the allocation size for CCState's UsedRegs member, which only needs one bit for each register. UsedRegs is a SmallVector sized at 16, so this eliminates a heap allocation/free for every call and return processed by Legalize on most targets. llvm-svn: 52915	2008-06-30 20:25:31 +00:00
Duncan Sands	9e08148f29	ExpungeNode is only needed for new nodes! This fixes CodeGen/PowerPC/2008-06-19-LegalizerCrash.ll when using the new LegalizeTypes infrastructure. llvm-svn: 52903	2008-06-30 16:43:45 +00:00
Duncan Sands	36410f6cde	Support for VAARG. As noted in a comment, this is wrong for types like x86 long double and i1, but no worse than what is done in LegalizeDAG. llvm-svn: 52898	2008-06-30 13:55:15 +00:00
Duncan Sands	dd5354df89	Support for promoting select_cc operands. llvm-svn: 52895	2008-06-30 11:50:11 +00:00
Duncan Sands	1ae6ef83ee	Revert the SelectionDAG optimization that makes it impossible to create a MERGE_VALUES node with only one result: sometimes it is useful to be able to create a node with only one result out of one of the results of a node with more than one result, for example because the new node will eventually be used to replace a one-result node using ReplaceAllUsesWith, cf X86TargetLowering::ExpandFP_TO_SINT. On the other hand, most users of MERGE_VALUES don't need this and for them the optimization was valuable. So add a new utility method getMergeValues for creating MERGE_VALUES nodes which by default performs the optimization. Change almost everywhere to use getMergeValues (and tidy some stuff up at the same time). llvm-svn: 52893	2008-06-30 10:19:09 +00:00
Evan Cheng	da3db11db3	- Re-apply 52748 and friends with fix. GetConstantStringInfo() returns an empty string for ConstantAggregateZero case which surprises selectiondag. - Correctly handle memcpy from constant string which is zero-initialized. llvm-svn: 52891	2008-06-30 07:31:25 +00:00
Chris Lattner	9d3740ed1c	Implement split and scalarize for SELECT_CC, fixing PR2504 llvm-svn: 52887	2008-06-30 02:43:01 +00:00
Anton Korobeynikov	a7c583d584	Revert (52748 and friends): Move GetConstantStringInfo to lib/Analysis. Remove string output routine from Constant. Update all callers. Change debug intrinsic api slightly to accomodate move of routine, these now return values instead of strings. This unbreaks llvm-gcc bootstrap. llvm-svn: 52884	2008-06-29 17:57:03 +00:00
Chris Lattner	3cffa471d9	Really fix the bootstrap failure. llvm-svn: 52854	2008-06-28 06:24:50 +00:00
Chris Lattner	1701328675	Add back the capability to include nul characters in strings with GetConstantStringInfo. This will hopefully restore llvm-gcc to happy bootstrap land. llvm-svn: 52851	2008-06-28 05:33:32 +00:00
Dan Gohman	6f7b5a6392	When folding a bitcast into a load or store, preserve the alignment information of the original load or store, which is checked to be at least as good, and possibly better. llvm-svn: 52849	2008-06-28 00:45:22 +00:00
Chris Lattner	735705bc3e	simplify this check, GetConstantStringInfo validates that a global is constant already. No functionality change. llvm-svn: 52812	2008-06-27 03:18:41 +00:00
Bill Wendling	c758698d2c	Refactor the DebugInfoDesc stuff out of the MachineModuleInfo file. Clean up some uses of std::vector, where it's return std::vector by value. Yuck! llvm-svn: 52800	2008-06-27 00:09:40 +00:00
Chris Lattner	df1cbdd645	duncan points out that isOperationLegal includes a check for type legality. Thanks Duncan! llvm-svn: 52786	2008-06-26 17:16:00 +00:00
Eric Christopher	d0ab9c47e6	Move GetConstantStringInfo to lib/Analysis. Remove string output routine from Constant. Update all callers. Change debug intrinsic api slightly to accomodate move of routine, these now return values instead of strings. llvm-svn: 52748	2008-06-26 00:31:12 +00:00
Chris Lattner	b1e66ce3bb	when we know the signbit of an input to uint_to_fp is zero, change it to sint_to_fp on targets where that is cheaper (and visaversa of course). This allows us to compile uint_to_fp to: _test: movl 4(%esp), %eax shrl $23, %eax cvtsi2ss %eax, %xmm0 movl 8(%esp), %eax movss %xmm0, (%eax) ret instead of: .align 3 LCPI1_0: ## double .long 0 ## double least significant word 4.5036e+15 .long 1127219200 ## double most significant word 4.5036e+15 .text .align 4,0x90 .globl _test _test: subl $12, %esp movl 16(%esp), %eax shrl $23, %eax movl %eax, (%esp) movl $1127219200, 4(%esp) movsd (%esp), %xmm0 subsd LCPI1_0, %xmm0 cvtsd2ss %xmm0, %xmm0 movl 20(%esp), %eax movss %xmm0, (%eax) addl $12, %esp ret llvm-svn: 52747	2008-06-26 00:16:49 +00:00
Evan Cheng	3fc2372d3a	- Fix a x86 vector isel bug: illegal transformation of a vector_shuffle into a shift. - Add a readme entry for a missing vector_shuffle optimization that results in awful codegen. llvm-svn: 52740	2008-06-25 20:52:59 +00:00
Duncan Sands	33ff5c8d0d	Add support for expanding PPC 128 bit floats. For this it is convenient to permit floats to be used with EXTRACT_ELEMENT, so I tweaked things to allow that. I also added libcalls for ppcf128 to i32 forms of FP_TO_XINT, since they exist in libgcc and this case can certainly occur (and does occur in the testsuite) - before the i64 libcall was being used. Also, the XINT_TO_FP result seemed to be wrong when the argument is an i128: the wrong fudge factor was added (the i32 and i64 cases were handled directly, but the i128 code fell through to some generic softening code which seemed to think it was i64 to f32!). So I fixed it by adding a fudge factor that I found in my breakfast cereal. llvm-svn: 52739	2008-06-25 20:24:48 +00:00
Duncan Sands	6920b254ad	Add/complete support for integer and float select_cc and friends. This code could be factorized a bit but I'm not sure that it's worth it. llvm-svn: 52724	2008-06-25 16:34:21 +00:00
Dan Gohman	aa01afd47c	Remove the OrigVT member from AtomicSDNode, as it is redundant with the base SDNode's VTList. llvm-svn: 52722	2008-06-25 16:07:49 +00:00
Mon P Wang	6a490371c9	Added MemOperands to Atomic operations since Atomics touches memory. Added abstract class MemSDNode for any Node that have an associated MemOperand Changed atomic.lcs => atomic.cmp.swap, atomic.las => atomic.load.add, and atomic.lss => atomic.load.sub llvm-svn: 52706	2008-06-25 08:15:39 +00:00
Dan Gohman	0d8a61eb60	Use the new PriorityQueue in ScheduleDAGList too, which also needs arbitrary-element removal. llvm-svn: 52654	2008-06-23 23:40:09 +00:00
Dan Gohman	fa63cc4e91	Move a DenseMap's declaration outside of a loop, and just call clear() on each iteration. This avoids allocating and deallocating all of DenseMap's memory on each iteration. llvm-svn: 52642	2008-06-23 21:15:00 +00:00
Dan Gohman	b4e2637e9b	Duncan pointed out this code could be tidied. llvm-svn: 52624	2008-06-23 15:29:14 +00:00
Duncan Sands	d803ce29a8	Port some integer multiplication fixes from LegalizeDAG. Bail out with an error if there is no libcall available for the given size of integer. llvm-svn: 52622	2008-06-23 15:15:44 +00:00
Duncan Sands	73ceffa3df	Support for expanding the result of EXTRACT_ELEMENT. llvm-svn: 52621	2008-06-23 15:08:15 +00:00
Duncan Sands	bc12dce8bb	Cleanup up LegalizeTypes handling of loads and stores. llvm-svn: 52620	2008-06-23 14:19:45 +00:00
Duncan Sands	5fb92e58de	Make custom lowering of ADD work correctly. This fixes PR2476; patch by Richard Osborne. The same problem exists for a bunch of other operators, but I'm ignoring this because they will be automagically fixed when the new LegalizeTypes infrastructure lands, since it already solves this problem centrally. llvm-svn: 52610	2008-06-22 09:42:16 +00:00
Dan Gohman	546505e7e1	Simplify some getNode calls. llvm-svn: 52604	2008-06-21 22:06:07 +00:00
Dan Gohman	ea0452016e	canClobberPhysRegDefs shouldn't called without checking hasPhysRegDefs; check this with an assert. llvm-svn: 52603	2008-06-21 22:05:24 +00:00
Dan Gohman	38c19aae38	Use clear() to zero an existing APInt. llvm-svn: 52601	2008-06-21 22:02:15 +00:00
Dan Gohman	14b911d929	Remove a redundant return. llvm-svn: 52585	2008-06-21 19:34:57 +00:00
Dan Gohman	46520a25a4	Remove ScheduleDAG's SUnitMap altogether. Instead, use SDNode's NodeId field, which is otherwise unused after instruction selection, as an index into the SUnit array. llvm-svn: 52583	2008-06-21 19:18:17 +00:00
Dan Gohman	a4db3352f9	Add a priority queue class, which is a wrapper around std::priority_queue and provides fairly efficient removal of arbitrary elements. Switch ScheduleDAGRRList from std::set to this new priority queue. llvm-svn: 52582	2008-06-21 18:35:25 +00:00
Duncan Sands	3bb8999719	Support for load/store of expanded float types. I don't know if a truncating store is possible here, but added support for it anyway. llvm-svn: 52577	2008-06-21 17:00:47 +00:00
Dan Gohman	e6e1348275	Change ScheduleDAG's SUnitMap from DenseMap<SDNode, vector<SUnit> > to DenseMap<SDNode, SUnit>, and adjust the way cloned SUnit nodes are handled so that only the original node needs to be in the map. This speeds up llc on 447.dealII.llvm.bc by about 2%. llvm-svn: 52576	2008-06-21 15:52:51 +00:00
Dan Gohman	4b49be1cbe	Simplify some template parameterization. llvm-svn: 52571	2008-06-21 01:08:22 +00:00
Duncan Sands	f362183c24	Share some code that is common between integer and float expansion (and sometimes vector splitting too). llvm-svn: 52548	2008-06-20 18:40:50 +00:00
Duncan Sands	49295b48eb	Rename the operation of turning a float type into an integer of the same type. Before it was "promotion", but this is confusing because it is quite different to promotion of integers. Call it "softening" instead, inspired by "soft float". llvm-svn: 52546	2008-06-20 17:49:55 +00:00
Dan Gohman	3792c470d5	Clean up some uses of std::distance, now that we have allnodes_size. llvm-svn: 52545	2008-06-20 17:15:19 +00:00
Dan Gohman	593a010c56	Teach ReturnInst lowering about aggregate return values. llvm-svn: 52522	2008-06-20 01:29:26 +00:00
Dan Gohman	44b2c57e2b	Fix the index calculations for the extractvalue lowering code. llvm-svn: 52517	2008-06-20 00:54:19 +00:00
Dan Gohman	c7a32fc8ca	Simplify the ComputeLinearIndex logic and fix a few bugs. llvm-svn: 52516	2008-06-20 00:53:00 +00:00
Evan Cheng	be0429c558	ISD::UNDEF should be expanded recursively / iteratively. llvm-svn: 52508	2008-06-19 22:01:11 +00:00
Duncan Sands	4c69995fb2	Split type expansion into ExpandInteger and ExpandFloat rather than bundling them together. Rename FloatToInt to PromoteFloat (better, if not perfect). Reorganize files by types rather than by operations. llvm-svn: 52408	2008-06-17 14:27:01 +00:00
Chris Lattner	1b08c4a709	add a new -enable-value-prop flag for llcbeta, that enables propagation of value info (sign/zero ext info) from one MBB to another. This doesn't handle much right now because of two limitations: 1) only handles zext/sext, not random bit propagation (no assert exists for this) 2) doesn't handle phis. llvm-svn: 52383	2008-06-17 06:09:18 +00:00
Duncan Sands	0ae829e5d1	Fix spelling. llvm-svn: 52381	2008-06-17 03:24:13 +00:00
Duncan Sands	37c1f5267b	Allow these transforms for types like i256 while still excluding types like i1 (not byte sized) and i120 (loading an i120 requires loading an i64, an i32, an i16 and an i8, which is expensive). llvm-svn: 52310	2008-06-16 08:14:38 +00:00
Duncan Sands	075293ff46	The transforms in visitEXTRACT_VECTOR_ELT are not valid if the load is volatile. Hopefully all wrong DAG combiner transforms of volatile loads and stores have now been caught. llvm-svn: 52293	2008-06-15 20:12:31 +00:00
Duncan Sands	0bc21c0551	LegalizeTypes support for INSERT_VECTOR_ELT with a non-constant index. llvm-svn: 52292	2008-06-15 20:00:14 +00:00
Duncan Sands	b1bfff53fe	Remove a redundant AfterLegalize check. Turn on some code when !AfterLegalize - but since this whole code section is turned off by an "if (0)" it's not really turning anything on. llvm-svn: 52276	2008-06-14 17:48:34 +00:00
Andrew Lenharth	f88d50bfcc	add missing atomic intrinsic from gcc llvm-svn: 52270	2008-06-14 05:48:15 +00:00
Duncan Sands	8651e9c584	Disable some DAG combiner optimizations that may be wrong for volatile loads and stores. In fact this is almost all of them! There are three types of problems: (1) it is wrong to change the width of a volatile memory access. These may be used to do memory mapped i/o, in which case a load can have an effect even if the result is not used. Consider loading an i32 but only using the lower 8 bits. It is wrong to change this into a load of an i8, because you are no longer tickling the other three bytes. It is also unwise to make a load/store wider. For example, changing an i16 load into an i32 load is wrong no matter how aligned things are, since the fact of loading an additional 2 bytes can have i/o side-effects. (2) it is wrong to change the number of volatile load/stores: they may be counted by the hardware. (3) it is wrong to change a volatile load/store that requires one memory access into one that requires several. For example on x86-32, you can store a double in one processor operation, but to store an i64 requires two (two i32 stores). In a multi-threaded program you may want to bitcast an i64 to a double and store as a double because that will occur atomically, and be indivisible to other threads. So it would be wrong to convert the store-of-double into a store of an i64, because this will become two i32 stores - no longer atomic. My policy here is to say that the number of processor operations for an illegal operation is undefined. So it is alright to change a store of an i64 (requires at least two stores; but could be validly lowered to memcpy for example) into a store of double (one processor op). In short, if the new store is legal and has the same size then I say that the transform is ok. It would also be possible to say that transforms are always ok if before they were illegal, whether after they are illegal or not, but that's more awkward to do and I doubt it buys us anything much. However this exposed an interesting thing - on x86-32 a store of i64 is considered legal! That is because operations are marked legal by default, regardless of whether the type is legal or not. In some ways this is clever: before type legalization this means that operations on illegal types are considered legal; after type legalization there are no illegal types so now operations are only legal if they really are. But I consider this to be too cunning for mere mortals. Better to do things explicitly by testing AfterLegalize. So I have changed things so that operations with illegal types are considered illegal - indeed they can never map to a machine operation. However this means that the DAG combiner is more conservative because before it was "accidentally" performing transforms where the type was illegal because the operation was nonetheless marked legal. So in a few such places I added a check on AfterLegalize, which I suppose was actually just forgotten before. This causes the DAG combiner to do slightly more than it used to, which resulted in the X86 backend blowing up because it got a slightly surprising node it wasn't expecting, so I tweaked it. llvm-svn: 52254	2008-06-13 19:07:40 +00:00
Duncan Sands	bf17080ec2	Sometimes (rarely) nodes held in LegalizeTypes maps can be deleted. This happens when RAUW replaces a node N with another equivalent node E, deleting the first node. Solve this by adding (N, E) to ReplacedNodes, which is already used to remap nodes to replacements. This means that deleted nodes are being allowed in maps, which can be delicate: the memory may be reused for a new node which might get confused with the old deleted node pointer hanging around in the maps, so detect this and flush out maps if it occurs (ExpungeNode). The expunging operation is expensive, however it never occurs during a llvm-gcc bootstrap or anywhere in the nightly testsuite. It occurs three times in "make check": Alpha/illegal-element-type.ll, PowerPC/illegal-element-type.ll and X86/mmx-shift.ll. If expunging proves to be too expensive then there are other more complicated ways of solving the problem. In the normal case this patch adds the overhead of a few more map lookups, which is hopefully negligable. llvm-svn: 52214	2008-06-11 11:42:12 +00:00
Dan Gohman	e38cc01244	Teach isGAPlusOffset to respect a GlobalAddressSDNode's offset value, which is something that apparently isn't used much. llvm-svn: 52158	2008-06-09 22:05:52 +00:00
Dan Gohman	6001b91d8e	CodeGen support for aggregate-value function arguments. llvm-svn: 52156	2008-06-09 21:19:23 +00:00
Duncan Sands	67d0f332d5	Various tweaks related to apint codegen. No functionality change for non-funky-sized integers. llvm-svn: 52151	2008-06-09 15:48:25 +00:00
Dan Gohman	d485e4eb5c	Handle empty aggregate values. llvm-svn: 52150	2008-06-09 15:21:47 +00:00
Duncan Sands	93b6609ae2	Remove some DAG combiner assumptions about sizes of integer types. Fix the isMask APInt method to actually work (hopefully) rather than crashing because it adds apints of different bitwidths. It looks like isShiftedMask is also broken, but I'm leaving that one to the APInt people (it is not used anywhere). llvm-svn: 52142	2008-06-09 11:32:28 +00:00
Duncan Sands	11dd424539	Remove comparison methods for MVT. The main cause of apint codegen failure is the DAG combiner doing the wrong thing because it was comparing MVT's using < rather than comparing the number of bits. Removing the < method makes this mistake impossible to commit. Instead, add helper methods for comparing bits and use them. llvm-svn: 52098	2008-06-08 20:54:56 +00:00
Dan Gohman	f6743d70ab	CodeGen support for insertvalue and extractvalue, and for loads and stores of aggregate values. llvm-svn: 52069	2008-06-07 02:02:36 +00:00
Owen Anderson	0bd08cf64c	Connect successors before creating the DAG node for the branch. This has no visible functionality change, but enables a future patch where node creation will update the CFG if it decides to create an unconditional rather than a conditional branch. llvm-svn: 52067	2008-06-07 00:00:23 +00:00
Duncan Sands	f1123e58fc	Tighten up the abstraction slightly. llvm-svn: 52045	2008-06-06 12:49:32 +00:00
Duncan Sands	13237ac3b9	Wrap MVT::ValueType in a struct to get type safety and better control the abstraction. Rename the type to MVT. To update out-of-tree patches, the main thing to do is to rename MVT::ValueType to MVT, and rewrite expressions like MVT::getSizeInBits(VT) in the form VT.getSizeInBits(). Use VT.getSimpleVT() to extract a MVT::SimpleValueType for use in switch statements (you will get an assert failure if VT is an extended value type - these shouldn't exist after type legalization). This results in a small speedup of codegen and no new testsuite failures (x86-64 linux). llvm-svn: 52044	2008-06-06 12:08:01 +00:00
Evan Cheng	976b1eee81	Fix a memcpy lowering bug. Even though the memcpy alignment is smaller than the desired alignment, the frame destination alignment may still be larger than the desired alignment. Don't change its alignment to something smaller. llvm-svn: 51970	2008-06-04 23:37:54 +00:00
Scott Michel	a7d8649f78	Fix spellnig error llvm-svn: 51917	2008-06-03 19:13:20 +00:00
Dan Gohman	057240f4f0	Fold adds and subtracts of zero immediately, instead of waiting for dagcombine to do this. llvm-svn: 51886	2008-06-02 22:27:05 +00:00
Scott Michel	d831cc49e5	Add necessary 64-bit support so that gcc frontend compiles (mostly). Current issue is operand promotion for setcc/select... but looks like the fundamental stuff is implemented for CellSPU. llvm-svn: 51884	2008-06-02 22:18:03 +00:00
Dan Gohman	9a19f33842	Remove an unused variable. llvm-svn: 51807	2008-05-31 01:44:25 +00:00
Dan Gohman	8807147ada	Remove an unused variable. llvm-svn: 51721	2008-05-30 00:56:36 +00:00
Dan Gohman	714663ab94	Expand small memmovs using inline code. Set the X86 threshold for expanding memmove to a more plausible value, now that it's actually being used. llvm-svn: 51696	2008-05-29 19:42:22 +00:00
Evan Cheng	5e28227dbd	Implement vector shift up / down and insert zero with ps{rl}lq / ps{rl}ldq. llvm-svn: 51667	2008-05-29 08:22:04 +00:00
Duncan Sands	698348dfac	Fix some constructs that gcc-4.4 warns about. llvm-svn: 51591	2008-05-27 11:50:51 +00:00
Dan Gohman	643b3a0581	Add #includes to make some dependencies explicit. llvm-svn: 51496	2008-05-23 20:40:06 +00:00
Dan Gohman	6d5f120c5c	Generalize the new code in instcombine's ComputeNumSignBits for handling and/or to handle more cases (such as this add-sitofp.ll testcase), and port it to selectiondag's ComputeNumSignBits. llvm-svn: 51469	2008-05-23 02:28:01 +00:00
Dan Gohman	396ed504f1	Use isSingleValueType instead of isFirstClassType to exclude struct and array types. llvm-svn: 51460	2008-05-23 00:34:04 +00:00
Dan Gohman	fe13618682	Port the fix for the select operator from instcombine's ComputeNumSignBits to SelectionDAG's ComputeNumSignBits. llvm-svn: 51348	2008-05-20 20:59:51 +00:00
Dan Gohman	c1a4e212a3	Code simplification. llvm-svn: 51345	2008-05-20 20:56:33 +00:00
Evan Cheng	9ac3631fa3	If the result of a BIT_CONVERT is a v1* vector, it doesn't mean its source is a v1* vector. llvm-svn: 51192	2008-05-16 17:19:05 +00:00
Duncan Sands	70424d195a	Silence the compiler warning differently. The original method caused gcc-4.2 to complain. llvm-svn: 51186	2008-05-16 09:19:16 +00:00
Nate Begeman	f79f52282c	Actually scalarize the operand to BIT_CONVERT instead of asking someone to do something with a v1 type. llvm-svn: 51160	2008-05-15 20:40:58 +00:00
Dan Gohman	12fce7751b	IR support for extractvalue and insertvalue instructions. Also, begin moving toward making structs and arrays first-class types. llvm-svn: 51157	2008-05-15 19:50:34 +00:00
Evan Cheng	ef377adca0	Make use of vector load and store operations to implement memcpy, memmove, and memset. Currently only X86 target is taking advantage of these. llvm-svn: 51140	2008-05-15 08:39:06 +00:00
Evan Cheng	4ea9d49590	Use a better idiom to silence compiler warnings. llvm-svn: 51131	2008-05-14 21:08:07 +00:00
Evan Cheng	0f7fb95e79	Really silence compiler warnings. llvm-svn: 51126	2008-05-14 20:29:30 +00:00
Evan Cheng	a5b0a8d7fe	Really silence compiler warnings. llvm-svn: 51123	2008-05-14 20:26:35 +00:00
Evan Cheng	763ec13862	Silence some compiler warnings. llvm-svn: 51115	2008-05-14 20:07:51 +00:00
Dan Gohman	3ab94df276	When bit-twiddling CondCode values for integer comparisons produces SETOEQ, is it does with (SETEQ & SETULE), map it to SETEQ. llvm-svn: 51112	2008-05-14 18:17:09 +00:00
Dan Gohman	fd3e3003f3	Whitespace cleanups. llvm-svn: 51089	2008-05-14 00:43:10 +00:00
Evan Cheng	1120279ae6	Instead of a vector load, shuffle and then extract an element. Load the element from address with an offset. pshufd $1, (%rdi), %xmm0 movd %xmm0, %eax => movl 4(%rdi), %eax llvm-svn: 51026	2008-05-13 08:35:03 +00:00
Dan Gohman	d78c400b5b	Clean up the use of static and anonymous namespaces. This turned up several things that were neither in an anonymous namespace nor static but not intended to be global. llvm-svn: 51017	2008-05-13 00:00:25 +00:00
Nate Begeman	b87e63a730	Teach Legalize how to scalarize VSETCC Teach X86 a few more vsetcc patterns. Custom lowering for unsupported ones is next. llvm-svn: 51009	2008-05-12 23:09:43 +00:00
Evan Cheng	b980f6fb3d	Xform bitconvert(build_pair(load a, load b)) to a single load if the load locations are at the right offset from each other. llvm-svn: 51008	2008-05-12 23:04:07 +00:00
Evan Cheng	2609d5e779	Refactor isConsecutiveLoad from X86 to TargetLowering so DAG combiner can make use of it. llvm-svn: 50991	2008-05-12 19:56:52 +00:00
Nate Begeman	cfcb56091b	Add support for vicmp/vfcmp codegen, more legalize support coming. This is necessary to unbreak the build. llvm-svn: 50988	2008-05-12 19:40:03 +00:00
Dan Gohman	ecb77385ab	Fix a missing break in the ISD::FLT_ROUNDS_ handling. Patch by giuma! llvm-svn: 50967	2008-05-12 16:07:15 +00:00
Anton Korobeynikov	fc2edad4ae	Turn StripPointerCast() into a method llvm-svn: 50836	2008-05-07 22:54:15 +00:00
Dan Gohman	5a3eecdfd8	Fix a bug in the ComputeMaskedBits logic for multiply. llvm-svn: 50793	2008-05-07 00:35:55 +00:00
Anton Korobeynikov	82c02b28f3	Make StripPointerCast a common function (should we mak it method of Value instead?) llvm-svn: 50775	2008-05-06 22:52:30 +00:00
Dan Gohman	6a2da37c0e	Make several variable declarations static. llvm-svn: 50696	2008-05-06 01:53:16 +00:00
Dan Gohman	38dc08f36f	Instead of enumerating each opcode that isn't handled that ComputeMaskedBits handles, just use a 'default:'. This avoids TargetLowering's list getting out of date with SelectionDAG's. llvm-svn: 50693	2008-05-06 00:53:29 +00:00
Dan Gohman	cf0e3acf16	Correct the value of LowBits in srem and urem handling in ComputeMaskedBits. llvm-svn: 50692	2008-05-06 00:51:48 +00:00
Mon P Wang	3e58393c3d	Added addition atomic instrinsics and, or, xor, min, and max. llvm-svn: 50663	2008-05-05 19:05:59 +00:00
Dan Gohman	1962c2be6a	Fix a mistake in the computation of leading zeros for udiv. llvm-svn: 50591	2008-05-02 21:30:02 +00:00
Dan Gohman	2f83b47863	Fix a typo in a comment. llvm-svn: 50562	2008-05-02 00:05:03 +00:00
Dan Gohman	ea6357828b	Use push_back(...) instead of resize(1, ...), per review feedback. llvm-svn: 50561	2008-05-02 00:03:54 +00:00
Dan Gohman	752ce50b2d	Fix uninitialized uses of the FPC variable. llvm-svn: 50558	2008-05-01 23:40:44 +00:00
Chris Lattner	d4b2a67cf3	don't randomly miscompile seto/setuo just because we are in ffastmath mode. This fixes rdar://5902801, a miscompilation of gcc.dg/builtins-8.c. Bill, please pull this into Tak. llvm-svn: 50523	2008-05-01 07:26:11 +00:00
Arnold Schwaighofer	be0de34ede	Tail call optimization improvements: Move platform independent code (lowering of possibly overwritten arguments, check for tail call optimization eligibility) from target X86ISelectionLowering.cpp to TargetLowering.h and SelectionDAGISel.cpp. Initial PowerPC tail call implementation: Support ppc32 implemented and tested (passes my tests and test-suite llvm-test). Support ppc64 implemented and half tested (passes my tests). On ppc tail call optimization is performed if caller and callee are fastcc call is a tail call (in tail call position, call followed by ret) no variable argument lists or byval arguments option -tailcallopt is enabled Supported: * non pic tail calls on linux/darwin * module-local tail calls on linux(PIC/GOT)/darwin(PIC) * inter-module tail calls on darwin(PIC) If constraints are not met a normal call will be emitted. A test checking the argument lowering behaviour on x86-64 was added. llvm-svn: 50477	2008-04-30 09:16:33 +00:00
Scott Michel	be940424b3	Fix custom target lowering for zero/any/sign_extend: make sure that DAG.UpdateNodeOperands() is called before (not after) the call to TLI.LowerOperation(). llvm-svn: 50461	2008-04-30 00:26:38 +00:00
Roman Levenstein	6b37114590	Use std::set instead of std::priority_queue for the RegReductionPriorityQueue. This removes the existing bottleneck related to the removal of elements from the middle of the queue. Also fixes a subtle bug in ScheduleDAGRRList::CapturePred: It was updating the state of the SUnit before removing it. As a result, the comparison operators were working incorrectly and this SUnit could not be removed from the queue properly. Reviewed by Evan and Dan. Approved by Dan. llvm-svn: 50412	2008-04-29 09:07:59 +00:00
Chris Lattner	5c88f7b1ad	make the vector conversion magic handle multiple results. We now compile test2/test3 to: _test2: ## InlineAsm Start set %xmm0, %xmm1 ## InlineAsm End addps %xmm1, %xmm0 ret _test3: ## InlineAsm Start set %xmm0, %xmm1 ## InlineAsm End paddd %xmm1, %xmm0 ret as expected. llvm-svn: 50389	2008-04-29 04:48:56 +00:00
Chris Lattner	f9a49c4322	add support for multiple return values in inline asm. This is a step towards PR2094. It now compiles the attached .ll file to: _sad16_sse2: movslq %ecx, %rax ## InlineAsm Start %ecx %rdx %rax %rax %r8d %rdx %rsi ## InlineAsm End ## InlineAsm Start set %eax ## InlineAsm End ret which is pretty decent for a 3 output, 4 input asm. llvm-svn: 50386	2008-04-29 04:29:54 +00:00
Evan Cheng	b96782ecbd	Fix a bug in RegsForValue::getCopyToRegs() that causes cyclical scheduling units. If it's creating multiple CopyToReg nodes that are "flagged" together, it should not create a TokenFactor for it's chain outputs: c1, f1 = CopyToReg c2, f2 = CopyToReg c3 = TokenFactor c1, c2 ... = user c3, ..., f2 Now that the two CopyToReg's and the user are "flagged" together. They effectively forms a single scheduling unit. The TokenFactor is now both an operand and a successor of the Flagged nodes. llvm-svn: 50376	2008-04-28 22:07:13 +00:00
Dan Gohman	c968c1f592	Evan pointed out that folding sext to zext may not be correct if the zext is not legal. llvm-svn: 50368	2008-04-28 18:47:17 +00:00
Dan Gohman	77ce6da378	Delete an unused constructor. llvm-svn: 50367	2008-04-28 18:28:49 +00:00
Dan Gohman	d961d30b7f	Add a comment to CreateRegForValue that clarifies the handling of aggregate types. llvm-svn: 50366	2008-04-28 18:19:43 +00:00
Dan Gohman	80c692d439	Rewrite the comments for RegsForValue and its members, and reorder some of the members for clarity. llvm-svn: 50365	2008-04-28 18:10:39 +00:00
Dan Gohman	14a05df97b	Don't call size() on each iteration of the loop. llvm-svn: 50361	2008-04-28 17:42:03 +00:00
Dan Gohman	da44054867	Fix the SVOffset values for loads and stores produced by memcpy/memset expansion. It was a bug for the SVOffset value to be used in the actual address calculations. llvm-svn: 50359	2008-04-28 17:15:20 +00:00
Dan Gohman	72ec3f4562	Teach InstCombine's ComputeMaskedBits what SelectionDAG's ComputeMaskedBits knows about cttz, ctlz, and ctpop. Teach SelectionDAG's ComputeMaskedBits what InstCombine's knows about SRem. And teach them both some things about high bits in Mul, UDiv, URem, and Sub. This allows instcombine and dagcombine to eliminate sign-extension operations in several new cases. llvm-svn: 50358	2008-04-28 17:02:21 +00:00
Dan Gohman	3eb10f758e	Teach DAGCombine to convert (sext x) to (zext x) when the sign-bit of x is known to be zero. llvm-svn: 50357	2008-04-28 16:58:24 +00:00
Chris Lattner	c9e280c78a	Another collection of random cleanups. No functionality change. llvm-svn: 50341	2008-04-28 07:16:35 +00:00
Chris Lattner	52504e78fb	Remove the SmallVector ctor that converts from a SmallVectorImpl. This conversion open the door for many nasty implicit conversion issues, and can be easily solved by initializing with (V.begin(), V.end()) when needed. This patch includes many small cleanups for sdisel also. llvm-svn: 50340	2008-04-28 06:44:42 +00:00
Chris Lattner	8c7f5ad968	switch RegsForValue::Regs to be a SmallVector to avoid heap thrash on tiny (usually single-element) vectors. llvm-svn: 50335	2008-04-28 06:02:19 +00:00
Chris Lattner	d04b818a91	move static function out of anon namespace, no functionality change. llvm-svn: 50330	2008-04-27 23:48:12 +00:00
Chris Lattner	122721843b	Another step to getting multiple result inline asm to work. llvm-svn: 50329	2008-04-27 23:44:28 +00:00
Chris Lattner	58b9ece38d	typo llvm-svn: 50316	2008-04-27 01:49:46 +00:00
Chris Lattner	2237973438	Implement a signficant optimization for inline asm: When choosing between constraints with multiple options, like "ir", test to see if we can use the 'i' constraint and go with that if possible. This produces more optimal ASM in all cases (sparing a register and an instruction to load it), and fixes inline asm like this: void test () { asm volatile (" %c0 %1 " : : "imr" (42), "imr"(14)); } Previously we would dump "42" into a memory location (which is ok for the 'm' constraint) which would cause a problem because the 'c' modifier is not valid on memory operands. Isn't it great how inline asm turns 'missed optimization' into 'compile failed'?? Incidentally, this was the todo in PowerPC/2007-04-24-InlineAsm-I-Modifier.ll Please do NOT pull this into Tak. llvm-svn: 50315	2008-04-27 00:37:18 +00:00
Chris Lattner	a937baeb9b	isa+cast -> dyn_cast llvm-svn: 50314	2008-04-27 00:16:18 +00:00
Chris Lattner	4793515a9c	Move a bunch of inline asm code out of line. llvm-svn: 50313	2008-04-27 00:09:47 +00:00
Chris Lattner	724539c001	A few inline asm cleanups: - Make targetlowering.h fit in 80 cols. - Make LowerAsmOperandForConstraint const. - Make lowerXConstraint -> LowerXConstraint - Make LowerXConstraint return a const char* instead of taking a string byref. llvm-svn: 50312	2008-04-26 23:02:14 +00:00
Dan Gohman	ca95a5f49f	Remove the code from CodeGenPrepare that moved getresult instructions to the block that defines their operands. This doesn't work in the case that the operand is an invoke, because invoke is a terminator and must be the last instruction in a block. Replace it with support in SelectionDAGISel for copying struct values into sequences of virtual registers. llvm-svn: 50279	2008-04-25 18:27:55 +00:00
Nate Begeman	6f94f61317	Pull the code to perform an INSERT_VECTOR_ELT in memory out into its own function, and then use it to fix a bug in SplitVectorOp that expected inserts to always have constant insertion indices. llvm-svn: 50273	2008-04-25 18:07:40 +00:00
Dan Gohman	e9e3891c09	Use isa instead of dyn_cast. llvm-svn: 50181	2008-04-23 20:25:16 +00:00
Dan Gohman	b418aafabf	Add support to codegen for getresult instructions with undef operands. llvm-svn: 50180	2008-04-23 20:21:29 +00:00
Dan Gohman	dc90919d2b	Fix an out-of-bounds access in -view-sunit-dags in the case of an empty ScheduleDAG. llvm-svn: 50054	2008-04-21 20:07:30 +00:00
Dale Johannesen	aac27592f0	Check we aren't trying to convert PPC long double. This fixes the testsuite failure on ppcf128-4.ll. llvm-svn: 49994	2008-04-20 18:23:46 +00:00
Chris Lattner	3b18762f40	Switch to using Simplified ConstantFP::get API. llvm-svn: 49977	2008-04-20 00:41:09 +00:00
Duncan Sands	1ec193e90b	Implement a bit more softfloat support in LegalizeTypes. Correct the load logic so that it actually works, and also teach it to handle floating point extending loads. llvm-svn: 49923	2008-04-18 20:56:03 +00:00
Duncan Sands	a8a61562af	Add some more FIXME's for indexed loads and stores. llvm-svn: 49916	2008-04-18 20:27:12 +00:00
Duncan Sands	b4e0b24e0a	Provide an explicit list of operands to MakeLibcall, rather than having it suck them out of a node. Add a bunch of new libcalls, and remove dead softfloat code (dead, because FloatToInt is used not Expand in this case). Note that indexed stores probably aren't handled properly, likewise for loads. llvm-svn: 49915	2008-04-18 20:25:14 +00:00
Dan Gohman	75c895dbc4	Remove the implicit conversion from SDOperandPtr to SDOperand*; this may fix a build error on Visual Studio. llvm-svn: 49876	2008-04-17 23:02:12 +00:00
Dan Gohman	9752a8f3b4	Correct the SrcValue information in the Expand code for va_copy. llvm-svn: 49839	2008-04-17 02:09:26 +00:00
Roman Levenstein	a3ee1a38a3	Ongoing work on improving the instruction selection infrastructure: Rename SDOperandImpl back to SDOperand. Introduce the SDUse class that represents a use of the SDNode referred by an SDOperand. Now it is more similar to Use/Value classes. Patch is approved by Dan Gohman. llvm-svn: 49795	2008-04-16 16:15:27 +00:00
Dan Gohman	82b6673c44	Fix the new scheduler assertion checks to work when the scheduler has inserted no-ops. This fixes the 2006-07-03-schedulers.ll regression on ppc32. llvm-svn: 49747	2008-04-15 22:40:14 +00:00
Nicolas Geoffray	7000c8f1aa	Change Divided flag to Split, as suggested by Evan llvm-svn: 49715	2008-04-15 08:08:50 +00:00
Dan Gohman	4370f26750	Treat EntryToken nodes as "passive" so that they aren't added to the ScheduleDAG; they don't correspond to any actual instructions so they don't need to be scheduled. This fixes a bug where the EntryToken was being scheduled multiple times in some cases, though it ended up not causing any trouble because EntryToken doesn't expand into anything. With this fixed the schedulers reliably schedule the expected number of units, so we can check this with an assertion. This requires a tweak to test/CodeGen/X86/loop-hoist.ll because it ends up getting scheduled differently in a trivial way, though it was enough to fool the prcontext+grep that the test does. llvm-svn: 49701	2008-04-15 01:22:18 +00:00
Dan Gohman	e5f21cea3e	In -view-sunit-dags, display "special" chain dependencies as cyan instead of blue to distinguish them from regular dependencies. llvm-svn: 49696	2008-04-14 23:15:07 +00:00
Dan Gohman	5b61a288a7	Avoid creating MERGE_VALUES nodes for single values. llvm-svn: 49676	2008-04-14 18:43:25 +00:00
Dan Gohman	2505d86783	Fix const-correctness issues with the SrcValue handling in the memory intrinsic expansion code. llvm-svn: 49666	2008-04-14 17:55:48 +00:00
Nicolas Geoffray	db0ea1ff4e	Fix /test/CodeGen/PowerPC/big-endian-actual-args.ll for linux/ppc32 llvm-svn: 49652	2008-04-14 17:17:14 +00:00
Duncan Sands	6c503f9a65	Initial libcall support for LegalizeTypes. This is much simpler than in LegalizeDAG because calls are not yet expanded into call sequences: that happens after type legalization has finished. llvm-svn: 49634	2008-04-14 06:48:48 +00:00
Duncan Sands	0a8a4c4a0c	LegalizeTypes can sometimes have deleted nodes in its maps. Add some sanity checks that catch this kind of thing. Hopefully these can be removed one day (once all problems are fixed!) but for the moment it seems wise to have them in. llvm-svn: 49612	2008-04-13 16:04:03 +00:00
Nicolas Geoffray	dcc2eda5fc	Add a divided flag for the first piece of an argument divided into mulitple parts. Fixes PR1643 llvm-svn: 49611	2008-04-13 13:40:22 +00:00
Duncan Sands	844d55a42a	Factor some libcall code. llvm-svn: 49583	2008-04-12 17:14:18 +00:00
Dan Gohman	544ab2c50b	Drop ISD::MEMSET, ISD::MEMMOVE, and ISD::MEMCPY, which are not Legal on any current target and aren't optimized in DAGCombiner. Instead of using intermediate nodes, expand the operations, choosing between simple loads/stores, target-specific code, and library calls, immediately. Previously, the code to emit optimized code for these operations was only used at initial SelectionDAG construction time; now it is used at all times. This fixes some cases where rep;movs was being used for small copies where simple loads/stores would be better. This also cleans up code that checks for alignments less than 4; let the targets make that decision instead of doing it in target-independent code. This allows x86 to use rep;movs in low-alignment cases. Also, this fixes a bug that resulted in the use of rep;stos for memsets of 0 with non-constant memory size when the alignment was at least 4. It's better to use the library in this case, which can be significantly faster when the size is large. This also preserves more SourceValue information when memory intrinsics are lowered into simple loads/stores. llvm-svn: 49572	2008-04-12 04:36:06 +00:00
Gabor Greif	c422383e08	detabify llvm-svn: 49524	2008-04-11 09:34:57 +00:00
Dan Gohman	3bc3ddd638	Rename MemOperand to MachineMemOperand. This was suggested by review feedback from Chris quite a while ago. No functionality change. llvm-svn: 49348	2008-04-07 19:35:22 +00:00
Roman Levenstein	51f532f92d	Re-commit of the r48822, where the infinite looping problem discovered by Dan Gohman is fixed. llvm-svn: 49330	2008-04-07 10:06:32 +00:00
Torok Edwin	613d7afe64	Prefer to expand mask for xor to -1, so we have a chance to turn it into a not. If it cannot be expanded, it will keep the old behaviour and try to shrink the constant. Part of enhancement for PR2191. llvm-svn: 49280	2008-04-06 21:23:02 +00:00
Dale Johannesen	0ce4a7cc44	Make sure both PendingLoads and PendingExports are flushed before an invoke. Failure to do this causes references in the landing pad to variables that were not set. Fixes g++.dg/eh/delayslot1.C g++.dg/eh/fp-regs.C g++.old-deja/g++.brendan/eh1.C llvm-svn: 49243	2008-04-04 23:48:31 +00:00
Evan Cheng	916802a78e	Start of a series of patches related to implicit_def. There is no point in creating a long live range defined by an implicit_def. Scheduler now duplicates implicit_def instruction for each of its uses. Therefore, if an implicit_def node has multiple uses, it will become a number of very short live ranges, rather than a long one. This will make coalescer's job easier. llvm-svn: 49164	2008-04-03 16:36:07 +00:00
Evan Cheng	025cea1126	Backing out 48222 temporarily. llvm-svn: 49124	2008-04-03 03:13:16 +00:00
Dale Johannesen	fd967cf3fa	Recommitting EH patch; this should answer most of the review feedback. -enable-eh is still accepted but doesn't do anything. EH intrinsics use Dwarf EH if the target supports that, and are handled by LowerInvoke otherwise. The separation of the EH table and frame move data is, I think, logically figured out, but either one still causes full EH info to be generated (not sure how to split the metadata correctly). MachineModuleInfo::needsFrameInfo is no longer used and is removed. llvm-svn: 49064	2008-04-02 00:25:04 +00:00
Dale Johannesen	5e4e051c2a	Revert 49006 for the moment. llvm-svn: 49046	2008-04-01 20:00:57 +00:00
Evan Cheng	0bd72c5ccd	More soft fp fixes. llvm-svn: 49016	2008-04-01 02:18:22 +00:00
Evan Cheng	4cabe4b452	Pasto. llvm-svn: 49014	2008-04-01 02:00:09 +00:00
Evan Cheng	611abc03ed	Add comment. llvm-svn: 49013	2008-04-01 01:51:26 +00:00
Evan Cheng	86e476b7cb	Unbreak ARM / Thumb soft FP support. llvm-svn: 49012	2008-04-01 01:50:16 +00:00
Dale Johannesen	7d02cf3c9c	Emit exception handling info for functions which are not marked nounwind, or for all functions when -enable-eh is set, provided the target supports Dwarf EH. llvm-gcc generates nounwind in the right places; other FEs will need to do so also. Given such a FE, -enable-eh should no longer be needed. llvm-svn: 49006	2008-03-31 23:40:23 +00:00
Dan Gohman	f549b26254	Fix a DAGCombiner optimization to respect volatile qualification. llvm-svn: 48994	2008-03-31 20:32:52 +00:00
Chris Lattner	0f760dfe09	Fix "Control reaches the end of non-void function" warnings, patch by David Chisnall. llvm-svn: 48963	2008-03-30 18:22:13 +00:00
Evan Cheng	16d72072df	Cosmetic changes. llvm-svn: 48947	2008-03-29 18:34:22 +00:00
Chris Lattner	a148acdc82	ifdef out a dead function. Should this be removed? llvm-svn: 48916	2008-03-28 15:36:27 +00:00
Duncan Sands	35c7cdac07	Rename getAnyLoad to getLoad is suggested by Evan. llvm-svn: 48914	2008-03-28 09:45:24 +00:00
Duncan Sands	f740509e58	Implement LegalizeTypes support for softfloat LOAD. In order to handle indexed nodes I had to introduce a new constructor, and since I was there I factorized the code in the various load constructors. llvm-svn: 48894	2008-03-27 20:23:40 +00:00
Dan Gohman	cad51cb671	Avoid creating chain dependencies from CopyToReg nodes to load and store nodes. This doesn't currently have much impact the generated code, but it does produce simpler-looking SelectionDAGs, and consequently simpler-looking ScheduleDAGs, because there are fewer spurious dependencies. In particular, CopyValueToVirtualRegister now uses the entry node as the input chain dependency for new CopyToReg nodes instead of calling getRoot and depending on the most recent memory reference. Also, rename UnorderedChains to PendingExports and pull it up from being a local variable in SelectionDAGISel::BuildSelectionDAG to being a member variable of SelectionDAGISel, so that it doesn't have to be passed around to all the places that need it. llvm-svn: 48893	2008-03-27 19:56:19 +00:00
Roman Levenstein	30d09518b5	Fix spelling. Thanks, Duncan! :-) llvm-svn: 48873	2008-03-27 09:44:37 +00:00
Roman Levenstein	bc674501ba	Speed-up the SumOfUnscheduledPredsOfSuccs by introducing a new function called LimitedSumOfUnscheduledPredsOfSuccs. It terminates the computation after a given treshold is reached. This new function is always faster, but brings real wins only on bigger test-cases. The old function SumOfUnscheduledPredsOfSuccs is left in-place for now and therefore a warning about an unused static function is produced. llvm-svn: 48872	2008-03-27 09:14:57 +00:00
Roman Levenstein	358e04a185	Use a linked data structure for the uses lists of an SDNode, just like LLVM Value/Use does and MachineRegisterInfo/MachineOperand does. This allows constant time for all uses list maintenance operations. The idea was suggested by Chris. Reviewed by Evan and Dan. Patch is tested and approved by Dan. On normal use-cases compilation speed is not affected. On very big basic blocks there are compilation speedups in the range of 15-20% or even better. llvm-svn: 48822	2008-03-26 12:39:26 +00:00
Roman Levenstein	733a4d6e85	Fixed some spelling errors. Thanks, Duncan! llvm-svn: 48819	2008-03-26 11:23:38 +00:00
Roman Levenstein	7e71b4baaf	Some improvements related to the computation of isReachable. This fixes Bugzilla #1835 (http://llvm.org/bugs/show_bug.cgi?id=1835). This patched is reviewed by Tanya and Dan. Dan tested and approved it. The reason for the bad performance of the old algorithm is that it is very naive and scans every time all nodes of the DAG in the worst case. This patch introduces a new algorithm based on the paper "Online algorithms for maintaining the topological order of a directed acyclic graph" by David J.Pearce and Paul H.J.Kelly. This is the MNR algorithm. It has a linear time worst-case and performs much better in most situations. The paper can be found here: http://fano.ics.uci.edu/cites/Document/Online-algorithms-for-maintaining-the-topological-order-of-a-directed-acyclic-graph.html The main idea of the new algorithm is to compute the topological ordering of the SNodes in the DAG and to maintain it even after DAG modifications. The topological ordering allows for very fast node reachability checks. Tests on very big input files with tens of thousands of instructions in a BB indicate huge speed-ups (up to 10x compilation time improvement) compared to the old version. llvm-svn: 48817	2008-03-26 09:18:09 +00:00
Dan Gohman	bdc24adaaf	A quick nm audit turned up several fixed tables and objects that were marked read-write. Use const so that they can be allocated in a read-only segment. llvm-svn: 48800	2008-03-25 21:45:14 +00:00
Evan Cheng	df1690dc7c	Handle a special case xor undef, undef -> 0. Technically this should be transformed to undef. But this is such a common idiom (misuse) we are going to handle it. llvm-svn: 48792	2008-03-25 20:08:07 +00:00
Dan Gohman	fd227e9c3a	Fix typos. llvm-svn: 48779	2008-03-25 17:10:29 +00:00
Evan Cheng	fe7610f37f	Remove an unneeded test. llvm-svn: 48755	2008-03-24 23:55:16 +00:00
Dan Gohman	d8ea040c31	APIntify SelectionDAG's EXTRACT_ELEMENT code. llvm-svn: 48726	2008-03-24 16:38:05 +00:00
Anton Korobeynikov	2fa75184f3	Another comments fixing llvm-svn: 48683	2008-03-22 07:53:40 +00:00
Evan Cheng	31604a62f6	Teach DAG combiner to commute commutable binary nodes in order to achieve sdisel CSE. llvm-svn: 48673	2008-03-22 01:55:50 +00:00
Dan Gohman	30e44a4b40	Fix -view-sunit-dags to support cross-rc-copy nodes. llvm-svn: 48664	2008-03-21 22:51:06 +00:00
Duncan Sands	d97eea372a	Introduce a new node for holding call argument flags. This is needed by the new legalize types infrastructure which wants to expand the 64 bit constants previously used to hold the flags on 32 bit machines. There are two functional changes: (1) in LowerArguments, if a parameter has the zext attribute set then that is marked in the flags; before it was being ignored; (2) PPC had some bogus code for handling two word arguments when using the ELF 32 ABI, which was hard to convert because of the bogusness. As suggested by the original author (Nicolas Geoffray), I've disabled it for the moment. Tested with "make check" and the Ada ACATS testsuite. llvm-svn: 48640	2008-03-21 09:14:45 +00:00
Christopher Lamb	3e9f49716e	Check even more carefully before applying this DAGCombine transform. llvm-svn: 48580	2008-03-20 04:31:39 +00:00
Evan Cheng	7a3e750fd2	Fix this xform: (sra (shl X, m), result_size) -> (sign_extend (trunc (shl X, result_size - n - m))) llvm-svn: 48578	2008-03-20 02:18:41 +00:00
Chris Lattner	a7cca362af	detabify llvm, patch by Mike Stump! llvm-svn: 48577	2008-03-20 01:22:40 +00:00
Christopher Lamb	8fe9109469	Fix X86's isTruncateFree to not claim that truncate to i1 is free. This fixes Bill's testcase that failed for r48491. llvm-svn: 48542	2008-03-19 08:30:06 +00:00
Bill Wendling	efb4d9ef80	Temporarily revert r48491. It's breaking test/CodeGen/X86/xorl.ll. llvm-svn: 48510	2008-03-18 22:29:51 +00:00
Dale Johannesen	12c76db312	Make conversions of i8/i16 to ppcf128 work. llvm-svn: 48493	2008-03-18 17:28:38 +00:00
Christopher Lamb	3e408d4d82	Target independent DAG transform to use truncate for field extraction + sign extend on targets where this is profitable. Passes nightly on x86-64. llvm-svn: 48491	2008-03-18 16:46:39 +00:00
Christopher Lamb	d3d0ad3f58	Make insert_subreg a two-address instruction, vastly simplifying LowerSubregs pass. Add a new TII, subreg_to_reg, which is like insert_subreg except that it takes an immediate implicit value to insert into rather than a register. llvm-svn: 48412	2008-03-16 03:12:01 +00:00
Evan Cheng	0e7b00d79f	Replace all target specific implicit def instructions with a target independent one: TargetInstrInfo::IMPLICIT_DEF. llvm-svn: 48380	2008-03-15 00:03:38 +00:00
Duncan Sands	858e6385f7	Do not generate special entries in the dwarf eh table for nounwind calls. llvm-svn: 48373	2008-03-14 21:36:24 +00:00
Duncan Sands	a06e4f3050	Simplify using getIntPtrConstant. llvm-svn: 48355	2008-03-14 05:23:57 +00:00
Nate Begeman	63eb03f800	Tabs -> spaces Use getIntPtrConstant in a couple places to shorten stuff up Handle splitting vector shuffles with undefs in the mask llvm-svn: 48351	2008-03-14 00:53:31 +00:00
Evan Cheng	db443ca377	Livein copy scheduling fixes: do not coalesce physical register copies, correctly determine the safe location to insert the copies. llvm-svn: 48348	2008-03-14 00:14:55 +00:00
Dan Gohman	b72127ac4c	More APInt-ification. llvm-svn: 48344	2008-03-13 22:13:53 +00:00
Evan Cheng	65e9d5f1a8	Experimental scheduler change to schedule / coalesce the copies added for function livein's. Take 2008-03-10-RegAllocInfLoop.ll, the schedule looks like this after these copies are inserted: entry: 0x12049d0, LLVM BB @0x1201fd0, ID#0: Live Ins: %EAX %EDX %ECX %reg1031<def> = MOVPC32r 0 %reg1032<def> = ADD32ri %reg1031, <es:_GLOBAL_OFFSET_TABLE_>, %EFLAGS<imp-def> %reg1028<def> = MOV32rr %EAX %reg1029<def> = MOV32rr %EDX %reg1030<def> = MOV32rr %ECX %reg1027<def> = MOV8rm %reg0, 1, %reg0, 0, Mem:LD(1,1) [0x1201910 + 0] %reg1025<def> = MOV32rr %reg1029 %reg1026<def> = MOV32rr %reg1030 %reg1024<def> = MOV32rr %reg1028 The copies unnecessarily increase register pressure and it will end up requiring a physical register to be spilled. With -schedule-livein-copies: entry: 0x12049d0, LLVM BB @0x1201fa0, ID#0: Live Ins: %EAX %EDX %ECX %reg1031<def> = MOVPC32r 0 %reg1032<def> = ADD32ri %reg1031, <es:_GLOBAL_OFFSET_TABLE_>, %EFLAGS<imp-def> %reg1024<def> = MOV32rr %EAX %reg1025<def> = MOV32rr %EDX %reg1026<def> = MOV32rr %ECX %reg1027<def> = MOV8rm %reg0, 1, %reg0, 0, Mem:LD(1,1) [0x12018e0 + 0] Much better! llvm-svn: 48307	2008-03-12 22:19:41 +00:00
Duncan Sands	723849a17f	Initial soft-float support for LegalizeTypes. I rewrote the fcopysign expansion from LegalizeDAG to get rid of what seems to be a bug: the use of sign extension means that when copying the sign bit from an f32 to an f64, the upper 32 bits of the f64 (now an i64) are set, not just the top bit... I also generalized it to work for any sized floating point types, and removed the bogosity: SDOperand Mask1 = (SrcVT == MVT::f64) ? DAG.getConstantFP(BitsToDouble(1ULL << 63), SrcVT) : DAG.getConstantFP(BitsToFloat(1U << 31), SrcVT); Mask1 = DAG.getNode(ISD::BIT_CONVERT, SrcNVT, Mask1); (here SrcNVT is an integer with the same size as SrcVT). As far as I can see this takes a 1 << 63, converts to a double, converts that to a floating point constant then converts that to an integer constant, ending up with... 1 << 63 as an integer constant! So I just generate this integer constant directly. llvm-svn: 48305	2008-03-12 21:27:04 +00:00
Duncan Sands	c54fe97f08	Fix typo. llvm-svn: 48295	2008-03-12 20:35:19 +00:00
Duncan Sands	87de65fc29	Don't try to extract an i32 from an f64. This getCopyToParts problem was noticed by the new LegalizeTypes infrastructure. In order to avoid this kind of thing in the future I've added a check that EXTRACT_ELEMENT is only used with integers. Once LegalizeTypes is up and running most likely BUILD_PAIR and EXTRACT_ELEMENT can be removed, in favour of using apints instead. llvm-svn: 48294	2008-03-12 20:30:08 +00:00
Evan Cheng	99ee78ef63	Clean up my own mess. X86 lowering normalize vector 0 to v4i32. However DAGCombine can fold (sub x, x) -> 0 after legalization. It can create a zero vector of a type that's not expected (e.g. v8i16). We don't want to disable the optimization since leaving a (sub x, x) is really bad. Add isel patterns for other types of vector 0 to ensure correctness. It's highly unlikely to happen other than in bugpoint reduced test cases. llvm-svn: 48279	2008-03-12 07:02:50 +00:00
Evan Cheng	0903aef2ff	Total brain cramp. llvm-svn: 48274	2008-03-12 02:05:05 +00:00
Anton Korobeynikov	e8fa50f63a	Correctly propagate thread-local flag from aliasee to alias. This fixes PR2137 llvm-svn: 48257	2008-03-11 22:38:53 +00:00
Dan Gohman	44b4c07cd1	Use the correct value for InSignBit. llvm-svn: 48245	2008-03-11 21:29:43 +00:00
Dan Gohman	1351025a91	Initial codegen support for functions and calls with multiple return values. llvm-svn: 48244	2008-03-11 21:11:25 +00:00
Christopher Lamb	aa7c2105de	Recommitting parts of r48130. These do not appear to cause the observed failures. llvm-svn: 48223	2008-03-11 10:09:17 +00:00
Evan Cheng	e88a625ecd	When the register allocator runs out of registers, spill a physical register around the def's and use's of the interval being allocated to make it possible for the interval to target a register and spill it right away and restore a register for uses. This likely generates terrible code but is before than aborting. llvm-svn: 48218	2008-03-11 07:19:34 +00:00
Duncan Sands	b29f93613d	Some LegalizeTypes code factorization and minor enhancements. llvm-svn: 48215	2008-03-11 06:41:14 +00:00
Chris Lattner	5c7bda440f	compile: double test() {} into: _test: fldz ret instead of: _test: subl $12, %esp #IMPLICIT_DEF %xmm0 movsd %xmm0, (%esp) fldl (%esp) addl $12, %esp ret llvm-svn: 48213	2008-03-11 06:21:08 +00:00
Chris Lattner	3e0ec65678	variadic instructions don't have operand info for variadic arguments. llvm-svn: 48208	2008-03-11 03:14:42 +00:00
Dan Gohman	d6819da453	Generalize ExpandIntToFP to handle the case where the operand is legal and it's the result that requires expansion. This code is a little confusing because the TargetLoweringInfo tables for [US]INT_TO_FP use the operand type (the integer type) rather than the result type. llvm-svn: 48206	2008-03-11 01:59:03 +00:00
Chris Lattner	d3090bcfc8	If a register operand comes from the variadic part of a node, don't verify the register constraint matches what the instruction expects. llvm-svn: 48205	2008-03-11 00:59:28 +00:00
Dan Gohman	10f7d850cf	More APInt-ification. llvm-svn: 48201	2008-03-11 00:11:06 +00:00
Dan Gohman	2a3aeb1f72	Correctly clone FlaggedNodes. llvm-svn: 48196	2008-03-10 23:48:14 +00:00
Dan Gohman	830d86cab8	APInt-ify this. llvm-svn: 48194	2008-03-10 23:38:17 +00:00
Dan Gohman	f4300950f1	Implement more support for fp-to-i128 and i128-to-fp conversions. llvm-svn: 48189	2008-03-10 23:03:31 +00:00
Dan Gohman	272e234477	Fix mul expansion to check the correct number of bits for zero extension when checking if an unsigned multiply is safe. llvm-svn: 48171	2008-03-10 20:42:19 +00:00
Evan Cheng	b9e4280e94	Somewhat better solution. llvm-svn: 48170	2008-03-10 19:58:22 +00:00
Evan Cheng	ae2c56d93e	Default ISD::PREFETCH to expand. llvm-svn: 48169	2008-03-10 19:38:10 +00:00
Evan Cheng	d4e1d9eeb2	Revert 48125, 48126, and 48130 for now to unbreak some x86-64 tests. llvm-svn: 48167	2008-03-10 19:31:26 +00:00
Scott Michel	a6729e8666	Give TargetLowering::getSetCCResultType() a parameter so that ISD::SETCC's return ValueType can depend its operands' ValueType. This is a cosmetic change, no functionality impacted. llvm-svn: 48145	2008-03-10 15:42:14 +00:00
Evan Cheng	831ae49599	Doh llvm-svn: 48140	2008-03-10 07:59:01 +00:00
Evan Cheng	b5d11980d9	Avoid creating BUILD_VECTOR of all zero elements of "non-normalized" type (e.g. v8i16 on x86) after legalizer. Instruction selection does not expect to see them. In all likelihood this can only be an issue in a bugpoint reduced test case. llvm-svn: 48136	2008-03-10 07:19:13 +00:00
Christopher Lamb	4ba3f0430b	Allow insert_subreg into implicit, target-specific values. Change insert/extract subreg instructions to be able to be used in TableGen patterns. Use the above features to reimplement an x86-64 pseudo instruction as a pattern. llvm-svn: 48130	2008-03-10 06:12:08 +00:00
Dale Johannesen	4e622ec86d	Increase ISD::ParamFlags to 64 bits. Increase the ByValSize field to 32 bits, thus enabling correct handling of ByVal structs bigger than 0x1ffff. Abstract interface a bit. Fixes gcc.c-torture/execute/pr23135.c and gcc.c-torture/execute/pr28982b.c in gcc testsuite (were ICE'ing on ppc32, quietly producing wrong code on x86-32.) llvm-svn: 48122	2008-03-10 02:17:22 +00:00
Chris Lattner	4c4234b59c	remove an extraneous (and ugly) default argument, thanks Duncan. llvm-svn: 48117	2008-03-09 20:04:36 +00:00
Chris Lattner	ce5f841bb5	fp_round's produced by getCopyFromParts should always be exact, because they are produced by calls (which are known exact) and by cross block copies which are known to be produced by extends. This improves: define double @test2() { %tmp85 = call double asm sideeffect "fld0", "={st(0)}"() ret double %tmp85 } from: _test2: subl $20, %esp # InlineAsm Start fld0 # InlineAsm End fstpl 8(%esp) movsd 8(%esp), %xmm0 movsd %xmm0, (%esp) fldl (%esp) addl $20, %esp #FP_REG_KILL ret to: _test2: # InlineAsm Start fld0 # InlineAsm End #FP_REG_KILL ret by avoiding a f64 <-> f80 trip llvm-svn: 48108	2008-03-09 09:38:46 +00:00
Chris Lattner	86829f0ff7	teach X86InstrInfo::copyRegToReg how to copy into ST(0) from an RFP register class. Teach ScheduleDAG how to handle CopyToReg with different src/dst reg classes. This allows us to compile trivial inline asms that expect stuff on the top of x87-fp stack. llvm-svn: 48107	2008-03-09 09:15:31 +00:00
Chris Lattner	9e07537e8c	Add ScheduleDAG support for copytoreg where the src/dst register are in different register classes, e.g. copy of ST(0) to RFP*. This gets some really trivial inline asm working that plops things on the top of stack (PR879) llvm-svn: 48105	2008-03-09 08:49:15 +00:00
Chris Lattner	381bbdb924	fix 80 col violation llvm-svn: 48100	2008-03-09 07:51:01 +00:00
Chris Lattner	83b3473dd8	extend fp values with FP_EXTEND not FP_ROUND. llvm-svn: 48097	2008-03-09 07:47:22 +00:00
Chris Lattner	322c826c9d	Fix two problems in SelectionDAGLegalize::ExpandBUILD_VECTOR's handling of BUILD_VECTORS that only have two unique elements: 1. The previous code was nondeterminstic, because it walked a map in SDOperand order, which isn't determinstic. 2. The previous code didn't handle the case when one element was undef very well. Now we ensure that the generated shuffle mask has the undef vector on the RHS (instead of potentially being on the LHS) and that any elements that refer to it are themselves undef. This allows us to compile CodeGen/X86/vec_set-9.ll into: _test3: movd %rdi, %xmm0 punpcklqdq %xmm0, %xmm0 ret instead of: _test3: movd %rdi, %xmm1 #IMPLICIT_DEF %xmm0 punpcklqdq %xmm1, %xmm0 ret ... saving a register. llvm-svn: 48060	2008-03-09 00:29:42 +00:00
Chris Lattner	a1f25b0020	Teach SD some vector identities, allowing us to compile vec_set-9 into: _test3: movd %rdi, %xmm1 #IMPLICIT_DEF %xmm0 punpcklqdq %xmm1, %xmm0 ret instead of: _test3: #IMPLICIT_DEF %rax movd %rax, %xmm0 movd %rdi, %xmm1 punpcklqdq %xmm1, %xmm0 ret This is still not ideal. There is no reason to two xmm regs. llvm-svn: 48058	2008-03-08 23:43:36 +00:00
Evan Cheng	95cf661534	Implement x86 support for @llvm.prefetch. It corresponds to prefetcht{0\|1\|2} and prefetchnta instructions. llvm-svn: 48042	2008-03-08 00:58:38 +00:00
Evan Cheng	34173f0a43	80 col violation. llvm-svn: 47998	2008-03-06 17:42:34 +00:00
Evan Cheng	a3cb090446	Constant fold SIGN_EXTEND_INREG with ashr not lshr. llvm-svn: 47992	2008-03-06 08:20:51 +00:00
Dale Johannesen	8ee39c61f2	Clarify that CALLSEQ_START..END may not be nested, and add some protection against creating such. llvm-svn: 47957	2008-03-05 19:14:03 +00:00
Chris Lattner	78e9cab229	Generalize FP constant shrinking optimization to apply to any vt except ppc long double. This allows us to shrink constant pool entries for x86 long double constants, which in turn allows us to use flds/fldl instead of fldt. llvm-svn: 47938	2008-03-05 06:48:13 +00:00
Chris Lattner	3dc3899007	Improve comment, pass in the original VT so that we can shrink a long double constant all the way to float, not stopping at double. llvm-svn: 47937	2008-03-05 06:46:58 +00:00
Dan Gohman	da7897c4e1	Codegen support for i128 UINT_TO_FP. This just fixes a bug in r47928 (Int64Ty is the correct type for the constant pool entry here) and removes the asserts, now that the code is capable of handling i128. llvm-svn: 47932	2008-03-05 02:07:31 +00:00
Evan Cheng	0a62cb44ce	Add a target lowering hook to control whether it's worthwhile to compress fp constant. For x86, if sse2 is available, it's not a good idea since cvtss2sd is slower than a movsd load and it prevents load folding. On x87, it's important to shrink fp constant since fldt is very expensive. llvm-svn: 47931	2008-03-05 01:30:59 +00:00
Andrew Lenharth	357061a74d	64bit CAS on 32bit x86. llvm-svn: 47929	2008-03-05 01:15:49 +00:00
Dan Gohman	d9d874b0cd	Codegen support for i128 SINT_TO_FP. llvm-svn: 47928	2008-03-05 01:08:17 +00:00
Roman Levenstein	c62c2bb4d0	Some improvements related to the computation of heights, depths of SUnits. The basic idea is that all these algorithms are computing the longest paths from the root node or to the exit node. Therefore the existing implementation that uses and iterative and potentially exponential algorithm was changed to a well-known graph algorithm based on dynamic programming. It has a linear run-time. llvm-svn: 47884	2008-03-04 11:19:43 +00:00
Evan Cheng	38caf77419	Refactor ExpandConstantFP so it can optimize load from constpool of types larger than f64 into extload from smaller types. llvm-svn: 47883	2008-03-04 08:05:30 +00:00
Evan Cheng	567d2e5b57	Rename isOperand() to isOperandOf() (and other similar methods). It always confuses me. llvm-svn: 47872	2008-03-04 00:41:45 +00:00
Dan Gohman	e1c4f99549	Misc. APInt-ification in the DAGCombiner. llvm-svn: 47869	2008-03-03 23:51:38 +00:00
Dan Gohman	10f34077f1	More APInt-ification. llvm-svn: 47868	2008-03-03 23:35:36 +00:00
Dan Gohman	0e238dc813	Yet more APInt-ification. llvm-svn: 47867	2008-03-03 22:37:52 +00:00
Dan Gohman	2fa65b7997	More APInt-ification. llvm-svn: 47866	2008-03-03 22:22:56 +00:00
Dan Gohman	f2bbfa3ba0	More APInt-ification. llvm-svn: 47864	2008-03-03 22:20:46 +00:00
Andrew Lenharth	d032c33300	all but CAS working on x86 llvm-svn: 47798	2008-03-01 21:52:34 +00:00
Dale Johannesen	208cc8f1b9	Add MVT::is128BitVector and is64BitVector. Shrink unaligned load/store code using them. Per review of unaligned load/store vector patch. llvm-svn: 47782	2008-03-01 03:40:57 +00:00
Evan Cheng	73bdf043a1	Refactor / clean up code; remove td list scheduler special tie breaker (no real benefit). llvm-svn: 47779	2008-03-01 00:39:47 +00:00
Dan Gohman	bd2fa566e4	More APInt-ification. llvm-svn: 47746	2008-02-29 01:47:35 +00:00
Dan Gohman	837a6dccd7	Use the new convertFromAPInt instead of convertFromZeroExtendedInteger, which allows more of the surrounding arithmetic to be done with APInt instead of uint64_t. llvm-svn: 47745	2008-02-29 01:44:25 +00:00
Dan Gohman	ec6be4a782	Use the new APInt-enabled form of getConstant instead of converting an APInt into a uint64_t to call getConstant. llvm-svn: 47742	2008-02-29 01:41:59 +00:00
Dale Johannesen	cbde4c2206	Interface of getByValTypeAlignment differed between generic & x86 versions; change generic to follow x86 and improve comments. Add PPC version (not right for non-Darwin.) llvm-svn: 47734	2008-02-28 22:31:51 +00:00
Dale Johannesen	c4c3de2b52	Fix an assertion message. llvm-svn: 47722	2008-02-28 18:36:51 +00:00
Evan Cheng	a465bfb87c	Keep track how many commutes are performed by the scheduler. llvm-svn: 47710	2008-02-28 07:40:24 +00:00
Chris Lattner	9824ffef0c	implement expand for ISD::DECLARE by just deleting it. llvm-svn: 47708	2008-02-28 05:53:40 +00:00
Evan Cheng	c799065cc3	Add a quick and dirty "loop aligner pass". x86 uses it to align its loops to 16-byte boundaries. llvm-svn: 47703	2008-02-28 00:43:03 +00:00
Dale Johannesen	bf76a08e7c	Handle load/store of misaligned vectors that are the same size as an int type by doing a bitconvert of load/store of the int type (same algorithm as floating point). This makes them work for ppc Altivec. There was some code that purported to handle loads of (some) vectors by splitting them into two smaller vectors, but getExtLoad rejects subvector loads, so this could never have worked; the patch removes it. llvm-svn: 47696	2008-02-27 22:36:00 +00:00
Dan Gohman	e5e32ec8f7	Remove the `else', at Evan's insistence. llvm-svn: 47686	2008-02-27 19:44:57 +00:00
Duncan Sands	ef40c5b204	Add a FIXME about the VECTOR_SHUFFLE evil hack. llvm-svn: 47676	2008-02-27 17:39:13 +00:00
Duncan Sands	e158a82f26	LegalizeTypes support for EXTRACT_VECTOR_ELT. The approach taken is different to that in LegalizeDAG when it is a question of expanding or promoting the result type: for example, if extracting an i64 from a <2 x i64>, when i64 needs expanding, it bitcasts the vector to <4 x i32>, extracts the appropriate two i32's, and uses those for the Lo and Hi parts. Likewise, when extracting an i16 from a <4 x i16>, and i16 needs promoting, it bitcasts the vector to <2 x i32>, extracts the appropriate i32, twiddles the bits if necessary, and uses that as the promoted value. This puts more pressure on bitcast legalization, and I've added the appropriate cases. They needed to be added anyway since users can generate such bitcasts too if they want to. Also, when considering various cases (Legal, Promote, Expand, Scalarize, Split) it is a pain that expand can correspond to Expand, Scalarize or Split, so I've changed the LegalizeTypes enum so it lists those different cases - now Expand only means splitting a scalar in two. The code produced is the same as by LegalizeDAG for all relevant testcases, except for 2007-10-31-extractelement-i64.ll, where the code seems to have improved (see below; can an expert please tell me if it is better or not). Before < vs after >. < subl $92, %esp < movaps %xmm0, 64(%esp) < movaps %xmm0, (%esp) < movl 4(%esp), %eax < movl %eax, 28(%esp) < movl (%esp), %eax < movl %eax, 24(%esp) < movq 24(%esp), %mm0 < movq %mm0, 56(%esp) --- > subl $44, %esp > movaps %xmm0, 16(%esp) > pshufd $1, %xmm0, %xmm1 > movd %xmm1, 4(%esp) > movd %xmm0, (%esp) > movq (%esp), %mm0 > movq %mm0, 8(%esp) < subl $92, %esp < movaps %xmm0, 64(%esp) < movaps %xmm0, (%esp) < movl 12(%esp), %eax < movl %eax, 28(%esp) < movl 8(%esp), %eax < movl %eax, 24(%esp) < movq 24(%esp), %mm0 < movq %mm0, 56(%esp) --- > subl $44, %esp > movaps %xmm0, 16(%esp) > pshufd $3, %xmm0, %xmm1 > movd %xmm1, 4(%esp) > movhlps %xmm0, %xmm0 > movd %xmm0, (%esp) > movq (%esp), %mm0 > movq %mm0, 8(%esp) < subl $92, %esp < movaps %xmm0, 64(%esp) --- > subl $44, %esp < movl 16(%esp), %eax < movl %eax, 48(%esp) < movl 20(%esp), %eax < movl %eax, 52(%esp) < movaps %xmm0, (%esp) < movl 4(%esp), %eax < movl %eax, 60(%esp) < movl (%esp), %eax < movl %eax, 56(%esp) --- > pshufd $1, %xmm0, %xmm1 > movd %xmm1, 4(%esp) > movd %xmm0, (%esp) > movd %xmm1, 12(%esp) > movd %xmm0, 8(%esp) < subl $92, %esp < movaps %xmm0, 64(%esp) --- > subl $44, %esp < movl 24(%esp), %eax < movl %eax, 48(%esp) < movl 28(%esp), %eax < movl %eax, 52(%esp) < movaps %xmm0, (%esp) < movl 12(%esp), %eax < movl %eax, 60(%esp) < movl 8(%esp), %eax < movl %eax, 56(%esp) --- > pshufd $3, %xmm0, %xmm1 > movd %xmm1, 4(%esp) > movhlps %xmm0, %xmm0 > movd %xmm0, (%esp) > movd %xmm1, 12(%esp) > movd %xmm0, 8(%esp) llvm-svn: 47672	2008-02-27 13:34:40 +00:00
Duncan Sands	2111bd2e37	LegalizeTypes support for legalizing the mask operand of a VECTOR_SHUFFLE. The mask is a vector of constant integers. The code in LegalizeDAG doesn't bother to legalize the mask, since it's basically just storage for a bunch of constants, however LegalizeTypes is more picky. The problem is that there may not exist any legal vector-of-integers type with a legal element type, so it is impossible to create a legal mask! Unless of course you cheat by creating a BUILD_VECTOR where the operands have a different type to the element type of the vector being built... This is pretty ugly but works - all relevant tests in the testsuite pass, and produce the same assembler with and without LegalizeTypes. llvm-svn: 47670	2008-02-27 13:03:44 +00:00
Duncan Sands	5d5bc484d0	LegalizeTypes support for INSERT_VECTOR_ELT. llvm-svn: 47669	2008-02-27 10:18:23 +00:00
Duncan Sands	96658d0189	Support for legalizing MEMBARRIER. llvm-svn: 47667	2008-02-27 08:53:44 +00:00
Bill Wendling	97925ec704	Final de-tabification. llvm-svn: 47663	2008-02-27 06:33:05 +00:00
Dan Gohman	66272a545b	Teach Legalize how to expand an EXTRACT_ELEMENT. llvm-svn: 47656	2008-02-27 01:52:30 +00:00
Dan Gohman	f19609abe8	Convert the last remaining users of the non-APInt form of ComputeMaskedBits to use the APInt form, and remove the non-APInt form. llvm-svn: 47654	2008-02-27 01:23:58 +00:00
Dan Gohman	ae2b6fbb8e	Convert SimplifyDemandedMask and ShrinkDemandedConstant to use APInt. Change several cases in SimplifyDemandedMask that don't ever do any simplifying to reuse the logic in ComputeMaskedBits instead of duplicating it. llvm-svn: 47648	2008-02-27 00:25:32 +00:00
Bill Wendling	d7a258d325	Rename PrintableName to Name. llvm-svn: 47629	2008-02-26 21:47:57 +00:00
Bill Wendling	c24ea4fb41	Change "Name" to "AsmName" in the target register info. Gee, a refactoring tool would have been a Godsend here! llvm-svn: 47625	2008-02-26 21:11:01 +00:00
Dan Gohman	9db0aa86d9	Avoid aborting on invalid shift counts. llvm-svn: 47612	2008-02-26 18:50:50 +00:00
Chris Lattner	07c83cc86e	Fix PR2096, a regression introduced with my patch last night. This also fixes cfrac, flops, and 175.vpr llvm-svn: 47605	2008-02-26 17:09:59 +00:00
Duncan Sands	7cdbbfd067	Fix a nasty bug in LegalizeTypes (spotted in CodeGen/PowerPC/illegal-element-type.ll): suppose a node X is processed, and processing maps it to a node Y. Then X continues to exist in the DAG, but with no users. While processing some other node, a new node may be created that happens to be equal to X, and thus X will be reused rather than a truly new node. This can cause X to "magically reappear", and since it is in the Processed state in will not be reprocessed, so at the end of type legalization the illegal node X can still be present. The solution is to replace X with Y whenever X gets resurrected like this. llvm-svn: 47601	2008-02-26 11:21:42 +00:00
Chris Lattner	e7c14013f5	Fix isNegatibleForFree to not return true for ConstantFP nodes after legalize. Just because a constant is legal (e.g. 0.0 in SSE) doesn't mean that its negated value is legal (-0.0). We could make this stronger by checking to see if the negated constant is actually legal post negation, but it doesn't seem like a big deal. llvm-svn: 47591	2008-02-26 07:04:54 +00:00
Evan Cheng	ccc0c996a4	Refactor inline asm constraint matching code out of SDIsel into TargetLowering. llvm-svn: 47587	2008-02-26 02:33:44 +00:00
Dan Gohman	432e4a6742	Make some static variables const. llvm-svn: 47566	2008-02-25 21:39:34 +00:00
Dan Gohman	1f372edd97	Convert MaskedValueIsZero and all its users to use APInt. Also add a SignBitIsZero function to simplify a common use case. llvm-svn: 47561	2008-02-25 21:11:39 +00:00
Duncan Sands	896c519d19	In debug builds check that the key property holds: all result and operand types are legal. llvm-svn: 47546	2008-02-25 16:21:21 +00:00
Duncan Sands	ba3d7e8e7d	Add support to LegalizeTypes for building legal vectors out of illegal elements (BUILD_VECTOR). Uses and beefs up BUILD_PAIR, though it didn't really have to. Like most of LegalizeTypes, does not support soft-float. This cures all "make check" vector building failures. llvm-svn: 47537	2008-02-24 07:36:03 +00:00
Dale Johannesen	eabc5f39af	Pass alignment on ByVal parameters, from FE, all the way through. It is now used for codegen. llvm-svn: 47484	2008-02-22 17:49:45 +00:00
Dan Gohman	f3057a939d	Fix a regression in 403.gcc and 186.crafty introduced in 47383. To test that a value is >= 32, check that all of the high bits are zero, not just one or more. llvm-svn: 47467	2008-02-22 01:12:31 +00:00
Chris Lattner	3422b673d1	Make the clobber analysis a bit more smart: we only are careful about early clobbers if the clobber list contains a register not some thing like {memory}, {dirflag} etc. llvm-svn: 47457	2008-02-21 20:54:31 +00:00
Chris Lattner	bdd4c8b04d	Treat clobber operands like early clobbers: if we have any, we force sdisel to do all regalloc for an asm. This leads to gross but correct codegen. This fixes the rest of PR2078. llvm-svn: 47454	2008-02-21 19:43:13 +00:00
Andrew Lenharth	7254826c40	Better names as per Evan's request llvm-svn: 47435	2008-02-21 16:11:38 +00:00
Andrew Lenharth	95528943e9	Atomic op support. If any gcc test uses __sync builtins, it might start failing on archs that haven't implemented them yet llvm-svn: 47430	2008-02-21 06:45:13 +00:00
Chris Lattner	4da4f85090	Add support for matching mem operands. This fixes PR1133, patch by Eli Friedman. This implements CodeGen/Generic/2008-02-20-MatchingMem.ll. llvm-svn: 47428	2008-02-21 05:27:19 +00:00
Chris Lattner	83c93d5afd	Fix a (harmless) but where vregs were added to the used reg lists for inline asms. Fix PR2078 by marking aliases of registers used when a register is marked used. This prevents EAX from being allocated when AX is listed in the clobber set for the asm. llvm-svn: 47426	2008-02-21 04:55:52 +00:00
Devang Patel	57b4eedad9	assert is more effective reminder then FIXME tag for unimplemented features. llvm-svn: 47388	2008-02-20 18:37:40 +00:00
Duncan Sands	e7b462b329	LegalizeTypes support for scalarizing a vector store and splitting extract_subvector. This fixes nine "make check" testcases, for example 2008-02-04-ExtractSubvector.ll and (partially) CodeGen/Generic/vector.ll. llvm-svn: 47384	2008-02-20 17:38:09 +00:00
Dan Gohman	34fc7dbf5b	Convert Legalize to use the APInt form of ComputeMaskedBits. llvm-svn: 47383	2008-02-20 16:57:27 +00:00
Dan Gohman	360c86aed5	Add explicit keywords. llvm-svn: 47382	2008-02-20 16:44:09 +00:00
Dan Gohman	d0ff91dac5	Convert DAGCombiner to use the APInt form of ComputeMaskedBits. llvm-svn: 47381	2008-02-20 16:33:30 +00:00
Dan Gohman	b717fdaa7b	Use APInt::intersects. llvm-svn: 47380	2008-02-20 16:30:17 +00:00
Anton Korobeynikov	035eaacd1f	Update gcc 4.3 warnings fix patch with recent head changes llvm-svn: 47368	2008-02-20 11:10:28 +00:00
Chris Lattner	2a8037b5f5	Fix an incredibly subtle bug exposed by Ted's change to APInt profiling. AddNodeIDNode does profiling for a ConstantSDNode, but so does SelectionDAG::getConstant. This profiling should be moved to a common static function in ConstantSDNode. llvm-svn: 47359	2008-02-20 06:28:01 +00:00
Devang Patel	295711f583	Add GetResultInst. First step for multiple return value support. llvm-svn: 47348	2008-02-19 22:15:16 +00:00
Evan Cheng	6200c225e0	- When DAG combiner is folding a bit convert into a BUILD_VECTOR, it should check if it's essentially a SCALAR_TO_VECTOR. Avoid turning (v8i16) <10, u, u, u> to <10, 0, u, u, u, u, u, u>. Instead, simply convert it to a SCALAR_TO_VECTOR of the proper type. - X86 now normalize SCALAR_TO_VECTOR to (BIT_CONVERT (v4i32 SCALAR_TO_VECTOR)). Get rid of X86ISD::S2VEC. llvm-svn: 47290	2008-02-18 23:04:32 +00:00
Andrew Lenharth	fedcf477b5	I cannot find a libgcc function for this builtin. Therefor expanding it to a noop (which is how it use to be treated). If someone who knows the x86 backend better than me could tell me how to get a lock prefix on an instruction, that would be nice to complete x86 support. llvm-svn: 47213	2008-02-16 14:46:26 +00:00
Duncan Sands	b289516a71	Teach LegalizeTypes how to expand the operands of br_cc. This fixes 5 "make check" failures. llvm-svn: 47212	2008-02-16 10:29:26 +00:00
Andrew Lenharth	9b254eed32	llvm.memory.barrier, and impl for x86 and alpha llvm-svn: 47204	2008-02-16 01:24:58 +00:00
Dan Gohman	27ae573900	Rename CountMemOperands to ComputeMemOperandsEnd to reflect what it actually does. Simplify CountOperands a little by reusing ComputeMemOperandsEnd. And reword some comments for both. llvm-svn: 47198	2008-02-16 00:36:48 +00:00
Dan Gohman	856c01204b	Revert 47177, which was incorrect. llvm-svn: 47196	2008-02-16 00:25:40 +00:00
Scott Michel	a3cefeaf0c	Make tblgen a little smarter about constants smaller than i32. Currently, tblgen will complain if a sign-extended constant does not fit into a data type smaller than i32, e.g., i16. This causes a problem when certain hex constants are used, such as 0xff for byte masks or immediate xor values. tblgen will try the sign-extended value first and, if the sign extended value would overflow, it tries to see if the unsigned value will fit. Consequently, a software developer can now safely incant: (XORHIr16 R16C:$rA, 0xffff) which is somewhat clearer and more informative than incanting: (XORHIr16 R16C:$rA, (i16 -1)) even if the two are bitwise equivalent. Tblgen also outputs the 64-bit unsigned constant in the generated ISel code when getTargetConstant() is invoked. llvm-svn: 47188	2008-02-15 23:05:48 +00:00
Dan Gohman	c278c4aba0	Skip over the defs and start at the uses when looking for operands with the TIED_TO attribute. llvm-svn: 47177	2008-02-15 20:59:17 +00:00
Dan Gohman	0340d1e2cd	Use the TargetInstrDescr to determine the number of operands that should be checked for the TIED_TO attribute instead of using CountOperands. llvm-svn: 47176	2008-02-15 20:50:13 +00:00
Duncan Sands	5560281c06	Teach LegalizeTypes how to promote the flags in a ret node. These are created as i32 constants but on some platforms i32 is not legal. This fixes 26 "make check" failures, for example Alpha/2005-07-12-TwoMallocCalls.ll. llvm-svn: 47172	2008-02-15 19:34:17 +00:00
Dan Gohman	a36ade5595	Use StoreSDNode::getValue instead of calling getOperand directly with a hard-coded operand number. llvm-svn: 47163	2008-02-15 18:11:59 +00:00
Chris Lattner	558a3ba17f	Fix a miscompilation from Dan's recent apintification. llvm-svn: 47128	2008-02-14 18:48:56 +00:00
Duncan Sands	4c95dbd69f	In TargetLowering::LowerCallTo, don't assert that the return value is zero-extended if it isn't sign-extended. It may also be any-extended. Also, if a floating point value was returned in a larger floating point type, pass 1 as the second operand to FP_ROUND, which tells it that all the precision is in the original type. I think this is right but I could be wrong. Finally, when doing libcalls, set isZExt on a parameter if it is "unsigned". Currently isSExt is set when signed, and nothing is set otherwise. This should be right for all calls to standard library routines. llvm-svn: 47122	2008-02-14 17:28:50 +00:00
Nate Begeman	53e1b3f9d5	Change how FP immediates are handled. 1) ConstantFP is now expand by default 2) ConstantFP is not turned into TargetConstantFP during Legalize if it is legal. This allows ConstantFP to be handled like Constant, allowing for targets that can encode FP immediates as MachineOperands. As a bonus, fix up Itanium FP constants, which now correctly match, and match more constants! Hooray. llvm-svn: 47121	2008-02-14 08:57:00 +00:00
Dan Gohman	7e22a5d8df	Allow the APInt form of ComputeMaskedBits to operate on i128 types. llvm-svn: 47101	2008-02-13 23:13:32 +00:00
Dan Gohman	95d25d39d0	Avoid setting bits that aren't demanded. llvm-svn: 47098	2008-02-13 22:43:25 +00:00

... 11 12 13 14 15 ...

3375 Commits