llvm-project

Commit Graph

Author	SHA1	Message	Date
Dale Johannesen	8b531d2754	Initialize uninitialized variable. llvm-svn: 58057	2008-10-24 01:06:58 +00:00
Duncan Sands	62951678ee	Fix thinko - the operand number has nothing to do with the result number. llvm-svn: 58041	2008-10-23 19:34:23 +00:00
Duncan Sands	8178141378	LegalizeTypes soft-float support for fpow. llvm-svn: 57973	2008-10-22 11:49:09 +00:00
Duncan Sands	578a68a91a	Be nice to CellSPU: for this target getSetCCResultType may return i8, which can result in SELECT nodes for which the type of the condition is i8, but there are no patterns for select with i8 condition. Tweak the LegalizeTypes logic to avoid this as much as possible. This isn't a real fix because it is still perfectly possible to end up with such select nodes - CellSPU needs to be fixed IMHO. llvm-svn: 57968	2008-10-22 09:23:20 +00:00
Duncan Sands	01a1c11218	Port from LegalizeDAG the logic to only generate ADDC/ADDE/SUBC/SUBE if the target supports it. llvm-svn: 57967	2008-10-22 09:07:29 +00:00
Duncan Sands	a1a388cac3	Add some comments explaining the meaning of a boolean that is not of type MVT::i1 in SELECT and SETCC nodes. Relax the LegalizeTypes SELECT condition promotion sanity checks to allow other condition types than i1. llvm-svn: 57966	2008-10-22 09:06:24 +00:00
Duncan Sands	4b6b5fcd80	Temporarily allow the operands of a BUILD_VECTOR to have a different type to the vector element type. This should be fairly harmless because in the past guys like this were being built all over the place (and were cleaned up when I added this check). The reason for relaxing this check is that it helps LegalizeTypes legalize vector shuffles: the mask is a BUILD_VECTOR that it is not always possible to legalize while keeping it a BUILD_VECTOR (vector_shuffle requires the mask to be a BUILD_VECTOR, as opposed to a vector with the right vector type). With this check it is even harder to legalize the mask - turning the check off means that LegalizeTypes manages to legalize almost all vector shuffles encountered in practice. The correct solution is to change vector_shuffle to be a variadic node with the mask built into it as operands. While waiting for that change, this hack stops the problem with vector_shuffle from blocking the turning on of LegalizeTypes. llvm-svn: 57965	2008-10-22 09:00:33 +00:00
Dale Johannesen	28929589e7	Add an SSE2 algorithm for uint64->f64 conversion. The same one Apple gcc uses, faster. Also gets the extreme case in gcc.c-torture/execute/ieee/rbug.c correct which we weren't before; this is not sufficient to get the test to pass though, there is another bug. llvm-svn: 57926	2008-10-21 20:50:01 +00:00
Dan Gohman	8b44b88eff	Fix SelectionDAGBuild lowering of Select instructions to handle first-class aggregate values. Also, fix a bug in the Ret handling for empty aggregates. llvm-svn: 57925	2008-10-21 20:00:42 +00:00
Dan Gohman	269246b034	Don't create TargetGlobalAddress nodes with offsets that don't fit in the 32-bit signed offset field of addresses. Even though this may be intended, some linkers refuse to relocate code where the relocated address computation overflows. Also, fix the sign-extension of constant offsets to use the actual pointer size, rather than the size of the GlobalAddress node, which may be different, for example on x86-64 where MVT::i32 is used when the address is being fit into the 32-bit displacement field. llvm-svn: 57885	2008-10-21 03:38:42 +00:00
Dan Gohman	97d3f6cfe3	Make the NaN test come second, heuristically assuming that NaNs are less common. llvm-svn: 57871	2008-10-21 03:12:54 +00:00
Chris Lattner	4396e0d2c3	Fix gcc.c-torture/compile/920520-1.c by inserting bitconverts for strange asm conditions earlier. In this case, we have a double being passed in an integer reg class. Convert to like sized integer register so that we allocate the right number for the class (two i32's for the f64 in this case). llvm-svn: 57862	2008-10-21 00:45:36 +00:00
Dan Gohman	1a59b3b9b8	Fast-isel no longer an experiment. llvm-svn: 57845	2008-10-20 21:30:12 +00:00
Duncan Sands	aac74a9055	Support operations like fp_to_uint with a vector result type when the result type is legal but not the operand type. Add additional support for EXTRACT_SUBVECTOR and CONCAT_VECTORS, needed to handle such cases. llvm-svn: 57840	2008-10-20 16:31:21 +00:00
Duncan Sands	e0fb87acf6	LegalizeTypes support for atomic operation promotion. llvm-svn: 57838	2008-10-20 16:17:42 +00:00
Duncan Sands	840143fc6f	Use DAG.getIntPtrConstant rather than DAG.getConstant with TLI.getPointerTy for a small simplification. llvm-svn: 57837	2008-10-20 16:14:43 +00:00
Duncan Sands	5805334d5b	Always use either MVT::i1 or getSetCCResultType for the condition of a SELECT node. Make sure that the correct extension type (any-, sign- or zero-extend) is used. llvm-svn: 57836	2008-10-20 16:13:04 +00:00
Duncan Sands	fe9b5550de	Formatting - no functional change. llvm-svn: 57834	2008-10-20 16:06:47 +00:00
Duncan Sands	3ed8b29ace	Don't use a random type for the select condition, use an MVT::i1 and simplify the code while there. llvm-svn: 57833	2008-10-20 16:04:57 +00:00
Bill Wendling	8ec2a4a96c	Set N->OperandList to 0 after deletion. Otherwise, it's possible that it will be either deleted or referenced afterwards. llvm-svn: 57786	2008-10-19 20:51:12 +00:00
Bill Wendling	6c87bfc6fd	Fix comment. Other formatting changes. No functionality changes. llvm-svn: 57785	2008-10-19 20:34:04 +00:00
Duncan Sands	8d11adca4c	Vector shuffle mask elements may be "undef". Handle this everywhere in LegalizeTypes. llvm-svn: 57783	2008-10-19 15:00:25 +00:00
Duncan Sands	c6d12bd665	Use a legal integer type for vector shuffle mask elements. Otherwise LegalizeTypes will, reasonably enough, legalize the mask, which may result in it no longer being a BUILD_VECTOR node (LegalizeDAG simply ignores the legality or not of vector masks). llvm-svn: 57782	2008-10-19 14:58:05 +00:00
Chris Lattner	160e8abd77	Reapply r57699 with a fix to not crash on asms with multiple results. Unlike the previous patch this one actually passes make check. "Fix PR2356 on PowerPC: if we have an input and output that are tied together that have different sizes (e.g. i32 and i64) make sure to reserve registers for the bigger operand." llvm-svn: 57771	2008-10-18 18:49:30 +00:00
Dan Gohman	727a94063c	Don't truncate GlobalAddress offsets to int in debug output. llvm-svn: 57770	2008-10-18 18:22:42 +00:00
Dan Gohman	2fe6bee5b6	Teach DAGCombine to fold constant offsets into GlobalAddress nodes, and add a TargetLowering hook for it to use to determine when this is legal (i.e. not in PIC mode, etc.) This allows instruction selection to emit folded constant offsets in more cases, such as the included testcase, eliminating the need for explicit arithmetic instructions. This eliminates the need for the C++ code in X86ISelDAGToDAG.cpp that attempted to achieve the same effect, but wasn't as effective. Also, fix handling of offsets in GlobalAddressSDNodes in several places, including changing GlobalAddressSDNode's offset from int to int64_t. The Mips, Alpha, Sparc, and CellSPU targets appear to be unaware of GlobalAddress offsets currently, so set the hook to false on those targets. llvm-svn: 57748	2008-10-18 02:06:02 +00:00
Dan Gohman	6de2556205	Revert r57699. It's causing regressions in test/CodeGen/X86/2008-09-17-inline-asm-1.ll and a few others, and it breaks the llvm-gcc build. llvm-svn: 57747	2008-10-18 01:03:45 +00:00
Dan Gohman	d01ddb51ee	Factor out the code for mapping LLVM IR condition opcodes to ISD condition opcodes into helper functions. llvm-svn: 57726	2008-10-17 21:16:08 +00:00
Chris Lattner	aadf7414b2	add support for 128 bit aggregates. llvm-svn: 57715	2008-10-17 19:59:51 +00:00
Mon P Wang	85f48ade9c	Added MemIntrinsicNode which is useful to represent target intrinsics that touches memory and need an associated MemOperand llvm-svn: 57712	2008-10-17 18:22:58 +00:00
Dan Gohman	293abcc91d	Factor out the code for mapping LLVM IR condition opcodes to ISD condition opcodes into helper functions. llvm-svn: 57710	2008-10-17 18:18:45 +00:00
Chris Lattner	052092bf9c	Fix PR2356 on PowerPC: if we have an input and output that are tied together that have different sizes (e.g. i32 and i64) make sure to reserve registers for the bigger operand. llvm-svn: 57699	2008-10-17 17:52:49 +00:00
Chris Lattner	3b1833c9b4	refactor some code into a helper method, no functionality change. llvm-svn: 57690	2008-10-17 17:05:25 +00:00
Chris Lattner	860df6e84c	Keep track of which input constraint matches an output constraint. Reject asms where an output has multiple input constraints tied to it. llvm-svn: 57687	2008-10-17 16:47:46 +00:00
Chris Lattner	ef8901722e	add an assert so that PR2356 explodes instead of running off an array. Improve some minor comments, refactor some helpers in AsmOperandInfo. No functionality change for valid code. llvm-svn: 57686	2008-10-17 16:21:11 +00:00
Dan Gohman	a39b0a1f05	Define patterns for shld and shrd that match immediate shift counts, and patterns that match dynamic shift counts when the subtract is obscured by a truncate node. Add DAGCombiner support for recognizing rotate patterns when the shift counts are defined by truncate nodes. Fix and simplify the code for commuting shld and shrd instructions to work even when the given instruction doesn't have a parent, and when the caller needs a new instruction. These changes allow LLVM to use the shld, shrd, rol, and ror instructions on x86 to replace equivalent code using two shifts and an or in many more cases. llvm-svn: 57662	2008-10-17 01:23:35 +00:00
Evan Cheng	3b0f5e4d61	- Add target lowering hooks that specify which setcc conditions are illegal, i.e. conditions that cannot be checked with a single instruction. For example, SETONE and SETUEQ on x86. - Teach legalizer to implement illegal setcc as a and / or of a number of legal setcc nodes. For now, only implement FP conditions. e.g. SETONE is implemented as SETO & SETNE, SETUEQ is SETUO \| SETEQ. - Move x86 target over. llvm-svn: 57542	2008-10-15 02:05:31 +00:00
Dan Gohman	e7ced74558	FastISel support for exception-handling constructs. - Move the EH landing-pad code and adjust it so that it works with FastISel as well as with SDISel. - Add FastISel support for @llvm.eh.exception and @llvm.eh.selector. llvm-svn: 57539	2008-10-14 23:54:11 +00:00
Evan Cheng	07d53b1d33	Rename LoadX to LoadExt. llvm-svn: 57526	2008-10-14 21:26:46 +00:00
Dan Gohman	9c4b7d5c4f	Fix command-line option printing to print two spaces where needed, instead of requiring all "short description" strings to begin with two spaces. This makes these strings less mysterious, and it fixes some cases where short description strings mistakenly did not begin with two spaces. llvm-svn: 57521	2008-10-14 20:25:08 +00:00
Evan Cheng	da9b752883	FIX PR2794. Make sure SIGN_EXTEND_INREG nodes introduced by LegalizeSetCCOperands are leglized. Patch by Richard Pennington. llvm-svn: 57460	2008-10-13 18:46:18 +00:00
Matthijs Kooijman	43686a6665	* Make TargetLowering not crash when TargetMachine::getTargetAsmInfo() returns null. This assumes that any target that does not have AsmInfo, does not support "LocAndDot". llvm-svn: 57438	2008-10-13 12:41:46 +00:00
Chris Lattner	c52af45304	calls can be supported. llvm-svn: 57428	2008-10-13 01:59:13 +00:00
Chris Lattner	2753955fc0	Change CALLSEQ_BEGIN and CALLSEQ_END to take TargetConstant's as parameters instead of raw Constants. This prevents the constants from being selected by the isel pass, fixing PR2735. llvm-svn: 57385	2008-10-11 22:08:30 +00:00
Chris Lattner	fb1f4a1329	simplify comparison llvm-svn: 57371	2008-10-11 00:08:02 +00:00
Dale Johannesen	4f0bd68cfe	Add a "loses information" return value to APFloat::convert and APFloat::convertToInteger. Restore return value to IEEE754. Adjust all users accordingly. llvm-svn: 57329	2008-10-09 23:00:39 +00:00
Dale Johannesen	54306fe499	Rename APFloat::convertToAPInt to bitcastToAPInt to make it clearer what the function does. No functional change. llvm-svn: 57325	2008-10-09 18:53:47 +00:00
Dan Gohman	c1d47c56f9	Avoid emitting redundant materializations of integer constants for things like null pointers, which at this level aren't different from regular integer constants. llvm-svn: 57265	2008-10-07 22:03:27 +00:00
Andrew Lenharth	21dca9cbb1	Use Dan's supperior check llvm-svn: 57255	2008-10-07 18:27:23 +00:00
Andrew Lenharth	d69bdaef64	No need for \|= llvm-svn: 57249	2008-10-07 17:11:29 +00:00
Andrew Lenharth	6d409f08be	Use ADDC if it is valid at any smaller size. Do it right this time llvm-svn: 57248	2008-10-07 17:09:16 +00:00
Andrew Lenharth	6606f17e50	Use ADDC if it is valid at any smaller size. fixes test/Codegen/Generic/i128-addsub.ll on x86 llvm-svn: 57247	2008-10-07 17:03:15 +00:00
Andrew Lenharth	3a9be150be	Expand arith on machines without carry flags llvm-svn: 57243	2008-10-07 14:15:42 +00:00
Dan Gohman	bef9b0bef0	Correctly handle calls with no return values. This fixes 2006-01-23-UnionInit on x86-64 when inlining is not enabled. llvm-svn: 57223	2008-10-07 00:12:37 +00:00
Chris Lattner	2416896b3c	wrap some long lines and expand i32 mul's to libcalls, inspired by a patch by Mikael Lepisto! llvm-svn: 57077	2008-10-04 21:27:46 +00:00
Dan Gohman	13b048268b	Fix fast-isel's handling of atomic instructions. They may expand to multiple basic blocks, in which case fast-isel needs to informed of which block to use as it resumes inserting instructions. llvm-svn: 57040	2008-10-04 00:56:36 +00:00
Dale Johannesen	5d60c1ebb1	Pass MemOperand through for 64-bit atomics on 32-bit, incidentally making the case where the memop is a pointer deref work. Fix cmp-and-swap regression. llvm-svn: 57027	2008-10-03 19:41:08 +00:00
Dan Gohman	b62cd7ea98	Use -1ULL instead of uint64_t(-1), at Anton's suggestion. llvm-svn: 57021	2008-10-03 17:56:45 +00:00
Duncan Sands	6e42742d2d	The result of getSetCCResultType (eg: i32) may be larger than the type an i1 is promoted to (eg: i8). Account for this. Noticed by Tilmann Scheller on CellSPU; he will hopefully take care of fixing this in LegalizeDAG and adding a testcase! llvm-svn: 56997	2008-10-03 07:41:46 +00:00
Dan Gohman	4e072a75cc	Implement fast-isel support for zero-extending from i1. It turns out that this is a fairly common operation, and it's easy enough to handle. llvm-svn: 56990	2008-10-03 01:28:47 +00:00
Dan Gohman	1ab1d31f7a	Optimize conditional branches in X86FastISel. This replaces sequences like this: sete %al testb %al, %al jne LBB11_1 with this: je LBB11_1 llvm-svn: 56969	2008-10-02 22:15:21 +00:00
Dale Johannesen	867d549fce	Handle some 64-bit atomics on x86-32, some of the time. llvm-svn: 56963	2008-10-02 18:53:47 +00:00
Dan Gohman	1dd27578dd	Make some implicit conversions explicit, to avoid compiler warnings. llvm-svn: 56927	2008-10-01 19:58:59 +00:00
Dan Gohman	94798d31dd	Fold trivial two-operand tokenfactors where the operands are equal immediately. llvm-svn: 56921	2008-10-01 15:11:19 +00:00
Dan Gohman	3a293e7404	Fix typos in comments. llvm-svn: 56919	2008-10-01 15:07:49 +00:00
Bill Wendling	68f12ee567	Implement the -fno-builtin option in the front-end, not in the back-end. llvm-svn: 56900	2008-10-01 00:59:58 +00:00
Bill Wendling	e818bc159f	- Initialize "--no-builtin" to "false". - Testcase for r56885. llvm-svn: 56886	2008-09-30 21:40:30 +00:00
Bill Wendling	bd09262e97	Add the new `-no-builtin' flag. This flag is meant to mimic the GCC `-fno-builtin' flag. Currently, it's used to replace "memset" with "_bzero" instead of "__bzero" on Darwin10+. This arguably violates the meaning of this flag, but is currently sufficient. The meaning of this flag should become more specific over time. llvm-svn: 56885	2008-09-30 21:22:07 +00:00
Dan Gohman	b486350b15	Move the primary fast-isel top-level comments to FastISel.cpp, where they'll be a little more visible. Also, update and reword them a bit. llvm-svn: 56877	2008-09-30 20:48:29 +00:00
Dan Gohman	86aa16a69a	Optimize SelectionDAG's AssignTopologicalOrder even further. Completely eliminate the TopOrder std::vector. Instead, sort the AllNodes list in place. This also eliminates the need to call AllNodes.size(), a linear-time operation, before performing the sort. Also, eliminate the Sources temporary std::vector, since it essentially duplicates the sorted result as it is being built. This also changes the direction of the topological sort from bottom-up to top-down. The AllNodes list starts out in roughly top-down order, so this reduces the amount of reordering needed. Top-down is also more convenient for Legalize, and ISel needed only minor adjustments. llvm-svn: 56867	2008-09-30 18:30:35 +00:00
Dale Johannesen	f61a84ec43	Remove misuse of ReplaceNodeResults for atomics with valid types. No functional change. llvm-svn: 56808	2008-09-29 22:25:26 +00:00
Dan Gohman	4aa9095398	Fix FastISel to not initialize the PIC-base register multiple times in functions with PIC references from more than one basic block. llvm-svn: 56807	2008-09-29 21:55:50 +00:00
Bill Wendling	c966a737c5	Temporarily reverting r56683. This is causing a failure during the build of llvm-gcc: /Volumes/Gir/devel/llvm/clean/llvm-gcc.obj/./gcc/xgcc -B/Volumes/Gir/devel/llvm/clean/llvm-gcc.obj/./gcc/ -B/Volumes/Gir/devel/llvm/clean/llvm-gcc.install/i386-apple-darwin9.5.0/bin/ -B/Volumes/Gir/devel/llvm/clean/llvm-gcc.install/i386-apple-darwin9.5.0/lib/ -isystem /Volumes/Gir/devel/llvm/clean/llvm-gcc.install/i386-apple-darwin9.5.0/include -isystem /Volumes/Gir/devel/llvm/clean/llvm-gcc.install/i386-apple-darwin9.5.0/sys-include -mmacosx-version-min=10.4 -O2 -O2 -g -O2 -DIN_GCC -W -Wall -Wwrite-strings -Wstrict-prototypes -Wmissing-prototypes -Wold-style-definition -isystem ./include -fPIC -pipe -g -DHAVE_GTHR_DEFAULT -DIN_LIBGCC2 -D__GCC_FLOAT_NOT_NEEDED -I. -I. -I../../llvm-gcc.src/gcc -I../../llvm-gcc.src/gcc/. -I../../llvm-gcc.src/gcc/../include -I./../intl -I../../llvm-gcc.src/gcc/../libcpp/include -I../../llvm-gcc.src/gcc/../libdecnumber -I../libdecnumber -I/Volumes/Gir/devel/llvm/clean/llvm.obj/include -I/Volumes/Gir/devel/llvm/clean/llvm.src/include -fexceptions -fvisibility=hidden -DHIDE_EXPORTS -c ../../llvm-gcc.src/gcc/unwind-dw2-fde-darwin.c -o libgcc/./unwind-dw2-fde-darwin.o Assertion failed: (TargetRegisterInfo::isVirtualRegister(regA) && TargetRegisterInfo::isVirtualRegister(regB) && "cannot update physical register live information"), function runOnMachineFunction, file /Volumes/Gir/devel/llvm/clean/llvm.src/lib/CodeGen/TwoAddressInstructionPass.cpp, line 311. ../../llvm-gcc.src/gcc/unwind-dw2.c:1527: internal compiler error: Abort trap Please submit a full bug report, with preprocessed source if appropriate. See <URL:http://developer.apple.com/bugreporter> for instructions. {standard input}:3521:non-relocatable subtraction expression, "_dwarf_reg_size_table" minus "L20$pb" {standard input}:3521:symbol: "_dwarf_reg_size_table" can't be undefined in a subtraction expression {standard input}:3520:non-relocatable subtraction expression, "_dwarf_reg_size_table" minus "L20$pb" ... llvm-svn: 56703	2008-09-26 22:10:44 +00:00
Dan Gohman	6e0548336a	Rename ConstantSDNode's getSignExtended to getSExtValue, for consistancy with ConstantInt, and re-implement it in terms of ConstantInt's getSExtValue. llvm-svn: 56700	2008-09-26 21:54:37 +00:00
Evan Cheng	d77cbe8947	Fix @llvm.frameaddress codegen. FP elimination optimization should be disabled when frame address is desired. Also add support for depth > 0. llvm-svn: 56683	2008-09-26 19:48:35 +00:00
Dale Johannesen	0e32a2c935	Add "inreg" field to CallSDNode (doesn't increase its size). Adjust various lowering functions to pass this info through from CallInst. Use it to implement sseregparm returns on X86. Remove X86_ssecall calling convention. llvm-svn: 56677	2008-09-26 19:31:26 +00:00
Devang Patel	4c758ea3e0	Large mechanical patch. s/ParamAttr/Attribute/g s/PAList/AttrList/g s/FnAttributeWithIndex/AttributeWithIndex/g s/FnAttr/Attribute/g This sets the stage - to implement function notes as function attributes and - to distinguish between function attributes and return value attributes. This requires corresponding changes in llvm-gcc and clang. llvm-svn: 56622	2008-09-25 21:00:45 +00:00
Dale Johannesen	c50ada2f56	Accept 'inreg' attribute on x86 functions as meaning sse_regparm (i.e. float/double values go in XMM0 instead of ST0). Update documentation to reflect reality. llvm-svn: 56619	2008-09-25 20:47:45 +00:00
Dan Gohman	5e490a7567	Support for i1 XOR in FastISel. It is actually safe because i1 operands are assumed to already by zero-extended. llvm-svn: 56615	2008-09-25 17:22:52 +00:00
Dan Gohman	6975c36c43	Don't print fast-isel debug messages by default. Thanks Chris! llvm-svn: 56614	2008-09-25 17:21:42 +00:00
Dan Gohman	dd920bf3f0	Don't forget the newline in debug output. llvm-svn: 56613	2008-09-25 17:17:27 +00:00
Dan Gohman	32a733e2c7	FastISel support for debug info. llvm-svn: 56610	2008-09-25 17:05:24 +00:00
Richard Pennington	4b35e64504	bug 2812: Segmentation fault on a big emdiam processor. llvm-svn: 56609	2008-09-25 16:15:10 +00:00
Dan Gohman	3663f156f7	Fix a recent fast-isel coverage regression - don't bail out before giving the target a chance to materialize constants. llvm-svn: 56605	2008-09-25 01:28:51 +00:00
Dan Gohman	b8e69f1755	Enable DeadMachineInstructionElim when Fast-ISel is enabled. llvm-svn: 56604	2008-09-25 01:14:49 +00:00
Evan Cheng	2e7450716a	<rdar://problem/6234798> Assertion failed: (!OpInfo.AssignedRegs.Regs.empty() && "Couldn't allocate input reg!") llvm-svn: 56597	2008-09-25 00:14:04 +00:00
Dale Johannesen	86d421df23	Remove SelectionDag early allocation of registers for earlyclobbers. Teach Local RA about earlyclobber, and add some tests for it. llvm-svn: 56592	2008-09-24 23:13:09 +00:00
Bill Wendling	dea91308ae	Reapplying r56550 llvm-svn: 56553	2008-09-24 10:25:02 +00:00
Bill Wendling	162c26dee3	Forgot this part with my last patch. Sorry about the breakage. llvm-svn: 56552	2008-09-24 10:16:24 +00:00
Eric Christopher	4e26a81371	Temporarily revert r56550 until missing commit can be added. llvm-svn: 56551	2008-09-24 08:30:44 +00:00
Bill Wendling	7c31464a0b	Refactor the constant folding code into it's own function. And call it from both the SelectionDAG and DAGCombiner code. The only functionality change is that now the DAG combiner is performing the constant folding for these operations instead of being a no-op. This is not in response to a bug, so there isn't a testcase. llvm-svn: 56550	2008-09-24 07:11:26 +00:00
Dale Johannesen	c36660d756	Next round of earlyclobber handling. Approach the RA problem by expanding the live interval of an earlyclobber def back one slot. Remove overlap-earlyclobber throughout. Remove earlyclobber bits and their handling from live internals. llvm-svn: 56539	2008-09-24 01:07:17 +00:00
Evan Cheng	e0add20c1b	Properly handle 'm' inline asm constraints. If a GV is being selected for the addressing mode, it requires the same logic for PIC relative addressing, etc. llvm-svn: 56526	2008-09-24 00:05:32 +00:00
Devang Patel	ba3fa6c6e1	s/ParameterAttributes/Attributes/g llvm-svn: 56513	2008-09-23 23:03:40 +00:00
Dan Gohman	918fe08a56	Arrange for FastISel code to have access to the MachineModuleInfo object. This will be needed to support debug info. llvm-svn: 56508	2008-09-23 21:53:34 +00:00
Dan Gohman	c07f686665	Replace the LiveRegs SmallSet with a simple counter that keeps track of the number of live registers, which is all the set was being used for. llvm-svn: 56498	2008-09-23 18:50:48 +00:00
Dan Gohman	e2947e1e07	Fix the alignment of loads from constant pool entries when the load address has an offset from the base of the constant pool entry. llvm-svn: 56479	2008-09-22 22:40:08 +00:00
Dale Johannesen	7a74e71489	Make log, log2, log10, exp, exp2 use Expand by default. llvm-svn: 56471	2008-09-22 21:57:32 +00:00
Evan Cheng	13beeeb128	Per review feedback: Only perform (srl x, (trunc (and y, c))) -> (srl x, (and (trunc y), c)) etc. when both "trunc" and "and" have single uses. llvm-svn: 56452	2008-09-22 18:19:24 +00:00
Oscar Fuentes	a229b3c9a7	Initial support for the CMake build system. llvm-svn: 56419	2008-09-22 01:08:49 +00:00
Bill Wendling	91ef8fcd29	Add helper function to get a 32-bit floating point constant. No functionality change. llvm-svn: 56418	2008-09-22 00:44:35 +00:00
Chris Lattner	43f5449c48	don't print GlobalAddressSDNode's with an offset of zero as "foo0". llvm-svn: 56399	2008-09-21 18:38:31 +00:00
Dan Gohman	9801ba451a	Refactor X86SelectConstAddr, folding it into X86SelectAddress. This results in better code for globals. Also, unbreak the local CSE for GlobalValue stub loads. llvm-svn: 56371	2008-09-19 22:16:54 +00:00
Dan Gohman	95be7d7b85	Add a new "fast" scheduler. This is currently basically just a copy of the BURRList scheduler, but with several parts ripped out, such as backtracking, online topological sort maintenance (needed by backtracking), the priority queue, and Sethi-Ullman number computation and maintenance (needed by the priority queue). As a result of all this, it generates somewhat lower quality code, but that's its tradeoff for running about 30% faster than list-burr in -fast mode in many cases. This is somewhat experimental. Moving forward, major pieces of this can be refactored with pieces in common with ScheduleDAGRRList.cpp. llvm-svn: 56307	2008-09-18 16:26:26 +00:00
Dale Johannesen	f8610ebebc	Add a bit to mark operands of asm's that conflict with an earlyclobber operand elsewhere. Propagate this bit and the earlyclobber bit through SDISel. Change linear-scan RA not to allocate regs in a way that conflicts with an earlyclobber. See also comments. llvm-svn: 56290	2008-09-17 21:13:11 +00:00
Dan Gohman	6ab52a8018	Don't worry about clobbering physical register defs that aren't used. llvm-svn: 56281	2008-09-17 15:25:49 +00:00
Evan Cheng	a904f466e8	When converting a CopyFromReg to a copy instruction, use the register class of its uses to determine the right destination register class of the copy. This is important for targets where a physical register may belong to multiple register classes. llvm-svn: 56258	2008-09-16 23:12:11 +00:00
Dan Gohman	64d6c6fe30	Change SelectionDAG::getConstantPool to always set the alignment of the ConstantPoolSDNode, using the target's preferred alignment for the constant type. In LegalizeDAG, when performing loads from the constant pool, the ConstantPoolSDNode's alignment is used in the calls to getLoad and getExtLoad. This change prevents SelectionDAG::getLoad/getExtLoad from incorrectly choosing the ABI alignment for constant pool loads when Alignment == 0. The incorrect alignment is only a performance issue when ABI alignment does not equal preferred alignment (i.e., on x86 it was generating MOVUPS instead of MOVAPS for v4f32 constant loads when the default ABI alignment for 128bit vectors is forced to 1 byte.) Patch by Paul Redmond! llvm-svn: 56253	2008-09-16 22:05:41 +00:00
Bill Wendling	24c79f28b1	Reverting r56249. On further investigation, this functionality isn't needed. Apologies for the thrashing. llvm-svn: 56251	2008-09-16 21:48:12 +00:00
Dan Gohman	ab26f20d44	Include the alignment value when displaying ConstantPoolSDNodes. llvm-svn: 56250	2008-09-16 21:18:22 +00:00
Bill Wendling	8bc392fb1d	- Change "ExternalSymbolSDNode" to "SymbolSDNode". - Add linkage to SymbolSDNode (default to external). - Change ISD::ExternalSymbol to ISD::Symbol. - Change ISD::TargetExternalSymbol to ISD::TargetSymbol These changes pave the way to allowing SymbolSDNodes with non-external linkage. llvm-svn: 56249	2008-09-16 21:12:30 +00:00
Dan Gohman	050d7835c6	Don't take the time to CheckDAGForTailCallsAndFixThem when tail calls are not enabled. Instead just omit the tail call flag when calls are created. llvm-svn: 56235	2008-09-16 01:42:28 +00:00
Dan Gohman	3c7b9ba547	Re-enable SelectionDAG CSE for calls. It matters in the case of libcalls, as in this testcase on ARM. llvm-svn: 56226	2008-09-15 19:46:03 +00:00
Dan Gohman	d3fe174c53	Define CallSDNode, an SDNode subclass for use with ISD::CALL. Currently it just holds the calling convention and flags for isVarArgs and isTailCall. And it has several utility methods, which eliminate magic 5+2*i and similar index computations in several places. CallSDNodes are not CSE'd. Teach UpdateNodeOperands to handle nodes that are not CSE'd gracefully. llvm-svn: 56183	2008-09-13 01:54:27 +00:00
Dan Gohman	ec270fb640	Change ConstantSDNode and ConstantFPSDNode to use ConstantInt* and ConstantFP* instead of APInt and APFloat directly. This reduces the amount of time to create ConstantSDNode and ConstantFPSDNode nodes when ConstantInt* and ConstantFP* respectively are already available, as is the case in SelectionDAGBuild.cpp. Also, it reduces the amount of time to legalize constants into constant pools, and the amount of time to add ConstantFP operands to MachineInstrs, due to eliminating ConstantInt::get and ConstantFP::get calls. It increases the amount of work needed to create new constants in cases where the client doesn't already have a ConstantInt* or ConstantFP*, such as legalize expanding 64-bit integer constants to 32-bit constants. And it adds a layer of indirection for the accessor methods. But these appear to be outweight by the benefits in most cases. It will also make it easier to make ConstantSDNode and ConstantFPNode more consistent with ConstantInt and ConstantFP. llvm-svn: 56162	2008-09-12 18:08:03 +00:00
Dale Johannesen	1f3ab86804	Pass "earlyclobber" bit through to machine representation; coalescer and RA need to know about it. No functional change. llvm-svn: 56161	2008-09-12 17:49:03 +00:00
Dan Gohman	effb894453	Rename ConstantSDNode::getValue to getZExtValue, for consistency with ConstantInt. This led to fixing a bug in TargetLowering.cpp using getValue instead of getAPIntValue. llvm-svn: 56159	2008-09-12 16:56:44 +00:00
Dale Johannesen	baf6762e26	The sequence for ppcf128 compares was not IEEE safe in the presence of NaNs. llvm-svn: 56136	2008-09-12 00:30:56 +00:00
Dan Gohman	1dc9b0514f	FastISel support for i1 PHI nodes. llvm-svn: 56069	2008-09-10 21:01:31 +00:00
Dan Gohman	940bafb687	FastISel support for i1 constants. llvm-svn: 56068	2008-09-10 21:01:08 +00:00
Dan Gohman	39d82f902a	Add X86FastISel support for static allocas, and refences to static allocas. As part of this change, refactor the address mode code for laods and stores. llvm-svn: 56066	2008-09-10 20:11:02 +00:00
Dan Gohman	222018da7b	Add a break statement that I accidentally deleted when I shuffled the fast-isel command-line options around. This fixes a bunch of fast-isel failures. llvm-svn: 56057	2008-09-10 15:52:34 +00:00
Bill Wendling	6987fec11c	Remove unnecessary bit-wise AND from the limited precision work. llvm-svn: 56049	2008-09-10 06:26:10 +00:00
Daniel Dunbar	999096065f	Fix 80 col violation. llvm-svn: 56048	2008-09-10 04:16:29 +00:00
Bill Wendling	eb1db169bf	Check that both operands are f32 before attempting to lower. llvm-svn: 56036	2008-09-10 00:24:59 +00:00
Bill Wendling	648930b9ba	Implement "visitPow". This is mainly used to see if we have a pow() call of this form: powf(10.0f, x); If this is the case, and also we want limited precision floating-point calculations, then lower to do the limited-precision stuff. llvm-svn: 56035	2008-09-10 00:20:20 +00:00
Evan Cheng	0fff397a13	A few more places where FPOW is being ignored. llvm-svn: 56032	2008-09-09 23:35:53 +00:00
Dan Gohman	b4c0295b8e	Change -fast-isel-no-abort to -fast-isel-abort, which now defaults to being off by default. Also, add assertion checks to check that the various fast-isel-related command-line options are only used when -fast-isel itself is enabled. llvm-svn: 56029	2008-09-09 23:05:00 +00:00
Evan Cheng	f4e5de4583	Legalizer was missing code that expand fpow to a libcall. llvm-svn: 56028	2008-09-09 23:02:14 +00:00
Bill Wendling	ab6676a46a	Adding 6-, 12-, and 18-bit limited-precision floating-point support for exp2 function. llvm-svn: 56025	2008-09-09 22:39:21 +00:00
Bill Wendling	48217d89b4	Add support for 6-, 12-, and 18-bit limited precision calculations of exp for floating-point numbers. llvm-svn: 56023	2008-09-09 22:13:54 +00:00
Dan Gohman	91491b51e2	Add a new option, -fast-isel-verbose, that can be used with -fast-isel-no-abort to get a dump of all unhandled instructions, without an abort. llvm-svn: 56021	2008-09-09 22:06:46 +00:00
Owen Anderson	4a58bd331b	Clean this up, based on Evan's suggestions. llvm-svn: 56009	2008-09-09 20:47:17 +00:00
Bill Wendling	ed3bb7888d	- Add support for 6-, 12-, and 18-bit limited precision floating-point "log" values. - Refactored some of the code. llvm-svn: 56008	2008-09-09 20:39:27 +00:00
Anton Korobeynikov	1a1140429e	Make safer variant of alias resolution routine to be default llvm-svn: 56005	2008-09-09 20:05:04 +00:00
Bill Wendling	faeb4b6755	Add limited precision floating-point conversions of log10 for 6- and 18-bit precisions. llvm-svn: 56000	2008-09-09 18:42:23 +00:00
Owen Anderson	8529085f4f	Check for type legality before materializing integer constants in fast isel. With this change, all of MultiSource/Applications passes on Darwin/X86 under FastISel. llvm-svn: 55982	2008-09-09 06:32:02 +00:00
Dan Gohman	b6aef419b4	Remove the code that protected FastISel from aborting in the case of loads, stores, and conditional branches. It can handle those now, so any that aren't handled should trigger the abort. llvm-svn: 55977	2008-09-09 02:40:04 +00:00
Evan Cheng	1e97901388	Fix a constant lowering bug. Now we can do load and store instructions with funky getelementptr embedded in the address operand. llvm-svn: 55975	2008-09-09 01:26:59 +00:00
Bill Wendling	484167851a	Add support for floating-point calculations of log2 with limited precisions of 6 and 18. llvm-svn: 55968	2008-09-09 00:28:24 +00:00
Anton Korobeynikov	45165ed1ac	Reapply 55904: Unbreak and fix indentation llvm-svn: 55958	2008-09-08 21:13:56 +00:00
Dan Gohman	a333f3ccb8	Fix a few I's that were meant to be renamed to BI's. llvm-svn: 55942	2008-09-08 20:37:59 +00:00
Dale Johannesen	67f99f1454	Redo the 3 existing low-precision expansions to use float constants. An oversight by the numerics people who supplied this. llvm-svn: 55930	2008-09-08 18:00:26 +00:00
Bill Wendling	99b83712f3	Reverting r55898 to r55909. One of these patches was causing an ICE during the full bootstrap on Darwin: /Volumes/Sandbox/Buildbot/llvm/full-llvm/build/llvm-gcc.obj/./gcc/xgcc -B/Volumes/Sandbox/Buildbot/llvm/full-llvm/build/llvm-gcc.obj/./gcc/ -B/Volumes/Sandbox/Buildbot/llvm/full-llvm/build/llvm-gcc.install/i386-apple-darwin9.4.0/bin/ -B/Volumes/Sandbox/Buildbot/llvm/full-llvm/build/llvm-gcc.install/i386-apple-darwin9.4.0/lib/ -isystem /Volumes/Sandbox/Buildbot/llvm/full-llvm/build/llvm-gcc.install/i386-apple-darwin9.4.0/include -isystem /Volumes/Sandbox/Buildbot/llvm/full-llvm/build/llvm-gcc.install/i386-apple-darwin9.4.0/sys-include -O2 -O2 -g -O2 -DIN_GCC -W -Wall -Wwrite-strings -Wstrict-prototypes -Wmissing-prototypes -Wold-style-definition -isystem ./include -fPIC -pipe -g -DHAVE_GTHR_DEFAULT -DIN_LIBGCC2 -D__GCC_FLOAT_NOT_NEEDED -I. -I. -I../../llvm-gcc.src/gcc -I../../llvm-gcc.src/gcc/. -I../../llvm-gcc.src/gcc/../include -I./../intl -I../../llvm-gcc.src/gcc/../libcpp/include -I../../llvm-gcc.src/gcc/../libdecnumber -I../libdecnumber -I/Volumes/Sandbox/Buildbot/llvm/full-llvm/build/llvm.obj/include -I/Volumes/Sandbox/Buildbot/llvm/full-llvm/build/llvm.src/include -DSHARED -m64 -DL_negdi2 -c ../../llvm-gcc.src/gcc/libgcc2.c -o libgcc/x86_64/_negdi2_s.o Assertion failed: (TargetRegisterInfo::isVirtualRegister(regA) && TargetRegisterInfo::isVirtualRegister(regB) && "cannot update physical register live information"), function runOnMachineFunction, file /Volumes/Sandbox/Buildbot/llvm/full-llvm/build/llvm.src/lib/CodeGen/TwoAddressInstructionPass.cpp, line 311. /Volumes/Sandbox/Buildbot/llvm/full-llvm/build/llvm-gcc.obj/./gcc/xgcc -B/Volumes/Sandbox/Buildbot/llvm/full-llvm/build/llvm-gcc.obj/./gcc/ -B/Volumes/Sandbox/Buildbot/llvm/full-llvm/build/llvm-gcc.install/i386-apple-darwin9.4.0/bin/ -B/Volumes/Sandbox/Buildbot/llvm/full-llvm/build/llvm-gcc.install/i386-apple-darwin9.4.0/lib/ -isystem /Volumes/Sandbox/Buildbot/llvm/full-llvm/build/llvm-gcc.install/i386-apple-darwin9.4.0/include -isystem /Volumes/Sandbox/Buildbot/llvm/full-llvm/build/llvm-gcc.install/i386-apple-darwin9.4.0/sys-include -O2 -O2 -g -O2 -DIN_GCC -W -Wall -Wwrite-strings -Wstrict-prototypes -Wmissing-prototypes -Wold-style-definition -isystem ./include -fPIC -pipe -g -DHAVE_GTHR_DEFAULT -DIN_LIBGCC2 -D__GCC_FLOAT_NOT_NEEDED -I. -I. -I../../llvm-gcc.src/gcc -I../../llvm-gcc.src/gcc/. -I../../llvm-gcc.src/gcc/../include -I./../intl -I../../llvm-gcc.src/gcc/../libcpp/include -I../../llvm-gcc.src/gcc/../libdecnumber -I../libdecnumber -I/Volumes/Sandbox/Buildbot/llvm/full-llvm/build/llvm.obj/include -I/Volumes/Sandbox/Buildbot/llvm/full-llvm/build/llvm.src/include -DSHARED -m64 -DL_lshrdi3 -c ../../llvm-gcc.src/gcc/libgcc2.c -o libgcc/x86_64/_lshrdi3_s.o ../../llvm-gcc.src/gcc/unwind-dw2.c:1527: internal compiler error: Abort trap Please submit a full bug report, with preprocessed source if appropriate. See <URL:http://developer.apple.com/bugreporter> for instructions. {standard input}:unknown:Undefined local symbol LBB21_11 {standard input}:unknown:Undefined local symbol LBB21_12 {standard input}:unknown:Undefined local symbol LBB21_13 {standard input}:unknown:Undefined local symbol LBB21_8 llvm-svn: 55928	2008-09-08 17:59:12 +00:00
Dan Gohman	1df80f6b1c	In visitUREM, arrange for the temporary UDIV node to be revisited, consistent with the code in visitSREM. llvm-svn: 55923	2008-09-08 16:59:01 +00:00
Daniel Dunbar	ede2d7d745	Add VISIBILITY_HIDDEN on SDISelAsmOperandInfo llvm-svn: 55922	2008-09-08 16:56:08 +00:00
Dan Gohman	e19bc1844f	Fix the string for ISD::UDIVREM. llvm-svn: 55917	2008-09-08 16:30:29 +00:00
Evan Cheng	24776b554d	Avoid redefinition and nnbreak windows build. llvm-svn: 55911	2008-09-08 16:01:27 +00:00
Anton Korobeynikov	6a73698a85	Unbreak and fix indentation llvm-svn: 55904	2008-09-08 14:23:34 +00:00
Evan Cheng	e775d3526c	Add fast isel physical register definition support. llvm-svn: 55892	2008-09-08 08:38:20 +00:00
Bill Wendling	5f7371d7b1	Revert my previous change -- the subtraction of two constants was a no-op before. This is taken care of in the selection DAG pass. In my opinion, this should be in one place or the other. I.e., it should probably be removed from the DAG combiner (along with the other arithmetic transformations on constants that are essentially no-ops). llvm-svn: 55889	2008-09-08 01:56:32 +00:00
Bill Wendling	df81749886	Convert // fold (sub c1, c2) -> c1-c2 from a no-op into an actual transformation. llvm-svn: 55886	2008-09-07 11:34:47 +00:00
Evan Cheng	b9a0abb129	Indentation. llvm-svn: 55880	2008-09-07 09:04:52 +00:00
Evan Cheng	615739b991	- Doh. Pass vector by value is bad. - Add a AnalyzeCallResult specialized for calls which produce a single value. This is used by fastisel. llvm-svn: 55879	2008-09-07 09:02:18 +00:00
Dale Johannesen	36d532abd6	Next limited float precision expansion (log2 12 bits) llvm-svn: 55866	2008-09-05 23:49:37 +00:00
Owen Anderson	1dd2e40521	Revert r55859. This is breaking the build in the abscence of its companion commit. llvm-svn: 55865	2008-09-05 23:36:01 +00:00
Dan Gohman	f17a2f3602	Move the code that inserts copies for function livein registers out of ScheduleDAGEmit.cpp and into SelectionDAGISel.cpp. This allows it to be run exactly once per function, even if multiple SelectionDAG iterations happen in the entry block, as may happen with FastISel. llvm-svn: 55863	2008-09-05 22:59:21 +00:00
Dale Johannesen	d4dac0e9ea	Add the next limited-precision expansion. llvm-svn: 55856	2008-09-05 21:27:19 +00:00
Dan Gohman	fd634599dc	FastISel support for AND and OR with type i1. llvm-svn: 55846	2008-09-05 18:44:22 +00:00
Dale Johannesen	520143e563	Add hooks for other intrinsics to get low-precision expansions. llvm-svn: 55845	2008-09-05 18:38:42 +00:00
Dan Gohman	fcf545690c	FastISel support for ConstantExprs. llvm-svn: 55843	2008-09-05 18:18:20 +00:00
Dan Gohman	677c3afbd1	Revert r55817. It broke PIC. FastISel will need to find a different approach here. llvm-svn: 55842	2008-09-05 18:13:01 +00:00
Evan Cheng	6b8fae1777	Add a variant of AnalyzeCallOperands that can be used by fast isel. llvm-svn: 55838	2008-09-05 16:59:26 +00:00
Duncan Sands	4d50e984bb	"Fix" PR2762. The testcase now crashes codegen elsewhere due to a missing pattern for v2f64 = sint_to_fp v2i32. That is PR2687. llvm-svn: 55828	2008-09-05 08:13:35 +00:00
Dan Gohman	921ddd69ba	Fix a search+replace-o. llvm-svn: 55824	2008-09-05 01:58:21 +00:00
Dale Johannesen	f2a52bbee5	Add -flimit-float-precision to enable some faster, but less accurate (non-IEEE) code sequences for certain math library functions. Add the first of several such expansions. Don't worry, if you don't turn it on it won't affect you. llvm-svn: 55823	2008-09-05 01:48:15 +00:00
Dan Gohman	ea56bdde34	FastISel support for unreachable. llvm-svn: 55818	2008-09-05 01:08:41 +00:00
Dan Gohman	5b4a9f4a69	In FastISel mode, the scheduler may be invoked multiple times in the same block. Fix the entry-block handling to only run at at the beginning of the entry block, and not any other times. llvm-svn: 55817	2008-09-05 01:07:48 +00:00
Owen Anderson	50288e3c99	Add initial support for selecting constant materializations that require constant pool loads on X86 in fast isel. This isn't actually used yet. llvm-svn: 55814	2008-09-05 00:06:23 +00:00
Dan Gohman	5eba3bcac6	Add an include of SmallSet.h. llvm-svn: 55793	2008-09-04 20:49:27 +00:00
Dan Gohman	a79db30d28	Tidy up several unbeseeming casts from pointer to intptr_t. llvm-svn: 55779	2008-09-04 17:05:41 +00:00
Dan Gohman	634412fe35	Clean up uses of TargetLowering::getTargetMachine. llvm-svn: 55769	2008-09-04 15:39:15 +00:00
Dale Johannesen	da2d80688b	Add intrinsics for log, log2, log10, exp, exp2. No functional change (and no FE change to generate them). llvm-svn: 55753	2008-09-04 00:47:13 +00:00
Dan Gohman	e039d5580e	Do trivial local CSE for constants and other non-Instruction values in FastISel. llvm-svn: 55748	2008-09-03 23:32:19 +00:00
Dan Gohman	45df9951f5	Put RegsForValue in the llvm namespace to avoid warnings about classes in the llvm namespace having members with types from anonymous namespaces. llvm-svn: 55747	2008-09-03 23:18:39 +00:00
Dan Gohman	7bda51f5a4	Create HandlePHINodesInSuccessorBlocksFast, a version of HandlePHINodesInSuccessorBlocks that works FastISel-style. This allows PHI nodes to be updated correctly while using FastISel. This also involves some code reorganization; ValueMap and MBBMap are now members of the FastISel class, so they needn't be passed around explicitly anymore. Also, SelectInstructions is changed to SelectInstruction, and only does one instruction at a time. llvm-svn: 55746	2008-09-03 23:12:08 +00:00
Owen Anderson	b1b9398ea7	Oops, I accidentally broke the fallback case with my last commit. llvm-svn: 55704	2008-09-03 17:51:57 +00:00
Owen Anderson	ea666816c2	Fix an issue where we were reusing materializations of constants in blocks not dominated by the materialization. This is the simple fix, materializing the constant before every use. It might be better to either track domination of uses or to materialize all constants and the beginning of the function and let remat sort when to do materialization at uses. llvm-svn: 55703	2008-09-03 17:37:03 +00:00
Dan Gohman	575fad337c	Split the SelectionDAG-building code, including the FunctionLoweringInfo and SelectionDAGLowering classes, out of SelectionDAGISel.cpp and put it in a separate file, SelectionDAGBuild.cpp. llvm-svn: 55701	2008-09-03 16:12:24 +00:00
Dan Gohman	b10f1a5c60	Separate MachineInstr-emitting routines from actual scheduling routines and move them into a separate file, ScheduleDAGEmit.cpp. llvm-svn: 55699	2008-09-03 16:01:59 +00:00
Evan Cheng	31ddd09f4a	If TargetSelectInstruction returns true, move to next instruction. llvm-svn: 55692	2008-09-03 06:43:41 +00:00
Evan Cheng	09ff2e7372	80 col violations. llvm-svn: 55668	2008-09-02 21:59:13 +00:00
Dan Gohman	115267fdc6	Ensure that HandlePHINodesInSuccessorBlocks is run for all blocks, even in FastISel mode in the case where FastISel successfully selects all the instructions. llvm-svn: 55641	2008-09-02 20:17:56 +00:00
Gabor Greif	9c64e61176	Provide two overloads of AnalyzeNewNode. The first can update the SDNode in an SDValue while the second is called with SDNode* and returns a possibly updated SDNode*. This patch has no intended functional impact, but helps eliminating ugly temporary SDValues. llvm-svn: 55608	2008-09-01 15:10:19 +00:00
Duncan Sands	4b31a2a7ce	Even though no caller actually uses the new value (what matters is that it is added to the worklist), it seems more logical to return it. llvm-svn: 55606	2008-09-01 13:11:13 +00:00
Bill Wendling	11284ea499	Another situation where ROTR is cheaper than ROTL. llvm-svn: 55577	2008-08-31 01:13:31 +00:00
Bill Wendling	4822a7ac8a	For this pattern, ROTR is the cheaper option. llvm-svn: 55576	2008-08-31 01:04:56 +00:00
Bill Wendling	fc72416447	- Fix comment so that it describes how the code really works: // fold (or (shl x, (ext y)), (srl x, (ext (sub 32, y)))) -> // (rotl x, y) // fold (or (shl x, (ext y)), (srl x, (ext (sub 32, y)))) -> // (rotr x, (sub 32, y)) Example: (x == 0xDEADBEEF and y == 4) (x << 4) \| (x >> 28) => 0xEADBEEF0 \| 0x0000000D => 0xEADBEEFD (rotl x, 4) => 0xEADBEEFD (rotr x, 28) => 0xEADBEEFD - Fix comment and code for second version. It wasn't using the rot* propertly. // fold (or (shl x, (ext (sub 32, y))), (srl x, (ext r))) -> // (rotr x, y) // fold (or (shl x, (ext (sub 32, y))), (srl x, (ext r))) -> // (rotl x, (sub 32, y)) (x << 28) \| (x >> 4) => 0xD0000000 \| 0x0DEADBEE => 0xDDEADBEE (rotl x, 4) => 0xEADBEEFD (rotr x, 28) => (0xEADBEEFD) llvm-svn: 55575	2008-08-31 00:37:27 +00:00
Gabor Greif	66ccf603a9	typo llvm-svn: 55574	2008-08-30 22:16:05 +00:00
Gabor Greif	e12264bf41	fix some 80-col violations llvm-svn: 55571	2008-08-30 19:29:20 +00:00
Evan Cheng	cfb7f3abdf	Transform (x << (y&31)) -> (x << y). This takes advantage of the fact x86 shift instructions 2nd operand (shift count) is limited to 0 to 31 (or 63 in the x86-64 case). llvm-svn: 55558	2008-08-30 02:03:58 +00:00
Owen Anderson	6f0c51d9da	Fix an issue where a use might be selected before a def, and then we didn't respect the pre-chosen vreg assignment when selecting the def. This is the naive solution to the problem: insert a copy to the pre-chosen vreg. Other solutions might be preferable, such as: 1) Passing the dest reg into FastEmit_. However, this would require the higher level code to know about reg classes, which they don't currently. 2) Selecting blocks in reverse postorder. This has some compile time cost for computing the order, and we'd need to measure its impact. llvm-svn: 55555	2008-08-30 00:38:46 +00:00
Evan Cheng	894be333f1	Fix 80 col. violations. llvm-svn: 55551	2008-08-29 23:20:46 +00:00
Evan Cheng	5e7658c2e4	Back out 55498. It broken Apple style bootstrapping. llvm-svn: 55549	2008-08-29 22:21:44 +00:00
Dan Gohman	d58f3e36d0	Add a target callback for FastISel. llvm-svn: 55512	2008-08-28 23:21:34 +00:00
Gabor Greif	f304a7aa4d	erect abstraction boundaries for accessing SDValue members, rename Val -> Node to reflect semantics llvm-svn: 55504	2008-08-28 21:40:38 +00:00
Dan Gohman	c45733f194	Implement null and undef values for FastISel. llvm-svn: 55500	2008-08-28 21:19:07 +00:00
Dan Gohman	f27e33baa7	Optimize DAGCombiner's worklist processing. Previously it started its work by putting all nodes in the worklist, requiring a big dynamic allocation. Now, DAGCombiner just iterates over the AllNodes list and maintains a worklist for nodes that are newly created or need to be revisited. This allows the worklist to stay small in most cases, so it can be a SmallVector. This has the side effect of making DAGCombine not miss a folding opportunity in alloca-align-rounding.ll. llvm-svn: 55498	2008-08-28 21:01:56 +00:00
Dan Gohman	17da671922	Move CaseBlock, JumpTable, and BitTestBlock to be members of SelectionDAGLowering instead of being in an anonymous namespace. This fixes warnings about SelectionDAGLowering having fields using anonymous namespaces. llvm-svn: 55497	2008-08-28 20:38:18 +00:00
Dan Gohman	360c57f683	Fix a FastISel bug where the instructions from lowering the arguments were being emitted after the first instructions of the entry block. llvm-svn: 55496	2008-08-28 20:28:56 +00:00
Rafael Espindola	6c8a99a778	Reduce the size of the Parts vector. llvm-svn: 55483	2008-08-28 18:29:58 +00:00
Owen Anderson	d8a82b75e2	Hook up support for fast-isel of trunc instructions, using the newly working support for EXTRACT_SUBREG. llvm-svn: 55482	2008-08-28 18:26:01 +00:00
Owen Anderson	9cd1a5e530	FastEmitInst_extractsubreg doesn't need to be passed the register class. It can get it from MachineRegisterInfo instead. llvm-svn: 55476	2008-08-28 17:47:37 +00:00
Rafael Espindola	029c1c8460	Correctly resize the Parts array. llvm-svn: 55471	2008-08-28 14:24:45 +00:00
Dale Johannesen	41be0d4445	Split the ATOMIC NodeType's to include the size, e.g. ATOMIC_LOAD_ADD_{8,16,32,64} instead of ATOMIC_LOAD_ADD. Increased the Hardcoded Constant OpActionsCapacity to match. Large but boring; no functional change. This is to support partial-word atomics on ppc; i8 is not a valid type there, so by the time we get to lowering, the ATOMIC_LOAD nodes looks the same whether the type was i8 or i32. The information can be added to the AtomicSDNode, but that is the largest SDNode; I don't fully understand the SDNode allocation, but it is sensitive to the largest node size, so increasing that must be bad. This is the alternative. llvm-svn: 55457	2008-08-28 02:44:49 +00:00
Dan Gohman	e1a9a780a5	Reorganize the lifetimes of the major objects SelectionDAGISel works with. SelectionDAG, FunctionLoweringInfo, and SelectionDAGLowering objects now get created once per SelectionDAGISel instance, and can be reused across blocks and across functions. Previously, they were created and destroyed each time they were needed. This reorganization simplifies the handling of PHI nodes, and also SwitchCases, JumpTables, and BitTestBlocks. This simplification has the side effect of fixing a bug in FastISel where successor PHI nodes weren't being updated correctly. This is also a step towards making the transition from FastISel into and out of SelectionDAG faster, and also making plain SelectionDAG faster on code with lots of little blocks. llvm-svn: 55450	2008-08-27 23:52:12 +00:00
Owen Anderson	5f57bc2247	Add a helper method that will be used to support EXTRACT_SUBREG for selecting trunc's in fast-isel. llvm-svn: 55439	2008-08-27 22:30:02 +00:00
Dan Gohman	61cfa3095d	Fix FastISel's bitcast code for the case where getRegForValue fails. llvm-svn: 55431	2008-08-27 20:41:38 +00:00
Owen Anderson	90609850b2	Use TargetLowering to get the types in fast isel, which handles pointer types correctly for our purposes. llvm-svn: 55428	2008-08-27 18:58:30 +00:00
Dan Gohman	d01789be23	Don't check TLI.getOperationAction. The FastISel way is to just try to do the action and let the tablegen-generated code determine if there is target-support for an operation. llvm-svn: 55427	2008-08-27 18:15:05 +00:00
Dan Gohman	b0b5a27438	Add a new FastISel method, getRegForValue, which takes care of the details of materializing constants and other values into registers, and make use of it in several places. llvm-svn: 55426	2008-08-27 18:10:19 +00:00
Dan Gohman	f2a6c1579f	Add a comment about the current floating-point constant code in FastISel. llvm-svn: 55425	2008-08-27 18:01:42 +00:00
Dan Gohman	3a3a52de58	Optimize ScheduleDAGRRList's topological sort to use one pass instead of two, and to not need a scratch std::vector. Also, compute the ordering immediately in the result array, instead of in another scratch std::vector that is copied to the result array. llvm-svn: 55421	2008-08-27 16:29:48 +00:00
Dan Gohman	9cbdedcbcf	Optimize ScheduleDAG's ComputeDepths and ComputeHeights to not need a scratch std::vector. llvm-svn: 55420	2008-08-27 16:27:25 +00:00
Dan Gohman	5ca269e684	Basic FastISel support for floating-point constants. llvm-svn: 55401	2008-08-27 01:09:54 +00:00
Owen Anderson	54aff7bb23	Fix handling of inttoptr and ptrtoint when unhandled operands are present. llvm-svn: 55400	2008-08-27 00:35:37 +00:00
Owen Anderson	140549256f	Add support for fast isel of inttoptr and ptrtoint in the cases where truncation is not needed. llvm-svn: 55399	2008-08-27 00:31:01 +00:00
Owen Anderson	ca1711a5b5	Factor out a large amoutn of the cast handling code in fast isel into helper methods. This simultaneously makes the code simpler and adds support for sext as well. llvm-svn: 55398	2008-08-26 23:46:32 +00:00
Owen Anderson	343310a715	Add support for fast isel of zext. llvm-svn: 55396	2008-08-26 23:14:49 +00:00
Gabor Greif	abfdf928d8	disallow direct access to SDValue::ResNo, provide a getter instead llvm-svn: 55394	2008-08-26 22:36:50 +00:00
Owen Anderson	655c1dc63d	Add support for fptosi of constants in fast isel. llvm-svn: 55393	2008-08-26 22:34:28 +00:00
Dan Gohman	d56f73f2f2	Optimize SelectionDAG's topological sort to use one pass instead of two, and to not need a scratch std::vector. Also, use the SelectionDAG's topological sort in LegalizeDAG instead of having a separate implementation. llvm-svn: 55389	2008-08-26 21:42:18 +00:00
Dan Gohman	6fda9208d9	Refactor the bitcast code into its own function. llvm-svn: 55387	2008-08-26 21:28:54 +00:00
Dan Gohman	b5e04bfb18	Make FastISel use the correct argument type when casting GEP indices. llvm-svn: 55384	2008-08-26 20:57:08 +00:00
Dan Gohman	3bcbbece19	Don't select binary instructions with illegal types. llvm-svn: 55383	2008-08-26 20:52:40 +00:00
Owen Anderson	3c4dc434ee	Add support for fast isel of sitofp, and remove some unnecessary and imprecise legality checks. llvm-svn: 55381	2008-08-26 20:37:00 +00:00
Owen Anderson	e0ac9765b2	Use a combination of copyRegToReg and ISD::BIT_CONVERT when doing fast isel of bitcasts, allowing it to support the full range of conversions people might ask for in a correct manner. llvm-svn: 55378	2008-08-26 18:51:24 +00:00
Owen Anderson	27fb3dcbc7	Make TargetInstrInfo::copyRegToReg return a bool indicating whether the copy requested was inserted or not. This allows bitcast in fast isel to properly handle the case where an appropriate reg-to-reg copy is not available. llvm-svn: 55375	2008-08-26 18:03:31 +00:00
Owen Anderson	bf05ebaccf	Add support for fast isel of non-constant fptosi instructions. llvm-svn: 55373	2008-08-26 17:44:42 +00:00
Chris Lattner	54ef9f5831	typo fix. llvm-svn: 55355	2008-08-26 06:07:47 +00:00
Dan Gohman	2e834906b9	Actually recycle SDNode allocations. SelectionDAG is using RecyclingAllocator, but this change is needed for the nodes to actually be recycled. This cuts SelectionDAG's memory usage high-water-mark in half in some cases. llvm-svn: 55351	2008-08-26 01:44:34 +00:00
Owen Anderson	8dd01ccdd8	Add a RetVT parameter to emitted FastISel methods, so that we will be able to pass the desired return type down. This is not currently used. llvm-svn: 55345	2008-08-25 23:58:18 +00:00
Evan Cheng	2c067325d6	Unbreak build. llvm-svn: 55342	2008-08-25 22:20:39 +00:00
Owen Anderson	126afc5cb9	Expand bitcast support in fast isel to support bitcasts of non-constant values by emitting reg-reg copies. llvm-svn: 55340	2008-08-25 21:32:34 +00:00
Owen Anderson	32635dbfb2	Add support for fast isel of (integer) immediate materialization pattens, and use them to support bitcast of constants in fast isel. llvm-svn: 55325	2008-08-25 20:20:32 +00:00
Chris Lattner	f4bd5cf3dd	make sure to flush the stream after dumping, to make sure it goes out immediately. llvm-svn: 55288	2008-08-24 18:28:30 +00:00
Chris Lattner	838aff36dd	get MachineConstantPool off std::ostream, onto raw_ostream. It would be really nice if someone converted MachineFunction::print to raw_ostream. llvm-svn: 55268	2008-08-23 22:53:13 +00:00
Chris Lattner	0c19df4871	Switch the asmprinter (.ll) and all the stuff it requires over to use raw_ostream instead of std::ostream. Among other goodness, this speeds up llvm-dis of kc++ with a release build from 0.85s to 0.49s (88% faster). Other interesting changes: 1) This makes Value::print be non-virtual. 2) AP[S]Int and ConstantRange can no longer print to ostream directly, use raw_ostream instead. 3) This fixes a bug in raw_os_ostream where it didn't flush itself when destroyed. 4) This adds a new SDNode::print method, instead of only allowing "dump". A lot of APIs have both std::ostream and raw_ostream versions, it would be useful to go through and systematically anihilate the std::ostream versions. This passes dejagnu, but there may be minor fallout, plz let me know if so and I'll fix it. llvm-svn: 55263	2008-08-23 22:23:09 +00:00
Dan Gohman	48a3623591	Make MBBMap a DenseMap instead of a std::map. llvm-svn: 55220	2008-08-23 02:44:46 +00:00
Dan Gohman	eb0cee91f6	Move the point at which FastISel taps into the SelectionDAGISel process up to a higher level. This allows FastISel to leverage more of SelectionDAGISel's infastructure, such as updating Machine PHI nodes. Also, implement transitioning from SDISel back to FastISel in the middle of a block, so it's now possible to go back and forth. This allows FastISel to hand individual CallInsts and other complicated things off to SDISel to handle, while handling the rest of the block itself. To help support this, reorganize the SelectionDAG class so that it is allocated once and reused throughout a function, instead of being completely reallocated for each block. llvm-svn: 55219	2008-08-23 02:25:05 +00:00
Dan Gohman	95d1056831	Avoid creating shift-by-zero SDNodes in the common case of i8* getelementptr. DAGCombine eliminates these, but this is a fairly common case. llvm-svn: 55214	2008-08-23 01:06:51 +00:00
Dan Gohman	ac37f9a9be	Move SelectionDAG's constructor out of line. llvm-svn: 55212	2008-08-23 00:50:30 +00:00
Dan Gohman	2db3f8a095	Reapply r55191 and r55192. llvm-svn: 55205	2008-08-22 21:28:19 +00:00
Bill Wendling	fc4f64eed0	Reverting r55190, r55191, and r55192. They broke the build with this error message: {standard input}:17:bad register name `%sil' make[4]: * [libgcc/./_addvsi3.o] Error 1 make[4]: * Waiting for unfinished jobs.... {standard input}:23:bad register name `%dil' {standard input}:28:bad register name `%dil' make[4]: * [libgcc/./_addvdi3.o] Error 1 {standard input}:18:bad register name `%sil' make[4]: * [libgcc/./_subvsi3.o] Error 1 llvm-svn: 55200	2008-08-22 20:51:05 +00:00
Dan Gohman	04968da460	Fix the InsertBranch call. llvm-svn: 55192	2008-08-22 19:26:10 +00:00
Dan Gohman	87ff7058e7	Support non-fallthrough unconditional branches in FastISel. llvm-svn: 55191	2008-08-22 19:21:41 +00:00
Dan Gohman	a2292c0d34	Add FastISel support for PHINodes. Machine PHI nodes are not yet updated properly, but that's a separate task. llvm-svn: 55187	2008-08-22 17:37:48 +00:00
Dan Gohman	49e19e906f	Factor out the predicate check code from DAGISelEmitter.cpp and use it in FastISelEmitter.cpp, and make FastISel subtarget aware. Among other things, this lets it work properly on x86 targets that don't have SSE, where it successfully selects x87 instructions. llvm-svn: 55156	2008-08-22 00:20:26 +00:00
Dan Gohman	2af34bd309	Add libcalls for the new rounding opcodes. llvm-svn: 55133	2008-08-21 18:38:14 +00:00
Dan Gohman	c6337ac069	Add libm-oriented ISD opcodes for rounding operations. llvm-svn: 55130	2008-08-21 17:55:02 +00:00

... 3 4 5 6 7 ...

3074 Commits