llvm-project

Commit Graph

Author	SHA1	Message	Date
Dan Gohman	0e8d1454b1	Delete #if 0 code accidentally left in. llvm-svn: 143179	2011-10-28 01:41:21 +00:00
Dan Gohman	4db3f7dd83	Eliminate LegalizeOps' LegalizedNodes map and have it just call RAUW on every node as it legalizes them. This makes it easier to use hasOneUse() heuristics, since unneeded nodes can be removed from the DAG earlier. Make LegalizeOps visit the DAG in an operands-last order. It previously used operands-first, because LegalizeTypes has to go operands-first, and LegalizeTypes used to be part of LegalizeOps, but they're now split. The operands-last order is more natural for several legalization tasks. For example, it allows lowering code for nodes with floating-point or vector constants to see those constants directly instead of seeing the lowered form (often constant-pool loads). This makes some things somewhat more complicated today, though it ought to allow things to be simpler in the future. It also fixes some bugs exposed by Legalizing using RAUW aggressively. Remove the part of LegalizeOps that attempted to patch up invalid chain operands on libcalls generated by LegalizeTypes, since it doesn't work with the new LegalizeOps traversal order. Instead, define what LegalizeTypes is doing to be correct, and transfer the responsibility of keeping calls from having overlapping calling sequences into the scheduler. Teach the scheduler to model callseq_begin/end pairs as having a physical register definition/use to prevent calls from having overlapping calling sequences. This is also somewhat complicated, though there are ways it might be simplified in the future. This addresses rdar://9816668, rdar://10043614, rdar://8434668, and others. Please direct high-level questions about this patch to management. llvm-svn: 143177	2011-10-28 01:29:32 +00:00
Eli Friedman	e9e356ad6b	Don't crash on 128-bit sdiv by constant. Found by inspection. llvm-svn: 143095	2011-10-27 02:06:39 +00:00
Lang Hames	58dba012b6	Rename NonScalarIntSafe to something more appropriate. llvm-svn: 143080	2011-10-26 23:50:43 +00:00
Duncan Sands	dce448c642	Simplify SplitVecRes_UnaryOp by removing all the code that is trying to legalize the operand types when only the result type is required to be legalized - the type legalization machinery will get round to the operands later if they need legalizing. There can be a point to legalizing operands in parallel with the result: when this saves compile time or results in better code. There was only one case in which this was true: when the operand is also split, so keep the logic for that bit. As a result of this change, additional operand legalization methods may need to be introduced to handle nodes where the result and operand types can differ, like SIGN_EXTEND, but the testsuite doesn't contain any tests where this is the case. In any case, it seems better to require such methods (and die with an assert if they doesn't exist) than to quietly produce wrong code if we forgot to special case the node in SplitVecRes_UnaryOp. llvm-svn: 143026	2011-10-26 14:11:18 +00:00
Jakob Stoklund Olesen	e8261a22f1	Don't use floating point to do an integer's job. This code makes different decisions when compiled into x87 instructions because of different rounding behavior. That caused phase 2/3 miscompares on 32-bit Linux when the phase 1 compiler was built with gcc (using x87), and the phase 2 compiler was built with clang (using SSE). This fixes PR11200. llvm-svn: 143006	2011-10-26 01:47:48 +00:00
Eli Friedman	3e9ef907e0	Remove a couple redundant checks. llvm-svn: 142959	2011-10-25 20:34:22 +00:00
Douglas Gregor	0cc574eee7	Really unbreak CMake build llvm-svn: 142822	2011-10-24 18:10:52 +00:00
Douglas Gregor	d0800fda95	Unbreak CMake build llvm-svn: 142821	2011-10-24 18:09:23 +00:00
Dan Gohman	e2ff95e327	Delete the top-down "Latency" scheduler. Top-down scheduling doesn't handle physreg dependencies, and upcoming codegen changes will require proper physreg dependence handling. llvm-svn: 142816	2011-10-24 18:01:06 +00:00
Dan Gohman	d78fc160cc	Delete the Latency scheduling preference. llvm-svn: 142815	2011-10-24 17:56:48 +00:00
Dan Gohman	4ed1afa51d	Change this overloaded use of Sched::Latency to be an overloaded use of Sched::ILP instead, as Sched::Latency is going away. llvm-svn: 142813	2011-10-24 17:55:11 +00:00
Dan Gohman	c32af340fc	Change the default scheduler from Latency to ILP, since Latency is going away. llvm-svn: 142810	2011-10-24 17:45:02 +00:00
Nadav Rotem	5e00bb5feb	Fix pr11194. When promoting and splitting integers we need to use ZExtPromotedInteger and SExtPromotedInteger based on the operation we legalize. SetCC return type needs to be legalized via PromoteTargetBoolean. llvm-svn: 142660	2011-10-21 17:35:19 +00:00
Nadav Rotem	d315157f12	1. Fix the widening of SETCC in WidenVecOp_SETCC. Use the correct return CC type. 2. Fix a typo in CONCAT_VECTORS which exposed the bug in #1. llvm-svn: 142648	2011-10-21 11:42:07 +00:00
Chandler Carruth	001153784a	Remove a now dead function, fixing -Wunused-function warnings from Clang. llvm-svn: 142631	2011-10-21 01:23:41 +00:00
Dan Gohman	90fb55237b	Delete the list-tdrr scheduler. Top-down schedulers are going away because they don't support physical register dependencies. llvm-svn: 142620	2011-10-20 21:44:34 +00:00
Chad Rosier	4236a63c3c	Revert r142579, "Fix a type in the legalization of CONCAT_VECTORS". This is causing one of the unit tests to infinitely loop, which resulted in the buildbots stalling. llvm-svn: 142604	2011-10-20 19:19:10 +00:00
Nadav Rotem	fe3969293d	Fix a type in the legalization of CONCAT_VECTORS. llvm-svn: 142579	2011-10-20 13:38:16 +00:00
Nadav Rotem	8824472a25	Improve code generation for vselect on SSE2: When checking the availability of instructions using the TLI, a 'promoted' instruction IS available. It means that the value is bitcasted to another type for which there is an operation. The correct check for the availablity of an instruction is to check if it should be expanded. llvm-svn: 142542	2011-10-19 20:43:16 +00:00
Nadav Rotem	6652e22bad	Add support for the vector-widening of vselect and vector-setcc llvm-svn: 142488	2011-10-19 09:45:11 +00:00
Nadav Rotem	75c2229f41	Fix a bug in the legalization of vector anyext-load and trunc-store. Mem Index starts with zero. llvm-svn: 142434	2011-10-18 22:32:43 +00:00
Bob Wilson	681561901d	Fix a DAG combiner assertion failure when constant folding BUILD_VECTORS. svn r139159 caused SelectionDAG::getConstant() to promote BUILD_VECTOR operands with illegal types, even before type legalization. For this testcase, that led to one BUILD_VECTOR with i16 operands and another with promoted i32 operands, which triggered the assertion. llvm-svn: 142370	2011-10-18 17:34:47 +00:00
Duncan Sands	d278d35b13	Fix a bunch of unused variable warnings when doing a release build with gcc-4.6. llvm-svn: 142350	2011-10-18 12:44:00 +00:00
Hal Finkel	bab66789d5	Fix comment to refer to correct instruction llvm-svn: 142334	2011-10-18 03:51:57 +00:00
Bill Wendling	63a4ea1859	Correct over-zealous removal of hack. Some code want to check that any call within a function has the 'returns twice' attribute, not just that the current function has one. llvm-svn: 142221	2011-10-17 18:43:40 +00:00
Bill Wendling	2a83a71c2a	Now that we have the ReturnsTwice function attribute, this method is obsolete. Check the attribute instead. <rdar://problem/8031714> llvm-svn: 142212	2011-10-17 18:22:52 +00:00
Chad Rosier	c17257c4cb	Removed set, but unused variable. Patch by Joe Abbey <jabbey@arxan.com>. llvm-svn: 142206	2011-10-17 18:01:59 +00:00
Nadav Rotem	486ff59a9f	Enable element promotion type legalization by deafault. Changed tests which assumed that vectors are legalized by widening them. llvm-svn: 142152	2011-10-16 20:31:33 +00:00
Benjamin Kramer	cc863b2bb6	Let printf do the formatting instead aligning strings ourselves. While at it, merge some format strings. llvm-svn: 142140	2011-10-16 16:30:34 +00:00
Nadav Rotem	ebe13bc3f1	Move the legalization of vector loads and stores into LegalizeVectorOps. In some cases we need the second type-legalization pass in order to support all cases. llvm-svn: 142060	2011-10-15 07:41:10 +00:00
Bill Wendling	2730a0099a	Clear out the landing pad to call site map for each function. This isn't put into the 'clear()' method because the information needs to stick around (at least for a little bit) after the selection DAG is built. llvm-svn: 142032	2011-10-15 01:00:26 +00:00
Jim Grosbach	400907cc41	Fix typo. "__sync_fetch_and-xor_4" should be "__sync_fetch_and_xor_4". Pointed out by George Russell. llvm-svn: 141956	2011-10-14 15:53:48 +00:00
Jakob Stoklund Olesen	24abd9d9b6	Encode register class constreaints in inline asm instructions. The inline asm operand constraint is initially encoded in the virtual register for the operand, but that register class may change during coalescing, and the original constraint is lost. Encode the original register class as part of the flag word for each inline asm operand. This makes it possible to recover the actual constraint required by inline asm, just like we can for normal instructions. llvm-svn: 141833	2011-10-12 23:37:29 +00:00
Eli Friedman	979009ea61	Use a utility from MathExtras to clarify a check and avoid undefined behavior. Based on patch by Ahmed Charles. llvm-svn: 141829	2011-10-12 22:46:45 +00:00
Dan Gohman	de239d2647	Fix a thinko that Nick noticed. The previous code actually worked as intended, but only by accident. llvm-svn: 141779	2011-10-12 15:56:56 +00:00
Jakob Stoklund Olesen	35163e21dc	Use an existing function. llvm-svn: 141763	2011-10-12 01:24:51 +00:00
Eric Christopher	57d1692750	Formatting. llvm-svn: 141728	2011-10-11 22:59:04 +00:00
Nadav Rotem	3283793c9a	Add support for legalization of vector SHL/SRA/SRL instructions llvm-svn: 141667	2011-10-11 14:36:35 +00:00
Nadav Rotem	198fe81571	Add support for legalization of vector trunc-store where the saved scalar type is illegal (for example, v2i16 on systems where the smallest store size is i32) llvm-svn: 141661	2011-10-11 11:25:16 +00:00
Nadav Rotem	b521b6037b	Cleanup the trunc-store legalization code and add asserts. llvm-svn: 141659	2011-10-11 10:04:25 +00:00
Bill Wendling	7ecfbd90ef	Thread the chain through the eh.sjlj.setjmp intrinsic, like it's documented to do. This will be useful later on with the new SJLJ stuff. llvm-svn: 141416	2011-10-07 21:25:38 +00:00
Eli Friedman	1456cd20b4	Remove the old atomic instrinsics. autoupgrade functionality is included with this patch. llvm-svn: 141333	2011-10-06 23:20:49 +00:00
Bill Wendling	267f323d28	Modify the mapping from landing pad to call sites to accept more than one call site. llvm-svn: 141226	2011-10-05 22:24:35 +00:00
Bill Wendling	e61c62533e	Small refactoring. Cache the FunctionInfo->MBB into a local variable. llvm-svn: 141221	2011-10-05 22:16:11 +00:00
Jakob Stoklund Olesen	f7957a9819	Simplify EXTRACT_SUBREG emission. EXTRACT_SUBREG is emitted as %dst = COPY %src:sub, so there is no need to constrain the %dst register class. RegisterCoalescer will apply the necessary constraints if it decides to eliminate the COPY. The %src register class does need to be constrained to something with the right sub-registers, though. This is currently done manually with COPY_TO_REGCLASS nodes. They can possibly be removed after this patch. llvm-svn: 141207	2011-10-05 20:26:40 +00:00
Jakob Stoklund Olesen	8ff52c4135	Simplify INSERT_SUBREG emission. The register class created by INSERT_SUBREG and SUBREG_TO_REG must be legal and support the SubIdx sub-registers. The new getSubClassWithSubReg() hook can compute that. This may create INSERT_SUBREG instructions defining a larger register class than the sub-register being inserted. That is OK, RegisterCoalescer will constrain the register class as needed when it eliminates the INSERT_SUBREG instructions. llvm-svn: 141198	2011-10-05 18:31:00 +00:00
Bill Wendling	3d11aa7e75	Create a mapping between the landing pad basic block and the call site index for later use. llvm-svn: 141125	2011-10-04 22:00:35 +00:00
Nadav Rotem	52e8ed9214	Moved type construction out of the loop and added an assert on the legality of the type. Formatted lines to the 80 char limit. llvm-svn: 140952	2011-10-01 18:39:28 +00:00
Bill Wendling	9925f197cc	When inferring the pointer alignment, if the global doesn't have an initializer and the alignment is 0 (i.e., it's defined globally in one file and declared in another file) it could get an alignment which is larger than the ABI allows for that type, resulting in aligned moves being used for unaligned loads. For instance, in file A.c: struct S s; In file B.c: struct { // something long }; extern S s; void foo() { struct S p = s; // ... } this copy is a 'memcpy' which is turned into a series of 'movaps' instructions on X86. But this is wrong, because 'struct S' has alignment of 4, not 16. llvm-svn: 140902	2011-09-30 23:19:55 +00:00
Nick Lewycky	f40df1d46c	Promote comment to doxycomment. Adjust whitespace. No functionality change. llvm-svn: 140899	2011-09-30 22:19:53 +00:00
Jakob Stoklund Olesen	1352be2bd3	Move getCommonSubClass() into TRI. It will soon need the context. llvm-svn: 140896	2011-09-30 22:18:51 +00:00
Eli Friedman	95031ed837	Clean up uses of switch instructions so they are not dependent on the operand ordering. Patch by Stepan Dyatkovskiy. llvm-svn: 140803	2011-09-29 20:21:17 +00:00
Eric Christopher	d299dccf91	Use the local we already set up. llvm-svn: 140745	2011-09-29 00:50:59 +00:00
Bill Wendling	baf3941fde	Strip off pointer casts when looking at the eh.sjlj.functioncontext's argument. llvm-svn: 140678	2011-09-28 03:52:41 +00:00
Bill Wendling	66b110f571	Create and use an llvm.eh.sjlj.functioncontext intrinsic. This intrinsic is used to pass the index of the function context to the back-end for further processing. The back-end is in charge of filling in the rest of the entries. llvm-svn: 140676	2011-09-28 03:36:43 +00:00
Jim Grosbach	af136f71ec	Rename AddSelectionDAGCSEId() to addSelectionDAGCSEId(). Naming conventions consistency. No functional change. llvm-svn: 140636	2011-09-27 20:59:33 +00:00
Nadav Rotem	38b3b83362	Cleanup PromoteIntOp_EXTRACT_VECTOR_ELT and PromoteIntRes_SETCC. Add a new method: getAnyExtOrTrunc and use it to replace the manual check. llvm-svn: 140603	2011-09-27 11:16:47 +00:00
Nadav Rotem	1b857d2762	Revert r140463; The patch assumes that <4 x i1> is saved to memory as 4 x i8, while the decision is to bit-pack small values. llvm-svn: 140601	2011-09-27 10:48:29 +00:00
Nadav Rotem	2279949129	[vector-select] Address one of the issues in pr10902. EXTRACT_VECTOR_ELEMENT SDNodes may return values which are wider than the incoming element types. In this patch we fix the integer promotion of these nodes. Fixes spill-q.ll when running -promote-elements. llvm-svn: 140471	2011-09-25 18:59:42 +00:00
Nadav Rotem	c2deabd202	Implement Duncan's suggestion to use the result of getSetCCResultType if it is legal (this is always the case for scalars), otherwise use the promoted result type. Fix test/CodeGen/X86/vsplit-and.ll when promote-elements is enabled. llvm-svn: 140464	2011-09-24 19:48:19 +00:00
Nadav Rotem	77426a754b	[Vector-Select] Address one of the problems in 10902. When generating the trunc-store of i1's, we need to use the vector type and not the scalar type. This patch fixes the assertion in CodeGen/Generic/bool-vector.ll when running with -promote-elements. llvm-svn: 140463	2011-09-24 18:32:19 +00:00
Duncan Sands	b461176cfb	Tweak the handling of MERGE_VALUES nodes: remove the need for DecomposeMERGE_VALUES to "know" that results are legalized in a particular order, by passing it the number of the result being legalized (the type legalization core provides this, it just needs to be passed on). llvm-svn: 140373	2011-09-23 13:59:22 +00:00
Nadav Rotem	57e30726ad	Vector-Select: Address one of the problems in pr10902. Add handling for the integer-promotion of CONCAT_VECTORS. Test: test/CodeGen/X86/widen_shuffle-1.ll This patch fixes the above tests (when running in with -promote-elements). llvm-svn: 140372	2011-09-23 09:33:24 +00:00
Dan Gohman	e83e1b2d2c	Fix SimplifySelectCC to add newly created nodes to the DAGCombiner worklist, as it may be possible to perform further optimization on them. llvm-svn: 140349	2011-09-22 23:01:29 +00:00
Jakob Stoklund Olesen	e92e5ee81f	Constrain register classes instead of emitting copies. Sometimes register class constraints are trivial, like GR32->GR32_NOSP, or GPR->rGPR. Teach InstrEmitter to simply constrain the virtual register instead of emitting a copy in these cases. Normally, these copies are handled by the coalescer. This saves some coalescer work. llvm-svn: 140340	2011-09-22 21:39:34 +00:00
Nadav Rotem	bc9ba30158	[VECTOR-SELECT] Address one of the bugs in pr10902. Vector SetCC result types need to be type-legalized. This code worked before because scalar result types are known to be legal. llvm-svn: 140249	2011-09-21 14:34:38 +00:00
Andrew Trick	924123acb3	Lower ARM adds/subs to add/sub after adding optional CPSR operand. This is still a hack until we can teach tblgen to generate the optional CPSR operand rather than an implicit CPSR def. But the strangeness is now limited to the selection DAG. ADD/SUB MI's no longer have implicit CPSR defs, nor do we allow flag setting variants of these opcodes in machine code. There are several corner cases to consider, and getting one wrong would previously lead to nasty miscompilation. It's not the first time I've debugged one, so this time I added enough verification to ensure it won't happen again. llvm-svn: 140228	2011-09-21 02:20:46 +00:00
Bruno Cardoso Lopes	6cb23f6e7f	Add a DAGCombine for subvector extracts to remove useless chains of subvector inserts and extracts. Initial patch by Rackover, Zvi with some tweak done by me. llvm-svn: 140204	2011-09-20 23:19:33 +00:00
Andrew Trick	52363bdbeb	Restore hasPostISelHook tblgen flag. No functionality change. The hook makes it explicit which patterns require "special" handling. i.e. it self-documents tblgen deficiencies. I plan to add verification in ExpandISelPseudos and Thumb2SizeReduce to catch any missing hasPostISelHooks. Otherwise it's too fragile. llvm-svn: 140160	2011-09-20 18:22:31 +00:00
Andrew Trick	8586e62d91	ARM isel bug fix for adds/subs operands. Modified ARMISelLowering::AdjustInstrPostInstrSelection to handle the full gamut of CPSR defs/uses including instructins whose "optional" cc_out operand is not really optional. This allowed removal of the hasPostISelHook to simplify the .td files and make the implementation more robust. Fixes rdar://10137436: sqlite3 miscompile llvm-svn: 140134	2011-09-20 03:17:40 +00:00
Andrew Trick	53df4b6dfa	whitespace llvm-svn: 140133	2011-09-20 03:06:13 +00:00
Nadav Rotem	7aaa0aa7a7	white space cleanups llvm-svn: 139994	2011-09-18 10:29:29 +00:00
Eli Friedman	ee8f14a799	Some legalization fixes for atomic load and store. llvm-svn: 139851	2011-09-15 21:20:49 +00:00
Nadav Rotem	d748dbacb0	Add integer promotion support for vselect llvm-svn: 139692	2011-09-14 14:42:15 +00:00
Eli Friedman	f78c6a83ee	Fix check for unaligned load/store so it doesn't catch over-aligned load/store. llvm-svn: 139649	2011-09-13 22:19:59 +00:00
Eli Friedman	f1518216fd	Error out on CodeGen of unaligned load/store. Fix test so it isn't accidentally testing that case. llvm-svn: 139641	2011-09-13 20:50:54 +00:00
Nadav Rotem	66dc9ae08d	Fix the assertion which checks the size of the input operand. llvm-svn: 139633	2011-09-13 20:03:38 +00:00
Nadav Rotem	52202fbf2d	Add vselect target support for targets that do not support blend but do support xor/and/or (For example SSE2). llvm-svn: 139623	2011-09-13 19:17:42 +00:00
Chris Lattner	e74e0c8020	tidy up a bit llvm-svn: 139419	2011-09-09 22:06:59 +00:00
Eli Friedman	b7910b79f5	Make the SelectionDAG verify that all the operands of BUILD_VECTOR have the same type. Teach DAGCombiner::visitINSERT_VECTOR_ELT not to make invalid BUILD_VECTORs. Fixes PR10897. llvm-svn: 139407	2011-09-09 21:04:06 +00:00
Devang Patel	9d904e1a97	Directly point debug info to the stack slot of the arugment, instead of trying to keep track of vreg in which it the arugment is copied. The LiveDebugVariable can keep track of variable's ranges. llvm-svn: 139330	2011-09-08 22:59:09 +00:00
Eli Friedman	e978d2f644	Relax the MemOperands on atomics a bit. Fixes -verify-machineinstrs failures for atomic laod/store on ARM. (The fix for the related failures on x86 is going to be nastier because we actually need Acquire memoperands attached to the atomic load instrs, etc.) llvm-svn: 139221	2011-09-07 02:23:42 +00:00
Duncan Sands	f2641e1bc1	Add codegen support for vector select (in the IR this means a select with a vector condition); such selects become VSELECT codegen nodes. This patch also removes VSETCC codegen nodes, unifying them with SETCC nodes (codegen was actually often using SETCC for vector SETCC already). This ensures that various DAG combiner optimizations kick in for vector comparisons. Passes dragonegg bootstrap with no testsuite regressions (nightly testsuite as well as "make check-all"). Patch mostly by Nadav Rotem. llvm-svn: 139159	2011-09-06 19:07:46 +00:00
Duncan Sands	a098436b32	Split the init.trampoline intrinsic, which currently combines GCC's init.trampoline and adjust.trampoline intrinsics, into two intrinsics like in GCC. While having one combined intrinsic is tempting, it is not natural because typically the trampoline initialization needs to be done in one function, and the result of adjust trampoline is needed in a different (nested) function. To get around this llvm-gcc hacks the nested function lowering code to insert an additional parent variable holding the adjust.trampoline result that can be accessed from the child function. Dragonegg doesn't have the luxury of tweaking GCC code, so it stored the result of adjust.trampoline in the memory GCC set aside for the trampoline itself (this is always available in the child function), and set up some new memory (using an alloca) to hold the trampoline. Unfortunately this breaks Go which allocates trampoline memory on the heap and wants to use it even after the parent has exited (!). Rather than doing even more hacks to get Go working, it seemed best to just use two intrinsics like in GCC. Patch mostly by Sanjoy Das. llvm-svn: 139140	2011-09-06 13:37:06 +00:00
Owen Anderson	40d756eacc	Fix a truly heinous bug in DAGCombine related to AssertZext. If we have a chain of zext -> assert_zext -> zext -> use, the first zext would get simplified away because of the later zext, and then the later zext would get simplified away because of the assert. The solution is to teach SimplifyDemandedBits that assert_zext demands all of the high bits of its input, rather than only those demanded by its users. No testcase because the only example I have manifests as llvm-gcc miscompiling LLVM, and I haven't found a smaller case that reproduces this problem. Fixes <rdar://problem/10063365>. llvm-svn: 139059	2011-09-03 00:26:49 +00:00
Dan Gohman	3767be9aee	Revert r131152, r129796, r129761. This code is currently considered to be unreliable on platforms which require memcpy calls, and it is complicating broader legalize cleanups. It is hoped that these cleanups will make memcpy byval easier to implement in the future. llvm-svn: 138977	2011-09-01 23:07:08 +00:00
Andrew Trick	832a6a1909	PreRA scheduler should avoid cloning compares. Added canClobberReachingPhysRegUse() to handle a particular pattern in which a two-address instruction could be forced to interfere with EFLAGS, causing a compare to be unnecessarilly cloned. Fixes rdar://problem/5875261 llvm-svn: 138924	2011-09-01 00:54:31 +00:00
Eli Friedman	ae1acddb95	Misc cleanup; addresses Duncan's comments on r138877. llvm-svn: 138887	2011-08-31 20:13:26 +00:00
Eli Friedman	e839ecb70b	Fill in type legalization for MERGE_VALUES in all the various cases. Patch by Micah Villmow. (No testcase because the issue only showed up in an out-of-tree backend.) llvm-svn: 138877	2011-08-31 18:36:04 +00:00
Eli Friedman	7c3bdede25	Generic expansion for atomic load/store into cmpxchg/atomicrmw xchg; implements 64-bit atomic load/store for ARM. llvm-svn: 138872	2011-08-31 18:26:09 +00:00
Evan Cheng	e6fba77971	Follow up to r138791. Add a instruction flag: hasPostISelHook which tells the pre-RA scheduler to call a target hook to adjust the instruction. For ARM, this is used to adjust instructions which may be setting the 's' flag. ADC, SBC, RSB, and RSC instructions have implicit def of CPSR (required since it now uses CPSR physical register dependency rather than "glue"). If the carry flag is used, then the target hook will fill in the optional operand with CPSR. Otherwise, the hook will remove the CPSR implicit def from the MachineInstr. llvm-svn: 138810	2011-08-30 19:09:48 +00:00
Eli Friedman	452aae6202	Atomic load/store on ARM/Thumb. I don't really like the patterns, but I'm having trouble coming up with a better way to handle them. I plan on making other targets use the same legalization ARM-without-memory-barriers is using... it's not especially efficient, but if anyone cares, it's not that hard to fix for a given target if there's some better lowering. llvm-svn: 138621	2011-08-26 02:59:24 +00:00
Eli Friedman	342e8df0e0	Basic x86 code generation for atomic load and store instructions. llvm-svn: 138478	2011-08-24 20:50:09 +00:00
Bill Wendling	4eb0433672	A landingpad instruction is neither folded nor dead. llvm-svn: 138387	2011-08-23 21:33:05 +00:00
Evan Cheng	6b477b985b	Fix 80 col violations. llvm-svn: 138356	2011-08-23 19:17:21 +00:00
Nick Lewycky	97f73cb449	Be less redundant. llvm-svn: 138252	2011-08-22 18:26:12 +00:00
Benjamin Kramer	68ed46ce9a	Roll back the rest of r126557. It's a hack that will break in some obscure cases. llvm-svn: 138130	2011-08-19 22:39:31 +00:00
Nick Lewycky	c1348074ec	Eli points out that this is what report_fatal_error() is for. llvm-svn: 138091	2011-08-19 21:45:19 +00:00
Nick Lewycky	3f73184d90	This is not actually unreachable, so don't use llvm_unreachable for it. Since the intent seems to be to terminate even in Release builds, just use abort() directly. If program flow ever reaches a __builtin_unreachable (which llvm_unreachable is #define'd to on newer GCCs) then the program is undefined. llvm-svn: 138068	2011-08-19 20:14:27 +00:00
Ivan Krasin	d7cbd4c518	FastISel: avoid function calls between the materialization of the constant and its use. llvm-svn: 137993	2011-08-18 22:06:10 +00:00
Bill Wendling	247fd3bf59	Add the support in code-gen for the landingpad instruction lowering. The landingpad instruction is lowered into the EXCEPTIONADDR and EHSELECTION SDNodes. The information from the landingpad instruction is harvested by the 'AddLandingPadInfo' function. The new EH uses the current EH scheme in the back-end. This will change once we switch over to the new scheme. (Reviewed by Jakob!) llvm-svn: 137880	2011-08-17 21:56:44 +00:00
Bill Wendling	a408e5bf31	Revert patch. Forgot a dependent commit. llvm-svn: 137875	2011-08-17 21:28:05 +00:00
Bill Wendling	2a521948f0	Add the body of 'visitLandingPad'. This generates the SDNodes for the new exception handling scheme. It takes the two values coming from the landingpad instruction and assigns them to the EXCEPTIONADDR and EHSELECTION nodes. llvm-svn: 137873	2011-08-17 21:25:14 +00:00
Nadav Rotem	b66b866f46	Revert r137562 because it caused PR10674 llvm-svn: 137719	2011-08-16 14:34:29 +00:00
Nadav Rotem	6858b344ed	Fix PR 10635. When generating integer constants, the constant element type may be illegal, even if the requested vector type is legal. Testcase is one of the disabled ARM tests in the vector-select patch. llvm-svn: 137562	2011-08-13 20:31:45 +00:00
Bill Wendling	fae1475823	Initial commit of the 'landingpad' instruction. This implements the 'landingpad' instruction. It's used to indicate that a basic block is a landing pad. There are several restrictions on its use (see LangRef.html for more detail). These restrictions allow the exception handling code to gather the information it needs in a much more sane way. This patch has the definition, implementation, C interface, parsing, and bitcode support in it. llvm-svn: 137501	2011-08-12 20:24:12 +00:00
Nadav Rotem	62da15a330	Revert r137310 because it does not optimize any code on ToT llvm-svn: 137466	2011-08-12 17:15:04 +00:00
Duncan Sands	a41634e307	Silence a bunch (but not all) "variable written but not read" warnings when building with assertions disabled. llvm-svn: 137460	2011-08-12 14:54:45 +00:00
Nadav Rotem	61140e1028	[AVX] When joining two XMM registers into a YMM register, make sure that the lower XMM register gets in first. This will allow the SUBREG pattern to elliminate the first vector insertion. llvm-svn: 137310	2011-08-11 16:49:36 +00:00
Chris Lattner	96710b4308	fix PR10605 / rdar://9930964 by adding a pretty scary missed check. It's somewhat surprising anything works without this. Before we would compile the testcase into: test: # @test movl $4, 8(%rdi) movl 8(%rdi), %eax orl %esi, %eax cmpl $32, %edx movl %eax, -4(%rsp) # 4-byte Spill je .LBB0_2 now we produce: test: # @test movl 8(%rdi), %eax movl $4, 8(%rdi) orl %esi, %eax cmpl $32, %edx movl %eax, -4(%rsp) # 4-byte Spill je .LBB0_2 llvm-svn: 137303	2011-08-11 06:26:54 +00:00
Devang Patel	aab841cf63	Do not drop undef debug values. These are used as range termination marker by live debug variable pass. llvm-svn: 136834	2011-08-03 23:13:55 +00:00
Eli Friedman	30a49e93e3	New approach to r136737: insert the necessary fences for atomic ops in platform-independent code, since a bunch of platforms (ARM, Mips, PPC, Alpha are the relevant targets here) need to do essentially the same thing. I think this completes the basic CodeGen for atomicrmw and cmpxchg. llvm-svn: 136813	2011-08-03 21:06:02 +00:00
Eli Friedman	04c5025cd5	Don't create a ridiculous EXTRACT_ELEMENT. PR10563. The testcase looks extremely fragile, so I'm adding an assertion which should catch any cases like this. llvm-svn: 136711	2011-08-02 18:38:35 +00:00
Bill Wendling	f891bf8b30	Add the 'resume' instruction for the new EH rewrite. This adds the 'resume' instruction class, IR parsing, and bitcode reading and writing. The 'resume' instruction resumes propagation of an existing (in-flight) exception whose unwinding was interrupted with a 'landingpad' instruction (to be added later). llvm-svn: 136589	2011-07-31 06:30:59 +00:00
Bill Wendling	ad088e6724	Revert r136253, r136263, r136269, r136313, r136325, r136326, r136329, r136338, r136339, r136341, r136369, r136387, r136392, r136396, r136429, r136430, r136444, r136445, r136446, r136253 pending review. llvm-svn: 136556	2011-07-30 05:42:50 +00:00
Jakub Staszak	0480a8fbbb	Do not lose branch weights when lowering SwitchInst. llvm-svn: 136529	2011-07-29 22:25:21 +00:00
Jakub Staszak	539db98987	Remove unneeded const_cast. llvm-svn: 136506	2011-07-29 20:05:36 +00:00
Eli Friedman	adec587d5c	Misc optimizer+codegen work for 'cmpxchg' and 'atomicrmw'. They appear to be working on x86 (at least for trivial testcases); other architectures will need more work so that they actually emit the appropriate instructions for orderings stricter than 'monotonic'. (As far as I can tell, the ARM, PPC, Mips, and Alpha backends need such changes.) llvm-svn: 136457	2011-07-29 03:05:32 +00:00
Bill Wendling	7eadbeaf62	Use the pointer type size. With this, we can now compile a simple EH program. llvm-svn: 136446	2011-07-29 01:15:29 +00:00
Bill Wendling	6a8cac735a	And now something that compiles... llvm-svn: 136445	2011-07-29 01:11:33 +00:00
Bill Wendling	4b0a365beb	Make sure to sext or trunc the result from the register. llvm-svn: 136444	2011-07-29 01:11:14 +00:00
Chandler Carruth	9d7feab3e0	Rewrite the CMake build to use explicit dependencies between libraries, specified in the same file that the library itself is created. This is more idiomatic for CMake builds, and also allows us to correctly specify dependencies that are missed due to bugs in the GenLibDeps perl script, or change from compiler to compiler. On Linux, this returns CMake to a place where it can relably rebuild several targets of LLVM. I have tried not to change the dependencies from the ones in the current auto-generated file. The only places I've really diverged are in places where I was seeing link failures, and added a dependency. The goal of this patch is not to start changing the dependencies, merely to move them into the correct location, and an explicit form that we can control and change when necessary. This also removes a serialization point in the build because we don't have to scan all the libraries before we begin building various tools. We no longer have a step of the build that regenerates a file inside the source tree. A few other associated cleanups fall out of this. This isn't really finished yet though. After talking to dgregor he urged switching to a single CMake macro to construct libraries with both sources and dependencies in the arguments. Migrating from the two macros to that style will be a follow-up patch. Also, llvm-config is still generated with GenLibDeps.pl, which means it still has slightly buggy dependencies. The internal CMake 'llvm-config-like' macro uses the correct explicitly specified dependencies however. A future patch will switch llvm-config generation (when using CMake) to be based on these deps as well. This may well break Windows. I'm getting a machine set up now to dig into any failures there. If anyone can chime in with problems they see or ideas of how to solve them for Windows, much appreciated. llvm-svn: 136433	2011-07-29 00:14:25 +00:00
Bill Wendling	3cc87682e1	Visit the landingpad instruction. This generates the correct SDNodes for the landingpad instruction. It makes an assumption that the result of the landingpad instruction has at least two values. And that the first value is a pointer to the exception object and the second value is the "selector." llvm-svn: 136430	2011-07-28 23:44:58 +00:00
Bill Wendling	7fa7fe6b58	Add the AddLandingPadInfo function. AddLandingPadInfo takes a landingpad instruction and grabs all of the information from it that it needs for EH table generation. llvm-svn: 136429	2011-07-28 23:42:57 +00:00
Eli Friedman	c9a551ebed	LangRef and basic memory-representation/reading/writing for 'cmpxchg' and 'atomicrmw' instructions, which allow representing all the current atomic rmw intrinsics. The allowed operands for these instructions are heavily restricted at the moment; we can probably loosen it a bit, but supporting general first-class types (where it makes sense) might get a bit complicated, given how SelectionDAG works. As an initial cut, these operations do not support specifying an alignment, but it would be possible to add if we think it's useful. Specifying an alignment lower than the natural alignment would be essentially impossible to support on anything other than x86, but specifying a greater alignment would be possible. I can't think of any useful optimizations which would use that information, but maybe someone else has ideas. Optimizer/codegen support coming soon. llvm-svn: 136404	2011-07-28 21:48:00 +00:00
Bill Wendling	4f027233d2	The personality function should be a Function* and not just a Value*. llvm-svn: 136392	2011-07-28 21:14:13 +00:00
Nadav Rotem	9708aef2dc	CR fix: The ANY_EXTEND can be removed because the input and putput type must be identical. llvm-svn: 136355	2011-07-28 14:38:46 +00:00
Eli Friedman	26a484852e	Code generation for 'fence' instruction. llvm-svn: 136283	2011-07-27 22:21:52 +00:00
Bill Wendling	6c923bb8d9	Merge the contents from exception-handling-rewrite to the mainline. This adds the new instructions 'landingpad' and 'resume'. llvm-svn: 136253	2011-07-27 20:18:04 +00:00
Jeffrey Yasskin	6381c0100b	Explicitly cast narrowing conversions inside {}s that will become errors in C++0x. llvm-svn: 136211	2011-07-27 06:22:51 +00:00
Dan Gohman	456b1edd0d	Revert r136156, which broke several buildbots. llvm-svn: 136206	2011-07-27 01:10:27 +00:00
Dan Gohman	9eb62cd159	Delete unnecessarily cautious LastCALLSEQ code. llvm-svn: 136156	2011-07-26 22:00:59 +00:00
Eli Friedman	06b8b571b2	Add obvious missing case to switch. PR10497. llvm-svn: 136130	2011-07-26 20:38:49 +00:00
Eli Friedman	fee02c6c13	Initial implementation of 'fence' instruction, the new C++0x-style replacement for llvm.memory.barrier. This is just a LangRef entry and reading/writing/memory representation; optimizer+codegen support coming soon. llvm-svn: 136009	2011-07-25 23:16:38 +00:00
Eli Friedman	cbd3ba91b7	Make sure this DAGCombine actually returns an UNDEF of the correct type; PR10476. llvm-svn: 135993	2011-07-25 22:25:42 +00:00
Eli Friedman	6ed783228d	PR10421: Fix a straightforward bug in the widening logic for CONCAT_VECTORS. llvm-svn: 135595	2011-07-20 18:14:33 +00:00
Devang Patel	9ab3cac694	Revert r135423. llvm-svn: 135454	2011-07-19 00:28:24 +00:00
Jeffrey Yasskin	7a16288157	Add APInt(numBits, ArrayRef<uint64_t> bigVal) constructor to prevent future ambiguity errors like the one corrected by r135261. Migrate all LLVM callers of the old constructor to the new one. llvm-svn: 135431	2011-07-18 21:45:40 +00:00
Devang Patel	4dc76f2438	During bottom up fast-isel, instructions emitted to materalize registers are at top of basic block and do not have debug location. This may misguide debugger while entering the basic block and sometimes debugger provides semi useful view of current location to developer by picking up previous known location as current location. Assign a sensible location to the first instruction in a basic block, if it does not have one location derived from source file, so that debugger can provide meaningful user experience to developers in edge cases. [take 2] llvm-svn: 135423	2011-07-18 20:55:23 +00:00
Chris Lattner	229907cd11	land David Blaikie's patch to de-constify Type, with a few tweaks. llvm-svn: 135375	2011-07-18 04:54:35 +00:00
Nadav Rotem	76d51c6c89	Minor code cleanups llvm-svn: 135362	2011-07-17 19:05:00 +00:00
Dan Gohman	945864d6dc	LegalizeDAG doesn't need its own copy of this enum. llvm-svn: 135320	2011-07-15 22:51:43 +00:00
Dan Gohman	e49e74261a	Delete LegalizeDAG's own version of isTypeLegal and getTypeAction and just use the ones from TargetLowering directly. llvm-svn: 135318	2011-07-15 22:39:09 +00:00
Dan Gohman	8c5ca645ce	Delete an unused variable and a redundant assert. llvm-svn: 135311	2011-07-15 22:19:02 +00:00
Dan Gohman	ad94608b1f	Modernize comments. llvm-svn: 135305	2011-07-15 21:42:20 +00:00
Eric Christopher	92464be28c	Check register class matching instead of width of type matching when determining validity of matching constraint. Allow i1 types access to the GR8 reg class for x86. Fixes PR10352 and rdar://9777108 llvm-svn: 135180	2011-07-14 20:13:52 +00:00
Nadav Rotem	771f29677f	[VECTOR-SELECT] During type legalization we often use the SIGN_EXTEND_INREG SDNode. When this SDNode is legalized during the LegalizeVector phase, it is scalarized because non-simple types are automatically marked to be expanded. In this patch we add support for lowering SIGN_EXTEND_INREG manually. This fixes CodeGen/X86/vec_sext.ll when running with the '-promote-elements' flag. llvm-svn: 135144	2011-07-14 11:11:14 +00:00
Nadav Rotem	db213c0400	Add assertion for the chain value type llvm-svn: 135143	2011-07-14 10:37:54 +00:00
Benjamin Kramer	15cd5a3f12	Don't emit a bit test if there is only one case the test can yield false. A simple SETNE is sufficient. llvm-svn: 135126	2011-07-14 01:38:42 +00:00
Eric Christopher	d6300d2956	Add a dag combine pattern for folding C2-(A+C1) -> (C2-C1)-A Fixes rdar://9761830 llvm-svn: 135123	2011-07-14 01:12:15 +00:00
Jay Foad	57aa636794	Convert InsertValueInst and ExtractValueInst APIs to use ArrayRef. llvm-svn: 135040	2011-07-13 10:26:04 +00:00
Cameron Zwarich	f03fa189ca	Add an intrinsic and codegen support for fused multiply-accumulate. The intent is to use this for architectures that have a native FMA instruction. llvm-svn: 134742	2011-07-08 21:39:21 +00:00
Benjamin Kramer	2bb8b26aa8	Apparently we can't expect a BinaryOperator here. Should fix llvm-gcc selfhost. llvm-svn: 134699	2011-07-08 12:08:24 +00:00
Benjamin Kramer	9960a25006	Emit a more efficient magic number multiplication for exact sdivs. We have to do this in DAGBuilder instead of DAGCombiner, because the exact bit is lost after building. struct foo { char x[24]; }; long bar(struct foo a, struct foo b) { return a-b; } is now compiled into movl 4(%esp), %eax subl 8(%esp), %eax sarl $3, %eax imull $-1431655765, %eax, %eax instead of movl 4(%esp), %eax subl 8(%esp), %eax movl $715827883, %ecx imull %ecx movl %edx, %eax shrl $31, %eax sarl $2, %edx addl %eax, %edx movl %edx, %eax llvm-svn: 134695	2011-07-08 10:31:30 +00:00
Eric Christopher	6a6d8fc7fd	Remove a FIXME. All of the standard ones are in the list. llvm-svn: 134647	2011-07-07 22:29:03 +00:00
Lang Hames	5a00499e87	Add functions 'hasPredecessor' and 'hasPredecessorHelper' to SDNode. The hasPredecessorHelper function allows predecessors to be cached to speed up repeated invocations. This fixes PR10186. X.isPredecessorOf(Y) now just calls Y.hasPredecessor(X) Y.hasPredecessor(X) calls Y.hasPredecessorHelper(X, Visited, Worklist) with empty Visited and Worklist sets (i.e. no caching over invocations). Y.hasPredecessorHelper(X, Visited, Worklist) caches search state in Visited and Worklist to speed up repeated calls. The Visited set is searched for X before going to the worklist to further search the DAG if necessary. llvm-svn: 134592	2011-07-07 04:31:51 +00:00
Eric Christopher	ea336c797c	Grammar and 80-col. llvm-svn: 134555	2011-07-06 22:41:18 +00:00
Jakub Staszak	3f158fdf6e	Introduce "expect" intrinsic instructions. llvm-svn: 134516	2011-07-06 18:22:43 +00:00
Evan Cheng	0d639a28aa	Rename TargetSubtarget to TargetSubtargetInfo for consistency. llvm-svn: 134259	2011-07-01 21:01:15 +00:00
Eric Christopher	f81292ba3b	Remove getRegClassForInlineAsmConstraint and all dependencies. Fixes rdar://9643582 llvm-svn: 134123	2011-06-30 01:20:03 +00:00
Devang Patel	0eada03216	Revert r133953 for now. llvm-svn: 134116	2011-06-29 23:50:13 +00:00
Benjamin Kramer	8665f8d916	Revert a part of r126557 which could create unschedulable DAGs. llvm-svn: 134067	2011-06-29 13:47:25 +00:00
Evan Cheng	8264e272a9	Sink SubtargetFeature and TargetInstrItineraries (renamed MCInstrItineraries) into MC. llvm-svn: 134049	2011-06-29 01:14:12 +00:00
Evan Cheng	6cc775f905	- Rename TargetInstrDesc, TargetOperandInfo to MCInstrDesc and MCOperandInfo and sink them into MC layer. - Added MCInstrInfo, which captures the tablegen generated static data. Chang TargetInstrInfo so it's based off MCInstrInfo. llvm-svn: 134021	2011-06-28 19:10:37 +00:00
Devang Patel	4dc034df1d	During bottom up fast-isel, instructions emitted to materalize registers are at top of basic block and do not have debug location. This may misguide debugger while entering the basic block and sometimes debugger provides semi useful view of current location to developer by picking up previous known location as current location. Assign a sensible location to the first instruction in a basic block, if it does not have one location derived from source file, so that debugger can provide meaningful user experience to developers in edge cases. llvm-svn: 133953	2011-06-27 22:32:04 +00:00
Evan Cheng	8d71a75777	More refactoring. Move getRegClass from TargetOperandInfo to TargetInstrInfo. llvm-svn: 133944	2011-06-27 21:26:13 +00:00
Owen Anderson	b0a5a1ee29	The index stored in the RegDefIter is one after the current index. When getting the index, decrement it so that it points to the current element. Fixes an off-by-one bug encountered when trying to make use of MVT::untyped. llvm-svn: 133923	2011-06-27 18:34:12 +00:00
Andrew Trick	31f25bc66f	pre-RA-sched: Cleanup register pressure tracking. Removed the check that peeks past EXTRA_SUBREG, which I don't think makes sense any more. Intead treat it as a normal register def. No significant affect on x86 or ARM benchmarks. llvm-svn: 133917	2011-06-27 18:01:20 +00:00
Jakob Stoklund Olesen	537a302d1a	Distinguish early clobber output operands from clobbered registers. Both become <earlyclobber> defs on the INLINEASM MachineInstr, but we now use two different asm operand kinds. The new Kind_Clobber is treated identically to the old Kind_RegDefEarlyClobber for now, but x87 floating point stack inline assembly does care about the difference. This will pop a register off the stack: asm("fstp %st" : : "t"(x) : "st"); While this will pop the input and push an output: asm("fst %st" : "=&t"(r) : "t"(x)); We need to know if ST0 was a clobber or an output operand, and we can't depend on <dead> flags for that. llvm-svn: 133902	2011-06-27 04:08:33 +00:00
Owen Anderson	99adfec0b1	The scheduler needs to be aware on the existence of untyped nodes when it performs type propagation for EXTRACT_SUBREG. llvm-svn: 133838	2011-06-24 23:02:22 +00:00
Devang Patel	f071d72c44	Handle debug info for i128 constants. llvm-svn: 133821	2011-06-24 20:46:11 +00:00
Jay Foad	83be361b8a	Replace the existing forms of ConstantArray::get() with a single form that takes an ArrayRef. llvm-svn: 133615	2011-06-22 09:24:39 +00:00
Owen Anderson	d1955e78b4	Fix some trailing issues from my introduction of MVT::untyped and its use for REGISTER_SEQUENCE. llvm-svn: 133567	2011-06-21 22:54:23 +00:00
Evan Cheng	4c0bd9629d	Teach dag combine to match halfword byteswap patterns. 1. (((x) & 0xFF00) >> 8) \| (((x) & 0x00FF) << 8) => (bswap x) >> 16 2. ((x&0xff)<<8)\|((x&0xff00)>>8)\|((x&0xff000000)>>8)\|((x&0x00ff0000)<<8)) => (rotl (bswap x) 16) This allows us to eliminate most of the def : Pat patterns for ARM rev16 revsh instructions. It catches many more cases for ARM and x86. rdar://9609108 llvm-svn: 133503	2011-06-21 06:01:08 +00:00
Nadav Rotem	d34ce4344b	Fix PromoteIntRes_TRUNCATE: Add support for cases where the source vector type is to be split while the target vector is to be promoted. (eg: <4 x i64> -> <4 x i8> ) llvm-svn: 133424	2011-06-20 07:15:58 +00:00
Nadav Rotem	94d67a02e0	Code cleanups: Remove duplicated logic in PromotInteRes_BITCAST, reserve vector space, reuse types. llvm-svn: 133389	2011-06-19 10:49:57 +00:00
Nadav Rotem	35d600d9f4	Calls to AssertZext and getZeroExtendInReg must be made using scalar types. llvm-svn: 133388	2011-06-19 10:22:39 +00:00
Nadav Rotem	36896bfd0c	When promoting the vector elements in CopyToParts, use vector trunc instead of scalarizing, and doing an element-by-element truncat. llvm-svn: 133382	2011-06-19 08:49:38 +00:00
Benjamin Kramer	e1fc29b6ac	Don't allocate empty read-only SmallVectors during SelectionDAG deallocation. llvm-svn: 133348	2011-06-18 13:13:44 +00:00
Benjamin Kramer	25e17b0f89	Remove unused but set variables. llvm-svn: 133347	2011-06-18 11:09:41 +00:00
Eric Christopher	e4a1266a9a	Fix UMULO support for 2x register width to allow the full range without a libcall to a new mulo<mode> libcall that we'd have to create. Finishes the rest of rdar://9090077 and rdar://9210061 llvm-svn: 133318	2011-06-18 00:09:57 +00:00
Eric Christopher	232431c389	Fix comment. llvm-svn: 133307	2011-06-17 22:35:59 +00:00
Eric Christopher	5bbb2bdb46	Lower multiply with overflow checking to __mulo<mode> calls if we haven't been able to lower them any other way. Fixes rdar://9090077 and rdar://9210061 llvm-svn: 133288	2011-06-17 20:41:29 +00:00
Jakob Stoklund Olesen	c826df9506	Don't use register classes larger than TLI->getRegClassFor(VT). In Thumb mode we cannot handle GPR virtual registers, even though some instructions can. When isel is lowering a CopyFromReg, it should limit itself to subclasses of getRegClassFor(VT). <rdar://problem/9624323> llvm-svn: 133210	2011-06-16 22:50:38 +00:00
Jakub Staszak	12a43bdde5	Introduce MachineBranchProbabilityInfo class, which has similar API to BranchProbabilityInfo (expect setEdgeWeight which is not available here). Branch Weights are kept in MachineBasicBlocks. To turn off this analysis set -use-mbpi=false. llvm-svn: 133184	2011-06-16 20:22:37 +00:00
Owen Anderson	5fc8b77f83	Change the REG_SEQUENCE SDNode to take an explict register class ID as its first operand. This operand is lowered away by the time we reach MachineInstrs, so the actual register-allocation handling of them doesn't need to change. This is intended to support using REG_SEQUENCE SDNode's with type MVT::untyped, and is part of the long road to eliminating some of the hacks we currently use to support register pairs and other strange constraints, particularly on ARM NEON. llvm-svn: 133178	2011-06-16 18:17:13 +00:00
Jakob Stoklund Olesen	1f641d577e	Add TargetRegisterInfo::getRawAllocationOrder(). This virtual function will replace allocation_order_begin/end as the one to override when implementing custom allocation orders. It is simpler to have one function return an ArrayRef than having two virtual functions computing different ends of the same array. Use getRawAllocationOrder() in place of allocation_order_begin() where it makes sense, but leave some clients that look like they really want the filtered allocation orders from RegisterClassInfo. llvm-svn: 133170	2011-06-16 17:42:25 +00:00
Nick Lewycky	6d677cfdd8	Add a DAGCombine for (ext (binop (load x), cst)). llvm-svn: 133124	2011-06-16 01:15:49 +00:00
Owen Anderson	96adc4a540	Add a new MVT::untyped. This will be used in future work for modelling ISA features like register pairs and lists with "interesting" constraints (such as ARM NEON contiguous register lists or even-odd paired registers). We need to be able to generate these instructions (often from intrinsics), but don't want to have to assign a legal type to them. Instead, we'll use an "untyped" edge to bypass the type-checking and simply ensure that the register classes match. llvm-svn: 133106	2011-06-15 23:35:18 +00:00
Andrew Trick	3013b6ae4a	Added -stress-sched flag in the Asserts build. Added a test case for handling physreg aliases during pre-RA-sched. llvm-svn: 133063	2011-06-15 17:16:12 +00:00
Nadav Rotem	13cb7736a7	getZeroExtendInReg needs to get a scalar type llvm-svn: 133057	2011-06-15 14:37:18 +00:00
Nadav Rotem	d2d9bdb2b0	Enable the simplification of truncating-store after fixing the usage of GetDemandBits (which must operate on the vector element type). Fix the a usage of getZeroExtendInReg which must also be done on scalar types. llvm-svn: 133052	2011-06-15 11:19:12 +00:00
Chad Rosier	818e116723	When pattern matching during instruction selection make sure shl x,1 is not converted to add x,x if x is a undef. add undef, undef does not guarantee that the resulting low order bit is zero. Fixes <rdar://problem/9453156> and <rdar://problem/9487392>. llvm-svn: 133022	2011-06-14 22:29:10 +00:00
Nadav Rotem	10193c830b	Add a testcase for checking the integer-promotion of many different vector types (with power of two types such as 8,16,32 .. 512). Fix a bug in the integer promotion of bitcast nodes. Enable integer expanding only if the target of the conversion is an integer (when the type action is scalarize). Add handling to the legalization of vector load/store in cases where the saved vector is integer-promoted. llvm-svn: 132985	2011-06-14 08:11:52 +00:00
Nadav Rotem	571ae19af7	Disable trunc-store simplification on vectors. llvm-svn: 132984	2011-06-14 07:18:26 +00:00
Bruno Cardoso Lopes	dc9ff3a4b1	Add one more argument to the prefetch intrinsic to indicate whether it's a data or instruction cache access. Update the targets to match it and also teach autoupgrade. llvm-svn: 132976	2011-06-14 04:58:37 +00:00
Nadav Rotem	573ee374a2	Fix a bug in FindMemType. When widening vector loads, use a wider memory type only if the number of packed elements is a power of two. Bug found in Duncan's testcase. llvm-svn: 132923	2011-06-13 18:13:24 +00:00
Nadav Rotem	504cf0cde2	Fix a bug in the calculation of the vectorTypeBreakdown into registers. Odd types such as i33 were rounded to i32. Originated from Duncan's testcase. llvm-svn: 132893	2011-06-12 14:56:55 +00:00
Nadav Rotem	083837e729	Improve the generated code by getCopyFromPartsVector for promoted integer types. Instead of scalarizing, and doing an element-by-element truncat, use vector truncate. Add support for scalarization of vectors: i8 -> <1 x i1> (from Duncan's testcase). llvm-svn: 132892	2011-06-12 14:49:38 +00:00
Chad Rosier	79044dbebf	Revert r132871. llvm-svn: 132872	2011-06-11 02:27:46 +00:00
Chad Rosier	5793b53027	Typo. llvm-svn: 132871	2011-06-11 02:16:36 +00:00
Eric Christopher	eb964516c3	80-col cleanups. llvm-svn: 132863	2011-06-10 23:05:08 +00:00
Eli Friedman	1877ac9937	Change this DAGCombine to build AND of SHR instead of SHR of AND; this matches the ordering we prefer in instcombine. Part of rdar://9562809. The potential DAGCombine which enforces this more generally messes up some other very fragile patterns, so I'm leaving that alone, at least for now. llvm-svn: 132809	2011-06-09 22:14:44 +00:00
Eric Christopher	0713a9d8fc	Add a parameter to CCState so that it can access the MachineFunction. No functional change. Part of PR6965 llvm-svn: 132763	2011-06-08 23:55:35 +00:00
Andrew Trick	6ed0c63559	Remove a temporary test case probe in CheckForLiveRegDef. llvm-svn: 132751	2011-06-08 15:19:49 +00:00
Andrew Trick	0af2e47310	Fix a merge bug in preRAsched for handling physreg aliases. I've been sitting on this long enough trying to find a test case. I think the fix should go in now, but I'll keep working on the test case. llvm-svn: 132701	2011-06-07 00:38:12 +00:00
Nadav Rotem	c807fa5687	Add methods to support the integer-promotion of vector types. Methods to legalize SDNodes such as BUILD_VECTOR, EXTRACT_VECTOR_ELT, etc. llvm-svn: 132689	2011-06-06 20:55:56 +00:00
Stuart Hastings	bee6fcc5aa	Avoid FGETSIGN of 80-bit types. Fixes PR10085. llvm-svn: 132681	2011-06-06 16:44:31 +00:00
Eli Friedman	bd375f1a3f	PR10077: fix fast-isel of extractvalue of aggregate constants. llvm-svn: 132676	2011-06-06 05:46:34 +00:00
Nadav Rotem	06bd6d304e	TypeLegalizer: Add support for passing of vector-promoted types in registers (copyFromParts/copyToParts). llvm-svn: 132649	2011-06-04 20:58:08 +00:00
Nadav Rotem	78d19bebe6	TypeLegalizer: Fix a bug in the promotion of elements of integer vectors. (only happens when using the -promote-elements option). The correct legalization order is to first try to promote element. Next, we try to widen vectors. llvm-svn: 132648	2011-06-04 20:32:01 +00:00
Eric Christopher	fbff0e4f26	Add a TODO about memory operands. llvm-svn: 132559	2011-06-03 17:21:23 +00:00
Eric Christopher	de9399bf76	Have LowerOperandForConstraint handle multiple character constraints. Part of rdar://9119939 llvm-svn: 132510	2011-06-02 23:16:42 +00:00
Rafael Espindola	aa318ae495	Revert 132424 to fix PR10068. llvm-svn: 132479	2011-06-02 19:57:47 +00:00
Jakob Stoklund Olesen	aff1060207	Use TRI::has{Sub,Super}ClassEq() where possible. No functional change. llvm-svn: 132455	2011-06-02 05:43:46 +00:00
Stuart Hastings	7adc95f69e	Recommit 132404 with fixes. rdar://problem/5993888 llvm-svn: 132424	2011-06-01 21:33:14 +00:00
Eric Christopher	690030c116	Allow bitcasts between valid types of the same size and vector types if the vector type is legal. Fixes rdar://9306086 llvm-svn: 132420	2011-06-01 19:55:10 +00:00
Nadav Rotem	22ad9bb7d9	Refactor LegalizeTypes: Erase LegalizeAction and make the type legalizer use the TargetLowering enum. llvm-svn: 132418	2011-06-01 19:47:10 +00:00
Stuart Hastings	3ae49c03a4	Fix double FGETSIGN to work on x86_32; followup to 132396. rdar://problem/5660695 llvm-svn: 132411	2011-06-01 18:32:25 +00:00
Stuart Hastings	fd5ecd0cec	Turn on FGETSIGN for x86. Followup to 132388. rdar://problem/5660695 llvm-svn: 132396	2011-06-01 14:04:17 +00:00
Nadav Rotem	8b24a731f2	This patch is another step in the direction of adding vector select. In this patch we add a flag to enable a new type legalization decision - to promote integer elements in vectors. Currently, the rest of the codegen does not support this kind of legalization. This flag will be removed when the transition is complete. llvm-svn: 132394	2011-06-01 12:51:46 +00:00
Nadav Rotem	d86c1c41fb	Refactor the type legalizer. Switch TargetLowering to a new enum - LegalizeTypeAction. This patch does not change the behavior of the type legalizer. The codegen produces the same code. This infrastructural change is needed in order to enable complex decisions for vector types (needed by the vector-select patch). llvm-svn: 132263	2011-05-28 17:57:14 +00:00
Nadav Rotem	a9effb13dd	Refactor getActionType and getTypeToTransformTo ; place all of the 'decision' code in one place. Re-apply 131534 and fix the multi-step promotion of integers. llvm-svn: 132217	2011-05-27 21:03:13 +00:00
Eli Friedman	c70355195c	Rewrite fast-isel integer cast handling to handle more cases, and to be simpler and more consistent. The practical effects here are that x86-64 fast-isel can now handle trunc from i8 to i1, and ARM fast-isel can handle many more constructs involving integers narrower than 32 bits (including loads, stores, and many integer casts). rdar://9437928 . llvm-svn: 132099	2011-05-25 23:49:02 +00:00
Devang Patel	84b64a3e92	Remove unused statistical counter. llvm-svn: 132087	2011-05-25 21:55:40 +00:00
Devang Patel	5de2375db8	Remove dead code. llvm-svn: 131974	2011-05-24 18:27:52 +00:00
Evan Cheng	88f9137fd7	- Teach SelectionDAG::isKnownNeverZero to return true (op x, c) when c is non-zero. - Teach X86 cmov optimization to eliminate the cmov from ctlz, cttz extension when the source of X86ISD::BSR / X86ISD::BSF is proven to be non-zero. rdar://9490949 llvm-svn: 131948	2011-05-24 01:48:22 +00:00
Devang Patel	efec7715ec	Revert 121907 (it causes llc crash) and apply original patch from PR9817. llvm-svn: 131926	2011-05-23 22:04:42 +00:00
Devang Patel	7992883811	Preserve debug info during iSel by keeping DanglingDebugInfoMap live until end of function. Patch by Micah Villmow llvm-svn: 131908	2011-05-23 17:44:13 +00:00
Devang Patel	c4d9a84159	While replacing all uses of a SDValue with another value, do not forget to transfer SDDbgValue. llvm-svn: 131907	2011-05-23 17:35:08 +00:00
Chris Lattner	68254fcbca	Eliminate some temporary variables, and don't call getByValTypeAlignment when we're just going to throw the result away. No functionality change. llvm-svn: 131880	2011-05-22 23:23:02 +00:00
Benjamin Kramer	2fd48f2730	Implement mulo x, 2 -> addo x, x in DAGCombiner. llvm-svn: 131800	2011-05-21 18:31:55 +00:00
Cameron Zwarich	2af60abad8	Fix PR9955 by only attaching load memory operands to load instructions and similarly for stores. Now "make check" passes with the MachineVerifier forced on with the VerifyCoalescing option! llvm-svn: 131705	2011-05-19 23:44:34 +00:00
Stuart Hastings	4a4e5a2b55	Update some currently-disabled code, preparing for eventual use. llvm-svn: 131663	2011-05-19 18:48:20 +00:00
Duncan Sands	3d9407f4eb	Revert commit 131534 since it seems to have broken several buildbots. Original log entry: Refactor getActionType and getTypeToTransformTo ; place all of the 'decision' code in one place. llvm-svn: 131536	2011-05-18 14:57:56 +00:00
Nadav Rotem	c5c27ede55	Refactor getActionType and getTypeToTransformTo ; place all of the 'decision' code in one place. llvm-svn: 131534	2011-05-18 12:26:38 +00:00
Eli Friedman	e9692808b7	Make fast-isel miss counting in -stats and -fast-isel-verbose take terminators into account; since there are many fewer isel misses with recent changes, misses caused by terminators are more significant. llvm-svn: 131502	2011-05-17 23:02:10 +00:00
Dan Gohman	abffc991dc	Misc. code cleanups. llvm-svn: 131497	2011-05-17 22:22:52 +00:00
Dan Gohman	4298df6d86	Misc. code cleanups. llvm-svn: 131495	2011-05-17 22:20:36 +00:00
Dan Gohman	d282f46c6b	Delete unused variables. llvm-svn: 131430	2011-05-16 22:19:54 +00:00
Dan Gohman	d4d12d14b5	Trim #includes. llvm-svn: 131429	2011-05-16 22:14:50 +00:00
Dan Gohman	ae9b1685a8	Fix whitespace and 80-column violations. llvm-svn: 131428	2011-05-16 22:09:53 +00:00
Jim Grosbach	e85c0dde7a	Track how many insns fast-isel successfully selects as well as how many it misses. llvm-svn: 131426	2011-05-16 21:51:07 +00:00
Devang Patel	8e60ff11db	Preserve debug info for unused zero extended boolean argument. Radar 9422775. llvm-svn: 131422	2011-05-16 21:24:05 +00:00
Eli Friedman	a4d4a0162d	Make fast-isel work correctly s/uadd.with.overflow intrinsics. llvm-svn: 131420	2011-05-16 21:06:17 +00:00
Eli Friedman	4c08bb450a	Fix silly typo. llvm-svn: 131419	2011-05-16 20:34:53 +00:00
Eli Friedman	9ac944774f	Basic fast-isel of extractvalue. Not too helpful on its own, given the IR clang generates for cases like this, but it should become more useful soon. llvm-svn: 131417	2011-05-16 20:27:46 +00:00
Rafael Espindola	2050af838d	Don't do tail calls in a function that call setjmp. The stack might be corrupted when setjmp returns again. llvm-svn: 131399	2011-05-16 03:05:33 +00:00
Eli Friedman	8f1e11cde9	Fix a FIXME by moving the fast-isel implementation of the objectsize intrinsic from the x86 code to the generic code. llvm-svn: 131332	2011-05-14 00:47:51 +00:00
Rafael Espindola	e53b7d1a11	Make codegen able to handle values of empty types. This is one way to fix PR9900. I will keep it open until sable is able to comment on it. llvm-svn: 131294	2011-05-13 15:18:06 +00:00
Stuart Hastings	aa02c0847d	Since I can't reproduce the failures from 131261, re-trying with a simplified version. <rdar://problem/9298790> llvm-svn: 131274	2011-05-13 00:51:54 +00:00
Stuart Hastings	8d57d8ea64	Revert 131266 and 131261 due to buildbot complaints. rdar://problem/9298790 llvm-svn: 131269	2011-05-13 00:15:17 +00:00
Stuart Hastings	89f1b47e3a	Non-fast-isel followup to 129634; correctly handle branches controlled by non-CMP expressions. The executable test case (129821) would test this as well, if we had an "-O0 -disable-arm-fast-isel" LLVM-GCC tester. Alas, the ARM assembly would be very difficult to check with FileCheck. The thumb2-cbnz.ll test is affected; it generates larger code (tst.w vs. cmp #0), but I believe the new version is correct. rdar://problem/9298790 llvm-svn: 131261	2011-05-12 23:36:41 +00:00
Nadav Rotem	8a7beb80f0	Fixes a bug in the DAGCombiner. LoadSDNodes have two values (data, chain). If there is a store after the load node, then there is a chain, which means that there is another user. Thus, asking hasOneUser would fail. Instead we ask hasNUsesOfValue on the 'data' value. llvm-svn: 131183	2011-05-11 14:40:50 +00:00
Bill Wendling	50117f8186	Give the 'eh.sjlj.dispatchsetup' intrinsic call the value coming from the setjmp intrinsic call. This prevents it from being reordered so that it appears before the setjmp intrinsic (thus making it completely useless). <rdar://problem/9409683> llvm-svn: 131174	2011-05-11 01:11:55 +00:00
Eli Friedman	768de0a0f8	Disable my little CopyToReg argument hack with fast-isel. rdar://problem/9413587 . llvm-svn: 131156	2011-05-10 21:50:58 +00:00
Stuart Hastings	999fa3bf1f	Correctly walk through nested and adjacent CALLSEQ_START nodes. No test case; I've only seen this on a release branch, and I can't get it to reproduce on trunk. rdar://problem/7662569 llvm-svn: 131152	2011-05-10 21:20:03 +00:00
Eric Christopher	4480428474	Look through struct wrapped types for inline asm statments. Patch by Evan Cheng. llvm-svn: 131093	2011-05-09 20:04:43 +00:00
Duncan Sands	6be291a2cd	Indent properly, no functionality change. llvm-svn: 131082	2011-05-09 08:03:33 +00:00
Evan Cheng	d26fc5e013	80 col violations. llvm-svn: 131015	2011-05-06 20:52:23 +00:00
Eli Friedman	2518f8376d	Make the logic for determining function alignment more explicit. No functionality change. llvm-svn: 131012	2011-05-06 20:34:06 +00:00
Eli Friedman	7a78f66145	Use array_lengthof. No functional change. llvm-svn: 131008	2011-05-06 19:50:10 +00:00
Owen Anderson	68b6b0efb0	Allow FastISel of three-register-operand instructions. llvm-svn: 130934	2011-05-05 17:59:04 +00:00
Eli Friedman	441a01a2b8	Avoid extra vreg copies for arguments passed in registers. Specifically, this can make MachineCSE more effective in some cases (especially in small functions). PR8361 / part of rdar://problem/8259436 . llvm-svn: 130928	2011-05-05 16:53:34 +00:00
Eli Friedman	fd8c6adffb	Small syntax cleanup; we don't need to #define constants in C++. No functionality change intended. llvm-svn: 130926	2011-05-05 16:25:23 +00:00
Owen Anderson	66fd073974	Other parts of the SelectionDAG framework assume that targets use their pointer type for vector indices. Make the vector unrolling code respect that. llvm-svn: 130733	2011-05-02 22:25:45 +00:00
Eli Friedman	4105ed1523	Make FastEmit_ri_ try a bit harder to succeed for supported operations; FastEmit_i can fail for non-Thumb2 ARM. Makes ARMSimplifyAddress work correctly, and reduces the number of fast-isel bailouts on non-Thumb ARM. llvm-svn: 130560	2011-04-29 23:34:52 +00:00
Eli Friedman	33c133919a	Fix a silly mistake in r130338. llvm-svn: 130360	2011-04-28 00:42:03 +00:00
Eli Friedman	406c471b69	Make the fast-isel code for literal 0.0 a bit shorter/faster, since 0.0 is common. rdar://problem/9303592 . llvm-svn: 130338	2011-04-27 22:41:55 +00:00
Eli Friedman	121d27e9e4	Remove unused function. llvm-svn: 130337	2011-04-27 22:21:02 +00:00
Evan Cheng	1355bbdd11	Be careful about scheduling nodes above previous calls. It increase usages of more callee-saved registers and introduce copies. Only allows it if scheduling a node above calls would end up lessen register pressure. Call operands also has added ABI restrictions for register allocation, so be extra careful with hoisting them above calls. rdar://9329627 llvm-svn: 130245	2011-04-26 21:31:35 +00:00
Dan Gohman	7da91aee83	Fast-isel support for simple inline asms. llvm-svn: 130205	2011-04-26 17:18:34 +00:00
Evan Cheng	2f64754031	Fix typo llvm-svn: 130190	2011-04-26 04:57:37 +00:00
Devang Patel	734f2218ac	A dbg.declare may not be in entry block, even if it is referring to an incoming argument. However, It is appropriate to emit DBG_VALUE referring to this incoming argument in entry block in MachineFunction. llvm-svn: 130129	2011-04-25 16:33:52 +00:00
Jay Foad	1a180156b6	Remove unused STL header includes. llvm-svn: 130068	2011-04-23 19:53:52 +00:00
Owen Anderson	dd450b86cf	Teach FastISel to deal with instructions that have two immediate operands. llvm-svn: 130033	2011-04-22 23:38:06 +00:00
Chris Lattner	6d277517d1	Recommit the fix for rdar://9289512 with a couple tweaks to fix bugs exposed by the gcc dejagnu testsuite: 1. The load may actually be used by a dead instruction, which would cause an assert. 2. The load may not be used by the current chain of instructions, and we could move it past a side-effecting instruction. Change how we process uses to define the problem away. llvm-svn: 130018	2011-04-22 21:59:37 +00:00
Benjamin Kramer	341c11da3b	DAGCombine: fold "(zext x) == C" into "x == (trunc C)" if the trunc is lossless. On x86 this allows to fold a load into the cmp, greatly reducing register pressure. movzbl (%rdi), %eax cmpl $47, %eax -> cmpb $47, (%rdi) This shaves 8k off gcc.o on i386. I'll leave applying the patch in README.txt to Chris :) llvm-svn: 130005	2011-04-22 18:47:44 +00:00
Daniel Dunbar	6309828206	Revert r1296656, "Fix rdar://9289512 - not folding load into compare at -O0...", which broke a couple GCC test suite tests at -O0. llvm-svn: 129914	2011-04-21 16:14:46 +00:00
Eric Christopher	bcaedb5ce0	Rewrite the expander for umulo/smulo to remember to sign extend the input manually and pass all (now) 4 arguments to the mul libcall. Add a new ExpandLibCall for just this (copied gratuitously from type legalization). Fixes rdar://9292577 llvm-svn: 129842	2011-04-20 01:19:45 +00:00
Stuart Hastings	468086d5e1	Delete unnecessary variable. <rdar://problem/7662569> llvm-svn: 129796	2011-04-19 20:09:38 +00:00
Eli Friedman	bcd09b3a7f	SelectBasicBlock is rather slow even when it doesn't do anything; skip the unnecessary work where possible. llvm-svn: 129763	2011-04-19 17:01:08 +00:00
Stuart Hastings	0b68c1219f	Support nested CALLSEQ_BEGIN/END; necessary for ARM byval support. <rdar://problem/7662569> llvm-svn: 129761	2011-04-19 16:16:58 +00:00
Chris Lattner	91328b317b	Implement support for x86 fastisel of small fixed-sized memcpys, which are generated en-mass for C++ PODs. On my c++ test file, this cuts the fast isel rejects by 10x and shrinks the generated .s file by 5% llvm-svn: 129755	2011-04-19 05:52:03 +00:00
Chris Lattner	48f75ad678	while we're at it, handle 'sdiv exact' of a power of 2 also, this fixes a few rejects on c++ iterator loops. llvm-svn: 129694	2011-04-18 07:00:40 +00:00
Chris Lattner	562d6e82bd	fix rdar://9297011 - udiv by power of two causing fast-isel rejects llvm-svn: 129693	2011-04-18 06:55:51 +00:00
Chris Lattner	b53ccb8e36	1. merge fast-isel-shift-imm.ll into fast-isel-x86-64.ll 2. implement rdar://9289501 - fast isel should fold trivial multiplies to shifts 3. teach tblgen to handle shift immediates that are different sizes than the shifted operands, eliminating some code from the X86 fast isel backend. 4. Have FastISel::SelectBinaryOp use (the poorly named) FastEmit_ri_ function instead of FastEmit_ri to simplify code. llvm-svn: 129666	2011-04-17 20:23:29 +00:00
Chris Lattner	4832660b4d	fix an oversight which caused us to compile the testcase (and other less trivial things) into a dummy lea. Before we generated: _test: ## @test movq _G@GOTPCREL(%rip), %rax leaq (%rax), %rax ret now we produce: _test: ## @test movq _G@GOTPCREL(%rip), %rax ret This is part of rdar://9289558 llvm-svn: 129662	2011-04-17 17:12:08 +00:00
Chris Lattner	045c43855c	Fix rdar://9289512 - not folding load into compare at -O0 The basic issue here is that bottom-up isel is matching the branch and compare, and was failing to fold the load into the branch/compare combo. Fixing this (by allowing folding into any instruction of a sequence that is selected) allows us to produce things like: cmpb $0, 52(%rax) je LBB4_2 instead of: movb 52(%rax), %cl cmpb $0, %cl je LBB4_2 This makes the generated -O0 code run a bit faster, but also speeds up compile time by putting less pressure on the register allocator and generating less code. This was one of the biggest classes of missing load folding. Implementing this shrinks 176.gcc's c-decl.s (as a random example) by about 4% in (verbose-asm) line count. llvm-svn: 129656	2011-04-17 06:35:44 +00:00
Chris Lattner	d70ff0d807	split a complex predicate out to a helper function. Simplify two for loops, which don't need to check for falling off the end of a block and end of phi nodes, since terminators are never phis. llvm-svn: 129655	2011-04-17 06:03:19 +00:00
Chris Lattner	fba7ca63cc	fix rdar://9289583 - fast isel should handle non-canonical commutative binops allowing us to fold the immediate into the 'and' in this case: int test1(int i) { return 8&i; } llvm-svn: 129653	2011-04-17 01:16:47 +00:00
Eli Friedman	55b0acd624	PR9055: extend the fix to PR4050 (r70179) to apply to zext and anyext. Returning a new node makes the code try to replace the old node, which in the included testcase is killed by CSE. llvm-svn: 129650	2011-04-16 23:25:34 +00:00
Evan Cheng	b14ce09fca	Fix divmod libcall lowering. Convert to {S\|U}DIVREM first and then expand the node to a libcall. rdar://9280991 llvm-svn: 129633	2011-04-16 03:08:26 +00:00
Chris Lattner	0ab5e2cded	Fix a ton of comment typos found by codespell. Patch by Luis Felipe Strano Moraes! llvm-svn: 129558	2011-04-15 05:18:47 +00:00
Owen Anderson	a519284fec	Fix another instance of the DAG combiner not using the correct type for the RHS of a shift. llvm-svn: 129522	2011-04-14 17:30:49 +00:00
Andrew Trick	bfbd972b1f	In the pre-RA scheduler, maintain cmp+br proximity. This is done by pushing physical register definitions close to their use, which happens to handle flag definitions if they're not glued to the branch. This seems to be generally a good thing though, so I didn't need to add a target hook yet. The primary motivation is to generate code closer to what people expect and rule out missed opportunity from enabling macro-op fusion. As a side benefit, we get several 2-5% gains on x86 benchmarks. There is one regression: SingleSource/Benchmarks/Shootout/lists slows down be -10%. But this is an independent scheduler bug that will be tracked separately. See rdar://problem/9283108. Incidentally, pre-RA scheduling is only half the solution. Fixing the later passes is tracked by: <rdar://problem/8932804> [pre-RA-sched] on x86, attempt to schedule CMP/TEST adjacent with condition jump Fixes: <rdar://problem/9262453> Scheduler unnecessary break of cmp/jump fusion llvm-svn: 129508	2011-04-14 05:15:06 +00:00
Chris Lattner	493b3e72f2	sink a call into its only use. llvm-svn: 129503	2011-04-14 04:12:47 +00:00
Owen Anderson	9c12834eed	During post-legalization DAG combining, be careful to only create shifts where the RHS is of the legal type for the new operation. llvm-svn: 129484	2011-04-13 23:22:23 +00:00
Andrew Trick	b53a00d2cb	Recommit r129383. PreRA scheduler heuristic fixes: VRegCycle, TokenFactor latency. Additional fixes: Do something reasonable for subtargets with generic itineraries by handle node latency the same as for an empty itinerary. Now nodes default to unit latency unless an itinerary explicitly specifies a zero cycle stage or it is a TokenFactor chain. Original fixes: UnitsSharePred was a source of randomness in the scheduler: node priority depended on the queue data structure. I rewrote the recent VRegCycle heuristics to completely replace the old heuristic without any randomness. To make the ndoe latency adjustments work, I also needed to do something a little more reasonable with TokenFactor. I gave it zero latency to its consumers and always schedule it as low as possible. llvm-svn: 129421	2011-04-13 00:38:32 +00:00
Andrew Trick	1b60ad6644	Revert 129383. It causes some targets to hit a scheduler assert. llvm-svn: 129385	2011-04-12 20:14:07 +00:00
Andrew Trick	c5dd24a542	PreRA scheduler heuristic fixes: VRegCycle, TokenFactor latency. UnitsSharePred was a source of randomness in the scheduler: node priority depended on the queue data structure. I rewrote the recent VRegCycle heuristics to completely replace the old heuristic without any randomness. To make these heuristic adjustments to node latency work, I also needed to do something a little more reasonable with TokenFactor. I gave it zero latency to its consumers and always schedule it as low as possible. llvm-svn: 129383	2011-04-12 19:54:36 +00:00
Jay Foad	7c14a558fe	Don't include Operator.h from InstrTypes.h. llvm-svn: 129271	2011-04-11 09:35:34 +00:00
Chris Lattner	cfe5aa65d2	Avoid excess precision issues that lead to generating host-compiler-specific code. Switch lowering probably shouldn't be using FP for this. This resolves PR9581. llvm-svn: 129199	2011-04-09 06:57:13 +00:00
Chris Lattner	41c80e89f3	have dag combine zap "store undef", which can be formed during call lowering with undef arguments. llvm-svn: 129185	2011-04-09 02:32:02 +00:00
Evan Cheng	74d92c1924	Change -arm-trap-func= into a non-arm specific option. Now Intrinsic::trap is lowered into a call to the specified trap function at sdisel time. llvm-svn: 129152	2011-04-08 21:37:21 +00:00
Andrew Trick	2ad0b37318	Added a check in the preRA scheduler for potential interference on a induction variable. The preRA scheduler is unaware of induction vars, so we look for potential "virtual register cycles" instead. Fixes <rdar://problem/8946719> Bad scheduling prevents coalescing llvm-svn: 129100	2011-04-07 19:54:57 +00:00
Bill Wendling	dd4dcd549b	Revamp the SjLj "dispatch setup" intrinsic. It needed to be moved closer to the setjmp statement, because the code directly after the setjmp needs to know about values that are on the stack. Also, the 'bitcast' of the function context was causing a dead load. This wouldn't be too horrible, except that at -O0 it wasn't optimized out, and because it wasn't using the correct base pointer (if there is a VLA), it would try to access a value from a garbage address. <rdar://problem/9130540> llvm-svn: 128873	2011-04-05 01:37:43 +00:00
Stuart Hastings	ad68c93a2d	Revert 123704; it broke threaded LLVM. llvm-svn: 128868	2011-04-05 00:37:28 +00:00
Cameron Zwarich	8c7bbc09e2	Add a RemoveFromWorklist method to DCI. This is needed to do some complicated transformations in target-specific DAG combines without causing DAGCombiner to delete the same node twice. If you know of a better way to avoid this (see my next patch for an example), please let me know. llvm-svn: 128758	2011-04-02 02:40:26 +00:00
Evan Cheng	8b1bca1998	Add comments. llvm-svn: 128730	2011-04-01 19:57:01 +00:00
Evan Cheng	8d68ebd42a	Assign node order numbers to results of call instruction lowering. This should improve src line debug info when sdisel is used. rdar://9199118 llvm-svn: 128728	2011-04-01 19:42:22 +00:00
Evan Cheng	bd76679700	Issue libcalls __udivmodi4 / __divmodi4 for div / rem pairs. rdar://8911343 llvm-svn: 128696	2011-04-01 00:42:02 +00:00
Benjamin Kramer	355ce07425	Turn SelectionDAGBuilder::GetRegistersForValue into a local function. It couldn't be used outside of the file because SDISelAsmOperandInfo is local to SelectionDAGBuilder.cpp. Making it a static function avoids a weird linkage dance. llvm-svn: 128342	2011-03-26 16:35:10 +00:00
Andrew Trick	3bd8b7a388	Fix for -pre-RA-sched=source. Yet another case of unchecked NULL node (for physreg copy). May fix PR9509. llvm-svn: 128266	2011-03-25 06:40:55 +00:00
Eli Friedman	4c192305bf	PR9535: add support for splitting and scalarizing vector ISD::FP_ROUND. Also cleaning up some duplicated code while I'm here. llvm-svn: 128176	2011-03-23 22:18:48 +00:00
Andrew Trick	13acae040c	Ensure that def-side physreg copies are scheduled above any other uses so the scheduler can't create new interferences on the copies themselves. Prior to this fix the scheduler could get stuck in a loop creating copies. Fixes PR9509. llvm-svn: 128164	2011-03-23 20:42:39 +00:00
Andrew Trick	a8846e0540	whitespace llvm-svn: 128163	2011-03-23 20:40:18 +00:00
Andrew Trick	b1fd328581	Added block number and name to isel debug output. I'm tired of doing this manually for each checkout. If anyone knows a better way debug isel for non-trivial tests feel free to revert and let me know how to do it. llvm-svn: 128132	2011-03-23 01:38:28 +00:00
Eric Christopher	1b4b1e559a	Grammar-o. llvm-svn: 128004	2011-03-21 18:06:21 +00:00
Nadav Rotem	e7a101ccab	Add support for legalizing UINT_TO_FP of vectors on platforms which do not have native support for this operation (such as X86). The legalized code uses two vector INT_TO_FP operations and is faster than scalarizing. llvm-svn: 127951	2011-03-19 13:09:10 +00:00
Benjamin Kramer	cfcea12fe2	BuildUDIV: If the divisor is even we can simplify the fixup of the multiplied value by introducing an early shift. This allows us to compile "unsigned foo(unsigned x) { return x/28; }" into shrl $2, %edi imulq $613566757, %rdi, %rax shrq $32, %rax ret instead of movl %edi, %eax imulq $613566757, %rax, %rcx shrq $32, %rcx subl %ecx, %eax shrl %eax addl %ecx, %eax shrl $4, %eax on x86_64 llvm-svn: 127829	2011-03-17 20:39:14 +00:00
Cameron Zwarich	2ef0c69df1	Move more logic into getTypeForExtArgOrReturn. llvm-svn: 127809	2011-03-17 14:53:37 +00:00
Cameron Zwarich	34e7b3f77e	Rename getTypeForExtendedInteger() to getTypeForExtArgOrReturn(). llvm-svn: 127807	2011-03-17 14:21:56 +00:00
Cameron Zwarich	ac106273d4	The x86-64 ABI says that a bool is only guaranteed to be sign-extended to a byte rather than an int. Thankfully, this only causes LLVM to miss optimizations, not generate incorrect code. This just fixes the zext at the return. We still insert an i32 ZextAssert when reading a function's arguments, but it is followed by a truncate and another i8 ZextAssert so it is not optimized. llvm-svn: 127766	2011-03-16 22:20:18 +00:00
Cameron Zwarich	d1ad9bc277	Don't recompute something that we already have in a local variable. llvm-svn: 127764	2011-03-16 22:20:07 +00:00
Evan Cheng	c5c2cfa381	sext(undef) = 0, because the top bits will all be the same. zext(undef) = 0, because the top bits will be zero. llvm-svn: 127649	2011-03-15 02:22:10 +00:00
Evan Cheng	37139edc8c	BIT_CONVERT has been renamed to BITCAST. llvm-svn: 127600	2011-03-14 18:19:52 +00:00
Evan Cheng	d2f3b01797	Minor optimization. sign-ext/anyext of undef is still undef. llvm-svn: 127598	2011-03-14 18:15:55 +00:00
Owen Anderson	66443c034d	Teach FastISel to support register-immediate-immediate instructions. llvm-svn: 127496	2011-03-11 21:33:55 +00:00
Andrew Trick	710d5da306	Replace -dag-chain-limit flag with constant. It has survived a release cycle without being touched, so no longer needs to pollute the hidden-help text. llvm-svn: 127468	2011-03-11 17:46:59 +00:00
Evan Cheng	adb9c03e41	Avoid replacing the value of a directly stored load with the stored value if the load is indexed. rdar://9117613. llvm-svn: 127440	2011-03-11 00:48:56 +00:00
Evan Cheng	b4c6a34415	Re-commit 127368 and 127371. They are exonerated. llvm-svn: 127380	2011-03-10 00:16:32 +00:00
Evan Cheng	d4b3f8e009	Revert 127368 and 127371 for now. llvm-svn: 127376	2011-03-09 23:53:17 +00:00
Evan Cheng	ca9a936332	Change the definition of TargetRegisterInfo::getCrossCopyRegClass to be more flexible. If it returns a register class that's different from the input, then that's the register class used for cross-register class copies. If it returns a register class that's the same as the input, then no cross- register class copies are needed (normal copies would do). If it returns null, then it's not at all possible to copy registers of the specified register class. llvm-svn: 127368	2011-03-09 22:47:38 +00:00
Andrew Trick	072ed2ee0d	Improve pre-RA-sched register pressure tracking for duplicate operands. This helps cases like 2008-07-19-movups-spills.ll, but doesn't have an obvious impact on benchmarks llvm-svn: 127347	2011-03-09 19:12:43 +00:00
Benjamin Kramer	b2e4d84305	Fix typo, make helper static. llvm-svn: 127335	2011-03-09 16:19:12 +00:00
Eric Christopher	7238cba180	Fix some latent bugs if the nodes are unschedulable. We'd gotten away with this before since none of the register tracking or nightly tests had unschedulable nodes. This should probably be refixed with a special default Node that just returns some "don't touch me" values. Fixes PR9427 llvm-svn: 127263	2011-03-08 19:35:47 +00:00
Andrew Trick	52b3e38a1f	Further improvements to pre-RA-sched=list-ilp. This change uses the MaxReorderWindow for both height and depth, which tends to limit the negative effects of high register pressure. llvm-svn: 127203	2011-03-08 01:51:56 +00:00
Cameron Zwarich	df61694417	Move getRegPressureLimit() from TargetLoweringInfo to TargetRegisterInfo. llvm-svn: 127175	2011-03-07 21:56:36 +00:00
Owen Anderson	cd526fa15e	Use the correct LHS type when determining the legalization of a shift's RHS type. llvm-svn: 127163	2011-03-07 18:29:47 +00:00
Eric Christopher	9cb33deebf	Typo. llvm-svn: 127131	2011-03-06 21:13:45 +00:00
Andrew Trick	dd01732e63	Disable a couple of experimental heuristics to get the best results from the current implementation of -pre-RA-sched=list-ilp. llvm-svn: 127113	2011-03-06 00:03:32 +00:00
Andrew Trick	25cedf3fe4	Be explicit with abs(). Visual Studio workaround. llvm-svn: 127075	2011-03-05 10:29:25 +00:00
Andrew Trick	d7f4c21684	Fix for -sched-high-latency-cycles in sched=list-ilp mode. llvm-svn: 127071	2011-03-05 09:18:16 +00:00
Andrew Trick	b8390b7a25	Missing comment. llvm-svn: 127068	2011-03-05 08:04:11 +00:00
Andrew Trick	641e2d4f8c	Increased the register pressure limit on x86_64 from 8 to 12 regs. This is the only change in this checkin that may affects the default scheduler. With better register tracking and heuristics, it doesn't make sense to artificially lower the register limit so much. Added -sched-high-latency-cycles and X86InstrInfo::isHighLatencyDef to give the scheduler a way to account for div and sqrt on targets that don't have an itinerary. It is currently defaults to 10 (the actual number doesn't matter much), but only takes effect on non-default schedulers: list-hybrid and list-ilp. Added several heuristics that can be individually disabled for the non-default sched=list-ilp mode. This helps us determine how much better we can do on a given benchmark than the default scheduler. Certain compute intensive loops run much faster in this mode with the right set of heuristics, and it doesn't seem to have much negative impact elsewhere. Not all of the heuristics are needed, but we still need to experiment to decide which should be disabled by default for sched=list-ilp. llvm-svn: 127067	2011-03-05 08:00:22 +00:00
Duncan Sands	6bd1044222	Revert commit 126684 "Use the correct shift amount type". It is only the correct type after type legalization has completed. Before then it may simply not be big enough to hold the shift amount, particularly on x86 which uses a very small type for shifts (this issue broke stuff in the past which is why LegalizeTypes carefully uses a large type for shift amounts). llvm-svn: 127000	2011-03-04 14:28:59 +00:00
Andrew Trick	c88b7ecb88	Minor pre-RA-sched fixes and cleanup. Fix the PendingQueue, then disable it because it's not required for the current schedulers' heuristics. Fix the logic for the unused list-ilp scheduler. llvm-svn: 126981	2011-03-04 02:03:45 +00:00
Bill Wendling	f3658f3872	There are times when the landing pad won't have a call to 'eh.selector' in it. It's been assumed up til now that it would be in its immediate successor. However, this isn't necessarily the case. It could be in one of its successor's successors. Modify the code to more thoroughly check for an 'eh.selector' call in successors. It only looks at a successor if we get there as a result of an unconditional branch. Testcase ObjC/exceptions-4.m in r126968. llvm-svn: 126969	2011-03-03 23:14:05 +00:00
Eli Friedman	d8a555bb3b	Revert r123908; the code in question is completely untested and wrong. llvm-svn: 126964	2011-03-03 22:33:23 +00:00
Bob Wilson	24b3ba5990	Avoid exponential blow-up when printing DAGs. David Greene changed CannotYetSelect() to print the full DAG including multiple copies of operands reached through different paths in the DAG. Unfortunately this blows up exponentially in some cases. The depth limit of 100 is way too high to prevent this -- I'm seeing a message string of 150MB with a depth of only 40 in one particularly bad case, even though the DAG has less than 200 nodes. Part of the problem is that the printing code is following chain operands, so if you fail to select an operation with a chain, the printer will follow all the chained operations back to the entry node. llvm-svn: 126899	2011-03-02 23:38:06 +00:00
Stuart Hastings	6b4007dec6	Can't introduce floating-point immediate constants after legalization. Radar 9056407. llvm-svn: 126864	2011-03-02 19:36:30 +00:00
Duncan Sands	cb95eeecc6	Add a few missed unary cases when legalizing vector results. Put some cases in alphabetical order. llvm-svn: 126745	2011-03-01 15:15:43 +00:00
Jim Grosbach	621818ab1a	trailing whitespace. llvm-svn: 126733	2011-03-01 01:39:05 +00:00
Jim Grosbach	1d479dbc55	Generalize the register matching code in DAGISel a bit. llvm-svn: 126731	2011-03-01 01:37:19 +00:00
Owen Anderson	0dc63104c6	Use the correct shift amount type. llvm-svn: 126684	2011-02-28 21:10:10 +00:00
Owen Anderson	4f4df81861	Clean whitespace. llvm-svn: 126683	2011-02-28 20:57:56 +00:00
Duncan Sands	f571290d1e	Legalize support for fpextend of vector. PR9309. llvm-svn: 126574	2011-02-27 14:41:27 +00:00
Nadav Rotem	b00913028f	Fix typos in the comments. llvm-svn: 126565	2011-02-27 07:40:43 +00:00
Tobias Grosser	3ac8689fa3	Pass the graph to the DOTGraphTraits.getEdgeAttributes(). This follows the interface of getNodeAttributes. llvm-svn: 126562	2011-02-27 04:11:03 +00:00
Benjamin Kramer	26691d9660	Add some DAGCombines for (adde 0, 0, glue), which are useful to optimize legalized code for large integer arithmetic. 1. Inform users of ADDEs with two 0 operands that it never sets carry 2. Fold other ADDs or ADDCs into the ADDE if possible It would be neat if we could do the same thing for SETCC+ADD eventually, but we can't do that in target independent code. llvm-svn: 126557	2011-02-26 22:48:07 +00:00
Owen Anderson	b2c80da4ae	Allow targets to specify a the type of the RHS of a shift parameterized on the type of the LHS. llvm-svn: 126518	2011-02-25 21:41:48 +00:00
Jim Grosbach	14a07365cb	Fix formatting of debug helper string. llvm-svn: 126471	2011-02-25 03:59:03 +00:00
Cameron Zwarich	4c82cd21ed	Set NumSignBits to 1 if KnownZero/KnownOne are being zero extended. In theory it is possible to do better if the high bit is set in either KnownZero/KnownOne, but in practice NumSignBits is always 1 when we are zero extending because nothing is known about that register. llvm-svn: 126465	2011-02-25 01:11:01 +00:00
Cameron Zwarich	d2f3041c7f	We only want to zero extend the existing information if the bit width is actually larger. llvm-svn: 126464	2011-02-25 01:10:55 +00:00
Nadav Rotem	502f1b943f	Enable support for vector sext and trunc: Limit the folding of any_ext and sext into the load operation to scalars. Limit the active-bits trunc optimization to scalars. Document vector trunc and vector sext in LangRef. Similar to commit 126080 (for enabling zext). llvm-svn: 126424	2011-02-24 21:01:34 +00:00
Cameron Zwarich	a62fc89a04	Merge information about the number of zero, one, and sign bits of live-out registers at phis. This enables us to eliminate a lot of pointless zexts during the DAGCombine phase. This fixes <rdar://problem/8760114>. llvm-svn: 126380	2011-02-24 10:00:25 +00:00
Cameron Zwarich	3cf9280214	Add a getNumSignBits() method to APInt. llvm-svn: 126379	2011-02-24 10:00:20 +00:00
Cameron Zwarich	97eb52da7b	Add a mechanism for invalidating the LiveOutInfo of a PHI, and use it whenever a block is visited before all of its predecessors. llvm-svn: 126378	2011-02-24 10:00:16 +00:00
Cameron Zwarich	988faf91bd	Track blocks visited in reverse postorder. llvm-svn: 126377	2011-02-24 10:00:13 +00:00
Cameron Zwarich	6470647383	Refactor the LiveOutInfo interface into a few methods on FunctionLoweringInfo and make the actual map private. llvm-svn: 126376	2011-02-24 10:00:08 +00:00
Cameron Zwarich	b670d512e9	Have isel visit blocks in reverse postorder rather than an undefined order. This allows for the information propagated across basic blocks to be merged at phis. llvm-svn: 126375	2011-02-24 10:00:04 +00:00
Cameron Zwarich	f8b22b3483	Roll out r126169 and r126170 in an attempt to fix the selfhost bot. llvm-svn: 126185	2011-02-22 03:24:52 +00:00
Cameron Zwarich	800f85baf9	Merge information about the number of zero, one, and sign bits of live-out registers at phis. This enables us to eliminate a lot of pointless zexts during the DAGCombine phase. This fixes <rdar://problem/8760114>. llvm-svn: 126170	2011-02-22 00:46:27 +00:00
Cameron Zwarich	f248f945c8	Have isel visit blocks in reverse postorder rather than an undefined order. This allows for the information propagated across basic blocks to be merged at phis. llvm-svn: 126169	2011-02-22 00:46:22 +00:00
Devang Patel	f3292b2196	Revert r124611 - "Keep track of incoming argument's location while emitting LiveIns." In other words, do not keep track of argument's location. The debugger (gdb) is not prepared to see line table entries for arguments. For the debugger, "second" line table entry marks beginning of function body. This requires some coordination with debugger to get this working. - The debugger needs to be aware of prolog_end attribute attached with line table entries. - The compiler needs to accurately mark prolog_end in line table entries (at -O0 and at -O1+) llvm-svn: 126155	2011-02-21 23:21:26 +00:00
Nadav Rotem	25f2ac948b	Fix 9267; Add vector zext support. The DAGCombiner folds the zext into complex load instructions. This patch prevents this optimization on vectors since none of the supported targets knows how to perform load+vector_zext in one instruction. llvm-svn: 126080	2011-02-20 12:37:50 +00:00
Devang Patel	b7ae3ccb84	Do not lose debug info of an inlined function argument even if the argument is only used through GEPs. This time with a fix that avoids using invalidated DenseMap iterator. llvm-svn: 125984	2011-02-18 22:43:42 +00:00
Cameron Zwarich	0a1a36dc46	Roll out r125794 to help diagnose the llvm-gcc-i386-linux-selfhost failure. llvm-svn: 125830	2011-02-18 04:58:10 +00:00
Devang Patel	f922a431ee	Do not lose debug info of an inlined function argument even if the argument is only used through GEPs. llvm-svn: 125794	2011-02-17 23:33:27 +00:00
Duncan Sands	c6196aa481	Fix wrong logic in promotion of signed mul-with-overflow (I pointed this out at the time but presumably my email got lost). Examples where the previous logic got it wrong: (1) a signed i8 multiply of 64 by 2 overflows, but the high part is zero; (2) a signed i8 multiple of -128 by 2 overflows, but the high part is all ones. llvm-svn: 125748	2011-02-17 12:42:48 +00:00
Stuart Hastings	81c4306005	Swap VT and DebugLoc operands of getExtLoad() for consistency with other getNode() methods. Radar 9002173. llvm-svn: 125665	2011-02-16 16:23:55 +00:00
Eric Christopher	e5ca1e0506	Refactor zero folding slightly. Clean up todo. llvm-svn: 125651	2011-02-16 04:50:12 +00:00
Eric Christopher	ef72141a75	The change for PR9190 wasn't quite right. We need to avoid making the transformation if we can't legally create a build vector of the correct type. Check that we can make the transformation first, and add a TODO to refactor this code with similar cases. Fixes: PR9223 and rdar://9000350 llvm-svn: 125631	2011-02-16 01:10:03 +00:00
Chris Lattner	69229316aa	convert ConstantVector::get to use ArrayRef. llvm-svn: 125537	2011-02-15 00:14:00 +00:00
Chris Lattner	34442e6ebf	revert my ConstantVector patch, it seems to have made the llvm-gcc builders unhappy. llvm-svn: 125504	2011-02-14 18:15:46 +00:00
Chris Lattner	d9f5b88548	Switch ConstantVector::get to use ArrayRef instead of a pointer+size idiom. Change various clients to simplify their code. llvm-svn: 125487	2011-02-14 07:55:32 +00:00
Chris Lattner	eff248ca7f	fix PR9210 by implementing some type legalization logic for vector fp conversions. llvm-svn: 125482	2011-02-14 06:30:45 +00:00
Chris Lattner	eaa8341d3b	fix two comment thinkos llvm-svn: 125481	2011-02-14 06:14:42 +00:00
Chris Lattner	46c01a30f4	Enhance ComputeMaskedBits to know that aligned frameindexes have their low bits set to zero. This allows us to optimize out explicit stack alignment code like in stack-align.ll:test4 when it is redundant. Doing this causes the code generator to start turning FI+cst into FI\|cst all over the place, which is general goodness (that is the canonical form) except that various pieces of the code generator don't handle OR aggressively. Fix this by introducing a new SelectionDAG::isBaseWithConstantOffset predicate, and using it in places that are looking for ADD(X,CST). The ARM backend in particular was missing a lot of addressing mode folding opportunities around OR. llvm-svn: 125470	2011-02-13 22:25:43 +00:00
Chris Lattner	e95d195014	Revisit my fix for PR9028: the issue is that DAGCombine was generating i8 shift amounts for things like i1024 types. Add an assert in getNode to prevent this from occuring in the future, fix the buggy transformation, revert my previous patch, and document this gotcha in ISDOpcodes.h llvm-svn: 125465	2011-02-13 19:09:16 +00:00
Chris Lattner	d5f0b1148a	when legalizing extremely wide shifts, make sure that the shift amounts are in a suitably wide type so that we don't generate out of range constant shift amounts. This fixes PR9028. llvm-svn: 125458	2011-02-13 09:10:56 +00:00
Chris Lattner	2a720d933a	fix visitShift to properly zero extend the shift amount if the provided operand is narrower than the shift register. Doing an anyext provides undefined bits in the top part of the register. llvm-svn: 125457	2011-02-13 09:02:52 +00:00
Nadav Rotem	db2f54811d	A fix for 9165. The DAGCombiner created illegal BUILD_VECTOR operations. The patch added a check that either illegal operations are allowed or that the created operation is legal. llvm-svn: 125435	2011-02-12 14:40:33 +00:00
Nadav Rotem	a49a02a04f	SimplifySelectOps can only handle selects with a scalar condition. Add a check that the condition is not a vector. llvm-svn: 125398	2011-02-11 19:57:47 +00:00
Nadav Rotem	18f6a33457	Fix #9190 The bug happens when the DAGCombiner attempts to optimize one of the patterns of the SUB opcode. It tries to create a zero of type v2i64. This type is legal on 32bit machines, but the initializer of this vector (i64) is target dependent. Currently, the initializer attempts to create an i64 zero constant, which fails. Added a flag to tell the DAGCombiner to create a legal zero, if we require that the pass would generate legal types. llvm-svn: 125391	2011-02-11 19:20:37 +00:00
Devang Patel	639dd997eb	Remove comment about an argument that was removed couple of years ago. llvm-svn: 125054	2011-02-07 21:58:52 +00:00
Andrew Trick	d0548ae750	Introducing a new method of tracking register pressure. We can't precisely track pressure on a selection DAG, but we can at least keep it balanced. This design accounts for various interesting aspects of selection DAGS: register and subregister copies, glued nodes, dead nodes, unused registers, etc. Added SUnit::NumRegDefsLeft and ScheduleDAGSDNodes::RegDefIter. Note: I disabled PrescheduleNodesWithMultipleUses when register pressure is enabled, based on no evidence other than I don't think it makes sense to have both enabled. llvm-svn: 124853	2011-02-04 03:18:17 +00:00
Andrew Trick	3f924e4e87	whitespace llvm-svn: 124827	2011-02-03 23:00:17 +00:00
Evan Cheng	d42641c6b5	Given a pair of floating point load and store, if there are no other uses of the load, then it may be legal to transform the load and store to integer load and store of the same width. This is done if the target specified the transformation as profitable. e.g. On arm, this can transform: vldr.32 s0, [] vstr.32 s0, [] to ldr r12, [] str r12, [] rdar://8944252 llvm-svn: 124708	2011-02-02 01:06:55 +00:00
Matt Beaumont-Gay	29c8c8fe92	Take Bill Wendling's suggestion for structuring a couple of asserts. llvm-svn: 124688	2011-02-01 22:12:50 +00:00
Devang Patel	56cc5fdf09	Keep track of incoming argument's location while emitting LiveIns. llvm-svn: 124611	2011-01-31 21:38:14 +00:00
Richard Osborne	272e084bca	Fix bug where ReduceLoadWidth was creating illegal ZEXTLOAD instructions. llvm-svn: 124587	2011-01-31 17:41:44 +00:00
Benjamin Kramer	946e1522b6	Teach DAGCombine to fold fold (sra (trunc (sr x, c1)), c2) -> (trunc (sra x, c1+c2) when c1 equals the amount of bits that are truncated off. This happens all the time when a smul is promoted to a larger type. On x86-64 we now compile "int test(int x) { return x/10; }" into movslq %edi, %rax imulq $1717986919, %rax, %rax movq %rax, %rcx shrq $63, %rcx sarq $34, %rax <- used to be "shrq $32, %rax; sarl $2, %eax" addl %ecx, %eax This fires 96 times in gcc.c on x86-64. llvm-svn: 124559	2011-01-30 16:38:43 +00:00
Benjamin Kramer	65bb14d368	Add the missing sub identity "A-(A-B) -> B" to DAGCombine. This happens e.g. for code like "X - X%10" where we lower the modulo operation to a series of multiplies and shifts that are then subtracted from X, leading to this missed optimization. llvm-svn: 124532	2011-01-29 12:34:05 +00:00
Nick Lewycky	0af77fd45b	Fix build with stdcxx by using llvm::next. Patch by Joerg Sonnenberger! llvm-svn: 124472	2011-01-28 04:00:15 +00:00
Andrew Trick	c0ca67601a	Remove a temporary workaround for a lencod miscompile. Depends on the fix in r124442. llvm-svn: 124443	2011-01-27 21:28:51 +00:00
Devang Patel	1cec755494	Speculatively revert r124380. llvm-svn: 124397	2011-01-27 19:15:01 +00:00
Devang Patel	3b266a2780	While legalizing SDValues do not drop SDDbgValues, trasfer them to new legal nodes. Take 2. This includes fix for dragonegg crash. llvm-svn: 124380	2011-01-27 17:43:53 +00:00
Matt Beaumont-Gay	a148c59231	Try harder to not have unused variables. llvm-svn: 124350	2011-01-27 02:39:27 +00:00
Matt Beaumont-Gay	0cddbf2bdf	Opt-mode -Wunused-variable cleanup llvm-svn: 124346	2011-01-27 01:47:50 +00:00
Devang Patel	92b7077f9e	Reapply 124301 llvm-svn: 124339	2011-01-27 00:13:27 +00:00
Bill Wendling	fb4ee9bbde	Initialize variable to get rid of clang warning. llvm-svn: 124331	2011-01-26 22:21:35 +00:00
Devang Patel	b370bf329a	Revert 124301. llvm-svn: 124327	2011-01-26 21:41:22 +00:00
Devang Patel	084e0628e0	Revert r124302 llvm-svn: 124320	2011-01-26 21:12:32 +00:00
David Greene	bab5e6ed0e	[AVX] Add INSERT_SUBVECTOR and support it on x86. This provides a default implementation for x86, going through the stack in a similr fashion to how the codegen implements BUILD_VECTOR. Eventually this will get matched to VINSERTF128 if AVX is available. llvm-svn: 124307	2011-01-26 19:13:22 +00:00
Devang Patel	a11210b1b8	While legalizing SDValues do not drop SDDbgValues, trasfer them to new legal nodes. llvm-svn: 124302	2011-01-26 18:55:05 +00:00
Devang Patel	9d4eb2f480	Process valid SDDbgValues even if the node does not have any order assigned. llvm-svn: 124301	2011-01-26 18:42:32 +00:00
Devang Patel	1448e7c8b6	Refactor. llvm-svn: 124300	2011-01-26 18:20:04 +00:00
David Greene	b6f1611928	[AVX] Support EXTRACT_SUBVECTOR on x86. This provides a default implementation of EXTRACT_SUBVECTOR for x86, going through the stack in a similr fashion to how the codegen implements BUILD_VECTOR. Eventually this will get matched to VEXTRACTF128 if AVX is available. llvm-svn: 124292	2011-01-26 15:38:49 +00:00
Devang Patel	efc6b16e4b	Provide an interface to transfer SDDbgValue from one SDNode to another. llvm-svn: 124245	2011-01-25 23:27:42 +00:00
Devang Patel	70f8e5962a	Resolve DanglingDbgValue of PHI nodes where the use follows dbg.value intrinisic. llvm-svn: 124203	2011-01-25 18:09:58 +00:00
Devang Patel	04b649d48a	This assertion is too restrictive, it does not apply for dangling dbg value nodes (nodes where dbg.value intrinsic preceds use of the value). llvm-svn: 124202	2011-01-25 18:09:33 +00:00
Devang Patel	533479544b	Speculatively revert r124138. llvm-svn: 124142	2011-01-24 20:04:37 +00:00
Devang Patel	8cc5355c90	Resolve DanglingDbgValue of PHI nodes where the use follows dbg.value intrinisic. llvm-svn: 124138	2011-01-24 19:24:37 +00:00
Andrew Trick	a293c49f0d	Temporarily workaround JM/lencod miscompile (SIGSEGV). rdar://problem/8893967 llvm-svn: 124137	2011-01-24 19:08:15 +00:00
Ted Kremenek	3c4408ceb6	Null initialize a few variables flagged by clang's -Wuninitialized-experimental warning. While these don't look like real bugs, clang's -Wuninitialized-experimental analysis is stricter than GCC's, and these fixes have the benefit of being general nice cleanups. llvm-svn: 124073	2011-01-23 17:05:06 +00:00
Andrew Trick	bd428ec50f	Enable support for precise scheduling of the instruction selection DAG. Disable using "-disable-sched-cycles". For ARM, this enables a framework for modeling the cpu pipeline and counting stalls. It also activates several heuristics to drive scheduling based on the model. Scheduling is inherently imprecise at this stage, and until spilling is improved it may defeat attempts to schedule. However, this framework provides greater control over tuning codegen. Although the flag is not target-specific, it should have very little affect on the default scheduler used by x86. The only two changes that affect x86 are: - scheduling a high-latency operation bumps the current cycle so independent operations can have their latency covered. i.e. two independent 4 cycle operations can produce results in 4 cycles, not 8 cycles. - Two operations with equal register pressure impact and no latency-based stalls on their uses will be prioritized by depth before height (height is irrelevant if no stalls occur in the schedule below this point). llvm-svn: 123971	2011-01-21 06:19:05 +00:00
Andrew Trick	47ff14b091	Convert -enable-sched-cycles and -enable-sched-hazard to -disable flags. They are still not enable in this revision. Added TargetInstrInfo::isZeroCost() to fix a fundamental problem with the scheduler's model of operand latency in the selection DAG. Generalized unit tests to work with sched-cycles. llvm-svn: 123969	2011-01-21 05:51:33 +00:00
Eric Christopher	37c4a8be72	My editor's indent went crazy. Fix. llvm-svn: 123909	2011-01-20 08:56:34 +00:00
Eric Christopher	785db078b4	Expand invalid return values for umulo and smulo. Handle these similarly to add/sub by doing the normal operation and then checking for overflow afterwards. This generally relies on the DAG handling the later invalid operations as well. Fixes the 64-bit part of rdar://8622122 and rdar://8774702. llvm-svn: 123908	2011-01-20 08:54:28 +00:00
Andrew Trick	2cd1f0beb6	Selection DAG scheduler register pressure heuristic fixes. Added a check for already live regs before claiming HighRegPressure. Fixed a few cases of checking the wrong number of successors. Added some tracing until these heuristics are better understood. llvm-svn: 123892	2011-01-20 06:21:59 +00:00
Eric Christopher	b2139f655b	Use only one API at a time. llvm-svn: 123866	2011-01-20 01:29:23 +00:00
Eric Christopher	bb14f65672	If we can, lower the multiply part of a umulo/smulo call to a libcall with an invalid type then split the result and perform the overflow check normally. Fixes the 32-bit parts of rdar://8622122 and rdar://8774702. llvm-svn: 123864	2011-01-20 00:29:24 +00:00
Jeffrey Yasskin	249fcd4499	Remove unused variables found by gcc-4.6's -Wunused-but-set-variable. llvm-svn: 123707	2011-01-18 00:51:23 +00:00
Stuart Hastings	4fa832aab0	Remove checking that prevented overlapping CALLSEQ_START/CALLSEQ_END ranges, add legalizer support for nested calls. Necessary for ARM byval support. Radar 7662569. llvm-svn: 123704	2011-01-18 00:09:27 +00:00
Benjamin Kramer	45d183ccf0	Fix an off-by-one error in ctpop combining. llvm-svn: 123664	2011-01-17 18:00:28 +00:00
Benjamin Kramer	24c5184dca	Add a DAGCombine to turn (ctpop x) u< 2 into (x & x-1) == 0. This shaves off 4 popcounts from the hacked 186.crafty source. This is enabled even when a native popcount instruction is available. The combined code is one operation longer but it should be faster nevertheless. llvm-svn: 123621	2011-01-17 12:04:57 +00:00
Chris Lattner	2d186574a6	reapply my fix for PR8961 with a tweak to properly handle multi-instruction sequences like calls. Many thanks to Jakob for finding a testcase. llvm-svn: 123559	2011-01-16 02:27:38 +00:00
Benjamin Kramer	bec03ea725	Add an assert so we don't silently miscompile ctpop for bit widths > 128. llvm-svn: 123549	2011-01-15 21:19:37 +00:00
Benjamin Kramer	fff2517edc	Reimplement CTPOP legalization with the "best" algorithm from http://graphics.stanford.edu/~seander/bithacks.html#CountBitsSetParallel In a silly microbenchmark on a 65 nm core2 this is 1.5x faster than the old code in 32 bit mode and about 2x faster in 64 bit mode. It's also a lot shorter, especially when counting 64 bit population on a 32 bit target. I hope this is fast enough to replace Kernighan-style counting loops even when the input is rather sparse. llvm-svn: 123547	2011-01-15 20:30:30 +00:00
Dan Gohman	abac063b7a	Delete an assignment to ThisBB which isn't needed, and tidy up some comments. llvm-svn: 123479	2011-01-14 22:26:16 +00:00
Andrew Trick	9ccce77893	Support for precise scheduling of the instruction selection DAG, disabled in this checkin. Sorry for the large diffs due to refactoring. New functionality is all guarded by EnableSchedCycles. Scheduling the isel DAG is inherently imprecise, but we give it a best effort: - Added MayReduceRegPressure to allow stalled nodes in the queue only if there is a regpressure need. - Added BUHasStall to allow checking for either dependence stalls due to latency or resource stalls due to pipeline hazards. - Added BUCompareLatency to encapsulate and standardize the heuristics for minimizing stall cycles (vs. reducing register pressure). - Modified the bottom-up heuristic (now in BUCompareLatency) to prioritize nodes by their depth rather than height. As long as it doesn't stall, height is irrelevant. Depth represents the critical path to the DAG root. - Added hybrid_ls_rr_sort::isReady to filter stalled nodes before adding them to the available queue. Related Cleanup: most of the register reduction routines do not need to be templates. llvm-svn: 123468	2011-01-14 21:11:41 +00:00
Chris Lattner	3be81e9bd7	Set the insertion point correctly for instructions generated by load folding: they should go before the new instruction not after it. llvm-svn: 123420	2011-01-14 01:33:40 +00:00
Dan Gohman	958620dd6d	Fix r123346 to handle scalar types too. llvm-svn: 123352	2011-01-13 01:06:51 +00:00
Dan Gohman	6e017a1134	Apply the patch from PR8958, which allows llc to get slightly further on the associated testcase before aborting. llvm-svn: 123346	2011-01-12 23:56:26 +00:00
Eric Christopher	1bb2c00f65	Move ExpandAtomic into the integer expansion routines - it's only used there. llvm-svn: 123202	2011-01-11 00:36:08 +00:00
Dale Johannesen	d2b48119b0	Fix PR 8916 (qv for analysis), at least the immediate problem. There's an inherent tension in DAGCombine between assuming that things will be put in canonical form, and the Depth mechanism that disables transformations when recursion gets too deep. It would not surprise me if there's a lot of little bugs like this one waiting to be discovered. The mechanism seems fragile and I'd suggest looking at it from a design viewpoint. llvm-svn: 123191	2011-01-10 21:53:07 +00:00
Anton Korobeynikov	2f93128109	Rename TargetFrameInfo into TargetFrameLowering. Also, put couple of FIXMEs and fixes here and there. llvm-svn: 123170	2011-01-10 12:39:04 +00:00
Jakob Stoklund Olesen	2fb5b31578	Simplify a bunch of isVirtualRegister() and isPhysicalRegister() logic. These functions not longer assert when passed 0, but simply return false instead. No functional change intended. llvm-svn: 123155	2011-01-10 02:58:51 +00:00
Jakob Stoklund Olesen	1331a15b0c	Replace TargetRegisterInfo::printReg with a PrintReg class that also works without a TRI instance. Print virtual registers numbered from 0 instead of the arbitrary FirstVirtualRegister. The first virtual register is printed as %vreg0. TRI::NoRegister is printed as %noreg. llvm-svn: 123107	2011-01-09 03:05:53 +00:00
Jakob Stoklund Olesen	793d7b7626	Use an IndexedMap for LiveOutRegInfo to hide its dependence on TargetRegisterInfo::FirstVirtualRegister. llvm-svn: 123096	2011-01-08 23:10:50 +00:00
Evan Cheng	6eb516dbea	Do not model all INLINEASM instructions as having unmodelled side effects. Instead encode llvm IR level property "HasSideEffects" in an operand (shared with IsAlignStack). Added MachineInstrs::hasUnmodeledSideEffects() to check the operand when the instruction is an INLINEASM. This allows memory instructions to be moved around INLINEASM instructions. llvm-svn: 123044	2011-01-07 23:50:32 +00:00
Bob Wilson	8265d56638	Add ARM patterns to match EXTRACT_SUBVECTOR nodes. Also fix an off-by-one in SelectionDAGBuilder that was preventing shuffle vectors from being translated to EXTRACT_SUBVECTOR. Patch by Tim Northover. The test changes are needed to keep those spill-q tests from testing aligned spills and restores. If the only aligned stack objects are spill slots, we no longer realign the stack frame. Prior to this patch, an EXTRACT_SUBVECTOR was legalized by loading from the stack, which created an aligned frame index. Now, however, there is nothing except the spill slot in the stack frame, so I added an aligned alloca. llvm-svn: 122995	2011-01-07 04:59:04 +00:00
Bob Wilson	f291cb268f	Change EXTRACT_SUBVECTOR to require a constant index. We were never generating any of these nodes with variable indices, and there was one legalizer function asserting on a non-constant index. If we ever have a need to support variable indices, we can add this back again. llvm-svn: 122993	2011-01-07 04:58:56 +00:00
Duncan Sands	61c5708b51	Fix the other problem reported in PR8582. Testcase and patch by Nadav Rotem. llvm-svn: 122983	2011-01-06 23:45:22 +00:00
Eric Christopher	e516af753b	Add some fairly duplicated code to let type legalization split illegal typed atomics. This will lower exclusively to libcalls at the moment. llvm-svn: 122979	2011-01-06 22:28:56 +00:00
Evan Cheng	3ae2b79aa3	Re-implement r122936 with proper target hooks. Now getMaxStoresPerMemcpy etc. takes an option OptSize. If OptSize is true, it would return the inline limit for functions with attribute OptSize. llvm-svn: 122952	2011-01-06 06:52:41 +00:00
Evan Cheng	c052ba7ff3	Revert r122936. I'll re-implement the change. llvm-svn: 122949	2011-01-06 06:17:53 +00:00
Evan Cheng	06536e7158	r105228 reduced the memcpy / memset inline limit to 4 with -Os to avoid blowing up freebsd bootloader. However, this doesn't make much sense for Darwin, whose -Os is meant to optimize for size only if it doesn't hurt performance. rdar://8821501 llvm-svn: 122936	2011-01-06 01:04:47 +00:00
Evan Cheng	ac730dd2d1	Avoid zero extend bit test operands to pointer type if all the masks fit in the original type of the switch statement key. rdar://8781238 llvm-svn: 122935	2011-01-06 01:02:44 +00:00
Evan Cheng	260acf32ee	Optimize: r1025 = s/zext r1024, 4 r1026 = extract_subreg r1025, 4 to: r1026 = copy r1024 llvm-svn: 122925	2011-01-05 23:06:49 +00:00
Eric Christopher	c673b21a87	80-cols. llvm-svn: 122909	2011-01-05 21:45:56 +00:00
Eric Christopher	988518109d	Remove TODO, these appear to be implemented. llvm-svn: 122849	2011-01-04 22:31:50 +00:00
Benjamin Kramer	25e6e06e42	Try to reuse the value when lowering memset. This allows us to compile: void test(char *s, int a) { __builtin_memset(s, a, 15); } into 1 mul + 3 stores instead of 3 muls + 3 stores. llvm-svn: 122710	2011-01-02 19:57:05 +00:00
Benjamin Kramer	2fdea4c8f1	Lower the i8 extension in memset to a multiply instead of a potentially long series of shifts and ors. We could implement a DAGCombine to turn x * 0x0101 back into logic operations on targets that doesn't support the multiply or it is slow (p4) if someone cares enough. Example code: void test(char *s, int a) { __builtin_memset(s, a, 4); } before: _test: ## @test movzbl 8(%esp), %eax movl %eax, %ecx shll $8, %ecx orl %eax, %ecx movl %ecx, %eax shll $16, %eax orl %ecx, %eax movl 4(%esp), %ecx movl %eax, 4(%ecx) movl %eax, (%ecx) ret after: _test: ## @test movzbl 8(%esp), %eax imull $16843009, %eax, %eax ## imm = 0x1010101 movl 4(%esp), %ecx movl %eax, 4(%ecx) movl %eax, (%ecx) ret llvm-svn: 122707	2011-01-02 19:44:58 +00:00
Andrew Trick	5ce945ca3a	Minor cleanup related to my latest scheduler changes. llvm-svn: 122545	2010-12-24 07:10:19 +00:00
Andrew Trick	c94056692a	Fix a few cases where the scheduler is not checking for phys reg copies. The scheduling node may have a NULL DAG node, yuck. llvm-svn: 122544	2010-12-24 06:46:50 +00:00
Andrew Trick	10ffc2b6c2	Various bits of framework needed for precise machine-level selection DAG scheduling during isel. Most new functionality is currently guarded by -enable-sched-cycles and -enable-sched-hazard. Added InstrItineraryData::IssueWidth field, currently derived from ARM itineraries, but could be initialized differently on other targets. Added ScheduleHazardRecognizer::MaxLookAhead to indicate whether it is active, and if so how many cycles of state it holds. Added SchedulingPriorityQueue::HasReadyFilter to allowing gating entry into the scheduler's available queue. ScoreboardHazardRecognizer now accesses the ScheduleDAG in order to get information about it's SUnits, provides RecedeCycle for bottom-up scheduling, correctly computes scoreboard depth, tracks IssueCount, and considers potential stall cycles when checking for hazards. ScheduleDAGRRList now models machine cycles and hazards (under flags). It tracks MinAvailableCycle, drives the hazard recognizer and priority queue's ready filter, manages a new PendingQueue, properly accounts for stall cycles, etc. llvm-svn: 122541	2010-12-24 05:03:26 +00:00
Andrew Trick	c416ba612b	whitespace llvm-svn: 122539	2010-12-24 04:28:06 +00:00
Chris Lattner	11a33811b6	flags -> glue for selectiondag llvm-svn: 122509	2010-12-23 17:24:32 +00:00
Chris Lattner	f647e95b9a	sdisel flag -> glue. llvm-svn: 122507	2010-12-23 17:13:18 +00:00
Andrew Trick	528fad91d2	Reorganize ListScheduleBottomUp in preparation for modeling machine cycles and instruction issue. llvm-svn: 122491	2010-12-23 05:42:20 +00:00
Andrew Trick	a52f325c35	Converted LiveRegCycles to LiveRegGens. It's easier to work with and allows multiple nodes per cycle. llvm-svn: 122474	2010-12-23 04:16:14 +00:00
Andrew Trick	12acde11cb	In CheckForLiveRegDef use TRI->getOverlaps. llvm-svn: 122473	2010-12-23 03:43:21 +00:00
Andrew Trick	033efdf4d7	Fixes PR8823: add-with-overflow-128.ll In the bottom-up selection DAG scheduling, handle two-address instructions that read/write unspillable registers. Treat the entire chain of two-address nodes as a single live range. llvm-svn: 122472	2010-12-23 03:15:51 +00:00
Jeffrey Yasskin	9b43f33620	Change all self assignments X=X to (void)X, so that we can turn on a new gcc warning that complains on self-assignments and self-initializations. llvm-svn: 122458	2010-12-23 00:58:24 +00:00
Benjamin Kramer	1f4dfbbcb0	DAGCombine add (sext i1), X into sub X, (zext i1) if sext from i1 is illegal. The latter usually compiles into smaller code. example code: unsigned foo(unsigned x, unsigned y) { if (x != 0) y--; return y; } before: _foo: ## @foo cmpl $1, 4(%esp) ## encoding: [0x83,0x7c,0x24,0x04,0x01] sbbl %eax, %eax ## encoding: [0x19,0xc0] notl %eax ## encoding: [0xf7,0xd0] addl 8(%esp), %eax ## encoding: [0x03,0x44,0x24,0x08] ret ## encoding: [0xc3] after: _foo: ## @foo cmpl $1, 4(%esp) ## encoding: [0x83,0x7c,0x24,0x04,0x01] movl 8(%esp), %eax ## encoding: [0x8b,0x44,0x24,0x08] adcl $-1, %eax ## encoding: [0x83,0xd0,0xff] ret ## encoding: [0xc3] llvm-svn: 122455	2010-12-22 23:17:45 +00:00
Chris Lattner	cafc1e60bb	Fix a bug in ReduceLoadWidth that wasn't handling extending loads properly. We miscompiled the testcase into: _test: ## @test movl $128, (%rdi) movzbl 1(%rdi), %eax ret Now we get a proper: _test: ## @test movl $128, (%rdi) movsbl (%rdi), %eax movzbl %ah, %eax ret This fixes PR8757. llvm-svn: 122392	2010-12-22 08:02:57 +00:00
Chris Lattner	9a499e96eb	more cleanups, move a check for "roundedness" earlier to reject unhanded cases faster and simplify code. llvm-svn: 122391	2010-12-22 08:01:44 +00:00
Chris Lattner	222374d886	reduce indentation and improve comments, no functionality change. llvm-svn: 122389	2010-12-22 07:36:50 +00:00
Andrew Trick	fbb3ed8774	In DelayForLiveRegsBottomUp, handle instructions that read and write the same physical register. Simplifies the fix from the previous checkin r122211. llvm-svn: 122370	2010-12-21 22:27:44 +00:00
Andrew Trick	2085a96513	whitespace llvm-svn: 122368	2010-12-21 22:25:04 +00:00
Dale Johannesen	a94e36bbee	Reapply 122353-122355 with fixes. 122354 was wrong; the shift type was needed one place, the shift count type another. The transform in 123555 had the same problem. llvm-svn: 122366	2010-12-21 21:55:50 +00:00
Dale Johannesen	87c47499c6	Revert 122353-122355 for the moment, they broke stuff. llvm-svn: 122360	2010-12-21 21:22:27 +00:00
Dale Johannesen	caf42aa6a4	Add a new transform to DAGCombiner. llvm-svn: 122355	2010-12-21 20:10:51 +00:00
Dale Johannesen	fa5dc82fda	Get the type of a shift from the shift, not from its shift count operand. These should be the same but apparently are not always, and this is cleaner anyway. This improves the code in an existing test. llvm-svn: 122354	2010-12-21 20:06:19 +00:00
Dale Johannesen	d64931df77	Shift by the word size is invalid IR; don't create it. llvm-svn: 122353	2010-12-21 20:00:06 +00:00
Chris Lattner	2a7ff99979	fix some typos llvm-svn: 122349	2010-12-21 18:05:22 +00:00
Stuart Hastings	83cce8e7ab	Fix indentation, add comment. llvm-svn: 122345	2010-12-21 17:16:58 +00:00
Stuart Hastings	8c5bfcaa29	Missing logic for nested CALLSEQ_START/END. llvm-svn: 122342	2010-12-21 17:07:24 +00:00
Chris Lattner	3e5fbd74ed	rename MVT::Flag to MVT::Glue. "Flag" is a terrible name for something that just glues two nodes together, even if it is sometimes used for flags. llvm-svn: 122310	2010-12-21 02:38:05 +00:00
Chris Lattner	17f906be96	improve "cannot yet select" errors a trivial amount: now they are just as useless, but at least a bit more gramatical llvm-svn: 122305	2010-12-21 02:07:03 +00:00
Dale Johannesen	0a291a36f2	Cosmetic changes. llvm-svn: 122259	2010-12-20 20:10:50 +00:00
Chris Lattner	0b3ca50ebb	implement type legalization promotion support for SMULO and UMULO, giving ARM (and other 32-bit-only) targets support for i8 and i16 overflow multiplies. The generated code isn't great, but this at least fixes CodeGen/Generic/overflow.ll when running on ARM hosts. llvm-svn: 122221	2010-12-20 02:05:39 +00:00
Chris Lattner	981afd206b	Fix a bug in the scheduler's handling of "unspillable" vregs. Imagine we see: EFLAGS = inst1 EFLAGS = inst2 FLAGS gpr = inst3 EFLAGS Previously, we would refuse to schedule inst2 because it clobbers the EFLAGS of the predecessor. However, it also uses the EFLAGS of the predecessor, so it is safe to emit. SDep edges ensure that the right order happens already anyway. This fixes 2 testsuite crashes with the X86 patch I'm going to commit next. llvm-svn: 122211	2010-12-20 00:55:43 +00:00
Chris Lattner	0cfe884874	the result of CheckForLiveRegDef is dead, remove it. llvm-svn: 122209	2010-12-20 00:51:56 +00:00
Chris Lattner	ed69c6e4b9	reduce indentation, no functionality change. llvm-svn: 122208	2010-12-20 00:50:16 +00:00
Nick Lewycky	0de20af7ba	Add missing standard headers. Patch by Joerg Sonnenberger! llvm-svn: 122193	2010-12-19 20:43:38 +00:00
Chris Lattner	440b2804ff	teach MaskedValueIsZero how to analyze ADDE. This is enough to teach it that ADDE(0,0) is known 0 except the low bit, for example. llvm-svn: 122191	2010-12-19 20:38:28 +00:00
Chris Lattner	77a8a71414	fix PR8642: if a critical edge has a PHI value that can trap, isel is required to split the edge. PHI values get evaluated on the edge, not in their predecessor block. llvm-svn: 122170	2010-12-19 04:58:57 +00:00
Bob Wilson	5408144add	Fix a DAGCombiner crash when folding binary vector operations with constant BUILD_VECTOR operands where the element type is not legal. I had previously changed this code to insert TRUNCATE operations, but that was just wrong. llvm-svn: 122102	2010-12-17 23:06:49 +00:00
Dale Johannesen	cd538afa52	Add a transform to DAG Combiner. This improves the code for the case where 32-bit divide by constant is turned into 64-bit multiply by constant. 8771012. llvm-svn: 122090	2010-12-17 21:45:49 +00:00
Bob Wilson	bfc6904fc6	Fix crash compiling a QQQQ REG_SEQUENCE for a Neon vld3_lane operation. Radar 8776599 llvm-svn: 122018	2010-12-17 01:21:12 +00:00
Chris Lattner	15090e1eb0	take care of some todos, transforming [us]mul_lohi into a wider mul if the wider mul is legal. llvm-svn: 121848	2010-12-15 06:04:19 +00:00
Chris Lattner	b86dceea1b	when transforming a MULHS into a wider MUL, there is no need to SRA the result, the top bits are truncated off anyway, just use SRL. llvm-svn: 121846	2010-12-15 05:51:39 +00:00
Chris Lattner	10bd29f1d4	Add a couple dag combines to transform mulhi/mullo into a wider multiply when the wider type is legal. This allows us to compile: define zeroext i16 @test1(i16 zeroext %x) nounwind { entry: %div = udiv i16 %x, 33 ret i16 %div } into: test1: # @test1 movzwl 4(%esp), %eax imull $63551, %eax, %eax # imm = 0xF83F shrl $21, %eax ret instead of: test1: # @test1 movw $-1985, %ax # imm = 0xFFFFFFFFFFFFF83F mulw 4(%esp) andl $65504, %edx # imm = 0xFFE0 movl %edx, %eax shrl $5, %eax ret Implementing rdar://8760399 and example #4 from: http://blog.regehr.org/archives/320 We should implement the same thing for [su]mul_hilo, but I don't have immediate plans to do this. llvm-svn: 121696	2010-12-13 08:39:01 +00:00
Chris Lattner	cb404360ca	reduce indentation by using continue, no functionality change. llvm-svn: 121662	2010-12-13 01:11:17 +00:00
Duncan Sands	d2e70b5442	Catch attempts to remove a deleted node from the CSE maps. Better to catch this here rather than later after accessing uninitialized memory etc. Fires when compiling the testcase in PR8237. llvm-svn: 121635	2010-12-12 13:22:50 +00:00
Stuart Hastings	d2ea97cbef	Initial support for nested CALLSEQ_START/CALLSEQ_END constructs in LegalizeDAG. Necessary for byval support on ARM. Radar 7662569. llvm-svn: 121412	2010-12-09 21:25:20 +00:00
Eric Christopher	d9e8eac235	80-col fixups. llvm-svn: 121356	2010-12-09 04:48:06 +00:00
Eric Christopher	1b93e7b4ed	Reword comment slightly. llvm-svn: 121293	2010-12-08 22:21:42 +00:00
Jay Foad	583abbc4df	PR5207: Change APInt methods trunc(), sext(), zext(), sextOrTrunc() and zextOrTrunc(), and APSInt methods extend(), extOrTrunc() and new method trunc(), to be const and to return a new value instead of modifying the object in place. llvm-svn: 121120	2010-12-07 08:25:19 +00:00
Devang Patel	c24048a718	If dbg_declare() or dbg_value() is not lowered by isel then emit DEBUG message instead of creating DBG_VALUE for undefined value in reg0. llvm-svn: 121059	2010-12-06 22:39:26 +00:00
Benjamin Kramer	31920b0a2a	Remove unneeded zero arrays. llvm-svn: 120910	2010-12-04 15:28:22 +00:00
Jay Foad	25a5e4ca1f	PR5207: Rename overloaded APInt methods set(), clear(), flip() to setAllBits(), setBit(unsigned), etc. llvm-svn: 120564	2010-12-01 08:53:58 +00:00
Evan Cheng	d4b0873c06	Enable sibling call optimization of libcalls which are expanded during legalization time. Since at legalization time there is no mapping from SDNode back to the corresponding LLVM instruction and the return SDNode is target specific, this requires a target hook to check for eligibility. Only x86 and ARM support this form of sibcall optimization right now. rdar://8707777 llvm-svn: 120501	2010-11-30 23:55:39 +00:00
Chris Lattner	ea41dfe385	add TLI support indicating that jumps are more expensive than logical operations and use this to disable a specific optimization. Patch by Micah Villmow! llvm-svn: 120435	2010-11-30 18:12:52 +00:00
Jay Foad	15084f085d	PR5207: Make APInt::set(), APInt::clear() and APInt::flip() return void. llvm-svn: 120413	2010-11-30 09:02:01 +00:00
Michael J. Spencer	447762da85	Merge System into Support. llvm-svn: 120298	2010-11-29 18:16:10 +00:00
Bob Wilson	f9b96c474f	Fix a comment typo. llvm-svn: 120235	2010-11-28 06:51:19 +00:00
Wesley Peck	527da1b6e2	Renaming ISD::BIT_CONVERT to ISD::BITCAST to better reflect the LLVM IR concept. llvm-svn: 119990	2010-11-23 03:31:01 +00:00
Benjamin Kramer	24656c9583	Implement the "if (X == 6 \|\| X == 4)" -> "if ((X\|2) == 6)" optimization. This currently only catches the most basic case, a two-case switch, but can be extended later. llvm-svn: 119964	2010-11-22 09:45:38 +00:00
Benjamin Kramer	f6fb58a216	Silence Release build warnings about unused functions. llvm-svn: 119903	2010-11-20 15:53:24 +00:00
Duncan Sands	7c601ded34	On X86, MEMBARRIER, MFENCE, SFENCE, LFENCE are not target memory intrinsics, so don't claim they are. They are allocated using DAG.getNode, so attempts to access MemSDNode fields results in reading off the end of the allocated memory. This fixes crashes with "llc -debug" due to debug code trying to print MemSDNode fields for these barrier nodes (since the crashes are not deterministic, use valgrind to see this). Add some nasty checking to try to catch this kind of thing in the future. llvm-svn: 119901	2010-11-20 11:25:00 +00:00
Andrew Trick	cf7fefb25c	Removing the useless test that I added recently. It was meant as an example, but not complicated enough to merit another test. llvm-svn: 119898	2010-11-20 07:26:51 +00:00
Bill Wendling	54df187f25	Check for _setjmp too, because it's also used. llvm-svn: 119875	2010-11-20 00:03:09 +00:00
Mon P Wang	88ff56caa3	Make isScalarToVector to return false if the node is a scalar. This will prevent DAGCombine from making an illegal transformation of bitcast of a scalar to a vector into a scalar_to_vector. llvm-svn: 119819	2010-11-19 19:08:12 +00:00
Duncan Sands	c92331b984	Fix thinko: we must turn select(anyext, sext) into sext(select) not anyext(select). Spotted by Frits van Bommel. llvm-svn: 119739	2010-11-18 21:16:28 +00:00
Duncan Sands	12f3b3b44f	The DAGCombiner was threading select over pairs of extending loads even if the extension types were not the same. The result was that if you fed a select with sext and zext loads, as in the testcase, then it would get turned into a zext (or sext) of the select, which is wrong in the cases when it should have been an sext (resp. zext). Reported and diagnosed by Sebastien Deldon. llvm-svn: 119728	2010-11-18 20:05:18 +00:00
Dale Johannesen	ed0d840838	Do not throw away alignment when generating the DAG for memset; we may need it to decide between MOVAPS and MOVUPS later. Adjust a test that was looking for wrong code. PR 3866 / 8675131. llvm-svn: 119605	2010-11-18 01:35:23 +00:00
John Thompson	ddc7ce548c	Bug 8621 fix - pointer cast stripped from inline asm constraint argument. llvm-svn: 119590	2010-11-17 23:58:47 +00:00
Dan Gohman	8b67c720f2	Split pseudo-instruction expansion into a separate pass, to make it easier to debug, and to avoid complications when the CFG changes in the middle of the instruction selection process. llvm-svn: 119382	2010-11-16 21:02:37 +00:00
Andrew Trick	6cbf6c1db5	typo (4th checkin for one fix) llvm-svn: 118913	2010-11-12 18:36:03 +00:00
Andrew Trick	116efac780	Fixes PR8287: SD scheduling time. The fix is a failsafe that prevents catastrophic compilation time in the event of unreasonable LLVM IR. Code quality is a separate issue--someone upstream needs to do a better job of reducing to llvm.memcpy. If the situation can be reproduced with any supported frontend, then it will be a separate bug. llvm-svn: 118904	2010-11-12 17:50:46 +00:00
Chris Lattner	64634c36dd	tidy up. llvm-svn: 118896	2010-11-12 17:24:29 +00:00
Dan Gohman	6cf9bb45ad	Remove the memmove->memcpy optimization from CodeGen. MemCpyOpt does this. llvm-svn: 118789	2010-11-11 16:24:49 +00:00
Dan Gohman	5db8921422	Fix DAGCombiner to avoid folding a sext-in-reg or similar through a shl in order to fold it into a load. llvm-svn: 118471	2010-11-09 01:54:35 +00:00
Dale Johannesen	f11ea9ce61	Fix an inline asm pasto from 117667; was preventing {i64, i64} from matching i128. llvm-svn: 118465	2010-11-09 01:15:07 +00:00
Duncan Sands	6c25ca4f2b	When passing a parameter using the 'byval' mechanism, inline code needs to be used to perform the copy, which may be of lots of memory []. It would be good if the fall-back code generated something reasonable, i.e. did the copy in a loop, rather than vast numbers of loads and stores. Add a note about this. Currently target specific code seems to always kick in so this is more of a theoretical issue rather than a practical one now that X86 has been fixed. [] It's amazing how often people pass mega-byte long arrays by copy... llvm-svn: 118275	2010-11-05 15:20:29 +00:00
Eric Christopher	c6418b105a	Just return undef for invalid masks or elts, and since we're doing that, just do it earlier too. llvm-svn: 118195	2010-11-03 20:44:42 +00:00
Duncan Sands	1462777017	Simplify uses of MVT and EVT. An MVT can be compared directly with a SimpleValueType, while an EVT supports equality and inequality comparisons with SimpleValueType. llvm-svn: 118169	2010-11-03 12:17:33 +00:00
Duncan Sands	f5dda01f33	Inside the calling convention logic LocVT is always a simple value type, so there is no point in passing it around using an EVT. Use the simpler MVT everywhere. Rather than trying to propagate this information maximally in all the code that using the calling convention stuff, I chose to do a mainly low impact change instead. llvm-svn: 118167	2010-11-03 11:35:31 +00:00
Eric Christopher	fcc9e6848a	If we have an undef mask our Elt will be -1 for our access, handle this by using an undef as a pointer. Fixes rdar://8625016 llvm-svn: 118164	2010-11-03 09:36:40 +00:00
Dan Gohman	68fb004616	Fix DAGCombiner to avoid going into an infinite loop when it encounters (and:i64 (shl:i64 (load:i64), 1), 0xffffffff). This fixes rdar://8606584. llvm-svn: 118143	2010-11-03 01:47:46 +00:00
Evan Cheng	debf9c502a	Two sets of changes. Sorry they are intermingled. 1. Fix pre-ra scheduler so it doesn't try to push instructions above calls to "optimize for latency". Call instructions don't have the right latency and this is more likely to use introduce spills. 2. Fix if-converter cost function. For ARM, it should use instruction latencies, not # of micro-ops since multi-latency instructions is completely executed even when the predicate is false. Also, some instruction will be "slower" when they are predicated due to the register def becoming implicit input. rdar://8598427 llvm-svn: 118135	2010-11-03 00:45:17 +00:00
Devang Patel	bc741405a7	If value map does not have register for an argument then try to find frame index before giving up. llvm-svn: 118022	2010-11-02 17:19:03 +00:00
Devang Patel	94f2a2578c	Use frameindex, if available, as a last resort to emit debug info for a parameter. llvm-svn: 118020	2010-11-02 17:01:30 +00:00
Bob Wilson	08882be86c	Remove DAG combiner patch to fold vector splats. Instcombiner does it now. llvm-svn: 117720	2010-10-29 22:03:02 +00:00
Evan Cheng	6c1414f9c2	Avoiding overly aggressive latency scheduling. If the two nodes share an operand and one of them has a single use that is a live out copy, favor the one that is live out. Otherwise it will be difficult to eliminate the copy if the instruction is a loop induction variable update. e.g. BB: sub r1, r3, #1 str r0, [r2, r3] mov r3, r1 cmp bne BB => BB: str r0, [r2, r3] sub r3, r3, #1 cmp bne BB This fixed the recent 256.bzip2 regression. llvm-svn: 117675	2010-10-29 18:09:28 +00:00
John Thompson	e8360b7182	Inline asm multiple alternative constraints development phase 2 - improved basic logic, added initial platform support. llvm-svn: 117667	2010-10-29 17:29:13 +00:00
Bob Wilson	f63da12be9	Teach the DAG combiner to fold a splat of a splat. Radar 8597790. Also do some minor refactoring to reduce indentation. llvm-svn: 117558	2010-10-28 17:06:14 +00:00
Evan Cheng	ff310737e5	Re-commit 117518 and 117519 now that ARM MC test failures are out of the way. llvm-svn: 117531	2010-10-28 06:47:08 +00:00
Evan Cheng	e2c211c1b9	Revert 117518 and 117519 for now. They changed scheduling and cause MC tests to fail. Ugh. llvm-svn: 117520	2010-10-28 02:00:25 +00:00
Evan Cheng	523fa3a2e8	Fix a major bug in operand latency computation. The use index must be adjusted by the number of defs first for it to match the instruction itinerary. llvm-svn: 117518	2010-10-28 01:46:29 +00:00
Dale Johannesen	e660f4d072	Use a MemIntrinsicSDNode for ISD::PREFETCH, which touches memory, so a MachineMemOperand is useful (not propagated into the MachineInstr yet). No functional change except for dump output. llvm-svn: 117413	2010-10-26 23:11:10 +00:00
Devang Patel	05561e8b7b	Assign source ordering to nodes created for StoreInst. llvm-svn: 117404	2010-10-26 22:14:52 +00:00
Nick Lewycky	90b2ac2696	For statistics that are only used in functions declared in !NDEBUG, wrap the declarations in !NDEBUG to avoid -Wunused-variable warnings. Patch by Matt Beaumont-Gay! llvm-svn: 117345	2010-10-26 00:51:57 +00:00
Devang Patel	43c3f4b63c	Simplify. Do not count use of sdisel for single call instruction. llvm-svn: 117316	2010-10-25 21:31:46 +00:00
Devang Patel	3bc6d198fb	Add counters to count basic blocks and machine basic blocks with out of order line number info. Add counters to count how many basic blocks are entirely selected by fastisel. llvm-svn: 117310	2010-10-25 20:55:43 +00:00
Chandler Carruth	82058c05f8	Move the remaining attribute macros to systematic names based on the attribute name and prefixed with 'LLVM_'. llvm-svn: 117203	2010-10-23 08:40:19 +00:00
Michael J. Spencer	0e36e0340a	X86: Base _fltused on the FunctionType of the called value instead of the potentially null "CalledFunction". Thanks Duncan! This is needed for indirect calls. llvm-svn: 117061	2010-10-21 20:49:23 +00:00
Michael J. Spencer	83ce5f181f	CodeGen-Windows: Only emit _fltused if a VarArg function is called with floating point args. This should be the minimum set of functions that could possibly need it. llvm-svn: 116978	2010-10-21 00:08:21 +00:00
Dale Johannesen	320a553319	Remove Synthesizable from the Type system; as MMX vector types are no longer Legal on X86, we don't need it. No functional change. 8499854. llvm-svn: 116947	2010-10-20 21:32:10 +00:00
Dan Gohman	a94cc6dfe8	Make CodeGen TBAA-aware. llvm-svn: 116890	2010-10-20 00:31:05 +00:00
Jim Grosbach	bbdc5d2ef9	Add a pre-dispatch SjLj EH hook on the unwind edge for targets to do any setup they require. Use this for ARM/Darwin to rematerialize the base pointer from the frame pointer when required. rdar://8564268 llvm-svn: 116879	2010-10-19 23:27:08 +00:00
Owen Anderson	6c18d1aac0	Get rid of static constructors for pass registration. Instead, every pass exposes an initializeMyPassFunction(), which must be called in the pass's constructor. This function uses static dependency declarations to recursively initialize the pass's dependencies. Clients that only create passes through the createFooPass() APIs will require no changes. Clients that want to use the CommandLine options for passes will need to manually call the appropriate initialization functions in PassInitialization.h before parsing commandline arguments. I have tested this with all standard configurations of clang and llvm-gcc on Darwin. It is possible that there are problems with the static dependencies that will only be visible with non-standard options. If you encounter any crash in pass registration/creation, please send the testcase to me directly. llvm-svn: 116820	2010-10-19 17:21:58 +00:00
Michael J. Spencer	5e683250ee	X86-Windows: Emit an undefined global __fltused symbol when targeting Windows if any floating point arguments are passed to an external function. llvm-svn: 116665	2010-10-16 08:25:41 +00:00
Michael J. Spencer	d3ea25e66e	Whitespace! llvm-svn: 116664	2010-10-16 08:25:21 +00:00
Chris Lattner	eb313a46fc	fix the default va_arg expansion (in the realignment case) to not implicitly truncate the stack pointer to 32-bits on a 64-bit machine. llvm-svn: 116169	2010-10-10 18:36:26 +00:00
Dan Gohman	aadc5596f1	ComputeLinearIndex doesn't need its TLI argument. llvm-svn: 115792	2010-10-06 16:18:29 +00:00
Evan Cheng	49d4c0bd18	- Add TargetInstrInfo::getOperandLatency() to compute operand latencies. This allow target to correctly compute latency for cases where static scheduling itineraries isn't sufficient. e.g. variable_ops instructions such as ARM::ldm. This also allows target without scheduling itineraries to compute operand latencies. e.g. X86 can return (approximated) latencies for high latency instructions such as division. - Compute operand latencies for those defined by load multiple instructions, e.g. ldm and those used by store multiple instructions, e.g. stm. llvm-svn: 115755	2010-10-06 06:27:31 +00:00
Owen Anderson	d8d1dcc09a	Use a more efficient lowering of uint64_t --> float that can take advantage of hardware signed integer conversion without having to do a double cast (uint64_t --> double --> float). This is based on the algorithm from compiler_rt's __floatundisf for X86-64. llvm-svn: 115634	2010-10-05 17:24:05 +00:00
Evan Cheng	c8d6cfd730	This DAG combine BRCOND transformation can look pass truncate of the operand: // %a = ... // %b = and i32 %a, 2 // %c = srl i32 %b, 1 // brcond i32 %c ... // // into // // %a = ... // %b = and i32 %a, 2 // %c = setcc eq %b, 0 // brcond %c ... Make sure it restores local variable N1, which corresponds to the condition operand if it fails to match. This apparently breaks TCE but since that backend isn't in the tree I don't have a test for it. llvm-svn: 115571	2010-10-04 22:41:01 +00:00
Devang Patel	d3fe5fa5d1	Fix code gen crash reported in PR 8235. We still lose debug info for the unused argument here. This is a known limitation recorded debuginfo-tests/trunk/dbg-declare2.ll function 'f6' test case. llvm-svn: 115323	2010-10-01 19:00:44 +00:00
Gabor Greif	47a3b8c30b	typo llvm-svn: 115310	2010-10-01 10:32:19 +00:00
Chris Lattner	f08bfdc29f	fix typo llvm-svn: 115300	2010-10-01 06:54:02 +00:00
Chris Lattner	a205055857	fix rdar://8494845 + PR8244 - a miscompile exposed by my patch in r101350 llvm-svn: 115294	2010-10-01 05:36:09 +00:00
Dale Johannesen	dd224d2333	Massive rewrite of MMX: The x86_mmx type is used for MMX intrinsics, parameters and return values where these use MMX registers, and is also supported in load, store, and bitcast. Only the above operations generate MMX instructions, and optimizations do not operate on or produce MMX intrinsics. MMX-sized vectors <2 x i32> etc. are lowered to XMM or split into smaller pieces. Optimizations may occur on these forms and the result casted back to x86_mmx, provided the result feeds into a previous existing x86_mmx operation. The point of all this is prevent optimizations from introducing MMX operations, which is unsafe due to the EMMS problem. llvm-svn: 115243	2010-09-30 23:57:10 +00:00
Jakob Stoklund Olesen	665aa6efcc	When isel is emitting instructions for an x86 target without CMOV, the CFG is edited during emission. If the basic block ends in a switch that gets lowered to a jump table, any phis at the default edge were getting updated wrong. The jump table data structure keeps a pointer to the header blocks that wasn't getting updated after the MBB is split. This bug was exposed on 32-bit Linux when disabling critical edge splitting in codegen prepare. The fix is to uipdate stale MBB pointers whenever a block is split during emission. llvm-svn: 115191	2010-09-30 19:44:31 +00:00
Evan Cheng	4a010fd1ea	Model Cortex-a9 load to SUB, RSB, ADD, ADC, SBC, RSC, CMN, MVN, or CMP pipeline forwarding path. llvm-svn: 115098	2010-09-29 22:42:35 +00:00
Oscar Fuentes	b4b12535e8	Removed a bunch of unnecessary target_link_libraries. llvm-svn: 114999	2010-09-28 22:39:14 +00:00
Dale Johannesen	117f7708c4	Don't try to make a vector of x86mmx; this won't work, and asserts. llvm-svn: 114843	2010-09-27 17:29:14 +00:00
John Thompson	8118ef8d3d	Fix for test/CodeGen/PowerPC/2008-10-17-AsmMatchingOperands.ll crash. llvm-svn: 114767	2010-09-24 22:24:05 +00:00
Michael J. Spencer	ded5f66813	Get rid of pop_macro warnings on MSVC. llvm-svn: 114750	2010-09-24 19:48:47 +00:00
Evan Cheng	6b8b2b7312	Revert 114634 for now since buildbot claim it broke Clang self-hosting. I doubt it but it's possible it's exposing another bug somewhere. llvm-svn: 114681	2010-09-23 18:32:19 +00:00
Oscar Fuentes	57214f533a	Fix VS 2010 build. Patch by Nathan Jeffords! llvm-svn: 114661	2010-09-23 16:59:36 +00:00
Evan Cheng	b6d175a39d	Follow up to r114630. Do not optimize away unconditional branch following a conditional one. llvm-svn: 114634	2010-09-23 07:18:35 +00:00
Evan Cheng	79687dda9a	SDISel should not optimize a unconditional branch following a conditional branch when the unconditional branch destination is the fallthrough block. The canonicalization makes it easier to allow optimizations on DAGs to invert conditional branches. The branch folding pass (and AnalyzeBranch) will clean up the unnecessary unconditional branches later. This is one of the patches leading up to disabling codegen prepare critical edge splitting. llvm-svn: 114630	2010-09-23 06:51:55 +00:00
Owen Anderson	3231d13ddd	A select between a constant and zero, when fed by a bit test, can be efficiently lowered using a series of shifts. Fixes <rdar://problem/8285015>. llvm-svn: 114599	2010-09-22 22:58:22 +00:00
John Thompson	c467aa2fa4	Fixed pr20314-2.c failure, added E, F, p constraint letters. llvm-svn: 114490	2010-09-21 22:04:54 +00:00
Chris Lattner	a9e57e0eff	Rework passing parent pointers into complexpatterns, I forgot that complex patterns are matched after the entire pattern has a structural match, therefore the NodeStack isn't in a useful state when the actual call to the matcher happens. llvm-svn: 114489	2010-09-21 22:00:25 +00:00
Devang Patel	99ff76212a	If only user of a vreg is an copy instruction to export copy of vreg out of current basic block then insert DBG_VALUE so that debug value of the variable is also transfered to new vreg. Testcase is in r114476. This fixes radar 8412415. llvm-svn: 114478	2010-09-21 20:56:33 +00:00
Chris Lattner	0bb8b19865	correct this logic. llvm-svn: 114474	2010-09-21 20:46:40 +00:00
Owen Anderson	5e65dfbb97	Reimplement r114460 in target-independent DAGCombine rather than target-dependent, by using the predicate to discover the number of sign bits. Enhance X86's target lowering to provide a useful response to this query. llvm-svn: 114473	2010-09-21 20:42:50 +00:00
Chris Lattner	dd83548fea	just like they can opt into getting the root of the pattern being matched, allow ComplexPatterns to opt into getting the parent node of the operand being matched. llvm-svn: 114472	2010-09-21 20:37:12 +00:00
Chris Lattner	a4f199720d	finish pushing MachinePointerInfo through selectiondags. At this point, I think I've audited all uses, so it should be dependable for address spaces, and the pointer+offset info should also be accurate when there. llvm-svn: 114464	2010-09-21 18:58:22 +00:00
Chris Lattner	676c61db0e	update a bunch of code to use the MachinePointerInfo version of getStore. llvm-svn: 114461	2010-09-21 18:41:36 +00:00
Bob Wilson	5549d496dd	Define the TargetLowering::getTgtMemIntrinsic hook for ARM so that NEON load and store intrinsics are represented with MemIntrinsicSDNodes. llvm-svn: 114454	2010-09-21 17:56:22 +00:00
Chris Lattner	6963c1f789	eliminate an old SelectionDAG::getTruncStore method, propagating MachinePointerInfo around more. llvm-svn: 114452	2010-09-21 17:42:31 +00:00
Chris Lattner	5e39ffd02f	eliminate last SelectionDAG::getLoad old entrypoint, on to stores. llvm-svn: 114450	2010-09-21 17:28:52 +00:00
Chris Lattner	ea952f05a5	fix the code that infers SV info to be correct when dealing with an indexed load/store that has an offset in the index. llvm-svn: 114449	2010-09-21 17:24:05 +00:00
Chris Lattner	3d178ed4d4	propagate MachinePointerInfo through various uses of the old SelectionDAG::getExtLoad overload, and eliminate it. llvm-svn: 114446	2010-09-21 17:04:51 +00:00
Chris Lattner	1ffcf527c7	continue MachinePointerInfo'izing, eliminating use of one of the old getLoad overloads. llvm-svn: 114443	2010-09-21 16:36:31 +00:00
Chris Lattner	f72c3c08a4	convert dagcombine off the old form of getLoad. This fixes several bugs with SVOffset computation. llvm-svn: 114442	2010-09-21 16:08:50 +00:00
Chris Lattner	e32675253f	simplify DAGCombiner::SimplifySelectOps step #2/2. llvm-svn: 114437	2010-09-21 15:58:55 +00:00
Chris Lattner	254c445e63	substantially reduce indentation and simplify DAGCombiner::SimplifySelectOps. no functionality change (step #1) llvm-svn: 114436	2010-09-21 15:46:59 +00:00
Chris Lattner	a35499e2af	a few more trivial updates. This fixes PerformInsertVectorEltInMemory to not pass a completely incorrect SrcValue, which would result in a miscompile with combiner-aa. llvm-svn: 114411	2010-09-21 07:32:19 +00:00
Chris Lattner	2510de2bea	reimplement memcpy/memmove/memset lowering to use MachinePointerInfo instead of srcvalue/offset pairs. This corrects SV info for mem operations whose size is > 32-bits. llvm-svn: 114401	2010-09-21 05:40:29 +00:00
Chris Lattner	bc419ba98f	add overloads for SelectionDAG::getLoad, getStore, getTruncStore that take a MachinePointerInfo. Among other virtues, this doesn't silently truncate the svoffset to 32-bits. llvm-svn: 114399	2010-09-21 05:10:45 +00:00
Chris Lattner	d2d58ada70	simplify interface to SelectionDAG::getMemIntrinsicNode, making it take a MachinePointerInfo llvm-svn: 114397	2010-09-21 04:57:15 +00:00
Chris Lattner	15d84c460a	chagne interface to SelectionDAG::getAtomic to take a MachinePointerInfo, eliminating some weird "infer a frame address" logic which was dead. llvm-svn: 114396	2010-09-21 04:53:42 +00:00
Chris Lattner	3b5dc0cdad	don't implicitly drop the offset of a machinememoperand when legalizing atomics. llvm-svn: 114395	2010-09-21 04:51:11 +00:00
Chris Lattner	b5f4920979	force clients of MachineFunction::getMachineMemOperand to provide a MachinePointerInfo, propagating the type out a level of API. Remove the old MachineFunction::getMachineMemOperand impl. llvm-svn: 114393	2010-09-21 04:46:39 +00:00
Owen Anderson	272ff94916	When TCO is turned on, it is possible to end up with aliasing FrameIndex's. Therefore, CombinerAA cannot assume that different FrameIndex's never alias, but can instead use MachineFrameInfo to get the actual offsets of these slots and check for actual aliasing. This fixes CodeGen/X86/2010-02-19-TailCallRetAddrBug.ll and CodeGen/X86/tailcallstack64.ll when CombinerAA is enabled, modulo a different register allocation sequence. llvm-svn: 114348	2010-09-20 20:39:59 +00:00
Owen Anderson	7b8d2ae912	Revert r114312 while I sort out some issues. llvm-svn: 114313	2010-09-19 21:01:26 +00:00
Owen Anderson	ff82f8a35b	Tentatively enabled DAGCombiner Alias Analysis by default. As far as I know, r114268 fixed the last of the blockers to enabling it. I will be monitoring for failures. llvm-svn: 114312	2010-09-19 19:51:55 +00:00
Owen Anderson	b92b13d8a0	Invert the logic of reachesChainWithoutSideEffects(). What we want to check is that there is NO path to the destination containing side effects, not that SOME path contains no side effects. In practice, this only manifests with CombinerAA enabled, because otherwise the chain has little to no branching, so "any" is effectively equivalent to "all". llvm-svn: 114268	2010-09-18 04:45:14 +00:00
Devang Patel	46b96c4ba0	Check bb to ensure that alloca is in separate basic block. This fixes funcargs.exp regression reported by gdb testsuite. llvm-svn: 113992	2010-09-15 18:13:55 +00:00
Devang Patel	da25de8096	If dbg.declare from non-entry block is using alloca from entry block then use offset available in StaticAllocaMap to emit DBG_VALUE. Right now, this has no material impact because varible info also collected using offset table maintained in machine module info. llvm-svn: 113967	2010-09-15 14:48:53 +00:00
Devang Patel	e4682fa8e2	Use frame index, if available for byval argument while lowering dbg_declare. Otherwise let getRegForValue() find register for this argument. llvm-svn: 113843	2010-09-14 20:29:31 +00:00
Michael J. Spencer	93c9b2ea93	Revert "CMake: Get rid of LLVMLibDeps.cmake and export the libraries normally." This reverts commit r113632 Conflicts: cmake/modules/AddLLVM.cmake llvm-svn: 113819	2010-09-13 23:59:48 +00:00
Eric Christopher	79127ab3f5	Silence more warnings. Two more unused variables. llvm-svn: 113771	2010-09-13 18:30:57 +00:00
John Thompson	1094c80281	Added skeleton for inline asm multiple alternative constraint support. llvm-svn: 113766	2010-09-13 18:15:37 +00:00
Michael J. Spencer	dc38d36ccb	CMake: Get rid of LLVMLibDeps.cmake and export the libraries normally. llvm-svn: 113632	2010-09-10 21:14:25 +00:00
Devang Patel	6095d818e5	Add DEBUG message. llvm-svn: 113614	2010-09-10 20:32:09 +00:00
Evan Cheng	bf4070756f	Teach if-converter to be more careful with predicating instructions that would take multiple cycles to decode. For the current if-converter clients (actually only ARM), the instructions that are predicated on false are not nops. They would still take machine cycles to decode. Micro-coded instructions such as LDM / STM can potentially take multiple cycles to decode. If-converter should take treat them as non-micro-coded simple instructions. llvm-svn: 113570	2010-09-10 01:29:16 +00:00
Chris Lattner	eeba0c73e5	implement rdar://6653118 - fastisel should fold loads where possible. Since mem2reg isn't run at -O0, we get a ton of reloads from the stack, for example, before, this code: int foo(int x, int y, int z) { return x+y+z; } used to compile into: _foo: ## @foo subq $12, %rsp movl %edi, 8(%rsp) movl %esi, 4(%rsp) movl %edx, (%rsp) movl 8(%rsp), %edx movl 4(%rsp), %esi addl %edx, %esi movl (%rsp), %edx addl %esi, %edx movl %edx, %eax addq $12, %rsp ret Now we produce: _foo: ## @foo subq $12, %rsp movl %edi, 8(%rsp) movl %esi, 4(%rsp) movl %edx, (%rsp) movl 8(%rsp), %edx addl 4(%rsp), %edx ## Folded load addl (%rsp), %edx ## Folded load movl %edx, %eax addq $12, %rsp ret Fewer instructions and less register use = faster compiles. llvm-svn: 113102	2010-09-05 02:18:34 +00:00
Bob Wilson	3626a8c136	Add a missing check when legalizing a vector extending load. This doesn't solve the root problem, but it corrects the bug in the code I added to support legalizing in the case where the non-extended type is also legal. llvm-svn: 112997	2010-09-03 19:20:37 +00:00
Devang Patel	3bffd52d78	Detect undef value early and save unnecessary NodeMap query. llvm-svn: 112864	2010-09-02 21:29:42 +00:00
Dan Gohman	3c9b5f394b	Don't narrow the load and store in a load+twiddle+store sequence unless there are clearly no stores between the load and the store. This fixes this miscompile reported as PR7833. This breaks the test/CodeGen/X86/narrow_op-2.ll optimization, which is safe, but awkward to prove safe. Move it to X86's README.txt. llvm-svn: 112861	2010-09-02 21:18:42 +00:00
Devang Patel	98d3edfe2a	Tidy up. llvm-svn: 112858	2010-09-02 21:02:27 +00:00
Devang Patel	86ec8b3a3f	Reapply r112623. Included additional check for unused byval argument. llvm-svn: 112659	2010-08-31 22:22:42 +00:00
Devang Patel	529f248eb4	Revert r112623. It is causing self host build failures. llvm-svn: 112631	2010-08-31 19:41:03 +00:00
Devang Patel	8559932d36	Remember byval argument's frame index during argument lowering and use this info to emit debug info. Fixes Radar 8367011. llvm-svn: 112623	2010-08-31 18:50:09 +00:00
Devang Patel	417d72823a	Offset is not always unsigned number. llvm-svn: 112584	2010-08-31 06:12:08 +00:00
Bruno Cardoso Lopes	d9ef4a1a24	zap unused method. x86 is the only user and already has a more powerfull version llvm-svn: 112571	2010-08-31 02:36:20 +00:00
Bill Wendling	f824489a1d	Revert r112461. It was failing on PPC... llvm-svn: 112463	2010-08-30 04:36:50 +00:00
Bill Wendling	938f299fa9	When adding a register, we should mark it as "def" if it can optionally define said (physical) register. llvm-svn: 112461	2010-08-30 01:36:05 +00:00
Chris Lattner	13ee795c42	remove unions from LLVM IR. They are severely buggy and not being actively maintained, improved, or extended. llvm-svn: 112356	2010-08-28 04:09:24 +00:00
Dan Gohman	e06905d1f0	Completely disable tail calls when fast-isel is enabled, as fast-isel doesn't currently support dealing with this. llvm-svn: 112341	2010-08-28 00:51:03 +00:00
Dan Gohman	1e06dbf881	Trim a #include. llvm-svn: 112340	2010-08-28 00:49:13 +00:00
Devang Patel	f2855b147f	Simplify. llvm-svn: 112305	2010-08-27 22:25:51 +00:00
Devang Patel	b12ff5999e	Revert r112213. It is not needed. llvm-svn: 112242	2010-08-26 23:35:15 +00:00
Devang Patel	ea134f56b1	If node is not available then use FuncInfo.ValueMap to emit debug info for byval parameter. llvm-svn: 112238	2010-08-26 22:53:27 +00:00
Devang Patel	42b4ac7ed3	Speculatively revert r112207. llvm-svn: 112216	2010-08-26 20:33:42 +00:00
Devang Patel	977057f481	80 col. llvm-svn: 112215	2010-08-26 20:32:32 +00:00
Devang Patel	384fa91deb	Update DanglingDebugInfo so that it can be used to track llvm.dbg.declare also. llvm-svn: 112213	2010-08-26 20:06:46 +00:00
Devang Patel	ab596a637c	Donot forget to resolve dangling debug info in a case where virtual register, used for a value, is initialized after a dbg intrinsic is seen. llvm-svn: 112207	2010-08-26 18:36:14 +00:00
Chris Lattner	af23e9a798	Add a hackaround for PR7993 which is causing failures on x86 builders that lack sse2. llvm-svn: 112175	2010-08-26 06:57:07 +00:00
Chris Lattner	eb2cc0ce0e	implement SplitVecOp_CONCAT_VECTORS, fixing the included testcase with SSE1. llvm-svn: 112171	2010-08-26 05:51:22 +00:00
Chris Lattner	f6418b804e	zap dead code. llvm-svn: 112155	2010-08-26 02:57:35 +00:00
Chris Lattner	8df99b523e	remove some llvmcontext arguments that are now dead post-refactoring. llvm-svn: 112104	2010-08-25 23:00:45 +00:00
Chris Lattner	75ff053497	Change handling of illegal vector types to widen when possible instead of expanding: e.g. <2 x float> -> <4 x float> instead of -> 2 floats. This affects two places in the code: handling cross block values and handling function return and arguments. Since vectors are already widened by legalizetypes, this gives us much better code and unblocks x86-64 abi and SPU abi work. For example, this (which is a silly example of a cross-block value): define <4 x float> @test2(<4 x float> %A) nounwind { %B = shufflevector <4 x float> %A, <4 x float> undef, <2 x i32> <i32 0, i32 1> %C = fadd <2 x float> %B, %B br label %BB BB: %D = fadd <2 x float> %C, %C %E = shufflevector <2 x float> %D, <2 x float> undef, <4 x i32> <i32 0, i32 1, i32 undef, i32 undef> ret <4 x float> %E } Now compiles into: _test2: ## @test2 ## BB#0: addps %xmm0, %xmm0 addps %xmm0, %xmm0 ret previously it compiled into: _test2: ## @test2 ## BB#0: addps %xmm0, %xmm0 pshufd $1, %xmm0, %xmm1 ## kill: XMM0<def> XMM0<kill> XMM0<def> insertps $0, %xmm0, %xmm0 insertps $16, %xmm1, %xmm0 addps %xmm0, %xmm0 ret This implements rdar://8230384 llvm-svn: 112101	2010-08-25 22:49:25 +00:00
Devang Patel	32a72ab072	Fix comment. llvm-svn: 112086	2010-08-25 20:41:24 +00:00
Devang Patel	3f53d6e56a	Remove dead argument. llvm-svn: 112085	2010-08-25 20:39:26 +00:00
Chris Lattner	05bcb488b5	split the vector case of getCopyFromParts out to its own function, no functionality change. llvm-svn: 111994	2010-08-24 23:20:40 +00:00
Chris Lattner	96a77ebd7c	split the vector case out of getCopyToParts into its own function. No functionality change. llvm-svn: 111990	2010-08-24 23:10:06 +00:00
Chris Lattner	5b8967f8a2	tidy up, reduce indentation llvm-svn: 111982	2010-08-24 22:43:11 +00:00
Chandler Carruth	191c4f73b2	Fix some GCC warnings by providing a virtual destructor in the base of a class hierarchy with virtual methods and using llvm_unreachable to properly indicate unreachable states which would otherwise leave variables uninitialized. llvm-svn: 111803	2010-08-23 08:25:07 +00:00
Bob Wilson	c56fef4eac	If the target says that an extending load is not legal, regardless of whether it involves specific floating-point types, legalize should expand an extending load to a non-extending load followed by a separate extend operation. For example, we currently expand SEXTLOAD to EXTLOAD+SIGN_EXTEND_INREG (and assert that EXTLOAD should always be supported). Now we can expand that to LOAD+SIGN_EXTEND. This is needed to allow vector SIGN_EXTEND and ZERO_EXTEND to be used for NEON. llvm-svn: 111586	2010-08-19 23:52:39 +00:00
Dale Johannesen	16f96445c3	Make fast scheduler handle asm clobbers correctly. PR 7882. Follows suggestion by Amaury Pouly, thanks. llvm-svn: 111306	2010-08-17 22:17:24 +00:00
Eric Christopher	541f8012d9	Fix typo. llvm-svn: 111223	2010-08-17 01:30:33 +00:00
Evan Cheng	23ef829096	Add missing null check reported by Amaury Pouly. llvm-svn: 110649	2010-08-10 02:39:45 +00:00
Owen Anderson	a7aed18624	Reapply r110396, with fixes to appease the Linux buildbot gods. llvm-svn: 110460	2010-08-06 18:33:48 +00:00
Owen Anderson	bda59bd247	Revert r110396 to fix buildbots. llvm-svn: 110410	2010-08-06 00:23:35 +00:00
Owen Anderson	755aceb5d0	Don't use PassInfo* as a type identifier for passes. Instead, use the address of the static ID member as the sole unique type identifier. Clean up APIs related to this change. llvm-svn: 110396	2010-08-05 23:42:04 +00:00
Dan Gohman	5cae103392	Eliminate unnecessary empty string literals. llvm-svn: 110183	2010-08-04 01:39:08 +00:00
Oscar Fuentes	40b31ad3ee	Prefix `next' iterator operation with `llvm::'. Fixes potential ambiguity problems on VS 2010. Patch by nobled! llvm-svn: 110029	2010-08-02 06:00:15 +00:00
Eli Friedman	460ad41d6d	PR7586: Make sure we don't claim that unknown bits are actually known in the ISD::AND case of TargetLowering::SimplifyDemandedBits. llvm-svn: 110019	2010-08-02 04:42:25 +00:00
Eli Friedman	ffe64c06ef	Fix for bug reported by Evzen Muller on llvm-commits: make sure to correctly check the range of the constant when optimizing a comparison between a constant and a sign_extend_inreg node. llvm-svn: 109854	2010-07-30 06:44:31 +00:00
Nate Begeman	317b969ac5	Fix a crash in the dag combiner caused by ConstantFoldBIT_CONVERTofBUILD_VECTOR calling itself recursively and returning a SCALAR_TO_VECTOR node, but assuming the input was always a BUILD_VECTOR. llvm-svn: 109519	2010-07-27 18:02:18 +00:00
Bill Wendling	0ff1ef650b	It's better to have the arrays, which would trigger the creation of stack protectors, to be near the stack protectors on the stack. Accomplish this by tagging the stack object with a predicate that indicates that it would trigger this. In the prolog-epilog inserter, assign these objects to the stack after the stack protector but before the other objects. llvm-svn: 109481	2010-07-27 01:55:19 +00:00
Evan Cheng	e6d6c5dd11	The "excess register pressure" returned by HighRegPressure() is not accurate enough to factor into scheduling priority. Eliminate it and add early exits to speed up scheduling. llvm-svn: 109449	2010-07-26 21:49:07 +00:00
Dan Gohman	2810bacafb	Handle Values with no value in getCopyFromRegs. llvm-svn: 109415	2010-07-26 18:15:41 +00:00
Duncan Sands	136a6f0dbb	Pacify gcc-4.5 which wrongly thinks that RExcess (passed as the Excess parameter) may be used uninitialized in the callers of HighRegPressure. llvm-svn: 109393	2010-07-26 07:54:17 +00:00
Evan Cheng	8ae3ecad2b	Add comments. llvm-svn: 109383	2010-07-25 18:59:43 +00:00
Bob Wilson	280ce9984e	Fix crashes when scheduling a CopyToReg node -- getMachineOpcode asserts on those. Radar 8231572. llvm-svn: 109367	2010-07-25 05:34:27 +00:00
Evan Cheng	37b740c4bf	Add an ILP scheduler. This is a register pressure aware scheduler that's appropriate for targets without detailed instruction iterineries. The scheduler schedules for increased instruction level parallelism in low register pressure situation; it schedules to reduce register pressure when the register pressure becomes high. On x86_64, this is a win for all tests in CFP2000. It also sped up 256.bzip2 by 16%. llvm-svn: 109300	2010-07-24 00:39:05 +00:00
Evan Cheng	df907f4594	- Allow target to specify when is register pressure "too high". In most cases, it's too late to start backing off aggressive latency scheduling when most of the registers are in use so the threshold should be a bit tighter. - Correctly handle live out's and extract_subreg etc. - Enable register pressure aware scheduling by default for hybrid scheduler. For ARM, this is almost always a win on # of instructions. It's runtime neutral for most of the tests. But for some kernels with high register pressure it can be a huge win. e.g. 464.h264ref reduced number of spills by 54 and sped up by 20%. llvm-svn: 109279	2010-07-23 22:39:59 +00:00
Dan Gohman	55e244698a	Use the proper type for shift counts. This fixes a bootstrap error. llvm-svn: 109265	2010-07-23 21:08:12 +00:00
Dan Gohman	0818684a70	DAGCombine (shl (anyext x, c)) to (anyext (shl x, c)) if the high bits are not demanded. This often allows the anyext to be folded away. llvm-svn: 109242	2010-07-23 18:03:30 +00:00
Dan Gohman	2e00e3b12d	Make SDNode::dump() print a newline at the end. llvm-svn: 109234	2010-07-23 16:37:47 +00:00
Eric Christopher	faf5c76114	80-col. llvm-svn: 109205	2010-07-23 01:05:59 +00:00
Gabor Greif	59f9970ba5	keep in 80 cols llvm-svn: 109122	2010-07-22 17:18:03 +00:00
Gabor Greif	dde79d8f1a	mass elimination of reliance on automatic iterator dereferencing llvm-svn: 109103	2010-07-22 13:36:47 +00:00
Evan Cheng	bf32e54bac	Re-apply r109079 with fix. llvm-svn: 109083	2010-07-22 06:24:48 +00:00
Owen Anderson	6c55cccf87	Revert r109079, which broke a lot of CodeGen tests. llvm-svn: 109082	2010-07-22 06:01:28 +00:00
Evan Cheng	bd81bff672	Initialize RegLimit only when register pressure is being tracked. llvm-svn: 109079	2010-07-22 05:18:41 +00:00
Evan Cheng	285903853f	More register pressure aware scheduling work. llvm-svn: 109064	2010-07-21 23:53:58 +00:00
Evan Cheng	a77f3d3b37	Teach bottom up pre-ra scheduler to track register pressure. Work in progress. llvm-svn: 108991	2010-07-21 06:09:07 +00:00
Dan Gohman	b5e918dc05	After a custom inserter, in a block which has constant instructions, update the current basic block in addition to the current insert position, so that they remain consistent. This fixes rdar://8204072. llvm-svn: 108765	2010-07-19 22:48:56 +00:00
Evan Cheng	10f99a3490	ARM has to provide its own TargetLowering::findRepresentativeClass because its scalar floating point registers alias its vector registers. llvm-svn: 108761	2010-07-19 22:15:08 +00:00
Evan Cheng	7a135510e3	Teach computeRegisterProperties() to compute "representative" register class for legal value types. A "representative" register class is the largest legal super-reg register class for a value type. e.g. On i386, GR32 is the rep register class for i8 / i16 / i32; on x86_64 it would be GR64. This property will be used by the register pressure tracking instruction scheduler. llvm-svn: 108735	2010-07-19 18:47:01 +00:00
Owen Anderson	9c271e2835	Remove r108639 now that it is handled by InstCombine instead. llvm-svn: 108688	2010-07-19 08:10:24 +00:00
Owen Anderson	f7f9c8a2f7	Add a DAGCombine xform to fold away redundant float->double->float conversions around sqrt instructions. I am assured by people more knowledgeable than me that there are no rounding issues in eliminating this. This fixed <rdar://problem/8197504>. llvm-svn: 108639	2010-07-18 08:47:54 +00:00
Eric Christopher	0baaa9bcc1	Propagate alloca alignment information via variable size object frame information. No functional change yet. llvm-svn: 108583	2010-07-17 00:28:22 +00:00
Dan Gohman	1e936277c3	Revert r108369, sorting llvm.dbg.declare information by source position, since it doesn't work for front-ends which don't emit column information (which includes llvm-gcc in its present configuration), and doesn't work for clang for K&R style variables where the variables are declared in a different order from the parameter list. Instead, make a separate pass through the instructions to collect the llvm.dbg.declare instructions in order. This ensures that the debug information for variables is emitted in this order. llvm-svn: 108538	2010-07-16 17:54:27 +00:00
Dan Gohman	103c4ebea5	Use the source-order scheduler instead of the "fast" scheduler at -O0, because it's more likely to keep debug line information in its original order. llvm-svn: 108496	2010-07-16 02:01:19 +00:00
Dale Johannesen	bfd4fd7bb7	The SelectionDAGBuilder's handling of debug info, on rare occasions, caused code to be generated in a different order. All cases I've seen involved float softening in the type legalizer, and this could be perhaps be fixed there, but it's better not to generate things differently in the first place. 7797940 (6/29/2010..7/15/2010). llvm-svn: 108484	2010-07-16 00:02:08 +00:00
Bill Wendling	4bda1c8e68	Revert. This isn't the correct way to go. llvm-svn: 108478	2010-07-15 23:42:21 +00:00
Bill Wendling	973dc3b1d8	Handle code gen for the unreachable instruction if it's the only instruction in the function. We'll just turn it into a "trap" instruction instead. The problem with not handling this is that it might generate a prologue without the equivalent epilogue to go with it: $ cat t.ll define void @foo() { entry: unreachable } $ llc -o - t.ll -relocation-model=pic -disable-fp-elim -unwind-tables .section __TEXT,__text,regular,pure_instructions .globl _foo .align 4, 0x90 _foo: ## @foo Leh_func_begin0: ## BB#0: ## %entry pushq %rbp Ltmp0: movq %rsp, %rbp Ltmp1: Leh_func_end0: ... The unwind tables then have bad data in them causing all sorts of problems. Fixes <rdar://problem/8096481>. llvm-svn: 108473	2010-07-15 23:32:40 +00:00
Evan Cheng	55f0c6b9fc	Split -enable-finite-only-fp-math to two options: -enable-no-nans-fp-math and -enable-no-infs-fp-math. All of the current codegen fp math optimizations only care whether the fp arithmetics arguments and results can never be NaN. llvm-svn: 108465	2010-07-15 22:07:12 +00:00
Devang Patel	df09db62e2	Fix crash reported in PR7653. llvm-svn: 108441	2010-07-15 18:45:27 +00:00
Eric Christopher	474e56a2bf	80-col. llvm-svn: 108381	2010-07-14 23:41:32 +00:00
Dan Gohman	c12a6731c5	Properly restore DebugLoc after leaving the local constant area. llvm-svn: 108364	2010-07-14 22:01:31 +00:00
Dan Gohman	042523340b	Delete fast-isel's trivial load optimization; it breaks debugging because it can look past points where a debugger might modify user variables. llvm-svn: 108336	2010-07-14 17:25:37 +00:00
Dan Gohman	1f471435f8	Don't propagate debug locations to instructions for materializing constants, since they may not be emited near the other instructions which get the same line, and this confuses debug info. llvm-svn: 108302	2010-07-14 01:07:44 +00:00
Dale Johannesen	caca5488dc	In inline asm treat indirect 'X' constraint as 'm'. This may not be right in all cases, but it's better than asserting which it was doing before. PR 7528. llvm-svn: 108268	2010-07-13 20:17:05 +00:00
Rafael Espindola	a18c5a0e5e	Fix a typo and fit in 80 columns. Found by Bob Wilson. llvm-svn: 108164	2010-07-12 18:11:17 +00:00
Duncan Sands	41b4a6b36a	Convert some tab stops into spaces. llvm-svn: 108130	2010-07-12 08:16:59 +00:00
Jakob Stoklund Olesen	51642aea77	Use COPY for fast-isel bitconvert, but don't create cross-class copies. This doesn't change the behavior of SelectBitcast for X86. llvm-svn: 108073	2010-07-11 05:16:54 +00:00
Rafael Espindola	a76eccf815	Fix va_arg for doubles. With this patch VAARG nodes always contain the correct alignment information, which simplifies ExpandRes_VAARG a bit. The patch introduces a new alignment information to TargetLoweringInfo. This is needed since the two natural candidates cannot be used: * The 's' in target data: If this is set to the minimal alignment of any argument, getCallFrameTypeAlignment would return 4 for doubles on ARM for example. * The getTransientStackAlignment method. It is possible for an architecture to have argument less aligned than what we maintain the stack pointer. llvm-svn: 108072	2010-07-11 04:01:49 +00:00
Jakob Stoklund Olesen	7147ab9e78	Use COPY for extracting ImplicitDef'ed values from fast-isel instructions. This assumes that the registers can be copied which is probably a safe assumption. llvm-svn: 108070	2010-07-11 03:31:05 +00:00
Jakob Stoklund Olesen	3bb1267431	Use COPY in FastISel everywhere it is safe and trivial. The remaining copyRegToReg calls actually check the return value (shock!), so we cannot trivially replace them with COPY instructions. llvm-svn: 108069	2010-07-11 03:31:00 +00:00
Dan Gohman	a64a323564	Fix a bug in the code which re-inserts DBG_VALUE nodes after scheduling; if a block is split (by a custom inserter), the insert point may be in a different block than it was originally. This fixes 32-bit llvm-gcc bootstrap builds, and I haven't been able to reproduce it otherwise. llvm-svn: 108060	2010-07-10 22:42:31 +00:00
Jakob Stoklund Olesen	e50d30d586	Emit COPY instructions instead of using copyRegToReg in InstrEmitter, ScheduleDAGEmit, TwoAddressLowering, and PHIElimination. This switches the bulk of register copies to using COPY, but many less used copyRegToReg calls remain. llvm-svn: 108050	2010-07-10 19:08:25 +00:00
Dan Gohman	fbdba81550	Insert IMPLICIT_DEF instructions at the current insert position, not at the end of the block. llvm-svn: 108045	2010-07-10 13:55:45 +00:00
Dan Gohman	d7b5ce3312	Reapply bottom-up fast-isel, with several fixes for x86-32: - Check getBytesToPopOnReturn(). - Eschew ST0 and ST1 for return values. - Fix the PIC base register initialization so that it doesn't ever fail to end up the top of the entry block. llvm-svn: 108039	2010-07-10 09:00:22 +00:00
Bill Wendling	f831d86311	Clarify what mysterious check means. llvm-svn: 108005	2010-07-09 19:44:12 +00:00
Bob Wilson	6586e9b203	--- Reverse-merging r107947 into '.': U utils/TableGen/FastISelEmitter.cpp --- Reverse-merging r107943 into '.': U test/CodeGen/X86/fast-isel.ll U test/CodeGen/X86/fast-isel-loads.ll U include/llvm/Target/TargetLowering.h U include/llvm/Support/PassNameParser.h U include/llvm/CodeGen/FunctionLoweringInfo.h U include/llvm/CodeGen/CallingConvLower.h U include/llvm/CodeGen/FastISel.h U include/llvm/CodeGen/SelectionDAGISel.h U lib/CodeGen/LLVMTargetMachine.cpp U lib/CodeGen/CallingConvLower.cpp U lib/CodeGen/SelectionDAG/SelectionDAGBuilder.cpp U lib/CodeGen/SelectionDAG/FunctionLoweringInfo.cpp U lib/CodeGen/SelectionDAG/FastISel.cpp U lib/CodeGen/SelectionDAG/SelectionDAGISel.cpp U lib/CodeGen/SelectionDAG/ScheduleDAGSDNodes.cpp U lib/CodeGen/SelectionDAG/InstrEmitter.cpp U lib/CodeGen/SelectionDAG/TargetLowering.cpp U lib/Target/XCore/XCoreISelLowering.cpp U lib/Target/XCore/XCoreISelLowering.h U lib/Target/X86/X86ISelLowering.cpp U lib/Target/X86/X86FastISel.cpp U lib/Target/X86/X86ISelLowering.h llvm-svn: 107987	2010-07-09 16:37:18 +00:00
Gabor Greif	52617fc462	cache result of operator* llvm-svn: 107980	2010-07-09 16:08:33 +00:00
Dan Gohman	0b5aa1cdd3	Re-apply bottom-up fast-isel, with fixes. Be very careful to avoid emitting a DBG_VALUE after a terminator, or emitting any instructions before an EH_LABEL. llvm-svn: 107943	2010-07-09 00:39:23 +00:00
Bob Wilson	21eed476e8	Reenable DAG combining for vector shuffles. It looks like it was temporarily disabled and then never turned back on again. Adjust some tests, one because this change avoids an unnecessary instruction, and the other to make it continue testing what it was intended to test. llvm-svn: 107941	2010-07-09 00:38:12 +00:00
Bill Wendling	a992445ff2	Extension of r107506. Make sure that we don't mark a function as having a call if the inline ASM doesn't need a stack frame. llvm-svn: 107922	2010-07-08 22:38:02 +00:00
Jakob Stoklund Olesen	00264624a9	Convert EXTRACT_SUBREG to COPY when emitting machine instrs. EXTRACT_SUBREG no longer appears as a machine instruction. Use COPY instead. Add isCopy() checks in many places using isMoveInstr() and isExtractSubreg(). The isMoveInstr hook will be removed later. llvm-svn: 107879	2010-07-08 16:40:22 +00:00
Benjamin Kramer	0ae3f08c0d	Merge the duplicated iabs optimization in DAGCombiner and let it detected a few more idioms. llvm-svn: 107868	2010-07-08 12:09:56 +00:00
Dan Gohman	e75704369d	Revert 107840 107839 107813 107804 107800 107797 107791. Debug info intrinsics win for now. llvm-svn: 107850	2010-07-08 01:00:56 +00:00
Dan Gohman	eb9164dc50	Don't forward-declare registers for static allocas, which we'll prefer to materialize as local constants. This fixes the clang bootstrap abort. llvm-svn: 107840	2010-07-07 23:52:58 +00:00
Dan Gohman	1adc499dda	Fix -fast-isel-abort to check the right instruction. llvm-svn: 107839	2010-07-07 23:47:25 +00:00
Evan Cheng	1c349f18f8	Move getExtLoad() and (some) getLoad() DebugLoc argument after EVT argument for consistency sake. llvm-svn: 107820	2010-07-07 22:15:37 +00:00
Dan Gohman	25d5c1b4f8	Not all custom inserters create new basic blocks. If the inserter didn't create a new block, don't reset the insert position. llvm-svn: 107813	2010-07-07 21:18:22 +00:00
Dan Gohman	e7ccc51cc1	Implement bottom-up fast-isel. This has the advantage of not requiring a separate DCE pass over MachineInstrs. llvm-svn: 107804	2010-07-07 19:20:32 +00:00
Dan Gohman	2d4d01d0de	Add X86FastISel support for return statements. This entails refactoring a bunch of stuff, to allow the target-independent calling convention logic to be employed. llvm-svn: 107800	2010-07-07 18:32:53 +00:00
Dan Gohman	b792f844ad	Update the insert position after scheduling, which may change the position when emitting multiple blocks when executing a custom inserter. llvm-svn: 107797	2010-07-07 18:22:13 +00:00
Devang Patel	637ee5f149	Update comment. llvm-svn: 107796	2010-07-07 18:18:18 +00:00
Dan Gohman	ffe64b1ee5	Give FunctionLoweringInfo an MBB member, avoiding the need to pass it around everywhere, and also give it an InsertPt member, to enable isel to operate at an arbitrary position within a block, rather than just appending to a block. llvm-svn: 107791	2010-07-07 16:47:08 +00:00
Dan Gohman	87fb4e8fcd	Simplify FastISel's constructor by giving it a FunctionLoweringInfo instance, rather than pointers to all of FunctionLoweringInfo's members. This eliminates an NDEBUG ABI sensitivity. llvm-svn: 107789	2010-07-07 16:29:44 +00:00
Dan Gohman	e784616fbb	Move FunctionLoweringInfo.h out into include/llvm/CodeGen. This will allow target-specific fast-isel code to make use of it directly. llvm-svn: 107787	2010-07-07 16:01:37 +00:00
Dan Gohman	fe7532a308	Split the SDValue out of OutputArg so that SelectionDAG-independent code can do calling-convention queries. This obviates OutputArgReg. llvm-svn: 107786	2010-07-07 15:54:55 +00:00
Dan Gohman	498e5f899d	Move CallingConvLower.cpp out of the SelectionDAG directory. llvm-svn: 107781	2010-07-07 15:15:27 +00:00
Jim Grosbach	dc0a0659be	By default, the eh.sjlj.setjmp/longjmp intrinsics should just do nothing rather than assuming a target will custom lower them. Targets which do so should exlicitly mark them as having custom lowerings. PR7454. llvm-svn: 107734	2010-07-06 23:44:52 +00:00
Dan Gohman	ee0cb70381	CanLowerReturn doesn't need a SelectionDAG; it just needs an LLVMContext. SelectBasicBlock doesn't needs its BasicBlock argument. llvm-svn: 107712	2010-07-06 22:19:37 +00:00
Devang Patel	a3ca21b228	Propagate debug loc. llvm-svn: 107710	2010-07-06 22:08:15 +00:00
Dan Gohman	3439629239	Reapply r107655 with fixes; insert the pseudo instruction into the block before calling the expansion hook. And don't put EFLAGS in a mbb's live-in list twice. llvm-svn: 107691	2010-07-06 20:24:04 +00:00
Dan Gohman	4e49b59dad	Add versions of OutputArgReg, AnalyzeReturn, and AnalyzeCallOperands which do not depend on SelectionDAG. llvm-svn: 107666	2010-07-06 15:39:54 +00:00
Chris Lattner	c4a7073db3	more tidying. llvm-svn: 107615	2010-07-05 05:53:14 +00:00
Chris Lattner	2c0315a0f3	random tidying llvm-svn: 107612	2010-07-05 05:36:21 +00:00
Evan Cheng	f3aeb2c22c	Infer alignments of fixed frame objects when they are constructed. This ensures remat'ed loads from fixed slots have the right alignments. llvm-svn: 107591	2010-07-04 18:52:05 +00:00
Bill Wendling	f844642350	Proper indentation. llvm-svn: 107581	2010-07-04 08:58:43 +00:00
Dale Johannesen	4d887f7ca7	Propagate the AlignStack bit in InlineAsm's to the PrologEpilog code, and use it to determine whether the asm forces stack alignment or not. gcc consistently does not do this for GCC-style asms; Apple gcc inconsistently sometimes does it for asm blocks. There is no convenient place to put a bit in either the SDNode or the MachineInstr form, so I've added an extra operand to each; unlovely, but it does allow for expansion for more bits, should we need it. PR 5125. Some existing testcases are affected. The operand lists of the SDNode and MachineInstr forms are indexed with awesome mnemonics, like "2"; I may fix this someday, but not now. I'm not making it any worse. If anyone is inspired I think you can find all the right places from this patch. llvm-svn: 107506	2010-07-02 20:16:09 +00:00
Jim Grosbach	9b7755fbc6	80-column and trailing whitespace cleanup. llvm-svn: 107490	2010-07-02 17:41:59 +00:00
Jim Grosbach	64a4f3f062	grammar tweaks llvm-svn: 107489	2010-07-02 17:38:34 +00:00
Dan Gohman	93f5920914	Rename CreateReg to CreateRegs, and MakeReg to CreateReg. llvm-svn: 107451	2010-07-02 00:10:16 +00:00
Dan Gohman	d2965c10a1	Temporarily disable on-demand fast-isel. llvm-svn: 107393	2010-07-01 12:15:30 +00:00
Dan Gohman	42b7ee15f5	Use FuncInfo's isExportedInst accessor method instead of doing the work manually. llvm-svn: 107384	2010-07-01 03:57:05 +00:00
Dan Gohman	85e02e9340	Rename CreateRegForValue to CreateReg, and change its argument from a Value to a Type, because it doesn't actually care about the Value. llvm-svn: 107383	2010-07-01 03:55:39 +00:00
Dan Gohman	aef3d140b7	Teach fast-isel to avoid loading a value from memory when it's already available in a register. This is pretty primitive, but it reduces the number of instructions in common testcases by 4%. llvm-svn: 107380	2010-07-01 03:49:38 +00:00
Dan Gohman	722f5fc567	Enable on-demand fast-isel. llvm-svn: 107377	2010-07-01 02:58:57 +00:00
Dan Gohman	d432223163	Reapply r106422, splitting the code for materializing a value out of SelectionDAGBuilder::getValue into a helper function, with fixes to use DenseMaps safely. llvm-svn: 107371	2010-07-01 01:59:43 +00:00
Dan Gohman	9576645a84	Don't use operator[] here, because it's not desirable to insert a default value if the search fails. llvm-svn: 107368	2010-07-01 01:33:21 +00:00
Jim Grosbach	caf9b3ab7d	grammar tweak in comment. llvm-svn: 107321	2010-06-30 21:27:56 +00:00
Duncan Sands	945a347478	Remove an unused variable. The call to getRoot has side-effects, so this could break something (but doesn't seem to). llvm-svn: 107295	2010-06-30 17:22:28 +00:00
Gabor Greif	647d9c9797	use ArgOperand API llvm-svn: 107282	2010-06-30 13:45:50 +00:00
Gabor Greif	f69acfe133	use ArgOperand API llvm-svn: 107279	2010-06-30 12:55:46 +00:00
Rafael Espindola	38a7d7cbc3	Add a VT argument to getMinimalPhysRegClass and replace the copy related uses of getPhysicalRegisterRegClass with it. If we want to make a copy (or estimate its cost), it is better to use the smallest class as more efficient operations might be possible. llvm-svn: 107140	2010-06-29 14:02:34 +00:00
Duncan Sands	6d28e73acc	Remove initialized but otherwise unused variables. llvm-svn: 107127	2010-06-29 11:22:26 +00:00
Bob Wilson	269a89fd3a	Unlike other targets, ARM now uses BUILD_VECTORs post-legalization so they can't be changed arbitrarily by the DAGCombiner without checking if it is running after legalization. llvm-svn: 107097	2010-06-28 23:40:25 +00:00
Dale Johannesen	17feb07c53	In asm's, output operands with matching input constraints have to be registers, per gcc documentation. This affects the logic for determining what "g" should lower to. PR 7393. A couple of existing testcases are affected. llvm-svn: 107079	2010-06-28 22:09:45 +00:00
Rafael Espindola	2041abd958	When splitting a VAARG, remember its alignment. This produces terrible but correct code. llvm-svn: 106952	2010-06-26 18:22:20 +00:00
Evan Cheng	02b184de5b	Change if-conversion block size limit checks to add some flexibility. llvm-svn: 106901	2010-06-25 22:42:03 +00:00
Dale Johannesen	ce97d55ad9	The hasMemory argument is irrelevant to how the argument for an "i" constraint should get lowered; PR 6309. While this argument was passed around a lot, this is the only place it was used, so it goes away from a lot of other places. llvm-svn: 106893	2010-06-25 21:55:36 +00:00
Duncan Sands	2dc70bea54	Remove variables which are assigned to but for which the value is not used. Spotted by gcc-4.6. llvm-svn: 106854	2010-06-25 14:48:39 +00:00
Gabor Greif	eba0be7dc9	use ArgOperand API llvm-svn: 106836	2010-06-25 09:38:13 +00:00
Gabor Greif	e4eed709d4	use ArgOperand API llvm-svn: 106828	2010-06-25 08:24:59 +00:00
Gabor Greif	f6207e0a80	prune an include llvm-svn: 106827	2010-06-25 08:16:50 +00:00
Bill Wendling	2d3c490026	It's possible that a flag is added to the SDNode that points back to the original SDNode. This is badness. Also, this function allows one SDNode to point multiple flags to another SDNode. Badness as well. llvm-svn: 106793	2010-06-24 22:00:37 +00:00
Dan Gohman	8a84cd57ae	Simplify this code; switch lowering shouldn't produce cases which trivially fold away. llvm-svn: 106765	2010-06-24 17:08:31 +00:00
Dan Gohman	463f26b4be	Eliminate the other half of the BRCOND optimization, and update as many tests as possible. llvm-svn: 106749	2010-06-24 15:24:03 +00:00
Dan Gohman	df6b33e778	Eliminate the first have of the optimization which eliminates BRCOND when the condition is constant. This optimization shouldn't be necessary, because codegen shouldn't be able to find dead control paths that the IR-level optimizer can't find. And it's undesirable, because it encourages bugpoint to leave "br i1 false" branches in its output. And it wasn't updating the CFG. I updated all the tests I could, but some tests are too reduced and I wasn't able to meaningfully preserve them. llvm-svn: 106748	2010-06-24 15:04:11 +00:00
Dan Gohman	600f62b3ba	Reapply r106634, now that the bug it exposed is fixed. llvm-svn: 106746	2010-06-24 14:30:44 +00:00
Dan Gohman	0695e09b09	Optimize the "bit test" code path for switch lowering in the case where the bit mask has exactly one bit. llvm-svn: 106716	2010-06-24 02:06:24 +00:00
Bill Wendling	a136521a17	MorphNodeTo doesn't preserve the memory operands. Because we're morphing a node into the same node, but with different non-memory operands, we need to replace the memory operands after it's finished morphing. llvm-svn: 106643	2010-06-23 18:16:24 +00:00
Daniel Dunbar	4df321b7ad	Revert r106263, "Fold the ShrinkDemandedOps pass into the regular DAGCombiner pass,"... it was causing both 'file' (with clang) and 176.gcc (with llvm-gcc) to be miscompiled. llvm-svn: 106634	2010-06-23 17:09:26 +00:00
Jim Grosbach	b58c08b0ba	Some targets don't require the fencing MEMBARRIER instructions surrounding atomic intrinsics, either because the use locking instructions for the atomics, or because they perform the locking directly. Add support in the DAG combiner to fold away the fences. llvm-svn: 106630	2010-06-23 16:07:42 +00:00
Dan Gohman	dd41bba517	Use A.append(...) instead of A.insert(A.end(), ...) when A is a SmallVector, and other SmallVector simplifications. llvm-svn: 106452	2010-06-21 19:47:52 +00:00
Dan Gohman	bbc29ea821	Revert r106422, which is breaking the non-fast-isel path. llvm-svn: 106423	2010-06-21 16:02:28 +00:00
Dan Gohman	f64fdd69d0	More changes for non-top-down fast-isel. Split the code for materializing a value out of SelectionDAGBuilder::getValue into a helper function, so that it can be used in other ways. Add a new getNonRegisterValue function which uses it, for use in code which doesn't want a CopyFromReg even when FuncMap.ValueMap already has an entry for it. llvm-svn: 106422	2010-06-21 15:13:54 +00:00
Dan Gohman	f91aff5f13	Do one lookup instead of two. llvm-svn: 106415	2010-06-21 14:21:47 +00:00
Dan Gohman	7c58cf75fa	Generalize this to look in the regular ValueMap in addition to the LocalValueMap, to make it more flexible when fast-isel isn't proceding straight top-down. llvm-svn: 106414	2010-06-21 14:17:46 +00:00
Dan Gohman	8693650422	Teach regular and fast isel to set dead flags on unused implicit defs on calls and similar instructions. llvm-svn: 106353	2010-06-18 23:28:01 +00:00
Jim Grosbach	a57c2885cf	back-end libcall handling for ATOMIC_SWAP (__sync_lock_test_and_set) llvm-svn: 106342	2010-06-18 23:03:10 +00:00
Evan Cheng	f5d62535a5	Fix cross initialization compilation error. llvm-svn: 106324	2010-06-18 22:01:37 +00:00
Jim Grosbach	d64dfc1568	Add Expand-to-libcall support for additional atomics. This covers the usual entries used by llvm-gcc. *_[U]MIN and such can be added later if needed. This enables the front ends to simplify handling of the atomic intrinsics by removing the target-specific decision about which targets can handle the intrinsics. llvm-svn: 106321	2010-06-18 21:43:38 +00:00
Dan Gohman	7edb39cc6b	Minor code simplifications. llvm-svn: 106286	2010-06-18 16:00:29 +00:00
Dan Gohman	6e681a5fbe	Give NamedRegionTimer an Enabled flag, allowing all its clients to switch from this: if (TimePassesIsEnabled) { NamedRegionTimer T(Name, GroupName); do_something(); } else { do_something(); // duplicate the code, this time without a timer! } to this: { NamedRegionTimer T(Name, GroupName, TimePassesIsEnabled); do_something(); } llvm-svn: 106285	2010-06-18 15:56:31 +00:00
Dan Gohman	96ca25eba5	Don't replace the old Ordering object with a new one; just clear() the old one. llvm-svn: 106284	2010-06-18 15:40:58 +00:00
Dan Gohman	a4f46b3ef8	Don't call clear() on DbgInfo when it's going to be deleted anyway. Don't replace the old DbgInfo with a new one when clear() on the old one is sufficient. llvm-svn: 106283	2010-06-18 15:36:18 +00:00
Dan Gohman	92c11acdb8	Change UpdateNodeOperands' operand and return value from SDValue to SDNode *, since it doesn't care about the ResNo value. llvm-svn: 106282	2010-06-18 15:30:29 +00:00
Dan Gohman	f1d8304fe3	Eliminate unnecessary uses of getZExtValue(). llvm-svn: 106279	2010-06-18 14:22:04 +00:00
Dan Gohman	35b6f9a929	isValueValidForType can be a static member function. llvm-svn: 106278	2010-06-18 14:01:07 +00:00
Dan Gohman	b92156d5e4	Fold the ShrinkDemandedOps pass into the regular DAGCombiner pass, which is faster, simpler, and less surprising. llvm-svn: 106263	2010-06-18 01:05:21 +00:00
Dan Gohman	0883789ec4	Handle ext(ext(x)) -> ext(x) immediately, since it's simple. llvm-svn: 106256	2010-06-18 00:08:30 +00:00
Stuart Hastings	0125b6410a	Add a DebugLoc parameter to TargetInstrInfo::InsertBranch(). This addresses a longstanding deficiency noted in many FIXMEs scattered across all the targets. This effectively moves the problem up one level, replacing eleven FIXMEs in the targets with eight FIXMEs in CodeGen, plus one path through FastISel where we actually supply a DebugLoc, fixing Radar 7421831. llvm-svn: 106243	2010-06-17 22:43:56 +00:00
Jim Grosbach	0ed5b460dc	add missing break. inconsequential as the code shouldn't be reached, but for correctness' sake, it should be there. llvm-svn: 106229	2010-06-17 17:58:54 +00:00
Jim Grosbach	3aeae8aeeb	Add entries for Expanding atomic intrinsics to libcalls. Just a placeholder for the moment. The implementation of the libcall will follow. Currently, the llvm-gcc knows when the intrinsics can be correctly handled by the back end and only generates them in those cases, issuing libcalls directly otherwise. That's too much coupling. The intrinsics should always be generated and the back end decide how to handle them, be it with a libcall, inline code, or whatever. This patch is a step in that direction. rdar://8097623 llvm-svn: 106227	2010-06-17 17:50:54 +00:00
Jim Grosbach	ba451e80dc	ISD::MEMBARRIER should lower to a libcall (__sync_synchronize) if the target sets the legalize action to Expand. llvm-svn: 106203	2010-06-17 02:00:53 +00:00
Mon P Wang	7a84689cc5	Fixed vector widening of binary instructions that can trap. Patch by Visa Putkinen! llvm-svn: 106038	2010-06-15 20:29:05 +00:00
Evan Cheng	38f6560461	Code refactoring, no functionality changes. llvm-svn: 105775	2010-06-10 02:09:31 +00:00
Jakob Stoklund Olesen	8bc5eca331	Mark physregs defined by inline asm as implicit. This is a bit of a hack to make inline asm look more like call instructions. It would be better to produce correct dead flags during isel. llvm-svn: 105749	2010-06-09 20:05:00 +00:00
Jakob Stoklund Olesen	a13b1c29b0	Add argument name comments. llvm-svn: 105665	2010-06-09 00:40:31 +00:00
Mon P Wang	622cdd2297	Fixed a bug during widening where we would avoid legalizing a node. When we replace an OpA with a widened OpB, it is possible to get new uses of OpA due to CSE when recursively updating nodes. Since OpA has been processed, the new uses are not examined again. The patch checks if this occurred and it it did, updates the new uses of OpA to use OpB. llvm-svn: 105453	2010-06-04 01:20:10 +00:00
Dan Gohman	d83e3e7750	Fix SimplifyDemandedBits' AssertZext logic to demand all the bits. It needs to demand the high bits because it's asserting that they're zero. llvm-svn: 105406	2010-06-03 20:21:33 +00:00
Eli Friedman	dbbbf73c96	Implement expansion in type legalization for add/sub with overflow. The expansion is the same as that used by LegalizeDAG. The resulting code sucks in terms of performance/codesize on x86-32 for a 64-bit operation; I haven't looked into whether different expansions might be better in general. llvm-svn: 105378	2010-06-03 03:49:50 +00:00
Devang Patel	b0c76394a3	Keep track of incoming debug value of unused argument. Radar 7927666. llvm-svn: 105285	2010-06-01 19:59:01 +00:00
Dan Gohman	b782caa393	Fill in missing support for ISD::FEXP, ISD::FPOWI, and friends. llvm-svn: 105283	2010-06-01 18:35:14 +00:00
Chris Lattner	14c46517b5	fix PR6623: when optimizing for size, don't inline memcpy/memsets that are too large. This causes the freebsd bootloader to be too large apparently. It's unclear if this should be an -Os or -Oz thing. Thoughts welcome. llvm-svn: 105228	2010-05-31 17:30:14 +00:00
Chris Lattner	b4a773b452	the 'limit' argument to FindOptimalMemOpLowering is unsigned, not uint64_t. llvm-svn: 105226	2010-05-31 17:12:23 +00:00
Oscar Fuentes	a97311f152	Use `llvm::next' instead of `next' to make VC++ 2010 happy. llvm-svn: 105168	2010-05-30 13:14:21 +00:00
Dan Gohman	4db93c9700	Reorder some code in SelectionDAGBuilder. llvm-svn: 105105	2010-05-29 17:53:24 +00:00
Dan Gohman	d16aa541af	SelectionDAG shouldn't have a FunctionLoweringInfo member. RegsForValue shouldn't have a TargetLoweringInfo member. And FunctionLoweringInfo::set doesn't needs its EnableFastISel argument. llvm-svn: 105101	2010-05-29 17:03:36 +00:00
Evan Cheng	cc2efe11db	Fix some latency computation bugs: if the use is not a machine opcode do not just return zero. llvm-svn: 105061	2010-05-28 23:26:21 +00:00
Dan Gohman	2140a74979	Eliminate the restriction that the array size in an alloca must be i32. This will help reduce the amount of casting required on 64-bit targets. llvm-svn: 104911	2010-05-28 01:14:11 +00:00
Jim Grosbach	faa3abbe39	Update the saved stack pointer in the sjlj function context following either an alloca() or an llvm.stackrestore(). rdar://8031573 llvm-svn: 104900	2010-05-27 23:49:24 +00:00
Jim Grosbach	c9f532dddc	back out 104862/104869. Can reuse stacksave after all. Very cool. llvm-svn: 104897	2010-05-27 23:11:57 +00:00
Jim Grosbach	b68dfb45f5	hook ISD::STACKADDR to an intrinsic llvm-svn: 104869	2010-05-27 18:52:11 +00:00
Bill Wendling	ddee3cb163	Add FIXME comment to remove this. llvm-svn: 104749	2010-05-26 21:53:50 +00:00
Bill Wendling	27311269cb	Add "setjmp_syscall", "savectx", "qsetjmp", "vfork", "getcontext" to the list of usual suspects that could "return twice". llvm-svn: 104737	2010-05-26 20:39:00 +00:00
Jim Grosbach	c98892fdaa	Adjust eh.sjlj.setjmp to properly have a chain and to have an opcode entry in ISD::. No functional change. llvm-svn: 104734	2010-05-26 20:22:18 +00:00
Devang Patel	1b08572a66	Update debug info when live-in reg is copied into a vreg. llvm-svn: 104732	2010-05-26 20:18:50 +00:00
Bill Wendling	0c3bfd3fb0	Move the check for "calls setjmp" to SelectionDAGISel so that it can be used by more than just the stack slot coloring algorithm. llvm-svn: 104722	2010-05-26 19:46:12 +00:00
Dan Gohman	52c2738324	Eliminate the use of PriorityQueue and just use a std::vector, implementing pop with a linear search for a "best" element. The priority queue was a neat idea, but in practice the comparison functions depend on dynamic information. llvm-svn: 104718	2010-05-26 18:52:00 +00:00
Dan Gohman	1e5d0b0456	Delete an unused function. llvm-svn: 104716	2010-05-26 18:34:12 +00:00
Eric Christopher	e805ea9e39	Temporarily revert r104655 as it's breaking the bots. llvm-svn: 104664	2010-05-26 01:59:55 +00:00
Dan Gohman	7c00576a62	Change push_all to a non-virtual function and implement it in the base class, since all the implementations are the same. llvm-svn: 104659	2010-05-26 01:10:55 +00:00
Dan Gohman	3701b3928e	Trim #include. llvm-svn: 104657	2010-05-26 00:55:59 +00:00
Bill Wendling	c5222d6c38	Dale and Evan suggested putting the "check for setjmp" much earlier in the machine code generation. That's a good idea, so I made it so. llvm-svn: 104655	2010-05-26 00:32:40 +00:00
Dan Gohman	ce3269b815	Do one map lookup instead of two. llvm-svn: 104645	2010-05-25 21:59:42 +00:00
Dale Johannesen	60fe2cdc4f	Fix another variant of PR 7191. Also add a testcase Mon Ping provided; unfortunately bugpoint failed to reduce it, but I think it's important to have a test for this in the suite. 8023512. llvm-svn: 104624	2010-05-25 18:47:23 +00:00
Dale Johannesen	ff384ad981	Fix PR 7191. I have been unable to create a .ll file that fails, sorry. (oye, a word which should be better known to people writing tree traversals, means grandchild.) llvm-svn: 104619	2010-05-25 17:50:03 +00:00
Jim Grosbach	bd9485db63	Implement eh.sjlj.longjmp for ARM. Clean up the intrinsic a bit. Followups: docs patch for the builtin and eh.sjlj.setjmp cleanup to match longjmp. llvm-svn: 104419	2010-05-22 01:06:18 +00:00
Bob Wilson	61438fe064	Clean up extra whitespace. llvm-svn: 104410	2010-05-21 23:53:55 +00:00
Bob Wilson	51d9ee3ff6	Change CodeGen/ARM/2009-11-02-NegativeLane.ll to use 16-bit vector elements so that it will continue to test what it was meant to test when I commit a separate change for better support of BUILD_VECTOR and VECTOR_SHUFFLE for Neon. Fix a DAG combiner crash exposed by this test change. llvm-svn: 104380	2010-05-21 21:05:32 +00:00
Evan Cheng	725211e948	Rename -pre-RA-sched=hybrid to -pre-RA-sched=list-hybrid. llvm-svn: 104306	2010-05-21 00:42:32 +00:00
Evan Cheng	4401f8873c	Allow targets more controls on what nodes are scheduled by reg pressure, what for latency in hybrid mode. llvm-svn: 104293	2010-05-20 23:26:43 +00:00
Evan Cheng	bdd062dae0	Add a hybrid bottom up scheduler that reduce register usage while avoiding pipeline stall. It's useful for targets like ARM cortex-a8. NEON has a lot of long latency instructions so a strict register pressure reduction scheduler does not work well. Early experiments show this speeds up some NEON loops by over 30%. llvm-svn: 104216	2010-05-20 06:13:19 +00:00
Bob Wilson	42603958fb	Optimize away insertelement of an undef value. This shows up in test/Codegen/ARM/reg_sequence.ll but it doesn't affect the generated code because the coalescer cleans it up. Radar 7998853. llvm-svn: 104185	2010-05-19 23:42:58 +00:00
Evan Cheng	70e506e18a	Code clean up. llvm-svn: 104173	2010-05-19 22:42:23 +00:00
Evan Cheng	738e920edf	Code refactoring: pull SchedPreference enum from TargetLowering.h to TargetMachine.h and put it in its own namespace. llvm-svn: 104147	2010-05-19 20:19:50 +00:00
Bob Wilson	6a1bfd282b	When expanding a vector_shuffle, the element type may not be legal and may need to be promoted. The BUILD_VECTOR and EXTRACT_VECTOR_ELT nodes generated here already allow the promoted type to be used without further changes, so just do the promotion. This fixes part of pr7167. llvm-svn: 104141	2010-05-19 18:48:32 +00:00
Evan Cheng	abd0ad54a4	Intrinsics which do a vector compare (results are all zero or all ones) are modeled as icmp / fcmp + sext. This is turned into a vsetcc by dag combine (yes, not a good long term solution). The targets can then isel the vsetcc to the appropriate instruction. The trouble arises when the result of a vector cmp + sext is then and'ed with all ones. Instcombine will turn it into a vector cmp + zext, dag combiner will miss turning it into a vsetcc and hell breaks loose after that. Teach dag combine to turn a vector cpm + zest into a vsetcc + and 1. This fixes rdar://7923010. llvm-svn: 104094	2010-05-19 01:08:17 +00:00
Evan Cheng	f19384d54a	Sink dag combine's post index load / store code that swap base ptr and index into the target hook. Only the target knows whether the swap is safe. In Thumb2 mode, the offset must be an immediate. rdar://7998649 llvm-svn: 104060	2010-05-18 21:31:17 +00:00
Evan Cheng	45b3f702ab	Continuously refine the register class of REG_SEQUENCE def with all the source registers and sub-register indices. llvm-svn: 104051	2010-05-18 20:07:47 +00:00
Evan Cheng	e7fc64a5c9	Fix PR7162: Use source register classes and sub-indices to determine the correct register class of the definitions of REG_SEQUENCE. llvm-svn: 104050	2010-05-18 20:03:28 +00:00
Evan Cheng	48f0de96d6	FIX PR7158. SimplifyVBinOp was asserting when it fails to constant fold (op (build_vector), (build_vector)). llvm-svn: 104004	2010-05-18 00:03:40 +00:00
Bill Wendling	02d3368831	- Set the "HasCalls" flag after instruction selection is finished. - Change the logic DisableFramePointerElim() to check for the -disable-non-leaf-fp-elim before -disable-fp-elim. llvm-svn: 103990	2010-05-17 23:09:50 +00:00
Dale Johannesen	3a366a88f2	Fix uint64->{float, double} conversion to do rounding correctly in 32-bit. The implementation in LegalizeIntegerTypes to handle this as sint64->float + appropriate power of 2 is subject to double rounding, considered incorrect by numerics people. Use this implementation only when it is safe. This leads to using library calls in some cases that produced inline code before, but it's correct now. (EVTToAPFloatSemantics belongs somewhere else, any suggestions?) Add a correctly rounding (though not particularly fast) conversion that uses X87 80-bit computations for x86-32. 7885399, 5901940. This shows up in gcc.c-torture/execute/ieee/rbug.c in the gcc testsuite on some platforms. llvm-svn: 103883	2010-05-15 18:51:12 +00:00
Dale Johannesen	bb4656c05e	Improve assertion messages. llvm-svn: 103882	2010-05-15 18:38:02 +00:00
Dan Gohman	88fb253562	Fast ISel trivially coalesces away no-op casts, so check for this when setting kill flags. llvm-svn: 103832	2010-05-14 22:53:18 +00:00
Dan Gohman	2f277c866d	Don't set kill flags for instructions which the scheduler has cloned. llvm-svn: 103827	2010-05-14 22:01:14 +00:00
Bill Wendling	95f6ebcb37	Rename "HasCalls" in MachineFrameInfo to "AdjustsStack" to better describe what the variable actually tracks. N.B., several back-ends are using "HasCalls" as being synonymous for something that adjusts the stack. This isn't 100% correct and should be looked into. llvm-svn: 103802	2010-05-14 21:14:32 +00:00
Dale Johannesen	1ae94b9394	Implement a correct ui64->f32 conversion. The old one was subject to double rounding in extreme cases. llvm-svn: 103744	2010-05-13 23:50:42 +00:00
Dan Gohman	5b510c1474	An Instruction has a trivial kill only if its use is in the same basic block. llvm-svn: 103725	2010-05-13 19:19:32 +00:00
Dan Gohman	1a1b51ff59	Add initial kill flag support to FastISel. llvm-svn: 103529	2010-05-11 23:54:07 +00:00
Dan Gohman	afd2b8bbb7	Don't set kill flags on uses of CopyFromReg nodes. InstrEmitter doesn't create separate virtual registers for CopyFromReg values, so uses of them don't necessarily kill the value. llvm-svn: 103519	2010-05-11 21:59:14 +00:00
Duncan Sands	6c5e4355bb	I got tired of VISIBILITY_HIDDEN colliding with the gcc enum. Rename it to LLVM_LIBRARY_VISIBILITY and introduce LLVM_GLOBAL_VISIBILITY, which is the opposite, for future use by dragonegg. llvm-svn: 103495	2010-05-11 20:16:09 +00:00
Dan Gohman	9132c59d43	Trim #includes and forward declarations. llvm-svn: 103489	2010-05-11 19:11:43 +00:00
Dan Gohman	bb919dfb6b	Implement a bunch more TargetSelectionDAGInfo infrastructure. Move EmitTargetCodeForMemcpy, EmitTargetCodeForMemset, and EmitTargetCodeForMemmove out of TargetLowering and into SelectionDAGInfo to exercise this. llvm-svn: 103481	2010-05-11 17:31:57 +00:00
Douglas Gregor	6739a89117	Fixes for Microsoft Visual Studio 2010, from Steven Watanabe! llvm-svn: 103457	2010-05-11 06:17:44 +00:00
Evan Cheng	ffb9f18dfe	Indentation. llvm-svn: 103441	2010-05-10 23:08:19 +00:00
Evan Cheng	02947a4551	Be careful with operand promotion. For a binary operation, the source operands may be the same. PR7018. rdar://7939869. llvm-svn: 103419	2010-05-10 19:03:57 +00:00
Duncan Sands	e4d6670f6b	Add an assertion to catch attempts to access off the end of the array. Based on a patch by Javier Martinez. llvm-svn: 103391	2010-05-10 04:54:28 +00:00
Dan Gohman	7de01ec2c9	SDDbgValues are apparently not being legalized. Fix a symptom of the problem, and not the real problem itself, by dropping debug info for i128 values. rdar://7958162. llvm-svn: 103310	2010-05-07 22:19:08 +00:00
Devang Patel	2ae3397536	Verify variable directly. llvm-svn: 103305	2010-05-07 22:04:20 +00:00
Dale Johannesen	51c1695a0a	Fix PR 7087, and probably other things, by extending getConstantFP to accept the two supported long double target types. This was not the original intent, but there are other places that assume this works and it's easy enough to do. llvm-svn: 103299	2010-05-07 21:35:53 +00:00
Dan Gohman	e6d40166a8	Transfer debug location information from PHI nodes to resulting lowered copies. llvm-svn: 103228	2010-05-07 01:10:20 +00:00
Dan Gohman	e7dff14d5d	Print debug information for SDNodes. llvm-svn: 103227	2010-05-07 01:09:21 +00:00
Dan Gohman	779c69bbc5	Add a DebugLoc argument to TargetInstrInfo::copyRegToReg, so that it doesn't have to guess. llvm-svn: 103194	2010-05-06 20:33:48 +00:00
Dan Gohman	a7c717d8d4	In bottom-up mode, defer the materialization of local constant values. llvm-svn: 103139	2010-05-06 00:02:14 +00:00
Dan Gohman	ffcb590b0f	Add an "IsBottomUp" member function to FastISel, which will be used to support a new bottom-up mode. llvm-svn: 103138	2010-05-05 23:58:35 +00:00
Devang Patel	92b21cad5d	Use getValue() for PHINodes when direct NodeMap access does not work. llvm-svn: 103126	2010-05-05 22:29:00 +00:00
Evan Cheng	55869af998	Instruction selection optimizations may have moved the def of a function argument out of the entry block. rdar://7937489 llvm-svn: 102993	2010-05-04 00:58:39 +00:00
Evan Cheng	f869d9adf2	Teach scheduler about REG_SEQUENCE. llvm-svn: 102984	2010-05-04 00:22:40 +00:00
Dan Gohman	0e79c864c3	Re-enable isel kill flags, now that the local allocator is ignoring them. llvm-svn: 102981	2010-05-04 00:12:15 +00:00
Dan Gohman	626b5d8e0c	Factor out FastISel's code for materializing constants and other values in registers into a separate function to de-couple it from the top-down-specific logic in getRegForValue. llvm-svn: 102975	2010-05-03 23:36:34 +00:00
Anton Korobeynikov	737718d4f4	Insert ANY_EXTEND node instead of invalid truncate during DAG Combining (X & 1), when needed. This fixes PR7001 llvm-svn: 102838	2010-05-01 12:52:34 +00:00
Dan Gohman	ec74444d3e	Remove the code for special-casing byval for fast-isel. SelectionDAG handles argument lowering anyway, so there's no need for special casing here. llvm-svn: 102828	2010-05-01 02:44:23 +00:00
Dan Gohman	4959cf19b2	Re-disable kill flags, as there is more trouble. llvm-svn: 102826	2010-05-01 01:57:56 +00:00
Dan Gohman	77ef6f6a17	Re-enable kill flags from SelectionDAGISel, with a fix: don't try to put a kill flag on a DBG_INFO instruction. llvm-svn: 102820	2010-05-01 00:50:53 +00:00
Dan Gohman	096619eb52	Fix whitespace. llvm-svn: 102817	2010-05-01 00:33:28 +00:00
Dan Gohman	63f31115cd	Don't pass SDValues by non-const reference unless they may be modified. llvm-svn: 102816	2010-05-01 00:33:16 +00:00
Dan Gohman	5d059718c9	Reorgnaize more switch code lowering to clean up some tricky code, and to eliminate the need for the SelectionDAGBuilder state to be live during CodeGenAndEmitDAG calls. Call SDB->clear() before CodeGenAndEmitDAG calls instead of before it, and move the CurDAG->clear() out of SelectionDAGBuilder, which doesn't own the DAG, and into CodeGenAndEmitDAG. llvm-svn: 102814	2010-05-01 00:25:44 +00:00
Dan Gohman	f0514717cd	Delete the EdgeMapping variable itself. llvm-svn: 102810	2010-05-01 00:02:20 +00:00
Dan Gohman	25c1653700	Get rid of the EdgeMapping map. Instead, just check for BasicBlock changes before doing phi lowering for switches. llvm-svn: 102809	2010-05-01 00:01:06 +00:00
Bill Wendling	de4b225093	EXTRACT_VECTOR_ELT of an INSERT_VECTOR_ELT may have the same index, but the indexes could be of a different value type. Or not even using the same SDNode for the constant (weird, I know). Compare the actual values instead of the pointers. llvm-svn: 102791	2010-04-30 22:19:17 +00:00
Dan Gohman	09452cecd8	Remove this debug output. The MachineFunction will be printed once all of instruction selection is done; it's confusing to see parts of it printed, while other parts are omitted, along the way. llvm-svn: 102771	2010-04-30 21:21:21 +00:00
Dan Gohman	8acc8f7dfd	EmitDbgValue doesn't need its EdgeMapping argument. llvm-svn: 102742	2010-04-30 19:35:33 +00:00
Dan Gohman	e82c25e878	Apply a patch from Jan Sjodin to fix a compiler abort on vector comparisons sign-extended to a different bitwidth than the comparison operands. llvm-svn: 102721	2010-04-30 17:19:19 +00:00
Dan Gohman	587e0800e5	Temporarily disable SelectionDAG kill flags, which are causing trouble. llvm-svn: 102680	2010-04-30 00:32:51 +00:00
Dan Gohman	ac55510c4e	Set register kill flags on the SelectionDAG path, at least in the easy cases. llvm-svn: 102678	2010-04-30 00:08:21 +00:00
Devang Patel	0395553e35	Refactor. llvm-svn: 102661	2010-04-29 20:40:36 +00:00
Devang Patel	a46953d281	DO not push DBG_VALUE machine instructions for inlined fuction arguments in entry block. llvm-svn: 102653	2010-04-29 18:50:36 +00:00
Evan Cheng	5c864b42b2	Add comment. llvm-svn: 102606	2010-04-29 06:58:53 +00:00
Evan Cheng	923679f929	Re-enable 102565 with fixes. llvm-svn: 102602	2010-04-29 06:33:38 +00:00
Evan Cheng	d65a1e782b	Temporarily disable my changes to unbreak the build. llvm-svn: 102590	2010-04-29 03:34:19 +00:00
Evan Cheng	5fb45a2b85	Do not generate duplicate dbg_value instructions for function arguments. llvm-svn: 102585	2010-04-29 01:40:30 +00:00
Dan Gohman	d9e7322c9a	Fix missing #include. llvm-svn: 102584	2010-04-29 01:39:13 +00:00
Evan Cheng	70a0145d7c	Avoid emitting a dbg_value machineinstr that's not going to be inserted into entry block. llvm-svn: 102581	2010-04-29 01:23:55 +00:00
Evan Cheng	f4336ebb2a	Check Reg against zero. llvm-svn: 102573	2010-04-29 00:59:34 +00:00
Devang Patel	bb728e17d3	tidy up. llvm-svn: 102558	2010-04-28 23:24:13 +00:00
Evan Cheng	6e822459ed	Replace r102368 with code that's less fragile. This creates DBG_VALUE instructions for function arguments early and insert them after instruction selection is done. llvm-svn: 102554	2010-04-28 23:08:54 +00:00
Devang Patel	888c17073a	While lowering dbg_declare, emit DBG_VALUE machine instruction if alloca matching llvm.dbg.declare intrinsic is missing. llvm-svn: 102513	2010-04-28 19:27:33 +00:00
Evan Cheng	f100557c9a	Try operation promotion only if regular dag combine and target-specific ones failed to do anything. llvm-svn: 102492	2010-04-28 07:10:39 +00:00
Devang Patel	1a0bbe25e3	Ignore DBG_VALUE instructions that points to undef values. llvm-svn: 102463	2010-04-27 20:54:45 +00:00
Evan Cheng	e813690b7a	- When legal, promote a load to zextload rather than ext load. - Catch more further dag combine opportunities as result of operand promotion, e.g. (i32 anyext (i16 trunc (i32 x))) -> (i32 x) llvm-svn: 102455	2010-04-27 19:48:13 +00:00
Dale Johannesen	eb61a7d616	Revert a small part of 102372; this fixes at least one of the dbg testsuite regressions. I don't think this is really the right fix; this change exposed an existing problem upstream somewhere. llvm-svn: 102410	2010-04-27 02:10:05 +00:00
Bob Wilson	a1e343095f	Avoid adding a null MD node operand, which crashes with "-debug" when trying to print the operand. llvm-svn: 102395	2010-04-26 22:56:56 +00:00
Dale Johannesen	59a438560c	Remove crufty comments. llvm-svn: 102380	2010-04-26 20:48:54 +00:00
Dale Johannesen	e098352ed1	Add DBG_VALUE handling for byval parameters; this produces a comment on targets that support it, but the Dwarf writer is not hooked up yet. llvm-svn: 102372	2010-04-26 20:06:49 +00:00
Evan Cheng	ed69b382ea	- Move TargetLowering::EmitTargetCodeForFrameDebugValue to TargetInstrInfo and rename it to emitFrameIndexDebugValue. - Teach spiller to modify DBG_VALUE instructions to reference spill slots. llvm-svn: 102323	2010-04-26 07:38:55 +00:00
Dale Johannesen	582565e991	Stop abusing EmitInstrWithCustomInserter for target-dependent form of DEBUG_VALUE, as it doesn't have reasonable default behavior for unsupported targets. Add a new hook instead. No functional change. llvm-svn: 102320	2010-04-25 21:33:54 +00:00
Dale Johannesen	1fc01985a3	Add comment re byval args. Doesn't actually work this way yet. xs llvm-svn: 102316	2010-04-25 21:03:54 +00:00
Evan Cheng	0abb54d631	When a load operand is promoted to an extload, replace other uses with uses of extload result truncated. llvm-svn: 102236	2010-04-24 04:43:44 +00:00
Dan Gohman	5544b0c588	Apply a fix for a vector setcc dagcombine from Jan Sjodin. No testcase yet, as the testcase now fails downstream. llvm-svn: 102228	2010-04-24 01:17:30 +00:00
Evan Cheng	b9ff130d47	Code refactoring. llvm-svn: 102202	2010-04-23 19:10:30 +00:00
Dan Gohman	6e9a8fcc28	Move FastISel's HandlePHINodesInSuccessorBlocks call down into FastISel itself too. llvm-svn: 102176	2010-04-23 15:29:50 +00:00
Dan Gohman	5b43aa0ddd	Sink SelectionDAGBuilder's HandlePHINodesInSuccessorBlocks down into SelectionDAGBuilder itself. llvm-svn: 102128	2010-04-22 20:55:53 +00:00
Dan Gohman	c594eab10f	Move HandlePHINodesInSuccessorBlocks functions out of SelectionDAGISel and into SelectionDAGBuilder and FastISel. llvm-svn: 102123	2010-04-22 20:46:50 +00:00
Evan Cheng	f1223bdec0	- It's not safe to promote rotates (at least not trivially). - Some code refactoring. llvm-svn: 102111	2010-04-22 20:19:46 +00:00
Dan Gohman	e149e9896c	Fix a comment. llvm-svn: 102110	2010-04-22 20:06:42 +00:00
Dan Gohman	fd81254190	Move PHINodesToUpdate out of SelectionDAGBuilder and into FunctionLoweringInfo, as it isn't SelectionDAG-specific. This isn't completely natural, as PHI node state is not per-function but rather per-basic-block, however there's currently no other convenient per-basic-block state to group it with. llvm-svn: 102109	2010-04-22 19:55:20 +00:00
Dan Gohman	57c732b032	Add more const qualifiers on TargetMachine and friends. llvm-svn: 101977	2010-04-21 01:34:56 +00:00
Dan Gohman	450aa64fc1	Move several SelectionDAG-independent utility functions out of the SelectionDAG directory and into a new Analysis.cpp file. llvm-svn: 101975	2010-04-21 01:22:34 +00:00
Dan Gohman	ad33d33719	Add another variant of this test which found a place where CodeGen's ComputeMaskedBits was being over-conservative when computing bits for an ADD. llvm-svn: 101963	2010-04-21 00:19:28 +00:00
Dale Johannesen	0522b90cdb	Because of the EMMS problem, right now we have to support user-defined operations that use MMX register types, but the compiler shouldn't generate them on its own. This adds a Synthesizable abstraction to represent this, and changes the vector widening computation so it won't produce MMX types. (The motivation is to remove noise from the ABI compatibility part of the gcc test suite, which has some breakage right now.) llvm-svn: 101951	2010-04-20 22:34:09 +00:00
Dan Gohman	950fe784be	Sink the CopyToExportRegsIfNeeded calls out of SelectionDAGISel into SelectionDAGBuilder. This avoids a separate pass over the instructions, and has the side effect of providing debug location information to the copy. llvm-svn: 101906	2010-04-20 15:03:56 +00:00
Dan Gohman	f41ad478ca	Don't send PHI nodes down to SelectionDAGBuilder of FastISel, since they end up doing nothing. llvm-svn: 101904	2010-04-20 15:00:41 +00:00
Dan Gohman	7c845e4ea4	Sink this use_empty() check into isUsedOutsideOfDefiningBlock. llvm-svn: 101902	2010-04-20 14:50:13 +00:00
Dan Gohman	7b7f0883fe	If a PHI node somehow has debug info, propogate it to the MachineInstr PHI. llvm-svn: 101901	2010-04-20 14:48:02 +00:00
Dan Gohman	0f055d3f56	Don't iterate through the whole block just to find the PHI nodes. llvm-svn: 101900	2010-04-20 14:46:25 +00:00
Dan Gohman	0c862a86fa	Delete a redundant return statement. llvm-svn: 101860	2010-04-20 01:58:20 +00:00
Bill Wendling	467e6c2deb	The visitXOR method can return the same SDNode. If so, we don't want to delete it as it's not dead. llvm-svn: 101855	2010-04-20 01:25:01 +00:00
Dan Gohman	eadc04badc	Remove this debug output; it isn't that useful, and it's incomplete in the case where a basic block is split. llvm-svn: 101850	2010-04-20 00:56:44 +00:00
Dan Gohman	e450d7444d	Sink DebugLoc handling out of SelectionDAGISel into FastISel and SelectionDAGBuilder, where it doesn't have to be as complicated. llvm-svn: 101848	2010-04-20 00:48:35 +00:00
Dan Gohman	3df671a81c	Remove MachineFunction's DefaultDebugLoc member, and make DwarfDebug.cpp responsible for figuring out what that's supposed to be on its own. llvm-svn: 101844	2010-04-20 00:37:27 +00:00
Dan Gohman	ca35aa1122	Reapply the removal of SelectionDAGISel's BB, with a fix for the case where multiple blocks are emitted; functions which do this need to return the new BB so that their callers can stay current. llvm-svn: 101843	2010-04-20 00:29:35 +00:00
Dan Gohman	be2e727a38	Revert 101825, which is causing trouble. llvm-svn: 101832	2010-04-19 23:34:15 +00:00
Dan Gohman	8cccc542f6	Eliminate SelectionDAGISel's "current block" member. Just pass it as an argument to things that need it. llvm-svn: 101825	2010-04-19 22:51:14 +00:00
Dan Gohman	7c0303a059	Eliminate the CurMBB member from SelectionDAGBuilder. For places that need it, just pass around the parent block of the current instruction explicitly. llvm-svn: 101822	2010-04-19 22:41:47 +00:00
Evan Cheng	e19aa5cc52	More progress on promoting i16 operations to i32 for x86. Work in progress. llvm-svn: 101808	2010-04-19 19:29:22 +00:00
Dan Gohman	1e95790fd4	Give SelectionDAG a TargetMachine too, rather than having it fetch one from the MachineFunction. llvm-svn: 101807	2010-04-19 19:22:07 +00:00
Evan Cheng	e7c21a4242	More 80 col violation. llvm-svn: 101806	2010-04-19 19:17:44 +00:00
Dan Gohman	c334960f16	Code that needs a TargetMachine should have access to one directly, rather than just getting one through a TargetLowering. llvm-svn: 101802	2010-04-19 19:05:59 +00:00
Dan Gohman	a91754da67	Move isInTailCallPosition out of SelectionDAGBuilder, as it isn't SelectionDAG-specific. llvm-svn: 101801	2010-04-19 18:41:46 +00:00
Dan Gohman	1f0f2142cc	Fix -Wcast-qual warnings. llvm-svn: 101655	2010-04-17 17:42:52 +00:00
Dan Gohman	8422e57baa	Delete now-unnecessary const_casts. llvm-svn: 101637	2010-04-17 15:32:28 +00:00
Dan Gohman	21cea8ac2e	Use const qualifiers with TargetLowering. This eliminates several const_casts, and it reinforces the design of the Target classes being immutable. SelectionDAGISel::IsLegalToFold is now a static member function, because PIC16 uses it in an unconventional way. There is more room for API cleanup here. And PIC16's AsmPrinter no longer uses TargetLowering. llvm-svn: 101635	2010-04-17 15:26:15 +00:00
Evan Cheng	f1bd5fcdb4	More work to allow dag combiner to promote 16-bit ops to 32-bit. llvm-svn: 101621	2010-04-17 06:13:15 +00:00
Evan Cheng	829c300ce0	Another 80 col violation. llvm-svn: 101620	2010-04-17 06:12:32 +00:00
Eric Christopher	7258dcd77f	Revert 101465, it broke internal OpenGL testing. Probably the best way to know that all getOperand() calls have been handled is to replace that API instead of updating. llvm-svn: 101579	2010-04-16 23:37:20 +00:00
Evan Cheng	f037f87bde	(i32 sext_in_reg (i32 aext (i16 x)), i16) -> (i32 sext x). No known test case until -promote-16bit is enabled. llvm-svn: 101551	2010-04-16 22:26:19 +00:00
Dan Gohman	c4759a5b97	Create a new TargetSelectionDAGInfo class. This will eventually acquire SelectionDAG-specific parts of TargetLowering. llvm-svn: 101537	2010-04-16 21:12:11 +00:00
Dan Gohman	4d273f4519	Commit this, which should have accompanied 101531. llvm-svn: 101532	2010-04-16 20:22:43 +00:00
Evan Cheng	954bd598dd	80 col. llvm-svn: 101501	2010-04-16 17:58:41 +00:00
Evan Cheng	d6b0a7c075	80 col. llvm-svn: 101500	2010-04-16 17:57:59 +00:00
Dan Gohman	3a7ee8eead	Avoid creating virtual registers for unused values. llvm-svn: 101480	2010-04-16 17:15:02 +00:00
Dan Gohman	5664b9f1a9	Fix an assertion string. llvm-svn: 101478	2010-04-16 16:55:18 +00:00
Dan Gohman	4572a9f479	Fix a comment. llvm-svn: 101477	2010-04-16 16:52:37 +00:00
Gabor Greif	f375520f7b	reapply r101434 with a fix for self-hosting rotate CallInst operands, i.e. move callee to the back of the operand array the motivation for this patch are laid out in my mail to llvm-commits: more efficient access to operands and callee, faster callgraph-construction, smaller compiler binary llvm-svn: 101465	2010-04-16 15:33:14 +00:00
Evan Cheng	af56facacd	Adding support for dag combiner to promote operations for profit. This requires target specific queries. For example, x86 should promote i16 to i32 when it does not impact load folding. x86 support is off by default. It can be enabled with -promote-16bit. Work in progress. llvm-svn: 101448	2010-04-16 06:14:10 +00:00
Dan Gohman	5563473062	Refine further the scope where the global DebugLoc value is active. llvm-svn: 101443	2010-04-16 05:06:56 +00:00
Gabor Greif	403e9694f9	back out r101423 and r101397, they break llvm-gcc self-host on darwin10 llvm-svn: 101434	2010-04-16 01:16:20 +00:00
Gabor Greif	33ae80bff7	reapply r101364, which has been backed out in r101368 with a fix rotate CallInst operands, i.e. move callee to the back of the operand array the motivation for this patch are laid out in my mail to llvm-commits: more efficient access to operands and callee, faster callgraph-construction, smaller compiler binary llvm-svn: 101397	2010-04-15 20:51:13 +00:00
Dan Gohman	b29cda9b3c	Fix a bunch of namespace polution. llvm-svn: 101376	2010-04-15 17:08:50 +00:00
Gabor Greif	9fd00c7d25	back out r101364, as it trips the linux nightlybot on some clang C++ tests llvm-svn: 101368	2010-04-15 12:46:56 +00:00
Gabor Greif	aafd209632	rotate CallInst operands, i.e. move callee to the back of the operand array the motivation for this patch are laid out in my mail to llvm-commits: more efficient access to operands and callee, faster callgraph-construction, smaller compiler binary llvm-svn: 101364	2010-04-15 10:49:53 +00:00
Chris Lattner	3245afdf05	enhance the load/store narrowing optimization to handle a tokenfactor in between the load/store. This allows us to optimize test7 into: _test7: ## @test7 ## BB#0: ## %entry movl (%rdx), %eax ## kill: SIL<def> ESI<kill> movb %sil, 5(%rdi) ret instead of: _test7: ## @test7 ## BB#0: ## %entry movl 4(%esp), %ecx movl $-65281, %eax ## imm = 0xFFFFFFFFFFFF00FF andl 4(%ecx), %eax movzbl 8(%esp), %edx shll $8, %edx addl %eax, %edx movl 12(%esp), %eax movl (%eax), %eax movl %edx, 4(%ecx) ret llvm-svn: 101355	2010-04-15 06:10:49 +00:00
Chris Lattner	6ebd8674eb	teach codegen to turn trunc(zextload) into load when possible. This doesn't occur much at all, it only seems to formed in the case when the trunc optimization kicks in due to phase ordering. In that case it is saves a few bytes on x86-32. llvm-svn: 101350	2010-04-15 05:40:59 +00:00
Chris Lattner	f9b2e3c68a	add a simple dag combine to replace trivial shl+lshr with and. This happens with the store->load narrowing stuff. llvm-svn: 101348	2010-04-15 05:28:43 +00:00
Chris Lattner	4041ab6e00	Implement rdar://7860110 (also in target/readme.txt) narrowing a load/or/and/store sequence into a narrower store when it is safe. Daniel tells me that clang will start producing this sort of thing with bitfields, and this does trigger a few dozen times on 176.gcc produced by llvm-gcc even now. This compiles code like CodeGen/X86/2009-05-28-DAGCombineCrash.ll into: movl %eax, 36(%rdi) instead of: movl $4294967295, %eax ## imm = 0xFFFFFFFF andq 32(%rdi), %rax shlq $32, %rcx addq %rax, %rcx movq %rcx, 32(%rdi) and each of the testcases into a single store. Each of them used to compile into craziness like this: _test4: movl $65535, %eax ## imm = 0xFFFF andl (%rdi), %eax shll $16, %esi addl %eax, %esi movl %esi, (%rdi) ret llvm-svn: 101343	2010-04-15 04:48:01 +00:00
Dan Gohman	913c998703	Add more const qualifiers for LLVM IR pointers in CodeGen. llvm-svn: 101342	2010-04-15 04:33:49 +00:00
Dan Gohman	bcaf681cde	Add const qualifiers to CodeGen's use of LLVM IR constructs. llvm-svn: 101334	2010-04-15 01:51:59 +00:00
Evan Cheng	87b4f7c1aa	More 80 violations. llvm-svn: 101330	2010-04-15 01:25:27 +00:00
Evan Cheng	8442ef6f89	80 col violations. llvm-svn: 101325	2010-04-15 01:01:55 +00:00
Dan Gohman	c87b74d913	Delete unneeeded arguments. llvm-svn: 101276	2010-04-14 20:17:22 +00:00
Dan Gohman	a3918ecdf5	Delete unused arguments. llvm-svn: 101275	2010-04-14 20:05:00 +00:00
Dan Gohman	7deb447781	Factor out EH landing pad code into a separate function, and constify a bunch of stuff to support it. llvm-svn: 101273	2010-04-14 19:53:31 +00:00
Dan Gohman	c2c08d19b8	Reset the debug location even if the instruction was a terminator. llvm-svn: 101272	2010-04-14 19:30:02 +00:00
Dan Gohman	cacd4f2401	Refine #includes. llvm-svn: 101269	2010-04-14 18:49:17 +00:00
Dan Gohman	8ebcbe949a	Pull utility routines with no SelectionDAG dependence out of SelectionDAGBuilder. FunctionLoweringInfo isn't an ideal place for them to live, but it's better than SelectionDAGBuilder for now. llvm-svn: 101267	2010-04-14 18:31:02 +00:00
Dan Gohman	f5cca35750	Fix typos in comments. llvm-svn: 101266	2010-04-14 18:24:06 +00:00
Dan Gohman	fea9ba18ff	Delete an obsolete comment. llvm-svn: 101264	2010-04-14 17:40:25 +00:00
Dan Gohman	3215eae4a3	Delete an unused function. llvm-svn: 101263	2010-04-14 17:22:02 +00:00
Dan Gohman	094fc7b09e	Clear the FunctionLoweringInfo object before doing other things that don't need it. llvm-svn: 101262	2010-04-14 17:13:16 +00:00
Dan Gohman	ad0b3ea3cc	Move this assert out of SelectionDAGISel into FunctionLoweringInfo, and drop the redundant #ifndef NDEBUG. llvm-svn: 101261	2010-04-14 17:11:23 +00:00
Dan Gohman	0f405c8d73	Add a comment. llvm-svn: 101260	2010-04-14 17:09:37 +00:00
Dan Gohman	2ca8fb229c	Move the code for initialing the entry block livein set out of SelectionDAGISel. llvm-svn: 101258	2010-04-14 17:05:00 +00:00
Dan Gohman	4bfb437ec9	Reorgnaize this code to be more tidy and readable. llvm-svn: 101256	2010-04-14 17:02:07 +00:00
Dan Gohman	1939b5f130	Trim #includes. llvm-svn: 101255	2010-04-14 16:54:39 +00:00
Dan Gohman	2b79ee8bc8	Move the code for emitting livein copies out of SelectionDAGISel. llvm-svn: 101254	2010-04-14 16:51:49 +00:00
Dan Gohman	69e8e322d9	Sink landing-pad marking code out of SelectionDAGISel::runOnMachineFunction into FunctionLowering. llvm-svn: 101252	2010-04-14 16:32:56 +00:00
Dan Gohman	f57117d166	It's not necessary to recompute EB here. llvm-svn: 101251	2010-04-14 16:30:40 +00:00
Dan Gohman	5f40d34958	Generalize this code to handle Instructions in addition to ConstantExprs. llvm-svn: 101210	2010-04-14 02:33:23 +00:00
Dan Gohman	9162fb07be	Reorder the methods of this class to be a little more organized. llvm-svn: 101206	2010-04-14 02:09:45 +00:00
Dan Gohman	8a2dae57e2	Add a few comments. llvm-svn: 101148	2010-04-13 17:07:06 +00:00
Dan Gohman	ecd40a34e2	Remove unnecessary parens. llvm-svn: 101010	2010-04-12 02:24:01 +00:00
Dan Gohman	4ce1fb1448	Add variants of ult, ule, etc. which take a uint64_t RHS, for convenience. llvm-svn: 100824	2010-04-08 23:03:40 +00:00
Ted Kremenek	d87bd77586	Fix -Wsign-compare warning (issued by clang++). llvm-svn: 100799	2010-04-08 18:49:30 +00:00
Benjamin Kramer	a6769269f3	Use twines to simplify calls to report_fatal_error. For code size and readability. llvm-svn: 100756	2010-04-08 10:44:28 +00:00
Evan Cheng	ebe47c872f	Avoid using f64 to lower memcpy from constant string. It's cheaper to use i32 store of immediates. llvm-svn: 100751	2010-04-08 07:37:57 +00:00
Chris Lattner	3c65a8324d	convert a report_fatal_error that I was able to trigger into a nice error so the user at least knows what inline asm is a problem. For example: error: inline asm not supported yet: don't know how to handle tied indirect register inputs pr8788-1.c:14:10: note: generated from here asm ("\n" : "+r" (stack->regs) ^ Instead of: fatal error: error in backend: Don't know how to handle tied indirect register inputs yet! llvm-svn: 100731	2010-04-08 00:09:16 +00:00
Chris Lattner	94ef52824b	minor tidying. llvm-svn: 100725	2010-04-07 23:50:38 +00:00
Chris Lattner	cd92718a0f	use assertions instead of unreachable for logic errors. llvm-svn: 100724	2010-04-07 23:47:51 +00:00
Chris Lattner	2104b8d36e	rename llvm::llvm_report_error -> llvm::report_fatal_error llvm-svn: 100709	2010-04-07 22:58:41 +00:00
Chris Lattner	6855d62768	fix 80 col violation, patch by Alastair Lynn llvm-svn: 100639	2010-04-07 18:13:33 +00:00
Chris Lattner	51065568cd	Have the inst emitter add the !srcloc mdnode to the machine instr. Have the asmprinter use the mdnode to scavenge a source location if present. Document this nonsense in langref. llvm-svn: 100607	2010-04-07 05:38:05 +00:00
Chris Lattner	3b9f02a2aa	Three changes: 1. Introduce some enums and accessors in the InlineAsm class that eliminate a ton of magic numbers when handling inline asm SDNode. 2. Add a new MDNodeSDNode selection dag node type that holds a MDNode (shocking!) 3. Add a new argument to ISD::INLINEASM nodes that hold !srcloc metadata, propagating it to the instruction emitter, which drops it. No functionality change. llvm-svn: 100605	2010-04-07 05:20:54 +00:00
Dale Johannesen	5d7f0a0fdd	Move printing of target-indepedent DEBUG_VALUE comments into AsmPrinter. Target-dependent form is still generated by FastISel and still handled in X86 code. llvm-svn: 100596	2010-04-07 01:15:14 +00:00
Dale Johannesen	d1976e35c4	Allow for the possibility that a debug-value points to a SDNode that didn't have code generated for it. llvm-svn: 100566	2010-04-06 21:59:56 +00:00
Mon P Wang	bf86224d5e	Remove assert to treat memmove and memset like memcpy llvm-svn: 100521	2010-04-06 08:27:51 +00:00
Evan Cheng	272a2f8432	Fix an obvious copy-n-paste bug. It's not known to cause any miscompilation. llvm-svn: 100494	2010-04-05 23:33:29 +00:00
Dan Gohman	f38547c83f	Add a comment. llvm-svn: 100459	2010-04-05 20:24:08 +00:00
Chris Lattner	bc217873e3	lowering a volatile llvm.memcpy to a libc memcpy is ok. PR6779 llvm-svn: 100457	2010-04-05 20:11:45 +00:00
Chris Lattner	fb964e57e5	remove the now-redundant MMI pointer in SelectionDAG. llvm-svn: 100419	2010-04-05 06:19:28 +00:00
Chris Lattner	6361414c88	remove some redundant MMI arguments. llvm-svn: 100417	2010-04-05 06:10:13 +00:00
Chris Lattner	305f2efb63	unthread MMI from FastISel llvm-svn: 100416	2010-04-05 06:05:26 +00:00
Chris Lattner	f5d0636850	trim some spurious references to DwarfWriter. SDIsel really doesn't need it anymore, so don't addRequire it. llvm-svn: 100400	2010-04-05 04:09:20 +00:00
Chris Lattner	ab5dc34351	selection dag doesn't need DwarfWriter, remove some tendrils. llvm-svn: 100382	2010-04-05 02:23:33 +00:00
Chris Lattner	7cfa70e9b3	fastisel doesn't need DwarfWriter, remove some tendricles. llvm-svn: 100381	2010-04-05 02:19:28 +00:00
Mon P Wang	c576ee9040	Reapply address space patch after fixing an issue in MemCopyOptimizer. Added support for address spaces and added a isVolatile field to memcpy, memmove, and memset, e.g., llvm.memcpy.i32(i8, i8, i32, i32) -> llvm.memcpy.p0i8.p0i8.i32(i8, i8, i32, i32, i1) llvm-svn: 100304	2010-04-04 03:10:48 +00:00
Benjamin Kramer	cc034c7879	Fix anachronism. llvm-svn: 100225	2010-04-02 20:47:05 +00:00
Chris Lattner	47857a9176	fix the llvm-x86_64-linux buildbot. llvm-svn: 100223	2010-04-02 20:36:25 +00:00
Chris Lattner	bd009d6d6d	stop using DebugLoc::getUnknownLoc() llvm-svn: 100215	2010-04-02 20:17:23 +00:00
Chris Lattner	915c5f9862	Switch the code generator (except the JIT) onto the new DebugLoc representation. This eliminates the 'DILocation' MDNodes for file/line/col tuples from -O0 -g codegen. This remove the old DebugLoc class, making it a typedef for DebugLoc, I'll rename NewDebugLoc next. I didn't update the JIT to use the new apis, so it will continue to work, but be as slow as before. Someone should eventually do this or, better yet, rip out the JIT debug info stuff and build the JIT on top of MC. llvm-svn: 100209	2010-04-02 19:42:39 +00:00
Evan Cheng	61399375a2	Correctly lower memset / memcpy of undef. It should be a nop. PR6767. llvm-svn: 100208	2010-04-02 19:36:14 +00:00
Mon P Wang	999c1b927b	Revert r100191 since it breaks objc in clang llvm-svn: 100199	2010-04-02 18:43:02 +00:00
Mon P Wang	a972ab8564	Reapply address space patch after fixing an issue in MemCopyOptimizer. Added support for address spaces and added a isVolatile field to memcpy, memmove, and memset, e.g., llvm.memcpy.i32(i8, i8, i32, i32) -> llvm.memcpy.p0i8.p0i8.i32(i8, i8, i32, i32, i1) llvm-svn: 100191	2010-04-02 18:04:15 +00:00
Evan Cheng	8563ee4ed4	Skip checking preferred alignment of GVs defined in other translation units all together. llvm-svn: 100133	2010-04-01 20:13:28 +00:00
Evan Cheng	4c014c892a	- Avoid using floating point stores to implement memset unless the value is zero. - Do not try to infer GV alignment unless its type is sized. It's not possible to infer alignment if it has opaque type. llvm-svn: 100118	2010-04-01 18:19:11 +00:00
Evan Cheng	43cd9e3845	Fix sdisel memcpy, memset, memmove lowering: 1. Makes it possible to lower with floating point loads and stores. 2. Avoid unaligned loads / stores unless it's fast. 3. Fix some memcpy lowering logic bug related to when to optimize a load from constant string into a constant. 4. Adjust x86 memcpy lowering threshold to make it more sane. 5. Fix x86 target hook so it uses vector and floating point memory ops more effectively. rdar://7774704 llvm-svn: 100090	2010-04-01 06:04:33 +00:00
Chris Lattner	3131ae86d8	use the optimized debug info apis in sdisel. llvm-svn: 99986	2010-03-31 04:24:50 +00:00
Chris Lattner	009de335ac	add new apis for getting/setting !dbg metadata on instructions. In addition to being a convenience, they are faster than the old apis, particularly when not going from an MDKindID like people should be doing. llvm-svn: 99982	2010-03-31 03:34:40 +00:00
Bob Wilson	6f7fd28824	Revert Mon Ping's change 99928, since it broke all the llvm-gcc buildbots. llvm-svn: 99948	2010-03-30 22:27:04 +00:00
Mon P Wang	7460571381	Added support for address spaces and added a isVolatile field to memcpy, memmove, and memset, e.g., llvm.memcpy.i32(i8, i8, i32, i32) -> llvm.memcpy.p0i8.p0i8.i32(i8, i8, i32, i32, i1) A update of langref will occur in a subsequent checkin. llvm-svn: 99928	2010-03-30 20:55:56 +00:00
Evan Cheng	85eea4e031	Funky indentation. llvm-svn: 99901	2010-03-30 18:08:53 +00:00
Evan Cheng	742db6874a	Fix PR4975. Avoid referencing empty vector. llvm-svn: 99840	2010-03-29 21:27:30 +00:00
Evan Cheng	4d1aa2a1c6	Pool allocate SDDbgValue nodes. llvm-svn: 99836	2010-03-29 20:48:30 +00:00
Chris Lattner	03719af41d	add a statistic for the # times isel has to backtrack. llvm-svn: 99774	2010-03-28 19:46:56 +00:00
Chris Lattner	6642118e83	finally remove the immAllOnesV_bc/immAllZerosV_bc patterns and those derived from them. These are obnoxious because they were written as: PatLeaf<(bitconvert). Not having an argument was foiling adding better type checking for operand count matching up with what was required (in this case, bitconvert always requires an operand!) llvm-svn: 99759	2010-03-28 08:43:23 +00:00
Chris Lattner	89ad2f1d60	comply with the wishes of a fixme. llvm-svn: 99742	2010-03-28 05:55:17 +00:00
Chris Lattner	4d8786b5cc	now that (parallel) is gone and a variety of bugs in targets are cleaned up, we can remove an old fixme. llvm-svn: 99741	2010-03-28 05:54:03 +00:00
Chris Lattner	49e2773dd8	add an optimized form of OPC_EmitMergeInputChains for the 1, 0 and 1, 1 cases which are by-far the most frequent. This shrinks the X86 isel table from 77014 -> 74657 bytes. llvm-svn: 99740	2010-03-28 05:50:16 +00:00
Chris Lattner	e01f7e33ac	don't add nodes to the now-dead nodes list multiple times, this can cause a crash on crazy situations in msp430 when morph-node-to is disabled. llvm-svn: 99739	2010-03-28 05:28:31 +00:00
Chris Lattner	7aed1fb69d	don't add flag nodes with chain results to the NowDeadNodes list multiple times when MorphNodeTo can't be applied. llvm-svn: 99735	2010-03-28 04:54:33 +00:00
Chris Lattner	decc73d4bc	improve -debug-only=isel comments for cases when we don't enter a scope due to obviously false predicate. llvm-svn: 99723	2010-03-27 18:54:50 +00:00
Bill Wendling	6888e798d3	Forgot the part where we handle the ".llvm.eh.catch.all.value". llvm-svn: 99697	2010-03-27 01:24:30 +00:00
Anton Korobeynikov	2f072e95e6	Add few missed libcalls and correct names for others. llvm-svn: 99656	2010-03-26 21:32:14 +00:00
Evan Cheng	eb50ac5ccc	LiveVariables should clear kill / dead markers first. This allows us to remove a hack in the scheduler. llvm-svn: 99597	2010-03-26 02:12:24 +00:00
Chris Lattner	fc4ec25363	fix a valgrind error on copy-constructor-synthesis.cpp, which is caused when the custom insertion hook deletes the instruction, then we try to set dead flags on it. Neither the code that I added nor the code that was there before was safe. llvm-svn: 99538	2010-03-25 18:49:10 +00:00
Evan Cheng	1889440b52	Scheduler assumes SDDbgValue nodes are in source order. That's true currently. But add an assertion to verify it. llvm-svn: 99501	2010-03-25 07:16:57 +00:00
Chris Lattner	552dddc51c	Change tblgen to emit FOOISD opcode names as two bytes instead of one byte. This is important because we're running up to too many opcodes to fit in a byte and it is aggrevated by FIRST_TARGET_MEMORY_OPCODE making the numbering sparse. This just bites the bullet and bloats out the table. In practice, this increases the size of the x86 isel table from 74.5K to 76K. I think we'll cope :) This fixes rdar://7791648 llvm-svn: 99494	2010-03-25 06:33:05 +00:00
Evan Cheng	08b3364c6e	Remove a fixme that doesn't make sense any more. llvm-svn: 99489	2010-03-25 06:02:53 +00:00
Evan Cheng	7f0b16a206	Make sure SDDbgValue.Invalid is initialized to false by all the constructors. llvm-svn: 99487	2010-03-25 05:50:26 +00:00
Chris Lattner	4690af8567	Make the NDEBUG assertion stronger and more clear what is happening. Enhance scheduling to set the DEAD flag on implicit defs more aggressively. Before, we'd set an implicit def operand to dead if it were present in the SDNode corresponding to the machineinstr but had no use. Now we do it in this case AND if the implicit def does not exist in the SDNode at all. This exposes a couple of problems: one is the FIXME, which causes a live intervals crash on CodeGen/X86/sibcall.ll. The second is that it makes machinecse and licm more aggressive (which is a good thing) but also exposes a case where licm hoists a set0 and then it doesn't get resunk. Talking to codegen folks about both these issues, but I need this patch in in the meantime. llvm-svn: 99485	2010-03-25 05:40:48 +00:00
Chris Lattner	e2a504ee82	reapply 99444/99445, which I speculatively reverted in r99453. llvm-svn: 99482	2010-03-25 04:41:16 +00:00
Evan Cheng	563fe3cc12	Change how dbg_value sdnodes are converted into machine instructions. Their placement should be determined by the relative order of incoming llvm instructions. The scheduler will now use the SDNode ordering information to determine where to insert them. A dbg_value instruction is inserted after the instruction with the last highest source order and before the instruction with the next highest source order. It will optimize the placement by inserting right after the instruction that produces the value if they have consecutive order numbers. Here is a theoretical example that illustrates why the placement is important. tmp1 = store tmp1 -> x ... tmp2 = add ... ... call ... store tmp2 -> x Now mem2reg comes along: tmp1 = dbg_value (tmp1 -> x) ... tmp2 = add ... ... call ... dbg_value (tmp2 -> x) When the debugger examine the value of x after the add instruction but before the call, it should have the value of tmp1. Furthermore, for dbg_value's that reference constants, they should not be emitted at the beginning of the block (since they do not have "producers"). This patch also cleans up how SDISel manages DbgValue nodes. It allow a SDNode to be referenced by multiple SDDbgValue nodes. When a SDNode is deleted, it uses the information to find the SDDbgValues and invalidate them. They are not deleted until the corresponding SelectionDAG is destroyed. llvm-svn: 99469	2010-03-25 01:38:16 +00:00
Chris Lattner	ddca7b09f7	revert 99444/99445. This doesn't cause the failure of 2006-07-19-stwbrx-crash.ll for me, but it's the only likely patch in the blame list of several bots. Lets see if this fixes it. llvm-svn: 99453	2010-03-24 23:41:19 +00:00
Chris Lattner	e70742450f	remove dead argument. llvm-svn: 99445	2010-03-24 22:47:12 +00:00
Chris Lattner	26136636e0	split EmitNode in half to reduce indentation. llvm-svn: 99444	2010-03-24 22:45:47 +00:00
Dan Gohman	b92c8c849b	Remove the ConvertActions table and associated code, which is unused. llvm-svn: 99372	2010-03-24 00:53:38 +00:00
Dan Gohman	c53d5d6bb4	Revert 99335. getTypeToExpandTo's iterative behavior is actually needed here. llvm-svn: 99339	2010-03-23 22:44:42 +00:00
Dan Gohman	42f8ddeb11	Remove getTypeToExpandTo, since it isn't adding much value beyond just calling getTypeToTransformTo. llvm-svn: 99335	2010-03-23 22:15:31 +00:00
Mon P Wang	7ad43f8768	Fixed a widening bug where we were not using the correct size for the load llvm-svn: 98920	2010-03-19 01:19:52 +00:00
Anton Korobeynikov	64578d5599	Get rid of target-specific nodes for fp16 <-> fp32 conversion. llvm-svn: 98888	2010-03-18 22:35:37 +00:00
Dan Gohman	01c65a2622	Define placement new wrappers for BumpPtrAllocator and RecyclingAllocator to allow client code to be simpler, and simplify several clients. llvm-svn: 98847	2010-03-18 18:49:47 +00:00
Bob Wilson	3c7cde466e	Fix pr6543: svn r88806 changed MachineJumpTableInfo::getJumpTableIndex() to always create a new jump table. The intention was to avoid merging jump tables in SelectionDAGBuilder, and to wait for the branch folding pass to merge tables. Unfortunately, the same getJumpTableIndex() method is also used to merge tables in branch folding, so as a result of this change branch tables are never merged. Worse, the branch folding code is expecting getJumpTableIndex to always return the index of an existing table, but with this change, it never does so. In at least some cases, e.g., pr6543, this creates references to non-existent tables. I've fixed the problem by adding a new createJumpTableIndex function, which will always create a new table, and I've changed getJumpTableIndex to only look at existing tables. llvm-svn: 98845	2010-03-18 18:42:41 +00:00
Devang Patel	0b39f10d10	Fix comment. llvm-svn: 98830	2010-03-18 16:41:16 +00:00
Devang Patel	7976f6f03c	Debug info intrinsic does not intefer during tail call optimization. llvm-svn: 98778	2010-03-17 23:52:37 +00:00
Chris Lattner	8fce3dddfa	reapply r98656 unmodified, which exposed the asmprinter not handling constant unions. llvm-svn: 98680	2010-03-16 21:25:55 +00:00
Daniel Dunbar	3a374da973	Revert r98656, its breaking all over the place. llvm-svn: 98662	2010-03-16 19:35:34 +00:00
Chris Lattner	9ae99e0df5	improve support for uniontype and ConstantUnion, patch by Tim Northover! llvm-svn: 98656	2010-03-16 19:15:03 +00:00
Devang Patel	f2bce7cbae	Create SDDbgValue for dbg_value intrinsics and remember its connections with DAG nodes. This is a work in progress. Patch by Dale Johannesen! llvm-svn: 98568	2010-03-15 19:15:44 +00:00
Devang Patel	a3e9c9ca7b	Emit dwarf variable info communicated by code generator through DBG_VALUE machine instructions. This is a work in progress. llvm-svn: 98556	2010-03-15 18:33:46 +00:00
Chris Lattner	c73a361ac5	SIGN_EXTEND from the same type as the dest is valid. llvm-svn: 98548	2010-03-15 16:15:56 +00:00
Chris Lattner	d5df1f5b54	sink the call to VT.getSizeInBits() down into its uses, not all unary nodes necessarily have a simple result type. llvm-svn: 98547	2010-03-15 16:05:15 +00:00
Duncan Sands	ca595495e4	Turn calls to copysignl into an FCOPYSIGN node. Handle FCOPYSIGN nodes with ppc_f128 type by having the type legalizer turn these back into a call to copysignl. llvm-svn: 98514	2010-03-14 21:08:40 +00:00
Evan Cheng	00fd0b6749	Rename SDDbgValue.h to SDNodeDbgValue.h for consistency. llvm-svn: 98513	2010-03-14 19:56:39 +00:00
Chris Lattner	f71cb6c439	fix ShrinkDemandedOps to not leave dead nodes around, fixing PR6607 llvm-svn: 98512	2010-03-14 19:46:02 +00:00
Chris Lattner	468decdda2	rewrite ShrinkDemandedOps to be faster and indent less, no functionality change. llvm-svn: 98511	2010-03-14 19:43:04 +00:00
Chris Lattner	f1ed59a418	make -view-isel-dags print after the 'ShrinkDemandedOps' pass. llvm-svn: 98509	2010-03-14 19:27:55 +00:00
Anton Korobeynikov	59e96008bd	Make default expansion for FP16 <-> FP32 nodes into libcalls llvm-svn: 98501	2010-03-14 18:42:24 +00:00
Anton Korobeynikov	39ed49df71	Add DAG nodes to represent FP16 <-> FP32 intrinsics llvm-svn: 98500	2010-03-14 18:42:15 +00:00
Chris Lattner	9efbbcbe45	fix AsmPrinter::GetBlockAddressSymbol to always return a unique label instead of trying to form one based on the BB name (which causes collisions if the name is empty). This fixes PR6608 llvm-svn: 98495	2010-03-14 17:53:23 +00:00
Chris Lattner	6e52e9db31	get MMI out of the label uniquing business, just go to MCContext to get unique assembler temporary labels. llvm-svn: 98489	2010-03-14 08:36:50 +00:00
Chris Lattner	ee2fbbc978	change the LabelSDNode to be EHLabelSDNode and make it hold an MCSymbol. Make the EH_LABEL MachineInstr hold its label with an MCSymbol instead of ID. Fix a bug in MMI.cpp which would return labels named "Label4" instead of "label4". llvm-svn: 98463	2010-03-14 02:33:54 +00:00
Chris Lattner	34adc8d225	change EH related stuff (other than EH_LABEL) to use MCSymbol instead of label ID's. This cleans up and regularizes a bunch of code and makes way for future progress. Unfortunately, this pointed out to me that JITDwarfEmitter.cpp is largely copy and paste from DwarfException/MachineModuleInfo and other places. This is very sad and disturbing. :( One major change here is that TidyLandingPads moved from being called in DwarfException::BeginFunction to being called in DwarfException::EndFunction. There should not be any functionality change from doing this, but I'm not an EH expert. llvm-svn: 98459	2010-03-14 01:41:15 +00:00
Duncan Sands	03fcbcf407	Revert turning copysignl into a COPYSIGN node for the moment: ppc calls copysignl with a 128 bit ppc long double, resulting in a node that the type legalizer doesn't know how to expand. llvm-svn: 98357	2010-03-12 17:41:34 +00:00
Duncan Sands	607f1825b0	Now that it's supported, turn copysignl into a COPYSIGN node. llvm-svn: 98348	2010-03-12 12:13:59 +00:00
Duncan Sands	4c55f76936	Fix PR6522: implement copysign expansion for x86 long double (it seems that FreeBSD doesn't have copysignl). Done by removing a bunch of assumptions from the code. This may also help with sparc 128 bit floats. llvm-svn: 98346	2010-03-12 11:45:06 +00:00
Chris Lattner	53ebf8a7ca	fix PR6577, a bug in sdbuilder lowering select instructions whose true value was not Val#0. llvm-svn: 98336	2010-03-12 07:15:36 +00:00
Dan Gohman	576aec4363	Remove getWidenVectorType, which is no longer used. llvm-svn: 98289	2010-03-11 21:39:57 +00:00
Evan Cheng	180704dd1d	In case of tail call size of Ins and InVals may not match. llvm-svn: 98277	2010-03-11 19:38:18 +00:00
Daniel Dunbar	84b5ddc872	Remove dead include. llvm-svn: 98225	2010-03-11 02:28:48 +00:00
Chris Lattner	4ec0b670d5	fix PR6533 by updating the br(xor) code to remember the case when it looked past a trunc. llvm-svn: 98203	2010-03-10 23:46:44 +00:00
Dale Johannesen	29108f0b6c	Cosmetic: lengthen names and improve comments. llvm-svn: 98202	2010-03-10 23:37:24 +00:00
Dale Johannesen	49de0607a8	Progress towards shepherding debug info through SelectionDAG. No functional effect yet. This is still evolving and should not be viewed as final. llvm-svn: 98195	2010-03-10 22:13:47 +00:00
Dan Gohman	703b12d62f	Fix another bitwidth calculation to handle vector types; based on a patch by Micah Villmow for PR6572. llvm-svn: 98188	2010-03-10 21:04:53 +00:00
Dan Gohman	52cc041ee5	Attempt to make this debug output meaningful, both in the case of multibyte opcodes and in the case of multiple scopes. llvm-svn: 98036	2010-03-09 02:15:05 +00:00
Dan Gohman	f6fb1e0d93	Print the correct index in the "match failed at index" message. llvm-svn: 98013	2010-03-09 00:07:36 +00:00
Dale Johannesen	30488c636d	Add Order to SDDbgValue llvm-svn: 97939	2010-03-08 05:39:50 +00:00
Chris Lattner	28dc6c12c3	Use Other as a sentinel instead of iAny. llvm-svn: 97914	2010-03-07 07:45:08 +00:00
Dale Johannesen	10a77adede	Add some new bits of debug info handling. No functional change yet. llvm-svn: 97855	2010-03-06 00:03:23 +00:00
Dan Gohman	14e450f595	Reapply r97778 and r97779, enabled only for unsigned i64 to f64 conversions. llvm-svn: 97854	2010-03-06 00:00:55 +00:00
Jakob Stoklund Olesen	b0503beff1	Avoid creating bad PHI instructions when BR is being const-folded. llvm-svn: 97836	2010-03-05 21:49:10 +00:00
Chris Lattner	55e81eb49f	Fix PR6497, a bug where we'd fold a load into an addc node which has a flag. That flag in turn was used by an already-selected adde which turned into an ADC32ri8 which used a selected load which was chained to the load we folded. This flag use caused us to form a cycle. Fix this by not ignoring chains in IsLegalToFold even in cases where the isel thinks it can. llvm-svn: 97791	2010-03-05 06:19:13 +00:00
Chris Lattner	374a3ac744	inline a small function with one call site. llvm-svn: 97789	2010-03-05 05:49:45 +00:00
Dan Gohman	998c7c2614	Revert r97778 and r97779. They're somehow breaking llvm-gcc builds. llvm-svn: 97781	2010-03-05 02:40:23 +00:00
Dan Gohman	ba9eb0bf2e	Fix these constants to be more portable. llvm-svn: 97779	2010-03-05 02:13:10 +00:00
Dan Gohman	7fbeeebaf6	Rewrite i64-to-f64 conversion using an algorithm which handles rounding correctly. This implementation is a generalization of the x86_64 code in compiler-rt. This fixes rdar://7683708. llvm-svn: 97778	2010-03-05 02:00:46 +00:00
Chris Lattner	c1cb75eb72	add a statistic for # times fastisel fails. llvm-svn: 97738	2010-03-04 19:46:56 +00:00
Dan Gohman	9cc886b9f1	Fix a typo Duncan noticed. llvm-svn: 97735	2010-03-04 19:11:28 +00:00
Chris Lattner	0acbb71bad	change the new isel matcher to emit ComplexPattern matches as the very last thing before node emission. This should dramatically reduce the number of times we do 'MatchAddress' on X86, speeding up compile time. This also improves comments in the tables and shrinks the table a bit, now down to 80506 bytes for x86. llvm-svn: 97703	2010-03-04 01:23:08 +00:00
Dan Gohman	e14c4087a3	Fix more code to work properly with vector operands. Based on a patch my Micah Villmow for PR6465. llvm-svn: 97692	2010-03-04 00:23:16 +00:00
Chris Lattner	878b3e46fb	inline CannotYetSelectIntrinsic into CannotYetSelect and simplify. llvm-svn: 97690	2010-03-04 00:21:16 +00:00
Dan Gohman	7d099f7e89	Fix a bug in SelectionDAG's ReplaceAllUsesWith in the case where CSE and recursive RAUW calls delete a node from the use list, invalidating the use list iterator. There's currently no known way to reproduce this in an unmodified LLVM, however there's no fundamental reason why a SelectionDAG couldn't be formed which would trigger this case. llvm-svn: 97665	2010-03-03 21:33:37 +00:00
Chris Lattner	dc1b6f79da	add some of the more obscure predicate types to the Scope accelerator. llvm-svn: 97652	2010-03-03 07:46:25 +00:00
Chris Lattner	796f1da479	speed up scope node processing: if the first element of a scope entry we're about to process is obviously going to fail, don't bother pushing a scope only to have it immediately be popped. This avoids a lot of scope stack traffic in common cases. Unfortunately, this requires duplicating some of the predicate dispatch. To avoid duplicating the actual logic I pulled each predicate out to its own static function which gets used in both places. llvm-svn: 97651	2010-03-03 07:31:15 +00:00
Chris Lattner	3e1ffd06fc	introduce a new SwitchTypeMatcher node (which is analogous to SwitchOpcodeMatcher) and have DAGISelMatcherOpt form it. This speeds up selection, particularly for X86 which has lots of variants of instructions with only type differences. llvm-svn: 97645	2010-03-03 06:28:15 +00:00
Bill Wendling	c8d3add052	Use APInt instead of zext value. llvm-svn: 97631	2010-03-03 01:58:01 +00:00
Bill Wendling	af13d82945	This test case: long test(long x) { return (x & 123124) \| 3; } Currently compiles to: _test: orl $3, %edi movq %rdi, %rax andq $123127, %rax ret This is because instruction and DAG combiners canonicalize (or (and x, C), D) -> (and (or, D), (C \| D)) However, this is only profitable if (C & D) != 0. It gets in the way of the 3-addressification because the input bits are known to be zero. llvm-svn: 97616	2010-03-03 00:35:56 +00:00
Chris Lattner	dd030701bd	Fix some issues in WalkChainUsers dealing with CopyToReg/CopyFromReg/INLINEASM. These are annoying because they have the same opcode before an after isel. Fix this by setting their NodeID to -1 to indicate that they are selected, just like what automatically happens when selecting things that end up being machine nodes. With that done, give IsLegalToFold a new flag that causes it to ignore chains. This lets the HandleMergeInputChains routine be the one place that validates chains after a match is successful, enabling the new hotness in chain processing. This smarter chain processing eliminates the need for "PreprocessRMW" in the X86 and MSP430 backends and enables MSP to start matching it's multiple mem operand instructions more aggressively. I currently #if out the dead code in the X86 backend and MSP backend, I'll remove it for real in a follow-on patch. The testcase changes are: test/CodeGen/X86/sse3.ll: we generate better code test/CodeGen/X86/store_op_load_fold2.ll: PreprocessRMW was miscompiling this before, we now generate correct code Convert it to filecheck while I'm at it. test/CodeGen/MSP430/Inst16mm.ll: Add a testcase for mem/mem folding to make anton happy. :) llvm-svn: 97596	2010-03-02 22:20:06 +00:00
Chris Lattner	27a184b851	run HandleMergeInputChains even if we only have one input chain. llvm-svn: 97581	2010-03-02 19:34:59 +00:00
Chris Lattner	925ac71f26	Fix the xfail I added a couple of patches back. The issue was that we weren't properly handling the case when interior nodes of a matched pattern become dead after updating chain and flag uses. Now we handle this explicitly in UpdateChainsAndFlags. llvm-svn: 97561	2010-03-02 07:50:03 +00:00
Chris Lattner	350bb062b2	I was confused about this, it turns out that MorphNodeTo does delete ex-operands that become dead. llvm-svn: 97559	2010-03-02 07:14:49 +00:00
Chris Lattner	9732ab6d86	factor node morphing out to its own helper method. llvm-svn: 97558	2010-03-02 06:55:04 +00:00
Chris Lattner	f98f124a73	Sink InstructionSelect() out of each target into SDISel, and rename it DoInstructionSelection. Inline "SelectRoot" into it from DAGISelHeader. Sink some other stuff out of DAGISelHeader into SDISel. Eliminate the various 'Indent' stuff from various targets, which dates to when isel was recursive. 17 files changed, 114 insertions(+), 430 deletions(-) llvm-svn: 97555	2010-03-02 06:34:30 +00:00
Chris Lattner	2f846eeaca	Use the right induction variable. llvm-svn: 97541	2010-03-02 02:37:23 +00:00
Chris Lattner	b884fe867e	Rewrite chain handling validation and input TokenFactor handling stuff now that we don't care about emulating the old broken behavior of the old isel. This eliminates the 'CheckChainCompatible' check (along with IsChainCompatible) which did an incorrect and inefficient scan up the chain nodes which happened as the pattern was being formed and does the validation at the end in HandleMergeInputChains when it forms a structural pattern. This scans "down" the graph, which means that it is quickly bounded by nodes already selected. This also handles token factors that get "trapped" in the dag. Removing the CheckChainCompatible nodes also shrinks the generated tables by about 6K for X86 (down to 83K). There are two pieces remaining before I can nuke PreprocessRMW: 1. I xfailed a test because we're now producing worse code in a case that has nothing to do with the change: it turns out that our use of MorphNodeTo will leave dead nodes in the graph which (depending on how the graph is walked) end up causing bogus uses of chains and blocking matches. This is really bad for other reasons, so I'll fix this in a follow-up patch. 2. CheckFoldableChainNode needs to be improved to handle the TF. llvm-svn: 97539	2010-03-02 02:22:10 +00:00
Dan Gohman	4cec543952	Fix several places to handle vector operands properly. Based on a patch by Micah Villmow for PR6438. llvm-svn: 97538	2010-03-02 02:14:38 +00:00
Bill Wendling	78c5b7a76d	Remove dead parameter passing. llvm-svn: 97536	2010-03-02 01:55:18 +00:00
Chris Lattner	7894ab3a99	remove dead code. llvm-svn: 97529	2010-03-02 00:40:26 +00:00
Chris Lattner	c1f2e15332	refactor some code out of OPC_EmitMergeInputChains into a new helper function. llvm-svn: 97525	2010-03-02 00:00:03 +00:00
Chris Lattner	19c92aea01	remove all but one version of SelectionDAG::MorphNodeTo (the most general) the others are dead. llvm-svn: 97511	2010-03-01 22:20:05 +00:00
Chris Lattner	c1a3190870	Accelerate isel dispatch for tables that start with a top-level OPC_SwitchOpcode to use a table lookup instead of having to go through the interpreter for this. llvm-svn: 97469	2010-03-01 18:47:11 +00:00
Dan Gohman	c3c3c6829f	Fix optimization of ISD::TRUNCATE on vector operands. Based on a patch by Micah Villmow for PR6335. llvm-svn: 97461	2010-03-01 17:59:21 +00:00
Chris Lattner	e89ca7c146	some trivial microoptimizations. llvm-svn: 97441	2010-03-01 07:43:08 +00:00
Chris Lattner	053a28a397	eliminate the CheckMultiOpcodeMatcher code and have each ComplexPattern at the root be generated multiple times, once for each opcode they are part of. This encourages factoring because the opcode checks get treated just like everything else in the matcher. llvm-svn: 97439	2010-03-01 07:17:40 +00:00
Chris Lattner	f4d1775263	add a new OPC_SwitchOpcode which is semantically equivalent to a scope where every child starts with a CheckOpcode, but executes more efficiently. Enhance DAGISelMatcherOpt to form it. This also fixes a bug in CheckOpcode: apparently the SDNodeInfo objects are not pointer comparable, we have to compare the enum name. llvm-svn: 97438	2010-03-01 06:59:22 +00:00
Chris Lattner	53cf6b8444	eliminate GetInt1/2 llvm-svn: 97426	2010-02-28 22:38:43 +00:00
Chris Lattner	5ef43cec36	hoist the new isel interpreter out of DAGISelHeader.h (which gets #included into the middle of each target's DAGISel class) into a .cpp file where it is only compiled once. llvm-svn: 97425	2010-02-28 22:37:22 +00:00
Chris Lattner	af197502d6	enhance the new isel to handle the 'node already exists' case of MorphNodeTo directly. llvm-svn: 97417	2010-02-28 21:36:14 +00:00
Chris Lattner	b1af865aa6	simplify this code, return only ever has zero or one operands. llvm-svn: 97408	2010-02-28 18:53:13 +00:00
Evan Cheng	228c31f045	Re-apply 97040 with fix. This survives a ppc self-host llvm-gcc bootstrap. llvm-svn: 97310	2010-02-27 07:36:59 +00:00
Dale Johannesen	dd33104203	Move dbg_value generation to target-independent FastISel, as X86 is currently the only FastISel target. Per review. llvm-svn: 97255	2010-02-26 20:01:55 +00:00
Dan Gohman	2a8e3777b4	Fix ExpandVectorBuildThroughStack for the case where the operands are themselves vectors. Based on a patch by Micah Villmow for PR6338. llvm-svn: 97165	2010-02-25 20:30:49 +00:00
Dan Gohman	9b80f86e5b	Revert r97064. Duncan pointed out that bitcasts are defined in terms of store and load, which means bitcasting between scalar integer and vector has endian-specific results, which undermines this whole approach. llvm-svn: 97137	2010-02-25 15:20:39 +00:00
Chris Lattner	d3aa3aa0ec	clean up various VT manipulations, patch by Micah Villmow! PR6337 llvm-svn: 97072	2010-02-24 22:44:06 +00:00
Dan Gohman	4b2b48daba	Make getTypeSizeInBits work correctly for array types; it should return the number of value bits, not the number of bits of allocation for in-memory storage. Make getTypeStoreSize and getTypeAllocSize work consistently for arrays and vectors. Fix several places in CodeGen which compute offsets into in-memory vectors to use TargetData information. This fixes PR1784. llvm-svn: 97064	2010-02-24 22:05:23 +00:00
Chris Lattner	9b7cfd39b2	convert cycle checker to smallptrset, add comments and make it more elegant. llvm-svn: 97059	2010-02-24 21:34:04 +00:00
Chris Lattner	02ec121de8	revert david's patch which does not even build. llvm-svn: 97057	2010-02-24 21:25:08 +00:00
David Greene	8328341d9c	Use a SmallPtrSet as suggested by Chris. llvm-svn: 97056	2010-02-24 20:59:49 +00:00
Daniel Dunbar	4811d004be	Speculatively revert r97011, "Re-apply 96540 and 96556 with fixes.", again in the hopes of fixing PPC bootstrap. llvm-svn: 97040	2010-02-24 17:05:47 +00:00
Dan Gohman	3860521406	When forming SSE min and max nodes for UGE and ULE comparisons, it's necessary to swap the operands to handle NaN and negative zero properly. Also, reintroduce logic for checking for NaN conditions when forming SSE min and max instructions, fixed to take into consideration NaNs and negative zeros. This allows forming min and max instructions in more cases. llvm-svn: 97025	2010-02-24 06:52:40 +00:00
Chris Lattner	df8a8a8c6f	Change the scheduler from adding nodes in allnodes order to adding them in a determinstic order (bottom up from the root) based on the structure of the graph itself. This updates tests for some random changes, interesting bits: CodeGen/Blackfin/promote-logic.ll no longer crashes. I have no idea why, but that's good right? CodeGen/X86/2009-07-16-LoadFoldingBug.ll also fails, but now compiles to have one fewer constant pool entry, making the expected load that was being folded disappear. Since it is an unreduced mass of gnast, I just removed it. This fixes PR6370 llvm-svn: 97023	2010-02-24 06:11:37 +00:00
Chris Lattner	3ea9066bb4	add node #'s to debug dumps. llvm-svn: 97019	2010-02-24 04:24:44 +00:00
Evan Cheng	328a607490	Re-apply 96540 and 96556 with fixes. llvm-svn: 97011	2010-02-24 01:42:31 +00:00
Chris Lattner	625916df32	make selectnodeto set the nodeid to -1. This makes it more akin to creating a new node then replacing uses. llvm-svn: 97000	2010-02-23 23:01:35 +00:00
Chris Lattner	8585850e94	fix a bug in findNonImmUse (used by IsLegalToFold) where nodes with no id's would cause early exit allowing IsLegalToFold to return true instead of false, producing a cyclic dag. This was striking the new isel because it isn't using SelectNodeTo yet, which theoretically is just an optimization. llvm-svn: 96972	2010-02-23 19:32:27 +00:00
Chris Lattner	1738d49b74	Print node ID's in dumps and views if set. llvm-svn: 96971	2010-02-23 19:31:18 +00:00
David Greene	d8ecd5e902	Speed up cycle checking significantly by caching results. llvm-svn: 96956	2010-02-23 17:37:50 +00:00
Duncan Sands	d0bf6f640f	Revert commits 96556 and 96640, because commit 96556 breaks the dragonegg self-host build. I reverted 96640 in order to revert 96556 (96640 goes on top of 96556), but it also looks like with both of them applied the breakage happens even earlier. The symptom of the 96556 miscompile is the following crash: llvm[3]: Compiling AlphaISelLowering.cpp for Release build cc1plus: /home/duncan/tmp/tmp/llvm/lib/CodeGen/SelectionDAG/SelectionDAG.cpp:4982: void llvm::SelectionDAG::ReplaceAllUsesWith(llvm::SDNode, llvm::SDNode, llvm::SelectionDAG::DAGUpdateListener*): Assertion `(!From->hasAnyUseOfValue(i) \|\| From->getValueType(i) == To->getValueType(i)) && "Cannot use this version of ReplaceAllUsesWith!"' failed. Stack dump: 0. Running pass 'X86 DAG->DAG Instruction Selection' on function '@_ZN4llvm19AlphaTargetLowering14LowerOperationENS_7SDValueERNS_12SelectionDAGE' g++: Internal error: Aborted (program cc1plus) This occurs when building LLVM using LLVM built by LLVM (via dragonegg). Probably LLVM has miscompiled itself, though it may have miscompiled GCC and/or dragonegg itself: at this point of the self-host build, all of GCC, LLVM and dragonegg were built using LLVM. Unfortunately this kind of thing is extremely hard to debug, and while I did rummage around a bit I didn't find any smoking guns, aka obviously miscompiled code. Found by bisection. r96556 \| evancheng \| 2010-02-18 03:13:50 +0100 (Thu, 18 Feb 2010) \| 5 lines Some dag combiner goodness: Transform br (xor (x, y)) -> br (x != y) Transform br (xor (xor (x,y), 1)) -> br (x == y) Also normalize (and (X, 1) == / != 1 -> (and (X, 1)) != / == 0 to match to "test on x86" and "tst on arm" r96640 \| evancheng \| 2010-02-19 01:34:39 +0100 (Fri, 19 Feb 2010) \| 16 lines Transform (xor (setcc), (setcc)) == / != 1 to (xor (setcc), (setcc)) != / == 1. e.g. On x86_64 %0 = icmp eq i32 %x, 0 %1 = icmp eq i32 %y, 0 %2 = xor i1 %1, %0 br i1 %2, label %bb, label %return => testl %edi, %edi sete %al testl %esi, %esi sete %cl cmpb %al, %cl je LBB1_2 llvm-svn: 96672	2010-02-19 11:30:41 +00:00
Evan Cheng	d2d9252f35	Transform (xor (setcc), (setcc)) == / != 1 to (xor (setcc), (setcc)) != / == 1. e.g. On x86_64 %0 = icmp eq i32 %x, 0 %1 = icmp eq i32 %y, 0 %2 = xor i1 %1, %0 br i1 %2, label %bb, label %return => testl %edi, %edi sete %al testl %esi, %esi sete %cl cmpb %al, %cl je LBB1_2 llvm-svn: 96640	2010-02-19 00:34:39 +00:00
Evan Cheng	0ceb68a552	Some dag combiner goodness: Transform br (xor (x, y)) -> br (x != y) Transform br (xor (xor (x,y), 1)) -> br (x == y) Also normalize (and (X, 1) == / != 1 -> (and (X, 1)) != / == 0 to match to "test on x86" and "tst on arm" llvm-svn: 96556	2010-02-18 02:13:50 +00:00
David Greene	b7941b0703	Make the non-temporal bit "significant" in MemSDNodes so they aren't CSE'd or otherwise combined with temporal MemSDNodes. llvm-svn: 96505	2010-02-17 20:21:42 +00:00
Chris Lattner	e78bc753fe	sink special case "cannotyetselect" for intrinsics out of the tblgen splatted code into the implementation. llvm-svn: 96460	2010-02-17 06:28:22 +00:00
Duncan Sands	19d0b47b1f	There are two ways of checking for a given type, for example isa<PointerType>(T) and T->isPointerTy(). Convert most instances of the first form to the second form. Requested by Chris. llvm-svn: 96344	2010-02-16 11:11:14 +00:00
Evan Cheng	3f08464a1a	Fix a memory leak. Patch by Nicolas Geoffray. llvm-svn: 96295	2010-02-15 23:16:53 +00:00
Evan Cheng	5e73ff2e3a	Split SelectionDAGISel::IsLegalAndProfitableToFold to IsLegalToFold and IsProfitableToFold. The generic version of the later simply checks whether the folding candidate has a single use. This allows the target isel routines more flexibility in deciding whether folding makes sense. The specific case we are interested in is folding constant pool loads with multiple uses. llvm-svn: 96255	2010-02-15 19:41:07 +00:00
David Greene	39c6d01879	Add non-temporal flags and remove an assumption of default arguments. llvm-svn: 96240	2010-02-15 17:00:31 +00:00
Duncan Sands	9dff9bec31	Uniformize the names of type predicates: rather than having isFloatTy and isInteger, we now have isFloatTy and isIntegerTy. Requested by Chris! llvm-svn: 96223	2010-02-15 16:12:20 +00:00
Jakob Stoklund Olesen	45396438c3	Use array_pod_sort instead of std::sort for improved code size. Use SmallVector instead of std::vector for better speed when indirectbr has few successors. llvm-svn: 95879	2010-02-11 18:06:56 +00:00
Jakob Stoklund Olesen	896428d630	Remove duplicate successors from indirectbr instructions before building the machine CFG. This makes early tail duplication run 60 times faster when compiling the Firefox JavaScript interpreter, see PR6186. llvm-svn: 95831	2010-02-11 00:34:18 +00:00
Mon P Wang	5b77f0dac1	The previous fix of widening divides that trap was too fragile as it depends on custom lowering and requires that certain types exist in ValueTypes.h. Modified widening to check if an op can trap and if so, the widening algorithm will apply only the op on the defined elements. It is safer to do this in widening because the optimizer can't guarantee removing unused ops in some cases. llvm-svn: 95823	2010-02-10 23:37:45 +00:00
Dan Gohman	4a618827de	Fix "the the" and similar typos. llvm-svn: 95781	2010-02-10 16:03:48 +00:00
Evan Cheng	29b8f554fc	Now that ShrinkDemandedOps() is separated out from DAG combine. It sometimes leave some obvious nops which dag combine used to clean up afterwards e.g. (trunk (ext n)) -> n. Look for them and squash them. llvm-svn: 95757	2010-02-10 02:17:34 +00:00
Evan Cheng	3ebd551aac	Emit an error for illegal inline asm constraint (which uses illegal type) rather than asserting. llvm-svn: 95746	2010-02-10 01:21:02 +00:00
Dale Johannesen	3d1f1cccbb	Fix comments to reflect renaming elsewhere. llvm-svn: 95730	2010-02-10 00:11:11 +00:00
David Greene	893047d43e	Only dump output in debug mode. llvm-svn: 95711	2010-02-09 23:03:05 +00:00
Chris Lattner	b06015aa69	move target-independent opcodes out of TargetInstrInfo into TargetOpcodes.h. #include the new TargetOpcodes.h into MachineInstr. Add new inline accessors (like isPHI()) to MachineInstr, and start using them throughout the codebase. llvm-svn: 95687	2010-02-09 19:54:29 +00:00
Dale Johannesen	120cfe23a7	Apply the 95471 fix to SelectionDAGBuilder as well; we can get in here if FastISel gives up in a block. (Actually the two copies of this need to be unified. Later.) llvm-svn: 95579	2010-02-08 21:53:27 +00:00
Dan Gohman	bd374da130	In guaranteed tailcall mode, don't decline the tailcall optimization for blocks ending in "unreachable". llvm-svn: 95565	2010-02-08 20:34:14 +00:00
Dale Johannesen	db2eb47835	After Victor's latest commits I am seeing null addresses in dbg.declare; ignore this for the moment to prevent things from breaking. llvm-svn: 95471	2010-02-06 02:26:02 +00:00
Evan Cheng	3b245876c0	When the scheduler unfold a load folding instruction it move some of the predecessors to the unfolded load. It decides what gets moved to the load by checking whether the new load is using the predecessor as an operand. The check neglects the cases whether the predecessor is a flagged scheduling unit. rdar://7604000 llvm-svn: 95339	2010-02-05 01:27:11 +00:00
Evan Cheng	0a4fa4ca93	Fix typo Duncan noticed. llvm-svn: 95322	2010-02-04 19:07:06 +00:00
Evan Cheng	01676f9ff4	It's too risky to eliminate sext / zext of call results for tail call optimization even if the caller / callee attributes completely match. The callee may have been bitcast'ed (or otherwise lied about what it's doing). llvm-svn: 95282	2010-02-04 02:45:02 +00:00
Evan Cheng	27a41d5473	Revert 94937 and move the noreturn check to codegen. llvm-svn: 95198	2010-02-03 03:55:59 +00:00
Evan Cheng	40905b4302	Allow all types of callee's to be tail called. But avoid automatic tailcall if the callee is a result of bitcast to avoid losing necessary zext / sext etc. llvm-svn: 95195	2010-02-03 03:28:02 +00:00
Evan Cheng	6f36a083ef	Revert 95130. llvm-svn: 95160	2010-02-02 23:55:14 +00:00
Evan Cheng	c1b0116ff1	Pass callsite return type to TargetLowering::LowerCall and use that to check sibcall eligibility. llvm-svn: 95130	2010-02-02 21:29:10 +00:00
Mon P Wang	d74e0023c5	Improve EXTRACT_VECTOR_ELT patch based on comments from Duncan llvm-svn: 95012	2010-02-01 22:15:09 +00:00
Chris Lattner	f5edeebd8c	eliminate a bunch of pointless LLVMContext arguments. llvm-svn: 95001	2010-02-01 20:48:08 +00:00
Dale Johannesen	0b30cfc57e	fix PR 6157. Testcase pending. llvm-svn: 94996	2010-02-01 19:54:53 +00:00
Mon P Wang	72c60c73af	Fixed a couple of optimization with EXTRACT_VECTOR_ELT that assumes the result type is the same as the element type of the vector. EXTRACT_VECTOR_ELT can be used to extended the width of an integer type. This fixes a bug for Generic/vector-casts.ll on a ppc750. llvm-svn: 94990	2010-02-01 19:03:18 +00:00
Duncan Sands	3327498095	Change the SREM case to match the logic in the IR version ComputeMaskedBits. llvm-svn: 94805	2010-01-29 09:45:26 +00:00
Bill Wendling	954cb187e0	Assign the ordering of SDNodes in a much less intrusive fashion. After the "visit*" method is called, take the newly created nodes, walk them in a DFS fashion, and if they don't have an ordering set, then give it one. llvm-svn: 94757	2010-01-28 21:51:40 +00:00
Jim Grosbach	54c0530834	Update of 94055 to track the IR level call site information via an intrinsic. This allows code gen and the exception table writer to cooperate to make sure landing pads are associated with the correct invoke locations. llvm-svn: 94726	2010-01-28 01:45:32 +00:00
Evan Cheng	67a69dd2ed	Eliminate target hook IsEligibleForTailCallOptimization. Target independent isel should always pass along the "tail call" property. Change target hook LowerCall's parameter "isTailCall" into a refernce. If the target decides it's impossible to honor the tail call request, it should set isTailCall to false to make target independent isel happy. llvm-svn: 94626	2010-01-27 00:07:07 +00:00
Evan Cheng	c35b5a123b	Allow some automatic tailcall optimization without changing ABI. llvm-svn: 94611	2010-01-26 23:13:04 +00:00
Chris Lattner	547c761dd6	eliminate the TargetLowering::UsesGlobalOffsetTable bool, which is subsumed by TargetLowering::getJumpTableEncoding(). Change uses of it to be more specific. llvm-svn: 94529	2010-01-26 06:53:37 +00:00
Chris Lattner	8a785d7a67	Move getJTISymbol from MachineJumpTableInfo to MachineFunction, which is more convenient, and change getPICJumpTableRelocBaseExpr to take a MachineFunction to match. Next, move the X86 code that create a PICBase symbol to X86TargetLowering::getPICBaseSymbol from X86MCInstLower::GetPICBaseSymbol, which was an asmprinter specific library. This eliminates a 'gross hack', and allows us to implement X86ISelLowering::getPICJumpTableRelocBaseExpr which now calls it. This in turn allows us to eliminate the X86AsmPrinter::printPICJumpTableSetLabel method, which was the only overload of printPICJumpTableSetLabel. llvm-svn: 94526	2010-01-26 06:28:43 +00:00
Chris Lattner	273735bc5a	add a new MachineJumpTableInfo::getJTISymbol method, use it to implement the default TargetLowering::getPICJumpTableRelocBaseExpr llvm-svn: 94523	2010-01-26 05:58:28 +00:00
Chris Lattner	8a6c1eaabb	stub out a new target hook, need some refactoring before I can implement it. llvm-svn: 94521	2010-01-26 05:30:30 +00:00
Evan Cheng	555f61bf58	Implement cond ? -1 : 0 with sbb. llvm-svn: 94490	2010-01-26 02:00:44 +00:00
Dale Johannesen	d5575f29f1	Generate DEBUG_VALUE comments on x86. The (limited) dbg.declare's we currently generate go through both register allocators without perturbing the results. llvm-svn: 94480	2010-01-26 00:09:58 +00:00
Chris Lattner	b6db2c6b31	Rearrange handling of jump tables. Highlights: 1. MachineJumpTableInfo is now created lazily for a function the first time it actually makes a jump table instead of for every function. 2. The encoding of jump table entries is now described by the MachineJumpTableInfo::JTEntryKind enum. This enum is determined by the TLI::getJumpTableEncoding() hook, instead of by lots of code scattered throughout the compiler that "knows" that jump table entries are always 32-bits in pic mode (for example). 3. The size and alignment of jump table entries is now calculated based on their kind, instead of at machinefunction creation time. Future work includes using the EntryKind in more places in the compiler, eliminating other logic that "knows" the layout of jump tables in various situations. llvm-svn: 94470	2010-01-25 23:26:13 +00:00
Chris Lattner	823aed16f9	make -fno-rtti the default unless a directory builds with REQUIRES_RTTI. llvm-svn: 94378	2010-01-24 20:43:08 +00:00
Mon P Wang	4f45512c23	It seems better to scalarize vectors of size 1 instead of widening them. Add support to widen SETCC. llvm-svn: 94342	2010-01-24 00:24:43 +00:00
Mon P Wang	586d997e98	Improved widening loads by adding support for wider loads if the alignment allows. Fixed a bug where we didn't use a vector load/store for PR5626. llvm-svn: 94338	2010-01-24 00:05:03 +00:00
Bill Wendling	8cbc25d945	Remove the '-disable-scheduling' flag and replace it with the 'source' option of the '-pre-RA-sched' flag. It actually makes more sense to do it this way. Also, keep track of the SDNode ordering by default. Eventually, we would like to make this ordering a way to break a "tie" in the scheduler. However, doing that now breaks the "CodeGen/X86/abi-isel.ll" test for 32-bit Linux. llvm-svn: 94308	2010-01-23 10:26:57 +00:00
Evan Cheng	c22893a3b7	Enable pre-regalloc scheduling load clustering by default. llvm-svn: 94255	2010-01-22 23:49:45 +00:00
Chris Lattner	7ba0661f27	Stop building RTTI information for most llvm libraries. Notable missing ones are libsupport, libsystem and libvmcore. libvmcore is currently blocked on bugpoint, which uses EH. Once it stops using EH, we can switch it off. This #if 0's out 3 unit tests, because gtest requires RTTI information. Suggestions welcome on how to fix this. llvm-svn: 94164	2010-01-22 06:49:46 +00:00
Evan Cheng	9d92aaabf1	Teach pre-regalloc scheduler to schedule loads from nearby addresses. It may improve cache locality. This is controlled by -cluster-loads for now. llvm-svn: 94148	2010-01-22 03:36:51 +00:00
Evan Cheng	32bfe1e837	Trim unneeded includes. llvm-svn: 94105	2010-01-21 21:44:43 +00:00
Jim Grosbach	143f7eb4c8	back this out for now. Growing Function is not good. llvm-svn: 94097	2010-01-21 20:10:22 +00:00
Jim Grosbach	e029a6a5ed	Make sure that landing pad entries in the EH call site table are in the proper order for SjLj style exception handling. llvm-svn: 94055	2010-01-21 00:43:30 +00:00
David Greene	0985160c54	When XDEBUG is enabled, check for SelectionDAG cycles at some key points. This will help us find future problems like the one described in PR6019. llvm-svn: 94019	2010-01-20 20:13:31 +00:00
David Greene	3b2a68ceb8	Add some asserts to check SelectionDAG problems earlier. llvm-svn: 93960	2010-01-20 00:59:23 +00:00
Dan Gohman	954f49014d	Fold (add x, shl(0 - y, n)) -> sub(x, shl(y, n)), to simplify some code that SCEVExpander can produce when running on behalf of LSR. llvm-svn: 93949	2010-01-19 23:30:49 +00:00
David Greene	f1c7388b29	Add some new debugging APIs to print out "raw" SelectionDAGs to make understanding CannotYTetSelect and other errors easier. llvm-svn: 93901	2010-01-19 20:37:34 +00:00
Dale Johannesen	a3db6ef9a2	Revert 93811 per request. llvm-svn: 93818	2010-01-19 00:10:52 +00:00
Dale Johannesen	0c90d43b70	Enable code to emit dbg.declare as DEBUG_VALUE comments (fast isel, X86). This doesn't seem to break any functionality, but will introduce cases where -g affects the generated code. I'll be fixing that. llvm-svn: 93811	2010-01-18 23:34:55 +00:00
Evan Cheng	88b65bc835	Canonicalize -1 - x to ~x. Instcombine does this but apparently there are situations where this pattern will escape the optimizer and / or created by isel. Here is a case that's seen in JavaScriptCore: %t1 = sub i32 0, %a %t2 = add i32 %t1, -1 The dag combiner pattern: ((c1-A)+c2) -> (c1+c2)-A will fold it to -1 - %a. llvm-svn: 93773	2010-01-18 21:38:44 +00:00
Kenneth Uildriks	dd6ddd1aeb	When checking for sret-demotion, it needs to use legal types. When using the return value of an sret-demoted call, it needs to use possibly illegal types that match the declared Type of the callee. llvm-svn: 93667	2010-01-16 23:37:33 +00:00
David Greene	554039a914	Add some debug routines to SelectionDAG to dump full DAGs. print/dumpWithDepth allows one to dump a DAG up to N levels deep. dump/printWithFullDepth prints the whole DAG, subject to a depth limit on 100 in the default case (to prevent infinite recursion). Have CannotYetSelect to a dumpWithFullDepth so it is clearer exactly what the non-matching DAG looks like. llvm-svn: 93538	2010-01-15 19:43:23 +00:00
Victor Hernandez	b324e66f4c	Improve llvm.dbg.declare intrinsic by referring directly to the storage in its first argument, via function-local metadata (instead of via a bitcast). This patch also cleans up code that expects there to be a bitcast in the first argument and testcases that call llvm.dbg.declare. It also strips old llvm.dbg.declare intrinsics that did not pass metadata as the first argument. llvm-svn: 93531	2010-01-15 19:04:09 +00:00
Victor Hernandez	8d4904b639	Revert r93504 because older uses of llvm.dbg.declare intrinsics need to be auto-upgraded llvm-svn: 93515	2010-01-15 17:36:47 +00:00
Victor Hernandez	5d6551816b	Improve llvm.dbg.declare intrinsic by referring directly to the storage in its first argument, via function-local metadata (instead of via a bitcast). This patch also cleans up code that expects there to be a bitcast in the first argument and testcases that call llvm.dbg.declare. llvm-svn: 93504	2010-01-15 03:37:48 +00:00
Jim Grosbach	4f1b0ded75	fix 80-column violations llvm-svn: 93487	2010-01-15 00:36:15 +00:00
Dan Gohman	dd5286dc63	Fix a codegen abort seen in 483.xalancbmk. llvm-svn: 93417	2010-01-14 03:08:49 +00:00
Dan Gohman	d49763d200	Update a partially obsolete comment. llvm-svn: 93228	2010-01-12 04:32:35 +00:00
Dan Gohman	f9d6d53823	Fix a typo in a comment. llvm-svn: 93227	2010-01-12 04:30:26 +00:00
Jakob Stoklund Olesen	d2a1bee2d4	Avoid adding PHI arguments for a predecessor that has gone away when a BRCOND was constant folded. This fixes PR5980. llvm-svn: 93184	2010-01-11 21:02:33 +00:00
Mon P Wang	ec57c81e64	Disable transformation of select of two loads to a select of address and then a load if the loads are not in the default address space because the transformation discards src value info. llvm-svn: 93180	2010-01-11 20:12:49 +00:00
Dan Gohman	6bd3ef82ff	Revert an earlier change to SIGN_EXTEND_INREG for vectors. The VTSDNode really does need to be a vector type, because TargetLowering::getOperationAction for SIGN_EXTEND_INREG uses that type, and it needs to be able to distinguish between vectors and scalars. Also, fix some more issues with legalization of vector casts. llvm-svn: 93043	2010-01-09 02:13:55 +00:00
Evan Cheng	0c6defd577	Dan pointed out checking whether a node is dead by comparing its opcode to ISD::DELETED_NODE is not safe. Use a DAGUpdateListener to remove dead nodes from work list instead. llvm-svn: 93031	2010-01-09 00:21:08 +00:00
Evan Cheng	58ec4fec88	ReplaceAllUsesOfValueWith may delete other nodes that the one being replaced. Do not delete dead nodes again. llvm-svn: 92988	2010-01-08 02:36:12 +00:00
Chris Lattner	dab2cd543f	Fix rdar://7517201, a regression introduced by r92849. When folding a and(any_ext(load)) both the any_ext and the load have to have only a single use. This removes the anyext-uses.ll testcase which started failing because it is unreduced and unclear what it is testing. llvm-svn: 92950	2010-01-07 21:59:23 +00:00
Chris Lattner	88de38453f	factor this code better and reduce nesting at the same time, no functionality change. llvm-svn: 92948	2010-01-07 21:53:27 +00:00
Evan Cheng	16b75ce19c	APInt'fy TargetLowering::SimplifySetCC to fix PR5963. llvm-svn: 92943	2010-01-07 20:58:44 +00:00
Benjamin Kramer	cdb3889791	Use pop_back_val instead of back()+pop_back. llvm-svn: 92918	2010-01-07 17:27:56 +00:00
Evan Cheng	746012a6c1	Comment. llvm-svn: 92850	2010-01-06 19:43:21 +00:00
Evan Cheng	166a4e6caa	Teach dag combine to fold the following transformation more aggressively: (OP (trunc x), (trunc y)) -> (trunc (OP x, y)) Unfortunately this simple change causes dag combine to infinite looping. The problem is the shrink demanded ops optimization tend to canonicalize expressions in the opposite manner. That is badness. This patch disable those optimizations in dag combine but instead it is done as a late pass in sdisel. This also exposes some deficiencies in dag combine and x86 setcc / brcond lowering. Teach them to look pass ISD::TRUNCATE in various places. llvm-svn: 92849	2010-01-06 19:38:29 +00:00
Bill Wendling	c075acbb54	The previous code could potentially cause a cycle. Allow ordering w.r.t. a 0 order. llvm-svn: 92810	2010-01-06 00:23:35 +00:00
Bill Wendling	578865ff3d	Only check the ordering if there is an ordering for each nodes. llvm-svn: 92807	2010-01-06 00:09:23 +00:00
Bill Wendling	0a7056fe52	Add a semi-primitive form of scheduling via the "SDNode ordering" to the bottom-up scheduler. We prefer the lower order number. llvm-svn: 92806	2010-01-05 23:48:12 +00:00
Bill Wendling	03f0af372c	Don't assign the shift the same type as the variable being shifted. This could result in illegal types for the SHL operator. llvm-svn: 92797	2010-01-05 22:39:10 +00:00
Dan Gohman	404a984780	Don't use the ISD::NodeType enum for SDNode opcodes, as CodeGen uses several kinds of opcode values which are not declared within that enum. This fixes PR5946. llvm-svn: 92794	2010-01-05 22:26:32 +00:00
Benjamin Kramer	ccce8bae14	Avoid going through the LLVMContext for type equality where it's safe to dereference the type pointer. llvm-svn: 92726	2010-01-05 13:12:22 +00:00
Devang Patel	33f80d2303	Delete renaming use of dead dbg intrinsics. Intrinsic::dbg_stoppoint Intrinsic::dbg_region_start Intrinsic::dbg_region_end Intrinsic::dbg_func_start llvm-svn: 92672	2010-01-05 01:47:06 +00:00
David Greene	30ed3ca034	Change errs() to dbgs(). llvm-svn: 92597	2010-01-05 01:26:11 +00:00
David Greene	4eb5bed65b	Change errs() to dbgs(). llvm-svn: 92581	2010-01-05 01:25:11 +00:00
David Greene	d65bc15c81	Change errs() to dbgs(). llvm-svn: 92580	2010-01-05 01:25:09 +00:00
David Greene	6f021a30fe	Change errs() to dbgs(). llvm-svn: 92579	2010-01-05 01:25:04 +00:00
David Greene	fe5c3524c7	Change errs() to dbgs(). llvm-svn: 92578	2010-01-05 01:25:00 +00:00
David Greene	5730f203ee	Change errs() to dbgs(). llvm-svn: 92577	2010-01-05 01:24:57 +00:00
David Greene	f34d7ac9f1	Change errs() to dbgs(). llvm-svn: 92576	2010-01-05 01:24:54 +00:00
David Greene	ae4f266b2d	Change errs() to dbgs(). llvm-svn: 92575	2010-01-05 01:24:53 +00:00
David Greene	ec5883fc0e	Change errs() to dbgs(). llvm-svn: 92574	2010-01-05 01:24:50 +00:00
David Greene	807fcf6374	Change errs() to dbgs(). llvm-svn: 92573	2010-01-05 01:24:48 +00:00
David Greene	40deefdc4f	Change errs() to dbgs(). llvm-svn: 92572	2010-01-05 01:24:45 +00:00
David Greene	63145844c8	Change errs() to dbgs(). llvm-svn: 92571	2010-01-05 01:24:43 +00:00
David Greene	4cec475ed7	Change errs() to dbgs(). llvm-svn: 92570	2010-01-05 01:24:40 +00:00
David Greene	d93137dce7	Change errs() to dbgs(). llvm-svn: 92569	2010-01-05 01:24:36 +00:00
David Greene	7562faa4cf	Change errs() to dbgs(). llvm-svn: 92568	2010-01-05 01:24:34 +00:00
Dan Gohman	ea6f91ff64	Change SelectCode's argument from SDValue to SDNode , to make it more clear what information these functions are actually using. This is also a micro-optimization, as passing a SDNode around is simpler than passing a { SDNode *, int } by value or reference. llvm-svn: 92564	2010-01-05 01:24:18 +00:00
Dan Gohman	feeced4104	Use a pointer type rather than MVT::Other for the ExternalSymbol node used in an inline asm. llvm-svn: 92512	2010-01-04 21:00:54 +00:00
Chris Lattner	1eea3b0ada	Teach codegen to handle: (X != null) \| (Y != null) --> (X\|Y) != 0 (X == null) & (Y == null) --> (X\|Y) == 0 so that instcombine can stop doing this for pointers. This is part of PR3351, which is a case where instcombine doing this for pointers (inserting ptrtoint) is pessimizing code. llvm-svn: 92406	2010-01-02 00:00:03 +00:00
Chris Lattner	24576a5cf3	whitespace cleanup llvm-svn: 92404	2010-01-01 23:37:34 +00:00
Mikhail Glushenkov	5c35d2f6a4	Fix a warning on gcc 4.4. SelectionDAGBuilder.cpp:4294: warning: suggest explicit braces to avoid ambiguous ‘else’ llvm-svn: 92395	2010-01-01 04:41:36 +00:00
Mikhail Glushenkov	2abe1b70ac	Trailing whitespace, 80-col violations. llvm-svn: 92394	2010-01-01 04:41:22 +00:00
Chris Lattner	39f18e545e	Teach codegen to lower llvm.powi to an efficient (but not optimal) multiply sequence when the power is a constant integer. Before, our codegen for std::pow(.., int) always turned into a libcall, which was really inefficient. This should also make many gfortran programs happier I'd imagine. llvm-svn: 92388	2010-01-01 03:32:16 +00:00
Chris Lattner	8e805be369	remove a bunch of unneeded functions. llvm-svn: 92263	2009-12-29 09:32:19 +00:00
Chris Lattner	a0566979b7	Final step in the metadata API restructuring: move the getMDKindID/getMDKindNames methods to LLVMContext (and add convenience methods to Module), eliminating MetadataContext. Move the state that it maintains out to LLVMContext. llvm-svn: 92259	2009-12-29 09:01:33 +00:00
Chris Lattner	2f2aa2b067	This is a major cleanup of the instruction metadata interfaces that I asked Devang to do back on Sep 27. Instead of going through the MetadataContext class with methods like getMD() and getMDs(), just ask the instruction directly for its metadata with getMetadata() and getAllMetadata(). This includes a variety of other fixes and improvements: previously all Value*'s were bloated because the HasMetadata bit was thrown into value, adding a 9th bit to a byte. Now this is properly sunk down to the Instruction class (the only place where it makes sense) and it will be folded away somewhere soon. This also fixes some confusion in getMDs and its clients about whether the returned list is indexed by the MDID or densely packed. This is now returned sorted and densely packed and the comments make this clear. This introduces a number of fixme's which I'll follow up on. llvm-svn: 92235	2009-12-28 23:41:32 +00:00
Chris Lattner	7093946ab1	rename getMDKind -> getMDKindID, make it autoinsert if an MD Kind doesn't exist already, eliminate registerMDKind. Tidy up a bunch of random stuff. llvm-svn: 92225	2009-12-28 20:45:51 +00:00
Sanjiv Gupta	0b00a1b54e	Allow targets to specify the return type of libcalls that are generated for floating point comparisons, rather than hard-coding them as i32. llvm-svn: 92199	2009-12-28 02:40:33 +00:00
Bill Wendling	42bc7ad2b1	Remove dead store. llvm-svn: 92190	2009-12-28 01:51:30 +00:00
Bill Wendling	5b8d89d0a2	Remove dead variable. llvm-svn: 92189	2009-12-28 01:48:56 +00:00
Bill Wendling	9a62b467a8	Remove dead variable. llvm-svn: 92188	2009-12-28 01:47:48 +00:00
Bill Wendling	846ca9b38b	Remove dead variable. llvm-svn: 92180	2009-12-28 01:02:21 +00:00
Bill Wendling	7da8f90d41	Remove dead variable. llvm-svn: 92178	2009-12-28 01:00:12 +00:00
Chris Lattner	f5e3ed64d5	handle equality memcmp of 8 bytes on x86-64 with two unaligned loads and a compare. On other targets we end up with a call to memcmp because we don't want 16 individual byte loads. We should be able to use movups as well, but we're failing to select the generated icmp. llvm-svn: 92107	2009-12-24 01:07:17 +00:00
Chris Lattner	1a32ede6fd	move an optimization for memcmp out of simplifylibcalls and into SDISel. This optimization was causing simplifylibcalls to introduce type-unsafe nastiness. This is the first step, I'll be expanding the memcmp optimizations shortly, covering things that we really really wouldn't want simplifylibcalls to do. llvm-svn: 92098	2009-12-24 00:37:38 +00:00
Nuno Lopes	129819de71	move a few more symbols to .rodata llvm-svn: 92011	2009-12-23 17:48:10 +00:00
Dale Johannesen	a864a67185	Use more sensible type for flags in asms. PR 5570. Patch by Sylve`re Teissier (sorry, ASCII only). llvm-svn: 91988	2009-12-23 07:32:51 +00:00
Eric Christopher	fdb33458fc	Update objectsize intrinsic and associated dependencies. Fix lowering code and update testcases. llvm-svn: 91979	2009-12-23 02:51:48 +00:00
Bill Wendling	0602f39bb1	Remove superfluous SDNode ordering. llvm-svn: 91971	2009-12-23 01:28:19 +00:00
Bill Wendling	9df5c6dfc3	Remove node ordering from inline asm nodes. It's not needed. llvm-svn: 91961	2009-12-23 00:47:20 +00:00
Bill Wendling	91313064f1	Remove node ordering from VA nodes. It's not needed. llvm-svn: 91958	2009-12-23 00:44:51 +00:00
Bill Wendling	ef408db250	Revert r91949 r91942 and r91936. llvm-svn: 91953	2009-12-23 00:28:23 +00:00
Bill Wendling	54dd5398e0	Finish up node ordering in ExpandNode. llvm-svn: 91949	2009-12-23 00:05:09 +00:00
Bill Wendling	ad1fdf0e40	Assign ordering to nodes created in ExpandNode. Only roughly 1/2 of the function is finished. llvm-svn: 91942	2009-12-22 23:44:56 +00:00
Bill Wendling	70794596a8	Assign ordering to SDNodes in PromoteNode. Also fixing a subtle bug where BSWAP was using "Tmp1" in the first getNode call instead of Node->getOperand(0). llvm-svn: 91936	2009-12-22 22:53:39 +00:00
Bill Wendling	d85498132f	Allow 0 as an order number. Don't assign an order to formal arguments. llvm-svn: 91920	2009-12-22 21:35:02 +00:00
Bob Wilson	bac37abe73	Report an error for bad inline assembly, where the value passed for an "indirect" operand is not a pointer. llvm-svn: 91913	2009-12-22 18:34:19 +00:00
Bill Wendling	919b7aab2e	Add more plumbing. This time in the LowerArguments and "get" functions which return partial registers. This affected the back-end lowering code some. Also patch up some places I missed before in the "get" functions. llvm-svn: 91880	2009-12-22 02:10:19 +00:00
Bill Wendling	ac08758b71	Add SDNode ordering to inlined asm and VA functions. llvm-svn: 91876	2009-12-22 01:25:10 +00:00
Bill Wendling	f376c40d0e	Adding more assignment of ordering to SDNodes. This time in the "call" and generic copy functions. llvm-svn: 91872	2009-12-22 01:11:43 +00:00
Bill Wendling	a4d7df7a37	Add ordering of SDNodes to LowerCallTo. llvm-svn: 91866	2009-12-22 00:50:32 +00:00
Bill Wendling	b99b2693f3	Now add ordering to SDNodes created by the massive intrinsic lowering function. llvm-svn: 91863	2009-12-22 00:40:51 +00:00
Bill Wendling	ea3e73e596	To make things interesting, I added MORE code to set the ordering of SDNodes. This time in the load/store and limited-precision code. llvm-svn: 91860	2009-12-22 00:12:37 +00:00
Bill Wendling	c6b473433b	Add more plumbing to assign ordering to SDNodes. Have the "getValue" method assign the ordering when called. Combine some of the ordering assignments to keep things simple. llvm-svn: 91857	2009-12-21 23:47:40 +00:00
Bill Wendling	e79105b591	More ordering plumbing. This time for GEP. I need to remember to assign orderings to values returned by getValue(). llvm-svn: 91850	2009-12-21 23:10:19 +00:00
Bill Wendling	fff99f066b	Another incremental check-in for assigning ordering to SDNodes. This time for shuffle and insert vector. llvm-svn: 91847	2009-12-21 22:42:14 +00:00
Bill Wendling	443d0722b0	Assign ordering to more instructions. Incremental check-in. llvm-svn: 91846	2009-12-21 22:30:11 +00:00
Bill Wendling	28727f3785	- Add a bit more plumbing assigning an order to SDNodes. - Modify the "dump" method to emit the order of an SDNode. llvm-svn: 91845	2009-12-21 21:59:52 +00:00
Bill Wendling	7f5eb53ce2	First wave of plumbing for assigning an ordering to SDNodes. This takes care of a lot of the branching instructions. llvm-svn: 91838	2009-12-21 19:59:38 +00:00
Bill Wendling	6de55a0efd	Place SDNodeOrdering.h in the directory it's used. llvm-svn: 91834	2009-12-21 19:34:59 +00:00
Anton Korobeynikov	10590171fa	Use 4-arg getVTList) variant instead of generic one, when possible llvm-svn: 91744	2009-12-19 02:04:00 +00:00
Bill Wendling	022d18fa3f	Changes from review: - Move DisableScheduling flag into TargetOption.h - Move SDNodeOrdering into its own header file. Give it a minimal interface that doesn't conflate construction with storage. - Move assigning the ordering into the SelectionDAGBuilder. This isn't used yet, so there should be no functional changes. llvm-svn: 91727	2009-12-18 23:32:53 +00:00
Evan Cheng	b175de6356	Increase opportunities to optimize (brcond (srl (and c1), c2)). llvm-svn: 91717	2009-12-18 21:31:31 +00:00
Bob Wilson	3152b0471b	Handle ARM inline asm "w" constraints with 64-bit ("d") registers. The change in SelectionDAGBuilder is needed to allow using bitcasts to convert between f64 (the default type for ARM "d" registers) and 64-bit Neon vector types. Radar 7457110. llvm-svn: 91649	2009-12-18 01:03:29 +00:00
Ken Dyck	df5561db78	Introduce EVT::getHalfSizedIntegerVT() for use in ExpandUnalignedStore() in LegalizeDAG.cpp. Unlike the code it replaces, which simply decrements the simple type by one, getHalfSizedIntegerVT() searches for the smallest simple integer type that is at least half the size of the type it is called on. This approach has the advantage that it will continue working if a new value type (such as i24) is added to MVT. Also, in preparation for new value types, remove the assertions that non-power-of-2 8-bit-mutiple types are Extended when legalizing extload and truncstore operations. llvm-svn: 91614	2009-12-17 20:09:43 +00:00
Bob Wilson	1c00b6964f	Fix a comment grammaro. llvm-svn: 91584	2009-12-17 05:07:36 +00:00
Evan Cheng	aadf060b92	Revert this dag combine change: Fold (zext (and x, cst)) -> (and (zext x), cst) DAG combiner likes to optimize expression in the other way so this would end up cause an infinite looping. llvm-svn: 91574	2009-12-17 00:40:05 +00:00
Daniel Dunbar	b827e52638	Reapply r91392, it was only unmasking the bug, and since TOT is still broken having it reverted does no good. llvm-svn: 91560	2009-12-16 20:10:05 +00:00
Daniel Dunbar	df45b70c1e	Revert "Initial work on disabling the scheduler. This is a work in progress, and this", this broke llvm-gcc bootstrap for release builds on x86_64-apple-darwin10. llvm-svn: 91533	2009-12-16 10:56:02 +00:00
Evan Cheng	852c486946	Make 91378 more conservative. 1. Only perform (zext (shl (zext x), y)) -> (shl (zext x), y) when y is a constant. This makes sure it remove at least one zest. 2. If the shift is a left shift, make sure the original shift cannot shift out bits. llvm-svn: 91399	2009-12-15 03:00:32 +00:00
Bill Wendling	07beddceb7	Initial work on disabling the scheduler. This is a work in progress, and this stuff isn't used just yet. We want to model the GCC `-fno-schedule-insns' and `-fno-schedule-insns2' flags. The hypothesis is that the people who use these flags know what they are doing, and have hand-optimized the C code to reduce latencies and other conflicts. The idea behind our scheme to turn off scheduling is to create a map "on the side" during DAG generation. It will order the nodes by how they appeared in the code. This map is then used during scheduling to get the ordering. llvm-svn: 91392	2009-12-15 01:54:51 +00:00
Evan Cheng	d1521ef40c	Fold (zext (and x, cst)) -> (and (zext x), cst). llvm-svn: 91380	2009-12-15 00:52:11 +00:00
Evan Cheng	ca7c690d3b	Propagate zest through logical shift. llvm-svn: 91378	2009-12-15 00:41:36 +00:00
Dan Gohman	cecad35728	Fix integer cast code to handle vector types. llvm-svn: 91362	2009-12-14 23:40:38 +00:00
Dan Gohman	6453a4e2ab	Fix this to properly clear the FastISel debug location. Thanks to Bill for spotting this! llvm-svn: 91355	2009-12-14 23:08:09 +00:00
Anton Korobeynikov	94b6310136	Fix weird typo which leads to unallocated memory access for nodes with 4 results. llvm-svn: 91233	2009-12-13 01:00:59 +00:00
Dan Gohman	619a78bd59	Delete an unnecessary line. The VTSDNode on a SIGN_EXTEND_REG is never a vector type. llvm-svn: 91181	2009-12-11 23:26:08 +00:00
Dan Gohman	1d459e4937	Implement vector widening, splitting, and scalarizing for SIGN_EXTEND_INREG. llvm-svn: 91158	2009-12-11 21:31:27 +00:00
Dan Gohman	6d306bb32b	Fix the result type of SELECT nodes lowered from Select instructions with aggregate return values. This fixes PR5754. llvm-svn: 91145	2009-12-11 19:50:50 +00:00
Evan Cheng	d938faff4b	Teach InferPtrAlignment to infer GV+cst alignment and use it to simplify x86 isl lowering code. llvm-svn: 90925	2009-12-09 01:53:58 +00:00
Evan Cheng	f5938d5d27	Move isConsecutiveLoad to SelectionDAG. It's not target dependent and it's primary used by selectdag passes. llvm-svn: 90922	2009-12-09 01:36:00 +00:00
Evan Cheng	2d412f0cb8	Infer alignment for non-fixed stack object. llvm-svn: 90919	2009-12-09 01:17:24 +00:00
Evan Cheng	1750009f38	Add const qualifier. llvm-svn: 90918	2009-12-09 01:10:37 +00:00
Evan Cheng	34a23ea371	Refactor InferAlignment out of DAGCombine. llvm-svn: 90917	2009-12-09 01:04:59 +00:00
Anton Korobeynikov	1bcece70bd	Truncate the arguments of llvm.frameaddress / llvm.returnaddress intrinsics from i32 to platform's largest native type llvm-svn: 90741	2009-12-07 02:28:26 +00:00
Dan Gohman	35f5646ef0	Remove old DBG_LABEL code. llvm-svn: 90669	2009-12-05 17:56:26 +00:00
Dan Gohman	6e7073b846	Remove the unused DisableLegalizeTypes option and related code. llvm-svn: 90668	2009-12-05 17:51:33 +00:00
Dan Gohman	c82272a7b6	Don't blindly set the debug location for PHI node copies. llvm-svn: 90637	2009-12-05 01:29:04 +00:00
Dan Gohman	18f94469dc	Make TargetSelectInstruction protected and called from FastISel.cpp instead of SelectionDAGISel.cpp. llvm-svn: 90636	2009-12-05 01:27:58 +00:00
Dan Gohman	02578a3805	The debug information for an LLVM Instruction applies to that Instruction and that Instruction only. Implement this by setting the "current debug position" back to Unknown after processing each instruction. llvm-svn: 90632	2009-12-05 00:27:08 +00:00
Duncan Sands	1602277b70	Add note about a subtle bug in this code. Does not effect the main architectures that LLVM targets, because they don't use this code. llvm-svn: 90564	2009-12-04 08:42:17 +00:00
Duncan Sands	bbd6b6ddf4	Fix ExpandShiftWithUnknownAmountBit, which was completely bogus. Pointed out by Javier Martinez (who also provided a patch). Since this logic is not used on (for example) x86, I guess nobody noticed. Tested by generating SHL, SRL, SRA on various choices of i64 for all possible shift amounts, and comparing with gcc. Since I did this on x86-32, I had to force the use of ExpandShiftWithUnknownAmountBit. What I'm saying here is that I don't have a testcase I can add to the repository. llvm-svn: 90482	2009-12-03 21:37:32 +00:00
Nate Begeman	9655f84662	Don't pull vector sext through both hands of a logical operation, since doing so prevents the fusion of vector sext and setcc into vsetcc. Add a testcase for the above transformation. Fix a bogus use of APInt noticed while tracking this down. llvm-svn: 90423	2009-12-03 07:11:29 +00:00
Jakob Stoklund Olesen	32042f9475	Don't call getValueType() on a null SDValue llvm-svn: 90415	2009-12-03 05:15:35 +00:00
Chris Lattner	a48f44d9ee	improve portability to avoid conflicting with std::next in c++'0x. Patch by Howard Hinnant! llvm-svn: 90365	2009-12-03 00:50:42 +00:00
Dan Gohman	b2ae02979f	Add edge source labels to SelectionDAG graphs, now that the graph printing framework omits differentiated edge sources in the case where the labels are empty strings. llvm-svn: 90254	2009-12-01 19:20:00 +00:00
Dan Gohman	8def6e3daf	Minor cleanups. llvm-svn: 90253	2009-12-01 19:16:15 +00:00
Dan Gohman	939c828604	Trim an unnecessary #include. llvm-svn: 90252	2009-12-01 19:13:27 +00:00
Tobias Grosser	9caf3801ca	Fix last DOTGraphTraits problems in CompilationGraph. llvm-svn: 90136	2009-11-30 13:34:51 +00:00
Tobias Grosser	dd7f2e797f	Remove ShortNames from getNodeLabel in DOTGraphTraits llvm-svn: 90134	2009-11-30 12:38:47 +00:00
Tobias Grosser	90d334032a	Instantiate DefaultDOTGraphTraits llvm-svn: 90133	2009-11-30 12:38:13 +00:00
Mon P Wang	32f8bb9ed4	Added support to allow clients to custom widen. For X86, custom widen vectors for divide/remainder since these operations can trap by unroll them and adding undefs for the resulting vector. llvm-svn: 90108	2009-11-30 02:42:02 +00:00
Dan Gohman	de5dea869f	Remove ISD::DEBUG_LOC and ISD::DBG_LABEL, which are no longer used. Note that "hasDotLocAndDotFile"-style debug info was already broken; people wanting this functionality should implement it in the AsmPrinter/DwarfWriter code. llvm-svn: 89711	2009-11-23 23:20:51 +00:00
Dan Gohman	9d72cbf2d5	Move CopyCatchInfo into FunctionLoweringInfo.cpp too, for consistency. llvm-svn: 89683	2009-11-23 18:12:11 +00:00
Dan Gohman	1a6c47f1cb	Rename SelectionDAGLowering to SelectionDAGBuilder, and rename SelectionDAGBuild.cpp to SelectionDAGBuilder.cpp. llvm-svn: 89681	2009-11-23 18:04:58 +00:00
Dan Gohman	91aad4b834	Move RegsForValue to an anonymous namespace, since it is only used in this file. llvm-svn: 89675	2009-11-23 17:46:23 +00:00
Dan Gohman	ad97b3dbd0	Move some more code out of SelectionDAGBuild.cpp and into FunctionLoweringInfo.cpp. llvm-svn: 89674	2009-11-23 17:42:46 +00:00
Ted Kremenek	9b6515794f	Update CMake file. llvm-svn: 89671	2009-11-23 17:26:04 +00:00
Dan Gohman	a3624b6099	Move the FunctionLoweringInfo class and some related utility functions out of SelectionDAGBuild.h/cpp into its own files, to help separate general lowering logic from SelectionDAG-specific lowering logic. llvm-svn: 89667	2009-11-23 17:16:22 +00:00
Devang Patel	ed85e12da6	We are not using DBG_STOPPOINT anymore. llvm-svn: 89536	2009-11-21 02:46:55 +00:00
Dale Johannesen	b91eba382d	When generating a vector the really slow way, via loads and stores, handle the case where the element size is not a valid target type correctly (PPC). llvm-svn: 89521	2009-11-21 00:53:23 +00:00
Dan Gohman	7a6611793f	Target-independent support for TargetFlags on BlockAddress operands, and support for blockaddresses in x86-32 PIC mode. llvm-svn: 89506	2009-11-20 23:18:13 +00:00
Duncan Sands	cc0a0cb4b7	Fix PR5558, which was caused by a wrong fix for PR3393 (see commit 63048), which was an expensive checks failure due to a bug in the checking. This patch in essence reverts the original fix for PR3393, and refixes it by a tweak to the way expensive checking is done. llvm-svn: 89454	2009-11-20 10:45:10 +00:00
Dan Gohman	20c8ab655e	Fix fast-isel to avoid selecting the return instruction if a tail call has been encountered. llvm-svn: 89444	2009-11-20 02:51:26 +00:00
Dan Gohman	82e80019a5	Remove the optimizations that convert BRCOND and BR_CC into unconditional branches or fallthroghes. Instcombine/SimplifyCFG should be simplifying branches with known conditions. This fixes some problems caused by these transformations not updating the MachineBasicBlock CFG. llvm-svn: 89017	2009-11-17 00:47:23 +00:00
Dan Gohman	6b3f32e6d7	Fix a typo in a comment. llvm-svn: 88953	2009-11-16 20:35:59 +00:00
Dan Gohman	a627e26d39	Enable the tail call optimization when the caller returns undef. llvm-svn: 88737	2009-11-14 02:06:30 +00:00
Dan Gohman	f80dc08059	Don't let a noalias difference disrupt the tailcall optimization. llvm-svn: 88672	2009-11-13 18:49:38 +00:00
Dale Johannesen	5f4eecf961	Adjust isConstantSplat to allow for big-endian targets. PPC is such a target; make it work. llvm-svn: 87060	2009-11-13 01:45:18 +00:00
David Greene	1fbe054450	Add a bool flag to StackObjects telling whether they reference spill slots. The AsmPrinter will use this information to determine whether to print a spill/reload comment. Remove default argument values. It's too easy to pass a wrong argument value when multiple arguments have default values. Make everything explicit to trap bugs early. Update all targets to adhere to the new interfaces.. llvm-svn: 87022	2009-11-12 20:49:22 +00:00
Benjamin Kramer	68e4945c03	Add compare_lower and equals_lower methods to StringRef. Switch all users of StringsEqualNoCase (from StringExtras.h) to it. llvm-svn: 87020	2009-11-12 20:36:59 +00:00
Devang Patel	2904aa9f6e	"Attach debug info with llvm instructions" mode was enabled a month ago. Now make it permanent and remove old way of inserting intrinsics to encode debug info for line number and scopes. llvm-svn: 87014	2009-11-12 19:02:56 +00:00
Kenneth Uildriks	9f34406a90	x86 users can now return arbitrary sized structs. Structs too large to fit in return registers will be returned through a hidden sret parameter introduced during SelectionDAG construction. llvm-svn: 86876	2009-11-11 19:59:24 +00:00
Dale Johannesen	6f7d5b22bb	Emit correct code when making a ConstantPool entry for a vector constant whose component type is not a legal type for the target. (If the target ConstantPool cannot handle this type either, it has an opportunity to merge elements. In practice any target with 8-bit bytes must support i8 as data). 7320806 (partial). llvm-svn: 86751	2009-11-10 23:16:41 +00:00
Devang Patel	f6eeaebd76	Implement support to debug inlined functions. llvm-svn: 86748	2009-11-10 23:06:00 +00:00
Duncan Sands	dca0c28452	Codegen support for the llvm.invariant/lifetime.start/end intrinsics: just throw them away. llvm-svn: 86678	2009-11-10 09:08:09 +00:00
Dan Gohman	a951526510	Remove an unneeded #include. llvm-svn: 86601	2009-11-09 22:28:30 +00:00
Mike Stump	f04c4cdb27	Fix for 64-bit builds. llvm-svn: 86600	2009-11-09 22:28:21 +00:00
Evan Cheng	ad7c6124e7	Hide a couple of options. llvm-svn: 86522	2009-11-09 06:49:37 +00:00
Anton Korobeynikov	f93bb39b03	Add 8 bit libcalls and make use of them for msp430 llvm-svn: 86384	2009-11-07 17:14:39 +00:00
Chris Lattner	8e1d7222a7	Fix PR5421 by APInt'izing switch lowering. llvm-svn: 86354	2009-11-07 07:50:34 +00:00
Mon P Wang	fc032ced22	Fix memoizing of CvtRndSatSDNode llvm-svn: 86340	2009-11-07 04:46:25 +00:00
Kenneth Uildriks	07119737aa	Add code to check at SelectionDAGISel::LowerArguments time to see if return values can be lowered to registers. Coming soon, code to perform sret-demotion if return values cannot be lowered to registers llvm-svn: 86324	2009-11-07 02:11:54 +00:00
Dan Gohman	43bdc260d6	Avoid printing a redundant space in SDNode->dump(). llvm-svn: 86151	2009-11-05 18:49:11 +00:00
Dan Gohman	34341e69c4	Make -print-machineinstrs more readable. - Be consistent when referring to MachineBasicBlocks: BB#0. - Be consistent when referring to virtual registers: %reg1024. - Be consistent when referring to unknown physical registers: %physreg10. - Be consistent when referring to known physical registers: %RAX - Be consistent when referring to register 0: %reg0 - Be consistent when printing alignments: align=16 - Print jump table contents. - Don't print host addresses, in general. - and various other cleanups. llvm-svn: 85682	2009-10-31 20:19:03 +00:00
Dan Gohman	ba8735d25a	When discarding SrcValue information, discard all of it so that code that uses this information knows to behave conservatively. llvm-svn: 85654	2009-10-31 14:14:04 +00:00
Eric Christopher	a0ca9e944f	Fix warning with gcc-4.0 and signed/unsigned. llvm-svn: 85648	2009-10-31 09:24:35 +00:00
Dan Gohman	d814e32e57	Don't mark registers dead here when processing nodes with MVT::Flag results. This works around a problem affecting targets which rely on MVT::Flag to handle physical register defs. llvm-svn: 85638	2009-10-30 23:57:47 +00:00
Dan Gohman	6c9388011b	Initial target-independent CodeGen support for BlockAddresses. llvm-svn: 85556	2009-10-30 01:27:03 +00:00
Dan Gohman	05efd893db	Remove some unnecessary spaces in debug output. llvm-svn: 85536	2009-10-29 23:30:06 +00:00
Dan Gohman	554a75a973	Move some code from being emitted as boilerplate duplicated in every *ISelDAGToDAG.cpp to being regular code in SelectionDAGISel.cpp. llvm-svn: 85530	2009-10-29 22:30:23 +00:00
Dan Gohman	453d64c9f5	Rename usesCustomDAGSchedInserter to usesCustomInserter, and update a bunch of associated comments, because it doesn't have anything to do with DAGs or scheduling. This is another step in decoupling MachineInstr emitting from scheduling. llvm-svn: 85517	2009-10-29 18:10:34 +00:00
Eric Christopher	1fd4c577d2	Make sure we return the right sized type here. llvm-svn: 85436	2009-10-28 21:32:16 +00:00
Dan Gohman	14ca753e28	Don't call SDNode::isPredecessorOf when it isn't necessary. If the load's chains have no users, they can't be predecessors of the condition. llvm-svn: 85394	2009-10-28 15:28:02 +00:00
Dan Gohman	cd139c0373	Rewrite SelectionDAG::isPredecessorOf to be iterative instead of recursive to avoid consuming extraordinary amounts of stack space when processing tall graphs. llvm-svn: 85369	2009-10-28 03:44:30 +00:00
Evan Cheng	83896a59e1	Add a second ValueType argument to isFPImmLegal. llvm-svn: 85361	2009-10-28 01:43:28 +00:00
Dan Gohman	4b46cbfc23	Mark dead physregdefs dead immediately. This helps MachineSink and MachineLICM and other things which run before LiveVariables is run. llvm-svn: 85360	2009-10-28 01:13:53 +00:00
Chris Lattner	d04cb6d0fa	rename indbr -> indirectbr to appease the residents of #llvm. llvm-svn: 85351	2009-10-28 00:19:10 +00:00
Dan Gohman	a5e078b677	Update the MachineBasicBlock CFG for an indirect branch. llvm-svn: 85325	2009-10-27 22:10:34 +00:00
Dan Gohman	a4374e66f0	Add CodeGen support for indirect branches. llvm-svn: 85323	2009-10-27 21:56:26 +00:00
Chris Lattner	26076a8f10	don't use stdio llvm-svn: 85296	2009-10-27 20:42:54 +00:00
Evan Cheng	16993aa30b	Do away with addLegalFPImmediate. Add a target hook isFPImmLegal which returns true if the fp immediate can be natively codegened by target. llvm-svn: 85281	2009-10-27 19:56:55 +00:00
Chris Lattner	3ed871fe62	add enough support for indirect branch for the feature test to pass (assembler,asmprinter, bc reader+writer) and document it. Codegen currently aborts on it. llvm-svn: 85274	2009-10-27 19:13:16 +00:00
Chris Lattner	0997991252	pseudosourcevalue is also still using getGlobalContext(), so it isn't thread safe either. llvm-svn: 85253	2009-10-27 17:02:08 +00:00
Eric Christopher	7a50b280c1	Add objectsize intrinsic and hook it up through codegen. Doesn't do anything than return "I don't know" at the moment. llvm-svn: 85189	2009-10-27 00:52:25 +00:00
Victor Hernandez	de5ad42aa1	Remove FreeInst. Remove LowerAllocations pass. Update some more passes to treate free calls just like they were treating FreeInst. llvm-svn: 85176	2009-10-26 23:43:48 +00:00
Nick Lewycky	974e12b2d3	Remove includes of Support/Compiler.h that are no longer needed after the VISIBILITY_HIDDEN removal. llvm-svn: 85043	2009-10-25 06:57:41 +00:00
Nick Lewycky	02d5f77d26	Remove VISIBILITY_HIDDEN from class/struct found inside anonymous namespaces. Chris claims we should never have visibility_hidden inside any .cpp file but that's still not true even after this commit. llvm-svn: 85042	2009-10-25 06:33:48 +00:00
Dan Gohman	4ef112be62	APInt-ify the gep scaling code, so that it correctly handles the case where the scale overflows pointer-sized arithmetic. This fixes PR5281. llvm-svn: 84954	2009-10-23 17:57:43 +00:00
Anton Korobeynikov	8626367e38	Fix null pointer dereference. llvm-svn: 84806	2009-10-22 00:15:17 +00:00
Anton Korobeynikov	a6faf60831	Fix invalid for vector types fneg(bitconvert(x)) => bitconvert(x ^ sign) transform. llvm-svn: 84683	2009-10-20 21:37:45 +00:00
Evan Cheng	0e9d9ca855	-Revert parts of 84326 and 84411. Distinquishing between fixed and non-fixed stack slots and giving them different PseudoSourceValue's did not fix the problem of post-alloc scheduling miscompiling llvm itself. - Apply Dan's conservative workaround by assuming any non fixed stack slots can alias other memory locations. This means a load from spill slot #1 cannot move above a store of spill slot #2. - Enable post-alloc scheduling for x86 at optimization leverl Default and above. llvm-svn: 84424	2009-10-18 18:16:27 +00:00
Evan Cheng	0b8db2dab7	Only fixed stack objects and spill slots should be get FixedStack PseudoSourceValue. llvm-svn: 84411	2009-10-18 06:27:36 +00:00
Evan Cheng	8759585aba	Revert 84315 for now. Re-thinking the patch. llvm-svn: 84321	2009-10-17 07:53:04 +00:00
Evan Cheng	0818d87ed1	Rename getFixedStack to getStackObject. The stack objects represented are not necessarily fixed. Only those will negative frame indices are "fixed." llvm-svn: 84315	2009-10-17 06:22:26 +00:00
Evan Cheng	a6e4db8ff7	80 col violation. llvm-svn: 84311	2009-10-17 06:05:11 +00:00
Dan Gohman	650997fb0b	Delete an obsolete comment. llvm-svn: 84300	2009-10-17 01:37:38 +00:00
Victor Hernandez	a3aaf85e23	Remove MallocInst from LLVM Instructions. llvm-svn: 84299	2009-10-17 01:18:07 +00:00
Mon P Wang	b1baaf5ab9	Allow widening of extract subvector llvm-svn: 84279	2009-10-16 22:05:48 +00:00
Zhongxing Xu	47062ce503	Indent code. llvm-svn: 84247	2009-10-16 05:42:28 +00:00
Jakob Stoklund Olesen	e4197250cc	Report errors correctly for unselected target intrinsics. llvm-svn: 84193	2009-10-15 18:50:03 +00:00
Duncan Sands	8e6ccb65df	I don't see any point in having both eh.selector.i32 and eh.selector.i64, so get rid of eh.selector.i64 and rename eh.selector.i32 to eh.selector. Likewise for eh.typeid.for. This aligns us with gcc, which always uses a 32 bit value for the selector on all platforms. My understanding is that the register allocator used to assert if the selector intrinsic size didn't match the pointer size, and this was the reason for introducing the two variants. However my testing shows that this is no longer the case (I fixed some bugs in selector lowering yesterday, and some more today in the fastisel path; these might have caused the original problems). llvm-svn: 84106	2009-10-14 16:11:37 +00:00
Devang Patel	d7ebfe3963	s/DebugLoc.CompileUnit/DebugLoc.Scope/g s/DebugLoc.InlinedLoc/DebugLoc.InlinedAtLoc/g llvm-svn: 84054	2009-10-13 23:28:53 +00:00
Duncan Sands	18a956cb4a	Introduce new convenience methods for sign extending or truncating an SDValue (depending on whether the target type is bigger or smaller than the value's type); or zero extending or truncating it. Use it in a few places (this seems to be a popular operation, but I only modified cases of it in SelectionDAGBuild). In particular, the eh_selector lowering was doing this wrong due to a repeated rather than inverted test, fixed with this change. llvm-svn: 84027	2009-10-13 21:04:12 +00:00
Devang Patel	0af2a420cd	Set default location for a function if it is not set. llvm-svn: 83921	2009-10-12 23:10:55 +00:00
Nate Begeman	a3ed9edd40	More heuristics for Combiner-AA. Still catches all important cases, but compile time penalty on gnugo, the worst case in MultiSource, is down to about 2.5% from 30% llvm-svn: 83824	2009-10-12 05:53:58 +00:00
Dan Gohman	b8120770b4	Create a new InstrEmitter class for translating SelectionDAG nodes into MachineInstrs. This is mostly just moving the code from ScheduleDAGSDNodesEmit.cpp into a new class. This decouples MachineInstr emitting from scheduling. llvm-svn: 83699	2009-10-10 01:32:21 +00:00
Dan Gohman	a22f2d8614	Make getMachineNode return a MachineSDNode* instead of a generic SDNode* since it won't do any folding. This will help avoid some inconvenient casting. llvm-svn: 83698	2009-10-10 01:29:16 +00:00
Dan Gohman	918ec53c64	The ScheduleDAG framework now requires an AliasAnalysis argument, though it isn't needed in the ScheduleDAGSDNodes schedulers. llvm-svn: 83691	2009-10-09 23:33:48 +00:00
Devang Patel	df45c7f642	Extract scope information from the variable itself, instead of relying on alloca or llvm.dbg.declare location. While recording beginning of a function, use scope info from the first location entry instead of just relying on first location entry itself. llvm-svn: 83684	2009-10-09 22:42:28 +00:00
Bob Wilson	2a45a65511	Add a SelectionDAG getTargetInsertSubreg convenience function, similar to getTargetExtractSubreg. llvm-svn: 83564	2009-10-08 18:49:46 +00:00
Devang Patel	4598eb6214	Add support to handle debug info attached to an instruction. This is not yet enabled. llvm-svn: 83400	2009-10-06 18:37:31 +00:00
Devang Patel	bb802206d2	Set default location for the function if it is not already set. This code is not yet enabled. llvm-svn: 83349	2009-10-06 00:09:08 +00:00
Devang Patel	4dbca6dfd4	If location info is attached with an instruction then keep track of alloca slots used by a variable. This info will be used by AsmPrinter to emit debug info for variables. llvm-svn: 83189	2009-10-01 01:03:26 +00:00
Devang Patel	3256c751f5	Use MDNode * directly as an RecordSourceLine() argument. llvm-svn: 83182	2009-09-30 22:51:28 +00:00
Reid Kleckner	cea8dab1d1	Silence comparison always false warning in -Asserts mode. llvm-svn: 83164	2009-09-30 20:43:07 +00:00
Reid Kleckner	8ff5c19ebd	Fix integer overflow in instruction scheduling. This can happen if we have basic blocks that are so long that their size overflows a short. Also assert that overflow does not happen in the future, as requested by Evan. This fixes PR4401. llvm-svn: 83159	2009-09-30 20:15:38 +00:00
Devang Patel	5d58383ea9	Remove unnecessary cast. llvm-svn: 83100	2009-09-29 19:56:13 +00:00
Devang Patel	2d85eef974	s/class Metadata/class MetadataContext/g llvm-svn: 83019	2009-09-28 21:41:20 +00:00
Devang Patel	b1a4477f1f	Do not use global typedef for MDKindID. llvm-svn: 83016	2009-09-28 21:14:55 +00:00
Dan Gohman	6905f15256	Use VerifySchedule instead of doing the work manually. llvm-svn: 82995	2009-09-28 16:09:41 +00:00
Dan Gohman	832800aa6f	Convert comparisons like (x == infinity) to (x >= infinity) on targets where FCMP_OEQ is not legal and FCMP_OGE is, such as x86. llvm-svn: 82861	2009-09-26 15:24:17 +00:00
Dan Gohman	48b185d6f7	Improve MachineMemOperand handling. - Allocate MachineMemOperands and MachineMemOperand lists in MachineFunctions. This eliminates MachineInstr's std::list member and allows the data to be created by isel and live for the remainder of codegen, avoiding a lot of copying and unnecessary translation. This also shrinks MemSDNode. - Delete MemOperandSDNode. Introduce MachineSDNode which has dedicated fields for MachineMemOperands. - Change MemSDNode to have a MachineMemOperand member instead of its own fields with the same information. This introduces some redundancy, but it's more consistent with what MachineInstr will eventually want. - Ignore alignment when searching for redundant loads for CSE, but remember the greatest alignment. Target-specific code which previously used MemOperandSDNodes with generic SDNodes now use MemIntrinsicSDNodes, with opcodes in a designated range so that the SelectionDAG framework knows that MachineMemOperand information is available. llvm-svn: 82794	2009-09-25 20:36:54 +00:00
Dan Gohman	32f71d714b	Rename getTargetNode to getMachineNode, for consistency with the naming scheme used in SelectionDAG, where there are multiple kinds of "target" nodes, but "machine" nodes are nodes which represent a MachineInstr. llvm-svn: 82790	2009-09-25 18:54:59 +00:00
Dale Johannesen	a318d91a1e	Make sure sin, cos, sqrt calls are marked readonly before producing FSIN, FCOS, FSQRT. If they aren't so marked we have to assume they might set errno. llvm-svn: 82781	2009-09-25 18:00:35 +00:00
Dale Johannesen	c72134269f	Generate FSQRT from calls to the sqrt function, which allows appropriate backends to generate a sqrt instruction. On x86, this isn't done at -O0 because we go through FastISel instead. This is a behavior change from before this series of sqrt patches started. I think this is OK considering that compile speed is most important at -O0, but could be convinced otherwise. llvm-svn: 82778	2009-09-25 17:23:22 +00:00
Nate Begeman	18150d5abc	Fix combiner-aa issue with bases which are different, but can alias. Previously, it treated GV+28 GV+0 as different bases, and assumed they could not alias. llvm-svn: 82753	2009-09-25 06:05:26 +00:00
Dan Gohman	ebdfe4af62	Add a version of dumpr() that has a SelectionDAG* argument. llvm-svn: 82742	2009-09-25 00:34:34 +00:00
Dan Gohman	203d53ed79	Use getStoreSize() instead of getStoreSizeInBits()/8. llvm-svn: 82656	2009-09-23 21:07:02 +00:00
Dan Gohman	08c0a95ac6	Rename several variables from EVT to more descriptive names, now that EVT is also the name of their type, as declarations like "EVT EVT" look really odd. llvm-svn: 82654	2009-09-23 21:02:20 +00:00
Dan Gohman	c0353bfff5	Give MachineMemOperand an operator<<, factoring out code from two different places for printing MachineMemOperands. Drop the virtual from Value::dump and instead give Value a protected virtual hook that can be overridden by subclasses to implement custom printing. This lets printing be more consistent, and simplifies printing of PseudoSourceValue values. llvm-svn: 82599	2009-09-23 01:33:16 +00:00
Dan Gohman	e7c8242baa	Change MachineMemOperand's alignment value to be the alignment of the base pointer, without the offset. This matches MemSDNode's new alignment behavior, and holds more interesting information. llvm-svn: 82473	2009-09-21 19:47:04 +00:00
Chris Lattner	bb1a1bd2bd	tidy up llvm-svn: 82397	2009-09-20 17:32:21 +00:00
Daniel Dunbar	7d6781b0fe	Tabs -> spaces, and remove trailing whitespace. llvm-svn: 82355	2009-09-20 02:20:51 +00:00
Evan Cheng	9827ad39a7	Fix PR4926. When target hook EmitInstrWithCustomInserter() insert new basic blocks and update CFG, it should also inform sdisel of the changes so the phi source operands will come from the right basic blocks. llvm-svn: 82311	2009-09-19 09:51:03 +00:00
Evan Cheng	270d0f986f	Enhance EmitInstrWithCustomInserter() so target can specify CFG changes that sdisel will use to properly complete phi nodes. Not functionality change yet. llvm-svn: 82273	2009-09-18 21:02:19 +00:00
Chris Lattner	e133923abe	duncan points out the EH selector values are signed. llvm-svn: 82245	2009-09-18 18:34:29 +00:00
Evan Cheng	f4db6396e0	Revert r82214. It broke 403.gcc on x86_64 / Darwin. llvm-svn: 82215	2009-09-18 08:26:06 +00:00
Evan Cheng	6ba1931d60	Fix a bug in sdisel switch lowering code. When it updates the phi nodes in switch successor blocks, it can introduce multiple phi operands of the same value from different blocks (and may not be on the predecessor list). This can be seen on CodeGen/Generic/2006-09-06-SwitchLowering.ll. But it's not known to cause any real regression (but I have added an assertion for it now). llvm-svn: 82214	2009-09-18 08:16:04 +00:00
Chris Lattner	1bd81314e7	tolerate llvm.eh.selector.i64 on 32-bit systems and llvm.eh.selector.i32 on 64-bit systems. llvm-svn: 82180	2009-09-17 23:54:54 +00:00
Devang Patel	44b3a87f78	Fix typo. llvm-svn: 82080	2009-09-16 21:09:07 +00:00
Devang Patel	852c9b6627	At iSel time, update DebugLoc based on debug info attached with an instruction. llvm-svn: 82077	2009-09-16 20:39:11 +00:00
Nate Begeman	fbb88b180c	Do not add the SVOffset to the Node CSE ID. The same pointer argument cannot have different SVOffsets. llvm-svn: 81937	2009-09-15 22:30:11 +00:00
Nate Begeman	178135c88b	Better solution for tracking both the original alignment of the access, and the current alignment based on the source value offset. This avoids increasing the size of mem nodes. llvm-svn: 81897	2009-09-15 19:05:41 +00:00
Nate Begeman	d41f8fd2b3	Remove incorrect CSE code from r81813. llvm-svn: 81819	2009-09-15 00:38:09 +00:00
Nate Begeman	879d8f1c3e	Substantially speed up combiner-aa in the following ways: 1. Switch from an std::set to a SmallPtrSet for visited chain nodes. 2. Do not force the recursive flattening of token factor nodes, regardless of use count. 3. Immediately process newly created TokenFactor nodes. Also, improve combiner-aa by teaching it that loads to non-overlapping offsets of relatively aligned objects cannot alias. These changes result in a >5x speedup for combiner-aa on most testcases. llvm-svn: 81816	2009-09-15 00:18:30 +00:00
Nate Begeman	01c1e1152d	Teach the legalizer to propagate the original alignment of loads and store when it splits them. llvm-svn: 81815	2009-09-15 00:14:28 +00:00
Nate Begeman	02a685a914	Add an "original alignment" field to load and store nodes. This enables the DAG Combiner to disambiguate chains for loads and stores of types which are broken up by the Legalizer into smaller pieces. llvm-svn: 81813	2009-09-15 00:13:12 +00:00
Chris Lattner	0bad631cde	kill off the last use of TRI::AsmName. llvm-svn: 81727	2009-09-13 22:42:03 +00:00
Dan Gohman	9cbef32726	Make fast-isel try ISD::FNEG before resorting to bitcasts and xors. llvm-svn: 81493	2009-09-11 00:36:43 +00:00
Dan Gohman	89b090e51e	Reapply r81171 with a fix: don't try to use i64 when it isn't legal. llvm-svn: 81492	2009-09-11 00:34:46 +00:00
Bob Wilson	39f51320ca	Don't swap the operands of a subtraction when trying to create a post-decrement load/store. llvm-svn: 81464	2009-09-10 22:09:31 +00:00
Bob Wilson	59e4c84c6f	Revert r81171 which was causing pr4927. llvm-svn: 81415	2009-09-10 00:49:22 +00:00
Dan Gohman	16ad903fcf	When widening a vector load, use the correct chain. This fixes PR4891. llvm-svn: 81343	2009-09-09 14:22:57 +00:00
Chris Lattner	e819cfbc71	change selectiondag to add the sign extended versions of immediate operands to instructions instead of zero extended ones. This makes the asmprinter print signed values more consistently. This apparently only really affects the X86 backend. llvm-svn: 81265	2009-09-08 23:05:44 +00:00
Dan Gohman	f4a0f0f033	Fix an abort on a store of an empty struct member. getValue returns null in the case of an empty struct, so don't try to call getNumValues on it. llvm-svn: 81180	2009-09-08 01:44:02 +00:00
Dan Gohman	2512a42548	Fix a thinko: When lowering fneg with xor, bitcast the operands from floating-point to integer first, and bitcast the result back to floating-point. Previously, this test was passing by falling back to SelectionDAG lowering. The resulting code isn't as nice, but it's correct and CodeGen now stays on the fast path. llvm-svn: 81171	2009-09-07 23:47:14 +00:00
Duncan Sands	3ee3c174b1	Simplify. Testing shows that this is not equivalent to BBI = CR.CaseBB + 1. llvm-svn: 81124	2009-09-06 18:03:32 +00:00
Duncan Sands	89720bbd11	Remove some not-really-used variables, as warned about by icc (#593, partial). Patch by Erick Tryzelaar. llvm-svn: 81115	2009-09-06 12:41:19 +00:00
Duncan Sands	2fbeaf084f	Remove some unused variables and methods warned about by icc (#177, partial). Patch by Erick Tryzelaar. llvm-svn: 81106	2009-09-06 08:33:48 +00:00
Devang Patel	f03667e20e	Detect VLAs. Do not use DenseMap operator[] because it inserts new entry if lookup fails. Use find() to check an entry in a DenseMap first. llvm-svn: 81058	2009-09-05 00:34:14 +00:00
Dan Gohman	aa92dc1e61	LLVM currently represents floating-point negation as -0.0 - x. Fix FastISel to recognize this pattern and emit a floating-point negation using xor. llvm-svn: 80963	2009-09-03 22:53:57 +00:00
Dan Gohman	d0d5e685da	Recognize more opportunities to use SSE min and max instructions, swapping the operands if necessary. llvm-svn: 80940	2009-09-03 20:34:31 +00:00
Sandeep Patel	68c5f477fa	Retype from unsigned to CallingConv::ID accordingly. Approved by Bob Wilson. llvm-svn: 80773	2009-09-02 08:44:58 +00:00
Daniel Dunbar	f7a14aa43d	Remove Offset from ExternalSybmol MachineOperands, this is unused (and at least partly unsupported, in X86 encoding at least). llvm-svn: 80726	2009-09-01 22:06:46 +00:00
Devang Patel	80ae34974b	Reapply 79977. Use MDNodes to encode debug info in llvm IR. llvm-svn: 80406	2009-08-28 23:24:31 +00:00
Anton Korobeynikov	50509fc2cb	Add extload expansion for f128 llvm-svn: 80116	2009-08-26 17:39:40 +00:00
Devang Patel	f08e35d9dc	Revert 79977. It causes llvm-gcc bootstrap failures on some platforms. llvm-svn: 80073	2009-08-26 05:01:18 +00:00
Owen Anderson	3b1665eca5	Get rid of this horrible "benign race" by exploiting ManagedStatic to initialize the array on its first access. llvm-svn: 80040	2009-08-25 22:27:22 +00:00
Devang Patel	02aac922b4	Update DebugInfo interface to use metadata, instead of special named llvm.dbg.... global variables, to encode debugging information in llvm IR. This is mostly a mechanical change that tests metadata support very well. This change speeds up llvm-gcc by more then 6% at "-O0 -g" (measured by compiling InstructionCombining.cpp!) llvm-svn: 79977	2009-08-25 05:24:07 +00:00
Daniel Dunbar	34ee203337	Fix some refactos for iostream changes (in -Asserts mode). - The world needs better C++ refactoring tools, can I get an Amen!? llvm-svn: 79843	2009-08-23 08:50:52 +00:00
Chris Lattner	317dbbcfb1	eliminate uses of cerr() llvm-svn: 79834	2009-08-23 07:05:07 +00:00
Chris Lattner	4dc3edde9f	remove a few DOUTs here and there. llvm-svn: 79832	2009-08-23 06:35:02 +00:00
Chris Lattner	1362602eb2	Change Pass::print to take a raw ostream instead of std::ostream, update all code that this affects. llvm-svn: 79830	2009-08-23 06:03:38 +00:00
Eli Friedman	79ba8f2edc	Add check for completeness. Note that this doesn't actually have any effect with the way the current code is structured. llvm-svn: 79792	2009-08-23 00:14:19 +00:00
Chris Lattner	7b26fce23e	Rename TargetAsmInfo (and its subclasses) to MCAsmInfo. llvm-svn: 79763	2009-08-22 20:48:53 +00:00
Devang Patel	0939595711	Record variable debug info at ISel time directly. llvm-svn: 79742	2009-08-22 17:12:53 +00:00
Owen Anderson	63010bb65a	Reapply r79708 with the appropriate fix for the case that still requires locking. llvm-svn: 79731	2009-08-22 06:32:36 +00:00
Chris Lattner	56d60eaa61	revert r79708 + r79711 llvm-svn: 79720	2009-08-22 04:07:34 +00:00
Eric Christopher	677c2287da	Actually remove unused static. Previous commit removed trailing whitespace. llvm-svn: 79711	2009-08-22 00:41:47 +00:00
Eric Christopher	dfda92b76e	Remove unused static. llvm-svn: 79710	2009-08-22 00:40:45 +00:00
Owen Anderson	8e2456c254	Ease contention on this lock by noticing that all writes to the VTs array will be of (dynamically) constant values, so races on it are immaterial. We just need to ensure that at least one write has completed before return the pointer into it. With this change, parllc exhibits essentially no overhead on 403.gcc. llvm-svn: 79708	2009-08-22 00:29:12 +00:00
Bill Wendling	dff54eff8e	Fix typo. Should check both values of RangeUse for 0. Patch by Marius Wachtler. llvm-svn: 79649	2009-08-21 18:16:06 +00:00
Dan Gohman	ac33a9061d	Add an x86 peep that narrows TEST instructions to forms that use a smaller encoding. These kinds of patterns are very frequent in sqlite3, for example. llvm-svn: 79439	2009-08-19 18:16:17 +00:00
David Goodwin	9b48cd4899	Use the schedule itinerary operand use/def cycle information to adjust dependence edge latency for post-RA scheduling. llvm-svn: 79425	2009-08-19 16:08:58 +00:00
Eli Friedman	1e008c173a	PR4737: Fix a nasty bug in load narrowing with non-power-of-two types. llvm-svn: 79415	2009-08-19 08:46:10 +00:00
Dan Gohman	2fa67c9f70	Be tidy and use a break to exit from a switch block rather than just falling through the end. llvm-svn: 79383	2009-08-18 23:52:48 +00:00
Dan Gohman	4906f73a9f	Legalize the shift amount operand of SRL_PARTS, SHL_PARTS, and SRA_PARTS, as is done for SRL, SHL, and SRA. llvm-svn: 79380	2009-08-18 23:36:17 +00:00
Jim Grosbach	43bbb9de66	Remove a bit more cruft from the sjlj moving to a backend pass. llvm-svn: 79272	2009-08-17 20:25:04 +00:00
Jakob Stoklund Olesen	7f91fee62b	Be more clever about regclasses in ScheduleDAGSDNodes::EmitCopyFromReg. If two uses of a CopyFromReg want different regclasses, first try a common sub-class, then fall back on the copy emitted in AddRegisterOperand. There is no need for an assert here. The cross-class joiner usually cleans up nicely. llvm-svn: 79193	2009-08-16 17:40:59 +00:00
Evan Cheng	badf17cdc7	Needs to check whether unaligned load / store of i64 is legal here. llvm-svn: 79150	2009-08-15 23:41:42 +00:00
Benjamin Kramer	d2d5e716bd	Unbreak build. Evan, please make sure my changes are correct. llvm-svn: 79133	2009-08-15 20:46:16 +00:00
Evan Cheng	567f124305	80 col violations. llvm-svn: 79087	2009-08-15 08:38:52 +00:00
Dan Gohman	e8c913e657	Simplify this code to not depend as much on CurMBB. llvm-svn: 79068	2009-08-15 02:06:22 +00:00
Anton Korobeynikov	a6b3ce203a	Allow targets to specify their choice of calling conventions per libcall. Take advantage of this in the ARM backend to rectify broken choice of CC when hard float is in effect. PIC16 may want to see if it could be of use in MakePIC16Libcall, which works unchanged. Patch by Sandeep! llvm-svn: 79033	2009-08-14 20:10:52 +00:00
Evan Cheng	dc1869661b	Indentation change. llvm-svn: 78978	2009-08-14 01:56:37 +00:00
Owen Anderson	55f1c09e31	Push LLVMContexts through the IntegerType APIs. llvm-svn: 78948	2009-08-13 21:58:54 +00:00
David Goodwin	90e6b8b708	Add callback to allow target to adjust latency of schedule dependency edge. llvm-svn: 78910	2009-08-13 16:05:04 +00:00
Owen Anderson	117c9e8497	Add contexts to some of the MVT APIs. No functionality change yet, just the infrastructure work needed to get the contexts to where they need to be first. llvm-svn: 78759	2009-08-12 00:36:31 +00:00
Owen Anderson	c6daf8f17c	Fix warnings. llvm-svn: 78725	2009-08-11 21:59:30 +00:00
Owen Anderson	9f94459d24	Split EVT into MVT and EVT, the former representing _just_ a primitive type, while the latter is capable of representing either a primitive or an extended type. llvm-svn: 78713	2009-08-11 20:47:22 +00:00
Dan Gohman	7c50c9bd63	Tidy #includes. llvm-svn: 78677	2009-08-11 16:02:12 +00:00
Jim Grosbach	693e36a3e8	SjLj based exception handling unwinding support. This patch is nasty, brutish and short. Well, it's kinda short. Definitely nasty and brutish. The front-end generates the register/unregister calls into the SjLj runtime, call-site indices and landing pad dispatch. The back end fills in the LSDA with the call-site information provided by the front end. Catch blocks are not yet implemented. Built on Darwin and verified no llvm-core "make check" regressions. llvm-svn: 78625	2009-08-11 00:09:57 +00:00
Dan Gohman	9d26c85bdc	Fix a bug in the DAGCombiner's handling of multiple linked MERGE_VALUES nodes. Replacing the result values with the operands in one MERGE_VALUES node may cause another MERGE_VALUES node be CSE'd with the first one, and bring its uses along, so that the first one isn't dead, as this code expects. Fix this by iterating until the node is really dead. This fixes PR4699. llvm-svn: 78619	2009-08-10 23:43:19 +00:00
Dan Gohman	733a64db57	Fix a bug where DAGCombine was producing an illegal ConstantFP node after legalize, and remove the workaround code from the ARM backend. llvm-svn: 78615	2009-08-10 23:15:10 +00:00
Owen Anderson	53aa7a960c	Rename MVT to EVT, in preparation for splitting SimpleValueType out into its own struct type. llvm-svn: 78610	2009-08-10 22:56:29 +00:00
Owen Anderson	c30530d105	Start moving TargetLowering away from using full MVTs and towards SimpleValueType, which will simplify the privatization of IntegerType in the future. llvm-svn: 78584	2009-08-10 18:56:59 +00:00
Dan Gohman	b717091e69	Make this comment more closely reflect the code. llvm-svn: 78569	2009-08-10 16:50:32 +00:00
Jakob Stoklund Olesen	dc6bccbaa6	Don't build illegal ops in DAGCombiner::SimplifyBinOpWithSameOpcodeHands(). Blackfin supports and/or/xor on i32 but not on i16. Teach DAGCombiner::SimplifyBinOpWithSameOpcodeHands to not produce illegal nodes after legalize ops. llvm-svn: 78497	2009-08-08 20:42:17 +00:00
Dale Johannesen	352fa92995	Use stripPointerCasts instead of partially rewriting it. llvm-svn: 78350	2009-08-06 22:45:51 +00:00
Dan Gohman	695d811ad5	Add assertion checks after the calls to LowerFormalArguments, LowerCall, and LowerReturn, to verify that the targets' hooks have respected some of their postconditions. llvm-svn: 78312	2009-08-06 15:37:27 +00:00
Dan Gohman	ee902509a8	Remove an over-aggressive assert. Functions with empty struct return types don't have any return values, from CodeGen's perspective. This fixes PR4688. llvm-svn: 78311	2009-08-06 15:07:58 +00:00
Dan Gohman	5758e1e92a	Fix a few places in DAGCombiner that were creating all-ones-bits and high-bits values in ways that weren't correct for integer types wider than 64 bits. This fixes a miscompile in PPMacroExpansion.cpp in clang on x86-64. llvm-svn: 78295	2009-08-06 09:18:59 +00:00
Dan Gohman	f9bbcd1afd	Major calling convention code refactoring. Instead of awkwardly encoding calling-convention information with ISD::CALL, ISD::FORMAL_ARGUMENTS, ISD::RET, and ISD::ARG_FLAGS nodes, TargetLowering provides three virtual functions for targets to override: LowerFormalArguments, LowerCall, and LowerRet, which replace the custom lowering done on the special nodes. They provide the same information, but in a more immediately usable format. This also reworks much of the target-independent tail call logic. The decision of whether or not to perform a tail call is now cleanly split between target-independent portions, and the target dependent portion in IsEligibleForTailCallOptimization. This also synchronizes all in-tree targets, to help enable future refactoring and feature work. llvm-svn: 78142	2009-08-05 01:29:28 +00:00
Dan Gohman	15873a8ff7	Propogate the Depth argument when calling TLI.computeMaskedBitsForTargetNode from ComputeMaskedBits, since the former may call back into the latter. This fixes a major compile time problem on a testcase that happnened to hit this in a particularly bad way, PR4643. llvm-svn: 78023	2009-08-04 00:24:42 +00:00
Bob Wilson	5f6f72605b	Revert 77974. It breaks 3 of the ARM tests. llvm-svn: 77982	2009-08-03 19:06:29 +00:00
Sanjiv Gupta	9503900c60	Allow targets to custom handle softening of results or operands before trying the standard stuff. llvm-svn: 77974	2009-08-03 17:35:21 +00:00
Benjamin Kramer	c28b306423	llvm_report_error already prints "LLVM ERROR:". So stop reporting errors like "LLVM ERROR: llvm: error:" or "LLVM ERROR: ERROR:". llvm-svn: 77971	2009-08-03 13:33:33 +00:00
Dan Gohman	3f323847bc	Avoid forming a SELECT_CC in a type that the target doesn't support. This isn't immediately interesting, because Legalize ends up lowering SELECT_CC if the target doesn't support it, but this simplifies the process. Also, if the SELECT_CC would be expanded in Legalize, it can potentially end up with two copies of the condition expression. By leaving it as SELECT+SETCC, the SELECT can be expanded into two SELECTs that use a single SETCC. The two comparisons are usually CSE'd, but depending on when various expressions get legalized, the comparison expression could involve calls to library functions, such that the comparison expression may not be able to be CSE'd. This will be needed by a future patch. llvm-svn: 77896	2009-08-02 16:19:38 +00:00
Dan Gohman	3a9b9a59ea	Print the target flags as an int instead of a char, as they aren't actually characters. llvm-svn: 77794	2009-08-01 19:13:38 +00:00
Dan Gohman	859103d8e7	Delete a redundant variable. llvm-svn: 77774	2009-08-01 04:18:29 +00:00
Dan Gohman	7153692bdf	Minor code simplifications. llvm-svn: 77769	2009-08-01 03:51:09 +00:00
Dan Gohman	1987bf4561	SelectionDAGISel no longer needs to check hasAvailableExternallyLinkage, as it is now a MachineFunctionPass, and MachineFunctionPass now handles this. llvm-svn: 77760	2009-08-01 00:42:23 +00:00
Dan Gohman	10b8898ac0	SelectionDAGISel does not "preserve all", since it makes lots of changes to the MachineFunction. llvm-svn: 77753	2009-07-31 23:36:22 +00:00
Dan Gohman	dd3da92b4a	Use a range insert instead of an explicit loop. llvm-svn: 77752	2009-07-31 23:36:06 +00:00
Bob Wilson	84aa855ead	Allow target intrinsics that return multiple values, i.e., struct types, in SelectionDAGLowering::visitTargetIntrinsic. This removes a bit of special-case code for vector types. After staring at it for a while, I managed to convince myself that it is not necessary. The only case where TLI.getValueType() differs from MVT::getMVT is for iPTR, so this code could potentially make a difference for a vector of pointers. But, it looks like that is not supported. Calling TLI.getValueType() on a vector of pointers leads to the following sequence of calls: TargetLowering::getValueType MVT::getMVT MVT::getVectorVT(iPTR, num elements) MVT::getExtendedVectorVT MVT::getTypeForMVT for iPTR assertion fails "Type is not extended!" So, unless I'm really missing something, this bit of code is irrelevant to the current version of LLVM, which is consistent with the fact that I don't see this code in other similar places. llvm-svn: 77747	2009-07-31 22:41:21 +00:00
Owen Anderson	5a1acd9912	Move a few more APIs back to 2.5 forms. The only remaining ones left to change back are metadata related, which I'm waiting on to avoid conflicting with Devang. llvm-svn: 77721	2009-07-31 20:28:14 +00:00
Dan Gohman	5ea74d55ce	Reapply r77654 with a fix: MachineFunctionPass's getAnalysisUsage shouldn't do AU.setPreservesCFG(), because even though CodeGen passes don't modify the LLVM IR CFG, they may modify the MachineFunction CFG, and passes like MachineLoop are registered with isCFGOnly set to true. llvm-svn: 77691	2009-07-31 18:16:33 +00:00
Owen Anderson	23a204d91b	Move getTrue() and getFalse() to 2.5-like APIs. llvm-svn: 77685	2009-07-31 17:39:07 +00:00
Daniel Dunbar	5434756585	Revert r77654, it appears to be causing llvm-gcc bootstrap failures, and many failures when building assorted projects with clang. --- Reverse-merging r77654 into '.': U include/llvm/CodeGen/Passes.h U include/llvm/CodeGen/MachineFunctionPass.h U include/llvm/CodeGen/MachineFunction.h U include/llvm/CodeGen/LazyLiveness.h U include/llvm/CodeGen/SelectionDAGISel.h D include/llvm/CodeGen/MachineFunctionAnalysis.h U include/llvm/Function.h U lib/Target/CellSPU/SPUISelDAGToDAG.cpp U lib/Target/PowerPC/PPCISelDAGToDAG.cpp U lib/CodeGen/LLVMTargetMachine.cpp U lib/CodeGen/MachineVerifier.cpp U lib/CodeGen/MachineFunction.cpp U lib/CodeGen/PrologEpilogInserter.cpp U lib/CodeGen/MachineLoopInfo.cpp U lib/CodeGen/SelectionDAG/SelectionDAGISel.cpp D lib/CodeGen/MachineFunctionAnalysis.cpp D lib/CodeGen/MachineFunctionPass.cpp U lib/CodeGen/LiveVariables.cpp llvm-svn: 77661	2009-07-31 03:02:41 +00:00
Dan Gohman	bcb44baa57	Manage MachineFunctions with an analysis Pass instead of the Annotable mechanism. To support this, make MachineFunctionPass a little more complete. llvm-svn: 77654	2009-07-31 01:52:50 +00:00
Owen Anderson	b292b8ce70	Move more code back to 2.5 APIs. llvm-svn: 77635	2009-07-30 23:03:37 +00:00
Sanjiv Gupta	a53e686d96	Allow targets to define libcall names for mem(cpy,set,move) intrinsics, rather than hardcoding them in DAG lowering. llvm-svn: 77586	2009-07-30 09:12:56 +00:00
Evan Cheng	e62288fdd4	Optimize some common usage patterns of atomic built-ins __sync_add_and_fetch() and __sync_sub_and_fetch. When the return value is not used (i.e. only care about the value in the memory), x86 does not have to use add to implement these. Instead, it can use add, sub, inc, dec instructions with the "lock" prefix. This is currently implemented using a bit of instruction selection trick. The issue is the target independent pattern produces one output and a chain and we want to map it into one that just output a chain. The current trick is to select it into a merge_values with the first definition being an implicit_def. The proper solution is to add new ISD opcodes for the no-output variant. DAG combiner can then transform the node before it gets to target node selection. Problem #2 is we are adding a whole bunch of x86 atomic instructions when in fact these instructions are identical to the non-lock versions. We need a way to add target specific information to target nodes and have this information carried over to machine instructions. Asm printer (or JIT) can use this information to add the "lock" prefix. llvm-svn: 77582	2009-07-30 08:33:02 +00:00
Owen Anderson	4056ca9568	Move types back to the 2.5 API. llvm-svn: 77516	2009-07-29 22:17:13 +00:00
Chris Lattner	7667332899	inline the global 'getInstrOperandRegClass' function into its callers now that TargetOperandInfo does the heavy lifting. llvm-svn: 77508	2009-07-29 21:36:49 +00:00
Benjamin Kramer	21d75078b5	Remove now unused Context variables. llvm-svn: 77495	2009-07-29 19:14:17 +00:00
Owen Anderson	487375e9a2	Move ConstantExpr to 2.5 API. llvm-svn: 77494	2009-07-29 18:55:55 +00:00
Owen Anderson	4aa3295a65	Return ConstantVector to 2.5 API. llvm-svn: 77366	2009-07-28 21:19:26 +00:00
Owen Anderson	c2c7932c64	Change ConstantArray to 2.5 API. llvm-svn: 77347	2009-07-28 18:32:17 +00:00
Chris Lattner	5e693ed07b	Rip all of the global variable lowering logic out of TargetAsmInfo. Since it is highly specific to the object file that will be generated in the end, this introduces a new TargetLoweringObjectFile interface that is implemented for each of ELF/MachO/COFF/Alpha/PIC16 and XCore. Though still is still a brutal and ugly refactoring, this is a major step towards goodness. This patch also: 1. fixes a bunch of dangling pointer problems in the PIC16 backend. 2. disables the TargetLowering copy ctor which PIC16 was accidentally using. 3. gets us closer to xcore having its own crazy target section flags and pic16 not having to shadow sections with its own objects. 4. fixes wierdness where ELF targets would set CStringSection but not CStringSection_. Factor the code better. 5. fixes some bugs in string lowering on ELF targets. llvm-svn: 77294	2009-07-28 03:13:23 +00:00
Owen Anderson	69c464dec4	Move ConstantFP construction back to the 2.5-ish API. llvm-svn: 77247	2009-07-27 20:59:43 +00:00
Eli Friedman	65919b5058	Reorganize code a bit to reduce indentation. No visible functionality change. llvm-svn: 77171	2009-07-26 23:47:17 +00:00
Daniel Dunbar	ca414c7cae	Remove Value::getNameLen llvm-svn: 77148	2009-07-26 08:34:35 +00:00
Dan Gohman	1ddf98ad8e	Convert a few more things to use raw_ostream. llvm-svn: 77039	2009-07-25 01:43:01 +00:00
Daniel Dunbar	0dd5e1ed39	More migration to raw_ostream, the water has dried up around the iostream hole. - Some clients which used DOUT have moved to DEBUG. We are deprecating the "magic" DOUT behavior which avoided calling printing functions when the statement was disabled. In addition to being unnecessary magic, it had the downside of leaving code in -Asserts builds, and of hiding potentially unnecessary computations. llvm-svn: 77019	2009-07-25 00:23:56 +00:00
Owen Anderson	edb4a70325	Revert the ConstantInt constructors back to their 2.5 forms where possible, thanks to contexts-on-types. More to come. llvm-svn: 77011	2009-07-24 23:12:02 +00:00
Jakob Stoklund Olesen	1ae0736830	Add support for promoting SETCC operations. llvm-svn: 76987	2009-07-24 18:22:59 +00:00
Daniel Dunbar	796e43eede	Move more to raw_ostream, provide support for writing MachineBasicBlock, LiveInterval, etc to raw_ostream. llvm-svn: 76965	2009-07-24 10:36:58 +00:00
Daniel Dunbar	12368685d8	Switch to getNameStr(). llvm-svn: 76962	2009-07-24 08:24:36 +00:00
Chris Lattner	308c7896a4	"fix" PR4612, which is a crash on: %0 = malloc [3758096384 x i32] The "malloc" instruction doesn't support 64-bits correctly (see PR715), and should be removed. Victor is actively working on fixing this, in the meantime just don't crash. llvm-svn: 76899	2009-07-23 21:26:18 +00:00
Owen Anderson	47db941fd3	Get rid of the Pass+Context magic. llvm-svn: 76702	2009-07-22 00:24:57 +00:00
Eli Friedman	da9eda8ef6	Remove shift amount flavor. It isn't actually complete enough to be useful, and it's currently unused. (Some issues: it isn't actually rich enough to capture the semantics on many architectures, and semantics can vary depending on the type being shifted.) llvm-svn: 76633	2009-07-21 20:12:16 +00:00
Owen Anderson	c37bc69e91	Rename getConstantInt{True\|False} to get{True\|False} at Chris' behest. llvm-svn: 76598	2009-07-21 18:03:38 +00:00
Daniel Dunbar	5899e340f3	Simplify / normalize some uses of Value::getName. llvm-svn: 76553	2009-07-21 08:54:24 +00:00
Evan Cheng	a7bb55ebb6	Fix a dagga combiner bug: avoid creating illegal constant. Is this really a winning transformation? fold (shl (srl x, c1), c2) -> (shl (and x, (shl -1, c1)), (sub c2, c1)) or (srl (and x, (shl -1, c1)), (sub c1, c2)) llvm-svn: 76535	2009-07-21 05:40:15 +00:00
Owen Anderson	2ad52176f9	Move a bit more state over to the LLVMContext. llvm-svn: 76533	2009-07-21 02:47:59 +00:00
Dale Johannesen	ade297d496	Move stripping of bitcasts in inline asm arguments to a place where it affects everything. Occurs only on calls AFAIK. llvm-svn: 76502	2009-07-20 23:27:39 +00:00
Daniel Dunbar	ac0ca9241a	Fix some minor MSVC compiler warnings. llvm-svn: 76356	2009-07-19 01:38:38 +00:00
Eli Friedman	97f3f965eb	Make promotion in operation legalization for SETCC work correctly. llvm-svn: 76153	2009-07-17 05:16:04 +00:00
Jeffrey Yasskin	efad8e45fe	Add line numbers to OProfile. To do this, I added a processDebugLoc() call to the MachineCodeEmitter interface and made copying the start line of a function not conditional on whether we're emitting Dwarf debug information. I'll propagate the processDebugLoc() calls to the non-X86 targets in a followup patch. In the long run, it'll probably be better to gather this information through the DwarfWriter, but the DwarfWriter currently depends on the AsmPrinter and TargetAsmInfo, and fixing that would be out of the way for this patch. There's a bug in OProfile 0.9.4 that makes it ignore line numbers for addresses above 4G, and a patch fixing it at http://thread.gmane.org/gmane.linux.oprofile/7634 Sample output: $ sudo opcontrol --reset; sudo opcontrol --start-daemon; sudo opcontrol --start; `pwd`/Debug/bin/lli fib.bc; sudo opcontrol --stop Signalling daemon... done Profiler running. fib(40) == 165580141 Stopping profiling. $ opreport -g -d -l `pwd`/Debug/bin/lli\|head -60 Overflow stats not available CPU: Core 2, speed 1998 MHz (estimated) Counted CPU_CLK_UNHALTED events (Clock cycles when not halted) with a unit mask of 0x00 (Unhalted core cycles) count 100000 vma samples % linenr info image name symbol name 00007f67a30370b0 25489 61.2554 fib.c:24 10946.jo fib_left 00007f67a30370b0 1634 6.4106 fib.c:24 00007f67a30370b1 83 0.3256 fib.c:24 00007f67a30370b9 1997 7.8348 fib.c:24 00007f67a30370c6 2080 8.1604 fib.c:27 00007f67a30370c8 988 3.8762 fib.c:27 00007f67a30370cd 1315 5.1591 fib.c:27 00007f67a30370cf 251 0.9847 fib.c:27 00007f67a30370d3 1191 4.6726 fib.c:27 00007f67a30370d6 975 3.8252 fib.c:27 00007f67a30370db 1010 3.9625 fib.c:27 00007f67a30370dd 242 0.9494 fib.c:27 00007f67a30370e1 2782 10.9145 fib.c:28 00007f67a30370e5 3768 14.7828 fib.c:28 00007f67a30370eb 615 2.4128 (no location information) 00007f67a30370f3 6558 25.7287 (no location information) 00007f67a3037100 15603 37.4973 fib.c:29 10946.jo fib_right 00007f67a3037100 1646 10.5493 fib.c:29 00007f67a3037101 45 0.2884 fib.c:29 00007f67a3037109 2372 15.2022 fib.c:29 00007f67a3037116 2234 14.3178 fib.c:32 00007f67a3037118 612 3.9223 fib.c:32 00007f67a303711d 622 3.9864 fib.c:32 00007f67a303711f 385 2.4675 fib.c:32 00007f67a3037123 404 2.5892 fib.c:32 00007f67a3037126 634 4.0633 fib.c:32 00007f67a303712b 870 5.5759 fib.c:32 00007f67a303712d 62 0.3974 fib.c:32 00007f67a3037131 1848 11.8439 fib.c:33 00007f67a3037135 2840 18.2016 fib.c:33 00007f67a303713a 1 0.0064 fib.c:33 00007f67a303713b 1023 6.5564 (no location information) 00007f67a3037143 5 0.0320 (no location information) 000000000080c1e4 15 0.0360 MachineOperand.h:150 lli llvm::MachineOperand::isReg() const 000000000080c1e4 6 40.0000 MachineOperand.h:150 000000000080c1ec 2 13.3333 MachineOperand.h:150 ... llvm-svn: 76102	2009-07-16 21:07:26 +00:00
Owen Anderson	c277dc408b	Privatize the ConstantFP table. I'm on a roll! llvm-svn: 76097	2009-07-16 19:05:41 +00:00
Owen Anderson	20b34ac794	Move the ConstantInt uniquing table into LLVMContextImpl. This exposed a number of issues in our current context-passing stuff, which is also fixed here llvm-svn: 76089	2009-07-16 18:04:31 +00:00
Anton Korobeynikov	bbd751e410	Propagate return result extension type llvm-svn: 75925	2009-07-16 13:35:48 +00:00
Owen Anderson	f945a9ed07	Move a few more convenience factory functions from Constant to LLVMContext. llvm-svn: 75840	2009-07-15 21:51:10 +00:00
Ted Kremenek	39816d9157	Lexically order files in CMakeLists.txt files. llvm-svn: 75831	2009-07-15 21:08:16 +00:00
Owen Anderson	b6b2530000	Move EVER MORE stuff over to LLVMContext. llvm-svn: 75703	2009-07-14 23:09:55 +00:00
Torok Edwin	fbcc663cbf	llvm_unreachable->llvm_unreachable(0), LLVM_UNREACHABLE->llvm_unreachable. This adds location info for all llvm_unreachable calls (which is a macro now) in !NDEBUG builds. In NDEBUG builds location info and the message is off (it only prints "UREACHABLE executed"). llvm-svn: 75640	2009-07-14 16:55:14 +00:00
Owen Anderson	53a52215b5	Begin the painful process of tearing apart the rat'ss nest that is Constants.cpp and ConstantFold.cpp. This involves temporarily hard wiring some parts to use the global context. This isn't ideal, but it's the only way I could figure out to make this process vaguely incremental. llvm-svn: 75445	2009-07-13 04:09:18 +00:00
Chris Lattner	7b9d6ebb9c	remove llvm.part.set.* and llvm.part.select.*. They have never been implemented in codegen, have no frontend to generate them, and are better implemented with pattern matching (like the ppc backend does to generate rlwimi/rlwinm etc). PR4543 llvm-svn: 75430	2009-07-12 21:08:53 +00:00
Torok Edwin	08954aa4e1	Fix assert(0) conversion, as suggested by Chris. llvm-svn: 75423	2009-07-12 20:07:01 +00:00
Jakob Stoklund Olesen	ed0e1a0552	Implement support for promotion of AND/OR/XOR on integer types. The blackfin processor has a legal i16 type, but only logic operations on i32. llvm-svn: 75419	2009-07-12 18:10:18 +00:00
Jakob Stoklund Olesen	6b9f63cafa	Fix types in PromoteNode handling of CTPOP and friends. llvm-svn: 75418	2009-07-12 17:43:20 +00:00
Torok Edwin	56d0659726	assert(0) -> LLVM_UNREACHABLE. Make llvm_unreachable take an optional string, thus moving the cerr<< out of line. LLVM_UNREACHABLE is now a simple wrapper that makes the message go away for NDEBUG builds. llvm-svn: 75379	2009-07-11 20:10:48 +00:00
Torok Edwin	ccb29cd290	Convert more assert(0)+abort() -> LLVM_UNREACHABLE, and abort()/exit() -> llvm_report_error(). llvm-svn: 75363	2009-07-11 13:10:19 +00:00
Evan Cheng	ede2ce71aa	Fix up support for OptionalDefOperand when it defaults to an actual register def. I need this to get ready for major Thumb1 surgery. llvm-svn: 75328	2009-07-11 01:06:50 +00:00
Eli Friedman	106f2885d1	Use CreateStackStoreLoad helper in more places. llvm-svn: 75320	2009-07-11 00:11:07 +00:00
Bob Wilson	f76798769f	Fix an apparent copy-and-paste problem in an error message. llvm-svn: 75197	2009-07-09 23:42:59 +00:00
Eli Friedman	2b77eef160	Make EXTRACT_VECTOR_ELT a bit more flexible in terms of the returned value. Adjust other code to deal with that correctly. Make DAGTypeLegalizer::PromoteIntRes_EXTRACT_VECTOR_ELT take advantage of this new flexibility to simplify the code and make it deal with unusual vectors (like <4 x i1>) correctly. Fixes PR3037. llvm-svn: 75176	2009-07-09 22:01:03 +00:00
Owen Anderson	092bc51cdb	As Chris pointed out, we don't actually need to pass the context around here. llvm-svn: 75161	2009-07-09 18:44:09 +00:00
Owen Anderson	0504e0a222	Thread LLVMContext through MVT and related parts of SDISel. llvm-svn: 75153	2009-07-09 17:57:24 +00:00
Dan Gohman	6b04136756	Make SelectionDAG::getVectorShuffle work properly for VECTOR_SHUFFLE nodes with operand types that differ from the result type. (This doesn't normally happen right now, because SelectionDAGLowering::visitShuffleVector normalizes vector shuffles.) llvm-svn: 75081	2009-07-09 00:46:33 +00:00
David Goodwin	22c2fba978	Use common code for both ARM and Thumb-2 instruction and register info. llvm-svn: 75067	2009-07-08 23:10:31 +00:00
Duncan Sands	7dcc37b942	Nowadays vectors are only split if they have an even number of elements. Make some simplifications based on this (in particular SplitVecRes_SETCC). Tighten up some checking while there. llvm-svn: 75050	2009-07-08 21:34:03 +00:00
Duncan Sands	3f1e2409cc	Remove trailing whitespace. Reorder some methods and cases alphabetically. No functionality change. llvm-svn: 75001	2009-07-08 11:36:39 +00:00
Nick Lewycky	a21d3daadc	Remove the vicmp and vfcmp instructions. Because we never had a release with these instructions, no autoupgrade or backwards compatibility support is provided. llvm-svn: 74991	2009-07-08 03:04:38 +00:00
Chris Lattner	4ac607332d	dag combine sext(setcc) -> vsetcc before legalize. To make this safe, VSETCC must define all bits, which is different than it was documented to before. Since all targets that implement VSETCC already have this behavior, and we don't optimize based on this, just change the documentation. We now get nice code for vec_compare.ll llvm-svn: 74978	2009-07-08 00:31:33 +00:00
Chris Lattner	f3989abdbf	SelectionDAG::SignBitIsZero doesn't work right for vectors, for now, conservatively return false. llvm-svn: 74969	2009-07-07 23:28:46 +00:00
Dale Johannesen	4e33115e5e	Operand of asm("call") (the callee function) is represented as "X" constraint and "P" modifier on x86. Make this work. (Change may not be sufficient to fix it for non-Darwin, but I'm pretty sure it won't break anything.) gcc.apple/asm-block-32.c gcc.apple/asm-block-33.c llvm-svn: 74967	2009-07-07 23:26:33 +00:00
Chris Lattner	fc74e8241a	add support for legalizing an icmp where the result is illegal (4xi1) but the input is legal (4 x i32) llvm-svn: 74964	2009-07-07 23:03:54 +00:00
Chris Lattner	f48f3be185	random code cleanups. llvm-svn: 74962	2009-07-07 22:49:15 +00:00
Chris Lattner	30220d8f98	implement support for spliting and scalarizing vector setcc's. This finishes off enough support for vector compares to get the icmp/fcmp version of 2008-07-23-VSetCC.ll passing. llvm-svn: 74961	2009-07-07 22:47:46 +00:00
Chris Lattner	f2af7f44e7	lower vector icmp/fcmp to ICMP/FCMP nodes with the right result (vector of bool). llvm-svn: 74960	2009-07-07 22:41:32 +00:00
Chris Lattner	119421421a	ScalarizeVecRes_ShiftOp and ScalarizeVecRes_BinOp are the same, eliminate the former. llvm-svn: 74959	2009-07-07 22:28:41 +00:00
Chris Lattner	cc1fed3111	add support for vector legalizing of *_EXTEND. llvm-svn: 74957	2009-07-07 22:27:17 +00:00
Owen Anderson	5c96ef7c4e	Have scoped mutexes take referenes instead of pointers. llvm-svn: 74931	2009-07-07 18:33:04 +00:00
Tilmann Scheller	aea6059ed4	Add NumFixedArgs attribute to CallSDNode which indicates the number of fixed arguments in a vararg call. With the SVR4 ABI on PowerPC, vector arguments for vararg calls are passed differently depending on whether they are a fixed or a variable argument. Variable vector arguments always go into memory, fixed vector arguments are put into vector registers. If there are no free vector registers available, fixed vector arguments are put on the stack. The NumFixedArgs attribute allows to decide for an argument in a vararg call whether it belongs to the fixed or variable portion of the parameter list. llvm-svn: 74764	2009-07-03 06:44:53 +00:00
Devang Patel	87127712b9	Simplify debug info intrisinc lowering. llvm-svn: 74733	2009-07-02 22:43:26 +00:00
Douglas Gregor	6141511621	CMake build fixes, from Xerxes Ranby llvm-svn: 74720	2009-07-02 18:53:52 +00:00
Devang Patel	6bab414f87	Simplify. llvm-svn: 74677	2009-07-02 00:28:03 +00:00
Devang Patel	846a5e4d3e	Simplify. No intentional functionality change. llvm-svn: 74673	2009-07-02 00:08:09 +00:00
Devang Patel	53d24bc7d6	Refactor. No functionality change. llvm-svn: 74659	2009-07-01 23:19:01 +00:00
Devang Patel	ea76e08645	llvm.dbg.declare is always used for local variable's debug info. llvm-svn: 74625	2009-07-01 18:51:07 +00:00
Evan Cheng	0dc101b897	Add a bit IsUndef to MachineOperand. This indicates the def / use register operand is defined by an implicit_def. That means it can def / use any register and passes (e.g. register scavenger) can feel free to ignore them. The register allocator, when it allocates a register to a virtual register defined by an implicit_def, can allocate any physical register without worrying about overlapping live ranges. It should mark all of operands of the said virtual register so later passes will do the right thing. This is not the best solution. But it should be a lot less fragile to having the scavenger try to track what is defined by implicit_def. llvm-svn: 74518	2009-06-30 08:49:04 +00:00
Chris Lattner	a4775f2b13	fix a typo that GCC should have caught that causes crashes with -view-*-dags llvm-svn: 74364	2009-06-27 00:57:02 +00:00
Chris Lattner	bc60c14c97	fix a really subtle bug in the cross section of aliases and TLS: the SelectionDAG::getGlobalAddress function properly looks through aliases to determine thread-localness, but then passes the GV* down to GlobalAddressSDNode::GlobalAddressSDNode which does not. Instead of passing down isTarget, just pass down the predetermined node opcode. This fixes some assertions with out of tree changes I'm working on. llvm-svn: 74325	2009-06-26 21:14:05 +00:00
Chris Lattner	7f82a19fbf	implement DOTGraphTraits<SelectionDAG*>::getNodeLabel in terms of SDNode::print_details to eliminate a ton of near-duplicate code. llvm-svn: 74311	2009-06-26 19:06:10 +00:00
Chris Lattner	68bb4e0e01	dot graph viewing is apparently not using SDNode::print_details, this is bad, but in the meantime lets print targetflags on node labels. llvm-svn: 74274	2009-06-26 05:55:43 +00:00
Chris Lattner	17dcba9da4	propagate target operand flags from dag nodes into MachineOperands. llvm-svn: 74273	2009-06-26 05:52:14 +00:00
Chris Lattner	54b8ebced6	fit in 80 cols llvm-svn: 74270	2009-06-26 05:39:02 +00:00
Chris Lattner	b3586b6e73	add targetflags to jump tables and constant pool entries. llvm-svn: 74204	2009-06-25 21:35:31 +00:00
Chris Lattner	8e34f98d72	allow setting target operand flags on TargetGlobalAddress nodes. llvm-svn: 74203	2009-06-25 21:21:14 +00:00
Chris Lattner	af5dbfc6f8	start bringing targetoperand flags into isel, first up, ExternalSymbol. llvm-svn: 74199	2009-06-25 18:45:50 +00:00
Owen Anderson	5defd5655e	Provide guards for this shared structure. I'm not sure this actually needs to be shared, but how/where to privatize it is not immediately clear to me. If any SelectionDAG experts see a better solution, please share! llvm-svn: 74180	2009-06-25 17:09:00 +00:00
David Greene	30048bdb63	This increases the maximum for MVT::LAST_VALUETYPE This change doubles the allowable value for MVT::LAST_VALUETYPE. It does this by doing several things. 1. Introduces MVT::MAX_ALLOWED_LAST_VALUETYPE which in this change has a value of 64. This value contains the current maximum for the MVT::LAST_VALUETYPE. 2. Instead of checking "MVT::LAST_VALUETYPE <= 32", all of those uses now become "MVT::LAST_VALUETYPE <= MVT::MAX_ALLOWED_LAST_VALUETYPE" 3. Changes the dimension of the ValueTypeActions from 2 elements to four elements and adds comments ahead of the declaration indicating the it is "(MVT::MAX_ALLOWED_LAST_VALUETYPE/32) * 2". This at least lets us find what is affected if and when MVT::MAX_ALLOWED_LAST_VALUETYPE gets changed. 4. Adds initializers for the new elements of ValueTypeActions. This does NOT add any types in MVT. That would be done separately. This doubles the size of ValueTypeActions from 64 bits to 128 bits and gives us the freedom to add more types for AVX. llvm-svn: 74110	2009-06-24 19:41:55 +00:00
Owen Anderson	b70adf2b92	Get rid of the global CFGOnly flag by threading a ShortNames parameters through the GraphViz rendering code. Update other uses in the codebase for this change. llvm-svn: 74084	2009-06-24 17:37:09 +00:00
Dale Johannesen	92c11e90c8	Rewrite 73900 per Duncan's suggestion. llvm-svn: 74082	2009-06-24 17:11:31 +00:00
Chris Lattner	3912036c25	remove dead makefile flags. llvm-svn: 74065	2009-06-24 05:29:56 +00:00
Dale Johannesen	315fb72d36	Fix memcpy expansion so it won't generate invalid types for the target (I think). This was breaking the PPC32 calling sequence. llvm-svn: 73900	2009-06-22 20:59:07 +00:00
Devang Patel	da10358c84	mv CodeGen/DebugLoc.h Support/DebugLoc.h llvm-svn: 73786	2009-06-19 22:08:58 +00:00
Eli Friedman	495d02f4a6	Minor cleanup; fixes review comments for a previous patch. Sorry for taking so long to get to this! llvm-svn: 73757	2009-06-19 06:01:55 +00:00
Sanjiv Gupta	bce3ca6ad9	Fixed names of libcalls checked in r73480. llvm-svn: 73483	2009-06-16 10:22:58 +00:00
Sanjiv Gupta	557ed09e0f	Added required libcalls for PIC16 (mostly floating points to integer casting operations). llvm-svn: 73480	2009-06-16 09:03:58 +00:00
Eli Friedman	abfad5d61e	Add some generic expansion logic for SMULO and UMULO. Fixes UMULO support for x86, and UMULO/SMULO for many architectures, including PPC (PR4201), ARM, and Cell. The resulting expansion isn't perfect, but it's not bad. llvm-svn: 73477	2009-06-16 06:58:29 +00:00
Dan Gohman	6e6808adaf	Change this from an assert to a cerr+exit, since it's diagnosing an unsupported inline asm construct, rather than verifying a code invariant. llvm-svn: 73435	2009-06-15 22:32:41 +00:00
Devang Patel	56e6fe1642	Gracefully handle imbalanced inline function begin and end markers. llvm-svn: 73426	2009-06-15 21:45:50 +00:00
Arnold Schwaighofer	cb9046cfc8	CheckTailCallReturnConstraints is missing a check on the incomming chain of the RETURN node. The incomming chain must be the outgoing chain of the CALL node. This causes the backend to identify tail calls that are not tail calls. This patch fixes this. llvm-svn: 73387	2009-06-15 14:43:36 +00:00
Eli Friedman	516479d6e7	Tweak the expansion code for BIT_CONVERT to generate better code converting from an MMX vector to an i64. llvm-svn: 73024	2009-06-07 09:41:57 +00:00
Eli Friedman	3234587213	Slightly generalize the code that handles shuffles of consecutive loads on x86 to handle more cases. Fix a bug in said code that would cause it to read past the end of an object. Rewrite the code in SelectionDAGLegalize::ExpandBUILD_VECTOR to be a bit more general. Remove PerformBuildVectorCombine, which is no longer necessary with these changes. In addition to simplifying the code, with this change, we can now catch a few more cases of consecutive loads. llvm-svn: 73012	2009-06-07 06:52:44 +00:00
Eli Friedman	c61e357aa6	Fix the expansion for CONCAT_VECTORS so that it doesn't create illegal types. llvm-svn: 72993	2009-06-06 07:08:26 +00:00
Eli Friedman	aee3f62b75	Factor out a couple of helpers. llvm-svn: 72992	2009-06-06 07:04:42 +00:00
Eli Friedman	aea9b65668	Make SINT_TO_FP/UINT_TO_FP vector legalization queries query on the integer type to be consistent with normal operation legalization. No visible change because nothing is actually using this at the moment. llvm-svn: 72980	2009-06-06 03:27:50 +00:00
Devang Patel	d1c7d34924	Add new function attribute - noimplicitfloat Update code generator to use this attribute and remove NoImplicitFloat target option. Update llc to set this attribute when -no-implicit-float command line option is used. llvm-svn: 72959	2009-06-05 21:57:13 +00:00
Nate Begeman	624690c6b2	Adapt the x86 build_vector dagcombine to the current state of the legalizer. build vectors with i64 elements will only appear on 32b x86 before legalize. Since vector widening occurs during legalize, and produces i64 build_vector elements, the dag combiner is never run on these before legalize splits them into 32b elements. Teach the build_vector dag combine in x86 back end to recognize consecutive loads producing the low part of the vector. Convert the two uses of TLI's consecutive load recognizer to pass LoadSDNodes since that was required implicitly. Add a testcase for the transform. Old: subl $28, %esp movl 32(%esp), %eax movl 4(%eax), %ecx movl %ecx, 4(%esp) movl (%eax), %eax movl %eax, (%esp) movaps (%esp), %xmm0 pmovzxwd %xmm0, %xmm0 movl 36(%esp), %eax movaps %xmm0, (%eax) addl $28, %esp ret New: movl 4(%esp), %eax pmovzxwd (%eax), %xmm0 movl 8(%esp), %eax movaps %xmm0, (%eax) ret llvm-svn: 72957	2009-06-05 21:37:30 +00:00
Sanjiv Gupta	7925c5fd3f	Allow libcalls for i16 sdiv/udiv/rem operations. llvm-svn: 72941	2009-06-05 14:41:10 +00:00
Dan Gohman	a5b9645c4b	Split the Add, Sub, and Mul instruction opcodes into separate integer and floating-point opcodes, introducing FAdd, FSub, and FMul. For now, the AsmParser, BitcodeReader, and IRBuilder all preserve backwards compatability, and the Core LLVM APIs preserve backwards compatibility for IR producers. Most front-ends won't need to change immediately. This implements the first step of the plan outlined here: http://nondot.org/sabre/LLVMNotes/IntegerOverflow.txt llvm-svn: 72897	2009-06-04 22:49:04 +00:00
Dale Johannesen	37bc85f89a	Fix FP_TO_UINT->i32 on ppc32 -mcpu=g5. This was using Promote which won't work because i64 isn't a legal type. It's easy enough to use Custom, but then we have the problem that when the type legalizer is promoting FP_TO_UINT->i16, it has no way of telling it should prefer FP_TO_SINT->i32 to FP_TO_UINT->i32. I have uncomfortably hacked this by making the type legalizer choose FP_TO_SINT when both are Custom. This fixes several regressions in the testsuite. llvm-svn: 72891	2009-06-04 20:53:52 +00:00
Dan Gohman	7b6b5dd954	Don't do the X * 0.0 -> 0.0 transformation in instcombine, because instcombine doesn't know when it's safe. To partially compensate for this, introduce new code to do this transformation in dagcombine, which can use UnsafeFPMath. llvm-svn: 72872	2009-06-04 17:12:12 +00:00
Dan Gohman	c2eed3b0f8	Fix comments. llvm-svn: 72870	2009-06-04 16:49:15 +00:00
Dale Johannesen	5234d3795f	Revert 72707 and 72709, for the moment. llvm-svn: 72712	2009-06-02 03:12:52 +00:00
Dale Johannesen	0b8ca79253	Make the implicit inputs and outputs of target-independent ADDC/ADDE use MVT::i1 (later, whatever it gets legalized to) instead of MVT::Flag. Remove CARRY_FALSE in favor of 0; adjust all target-independent code to use this format. Most targets will still produce a Flag-setting target-dependent version when selection is done. X86 is converted to use i32 instead, which means TableGen needs to produce different code in xxxGenDAGISel.inc. This keys off the new supportsHasI1 bit in xxxInstrInfo, currently set only for X86; in principle this is temporary and should go away when all other targets have been converted. All relevant X86 instruction patterns are modified to represent setting and using EFLAGS explicitly. The same can be done on other targets. The immediate behavior change is that an ADC/ADD pair are no longer tightly coupled in the X86 scheduler; they can be separated by instructions that don't clobber the flags (MOV). I will soon add some peephole optimizations based on using other instructions that set the flags to feed into ADC. llvm-svn: 72707	2009-06-01 23:27:20 +00:00
Duncan Sands	96e5698741	Rename CustomLowerResults to CustomLowerNode, since it is used both when a result is illegal and when an operand is illegal. llvm-svn: 72658	2009-05-31 04:15:38 +00:00
Bill Wendling	09f17a8479	Untabification. llvm-svn: 72604	2009-05-30 01:09:53 +00:00
Evan Cheng	86cdb4b345	Do not try to create a MVT type of width 0. llvm-svn: 72557	2009-05-28 23:52:18 +00:00
Eli Friedman	e1dc193f35	Re-commit r72514 and r72516 with a fixed version of BR_CC lowering. This patch removes some special cases for opcodes and does a bit of cleanup. llvm-svn: 72536	2009-05-28 20:40:34 +00:00
Evan Cheng	6673ff08fe	Incorporate patch feedbacks. llvm-svn: 72533	2009-05-28 18:41:02 +00:00
Bill Wendling	f193838d2b	Temporarily revert r72514 (and dependent patch r72516). It was causing this failure during llvm-gcc bootstrap: Assertion failed: (!Tmp2.getNode() && "Can't legalize BR_CC with legal condition!"), function ExpandNode, file /Volumes/Sandbox/Buildbot/llvm/full-llvm/build/llvmCore.roots/llvmCore~obj/src/lib/CodeGen/SelectionDAG/LegalizeDAG.cpp, line 2923. /Volumes/Sandbox/Buildbot/llvm/full-llvm/build/llvmgcc42.roots/llvmgcc42~obj/src/gcc/libgcc2.c:1727: internal compiler error: Abort trap Please submit a full bug report, with preprocessed source if appropriate. See <URL:http://developer.apple.com/bugreporter> for instructions. llvm-svn: 72530	2009-05-28 18:18:59 +00:00
Eli Friedman	9b9df77260	Remove a couple of useless functions. llvm-svn: 72516	2009-05-28 04:49:34 +00:00
Eli Friedman	3aa278394e	Remove special cases for more opcodes. This is basically the end of this series of patches for LegalizeDAG; the remaining special cases can't be removed without more infrastructure work. There's a FIXME for each relevant opcode near the beginning of SelectionDAGLegalize::LegalizeOp. llvm-svn: 72514	2009-05-28 04:39:57 +00:00
Eli Friedman	5df7202d3b	Remove special case for SETCC opcode; add some comments explaining why some special cases are necessary. llvm-svn: 72511	2009-05-28 03:56:57 +00:00
Eli Friedman	e1bc3798e6	Some minor cleanups. llvm-svn: 72509	2009-05-28 03:06:16 +00:00
Evan Cheng	a9cda8abf2	Added optimization that narrow load / op / store and the 'op' is a bit twiddling instruction and its second operand is an immediate. If bits that are touched by 'op' can be done with a narrower instruction, reduce the width of the load and store as well. This happens a lot with bitfield manipulation code. e.g. orl $65536, 8(%rax) => orb $1, 10(%rax) Since narrowing is not always a win, e.g. i32 -> i16 is a loss on x86, dag combiner consults with the target before performing the optimization. llvm-svn: 72507	2009-05-28 00:35:15 +00:00
Eli Friedman	ed795153c7	Minor cleanups; add a better explanation for the issue with BUILD_VECTOR. llvm-svn: 72469	2009-05-27 12:42:55 +00:00
Eli Friedman	2892d82378	Remove more special cases for opcodes. llvm-svn: 72468	2009-05-27 12:20:41 +00:00
Eli Friedman	3b251705fd	Remove special cases for more opcodes. llvm-svn: 72467	2009-05-27 07:58:35 +00:00
Eli Friedman	0e49431422	Removing more special cases from LegalizeDAG. llvm-svn: 72465	2009-05-27 07:32:27 +00:00
Eli Friedman	568839681c	Eliminate more special cases for opcodes. llvm-svn: 72464	2009-05-27 07:05:37 +00:00
Eli Friedman	d6f2834496	Remove more special cases from LegalizeDAG. llvm-svn: 72456	2009-05-27 03:33:44 +00:00
Eli Friedman	b3554158c5	Remove unused argument. llvm-svn: 72455	2009-05-27 02:21:29 +00:00
Eli Friedman	a8f9a0261e	Remove more opcode special cases. llvm-svn: 72454	2009-05-27 02:16:40 +00:00
Eli Friedman	21d349b3c5	Start of refactoring LegalizeDAG so that we don't need specialized handling for every single opcode. llvm-svn: 72447	2009-05-27 01:25:56 +00:00
Eli Friedman	4a951bf2ad	Delete a bunch of dead code from LegalizeDAG. llvm-svn: 72414	2009-05-26 08:55:52 +00:00
Eli Friedman	ac149ee60a	Add a comment which should hopefully make the purpose of this method a bit clearer. llvm-svn: 72374	2009-05-24 20:32:10 +00:00
Eli Friedman	fd8b335ca4	Minor improvement to FCOPYSIGN to use BIT_CONVERT in cases where the corresponding integer type is legal. llvm-svn: 72373	2009-05-24 20:29:11 +00:00
Eli Friedman	fe87034cef	Rewrite ISD::FCOPYSIGN lowering to never use i64. Not really ideal, but it's late, and I don't have any better ideas at the moment. Fixes PR4257. llvm-svn: 72363	2009-05-24 10:21:20 +00:00
Eli Friedman	cd2e0cd297	Update for CMakeLists; untested, so tell me if there are issues. llvm-svn: 72360	2009-05-24 09:13:13 +00:00
Eli Friedman	a4e1675dac	Remove checks of getTypeAction from LegalizeOp; we already assert that all results and all operands are legal, so this change shouldn't affect behavior at all. llvm-svn: 72359	2009-05-24 08:42:01 +00:00
Eli Friedman	5e0d150689	Disable type legalization in LegalizeDAG. This leaves around 4000 lines of dead code; I'll clean that up in subsequent commits. llvm-svn: 72358	2009-05-24 02:46:31 +00:00
Eli Friedman	7badee92ad	Fix a bug in the expansion of EXTRACT_SUBVECTOR in ExpandExtractFromVectorThroughStack. llvm-svn: 72351	2009-05-23 23:03:28 +00:00
Eli Friedman	40afdb63ec	Add a proper implementation of EXTRACT_SUBVECTOR legalization that doesn't split legal vector operands. This is necessary because the type legalization (and therefore, vector splitting) code will be going away soon. llvm-svn: 72349	2009-05-23 22:37:25 +00:00
Torok Edwin	be6a9a151a	Fix PR4254. The DAGCombiner created a negative shiftamount, stored in an unsigned variable. Later the optimizer eliminated the shift entirely as being undefined. Example: (srl (shl X, 56) 48). ShiftAmt is 4294967288. Fix it by checking that the shiftamount is positive, and storing in a signed variable. llvm-svn: 72331	2009-05-23 17:29:48 +00:00
Eli Friedman	da90dd6d72	Add a new step to legalization to legalize vector math operations. This will allow simplifying LegalizeDAG to eliminate type legalization. (I have a patch to do that, but it's not quite finished; I'll commit it once it's finished and I've fixed any review comments for this patch.) See the comment at the beginning of lib/CodeGen/SelectionDAG/LegalizeVectorOps.cpp for more details on the motivation for this patch. llvm-svn: 72325	2009-05-23 12:35:30 +00:00
Duncan Sands	d6fb6501e3	Add a new codegen pass that normalizes dwarf exception handling code in preparation for code generation. The main thing it does is handle the case when eh.exception calls (and, in a future patch, eh.selector calls) are far away from landing pads. Right now in practice you only find eh.exception calls close to landing pads: either in a landing pad (the common case) or in a landing pad successor, due to loop passes shifting them about. However future exception handling improvements will result in calls far from landing pads: (1) Inlining of rewinds. Consider the following case: In function @f: ... invoke @g to label %normal unwind label %unwinds ... unwinds: %ex = call i8* @llvm.eh.exception() ... In function @g: ... invoke @something to label %continue unwind label %handler ... handler: %ex = call i8* @llvm.eh.exception() ... perform cleanups ... "rethrow exception" Now inline @g into @f. Currently this is turned into: In function @f: ... invoke @something to label %continue unwind label %handler ... handler: %ex = call i8* @llvm.eh.exception() ... perform cleanups ... invoke "rethrow exception" to label %normal unwind label %unwinds unwinds: %ex = call i8* @llvm.eh.exception() ... However we would like to simplify invoke of "rethrow exception" into a branch to the %unwinds label. Then %unwinds is no longer a landing pad, and the eh.exception call there is then far away from any landing pads. (2) Using the unwind instruction for cleanups. It would be nice to have codegen handle the following case: invoke @something to label %continue unwind label %run_cleanups ... handler: ... perform cleanups ... unwind This requires turning "unwind" into a library call, which necessarily takes a pointer to the exception as an argument (this patch also does this unwind lowering). But that means you are using eh.exception again far from a landing pad. (3) Bugpoint simplifications. When bugpoint is simplifying exception handling code it often generates eh.exception calls far from a landing pad, which then causes codegen to assert. Bugpoint then latches on to this assertion and loses sight of the original problem. Note that it is currently rare for this pass to actually do anything. And in fact it normally shouldn't do anything at all given the code coming out of llvm-gcc! But it does fire a few times in the testsuite. As far as I can see this is almost always due to the LoopStrengthReduce codegen pass introducing pointless loop preheader blocks which are landing pads and only contain a branch to another block. This other block contains an eh.exception call. So probably by tweaking LoopStrengthReduce a bit this can be avoided. llvm-svn: 72276	2009-05-22 20:36:31 +00:00
Jay Foad	7d0479f2c2	Use v.data() instead of &v[0] when SmallVector v might be empty. llvm-svn: 72210	2009-05-21 09:52:38 +00:00
Bill Wendling	f99bd3a82b	Temporarily revert r72191. It was causing an assert during llvm-gcc bootstrapping. llvm-svn: 72200	2009-05-21 00:04:55 +00:00
Argyrios Kyrtzidis	2b59a5fc6c	Introduce DebugScope which gets embedded into the machine instructions' DebugLoc. DebugScope refers to a debug region, function or block. llvm-svn: 72191	2009-05-20 22:57:17 +00:00
Eli Friedman	9030c35eb4	Fix for PR4235: to build a floating-point value from integer parts, build an integer and cast that to a float. This fixes a crash caused by trying to split an f32 into two f16's. This changes the behavior in test/CodeGen/XCore/fneg.ll because that testcase now triggers a DAGCombine which converts the fneg into an integer operation. If someone is interested, it's probably possible to tweak the test to generate an actual fneg. llvm-svn: 72162	2009-05-20 06:02:09 +00:00
Dan Gohman	d697a2dd8e	Remove the #ifndef NDEBUG from the FastISel debugging options. This fixes dejagnu tests that use these options. llvm-svn: 72094	2009-05-19 02:19:57 +00:00
Bill Wendling	d2dc9063d7	Revert last commit. It was wrong. llvm-svn: 72026	2009-05-18 18:21:03 +00:00
Bill Wendling	af7e400fda	Don't call RegionInlinedFnEnd if our optimization level isn't -O0. llvm-svn: 72024	2009-05-18 18:17:22 +00:00
Daniel Dunbar	a8c1658619	Silence Release-Asserts warnings. llvm-svn: 72011	2009-05-18 16:43:04 +00:00
Duncan Sands	83d008614f	Put back a bit of expensive checking logic that was overenthusiastically deleted in r70234. llvm-svn: 71926	2009-05-16 04:14:29 +00:00
Dan Gohman	d4f63052c4	Add an assert to turn a segfault on an unsupported inline asm construct into an assertion failure. llvm-svn: 71757	2009-05-14 00:30:16 +00:00
Jim Grosbach	4f915313ed	Removing the HasBuiltinSetjmp flag and associated bits. Flagging the presence of exception handling builtin sjlj targets in functions turns out not to be necessary. Marking the intrinsic implementation in the .td file as defining all registers is sufficient to get the context saved properly by the containing function. llvm-svn: 71743	2009-05-13 23:50:53 +00:00
Evan Cheng	ab0d23396a	Run code placement optimization for targets that want it (arm and x86 for now). llvm-svn: 71726	2009-05-13 21:42:09 +00:00
Jim Grosbach	aeca45dd6f	Add support for GCC compatible builtin setjmp and longjmp intrinsics. This is a supporting preliminary patch for GCC-compatible SjLJ exception handling. Note that these intrinsics are not designed to be invoked directly by the user, but rather used by the front-end as target hooks for exception handling. llvm-svn: 71610	2009-05-12 23:59:14 +00:00
Dan Gohman	9521cadff7	When scalarizing a vector BITCAST, check whether the operand has vector type, rather than assume that it does. If the operand is not vector, it shouldn't be run through ScalarizeVectorOp. This fixes one of the testcases in PR3886. llvm-svn: 71453	2009-05-11 18:30:42 +00:00
Bill Wendling	d6280534e4	--- Reverse-merging r71370 into '.': U lib/CodeGen/SelectionDAG/SelectionDAGBuild.cpp Revert r71370. llvm-svn: 71373	2009-05-10 00:10:50 +00:00
Bill Wendling	d53af35629	A debug function start was not being recorded when the optimization level wasn't None. However, we were always recording the region end. There's no longer a good reason for this code to be separated out between the different opt levels, as it was doing pretty much the same thing anyway. llvm-svn: 71370	2009-05-09 23:51:35 +00:00
Duncan Sands	af9eaa830a	Rename PaddedSize to AllocSize, in the hope that this will make it more obvious what it represents, and stop it being confused with the StoreSize. llvm-svn: 71349	2009-05-09 07:06:46 +00:00
Bill Wendling	8881780832	Mirror how Fast ISel determines if a region.end intrinsic is the end of an inlined function or the end of a function. Before, this was never executing the "inlined" version of the Record method. This will become important once the inlined Dwarf writer patch lands. llvm-svn: 71268	2009-05-08 21:14:49 +00:00
Anton Korobeynikov	65a58168cc	Factor out cycle-finder code and make it generic. llvm-svn: 71241	2009-05-08 18:51:58 +00:00
Anton Korobeynikov	c94dbf5ba0	Do not emit bit tests if target does not support natively left shift llvm-svn: 71240	2009-05-08 18:51:34 +00:00
Anton Korobeynikov	e7a9661f31	Properly expand libcalls for urem / srem. Also make code more straightforward. llvm-svn: 71238	2009-05-08 18:51:08 +00:00
Anton Korobeynikov	e2b78115d4	Typo llvm-svn: 71237	2009-05-08 18:50:54 +00:00
Dan Gohman	4bb6fa23cb	Revert 71165. It did more than just revert 71158 and it introduced several regressions. The problem due to 71158 is now fixed. llvm-svn: 71176	2009-05-07 19:46:24 +00:00
Bill Wendling	17f0f65499	Temporarily revert r71158. It was causing a failure during a full bootstrap: checking for bcopy... no checking for getc_unlocked... Assertion failed: (0 && "Unknown SCEV kind!"), function operator(), file /Volumes/Sandbox/Buildbot/llvm/full-llvm/build/llvmCore.roots/llvmCore~obj/src/lib/Analysis/ScalarEvolution.cpp, line 511. /Volumes/Sandbox/Buildbot/llvm/full-llvm/build/llvmgcc42.roots/llvmgcc42~obj/src/libdecnumber/decUtility.c:360: internal compiler error: Abort trap Please submit a full bug report, with preprocessed source if appropriate. See <URL:http://developer.apple.com/bugreporter> for instructions. make[4]: * [decUtility.o] Error 1 make[4]: * Waiting for unfinished jobs.... Assertion failed: (0 && "Unknown SCEV kind!"), function operator(), file /Volumes/Sandbox/Buildbot/llvm/full-llvm/build/llvmCore.roots/llvmCore~obj/src/lib/Analysis/ScalarEvolution.cpp, line 511. /Volumes/Sandbox/Buildbot/llvm/full-llvm/build/llvmgcc42.roots/llvmgcc42~obj/src/libdecnumber/decNumber.c:5591: internal compiler error: Abort trap Please submit a full bug report, with preprocessed source if appropriate. See <URL:http://developer.apple.com/bugreporter> for instructions. make[4]: * [decNumber.o] Error 1 make[3]: * [all-stage2-libdecnumber] Error 2 make[3]: *** Waiting for unfinished jobs.... llvm-svn: 71165	2009-05-07 17:26:14 +00:00
Argyrios Kyrtzidis	baf3fee885	Make DwarfWriter::RecordInlinedFnStart more like the other DwarfWriter's methods: -Have it return a label ID -Remove the unused Instruction parameter No functionality change. llvm-svn: 71132	2009-05-07 00:16:31 +00:00
Evan Cheng	cfc0513080	Do not use register as base ptr of pre- and post- inc/dec load / store nodes. llvm-svn: 71098	2009-05-06 18:25:01 +00:00
Duncan Sands	2338f6c57e	Add generic expansion of SUB when ADD and XOR are legal. Based on a patch by Micah Villmow. llvm-svn: 71078	2009-05-06 11:29:50 +00:00
Evan Cheng	1ff2727c95	Move getInstrOperandRegClass from the scheduler to TargetInstrInfo. llvm-svn: 70950	2009-05-05 00:30:09 +00:00
Chris Lattner	354b12259f	Make DBG_STOPPOINT nodes, and therefore DBG_LABEL labels, get a DebugLoc, so that it shows up in -print-machineinstrs. This doesn't appear to affect anything, but it was weird for some DBG_LABELs to have DebugLocs but not all of them. llvm-svn: 70921	2009-05-04 22:10:05 +00:00
Argyrios Kyrtzidis	9ae29b2d8f	-Remove the DwarfWriter::RecordSourceLine calls from the instruction selectors. -Depend on DebugLocs for source line info. (Comes with Regression-Be-Gone(tm)) llvm-svn: 70871	2009-05-04 16:23:49 +00:00
Argyrios Kyrtzidis	79be34012f	Revert r70803 for now, it causes a regression. llvm-svn: 70811	2009-05-03 23:27:19 +00:00
Argyrios Kyrtzidis	ce7196b903	-Remove the DwarfWriter::RecordSourceLine calls from the instruction selectors. -Depend on DebugLocs for source line info. llvm-svn: 70803	2009-05-03 22:03:35 +00:00
Anton Korobeynikov	2745bc92fa	Fix typo llvm-svn: 70770	2009-05-03 13:19:57 +00:00
Anton Korobeynikov	05b7a7c8f8	Properly handle sdiv / udiv / srem / urem libcalls llvm-svn: 70764	2009-05-03 13:18:16 +00:00
Anton Korobeynikov	399ad444fd	Proper name 16 bit libcalls llvm-svn: 70750	2009-05-03 13:14:08 +00:00
Anton Korobeynikov	f3fc92d6fc	Add libcall expansion for 16 and 128 bit muls llvm-svn: 70749	2009-05-03 13:13:51 +00:00
Argyrios Kyrtzidis	97324cec99	-Move the DwarfWriter::ValidDebugInfo check to a static DIDescriptor::ValidDebugInfo -Create DebugLocs without the need to have a DwarfWriter around llvm-svn: 70682	2009-05-03 08:50:41 +00:00
Bob Wilson	62a3124fb8	Allow CONCAT_VECTORS nodes to be legal or have custom lowering for some targets. Changes to take advantage of this will come later. llvm-svn: 70560	2009-05-01 17:55:32 +00:00
Argyrios Kyrtzidis	a5037484a4	Make DebugLoc independent of DwarfWriter. -Replace DebugLocTuple's Source ID with CompileUnit's GlobalVariable* -Remove DwarfWriter::getOrCreateSourceID -Make necessary changes for the above (fix callsites, etc.) llvm-svn: 70520	2009-04-30 23:22:31 +00:00
Jay Foad	fe0c648fee	Move helper functions for optimizing division by constant into the APInt class. llvm-svn: 70488	2009-04-30 10:15:35 +00:00
Chris Lattner	5ab42e93c4	fix a regression handling indirect results: these need to be considered memory operands otherwise the writebacks get lost when the inline asm doesn't otherwise have side effects. This fixes rdar://6839427, though clang really shouldn't generate these anymore. llvm-svn: 70455	2009-04-30 00:48:50 +00:00
Bill Wendling	026e5d7667	Instead of passing in an unsigned value for the optimization level, use an enum, which better identifies what the optimization is doing. And is more flexible for future uses. llvm-svn: 70440	2009-04-29 23:29:43 +00:00
Nate Begeman	7e6e352735	Fix infinite recursion in the C++ code which handles movddup by making it unnecessary. llvm-svn: 70425	2009-04-29 22:47:44 +00:00
Nate Begeman	39b59db245	Update comment, replace theoretically impossible check with an assert. llvm-svn: 70391	2009-04-29 18:13:31 +00:00
Nate Begeman	5f829d896d	Implement review feedback for vector shuffle work. llvm-svn: 70372	2009-04-29 05:20:52 +00:00
Sanjiv Gupta	ccd30945f9	Add a public method called getAddressSpace() to the GlobalAddressSDNode. llvm-svn: 70366	2009-04-29 04:43:24 +00:00
Chris Lattner	7d10386113	Disable the load-shrinking optimization from looking at anything larger than 64-bits, avoiding a crash. This should really be fixed to use APInts, though type legalization happens to help us out and we get good code on the attached testcase at least. This fixes rdar://6836460 llvm-svn: 70360	2009-04-29 03:45:07 +00:00
Bill Wendling	084669a1c9	Second attempt: Massive check in. This changes the "-fast" flag to "-O#" in llc. If you want to use the old behavior, the flag is -O0. This change allows for finer-grained control over which optimizations are run at different -O levels. Most of this work was pretty mechanical. The majority of the fixes came from verifying that a "fast" variable wasn't used anymore. The JIT still uses a "Fast" flag. I'll change the JIT with a follow-up patch. llvm-svn: 70343	2009-04-29 00:15:41 +00:00
Jakob Stoklund Olesen	604248e81f	Move getSubRegisterRegClass from ScheduleDagSDNodesEmit.cpp to a TargetRegisterClass method. Also make the method non-asserting. It will return NULL when given an invalid subreg index. The method is needed by an upcoming patch. llvm-svn: 70296	2009-04-28 16:34:09 +00:00
Bill Wendling	56f2987a87	r70270 isn't ready yet. Back this out. Sorry for the noise. llvm-svn: 70275	2009-04-28 01:04:53 +00:00
Bill Wendling	d0ae15946c	Massive check in. This changes the "-fast" flag to "-O#" in llc. If you want to use the old behavior, the flag is -O0. This change allows for finer-grained control over which optimizations are run at different -O levels. Most of this work was pretty mechanical. The majority of the fixes came from verifying that a "fast" variable wasn't used anymore. The JIT still uses a "Fast" flag. I'm not 100% sure if it's necessary to change it there... llvm-svn: 70270	2009-04-28 00:21:31 +00:00
Duncan Sands	bfa037705e	Now that PR2957 is resolved, remove a bunch of no-longer needed workarounds. llvm-svn: 70234	2009-04-27 19:33:03 +00:00
Nate Begeman	8d6d4b9289	2nd attempt, fixing SSE4.1 issues and implementing feedback from duncan. PR2957 ISD::VECTOR_SHUFFLE now stores an array of integers representing the shuffle mask internal to the node, rather than taking a BUILD_VECTOR of ConstantSDNodes as the shuffle mask. A value of -1 represents UNDEF. In addition to eliminating the creation of illegal BUILD_VECTORS just to represent shuffle masks, we are better about canonicalizing the shuffle mask, resulting in substantially better code for some classes of shuffles. llvm-svn: 70225	2009-04-27 18:41:29 +00:00
Dan Gohman	be36f5ccda	When transforming sext(trunc(load(x))) into sext(smaller load(x)), the trunc is directly replaced with the smaller load, so don't try to create a new sext node. This fixes PR4050. llvm-svn: 70179	2009-04-27 02:00:55 +00:00
Dan Gohman	fe9e1d5b59	Refactor the code to grab the low and high parts of a value using EXTRACT_ELEMENT into a utility function. llvm-svn: 70056	2009-04-25 17:55:53 +00:00
Dan Gohman	4539987920	Add a top-level comment about DAGCombiner's role in the compiler. llvm-svn: 70052	2009-04-25 17:09:45 +00:00
Dale Johannesen	56cb14c874	Fix PR 4057, a crash doing float->char const folding. This particular one is undefined behavior (although this isn't related to the crash), so it will no longer do it at compile time, which seems better. llvm-svn: 69990	2009-04-24 21:34:13 +00:00
Rafael Espindola	b93db668b3	Revert 69952. Causes testsuite failures on linux x86-64. llvm-svn: 69967	2009-04-24 12:40:33 +00:00
Nate Begeman	bb881d66f4	PR2957 ISD::VECTOR_SHUFFLE now stores an array of integers representing the shuffle mask internal to the node, rather than taking a BUILD_VECTOR of ConstantSDNodes as the shuffle mask. A value of -1 represents UNDEF. In addition to eliminating the creation of illegal BUILD_VECTORS just to represent shuffle masks, we are better about canonicalizing the shuffle mask, resulting in substantially better code for some classes of shuffles. A clean up of x86 shuffle code, and some canonicalizing in DAGCombiner is next. llvm-svn: 69952	2009-04-24 03:42:54 +00:00
Dan Gohman	640a161c73	Instead of requiring TLI.LowerCallTo to return an ISD::BUILD_PAIR, use ISD::EXTRACT_ELEMENT. SelectionDAG has a special fast-path for the cast of an EXTRACT_ELEMENT with a BUILD_PAIR operand, for the common case. llvm-svn: 69948	2009-04-24 02:40:23 +00:00
Dan Gohman	9478c3f8e5	Factor out a bit of code that appears in several places into a utility function. llvm-svn: 69937	2009-04-23 23:13:24 +00:00
Dan Gohman	a290ab44e8	Handle Void types in ComputeValueVTs. This doesn't currently occur, but this change makes the code more general and easier to adapt for new purposes. llvm-svn: 69935	2009-04-23 22:50:03 +00:00
Dan Gohman	1addf64735	Make X86's copyRegToReg able to handle copies to and from subclasses. This makes the extra copyRegToReg calls in ScheduleDAGSDNodesEmit.cpp unnecessary. Derived from a patch by Jakob Stoklund Olesen. llvm-svn: 69635	2009-04-20 22:54:34 +00:00
Dan Gohman	e014b69919	Simplify this code. getConstant knows how to make broadcasted vector constants. llvm-svn: 69634	2009-04-20 22:51:43 +00:00
Bob Wilson	da188ebbbd	Revise my previous change 68996 as suggested by Duncan. llvm-svn: 69607	2009-04-20 17:27:09 +00:00
Duncan Sands	f2e7133d34	Now that BUILD_VECTOR operands are allowed to be bigger than the vector element type, turn checking of the operand type back on again, appropriately adjusted. llvm-svn: 69516	2009-04-19 06:40:30 +00:00
Chris Lattner	7b01e66443	Fix PR3898, which manifests as failures on are an Xcore, patch by Jakob Stoklund Olesen! llvm-svn: 69472	2009-04-18 20:48:07 +00:00
Duncan Sands	e4ff21ba4b	Don't try to make BUILD_VECTOR operands have the same type as the vector element type: allow them to be of a wider integer type than the element type all the way through the system, and not just as far as LegalizeDAG. This should be safe because it used to be this way (the old type legalizer would produce such nodes), so backends should be able to handle it. In fact only targets which have legal vector types with an illegal promoted element type will ever see this (eg: <4 x i16> on ppc). This fixes a regression with the new type legalizer (vec_splat.ll). Also, treat SCALAR_TO_VECTOR the same as BUILD_VECTOR. After all, it is just a special case of BUILD_VECTOR. llvm-svn: 69467	2009-04-18 20:16:54 +00:00
Dale Johannesen	ad968ee286	Inline asm's were still introducing bogus dependencies; my earlier patch to this code only fixed half of it. llvm-svn: 69408	2009-04-18 00:09:40 +00:00
Dan Gohman	eefba6bbe0	In the list-burr's pseudo two-addr dependency heuristics, don't add dependencies on nodes with exactly one successor which is a COPY_TO_REGCLASS node. In the case that the copy is coalesced away, the dependence should be on the user of the copy, rather than the copy itself. llvm-svn: 69309	2009-04-16 20:59:02 +00:00
Dan Gohman	3027bb6953	Handle SUBREG_TO_REG instructions with the same heuristics as INSERT_SUBREG instructions in the list-burr scheduler. llvm-svn: 69308	2009-04-16 20:57:10 +00:00
Devang Patel	dab01f3fd6	Do not treat beginning of inlined scope as beginning of normal function scope if the location info is missing. Insetad of doing ... if (inlined_subroutine && known_location) DW_TAG_inline_subroutine else DW_TAG_subprogram do if (inlined_subroutine) { if (known_location) DW_TAG_inline_subroutine } else { DW_TAG_subprogram } llvm-svn: 69300	2009-04-16 17:55:30 +00:00
Devang Patel	9ac4390bf4	Record line number at the beginning of a func.start. This line was accidently lost yesterday. llvm-svn: 69286	2009-04-16 15:07:09 +00:00
Devang Patel	653dee0884	In -fast mode do what FastISel does. This code could use some refactoring help! llvm-svn: 69254	2009-04-16 02:33:41 +00:00
Devang Patel	46b04e4d06	If FastISel is run and it has known DebugLoc then use it. llvm-svn: 69253	2009-04-16 01:33:10 +00:00
Devang Patel	43fc7e481b	If location where the function was inlined is not know then do not emit debug info describing inlinied region. llvm-svn: 69252	2009-04-16 01:31:54 +00:00
Devang Patel	2738d7312a	Add DISubprogram is not null check. This fixes test/CodeGen//2009-01-21-invalid-debug-info.m test case. llvm-svn: 69210	2009-04-15 20:11:08 +00:00
Dan Gohman	8aa28b9c34	Generalize one of the SelectionDAG::ReplaceAllUsesWith overloads to support replacing a node with another that has a superset of the result types. Use this instead of calling ReplaceAllUsesOfValueWith for each value. llvm-svn: 69209	2009-04-15 20:06:30 +00:00
Devang Patel	32d17a1a29	Construct and emit DW_TAG_inlined_subroutine DIEs for inlined subroutine scopes (only in FastISel mode). llvm-svn: 69116	2009-04-15 00:10:26 +00:00
Dan Gohman	e5cd1fcdb9	When the result of an EXTRACT_SUBREG, INSERT_SUBREG, or SUBREG_TO_REG operator is used by a CopyToReg to export the value to a different block, don't reuse the CopyToReg's register for the subreg operation result if the register isn't precisely the right class for the subreg operation. Also, rename the h-registers.ll test, now that there are more than one. llvm-svn: 69087	2009-04-14 22:17:14 +00:00
Dale Johannesen	83593f4167	Do not force asm's to be chained if they don't touch memory and aren't volatile. This was interfering with good scheduling. llvm-svn: 69008	2009-04-14 00:56:56 +00:00
Daniel Dunbar	097f630dad	Make these errors more noticable in build logs. llvm-svn: 68998	2009-04-13 22:26:09 +00:00
Bob Wilson	59dbbb2bb4	Change SelectionDAG type legalization to allow BUILD_VECTOR operands to be promoted to legal types without changing the type of the vector. This is following a suggestion from Duncan (http://lists.cs.uiuc.edu/pipermail/llvmdev/2009-February/019923.html). The transformation that used to be done during type legalization is now postponed to DAG legalization. This allows the BUILD_VECTORs to be optimized and potentially handled specially by target-specific code. It turns out that this is also consistent with an optimization done by the DAG combiner: a BUILD_VECTOR and INSERT_VECTOR_ELT may be combined by replacing one of the BUILD_VECTOR operands with the newly inserted element; but INSERT_VECTOR_ELT allows its scalar operand to be larger than the element type, with any extra high bits being implicitly truncated. The result is a BUILD_VECTOR where one of the operands has a type larger the the vector element type. Any code that operates on BUILD_VECTORs may now need to be aware of the potential type discrepancy between the vector element type and the BUILD_VECTOR operands. This patch updates all of the places that I could find to handle that case. llvm-svn: 68996	2009-04-13 22:05:19 +00:00
Dan Gohman	6c1426308c	Rename COPY_TO_SUBCLASS to COPY_TO_REGCLASS, and generalize it accordingly. Thanks to Jakob Stoklund Olesen for pointing out how this might be useful. llvm-svn: 68986	2009-04-13 21:06:25 +00:00
Bob Wilson	f6c2195383	Refactor some code in SelectionDAGLegalize::ExpandBUILD_VECTOR. llvm-svn: 68981	2009-04-13 20:20:30 +00:00
Devang Patel	0431504fb2	Right now, Debugging information to encode scopes (DW_TAG_lexical_block) relies on DBG_LABEL. Unfortunately this intefers with the quality of optimized code. This patch updates dwarf writer to encode scoping information in DWARF only in FastISel mode. llvm-svn: 68973	2009-04-13 18:13:16 +00:00
Devang Patel	80be3511ed	Reapply 68847. Now debug_inlined section is covered by TAI->doesDwarfUsesInlineInfoSection(), which is false by default. llvm-svn: 68964	2009-04-13 17:02:03 +00:00
Dan Gohman	60a446ab02	Add a new TargetInstrInfo MachineInstr opcode, COPY_TO_SUBCLASS. This will be used to replace things like X86's MOV32to32_. Enhance ScheduleDAGSDNodesEmit to be more flexible and robust in the presense of subregister superclasses and subclasses. It can now cope with the definition of a virtual register being in a subclass of a use. Re-introduce the code for recording register superreg classes and subreg classes. This is needed because when subreg extracts and inserts get coalesced away, the virtual registers are left in the correct subclass. llvm-svn: 68961	2009-04-13 15:38:05 +00:00
Chris Lattner	a101f6f8d3	make UpdateValueMap handle the possiblity that we could be copying into the right register, avoiding a copy. llvm-svn: 68889	2009-04-12 07:46:30 +00:00
Chris Lattner	ada5d6c37e	optimize FastISel::UpdateValueMap to avoid duplicate map lookups, and make it return the assigned register. llvm-svn: 68888	2009-04-12 07:45:01 +00:00
Dan Gohman	825236b116	Revert r68847. It breaks the build on non-Darwin targets, with this message from the assembler: Error: unknown pseudo-op: `.debug_inlined' llvm-svn: 68863	2009-04-11 15:57:04 +00:00
Devang Patel	790e60999e	Keep track of inlined functions and their locations. This information is collected when nested llvm.dbg.func.start intrinsics are seen. (Right now, inliner removes nested llvm.dbg.func.start intrinisics during inlining.) Create debug_inlined dwarf section using these information. This info is used by gdb, at least on Darwin, to enable better experience debugging inlined functions. See DwarfWriter.cpp for more information on structure of debug_inlined section. llvm-svn: 68847	2009-04-11 00:16:47 +00:00
Bob Wilson	f074ca7454	Clean up a bunch of whitespace issues and fix a comment typo. No functional changes. llvm-svn: 68808	2009-04-10 18:48:47 +00:00
Dan Gohman	e517ae4211	Now that register classes have names, include the name in debug output. llvm-svn: 68786	2009-04-10 15:59:38 +00:00
Dan Gohman	de912e2475	Remove the obsolete SelectionDAG::getNodeValueTypes and simplify code that uses it by using SelectionDAG::getVTList instead. llvm-svn: 68744	2009-04-09 23:54:40 +00:00
Devang Patel	a68bdef482	Silence unused variable warning. llvm-svn: 68735	2009-04-09 23:45:17 +00:00
Devang Patel	a2c2b85df4	llvm.dbg.func_start also defines beginning of function scope. llvm-svn: 68727	2009-04-09 21:42:11 +00:00
Dan Gohman	0e8d199f91	Generalize ExtendUsesToFormExtLoad to be usable for ANY_EXTEND, in addition to ZERO_EXTEND and SIGN_EXTEND. Fix a bug in the way it checked for live-out values, and simplify the way it find users by using SDNode::use_iterator's (relatively) new features. Also, make it slightly more permissive on targets with free truncates. In SelectionDAGBuild, avoid creating ANY_EXTEND nodes that are larger than necessary. If the target's SwitchAmountTy has enough bits, use it. This exposes the truncate to optimization early, enabling more optimizations. llvm-svn: 68670	2009-04-09 03:51:29 +00:00
Dan Gohman	e6db8ca5eb	Don't copy the operand of a SwitchInst into virtual registers as eagerly. This helps avoid CopyToReg nodes in some cases where they aren't needed, and also helps subsequent optimizer heuristics in cases where the extra nodes would cause the node to appear to have multiple results. This doesn't have a significant impact currently; it'll help an upcoming change. llvm-svn: 68667	2009-04-09 02:33:36 +00:00
Duncan Sands	5a82613db0	Soft float support for FREM. llvm-svn: 68614	2009-04-08 16:20:57 +00:00
Duncan Sands	fb438caac6	Soft float support for undef. Reported by Xerxes Rånby. llvm-svn: 68607	2009-04-08 13:33:37 +00:00
Dan Gohman	ad3e549a53	Implement support for using modeling implicit-zero-extension on x86-64 with SUBREG_TO_REG, teach SimpleRegisterCoalescing to coalesce SUBREG_TO_REG instructions (which are similar to INSERT_SUBREG instructions), and teach the DAGCombiner to take advantage of this on targets which support it. This eliminates many redundant zero-extension operations on x86-64. This adds a new TargetLowering hook, isZExtFree. It's similar to isTruncateFree, except it only applies to actual definitions, and not no-op truncates which may not zero the high bits. Also, this adds a new optimization to SimplifyDemandedBits: transform operations like x+y into (zext (add (trunc x), (trunc y))) on targets where all the casts are no-ops. In contexts where the high part of the add is explicitly masked off, this allows the mask operation to be eliminated. Fix the DAGCombiner to avoid undoing these transformations to eliminate casts on targets where the casts are no-ops. Also, this adds a new two-address lowering heuristic. Since two-address lowering runs before coalescing, it helps to be able to look through copies when deciding whether commuting and/or three-address conversion are profitable. Also, fix a bug in LiveInterval::MergeInClobberRanges. It didn't handle the case that a clobber range extended both before and beyond an existing live range. In that case, multiple live ranges need to be added. This was exposed by the new subreg coalescing code. Remove 2008-05-06-SpillerBug.ll. It was bugpoint-reduced, and the spiller behavior it was looking for no longer occurrs with the new instruction selection. llvm-svn: 68576	2009-04-08 00:15:30 +00:00
Devang Patel	10f7c3deb7	Revert prev. patch for now. llvm-svn: 68569	2009-04-07 23:00:04 +00:00
Devang Patel	ddafc03e41	Right now DBG_LABEL are required for llvm.dbg.region_start and llvm.dbg.region_end in non-fast mode also. llvm-svn: 68559	2009-04-07 22:27:56 +00:00
Dan Gohman	ca93aabeba	Don't attempt to handle aggregate argument values in FastISel; let SelectionDAG do those. This fixes PR3955. llvm-svn: 68546	2009-04-07 20:40:11 +00:00
Dan Gohman	8bff8a1e87	Fix a TargetLowering optimization so that it doesn't duplicate loads when an input node has multiple uses. llvm-svn: 68398	2009-04-03 20:11:30 +00:00
Dan Gohman	b425feb2aa	Delete ISD::INSERT_SUBREG and ISD::EXTRACT_SUBREG, which are unused. Note that these are distinct from TargetInstrInfo::INSERT_SUBREG and TargetInstrInfo::EXTRACT_SUBREG, which are used. llvm-svn: 68355	2009-04-03 00:25:26 +00:00
Sanjiv Gupta	cc841a3810	To convert the StopPoint insn into an assembler directive by ISel, we need to have access to the line number field. So we convert that info as an operand by custom handling DBG_STOPPOINT in legalize. llvm-svn: 68329	2009-04-02 18:03:10 +00:00
Evan Cheng	0d551591ea	Fully general expansion of integer shift of any size. llvm-svn: 68134	2009-03-31 19:39:24 +00:00
Dan Gohman	d51f196ff5	Minor top-level comment fix. llvm-svn: 68113	2009-03-31 16:51:18 +00:00
Dan Gohman	97a20b8dbf	Fix live-out reg logic to not insert over-aggressive AssertZExt instructions. This fixes lua. llvm-svn: 68083	2009-03-31 01:38:29 +00:00
Duncan Sands	d21581eaa1	Fix PR3899: add support for extracting floats from vectors when using -soft-float. Based on a patch by Jakob Stoklund Olesen. llvm-svn: 67996	2009-03-29 13:51:06 +00:00
Arnold Schwaighofer	e622cbf385	Make check in CheckTailCallReturnConstraints for ignorable instructions between a CALL and a RET node more generic. Add a test for tail calls with a void return. llvm-svn: 67943	2009-03-28 12:36:29 +00:00
Arnold Schwaighofer	83d5420d02	Enable tail call optimization for functions that return a struct (bug 3664) and for functions that return types that need extending (e.g i1). llvm-svn: 67934	2009-03-28 08:33:27 +00:00
Evan Cheng	fd81c73cde	Optimize some 64-bit multiplication by constants into two lea's or one lea + shl since imulq is slow (latency 5). e.g. x * 40 => shlq $3, %rdi leaq (%rdi,%rdi,4), %rax This has the added benefit of allowing more multiply to be folded into addressing mode. e.g. a * 24 + b => leaq (%rdi,%rdi,2), %rax leaq (%rsi,%rax,8), %rax llvm-svn: 67917	2009-03-28 05:57:29 +00:00
Dan Gohman	2785e4be37	Fix what surely must be a copy+pasto. llvm-svn: 67881	2009-03-27 23:55:04 +00:00
Dan Gohman	6d75876473	Initialize LiveOutInfo's APInt members to zero, as APInt's default constructor produces an uninitialized APInt. This fixes PR3896. llvm-svn: 67879	2009-03-27 23:51:02 +00:00
Bill Wendling	aa28be652c	Pull transform from target-dependent code into target-independent code. llvm-svn: 67742	2009-03-26 06:14:09 +00:00
Evan Cheng	2e9f42bed5	Revert 67132. This is breaking some objective-c apps. Also fixes SDISel so it does not force promote return value if the function is not marked signext / zeroext. llvm-svn: 67701	2009-03-25 20:20:11 +00:00
Dale Johannesen	eb1646d28c	When optimizing with debug info, don't keep the stoppoint nodes around until Legalize; doing this imposed an ordering on a sequence of loads that came from different lines, interfering with scheduling. llvm-svn: 67692	2009-03-25 17:36:08 +00:00
Chris Lattner	c35847e109	more tidying: name the components of PhysReg in the case when the target constraint specifies a specific physreg. llvm-svn: 67618	2009-03-24 15:27:37 +00:00
Chris Lattner	42eceb3491	Tidy a bit more. llvm-svn: 67617	2009-03-24 15:25:07 +00:00
Chris Lattner	246eda43bd	simplify this code a bit now that "allocation to a vreg class" can never fail. llvm-svn: 67616	2009-03-24 15:22:11 +00:00
Dan Gohman	f3746cbc56	Minor compile-time optimization; don't bother checking canClobberPhysRegDefs if the successor node doesn't clobber any physical registers. llvm-svn: 67587	2009-03-24 00:50:07 +00:00
Dan Gohman	9a658d72db	Add a pre-pass to the burr-list scheduler which makes adjustments to help out the register pressure reduction heuristics in the case of nodes with multiple uses. Currently this uses very conservative heuristics, so it doesn't have a broad impact, but in cases where it does help it can make a big difference. llvm-svn: 67586	2009-03-24 00:49:12 +00:00
Dan Gohman	ed0e8d44ce	When unfolding a load during scheduling, the new operator node has a data dependency on the load node, so it really needs a data-dependence edge to the load node, even if the load previously existed. And add a few comments. llvm-svn: 67554	2009-03-23 20:20:43 +00:00
Dan Gohman	f477262e69	Don't set SUnit::hasPhysRegDefs to true unless the defs are actually have uses, which reflects the way it's used. llvm-svn: 67540	2009-03-23 17:39:36 +00:00
Dan Gohman	a366da1bf7	Fix canClobberPhysRegDefs to check all SDNodes grouped together in an SUnit, instead of just the first one. This fix is needed by some upcoming scheduler changes. llvm-svn: 67531	2009-03-23 16:23:01 +00:00
Dan Gohman	52c278e54d	Add a new bit to SUnit to record whether a node has implicit physreg defs, regardless of whether they are actually used. llvm-svn: 67528	2009-03-23 16:10:52 +00:00
Dan Gohman	4f2fea1a21	Now that errs() is properly non-buffered, there's no need to explicitly flush it. llvm-svn: 67526	2009-03-23 15:57:19 +00:00
Evan Cheng	968c3b0d6e	Model inline asm constraint which ties an input to an output register as machine operand TIED_TO constraint. This eliminated the need to pre-allocate registers for these. This also allows register allocator can eliminate the unneeded copies. llvm-svn: 67512	2009-03-23 08:01:15 +00:00
Dan Gohman	3bdc4bdba6	Simplify this code; use a while instead of an if and a do-while. llvm-svn: 67400	2009-03-20 20:42:23 +00:00
Evan Cheng	2e55923fba	For inline asm output operand that matches an input. Encode the input operand index in the high bits. llvm-svn: 67387	2009-03-20 18:03:34 +00:00
Sanjiv Gupta	e9759c458c	Fixed the comment. No functionality change. llvm-svn: 67370	2009-03-20 09:38:50 +00:00
Mon P Wang	32c8074be6	Added missing support for widening when splitting an unary op (PR3683) and expanding a bit convert (PR3711). In both cases, we extract the valid part of the widen vector and then do the conversion. llvm-svn: 67175	2009-03-18 06:24:04 +00:00
Rafael Espindola	4606b12108	Don't force promotion of return arguments on the callee. Some architectures (like x86) don't require it. This fixes bug 3779. llvm-svn: 67132	2009-03-17 23:43:59 +00:00
Chris Lattner	2363d0b8b9	Fix codegen to compute the size of an allocation by multiplying the size by the array amount as an i32 value instead of promoting from i32 to i64 then doing the multiply. Not doing this broke wrap-around assumptions that the optimizers (validly) made. The ultimate real fix for this is to introduce i64 version of alloca and remove mallocinst. This fixes PR3829 llvm-svn: 67093	2009-03-17 19:36:00 +00:00
Mon P Wang	523c0852c6	Fix a problem with DAGCombine where we were building an illegal build vector shuffle mask. Forced the mask to be built using i32. Note: this will be irrelevant once vector_shuffle no longer takes a build vector for the shuffle mask. llvm-svn: 67076	2009-03-17 06:33:10 +00:00
Mon P Wang	c86715631c	Avoid doing the transformation c ? 1.0 : 2.0 as load { 2.0, 1.0 } + c*4 if FPConstant is legal because if the FPConstant doesn't need to be stored in a constant pool, the transformation is unlikely to be profitable. llvm-svn: 66994	2009-03-14 00:25:19 +00:00
Dan Gohman	a62e4ab690	Improve FastISel's handling of truncates to i1, and implement ptrtoint and inttoptr in X86FastISel. These casts aren't always handled in the generic FastISel code because X86 sometimes needs custom code to do truncation and zero-extension. llvm-svn: 66988	2009-03-13 23:53:06 +00:00
Dan Gohman	c0bb959591	Fix FastISel's assumption that i1 values are always zero-extended by inserting explicit zero extensions where necessary. Included is a testcase where SelectionDAG produces a virtual register holding an i1 value which FastISel previously mistakenly assumed to be zero-extended. llvm-svn: 66941	2009-03-13 20:42:20 +00:00
Evan Cheng	1fb8aedd1e	Fix some significant problems with constant pools that resulted in unnecessary paddings between constant pool entries, larger than necessary alignments (e.g. 8 byte alignment for .literal4 sections), and potentially other issues. 1. ConstantPoolSDNode alignment field is log2 value of the alignment requirement. This is not consistent with other SDNode variants. 2. MachineConstantPool alignment field is also a log2 value. 3. However, some places are creating ConstantPoolSDNode with alignment value rather than log2 values. This creates entries with artificially large alignments, e.g. 256 for SSE vector values. 4. Constant pool entry offsets are computed when they are created. However, asm printer group them by sections. That means the offsets are no longer valid. However, asm printer uses them to determine size of padding between entries. 5. Asm printer uses expensive data structure multimap to track constant pool entries by sections. 6. Asm printer iterate over SmallPtrSet when it's emitting constant pool entries. This is non-deterministic. Solutions: 1. ConstantPoolSDNode alignment field is changed to keep non-log2 value. 2. MachineConstantPool alignment field is also changed to keep non-log2 value. 3. Functions that create ConstantPool nodes are passing in non-log2 alignments. 4. MachineConstantPoolEntry no longer keeps an offset field. It's replaced with an alignment field. Offsets are not computed when constant pool entries are created. They are computed on the fly in asm printer and JIT. 5. Asm printer uses cheaper data structure to group constant pool entries. 6. Asm printer compute entry offsets after grouping is done. 7. Change JIT code to compute entry offsets on the fly. llvm-svn: 66875	2009-03-13 07:51:59 +00:00
Bill Wendling	fa54bc2052	Oops...I committed too much. llvm-svn: 66867	2009-03-13 04:39:26 +00:00
Bill Wendling	b02eadf660	Temporarily XFAIL this test. llvm-svn: 66866	2009-03-13 04:37:11 +00:00
Dan Gohman	a19c662a83	Fix a typo in a comment. llvm-svn: 66843	2009-03-12 23:55:10 +00:00
Chris Lattner	4147f08e44	Move 3 "(add (select cc, 0, c), x) -> (select cc, x, (add, x, c))" related transformations out of target-specific dag combine into the ARM backend. These were added by Evan in r37685 with no testcases and only seems to help ARM (e.g. test/CodeGen/ARM/select_xform.ll). Add some simple X86-specific (for now) DAG combines that turn things like cond ? 8 : 0 -> (zext(cond) << 3). This happens frequently with the recently added cp constant select optimization, but is a very general xform. For example, we now compile the second example in const-select.ll to: _test: movsd LCPI2_0, %xmm0 ucomisd 8(%esp), %xmm0 seta %al movzbl %al, %eax movl 4(%esp), %ecx movsbl (%ecx,%eax,4), %eax ret instead of: _test: movl 4(%esp), %eax leal 4(%eax), %ecx movsd LCPI2_0, %xmm0 ucomisd 8(%esp), %xmm0 cmovbe %eax, %ecx movsbl (%ecx), %eax ret This passes multisource and dejagnu. llvm-svn: 66779	2009-03-12 06:52:53 +00:00
Evan Cheng	4465954638	Enable Chris' value propagation change. It make available known sign, zero, one bits information for values that are live out of basic blocks. The goal is to eliminate unnecessary sext, zext, truncate of values that are live-in to blocks. This does not handle PHI nodes yet. llvm-svn: 66777	2009-03-12 06:29:49 +00:00
Chris Lattner	43d6377f89	reapply my previous patch (r66358) with a tweak to set the alignment of the generated constant pool entry to the desired alignment of a type. If we don't do this, we end up trying to do movsd from 4-byte alignment memory. This fixes 450.soplex and 456.hmmer. llvm-svn: 66641	2009-03-11 05:08:08 +00:00
Evan Cheng	aa887653f4	Revert 66358 for now. It's breaking povray, 450.soplex, and 456.hmmer on x86 / Darwin. llvm-svn: 66574	2009-03-10 20:47:18 +00:00
Chris Lattner	4249b9a698	Fix PR3763 by using proper APInt methods instead of uint64_t's. llvm-svn: 66434	2009-03-09 20:22:18 +00:00
Bill Wendling	c6869f4695	Pass in a std::string when getting the names of debugging things. This cuts down on the number of times a std::string is created and copied. llvm-svn: 66396	2009-03-09 05:04:40 +00:00
Chris Lattner	ab5a443144	implement an optimization to codegen c ? 1.0 : 2.0 as load { 2.0, 1.0 } + c*4. For 2009-03-07-FPConstSelect.ll we now produce: _f: xorl %eax, %eax testl %edi, %edi movl $4, %ecx cmovne %rax, %rcx leaq LCPI1_0(%rip), %rax movss (%rcx,%rax), %xmm0 ret previously we produced: _f: subl $4, %esp cmpl $0, 8(%esp) movss LCPI1_0, %xmm0 je LBB1_2 ## entry LBB1_1: ## entry movss LCPI1_1, %xmm0 LBB1_2: ## entry movss %xmm0, (%esp) flds (%esp) addl $4, %esp ret on PPC the code also improves to: _f: cntlzw r2, r3 srwi r2, r2, 5 li r3, lo16(LCPI1_0) slwi r2, r2, 2 addis r3, r3, ha16(LCPI1_0) lfsx f1, r3, r2 blr from: _f: li r2, lo16(LCPI1_1) cmplwi cr0, r3, 0 addis r2, r2, ha16(LCPI1_1) beq cr0, LBB1_2 ; entry LBB1_1: ; entry li r2, lo16(LCPI1_0) addis r2, r2, ha16(LCPI1_0) LBB1_2: ; entry lfs f1, 0(r2) blr This also improves the existing pic-cpool case from: foo: subl $12, %esp call .Lllvm$1.$piclabel .Lllvm$1.$piclabel: popl %eax addl $_GLOBAL_OFFSET_TABLE_ + [.-.Lllvm$1.$piclabel], %eax cmpl $0, 16(%esp) movsd .LCPI1_0@GOTOFF(%eax), %xmm0 je .LBB1_2 # entry .LBB1_1: # entry movsd .LCPI1_1@GOTOFF(%eax), %xmm0 .LBB1_2: # entry movsd %xmm0, (%esp) fldl (%esp) addl $12, %esp ret to: foo: call .Lllvm$1.$piclabel .Lllvm$1.$piclabel: popl %eax addl $_GLOBAL_OFFSET_TABLE_ + [.-.Lllvm$1.$piclabel], %eax xorl %ecx, %ecx cmpl $0, 4(%esp) movl $8, %edx cmovne %ecx, %edx fldl .LCPI1_0@GOTOFF(%eax,%edx) ret This triggers a few dozen times in spec FP 2000. llvm-svn: 66358	2009-03-08 01:51:30 +00:00
Chris Lattner	21cf4bf235	random cleanups. llvm-svn: 66357	2009-03-08 01:47:41 +00:00
Duncan Sands	12da8ce3d2	Introduce new linkage types linkonce_odr, weak_odr, common_odr and extern_weak_odr. These are the same as the non-odr versions, except that they indicate that the global will only be overridden by an equivalent global. In C, a function with weak linkage can be overridden by a function which behaves completely differently. This means that IP passes have to skip weak functions, since any deductions made from the function definition might be wrong, since the definition could be replaced by something completely different at link time. This is not allowed in C++, thanks to the ODR (One-Definition-Rule): if a function is replaced by another at link-time, then the new function must be the same as the original function. If a language knows that a function or other global can only be overridden by an equivalent global, it can give it the weak_odr linkage type, and the optimizers will understand that it is alright to make deductions based on the function body. The code generators on the other hand map weak and weak_odr linkage to the same thing. llvm-svn: 66339	2009-03-07 15:45:40 +00:00
Dan Gohman	15af5524a4	Fix ScheduleDAGRRList::CopyAndMoveSuccessors' handling of nodes with multiple chain operands. This can occur when the scheduler has added chain operands to a node that already has a chain operand, in order to handle physical register dependencies. This fixes an llvm-gcc bootstrap failure on x86-64 introduced in r66058. llvm-svn: 66240	2009-03-06 02:23:01 +00:00
Bob Wilson	5b15d01ff3	Fix BuildVectorSDNode::isConstantSplat to handle one-element vectors. It is an error to call APInt::zext with a size that is equal to the value's current size, so use zextOrTrunc instead. llvm-svn: 66039	2009-03-04 17:47:01 +00:00
Eli Friedman	7604d37723	PR3686: make the legalizer handle bitcast from i80 to x86 long double. llvm-svn: 66021	2009-03-04 06:23:34 +00:00
Evan Cheng	b8905c4e2c	Fix PR3701. 1. X86 target renamed eflags register to flags. This matches what llvm-gcc generates so codegen knows flags register is being clobbered by inline asm. 2. BURR scheduler should also check if inline asm nodes can clobber "live" physical registers. Previously it was only checking target nodes with implicit defs. llvm-svn: 65996	2009-03-04 01:41:49 +00:00
Bill Wendling	6d2714738f	The DAG combiner was performing a BT combine. The BT combine had a value of -1, so it changed it into a 31 via the TLO.ShrinkDemandedConstant() call. Then it would go through the DAG combiner again. This time it had a value of 31, which was turned into a -1 by TLI.SimplifyDemandedBits(). This would ping pong forever. Teach the TLO.ShrinkDemandedConstant() call not to lower a value if the demanded value is an XOR of all ones. llvm-svn: 65985	2009-03-04 00:18:06 +00:00
Bob Wilson	85cefe8567	Generalize BuildVectorSDNode::isConstantSplat to use APInts and handle arbitrary vector sizes. Add an optional MinSplatBits parameter to specify a minimum for the splat element size. Update the PPC target to use the revised interface. llvm-svn: 65899	2009-03-02 23:24:16 +00:00
Nate Begeman	a9e981225e	Fix a problem with DAGCombine on 64b targets where folding extracts + build_vector into a shuffle would fail, because the type of the new build_vector would not be legal. Try harder to create a legal build_vector type. Note: this will be totally irrelevant once vector_shuffle no longer takes a build_vector for shuffle mask. New: _foo: xorps %xmm0, %xmm0 xorps %xmm1, %xmm1 subps %xmm1, %xmm1 mulps %xmm0, %xmm1 addps %xmm0, %xmm1 movaps %xmm1, 0 Old: _foo: xorps %xmm0, %xmm0 movss %xmm0, %xmm1 xorps %xmm2, %xmm2 unpcklps %xmm1, %xmm2 pshufd $80, %xmm1, %xmm1 unpcklps %xmm1, %xmm2 pslldq $16, %xmm2 pshufd $57, %xmm2, %xmm1 subps %xmm0, %xmm1 mulps %xmm0, %xmm1 addps %xmm0, %xmm1 movaps %xmm1, 0 llvm-svn: 65791	2009-03-01 23:44:07 +00:00
Bob Wilson	d8ea0e144e	Combine PPC's GetConstantBuildVectorBits and isConstantSplat functions to a new method in a BuildVectorSDNode "pseudo-class". llvm-svn: 65747	2009-03-01 01:13:55 +00:00
Rafael Espindola	000421eade	Refactor TLS code and add some tests. The tests and expected results are: pic \| declaration \| linkage \| visibility \| !pic \| declaration \| external \| default \| tls1.ll tls2.ll \| local exec pic \| declaration \| external \| default \| tls1-pic.ll tls2-pic.ll \| general dynamic !pic \| !declaration \| external \| default \| tls3.ll tls4.ll \| initial exec pic \| !declaration \| external \| default \| tls3-pic.ll tls4-pic.ll \| general dynamic !pic \| declaration \| external \| hidden \| tls7.ll tls8.ll \| local exec pic \| declaration \| external \| hidden \| X \| local dynamic !pic \| !declaration \| external \| hidden \| tls9.ll tls10.ll \| local exec pic \| !declaration \| external \| hidden \| X \| local dynamic !pic \| declaration \| internal \| default \| tls5.ll tls6.ll \| local exec pic \| declaration \| internal \| default \| X \| local dynamic The ones marked with an X have not been implemented since local dynamic is not implemented. llvm-svn: 65632	2009-02-27 13:37:18 +00:00
Evan Cheng	a49de9de2e	Revert BuildVectorSDNode related patches: 65426, 65427, and 65296. llvm-svn: 65482	2009-02-25 22:49:59 +00:00
Dale Johannesen	7d12ea0f62	Fix big-endian codegen bug. We're splitting up overly long ints, e.g. i96, into pieces at PHIs and the nodes that feed into them; however big-endian reverses the order of the pieces (for some reason), and wasn't doing it the same way on both sides, so the pieces didn't match and runtime failures ensued. Fixes 188.ammp and sqlite3 on ppc32. llvm-svn: 65481	2009-02-25 22:39:13 +00:00
Evan Cheng	86673f2806	Clean up dwarf writer, part 1. This eliminated the horrible recursive getGlobalVariablesUsing and replaced it something readable. It eliminated use of slow UniqueVector and replaced it with StringMap, SmallVector, and DenseMap, etc. It also fixed some non-deterministic behavior. This is a very minor compile time win. llvm-svn: 65438	2009-02-25 07:04:34 +00:00
Scott Michel	e2fdc31759	Expand tabs to spaces (overlooked in previous commit) llvm-svn: 65427	2009-02-25 03:57:49 +00:00
Scott Michel	bb878288cb	Remove all "cached" data from BuildVectorSDNode, preferring to retrieve results via reference parameters. This patch also appears to fix Evan's reported problem supplied as a reduced bugpoint test case. llvm-svn: 65426	2009-02-25 03:12:50 +00:00
Bill Wendling	c5437ea429	Overhaul my earlier submission due to feedback. It's a large patch, but most of them are generic changes. - Use the "fast" flag that's already being passed into the asm printers instead of shoving it into the DwarfWriter. - Instead of calling "MI->getParent()->getParent()" for every MI, set the machine function when calling "runOnMachineFunction" in the asm printers. llvm-svn: 65379	2009-02-24 08:30:20 +00:00
Bill Wendling	786c5973f7	- Use the "Fast" flag instead of "OptimizeForSize" to determine whether to emit a DBG_LABEL or not. We want to fall back to the original way of emitting debug info when we're in -O0/-fast mode. - Add plumbing in to pass the "Fast" flag to places that need it. - XFAIL DebugInfo/deaddebuglabel.ll. This is finding 11 labels instead of 8. I need to investigate still. llvm-svn: 65367	2009-02-24 02:35:30 +00:00
Dan Gohman	4f356bb9b0	Fix a ValueTracking rule: RHS means operand 1, not 0. Add a simple ashr instcombine to help expose this code. And apply the fix to SelectionDAG's copy of this code too. llvm-svn: 65364	2009-02-24 02:00:40 +00:00
Scott Michel	9d31aca679	Introduce the BuildVectorSDNode class that encapsulates the ISD::BUILD_VECTOR instruction. The class also consolidates the code for detecting constant splats that's shared across PowerPC and the CellSPU backends (and might be useful for other backends.) Also introduces SelectionDAG::getBUID_VECTOR() for generating new BUILD_VECTOR nodes. llvm-svn: 65296	2009-02-22 23:36:09 +00:00
Richard Pennington	99f6d7c9fc	bug 3610: Floating point vaarg not softened. llvm-svn: 65239	2009-02-21 19:11:18 +00:00
Dan Gohman	e7fe80fcf9	Fix a bug that David Greene found in the DAGCombiner's logic that checks whether it's safe to transform a store of a bitcast value into a store of the original value. llvm-svn: 65201	2009-02-20 23:29:13 +00:00
Bill Wendling	7b9f38ad37	Temporarily revert r65065. It was causing test failures. llvm-svn: 65068	2009-02-19 21:57:07 +00:00
Bill Wendling	df78dcc0b2	Check for -fast here too. llvm-svn: 65065	2009-02-19 21:23:54 +00:00
Bill Wendling	19e0a5b3c3	Generate these labels when we're in "fast" mode, not simply when we're no in "optimize-for-size" mode. llvm-svn: 65064	2009-02-19 21:12:54 +00:00
Scott Michel	cf0da6c597	Remove trailing whitespace to reduce later commit patch noise. (Note: Eventually, commits like this will be handled via a pre-commit hook that does this automagically, as well as expand tabs to spaces and look for 80-col violations.) llvm-svn: 64827	2009-02-17 22:15:04 +00:00
Bill Wendling	3c50922ea0	--- Merging (from foreign repository) r64714 into '.': U include/llvm/CodeGen/DebugLoc.h U lib/CodeGen/SelectionDAG/LegalizeDAG.cpp U lib/CodeGen/SelectionDAG/SelectionDAGBuild.cpp U lib/Target/X86/AsmPrinter/X86ATTAsmPrinter.cpp Enable debug location generation at -Os. This goes with the reapplication of the r63639 patch. llvm-svn: 64715	2009-02-17 01:04:54 +00:00
Dan Gohman	aaee6c9523	Don't assume that a left-shift of a value with one bit set will have one bit set, because the bit may be shifted off the end. Instead, just check for a constant 1 being shifted. This is still sufficient to handle all the cases in test/CodeGen/X86/bt.ll. This fixes PR3583. llvm-svn: 64622	2009-02-15 23:59:32 +00:00
Cedric Venet	d1e179d992	Unbreak the build on win32. Cleanup some warning. Remark: when struct/class are declared differently than they are defined, this make problem for VC++ since it seems to mangle class differently that struct. These error are very hard to understand and find. So please, try to keep your definition/declaration in sync. Only tested with VS2008. hope it does not break anything. feel free to revert. llvm-svn: 64554	2009-02-14 16:06:42 +00:00
Bill Wendling	65c0fd4c44	Revert this. It was breaking stuff. llvm-svn: 64428	2009-02-13 02:16:35 +00:00
Bill Wendling	1c21ac3066	Turn off the old way of handling debug information in the code generator. Use the new way, where all of the information is passed on SDNodes and machine instructions. llvm-svn: 64427	2009-02-13 02:01:04 +00:00
Dale Johannesen	655775293f	Arrange to print constants that match "n" and "i" constraints in inline asm as signed (what gcc does). Add partial support for x86-specific "e" and "Z" constraints, with appropriate signedness for printing. llvm-svn: 64400	2009-02-12 20:58:09 +00:00
Chris Lattner	90880e2598	make fast isel fall back to selectiondags for VLA llvm.declare intrinsics. llvm-svn: 64379	2009-02-12 17:23:20 +00:00
Evan Cheng	b570499c25	Oops. Last second clean up messed things up. llvm-svn: 64373	2009-02-12 09:52:13 +00:00
Evan Cheng	3a14efacb6	Replace one of burr scheduling heuristic with something more sensible. Now calcMaxScratches simply compute the number of true data dependencies. This actually improve a couple of tests in dejagnu suite as many tests in llvm nightly test suite. llvm-svn: 64369	2009-02-12 08:59:45 +00:00
Dan Gohman	45889d24fe	Fix a comment. llvm-svn: 64328	2009-02-11 21:32:08 +00:00
Dan Gohman	6571ef3577	Don't use special heuristics for nodes with no data predecessors unless they actually have data successors, and likewise for nodes with no data successors unless they actually have data precessors. llvm-svn: 64327	2009-02-11 21:29:39 +00:00
Dan Gohman	298a2946f1	Delete the heuristic for non-livein CopyFromReg nodes. Non-liveinness is determined by whether the node has a Flag operand. However, if the node does have a Flag operand, it will be glued to its register's def, so the heuristic would end up spuriously applying to whatever node is the def. llvm-svn: 64319	2009-02-11 20:25:59 +00:00
Dale Johannesen	cc5fc44d02	Make a transformation added in 63266 a bit less aggressive. It was transforming (x&y)==y to (x&y)!=0 in the case where y is variable and known to have at most one bit set (e.g. z&1). This is not correct; the expressions are not equivalent when y==0. I believe this patch salvages what can be salvaged, including all the cases in bt.ll. Dan, please review. Fixes gcc.c-torture/execute/20040709-[12].c llvm-svn: 64314	2009-02-11 19:19:41 +00:00
Dan Gohman	dfaf646c34	When scheduling a block in parts, keep track of the overall instruction index across each part. Instruction indices are used to make live range queries, and live ranges can extend beyond scheduling region boundaries. Refactor the ScheduleDAGSDNodes class some more so that it doesn't have to worry about this additional information. llvm-svn: 64288	2009-02-11 04:27:20 +00:00
Dan Gohman	b95434356c	Factor out more code for computing register live-range informationfor scheduling, and generalize is so that preserves state across scheduling regions. This fixes incorrect live-range information around terminators and labels, which are effective region boundaries. In place of looking for terminators to anchor inter-block dependencies, introduce special entry and exit scheduling units for this purpose. llvm-svn: 64254	2009-02-10 23:27:53 +00:00
Evan Cheng	ce3bbe515b	Fix PR3457: Ignore control successors when looking for closest scheduled successor. A control successor doesn't read result(s) produced by the scheduling unit being evaluated. llvm-svn: 64210	2009-02-10 08:30:11 +00:00
Evan Cheng	3af42a8a14	If the target cannot issue a copy for the given source and dest registers, abort instead of silently continue. llvm-svn: 64184	2009-02-09 22:47:36 +00:00
Evan Cheng	fe174df170	Simplify code. llvm-svn: 64164	2009-02-09 21:01:06 +00:00
Evan Cheng	020588cee3	Make sure constant subscript is truncated to ptr size if it may not fit. llvm-svn: 64163	2009-02-09 20:54:38 +00:00
Dale Johannesen	9c310711bb	Use getDebugLoc forwarder instead of getNode()->getDebugLoc. No functional change. llvm-svn: 64026	2009-02-07 19:59:05 +00:00
Dan Gohman	747e55bc9a	Constify TargetInstrInfo::EmitInstrWithCustomInserter, allowing ScheduleDAG's TLI member to use const. llvm-svn: 64018	2009-02-07 16:15:20 +00:00
Dale Johannesen	8ba7132128	Make SDNode constructors take a DebugLoc always. Adjust derived classes to pass UnknownLoc where a DebugLoc does not make sense. Pick one of DebugLoc and non-DebugLoc variants to survive for all such classes. llvm-svn: 64000	2009-02-07 02:15:05 +00:00
Dale Johannesen	a72d41a67b	Remove now-unused constructors. llvm-svn: 63995	2009-02-07 01:27:09 +00:00
Dale Johannesen	62fd95d6ec	Get rid of the last non-DebugLoc versions of getNode! Many targets build placeholder nodes for special operands, e.g. GlobalBaseReg on X86 and PPC for the PIC base. There's no sensible way to associate debug info with these. I've left them built with getNode calls with explicit DebugLoc::getUnknownLoc operands. I'm not too happy about this but don't see a good improvement; I considered adding a getPseudoOperand or something, but it seems to me that'll just make it harder to read. llvm-svn: 63992	2009-02-07 00:55:49 +00:00
Dale Johannesen	84935759d5	Remove more non-DebugLoc getNode variants. Use getCALLSEQ_{END,START} to permit passing no DebugLoc there. UNDEF doesn't logically have DebugLoc; add getUNDEF to encapsulate this. llvm-svn: 63978	2009-02-06 23:05:02 +00:00
Dale Johannesen	dc93bbc4b0	And one more file. llvm-svn: 63971	2009-02-06 21:55:48 +00:00
Dale Johannesen	400dc2e2e4	Remove more non-DebugLoc versions of getNode. llvm-svn: 63969	2009-02-06 21:50:26 +00:00
Bill Wendling	03c34d0d3c	Clear out the CurDebugLoc info when doing a 'clear' on the SDL object. llvm-svn: 63967	2009-02-06 21:36:23 +00:00
Dale Johannesen	ab8e4425a3	Eliminate remaining non-DebugLoc version of getTargetNode. llvm-svn: 63951	2009-02-06 19:16:40 +00:00
Dan Gohman	817a24f8e9	Rename SelectionDAGISel::Schedule to SelectionDAGISel::CreateScheduler, and make it just create the scheduler. Leave running the scheduler to the higher-level code. This makes the higher-level code a little more explicit and easier to follow, and will help enable some future refactoring. llvm-svn: 63944	2009-02-06 18:26:51 +00:00
Dan Gohman	cd2cd9f5d7	Delete an unused member function. llvm-svn: 63941	2009-02-06 18:19:52 +00:00
Evan Cheng	066757eea1	Move getPointerRegClass from TargetInstrInfo to TargetRegisterInfo. llvm-svn: 63938	2009-02-06 17:43:24 +00:00
Dan Gohman	483377c639	Move ScheduleDAGSDNodes.h to be a private header. Front-ends that previously included this header should include SchedulerRegistry.h instead. llvm-svn: 63937	2009-02-06 17:22:58 +00:00
Dale Johannesen	2c4cf2752d	get rid of some non-DebugLoc getTargetNode variants. llvm-svn: 63909	2009-02-06 02:08:06 +00:00
Dale Johannesen	9f3f72f144	Get rid of one more non-DebugLoc getNode and its corresponding getTargetNode. Lots of caller changes. llvm-svn: 63904	2009-02-06 01:31:28 +00:00
Dale Johannesen	f80493bbfd	Remove a non-DebugLoc version of getNode. llvm-svn: 63889	2009-02-05 22:07:54 +00:00
Dale Johannesen	3eb373f5ce	Remove 3 non-DebugLoc variants of getNode. llvm-svn: 63886	2009-02-05 21:20:44 +00:00
Mon P Wang	3f0e0a6dea	Fix a bug where we were not emitting a cvt rnd sat node for converting between a unsigned integer and signed integer. llvm-svn: 63831	2009-02-05 04:47:42 +00:00
Dale Johannesen	b842d529a3	Reapply 63765. Patches for clang and llvm-gcc to follow. llvm-svn: 63812	2009-02-05 01:49:45 +00:00
Dale Johannesen	12c572b6fa	Get rid of 3 non-DebugLoc getNode variants. llvm-svn: 63808	2009-02-05 01:01:16 +00:00
Dale Johannesen	7ae8c8b108	Remove non-DebugLoc versions of getMergeValues, ZeroExtendInReg. llvm-svn: 63800	2009-02-05 00:20:09 +00:00
Dale Johannesen	f08a47bb70	Remove non-DebugLoc forms of CopyToReg and CopyFromReg. Adjust callers. llvm-svn: 63789	2009-02-04 23:02:30 +00:00
Dale Johannesen	ae616c2c61	Reverting 63765. This broke the build of both clang and llvm-gcc. llvm-svn: 63786	2009-02-04 22:47:25 +00:00
Stuart Hastings	556bd92698	80 column rule. llvm-svn: 63768	2009-02-04 20:30:10 +00:00
Dale Johannesen	021052a705	Remove non-DebugLoc versions of getLoad and getStore. Adjust the many callers of those versions. llvm-svn: 63767	2009-02-04 20:06:27 +00:00
Nate Begeman	6ae3aa83d0	New feature: add support for target intrinsics being defined in the target directories themselves. This also means that VMCore no longer needs to know about every target's list of intrinsics. Future work will include converting the PowerPC target to this interface as an example implementation. llvm-svn: 63765	2009-02-04 19:47:21 +00:00
Mon P Wang	34650735d0	Avoids generating a legalization assert for the case where a vector type is legal but when legalizing the operation, we split the vector type and generate a library call whose type needs to be promoted. For example, X86 with SSE on but MMX off, a divide v2i64 will be scalarized to 2 calls to a library using i64. llvm-svn: 63760	2009-02-04 19:38:14 +00:00
Stuart Hastings	ffee3d831a	Since I'm obliged to work with a development OS that currently doesn't support GraphViz, I've been using the foo->dump() facility. This patch is a minor rewrite to the SelectionDAG dump() stuff to make it a little more helpful. The existing foo->dump() functionality does not change; this patch adds foo->dumpr(). All of this is only useful when running LLVM under a debugger. llvm-svn: 63736	2009-02-04 16:46:19 +00:00
Dale Johannesen	679073b420	Remove non-DebugLoc forms of the exotic forms of Lod and Sto; patch uses. llvm-svn: 63716	2009-02-04 02:34:38 +00:00
Dale Johannesen	f2bb6f09a3	Remove some more non-DebugLoc versions of construction functions, with callers adjusted to fit. llvm-svn: 63705	2009-02-04 01:48:28 +00:00
Dale Johannesen	efb82cfbf2	Check in file I forgot. llvm-svn: 63704	2009-02-04 01:33:20 +00:00
Dale Johannesen	85263882aa	Remove a few non-DebugLoc versions of node creation functions. llvm-svn: 63703	2009-02-04 01:17:06 +00:00
Dale Johannesen	9888edee10	Fill in more omissions in DebugLog propagation. I think that's it for this directory. llvm-svn: 63690	2009-02-04 00:13:36 +00:00
Dale Johannesen	3a09f5589d	DebugLoc propagation; adjustment to things omitted from SelectionDagBuild. llvm-svn: 63680	2009-02-03 23:04:43 +00:00
Dale Johannesen	abf66b8343	Add some DL propagation to places that didn't have it yet. More coming. llvm-svn: 63673	2009-02-03 22:26:09 +00:00
Devang Patel	70da8e8425	First initialize DAG otherwise dwarf writer is used uninitialized. Duncan spotted this. Thanks! llvm-svn: 63641	2009-02-03 18:46:32 +00:00
Duncan Sands	a77c5f758c	Fix PR3411. When replacing values, nodes are analyzed in any old order. Since analyzing a node analyzes its operands also, this can mean that when we pop a node off the list of nodes to be analyzed, it may already have been analyzed. llvm-svn: 63632	2009-02-03 10:23:33 +00:00
Bill Wendling	135227a060	Pass in something sensible for the debug location information when creating the initial PHI nodes of the machine function. llvm-svn: 63598	2009-02-03 02:20:52 +00:00
Dale Johannesen	db39362c90	Fill in some missing DL propagation in getNode()s. llvm-svn: 63595	2009-02-03 01:55:44 +00:00
Bill Wendling	143a2c3470	Use SDL->getCurDebugLoc() instead of unknown loc for landing pads. llvm-svn: 63594	2009-02-03 01:55:42 +00:00
Bill Wendling	fa50a23f8a	Explicitly pass in the "unknown" debug location. This is probably not correct. We need more infrastructure before we can get the DebugLoc info for these instructions. llvm-svn: 63593	2009-02-03 01:33:28 +00:00
Bill Wendling	9862a64419	Alphabetize includes. llvm-svn: 63591	2009-02-03 01:32:22 +00:00
Bill Wendling	17450acc3b	Propagate debug loc info during SDNode -> machine instr creation. llvm-svn: 63585	2009-02-03 01:02:39 +00:00
Bill Wendling	e3c78361d3	Create DebugLoc information in FastISel. Several temporary methods were created. Specifically, those BuildMIs which use "DebugLoc::getUnknownLoc()". I'll remove them soon. llvm-svn: 63584	2009-02-03 00:55:04 +00:00
Dale Johannesen	f1163e9a4d	Propagation in TargetLowering. Includes passing a DL into SimplifySetCC which gets called elsewhere. llvm-svn: 63583	2009-02-03 00:47:48 +00:00
Dan Gohman	76a07f59d4	Use the SubclassData field to hold ExtType, isTrunc, and MemIndexedMode information. This eliminates the need for the Flags field in MemSDNode, so this makes LoadSDNode and StoreSDNode smaller. Also, it makes FoldingSetNodeIDs for loads and stores two AddIntegers smaller. llvm-svn: 63577	2009-02-03 00:08:45 +00:00
Dale Johannesen	72ba6df1a9	Last DebugLoc propagation for this file. llvm-svn: 63574	2009-02-02 23:46:53 +00:00
Dale Johannesen	b5dd922a92	More DebugLoc propagation. This should be everything except LegalizeOp itself. llvm-svn: 63560	2009-02-02 22:49:46 +00:00
Dale Johannesen	a02e45ca19	DebugLoc propagation. ExpandOp and PromoteOp, among others. llvm-svn: 63555	2009-02-02 22:12:50 +00:00
Dale Johannesen	ae7992a333	Commit missing files. llvm-svn: 63545	2009-02-02 20:47:48 +00:00
Dale Johannesen	ad00f6e010	More DebugLoc propagation. llvm-svn: 63543	2009-02-02 20:41:04 +00:00
Duncan Sands	dab7be8774	Remove trailing spaces. llvm-svn: 63540	2009-02-02 19:46:41 +00:00
Dale Johannesen	8525d83aac	DebugLoc propagation for int<->fp conversions. llvm-svn: 63537	2009-02-02 19:03:57 +00:00
Sanjiv Gupta	8e56d1898b	Duncan's patch. Further to 64382. Takes care of illegal types for shift amount. llvm-svn: 63523	2009-02-02 17:19:39 +00:00
Mon P Wang	cc866c955c	Preserve more SourceValue information. llvm-svn: 63498	2009-02-02 06:37:55 +00:00
Duncan Sands	3ed768868d	Fix PR3453 and probably a bunch of other potential crashes or wrong code with codegen of large integers: eliminate the legacy getIntegerVTBitMask and getIntegerVTSignBit methods, which returned their value as a uint64_t, so couldn't handle huge types. llvm-svn: 63494	2009-02-01 18:06:53 +00:00
Bill Wendling	a6c75ffd73	Forgot some more DebugLoc propagations. llvm-svn: 63493	2009-02-01 11:19:36 +00:00
Dale Johannesen	dfbb6a1a9a	DebugLoc propagation. llvm-svn: 63488	2009-01-31 22:04:51 +00:00
Dale Johannesen	5f98ea28ca	DebugLoc propagation. Done with file. llvm-svn: 63486	2009-01-31 21:04:24 +00:00
Dale Johannesen	4d9fa9e71d	DebugLoc propagation. Done with file. llvm-svn: 63485	2009-01-31 20:01:02 +00:00
Duncan Sands	41826036b1	Fix PR3401: when using large integers, the type returned by getShiftAmountTy may be too small to hold shift values (it is an i8 on x86-32). Before and during type legalization, use a large but legal type for shift amounts: getPointerTy; afterwards use getShiftAmountTy, fixing up any shift amounts with a big type during operation legalization. Thanks to Dan for writing the original patch (which I shamelessly pillaged). llvm-svn: 63482	2009-01-31 15:50:11 +00:00
Mon P Wang	cf9ba82324	If unsafe FP optimization is not set, don't allow -(A-B) => B-A because when A==B, -0.0 != +0.0. llvm-svn: 63474	2009-01-31 06:07:45 +00:00
Bill Wendling	3b585af0ec	Don't use DebugLoc::getUnknownLoc(). Default to something hopefully sensible. llvm-svn: 63473	2009-01-31 03:12:48 +00:00
Dale Johannesen	db7c5f6a7b	Move CurDebugLoc into SelectionDAGLowering. llvm-svn: 63468	2009-01-31 02:22:37 +00:00
Dale Johannesen	dc0f124429	Propagate debug info in LegalizeFloatTypes. Complete (modulo bugs). llvm-svn: 63458	2009-01-31 00:43:08 +00:00
Dale Johannesen	42aa385e20	Propagate debug info. This file complete (modulo bugs) llvm-svn: 63457	2009-01-31 00:20:43 +00:00
Dale Johannesen	c910889511	Propagate debug info through MakeLibCall and a couple of things that use it. llvm-svn: 63456	2009-01-31 00:11:23 +00:00
Bill Wendling	31b50991cb	More DebugLoc propagation. llvm-svn: 63454	2009-01-30 23:59:18 +00:00
Bill Wendling	27d9dd4b57	More DebugLoc propagation. llvm-svn: 63452	2009-01-30 23:36:47 +00:00
Bill Wendling	306bfc2213	More DebugLoc propagation in LOAD etc. methods. llvm-svn: 63451	2009-01-30 23:27:35 +00:00
Bill Wendling	0bd29743e3	More DebugLoc propagation in floating-point methods. llvm-svn: 63446	2009-01-30 23:15:49 +00:00
Dale Johannesen	555a375bb6	Make LowerCallTo and LowerArguments take a DebugLoc argument. Adjust all callers and overloaded versions. llvm-svn: 63444	2009-01-30 23:10:59 +00:00
Bill Wendling	6fbf5495f8	Standardize comments about folding xforms. llvm-svn: 63443	2009-01-30 23:10:18 +00:00
Bill Wendling	8fb81f1b3d	Get rid of the non-DebugLoc-ified getNOT() method. llvm-svn: 63442	2009-01-30 23:03:19 +00:00
Bill Wendling	3dc5d2454e	Propagate debug loc info for some FP arithmetic methods. llvm-svn: 63441	2009-01-30 22:57:07 +00:00
Bill Wendling	cb9be5d174	Propagate debug loc info for some FP arithmetic methods. llvm-svn: 63440	2009-01-30 22:53:48 +00:00
Bill Wendling	4e0a61514b	Propagate debug loc info for BIT_CONVERT. llvm-svn: 63439	2009-01-30 22:44:24 +00:00
Bill Wendling	7bfa43b022	Propagate debug loc info for more *_EXTEND methods. llvm-svn: 63437	2009-01-30 22:33:24 +00:00
Bill Wendling	9b3dc8d848	Propagate debug loc info for ANY_EXTEND. llvm-svn: 63436	2009-01-30 22:27:33 +00:00
Bill Wendling	c409318562	Propagate debug loc info for some of the *_EXTEND functions. llvm-svn: 63434	2009-01-30 22:23:15 +00:00
Bill Wendling	cab9a2eef5	DebugLoc form of getNOT(). llvm-svn: 63433	2009-01-30 22:11:22 +00:00
Bill Wendling	b6b6f46fe4	- Propagate debug loc info for SELECT. - Added xform for (select X, 1, Y) and (select X, Y, 0), which was commented on, but missing. llvm-svn: 63428	2009-01-30 22:02:18 +00:00
Bill Wendling	d51e3ff540	Propagate debug loc info for Shifts. llvm-svn: 63424	2009-01-30 21:37:17 +00:00
Bill Wendling	35972a9460	Propagate debug loc info for XOR and MatchRotate. llvm-svn: 63420	2009-01-30 21:14:50 +00:00
Bill Wendling	f29b6e1318	Propagate debug loc info for OR. Also clean up some comments. llvm-svn: 63419	2009-01-30 20:59:34 +00:00
Bill Wendling	ff8acd684f	Perform obvious constant arithmetic folding. llvm-svn: 63417	2009-01-30 20:50:00 +00:00
Bill Wendling	8617191302	Propagate debug loc info for AND. Also clean up some comments. llvm-svn: 63416	2009-01-30 20:43:18 +00:00
Bill Wendling	781db7a1ad	Propagate debug loc info in SimplifyBinOpWithSameOpcodeHands. llvm-svn: 63411	2009-01-30 19:25:47 +00:00
Bill Wendling	9b3407e5bb	Propagate debug loc info in SimplifyNodeWithTwoResults. llvm-svn: 63376	2009-01-30 03:08:40 +00:00
Bill Wendling	faed065e5c	Propagate debug loc info for MULHS. llvm-svn: 63375	2009-01-30 03:00:18 +00:00
Bill Wendling	d033af09fd	Propagate debug loc info for SREM and UREM. llvm-svn: 63374	2009-01-30 02:57:00 +00:00
Bill Wendling	aff3e03765	Propagate debug loc info for UDIV. llvm-svn: 63373	2009-01-30 02:55:25 +00:00
Bill Wendling	5b663e7b53	Propagate debug loc info for SDIV. llvm-svn: 63372	2009-01-30 02:52:17 +00:00
Bill Wendling	b48dcf67e5	Forgot to propagate debug loc info here. llvm-svn: 63371	2009-01-30 02:49:26 +00:00
Bill Wendling	091f92f568	Propagate debug loc info for MUL. llvm-svn: 63369	2009-01-30 02:45:56 +00:00
Bill Wendling	48ff08ef3e	Propagate debug loc info in SUB. llvm-svn: 63368	2009-01-30 02:42:10 +00:00
Bill Wendling	6127757920	Propagate debug loc info in ADDC and ADDE. llvm-svn: 63367	2009-01-30 02:38:00 +00:00
Bill Wendling	c442348dd7	Propagate debug loc info in DAG combine's "ADD". llvm-svn: 63366	2009-01-30 02:31:17 +00:00
Bill Wendling	cdd96133bd	- Propagate debug loc info in combineSelectAndUse(). - Modify ReassociateOps so that the resulting SDValue is what the comment claims it is. llvm-svn: 63365	2009-01-30 02:23:43 +00:00
Dale Johannesen	ed255b3d8e	Propagate debug info when building SelectionDAG. llvm-svn: 63359	2009-01-30 01:34:22 +00:00
Bill Wendling	9c9a3b6665	Propagate debug location info for the token factor. llvm-svn: 63355	2009-01-30 01:13:16 +00:00
Bill Wendling	f6d0aff0bd	Add DebugLoc propagation to some of the methods in DAG combiner. llvm-svn: 63350	2009-01-30 00:45:56 +00:00
Dan Gohman	14d55f0a5c	Explicitly add PseudoSourceValue information when lowering BUILD_VECTOR and conversions to stack operations. llvm-svn: 63333	2009-01-29 21:02:43 +00:00
Dan Gohman	60d6844aa8	Make a few things const, fix some comments, and simplify some assertions. llvm-svn: 63328	2009-01-29 19:49:27 +00:00
Dan Gohman	8b437ccbbe	Fix two typos that Duncan spotted in a comment. llvm-svn: 63312	2009-01-29 16:18:12 +00:00
Dan Gohman	ef04ed5477	In the case of an extractelement on an insertelement value, the element indices may be equal if either one is not a constant. llvm-svn: 63311	2009-01-29 16:10:46 +00:00
Bill Wendling	a434d930ff	Revert r63273. This was already implemented by Dale. There's no need for my change. llvm-svn: 63301	2009-01-29 09:01:55 +00:00
Bill Wendling	50338007b9	- Add DebugLoc to getTargetNode(). - Modify TableGen to add the DebugLoc when calling getTargetNode. (The light-weight wrappers are only temporary. The non-DebugLoc version will be removed once the whole debug info stuff is finished with.) llvm-svn: 63273	2009-01-29 05:27:31 +00:00
Dan Gohman	e58ab79f33	Make x86's BT instruction matching more thorough, and add some dagcombines that help it match in several more cases. Add several more cases to test/CodeGen/X86/bt.ll. This doesn't yet include matching for BT with an immediate operand, it just covers more register+register cases. llvm-svn: 63266	2009-01-29 01:59:02 +00:00
Dale Johannesen	839acbb089	Add DebugLoc-sensitive versions of many node creation functions. Currently omitted: memcpy, memmove, memset. llvm-svn: 63259	2009-01-29 00:47:48 +00:00
Bill Wendling	1b6a3bce82	Add DebugLoc to the getNode() methods. llvm-svn: 63245	2009-01-28 22:17:52 +00:00
Dale Johannesen	666bf20441	Add DebugLoc-aware constructors for SDNode derived classes (those that reasonably have a DebugLoc associated with them). llvm-svn: 63236	2009-01-28 21:18:29 +00:00
Mon P Wang	a15ea78ea6	Fixed extract element when the result needs to be promoted and the input widened. llvm-svn: 63217	2009-01-28 18:53:39 +00:00
Dan Gohman	4aa1846215	Make isOperationLegal do what its name suggests, and introduce a new isOperationLegalOrCustom, which does what isOperationLegal previously did. Update a bunch of callers to use isOperationLegalOrCustom instead of isOperationLegal. In some case it wasn't obvious which behavior is desired; when in doubt I changed then to isOperationLegalOrCustom as that preserves their previous behavior. This is for the second half of PR3376. llvm-svn: 63212	2009-01-28 17:46:25 +00:00
Duncan Sands	ba21b7d57a	Formatting. llvm-svn: 63199	2009-01-28 14:42:54 +00:00
Duncan Sands	5a913d61e3	Rename getAnalysisToUpdate to getAnalysisIfAvailable. llvm-svn: 63198	2009-01-28 13:14:17 +00:00
Dan Gohman	b3bbde3e62	Use ValueType::bitsLT to simplify some code. llvm-svn: 63170	2009-01-28 03:10:52 +00:00
Dan Gohman	172ad92b29	Use ZERO_EXTEND instead of ANY_EXTEND when promoting shift amounts, to avoid implicitly assuming that target architectures will ignore the high bits. llvm-svn: 63169	2009-01-28 02:58:31 +00:00
Dan Gohman	fb58faf29e	Add an assertion to the form of SelectionDAG::getConstant that takes a uint64_t to verify that the value is in range for the given type, to help catch accidental overflow. Fix a few places that relied on getConstant implicitly truncating the value. llvm-svn: 63128	2009-01-27 20:39:34 +00:00
Dan Gohman	0bd9546039	Delete redundant return statements. llvm-svn: 63120	2009-01-27 19:23:22 +00:00
Duncan Sands	d77e476921	Fix PR3393, which amounts to a bug in the expensive checking logic. Rather than make the checking more complicated, I've tweaked some logic to make things conform to how the checking thought things ought to be, since this results in a simpler "mental model". llvm-svn: 63048	2009-01-26 21:54:18 +00:00
Anton Korobeynikov	4b4622454c	During bittest switch lowering emit shift in the test block, which should (theoretically) allow us to generate more efficient code. We don't do this now though :) llvm-svn: 63027	2009-01-26 19:26:01 +00:00
Dan Gohman	8e4ac9b71a	Take the next steps in making SDUse more consistent with LLVM Use, and tidy up SDUse and related code. - Replace the operator= member functions with a set method, like LLVM Use has, and variants setInitial and setNode, which take care up updating use lists, like LLVM Use's does. This simplifies code that calls these functions. - getSDValue() is renamed to get(), as in LLVM Use, though most places can either use the implicit conversion to SDValue or the convenience functions instead. - Fix some more node vs. value terminology issues. Also, eliminate the one remaining use of SDOperandPtr, and SDOperandPtr itself. llvm-svn: 62995	2009-01-26 04:35:06 +00:00
Dan Gohman	f1d38be265	Eliminate the loop that searches through each of the operands of each use in the SelectionDAG ReplaceAllUses* functions. Thanks to Chris for spotting this opportunity. Also, factor out code from all 5 of the ReplaceAllUses* functions into AddNonLeafNodeToCSEMaps, which is now renamed AddModifiedNodeToCSEMaps to more accurately reflect its purpose. llvm-svn: 62964	2009-01-25 16:29:12 +00:00
Dan Gohman	3a113ec468	Whitespace tidiments. llvm-svn: 62963	2009-01-25 16:21:38 +00:00
Dan Gohman	e7b0dde2ee	Move the N->use_empty() assert from DeleteNode to DeleteNodeNotInCSEMaps, since DeleteNode just calls DeleteNodeNotInCSEMaps. llvm-svn: 62962	2009-01-25 16:20:37 +00:00
Nate Begeman	b09b0242ca	Fix an indent and a typo. llvm-svn: 62940	2009-01-24 22:12:48 +00:00
Dan Gohman	1275e28ded	Fold x-0 to x in unsafe-fp-math mode. This comes up in the testcase from PR3376, and in fact is sufficient to completely avoid the problem in that testcase. There's an underlying problem though; TLI.isOperationLegal considers Custom to be Legal, which might be ok in some cases, but that's what DAGCombiner is using in many places to test if something is legal when LegalOperations is true. When DAGCombiner is running after legalize, this isn't sufficient. I'll address this in a separate commit. llvm-svn: 62860	2009-01-23 19:10:37 +00:00
Bob Wilson	c2dc7ee5d0	Fix a minor bug in DAGCombiner's folding of SELECT. Folding "select C, 0, 1" to "C ^ 1" is only valid when C is known to be either 0 or 1. Most of the similar foldings in this function only handle "i1" types, but this one appears intentionally written to handle larger integer types. If C has an integer type larger than "i1", this needs to check if the high bits of a boolean are known to be zero. I also changed the comment to describe this folding as "C ^ 1" instead of "~C", since that is what the code does and since the latter would only be valid for "i1" types. The good news is that most LLVM targets use TargetLowering::ZeroOrOneBooleanContent so this change will not disable the optimization; the bad news is that I've been unable to come up with a testcase to demonstrate the problem. I have also removed a "FIXME" comment for folding "select C, X, 0" to "C & X", since the code looks correct to me. It could be made more aggressive by not limiting the type to "i1", but that would then require checking for TargetLowering::ZeroOrNegativeOneBooleanContent. Similar changes could be done for the other SELECT foldings, but it was decided to be not worth the trouble and complexity (see e.g., r44663). llvm-svn: 62790	2009-01-22 22:05:48 +00:00
Dan Gohman	1f3411de47	Don't create ISD::FNEG nodes after legalize if they aren't legal. Simplify x+0 to x in unsafe-fp-math mode. This avoids a bunch of redundant work in many cases, because in unsafe-fp-math mode, ISD::FADD with a constant is considered free to negate, so the DAGCombiner often negates x+0 to -0-x thinking it's free, when in reality the end result is -x, which is more expensive than x. Also, combine x*0 to 0. This fixes PR3374. llvm-svn: 62789	2009-01-22 21:58:43 +00:00
Bob Wilson	c58900504b	Add SelectionDAG::getNOT method to construct bitwise NOT operations, corresponding to the "not" and "vnot" PatFrags. Use the new method in some places where it seems appropriate. llvm-svn: 62768	2009-01-22 17:39:32 +00:00
Evan Cheng	4a0bf66eb8	Eliminate a couple of fields from TargetRegisterClass: SubRegClasses and SuperRegClasses. These are not necessary. Also eliminate getSubRegisterRegClass and getSuperRegisterRegClass. These are slow and their results can change if register file names change. Just use TargetLowering::getRegClassFor() to get the right TargetRegisterClass instead. llvm-svn: 62762	2009-01-22 09:10:11 +00:00
Chris Lattner	e09d631d8e	fix a typo llvm-svn: 62761	2009-01-22 07:21:55 +00:00
Dan Gohman	7e6b932f18	Simplify ReduceLoadWidth's logic: it doesn't need several different special cases after producing the new reduced-width load, because the new load already has the needed adjustments built into it. This fixes several bugs due to the special cases, including PR3317. llvm-svn: 62692	2009-01-21 15:17:51 +00:00
Duncan Sands	be7e41481b	Cleanup whitespace and comments, and tweak some prototypes, in operand type legalization. No functionality change. llvm-svn: 62680	2009-01-21 09:00:29 +00:00
Scott Michel	ed7d79fce4	CellSPU: - Ensure that (operation) legalization emits proper FDIV libcall when needed. - Fix various bugs encountered during llvm-spu-gcc build, along with various cleanups. - Start supporting double precision comparisons for remaining libgcc2 build. Discovered interesting DAGCombiner feature, which is currently solved via custom lowering (64-bit constants are not legal on CellSPU, but DAGCombiner insists on inserting one anyway.) - Update README. llvm-svn: 62664	2009-01-21 04:58:48 +00:00
Sanjiv Gupta	a70798cc9a	Allow targets to legalize operations (with illegal operands) that produces multiple values. For example, a load with an illegal operand (a load produces two values, a value and chain). llvm-svn: 62663	2009-01-21 04:48:39 +00:00
Bill Wendling	2395916c87	Use "SINT_TO_FP" instead of "UINT_TO_FP" when getting the exponent. This was causing the limited precision stuff to produce the wrong result for values in the range [0, 1). llvm-svn: 62615	2009-01-20 21:17:57 +00:00
Evan Cheng	c544cb0eca	Change TargetInstrInfo::isMoveInstr to return source and destination sub-register indices as well. llvm-svn: 62600	2009-01-20 19:12:24 +00:00
Bill Wendling	786a683441	Shift types need to match. llvm-svn: 62571	2009-01-20 06:10:42 +00:00
Dan Gohman	161b7b66ac	Fix a dagcombine to not generate loads of non-round integer types, as its comment says, even in the case where it will be generating extending loads. This fixes PR3216. llvm-svn: 62557	2009-01-20 01:06:45 +00:00
Devang Patel	44afc82ebe	Verify debug info. llvm-svn: 62545	2009-01-19 23:21:49 +00:00
Dan Gohman	534c8a2d72	Remove SDNode's virtual destructor. This makes it impossible for SDNode subclasses to keep state that requires non-trivial destructors, however it was already effectively impossible, since the destructor isn't actually ever called. There currently aren't any SDNode subclasses affected by this, and in general it's desireable to keep SDNode objects light-weight. This eliminates the last virtual member function in the SDNode class, so it eliminates the need for a vtable pointer, making SDNode smaller. llvm-svn: 62539	2009-01-19 22:39:36 +00:00
Dan Gohman	cd0b1bf0a0	Fix SelectionDAG::ReplaceAllUsesWith to behave correctly when uses are added to the From node while it is processing From's use list, because of automatic local CSE. The fix is to avoid visiting any new uses. Fix a few places in the DAGCombiner that assumed that after a RAUW call, the From node has no users and may be deleted. This fixes PR3018. llvm-svn: 62533	2009-01-19 21:44:21 +00:00
Sanjiv Gupta	1d2fc787a9	Few targets like PIC16 wants libcall generation for illegal type i16. llvm-svn: 62467	2009-01-18 18:25:27 +00:00
Mon P Wang	e9e7abb6b8	Simplify extract element based on comments from Duncan Sands. llvm-svn: 62459	2009-01-18 06:43:40 +00:00
Mon P Wang	ca6d6dea0b	Simplify extract element of a scalar to vector. llvm-svn: 62383	2009-01-17 00:07:25 +00:00
Dan Gohman	5f8a2598b2	Instead of adding dependence edges between terminator instructions and every other instruction in their blocks to keep the terminator instructions at the end, teach the post-RA scheduler how to operate on ranges of instructions, and exclude terminators from the range of instructions that get scheduled. Also, exclude mid-block labels, such as EH_LABEL instructions, and schedule code before them separately from code after them. This fixes problems with the post-RA scheduler moving code past EH_LABELs. llvm-svn: 62366	2009-01-16 22:10:20 +00:00
Dan Gohman	38978ba972	Use the getNode() accessor instead of accessing the Node member directly, which is private as of r55504. llvm-svn: 62364	2009-01-16 21:47:21 +00:00
Chris Lattner	41828cdb0a	new nodes should be added to the worklist, not old nodes. llvm-svn: 62359	2009-01-16 21:15:56 +00:00
Evan Cheng	968e2e7b3d	CreateVirtualRegisters does trivial copy coalescing. If a node def is used by a single CopyToReg, it reuses the virtual register assigned to the CopyToReg. This won't work for SDNode that is a clone or is itself cloned. Disable this optimization for those nodes or it can end up with non-SSA machine instructions. llvm-svn: 62356	2009-01-16 20:57:18 +00:00
Mikhail Glushenkov	6e8d814d36	Registry.h should not depend on CommandLine.h. Split Support/Registry.h into two files so that we have less to recompile every time CommandLine.h is changed. llvm-svn: 62312	2009-01-16 07:02:28 +00:00
Mikhail Glushenkov	b2f9a73029	Delete trailing whitespace. llvm-svn: 62307	2009-01-16 06:53:46 +00:00
Dan Gohman	ceac7c34f1	Initial hazard recognizer support in post-pass scheduling. This includes a new toy hazard recognizier heuristic which attempts to direct the scheduler to avoid clumping large groups of loads or stores too densely. llvm-svn: 62291	2009-01-16 01:33:36 +00:00
Devang Patel	76d190cf4a	Validate dbg_* intrinsics before lowering them. llvm-svn: 62286	2009-01-15 23:41:32 +00:00
Mon P Wang	e248edff1b	Added missing support to widen an operand from a bit convert. llvm-svn: 62285	2009-01-15 22:43:38 +00:00
Dan Gohman	7e105f0b12	Generalize the HazardRecognizer interface so that it can be used to support MachineInstr-based scheduling in addition to SDNode-based scheduling. llvm-svn: 62284	2009-01-15 22:18:12 +00:00
Rafael Espindola	6de96a1b5d	Add the private linkage. llvm-svn: 62279	2009-01-15 20:18:42 +00:00
Dan Gohman	619ef48a52	Move a few containers out of ScheduleDAGInstrs::BuildSchedGraph and into the ScheduleDAGInstrs class, so that they don't get destructed and re-constructed for each block. This fixes a compile-time hot spot in the post-pass scheduler. To help facilitate this, tidy and do some minor reorganization in the scheduler constructor functions. llvm-svn: 62275	2009-01-15 19:20:50 +00:00
Dan Gohman	307954ac69	Make getWidenVectorType const; this file was missed in the previous commit. llvm-svn: 62266	2009-01-15 17:39:39 +00:00
Dan Gohman	91febd1330	More consts on TargetLowering references. llvm-svn: 62262	2009-01-15 16:58:17 +00:00
Dan Gohman	4bdf021e05	Use const with TargetLowering references in a few more places. llvm-svn: 62260	2009-01-15 16:43:02 +00:00
Gabor Greif	08a4c281cb	minor refactoring: use a more specific API llvm-svn: 62256	2009-01-15 11:10:44 +00:00
Devang Patel	3c82aa0209	Removoe MachineModuleInfo methods (and related DebugInfoDesc class hierarchy) that were used to handle debug info. llvm-svn: 62199	2009-01-13 23:54:55 +00:00
Devang Patel	fe9581f0cd	Undo previous checkin. llvm-svn: 62190	2009-01-13 22:54:57 +00:00
Devang Patel	ca997988c3	Use dwarf writer to decide whether the module has debug info or not. llvm-svn: 62184	2009-01-13 21:25:00 +00:00
Dan Gohman	1407484178	The list-td and list-tdrr schedulers don't yet support physreg scheduling dependencies. Add assertion checks to help catch this. It appears the Mips target defaults to list-td, and it has a regression test that uses a physreg dependence. Such code was liable to be miscompiled, and now evokes an assertion failure. llvm-svn: 62177	2009-01-13 20:24:13 +00:00
Duncan Sands	ffc6133318	When replacing uses and the same node is reached via two paths, process it once not twice, d'oh! Analysis, testcase and original patch thanks to Mon Ping Wang. llvm-svn: 62169	2009-01-13 15:17:14 +00:00
Duncan Sands	90d2a7bd72	Fix some typos. Also, the WidenedVectors map was not being cleaned by ExpungeNode. llvm-svn: 62167	2009-01-13 14:42:39 +00:00
Duncan Sands	013be76241	Correct a comment - this is not a sign extension. llvm-svn: 62166	2009-01-13 14:04:14 +00:00
Devang Patel	5c6e1e3b7d	Use DebugInfo interface to lower dbg_* intrinsics. llvm-svn: 62127	2009-01-13 00:35:13 +00:00
Duncan Sands	dc020f9c3c	Rename getABITypeSize to getTypePaddedSize, as suggested by Chris. llvm-svn: 62099	2009-01-12 20:38:59 +00:00
Evan Cheng	b2c42c648d	Fix PR3241: Currently EmitCopyFromReg emits a copy from the physical register to a virtual register unless it requires an expensive cross class copy. That means we are only treating "expensive to copy" register dependency as physical register dependency. Also future proof the scheduler to handle "normal" physical register dependencies. The code is not exercised yet. llvm-svn: 62074	2009-01-12 03:19:55 +00:00
Evan Cheng	e3108148e2	CheckForPhysRegDependency should not return copy cost. It's not used. No functionality change. llvm-svn: 62036	2009-01-11 08:53:35 +00:00
Evan Cheng	ed74d8ac2a	Duplicated node may produce a non-physical register def. llvm-svn: 62015	2009-01-09 22:44:02 +00:00
Evan Cheng	0c4fe2600a	Minor debug output tweak. llvm-svn: 62005	2009-01-09 20:42:34 +00:00
Devang Patel	235acaa131	Request DwarfWriter. This will be used to handle dbg_* intrinsics. llvm-svn: 61999	2009-01-09 19:11:50 +00:00
Misha Brukman	5cbf223916	Removed trailing whitespace from Makefiles. llvm-svn: 61991	2009-01-09 16:44:42 +00:00
Dan Gohman	261ee6be57	Remove redundant 'else's. No functionality change. llvm-svn: 61891	2009-01-07 22:30:55 +00:00
Dan Gohman	c7847cdb8d	Fix a bug in ComputeLinearIndex computation handling multi-level aggregate types. Don't increment the current index after reaching the end of a struct, as it will already be pointing at one-past-the end. This fixes PR3288. llvm-svn: 61828	2009-01-06 22:53:52 +00:00
Dan Gohman	bf8e5204d1	Update these argument lists for the isNormalMemory argument. This doesn't affect current functionality. llvm-svn: 61779	2009-01-06 01:28:56 +00:00
Dan Gohman	79c3516912	Use a latency value of 0 for the artificial edges inserted by AddPseudoTwoAddrDeps. This lets the scheduling infrastructure avoid recalculating node heights. In very large testcases this was a major bottleneck. Thanks to Roman Levenstein for finding this! As a side effect, fold-pcmpeqd-0.ll is now scheduled better and it no longer requires spilling on x86-32. llvm-svn: 61778	2009-01-06 01:19:04 +00:00
Dan Gohman	dbc6c31f62	TargetLowering.h #includes SelectionDAGNodes.h, so it doesn't need its own OpActionsCapacity magic number; it can just use ISD::BUILTIN_OP_END, as long as it takes care to round up when needed. llvm-svn: 61733	2009-01-05 19:40:39 +00:00
Dan Gohman	906152a20f	Tidy up #includes, deleting a bunch of unnecessary #includes. llvm-svn: 61715	2009-01-05 17:59:02 +00:00
Devang Patel	56a8bb670f	squash warnings. llvm-svn: 61707	2009-01-05 17:31:22 +00:00
Dan Gohman	b9fa1d24f8	Fix a DAGCombiner abort on an invalid shift count constant. This fixes PR3250. llvm-svn: 61613	2009-01-03 19:22:06 +00:00
Dan Gohman	4d41fdf4ca	CommuteNodesToReducePressure() is now removed. llvm-svn: 61612	2009-01-03 19:19:30 +00:00
Dan Gohman	1be2e9650e	Remove the code from the scheduler that commuted two-address instructions to avoid copies, because TwoAddressInstructionPass also does this optimization. The scheduler's version didn't account for live-out values, which resulted in spurious commutes and missed opportunities. Now, TwoAddressInstructionPass handles all the opportunities, instead of just those that the scheduler missed. The result is usually the same, though there are occasional trivial differences resulting from the avoidance of spurious commutes. llvm-svn: 61611	2009-01-03 18:01:46 +00:00
Duncan Sands	953c9c2fbc	Factorize (and generalize) the code promoting SELECT and BRCOND conditions. Reorder a few methods while there. llvm-svn: 61547	2009-01-01 20:36:20 +00:00
Duncan Sands	19ee60848a	Remove trailing spaces. llvm-svn: 61545	2009-01-01 19:56:02 +00:00
Duncan Sands	8feb694e8f	Fix PR3274: when promoting the condition of a BRCOND node, promote from i1 all the way up to the canonical SetCC type. In order to discover an appropriate type to use, pass MVT::Other to getSetCCResultType. In order to be able to do this, change getSetCCResultType to take a type as an argument, not a value (this is also more logical). llvm-svn: 61542	2009-01-01 15:52:00 +00:00
Scott Michel	0c9259f149	Teach LeaglizeDAG that i64 mul can be a libcall. llvm-svn: 61463	2008-12-29 03:21:37 +00:00
Dale Johannesen	ee573fcefc	Change comments so everybody can understand them, hopefully. llvm-svn: 61405	2008-12-23 23:47:22 +00:00
Dale Johannesen	acc84e5aa0	Add another permutation where we should get rid of a-a. llvm-svn: 61401	2008-12-23 23:01:27 +00:00
Anton Korobeynikov	f4a66e8dda	Restore debug printing llvm-svn: 61398	2008-12-23 22:26:18 +00:00
Anton Korobeynikov	d305d00796	Sometimes APInt syntax is really ugly... :( llvm-svn: 61397	2008-12-23 22:26:01 +00:00
Anton Korobeynikov	05149bad18	Indent stuff properly llvm-svn: 61396	2008-12-23 22:25:45 +00:00
Anton Korobeynikov	6f219132a7	Initial checkin of APInt'ififcation of switch lowering llvm-svn: 61395	2008-12-23 22:25:27 +00:00
Dan Gohman	12f2490489	Clean up the atomic opcodes in SelectionDAG. This removes all the _8, _16, _32, and _64 opcodes and replaces each group with an unsuffixed opcode. The MemoryVT field of the AtomicSDNode is now used to carry the size information. In tablegen, the size-specific opcodes are replaced by size-independent opcodes that utilize the ability to compose them with predicates. This shrinks the per-opcode tables and makes the code that handles atomics much more concise. llvm-svn: 61389	2008-12-23 21:37:04 +00:00
Dan Gohman	04543e719e	Rename BuildSchedUnits to BuildSchedGraph, and refactor the code in ScheduleDAGSDNodes' BuildSchedGraph into separate functions. llvm-svn: 61376	2008-12-23 18:36:58 +00:00
Dan Gohman	92cf280dfb	Avoid an unnecessary call to allnodes_size(), which is linear. llvm-svn: 61372	2008-12-23 17:24:50 +00:00
Dale Johannesen	d2a4685860	One more permutation of subtracting off a base value. llvm-svn: 61361	2008-12-23 01:59:54 +00:00
Mon P Wang	a501640ffa	Added support for vector widening. llvm-svn: 61209	2008-12-18 20:03:17 +00:00
Mon P Wang	015a7f57b2	Fix expansion of vsetcc to set the high bit for true instead of 1. llvm-svn: 61129	2008-12-17 08:49:47 +00:00
Dan Gohman	ce70fe2e25	Double the amount of memory reserved for SUnits. This is a temporary workaround for an obscure bug. When node cloning is used, it is possible that more SUnits will be created, and if the SUnits std::vector has to reallocate, it will invalidate all the graph edges. llvm-svn: 61122	2008-12-17 04:30:46 +00:00
Eli Friedman	6cf404f2d1	Fix for PR3225: disable a broken optimization in DAGTypeLegalizer::ExpandShiftWithKnownAmountBit. In terms of restoring the optimization, the best fix here isn't obvious... any ideas? llvm-svn: 61119	2008-12-17 03:35:17 +00:00
Dale Johannesen	f51dcef803	A new dag combine; several permutations of this are there under ADD, this one was missing. llvm-svn: 61107	2008-12-16 22:13:49 +00:00
Dan Gohman	4476ef810b	Preserve SourceValue information when lowering produces multiple loads from different offsets within the same stack slot. llvm-svn: 61093	2008-12-16 18:25:36 +00:00
Evan Cheng	c35fc49477	We have decided not to support inline asm where an output operand with a matching input operand with incompatible type (i.e. either one is a floating point and the other is an integer or the sizes of the types differ). SelectionDAGBuild will catch these and exit with an error. llvm-svn: 61092	2008-12-16 18:21:39 +00:00
Dan Gohman	405f2197a4	Remove some special-case logic in ScheduleDAGSDNodes's latency computation code that is no longer needed with the new method for handling latencies. llvm-svn: 61074	2008-12-16 03:31:11 +00:00
Dan Gohman	dddc1ac7ea	Fix some register-alias-related bugs in the post-RA scheduler liveness computation code. Also, avoid adding output-depenency edges when both defs are dead, which frequently happens with EFLAGS defs. Compute Depth and Height lazily, and always in terms of edge latency values. For the schedulers that don't care about latency, edge latencies are set to 1. Eliminate Cycle and CycleBound, and LatencyPriorityQueue's Latencies array. These are all subsumed by the Depth and Height fields. llvm-svn: 61073	2008-12-16 03:25:46 +00:00
Dan Gohman	17214e633d	Make addPred and removePred return void, since the return value is not currently used by anything. llvm-svn: 61066	2008-12-16 01:00:55 +00:00
Mon P Wang	580f2c7b61	Added support for splitting and scalarizing vector shifts. llvm-svn: 61050	2008-12-15 21:44:00 +00:00
Dan Gohman	a7e139a3e6	Fix printing of PseudoSourceValues in SDNode graphs. llvm-svn: 61036	2008-12-15 17:28:10 +00:00
Mon P Wang	ac4e120912	Added support to LegalizeType for expanding the operands of scalar to vector and insert vector element. Modified extract vector element to extend the result to match the expected promoted type. llvm-svn: 61029	2008-12-15 06:57:02 +00:00
Duncan Sands	f312dc7729	Reapply r60997, this time without forgetting that target constants are allowed to have an illegal type. llvm-svn: 61006	2008-12-14 09:43:15 +00:00
Bill Wendling	e5af6f1990	Temporarily revert r60997. It was causing this failure: Running /Users/void/llvm/llvm.src/test/CodeGen/Generic/dg.exp ... FAIL: /Users/void/llvm/llvm.src/test/CodeGen/Generic/asm-large-immediate.ll Failed with exit(1) at line 1 while running: llvm-as < /Users/void/llvm/llvm.src/test/CodeGen/Generic/asm-large-immediate.ll \| llc \| /usr/bin/grep 68719476738 Assertion failed: ((TypesNeedLegalizing \|\| getTypeAction(VT) == Legal) && "Illegal type introduced after type legalization?"), function HandleOp, file /Users/void/llvm/llvm.src/lib/CodeGen/SelectionDAG/LegalizeDAG.cpp, line 493. 0 llc 0x0085392e char const* std::find<char const, char>(char const, char const, char const&) + 98 1 llc 0x00853e63 llvm::sys::PrintStackTraceOnErrorSignal() + 593 2 libSystem.B.dylib 0x96cac09b _sigtramp + 43 3 libSystem.B.dylib 0xffffffff _sigtramp + 1765097359 4 libSystem.B.dylib 0x96d24ec2 raise + 26 5 libSystem.B.dylib 0x96d3447f abort + 73 6 libSystem.B.dylib 0x96d26063 __assert_rtn + 101 7 llc 0x004f9018 llvm::cast_retty<llvm::SubprogramDesc, llvm::DebugInfoDesc>::ret_type llvm::cast<llvm::Sub ... llvm-svn: 61001	2008-12-13 23:53:00 +00:00
Duncan Sands	24092271cc	LegalizeDAG is not supposed to introduce illegal types into the DAG if they were not already there. Check this with an assertion. llvm-svn: 60997	2008-12-13 22:33:38 +00:00
Mon P Wang	472cd640fa	Remove assertion to allow promotion of a truncating store operand llvm-svn: 60975	2008-12-13 08:16:43 +00:00
Mon P Wang	f95bd2078d	Added basic support for expanding VSETCC llvm-svn: 60974	2008-12-13 08:15:14 +00:00
Duncan Sands	b6f09933c0	On big-endian machines it is wrong to do a full width register load followed by a truncating store for the copy, since the load will not place the value in the lower bits. Probably partial loads/stores can never happen here, but fix it anyway. llvm-svn: 60972	2008-12-13 07:18:38 +00:00
Duncan Sands	8f352fe100	When expanding unaligned loads and stores do not make use of illegal integer types: instead, use a stack slot and copying via integer registers. The existing code is still used if the bitconvert is to a legal integer type. This fires on the PPC testcases 2007-09-08-unaligned.ll and vec_misaligned.ll. It looks like equivalent code is generated with these changes, just permuted, but it's hard to tell. With these changes, nothing in LegalizeDAG produces illegal integer types anymore. This is a prerequisite for removing the LegalizeDAG type legalization code. While there I noticed that the existing code doesn't handle trunc store of f64 to f32: it turns this into an i64 store, which represents a 4 byte stack smash. I added a FIXME about this. Hopefully someone more motivated than I am will take care of it. llvm-svn: 60964	2008-12-12 21:47:02 +00:00
Evan Cheng	3270a1dec3	Fix add/sub expansion: don't create ADD / SUB with two results (seems like everyone is doing this these days :-). Patch by Daniel M Gessel! llvm-svn: 60958	2008-12-12 18:49:09 +00:00
Duncan Sands	e4bcb8e2dd	When using a 4 byte jump table on a 64 bit machine, do an extending load of the 4 bytes rather than a potentially illegal (type) i32 load followed by a sign extend. llvm-svn: 60945	2008-12-12 08:13:38 +00:00
Mon P Wang	9c2d26d208	Added support for SELECT v8i8 v4i16 for X86 (MMX) Added support for TRUNC v8i16 to v8i8 for X86 (MMX) llvm-svn: 60916	2008-12-12 01:25:51 +00:00
Bill Wendling	1a317678bc	Redo the arithmetic with overflow architecture. I was changing the semantics of ISD::ADD to emit an implicit EFLAGS. This was horribly broken. Instead, replace the intrinsic with an ISD::SADDO node. Then custom lower that into an X86ISD::ADD node with a associated SETCC that checks the correct condition code (overflow or carry). Then that gets lowered into the correct X86::ADDOvf instruction. Similar for SUB and MUL instructions. llvm-svn: 60915	2008-12-12 00:56:36 +00:00
Mon P Wang	bcdbfa854a	Avoid generating a convert_rndsat node when the src and dest type are the same. llvm-svn: 60869	2008-12-11 03:30:13 +00:00
Bill Wendling	40d2476adc	Clarify FIXME. llvm-svn: 60867	2008-12-11 01:26:44 +00:00
Mon P Wang	c68b3c4fc1	Whitespace clean up (tabs with spaces) llvm-svn: 60866	2008-12-11 00:44:22 +00:00
Mon P Wang	b5eb7205ea	Make fix for r60829 less conservative to allow the proper optimization for vec_extract-sse4.ll. llvm-svn: 60865	2008-12-11 00:26:16 +00:00
Bill Wendling	0864a75ebf	If ADD, SUB, or MUL have an overflow bit that's used, don't do transformation on them. The DAG combiner expects that nodes that are transformed have one value result. llvm-svn: 60857	2008-12-10 22:36:00 +00:00
Duncan Sands	09ed3bba2b	For amusement, implement SADDO, SSUBO, UADDO, USUBO for promoted integer types, eg: i16 on ppc-32, or i24 on any platform. Complete support for arbitrary precision integers would require handling expanded integer types, eg: i128, but I couldn't be bothered. llvm-svn: 60834	2008-12-10 12:30:42 +00:00
Mon P Wang	4637c3c698	Fixed a bug when trying to optimize a extract vector element of a bit convert that changes the number of elements of a shuffle. llvm-svn: 60829	2008-12-10 03:59:02 +00:00
Bill Wendling	f482f379ef	Whitespace changes. llvm-svn: 60826	2008-12-10 02:01:32 +00:00
Bill Wendling	4eb2dcdc45	Whitespace fixes. llvm-svn: 60818	2008-12-10 00:28:22 +00:00
Dan Gohman	2d170896ee	Rewrite the SDep class, and simplify some of the related code. The Cost field is removed. It was only being used in a very limited way, to indicate when the scheduler should attempt to protect a live register, and it isn't really needed to do that. If we ever want the scheduler to start inserting copies in non-prohibitive situations, we'll have to rethink some things anyway. A Latency field is added. Instead of giving each node a single fixed latency, each edge can have its own latency. This will eventually be used to model various micro-architecture properties more accurately. The PointerIntPair class and an internal union are now used, which reduce the overall size. llvm-svn: 60806	2008-12-09 22:54:47 +00:00
Bill Wendling	db8ec2d75a	Add sub/mul overflow intrinsics. This currently doesn't have a target-independent way of determining overflow on multiplication. It's very tricky. Patch by Zoltan Varga! llvm-svn: 60800	2008-12-09 22:08:41 +00:00
Duncan Sands	445071c44f	Fix PR3117: not all nodes being legalized. The essential problem was that the DAG can contain random unused nodes which were never analyzed. When remapping a value of a node being processed, such a node may become used and need to be analyzed; however due to operands being transformed during analysis the node may morph into a different one. Users of the morphing node need to be updated, and this wasn't happening. While there I added a bunch of documentation and sanity checks, so I (or some other poor soul) won't have to scratch their head over this stuff so long trying to remember how it was all supposed to work next time some obscure problem pops up! The extra sanity checking exposed a few places where invariants weren't being preserved, so those are fixed too. Since some of the sanity checking is expensive, I added a flag to turn it on. It is also turned on when building with ENABLE_EXPENSIVE_CHECKS=1. llvm-svn: 60797	2008-12-09 21:33:20 +00:00
Mon P Wang	8a5366332f	In LegalizeOp, don't change the result type of CONVERT_RNDSAT when promoting one of its operand. llvm-svn: 60749	2008-12-09 07:27:39 +00:00
Mon P Wang	4dd832d241	Fix getNode to allow a vector for the shift amount for shifts of vectors. Fix the shift amount when unrolling a vector shift into scalar shifts. Fix problem in getShuffleScalarElt where it assumes that the input of a bit convert must be a vector. llvm-svn: 60740	2008-12-09 05:46:39 +00:00
Dan Gohman	4c31524bec	Factor out the code for sign-extending/truncating gep indices and use it in x86 address mode folding. Also, make getRegForValue return 0 for illegal types even if it has a ValueMap for them, because Argument values are put in the ValueMap. This fixes PR3181. llvm-svn: 60696	2008-12-08 07:57:47 +00:00
Duncan Sands	471a654711	When allocating a stack temporary, use the correct number of bytes for types such as i1 which are not a multiple of 8 bits in length. llvm-svn: 60543	2008-12-04 18:08:40 +00:00
Dan Gohman	30cad9c192	Make debug output more informative. llvm-svn: 60524	2008-12-04 02:14:57 +00:00
Duncan Sands	f52e518d05	Only check that the result of the mapping was not a new node if the node was actually remapped. llvm-svn: 60482	2008-12-03 12:36:16 +00:00
Evan Cheng	e62150cae4	Remove a (what appears to be) overly strict assertion. Here is what happened: 1. ppcf128 select is expanded to f64 select's. 2. f64 select operand 0 is an i1 truncate, it's promoted to i32 zero_extend. 3. f64 select is updated. It's changed back to a "NewNode" and being re-analyzed. 4. f64 select operands are being processed. Operand 0 is a "NewNode". It's being expunged out of ReplacedValues map. 5. ExpungeNode tries to remap f64 select and notice it's a "NewNode" and assert. Duncan, please take a look. Thanks. llvm-svn: 60443	2008-12-02 21:57:09 +00:00
Scott Michel	9b0b28e021	Non-functional change: make custom lowering for truncate stylistically consistent with the way it's generally done in other places. llvm-svn: 60439	2008-12-02 19:55:08 +00:00
Dale Johannesen	54bdec238a	One more transformation. llvm-svn: 60432	2008-12-02 18:40:40 +00:00
Tilmann Scheller	318ccb0e62	make it possible to custom lower TRUNCATE (needed for the CellSPU target) llvm-svn: 60409	2008-12-02 12:12:25 +00:00
Mon P Wang	6e1c6ad127	Removed some unnecessary code in widening. llvm-svn: 60406	2008-12-02 07:35:08 +00:00
Dale Johannesen	8c76670b5a	Add a few more transformations. llvm-svn: 60391	2008-12-02 01:30:54 +00:00
Bill Wendling	2d59863d06	Expand getVTList, getNodeValueTypes, and SelectNodeTo to handle more value types. llvm-svn: 60381	2008-12-01 23:28:22 +00:00
Duncan Sands	3d960941b1	There are no longer any places that require a MERGE_VALUES node with only one operand, so get rid of special code that only existed to handle that possibility. llvm-svn: 60349	2008-12-01 11:41:29 +00:00
Duncan Sands	6ed40141f7	Change the interface to the type legalization method ReplaceNodeResults: rather than returning a node which must have the same number of results as the original node (which means mucking around with MERGE_VALUES, and which is also easy to get wrong since SelectionDAG folding may mean you don't get the node you expect), return the results in a vector. llvm-svn: 60348	2008-12-01 11:39:25 +00:00
Eli Friedman	c8228d263b	Followup to r60283: optimize arbitrary width signed divisions as well as unsigned divisions. Same caveats as before. llvm-svn: 60284	2008-11-30 06:35:39 +00:00
Eli Friedman	1b7fc154a5	Fix for PR2164: allow transforming arbitrary-width unsigned divides into multiplies. Some more cleverness would be nice, though. It would be nice if we could do this transformation on illegal types. Also, we would prefer a narrower constant when possible so that we can use a narrower multiply, which can be cheaper. llvm-svn: 60283	2008-11-30 06:02:26 +00:00
Eli Friedman	bd0f57821a	APIntify a test which is potentially unsafe otherwise, and fix the nearby FIXME. I'm not sure what the right way to fix the Cell test was; if the approach I used isn't okay, please let me know. llvm-svn: 60277	2008-11-30 04:59:26 +00:00
Sanjiv Gupta	7ae1a84465	Removing redundant semicolons. No functionality change. llvm-svn: 60149	2008-11-27 05:58:04 +00:00
Dale Johannesen	73bc0ba4c9	Add a missing case in visitADD. llvm-svn: 60137	2008-11-27 00:43:21 +00:00
Sanjiv Gupta	80810f8c6b	Allow custom lowering of ADDE/ADDC/SUBE/SUBC operations. llvm-svn: 60102	2008-11-26 11:19:00 +00:00
Bill Wendling	b4ff5322c1	A simplification for checking whether the signs of the operands and sum differ. Thanks, Duncan. llvm-svn: 60043	2008-11-25 19:40:17 +00:00
Bill Wendling	bf592fccd4	Now with the correct type for the 0. llvm-svn: 60016	2008-11-25 08:19:22 +00:00
Bill Wendling	d06c625b95	Get rid of unused variable. llvm-svn: 60015	2008-11-25 08:13:20 +00:00
Bill Wendling	4498b47677	Hacker's Delight says, "Signed integer overflow of addition occurs if and only if the operands have the same sign and the sum has sign opposite to that of the operands." llvm-svn: 60014	2008-11-25 08:12:19 +00:00
Dan Gohman	ad2134d45d	Initial support for anti-dependence breaking. Currently this code does not introduce any new spilling; it just uses unused registers. Refactor the SUnit topological sort code out of the RRList scheduler and make use of it to help with the post-pass scheduler. llvm-svn: 59999	2008-11-25 00:52:40 +00:00
Bill Wendling	66835479d7	- Make lowering of "add with overflow" customizable by back-ends. - Mark "add with overflow" as having a custom lowering for X86. Give it a null lowering representation for now. llvm-svn: 59971	2008-11-24 19:21:46 +00:00
Dan Gohman	5cc12a8e31	Check in the rest of this change. The isAntiDep flag needs to be passed to removePred because an SUnit can both data-depend and anti-depend on the same SUnit. llvm-svn: 59969	2008-11-24 17:33:52 +00:00
Duncan Sands	dc2dac181a	If the type legalizer actually legalized anything (this doesn't happen that often, since most code does not use illegal types) then follow it by a DAG combiner run that is allowed to generate illegal operations but not illegal types. I didn't modify the target combiner code to distinguish like this between illegal operations and illegal types, so it will not produce illegal operations as well as not producing illegal types. llvm-svn: 59960	2008-11-24 14:53:14 +00:00
Evan Cheng	a8fd1f2c8e	Eliminate some unused variable compile time warnings. llvm-svn: 59952	2008-11-24 07:09:49 +00:00
Bill Wendling	2278f8f5e1	Add support for llvm.uadd.with.overflow. llvm-svn: 59926	2008-11-24 01:38:29 +00:00
Duncan Sands	8d6e2e13d5	Rename SetCCResultContents to BooleanContents. In practice these booleans are mostly produced by SetCC, however the concept is more general. llvm-svn: 59911	2008-11-23 15:47:28 +00:00
Mon P Wang	2967480f54	Added check to avoid generating extract subvector beyond the end of the vector when normalizing vector shuffles. llvm-svn: 59900	2008-11-23 04:35:05 +00:00
Bill Wendling	5424e6d4ec	Cleanup of the [SU]ADDO type legalization code. Patch by Duncan! "It simplifies the type legalization part a bit, and produces better code by teaching SelectionDAG about the extra bits in an i8 SADDO/UADDO node. In essence, I spontaneously decided that on x86 this i8 boolean result would be either 0 or 1, and on other platforms 0/1 or 0/-1, depending on whether the platform likes it's boolean zero extended or sign extended." llvm-svn: 59864	2008-11-22 07:24:01 +00:00
Bill Wendling	be8e7f851c	- Move conversion of [SU]ADDO from DAG combiner into legalizer. - Add "promote integer type" stuff to the legalizer for these nodes. llvm-svn: 59847	2008-11-22 00:22:52 +00:00
Dan Gohman	8dfa51c5ef	Update comments. llvm-svn: 59834	2008-11-21 19:10:41 +00:00
Chris Lattner	dd7083452f	reapply Sanjiv's patch to genericize memcpy/memset/memmove to take an arbitrary integer width for the count. llvm-svn: 59823	2008-11-21 16:42:48 +00:00
Bill Wendling	4bce2bff88	Revert r59802. It was breaking the build of llvm-gcc: g++ -m32 -c -g -DIN_GCC -W -Wall -Wwrite-strings -Wmissing-format-attribute -fno-common -mdynamic-no-pic -DHAVE_CONFIG_H -Wno-unused -DTARGET_NAME=\"i386-apple-darwin9.5.0\" -I. -I. -I../../llvm-gcc.src/gcc -I../../llvm-gcc.src/gcc/. -I../../llvm-gcc.src/gcc/../include -I./../intl -I../../llvm-gcc.src/gcc/../libcpp/include -I../../llvm-gcc.src/gcc/../libdecnumber -I../libdecnumber -I/Volumes/Sandbox/Buildbot/llvm/full-llvm/build/llvm.obj/include -I/Volumes/Sandbox/Buildbot/llvm/full-llvm/build/llvm.src/include -DENABLE_LLVM -I/Volumes/Sandbox/Buildbot/llvm/full-llvm/build/llvm.obj/../llvm.src/include -D_DEBUG -D_GNU_SOURCE -D__STDC_LIMIT_MACROS -D__STDC_CONSTANT_MACROS -I. -I. -I../../llvm-gcc.src/gcc -I../../llvm-gcc.src/gcc/. -I../../llvm-gcc.src/gcc/../include -I./../intl -I../../llvm-gcc.src/gcc/../libcpp/include -I../../llvm-gcc.src/gcc/../libdecnumber -I../libdecnumber -I/Volumes/Sandbox/Buildbot/llvm/full-llvm/build/llvm.obj/include -I/Volumes/Sandbox/Buildbot/llvm/full-llvm/build/llvm.src/include ../../llvm-gcc.src/gcc/llvm-types.cpp -o llvm-types.o ../../llvm-gcc.src/gcc/llvm-convert.cpp: In member function 'void TreeToLLVM::EmitMemCpy(llvm::Value, llvm::Value, llvm::Value, unsigned int)': ../../llvm-gcc.src/gcc/llvm-convert.cpp:1496: error: 'memcpy_i32' is not a member of 'llvm::Intrinsic' ../../llvm-gcc.src/gcc/llvm-convert.cpp:1496: error: 'memcpy_i64' is not a member of 'llvm::Intrinsic' ../../llvm-gcc.src/gcc/llvm-convert.cpp: In member function 'void TreeToLLVM::EmitMemMove(llvm::Value, llvm::Value, llvm::Value, unsigned int)': ../../llvm-gcc.src/gcc/llvm-convert.cpp:1512: error: 'memmove_i32' is not a member of 'llvm::Intrinsic' ../../llvm-gcc.src/gcc/llvm-convert.cpp:1512: error: 'memmove_i64' is not a member of 'llvm::Intrinsic' ../../llvm-gcc.src/gcc/llvm-convert.cpp: In member function 'void TreeToLLVM::EmitMemSet(llvm::Value, llvm::Value, llvm::Value, unsigned int)': ../../llvm-gcc.src/gcc/llvm-convert.cpp:1528: error: 'memset_i32' is not a member of 'llvm::Intrinsic' ../../llvm-gcc.src/gcc/llvm-convert.cpp:1528: error: 'memset_i64' is not a member of 'llvm::Intrinsic' make[3]: [llvm-convert.o] Error 1 make[3]: * Waiting for unfinished jobs.... rm fsf-funding.pod gcov.pod gfdl.pod cpp.pod gpl.pod gcc.pod make[2]: * [all-stage1-gcc] Error 2 make[1]: * [stage1-bubble] Error 2 make: *** [all] Error 2 llvm-svn: 59809	2008-11-21 09:09:41 +00:00
Sanjiv Gupta	09a203765a	Make mem[cpy,move,set] intrinsics overloaded. llvm-svn: 59802	2008-11-21 07:49:09 +00:00
Bill Wendling	0b5be6c5e0	Default to converting UADDO to the generic form that SADDO is converted to. llvm-svn: 59801	2008-11-21 07:44:30 +00:00
Mon P Wang	c311360909	Clean up normalization of shuffles llvm-svn: 59792	2008-11-21 04:25:21 +00:00
Bill Wendling	5eee74446d	Combine the two add with overflow intrinsics lowerings. They differ only in DAG node type. llvm-svn: 59788	2008-11-21 02:38:44 +00:00
Bill Wendling	87c175e629	Generate code for llvm.uadd.with.overflow intrinsic. No conversion support yet. llvm-svn: 59786	2008-11-21 02:33:36 +00:00
Dan Gohman	f00cef4491	Add a flag to SDep for tracking which edges are anti-dependence edges. llvm-svn: 59785	2008-11-21 02:27:52 +00:00
Bill Wendling	8badb674eb	Remove chains. Unnecessary. llvm-svn: 59783	2008-11-21 02:22:59 +00:00
Dan Gohman	67b35bd4d1	Rename SDep's isSpecial to isArtificial, to make this field a little less mysterious. llvm-svn: 59782	2008-11-21 02:18:56 +00:00
Bill Wendling	77538cc510	Rename "ADDO" to "SADDO" and "UADDO". The "UADDO" isn't equivalent to "ADDC" because the boolean it returns to indicate an overflow may not be treated like as a flag. It could be stored to memory, for instance. llvm-svn: 59780	2008-11-21 02:12:42 +00:00
Bill Wendling	74296c60ff	Implement the sadd_with_overflow intrinsic. This is converted into "ISD::ADDO". ISD::ADDO is lowered into a target-independent form that does the addition and then checks if the result is less than one of the operands. (If it is, then there was an overflow.) llvm-svn: 59779	2008-11-21 02:03:52 +00:00
Dan Gohman	d1f33e2397	Use ComputeLatency in the MachineInstr scheduler. llvm-svn: 59777	2008-11-21 01:44:51 +00:00
Dan Gohman	63be531e09	Remove the CycleBound computation code from the ScheduleDAGRRList schedulers. This doesn't have much immediate impact because targets that use these schedulers by default don't yet provide pipeline information. This code also didn't have the benefit of register pressure information. Also, removing it will avoid problems with list-burr suddenly starting to do latency-oriented scheduling on x86 when we start providing pipeline data, which would increase spilling. llvm-svn: 59775	2008-11-21 01:30:54 +00:00
Dan Gohman	7b7ca502fa	Implement ComputeLatency for MachineInstr ScheduleDAGs. Factor some of the latency computation logic out of the SDNode ScheduleDAG code into a TargetInstrItineraries helper method to help with this. llvm-svn: 59761	2008-11-21 00:12:10 +00:00
Bill Wendling	39acb29ff8	Add UADDO and SADDO nodes. These will be used for determining an overflow condition in an addition operation. llvm-svn: 59760	2008-11-21 00:11:16 +00:00
Dan Gohman	c602dd407c	Change these schedulers to not emit no-ops. It turns out that the RR scheduler actually does look at latency values, but it doesn't use a hazard recognizer so it has no way to know when a no-op is needed, as opposed to just stalling and incrementing the cycle count. llvm-svn: 59759	2008-11-21 00:10:42 +00:00
Duncan Sands	3fa0a5afab	Add some documentation. llvm-svn: 59727	2008-11-20 10:34:43 +00:00
Bill Wendling	165b45d385	80-column violation. llvm-svn: 59718	2008-11-20 07:24:30 +00:00
Dan Gohman	8e066a1349	Remove a remnant of list-burr's fast mode. llvm-svn: 59702	2008-11-20 03:32:45 +00:00
Dan Gohman	186f65d275	Factor out the SethiUllman numbering logic from the list-burr and list-tdrr schedulers into a common base class. llvm-svn: 59701	2008-11-20 03:30:37 +00:00
Dan Gohman	fd08af4ee7	Remove the "fast" form of the list-burr scheduler, and use the dedicated "fast" scheduler in -fast mode instead, which is faster. This speeds up llc -fast by a few percent on some testcases -- the speedup only happens for code not handled by fast-isel. llvm-svn: 59700	2008-11-20 03:11:19 +00:00
Dan Gohman	3f656dfa03	Facter AddPseudoTwoAddrDeps and associated infrasructure out of the list-burr scheduler so that it can be used by the list-tdrr scheduler too. llvm-svn: 59698	2008-11-20 02:45:51 +00:00
Dan Gohman	4ce15e12b9	Factor out the code for verifying the work of the scheduler, extend it a bit, and make use of it in all schedulers, to ensure consistent checking. llvm-svn: 59689	2008-11-20 01:26:25 +00:00
Dan Gohman	4c3034f711	Simplify this code a little. In the fast scheduler, CreateNewSUnit and CreateClone don't add any extra value. llvm-svn: 59679	2008-11-19 23:39:02 +00:00
Dan Gohman	60cb69e665	Experimental post-pass scheduling support. Post-pass scheduling is currently off by default, and can be enabled with -disable-post-RA-scheduler=false. This doesn't have a significant impact on most code yet because it doesn't yet do anything to address anti-dependencies and it doesn't attempt to disambiguate memory references. Also, several popular targets don't have pipeline descriptions yet. The majority of the changes here are splitting the SelectionDAG-specific code out of ScheduleDAG, so that ScheduleDAG can be moved to libLLVMCodeGen.a. The interface between ScheduleDAG-using code and the rest of the scheduling code is somewhat rough and will evolve. llvm-svn: 59676	2008-11-19 23:18:57 +00:00
Dan Gohman	f4d95fdce9	Move the code for printing a graph node label for an SUnit into a virtual method of SelectionDAG. llvm-svn: 59667	2008-11-19 22:09:45 +00:00
Dan Gohman	78fb6214f3	Convert SUnit's dump method into a print method and implement dump in terms of it. llvm-svn: 59665	2008-11-19 21:32:03 +00:00
Dan Gohman	82016c243b	Rearrange code to reduce the nesting level. No functionality change. llvm-svn: 59580	2008-11-19 02:00:32 +00:00
Dan Gohman	eb87975384	Fix debug printing of flagged SDNodes in SUnits so that they print in the correct order. llvm-svn: 59567	2008-11-19 00:04:44 +00:00
Dan Gohman	6e58726416	Tidy up ScheduleNodeBottomUp methods, and make them more consistent with ScheduleNodeTopDown methods. llvm-svn: 59550	2008-11-18 21:22:20 +00:00
Dan Gohman	71b632f905	Update a comment to reflect the current code. llvm-svn: 59549	2008-11-18 21:14:44 +00:00
Duncan Sands	3ca78c675e	Remove integer promotion support for FP_EXTEND and FP_ROUND. Not sure what these were doing here - probably they were sometimes (wrongly) created with integer operands somewhere that has since been fixed. llvm-svn: 59548	2008-11-18 21:13:59 +00:00
Duncan Sands	97933c3990	Simplify code using helper routines. There is not supposed to be any functionality change. llvm-svn: 59545	2008-11-18 20:56:22 +00:00
Dan Gohman	1132313e71	Whitespace cleanups. llvm-svn: 59532	2008-11-18 17:05:42 +00:00
Duncan Sands	789dbb906d	LegalizeTypes support for splitting and scalarizing SCALAR_TO_VECTOR. I didn't add the testcase, because once llc gets past scalar-to-vector it hits a SPU target lowering bug and explodes. llvm-svn: 59530	2008-11-18 16:40:48 +00:00
Bill Wendling	13020d22da	Rename stackprotector_create intrinsic to stackprotector. llvm-svn: 59519	2008-11-18 11:01:33 +00:00
Duncan Sands	1315f80ea8	Reapply r59464, this time using the correct type when softening FNEG. llvm-svn: 59513	2008-11-18 09:15:03 +00:00
Bill Wendling	7235002bd1	Remove the stackprotector_check intrinsic. Use a volatile load instead. llvm-svn: 59504	2008-11-18 07:30:57 +00:00
Dan Gohman	fe1748da07	Fix a typo in a comment. llvm-svn: 59489	2008-11-18 02:50:01 +00:00
Dan Gohman	22d07b14bc	Change SUnit's dump method to take a ScheduleDAG* instead of a SelectionDAG*. llvm-svn: 59488	2008-11-18 02:06:40 +00:00
Bill Wendling	e0d5e67c98	Revert r59464. It was causing this failure: Running /Volumes/Sandbox/Buildbot/llvm/full-llvm/build/llvm.src/test/CodeGen/XCore/dg.exp ... FAIL: /Volumes/Sandbox/Buildbot/llvm/full-llvm/build/llvm.src/test/CodeGen/XCore/fneg.ll Failed with signal(SIGABRT) at line 1 while running: llvm-as < /Volumes/Sandbox/Buildbot/llvm/full-llvm/build/llvm.src/test/CodeGen/XCore/fneg.ll \| llc -march=xcore > fneg.ll.tmp1.s Assertion failed: (VT.isFloatingPoint() && "Cannot create integer FP constant!"), function getConstantFP, file /Volumes/Sandbox/Buildbot/llvm/full-llvm/build/llvm.src/lib/CodeGen/SelectionDAG/SelectionDAG.cpp, line 913. 0 llc 0x0092115c _ZN4llvm3sys18RemoveFileOnSignalERKNS0_4PathEPSs + 844 1 libSystem.B.dylib 0x9217809b _sigtramp + 43 2 ??? 0xffffffff 0x0 + 4294967295 3 libSystem.B.dylib 0x921f0ec2 raise + 26 4 libSystem.B.dylib 0x9220047f abort + 73 5 libSystem.B.dylib 0x921f2063 __assert_rtn + 101 6 llc 0x005a5b0a _ZN4llvm12SelectionDAG13getConmake[1]: * [check-local] Error 1 make: * [check] Error 2 llvm-svn: 59487	2008-11-18 01:49:24 +00:00
Dan Gohman	5ebdb98a6e	Avoid using a loop in ReleasePred and ReleaseSucc methods to compute the new CycleBound value. Instead, just update CycleBound on each call. Also, make ReleasePred and ReleaseSucc methods more consistent accross the various schedulers. This also happens to make ScheduleDAGRRList's CycleBound computation somewhat more interesting, though it still doesn't have any noticeable effect, because no current targets that use the register-pressure reduction scheduler provide pipeline models. llvm-svn: 59475	2008-11-18 00:38:59 +00:00
Dan Gohman	92a36d7a78	Eliminate some trivial differences between the ScheduleNodeTopDown functions in these two schedulers. llvm-svn: 59465	2008-11-17 21:31:02 +00:00
Duncan Sands	f046b50ecd	Add soft float support for a bunch more operations. Original patch by Richard Osborne, tweaked and extended by your humble servant. llvm-svn: 59464	2008-11-17 20:52:38 +00:00
Dan Gohman	4f474b092e	Don't bother doing latency calculations in the "fast" scheduler. llvm-svn: 59461	2008-11-17 19:52:36 +00:00
Dan Gohman	a687fd8339	Use SUnit's CycleBound field instead of duplicating it in a side-car datastructure llvm-svn: 59458	2008-11-17 19:45:19 +00:00
Richard Osborne	6751b4a604	Don't produce ADDC/ADDE when expanding SHL unless they are legal for the target. This fixes PR3080. llvm-svn: 59450	2008-11-17 17:34:31 +00:00
Dan Gohman	17c226b8ca	Don't use the isPending flag to mean what the isAvailable flag means. llvm-svn: 59445	2008-11-17 16:37:30 +00:00
Mon P Wang	4964368e0d	Fixed legalization of CONVERT_RNDSAT for integers. llvm-svn: 59432	2008-11-17 00:41:12 +00:00
Mon P Wang	7a82474387	Improved shuffle normalization to avoid using extract/build when we can extract using different indexes for two vectors. Added a few tests for vector shuffles. llvm-svn: 59399	2008-11-16 05:06:27 +00:00
Duncan Sands	da8d2873ed	When splitting a SHUFFLE_VECTOR, try to have the result use SHUFFLE_VECTOR instead. If not practical, fall back to the old scheme of building the split result by hand using a BUILD_VECTOR. llvm-svn: 59361	2008-11-15 09:25:38 +00:00
Mon P Wang	f414cbc1fd	Add missing widen operations, fixed widening for extracting a subvector, and when loading/storing a widen vector, make sure that they are loaded and stored in consecutive order. llvm-svn: 59357	2008-11-15 06:05:52 +00:00
Dan Gohman	68294c06fe	Correct a comment. llvm-svn: 59341	2008-11-15 00:24:23 +00:00
Dan Gohman	d2760c0473	Move ScheduleDAGList's LatencyPriorityQueue class out to a separate file. llvm-svn: 59340	2008-11-15 00:23:40 +00:00
Dan Gohman	1472955eab	Add support for building a ScheduleDAG from MachineInstrs. This is currently fairly conservative; it doesn't do alias-analysis queries and it doesn't attempt to break anti-dependencies. llvm-svn: 59324	2008-11-14 21:47:58 +00:00
Dan Gohman	db8b95a4fa	For post-regalloc scheduling, remove the instructions from the block before re-inserting them. llvm-svn: 59281	2008-11-14 00:33:17 +00:00
Dan Gohman	1a21ab6925	Check in the correct version of the patch in r59279. llvm-svn: 59280	2008-11-14 00:32:34 +00:00
Dan Gohman	8f973f157d	Debug printing for SUnits that carry MachineInstrs. llvm-svn: 59279	2008-11-14 00:28:56 +00:00
Dan Gohman	ee8273e52f	Initial support for carrying MachineInstrs in SUnits. llvm-svn: 59278	2008-11-14 00:06:09 +00:00
Dan Gohman	a2cbbaa41f	Change DOTGraphTraits<ScheduleDAG*>::getGraphName how to find the name of the current function on its own, rather than relying on the SelectionDAG. llvm-svn: 59277	2008-11-13 23:45:55 +00:00
Dan Gohman	072734ebd6	Remove the FlaggedNodes member from SUnit. Instead of requiring each SUnit to carry a SmallVector of flagged nodes, just calculate the flagged nodes dynamically when they are needed. The local-liveness change is due to a trivial scheduling change where the scheduler arbitrary decision differently. llvm-svn: 59273	2008-11-13 23:24:17 +00:00
Dan Gohman	1ddfcba5be	Make the Node member of SUnit private, and add accessors. llvm-svn: 59264	2008-11-13 21:36:12 +00:00
Dan Gohman	5a390b974c	Change ScheduleDAG's DAG member from a reference to a pointer, to prepare for the possibility of scheduling without a SelectionDAG being present. llvm-svn: 59263	2008-11-13 21:21:28 +00:00
Dan Gohman	88ba5f0b96	Move the code that inserts X87 FP_REG_KILL instructions from a special-purpose hook to a new pass. Also, add check to see if any x87 virtual registers are used, to avoid doing any work in the common case that no x87 code is needed. llvm-svn: 59190	2008-11-12 22:55:05 +00:00
Dale Johannesen	6467858be1	Fix unsigned char->ppcf128 conversion. llvm-svn: 59150	2008-11-12 18:38:44 +00:00
Duncan Sands	2907b0085c	Simplify SplitVecRes_EXTRACT_SUBVECTOR. This means that it no longer handles non-power-of-two vectors. However it previously only handled them sometimes, depending on obscure numerical relationships between the index and vector type. For example, for a vector of length 6, it would succeed if and only if the index was an even multiple of 6. I consider this more confusing than useful. llvm-svn: 59122	2008-11-12 08:37:57 +00:00
Duncan Sands	aa7060c885	Correct some thinkos in the expansion of ADD/SUB when the target does not support ADDC/SUBC. This fixes PR3044. llvm-svn: 59120	2008-11-12 08:23:26 +00:00
Dale Johannesen	ffc67df2aa	Fix the testb optimization so x86 also bootstraps. Reenable test. llvm-svn: 59101	2008-11-12 02:00:35 +00:00
Dan Gohman	e52e0897e2	In ScheduleDAGRRList::CopyAndMoveSuccessors, create the SUnit for the load before creating the SUnit for the operation that it was unfolded from. This allows each SUnit to have all of its predecessor SUnits available at the time it is created. I don't know yet if this will be absolutely required, but it is a little tidier to do it this way. llvm-svn: 59083	2008-11-11 21:34:44 +00:00
Dan Gohman	fb78ef9fd3	Avoid relying on the SelectionDAG for initializing the MachineFunction and TargetLoweringInfo variables for the scheduler. llvm-svn: 59082	2008-11-11 21:31:56 +00:00
Dan Gohman	5499e89d06	Change the scheduler accessor methods to accept an explicit TargetMachine argument instead of taking the SelectionDAG's TargetMachine. This is needed for some upcoming scheduler changes. llvm-svn: 59055	2008-11-11 17:50:47 +00:00
Bill Wendling	49a5ce863e	Fix for PR3040: The CC was changed, but wasn't checked to see if it was legal if the DAG combiner was being run after legalization. Threw in a couple of checks just to make sure that it's okay. As far as the PR is concerned, no back-end target actually exhibited this problem, so there isn't an associated testcase. llvm-svn: 59035	2008-11-11 08:25:46 +00:00
Mon P Wang	774e9ac433	Cleaned up and fix bugs in convert_rndsat node llvm-svn: 59025	2008-11-11 05:40:06 +00:00
Bill Wendling	b85755c829	Temporarily revert r58979 and related patch. It's causing a failure in X86 bootstrap: Comparing stages 2 and 3 warning: ./cc1-checksum.o differs warning: ./cc1obj-checksum.o differs warning: ./cc1objplus-checksum.o differs warning: ./cc1plus-checksum.o differs Bootstrap comparison failure! ./alias.o differs ./alloc-pool.o differs ./attribs.o differs ./bb-reorder.o differs ./bitmap.o differs ./build/errors.o differs ./build/genattrtab.o differs ./build/genautomata.o differs ./build/genemit.o differs ./build/genextract.o differs ... -bw llvm-svn: 59003	2008-11-10 21:22:06 +00:00
Mon P Wang	58fb9135e2	Added CONVERT_RNDSAT (conversion with rounding and saturation) SDNode to support targets that support these conversions. Users should avoid using this node as the current targets don't generating code for it. llvm-svn: 59001	2008-11-10 20:54:11 +00:00
Duncan Sands	ddacbb39ab	Fix PR2667: add soft float support for sint_to_fp/uint_to_fp where the argument is an apint, or smaller than the minimum size for which there is a libcall (i32). llvm-svn: 58994	2008-11-10 17:36:26 +00:00
Duncan Sands	13b2e3634b	Tweak some comments. llvm-svn: 58993	2008-11-10 17:31:56 +00:00
Duncan Sands	7da4b44dd1	Small cleanups. No functionality change intended! llvm-svn: 58992	2008-11-10 17:29:56 +00:00
Duncan Sands	d5b53e1c6c	When promoting the result of fp_to_uint/fp_to_sint, inform the optimizers that the result must be zero/ sign extended from the smaller type. For example, if a fp to unsigned i16 is promoted to fp to i32, then we are allowed to assume that the extra 16 bits are zero (because the result of fp to i16 is undefined if the result does not fit in an i16). This is quite aggressive, but should help the optimizers produce better code. This requires correcting a test which thought that fp_to_uint is some kind of truncation, which it is not: in the testcase (which does fp to i1), either the fp value converts to 0 or 1 or the result is undefined, which is quite different to truncation. llvm-svn: 58991	2008-11-10 17:28:30 +00:00
Dale Johannesen	671743369c	Really fix testb optimization on big-endian. Fixes ppc32 bootstrap. llvm-svn: 58979	2008-11-10 07:16:42 +00:00
Mon P Wang	25f0106fd9	Added support for the following definition of shufflevector <result> = shufflevector <n x <ty>> <v1>, <n x <ty>> <v2>, <m x i32> <mask> llvm-svn: 58964	2008-11-10 04:46:22 +00:00
Dale Johannesen	aa4d82d244	Temporarily revert 58825, which breaks PPC bootstrap. xs llvm-svn: 58930	2008-11-09 06:48:10 +00:00
Duncan Sands	0f3937115d	Try to produce better code when scalarizing VSETCC. llvm-svn: 58920	2008-11-08 18:26:48 +00:00
Dale Johannesen	bb5c9b4b68	Make testb optimization work on big-endian targets. llvm-svn: 58874	2008-11-08 00:01:16 +00:00
Dale Johannesen	160be0ffda	Make FP tests requiring two compares work on PPC (PR 642). This is Chris' patch from the PR, modified to realize that SETUGT/SETULT occur legitimately with integers, plus two fixes in LegalizeDAG to pass a valid result type into LegalizeSetCC. The argument of TLI.getSetCCResultType is ignored on PPC, but I think I'm following usage elsewhere. llvm-svn: 58871	2008-11-07 22:54:33 +00:00
Duncan Sands	2d636b5265	Sign-extend rather than zero-extend when promoting the condition for a BRCOND, according to what is returned by getSetCCResultContents. Since all targets return the same thing (ZeroOrOneSetCCResult), this should be harmless! The point is that all over the place the result of SETCC is fed directly into BRCOND. On machines for which getSetCCResultContents returns ZeroOrNegativeOneSetCCResult, this is a sign-extended boolean. So it seems dangerous to also feed BRCOND zero-extended booleans in some circumstances - for example, when promoting the condition. llvm-svn: 58861	2008-11-07 20:13:04 +00:00
Dale Johannesen	9016882d67	Fix unsigned->ppcf128 conversion. llvm-svn: 58856	2008-11-07 19:11:43 +00:00
Dale Johannesen	7aad542d35	When we're doing a compare of load-AND-constant to 0 (e.g. a bitfield test) narrow the load as much as possible. The has the potential to avoid unnecessary partial-word load-after-store conflicts, which cause stalls on several targets. Also a size win on x86 (testb vs testl). llvm-svn: 58825	2008-11-07 01:28:02 +00:00
Bill Wendling	eb4268d72f	- Modify the stack protector algorithm so that the stack slot is allocated in LLVM IR code and not in the selection DAG ISel. This is a cleaner solution. - Fix the heuristic for determining if protectors are necessary. The previous one wasn't checking the proper type size. llvm-svn: 58824	2008-11-07 01:23:58 +00:00
Mon P Wang	5ca2ec65bd	Fixed scalarizing an extract subvector and prevent an infinite loop when simplify a vector. llvm-svn: 58820	2008-11-06 22:52:21 +00:00
Devang Patel	8af0a362f1	Emit label for llvm.dbg.func.start of the inlined function. llvm-svn: 58814	2008-11-06 21:28:20 +00:00
Duncan Sands	f178f8300d	Formating/comment changes - no functionality change. llvm-svn: 58801	2008-11-06 08:51:32 +00:00
Bill Wendling	b3f7a39877	- Rename stackprotector_{prologue,epilogue} to stackprotector_{create,check}. - Get rid of "HasStackProtector" in MachineFrameInfo. - Modify intrinsics to tell which are doing what with memory. llvm-svn: 58799	2008-11-06 07:23:03 +00:00
Mon P Wang	9a8d60a7c0	Widening cleanup llvm-svn: 58796	2008-11-06 05:31:54 +00:00
Bill Wendling	d970ea3eac	Implement the stack protector stack accesses via intrinsics: - stackprotector_prologue creates a stack object and stores the guard there. - stackprotector_epilogue reads the stack guard from the stack position created by stackprotector_prologue. - The PrologEpilogInserter was changed to make sure that the stack guard is first on the stack frame. llvm-svn: 58791	2008-11-06 02:29:10 +00:00
Devang Patel	9e3e776e28	Emit label for llvm.dbg.func.start of the inlined function. llvm-svn: 58786	2008-11-06 00:30:09 +00:00
Duncan Sands	68035d4076	Fix thinko in ppcf128 expansion of truncating store. llvm-svn: 58753	2008-11-05 07:17:27 +00:00
Evan Cheng	c7b04a12bb	Type of shuffle mask has changed. llvm-svn: 58751	2008-11-05 06:04:18 +00:00
Dale Johannesen	db6b956585	80 columns llvm-svn: 58717	2008-11-04 20:52:49 +00:00
Duncan Sands	d5f935921a	Fix PR3011: LegalizeTypes support for scalarizing SELECT_CC. llvm-svn: 58706	2008-11-04 17:31:08 +00:00
Dale Johannesen	08535d2507	Fix some ppcf128 regressions: make ExpandFloatRes_LOAD work correctly, and bring over a late change to ppcf128 SetCC handling. llvm-svn: 58642	2008-11-03 20:47:45 +00:00
Duncan Sands	6692dec2a0	Make VAARG promotion work correctly with large funky sized integers like i129, and also reduce the number of assumptions made about how vaarg is implemented. This still doesn't work correctly for small integers like (eg) i1 on x86, since x86 passes each of them (essentially an i8) in a 4 byte stack slot, so the pointer needs to be advanced by 4 bytes not by 1 byte as now. But this is no longer a LegalizeTypes problem (it was also wrong in LT before): it is a bug in the operation expansion in LegalizeDAG: now LegalizeTypes turns an i1 vaarg into an i8 vaarg which would work fine if only the i8 vaarg was turned into correct code later. llvm-svn: 58635	2008-11-03 20:22:12 +00:00
Duncan Sands	0207a3f897	Make VAARG work with x86 long double (which is 10 bytes long, but is passed in 12/16 bytes). llvm-svn: 58608	2008-11-03 11:51:11 +00:00
Mon P Wang	769134be1e	Added interface to allow clients to create a MemIntrinsicNode for target intrinsics that touches memory llvm-svn: 58548	2008-11-01 20:24:53 +00:00
Dan Gohman	50c76beeb0	Remove some unused virtual function bodies. llvm-svn: 58524	2008-10-31 19:06:33 +00:00
Duncan Sands	8758851908	Add a bunch of libcalls for ppcf128 that were somehow completely forgotten about when writing LegalizeTypes. llvm-svn: 58508	2008-10-31 14:06:52 +00:00
Duncan Sands	e18295c258	Fix PR2986: do not use a potentially illegal type for the shift amount type. Add a check that shifts and rotates use the type returned by getShiftAmountTy for the amount. This exposed some problems in CellSPU and PPC, which have already been fixed. llvm-svn: 58455	2008-10-30 20:26:50 +00:00
Mon P Wang	01b8a5a967	Add missing vsetcc expansion for widening llvm-svn: 58443	2008-10-30 18:21:52 +00:00
Mon P Wang	58c3794c27	Add initial support for vector widening. Logic is set to widen for X86. One will only see an effect if legalizetype is not active. Will move support to LegalizeType soon. llvm-svn: 58426	2008-10-30 08:01:45 +00:00
Duncan Sands	ee273419f9	Uniformize capitalization of NodeId. llvm-svn: 58386	2008-10-29 17:52:12 +00:00
Duncan Sands	fbb10bbec4	Fix PR2977: LegalizeTypes support for expanding VAARG. llvm-svn: 58379	2008-10-29 14:25:28 +00:00
Duncan Sands	17e678be87	Add sanity checking for BUILD_PAIR (I noticed the other day that PPC custom lowering could create a BUILD_PAIR of two f64 with a result type of... f64! - already fixed). Fix a place that triggers the sanity check. llvm-svn: 58378	2008-10-29 14:22:20 +00:00
Duncan Sands	b964813b1f	Fix a FIXME: in ReplaceNodeWith, if the new node is morphed by AnalyzeNewNode into a previously processed node, and different result values of that node are remapped to values with different nodes, then we could end up using wrong values here [we were assuming that all results remap to values with the same underlying node]. This seems theoretically possible, but I don't have a testcase. The meat of the patch is in the changes to AnalyzeNewNode/AnalyzeNewValue and ReplaceNodeWith. While there, I changed names like RemapNode to RemapValue, since it really remaps values. To tell the truth, I would be much happier if we were only remapping nodes (it would simplify a bunch of logic, and allow for some cute speedups) but I haven't yet worked out how to do that. llvm-svn: 58372	2008-10-29 06:42:19 +00:00
Duncan Sands	914745768e	Fix 80 column violations. llvm-svn: 58371	2008-10-29 06:33:00 +00:00
Duncan Sands	d4ec020734	Fix 80 column violations. llvm-svn: 58370	2008-10-29 06:31:03 +00:00
Dan Gohman	1e3c25ac2d	Take Chris' suggestion and define EnableFastISelVerbose and EnableFastISelAbort variables for Release mode instead of using ifdefs in the code. llvm-svn: 58350	2008-10-28 20:35:31 +00:00
Dan Gohman	e750bb67ee	Protect the code for fast-isel debugging with #ifndef NDEBUG. llvm-svn: 58340	2008-10-28 19:08:46 +00:00
Duncan Sands	4068a7f31e	Fix darwin ppc llvm-gcc build breakage: intercept ppcf128 to i32 conversion and expand it into a code sequence like in LegalizeDAG. This needs custom ppc lowering of FP_ROUND_INREG, so turn that on and make it work with LegalizeTypes. Probably PPC should simply custom lower the original conversion. llvm-svn: 58329	2008-10-28 15:00:32 +00:00
Duncan Sands	f3e5850f80	Fix a testcase provided by Bill in which the node id could end up being wrong mostly because of forgetting to remap new nodes that morphed into processed nodes through CSE. llvm-svn: 58323	2008-10-28 09:38:36 +00:00
Chris Lattner	5fa1040130	Don't produce invalid comparisons after legalize. llvm-svn: 58320	2008-10-28 07:11:07 +00:00
Chris Lattner	56d016ab05	fix some whitespace stuff llvm-svn: 58319	2008-10-28 07:10:51 +00:00
Ted Kremenek	8fcff4d87a	Fix bogus comparison of "const char *" with c-string literal. Use strcmp instead. llvm-svn: 58290	2008-10-27 22:43:07 +00:00
David Greene	b04e7c36d3	Add setSubgraphColor to color an entire portion of a SelectionDAG. This will be used to support debug features in TableGen. llvm-svn: 58257	2008-10-27 18:17:03 +00:00
Duncan Sands	835bdca590	Fix UpdateNodeOperands so that it does CSE of calls (and a bunch of other node types). While there, I added a doNotCSE predicate and used it to reduce code duplication (some of the duplicated code was wrong...). This fixes ARM/cse-libcalls.ll when using LegalizeTypes. llvm-svn: 58249	2008-10-27 15:30:53 +00:00
Duncan Sands	75cf2e03ab	Fix a bug in which a node could be added to the worklist twice: UpdateNodeOperands could morph a new node into a node already on the worklist. We would then recalculate the NodeId for this existing node and add it to the worklist. The testcase is ARM/cse-libcalls.ll, the problem showing up once UpdateNodeOperands is taught to do CSE for calls. llvm-svn: 58246	2008-10-27 13:18:32 +00:00

... 44 45 46 47 48 ...

7426 Commits