llvm-project

Commit Graph

Author	SHA1	Message	Date
Chris Lattner	3ea9066bb4	add node #'s to debug dumps. llvm-svn: 97019	2010-02-24 04:24:44 +00:00
Evan Cheng	328a607490	Re-apply 96540 and 96556 with fixes. llvm-svn: 97011	2010-02-24 01:42:31 +00:00
Chris Lattner	625916df32	make selectnodeto set the nodeid to -1. This makes it more akin to creating a new node then replacing uses. llvm-svn: 97000	2010-02-23 23:01:35 +00:00
Chris Lattner	8585850e94	fix a bug in findNonImmUse (used by IsLegalToFold) where nodes with no id's would cause early exit allowing IsLegalToFold to return true instead of false, producing a cyclic dag. This was striking the new isel because it isn't using SelectNodeTo yet, which theoretically is just an optimization. llvm-svn: 96972	2010-02-23 19:32:27 +00:00
Chris Lattner	1738d49b74	Print node ID's in dumps and views if set. llvm-svn: 96971	2010-02-23 19:31:18 +00:00
David Greene	d8ecd5e902	Speed up cycle checking significantly by caching results. llvm-svn: 96956	2010-02-23 17:37:50 +00:00
Duncan Sands	d0bf6f640f	Revert commits 96556 and 96640, because commit 96556 breaks the dragonegg self-host build. I reverted 96640 in order to revert 96556 (96640 goes on top of 96556), but it also looks like with both of them applied the breakage happens even earlier. The symptom of the 96556 miscompile is the following crash: llvm[3]: Compiling AlphaISelLowering.cpp for Release build cc1plus: /home/duncan/tmp/tmp/llvm/lib/CodeGen/SelectionDAG/SelectionDAG.cpp:4982: void llvm::SelectionDAG::ReplaceAllUsesWith(llvm::SDNode, llvm::SDNode, llvm::SelectionDAG::DAGUpdateListener*): Assertion `(!From->hasAnyUseOfValue(i) \|\| From->getValueType(i) == To->getValueType(i)) && "Cannot use this version of ReplaceAllUsesWith!"' failed. Stack dump: 0. Running pass 'X86 DAG->DAG Instruction Selection' on function '@_ZN4llvm19AlphaTargetLowering14LowerOperationENS_7SDValueERNS_12SelectionDAGE' g++: Internal error: Aborted (program cc1plus) This occurs when building LLVM using LLVM built by LLVM (via dragonegg). Probably LLVM has miscompiled itself, though it may have miscompiled GCC and/or dragonegg itself: at this point of the self-host build, all of GCC, LLVM and dragonegg were built using LLVM. Unfortunately this kind of thing is extremely hard to debug, and while I did rummage around a bit I didn't find any smoking guns, aka obviously miscompiled code. Found by bisection. r96556 \| evancheng \| 2010-02-18 03:13:50 +0100 (Thu, 18 Feb 2010) \| 5 lines Some dag combiner goodness: Transform br (xor (x, y)) -> br (x != y) Transform br (xor (xor (x,y), 1)) -> br (x == y) Also normalize (and (X, 1) == / != 1 -> (and (X, 1)) != / == 0 to match to "test on x86" and "tst on arm" r96640 \| evancheng \| 2010-02-19 01:34:39 +0100 (Fri, 19 Feb 2010) \| 16 lines Transform (xor (setcc), (setcc)) == / != 1 to (xor (setcc), (setcc)) != / == 1. e.g. On x86_64 %0 = icmp eq i32 %x, 0 %1 = icmp eq i32 %y, 0 %2 = xor i1 %1, %0 br i1 %2, label %bb, label %return => testl %edi, %edi sete %al testl %esi, %esi sete %cl cmpb %al, %cl je LBB1_2 llvm-svn: 96672	2010-02-19 11:30:41 +00:00
Evan Cheng	d2d9252f35	Transform (xor (setcc), (setcc)) == / != 1 to (xor (setcc), (setcc)) != / == 1. e.g. On x86_64 %0 = icmp eq i32 %x, 0 %1 = icmp eq i32 %y, 0 %2 = xor i1 %1, %0 br i1 %2, label %bb, label %return => testl %edi, %edi sete %al testl %esi, %esi sete %cl cmpb %al, %cl je LBB1_2 llvm-svn: 96640	2010-02-19 00:34:39 +00:00
Evan Cheng	0ceb68a552	Some dag combiner goodness: Transform br (xor (x, y)) -> br (x != y) Transform br (xor (xor (x,y), 1)) -> br (x == y) Also normalize (and (X, 1) == / != 1 -> (and (X, 1)) != / == 0 to match to "test on x86" and "tst on arm" llvm-svn: 96556	2010-02-18 02:13:50 +00:00
David Greene	b7941b0703	Make the non-temporal bit "significant" in MemSDNodes so they aren't CSE'd or otherwise combined with temporal MemSDNodes. llvm-svn: 96505	2010-02-17 20:21:42 +00:00
Chris Lattner	e78bc753fe	sink special case "cannotyetselect" for intrinsics out of the tblgen splatted code into the implementation. llvm-svn: 96460	2010-02-17 06:28:22 +00:00
Duncan Sands	19d0b47b1f	There are two ways of checking for a given type, for example isa<PointerType>(T) and T->isPointerTy(). Convert most instances of the first form to the second form. Requested by Chris. llvm-svn: 96344	2010-02-16 11:11:14 +00:00
Evan Cheng	3f08464a1a	Fix a memory leak. Patch by Nicolas Geoffray. llvm-svn: 96295	2010-02-15 23:16:53 +00:00
Evan Cheng	5e73ff2e3a	Split SelectionDAGISel::IsLegalAndProfitableToFold to IsLegalToFold and IsProfitableToFold. The generic version of the later simply checks whether the folding candidate has a single use. This allows the target isel routines more flexibility in deciding whether folding makes sense. The specific case we are interested in is folding constant pool loads with multiple uses. llvm-svn: 96255	2010-02-15 19:41:07 +00:00
David Greene	39c6d01879	Add non-temporal flags and remove an assumption of default arguments. llvm-svn: 96240	2010-02-15 17:00:31 +00:00
Duncan Sands	9dff9bec31	Uniformize the names of type predicates: rather than having isFloatTy and isInteger, we now have isFloatTy and isIntegerTy. Requested by Chris! llvm-svn: 96223	2010-02-15 16:12:20 +00:00
Jakob Stoklund Olesen	45396438c3	Use array_pod_sort instead of std::sort for improved code size. Use SmallVector instead of std::vector for better speed when indirectbr has few successors. llvm-svn: 95879	2010-02-11 18:06:56 +00:00
Jakob Stoklund Olesen	896428d630	Remove duplicate successors from indirectbr instructions before building the machine CFG. This makes early tail duplication run 60 times faster when compiling the Firefox JavaScript interpreter, see PR6186. llvm-svn: 95831	2010-02-11 00:34:18 +00:00
Mon P Wang	5b77f0dac1	The previous fix of widening divides that trap was too fragile as it depends on custom lowering and requires that certain types exist in ValueTypes.h. Modified widening to check if an op can trap and if so, the widening algorithm will apply only the op on the defined elements. It is safer to do this in widening because the optimizer can't guarantee removing unused ops in some cases. llvm-svn: 95823	2010-02-10 23:37:45 +00:00
Dan Gohman	4a618827de	Fix "the the" and similar typos. llvm-svn: 95781	2010-02-10 16:03:48 +00:00
Evan Cheng	29b8f554fc	Now that ShrinkDemandedOps() is separated out from DAG combine. It sometimes leave some obvious nops which dag combine used to clean up afterwards e.g. (trunk (ext n)) -> n. Look for them and squash them. llvm-svn: 95757	2010-02-10 02:17:34 +00:00
Evan Cheng	3ebd551aac	Emit an error for illegal inline asm constraint (which uses illegal type) rather than asserting. llvm-svn: 95746	2010-02-10 01:21:02 +00:00
Dale Johannesen	3d1f1cccbb	Fix comments to reflect renaming elsewhere. llvm-svn: 95730	2010-02-10 00:11:11 +00:00
David Greene	893047d43e	Only dump output in debug mode. llvm-svn: 95711	2010-02-09 23:03:05 +00:00
Chris Lattner	b06015aa69	move target-independent opcodes out of TargetInstrInfo into TargetOpcodes.h. #include the new TargetOpcodes.h into MachineInstr. Add new inline accessors (like isPHI()) to MachineInstr, and start using them throughout the codebase. llvm-svn: 95687	2010-02-09 19:54:29 +00:00
Dale Johannesen	120cfe23a7	Apply the 95471 fix to SelectionDAGBuilder as well; we can get in here if FastISel gives up in a block. (Actually the two copies of this need to be unified. Later.) llvm-svn: 95579	2010-02-08 21:53:27 +00:00
Dan Gohman	bd374da130	In guaranteed tailcall mode, don't decline the tailcall optimization for blocks ending in "unreachable". llvm-svn: 95565	2010-02-08 20:34:14 +00:00
Dale Johannesen	db2eb47835	After Victor's latest commits I am seeing null addresses in dbg.declare; ignore this for the moment to prevent things from breaking. llvm-svn: 95471	2010-02-06 02:26:02 +00:00
Evan Cheng	3b245876c0	When the scheduler unfold a load folding instruction it move some of the predecessors to the unfolded load. It decides what gets moved to the load by checking whether the new load is using the predecessor as an operand. The check neglects the cases whether the predecessor is a flagged scheduling unit. rdar://7604000 llvm-svn: 95339	2010-02-05 01:27:11 +00:00
Evan Cheng	0a4fa4ca93	Fix typo Duncan noticed. llvm-svn: 95322	2010-02-04 19:07:06 +00:00
Evan Cheng	01676f9ff4	It's too risky to eliminate sext / zext of call results for tail call optimization even if the caller / callee attributes completely match. The callee may have been bitcast'ed (or otherwise lied about what it's doing). llvm-svn: 95282	2010-02-04 02:45:02 +00:00
Evan Cheng	27a41d5473	Revert 94937 and move the noreturn check to codegen. llvm-svn: 95198	2010-02-03 03:55:59 +00:00
Evan Cheng	40905b4302	Allow all types of callee's to be tail called. But avoid automatic tailcall if the callee is a result of bitcast to avoid losing necessary zext / sext etc. llvm-svn: 95195	2010-02-03 03:28:02 +00:00
Evan Cheng	6f36a083ef	Revert 95130. llvm-svn: 95160	2010-02-02 23:55:14 +00:00
Evan Cheng	c1b0116ff1	Pass callsite return type to TargetLowering::LowerCall and use that to check sibcall eligibility. llvm-svn: 95130	2010-02-02 21:29:10 +00:00
Mon P Wang	d74e0023c5	Improve EXTRACT_VECTOR_ELT patch based on comments from Duncan llvm-svn: 95012	2010-02-01 22:15:09 +00:00
Chris Lattner	f5edeebd8c	eliminate a bunch of pointless LLVMContext arguments. llvm-svn: 95001	2010-02-01 20:48:08 +00:00
Dale Johannesen	0b30cfc57e	fix PR 6157. Testcase pending. llvm-svn: 94996	2010-02-01 19:54:53 +00:00
Mon P Wang	72c60c73af	Fixed a couple of optimization with EXTRACT_VECTOR_ELT that assumes the result type is the same as the element type of the vector. EXTRACT_VECTOR_ELT can be used to extended the width of an integer type. This fixes a bug for Generic/vector-casts.ll on a ppc750. llvm-svn: 94990	2010-02-01 19:03:18 +00:00
Duncan Sands	3327498095	Change the SREM case to match the logic in the IR version ComputeMaskedBits. llvm-svn: 94805	2010-01-29 09:45:26 +00:00
Bill Wendling	954cb187e0	Assign the ordering of SDNodes in a much less intrusive fashion. After the "visit*" method is called, take the newly created nodes, walk them in a DFS fashion, and if they don't have an ordering set, then give it one. llvm-svn: 94757	2010-01-28 21:51:40 +00:00
Jim Grosbach	54c0530834	Update of 94055 to track the IR level call site information via an intrinsic. This allows code gen and the exception table writer to cooperate to make sure landing pads are associated with the correct invoke locations. llvm-svn: 94726	2010-01-28 01:45:32 +00:00
Evan Cheng	67a69dd2ed	Eliminate target hook IsEligibleForTailCallOptimization. Target independent isel should always pass along the "tail call" property. Change target hook LowerCall's parameter "isTailCall" into a refernce. If the target decides it's impossible to honor the tail call request, it should set isTailCall to false to make target independent isel happy. llvm-svn: 94626	2010-01-27 00:07:07 +00:00
Evan Cheng	c35b5a123b	Allow some automatic tailcall optimization without changing ABI. llvm-svn: 94611	2010-01-26 23:13:04 +00:00
Chris Lattner	547c761dd6	eliminate the TargetLowering::UsesGlobalOffsetTable bool, which is subsumed by TargetLowering::getJumpTableEncoding(). Change uses of it to be more specific. llvm-svn: 94529	2010-01-26 06:53:37 +00:00
Chris Lattner	8a785d7a67	Move getJTISymbol from MachineJumpTableInfo to MachineFunction, which is more convenient, and change getPICJumpTableRelocBaseExpr to take a MachineFunction to match. Next, move the X86 code that create a PICBase symbol to X86TargetLowering::getPICBaseSymbol from X86MCInstLower::GetPICBaseSymbol, which was an asmprinter specific library. This eliminates a 'gross hack', and allows us to implement X86ISelLowering::getPICJumpTableRelocBaseExpr which now calls it. This in turn allows us to eliminate the X86AsmPrinter::printPICJumpTableSetLabel method, which was the only overload of printPICJumpTableSetLabel. llvm-svn: 94526	2010-01-26 06:28:43 +00:00
Chris Lattner	273735bc5a	add a new MachineJumpTableInfo::getJTISymbol method, use it to implement the default TargetLowering::getPICJumpTableRelocBaseExpr llvm-svn: 94523	2010-01-26 05:58:28 +00:00
Chris Lattner	8a6c1eaabb	stub out a new target hook, need some refactoring before I can implement it. llvm-svn: 94521	2010-01-26 05:30:30 +00:00
Evan Cheng	555f61bf58	Implement cond ? -1 : 0 with sbb. llvm-svn: 94490	2010-01-26 02:00:44 +00:00
Dale Johannesen	d5575f29f1	Generate DEBUG_VALUE comments on x86. The (limited) dbg.declare's we currently generate go through both register allocators without perturbing the results. llvm-svn: 94480	2010-01-26 00:09:58 +00:00
Chris Lattner	b6db2c6b31	Rearrange handling of jump tables. Highlights: 1. MachineJumpTableInfo is now created lazily for a function the first time it actually makes a jump table instead of for every function. 2. The encoding of jump table entries is now described by the MachineJumpTableInfo::JTEntryKind enum. This enum is determined by the TLI::getJumpTableEncoding() hook, instead of by lots of code scattered throughout the compiler that "knows" that jump table entries are always 32-bits in pic mode (for example). 3. The size and alignment of jump table entries is now calculated based on their kind, instead of at machinefunction creation time. Future work includes using the EntryKind in more places in the compiler, eliminating other logic that "knows" the layout of jump tables in various situations. llvm-svn: 94470	2010-01-25 23:26:13 +00:00
Chris Lattner	823aed16f9	make -fno-rtti the default unless a directory builds with REQUIRES_RTTI. llvm-svn: 94378	2010-01-24 20:43:08 +00:00
Mon P Wang	4f45512c23	It seems better to scalarize vectors of size 1 instead of widening them. Add support to widen SETCC. llvm-svn: 94342	2010-01-24 00:24:43 +00:00
Mon P Wang	586d997e98	Improved widening loads by adding support for wider loads if the alignment allows. Fixed a bug where we didn't use a vector load/store for PR5626. llvm-svn: 94338	2010-01-24 00:05:03 +00:00
Bill Wendling	8cbc25d945	Remove the '-disable-scheduling' flag and replace it with the 'source' option of the '-pre-RA-sched' flag. It actually makes more sense to do it this way. Also, keep track of the SDNode ordering by default. Eventually, we would like to make this ordering a way to break a "tie" in the scheduler. However, doing that now breaks the "CodeGen/X86/abi-isel.ll" test for 32-bit Linux. llvm-svn: 94308	2010-01-23 10:26:57 +00:00
Evan Cheng	c22893a3b7	Enable pre-regalloc scheduling load clustering by default. llvm-svn: 94255	2010-01-22 23:49:45 +00:00
Chris Lattner	7ba0661f27	Stop building RTTI information for most llvm libraries. Notable missing ones are libsupport, libsystem and libvmcore. libvmcore is currently blocked on bugpoint, which uses EH. Once it stops using EH, we can switch it off. This #if 0's out 3 unit tests, because gtest requires RTTI information. Suggestions welcome on how to fix this. llvm-svn: 94164	2010-01-22 06:49:46 +00:00
Evan Cheng	9d92aaabf1	Teach pre-regalloc scheduler to schedule loads from nearby addresses. It may improve cache locality. This is controlled by -cluster-loads for now. llvm-svn: 94148	2010-01-22 03:36:51 +00:00
Evan Cheng	32bfe1e837	Trim unneeded includes. llvm-svn: 94105	2010-01-21 21:44:43 +00:00
Jim Grosbach	143f7eb4c8	back this out for now. Growing Function is not good. llvm-svn: 94097	2010-01-21 20:10:22 +00:00
Jim Grosbach	e029a6a5ed	Make sure that landing pad entries in the EH call site table are in the proper order for SjLj style exception handling. llvm-svn: 94055	2010-01-21 00:43:30 +00:00
David Greene	0985160c54	When XDEBUG is enabled, check for SelectionDAG cycles at some key points. This will help us find future problems like the one described in PR6019. llvm-svn: 94019	2010-01-20 20:13:31 +00:00
David Greene	3b2a68ceb8	Add some asserts to check SelectionDAG problems earlier. llvm-svn: 93960	2010-01-20 00:59:23 +00:00
Dan Gohman	954f49014d	Fold (add x, shl(0 - y, n)) -> sub(x, shl(y, n)), to simplify some code that SCEVExpander can produce when running on behalf of LSR. llvm-svn: 93949	2010-01-19 23:30:49 +00:00
David Greene	f1c7388b29	Add some new debugging APIs to print out "raw" SelectionDAGs to make understanding CannotYTetSelect and other errors easier. llvm-svn: 93901	2010-01-19 20:37:34 +00:00
Dale Johannesen	a3db6ef9a2	Revert 93811 per request. llvm-svn: 93818	2010-01-19 00:10:52 +00:00
Dale Johannesen	0c90d43b70	Enable code to emit dbg.declare as DEBUG_VALUE comments (fast isel, X86). This doesn't seem to break any functionality, but will introduce cases where -g affects the generated code. I'll be fixing that. llvm-svn: 93811	2010-01-18 23:34:55 +00:00
Evan Cheng	88b65bc835	Canonicalize -1 - x to ~x. Instcombine does this but apparently there are situations where this pattern will escape the optimizer and / or created by isel. Here is a case that's seen in JavaScriptCore: %t1 = sub i32 0, %a %t2 = add i32 %t1, -1 The dag combiner pattern: ((c1-A)+c2) -> (c1+c2)-A will fold it to -1 - %a. llvm-svn: 93773	2010-01-18 21:38:44 +00:00
Kenneth Uildriks	dd6ddd1aeb	When checking for sret-demotion, it needs to use legal types. When using the return value of an sret-demoted call, it needs to use possibly illegal types that match the declared Type of the callee. llvm-svn: 93667	2010-01-16 23:37:33 +00:00
David Greene	554039a914	Add some debug routines to SelectionDAG to dump full DAGs. print/dumpWithDepth allows one to dump a DAG up to N levels deep. dump/printWithFullDepth prints the whole DAG, subject to a depth limit on 100 in the default case (to prevent infinite recursion). Have CannotYetSelect to a dumpWithFullDepth so it is clearer exactly what the non-matching DAG looks like. llvm-svn: 93538	2010-01-15 19:43:23 +00:00
Victor Hernandez	b324e66f4c	Improve llvm.dbg.declare intrinsic by referring directly to the storage in its first argument, via function-local metadata (instead of via a bitcast). This patch also cleans up code that expects there to be a bitcast in the first argument and testcases that call llvm.dbg.declare. It also strips old llvm.dbg.declare intrinsics that did not pass metadata as the first argument. llvm-svn: 93531	2010-01-15 19:04:09 +00:00
Victor Hernandez	8d4904b639	Revert r93504 because older uses of llvm.dbg.declare intrinsics need to be auto-upgraded llvm-svn: 93515	2010-01-15 17:36:47 +00:00
Victor Hernandez	5d6551816b	Improve llvm.dbg.declare intrinsic by referring directly to the storage in its first argument, via function-local metadata (instead of via a bitcast). This patch also cleans up code that expects there to be a bitcast in the first argument and testcases that call llvm.dbg.declare. llvm-svn: 93504	2010-01-15 03:37:48 +00:00
Jim Grosbach	4f1b0ded75	fix 80-column violations llvm-svn: 93487	2010-01-15 00:36:15 +00:00
Dan Gohman	dd5286dc63	Fix a codegen abort seen in 483.xalancbmk. llvm-svn: 93417	2010-01-14 03:08:49 +00:00
Dan Gohman	d49763d200	Update a partially obsolete comment. llvm-svn: 93228	2010-01-12 04:32:35 +00:00
Dan Gohman	f9d6d53823	Fix a typo in a comment. llvm-svn: 93227	2010-01-12 04:30:26 +00:00
Jakob Stoklund Olesen	d2a1bee2d4	Avoid adding PHI arguments for a predecessor that has gone away when a BRCOND was constant folded. This fixes PR5980. llvm-svn: 93184	2010-01-11 21:02:33 +00:00
Mon P Wang	ec57c81e64	Disable transformation of select of two loads to a select of address and then a load if the loads are not in the default address space because the transformation discards src value info. llvm-svn: 93180	2010-01-11 20:12:49 +00:00
Dan Gohman	6bd3ef82ff	Revert an earlier change to SIGN_EXTEND_INREG for vectors. The VTSDNode really does need to be a vector type, because TargetLowering::getOperationAction for SIGN_EXTEND_INREG uses that type, and it needs to be able to distinguish between vectors and scalars. Also, fix some more issues with legalization of vector casts. llvm-svn: 93043	2010-01-09 02:13:55 +00:00
Evan Cheng	0c6defd577	Dan pointed out checking whether a node is dead by comparing its opcode to ISD::DELETED_NODE is not safe. Use a DAGUpdateListener to remove dead nodes from work list instead. llvm-svn: 93031	2010-01-09 00:21:08 +00:00
Evan Cheng	58ec4fec88	ReplaceAllUsesOfValueWith may delete other nodes that the one being replaced. Do not delete dead nodes again. llvm-svn: 92988	2010-01-08 02:36:12 +00:00
Chris Lattner	dab2cd543f	Fix rdar://7517201, a regression introduced by r92849. When folding a and(any_ext(load)) both the any_ext and the load have to have only a single use. This removes the anyext-uses.ll testcase which started failing because it is unreduced and unclear what it is testing. llvm-svn: 92950	2010-01-07 21:59:23 +00:00
Chris Lattner	88de38453f	factor this code better and reduce nesting at the same time, no functionality change. llvm-svn: 92948	2010-01-07 21:53:27 +00:00
Evan Cheng	16b75ce19c	APInt'fy TargetLowering::SimplifySetCC to fix PR5963. llvm-svn: 92943	2010-01-07 20:58:44 +00:00
Benjamin Kramer	cdb3889791	Use pop_back_val instead of back()+pop_back. llvm-svn: 92918	2010-01-07 17:27:56 +00:00
Evan Cheng	746012a6c1	Comment. llvm-svn: 92850	2010-01-06 19:43:21 +00:00
Evan Cheng	166a4e6caa	Teach dag combine to fold the following transformation more aggressively: (OP (trunc x), (trunc y)) -> (trunc (OP x, y)) Unfortunately this simple change causes dag combine to infinite looping. The problem is the shrink demanded ops optimization tend to canonicalize expressions in the opposite manner. That is badness. This patch disable those optimizations in dag combine but instead it is done as a late pass in sdisel. This also exposes some deficiencies in dag combine and x86 setcc / brcond lowering. Teach them to look pass ISD::TRUNCATE in various places. llvm-svn: 92849	2010-01-06 19:38:29 +00:00
Bill Wendling	c075acbb54	The previous code could potentially cause a cycle. Allow ordering w.r.t. a 0 order. llvm-svn: 92810	2010-01-06 00:23:35 +00:00
Bill Wendling	578865ff3d	Only check the ordering if there is an ordering for each nodes. llvm-svn: 92807	2010-01-06 00:09:23 +00:00
Bill Wendling	0a7056fe52	Add a semi-primitive form of scheduling via the "SDNode ordering" to the bottom-up scheduler. We prefer the lower order number. llvm-svn: 92806	2010-01-05 23:48:12 +00:00
Bill Wendling	03f0af372c	Don't assign the shift the same type as the variable being shifted. This could result in illegal types for the SHL operator. llvm-svn: 92797	2010-01-05 22:39:10 +00:00
Dan Gohman	404a984780	Don't use the ISD::NodeType enum for SDNode opcodes, as CodeGen uses several kinds of opcode values which are not declared within that enum. This fixes PR5946. llvm-svn: 92794	2010-01-05 22:26:32 +00:00
Benjamin Kramer	ccce8bae14	Avoid going through the LLVMContext for type equality where it's safe to dereference the type pointer. llvm-svn: 92726	2010-01-05 13:12:22 +00:00
Devang Patel	33f80d2303	Delete renaming use of dead dbg intrinsics. Intrinsic::dbg_stoppoint Intrinsic::dbg_region_start Intrinsic::dbg_region_end Intrinsic::dbg_func_start llvm-svn: 92672	2010-01-05 01:47:06 +00:00
David Greene	30ed3ca034	Change errs() to dbgs(). llvm-svn: 92597	2010-01-05 01:26:11 +00:00
David Greene	4eb5bed65b	Change errs() to dbgs(). llvm-svn: 92581	2010-01-05 01:25:11 +00:00
David Greene	d65bc15c81	Change errs() to dbgs(). llvm-svn: 92580	2010-01-05 01:25:09 +00:00
David Greene	6f021a30fe	Change errs() to dbgs(). llvm-svn: 92579	2010-01-05 01:25:04 +00:00
David Greene	fe5c3524c7	Change errs() to dbgs(). llvm-svn: 92578	2010-01-05 01:25:00 +00:00
David Greene	5730f203ee	Change errs() to dbgs(). llvm-svn: 92577	2010-01-05 01:24:57 +00:00
David Greene	f34d7ac9f1	Change errs() to dbgs(). llvm-svn: 92576	2010-01-05 01:24:54 +00:00
David Greene	ae4f266b2d	Change errs() to dbgs(). llvm-svn: 92575	2010-01-05 01:24:53 +00:00
David Greene	ec5883fc0e	Change errs() to dbgs(). llvm-svn: 92574	2010-01-05 01:24:50 +00:00
David Greene	807fcf6374	Change errs() to dbgs(). llvm-svn: 92573	2010-01-05 01:24:48 +00:00
David Greene	40deefdc4f	Change errs() to dbgs(). llvm-svn: 92572	2010-01-05 01:24:45 +00:00
David Greene	63145844c8	Change errs() to dbgs(). llvm-svn: 92571	2010-01-05 01:24:43 +00:00
David Greene	4cec475ed7	Change errs() to dbgs(). llvm-svn: 92570	2010-01-05 01:24:40 +00:00
David Greene	d93137dce7	Change errs() to dbgs(). llvm-svn: 92569	2010-01-05 01:24:36 +00:00
David Greene	7562faa4cf	Change errs() to dbgs(). llvm-svn: 92568	2010-01-05 01:24:34 +00:00
Dan Gohman	ea6f91ff64	Change SelectCode's argument from SDValue to SDNode , to make it more clear what information these functions are actually using. This is also a micro-optimization, as passing a SDNode around is simpler than passing a { SDNode *, int } by value or reference. llvm-svn: 92564	2010-01-05 01:24:18 +00:00
Dan Gohman	feeced4104	Use a pointer type rather than MVT::Other for the ExternalSymbol node used in an inline asm. llvm-svn: 92512	2010-01-04 21:00:54 +00:00
Chris Lattner	1eea3b0ada	Teach codegen to handle: (X != null) \| (Y != null) --> (X\|Y) != 0 (X == null) & (Y == null) --> (X\|Y) == 0 so that instcombine can stop doing this for pointers. This is part of PR3351, which is a case where instcombine doing this for pointers (inserting ptrtoint) is pessimizing code. llvm-svn: 92406	2010-01-02 00:00:03 +00:00
Chris Lattner	24576a5cf3	whitespace cleanup llvm-svn: 92404	2010-01-01 23:37:34 +00:00
Mikhail Glushenkov	5c35d2f6a4	Fix a warning on gcc 4.4. SelectionDAGBuilder.cpp:4294: warning: suggest explicit braces to avoid ambiguous ‘else’ llvm-svn: 92395	2010-01-01 04:41:36 +00:00
Mikhail Glushenkov	2abe1b70ac	Trailing whitespace, 80-col violations. llvm-svn: 92394	2010-01-01 04:41:22 +00:00
Chris Lattner	39f18e545e	Teach codegen to lower llvm.powi to an efficient (but not optimal) multiply sequence when the power is a constant integer. Before, our codegen for std::pow(.., int) always turned into a libcall, which was really inefficient. This should also make many gfortran programs happier I'd imagine. llvm-svn: 92388	2010-01-01 03:32:16 +00:00
Chris Lattner	8e805be369	remove a bunch of unneeded functions. llvm-svn: 92263	2009-12-29 09:32:19 +00:00
Chris Lattner	a0566979b7	Final step in the metadata API restructuring: move the getMDKindID/getMDKindNames methods to LLVMContext (and add convenience methods to Module), eliminating MetadataContext. Move the state that it maintains out to LLVMContext. llvm-svn: 92259	2009-12-29 09:01:33 +00:00
Chris Lattner	2f2aa2b067	This is a major cleanup of the instruction metadata interfaces that I asked Devang to do back on Sep 27. Instead of going through the MetadataContext class with methods like getMD() and getMDs(), just ask the instruction directly for its metadata with getMetadata() and getAllMetadata(). This includes a variety of other fixes and improvements: previously all Value*'s were bloated because the HasMetadata bit was thrown into value, adding a 9th bit to a byte. Now this is properly sunk down to the Instruction class (the only place where it makes sense) and it will be folded away somewhere soon. This also fixes some confusion in getMDs and its clients about whether the returned list is indexed by the MDID or densely packed. This is now returned sorted and densely packed and the comments make this clear. This introduces a number of fixme's which I'll follow up on. llvm-svn: 92235	2009-12-28 23:41:32 +00:00
Chris Lattner	7093946ab1	rename getMDKind -> getMDKindID, make it autoinsert if an MD Kind doesn't exist already, eliminate registerMDKind. Tidy up a bunch of random stuff. llvm-svn: 92225	2009-12-28 20:45:51 +00:00
Sanjiv Gupta	0b00a1b54e	Allow targets to specify the return type of libcalls that are generated for floating point comparisons, rather than hard-coding them as i32. llvm-svn: 92199	2009-12-28 02:40:33 +00:00
Bill Wendling	42bc7ad2b1	Remove dead store. llvm-svn: 92190	2009-12-28 01:51:30 +00:00
Bill Wendling	5b8d89d0a2	Remove dead variable. llvm-svn: 92189	2009-12-28 01:48:56 +00:00
Bill Wendling	9a62b467a8	Remove dead variable. llvm-svn: 92188	2009-12-28 01:47:48 +00:00
Bill Wendling	846ca9b38b	Remove dead variable. llvm-svn: 92180	2009-12-28 01:02:21 +00:00
Bill Wendling	7da8f90d41	Remove dead variable. llvm-svn: 92178	2009-12-28 01:00:12 +00:00
Chris Lattner	f5e3ed64d5	handle equality memcmp of 8 bytes on x86-64 with two unaligned loads and a compare. On other targets we end up with a call to memcmp because we don't want 16 individual byte loads. We should be able to use movups as well, but we're failing to select the generated icmp. llvm-svn: 92107	2009-12-24 01:07:17 +00:00
Chris Lattner	1a32ede6fd	move an optimization for memcmp out of simplifylibcalls and into SDISel. This optimization was causing simplifylibcalls to introduce type-unsafe nastiness. This is the first step, I'll be expanding the memcmp optimizations shortly, covering things that we really really wouldn't want simplifylibcalls to do. llvm-svn: 92098	2009-12-24 00:37:38 +00:00
Nuno Lopes	129819de71	move a few more symbols to .rodata llvm-svn: 92011	2009-12-23 17:48:10 +00:00
Dale Johannesen	a864a67185	Use more sensible type for flags in asms. PR 5570. Patch by Sylve`re Teissier (sorry, ASCII only). llvm-svn: 91988	2009-12-23 07:32:51 +00:00
Eric Christopher	fdb33458fc	Update objectsize intrinsic and associated dependencies. Fix lowering code and update testcases. llvm-svn: 91979	2009-12-23 02:51:48 +00:00
Bill Wendling	0602f39bb1	Remove superfluous SDNode ordering. llvm-svn: 91971	2009-12-23 01:28:19 +00:00
Bill Wendling	9df5c6dfc3	Remove node ordering from inline asm nodes. It's not needed. llvm-svn: 91961	2009-12-23 00:47:20 +00:00
Bill Wendling	91313064f1	Remove node ordering from VA nodes. It's not needed. llvm-svn: 91958	2009-12-23 00:44:51 +00:00
Bill Wendling	ef408db250	Revert r91949 r91942 and r91936. llvm-svn: 91953	2009-12-23 00:28:23 +00:00
Bill Wendling	54dd5398e0	Finish up node ordering in ExpandNode. llvm-svn: 91949	2009-12-23 00:05:09 +00:00
Bill Wendling	ad1fdf0e40	Assign ordering to nodes created in ExpandNode. Only roughly 1/2 of the function is finished. llvm-svn: 91942	2009-12-22 23:44:56 +00:00
Bill Wendling	70794596a8	Assign ordering to SDNodes in PromoteNode. Also fixing a subtle bug where BSWAP was using "Tmp1" in the first getNode call instead of Node->getOperand(0). llvm-svn: 91936	2009-12-22 22:53:39 +00:00
Bill Wendling	d85498132f	Allow 0 as an order number. Don't assign an order to formal arguments. llvm-svn: 91920	2009-12-22 21:35:02 +00:00
Bob Wilson	bac37abe73	Report an error for bad inline assembly, where the value passed for an "indirect" operand is not a pointer. llvm-svn: 91913	2009-12-22 18:34:19 +00:00
Bill Wendling	919b7aab2e	Add more plumbing. This time in the LowerArguments and "get" functions which return partial registers. This affected the back-end lowering code some. Also patch up some places I missed before in the "get" functions. llvm-svn: 91880	2009-12-22 02:10:19 +00:00
Bill Wendling	ac08758b71	Add SDNode ordering to inlined asm and VA functions. llvm-svn: 91876	2009-12-22 01:25:10 +00:00
Bill Wendling	f376c40d0e	Adding more assignment of ordering to SDNodes. This time in the "call" and generic copy functions. llvm-svn: 91872	2009-12-22 01:11:43 +00:00
Bill Wendling	a4d7df7a37	Add ordering of SDNodes to LowerCallTo. llvm-svn: 91866	2009-12-22 00:50:32 +00:00
Bill Wendling	b99b2693f3	Now add ordering to SDNodes created by the massive intrinsic lowering function. llvm-svn: 91863	2009-12-22 00:40:51 +00:00
Bill Wendling	ea3e73e596	To make things interesting, I added MORE code to set the ordering of SDNodes. This time in the load/store and limited-precision code. llvm-svn: 91860	2009-12-22 00:12:37 +00:00
Bill Wendling	c6b473433b	Add more plumbing to assign ordering to SDNodes. Have the "getValue" method assign the ordering when called. Combine some of the ordering assignments to keep things simple. llvm-svn: 91857	2009-12-21 23:47:40 +00:00
Bill Wendling	e79105b591	More ordering plumbing. This time for GEP. I need to remember to assign orderings to values returned by getValue(). llvm-svn: 91850	2009-12-21 23:10:19 +00:00
Bill Wendling	fff99f066b	Another incremental check-in for assigning ordering to SDNodes. This time for shuffle and insert vector. llvm-svn: 91847	2009-12-21 22:42:14 +00:00
Bill Wendling	443d0722b0	Assign ordering to more instructions. Incremental check-in. llvm-svn: 91846	2009-12-21 22:30:11 +00:00
Bill Wendling	28727f3785	- Add a bit more plumbing assigning an order to SDNodes. - Modify the "dump" method to emit the order of an SDNode. llvm-svn: 91845	2009-12-21 21:59:52 +00:00
Bill Wendling	7f5eb53ce2	First wave of plumbing for assigning an ordering to SDNodes. This takes care of a lot of the branching instructions. llvm-svn: 91838	2009-12-21 19:59:38 +00:00
Bill Wendling	6de55a0efd	Place SDNodeOrdering.h in the directory it's used. llvm-svn: 91834	2009-12-21 19:34:59 +00:00
Anton Korobeynikov	10590171fa	Use 4-arg getVTList) variant instead of generic one, when possible llvm-svn: 91744	2009-12-19 02:04:00 +00:00
Bill Wendling	022d18fa3f	Changes from review: - Move DisableScheduling flag into TargetOption.h - Move SDNodeOrdering into its own header file. Give it a minimal interface that doesn't conflate construction with storage. - Move assigning the ordering into the SelectionDAGBuilder. This isn't used yet, so there should be no functional changes. llvm-svn: 91727	2009-12-18 23:32:53 +00:00
Evan Cheng	b175de6356	Increase opportunities to optimize (brcond (srl (and c1), c2)). llvm-svn: 91717	2009-12-18 21:31:31 +00:00
Bob Wilson	3152b0471b	Handle ARM inline asm "w" constraints with 64-bit ("d") registers. The change in SelectionDAGBuilder is needed to allow using bitcasts to convert between f64 (the default type for ARM "d" registers) and 64-bit Neon vector types. Radar 7457110. llvm-svn: 91649	2009-12-18 01:03:29 +00:00
Ken Dyck	df5561db78	Introduce EVT::getHalfSizedIntegerVT() for use in ExpandUnalignedStore() in LegalizeDAG.cpp. Unlike the code it replaces, which simply decrements the simple type by one, getHalfSizedIntegerVT() searches for the smallest simple integer type that is at least half the size of the type it is called on. This approach has the advantage that it will continue working if a new value type (such as i24) is added to MVT. Also, in preparation for new value types, remove the assertions that non-power-of-2 8-bit-mutiple types are Extended when legalizing extload and truncstore operations. llvm-svn: 91614	2009-12-17 20:09:43 +00:00
Bob Wilson	1c00b6964f	Fix a comment grammaro. llvm-svn: 91584	2009-12-17 05:07:36 +00:00
Evan Cheng	aadf060b92	Revert this dag combine change: Fold (zext (and x, cst)) -> (and (zext x), cst) DAG combiner likes to optimize expression in the other way so this would end up cause an infinite looping. llvm-svn: 91574	2009-12-17 00:40:05 +00:00
Daniel Dunbar	b827e52638	Reapply r91392, it was only unmasking the bug, and since TOT is still broken having it reverted does no good. llvm-svn: 91560	2009-12-16 20:10:05 +00:00
Daniel Dunbar	df45b70c1e	Revert "Initial work on disabling the scheduler. This is a work in progress, and this", this broke llvm-gcc bootstrap for release builds on x86_64-apple-darwin10. llvm-svn: 91533	2009-12-16 10:56:02 +00:00
Evan Cheng	852c486946	Make 91378 more conservative. 1. Only perform (zext (shl (zext x), y)) -> (shl (zext x), y) when y is a constant. This makes sure it remove at least one zest. 2. If the shift is a left shift, make sure the original shift cannot shift out bits. llvm-svn: 91399	2009-12-15 03:00:32 +00:00
Bill Wendling	07beddceb7	Initial work on disabling the scheduler. This is a work in progress, and this stuff isn't used just yet. We want to model the GCC `-fno-schedule-insns' and `-fno-schedule-insns2' flags. The hypothesis is that the people who use these flags know what they are doing, and have hand-optimized the C code to reduce latencies and other conflicts. The idea behind our scheme to turn off scheduling is to create a map "on the side" during DAG generation. It will order the nodes by how they appeared in the code. This map is then used during scheduling to get the ordering. llvm-svn: 91392	2009-12-15 01:54:51 +00:00
Evan Cheng	d1521ef40c	Fold (zext (and x, cst)) -> (and (zext x), cst). llvm-svn: 91380	2009-12-15 00:52:11 +00:00
Evan Cheng	ca7c690d3b	Propagate zest through logical shift. llvm-svn: 91378	2009-12-15 00:41:36 +00:00
Dan Gohman	cecad35728	Fix integer cast code to handle vector types. llvm-svn: 91362	2009-12-14 23:40:38 +00:00
Dan Gohman	6453a4e2ab	Fix this to properly clear the FastISel debug location. Thanks to Bill for spotting this! llvm-svn: 91355	2009-12-14 23:08:09 +00:00
Anton Korobeynikov	94b6310136	Fix weird typo which leads to unallocated memory access for nodes with 4 results. llvm-svn: 91233	2009-12-13 01:00:59 +00:00
Dan Gohman	619a78bd59	Delete an unnecessary line. The VTSDNode on a SIGN_EXTEND_REG is never a vector type. llvm-svn: 91181	2009-12-11 23:26:08 +00:00
Dan Gohman	1d459e4937	Implement vector widening, splitting, and scalarizing for SIGN_EXTEND_INREG. llvm-svn: 91158	2009-12-11 21:31:27 +00:00
Dan Gohman	6d306bb32b	Fix the result type of SELECT nodes lowered from Select instructions with aggregate return values. This fixes PR5754. llvm-svn: 91145	2009-12-11 19:50:50 +00:00
Evan Cheng	d938faff4b	Teach InferPtrAlignment to infer GV+cst alignment and use it to simplify x86 isl lowering code. llvm-svn: 90925	2009-12-09 01:53:58 +00:00
Evan Cheng	f5938d5d27	Move isConsecutiveLoad to SelectionDAG. It's not target dependent and it's primary used by selectdag passes. llvm-svn: 90922	2009-12-09 01:36:00 +00:00
Evan Cheng	2d412f0cb8	Infer alignment for non-fixed stack object. llvm-svn: 90919	2009-12-09 01:17:24 +00:00
Evan Cheng	1750009f38	Add const qualifier. llvm-svn: 90918	2009-12-09 01:10:37 +00:00
Evan Cheng	34a23ea371	Refactor InferAlignment out of DAGCombine. llvm-svn: 90917	2009-12-09 01:04:59 +00:00
Anton Korobeynikov	1bcece70bd	Truncate the arguments of llvm.frameaddress / llvm.returnaddress intrinsics from i32 to platform's largest native type llvm-svn: 90741	2009-12-07 02:28:26 +00:00
Dan Gohman	35f5646ef0	Remove old DBG_LABEL code. llvm-svn: 90669	2009-12-05 17:56:26 +00:00
Dan Gohman	6e7073b846	Remove the unused DisableLegalizeTypes option and related code. llvm-svn: 90668	2009-12-05 17:51:33 +00:00
Dan Gohman	c82272a7b6	Don't blindly set the debug location for PHI node copies. llvm-svn: 90637	2009-12-05 01:29:04 +00:00
Dan Gohman	18f94469dc	Make TargetSelectInstruction protected and called from FastISel.cpp instead of SelectionDAGISel.cpp. llvm-svn: 90636	2009-12-05 01:27:58 +00:00
Dan Gohman	02578a3805	The debug information for an LLVM Instruction applies to that Instruction and that Instruction only. Implement this by setting the "current debug position" back to Unknown after processing each instruction. llvm-svn: 90632	2009-12-05 00:27:08 +00:00
Duncan Sands	1602277b70	Add note about a subtle bug in this code. Does not effect the main architectures that LLVM targets, because they don't use this code. llvm-svn: 90564	2009-12-04 08:42:17 +00:00
Duncan Sands	bbd6b6ddf4	Fix ExpandShiftWithUnknownAmountBit, which was completely bogus. Pointed out by Javier Martinez (who also provided a patch). Since this logic is not used on (for example) x86, I guess nobody noticed. Tested by generating SHL, SRL, SRA on various choices of i64 for all possible shift amounts, and comparing with gcc. Since I did this on x86-32, I had to force the use of ExpandShiftWithUnknownAmountBit. What I'm saying here is that I don't have a testcase I can add to the repository. llvm-svn: 90482	2009-12-03 21:37:32 +00:00
Nate Begeman	9655f84662	Don't pull vector sext through both hands of a logical operation, since doing so prevents the fusion of vector sext and setcc into vsetcc. Add a testcase for the above transformation. Fix a bogus use of APInt noticed while tracking this down. llvm-svn: 90423	2009-12-03 07:11:29 +00:00
Jakob Stoklund Olesen	32042f9475	Don't call getValueType() on a null SDValue llvm-svn: 90415	2009-12-03 05:15:35 +00:00
Chris Lattner	a48f44d9ee	improve portability to avoid conflicting with std::next in c++'0x. Patch by Howard Hinnant! llvm-svn: 90365	2009-12-03 00:50:42 +00:00
Dan Gohman	b2ae02979f	Add edge source labels to SelectionDAG graphs, now that the graph printing framework omits differentiated edge sources in the case where the labels are empty strings. llvm-svn: 90254	2009-12-01 19:20:00 +00:00
Dan Gohman	8def6e3daf	Minor cleanups. llvm-svn: 90253	2009-12-01 19:16:15 +00:00
Dan Gohman	939c828604	Trim an unnecessary #include. llvm-svn: 90252	2009-12-01 19:13:27 +00:00
Tobias Grosser	9caf3801ca	Fix last DOTGraphTraits problems in CompilationGraph. llvm-svn: 90136	2009-11-30 13:34:51 +00:00
Tobias Grosser	dd7f2e797f	Remove ShortNames from getNodeLabel in DOTGraphTraits llvm-svn: 90134	2009-11-30 12:38:47 +00:00
Tobias Grosser	90d334032a	Instantiate DefaultDOTGraphTraits llvm-svn: 90133	2009-11-30 12:38:13 +00:00
Mon P Wang	32f8bb9ed4	Added support to allow clients to custom widen. For X86, custom widen vectors for divide/remainder since these operations can trap by unroll them and adding undefs for the resulting vector. llvm-svn: 90108	2009-11-30 02:42:02 +00:00
Dan Gohman	de5dea869f	Remove ISD::DEBUG_LOC and ISD::DBG_LABEL, which are no longer used. Note that "hasDotLocAndDotFile"-style debug info was already broken; people wanting this functionality should implement it in the AsmPrinter/DwarfWriter code. llvm-svn: 89711	2009-11-23 23:20:51 +00:00
Dan Gohman	9d72cbf2d5	Move CopyCatchInfo into FunctionLoweringInfo.cpp too, for consistency. llvm-svn: 89683	2009-11-23 18:12:11 +00:00
Dan Gohman	1a6c47f1cb	Rename SelectionDAGLowering to SelectionDAGBuilder, and rename SelectionDAGBuild.cpp to SelectionDAGBuilder.cpp. llvm-svn: 89681	2009-11-23 18:04:58 +00:00
Dan Gohman	91aad4b834	Move RegsForValue to an anonymous namespace, since it is only used in this file. llvm-svn: 89675	2009-11-23 17:46:23 +00:00
Dan Gohman	ad97b3dbd0	Move some more code out of SelectionDAGBuild.cpp and into FunctionLoweringInfo.cpp. llvm-svn: 89674	2009-11-23 17:42:46 +00:00
Ted Kremenek	9b6515794f	Update CMake file. llvm-svn: 89671	2009-11-23 17:26:04 +00:00
Dan Gohman	a3624b6099	Move the FunctionLoweringInfo class and some related utility functions out of SelectionDAGBuild.h/cpp into its own files, to help separate general lowering logic from SelectionDAG-specific lowering logic. llvm-svn: 89667	2009-11-23 17:16:22 +00:00
Devang Patel	ed85e12da6	We are not using DBG_STOPPOINT anymore. llvm-svn: 89536	2009-11-21 02:46:55 +00:00
Dale Johannesen	b91eba382d	When generating a vector the really slow way, via loads and stores, handle the case where the element size is not a valid target type correctly (PPC). llvm-svn: 89521	2009-11-21 00:53:23 +00:00
Dan Gohman	7a6611793f	Target-independent support for TargetFlags on BlockAddress operands, and support for blockaddresses in x86-32 PIC mode. llvm-svn: 89506	2009-11-20 23:18:13 +00:00
Duncan Sands	cc0a0cb4b7	Fix PR5558, which was caused by a wrong fix for PR3393 (see commit 63048), which was an expensive checks failure due to a bug in the checking. This patch in essence reverts the original fix for PR3393, and refixes it by a tweak to the way expensive checking is done. llvm-svn: 89454	2009-11-20 10:45:10 +00:00
Dan Gohman	20c8ab655e	Fix fast-isel to avoid selecting the return instruction if a tail call has been encountered. llvm-svn: 89444	2009-11-20 02:51:26 +00:00
Dan Gohman	82e80019a5	Remove the optimizations that convert BRCOND and BR_CC into unconditional branches or fallthroghes. Instcombine/SimplifyCFG should be simplifying branches with known conditions. This fixes some problems caused by these transformations not updating the MachineBasicBlock CFG. llvm-svn: 89017	2009-11-17 00:47:23 +00:00
Dan Gohman	6b3f32e6d7	Fix a typo in a comment. llvm-svn: 88953	2009-11-16 20:35:59 +00:00
Dan Gohman	a627e26d39	Enable the tail call optimization when the caller returns undef. llvm-svn: 88737	2009-11-14 02:06:30 +00:00
Dan Gohman	f80dc08059	Don't let a noalias difference disrupt the tailcall optimization. llvm-svn: 88672	2009-11-13 18:49:38 +00:00
Dale Johannesen	5f4eecf961	Adjust isConstantSplat to allow for big-endian targets. PPC is such a target; make it work. llvm-svn: 87060	2009-11-13 01:45:18 +00:00
David Greene	1fbe054450	Add a bool flag to StackObjects telling whether they reference spill slots. The AsmPrinter will use this information to determine whether to print a spill/reload comment. Remove default argument values. It's too easy to pass a wrong argument value when multiple arguments have default values. Make everything explicit to trap bugs early. Update all targets to adhere to the new interfaces.. llvm-svn: 87022	2009-11-12 20:49:22 +00:00
Benjamin Kramer	68e4945c03	Add compare_lower and equals_lower methods to StringRef. Switch all users of StringsEqualNoCase (from StringExtras.h) to it. llvm-svn: 87020	2009-11-12 20:36:59 +00:00
Devang Patel	2904aa9f6e	"Attach debug info with llvm instructions" mode was enabled a month ago. Now make it permanent and remove old way of inserting intrinsics to encode debug info for line number and scopes. llvm-svn: 87014	2009-11-12 19:02:56 +00:00
Kenneth Uildriks	9f34406a90	x86 users can now return arbitrary sized structs. Structs too large to fit in return registers will be returned through a hidden sret parameter introduced during SelectionDAG construction. llvm-svn: 86876	2009-11-11 19:59:24 +00:00
Dale Johannesen	6f7d5b22bb	Emit correct code when making a ConstantPool entry for a vector constant whose component type is not a legal type for the target. (If the target ConstantPool cannot handle this type either, it has an opportunity to merge elements. In practice any target with 8-bit bytes must support i8 as data). 7320806 (partial). llvm-svn: 86751	2009-11-10 23:16:41 +00:00
Devang Patel	f6eeaebd76	Implement support to debug inlined functions. llvm-svn: 86748	2009-11-10 23:06:00 +00:00
Duncan Sands	dca0c28452	Codegen support for the llvm.invariant/lifetime.start/end intrinsics: just throw them away. llvm-svn: 86678	2009-11-10 09:08:09 +00:00
Dan Gohman	a951526510	Remove an unneeded #include. llvm-svn: 86601	2009-11-09 22:28:30 +00:00
Mike Stump	f04c4cdb27	Fix for 64-bit builds. llvm-svn: 86600	2009-11-09 22:28:21 +00:00
Evan Cheng	ad7c6124e7	Hide a couple of options. llvm-svn: 86522	2009-11-09 06:49:37 +00:00
Anton Korobeynikov	f93bb39b03	Add 8 bit libcalls and make use of them for msp430 llvm-svn: 86384	2009-11-07 17:14:39 +00:00
Chris Lattner	8e1d7222a7	Fix PR5421 by APInt'izing switch lowering. llvm-svn: 86354	2009-11-07 07:50:34 +00:00
Mon P Wang	fc032ced22	Fix memoizing of CvtRndSatSDNode llvm-svn: 86340	2009-11-07 04:46:25 +00:00
Kenneth Uildriks	07119737aa	Add code to check at SelectionDAGISel::LowerArguments time to see if return values can be lowered to registers. Coming soon, code to perform sret-demotion if return values cannot be lowered to registers llvm-svn: 86324	2009-11-07 02:11:54 +00:00
Dan Gohman	43bdc260d6	Avoid printing a redundant space in SDNode->dump(). llvm-svn: 86151	2009-11-05 18:49:11 +00:00
Dan Gohman	34341e69c4	Make -print-machineinstrs more readable. - Be consistent when referring to MachineBasicBlocks: BB#0. - Be consistent when referring to virtual registers: %reg1024. - Be consistent when referring to unknown physical registers: %physreg10. - Be consistent when referring to known physical registers: %RAX - Be consistent when referring to register 0: %reg0 - Be consistent when printing alignments: align=16 - Print jump table contents. - Don't print host addresses, in general. - and various other cleanups. llvm-svn: 85682	2009-10-31 20:19:03 +00:00
Dan Gohman	ba8735d25a	When discarding SrcValue information, discard all of it so that code that uses this information knows to behave conservatively. llvm-svn: 85654	2009-10-31 14:14:04 +00:00
Eric Christopher	a0ca9e944f	Fix warning with gcc-4.0 and signed/unsigned. llvm-svn: 85648	2009-10-31 09:24:35 +00:00
Dan Gohman	d814e32e57	Don't mark registers dead here when processing nodes with MVT::Flag results. This works around a problem affecting targets which rely on MVT::Flag to handle physical register defs. llvm-svn: 85638	2009-10-30 23:57:47 +00:00
Dan Gohman	6c9388011b	Initial target-independent CodeGen support for BlockAddresses. llvm-svn: 85556	2009-10-30 01:27:03 +00:00
Dan Gohman	05efd893db	Remove some unnecessary spaces in debug output. llvm-svn: 85536	2009-10-29 23:30:06 +00:00
Dan Gohman	554a75a973	Move some code from being emitted as boilerplate duplicated in every *ISelDAGToDAG.cpp to being regular code in SelectionDAGISel.cpp. llvm-svn: 85530	2009-10-29 22:30:23 +00:00
Dan Gohman	453d64c9f5	Rename usesCustomDAGSchedInserter to usesCustomInserter, and update a bunch of associated comments, because it doesn't have anything to do with DAGs or scheduling. This is another step in decoupling MachineInstr emitting from scheduling. llvm-svn: 85517	2009-10-29 18:10:34 +00:00
Eric Christopher	1fd4c577d2	Make sure we return the right sized type here. llvm-svn: 85436	2009-10-28 21:32:16 +00:00
Dan Gohman	14ca753e28	Don't call SDNode::isPredecessorOf when it isn't necessary. If the load's chains have no users, they can't be predecessors of the condition. llvm-svn: 85394	2009-10-28 15:28:02 +00:00
Dan Gohman	cd139c0373	Rewrite SelectionDAG::isPredecessorOf to be iterative instead of recursive to avoid consuming extraordinary amounts of stack space when processing tall graphs. llvm-svn: 85369	2009-10-28 03:44:30 +00:00
Evan Cheng	83896a59e1	Add a second ValueType argument to isFPImmLegal. llvm-svn: 85361	2009-10-28 01:43:28 +00:00
Dan Gohman	4b46cbfc23	Mark dead physregdefs dead immediately. This helps MachineSink and MachineLICM and other things which run before LiveVariables is run. llvm-svn: 85360	2009-10-28 01:13:53 +00:00
Chris Lattner	d04cb6d0fa	rename indbr -> indirectbr to appease the residents of #llvm. llvm-svn: 85351	2009-10-28 00:19:10 +00:00
Dan Gohman	a5e078b677	Update the MachineBasicBlock CFG for an indirect branch. llvm-svn: 85325	2009-10-27 22:10:34 +00:00
Dan Gohman	a4374e66f0	Add CodeGen support for indirect branches. llvm-svn: 85323	2009-10-27 21:56:26 +00:00
Chris Lattner	26076a8f10	don't use stdio llvm-svn: 85296	2009-10-27 20:42:54 +00:00
Evan Cheng	16993aa30b	Do away with addLegalFPImmediate. Add a target hook isFPImmLegal which returns true if the fp immediate can be natively codegened by target. llvm-svn: 85281	2009-10-27 19:56:55 +00:00
Chris Lattner	3ed871fe62	add enough support for indirect branch for the feature test to pass (assembler,asmprinter, bc reader+writer) and document it. Codegen currently aborts on it. llvm-svn: 85274	2009-10-27 19:13:16 +00:00
Chris Lattner	0997991252	pseudosourcevalue is also still using getGlobalContext(), so it isn't thread safe either. llvm-svn: 85253	2009-10-27 17:02:08 +00:00
Eric Christopher	7a50b280c1	Add objectsize intrinsic and hook it up through codegen. Doesn't do anything than return "I don't know" at the moment. llvm-svn: 85189	2009-10-27 00:52:25 +00:00
Victor Hernandez	de5ad42aa1	Remove FreeInst. Remove LowerAllocations pass. Update some more passes to treate free calls just like they were treating FreeInst. llvm-svn: 85176	2009-10-26 23:43:48 +00:00
Nick Lewycky	974e12b2d3	Remove includes of Support/Compiler.h that are no longer needed after the VISIBILITY_HIDDEN removal. llvm-svn: 85043	2009-10-25 06:57:41 +00:00
Nick Lewycky	02d5f77d26	Remove VISIBILITY_HIDDEN from class/struct found inside anonymous namespaces. Chris claims we should never have visibility_hidden inside any .cpp file but that's still not true even after this commit. llvm-svn: 85042	2009-10-25 06:33:48 +00:00
Dan Gohman	4ef112be62	APInt-ify the gep scaling code, so that it correctly handles the case where the scale overflows pointer-sized arithmetic. This fixes PR5281. llvm-svn: 84954	2009-10-23 17:57:43 +00:00
Anton Korobeynikov	8626367e38	Fix null pointer dereference. llvm-svn: 84806	2009-10-22 00:15:17 +00:00
Anton Korobeynikov	a6faf60831	Fix invalid for vector types fneg(bitconvert(x)) => bitconvert(x ^ sign) transform. llvm-svn: 84683	2009-10-20 21:37:45 +00:00
Evan Cheng	0e9d9ca855	-Revert parts of 84326 and 84411. Distinquishing between fixed and non-fixed stack slots and giving them different PseudoSourceValue's did not fix the problem of post-alloc scheduling miscompiling llvm itself. - Apply Dan's conservative workaround by assuming any non fixed stack slots can alias other memory locations. This means a load from spill slot #1 cannot move above a store of spill slot #2. - Enable post-alloc scheduling for x86 at optimization leverl Default and above. llvm-svn: 84424	2009-10-18 18:16:27 +00:00
Evan Cheng	0b8db2dab7	Only fixed stack objects and spill slots should be get FixedStack PseudoSourceValue. llvm-svn: 84411	2009-10-18 06:27:36 +00:00
Evan Cheng	8759585aba	Revert 84315 for now. Re-thinking the patch. llvm-svn: 84321	2009-10-17 07:53:04 +00:00
Evan Cheng	0818d87ed1	Rename getFixedStack to getStackObject. The stack objects represented are not necessarily fixed. Only those will negative frame indices are "fixed." llvm-svn: 84315	2009-10-17 06:22:26 +00:00
Evan Cheng	a6e4db8ff7	80 col violation. llvm-svn: 84311	2009-10-17 06:05:11 +00:00
Dan Gohman	650997fb0b	Delete an obsolete comment. llvm-svn: 84300	2009-10-17 01:37:38 +00:00
Victor Hernandez	a3aaf85e23	Remove MallocInst from LLVM Instructions. llvm-svn: 84299	2009-10-17 01:18:07 +00:00
Mon P Wang	b1baaf5ab9	Allow widening of extract subvector llvm-svn: 84279	2009-10-16 22:05:48 +00:00
Zhongxing Xu	47062ce503	Indent code. llvm-svn: 84247	2009-10-16 05:42:28 +00:00
Jakob Stoklund Olesen	e4197250cc	Report errors correctly for unselected target intrinsics. llvm-svn: 84193	2009-10-15 18:50:03 +00:00
Duncan Sands	8e6ccb65df	I don't see any point in having both eh.selector.i32 and eh.selector.i64, so get rid of eh.selector.i64 and rename eh.selector.i32 to eh.selector. Likewise for eh.typeid.for. This aligns us with gcc, which always uses a 32 bit value for the selector on all platforms. My understanding is that the register allocator used to assert if the selector intrinsic size didn't match the pointer size, and this was the reason for introducing the two variants. However my testing shows that this is no longer the case (I fixed some bugs in selector lowering yesterday, and some more today in the fastisel path; these might have caused the original problems). llvm-svn: 84106	2009-10-14 16:11:37 +00:00
Devang Patel	d7ebfe3963	s/DebugLoc.CompileUnit/DebugLoc.Scope/g s/DebugLoc.InlinedLoc/DebugLoc.InlinedAtLoc/g llvm-svn: 84054	2009-10-13 23:28:53 +00:00
Duncan Sands	18a956cb4a	Introduce new convenience methods for sign extending or truncating an SDValue (depending on whether the target type is bigger or smaller than the value's type); or zero extending or truncating it. Use it in a few places (this seems to be a popular operation, but I only modified cases of it in SelectionDAGBuild). In particular, the eh_selector lowering was doing this wrong due to a repeated rather than inverted test, fixed with this change. llvm-svn: 84027	2009-10-13 21:04:12 +00:00
Devang Patel	0af2a420cd	Set default location for a function if it is not set. llvm-svn: 83921	2009-10-12 23:10:55 +00:00
Nate Begeman	a3ed9edd40	More heuristics for Combiner-AA. Still catches all important cases, but compile time penalty on gnugo, the worst case in MultiSource, is down to about 2.5% from 30% llvm-svn: 83824	2009-10-12 05:53:58 +00:00
Dan Gohman	b8120770b4	Create a new InstrEmitter class for translating SelectionDAG nodes into MachineInstrs. This is mostly just moving the code from ScheduleDAGSDNodesEmit.cpp into a new class. This decouples MachineInstr emitting from scheduling. llvm-svn: 83699	2009-10-10 01:32:21 +00:00
Dan Gohman	a22f2d8614	Make getMachineNode return a MachineSDNode* instead of a generic SDNode* since it won't do any folding. This will help avoid some inconvenient casting. llvm-svn: 83698	2009-10-10 01:29:16 +00:00
Dan Gohman	918ec53c64	The ScheduleDAG framework now requires an AliasAnalysis argument, though it isn't needed in the ScheduleDAGSDNodes schedulers. llvm-svn: 83691	2009-10-09 23:33:48 +00:00
Devang Patel	df45c7f642	Extract scope information from the variable itself, instead of relying on alloca or llvm.dbg.declare location. While recording beginning of a function, use scope info from the first location entry instead of just relying on first location entry itself. llvm-svn: 83684	2009-10-09 22:42:28 +00:00
Bob Wilson	2a45a65511	Add a SelectionDAG getTargetInsertSubreg convenience function, similar to getTargetExtractSubreg. llvm-svn: 83564	2009-10-08 18:49:46 +00:00
Devang Patel	4598eb6214	Add support to handle debug info attached to an instruction. This is not yet enabled. llvm-svn: 83400	2009-10-06 18:37:31 +00:00
Devang Patel	bb802206d2	Set default location for the function if it is not already set. This code is not yet enabled. llvm-svn: 83349	2009-10-06 00:09:08 +00:00
Devang Patel	4dbca6dfd4	If location info is attached with an instruction then keep track of alloca slots used by a variable. This info will be used by AsmPrinter to emit debug info for variables. llvm-svn: 83189	2009-10-01 01:03:26 +00:00
Devang Patel	3256c751f5	Use MDNode * directly as an RecordSourceLine() argument. llvm-svn: 83182	2009-09-30 22:51:28 +00:00
Reid Kleckner	cea8dab1d1	Silence comparison always false warning in -Asserts mode. llvm-svn: 83164	2009-09-30 20:43:07 +00:00
Reid Kleckner	8ff5c19ebd	Fix integer overflow in instruction scheduling. This can happen if we have basic blocks that are so long that their size overflows a short. Also assert that overflow does not happen in the future, as requested by Evan. This fixes PR4401. llvm-svn: 83159	2009-09-30 20:15:38 +00:00
Devang Patel	5d58383ea9	Remove unnecessary cast. llvm-svn: 83100	2009-09-29 19:56:13 +00:00
Devang Patel	2d85eef974	s/class Metadata/class MetadataContext/g llvm-svn: 83019	2009-09-28 21:41:20 +00:00
Devang Patel	b1a4477f1f	Do not use global typedef for MDKindID. llvm-svn: 83016	2009-09-28 21:14:55 +00:00
Dan Gohman	6905f15256	Use VerifySchedule instead of doing the work manually. llvm-svn: 82995	2009-09-28 16:09:41 +00:00
Dan Gohman	832800aa6f	Convert comparisons like (x == infinity) to (x >= infinity) on targets where FCMP_OEQ is not legal and FCMP_OGE is, such as x86. llvm-svn: 82861	2009-09-26 15:24:17 +00:00
Dan Gohman	48b185d6f7	Improve MachineMemOperand handling. - Allocate MachineMemOperands and MachineMemOperand lists in MachineFunctions. This eliminates MachineInstr's std::list member and allows the data to be created by isel and live for the remainder of codegen, avoiding a lot of copying and unnecessary translation. This also shrinks MemSDNode. - Delete MemOperandSDNode. Introduce MachineSDNode which has dedicated fields for MachineMemOperands. - Change MemSDNode to have a MachineMemOperand member instead of its own fields with the same information. This introduces some redundancy, but it's more consistent with what MachineInstr will eventually want. - Ignore alignment when searching for redundant loads for CSE, but remember the greatest alignment. Target-specific code which previously used MemOperandSDNodes with generic SDNodes now use MemIntrinsicSDNodes, with opcodes in a designated range so that the SelectionDAG framework knows that MachineMemOperand information is available. llvm-svn: 82794	2009-09-25 20:36:54 +00:00
Dan Gohman	32f71d714b	Rename getTargetNode to getMachineNode, for consistency with the naming scheme used in SelectionDAG, where there are multiple kinds of "target" nodes, but "machine" nodes are nodes which represent a MachineInstr. llvm-svn: 82790	2009-09-25 18:54:59 +00:00
Dale Johannesen	a318d91a1e	Make sure sin, cos, sqrt calls are marked readonly before producing FSIN, FCOS, FSQRT. If they aren't so marked we have to assume they might set errno. llvm-svn: 82781	2009-09-25 18:00:35 +00:00
Dale Johannesen	c72134269f	Generate FSQRT from calls to the sqrt function, which allows appropriate backends to generate a sqrt instruction. On x86, this isn't done at -O0 because we go through FastISel instead. This is a behavior change from before this series of sqrt patches started. I think this is OK considering that compile speed is most important at -O0, but could be convinced otherwise. llvm-svn: 82778	2009-09-25 17:23:22 +00:00
Nate Begeman	18150d5abc	Fix combiner-aa issue with bases which are different, but can alias. Previously, it treated GV+28 GV+0 as different bases, and assumed they could not alias. llvm-svn: 82753	2009-09-25 06:05:26 +00:00
Dan Gohman	ebdfe4af62	Add a version of dumpr() that has a SelectionDAG* argument. llvm-svn: 82742	2009-09-25 00:34:34 +00:00
Dan Gohman	203d53ed79	Use getStoreSize() instead of getStoreSizeInBits()/8. llvm-svn: 82656	2009-09-23 21:07:02 +00:00
Dan Gohman	08c0a95ac6	Rename several variables from EVT to more descriptive names, now that EVT is also the name of their type, as declarations like "EVT EVT" look really odd. llvm-svn: 82654	2009-09-23 21:02:20 +00:00
Dan Gohman	c0353bfff5	Give MachineMemOperand an operator<<, factoring out code from two different places for printing MachineMemOperands. Drop the virtual from Value::dump and instead give Value a protected virtual hook that can be overridden by subclasses to implement custom printing. This lets printing be more consistent, and simplifies printing of PseudoSourceValue values. llvm-svn: 82599	2009-09-23 01:33:16 +00:00
Dan Gohman	e7c8242baa	Change MachineMemOperand's alignment value to be the alignment of the base pointer, without the offset. This matches MemSDNode's new alignment behavior, and holds more interesting information. llvm-svn: 82473	2009-09-21 19:47:04 +00:00
Chris Lattner	bb1a1bd2bd	tidy up llvm-svn: 82397	2009-09-20 17:32:21 +00:00
Daniel Dunbar	7d6781b0fe	Tabs -> spaces, and remove trailing whitespace. llvm-svn: 82355	2009-09-20 02:20:51 +00:00
Evan Cheng	9827ad39a7	Fix PR4926. When target hook EmitInstrWithCustomInserter() insert new basic blocks and update CFG, it should also inform sdisel of the changes so the phi source operands will come from the right basic blocks. llvm-svn: 82311	2009-09-19 09:51:03 +00:00
Evan Cheng	270d0f986f	Enhance EmitInstrWithCustomInserter() so target can specify CFG changes that sdisel will use to properly complete phi nodes. Not functionality change yet. llvm-svn: 82273	2009-09-18 21:02:19 +00:00
Chris Lattner	e133923abe	duncan points out the EH selector values are signed. llvm-svn: 82245	2009-09-18 18:34:29 +00:00
Evan Cheng	f4db6396e0	Revert r82214. It broke 403.gcc on x86_64 / Darwin. llvm-svn: 82215	2009-09-18 08:26:06 +00:00
Evan Cheng	6ba1931d60	Fix a bug in sdisel switch lowering code. When it updates the phi nodes in switch successor blocks, it can introduce multiple phi operands of the same value from different blocks (and may not be on the predecessor list). This can be seen on CodeGen/Generic/2006-09-06-SwitchLowering.ll. But it's not known to cause any real regression (but I have added an assertion for it now). llvm-svn: 82214	2009-09-18 08:16:04 +00:00
Chris Lattner	1bd81314e7	tolerate llvm.eh.selector.i64 on 32-bit systems and llvm.eh.selector.i32 on 64-bit systems. llvm-svn: 82180	2009-09-17 23:54:54 +00:00
Devang Patel	44b3a87f78	Fix typo. llvm-svn: 82080	2009-09-16 21:09:07 +00:00
Devang Patel	852c9b6627	At iSel time, update DebugLoc based on debug info attached with an instruction. llvm-svn: 82077	2009-09-16 20:39:11 +00:00
Nate Begeman	fbb88b180c	Do not add the SVOffset to the Node CSE ID. The same pointer argument cannot have different SVOffsets. llvm-svn: 81937	2009-09-15 22:30:11 +00:00
Nate Begeman	178135c88b	Better solution for tracking both the original alignment of the access, and the current alignment based on the source value offset. This avoids increasing the size of mem nodes. llvm-svn: 81897	2009-09-15 19:05:41 +00:00
Nate Begeman	d41f8fd2b3	Remove incorrect CSE code from r81813. llvm-svn: 81819	2009-09-15 00:38:09 +00:00
Nate Begeman	879d8f1c3e	Substantially speed up combiner-aa in the following ways: 1. Switch from an std::set to a SmallPtrSet for visited chain nodes. 2. Do not force the recursive flattening of token factor nodes, regardless of use count. 3. Immediately process newly created TokenFactor nodes. Also, improve combiner-aa by teaching it that loads to non-overlapping offsets of relatively aligned objects cannot alias. These changes result in a >5x speedup for combiner-aa on most testcases. llvm-svn: 81816	2009-09-15 00:18:30 +00:00
Nate Begeman	01c1e1152d	Teach the legalizer to propagate the original alignment of loads and store when it splits them. llvm-svn: 81815	2009-09-15 00:14:28 +00:00
Nate Begeman	02a685a914	Add an "original alignment" field to load and store nodes. This enables the DAG Combiner to disambiguate chains for loads and stores of types which are broken up by the Legalizer into smaller pieces. llvm-svn: 81813	2009-09-15 00:13:12 +00:00
Chris Lattner	0bad631cde	kill off the last use of TRI::AsmName. llvm-svn: 81727	2009-09-13 22:42:03 +00:00
Dan Gohman	9cbef32726	Make fast-isel try ISD::FNEG before resorting to bitcasts and xors. llvm-svn: 81493	2009-09-11 00:36:43 +00:00
Dan Gohman	89b090e51e	Reapply r81171 with a fix: don't try to use i64 when it isn't legal. llvm-svn: 81492	2009-09-11 00:34:46 +00:00
Bob Wilson	39f51320ca	Don't swap the operands of a subtraction when trying to create a post-decrement load/store. llvm-svn: 81464	2009-09-10 22:09:31 +00:00
Bob Wilson	59e4c84c6f	Revert r81171 which was causing pr4927. llvm-svn: 81415	2009-09-10 00:49:22 +00:00
Dan Gohman	16ad903fcf	When widening a vector load, use the correct chain. This fixes PR4891. llvm-svn: 81343	2009-09-09 14:22:57 +00:00
Chris Lattner	e819cfbc71	change selectiondag to add the sign extended versions of immediate operands to instructions instead of zero extended ones. This makes the asmprinter print signed values more consistently. This apparently only really affects the X86 backend. llvm-svn: 81265	2009-09-08 23:05:44 +00:00
Dan Gohman	f4a0f0f033	Fix an abort on a store of an empty struct member. getValue returns null in the case of an empty struct, so don't try to call getNumValues on it. llvm-svn: 81180	2009-09-08 01:44:02 +00:00
Dan Gohman	2512a42548	Fix a thinko: When lowering fneg with xor, bitcast the operands from floating-point to integer first, and bitcast the result back to floating-point. Previously, this test was passing by falling back to SelectionDAG lowering. The resulting code isn't as nice, but it's correct and CodeGen now stays on the fast path. llvm-svn: 81171	2009-09-07 23:47:14 +00:00
Duncan Sands	3ee3c174b1	Simplify. Testing shows that this is not equivalent to BBI = CR.CaseBB + 1. llvm-svn: 81124	2009-09-06 18:03:32 +00:00
Duncan Sands	89720bbd11	Remove some not-really-used variables, as warned about by icc (#593, partial). Patch by Erick Tryzelaar. llvm-svn: 81115	2009-09-06 12:41:19 +00:00
Duncan Sands	2fbeaf084f	Remove some unused variables and methods warned about by icc (#177, partial). Patch by Erick Tryzelaar. llvm-svn: 81106	2009-09-06 08:33:48 +00:00
Devang Patel	f03667e20e	Detect VLAs. Do not use DenseMap operator[] because it inserts new entry if lookup fails. Use find() to check an entry in a DenseMap first. llvm-svn: 81058	2009-09-05 00:34:14 +00:00
Dan Gohman	aa92dc1e61	LLVM currently represents floating-point negation as -0.0 - x. Fix FastISel to recognize this pattern and emit a floating-point negation using xor. llvm-svn: 80963	2009-09-03 22:53:57 +00:00
Dan Gohman	d0d5e685da	Recognize more opportunities to use SSE min and max instructions, swapping the operands if necessary. llvm-svn: 80940	2009-09-03 20:34:31 +00:00
Sandeep Patel	68c5f477fa	Retype from unsigned to CallingConv::ID accordingly. Approved by Bob Wilson. llvm-svn: 80773	2009-09-02 08:44:58 +00:00
Daniel Dunbar	f7a14aa43d	Remove Offset from ExternalSybmol MachineOperands, this is unused (and at least partly unsupported, in X86 encoding at least). llvm-svn: 80726	2009-09-01 22:06:46 +00:00
Devang Patel	80ae34974b	Reapply 79977. Use MDNodes to encode debug info in llvm IR. llvm-svn: 80406	2009-08-28 23:24:31 +00:00
Anton Korobeynikov	50509fc2cb	Add extload expansion for f128 llvm-svn: 80116	2009-08-26 17:39:40 +00:00
Devang Patel	f08e35d9dc	Revert 79977. It causes llvm-gcc bootstrap failures on some platforms. llvm-svn: 80073	2009-08-26 05:01:18 +00:00
Owen Anderson	3b1665eca5	Get rid of this horrible "benign race" by exploiting ManagedStatic to initialize the array on its first access. llvm-svn: 80040	2009-08-25 22:27:22 +00:00
Devang Patel	02aac922b4	Update DebugInfo interface to use metadata, instead of special named llvm.dbg.... global variables, to encode debugging information in llvm IR. This is mostly a mechanical change that tests metadata support very well. This change speeds up llvm-gcc by more then 6% at "-O0 -g" (measured by compiling InstructionCombining.cpp!) llvm-svn: 79977	2009-08-25 05:24:07 +00:00
Daniel Dunbar	34ee203337	Fix some refactos for iostream changes (in -Asserts mode). - The world needs better C++ refactoring tools, can I get an Amen!? llvm-svn: 79843	2009-08-23 08:50:52 +00:00
Chris Lattner	317dbbcfb1	eliminate uses of cerr() llvm-svn: 79834	2009-08-23 07:05:07 +00:00
Chris Lattner	4dc3edde9f	remove a few DOUTs here and there. llvm-svn: 79832	2009-08-23 06:35:02 +00:00
Chris Lattner	1362602eb2	Change Pass::print to take a raw ostream instead of std::ostream, update all code that this affects. llvm-svn: 79830	2009-08-23 06:03:38 +00:00
Eli Friedman	79ba8f2edc	Add check for completeness. Note that this doesn't actually have any effect with the way the current code is structured. llvm-svn: 79792	2009-08-23 00:14:19 +00:00
Chris Lattner	7b26fce23e	Rename TargetAsmInfo (and its subclasses) to MCAsmInfo. llvm-svn: 79763	2009-08-22 20:48:53 +00:00
Devang Patel	0939595711	Record variable debug info at ISel time directly. llvm-svn: 79742	2009-08-22 17:12:53 +00:00
Owen Anderson	63010bb65a	Reapply r79708 with the appropriate fix for the case that still requires locking. llvm-svn: 79731	2009-08-22 06:32:36 +00:00
Chris Lattner	56d60eaa61	revert r79708 + r79711 llvm-svn: 79720	2009-08-22 04:07:34 +00:00
Eric Christopher	677c2287da	Actually remove unused static. Previous commit removed trailing whitespace. llvm-svn: 79711	2009-08-22 00:41:47 +00:00
Eric Christopher	dfda92b76e	Remove unused static. llvm-svn: 79710	2009-08-22 00:40:45 +00:00
Owen Anderson	8e2456c254	Ease contention on this lock by noticing that all writes to the VTs array will be of (dynamically) constant values, so races on it are immaterial. We just need to ensure that at least one write has completed before return the pointer into it. With this change, parllc exhibits essentially no overhead on 403.gcc. llvm-svn: 79708	2009-08-22 00:29:12 +00:00
Bill Wendling	dff54eff8e	Fix typo. Should check both values of RangeUse for 0. Patch by Marius Wachtler. llvm-svn: 79649	2009-08-21 18:16:06 +00:00
Dan Gohman	ac33a9061d	Add an x86 peep that narrows TEST instructions to forms that use a smaller encoding. These kinds of patterns are very frequent in sqlite3, for example. llvm-svn: 79439	2009-08-19 18:16:17 +00:00
David Goodwin	9b48cd4899	Use the schedule itinerary operand use/def cycle information to adjust dependence edge latency for post-RA scheduling. llvm-svn: 79425	2009-08-19 16:08:58 +00:00
Eli Friedman	1e008c173a	PR4737: Fix a nasty bug in load narrowing with non-power-of-two types. llvm-svn: 79415	2009-08-19 08:46:10 +00:00
Dan Gohman	2fa67c9f70	Be tidy and use a break to exit from a switch block rather than just falling through the end. llvm-svn: 79383	2009-08-18 23:52:48 +00:00
Dan Gohman	4906f73a9f	Legalize the shift amount operand of SRL_PARTS, SHL_PARTS, and SRA_PARTS, as is done for SRL, SHL, and SRA. llvm-svn: 79380	2009-08-18 23:36:17 +00:00
Jim Grosbach	43bbb9de66	Remove a bit more cruft from the sjlj moving to a backend pass. llvm-svn: 79272	2009-08-17 20:25:04 +00:00
Jakob Stoklund Olesen	7f91fee62b	Be more clever about regclasses in ScheduleDAGSDNodes::EmitCopyFromReg. If two uses of a CopyFromReg want different regclasses, first try a common sub-class, then fall back on the copy emitted in AddRegisterOperand. There is no need for an assert here. The cross-class joiner usually cleans up nicely. llvm-svn: 79193	2009-08-16 17:40:59 +00:00
Evan Cheng	badf17cdc7	Needs to check whether unaligned load / store of i64 is legal here. llvm-svn: 79150	2009-08-15 23:41:42 +00:00
Benjamin Kramer	d2d5e716bd	Unbreak build. Evan, please make sure my changes are correct. llvm-svn: 79133	2009-08-15 20:46:16 +00:00
Evan Cheng	567f124305	80 col violations. llvm-svn: 79087	2009-08-15 08:38:52 +00:00
Dan Gohman	e8c913e657	Simplify this code to not depend as much on CurMBB. llvm-svn: 79068	2009-08-15 02:06:22 +00:00
Anton Korobeynikov	a6b3ce203a	Allow targets to specify their choice of calling conventions per libcall. Take advantage of this in the ARM backend to rectify broken choice of CC when hard float is in effect. PIC16 may want to see if it could be of use in MakePIC16Libcall, which works unchanged. Patch by Sandeep! llvm-svn: 79033	2009-08-14 20:10:52 +00:00
Evan Cheng	dc1869661b	Indentation change. llvm-svn: 78978	2009-08-14 01:56:37 +00:00
Owen Anderson	55f1c09e31	Push LLVMContexts through the IntegerType APIs. llvm-svn: 78948	2009-08-13 21:58:54 +00:00
David Goodwin	90e6b8b708	Add callback to allow target to adjust latency of schedule dependency edge. llvm-svn: 78910	2009-08-13 16:05:04 +00:00
Owen Anderson	117c9e8497	Add contexts to some of the MVT APIs. No functionality change yet, just the infrastructure work needed to get the contexts to where they need to be first. llvm-svn: 78759	2009-08-12 00:36:31 +00:00
Owen Anderson	c6daf8f17c	Fix warnings. llvm-svn: 78725	2009-08-11 21:59:30 +00:00
Owen Anderson	9f94459d24	Split EVT into MVT and EVT, the former representing _just_ a primitive type, while the latter is capable of representing either a primitive or an extended type. llvm-svn: 78713	2009-08-11 20:47:22 +00:00
Dan Gohman	7c50c9bd63	Tidy #includes. llvm-svn: 78677	2009-08-11 16:02:12 +00:00
Jim Grosbach	693e36a3e8	SjLj based exception handling unwinding support. This patch is nasty, brutish and short. Well, it's kinda short. Definitely nasty and brutish. The front-end generates the register/unregister calls into the SjLj runtime, call-site indices and landing pad dispatch. The back end fills in the LSDA with the call-site information provided by the front end. Catch blocks are not yet implemented. Built on Darwin and verified no llvm-core "make check" regressions. llvm-svn: 78625	2009-08-11 00:09:57 +00:00
Dan Gohman	9d26c85bdc	Fix a bug in the DAGCombiner's handling of multiple linked MERGE_VALUES nodes. Replacing the result values with the operands in one MERGE_VALUES node may cause another MERGE_VALUES node be CSE'd with the first one, and bring its uses along, so that the first one isn't dead, as this code expects. Fix this by iterating until the node is really dead. This fixes PR4699. llvm-svn: 78619	2009-08-10 23:43:19 +00:00
Dan Gohman	733a64db57	Fix a bug where DAGCombine was producing an illegal ConstantFP node after legalize, and remove the workaround code from the ARM backend. llvm-svn: 78615	2009-08-10 23:15:10 +00:00
Owen Anderson	53aa7a960c	Rename MVT to EVT, in preparation for splitting SimpleValueType out into its own struct type. llvm-svn: 78610	2009-08-10 22:56:29 +00:00
Owen Anderson	c30530d105	Start moving TargetLowering away from using full MVTs and towards SimpleValueType, which will simplify the privatization of IntegerType in the future. llvm-svn: 78584	2009-08-10 18:56:59 +00:00
Dan Gohman	b717091e69	Make this comment more closely reflect the code. llvm-svn: 78569	2009-08-10 16:50:32 +00:00
Jakob Stoklund Olesen	dc6bccbaa6	Don't build illegal ops in DAGCombiner::SimplifyBinOpWithSameOpcodeHands(). Blackfin supports and/or/xor on i32 but not on i16. Teach DAGCombiner::SimplifyBinOpWithSameOpcodeHands to not produce illegal nodes after legalize ops. llvm-svn: 78497	2009-08-08 20:42:17 +00:00
Dale Johannesen	352fa92995	Use stripPointerCasts instead of partially rewriting it. llvm-svn: 78350	2009-08-06 22:45:51 +00:00
Dan Gohman	695d811ad5	Add assertion checks after the calls to LowerFormalArguments, LowerCall, and LowerReturn, to verify that the targets' hooks have respected some of their postconditions. llvm-svn: 78312	2009-08-06 15:37:27 +00:00
Dan Gohman	ee902509a8	Remove an over-aggressive assert. Functions with empty struct return types don't have any return values, from CodeGen's perspective. This fixes PR4688. llvm-svn: 78311	2009-08-06 15:07:58 +00:00
Dan Gohman	5758e1e92a	Fix a few places in DAGCombiner that were creating all-ones-bits and high-bits values in ways that weren't correct for integer types wider than 64 bits. This fixes a miscompile in PPMacroExpansion.cpp in clang on x86-64. llvm-svn: 78295	2009-08-06 09:18:59 +00:00
Dan Gohman	f9bbcd1afd	Major calling convention code refactoring. Instead of awkwardly encoding calling-convention information with ISD::CALL, ISD::FORMAL_ARGUMENTS, ISD::RET, and ISD::ARG_FLAGS nodes, TargetLowering provides three virtual functions for targets to override: LowerFormalArguments, LowerCall, and LowerRet, which replace the custom lowering done on the special nodes. They provide the same information, but in a more immediately usable format. This also reworks much of the target-independent tail call logic. The decision of whether or not to perform a tail call is now cleanly split between target-independent portions, and the target dependent portion in IsEligibleForTailCallOptimization. This also synchronizes all in-tree targets, to help enable future refactoring and feature work. llvm-svn: 78142	2009-08-05 01:29:28 +00:00
Dan Gohman	15873a8ff7	Propogate the Depth argument when calling TLI.computeMaskedBitsForTargetNode from ComputeMaskedBits, since the former may call back into the latter. This fixes a major compile time problem on a testcase that happnened to hit this in a particularly bad way, PR4643. llvm-svn: 78023	2009-08-04 00:24:42 +00:00
Bob Wilson	5f6f72605b	Revert 77974. It breaks 3 of the ARM tests. llvm-svn: 77982	2009-08-03 19:06:29 +00:00
Sanjiv Gupta	9503900c60	Allow targets to custom handle softening of results or operands before trying the standard stuff. llvm-svn: 77974	2009-08-03 17:35:21 +00:00
Benjamin Kramer	c28b306423	llvm_report_error already prints "LLVM ERROR:". So stop reporting errors like "LLVM ERROR: llvm: error:" or "LLVM ERROR: ERROR:". llvm-svn: 77971	2009-08-03 13:33:33 +00:00
Dan Gohman	3f323847bc	Avoid forming a SELECT_CC in a type that the target doesn't support. This isn't immediately interesting, because Legalize ends up lowering SELECT_CC if the target doesn't support it, but this simplifies the process. Also, if the SELECT_CC would be expanded in Legalize, it can potentially end up with two copies of the condition expression. By leaving it as SELECT+SETCC, the SELECT can be expanded into two SELECTs that use a single SETCC. The two comparisons are usually CSE'd, but depending on when various expressions get legalized, the comparison expression could involve calls to library functions, such that the comparison expression may not be able to be CSE'd. This will be needed by a future patch. llvm-svn: 77896	2009-08-02 16:19:38 +00:00
Dan Gohman	3a9b9a59ea	Print the target flags as an int instead of a char, as they aren't actually characters. llvm-svn: 77794	2009-08-01 19:13:38 +00:00
Dan Gohman	859103d8e7	Delete a redundant variable. llvm-svn: 77774	2009-08-01 04:18:29 +00:00
Dan Gohman	7153692bdf	Minor code simplifications. llvm-svn: 77769	2009-08-01 03:51:09 +00:00
Dan Gohman	1987bf4561	SelectionDAGISel no longer needs to check hasAvailableExternallyLinkage, as it is now a MachineFunctionPass, and MachineFunctionPass now handles this. llvm-svn: 77760	2009-08-01 00:42:23 +00:00
Dan Gohman	10b8898ac0	SelectionDAGISel does not "preserve all", since it makes lots of changes to the MachineFunction. llvm-svn: 77753	2009-07-31 23:36:22 +00:00
Dan Gohman	dd3da92b4a	Use a range insert instead of an explicit loop. llvm-svn: 77752	2009-07-31 23:36:06 +00:00
Bob Wilson	84aa855ead	Allow target intrinsics that return multiple values, i.e., struct types, in SelectionDAGLowering::visitTargetIntrinsic. This removes a bit of special-case code for vector types. After staring at it for a while, I managed to convince myself that it is not necessary. The only case where TLI.getValueType() differs from MVT::getMVT is for iPTR, so this code could potentially make a difference for a vector of pointers. But, it looks like that is not supported. Calling TLI.getValueType() on a vector of pointers leads to the following sequence of calls: TargetLowering::getValueType MVT::getMVT MVT::getVectorVT(iPTR, num elements) MVT::getExtendedVectorVT MVT::getTypeForMVT for iPTR assertion fails "Type is not extended!" So, unless I'm really missing something, this bit of code is irrelevant to the current version of LLVM, which is consistent with the fact that I don't see this code in other similar places. llvm-svn: 77747	2009-07-31 22:41:21 +00:00
Owen Anderson	5a1acd9912	Move a few more APIs back to 2.5 forms. The only remaining ones left to change back are metadata related, which I'm waiting on to avoid conflicting with Devang. llvm-svn: 77721	2009-07-31 20:28:14 +00:00
Dan Gohman	5ea74d55ce	Reapply r77654 with a fix: MachineFunctionPass's getAnalysisUsage shouldn't do AU.setPreservesCFG(), because even though CodeGen passes don't modify the LLVM IR CFG, they may modify the MachineFunction CFG, and passes like MachineLoop are registered with isCFGOnly set to true. llvm-svn: 77691	2009-07-31 18:16:33 +00:00
Owen Anderson	23a204d91b	Move getTrue() and getFalse() to 2.5-like APIs. llvm-svn: 77685	2009-07-31 17:39:07 +00:00
Daniel Dunbar	5434756585	Revert r77654, it appears to be causing llvm-gcc bootstrap failures, and many failures when building assorted projects with clang. --- Reverse-merging r77654 into '.': U include/llvm/CodeGen/Passes.h U include/llvm/CodeGen/MachineFunctionPass.h U include/llvm/CodeGen/MachineFunction.h U include/llvm/CodeGen/LazyLiveness.h U include/llvm/CodeGen/SelectionDAGISel.h D include/llvm/CodeGen/MachineFunctionAnalysis.h U include/llvm/Function.h U lib/Target/CellSPU/SPUISelDAGToDAG.cpp U lib/Target/PowerPC/PPCISelDAGToDAG.cpp U lib/CodeGen/LLVMTargetMachine.cpp U lib/CodeGen/MachineVerifier.cpp U lib/CodeGen/MachineFunction.cpp U lib/CodeGen/PrologEpilogInserter.cpp U lib/CodeGen/MachineLoopInfo.cpp U lib/CodeGen/SelectionDAG/SelectionDAGISel.cpp D lib/CodeGen/MachineFunctionAnalysis.cpp D lib/CodeGen/MachineFunctionPass.cpp U lib/CodeGen/LiveVariables.cpp llvm-svn: 77661	2009-07-31 03:02:41 +00:00
Dan Gohman	bcb44baa57	Manage MachineFunctions with an analysis Pass instead of the Annotable mechanism. To support this, make MachineFunctionPass a little more complete. llvm-svn: 77654	2009-07-31 01:52:50 +00:00
Owen Anderson	b292b8ce70	Move more code back to 2.5 APIs. llvm-svn: 77635	2009-07-30 23:03:37 +00:00
Sanjiv Gupta	a53e686d96	Allow targets to define libcall names for mem(cpy,set,move) intrinsics, rather than hardcoding them in DAG lowering. llvm-svn: 77586	2009-07-30 09:12:56 +00:00
Evan Cheng	e62288fdd4	Optimize some common usage patterns of atomic built-ins __sync_add_and_fetch() and __sync_sub_and_fetch. When the return value is not used (i.e. only care about the value in the memory), x86 does not have to use add to implement these. Instead, it can use add, sub, inc, dec instructions with the "lock" prefix. This is currently implemented using a bit of instruction selection trick. The issue is the target independent pattern produces one output and a chain and we want to map it into one that just output a chain. The current trick is to select it into a merge_values with the first definition being an implicit_def. The proper solution is to add new ISD opcodes for the no-output variant. DAG combiner can then transform the node before it gets to target node selection. Problem #2 is we are adding a whole bunch of x86 atomic instructions when in fact these instructions are identical to the non-lock versions. We need a way to add target specific information to target nodes and have this information carried over to machine instructions. Asm printer (or JIT) can use this information to add the "lock" prefix. llvm-svn: 77582	2009-07-30 08:33:02 +00:00
Owen Anderson	4056ca9568	Move types back to the 2.5 API. llvm-svn: 77516	2009-07-29 22:17:13 +00:00
Chris Lattner	7667332899	inline the global 'getInstrOperandRegClass' function into its callers now that TargetOperandInfo does the heavy lifting. llvm-svn: 77508	2009-07-29 21:36:49 +00:00
Benjamin Kramer	21d75078b5	Remove now unused Context variables. llvm-svn: 77495	2009-07-29 19:14:17 +00:00
Owen Anderson	487375e9a2	Move ConstantExpr to 2.5 API. llvm-svn: 77494	2009-07-29 18:55:55 +00:00
Owen Anderson	4aa3295a65	Return ConstantVector to 2.5 API. llvm-svn: 77366	2009-07-28 21:19:26 +00:00
Owen Anderson	c2c7932c64	Change ConstantArray to 2.5 API. llvm-svn: 77347	2009-07-28 18:32:17 +00:00
Chris Lattner	5e693ed07b	Rip all of the global variable lowering logic out of TargetAsmInfo. Since it is highly specific to the object file that will be generated in the end, this introduces a new TargetLoweringObjectFile interface that is implemented for each of ELF/MachO/COFF/Alpha/PIC16 and XCore. Though still is still a brutal and ugly refactoring, this is a major step towards goodness. This patch also: 1. fixes a bunch of dangling pointer problems in the PIC16 backend. 2. disables the TargetLowering copy ctor which PIC16 was accidentally using. 3. gets us closer to xcore having its own crazy target section flags and pic16 not having to shadow sections with its own objects. 4. fixes wierdness where ELF targets would set CStringSection but not CStringSection_. Factor the code better. 5. fixes some bugs in string lowering on ELF targets. llvm-svn: 77294	2009-07-28 03:13:23 +00:00
Owen Anderson	69c464dec4	Move ConstantFP construction back to the 2.5-ish API. llvm-svn: 77247	2009-07-27 20:59:43 +00:00
Eli Friedman	65919b5058	Reorganize code a bit to reduce indentation. No visible functionality change. llvm-svn: 77171	2009-07-26 23:47:17 +00:00
Daniel Dunbar	ca414c7cae	Remove Value::getNameLen llvm-svn: 77148	2009-07-26 08:34:35 +00:00
Dan Gohman	1ddf98ad8e	Convert a few more things to use raw_ostream. llvm-svn: 77039	2009-07-25 01:43:01 +00:00
Daniel Dunbar	0dd5e1ed39	More migration to raw_ostream, the water has dried up around the iostream hole. - Some clients which used DOUT have moved to DEBUG. We are deprecating the "magic" DOUT behavior which avoided calling printing functions when the statement was disabled. In addition to being unnecessary magic, it had the downside of leaving code in -Asserts builds, and of hiding potentially unnecessary computations. llvm-svn: 77019	2009-07-25 00:23:56 +00:00
Owen Anderson	edb4a70325	Revert the ConstantInt constructors back to their 2.5 forms where possible, thanks to contexts-on-types. More to come. llvm-svn: 77011	2009-07-24 23:12:02 +00:00
Jakob Stoklund Olesen	1ae0736830	Add support for promoting SETCC operations. llvm-svn: 76987	2009-07-24 18:22:59 +00:00
Daniel Dunbar	796e43eede	Move more to raw_ostream, provide support for writing MachineBasicBlock, LiveInterval, etc to raw_ostream. llvm-svn: 76965	2009-07-24 10:36:58 +00:00
Daniel Dunbar	12368685d8	Switch to getNameStr(). llvm-svn: 76962	2009-07-24 08:24:36 +00:00
Chris Lattner	308c7896a4	"fix" PR4612, which is a crash on: %0 = malloc [3758096384 x i32] The "malloc" instruction doesn't support 64-bits correctly (see PR715), and should be removed. Victor is actively working on fixing this, in the meantime just don't crash. llvm-svn: 76899	2009-07-23 21:26:18 +00:00
Owen Anderson	47db941fd3	Get rid of the Pass+Context magic. llvm-svn: 76702	2009-07-22 00:24:57 +00:00
Eli Friedman	da9eda8ef6	Remove shift amount flavor. It isn't actually complete enough to be useful, and it's currently unused. (Some issues: it isn't actually rich enough to capture the semantics on many architectures, and semantics can vary depending on the type being shifted.) llvm-svn: 76633	2009-07-21 20:12:16 +00:00
Owen Anderson	c37bc69e91	Rename getConstantInt{True\|False} to get{True\|False} at Chris' behest. llvm-svn: 76598	2009-07-21 18:03:38 +00:00
Daniel Dunbar	5899e340f3	Simplify / normalize some uses of Value::getName. llvm-svn: 76553	2009-07-21 08:54:24 +00:00
Evan Cheng	a7bb55ebb6	Fix a dagga combiner bug: avoid creating illegal constant. Is this really a winning transformation? fold (shl (srl x, c1), c2) -> (shl (and x, (shl -1, c1)), (sub c2, c1)) or (srl (and x, (shl -1, c1)), (sub c1, c2)) llvm-svn: 76535	2009-07-21 05:40:15 +00:00
Owen Anderson	2ad52176f9	Move a bit more state over to the LLVMContext. llvm-svn: 76533	2009-07-21 02:47:59 +00:00
Dale Johannesen	ade297d496	Move stripping of bitcasts in inline asm arguments to a place where it affects everything. Occurs only on calls AFAIK. llvm-svn: 76502	2009-07-20 23:27:39 +00:00
Daniel Dunbar	ac0ca9241a	Fix some minor MSVC compiler warnings. llvm-svn: 76356	2009-07-19 01:38:38 +00:00
Eli Friedman	97f3f965eb	Make promotion in operation legalization for SETCC work correctly. llvm-svn: 76153	2009-07-17 05:16:04 +00:00
Jeffrey Yasskin	efad8e45fe	Add line numbers to OProfile. To do this, I added a processDebugLoc() call to the MachineCodeEmitter interface and made copying the start line of a function not conditional on whether we're emitting Dwarf debug information. I'll propagate the processDebugLoc() calls to the non-X86 targets in a followup patch. In the long run, it'll probably be better to gather this information through the DwarfWriter, but the DwarfWriter currently depends on the AsmPrinter and TargetAsmInfo, and fixing that would be out of the way for this patch. There's a bug in OProfile 0.9.4 that makes it ignore line numbers for addresses above 4G, and a patch fixing it at http://thread.gmane.org/gmane.linux.oprofile/7634 Sample output: $ sudo opcontrol --reset; sudo opcontrol --start-daemon; sudo opcontrol --start; `pwd`/Debug/bin/lli fib.bc; sudo opcontrol --stop Signalling daemon... done Profiler running. fib(40) == 165580141 Stopping profiling. $ opreport -g -d -l `pwd`/Debug/bin/lli\|head -60 Overflow stats not available CPU: Core 2, speed 1998 MHz (estimated) Counted CPU_CLK_UNHALTED events (Clock cycles when not halted) with a unit mask of 0x00 (Unhalted core cycles) count 100000 vma samples % linenr info image name symbol name 00007f67a30370b0 25489 61.2554 fib.c:24 10946.jo fib_left 00007f67a30370b0 1634 6.4106 fib.c:24 00007f67a30370b1 83 0.3256 fib.c:24 00007f67a30370b9 1997 7.8348 fib.c:24 00007f67a30370c6 2080 8.1604 fib.c:27 00007f67a30370c8 988 3.8762 fib.c:27 00007f67a30370cd 1315 5.1591 fib.c:27 00007f67a30370cf 251 0.9847 fib.c:27 00007f67a30370d3 1191 4.6726 fib.c:27 00007f67a30370d6 975 3.8252 fib.c:27 00007f67a30370db 1010 3.9625 fib.c:27 00007f67a30370dd 242 0.9494 fib.c:27 00007f67a30370e1 2782 10.9145 fib.c:28 00007f67a30370e5 3768 14.7828 fib.c:28 00007f67a30370eb 615 2.4128 (no location information) 00007f67a30370f3 6558 25.7287 (no location information) 00007f67a3037100 15603 37.4973 fib.c:29 10946.jo fib_right 00007f67a3037100 1646 10.5493 fib.c:29 00007f67a3037101 45 0.2884 fib.c:29 00007f67a3037109 2372 15.2022 fib.c:29 00007f67a3037116 2234 14.3178 fib.c:32 00007f67a3037118 612 3.9223 fib.c:32 00007f67a303711d 622 3.9864 fib.c:32 00007f67a303711f 385 2.4675 fib.c:32 00007f67a3037123 404 2.5892 fib.c:32 00007f67a3037126 634 4.0633 fib.c:32 00007f67a303712b 870 5.5759 fib.c:32 00007f67a303712d 62 0.3974 fib.c:32 00007f67a3037131 1848 11.8439 fib.c:33 00007f67a3037135 2840 18.2016 fib.c:33 00007f67a303713a 1 0.0064 fib.c:33 00007f67a303713b 1023 6.5564 (no location information) 00007f67a3037143 5 0.0320 (no location information) 000000000080c1e4 15 0.0360 MachineOperand.h:150 lli llvm::MachineOperand::isReg() const 000000000080c1e4 6 40.0000 MachineOperand.h:150 000000000080c1ec 2 13.3333 MachineOperand.h:150 ... llvm-svn: 76102	2009-07-16 21:07:26 +00:00
Owen Anderson	c277dc408b	Privatize the ConstantFP table. I'm on a roll! llvm-svn: 76097	2009-07-16 19:05:41 +00:00
Owen Anderson	20b34ac794	Move the ConstantInt uniquing table into LLVMContextImpl. This exposed a number of issues in our current context-passing stuff, which is also fixed here llvm-svn: 76089	2009-07-16 18:04:31 +00:00
Anton Korobeynikov	bbd751e410	Propagate return result extension type llvm-svn: 75925	2009-07-16 13:35:48 +00:00
Owen Anderson	f945a9ed07	Move a few more convenience factory functions from Constant to LLVMContext. llvm-svn: 75840	2009-07-15 21:51:10 +00:00
Ted Kremenek	39816d9157	Lexically order files in CMakeLists.txt files. llvm-svn: 75831	2009-07-15 21:08:16 +00:00
Owen Anderson	b6b2530000	Move EVER MORE stuff over to LLVMContext. llvm-svn: 75703	2009-07-14 23:09:55 +00:00
Torok Edwin	fbcc663cbf	llvm_unreachable->llvm_unreachable(0), LLVM_UNREACHABLE->llvm_unreachable. This adds location info for all llvm_unreachable calls (which is a macro now) in !NDEBUG builds. In NDEBUG builds location info and the message is off (it only prints "UREACHABLE executed"). llvm-svn: 75640	2009-07-14 16:55:14 +00:00
Owen Anderson	53a52215b5	Begin the painful process of tearing apart the rat'ss nest that is Constants.cpp and ConstantFold.cpp. This involves temporarily hard wiring some parts to use the global context. This isn't ideal, but it's the only way I could figure out to make this process vaguely incremental. llvm-svn: 75445	2009-07-13 04:09:18 +00:00
Chris Lattner	7b9d6ebb9c	remove llvm.part.set.* and llvm.part.select.*. They have never been implemented in codegen, have no frontend to generate them, and are better implemented with pattern matching (like the ppc backend does to generate rlwimi/rlwinm etc). PR4543 llvm-svn: 75430	2009-07-12 21:08:53 +00:00
Torok Edwin	08954aa4e1	Fix assert(0) conversion, as suggested by Chris. llvm-svn: 75423	2009-07-12 20:07:01 +00:00
Jakob Stoklund Olesen	ed0e1a0552	Implement support for promotion of AND/OR/XOR on integer types. The blackfin processor has a legal i16 type, but only logic operations on i32. llvm-svn: 75419	2009-07-12 18:10:18 +00:00
Jakob Stoklund Olesen	6b9f63cafa	Fix types in PromoteNode handling of CTPOP and friends. llvm-svn: 75418	2009-07-12 17:43:20 +00:00
Torok Edwin	56d0659726	assert(0) -> LLVM_UNREACHABLE. Make llvm_unreachable take an optional string, thus moving the cerr<< out of line. LLVM_UNREACHABLE is now a simple wrapper that makes the message go away for NDEBUG builds. llvm-svn: 75379	2009-07-11 20:10:48 +00:00
Torok Edwin	ccb29cd290	Convert more assert(0)+abort() -> LLVM_UNREACHABLE, and abort()/exit() -> llvm_report_error(). llvm-svn: 75363	2009-07-11 13:10:19 +00:00
Evan Cheng	ede2ce71aa	Fix up support for OptionalDefOperand when it defaults to an actual register def. I need this to get ready for major Thumb1 surgery. llvm-svn: 75328	2009-07-11 01:06:50 +00:00
Eli Friedman	106f2885d1	Use CreateStackStoreLoad helper in more places. llvm-svn: 75320	2009-07-11 00:11:07 +00:00
Bob Wilson	f76798769f	Fix an apparent copy-and-paste problem in an error message. llvm-svn: 75197	2009-07-09 23:42:59 +00:00
Eli Friedman	2b77eef160	Make EXTRACT_VECTOR_ELT a bit more flexible in terms of the returned value. Adjust other code to deal with that correctly. Make DAGTypeLegalizer::PromoteIntRes_EXTRACT_VECTOR_ELT take advantage of this new flexibility to simplify the code and make it deal with unusual vectors (like <4 x i1>) correctly. Fixes PR3037. llvm-svn: 75176	2009-07-09 22:01:03 +00:00
Owen Anderson	092bc51cdb	As Chris pointed out, we don't actually need to pass the context around here. llvm-svn: 75161	2009-07-09 18:44:09 +00:00
Owen Anderson	0504e0a222	Thread LLVMContext through MVT and related parts of SDISel. llvm-svn: 75153	2009-07-09 17:57:24 +00:00
Dan Gohman	6b04136756	Make SelectionDAG::getVectorShuffle work properly for VECTOR_SHUFFLE nodes with operand types that differ from the result type. (This doesn't normally happen right now, because SelectionDAGLowering::visitShuffleVector normalizes vector shuffles.) llvm-svn: 75081	2009-07-09 00:46:33 +00:00
David Goodwin	22c2fba978	Use common code for both ARM and Thumb-2 instruction and register info. llvm-svn: 75067	2009-07-08 23:10:31 +00:00
Duncan Sands	7dcc37b942	Nowadays vectors are only split if they have an even number of elements. Make some simplifications based on this (in particular SplitVecRes_SETCC). Tighten up some checking while there. llvm-svn: 75050	2009-07-08 21:34:03 +00:00
Duncan Sands	3f1e2409cc	Remove trailing whitespace. Reorder some methods and cases alphabetically. No functionality change. llvm-svn: 75001	2009-07-08 11:36:39 +00:00
Nick Lewycky	a21d3daadc	Remove the vicmp and vfcmp instructions. Because we never had a release with these instructions, no autoupgrade or backwards compatibility support is provided. llvm-svn: 74991	2009-07-08 03:04:38 +00:00
Chris Lattner	4ac607332d	dag combine sext(setcc) -> vsetcc before legalize. To make this safe, VSETCC must define all bits, which is different than it was documented to before. Since all targets that implement VSETCC already have this behavior, and we don't optimize based on this, just change the documentation. We now get nice code for vec_compare.ll llvm-svn: 74978	2009-07-08 00:31:33 +00:00
Chris Lattner	f3989abdbf	SelectionDAG::SignBitIsZero doesn't work right for vectors, for now, conservatively return false. llvm-svn: 74969	2009-07-07 23:28:46 +00:00
Dale Johannesen	4e33115e5e	Operand of asm("call") (the callee function) is represented as "X" constraint and "P" modifier on x86. Make this work. (Change may not be sufficient to fix it for non-Darwin, but I'm pretty sure it won't break anything.) gcc.apple/asm-block-32.c gcc.apple/asm-block-33.c llvm-svn: 74967	2009-07-07 23:26:33 +00:00
Chris Lattner	fc74e8241a	add support for legalizing an icmp where the result is illegal (4xi1) but the input is legal (4 x i32) llvm-svn: 74964	2009-07-07 23:03:54 +00:00
Chris Lattner	f48f3be185	random code cleanups. llvm-svn: 74962	2009-07-07 22:49:15 +00:00
Chris Lattner	30220d8f98	implement support for spliting and scalarizing vector setcc's. This finishes off enough support for vector compares to get the icmp/fcmp version of 2008-07-23-VSetCC.ll passing. llvm-svn: 74961	2009-07-07 22:47:46 +00:00
Chris Lattner	f2af7f44e7	lower vector icmp/fcmp to ICMP/FCMP nodes with the right result (vector of bool). llvm-svn: 74960	2009-07-07 22:41:32 +00:00
Chris Lattner	119421421a	ScalarizeVecRes_ShiftOp and ScalarizeVecRes_BinOp are the same, eliminate the former. llvm-svn: 74959	2009-07-07 22:28:41 +00:00
Chris Lattner	cc1fed3111	add support for vector legalizing of *_EXTEND. llvm-svn: 74957	2009-07-07 22:27:17 +00:00
Owen Anderson	5c96ef7c4e	Have scoped mutexes take referenes instead of pointers. llvm-svn: 74931	2009-07-07 18:33:04 +00:00
Tilmann Scheller	aea6059ed4	Add NumFixedArgs attribute to CallSDNode which indicates the number of fixed arguments in a vararg call. With the SVR4 ABI on PowerPC, vector arguments for vararg calls are passed differently depending on whether they are a fixed or a variable argument. Variable vector arguments always go into memory, fixed vector arguments are put into vector registers. If there are no free vector registers available, fixed vector arguments are put on the stack. The NumFixedArgs attribute allows to decide for an argument in a vararg call whether it belongs to the fixed or variable portion of the parameter list. llvm-svn: 74764	2009-07-03 06:44:53 +00:00
Devang Patel	87127712b9	Simplify debug info intrisinc lowering. llvm-svn: 74733	2009-07-02 22:43:26 +00:00
Douglas Gregor	6141511621	CMake build fixes, from Xerxes Ranby llvm-svn: 74720	2009-07-02 18:53:52 +00:00
Devang Patel	6bab414f87	Simplify. llvm-svn: 74677	2009-07-02 00:28:03 +00:00
Devang Patel	846a5e4d3e	Simplify. No intentional functionality change. llvm-svn: 74673	2009-07-02 00:08:09 +00:00
Devang Patel	53d24bc7d6	Refactor. No functionality change. llvm-svn: 74659	2009-07-01 23:19:01 +00:00
Devang Patel	ea76e08645	llvm.dbg.declare is always used for local variable's debug info. llvm-svn: 74625	2009-07-01 18:51:07 +00:00
Evan Cheng	0dc101b897	Add a bit IsUndef to MachineOperand. This indicates the def / use register operand is defined by an implicit_def. That means it can def / use any register and passes (e.g. register scavenger) can feel free to ignore them. The register allocator, when it allocates a register to a virtual register defined by an implicit_def, can allocate any physical register without worrying about overlapping live ranges. It should mark all of operands of the said virtual register so later passes will do the right thing. This is not the best solution. But it should be a lot less fragile to having the scavenger try to track what is defined by implicit_def. llvm-svn: 74518	2009-06-30 08:49:04 +00:00
Chris Lattner	a4775f2b13	fix a typo that GCC should have caught that causes crashes with -view-*-dags llvm-svn: 74364	2009-06-27 00:57:02 +00:00
Chris Lattner	bc60c14c97	fix a really subtle bug in the cross section of aliases and TLS: the SelectionDAG::getGlobalAddress function properly looks through aliases to determine thread-localness, but then passes the GV* down to GlobalAddressSDNode::GlobalAddressSDNode which does not. Instead of passing down isTarget, just pass down the predetermined node opcode. This fixes some assertions with out of tree changes I'm working on. llvm-svn: 74325	2009-06-26 21:14:05 +00:00
Chris Lattner	7f82a19fbf	implement DOTGraphTraits<SelectionDAG*>::getNodeLabel in terms of SDNode::print_details to eliminate a ton of near-duplicate code. llvm-svn: 74311	2009-06-26 19:06:10 +00:00
Chris Lattner	68bb4e0e01	dot graph viewing is apparently not using SDNode::print_details, this is bad, but in the meantime lets print targetflags on node labels. llvm-svn: 74274	2009-06-26 05:55:43 +00:00
Chris Lattner	17dcba9da4	propagate target operand flags from dag nodes into MachineOperands. llvm-svn: 74273	2009-06-26 05:52:14 +00:00
Chris Lattner	54b8ebced6	fit in 80 cols llvm-svn: 74270	2009-06-26 05:39:02 +00:00
Chris Lattner	b3586b6e73	add targetflags to jump tables and constant pool entries. llvm-svn: 74204	2009-06-25 21:35:31 +00:00
Chris Lattner	8e34f98d72	allow setting target operand flags on TargetGlobalAddress nodes. llvm-svn: 74203	2009-06-25 21:21:14 +00:00
Chris Lattner	af5dbfc6f8	start bringing targetoperand flags into isel, first up, ExternalSymbol. llvm-svn: 74199	2009-06-25 18:45:50 +00:00
Owen Anderson	5defd5655e	Provide guards for this shared structure. I'm not sure this actually needs to be shared, but how/where to privatize it is not immediately clear to me. If any SelectionDAG experts see a better solution, please share! llvm-svn: 74180	2009-06-25 17:09:00 +00:00
David Greene	30048bdb63	This increases the maximum for MVT::LAST_VALUETYPE This change doubles the allowable value for MVT::LAST_VALUETYPE. It does this by doing several things. 1. Introduces MVT::MAX_ALLOWED_LAST_VALUETYPE which in this change has a value of 64. This value contains the current maximum for the MVT::LAST_VALUETYPE. 2. Instead of checking "MVT::LAST_VALUETYPE <= 32", all of those uses now become "MVT::LAST_VALUETYPE <= MVT::MAX_ALLOWED_LAST_VALUETYPE" 3. Changes the dimension of the ValueTypeActions from 2 elements to four elements and adds comments ahead of the declaration indicating the it is "(MVT::MAX_ALLOWED_LAST_VALUETYPE/32) * 2". This at least lets us find what is affected if and when MVT::MAX_ALLOWED_LAST_VALUETYPE gets changed. 4. Adds initializers for the new elements of ValueTypeActions. This does NOT add any types in MVT. That would be done separately. This doubles the size of ValueTypeActions from 64 bits to 128 bits and gives us the freedom to add more types for AVX. llvm-svn: 74110	2009-06-24 19:41:55 +00:00
Owen Anderson	b70adf2b92	Get rid of the global CFGOnly flag by threading a ShortNames parameters through the GraphViz rendering code. Update other uses in the codebase for this change. llvm-svn: 74084	2009-06-24 17:37:09 +00:00
Dale Johannesen	92c11e90c8	Rewrite 73900 per Duncan's suggestion. llvm-svn: 74082	2009-06-24 17:11:31 +00:00
Chris Lattner	3912036c25	remove dead makefile flags. llvm-svn: 74065	2009-06-24 05:29:56 +00:00
Dale Johannesen	315fb72d36	Fix memcpy expansion so it won't generate invalid types for the target (I think). This was breaking the PPC32 calling sequence. llvm-svn: 73900	2009-06-22 20:59:07 +00:00
Devang Patel	da10358c84	mv CodeGen/DebugLoc.h Support/DebugLoc.h llvm-svn: 73786	2009-06-19 22:08:58 +00:00
Eli Friedman	495d02f4a6	Minor cleanup; fixes review comments for a previous patch. Sorry for taking so long to get to this! llvm-svn: 73757	2009-06-19 06:01:55 +00:00
Sanjiv Gupta	bce3ca6ad9	Fixed names of libcalls checked in r73480. llvm-svn: 73483	2009-06-16 10:22:58 +00:00
Sanjiv Gupta	557ed09e0f	Added required libcalls for PIC16 (mostly floating points to integer casting operations). llvm-svn: 73480	2009-06-16 09:03:58 +00:00
Eli Friedman	abfad5d61e	Add some generic expansion logic for SMULO and UMULO. Fixes UMULO support for x86, and UMULO/SMULO for many architectures, including PPC (PR4201), ARM, and Cell. The resulting expansion isn't perfect, but it's not bad. llvm-svn: 73477	2009-06-16 06:58:29 +00:00
Dan Gohman	6e6808adaf	Change this from an assert to a cerr+exit, since it's diagnosing an unsupported inline asm construct, rather than verifying a code invariant. llvm-svn: 73435	2009-06-15 22:32:41 +00:00
Devang Patel	56e6fe1642	Gracefully handle imbalanced inline function begin and end markers. llvm-svn: 73426	2009-06-15 21:45:50 +00:00
Arnold Schwaighofer	cb9046cfc8	CheckTailCallReturnConstraints is missing a check on the incomming chain of the RETURN node. The incomming chain must be the outgoing chain of the CALL node. This causes the backend to identify tail calls that are not tail calls. This patch fixes this. llvm-svn: 73387	2009-06-15 14:43:36 +00:00
Eli Friedman	516479d6e7	Tweak the expansion code for BIT_CONVERT to generate better code converting from an MMX vector to an i64. llvm-svn: 73024	2009-06-07 09:41:57 +00:00
Eli Friedman	3234587213	Slightly generalize the code that handles shuffles of consecutive loads on x86 to handle more cases. Fix a bug in said code that would cause it to read past the end of an object. Rewrite the code in SelectionDAGLegalize::ExpandBUILD_VECTOR to be a bit more general. Remove PerformBuildVectorCombine, which is no longer necessary with these changes. In addition to simplifying the code, with this change, we can now catch a few more cases of consecutive loads. llvm-svn: 73012	2009-06-07 06:52:44 +00:00
Eli Friedman	c61e357aa6	Fix the expansion for CONCAT_VECTORS so that it doesn't create illegal types. llvm-svn: 72993	2009-06-06 07:08:26 +00:00
Eli Friedman	aee3f62b75	Factor out a couple of helpers. llvm-svn: 72992	2009-06-06 07:04:42 +00:00
Eli Friedman	aea9b65668	Make SINT_TO_FP/UINT_TO_FP vector legalization queries query on the integer type to be consistent with normal operation legalization. No visible change because nothing is actually using this at the moment. llvm-svn: 72980	2009-06-06 03:27:50 +00:00
Devang Patel	d1c7d34924	Add new function attribute - noimplicitfloat Update code generator to use this attribute and remove NoImplicitFloat target option. Update llc to set this attribute when -no-implicit-float command line option is used. llvm-svn: 72959	2009-06-05 21:57:13 +00:00
Nate Begeman	624690c6b2	Adapt the x86 build_vector dagcombine to the current state of the legalizer. build vectors with i64 elements will only appear on 32b x86 before legalize. Since vector widening occurs during legalize, and produces i64 build_vector elements, the dag combiner is never run on these before legalize splits them into 32b elements. Teach the build_vector dag combine in x86 back end to recognize consecutive loads producing the low part of the vector. Convert the two uses of TLI's consecutive load recognizer to pass LoadSDNodes since that was required implicitly. Add a testcase for the transform. Old: subl $28, %esp movl 32(%esp), %eax movl 4(%eax), %ecx movl %ecx, 4(%esp) movl (%eax), %eax movl %eax, (%esp) movaps (%esp), %xmm0 pmovzxwd %xmm0, %xmm0 movl 36(%esp), %eax movaps %xmm0, (%eax) addl $28, %esp ret New: movl 4(%esp), %eax pmovzxwd (%eax), %xmm0 movl 8(%esp), %eax movaps %xmm0, (%eax) ret llvm-svn: 72957	2009-06-05 21:37:30 +00:00
Sanjiv Gupta	7925c5fd3f	Allow libcalls for i16 sdiv/udiv/rem operations. llvm-svn: 72941	2009-06-05 14:41:10 +00:00
Dan Gohman	a5b9645c4b	Split the Add, Sub, and Mul instruction opcodes into separate integer and floating-point opcodes, introducing FAdd, FSub, and FMul. For now, the AsmParser, BitcodeReader, and IRBuilder all preserve backwards compatability, and the Core LLVM APIs preserve backwards compatibility for IR producers. Most front-ends won't need to change immediately. This implements the first step of the plan outlined here: http://nondot.org/sabre/LLVMNotes/IntegerOverflow.txt llvm-svn: 72897	2009-06-04 22:49:04 +00:00
Dale Johannesen	37bc85f89a	Fix FP_TO_UINT->i32 on ppc32 -mcpu=g5. This was using Promote which won't work because i64 isn't a legal type. It's easy enough to use Custom, but then we have the problem that when the type legalizer is promoting FP_TO_UINT->i16, it has no way of telling it should prefer FP_TO_SINT->i32 to FP_TO_UINT->i32. I have uncomfortably hacked this by making the type legalizer choose FP_TO_SINT when both are Custom. This fixes several regressions in the testsuite. llvm-svn: 72891	2009-06-04 20:53:52 +00:00
Dan Gohman	7b6b5dd954	Don't do the X * 0.0 -> 0.0 transformation in instcombine, because instcombine doesn't know when it's safe. To partially compensate for this, introduce new code to do this transformation in dagcombine, which can use UnsafeFPMath. llvm-svn: 72872	2009-06-04 17:12:12 +00:00
Dan Gohman	c2eed3b0f8	Fix comments. llvm-svn: 72870	2009-06-04 16:49:15 +00:00
Dale Johannesen	5234d3795f	Revert 72707 and 72709, for the moment. llvm-svn: 72712	2009-06-02 03:12:52 +00:00
Dale Johannesen	0b8ca79253	Make the implicit inputs and outputs of target-independent ADDC/ADDE use MVT::i1 (later, whatever it gets legalized to) instead of MVT::Flag. Remove CARRY_FALSE in favor of 0; adjust all target-independent code to use this format. Most targets will still produce a Flag-setting target-dependent version when selection is done. X86 is converted to use i32 instead, which means TableGen needs to produce different code in xxxGenDAGISel.inc. This keys off the new supportsHasI1 bit in xxxInstrInfo, currently set only for X86; in principle this is temporary and should go away when all other targets have been converted. All relevant X86 instruction patterns are modified to represent setting and using EFLAGS explicitly. The same can be done on other targets. The immediate behavior change is that an ADC/ADD pair are no longer tightly coupled in the X86 scheduler; they can be separated by instructions that don't clobber the flags (MOV). I will soon add some peephole optimizations based on using other instructions that set the flags to feed into ADC. llvm-svn: 72707	2009-06-01 23:27:20 +00:00
Duncan Sands	96e5698741	Rename CustomLowerResults to CustomLowerNode, since it is used both when a result is illegal and when an operand is illegal. llvm-svn: 72658	2009-05-31 04:15:38 +00:00
Bill Wendling	09f17a8479	Untabification. llvm-svn: 72604	2009-05-30 01:09:53 +00:00
Evan Cheng	86cdb4b345	Do not try to create a MVT type of width 0. llvm-svn: 72557	2009-05-28 23:52:18 +00:00
Eli Friedman	e1dc193f35	Re-commit r72514 and r72516 with a fixed version of BR_CC lowering. This patch removes some special cases for opcodes and does a bit of cleanup. llvm-svn: 72536	2009-05-28 20:40:34 +00:00
Evan Cheng	6673ff08fe	Incorporate patch feedbacks. llvm-svn: 72533	2009-05-28 18:41:02 +00:00
Bill Wendling	f193838d2b	Temporarily revert r72514 (and dependent patch r72516). It was causing this failure during llvm-gcc bootstrap: Assertion failed: (!Tmp2.getNode() && "Can't legalize BR_CC with legal condition!"), function ExpandNode, file /Volumes/Sandbox/Buildbot/llvm/full-llvm/build/llvmCore.roots/llvmCore~obj/src/lib/CodeGen/SelectionDAG/LegalizeDAG.cpp, line 2923. /Volumes/Sandbox/Buildbot/llvm/full-llvm/build/llvmgcc42.roots/llvmgcc42~obj/src/gcc/libgcc2.c:1727: internal compiler error: Abort trap Please submit a full bug report, with preprocessed source if appropriate. See <URL:http://developer.apple.com/bugreporter> for instructions. llvm-svn: 72530	2009-05-28 18:18:59 +00:00
Eli Friedman	9b9df77260	Remove a couple of useless functions. llvm-svn: 72516	2009-05-28 04:49:34 +00:00
Eli Friedman	3aa278394e	Remove special cases for more opcodes. This is basically the end of this series of patches for LegalizeDAG; the remaining special cases can't be removed without more infrastructure work. There's a FIXME for each relevant opcode near the beginning of SelectionDAGLegalize::LegalizeOp. llvm-svn: 72514	2009-05-28 04:39:57 +00:00
Eli Friedman	5df7202d3b	Remove special case for SETCC opcode; add some comments explaining why some special cases are necessary. llvm-svn: 72511	2009-05-28 03:56:57 +00:00
Eli Friedman	e1bc3798e6	Some minor cleanups. llvm-svn: 72509	2009-05-28 03:06:16 +00:00
Evan Cheng	a9cda8abf2	Added optimization that narrow load / op / store and the 'op' is a bit twiddling instruction and its second operand is an immediate. If bits that are touched by 'op' can be done with a narrower instruction, reduce the width of the load and store as well. This happens a lot with bitfield manipulation code. e.g. orl $65536, 8(%rax) => orb $1, 10(%rax) Since narrowing is not always a win, e.g. i32 -> i16 is a loss on x86, dag combiner consults with the target before performing the optimization. llvm-svn: 72507	2009-05-28 00:35:15 +00:00
Eli Friedman	ed795153c7	Minor cleanups; add a better explanation for the issue with BUILD_VECTOR. llvm-svn: 72469	2009-05-27 12:42:55 +00:00
Eli Friedman	2892d82378	Remove more special cases for opcodes. llvm-svn: 72468	2009-05-27 12:20:41 +00:00
Eli Friedman	3b251705fd	Remove special cases for more opcodes. llvm-svn: 72467	2009-05-27 07:58:35 +00:00
Eli Friedman	0e49431422	Removing more special cases from LegalizeDAG. llvm-svn: 72465	2009-05-27 07:32:27 +00:00
Eli Friedman	568839681c	Eliminate more special cases for opcodes. llvm-svn: 72464	2009-05-27 07:05:37 +00:00
Eli Friedman	d6f2834496	Remove more special cases from LegalizeDAG. llvm-svn: 72456	2009-05-27 03:33:44 +00:00
Eli Friedman	b3554158c5	Remove unused argument. llvm-svn: 72455	2009-05-27 02:21:29 +00:00
Eli Friedman	a8f9a0261e	Remove more opcode special cases. llvm-svn: 72454	2009-05-27 02:16:40 +00:00
Eli Friedman	21d349b3c5	Start of refactoring LegalizeDAG so that we don't need specialized handling for every single opcode. llvm-svn: 72447	2009-05-27 01:25:56 +00:00
Eli Friedman	4a951bf2ad	Delete a bunch of dead code from LegalizeDAG. llvm-svn: 72414	2009-05-26 08:55:52 +00:00
Eli Friedman	ac149ee60a	Add a comment which should hopefully make the purpose of this method a bit clearer. llvm-svn: 72374	2009-05-24 20:32:10 +00:00
Eli Friedman	fd8b335ca4	Minor improvement to FCOPYSIGN to use BIT_CONVERT in cases where the corresponding integer type is legal. llvm-svn: 72373	2009-05-24 20:29:11 +00:00
Eli Friedman	fe87034cef	Rewrite ISD::FCOPYSIGN lowering to never use i64. Not really ideal, but it's late, and I don't have any better ideas at the moment. Fixes PR4257. llvm-svn: 72363	2009-05-24 10:21:20 +00:00
Eli Friedman	cd2e0cd297	Update for CMakeLists; untested, so tell me if there are issues. llvm-svn: 72360	2009-05-24 09:13:13 +00:00
Eli Friedman	a4e1675dac	Remove checks of getTypeAction from LegalizeOp; we already assert that all results and all operands are legal, so this change shouldn't affect behavior at all. llvm-svn: 72359	2009-05-24 08:42:01 +00:00
Eli Friedman	5e0d150689	Disable type legalization in LegalizeDAG. This leaves around 4000 lines of dead code; I'll clean that up in subsequent commits. llvm-svn: 72358	2009-05-24 02:46:31 +00:00
Eli Friedman	7badee92ad	Fix a bug in the expansion of EXTRACT_SUBVECTOR in ExpandExtractFromVectorThroughStack. llvm-svn: 72351	2009-05-23 23:03:28 +00:00
Eli Friedman	40afdb63ec	Add a proper implementation of EXTRACT_SUBVECTOR legalization that doesn't split legal vector operands. This is necessary because the type legalization (and therefore, vector splitting) code will be going away soon. llvm-svn: 72349	2009-05-23 22:37:25 +00:00
Torok Edwin	be6a9a151a	Fix PR4254. The DAGCombiner created a negative shiftamount, stored in an unsigned variable. Later the optimizer eliminated the shift entirely as being undefined. Example: (srl (shl X, 56) 48). ShiftAmt is 4294967288. Fix it by checking that the shiftamount is positive, and storing in a signed variable. llvm-svn: 72331	2009-05-23 17:29:48 +00:00
Eli Friedman	da90dd6d72	Add a new step to legalization to legalize vector math operations. This will allow simplifying LegalizeDAG to eliminate type legalization. (I have a patch to do that, but it's not quite finished; I'll commit it once it's finished and I've fixed any review comments for this patch.) See the comment at the beginning of lib/CodeGen/SelectionDAG/LegalizeVectorOps.cpp for more details on the motivation for this patch. llvm-svn: 72325	2009-05-23 12:35:30 +00:00
Duncan Sands	d6fb6501e3	Add a new codegen pass that normalizes dwarf exception handling code in preparation for code generation. The main thing it does is handle the case when eh.exception calls (and, in a future patch, eh.selector calls) are far away from landing pads. Right now in practice you only find eh.exception calls close to landing pads: either in a landing pad (the common case) or in a landing pad successor, due to loop passes shifting them about. However future exception handling improvements will result in calls far from landing pads: (1) Inlining of rewinds. Consider the following case: In function @f: ... invoke @g to label %normal unwind label %unwinds ... unwinds: %ex = call i8* @llvm.eh.exception() ... In function @g: ... invoke @something to label %continue unwind label %handler ... handler: %ex = call i8* @llvm.eh.exception() ... perform cleanups ... "rethrow exception" Now inline @g into @f. Currently this is turned into: In function @f: ... invoke @something to label %continue unwind label %handler ... handler: %ex = call i8* @llvm.eh.exception() ... perform cleanups ... invoke "rethrow exception" to label %normal unwind label %unwinds unwinds: %ex = call i8* @llvm.eh.exception() ... However we would like to simplify invoke of "rethrow exception" into a branch to the %unwinds label. Then %unwinds is no longer a landing pad, and the eh.exception call there is then far away from any landing pads. (2) Using the unwind instruction for cleanups. It would be nice to have codegen handle the following case: invoke @something to label %continue unwind label %run_cleanups ... handler: ... perform cleanups ... unwind This requires turning "unwind" into a library call, which necessarily takes a pointer to the exception as an argument (this patch also does this unwind lowering). But that means you are using eh.exception again far from a landing pad. (3) Bugpoint simplifications. When bugpoint is simplifying exception handling code it often generates eh.exception calls far from a landing pad, which then causes codegen to assert. Bugpoint then latches on to this assertion and loses sight of the original problem. Note that it is currently rare for this pass to actually do anything. And in fact it normally shouldn't do anything at all given the code coming out of llvm-gcc! But it does fire a few times in the testsuite. As far as I can see this is almost always due to the LoopStrengthReduce codegen pass introducing pointless loop preheader blocks which are landing pads and only contain a branch to another block. This other block contains an eh.exception call. So probably by tweaking LoopStrengthReduce a bit this can be avoided. llvm-svn: 72276	2009-05-22 20:36:31 +00:00
Jay Foad	7d0479f2c2	Use v.data() instead of &v[0] when SmallVector v might be empty. llvm-svn: 72210	2009-05-21 09:52:38 +00:00
Bill Wendling	f99bd3a82b	Temporarily revert r72191. It was causing an assert during llvm-gcc bootstrapping. llvm-svn: 72200	2009-05-21 00:04:55 +00:00
Argyrios Kyrtzidis	2b59a5fc6c	Introduce DebugScope which gets embedded into the machine instructions' DebugLoc. DebugScope refers to a debug region, function or block. llvm-svn: 72191	2009-05-20 22:57:17 +00:00
Eli Friedman	9030c35eb4	Fix for PR4235: to build a floating-point value from integer parts, build an integer and cast that to a float. This fixes a crash caused by trying to split an f32 into two f16's. This changes the behavior in test/CodeGen/XCore/fneg.ll because that testcase now triggers a DAGCombine which converts the fneg into an integer operation. If someone is interested, it's probably possible to tweak the test to generate an actual fneg. llvm-svn: 72162	2009-05-20 06:02:09 +00:00
Dan Gohman	d697a2dd8e	Remove the #ifndef NDEBUG from the FastISel debugging options. This fixes dejagnu tests that use these options. llvm-svn: 72094	2009-05-19 02:19:57 +00:00
Bill Wendling	d2dc9063d7	Revert last commit. It was wrong. llvm-svn: 72026	2009-05-18 18:21:03 +00:00
Bill Wendling	af7e400fda	Don't call RegionInlinedFnEnd if our optimization level isn't -O0. llvm-svn: 72024	2009-05-18 18:17:22 +00:00
Daniel Dunbar	a8c1658619	Silence Release-Asserts warnings. llvm-svn: 72011	2009-05-18 16:43:04 +00:00
Duncan Sands	83d008614f	Put back a bit of expensive checking logic that was overenthusiastically deleted in r70234. llvm-svn: 71926	2009-05-16 04:14:29 +00:00
Dan Gohman	d4f63052c4	Add an assert to turn a segfault on an unsupported inline asm construct into an assertion failure. llvm-svn: 71757	2009-05-14 00:30:16 +00:00
Jim Grosbach	4f915313ed	Removing the HasBuiltinSetjmp flag and associated bits. Flagging the presence of exception handling builtin sjlj targets in functions turns out not to be necessary. Marking the intrinsic implementation in the .td file as defining all registers is sufficient to get the context saved properly by the containing function. llvm-svn: 71743	2009-05-13 23:50:53 +00:00
Evan Cheng	ab0d23396a	Run code placement optimization for targets that want it (arm and x86 for now). llvm-svn: 71726	2009-05-13 21:42:09 +00:00
Jim Grosbach	aeca45dd6f	Add support for GCC compatible builtin setjmp and longjmp intrinsics. This is a supporting preliminary patch for GCC-compatible SjLJ exception handling. Note that these intrinsics are not designed to be invoked directly by the user, but rather used by the front-end as target hooks for exception handling. llvm-svn: 71610	2009-05-12 23:59:14 +00:00
Dan Gohman	9521cadff7	When scalarizing a vector BITCAST, check whether the operand has vector type, rather than assume that it does. If the operand is not vector, it shouldn't be run through ScalarizeVectorOp. This fixes one of the testcases in PR3886. llvm-svn: 71453	2009-05-11 18:30:42 +00:00
Bill Wendling	d6280534e4	--- Reverse-merging r71370 into '.': U lib/CodeGen/SelectionDAG/SelectionDAGBuild.cpp Revert r71370. llvm-svn: 71373	2009-05-10 00:10:50 +00:00
Bill Wendling	d53af35629	A debug function start was not being recorded when the optimization level wasn't None. However, we were always recording the region end. There's no longer a good reason for this code to be separated out between the different opt levels, as it was doing pretty much the same thing anyway. llvm-svn: 71370	2009-05-09 23:51:35 +00:00
Duncan Sands	af9eaa830a	Rename PaddedSize to AllocSize, in the hope that this will make it more obvious what it represents, and stop it being confused with the StoreSize. llvm-svn: 71349	2009-05-09 07:06:46 +00:00
Bill Wendling	8881780832	Mirror how Fast ISel determines if a region.end intrinsic is the end of an inlined function or the end of a function. Before, this was never executing the "inlined" version of the Record method. This will become important once the inlined Dwarf writer patch lands. llvm-svn: 71268	2009-05-08 21:14:49 +00:00
Anton Korobeynikov	65a58168cc	Factor out cycle-finder code and make it generic. llvm-svn: 71241	2009-05-08 18:51:58 +00:00
Anton Korobeynikov	c94dbf5ba0	Do not emit bit tests if target does not support natively left shift llvm-svn: 71240	2009-05-08 18:51:34 +00:00
Anton Korobeynikov	e7a9661f31	Properly expand libcalls for urem / srem. Also make code more straightforward. llvm-svn: 71238	2009-05-08 18:51:08 +00:00
Anton Korobeynikov	e2b78115d4	Typo llvm-svn: 71237	2009-05-08 18:50:54 +00:00
Dan Gohman	4bb6fa23cb	Revert 71165. It did more than just revert 71158 and it introduced several regressions. The problem due to 71158 is now fixed. llvm-svn: 71176	2009-05-07 19:46:24 +00:00
Bill Wendling	17f0f65499	Temporarily revert r71158. It was causing a failure during a full bootstrap: checking for bcopy... no checking for getc_unlocked... Assertion failed: (0 && "Unknown SCEV kind!"), function operator(), file /Volumes/Sandbox/Buildbot/llvm/full-llvm/build/llvmCore.roots/llvmCore~obj/src/lib/Analysis/ScalarEvolution.cpp, line 511. /Volumes/Sandbox/Buildbot/llvm/full-llvm/build/llvmgcc42.roots/llvmgcc42~obj/src/libdecnumber/decUtility.c:360: internal compiler error: Abort trap Please submit a full bug report, with preprocessed source if appropriate. See <URL:http://developer.apple.com/bugreporter> for instructions. make[4]: * [decUtility.o] Error 1 make[4]: * Waiting for unfinished jobs.... Assertion failed: (0 && "Unknown SCEV kind!"), function operator(), file /Volumes/Sandbox/Buildbot/llvm/full-llvm/build/llvmCore.roots/llvmCore~obj/src/lib/Analysis/ScalarEvolution.cpp, line 511. /Volumes/Sandbox/Buildbot/llvm/full-llvm/build/llvmgcc42.roots/llvmgcc42~obj/src/libdecnumber/decNumber.c:5591: internal compiler error: Abort trap Please submit a full bug report, with preprocessed source if appropriate. See <URL:http://developer.apple.com/bugreporter> for instructions. make[4]: * [decNumber.o] Error 1 make[3]: * [all-stage2-libdecnumber] Error 2 make[3]: *** Waiting for unfinished jobs.... llvm-svn: 71165	2009-05-07 17:26:14 +00:00
Argyrios Kyrtzidis	baf3fee885	Make DwarfWriter::RecordInlinedFnStart more like the other DwarfWriter's methods: -Have it return a label ID -Remove the unused Instruction parameter No functionality change. llvm-svn: 71132	2009-05-07 00:16:31 +00:00
Evan Cheng	cfc0513080	Do not use register as base ptr of pre- and post- inc/dec load / store nodes. llvm-svn: 71098	2009-05-06 18:25:01 +00:00
Duncan Sands	2338f6c57e	Add generic expansion of SUB when ADD and XOR are legal. Based on a patch by Micah Villmow. llvm-svn: 71078	2009-05-06 11:29:50 +00:00
Evan Cheng	1ff2727c95	Move getInstrOperandRegClass from the scheduler to TargetInstrInfo. llvm-svn: 70950	2009-05-05 00:30:09 +00:00
Chris Lattner	354b12259f	Make DBG_STOPPOINT nodes, and therefore DBG_LABEL labels, get a DebugLoc, so that it shows up in -print-machineinstrs. This doesn't appear to affect anything, but it was weird for some DBG_LABELs to have DebugLocs but not all of them. llvm-svn: 70921	2009-05-04 22:10:05 +00:00
Argyrios Kyrtzidis	9ae29b2d8f	-Remove the DwarfWriter::RecordSourceLine calls from the instruction selectors. -Depend on DebugLocs for source line info. (Comes with Regression-Be-Gone(tm)) llvm-svn: 70871	2009-05-04 16:23:49 +00:00
Argyrios Kyrtzidis	79be34012f	Revert r70803 for now, it causes a regression. llvm-svn: 70811	2009-05-03 23:27:19 +00:00
Argyrios Kyrtzidis	ce7196b903	-Remove the DwarfWriter::RecordSourceLine calls from the instruction selectors. -Depend on DebugLocs for source line info. llvm-svn: 70803	2009-05-03 22:03:35 +00:00
Anton Korobeynikov	2745bc92fa	Fix typo llvm-svn: 70770	2009-05-03 13:19:57 +00:00
Anton Korobeynikov	05b7a7c8f8	Properly handle sdiv / udiv / srem / urem libcalls llvm-svn: 70764	2009-05-03 13:18:16 +00:00
Anton Korobeynikov	399ad444fd	Proper name 16 bit libcalls llvm-svn: 70750	2009-05-03 13:14:08 +00:00
Anton Korobeynikov	f3fc92d6fc	Add libcall expansion for 16 and 128 bit muls llvm-svn: 70749	2009-05-03 13:13:51 +00:00
Argyrios Kyrtzidis	97324cec99	-Move the DwarfWriter::ValidDebugInfo check to a static DIDescriptor::ValidDebugInfo -Create DebugLocs without the need to have a DwarfWriter around llvm-svn: 70682	2009-05-03 08:50:41 +00:00
Bob Wilson	62a3124fb8	Allow CONCAT_VECTORS nodes to be legal or have custom lowering for some targets. Changes to take advantage of this will come later. llvm-svn: 70560	2009-05-01 17:55:32 +00:00
Argyrios Kyrtzidis	a5037484a4	Make DebugLoc independent of DwarfWriter. -Replace DebugLocTuple's Source ID with CompileUnit's GlobalVariable* -Remove DwarfWriter::getOrCreateSourceID -Make necessary changes for the above (fix callsites, etc.) llvm-svn: 70520	2009-04-30 23:22:31 +00:00
Jay Foad	fe0c648fee	Move helper functions for optimizing division by constant into the APInt class. llvm-svn: 70488	2009-04-30 10:15:35 +00:00
Chris Lattner	5ab42e93c4	fix a regression handling indirect results: these need to be considered memory operands otherwise the writebacks get lost when the inline asm doesn't otherwise have side effects. This fixes rdar://6839427, though clang really shouldn't generate these anymore. llvm-svn: 70455	2009-04-30 00:48:50 +00:00
Bill Wendling	026e5d7667	Instead of passing in an unsigned value for the optimization level, use an enum, which better identifies what the optimization is doing. And is more flexible for future uses. llvm-svn: 70440	2009-04-29 23:29:43 +00:00
Nate Begeman	7e6e352735	Fix infinite recursion in the C++ code which handles movddup by making it unnecessary. llvm-svn: 70425	2009-04-29 22:47:44 +00:00
Nate Begeman	39b59db245	Update comment, replace theoretically impossible check with an assert. llvm-svn: 70391	2009-04-29 18:13:31 +00:00
Nate Begeman	5f829d896d	Implement review feedback for vector shuffle work. llvm-svn: 70372	2009-04-29 05:20:52 +00:00
Sanjiv Gupta	ccd30945f9	Add a public method called getAddressSpace() to the GlobalAddressSDNode. llvm-svn: 70366	2009-04-29 04:43:24 +00:00
Chris Lattner	7d10386113	Disable the load-shrinking optimization from looking at anything larger than 64-bits, avoiding a crash. This should really be fixed to use APInts, though type legalization happens to help us out and we get good code on the attached testcase at least. This fixes rdar://6836460 llvm-svn: 70360	2009-04-29 03:45:07 +00:00
Bill Wendling	084669a1c9	Second attempt: Massive check in. This changes the "-fast" flag to "-O#" in llc. If you want to use the old behavior, the flag is -O0. This change allows for finer-grained control over which optimizations are run at different -O levels. Most of this work was pretty mechanical. The majority of the fixes came from verifying that a "fast" variable wasn't used anymore. The JIT still uses a "Fast" flag. I'll change the JIT with a follow-up patch. llvm-svn: 70343	2009-04-29 00:15:41 +00:00
Jakob Stoklund Olesen	604248e81f	Move getSubRegisterRegClass from ScheduleDagSDNodesEmit.cpp to a TargetRegisterClass method. Also make the method non-asserting. It will return NULL when given an invalid subreg index. The method is needed by an upcoming patch. llvm-svn: 70296	2009-04-28 16:34:09 +00:00
Bill Wendling	56f2987a87	r70270 isn't ready yet. Back this out. Sorry for the noise. llvm-svn: 70275	2009-04-28 01:04:53 +00:00
Bill Wendling	d0ae15946c	Massive check in. This changes the "-fast" flag to "-O#" in llc. If you want to use the old behavior, the flag is -O0. This change allows for finer-grained control over which optimizations are run at different -O levels. Most of this work was pretty mechanical. The majority of the fixes came from verifying that a "fast" variable wasn't used anymore. The JIT still uses a "Fast" flag. I'm not 100% sure if it's necessary to change it there... llvm-svn: 70270	2009-04-28 00:21:31 +00:00
Duncan Sands	bfa037705e	Now that PR2957 is resolved, remove a bunch of no-longer needed workarounds. llvm-svn: 70234	2009-04-27 19:33:03 +00:00
Nate Begeman	8d6d4b9289	2nd attempt, fixing SSE4.1 issues and implementing feedback from duncan. PR2957 ISD::VECTOR_SHUFFLE now stores an array of integers representing the shuffle mask internal to the node, rather than taking a BUILD_VECTOR of ConstantSDNodes as the shuffle mask. A value of -1 represents UNDEF. In addition to eliminating the creation of illegal BUILD_VECTORS just to represent shuffle masks, we are better about canonicalizing the shuffle mask, resulting in substantially better code for some classes of shuffles. llvm-svn: 70225	2009-04-27 18:41:29 +00:00
Dan Gohman	be36f5ccda	When transforming sext(trunc(load(x))) into sext(smaller load(x)), the trunc is directly replaced with the smaller load, so don't try to create a new sext node. This fixes PR4050. llvm-svn: 70179	2009-04-27 02:00:55 +00:00
Dan Gohman	fe9e1d5b59	Refactor the code to grab the low and high parts of a value using EXTRACT_ELEMENT into a utility function. llvm-svn: 70056	2009-04-25 17:55:53 +00:00
Dan Gohman	4539987920	Add a top-level comment about DAGCombiner's role in the compiler. llvm-svn: 70052	2009-04-25 17:09:45 +00:00
Dale Johannesen	56cb14c874	Fix PR 4057, a crash doing float->char const folding. This particular one is undefined behavior (although this isn't related to the crash), so it will no longer do it at compile time, which seems better. llvm-svn: 69990	2009-04-24 21:34:13 +00:00
Rafael Espindola	b93db668b3	Revert 69952. Causes testsuite failures on linux x86-64. llvm-svn: 69967	2009-04-24 12:40:33 +00:00
Nate Begeman	bb881d66f4	PR2957 ISD::VECTOR_SHUFFLE now stores an array of integers representing the shuffle mask internal to the node, rather than taking a BUILD_VECTOR of ConstantSDNodes as the shuffle mask. A value of -1 represents UNDEF. In addition to eliminating the creation of illegal BUILD_VECTORS just to represent shuffle masks, we are better about canonicalizing the shuffle mask, resulting in substantially better code for some classes of shuffles. A clean up of x86 shuffle code, and some canonicalizing in DAGCombiner is next. llvm-svn: 69952	2009-04-24 03:42:54 +00:00
Dan Gohman	640a161c73	Instead of requiring TLI.LowerCallTo to return an ISD::BUILD_PAIR, use ISD::EXTRACT_ELEMENT. SelectionDAG has a special fast-path for the cast of an EXTRACT_ELEMENT with a BUILD_PAIR operand, for the common case. llvm-svn: 69948	2009-04-24 02:40:23 +00:00
Dan Gohman	9478c3f8e5	Factor out a bit of code that appears in several places into a utility function. llvm-svn: 69937	2009-04-23 23:13:24 +00:00
Dan Gohman	a290ab44e8	Handle Void types in ComputeValueVTs. This doesn't currently occur, but this change makes the code more general and easier to adapt for new purposes. llvm-svn: 69935	2009-04-23 22:50:03 +00:00
Dan Gohman	1addf64735	Make X86's copyRegToReg able to handle copies to and from subclasses. This makes the extra copyRegToReg calls in ScheduleDAGSDNodesEmit.cpp unnecessary. Derived from a patch by Jakob Stoklund Olesen. llvm-svn: 69635	2009-04-20 22:54:34 +00:00
Dan Gohman	e014b69919	Simplify this code. getConstant knows how to make broadcasted vector constants. llvm-svn: 69634	2009-04-20 22:51:43 +00:00
Bob Wilson	da188ebbbd	Revise my previous change 68996 as suggested by Duncan. llvm-svn: 69607	2009-04-20 17:27:09 +00:00
Duncan Sands	f2e7133d34	Now that BUILD_VECTOR operands are allowed to be bigger than the vector element type, turn checking of the operand type back on again, appropriately adjusted. llvm-svn: 69516	2009-04-19 06:40:30 +00:00
Chris Lattner	7b01e66443	Fix PR3898, which manifests as failures on are an Xcore, patch by Jakob Stoklund Olesen! llvm-svn: 69472	2009-04-18 20:48:07 +00:00
Duncan Sands	e4ff21ba4b	Don't try to make BUILD_VECTOR operands have the same type as the vector element type: allow them to be of a wider integer type than the element type all the way through the system, and not just as far as LegalizeDAG. This should be safe because it used to be this way (the old type legalizer would produce such nodes), so backends should be able to handle it. In fact only targets which have legal vector types with an illegal promoted element type will ever see this (eg: <4 x i16> on ppc). This fixes a regression with the new type legalizer (vec_splat.ll). Also, treat SCALAR_TO_VECTOR the same as BUILD_VECTOR. After all, it is just a special case of BUILD_VECTOR. llvm-svn: 69467	2009-04-18 20:16:54 +00:00
Dale Johannesen	ad968ee286	Inline asm's were still introducing bogus dependencies; my earlier patch to this code only fixed half of it. llvm-svn: 69408	2009-04-18 00:09:40 +00:00
Dan Gohman	eefba6bbe0	In the list-burr's pseudo two-addr dependency heuristics, don't add dependencies on nodes with exactly one successor which is a COPY_TO_REGCLASS node. In the case that the copy is coalesced away, the dependence should be on the user of the copy, rather than the copy itself. llvm-svn: 69309	2009-04-16 20:59:02 +00:00
Dan Gohman	3027bb6953	Handle SUBREG_TO_REG instructions with the same heuristics as INSERT_SUBREG instructions in the list-burr scheduler. llvm-svn: 69308	2009-04-16 20:57:10 +00:00
Devang Patel	dab01f3fd6	Do not treat beginning of inlined scope as beginning of normal function scope if the location info is missing. Insetad of doing ... if (inlined_subroutine && known_location) DW_TAG_inline_subroutine else DW_TAG_subprogram do if (inlined_subroutine) { if (known_location) DW_TAG_inline_subroutine } else { DW_TAG_subprogram } llvm-svn: 69300	2009-04-16 17:55:30 +00:00
Devang Patel	9ac4390bf4	Record line number at the beginning of a func.start. This line was accidently lost yesterday. llvm-svn: 69286	2009-04-16 15:07:09 +00:00
Devang Patel	653dee0884	In -fast mode do what FastISel does. This code could use some refactoring help! llvm-svn: 69254	2009-04-16 02:33:41 +00:00
Devang Patel	46b04e4d06	If FastISel is run and it has known DebugLoc then use it. llvm-svn: 69253	2009-04-16 01:33:10 +00:00
Devang Patel	43fc7e481b	If location where the function was inlined is not know then do not emit debug info describing inlinied region. llvm-svn: 69252	2009-04-16 01:31:54 +00:00
Devang Patel	2738d7312a	Add DISubprogram is not null check. This fixes test/CodeGen//2009-01-21-invalid-debug-info.m test case. llvm-svn: 69210	2009-04-15 20:11:08 +00:00
Dan Gohman	8aa28b9c34	Generalize one of the SelectionDAG::ReplaceAllUsesWith overloads to support replacing a node with another that has a superset of the result types. Use this instead of calling ReplaceAllUsesOfValueWith for each value. llvm-svn: 69209	2009-04-15 20:06:30 +00:00
Devang Patel	32d17a1a29	Construct and emit DW_TAG_inlined_subroutine DIEs for inlined subroutine scopes (only in FastISel mode). llvm-svn: 69116	2009-04-15 00:10:26 +00:00
Dan Gohman	e5cd1fcdb9	When the result of an EXTRACT_SUBREG, INSERT_SUBREG, or SUBREG_TO_REG operator is used by a CopyToReg to export the value to a different block, don't reuse the CopyToReg's register for the subreg operation result if the register isn't precisely the right class for the subreg operation. Also, rename the h-registers.ll test, now that there are more than one. llvm-svn: 69087	2009-04-14 22:17:14 +00:00
Dale Johannesen	83593f4167	Do not force asm's to be chained if they don't touch memory and aren't volatile. This was interfering with good scheduling. llvm-svn: 69008	2009-04-14 00:56:56 +00:00
Daniel Dunbar	097f630dad	Make these errors more noticable in build logs. llvm-svn: 68998	2009-04-13 22:26:09 +00:00
Bob Wilson	59dbbb2bb4	Change SelectionDAG type legalization to allow BUILD_VECTOR operands to be promoted to legal types without changing the type of the vector. This is following a suggestion from Duncan (http://lists.cs.uiuc.edu/pipermail/llvmdev/2009-February/019923.html). The transformation that used to be done during type legalization is now postponed to DAG legalization. This allows the BUILD_VECTORs to be optimized and potentially handled specially by target-specific code. It turns out that this is also consistent with an optimization done by the DAG combiner: a BUILD_VECTOR and INSERT_VECTOR_ELT may be combined by replacing one of the BUILD_VECTOR operands with the newly inserted element; but INSERT_VECTOR_ELT allows its scalar operand to be larger than the element type, with any extra high bits being implicitly truncated. The result is a BUILD_VECTOR where one of the operands has a type larger the the vector element type. Any code that operates on BUILD_VECTORs may now need to be aware of the potential type discrepancy between the vector element type and the BUILD_VECTOR operands. This patch updates all of the places that I could find to handle that case. llvm-svn: 68996	2009-04-13 22:05:19 +00:00
Dan Gohman	6c1426308c	Rename COPY_TO_SUBCLASS to COPY_TO_REGCLASS, and generalize it accordingly. Thanks to Jakob Stoklund Olesen for pointing out how this might be useful. llvm-svn: 68986	2009-04-13 21:06:25 +00:00
Bob Wilson	f6c2195383	Refactor some code in SelectionDAGLegalize::ExpandBUILD_VECTOR. llvm-svn: 68981	2009-04-13 20:20:30 +00:00
Devang Patel	0431504fb2	Right now, Debugging information to encode scopes (DW_TAG_lexical_block) relies on DBG_LABEL. Unfortunately this intefers with the quality of optimized code. This patch updates dwarf writer to encode scoping information in DWARF only in FastISel mode. llvm-svn: 68973	2009-04-13 18:13:16 +00:00
Devang Patel	80be3511ed	Reapply 68847. Now debug_inlined section is covered by TAI->doesDwarfUsesInlineInfoSection(), which is false by default. llvm-svn: 68964	2009-04-13 17:02:03 +00:00
Dan Gohman	60a446ab02	Add a new TargetInstrInfo MachineInstr opcode, COPY_TO_SUBCLASS. This will be used to replace things like X86's MOV32to32_. Enhance ScheduleDAGSDNodesEmit to be more flexible and robust in the presense of subregister superclasses and subclasses. It can now cope with the definition of a virtual register being in a subclass of a use. Re-introduce the code for recording register superreg classes and subreg classes. This is needed because when subreg extracts and inserts get coalesced away, the virtual registers are left in the correct subclass. llvm-svn: 68961	2009-04-13 15:38:05 +00:00
Chris Lattner	a101f6f8d3	make UpdateValueMap handle the possiblity that we could be copying into the right register, avoiding a copy. llvm-svn: 68889	2009-04-12 07:46:30 +00:00
Chris Lattner	ada5d6c37e	optimize FastISel::UpdateValueMap to avoid duplicate map lookups, and make it return the assigned register. llvm-svn: 68888	2009-04-12 07:45:01 +00:00
Dan Gohman	825236b116	Revert r68847. It breaks the build on non-Darwin targets, with this message from the assembler: Error: unknown pseudo-op: `.debug_inlined' llvm-svn: 68863	2009-04-11 15:57:04 +00:00
Devang Patel	790e60999e	Keep track of inlined functions and their locations. This information is collected when nested llvm.dbg.func.start intrinsics are seen. (Right now, inliner removes nested llvm.dbg.func.start intrinisics during inlining.) Create debug_inlined dwarf section using these information. This info is used by gdb, at least on Darwin, to enable better experience debugging inlined functions. See DwarfWriter.cpp for more information on structure of debug_inlined section. llvm-svn: 68847	2009-04-11 00:16:47 +00:00
Bob Wilson	f074ca7454	Clean up a bunch of whitespace issues and fix a comment typo. No functional changes. llvm-svn: 68808	2009-04-10 18:48:47 +00:00
Dan Gohman	e517ae4211	Now that register classes have names, include the name in debug output. llvm-svn: 68786	2009-04-10 15:59:38 +00:00
Dan Gohman	de912e2475	Remove the obsolete SelectionDAG::getNodeValueTypes and simplify code that uses it by using SelectionDAG::getVTList instead. llvm-svn: 68744	2009-04-09 23:54:40 +00:00
Devang Patel	a68bdef482	Silence unused variable warning. llvm-svn: 68735	2009-04-09 23:45:17 +00:00
Devang Patel	a2c2b85df4	llvm.dbg.func_start also defines beginning of function scope. llvm-svn: 68727	2009-04-09 21:42:11 +00:00
Dan Gohman	0e8d199f91	Generalize ExtendUsesToFormExtLoad to be usable for ANY_EXTEND, in addition to ZERO_EXTEND and SIGN_EXTEND. Fix a bug in the way it checked for live-out values, and simplify the way it find users by using SDNode::use_iterator's (relatively) new features. Also, make it slightly more permissive on targets with free truncates. In SelectionDAGBuild, avoid creating ANY_EXTEND nodes that are larger than necessary. If the target's SwitchAmountTy has enough bits, use it. This exposes the truncate to optimization early, enabling more optimizations. llvm-svn: 68670	2009-04-09 03:51:29 +00:00
Dan Gohman	e6db8ca5eb	Don't copy the operand of a SwitchInst into virtual registers as eagerly. This helps avoid CopyToReg nodes in some cases where they aren't needed, and also helps subsequent optimizer heuristics in cases where the extra nodes would cause the node to appear to have multiple results. This doesn't have a significant impact currently; it'll help an upcoming change. llvm-svn: 68667	2009-04-09 02:33:36 +00:00
Duncan Sands	5a82613db0	Soft float support for FREM. llvm-svn: 68614	2009-04-08 16:20:57 +00:00
Duncan Sands	fb438caac6	Soft float support for undef. Reported by Xerxes Rånby. llvm-svn: 68607	2009-04-08 13:33:37 +00:00
Dan Gohman	ad3e549a53	Implement support for using modeling implicit-zero-extension on x86-64 with SUBREG_TO_REG, teach SimpleRegisterCoalescing to coalesce SUBREG_TO_REG instructions (which are similar to INSERT_SUBREG instructions), and teach the DAGCombiner to take advantage of this on targets which support it. This eliminates many redundant zero-extension operations on x86-64. This adds a new TargetLowering hook, isZExtFree. It's similar to isTruncateFree, except it only applies to actual definitions, and not no-op truncates which may not zero the high bits. Also, this adds a new optimization to SimplifyDemandedBits: transform operations like x+y into (zext (add (trunc x), (trunc y))) on targets where all the casts are no-ops. In contexts where the high part of the add is explicitly masked off, this allows the mask operation to be eliminated. Fix the DAGCombiner to avoid undoing these transformations to eliminate casts on targets where the casts are no-ops. Also, this adds a new two-address lowering heuristic. Since two-address lowering runs before coalescing, it helps to be able to look through copies when deciding whether commuting and/or three-address conversion are profitable. Also, fix a bug in LiveInterval::MergeInClobberRanges. It didn't handle the case that a clobber range extended both before and beyond an existing live range. In that case, multiple live ranges need to be added. This was exposed by the new subreg coalescing code. Remove 2008-05-06-SpillerBug.ll. It was bugpoint-reduced, and the spiller behavior it was looking for no longer occurrs with the new instruction selection. llvm-svn: 68576	2009-04-08 00:15:30 +00:00
Devang Patel	10f7c3deb7	Revert prev. patch for now. llvm-svn: 68569	2009-04-07 23:00:04 +00:00
Devang Patel	ddafc03e41	Right now DBG_LABEL are required for llvm.dbg.region_start and llvm.dbg.region_end in non-fast mode also. llvm-svn: 68559	2009-04-07 22:27:56 +00:00
Dan Gohman	ca93aabeba	Don't attempt to handle aggregate argument values in FastISel; let SelectionDAG do those. This fixes PR3955. llvm-svn: 68546	2009-04-07 20:40:11 +00:00
Dan Gohman	8bff8a1e87	Fix a TargetLowering optimization so that it doesn't duplicate loads when an input node has multiple uses. llvm-svn: 68398	2009-04-03 20:11:30 +00:00
Dan Gohman	b425feb2aa	Delete ISD::INSERT_SUBREG and ISD::EXTRACT_SUBREG, which are unused. Note that these are distinct from TargetInstrInfo::INSERT_SUBREG and TargetInstrInfo::EXTRACT_SUBREG, which are used. llvm-svn: 68355	2009-04-03 00:25:26 +00:00
Sanjiv Gupta	cc841a3810	To convert the StopPoint insn into an assembler directive by ISel, we need to have access to the line number field. So we convert that info as an operand by custom handling DBG_STOPPOINT in legalize. llvm-svn: 68329	2009-04-02 18:03:10 +00:00
Evan Cheng	0d551591ea	Fully general expansion of integer shift of any size. llvm-svn: 68134	2009-03-31 19:39:24 +00:00
Dan Gohman	d51f196ff5	Minor top-level comment fix. llvm-svn: 68113	2009-03-31 16:51:18 +00:00
Dan Gohman	97a20b8dbf	Fix live-out reg logic to not insert over-aggressive AssertZExt instructions. This fixes lua. llvm-svn: 68083	2009-03-31 01:38:29 +00:00
Duncan Sands	d21581eaa1	Fix PR3899: add support for extracting floats from vectors when using -soft-float. Based on a patch by Jakob Stoklund Olesen. llvm-svn: 67996	2009-03-29 13:51:06 +00:00
Arnold Schwaighofer	e622cbf385	Make check in CheckTailCallReturnConstraints for ignorable instructions between a CALL and a RET node more generic. Add a test for tail calls with a void return. llvm-svn: 67943	2009-03-28 12:36:29 +00:00
Arnold Schwaighofer	83d5420d02	Enable tail call optimization for functions that return a struct (bug 3664) and for functions that return types that need extending (e.g i1). llvm-svn: 67934	2009-03-28 08:33:27 +00:00
Evan Cheng	fd81c73cde	Optimize some 64-bit multiplication by constants into two lea's or one lea + shl since imulq is slow (latency 5). e.g. x * 40 => shlq $3, %rdi leaq (%rdi,%rdi,4), %rax This has the added benefit of allowing more multiply to be folded into addressing mode. e.g. a * 24 + b => leaq (%rdi,%rdi,2), %rax leaq (%rsi,%rax,8), %rax llvm-svn: 67917	2009-03-28 05:57:29 +00:00
Dan Gohman	2785e4be37	Fix what surely must be a copy+pasto. llvm-svn: 67881	2009-03-27 23:55:04 +00:00
Dan Gohman	6d75876473	Initialize LiveOutInfo's APInt members to zero, as APInt's default constructor produces an uninitialized APInt. This fixes PR3896. llvm-svn: 67879	2009-03-27 23:51:02 +00:00
Bill Wendling	aa28be652c	Pull transform from target-dependent code into target-independent code. llvm-svn: 67742	2009-03-26 06:14:09 +00:00
Evan Cheng	2e9f42bed5	Revert 67132. This is breaking some objective-c apps. Also fixes SDISel so it does not force promote return value if the function is not marked signext / zeroext. llvm-svn: 67701	2009-03-25 20:20:11 +00:00
Dale Johannesen	eb1646d28c	When optimizing with debug info, don't keep the stoppoint nodes around until Legalize; doing this imposed an ordering on a sequence of loads that came from different lines, interfering with scheduling. llvm-svn: 67692	2009-03-25 17:36:08 +00:00
Chris Lattner	c35847e109	more tidying: name the components of PhysReg in the case when the target constraint specifies a specific physreg. llvm-svn: 67618	2009-03-24 15:27:37 +00:00
Chris Lattner	42eceb3491	Tidy a bit more. llvm-svn: 67617	2009-03-24 15:25:07 +00:00
Chris Lattner	246eda43bd	simplify this code a bit now that "allocation to a vreg class" can never fail. llvm-svn: 67616	2009-03-24 15:22:11 +00:00
Dan Gohman	f3746cbc56	Minor compile-time optimization; don't bother checking canClobberPhysRegDefs if the successor node doesn't clobber any physical registers. llvm-svn: 67587	2009-03-24 00:50:07 +00:00
Dan Gohman	9a658d72db	Add a pre-pass to the burr-list scheduler which makes adjustments to help out the register pressure reduction heuristics in the case of nodes with multiple uses. Currently this uses very conservative heuristics, so it doesn't have a broad impact, but in cases where it does help it can make a big difference. llvm-svn: 67586	2009-03-24 00:49:12 +00:00
Dan Gohman	ed0e8d44ce	When unfolding a load during scheduling, the new operator node has a data dependency on the load node, so it really needs a data-dependence edge to the load node, even if the load previously existed. And add a few comments. llvm-svn: 67554	2009-03-23 20:20:43 +00:00
Dan Gohman	f477262e69	Don't set SUnit::hasPhysRegDefs to true unless the defs are actually have uses, which reflects the way it's used. llvm-svn: 67540	2009-03-23 17:39:36 +00:00
Dan Gohman	a366da1bf7	Fix canClobberPhysRegDefs to check all SDNodes grouped together in an SUnit, instead of just the first one. This fix is needed by some upcoming scheduler changes. llvm-svn: 67531	2009-03-23 16:23:01 +00:00
Dan Gohman	52c278e54d	Add a new bit to SUnit to record whether a node has implicit physreg defs, regardless of whether they are actually used. llvm-svn: 67528	2009-03-23 16:10:52 +00:00
Dan Gohman	4f2fea1a21	Now that errs() is properly non-buffered, there's no need to explicitly flush it. llvm-svn: 67526	2009-03-23 15:57:19 +00:00
Evan Cheng	968c3b0d6e	Model inline asm constraint which ties an input to an output register as machine operand TIED_TO constraint. This eliminated the need to pre-allocate registers for these. This also allows register allocator can eliminate the unneeded copies. llvm-svn: 67512	2009-03-23 08:01:15 +00:00
Dan Gohman	3bdc4bdba6	Simplify this code; use a while instead of an if and a do-while. llvm-svn: 67400	2009-03-20 20:42:23 +00:00
Evan Cheng	2e55923fba	For inline asm output operand that matches an input. Encode the input operand index in the high bits. llvm-svn: 67387	2009-03-20 18:03:34 +00:00
Sanjiv Gupta	e9759c458c	Fixed the comment. No functionality change. llvm-svn: 67370	2009-03-20 09:38:50 +00:00
Mon P Wang	32c8074be6	Added missing support for widening when splitting an unary op (PR3683) and expanding a bit convert (PR3711). In both cases, we extract the valid part of the widen vector and then do the conversion. llvm-svn: 67175	2009-03-18 06:24:04 +00:00
Rafael Espindola	4606b12108	Don't force promotion of return arguments on the callee. Some architectures (like x86) don't require it. This fixes bug 3779. llvm-svn: 67132	2009-03-17 23:43:59 +00:00
Chris Lattner	2363d0b8b9	Fix codegen to compute the size of an allocation by multiplying the size by the array amount as an i32 value instead of promoting from i32 to i64 then doing the multiply. Not doing this broke wrap-around assumptions that the optimizers (validly) made. The ultimate real fix for this is to introduce i64 version of alloca and remove mallocinst. This fixes PR3829 llvm-svn: 67093	2009-03-17 19:36:00 +00:00
Mon P Wang	523c0852c6	Fix a problem with DAGCombine where we were building an illegal build vector shuffle mask. Forced the mask to be built using i32. Note: this will be irrelevant once vector_shuffle no longer takes a build vector for the shuffle mask. llvm-svn: 67076	2009-03-17 06:33:10 +00:00
Mon P Wang	c86715631c	Avoid doing the transformation c ? 1.0 : 2.0 as load { 2.0, 1.0 } + c*4 if FPConstant is legal because if the FPConstant doesn't need to be stored in a constant pool, the transformation is unlikely to be profitable. llvm-svn: 66994	2009-03-14 00:25:19 +00:00
Dan Gohman	a62e4ab690	Improve FastISel's handling of truncates to i1, and implement ptrtoint and inttoptr in X86FastISel. These casts aren't always handled in the generic FastISel code because X86 sometimes needs custom code to do truncation and zero-extension. llvm-svn: 66988	2009-03-13 23:53:06 +00:00
Dan Gohman	c0bb959591	Fix FastISel's assumption that i1 values are always zero-extended by inserting explicit zero extensions where necessary. Included is a testcase where SelectionDAG produces a virtual register holding an i1 value which FastISel previously mistakenly assumed to be zero-extended. llvm-svn: 66941	2009-03-13 20:42:20 +00:00
Evan Cheng	1fb8aedd1e	Fix some significant problems with constant pools that resulted in unnecessary paddings between constant pool entries, larger than necessary alignments (e.g. 8 byte alignment for .literal4 sections), and potentially other issues. 1. ConstantPoolSDNode alignment field is log2 value of the alignment requirement. This is not consistent with other SDNode variants. 2. MachineConstantPool alignment field is also a log2 value. 3. However, some places are creating ConstantPoolSDNode with alignment value rather than log2 values. This creates entries with artificially large alignments, e.g. 256 for SSE vector values. 4. Constant pool entry offsets are computed when they are created. However, asm printer group them by sections. That means the offsets are no longer valid. However, asm printer uses them to determine size of padding between entries. 5. Asm printer uses expensive data structure multimap to track constant pool entries by sections. 6. Asm printer iterate over SmallPtrSet when it's emitting constant pool entries. This is non-deterministic. Solutions: 1. ConstantPoolSDNode alignment field is changed to keep non-log2 value. 2. MachineConstantPool alignment field is also changed to keep non-log2 value. 3. Functions that create ConstantPool nodes are passing in non-log2 alignments. 4. MachineConstantPoolEntry no longer keeps an offset field. It's replaced with an alignment field. Offsets are not computed when constant pool entries are created. They are computed on the fly in asm printer and JIT. 5. Asm printer uses cheaper data structure to group constant pool entries. 6. Asm printer compute entry offsets after grouping is done. 7. Change JIT code to compute entry offsets on the fly. llvm-svn: 66875	2009-03-13 07:51:59 +00:00
Bill Wendling	fa54bc2052	Oops...I committed too much. llvm-svn: 66867	2009-03-13 04:39:26 +00:00
Bill Wendling	b02eadf660	Temporarily XFAIL this test. llvm-svn: 66866	2009-03-13 04:37:11 +00:00
Dan Gohman	a19c662a83	Fix a typo in a comment. llvm-svn: 66843	2009-03-12 23:55:10 +00:00
Chris Lattner	4147f08e44	Move 3 "(add (select cc, 0, c), x) -> (select cc, x, (add, x, c))" related transformations out of target-specific dag combine into the ARM backend. These were added by Evan in r37685 with no testcases and only seems to help ARM (e.g. test/CodeGen/ARM/select_xform.ll). Add some simple X86-specific (for now) DAG combines that turn things like cond ? 8 : 0 -> (zext(cond) << 3). This happens frequently with the recently added cp constant select optimization, but is a very general xform. For example, we now compile the second example in const-select.ll to: _test: movsd LCPI2_0, %xmm0 ucomisd 8(%esp), %xmm0 seta %al movzbl %al, %eax movl 4(%esp), %ecx movsbl (%ecx,%eax,4), %eax ret instead of: _test: movl 4(%esp), %eax leal 4(%eax), %ecx movsd LCPI2_0, %xmm0 ucomisd 8(%esp), %xmm0 cmovbe %eax, %ecx movsbl (%ecx), %eax ret This passes multisource and dejagnu. llvm-svn: 66779	2009-03-12 06:52:53 +00:00
Evan Cheng	4465954638	Enable Chris' value propagation change. It make available known sign, zero, one bits information for values that are live out of basic blocks. The goal is to eliminate unnecessary sext, zext, truncate of values that are live-in to blocks. This does not handle PHI nodes yet. llvm-svn: 66777	2009-03-12 06:29:49 +00:00
Chris Lattner	43d6377f89	reapply my previous patch (r66358) with a tweak to set the alignment of the generated constant pool entry to the desired alignment of a type. If we don't do this, we end up trying to do movsd from 4-byte alignment memory. This fixes 450.soplex and 456.hmmer. llvm-svn: 66641	2009-03-11 05:08:08 +00:00
Evan Cheng	aa887653f4	Revert 66358 for now. It's breaking povray, 450.soplex, and 456.hmmer on x86 / Darwin. llvm-svn: 66574	2009-03-10 20:47:18 +00:00
Chris Lattner	4249b9a698	Fix PR3763 by using proper APInt methods instead of uint64_t's. llvm-svn: 66434	2009-03-09 20:22:18 +00:00
Bill Wendling	c6869f4695	Pass in a std::string when getting the names of debugging things. This cuts down on the number of times a std::string is created and copied. llvm-svn: 66396	2009-03-09 05:04:40 +00:00
Chris Lattner	ab5a443144	implement an optimization to codegen c ? 1.0 : 2.0 as load { 2.0, 1.0 } + c*4. For 2009-03-07-FPConstSelect.ll we now produce: _f: xorl %eax, %eax testl %edi, %edi movl $4, %ecx cmovne %rax, %rcx leaq LCPI1_0(%rip), %rax movss (%rcx,%rax), %xmm0 ret previously we produced: _f: subl $4, %esp cmpl $0, 8(%esp) movss LCPI1_0, %xmm0 je LBB1_2 ## entry LBB1_1: ## entry movss LCPI1_1, %xmm0 LBB1_2: ## entry movss %xmm0, (%esp) flds (%esp) addl $4, %esp ret on PPC the code also improves to: _f: cntlzw r2, r3 srwi r2, r2, 5 li r3, lo16(LCPI1_0) slwi r2, r2, 2 addis r3, r3, ha16(LCPI1_0) lfsx f1, r3, r2 blr from: _f: li r2, lo16(LCPI1_1) cmplwi cr0, r3, 0 addis r2, r2, ha16(LCPI1_1) beq cr0, LBB1_2 ; entry LBB1_1: ; entry li r2, lo16(LCPI1_0) addis r2, r2, ha16(LCPI1_0) LBB1_2: ; entry lfs f1, 0(r2) blr This also improves the existing pic-cpool case from: foo: subl $12, %esp call .Lllvm$1.$piclabel .Lllvm$1.$piclabel: popl %eax addl $_GLOBAL_OFFSET_TABLE_ + [.-.Lllvm$1.$piclabel], %eax cmpl $0, 16(%esp) movsd .LCPI1_0@GOTOFF(%eax), %xmm0 je .LBB1_2 # entry .LBB1_1: # entry movsd .LCPI1_1@GOTOFF(%eax), %xmm0 .LBB1_2: # entry movsd %xmm0, (%esp) fldl (%esp) addl $12, %esp ret to: foo: call .Lllvm$1.$piclabel .Lllvm$1.$piclabel: popl %eax addl $_GLOBAL_OFFSET_TABLE_ + [.-.Lllvm$1.$piclabel], %eax xorl %ecx, %ecx cmpl $0, 4(%esp) movl $8, %edx cmovne %ecx, %edx fldl .LCPI1_0@GOTOFF(%eax,%edx) ret This triggers a few dozen times in spec FP 2000. llvm-svn: 66358	2009-03-08 01:51:30 +00:00
Chris Lattner	21cf4bf235	random cleanups. llvm-svn: 66357	2009-03-08 01:47:41 +00:00
Duncan Sands	12da8ce3d2	Introduce new linkage types linkonce_odr, weak_odr, common_odr and extern_weak_odr. These are the same as the non-odr versions, except that they indicate that the global will only be overridden by an equivalent global. In C, a function with weak linkage can be overridden by a function which behaves completely differently. This means that IP passes have to skip weak functions, since any deductions made from the function definition might be wrong, since the definition could be replaced by something completely different at link time. This is not allowed in C++, thanks to the ODR (One-Definition-Rule): if a function is replaced by another at link-time, then the new function must be the same as the original function. If a language knows that a function or other global can only be overridden by an equivalent global, it can give it the weak_odr linkage type, and the optimizers will understand that it is alright to make deductions based on the function body. The code generators on the other hand map weak and weak_odr linkage to the same thing. llvm-svn: 66339	2009-03-07 15:45:40 +00:00
Dan Gohman	15af5524a4	Fix ScheduleDAGRRList::CopyAndMoveSuccessors' handling of nodes with multiple chain operands. This can occur when the scheduler has added chain operands to a node that already has a chain operand, in order to handle physical register dependencies. This fixes an llvm-gcc bootstrap failure on x86-64 introduced in r66058. llvm-svn: 66240	2009-03-06 02:23:01 +00:00
Bob Wilson	5b15d01ff3	Fix BuildVectorSDNode::isConstantSplat to handle one-element vectors. It is an error to call APInt::zext with a size that is equal to the value's current size, so use zextOrTrunc instead. llvm-svn: 66039	2009-03-04 17:47:01 +00:00
Eli Friedman	7604d37723	PR3686: make the legalizer handle bitcast from i80 to x86 long double. llvm-svn: 66021	2009-03-04 06:23:34 +00:00
Evan Cheng	b8905c4e2c	Fix PR3701. 1. X86 target renamed eflags register to flags. This matches what llvm-gcc generates so codegen knows flags register is being clobbered by inline asm. 2. BURR scheduler should also check if inline asm nodes can clobber "live" physical registers. Previously it was only checking target nodes with implicit defs. llvm-svn: 65996	2009-03-04 01:41:49 +00:00
Bill Wendling	6d2714738f	The DAG combiner was performing a BT combine. The BT combine had a value of -1, so it changed it into a 31 via the TLO.ShrinkDemandedConstant() call. Then it would go through the DAG combiner again. This time it had a value of 31, which was turned into a -1 by TLI.SimplifyDemandedBits(). This would ping pong forever. Teach the TLO.ShrinkDemandedConstant() call not to lower a value if the demanded value is an XOR of all ones. llvm-svn: 65985	2009-03-04 00:18:06 +00:00
Bob Wilson	85cefe8567	Generalize BuildVectorSDNode::isConstantSplat to use APInts and handle arbitrary vector sizes. Add an optional MinSplatBits parameter to specify a minimum for the splat element size. Update the PPC target to use the revised interface. llvm-svn: 65899	2009-03-02 23:24:16 +00:00
Nate Begeman	a9e981225e	Fix a problem with DAGCombine on 64b targets where folding extracts + build_vector into a shuffle would fail, because the type of the new build_vector would not be legal. Try harder to create a legal build_vector type. Note: this will be totally irrelevant once vector_shuffle no longer takes a build_vector for shuffle mask. New: _foo: xorps %xmm0, %xmm0 xorps %xmm1, %xmm1 subps %xmm1, %xmm1 mulps %xmm0, %xmm1 addps %xmm0, %xmm1 movaps %xmm1, 0 Old: _foo: xorps %xmm0, %xmm0 movss %xmm0, %xmm1 xorps %xmm2, %xmm2 unpcklps %xmm1, %xmm2 pshufd $80, %xmm1, %xmm1 unpcklps %xmm1, %xmm2 pslldq $16, %xmm2 pshufd $57, %xmm2, %xmm1 subps %xmm0, %xmm1 mulps %xmm0, %xmm1 addps %xmm0, %xmm1 movaps %xmm1, 0 llvm-svn: 65791	2009-03-01 23:44:07 +00:00
Bob Wilson	d8ea0e144e	Combine PPC's GetConstantBuildVectorBits and isConstantSplat functions to a new method in a BuildVectorSDNode "pseudo-class". llvm-svn: 65747	2009-03-01 01:13:55 +00:00
Rafael Espindola	000421eade	Refactor TLS code and add some tests. The tests and expected results are: pic \| declaration \| linkage \| visibility \| !pic \| declaration \| external \| default \| tls1.ll tls2.ll \| local exec pic \| declaration \| external \| default \| tls1-pic.ll tls2-pic.ll \| general dynamic !pic \| !declaration \| external \| default \| tls3.ll tls4.ll \| initial exec pic \| !declaration \| external \| default \| tls3-pic.ll tls4-pic.ll \| general dynamic !pic \| declaration \| external \| hidden \| tls7.ll tls8.ll \| local exec pic \| declaration \| external \| hidden \| X \| local dynamic !pic \| !declaration \| external \| hidden \| tls9.ll tls10.ll \| local exec pic \| !declaration \| external \| hidden \| X \| local dynamic !pic \| declaration \| internal \| default \| tls5.ll tls6.ll \| local exec pic \| declaration \| internal \| default \| X \| local dynamic The ones marked with an X have not been implemented since local dynamic is not implemented. llvm-svn: 65632	2009-02-27 13:37:18 +00:00
Evan Cheng	a49de9de2e	Revert BuildVectorSDNode related patches: 65426, 65427, and 65296. llvm-svn: 65482	2009-02-25 22:49:59 +00:00
Dale Johannesen	7d12ea0f62	Fix big-endian codegen bug. We're splitting up overly long ints, e.g. i96, into pieces at PHIs and the nodes that feed into them; however big-endian reverses the order of the pieces (for some reason), and wasn't doing it the same way on both sides, so the pieces didn't match and runtime failures ensued. Fixes 188.ammp and sqlite3 on ppc32. llvm-svn: 65481	2009-02-25 22:39:13 +00:00
Evan Cheng	86673f2806	Clean up dwarf writer, part 1. This eliminated the horrible recursive getGlobalVariablesUsing and replaced it something readable. It eliminated use of slow UniqueVector and replaced it with StringMap, SmallVector, and DenseMap, etc. It also fixed some non-deterministic behavior. This is a very minor compile time win. llvm-svn: 65438	2009-02-25 07:04:34 +00:00
Scott Michel	e2fdc31759	Expand tabs to spaces (overlooked in previous commit) llvm-svn: 65427	2009-02-25 03:57:49 +00:00
Scott Michel	bb878288cb	Remove all "cached" data from BuildVectorSDNode, preferring to retrieve results via reference parameters. This patch also appears to fix Evan's reported problem supplied as a reduced bugpoint test case. llvm-svn: 65426	2009-02-25 03:12:50 +00:00
Bill Wendling	c5437ea429	Overhaul my earlier submission due to feedback. It's a large patch, but most of them are generic changes. - Use the "fast" flag that's already being passed into the asm printers instead of shoving it into the DwarfWriter. - Instead of calling "MI->getParent()->getParent()" for every MI, set the machine function when calling "runOnMachineFunction" in the asm printers. llvm-svn: 65379	2009-02-24 08:30:20 +00:00
Bill Wendling	786c5973f7	- Use the "Fast" flag instead of "OptimizeForSize" to determine whether to emit a DBG_LABEL or not. We want to fall back to the original way of emitting debug info when we're in -O0/-fast mode. - Add plumbing in to pass the "Fast" flag to places that need it. - XFAIL DebugInfo/deaddebuglabel.ll. This is finding 11 labels instead of 8. I need to investigate still. llvm-svn: 65367	2009-02-24 02:35:30 +00:00
Dan Gohman	4f356bb9b0	Fix a ValueTracking rule: RHS means operand 1, not 0. Add a simple ashr instcombine to help expose this code. And apply the fix to SelectionDAG's copy of this code too. llvm-svn: 65364	2009-02-24 02:00:40 +00:00
Scott Michel	9d31aca679	Introduce the BuildVectorSDNode class that encapsulates the ISD::BUILD_VECTOR instruction. The class also consolidates the code for detecting constant splats that's shared across PowerPC and the CellSPU backends (and might be useful for other backends.) Also introduces SelectionDAG::getBUID_VECTOR() for generating new BUILD_VECTOR nodes. llvm-svn: 65296	2009-02-22 23:36:09 +00:00
Richard Pennington	99f6d7c9fc	bug 3610: Floating point vaarg not softened. llvm-svn: 65239	2009-02-21 19:11:18 +00:00
Dan Gohman	e7fe80fcf9	Fix a bug that David Greene found in the DAGCombiner's logic that checks whether it's safe to transform a store of a bitcast value into a store of the original value. llvm-svn: 65201	2009-02-20 23:29:13 +00:00
Bill Wendling	7b9f38ad37	Temporarily revert r65065. It was causing test failures. llvm-svn: 65068	2009-02-19 21:57:07 +00:00
Bill Wendling	df78dcc0b2	Check for -fast here too. llvm-svn: 65065	2009-02-19 21:23:54 +00:00
Bill Wendling	19e0a5b3c3	Generate these labels when we're in "fast" mode, not simply when we're no in "optimize-for-size" mode. llvm-svn: 65064	2009-02-19 21:12:54 +00:00
Scott Michel	cf0da6c597	Remove trailing whitespace to reduce later commit patch noise. (Note: Eventually, commits like this will be handled via a pre-commit hook that does this automagically, as well as expand tabs to spaces and look for 80-col violations.) llvm-svn: 64827	2009-02-17 22:15:04 +00:00
Bill Wendling	3c50922ea0	--- Merging (from foreign repository) r64714 into '.': U include/llvm/CodeGen/DebugLoc.h U lib/CodeGen/SelectionDAG/LegalizeDAG.cpp U lib/CodeGen/SelectionDAG/SelectionDAGBuild.cpp U lib/Target/X86/AsmPrinter/X86ATTAsmPrinter.cpp Enable debug location generation at -Os. This goes with the reapplication of the r63639 patch. llvm-svn: 64715	2009-02-17 01:04:54 +00:00
Dan Gohman	aaee6c9523	Don't assume that a left-shift of a value with one bit set will have one bit set, because the bit may be shifted off the end. Instead, just check for a constant 1 being shifted. This is still sufficient to handle all the cases in test/CodeGen/X86/bt.ll. This fixes PR3583. llvm-svn: 64622	2009-02-15 23:59:32 +00:00
Cedric Venet	d1e179d992	Unbreak the build on win32. Cleanup some warning. Remark: when struct/class are declared differently than they are defined, this make problem for VC++ since it seems to mangle class differently that struct. These error are very hard to understand and find. So please, try to keep your definition/declaration in sync. Only tested with VS2008. hope it does not break anything. feel free to revert. llvm-svn: 64554	2009-02-14 16:06:42 +00:00
Bill Wendling	65c0fd4c44	Revert this. It was breaking stuff. llvm-svn: 64428	2009-02-13 02:16:35 +00:00
Bill Wendling	1c21ac3066	Turn off the old way of handling debug information in the code generator. Use the new way, where all of the information is passed on SDNodes and machine instructions. llvm-svn: 64427	2009-02-13 02:01:04 +00:00
Dale Johannesen	655775293f	Arrange to print constants that match "n" and "i" constraints in inline asm as signed (what gcc does). Add partial support for x86-specific "e" and "Z" constraints, with appropriate signedness for printing. llvm-svn: 64400	2009-02-12 20:58:09 +00:00
Chris Lattner	90880e2598	make fast isel fall back to selectiondags for VLA llvm.declare intrinsics. llvm-svn: 64379	2009-02-12 17:23:20 +00:00
Evan Cheng	b570499c25	Oops. Last second clean up messed things up. llvm-svn: 64373	2009-02-12 09:52:13 +00:00
Evan Cheng	3a14efacb6	Replace one of burr scheduling heuristic with something more sensible. Now calcMaxScratches simply compute the number of true data dependencies. This actually improve a couple of tests in dejagnu suite as many tests in llvm nightly test suite. llvm-svn: 64369	2009-02-12 08:59:45 +00:00
Dan Gohman	45889d24fe	Fix a comment. llvm-svn: 64328	2009-02-11 21:32:08 +00:00
Dan Gohman	6571ef3577	Don't use special heuristics for nodes with no data predecessors unless they actually have data successors, and likewise for nodes with no data successors unless they actually have data precessors. llvm-svn: 64327	2009-02-11 21:29:39 +00:00
Dan Gohman	298a2946f1	Delete the heuristic for non-livein CopyFromReg nodes. Non-liveinness is determined by whether the node has a Flag operand. However, if the node does have a Flag operand, it will be glued to its register's def, so the heuristic would end up spuriously applying to whatever node is the def. llvm-svn: 64319	2009-02-11 20:25:59 +00:00
Dale Johannesen	cc5fc44d02	Make a transformation added in 63266 a bit less aggressive. It was transforming (x&y)==y to (x&y)!=0 in the case where y is variable and known to have at most one bit set (e.g. z&1). This is not correct; the expressions are not equivalent when y==0. I believe this patch salvages what can be salvaged, including all the cases in bt.ll. Dan, please review. Fixes gcc.c-torture/execute/20040709-[12].c llvm-svn: 64314	2009-02-11 19:19:41 +00:00
Dan Gohman	dfaf646c34	When scheduling a block in parts, keep track of the overall instruction index across each part. Instruction indices are used to make live range queries, and live ranges can extend beyond scheduling region boundaries. Refactor the ScheduleDAGSDNodes class some more so that it doesn't have to worry about this additional information. llvm-svn: 64288	2009-02-11 04:27:20 +00:00
Dan Gohman	b95434356c	Factor out more code for computing register live-range informationfor scheduling, and generalize is so that preserves state across scheduling regions. This fixes incorrect live-range information around terminators and labels, which are effective region boundaries. In place of looking for terminators to anchor inter-block dependencies, introduce special entry and exit scheduling units for this purpose. llvm-svn: 64254	2009-02-10 23:27:53 +00:00
Evan Cheng	ce3bbe515b	Fix PR3457: Ignore control successors when looking for closest scheduled successor. A control successor doesn't read result(s) produced by the scheduling unit being evaluated. llvm-svn: 64210	2009-02-10 08:30:11 +00:00
Evan Cheng	3af42a8a14	If the target cannot issue a copy for the given source and dest registers, abort instead of silently continue. llvm-svn: 64184	2009-02-09 22:47:36 +00:00
Evan Cheng	fe174df170	Simplify code. llvm-svn: 64164	2009-02-09 21:01:06 +00:00
Evan Cheng	020588cee3	Make sure constant subscript is truncated to ptr size if it may not fit. llvm-svn: 64163	2009-02-09 20:54:38 +00:00
Dale Johannesen	9c310711bb	Use getDebugLoc forwarder instead of getNode()->getDebugLoc. No functional change. llvm-svn: 64026	2009-02-07 19:59:05 +00:00
Dan Gohman	747e55bc9a	Constify TargetInstrInfo::EmitInstrWithCustomInserter, allowing ScheduleDAG's TLI member to use const. llvm-svn: 64018	2009-02-07 16:15:20 +00:00
Dale Johannesen	8ba7132128	Make SDNode constructors take a DebugLoc always. Adjust derived classes to pass UnknownLoc where a DebugLoc does not make sense. Pick one of DebugLoc and non-DebugLoc variants to survive for all such classes. llvm-svn: 64000	2009-02-07 02:15:05 +00:00
Dale Johannesen	a72d41a67b	Remove now-unused constructors. llvm-svn: 63995	2009-02-07 01:27:09 +00:00
Dale Johannesen	62fd95d6ec	Get rid of the last non-DebugLoc versions of getNode! Many targets build placeholder nodes for special operands, e.g. GlobalBaseReg on X86 and PPC for the PIC base. There's no sensible way to associate debug info with these. I've left them built with getNode calls with explicit DebugLoc::getUnknownLoc operands. I'm not too happy about this but don't see a good improvement; I considered adding a getPseudoOperand or something, but it seems to me that'll just make it harder to read. llvm-svn: 63992	2009-02-07 00:55:49 +00:00
Dale Johannesen	84935759d5	Remove more non-DebugLoc getNode variants. Use getCALLSEQ_{END,START} to permit passing no DebugLoc there. UNDEF doesn't logically have DebugLoc; add getUNDEF to encapsulate this. llvm-svn: 63978	2009-02-06 23:05:02 +00:00
Dale Johannesen	dc93bbc4b0	And one more file. llvm-svn: 63971	2009-02-06 21:55:48 +00:00
Dale Johannesen	400dc2e2e4	Remove more non-DebugLoc versions of getNode. llvm-svn: 63969	2009-02-06 21:50:26 +00:00
Bill Wendling	03c34d0d3c	Clear out the CurDebugLoc info when doing a 'clear' on the SDL object. llvm-svn: 63967	2009-02-06 21:36:23 +00:00
Dale Johannesen	ab8e4425a3	Eliminate remaining non-DebugLoc version of getTargetNode. llvm-svn: 63951	2009-02-06 19:16:40 +00:00
Dan Gohman	817a24f8e9	Rename SelectionDAGISel::Schedule to SelectionDAGISel::CreateScheduler, and make it just create the scheduler. Leave running the scheduler to the higher-level code. This makes the higher-level code a little more explicit and easier to follow, and will help enable some future refactoring. llvm-svn: 63944	2009-02-06 18:26:51 +00:00
Dan Gohman	cd2cd9f5d7	Delete an unused member function. llvm-svn: 63941	2009-02-06 18:19:52 +00:00
Evan Cheng	066757eea1	Move getPointerRegClass from TargetInstrInfo to TargetRegisterInfo. llvm-svn: 63938	2009-02-06 17:43:24 +00:00
Dan Gohman	483377c639	Move ScheduleDAGSDNodes.h to be a private header. Front-ends that previously included this header should include SchedulerRegistry.h instead. llvm-svn: 63937	2009-02-06 17:22:58 +00:00
Dale Johannesen	2c4cf2752d	get rid of some non-DebugLoc getTargetNode variants. llvm-svn: 63909	2009-02-06 02:08:06 +00:00
Dale Johannesen	9f3f72f144	Get rid of one more non-DebugLoc getNode and its corresponding getTargetNode. Lots of caller changes. llvm-svn: 63904	2009-02-06 01:31:28 +00:00

... 13 14 15 16 17 ...

4711 Commits