llvm-project

Commit Graph

Author	SHA1	Message	Date
Dan Gohman	2fa67c9f70	Be tidy and use a break to exit from a switch block rather than just falling through the end. llvm-svn: 79383	2009-08-18 23:52:48 +00:00
Dan Gohman	4906f73a9f	Legalize the shift amount operand of SRL_PARTS, SHL_PARTS, and SRA_PARTS, as is done for SRL, SHL, and SRA. llvm-svn: 79380	2009-08-18 23:36:17 +00:00
Jim Grosbach	43bbb9de66	Remove a bit more cruft from the sjlj moving to a backend pass. llvm-svn: 79272	2009-08-17 20:25:04 +00:00
Jakob Stoklund Olesen	7f91fee62b	Be more clever about regclasses in ScheduleDAGSDNodes::EmitCopyFromReg. If two uses of a CopyFromReg want different regclasses, first try a common sub-class, then fall back on the copy emitted in AddRegisterOperand. There is no need for an assert here. The cross-class joiner usually cleans up nicely. llvm-svn: 79193	2009-08-16 17:40:59 +00:00
Evan Cheng	badf17cdc7	Needs to check whether unaligned load / store of i64 is legal here. llvm-svn: 79150	2009-08-15 23:41:42 +00:00
Benjamin Kramer	d2d5e716bd	Unbreak build. Evan, please make sure my changes are correct. llvm-svn: 79133	2009-08-15 20:46:16 +00:00
Evan Cheng	567f124305	80 col violations. llvm-svn: 79087	2009-08-15 08:38:52 +00:00
Dan Gohman	e8c913e657	Simplify this code to not depend as much on CurMBB. llvm-svn: 79068	2009-08-15 02:06:22 +00:00
Anton Korobeynikov	a6b3ce203a	Allow targets to specify their choice of calling conventions per libcall. Take advantage of this in the ARM backend to rectify broken choice of CC when hard float is in effect. PIC16 may want to see if it could be of use in MakePIC16Libcall, which works unchanged. Patch by Sandeep! llvm-svn: 79033	2009-08-14 20:10:52 +00:00
Evan Cheng	dc1869661b	Indentation change. llvm-svn: 78978	2009-08-14 01:56:37 +00:00
Owen Anderson	55f1c09e31	Push LLVMContexts through the IntegerType APIs. llvm-svn: 78948	2009-08-13 21:58:54 +00:00
David Goodwin	90e6b8b708	Add callback to allow target to adjust latency of schedule dependency edge. llvm-svn: 78910	2009-08-13 16:05:04 +00:00
Owen Anderson	117c9e8497	Add contexts to some of the MVT APIs. No functionality change yet, just the infrastructure work needed to get the contexts to where they need to be first. llvm-svn: 78759	2009-08-12 00:36:31 +00:00
Owen Anderson	c6daf8f17c	Fix warnings. llvm-svn: 78725	2009-08-11 21:59:30 +00:00
Owen Anderson	9f94459d24	Split EVT into MVT and EVT, the former representing _just_ a primitive type, while the latter is capable of representing either a primitive or an extended type. llvm-svn: 78713	2009-08-11 20:47:22 +00:00
Dan Gohman	7c50c9bd63	Tidy #includes. llvm-svn: 78677	2009-08-11 16:02:12 +00:00
Jim Grosbach	693e36a3e8	SjLj based exception handling unwinding support. This patch is nasty, brutish and short. Well, it's kinda short. Definitely nasty and brutish. The front-end generates the register/unregister calls into the SjLj runtime, call-site indices and landing pad dispatch. The back end fills in the LSDA with the call-site information provided by the front end. Catch blocks are not yet implemented. Built on Darwin and verified no llvm-core "make check" regressions. llvm-svn: 78625	2009-08-11 00:09:57 +00:00
Dan Gohman	9d26c85bdc	Fix a bug in the DAGCombiner's handling of multiple linked MERGE_VALUES nodes. Replacing the result values with the operands in one MERGE_VALUES node may cause another MERGE_VALUES node be CSE'd with the first one, and bring its uses along, so that the first one isn't dead, as this code expects. Fix this by iterating until the node is really dead. This fixes PR4699. llvm-svn: 78619	2009-08-10 23:43:19 +00:00
Dan Gohman	733a64db57	Fix a bug where DAGCombine was producing an illegal ConstantFP node after legalize, and remove the workaround code from the ARM backend. llvm-svn: 78615	2009-08-10 23:15:10 +00:00
Owen Anderson	53aa7a960c	Rename MVT to EVT, in preparation for splitting SimpleValueType out into its own struct type. llvm-svn: 78610	2009-08-10 22:56:29 +00:00
Owen Anderson	c30530d105	Start moving TargetLowering away from using full MVTs and towards SimpleValueType, which will simplify the privatization of IntegerType in the future. llvm-svn: 78584	2009-08-10 18:56:59 +00:00
Dan Gohman	b717091e69	Make this comment more closely reflect the code. llvm-svn: 78569	2009-08-10 16:50:32 +00:00
Jakob Stoklund Olesen	dc6bccbaa6	Don't build illegal ops in DAGCombiner::SimplifyBinOpWithSameOpcodeHands(). Blackfin supports and/or/xor on i32 but not on i16. Teach DAGCombiner::SimplifyBinOpWithSameOpcodeHands to not produce illegal nodes after legalize ops. llvm-svn: 78497	2009-08-08 20:42:17 +00:00
Dale Johannesen	352fa92995	Use stripPointerCasts instead of partially rewriting it. llvm-svn: 78350	2009-08-06 22:45:51 +00:00
Dan Gohman	695d811ad5	Add assertion checks after the calls to LowerFormalArguments, LowerCall, and LowerReturn, to verify that the targets' hooks have respected some of their postconditions. llvm-svn: 78312	2009-08-06 15:37:27 +00:00
Dan Gohman	ee902509a8	Remove an over-aggressive assert. Functions with empty struct return types don't have any return values, from CodeGen's perspective. This fixes PR4688. llvm-svn: 78311	2009-08-06 15:07:58 +00:00
Dan Gohman	5758e1e92a	Fix a few places in DAGCombiner that were creating all-ones-bits and high-bits values in ways that weren't correct for integer types wider than 64 bits. This fixes a miscompile in PPMacroExpansion.cpp in clang on x86-64. llvm-svn: 78295	2009-08-06 09:18:59 +00:00
Dan Gohman	f9bbcd1afd	Major calling convention code refactoring. Instead of awkwardly encoding calling-convention information with ISD::CALL, ISD::FORMAL_ARGUMENTS, ISD::RET, and ISD::ARG_FLAGS nodes, TargetLowering provides three virtual functions for targets to override: LowerFormalArguments, LowerCall, and LowerRet, which replace the custom lowering done on the special nodes. They provide the same information, but in a more immediately usable format. This also reworks much of the target-independent tail call logic. The decision of whether or not to perform a tail call is now cleanly split between target-independent portions, and the target dependent portion in IsEligibleForTailCallOptimization. This also synchronizes all in-tree targets, to help enable future refactoring and feature work. llvm-svn: 78142	2009-08-05 01:29:28 +00:00
Dan Gohman	15873a8ff7	Propogate the Depth argument when calling TLI.computeMaskedBitsForTargetNode from ComputeMaskedBits, since the former may call back into the latter. This fixes a major compile time problem on a testcase that happnened to hit this in a particularly bad way, PR4643. llvm-svn: 78023	2009-08-04 00:24:42 +00:00
Bob Wilson	5f6f72605b	Revert 77974. It breaks 3 of the ARM tests. llvm-svn: 77982	2009-08-03 19:06:29 +00:00
Sanjiv Gupta	9503900c60	Allow targets to custom handle softening of results or operands before trying the standard stuff. llvm-svn: 77974	2009-08-03 17:35:21 +00:00
Benjamin Kramer	c28b306423	llvm_report_error already prints "LLVM ERROR:". So stop reporting errors like "LLVM ERROR: llvm: error:" or "LLVM ERROR: ERROR:". llvm-svn: 77971	2009-08-03 13:33:33 +00:00
Dan Gohman	3f323847bc	Avoid forming a SELECT_CC in a type that the target doesn't support. This isn't immediately interesting, because Legalize ends up lowering SELECT_CC if the target doesn't support it, but this simplifies the process. Also, if the SELECT_CC would be expanded in Legalize, it can potentially end up with two copies of the condition expression. By leaving it as SELECT+SETCC, the SELECT can be expanded into two SELECTs that use a single SETCC. The two comparisons are usually CSE'd, but depending on when various expressions get legalized, the comparison expression could involve calls to library functions, such that the comparison expression may not be able to be CSE'd. This will be needed by a future patch. llvm-svn: 77896	2009-08-02 16:19:38 +00:00
Dan Gohman	3a9b9a59ea	Print the target flags as an int instead of a char, as they aren't actually characters. llvm-svn: 77794	2009-08-01 19:13:38 +00:00
Dan Gohman	859103d8e7	Delete a redundant variable. llvm-svn: 77774	2009-08-01 04:18:29 +00:00
Dan Gohman	7153692bdf	Minor code simplifications. llvm-svn: 77769	2009-08-01 03:51:09 +00:00
Dan Gohman	1987bf4561	SelectionDAGISel no longer needs to check hasAvailableExternallyLinkage, as it is now a MachineFunctionPass, and MachineFunctionPass now handles this. llvm-svn: 77760	2009-08-01 00:42:23 +00:00
Dan Gohman	10b8898ac0	SelectionDAGISel does not "preserve all", since it makes lots of changes to the MachineFunction. llvm-svn: 77753	2009-07-31 23:36:22 +00:00
Dan Gohman	dd3da92b4a	Use a range insert instead of an explicit loop. llvm-svn: 77752	2009-07-31 23:36:06 +00:00
Bob Wilson	84aa855ead	Allow target intrinsics that return multiple values, i.e., struct types, in SelectionDAGLowering::visitTargetIntrinsic. This removes a bit of special-case code for vector types. After staring at it for a while, I managed to convince myself that it is not necessary. The only case where TLI.getValueType() differs from MVT::getMVT is for iPTR, so this code could potentially make a difference for a vector of pointers. But, it looks like that is not supported. Calling TLI.getValueType() on a vector of pointers leads to the following sequence of calls: TargetLowering::getValueType MVT::getMVT MVT::getVectorVT(iPTR, num elements) MVT::getExtendedVectorVT MVT::getTypeForMVT for iPTR assertion fails "Type is not extended!" So, unless I'm really missing something, this bit of code is irrelevant to the current version of LLVM, which is consistent with the fact that I don't see this code in other similar places. llvm-svn: 77747	2009-07-31 22:41:21 +00:00
Owen Anderson	5a1acd9912	Move a few more APIs back to 2.5 forms. The only remaining ones left to change back are metadata related, which I'm waiting on to avoid conflicting with Devang. llvm-svn: 77721	2009-07-31 20:28:14 +00:00
Dan Gohman	5ea74d55ce	Reapply r77654 with a fix: MachineFunctionPass's getAnalysisUsage shouldn't do AU.setPreservesCFG(), because even though CodeGen passes don't modify the LLVM IR CFG, they may modify the MachineFunction CFG, and passes like MachineLoop are registered with isCFGOnly set to true. llvm-svn: 77691	2009-07-31 18:16:33 +00:00
Owen Anderson	23a204d91b	Move getTrue() and getFalse() to 2.5-like APIs. llvm-svn: 77685	2009-07-31 17:39:07 +00:00
Daniel Dunbar	5434756585	Revert r77654, it appears to be causing llvm-gcc bootstrap failures, and many failures when building assorted projects with clang. --- Reverse-merging r77654 into '.': U include/llvm/CodeGen/Passes.h U include/llvm/CodeGen/MachineFunctionPass.h U include/llvm/CodeGen/MachineFunction.h U include/llvm/CodeGen/LazyLiveness.h U include/llvm/CodeGen/SelectionDAGISel.h D include/llvm/CodeGen/MachineFunctionAnalysis.h U include/llvm/Function.h U lib/Target/CellSPU/SPUISelDAGToDAG.cpp U lib/Target/PowerPC/PPCISelDAGToDAG.cpp U lib/CodeGen/LLVMTargetMachine.cpp U lib/CodeGen/MachineVerifier.cpp U lib/CodeGen/MachineFunction.cpp U lib/CodeGen/PrologEpilogInserter.cpp U lib/CodeGen/MachineLoopInfo.cpp U lib/CodeGen/SelectionDAG/SelectionDAGISel.cpp D lib/CodeGen/MachineFunctionAnalysis.cpp D lib/CodeGen/MachineFunctionPass.cpp U lib/CodeGen/LiveVariables.cpp llvm-svn: 77661	2009-07-31 03:02:41 +00:00
Dan Gohman	bcb44baa57	Manage MachineFunctions with an analysis Pass instead of the Annotable mechanism. To support this, make MachineFunctionPass a little more complete. llvm-svn: 77654	2009-07-31 01:52:50 +00:00
Owen Anderson	b292b8ce70	Move more code back to 2.5 APIs. llvm-svn: 77635	2009-07-30 23:03:37 +00:00
Sanjiv Gupta	a53e686d96	Allow targets to define libcall names for mem(cpy,set,move) intrinsics, rather than hardcoding them in DAG lowering. llvm-svn: 77586	2009-07-30 09:12:56 +00:00
Evan Cheng	e62288fdd4	Optimize some common usage patterns of atomic built-ins __sync_add_and_fetch() and __sync_sub_and_fetch. When the return value is not used (i.e. only care about the value in the memory), x86 does not have to use add to implement these. Instead, it can use add, sub, inc, dec instructions with the "lock" prefix. This is currently implemented using a bit of instruction selection trick. The issue is the target independent pattern produces one output and a chain and we want to map it into one that just output a chain. The current trick is to select it into a merge_values with the first definition being an implicit_def. The proper solution is to add new ISD opcodes for the no-output variant. DAG combiner can then transform the node before it gets to target node selection. Problem #2 is we are adding a whole bunch of x86 atomic instructions when in fact these instructions are identical to the non-lock versions. We need a way to add target specific information to target nodes and have this information carried over to machine instructions. Asm printer (or JIT) can use this information to add the "lock" prefix. llvm-svn: 77582	2009-07-30 08:33:02 +00:00
Owen Anderson	4056ca9568	Move types back to the 2.5 API. llvm-svn: 77516	2009-07-29 22:17:13 +00:00
Chris Lattner	7667332899	inline the global 'getInstrOperandRegClass' function into its callers now that TargetOperandInfo does the heavy lifting. llvm-svn: 77508	2009-07-29 21:36:49 +00:00
Benjamin Kramer	21d75078b5	Remove now unused Context variables. llvm-svn: 77495	2009-07-29 19:14:17 +00:00
Owen Anderson	487375e9a2	Move ConstantExpr to 2.5 API. llvm-svn: 77494	2009-07-29 18:55:55 +00:00
Owen Anderson	4aa3295a65	Return ConstantVector to 2.5 API. llvm-svn: 77366	2009-07-28 21:19:26 +00:00
Owen Anderson	c2c7932c64	Change ConstantArray to 2.5 API. llvm-svn: 77347	2009-07-28 18:32:17 +00:00
Chris Lattner	5e693ed07b	Rip all of the global variable lowering logic out of TargetAsmInfo. Since it is highly specific to the object file that will be generated in the end, this introduces a new TargetLoweringObjectFile interface that is implemented for each of ELF/MachO/COFF/Alpha/PIC16 and XCore. Though still is still a brutal and ugly refactoring, this is a major step towards goodness. This patch also: 1. fixes a bunch of dangling pointer problems in the PIC16 backend. 2. disables the TargetLowering copy ctor which PIC16 was accidentally using. 3. gets us closer to xcore having its own crazy target section flags and pic16 not having to shadow sections with its own objects. 4. fixes wierdness where ELF targets would set CStringSection but not CStringSection_. Factor the code better. 5. fixes some bugs in string lowering on ELF targets. llvm-svn: 77294	2009-07-28 03:13:23 +00:00
Owen Anderson	69c464dec4	Move ConstantFP construction back to the 2.5-ish API. llvm-svn: 77247	2009-07-27 20:59:43 +00:00
Eli Friedman	65919b5058	Reorganize code a bit to reduce indentation. No visible functionality change. llvm-svn: 77171	2009-07-26 23:47:17 +00:00
Daniel Dunbar	ca414c7cae	Remove Value::getNameLen llvm-svn: 77148	2009-07-26 08:34:35 +00:00
Dan Gohman	1ddf98ad8e	Convert a few more things to use raw_ostream. llvm-svn: 77039	2009-07-25 01:43:01 +00:00
Daniel Dunbar	0dd5e1ed39	More migration to raw_ostream, the water has dried up around the iostream hole. - Some clients which used DOUT have moved to DEBUG. We are deprecating the "magic" DOUT behavior which avoided calling printing functions when the statement was disabled. In addition to being unnecessary magic, it had the downside of leaving code in -Asserts builds, and of hiding potentially unnecessary computations. llvm-svn: 77019	2009-07-25 00:23:56 +00:00
Owen Anderson	edb4a70325	Revert the ConstantInt constructors back to their 2.5 forms where possible, thanks to contexts-on-types. More to come. llvm-svn: 77011	2009-07-24 23:12:02 +00:00
Jakob Stoklund Olesen	1ae0736830	Add support for promoting SETCC operations. llvm-svn: 76987	2009-07-24 18:22:59 +00:00
Daniel Dunbar	796e43eede	Move more to raw_ostream, provide support for writing MachineBasicBlock, LiveInterval, etc to raw_ostream. llvm-svn: 76965	2009-07-24 10:36:58 +00:00
Daniel Dunbar	12368685d8	Switch to getNameStr(). llvm-svn: 76962	2009-07-24 08:24:36 +00:00
Chris Lattner	308c7896a4	"fix" PR4612, which is a crash on: %0 = malloc [3758096384 x i32] The "malloc" instruction doesn't support 64-bits correctly (see PR715), and should be removed. Victor is actively working on fixing this, in the meantime just don't crash. llvm-svn: 76899	2009-07-23 21:26:18 +00:00
Owen Anderson	47db941fd3	Get rid of the Pass+Context magic. llvm-svn: 76702	2009-07-22 00:24:57 +00:00
Eli Friedman	da9eda8ef6	Remove shift amount flavor. It isn't actually complete enough to be useful, and it's currently unused. (Some issues: it isn't actually rich enough to capture the semantics on many architectures, and semantics can vary depending on the type being shifted.) llvm-svn: 76633	2009-07-21 20:12:16 +00:00
Owen Anderson	c37bc69e91	Rename getConstantInt{True\|False} to get{True\|False} at Chris' behest. llvm-svn: 76598	2009-07-21 18:03:38 +00:00
Daniel Dunbar	5899e340f3	Simplify / normalize some uses of Value::getName. llvm-svn: 76553	2009-07-21 08:54:24 +00:00
Evan Cheng	a7bb55ebb6	Fix a dagga combiner bug: avoid creating illegal constant. Is this really a winning transformation? fold (shl (srl x, c1), c2) -> (shl (and x, (shl -1, c1)), (sub c2, c1)) or (srl (and x, (shl -1, c1)), (sub c1, c2)) llvm-svn: 76535	2009-07-21 05:40:15 +00:00
Owen Anderson	2ad52176f9	Move a bit more state over to the LLVMContext. llvm-svn: 76533	2009-07-21 02:47:59 +00:00
Dale Johannesen	ade297d496	Move stripping of bitcasts in inline asm arguments to a place where it affects everything. Occurs only on calls AFAIK. llvm-svn: 76502	2009-07-20 23:27:39 +00:00
Daniel Dunbar	ac0ca9241a	Fix some minor MSVC compiler warnings. llvm-svn: 76356	2009-07-19 01:38:38 +00:00
Eli Friedman	97f3f965eb	Make promotion in operation legalization for SETCC work correctly. llvm-svn: 76153	2009-07-17 05:16:04 +00:00
Jeffrey Yasskin	efad8e45fe	Add line numbers to OProfile. To do this, I added a processDebugLoc() call to the MachineCodeEmitter interface and made copying the start line of a function not conditional on whether we're emitting Dwarf debug information. I'll propagate the processDebugLoc() calls to the non-X86 targets in a followup patch. In the long run, it'll probably be better to gather this information through the DwarfWriter, but the DwarfWriter currently depends on the AsmPrinter and TargetAsmInfo, and fixing that would be out of the way for this patch. There's a bug in OProfile 0.9.4 that makes it ignore line numbers for addresses above 4G, and a patch fixing it at http://thread.gmane.org/gmane.linux.oprofile/7634 Sample output: $ sudo opcontrol --reset; sudo opcontrol --start-daemon; sudo opcontrol --start; `pwd`/Debug/bin/lli fib.bc; sudo opcontrol --stop Signalling daemon... done Profiler running. fib(40) == 165580141 Stopping profiling. $ opreport -g -d -l `pwd`/Debug/bin/lli\|head -60 Overflow stats not available CPU: Core 2, speed 1998 MHz (estimated) Counted CPU_CLK_UNHALTED events (Clock cycles when not halted) with a unit mask of 0x00 (Unhalted core cycles) count 100000 vma samples % linenr info image name symbol name 00007f67a30370b0 25489 61.2554 fib.c:24 10946.jo fib_left 00007f67a30370b0 1634 6.4106 fib.c:24 00007f67a30370b1 83 0.3256 fib.c:24 00007f67a30370b9 1997 7.8348 fib.c:24 00007f67a30370c6 2080 8.1604 fib.c:27 00007f67a30370c8 988 3.8762 fib.c:27 00007f67a30370cd 1315 5.1591 fib.c:27 00007f67a30370cf 251 0.9847 fib.c:27 00007f67a30370d3 1191 4.6726 fib.c:27 00007f67a30370d6 975 3.8252 fib.c:27 00007f67a30370db 1010 3.9625 fib.c:27 00007f67a30370dd 242 0.9494 fib.c:27 00007f67a30370e1 2782 10.9145 fib.c:28 00007f67a30370e5 3768 14.7828 fib.c:28 00007f67a30370eb 615 2.4128 (no location information) 00007f67a30370f3 6558 25.7287 (no location information) 00007f67a3037100 15603 37.4973 fib.c:29 10946.jo fib_right 00007f67a3037100 1646 10.5493 fib.c:29 00007f67a3037101 45 0.2884 fib.c:29 00007f67a3037109 2372 15.2022 fib.c:29 00007f67a3037116 2234 14.3178 fib.c:32 00007f67a3037118 612 3.9223 fib.c:32 00007f67a303711d 622 3.9864 fib.c:32 00007f67a303711f 385 2.4675 fib.c:32 00007f67a3037123 404 2.5892 fib.c:32 00007f67a3037126 634 4.0633 fib.c:32 00007f67a303712b 870 5.5759 fib.c:32 00007f67a303712d 62 0.3974 fib.c:32 00007f67a3037131 1848 11.8439 fib.c:33 00007f67a3037135 2840 18.2016 fib.c:33 00007f67a303713a 1 0.0064 fib.c:33 00007f67a303713b 1023 6.5564 (no location information) 00007f67a3037143 5 0.0320 (no location information) 000000000080c1e4 15 0.0360 MachineOperand.h:150 lli llvm::MachineOperand::isReg() const 000000000080c1e4 6 40.0000 MachineOperand.h:150 000000000080c1ec 2 13.3333 MachineOperand.h:150 ... llvm-svn: 76102	2009-07-16 21:07:26 +00:00
Owen Anderson	c277dc408b	Privatize the ConstantFP table. I'm on a roll! llvm-svn: 76097	2009-07-16 19:05:41 +00:00
Owen Anderson	20b34ac794	Move the ConstantInt uniquing table into LLVMContextImpl. This exposed a number of issues in our current context-passing stuff, which is also fixed here llvm-svn: 76089	2009-07-16 18:04:31 +00:00
Anton Korobeynikov	bbd751e410	Propagate return result extension type llvm-svn: 75925	2009-07-16 13:35:48 +00:00
Owen Anderson	f945a9ed07	Move a few more convenience factory functions from Constant to LLVMContext. llvm-svn: 75840	2009-07-15 21:51:10 +00:00
Ted Kremenek	39816d9157	Lexically order files in CMakeLists.txt files. llvm-svn: 75831	2009-07-15 21:08:16 +00:00
Owen Anderson	b6b2530000	Move EVER MORE stuff over to LLVMContext. llvm-svn: 75703	2009-07-14 23:09:55 +00:00
Torok Edwin	fbcc663cbf	llvm_unreachable->llvm_unreachable(0), LLVM_UNREACHABLE->llvm_unreachable. This adds location info for all llvm_unreachable calls (which is a macro now) in !NDEBUG builds. In NDEBUG builds location info and the message is off (it only prints "UREACHABLE executed"). llvm-svn: 75640	2009-07-14 16:55:14 +00:00
Owen Anderson	53a52215b5	Begin the painful process of tearing apart the rat'ss nest that is Constants.cpp and ConstantFold.cpp. This involves temporarily hard wiring some parts to use the global context. This isn't ideal, but it's the only way I could figure out to make this process vaguely incremental. llvm-svn: 75445	2009-07-13 04:09:18 +00:00
Chris Lattner	7b9d6ebb9c	remove llvm.part.set.* and llvm.part.select.*. They have never been implemented in codegen, have no frontend to generate them, and are better implemented with pattern matching (like the ppc backend does to generate rlwimi/rlwinm etc). PR4543 llvm-svn: 75430	2009-07-12 21:08:53 +00:00
Torok Edwin	08954aa4e1	Fix assert(0) conversion, as suggested by Chris. llvm-svn: 75423	2009-07-12 20:07:01 +00:00
Jakob Stoklund Olesen	ed0e1a0552	Implement support for promotion of AND/OR/XOR on integer types. The blackfin processor has a legal i16 type, but only logic operations on i32. llvm-svn: 75419	2009-07-12 18:10:18 +00:00
Jakob Stoklund Olesen	6b9f63cafa	Fix types in PromoteNode handling of CTPOP and friends. llvm-svn: 75418	2009-07-12 17:43:20 +00:00
Torok Edwin	56d0659726	assert(0) -> LLVM_UNREACHABLE. Make llvm_unreachable take an optional string, thus moving the cerr<< out of line. LLVM_UNREACHABLE is now a simple wrapper that makes the message go away for NDEBUG builds. llvm-svn: 75379	2009-07-11 20:10:48 +00:00
Torok Edwin	ccb29cd290	Convert more assert(0)+abort() -> LLVM_UNREACHABLE, and abort()/exit() -> llvm_report_error(). llvm-svn: 75363	2009-07-11 13:10:19 +00:00
Evan Cheng	ede2ce71aa	Fix up support for OptionalDefOperand when it defaults to an actual register def. I need this to get ready for major Thumb1 surgery. llvm-svn: 75328	2009-07-11 01:06:50 +00:00
Eli Friedman	106f2885d1	Use CreateStackStoreLoad helper in more places. llvm-svn: 75320	2009-07-11 00:11:07 +00:00
Bob Wilson	f76798769f	Fix an apparent copy-and-paste problem in an error message. llvm-svn: 75197	2009-07-09 23:42:59 +00:00
Eli Friedman	2b77eef160	Make EXTRACT_VECTOR_ELT a bit more flexible in terms of the returned value. Adjust other code to deal with that correctly. Make DAGTypeLegalizer::PromoteIntRes_EXTRACT_VECTOR_ELT take advantage of this new flexibility to simplify the code and make it deal with unusual vectors (like <4 x i1>) correctly. Fixes PR3037. llvm-svn: 75176	2009-07-09 22:01:03 +00:00
Owen Anderson	092bc51cdb	As Chris pointed out, we don't actually need to pass the context around here. llvm-svn: 75161	2009-07-09 18:44:09 +00:00
Owen Anderson	0504e0a222	Thread LLVMContext through MVT and related parts of SDISel. llvm-svn: 75153	2009-07-09 17:57:24 +00:00
Dan Gohman	6b04136756	Make SelectionDAG::getVectorShuffle work properly for VECTOR_SHUFFLE nodes with operand types that differ from the result type. (This doesn't normally happen right now, because SelectionDAGLowering::visitShuffleVector normalizes vector shuffles.) llvm-svn: 75081	2009-07-09 00:46:33 +00:00
David Goodwin	22c2fba978	Use common code for both ARM and Thumb-2 instruction and register info. llvm-svn: 75067	2009-07-08 23:10:31 +00:00
Duncan Sands	7dcc37b942	Nowadays vectors are only split if they have an even number of elements. Make some simplifications based on this (in particular SplitVecRes_SETCC). Tighten up some checking while there. llvm-svn: 75050	2009-07-08 21:34:03 +00:00
Duncan Sands	3f1e2409cc	Remove trailing whitespace. Reorder some methods and cases alphabetically. No functionality change. llvm-svn: 75001	2009-07-08 11:36:39 +00:00
Nick Lewycky	a21d3daadc	Remove the vicmp and vfcmp instructions. Because we never had a release with these instructions, no autoupgrade or backwards compatibility support is provided. llvm-svn: 74991	2009-07-08 03:04:38 +00:00
Chris Lattner	4ac607332d	dag combine sext(setcc) -> vsetcc before legalize. To make this safe, VSETCC must define all bits, which is different than it was documented to before. Since all targets that implement VSETCC already have this behavior, and we don't optimize based on this, just change the documentation. We now get nice code for vec_compare.ll llvm-svn: 74978	2009-07-08 00:31:33 +00:00
Chris Lattner	f3989abdbf	SelectionDAG::SignBitIsZero doesn't work right for vectors, for now, conservatively return false. llvm-svn: 74969	2009-07-07 23:28:46 +00:00
Dale Johannesen	4e33115e5e	Operand of asm("call") (the callee function) is represented as "X" constraint and "P" modifier on x86. Make this work. (Change may not be sufficient to fix it for non-Darwin, but I'm pretty sure it won't break anything.) gcc.apple/asm-block-32.c gcc.apple/asm-block-33.c llvm-svn: 74967	2009-07-07 23:26:33 +00:00
Chris Lattner	fc74e8241a	add support for legalizing an icmp where the result is illegal (4xi1) but the input is legal (4 x i32) llvm-svn: 74964	2009-07-07 23:03:54 +00:00
Chris Lattner	f48f3be185	random code cleanups. llvm-svn: 74962	2009-07-07 22:49:15 +00:00
Chris Lattner	30220d8f98	implement support for spliting and scalarizing vector setcc's. This finishes off enough support for vector compares to get the icmp/fcmp version of 2008-07-23-VSetCC.ll passing. llvm-svn: 74961	2009-07-07 22:47:46 +00:00
Chris Lattner	f2af7f44e7	lower vector icmp/fcmp to ICMP/FCMP nodes with the right result (vector of bool). llvm-svn: 74960	2009-07-07 22:41:32 +00:00
Chris Lattner	119421421a	ScalarizeVecRes_ShiftOp and ScalarizeVecRes_BinOp are the same, eliminate the former. llvm-svn: 74959	2009-07-07 22:28:41 +00:00
Chris Lattner	cc1fed3111	add support for vector legalizing of *_EXTEND. llvm-svn: 74957	2009-07-07 22:27:17 +00:00
Owen Anderson	5c96ef7c4e	Have scoped mutexes take referenes instead of pointers. llvm-svn: 74931	2009-07-07 18:33:04 +00:00
Tilmann Scheller	aea6059ed4	Add NumFixedArgs attribute to CallSDNode which indicates the number of fixed arguments in a vararg call. With the SVR4 ABI on PowerPC, vector arguments for vararg calls are passed differently depending on whether they are a fixed or a variable argument. Variable vector arguments always go into memory, fixed vector arguments are put into vector registers. If there are no free vector registers available, fixed vector arguments are put on the stack. The NumFixedArgs attribute allows to decide for an argument in a vararg call whether it belongs to the fixed or variable portion of the parameter list. llvm-svn: 74764	2009-07-03 06:44:53 +00:00
Devang Patel	87127712b9	Simplify debug info intrisinc lowering. llvm-svn: 74733	2009-07-02 22:43:26 +00:00
Douglas Gregor	6141511621	CMake build fixes, from Xerxes Ranby llvm-svn: 74720	2009-07-02 18:53:52 +00:00
Devang Patel	6bab414f87	Simplify. llvm-svn: 74677	2009-07-02 00:28:03 +00:00
Devang Patel	846a5e4d3e	Simplify. No intentional functionality change. llvm-svn: 74673	2009-07-02 00:08:09 +00:00
Devang Patel	53d24bc7d6	Refactor. No functionality change. llvm-svn: 74659	2009-07-01 23:19:01 +00:00
Devang Patel	ea76e08645	llvm.dbg.declare is always used for local variable's debug info. llvm-svn: 74625	2009-07-01 18:51:07 +00:00
Evan Cheng	0dc101b897	Add a bit IsUndef to MachineOperand. This indicates the def / use register operand is defined by an implicit_def. That means it can def / use any register and passes (e.g. register scavenger) can feel free to ignore them. The register allocator, when it allocates a register to a virtual register defined by an implicit_def, can allocate any physical register without worrying about overlapping live ranges. It should mark all of operands of the said virtual register so later passes will do the right thing. This is not the best solution. But it should be a lot less fragile to having the scavenger try to track what is defined by implicit_def. llvm-svn: 74518	2009-06-30 08:49:04 +00:00
Chris Lattner	a4775f2b13	fix a typo that GCC should have caught that causes crashes with -view-*-dags llvm-svn: 74364	2009-06-27 00:57:02 +00:00
Chris Lattner	bc60c14c97	fix a really subtle bug in the cross section of aliases and TLS: the SelectionDAG::getGlobalAddress function properly looks through aliases to determine thread-localness, but then passes the GV* down to GlobalAddressSDNode::GlobalAddressSDNode which does not. Instead of passing down isTarget, just pass down the predetermined node opcode. This fixes some assertions with out of tree changes I'm working on. llvm-svn: 74325	2009-06-26 21:14:05 +00:00
Chris Lattner	7f82a19fbf	implement DOTGraphTraits<SelectionDAG*>::getNodeLabel in terms of SDNode::print_details to eliminate a ton of near-duplicate code. llvm-svn: 74311	2009-06-26 19:06:10 +00:00
Chris Lattner	68bb4e0e01	dot graph viewing is apparently not using SDNode::print_details, this is bad, but in the meantime lets print targetflags on node labels. llvm-svn: 74274	2009-06-26 05:55:43 +00:00
Chris Lattner	17dcba9da4	propagate target operand flags from dag nodes into MachineOperands. llvm-svn: 74273	2009-06-26 05:52:14 +00:00
Chris Lattner	54b8ebced6	fit in 80 cols llvm-svn: 74270	2009-06-26 05:39:02 +00:00
Chris Lattner	b3586b6e73	add targetflags to jump tables and constant pool entries. llvm-svn: 74204	2009-06-25 21:35:31 +00:00
Chris Lattner	8e34f98d72	allow setting target operand flags on TargetGlobalAddress nodes. llvm-svn: 74203	2009-06-25 21:21:14 +00:00
Chris Lattner	af5dbfc6f8	start bringing targetoperand flags into isel, first up, ExternalSymbol. llvm-svn: 74199	2009-06-25 18:45:50 +00:00
Owen Anderson	5defd5655e	Provide guards for this shared structure. I'm not sure this actually needs to be shared, but how/where to privatize it is not immediately clear to me. If any SelectionDAG experts see a better solution, please share! llvm-svn: 74180	2009-06-25 17:09:00 +00:00
David Greene	30048bdb63	This increases the maximum for MVT::LAST_VALUETYPE This change doubles the allowable value for MVT::LAST_VALUETYPE. It does this by doing several things. 1. Introduces MVT::MAX_ALLOWED_LAST_VALUETYPE which in this change has a value of 64. This value contains the current maximum for the MVT::LAST_VALUETYPE. 2. Instead of checking "MVT::LAST_VALUETYPE <= 32", all of those uses now become "MVT::LAST_VALUETYPE <= MVT::MAX_ALLOWED_LAST_VALUETYPE" 3. Changes the dimension of the ValueTypeActions from 2 elements to four elements and adds comments ahead of the declaration indicating the it is "(MVT::MAX_ALLOWED_LAST_VALUETYPE/32) * 2". This at least lets us find what is affected if and when MVT::MAX_ALLOWED_LAST_VALUETYPE gets changed. 4. Adds initializers for the new elements of ValueTypeActions. This does NOT add any types in MVT. That would be done separately. This doubles the size of ValueTypeActions from 64 bits to 128 bits and gives us the freedom to add more types for AVX. llvm-svn: 74110	2009-06-24 19:41:55 +00:00
Owen Anderson	b70adf2b92	Get rid of the global CFGOnly flag by threading a ShortNames parameters through the GraphViz rendering code. Update other uses in the codebase for this change. llvm-svn: 74084	2009-06-24 17:37:09 +00:00
Dale Johannesen	92c11e90c8	Rewrite 73900 per Duncan's suggestion. llvm-svn: 74082	2009-06-24 17:11:31 +00:00
Chris Lattner	3912036c25	remove dead makefile flags. llvm-svn: 74065	2009-06-24 05:29:56 +00:00
Dale Johannesen	315fb72d36	Fix memcpy expansion so it won't generate invalid types for the target (I think). This was breaking the PPC32 calling sequence. llvm-svn: 73900	2009-06-22 20:59:07 +00:00
Devang Patel	da10358c84	mv CodeGen/DebugLoc.h Support/DebugLoc.h llvm-svn: 73786	2009-06-19 22:08:58 +00:00
Eli Friedman	495d02f4a6	Minor cleanup; fixes review comments for a previous patch. Sorry for taking so long to get to this! llvm-svn: 73757	2009-06-19 06:01:55 +00:00
Sanjiv Gupta	bce3ca6ad9	Fixed names of libcalls checked in r73480. llvm-svn: 73483	2009-06-16 10:22:58 +00:00
Sanjiv Gupta	557ed09e0f	Added required libcalls for PIC16 (mostly floating points to integer casting operations). llvm-svn: 73480	2009-06-16 09:03:58 +00:00
Eli Friedman	abfad5d61e	Add some generic expansion logic for SMULO and UMULO. Fixes UMULO support for x86, and UMULO/SMULO for many architectures, including PPC (PR4201), ARM, and Cell. The resulting expansion isn't perfect, but it's not bad. llvm-svn: 73477	2009-06-16 06:58:29 +00:00
Dan Gohman	6e6808adaf	Change this from an assert to a cerr+exit, since it's diagnosing an unsupported inline asm construct, rather than verifying a code invariant. llvm-svn: 73435	2009-06-15 22:32:41 +00:00
Devang Patel	56e6fe1642	Gracefully handle imbalanced inline function begin and end markers. llvm-svn: 73426	2009-06-15 21:45:50 +00:00
Arnold Schwaighofer	cb9046cfc8	CheckTailCallReturnConstraints is missing a check on the incomming chain of the RETURN node. The incomming chain must be the outgoing chain of the CALL node. This causes the backend to identify tail calls that are not tail calls. This patch fixes this. llvm-svn: 73387	2009-06-15 14:43:36 +00:00
Eli Friedman	516479d6e7	Tweak the expansion code for BIT_CONVERT to generate better code converting from an MMX vector to an i64. llvm-svn: 73024	2009-06-07 09:41:57 +00:00
Eli Friedman	3234587213	Slightly generalize the code that handles shuffles of consecutive loads on x86 to handle more cases. Fix a bug in said code that would cause it to read past the end of an object. Rewrite the code in SelectionDAGLegalize::ExpandBUILD_VECTOR to be a bit more general. Remove PerformBuildVectorCombine, which is no longer necessary with these changes. In addition to simplifying the code, with this change, we can now catch a few more cases of consecutive loads. llvm-svn: 73012	2009-06-07 06:52:44 +00:00
Eli Friedman	c61e357aa6	Fix the expansion for CONCAT_VECTORS so that it doesn't create illegal types. llvm-svn: 72993	2009-06-06 07:08:26 +00:00
Eli Friedman	aee3f62b75	Factor out a couple of helpers. llvm-svn: 72992	2009-06-06 07:04:42 +00:00
Eli Friedman	aea9b65668	Make SINT_TO_FP/UINT_TO_FP vector legalization queries query on the integer type to be consistent with normal operation legalization. No visible change because nothing is actually using this at the moment. llvm-svn: 72980	2009-06-06 03:27:50 +00:00
Devang Patel	d1c7d34924	Add new function attribute - noimplicitfloat Update code generator to use this attribute and remove NoImplicitFloat target option. Update llc to set this attribute when -no-implicit-float command line option is used. llvm-svn: 72959	2009-06-05 21:57:13 +00:00
Nate Begeman	624690c6b2	Adapt the x86 build_vector dagcombine to the current state of the legalizer. build vectors with i64 elements will only appear on 32b x86 before legalize. Since vector widening occurs during legalize, and produces i64 build_vector elements, the dag combiner is never run on these before legalize splits them into 32b elements. Teach the build_vector dag combine in x86 back end to recognize consecutive loads producing the low part of the vector. Convert the two uses of TLI's consecutive load recognizer to pass LoadSDNodes since that was required implicitly. Add a testcase for the transform. Old: subl $28, %esp movl 32(%esp), %eax movl 4(%eax), %ecx movl %ecx, 4(%esp) movl (%eax), %eax movl %eax, (%esp) movaps (%esp), %xmm0 pmovzxwd %xmm0, %xmm0 movl 36(%esp), %eax movaps %xmm0, (%eax) addl $28, %esp ret New: movl 4(%esp), %eax pmovzxwd (%eax), %xmm0 movl 8(%esp), %eax movaps %xmm0, (%eax) ret llvm-svn: 72957	2009-06-05 21:37:30 +00:00
Sanjiv Gupta	7925c5fd3f	Allow libcalls for i16 sdiv/udiv/rem operations. llvm-svn: 72941	2009-06-05 14:41:10 +00:00
Dan Gohman	a5b9645c4b	Split the Add, Sub, and Mul instruction opcodes into separate integer and floating-point opcodes, introducing FAdd, FSub, and FMul. For now, the AsmParser, BitcodeReader, and IRBuilder all preserve backwards compatability, and the Core LLVM APIs preserve backwards compatibility for IR producers. Most front-ends won't need to change immediately. This implements the first step of the plan outlined here: http://nondot.org/sabre/LLVMNotes/IntegerOverflow.txt llvm-svn: 72897	2009-06-04 22:49:04 +00:00
Dale Johannesen	37bc85f89a	Fix FP_TO_UINT->i32 on ppc32 -mcpu=g5. This was using Promote which won't work because i64 isn't a legal type. It's easy enough to use Custom, but then we have the problem that when the type legalizer is promoting FP_TO_UINT->i16, it has no way of telling it should prefer FP_TO_SINT->i32 to FP_TO_UINT->i32. I have uncomfortably hacked this by making the type legalizer choose FP_TO_SINT when both are Custom. This fixes several regressions in the testsuite. llvm-svn: 72891	2009-06-04 20:53:52 +00:00
Dan Gohman	7b6b5dd954	Don't do the X * 0.0 -> 0.0 transformation in instcombine, because instcombine doesn't know when it's safe. To partially compensate for this, introduce new code to do this transformation in dagcombine, which can use UnsafeFPMath. llvm-svn: 72872	2009-06-04 17:12:12 +00:00
Dan Gohman	c2eed3b0f8	Fix comments. llvm-svn: 72870	2009-06-04 16:49:15 +00:00
Dale Johannesen	5234d3795f	Revert 72707 and 72709, for the moment. llvm-svn: 72712	2009-06-02 03:12:52 +00:00
Dale Johannesen	0b8ca79253	Make the implicit inputs and outputs of target-independent ADDC/ADDE use MVT::i1 (later, whatever it gets legalized to) instead of MVT::Flag. Remove CARRY_FALSE in favor of 0; adjust all target-independent code to use this format. Most targets will still produce a Flag-setting target-dependent version when selection is done. X86 is converted to use i32 instead, which means TableGen needs to produce different code in xxxGenDAGISel.inc. This keys off the new supportsHasI1 bit in xxxInstrInfo, currently set only for X86; in principle this is temporary and should go away when all other targets have been converted. All relevant X86 instruction patterns are modified to represent setting and using EFLAGS explicitly. The same can be done on other targets. The immediate behavior change is that an ADC/ADD pair are no longer tightly coupled in the X86 scheduler; they can be separated by instructions that don't clobber the flags (MOV). I will soon add some peephole optimizations based on using other instructions that set the flags to feed into ADC. llvm-svn: 72707	2009-06-01 23:27:20 +00:00
Duncan Sands	96e5698741	Rename CustomLowerResults to CustomLowerNode, since it is used both when a result is illegal and when an operand is illegal. llvm-svn: 72658	2009-05-31 04:15:38 +00:00
Bill Wendling	09f17a8479	Untabification. llvm-svn: 72604	2009-05-30 01:09:53 +00:00
Evan Cheng	86cdb4b345	Do not try to create a MVT type of width 0. llvm-svn: 72557	2009-05-28 23:52:18 +00:00
Eli Friedman	e1dc193f35	Re-commit r72514 and r72516 with a fixed version of BR_CC lowering. This patch removes some special cases for opcodes and does a bit of cleanup. llvm-svn: 72536	2009-05-28 20:40:34 +00:00
Evan Cheng	6673ff08fe	Incorporate patch feedbacks. llvm-svn: 72533	2009-05-28 18:41:02 +00:00
Bill Wendling	f193838d2b	Temporarily revert r72514 (and dependent patch r72516). It was causing this failure during llvm-gcc bootstrap: Assertion failed: (!Tmp2.getNode() && "Can't legalize BR_CC with legal condition!"), function ExpandNode, file /Volumes/Sandbox/Buildbot/llvm/full-llvm/build/llvmCore.roots/llvmCore~obj/src/lib/CodeGen/SelectionDAG/LegalizeDAG.cpp, line 2923. /Volumes/Sandbox/Buildbot/llvm/full-llvm/build/llvmgcc42.roots/llvmgcc42~obj/src/gcc/libgcc2.c:1727: internal compiler error: Abort trap Please submit a full bug report, with preprocessed source if appropriate. See <URL:http://developer.apple.com/bugreporter> for instructions. llvm-svn: 72530	2009-05-28 18:18:59 +00:00
Eli Friedman	9b9df77260	Remove a couple of useless functions. llvm-svn: 72516	2009-05-28 04:49:34 +00:00
Eli Friedman	3aa278394e	Remove special cases for more opcodes. This is basically the end of this series of patches for LegalizeDAG; the remaining special cases can't be removed without more infrastructure work. There's a FIXME for each relevant opcode near the beginning of SelectionDAGLegalize::LegalizeOp. llvm-svn: 72514	2009-05-28 04:39:57 +00:00
Eli Friedman	5df7202d3b	Remove special case for SETCC opcode; add some comments explaining why some special cases are necessary. llvm-svn: 72511	2009-05-28 03:56:57 +00:00
Eli Friedman	e1bc3798e6	Some minor cleanups. llvm-svn: 72509	2009-05-28 03:06:16 +00:00
Evan Cheng	a9cda8abf2	Added optimization that narrow load / op / store and the 'op' is a bit twiddling instruction and its second operand is an immediate. If bits that are touched by 'op' can be done with a narrower instruction, reduce the width of the load and store as well. This happens a lot with bitfield manipulation code. e.g. orl $65536, 8(%rax) => orb $1, 10(%rax) Since narrowing is not always a win, e.g. i32 -> i16 is a loss on x86, dag combiner consults with the target before performing the optimization. llvm-svn: 72507	2009-05-28 00:35:15 +00:00
Eli Friedman	ed795153c7	Minor cleanups; add a better explanation for the issue with BUILD_VECTOR. llvm-svn: 72469	2009-05-27 12:42:55 +00:00
Eli Friedman	2892d82378	Remove more special cases for opcodes. llvm-svn: 72468	2009-05-27 12:20:41 +00:00
Eli Friedman	3b251705fd	Remove special cases for more opcodes. llvm-svn: 72467	2009-05-27 07:58:35 +00:00
Eli Friedman	0e49431422	Removing more special cases from LegalizeDAG. llvm-svn: 72465	2009-05-27 07:32:27 +00:00
Eli Friedman	568839681c	Eliminate more special cases for opcodes. llvm-svn: 72464	2009-05-27 07:05:37 +00:00
Eli Friedman	d6f2834496	Remove more special cases from LegalizeDAG. llvm-svn: 72456	2009-05-27 03:33:44 +00:00
Eli Friedman	b3554158c5	Remove unused argument. llvm-svn: 72455	2009-05-27 02:21:29 +00:00
Eli Friedman	a8f9a0261e	Remove more opcode special cases. llvm-svn: 72454	2009-05-27 02:16:40 +00:00
Eli Friedman	21d349b3c5	Start of refactoring LegalizeDAG so that we don't need specialized handling for every single opcode. llvm-svn: 72447	2009-05-27 01:25:56 +00:00
Eli Friedman	4a951bf2ad	Delete a bunch of dead code from LegalizeDAG. llvm-svn: 72414	2009-05-26 08:55:52 +00:00
Eli Friedman	ac149ee60a	Add a comment which should hopefully make the purpose of this method a bit clearer. llvm-svn: 72374	2009-05-24 20:32:10 +00:00
Eli Friedman	fd8b335ca4	Minor improvement to FCOPYSIGN to use BIT_CONVERT in cases where the corresponding integer type is legal. llvm-svn: 72373	2009-05-24 20:29:11 +00:00
Eli Friedman	fe87034cef	Rewrite ISD::FCOPYSIGN lowering to never use i64. Not really ideal, but it's late, and I don't have any better ideas at the moment. Fixes PR4257. llvm-svn: 72363	2009-05-24 10:21:20 +00:00
Eli Friedman	cd2e0cd297	Update for CMakeLists; untested, so tell me if there are issues. llvm-svn: 72360	2009-05-24 09:13:13 +00:00
Eli Friedman	a4e1675dac	Remove checks of getTypeAction from LegalizeOp; we already assert that all results and all operands are legal, so this change shouldn't affect behavior at all. llvm-svn: 72359	2009-05-24 08:42:01 +00:00
Eli Friedman	5e0d150689	Disable type legalization in LegalizeDAG. This leaves around 4000 lines of dead code; I'll clean that up in subsequent commits. llvm-svn: 72358	2009-05-24 02:46:31 +00:00
Eli Friedman	7badee92ad	Fix a bug in the expansion of EXTRACT_SUBVECTOR in ExpandExtractFromVectorThroughStack. llvm-svn: 72351	2009-05-23 23:03:28 +00:00
Eli Friedman	40afdb63ec	Add a proper implementation of EXTRACT_SUBVECTOR legalization that doesn't split legal vector operands. This is necessary because the type legalization (and therefore, vector splitting) code will be going away soon. llvm-svn: 72349	2009-05-23 22:37:25 +00:00
Torok Edwin	be6a9a151a	Fix PR4254. The DAGCombiner created a negative shiftamount, stored in an unsigned variable. Later the optimizer eliminated the shift entirely as being undefined. Example: (srl (shl X, 56) 48). ShiftAmt is 4294967288. Fix it by checking that the shiftamount is positive, and storing in a signed variable. llvm-svn: 72331	2009-05-23 17:29:48 +00:00
Eli Friedman	da90dd6d72	Add a new step to legalization to legalize vector math operations. This will allow simplifying LegalizeDAG to eliminate type legalization. (I have a patch to do that, but it's not quite finished; I'll commit it once it's finished and I've fixed any review comments for this patch.) See the comment at the beginning of lib/CodeGen/SelectionDAG/LegalizeVectorOps.cpp for more details on the motivation for this patch. llvm-svn: 72325	2009-05-23 12:35:30 +00:00
Duncan Sands	d6fb6501e3	Add a new codegen pass that normalizes dwarf exception handling code in preparation for code generation. The main thing it does is handle the case when eh.exception calls (and, in a future patch, eh.selector calls) are far away from landing pads. Right now in practice you only find eh.exception calls close to landing pads: either in a landing pad (the common case) or in a landing pad successor, due to loop passes shifting them about. However future exception handling improvements will result in calls far from landing pads: (1) Inlining of rewinds. Consider the following case: In function @f: ... invoke @g to label %normal unwind label %unwinds ... unwinds: %ex = call i8* @llvm.eh.exception() ... In function @g: ... invoke @something to label %continue unwind label %handler ... handler: %ex = call i8* @llvm.eh.exception() ... perform cleanups ... "rethrow exception" Now inline @g into @f. Currently this is turned into: In function @f: ... invoke @something to label %continue unwind label %handler ... handler: %ex = call i8* @llvm.eh.exception() ... perform cleanups ... invoke "rethrow exception" to label %normal unwind label %unwinds unwinds: %ex = call i8* @llvm.eh.exception() ... However we would like to simplify invoke of "rethrow exception" into a branch to the %unwinds label. Then %unwinds is no longer a landing pad, and the eh.exception call there is then far away from any landing pads. (2) Using the unwind instruction for cleanups. It would be nice to have codegen handle the following case: invoke @something to label %continue unwind label %run_cleanups ... handler: ... perform cleanups ... unwind This requires turning "unwind" into a library call, which necessarily takes a pointer to the exception as an argument (this patch also does this unwind lowering). But that means you are using eh.exception again far from a landing pad. (3) Bugpoint simplifications. When bugpoint is simplifying exception handling code it often generates eh.exception calls far from a landing pad, which then causes codegen to assert. Bugpoint then latches on to this assertion and loses sight of the original problem. Note that it is currently rare for this pass to actually do anything. And in fact it normally shouldn't do anything at all given the code coming out of llvm-gcc! But it does fire a few times in the testsuite. As far as I can see this is almost always due to the LoopStrengthReduce codegen pass introducing pointless loop preheader blocks which are landing pads and only contain a branch to another block. This other block contains an eh.exception call. So probably by tweaking LoopStrengthReduce a bit this can be avoided. llvm-svn: 72276	2009-05-22 20:36:31 +00:00
Jay Foad	7d0479f2c2	Use v.data() instead of &v[0] when SmallVector v might be empty. llvm-svn: 72210	2009-05-21 09:52:38 +00:00
Bill Wendling	f99bd3a82b	Temporarily revert r72191. It was causing an assert during llvm-gcc bootstrapping. llvm-svn: 72200	2009-05-21 00:04:55 +00:00
Argyrios Kyrtzidis	2b59a5fc6c	Introduce DebugScope which gets embedded into the machine instructions' DebugLoc. DebugScope refers to a debug region, function or block. llvm-svn: 72191	2009-05-20 22:57:17 +00:00
Eli Friedman	9030c35eb4	Fix for PR4235: to build a floating-point value from integer parts, build an integer and cast that to a float. This fixes a crash caused by trying to split an f32 into two f16's. This changes the behavior in test/CodeGen/XCore/fneg.ll because that testcase now triggers a DAGCombine which converts the fneg into an integer operation. If someone is interested, it's probably possible to tweak the test to generate an actual fneg. llvm-svn: 72162	2009-05-20 06:02:09 +00:00
Dan Gohman	d697a2dd8e	Remove the #ifndef NDEBUG from the FastISel debugging options. This fixes dejagnu tests that use these options. llvm-svn: 72094	2009-05-19 02:19:57 +00:00
Bill Wendling	d2dc9063d7	Revert last commit. It was wrong. llvm-svn: 72026	2009-05-18 18:21:03 +00:00
Bill Wendling	af7e400fda	Don't call RegionInlinedFnEnd if our optimization level isn't -O0. llvm-svn: 72024	2009-05-18 18:17:22 +00:00
Daniel Dunbar	a8c1658619	Silence Release-Asserts warnings. llvm-svn: 72011	2009-05-18 16:43:04 +00:00
Duncan Sands	83d008614f	Put back a bit of expensive checking logic that was overenthusiastically deleted in r70234. llvm-svn: 71926	2009-05-16 04:14:29 +00:00
Dan Gohman	d4f63052c4	Add an assert to turn a segfault on an unsupported inline asm construct into an assertion failure. llvm-svn: 71757	2009-05-14 00:30:16 +00:00
Jim Grosbach	4f915313ed	Removing the HasBuiltinSetjmp flag and associated bits. Flagging the presence of exception handling builtin sjlj targets in functions turns out not to be necessary. Marking the intrinsic implementation in the .td file as defining all registers is sufficient to get the context saved properly by the containing function. llvm-svn: 71743	2009-05-13 23:50:53 +00:00
Evan Cheng	ab0d23396a	Run code placement optimization for targets that want it (arm and x86 for now). llvm-svn: 71726	2009-05-13 21:42:09 +00:00
Jim Grosbach	aeca45dd6f	Add support for GCC compatible builtin setjmp and longjmp intrinsics. This is a supporting preliminary patch for GCC-compatible SjLJ exception handling. Note that these intrinsics are not designed to be invoked directly by the user, but rather used by the front-end as target hooks for exception handling. llvm-svn: 71610	2009-05-12 23:59:14 +00:00
Dan Gohman	9521cadff7	When scalarizing a vector BITCAST, check whether the operand has vector type, rather than assume that it does. If the operand is not vector, it shouldn't be run through ScalarizeVectorOp. This fixes one of the testcases in PR3886. llvm-svn: 71453	2009-05-11 18:30:42 +00:00
Bill Wendling	d6280534e4	--- Reverse-merging r71370 into '.': U lib/CodeGen/SelectionDAG/SelectionDAGBuild.cpp Revert r71370. llvm-svn: 71373	2009-05-10 00:10:50 +00:00
Bill Wendling	d53af35629	A debug function start was not being recorded when the optimization level wasn't None. However, we were always recording the region end. There's no longer a good reason for this code to be separated out between the different opt levels, as it was doing pretty much the same thing anyway. llvm-svn: 71370	2009-05-09 23:51:35 +00:00
Duncan Sands	af9eaa830a	Rename PaddedSize to AllocSize, in the hope that this will make it more obvious what it represents, and stop it being confused with the StoreSize. llvm-svn: 71349	2009-05-09 07:06:46 +00:00
Bill Wendling	8881780832	Mirror how Fast ISel determines if a region.end intrinsic is the end of an inlined function or the end of a function. Before, this was never executing the "inlined" version of the Record method. This will become important once the inlined Dwarf writer patch lands. llvm-svn: 71268	2009-05-08 21:14:49 +00:00
Anton Korobeynikov	65a58168cc	Factor out cycle-finder code and make it generic. llvm-svn: 71241	2009-05-08 18:51:58 +00:00
Anton Korobeynikov	c94dbf5ba0	Do not emit bit tests if target does not support natively left shift llvm-svn: 71240	2009-05-08 18:51:34 +00:00
Anton Korobeynikov	e7a9661f31	Properly expand libcalls for urem / srem. Also make code more straightforward. llvm-svn: 71238	2009-05-08 18:51:08 +00:00
Anton Korobeynikov	e2b78115d4	Typo llvm-svn: 71237	2009-05-08 18:50:54 +00:00
Dan Gohman	4bb6fa23cb	Revert 71165. It did more than just revert 71158 and it introduced several regressions. The problem due to 71158 is now fixed. llvm-svn: 71176	2009-05-07 19:46:24 +00:00
Bill Wendling	17f0f65499	Temporarily revert r71158. It was causing a failure during a full bootstrap: checking for bcopy... no checking for getc_unlocked... Assertion failed: (0 && "Unknown SCEV kind!"), function operator(), file /Volumes/Sandbox/Buildbot/llvm/full-llvm/build/llvmCore.roots/llvmCore~obj/src/lib/Analysis/ScalarEvolution.cpp, line 511. /Volumes/Sandbox/Buildbot/llvm/full-llvm/build/llvmgcc42.roots/llvmgcc42~obj/src/libdecnumber/decUtility.c:360: internal compiler error: Abort trap Please submit a full bug report, with preprocessed source if appropriate. See <URL:http://developer.apple.com/bugreporter> for instructions. make[4]: * [decUtility.o] Error 1 make[4]: * Waiting for unfinished jobs.... Assertion failed: (0 && "Unknown SCEV kind!"), function operator(), file /Volumes/Sandbox/Buildbot/llvm/full-llvm/build/llvmCore.roots/llvmCore~obj/src/lib/Analysis/ScalarEvolution.cpp, line 511. /Volumes/Sandbox/Buildbot/llvm/full-llvm/build/llvmgcc42.roots/llvmgcc42~obj/src/libdecnumber/decNumber.c:5591: internal compiler error: Abort trap Please submit a full bug report, with preprocessed source if appropriate. See <URL:http://developer.apple.com/bugreporter> for instructions. make[4]: * [decNumber.o] Error 1 make[3]: * [all-stage2-libdecnumber] Error 2 make[3]: *** Waiting for unfinished jobs.... llvm-svn: 71165	2009-05-07 17:26:14 +00:00
Argyrios Kyrtzidis	baf3fee885	Make DwarfWriter::RecordInlinedFnStart more like the other DwarfWriter's methods: -Have it return a label ID -Remove the unused Instruction parameter No functionality change. llvm-svn: 71132	2009-05-07 00:16:31 +00:00
Evan Cheng	cfc0513080	Do not use register as base ptr of pre- and post- inc/dec load / store nodes. llvm-svn: 71098	2009-05-06 18:25:01 +00:00
Duncan Sands	2338f6c57e	Add generic expansion of SUB when ADD and XOR are legal. Based on a patch by Micah Villmow. llvm-svn: 71078	2009-05-06 11:29:50 +00:00
Evan Cheng	1ff2727c95	Move getInstrOperandRegClass from the scheduler to TargetInstrInfo. llvm-svn: 70950	2009-05-05 00:30:09 +00:00
Chris Lattner	354b12259f	Make DBG_STOPPOINT nodes, and therefore DBG_LABEL labels, get a DebugLoc, so that it shows up in -print-machineinstrs. This doesn't appear to affect anything, but it was weird for some DBG_LABELs to have DebugLocs but not all of them. llvm-svn: 70921	2009-05-04 22:10:05 +00:00
Argyrios Kyrtzidis	9ae29b2d8f	-Remove the DwarfWriter::RecordSourceLine calls from the instruction selectors. -Depend on DebugLocs for source line info. (Comes with Regression-Be-Gone(tm)) llvm-svn: 70871	2009-05-04 16:23:49 +00:00
Argyrios Kyrtzidis	79be34012f	Revert r70803 for now, it causes a regression. llvm-svn: 70811	2009-05-03 23:27:19 +00:00
Argyrios Kyrtzidis	ce7196b903	-Remove the DwarfWriter::RecordSourceLine calls from the instruction selectors. -Depend on DebugLocs for source line info. llvm-svn: 70803	2009-05-03 22:03:35 +00:00
Anton Korobeynikov	2745bc92fa	Fix typo llvm-svn: 70770	2009-05-03 13:19:57 +00:00
Anton Korobeynikov	05b7a7c8f8	Properly handle sdiv / udiv / srem / urem libcalls llvm-svn: 70764	2009-05-03 13:18:16 +00:00
Anton Korobeynikov	399ad444fd	Proper name 16 bit libcalls llvm-svn: 70750	2009-05-03 13:14:08 +00:00
Anton Korobeynikov	f3fc92d6fc	Add libcall expansion for 16 and 128 bit muls llvm-svn: 70749	2009-05-03 13:13:51 +00:00
Argyrios Kyrtzidis	97324cec99	-Move the DwarfWriter::ValidDebugInfo check to a static DIDescriptor::ValidDebugInfo -Create DebugLocs without the need to have a DwarfWriter around llvm-svn: 70682	2009-05-03 08:50:41 +00:00
Bob Wilson	62a3124fb8	Allow CONCAT_VECTORS nodes to be legal or have custom lowering for some targets. Changes to take advantage of this will come later. llvm-svn: 70560	2009-05-01 17:55:32 +00:00
Argyrios Kyrtzidis	a5037484a4	Make DebugLoc independent of DwarfWriter. -Replace DebugLocTuple's Source ID with CompileUnit's GlobalVariable* -Remove DwarfWriter::getOrCreateSourceID -Make necessary changes for the above (fix callsites, etc.) llvm-svn: 70520	2009-04-30 23:22:31 +00:00
Jay Foad	fe0c648fee	Move helper functions for optimizing division by constant into the APInt class. llvm-svn: 70488	2009-04-30 10:15:35 +00:00
Chris Lattner	5ab42e93c4	fix a regression handling indirect results: these need to be considered memory operands otherwise the writebacks get lost when the inline asm doesn't otherwise have side effects. This fixes rdar://6839427, though clang really shouldn't generate these anymore. llvm-svn: 70455	2009-04-30 00:48:50 +00:00
Bill Wendling	026e5d7667	Instead of passing in an unsigned value for the optimization level, use an enum, which better identifies what the optimization is doing. And is more flexible for future uses. llvm-svn: 70440	2009-04-29 23:29:43 +00:00
Nate Begeman	7e6e352735	Fix infinite recursion in the C++ code which handles movddup by making it unnecessary. llvm-svn: 70425	2009-04-29 22:47:44 +00:00
Nate Begeman	39b59db245	Update comment, replace theoretically impossible check with an assert. llvm-svn: 70391	2009-04-29 18:13:31 +00:00
Nate Begeman	5f829d896d	Implement review feedback for vector shuffle work. llvm-svn: 70372	2009-04-29 05:20:52 +00:00
Sanjiv Gupta	ccd30945f9	Add a public method called getAddressSpace() to the GlobalAddressSDNode. llvm-svn: 70366	2009-04-29 04:43:24 +00:00
Chris Lattner	7d10386113	Disable the load-shrinking optimization from looking at anything larger than 64-bits, avoiding a crash. This should really be fixed to use APInts, though type legalization happens to help us out and we get good code on the attached testcase at least. This fixes rdar://6836460 llvm-svn: 70360	2009-04-29 03:45:07 +00:00
Bill Wendling	084669a1c9	Second attempt: Massive check in. This changes the "-fast" flag to "-O#" in llc. If you want to use the old behavior, the flag is -O0. This change allows for finer-grained control over which optimizations are run at different -O levels. Most of this work was pretty mechanical. The majority of the fixes came from verifying that a "fast" variable wasn't used anymore. The JIT still uses a "Fast" flag. I'll change the JIT with a follow-up patch. llvm-svn: 70343	2009-04-29 00:15:41 +00:00
Jakob Stoklund Olesen	604248e81f	Move getSubRegisterRegClass from ScheduleDagSDNodesEmit.cpp to a TargetRegisterClass method. Also make the method non-asserting. It will return NULL when given an invalid subreg index. The method is needed by an upcoming patch. llvm-svn: 70296	2009-04-28 16:34:09 +00:00
Bill Wendling	56f2987a87	r70270 isn't ready yet. Back this out. Sorry for the noise. llvm-svn: 70275	2009-04-28 01:04:53 +00:00
Bill Wendling	d0ae15946c	Massive check in. This changes the "-fast" flag to "-O#" in llc. If you want to use the old behavior, the flag is -O0. This change allows for finer-grained control over which optimizations are run at different -O levels. Most of this work was pretty mechanical. The majority of the fixes came from verifying that a "fast" variable wasn't used anymore. The JIT still uses a "Fast" flag. I'm not 100% sure if it's necessary to change it there... llvm-svn: 70270	2009-04-28 00:21:31 +00:00
Duncan Sands	bfa037705e	Now that PR2957 is resolved, remove a bunch of no-longer needed workarounds. llvm-svn: 70234	2009-04-27 19:33:03 +00:00
Nate Begeman	8d6d4b9289	2nd attempt, fixing SSE4.1 issues and implementing feedback from duncan. PR2957 ISD::VECTOR_SHUFFLE now stores an array of integers representing the shuffle mask internal to the node, rather than taking a BUILD_VECTOR of ConstantSDNodes as the shuffle mask. A value of -1 represents UNDEF. In addition to eliminating the creation of illegal BUILD_VECTORS just to represent shuffle masks, we are better about canonicalizing the shuffle mask, resulting in substantially better code for some classes of shuffles. llvm-svn: 70225	2009-04-27 18:41:29 +00:00
Dan Gohman	be36f5ccda	When transforming sext(trunc(load(x))) into sext(smaller load(x)), the trunc is directly replaced with the smaller load, so don't try to create a new sext node. This fixes PR4050. llvm-svn: 70179	2009-04-27 02:00:55 +00:00
Dan Gohman	fe9e1d5b59	Refactor the code to grab the low and high parts of a value using EXTRACT_ELEMENT into a utility function. llvm-svn: 70056	2009-04-25 17:55:53 +00:00
Dan Gohman	4539987920	Add a top-level comment about DAGCombiner's role in the compiler. llvm-svn: 70052	2009-04-25 17:09:45 +00:00
Dale Johannesen	56cb14c874	Fix PR 4057, a crash doing float->char const folding. This particular one is undefined behavior (although this isn't related to the crash), so it will no longer do it at compile time, which seems better. llvm-svn: 69990	2009-04-24 21:34:13 +00:00
Rafael Espindola	b93db668b3	Revert 69952. Causes testsuite failures on linux x86-64. llvm-svn: 69967	2009-04-24 12:40:33 +00:00
Nate Begeman	bb881d66f4	PR2957 ISD::VECTOR_SHUFFLE now stores an array of integers representing the shuffle mask internal to the node, rather than taking a BUILD_VECTOR of ConstantSDNodes as the shuffle mask. A value of -1 represents UNDEF. In addition to eliminating the creation of illegal BUILD_VECTORS just to represent shuffle masks, we are better about canonicalizing the shuffle mask, resulting in substantially better code for some classes of shuffles. A clean up of x86 shuffle code, and some canonicalizing in DAGCombiner is next. llvm-svn: 69952	2009-04-24 03:42:54 +00:00
Dan Gohman	640a161c73	Instead of requiring TLI.LowerCallTo to return an ISD::BUILD_PAIR, use ISD::EXTRACT_ELEMENT. SelectionDAG has a special fast-path for the cast of an EXTRACT_ELEMENT with a BUILD_PAIR operand, for the common case. llvm-svn: 69948	2009-04-24 02:40:23 +00:00
Dan Gohman	9478c3f8e5	Factor out a bit of code that appears in several places into a utility function. llvm-svn: 69937	2009-04-23 23:13:24 +00:00
Dan Gohman	a290ab44e8	Handle Void types in ComputeValueVTs. This doesn't currently occur, but this change makes the code more general and easier to adapt for new purposes. llvm-svn: 69935	2009-04-23 22:50:03 +00:00
Dan Gohman	1addf64735	Make X86's copyRegToReg able to handle copies to and from subclasses. This makes the extra copyRegToReg calls in ScheduleDAGSDNodesEmit.cpp unnecessary. Derived from a patch by Jakob Stoklund Olesen. llvm-svn: 69635	2009-04-20 22:54:34 +00:00
Dan Gohman	e014b69919	Simplify this code. getConstant knows how to make broadcasted vector constants. llvm-svn: 69634	2009-04-20 22:51:43 +00:00
Bob Wilson	da188ebbbd	Revise my previous change 68996 as suggested by Duncan. llvm-svn: 69607	2009-04-20 17:27:09 +00:00
Duncan Sands	f2e7133d34	Now that BUILD_VECTOR operands are allowed to be bigger than the vector element type, turn checking of the operand type back on again, appropriately adjusted. llvm-svn: 69516	2009-04-19 06:40:30 +00:00
Chris Lattner	7b01e66443	Fix PR3898, which manifests as failures on are an Xcore, patch by Jakob Stoklund Olesen! llvm-svn: 69472	2009-04-18 20:48:07 +00:00
Duncan Sands	e4ff21ba4b	Don't try to make BUILD_VECTOR operands have the same type as the vector element type: allow them to be of a wider integer type than the element type all the way through the system, and not just as far as LegalizeDAG. This should be safe because it used to be this way (the old type legalizer would produce such nodes), so backends should be able to handle it. In fact only targets which have legal vector types with an illegal promoted element type will ever see this (eg: <4 x i16> on ppc). This fixes a regression with the new type legalizer (vec_splat.ll). Also, treat SCALAR_TO_VECTOR the same as BUILD_VECTOR. After all, it is just a special case of BUILD_VECTOR. llvm-svn: 69467	2009-04-18 20:16:54 +00:00
Dale Johannesen	ad968ee286	Inline asm's were still introducing bogus dependencies; my earlier patch to this code only fixed half of it. llvm-svn: 69408	2009-04-18 00:09:40 +00:00
Dan Gohman	eefba6bbe0	In the list-burr's pseudo two-addr dependency heuristics, don't add dependencies on nodes with exactly one successor which is a COPY_TO_REGCLASS node. In the case that the copy is coalesced away, the dependence should be on the user of the copy, rather than the copy itself. llvm-svn: 69309	2009-04-16 20:59:02 +00:00
Dan Gohman	3027bb6953	Handle SUBREG_TO_REG instructions with the same heuristics as INSERT_SUBREG instructions in the list-burr scheduler. llvm-svn: 69308	2009-04-16 20:57:10 +00:00
Devang Patel	dab01f3fd6	Do not treat beginning of inlined scope as beginning of normal function scope if the location info is missing. Insetad of doing ... if (inlined_subroutine && known_location) DW_TAG_inline_subroutine else DW_TAG_subprogram do if (inlined_subroutine) { if (known_location) DW_TAG_inline_subroutine } else { DW_TAG_subprogram } llvm-svn: 69300	2009-04-16 17:55:30 +00:00
Devang Patel	9ac4390bf4	Record line number at the beginning of a func.start. This line was accidently lost yesterday. llvm-svn: 69286	2009-04-16 15:07:09 +00:00
Devang Patel	653dee0884	In -fast mode do what FastISel does. This code could use some refactoring help! llvm-svn: 69254	2009-04-16 02:33:41 +00:00
Devang Patel	46b04e4d06	If FastISel is run and it has known DebugLoc then use it. llvm-svn: 69253	2009-04-16 01:33:10 +00:00
Devang Patel	43fc7e481b	If location where the function was inlined is not know then do not emit debug info describing inlinied region. llvm-svn: 69252	2009-04-16 01:31:54 +00:00
Devang Patel	2738d7312a	Add DISubprogram is not null check. This fixes test/CodeGen//2009-01-21-invalid-debug-info.m test case. llvm-svn: 69210	2009-04-15 20:11:08 +00:00
Dan Gohman	8aa28b9c34	Generalize one of the SelectionDAG::ReplaceAllUsesWith overloads to support replacing a node with another that has a superset of the result types. Use this instead of calling ReplaceAllUsesOfValueWith for each value. llvm-svn: 69209	2009-04-15 20:06:30 +00:00
Devang Patel	32d17a1a29	Construct and emit DW_TAG_inlined_subroutine DIEs for inlined subroutine scopes (only in FastISel mode). llvm-svn: 69116	2009-04-15 00:10:26 +00:00
Dan Gohman	e5cd1fcdb9	When the result of an EXTRACT_SUBREG, INSERT_SUBREG, or SUBREG_TO_REG operator is used by a CopyToReg to export the value to a different block, don't reuse the CopyToReg's register for the subreg operation result if the register isn't precisely the right class for the subreg operation. Also, rename the h-registers.ll test, now that there are more than one. llvm-svn: 69087	2009-04-14 22:17:14 +00:00
Dale Johannesen	83593f4167	Do not force asm's to be chained if they don't touch memory and aren't volatile. This was interfering with good scheduling. llvm-svn: 69008	2009-04-14 00:56:56 +00:00
Daniel Dunbar	097f630dad	Make these errors more noticable in build logs. llvm-svn: 68998	2009-04-13 22:26:09 +00:00
Bob Wilson	59dbbb2bb4	Change SelectionDAG type legalization to allow BUILD_VECTOR operands to be promoted to legal types without changing the type of the vector. This is following a suggestion from Duncan (http://lists.cs.uiuc.edu/pipermail/llvmdev/2009-February/019923.html). The transformation that used to be done during type legalization is now postponed to DAG legalization. This allows the BUILD_VECTORs to be optimized and potentially handled specially by target-specific code. It turns out that this is also consistent with an optimization done by the DAG combiner: a BUILD_VECTOR and INSERT_VECTOR_ELT may be combined by replacing one of the BUILD_VECTOR operands with the newly inserted element; but INSERT_VECTOR_ELT allows its scalar operand to be larger than the element type, with any extra high bits being implicitly truncated. The result is a BUILD_VECTOR where one of the operands has a type larger the the vector element type. Any code that operates on BUILD_VECTORs may now need to be aware of the potential type discrepancy between the vector element type and the BUILD_VECTOR operands. This patch updates all of the places that I could find to handle that case. llvm-svn: 68996	2009-04-13 22:05:19 +00:00
Dan Gohman	6c1426308c	Rename COPY_TO_SUBCLASS to COPY_TO_REGCLASS, and generalize it accordingly. Thanks to Jakob Stoklund Olesen for pointing out how this might be useful. llvm-svn: 68986	2009-04-13 21:06:25 +00:00
Bob Wilson	f6c2195383	Refactor some code in SelectionDAGLegalize::ExpandBUILD_VECTOR. llvm-svn: 68981	2009-04-13 20:20:30 +00:00
Devang Patel	0431504fb2	Right now, Debugging information to encode scopes (DW_TAG_lexical_block) relies on DBG_LABEL. Unfortunately this intefers with the quality of optimized code. This patch updates dwarf writer to encode scoping information in DWARF only in FastISel mode. llvm-svn: 68973	2009-04-13 18:13:16 +00:00
Devang Patel	80be3511ed	Reapply 68847. Now debug_inlined section is covered by TAI->doesDwarfUsesInlineInfoSection(), which is false by default. llvm-svn: 68964	2009-04-13 17:02:03 +00:00
Dan Gohman	60a446ab02	Add a new TargetInstrInfo MachineInstr opcode, COPY_TO_SUBCLASS. This will be used to replace things like X86's MOV32to32_. Enhance ScheduleDAGSDNodesEmit to be more flexible and robust in the presense of subregister superclasses and subclasses. It can now cope with the definition of a virtual register being in a subclass of a use. Re-introduce the code for recording register superreg classes and subreg classes. This is needed because when subreg extracts and inserts get coalesced away, the virtual registers are left in the correct subclass. llvm-svn: 68961	2009-04-13 15:38:05 +00:00
Chris Lattner	a101f6f8d3	make UpdateValueMap handle the possiblity that we could be copying into the right register, avoiding a copy. llvm-svn: 68889	2009-04-12 07:46:30 +00:00
Chris Lattner	ada5d6c37e	optimize FastISel::UpdateValueMap to avoid duplicate map lookups, and make it return the assigned register. llvm-svn: 68888	2009-04-12 07:45:01 +00:00
Dan Gohman	825236b116	Revert r68847. It breaks the build on non-Darwin targets, with this message from the assembler: Error: unknown pseudo-op: `.debug_inlined' llvm-svn: 68863	2009-04-11 15:57:04 +00:00
Devang Patel	790e60999e	Keep track of inlined functions and their locations. This information is collected when nested llvm.dbg.func.start intrinsics are seen. (Right now, inliner removes nested llvm.dbg.func.start intrinisics during inlining.) Create debug_inlined dwarf section using these information. This info is used by gdb, at least on Darwin, to enable better experience debugging inlined functions. See DwarfWriter.cpp for more information on structure of debug_inlined section. llvm-svn: 68847	2009-04-11 00:16:47 +00:00
Bob Wilson	f074ca7454	Clean up a bunch of whitespace issues and fix a comment typo. No functional changes. llvm-svn: 68808	2009-04-10 18:48:47 +00:00
Dan Gohman	e517ae4211	Now that register classes have names, include the name in debug output. llvm-svn: 68786	2009-04-10 15:59:38 +00:00
Dan Gohman	de912e2475	Remove the obsolete SelectionDAG::getNodeValueTypes and simplify code that uses it by using SelectionDAG::getVTList instead. llvm-svn: 68744	2009-04-09 23:54:40 +00:00
Devang Patel	a68bdef482	Silence unused variable warning. llvm-svn: 68735	2009-04-09 23:45:17 +00:00
Devang Patel	a2c2b85df4	llvm.dbg.func_start also defines beginning of function scope. llvm-svn: 68727	2009-04-09 21:42:11 +00:00
Dan Gohman	0e8d199f91	Generalize ExtendUsesToFormExtLoad to be usable for ANY_EXTEND, in addition to ZERO_EXTEND and SIGN_EXTEND. Fix a bug in the way it checked for live-out values, and simplify the way it find users by using SDNode::use_iterator's (relatively) new features. Also, make it slightly more permissive on targets with free truncates. In SelectionDAGBuild, avoid creating ANY_EXTEND nodes that are larger than necessary. If the target's SwitchAmountTy has enough bits, use it. This exposes the truncate to optimization early, enabling more optimizations. llvm-svn: 68670	2009-04-09 03:51:29 +00:00
Dan Gohman	e6db8ca5eb	Don't copy the operand of a SwitchInst into virtual registers as eagerly. This helps avoid CopyToReg nodes in some cases where they aren't needed, and also helps subsequent optimizer heuristics in cases where the extra nodes would cause the node to appear to have multiple results. This doesn't have a significant impact currently; it'll help an upcoming change. llvm-svn: 68667	2009-04-09 02:33:36 +00:00
Duncan Sands	5a82613db0	Soft float support for FREM. llvm-svn: 68614	2009-04-08 16:20:57 +00:00
Duncan Sands	fb438caac6	Soft float support for undef. Reported by Xerxes Rånby. llvm-svn: 68607	2009-04-08 13:33:37 +00:00
Dan Gohman	ad3e549a53	Implement support for using modeling implicit-zero-extension on x86-64 with SUBREG_TO_REG, teach SimpleRegisterCoalescing to coalesce SUBREG_TO_REG instructions (which are similar to INSERT_SUBREG instructions), and teach the DAGCombiner to take advantage of this on targets which support it. This eliminates many redundant zero-extension operations on x86-64. This adds a new TargetLowering hook, isZExtFree. It's similar to isTruncateFree, except it only applies to actual definitions, and not no-op truncates which may not zero the high bits. Also, this adds a new optimization to SimplifyDemandedBits: transform operations like x+y into (zext (add (trunc x), (trunc y))) on targets where all the casts are no-ops. In contexts where the high part of the add is explicitly masked off, this allows the mask operation to be eliminated. Fix the DAGCombiner to avoid undoing these transformations to eliminate casts on targets where the casts are no-ops. Also, this adds a new two-address lowering heuristic. Since two-address lowering runs before coalescing, it helps to be able to look through copies when deciding whether commuting and/or three-address conversion are profitable. Also, fix a bug in LiveInterval::MergeInClobberRanges. It didn't handle the case that a clobber range extended both before and beyond an existing live range. In that case, multiple live ranges need to be added. This was exposed by the new subreg coalescing code. Remove 2008-05-06-SpillerBug.ll. It was bugpoint-reduced, and the spiller behavior it was looking for no longer occurrs with the new instruction selection. llvm-svn: 68576	2009-04-08 00:15:30 +00:00
Devang Patel	10f7c3deb7	Revert prev. patch for now. llvm-svn: 68569	2009-04-07 23:00:04 +00:00
Devang Patel	ddafc03e41	Right now DBG_LABEL are required for llvm.dbg.region_start and llvm.dbg.region_end in non-fast mode also. llvm-svn: 68559	2009-04-07 22:27:56 +00:00
Dan Gohman	ca93aabeba	Don't attempt to handle aggregate argument values in FastISel; let SelectionDAG do those. This fixes PR3955. llvm-svn: 68546	2009-04-07 20:40:11 +00:00
Dan Gohman	8bff8a1e87	Fix a TargetLowering optimization so that it doesn't duplicate loads when an input node has multiple uses. llvm-svn: 68398	2009-04-03 20:11:30 +00:00
Dan Gohman	b425feb2aa	Delete ISD::INSERT_SUBREG and ISD::EXTRACT_SUBREG, which are unused. Note that these are distinct from TargetInstrInfo::INSERT_SUBREG and TargetInstrInfo::EXTRACT_SUBREG, which are used. llvm-svn: 68355	2009-04-03 00:25:26 +00:00
Sanjiv Gupta	cc841a3810	To convert the StopPoint insn into an assembler directive by ISel, we need to have access to the line number field. So we convert that info as an operand by custom handling DBG_STOPPOINT in legalize. llvm-svn: 68329	2009-04-02 18:03:10 +00:00
Evan Cheng	0d551591ea	Fully general expansion of integer shift of any size. llvm-svn: 68134	2009-03-31 19:39:24 +00:00
Dan Gohman	d51f196ff5	Minor top-level comment fix. llvm-svn: 68113	2009-03-31 16:51:18 +00:00
Dan Gohman	97a20b8dbf	Fix live-out reg logic to not insert over-aggressive AssertZExt instructions. This fixes lua. llvm-svn: 68083	2009-03-31 01:38:29 +00:00
Duncan Sands	d21581eaa1	Fix PR3899: add support for extracting floats from vectors when using -soft-float. Based on a patch by Jakob Stoklund Olesen. llvm-svn: 67996	2009-03-29 13:51:06 +00:00
Arnold Schwaighofer	e622cbf385	Make check in CheckTailCallReturnConstraints for ignorable instructions between a CALL and a RET node more generic. Add a test for tail calls with a void return. llvm-svn: 67943	2009-03-28 12:36:29 +00:00
Arnold Schwaighofer	83d5420d02	Enable tail call optimization for functions that return a struct (bug 3664) and for functions that return types that need extending (e.g i1). llvm-svn: 67934	2009-03-28 08:33:27 +00:00
Evan Cheng	fd81c73cde	Optimize some 64-bit multiplication by constants into two lea's or one lea + shl since imulq is slow (latency 5). e.g. x * 40 => shlq $3, %rdi leaq (%rdi,%rdi,4), %rax This has the added benefit of allowing more multiply to be folded into addressing mode. e.g. a * 24 + b => leaq (%rdi,%rdi,2), %rax leaq (%rsi,%rax,8), %rax llvm-svn: 67917	2009-03-28 05:57:29 +00:00
Dan Gohman	2785e4be37	Fix what surely must be a copy+pasto. llvm-svn: 67881	2009-03-27 23:55:04 +00:00
Dan Gohman	6d75876473	Initialize LiveOutInfo's APInt members to zero, as APInt's default constructor produces an uninitialized APInt. This fixes PR3896. llvm-svn: 67879	2009-03-27 23:51:02 +00:00
Bill Wendling	aa28be652c	Pull transform from target-dependent code into target-independent code. llvm-svn: 67742	2009-03-26 06:14:09 +00:00
Evan Cheng	2e9f42bed5	Revert 67132. This is breaking some objective-c apps. Also fixes SDISel so it does not force promote return value if the function is not marked signext / zeroext. llvm-svn: 67701	2009-03-25 20:20:11 +00:00
Dale Johannesen	eb1646d28c	When optimizing with debug info, don't keep the stoppoint nodes around until Legalize; doing this imposed an ordering on a sequence of loads that came from different lines, interfering with scheduling. llvm-svn: 67692	2009-03-25 17:36:08 +00:00
Chris Lattner	c35847e109	more tidying: name the components of PhysReg in the case when the target constraint specifies a specific physreg. llvm-svn: 67618	2009-03-24 15:27:37 +00:00
Chris Lattner	42eceb3491	Tidy a bit more. llvm-svn: 67617	2009-03-24 15:25:07 +00:00
Chris Lattner	246eda43bd	simplify this code a bit now that "allocation to a vreg class" can never fail. llvm-svn: 67616	2009-03-24 15:22:11 +00:00
Dan Gohman	f3746cbc56	Minor compile-time optimization; don't bother checking canClobberPhysRegDefs if the successor node doesn't clobber any physical registers. llvm-svn: 67587	2009-03-24 00:50:07 +00:00
Dan Gohman	9a658d72db	Add a pre-pass to the burr-list scheduler which makes adjustments to help out the register pressure reduction heuristics in the case of nodes with multiple uses. Currently this uses very conservative heuristics, so it doesn't have a broad impact, but in cases where it does help it can make a big difference. llvm-svn: 67586	2009-03-24 00:49:12 +00:00
Dan Gohman	ed0e8d44ce	When unfolding a load during scheduling, the new operator node has a data dependency on the load node, so it really needs a data-dependence edge to the load node, even if the load previously existed. And add a few comments. llvm-svn: 67554	2009-03-23 20:20:43 +00:00
Dan Gohman	f477262e69	Don't set SUnit::hasPhysRegDefs to true unless the defs are actually have uses, which reflects the way it's used. llvm-svn: 67540	2009-03-23 17:39:36 +00:00
Dan Gohman	a366da1bf7	Fix canClobberPhysRegDefs to check all SDNodes grouped together in an SUnit, instead of just the first one. This fix is needed by some upcoming scheduler changes. llvm-svn: 67531	2009-03-23 16:23:01 +00:00
Dan Gohman	52c278e54d	Add a new bit to SUnit to record whether a node has implicit physreg defs, regardless of whether they are actually used. llvm-svn: 67528	2009-03-23 16:10:52 +00:00
Dan Gohman	4f2fea1a21	Now that errs() is properly non-buffered, there's no need to explicitly flush it. llvm-svn: 67526	2009-03-23 15:57:19 +00:00
Evan Cheng	968c3b0d6e	Model inline asm constraint which ties an input to an output register as machine operand TIED_TO constraint. This eliminated the need to pre-allocate registers for these. This also allows register allocator can eliminate the unneeded copies. llvm-svn: 67512	2009-03-23 08:01:15 +00:00
Dan Gohman	3bdc4bdba6	Simplify this code; use a while instead of an if and a do-while. llvm-svn: 67400	2009-03-20 20:42:23 +00:00
Evan Cheng	2e55923fba	For inline asm output operand that matches an input. Encode the input operand index in the high bits. llvm-svn: 67387	2009-03-20 18:03:34 +00:00
Sanjiv Gupta	e9759c458c	Fixed the comment. No functionality change. llvm-svn: 67370	2009-03-20 09:38:50 +00:00
Mon P Wang	32c8074be6	Added missing support for widening when splitting an unary op (PR3683) and expanding a bit convert (PR3711). In both cases, we extract the valid part of the widen vector and then do the conversion. llvm-svn: 67175	2009-03-18 06:24:04 +00:00
Rafael Espindola	4606b12108	Don't force promotion of return arguments on the callee. Some architectures (like x86) don't require it. This fixes bug 3779. llvm-svn: 67132	2009-03-17 23:43:59 +00:00
Chris Lattner	2363d0b8b9	Fix codegen to compute the size of an allocation by multiplying the size by the array amount as an i32 value instead of promoting from i32 to i64 then doing the multiply. Not doing this broke wrap-around assumptions that the optimizers (validly) made. The ultimate real fix for this is to introduce i64 version of alloca and remove mallocinst. This fixes PR3829 llvm-svn: 67093	2009-03-17 19:36:00 +00:00
Mon P Wang	523c0852c6	Fix a problem with DAGCombine where we were building an illegal build vector shuffle mask. Forced the mask to be built using i32. Note: this will be irrelevant once vector_shuffle no longer takes a build vector for the shuffle mask. llvm-svn: 67076	2009-03-17 06:33:10 +00:00
Mon P Wang	c86715631c	Avoid doing the transformation c ? 1.0 : 2.0 as load { 2.0, 1.0 } + c*4 if FPConstant is legal because if the FPConstant doesn't need to be stored in a constant pool, the transformation is unlikely to be profitable. llvm-svn: 66994	2009-03-14 00:25:19 +00:00
Dan Gohman	a62e4ab690	Improve FastISel's handling of truncates to i1, and implement ptrtoint and inttoptr in X86FastISel. These casts aren't always handled in the generic FastISel code because X86 sometimes needs custom code to do truncation and zero-extension. llvm-svn: 66988	2009-03-13 23:53:06 +00:00
Dan Gohman	c0bb959591	Fix FastISel's assumption that i1 values are always zero-extended by inserting explicit zero extensions where necessary. Included is a testcase where SelectionDAG produces a virtual register holding an i1 value which FastISel previously mistakenly assumed to be zero-extended. llvm-svn: 66941	2009-03-13 20:42:20 +00:00
Evan Cheng	1fb8aedd1e	Fix some significant problems with constant pools that resulted in unnecessary paddings between constant pool entries, larger than necessary alignments (e.g. 8 byte alignment for .literal4 sections), and potentially other issues. 1. ConstantPoolSDNode alignment field is log2 value of the alignment requirement. This is not consistent with other SDNode variants. 2. MachineConstantPool alignment field is also a log2 value. 3. However, some places are creating ConstantPoolSDNode with alignment value rather than log2 values. This creates entries with artificially large alignments, e.g. 256 for SSE vector values. 4. Constant pool entry offsets are computed when they are created. However, asm printer group them by sections. That means the offsets are no longer valid. However, asm printer uses them to determine size of padding between entries. 5. Asm printer uses expensive data structure multimap to track constant pool entries by sections. 6. Asm printer iterate over SmallPtrSet when it's emitting constant pool entries. This is non-deterministic. Solutions: 1. ConstantPoolSDNode alignment field is changed to keep non-log2 value. 2. MachineConstantPool alignment field is also changed to keep non-log2 value. 3. Functions that create ConstantPool nodes are passing in non-log2 alignments. 4. MachineConstantPoolEntry no longer keeps an offset field. It's replaced with an alignment field. Offsets are not computed when constant pool entries are created. They are computed on the fly in asm printer and JIT. 5. Asm printer uses cheaper data structure to group constant pool entries. 6. Asm printer compute entry offsets after grouping is done. 7. Change JIT code to compute entry offsets on the fly. llvm-svn: 66875	2009-03-13 07:51:59 +00:00
Bill Wendling	fa54bc2052	Oops...I committed too much. llvm-svn: 66867	2009-03-13 04:39:26 +00:00
Bill Wendling	b02eadf660	Temporarily XFAIL this test. llvm-svn: 66866	2009-03-13 04:37:11 +00:00
Dan Gohman	a19c662a83	Fix a typo in a comment. llvm-svn: 66843	2009-03-12 23:55:10 +00:00
Chris Lattner	4147f08e44	Move 3 "(add (select cc, 0, c), x) -> (select cc, x, (add, x, c))" related transformations out of target-specific dag combine into the ARM backend. These were added by Evan in r37685 with no testcases and only seems to help ARM (e.g. test/CodeGen/ARM/select_xform.ll). Add some simple X86-specific (for now) DAG combines that turn things like cond ? 8 : 0 -> (zext(cond) << 3). This happens frequently with the recently added cp constant select optimization, but is a very general xform. For example, we now compile the second example in const-select.ll to: _test: movsd LCPI2_0, %xmm0 ucomisd 8(%esp), %xmm0 seta %al movzbl %al, %eax movl 4(%esp), %ecx movsbl (%ecx,%eax,4), %eax ret instead of: _test: movl 4(%esp), %eax leal 4(%eax), %ecx movsd LCPI2_0, %xmm0 ucomisd 8(%esp), %xmm0 cmovbe %eax, %ecx movsbl (%ecx), %eax ret This passes multisource and dejagnu. llvm-svn: 66779	2009-03-12 06:52:53 +00:00
Evan Cheng	4465954638	Enable Chris' value propagation change. It make available known sign, zero, one bits information for values that are live out of basic blocks. The goal is to eliminate unnecessary sext, zext, truncate of values that are live-in to blocks. This does not handle PHI nodes yet. llvm-svn: 66777	2009-03-12 06:29:49 +00:00
Chris Lattner	43d6377f89	reapply my previous patch (r66358) with a tweak to set the alignment of the generated constant pool entry to the desired alignment of a type. If we don't do this, we end up trying to do movsd from 4-byte alignment memory. This fixes 450.soplex and 456.hmmer. llvm-svn: 66641	2009-03-11 05:08:08 +00:00
Evan Cheng	aa887653f4	Revert 66358 for now. It's breaking povray, 450.soplex, and 456.hmmer on x86 / Darwin. llvm-svn: 66574	2009-03-10 20:47:18 +00:00
Chris Lattner	4249b9a698	Fix PR3763 by using proper APInt methods instead of uint64_t's. llvm-svn: 66434	2009-03-09 20:22:18 +00:00
Bill Wendling	c6869f4695	Pass in a std::string when getting the names of debugging things. This cuts down on the number of times a std::string is created and copied. llvm-svn: 66396	2009-03-09 05:04:40 +00:00
Chris Lattner	ab5a443144	implement an optimization to codegen c ? 1.0 : 2.0 as load { 2.0, 1.0 } + c*4. For 2009-03-07-FPConstSelect.ll we now produce: _f: xorl %eax, %eax testl %edi, %edi movl $4, %ecx cmovne %rax, %rcx leaq LCPI1_0(%rip), %rax movss (%rcx,%rax), %xmm0 ret previously we produced: _f: subl $4, %esp cmpl $0, 8(%esp) movss LCPI1_0, %xmm0 je LBB1_2 ## entry LBB1_1: ## entry movss LCPI1_1, %xmm0 LBB1_2: ## entry movss %xmm0, (%esp) flds (%esp) addl $4, %esp ret on PPC the code also improves to: _f: cntlzw r2, r3 srwi r2, r2, 5 li r3, lo16(LCPI1_0) slwi r2, r2, 2 addis r3, r3, ha16(LCPI1_0) lfsx f1, r3, r2 blr from: _f: li r2, lo16(LCPI1_1) cmplwi cr0, r3, 0 addis r2, r2, ha16(LCPI1_1) beq cr0, LBB1_2 ; entry LBB1_1: ; entry li r2, lo16(LCPI1_0) addis r2, r2, ha16(LCPI1_0) LBB1_2: ; entry lfs f1, 0(r2) blr This also improves the existing pic-cpool case from: foo: subl $12, %esp call .Lllvm$1.$piclabel .Lllvm$1.$piclabel: popl %eax addl $_GLOBAL_OFFSET_TABLE_ + [.-.Lllvm$1.$piclabel], %eax cmpl $0, 16(%esp) movsd .LCPI1_0@GOTOFF(%eax), %xmm0 je .LBB1_2 # entry .LBB1_1: # entry movsd .LCPI1_1@GOTOFF(%eax), %xmm0 .LBB1_2: # entry movsd %xmm0, (%esp) fldl (%esp) addl $12, %esp ret to: foo: call .Lllvm$1.$piclabel .Lllvm$1.$piclabel: popl %eax addl $_GLOBAL_OFFSET_TABLE_ + [.-.Lllvm$1.$piclabel], %eax xorl %ecx, %ecx cmpl $0, 4(%esp) movl $8, %edx cmovne %ecx, %edx fldl .LCPI1_0@GOTOFF(%eax,%edx) ret This triggers a few dozen times in spec FP 2000. llvm-svn: 66358	2009-03-08 01:51:30 +00:00
Chris Lattner	21cf4bf235	random cleanups. llvm-svn: 66357	2009-03-08 01:47:41 +00:00
Duncan Sands	12da8ce3d2	Introduce new linkage types linkonce_odr, weak_odr, common_odr and extern_weak_odr. These are the same as the non-odr versions, except that they indicate that the global will only be overridden by an equivalent global. In C, a function with weak linkage can be overridden by a function which behaves completely differently. This means that IP passes have to skip weak functions, since any deductions made from the function definition might be wrong, since the definition could be replaced by something completely different at link time. This is not allowed in C++, thanks to the ODR (One-Definition-Rule): if a function is replaced by another at link-time, then the new function must be the same as the original function. If a language knows that a function or other global can only be overridden by an equivalent global, it can give it the weak_odr linkage type, and the optimizers will understand that it is alright to make deductions based on the function body. The code generators on the other hand map weak and weak_odr linkage to the same thing. llvm-svn: 66339	2009-03-07 15:45:40 +00:00
Dan Gohman	15af5524a4	Fix ScheduleDAGRRList::CopyAndMoveSuccessors' handling of nodes with multiple chain operands. This can occur when the scheduler has added chain operands to a node that already has a chain operand, in order to handle physical register dependencies. This fixes an llvm-gcc bootstrap failure on x86-64 introduced in r66058. llvm-svn: 66240	2009-03-06 02:23:01 +00:00
Bob Wilson	5b15d01ff3	Fix BuildVectorSDNode::isConstantSplat to handle one-element vectors. It is an error to call APInt::zext with a size that is equal to the value's current size, so use zextOrTrunc instead. llvm-svn: 66039	2009-03-04 17:47:01 +00:00
Eli Friedman	7604d37723	PR3686: make the legalizer handle bitcast from i80 to x86 long double. llvm-svn: 66021	2009-03-04 06:23:34 +00:00
Evan Cheng	b8905c4e2c	Fix PR3701. 1. X86 target renamed eflags register to flags. This matches what llvm-gcc generates so codegen knows flags register is being clobbered by inline asm. 2. BURR scheduler should also check if inline asm nodes can clobber "live" physical registers. Previously it was only checking target nodes with implicit defs. llvm-svn: 65996	2009-03-04 01:41:49 +00:00
Bill Wendling	6d2714738f	The DAG combiner was performing a BT combine. The BT combine had a value of -1, so it changed it into a 31 via the TLO.ShrinkDemandedConstant() call. Then it would go through the DAG combiner again. This time it had a value of 31, which was turned into a -1 by TLI.SimplifyDemandedBits(). This would ping pong forever. Teach the TLO.ShrinkDemandedConstant() call not to lower a value if the demanded value is an XOR of all ones. llvm-svn: 65985	2009-03-04 00:18:06 +00:00
Bob Wilson	85cefe8567	Generalize BuildVectorSDNode::isConstantSplat to use APInts and handle arbitrary vector sizes. Add an optional MinSplatBits parameter to specify a minimum for the splat element size. Update the PPC target to use the revised interface. llvm-svn: 65899	2009-03-02 23:24:16 +00:00
Nate Begeman	a9e981225e	Fix a problem with DAGCombine on 64b targets where folding extracts + build_vector into a shuffle would fail, because the type of the new build_vector would not be legal. Try harder to create a legal build_vector type. Note: this will be totally irrelevant once vector_shuffle no longer takes a build_vector for shuffle mask. New: _foo: xorps %xmm0, %xmm0 xorps %xmm1, %xmm1 subps %xmm1, %xmm1 mulps %xmm0, %xmm1 addps %xmm0, %xmm1 movaps %xmm1, 0 Old: _foo: xorps %xmm0, %xmm0 movss %xmm0, %xmm1 xorps %xmm2, %xmm2 unpcklps %xmm1, %xmm2 pshufd $80, %xmm1, %xmm1 unpcklps %xmm1, %xmm2 pslldq $16, %xmm2 pshufd $57, %xmm2, %xmm1 subps %xmm0, %xmm1 mulps %xmm0, %xmm1 addps %xmm0, %xmm1 movaps %xmm1, 0 llvm-svn: 65791	2009-03-01 23:44:07 +00:00
Bob Wilson	d8ea0e144e	Combine PPC's GetConstantBuildVectorBits and isConstantSplat functions to a new method in a BuildVectorSDNode "pseudo-class". llvm-svn: 65747	2009-03-01 01:13:55 +00:00
Rafael Espindola	000421eade	Refactor TLS code and add some tests. The tests and expected results are: pic \| declaration \| linkage \| visibility \| !pic \| declaration \| external \| default \| tls1.ll tls2.ll \| local exec pic \| declaration \| external \| default \| tls1-pic.ll tls2-pic.ll \| general dynamic !pic \| !declaration \| external \| default \| tls3.ll tls4.ll \| initial exec pic \| !declaration \| external \| default \| tls3-pic.ll tls4-pic.ll \| general dynamic !pic \| declaration \| external \| hidden \| tls7.ll tls8.ll \| local exec pic \| declaration \| external \| hidden \| X \| local dynamic !pic \| !declaration \| external \| hidden \| tls9.ll tls10.ll \| local exec pic \| !declaration \| external \| hidden \| X \| local dynamic !pic \| declaration \| internal \| default \| tls5.ll tls6.ll \| local exec pic \| declaration \| internal \| default \| X \| local dynamic The ones marked with an X have not been implemented since local dynamic is not implemented. llvm-svn: 65632	2009-02-27 13:37:18 +00:00
Evan Cheng	a49de9de2e	Revert BuildVectorSDNode related patches: 65426, 65427, and 65296. llvm-svn: 65482	2009-02-25 22:49:59 +00:00
Dale Johannesen	7d12ea0f62	Fix big-endian codegen bug. We're splitting up overly long ints, e.g. i96, into pieces at PHIs and the nodes that feed into them; however big-endian reverses the order of the pieces (for some reason), and wasn't doing it the same way on both sides, so the pieces didn't match and runtime failures ensued. Fixes 188.ammp and sqlite3 on ppc32. llvm-svn: 65481	2009-02-25 22:39:13 +00:00
Evan Cheng	86673f2806	Clean up dwarf writer, part 1. This eliminated the horrible recursive getGlobalVariablesUsing and replaced it something readable. It eliminated use of slow UniqueVector and replaced it with StringMap, SmallVector, and DenseMap, etc. It also fixed some non-deterministic behavior. This is a very minor compile time win. llvm-svn: 65438	2009-02-25 07:04:34 +00:00
Scott Michel	e2fdc31759	Expand tabs to spaces (overlooked in previous commit) llvm-svn: 65427	2009-02-25 03:57:49 +00:00
Scott Michel	bb878288cb	Remove all "cached" data from BuildVectorSDNode, preferring to retrieve results via reference parameters. This patch also appears to fix Evan's reported problem supplied as a reduced bugpoint test case. llvm-svn: 65426	2009-02-25 03:12:50 +00:00
Bill Wendling	c5437ea429	Overhaul my earlier submission due to feedback. It's a large patch, but most of them are generic changes. - Use the "fast" flag that's already being passed into the asm printers instead of shoving it into the DwarfWriter. - Instead of calling "MI->getParent()->getParent()" for every MI, set the machine function when calling "runOnMachineFunction" in the asm printers. llvm-svn: 65379	2009-02-24 08:30:20 +00:00
Bill Wendling	786c5973f7	- Use the "Fast" flag instead of "OptimizeForSize" to determine whether to emit a DBG_LABEL or not. We want to fall back to the original way of emitting debug info when we're in -O0/-fast mode. - Add plumbing in to pass the "Fast" flag to places that need it. - XFAIL DebugInfo/deaddebuglabel.ll. This is finding 11 labels instead of 8. I need to investigate still. llvm-svn: 65367	2009-02-24 02:35:30 +00:00
Dan Gohman	4f356bb9b0	Fix a ValueTracking rule: RHS means operand 1, not 0. Add a simple ashr instcombine to help expose this code. And apply the fix to SelectionDAG's copy of this code too. llvm-svn: 65364	2009-02-24 02:00:40 +00:00
Scott Michel	9d31aca679	Introduce the BuildVectorSDNode class that encapsulates the ISD::BUILD_VECTOR instruction. The class also consolidates the code for detecting constant splats that's shared across PowerPC and the CellSPU backends (and might be useful for other backends.) Also introduces SelectionDAG::getBUID_VECTOR() for generating new BUILD_VECTOR nodes. llvm-svn: 65296	2009-02-22 23:36:09 +00:00
Richard Pennington	99f6d7c9fc	bug 3610: Floating point vaarg not softened. llvm-svn: 65239	2009-02-21 19:11:18 +00:00
Dan Gohman	e7fe80fcf9	Fix a bug that David Greene found in the DAGCombiner's logic that checks whether it's safe to transform a store of a bitcast value into a store of the original value. llvm-svn: 65201	2009-02-20 23:29:13 +00:00
Bill Wendling	7b9f38ad37	Temporarily revert r65065. It was causing test failures. llvm-svn: 65068	2009-02-19 21:57:07 +00:00
Bill Wendling	df78dcc0b2	Check for -fast here too. llvm-svn: 65065	2009-02-19 21:23:54 +00:00
Bill Wendling	19e0a5b3c3	Generate these labels when we're in "fast" mode, not simply when we're no in "optimize-for-size" mode. llvm-svn: 65064	2009-02-19 21:12:54 +00:00
Scott Michel	cf0da6c597	Remove trailing whitespace to reduce later commit patch noise. (Note: Eventually, commits like this will be handled via a pre-commit hook that does this automagically, as well as expand tabs to spaces and look for 80-col violations.) llvm-svn: 64827	2009-02-17 22:15:04 +00:00
Bill Wendling	3c50922ea0	--- Merging (from foreign repository) r64714 into '.': U include/llvm/CodeGen/DebugLoc.h U lib/CodeGen/SelectionDAG/LegalizeDAG.cpp U lib/CodeGen/SelectionDAG/SelectionDAGBuild.cpp U lib/Target/X86/AsmPrinter/X86ATTAsmPrinter.cpp Enable debug location generation at -Os. This goes with the reapplication of the r63639 patch. llvm-svn: 64715	2009-02-17 01:04:54 +00:00
Dan Gohman	aaee6c9523	Don't assume that a left-shift of a value with one bit set will have one bit set, because the bit may be shifted off the end. Instead, just check for a constant 1 being shifted. This is still sufficient to handle all the cases in test/CodeGen/X86/bt.ll. This fixes PR3583. llvm-svn: 64622	2009-02-15 23:59:32 +00:00
Cedric Venet	d1e179d992	Unbreak the build on win32. Cleanup some warning. Remark: when struct/class are declared differently than they are defined, this make problem for VC++ since it seems to mangle class differently that struct. These error are very hard to understand and find. So please, try to keep your definition/declaration in sync. Only tested with VS2008. hope it does not break anything. feel free to revert. llvm-svn: 64554	2009-02-14 16:06:42 +00:00
Bill Wendling	65c0fd4c44	Revert this. It was breaking stuff. llvm-svn: 64428	2009-02-13 02:16:35 +00:00
Bill Wendling	1c21ac3066	Turn off the old way of handling debug information in the code generator. Use the new way, where all of the information is passed on SDNodes and machine instructions. llvm-svn: 64427	2009-02-13 02:01:04 +00:00
Dale Johannesen	655775293f	Arrange to print constants that match "n" and "i" constraints in inline asm as signed (what gcc does). Add partial support for x86-specific "e" and "Z" constraints, with appropriate signedness for printing. llvm-svn: 64400	2009-02-12 20:58:09 +00:00
Chris Lattner	90880e2598	make fast isel fall back to selectiondags for VLA llvm.declare intrinsics. llvm-svn: 64379	2009-02-12 17:23:20 +00:00
Evan Cheng	b570499c25	Oops. Last second clean up messed things up. llvm-svn: 64373	2009-02-12 09:52:13 +00:00
Evan Cheng	3a14efacb6	Replace one of burr scheduling heuristic with something more sensible. Now calcMaxScratches simply compute the number of true data dependencies. This actually improve a couple of tests in dejagnu suite as many tests in llvm nightly test suite. llvm-svn: 64369	2009-02-12 08:59:45 +00:00
Dan Gohman	45889d24fe	Fix a comment. llvm-svn: 64328	2009-02-11 21:32:08 +00:00
Dan Gohman	6571ef3577	Don't use special heuristics for nodes with no data predecessors unless they actually have data successors, and likewise for nodes with no data successors unless they actually have data precessors. llvm-svn: 64327	2009-02-11 21:29:39 +00:00
Dan Gohman	298a2946f1	Delete the heuristic for non-livein CopyFromReg nodes. Non-liveinness is determined by whether the node has a Flag operand. However, if the node does have a Flag operand, it will be glued to its register's def, so the heuristic would end up spuriously applying to whatever node is the def. llvm-svn: 64319	2009-02-11 20:25:59 +00:00
Dale Johannesen	cc5fc44d02	Make a transformation added in 63266 a bit less aggressive. It was transforming (x&y)==y to (x&y)!=0 in the case where y is variable and known to have at most one bit set (e.g. z&1). This is not correct; the expressions are not equivalent when y==0. I believe this patch salvages what can be salvaged, including all the cases in bt.ll. Dan, please review. Fixes gcc.c-torture/execute/20040709-[12].c llvm-svn: 64314	2009-02-11 19:19:41 +00:00
Dan Gohman	dfaf646c34	When scheduling a block in parts, keep track of the overall instruction index across each part. Instruction indices are used to make live range queries, and live ranges can extend beyond scheduling region boundaries. Refactor the ScheduleDAGSDNodes class some more so that it doesn't have to worry about this additional information. llvm-svn: 64288	2009-02-11 04:27:20 +00:00
Dan Gohman	b95434356c	Factor out more code for computing register live-range informationfor scheduling, and generalize is so that preserves state across scheduling regions. This fixes incorrect live-range information around terminators and labels, which are effective region boundaries. In place of looking for terminators to anchor inter-block dependencies, introduce special entry and exit scheduling units for this purpose. llvm-svn: 64254	2009-02-10 23:27:53 +00:00
Evan Cheng	ce3bbe515b	Fix PR3457: Ignore control successors when looking for closest scheduled successor. A control successor doesn't read result(s) produced by the scheduling unit being evaluated. llvm-svn: 64210	2009-02-10 08:30:11 +00:00
Evan Cheng	3af42a8a14	If the target cannot issue a copy for the given source and dest registers, abort instead of silently continue. llvm-svn: 64184	2009-02-09 22:47:36 +00:00
Evan Cheng	fe174df170	Simplify code. llvm-svn: 64164	2009-02-09 21:01:06 +00:00
Evan Cheng	020588cee3	Make sure constant subscript is truncated to ptr size if it may not fit. llvm-svn: 64163	2009-02-09 20:54:38 +00:00
Dale Johannesen	9c310711bb	Use getDebugLoc forwarder instead of getNode()->getDebugLoc. No functional change. llvm-svn: 64026	2009-02-07 19:59:05 +00:00
Dan Gohman	747e55bc9a	Constify TargetInstrInfo::EmitInstrWithCustomInserter, allowing ScheduleDAG's TLI member to use const. llvm-svn: 64018	2009-02-07 16:15:20 +00:00
Dale Johannesen	8ba7132128	Make SDNode constructors take a DebugLoc always. Adjust derived classes to pass UnknownLoc where a DebugLoc does not make sense. Pick one of DebugLoc and non-DebugLoc variants to survive for all such classes. llvm-svn: 64000	2009-02-07 02:15:05 +00:00
Dale Johannesen	a72d41a67b	Remove now-unused constructors. llvm-svn: 63995	2009-02-07 01:27:09 +00:00
Dale Johannesen	62fd95d6ec	Get rid of the last non-DebugLoc versions of getNode! Many targets build placeholder nodes for special operands, e.g. GlobalBaseReg on X86 and PPC for the PIC base. There's no sensible way to associate debug info with these. I've left them built with getNode calls with explicit DebugLoc::getUnknownLoc operands. I'm not too happy about this but don't see a good improvement; I considered adding a getPseudoOperand or something, but it seems to me that'll just make it harder to read. llvm-svn: 63992	2009-02-07 00:55:49 +00:00
Dale Johannesen	84935759d5	Remove more non-DebugLoc getNode variants. Use getCALLSEQ_{END,START} to permit passing no DebugLoc there. UNDEF doesn't logically have DebugLoc; add getUNDEF to encapsulate this. llvm-svn: 63978	2009-02-06 23:05:02 +00:00
Dale Johannesen	dc93bbc4b0	And one more file. llvm-svn: 63971	2009-02-06 21:55:48 +00:00
Dale Johannesen	400dc2e2e4	Remove more non-DebugLoc versions of getNode. llvm-svn: 63969	2009-02-06 21:50:26 +00:00
Bill Wendling	03c34d0d3c	Clear out the CurDebugLoc info when doing a 'clear' on the SDL object. llvm-svn: 63967	2009-02-06 21:36:23 +00:00
Dale Johannesen	ab8e4425a3	Eliminate remaining non-DebugLoc version of getTargetNode. llvm-svn: 63951	2009-02-06 19:16:40 +00:00
Dan Gohman	817a24f8e9	Rename SelectionDAGISel::Schedule to SelectionDAGISel::CreateScheduler, and make it just create the scheduler. Leave running the scheduler to the higher-level code. This makes the higher-level code a little more explicit and easier to follow, and will help enable some future refactoring. llvm-svn: 63944	2009-02-06 18:26:51 +00:00
Dan Gohman	cd2cd9f5d7	Delete an unused member function. llvm-svn: 63941	2009-02-06 18:19:52 +00:00
Evan Cheng	066757eea1	Move getPointerRegClass from TargetInstrInfo to TargetRegisterInfo. llvm-svn: 63938	2009-02-06 17:43:24 +00:00
Dan Gohman	483377c639	Move ScheduleDAGSDNodes.h to be a private header. Front-ends that previously included this header should include SchedulerRegistry.h instead. llvm-svn: 63937	2009-02-06 17:22:58 +00:00
Dale Johannesen	2c4cf2752d	get rid of some non-DebugLoc getTargetNode variants. llvm-svn: 63909	2009-02-06 02:08:06 +00:00
Dale Johannesen	9f3f72f144	Get rid of one more non-DebugLoc getNode and its corresponding getTargetNode. Lots of caller changes. llvm-svn: 63904	2009-02-06 01:31:28 +00:00
Dale Johannesen	f80493bbfd	Remove a non-DebugLoc version of getNode. llvm-svn: 63889	2009-02-05 22:07:54 +00:00
Dale Johannesen	3eb373f5ce	Remove 3 non-DebugLoc variants of getNode. llvm-svn: 63886	2009-02-05 21:20:44 +00:00
Mon P Wang	3f0e0a6dea	Fix a bug where we were not emitting a cvt rnd sat node for converting between a unsigned integer and signed integer. llvm-svn: 63831	2009-02-05 04:47:42 +00:00
Dale Johannesen	b842d529a3	Reapply 63765. Patches for clang and llvm-gcc to follow. llvm-svn: 63812	2009-02-05 01:49:45 +00:00
Dale Johannesen	12c572b6fa	Get rid of 3 non-DebugLoc getNode variants. llvm-svn: 63808	2009-02-05 01:01:16 +00:00
Dale Johannesen	7ae8c8b108	Remove non-DebugLoc versions of getMergeValues, ZeroExtendInReg. llvm-svn: 63800	2009-02-05 00:20:09 +00:00
Dale Johannesen	f08a47bb70	Remove non-DebugLoc forms of CopyToReg and CopyFromReg. Adjust callers. llvm-svn: 63789	2009-02-04 23:02:30 +00:00
Dale Johannesen	ae616c2c61	Reverting 63765. This broke the build of both clang and llvm-gcc. llvm-svn: 63786	2009-02-04 22:47:25 +00:00
Stuart Hastings	556bd92698	80 column rule. llvm-svn: 63768	2009-02-04 20:30:10 +00:00
Dale Johannesen	021052a705	Remove non-DebugLoc versions of getLoad and getStore. Adjust the many callers of those versions. llvm-svn: 63767	2009-02-04 20:06:27 +00:00
Nate Begeman	6ae3aa83d0	New feature: add support for target intrinsics being defined in the target directories themselves. This also means that VMCore no longer needs to know about every target's list of intrinsics. Future work will include converting the PowerPC target to this interface as an example implementation. llvm-svn: 63765	2009-02-04 19:47:21 +00:00
Mon P Wang	34650735d0	Avoids generating a legalization assert for the case where a vector type is legal but when legalizing the operation, we split the vector type and generate a library call whose type needs to be promoted. For example, X86 with SSE on but MMX off, a divide v2i64 will be scalarized to 2 calls to a library using i64. llvm-svn: 63760	2009-02-04 19:38:14 +00:00
Stuart Hastings	ffee3d831a	Since I'm obliged to work with a development OS that currently doesn't support GraphViz, I've been using the foo->dump() facility. This patch is a minor rewrite to the SelectionDAG dump() stuff to make it a little more helpful. The existing foo->dump() functionality does not change; this patch adds foo->dumpr(). All of this is only useful when running LLVM under a debugger. llvm-svn: 63736	2009-02-04 16:46:19 +00:00
Dale Johannesen	679073b420	Remove non-DebugLoc forms of the exotic forms of Lod and Sto; patch uses. llvm-svn: 63716	2009-02-04 02:34:38 +00:00
Dale Johannesen	f2bb6f09a3	Remove some more non-DebugLoc versions of construction functions, with callers adjusted to fit. llvm-svn: 63705	2009-02-04 01:48:28 +00:00
Dale Johannesen	efb82cfbf2	Check in file I forgot. llvm-svn: 63704	2009-02-04 01:33:20 +00:00
Dale Johannesen	85263882aa	Remove a few non-DebugLoc versions of node creation functions. llvm-svn: 63703	2009-02-04 01:17:06 +00:00
Dale Johannesen	9888edee10	Fill in more omissions in DebugLog propagation. I think that's it for this directory. llvm-svn: 63690	2009-02-04 00:13:36 +00:00
Dale Johannesen	3a09f5589d	DebugLoc propagation; adjustment to things omitted from SelectionDagBuild. llvm-svn: 63680	2009-02-03 23:04:43 +00:00
Dale Johannesen	abf66b8343	Add some DL propagation to places that didn't have it yet. More coming. llvm-svn: 63673	2009-02-03 22:26:09 +00:00
Devang Patel	70da8e8425	First initialize DAG otherwise dwarf writer is used uninitialized. Duncan spotted this. Thanks! llvm-svn: 63641	2009-02-03 18:46:32 +00:00
Duncan Sands	a77c5f758c	Fix PR3411. When replacing values, nodes are analyzed in any old order. Since analyzing a node analyzes its operands also, this can mean that when we pop a node off the list of nodes to be analyzed, it may already have been analyzed. llvm-svn: 63632	2009-02-03 10:23:33 +00:00
Bill Wendling	135227a060	Pass in something sensible for the debug location information when creating the initial PHI nodes of the machine function. llvm-svn: 63598	2009-02-03 02:20:52 +00:00
Dale Johannesen	db39362c90	Fill in some missing DL propagation in getNode()s. llvm-svn: 63595	2009-02-03 01:55:44 +00:00
Bill Wendling	143a2c3470	Use SDL->getCurDebugLoc() instead of unknown loc for landing pads. llvm-svn: 63594	2009-02-03 01:55:42 +00:00
Bill Wendling	fa50a23f8a	Explicitly pass in the "unknown" debug location. This is probably not correct. We need more infrastructure before we can get the DebugLoc info for these instructions. llvm-svn: 63593	2009-02-03 01:33:28 +00:00
Bill Wendling	9862a64419	Alphabetize includes. llvm-svn: 63591	2009-02-03 01:32:22 +00:00
Bill Wendling	17450acc3b	Propagate debug loc info during SDNode -> machine instr creation. llvm-svn: 63585	2009-02-03 01:02:39 +00:00
Bill Wendling	e3c78361d3	Create DebugLoc information in FastISel. Several temporary methods were created. Specifically, those BuildMIs which use "DebugLoc::getUnknownLoc()". I'll remove them soon. llvm-svn: 63584	2009-02-03 00:55:04 +00:00
Dale Johannesen	f1163e9a4d	Propagation in TargetLowering. Includes passing a DL into SimplifySetCC which gets called elsewhere. llvm-svn: 63583	2009-02-03 00:47:48 +00:00
Dan Gohman	76a07f59d4	Use the SubclassData field to hold ExtType, isTrunc, and MemIndexedMode information. This eliminates the need for the Flags field in MemSDNode, so this makes LoadSDNode and StoreSDNode smaller. Also, it makes FoldingSetNodeIDs for loads and stores two AddIntegers smaller. llvm-svn: 63577	2009-02-03 00:08:45 +00:00
Dale Johannesen	72ba6df1a9	Last DebugLoc propagation for this file. llvm-svn: 63574	2009-02-02 23:46:53 +00:00
Dale Johannesen	b5dd922a92	More DebugLoc propagation. This should be everything except LegalizeOp itself. llvm-svn: 63560	2009-02-02 22:49:46 +00:00
Dale Johannesen	a02e45ca19	DebugLoc propagation. ExpandOp and PromoteOp, among others. llvm-svn: 63555	2009-02-02 22:12:50 +00:00
Dale Johannesen	ae7992a333	Commit missing files. llvm-svn: 63545	2009-02-02 20:47:48 +00:00
Dale Johannesen	ad00f6e010	More DebugLoc propagation. llvm-svn: 63543	2009-02-02 20:41:04 +00:00
Duncan Sands	dab7be8774	Remove trailing spaces. llvm-svn: 63540	2009-02-02 19:46:41 +00:00
Dale Johannesen	8525d83aac	DebugLoc propagation for int<->fp conversions. llvm-svn: 63537	2009-02-02 19:03:57 +00:00
Sanjiv Gupta	8e56d1898b	Duncan's patch. Further to 64382. Takes care of illegal types for shift amount. llvm-svn: 63523	2009-02-02 17:19:39 +00:00
Mon P Wang	cc866c955c	Preserve more SourceValue information. llvm-svn: 63498	2009-02-02 06:37:55 +00:00
Duncan Sands	3ed768868d	Fix PR3453 and probably a bunch of other potential crashes or wrong code with codegen of large integers: eliminate the legacy getIntegerVTBitMask and getIntegerVTSignBit methods, which returned their value as a uint64_t, so couldn't handle huge types. llvm-svn: 63494	2009-02-01 18:06:53 +00:00
Bill Wendling	a6c75ffd73	Forgot some more DebugLoc propagations. llvm-svn: 63493	2009-02-01 11:19:36 +00:00
Dale Johannesen	dfbb6a1a9a	DebugLoc propagation. llvm-svn: 63488	2009-01-31 22:04:51 +00:00
Dale Johannesen	5f98ea28ca	DebugLoc propagation. Done with file. llvm-svn: 63486	2009-01-31 21:04:24 +00:00
Dale Johannesen	4d9fa9e71d	DebugLoc propagation. Done with file. llvm-svn: 63485	2009-01-31 20:01:02 +00:00
Duncan Sands	41826036b1	Fix PR3401: when using large integers, the type returned by getShiftAmountTy may be too small to hold shift values (it is an i8 on x86-32). Before and during type legalization, use a large but legal type for shift amounts: getPointerTy; afterwards use getShiftAmountTy, fixing up any shift amounts with a big type during operation legalization. Thanks to Dan for writing the original patch (which I shamelessly pillaged). llvm-svn: 63482	2009-01-31 15:50:11 +00:00
Mon P Wang	cf9ba82324	If unsafe FP optimization is not set, don't allow -(A-B) => B-A because when A==B, -0.0 != +0.0. llvm-svn: 63474	2009-01-31 06:07:45 +00:00
Bill Wendling	3b585af0ec	Don't use DebugLoc::getUnknownLoc(). Default to something hopefully sensible. llvm-svn: 63473	2009-01-31 03:12:48 +00:00
Dale Johannesen	db7c5f6a7b	Move CurDebugLoc into SelectionDAGLowering. llvm-svn: 63468	2009-01-31 02:22:37 +00:00
Dale Johannesen	dc0f124429	Propagate debug info in LegalizeFloatTypes. Complete (modulo bugs). llvm-svn: 63458	2009-01-31 00:43:08 +00:00
Dale Johannesen	42aa385e20	Propagate debug info. This file complete (modulo bugs) llvm-svn: 63457	2009-01-31 00:20:43 +00:00
Dale Johannesen	c910889511	Propagate debug info through MakeLibCall and a couple of things that use it. llvm-svn: 63456	2009-01-31 00:11:23 +00:00
Bill Wendling	31b50991cb	More DebugLoc propagation. llvm-svn: 63454	2009-01-30 23:59:18 +00:00
Bill Wendling	27d9dd4b57	More DebugLoc propagation. llvm-svn: 63452	2009-01-30 23:36:47 +00:00
Bill Wendling	306bfc2213	More DebugLoc propagation in LOAD etc. methods. llvm-svn: 63451	2009-01-30 23:27:35 +00:00
Bill Wendling	0bd29743e3	More DebugLoc propagation in floating-point methods. llvm-svn: 63446	2009-01-30 23:15:49 +00:00
Dale Johannesen	555a375bb6	Make LowerCallTo and LowerArguments take a DebugLoc argument. Adjust all callers and overloaded versions. llvm-svn: 63444	2009-01-30 23:10:59 +00:00
Bill Wendling	6fbf5495f8	Standardize comments about folding xforms. llvm-svn: 63443	2009-01-30 23:10:18 +00:00
Bill Wendling	8fb81f1b3d	Get rid of the non-DebugLoc-ified getNOT() method. llvm-svn: 63442	2009-01-30 23:03:19 +00:00
Bill Wendling	3dc5d2454e	Propagate debug loc info for some FP arithmetic methods. llvm-svn: 63441	2009-01-30 22:57:07 +00:00
Bill Wendling	cb9be5d174	Propagate debug loc info for some FP arithmetic methods. llvm-svn: 63440	2009-01-30 22:53:48 +00:00
Bill Wendling	4e0a61514b	Propagate debug loc info for BIT_CONVERT. llvm-svn: 63439	2009-01-30 22:44:24 +00:00
Bill Wendling	7bfa43b022	Propagate debug loc info for more *_EXTEND methods. llvm-svn: 63437	2009-01-30 22:33:24 +00:00
Bill Wendling	9b3dc8d848	Propagate debug loc info for ANY_EXTEND. llvm-svn: 63436	2009-01-30 22:27:33 +00:00
Bill Wendling	c409318562	Propagate debug loc info for some of the *_EXTEND functions. llvm-svn: 63434	2009-01-30 22:23:15 +00:00
Bill Wendling	cab9a2eef5	DebugLoc form of getNOT(). llvm-svn: 63433	2009-01-30 22:11:22 +00:00
Bill Wendling	b6b6f46fe4	- Propagate debug loc info for SELECT. - Added xform for (select X, 1, Y) and (select X, Y, 0), which was commented on, but missing. llvm-svn: 63428	2009-01-30 22:02:18 +00:00
Bill Wendling	d51e3ff540	Propagate debug loc info for Shifts. llvm-svn: 63424	2009-01-30 21:37:17 +00:00
Bill Wendling	35972a9460	Propagate debug loc info for XOR and MatchRotate. llvm-svn: 63420	2009-01-30 21:14:50 +00:00
Bill Wendling	f29b6e1318	Propagate debug loc info for OR. Also clean up some comments. llvm-svn: 63419	2009-01-30 20:59:34 +00:00
Bill Wendling	ff8acd684f	Perform obvious constant arithmetic folding. llvm-svn: 63417	2009-01-30 20:50:00 +00:00
Bill Wendling	8617191302	Propagate debug loc info for AND. Also clean up some comments. llvm-svn: 63416	2009-01-30 20:43:18 +00:00
Bill Wendling	781db7a1ad	Propagate debug loc info in SimplifyBinOpWithSameOpcodeHands. llvm-svn: 63411	2009-01-30 19:25:47 +00:00
Bill Wendling	9b3407e5bb	Propagate debug loc info in SimplifyNodeWithTwoResults. llvm-svn: 63376	2009-01-30 03:08:40 +00:00
Bill Wendling	faed065e5c	Propagate debug loc info for MULHS. llvm-svn: 63375	2009-01-30 03:00:18 +00:00
Bill Wendling	d033af09fd	Propagate debug loc info for SREM and UREM. llvm-svn: 63374	2009-01-30 02:57:00 +00:00
Bill Wendling	aff3e03765	Propagate debug loc info for UDIV. llvm-svn: 63373	2009-01-30 02:55:25 +00:00
Bill Wendling	5b663e7b53	Propagate debug loc info for SDIV. llvm-svn: 63372	2009-01-30 02:52:17 +00:00
Bill Wendling	b48dcf67e5	Forgot to propagate debug loc info here. llvm-svn: 63371	2009-01-30 02:49:26 +00:00
Bill Wendling	091f92f568	Propagate debug loc info for MUL. llvm-svn: 63369	2009-01-30 02:45:56 +00:00
Bill Wendling	48ff08ef3e	Propagate debug loc info in SUB. llvm-svn: 63368	2009-01-30 02:42:10 +00:00
Bill Wendling	6127757920	Propagate debug loc info in ADDC and ADDE. llvm-svn: 63367	2009-01-30 02:38:00 +00:00
Bill Wendling	c442348dd7	Propagate debug loc info in DAG combine's "ADD". llvm-svn: 63366	2009-01-30 02:31:17 +00:00
Bill Wendling	cdd96133bd	- Propagate debug loc info in combineSelectAndUse(). - Modify ReassociateOps so that the resulting SDValue is what the comment claims it is. llvm-svn: 63365	2009-01-30 02:23:43 +00:00
Dale Johannesen	ed255b3d8e	Propagate debug info when building SelectionDAG. llvm-svn: 63359	2009-01-30 01:34:22 +00:00
Bill Wendling	9c9a3b6665	Propagate debug location info for the token factor. llvm-svn: 63355	2009-01-30 01:13:16 +00:00
Bill Wendling	f6d0aff0bd	Add DebugLoc propagation to some of the methods in DAG combiner. llvm-svn: 63350	2009-01-30 00:45:56 +00:00
Dan Gohman	14d55f0a5c	Explicitly add PseudoSourceValue information when lowering BUILD_VECTOR and conversions to stack operations. llvm-svn: 63333	2009-01-29 21:02:43 +00:00
Dan Gohman	60d6844aa8	Make a few things const, fix some comments, and simplify some assertions. llvm-svn: 63328	2009-01-29 19:49:27 +00:00
Dan Gohman	8b437ccbbe	Fix two typos that Duncan spotted in a comment. llvm-svn: 63312	2009-01-29 16:18:12 +00:00
Dan Gohman	ef04ed5477	In the case of an extractelement on an insertelement value, the element indices may be equal if either one is not a constant. llvm-svn: 63311	2009-01-29 16:10:46 +00:00
Bill Wendling	a434d930ff	Revert r63273. This was already implemented by Dale. There's no need for my change. llvm-svn: 63301	2009-01-29 09:01:55 +00:00
Bill Wendling	50338007b9	- Add DebugLoc to getTargetNode(). - Modify TableGen to add the DebugLoc when calling getTargetNode. (The light-weight wrappers are only temporary. The non-DebugLoc version will be removed once the whole debug info stuff is finished with.) llvm-svn: 63273	2009-01-29 05:27:31 +00:00
Dan Gohman	e58ab79f33	Make x86's BT instruction matching more thorough, and add some dagcombines that help it match in several more cases. Add several more cases to test/CodeGen/X86/bt.ll. This doesn't yet include matching for BT with an immediate operand, it just covers more register+register cases. llvm-svn: 63266	2009-01-29 01:59:02 +00:00
Dale Johannesen	839acbb089	Add DebugLoc-sensitive versions of many node creation functions. Currently omitted: memcpy, memmove, memset. llvm-svn: 63259	2009-01-29 00:47:48 +00:00
Bill Wendling	1b6a3bce82	Add DebugLoc to the getNode() methods. llvm-svn: 63245	2009-01-28 22:17:52 +00:00
Dale Johannesen	666bf20441	Add DebugLoc-aware constructors for SDNode derived classes (those that reasonably have a DebugLoc associated with them). llvm-svn: 63236	2009-01-28 21:18:29 +00:00
Mon P Wang	a15ea78ea6	Fixed extract element when the result needs to be promoted and the input widened. llvm-svn: 63217	2009-01-28 18:53:39 +00:00
Dan Gohman	4aa1846215	Make isOperationLegal do what its name suggests, and introduce a new isOperationLegalOrCustom, which does what isOperationLegal previously did. Update a bunch of callers to use isOperationLegalOrCustom instead of isOperationLegal. In some case it wasn't obvious which behavior is desired; when in doubt I changed then to isOperationLegalOrCustom as that preserves their previous behavior. This is for the second half of PR3376. llvm-svn: 63212	2009-01-28 17:46:25 +00:00
Duncan Sands	ba21b7d57a	Formatting. llvm-svn: 63199	2009-01-28 14:42:54 +00:00
Duncan Sands	5a913d61e3	Rename getAnalysisToUpdate to getAnalysisIfAvailable. llvm-svn: 63198	2009-01-28 13:14:17 +00:00
Dan Gohman	b3bbde3e62	Use ValueType::bitsLT to simplify some code. llvm-svn: 63170	2009-01-28 03:10:52 +00:00
Dan Gohman	172ad92b29	Use ZERO_EXTEND instead of ANY_EXTEND when promoting shift amounts, to avoid implicitly assuming that target architectures will ignore the high bits. llvm-svn: 63169	2009-01-28 02:58:31 +00:00
Dan Gohman	fb58faf29e	Add an assertion to the form of SelectionDAG::getConstant that takes a uint64_t to verify that the value is in range for the given type, to help catch accidental overflow. Fix a few places that relied on getConstant implicitly truncating the value. llvm-svn: 63128	2009-01-27 20:39:34 +00:00
Dan Gohman	0bd9546039	Delete redundant return statements. llvm-svn: 63120	2009-01-27 19:23:22 +00:00
Duncan Sands	d77e476921	Fix PR3393, which amounts to a bug in the expensive checking logic. Rather than make the checking more complicated, I've tweaked some logic to make things conform to how the checking thought things ought to be, since this results in a simpler "mental model". llvm-svn: 63048	2009-01-26 21:54:18 +00:00
Anton Korobeynikov	4b4622454c	During bittest switch lowering emit shift in the test block, which should (theoretically) allow us to generate more efficient code. We don't do this now though :) llvm-svn: 63027	2009-01-26 19:26:01 +00:00
Dan Gohman	8e4ac9b71a	Take the next steps in making SDUse more consistent with LLVM Use, and tidy up SDUse and related code. - Replace the operator= member functions with a set method, like LLVM Use has, and variants setInitial and setNode, which take care up updating use lists, like LLVM Use's does. This simplifies code that calls these functions. - getSDValue() is renamed to get(), as in LLVM Use, though most places can either use the implicit conversion to SDValue or the convenience functions instead. - Fix some more node vs. value terminology issues. Also, eliminate the one remaining use of SDOperandPtr, and SDOperandPtr itself. llvm-svn: 62995	2009-01-26 04:35:06 +00:00
Dan Gohman	f1d38be265	Eliminate the loop that searches through each of the operands of each use in the SelectionDAG ReplaceAllUses* functions. Thanks to Chris for spotting this opportunity. Also, factor out code from all 5 of the ReplaceAllUses* functions into AddNonLeafNodeToCSEMaps, which is now renamed AddModifiedNodeToCSEMaps to more accurately reflect its purpose. llvm-svn: 62964	2009-01-25 16:29:12 +00:00
Dan Gohman	3a113ec468	Whitespace tidiments. llvm-svn: 62963	2009-01-25 16:21:38 +00:00
Dan Gohman	e7b0dde2ee	Move the N->use_empty() assert from DeleteNode to DeleteNodeNotInCSEMaps, since DeleteNode just calls DeleteNodeNotInCSEMaps. llvm-svn: 62962	2009-01-25 16:20:37 +00:00
Nate Begeman	b09b0242ca	Fix an indent and a typo. llvm-svn: 62940	2009-01-24 22:12:48 +00:00
Dan Gohman	1275e28ded	Fold x-0 to x in unsafe-fp-math mode. This comes up in the testcase from PR3376, and in fact is sufficient to completely avoid the problem in that testcase. There's an underlying problem though; TLI.isOperationLegal considers Custom to be Legal, which might be ok in some cases, but that's what DAGCombiner is using in many places to test if something is legal when LegalOperations is true. When DAGCombiner is running after legalize, this isn't sufficient. I'll address this in a separate commit. llvm-svn: 62860	2009-01-23 19:10:37 +00:00
Bob Wilson	c2dc7ee5d0	Fix a minor bug in DAGCombiner's folding of SELECT. Folding "select C, 0, 1" to "C ^ 1" is only valid when C is known to be either 0 or 1. Most of the similar foldings in this function only handle "i1" types, but this one appears intentionally written to handle larger integer types. If C has an integer type larger than "i1", this needs to check if the high bits of a boolean are known to be zero. I also changed the comment to describe this folding as "C ^ 1" instead of "~C", since that is what the code does and since the latter would only be valid for "i1" types. The good news is that most LLVM targets use TargetLowering::ZeroOrOneBooleanContent so this change will not disable the optimization; the bad news is that I've been unable to come up with a testcase to demonstrate the problem. I have also removed a "FIXME" comment for folding "select C, X, 0" to "C & X", since the code looks correct to me. It could be made more aggressive by not limiting the type to "i1", but that would then require checking for TargetLowering::ZeroOrNegativeOneBooleanContent. Similar changes could be done for the other SELECT foldings, but it was decided to be not worth the trouble and complexity (see e.g., r44663). llvm-svn: 62790	2009-01-22 22:05:48 +00:00
Dan Gohman	1f3411de47	Don't create ISD::FNEG nodes after legalize if they aren't legal. Simplify x+0 to x in unsafe-fp-math mode. This avoids a bunch of redundant work in many cases, because in unsafe-fp-math mode, ISD::FADD with a constant is considered free to negate, so the DAGCombiner often negates x+0 to -0-x thinking it's free, when in reality the end result is -x, which is more expensive than x. Also, combine x*0 to 0. This fixes PR3374. llvm-svn: 62789	2009-01-22 21:58:43 +00:00
Bob Wilson	c58900504b	Add SelectionDAG::getNOT method to construct bitwise NOT operations, corresponding to the "not" and "vnot" PatFrags. Use the new method in some places where it seems appropriate. llvm-svn: 62768	2009-01-22 17:39:32 +00:00
Evan Cheng	4a0bf66eb8	Eliminate a couple of fields from TargetRegisterClass: SubRegClasses and SuperRegClasses. These are not necessary. Also eliminate getSubRegisterRegClass and getSuperRegisterRegClass. These are slow and their results can change if register file names change. Just use TargetLowering::getRegClassFor() to get the right TargetRegisterClass instead. llvm-svn: 62762	2009-01-22 09:10:11 +00:00
Chris Lattner	e09d631d8e	fix a typo llvm-svn: 62761	2009-01-22 07:21:55 +00:00
Dan Gohman	7e6b932f18	Simplify ReduceLoadWidth's logic: it doesn't need several different special cases after producing the new reduced-width load, because the new load already has the needed adjustments built into it. This fixes several bugs due to the special cases, including PR3317. llvm-svn: 62692	2009-01-21 15:17:51 +00:00
Duncan Sands	be7e41481b	Cleanup whitespace and comments, and tweak some prototypes, in operand type legalization. No functionality change. llvm-svn: 62680	2009-01-21 09:00:29 +00:00
Scott Michel	ed7d79fce4	CellSPU: - Ensure that (operation) legalization emits proper FDIV libcall when needed. - Fix various bugs encountered during llvm-spu-gcc build, along with various cleanups. - Start supporting double precision comparisons for remaining libgcc2 build. Discovered interesting DAGCombiner feature, which is currently solved via custom lowering (64-bit constants are not legal on CellSPU, but DAGCombiner insists on inserting one anyway.) - Update README. llvm-svn: 62664	2009-01-21 04:58:48 +00:00
Sanjiv Gupta	a70798cc9a	Allow targets to legalize operations (with illegal operands) that produces multiple values. For example, a load with an illegal operand (a load produces two values, a value and chain). llvm-svn: 62663	2009-01-21 04:48:39 +00:00
Bill Wendling	2395916c87	Use "SINT_TO_FP" instead of "UINT_TO_FP" when getting the exponent. This was causing the limited precision stuff to produce the wrong result for values in the range [0, 1). llvm-svn: 62615	2009-01-20 21:17:57 +00:00
Evan Cheng	c544cb0eca	Change TargetInstrInfo::isMoveInstr to return source and destination sub-register indices as well. llvm-svn: 62600	2009-01-20 19:12:24 +00:00
Bill Wendling	786a683441	Shift types need to match. llvm-svn: 62571	2009-01-20 06:10:42 +00:00
Dan Gohman	161b7b66ac	Fix a dagcombine to not generate loads of non-round integer types, as its comment says, even in the case where it will be generating extending loads. This fixes PR3216. llvm-svn: 62557	2009-01-20 01:06:45 +00:00
Devang Patel	44afc82ebe	Verify debug info. llvm-svn: 62545	2009-01-19 23:21:49 +00:00
Dan Gohman	534c8a2d72	Remove SDNode's virtual destructor. This makes it impossible for SDNode subclasses to keep state that requires non-trivial destructors, however it was already effectively impossible, since the destructor isn't actually ever called. There currently aren't any SDNode subclasses affected by this, and in general it's desireable to keep SDNode objects light-weight. This eliminates the last virtual member function in the SDNode class, so it eliminates the need for a vtable pointer, making SDNode smaller. llvm-svn: 62539	2009-01-19 22:39:36 +00:00
Dan Gohman	cd0b1bf0a0	Fix SelectionDAG::ReplaceAllUsesWith to behave correctly when uses are added to the From node while it is processing From's use list, because of automatic local CSE. The fix is to avoid visiting any new uses. Fix a few places in the DAGCombiner that assumed that after a RAUW call, the From node has no users and may be deleted. This fixes PR3018. llvm-svn: 62533	2009-01-19 21:44:21 +00:00
Sanjiv Gupta	1d2fc787a9	Few targets like PIC16 wants libcall generation for illegal type i16. llvm-svn: 62467	2009-01-18 18:25:27 +00:00
Mon P Wang	e9e7abb6b8	Simplify extract element based on comments from Duncan Sands. llvm-svn: 62459	2009-01-18 06:43:40 +00:00
Mon P Wang	ca6d6dea0b	Simplify extract element of a scalar to vector. llvm-svn: 62383	2009-01-17 00:07:25 +00:00
Dan Gohman	5f8a2598b2	Instead of adding dependence edges between terminator instructions and every other instruction in their blocks to keep the terminator instructions at the end, teach the post-RA scheduler how to operate on ranges of instructions, and exclude terminators from the range of instructions that get scheduled. Also, exclude mid-block labels, such as EH_LABEL instructions, and schedule code before them separately from code after them. This fixes problems with the post-RA scheduler moving code past EH_LABELs. llvm-svn: 62366	2009-01-16 22:10:20 +00:00
Dan Gohman	38978ba972	Use the getNode() accessor instead of accessing the Node member directly, which is private as of r55504. llvm-svn: 62364	2009-01-16 21:47:21 +00:00
Chris Lattner	41828cdb0a	new nodes should be added to the worklist, not old nodes. llvm-svn: 62359	2009-01-16 21:15:56 +00:00
Evan Cheng	968e2e7b3d	CreateVirtualRegisters does trivial copy coalescing. If a node def is used by a single CopyToReg, it reuses the virtual register assigned to the CopyToReg. This won't work for SDNode that is a clone or is itself cloned. Disable this optimization for those nodes or it can end up with non-SSA machine instructions. llvm-svn: 62356	2009-01-16 20:57:18 +00:00
Mikhail Glushenkov	6e8d814d36	Registry.h should not depend on CommandLine.h. Split Support/Registry.h into two files so that we have less to recompile every time CommandLine.h is changed. llvm-svn: 62312	2009-01-16 07:02:28 +00:00
Mikhail Glushenkov	b2f9a73029	Delete trailing whitespace. llvm-svn: 62307	2009-01-16 06:53:46 +00:00
Dan Gohman	ceac7c34f1	Initial hazard recognizer support in post-pass scheduling. This includes a new toy hazard recognizier heuristic which attempts to direct the scheduler to avoid clumping large groups of loads or stores too densely. llvm-svn: 62291	2009-01-16 01:33:36 +00:00
Devang Patel	76d190cf4a	Validate dbg_* intrinsics before lowering them. llvm-svn: 62286	2009-01-15 23:41:32 +00:00
Mon P Wang	e248edff1b	Added missing support to widen an operand from a bit convert. llvm-svn: 62285	2009-01-15 22:43:38 +00:00
Dan Gohman	7e105f0b12	Generalize the HazardRecognizer interface so that it can be used to support MachineInstr-based scheduling in addition to SDNode-based scheduling. llvm-svn: 62284	2009-01-15 22:18:12 +00:00
Rafael Espindola	6de96a1b5d	Add the private linkage. llvm-svn: 62279	2009-01-15 20:18:42 +00:00
Dan Gohman	619ef48a52	Move a few containers out of ScheduleDAGInstrs::BuildSchedGraph and into the ScheduleDAGInstrs class, so that they don't get destructed and re-constructed for each block. This fixes a compile-time hot spot in the post-pass scheduler. To help facilitate this, tidy and do some minor reorganization in the scheduler constructor functions. llvm-svn: 62275	2009-01-15 19:20:50 +00:00
Dan Gohman	307954ac69	Make getWidenVectorType const; this file was missed in the previous commit. llvm-svn: 62266	2009-01-15 17:39:39 +00:00
Dan Gohman	91febd1330	More consts on TargetLowering references. llvm-svn: 62262	2009-01-15 16:58:17 +00:00
Dan Gohman	4bdf021e05	Use const with TargetLowering references in a few more places. llvm-svn: 62260	2009-01-15 16:43:02 +00:00
Gabor Greif	08a4c281cb	minor refactoring: use a more specific API llvm-svn: 62256	2009-01-15 11:10:44 +00:00
Devang Patel	3c82aa0209	Removoe MachineModuleInfo methods (and related DebugInfoDesc class hierarchy) that were used to handle debug info. llvm-svn: 62199	2009-01-13 23:54:55 +00:00
Devang Patel	fe9581f0cd	Undo previous checkin. llvm-svn: 62190	2009-01-13 22:54:57 +00:00
Devang Patel	ca997988c3	Use dwarf writer to decide whether the module has debug info or not. llvm-svn: 62184	2009-01-13 21:25:00 +00:00
Dan Gohman	1407484178	The list-td and list-tdrr schedulers don't yet support physreg scheduling dependencies. Add assertion checks to help catch this. It appears the Mips target defaults to list-td, and it has a regression test that uses a physreg dependence. Such code was liable to be miscompiled, and now evokes an assertion failure. llvm-svn: 62177	2009-01-13 20:24:13 +00:00
Duncan Sands	ffc6133318	When replacing uses and the same node is reached via two paths, process it once not twice, d'oh! Analysis, testcase and original patch thanks to Mon Ping Wang. llvm-svn: 62169	2009-01-13 15:17:14 +00:00
Duncan Sands	90d2a7bd72	Fix some typos. Also, the WidenedVectors map was not being cleaned by ExpungeNode. llvm-svn: 62167	2009-01-13 14:42:39 +00:00
Duncan Sands	013be76241	Correct a comment - this is not a sign extension. llvm-svn: 62166	2009-01-13 14:04:14 +00:00
Devang Patel	5c6e1e3b7d	Use DebugInfo interface to lower dbg_* intrinsics. llvm-svn: 62127	2009-01-13 00:35:13 +00:00
Duncan Sands	dc020f9c3c	Rename getABITypeSize to getTypePaddedSize, as suggested by Chris. llvm-svn: 62099	2009-01-12 20:38:59 +00:00
Evan Cheng	b2c42c648d	Fix PR3241: Currently EmitCopyFromReg emits a copy from the physical register to a virtual register unless it requires an expensive cross class copy. That means we are only treating "expensive to copy" register dependency as physical register dependency. Also future proof the scheduler to handle "normal" physical register dependencies. The code is not exercised yet. llvm-svn: 62074	2009-01-12 03:19:55 +00:00
Evan Cheng	e3108148e2	CheckForPhysRegDependency should not return copy cost. It's not used. No functionality change. llvm-svn: 62036	2009-01-11 08:53:35 +00:00
Evan Cheng	ed74d8ac2a	Duplicated node may produce a non-physical register def. llvm-svn: 62015	2009-01-09 22:44:02 +00:00
Evan Cheng	0c4fe2600a	Minor debug output tweak. llvm-svn: 62005	2009-01-09 20:42:34 +00:00
Devang Patel	235acaa131	Request DwarfWriter. This will be used to handle dbg_* intrinsics. llvm-svn: 61999	2009-01-09 19:11:50 +00:00
Misha Brukman	5cbf223916	Removed trailing whitespace from Makefiles. llvm-svn: 61991	2009-01-09 16:44:42 +00:00
Dan Gohman	261ee6be57	Remove redundant 'else's. No functionality change. llvm-svn: 61891	2009-01-07 22:30:55 +00:00
Dan Gohman	c7847cdb8d	Fix a bug in ComputeLinearIndex computation handling multi-level aggregate types. Don't increment the current index after reaching the end of a struct, as it will already be pointing at one-past-the end. This fixes PR3288. llvm-svn: 61828	2009-01-06 22:53:52 +00:00
Dan Gohman	bf8e5204d1	Update these argument lists for the isNormalMemory argument. This doesn't affect current functionality. llvm-svn: 61779	2009-01-06 01:28:56 +00:00
Dan Gohman	79c3516912	Use a latency value of 0 for the artificial edges inserted by AddPseudoTwoAddrDeps. This lets the scheduling infrastructure avoid recalculating node heights. In very large testcases this was a major bottleneck. Thanks to Roman Levenstein for finding this! As a side effect, fold-pcmpeqd-0.ll is now scheduled better and it no longer requires spilling on x86-32. llvm-svn: 61778	2009-01-06 01:19:04 +00:00
Dan Gohman	dbc6c31f62	TargetLowering.h #includes SelectionDAGNodes.h, so it doesn't need its own OpActionsCapacity magic number; it can just use ISD::BUILTIN_OP_END, as long as it takes care to round up when needed. llvm-svn: 61733	2009-01-05 19:40:39 +00:00
Dan Gohman	906152a20f	Tidy up #includes, deleting a bunch of unnecessary #includes. llvm-svn: 61715	2009-01-05 17:59:02 +00:00
Devang Patel	56a8bb670f	squash warnings. llvm-svn: 61707	2009-01-05 17:31:22 +00:00
Dan Gohman	b9fa1d24f8	Fix a DAGCombiner abort on an invalid shift count constant. This fixes PR3250. llvm-svn: 61613	2009-01-03 19:22:06 +00:00
Dan Gohman	4d41fdf4ca	CommuteNodesToReducePressure() is now removed. llvm-svn: 61612	2009-01-03 19:19:30 +00:00
Dan Gohman	1be2e9650e	Remove the code from the scheduler that commuted two-address instructions to avoid copies, because TwoAddressInstructionPass also does this optimization. The scheduler's version didn't account for live-out values, which resulted in spurious commutes and missed opportunities. Now, TwoAddressInstructionPass handles all the opportunities, instead of just those that the scheduler missed. The result is usually the same, though there are occasional trivial differences resulting from the avoidance of spurious commutes. llvm-svn: 61611	2009-01-03 18:01:46 +00:00
Duncan Sands	953c9c2fbc	Factorize (and generalize) the code promoting SELECT and BRCOND conditions. Reorder a few methods while there. llvm-svn: 61547	2009-01-01 20:36:20 +00:00
Duncan Sands	19ee60848a	Remove trailing spaces. llvm-svn: 61545	2009-01-01 19:56:02 +00:00
Duncan Sands	8feb694e8f	Fix PR3274: when promoting the condition of a BRCOND node, promote from i1 all the way up to the canonical SetCC type. In order to discover an appropriate type to use, pass MVT::Other to getSetCCResultType. In order to be able to do this, change getSetCCResultType to take a type as an argument, not a value (this is also more logical). llvm-svn: 61542	2009-01-01 15:52:00 +00:00
Scott Michel	0c9259f149	Teach LeaglizeDAG that i64 mul can be a libcall. llvm-svn: 61463	2008-12-29 03:21:37 +00:00
Dale Johannesen	ee573fcefc	Change comments so everybody can understand them, hopefully. llvm-svn: 61405	2008-12-23 23:47:22 +00:00
Dale Johannesen	acc84e5aa0	Add another permutation where we should get rid of a-a. llvm-svn: 61401	2008-12-23 23:01:27 +00:00
Anton Korobeynikov	f4a66e8dda	Restore debug printing llvm-svn: 61398	2008-12-23 22:26:18 +00:00
Anton Korobeynikov	d305d00796	Sometimes APInt syntax is really ugly... :( llvm-svn: 61397	2008-12-23 22:26:01 +00:00
Anton Korobeynikov	05149bad18	Indent stuff properly llvm-svn: 61396	2008-12-23 22:25:45 +00:00
Anton Korobeynikov	6f219132a7	Initial checkin of APInt'ififcation of switch lowering llvm-svn: 61395	2008-12-23 22:25:27 +00:00
Dan Gohman	12f2490489	Clean up the atomic opcodes in SelectionDAG. This removes all the _8, _16, _32, and _64 opcodes and replaces each group with an unsuffixed opcode. The MemoryVT field of the AtomicSDNode is now used to carry the size information. In tablegen, the size-specific opcodes are replaced by size-independent opcodes that utilize the ability to compose them with predicates. This shrinks the per-opcode tables and makes the code that handles atomics much more concise. llvm-svn: 61389	2008-12-23 21:37:04 +00:00
Dan Gohman	04543e719e	Rename BuildSchedUnits to BuildSchedGraph, and refactor the code in ScheduleDAGSDNodes' BuildSchedGraph into separate functions. llvm-svn: 61376	2008-12-23 18:36:58 +00:00
Dan Gohman	92cf280dfb	Avoid an unnecessary call to allnodes_size(), which is linear. llvm-svn: 61372	2008-12-23 17:24:50 +00:00
Dale Johannesen	d2a4685860	One more permutation of subtracting off a base value. llvm-svn: 61361	2008-12-23 01:59:54 +00:00
Mon P Wang	a501640ffa	Added support for vector widening. llvm-svn: 61209	2008-12-18 20:03:17 +00:00
Mon P Wang	015a7f57b2	Fix expansion of vsetcc to set the high bit for true instead of 1. llvm-svn: 61129	2008-12-17 08:49:47 +00:00
Dan Gohman	ce70fe2e25	Double the amount of memory reserved for SUnits. This is a temporary workaround for an obscure bug. When node cloning is used, it is possible that more SUnits will be created, and if the SUnits std::vector has to reallocate, it will invalidate all the graph edges. llvm-svn: 61122	2008-12-17 04:30:46 +00:00
Eli Friedman	6cf404f2d1	Fix for PR3225: disable a broken optimization in DAGTypeLegalizer::ExpandShiftWithKnownAmountBit. In terms of restoring the optimization, the best fix here isn't obvious... any ideas? llvm-svn: 61119	2008-12-17 03:35:17 +00:00
Dale Johannesen	f51dcef803	A new dag combine; several permutations of this are there under ADD, this one was missing. llvm-svn: 61107	2008-12-16 22:13:49 +00:00
Dan Gohman	4476ef810b	Preserve SourceValue information when lowering produces multiple loads from different offsets within the same stack slot. llvm-svn: 61093	2008-12-16 18:25:36 +00:00
Evan Cheng	c35fc49477	We have decided not to support inline asm where an output operand with a matching input operand with incompatible type (i.e. either one is a floating point and the other is an integer or the sizes of the types differ). SelectionDAGBuild will catch these and exit with an error. llvm-svn: 61092	2008-12-16 18:21:39 +00:00
Dan Gohman	405f2197a4	Remove some special-case logic in ScheduleDAGSDNodes's latency computation code that is no longer needed with the new method for handling latencies. llvm-svn: 61074	2008-12-16 03:31:11 +00:00
Dan Gohman	dddc1ac7ea	Fix some register-alias-related bugs in the post-RA scheduler liveness computation code. Also, avoid adding output-depenency edges when both defs are dead, which frequently happens with EFLAGS defs. Compute Depth and Height lazily, and always in terms of edge latency values. For the schedulers that don't care about latency, edge latencies are set to 1. Eliminate Cycle and CycleBound, and LatencyPriorityQueue's Latencies array. These are all subsumed by the Depth and Height fields. llvm-svn: 61073	2008-12-16 03:25:46 +00:00
Dan Gohman	17214e633d	Make addPred and removePred return void, since the return value is not currently used by anything. llvm-svn: 61066	2008-12-16 01:00:55 +00:00
Mon P Wang	580f2c7b61	Added support for splitting and scalarizing vector shifts. llvm-svn: 61050	2008-12-15 21:44:00 +00:00
Dan Gohman	a7e139a3e6	Fix printing of PseudoSourceValues in SDNode graphs. llvm-svn: 61036	2008-12-15 17:28:10 +00:00
Mon P Wang	ac4e120912	Added support to LegalizeType for expanding the operands of scalar to vector and insert vector element. Modified extract vector element to extend the result to match the expected promoted type. llvm-svn: 61029	2008-12-15 06:57:02 +00:00
Duncan Sands	f312dc7729	Reapply r60997, this time without forgetting that target constants are allowed to have an illegal type. llvm-svn: 61006	2008-12-14 09:43:15 +00:00
Bill Wendling	e5af6f1990	Temporarily revert r60997. It was causing this failure: Running /Users/void/llvm/llvm.src/test/CodeGen/Generic/dg.exp ... FAIL: /Users/void/llvm/llvm.src/test/CodeGen/Generic/asm-large-immediate.ll Failed with exit(1) at line 1 while running: llvm-as < /Users/void/llvm/llvm.src/test/CodeGen/Generic/asm-large-immediate.ll \| llc \| /usr/bin/grep 68719476738 Assertion failed: ((TypesNeedLegalizing \|\| getTypeAction(VT) == Legal) && "Illegal type introduced after type legalization?"), function HandleOp, file /Users/void/llvm/llvm.src/lib/CodeGen/SelectionDAG/LegalizeDAG.cpp, line 493. 0 llc 0x0085392e char const* std::find<char const, char>(char const, char const, char const&) + 98 1 llc 0x00853e63 llvm::sys::PrintStackTraceOnErrorSignal() + 593 2 libSystem.B.dylib 0x96cac09b _sigtramp + 43 3 libSystem.B.dylib 0xffffffff _sigtramp + 1765097359 4 libSystem.B.dylib 0x96d24ec2 raise + 26 5 libSystem.B.dylib 0x96d3447f abort + 73 6 libSystem.B.dylib 0x96d26063 __assert_rtn + 101 7 llc 0x004f9018 llvm::cast_retty<llvm::SubprogramDesc, llvm::DebugInfoDesc>::ret_type llvm::cast<llvm::Sub ... llvm-svn: 61001	2008-12-13 23:53:00 +00:00
Duncan Sands	24092271cc	LegalizeDAG is not supposed to introduce illegal types into the DAG if they were not already there. Check this with an assertion. llvm-svn: 60997	2008-12-13 22:33:38 +00:00
Mon P Wang	472cd640fa	Remove assertion to allow promotion of a truncating store operand llvm-svn: 60975	2008-12-13 08:16:43 +00:00
Mon P Wang	f95bd2078d	Added basic support for expanding VSETCC llvm-svn: 60974	2008-12-13 08:15:14 +00:00
Duncan Sands	b6f09933c0	On big-endian machines it is wrong to do a full width register load followed by a truncating store for the copy, since the load will not place the value in the lower bits. Probably partial loads/stores can never happen here, but fix it anyway. llvm-svn: 60972	2008-12-13 07:18:38 +00:00
Duncan Sands	8f352fe100	When expanding unaligned loads and stores do not make use of illegal integer types: instead, use a stack slot and copying via integer registers. The existing code is still used if the bitconvert is to a legal integer type. This fires on the PPC testcases 2007-09-08-unaligned.ll and vec_misaligned.ll. It looks like equivalent code is generated with these changes, just permuted, but it's hard to tell. With these changes, nothing in LegalizeDAG produces illegal integer types anymore. This is a prerequisite for removing the LegalizeDAG type legalization code. While there I noticed that the existing code doesn't handle trunc store of f64 to f32: it turns this into an i64 store, which represents a 4 byte stack smash. I added a FIXME about this. Hopefully someone more motivated than I am will take care of it. llvm-svn: 60964	2008-12-12 21:47:02 +00:00
Evan Cheng	3270a1dec3	Fix add/sub expansion: don't create ADD / SUB with two results (seems like everyone is doing this these days :-). Patch by Daniel M Gessel! llvm-svn: 60958	2008-12-12 18:49:09 +00:00
Duncan Sands	e4bcb8e2dd	When using a 4 byte jump table on a 64 bit machine, do an extending load of the 4 bytes rather than a potentially illegal (type) i32 load followed by a sign extend. llvm-svn: 60945	2008-12-12 08:13:38 +00:00
Mon P Wang	9c2d26d208	Added support for SELECT v8i8 v4i16 for X86 (MMX) Added support for TRUNC v8i16 to v8i8 for X86 (MMX) llvm-svn: 60916	2008-12-12 01:25:51 +00:00
Bill Wendling	1a317678bc	Redo the arithmetic with overflow architecture. I was changing the semantics of ISD::ADD to emit an implicit EFLAGS. This was horribly broken. Instead, replace the intrinsic with an ISD::SADDO node. Then custom lower that into an X86ISD::ADD node with a associated SETCC that checks the correct condition code (overflow or carry). Then that gets lowered into the correct X86::ADDOvf instruction. Similar for SUB and MUL instructions. llvm-svn: 60915	2008-12-12 00:56:36 +00:00
Mon P Wang	bcdbfa854a	Avoid generating a convert_rndsat node when the src and dest type are the same. llvm-svn: 60869	2008-12-11 03:30:13 +00:00
Bill Wendling	40d2476adc	Clarify FIXME. llvm-svn: 60867	2008-12-11 01:26:44 +00:00
Mon P Wang	c68b3c4fc1	Whitespace clean up (tabs with spaces) llvm-svn: 60866	2008-12-11 00:44:22 +00:00
Mon P Wang	b5eb7205ea	Make fix for r60829 less conservative to allow the proper optimization for vec_extract-sse4.ll. llvm-svn: 60865	2008-12-11 00:26:16 +00:00
Bill Wendling	0864a75ebf	If ADD, SUB, or MUL have an overflow bit that's used, don't do transformation on them. The DAG combiner expects that nodes that are transformed have one value result. llvm-svn: 60857	2008-12-10 22:36:00 +00:00
Duncan Sands	09ed3bba2b	For amusement, implement SADDO, SSUBO, UADDO, USUBO for promoted integer types, eg: i16 on ppc-32, or i24 on any platform. Complete support for arbitrary precision integers would require handling expanded integer types, eg: i128, but I couldn't be bothered. llvm-svn: 60834	2008-12-10 12:30:42 +00:00
Mon P Wang	4637c3c698	Fixed a bug when trying to optimize a extract vector element of a bit convert that changes the number of elements of a shuffle. llvm-svn: 60829	2008-12-10 03:59:02 +00:00
Bill Wendling	f482f379ef	Whitespace changes. llvm-svn: 60826	2008-12-10 02:01:32 +00:00
Bill Wendling	4eb2dcdc45	Whitespace fixes. llvm-svn: 60818	2008-12-10 00:28:22 +00:00
Dan Gohman	2d170896ee	Rewrite the SDep class, and simplify some of the related code. The Cost field is removed. It was only being used in a very limited way, to indicate when the scheduler should attempt to protect a live register, and it isn't really needed to do that. If we ever want the scheduler to start inserting copies in non-prohibitive situations, we'll have to rethink some things anyway. A Latency field is added. Instead of giving each node a single fixed latency, each edge can have its own latency. This will eventually be used to model various micro-architecture properties more accurately. The PointerIntPair class and an internal union are now used, which reduce the overall size. llvm-svn: 60806	2008-12-09 22:54:47 +00:00
Bill Wendling	db8ec2d75a	Add sub/mul overflow intrinsics. This currently doesn't have a target-independent way of determining overflow on multiplication. It's very tricky. Patch by Zoltan Varga! llvm-svn: 60800	2008-12-09 22:08:41 +00:00
Duncan Sands	445071c44f	Fix PR3117: not all nodes being legalized. The essential problem was that the DAG can contain random unused nodes which were never analyzed. When remapping a value of a node being processed, such a node may become used and need to be analyzed; however due to operands being transformed during analysis the node may morph into a different one. Users of the morphing node need to be updated, and this wasn't happening. While there I added a bunch of documentation and sanity checks, so I (or some other poor soul) won't have to scratch their head over this stuff so long trying to remember how it was all supposed to work next time some obscure problem pops up! The extra sanity checking exposed a few places where invariants weren't being preserved, so those are fixed too. Since some of the sanity checking is expensive, I added a flag to turn it on. It is also turned on when building with ENABLE_EXPENSIVE_CHECKS=1. llvm-svn: 60797	2008-12-09 21:33:20 +00:00
Mon P Wang	8a5366332f	In LegalizeOp, don't change the result type of CONVERT_RNDSAT when promoting one of its operand. llvm-svn: 60749	2008-12-09 07:27:39 +00:00
Mon P Wang	4dd832d241	Fix getNode to allow a vector for the shift amount for shifts of vectors. Fix the shift amount when unrolling a vector shift into scalar shifts. Fix problem in getShuffleScalarElt where it assumes that the input of a bit convert must be a vector. llvm-svn: 60740	2008-12-09 05:46:39 +00:00
Dan Gohman	4c31524bec	Factor out the code for sign-extending/truncating gep indices and use it in x86 address mode folding. Also, make getRegForValue return 0 for illegal types even if it has a ValueMap for them, because Argument values are put in the ValueMap. This fixes PR3181. llvm-svn: 60696	2008-12-08 07:57:47 +00:00
Duncan Sands	471a654711	When allocating a stack temporary, use the correct number of bytes for types such as i1 which are not a multiple of 8 bits in length. llvm-svn: 60543	2008-12-04 18:08:40 +00:00
Dan Gohman	30cad9c192	Make debug output more informative. llvm-svn: 60524	2008-12-04 02:14:57 +00:00
Duncan Sands	f52e518d05	Only check that the result of the mapping was not a new node if the node was actually remapped. llvm-svn: 60482	2008-12-03 12:36:16 +00:00
Evan Cheng	e62150cae4	Remove a (what appears to be) overly strict assertion. Here is what happened: 1. ppcf128 select is expanded to f64 select's. 2. f64 select operand 0 is an i1 truncate, it's promoted to i32 zero_extend. 3. f64 select is updated. It's changed back to a "NewNode" and being re-analyzed. 4. f64 select operands are being processed. Operand 0 is a "NewNode". It's being expunged out of ReplacedValues map. 5. ExpungeNode tries to remap f64 select and notice it's a "NewNode" and assert. Duncan, please take a look. Thanks. llvm-svn: 60443	2008-12-02 21:57:09 +00:00
Scott Michel	9b0b28e021	Non-functional change: make custom lowering for truncate stylistically consistent with the way it's generally done in other places. llvm-svn: 60439	2008-12-02 19:55:08 +00:00
Dale Johannesen	54bdec238a	One more transformation. llvm-svn: 60432	2008-12-02 18:40:40 +00:00
Tilmann Scheller	318ccb0e62	make it possible to custom lower TRUNCATE (needed for the CellSPU target) llvm-svn: 60409	2008-12-02 12:12:25 +00:00
Mon P Wang	6e1c6ad127	Removed some unnecessary code in widening. llvm-svn: 60406	2008-12-02 07:35:08 +00:00
Dale Johannesen	8c76670b5a	Add a few more transformations. llvm-svn: 60391	2008-12-02 01:30:54 +00:00
Bill Wendling	2d59863d06	Expand getVTList, getNodeValueTypes, and SelectNodeTo to handle more value types. llvm-svn: 60381	2008-12-01 23:28:22 +00:00
Duncan Sands	3d960941b1	There are no longer any places that require a MERGE_VALUES node with only one operand, so get rid of special code that only existed to handle that possibility. llvm-svn: 60349	2008-12-01 11:41:29 +00:00
Duncan Sands	6ed40141f7	Change the interface to the type legalization method ReplaceNodeResults: rather than returning a node which must have the same number of results as the original node (which means mucking around with MERGE_VALUES, and which is also easy to get wrong since SelectionDAG folding may mean you don't get the node you expect), return the results in a vector. llvm-svn: 60348	2008-12-01 11:39:25 +00:00
Eli Friedman	c8228d263b	Followup to r60283: optimize arbitrary width signed divisions as well as unsigned divisions. Same caveats as before. llvm-svn: 60284	2008-11-30 06:35:39 +00:00
Eli Friedman	1b7fc154a5	Fix for PR2164: allow transforming arbitrary-width unsigned divides into multiplies. Some more cleverness would be nice, though. It would be nice if we could do this transformation on illegal types. Also, we would prefer a narrower constant when possible so that we can use a narrower multiply, which can be cheaper. llvm-svn: 60283	2008-11-30 06:02:26 +00:00
Eli Friedman	bd0f57821a	APIntify a test which is potentially unsafe otherwise, and fix the nearby FIXME. I'm not sure what the right way to fix the Cell test was; if the approach I used isn't okay, please let me know. llvm-svn: 60277	2008-11-30 04:59:26 +00:00
Sanjiv Gupta	7ae1a84465	Removing redundant semicolons. No functionality change. llvm-svn: 60149	2008-11-27 05:58:04 +00:00
Dale Johannesen	73bc0ba4c9	Add a missing case in visitADD. llvm-svn: 60137	2008-11-27 00:43:21 +00:00
Sanjiv Gupta	80810f8c6b	Allow custom lowering of ADDE/ADDC/SUBE/SUBC operations. llvm-svn: 60102	2008-11-26 11:19:00 +00:00
Bill Wendling	b4ff5322c1	A simplification for checking whether the signs of the operands and sum differ. Thanks, Duncan. llvm-svn: 60043	2008-11-25 19:40:17 +00:00
Bill Wendling	bf592fccd4	Now with the correct type for the 0. llvm-svn: 60016	2008-11-25 08:19:22 +00:00
Bill Wendling	d06c625b95	Get rid of unused variable. llvm-svn: 60015	2008-11-25 08:13:20 +00:00
Bill Wendling	4498b47677	Hacker's Delight says, "Signed integer overflow of addition occurs if and only if the operands have the same sign and the sum has sign opposite to that of the operands." llvm-svn: 60014	2008-11-25 08:12:19 +00:00
Dan Gohman	ad2134d45d	Initial support for anti-dependence breaking. Currently this code does not introduce any new spilling; it just uses unused registers. Refactor the SUnit topological sort code out of the RRList scheduler and make use of it to help with the post-pass scheduler. llvm-svn: 59999	2008-11-25 00:52:40 +00:00
Bill Wendling	66835479d7	- Make lowering of "add with overflow" customizable by back-ends. - Mark "add with overflow" as having a custom lowering for X86. Give it a null lowering representation for now. llvm-svn: 59971	2008-11-24 19:21:46 +00:00
Dan Gohman	5cc12a8e31	Check in the rest of this change. The isAntiDep flag needs to be passed to removePred because an SUnit can both data-depend and anti-depend on the same SUnit. llvm-svn: 59969	2008-11-24 17:33:52 +00:00
Duncan Sands	dc2dac181a	If the type legalizer actually legalized anything (this doesn't happen that often, since most code does not use illegal types) then follow it by a DAG combiner run that is allowed to generate illegal operations but not illegal types. I didn't modify the target combiner code to distinguish like this between illegal operations and illegal types, so it will not produce illegal operations as well as not producing illegal types. llvm-svn: 59960	2008-11-24 14:53:14 +00:00
Evan Cheng	a8fd1f2c8e	Eliminate some unused variable compile time warnings. llvm-svn: 59952	2008-11-24 07:09:49 +00:00
Bill Wendling	2278f8f5e1	Add support for llvm.uadd.with.overflow. llvm-svn: 59926	2008-11-24 01:38:29 +00:00
Duncan Sands	8d6e2e13d5	Rename SetCCResultContents to BooleanContents. In practice these booleans are mostly produced by SetCC, however the concept is more general. llvm-svn: 59911	2008-11-23 15:47:28 +00:00
Mon P Wang	2967480f54	Added check to avoid generating extract subvector beyond the end of the vector when normalizing vector shuffles. llvm-svn: 59900	2008-11-23 04:35:05 +00:00
Bill Wendling	5424e6d4ec	Cleanup of the [SU]ADDO type legalization code. Patch by Duncan! "It simplifies the type legalization part a bit, and produces better code by teaching SelectionDAG about the extra bits in an i8 SADDO/UADDO node. In essence, I spontaneously decided that on x86 this i8 boolean result would be either 0 or 1, and on other platforms 0/1 or 0/-1, depending on whether the platform likes it's boolean zero extended or sign extended." llvm-svn: 59864	2008-11-22 07:24:01 +00:00
Bill Wendling	be8e7f851c	- Move conversion of [SU]ADDO from DAG combiner into legalizer. - Add "promote integer type" stuff to the legalizer for these nodes. llvm-svn: 59847	2008-11-22 00:22:52 +00:00
Dan Gohman	8dfa51c5ef	Update comments. llvm-svn: 59834	2008-11-21 19:10:41 +00:00
Chris Lattner	dd7083452f	reapply Sanjiv's patch to genericize memcpy/memset/memmove to take an arbitrary integer width for the count. llvm-svn: 59823	2008-11-21 16:42:48 +00:00
Bill Wendling	4bce2bff88	Revert r59802. It was breaking the build of llvm-gcc: g++ -m32 -c -g -DIN_GCC -W -Wall -Wwrite-strings -Wmissing-format-attribute -fno-common -mdynamic-no-pic -DHAVE_CONFIG_H -Wno-unused -DTARGET_NAME=\"i386-apple-darwin9.5.0\" -I. -I. -I../../llvm-gcc.src/gcc -I../../llvm-gcc.src/gcc/. -I../../llvm-gcc.src/gcc/../include -I./../intl -I../../llvm-gcc.src/gcc/../libcpp/include -I../../llvm-gcc.src/gcc/../libdecnumber -I../libdecnumber -I/Volumes/Sandbox/Buildbot/llvm/full-llvm/build/llvm.obj/include -I/Volumes/Sandbox/Buildbot/llvm/full-llvm/build/llvm.src/include -DENABLE_LLVM -I/Volumes/Sandbox/Buildbot/llvm/full-llvm/build/llvm.obj/../llvm.src/include -D_DEBUG -D_GNU_SOURCE -D__STDC_LIMIT_MACROS -D__STDC_CONSTANT_MACROS -I. -I. -I../../llvm-gcc.src/gcc -I../../llvm-gcc.src/gcc/. -I../../llvm-gcc.src/gcc/../include -I./../intl -I../../llvm-gcc.src/gcc/../libcpp/include -I../../llvm-gcc.src/gcc/../libdecnumber -I../libdecnumber -I/Volumes/Sandbox/Buildbot/llvm/full-llvm/build/llvm.obj/include -I/Volumes/Sandbox/Buildbot/llvm/full-llvm/build/llvm.src/include ../../llvm-gcc.src/gcc/llvm-types.cpp -o llvm-types.o ../../llvm-gcc.src/gcc/llvm-convert.cpp: In member function 'void TreeToLLVM::EmitMemCpy(llvm::Value, llvm::Value, llvm::Value, unsigned int)': ../../llvm-gcc.src/gcc/llvm-convert.cpp:1496: error: 'memcpy_i32' is not a member of 'llvm::Intrinsic' ../../llvm-gcc.src/gcc/llvm-convert.cpp:1496: error: 'memcpy_i64' is not a member of 'llvm::Intrinsic' ../../llvm-gcc.src/gcc/llvm-convert.cpp: In member function 'void TreeToLLVM::EmitMemMove(llvm::Value, llvm::Value, llvm::Value, unsigned int)': ../../llvm-gcc.src/gcc/llvm-convert.cpp:1512: error: 'memmove_i32' is not a member of 'llvm::Intrinsic' ../../llvm-gcc.src/gcc/llvm-convert.cpp:1512: error: 'memmove_i64' is not a member of 'llvm::Intrinsic' ../../llvm-gcc.src/gcc/llvm-convert.cpp: In member function 'void TreeToLLVM::EmitMemSet(llvm::Value, llvm::Value, llvm::Value, unsigned int)': ../../llvm-gcc.src/gcc/llvm-convert.cpp:1528: error: 'memset_i32' is not a member of 'llvm::Intrinsic' ../../llvm-gcc.src/gcc/llvm-convert.cpp:1528: error: 'memset_i64' is not a member of 'llvm::Intrinsic' make[3]: [llvm-convert.o] Error 1 make[3]: * Waiting for unfinished jobs.... rm fsf-funding.pod gcov.pod gfdl.pod cpp.pod gpl.pod gcc.pod make[2]: * [all-stage1-gcc] Error 2 make[1]: * [stage1-bubble] Error 2 make: *** [all] Error 2 llvm-svn: 59809	2008-11-21 09:09:41 +00:00
Sanjiv Gupta	09a203765a	Make mem[cpy,move,set] intrinsics overloaded. llvm-svn: 59802	2008-11-21 07:49:09 +00:00
Bill Wendling	0b5be6c5e0	Default to converting UADDO to the generic form that SADDO is converted to. llvm-svn: 59801	2008-11-21 07:44:30 +00:00
Mon P Wang	c311360909	Clean up normalization of shuffles llvm-svn: 59792	2008-11-21 04:25:21 +00:00
Bill Wendling	5eee74446d	Combine the two add with overflow intrinsics lowerings. They differ only in DAG node type. llvm-svn: 59788	2008-11-21 02:38:44 +00:00
Bill Wendling	87c175e629	Generate code for llvm.uadd.with.overflow intrinsic. No conversion support yet. llvm-svn: 59786	2008-11-21 02:33:36 +00:00
Dan Gohman	f00cef4491	Add a flag to SDep for tracking which edges are anti-dependence edges. llvm-svn: 59785	2008-11-21 02:27:52 +00:00
Bill Wendling	8badb674eb	Remove chains. Unnecessary. llvm-svn: 59783	2008-11-21 02:22:59 +00:00
Dan Gohman	67b35bd4d1	Rename SDep's isSpecial to isArtificial, to make this field a little less mysterious. llvm-svn: 59782	2008-11-21 02:18:56 +00:00
Bill Wendling	77538cc510	Rename "ADDO" to "SADDO" and "UADDO". The "UADDO" isn't equivalent to "ADDC" because the boolean it returns to indicate an overflow may not be treated like as a flag. It could be stored to memory, for instance. llvm-svn: 59780	2008-11-21 02:12:42 +00:00
Bill Wendling	74296c60ff	Implement the sadd_with_overflow intrinsic. This is converted into "ISD::ADDO". ISD::ADDO is lowered into a target-independent form that does the addition and then checks if the result is less than one of the operands. (If it is, then there was an overflow.) llvm-svn: 59779	2008-11-21 02:03:52 +00:00
Dan Gohman	d1f33e2397	Use ComputeLatency in the MachineInstr scheduler. llvm-svn: 59777	2008-11-21 01:44:51 +00:00
Dan Gohman	63be531e09	Remove the CycleBound computation code from the ScheduleDAGRRList schedulers. This doesn't have much immediate impact because targets that use these schedulers by default don't yet provide pipeline information. This code also didn't have the benefit of register pressure information. Also, removing it will avoid problems with list-burr suddenly starting to do latency-oriented scheduling on x86 when we start providing pipeline data, which would increase spilling. llvm-svn: 59775	2008-11-21 01:30:54 +00:00
Dan Gohman	7b7ca502fa	Implement ComputeLatency for MachineInstr ScheduleDAGs. Factor some of the latency computation logic out of the SDNode ScheduleDAG code into a TargetInstrItineraries helper method to help with this. llvm-svn: 59761	2008-11-21 00:12:10 +00:00
Bill Wendling	39acb29ff8	Add UADDO and SADDO nodes. These will be used for determining an overflow condition in an addition operation. llvm-svn: 59760	2008-11-21 00:11:16 +00:00
Dan Gohman	c602dd407c	Change these schedulers to not emit no-ops. It turns out that the RR scheduler actually does look at latency values, but it doesn't use a hazard recognizer so it has no way to know when a no-op is needed, as opposed to just stalling and incrementing the cycle count. llvm-svn: 59759	2008-11-21 00:10:42 +00:00
Duncan Sands	3fa0a5afab	Add some documentation. llvm-svn: 59727	2008-11-20 10:34:43 +00:00
Bill Wendling	165b45d385	80-column violation. llvm-svn: 59718	2008-11-20 07:24:30 +00:00
Dan Gohman	8e066a1349	Remove a remnant of list-burr's fast mode. llvm-svn: 59702	2008-11-20 03:32:45 +00:00
Dan Gohman	186f65d275	Factor out the SethiUllman numbering logic from the list-burr and list-tdrr schedulers into a common base class. llvm-svn: 59701	2008-11-20 03:30:37 +00:00
Dan Gohman	fd08af4ee7	Remove the "fast" form of the list-burr scheduler, and use the dedicated "fast" scheduler in -fast mode instead, which is faster. This speeds up llc -fast by a few percent on some testcases -- the speedup only happens for code not handled by fast-isel. llvm-svn: 59700	2008-11-20 03:11:19 +00:00
Dan Gohman	3f656dfa03	Facter AddPseudoTwoAddrDeps and associated infrasructure out of the list-burr scheduler so that it can be used by the list-tdrr scheduler too. llvm-svn: 59698	2008-11-20 02:45:51 +00:00
Dan Gohman	4ce15e12b9	Factor out the code for verifying the work of the scheduler, extend it a bit, and make use of it in all schedulers, to ensure consistent checking. llvm-svn: 59689	2008-11-20 01:26:25 +00:00
Dan Gohman	4c3034f711	Simplify this code a little. In the fast scheduler, CreateNewSUnit and CreateClone don't add any extra value. llvm-svn: 59679	2008-11-19 23:39:02 +00:00
Dan Gohman	60cb69e665	Experimental post-pass scheduling support. Post-pass scheduling is currently off by default, and can be enabled with -disable-post-RA-scheduler=false. This doesn't have a significant impact on most code yet because it doesn't yet do anything to address anti-dependencies and it doesn't attempt to disambiguate memory references. Also, several popular targets don't have pipeline descriptions yet. The majority of the changes here are splitting the SelectionDAG-specific code out of ScheduleDAG, so that ScheduleDAG can be moved to libLLVMCodeGen.a. The interface between ScheduleDAG-using code and the rest of the scheduling code is somewhat rough and will evolve. llvm-svn: 59676	2008-11-19 23:18:57 +00:00
Dan Gohman	f4d95fdce9	Move the code for printing a graph node label for an SUnit into a virtual method of SelectionDAG. llvm-svn: 59667	2008-11-19 22:09:45 +00:00
Dan Gohman	78fb6214f3	Convert SUnit's dump method into a print method and implement dump in terms of it. llvm-svn: 59665	2008-11-19 21:32:03 +00:00
Dan Gohman	82016c243b	Rearrange code to reduce the nesting level. No functionality change. llvm-svn: 59580	2008-11-19 02:00:32 +00:00
Dan Gohman	eb87975384	Fix debug printing of flagged SDNodes in SUnits so that they print in the correct order. llvm-svn: 59567	2008-11-19 00:04:44 +00:00
Dan Gohman	6e58726416	Tidy up ScheduleNodeBottomUp methods, and make them more consistent with ScheduleNodeTopDown methods. llvm-svn: 59550	2008-11-18 21:22:20 +00:00
Dan Gohman	71b632f905	Update a comment to reflect the current code. llvm-svn: 59549	2008-11-18 21:14:44 +00:00
Duncan Sands	3ca78c675e	Remove integer promotion support for FP_EXTEND and FP_ROUND. Not sure what these were doing here - probably they were sometimes (wrongly) created with integer operands somewhere that has since been fixed. llvm-svn: 59548	2008-11-18 21:13:59 +00:00
Duncan Sands	97933c3990	Simplify code using helper routines. There is not supposed to be any functionality change. llvm-svn: 59545	2008-11-18 20:56:22 +00:00
Dan Gohman	1132313e71	Whitespace cleanups. llvm-svn: 59532	2008-11-18 17:05:42 +00:00
Duncan Sands	789dbb906d	LegalizeTypes support for splitting and scalarizing SCALAR_TO_VECTOR. I didn't add the testcase, because once llc gets past scalar-to-vector it hits a SPU target lowering bug and explodes. llvm-svn: 59530	2008-11-18 16:40:48 +00:00
Bill Wendling	13020d22da	Rename stackprotector_create intrinsic to stackprotector. llvm-svn: 59519	2008-11-18 11:01:33 +00:00
Duncan Sands	1315f80ea8	Reapply r59464, this time using the correct type when softening FNEG. llvm-svn: 59513	2008-11-18 09:15:03 +00:00
Bill Wendling	7235002bd1	Remove the stackprotector_check intrinsic. Use a volatile load instead. llvm-svn: 59504	2008-11-18 07:30:57 +00:00
Dan Gohman	fe1748da07	Fix a typo in a comment. llvm-svn: 59489	2008-11-18 02:50:01 +00:00
Dan Gohman	22d07b14bc	Change SUnit's dump method to take a ScheduleDAG* instead of a SelectionDAG*. llvm-svn: 59488	2008-11-18 02:06:40 +00:00
Bill Wendling	e0d5e67c98	Revert r59464. It was causing this failure: Running /Volumes/Sandbox/Buildbot/llvm/full-llvm/build/llvm.src/test/CodeGen/XCore/dg.exp ... FAIL: /Volumes/Sandbox/Buildbot/llvm/full-llvm/build/llvm.src/test/CodeGen/XCore/fneg.ll Failed with signal(SIGABRT) at line 1 while running: llvm-as < /Volumes/Sandbox/Buildbot/llvm/full-llvm/build/llvm.src/test/CodeGen/XCore/fneg.ll \| llc -march=xcore > fneg.ll.tmp1.s Assertion failed: (VT.isFloatingPoint() && "Cannot create integer FP constant!"), function getConstantFP, file /Volumes/Sandbox/Buildbot/llvm/full-llvm/build/llvm.src/lib/CodeGen/SelectionDAG/SelectionDAG.cpp, line 913. 0 llc 0x0092115c _ZN4llvm3sys18RemoveFileOnSignalERKNS0_4PathEPSs + 844 1 libSystem.B.dylib 0x9217809b _sigtramp + 43 2 ??? 0xffffffff 0x0 + 4294967295 3 libSystem.B.dylib 0x921f0ec2 raise + 26 4 libSystem.B.dylib 0x9220047f abort + 73 5 libSystem.B.dylib 0x921f2063 __assert_rtn + 101 6 llc 0x005a5b0a _ZN4llvm12SelectionDAG13getConmake[1]: * [check-local] Error 1 make: * [check] Error 2 llvm-svn: 59487	2008-11-18 01:49:24 +00:00
Dan Gohman	5ebdb98a6e	Avoid using a loop in ReleasePred and ReleaseSucc methods to compute the new CycleBound value. Instead, just update CycleBound on each call. Also, make ReleasePred and ReleaseSucc methods more consistent accross the various schedulers. This also happens to make ScheduleDAGRRList's CycleBound computation somewhat more interesting, though it still doesn't have any noticeable effect, because no current targets that use the register-pressure reduction scheduler provide pipeline models. llvm-svn: 59475	2008-11-18 00:38:59 +00:00
Dan Gohman	92a36d7a78	Eliminate some trivial differences between the ScheduleNodeTopDown functions in these two schedulers. llvm-svn: 59465	2008-11-17 21:31:02 +00:00
Duncan Sands	f046b50ecd	Add soft float support for a bunch more operations. Original patch by Richard Osborne, tweaked and extended by your humble servant. llvm-svn: 59464	2008-11-17 20:52:38 +00:00
Dan Gohman	4f474b092e	Don't bother doing latency calculations in the "fast" scheduler. llvm-svn: 59461	2008-11-17 19:52:36 +00:00
Dan Gohman	a687fd8339	Use SUnit's CycleBound field instead of duplicating it in a side-car datastructure llvm-svn: 59458	2008-11-17 19:45:19 +00:00
Richard Osborne	6751b4a604	Don't produce ADDC/ADDE when expanding SHL unless they are legal for the target. This fixes PR3080. llvm-svn: 59450	2008-11-17 17:34:31 +00:00
Dan Gohman	17c226b8ca	Don't use the isPending flag to mean what the isAvailable flag means. llvm-svn: 59445	2008-11-17 16:37:30 +00:00
Mon P Wang	4964368e0d	Fixed legalization of CONVERT_RNDSAT for integers. llvm-svn: 59432	2008-11-17 00:41:12 +00:00
Mon P Wang	7a82474387	Improved shuffle normalization to avoid using extract/build when we can extract using different indexes for two vectors. Added a few tests for vector shuffles. llvm-svn: 59399	2008-11-16 05:06:27 +00:00
Duncan Sands	da8d2873ed	When splitting a SHUFFLE_VECTOR, try to have the result use SHUFFLE_VECTOR instead. If not practical, fall back to the old scheme of building the split result by hand using a BUILD_VECTOR. llvm-svn: 59361	2008-11-15 09:25:38 +00:00
Mon P Wang	f414cbc1fd	Add missing widen operations, fixed widening for extracting a subvector, and when loading/storing a widen vector, make sure that they are loaded and stored in consecutive order. llvm-svn: 59357	2008-11-15 06:05:52 +00:00
Dan Gohman	68294c06fe	Correct a comment. llvm-svn: 59341	2008-11-15 00:24:23 +00:00
Dan Gohman	d2760c0473	Move ScheduleDAGList's LatencyPriorityQueue class out to a separate file. llvm-svn: 59340	2008-11-15 00:23:40 +00:00
Dan Gohman	1472955eab	Add support for building a ScheduleDAG from MachineInstrs. This is currently fairly conservative; it doesn't do alias-analysis queries and it doesn't attempt to break anti-dependencies. llvm-svn: 59324	2008-11-14 21:47:58 +00:00
Dan Gohman	db8b95a4fa	For post-regalloc scheduling, remove the instructions from the block before re-inserting them. llvm-svn: 59281	2008-11-14 00:33:17 +00:00
Dan Gohman	1a21ab6925	Check in the correct version of the patch in r59279. llvm-svn: 59280	2008-11-14 00:32:34 +00:00
Dan Gohman	8f973f157d	Debug printing for SUnits that carry MachineInstrs. llvm-svn: 59279	2008-11-14 00:28:56 +00:00
Dan Gohman	ee8273e52f	Initial support for carrying MachineInstrs in SUnits. llvm-svn: 59278	2008-11-14 00:06:09 +00:00
Dan Gohman	a2cbbaa41f	Change DOTGraphTraits<ScheduleDAG*>::getGraphName how to find the name of the current function on its own, rather than relying on the SelectionDAG. llvm-svn: 59277	2008-11-13 23:45:55 +00:00
Dan Gohman	072734ebd6	Remove the FlaggedNodes member from SUnit. Instead of requiring each SUnit to carry a SmallVector of flagged nodes, just calculate the flagged nodes dynamically when they are needed. The local-liveness change is due to a trivial scheduling change where the scheduler arbitrary decision differently. llvm-svn: 59273	2008-11-13 23:24:17 +00:00
Dan Gohman	1ddfcba5be	Make the Node member of SUnit private, and add accessors. llvm-svn: 59264	2008-11-13 21:36:12 +00:00
Dan Gohman	5a390b974c	Change ScheduleDAG's DAG member from a reference to a pointer, to prepare for the possibility of scheduling without a SelectionDAG being present. llvm-svn: 59263	2008-11-13 21:21:28 +00:00
Dan Gohman	88ba5f0b96	Move the code that inserts X87 FP_REG_KILL instructions from a special-purpose hook to a new pass. Also, add check to see if any x87 virtual registers are used, to avoid doing any work in the common case that no x87 code is needed. llvm-svn: 59190	2008-11-12 22:55:05 +00:00
Dale Johannesen	6467858be1	Fix unsigned char->ppcf128 conversion. llvm-svn: 59150	2008-11-12 18:38:44 +00:00
Duncan Sands	2907b0085c	Simplify SplitVecRes_EXTRACT_SUBVECTOR. This means that it no longer handles non-power-of-two vectors. However it previously only handled them sometimes, depending on obscure numerical relationships between the index and vector type. For example, for a vector of length 6, it would succeed if and only if the index was an even multiple of 6. I consider this more confusing than useful. llvm-svn: 59122	2008-11-12 08:37:57 +00:00
Duncan Sands	aa7060c885	Correct some thinkos in the expansion of ADD/SUB when the target does not support ADDC/SUBC. This fixes PR3044. llvm-svn: 59120	2008-11-12 08:23:26 +00:00
Dale Johannesen	ffc67df2aa	Fix the testb optimization so x86 also bootstraps. Reenable test. llvm-svn: 59101	2008-11-12 02:00:35 +00:00
Dan Gohman	e52e0897e2	In ScheduleDAGRRList::CopyAndMoveSuccessors, create the SUnit for the load before creating the SUnit for the operation that it was unfolded from. This allows each SUnit to have all of its predecessor SUnits available at the time it is created. I don't know yet if this will be absolutely required, but it is a little tidier to do it this way. llvm-svn: 59083	2008-11-11 21:34:44 +00:00
Dan Gohman	fb78ef9fd3	Avoid relying on the SelectionDAG for initializing the MachineFunction and TargetLoweringInfo variables for the scheduler. llvm-svn: 59082	2008-11-11 21:31:56 +00:00
Dan Gohman	5499e89d06	Change the scheduler accessor methods to accept an explicit TargetMachine argument instead of taking the SelectionDAG's TargetMachine. This is needed for some upcoming scheduler changes. llvm-svn: 59055	2008-11-11 17:50:47 +00:00
Bill Wendling	49a5ce863e	Fix for PR3040: The CC was changed, but wasn't checked to see if it was legal if the DAG combiner was being run after legalization. Threw in a couple of checks just to make sure that it's okay. As far as the PR is concerned, no back-end target actually exhibited this problem, so there isn't an associated testcase. llvm-svn: 59035	2008-11-11 08:25:46 +00:00
Mon P Wang	774e9ac433	Cleaned up and fix bugs in convert_rndsat node llvm-svn: 59025	2008-11-11 05:40:06 +00:00
Bill Wendling	b85755c829	Temporarily revert r58979 and related patch. It's causing a failure in X86 bootstrap: Comparing stages 2 and 3 warning: ./cc1-checksum.o differs warning: ./cc1obj-checksum.o differs warning: ./cc1objplus-checksum.o differs warning: ./cc1plus-checksum.o differs Bootstrap comparison failure! ./alias.o differs ./alloc-pool.o differs ./attribs.o differs ./bb-reorder.o differs ./bitmap.o differs ./build/errors.o differs ./build/genattrtab.o differs ./build/genautomata.o differs ./build/genemit.o differs ./build/genextract.o differs ... -bw llvm-svn: 59003	2008-11-10 21:22:06 +00:00
Mon P Wang	58fb9135e2	Added CONVERT_RNDSAT (conversion with rounding and saturation) SDNode to support targets that support these conversions. Users should avoid using this node as the current targets don't generating code for it. llvm-svn: 59001	2008-11-10 20:54:11 +00:00
Duncan Sands	ddacbb39ab	Fix PR2667: add soft float support for sint_to_fp/uint_to_fp where the argument is an apint, or smaller than the minimum size for which there is a libcall (i32). llvm-svn: 58994	2008-11-10 17:36:26 +00:00
Duncan Sands	13b2e3634b	Tweak some comments. llvm-svn: 58993	2008-11-10 17:31:56 +00:00
Duncan Sands	7da4b44dd1	Small cleanups. No functionality change intended! llvm-svn: 58992	2008-11-10 17:29:56 +00:00
Duncan Sands	d5b53e1c6c	When promoting the result of fp_to_uint/fp_to_sint, inform the optimizers that the result must be zero/ sign extended from the smaller type. For example, if a fp to unsigned i16 is promoted to fp to i32, then we are allowed to assume that the extra 16 bits are zero (because the result of fp to i16 is undefined if the result does not fit in an i16). This is quite aggressive, but should help the optimizers produce better code. This requires correcting a test which thought that fp_to_uint is some kind of truncation, which it is not: in the testcase (which does fp to i1), either the fp value converts to 0 or 1 or the result is undefined, which is quite different to truncation. llvm-svn: 58991	2008-11-10 17:28:30 +00:00
Dale Johannesen	671743369c	Really fix testb optimization on big-endian. Fixes ppc32 bootstrap. llvm-svn: 58979	2008-11-10 07:16:42 +00:00
Mon P Wang	25f0106fd9	Added support for the following definition of shufflevector <result> = shufflevector <n x <ty>> <v1>, <n x <ty>> <v2>, <m x i32> <mask> llvm-svn: 58964	2008-11-10 04:46:22 +00:00
Dale Johannesen	aa4d82d244	Temporarily revert 58825, which breaks PPC bootstrap. xs llvm-svn: 58930	2008-11-09 06:48:10 +00:00
Duncan Sands	0f3937115d	Try to produce better code when scalarizing VSETCC. llvm-svn: 58920	2008-11-08 18:26:48 +00:00
Dale Johannesen	bb5c9b4b68	Make testb optimization work on big-endian targets. llvm-svn: 58874	2008-11-08 00:01:16 +00:00
Dale Johannesen	160be0ffda	Make FP tests requiring two compares work on PPC (PR 642). This is Chris' patch from the PR, modified to realize that SETUGT/SETULT occur legitimately with integers, plus two fixes in LegalizeDAG to pass a valid result type into LegalizeSetCC. The argument of TLI.getSetCCResultType is ignored on PPC, but I think I'm following usage elsewhere. llvm-svn: 58871	2008-11-07 22:54:33 +00:00
Duncan Sands	2d636b5265	Sign-extend rather than zero-extend when promoting the condition for a BRCOND, according to what is returned by getSetCCResultContents. Since all targets return the same thing (ZeroOrOneSetCCResult), this should be harmless! The point is that all over the place the result of SETCC is fed directly into BRCOND. On machines for which getSetCCResultContents returns ZeroOrNegativeOneSetCCResult, this is a sign-extended boolean. So it seems dangerous to also feed BRCOND zero-extended booleans in some circumstances - for example, when promoting the condition. llvm-svn: 58861	2008-11-07 20:13:04 +00:00
Dale Johannesen	9016882d67	Fix unsigned->ppcf128 conversion. llvm-svn: 58856	2008-11-07 19:11:43 +00:00
Dale Johannesen	7aad542d35	When we're doing a compare of load-AND-constant to 0 (e.g. a bitfield test) narrow the load as much as possible. The has the potential to avoid unnecessary partial-word load-after-store conflicts, which cause stalls on several targets. Also a size win on x86 (testb vs testl). llvm-svn: 58825	2008-11-07 01:28:02 +00:00
Bill Wendling	eb4268d72f	- Modify the stack protector algorithm so that the stack slot is allocated in LLVM IR code and not in the selection DAG ISel. This is a cleaner solution. - Fix the heuristic for determining if protectors are necessary. The previous one wasn't checking the proper type size. llvm-svn: 58824	2008-11-07 01:23:58 +00:00
Mon P Wang	5ca2ec65bd	Fixed scalarizing an extract subvector and prevent an infinite loop when simplify a vector. llvm-svn: 58820	2008-11-06 22:52:21 +00:00
Devang Patel	8af0a362f1	Emit label for llvm.dbg.func.start of the inlined function. llvm-svn: 58814	2008-11-06 21:28:20 +00:00
Duncan Sands	f178f8300d	Formating/comment changes - no functionality change. llvm-svn: 58801	2008-11-06 08:51:32 +00:00
Bill Wendling	b3f7a39877	- Rename stackprotector_{prologue,epilogue} to stackprotector_{create,check}. - Get rid of "HasStackProtector" in MachineFrameInfo. - Modify intrinsics to tell which are doing what with memory. llvm-svn: 58799	2008-11-06 07:23:03 +00:00
Mon P Wang	9a8d60a7c0	Widening cleanup llvm-svn: 58796	2008-11-06 05:31:54 +00:00
Bill Wendling	d970ea3eac	Implement the stack protector stack accesses via intrinsics: - stackprotector_prologue creates a stack object and stores the guard there. - stackprotector_epilogue reads the stack guard from the stack position created by stackprotector_prologue. - The PrologEpilogInserter was changed to make sure that the stack guard is first on the stack frame. llvm-svn: 58791	2008-11-06 02:29:10 +00:00
Devang Patel	9e3e776e28	Emit label for llvm.dbg.func.start of the inlined function. llvm-svn: 58786	2008-11-06 00:30:09 +00:00
Duncan Sands	68035d4076	Fix thinko in ppcf128 expansion of truncating store. llvm-svn: 58753	2008-11-05 07:17:27 +00:00
Evan Cheng	c7b04a12bb	Type of shuffle mask has changed. llvm-svn: 58751	2008-11-05 06:04:18 +00:00
Dale Johannesen	db6b956585	80 columns llvm-svn: 58717	2008-11-04 20:52:49 +00:00
Duncan Sands	d5f935921a	Fix PR3011: LegalizeTypes support for scalarizing SELECT_CC. llvm-svn: 58706	2008-11-04 17:31:08 +00:00
Dale Johannesen	08535d2507	Fix some ppcf128 regressions: make ExpandFloatRes_LOAD work correctly, and bring over a late change to ppcf128 SetCC handling. llvm-svn: 58642	2008-11-03 20:47:45 +00:00
Duncan Sands	6692dec2a0	Make VAARG promotion work correctly with large funky sized integers like i129, and also reduce the number of assumptions made about how vaarg is implemented. This still doesn't work correctly for small integers like (eg) i1 on x86, since x86 passes each of them (essentially an i8) in a 4 byte stack slot, so the pointer needs to be advanced by 4 bytes not by 1 byte as now. But this is no longer a LegalizeTypes problem (it was also wrong in LT before): it is a bug in the operation expansion in LegalizeDAG: now LegalizeTypes turns an i1 vaarg into an i8 vaarg which would work fine if only the i8 vaarg was turned into correct code later. llvm-svn: 58635	2008-11-03 20:22:12 +00:00
Duncan Sands	0207a3f897	Make VAARG work with x86 long double (which is 10 bytes long, but is passed in 12/16 bytes). llvm-svn: 58608	2008-11-03 11:51:11 +00:00
Mon P Wang	769134be1e	Added interface to allow clients to create a MemIntrinsicNode for target intrinsics that touches memory llvm-svn: 58548	2008-11-01 20:24:53 +00:00
Dan Gohman	50c76beeb0	Remove some unused virtual function bodies. llvm-svn: 58524	2008-10-31 19:06:33 +00:00
Duncan Sands	8758851908	Add a bunch of libcalls for ppcf128 that were somehow completely forgotten about when writing LegalizeTypes. llvm-svn: 58508	2008-10-31 14:06:52 +00:00
Duncan Sands	e18295c258	Fix PR2986: do not use a potentially illegal type for the shift amount type. Add a check that shifts and rotates use the type returned by getShiftAmountTy for the amount. This exposed some problems in CellSPU and PPC, which have already been fixed. llvm-svn: 58455	2008-10-30 20:26:50 +00:00
Mon P Wang	01b8a5a967	Add missing vsetcc expansion for widening llvm-svn: 58443	2008-10-30 18:21:52 +00:00
Mon P Wang	58c3794c27	Add initial support for vector widening. Logic is set to widen for X86. One will only see an effect if legalizetype is not active. Will move support to LegalizeType soon. llvm-svn: 58426	2008-10-30 08:01:45 +00:00
Duncan Sands	ee273419f9	Uniformize capitalization of NodeId. llvm-svn: 58386	2008-10-29 17:52:12 +00:00
Duncan Sands	fbb10bbec4	Fix PR2977: LegalizeTypes support for expanding VAARG. llvm-svn: 58379	2008-10-29 14:25:28 +00:00
Duncan Sands	17e678be87	Add sanity checking for BUILD_PAIR (I noticed the other day that PPC custom lowering could create a BUILD_PAIR of two f64 with a result type of... f64! - already fixed). Fix a place that triggers the sanity check. llvm-svn: 58378	2008-10-29 14:22:20 +00:00
Duncan Sands	b964813b1f	Fix a FIXME: in ReplaceNodeWith, if the new node is morphed by AnalyzeNewNode into a previously processed node, and different result values of that node are remapped to values with different nodes, then we could end up using wrong values here [we were assuming that all results remap to values with the same underlying node]. This seems theoretically possible, but I don't have a testcase. The meat of the patch is in the changes to AnalyzeNewNode/AnalyzeNewValue and ReplaceNodeWith. While there, I changed names like RemapNode to RemapValue, since it really remaps values. To tell the truth, I would be much happier if we were only remapping nodes (it would simplify a bunch of logic, and allow for some cute speedups) but I haven't yet worked out how to do that. llvm-svn: 58372	2008-10-29 06:42:19 +00:00
Duncan Sands	914745768e	Fix 80 column violations. llvm-svn: 58371	2008-10-29 06:33:00 +00:00
Duncan Sands	d4ec020734	Fix 80 column violations. llvm-svn: 58370	2008-10-29 06:31:03 +00:00
Dan Gohman	1e3c25ac2d	Take Chris' suggestion and define EnableFastISelVerbose and EnableFastISelAbort variables for Release mode instead of using ifdefs in the code. llvm-svn: 58350	2008-10-28 20:35:31 +00:00
Dan Gohman	e750bb67ee	Protect the code for fast-isel debugging with #ifndef NDEBUG. llvm-svn: 58340	2008-10-28 19:08:46 +00:00
Duncan Sands	4068a7f31e	Fix darwin ppc llvm-gcc build breakage: intercept ppcf128 to i32 conversion and expand it into a code sequence like in LegalizeDAG. This needs custom ppc lowering of FP_ROUND_INREG, so turn that on and make it work with LegalizeTypes. Probably PPC should simply custom lower the original conversion. llvm-svn: 58329	2008-10-28 15:00:32 +00:00
Duncan Sands	f3e5850f80	Fix a testcase provided by Bill in which the node id could end up being wrong mostly because of forgetting to remap new nodes that morphed into processed nodes through CSE. llvm-svn: 58323	2008-10-28 09:38:36 +00:00
Chris Lattner	5fa1040130	Don't produce invalid comparisons after legalize. llvm-svn: 58320	2008-10-28 07:11:07 +00:00
Chris Lattner	56d016ab05	fix some whitespace stuff llvm-svn: 58319	2008-10-28 07:10:51 +00:00
Ted Kremenek	8fcff4d87a	Fix bogus comparison of "const char *" with c-string literal. Use strcmp instead. llvm-svn: 58290	2008-10-27 22:43:07 +00:00
David Greene	b04e7c36d3	Add setSubgraphColor to color an entire portion of a SelectionDAG. This will be used to support debug features in TableGen. llvm-svn: 58257	2008-10-27 18:17:03 +00:00
Duncan Sands	835bdca590	Fix UpdateNodeOperands so that it does CSE of calls (and a bunch of other node types). While there, I added a doNotCSE predicate and used it to reduce code duplication (some of the duplicated code was wrong...). This fixes ARM/cse-libcalls.ll when using LegalizeTypes. llvm-svn: 58249	2008-10-27 15:30:53 +00:00
Duncan Sands	75cf2e03ab	Fix a bug in which a node could be added to the worklist twice: UpdateNodeOperands could morph a new node into a node already on the worklist. We would then recalculate the NodeId for this existing node and add it to the worklist. The testcase is ARM/cse-libcalls.ll, the problem showing up once UpdateNodeOperands is taught to do CSE for calls. llvm-svn: 58246	2008-10-27 13:18:32 +00:00
Duncan Sands	8475d56794	Turn on LegalizeTypes, the new type legalization codegen infrastructure, by default. Please report any breakage to the mailing lists. llvm-svn: 58232	2008-10-27 08:42:46 +00:00
Dan Gohman	811eed81ab	SDNodes may have at most one Flag result. Update this comment to reflect that. llvm-svn: 58145	2008-10-25 17:51:24 +00:00
Dale Johannesen	8b531d2754	Initialize uninitialized variable. llvm-svn: 58057	2008-10-24 01:06:58 +00:00
Duncan Sands	62951678ee	Fix thinko - the operand number has nothing to do with the result number. llvm-svn: 58041	2008-10-23 19:34:23 +00:00
Duncan Sands	8178141378	LegalizeTypes soft-float support for fpow. llvm-svn: 57973	2008-10-22 11:49:09 +00:00
Duncan Sands	578a68a91a	Be nice to CellSPU: for this target getSetCCResultType may return i8, which can result in SELECT nodes for which the type of the condition is i8, but there are no patterns for select with i8 condition. Tweak the LegalizeTypes logic to avoid this as much as possible. This isn't a real fix because it is still perfectly possible to end up with such select nodes - CellSPU needs to be fixed IMHO. llvm-svn: 57968	2008-10-22 09:23:20 +00:00
Duncan Sands	01a1c11218	Port from LegalizeDAG the logic to only generate ADDC/ADDE/SUBC/SUBE if the target supports it. llvm-svn: 57967	2008-10-22 09:07:29 +00:00
Duncan Sands	a1a388cac3	Add some comments explaining the meaning of a boolean that is not of type MVT::i1 in SELECT and SETCC nodes. Relax the LegalizeTypes SELECT condition promotion sanity checks to allow other condition types than i1. llvm-svn: 57966	2008-10-22 09:06:24 +00:00
Duncan Sands	4b6b5fcd80	Temporarily allow the operands of a BUILD_VECTOR to have a different type to the vector element type. This should be fairly harmless because in the past guys like this were being built all over the place (and were cleaned up when I added this check). The reason for relaxing this check is that it helps LegalizeTypes legalize vector shuffles: the mask is a BUILD_VECTOR that it is not always possible to legalize while keeping it a BUILD_VECTOR (vector_shuffle requires the mask to be a BUILD_VECTOR, as opposed to a vector with the right vector type). With this check it is even harder to legalize the mask - turning the check off means that LegalizeTypes manages to legalize almost all vector shuffles encountered in practice. The correct solution is to change vector_shuffle to be a variadic node with the mask built into it as operands. While waiting for that change, this hack stops the problem with vector_shuffle from blocking the turning on of LegalizeTypes. llvm-svn: 57965	2008-10-22 09:00:33 +00:00
Dale Johannesen	28929589e7	Add an SSE2 algorithm for uint64->f64 conversion. The same one Apple gcc uses, faster. Also gets the extreme case in gcc.c-torture/execute/ieee/rbug.c correct which we weren't before; this is not sufficient to get the test to pass though, there is another bug. llvm-svn: 57926	2008-10-21 20:50:01 +00:00
Dan Gohman	8b44b88eff	Fix SelectionDAGBuild lowering of Select instructions to handle first-class aggregate values. Also, fix a bug in the Ret handling for empty aggregates. llvm-svn: 57925	2008-10-21 20:00:42 +00:00
Dan Gohman	269246b034	Don't create TargetGlobalAddress nodes with offsets that don't fit in the 32-bit signed offset field of addresses. Even though this may be intended, some linkers refuse to relocate code where the relocated address computation overflows. Also, fix the sign-extension of constant offsets to use the actual pointer size, rather than the size of the GlobalAddress node, which may be different, for example on x86-64 where MVT::i32 is used when the address is being fit into the 32-bit displacement field. llvm-svn: 57885	2008-10-21 03:38:42 +00:00
Dan Gohman	97d3f6cfe3	Make the NaN test come second, heuristically assuming that NaNs are less common. llvm-svn: 57871	2008-10-21 03:12:54 +00:00
Chris Lattner	4396e0d2c3	Fix gcc.c-torture/compile/920520-1.c by inserting bitconverts for strange asm conditions earlier. In this case, we have a double being passed in an integer reg class. Convert to like sized integer register so that we allocate the right number for the class (two i32's for the f64 in this case). llvm-svn: 57862	2008-10-21 00:45:36 +00:00
Dan Gohman	1a59b3b9b8	Fast-isel no longer an experiment. llvm-svn: 57845	2008-10-20 21:30:12 +00:00
Duncan Sands	aac74a9055	Support operations like fp_to_uint with a vector result type when the result type is legal but not the operand type. Add additional support for EXTRACT_SUBVECTOR and CONCAT_VECTORS, needed to handle such cases. llvm-svn: 57840	2008-10-20 16:31:21 +00:00
Duncan Sands	e0fb87acf6	LegalizeTypes support for atomic operation promotion. llvm-svn: 57838	2008-10-20 16:17:42 +00:00
Duncan Sands	840143fc6f	Use DAG.getIntPtrConstant rather than DAG.getConstant with TLI.getPointerTy for a small simplification. llvm-svn: 57837	2008-10-20 16:14:43 +00:00
Duncan Sands	5805334d5b	Always use either MVT::i1 or getSetCCResultType for the condition of a SELECT node. Make sure that the correct extension type (any-, sign- or zero-extend) is used. llvm-svn: 57836	2008-10-20 16:13:04 +00:00
Duncan Sands	fe9b5550de	Formatting - no functional change. llvm-svn: 57834	2008-10-20 16:06:47 +00:00
Duncan Sands	3ed8b29ace	Don't use a random type for the select condition, use an MVT::i1 and simplify the code while there. llvm-svn: 57833	2008-10-20 16:04:57 +00:00
Bill Wendling	8ec2a4a96c	Set N->OperandList to 0 after deletion. Otherwise, it's possible that it will be either deleted or referenced afterwards. llvm-svn: 57786	2008-10-19 20:51:12 +00:00
Bill Wendling	6c87bfc6fd	Fix comment. Other formatting changes. No functionality changes. llvm-svn: 57785	2008-10-19 20:34:04 +00:00
Duncan Sands	8d11adca4c	Vector shuffle mask elements may be "undef". Handle this everywhere in LegalizeTypes. llvm-svn: 57783	2008-10-19 15:00:25 +00:00
Duncan Sands	c6d12bd665	Use a legal integer type for vector shuffle mask elements. Otherwise LegalizeTypes will, reasonably enough, legalize the mask, which may result in it no longer being a BUILD_VECTOR node (LegalizeDAG simply ignores the legality or not of vector masks). llvm-svn: 57782	2008-10-19 14:58:05 +00:00
Chris Lattner	160e8abd77	Reapply r57699 with a fix to not crash on asms with multiple results. Unlike the previous patch this one actually passes make check. "Fix PR2356 on PowerPC: if we have an input and output that are tied together that have different sizes (e.g. i32 and i64) make sure to reserve registers for the bigger operand." llvm-svn: 57771	2008-10-18 18:49:30 +00:00
Dan Gohman	727a94063c	Don't truncate GlobalAddress offsets to int in debug output. llvm-svn: 57770	2008-10-18 18:22:42 +00:00
Dan Gohman	2fe6bee5b6	Teach DAGCombine to fold constant offsets into GlobalAddress nodes, and add a TargetLowering hook for it to use to determine when this is legal (i.e. not in PIC mode, etc.) This allows instruction selection to emit folded constant offsets in more cases, such as the included testcase, eliminating the need for explicit arithmetic instructions. This eliminates the need for the C++ code in X86ISelDAGToDAG.cpp that attempted to achieve the same effect, but wasn't as effective. Also, fix handling of offsets in GlobalAddressSDNodes in several places, including changing GlobalAddressSDNode's offset from int to int64_t. The Mips, Alpha, Sparc, and CellSPU targets appear to be unaware of GlobalAddress offsets currently, so set the hook to false on those targets. llvm-svn: 57748	2008-10-18 02:06:02 +00:00
Dan Gohman	6de2556205	Revert r57699. It's causing regressions in test/CodeGen/X86/2008-09-17-inline-asm-1.ll and a few others, and it breaks the llvm-gcc build. llvm-svn: 57747	2008-10-18 01:03:45 +00:00
Dan Gohman	d01ddb51ee	Factor out the code for mapping LLVM IR condition opcodes to ISD condition opcodes into helper functions. llvm-svn: 57726	2008-10-17 21:16:08 +00:00
Chris Lattner	aadf7414b2	add support for 128 bit aggregates. llvm-svn: 57715	2008-10-17 19:59:51 +00:00
Mon P Wang	85f48ade9c	Added MemIntrinsicNode which is useful to represent target intrinsics that touches memory and need an associated MemOperand llvm-svn: 57712	2008-10-17 18:22:58 +00:00
Dan Gohman	293abcc91d	Factor out the code for mapping LLVM IR condition opcodes to ISD condition opcodes into helper functions. llvm-svn: 57710	2008-10-17 18:18:45 +00:00
Chris Lattner	052092bf9c	Fix PR2356 on PowerPC: if we have an input and output that are tied together that have different sizes (e.g. i32 and i64) make sure to reserve registers for the bigger operand. llvm-svn: 57699	2008-10-17 17:52:49 +00:00
Chris Lattner	3b1833c9b4	refactor some code into a helper method, no functionality change. llvm-svn: 57690	2008-10-17 17:05:25 +00:00
Chris Lattner	860df6e84c	Keep track of which input constraint matches an output constraint. Reject asms where an output has multiple input constraints tied to it. llvm-svn: 57687	2008-10-17 16:47:46 +00:00
Chris Lattner	ef8901722e	add an assert so that PR2356 explodes instead of running off an array. Improve some minor comments, refactor some helpers in AsmOperandInfo. No functionality change for valid code. llvm-svn: 57686	2008-10-17 16:21:11 +00:00
Dan Gohman	a39b0a1f05	Define patterns for shld and shrd that match immediate shift counts, and patterns that match dynamic shift counts when the subtract is obscured by a truncate node. Add DAGCombiner support for recognizing rotate patterns when the shift counts are defined by truncate nodes. Fix and simplify the code for commuting shld and shrd instructions to work even when the given instruction doesn't have a parent, and when the caller needs a new instruction. These changes allow LLVM to use the shld, shrd, rol, and ror instructions on x86 to replace equivalent code using two shifts and an or in many more cases. llvm-svn: 57662	2008-10-17 01:23:35 +00:00
Evan Cheng	3b0f5e4d61	- Add target lowering hooks that specify which setcc conditions are illegal, i.e. conditions that cannot be checked with a single instruction. For example, SETONE and SETUEQ on x86. - Teach legalizer to implement illegal setcc as a and / or of a number of legal setcc nodes. For now, only implement FP conditions. e.g. SETONE is implemented as SETO & SETNE, SETUEQ is SETUO \| SETEQ. - Move x86 target over. llvm-svn: 57542	2008-10-15 02:05:31 +00:00
Dan Gohman	e7ced74558	FastISel support for exception-handling constructs. - Move the EH landing-pad code and adjust it so that it works with FastISel as well as with SDISel. - Add FastISel support for @llvm.eh.exception and @llvm.eh.selector. llvm-svn: 57539	2008-10-14 23:54:11 +00:00
Evan Cheng	07d53b1d33	Rename LoadX to LoadExt. llvm-svn: 57526	2008-10-14 21:26:46 +00:00
Dan Gohman	9c4b7d5c4f	Fix command-line option printing to print two spaces where needed, instead of requiring all "short description" strings to begin with two spaces. This makes these strings less mysterious, and it fixes some cases where short description strings mistakenly did not begin with two spaces. llvm-svn: 57521	2008-10-14 20:25:08 +00:00
Evan Cheng	da9b752883	FIX PR2794. Make sure SIGN_EXTEND_INREG nodes introduced by LegalizeSetCCOperands are leglized. Patch by Richard Pennington. llvm-svn: 57460	2008-10-13 18:46:18 +00:00
Matthijs Kooijman	43686a6665	* Make TargetLowering not crash when TargetMachine::getTargetAsmInfo() returns null. This assumes that any target that does not have AsmInfo, does not support "LocAndDot". llvm-svn: 57438	2008-10-13 12:41:46 +00:00
Chris Lattner	c52af45304	calls can be supported. llvm-svn: 57428	2008-10-13 01:59:13 +00:00
Chris Lattner	2753955fc0	Change CALLSEQ_BEGIN and CALLSEQ_END to take TargetConstant's as parameters instead of raw Constants. This prevents the constants from being selected by the isel pass, fixing PR2735. llvm-svn: 57385	2008-10-11 22:08:30 +00:00
Chris Lattner	fb1f4a1329	simplify comparison llvm-svn: 57371	2008-10-11 00:08:02 +00:00
Dale Johannesen	4f0bd68cfe	Add a "loses information" return value to APFloat::convert and APFloat::convertToInteger. Restore return value to IEEE754. Adjust all users accordingly. llvm-svn: 57329	2008-10-09 23:00:39 +00:00
Dale Johannesen	54306fe499	Rename APFloat::convertToAPInt to bitcastToAPInt to make it clearer what the function does. No functional change. llvm-svn: 57325	2008-10-09 18:53:47 +00:00
Dan Gohman	c1d47c56f9	Avoid emitting redundant materializations of integer constants for things like null pointers, which at this level aren't different from regular integer constants. llvm-svn: 57265	2008-10-07 22:03:27 +00:00
Andrew Lenharth	21dca9cbb1	Use Dan's supperior check llvm-svn: 57255	2008-10-07 18:27:23 +00:00
Andrew Lenharth	d69bdaef64	No need for \|= llvm-svn: 57249	2008-10-07 17:11:29 +00:00
Andrew Lenharth	6d409f08be	Use ADDC if it is valid at any smaller size. Do it right this time llvm-svn: 57248	2008-10-07 17:09:16 +00:00
Andrew Lenharth	6606f17e50	Use ADDC if it is valid at any smaller size. fixes test/Codegen/Generic/i128-addsub.ll on x86 llvm-svn: 57247	2008-10-07 17:03:15 +00:00
Andrew Lenharth	3a9be150be	Expand arith on machines without carry flags llvm-svn: 57243	2008-10-07 14:15:42 +00:00
Dan Gohman	bef9b0bef0	Correctly handle calls with no return values. This fixes 2006-01-23-UnionInit on x86-64 when inlining is not enabled. llvm-svn: 57223	2008-10-07 00:12:37 +00:00
Chris Lattner	2416896b3c	wrap some long lines and expand i32 mul's to libcalls, inspired by a patch by Mikael Lepisto! llvm-svn: 57077	2008-10-04 21:27:46 +00:00
Dan Gohman	13b048268b	Fix fast-isel's handling of atomic instructions. They may expand to multiple basic blocks, in which case fast-isel needs to informed of which block to use as it resumes inserting instructions. llvm-svn: 57040	2008-10-04 00:56:36 +00:00
Dale Johannesen	5d60c1ebb1	Pass MemOperand through for 64-bit atomics on 32-bit, incidentally making the case where the memop is a pointer deref work. Fix cmp-and-swap regression. llvm-svn: 57027	2008-10-03 19:41:08 +00:00
Dan Gohman	b62cd7ea98	Use -1ULL instead of uint64_t(-1), at Anton's suggestion. llvm-svn: 57021	2008-10-03 17:56:45 +00:00
Duncan Sands	6e42742d2d	The result of getSetCCResultType (eg: i32) may be larger than the type an i1 is promoted to (eg: i8). Account for this. Noticed by Tilmann Scheller on CellSPU; he will hopefully take care of fixing this in LegalizeDAG and adding a testcase! llvm-svn: 56997	2008-10-03 07:41:46 +00:00
Dan Gohman	4e072a75cc	Implement fast-isel support for zero-extending from i1. It turns out that this is a fairly common operation, and it's easy enough to handle. llvm-svn: 56990	2008-10-03 01:28:47 +00:00
Dan Gohman	1ab1d31f7a	Optimize conditional branches in X86FastISel. This replaces sequences like this: sete %al testb %al, %al jne LBB11_1 with this: je LBB11_1 llvm-svn: 56969	2008-10-02 22:15:21 +00:00
Dale Johannesen	867d549fce	Handle some 64-bit atomics on x86-32, some of the time. llvm-svn: 56963	2008-10-02 18:53:47 +00:00
Dan Gohman	1dd27578dd	Make some implicit conversions explicit, to avoid compiler warnings. llvm-svn: 56927	2008-10-01 19:58:59 +00:00
Dan Gohman	94798d31dd	Fold trivial two-operand tokenfactors where the operands are equal immediately. llvm-svn: 56921	2008-10-01 15:11:19 +00:00
Dan Gohman	3a293e7404	Fix typos in comments. llvm-svn: 56919	2008-10-01 15:07:49 +00:00
Bill Wendling	68f12ee567	Implement the -fno-builtin option in the front-end, not in the back-end. llvm-svn: 56900	2008-10-01 00:59:58 +00:00
Bill Wendling	e818bc159f	- Initialize "--no-builtin" to "false". - Testcase for r56885. llvm-svn: 56886	2008-09-30 21:40:30 +00:00
Bill Wendling	bd09262e97	Add the new `-no-builtin' flag. This flag is meant to mimic the GCC `-fno-builtin' flag. Currently, it's used to replace "memset" with "_bzero" instead of "__bzero" on Darwin10+. This arguably violates the meaning of this flag, but is currently sufficient. The meaning of this flag should become more specific over time. llvm-svn: 56885	2008-09-30 21:22:07 +00:00
Dan Gohman	b486350b15	Move the primary fast-isel top-level comments to FastISel.cpp, where they'll be a little more visible. Also, update and reword them a bit. llvm-svn: 56877	2008-09-30 20:48:29 +00:00
Dan Gohman	86aa16a69a	Optimize SelectionDAG's AssignTopologicalOrder even further. Completely eliminate the TopOrder std::vector. Instead, sort the AllNodes list in place. This also eliminates the need to call AllNodes.size(), a linear-time operation, before performing the sort. Also, eliminate the Sources temporary std::vector, since it essentially duplicates the sorted result as it is being built. This also changes the direction of the topological sort from bottom-up to top-down. The AllNodes list starts out in roughly top-down order, so this reduces the amount of reordering needed. Top-down is also more convenient for Legalize, and ISel needed only minor adjustments. llvm-svn: 56867	2008-09-30 18:30:35 +00:00
Dale Johannesen	f61a84ec43	Remove misuse of ReplaceNodeResults for atomics with valid types. No functional change. llvm-svn: 56808	2008-09-29 22:25:26 +00:00
Dan Gohman	4aa9095398	Fix FastISel to not initialize the PIC-base register multiple times in functions with PIC references from more than one basic block. llvm-svn: 56807	2008-09-29 21:55:50 +00:00
Bill Wendling	c966a737c5	Temporarily reverting r56683. This is causing a failure during the build of llvm-gcc: /Volumes/Gir/devel/llvm/clean/llvm-gcc.obj/./gcc/xgcc -B/Volumes/Gir/devel/llvm/clean/llvm-gcc.obj/./gcc/ -B/Volumes/Gir/devel/llvm/clean/llvm-gcc.install/i386-apple-darwin9.5.0/bin/ -B/Volumes/Gir/devel/llvm/clean/llvm-gcc.install/i386-apple-darwin9.5.0/lib/ -isystem /Volumes/Gir/devel/llvm/clean/llvm-gcc.install/i386-apple-darwin9.5.0/include -isystem /Volumes/Gir/devel/llvm/clean/llvm-gcc.install/i386-apple-darwin9.5.0/sys-include -mmacosx-version-min=10.4 -O2 -O2 -g -O2 -DIN_GCC -W -Wall -Wwrite-strings -Wstrict-prototypes -Wmissing-prototypes -Wold-style-definition -isystem ./include -fPIC -pipe -g -DHAVE_GTHR_DEFAULT -DIN_LIBGCC2 -D__GCC_FLOAT_NOT_NEEDED -I. -I. -I../../llvm-gcc.src/gcc -I../../llvm-gcc.src/gcc/. -I../../llvm-gcc.src/gcc/../include -I./../intl -I../../llvm-gcc.src/gcc/../libcpp/include -I../../llvm-gcc.src/gcc/../libdecnumber -I../libdecnumber -I/Volumes/Gir/devel/llvm/clean/llvm.obj/include -I/Volumes/Gir/devel/llvm/clean/llvm.src/include -fexceptions -fvisibility=hidden -DHIDE_EXPORTS -c ../../llvm-gcc.src/gcc/unwind-dw2-fde-darwin.c -o libgcc/./unwind-dw2-fde-darwin.o Assertion failed: (TargetRegisterInfo::isVirtualRegister(regA) && TargetRegisterInfo::isVirtualRegister(regB) && "cannot update physical register live information"), function runOnMachineFunction, file /Volumes/Gir/devel/llvm/clean/llvm.src/lib/CodeGen/TwoAddressInstructionPass.cpp, line 311. ../../llvm-gcc.src/gcc/unwind-dw2.c:1527: internal compiler error: Abort trap Please submit a full bug report, with preprocessed source if appropriate. See <URL:http://developer.apple.com/bugreporter> for instructions. {standard input}:3521:non-relocatable subtraction expression, "_dwarf_reg_size_table" minus "L20$pb" {standard input}:3521:symbol: "_dwarf_reg_size_table" can't be undefined in a subtraction expression {standard input}:3520:non-relocatable subtraction expression, "_dwarf_reg_size_table" minus "L20$pb" ... llvm-svn: 56703	2008-09-26 22:10:44 +00:00
Dan Gohman	6e0548336a	Rename ConstantSDNode's getSignExtended to getSExtValue, for consistancy with ConstantInt, and re-implement it in terms of ConstantInt's getSExtValue. llvm-svn: 56700	2008-09-26 21:54:37 +00:00
Evan Cheng	d77cbe8947	Fix @llvm.frameaddress codegen. FP elimination optimization should be disabled when frame address is desired. Also add support for depth > 0. llvm-svn: 56683	2008-09-26 19:48:35 +00:00
Dale Johannesen	0e32a2c935	Add "inreg" field to CallSDNode (doesn't increase its size). Adjust various lowering functions to pass this info through from CallInst. Use it to implement sseregparm returns on X86. Remove X86_ssecall calling convention. llvm-svn: 56677	2008-09-26 19:31:26 +00:00
Devang Patel	4c758ea3e0	Large mechanical patch. s/ParamAttr/Attribute/g s/PAList/AttrList/g s/FnAttributeWithIndex/AttributeWithIndex/g s/FnAttr/Attribute/g This sets the stage - to implement function notes as function attributes and - to distinguish between function attributes and return value attributes. This requires corresponding changes in llvm-gcc and clang. llvm-svn: 56622	2008-09-25 21:00:45 +00:00
Dale Johannesen	c50ada2f56	Accept 'inreg' attribute on x86 functions as meaning sse_regparm (i.e. float/double values go in XMM0 instead of ST0). Update documentation to reflect reality. llvm-svn: 56619	2008-09-25 20:47:45 +00:00
Dan Gohman	5e490a7567	Support for i1 XOR in FastISel. It is actually safe because i1 operands are assumed to already by zero-extended. llvm-svn: 56615	2008-09-25 17:22:52 +00:00
Dan Gohman	6975c36c43	Don't print fast-isel debug messages by default. Thanks Chris! llvm-svn: 56614	2008-09-25 17:21:42 +00:00
Dan Gohman	dd920bf3f0	Don't forget the newline in debug output. llvm-svn: 56613	2008-09-25 17:17:27 +00:00
Dan Gohman	32a733e2c7	FastISel support for debug info. llvm-svn: 56610	2008-09-25 17:05:24 +00:00
Richard Pennington	4b35e64504	bug 2812: Segmentation fault on a big emdiam processor. llvm-svn: 56609	2008-09-25 16:15:10 +00:00
Dan Gohman	3663f156f7	Fix a recent fast-isel coverage regression - don't bail out before giving the target a chance to materialize constants. llvm-svn: 56605	2008-09-25 01:28:51 +00:00
Dan Gohman	b8e69f1755	Enable DeadMachineInstructionElim when Fast-ISel is enabled. llvm-svn: 56604	2008-09-25 01:14:49 +00:00
Evan Cheng	2e7450716a	<rdar://problem/6234798> Assertion failed: (!OpInfo.AssignedRegs.Regs.empty() && "Couldn't allocate input reg!") llvm-svn: 56597	2008-09-25 00:14:04 +00:00
Dale Johannesen	86d421df23	Remove SelectionDag early allocation of registers for earlyclobbers. Teach Local RA about earlyclobber, and add some tests for it. llvm-svn: 56592	2008-09-24 23:13:09 +00:00
Bill Wendling	dea91308ae	Reapplying r56550 llvm-svn: 56553	2008-09-24 10:25:02 +00:00
Bill Wendling	162c26dee3	Forgot this part with my last patch. Sorry about the breakage. llvm-svn: 56552	2008-09-24 10:16:24 +00:00
Eric Christopher	4e26a81371	Temporarily revert r56550 until missing commit can be added. llvm-svn: 56551	2008-09-24 08:30:44 +00:00
Bill Wendling	7c31464a0b	Refactor the constant folding code into it's own function. And call it from both the SelectionDAG and DAGCombiner code. The only functionality change is that now the DAG combiner is performing the constant folding for these operations instead of being a no-op. This is not in response to a bug, so there isn't a testcase. llvm-svn: 56550	2008-09-24 07:11:26 +00:00
Dale Johannesen	c36660d756	Next round of earlyclobber handling. Approach the RA problem by expanding the live interval of an earlyclobber def back one slot. Remove overlap-earlyclobber throughout. Remove earlyclobber bits and their handling from live internals. llvm-svn: 56539	2008-09-24 01:07:17 +00:00
Evan Cheng	e0add20c1b	Properly handle 'm' inline asm constraints. If a GV is being selected for the addressing mode, it requires the same logic for PIC relative addressing, etc. llvm-svn: 56526	2008-09-24 00:05:32 +00:00
Devang Patel	ba3fa6c6e1	s/ParameterAttributes/Attributes/g llvm-svn: 56513	2008-09-23 23:03:40 +00:00
Dan Gohman	918fe08a56	Arrange for FastISel code to have access to the MachineModuleInfo object. This will be needed to support debug info. llvm-svn: 56508	2008-09-23 21:53:34 +00:00
Dan Gohman	c07f686665	Replace the LiveRegs SmallSet with a simple counter that keeps track of the number of live registers, which is all the set was being used for. llvm-svn: 56498	2008-09-23 18:50:48 +00:00
Dan Gohman	e2947e1e07	Fix the alignment of loads from constant pool entries when the load address has an offset from the base of the constant pool entry. llvm-svn: 56479	2008-09-22 22:40:08 +00:00
Dale Johannesen	7a74e71489	Make log, log2, log10, exp, exp2 use Expand by default. llvm-svn: 56471	2008-09-22 21:57:32 +00:00
Evan Cheng	13beeeb128	Per review feedback: Only perform (srl x, (trunc (and y, c))) -> (srl x, (and (trunc y), c)) etc. when both "trunc" and "and" have single uses. llvm-svn: 56452	2008-09-22 18:19:24 +00:00
Oscar Fuentes	a229b3c9a7	Initial support for the CMake build system. llvm-svn: 56419	2008-09-22 01:08:49 +00:00
Bill Wendling	91ef8fcd29	Add helper function to get a 32-bit floating point constant. No functionality change. llvm-svn: 56418	2008-09-22 00:44:35 +00:00
Chris Lattner	43f5449c48	don't print GlobalAddressSDNode's with an offset of zero as "foo0". llvm-svn: 56399	2008-09-21 18:38:31 +00:00
Dan Gohman	9801ba451a	Refactor X86SelectConstAddr, folding it into X86SelectAddress. This results in better code for globals. Also, unbreak the local CSE for GlobalValue stub loads. llvm-svn: 56371	2008-09-19 22:16:54 +00:00
Dan Gohman	95be7d7b85	Add a new "fast" scheduler. This is currently basically just a copy of the BURRList scheduler, but with several parts ripped out, such as backtracking, online topological sort maintenance (needed by backtracking), the priority queue, and Sethi-Ullman number computation and maintenance (needed by the priority queue). As a result of all this, it generates somewhat lower quality code, but that's its tradeoff for running about 30% faster than list-burr in -fast mode in many cases. This is somewhat experimental. Moving forward, major pieces of this can be refactored with pieces in common with ScheduleDAGRRList.cpp. llvm-svn: 56307	2008-09-18 16:26:26 +00:00
Dale Johannesen	f8610ebebc	Add a bit to mark operands of asm's that conflict with an earlyclobber operand elsewhere. Propagate this bit and the earlyclobber bit through SDISel. Change linear-scan RA not to allocate regs in a way that conflicts with an earlyclobber. See also comments. llvm-svn: 56290	2008-09-17 21:13:11 +00:00
Dan Gohman	6ab52a8018	Don't worry about clobbering physical register defs that aren't used. llvm-svn: 56281	2008-09-17 15:25:49 +00:00
Evan Cheng	a904f466e8	When converting a CopyFromReg to a copy instruction, use the register class of its uses to determine the right destination register class of the copy. This is important for targets where a physical register may belong to multiple register classes. llvm-svn: 56258	2008-09-16 23:12:11 +00:00
Dan Gohman	64d6c6fe30	Change SelectionDAG::getConstantPool to always set the alignment of the ConstantPoolSDNode, using the target's preferred alignment for the constant type. In LegalizeDAG, when performing loads from the constant pool, the ConstantPoolSDNode's alignment is used in the calls to getLoad and getExtLoad. This change prevents SelectionDAG::getLoad/getExtLoad from incorrectly choosing the ABI alignment for constant pool loads when Alignment == 0. The incorrect alignment is only a performance issue when ABI alignment does not equal preferred alignment (i.e., on x86 it was generating MOVUPS instead of MOVAPS for v4f32 constant loads when the default ABI alignment for 128bit vectors is forced to 1 byte.) Patch by Paul Redmond! llvm-svn: 56253	2008-09-16 22:05:41 +00:00
Bill Wendling	24c79f28b1	Reverting r56249. On further investigation, this functionality isn't needed. Apologies for the thrashing. llvm-svn: 56251	2008-09-16 21:48:12 +00:00
Dan Gohman	ab26f20d44	Include the alignment value when displaying ConstantPoolSDNodes. llvm-svn: 56250	2008-09-16 21:18:22 +00:00
Bill Wendling	8bc392fb1d	- Change "ExternalSymbolSDNode" to "SymbolSDNode". - Add linkage to SymbolSDNode (default to external). - Change ISD::ExternalSymbol to ISD::Symbol. - Change ISD::TargetExternalSymbol to ISD::TargetSymbol These changes pave the way to allowing SymbolSDNodes with non-external linkage. llvm-svn: 56249	2008-09-16 21:12:30 +00:00
Dan Gohman	050d7835c6	Don't take the time to CheckDAGForTailCallsAndFixThem when tail calls are not enabled. Instead just omit the tail call flag when calls are created. llvm-svn: 56235	2008-09-16 01:42:28 +00:00
Dan Gohman	3c7b9ba547	Re-enable SelectionDAG CSE for calls. It matters in the case of libcalls, as in this testcase on ARM. llvm-svn: 56226	2008-09-15 19:46:03 +00:00
Dan Gohman	d3fe174c53	Define CallSDNode, an SDNode subclass for use with ISD::CALL. Currently it just holds the calling convention and flags for isVarArgs and isTailCall. And it has several utility methods, which eliminate magic 5+2*i and similar index computations in several places. CallSDNodes are not CSE'd. Teach UpdateNodeOperands to handle nodes that are not CSE'd gracefully. llvm-svn: 56183	2008-09-13 01:54:27 +00:00
Dan Gohman	ec270fb640	Change ConstantSDNode and ConstantFPSDNode to use ConstantInt* and ConstantFP* instead of APInt and APFloat directly. This reduces the amount of time to create ConstantSDNode and ConstantFPSDNode nodes when ConstantInt* and ConstantFP* respectively are already available, as is the case in SelectionDAGBuild.cpp. Also, it reduces the amount of time to legalize constants into constant pools, and the amount of time to add ConstantFP operands to MachineInstrs, due to eliminating ConstantInt::get and ConstantFP::get calls. It increases the amount of work needed to create new constants in cases where the client doesn't already have a ConstantInt* or ConstantFP*, such as legalize expanding 64-bit integer constants to 32-bit constants. And it adds a layer of indirection for the accessor methods. But these appear to be outweight by the benefits in most cases. It will also make it easier to make ConstantSDNode and ConstantFPNode more consistent with ConstantInt and ConstantFP. llvm-svn: 56162	2008-09-12 18:08:03 +00:00
Dale Johannesen	1f3ab86804	Pass "earlyclobber" bit through to machine representation; coalescer and RA need to know about it. No functional change. llvm-svn: 56161	2008-09-12 17:49:03 +00:00
Dan Gohman	effb894453	Rename ConstantSDNode::getValue to getZExtValue, for consistency with ConstantInt. This led to fixing a bug in TargetLowering.cpp using getValue instead of getAPIntValue. llvm-svn: 56159	2008-09-12 16:56:44 +00:00
Dale Johannesen	baf6762e26	The sequence for ppcf128 compares was not IEEE safe in the presence of NaNs. llvm-svn: 56136	2008-09-12 00:30:56 +00:00
Dan Gohman	1dc9b0514f	FastISel support for i1 PHI nodes. llvm-svn: 56069	2008-09-10 21:01:31 +00:00
Dan Gohman	940bafb687	FastISel support for i1 constants. llvm-svn: 56068	2008-09-10 21:01:08 +00:00
Dan Gohman	39d82f902a	Add X86FastISel support for static allocas, and refences to static allocas. As part of this change, refactor the address mode code for laods and stores. llvm-svn: 56066	2008-09-10 20:11:02 +00:00
Dan Gohman	222018da7b	Add a break statement that I accidentally deleted when I shuffled the fast-isel command-line options around. This fixes a bunch of fast-isel failures. llvm-svn: 56057	2008-09-10 15:52:34 +00:00
Bill Wendling	6987fec11c	Remove unnecessary bit-wise AND from the limited precision work. llvm-svn: 56049	2008-09-10 06:26:10 +00:00
Daniel Dunbar	999096065f	Fix 80 col violation. llvm-svn: 56048	2008-09-10 04:16:29 +00:00
Bill Wendling	eb1db169bf	Check that both operands are f32 before attempting to lower. llvm-svn: 56036	2008-09-10 00:24:59 +00:00
Bill Wendling	648930b9ba	Implement "visitPow". This is mainly used to see if we have a pow() call of this form: powf(10.0f, x); If this is the case, and also we want limited precision floating-point calculations, then lower to do the limited-precision stuff. llvm-svn: 56035	2008-09-10 00:20:20 +00:00
Evan Cheng	0fff397a13	A few more places where FPOW is being ignored. llvm-svn: 56032	2008-09-09 23:35:53 +00:00
Dan Gohman	b4c0295b8e	Change -fast-isel-no-abort to -fast-isel-abort, which now defaults to being off by default. Also, add assertion checks to check that the various fast-isel-related command-line options are only used when -fast-isel itself is enabled. llvm-svn: 56029	2008-09-09 23:05:00 +00:00
Evan Cheng	f4e5de4583	Legalizer was missing code that expand fpow to a libcall. llvm-svn: 56028	2008-09-09 23:02:14 +00:00
Bill Wendling	ab6676a46a	Adding 6-, 12-, and 18-bit limited-precision floating-point support for exp2 function. llvm-svn: 56025	2008-09-09 22:39:21 +00:00
Bill Wendling	48217d89b4	Add support for 6-, 12-, and 18-bit limited precision calculations of exp for floating-point numbers. llvm-svn: 56023	2008-09-09 22:13:54 +00:00
Dan Gohman	91491b51e2	Add a new option, -fast-isel-verbose, that can be used with -fast-isel-no-abort to get a dump of all unhandled instructions, without an abort. llvm-svn: 56021	2008-09-09 22:06:46 +00:00
Owen Anderson	4a58bd331b	Clean this up, based on Evan's suggestions. llvm-svn: 56009	2008-09-09 20:47:17 +00:00
Bill Wendling	ed3bb7888d	- Add support for 6-, 12-, and 18-bit limited precision floating-point "log" values. - Refactored some of the code. llvm-svn: 56008	2008-09-09 20:39:27 +00:00
Anton Korobeynikov	1a1140429e	Make safer variant of alias resolution routine to be default llvm-svn: 56005	2008-09-09 20:05:04 +00:00
Bill Wendling	faeb4b6755	Add limited precision floating-point conversions of log10 for 6- and 18-bit precisions. llvm-svn: 56000	2008-09-09 18:42:23 +00:00
Owen Anderson	8529085f4f	Check for type legality before materializing integer constants in fast isel. With this change, all of MultiSource/Applications passes on Darwin/X86 under FastISel. llvm-svn: 55982	2008-09-09 06:32:02 +00:00
Dan Gohman	b6aef419b4	Remove the code that protected FastISel from aborting in the case of loads, stores, and conditional branches. It can handle those now, so any that aren't handled should trigger the abort. llvm-svn: 55977	2008-09-09 02:40:04 +00:00
Evan Cheng	1e97901388	Fix a constant lowering bug. Now we can do load and store instructions with funky getelementptr embedded in the address operand. llvm-svn: 55975	2008-09-09 01:26:59 +00:00
Bill Wendling	484167851a	Add support for floating-point calculations of log2 with limited precisions of 6 and 18. llvm-svn: 55968	2008-09-09 00:28:24 +00:00
Anton Korobeynikov	45165ed1ac	Reapply 55904: Unbreak and fix indentation llvm-svn: 55958	2008-09-08 21:13:56 +00:00
Dan Gohman	a333f3ccb8	Fix a few I's that were meant to be renamed to BI's. llvm-svn: 55942	2008-09-08 20:37:59 +00:00
Dale Johannesen	67f99f1454	Redo the 3 existing low-precision expansions to use float constants. An oversight by the numerics people who supplied this. llvm-svn: 55930	2008-09-08 18:00:26 +00:00
Bill Wendling	99b83712f3	Reverting r55898 to r55909. One of these patches was causing an ICE during the full bootstrap on Darwin: /Volumes/Sandbox/Buildbot/llvm/full-llvm/build/llvm-gcc.obj/./gcc/xgcc -B/Volumes/Sandbox/Buildbot/llvm/full-llvm/build/llvm-gcc.obj/./gcc/ -B/Volumes/Sandbox/Buildbot/llvm/full-llvm/build/llvm-gcc.install/i386-apple-darwin9.4.0/bin/ -B/Volumes/Sandbox/Buildbot/llvm/full-llvm/build/llvm-gcc.install/i386-apple-darwin9.4.0/lib/ -isystem /Volumes/Sandbox/Buildbot/llvm/full-llvm/build/llvm-gcc.install/i386-apple-darwin9.4.0/include -isystem /Volumes/Sandbox/Buildbot/llvm/full-llvm/build/llvm-gcc.install/i386-apple-darwin9.4.0/sys-include -O2 -O2 -g -O2 -DIN_GCC -W -Wall -Wwrite-strings -Wstrict-prototypes -Wmissing-prototypes -Wold-style-definition -isystem ./include -fPIC -pipe -g -DHAVE_GTHR_DEFAULT -DIN_LIBGCC2 -D__GCC_FLOAT_NOT_NEEDED -I. -I. -I../../llvm-gcc.src/gcc -I../../llvm-gcc.src/gcc/. -I../../llvm-gcc.src/gcc/../include -I./../intl -I../../llvm-gcc.src/gcc/../libcpp/include -I../../llvm-gcc.src/gcc/../libdecnumber -I../libdecnumber -I/Volumes/Sandbox/Buildbot/llvm/full-llvm/build/llvm.obj/include -I/Volumes/Sandbox/Buildbot/llvm/full-llvm/build/llvm.src/include -DSHARED -m64 -DL_negdi2 -c ../../llvm-gcc.src/gcc/libgcc2.c -o libgcc/x86_64/_negdi2_s.o Assertion failed: (TargetRegisterInfo::isVirtualRegister(regA) && TargetRegisterInfo::isVirtualRegister(regB) && "cannot update physical register live information"), function runOnMachineFunction, file /Volumes/Sandbox/Buildbot/llvm/full-llvm/build/llvm.src/lib/CodeGen/TwoAddressInstructionPass.cpp, line 311. /Volumes/Sandbox/Buildbot/llvm/full-llvm/build/llvm-gcc.obj/./gcc/xgcc -B/Volumes/Sandbox/Buildbot/llvm/full-llvm/build/llvm-gcc.obj/./gcc/ -B/Volumes/Sandbox/Buildbot/llvm/full-llvm/build/llvm-gcc.install/i386-apple-darwin9.4.0/bin/ -B/Volumes/Sandbox/Buildbot/llvm/full-llvm/build/llvm-gcc.install/i386-apple-darwin9.4.0/lib/ -isystem /Volumes/Sandbox/Buildbot/llvm/full-llvm/build/llvm-gcc.install/i386-apple-darwin9.4.0/include -isystem /Volumes/Sandbox/Buildbot/llvm/full-llvm/build/llvm-gcc.install/i386-apple-darwin9.4.0/sys-include -O2 -O2 -g -O2 -DIN_GCC -W -Wall -Wwrite-strings -Wstrict-prototypes -Wmissing-prototypes -Wold-style-definition -isystem ./include -fPIC -pipe -g -DHAVE_GTHR_DEFAULT -DIN_LIBGCC2 -D__GCC_FLOAT_NOT_NEEDED -I. -I. -I../../llvm-gcc.src/gcc -I../../llvm-gcc.src/gcc/. -I../../llvm-gcc.src/gcc/../include -I./../intl -I../../llvm-gcc.src/gcc/../libcpp/include -I../../llvm-gcc.src/gcc/../libdecnumber -I../libdecnumber -I/Volumes/Sandbox/Buildbot/llvm/full-llvm/build/llvm.obj/include -I/Volumes/Sandbox/Buildbot/llvm/full-llvm/build/llvm.src/include -DSHARED -m64 -DL_lshrdi3 -c ../../llvm-gcc.src/gcc/libgcc2.c -o libgcc/x86_64/_lshrdi3_s.o ../../llvm-gcc.src/gcc/unwind-dw2.c:1527: internal compiler error: Abort trap Please submit a full bug report, with preprocessed source if appropriate. See <URL:http://developer.apple.com/bugreporter> for instructions. {standard input}:unknown:Undefined local symbol LBB21_11 {standard input}:unknown:Undefined local symbol LBB21_12 {standard input}:unknown:Undefined local symbol LBB21_13 {standard input}:unknown:Undefined local symbol LBB21_8 llvm-svn: 55928	2008-09-08 17:59:12 +00:00
Dan Gohman	1df80f6b1c	In visitUREM, arrange for the temporary UDIV node to be revisited, consistent with the code in visitSREM. llvm-svn: 55923	2008-09-08 16:59:01 +00:00
Daniel Dunbar	ede2d7d745	Add VISIBILITY_HIDDEN on SDISelAsmOperandInfo llvm-svn: 55922	2008-09-08 16:56:08 +00:00
Dan Gohman	e19bc1844f	Fix the string for ISD::UDIVREM. llvm-svn: 55917	2008-09-08 16:30:29 +00:00
Evan Cheng	24776b554d	Avoid redefinition and nnbreak windows build. llvm-svn: 55911	2008-09-08 16:01:27 +00:00
Anton Korobeynikov	6a73698a85	Unbreak and fix indentation llvm-svn: 55904	2008-09-08 14:23:34 +00:00
Evan Cheng	e775d3526c	Add fast isel physical register definition support. llvm-svn: 55892	2008-09-08 08:38:20 +00:00
Bill Wendling	5f7371d7b1	Revert my previous change -- the subtraction of two constants was a no-op before. This is taken care of in the selection DAG pass. In my opinion, this should be in one place or the other. I.e., it should probably be removed from the DAG combiner (along with the other arithmetic transformations on constants that are essentially no-ops). llvm-svn: 55889	2008-09-08 01:56:32 +00:00
Bill Wendling	df81749886	Convert // fold (sub c1, c2) -> c1-c2 from a no-op into an actual transformation. llvm-svn: 55886	2008-09-07 11:34:47 +00:00
Evan Cheng	b9a0abb129	Indentation. llvm-svn: 55880	2008-09-07 09:04:52 +00:00
Evan Cheng	615739b991	- Doh. Pass vector by value is bad. - Add a AnalyzeCallResult specialized for calls which produce a single value. This is used by fastisel. llvm-svn: 55879	2008-09-07 09:02:18 +00:00
Dale Johannesen	36d532abd6	Next limited float precision expansion (log2 12 bits) llvm-svn: 55866	2008-09-05 23:49:37 +00:00
Owen Anderson	1dd2e40521	Revert r55859. This is breaking the build in the abscence of its companion commit. llvm-svn: 55865	2008-09-05 23:36:01 +00:00
Dan Gohman	f17a2f3602	Move the code that inserts copies for function livein registers out of ScheduleDAGEmit.cpp and into SelectionDAGISel.cpp. This allows it to be run exactly once per function, even if multiple SelectionDAG iterations happen in the entry block, as may happen with FastISel. llvm-svn: 55863	2008-09-05 22:59:21 +00:00
Dale Johannesen	d4dac0e9ea	Add the next limited-precision expansion. llvm-svn: 55856	2008-09-05 21:27:19 +00:00
Dan Gohman	fd634599dc	FastISel support for AND and OR with type i1. llvm-svn: 55846	2008-09-05 18:44:22 +00:00
Dale Johannesen	520143e563	Add hooks for other intrinsics to get low-precision expansions. llvm-svn: 55845	2008-09-05 18:38:42 +00:00
Dan Gohman	fcf545690c	FastISel support for ConstantExprs. llvm-svn: 55843	2008-09-05 18:18:20 +00:00
Dan Gohman	677c3afbd1	Revert r55817. It broke PIC. FastISel will need to find a different approach here. llvm-svn: 55842	2008-09-05 18:13:01 +00:00
Evan Cheng	6b8fae1777	Add a variant of AnalyzeCallOperands that can be used by fast isel. llvm-svn: 55838	2008-09-05 16:59:26 +00:00
Duncan Sands	4d50e984bb	"Fix" PR2762. The testcase now crashes codegen elsewhere due to a missing pattern for v2f64 = sint_to_fp v2i32. That is PR2687. llvm-svn: 55828	2008-09-05 08:13:35 +00:00
Dan Gohman	921ddd69ba	Fix a search+replace-o. llvm-svn: 55824	2008-09-05 01:58:21 +00:00
Dale Johannesen	f2a52bbee5	Add -flimit-float-precision to enable some faster, but less accurate (non-IEEE) code sequences for certain math library functions. Add the first of several such expansions. Don't worry, if you don't turn it on it won't affect you. llvm-svn: 55823	2008-09-05 01:48:15 +00:00
Dan Gohman	ea56bdde34	FastISel support for unreachable. llvm-svn: 55818	2008-09-05 01:08:41 +00:00
Dan Gohman	5b4a9f4a69	In FastISel mode, the scheduler may be invoked multiple times in the same block. Fix the entry-block handling to only run at at the beginning of the entry block, and not any other times. llvm-svn: 55817	2008-09-05 01:07:48 +00:00
Owen Anderson	50288e3c99	Add initial support for selecting constant materializations that require constant pool loads on X86 in fast isel. This isn't actually used yet. llvm-svn: 55814	2008-09-05 00:06:23 +00:00
Dan Gohman	5eba3bcac6	Add an include of SmallSet.h. llvm-svn: 55793	2008-09-04 20:49:27 +00:00
Dan Gohman	a79db30d28	Tidy up several unbeseeming casts from pointer to intptr_t. llvm-svn: 55779	2008-09-04 17:05:41 +00:00
Dan Gohman	634412fe35	Clean up uses of TargetLowering::getTargetMachine. llvm-svn: 55769	2008-09-04 15:39:15 +00:00
Dale Johannesen	da2d80688b	Add intrinsics for log, log2, log10, exp, exp2. No functional change (and no FE change to generate them). llvm-svn: 55753	2008-09-04 00:47:13 +00:00
Dan Gohman	e039d5580e	Do trivial local CSE for constants and other non-Instruction values in FastISel. llvm-svn: 55748	2008-09-03 23:32:19 +00:00
Dan Gohman	45df9951f5	Put RegsForValue in the llvm namespace to avoid warnings about classes in the llvm namespace having members with types from anonymous namespaces. llvm-svn: 55747	2008-09-03 23:18:39 +00:00
Dan Gohman	7bda51f5a4	Create HandlePHINodesInSuccessorBlocksFast, a version of HandlePHINodesInSuccessorBlocks that works FastISel-style. This allows PHI nodes to be updated correctly while using FastISel. This also involves some code reorganization; ValueMap and MBBMap are now members of the FastISel class, so they needn't be passed around explicitly anymore. Also, SelectInstructions is changed to SelectInstruction, and only does one instruction at a time. llvm-svn: 55746	2008-09-03 23:12:08 +00:00
Owen Anderson	b1b9398ea7	Oops, I accidentally broke the fallback case with my last commit. llvm-svn: 55704	2008-09-03 17:51:57 +00:00
Owen Anderson	ea666816c2	Fix an issue where we were reusing materializations of constants in blocks not dominated by the materialization. This is the simple fix, materializing the constant before every use. It might be better to either track domination of uses or to materialize all constants and the beginning of the function and let remat sort when to do materialization at uses. llvm-svn: 55703	2008-09-03 17:37:03 +00:00
Dan Gohman	575fad337c	Split the SelectionDAG-building code, including the FunctionLoweringInfo and SelectionDAGLowering classes, out of SelectionDAGISel.cpp and put it in a separate file, SelectionDAGBuild.cpp. llvm-svn: 55701	2008-09-03 16:12:24 +00:00
Dan Gohman	b10f1a5c60	Separate MachineInstr-emitting routines from actual scheduling routines and move them into a separate file, ScheduleDAGEmit.cpp. llvm-svn: 55699	2008-09-03 16:01:59 +00:00
Evan Cheng	31ddd09f4a	If TargetSelectInstruction returns true, move to next instruction. llvm-svn: 55692	2008-09-03 06:43:41 +00:00
Evan Cheng	09ff2e7372	80 col violations. llvm-svn: 55668	2008-09-02 21:59:13 +00:00
Dan Gohman	115267fdc6	Ensure that HandlePHINodesInSuccessorBlocks is run for all blocks, even in FastISel mode in the case where FastISel successfully selects all the instructions. llvm-svn: 55641	2008-09-02 20:17:56 +00:00
Gabor Greif	9c64e61176	Provide two overloads of AnalyzeNewNode. The first can update the SDNode in an SDValue while the second is called with SDNode* and returns a possibly updated SDNode*. This patch has no intended functional impact, but helps eliminating ugly temporary SDValues. llvm-svn: 55608	2008-09-01 15:10:19 +00:00
Duncan Sands	4b31a2a7ce	Even though no caller actually uses the new value (what matters is that it is added to the worklist), it seems more logical to return it. llvm-svn: 55606	2008-09-01 13:11:13 +00:00
Bill Wendling	11284ea499	Another situation where ROTR is cheaper than ROTL. llvm-svn: 55577	2008-08-31 01:13:31 +00:00
Bill Wendling	4822a7ac8a	For this pattern, ROTR is the cheaper option. llvm-svn: 55576	2008-08-31 01:04:56 +00:00
Bill Wendling	fc72416447	- Fix comment so that it describes how the code really works: // fold (or (shl x, (ext y)), (srl x, (ext (sub 32, y)))) -> // (rotl x, y) // fold (or (shl x, (ext y)), (srl x, (ext (sub 32, y)))) -> // (rotr x, (sub 32, y)) Example: (x == 0xDEADBEEF and y == 4) (x << 4) \| (x >> 28) => 0xEADBEEF0 \| 0x0000000D => 0xEADBEEFD (rotl x, 4) => 0xEADBEEFD (rotr x, 28) => 0xEADBEEFD - Fix comment and code for second version. It wasn't using the rot* propertly. // fold (or (shl x, (ext (sub 32, y))), (srl x, (ext r))) -> // (rotr x, y) // fold (or (shl x, (ext (sub 32, y))), (srl x, (ext r))) -> // (rotl x, (sub 32, y)) (x << 28) \| (x >> 4) => 0xD0000000 \| 0x0DEADBEE => 0xDDEADBEE (rotl x, 4) => 0xEADBEEFD (rotr x, 28) => (0xEADBEEFD) llvm-svn: 55575	2008-08-31 00:37:27 +00:00
Gabor Greif	66ccf603a9	typo llvm-svn: 55574	2008-08-30 22:16:05 +00:00
Gabor Greif	e12264bf41	fix some 80-col violations llvm-svn: 55571	2008-08-30 19:29:20 +00:00
Evan Cheng	cfb7f3abdf	Transform (x << (y&31)) -> (x << y). This takes advantage of the fact x86 shift instructions 2nd operand (shift count) is limited to 0 to 31 (or 63 in the x86-64 case). llvm-svn: 55558	2008-08-30 02:03:58 +00:00
Owen Anderson	6f0c51d9da	Fix an issue where a use might be selected before a def, and then we didn't respect the pre-chosen vreg assignment when selecting the def. This is the naive solution to the problem: insert a copy to the pre-chosen vreg. Other solutions might be preferable, such as: 1) Passing the dest reg into FastEmit_. However, this would require the higher level code to know about reg classes, which they don't currently. 2) Selecting blocks in reverse postorder. This has some compile time cost for computing the order, and we'd need to measure its impact. llvm-svn: 55555	2008-08-30 00:38:46 +00:00
Evan Cheng	894be333f1	Fix 80 col. violations. llvm-svn: 55551	2008-08-29 23:20:46 +00:00
Evan Cheng	5e7658c2e4	Back out 55498. It broken Apple style bootstrapping. llvm-svn: 55549	2008-08-29 22:21:44 +00:00
Dan Gohman	d58f3e36d0	Add a target callback for FastISel. llvm-svn: 55512	2008-08-28 23:21:34 +00:00
Gabor Greif	f304a7aa4d	erect abstraction boundaries for accessing SDValue members, rename Val -> Node to reflect semantics llvm-svn: 55504	2008-08-28 21:40:38 +00:00
Dan Gohman	c45733f194	Implement null and undef values for FastISel. llvm-svn: 55500	2008-08-28 21:19:07 +00:00
Dan Gohman	f27e33baa7	Optimize DAGCombiner's worklist processing. Previously it started its work by putting all nodes in the worklist, requiring a big dynamic allocation. Now, DAGCombiner just iterates over the AllNodes list and maintains a worklist for nodes that are newly created or need to be revisited. This allows the worklist to stay small in most cases, so it can be a SmallVector. This has the side effect of making DAGCombine not miss a folding opportunity in alloca-align-rounding.ll. llvm-svn: 55498	2008-08-28 21:01:56 +00:00
Dan Gohman	17da671922	Move CaseBlock, JumpTable, and BitTestBlock to be members of SelectionDAGLowering instead of being in an anonymous namespace. This fixes warnings about SelectionDAGLowering having fields using anonymous namespaces. llvm-svn: 55497	2008-08-28 20:38:18 +00:00
Dan Gohman	360c57f683	Fix a FastISel bug where the instructions from lowering the arguments were being emitted after the first instructions of the entry block. llvm-svn: 55496	2008-08-28 20:28:56 +00:00
Rafael Espindola	6c8a99a778	Reduce the size of the Parts vector. llvm-svn: 55483	2008-08-28 18:29:58 +00:00
Owen Anderson	d8a82b75e2	Hook up support for fast-isel of trunc instructions, using the newly working support for EXTRACT_SUBREG. llvm-svn: 55482	2008-08-28 18:26:01 +00:00
Owen Anderson	9cd1a5e530	FastEmitInst_extractsubreg doesn't need to be passed the register class. It can get it from MachineRegisterInfo instead. llvm-svn: 55476	2008-08-28 17:47:37 +00:00
Rafael Espindola	029c1c8460	Correctly resize the Parts array. llvm-svn: 55471	2008-08-28 14:24:45 +00:00
Dale Johannesen	41be0d4445	Split the ATOMIC NodeType's to include the size, e.g. ATOMIC_LOAD_ADD_{8,16,32,64} instead of ATOMIC_LOAD_ADD. Increased the Hardcoded Constant OpActionsCapacity to match. Large but boring; no functional change. This is to support partial-word atomics on ppc; i8 is not a valid type there, so by the time we get to lowering, the ATOMIC_LOAD nodes looks the same whether the type was i8 or i32. The information can be added to the AtomicSDNode, but that is the largest SDNode; I don't fully understand the SDNode allocation, but it is sensitive to the largest node size, so increasing that must be bad. This is the alternative. llvm-svn: 55457	2008-08-28 02:44:49 +00:00
Dan Gohman	e1a9a780a5	Reorganize the lifetimes of the major objects SelectionDAGISel works with. SelectionDAG, FunctionLoweringInfo, and SelectionDAGLowering objects now get created once per SelectionDAGISel instance, and can be reused across blocks and across functions. Previously, they were created and destroyed each time they were needed. This reorganization simplifies the handling of PHI nodes, and also SwitchCases, JumpTables, and BitTestBlocks. This simplification has the side effect of fixing a bug in FastISel where successor PHI nodes weren't being updated correctly. This is also a step towards making the transition from FastISel into and out of SelectionDAG faster, and also making plain SelectionDAG faster on code with lots of little blocks. llvm-svn: 55450	2008-08-27 23:52:12 +00:00
Owen Anderson	5f57bc2247	Add a helper method that will be used to support EXTRACT_SUBREG for selecting trunc's in fast-isel. llvm-svn: 55439	2008-08-27 22:30:02 +00:00
Dan Gohman	61cfa3095d	Fix FastISel's bitcast code for the case where getRegForValue fails. llvm-svn: 55431	2008-08-27 20:41:38 +00:00
Owen Anderson	90609850b2	Use TargetLowering to get the types in fast isel, which handles pointer types correctly for our purposes. llvm-svn: 55428	2008-08-27 18:58:30 +00:00
Dan Gohman	d01789be23	Don't check TLI.getOperationAction. The FastISel way is to just try to do the action and let the tablegen-generated code determine if there is target-support for an operation. llvm-svn: 55427	2008-08-27 18:15:05 +00:00
Dan Gohman	b0b5a27438	Add a new FastISel method, getRegForValue, which takes care of the details of materializing constants and other values into registers, and make use of it in several places. llvm-svn: 55426	2008-08-27 18:10:19 +00:00
Dan Gohman	f2a6c1579f	Add a comment about the current floating-point constant code in FastISel. llvm-svn: 55425	2008-08-27 18:01:42 +00:00
Dan Gohman	3a3a52de58	Optimize ScheduleDAGRRList's topological sort to use one pass instead of two, and to not need a scratch std::vector. Also, compute the ordering immediately in the result array, instead of in another scratch std::vector that is copied to the result array. llvm-svn: 55421	2008-08-27 16:29:48 +00:00
Dan Gohman	9cbdedcbcf	Optimize ScheduleDAG's ComputeDepths and ComputeHeights to not need a scratch std::vector. llvm-svn: 55420	2008-08-27 16:27:25 +00:00
Dan Gohman	5ca269e684	Basic FastISel support for floating-point constants. llvm-svn: 55401	2008-08-27 01:09:54 +00:00
Owen Anderson	54aff7bb23	Fix handling of inttoptr and ptrtoint when unhandled operands are present. llvm-svn: 55400	2008-08-27 00:35:37 +00:00
Owen Anderson	140549256f	Add support for fast isel of inttoptr and ptrtoint in the cases where truncation is not needed. llvm-svn: 55399	2008-08-27 00:31:01 +00:00
Owen Anderson	ca1711a5b5	Factor out a large amoutn of the cast handling code in fast isel into helper methods. This simultaneously makes the code simpler and adds support for sext as well. llvm-svn: 55398	2008-08-26 23:46:32 +00:00
Owen Anderson	343310a715	Add support for fast isel of zext. llvm-svn: 55396	2008-08-26 23:14:49 +00:00
Gabor Greif	abfdf928d8	disallow direct access to SDValue::ResNo, provide a getter instead llvm-svn: 55394	2008-08-26 22:36:50 +00:00
Owen Anderson	655c1dc63d	Add support for fptosi of constants in fast isel. llvm-svn: 55393	2008-08-26 22:34:28 +00:00
Dan Gohman	d56f73f2f2	Optimize SelectionDAG's topological sort to use one pass instead of two, and to not need a scratch std::vector. Also, use the SelectionDAG's topological sort in LegalizeDAG instead of having a separate implementation. llvm-svn: 55389	2008-08-26 21:42:18 +00:00
Dan Gohman	6fda9208d9	Refactor the bitcast code into its own function. llvm-svn: 55387	2008-08-26 21:28:54 +00:00
Dan Gohman	b5e04bfb18	Make FastISel use the correct argument type when casting GEP indices. llvm-svn: 55384	2008-08-26 20:57:08 +00:00
Dan Gohman	3bcbbece19	Don't select binary instructions with illegal types. llvm-svn: 55383	2008-08-26 20:52:40 +00:00
Owen Anderson	3c4dc434ee	Add support for fast isel of sitofp, and remove some unnecessary and imprecise legality checks. llvm-svn: 55381	2008-08-26 20:37:00 +00:00
Owen Anderson	e0ac9765b2	Use a combination of copyRegToReg and ISD::BIT_CONVERT when doing fast isel of bitcasts, allowing it to support the full range of conversions people might ask for in a correct manner. llvm-svn: 55378	2008-08-26 18:51:24 +00:00
Owen Anderson	27fb3dcbc7	Make TargetInstrInfo::copyRegToReg return a bool indicating whether the copy requested was inserted or not. This allows bitcast in fast isel to properly handle the case where an appropriate reg-to-reg copy is not available. llvm-svn: 55375	2008-08-26 18:03:31 +00:00
Owen Anderson	bf05ebaccf	Add support for fast isel of non-constant fptosi instructions. llvm-svn: 55373	2008-08-26 17:44:42 +00:00
Chris Lattner	54ef9f5831	typo fix. llvm-svn: 55355	2008-08-26 06:07:47 +00:00
Dan Gohman	2e834906b9	Actually recycle SDNode allocations. SelectionDAG is using RecyclingAllocator, but this change is needed for the nodes to actually be recycled. This cuts SelectionDAG's memory usage high-water-mark in half in some cases. llvm-svn: 55351	2008-08-26 01:44:34 +00:00
Owen Anderson	8dd01ccdd8	Add a RetVT parameter to emitted FastISel methods, so that we will be able to pass the desired return type down. This is not currently used. llvm-svn: 55345	2008-08-25 23:58:18 +00:00
Evan Cheng	2c067325d6	Unbreak build. llvm-svn: 55342	2008-08-25 22:20:39 +00:00
Owen Anderson	126afc5cb9	Expand bitcast support in fast isel to support bitcasts of non-constant values by emitting reg-reg copies. llvm-svn: 55340	2008-08-25 21:32:34 +00:00
Owen Anderson	32635dbfb2	Add support for fast isel of (integer) immediate materialization pattens, and use them to support bitcast of constants in fast isel. llvm-svn: 55325	2008-08-25 20:20:32 +00:00
Chris Lattner	f4bd5cf3dd	make sure to flush the stream after dumping, to make sure it goes out immediately. llvm-svn: 55288	2008-08-24 18:28:30 +00:00
Chris Lattner	838aff36dd	get MachineConstantPool off std::ostream, onto raw_ostream. It would be really nice if someone converted MachineFunction::print to raw_ostream. llvm-svn: 55268	2008-08-23 22:53:13 +00:00
Chris Lattner	0c19df4871	Switch the asmprinter (.ll) and all the stuff it requires over to use raw_ostream instead of std::ostream. Among other goodness, this speeds up llvm-dis of kc++ with a release build from 0.85s to 0.49s (88% faster). Other interesting changes: 1) This makes Value::print be non-virtual. 2) AP[S]Int and ConstantRange can no longer print to ostream directly, use raw_ostream instead. 3) This fixes a bug in raw_os_ostream where it didn't flush itself when destroyed. 4) This adds a new SDNode::print method, instead of only allowing "dump". A lot of APIs have both std::ostream and raw_ostream versions, it would be useful to go through and systematically anihilate the std::ostream versions. This passes dejagnu, but there may be minor fallout, plz let me know if so and I'll fix it. llvm-svn: 55263	2008-08-23 22:23:09 +00:00
Dan Gohman	48a3623591	Make MBBMap a DenseMap instead of a std::map. llvm-svn: 55220	2008-08-23 02:44:46 +00:00
Dan Gohman	eb0cee91f6	Move the point at which FastISel taps into the SelectionDAGISel process up to a higher level. This allows FastISel to leverage more of SelectionDAGISel's infastructure, such as updating Machine PHI nodes. Also, implement transitioning from SDISel back to FastISel in the middle of a block, so it's now possible to go back and forth. This allows FastISel to hand individual CallInsts and other complicated things off to SDISel to handle, while handling the rest of the block itself. To help support this, reorganize the SelectionDAG class so that it is allocated once and reused throughout a function, instead of being completely reallocated for each block. llvm-svn: 55219	2008-08-23 02:25:05 +00:00
Dan Gohman	95d1056831	Avoid creating shift-by-zero SDNodes in the common case of i8* getelementptr. DAGCombine eliminates these, but this is a fairly common case. llvm-svn: 55214	2008-08-23 01:06:51 +00:00
Dan Gohman	ac37f9a9be	Move SelectionDAG's constructor out of line. llvm-svn: 55212	2008-08-23 00:50:30 +00:00
Dan Gohman	2db3f8a095	Reapply r55191 and r55192. llvm-svn: 55205	2008-08-22 21:28:19 +00:00
Bill Wendling	fc4f64eed0	Reverting r55190, r55191, and r55192. They broke the build with this error message: {standard input}:17:bad register name `%sil' make[4]: * [libgcc/./_addvsi3.o] Error 1 make[4]: * Waiting for unfinished jobs.... {standard input}:23:bad register name `%dil' {standard input}:28:bad register name `%dil' make[4]: * [libgcc/./_addvdi3.o] Error 1 {standard input}:18:bad register name `%sil' make[4]: * [libgcc/./_subvsi3.o] Error 1 llvm-svn: 55200	2008-08-22 20:51:05 +00:00
Dan Gohman	04968da460	Fix the InsertBranch call. llvm-svn: 55192	2008-08-22 19:26:10 +00:00
Dan Gohman	87ff7058e7	Support non-fallthrough unconditional branches in FastISel. llvm-svn: 55191	2008-08-22 19:21:41 +00:00
Dan Gohman	a2292c0d34	Add FastISel support for PHINodes. Machine PHI nodes are not yet updated properly, but that's a separate task. llvm-svn: 55187	2008-08-22 17:37:48 +00:00
Dan Gohman	49e19e906f	Factor out the predicate check code from DAGISelEmitter.cpp and use it in FastISelEmitter.cpp, and make FastISel subtarget aware. Among other things, this lets it work properly on x86 targets that don't have SSE, where it successfully selects x87 instructions. llvm-svn: 55156	2008-08-22 00:20:26 +00:00
Dan Gohman	2af34bd309	Add libcalls for the new rounding opcodes. llvm-svn: 55133	2008-08-21 18:38:14 +00:00
Dan Gohman	c6337ac069	Add libm-oriented ISD opcodes for rounding operations. llvm-svn: 55130	2008-08-21 17:55:02 +00:00
Dan Gohman	6a7461ad9b	Have FastISel skip the multiply by 1 for getelementptr on i8*. llvm-svn: 55129	2008-08-21 17:37:05 +00:00
Dan Gohman	efb7d2d03d	MVT::getMVT uses iPTR for pointer types, while we need the actual intptr_t type in this case. FastISel can now select simple getelementptr instructions. llvm-svn: 55125	2008-08-21 17:25:26 +00:00
Dan Gohman	75ea0b83c5	Elements in DeadNodeSet are checked for use_empty() before they are actually deleted, so it's not necessary to remove re-used nodes from the set. llvm-svn: 55123	2008-08-21 16:24:54 +00:00
Dan Gohman	fe9056584b	Basic fast-isel support for instructions with constant int operands. llvm-svn: 55099	2008-08-21 01:41:07 +00:00
Evan Cheng	4b5c038cd0	Type of first GEP operand is always the same as the target pointer type. llvm-svn: 55097	2008-08-21 01:19:11 +00:00
Dan Gohman	6a0780cdd7	Fix unused variable warnings. llvm-svn: 55089	2008-08-20 23:53:10 +00:00
Evan Cheng	864fcc198d	First cut, un-optimized (and untested) fast isel lowering of GetElementPtrInst. llvm-svn: 55085	2008-08-20 22:45:34 +00:00
Dan Gohman	a4305cec93	Simplify the BuildMI calls even more. llvm-svn: 55077	2008-08-20 21:10:53 +00:00
Dan Gohman	02c84b8910	Simplify FastISel's constructor argument list, make the FastISel class hold a MachineRegisterInfo member, and make the MachineBasicBlock be passed in to SelectInstructions rather than the FastISel constructor. llvm-svn: 55076	2008-08-20 21:05:57 +00:00
Dan Gohman	43d1c7c607	Dump the instruction that foiled ISel even when -debug is not used. llvm-svn: 55075	2008-08-20 20:47:32 +00:00
Dan Gohman	07a34a5f69	Make more use of the BuildMI API. llvm-svn: 55072	2008-08-20 18:16:32 +00:00
Dan Gohman	24e8f0cfe6	Minor code reorganization. llvm-svn: 55071	2008-08-20 18:10:48 +00:00
Dan Gohman	2471f6ce0f	Minor whitespace cleanup. llvm-svn: 55070	2008-08-20 18:09:38 +00:00
Dan Gohman	39a5ffb03f	Fix 80 column violation. llvm-svn: 55069	2008-08-20 18:09:02 +00:00
Evan Cheng	7b9cd58596	Kill off SimpleBBISel, it's replaced by FastISel. llvm-svn: 55067	2008-08-20 17:50:32 +00:00
Dan Gohman	837c13a029	Disable DAGCombine's alignment inference in "fast" codegen mode. llvm-svn: 55059	2008-08-20 16:30:28 +00:00
Dan Gohman	2da2bedc72	Change the FoldingSetNodeID usage for objects which carry alignment and volatility information, such as loads and stores, to reduce the number of integer values added to the FoldingSetNodeID. llvm-svn: 55058	2008-08-20 15:58:01 +00:00
Dan Gohman	f6aa60ff71	Use BitVector instead of std::vector<unsigned char>. llvm-svn: 55054	2008-08-20 14:58:41 +00:00
Dan Gohman	c63a46ef39	Avoid an empty-if-body warning in release builds. llvm-svn: 55050	2008-08-20 14:00:56 +00:00
Dan Gohman	e8f9a00424	Fix FastISel to recognize that the last block in the function does not have a fall-through successor. llvm-svn: 55033	2008-08-20 01:17:01 +00:00
Dan Gohman	98265cae87	Fix a leak in the FastISel code that Chris pointed out. llvm-svn: 55031	2008-08-20 00:56:17 +00:00
Dan Gohman	847ebb90b8	Add support for running SelectionDAG if FastISel fails. This is under a command-line option, so that the default behavior is an abort, which is useful for exposing code that isn't supported yet. llvm-svn: 55028	2008-08-20 00:47:54 +00:00
Dan Gohman	f6884373c2	Fix FastISel to recognize unhandled operands, such as constants that aren't available as virtual registers (for now). llvm-svn: 55026	2008-08-20 00:35:17 +00:00
Dan Gohman	b16a7783c5	Add FastISel support for floating-point operations. llvm-svn: 55021	2008-08-20 00:23:20 +00:00
Dan Gohman	a3e4d5a5e1	Add FastISel support for several more binary operators. llvm-svn: 55020	2008-08-20 00:11:48 +00:00
Dan Gohman	697284fe0a	Add code to call FastISel, and a command-line option to enable it. llvm-svn: 55015	2008-08-19 22:33:34 +00:00
Dan Gohman	214343fbbe	Support unconditional fall-through branches in FastISel. llvm-svn: 55014	2008-08-19 22:31:46 +00:00
Dan Gohman	547ce65467	Use the BuildMI overload that sets up a destination register instead of the one that doesn't and then adding it manually. llvm-svn: 55006	2008-08-19 20:46:54 +00:00
Dan Gohman	c55fdcc935	Handle the case where target-specific fastisel code doesn't have a desired opcode. llvm-svn: 55005	2008-08-19 20:43:22 +00:00
Chris Lattner	5d2a9a4ae6	don't use the result of WriteTypeSymbolic or WriteAsOperand. llvm-svn: 54978	2008-08-19 04:44:30 +00:00
Gordon Henriksen	d930f913e6	Rename some GC classes so that their roll will hopefully be clearer. In particular, Collector was confusing to implementors. Several thought that this compile-time class was the place to implement their runtime GC heap. Of course, it doesn't even exist at runtime. Specifically, the renames are: Collector -> GCStrategy CollectorMetadata -> GCFunctionInfo CollectorModuleMetadata -> GCModuleInfo CollectorRegistry -> GCRegistry Function::getCollector -> getGC (setGC, hasGC, clearGC) Several accessors and nested types have also been renamed to be consistent. These changes should be obvious. llvm-svn: 54899	2008-08-17 18:44:35 +00:00
Gordon Henriksen	bcef14d2e4	Factor GC metadata table assembly generation out of Collector in preparation for splitting AsmPrinter into its own library. llvm-svn: 54881	2008-08-17 12:56:54 +00:00
Chris Lattner	17f7165f84	Rework the routines that convert AP[S]Int into a string. Now, instead of returning an std::string by value, it fills in a SmallString/SmallVector passed in. This significantly reduces string thrashing in some cases. More specifically, this: - Adds an operator<< and a print method for APInt that allows you to directly send them to an ostream. - Reimplements APInt::toString to be much simpler and more efficient algorithmically in addition to not thrashing strings quite as much. This speeds up llvm-dis on kc++ by 7%, and may also slightly speed up the asmprinter. This also fixes a bug I introduced into the asmwriter in a previous patch w.r.t. alias printing. llvm-svn: 54873	2008-08-17 07:19:36 +00:00
Dan Gohman	c44423853a	Make FastISel's constructor protected, and give it a destructor. llvm-svn: 54793	2008-08-14 21:51:29 +00:00
Dan Gohman	550c9af91f	Improve support for vector casts in LLVM IR and CodeGen. llvm-svn: 54784	2008-08-14 20:04:46 +00:00
Dan Gohman	6134fbccef	Fix a bogus srem rule - a negative value srem'd by a power-of-2 can have a non-negative result; for example, -16%16 is 0. Also, clarify the related comments. This fixes PR2670. llvm-svn: 54767	2008-08-13 23:12:35 +00:00
Dan Gohman	7e3c392248	Allow SelectionDAG to create EXTRACT_VECTOR_ELT nodes with non-constant indices. Only a few of the peephole checks require a constant index. llvm-svn: 54764	2008-08-13 21:51:37 +00:00
Dan Gohman	b2226e21c3	Initial checkin of the new "fast" instruction selection support. See the comments in FastISelEmitter.cpp for details on what this is. This is currently experimental and unusable. llvm-svn: 54751	2008-08-13 20:19:35 +00:00
Dan Gohman	a7b8aed469	Rename SelectionDAGISel's FastISel to Fast, to begin to make room for the new FastISel instruction selection code. llvm-svn: 54749	2008-08-13 19:47:40 +00:00
Dan Gohman	23785a1679	Correct the filename in the top-of-file comment. llvm-svn: 54688	2008-08-12 17:42:33 +00:00
Dan Gohman	127bb03b8c	Take the FrameOffset into account when computing the alignment of stack objects. This fixes PR2656. llvm-svn: 54646	2008-08-11 18:27:03 +00:00
Evan Cheng	38aa7de6e9	Add skeleton of simple basic block instruction selector. llvm-svn: 54522	2008-08-08 07:27:28 +00:00
Bruno Cardoso Lopes	de5161fdf2	Add the remaining fp_round libcalls: FPROUND_F80_F32, FPROUND_PPCF128_F32, FPROUND_F80_F64, FPROUND_PPCF128_F64 Support for soften float fp_round operands is added, Mips needs this to round f64->f32. Also added support to soften float FABS result, Mips doesn't support double fabs results while in 'single float only' mode. llvm-svn: 54484	2008-08-07 19:01:24 +00:00
Evan Cheng	0638115a6e	Factor code that finalize PHI nodes, jump tables, etc. out of SelectBasicBlock. No functionality changes. llvm-svn: 54438	2008-08-07 00:43:25 +00:00
Owen Anderson	7c42ac4133	Remove the -disable-correct-folding option, which was ugly and is no longer needed. llvm-svn: 54361	2008-08-05 18:27:54 +00:00
Dan Gohman	e955c481fd	Fix several const-correctness issues, resolving some -Wcast-qual warnings. llvm-svn: 54349	2008-08-05 14:45:15 +00:00
Owen Anderson	bbeb8f0807	This option doesn't need to be a target option. It can be in SDISel instead. llvm-svn: 54336	2008-08-05 00:27:28 +00:00
Owen Anderson	a102290bdc	- Fix SelectionDAG to generate correct CFGs. - Add a basic machine-level dead block eliminator. These two have to go together, since many other parts of the code generator are unable to handle the unreachable blocks otherwise created. llvm-svn: 54333	2008-08-04 23:54:43 +00:00
Dan Gohman	90c724cadc	Fix SDISel lowering of PHI nodes to use ComputeValueVTs. This allows it to work correctly on aggregate values. This fixes PR2623. llvm-svn: 54331	2008-08-04 23:42:46 +00:00
Dan Gohman	6e023e63cd	Fix SDISel lowering of zeroinitializer and undef to use ComputeValueVTs. This allows it to work correctly on nested aggregate values. This fixes PR2625. llvm-svn: 54330	2008-08-04 23:30:41 +00:00
Dale Johannesen	c31eb205c1	Add a flag to disable jump table generation (all switches use the binary search algorithm) for environments that don't support it. PPC64 JIT is such an environment; turn the flag on for that. llvm-svn: 54248	2008-07-31 18:13:12 +00:00
Dan Gohman	345d63ccf2	Improve dagcombining for sext-loads and sext-in-reg nodes. llvm-svn: 54239	2008-07-31 00:50:31 +00:00
Dan Gohman	88e0df0c91	Move SelectionDAG::viewGraph() out of line; as an inline function it isn't always visible to gdb. llvm-svn: 54228	2008-07-30 18:48:53 +00:00
Dan Gohman	2fe4352691	Don't look for leaf values to store when lowering stores of empty structs. This fixes PR2612. llvm-svn: 54226	2008-07-30 18:36:51 +00:00
Nate Begeman	82f1925708	Fix broken CellSPU lowering, re-instate braces in Legalize llvm-svn: 54168	2008-07-29 19:07:27 +00:00
Nate Begeman	d63495ff25	Disable a fix in the previous patch, since it breaks CellSPU. The CellSPU codegen is broken, but needs to be fixed before we can put this back in. llvm-svn: 54164	2008-07-29 18:28:31 +00:00
Nate Begeman	fecbc8cff1	Add vector shifts to the IR, patch by Eli Friedman. CodeGen & Clang work coming next. llvm-svn: 54161	2008-07-29 15:49:41 +00:00
Dan Gohman	804c95df52	Fold the useful features of alist and alist_node into ilist, and a new ilist_node class, and remove them. Unlike alist_node, ilist_node doesn't attempt to manage storage itself, so it avoids the associated problems, including being opaque in gdb. Adjust the Recycler class so that it doesn't depend on alist_node. Also, change it to use explicit Size and Align parameters, allowing it to work when the largest-sized node doesn't have the greatest alignment requirement. Change MachineInstr's MachineMemOperand list from a pool-backed alist to a std::list for now. llvm-svn: 54146	2008-07-28 21:51:04 +00:00
Dan Gohman	68e45a361b	Make the ScheduleDAG's GraphRoot edge be blue and dashed too, like the SelectionDAG's. llvm-svn: 54129	2008-07-27 22:46:49 +00:00
Dan Gohman	2ce6f2ad5e	Rename SDOperand to SDValue. llvm-svn: 54128	2008-07-27 21:46:04 +00:00
Dan Gohman	91e5dcb680	Tidy SDNode::use_iterator, and complete the transition to have it parallel its analogue, Value::value_use_iterator. The operator* method now returns the user, rather than the use. llvm-svn: 54127	2008-07-27 20:43:25 +00:00
Dan Gohman	bb5f43ed4d	Rename isOnlyUseOf to isOnlyUserOf. llvm-svn: 54124	2008-07-27 18:06:42 +00:00
Duncan Sands	d9374421ea	Some binary operations were being treated as unary operations! Add support for softening some additional unary operations like fp_to_sint. llvm-svn: 54122	2008-07-27 12:28:43 +00:00

... 20 21 22 23 24 ...

4711 Commits