llvm-project

Commit Graph

Author	SHA1	Message	Date
Dan Gohman	8de6d22392	Use empty() instead of begin() == end(). llvm-svn: 54780	2008-08-14 18:13:49 +00:00
Owen Anderson	99e911fb16	Get rid of a use of std::map. llvm-svn: 54770	2008-08-13 23:36:23 +00:00
Dan Gohman	6134fbccef	Fix a bogus srem rule - a negative value srem'd by a power-of-2 can have a non-negative result; for example, -16%16 is 0. Also, clarify the related comments. This fixes PR2670. llvm-svn: 54767	2008-08-13 23:12:35 +00:00
Owen Anderson	706f6b7899	Expunge the last uses of std::map from LiveIntervals. llvm-svn: 54766	2008-08-13 22:28:50 +00:00
Owen Anderson	767b5cc7fd	Move r2iMap_ over to DenseMap from std::map. llvm-svn: 54765	2008-08-13 22:08:30 +00:00
Dan Gohman	7e3c392248	Allow SelectionDAG to create EXTRACT_VECTOR_ELT nodes with non-constant indices. Only a few of the peephole checks require a constant index. llvm-svn: 54764	2008-08-13 21:51:37 +00:00
Owen Anderson	51f689a652	Make the allocation of LiveIntervals explicit, rather than holding them in the r2iMap_ by value. This will prevent references to them from being invalidated if the map is changed. llvm-svn: 54763	2008-08-13 21:49:13 +00:00
Dan Gohman	b2226e21c3	Initial checkin of the new "fast" instruction selection support. See the comments in FastISelEmitter.cpp for details on what this is. This is currently experimental and unusable. llvm-svn: 54751	2008-08-13 20:19:35 +00:00
Dan Gohman	a7b8aed469	Rename SelectionDAGISel's FastISel to Fast, to begin to make room for the new FastISel instruction selection code. llvm-svn: 54749	2008-08-13 19:47:40 +00:00
Owen Anderson	ef96ac4f95	Get rid of unused variable. llvm-svn: 54742	2008-08-13 17:44:52 +00:00
Owen Anderson	65fce4d813	1) Merge entire live intervals instead of parts of them. 2) Conditionalize temporary insertion if we don't need it. llvm-svn: 54741	2008-08-13 17:25:42 +00:00
Dan Gohman	23785a1679	Correct the filename in the top-of-file comment. llvm-svn: 54688	2008-08-12 17:42:33 +00:00
Dan Gohman	127bb03b8c	Take the FrameOffset into account when computing the alignment of stack objects. This fixes PR2656. llvm-svn: 54646	2008-08-11 18:27:03 +00:00
Gordon Henriksen	ada201c8c1	Fix some typos. Apparently I think C needs a power-of operator. llvm-svn: 54574	2008-08-09 03:48:46 +00:00
Eric Christopher	5927883970	Have IRBuilder take a template argument on whether or not to preserve names. This can save a lot of allocations if you aren't going to be looking at the output. llvm-svn: 54546	2008-08-08 19:39:37 +00:00
Anton Korobeynikov	ed47329174	Handle visibility printing with all generality. Remove bunch of duplicate code. llvm-svn: 54540	2008-08-08 18:25:07 +00:00
Owen Anderson	dfb0b6952a	Reduce the entries in a phi before testing it for deadness, because removing the entries might make it dead. llvm-svn: 54535	2008-08-08 18:00:05 +00:00
Evan Cheng	38aa7de6e9	Add skeleton of simple basic block instruction selector. llvm-svn: 54522	2008-08-08 07:27:28 +00:00
Nick Lewycky	42a19b6933	Don't crash printing the asm for a ConstantExpr PtrToInt just because the int is narrower than the pointer. This testcase emits: .byte (((17) - 16) & 255) llvm-svn: 54517	2008-08-08 06:34:07 +00:00
Bruno Cardoso Lopes	de5161fdf2	Add the remaining fp_round libcalls: FPROUND_F80_F32, FPROUND_PPCF128_F32, FPROUND_F80_F64, FPROUND_PPCF128_F64 Support for soften float fp_round operands is added, Mips needs this to round f64->f32. Also added support to soften float FABS result, Mips doesn't support double fabs results while in 'single float only' mode. llvm-svn: 54484	2008-08-07 19:01:24 +00:00
Owen Anderson	d172c15ab0	Do a dominator walk when scheduling copies, rather than a DFS on the CFG. Also, fix a few problems when creating live intervals for temporaries created by phi elimination. llvm-svn: 54483	2008-08-07 18:28:07 +00:00
Dan Gohman	527ca7e253	Re-enable elimination of unnecessary SUBREG_TO_REG instructions in LowerSubregs, and fix an x86-64 isel bug that this exposed. SUBREG_TO_REG for x86-64 implicit zero extension is only safe for isel to generate when the source is known to always have zeros in the high 32 bits. The EXTRACT_SUBREG instruction does not clear the high 32 bits. llvm-svn: 54444	2008-08-07 02:54:50 +00:00
Evan Cheng	0638115a6e	Factor code that finalize PHI nodes, jump tables, etc. out of SelectBasicBlock. No functionality changes. llvm-svn: 54438	2008-08-07 00:43:25 +00:00
Owen Anderson	c6d527067b	SDISel's constant branch folding can fold away self-loops, which doesn't result in any dead blocks, but rather an incorrect phi input. Add code to UnreachableMachineBlockElim to get rid of these entries. llvm-svn: 54432	2008-08-06 23:16:52 +00:00
Owen Anderson	8a8d6f0a78	Correct handle cases where two phis are coalesced together, and correct break up the case where two different phis want to coalesce with the same vreg. llvm-svn: 54426	2008-08-06 22:08:58 +00:00
Owen Anderson	d184929176	Oops, didn't mean to commit this. llvm-svn: 54425	2008-08-06 20:58:38 +00:00
Owen Anderson	987b5057d3	We don't need to try to coalesce input vregs that are the same as the output vreg. llvm-svn: 54422	2008-08-06 20:29:20 +00:00
Owen Anderson	f9fca2f2dc	Only trim a live interval if the register is not used after the PHI node. llvm-svn: 54421	2008-08-06 18:36:17 +00:00
Owen Anderson	03dddbbed5	Only remap each VNInfo once when doing renumbering. llvm-svn: 54420	2008-08-06 18:35:45 +00:00
Owen Anderson	3d4c06dd54	Fix breakage on ARM/2008-04-10-ScavengerAssert.ll. llvm-svn: 54378	2008-08-05 22:24:40 +00:00
Evan Cheng	aa33b932bd	Fix PR2596: out of bound reference. llvm-svn: 54375	2008-08-05 21:51:46 +00:00
Owen Anderson	bdaed55ef3	Correctly handle replacement and removal of PHIs with one incoming register. llvm-svn: 54374	2008-08-05 21:40:45 +00:00
Owen Anderson	d9b88a85f2	Oops, we were already checking for dead phis. Handle this the proper way, then. llvm-svn: 54371	2008-08-05 21:18:51 +00:00
Owen Anderson	d4ffa4eb57	We don't need to update live intervals for dead PHIs. llvm-svn: 54369	2008-08-05 20:51:26 +00:00
Owen Anderson	7c42ac4133	Remove the -disable-correct-folding option, which was ugly and is no longer needed. llvm-svn: 54361	2008-08-05 18:27:54 +00:00
Dan Gohman	e955c481fd	Fix several const-correctness issues, resolving some -Wcast-qual warnings. llvm-svn: 54349	2008-08-05 14:45:15 +00:00
Evan Cheng	a4d6d884d6	Remove #if 0. llvm-svn: 54347	2008-08-05 07:20:57 +00:00
Evan Cheng	0ca10c9572	Fix PR2568: Fix bug that cause redudant kill marker after its live interval has been extended due to coalescing. llvm-svn: 54346	2008-08-05 07:10:38 +00:00
Owen Anderson	9f515394d3	Remove unneeded iteration. Thanks to Dan for the feedback. llvm-svn: 54337	2008-08-05 00:30:10 +00:00
Owen Anderson	bbeb8f0807	This option doesn't need to be a target option. It can be in SDISel instead. llvm-svn: 54336	2008-08-05 00:27:28 +00:00
Owen Anderson	a102290bdc	- Fix SelectionDAG to generate correct CFGs. - Add a basic machine-level dead block eliminator. These two have to go together, since many other parts of the code generator are unable to handle the unreachable blocks otherwise created. llvm-svn: 54333	2008-08-04 23:54:43 +00:00
Dan Gohman	90c724cadc	Fix SDISel lowering of PHI nodes to use ComputeValueVTs. This allows it to work correctly on aggregate values. This fixes PR2623. llvm-svn: 54331	2008-08-04 23:42:46 +00:00
Dan Gohman	6e023e63cd	Fix SDISel lowering of zeroinitializer and undef to use ComputeValueVTs. This allows it to work correctly on nested aggregate values. This fixes PR2625. llvm-svn: 54330	2008-08-04 23:30:41 +00:00
Dale Johannesen	c31eb205c1	Add a flag to disable jump table generation (all switches use the binary search algorithm) for environments that don't support it. PPC64 JIT is such an environment; turn the flag on for that. llvm-svn: 54248	2008-07-31 18:13:12 +00:00
Dan Gohman	345d63ccf2	Improve dagcombining for sext-loads and sext-in-reg nodes. llvm-svn: 54239	2008-07-31 00:50:31 +00:00
Dan Gohman	88e0df0c91	Move SelectionDAG::viewGraph() out of line; as an inline function it isn't always visible to gdb. llvm-svn: 54228	2008-07-30 18:48:53 +00:00
Dan Gohman	2fe4352691	Don't look for leaf values to store when lowering stores of empty structs. This fixes PR2612. llvm-svn: 54226	2008-07-30 18:36:51 +00:00
Owen Anderson	c818c01539	Use existing LiveInterval methods to simplify live interval merging. Thanks to Evan for pointing these out. llvm-svn: 54225	2008-07-30 18:27:35 +00:00
Owen Anderson	7b5f535590	Value numbers whose def index is a special sentinel value should not be remapped. llvm-svn: 54218	2008-07-30 17:42:47 +00:00
Owen Anderson	e9a0bae238	More fixes for corner cases when remapping live range indices. llvm-svn: 54186	2008-07-30 00:22:56 +00:00
Owen Anderson	1aebe49ae7	When merging live intervals, we also need to merge in any live ranges that are inputs to two-address instructions that themselves define a range we already care about. llvm-svn: 54185	2008-07-30 00:21:16 +00:00
Owen Anderson	6b1cc46fee	When merging a PHI operand's live interval into the PHI's live interval, we need to merge over all liveranges in the operand's interval that share the relevant value number, not just the range that immediately precedes the PHI. llvm-svn: 54174	2008-07-29 21:17:08 +00:00
Owen Anderson	2532e75933	Don't decrement the BB remap when we don't need to. llvm-svn: 54173	2008-07-29 21:15:44 +00:00
Duncan Sands	fa4120530e	Fix PR2609. If a label is deleted, then it needs to be marked invalid regardless of whether it is a debug, an exception handling or (hopefully) a GC label. llvm-svn: 54172	2008-07-29 20:56:02 +00:00
Nate Begeman	82f1925708	Fix broken CellSPU lowering, re-instate braces in Legalize llvm-svn: 54168	2008-07-29 19:07:27 +00:00
Nate Begeman	d63495ff25	Disable a fix in the previous patch, since it breaks CellSPU. The CellSPU codegen is broken, but needs to be fixed before we can put this back in. llvm-svn: 54164	2008-07-29 18:28:31 +00:00
Nate Begeman	fecbc8cff1	Add vector shifts to the IR, patch by Eli Friedman. CodeGen & Clang work coming next. llvm-svn: 54161	2008-07-29 15:49:41 +00:00
Dan Gohman	804c95df52	Fold the useful features of alist and alist_node into ilist, and a new ilist_node class, and remove them. Unlike alist_node, ilist_node doesn't attempt to manage storage itself, so it avoids the associated problems, including being opaque in gdb. Adjust the Recycler class so that it doesn't depend on alist_node. Also, change it to use explicit Size and Align parameters, allowing it to work when the largest-sized node doesn't have the greatest alignment requirement. Change MachineInstr's MachineMemOperand list from a pool-backed alist to a std::list for now. llvm-svn: 54146	2008-07-28 21:51:04 +00:00
Dan Gohman	24b3ce1db6	Fix a typo in a comment. llvm-svn: 54136	2008-07-28 18:43:51 +00:00
Dan Gohman	68e45a361b	Make the ScheduleDAG's GraphRoot edge be blue and dashed too, like the SelectionDAG's. llvm-svn: 54129	2008-07-27 22:46:49 +00:00
Dan Gohman	2ce6f2ad5e	Rename SDOperand to SDValue. llvm-svn: 54128	2008-07-27 21:46:04 +00:00
Dan Gohman	91e5dcb680	Tidy SDNode::use_iterator, and complete the transition to have it parallel its analogue, Value::value_use_iterator. The operator* method now returns the user, rather than the use. llvm-svn: 54127	2008-07-27 20:43:25 +00:00
Dan Gohman	bb5f43ed4d	Rename isOnlyUseOf to isOnlyUserOf. llvm-svn: 54124	2008-07-27 18:06:42 +00:00
Duncan Sands	d9374421ea	Some binary operations were being treated as unary operations! Add support for softening some additional unary operations like fp_to_sint. llvm-svn: 54122	2008-07-27 12:28:43 +00:00
Owen Anderson	54912b3e8d	Fix the issues originally addressed in r54070. After thinking about it some more, I realized that the right thing to do is to have StrongPHIElimination use its knowledge of the PHIs before they're erased to update the intervals appropriate. This is both simpler and more accurate than the alternative, which was having LIA figure it out when it renumbered things, plus it's just the right thing to do! llvm-svn: 54077	2008-07-25 23:38:08 +00:00
Owen Anderson	7a45b168ac	Revert my previous patch. In retrospect, this is completely the wrong way to fix this problem. llvm-svn: 54072	2008-07-25 23:06:59 +00:00
Owen Anderson	074f9db2fd	Special cases are needed in renumbering when dealing with renumbering after a PHI has been removed. The interval previously defined by the PHI needs to be extended to the beginning of its basic block, and the intervals that were inputs need to be trimmed to the end of their basic blocks. llvm-svn: 54070	2008-07-25 22:32:01 +00:00
Owen Anderson	0346aba5c2	In order to avoid reprocessing a register more than once, we need to add it to the handled set so it will get filtered out in future iterations. llvm-svn: 54065	2008-07-25 21:35:43 +00:00
Owen Anderson	d9c8711d70	Remove live interval entries for an interval if we're eliminating its only VN. llvm-svn: 54062	2008-07-25 21:08:41 +00:00
Owen Anderson	88499a3503	Properly remap live ranges whose end indices are the end of the function. llvm-svn: 54061	2008-07-25 21:07:13 +00:00
Owen Anderson	c7d53fd331	Make the remapping of interval indices (particularly ending indices) more robust. This is tricky business, and will probably take a few more iterations to get the last kinks out of it. llvm-svn: 54043	2008-07-25 19:50:48 +00:00
Dan Gohman	394ec3ab5a	Disable the new aggressive remat logic introduced in 54000; it causes some regressions, such as PR2595. Also, there is a significant code-quality issue in SPEC 464.h264ref and a few others. llvm-svn: 54014	2008-07-25 15:08:37 +00:00
Mon P Wang	7334350d31	When splitting a vector shuffle, fixed which type we used for the hi part llvm-svn: 54007	2008-07-25 01:30:26 +00:00
Dan Gohman	9268601d8a	Use AliasAnalysis::pointsToConstantMemory in SDISel to avoid unnecessary dependencies with constant load nodes. This allows them to be scheduled freely. llvm-svn: 54001	2008-07-25 00:04:14 +00:00
Dan Gohman	09b0448dbc	Enable rematerialization of constants using AliasAnalysis::pointsToConstantMemory, and knowledge of PseudoSourceValues. This unfortunately isn't sufficient to allow constants to be rematerialized in PIC mode -- the extra indirection is a complication. llvm-svn: 54000	2008-07-25 00:02:30 +00:00
Owen Anderson	79b66966b8	Store the predecessor MBB in the PHIUnion, rather than an index, since the indices will change after renumbering. llvm-svn: 53985	2008-07-24 17:12:16 +00:00
Owen Anderson	50d393a68d	Enable the insertion of empty indices into LiveInterals, thereby making renumbering possible. llvm-svn: 53961	2008-07-23 21:37:49 +00:00
Owen Anderson	7c800ad977	Fix a compile-time regression introduced by my heuristic-changing patch. I forgot to multiply the instruction count by a constant factor in a few places, which caused the register allocator to require many more iterations. llvm-svn: 53959	2008-07-23 19:47:27 +00:00
Dan Gohman	fa1211f69b	Enable first-class aggregates support. Remove the GetResultInst instruction. It is still accepted in LLVM assembly and bitcode, where it is now auto-upgraded to ExtractValueInst. Also, remove support for return instructions with multiple values. These are auto-upgraded to use InsertValueInst instructions. The IRBuilder still accepts multiple-value returns, and auto-upgrades them to InsertValueInst instructions. llvm-svn: 53941	2008-07-23 00:34:11 +00:00
Duncan Sands	775e509525	LegalizeTypes support for VSETCC. Fixes PR2575. llvm-svn: 53938	2008-07-22 23:54:03 +00:00
Owen Anderson	029182f3a3	Change the heuristics used in the coalescer, register allocator, and within live intervals itself to use an instruction count approximation that is not affected by inserting empty indices. llvm-svn: 53937	2008-07-22 22:46:49 +00:00
Evan Cheng	b8ff223f26	Fix pr2566: incorrect assumption about bit_convert. It doesn't not have to output a vector value. Patch by Nicolas Capens! llvm-svn: 53932	2008-07-22 20:42:56 +00:00
Dan Gohman	57c749294c	Make the GraphRoot edge look like a chain edge, which is more accurate, and use the right result number, in the off chance that the graph root has multiple result values. llvm-svn: 53923	2008-07-22 17:52:59 +00:00
Bill Wendling	9fe8b29012	Another buildbot test commit. llvm-svn: 53896	2008-07-22 00:53:37 +00:00
Bill Wendling	d07cee2e5c	Trivial check-in to test buildbot. No functionality change. llvm-svn: 53889	2008-07-22 00:28:47 +00:00
Dan Gohman	ebeccb44cf	Fix grammaros in comments. llvm-svn: 53884	2008-07-21 22:38:59 +00:00
Dan Gohman	f1dc362547	Enhance the GraphWriter support for edge destinations, and teach the SelectionDAG graph writer to make use of them. Now, nodes with multiple values are displayed as such, with incoming edges pointing to the specific value they use. llvm-svn: 53875	2008-07-21 21:06:55 +00:00
Dan Gohman	a6191cde79	After early-lowering the FORMAL_ARGUMENTS node, delete it. llvm-svn: 53874	2008-07-21 21:04:07 +00:00
Dan Gohman	581cc87f57	Add titles to the various SelectionDAG viewGraph calls that include useful information like the name of the block being viewed and the current phase of compilation. llvm-svn: 53872	2008-07-21 20:00:07 +00:00
Dan Gohman	8c08a692ee	Fix uses of underscore-capital names. llvm-svn: 53870	2008-07-21 19:48:15 +00:00
Dan Gohman	3e9ad4d8e6	Now that the MachineInstr leaks are fixed, enable leak checking in the MachineInstr clone code. llvm-svn: 53868	2008-07-21 18:47:29 +00:00
Duncan Sands	b0e3938651	Add VerifyNode, a place to put sanity checks on generic SDNode's (nodes with their own constructors should do sanity checking in the constructor). Add sanity checks for BUILD_VECTOR and fix all the places that were producing bogus BUILD_VECTORs, as found by "make check". My favorite is the BUILD_VECTOR with only two operands that was being used to build a vector with four elements! llvm-svn: 53850	2008-07-21 10:20:31 +00:00
Bill Wendling	1071bfb17d	Pull r53795 from Gaz into mainline: If .loc and .file aren't used, always emit the "debug_line" section. This requires at least one entry in the line matrix. So if there's nothing to emit into the matrix, emit an end of matrix value anyway. llvm-svn: 53803	2008-07-20 00:11:19 +00:00
Evan Cheng	a7a20c4946	Fix a memory leak in LiveIntervalAnalysis. llvm-svn: 53779	2008-07-19 00:37:25 +00:00
Duncan Sands	6b418e750d	Softfloat support for FDIV. Patch by Richard Pennington. llvm-svn: 53773	2008-07-18 21:18:48 +00:00
Duncan Sands	694228b47d	Eliminate unused variable. llvm-svn: 53772	2008-07-18 21:07:41 +00:00
Duncan Sands	32e387c461	Revert 53729, after waking up in the middle of the night realising that it was wrong :) I think the reason the same type was being used for the shufflevec of indices as for the actual indices is so that if one of them needs splitting then so does the other. After my patch it might be that the indices need splitting but not the rest, yet there is no good way of handling that. I think the right solution is to not have the shufflevec be an operand at all: just have it be the list of numbers it actually is, stored as extra info in the node. llvm-svn: 53768	2008-07-18 20:12:05 +00:00
Dan Gohman	597bd1633e	Fix a LocalSpiller leak. This fixes tramp3d-v4. llvm-svn: 53766	2008-07-18 18:28:56 +00:00
Dan Gohman	0ece943845	Re-introduce LeakDetector support for MachineInstrs and MachineBasicBlocks. Fix a leak that this turned up in LowerSubregs.cpp. And, comment a leak in LiveIntervalAnalysis.cpp. llvm-svn: 53746	2008-07-17 23:49:46 +00:00
Dan Gohman	7168de7872	When printing MemOperand nodes, only use print() for PseudoSourceValue values, which never have names. Use getName() for all other values, because we want to print just a short summary of the value, not the entire instruction. llvm-svn: 53738	2008-07-17 21:12:16 +00:00
Evan Cheng	cefd6e62fa	Subreg live interval valno may not have a corresponding def machineinstr since it's less precise. llvm-svn: 53734	2008-07-17 19:48:53 +00:00
Duncan Sands	656b256a1a	Use a legal type for elements of the vector_shuffle mask. These are just indices into the shuffled vector so their type is unrelated to the type of the shuffled elements (which is what was being used before). This fixes vec_shuffle-11.ll when using LegalizeTypes. What seems to have happened is that Dan's recent change r53687, which corrected the result type of the shuffle, somehow caused LegalizeTypes to notice that the mask operand was a BUILD_VECTOR with a legal type but elements of an illegal type (i64). LegalizeTypes legalized this by introducing a new BUILD_VECTOR of i32 and bitcasting it to the old type. But the mask operand is not supposed to be a bitcast but a straight BUILD_VECTOR of constants, causing a crash. llvm-svn: 53729	2008-07-17 19:28:41 +00:00
Dan Gohman	1705968102	Add a new function, ReplaceAllUsesOfValuesWith, which handles bulk replacement of multiple values. This is slightly more efficient than doing multiple ReplaceAllUsesOfValueWith calls, and theoretically could be optimized even further. However, an important property of this new function is that it handles the case where the source value set and destination value set overlap. This makes it feasible for isel to use SelectNodeTo in many very common cases, which is advantageous because SelectNodeTo avoids a temporary node and it doesn't require CSEMap updates for users of values that don't change position. Revamp MorphNodeTo, which is what does all the work of SelectNodeTo, to handle operand lists more efficiently, and to correctly handle a number of corner cases to which its new wider use exposes it. This commit also includes a change to the encoding of post-isel opcodes in SDNodes; now instead of being sandwiched between the target-independent pre-isel opcodes and the target-dependent pre-isel opcodes, post-isel opcodes are now represented as negative values. This makes it possible to test if an opcode is pre-isel or post-isel without having to know the size of the current target's post-isel instruction set. These changes speed up llc overall by 3% and reduce memory usage by 10% on the InstructionCombining.cpp testcase with -fast and -regalloc=local. llvm-svn: 53728	2008-07-17 19:10:17 +00:00
Duncan Sands	7e5edf1a1f	LegalizeTypes support for what seems to be the only missing ppc long double operations: FNEG and FP_EXTEND. llvm-svn: 53723	2008-07-17 17:35:14 +00:00
Duncan Sands	d9256a7ceb	Turn LegalizeTypes back off again for the moment: it is breaking Darwin bootstrap due to missing functionality. llvm-svn: 53721	2008-07-17 17:06:03 +00:00
Duncan Sands	77a3d05f1e	Factorize some code for determining which libcall to use. llvm-svn: 53713	2008-07-17 02:36:29 +00:00
Dan Gohman	2714059079	Fix the result type of a VECTOR_SHUFFLE+BIT_CONVERT dagcombine. This was turned up by some new SelectionDAG assertion checks that I'm working on. llvm-svn: 53687	2008-07-16 16:13:58 +00:00
Duncan Sands	2d28e281e9	Add support for promoting and expanding AssertZext and AssertSext. Needed when passing huge integer parameters with the zeroext or signext attributes. llvm-svn: 53684	2008-07-16 16:03:07 +00:00
Dan Gohman	26ffe2bea6	Fix a comment to say nonnegative instead of positive. llvm-svn: 53681	2008-07-16 15:57:10 +00:00
Dan Gohman	bf98f68265	Add an assert to check for empty flags for MachineMemOperand. llvm-svn: 53680	2008-07-16 15:56:42 +00:00
Duncan Sands	e766b4230e	Reorder methods alphabetically. No functionality change. While this is not a wonderful organizing principle, it does make it easy to find routines, and clear where to insert new ones. llvm-svn: 53672	2008-07-16 11:41:33 +00:00
Duncan Sands	c359055fa9	Turn on LegalizeTypes by default. llvm-svn: 53671	2008-07-16 11:36:51 +00:00
Dan Gohman	1e5aa12b7d	SelectionDAG::AssignNodeIds is unused. llvm-svn: 53636	2008-07-15 18:29:32 +00:00
Dan Gohman	1d093846b5	Don't sort SDNodes by their addresses in SelectionDAG::dump. Instead, just use the AllNodes order, which is at least relatively stable across runs. llvm-svn: 53632	2008-07-15 18:18:54 +00:00
Duncan Sands	0f1a1cdcf8	LegalizeTypes support for fabs on ppc long double. llvm-svn: 53613	2008-07-15 15:02:44 +00:00
Duncan Sands	6162e0377c	LegalizeTypes support for promotion of bswap. In LegalizeDAG the value is zero-extended to the new type before byte swapping. It doesn't matter how the extension is done since the new bits are shifted off anyway after the swap, so extend by any old rubbish bits. This results in the final assembler for the testcase being one line shorter. llvm-svn: 53604	2008-07-15 10:18:22 +00:00
Duncan Sands	202225cdf8	LegalizeTypes support for promotion of SIGN_EXTEND_INREG. llvm-svn: 53603	2008-07-15 10:14:24 +00:00
Duncan Sands	b9b5a671d3	Reorder the integer promotion methods alphabetically. No change in functionality. llvm-svn: 53602	2008-07-15 10:12:34 +00:00
Mon P Wang	97432f4f1b	Fixed potential bug if the source and target of a bit convert have different alignment llvm-svn: 53590	2008-07-15 05:28:34 +00:00
Dan Gohman	adec96f438	Reapply 53476 and 53480, with a fix so that it properly updates the BB member to the current basic block after emitting instructions. llvm-svn: 53567	2008-07-14 18:19:29 +00:00
Dan Gohman	e7c8387616	Improve debug output for MemOperandSDNode. PseudoSourceValue nodes don't have value names, so use print instead of getName() to get a useful string. llvm-svn: 53563	2008-07-14 17:51:24 +00:00
Dan Gohman	793357b115	Fix edito in the PseudoSourceValue name list. llvm-svn: 53562	2008-07-14 17:45:47 +00:00
Duncan Sands	673cf1836b	I don't think BUILD_PAIR can have a vector result. Remove support for this. llvm-svn: 53559	2008-07-14 17:34:19 +00:00
Duncan Sands	0ca9a38f68	Tighten up some checks. Fix FPOWI splitting for non-power-of-two vectors. llvm-svn: 53558	2008-07-14 17:33:37 +00:00
Duncan Sands	a30cbd9797	An INSERT_VECTOR_ELT can insert a larger value than the vector element type. Don't forget to handle this when the insertion index is not a constant. llvm-svn: 53556	2008-07-14 17:32:02 +00:00
Duncan Sands	693185bcee	According to the docs, it is possible to have an extending load of a vector. Handle this case when splitting vector loads. I'm not completely sure what is supposed to happen, but I think it means hi should be set to undef. LegalizeDAG does not consider this case. llvm-svn: 53555	2008-07-14 17:27:46 +00:00
Duncan Sands	b766084cb0	There should be no extending loads or truncating stores of one-element vectors. Also, neaten the handling of INSERT_VECTOR_ELT when the inserted type is larger than the vector element type. llvm-svn: 53554	2008-07-14 17:22:31 +00:00
Duncan Sands	d47d2d6b12	Ignore TargetConstant with an illegal type. These are used for passing huge immediates in inline ASM from the front-end straight down to the ASM writer. Of course this is a hack, but it is simple, limited in scope, works in practice, and is what LegalizeDAG does. llvm-svn: 53553	2008-07-14 17:15:45 +00:00
Evan Cheng	2b3c52d5c4	Typos. llvm-svn: 53504	2008-07-12 02:22:07 +00:00
Evan Cheng	e0a352e8e7	Fix PR2536: a nasty spiller bug. If a two-address instruction uses a register but the use portion of its live range is not part of its liveinterval, it must be defined by an implicit_def. In that case, do not spill the use. e.g. 8 %reg1024<def> = IMPLICIT_DEF 12 %reg1024<def> = INSERT_SUBREG %reg1024<kill>, %reg1025, 2 The live range [12, 14) are not part of the r1024 live interval since it's defined by an implicit def. It will not conflicts with live interval of r1025. Now suppose both registers are spilled, you can easily see a situation where both registers are reloaded before the INSERT_SUBREG and both target registers that would overlap. llvm-svn: 53503	2008-07-12 01:56:02 +00:00
Evan Cheng	ef8412c822	Back out 53476 and 53480 for now. Somehow they cause llc to miscompile 179.art. llvm-svn: 53502	2008-07-12 01:38:51 +00:00
Dan Gohman	02c7c6cb33	Include a frame index in the "fixed stack" pseudo source value instead of using the frame index for the SVOffset, which was inconsistent. llvm-svn: 53486	2008-07-11 22:44:52 +00:00
Dan Gohman	ed087a62dc	Fix an obsolete top-level comment. llvm-svn: 53481	2008-07-11 22:39:58 +00:00
Dan Gohman	f4cd404e6f	Factor out debugging code into the common base class. llvm-svn: 53480	2008-07-11 22:36:22 +00:00
Dan Gohman	36a69373dc	Add support for putting NamedRegionTimers in TimerGroups, and use a timer group for the timers in SelectionDAGISel. Also, Split scheduling out from emitting, to give each their own timer. llvm-svn: 53476	2008-07-11 21:54:34 +00:00
Dan Gohman	0597e5b697	Trim unnecessary #includes. llvm-svn: 53471	2008-07-11 20:38:31 +00:00
Duncan Sands	121641d601	Remove an apparently useless routine: there should be no need to split the result of a vector RET node, since they are always already legal. llvm-svn: 53462	2008-07-11 17:02:09 +00:00
Duncan Sands	3e7d0fa3ca	It is pointless to turn a UINT_TO_FP into an SINT_TO_FP libcall plus additional operations: it might as well be a direct UINT_TO_FP libcall. So only turn it into an SINT_TO_FP if the target has special handling for SINT_TO_FP. llvm-svn: 53461	2008-07-11 17:00:14 +00:00
Duncan Sands	37b7322b35	Add two missing SINT_TO_FP libcalls. llvm-svn: 53460	2008-07-11 16:57:02 +00:00
Duncan Sands	d9948110a6	Port a shift-by-1 optimization from LegalizeDAG: it was presumably added after the rest of the code was copied to LegalizeTypes. llvm-svn: 53459	2008-07-11 16:54:57 +00:00
Duncan Sands	927a3648d5	Add support for 128 bit shifts and 32 bit shifts on 16 bit machines. llvm-svn: 53458	2008-07-11 16:52:29 +00:00
Chris Lattner	87909d0629	Fix a bug in the soft-float handling of FCOPYSIGN that Duncan noticed when working on legalizetypes. Both legalizetypes and legalizeops now produce hte same code for CodeGen/ARM/fcopysign.ll. llvm-svn: 53435	2008-07-10 23:46:13 +00:00
Chris Lattner	17b234cf9b	make legalize types be a command line option: -enable-legalize-types. llvm-svn: 53434	2008-07-10 23:37:50 +00:00
Dan Gohman	7ce10037c4	Make stack slot coloring's debug output more consistent with other passes. llvm-svn: 53415	2008-07-10 19:49:32 +00:00
Evan Cheng	45fdeb6c3f	Change StackSlotForVirtReg (which maps vregs to frame indices) from std::map to IndexedMap. llvm-svn: 53414	2008-07-10 18:23:23 +00:00
Duncan Sands	abdcac66dc	Add support for 128 bit multiplicative operations. Lack of these caused a bootstrap failure with Fortran on x86-64 with LegalizeTypes turned on. While there, be nice to 16 bit machines and support expansion of i32 too. llvm-svn: 53408	2008-07-10 15:35:05 +00:00
Duncan Sands	5e6d1402c2	Add a mysteriously missing libcall, FPTOSINT_F80_I32. Be nice to 16 bit machines by supporting FP_TO_XINT expansion for these. llvm-svn: 53407	2008-07-10 15:33:02 +00:00
Duncan Sands	303524be58	Fix a FIXME: use an apint in CTTZ legalization. llvm-svn: 53406	2008-07-10 15:30:54 +00:00
Duncan Sands	e78352a125	Remove PromoteIntRes_FP_ROUND - not sure what it was doing there: FP_ROUND returns a float, not an integer. llvm-svn: 53405	2008-07-10 15:29:55 +00:00
Duncan Sands	4ac3984fc5	Make sure the alignment of the temporary created in CreateStackStoreLoad is good enough for both the source and destination types. llvm-svn: 53404	2008-07-10 15:26:17 +00:00
Duncan Sands	d4c09df689	Make the LegalizeType method naming scheme more regular. llvm-svn: 53403	2008-07-10 15:25:04 +00:00
Duncan Sands	74f23ff45c	Don't barf when dumping a constant that contains a ginormous value (eg: i128 -1). llvm-svn: 53402	2008-07-10 11:23:14 +00:00
Evan Cheng	e9ba28dd68	- Change the horrible N^2 isRegReDefinedByTwoAddr. Now callers must supply the operand index of def machineoperand and at most one full scan of non-implicit operands is needed. - Change local register allocator to use the new isRegReDefinedByTwoAddr instead of reinventing the wheel. llvm-svn: 53394	2008-07-10 07:35:43 +00:00
Owen Anderson	04a77c2492	Use DenseMap instead of std::map in local register allocation. This improves the time on instcombine from .31s to .22s llvm-svn: 53390	2008-07-10 01:56:35 +00:00
Owen Anderson	20f41dac8d	Fix 403.gcc. Finally got the check for two-address-ness correct. llvm-svn: 53389	2008-07-10 01:53:01 +00:00
Owen Anderson	be2e9a4447	Revert r53367, which was breaking things. llvm-svn: 53378	2008-07-09 23:09:10 +00:00
Dan Gohman	7d94c49db9	Simplify hasNUsesOfValue and hasAnyUsesOfValue even more. This makes their special-case checks of use_size() less beneficial, so remove them. This eliminates all but one use of use_size(), which is in AssignTopologicalOrder, which uses it only once for each node, and so can reasonably afford to recompute it, as this allows the UsesSize field of SDNode to be removed altogether. llvm-svn: 53377	2008-07-09 23:03:14 +00:00
Dan Gohman	7a510c2990	hasAnyUseOfValue can check SDUse nodes of its users directly instead of examining every operand of every user. llvm-svn: 53374	2008-07-09 22:39:01 +00:00
Dan Gohman	db4504fa57	Move MemoryVT out of LSBaseNode into MemSDNode, allowing the getMemOperand function to be moved into the base class as well and made non-virtual. llvm-svn: 53372	2008-07-09 22:08:04 +00:00
Evan Cheng	1787443028	Avoid creating expensive comment string if it's not going to be printed. llvm-svn: 53369	2008-07-09 21:53:02 +00:00
Owen Anderson	d3736ca1e0	Loosen our check here. Local regalloc only cares that the reg is used and def'd by the same instruction, but about the details of the relationship. llvm-svn: 53367	2008-07-09 21:34:36 +00:00
Dan Gohman	89e71d48b8	Move the IsVolatile and SVOffset fields into the MemSDNode base class, and store IsVolatile and Alignment in a more compact form. This makes AtomicSDNode slightly larger, but it shrinks LoadSDNode and StoreSDNode, which are much more common and are the largest of the SDNode subclasses. Also, this lets the isVolatile() and getAlignment() accessors be non-virtual. llvm-svn: 53361	2008-07-09 21:23:02 +00:00
Owen Anderson	b42ed21894	Don't use an expensive check for two-address-ness when we have the information sitting around to determine it much more quickly, This speeds up the local register allocator from 0.37s to 0.31s on instcombine. llvm-svn: 53359	2008-07-09 21:15:10 +00:00
Owen Anderson	a0bc522466	Factor local liveness computation out into its own function. llvm-svn: 53352	2008-07-09 20:14:53 +00:00
Dan Gohman	70aa89d215	Reuse the MO variable instead of recomputing it in RegAllocLocal. Keep RegAllocSimple in sync. llvm-svn: 53351	2008-07-09 20:12:26 +00:00
Dan Gohman	d0a33a9270	Give RegAllocSimple a TargetInstrInfo member to keep it consistent with RegAllocLocal. llvm-svn: 53347	2008-07-09 19:56:01 +00:00
Dan Gohman	8ab08642ee	RegAllocLocal has a TargetInstrInfo data member. Use it instead of having local variables duplicate it. llvm-svn: 53346	2008-07-09 19:55:19 +00:00
Dan Gohman	8a95073098	Use find with std::map, when that's what's needed, instead of lower_bound with extra checks. llvm-svn: 53344	2008-07-09 19:51:00 +00:00
Anton Korobeynikov	fe047d241c	Switch to new section name handling facility llvm-svn: 53316	2008-07-09 13:27:16 +00:00
Duncan Sands	37ab611e8e	Remove some unneeded includes. llvm-svn: 53289	2008-07-09 12:08:25 +00:00
Duncan Sands	5e266c914a	Redo LegalizeTypes soft float support for SINT_TO_FP and UINT_TO_FP. This now produces the same code as LegalizeDAG (the previous code was based on a mistaken idea of what LegalizeDAG did in this case). llvm-svn: 53288	2008-07-09 12:07:22 +00:00
Duncan Sands	b9e63db718	Forgot to update the chain result when softening loads. llvm-svn: 53287	2008-07-09 11:15:31 +00:00
Duncan Sands	ed811f0ec1	LegalizeTypes soft float support for FP_TO_SINT and FP_TO_UINT. llvm-svn: 53286	2008-07-09 11:13:46 +00:00
Duncan Sands	8090f8576f	LegalizeTypes support for powi soft float. llvm-svn: 53285	2008-07-09 11:11:47 +00:00
Duncan Sands	c52d3bf646	Make the role of MVT::i32 clearer here, and add a note since it is not clear whether it is correct. llvm-svn: 53284	2008-07-09 08:07:41 +00:00
Evan Cheng	7898e98026	Missed alignment argument on stores lowered from memcpy. llvm-svn: 53281	2008-07-09 06:38:06 +00:00
Bill Wendling	88d2506ae2	Make the DICountVisitor not a visitor. This keeps us from calling virtual functions and junk. llvm-svn: 53279	2008-07-09 06:02:33 +00:00
Dan Gohman	919936815e	const-ify SelectionDAG::getNodeValueTypes. llvm-svn: 53264	2008-07-09 00:00:42 +00:00
Dan Gohman	f188fa4499	It's no longer necessary to test if a MachineBasicBlock's parent is non-null. It now always is. llvm-svn: 53263	2008-07-08 23:59:09 +00:00
Dan Gohman	8293650d90	Verify that MachineMemOperand alignment is a non-zero power of 2. llvm-svn: 53262	2008-07-08 23:47:04 +00:00
Dan Gohman	e8d8d2ea42	Factor out the code for computing an alignment value, and make it available to getAtomic in addition to just getLoad and getStore, to prevent MachineMemOperands with 0 alignment. llvm-svn: 53261	2008-07-08 23:46:32 +00:00
Owen Anderson	27b8a21dfd	Fix the build. Apparently MachineInstr& is no longer implicitly convertable to MachineBasicBlock::iterator. llvm-svn: 53260	2008-07-08 23:36:37 +00:00
Owen Anderson	45d4475fe5	Make the local register allocator compute (purely local) liveness information for itself rather than depending on LiveVariables. This decreases compile time from: 0.5909s (LV + Regalloc) to 0.421s (just regalloc). llvm-svn: 53256	2008-07-08 22:24:50 +00:00
Dale Johannesen	45a4ec1a27	Remove some dead code. llvm-svn: 53253	2008-07-08 21:53:43 +00:00
Evan Cheng	34ef1db87c	Do not CSE DEBUG_LOC, DBG_LABEL, DBG_STOPPOINT, DECLARE, and EH_LABEL SDNode's. This improves compile time slightly at -O0 -g. llvm-svn: 53246	2008-07-08 20:06:39 +00:00
Duncan Sands	0797e5bf05	Remove custom expansion from LegalizeTypes when doing soft float: experiments show that targets aren't expecting this for results or for operands. Add support select/select_cc result soft float and correct operand soft float for these. llvm-svn: 53245	2008-07-08 20:03:24 +00:00
Duncan Sands	360d689db3	Add missing select_cc libcall line, somehow omitted in LegalizeTypes. llvm-svn: 53244	2008-07-08 20:00:05 +00:00
Evan Cheng	0a1e672dff	Unbreak C++ tests on x86 Darwin. llvm-svn: 53237	2008-07-08 16:40:43 +00:00
Duncan Sands	12525efdfc	LegalizeTypes support for FP_ROUND and FP_EXTEND soft float. llvm-svn: 53231	2008-07-08 10:50:55 +00:00
Evan Cheng	534952224c	Avoid unnecessary string construction during asm printing. llvm-svn: 53215	2008-07-08 00:55:58 +00:00
Dan Gohman	3b46030375	Pool-allocation for MachineInstrs, MachineBasicBlocks, and MachineMemOperands. The pools are owned by MachineFunctions. This drastically reduces the number of calls to malloc/free made during the "Emit" phase of scheduling, as well as later phases in CodeGen. Combined with other changes, this speeds up the "instruction selection" phase of CodeGen by 10% in some cases. llvm-svn: 53212	2008-07-07 23:14:23 +00:00
Dan Gohman	7f8b6d5f80	Pool-allocation for SDNodes. The pool is allocated once for each function, and reused across SelectionDAGs. This drastically reduces the number of calls to malloc/free made during instruction selection, and improves memory locality. llvm-svn: 53211	2008-07-07 23:02:41 +00:00
Bill Wendling	b46e5165bf	Use the canonical way to get an empty structure. llvm-svn: 53206	2008-07-07 21:41:57 +00:00
Bill Wendling	1214860a78	Use StringMap for greater justice! llvm-svn: 53202	2008-07-07 20:59:31 +00:00
Dan Gohman	9169763955	Fix SDNode::MorphNodeTo (a function used by by SelectNodeTo) to properly track dead nodes that are on the original SDNode's operand list but not the new one, and have no other uses. llvm-svn: 53201	2008-07-07 20:57:48 +00:00
Dan Gohman	aedb4a61b8	Move MachineMemOperand's constructor out of line, to avoid a #include dependency on Support/MathExtras.h in the header file. llvm-svn: 53200	2008-07-07 20:32:02 +00:00
Dan Gohman	14464bc61c	Use of operator* is redundant and confusing here. llvm-svn: 53197	2008-07-07 20:08:05 +00:00
Dan Gohman	c7fc432b19	Minor const-correctness fixes. llvm-svn: 53196	2008-07-07 20:06:06 +00:00
Dan Gohman	14ce7d1eba	Assert that all MachineInstrs update PhysRegUseDefLists in their cleanup code. llvm-svn: 53194	2008-07-07 19:55:35 +00:00
Dan Gohman	768f2c9246	Remove most of the uses of SDOperandPtr, usually replacing it with a simple const SDOperand, which is what's usually needed. For AddNodeIDOperands, which is small, just duplicate the function to accept an SDUse. For SelectionDAG::getNode - Add an overload that accepts SDUse* that copies the operands into a temporary SDOperand array, but also has special-case checks for 0 through 3 operands to avoid the copy in the common cases. llvm-svn: 53183	2008-07-07 18:26:29 +00:00
Dan Gohman	56e3f63ec5	Add explicit keywords. llvm-svn: 53179	2008-07-07 18:00:37 +00:00
Dan Gohman	38740a98b2	Make DenseMap's insert return a pair, to more closely resemble std::map. llvm-svn: 53177	2008-07-07 17:46:23 +00:00
Evan Cheng	d8b83e1292	LegalizeSetCCOperands should legalize the result of ExpandLibCall. Patch by Richard Osborne. llvm-svn: 53169	2008-07-07 07:18:09 +00:00
Bill Wendling	ecf34435f4	Prevent option name conflict. llvm-svn: 53166	2008-07-07 05:42:27 +00:00
Duncan Sands	2fa6cf5c2f	LegalizeTypes soft-float support for stores of a float value. llvm-svn: 53165	2008-07-07 00:08:12 +00:00
Mon P Wang	5c755ff51b	Fixed generating incorrect aligned stores that I backout of r53031 that fixed problems in EmitStackConvert where the source and target type have different alignment by creating a stack slot with the max alignment of source and target type. llvm-svn: 53150	2008-07-05 20:40:31 +00:00
Duncan Sands	93e180342a	Rather than having a different custom legalization hook for each way in which a result type can be legalized (promotion, expansion, softening etc), just use one: ReplaceNodeResults, which returns a node with exactly the same result types as the node passed to it, but presumably with a bunch of custom code behind the scenes. No change if the new LegalizeTypes infrastructure is not turned on. llvm-svn: 53137	2008-07-04 11:47:58 +00:00
Duncan Sands	04fb6bf468	Linux also does not require exception handling moves in order to get correct debug info. Since I can't imagine how any target could possibly be any different, I've just stripped out the option: now all the world's like Darwin! llvm-svn: 53134	2008-07-04 09:55:48 +00:00
Bill Wendling	4bb9089db7	Don't return std::vector by value, but pass it in by reference to be filled. llvm-svn: 53123	2008-07-03 23:13:02 +00:00
Bill Wendling	2e50689435	Revert my previous check-in that split up MachineModuleInfo. It turns out to slow the compiler down at -O0 some 30% or more. Ooops. llvm-svn: 53120	2008-07-03 22:53:42 +00:00
Evan Cheng	fad8be450d	Backed out 53031. llvm-svn: 53110	2008-07-03 18:20:14 +00:00
Evan Cheng	7d98a48f15	- Remove calls to copyKillDeadInfo which is an N^2 function. Instead, propagate kill / dead markers as new instructions are constructed in foldMemoryOperand, convertToThressAddress, etc. - Also remove LiveVariables::instructionChanged, etc. Replace all calls with cheaper calls which update VarInfo kill list. llvm-svn: 53097	2008-07-03 09:09:37 +00:00
Dan Gohman	b261292917	Reapply r52988, "Simplify addRegisterKilled and addRegisterDead." The 254.gap failure was not due to this mod. llvm-svn: 53068	2008-07-03 01:18:51 +00:00
Dan Gohman	f3c4d7f877	Avoid unnecessarily copying APInt objects. llvm-svn: 53065	2008-07-03 00:52:03 +00:00
Evan Cheng	9f8b66f3f1	Use std::replace instead of std::find and push_back. llvm-svn: 53063	2008-07-03 00:28:27 +00:00
Evan Cheng	7a265d83bf	- Add LiveVariables::replaceKillInstruction. This does a subset of instructionChanged. That is, it only update the VarInfo.kills if the new instruction is known to have the correct dead and kill markers. - CommuteInstruction copies kill / dead markers over to new instruction. So use replaceKillInstruction instead. llvm-svn: 53061	2008-07-03 00:07:19 +00:00
Owen Anderson	30cc028e4a	Make LiveVariables even more optional, by making it optional in the call to TargetInstrInfo::convertToThreeAddressInstruction Also, if LV isn't around, then TwoAddr doesn't need to be updating flags, since they won't have been set in the first place. llvm-svn: 53058	2008-07-02 23:41:07 +00:00
Dan Gohman	22e9707480	Replace a few uses of SelectionDAG::getTargetNode with SelectionDAG::SelectNodeTo in the instruction selector. This updates existing nodes in place instead of creating new ones. Go back to selecting ISD::DBG_LABEL nodes into TargetInstrInfo::DBG_LABEL nodes instead of leaving them unselected, now that SelectNodeTo allows us to update them in place. llvm-svn: 53057	2008-07-02 23:23:19 +00:00
Dan Gohman	1b46bfecfe	Revert r52988. It broke 254.gap on x86-64. llvm-svn: 53050	2008-07-02 22:12:55 +00:00
Owen Anderson	8c10c2482a	TwoAddressInstructionPass doesn't really require LiveVariables, it just needs to update it if it's already around. llvm-svn: 53049	2008-07-02 21:28:58 +00:00
Duncan Sands	739a0548c4	Add a new getMergeValues method that does not need to be passed the list of value types, and use this where appropriate. Inappropriate places are where the value type list is already known and may be long, in which case the existing method is more efficient. llvm-svn: 53035	2008-07-02 17:40:58 +00:00
Mon P Wang	4b7c1acf26	Fixed problem in EmitStackConvert where the source and target type have different alignment by creating a stack slot with the max alignment of source and target type. llvm-svn: 53031	2008-07-02 17:07:12 +00:00
Chris Lattner	6b2c4f6143	instead of aborting on shifts of i1, just implicitly fold them. The dag combiner can produce a shift of i1 when folding icmp i1's. llvm-svn: 53030	2008-07-02 17:01:57 +00:00
Duncan Sands	d353c265ff	Fix typo compounded by a cut-and-pasto. llvm-svn: 53012	2008-07-02 10:03:53 +00:00
Duncan Sands	ed283c49d5	Let AnalyzeNewNode take care of calling ExpungeNode. This makes sure that all new nodes are expunged, not just those the top node of a new subtree. llvm-svn: 53011	2008-07-02 09:56:41 +00:00
Evan Cheng	7e4abde27c	- Use a faster priority comparison function if -fast. - Code clean up. llvm-svn: 53010	2008-07-02 09:23:51 +00:00
Chris Lattner	bedd1b2427	Add a new (simple) StringMap::clear method, patch by Pratik Solanki! llvm-svn: 53008	2008-07-02 05:26:32 +00:00
Bill Wendling	536fb95321	Use the canonical form for getting an empty structure. llvm-svn: 53003	2008-07-02 00:50:02 +00:00
Bill Wendling	82a9321f56	Sorry. I couldn't sleep at night knowing I put these ugly casts into the source tree. llvm-svn: 53001	2008-07-02 00:35:47 +00:00
Bill Wendling	b7bd02be57	Darwin doesn't need exception handling information for the "move" info when debug information is being output, because it's leet! llvm-svn: 52994	2008-07-01 23:34:48 +00:00
Evan Cheng	c963f6c14b	Avoid creating expensive comment string if it's not going to be printed. llvm-svn: 52992	2008-07-01 23:18:29 +00:00
Owen Anderson	501f207bdf	No need to use std::distance. We can just count the number of operands much more cheaply. llvm-svn: 52990	2008-07-01 22:34:11 +00:00
Evan Cheng	f3202a6375	Simplify addRegisterKilled and addRegisterDead. llvm-svn: 52988	2008-07-01 22:21:21 +00:00
Bill Wendling	c8cdb883df	- Update comments. - Don't use GlobalVariable::LinkageTypes when unsigned works. llvm-svn: 52987	2008-07-01 22:08:01 +00:00
Dale Johannesen	ad6b3a6ed2	Fix longstanding thinko: don't exclude predessors of exit blocks from tail merging consideration. llvm-svn: 52985	2008-07-01 21:50:14 +00:00
Evan Cheng	4c609abd90	Eliminate a compile time warning. llvm-svn: 52982	2008-07-01 21:35:46 +00:00
Owen Anderson	1d952533c2	Add a version of AsmPrinter::EOL that takes a const char* so that we don't have to do as many implicit std::string constructions. Unfortunately, this doesn't appear to translate to a real speedup in practice. llvm-svn: 52981	2008-07-01 21:16:27 +00:00
Evan Cheng	33696cd9cf	Do run ComputeLiveOutVRegInfo with -fast. llvm-svn: 52975	2008-07-01 18:15:04 +00:00
Evan Cheng	2c9773155a	Do not use computationally expensive scheduling heuristics with -fast. llvm-svn: 52971	2008-07-01 18:05:03 +00:00
Evan Cheng	fb2573554c	Apply Chris' suggestion. llvm-svn: 52970	2008-07-01 17:59:20 +00:00
Dan Gohman	b58aff4858	Minimize duplicated code in AsmPrinter::printLabel. llvm-svn: 52944	2008-07-01 00:16:26 +00:00
Dan Gohman	fb19f9402b	Split ISD::LABEL into ISD::DBG_LABEL and ISD::EH_LABEL, eliminating the need for a flavor operand, and add a new SDNode subclass, LabelSDNode, for use with them to eliminate the need for a label id operand. Change instruction selection to let these label nodes through unmodified instead of creating copies of them. Teach the MachineInstr emitter how to emit a MachineInstr directly from an ISD label node. This avoids the need for allocating SDNodes for the label id and flavor value, as well as SDNodes for each of the post-isel label, label id, and label flavor. llvm-svn: 52943	2008-07-01 00:05:16 +00:00
Evan Cheng	819b770868	Suppress compiler warning. llvm-svn: 52934	2008-06-30 22:33:56 +00:00
Evan Cheng	6a323e16f2	Don't run stack slot coloring if -fast. llvm-svn: 52933	2008-06-30 22:33:16 +00:00
Dan Gohman	e09a1c88cf	Use a simpler but equivalent form of RecordSource. llvm-svn: 52931	2008-06-30 22:21:03 +00:00
Evan Cheng	0d3628946f	Add timing report for various sub-passes under SelectionDAGISel. llvm-svn: 52930	2008-06-30 22:10:09 +00:00
Dan Gohman	6896901e2c	std::ostream and std::string microoptimizations for asm printing. llvm-svn: 52929	2008-06-30 22:03:41 +00:00
Dan Gohman	a76e60a77a	Use reserve. SelectionDAG::allnodes_size is linear, but that doesn't appear to outweigh the benefit of reducing heap traffic. If it does become a problem, we should teach SelectionDAG to keep a count of how many nodes are live, because there are several other places where that information would be useful as well. llvm-svn: 52926	2008-06-30 21:04:06 +00:00
Dan Gohman	5c73a886b4	Rename ISD::LOCATION to ISD::DBG_STOPPOINT to better reflect its purpose, and give it a custom SDNode subclass so that it doesn't need to have line number, column number, filename string, and directory string, all existing as individual SDNodes to be the operands. This was the only user of ISD::STRING, StringSDNode, etc., so remove those and some associated code. This makes stop-points considerably easier to read in -view-legalize-dags output, and reduces overhead (creating new nodes and copying std::strings into them) on code containing debugging information. llvm-svn: 52924	2008-06-30 20:59:49 +00:00
Evan Cheng	0711d68fa7	Split scheduling from instruction selection. llvm-svn: 52923	2008-06-30 20:45:06 +00:00
Dale Johannesen	659aeb6186	No need to align the stack if there are no stack objects. Fixes a couple of tests on Linux. llvm-svn: 52921	2008-06-30 20:40:16 +00:00
Evan Cheng	d206e2ac2a	Remove unneeded include. llvm-svn: 52920	2008-06-30 20:38:22 +00:00
Dan Gohman	31c8123d07	Replace some std::vectors that showed up in heap profiling with SmallVectors. Change the signature of TargetLowering::LowerArguments to avoid returning a vector by value, and update the two targets which still use this directly, Sparc and IA64, accordingly. llvm-svn: 52917	2008-06-30 20:31:15 +00:00
Dan Gohman	328e26d0ac	Correct the allocation size for CCState's UsedRegs member, which only needs one bit for each register. UsedRegs is a SmallVector sized at 16, so this eliminates a heap allocation/free for every call and return processed by Legalize on most targets. llvm-svn: 52915	2008-06-30 20:25:31 +00:00
Duncan Sands	9e08148f29	ExpungeNode is only needed for new nodes! This fixes CodeGen/PowerPC/2008-06-19-LegalizerCrash.ll when using the new LegalizeTypes infrastructure. llvm-svn: 52903	2008-06-30 16:43:45 +00:00
Duncan Sands	36410f6cde	Support for VAARG. As noted in a comment, this is wrong for types like x86 long double and i1, but no worse than what is done in LegalizeDAG. llvm-svn: 52898	2008-06-30 13:55:15 +00:00
Duncan Sands	dd5354df89	Support for promoting select_cc operands. llvm-svn: 52895	2008-06-30 11:50:11 +00:00
Duncan Sands	1ae6ef83ee	Revert the SelectionDAG optimization that makes it impossible to create a MERGE_VALUES node with only one result: sometimes it is useful to be able to create a node with only one result out of one of the results of a node with more than one result, for example because the new node will eventually be used to replace a one-result node using ReplaceAllUsesWith, cf X86TargetLowering::ExpandFP_TO_SINT. On the other hand, most users of MERGE_VALUES don't need this and for them the optimization was valuable. So add a new utility method getMergeValues for creating MERGE_VALUES nodes which by default performs the optimization. Change almost everywhere to use getMergeValues (and tidy some stuff up at the same time). llvm-svn: 52893	2008-06-30 10:19:09 +00:00
Evan Cheng	da3db11db3	- Re-apply 52748 and friends with fix. GetConstantStringInfo() returns an empty string for ConstantAggregateZero case which surprises selectiondag. - Correctly handle memcpy from constant string which is zero-initialized. llvm-svn: 52891	2008-06-30 07:31:25 +00:00
Chris Lattner	9d3740ed1c	Implement split and scalarize for SELECT_CC, fixing PR2504 llvm-svn: 52887	2008-06-30 02:43:01 +00:00
Anton Korobeynikov	a7c583d584	Revert (52748 and friends): Move GetConstantStringInfo to lib/Analysis. Remove string output routine from Constant. Update all callers. Change debug intrinsic api slightly to accomodate move of routine, these now return values instead of strings. This unbreaks llvm-gcc bootstrap. llvm-svn: 52884	2008-06-29 17:57:03 +00:00
Chris Lattner	3cffa471d9	Really fix the bootstrap failure. llvm-svn: 52854	2008-06-28 06:24:50 +00:00
Chris Lattner	1701328675	Add back the capability to include nul characters in strings with GetConstantStringInfo. This will hopefully restore llvm-gcc to happy bootstrap land. llvm-svn: 52851	2008-06-28 05:33:32 +00:00
Dan Gohman	6f7b5a6392	When folding a bitcast into a load or store, preserve the alignment information of the original load or store, which is checked to be at least as good, and possibly better. llvm-svn: 52849	2008-06-28 00:45:22 +00:00
Evan Cheng	9a357637ef	Looks like this condition is inverted. llvm-svn: 52841	2008-06-27 22:11:49 +00:00
Bill Wendling	196c78f0be	Reduce number of times .size() is called on a vector. Rename some variables to match normal naming scheme. llvm-svn: 52820	2008-06-27 07:13:44 +00:00
Owen Anderson	413f7d90db	Use a SmallSet when we can to reduce memory allocations. This speeds up a particular testcase from 0.0302s to 0.0222s in LiveVariables. llvm-svn: 52819	2008-06-27 07:05:59 +00:00
Chris Lattner	735705bc3e	simplify this check, GetConstantStringInfo validates that a global is constant already. No functionality change. llvm-svn: 52812	2008-06-27 03:18:41 +00:00
Bill Wendling	3d92cbffc6	Cruft left from patch revert...sorry. :-( llvm-svn: 52808	2008-06-27 01:32:08 +00:00
Bill Wendling	fcbb9525d0	Reverting broken patch r52803. llvm-svn: 52806	2008-06-27 01:27:56 +00:00
Owen Anderson	21498f52b2	Don't perform expensive queries checking for super and sub registers when we know that there aren't any. This speed up LiveVariables on instcombine at -O0 -g from 0.3855s to 0.3503s. Look for more improvements in this area soon! llvm-svn: 52804	2008-06-27 01:22:50 +00:00
Bill Wendling	f96d046d7c	- Remove a use of std::vector. - Make sure that we're not recalculating the size of a vector that never changes. llvm-svn: 52803	2008-06-27 00:56:36 +00:00
Bill Wendling	c758698d2c	Refactor the DebugInfoDesc stuff out of the MachineModuleInfo file. Clean up some uses of std::vector, where it's return std::vector by value. Yuck! llvm-svn: 52800	2008-06-27 00:09:40 +00:00
Chris Lattner	df1cbdd645	duncan points out that isOperationLegal includes a check for type legality. Thanks Duncan! llvm-svn: 52786	2008-06-26 17:16:00 +00:00
Owen Anderson	7df0d58535	Don't create a whole new string just to copy the elements into it. llvm-svn: 52785	2008-06-26 17:06:02 +00:00
Dale Johannesen	a2de8eab61	Fixes the last x86-64 test failure in compat.exp: <16 x float> is 64-byte aligned (for some reason), which gets us into the stack realignment code. The computation changing FP-relative offsets to SP-relative was broken, assiging a spill temp to a location also used for parameter passing. This fixes it by rounding up the stack frame to a multiple of the largest alignment (I concluded it wasn't fixable without doing this, but I'm not very sure.) llvm-svn: 52750	2008-06-26 01:51:13 +00:00
Eric Christopher	d0ab9c47e6	Move GetConstantStringInfo to lib/Analysis. Remove string output routine from Constant. Update all callers. Change debug intrinsic api slightly to accomodate move of routine, these now return values instead of strings. llvm-svn: 52748	2008-06-26 00:31:12 +00:00
Chris Lattner	b1e66ce3bb	when we know the signbit of an input to uint_to_fp is zero, change it to sint_to_fp on targets where that is cheaper (and visaversa of course). This allows us to compile uint_to_fp to: _test: movl 4(%esp), %eax shrl $23, %eax cvtsi2ss %eax, %xmm0 movl 8(%esp), %eax movss %xmm0, (%eax) ret instead of: .align 3 LCPI1_0: ## double .long 0 ## double least significant word 4.5036e+15 .long 1127219200 ## double most significant word 4.5036e+15 .text .align 4,0x90 .globl _test _test: subl $12, %esp movl 16(%esp), %eax shrl $23, %eax movl %eax, (%esp) movl $1127219200, 4(%esp) movsd (%esp), %xmm0 subsd LCPI1_0, %xmm0 cvtsd2ss %xmm0, %xmm0 movl 20(%esp), %eax movss %xmm0, (%eax) addl $12, %esp ret llvm-svn: 52747	2008-06-26 00:16:49 +00:00
Owen Anderson	b55675e1db	Remember which MachineOperand we were processing, so we don't have to scan the list to find it again later. This speeds up live intervals from 0.37s to 0.30s on instcombine. llvm-svn: 52745	2008-06-25 23:39:39 +00:00
Dan Gohman	39b07db75a	Fix the text in an assert string. llvm-svn: 52744	2008-06-25 22:14:43 +00:00
Evan Cheng	3fc2372d3a	- Fix a x86 vector isel bug: illegal transformation of a vector_shuffle into a shift. - Add a readme entry for a missing vector_shuffle optimization that results in awful codegen. llvm-svn: 52740	2008-06-25 20:52:59 +00:00
Duncan Sands	33ff5c8d0d	Add support for expanding PPC 128 bit floats. For this it is convenient to permit floats to be used with EXTRACT_ELEMENT, so I tweaked things to allow that. I also added libcalls for ppcf128 to i32 forms of FP_TO_XINT, since they exist in libgcc and this case can certainly occur (and does occur in the testsuite) - before the i64 libcall was being used. Also, the XINT_TO_FP result seemed to be wrong when the argument is an i128: the wrong fudge factor was added (the i32 and i64 cases were handled directly, but the i128 code fell through to some generic softening code which seemed to think it was i64 to f32!). So I fixed it by adding a fudge factor that I found in my breakfast cereal. llvm-svn: 52739	2008-06-25 20:24:48 +00:00
Duncan Sands	6920b254ad	Add/complete support for integer and float select_cc and friends. This code could be factorized a bit but I'm not sure that it's worth it. llvm-svn: 52724	2008-06-25 16:34:21 +00:00
Dan Gohman	aa01afd47c	Remove the OrigVT member from AtomicSDNode, as it is redundant with the base SDNode's VTList. llvm-svn: 52722	2008-06-25 16:07:49 +00:00
Mon P Wang	6a490371c9	Added MemOperands to Atomic operations since Atomics touches memory. Added abstract class MemSDNode for any Node that have an associated MemOperand Changed atomic.lcs => atomic.cmp.swap, atomic.las => atomic.load.add, and atomic.lss => atomic.load.sub llvm-svn: 52706	2008-06-25 08:15:39 +00:00
Evan Cheng	73db52ebf8	Enable two-address remat by default. llvm-svn: 52701	2008-06-25 01:16:38 +00:00
Owen Anderson	230b3eb80c	Use SmallVector instead of std::vector for a minor compile time improvement. llvm-svn: 52689	2008-06-24 21:44:59 +00:00
Dan Gohman	9cc3f68ab1	A brief survey of priority_queue usage in the tree turned this up as a questionable case, but the code isn't actually needed. llvm-svn: 52657	2008-06-23 23:51:16 +00:00
Bill Wendling	c44659b92a	This situation can occur: ,------. \| \| \| v \| t2 = phi ... t1 ... \| \| \| v \| t1 = ... \| ... = ... t1 ... \| \| `------' where there is a use in a PHI node that's a predecessor to the defining block. We don't want to mark all predecessors as having the value "alive" in this case. Also, the assert was too restrictive and didn't handle this case. llvm-svn: 52655	2008-06-23 23:41:14 +00:00
Dan Gohman	0d8a61eb60	Use the new PriorityQueue in ScheduleDAGList too, which also needs arbitrary-element removal. llvm-svn: 52654	2008-06-23 23:40:09 +00:00
Owen Anderson	79d2fa52fa	Use getMBBEndIdx rather than assuming that the end is right after the last instruction in the block. llvm-svn: 52649	2008-06-23 22:12:23 +00:00
Evan Cheng	71013f8c78	Remove option used to debug stack coloring bugs. It's no longer needed since stack coloring is now bug free. llvm-svn: 52644	2008-06-23 21:24:32 +00:00
Dan Gohman	fa63cc4e91	Move a DenseMap's declaration outside of a loop, and just call clear() on each iteration. This avoids allocating and deallocating all of DenseMap's memory on each iteration. llvm-svn: 52642	2008-06-23 21:15:00 +00:00
Evan Cheng	c72dcd103c	Instead of adding an isSS field to LiveInterval to denote stack slot. Use top bit of 'reg' instead. If the top bit is set, than the LiveInterval represents a stack slot live interval. llvm-svn: 52639	2008-06-23 21:03:19 +00:00
Dan Gohman	b4e2637e9b	Duncan pointed out this code could be tidied. llvm-svn: 52624	2008-06-23 15:29:14 +00:00
Duncan Sands	d803ce29a8	Port some integer multiplication fixes from LegalizeDAG. Bail out with an error if there is no libcall available for the given size of integer. llvm-svn: 52622	2008-06-23 15:15:44 +00:00
Duncan Sands	73ceffa3df	Support for expanding the result of EXTRACT_ELEMENT. llvm-svn: 52621	2008-06-23 15:08:15 +00:00
Duncan Sands	bc12dce8bb	Cleanup up LegalizeTypes handling of loads and stores. llvm-svn: 52620	2008-06-23 14:19:45 +00:00
Duncan Sands	5fb92e58de	Make custom lowering of ADD work correctly. This fixes PR2476; patch by Richard Osborne. The same problem exists for a bunch of other operators, but I'm ignoring this because they will be automagically fixed when the new LegalizeTypes infrastructure lands, since it already solves this problem centrally. llvm-svn: 52610	2008-06-22 09:42:16 +00:00
Dan Gohman	546505e7e1	Simplify some getNode calls. llvm-svn: 52604	2008-06-21 22:06:07 +00:00
Dan Gohman	ea0452016e	canClobberPhysRegDefs shouldn't called without checking hasPhysRegDefs; check this with an assert. llvm-svn: 52603	2008-06-21 22:05:24 +00:00
Dan Gohman	38c19aae38	Use clear() to zero an existing APInt. llvm-svn: 52601	2008-06-21 22:02:15 +00:00
Dan Gohman	1ac5813726	Use back() instead of [size()-1]. llvm-svn: 52600	2008-06-21 22:00:54 +00:00
Dan Gohman	14b911d929	Remove a redundant return. llvm-svn: 52585	2008-06-21 19:34:57 +00:00
Dan Gohman	46520a25a4	Remove ScheduleDAG's SUnitMap altogether. Instead, use SDNode's NodeId field, which is otherwise unused after instruction selection, as an index into the SUnit array. llvm-svn: 52583	2008-06-21 19:18:17 +00:00
Dan Gohman	a4db3352f9	Add a priority queue class, which is a wrapper around std::priority_queue and provides fairly efficient removal of arbitrary elements. Switch ScheduleDAGRRList from std::set to this new priority queue. llvm-svn: 52582	2008-06-21 18:35:25 +00:00
Duncan Sands	3bb8999719	Support for load/store of expanded float types. I don't know if a truncating store is possible here, but added support for it anyway. llvm-svn: 52577	2008-06-21 17:00:47 +00:00
Dan Gohman	e6e1348275	Change ScheduleDAG's SUnitMap from DenseMap<SDNode, vector<SUnit> > to DenseMap<SDNode, SUnit>, and adjust the way cloned SUnit nodes are handled so that only the original node needs to be in the map. This speeds up llc on 447.dealII.llvm.bc by about 2%. llvm-svn: 52576	2008-06-21 15:52:51 +00:00
Evan Cheng	f593a65497	Undo spill weight tweak. Need to investigate the performance regressions. llvm-svn: 52572	2008-06-21 06:45:54 +00:00
Dan Gohman	4b49be1cbe	Simplify some template parameterization. llvm-svn: 52571	2008-06-21 01:08:22 +00:00
Evan Cheng	efc67e78d7	Enhanced heuristic to determine the best register to spill. Instead of picking the register with the lowest spill weight. Consider (up to) 2 additional registers with spill weights that are close to the lowest spill weight. The one with fewest defs and uses that conflicts with the current interval (weighted by loop depth) is the spill candidate. This is not always a win, but there are much more wins than loses and wins tend to be more noticeable. llvm-svn: 52554	2008-06-20 21:45:16 +00:00
Duncan Sands	f362183c24	Share some code that is common between integer and float expansion (and sometimes vector splitting too). llvm-svn: 52548	2008-06-20 18:40:50 +00:00
Duncan Sands	49295b48eb	Rename the operation of turning a float type into an integer of the same type. Before it was "promotion", but this is confusing because it is quite different to promotion of integers. Call it "softening" instead, inspired by "soft float". llvm-svn: 52546	2008-06-20 17:49:55 +00:00
Dan Gohman	3792c470d5	Clean up some uses of std::distance, now that we have allnodes_size. llvm-svn: 52545	2008-06-20 17:15:19 +00:00
Dan Gohman	593a010c56	Teach ReturnInst lowering about aggregate return values. llvm-svn: 52522	2008-06-20 01:29:26 +00:00
Dan Gohman	44b2c57e2b	Fix the index calculations for the extractvalue lowering code. llvm-svn: 52517	2008-06-20 00:54:19 +00:00
Dan Gohman	c7a32fc8ca	Simplify the ComputeLinearIndex logic and fix a few bugs. llvm-svn: 52516	2008-06-20 00:53:00 +00:00
Evan Cheng	be0429c558	ISD::UNDEF should be expanded recursively / iteratively. llvm-svn: 52508	2008-06-19 22:01:11 +00:00
Dan Gohman	6f880690b8	Use the transferSuccessors helper function. llvm-svn: 52495	2008-06-19 17:22:29 +00:00
Evan Cheng	849fa11f15	Missed a check. llvm-svn: 52487	2008-06-19 06:17:19 +00:00
Owen Anderson	3c4ccc830e	Revert my last patch, which was causing regression test failures. llvm-svn: 52485	2008-06-19 05:29:34 +00:00
Evan Cheng	0c8ef553f5	Coalesce copy from one register class to a sub register class. e.g. X86::MOV16to16_. llvm-svn: 52480	2008-06-19 01:39:21 +00:00
Evan Cheng	18e46d455b	Cosmetic changes. llvm-svn: 52479	2008-06-19 01:21:26 +00:00
Evan Cheng	55bc848640	Minor spiller tweak to unfavor reload into load/store instructions. llvm-svn: 52477	2008-06-19 01:16:17 +00:00
Owen Anderson	80ef880b98	Insert empty slots into the instruction numbering in live intervals, so that we can more easily add new instructions. llvm-svn: 52475	2008-06-19 00:10:49 +00:00
Argyrios Kyrtzidis	55a8524241	Fix the source line debug information for the Windows platform. According to DWARF-2 specification, the line information is provided through an offset in the .debug_line section. Replace the label reference that is used with a section offset. llvm-svn: 52468	2008-06-18 19:27:37 +00:00
Evan Cheng	c5618ebdb9	Complete support for two-address pass rematerialization. Now almost always a win. llvm-svn: 52452	2008-06-18 07:49:14 +00:00
Evan Cheng	50d59478da	Cosmetic. llvm-svn: 52450	2008-06-18 07:47:28 +00:00
Evan Cheng	f873ed1b10	Live-through live interval is [mbb start, mbb end+1]. llvm-svn: 52431	2008-06-17 20:13:36 +00:00
Evan Cheng	1eb69314fa	When extending a liveinterval by commuting, don't throw away the live ranges that are not affected. llvm-svn: 52430	2008-06-17 20:11:16 +00:00
Evan Cheng	5e4188f1ac	It's not safe to remove SUBREG_TO_REG that looks like identity copies, e.g. movl %eax, %eax on x86-64 actually does a zero-extend. llvm-svn: 52421	2008-06-17 17:59:16 +00:00
Duncan Sands	4c69995fb2	Split type expansion into ExpandInteger and ExpandFloat rather than bundling them together. Rename FloatToInt to PromoteFloat (better, if not perfect). Reorganize files by types rather than by operations. llvm-svn: 52408	2008-06-17 14:27:01 +00:00
Chris Lattner	1b08c4a709	add a new -enable-value-prop flag for llcbeta, that enables propagation of value info (sign/zero ext info) from one MBB to another. This doesn't handle much right now because of two limitations: 1) only handles zext/sext, not random bit propagation (no assert exists for this) 2) doesn't handle phis. llvm-svn: 52383	2008-06-17 06:09:18 +00:00
Duncan Sands	0ae829e5d1	Fix spelling. llvm-svn: 52381	2008-06-17 03:24:13 +00:00
Evan Cheng	1cde1f8d5e	Do not issue identity copies. llvm-svn: 52373	2008-06-16 22:52:53 +00:00
Owen Anderson	476e91ab75	Remove special case handling of empty MBBs now that we assign indices to them. llvm-svn: 52345	2008-06-16 19:32:40 +00:00
Owen Anderson	773b2d3ac3	Re-enable empty block indexing by default, since it doesn't seem to have any impact on code quality or compile time. llvm-svn: 52329	2008-06-16 16:58:24 +00:00
Duncan Sands	37c1f5267b	Allow these transforms for types like i256 while still excluding types like i1 (not byte sized) and i120 (loading an i120 requires loading an i64, an i32, an i16 and an i8, which is expensive). llvm-svn: 52310	2008-06-16 08:14:38 +00:00
Evan Cheng	51c75c0c95	Fix read after free found by valgrind. llvm-svn: 52309	2008-06-16 07:34:17 +00:00
Evan Cheng	03553bb59a	Add option to commuteInstruction() which forces it to create a new (commuted) instruction. llvm-svn: 52308	2008-06-16 07:33:11 +00:00
Owen Anderson	e546c55e59	Make indexing empty basic blocks an option for the moment. llvm-svn: 52306	2008-06-16 07:10:49 +00:00
Owen Anderson	d813091cde	Assign indices to empty basic blocks. This will be necessary for StrongPHIElimination in the near future. llvm-svn: 52300	2008-06-16 06:18:41 +00:00
Duncan Sands	075293ff46	The transforms in visitEXTRACT_VECTOR_ELT are not valid if the load is volatile. Hopefully all wrong DAG combiner transforms of volatile loads and stores have now been caught. llvm-svn: 52293	2008-06-15 20:12:31 +00:00
Duncan Sands	0bc21c0551	LegalizeTypes support for INSERT_VECTOR_ELT with a non-constant index. llvm-svn: 52292	2008-06-15 20:00:14 +00:00
Duncan Sands	b1bfff53fe	Remove a redundant AfterLegalize check. Turn on some code when !AfterLegalize - but since this whole code section is turned off by an "if (0)" it's not really turning anything on. llvm-svn: 52276	2008-06-14 17:48:34 +00:00
Andrew Lenharth	f88d50bfcc	add missing atomic intrinsic from gcc llvm-svn: 52270	2008-06-14 05:48:15 +00:00
Evan Cheng	fb79059b85	Teach the spiller to commute instructions in order to fold a reload. This hits 410 times on 444.namd and 122 times on 252.eon. llvm-svn: 52266	2008-06-13 23:58:02 +00:00
Duncan Sands	8651e9c584	Disable some DAG combiner optimizations that may be wrong for volatile loads and stores. In fact this is almost all of them! There are three types of problems: (1) it is wrong to change the width of a volatile memory access. These may be used to do memory mapped i/o, in which case a load can have an effect even if the result is not used. Consider loading an i32 but only using the lower 8 bits. It is wrong to change this into a load of an i8, because you are no longer tickling the other three bytes. It is also unwise to make a load/store wider. For example, changing an i16 load into an i32 load is wrong no matter how aligned things are, since the fact of loading an additional 2 bytes can have i/o side-effects. (2) it is wrong to change the number of volatile load/stores: they may be counted by the hardware. (3) it is wrong to change a volatile load/store that requires one memory access into one that requires several. For example on x86-32, you can store a double in one processor operation, but to store an i64 requires two (two i32 stores). In a multi-threaded program you may want to bitcast an i64 to a double and store as a double because that will occur atomically, and be indivisible to other threads. So it would be wrong to convert the store-of-double into a store of an i64, because this will become two i32 stores - no longer atomic. My policy here is to say that the number of processor operations for an illegal operation is undefined. So it is alright to change a store of an i64 (requires at least two stores; but could be validly lowered to memcpy for example) into a store of double (one processor op). In short, if the new store is legal and has the same size then I say that the transform is ok. It would also be possible to say that transforms are always ok if before they were illegal, whether after they are illegal or not, but that's more awkward to do and I doubt it buys us anything much. However this exposed an interesting thing - on x86-32 a store of i64 is considered legal! That is because operations are marked legal by default, regardless of whether the type is legal or not. In some ways this is clever: before type legalization this means that operations on illegal types are considered legal; after type legalization there are no illegal types so now operations are only legal if they really are. But I consider this to be too cunning for mere mortals. Better to do things explicitly by testing AfterLegalize. So I have changed things so that operations with illegal types are considered illegal - indeed they can never map to a machine operation. However this means that the DAG combiner is more conservative because before it was "accidentally" performing transforms where the type was illegal because the operation was nonetheless marked legal. So in a few such places I added a check on AfterLegalize, which I suppose was actually just forgotten before. This causes the DAG combiner to do slightly more than it used to, which resulted in the X86 backend blowing up because it got a slightly surprising node it wasn't expecting, so I tweaked it. llvm-svn: 52254	2008-06-13 19:07:40 +00:00
Duncan Sands	bf17080ec2	Sometimes (rarely) nodes held in LegalizeTypes maps can be deleted. This happens when RAUW replaces a node N with another equivalent node E, deleting the first node. Solve this by adding (N, E) to ReplacedNodes, which is already used to remap nodes to replacements. This means that deleted nodes are being allowed in maps, which can be delicate: the memory may be reused for a new node which might get confused with the old deleted node pointer hanging around in the maps, so detect this and flush out maps if it occurs (ExpungeNode). The expunging operation is expensive, however it never occurs during a llvm-gcc bootstrap or anywhere in the nightly testsuite. It occurs three times in "make check": Alpha/illegal-element-type.ll, PowerPC/illegal-element-type.ll and X86/mmx-shift.ll. If expunging proves to be too expensive then there are other more complicated ways of solving the problem. In the normal case this patch adds the overhead of a few more map lookups, which is hopefully negligable. llvm-svn: 52214	2008-06-11 11:42:12 +00:00
Dan Gohman	e38cc01244	Teach isGAPlusOffset to respect a GlobalAddressSDNode's offset value, which is something that apparently isn't used much. llvm-svn: 52158	2008-06-09 22:05:52 +00:00
Dan Gohman	6001b91d8e	CodeGen support for aggregate-value function arguments. llvm-svn: 52156	2008-06-09 21:19:23 +00:00
Duncan Sands	67d0f332d5	Various tweaks related to apint codegen. No functionality change for non-funky-sized integers. llvm-svn: 52151	2008-06-09 15:48:25 +00:00
Dan Gohman	d485e4eb5c	Handle empty aggregate values. llvm-svn: 52150	2008-06-09 15:21:47 +00:00
Duncan Sands	93b6609ae2	Remove some DAG combiner assumptions about sizes of integer types. Fix the isMask APInt method to actually work (hopefully) rather than crashing because it adds apints of different bitwidths. It looks like isShiftedMask is also broken, but I'm leaving that one to the APInt people (it is not used anywhere). llvm-svn: 52142	2008-06-09 11:32:28 +00:00
Duncan Sands	11dd424539	Remove comparison methods for MVT. The main cause of apint codegen failure is the DAG combiner doing the wrong thing because it was comparing MVT's using < rather than comparing the number of bits. Removing the < method makes this mistake impossible to commit. Instead, add helper methods for comparing bits and use them. llvm-svn: 52098	2008-06-08 20:54:56 +00:00
Dan Gohman	f6743d70ab	CodeGen support for insertvalue and extractvalue, and for loads and stores of aggregate values. llvm-svn: 52069	2008-06-07 02:02:36 +00:00
Owen Anderson	0bd08cf64c	Connect successors before creating the DAG node for the branch. This has no visible functionality change, but enables a future patch where node creation will update the CFG if it decides to create an unconditional rather than a conditional branch. llvm-svn: 52067	2008-06-07 00:00:23 +00:00
Evan Cheng	c324be32c4	Enable stack coloring by default. llvm-svn: 52057	2008-06-06 19:52:44 +00:00
Duncan Sands	f1123e58fc	Tighten up the abstraction slightly. llvm-svn: 52045	2008-06-06 12:49:32 +00:00
Duncan Sands	13237ac3b9	Wrap MVT::ValueType in a struct to get type safety and better control the abstraction. Rename the type to MVT. To update out-of-tree patches, the main thing to do is to rename MVT::ValueType to MVT, and rewrite expressions like MVT::getSizeInBits(VT) in the form VT.getSizeInBits(). Use VT.getSimpleVT() to extract a MVT::SimpleValueType for use in switch statements (you will get an assert failure if VT is an extended value type - these shouldn't exist after type legalization). This results in a small speedup of codegen and no new testsuite failures (x86-64 linux). llvm-svn: 52044	2008-06-06 12:08:01 +00:00
Evan Cheng	6d7a144453	Refine stack slot interval weight computation. llvm-svn: 52040	2008-06-06 07:54:39 +00:00
Owen Anderson	05d1cd99b3	Remove debugging code. llvm-svn: 52016	2008-06-05 18:43:34 +00:00
Owen Anderson	65dc2b2e06	Use the newly created helper on LiveIntervals. llvm-svn: 52013	2008-06-05 17:22:53 +00:00
Owen Anderson	35e2dfe1cf	Add a helper for constructing new live ranges that ended from an instruction to the end of its MBB. llvm-svn: 52012	2008-06-05 17:15:43 +00:00
Evan Cheng	976b1eee81	Fix a memcpy lowering bug. Even though the memcpy alignment is smaller than the desired alignment, the frame destination alignment may still be larger than the desired alignment. Don't change its alignment to something smaller. llvm-svn: 51970	2008-06-04 23:37:54 +00:00
Evan Cheng	9e27316337	Oops. Should not be enabled by default. llvm-svn: 51953	2008-06-04 18:09:20 +00:00
Owen Anderson	3a697779fe	Correctly construct live intervals for the copies we inserted into the predecessors of a block containing a PHI. llvm-svn: 51950	2008-06-04 17:55:58 +00:00
Evan Cheng	805cde4fbf	Revert this. llvm-svn: 51949	2008-06-04 17:21:44 +00:00
Evan Cheng	12a0222a01	Add a stack slot coloring pass. Not yet enabled. llvm-svn: 51934	2008-06-04 09:18:41 +00:00
Evan Cheng	a15e7eb29b	LowerSubregs should not clobber any analysis. llvm-svn: 51933	2008-06-04 09:17:16 +00:00
Evan Cheng	eecdf659e8	Move #include to right place. llvm-svn: 51932	2008-06-04 09:16:33 +00:00
Evan Cheng	c5b3a3bea5	Register if-converter pass for -debug-pass. llvm-svn: 51931	2008-06-04 09:15:51 +00:00
Duncan Sands	fc3c489b52	Change packed struct layout so that field sizes are the same as in unpacked structs, only field positions differ. This only matters for structs containing x86 long double or an apint; it may cause backwards compatibility problems if someone has bitcode containing a packed struct with a field of one of those types. The issue is that only 10 bytes are needed to hold an x86 long double: the store size is 10 bytes, but the ABI size is 12 or 16 bytes (linux/ darwin) which comes from rounding the store size up by the alignment. Because it seemed silly not to pack an x86 long double into 10 bytes in a packed struct, this is what was done. I now think this was a mistake. Reserving the ABI size for an x86 long double field even in a packed struct makes things more uniform: the ABI size is now always used when reserving space for a type. This means that developers are less likely to make mistakes. It also makes life easier for the CBE which otherwise could not represent all LLVM packed structs (PR2402). Front-end people might need to adjust the way they create LLVM structs - see following change to llvm-gcc. llvm-svn: 51928	2008-06-04 08:21:45 +00:00
Owen Anderson	c777d9e6fe	We need to subtract one from this index because live ranges are open at the end. llvm-svn: 51922	2008-06-04 00:38:56 +00:00
Scott Michel	a7d8649f78	Fix spellnig error llvm-svn: 51917	2008-06-03 19:13:20 +00:00
Scott Michel	c0e9ff6e52	Find a better place to output hex constants corresponding to integers. llvm-svn: 51904	2008-06-03 15:39:51 +00:00
Bruno Cardoso Lopes	2dd8fdc78a	Fixed bug in bad behavior in calculateFrameObjectOffsets, the solution commited is different from the previous patch to avoid int and unsigned comparison llvm-svn: 51899	2008-06-03 08:46:59 +00:00
Evan Cheng	097826643b	Do not run loop-aligner at -fast (e.g. -O0). llvm-svn: 51898	2008-06-03 06:56:08 +00:00
Scott Michel	499c119d7c	Revert this patch llvm-svn: 51897	2008-06-03 06:18:19 +00:00
Dan Gohman	057240f4f0	Fold adds and subtracts of zero immediately, instead of waiting for dagcombine to do this. llvm-svn: 51886	2008-06-02 22:27:05 +00:00
Scott Michel	5793cbcf44	Minor cosmetic patch so that the hex equivalent of a decimal constant shows up in the assembly language output. Helps with debugging without a HP calculator having to be handy. llvm-svn: 51885	2008-06-02 22:19:12 +00:00
Scott Michel	d831cc49e5	Add necessary 64-bit support so that gcc frontend compiles (mostly). Current issue is operand promotion for setcc/select... but looks like the fundamental stuff is implemented for CellSPU. llvm-svn: 51884	2008-06-02 22:18:03 +00:00
Owen Anderson	0908deccc2	Correctly handle removed instructions at the beginning of MBBs when renumbering. llvm-svn: 51876	2008-06-02 17:36:36 +00:00
Dan Gohman	9a19f33842	Remove an unused variable. llvm-svn: 51807	2008-05-31 01:44:25 +00:00
Evan Cheng	fe3ec48bc4	Fix indentation. llvm-svn: 51793	2008-05-30 22:39:32 +00:00
Owen Anderson	c5e21e4f38	The coalescer doesn't need LiveVariables now that we have register use iterators. llvm-svn: 51790	2008-05-30 22:37:27 +00:00
Owen Anderson	e785fb639c	Preserve the register coallescer, and update live intervals more correctly by triggering a renumbering after phi elimination. llvm-svn: 51780	2008-05-30 18:38:26 +00:00
Dan Gohman	8807147ada	Remove an unused variable. llvm-svn: 51721	2008-05-30 00:56:36 +00:00
Owen Anderson	82fc4cdafb	Make the renumbering correct in the face of deleted instructions that have been removed from the LiveIntervals maps. llvm-svn: 51714	2008-05-29 23:01:22 +00:00
Bill Wendling	bf5b228c32	Remove <iostream>. llvm-svn: 51704	2008-05-29 21:29:39 +00:00
Dan Gohman	714663ab94	Expand small memmovs using inline code. Set the X86 threshold for expanding memmove to a more plausible value, now that it's actually being used. llvm-svn: 51696	2008-05-29 19:42:22 +00:00
Owen Anderson	d95dcd12c9	Revert part of my last patch that I didn't intend to commit yet. llvm-svn: 51694	2008-05-29 18:35:21 +00:00
Owen Anderson	0178e95791	Renumbering needs to account for instruction slot offsets when performing lookups in the index maps. llvm-svn: 51691	2008-05-29 18:15:49 +00:00
Evan Cheng	5e28227dbd	Implement vector shift up / down and insert zero with ps{rl}lq / ps{rl}ldq. llvm-svn: 51667	2008-05-29 08:22:04 +00:00
Bill Wendling	7a1a8eb6e2	Implement "AsCheapAsAMove" for some obviously cheap instructions: xor and the like. llvm-svn: 51662	2008-05-29 01:02:09 +00:00
Bill Wendling	3f6bb2713e	Add a flag to indicate that an instruction is as cheap (or cheaper) than a move instruction to execute. This can be used for transformations (like two-address conversion) to remat an instruction instead of generating a "move" instruction. The idea is to decrease the live ranges and register pressure and all that jazz. llvm-svn: 51660	2008-05-28 22:54:52 +00:00
Bill Wendling	5a83b097ed	Check the "isSafeToMove" predicate, which has a series of tests to make sure that it's safe to remat an instruction. llvm-svn: 51659	2008-05-28 22:52:47 +00:00
Owen Anderson	779b4180dc	Remap VNInfo data as well when doing renumbering. llvm-svn: 51658	2008-05-28 22:40:08 +00:00
Owen Anderson	4f8e1ad32a	Factor the numbering computation into a separate method, and add the slightest attempt at some renumbering logic, which is currently unused. llvm-svn: 51652	2008-05-28 20:54:50 +00:00
Evan Cheng	68079268f5	Fix PR2289: vr defined by multiple implicit_def as result of coalescing. llvm-svn: 51648	2008-05-28 17:40:10 +00:00
Evan Cheng	427412e7c8	Teach local register allocator to deal with landing pad MBB's. llvm-svn: 51647	2008-05-28 17:22:32 +00:00
Bill Wendling	2e44ec7c4d	Incorporated feedback: Check that the implicitly defined operands aren't used before deleting the instruction. llvm-svn: 51609	2008-05-27 20:40:52 +00:00
Duncan Sands	698348dfac	Fix some constructs that gcc-4.4 warns about. llvm-svn: 51591	2008-05-27 11:50:51 +00:00
Bill Wendling	2e8c82893b	The enabling of remat in 2-address conversion breaks this test: Running /Users/void/llvm/llvm.src/test/CodeGen/X86/dg.exp ... FAIL: /Users/void/llvm/llvm.src/test/CodeGen/X86/2007-11-30-LoadFolding-Bug.ll Failed with exit(1) at line 1 while running: llvm-as < /Users/void/llvm/llvm.src/test/CodeGen/X86/2007-11-30-LoadFolding-Bug.ll \| llc -march=x86 -mattr=+sse2 -stats \|& grep {1 .*folded into instructions} child process exited abnormally Make this conditional for now. llvm-svn: 51563	2008-05-26 05:49:49 +00:00
Bill Wendling	c737e4639a	A problem that's exposed when machine LICM is enabled. Consider this code: LBB1_3: # bb ... xorl %ebp, %ebp subl (%ebx), %ebp ... incl %ecx cmpl %edi, %ecx jl LBB1_3 # bb Whe using machine LICM, LLVM converts it into: xorl %esi, %esi LBB1_3: # bb ... movl %esi, %ebp subl (%ebx), %ebp ... incl %ecx cmpl %edi, %ecx jl LBB1_3 # bb Two address conversion inserts the copy instruction. However, it's cheaper to rematerialize it, and remat helps reduce register pressure. llvm-svn: 51562	2008-05-26 05:18:34 +00:00
Evan Cheng	7c0db62a5e	Revert 51440 as it breaks a bunch of PIC tests. llvm-svn: 51513	2008-05-23 23:00:04 +00:00
Dan Gohman	643b3a0581	Add #includes to make some dependencies explicit. llvm-svn: 51496	2008-05-23 20:40:06 +00:00
Dale Johannesen	b28a17c346	Rewrite a loop to avoid using iterators pointing to elements that have been erased. Based on a patch by Nicolas Capens. llvm-svn: 51485	2008-05-23 17:19:02 +00:00
Dan Gohman	6d5f120c5c	Generalize the new code in instcombine's ComputeNumSignBits for handling and/or to handle more cases (such as this add-sitofp.ll testcase), and port it to selectiondag's ComputeNumSignBits. llvm-svn: 51469	2008-05-23 02:28:01 +00:00
Bill Wendling	6e326bf0fb	Remove warnings about comparison between signed and unsigned expressions. llvm-svn: 51465	2008-05-23 01:29:08 +00:00
Dan Gohman	396ed504f1	Use isSingleValueType instead of isFirstClassType to exclude struct and array types. llvm-svn: 51460	2008-05-23 00:34:04 +00:00
Dan Gohman	30ab45d01e	Use isSingleValueType instead of isFirstClassType to exclude struct and array types. llvm-svn: 51459	2008-05-23 00:17:26 +00:00
David Greene	830487035e	When rewriting defs and uses after spilling, don't set the weight of a live interval to infinity if the instruction being rewritten is an original remat def instruction. We were only checking against the clone of the remat def which doesn't actually appear in the IR at all. llvm-svn: 51440	2008-05-22 21:16:33 +00:00
David Greene	54b52fe13b	Don't attempt to update SpillSlotToUsesMap for stack slots that aren't generated by the spiller. llvm-svn: 51439	2008-05-22 21:12:21 +00:00
Evan Cheng	a5d27ae586	Fix PR2343. An interesting coalescer bug. BB1: vr1025 = copy vr1024 .. BB2: vr1024 = op = op vr1025 <loop eventually branch back to BB1> Even though vr1025 is copied from vr1024, it's not safe to coalesced them since live range of vr1025 intersects the def of vr1024. This happens when vr1025 is assigned the value of the previous iteration of vr1024 in the loop. llvm-svn: 51394	2008-05-21 22:34:12 +00:00
Dan Gohman	fe13618682	Port the fix for the select operator from instcombine's ComputeNumSignBits to SelectionDAG's ComputeNumSignBits. llvm-svn: 51348	2008-05-20 20:59:51 +00:00
Dan Gohman	c1a4e212a3	Code simplification. llvm-svn: 51345	2008-05-20 20:56:33 +00:00
Evan Cheng	0609ab646b	More local spiller complexity! If local spiller optimization turns some instruction into an identity copy, it will be removed. If the output register happens to be dead (and source is obviously killed), transfer the kill / dead information to last use / def in the same MBB. llvm-svn: 51306	2008-05-20 08:13:21 +00:00
Evan Cheng	c8b028daa4	Don't spill dead def. llvm-svn: 51305	2008-05-20 08:10:37 +00:00
Dale Johannesen	5bf742f2aa	Handle quoted names when constructing $stub's, $non_lazy_ptr's and $lazy_ptr's. llvm-svn: 51277	2008-05-19 21:38:18 +00:00
Gabor Greif	e1f6e4b21d	API change for {BinaryOperator\|CmpInst\|CastInst}::create*() --> Create. Legacy interfaces will be in place for some time. (Merge from use-diet branch.) llvm-svn: 51200	2008-05-16 19:29:10 +00:00
Evan Cheng	9ac3631fa3	If the result of a BIT_CONVERT is a v1* vector, it doesn't mean its source is a v1* vector. llvm-svn: 51192	2008-05-16 17:19:05 +00:00
Duncan Sands	70424d195a	Silence the compiler warning differently. The original method caused gcc-4.2 to complain. llvm-svn: 51186	2008-05-16 09:19:16 +00:00
Nate Begeman	f79f52282c	Actually scalarize the operand to BIT_CONVERT instead of asking someone to do something with a v1 type. llvm-svn: 51160	2008-05-15 20:40:58 +00:00
Dan Gohman	12fce7751b	IR support for extractvalue and insertvalue instructions. Also, begin moving toward making structs and arrays first-class types. llvm-svn: 51157	2008-05-15 19:50:34 +00:00
Evan Cheng	ef377adca0	Make use of vector load and store operations to implement memcpy, memmove, and memset. Currently only X86 target is taking advantage of these. llvm-svn: 51140	2008-05-15 08:39:06 +00:00
Evan Cheng	4ea9d49590	Use a better idiom to silence compiler warnings. llvm-svn: 51131	2008-05-14 21:08:07 +00:00
Evan Cheng	0f7fb95e79	Really silence compiler warnings. llvm-svn: 51126	2008-05-14 20:29:30 +00:00
Evan Cheng	a5b0a8d7fe	Really silence compiler warnings. llvm-svn: 51123	2008-05-14 20:26:35 +00:00
Dale Johannesen	ce4396bc92	Add CommonLinkage; currently tentative definitions are represented as "weak", but there are subtle differences in some cases on Darwin, so we need both. The intent is that "common" will behave identically to "weak" unless somebody changes their target to do something else. No functional change as yet. llvm-svn: 51118	2008-05-14 20:12:51 +00:00
Evan Cheng	763ec13862	Silence some compiler warnings. llvm-svn: 51115	2008-05-14 20:07:51 +00:00
Dan Gohman	3ab94df276	When bit-twiddling CondCode values for integer comparisons produces SETOEQ, is it does with (SETEQ & SETULE), map it to SETEQ. llvm-svn: 51112	2008-05-14 18:17:09 +00:00
Dan Gohman	fd3e3003f3	Whitespace cleanups. llvm-svn: 51089	2008-05-14 00:43:10 +00:00
Evan Cheng	1120279ae6	Instead of a vector load, shuffle and then extract an element. Load the element from address with an offset. pshufd $1, (%rdi), %xmm0 movd %xmm0, %eax => movl 4(%rdi), %eax llvm-svn: 51026	2008-05-13 08:35:03 +00:00
Dan Gohman	0479aa5c0b	Change class' public PassInfo variables to by initialized with the address of the PassInfo directly instead of calling getPassInfo. This eliminates a bunch of dynamic initializations of static data. Also, fold RegisterPassBase into PassInfo, make a bunch of its data members const, and rearrange some code to initialize data members in constructors instead of using setter member functions. llvm-svn: 51022	2008-05-13 02:05:11 +00:00
Dan Gohman	d78c400b5b	Clean up the use of static and anonymous namespaces. This turned up several things that were neither in an anonymous namespace nor static but not intended to be global. llvm-svn: 51017	2008-05-13 00:00:25 +00:00
Nate Begeman	b87e63a730	Teach Legalize how to scalarize VSETCC Teach X86 a few more vsetcc patterns. Custom lowering for unsupported ones is next. llvm-svn: 51009	2008-05-12 23:09:43 +00:00
Evan Cheng	b980f6fb3d	Xform bitconvert(build_pair(load a, load b)) to a single load if the load locations are at the right offset from each other. llvm-svn: 51008	2008-05-12 23:04:07 +00:00
Dale Johannesen	9d29283fc7	Be more aggressive about tail-merging small blocks if those blocks consist entirely of common instructions; merging will not add an extra branch in this case. llvm-svn: 51006	2008-05-12 22:53:12 +00:00
Bill Wendling	6b8bd513d4	Constify isSourceDefinedByImplicitDef function. Otherwise, just formatting changes that don't change functionality. llvm-svn: 51004	2008-05-12 22:15:05 +00:00
Dale Johannesen	c4c4d8e1f7	Further rework of tail merge algorithm. Not quite semantically identical, but little difference in either results or execution speed; but it's much easier to read, at least IMO. llvm-svn: 50999	2008-05-12 20:33:57 +00:00
Evan Cheng	2609d5e779	Refactor isConsecutiveLoad from X86 to TargetLowering so DAG combiner can make use of it. llvm-svn: 50991	2008-05-12 19:56:52 +00:00
Bill Wendling	2930845065	Revert the previous commit. Go ahead and hoist rematerializable instructions. llvm-svn: 50990	2008-05-12 19:47:18 +00:00
Nate Begeman	cfcb56091b	Add support for vicmp/vfcmp codegen, more legalize support coming. This is necessary to unbreak the build. llvm-svn: 50988	2008-05-12 19:40:03 +00:00
Bill Wendling	70613b84da	One real change - don't hoist something that's trivially rematerializable. It's possible for it to produce worse code than before. The rest of this patch is code cleanup. llvm-svn: 50987	2008-05-12 19:38:32 +00:00
Dan Gohman	ecb77385ab	Fix a missing break in the ISD::FLT_ROUNDS_ handling. Patch by giuma! llvm-svn: 50967	2008-05-12 16:07:15 +00:00
Evan Cheng	bec201fa06	If all sources of a PHI node are defined by an implicit_def, just emit an implicit_def instead of a copy. llvm-svn: 50927	2008-05-10 00:17:50 +00:00
Bill Wendling	19e3c857b8	Cosmetic changes: - Comment fixes. - Moar whitespace. - Made ivars "private" by default. No functionality change. llvm-svn: 50926	2008-05-10 00:12:52 +00:00
Dale Johannesen	66da8b5334	Remove an evil vector bool. Cosmetic refactoring, no functional change. llvm-svn: 50921	2008-05-09 23:28:24 +00:00
Dale Johannesen	cff7df201c	Rewrite tail merging algorithm to handle the case where there are multiple blocks with a large number of common tail instructions more efficiently (compile time optimization). llvm-svn: 50916	2008-05-09 21:24:35 +00:00
Duncan Sands	9897eee483	Get exception handling working again on 64 bit Darwin. This is a hack of course, but it does at least look at the right thing: gotpcrel means that this is already an offset, so an explicit offset is not needed (and wrong). I think this is good enough for the moment: Anton is working on something better. llvm-svn: 50850	2008-05-08 12:33:11 +00:00
Anton Korobeynikov	fc2edad4ae	Turn StripPointerCast() into a method llvm-svn: 50836	2008-05-07 22:54:15 +00:00
Duncan Sands	e2b0bf43a7	Output correct exception handling and frame info on x86-64 linux. This causes no regressions on 32 bit linux and 32 bit ppc. More tests pass on 64 bit ppc with no regressions. I didn't turn on eh on 64 bit linux because the intrinsics needed to compile the eh runtime aren't done yet. But if you turn it on and link with the mainline runtime then eh seems to work fine on x86-64 linux with this patch. Thanks to Dale for testing. The main point of the patch is that if you output that some object is encoded using 4 bytes you had better not output 8 bytes for it: the patch makes everything consistent. llvm-svn: 50825	2008-05-07 19:11:09 +00:00
Evan Cheng	7ca4a67ca1	Yet another nasty spiller bug. %ecx = op store %cl<kill>, (addr) (addr) = op %al It's not safe to unfold the last operand and eliminate store even though %cl is marked kill. It's a sub-register use which means one of its super-register(s) may be used below. llvm-svn: 50794	2008-05-07 00:49:28 +00:00
Dan Gohman	5a3eecdfd8	Fix a bug in the ComputeMaskedBits logic for multiply. llvm-svn: 50793	2008-05-07 00:35:55 +00:00
Anton Korobeynikov	82c02b28f3	Make StripPointerCast a common function (should we mak it method of Value instead?) llvm-svn: 50775	2008-05-06 22:52:30 +00:00
Dan Gohman	6a2da37c0e	Make several variable declarations static. llvm-svn: 50696	2008-05-06 01:53:16 +00:00
Dan Gohman	a8b7e78f54	Remove uses of llvm/System/IncludeFile.h that are no longer needed. llvm-svn: 50695	2008-05-06 01:32:53 +00:00
Dan Gohman	38dc08f36f	Instead of enumerating each opcode that isn't handled that ComputeMaskedBits handles, just use a 'default:'. This avoids TargetLowering's list getting out of date with SelectionDAG's. llvm-svn: 50693	2008-05-06 00:53:29 +00:00
Dan Gohman	cf0e3acf16	Correct the value of LowBits in srem and urem handling in ComputeMaskedBits. llvm-svn: 50692	2008-05-06 00:51:48 +00:00
Dan Gohman	72a0bc148c	Fix a broken doxygen comment, and reword it for clarity. llvm-svn: 50687	2008-05-06 00:20:10 +00:00
Mon P Wang	3e58393c3d	Added addition atomic instrinsics and, or, xor, min, and max. llvm-svn: 50663	2008-05-05 19:05:59 +00:00
Dan Gohman	e3a63ba3cd	Fix a bug in the ELF writer that caused it to produce malformed ELF headers. The ELF writer still isn't generally usable though. llvm-svn: 50652	2008-05-05 16:48:32 +00:00
Dan Gohman	bcde172222	Add AsmPrinter support for emitting a directive to declare that the code being generated does not require an executable stack. Also, add target-specific code to make use of this on Linux on x86. llvm-svn: 50634	2008-05-05 00:28:39 +00:00
Dan Gohman	1962c2be6a	Fix a mistake in the computation of leading zeros for udiv. llvm-svn: 50591	2008-05-02 21:30:02 +00:00
Dan Gohman	2f83b47863	Fix a typo in a comment. llvm-svn: 50562	2008-05-02 00:05:03 +00:00
Dan Gohman	ea6357828b	Use push_back(...) instead of resize(1, ...), per review feedback. llvm-svn: 50561	2008-05-02 00:03:54 +00:00
Dan Gohman	752ce50b2d	Fix uninitialized uses of the FPC variable. llvm-svn: 50558	2008-05-01 23:40:44 +00:00
Chris Lattner	d4b2a67cf3	don't randomly miscompile seto/setuo just because we are in ffastmath mode. This fixes rdar://5902801, a miscompilation of gcc.dg/builtins-8.c. Bill, please pull this into Tak. llvm-svn: 50523	2008-05-01 07:26:11 +00:00
Arnold Schwaighofer	be0de34ede	Tail call optimization improvements: Move platform independent code (lowering of possibly overwritten arguments, check for tail call optimization eligibility) from target X86ISelectionLowering.cpp to TargetLowering.h and SelectionDAGISel.cpp. Initial PowerPC tail call implementation: Support ppc32 implemented and tested (passes my tests and test-suite llvm-test). Support ppc64 implemented and half tested (passes my tests). On ppc tail call optimization is performed if caller and callee are fastcc call is a tail call (in tail call position, call followed by ret) no variable argument lists or byval arguments option -tailcallopt is enabled Supported: * non pic tail calls on linux/darwin * module-local tail calls on linux(PIC/GOT)/darwin(PIC) * inter-module tail calls on darwin(PIC) If constraints are not met a normal call will be emitted. A test checking the argument lowering behaviour on x86-64 was added. llvm-svn: 50477	2008-04-30 09:16:33 +00:00
Dale Johannesen	c110c4a526	Add comments for previous patch as requested. llvm-svn: 50463	2008-04-30 00:43:29 +00:00
Scott Michel	be940424b3	Fix custom target lowering for zero/any/sign_extend: make sure that DAG.UpdateNodeOperands() is called before (not after) the call to TLI.LowerOperation(). llvm-svn: 50461	2008-04-30 00:26:38 +00:00
Dale Johannesen	fc3e3ad74d	Make eh_frame objects by 8-byte aligned on 64-bit targets. llvm-svn: 50451	2008-04-29 22:58:20 +00:00
Roman Levenstein	6b37114590	Use std::set instead of std::priority_queue for the RegReductionPriorityQueue. This removes the existing bottleneck related to the removal of elements from the middle of the queue. Also fixes a subtle bug in ScheduleDAGRRList::CapturePred: It was updating the state of the SUnit before removing it. As a result, the comparison operators were working incorrectly and this SUnit could not be removed from the queue properly. Reviewed by Evan and Dan. Approved by Dan. llvm-svn: 50412	2008-04-29 09:07:59 +00:00
Chris Lattner	5c88f7b1ad	make the vector conversion magic handle multiple results. We now compile test2/test3 to: _test2: ## InlineAsm Start set %xmm0, %xmm1 ## InlineAsm End addps %xmm1, %xmm0 ret _test3: ## InlineAsm Start set %xmm0, %xmm1 ## InlineAsm End paddd %xmm1, %xmm0 ret as expected. llvm-svn: 50389	2008-04-29 04:48:56 +00:00
Chris Lattner	f9a49c4322	add support for multiple return values in inline asm. This is a step towards PR2094. It now compiles the attached .ll file to: _sad16_sse2: movslq %ecx, %rax ## InlineAsm Start %ecx %rdx %rax %rax %r8d %rdx %rsi ## InlineAsm End ## InlineAsm Start set %eax ## InlineAsm End ret which is pretty decent for a 3 output, 4 input asm. llvm-svn: 50386	2008-04-29 04:29:54 +00:00
Evan Cheng	11b98b6612	Another extract_subreg coalescing bug. e.g. vr1024<2> extract_subreg vr1025, 2 If vr1024 do not have the same register class as vr1025, it's not safe to coalesce this away. For example, vr1024 might be a GPR32 while vr1025 might be a GPR64. llvm-svn: 50385	2008-04-29 01:41:44 +00:00
Evan Cheng	b96782ecbd	Fix a bug in RegsForValue::getCopyToRegs() that causes cyclical scheduling units. If it's creating multiple CopyToReg nodes that are "flagged" together, it should not create a TokenFactor for it's chain outputs: c1, f1 = CopyToReg c2, f2 = CopyToReg c3 = TokenFactor c1, c2 ... = user c3, ..., f2 Now that the two CopyToReg's and the user are "flagged" together. They effectively forms a single scheduling unit. The TokenFactor is now both an operand and a successor of the Flagged nodes. llvm-svn: 50376	2008-04-28 22:07:13 +00:00
Dan Gohman	c968c1f592	Evan pointed out that folding sext to zext may not be correct if the zext is not legal. llvm-svn: 50368	2008-04-28 18:47:17 +00:00
Dan Gohman	77ce6da378	Delete an unused constructor. llvm-svn: 50367	2008-04-28 18:28:49 +00:00
Dan Gohman	d961d30b7f	Add a comment to CreateRegForValue that clarifies the handling of aggregate types. llvm-svn: 50366	2008-04-28 18:19:43 +00:00
Dan Gohman	80c692d439	Rewrite the comments for RegsForValue and its members, and reorder some of the members for clarity. llvm-svn: 50365	2008-04-28 18:10:39 +00:00
Dan Gohman	14a05df97b	Don't call size() on each iteration of the loop. llvm-svn: 50361	2008-04-28 17:42:03 +00:00
Dan Gohman	da44054867	Fix the SVOffset values for loads and stores produced by memcpy/memset expansion. It was a bug for the SVOffset value to be used in the actual address calculations. llvm-svn: 50359	2008-04-28 17:15:20 +00:00
Dan Gohman	72ec3f4562	Teach InstCombine's ComputeMaskedBits what SelectionDAG's ComputeMaskedBits knows about cttz, ctlz, and ctpop. Teach SelectionDAG's ComputeMaskedBits what InstCombine's knows about SRem. And teach them both some things about high bits in Mul, UDiv, URem, and Sub. This allows instcombine and dagcombine to eliminate sign-extension operations in several new cases. llvm-svn: 50358	2008-04-28 17:02:21 +00:00
Dan Gohman	3eb10f758e	Teach DAGCombine to convert (sext x) to (zext x) when the sign-bit of x is known to be zero. llvm-svn: 50357	2008-04-28 16:58:24 +00:00
Chris Lattner	c9e280c78a	Another collection of random cleanups. No functionality change. llvm-svn: 50341	2008-04-28 07:16:35 +00:00
Chris Lattner	52504e78fb	Remove the SmallVector ctor that converts from a SmallVectorImpl. This conversion open the door for many nasty implicit conversion issues, and can be easily solved by initializing with (V.begin(), V.end()) when needed. This patch includes many small cleanups for sdisel also. llvm-svn: 50340	2008-04-28 06:44:42 +00:00
Chris Lattner	8c7f5ad968	switch RegsForValue::Regs to be a SmallVector to avoid heap thrash on tiny (usually single-element) vectors. llvm-svn: 50335	2008-04-28 06:02:19 +00:00
Chris Lattner	d04b818a91	move static function out of anon namespace, no functionality change. llvm-svn: 50330	2008-04-27 23:48:12 +00:00
Chris Lattner	122721843b	Another step to getting multiple result inline asm to work. llvm-svn: 50329	2008-04-27 23:44:28 +00:00
Chris Lattner	58b9ece38d	typo llvm-svn: 50316	2008-04-27 01:49:46 +00:00
Chris Lattner	2237973438	Implement a signficant optimization for inline asm: When choosing between constraints with multiple options, like "ir", test to see if we can use the 'i' constraint and go with that if possible. This produces more optimal ASM in all cases (sparing a register and an instruction to load it), and fixes inline asm like this: void test () { asm volatile (" %c0 %1 " : : "imr" (42), "imr"(14)); } Previously we would dump "42" into a memory location (which is ok for the 'm' constraint) which would cause a problem because the 'c' modifier is not valid on memory operands. Isn't it great how inline asm turns 'missed optimization' into 'compile failed'?? Incidentally, this was the todo in PowerPC/2007-04-24-InlineAsm-I-Modifier.ll Please do NOT pull this into Tak. llvm-svn: 50315	2008-04-27 00:37:18 +00:00
Chris Lattner	a937baeb9b	isa+cast -> dyn_cast llvm-svn: 50314	2008-04-27 00:16:18 +00:00
Chris Lattner	4793515a9c	Move a bunch of inline asm code out of line. llvm-svn: 50313	2008-04-27 00:09:47 +00:00
Chris Lattner	724539c001	A few inline asm cleanups: - Make targetlowering.h fit in 80 cols. - Make LowerAsmOperandForConstraint const. - Make lowerXConstraint -> LowerXConstraint - Make LowerXConstraint return a const char* instead of taking a string byref. llvm-svn: 50312	2008-04-26 23:02:14 +00:00
Dan Gohman	ca95a5f49f	Remove the code from CodeGenPrepare that moved getresult instructions to the block that defines their operands. This doesn't work in the case that the operand is an invoke, because invoke is a terminator and must be the last instruction in a block. Replace it with support in SelectionDAGISel for copying struct values into sequences of virtual registers. llvm-svn: 50279	2008-04-25 18:27:55 +00:00
Nate Begeman	6f94f61317	Pull the code to perform an INSERT_VECTOR_ELT in memory out into its own function, and then use it to fix a bug in SplitVectorOp that expected inserts to always have constant insertion indices. llvm-svn: 50273	2008-04-25 18:07:40 +00:00
Evan Cheng	3980a7911a	- Check if a register is livein before removing it. It may have already been removed. - Do not iterate over SmallPtrSet, the order of iteration is not deterministic. llvm-svn: 50209	2008-04-24 09:06:33 +00:00
Dan Gohman	e9e3891c09	Use isa instead of dyn_cast. llvm-svn: 50181	2008-04-23 20:25:16 +00:00
Dan Gohman	b418aafabf	Add support to codegen for getresult instructions with undef operands. llvm-svn: 50180	2008-04-23 20:21:29 +00:00
Anton Korobeynikov	0516b6f2b0	Unbreak JIT llvm-svn: 50173	2008-04-23 18:26:03 +00:00
Anton Korobeynikov	7e859dd7f0	Add facility for pre-RA passes llvm-svn: 50165	2008-04-23 18:22:28 +00:00
Anton Korobeynikov	41334635cc	Use precomputed value, if any llvm-svn: 50164	2008-04-23 18:21:50 +00:00
Anton Korobeynikov	f49bc9f8ed	Cleanup llvm-svn: 50160	2008-04-23 18:19:47 +00:00
Dan Gohman	dc90919d2b	Fix an out-of-bounds access in -view-sunit-dags in the case of an empty ScheduleDAG. llvm-svn: 50054	2008-04-21 20:07:30 +00:00
Dale Johannesen	aac27592f0	Check we aren't trying to convert PPC long double. This fixes the testsuite failure on ppcf128-4.ll. llvm-svn: 49994	2008-04-20 18:23:46 +00:00
Chris Lattner	3b18762f40	Switch to using Simplified ConstantFP::get API. llvm-svn: 49977	2008-04-20 00:41:09 +00:00
Duncan Sands	1ec193e90b	Implement a bit more softfloat support in LegalizeTypes. Correct the load logic so that it actually works, and also teach it to handle floating point extending loads. llvm-svn: 49923	2008-04-18 20:56:03 +00:00
Duncan Sands	a8a61562af	Add some more FIXME's for indexed loads and stores. llvm-svn: 49916	2008-04-18 20:27:12 +00:00
Duncan Sands	b4e0b24e0a	Provide an explicit list of operands to MakeLibcall, rather than having it suck them out of a node. Add a bunch of new libcalls, and remove dead softfloat code (dead, because FloatToInt is used not Expand in this case). Note that indexed stores probably aren't handled properly, likewise for loads. llvm-svn: 49915	2008-04-18 20:25:14 +00:00
Evan Cheng	d556115e7e	Correct comment. llvm-svn: 49913	2008-04-18 19:25:26 +00:00
Evan Cheng	495a516390	Not safe to "kill" a register if its live range extends pass the end of block branch. llvm-svn: 49911	2008-04-18 19:22:23 +00:00
Dan Gohman	75c895dbc4	Remove the implicit conversion from SDOperandPtr to SDOperand*; this may fix a build error on Visual Studio. llvm-svn: 49876	2008-04-17 23:02:12 +00:00
Evan Cheng	7e4a55bc58	Be more careful with insert_subreg and extract_subreg where either source or destination operand has already been coalesced with another register that's defined by a insert_subreg or extract_subreg. llvm-svn: 49843	2008-04-17 07:58:04 +00:00
Bill Wendling	288ef83b8a	Use correct name for method in comment. llvm-svn: 49841	2008-04-17 05:20:39 +00:00
Dan Gohman	9752a8f3b4	Correct the SrcValue information in the Expand code for va_copy. llvm-svn: 49839	2008-04-17 02:09:26 +00:00
Evan Cheng	c8c3a899c0	Fix a sub-register indice propagation bug. llvm-svn: 49832	2008-04-17 00:06:42 +00:00
Nicolas Geoffray	a7557dfe71	Correlate stubs with functions in JIT: when emitting a stub, the JIT tells the memory manager which function the stub will resolve. llvm-svn: 49814	2008-04-16 20:46:05 +00:00
Evan Cheng	59aa126e48	After reading memory that's already freed. llvm-svn: 49810	2008-04-16 20:24:25 +00:00
Nicolas Geoffray	ae84bbdbed	Infrastructure for getting the machine code size of a function and an instruction. X86, PowerPC and ARM are implemented llvm-svn: 49809	2008-04-16 20:10:13 +00:00
Evan Cheng	23f12757ed	Fix PR2226. Avoid using uninitialized variables. llvm-svn: 49807	2008-04-16 18:48:43 +00:00
Evan Cheng	8dc8a8d8af	Empty basic block should have an empty range. llvm-svn: 49800	2008-04-16 18:01:08 +00:00
Roman Levenstein	a3ee1a38a3	Ongoing work on improving the instruction selection infrastructure: Rename SDOperandImpl back to SDOperand. Introduce the SDUse class that represents a use of the SDNode referred by an SDOperand. Now it is more similar to Use/Value classes. Patch is approved by Dan Gohman. llvm-svn: 49795	2008-04-16 16:15:27 +00:00
Evan Cheng	e45b8f89c5	Rewrite LiveVariable liveness computation. The new implementation is much simplified. It eliminated the nasty recursive routines and removed the partial def / use bookkeeping. There is also potential for performance improvement by replacing the conservative handling of partial physical register definitions. The code is currently disabled until live interval analysis is taught of the name scheme. This patch also fixed a couple of nasty corner cases. llvm-svn: 49784	2008-04-16 09:46:40 +00:00
Evan Cheng	6c17773ccc	Code clean up. llvm-svn: 49783	2008-04-16 09:41:59 +00:00
Evan Cheng	e29e9774a4	Avoid read after free. llvm-svn: 49760	2008-04-16 01:22:28 +00:00
Dan Gohman	82b6673c44	Fix the new scheduler assertion checks to work when the scheduler has inserted no-ops. This fixes the 2006-07-03-schedulers.ll regression on ppc32. llvm-svn: 49747	2008-04-15 22:40:14 +00:00
Nicolas Geoffray	7000c8f1aa	Change Divided flag to Split, as suggested by Evan llvm-svn: 49715	2008-04-15 08:08:50 +00:00
Dan Gohman	4370f26750	Treat EntryToken nodes as "passive" so that they aren't added to the ScheduleDAG; they don't correspond to any actual instructions so they don't need to be scheduled. This fixes a bug where the EntryToken was being scheduled multiple times in some cases, though it ended up not causing any trouble because EntryToken doesn't expand into anything. With this fixed the schedulers reliably schedule the expected number of units, so we can check this with an assertion. This requires a tweak to test/CodeGen/X86/loop-hoist.ll because it ends up getting scheduled differently in a trivial way, though it was enough to fool the prcontext+grep that the test does. llvm-svn: 49701	2008-04-15 01:22:18 +00:00
Dan Gohman	e5f21cea3e	In -view-sunit-dags, display "special" chain dependencies as cyan instead of blue to distinguish them from regular dependencies. llvm-svn: 49696	2008-04-14 23:15:07 +00:00
Dan Gohman	5b61a288a7	Avoid creating MERGE_VALUES nodes for single values. llvm-svn: 49676	2008-04-14 18:43:25 +00:00
Dan Gohman	2505d86783	Fix const-correctness issues with the SrcValue handling in the memory intrinsic expansion code. llvm-svn: 49666	2008-04-14 17:55:48 +00:00
Dale Johannesen	876224b1e8	Reverse sense of unwind-tables option. This means stack tracebacks on Darwin x86-64 won't work by default; nevertheless, everybody but me thinks this is a good idea. llvm-svn: 49663	2008-04-14 17:54:17 +00:00
Nicolas Geoffray	db0ea1ff4e	Fix /test/CodeGen/PowerPC/big-endian-actual-args.ll for linux/ppc32 llvm-svn: 49652	2008-04-14 17:17:14 +00:00
Duncan Sands	6c503f9a65	Initial libcall support for LegalizeTypes. This is much simpler than in LegalizeDAG because calls are not yet expanded into call sequences: that happens after type legalization has finished. llvm-svn: 49634	2008-04-14 06:48:48 +00:00
Duncan Sands	0a8a4c4a0c	LegalizeTypes can sometimes have deleted nodes in its maps. Add some sanity checks that catch this kind of thing. Hopefully these can be removed one day (once all problems are fixed!) but for the moment it seems wise to have them in. llvm-svn: 49612	2008-04-13 16:04:03 +00:00
Nicolas Geoffray	dcc2eda5fc	Add a divided flag for the first piece of an argument divided into mulitple parts. Fixes PR1643 llvm-svn: 49611	2008-04-13 13:40:22 +00:00
Duncan Sands	a07136ee2d	Merge LLVMBuilder and FoldingBuilder, calling the result IRBuilder. Patch by Dominic Hamon. llvm-svn: 49604	2008-04-13 06:22:09 +00:00
Duncan Sands	844d55a42a	Factor some libcall code. llvm-svn: 49583	2008-04-12 17:14:18 +00:00
Dan Gohman	544ab2c50b	Drop ISD::MEMSET, ISD::MEMMOVE, and ISD::MEMCPY, which are not Legal on any current target and aren't optimized in DAGCombiner. Instead of using intermediate nodes, expand the operations, choosing between simple loads/stores, target-specific code, and library calls, immediately. Previously, the code to emit optimized code for these operations was only used at initial SelectionDAG construction time; now it is used at all times. This fixes some cases where rep;movs was being used for small copies where simple loads/stores would be better. This also cleans up code that checks for alignments less than 4; let the targets make that decision instead of doing it in target-independent code. This allows x86 to use rep;movs in low-alignment cases. Also, this fixes a bug that resulted in the use of rep;stos for memsets of 0 with non-constant memory size when the alignment was at least 4. It's better to use the library in this case, which can be significantly faster when the size is large. This also preserves more SourceValue information when memory intrinsics are lowered into simple loads/stores. llvm-svn: 49572	2008-04-12 04:36:06 +00:00
Evan Cheng	305d268ac3	Do not add empty live intervals to handled_. They should never be undone for backtracking. llvm-svn: 49544	2008-04-11 17:55:47 +00:00
Evan Cheng	33281864c1	If a PHI node has a single implicit_def source, replace it with an implicit_def instead of a copy. llvm-svn: 49543	2008-04-11 17:54:45 +00:00
Evan Cheng	499ffa9055	Use of implicit_def is not part of live interval. Create empty intervals for the uses when the live interval is being spilled. llvm-svn: 49542	2008-04-11 17:53:36 +00:00
Gabor Greif	c422383e08	detabify llvm-svn: 49524	2008-04-11 09:34:57 +00:00
Evan Cheng	c6864b6652	Remove implicit_def instructions that become dead as result of coalescing. llvm-svn: 49513	2008-04-10 23:48:35 +00:00
Evan Cheng	2cb98eb4bb	Allow registers defined by implicit_def to be clobbered. llvm-svn: 49512	2008-04-10 23:47:53 +00:00
Evan Cheng	16ea87d6ee	A copy instruction may use a register multiple times on some targets. Change them all. llvm-svn: 49491	2008-04-10 18:38:47 +00:00
Evan Cheng	5e1971c894	Add comment. llvm-svn: 49469	2008-04-10 08:03:14 +00:00
Evan Cheng	9d339849ee	Teach branch folding pass about implicit_def instructions. Unfortunately we can't just eliminate them since register scavenger expects every register use to be defined. However, we can delete them when there are no intra-block uses. Carefully removing some implicit def's which enable more blocks to be optimized away. llvm-svn: 49461	2008-04-10 02:32:10 +00:00
Evan Cheng	c8eeb752a3	- More aggressively coalescing away copies whose source is defined by an implicit_def. - Added insert_subreg coalescing support. llvm-svn: 49448	2008-04-09 20:57:25 +00:00
Evan Cheng	aa3b55f842	Missed a hasInterval check. llvm-svn: 49415	2008-04-09 01:30:15 +00:00
Dale Johannesen	344aec2952	Implement new llc flag -disable-required-unwind-tables. Corresponds to -fno-unwind-tables (usually default in gcc). llvm-svn: 49361	2008-04-08 00:10:24 +00:00
Dan Gohman	3bc3ddd638	Rename MemOperand to MachineMemOperand. This was suggested by review feedback from Chris quite a while ago. No functionality change. llvm-svn: 49348	2008-04-07 19:35:22 +00:00
Roman Levenstein	51f532f92d	Re-commit of the r48822, where the infinite looping problem discovered by Dan Gohman is fixed. llvm-svn: 49330	2008-04-07 10:06:32 +00:00
Chris Lattner	4db1f62d84	Silence warning when no assertions. llvm-svn: 49284	2008-04-06 21:46:45 +00:00
Torok Edwin	613d7afe64	Prefer to expand mask for xor to -1, so we have a chance to turn it into a not. If it cannot be expanded, it will keep the old behaviour and try to shrink the constant. Part of enhancement for PR2191. llvm-svn: 49280	2008-04-06 21:23:02 +00:00
Gabor Greif	e9ecc68d8f	API changes for class Use size reduction, wave 1. Specifically, introduction of XXX::Create methods for Users that have a potentially variable number of Uses. llvm-svn: 49277	2008-04-06 20:25:17 +00:00
Evan Cheng	b5fdc923d3	1. IMPLICIT_DEF can re-define any register. 2. Coalescer can now create an interesting situation where a register def can reaches itself without being killed. llvm-svn: 49246	2008-04-05 01:27:09 +00:00
Dale Johannesen	0ce4a7cc44	Make sure both PendingLoads and PendingExports are flushed before an invoke. Failure to do this causes references in the landing pad to variables that were not set. Fixes g++.dg/eh/delayslot1.C g++.dg/eh/fp-regs.C g++.old-deja/g++.brendan/eh1.C llvm-svn: 49243	2008-04-04 23:48:31 +00:00
Evan Cheng	14bee50e06	Undo PHI elimination copy placement patch. This causes coalescing (performace) issues. llvm-svn: 49198	2008-04-04 01:20:05 +00:00
Evan Cheng	823017fdd1	This is done. llvm-svn: 49197	2008-04-04 01:19:03 +00:00
Andrew Lenharth	bfb7246fb6	if some functions don't have debug info, we were outputing the same label at the start of each of those functions. This makes assemblers unhappy llvm-svn: 49176	2008-04-03 17:37:43 +00:00
Evan Cheng	58936a48ee	- Turn copies of implicit_def into implicit_def instructions. - Be smarter about coalescing copies from implicit_def. llvm-svn: 49168	2008-04-03 16:41:54 +00:00
Evan Cheng	6d07b625aa	Special handling of zero-sized live intervals. llvm-svn: 49167	2008-04-03 16:40:27 +00:00
Evan Cheng	20aed56504	- Treat a live range defined by an implicit_def as a zero-sized one. - Eliminate an implicit_def when it's being spilled. llvm-svn: 49166	2008-04-03 16:39:43 +00:00
Evan Cheng	aacf4f15b3	- PHI elimination also eliminates implicit_def that fits into a PHI node rather than copying it. - Be (slightly) smarter about where to place the copies. llvm-svn: 49165	2008-04-03 16:38:20 +00:00
Evan Cheng	916802a78e	Start of a series of patches related to implicit_def. There is no point in creating a long live range defined by an implicit_def. Scheduler now duplicates implicit_def instruction for each of its uses. Therefore, if an implicit_def node has multiple uses, it will become a number of very short live ranges, rather than a long one. This will make coalescer's job easier. llvm-svn: 49164	2008-04-03 16:36:07 +00:00
Evan Cheng	025cea1126	Backing out 48222 temporarily. llvm-svn: 49124	2008-04-03 03:13:16 +00:00
Dale Johannesen	491557712a	Make EH work with unnamed functions. Reenable running StripSymbols when EH is on. llvm-svn: 49110	2008-04-02 20:10:52 +00:00
Evan Cheng	d8616064d8	Now that I am told MachineRegisterInfo also tracks physical register uses / defs, I can do away with the horribleness I introduced a while back. It's impossible to detect if there is any use of a physical register below an instruction (and before any def of the register) with some cheap book keeping. llvm-svn: 49105	2008-04-02 18:04:08 +00:00
Evan Cheng	be3d44c3cb	Remove #include<map> from LiveVariables.h. Not referenced. llvm-svn: 49099	2008-04-02 17:23:50 +00:00
Dale Johannesen	8780ecbbac	Cosmetic changes per EH patch review feedback. llvm-svn: 49096	2008-04-02 17:04:45 +00:00
Owen Anderson	2412158111	In some situations, we need to check for local interferences between the PHI node and its inputs. llvm-svn: 49070	2008-04-02 03:00:13 +00:00
Owen Anderson	edfc2eb558	Correctly mark a valno that was previous defined by a PHI node as having an unknown defining inst after PHI elimination. llvm-svn: 49069	2008-04-02 02:12:45 +00:00
Dale Johannesen	fd967cf3fa	Recommitting EH patch; this should answer most of the review feedback. -enable-eh is still accepted but doesn't do anything. EH intrinsics use Dwarf EH if the target supports that, and are handled by LowerInvoke otherwise. The separation of the EH table and frame move data is, I think, logically figured out, but either one still causes full EH info to be generated (not sure how to split the metadata correctly). MachineModuleInfo::needsFrameInfo is no longer used and is removed. llvm-svn: 49064	2008-04-02 00:25:04 +00:00
Evan Cheng	985a0b51d7	Re-materialization is for uses only. llvm-svn: 49053	2008-04-01 21:37:32 +00:00
Dale Johannesen	5e4e051c2a	Revert 49006 for the moment. llvm-svn: 49046	2008-04-01 20:00:57 +00:00
Owen Anderson	49dd9f16a9	Don't dereference MBB->end(). llvm-svn: 49043	2008-04-01 18:05:08 +00:00
Evan Cheng	0bd72c5ccd	More soft fp fixes. llvm-svn: 49016	2008-04-01 02:18:22 +00:00
Evan Cheng	4cabe4b452	Pasto. llvm-svn: 49014	2008-04-01 02:00:09 +00:00
Evan Cheng	611abc03ed	Add comment. llvm-svn: 49013	2008-04-01 01:51:26 +00:00
Evan Cheng	86e476b7cb	Unbreak ARM / Thumb soft FP support. llvm-svn: 49012	2008-04-01 01:50:16 +00:00
Dale Johannesen	7d02cf3c9c	Emit exception handling info for functions which are not marked nounwind, or for all functions when -enable-eh is set, provided the target supports Dwarf EH. llvm-gcc generates nounwind in the right places; other FEs will need to do so also. Given such a FE, -enable-eh should no longer be needed. llvm-svn: 49006	2008-03-31 23:40:23 +00:00
Evan Cheng	e4f77c69ac	It's not safe to fold a load from GV stub or constantpool into a two-address use. llvm-svn: 49002	2008-03-31 23:19:51 +00:00
Evan Cheng	ed6e34fe41	Move reMaterialize() from TargetRegisterInfo to TargetInstrInfo. llvm-svn: 48995	2008-03-31 20:40:39 +00:00
Dan Gohman	f549b26254	Fix a DAGCombiner optimization to respect volatile qualification. llvm-svn: 48994	2008-03-31 20:32:52 +00:00
Evan Cheng	73d7c3bfba	The support for remat of instructions with a register operand is hackish, to say the least. Since the register operand guaranteed to be PIC base and that it is already live at all uses, we are making sure it will not be spilled after its uses are rematerialized for both performance and correctness reasons. llvm-svn: 48976	2008-03-31 07:53:30 +00:00
Owen Anderson	f28fc71c93	Fix a major bug in the DFS calculation. Thanks for Christopher Lamb for pointing this out. llvm-svn: 48973	2008-03-31 01:39:20 +00:00
Chris Lattner	0f760dfe09	Fix "Control reaches the end of non-void function" warnings, patch by David Chisnall. llvm-svn: 48963	2008-03-30 18:22:13 +00:00
Evan Cheng	16d72072df	Cosmetic changes. llvm-svn: 48947	2008-03-29 18:34:22 +00:00
Owen Anderson	8b22873bdd	Remove some unneeded code for LiveInterval joining, and fix a bug in the Phi elimination algorithm where we were accidentally reasoning about the source rather than the destination. llvm-svn: 48936	2008-03-29 01:58:47 +00:00
Chris Lattner	a148acdc82	ifdef out a dead function. Should this be removed? llvm-svn: 48916	2008-03-28 15:36:27 +00:00
Duncan Sands	35c7cdac07	Rename getAnyLoad to getLoad is suggested by Evan. llvm-svn: 48914	2008-03-28 09:45:24 +00:00
Evan Cheng	87bac50d7b	New entry. llvm-svn: 48908	2008-03-28 06:34:23 +00:00
Duncan Sands	f740509e58	Implement LegalizeTypes support for softfloat LOAD. In order to handle indexed nodes I had to introduce a new constructor, and since I was there I factorized the code in the various load constructors. llvm-svn: 48894	2008-03-27 20:23:40 +00:00
Dan Gohman	cad51cb671	Avoid creating chain dependencies from CopyToReg nodes to load and store nodes. This doesn't currently have much impact the generated code, but it does produce simpler-looking SelectionDAGs, and consequently simpler-looking ScheduleDAGs, because there are fewer spurious dependencies. In particular, CopyValueToVirtualRegister now uses the entry node as the input chain dependency for new CopyToReg nodes instead of calling getRoot and depending on the most recent memory reference. Also, rename UnorderedChains to PendingExports and pull it up from being a local variable in SelectionDAGISel::BuildSelectionDAG to being a member variable of SelectionDAGISel, so that it doesn't have to be passed around to all the places that need it. llvm-svn: 48893	2008-03-27 19:56:19 +00:00
Roman Levenstein	30d09518b5	Fix spelling. Thanks, Duncan! :-) llvm-svn: 48873	2008-03-27 09:44:37 +00:00
Roman Levenstein	bc674501ba	Speed-up the SumOfUnscheduledPredsOfSuccs by introducing a new function called LimitedSumOfUnscheduledPredsOfSuccs. It terminates the computation after a given treshold is reached. This new function is always faster, but brings real wins only on bigger test-cases. The old function SumOfUnscheduledPredsOfSuccs is left in-place for now and therefore a warning about an unused static function is produced. llvm-svn: 48872	2008-03-27 09:14:57 +00:00
Evan Cheng	5832410d77	Fix a memory bug: increment an iterator of a deleted machine instr. llvm-svn: 48853	2008-03-27 01:27:25 +00:00
Dale Johannesen	87c6ada5de	Fix a bug in Darwin EH: FDE->CIE pointer must be relocatable. Describe why .set is needed better. llvm-svn: 48848	2008-03-26 23:31:39 +00:00
Evan Cheng	db390694ff	One more coalescer fix wrt deadness propagation. llvm-svn: 48837	2008-03-26 20:15:49 +00:00
Evan Cheng	289ba4f335	Avoid commuting a def MI in order to coalesce a copy instruction away if any use of the same val# is a copy instruction that has already been coalesced. llvm-svn: 48833	2008-03-26 19:03:01 +00:00
Roman Levenstein	358e04a185	Use a linked data structure for the uses lists of an SDNode, just like LLVM Value/Use does and MachineRegisterInfo/MachineOperand does. This allows constant time for all uses list maintenance operations. The idea was suggested by Chris. Reviewed by Evan and Dan. Patch is tested and approved by Dan. On normal use-cases compilation speed is not affected. On very big basic blocks there are compilation speedups in the range of 15-20% or even better. llvm-svn: 48822	2008-03-26 12:39:26 +00:00
Roman Levenstein	733a4d6e85	Fixed some spelling errors. Thanks, Duncan! llvm-svn: 48819	2008-03-26 11:23:38 +00:00
Roman Levenstein	7e71b4baaf	Some improvements related to the computation of isReachable. This fixes Bugzilla #1835 (http://llvm.org/bugs/show_bug.cgi?id=1835). This patched is reviewed by Tanya and Dan. Dan tested and approved it. The reason for the bad performance of the old algorithm is that it is very naive and scans every time all nodes of the DAG in the worst case. This patch introduces a new algorithm based on the paper "Online algorithms for maintaining the topological order of a directed acyclic graph" by David J.Pearce and Paul H.J.Kelly. This is the MNR algorithm. It has a linear time worst-case and performs much better in most situations. The paper can be found here: http://fano.ics.uci.edu/cites/Document/Online-algorithms-for-maintaining-the-topological-order-of-a-directed-acyclic-graph.html The main idea of the new algorithm is to compute the topological ordering of the SNodes in the DAG and to maintain it even after DAG modifications. The topological ordering allows for very fast node reachability checks. Tests on very big input files with tens of thousands of instructions in a BB indicate huge speed-ups (up to 10x compilation time improvement) compared to the old version. llvm-svn: 48817	2008-03-26 09:18:09 +00:00
Owen Anderson	5d2d1776e0	Dead PHI instructions need to be handled specially. llvm-svn: 48811	2008-03-26 03:03:23 +00:00
Owen Anderson	9f129318dc	Remove some debugging code. llvm-svn: 48803	2008-03-25 22:26:43 +00:00
Owen Anderson	1d46d45e35	StrongPHIElimination doesn't support swapping live intervals like the coalescer does. llvm-svn: 48802	2008-03-25 22:25:27 +00:00
Dan Gohman	bdc24adaaf	A quick nm audit turned up several fixed tables and objects that were marked read-write. Use const so that they can be allocated in a read-only segment. llvm-svn: 48800	2008-03-25 21:45:14 +00:00
Dan Gohman	a7ba51f6ec	Avoid outputing spaces at the ends of lines. llvm-svn: 48797	2008-03-25 21:38:12 +00:00
Devang Patel	72cfe84f05	Do not align loops if optimizing for size. llvm-svn: 48794	2008-03-25 21:03:02 +00:00
Evan Cheng	df1690dc7c	Handle a special case xor undef, undef -> 0. Technically this should be transformed to undef. But this is such a common idiom (misuse) we are going to handle it. llvm-svn: 48792	2008-03-25 20:08:07 +00:00
Dan Gohman	fd227e9c3a	Fix typos. llvm-svn: 48779	2008-03-25 17:10:29 +00:00
Evan Cheng	7d564c3b4a	lastRegisterUse() should ignore identity copies. Those will be erased. llvm-svn: 48759	2008-03-25 02:02:19 +00:00
Evan Cheng	fe7610f37f	Remove an unneeded test. llvm-svn: 48755	2008-03-24 23:55:16 +00:00
Evan Cheng	69a3f9c417	If the coalescer commuted a def MI to allow coalescing, it can changed a previously coalesced copy into an non-identity copy. llvm-svn: 48752	2008-03-24 23:31:21 +00:00
Evan Cheng	6e225173c5	Add an assertion to catch register of illegal class. llvm-svn: 48751	2008-03-24 23:28:21 +00:00
Owen Anderson	e2707768a4	Remove #include<iostream>, which I was using for debugging. llvm-svn: 48739	2008-03-24 20:36:47 +00:00
Dan Gohman	d8ea040c31	APIntify SelectionDAG's EXTRACT_ELEMENT code. llvm-svn: 48726	2008-03-24 16:38:05 +00:00
Owen Anderson	200e57840e	Be sure to remove intervals after we've joined them. Also, remove some duplicated code. With this pass, StrongPHIElim can compile very simple testcases correctly. There's still a ways to go before it's ready for prime time, though. llvm-svn: 48719	2008-03-24 04:11:27 +00:00
Anton Korobeynikov	2fa75184f3	Another comments fixing llvm-svn: 48683	2008-03-22 07:53:40 +00:00
Evan Cheng	31604a62f6	Teach DAG combiner to commute commutable binary nodes in order to achieve sdisel CSE. llvm-svn: 48673	2008-03-22 01:55:50 +00:00
Dan Gohman	9988569af8	Don't include <map> in Pass.h, which doesn't need it. This requires adding <map> to many files that actually do need it. llvm-svn: 48667	2008-03-21 23:51:57 +00:00
Dan Gohman	30e44a4b40	Fix -view-sunit-dags to support cross-rc-copy nodes. llvm-svn: 48664	2008-03-21 22:51:06 +00:00
Evan Cheng	8c19af1b7e	A couple of kill marker maintainence bug. llvm-svn: 48653	2008-03-21 19:09:30 +00:00
Duncan Sands	d97eea372a	Introduce a new node for holding call argument flags. This is needed by the new legalize types infrastructure which wants to expand the 64 bit constants previously used to hold the flags on 32 bit machines. There are two functional changes: (1) in LowerArguments, if a parameter has the zext attribute set then that is marked in the flags; before it was being ignored; (2) PPC had some bogus code for handling two word arguments when using the ELF 32 ABI, which was hard to convert because of the bogusness. As suggested by the original author (Nicolas Geoffray), I've disabled it for the moment. Tested with "make check" and the Ada ACATS testsuite. llvm-svn: 48640	2008-03-21 09:14:45 +00:00
Christopher Lamb	3e9f49716e	Check even more carefully before applying this DAGCombine transform. llvm-svn: 48580	2008-03-20 04:31:39 +00:00
Evan Cheng	7a3e750fd2	Fix this xform: (sra (shl X, m), result_size) -> (sign_extend (trunc (shl X, result_size - n - m))) llvm-svn: 48578	2008-03-20 02:18:41 +00:00
Chris Lattner	a7cca362af	detabify llvm, patch by Mike Stump! llvm-svn: 48577	2008-03-20 01:22:40 +00:00
Christopher Lamb	8fe9109469	Fix X86's isTruncateFree to not claim that truncate to i1 is free. This fixes Bill's testcase that failed for r48491. llvm-svn: 48542	2008-03-19 08:30:06 +00:00
Evan Cheng	56e9e57d28	Fixed a coalescer bug caused by a typo. llvm-svn: 48526	2008-03-19 02:26:36 +00:00
Evan Cheng	44c0b4f754	Fix live variables issues: 1. If part of a register is re-defined, an implicit kill and an implicit def are added to denote read / mod / write. However, this should only be necessary if the register is actually read later. This is a performance issue. 2. If a sub-register is being defined, and it doesn't have a previous use, do not add a implicit kill to the last use of a super-register: = EAX, AX<imp-use,kill> ... AX = In this case, EAX is live but AX is killed, this is wrong and will cause the coalescer to do bad things. llvm-svn: 48521	2008-03-19 00:52:20 +00:00
Bill Wendling	efb4d9ef80	Temporarily revert r48491. It's breaking test/CodeGen/X86/xorl.ll. llvm-svn: 48510	2008-03-18 22:29:51 +00:00
Dale Johannesen	12c76db312	Make conversions of i8/i16 to ppcf128 work. llvm-svn: 48493	2008-03-18 17:28:38 +00:00
Christopher Lamb	3e408d4d82	Target independent DAG transform to use truncate for field extraction + sign extend on targets where this is profitable. Passes nightly on x86-64. llvm-svn: 48491	2008-03-18 16:46:39 +00:00
Evan Cheng	d096ec0a86	Rewrite code that propagate isDead information after a dead copy is coalesced. This remove some ugly spaghetti code and fixed a number of subtle bugs. llvm-svn: 48490	2008-03-18 08:26:47 +00:00
Owen Anderson	488e645938	A first attempt at updating live intervals, with code lifted from the coalescer. This doesn't really work, but gets us farther than before. llvm-svn: 48446	2008-03-17 06:08:26 +00:00
Christopher Lamb	d3d0ad3f58	Make insert_subreg a two-address instruction, vastly simplifying LowerSubregs pass. Add a new TII, subreg_to_reg, which is like insert_subreg except that it takes an immediate implicit value to insert into rather than a register. llvm-svn: 48412	2008-03-16 03:12:01 +00:00
Evan Cheng	ec7533b620	Remove isImplicitDef TargetInstrDesc flag. llvm-svn: 48381	2008-03-15 00:19:36 +00:00
Evan Cheng	0e7b00d79f	Replace all target specific implicit def instructions with a target independent one: TargetInstrInfo::IMPLICIT_DEF. llvm-svn: 48380	2008-03-15 00:03:38 +00:00
Duncan Sands	858e6385f7	Do not generate special entries in the dwarf eh table for nounwind calls. llvm-svn: 48373	2008-03-14 21:36:24 +00:00
Evan Cheng	84aec09fdb	Fix PR2138. Apparently any modification to a std::multimap (including remove entries for a different key) can invalidate multimap iterators. llvm-svn: 48371	2008-03-14 20:44:01 +00:00
Duncan Sands	a06e4f3050	Simplify using getIntPtrConstant. llvm-svn: 48355	2008-03-14 05:23:57 +00:00
Nate Begeman	63eb03f800	Tabs -> spaces Use getIntPtrConstant in a couple places to shorten stuff up Handle splitting vector shuffles with undefs in the mask llvm-svn: 48351	2008-03-14 00:53:31 +00:00
Evan Cheng	db443ca377	Livein copy scheduling fixes: do not coalesce physical register copies, correctly determine the safe location to insert the copies. llvm-svn: 48348	2008-03-14 00:14:55 +00:00
Dan Gohman	b72127ac4c	More APInt-ification. llvm-svn: 48344	2008-03-13 22:13:53 +00:00
Evan Cheng	e21a68bca7	Undo tweak. It had no obvious benefit. llvm-svn: 48341	2008-03-13 17:42:48 +00:00
Evan Cheng	57bb088542	Typo. llvm-svn: 48337	2008-03-13 08:04:35 +00:00
Evan Cheng	8f8a8b28e9	Don't try to sink 3-address instruction if convertToThreeAddress created more than one instructions. llvm-svn: 48336	2008-03-13 07:56:58 +00:00
Evan Cheng	21449c76bc	Remove an unused command line option. llvm-svn: 48334	2008-03-13 06:38:28 +00:00
Evan Cheng	5c26bde55e	TwoAddressInstructionPass enhancement. After it converts a two address instruction into a 3-address one, sink it past the instruction that kills the read-mod-write register if its definition is used past the kill. This reduces the number of live register by one. llvm-svn: 48333	2008-03-13 06:37:55 +00:00
Christopher Lamb	dd55d3f1b2	Get rid of a pseudo instruction and replace it with subreg based operation on real instructions, ridding the asm printers of the hack used to do this previously. In the process, update LowerSubregs to be careful about eliminating copies that have side affects. Note: the coalescer will have to be careful about this too, when it starts coalescing insert_subreg nodes. llvm-svn: 48329	2008-03-13 05:47:01 +00:00
Evan Cheng	4f610c0de1	Remove unused options. llvm-svn: 48319	2008-03-13 02:41:34 +00:00
Evan Cheng	399e1101ba	Refactor some code out of MachineSink into a MachineInstr query. llvm-svn: 48311	2008-03-13 00:44:09 +00:00
Evan Cheng	65e9d5f1a8	Experimental scheduler change to schedule / coalesce the copies added for function livein's. Take 2008-03-10-RegAllocInfLoop.ll, the schedule looks like this after these copies are inserted: entry: 0x12049d0, LLVM BB @0x1201fd0, ID#0: Live Ins: %EAX %EDX %ECX %reg1031<def> = MOVPC32r 0 %reg1032<def> = ADD32ri %reg1031, <es:_GLOBAL_OFFSET_TABLE_>, %EFLAGS<imp-def> %reg1028<def> = MOV32rr %EAX %reg1029<def> = MOV32rr %EDX %reg1030<def> = MOV32rr %ECX %reg1027<def> = MOV8rm %reg0, 1, %reg0, 0, Mem:LD(1,1) [0x1201910 + 0] %reg1025<def> = MOV32rr %reg1029 %reg1026<def> = MOV32rr %reg1030 %reg1024<def> = MOV32rr %reg1028 The copies unnecessarily increase register pressure and it will end up requiring a physical register to be spilled. With -schedule-livein-copies: entry: 0x12049d0, LLVM BB @0x1201fa0, ID#0: Live Ins: %EAX %EDX %ECX %reg1031<def> = MOVPC32r 0 %reg1032<def> = ADD32ri %reg1031, <es:_GLOBAL_OFFSET_TABLE_>, %EFLAGS<imp-def> %reg1024<def> = MOV32rr %EAX %reg1025<def> = MOV32rr %EDX %reg1026<def> = MOV32rr %ECX %reg1027<def> = MOV8rm %reg0, 1, %reg0, 0, Mem:LD(1,1) [0x12018e0 + 0] Much better! llvm-svn: 48307	2008-03-12 22:19:41 +00:00
Duncan Sands	723849a17f	Initial soft-float support for LegalizeTypes. I rewrote the fcopysign expansion from LegalizeDAG to get rid of what seems to be a bug: the use of sign extension means that when copying the sign bit from an f32 to an f64, the upper 32 bits of the f64 (now an i64) are set, not just the top bit... I also generalized it to work for any sized floating point types, and removed the bogosity: SDOperand Mask1 = (SrcVT == MVT::f64) ? DAG.getConstantFP(BitsToDouble(1ULL << 63), SrcVT) : DAG.getConstantFP(BitsToFloat(1U << 31), SrcVT); Mask1 = DAG.getNode(ISD::BIT_CONVERT, SrcNVT, Mask1); (here SrcNVT is an integer with the same size as SrcVT). As far as I can see this takes a 1 << 63, converts to a double, converts that to a floating point constant then converts that to an integer constant, ending up with... 1 << 63 as an integer constant! So I just generate this integer constant directly. llvm-svn: 48305	2008-03-12 21:27:04 +00:00
Dan Gohman	34ae72c435	Change VirtRegMap's dump to dump to cerr, not DOUT, so that it can be called from within a debuger without having -debug specified on the command-line. llvm-svn: 48298	2008-03-12 20:52:10 +00:00
Dan Gohman	bf68f9fd8d	Fix typos in comments. llvm-svn: 48297	2008-03-12 20:50:04 +00:00
Duncan Sands	c54fe97f08	Fix typo. llvm-svn: 48295	2008-03-12 20:35:19 +00:00
Duncan Sands	87de65fc29	Don't try to extract an i32 from an f64. This getCopyToParts problem was noticed by the new LegalizeTypes infrastructure. In order to avoid this kind of thing in the future I've added a check that EXTRACT_ELEMENT is only used with integers. Once LegalizeTypes is up and running most likely BUILD_PAIR and EXTRACT_ELEMENT can be removed, in favour of using apints instead. llvm-svn: 48294	2008-03-12 20:30:08 +00:00
Evan Cheng	99ee78ef63	Clean up my own mess. X86 lowering normalize vector 0 to v4i32. However DAGCombine can fold (sub x, x) -> 0 after legalization. It can create a zero vector of a type that's not expected (e.g. v8i16). We don't want to disable the optimization since leaving a (sub x, x) is really bad. Add isel patterns for other types of vector 0 to ensure correctness. It's highly unlikely to happen other than in bugpoint reduced test cases. llvm-svn: 48279	2008-03-12 07:02:50 +00:00
Owen Anderson	944b1c76ab	We also need to collect the VN IDs for the PHI instructions for later updating. llvm-svn: 48278	2008-03-12 04:22:57 +00:00
Owen Anderson	70aaab6dc5	When we're determining what registers to coallesce, track the VNInfo IDs for the definitions that feed the PHI instructions. We'll need these IDs in order to update LiveIntervals properly. llvm-svn: 48277	2008-03-12 03:13:29 +00:00
Evan Cheng	0903aef2ff	Total brain cramp. llvm-svn: 48274	2008-03-12 02:05:05 +00:00
Evan Cheng	105cb3988b	Set NextMII after issuing a physical register spill. llvm-svn: 48263	2008-03-12 00:14:07 +00:00
Evan Cheng	b398635456	Minor debug output bug. llvm-svn: 48261	2008-03-12 00:02:46 +00:00
Anton Korobeynikov	e8fa50f63a	Correctly propagate thread-local flag from aliasee to alias. This fixes PR2137 llvm-svn: 48257	2008-03-11 22:38:53 +00:00
Dan Gohman	24570836b2	Use PassManagerBase instead of FunctionPassManager for functions that merely add passes. This allows them to be used with either FunctionPassManager or PassManager, or even with a custom new kind of pass manager. llvm-svn: 48256	2008-03-11 22:29:46 +00:00
Anton Korobeynikov	2601d7ee50	Honour aliases visibility during asm emission llvm-svn: 48249	2008-03-11 21:41:14 +00:00
Evan Cheng	a3891365b5	Transfer physical register spill info when load / store folding happens. llvm-svn: 48246	2008-03-11 21:34:46 +00:00
Dan Gohman	44b4c07cd1	Use the correct value for InSignBit. llvm-svn: 48245	2008-03-11 21:29:43 +00:00
Dan Gohman	1351025a91	Initial codegen support for functions and calls with multiple return values. llvm-svn: 48244	2008-03-11 21:11:25 +00:00
Christopher Lamb	aa7c2105de	Recommitting parts of r48130. These do not appear to cause the observed failures. llvm-svn: 48223	2008-03-11 10:09:17 +00:00
Evan Cheng	d54660aeed	Use TargetRegisterInfo::getPhysicalRegisterRegClass. Remove duplicated code. llvm-svn: 48221	2008-03-11 07:55:13 +00:00
Evan Cheng	e88a625ecd	When the register allocator runs out of registers, spill a physical register around the def's and use's of the interval being allocated to make it possible for the interval to target a register and spill it right away and restore a register for uses. This likely generates terrible code but is before than aborting. llvm-svn: 48218	2008-03-11 07:19:34 +00:00
Duncan Sands	b29f93613d	Some LegalizeTypes code factorization and minor enhancements. llvm-svn: 48215	2008-03-11 06:41:14 +00:00
Chris Lattner	5c7bda440f	compile: double test() {} into: _test: fldz ret instead of: _test: subl $12, %esp #IMPLICIT_DEF %xmm0 movsd %xmm0, (%esp) fldl (%esp) addl $12, %esp ret llvm-svn: 48213	2008-03-11 06:21:08 +00:00
Chris Lattner	3e0ec65678	variadic instructions don't have operand info for variadic arguments. llvm-svn: 48208	2008-03-11 03:14:42 +00:00
Dan Gohman	d6819da453	Generalize ExpandIntToFP to handle the case where the operand is legal and it's the result that requires expansion. This code is a little confusing because the TargetLoweringInfo tables for [US]INT_TO_FP use the operand type (the integer type) rather than the result type. llvm-svn: 48206	2008-03-11 01:59:03 +00:00
Chris Lattner	d3090bcfc8	If a register operand comes from the variadic part of a node, don't verify the register constraint matches what the instruction expects. llvm-svn: 48205	2008-03-11 00:59:28 +00:00
Evan Cheng	850e143cbf	Temporarily revert 48175. llvm-svn: 48204	2008-03-11 00:27:34 +00:00
Dan Gohman	10f7d850cf	More APInt-ification. llvm-svn: 48201	2008-03-11 00:11:06 +00:00
Dan Gohman	2a3aeb1f72	Correctly clone FlaggedNodes. llvm-svn: 48196	2008-03-10 23:48:14 +00:00
Dan Gohman	830d86cab8	APInt-ify this. llvm-svn: 48194	2008-03-10 23:38:17 +00:00
Dan Gohman	f4300950f1	Implement more support for fp-to-i128 and i128-to-fp conversions. llvm-svn: 48189	2008-03-10 23:03:31 +00:00
Evan Cheng	7abdb438a1	If the register allocator ran out of registers, just abort for now. llvm-svn: 48175	2008-03-10 21:27:20 +00:00
Dan Gohman	272e234477	Fix mul expansion to check the correct number of bits for zero extension when checking if an unsigned multiply is safe. llvm-svn: 48171	2008-03-10 20:42:19 +00:00
Evan Cheng	b9e4280e94	Somewhat better solution. llvm-svn: 48170	2008-03-10 19:58:22 +00:00
Evan Cheng	ae2c56d93e	Default ISD::PREFETCH to expand. llvm-svn: 48169	2008-03-10 19:38:10 +00:00
Evan Cheng	d4e1d9eeb2	Revert 48125, 48126, and 48130 for now to unbreak some x86-64 tests. llvm-svn: 48167	2008-03-10 19:31:26 +00:00
Scott Michel	a6729e8666	Give TargetLowering::getSetCCResultType() a parameter so that ISD::SETCC's return ValueType can depend its operands' ValueType. This is a cosmetic change, no functionality impacted. llvm-svn: 48145	2008-03-10 15:42:14 +00:00
Bill Wendling	2823eaebe8	Minor cleanup. No functionality change. llvm-svn: 48142	2008-03-10 08:13:01 +00:00
Evan Cheng	4a3c5eab34	- Fix a subtle bug in RemoveCopyByCommutingDef. ALR is the live range where the source is defined; BLR is the live range which is defined by the copy. If ALR and BLR overlaps and end of BLR extends beyond end of ALR, e.g. A = or A, B ... B = A ... C = A<kill> ... = B then do not add kills of A to the newly created B interval. - Also fix some kill info update bug. llvm-svn: 48141	2008-03-10 08:11:32 +00:00
Evan Cheng	831ae49599	Doh llvm-svn: 48140	2008-03-10 07:59:01 +00:00
Owen Anderson	75d04819a6	Move StrongPHIElimination after live interval analysis. This will make things happier down the road. llvm-svn: 48138	2008-03-10 07:22:36 +00:00
Evan Cheng	b5d11980d9	Avoid creating BUILD_VECTOR of all zero elements of "non-normalized" type (e.g. v8i16 on x86) after legalizer. Instruction selection does not expect to see them. In all likelihood this can only be an issue in a bugpoint reduced test case. llvm-svn: 48136	2008-03-10 07:19:13 +00:00
Christopher Lamb	4ba3f0430b	Allow insert_subreg into implicit, target-specific values. Change insert/extract subreg instructions to be able to be used in TableGen patterns. Use the above features to reimplement an x86-64 pseudo instruction as a pattern. llvm-svn: 48130	2008-03-10 06:12:08 +00:00
Dale Johannesen	4e622ec86d	Increase ISD::ParamFlags to 64 bits. Increase the ByValSize field to 32 bits, thus enabling correct handling of ByVal structs bigger than 0x1ffff. Abstract interface a bit. Fixes gcc.c-torture/execute/pr23135.c and gcc.c-torture/execute/pr28982b.c in gcc testsuite (were ICE'ing on ppc32, quietly producing wrong code on x86-32.) llvm-svn: 48122	2008-03-10 02:17:22 +00:00
Chris Lattner	4c4234b59c	remove an extraneous (and ugly) default argument, thanks Duncan. llvm-svn: 48117	2008-03-09 20:04:36 +00:00
Chris Lattner	ce5f841bb5	fp_round's produced by getCopyFromParts should always be exact, because they are produced by calls (which are known exact) and by cross block copies which are known to be produced by extends. This improves: define double @test2() { %tmp85 = call double asm sideeffect "fld0", "={st(0)}"() ret double %tmp85 } from: _test2: subl $20, %esp # InlineAsm Start fld0 # InlineAsm End fstpl 8(%esp) movsd 8(%esp), %xmm0 movsd %xmm0, (%esp) fldl (%esp) addl $20, %esp #FP_REG_KILL ret to: _test2: # InlineAsm Start fld0 # InlineAsm End #FP_REG_KILL ret by avoiding a f64 <-> f80 trip llvm-svn: 48108	2008-03-09 09:38:46 +00:00
Chris Lattner	86829f0ff7	teach X86InstrInfo::copyRegToReg how to copy into ST(0) from an RFP register class. Teach ScheduleDAG how to handle CopyToReg with different src/dst reg classes. This allows us to compile trivial inline asms that expect stuff on the top of x87-fp stack. llvm-svn: 48107	2008-03-09 09:15:31 +00:00
Chris Lattner	9e07537e8c	Add ScheduleDAG support for copytoreg where the src/dst register are in different register classes, e.g. copy of ST(0) to RFP*. This gets some really trivial inline asm working that plops things on the top of stack (PR879) llvm-svn: 48105	2008-03-09 08:49:15 +00:00
Chris Lattner	381bbdb924	fix 80 col violation llvm-svn: 48100	2008-03-09 07:51:01 +00:00
Chris Lattner	83b3473dd8	extend fp values with FP_EXTEND not FP_ROUND. llvm-svn: 48097	2008-03-09 07:47:22 +00:00
Chris Lattner	322c826c9d	Fix two problems in SelectionDAGLegalize::ExpandBUILD_VECTOR's handling of BUILD_VECTORS that only have two unique elements: 1. The previous code was nondeterminstic, because it walked a map in SDOperand order, which isn't determinstic. 2. The previous code didn't handle the case when one element was undef very well. Now we ensure that the generated shuffle mask has the undef vector on the RHS (instead of potentially being on the LHS) and that any elements that refer to it are themselves undef. This allows us to compile CodeGen/X86/vec_set-9.ll into: _test3: movd %rdi, %xmm0 punpcklqdq %xmm0, %xmm0 ret instead of: _test3: movd %rdi, %xmm1 #IMPLICIT_DEF %xmm0 punpcklqdq %xmm1, %xmm0 ret ... saving a register. llvm-svn: 48060	2008-03-09 00:29:42 +00:00
Chris Lattner	a1f25b0020	Teach SD some vector identities, allowing us to compile vec_set-9 into: _test3: movd %rdi, %xmm1 #IMPLICIT_DEF %xmm0 punpcklqdq %xmm1, %xmm0 ret instead of: _test3: #IMPLICIT_DEF %rax movd %rax, %xmm0 movd %rdi, %xmm1 punpcklqdq %xmm1, %xmm0 ret This is still not ideal. There is no reason to two xmm regs. llvm-svn: 48058	2008-03-08 23:43:36 +00:00
Evan Cheng	95cf661534	Implement x86 support for @llvm.prefetch. It corresponds to prefetcht{0\|1\|2} and prefetchnta instructions. llvm-svn: 48042	2008-03-08 00:58:38 +00:00
Bill Wendling	d6951455e4	Something that kills a super-register also kills the sub-register. llvm-svn: 48038	2008-03-07 23:45:15 +00:00
Evan Cheng	39a3221e27	Fixed a register scavenger bug. If a def is re-defining part of a super register, there must be an implicit def of the super-register on the MI. llvm-svn: 48024	2008-03-07 20:12:54 +00:00
Bill Wendling	55bfd8c3f7	When setting the "unused" info, take into account something like this: %r3<def> = OR %x3<kill>, %x3 We don't want to mark the %r3 as unused even though it's a sub-register of %x3. llvm-svn: 48003	2008-03-06 23:22:43 +00:00
Evan Cheng	34173f0a43	80 col violation. llvm-svn: 47998	2008-03-06 17:42:34 +00:00
Gabor Greif	636ab19205	some more spelling changes llvm-svn: 47996	2008-03-06 10:51:21 +00:00
Evan Cheng	a3cb090446	Constant fold SIGN_EXTEND_INREG with ashr not lshr. llvm-svn: 47992	2008-03-06 08:20:51 +00:00
Evan Cheng	29b502e0e0	Fix a coalescer bug wrt how dead copy interval is shortened. llvm-svn: 47966	2008-03-05 22:09:42 +00:00
Dale Johannesen	8ee39c61f2	Clarify that CALLSEQ_START..END may not be nested, and add some protection against creating such. llvm-svn: 47957	2008-03-05 19:14:03 +00:00
Chris Lattner	78e9cab229	Generalize FP constant shrinking optimization to apply to any vt except ppc long double. This allows us to shrink constant pool entries for x86 long double constants, which in turn allows us to use flds/fldl instead of fldt. llvm-svn: 47938	2008-03-05 06:48:13 +00:00
Chris Lattner	3dc3899007	Improve comment, pass in the original VT so that we can shrink a long double constant all the way to float, not stopping at double. llvm-svn: 47937	2008-03-05 06:46:58 +00:00
Dan Gohman	da7897c4e1	Codegen support for i128 UINT_TO_FP. This just fixes a bug in r47928 (Int64Ty is the correct type for the constant pool entry here) and removes the asserts, now that the code is capable of handling i128. llvm-svn: 47932	2008-03-05 02:07:31 +00:00
Evan Cheng	0a62cb44ce	Add a target lowering hook to control whether it's worthwhile to compress fp constant. For x86, if sse2 is available, it's not a good idea since cvtss2sd is slower than a movsd load and it prevents load folding. On x87, it's important to shrink fp constant since fldt is very expensive. llvm-svn: 47931	2008-03-05 01:30:59 +00:00
Andrew Lenharth	357061a74d	64bit CAS on 32bit x86. llvm-svn: 47929	2008-03-05 01:15:49 +00:00
Dan Gohman	d9d874b0cd	Codegen support for i128 SINT_TO_FP. llvm-svn: 47928	2008-03-05 01:08:17 +00:00
Evan Cheng	6325446666	Refactor code. Remove duplicated functions that basically do the same thing as findRegisterUseOperandIdx, findRegisterDefOperandIndx. Fix some naming inconsistencies. llvm-svn: 47927	2008-03-05 00:59:57 +00:00
Roman Levenstein	c62c2bb4d0	Some improvements related to the computation of heights, depths of SUnits. The basic idea is that all these algorithms are computing the longest paths from the root node or to the exit node. Therefore the existing implementation that uses and iterative and potentially exponential algorithm was changed to a well-known graph algorithm based on dynamic programming. It has a linear run-time. llvm-svn: 47884	2008-03-04 11:19:43 +00:00
Evan Cheng	38caf77419	Refactor ExpandConstantFP so it can optimize load from constpool of types larger than f64 into extload from smaller types. llvm-svn: 47883	2008-03-04 08:05:30 +00:00
Bill Wendling	2ae707888b	Did I say 'e = getNumOperands()'? I meant --e, of course. llvm-svn: 47875	2008-03-04 00:48:15 +00:00
Evan Cheng	567d2e5b57	Rename isOperand() to isOperandOf() (and other similar methods). It always confuses me. llvm-svn: 47872	2008-03-04 00:41:45 +00:00
Bill Wendling	0e541ea730	Miscellaneous clean-ups based on Evan's feedback: - Cleaned up how the prologue-epilogue inserter loops over the instructions. - Instead of restarting the processing of an instruction if we remove an implicit kill, just update the end iterator and make sure that the iterator isn't incremented. llvm-svn: 47870	2008-03-03 23:57:28 +00:00
Dan Gohman	e1c4f99549	Misc. APInt-ification in the DAGCombiner. llvm-svn: 47869	2008-03-03 23:51:38 +00:00
Dan Gohman	10f34077f1	More APInt-ification. llvm-svn: 47868	2008-03-03 23:35:36 +00:00
Dan Gohman	0e238dc813	Yet more APInt-ification. llvm-svn: 47867	2008-03-03 22:37:52 +00:00
Dan Gohman	2fa65b7997	More APInt-ification. llvm-svn: 47866	2008-03-03 22:22:56 +00:00
Dan Gohman	f2bbfa3ba0	More APInt-ification. llvm-svn: 47864	2008-03-03 22:20:46 +00:00
Bill Wendling	7921ad0d67	Go through the machine instruction's operands to make sure that we're not marking both a super- and sub-register as "killed". This removes implicit uses that are marked as "killed". llvm-svn: 47862	2008-03-03 22:14:33 +00:00
Bill Wendling	528083bc28	Make the register scavenger update the bookkeeping values for sub/super registers. llvm-svn: 47861	2008-03-03 22:12:25 +00:00
Bill Wendling	4836d58f89	Multiple instructions can be inserted when eliminating frame indexes. We need the register scavenger to process all of those new instructions instead of just the last one inserted. llvm-svn: 47860	2008-03-03 22:11:16 +00:00
Andrew Lenharth	d032c33300	all but CAS working on x86 llvm-svn: 47798	2008-03-01 21:52:34 +00:00
Dale Johannesen	208cc8f1b9	Add MVT::is128BitVector and is64BitVector. Shrink unaligned load/store code using them. Per review of unaligned load/store vector patch. llvm-svn: 47782	2008-03-01 03:40:57 +00:00
Evan Cheng	73bdf043a1	Refactor / clean up code; remove td list scheduler special tie breaker (no real benefit). llvm-svn: 47779	2008-03-01 00:39:47 +00:00
Evan Cheng	26edb59d97	Don't fill eh frames even though these are text sections. llvm-svn: 47765	2008-02-29 19:36:59 +00:00
Bill Wendling	811153a551	If we reload a virtual register that's already been assigned, we want to mark that instruction as its "last use". This fixes PR1925. llvm-svn: 47758	2008-02-29 18:52:01 +00:00
Evan Cheng	2e26dc8051	Fix PR2112: don't run loop aligner if target doesn't have a TargetLowering object. llvm-svn: 47755	2008-02-29 17:52:15 +00:00
Evan Cheng	ca7c61e79a	No need for coalescer to update kills. Only copies are coalesced and those instructions will be deleted. Doh. llvm-svn: 47749	2008-02-29 02:50:03 +00:00
Evan Cheng	88f839944d	Remove redundant #include. llvm-svn: 47748	2008-02-29 02:49:15 +00:00
Dan Gohman	bd2fa566e4	More APInt-ification. llvm-svn: 47746	2008-02-29 01:47:35 +00:00
Dan Gohman	837a6dccd7	Use the new convertFromAPInt instead of convertFromZeroExtendedInteger, which allows more of the surrounding arithmetic to be done with APInt instead of uint64_t. llvm-svn: 47745	2008-02-29 01:44:25 +00:00
Dan Gohman	ec6be4a782	Use the new APInt-enabled form of getConstant instead of converting an APInt into a uint64_t to call getConstant. llvm-svn: 47742	2008-02-29 01:41:59 +00:00
Evan Cheng	95a7be473c	Added option -align-loops=<true/false> to disable loop aligner pass. llvm-svn: 47736	2008-02-28 23:29:57 +00:00
Dale Johannesen	cbde4c2206	Interface of getByValTypeAlignment differed between generic & x86 versions; change generic to follow x86 and improve comments. Add PPC version (not right for non-Darwin.) llvm-svn: 47734	2008-02-28 22:31:51 +00:00
Dale Johannesen	c4c3de2b52	Fix an assertion message. llvm-svn: 47722	2008-02-28 18:36:51 +00:00
Evan Cheng	a465bfb87c	Keep track how many commutes are performed by the scheduler. llvm-svn: 47710	2008-02-28 07:40:24 +00:00
Chris Lattner	9824ffef0c	implement expand for ISD::DECLARE by just deleting it. llvm-svn: 47708	2008-02-28 05:53:40 +00:00
Evan Cheng	c799065cc3	Add a quick and dirty "loop aligner pass". x86 uses it to align its loops to 16-byte boundaries. llvm-svn: 47703	2008-02-28 00:43:03 +00:00
Dale Johannesen	bf76a08e7c	Handle load/store of misaligned vectors that are the same size as an int type by doing a bitconvert of load/store of the int type (same algorithm as floating point). This makes them work for ppc Altivec. There was some code that purported to handle loads of (some) vectors by splitting them into two smaller vectors, but getExtLoad rejects subvector loads, so this could never have worked; the patch removes it. llvm-svn: 47696	2008-02-27 22:36:00 +00:00
Evan Cheng	fdc732ab9a	Fix a bug in dead spill slot elimination. llvm-svn: 47687	2008-02-27 19:57:11 +00:00
Dan Gohman	e5e32ec8f7	Remove the `else', at Evan's insistence. llvm-svn: 47686	2008-02-27 19:44:57 +00:00
Duncan Sands	ef40c5b204	Add a FIXME about the VECTOR_SHUFFLE evil hack. llvm-svn: 47676	2008-02-27 17:39:13 +00:00
Duncan Sands	e158a82f26	LegalizeTypes support for EXTRACT_VECTOR_ELT. The approach taken is different to that in LegalizeDAG when it is a question of expanding or promoting the result type: for example, if extracting an i64 from a <2 x i64>, when i64 needs expanding, it bitcasts the vector to <4 x i32>, extracts the appropriate two i32's, and uses those for the Lo and Hi parts. Likewise, when extracting an i16 from a <4 x i16>, and i16 needs promoting, it bitcasts the vector to <2 x i32>, extracts the appropriate i32, twiddles the bits if necessary, and uses that as the promoted value. This puts more pressure on bitcast legalization, and I've added the appropriate cases. They needed to be added anyway since users can generate such bitcasts too if they want to. Also, when considering various cases (Legal, Promote, Expand, Scalarize, Split) it is a pain that expand can correspond to Expand, Scalarize or Split, so I've changed the LegalizeTypes enum so it lists those different cases - now Expand only means splitting a scalar in two. The code produced is the same as by LegalizeDAG for all relevant testcases, except for 2007-10-31-extractelement-i64.ll, where the code seems to have improved (see below; can an expert please tell me if it is better or not). Before < vs after >. < subl $92, %esp < movaps %xmm0, 64(%esp) < movaps %xmm0, (%esp) < movl 4(%esp), %eax < movl %eax, 28(%esp) < movl (%esp), %eax < movl %eax, 24(%esp) < movq 24(%esp), %mm0 < movq %mm0, 56(%esp) --- > subl $44, %esp > movaps %xmm0, 16(%esp) > pshufd $1, %xmm0, %xmm1 > movd %xmm1, 4(%esp) > movd %xmm0, (%esp) > movq (%esp), %mm0 > movq %mm0, 8(%esp) < subl $92, %esp < movaps %xmm0, 64(%esp) < movaps %xmm0, (%esp) < movl 12(%esp), %eax < movl %eax, 28(%esp) < movl 8(%esp), %eax < movl %eax, 24(%esp) < movq 24(%esp), %mm0 < movq %mm0, 56(%esp) --- > subl $44, %esp > movaps %xmm0, 16(%esp) > pshufd $3, %xmm0, %xmm1 > movd %xmm1, 4(%esp) > movhlps %xmm0, %xmm0 > movd %xmm0, (%esp) > movq (%esp), %mm0 > movq %mm0, 8(%esp) < subl $92, %esp < movaps %xmm0, 64(%esp) --- > subl $44, %esp < movl 16(%esp), %eax < movl %eax, 48(%esp) < movl 20(%esp), %eax < movl %eax, 52(%esp) < movaps %xmm0, (%esp) < movl 4(%esp), %eax < movl %eax, 60(%esp) < movl (%esp), %eax < movl %eax, 56(%esp) --- > pshufd $1, %xmm0, %xmm1 > movd %xmm1, 4(%esp) > movd %xmm0, (%esp) > movd %xmm1, 12(%esp) > movd %xmm0, 8(%esp) < subl $92, %esp < movaps %xmm0, 64(%esp) --- > subl $44, %esp < movl 24(%esp), %eax < movl %eax, 48(%esp) < movl 28(%esp), %eax < movl %eax, 52(%esp) < movaps %xmm0, (%esp) < movl 12(%esp), %eax < movl %eax, 60(%esp) < movl 8(%esp), %eax < movl %eax, 56(%esp) --- > pshufd $3, %xmm0, %xmm1 > movd %xmm1, 4(%esp) > movhlps %xmm0, %xmm0 > movd %xmm0, (%esp) > movd %xmm1, 12(%esp) > movd %xmm0, 8(%esp) llvm-svn: 47672	2008-02-27 13:34:40 +00:00
Duncan Sands	2111bd2e37	LegalizeTypes support for legalizing the mask operand of a VECTOR_SHUFFLE. The mask is a vector of constant integers. The code in LegalizeDAG doesn't bother to legalize the mask, since it's basically just storage for a bunch of constants, however LegalizeTypes is more picky. The problem is that there may not exist any legal vector-of-integers type with a legal element type, so it is impossible to create a legal mask! Unless of course you cheat by creating a BUILD_VECTOR where the operands have a different type to the element type of the vector being built... This is pretty ugly but works - all relevant tests in the testsuite pass, and produce the same assembler with and without LegalizeTypes. llvm-svn: 47670	2008-02-27 13:03:44 +00:00
Duncan Sands	5d5bc484d0	LegalizeTypes support for INSERT_VECTOR_ELT. llvm-svn: 47669	2008-02-27 10:18:23 +00:00
Evan Cheng	8ae8e2d50b	Don't track max alignment during stack object allocations since they can be deleted later. Let PEI compute it. llvm-svn: 47668	2008-02-27 10:04:56 +00:00
Duncan Sands	96658d0189	Support for legalizing MEMBARRIER. llvm-svn: 47667	2008-02-27 08:53:44 +00:00
Bill Wendling	97925ec704	Final de-tabification. llvm-svn: 47663	2008-02-27 06:33:05 +00:00
Evan Cheng	6d56368caf	Spiller now remove unused spill slots. llvm-svn: 47657	2008-02-27 03:04:06 +00:00
Dan Gohman	66272a545b	Teach Legalize how to expand an EXTRACT_ELEMENT. llvm-svn: 47656	2008-02-27 01:52:30 +00:00
Dan Gohman	f19609abe8	Convert the last remaining users of the non-APInt form of ComputeMaskedBits to use the APInt form, and remove the non-APInt form. llvm-svn: 47654	2008-02-27 01:23:58 +00:00
Dan Gohman	ae2b6fbb8e	Convert SimplifyDemandedMask and ShrinkDemandedConstant to use APInt. Change several cases in SimplifyDemandedMask that don't ever do any simplifying to reuse the logic in ComputeMaskedBits instead of duplicating it. llvm-svn: 47648	2008-02-27 00:25:32 +00:00
Chris Lattner	d6bd311506	Use a smallvector for inactiveCounts and initialize it lazily instead of init'ing it maximally to zeros on entry. getFreePhysReg is pretty hot and only a few elements are typically used. This speeds up linscan by 5% on 176.gcc. llvm-svn: 47631	2008-02-26 22:08:41 +00:00
Bill Wendling	d7a258d325	Rename PrintableName to Name. llvm-svn: 47629	2008-02-26 21:47:57 +00:00
Bill Wendling	c24ea4fb41	Change "Name" to "AsmName" in the target register info. Gee, a refactoring tool would have been a Godsend here! llvm-svn: 47625	2008-02-26 21:11:01 +00:00
Evan Cheng	fa6b366892	Enable -coalescer-commute-instrs by default. llvm-svn: 47623	2008-02-26 20:40:22 +00:00
Dan Gohman	9db0aa86d9	Avoid aborting on invalid shift counts. llvm-svn: 47612	2008-02-26 18:50:50 +00:00
Chris Lattner	07c83cc86e	Fix PR2096, a regression introduced with my patch last night. This also fixes cfrac, flops, and 175.vpr llvm-svn: 47605	2008-02-26 17:09:59 +00:00
Duncan Sands	7cdbbfd067	Fix a nasty bug in LegalizeTypes (spotted in CodeGen/PowerPC/illegal-element-type.ll): suppose a node X is processed, and processing maps it to a node Y. Then X continues to exist in the DAG, but with no users. While processing some other node, a new node may be created that happens to be equal to X, and thus X will be reused rather than a truly new node. This can cause X to "magically reappear", and since it is in the Processed state in will not be reprocessed, so at the end of type legalization the illegal node X can still be present. The solution is to replace X with Y whenever X gets resurrected like this. llvm-svn: 47601	2008-02-26 11:21:42 +00:00
Bill Wendling	7bb51dfbb1	De-tabify. llvm-svn: 47598	2008-02-26 10:51:52 +00:00
Evan Cheng	2ff0b0e681	This is possible: vr1 = extract_subreg vr2, 3 ... vr3 = extract_subreg vr1, 2 The end result is vr3 is equal to vr2 with subidx 2. llvm-svn: 47592	2008-02-26 08:03:41 +00:00
Chris Lattner	e7c14013f5	Fix isNegatibleForFree to not return true for ConstantFP nodes after legalize. Just because a constant is legal (e.g. 0.0 in SSE) doesn't mean that its negated value is legal (-0.0). We could make this stronger by checking to see if the negated constant is actually legal post negation, but it doesn't seem like a big deal. llvm-svn: 47591	2008-02-26 07:04:54 +00:00
Evan Cheng	ccc0c996a4	Refactor inline asm constraint matching code out of SDIsel into TargetLowering. llvm-svn: 47587	2008-02-26 02:33:44 +00:00
Dan Gohman	432e4a6742	Make some static variables const. llvm-svn: 47566	2008-02-25 21:39:34 +00:00
Dan Gohman	1f372edd97	Convert MaskedValueIsZero and all its users to use APInt. Also add a SignBitIsZero function to simplify a common use case. llvm-svn: 47561	2008-02-25 21:11:39 +00:00
Evan Cheng	548677022c	All remat'ed loads cannot be folded into two-address code. Not just argument loads. This change doesn't really have any impact on codegen. llvm-svn: 47557	2008-02-25 19:24:01 +00:00
Duncan Sands	896c519d19	In debug builds check that the key property holds: all result and operand types are legal. llvm-svn: 47546	2008-02-25 16:21:21 +00:00
Evan Cheng	589a9fb6dc	Correctly determine whether a argument load can be folded into its uses. llvm-svn: 47545	2008-02-25 08:50:41 +00:00
Duncan Sands	ba3d7e8e7d	Add support to LegalizeTypes for building legal vectors out of illegal elements (BUILD_VECTOR). Uses and beefs up BUILD_PAIR, though it didn't really have to. Like most of LegalizeTypes, does not support soft-float. This cures all "make check" vector building failures. llvm-svn: 47537	2008-02-24 07:36:03 +00:00
Bill Wendling	a7d1ed4c98	Some platforms use the same name for 32-bit and 64-bit registers (like %r3 on PPC) in their ASM files. However, it's hard for humans to read during debugging. Adding a new field to the register data that lets you specify a different name to be printed than the one that goes into the ASM file -- %x3 instead of %r3, for instance. llvm-svn: 47534	2008-02-24 00:56:13 +00:00
Evan Cheng	504c645b3e	Rematerialization logic was overly conservative when it comes to loads from fixed stack slots. llvm-svn: 47529	2008-02-23 03:38:34 +00:00
Evan Cheng	379682b0e5	If remating a machine instr with virtual register operand, make sure the vr is avaliable at all uses regardless of whether it would be folded. llvm-svn: 47526	2008-02-23 02:14:42 +00:00
Evan Cheng	e70afb021b	Recognize loads of arguments as re-materializable first. Therefore if isReallyTriviallyReMaterializable() returns true it doesn't confuse it as a "normal" re-materializable instruction. llvm-svn: 47520	2008-02-23 01:44:27 +00:00
Evan Cheng	4f5cb4cdac	Fix spill weight updating bug. llvm-svn: 47507	2008-02-23 00:33:04 +00:00
Evan Cheng	b6d981bddd	Same isPhysRegAvailable bug as local register allocator. llvm-svn: 47500	2008-02-22 20:31:32 +00:00
Evan Cheng	52c15b3e6d	Really really bad local register allocator bug. On X86, it was never using ESI, EDI, and EBP because of a bug in RALocal::isPhysRegAvailable(). For example, when it checks if ESI is available, it then looks at registers aliases to ESI. SIL is marked -2 (not allocatable) but isPhysRegAvailable() incorrectly assumes it is in use and returns false for ESI. llvm-svn: 47499	2008-02-22 20:30:53 +00:00
Evan Cheng	a1977d32f0	Add debugging printfs. llvm-svn: 47496	2008-02-22 19:57:06 +00:00
Evan Cheng	ea1ef87ea2	Make sure reload of implicit uses are issued before remat's. llvm-svn: 47492	2008-02-22 19:22:06 +00:00
Dale Johannesen	eabc5f39af	Pass alignment on ByVal parameters, from FE, all the way through. It is now used for codegen. llvm-svn: 47484	2008-02-22 17:49:45 +00:00
Evan Cheng	c373911461	Enable re-materialization of instructions which have virtual register operands if the definition of the operand also reaches its uses. llvm-svn: 47475	2008-02-22 09:24:50 +00:00
Evan Cheng	271aef2b03	Fix compiler warning. llvm-svn: 47468	2008-02-22 01:48:00 +00:00
Dan Gohman	f3057a939d	Fix a regression in 403.gcc and 186.crafty introduced in 47383. To test that a value is >= 32, check that all of the high bits are zero, not just one or more. llvm-svn: 47467	2008-02-22 01:12:31 +00:00
Chris Lattner	3422b673d1	Make the clobber analysis a bit more smart: we only are careful about early clobbers if the clobber list contains a register not some thing like {memory}, {dirflag} etc. llvm-svn: 47457	2008-02-21 20:54:31 +00:00
Chris Lattner	bdd4c8b04d	Treat clobber operands like early clobbers: if we have any, we force sdisel to do all regalloc for an asm. This leads to gross but correct codegen. This fixes the rest of PR2078. llvm-svn: 47454	2008-02-21 19:43:13 +00:00
Bill Wendling	15526b2e52	Clear PhysRegPartUse for the sub register as well. llvm-svn: 47453	2008-02-21 19:35:27 +00:00
Bill Wendling	963192f40b	Adjust the MaxAlignment for the special register scavenging spill slot. llvm-svn: 47452	2008-02-21 19:33:53 +00:00
Evan Cheng	31160f5b98	Help testing. llvm-svn: 47448	2008-02-21 19:20:21 +00:00
Andrew Lenharth	7254826c40	Better names as per Evan's request llvm-svn: 47435	2008-02-21 16:11:38 +00:00
Andrew Lenharth	95528943e9	Atomic op support. If any gcc test uses __sync builtins, it might start failing on archs that haven't implemented them yet llvm-svn: 47430	2008-02-21 06:45:13 +00:00
Chris Lattner	4da4f85090	Add support for matching mem operands. This fixes PR1133, patch by Eli Friedman. This implements CodeGen/Generic/2008-02-20-MatchingMem.ll. llvm-svn: 47428	2008-02-21 05:27:19 +00:00
Chris Lattner	83c93d5afd	Fix a (harmless) but where vregs were added to the used reg lists for inline asms. Fix PR2078 by marking aliases of registers used when a register is marked used. This prevents EAX from being allocated when AX is listed in the clobber set for the asm. llvm-svn: 47426	2008-02-21 04:55:52 +00:00
Evan Cheng	911f6bd799	Clean up some spilling code using MachineRegisterInfo. llvm-svn: 47416	2008-02-21 00:34:19 +00:00
Bill Wendling	eac9e5ef21	Remove one of the fixmes that I put in there. From Evan: No need to go up more levels. A def of a register also sets its sub-registers (so if PhysRegInfo[SuperReg] is NULL, it means SuperReg's super registers are not previously defined). llvm-svn: 47399	2008-02-20 20:56:45 +00:00
Bill Wendling	cf2d1aa485	Improve some comments explaining the "handle kills" stuff better. llvm-svn: 47395	2008-02-20 19:35:34 +00:00
Bill Wendling	0b72219681	Fix comment. llvm-svn: 47389	2008-02-20 19:09:14 +00:00
Devang Patel	57b4eedad9	assert is more effective reminder then FIXME tag for unimplemented features. llvm-svn: 47388	2008-02-20 18:37:40 +00:00
Duncan Sands	e7b462b329	LegalizeTypes support for scalarizing a vector store and splitting extract_subvector. This fixes nine "make check" testcases, for example 2008-02-04-ExtractSubvector.ll and (partially) CodeGen/Generic/vector.ll. llvm-svn: 47384	2008-02-20 17:38:09 +00:00
Dan Gohman	34fc7dbf5b	Convert Legalize to use the APInt form of ComputeMaskedBits. llvm-svn: 47383	2008-02-20 16:57:27 +00:00
Dan Gohman	360c86aed5	Add explicit keywords. llvm-svn: 47382	2008-02-20 16:44:09 +00:00
Dan Gohman	d0ff91dac5	Convert DAGCombiner to use the APInt form of ComputeMaskedBits. llvm-svn: 47381	2008-02-20 16:33:30 +00:00
Dan Gohman	b717fdaa7b	Use APInt::intersects. llvm-svn: 47380	2008-02-20 16:30:17 +00:00
Anton Korobeynikov	18991d78fa	Fix newly-introduced 4.3 warnings llvm-svn: 47375	2008-02-20 12:07:57 +00:00
Anton Korobeynikov	035eaacd1f	Update gcc 4.3 warnings fix patch with recent head changes llvm-svn: 47368	2008-02-20 11:10:28 +00:00
Anton Korobeynikov	579f07135a	Unbreak build with gcc 4.3: provide missed includes and silence most annoying warnings. llvm-svn: 47367	2008-02-20 11:08:44 +00:00
Bill Wendling	b912351ec9	Added some comments and reformatted others. No functionality change. Added two "FIXMEs" for code that looks dubious to me (but I could be wrong). llvm-svn: 47366	2008-02-20 09:15:16 +00:00
Bill Wendling	406fdbd3ad	More constification of things. More comments added. No functionality changes. (Sorry for any formatting changes that creeped in.) llvm-svn: 47362	2008-02-20 07:36:31 +00:00
Chris Lattner	2a8037b5f5	Fix an incredibly subtle bug exposed by Ted's change to APInt profiling. AddNodeIDNode does profiling for a ConstantSDNode, but so does SelectionDAG::getConstant. This profiling should be moved to a common static function in ConstantSDNode. llvm-svn: 47359	2008-02-20 06:28:01 +00:00
Bill Wendling	59cc15955f	No functionality change: - Constified some MachineOperand values. - Added/Modified some comments. llvm-svn: 47358	2008-02-20 06:10:21 +00:00
Devang Patel	295711f583	Add GetResultInst. First step for multiple return value support. llvm-svn: 47348	2008-02-19 22:15:16 +00:00
Evan Cheng	3266ff9a6f	PR1909: Tail merging pass ran wild. It makes no sense to merge blocks in order to save a single instruction since a branch will be inserted for each BB. llvm-svn: 47301	2008-02-19 02:09:37 +00:00
Evan Cheng	6200c225e0	- When DAG combiner is folding a bit convert into a BUILD_VECTOR, it should check if it's essentially a SCALAR_TO_VECTOR. Avoid turning (v8i16) <10, u, u, u> to <10, 0, u, u, u, u, u, u>. Instead, simply convert it to a SCALAR_TO_VECTOR of the proper type. - X86 now normalize SCALAR_TO_VECTOR to (BIT_CONVERT (v4i32 SCALAR_TO_VECTOR)). Get rid of X86ISD::S2VEC. llvm-svn: 47290	2008-02-18 23:04:32 +00:00
Evan Cheng	b2e4b7adde	- Remove the previous check which broke coalescer-commute3.ll - For now, conservatively ignore copy MI whose source is a physical register. Commuting its def MI can cause a physical register live interval to be live through a loop (since we know it's live coming into the def MI). llvm-svn: 47281	2008-02-18 18:56:31 +00:00
Roman Levenstein	0b2c8858df	New helper function getMBBFromIndex() that given an index in any instruction of an MBB returns a pointer the MBB. Reviewed by Evan. llvm-svn: 47267	2008-02-18 09:35:30 +00:00
Evan Cheng	8f90724a53	For now, avoid commuting def MI for copy MI's whose source is not killed. That simply trade a live interval for another and because only the non-two-address operands can be folded into loads, may end up pessimising code. llvm-svn: 47262	2008-02-18 08:40:53 +00:00
Andrew Lenharth	fedcf477b5	I cannot find a libgcc function for this builtin. Therefor expanding it to a noop (which is how it use to be treated). If someone who knows the x86 backend better than me could tell me how to get a lock prefix on an instruction, that would be nice to complete x86 support. llvm-svn: 47213	2008-02-16 14:46:26 +00:00
Duncan Sands	b289516a71	Teach LegalizeTypes how to expand the operands of br_cc. This fixes 5 "make check" failures. llvm-svn: 47212	2008-02-16 10:29:26 +00:00
Evan Cheng	652e4618e2	Refactor some code; check if commuteInstruction is able to commute the instruction. llvm-svn: 47208	2008-02-16 02:32:17 +00:00
Andrew Lenharth	9b254eed32	llvm.memory.barrier, and impl for x86 and alpha llvm-svn: 47204	2008-02-16 01:24:58 +00:00
Bill Wendling	f861fbaae8	Fix typos. llvm-svn: 47200	2008-02-16 01:09:25 +00:00
Dan Gohman	27ae573900	Rename CountMemOperands to ComputeMemOperandsEnd to reflect what it actually does. Simplify CountOperands a little by reusing ComputeMemOperandsEnd. And reword some comments for both. llvm-svn: 47198	2008-02-16 00:36:48 +00:00
Dan Gohman	856c01204b	Revert 47177, which was incorrect. llvm-svn: 47196	2008-02-16 00:25:40 +00:00
Scott Michel	a3cefeaf0c	Make tblgen a little smarter about constants smaller than i32. Currently, tblgen will complain if a sign-extended constant does not fit into a data type smaller than i32, e.g., i16. This causes a problem when certain hex constants are used, such as 0xff for byte masks or immediate xor values. tblgen will try the sign-extended value first and, if the sign extended value would overflow, it tries to see if the unsigned value will fit. Consequently, a software developer can now safely incant: (XORHIr16 R16C:$rA, 0xffff) which is somewhat clearer and more informative than incanting: (XORHIr16 R16C:$rA, (i16 -1)) even if the two are bitwise equivalent. Tblgen also outputs the 64-bit unsigned constant in the generated ISel code when getTargetConstant() is invoked. llvm-svn: 47188	2008-02-15 23:05:48 +00:00
Evan Cheng	803bb6d699	The copy instruction being coalesced will be removed, it is not a kill. llvm-svn: 47179	2008-02-15 21:36:51 +00:00
Dan Gohman	c278c4aba0	Skip over the defs and start at the uses when looking for operands with the TIED_TO attribute. llvm-svn: 47177	2008-02-15 20:59:17 +00:00
Dan Gohman	0340d1e2cd	Use the TargetInstrDescr to determine the number of operands that should be checked for the TIED_TO attribute instead of using CountOperands. llvm-svn: 47176	2008-02-15 20:50:13 +00:00
Duncan Sands	5560281c06	Teach LegalizeTypes how to promote the flags in a ret node. These are created as i32 constants but on some platforms i32 is not legal. This fixes 26 "make check" failures, for example Alpha/2005-07-12-TwoMallocCalls.ll. llvm-svn: 47172	2008-02-15 19:34:17 +00:00
Evan Cheng	2ff2da89ab	- Removing the infamous r2rMap_ and rep() method. Now the coalescer will update register defs and uses after each successful coalescing. - Also removed a number of hacks and fixed some subtle kill information bugs. llvm-svn: 47167	2008-02-15 18:24:29 +00:00
Evan Cheng	9215129f4e	Added CommuteChangesDestination(). This returns true if commuting the specified machine instr will change its definition register. llvm-svn: 47166	2008-02-15 18:21:33 +00:00
Evan Cheng	78b0edb957	Remove unnecessary #include. llvm-svn: 47164	2008-02-15 18:12:09 +00:00
Dan Gohman	a36ade5595	Use StoreSDNode::getValue instead of calling getOperand directly with a hard-coded operand number. llvm-svn: 47163	2008-02-15 18:11:59 +00:00
Chris Lattner	558a3ba17f	Fix a miscompilation from Dan's recent apintification. llvm-svn: 47128	2008-02-14 18:48:56 +00:00
Duncan Sands	4c95dbd69f	In TargetLowering::LowerCallTo, don't assert that the return value is zero-extended if it isn't sign-extended. It may also be any-extended. Also, if a floating point value was returned in a larger floating point type, pass 1 as the second operand to FP_ROUND, which tells it that all the precision is in the original type. I think this is right but I could be wrong. Finally, when doing libcalls, set isZExt on a parameter if it is "unsigned". Currently isSExt is set when signed, and nothing is set otherwise. This should be right for all calls to standard library routines. llvm-svn: 47122	2008-02-14 17:28:50 +00:00
Nate Begeman	53e1b3f9d5	Change how FP immediates are handled. 1) ConstantFP is now expand by default 2) ConstantFP is not turned into TargetConstantFP during Legalize if it is legal. This allows ConstantFP to be handled like Constant, allowing for targets that can encode FP immediates as MachineOperands. As a bonus, fix up Itanium FP constants, which now correctly match, and match more constants! Hooray. llvm-svn: 47121	2008-02-14 08:57:00 +00:00
Nate Begeman	26b76b69f4	Support a new type of MachineOperand, MO_FPImmediate, used for holding FP Immediates, crazily enough llvm-svn: 47117	2008-02-14 07:39:30 +00:00
Dan Gohman	7e22a5d8df	Allow the APInt form of ComputeMaskedBits to operate on i128 types. llvm-svn: 47101	2008-02-13 23:13:32 +00:00
Dan Gohman	95d25d39d0	Avoid setting bits that aren't demanded. llvm-svn: 47098	2008-02-13 22:43:25 +00:00
Dan Gohman	e1d9ee66ed	Simplify some logic in ComputeMaskedBits. And change ComputeMaskedBits to pass the mask APInt by value, not by reference. llvm-svn: 47096	2008-02-13 22:28:48 +00:00
Nicolas Geoffray	21ad494f67	Enable exception handling int JIT llvm-svn: 47079	2008-02-13 18:39:37 +00:00
Duncan Sands	f8d29f228d	Teach LegalizeTypes how to expand and promote CTLZ, CTTZ and CTPOP. The expansion code differs from that in LegalizeDAG in that it chooses to take the CTLZ/CTTZ count from the Hi/Lo part depending on whether the Hi/Lo value is zero, not on whether CTLZ/CTTZ of Hi/Lo returned 32 (or whatever the width of the type is) for it. I made this change because the optimizers may well know that Hi/Lo is zero and exploit it. The promotion code for CTTZ also differs from that in LegalizeDAG: it uses an "or" to get the right result when the original value is zero, rather than using a compare and select. This also means the value doesn't need to be zero extended. llvm-svn: 47075	2008-02-13 18:01:53 +00:00
Evan Cheng	587c66ed96	Some code clean up. llvm-svn: 47060	2008-02-13 09:56:03 +00:00
Evan Cheng	dc3f3841fc	Simplify. llvm-svn: 47058	2008-02-13 09:13:21 +00:00
Evan Cheng	bb4b97f90e	Fix a potential serious problem where kills belonging to the val# defined by a two-address instruction is also on the val# that defines the input. llvm-svn: 47057	2008-02-13 09:06:18 +00:00
Evan Cheng	8cc58728a8	* Cannot safely commute an instruction there are other defs which can reach its uses. * Ignore copy instructions which have already been coalesced. llvm-svn: 47056	2008-02-13 08:41:08 +00:00
Chris Lattner	a08af08a88	In SDISel, for targets that support FORMAL_ARGUMENTS nodes, lower this node as soon as we create it in SDISel. Previously we would lower it in legalize. The problem with this is that it only exposes the argument loads implied by FORMAL_ARGUMENTs after legalize, so that only dag combine 2 can hack on them. This causes us to miss some optimizations because datatype expansion also happens here. Exposing the loads early allows us to do optimizations on them. For example we now compile arg-cast.ll to: _foo: movl $2147483647, %eax andl 8(%esp), %eax ret where we previously produced: _foo: subl $12, %esp movsd 16(%esp), %xmm0 movsd %xmm0, (%esp) movl $2147483647, %eax andl 4(%esp), %eax addl $12, %esp ret It might also make sense to do this for ISD::CALL nodes, which have implicit stores on many targets. llvm-svn: 47054	2008-02-13 07:39:09 +00:00
Chris Lattner	ee322b44a4	teach dag combiner how to eliminate MERGE_VALUES nodes. llvm-svn: 47052	2008-02-13 07:25:05 +00:00
Nate Begeman	735ab3ce67	Support legalizing insert_vector_elt on targets where the element type is not legal. llvm-svn: 47048	2008-02-13 06:43:04 +00:00
Evan Cheng	1446726f3e	Initial support for copy elimination by commuting its definition MI. PR1877. A3 = op A2 B0<kill> ... B1 = A3 <- this copy ... = op A3 <- more uses ==> B2 = op B0 A2<kill> ... B1 = B2 <- now an identify copy ... = op B2 <- more uses This speeds up FreeBench/neural by 29%, Olden/bh by 12%, oopack_v1p8 by 53%. llvm-svn: 47046	2008-02-13 03:01:43 +00:00
Evan Cheng	47f462a7ec	- Added removeValNo() to remove all live ranges of a particular value#. - removeRange() can now update value# information. llvm-svn: 47044	2008-02-13 02:48:26 +00:00
Evan Cheng	244183ef0d	commuteInstr() can now commute non-ssa machine instrs. llvm-svn: 47043	2008-02-13 02:46:49 +00:00
Evan Cheng	61732d994e	Added debugging routine dumpUses. llvm-svn: 47042	2008-02-13 02:45:38 +00:00
Dan Gohman	f990faf23b	Convert SelectionDAG::ComputeMaskedBits to use APInt instead of uint64_t. Add an overload that supports the uint64_t interface for use by clients that haven't been updated yet. llvm-svn: 47039	2008-02-13 00:35:47 +00:00
Duncan Sands	f213e82bc5	Generalize getCopyFromParts and getCopyToParts to handle arbitrary precision integers and any number of parts. For example, on a 32 bit machine an i50 corresponds to two i32 parts. getCopyToParts will extend the i50 to an i64 then write half of the i64 to each part; getCopyFromParts will combine the two i32 parts into an i64 then truncate the result to i50. llvm-svn: 47024	2008-02-12 20:46:31 +00:00
Duncan Sands	a6ab6e7adb	Generalize the handling of call and return arguments, in preparation for apint support. These changes are intended to have no functional effect. llvm-svn: 46967	2008-02-11 20:58:28 +00:00
Dan Gohman	11f6212bc0	From Chris' review: use isa instead of explicitly using classof. llvm-svn: 46964	2008-02-11 19:00:34 +00:00
Dan Gohman	991056808b	From Chris' review: minor corrections in comments. llvm-svn: 46963	2008-02-11 19:00:03 +00:00
Dan Gohman	54d3b5a1f5	From Chris' review: use cast instead of dyn_cast with an assert. llvm-svn: 46962	2008-02-11 18:58:42 +00:00
Dan Gohman	7b5d916c98	From Chris' review: fix 80 column violations llvm-svn: 46961	2008-02-11 18:57:43 +00:00
Ted Kremenek	6f30a0798f	Added "Profile" method to APFloat for use with FoldingSet. Added member template "Add" to FoldingSetNodeID that allows "adding" arbitrary objects to a profile via dispatch to FoldingSetTrait<T>::Profile(). Removed FoldingSetNodeID::AddAPFloat and FoldingSetNodeID::APInt, as their functionality is now replaced using the above mentioned member template. llvm-svn: 46957	2008-02-11 17:24:50 +00:00
Duncan Sands	7377f5fbe3	Add a isBigEndian method to complement isLittleEndian. llvm-svn: 46954	2008-02-11 10:37:04 +00:00
Evan Cheng	ad4d57a2f5	Determine whether a spill kills the register it's spilling before insertion rather than trying to undo the kill marker afterwards. llvm-svn: 46953	2008-02-11 08:30:52 +00:00
Dan Gohman	3a4be0fdef	Rename MRegisterInfo to TargetRegisterInfo. llvm-svn: 46930	2008-02-10 18:45:23 +00:00
Duncan Sands	56689502c1	Add truncate and AssertZext result expansion. llvm-svn: 46926	2008-02-10 10:08:52 +00:00
Bill Wendling	9c2ce9a32d	Return "(c1 + c2)" instead of yet another ADD node (which made this a no-op). llvm-svn: 46922	2008-02-10 08:10:24 +00:00
Chris Lattner	0ededbc68e	add anote llvm-svn: 46918	2008-02-10 01:01:35 +00:00
Evan Cheng	6aabf837fe	Remove unused hidden option. llvm-svn: 46903	2008-02-09 08:36:28 +00:00
Dan Gohman	65f63eba2b	Change ConstantSDNode to store an APInt instead of a uint64_t, and begin adding some methods to use it this way. llvm-svn: 46899	2008-02-08 22:59:30 +00:00
Evan Cheng	f2bd1387b0	Forgot these files. llvm-svn: 46896	2008-02-08 22:05:27 +00:00
Evan Cheng	e460869d86	Also print alignment. llvm-svn: 46895	2008-02-08 22:05:07 +00:00
Dan Gohman	140a73efac	Avoid needlessly casting away const qualifiers. llvm-svn: 46876	2008-02-08 03:26:46 +00:00
Evan Cheng	6a80462568	Remove remnant of load folding in local register allocator. Patch by Holger Schurig. llvm-svn: 46861	2008-02-07 19:46:55 +00:00
Dan Gohman	16d4bc3dc0	Follow Chris' suggestion; change the PseudoSourceValue accessors to return pointers instead of references, since this is always what is needed. llvm-svn: 46857	2008-02-07 18:41:25 +00:00
Dan Gohman	b781c79d2c	Don't abort if a MemOperand is missing a SourceValue; just print it as <unknown>. And make some minor adjustments to the MemOperand dump format. llvm-svn: 46853	2008-02-07 16:18:00 +00:00
Nick Lewycky	7c1d787977	Don't make up new directives. (".set_foobar") llvm-svn: 46848	2008-02-07 06:36:26 +00:00
Dan Gohman	2d489b5081	Re-apply the memory operand changes, with a fix for the static initializer problem, a minor tweak to the way the DAGISelEmitter finds load/store nodes, and a renaming of the new PseudoSourceValue objects. llvm-svn: 46827	2008-02-06 22:27:42 +00:00
Evan Cheng	1ec748c784	Fix a number of local register allocator issues: PR1609. llvm-svn: 46821	2008-02-06 19:16:53 +00:00
Evan Cheng	8291ab4449	RegAllocaLocal still requires LiveVariables since it runs PHIElimination, followed by TwoAddress which requires LiveVariables. We cannot run LiveVariables on non-SSA code. llvm-svn: 46813	2008-02-06 08:00:32 +00:00
Evan Cheng	87fbd66f9f	Fix PR1975: dag isel emitter produces patterns that isel wrong flag result. llvm-svn: 46776	2008-02-05 22:50:29 +00:00
Evan Cheng	8d78b0597b	If a vr is already marked alive in a bb, then it has PHI uses that are visited earlier, then it is not killed in the def block (i.e. not dead). llvm-svn: 46763	2008-02-05 20:04:18 +00:00
Evan Cheng	ac3cd69add	Typo. llvm-svn: 46725	2008-02-04 23:10:38 +00:00
Evan Cheng	2cb9068c78	Dwarf requires variable entries to be in the source order. Right now, since we are recording variable information at isel time this means parameters would appear in the reverse order. The short term fix is to issue recordVariable() at asm printing time instead. llvm-svn: 46724	2008-02-04 23:06:48 +00:00
Duncan Sands	354e353220	I don't see how NodeUpdated can be called with a ReadyToProcess node - add an assertion to check this. Add an assertion to NodeDeleted that checks that processed/ready nodes are indeed not deleted. It is because they are never deleted that none of the maps can have a deleted node as the source of a mapping. It does however seem to be possible in theory to have a deleted value as the target of a mapping, however this has not yet been spotted in the wild. Still mulling on what to do about this. [The theoretical situation is this: a node A is expanded/promoted/whatever to a newly created node B. Thus A->B is added to a map. When the subtree rooted at B is legalized it is conceivable that B is deleted due to RAUW on a node somewhere above it]. llvm-svn: 46705	2008-02-04 09:29:17 +00:00
Chris Lattner	c41e535df1	Fix typo llvm-svn: 46682	2008-02-03 07:30:27 +00:00
Chris Lattner	62f67ea73a	handle the case where a node can become ready to process multiple times due to a RAUW. llvm-svn: 46680	2008-02-03 07:13:32 +00:00
Chris Lattner	4e9898825e	Use the new infrastructure for listening to node updates to keep the LegalizeTypes node flags up to date when doing a RAUW. This fixes a nasty bug that Duncan ran into and makes the previous (nonbuggy case) more efficent. llvm-svn: 46679	2008-02-03 07:08:51 +00:00
Chris Lattner	d2d166ea9f	the world doesn't need my debugging code. llvm-svn: 46678	2008-02-03 07:01:05 +00:00
Chris Lattner	b2b9d6f0fb	Change the 'global modification' APIs in SelectionDAG to take a new DAGUpdateListener object pointer instead of just returning a vector of deleted nodes. This makes the interfaces more efficient (no more allocating a vector [at least a malloc], filling it in, then walking it) and more clean. This also allows the client to be notified of nodes that are changed but not deleted. llvm-svn: 46677	2008-02-03 06:49:24 +00:00
Chris Lattner	7685891aa3	Generalize the SDOperand->SDOperand form of SelectionDAG::ReplaceAllUsesWith to handle replacement of an SDOperand with any sdoperand, not just one for a node with a single result. Note that this has a horrible FIXME'd hack in it to work around PR1975. This should be removed when PR1975 is fixed. llvm-svn: 46674	2008-02-03 03:35:22 +00:00
Chris Lattner	f34dfe4f28	add a -view-legalize-types-dags option, for viewing the dags going into legalize types. llvm-svn: 46672	2008-02-03 02:05:04 +00:00
Evan Cheng	32e5347eb8	Get rid of the annoying blank lines before labels. llvm-svn: 46667	2008-02-02 08:39:46 +00:00
Evan Cheng	efd142a920	SDIsel processes llvm.dbg.declare by recording the variable debug information descriptor and its corresponding stack frame index in MachineModuleInfo. This only works if the local variable is "homed" in the stack frame. It does not work for byval parameter, etc. Added ISD::DECLARE node type to represent llvm.dbg.declare intrinsic. Now the intrinsic calls are lowered into a SDNode and lives on through out the codegen passes. For now, since all the debugging information recording is done at isel time, when a ISD::DECLARE node is selected, it has the side effect of also recording the variable. This is a short term solution that should be fixed in time. llvm-svn: 46659	2008-02-02 04:07:54 +00:00
Evan Cheng	d6e44ab5ec	Remove the nasty LABEL hack with a much less evil one. Now llvm.dbg.func.start implies a stoppoint is set. SelectionDAGISel records a new source line but does not create a ISD::LABEL node for this special stoppoint. Asm printer will magically print this label. This ensures nothing is emitted before. llvm-svn: 46635	2008-02-01 09:10:45 +00:00
Evan Cheng	263070ea2b	Rename RecordLabel to RecordSourceLine because that's what it is doing. llvm-svn: 46628	2008-02-01 02:05:57 +00:00
Evan Cheng	27b32b87ed	Revert 46556 and 46585. Dan please fix the PseudoSourceValue problem and re-commit. llvm-svn: 46623	2008-01-31 21:00:00 +00:00
Evan Cheng	f4f1d44779	Add a comment for a nasty short term hack. llvm-svn: 46610	2008-01-31 10:05:13 +00:00
Evan Cheng	1c6c16ea11	Add an extra operand to LABEL nodes which distinguishes between debug, EH, or misc labels. This fixes the EH breakage. However I am not convinced this is the solution. llvm-svn: 46609	2008-01-31 09:59:15 +00:00
Christopher Lamb	58ffa8c57a	Add more thorough error checking for NULL register classes. llvm-svn: 46605	2008-01-31 07:09:08 +00:00
Evan Cheng	a41d3bcb12	MRegisterInfo::getLocation() is a really bad idea. Its function is to calculate the offset from frame pointer to a stack slot and then storing the delta in a MachineLocation object. The name is bad (it implies a getter), and MRegisterInfo doesn't need to know about MachineLocation. Replace getLocation() with getFrameIndexOffset() which returns the delta from frame pointer to stack slot. Dwarf writer can then use the information for whatever it wants. llvm-svn: 46597	2008-01-31 03:37:28 +00:00
Dan Gohman	9ba4d76816	Rename ISD::FLT_ROUNDS to ISD::FLT_ROUNDS_ to avoid conflicting with the real FLT_ROUNDS (defined in <float.h>). llvm-svn: 46587	2008-01-31 00:41:03 +00:00
Evan Cheng	4863fcc3eb	Also avoid adding callee save code before debug labels. llvm-svn: 46586	2008-01-31 00:27:49 +00:00
Dan Gohman	3646fdda67	Create a new class, MemOperand, for describing memory references in the backend. Introduce a new SDNode type, MemOperandSDNode, for holding a MemOperand in the SelectionDAG IR, and add a MemOperand list to MachineInstr, and code to manage them. Remove the offset field from SrcValueSDNode; uses of SrcValueSDNode that were using it are all all using MemOperandSDNode now. Also, begin updating some getLoad and getStore calls to use the PseudoSourceValue objects. Most of this was written by Florian Brander, some reorganization and updating to TOT by me. llvm-svn: 46585	2008-01-31 00:25:39 +00:00
Evan Cheng	b9b740119d	Fixed a bug in MergeValueInAsValue() pointed out by David Greene. Replace val# with previous liverange's. llvm-svn: 46579	2008-01-30 22:44:55 +00:00
Evan Cheng	a3395a61cc	Treat the label for the first @llvm.dbg.stoppoint the same way as the dbg_func_start label. Make sure nothing else is inserted before them. Note this solution might be somewhat fragile since ISD::LABEL may be used for other purposes. If that ends up to be an issue, we may need to introduce a different node for debug labels. llvm-svn: 46571	2008-01-30 20:08:35 +00:00
Dale Johannesen	19cf69ff9d	Adjust loop per review feedback. llvm-svn: 46569	2008-01-30 19:44:39 +00:00
Evan Cheng	a3ff8e6110	A semi-gross fix for a debug info issue. When inserting the "function start" label (i.e. first label in the entry block) take care to insert it at the beginning of the block. llvm-svn: 46568	2008-01-30 19:35:32 +00:00
Dale Johannesen	56d4903db5	Accept getelementptr starting at GV with all 0 indices as a legitimate way of representing global variable GV in debug info. llvm-svn: 46565	2008-01-30 19:00:21 +00:00
Evan Cheng	29cfb67e28	Even though InsertAtEndOfBasicBlock is an ugly hack it still deserves a proper name. Rename it to EmitInstrWithCustomInserter since it does not necessarily insert instruction at the end. llvm-svn: 46562	2008-01-30 18:18:23 +00:00
Dan Gohman	02b6792dd4	Add a new PseudoSourceValue class, which will be used to help track memory reference information in the backend. Most of this was written by Florian Brander, cleanup and updating to TOT by me. llvm-svn: 46556	2008-01-30 16:35:31 +00:00
Dan Gohman	47a7d6fafe	Factor the addressing mode and the load/store VT out of LoadSDNode and StoreSDNode into their common base class LSBaseSDNode. Member functions getLoadedVT and getStoredVT are replaced with the common getMemoryVT to simplify code that will handle both loads and stores. llvm-svn: 46538	2008-01-30 00:15:11 +00:00
Duncan Sands	032a5d2690	When expanding an operand, it's not the result value type that matters but the operand type. This fixes 2008-01-08-IllegalCMP.ll which crashed with the new legalize infrastructure because SETCC with result type i8 and operand type i64 was being custom expanded by the X86 backend. With this fix, the gcc build gets as far as the first libcall. llvm-svn: 46525	2008-01-29 19:29:08 +00:00
Dan Gohman	70de4cb1cd	Use empty() instead of comparing size() with zero. llvm-svn: 46514	2008-01-29 13:02:09 +00:00
Dan Gohman	cf8827a282	Fix a typo in a comment. llvm-svn: 46513	2008-01-29 12:43:50 +00:00
Dan Gohman	cd170a7017	Fix a typo in a comment. llvm-svn: 46508	2008-01-29 12:07:11 +00:00
Duncan Sands	05837edae7	Use getPreferredAlignmentLog or getPreferredAlignment to get the alignment of global variables, rather than using hand-made versions. llvm-svn: 46495	2008-01-29 06:23:44 +00:00
Owen Anderson	5aa1615add	RegAllocBigBlock doesn't need LiveVariables either. llvm-svn: 46488	2008-01-29 02:32:13 +00:00
Nate Begeman	ef33767efb	Properly expand extract-element for non-power-of-2 codegen llvm-svn: 46486	2008-01-29 02:24:00 +00:00
Dale Johannesen	2b3bc30420	Handle 'X' constraint in asm's better. llvm-svn: 46485	2008-01-29 02:21:21 +00:00
Chris Lattner	2ee91f4300	Fix PowerPC/./2007-10-18-PtrArithmetic.ll llvm-svn: 46424	2008-01-27 23:32:17 +00:00
Chris Lattner	d0496d0433	fix a crash on CodeGen/X86/vector-rem.ll llvm-svn: 46422	2008-01-27 23:21:58 +00:00
Owen Anderson	9a8c890c02	Reg alloc doesn't really need LiveVariables. llvm-svn: 46420	2008-01-27 22:00:00 +00:00
Chris Lattner	888560d62c	Implement some dag combines that allow doing fneg/fabs/fcopysign in integer registers if used by a bitconvert or using a bitconvert. This allows us to avoid constant pool loads and use cheaper integer instructions when the values come from or end up in integer regs anyway. For example, we now compile CodeGen/X86/fp-in-intregs.ll to: _test1: movl $2147483648, %eax xorl 4(%esp), %eax ret _test2: movl $1065353216, %eax orl 4(%esp), %eax andl $3212836864, %eax ret Instead of: _test1: movss 4(%esp), %xmm0 xorps LCPI2_0, %xmm0 movd %xmm0, %eax ret _test2: movss 4(%esp), %xmm0 andps LCPI3_0, %xmm0 movss LCPI3_1, %xmm1 andps LCPI3_2, %xmm1 orps %xmm0, %xmm1 movd %xmm1, %eax ret bitconverts can happen due to various calling conventions that require fp values to passed in integer regs in some cases, e.g. when returning a complex. llvm-svn: 46414	2008-01-27 17:42:27 +00:00
Chris Lattner	f1a6c9fe86	For long double constants, print an approximation of their value to the .s file to make it easier to read. llvm-svn: 46407	2008-01-27 06:09:28 +00:00
Chris Lattner	e30e33af4f	Infer alignment of loads and increase their alignment when we can tell they are from the stack. This allows us to compile stack-align.ll to: _test: movsd LCPI1_0, %xmm0 movapd %xmm0, %xmm1 * andpd 4(%esp), %xmm1 andpd _G, %xmm0 addsd %xmm1, %xmm0 movl 20(%esp), %eax movsd %xmm0, (%eax) ret instead of: _test: movsd LCPI1_0, %xmm0 movsd 4(%esp), %xmm1 ** andpd %xmm0, %xmm1 andpd _G, %xmm0 addsd %xmm1, %xmm0 movl 20(%esp), %eax movsd %xmm0, (%eax) ret llvm-svn: 46401	2008-01-26 19:45:50 +00:00
Chris Lattner	31e9edce1c	Fix some bugs in SimplifyNodeWithTwoResults where it would call deletenode to delete a node even if it was not dead in some cases. Instead, just add it to the worklist. Also, make sure to use the CombineTo methods, as it was doing things that were unsafe: the top level combine loop could touch dangling memory. This fixes CodeGen/Generic/2008-01-25-dag-combine-mul.ll llvm-svn: 46384	2008-01-26 01:09:19 +00:00
Chris Lattner	720d8999c7	don't bother making x&-1 only to simplify it in dag combine. This commonly occurs expanding i64 ops. llvm-svn: 46383	2008-01-26 01:05:42 +00:00
Chris Lattner	cb3cf546c3	reduce indentation llvm-svn: 46377	2008-01-25 23:34:24 +00:00
Chris Lattner	fc80996a21	fix long lines. llvm-svn: 46355	2008-01-25 17:24:52 +00:00
Chris Lattner	2d7a830ff3	Add skeletal code to increase the alignment of loads and stores when we can infer it. This will eventually help stuff, though it doesn't do much right now because all fixed FI's have an alignment of 1. llvm-svn: 46349	2008-01-25 07:20:16 +00:00
Chris Lattner	6068832dbe	move MachineFrameInfo::CreateFixedObject out of line, give MachineFrameInfo a reference to TargetFrameInfo. Rearrange order of fields in StackObject to save a word. llvm-svn: 46348	2008-01-25 07:19:06 +00:00
Chris Lattner	da52d9e093	include alignment and volatility information in -view-*-dags output llvm-svn: 46347	2008-01-25 06:40:45 +00:00
Chris Lattner	8d83271b25	Don't dump the function! llvm-svn: 46320	2008-01-24 19:28:11 +00:00
Chris Lattner	34ed27c46d	clarify a comment, thanks Duncan. llvm-svn: 46313	2008-01-24 17:10:01 +00:00
Chris Lattner	e97fa8cdf0	Fix this buggy transformation. Two observations: 1. we already know the value is dead, so don't bother replacing it with undef. 2. The very case the comment describes actually makes the load live which asserts in deletenode. If we do the replacement and the node becomes live, just treat it as new. This fixes a failure on X86/2008-01-16-InvalidDAGCombineXform.ll with some local changes in my tree. llvm-svn: 46306	2008-01-24 07:57:06 +00:00
Chris Lattner	d66eac62fd	The dag combiner is missing revisiting nodes that it really should, and thus leaving dead stuff around. This gets fed into the isel pass and causes certain foldings from happening because nodes have extraneous uses floating around. For example, if we turned foo(bar(x)) -> baz(x), we sometimes left bar(x) around. llvm-svn: 46305	2008-01-24 07:18:21 +00:00
Chris Lattner	0feb1b0f84	fold fp_round(fp_round(x)) -> fp_round(x). llvm-svn: 46304	2008-01-24 06:45:35 +00:00
Owen Anderson	2a8a485630	Move some functionality for adding flags to MachineInstr's into methods on MachineInstr rather than LiveVariables. llvm-svn: 46295	2008-01-24 01:10:07 +00:00
Evan Cheng	ec3da554e6	Forgot these. llvm-svn: 46292	2008-01-24 00:22:01 +00:00
Duncan Sands	95d46ef887	The last pieces needed for loading arbitrary precision integers. This won't actually work (and most of the code is dead) unless the new legalization machinery is turned on. While there, I rationalized the handling of i1, and removed some bogus (and unused) sextload patterns. For i1, this could result in microscopically better code for some architectures (not X86). It might also result in worse code if annotating with AssertZExt nodes turns out to be more harmful than helpful. llvm-svn: 46280	2008-01-23 20:39:46 +00:00
Owen Anderson	7fe0bb2b43	Fix an iterator invalidation issue. llvm-svn: 46263	2008-01-22 23:58:54 +00:00
Chris Lattner	1671361c5c	Simplify SelectionDAG::getNode so that a big switch stmt is not #ifndef NDEBUG. This is in response to a really nasty bug I introduced that Dale tracked down, hopefully this won't happen in the future. Many thanks Dale. llvm-svn: 46254	2008-01-22 19:09:33 +00:00
Duncan Sands	88de26cffb	The final piece needed for storing arbitrary precision integers. Handle truncstore of a legal type to an unusual number of bits. Most of this code is not reachable unless the new legalize infrastructure is turned on. llvm-svn: 46249	2008-01-22 07:17:34 +00:00
Owen Anderson	7fb6241733	Clarify a deviation from the original algorithm. llvm-svn: 46218	2008-01-21 22:03:00 +00:00
Owen Anderson	d990b4f646	Improve a few comments. llvm-svn: 46217	2008-01-21 22:01:01 +00:00
Dale Johannesen	59e0e4bf35	Move DAG-changing code out of #ifndef NDEBUG. llvm-svn: 46204	2008-01-21 01:00:34 +00:00
Dale Johannesen	949e5a2f8a	Do not generate a FP_ROUND of f64 to f64. llvm-svn: 46195	2008-01-20 01:18:38 +00:00
Chris Lattner	bc6cf9e810	remove extraneous &'s. llvm-svn: 46171	2008-01-18 19:36:20 +00:00
Chris Lattner	1ea55cf816	This commit changes: 1. Legalize now always promotes truncstore of i1 to i8. 2. Remove patterns and gunk related to truncstore i1 from targets. 3. Rename the StoreXAction stuff to TruncStoreAction in TLI. 4. Make the TLI TruncStoreAction table a 2d table to handle from/to conversions. 5. Mark a wide variety of invalid truncstores as such in various targets, e.g. X86 currently doesn't support truncstore of any of its integer types. 6. Add legalize support for truncstores with invalid value input types. 7. Add a dag combine transform to turn store(truncate) into truncstore when safe. The later allows us to compile CodeGen/X86/storetrunc-fp.ll to: _foo: fldt 20(%esp) fldt 4(%esp) faddp %st(1) movl 36(%esp), %eax fstps (%eax) ret instead of: _foo: subl $4, %esp fldt 24(%esp) fldt 8(%esp) faddp %st(1) fstps (%esp) movl 40(%esp), %eax movss (%esp), %xmm0 movss %xmm0, (%eax) addl $4, %esp ret llvm-svn: 46140	2008-01-17 19:59:44 +00:00
Chris Lattner	7eabed3521	code cleanups, no functionality change. llvm-svn: 46126	2008-01-17 07:20:38 +00:00
Chris Lattner	72733e573b	* Introduce a new SelectionDAG::getIntPtrConstant method and switch various codegen pieces and the X86 backend over to using it. * Add some comments to SelectionDAGNodes.h * Introduce a second argument to FP_ROUND, which indicates whether the FP_ROUND changes the value of its input. If not it is safe to xform things like fp_extend(fp_round(x)) -> x. llvm-svn: 46125	2008-01-17 07:00:52 +00:00
Evan Cheng	54c20b559e	When a live virtual register is being clobbered by an implicit def, it is spilled and the spill is its kill. However, if the local allocator has determined the register has not been modified (possible when its value was reloaded), it would not issue a restore. In that case, mark the last use of the virtual register as kill. llvm-svn: 46111	2008-01-17 02:08:17 +00:00
Evan Cheng	dc5b4c57d7	Replace std::vector<bool> with BitVector. llvm-svn: 46104	2008-01-17 00:35:26 +00:00
Evan Cheng	7be1528004	Fixes a nasty dag combiner bug that causes a bunch of tests to fail at -O0. It's not safe to use the two value CombineTo variant to combine away a dead load. e.g. v1, chain2 = load chain1, loc v2, chain3 = load chain2, loc v3 = add v2, c Now we replace use of v1 with undef, use of chain2 with chain1. ReplaceAllUsesWith() will iterate through uses of the first load and update operands: v1, chain2 = load chain1, loc v2, chain3 = load chain1, loc v3 = add v2, c Now the second load is the same as the first load, SelectionDAG cse will ensure the use of second load is replaced with the first load. v1, chain2 = load chain1, loc v3 = add v1, c Then v1 is replaced with undef and bad things happen. llvm-svn: 46099	2008-01-16 23:11:54 +00:00
Dale Johannesen	ed20366706	Do not mark EH tables no-dead-strip unless the associated function is so marked. llvm-svn: 46088	2008-01-16 19:59:28 +00:00
Chris Lattner	52188501f6	Fix a ppc long double regression I introduced yesterday due to a simplification. This fixes automotive-basicmath on PPC. llvm-svn: 46072	2008-01-16 17:59:31 +00:00
Chris Lattner	7ca4d5b1f3	merge a few pieces of code that do the store/load to stack pattern to use EmitStackConvert now. llvm-svn: 46066	2008-01-16 07:51:34 +00:00
Chris Lattner	87bc3e7ece	rename ExpandBIT_CONVERT to EmitStackConvert, generalizing it to allow it to emit different load and store kinds. llvm-svn: 46065	2008-01-16 07:45:30 +00:00
Chris Lattner	a2c7ff3386	simplify a bunch of code by using SelectionDAG::CreateStackTemporary instead of inlining its body. llvm-svn: 46062	2008-01-16 07:03:22 +00:00
Chris Lattner	91d86242f9	Change legalizeop of FP_ROUND and FP_EXTEND to not fall through into the ANY_EXTEND/ZERO_EXTEND/SIGN_EXTEND code to simplify it. Unmerge the code for FP_ROUND and FP_EXTEND from each other to make each one simpler. llvm-svn: 46061	2008-01-16 06:57:07 +00:00
Chris Lattner	2e50a6f90f	Factor the ReachesChainWithoutSideEffects out of dag combiner into a public SDOperand::reachesChainWithoutSideEffects method. No functionality change. llvm-svn: 46050	2008-01-16 05:49:24 +00:00
Dale Johannesen	59a2250b0d	Fix and enable EH for x86-64 Darwin. Adds ShortenEHDataFor64Bits as a not-very-accurate abstraction to cover all the changes in DwarfWriter. Some cosmetic changes to Darwin assembly code for gcc testsuite compatibility. llvm-svn: 46029	2008-01-15 23:24:56 +00:00
Owen Anderson	897aed9109	Move some calls to getVRegDef higher in the callgraph, so they don't get executed as frequently in performance sensitive code. llvm-svn: 46027	2008-01-15 22:58:11 +00:00
Chris Lattner	ec224888a6	The type of the 'abort' node should be pointer type (because it's a function pointer) not MVT::Other. This fixes builtin_trap lowering on ppc, alpha, ia64 llvm-svn: 46018	2008-01-15 22:09:33 +00:00
Owen Anderson	1ba66e0cec	Remove DefInst from LiveVariables::VarInfo. Use the facilities on MachineRegisterInfo instead. llvm-svn: 46016	2008-01-15 22:02:46 +00:00
Chris Lattner	ee8df1f4d3	Add support for targets that have a legal ISD::TRAP. llvm-svn: 46014	2008-01-15 21:58:08 +00:00
Evan Cheng	eb30bb7d29	Oops. Forgot to commit this. llvm-svn: 46002	2008-01-15 07:49:36 +00:00
Anton Korobeynikov	6bbbc4cbfa	For PR1839: add initial support for __builtin_trap. llvm-gcc part is missed as well as PPC codegen llvm-svn: 46001	2008-01-15 07:02:33 +00:00
Evan Cheng	5b212ea818	ByVal stack slot alignment should be at least as large as pointer ABI alignment. llvm-svn: 45995	2008-01-15 03:14:05 +00:00
Chris Lattner	994718417a	don't create the post-ra scheduler unless it is enabled. llvm-svn: 45972	2008-01-14 19:00:06 +00:00
Chris Lattner	4272c12571	remove dead #include llvm-svn: 45971	2008-01-14 18:45:28 +00:00
Duncan Sands	08c728b519	Remove the assumption that byval has been applied to a pointer to a struct. llvm-svn: 45939	2008-01-13 21:19:59 +00:00
Chris Lattner	08af5a9dad	implement support for sinking a load out the bottom of a block that has no stores between the load and the end of block. This works great and sinks hundreds of stores, but we can't turn it on because machineinstrs don't have volatility information and we don't want to sink volatile stores :( llvm-svn: 45894	2008-01-12 00:17:41 +00:00
Chris Lattner	c8226f32e9	Simplify the side effect stuff a bit more and make licm/sinking both work right according to the new flags. This removes the TII::isReallySideEffectFree predicate, and adds TII::isInvariantLoad. It removes NeverHasSideEffects+MayHaveSideEffects and adds UnmodeledSideEffects as machine instr flags. Now the clients can decide everything they need. I think isRematerializable can be implemented in terms of the flags we have now, though I will let others tackle that. llvm-svn: 45843	2008-01-10 23:08:24 +00:00
Chris Lattner	f3bd2cd37c	Clamp down on sinking of lots of instructions. llvm-svn: 45841	2008-01-10 22:35:15 +00:00
Duncan Sands	53c954fa86	Output sinl for a long double FSIN node, not sin. Likewise fix up a bunch of other libcalls. While there I remove NEG_F32 and NEG_F64 since they are not used anywhere. This fixes 9 Ada ACATS failures. llvm-svn: 45833	2008-01-10 10:28:30 +00:00
Evan Cheng	f2553ab84f	Only remat loads from immutable stack slots. llvm-svn: 45831	2008-01-10 08:24:38 +00:00
Evan Cheng	8b03bafd37	Simplify some code. llvm-svn: 45830	2008-01-10 08:22:10 +00:00
Owen Anderson	d445b8813f	Don't use LiveVariables::VarInfo::DefInst. llvm-svn: 45815	2008-01-10 03:12:54 +00:00
Dale Johannesen	7ecb3b79c7	Emit unused EH frames for weak definitions on Darwin, because assembler/linker can't cope with weak absolutes. PR 1880. llvm-svn: 45811	2008-01-10 02:03:30 +00:00
Owen Anderson	4f45cef2f9	Get rid of all uses of LiveVariables::VarInfo::DefInst in favor of the equivalent API from MachineRegisterInfo. Once all clients are switched over, the former will be going away. llvm-svn: 45805	2008-01-10 01:36:43 +00:00
Owen Anderson	51b8e20ccf	Add more comments explaining the basics of how the decision of when to rename and when to insert copies is made. llvm-svn: 45799	2008-01-10 00:47:01 +00:00
Owen Anderson	8958a78576	Get rid of the isKillInst predicate. LiveVariables already provides this information. llvm-svn: 45797	2008-01-10 00:33:11 +00:00
Owen Anderson	1c8152ba03	Copies need to be inserted before the first terminator, not at the end of the block. llvm-svn: 45791	2008-01-10 00:01:41 +00:00
Evan Cheng	0e400d4cb7	Special copy SUnit's do not have SDNode's. llvm-svn: 45787	2008-01-09 23:01:55 +00:00
Owen Anderson	436db42a3c	Clean up StrongPHIElimination a bit, and add some more comments to the internal structures. There's still more work to do on this front. llvm-svn: 45783	2008-01-09 22:40:54 +00:00
Owen Anderson	4de0c3978d	StrongPHIElim: Now with even fewer trivial bugs! llvm-svn: 45775	2008-01-09 10:41:39 +00:00
Owen Anderson	77c3fe441b	Fix an infinite recursion bug in InsertCopies. llvm-svn: 45774	2008-01-09 10:32:30 +00:00
Owen Anderson	e0fd9bd35a	Fix some simple bugs. StrongPHIElimination now does not crash on 164.gzip. llvm-svn: 45773	2008-01-09 06:19:05 +00:00
Chris Lattner	51b01bf8a5	Make load->store deletion a bit smarter. This allows us to compile this: void test(long long P) { P ^= 1; } into just: _test: movl 4(%esp), %eax xorl $1, (%eax) ret instead of code like this: _test: movl 4(%esp), %ecx xorl $1, (%ecx) movl 4(%ecx), %edx movl %edx, 4(%ecx) ret llvm-svn: 45762	2008-01-08 23:08:06 +00:00
Owen Anderson	1b0d5c747e	Rename registers that do not need copies. llvm-svn: 45759	2008-01-08 21:54:52 +00:00
Owen Anderson	812e1ea7cf	Actually insert copies now! llvm-svn: 45738	2008-01-08 05:16:15 +00:00
Owen Anderson	47299489ec	Oops, missed one. llvm-svn: 45719	2008-01-07 21:32:09 +00:00
Owen Anderson	bbc6352d1f	Make some predicates static. llvm-svn: 45718	2008-01-07 21:30:40 +00:00
Gordon Henriksen	24db8d383d	Pruning includes. llvm-svn: 45700	2008-01-07 13:30:38 +00:00
Chris Lattner	f3efadcb5b	remove #includage llvm-svn: 45697	2008-01-07 07:42:25 +00:00
Chris Lattner	03ad885039	rename TargetInstrDescriptor -> TargetInstrDesc. Make MachineInstr::getDesc return a reference instead of a pointer, since it can never be null. llvm-svn: 45695	2008-01-07 07:27:27 +00:00
Chris Lattner	fd2e338b85	simplify some code. llvm-svn: 45693	2008-01-07 06:47:00 +00:00
Chris Lattner	e99a6caee4	Rename all the M_* flags to be namespace qualified enums, and switch all clients over to using predicates instead of these flags directly. These are now private values which are only to be used to statically initialize the tables. llvm-svn: 45692	2008-01-07 06:42:05 +00:00
Chris Lattner	08a69ac2f5	add more and significantly better comments to the rest of the machineinstr flags that can be set. Add predicates for the ones lacking it, and switch some clients over to using the predicates instead of Flags directly. llvm-svn: 45690	2008-01-07 06:21:53 +00:00
Chris Lattner	769c86bf63	simplify some code using new predicates llvm-svn: 45689	2008-01-07 05:40:58 +00:00
Chris Lattner	f376c99ea0	rename hasVariableOperands() -> isVariadic(). Add some comments. Evan, please review the comments I added to getNumDefs to make sure that they are accurate, thx. llvm-svn: 45687	2008-01-07 05:19:29 +00:00
Chris Lattner	b0d06b4381	Move a bunch more accessors from TargetInstrInfo to TargetInstrDescriptor llvm-svn: 45680	2008-01-07 03:13:06 +00:00
Chris Lattner	d34c47653e	remove some uses of MachineOpCode, move getSchedClass into TargetInstrDescriptor from TargetInstrInfo. llvm-svn: 45678	2008-01-07 02:46:03 +00:00
Chris Lattner	e55e115616	Add predicates methods to TargetOperandInfo, and switch all clients over to using them, instead of diddling Flags directly. Change the various flags from const variables to enums. llvm-svn: 45677	2008-01-07 02:39:19 +00:00
Gordon Henriksen	c7e991b7c3	Setting GlobalDirective in TargetAsmInfo by default rather than providing a misleading facility. It's used once in the MIPS backend and hardcoded as "\t.globl\t" everywhere else. llvm-svn: 45676	2008-01-07 02:31:11 +00:00
Chris Lattner	a98c679de0	Rename MachineInstr::getInstrDescriptor -> getDesc(), which reflects that it is cheap and efficient to get. Move a variety of predicates from TargetInstrInfo into TargetInstrDescriptor, which makes it much easier to query a predicate when you don't have TII around. Now you can use MI->getDesc()->isBranch() instead of going through TII, and this is much more efficient anyway. Not all of the predicates have been moved over yet. Update old code that used MI->getInstrDescriptor()->Flags to use the new predicates in many places. llvm-svn: 45674	2008-01-07 01:56:04 +00:00
Owen Anderson	0ec92e9d64	Update CodeGen for MRegisterInfo --> TargetInstrInfo changes. llvm-svn: 45673	2008-01-07 01:35:56 +00:00
Gordon Henriksen	2d684b1fbf	Ammending r45669 with a missing file. llvm-svn: 45671	2008-01-07 01:33:09 +00:00
Gordon Henriksen	6047b6e140	With this patch, the LowerGC transformation becomes the ShadowStackCollector, which additionally has reduced overhead with no sacrifice in portability. Considering a function @fun with 8 loop-local roots, ShadowStackCollector introduces the following overhead (x86): ; shadowstack prologue movl L_llvm_gc_root_chain$non_lazy_ptr, %eax movl (%eax), %ecx movl $___gc_fun, 20(%esp) movl $0, 24(%esp) movl $0, 28(%esp) movl $0, 32(%esp) movl $0, 36(%esp) movl $0, 40(%esp) movl $0, 44(%esp) movl $0, 48(%esp) movl $0, 52(%esp) movl %ecx, 16(%esp) leal 16(%esp), %ecx movl %ecx, (%eax) ; shadowstack loop overhead (none) ; shadowstack epilogue movl 48(%esp), %edx movl %edx, (%ecx) ; shadowstack metadata .align 3 ___gc_fun: # __gc_fun .long 8 .space 4 In comparison to LowerGC: ; lowergc prologue movl L_llvm_gc_root_chain$non_lazy_ptr, %eax movl (%eax), %ecx movl %ecx, 48(%esp) movl $8, 52(%esp) movl $0, 60(%esp) movl $0, 56(%esp) movl $0, 68(%esp) movl $0, 64(%esp) movl $0, 76(%esp) movl $0, 72(%esp) movl $0, 84(%esp) movl $0, 80(%esp) movl $0, 92(%esp) movl $0, 88(%esp) movl $0, 100(%esp) movl $0, 96(%esp) movl $0, 108(%esp) movl $0, 104(%esp) movl $0, 116(%esp) movl $0, 112(%esp) ; lowergc loop overhead leal 44(%esp), %eax movl %eax, 56(%esp) leal 40(%esp), %eax movl %eax, 64(%esp) leal 36(%esp), %eax movl %eax, 72(%esp) leal 32(%esp), %eax movl %eax, 80(%esp) leal 28(%esp), %eax movl %eax, 88(%esp) leal 24(%esp), %eax movl %eax, 96(%esp) leal 20(%esp), %eax movl %eax, 104(%esp) leal 16(%esp), %eax movl %eax, 112(%esp) ; lowergc epilogue movl 48(%esp), %edx movl %edx, (%ecx) ; lowergc metadata (none) llvm-svn: 45670	2008-01-07 01:30:53 +00:00
Gordon Henriksen	5180e85675	Enabling the target-independent garbage collection infrastructure by hooking it up to the various compiler pipelines. This doesn't actually add support for any GC algorithms, which means it temporarily breaks a few tests. To be fixed shortly. llvm-svn: 45669	2008-01-07 01:30:38 +00:00
Chris Lattner	a4ce4f6987	rename isLoad -> isSimpleLoad due to evan's desire to have such a predicate. llvm-svn: 45667	2008-01-06 23:38:27 +00:00
Chris Lattner	10324d0175	rename isStore -> mayStore to more accurately reflect what it captures. llvm-svn: 45656	2008-01-06 08:36:04 +00:00
Duncan Sands	1694a53c5d	Remove an unused variable. llvm-svn: 45655	2008-01-06 07:43:13 +00:00
Chris Lattner	7eac714b41	make this build with newer gcc's llvm-svn: 45637	2008-01-05 23:29:51 +00:00
Nate Begeman	5743da502e	If custom lowering of insert element fails, the result Val will be 0. Don't overwrite a variable used by the fallthrough code path in this case. llvm-svn: 45630	2008-01-05 20:47:37 +00:00
Chris Lattner	647e61a42b	Fix build issue on certain compilers. llvm-svn: 45629	2008-01-05 20:15:42 +00:00
Chris Lattner	ee61d14bf6	The current impl is really trivial, add some comments about how it can be made better. llvm-svn: 45625	2008-01-05 06:47:58 +00:00
Chris Lattner	276178e49f	allow sinking to be enabled for the jit llvm-svn: 45624	2008-01-05 06:14:16 +00:00
Chris Lattner	d11ca169e7	don't sink anything with side effects, this makes lots of stuff work, but sinks almost nothing. llvm-svn: 45617	2008-01-05 02:33:22 +00:00
Chris Lattner	6ec78274df	fix a common crash. llvm-svn: 45614	2008-01-05 01:39:17 +00:00
Owen Anderson	3592b2352d	I should not be allowed to commit when sleepy. llvm-svn: 45608	2008-01-05 00:48:55 +00:00
Bill Wendling	0c209430b4	Don't recalculate the loop info and loop dominators analyses if they're preserved. llvm-svn: 45596	2008-01-04 20:54:55 +00:00
Bill Wendling	118ae4cd61	80-column violations. llvm-svn: 45574	2008-01-04 08:59:18 +00:00
Bill Wendling	3bf5603ce4	Add that this preserves some analyses. llvm-svn: 45573	2008-01-04 08:48:49 +00:00
Bill Wendling	66470d02c3	Move option to enable machine LICM into LLVMTargetMachine.cpp. llvm-svn: 45572	2008-01-04 08:11:03 +00:00
Bill Wendling	d865697016	Call the parent's getAnalysisUsage. llvm-svn: 45571	2008-01-04 07:50:05 +00:00
Chris Lattner	f3edc09f9b	Add a really quick hack at a machine code sinking pass, enabled with --enable-sinking. It is missing validity checks, so it is known broken. However, it is powerful enough to compile this contrived code: void test1(int C, double A, double B, double P) { double Tmp = AA+BB; P = C ? Tmp : A; } into: _test1: movsd 8(%esp), %xmm0 cmpl $0, 4(%esp) je LBB1_2 # entry LBB1_1: # entry movsd 16(%esp), %xmm1 mulsd %xmm1, %xmm1 mulsd %xmm0, %xmm0 addsd %xmm1, %xmm0 LBB1_2: # entry movl 24(%esp), %eax movsd %xmm0, (%eax) ret instead of: _test1: movsd 16(%esp), %xmm0 mulsd %xmm0, %xmm0 movsd 8(%esp), %xmm1 movapd %xmm1, %xmm2 mulsd %xmm2, %xmm2 addsd %xmm0, %xmm2 cmpl $0, 4(%esp) je LBB1_2 # entry LBB1_1: # entry movapd %xmm2, %xmm1 LBB1_2: # entry movl 24(%esp), %eax movsd %xmm1, (%eax) ret woo. llvm-svn: 45570	2008-01-04 07:36:53 +00:00
Chris Lattner	b5c1d9b7da	remove dead #includes and reorder the rest. llvm-svn: 45569	2008-01-04 06:41:45 +00:00
Bill Wendling	0ba4184404	Use the correct MachineRegisterInfo object. llvm-svn: 45499	2008-01-02 21:10:54 +00:00
Bill Wendling	f0b37780ca	Remove dead code. llvm-svn: 45496	2008-01-02 20:47:37 +00:00
Bill Wendling	5da1945cdd	Use the new architecture to get the containing machine basic block for a machine instruction. Also, use "splice" to move the new instruction instead of remove/insert (where it was leaking memory anyway). llvm-svn: 45492	2008-01-02 19:32:43 +00:00
Owen Anderson	eee14601b1	Move some more instruction creation methods from RegisterInfo into InstrInfo. llvm-svn: 45484	2008-01-01 21:11:32 +00:00
Chris Lattner	caaf8aae4d	Make MachineRegisterInfo::getVRegDef more efficient by aiming the keep the def of the vreg at the start of the list, so the list doesn't need to be traversed. llvm-svn: 45483	2008-01-01 21:08:22 +00:00
Chris Lattner	0cb9dd7aa2	switch the register iterator to act more like hte LLVM value iterator: dereferencing it now returns the machineinstr of the use. To get the operand, use I.getOperand(). Add a new MachineRegisterInfo::replaceRegWith, which is basically like Value::replaceAllUsesWith. llvm-svn: 45482	2008-01-01 20:36:19 +00:00
Chris Lattner	39204d76c5	Add a trivial but handy function to efficiently return the machine instruction that defines the specified vreg. Crazy. llvm-svn: 45480	2008-01-01 03:07:29 +00:00
Chris Lattner	961e7427ea	Implement automatically updated def/use lists for all MachineInstr register operands. The lists are currently kept in MachineRegisterInfo, but it does not yet provide an iterator interface to them. llvm-svn: 45477	2008-01-01 01:12:31 +00:00
Chris Lattner	25568e4cef	Fix a problem where lib/Target/TargetInstrInfo.h would include and use a header file from libcodegen. This violates a layering order: codegen depends on target, not the other way around. The fix to this is to split TII into two classes, TII and TargetInstrInfoImpl, which defines stuff that depends on libcodegen. It is defined in libcodegen, where the base is not. llvm-svn: 45475	2008-01-01 01:03:04 +00:00
Duncan Sands	57a60f0466	Fix PR1833 - eh.exception and eh.selector return two values, which means doing extra legalization work. It would be easier to get this kind of thing right if there was some documentation... llvm-svn: 45472	2007-12-31 18:35:50 +00:00
Owen Anderson	7a73ae9a86	Move copyRegToReg from MRegisterInfo to TargetInstrInfo. This is part of the Machine-level API cleanup instigated by Chris. llvm-svn: 45470	2007-12-31 06:32:00 +00:00
Chris Lattner	574e7166e0	properly encapsulate the parent field of MBB and MI with get/set accessors. llvm-svn: 45469	2007-12-31 04:56:33 +00:00
Chris Lattner	21ec2b4769	update a couple of references to SSARegMap. llvm-svn: 45468	2007-12-31 04:16:08 +00:00
Chris Lattner	a10fff51d9	Rename SSARegMap -> MachineRegisterInfo in keeping with the idea that "machine" classes are used to represent the current state of the code being compiled. Given this expanded name, we can start moving other stuff into it. For now, move the UsedPhysRegs and LiveIn/LoveOuts vectors from MachineFunction into it. Update all the clients to match. This also reduces some needless #includes, such as MachineModuleInfo from MachineFunction. llvm-svn: 45467	2007-12-31 04:13:23 +00:00
Chris Lattner	a5bb370aa4	Add new shorter predicates for testing machine operands for various types: e.g. MO.isMBB() instead of MO.isMachineBasicBlock(). I don't plan on switching everything over, so new clients should just start using the shorter names. Remove old long accessors, switching everything over to use the short accessor: getMachineBasicBlock() -> getMBB(), getConstantPoolIndex() -> getIndex(), setMachineBasicBlock -> setMBB(), etc. llvm-svn: 45464	2007-12-30 23:10:15 +00:00
Chris Lattner	6005589faf	More cleanups for MachineOperand: - Eliminate the static "print" method for operands, moving it into MachineOperand::print. - Change various set* methods for register flags to take a bool for the value to set it to. Remove unset* methods. - Group methods more logically by operand flavor in MachineOperand.h llvm-svn: 45461	2007-12-30 21:56:09 +00:00
Chris Lattner	c98c0e57eb	MachineOperand: - Add getParent() accessors. - Move SubReg out of the AuxInfo union, to make way for future changes. - Remove the getImmedValue/setImmedValue methods. - in some MachineOperand::Create* methods, stop initializing fields that are dead. MachineInstr: - Delete one copy of the MachineInstr printing code, now there is only one dump format and one copy of the code. - Make MachineOperand use the parent field to get info about preg register names if no target info is otherwise available. - Move def/use/kill/dead flag printing to the machineoperand printer, so they are always printed for an operand. llvm-svn: 45460	2007-12-30 21:31:53 +00:00
Chris Lattner	96317d2412	fix typo duncan noticed! llvm-svn: 45459	2007-12-30 21:21:10 +00:00
Chris Lattner	35fececec9	simpilfy some register printing code. llvm-svn: 45458	2007-12-30 21:08:36 +00:00
Chris Lattner	383a873a9a	eliminate a copy of the machineoperand printing stuff. Keep the copy that knows how to print offsets. llvm-svn: 45457	2007-12-30 21:03:30 +00:00
Chris Lattner	49bd29daa0	Simplify and clean up some machine operand/instr printing/dumping stuff. llvm-svn: 45456	2007-12-30 21:01:27 +00:00
Chris Lattner	0dad74d252	two register machineoperands are not identical unless their subregs match. llvm-svn: 45455	2007-12-30 20:55:08 +00:00
Chris Lattner	81798417dc	MachineOperand::getImmedValue -> MachineOperand::getImm llvm-svn: 45454	2007-12-30 20:50:28 +00:00
Chris Lattner	3c6ce5b43c	make machine operands fatter: give each one an up-pointer to the machineinstr that owns it. llvm-svn: 45449	2007-12-30 06:11:04 +00:00
Chris Lattner	20421fe936	use simplified operand addition methods. llvm-svn: 45436	2007-12-30 00:57:42 +00:00
Chris Lattner	bbbae8e1ce	use simplified operand addition methods. llvm-svn: 45435	2007-12-30 00:51:11 +00:00
Chris Lattner	e35dfb827f	Start using the simplified methods for adding operands. llvm-svn: 45432	2007-12-30 00:41:17 +00:00
Chris Lattner	c288ff1d78	simplify some code by factoring operand construction better. llvm-svn: 45428	2007-12-30 00:12:25 +00:00
Chris Lattner	f3ebc3f3d2	Remove attribution from file headers, per discussion on llvmdev. llvm-svn: 45418	2007-12-29 20:36:04 +00:00
Chris Lattner	a087a8d2ce	remove attribution from lib Makefiles. llvm-svn: 45415	2007-12-29 20:09:26 +00:00
Chris Lattner	3b6a82118b	Fold comparisons against a constant nan, and optimize ORD/UNORD comparisons with a constant. This allows us to compile isnan to: _foo: fcmpu cr7, f1, f1 mfcr r2 rlwinm r3, r2, 0, 31, 31 blr instead of: LCPI1_0: ; float .space 4 _foo: lis r2, ha16(LCPI1_0) lfs f0, lo16(LCPI1_0)(r2) fcmpu cr7, f1, f0 mfcr r2 rlwinm r3, r2, 0, 31, 31 blr llvm-svn: 45405	2007-12-29 08:37:08 +00:00
Chris Lattner	2de9b85297	make sure not to zap volatile stores, thanks a lot to Dale for noticing this! llvm-svn: 45402	2007-12-29 07:15:45 +00:00
Chris Lattner	5919b48fe9	don't fold fp_round(fp_extend(load)) -> fp_round(extload) llvm-svn: 45400	2007-12-29 06:55:23 +00:00
Chris Lattner	3f9c6a7260	Delete a store whose input is a load from the same pointer: x = load p store x -> p llvm-svn: 45398	2007-12-29 06:26:16 +00:00
Owen Anderson	bccb8c432d	Flesh out the Briggs implementation a little bit more, fix a few FIXMEs. llvm-svn: 45347	2007-12-24 22:12:23 +00:00
Owen Anderson	e110199916	Sketch out an implementation of Briggs' copy placement algorithm. llvm-svn: 45334	2007-12-23 15:37:26 +00:00
Chris Lattner	de272b1b63	initial code for forming an FGETSIGN node. This is disabled until legalizer support goes in. llvm-svn: 45323	2007-12-22 21:35:38 +00:00
Chris Lattner	afc8f13bf5	improve support for fgetsign llvm-svn: 45322	2007-12-22 21:26:52 +00:00
Chris Lattner	efd1cddb5a	Tell TargetLoweringOpt whether it is running before or after legalize. llvm-svn: 45321	2007-12-22 20:56:36 +00:00
Chris Lattner	843cad4df2	Add a new FGETSIGN operation, which defaults to expand on all targets. llvm-svn: 45320	2007-12-22 20:47:56 +00:00
Gordon Henriksen	41689b52ab	Use getIntrinsicID instead of looking up intrinsic prototypes. Also fixes a bug with indirect calls. (Test case will be included with ocaml collector patch.) llvm-svn: 45316	2007-12-22 17:27:01 +00:00
Owen Anderson	5a4c05d047	Note what still needs doing. llvm-svn: 45310	2007-12-22 04:59:10 +00:00
Owen Anderson	4534100765	Remove critical edge breaking. It won't be necessary as long as we are very careful when inserting copies. llvm-svn: 45309	2007-12-22 04:50:11 +00:00
Evan Cheng	f989141d30	More accurate checks for two-address constraints. llvm-svn: 45259	2007-12-20 09:25:31 +00:00
Evan Cheng	a509537e25	The physical register + virtual register joining requirement was much too strict. llvm-svn: 45253	2007-12-20 02:23:25 +00:00
Evan Cheng	61bc51ee97	Bring back a burr scheduling heuristic that's still needed. llvm-svn: 45252	2007-12-20 02:22:36 +00:00
Bill Wendling	65c001e6bc	Updated comments to reflect what "side effects" means in this situation. llvm-svn: 45245	2007-12-20 01:08:10 +00:00
Duncan Sands	e9d8861cdf	Simplify LowerCallTo by using a callsite. llvm-svn: 45198	2007-12-19 09:48:52 +00:00
Duncan Sands	030bce7b83	The C++ exception handling personality function wants to know about calls that cannot throw ('nounwind'): if such a call does throw for some reason then the personality will terminate the program. The distinction between an ordinary call and a nounwind call is that an ordinary call gets an entry in the exception table but a nounwind call does not. This patch sets up the exception table appropriately. One oddity is that I've chosen to bracket nounwind calls with labels (like invokes) - the other choice would have been to bracket ordinary calls with labels. While bracketing ordinary calls is more natural (because bracketing by labels would then correspond exactly to getting an entry in the exception table), I didn't do it because introducing labels impedes some optimizations and I'm guessing that ordinary calls occur more often than nounwind calls. This fixes the gcc filter2 eh test, at least at -O0 (the inliner needs some tweaking at higher optimization levels). llvm-svn: 45197	2007-12-19 07:36:31 +00:00
Evan Cheng	9f06e5e2df	Don't leave newly created nodes around if it turns out they are not needed. llvm-svn: 45186	2007-12-19 01:34:38 +00:00
Bill Wendling	166f746246	Add debugging info. Use the newly created "hasUnmodelledSideEffects" method. llvm-svn: 45178	2007-12-18 21:38:04 +00:00
Anton Korobeynikov	95cc3e0e66	Support more insane CEP's in AsmPrinter (Yes, PyPy folks do really use them). llvm-svn: 45172	2007-12-18 20:53:41 +00:00
Evan Cheng	483a969ece	Fix PR1872: SrcValue and SrcValueOffset should not be used to compute load / store node id. llvm-svn: 45167	2007-12-18 19:38:14 +00:00
Evan Cheng	78ced47a2f	Also print alignment and volatileness. llvm-svn: 45164	2007-12-18 19:06:30 +00:00
Evan Cheng	91e0fc9cb4	FIX for PR1799: When a load is unfolded from an instruction, check if it is a new node. If not, do not create a new SUnit. llvm-svn: 45157	2007-12-18 08:42:10 +00:00
Evan Cheng	e2dbba5828	SelectionDAG::dump() should print SrcValue of LoadSDNode and StoreSDNode. llvm-svn: 45151	2007-12-18 07:02:08 +00:00
Duncan Sands	b5a79d0eaa	Make invokes of inline asm legal. Teach codegen how to lower them (with no attempt made to be efficient, since they should only occur for unoptimized code). llvm-svn: 45108	2007-12-17 18:08:19 +00:00
Christopher Lamb	edf0788758	Change the PointerType api for creating pointer types. The old functionality of PointerType::get() has become PointerType::getUnqual(), which returns a pointer in the generic address space. The new prototype of PointerType::get() requires both a type and an address space. llvm-svn: 45082	2007-12-17 01:12:55 +00:00
Owen Anderson	7b8a741189	Break local interferences in StrongPHIElimination. One step closer... llvm-svn: 45070	2007-12-16 05:44:27 +00:00
Owen Anderson	ccb3981256	A few more comments. llvm-svn: 45069	2007-12-16 04:07:23 +00:00
Dan Gohman	8a332b235d	Add explicit keywords, and fix a minor typo that they uncovered. llvm-svn: 45034	2007-12-14 15:41:34 +00:00
Evan Cheng	0fcf56f8f5	Bug fix. Must also match ResNo when matching an operand with a user. llvm-svn: 45028	2007-12-14 08:25:15 +00:00
Owen Anderson	53b677e4e8	Add register pairs to the list to check for local interferences. llvm-svn: 44987	2007-12-13 05:53:03 +00:00
Owen Anderson	1f93edd08a	Remove ugly and horrible code. It's not necessary for correctness, and can be added back later if it causes code quality issues. llvm-svn: 44986	2007-12-13 05:43:37 +00:00
Evan Cheng	6e68381e02	Implicit def instructions, e.g. X86::IMPLICIT_DEF_GR32, are always re-materializable and they should not be spilled. llvm-svn: 44960	2007-12-12 23:12:09 +00:00
Dan Gohman	7a7742c2fe	Allow vector integer constants to be created with SelectionDAG::getConstant, in the same way as vector floating-point constants. This allows the legalize expansion code for @llvm.ctpop and friends to be usable with vector types. llvm-svn: 44954	2007-12-12 22:21:26 +00:00
Owen Anderson	499e5bffcf	Forgot to remove a register from the PHI-union after I'd determined that it interfered with other registers. Seems like that might be a good thing to do. :-) llvm-svn: 44902	2007-12-12 01:25:08 +00:00
Evan Cheng	6766d2fa4f	If deleting a reload instruction due to reuse (value is available in register R and reload is targeting R), make sure to invalidate the kill information of the last kill. llvm-svn: 44894	2007-12-11 23:36:57 +00:00
Bill Wendling	38236ef6cb	Need to grow the indexed map. Added debug statements. llvm-svn: 44892	2007-12-11 23:27:51 +00:00
Bill Wendling	642e15a7cb	Simplify slightly. llvm-svn: 44881	2007-12-11 22:22:22 +00:00
Owen Anderson	f24dd1c1eb	More progress on StrongPHIElimination. Now we actually USE the DomForest! llvm-svn: 44877	2007-12-11 20:12:11 +00:00
Bill Wendling	b678ae7c38	Blark! How in the world did this work without this?! llvm-svn: 44874	2007-12-11 19:40:06 +00:00
Bill Wendling	7717a8a37d	- Update the virtual reg to machine instruction map when hoisting. - Fix subtle bug when creating initially creating this map. llvm-svn: 44873	2007-12-11 19:17:04 +00:00
Bill Wendling	5143d898c8	Checking for "zero operands" during the "CanHoistInst()" method isn't necessary because those with side effects will be caught by other checks in here. Also, simplify the check for a BB in a sub loop. llvm-svn: 44871	2007-12-11 18:45:11 +00:00
Evan Cheng	303417d242	Switch over to MachineLoopInfo. llvm-svn: 44838	2007-12-11 02:09:15 +00:00
Evan Cheng	f54030231e	Pretty print shuffle mask operand. llvm-svn: 44837	2007-12-11 02:08:35 +00:00
Gordon Henriksen	7843c16f31	CollectorMetadata and Collector are rejiggered to get along with per-function collector model. Collector is now the factory for CollectorMetadata, so the latter may be subclassed. llvm-svn: 44827	2007-12-11 00:30:17 +00:00
Owen Anderson	ba61806ef1	A little more progress on StrongPHIElimination, now that I have a better sense of how the CodeGen machinery works. llvm-svn: 44786	2007-12-10 08:07:09 +00:00
Christopher Lamb	d202e03fe5	Improve branch folding by recgonizing that explict successor relationships impact the value of fall-through choices. llvm-svn: 44785	2007-12-10 07:24:06 +00:00
Chris Lattner	64443973c0	Duncan points out that the subtraction is unneeded since hte code knows the vector is not pow2 llvm-svn: 44740	2007-12-09 17:56:34 +00:00
Chris Lattner	69d3298777	Add support for splitting the operand of a return instruction. llvm-svn: 44728	2007-12-09 00:06:19 +00:00
Bill Wendling	3f19dfe794	Reverting 44702. It wasn't correct to rename them. llvm-svn: 44727	2007-12-08 23:58:46 +00:00
Chris Lattner	e48fc80446	add many new cases to SplitResult. SplitResult now handles all the cases that LegalizeDAG does. llvm-svn: 44726	2007-12-08 23:58:27 +00:00
Chris Lattner	de9046af54	Implement splitting support for store, allowing us to compile: %f8 = type <8 x float> define void @test_f8(%f8* %P, %f8* %Q, %f8* %S) { %p = load %f8* %P ; <%f8> [#uses=1] %q = load %f8* %Q ; <%f8> [#uses=1] %R = add %f8 %p, %q ; <%f8> [#uses=1] store %f8 %R, %f8* %S ret void } into: _test_f8: movaps 16(%rdi), %xmm0 addps 16(%rsi), %xmm0 movaps (%rdi), %xmm1 addps (%rsi), %xmm1 movaps %xmm0, 16(%rdx) movaps %xmm1, (%rdx) ret llvm-svn: 44725	2007-12-08 23:24:26 +00:00
Chris Lattner	de87224cd9	implement vector splitting of load, undef, and binops. llvm-svn: 44724	2007-12-08 23:08:49 +00:00
Chris Lattner	1ef437d4e1	implement some methods. llvm-svn: 44723	2007-12-08 22:40:18 +00:00
Chris Lattner	a5e7db115e	add scaffolding for splitting of vectors. llvm-svn: 44722	2007-12-08 22:37:41 +00:00
Chris Lattner	8c8eaf6b92	reorganize header to separate into functional blocks. llvm-svn: 44719	2007-12-08 21:59:32 +00:00
Chris Lattner	4063bd6eae	split scalarization out to its own file. llvm-svn: 44718	2007-12-08 20:30:28 +00:00
Chris Lattner	5c7c46baaf	Split expansion out into its own file. llvm-svn: 44717	2007-12-08 20:27:32 +00:00
Chris Lattner	029c816460	Split promotion support out to its own file. llvm-svn: 44716	2007-12-08 20:24:38 +00:00
Chris Lattner	757d4beba9	Rename LegalizeDAGTypes.cpp -> LegalizeTypes.cpp llvm-svn: 44715	2007-12-08 20:17:13 +00:00
Chris Lattner	92288147b6	Split the class definition of DAGTypeLegalizer out into a header. Leave it visibility hidden, but not in an anon namespace. llvm-svn: 44714	2007-12-08 20:16:06 +00:00
Bill Wendling	2b07d8c5a0	Renaming: isTriviallyReMaterializable -> hasNoSideEffects isReallyTriviallyReMaterializable -> isTriviallyReMaterializable llvm-svn: 44702	2007-12-08 07:17:56 +00:00
Bill Wendling	4375173ba0	Incorporated comments from Evan and Chris: http://lists.cs.uiuc.edu/pipermail/llvm-commits/Week-of-Mon-20071203/056043.html http://lists.cs.uiuc.edu/pipermail/llvm-commits/Week-of-Mon-20071203/056048.html llvm-svn: 44696	2007-12-08 01:47:01 +00:00
Bill Wendling	fb706bc52b	Initial commit of the machine code LICM pass. It successfully hoists this: _foo: li r2, 0 LBB1_1: ; bb li r5, 0 stw r5, 0(r3) addi r2, r2, 1 addi r3, r3, 4 cmplw cr0, r2, r4 bne cr0, LBB1_1 ; bb LBB1_2: ; return blr to: _foo: li r2, 0 li r5, 0 LBB1_1: ; bb stw r5, 0(r3) addi r2, r2, 1 addi r3, r3, 4 cmplw cr0, r2, r4 bne cr0, LBB1_1 ; bb LBB1_2: ; return blr ZOMG!! :-) Moar to come... llvm-svn: 44687	2007-12-07 21:42:31 +00:00
Evan Cheng	85cdba29b0	Add an option to control this heuristic tweak so I can test it. llvm-svn: 44671	2007-12-07 00:28:32 +00:00
Dale Johannesen	5eff4de9c8	Redo previous patch so optimization only done for i1. Simpler and safer. llvm-svn: 44663	2007-12-06 17:53:31 +00:00
Evan Cheng	8393dc7378	Turning simple splitting on. Start testing new coalescer heuristics as new llcbeta. llvm-svn: 44660	2007-12-06 08:54:31 +00:00
Chris Lattner	eedaf92fcf	third time around: instead of disabling this completely, only disable it if we don't know it will be obviously profitable. Still fixme, but less so. :) llvm-svn: 44658	2007-12-06 07:47:55 +00:00
Chris Lattner	b5fdfb9612	Actually, disable this code for now. More analysis and improvements to the X86 backend are needed before this should be enabled by default. llvm-svn: 44657	2007-12-06 07:44:31 +00:00
Chris Lattner	7c709a5d08	implement a readme entry, compiling the code into: _foo: movl $12, %eax andl 4(%esp), %eax movl _array(%eax), %eax ret instead of: _foo: movl 4(%esp), %eax shrl $2, %eax andl $3, %eax movl _array(,%eax,4), %eax ret As it turns out, this triggers all the time, in a wide variety of situations, for example, I see diffs like this in various programs: - movl 8(%eax), %eax - shll $2, %eax - andl $1020, %eax - movl (%esi,%eax), %eax + movzbl 8(%eax), %eax + movl (%esi,%eax,4), %eax - shll $2, %edx - andl $1020, %edx - movl (%edi,%edx), %edx + andl $255, %edx + movl (%edi,%edx,4), %edx Unfortunately, I also see stuff like this, which can be fixed in the X86 backend: - andl $85, %ebx - addl _bit_count(,%ebx,4), %ebp + shll $2, %ebx + andl $340, %ebx + addl _bit_count(%ebx), %ebp llvm-svn: 44656	2007-12-06 07:33:36 +00:00
Chris Lattner	42558bf664	implement the rest of the functionality from SelectionDAGLegalize::ScalarizeVectorOp llvm-svn: 44654	2007-12-06 05:53:43 +00:00
Dale Johannesen	05bbbda78a	Fix PR1842. llvm-svn: 44649	2007-12-06 01:43:46 +00:00
Evan Cheng	7fc1d98353	Fix for PR1831: if all defs of an interval are re-materializable, then it's a preferred spill candiate. llvm-svn: 44644	2007-12-06 00:01:56 +00:00
Evan Cheng	678b86d6ce	MachineInstr can change. Store indexes instead. llvm-svn: 44612	2007-12-05 10:24:35 +00:00
Evan Cheng	06353b48b5	If a split live interval is spilled again, remove the kill marker on its last use. llvm-svn: 44611	2007-12-05 09:51:10 +00:00
Evan Cheng	64b3baaaea	Clobber more bugs. llvm-svn: 44610	2007-12-05 09:05:34 +00:00
Evan Cheng	d7de56ac93	Fix kill info for split intervals. llvm-svn: 44609	2007-12-05 08:16:32 +00:00
Chris Lattner	c9693c60a5	more scalarization llvm-svn: 44608	2007-12-05 07:45:02 +00:00
Chris Lattner	1a0d49a63c	scalarize vector binops llvm-svn: 44607	2007-12-05 07:36:58 +00:00
Evan Cheng	269dbd31d0	- Mark last use of a split interval as kill instead of letting spiller track it. This allows an important optimization to be re-enabled. - If all uses / defs of a split interval can be folded, give the interval a low spill weight so it would not be picked in case spilling is needed (avoid pushing other intervals in the same BB to be spilled). llvm-svn: 44601	2007-12-05 03:22:34 +00:00
Evan Cheng	bb26301864	Add a argument to storeRegToStackSlot and storeRegToAddr to specify whether the stored register is killed. llvm-svn: 44600	2007-12-05 03:14:33 +00:00
Evan Cheng	e412a4427b	Remove a unsafe optimization. This fixes 401.bzip2. llvm-svn: 44587	2007-12-04 23:57:55 +00:00
Evan Cheng	cd8a89b3cd	Spiller unfold optimization bug: do not clobber a reusable stack slot value unless it can be modified. llvm-svn: 44575	2007-12-04 19:19:45 +00:00
Chris Lattner	b892225fb9	Implement framework for scalarizing node results. This is sufficient to codegen this: define float @test_extract_elt(<1 x float> * %P) { %p = load <1 x float>* %P %R = extractelement <1 x float> %p, i32 0 ret float %R } llvm-svn: 44570	2007-12-04 07:48:46 +00:00
Chris Lattner	681c9d6697	start providing framework for scalarizing vectors. llvm-svn: 44569	2007-12-04 07:29:51 +00:00
Evan Cheng	d1badb960e	Discard split intervals made empty due to folding. llvm-svn: 44565	2007-12-04 00:32:23 +00:00
Evan Cheng	40965448ff	Bug fixes. llvm-svn: 44549	2007-12-03 21:31:55 +00:00
Duncan Sands	38ef3a8ec7	Rather than having special rules like "intrinsics cannot throw exceptions", just mark intrinsics with the nounwind attribute. Likewise, mark intrinsics as readnone/readonly and get rid of special aliasing logic (which didn't use anything more than this anyway). llvm-svn: 44544	2007-12-03 20:06:50 +00:00
Evan Cheng	196faa9dc5	Typo llvm-svn: 44532	2007-12-03 10:00:00 +00:00
Evan Cheng	85ef9834a6	Update kill info for uses of split intervals. llvm-svn: 44531	2007-12-03 09:58:48 +00:00
Evan Cheng	f45a1d623c	Remove redundant foldMemoryOperand variants and other code clean up. llvm-svn: 44517	2007-12-02 08:30:39 +00:00
Evan Cheng	388f6f51a0	Fix a bug where splitting cause some unnecessary spilling. llvm-svn: 44482	2007-12-01 04:42:39 +00:00
Evan Cheng	69fda0a716	Allow some reloads to be folded in multi-use cases. Specifically testl r, r -> cmpl [mem], 0. llvm-svn: 44479	2007-12-01 02:07:52 +00:00
Evan Cheng	b10dc27b20	Do not fold reload into an instruction with multiple uses. It issues one extra load. llvm-svn: 44467	2007-11-30 21:23:43 +00:00
Devang Patel	cc45c338d1	Provide a way to update DescGlobals cache directly. llvm-svn: 44446	2007-11-30 00:51:33 +00:00
Evan Cheng	d35b5acae4	Do not lose rematerialization info when spilling already split live intervals. llvm-svn: 44443	2007-11-29 23:02:50 +00:00
Evan Cheng	8494ee175c	Fix a major performance issue with splitting. If there is a def (not def/use) in the middle of a split basic block, create a new live interval starting at the def. This avoid artifically extending the live interval over a number of cycles where it is dead. e.g. bb1: = vr1204 (use / kill) <= new interval starts and ends here. ... ... vr1204 = (new def) <= start a new interval here. = vr1204 (use) llvm-svn: 44436	2007-11-29 10:12:14 +00:00
Evan Cheng	f85c063ec0	Replace the odd kill# hack with something less fragile. llvm-svn: 44434	2007-11-29 09:49:23 +00:00
Evan Cheng	be255b0650	Fixed various live interval splitting bugs / compile time issues. llvm-svn: 44428	2007-11-29 01:06:25 +00:00
Evan Cheng	147f7799c5	Kill info update bug. llvm-svn: 44427	2007-11-29 01:05:47 +00:00
Duncan Sands	5208d1ab4a	Add some convenience methods for querying attributes, and use them. llvm-svn: 44403	2007-11-28 17:07:01 +00:00
Duncan Sands	45a0c3265f	Add missing newlines at EOF. llvm-svn: 44399	2007-11-28 10:13:38 +00:00
Evan Cheng	c1648b6a0d	Recover compile time regression. llvm-svn: 44386	2007-11-28 01:28:46 +00:00
Owen Anderson	30767b15e9	Add MachineLoopInfo. This is not yet tested. llvm-svn: 44384	2007-11-27 22:47:08 +00:00
Nate Begeman	6f026a654c	Support returning non-power-of-2 vectors to unblock some work llvm-svn: 44371	2007-11-27 19:28:48 +00:00
Duncan Sands	ad0ea2d430	Fix PR1146: parameter attributes are longer part of the function type, instead they belong to functions and function calls. This is an updated and slightly corrected version of Reid Spencer's original patch. The only known problem is that auto-upgrading of bitcode files doesn't seem to work properly (see test/Bitcode/AutoUpgradeIntrinsics.ll). Hopefully a bitcode guru (who might that be? :) ) will fix it. llvm-svn: 44359	2007-11-27 13:23:08 +00:00
Chris Lattner	698b1cb28d	err, no really. llvm-svn: 44352	2007-11-27 06:14:32 +00:00
Chris Lattner	28caf2717a	don't depend on ADL. llvm-svn: 44351	2007-11-27 06:14:12 +00:00
Dan Gohman	9a69341725	Don't lower srem/urem X%C to X-X/C*C unless the division is actually optimized. This avoids creating illegal divisions when the combiner is running after legalize; this fixes PR1815. Also, it produces better code in the included testcase by avoiding the subtract and multiply when the division isn't optimized. llvm-svn: 44341	2007-11-26 23:46:11 +00:00
Chris Lattner	cab915f9cf	Implement expand support for MERGE_VALUEs that only produces one result. llvm-svn: 44304	2007-11-24 19:12:15 +00:00
Chris Lattner	6e3641897b	Implement support for custom legalization in DAGTypeLegalizer::ExpandOperand. Improve a comment. Unbreak Duncan's carefully written path compression where I didn't realize what was happening! llvm-svn: 44301	2007-11-24 18:11:42 +00:00
Chris Lattner	f81d5886c6	Several changes: 1) Change the interface to TargetLowering::ExpandOperationResult to take and return entire NODES that need a result expanded, not just the value. This allows us to handle things like READCYCLECOUNTER, which returns two values. 2) Implement (extremely limited) support in LegalizeDAG::ExpandOp for MERGE_VALUES. 3) Reimplement custom lowering in LegalizeDAGTypes in terms of the new ExpandOperationResult. This makes the result simpler and fully general. 4) Implement (fully general) expand support for MERGE_VALUES in LegalizeDAGTypes. 5) Implement ExpandOperationResult support for ARM f64->i64 bitconvert and ARM i64 shifts, allowing them to work with LegalizeDAGTypes. 6) Implement ExpandOperationResult support for X86 READCYCLECOUNTER and FP_TO_SINT, allowing them to work with LegalizeDAGTypes. LegalizeDAGTypes now passes several more X86 codegen tests when enabled and when type legalization in LegalizeDAG is ifdef'd out. llvm-svn: 44300	2007-11-24 07:07:01 +00:00
Duncan Sands	b87dde7e8e	Fix a bug in which node A is replaced by node B, but later node A gets back into the DAG again because it was hiding in one of the node maps: make sure that node replacement happens in those maps too. llvm-svn: 44263	2007-11-21 16:43:19 +00:00
Dale Johannesen	763e110a9f	Fix .eh table linkage issues on Darwin. Some EH support for Darwin PPC, but it's not fully working yet. llvm-svn: 44258	2007-11-20 23:24:42 +00:00
Chris Lattner	09c0393d5e	ExpandUnalignedLoad doesn't handle vectors right at all apparently. Fix a couple of problems: 1. Don't assume the VT-1 is a VT that is half the size. 2. Treat vectors of FP in the vector path, not the FP path. This has a couple of remaining problems before it will work with the code in PR1811: the code below this change assumes that it can use extload/shift/or to construct the result, which isn't right for vectors. This also doesn't handle vectors of 1 or vectors that aren't pow-2. llvm-svn: 44243	2007-11-19 21:38:03 +00:00
Chris Lattner	6fa95ec19d	Implement vector expand support for shuffle_vector. This fixes PR1811. llvm-svn: 44242	2007-11-19 21:16:54 +00:00
Chris Lattner	67d77945e7	Implement splitting of UNDEF nodes. This is the first step towards fixing PR1811 llvm-svn: 44239	2007-11-19 20:21:32 +00:00
Dan Gohman	36347a26f9	Add support in SplitVectorOp for remainder operators. llvm-svn: 44233	2007-11-19 15:15:03 +00:00
Nate Begeman	d4d45c268c	Add support for vectors to int <-> float casts. llvm-svn: 44204	2007-11-17 03:58:34 +00:00
Evan Cheng	8e22379303	Live interval splitting: When a live interval is being spilled, rather than creating short, non-spillable intervals for every def / use, split the interval at BB boundaries. That is, for every BB where the live interval is defined or used, create a new interval that covers all the defs and uses in the BB. This is designed to eliminate one common problem: multiple reloads of the same value in a single basic block. Note, it does not decrease the number of spills since no copies are inserted so the split intervals are connected through spill and reloads (or rematerialization). The newly created intervals can be spilled again, in that case, since it does not span multiple basic blocks, it's spilled in the usual manner. However, it can reuse the same stack slot as the previously split interval. This is currently controlled by -split-intervals-at-bb. llvm-svn: 44198	2007-11-17 00:40:40 +00:00
Anton Korobeynikov	66b91e66ec	Implement necessary bits for flt_rounds gcc builtin. Codegen bits and llvm-gcc support will follow. llvm-svn: 44182	2007-11-15 23:25:33 +00:00
Nate Begeman	bd117f06ba	Basic non-power-of-2 vector support llvm-svn: 44181	2007-11-15 21:15:26 +00:00
Duncan Sands	d4494352f8	This assertion was bogus. llvm-svn: 44167	2007-11-15 09:54:37 +00:00
Evan Cheng	2c1a50455c	Fix a thinko in post-allocation coalescer. llvm-svn: 44166	2007-11-15 08:13:29 +00:00
Bill Wendling	b3712f8146	Adding debug output during coalescing. llvm-svn: 44154	2007-11-15 02:06:30 +00:00
Bill Wendling	8269925b1e	Need to increment the iterator. llvm-svn: 44153	2007-11-15 00:40:48 +00:00
Anton Korobeynikov	2c6387803e	Fix PIC jump table codegen on x86-32/linux. In fact, such thing should be applied to all targets uses GOT-relative offsets for PIC (Alpha?) llvm-svn: 44108	2007-11-14 09:18:41 +00:00
Evan Cheng	7f02cfa599	Clean up sub-register implementation by moving subReg information back to MachineOperand auxInfo. Previous clunky implementation uses an external map to track sub-register uses. That works because register allocator uses a new virtual register for each spilled use. With interval splitting (coming soon), we may have multiple uses of the same register some of which are of using different sub-registers from others. It's too fragile to constantly update the information. llvm-svn: 44104	2007-11-14 07:59:08 +00:00
Owen Anderson	d8167ab332	Run computeDomForest() on the set of registers that need to be tested for interference. llvm-svn: 44064	2007-11-13 20:13:24 +00:00
Owen Anderson	569ef71e44	Preserve LiveVariables when doing critical edge splitting. llvm-svn: 44063	2007-11-13 20:04:45 +00:00
Dale Johannesen	7a7085f6d3	Add parameter to getDwarfRegNum to permit targets to use different mappings for EH and debug info; no functional change yet. Fix warning in X86CodeEmitter. llvm-svn: 44056	2007-11-13 19:13:01 +00:00
Bill Wendling	f359fed9f9	Unify CALLSEQ_{START,END}. They take 4 parameters: the chain, two stack adjustment fields, and an optional flag. If there is a "dynamic_stackalloc" in the code, make sure that it's bracketed by CALLSEQ_START and CALLSEQ_END. If not, then there is the potential for the stack to be changed while the stack's being used by another instruction (like a call). This can only result in tears... llvm-svn: 44037	2007-11-13 00:44:25 +00:00
Owen Anderson	c520c4b325	Break critical edges coming into blocks with PHI nodes. llvm-svn: 44019	2007-11-12 17:27:27 +00:00
Evan Cheng	be51f28e2b	Refactor some code. llvm-svn: 44010	2007-11-12 06:35:08 +00:00
Owen Anderson	a1cd45213d	As Chris and Evan pointed out, BreakCriticalMachineEdges doesn't really need to be a pass of its own. Instead, move it out into a helper method. llvm-svn: 44002	2007-11-12 01:05:09 +00:00
Hartmut Kaiser	67297144ab	Fixed a strange construct. Please review. llvm-svn: 43960	2007-11-09 19:59:00 +00:00
Duncan Sands	e795efea5b	Move MinAlign to MathExtras.h. llvm-svn: 43944	2007-11-09 13:41:39 +00:00
Duncan Sands	e7a9ac929f	Fix some load/store logic that would be wrong for apints on big-endian machines if the bitwidth is not a multiple of 8. Introduce a new helper, MVT::getStoreSizeInBits, and use it. llvm-svn: 43934	2007-11-09 08:57:19 +00:00
Duncan Sands	bab9dc9433	Add terminating newline. llvm-svn: 43933	2007-11-09 08:30:21 +00:00
Evan Cheng	797d56ff17	Much improved pic jumptable codegen: Then: call "L1$pb" "L1$pb": popl %eax ... LBB1_1: # entry imull $4, %ecx, %ecx leal LJTI1_0-"L1$pb"(%eax), %edx addl LJTI1_0-"L1$pb"(%ecx,%eax), %edx jmpl %edx .align 2 .set L1_0_set_3,LBB1_3-LJTI1_0 .set L1_0_set_2,LBB1_2-LJTI1_0 .set L1_0_set_5,LBB1_5-LJTI1_0 .set L1_0_set_4,LBB1_4-LJTI1_0 LJTI1_0: .long L1_0_set_3 .long L1_0_set_2 Now: call "L1$pb" "L1$pb": popl %eax ... LBB1_1: # entry addl LJTI1_0-"L1$pb"(%eax,%ecx,4), %eax jmpl %eax .align 2 .set L1_0_set_3,LBB1_3-"L1$pb" .set L1_0_set_2,LBB1_2-"L1$pb" .set L1_0_set_5,LBB1_5-"L1$pb" .set L1_0_set_4,LBB1_4-"L1$pb" LJTI1_0: .long L1_0_set_3 .long L1_0_set_2 llvm-svn: 43924	2007-11-09 01:32:10 +00:00
Evan Cheng	f14006f4d6	Didn't mean to check these in. llvm-svn: 43923	2007-11-09 01:28:33 +00:00
Evan Cheng	1bf166312b	Bug fix. Passive nodes are not in SUnitMap. llvm-svn: 43922	2007-11-09 01:27:11 +00:00
Owen Anderson	65d2fcdd2a	This preserves critical edge breaking. llvm-svn: 43911	2007-11-08 22:23:57 +00:00
Owen Anderson	3bc8124a66	Make BreakCriticalMachineEdges available as a pass that can be depended on. llvm-svn: 43910	2007-11-08 22:20:23 +00:00
Evan Cheng	ece4c68b82	If both parts of smul_lohi, etc. are used, don't simplify. If only one part is used, try simplify it. llvm-svn: 43888	2007-11-08 09:25:29 +00:00
Owen Anderson	0be8c1dafe	Add the majority of machine-level critical edge breaking pass. Most of this was written by Fernando, cleanup and updating to TOT by me. This still needs a bit of work, particularly to handle jump tables properly. llvm-svn: 43885	2007-11-08 07:55:43 +00:00
Owen Anderson	bfbc12973d	Take another stab at getting isLiveIn() and isLiveOut() right. llvm-svn: 43869	2007-11-08 01:32:45 +00:00
Owen Anderson	9d86ef12c8	Bring UsedBlocks back. StrongPHIElimination needs this information. llvm-svn: 43866	2007-11-08 01:20:48 +00:00
Evan Cheng	e742ee1dbe	Simplify my (il)logic. llvm-svn: 43819	2007-11-07 08:08:25 +00:00
Owen Anderson	c6a5387d09	Add some more of StrongPHIElim. llvm-svn: 43805	2007-11-07 05:17:15 +00:00
Dan Gohman	ccfc028283	Remainder operations must be either integer or floating-point. llvm-svn: 43781	2007-11-06 22:11:54 +00:00
Evan Cheng	dd71a5c37b	When the allocator rewrite a spill register with new virtual register, it replaces other operands of the same register. Watch out for situations where only some of the operands are sub-register uses. llvm-svn: 43776	2007-11-06 21:12:10 +00:00
Evan Cheng	d5d59ad634	First step towards moving the coalescer to priority_queue based machinery. llvm-svn: 43764	2007-11-06 08:52:21 +00:00
Evan Cheng	92d23e5204	Fix a bug where a def use operand isn't being detected as a sub-register use. llvm-svn: 43763	2007-11-06 08:50:44 +00:00
Evan Cheng	2dbffa4e76	Add pseudo dependency to force two-address instruction to be scheduled after other uses. There was a overly restricted check that prevented some obvious cases. llvm-svn: 43762	2007-11-06 08:44:59 +00:00
Owen Anderson	d378cea030	Add a few comments. llvm-svn: 43755	2007-11-06 05:26:02 +00:00
Owen Anderson	eb964eb2c8	DomForest is a forest of registers, not instructions. llvm-svn: 43754	2007-11-06 05:22:43 +00:00
Owen Anderson	a9057f0b97	StrongPHIElimination requires LiveVariables. llvm-svn: 43751	2007-11-06 04:49:43 +00:00
Dan Gohman	08143e397d	Add support for vector remainder operations. llvm-svn: 43744	2007-11-05 23:35:22 +00:00
Rafael Espindola	fa0df55bdd	Move the LowerMEMCPY and LowerMEMCPYCall to a common place. Thanks for the suggestions Bill :-) llvm-svn: 43742	2007-11-05 23:12:20 +00:00
Dale Johannesen	4646aa3e33	Make labels work in asm blocks; allow labels as parameters. Rename ValueRefList to ParamList in AsmParser, since its only use is for parameters. llvm-svn: 43734	2007-11-05 21:20:28 +00:00
Duncan Sands	f7ae8bd090	Don't output ABI size padding twice. By using the store size for the field we get ABI padding automatically, so no need to put it in again when we emit the field. llvm-svn: 43720	2007-11-05 18:03:02 +00:00
Evan Cheng	8bb30184a8	Move SimpleRegisterCoalescing.h to lib/CodeGen since there is now a common register coalescer interface: RegisterCoalescing. llvm-svn: 43714	2007-11-05 17:41:38 +00:00
Evan Cheng	17b0e3e1ae	Skip over deleted val#'s. llvm-svn: 43700	2007-11-05 06:46:45 +00:00
Evan Cheng	a406b47f14	Handle cases where a register and one of its super-register are both marked as defined on the same instruction. This fixes PR1767. llvm-svn: 43699	2007-11-05 03:11:55 +00:00
Evan Cheng	a8044084ac	Fix PR1187. llvm-svn: 43692	2007-11-05 00:59:10 +00:00
Duncan Sands	283207a71c	Eliminate the remaining uses of getTypeSize. This should only effect x86 when using long double. Now 12/16 bytes are output for long double globals (the exact amount depends on the alignment). This brings globals in line with the rest of LLVM: the space reserved for an object is now always the ABI size. One tricky point is that only 10 bytes should be output for long double if it is a field in a packed struct, which is the reason for the additional argument to EmitGlobalConstant. llvm-svn: 43688	2007-11-05 00:04:43 +00:00
Owen Anderson	eea82746b3	Another step of stronger PHI elimination down. llvm-svn: 43684	2007-11-04 22:33:26 +00:00
Evan Cheng	5c1b044899	If an interval is being undone clear its preference as well since the source interval may have been undone as well. llvm-svn: 43670	2007-11-04 08:32:21 +00:00
Evan Cheng	66298e226f	There are times when the coalescer would not coalesce away a copy but the copy can be eliminated by the allocator is the destination and source targets the same register. The most common case is when the source and destination registers are in different class. For example, on x86 mov32to32_ targets GR32_ which contains a subset of the registers in GR32. The allocator can do 2 things: 1. Set the preferred allocation for the destination of a copy to that of its source. 2. After allocation is done, change the allocation of a copy destination (if legal) so the copy can be eliminated. This eliminates 443 extra moves from 403.gcc. llvm-svn: 43662	2007-11-03 07:20:12 +00:00
Dan Gohman	d7917b6248	Add std:: to sort calls. llvm-svn: 43652	2007-11-02 22:24:01 +00:00
Dan Gohman	c981d72d1a	Change illegal uses of ++ to uses of STLExtra.h's next function. llvm-svn: 43651	2007-11-02 22:22:02 +00:00
Evan Cheng	f851163c53	One more extract_subreg coalescing bug. llvm-svn: 43644	2007-11-02 17:35:08 +00:00
Duncan Sands	04059dd351	Fix a thinko. llvm-svn: 43639	2007-11-02 15:18:06 +00:00
Duncan Sands	44b8721de8	Executive summary: getTypeSize -> getTypeStoreSize / getABITypeSize. The meaning of getTypeSize was not clear - clarifying it is important now that we have x86 long double and arbitrary precision integers. The issue with long double is that it requires 80 bits, and this is not a multiple of its alignment. This gives a primitive type for which getTypeSize differed from getABITypeSize. For arbitrary precision integers it is even worse: there is the minimum number of bits needed to hold the type (eg: 36 for an i36), the maximum number of bits that will be overwriten when storing the type (40 bits for i36) and the ABI size (i.e. the storage size rounded up to a multiple of the alignment; 64 bits for i36). This patch removes getTypeSize (not really - it is still there but deprecated to allow for a gradual transition). Instead there is: (1) getTypeSizeInBits - a number of bits that suffices to hold all values of the type. For a primitive type, this is the minimum number of bits. For an i36 this is 36 bits. For x86 long double it is 80. This corresponds to gcc's TYPE_PRECISION. (2) getTypeStoreSizeInBits - the maximum number of bits that is written when storing the type (or read when reading it). For an i36 this is 40 bits, for an x86 long double it is 80 bits. This is the size alias analysis is interested in (getTypeStoreSize returns the number of bytes). There doesn't seem to be anything corresponding to this in gcc. (3) getABITypeSizeInBits - this is getTypeStoreSizeInBits rounded up to a multiple of the alignment. For an i36 this is 64, for an x86 long double this is 96 or 128 depending on the OS. This is the spacing between consecutive elements when you form an array out of this type (getABITypeSize returns the number of bytes). This is TYPE_SIZE in gcc. Since successive elements in a SequentialType (arrays, pointers and vectors) need to be aligned, the spacing between them will be given by getABITypeSize. This means that the size of an array is the length times the getABITypeSize. It also means that GEP computations need to use getABITypeSize when computing offsets. Furthermore, if an alloca allocates several elements at once then these too need to be aligned, so the size of the alloca has to be the number of elements multiplied by getABITypeSize. Logically speaking this doesn't have to be the case when allocating just one element, but it is simpler to also use getABITypeSize in this case. So alloca's and mallocs should use getABITypeSize. Finally, since gcc's only notion of size is that given by getABITypeSize, if you want to output assembler etc the same as gcc then getABITypeSize is the size you want. Since a store will overwrite no more than getTypeStoreSize bytes, and a read will read no more than that many bytes, this is the notion of size appropriate for alias analysis calculations. In this patch I have corrected all type size uses except some of those in ScalarReplAggregates, lib/Codegen, lib/Target (the hard cases). I will get around to auditing these too at some point, but I could do with some help. Finally, I made one change which I think wise but others might consider pointless and suboptimal: in an unpacked struct the amount of space allocated for a field is now given by the ABI size rather than getTypeStoreSize. I did this because every other place that reserves memory for a type (eg: alloca) now uses getABITypeSize, and I didn't want to make an exception for unpacked structs, i.e. I did it to make things more uniform. This only effects structs containing long doubles and arbitrary precision integers. If someone wants to pack these types more tightly they can always use a packed struct. llvm-svn: 43620	2007-11-01 20:53:16 +00:00
Evan Cheng	fe1ac52836	- Coalesce extract_subreg when both intervals are relatively small. - Some code clean up. llvm-svn: 43606	2007-11-01 06:22:48 +00:00
Duncan Sands	3b4668a5d8	Promotion of sdiv/srem/udiv/urem. llvm-svn: 43551	2007-10-31 08:57:43 +00:00
Duncan Sands	21ca939683	Add a newline at the end of the file. llvm-svn: 43550	2007-10-31 08:49:24 +00:00
Owen Anderson	0b59fa0605	Add the skeleton of a better PHI elimination pass. llvm-svn: 43542	2007-10-31 03:37:57 +00:00
Owen Anderson	9b8f34f2ac	Some fixes to get MachineDomTree working better. llvm-svn: 43541	2007-10-31 03:30:14 +00:00
Dale Johannesen	b066c1f216	Make i64=expand_vector_elt(v2i64) work in 32-bit mode. llvm-svn: 43535	2007-10-31 00:32:36 +00:00
Evan Cheng	0747bc1df6	Typo. llvm-svn: 43511	2007-10-30 20:11:21 +00:00
Duncan Sands	9ad5465005	Add support for expanding trunc stores. Consider storing an i170 on a 32 bit machine. This is first promoted to a trunc-i170 store of an i256. On a little-endian machine this expands to a store of an i128 and a trunc-i42 store of an i128. The trunc-i42 store is further expanded to a trunc-i42 store of an i64, then to a store of an i32 and a trunc-i10 store of an i32. At this point the operand type is legal (i32) and expansion stops (legalization of the trunc-i10 needs to be handled in LegalizeDAG.cpp). On big-endian machines the high bits are stored first, and some bit-fiddling is needed in order to generate aligned stores. llvm-svn: 43499	2007-10-30 12:50:39 +00:00
Duncan Sands	341f093bb1	If a call to getTruncStore is for a normal store, offload to getStore rather than trying to handle both cases at once (the assertions for example assume the store really is truncating). llvm-svn: 43498	2007-10-30 12:40:58 +00:00
Dan Gohman	ae95d72a52	Fix a DAGCombiner abort on a bitcast from a scalar to a vector. llvm-svn: 43470	2007-10-29 20:44:42 +00:00
Evan Cheng	e106e2f142	Enable more fold (sext (load x)) -> (sext (truncate (sextload x))) transformation. Previously, it's restricted by ensuring the number of load uses is one. Now the restriction is loosened up by allowing setcc uses to be "extended" (e.g. setcc x, c, eq -> setcc sext(x), sext(c), eq). llvm-svn: 43465	2007-10-29 19:58:20 +00:00
Dan Gohman	1961c28d46	Add explicit keywords. llvm-svn: 43464	2007-10-29 19:52:04 +00:00
Duncan Sands	1826deda68	The guaranteed alignment of ptr+offset is only the minimum of of offset and the alignment of ptr if these are both powers of 2. While the ptr alignment is guaranteed to be a power of 2, there is no reason to think that offset is. For example, if offset is 12 (the size of a long double on x86-32 linux) and the alignment of ptr is 8, then the alignment of ptr+offset will in general be 4, not 8. Introduce a function MinAlign, lifted from gcc, for computing the minimum guaranteed alignment. I've tried to fix up everywhere under lib/CodeGen/SelectionDAG/. I also changed some places that weren't wrong (because both values were a power of 2), as a defensive change against people copying and pasting the code. Hopefully someone who cares about alignment will review the rest of LLVM and fix up the remaining places. Since I'm on x86 I'm not very motivated to do this myself... llvm-svn: 43421	2007-10-28 12:59:45 +00:00
Bill Wendling	6d15b32c15	- Remove the hacky code that forces a memcpy. Alignment is taken care of in the FE. - Explicitly pass in the alignment of the load & store. - XFAIL 2007-10-23-UnalignedMemcpy.ll because llc has a bug that crashes on unaligned pointers. llvm-svn: 43398	2007-10-26 20:24:42 +00:00
Bill Wendling	f73340efb9	Changed XXX to FIXME, and added comment to the README file llvm-svn: 43359	2007-10-25 19:49:32 +00:00
Bill Wendling	5f7ed00d44	Added comment explaining why we are doing this check. llvm-svn: 43353	2007-10-25 18:23:45 +00:00
Duncan Sands	d385f0759c	Small formatting changes. Add a sanity check. Use NVT rather than looking it up, since we have it to hand. llvm-svn: 43341	2007-10-25 12:35:51 +00:00
Duncan Sands	a8f4ba6eb9	Promote SETCC operands. llvm-svn: 43340	2007-10-25 12:32:31 +00:00
Duncan Sands	cf0da03312	Correctly extract the ValueType from a VTSDNode. llvm-svn: 43339	2007-10-25 12:30:51 +00:00
Dale Johannesen	a4a972e32d	Another expansion for i64 multiply, suitable for PPC. llvm-svn: 43314	2007-10-24 22:26:08 +00:00
Bill Wendling	38ccabcae9	Fix comment and use the "Size" variable that's already provided. llvm-svn: 43271	2007-10-23 23:36:57 +00:00
Bill Wendling	e3b859298a	If there's an unaligned memcpy to/from the stack, don't lower it. Just call the memcpy library function instead. llvm-svn: 43270	2007-10-23 23:32:40 +00:00
Bill Wendling	6f149c0571	This broke lots. Reverting. llvm-svn: 43264	2007-10-23 22:04:26 +00:00
Bill Wendling	8971440e56	Lowering a memcpy to the stack is killing PPC. The ARM and X86 backends already have their own custom memcpy lowering code. This code needs to be factored out into a target-independent lowering method with hooks to the backend. In the meantime, just call memcpy if we're trying to copy onto a stack. llvm-svn: 43262	2007-10-23 21:30:25 +00:00
Evan Cheng	5d7032bb08	It's possible to commute instrctions with more than 3 operands. llvm-svn: 43256	2007-10-23 20:14:40 +00:00
Evan Cheng	847d42a85c	isSubRegOf() is a dup of isSubRegister. llvm-svn: 43249	2007-10-23 06:51:50 +00:00
Evan Cheng	5163a8f53e	Add missing paratheses. llvm-svn: 43227	2007-10-22 19:42:28 +00:00
Duncan Sands	941db4da0a	Support for expanding extending loads of integers with funky bit-widths. llvm-svn: 43225	2007-10-22 19:00:05 +00:00
Duncan Sands	8fc995069b	Fix up the logic for result expanding the various extension operations so they work right for integers with funky bit-widths. For example, consider extending i48 to i64 on a 32 bit machine. The i64 result is expanded to 2 x i32. We know that the i48 operand will be promoted to i64, then also expanded to 2 x i32. If we had the expanded promoted operand to hand, then expanding the result would be trivial. Unfortunately at this stage we can only get hold of the promoted operand. So instead we kind of hand-expand, doing explicit shifting and truncating to get the top and bottom halves of the i64 operand into 2 x i32, which are then used to expand the result. This is harmless, because when the promoted operand is finally expanded all this bit fiddling turns into trivial operations which are eliminated either by the expansion code itself or the DAG combiner. llvm-svn: 43223	2007-10-22 18:26:21 +00:00
Evan Cheng	8557603781	- Only perform the unfolding optimization when the folding in question is modref. - Remove a bogus assertion. llvm-svn: 43211	2007-10-22 03:01:44 +00:00
Chris Lattner	36f06c80e6	Add promote operand support for [su]int_to_fp. llvm-svn: 43204	2007-10-20 22:57:56 +00:00
Chris Lattner	2ba4b148f3	Add result promotion of FP_TO_*INT, fixing CodeGen/X86/trunc-to-bool.ll with the new legalizer. llvm-svn: 43199	2007-10-20 04:32:38 +00:00
Chris Lattner	1c87f0c620	simplify some code. llvm-svn: 43198	2007-10-20 04:09:48 +00:00
Chris Lattner	2bcac640b7	Implement promote and expand for operands of memcpy and friends. This fixes CodeGen/X86/mem*.ll. llvm-svn: 43197	2007-10-20 04:07:07 +00:00
Evan Cheng	f12967124c	Added missing curly braces which renders the if clause useless in debug build. llvm-svn: 43196	2007-10-20 04:01:47 +00:00
Dale Johannesen	771188cf60	Fix a few places vector operations were not getting the operand's type from the right place. llvm-svn: 43195	2007-10-20 00:07:52 +00:00
Evan Cheng	35ff79370b	Local spiller optimization: Turn a store folding instruction into a load folding instruction. e.g. xorl %edi, %eax movl %eax, -32(%ebp) movl -36(%ebp), %eax orl %eax, -32(%ebp) => xorl %edi, %eax orl -36(%ebp), %eax mov %eax, -32(%ebp) This enables the unfolding optimization for a subsequent instruction which will also eliminate the newly introduced store instruction. llvm-svn: 43192	2007-10-19 21:23:22 +00:00
Bill Wendling	ac5c93040f	Don't branch fold inline asm statements. llvm-svn: 43191	2007-10-19 21:09:55 +00:00
Duncan Sands	a87c9e4b75	Add support for a few more nodes. llvm-svn: 43190	2007-10-19 20:29:48 +00:00
Dale Johannesen	6802d0c96f	Redo "last ppc long double fix" as Chris wants. llvm-svn: 43189	2007-10-19 20:29:00 +00:00
Chris Lattner	064c31ebac	Fix a really nasty vector miscompilation bill recently introduced. llvm-svn: 43181	2007-10-19 16:47:35 +00:00
Chris Lattner	3ea519e56d	rename ExpandOperation to ExpandOperationResult, as suggested by Duncan llvm-svn: 43177	2007-10-19 15:28:47 +00:00
Duncan Sands	a9953e4d0a	Support for expanding ADDE and SUBE. llvm-svn: 43175	2007-10-19 13:06:17 +00:00
Duncan Sands	d9834b29dd	If the value types are equal then this routine asserts in later checks rather than producing the ordinary load it is supposed to. Avoid all such hassles by directly returning an ordinary load in this case. llvm-svn: 43174	2007-10-19 13:05:40 +00:00
Rafael Espindola	846c19dd70	Add support for byval function whose argument is not 32 bit aligned. To do this it is necessary to add a "always inline" argument to the memcpy node. For completeness I have also added this node to memmove and memset. I have also added getMem* functions, because the extra argument makes it cumbersome to use getNode and because I get confused by it :-) llvm-svn: 43172	2007-10-19 10:41:11 +00:00
Chris Lattner	e5a6448533	Implement a few new operations. llvm-svn: 43171	2007-10-19 04:46:45 +00:00
Chris Lattner	e31365eecc	Implement expansion of SINT_TO_FP and UINT_TO_FP operands. llvm-svn: 43170	2007-10-19 04:32:47 +00:00
Chris Lattner	9081d08083	implement support for custom expansion of any node type, in one place. llvm-svn: 43169	2007-10-19 04:14:36 +00:00
Chris Lattner	d01b8ea4a5	Make use of TLI.ExpandOperation, remove softfloat stuff. llvm-svn: 43167	2007-10-19 03:58:25 +00:00
Chris Lattner	3c7ee41c78	add expand support for bit_convert result, even allowing custom expansion. llvm-svn: 43166	2007-10-19 03:33:14 +00:00
Chris Lattner	579db81f1c	add a new target hook. llvm-svn: 43165	2007-10-19 03:31:45 +00:00
Bill Wendling	de16ad1446	Negative indices aren't allowed here. llvm-svn: 43161	2007-10-19 01:10:49 +00:00
Dale Johannesen	10432e5a67	More ppcf128 issues (maybe the last)? llvm-svn: 43160	2007-10-19 00:59:18 +00:00
Bill Wendling	070aca5d25	Pointer arithmetic should be done with the index the same size as the pointer. llvm-svn: 43120	2007-10-18 08:32:37 +00:00
Duncan Sands	cb7aca0dcb	Support for ADDC/SUBC. llvm-svn: 43119	2007-10-18 08:22:16 +00:00
Evan Cheng	e6a41c066a	Really fix PR1734. Carefully track which register uses are sub-register uses by traversing inverse register coalescing map. llvm-svn: 43118	2007-10-18 07:49:59 +00:00
Dan Gohman	8f518b9875	Add support for ISD::SELECT in SplitVectorOp. llvm-svn: 43072	2007-10-17 14:48:28 +00:00
Duncan Sands	d42c812f4a	Return Expand from getOperationAction for all extended types. This is needed for SIGN_EXTEND_INREG at least. It is not clear if this is correct for other operations. On the other hand, for the various load/store actions it seems to correct to return the type action, as is currently done. Also, it seems that SelectionDAG::getValueType can be called for extended value types; introduce a map for holding these, since we don't really want to extend the vector to be 2^32 pointers long! Generalize DAGTypeLegalizer::PromoteResult_TRUNCATE and DAGTypeLegalizer::PromoteResult_INT_EXTEND to handle the various funky possibilities that apints introduce, for example that you can promote to a type that needs to be expanded. llvm-svn: 43071	2007-10-17 13:49:58 +00:00
Evan Cheng	0dde6e5761	Apply Chris' suggestions. llvm-svn: 43069	2007-10-17 06:53:44 +00:00
Evan Cheng	c8b5397000	One more extract_subreg coalescing bug fix. llvm-svn: 43065	2007-10-17 05:29:37 +00:00
Evan Cheng	9b0a44a2ce	Fix MergeValueInAsValue(). It allows overlapping live ranges but should replace their value numbers with the specified value number. llvm-svn: 43062	2007-10-17 02:13:29 +00:00
Evan Cheng	a6fd8bc97e	Clean up code that calculate MBB live-in's. llvm-svn: 43061	2007-10-17 02:12:22 +00:00
Evan Cheng	8b8c7c9927	Clean up code that calculate MBB live-in's. llvm-svn: 43060	2007-10-17 02:10:22 +00:00
Dale Johannesen	e5facd51cb	Disable attempts to constant fold PPC f128. Remove the assumption that this will happen from various places. llvm-svn: 43053	2007-10-16 23:38:29 +00:00
Evan Cheng	8f644cef0f	Some clean up. llvm-svn: 43043	2007-10-16 21:09:14 +00:00
Evan Cheng	fab7ca89d5	Fix PR1734. llvm-svn: 43035	2007-10-16 19:29:47 +00:00
Duncan Sands	bbbfbe95f7	Initial infrastructure for arbitrary precision integer codegen support. This should have no effect on codegen for other types. Debatable bits: (1) the use (abuse?) of a set in SDNode::getValueTypeList; (2) the length of getTypeToTransformTo, which maybe should be refactored with a non-inline part for extended value types. llvm-svn: 43030	2007-10-16 09:56:48 +00:00
Duncan Sands	052c843559	Fixes due to lack of type-safety for ValueType: (1) ValueType being passed instead of an opcode; (2) ValueType being passed for isVolatile (!) in getLoad. llvm-svn: 43028	2007-10-16 09:07:20 +00:00
Evan Cheng	ecf62cb763	Code clean up. llvm-svn: 43026	2007-10-16 08:04:24 +00:00
Chris Lattner	cece03dd89	implement promotion of select and select_cc, allowing MallocBench/gs to work with type promotion on x86. llvm-svn: 43025	2007-10-16 03:00:22 +00:00
Dan Gohman	9aa4fc5cd6	Teach IntrinsicLowering.cpp about the sin, cos, and pow intrinsics. llvm-svn: 43020	2007-10-15 22:07:31 +00:00
Evan Cheng	04c44712d3	Make CalcLatency() non-recursive. llvm-svn: 43017	2007-10-15 21:33:22 +00:00
Evan Cheng	a5abba65b6	Fix PR1729: watch out for val# with no def. llvm-svn: 42996	2007-10-15 18:33:50 +00:00
Chris Lattner	d6f7d44eae	Move CreateStackTemporary out to SelectionDAG llvm-svn: 42995	2007-10-15 17:48:57 +00:00
Chris Lattner	9eb7a829e6	add a new CreateStackTemporary helper method. llvm-svn: 42994	2007-10-15 17:47:20 +00:00
Chris Lattner	9d5b131e70	implement promotion of BR_CC operands, fixing bisort on ppc. llvm-svn: 42992	2007-10-15 17:16:12 +00:00
Chris Lattner	8555e69def	updates from duncan llvm-svn: 42991	2007-10-15 16:46:29 +00:00
Duncan Sands	f6977d9842	Fix some typos. Call getTypeToTransformTo rather than getTypeToExpandTo. The difference is that getTypeToExpandTo gives the final result of expansion (eg: i128 -> i32 on a 32 bit machine) while getTypeToTransformTo does just one step (i128 -> i64). llvm-svn: 42982	2007-10-15 13:30:18 +00:00
Chris Lattner	3cfb56d489	One mundane change: Change ReplaceAllUsesOfValueWith to optionally take a deleted nodes vector, instead of requiring it. One more significant change: Implement the start of a legalizer that just works on types. This legalizer is designed to run before the operation legalizer and ensure just that the input dag is transformed into an output dag whose operand and result types are all legal, even if the operations on those types are not. This design/impl has the following advantages: 1. When finished, this will significantly reduce the amount of code in LegalizeDAG.cpp. It will remove all the code related to promotion and expansion as well as splitting and scalarizing vectors. 2. The new code is very simple, idiomatic, and modular: unlike LegalizeDAG.cpp, it has no 3000 line long functions. :) 3. The implementation is completely iterative instead of recursive, good for hacking on large dags without blowing out your stack. 4. The implementation updates nodes in place when possible instead of deallocating and reallocating the entire graph that points to some mutated node. 5. The code nicely separates out handling of operations with invalid results from operations with invalid operands, making some cases simpler and easier to understand. 6. The new -debug-only=legalize-types option is very very handy :), allowing you to easily understand what legalize types is doing. This is not yet done. Until the ifdef added to SelectionDAGISel.cpp is enabled, this does nothing. However, this code is sufficient to legalize all of the code in 186.crafty, olden and freebench on an x86 machine. The biggest issues are: 1. Vectors aren't implemented at all yet 2. SoftFP is a mess, I need to talk to Evan about it. 3. No lowering to libcalls is implemented yet. 4. Various operations are missing etc. 5. There are FIXME's for stuff I hax0r'd out, like softfp. Hey, at least it is a step in the right direction :). If you'd like to help, just enable the #ifdef in SelectionDAGISel.cpp and compile code with it. If this explodes it will tell you what needs to be implemented. Help is certainly appreciated. Once this goes in, we can do three things: 1. Add a new pass of dag combine between the "type legalizer" and "operation legalizer" passes. This will let us catch some long-standing isel issues that we miss because operation legalization often obfuscates the dag with target-specific nodes. 2. We can rip out all of the type legalization code from LegalizeDAG.cpp, making it much smaller and simpler. When that happens we can then reimplement the core functionality left in it in a much more efficient and non-recursive way. 3. Once the whole legalizer is non-recursive, we can implement whole-function selectiondags maybe... llvm-svn: 42981	2007-10-15 06:10:22 +00:00
Chris Lattner	b193517eed	One xform performed by LegalizeDAG is transformation of "store of fp" to "store of int". Make two changes: 1) only xform "store of f32" if i32 is a legal type for the target. 2) only xform "store of f64" if either i64 or i32 are legal for the target. 3) if i64 isn't legal, manually lower to 2 stores of i32 instead of letting a later pass of legalize do it. This is ugly, but helps future changes I'm about to commit. llvm-svn: 42980	2007-10-15 05:46:06 +00:00
Chris Lattner	90e0b271df	Add a (disabled by default) way to view the ID of a node. llvm-svn: 42978	2007-10-15 05:32:43 +00:00
Chris Lattner	fbbe570994	remove misleading comment. llvm-svn: 42970	2007-10-14 20:35:12 +00:00
Chris Lattner	ebe491ea9c	If a target doesn't have HasMULHU or HasUMUL_LOHI, ExpandOp would return without lo/hi set. Fall through to making a libcall instead. llvm-svn: 42969	2007-10-14 18:35:05 +00:00
Evan Cheng	8d6da9142c	When coalescing an EXTRACT_SUBREG and the dst register is a physical register, the source register will be coalesced to the super register of the LHS. Properly merge in the live ranges of the resulting coalesced interval that were part of the original source interval to the live interval of the super-register. llvm-svn: 42961	2007-10-14 10:08:34 +00:00
Evan Cheng	cdf3609130	Revert 42908 for now. llvm-svn: 42960	2007-10-14 05:57:21 +00:00
Dale Johannesen	19db093b35	Disable some compile-time optimizations on PPC long double. llvm-svn: 42958	2007-10-14 01:56:47 +00:00
Chris Lattner	f47e30627a	Enhance the truncstore optimization code to handle shifted values and propagate demanded bits through them in simple cases. This allows this code: void foo(char *P) { strcpy(P, "abc"); } to compile to: _foo: ldrb r3, [r1] ldrb r2, [r1, #+1] ldrb r12, [r1, #+2]! ldrb r1, [r1, #+1] strb r1, [r0, #+3] strb r2, [r0, #+1] strb r12, [r0, #+2] strb r3, [r0] bx lr instead of: _foo: ldrb r3, [r1, #+3] ldrb r2, [r1, #+2] orr r3, r2, r3, lsl #8 ldrb r2, [r1, #+1] ldrb r1, [r1] orr r2, r1, r2, lsl #8 orr r3, r2, r3, lsl #16 strb r3, [r0] mov r2, r3, lsr #24 strb r2, [r0, #+3] mov r2, r3, lsr #16 strb r2, [r0, #+2] mov r3, r3, lsr #8 strb r3, [r0, #+1] bx lr testcase here: test/CodeGen/ARM/truncstore-dag-combine.ll This also helps occasionally for X86 and other cases not involving unaligned load/stores. llvm-svn: 42954	2007-10-13 06:58:48 +00:00
Chris Lattner	5e6fe054a2	Add a simple optimization to simplify the input to truncate and truncstore instructions, based on the knowledge that they don't demand the top bits. llvm-svn: 42952	2007-10-13 06:35:54 +00:00
Evan Cheng	b63076504e	Local spiller optimization: Turn this: movswl %ax, %eax movl %eax, -36(%ebp) xorl %edi, -36(%ebp) into movswl %ax, %eax xorl %edi, %eax movl %eax, -36(%ebp) by unfolding the load / store xorl into an xorl and a store when we know the value in the spill slot is available in a register. This doesn't change the number of instructions but reduce the number of times memory is accessed. Also unfold some load folding instructions and reuse the value when similar situation presents itself. llvm-svn: 42947	2007-10-13 02:50:24 +00:00
Evan Cheng	9490e0d078	Optionally create a MachineInstr without default implicit operands. llvm-svn: 42945	2007-10-13 02:23:01 +00:00
Arnold Schwaighofer	1f0da1fefb	Corrected many typing errors. And removed 'nest' parameter handling for fastcc from X86CallingConv.td. This means that nested functions are not supported for calling convention 'fastcc'. llvm-svn: 42934	2007-10-12 21:30:57 +00:00
Dale Johannesen	61c574fc51	ppc long double. Implement fabs and fneg. llvm-svn: 42924	2007-10-12 19:02:17 +00:00
Dale Johannesen	a1a4a9ebfa	Implement i64->ppcf128 conversions. llvm-svn: 42919	2007-10-12 17:52:03 +00:00
Evan Cheng	1410b8512c	Did mean to leave this in. INSERT_SUBREG isn't being coalesced yet. llvm-svn: 42916	2007-10-12 17:16:50 +00:00
Dan Gohman	dc35bd79ca	Change the names used for internal labels to use the current function symbol name instead of a codegen-assigned function number. Thanks Evan! :-) llvm-svn: 42908	2007-10-12 14:53:36 +00:00
Dan Gohman	e3583817ac	Fix some corner cases with vectors in copyToRegs and copyFromRegs. llvm-svn: 42907	2007-10-12 14:33:11 +00:00
Dan Gohman	4f056f3c10	Add support to SplitVectorOp for powi, where the second operand is a scalar integer. llvm-svn: 42906	2007-10-12 14:13:46 +00:00
Evan Cheng	11330f7526	Restrict EXTRACT_SUBREG coalescing to avoid negative performance impact. llvm-svn: 42903	2007-10-12 09:15:53 +00:00
Evan Cheng	aa2d6ef81d	EXTRACT_SUBREG coalescing support. The coalescer now treats EXTRACT_SUBREG like (almost) a register copy. However, it always coalesced to the register of the RHS (the super-register). All uses of the result of a EXTRACT_SUBREG are sub- register uses which adds subtle complications to load folding, spiller rewrite, etc. llvm-svn: 42899	2007-10-12 08:50:34 +00:00
Evan Cheng	89d5916921	Some clean up. llvm-svn: 42898	2007-10-12 08:45:27 +00:00
Dale Johannesen	05ff9e8cda	PPC long double. Implement a couple more conversions. llvm-svn: 42888	2007-10-12 01:37:08 +00:00
Dan Gohman	be37007e64	Add intrinsics for sin, cos, and pow. These use llvm_anyfloat_ty, and so may be overloaded with vector types. And add a testcase for codegen for these. llvm-svn: 42885	2007-10-12 00:01:22 +00:00
Dan Gohman	2a7de41682	Codegen support for vector intrinsics. Factor out the code that expands the "nasty scalar code" for unrolling vectors into a separate routine, teach it how to handle mixed vector/scalar operands, as seen in powi, and use it for several operators, including sin, cos, powi, and pow. Add support in SplitVectorOp for fpow, fpowi and for several unary operators. llvm-svn: 42884	2007-10-11 23:57:53 +00:00
Dale Johannesen	6472eb63c2	Implement ppc long double->uint conversion. Make ppc long double constants print. llvm-svn: 42882	2007-10-11 23:32:15 +00:00
Dan Gohman	fd66486950	Add runtime library names for pow. llvm-svn: 42880	2007-10-11 23:09:10 +00:00
Dan Gohman	daee002438	Add an ISD::FPOW node type. llvm-svn: 42879	2007-10-11 23:06:37 +00:00
Arnold Schwaighofer	9ccea99165	Added tail call optimization to the x86 back end. It can be enabled by passing -tailcallopt to llc. The optimization is performed if the following conditions are satisfied: * caller/callee are fastcc * elf/pic is disabled OR elf/pic enabled + callee is in module + callee has visibility protected or hidden llvm-svn: 42870	2007-10-11 19:40:01 +00:00
Dale Johannesen	007aa378ad	Next PPC long double bits. First cut at constants. No compile-time support for constant operations yet, just format transformations. Make readers and writers work. Split constants into 2 doubles in Legalize. llvm-svn: 42865	2007-10-11 18:07:22 +00:00
Duncan Sands	56ab90d3ad	Correct swapped arguments to getConstant. llvm-svn: 42824	2007-10-10 09:54:50 +00:00
Dale Johannesen	666323eacd	Next PPC long double bits: ppcf128->i32 conversion. Surprisingly complicated. Adds getTargetNode for 2 outputs, no inputs (missing). llvm-svn: 42822	2007-10-10 01:01:31 +00:00
Evan Cheng	a9830a04eb	Bad choice of variable name. llvm-svn: 42821	2007-10-10 00:11:40 +00:00
Evan Cheng	ad55a6079a	Fix an extremely stupid bug that prevented first round of coalescing (physical registers only) from happening. llvm-svn: 42820	2007-10-09 23:36:27 +00:00
Dan Gohman	5942e5a5fb	Call getFunctionNumber() instead of referencing FunctionNumber directly, for consistency. llvm-svn: 42769	2007-10-08 21:27:12 +00:00
Dan Gohman	a160361c85	Migrate X86 and ARM from using X86ISD::{,I}DIV and ARMISD::MULHILO{U,S} to use ISD::{S,U}DIVREM and ISD::{S,U}MUL_HIO. Move the lowering code associated with these operators into target-independent in LegalizeDAG.cpp and TargetLowering.cpp. llvm-svn: 42762	2007-10-08 18:33:35 +00:00
Dan Gohman	5c6d0c3b99	DAGCombiner support for UDIVREM/SDIVREM and UMUL_LOHI/SMUL_LOHI. Check if one of the two results unneeded so see if a simpler operator could bs used. Also check to see if each of the two computations could be simplified if they were split into separate operators. Factor out the code that calls visit() so that it can be used for this purpose. llvm-svn: 42759	2007-10-08 17:57:15 +00:00
Dan Gohman	b08c8bfe41	Add convenience overloads of SelectionDAG::getNode that take a SDVTList and individual SDOperand operands. llvm-svn: 42753	2007-10-08 15:49:58 +00:00
Dan Gohman	fadf40a655	In -debug mode, dump SelectionDAGs both before and after the optimization passes. llvm-svn: 42749	2007-10-08 15:12:17 +00:00
Evan Cheng	21a58a72c5	Kill cycle of an live range is always the last use index + 1. llvm-svn: 42742	2007-10-08 06:59:30 +00:00
Neil Booth	5f00973393	convertFromInteger, as originally written, expected sign-extended input. APInt unfortunately zero-extends signed integers, so Dale modified the function to expect zero-extended input. Make this assumption explicit in the function name. llvm-svn: 42732	2007-10-07 11:45:55 +00:00
Evan Cheng	0de312dd7d	Reapply 42677. llvm-svn: 42692	2007-10-06 08:19:55 +00:00
Chris Lattner	82217bd155	revert evan's patch until the header is committed llvm-svn: 42686	2007-10-06 06:08:17 +00:00
Evan Cheng	f4b5d491df	Added DAG xforms. e.g. (vextract (v4f32 s2v (f32 load $addr)), 0) -> (f32 load $addr) (vextract (v4i32 bc (v4f32 s2v (f32 load $addr))), 0) -> (i32 load $addr) Remove x86 specific patterns. llvm-svn: 42677	2007-10-06 02:46:29 +00:00
Dale Johannesen	f864ac96d8	Next powerpc long double bits. Comparisons work, although not well, and shortening FP converts. llvm-svn: 42672	2007-10-06 01:24:11 +00:00
Dale Johannesen	c0154c06d6	First round of ppc long double. call/return and basic arithmetic works. Rename RTLIB long double functions to distinguish different flavors of long double; the lib functions have different names, alas. llvm-svn: 42644	2007-10-05 20:04:43 +00:00
Dan Gohman	12334acbfb	Legalize support for MUL_LOHI and DIVREM. llvm-svn: 42636	2007-10-05 14:17:22 +00:00
Dan Gohman	2682bb6df2	Fix a typo in a comment. llvm-svn: 42635	2007-10-05 14:11:58 +00:00
Dan Gohman	1a77dfba15	Provide names for MUL_LOHI and DIVREM operators. llvm-svn: 42634	2007-10-05 14:11:04 +00:00
Evan Cheng	84d0ebc10a	Chain producing nodes cannot be moved, not chain reading nodes. llvm-svn: 42627	2007-10-05 01:42:35 +00:00
Evan Cheng	991cf47221	Oops. Didn't mean to leave this in. llvm-svn: 42626	2007-10-05 01:39:40 +00:00
Evan Cheng	79e9713b11	If a node that defines a physical register that is expensive to copy. The scheduler will try a number of tricks in order to avoid generating the copies. This may not be possible in case the node produces a chain value that prevent movement. Try unfolding the load from the node before to allow it to be moved / cloned. llvm-svn: 42625	2007-10-05 01:39:18 +00:00
Evan Cheng	4852303bdb	Add a variant of getTargetNode() that takes a vector of MVT::ValueType. llvm-svn: 42620	2007-10-05 01:10:49 +00:00
Evan Cheng	fd11ef4665	Silence a warning. llvm-svn: 42619	2007-10-05 01:09:32 +00:00
Dan Gohman	c731c97fac	Use empty() member functions when that's what's being tested for instead of comparing begin() and end(). llvm-svn: 42585	2007-10-03 19:26:29 +00:00
Dale Johannesen	4d4e77af8e	Rewrite sqrt and powi to use anyfloat. By popular demand. llvm-svn: 42537	2007-10-02 17:43:59 +00:00
Dale Johannesen	b6c05b1f90	Fix stride computations for long double arrays. llvm-svn: 42508	2007-10-01 23:08:35 +00:00
Dan Gohman	9765cc3bbb	Move the code that emits the .file directives so that it runs after the SourceFiles list is fully filled in so that it sees all of the files. llvm-svn: 42506	2007-10-01 22:40:20 +00:00
Evan Cheng	a3a67596f6	Remove simple scheduler. llvm-svn: 42499	2007-10-01 20:44:07 +00:00
Dale Johannesen	c0855f8a88	remove dup comment llvm-svn: 42486	2007-09-30 19:08:12 +00:00
Dale Johannesen	9150652b21	Constant fold int-to-long-double conversions; use APFloat for int-to-float/double; use round-to-nearest for these (implementation-defined, seems to match gcc). llvm-svn: 42484	2007-09-30 18:19:03 +00:00
Gordon Henriksen	f5aa229ede	This is done already. llvm-svn: 42467	2007-09-29 02:23:08 +00:00
Gordon Henriksen	37ca83d4e9	Collector is the base class for garbage collection code generators. This version enhances the previous patch to add root initialization as discussed here: http://lists.cs.uiuc.edu/pipermail/llvm-commits/Week-of-Mon-20070910/053455.html Collector gives its subclasses control over generic algorithms: unsigned NeededSafePoints; //< Bitmask of required safe points. bool CustomReadBarriers; //< Default is to insert loads. bool CustomWriteBarriers; //< Default is to insert stores. bool CustomRoots; //< Default is to pass through to backend. bool InitRoots; //< If set, roots are nulled during lowering. It also has callbacks which collectors can hook: /// If any of the actions are set to Custom, this is expected to /// be overriden to create a transform to lower those actions to /// LLVM IR. virtual Pass *createCustomLoweringPass() const; /// beginAssembly/finishAssembly - Emit module metadata as /// assembly code. virtual void beginAssembly(Module &M, std::ostream &OS, AsmPrinter &AP, const TargetAsmInfo &TAI) const; virtual void finishAssembly(Module &M, CollectorModuleMetadata &CMM, std::ostream &OS, AsmPrinter &AP, const TargetAsmInfo &TAI) const; Various other independent algorithms could be implemented, but were not necessary for the initial two collectors. Some examples are listed here: http://llvm.org/docs/GarbageCollection.html#collector-algos llvm-svn: 42466	2007-09-29 02:13:43 +00:00
Dan Gohman	a90183e7d1	Teach SplitVectorOp how to split INSERT_VECTOR_ELT. llvm-svn: 42457	2007-09-28 23:53:40 +00:00
Evan Cheng	a5e595d23a	If two instructions are both two-address code, favors (schedule closer to terminator) the one that has a CopyToReg use. This fixes 2006-05-11-InstrSched.ll with -new-cc-modeling-scheme. llvm-svn: 42453	2007-09-28 22:32:30 +00:00
Evan Cheng	f72693f36e	Remove a poor scheduling heuristic. llvm-svn: 42443	2007-09-28 19:37:35 +00:00
Evan Cheng	038dcc5136	Trim some unneeded fields. llvm-svn: 42442	2007-09-28 19:24:24 +00:00
Dale Johannesen	789b5a505b	Fix long double -> uint64 conversion. llvm-svn: 42440	2007-09-28 18:44:17 +00:00
Dale Johannesen	6bf69ed3cc	minor long double related changes llvm-svn: 42439	2007-09-28 18:06:58 +00:00
Dan Gohman	25d506c41b	Make the checks for DW_FORM_data4 consistent with the others, and add more such code for DIEDwarfLabel::SizeOf and DIEObjectLabel::SizeOf. llvm-svn: 42435	2007-09-28 16:50:28 +00:00
Dan Gohman	0d23d63b9e	Use 32-bit data directives for DW_FORM_data4 format data, even on targets with 64-bit addresses. llvm-svn: 42434	2007-09-28 15:43:33 +00:00
Dale Johannesen	25a00a63eb	Add sqrt and powi intrinsics for long double. llvm-svn: 42423	2007-09-28 01:08:20 +00:00
Dan Gohman	a1d46c7d0a	TargetAsmInfo::getAddressSize() was incorrect for x86-64 and 64-bit targets other than PPC64. Instead of fixing it, just remove it and fix all the places that use it to use TargetData::getPointerSize() instead, as there aren't very many. Most of the references were in DwarfWriter.cpp. llvm-svn: 42419	2007-09-27 23:12:31 +00:00
Gordon Henriksen	613afce430	CollectorMetadata abstractly describes stack maps for a function. It includes: - location and of each safe point in machine code (identified by a label) - location of each root within the stack frame (identified by an offset), including the metadata tag provided to llvm.gcroot in the user program - size of the stack frame (for collectors which want to cheat on stack crawling :) - and eventually will include liveness It is to be populated by back-ends during code-generation. CollectorModuleMetadata aggregates this information across the entire module. llvm-svn: 42418	2007-09-27 22:18:46 +00:00

... 26 27 28 29 30 ...

6776 Commits