llvm-project

Commit Graph

Author	SHA1	Message	Date
Chandler Carruth	8c0b41d656	Add a somewhat hacky heuristic to do something different from whole-loop rotation. When there is a loop backedge which is an unconditional branch, we will end up with a branch somewhere no matter what. Try placing this backedge in a fallthrough position above the loop header as that will definitely remove at least one branch from the loop iteration, where whole loop rotation may not. I haven't seen any benchmarks where this is important but loop-blocks.ll tests for it, and so this will be covered when I flip the default. llvm-svn: 154812	2012-04-16 13:33:36 +00:00
Chandler Carruth	8c74c7b1c6	Tweak the loop rotation logic to check whether the loop is naturally laid out in a form with a fallthrough into the header and a fallthrough out of the bottom. In that case, leave the loop alone because any rotation will introduce unnecessary branches. If either side looks like it will require an explicit branch, then the rotation won't add any, do it to ensure the branch occurs outside of the loop (if possible) and maximize the benefit of the fallthrough in the bottom. llvm-svn: 154806	2012-04-16 09:31:23 +00:00
Hal Finkel	e0cf6397fd	Remove dead SD nodes after the combining pass. Fixes PR12201. llvm-svn: 154786	2012-04-16 03:33:22 +00:00
Chandler Carruth	ccc7e42b1f	Rewrite how machine block placement handles loop rotation. This is a complex change that resulted from a great deal of experimentation with several different benchmarks. The one which proved the most useful is included as a test case, but I don't know that it captures all of the relevant changes, as I didn't have specific regression tests for each, they were more the result of reasoning about what the old algorithm would possibly do wrong. I'm also failing at the moment to craft more targeted regression tests for these changes, if anyone has ideas, it would be welcome. The first big thing broken with the old algorithm is the idea that we can take a basic block which has a loop-exiting successor and a looping successor and use the looping successor as the layout top in order to get that particular block to be the bottom of the loop after layout. This happens to work in many cases, but not in all. The second big thing broken was that we didn't try to select the exit which fell into the nearest enclosing loop (to which we exit at all). As a consequence, even if the rotation worked perfectly, it would result in one of two bad layouts. Either the bottom of the loop would get fallthrough, skipping across a nearer enclosing loop and thereby making it discontiguous, or it would be forced to take an explicit jump over the nearest enclosing loop to earch its successor. The point of the rotation is to get fallthrough, so we need it to fallthrough to the nearest loop it can. The fix to the first issue is to actually layout the loop from the loop header, and then rotate the loop such that the correct exiting edge can be a fallthrough edge. This is actually much easier than I anticipated because we can handle all the hard parts of finding a viable rotation before we do the layout. We just store that, and then rotate after layout is finished. No inner loops get split across the post-rotation backedge because we check for them when selecting the rotation. That fix exposed a latent problem with our exitting block selection -- we should allow the backedge to point into the middle of some inner-loop chain as there is no real penalty to it, the whole point is that it won't be a fallthrough edge. This may have blocked the rotation at all in some cases, I have no idea and no test case as I've never seen it in practice, it was just noticed by inspection. Finally, all of these fixes, and studying the loops they produce, highlighted another problem: in rotating loops like this, we sometimes fail to align the destination of these backwards jumping edges. Fix this by actually walking the backwards edges rather than relying on loopinfo. This fixes regressions on heapsort if block placement is enabled as well as lots of other cases where the previous logic would introduce an abundance of unnecessary branches into the execution. llvm-svn: 154783	2012-04-16 01:12:56 +00:00
Nadav Rotem	02ef0c3524	When emulating vselect using OR/AND/XOR make sure to bitcast the result back to the original type. llvm-svn: 154764	2012-04-15 15:08:09 +00:00
Andrew Trick	97d5b9cca6	misched: Added CanHandleTerminators. This is a special flag for targets that really want their block terminators in the DAG. The default scheduler cannot handle this correctly, so it becomes the specialized scheduler's responsibility to schedule terminators. llvm-svn: 154712	2012-04-13 23:29:54 +00:00
Benjamin Kramer	330970d658	Reduce malloc traffic in DwarfAccelTable - Don't copy offsets into HashData, the underlying vector won't change once the table is finalized. - Allocate HashData and HashDataContents in a BumpPtrAllocator. - Allocate string map entries in the same allocator. - Random cleanups. llvm-svn: 154694	2012-04-13 20:06:17 +00:00
Sirish Pande	b486144c12	HexagonPacketizer patch. llvm-svn: 154616	2012-04-12 21:06:38 +00:00
Nadav Rotem	9d376b6578	Reapply 154397. Original message: Fix a dagcombine optimization which assumes that the vsetcc result type is always of the same size as the compared values. This is ture for SSE/AVX/NEON but not for all targets. llvm-svn: 154490	2012-04-11 08:26:11 +00:00
Craig Topper	692d584910	Fix an overly indented line. Remove an 'else' after an 'if' that returns. llvm-svn: 154479	2012-04-11 04:55:51 +00:00
Craig Topper	bc680061e8	Inline implVisitAluOverflow by introducing a nested switch to convert the intrinsic to an nodetype. llvm-svn: 154478	2012-04-11 04:34:11 +00:00
Craig Topper	3ef01cdb2e	Optimize code a bit by calling push_back only once in some loops. Reduces compiled code size a bit. llvm-svn: 154473	2012-04-11 03:06:35 +00:00
Jakob Stoklund Olesen	645bdd4b69	Tweak MachineLICM heuristics for cheap instructions. Allow cheap instructions to be hoisted if they are register pressure neutral or better. This happens if the instruction is the last loop use of another virtual register. Only expensive instructions are allowed to increase loop register pressure. llvm-svn: 154455	2012-04-11 00:00:28 +00:00
Jakob Stoklund Olesen	a3e86a604a	Only check for PHI uses inside the current loop. Hoisting a value that is used by a PHI in the loop will introduce a copy because the live range is extended to cross the PHI. The same applies to PHIs in exit blocks. Also use this opportunity to make HasLoopPHIUse() non-recursive. llvm-svn: 154454	2012-04-11 00:00:26 +00:00
Owen Anderson	6f1ee1634d	Move the constant-folding support for FP_ROUND in SelectionDAG from the one-operand version of getNode() to the two-operand version, since it became a two-operand node at sound point. Zap a testcase that this allows us to completely fold away. llvm-svn: 154447	2012-04-10 22:46:53 +00:00
Duncan Sands	4f53074cca	Add a comment noting that the fdiv -> fmul conversion won't generate multiplication by a denormal, and some tests checking that. llvm-svn: 154431	2012-04-10 20:35:27 +00:00
Eric Christopher	e9abba71fe	To ensure that we have more accurate line information for a block don't elide the branch instruction if it's the only one in the block, otherwise it's ok. PR9796 and rdar://11215207 llvm-svn: 154417	2012-04-10 18:18:10 +00:00
Owen Anderson	3efc8f22bd	Revert r154397, which was causing make check failures on the buildbots. llvm-svn: 154414	2012-04-10 18:02:12 +00:00
Nadav Rotem	065564d85a	Fix a dagcombine optimization which assumes that the vsetcc result type is always of the same size as the compared values. This is ture for SSE/AVX/NEON but not for all targets. llvm-svn: 154397	2012-04-10 14:58:31 +00:00
Chandler Carruth	68062617a6	Make a somewhat subtle change in the logic of block placement. Sometimes the loop header has a non-loop predecessor which has been pre-fused into its chain due to unanalyzable branches. In this case, rotating the header into the body of the loop in order to place a loop exit at the bottom of the loop is a Very Bad Idea as it makes the loop non-contiguous. I'm working on a good test case for this, but it's a bit annoynig to craft. I should get one shortly, but I'm submitting this now so I can begin the (lengthy) performance analysis process. An initial run of LNT looks really, really good, but there is too much noise there for me to trust it much. llvm-svn: 154395	2012-04-10 13:35:57 +00:00
Anton Korobeynikov	4d1220de34	Transform div to mul with reciprocal only when fp imm is legal. This fixes PR12516 and uncovers one weird problem in legalize (workarounded) llvm-svn: 154394	2012-04-10 13:22:49 +00:00
Evan Cheng	136861d994	Make the code slightly more palatable. llvm-svn: 154378	2012-04-10 03:15:18 +00:00
Evan Cheng	f8bad08001	Fix a long standing tail call optimization bug. When a libcall is emitted legalizer always use the DAG entry node. This is wrong when the libcall is emitted as a tail call since it effectively folds the return node. If the return node's input chain is not the entry (i.e. call, load, or store) use that as the tail call input chain. PR12419 rdar://9770785 rdar://11195178 llvm-svn: 154370	2012-04-10 01:51:00 +00:00
Rafael Espindola	1d9672bdce	Don't try to zExt just to check if an integer constant is zero, it might not fit in a i64. llvm-svn: 154364	2012-04-10 00:16:22 +00:00
Akira Hatanaka	8483a6c47d	Have TargetLowering::getPICJumpTableRelocBase return a node that points to the GOT if jump table uses 64-bit gp-relative relocation. llvm-svn: 154341	2012-04-09 20:32:12 +00:00
Lang Hames	3ad11ff90f	Patch r153892 for PR11861 apparently broke an external project (see PR12493). This patch restores TwoAddressInstructionPass's pre-r153892 behaviour when rescheduling instructions in TryInstructionTransform. Hopefully this will fix PR12493. To refix PR11861, lowering of INSERT_SUBREGS is deferred until after the copy that unties the operands is emitted (this seems to be a more appropriate fix for that issue anyway). llvm-svn: 154338	2012-04-09 20:17:30 +00:00
Rafael Espindola	8f62b3248e	Pattern match a setcc of boolean value with 0 as a truncate. llvm-svn: 154322	2012-04-09 16:06:03 +00:00
Craig Topper	9c3da316ec	Remove unnecessary type check when combining and/or/xor of swizzles. Move some checks to allow better early out. llvm-svn: 154309	2012-04-09 07:19:09 +00:00
Craig Topper	e5893f64e8	Remove unnecessary 'else' on an 'if' that always returns llvm-svn: 154308	2012-04-09 05:59:53 +00:00
Craig Topper	e3ad4834ae	Optimize code slightly. No functionality change. llvm-svn: 154307	2012-04-09 05:55:33 +00:00
Craig Topper	5894fe430a	Replace some explicit checks with asserts for conditions that should never happen. llvm-svn: 154305	2012-04-09 05:16:56 +00:00
Craig Topper	6148fe65e8	Optimize code a bit. No functional change intended. llvm-svn: 154299	2012-04-08 23:15:04 +00:00
Benjamin Kramer	bb6ff08766	Silence sign-compare warning. llvm-svn: 154297	2012-04-08 19:04:45 +00:00
Duncan Sands	2f1dc3814b	Only have codegen turn fdiv by a constant into fmul by the reciprocal when -ffast-math, i.e. don't just always do it if the reciprocal can be formed exactly. There is already an IR level transform that does that, and it does it more carefully. llvm-svn: 154296	2012-04-08 18:08:12 +00:00
Craig Topper	c8e2d91a58	Simplify code that tries to do vector extracts for shuffles when the mask width and the input vector widths don't match. No need to check the min and max are in range before calculating the start index. The range check after having the start index is sufficient. Also no need to check for an extract from the beginning differently. llvm-svn: 154295	2012-04-08 17:53:33 +00:00
Chandler Carruth	16f0ebcbb5	Move the TLSModel information into the TargetMachine rather than hiding in TargetLowering. There was already a FIXME about this location being odd. The interface is simplified as a consequence. This will also make it easier to change TLS models when compiling with PIE. llvm-svn: 154292	2012-04-08 17:20:55 +00:00
Chandler Carruth	bed1abf9ca	Remove an over zealous assert. The assert was trying to catch places where a chain outside of the loop block-set ended up in the worklist for scheduling as part of the contiguous loop. However, asserting the first block in the chain is in the loop-set isn't a valid check -- we may be forced to drag a chain into the worklist due to one block in the chain being part of the loop even though the first block is not in the loop. This occurs when we have been forced to form a chain early due to un-analyzable branches. No test case here as I have no idea how to even begin reducing one, and it will be hopelessly fragile. We have to somehow end up with a loop header of an inner loop which is a successor of a basic block with an unanalyzable pair of branch instructions. Ow. Self-host triggers it so it is unlikely it will regress. This at least gets block placement back to passing selfhost and the test suite. There are still a lot of slowdown that I don't like coming out of block placement, although there are now also a lot of speedups. =[ I'm seeing swings in both directions up to 10%. I'm going to try to find time to dig into this and see if we can turn this on for 3.1 as it does a really good job of cleaning up after some loops that degraded with the inliner changes. llvm-svn: 154287	2012-04-08 14:37:02 +00:00
Chandler Carruth	49158908dc	Add a debug-only 'dump' method to the BlockChain structure to ease debugging. llvm-svn: 154286	2012-04-08 14:37:01 +00:00
Craig Topper	d024cef233	Turn avx2 vinserti128 intrinsic calls into INSERT_SUBVECTOR DAG nodes and remove patterns for selecting the intrinsic. Similar was already done for avx1. llvm-svn: 154272	2012-04-07 22:32:29 +00:00
Craig Topper	e09d1c5c48	Remove 'else' after 'if' that ends in return. llvm-svn: 154267	2012-04-07 21:23:41 +00:00
Nadav Rotem	71d07ae5cb	1. Remove the part of r153848 which optimizes shuffle-of-shuffle into a new shuffle node because it could introduce new shuffle nodes that were not supported efficiently by the target. 2. Add a more restrictive shuffle-of-shuffle optimization for cases where the second shuffle reverses the transformation of the first shuffle. llvm-svn: 154266	2012-04-07 21:19:08 +00:00
Duncan Sands	5f8397a934	Convert floating point division by a constant into multiplication by the reciprocal if converting to the reciprocal is exact. Do it even if inexact if -ffast-math. This substantially speeds up ac.f90 from the polyhedron benchmarks. llvm-svn: 154265	2012-04-07 20:04:00 +00:00
Eric Christopher	aec8a82694	Patch to set is_stmt a little better for prologue lines in a function. This enables debuggers to see what are interesting lines for a breakpoint rather than any line that starts a function. rdar://9852092 llvm-svn: 154120	2012-04-05 20:39:05 +00:00
Jakob Stoklund Olesen	37492eac8c	Don't break the IV update in TLI::SimplifySetCC(). LSR always tries to make the ICmp in the loop latch use the incremented induction variable. This allows the induction variable to be kept in a single register. When the induction variable limit is equal to the stride, SimplifySetCC() would break LSR's hard work by transforming: (icmp (add iv, stride), stride) --> (cmp iv, 0) This forced us to use lea for the IC update, preventing the simpler incl+cmp. <rdar://problem/7643606> <rdar://problem/11184260> llvm-svn: 154119	2012-04-05 20:30:20 +00:00
Owen Anderson	a6eebf6013	Treat f16 the same as f80/f128 for the purposes of generating constants during instruction selection. llvm-svn: 154113	2012-04-05 18:50:32 +00:00
Pete Cooper	d7290700e6	REG_SEQUENCE expansion to COPY instructions wasn't taking account of sub register indices on the source registers. No simple test case llvm-svn: 154051	2012-04-04 21:03:25 +00:00
Pete Cooper	8a3dc0ed8c	f16 FREM can now be legalized by promoting to f32 llvm-svn: 154039	2012-04-04 19:36:31 +00:00
Jakob Stoklund Olesen	92fd79a639	Remove spurious debug output. llvm-svn: 154032	2012-04-04 18:23:38 +00:00
Rafael Espindola	ba0a6cabb8	Always compute all the bits in ComputeMaskedBits. This allows us to keep passing reduced masks to SimplifyDemandedBits, but know about all the bits if SimplifyDemandedBits fails. This allows instcombine to simplify cases like the one in the included testcase. llvm-svn: 154011	2012-04-04 12:51:34 +00:00
Craig Topper	4c7d995029	Remove default case from switch that was already covering all cases. llvm-svn: 153996	2012-04-04 04:42:42 +00:00
Pete Cooper	e7bff68a5e	Removed useless switch for default case when switch was covering all the enum values llvm-svn: 153984	2012-04-04 00:53:04 +00:00
Pete Cooper	9511ec86f9	Add VSELECT to LegalizeVectorTypes::ScalariseVectorResult. Previously it would crash if it encountered a 1 element VSELECT. Solution is slightly more complicated than just creating a SELET as we have to mask or sign extend the vector condition if it had different boolean contents from the scalar condition. Fixes <rdar://problem/11178095> llvm-svn: 153976	2012-04-03 22:57:55 +00:00
Pete Cooper	b98934cf72	Removed one last bad continue statement meant to be removed in r153914. llvm-svn: 153975	2012-04-03 22:18:49 +00:00
Chad Rosier	2a02fe1bb2	Fix an issue in SimplifySetCC() specific to vector comparisons. When folding X == X we need to check getBooleanContents() to determine if the result is a vector of ones or a vector of negative ones. I tried creating a test case, but the problem seems to only be exposed on a much older version of clang (around r144500). rdar://10923049 llvm-svn: 153966	2012-04-03 20:11:24 +00:00
Eric Christopher	b81e2b403c	Fix thinko check for number of operands to be the one that actually might have more than 19 operands. Add a testcase to make sure I never screw that up again. Part of rdar://11026482 llvm-svn: 153961	2012-04-03 17:55:42 +00:00
Eric Christopher	34164196af	Add a line number for the scope of the function (starting at the first brace) so that we get more accurate line number information about the declaration of a given function and the line where the function first starts. Part of rdar://11026482 llvm-svn: 153916	2012-04-03 00:43:49 +00:00
Pete Cooper	4f0dbb27d9	Fixes to r153903. Added missing explanation of behaviour when the VirtRegMap is NULL. Also changed it in this case to just avoid updating the map, but live ranges or intervals will still get updated and created llvm-svn: 153914	2012-04-03 00:28:46 +00:00
Pete Cooper	3ca96f9950	Moved LiveRangeEdit.h so that it can be called from other parts of the backend, not just libCodeGen llvm-svn: 153906	2012-04-02 22:44:18 +00:00
Jakob Stoklund Olesen	291007b055	Allocate virtual registers in ascending order. This is just the fallback tie-breaker ordering, the main allocation order is still descending size. Patch by Shamil Kurmangaleev! llvm-svn: 153904	2012-04-02 22:30:39 +00:00
Pete Cooper	2bde2f42b1	Refactored the LiveRangeEdit interface so that MachineFunction, TargetInstrInfo, MachineRegisterInfo, LiveIntervals, and VirtRegMap are all passed into the constructor and stored as members instead of passed in to each method. llvm-svn: 153903	2012-04-02 22:22:53 +00:00
Owen Anderson	98f2c0c384	Add predicates for checking whether targets have free FNEG and FABS operations, and prevent the DAGCombiner from turning them into bitwise operations if they do. llvm-svn: 153901	2012-04-02 22:10:29 +00:00
Lang Hames	aaafacd07e	During two-address lowering, rescheduling an instruction does not untie operands. Make TryInstructionTransform return false to reflect this. Fixes PR11861. llvm-svn: 153892	2012-04-02 19:58:43 +00:00
Eric Christopher	ad9fe8955a	Turn on the accelerator tables for Darwin. llvm-svn: 153880	2012-04-02 17:58:52 +00:00
Nadav Rotem	702f080767	Optimizing swizzles of complex shuffles may generate additional complex shuffles. Do not try to optimize swizzles of shuffles if the source shuffle has more than a single user, except when the source shuffle is also a swizzle. llvm-svn: 153864	2012-04-02 07:11:12 +00:00
Craig Topper	54bfde79db	Make MCInstrInfo available to the MCInstPrinter. This will be used to remove getInstructionName and the static data it contains since the same tables are already in MCInstrInfo. llvm-svn: 153860	2012-04-02 06:09:36 +00:00
Nadav Rotem	b078350872	This commit contains a few changes that had to go in together. 1. Simplify xor/and/or (bitcast(A), bitcast(B)) -> bitcast(op (A,B)) (and also scalar_to_vector). 2. Xor/and/or are indifferent to the swizzle operation (shuffle of one src). Simplify xor/and/or (shuff(A), shuff(B)) -> shuff(op (A, B)) 3. Optimize swizzles of shuffles: shuff(shuff(x, y), undef) -> shuff(x, y). 4. Fix an X86ISelLowering optimization which was very bitcast-sensitive. Code which was previously compiled to this: movd (%rsi), %xmm0 movdqa .LCPI0_0(%rip), %xmm2 pshufb %xmm2, %xmm0 movd (%rdi), %xmm1 pshufb %xmm2, %xmm1 pxor %xmm0, %xmm1 pshufb .LCPI0_1(%rip), %xmm1 movd %xmm1, (%rdi) ret Now compiles to this: movl (%rsi), %eax xorl %eax, (%rdi) ret llvm-svn: 153848	2012-04-01 19:31:22 +00:00
Lang Hames	652f21274f	Fix typo. llvm-svn: 153846	2012-04-01 19:27:25 +00:00
Andrew Trick	779b32a44e	misched: Add finalizeScheduler to complete the target interface. llvm-svn: 153827	2012-04-01 07:24:23 +00:00
Rafael Espindola	80c540e656	Teach CodeGen's version of computeMaskedBits to understand the range metadata. This is the CodeGen equivalent of r153747. I tested that there is not noticeable performance difference with any combination of -O0/-O2 /-g when compiling gcc as a single compilation unit. llvm-svn: 153817	2012-03-31 18:14:00 +00:00
Bill Wendling	9f829f1cc4	If we have a VLA that has a "use" in a metadata node that's then used here but it has no other uses, then we have a problem. E.g., int foo (const int x) { char a[x]; return 0; } If we assign 'a' a vreg and fast isel later on has to use the selection DAG isel, it will want to copy the value to the vreg. However, there are no uses, which goes counter to what selection DAG isel expects. <rdar://problem/11134152> llvm-svn: 153705	2012-03-30 00:02:55 +00:00
Eric Christopher	70e1bd8872	Add support for objc property decls according to the page at: http://llvm.org/docs/SourceLevelDebugging.html#objcproperty including type and DECL. Expand the metadata needed accordingly. rdar://11144023 llvm-svn: 153639	2012-03-29 08:42:56 +00:00
Jakob Stoklund Olesen	c3e80cc885	Enable machine code verification in the entire code generator. Some targets still mess up the liveness information, but that isn't verified after MRI->invalidateLiveness(). The verifier can still check other useful things like register classes and CFG, so it should be enabled after all passes. llvm-svn: 153615	2012-03-28 23:54:28 +00:00
Jakob Stoklund Olesen	d1bd8fba13	Enable machine code verification after PreSched2 passes. The late scheduler depends on accurate liveness information if it is breaking anti-dependencies, so we should be able to verify it. Relax the terminator checking in the machine code verifier so it can handle the basic blocks created by if conversion. llvm-svn: 153614	2012-03-28 23:31:15 +00:00
Jakob Stoklund Olesen	e433c68d7c	Also verify after ExpandPostRAPseudos. llvm-svn: 153599	2012-03-28 20:49:30 +00:00
Jakob Stoklund Olesen	341e06f8d5	Enable machine code verification after the late machine optimization passes. Branch folding invalidates liveness and disables liveness verification on some targets. llvm-svn: 153597	2012-03-28 20:47:37 +00:00
Jakob Stoklund Olesen	b21df32cf5	Skip liveness verification when MRI->tracksLiveness() is false. Extract the liveness verification into its own method. This makes it possible to run the machine code verifier after liveness information is no longer required to be valid. llvm-svn: 153596	2012-03-28 20:47:35 +00:00
Jakob Stoklund Olesen	8e58c90f51	Allow removeLiveIn to be called with a register that isn't live-in. This avoids the silly double search: if (isLiveIn(Reg)) removeLiveIn(Reg); llvm-svn: 153592	2012-03-28 20:11:42 +00:00
Pete Cooper	148ebb8802	Fixed commuteInstructions bug where if its called pre-regalloc the subreg indices weren't commuted llvm-svn: 153579	2012-03-28 17:02:22 +00:00
Eric Christopher	24a6298512	More debug output. llvm-svn: 153571	2012-03-28 07:34:36 +00:00
Eric Christopher	7285c7d51d	Fix the output of the DW_TAG_friend tag to include DW_AT_friend and not the rest of the member tag. Fixes PR11695 llvm-svn: 153570	2012-03-28 07:34:31 +00:00
Lang Hames	5544bf1b8a	Use a SmallVector and linear lookup instead of a DenseSet - SourceMap values will always be tiny sets, so DenseSet is overkill (SmallSet won't work as we need iteration support). llvm-svn: 153529	2012-03-27 19:10:45 +00:00
Eric Christopher	7ed2efca6a	Use DW_AT_low_pc for a single entry point into a routine. Fixes PR10105 llvm-svn: 153524	2012-03-27 18:35:54 +00:00
Jakob Stoklund Olesen	6c08534aff	Print SSA and liveness tracking flags in MF::print(). llvm-svn: 153518	2012-03-27 17:17:16 +00:00
Jakob Stoklund Olesen	d1664a1571	Branch folding may invalidate liveness. Branch folding can use a register scavenger to update liveness information when required. Don't do that if liveness information is already invalid. llvm-svn: 153517	2012-03-27 17:06:09 +00:00
Chris Lattner	1cc25e8a40	fix what looks like a real logic bug, found by PVS-Studio (part of PR12357) llvm-svn: 153513	2012-03-27 16:27:21 +00:00
Jakob Stoklund Olesen	9c1ad5cb7d	Add an MRI::tracksLiveness() flag. Late optimization passes like branch folding and tail duplication can transform the machine code in a way that makes it expensive to keep the register liveness information up to date. There is a fuzzy line between register allocation and late scheduling where the liveness information degrades. The MRI::tracksLiveness() flag makes the line clear: While true, liveness information is accurate, and can be used for register scavenging. Once the flag is false, liveness information is not accurate, and can only be used as a hint. Late passes generally don't need the liveness information, but they will sometimes use the register scavenger to help update it. The scavenger enforces strict correctness, and we have to spend a lot of code to update register liveness that may never be used. llvm-svn: 153511	2012-03-27 15:13:58 +00:00
Evan Cheng	7fede87349	Post-ra LICM should take care not to hoist an instruction that would clobber a register that's read by the preheader terminator. rdar://11095580 llvm-svn: 153492	2012-03-27 01:50:58 +00:00
Lang Hames	551662bf5d	During MachineCopyPropagation a register may be the source operand of multiple copies being considered for removal. Make sure to track all of the copies, rather than just the most recent encountered, by holding a DenseSet instead of an unsigned in SrcMap. No test case - couldn't reduce something with a sane size. llvm-svn: 153487	2012-03-27 00:44:47 +00:00
Lang Hames	95e021faf5	Add a debug option to dump PBQP graphs during register allocation. llvm-svn: 153483	2012-03-26 23:07:23 +00:00
Eric Christopher	0925c62c74	Use the file in the inlined die rather than the compile unit for backtrace locations. Testcase forthcoming, but I wanted to get some testing here. Should fix: PR12323 PR12314 rdar://11091100 llvm-svn: 153471	2012-03-26 21:38:38 +00:00
Benjamin Kramer	3e6719c133	No need to do an expensive stable sort for a bunch of integers. llvm-svn: 153438	2012-03-26 14:17:26 +00:00
Craig Topper	6e80c28017	Prune some includes and forward declarations. llvm-svn: 153429	2012-03-26 06:58:25 +00:00
Eric Christopher	c1e2dcdb8a	Add a debug statement. llvm-svn: 153428	2012-03-26 06:10:32 +00:00
Hal Finkel	71c2ba3d2e	Add the ability to promote legal integer VAARGs. This is required for the PPC64 SVR4 ABI. llvm-svn: 153372	2012-03-24 03:53:52 +00:00
Jim Grosbach	4a2909ab0f	Pretty-printing comments for literal floating point in .s files. Dump the hex representation to the comment stream as well as the float value. llvm-svn: 153346	2012-03-23 23:06:47 +00:00
Lang Hames	45c6d21ae1	Add support for register masks to PBQP. llvm-svn: 153341	2012-03-23 17:33:42 +00:00
Evan Cheng	8ab58a21a5	Source order scheduler should not preschedule nodes with multiple uses. rdar://11096639 llvm-svn: 153270	2012-03-22 19:31:17 +00:00
Evan Cheng	79f03e915d	Assign node orders to target intrinsics which do not produce results. rdar://11096639 llvm-svn: 153269	2012-03-22 19:29:09 +00:00
Eric Christopher	12da169839	In erroneous inline assembly we could mistakenly try to access the metadata operand as an actual operand, leading to an assert. Error out in this case. rdar://11007633 llvm-svn: 153234	2012-03-22 01:33:51 +00:00
Chad Rosier	6a63a74113	[fast-isel] Fold "urem x, pow2" -> "and x, pow2-1". This should fix the 271% execution-time regression for nsieve-bits on the ARMv7 -O0 -g nightly tester. This may also improve compile-time on architectures that would otherwise generate a libcall for urem (e.g., ARM) or fall back to the DAG selector. rdar://10810716 llvm-svn: 153230	2012-03-22 00:21:17 +00:00
Jim Grosbach	e13adc38d0	Checking a build_vector for an all-ones value. Type legalization can zero-extend the elements of the build_vector node, so, for example, we may have an <8 x i8> with i32 elements of value 255. That should return 'true' for the vector being all ones. llvm-svn: 153203	2012-03-21 17:48:04 +00:00
Andrew Trick	25baeca54d	misched: fix LiveInterval update for bottom-up scheduling llvm-svn: 153162	2012-03-21 04:12:16 +00:00
Andrew Trick	adb03b91ee	misched: trace LiveIntervals after scheduling. llvm-svn: 153161	2012-03-21 04:12:12 +00:00
Andrew Trick	54f7def703	misched: obvious iterator update fixes for bottom-up. llvm-svn: 153160	2012-03-21 04:12:10 +00:00
Andrew Trick	de670c0304	misched: cleanup main loop llvm-svn: 153159	2012-03-21 04:12:07 +00:00
Andrew Trick	3bfafcba10	misched: fix LI update for bottom-up. llvm-svn: 153158	2012-03-21 04:12:01 +00:00
Bill Wendling	7315c4b9cd	It's possible to have a constant expression who's size is quite big (e.g., i128). In that case, we may not be able to print out the MCExpr as an expression. For instance, we could have an MCExpr like this: 0xBEEF0000BEEF0000 \| (0xBEEF0000BEEF0000 << 64) The MCExpr printer handles sizes up to 64-bits, but this expression would require 128-bits. In this situation, try to evaluate the constant expression and emit that as the value into 64-bit chunks. <rdar://problem/11070338> llvm-svn: 153081	2012-03-20 08:56:43 +00:00
Craig Topper	aaeae98936	When combining (vextract shuffle (load ), <1,u,u,u>), 0) -> (load ), add users of the final load to the worklist too. Needed by changes I'm preparing to make to X86 backend. llvm-svn: 153078	2012-03-20 05:28:39 +00:00
Eric Christopher	60e01c560a	Do everything up to generating code to try to get a register for a variable. The previous code would break the debug info changing code invariant. This will regress debug info for arguments where we elide the alloca created. Fixes rdar://11066468 llvm-svn: 153074	2012-03-20 01:07:58 +00:00
Eric Christopher	997aaa9237	Untabify. llvm-svn: 153073	2012-03-20 01:07:56 +00:00
Eric Christopher	e5e54c87fa	Add another debugging statement here. llvm-svn: 153072	2012-03-20 01:07:53 +00:00
Eric Christopher	1a06cc9ae6	Use lookUpRegForValue here instead of duplicating the code. llvm-svn: 153071	2012-03-20 01:07:47 +00:00
Pete Cooper	e69be6df4f	f16 FDIV can now be legalized by promoting to f32 llvm-svn: 153064	2012-03-19 23:38:12 +00:00
Lang Hames	dd98c497b9	Add an option to the MI scheduler to cut off scheduling after a fixed number of instructions have been scheduled. Handy for tracking down scheduler bugs, or bugs exposed by scheduling. llvm-svn: 153045	2012-03-19 18:38:38 +00:00
Duncan Sands	3fb2fc6edb	Fix DAG combine which creates illegal vector shuffles. Patch by Heikki Kultala. llvm-svn: 153035	2012-03-19 15:35:44 +00:00
Benjamin Kramer	5d1bca8016	CriticalAntiDepBreaker: Replace a SmallSet of regs with a much denser BitVector. llvm-svn: 152999	2012-03-17 20:22:57 +00:00
Benjamin Kramer	97f889f43b	MachineInstr: Inline the fast path (non-bundle instruction) of hasProperty. This is particularly helpful as both arguments tend to be constants. llvm-svn: 152991	2012-03-17 17:03:45 +00:00
Benjamin Kramer	411d5a2026	ScheduleDAGInstrs: When adding uses we add them into a set that's empty at the beginning, no need to maintain another set for the added regs. llvm-svn: 152934	2012-03-16 17:38:19 +00:00
Benjamin Kramer	d03878bdf2	Limit the number of memory operands in MachineInstr to 2^16 and store the number in padding. Saves one machine word on MachineInstr (88->80 bytes on x86_64, 48->44 on i386). llvm-svn: 152930	2012-03-16 16:39:27 +00:00
Benjamin Kramer	8e5af375db	CriticalAntiDepBreaker: BasicBlock::size is an expensive operation, reuse the cached value. No functionality change. llvm-svn: 152927	2012-03-16 15:46:47 +00:00
Andrew Trick	e6913c7245	misched: add DAG edges from vreg defs to ExitSU. These edges are not really necessary, but it is consistent with the way we currently create physreg edges. Scheduler heuristics that expect a DAG edge to the block terminator could benefit from this change. Although in the future I hope we have a better mechanism for modeling latency across scheduling regions. llvm-svn: 152895	2012-03-16 05:04:25 +00:00
Chad Rosier	1a9c17efad	Revert r152705, which reapplied r152486 as this appears to be causing failures on our internal nightly testers. So, basically revert r152486 again. Abbreviated original commit message: Implement a more intelligent way of spilling uses across an invoke boundary. It looks as if Chander's inlining work, r152737, exposed an issue. llvm-svn: 152887	2012-03-16 01:04:00 +00:00
NAKAMURA Takumi	a7e57ace28	Revert r152613 (and r152614), "Inline the d'tor and add an anchor instead." for workaround of g++-4.4's miscompilation. It caused MSP430DAGToDAGISel::SelectIndexedBinOp() to be miscompiled. When two ReplaceUses()'s are expanded as inline, vtable in base class is stored to latter (ISelUpdater)ISU. llvm-svn: 152877	2012-03-16 00:01:55 +00:00
Eric Christopher	7734ca2891	For types with a parent of the compile unit make sure and emit the DECL information. rdar://10855921 llvm-svn: 152876	2012-03-15 23:55:40 +00:00
Eric Christopher	3390a6e5e3	We actually handle AllocaInst via getRegForValue below just fine. Part of rdar://8905263 llvm-svn: 152845	2012-03-15 21:33:47 +00:00
Eric Christopher	142820ba8d	Add some debugging output into fast isel as well. llvm-svn: 152844	2012-03-15 21:33:44 +00:00
Eric Christopher	be7a1016fc	Add another debug statement. llvm-svn: 152843	2012-03-15 21:33:41 +00:00
Eric Christopher	6a0c679762	Tabs. llvm-svn: 152842	2012-03-15 21:33:39 +00:00
Eric Christopher	be153e6610	Typo. llvm-svn: 152841	2012-03-15 21:33:35 +00:00
Nadav Rotem	6fd1d32c63	When optimizing certain BUILD_VECTOR nodes into other BUILD_VECTOR nodes, add the new node into the work list because there is a potential for further optimizations. llvm-svn: 152784	2012-03-15 08:49:06 +00:00
Eric Christopher	7dd54fb695	Revert the removal of DW_AT_MIPS_linkage_name when we aren't putting out the DW_AT_name. Older gdbs unfortunately still use it to disambiguate member functions in templated classes (gdb.cp/templates.exp). rdar://11043421 (which is now deferred for a bit) llvm-svn: 152782	2012-03-15 08:19:33 +00:00
Bill Wendling	df170db2f6	Add a xform to the DAG combiner. Transform: (fsub x, (fadd x, y)) -> (fneg y) and (fsub x, (fadd y, x)) -> (fneg y) if 'unsafe math' is specified. <rdar://problem/7540295> llvm-svn: 152777	2012-03-15 05:12:00 +00:00
Benjamin Kramer	05e7a843aa	Silence operator precedence warnings. llvm-svn: 152711	2012-03-14 11:26:37 +00:00
Bill Wendling	d7c0aae45b	Reapply r152486 with a fix for the nightly testers. There were cases where a value could be used and it's both crossing an invoke and NOT crossing an invoke. This could happen in the landing pads. In that case, we will demote the value to the stack like we did before. <rdar://problem/10609139> llvm-svn: 152705	2012-03-14 07:28:01 +00:00
Bill Wendling	618d57310a	Insert the debugging instructions in one fell-swoop so that it doesn't call the expensive "getFirstTerminator" call. This reduces the time of compilation in PR12258 from >10 minutes to < 10 seconds. llvm-svn: 152704	2012-03-14 07:14:25 +00:00
Andrew Trick	8823decdd4	misched: implemented a framework for top-down or bottom-up scheduling. New flags: -misched-topdown, -misched-bottomup. They can be used with the default scheduler or with -misched=shuffle. Without either topdown/bottomup flag -misched=shuffle now alternates scheduling direction. LiveIntervals update is unimplemented with bottom-up scheduling, so only -misched-topdown currently works. Capped the ScheduleDAG hierarchy with a concrete ScheduleDAGMI class. ScheduleDAGMI is aware of the top and bottom of the unscheduled zone within the current region. Scheduling policy can be plugged into the ScheduleDAGMI driver by implementing MachineSchedStrategy. ConvergingScheduler is now the default scheduling algorithm. It exercises the new driver but still does no reordering. llvm-svn: 152700	2012-03-14 04:00:41 +00:00
Andrew Trick	72515bef32	misched comments llvm-svn: 152699	2012-03-14 04:00:38 +00:00
Eric Christopher	a9916d0296	Remove the DW_AT_MIPS_linkage name attribute when we don't need it output (we're emitting a specification already and the information isn't changing). Saves 1% on the debug information for a build of llvm. Fixes rdar://11043421 llvm-svn: 152697	2012-03-14 02:59:17 +00:00
Evan Cheng	d5f8e5766c	Fortify r152675 a bit. Although I'm not able to come up with a test case that would trigger the truncation case. llvm-svn: 152678	2012-03-13 22:16:11 +00:00
Evan Cheng	7bf83096df	DAG combine incorrectly optimize (i32 vextract (v4i16 load $addr), c) to (i16 load $addr+csizeof(i16)) and replace uses of (i32 vextract) with the i16 load. It should issue an extload instead: (i32 extload $addr+csizeof(i16)). rdar://11035895 llvm-svn: 152675	2012-03-13 22:00:52 +00:00
Bill Wendling	12e5adb8d3	s/SjLjEHPass/SjLjEHPrepare/ No functionality change. llvm-svn: 152658	2012-03-13 20:04:21 +00:00
Bill Wendling	ac499ab244	Add a return type. llvm-svn: 152614	2012-03-13 05:52:28 +00:00
Bill Wendling	8adb10c8a9	Inline the d'tor and add an anchor instead. llvm-svn: 152613	2012-03-13 05:51:56 +00:00
Bill Wendling	508a3e5185	Refactor the SelectionDAG's 'dump' methods into their own .cpp file. No functionality change. llvm-svn: 152611	2012-03-13 05:47:27 +00:00
Lang Hames	fdb00ea27d	Fixed typo in comment. llvm-svn: 152610	2012-03-13 05:43:30 +00:00
Bill Wendling	5ad914038b	Revert due to nightly test failures. --- Reverse-merging r152486 into '.': U lib/CodeGen/SjLjEHPrepare.cpp llvm-svn: 152571	2012-03-12 20:19:41 +00:00
Benjamin Kramer	71b197306e	DwarfDebug: Store the filename/dirname pair as a zero-separated string in a stringmap, instead of using a highly inefficient std::map of a pair of std::strings. llvm-svn: 152541	2012-03-11 14:56:26 +00:00
Stepan Dyatkovskiy	97b02fc1b3	llvm::SwitchInst Renamed methods caseBegin, caseEnd and caseDefault with case_begin, case_end, and case_default. Added some notes relative to case iterators. llvm-svn: 152532	2012-03-11 06:09:17 +00:00
Benjamin Kramer	6338e61ae9	Microoptimize getVRegDef. def_begin isn't free, don't compute it twice. llvm-svn: 152492	2012-03-10 12:50:44 +00:00
Bill Wendling	1ab79c6db3	Implement a more intelligent way of spilling uses across an invoke boundary. The old way of determine when and where to spill a value that was used inside of a landing pad resulted in spilling that value everywhere and not just at the invoke edge. This algorithm determines which values are used within a landing pad. It then spills those values before the invoke and reloads them before the uses. This should prevent excessive spilling in many cases, e.g. inside of loops. <rdar://problem/10609139> llvm-svn: 152486	2012-03-10 07:11:55 +00:00
Jakob Stoklund Olesen	99014ff206	Report the defining instruction. llvm-svn: 152460	2012-03-10 00:44:11 +00:00
Jakob Stoklund Olesen	9f3e5744ab	Add SSA verification to MachineVerifier. Somehow we never verified SSA dominance before. llvm-svn: 152458	2012-03-10 00:36:06 +00:00
Jakob Stoklund Olesen	6ea6a14458	Use SmallPtrSet instead of DenseSet. llvm-svn: 152457	2012-03-10 00:36:04 +00:00
Benjamin Kramer	e1e549d617	Give dagcombiner's worklist some inline capacity. llvm-svn: 152454	2012-03-10 00:23:58 +00:00
Jakob Stoklund Olesen	7d544f9165	Assert on SSA errors in LiveVariables. All uses of a virtual register must be dominated by its def. llvm-svn: 152449	2012-03-09 23:41:44 +00:00
Andrew Trick	af1bee7235	misched: handle scheduler that insert instructions at empty region boundaries. And add comments, since this is obviously confusing. llvm-svn: 152445	2012-03-09 22:34:56 +00:00
Andrew Trick	edfe2ec429	misched: handle scheduling region boundaries nicely. llvm-svn: 152393	2012-03-09 08:02:51 +00:00
Andrew Trick	8c207e47c1	misched interface: rename Begin/End to RegionBegin/RegionEnd since they are not private. llvm-svn: 152382	2012-03-09 04:29:02 +00:00
Andrew Trick	1c0ec45b67	misched comments llvm-svn: 152374	2012-03-09 03:46:42 +00:00
Andrew Trick	a21daf7f5b	revert 152356: verify misched changes using -misched=shuffle. llvm-svn: 152373	2012-03-09 03:46:39 +00:00
Andrew Trick	453006875c	misched: allow the default scheduler to be one chosen by the target. llvm-svn: 152360	2012-03-09 00:52:20 +00:00
Evan Cheng	bc3b4e3f12	Cache MBB->begin. It's possible the scheduler / bundler may change MBB->begin(). llvm-svn: 152356	2012-03-09 00:24:29 +00:00
Craig Topper	5a4bcc749a	Use uint16_t to store instruction implicit uses and defs. Reduces static data. llvm-svn: 152301	2012-03-08 08:22:45 +00:00
Stepan Dyatkovskiy	5b648afb4d	Taken into account Duncan's comments for r149481 dated by 2nd Feb 2012: http://lists.cs.uiuc.edu/pipermail/llvm-commits/Week-of-Mon-20120130/136146.html Implemented CaseIterator and it solves almost all described issues: we don't need to mix operand/case/successor indexing anymore. Base iterator class is implemented as a template since it may be initialized either from "const SwitchInst" or from "SwitchInst". ConstCaseIt is just a read-only iterator. CaseIt is read-write iterator; it allows to change case successor and case value. Usage of iterator allows totally remove resolveXXXX methods. All indexing convertions done automatically inside the iterator's getters. Main way of iterator usage looks like this: SwitchInst SI = ... // intialize it somehow for (SwitchInst::CaseIt i = SI->caseBegin(), e = SI->caseEnd(); i != e; ++i) { BasicBlock BB = i.getCaseSuccessor(); ConstantInt *V = i.getCaseValue(); // Do something. } If you want to convert case number to TerminatorInst successor index, just use getSuccessorIndex iterator's method. If you want initialize iterator from TerminatorInst successor index, use CaseIt::fromSuccessorIndex(...) method. There are also related changes in llvm-clients: klee and clang. llvm-svn: 152297	2012-03-08 07:06:20 +00:00
Andrew Trick	02a80da331	misched interface: Expose the MachineScheduler pass. Allow targets to provide their own schedulers (subclass of ScheduleDAGInstrs) to the misched pass. Select schedulers using -misched=... llvm-svn: 152278	2012-03-08 01:41:12 +00:00
Andrew Trick	69b4204c18	Cleanup VLIWPacketizer to use the updated ScheduleDAGInstrs interface. llvm-svn: 152262	2012-03-07 23:01:09 +00:00
Andrew Trick	9a0c583954	misched prep: Expose the ScheduleDAGInstrs interface so targets may implement their own MachineScheduler. llvm-svn: 152261	2012-03-07 23:01:06 +00:00
Andrew Trick	d743f71e82	misched prep: Remove LLVM_LIBRARY_VISIBILITY from ScheduleDAGInstrs. llvm-svn: 152260	2012-03-07 23:01:02 +00:00
Andrew Trick	9b9dea5d07	misched prep: Comment the ScheduleDAGInstrs interface. llvm-svn: 152259	2012-03-07 23:00:59 +00:00
Andrew Trick	926d4736ed	misched prep: Cleanup ScheduleDAGInstrs interface. ScheduleDAGInstrs will be the main interface for MI-level schedulers. Make sure it's readable: one page of protected fields, one page of public methids. llvm-svn: 152258	2012-03-07 23:00:57 +00:00
Andrew Trick	67561b3ef2	misched prep: remove extra "protected" llvm-svn: 152257	2012-03-07 23:00:54 +00:00
Andrew Trick	a316faabec	misched prep: rename InsertPos to End. ScheduleDAGInstrs knows nothing about how instructions will be moved or inserted. llvm-svn: 152256	2012-03-07 23:00:52 +00:00
Andrew Trick	52226d409b	misched preparation: rename core scheduler methods for consistency. We had half the API with one convention, half with another. Now was a good time to clean it up. llvm-svn: 152255	2012-03-07 23:00:49 +00:00
Chandler Carruth	636ee38a88	Try to clarify this comment some. llvm-svn: 152221	2012-03-07 10:13:40 +00:00
Chandler Carruth	962152ca7a	Remove another outbreak of customized (and completely broken) hashing. This one is particularly annoying because the hashing algorithm is highly specialized, with a strange "equivalence" definition that subsets the fields involved. Still, this looks at the exact same set of data as the old code, but without bitwise or-ing over parts of it and other mixing badness. No functionality changed here. I've left a substantial fixme about the fact that there is a cleaner and more principled way to do this, but it requires making the equality definition actual stable for particular types... llvm-svn: 152218	2012-03-07 09:39:46 +00:00
Bill Wendling	7c5dcb6ccf	Where the BranchFolding pass removes a branch then adds another better branch, the DebugLoc information can be maintained throughout by grabbing the DebugLoc before the RemoveBranch and then passing the result to the InsertBranch. Patch by Andrew Stanford-Jason! llvm-svn: 152212	2012-03-07 08:49:42 +00:00
Andrew Trick	1a1b54a2da	Fix cmake llvm-svn: 152210	2012-03-07 05:46:04 +00:00
Andrew Trick	f9fa8afdaa	comment llvm-svn: 152209	2012-03-07 05:21:54 +00:00
Andrew Trick	60cf03e772	misched preparation: clarify ScheduleDAG and ScheduleDAGInstrs roles. ScheduleDAG is responsible for the DAG: SUnits and SDeps. It provides target hooks for latency computation. ScheduleDAGInstrs extends ScheduleDAG and defines the current scheduling region in terms of MachineInstr iterators. It has access to the target's scheduling itinerary data. ScheduleDAGInstrs provides the logic for building the ScheduleDAG for the sequence of MachineInstrs in the current region. Target's can implement highly custom schedulers by extending this class. ScheduleDAGPostRATDList provides the driver and diagnostics for current postRA scheduling. It maintains a current Sequence of scheduled machine instructions and logic for splicing them into the block. During scheduling, it uses the ScheduleHazardRecognizer provided by the target. Specific changes: - Removed driver code from ScheduleDAG. clearDAG is the only interface needed. - Added enterRegion/exitRegion hooks to ScheduleDAGInstrs to delimit the scope of each scheduling region and associated DAG. They should be used to setup and cleanup any region-specific state in addition to the DAG itself. This is necessary because we reuse the same ScheduleDAG object for the entire function. The target may extend these hooks to do things at regions boundaries, like bundle terminators. The hooks are called even if we decide not to schedule the region. So all instructions in a block are "covered" by these calls. - Added ScheduleDAGInstrs::begin()/end() public API. - Moved Sequence into the driver layer, which is specific to the scheduling algorithm. llvm-svn: 152208	2012-03-07 05:21:52 +00:00
Andrew Trick	42756e2eb4	ScheduleDAGInstrs comments llvm-svn: 152207	2012-03-07 05:21:47 +00:00
Andrew Trick	e932bb77b5	misched preparation: modularize schedule emission. ScheduleDAG has nothing to do with how the instructions are scheduled. llvm-svn: 152206	2012-03-07 05:21:44 +00:00
Andrew Trick	edee68ce1b	misched preparation: modularize schedule printing. ScheduleDAG will not refer to the scheduled instruction sequence. llvm-svn: 152205	2012-03-07 05:21:40 +00:00
Andrew Trick	46a58664f7	misched preparation: modularize schedule verification. ScheduleDAG will not refer to the scheduled instruction sequence. llvm-svn: 152204	2012-03-07 05:21:36 +00:00
Andrew Trick	7c6c41a56a	whitespace llvm-svn: 152203	2012-03-07 05:21:32 +00:00
Andrew Trick	a5f19560fb	Added -view-misched=dags options. llvm-svn: 152178	2012-03-07 00:18:25 +00:00
Andrew Trick	1b2324d0e8	Cleanup in preparation for misched: Move DAG visualization logic. Soon, ScheduleDAG will not refer to the BB. llvm-svn: 152177	2012-03-07 00:18:22 +00:00
Andrew Trick	320c7030db	Added MachineBasicBlock::getFullName() to standardize/factor codegen diagnostics. llvm-svn: 152176	2012-03-07 00:18:18 +00:00
Andrew Trick	5297d8df99	whitespace llvm-svn: 152175	2012-03-07 00:18:15 +00:00
Andrew Trick	0c84efe8dd	Cleanup: DAG building is specific to either SD or MI scheduling. Not part of the target interface. llvm-svn: 152174	2012-03-07 00:18:12 +00:00
Andrew Trick	3222c0985b	misched comments llvm-svn: 152173	2012-03-07 00:18:08 +00:00
Andrew Trick	3b6eb1e5ea	misched: Use the StartBlock/FinishBlock hooks llvm-svn: 152172	2012-03-07 00:18:05 +00:00
Eric Christopher	54cf8ff45e	Add the DW_AT_APPLE_runtime_class attribute to forward declarations as well as completely defined classes. This fixes rdar://10956070 llvm-svn: 152171	2012-03-07 00:15:19 +00:00
Evan Cheng	80893ce5f5	Extend r148086 to check for [r +/- reg] address mode. This fixes queens performance regression (due to increased register pressure from overly aggressive pre-inc formation). llvm-svn: 152162	2012-03-06 23:33:32 +00:00
Jakob Stoklund Olesen	936656ba2f	Hoist common code out of if statement. llvm-svn: 152153	2012-03-06 22:27:13 +00:00
Evan Cheng	217a704acc	Avoid finalizeBundles infinite looping. llvm-svn: 152089	2012-03-06 02:00:52 +00:00
Owen Anderson	2ee7c4dfc5	Make it possible for a target to mark FSUB as Expand. This requires providing a default expansion (FADD+FNEG), and teaching DAGCombine not to form FSUBs post-legalize if they are not legal. llvm-svn: 152079	2012-03-06 00:29:31 +00:00
Jim Grosbach	fd93a59557	Make MCRegisterInfo available to the the MCInstPrinter. Used to allow context sensitive printing of super-register or sub-register references. llvm-svn: 152043	2012-03-05 19:33:20 +00:00
Bill Wendling	7cf6db7e3c	Fix warnings about adding a bool to a string. Patch by Sean Silva! llvm-svn: 152042	2012-03-05 19:29:36 +00:00
Craig Topper	4b02a29eba	Convert more GenRegisterInfo tables from unsigned to uint16_t to reduce static data size. llvm-svn: 152016	2012-03-05 05:37:41 +00:00
Jakob Stoklund Olesen	59bc8c437a	Stop fixing bad machine code in LiveIntervalAnalysis. The first def of a virtual register cannot also read the register. Assert on such bad machine code instead of trying to fix it. TwoAddressInstructionPass should never create code like that. llvm-svn: 152010	2012-03-04 19:19:10 +00:00
Jakob Stoklund Olesen	6759dd078a	Stop adding <imp-def> operands when coalescing sub-registers. We are already setting <undef> flags, and that is good enough. The <imp-def> operands don't mean anything any more. llvm-svn: 152009	2012-03-04 19:19:07 +00:00
Craig Topper	1d32658877	Use uint16_t to store register overlaps to reduce static data. llvm-svn: 152001	2012-03-04 10:43:23 +00:00
Craig Topper	b35eacb0f0	Use uint16_t instead of unsigned to store registers in reg classes. Reduces static data size. llvm-svn: 151998	2012-03-04 10:16:38 +00:00
Craig Topper	420525ce3b	Use uint16_t to store registers in callee saved register tables to reduce size of static data. llvm-svn: 151996	2012-03-04 03:33:22 +00:00
Eric Christopher	1df94bfe8a	Grammar-o in function name. llvm-svn: 151875	2012-03-02 02:11:47 +00:00
Eric Christopher	e19f4cd066	Grammar. llvm-svn: 151874	2012-03-02 01:57:55 +00:00
Eric Christopher	7772531567	If the linkage name doesn't exist we're supposed to emit a reference to the string table for the function name, not the function name. llvm-svn: 151873	2012-03-02 01:57:52 +00:00
Eric Christopher	7524fe4551	Revert "Reorder the sections being output to reduce the number of assembler" The inline table needs to be constructed ahead of time so that it doesn't try to create new strings while we're emitting everything. This reverts commit a8ff9bccb399183cdd5f1c3cec2bda763664b4b0. llvm-svn: 151864	2012-03-02 00:30:24 +00:00
Eric Christopher	66b0721014	Reorder the sections being output to reduce the number of assembler fixups that are being used to determine section offsets. Reduces the total number of fixups by 50% for a non-trivial testcase. Part of rdar://10413936 llvm-svn: 151852	2012-03-01 22:50:31 +00:00
Michael J. Spencer	35145f830a	Minimal changes for LLVM to compile under VS11. llvm-svn: 151849	2012-03-01 22:42:52 +00:00
James Molloy	f6298e9281	Fix a codegen fault in which log2 or exp2 could be dead-code eliminated even though they could have sideeffects. Only allow log2/exp2 to be converted to an intrinsic if they are declared "readnone". llvm-svn: 151807	2012-03-01 14:32:18 +00:00
Jakob Stoklund Olesen	abe8c09b20	Make InlineSpiller bundle-aware. Simply treat bundles as instructions. Spill code is inserted between bundles, never inside a bundle. Rewrite all operands in a bundle at once. Don't attempt and memory operand folding inside bundles. llvm-svn: 151787	2012-03-01 01:43:25 +00:00
Jakob Stoklund Olesen	d256c21666	Move getBundleStart() into MachineInstrBundle.h. This allows the function to be inlined, and makes it suitable for use in getInstructionIndex(). Also provide a const version. C++ is great for touch typing practice. llvm-svn: 151782	2012-03-01 01:26:01 +00:00
Lang Hames	76e66c31a0	Don't redundantly copy implicit operands when rematerializing. While we're at it - don't copy vreg implicit operands while rematerializing. This fixes PR12138. llvm-svn: 151779	2012-03-01 00:41:17 +00:00
Benjamin Kramer	d05a0c6c42	LegalizeIntegerTypes: Reorder operations in the "big shift by small amount" optimization, making the lives of later passes easier. llvm-svn: 151722	2012-02-29 13:27:00 +00:00
Jakob Stoklund Olesen	9e821456a3	Add an analyzeVirtReg() function. This function does more or less the same as MI::readsWritesVirtualRegister(), but it supports bundles as well. It also determines if any constraint requires reading and writing operands to use the same register. Most clients want to know. Use the more modern MO.readsReg() instead of trying to sort out undefs and partial redefines. Stop supporting the extra full <imp-def> operand as an alternative to <def,undef> sub-register defines. llvm-svn: 151690	2012-02-29 01:40:37 +00:00
Jakob Stoklund Olesen	8017d80505	Move the operand iterator into MachineInstrBundle.h where it belongs. Extract a base class and provide four specific sub-classes for iterating over const/non-const bundles/instructions. This eliminates the mystery bool constructor argument. llvm-svn: 151684	2012-02-29 00:33:41 +00:00
Lang Hames	2fbad222e1	Kill off LiveRangeEdit::getNewVRegs and LiveRangeEdit::getUselessVRegs. These methods are no longer needed now that LinearScan has gone away. (Contains tweaks trivialSpillEverywhere to enable the removal of getNewVRegs). llvm-svn: 151658	2012-02-28 22:07:24 +00:00
Evan Cheng	65f9d19c4f	Re-commit r151623 with fix. Only issue special no-return calls if it's a direct call. llvm-svn: 151645	2012-02-28 18:51:51 +00:00
Benjamin Kramer	f2e160c665	Fix off-by one in comment. llvm-svn: 151644	2012-02-28 18:37:06 +00:00
Benjamin Kramer	0c281a7deb	LegalizeIntegerTypes: Reenable the large shift with small amount optimization. To avoid problems with zero shifts when getting the bits that move between words we use a trick: first shift the by amount-1, then do another shift by one. When amount is 0 (and size 32) we first shift by 31, then by one, instead of by 32. Also fix a latent bug that emitted the low and high words in the wrong order when shifting right. Fixes PR12113. llvm-svn: 151637	2012-02-28 17:58:00 +00:00
Daniel Dunbar	ee7b899343	Revert r151623 "Some ARM implementaions, e.g. A-series, does return stack prediction. ...", it is breaking the Clang build during the Compiler-RT part. llvm-svn: 151630	2012-02-28 15:36:07 +00:00
Nadav Rotem	1d666099be	Code cleanup following CR by Duncan. llvm-svn: 151627	2012-02-28 14:13:19 +00:00
Nadav Rotem	875e463b19	Fix a bug in the code that builds SDNodes from vector GEPs. When the GEP index is a vector of pointers, the code that calculated the size of the element started from the vector type, and not the contained pointer type. As a result, instead of looking at the data element pointed by the vector, this code used the size of the vector. This works for 32bit members (on 32bit systems), but not for other types. Added code to peel the vector type and added a test. llvm-svn: 151626	2012-02-28 11:54:05 +00:00
Evan Cheng	87c7b09d8d	Some ARM implementaions, e.g. A-series, does return stack prediction. That is, the processor keeps a return addresses stack (RAS) which stores the address and the instruction execution state of the instruction after a function-call type branch instruction. Calling a "noreturn" function with normal call instructions (e.g. bl) can corrupt RAS and causes 100% return misprediction so LLVM should use a unconditional branch instead. i.e. mov lr, pc b _foo The "mov lr, pc" is issued in order to get proper backtrace. rdar://8979299 llvm-svn: 151623	2012-02-28 06:42:03 +00:00
Jakob Stoklund Olesen	4c5ad2b812	Handle regmasks in MachineCSE. Don't attempt to extend physreg live ranges across calls. <rdar://problem/10942095> llvm-svn: 151610	2012-02-28 02:08:50 +00:00
Jakob Stoklund Olesen	16c4a972db	Handle regmasks in the machine code verifier. llvm-svn: 151607	2012-02-28 01:42:41 +00:00
Chad Rosier	248c29966c	Fix 80-column violation. llvm-svn: 151599	2012-02-28 00:23:01 +00:00
Evan Cheng	ddeb9d11fe	Fix for PR12090: clear def maps of aliases when visiting a copy. e.g. %S5<def> = COPY %S0<kill> First clear def map of Q1, etc. No small test case available. llvm-svn: 151574	2012-02-27 21:46:42 +00:00
Jakob Stoklund Olesen	5aafb56dc0	Update machine code verifier. After the SlotIndex slot names were updated, it is possible to apply stricter checks to live intervals. Also treat bundles as bags of operands when checking live intervals. llvm-svn: 151531	2012-02-27 18:24:30 +00:00
Lang Hames	d5862ce317	Make the peephole optimizer clear kill flags on a vreg if it's about to add new uses of the vreg, since the old kills may no longer be valid. This was causing -verify-machineinstrs to complain about uses after kills, and could potentially have been causing subtle register allocation issues, but I haven't come across a test case yet. llvm-svn: 151425	2012-02-25 02:01:00 +00:00
Lang Hames	31bb57bc55	Fixed typo. llvm-svn: 151417	2012-02-25 00:46:38 +00:00
Jakob Stoklund Olesen	7f99142804	Add missing static llvm-svn: 151396	2012-02-24 21:52:44 +00:00
Jakob Stoklund Olesen	0a0a9688c5	Add a -stress-regalloc=<N> option. This will limit all register classes to N registers in order to stress test register allocation. llvm-svn: 151379	2012-02-24 18:34:20 +00:00
Hal Finkel	b9a3d61894	Don't crash when a glue node contains an internal CopyToReg This is necessary to support the existing ppc lowering code for indirect calls. Fixes PR12071. llvm-svn: 151373	2012-02-24 17:53:59 +00:00
Benjamin Kramer	6fe3e3d335	SDAGBuilder: Remove register sets that were never read and prune dead code surrounding it. llvm-svn: 151364	2012-02-24 14:01:17 +00:00
Nick Lewycky	e839e2895f	ScheduleDAGInstrs.h:155: warning: suggest parentheses around `&&' within `\|\|'. llvm-svn: 151355	2012-02-24 07:59:05 +00:00
Andrew Trick	9dbbd3e553	PostRA sched: speed up physreg tracking by not abusing SparseSet. llvm-svn: 151348	2012-02-24 07:04:55 +00:00
Pete Cooper	682c76b7d4	Turn avx insert intrinsic calls into INSERT_SUBVECTOR DAG nodes and remove duplicate patterns for selecting the intrinsics llvm-svn: 151342	2012-02-24 03:51:49 +00:00
Eric Christopher	da97054114	If the Address of a variable is an argument then treat the entire variable declaration as an argument because we want that address anyhow for our debug information. This seems to fix rdar://9965111, at least we have more debug information than before and from reading the assembly it appears to be the correct location. llvm-svn: 151335	2012-02-24 01:59:08 +00:00
Eric Christopher	219d51d649	Tabs, formatting and long lines oh my! llvm-svn: 151334	2012-02-24 01:59:01 +00:00
Bill Wendling	38b31619f6	Allow an integer to be converted into an MMX type when it's used in an inline asm. <rdar://problem/10106006> llvm-svn: 151303	2012-02-23 23:25:25 +00:00
Benjamin Kramer	ef8bf39575	BitVectorize loop. llvm-svn: 151274	2012-02-23 19:29:25 +00:00
Benjamin Kramer	796fd46993	post-ra-sched: Turn the KillIndices vector into a bitvector, it only stored two meaningful states. Rename it to LiveRegs to make it more clear what's stored inside. llvm-svn: 151273	2012-02-23 19:15:40 +00:00
Benjamin Kramer	21974b1fa6	post-ra-sched: Replace a std::set of regs with a bitvector. Assuming that a single std::set node adds 3 control words, a bitvector can store (38+4)8=224 registers in the allocated memory of a single element in the std::set (x86_64). Also we don't have to call malloc for every register added. llvm-svn: 151269	2012-02-23 18:28:32 +00:00
Jakob Stoklund Olesen	a793a59fc3	Make calls scheduling boundaries post-ra. Before register allocation, instructions can be moved across calls in order to reduce register pressure. After register allocation, we don't gain a lot by moving callee-saved defs across calls. In fact, since the scheduler doesn't have a good idea how registers are used in the callee, it can't really make good scheduling decisions. This changes the schedule in two ways: 1. Latencies to call uses and defs are no longer accounted for, causing some random shuffling around calls. This isn't really a problem since those uses and defs are inaccurate proxies for what happens inside the callee. They don't represent registers used by the call instruction itself. 2. Instructions are no longer moved across calls. This didn't happen very often, and the scheduling decision was made on dubious information anyway. As with any scheduling change, benchmark numbers shift around a bit, but there is no positive or negative trend from this change. This makes the post-ra scheduler 5% faster for ARM targets. The secret motivation for this patch is the introduction of register mask operands representing call clobbers. The most efficient way of handling regmasks in ScheduleDAGInstrs is to model them as barriers for physreg live ranges, but not for virtreg live ranges. That's fine pre-ra, but post-ra it would have the same effect as this patch. llvm-svn: 151265	2012-02-23 17:54:21 +00:00
Benjamin Kramer	d53aa39f46	Strip a layer of boilerplate from the VLIWPacketizer by storing the scheduler as an opaque pointer. llvm-svn: 151252	2012-02-23 13:39:13 +00:00
Anton Korobeynikov	a22828e085	Fix to make sure that a comdat group gets generated correctly for a static member of instantiated C++ templates. Patch by Kristof Beyls! llvm-svn: 151250	2012-02-23 10:36:04 +00:00
Eric Christopher	18c6be7132	More newline cleanups. llvm-svn: 151235	2012-02-23 03:39:43 +00:00
Eric Christopher	5c45205b79	Add some handy-dandy newlines. llvm-svn: 151234	2012-02-23 03:39:39 +00:00
Andrew Trick	da6a15d90d	misched: cleanup reaching def computation Ignore undef uses completely. Use a more explicit SlotIndex API. Add more explicit comments. llvm-svn: 151233	2012-02-23 03:16:24 +00:00
Andrew Trick	d675a4cec0	PostRASched: Convert physreg def/use tracking to Jakob's SparseSet. Added array subscript to SparseSet for convenience. Slight reorg to make it easier to manage the def/use sets. llvm-svn: 151228	2012-02-23 01:52:38 +00:00
Jakob Stoklund Olesen	28d4803ade	Handle regmasks in FixupKills. llvm-svn: 151226	2012-02-23 01:22:15 +00:00
Jakob Stoklund Olesen	38ce889cb6	Handle regmasks in CriticalAntiDepBreaker. llvm-svn: 151223	2012-02-23 01:15:26 +00:00
Jakob Stoklund Olesen	e664abb837	Track reserved registers separately from RegsAvailable. The bulk masking operations from register mask operands don't account for reserved registers. llvm-svn: 151222	2012-02-23 01:13:32 +00:00
Jakob Stoklund Olesen	033b9add40	Don't compute latencies for regmask operands. llvm-svn: 151211	2012-02-22 22:52:52 +00:00
Jakob Stoklund Olesen	e21b2d0845	Handle regmasks in RegisterScavenging. llvm-svn: 151210	2012-02-22 22:50:14 +00:00
Andrew Trick	d458e2df8d	misched: Use SparseSet for VRegDegs for constant time clear(). llvm-svn: 151205	2012-02-22 21:59:00 +00:00
Hal Finkel	ad4d9f5848	Allow the use of an alternate symbol for calculating a function's size. The standard function epilog includes a .size directive, but ppc64 uses an alternate local symbol to tag the actual start of each function. Until recently, binutils accepted the .size directive as: .size test1, .Ltmp0-test1 however, using this directive with recent binutils will result in the error: .size expression for XXX does not evaluate to a constant so we must use the label which actually tags the start of the function. llvm-svn: 151200	2012-02-22 21:11:47 +00:00
Michael J. Spencer	8b98bf2d6b	Properly emit _fltused with FastISel. Refactor to share code with SDAG. Patch by Joe Groff! llvm-svn: 151183	2012-02-22 19:06:13 +00:00
Andrew Trick	64ca16e9b8	Comment from code review llvm-svn: 151178	2012-02-22 18:34:49 +00:00
Chad Rosier	5dfe6dab25	Remove extra semi-colons. llvm-svn: 151169	2012-02-22 17:25:00 +00:00
Jakob Stoklund Olesen	bd5e076201	80 col. llvm-svn: 151167	2012-02-22 16:50:46 +00:00
Eric Christopher	5cd2a9d98e	Only add DW_AT_prototyped if we're working with a C-like language. Worth another 45k (1%) off of a large C++ testcase. rdar://10909458 llvm-svn: 151144	2012-02-22 08:46:21 +00:00
Eric Christopher	3a2656b394	Add the source language into the compile unit. llvm-svn: 151143	2012-02-22 08:46:13 +00:00
Eric Christopher	ef64b465a4	Remove extra semi-colon. llvm-svn: 151142	2012-02-22 08:46:02 +00:00
Andrew Trick	db42c6faa4	misched: DAG builder should not track dependencies for SSA defs. The vast majority of virtual register definitions don't need an entry in the DAG builder's VRegDefs set. llvm-svn: 151136	2012-02-22 06:08:13 +00:00
Andrew Trick	46cc9a4aaa	Initialize SUnits before DAG building. Affect on SD scheduling and postRA scheduling: Printing the DAG will display the nodes in top-down topological order. This matches the order within the MBB and makes my life much easier in general. Affect on misched: We don't need to track virtual register uses at all. This is awesome. I also intend to rely on the SUnit ID as a topo-sort index. So if A < B then we cannot have an edge B -> A. llvm-svn: 151135	2012-02-22 06:08:11 +00:00
Craig Topper	760b134ffa	Make all pointers to TargetRegisterClass const since they are all pointers to static data that should not be modified. llvm-svn: 151134	2012-02-22 05:59:10 +00:00
Jakob Stoklund Olesen	9c4cd1bfb1	Use SparseSet for the RAFast live virtual register map. This makes RAFast 4% faster, and it gets rid of the dodgy DenseMap iteration. This also revealed that RAFast would sometimes dereference DenseMap iterators after erasing other elements from the map. That does seem to work in the current DenseMap implementation, but SparseSet doesn't allow it. llvm-svn: 151111	2012-02-22 01:02:37 +00:00
Lang Hames	d6e765c69f	Add API "handleMoveIntoBundl" for updating liveness when moving instructions into bundles. This method takes a bundle start and an MI being bundled, and makes the intervals for the MI's operands appear to start/end on the bundle start. Also fixes some minor cosmetic issues (whitespace, naming convention) in the HMEditor code. llvm-svn: 151099	2012-02-21 22:29:38 +00:00
Eric Christopher	8575790912	There's no need for a DW_AT_byte_size on a pointer type. Part of rdar://10493979 where it reduces by about .5% (10k) llvm-svn: 151097	2012-02-21 22:25:53 +00:00
Andrew Trick	da84e64683	Clear virtual registers after they are no longer referenced. Passes after RegAlloc should be able to rely on MRI->getNumVirtRegs() == 0. This makes sharing code for pre/postRA passes more robust. Now, to check if a pass is running before the RA pipeline begins, use MRI->isSSA(). To check if a pass is running after the RA pipeline ends, use !MRI->getNumVirtRegs(). PEI resets virtual regs when it's done scavenging. PTX will either have to provide its own PEI pass or assign physregs. llvm-svn: 151032	2012-02-21 04:51:23 +00:00
Andrew Trick	5c714e7985	StackSlotColoring does not use a VirtRegMap llvm-svn: 151031	2012-02-21 04:51:19 +00:00
Lang Hames	7e2ce889a0	Fix some bugs in HMEditor's moveAllOperandsInto logic. llvm-svn: 151006	2012-02-21 00:00:36 +00:00
Evan Cheng	63618f9ba6	Fix machine-cp by having it to check sub-register indicies. e.g. ecx = mov eax al = mov ch The second copy is not a nop because the sub-indices of ecx,ch is not the same of that of eax/al. Re-enabled machine-cp. PR11940 llvm-svn: 151002	2012-02-20 23:28:17 +00:00
James Molloy	862fe49c55	Teach the DAGCombiner that certain loadext nodes followed by ANDs can be converted to zeroexts. llvm-svn: 150957	2012-02-20 12:02:38 +00:00
Evan Cheng	d0c02966d2	Make post-ra tail duplication bundle safe. No test case as recent codegen flow changes have already hidden the bug. rdar://10893812 llvm-svn: 150949	2012-02-20 07:51:58 +00:00
Benjamin Kramer	c84ded88ea	Silence operator precedence warning. llvm-svn: 150921	2012-02-19 12:25:07 +00:00
Ahmed Charles	636a3d618c	Remove dead code. Improve llvm_unreachable text. Simplify some control flow. llvm-svn: 150918	2012-02-19 11:37:01 +00:00
Lang Hames	13b11527d8	Add machinery for pushing live ranges onto bundle starts while bundling. llvm-svn: 150915	2012-02-19 07:13:05 +00:00
Lang Hames	8140e84757	Simplify moveEnteringDownFrom rules. llvm-svn: 150914	2012-02-19 06:13:56 +00:00
Lang Hames	ed7f1f0b08	Skip through instructions rather than operands when looking for last use slot. llvm-svn: 150912	2012-02-19 04:38:25 +00:00
Lang Hames	da2ed648b5	Fix TODO and trailing whitespace. llvm-svn: 150910	2012-02-19 03:09:55 +00:00
Lang Hames	4645a72763	Defer sanity checks on live intervals until after all have been updated. Hold (LiveInterval, LiveRange) pairs to update, rather than vregs. llvm-svn: 150909	2012-02-19 03:00:30 +00:00
Lang Hames	59761985dd	Bring HMEditor into line with LLVM coding standards. llvm-svn: 150851	2012-02-17 23:43:40 +00:00
Eric Christopher	81e2bf2b77	Ignore the lifetime intrinsics in fast-isel. llvm-svn: 150848	2012-02-17 23:03:39 +00:00
Jakob Stoklund Olesen	a2755ea8f2	Don't print out pointer values in SUnit::dump(). llvm-svn: 150842	2012-02-17 21:44:51 +00:00
Matt Beaumont-Gay	714b99dc84	Sink variable into assert llvm-svn: 150841	2012-02-17 21:40:48 +00:00
Lang Hames	a9afc6ac4a	Add support for regmask slots to HMEditor. Also fixes a comment error. llvm-svn: 150840	2012-02-17 21:29:41 +00:00
Jakob Stoklund Olesen	a0cf42f2e1	Transfer regmasks to MRI. MRI keeps track of which physregs have been used. Make sure it gets updated with all the regmask-clobbered registers. Delete the closePhysRegsUsed() function which isn't necessary. llvm-svn: 150830	2012-02-17 19:07:56 +00:00
Lang Hames	b9057d5fae	Refactor 'handleMove' code in live intervals. Clients of LiveIntervals won't see any changes. Internally this adds a private inner class HMEditor, to LiveIntervals. HMEditor provides an API for updating live intervals when code is moved or bundled. llvm-svn: 150826	2012-02-17 18:44:18 +00:00
Jim Grosbach	905c952efa	Tidy up. llvm-svn: 150820	2012-02-17 17:35:10 +00:00
Jakob Stoklund Olesen	fd7d1b47ba	Revert r150288, "Allow Post-RA LICM to hoist reserved register reads." This caused miscompilations on out-of-tree targets, and possibly i386 as well. I'll find some other way of hoisting %rip-relative loads from loops containing calls. llvm-svn: 150816	2012-02-17 16:40:44 +00:00
David Chisnall	368d460d35	... and it's probably best to use the correct alignment, rather than just guessing that it's the same as the size. llvm-svn: 150813	2012-02-17 16:30:39 +00:00
David Chisnall	8fa1716508	It turns out that putting an 8-byte symbol in a 4-byte section makes Solaris ld sulk. GNU ld is perfectly happy with it, which is worrying for a whole other set of reasons... Thanks to Anton, Duncan and Rafael for helping me track this down. Pointy hat to Rafael for introducing the bug in the first place. llvm-svn: 150811	2012-02-17 16:05:50 +00:00
Lang Hames	3eedcce906	Reverse iterator - should be incrementing rather than decrementing. llvm-svn: 150778	2012-02-17 01:54:11 +00:00
Lang Hames	d9f2152a2e	MachineScheduler shouldn't use/preserve LiveDebugVariables. llvm-svn: 150773	2012-02-17 01:11:37 +00:00
Lang Hames	def9c61e4b	Oops - isRegLiveIntoSuccessor is used in non-assert builds now. Remove NDEBUG guards. llvm-svn: 150771	2012-02-17 00:51:32 +00:00
Lang Hames	5bade3dc6e	Re-enable 150652 and 150654 - Make FPSCR non-reserved, and make MachineCSE bail on reserved registers. This should be safe as of r150786. llvm-svn: 150769	2012-02-17 00:27:16 +00:00
Lang Hames	0d72bb49f0	Turn off assertion, conservatively compute liveness for live-in un-allocatable registers. llvm-svn: 150768	2012-02-17 00:18:18 +00:00
Benjamin Kramer	b0d75c2f4e	Disable machine copy propagation for now. It's known to be buggy (PR11940) and introduces subtle miscompiles in many places. llvm-svn: 150703	2012-02-16 17:29:50 +00:00
James Molloy	920ae8c642	Remove extraneous #include and spelling mistake introduced in r150669. llvm-svn: 150670	2012-02-16 09:48:07 +00:00
James Molloy	67b6b11b52	Modify the algorithm when traversing the DAGCombiner's worklist to be O(log N) for all operations. This fixes a horrible worst case with lots of nodes where 99% of the time was being spent in std::remove. llvm-svn: 150669	2012-02-16 09:17:04 +00:00
Lang Hames	55a2a96153	Oop - r150653 + r150654 broke one of my test cases. Backing out for now... llvm-svn: 150655	2012-02-16 02:32:10 +00:00
Lang Hames	2055493b97	MachineCSE shouldn't extend the live ranges of reserved or allocatable registers. llvm-svn: 150653	2012-02-16 02:19:35 +00:00
Jakob Stoklund Olesen	e9e30d083c	Handle register masks in branch folding. Don't attempt to move instructions with regmask operands. They are most likely calls anyway. llvm-svn: 150634	2012-02-15 23:42:54 +00:00
Andrew Trick	20349b88a6	Fix library visibility problems with VLIWPacketizer. The existing framework for postra scheduling is library local. We want to keep it that way. Soon we will have a more general MachineScheduler interface. At that time, various bits will be exposed to targets. In the meantime, the VLIWPacketizer wants to use ScheduleDAGInstrs directly, so it needs to wrapped in a PIMPL to avoid exposing it to the target interface. llvm-svn: 150633	2012-02-15 23:34:15 +00:00
Lang Hames	923d199a67	Make LiveIntervals::handleMove() bundle aware. llvm-svn: 150630	2012-02-15 23:21:33 +00:00
Bill Wendling	a0009ee85a	Use 'getDataNoRel' for the section kind. llvm-svn: 150628	2012-02-15 22:47:53 +00:00
Lang Hames	f15502f2e5	Fix assertion condition. llvm-svn: 150627	2012-02-15 22:45:51 +00:00
Bill Wendling	734909a078	Modify the code that emits the module flags to use the new module flags accessor method. This allows the target lowering code to not have to deal with MDNodes. Also, avoid leaking memory like a sieve by not creating a global variable for the image info section, but just emitting the code directly. llvm-svn: 150624	2012-02-15 22:36:15 +00:00
Andrew Trick	690a1fb045	Don't expose DefaultVLIWScheduler llvm-svn: 150619	2012-02-15 22:06:21 +00:00
Lang Hames	1b34a72f52	Remove overly conservative assert. llvm-svn: 150608	2012-02-15 19:04:53 +00:00
Andrew Trick	7a35faea5d	Generic "VLIW" packetizer based on a DFA generated from target itinerary. Patch by Sundeep! llvm-svn: 150607	2012-02-15 18:55:14 +00:00
Andrew Trick	899f46c113	Revert r150565 again. Appears to be a stage2 failure with dragonegg. I'll put MachineLICM back before PEI. All my arm/x86 benchmarks look good, but buildbots don't like it. llvm-svn: 150568	2012-02-15 07:57:03 +00:00
Andrew Trick	56d412a147	Reapply r150565 with the typo fix properly merged. llvm-svn: 150567	2012-02-15 05:43:27 +00:00
Andrew Trick	dd5beb78a7	reverting r150565. Premature push. llvm-svn: 150566	2012-02-15 05:22:12 +00:00
Andrew Trick	d83284c196	Move PostRAMachineLICM into MachineLateOptimization. It now runs after PEI! llvm-svn: 150565	2012-02-15 05:13:47 +00:00
Andrew Trick	e9a951c00b	Allow CodeGen (llc) command line options to work as expected. The llc command line options for enabling/disabling passes are local to CodeGen/Passes.cpp. This patch associates those options with standard pass IDs so they work regardless of how the target configures the passes. A target has two ways of overriding standard passes: 1) Redefine the pass pipeline (override TargetPassConfig::add%Stage) 2) Replace or suppress individiual passes with TargetPassConfig::substitutePass. In both cases, the command line options associated with the pass override the target default. For example, say a target wants to disable machine instruction scheduling by default: - The target calls disablePass(MachineSchedulerID) but otherwise does not override any TargetPassConfig methods. - Without any llc options, no scheduler is run. - With -enable-misched, the standard machine scheduler is run and honors the -misched=... flag to select the scheduler variant, which may be used for performance evaluation or testing. Sorry overridePass is ugly. I haven't thought of a better way without replacing the cl::opt framework. I hope to do that one day... I haven't figured out why CodeGen uses char& for pass IDs. AnalysisID is much easier to use and less bug prone. I'm using it wherever I can for internal implementation. Maybe later we can change the global pass ID definitions as well. llvm-svn: 150563	2012-02-15 03:21:51 +00:00
Andrew Trick	c9ce9d2315	Added TargetPassConfig::disablePass/substitutePass as a general mechanism to override specific passes. llvm-svn: 150562	2012-02-15 03:21:47 +00:00
Lang Hames	84f454ec5c	Don't emit live ranges for physregs live-ins that are dead. llvm-svn: 150553	2012-02-15 01:31:10 +00:00
Lang Hames	77d205152a	Disentangle moving a machine instr from updating LiveIntervals. llvm-svn: 150552	2012-02-15 01:23:52 +00:00
Pete Cooper	4dd0963d56	Added hook to let targets custom lower splitting of illegal vectors llvm-svn: 150550	2012-02-15 00:55:31 +00:00
Jakob Stoklund Olesen	c4cf13f791	Fix global live range splitting regmask accuracy. Pretend that regmask interference ends at the 'dead' slot, even when there is other interference ending at the 'reg' slot of the same instruction. llvm-svn: 150531	2012-02-14 23:53:23 +00:00
Jakob Stoklund Olesen	b0c0d340f8	Fix details in local live range splitting with regmasks. Perform all comparisons at instruction granularity, and make sure register masks on uses count in both gaps. llvm-svn: 150530	2012-02-14 23:51:27 +00:00
Jakob Stoklund Olesen	e7d3f441b5	Handle regmasks in findRegisterDefOperandIdx(). Only accept register masks when looking for an 'overlapping' def. When Overlap is not set, the function searches for a proper definition of Reg. This means MI->modifiesRegister() considers register masks, but MI->definesRegister() doesn't. llvm-svn: 150529	2012-02-14 23:49:37 +00:00
Jakob Stoklund Olesen	fab5201e22	Use the proper clobber check in handleLiveInRegister(). When a physreg is live in to a basic block, look for any instruction in the block that clobbers the physreg. The instruction doesn't have to properly redefine the register, any overlapping clobber is OK. This slightly changes live ranges when compiling with register masks. llvm-svn: 150528	2012-02-14 23:46:24 +00:00
Jakob Stoklund Olesen	20d25a7f40	Dump live intervals in numerical order. The old DenseMap hashed order was very confusing. llvm-svn: 150527	2012-02-14 23:46:21 +00:00
Lang Hames	e64294ef84	Don't create a new copy of reserved regs - we already have one handy. llvm-svn: 150525	2012-02-14 23:06:12 +00:00
Bill Wendling	06df7725fc	Add code to the target lowering object file module to handle module flags. The MachO back-end needs to emit the garbage collection flags specified in the module flags. This is a WIP, so the front-end hasn't been modified to emit these flags just yet. Documentation and front-end switching to occur soon. llvm-svn: 150507	2012-02-14 21:28:13 +00:00
Lang Hames	1ce837af7e	Update MachineVerifier to check the new physreg live-in rules. llvm-svn: 150496	2012-02-14 19:17:48 +00:00
Lang Hames	595111f221	Tighten physical register invariants: Allocatable physical registers can only be live in to a block if it is the function entry point or a landing pad. llvm-svn: 150494	2012-02-14 18:51:53 +00:00
Nadav Rotem	29984ba033	Fix PR12000. Some vector operations may use scalar operands with types that are greater than the vector element type. For example BUILD_VECTOR of type <1 x i1> with a constant i8 operand. This patch fixes the assertion. llvm-svn: 150477	2012-02-14 13:06:32 +00:00
Benjamin Kramer	0e3791efd1	Turn push_back loops into append/insert. llvm-svn: 150471	2012-02-14 10:29:27 +00:00
Lang Hames	29d6ed6416	Rename getExceptionAddressRegister() to getExceptionPointerRegister() for consistency with setExceptionPointerRegister(...). llvm-svn: 150460	2012-02-14 04:45:49 +00:00
Lang Hames	3365179018	Use convenience function for consistency. llvm-svn: 150457	2012-02-14 03:04:29 +00:00
Bill Wendling	05d6f2ff1e	Don't reserve the R0 and R1 registers here. We don't use these registers, and marking them as "live-in" into a BB ruins some invariants that the back-end tries to maintain. llvm-svn: 150437	2012-02-13 23:47:16 +00:00
Bill Wendling	05f7380b33	Don't recalculate the size of the vector each time through the loop. llvm-svn: 150436	2012-02-13 23:45:26 +00:00
Jakob Stoklund Olesen	2ceea93dd3	Add register mask support to ScheduleDAGRRList. The scheduler will sometimes check the implicit-def list on instructions to properly handle pre-colored DAG edges. Also check any register mask operands for physreg clobbers. llvm-svn: 150428	2012-02-13 23:25:24 +00:00
Andrew Trick	5188c0020c	LiveIntervalAnalysis does not depend on MachineLoopInfo. llvm-svn: 150411	2012-02-13 20:44:42 +00:00
Jakob Stoklund Olesen	6f8fe71216	Check regmask interference for -join-physregs. llvm-svn: 150404	2012-02-13 18:17:04 +00:00
Nadav Rotem	0c65064dbe	Fix a bug in DAGCombine for the optimization of BUILD_VECTOR. We cant generate a shuffle node from two vectors of different types. llvm-svn: 150383	2012-02-13 12:42:26 +00:00
Nadav Rotem	34ca89afa8	This patch addresses the problem of poor code generation for the zext v8i8 -> v8i32 on AVX machines. The codegen often scalarizes ANY_EXTEND nodes. The DAGCombiner has two optimizations that can mitigate the problem. First, if all of the operands of a BUILD_VECTOR node are extracted from an ZEXT/ANYEXT nodes, then it is possible to create a new simplified BUILD_VECTOR which uses UNDEFS/ZERO values to eliminate the scalar ZEXT/ANYEXT nodes. Second, another dag combine optimization lowers BUILD_VECTOR into a shuffle vector instruction. In the case of zext v8i8->v8i32 on AVX, a value in an XMM register is to be shuffled into a wide YMM register. This patch modifes the second optimization and allows the creation of shuffle vectors even when the newly generated vector and the original vector from which we extract the values are of different types. llvm-svn: 150340	2012-02-12 15:05:31 +00:00
Anton Korobeynikov	c6b4017ce2	Add support for implicit TLS model used with MS VC runtime. Patch by Kai Nacke! llvm-svn: 150307	2012-02-11 17:26:53 +00:00
Andrew Trick	ee874db886	Add TargetPassConfig hooks for scheduling/bundling. In case the MachineScheduling pass I'm working on doesn't work well for another target, they can completely override it. This also adds a hook immediately after the RegAlloc pass to cleanup immediately after vregs go away. We may want to fold it into the postRA hook later. llvm-svn: 150298	2012-02-11 07:11:32 +00:00
Jakob Stoklund Olesen	fd338e9777	Allow Post-RA LICM to hoist reserved register reads. When using register masks, registers like %rip are clobbered by the register mask. LICM should still be able to hoist instructions reading %rip from a loop containing calls. llvm-svn: 150288	2012-02-11 00:44:19 +00:00
Jakob Stoklund Olesen	17402e3d5a	Handle register masks in local live range splitting. Again the goal is to produce identical assembly with register mask operands enabled. llvm-svn: 150287	2012-02-11 00:42:18 +00:00
Jakob Stoklund Olesen	c8046c02c2	Don't read PreRegAlloc before it is initialized. llvm-svn: 150286	2012-02-11 00:40:36 +00:00
Jakob Stoklund Olesen	024d7ae110	Add a static MachineOperand::clobbersPhysReg(). It can be necessary to detach a register mask pointer from its MachineOperand. This method is convenient for checking clobbered physregs on a detached bitmask pointer. llvm-svn: 150261	2012-02-10 19:23:53 +00:00

... 5 6 7 8 9 ...

13710 Commits