llvm-project

Commit Graph

Author	SHA1	Message	Date
Jakob Stoklund Olesen	acd27c9279	Revert r161633-161634 "Partition use lists so defs always come before uses." These commits broke a number of buildbots. llvm-svn: 161640	2012-08-09 23:31:36 +00:00
Jakob Stoklund Olesen	df01e00710	Partition use lists so defs always come before uses. This makes it possible to speed up def_iterator by stopping at the first use. This makes def_empty() and getUniqueVRegDef() much faster when there are many uses. In a +Asserts build, LiveVariables is 100x faster in one case because getVRegDef() has an assertion that would scan to the end of a def_iterator chain. Spill weight calculation is significantly faster (300x in one case) because isTriviallyReMaterializable() calls MRI->isConstantPhysReg(%RIP) which calls def_empty(%RIP). llvm-svn: 161634	2012-08-09 22:49:46 +00:00
Jakob Stoklund Olesen	7d7051ca3c	Don't use pointer-pointers for the register use lists. Use a more conventional doubly linked list where the Prev pointers form a cycle. This means it is no longer necessary to adjust the Prev pointers when reallocating the VRegInfo array. The test changes are required because the register allocation hint is using the use-list order to break ties. llvm-svn: 161633	2012-08-09 22:49:42 +00:00
Jakob Stoklund Olesen	c4102d4902	Move use list management into MachineRegisterInfo. Register MachineOperands are kept in linked lists accessible via MRI's reg_iterator interfaces. The linked list management was handled partly by MachineOperand methods, partly by MRI methods. Move all of the list management into MRI, delete MO::AddRegOperandToRegInfo() and MO::RemoveRegOperandFromRegInfo(). Be more explicit about handling the cases where an MRI pointer isn't available. llvm-svn: 161632	2012-08-09 22:49:37 +00:00
Jakob Stoklund Olesen	420798ca4f	Fix a future TwoAddressInstructionPass crash. No test case, the crash only happens when the default use list order is changed. llvm-svn: 161627	2012-08-09 22:08:26 +00:00
Nadav Rotem	e0f84d31c8	Fix the legalization of ExtLoad on ARM. ExpandUnalignedLoad did not properly handle the cases where the memory value type was illegal. PR 13111. llvm-svn: 161565	2012-08-09 01:56:44 +00:00
Jakob Stoklund Olesen	f71bc7b267	Don't use getNextOperandForReg() in RAFast. That particular optimization was probably premature anyway. llvm-svn: 161541	2012-08-08 23:44:01 +00:00
Jakob Stoklund Olesen	bf1ac4bdc3	Deal with irreducible control flow when building traces. We filter out MachineLoop back-edges during the trace-building PO traversals, but it is possible to have CFG cycles that aren't natural loops, and MachineLoopInfo doesn't include such cycles. Use a standard visited set to detect such CFG cycles, and completely ignore them when picking traces. llvm-svn: 161532	2012-08-08 22:12:01 +00:00
Jakob Stoklund Olesen	fa8a26f9df	Heed -stress-early-ifcvt. llvm-svn: 161513	2012-08-08 18:24:23 +00:00
Jakob Stoklund Olesen	e71b6c6b20	Get the MispredictPenalty from MCSchedModel. Thanks, Andy! llvm-svn: 161507	2012-08-08 18:19:58 +00:00
Andrew Trick	db9b1b5e66	Minor cleanup of defaultDefLatency API llvm-svn: 161470	2012-08-08 02:44:11 +00:00
Jakob Stoklund Olesen	0556be983d	Revert "Fix a quadratic algorithm in MachineBranchProbabilityInfo." It caused an assertion failure when compiling consumer-typeset. llvm-svn: 161463	2012-08-08 01:10:31 +00:00
Manman Ren	1be131ba27	X86: enable CSE between CMP and SUB We perform the following: 1> Use SUB instead of CMP for i8,i16,i32 and i64 in ISel lowering. 2> Modify MachineCSE to correctly handle implicit defs. 3> Convert SUB back to CMP if possible at peephole. Removed pattern matching of (a>b) ? (a-b):0 and like, since they are handled by peephole now. rdar://11873276 llvm-svn: 161462	2012-08-08 00:51:41 +00:00
Jakob Stoklund Olesen	c0b61ff9c7	Fix a quadratic algorithm in MachineBranchProbabilityInfo. The getSumForBlock function was quadratic in the number of successors because getSuccWeight would perform a linear search for an already known iterator. llvm-svn: 161460	2012-08-08 00:20:37 +00:00
Jakob Stoklund Olesen	fbf45dc2bd	Skip tied operand pairs that already have the same register. llvm-svn: 161454	2012-08-07 22:47:06 +00:00
Jakob Stoklund Olesen	505715d816	Add SelectionDAG::getTargetIndex. This adds support for TargetIndex operands during isel. The meaning of these (index, offset, flags) operands is entirely defined by the target. llvm-svn: 161453	2012-08-07 22:37:05 +00:00
Bill Wendling	61396b81a4	For non-Darwin platforms, we want to generate stack protectors only for character arrays. This is in line with what GCC does. <rdar://problem/10529227> llvm-svn: 161446	2012-08-07 20:59:05 +00:00
Jakob Stoklund Olesen	84689b0d5a	Add a new kind of MachineOperand: MO_TargetIndex. A target index operand looks a lot like a constant pool reference, but it is completely target-defined. It contains the 8-bit TargetFlags, a 32-bit index, and a 64-bit offset. It is preserved by all code generator passes. TargetIndex operands can be used to carry target-specific information in cases where immediate operands won't suffice. llvm-svn: 161441	2012-08-07 18:56:39 +00:00
Jakob Stoklund Olesen	296448b293	Fix a couple of typos. llvm-svn: 161437	2012-08-07 18:32:57 +00:00
Jakob Stoklund Olesen	75d9d5159e	Add trace accessor methods, implement primitive if-conversion heuristic. Compare the critical paths of the two traces through an if-conversion candidate. If the difference is larger than the branch brediction penalty, reject the if-conversion. If would never pay. llvm-svn: 161433	2012-08-07 18:02:19 +00:00
Chandler Carruth	881d0a7966	Add a much more conservative strategy for aligning branch targets. Previously, MBP essentially aligned every branch target it could. This bloats code quite a bit, especially non-looping code which has no real reason to prefer aligned branch targets so heavily. As Andy said in review, it's still a bit odd to do this without a real cost model, but this at least has much more plausible heuristics. Fixes PR13265. llvm-svn: 161409	2012-08-07 09:45:24 +00:00
Manman Ren	cb36b8c2e6	MachineCSE: Update the heuristics for isProfitableToCSE. If the result of a common subexpression is used at all uses of the candidate expression, CSE should not increase the live range of the common subexpression. rdar://11393714 and rdar://11819721 llvm-svn: 161396	2012-08-07 06:16:46 +00:00
Jakob Stoklund Olesen	a9d0b850b3	Delete a dead variable. TwoAddressInstructionPass doesn't remat any more. llvm-svn: 161285	2012-08-04 00:04:03 +00:00
Jakob Stoklund Olesen	a0c72ecf79	TwoAddressInstructionPass refactoring: Extract another method. llvm-svn: 161284	2012-08-03 23:57:58 +00:00
Bob Wilson	874886cd66	Refactor and check "onlyReadsMemory" before optimizing builtins. This patch is mostly just refactoring a bunch of copy-and-pasted code, but it also adds a check that the call instructions are readnone or readonly. That check was already present for sin, cos, sqrt, log2, and exp2 calls, but it was missing for the rest of the builtins being handled in this code. llvm-svn: 161282	2012-08-03 23:29:17 +00:00
Jakob Stoklund Olesen	1162a1548b	TwoAddressInstructionPass refactoring: Extract a method. No functional change intended, except replacing a DenseMap with a SmallDenseMap which should behave identically. llvm-svn: 161281	2012-08-03 23:25:45 +00:00
Jakob Stoklund Olesen	24bc514c0c	Begin adding support for updating LiveIntervals in TwoAddressInstructionPass. This is far from complete, and only changes behavior when the -early-live-intervals flag is passed to llc. llvm-svn: 161273	2012-08-03 22:58:34 +00:00
Jakob Stoklund Olesen	1c46589290	Add an experimental -early-live-intervals option. This option runs LiveIntervals before TwoAddressInstructionPass which will eventually learn to exploit and update the analysis. Eventually, LiveIntervals will run before PHIElimination, and we can get rid of LiveVariables. llvm-svn: 161270	2012-08-03 22:12:54 +00:00
Jakob Stoklund Olesen	918999db95	Delete merged physreg copies in joinReservedPhysReg(). Previously, the identity copy would survive through register allocation before it was removed by the rewriter. llvm-svn: 161269	2012-08-03 22:12:51 +00:00
Bob Wilson	871701c606	Try to reduce the compile time impact of r161232. The previous change caused fast isel to not attempt handling any calls to builtin functions. That included things like "printf" and caused some noticable regressions in compile time. I wanted to avoid having fast isel keep a separate list of functions that had to be kept in sync with what the code in SelectionDAGBuilder.cpp was handling. I've resolved that here by moving the list into TargetLibraryInfo. This is somewhat redundant in SelectionDAGBuilder but it will ensure that we keep things consistent. llvm-svn: 161263	2012-08-03 21:26:24 +00:00
Bob Wilson	fa59485b94	Fix memcmp code-gen to honor -fno-builtin. I noticed that SelectionDAGBuilder::visitCall was missing a check for memcmp in TargetLibraryInfo, so that it would use custom code for memcmp calls even with -fno-builtin. I also had to add a new -disable-simplify-libcalls option to llc so that I could write a test for this. llvm-svn: 161262	2012-08-03 21:26:18 +00:00
Jakob Stoklund Olesen	daae19f785	Completely eliminate VNInfo flags. The 'unused' state of a value number can be represented as an invalid def SlotIndex. This also exposed code that shouldn't have been looking at unused value VNInfos. llvm-svn: 161258	2012-08-03 20:59:32 +00:00
Jakob Stoklund Olesen	21809385a6	Fix a couple of loops that were processing unused value numbers. Unused VNInfos should be left alone. Their def SlotIndex doesn't point to anything. llvm-svn: 161257	2012-08-03 20:59:29 +00:00
Matt Beaumont-Gay	aaba08d503	Silence unused variable warning in -asserts build llvm-svn: 161256	2012-08-03 20:54:11 +00:00
Jakob Stoklund Olesen	9f565e19c5	Eliminate the VNInfo::hasPHIKill() flag. The only real user of the flag was removeCopyByCommutingDef(), and it has been switched to LiveIntervals::hasPHIKill(). All the code changed by this patch was only concerned with computing and propagating the flag. llvm-svn: 161255	2012-08-03 20:19:44 +00:00
Jakob Stoklund Olesen	06d6a5363b	Make the hasPHIKills flag a computed property. The VNInfo::HAS_PHI_KILL is only half supported. We precompute it in LiveIntervalAnalysis, but it isn't properly updated by live range splitting and functions like shrinkToUses(). It is only used in one place: RegisterCoalescer::removeCopyByCommutingDef(). This patch changes that function to use a new LiveIntervals::hasPHIKill() function that computes the flag for a given value number. llvm-svn: 161254	2012-08-03 20:10:24 +00:00
Jakob Stoklund Olesen	19c4596629	Delete dead function. llvm-svn: 161242	2012-08-03 15:21:21 +00:00
Jakob Stoklund Olesen	47ac20d4d6	Don't delete dead code in TwoAddressInstructionPass. This functionality was added before we started running DeadMachineInstructionElim on all targets. It serves no purpose now. llvm-svn: 161241	2012-08-03 15:11:57 +00:00
Bob Wilson	3e6fa462f3	Fall back to selection DAG isel for calls to builtin functions. Fast isel doesn't currently have support for translating builtin function calls to target instructions. For embedded environments where the library functions are not available, this is a matter of correctness and not just optimization. Most of this patch is just arranging to make the TargetLibraryInfo available in fast isel. <rdar://problem/12008746> llvm-svn: 161232	2012-08-03 04:06:28 +00:00
Manman Ren	ba8122cc25	X86 Peephole: fold loads to the source register operand if possible. Add more comments and use early returns to reduce nesting in isLoadFoldable. Also disable folding for V_SET0 to avoid introducing a const pool entry and a const pool load. rdar://10554090 and rdar://11873276 llvm-svn: 161207	2012-08-02 19:37:32 +00:00
Jakob Stoklund Olesen	5d30630e22	Compute the critical path length through a trace. Whenever both instruction depths and instruction heights are known in a block, it is possible to compute the length of the critical path as max(depth+height) over the instructions in the block. The stored live-in lists make it possible to accurately compute the length of a critical path that bypasses the current (small) block. llvm-svn: 161197	2012-08-02 18:45:54 +00:00
Jakob Stoklund Olesen	637c467528	Verify regunit intervals along with virtreg intervals. Don't cause regunit intervals to be computed just to verify them. Only check the already cached intervals. llvm-svn: 161183	2012-08-02 16:36:50 +00:00
Jakob Stoklund Olesen	374071dde2	Avoid creating dangling physreg live ranges during DCE. LiveRangeEdit::eliminateDeadDefs() can delete a dead instruction that reads unreserved physregs. This would leave the corresponding regunit live interval dangling because we don't have shrinkToUses() for physical registers. Fix this problem by turning the instruction into a KILL instead of deleting it. This happens in a landing pad in test/CodeGen/X86/2012-05-19-CoalescerCrash.ll: %vreg27<def,dead> = COPY %EDX<kill>; GR32:%vreg27 becomes: KILL %EDX<kill> An upcoming fix to the machine verifier will catch problems like this by verifying regunit live intervals. This fixes PR13498. I am not including the test case from the PR since we already have one exposing the problem once the verifier is fixed. llvm-svn: 161182	2012-08-02 16:36:47 +00:00
Jakob Stoklund Olesen	bde5dc5e46	Add report() functions that take a LiveInterval argument. llvm-svn: 161178	2012-08-02 14:31:49 +00:00
Manman Ren	5759d01230	X86 Peephole: fold loads to the source register operand if possible. Machine CSE and other optimizations can remove instructions so folding is possible at peephole while not possible at ISel. This patch is a rework of r160919 and was tested on clang self-host on my local machine. rdar://10554090 and rdar://11873276 llvm-svn: 161152	2012-08-02 00:56:42 +00:00
Jakob Stoklund Olesen	e736b97eff	Extract some methods from verifyLiveIntervals. No functional change. llvm-svn: 161149	2012-08-02 00:20:20 +00:00
Jakob Stoklund Olesen	a766b4746d	Also verify RegUnit intervals at uses. llvm-svn: 161147	2012-08-01 23:52:40 +00:00
Jakob Stoklund Olesen	2db6b65330	Compute instruction heights through a trace. The height on an instruction is the minimum number of cycles from the instruction is issued to the end of the trace. Heights are computed for all instructions in and below the trace center block. The method for computing heights is different from the depth computation. As we visit instructions in the trace bottom-up, heights of used instructions are pushed upwards. This way, we avoid scanning long use lists, looking for uses in the current trace. At each basic block boundary, a list of live-in registers and their minimum heights is saved in the trace block info. These live-in lists are used when restarting depth computations on a trace that converges with an already computed trace. They will also be used to accurately compute the critical path length. llvm-svn: 161138	2012-08-01 22:36:00 +00:00
Eric Christopher	b1b9451337	Temporarily revert c23b933d5f8be9b51a1d22e717c0311f65f87dcd. It's causing failures in the debug testsuite and possibly PR13486. llvm-svn: 161121	2012-08-01 18:19:01 +00:00
Jakob Stoklund Olesen	5e19d35e9a	Add DataDep constructors. Explicitly check SSA form. llvm-svn: 161115	2012-08-01 16:02:59 +00:00
Elena Demikhovsky	3cb3b0045c	Added FMA functionality to X86 target. llvm-svn: 161110	2012-08-01 12:06:00 +00:00
Manman Ren	f288d2f120	MachineSink: Sort the successors before trying to find SuccToSinkTo. Use stable_sort instead of sort. Follow-up to r161062. rdar://11980766 llvm-svn: 161075	2012-07-31 20:45:38 +00:00
Jakob Stoklund Olesen	059e647c6d	Compute instruction depths through the current trace. Assuming infinite issue width, compute the earliest each instruction in the trace can issue, when considering the latency of data dependencies. The issue cycle is record as a 'depth' from the beginning of the trace. This is half the computation required to find the length of the critical path through the trace. Heights are next. llvm-svn: 161074	2012-07-31 20:44:38 +00:00
Jakob Stoklund Olesen	1dfb101835	Rename CT -> MTM. MachineTraceMetrics is abbreviated MTM. llvm-svn: 161072	2012-07-31 20:25:13 +00:00
Manman Ren	8c549b586c	MachineSink: Sort the successors before trying to find SuccToSinkTo. One motivating example is to sink an instruction from a basic block which has two successors: one outside the loop, the other inside the loop. We should try to sink the instruction outside the loop. rdar://11980766 llvm-svn: 161062	2012-07-31 18:10:39 +00:00
Micah Villmow	b67d7a3a33	Conform to LLVM coding style. llvm-svn: 161061	2012-07-31 18:07:43 +00:00
Micah Villmow	6b12f596ef	Don't generate ordered or unordered comparison operations if it is not legal to do so. llvm-svn: 161053	2012-07-31 16:48:03 +00:00
Jakob Stoklund Olesen	0c807dfae2	Clear kill flags in removeCopyByCommutingDef(). We are extending live ranges, so kill flags are not accurate. They aren't needed until they are recomputed after RA anyway. <rdar://problem/11950722> llvm-svn: 161023	2012-07-31 02:47:24 +00:00
Manman Ren	2b6a0dfd4c	Reverse order of the two branches at end of a basic block if it is profitable. We branch to the successor with higher edge weight first. Convert from je LBB4_8 --> to outer loop jmp LBB4_14 --> to inner loop to jne LBB4_14 jmp LBB4_8 PR12750 rdar: 11393714 llvm-svn: 161018	2012-07-31 01:11:07 +00:00
Andrew Trick	79795897b3	Use the latest MachineRegisterInfo APIs. No functionality. llvm-svn: 161010	2012-07-30 23:48:17 +00:00
Andrew Trick	535a23c38b	Inline MachineRegisterInfo::hasOneUse llvm-svn: 161007	2012-07-30 23:48:12 +00:00
Jakob Stoklund Olesen	68c2cd059e	Avoid looking at stale data in verifyAnalysis(). llvm-svn: 161004	2012-07-30 23:15:12 +00:00
Jakob Stoklund Olesen	c14cf57ba9	Allow traces to enter nested loops. This lets traces include the final iteration of a nested loop above the center block, and the first iteration of a nested loop below the center block. We still don't allow traces to contain backedges, and traces are truncated where they would leave a loop, as seen from the center block. llvm-svn: 161003	2012-07-30 23:15:10 +00:00
Jakob Stoklund Olesen	984cfe8322	Clarify invalidation strategy in comment. llvm-svn: 160997	2012-07-30 21:16:22 +00:00
Jakob Stoklund Olesen	f308c128ea	Assert that all trace candidate blocks have been visited by the PO. When computing a trace, all the candidates for pred/succ must have been visited. Filter out back-edges first, though. The PO traversal ignores them. Thanks to Andy for spotting this in review. llvm-svn: 160995	2012-07-30 21:10:27 +00:00
Jakob Stoklund Olesen	a12a7d5f74	Hook into PassManager's analysis verification. By overriding Pass::verifyAnalysis(), the pass contents will be verified by the pass manager. llvm-svn: 160994	2012-07-30 20:57:50 +00:00
Pete Cooper	91244268d7	Consider address spaces for hashing and CSEing DAG nodes. Otherwise two loads from different x86 segments but the same address would get CSEd llvm-svn: 160987	2012-07-30 20:23:19 +00:00
Jakob Stoklund Olesen	7361846f32	Add MachineInstr::isTransient(). This is a cleaned up version of the isFree() function in MachineTraceMetrics.cpp. Transient instructions are very unlikely to produce any code in the final output. Either because they get eliminated by RegisterCoalescing, or because they are pseudo-instructions like labels and debug values. llvm-svn: 160977	2012-07-30 18:34:14 +00:00
Jakob Stoklund Olesen	3df6c46fdd	Add MachineTraceMetrics::verify(). This function verifies the consistency of cached data in the MachineTraceMetrics analysis. llvm-svn: 160976	2012-07-30 18:34:11 +00:00
Jakob Stoklund Olesen	eb488fe165	Verify that the CFG hasn't changed during invalidate(). The MachineTraceMetrics analysis must be invalidated before modifying the CFG. This will catch some of the violations of that rule. llvm-svn: 160969	2012-07-30 17:36:49 +00:00
Jakob Stoklund Olesen	fee94ca15b	Add MachineBasicBlock::isPredecessor(). A->isPredecessor(B) is the same as B->isSuccessor(A), but it can tolerate a B that is null or dangling. This shouldn't happen normally, but it it useful for verification code. llvm-svn: 160968	2012-07-30 17:36:47 +00:00
Manman Ren	f87dd7c01b	Revert r160920 and r160919 due to dragonegg and clang selfhost failure llvm-svn: 160927	2012-07-29 02:44:09 +00:00
Manman Ren	0fa3ab88ba	X86 Peephole: fold loads to the source register operand if possible. Machine CSE and other optimizations can remove instructions so folding is possible at peephole while not possible at ISel. rdar://10554090 and rdar://11873276 llvm-svn: 160919	2012-07-28 16:48:01 +00:00
Andrew Trick	940534371b	Reenable a basic SSA DAG builder optimization. Jakob fixed ProcessImplicifDefs in r159149. llvm-svn: 160910	2012-07-28 01:48:15 +00:00
Jakob Stoklund Olesen	0563369755	Add more debug output to MachineTraceMetrics. llvm-svn: 160905	2012-07-27 23:58:38 +00:00
Jakob Stoklund Olesen	1152202cc2	Keep track of the head and tail of the trace through each block. This makes it possible to quickly detect blocks that are outside the trace. llvm-svn: 160904	2012-07-27 23:58:36 +00:00
Eric Christopher	86ca9f9e11	Add a DW_AT_high_pc for CUs that are a single address range. Update all tests accordingly. Fixes PR13351. Patch by shinichiro hamaji! llvm-svn: 160899	2012-07-27 22:00:05 +00:00
Jakob Stoklund Olesen	7dfe7abdee	Also compute register mask lists under -new-live-intervals. llvm-svn: 160898	2012-07-27 21:56:39 +00:00
Jakob Stoklund Olesen	97e14e02f1	Eliminate the IS_PHI_DEF flag and VNInfo::setIsPHIDef(). A value number is a PHI def if and only if it begins at a block boundary. This can be derived from the def slot, a separate flag is not necessary. llvm-svn: 160893	2012-07-27 21:11:14 +00:00
Jakob Stoklund Olesen	4021a7bf25	Add a -new-live-intervals experimental option. This option replaces the existing live interval computation with one based on LiveRangeCalc.cpp. The new algorithm does not depend on LiveVariables, and it can be run at any time, before or after leaving SSA form. llvm-svn: 160892	2012-07-27 20:58:46 +00:00
Jakob Stoklund Olesen	bc65e8f94e	Add <imp-def> of super-register when lowering SUBREG_TO_REG. Patch by Tyler Nowicki! llvm-svn: 160888	2012-07-27 20:19:49 +00:00
Jakob Stoklund Olesen	35400b1dda	Use an otherwise unused variable. llvm-svn: 160798	2012-07-26 19:42:56 +00:00
Jakob Stoklund Olesen	f9029fef2a	Start scaffolding for a MachineTraceMetrics analysis pass. This is still a work in progress. Out-of-order CPUs usually execute instructions from multiple basic blocks simultaneously, so it is necessary to look at longer traces when estimating the performance effects of code transformations. The MachineTraceMetrics analysis will pick a typical trace through a given basic block and provide performance metrics for the trace. Metrics will include: - Instruction count through the trace. - Issue count per functional unit. - Critical path length, and per-instruction 'slack'. These metrics can be used to determine the performance limiting factor when executing the trace, and how it will be affected by a code transformation. Initially, this will be used by the early if-conversion pass. llvm-svn: 160796	2012-07-26 18:38:11 +00:00
Dan Gohman	0b3d782933	Add a floor intrinsic. llvm-svn: 160791	2012-07-26 17:43:27 +00:00
Manman Ren	cc1dc6dc11	Disable rematerialization in TwoAddressInstructionPass. It is redundant; RegisterCoalescer will do the remat if it can't eliminate the copy. Collected instruction counts before and after this. A few extra instructions are generated due to spilling but it is normal to see these kinds of changes with almost any small codegen change, according to Jakob. This also fixed rdar://11830760 where xor is expected instead of movi0. llvm-svn: 160749	2012-07-25 18:28:13 +00:00
Jakob Stoklund Olesen	cef9a618b1	Preserve 2-addr constraints in ConnectedVNInfoEqClasses. When a live range splits into multiple connected components, we would arbitrarily assign <undef> uses to component 0. This is wrong when the use is tied to a def that gets assigned to a different component: %vreg69<def> = ADD8ri %vreg68<undef>, 1 The use and def must get the same virtual register. Fix this by assigning <undef> uses to the same component as the value defined by the instruction, if any: %vreg69<def> = ADD8ri %vreg69<undef>, 1 This fixes PR13402. The PR has a test case which I am not including because it is unlikely to keep exposing this behavior in the future. llvm-svn: 160739	2012-07-25 17:15:15 +00:00
Jakob Stoklund Olesen	c6fd3deee6	Verify two-address constraints more carefully. Include <undef> operands and virtual registers after leaving SSA form. llvm-svn: 160734	2012-07-25 16:49:11 +00:00
Craig Topper	17300940ae	Change llvm_unreachable in SplitVectorOperand to report_fatal_error. Keeps release builds from crashing if code uses an intrinsic with an illegal type. llvm-svn: 160661	2012-07-24 04:11:21 +00:00
Sylvestre Ledru	35521e2310	Fix a typo (the the => the) llvm-svn: 160621	2012-07-23 08:51:15 +00:00
Nadav Rotem	9056076cab	Fixed DAGCombine optimizations which generate select_cc for targets that do not support it (X86 does not lower select_cc). PR: 13428 Together with Michael Kuperstein <michael.m.kuperstein@intel.com> llvm-svn: 160619	2012-07-23 07:59:50 +00:00
Craig Topper	2694c05e86	Tidy up. Fix indentation and remove trailing whitespace. llvm-svn: 160617	2012-07-23 05:38:07 +00:00
Craig Topper	b49546a3b3	Change llvm_unreachable in SplitVectorResult to report_fatal_error. Keeps release builds from crashing if code uses an intrinsic with an illegal type. For instance 256-bit AVX intrinsics without having AVX enabled. llvm-svn: 160616	2012-07-23 04:34:49 +00:00
Benjamin Kramer	5be8f60126	Remove unused private member variables uncovered by the recent changes to clang's -Wunused-private-field. llvm-svn: 160583	2012-07-20 22:05:57 +00:00
Jakob Stoklund Olesen	e2cfd0d45a	Avoid folding loads that are unsafe to move. LiveRangeEdit::foldAsLoad() can eliminate a register by folding a load into its only use. Only do that when the load is safe to move, and it won't extend any live ranges. This fixes PR13414. llvm-svn: 160575	2012-07-20 21:29:31 +00:00
Jakob Stoklund Olesen	f62c07f147	Split loop exiting edges more aggressively. PHIElimination splits critical edges when it predicts it can resolve interference and eliminate copies. It doesn't split the edge if the interference wouldn't be resolved anyway because the phi-use register is live in the critical edge anyway. Teach PHIElimination to split loop exiting edges with interference, even if it wouldn't resolve the interference. This removes the necessary copies from the loop, which is still an improvement from injecting the copies into the loop. The test case demonstrates the improvement. Before: LBB0_1: cmpb $0, (%rdx) leaq 1(%rdx), %rdx movl %esi, %eax je LBB0_1 After: LBB0_1: cmpb $0, (%rdx) leaq 1(%rdx), %rdx je LBB0_1 movl %esi, %eax llvm-svn: 160571	2012-07-20 20:49:53 +00:00
Pete Cooper	dcf94db677	Fix crash in machine verifier when trying to print the def of a register which has no def llvm-svn: 160531	2012-07-19 23:40:38 +00:00
Benjamin Kramer	f364a63c3e	Replace some explicit compare loops with std::equal. No functionality change. llvm-svn: 160501	2012-07-19 10:46:05 +00:00
Galina Kistanova	aaf9735951	Fixed few warnings. llvm-svn: 160493	2012-07-19 04:50:12 +00:00
Bill Wendling	d163405df8	Remove tabs. llvm-svn: 160475	2012-07-19 00:04:14 +00:00
Chandler Carruth	985454e0ac	Fix a somewhat nasty crasher in PR13378. This crashes inside of LiveIntervals due to the two-addr pass generating bogus MI code. The crux of the issue was a loop nesting problem. The intent of the code which attempts to transform instructions before converting them to two-addr form is to defer and reprocess any transformed instructions as the second processing is likely to have more opportunities to coalesce copies, etc. Unfortunately, there was one section of processing that was not deferred -- the INSERT_SUBREG rewriting. Due to quirks of how this rewriting proceeded, not only did it occur early, it removed the bits of information needed for the deferred processing to correctly generate the necessary two address form (specifically inserting a copy), but didn't trigger any immediate assertions and produced what appeared to be already valid two-address from code. Thus, the assertion only fired much later in the pipeline. The fix is to hoist the transformation logic up layer to where it can more firmly defer all further processing, and to teach the normal processing to handle an edge case previously handled as part of the transformation logic. This edge case (already matched tied register operands) needs to not defer any steps. As has been brought up repeatedly in the process: wow does this code need refactoring. I may squeeze in some time to at least bring sanity to this loop... but wow... =] Thanks to Jakob for helpful hints on the way here, and the review. llvm-svn: 160443	2012-07-18 18:58:22 +00:00

1 2 3 4 5 ...

13949 Commits