llvm-project

Commit Graph

Author	SHA1	Message	Date
Devang Patel	6381e1584c	Appropriately truncate debug info range in dwarf output. Enable live debug variables pass. llvm-svn: 123032	2011-01-07 21:30:41 +00:00
Evan Cheng	0638c20e7c	DBG_VALUE does not have any side effects; it also makes no sense to mark it cheap as a copy. llvm-svn: 123031	2011-01-07 21:08:26 +00:00
Bob Wilson	8265d56638	Add ARM patterns to match EXTRACT_SUBVECTOR nodes. Also fix an off-by-one in SelectionDAGBuilder that was preventing shuffle vectors from being translated to EXTRACT_SUBVECTOR. Patch by Tim Northover. The test changes are needed to keep those spill-q tests from testing aligned spills and restores. If the only aligned stack objects are spill slots, we no longer realign the stack frame. Prior to this patch, an EXTRACT_SUBVECTOR was legalized by loading from the stack, which created an aligned frame index. Now, however, there is nothing except the spill slot in the stack frame, so I added an aligned alloca. llvm-svn: 122995	2011-01-07 04:59:04 +00:00
Bob Wilson	d23b3d2dfc	Fix a comment typo. llvm-svn: 122994	2011-01-07 04:58:58 +00:00
Bob Wilson	f291cb268f	Change EXTRACT_SUBVECTOR to require a constant index. We were never generating any of these nodes with variable indices, and there was one legalizer function asserting on a non-constant index. If we ever have a need to support variable indices, we can add this back again. llvm-svn: 122993	2011-01-07 04:58:56 +00:00
Bill Wendling	34e2bc0f08	Early exit if we don't have invokes. The 'Unwinds' vector isn't modified unless we have invokes, so there is no functionality change here. llvm-svn: 122990	2011-01-07 02:54:45 +00:00
Duncan Sands	61c5708b51	Fix the other problem reported in PR8582. Testcase and patch by Nadav Rotem. llvm-svn: 122983	2011-01-06 23:45:22 +00:00
Eric Christopher	e516af753b	Add some fairly duplicated code to let type legalization split illegal typed atomics. This will lower exclusively to libcalls at the moment. llvm-svn: 122979	2011-01-06 22:28:56 +00:00
Devang Patel	70eb982843	Emit 128 bit constant. This fixes PR 8913 crash. llvm-svn: 122971	2011-01-06 21:39:25 +00:00
Evan Cheng	3ae2b79aa3	Re-implement r122936 with proper target hooks. Now getMaxStoresPerMemcpy etc. takes an option OptSize. If OptSize is true, it would return the inline limit for functions with attribute OptSize. llvm-svn: 122952	2011-01-06 06:52:41 +00:00
Evan Cheng	c052ba7ff3	Revert r122936. I'll re-implement the change. llvm-svn: 122949	2011-01-06 06:17:53 +00:00
Jakob Stoklund Olesen	70be93a200	Zap the last two -Wself-assign warnings in llvm. Simplify RALinScan::DowngradeRegister with TRI::getOverlaps while we are there. llvm-svn: 122940	2011-01-06 01:33:22 +00:00
Jakob Stoklund Olesen	8e236eac74	Add the SpillPlacement analysis pass. This pass precomputes CFG block frequency information that can be used by the register allocator to find optimal spill code placement. Given an interference pattern, placeSpills() will compute which basic blocks should have the current variable enter or exit in a register, and which blocks prefer the stack. The algorithm is ready to consume block frequencies from profiling data, but for now it gets by with the static estimates used for spill weights. This is a work in progress and still not hooked up to RegAllocGreedy. llvm-svn: 122938	2011-01-06 01:21:53 +00:00
Evan Cheng	06536e7158	r105228 reduced the memcpy / memset inline limit to 4 with -Os to avoid blowing up freebsd bootloader. However, this doesn't make much sense for Darwin, whose -Os is meant to optimize for size only if it doesn't hurt performance. rdar://8821501 llvm-svn: 122936	2011-01-06 01:04:47 +00:00
Evan Cheng	ac730dd2d1	Avoid zero extend bit test operands to pointer type if all the masks fit in the original type of the switch statement key. rdar://8781238 llvm-svn: 122935	2011-01-06 01:02:44 +00:00
Evan Cheng	260acf32ee	Optimize: r1025 = s/zext r1024, 4 r1026 = extract_subreg r1025, 4 to: r1026 = copy r1024 llvm-svn: 122925	2011-01-05 23:06:49 +00:00
Jakob Stoklund Olesen	f3ac733684	Add a hidden command line option to display edge bundle graphs as they are calculated. llvm-svn: 122912	2011-01-05 21:50:24 +00:00
Eric Christopher	c673b21a87	80-cols. llvm-svn: 122909	2011-01-05 21:45:56 +00:00
Eric Christopher	988518109d	Remove TODO, these appear to be implemented. llvm-svn: 122849	2011-01-04 22:31:50 +00:00
Jakob Stoklund Olesen	f96ae684c4	Turn the EdgeBundles class into a stand-alone machine CFG analysis pass. The analysis will be needed by both the greedy register allocator and the X86FloatingPoint pass. It only needs to be computed once when the CFG doesn't change. This pass is very fast, usually showing up as 0.0% wall time. llvm-svn: 122832	2011-01-04 21:10:05 +00:00
Cameron Zwarich	5cd3d718f6	Switch to path halving from path compression for a small speedup. This also makes getLeader() nonrecursive. llvm-svn: 122811	2011-01-04 16:24:51 +00:00
Cameron Zwarich	82e8332a22	Eliminate repeated allocation of a per-BB DenseMap for a 4.6% reduction of time spent in StrongPHIElimination on 403.gcc. llvm-svn: 122803	2011-01-04 06:42:27 +00:00
Owen Anderson	2e28697c60	Clean up a funky pass registration that got passed over when I got rid of static constructors. llvm-svn: 122795	2011-01-04 00:55:21 +00:00
Cameron Zwarich	18f164f7c9	Use a RecyclingAllocator to allocate values for MachineCSE's ScopedHashTable for a 28% speedup of MachineCSE time on 403.gcc. llvm-svn: 122735	2011-01-03 04:07:46 +00:00
Chris Lattner	bf0aa927cc	split dom frontier handling stuff out to its own DominanceFrontier header, so that Dominators.h is just domtree. Also prune #includes a bit. llvm-svn: 122714	2011-01-02 22:09:33 +00:00
Benjamin Kramer	25e6e06e42	Try to reuse the value when lowering memset. This allows us to compile: void test(char *s, int a) { __builtin_memset(s, a, 15); } into 1 mul + 3 stores instead of 3 muls + 3 stores. llvm-svn: 122710	2011-01-02 19:57:05 +00:00
Benjamin Kramer	2fdea4c8f1	Lower the i8 extension in memset to a multiply instead of a potentially long series of shifts and ors. We could implement a DAGCombine to turn x * 0x0101 back into logic operations on targets that doesn't support the multiply or it is slow (p4) if someone cares enough. Example code: void test(char *s, int a) { __builtin_memset(s, a, 4); } before: _test: ## @test movzbl 8(%esp), %eax movl %eax, %ecx shll $8, %ecx orl %eax, %ecx movl %ecx, %eax shll $16, %eax orl %ecx, %eax movl 4(%esp), %ecx movl %eax, 4(%ecx) movl %eax, (%ecx) ret after: _test: ## @test movzbl 8(%esp), %eax imull $16843009, %eax, %eax ## imm = 0x1010101 movl 4(%esp), %ecx movl %eax, 4(%ecx) movl %eax, (%ecx) ret llvm-svn: 122707	2011-01-02 19:44:58 +00:00
Cameron Zwarich	2f6dc10ccc	Use getVRegDef() instead of def_iterator. This leads to fewer defs being added with 2-address instructions, for about a 3.5% speedup of StrongPHIElimination on 403.gcc. llvm-svn: 122635	2010-12-30 00:42:23 +00:00
Cameron Zwarich	329cd49ce6	None of the other pass names in CodeGen have terminating periods. llvm-svn: 122628	2010-12-29 11:49:10 +00:00
Cameron Zwarich	0507f44669	Instead of processing every instruction when splitting interferences, only process those instructions that define phi sources. This is a 47% speedup of StrongPHIElimination compile time on 403.gcc. llvm-svn: 122627	2010-12-29 11:00:09 +00:00
Cameron Zwarich	bfef075140	Add a missing word to a comment. llvm-svn: 122625	2010-12-29 04:42:39 +00:00
Cameron Zwarich	458fd305d4	Add text explaining an assertion. llvm-svn: 122617	2010-12-29 03:52:51 +00:00
Cameron Zwarich	6fe33fdd63	Simplify some code in MachineVerifier that was doing the correct thing, but not in the most obvious way. llvm-svn: 122610	2010-12-28 23:45:38 +00:00
Cameron Zwarich	146666eabb	Revert the optimization in r122596. It is correct for all current targets, but it relies on assumptions that may not be true in the future. llvm-svn: 122608	2010-12-28 23:02:56 +00:00
Cameron Zwarich	92f6e4290c	Avoid iterating every operand of an instruction in StrongPHIElimination, since we are only interested in the defs when discovering interferences. This is a 28% speedup running StrongPHIElimination on 403.gcc. llvm-svn: 122596	2010-12-28 10:49:33 +00:00
Duncan Sands	496770debc	Pacify the compiler. BestWeight cannot in fact be used uninitialized in this function, but the compiler was warning that it might be when doing a release build. llvm-svn: 122595	2010-12-28 10:07:15 +00:00
Cameron Zwarich	5e5cfbe871	Change an assertion to assert what the code actually relies upon. llvm-svn: 122586	2010-12-27 22:08:42 +00:00
Cameron Zwarich	25d046ce68	Land a first cut at StrongPHIElimination. There are only 5 new test failures when running without the verifier, and I have not yet checked them to see if the new results are still correct. There are more verifier failures, but they all seem to be additional occurrences of verifier failures that occur with the existing PHIElimination pass. There are a few obvious issues with the code: 1) It doesn't properly update the register equivalence classes during copy insertion, and instead recomputes them before merging live intervals and renaming registers. I wanted to keep this first patch simple for debugging purposes, but it shouldn't be very hard to do this. 2) It doesn't mix the renaming and live interval merging with the copy insertion process, which leads to a lot of virtual register churn. Virtual registers and live intervals are created, only to later be merged into others. The code should be smarter and only create a new virtual register if there is no existing register in the same congruence class. 3) In one place the code uses a DenseMap per basic block, which is unnecessary heap allocation. There should be an inline storage version of DenseMap. I did a quick compile-time test of running llc on 403.gcc with and without StrongPHIElimination. It is slightly slower with StrongPHIElimination, because the small decrease in the coalescer runtime can't beat the increase in phi elimination runtime. Perhaps fixing the above performance issues will narrow the gap. I also haven't yet run any tests of the quality of the generated code. llvm-svn: 122582	2010-12-27 10:08:19 +00:00
Cameron Zwarich	b95bfe1667	Add knowledge of phi-def and phi-kill valnos to MachineVerifier's predecessor valno verification. The "Different value live out of predecessor" check is incorrect in the case of phi-def valnos, so just skip that check for phi-def valnos and instead check that all of the valnos for predecessors have phi-kill. Fixes PR8863. llvm-svn: 122581	2010-12-27 05:17:23 +00:00
Andrew Trick	5ce945ca3a	Minor cleanup related to my latest scheduler changes. llvm-svn: 122545	2010-12-24 07:10:19 +00:00
Andrew Trick	c94056692a	Fix a few cases where the scheduler is not checking for phys reg copies. The scheduling node may have a NULL DAG node, yuck. llvm-svn: 122544	2010-12-24 06:46:50 +00:00
Andrew Trick	10ffc2b6c2	Various bits of framework needed for precise machine-level selection DAG scheduling during isel. Most new functionality is currently guarded by -enable-sched-cycles and -enable-sched-hazard. Added InstrItineraryData::IssueWidth field, currently derived from ARM itineraries, but could be initialized differently on other targets. Added ScheduleHazardRecognizer::MaxLookAhead to indicate whether it is active, and if so how many cycles of state it holds. Added SchedulingPriorityQueue::HasReadyFilter to allowing gating entry into the scheduler's available queue. ScoreboardHazardRecognizer now accesses the ScheduleDAG in order to get information about it's SUnits, provides RecedeCycle for bottom-up scheduling, correctly computes scoreboard depth, tracks IssueCount, and considers potential stall cycles when checking for hazards. ScheduleDAGRRList now models machine cycles and hazards (under flags). It tracks MinAvailableCycle, drives the hazard recognizer and priority queue's ready filter, manages a new PendingQueue, properly accounts for stall cycles, etc. llvm-svn: 122541	2010-12-24 05:03:26 +00:00
Andrew Trick	c416ba612b	whitespace llvm-svn: 122539	2010-12-24 04:28:06 +00:00
Cameron Zwarich	ab434079d3	Simplify a check for implicit defs and remove a FIXME. llvm-svn: 122537	2010-12-24 03:09:36 +00:00
Chris Lattner	11a33811b6	flags -> glue for selectiondag llvm-svn: 122509	2010-12-23 17:24:32 +00:00
Chris Lattner	f647e95b9a	sdisel flag -> glue. llvm-svn: 122507	2010-12-23 17:13:18 +00:00
Andrew Trick	528fad91d2	Reorganize ListScheduleBottomUp in preparation for modeling machine cycles and instruction issue. llvm-svn: 122491	2010-12-23 05:42:20 +00:00
Andrew Trick	a52f325c35	Converted LiveRegCycles to LiveRegGens. It's easier to work with and allows multiple nodes per cycle. llvm-svn: 122474	2010-12-23 04:16:14 +00:00
Andrew Trick	12acde11cb	In CheckForLiveRegDef use TRI->getOverlaps. llvm-svn: 122473	2010-12-23 03:43:21 +00:00
Andrew Trick	033efdf4d7	Fixes PR8823: add-with-overflow-128.ll In the bottom-up selection DAG scheduling, handle two-address instructions that read/write unspillable registers. Treat the entire chain of two-address nodes as a single live range. llvm-svn: 122472	2010-12-23 03:15:51 +00:00
Jeffrey Yasskin	9b43f33620	Change all self assignments X=X to (void)X, so that we can turn on a new gcc warning that complains on self-assignments and self-initializations. llvm-svn: 122458	2010-12-23 00:58:24 +00:00
Benjamin Kramer	1f4dfbbcb0	DAGCombine add (sext i1), X into sub X, (zext i1) if sext from i1 is illegal. The latter usually compiles into smaller code. example code: unsigned foo(unsigned x, unsigned y) { if (x != 0) y--; return y; } before: _foo: ## @foo cmpl $1, 4(%esp) ## encoding: [0x83,0x7c,0x24,0x04,0x01] sbbl %eax, %eax ## encoding: [0x19,0xc0] notl %eax ## encoding: [0xf7,0xd0] addl 8(%esp), %eax ## encoding: [0x03,0x44,0x24,0x08] ret ## encoding: [0xc3] after: _foo: ## @foo cmpl $1, 4(%esp) ## encoding: [0x83,0x7c,0x24,0x04,0x01] movl 8(%esp), %eax ## encoding: [0x8b,0x44,0x24,0x08] adcl $-1, %eax ## encoding: [0x83,0xd0,0xff] ret ## encoding: [0xc3] llvm-svn: 122455	2010-12-22 23:17:45 +00:00
Jakob Stoklund Olesen	0acb69d53c	When RegAllocGreedy decides to spill the interferences of the current register, pick the victim with the lowest total spill weight. llvm-svn: 122445	2010-12-22 22:01:30 +00:00
Jakob Stoklund Olesen	29836e6572	Include a shadow of the original CFG edges in the edge bundle graph. llvm-svn: 122444	2010-12-22 22:01:28 +00:00
Chris Lattner	cafc1e60bb	Fix a bug in ReduceLoadWidth that wasn't handling extending loads properly. We miscompiled the testcase into: _test: ## @test movl $128, (%rdi) movzbl 1(%rdi), %eax ret Now we get a proper: _test: ## @test movl $128, (%rdi) movsbl (%rdi), %eax movzbl %ah, %eax ret This fixes PR8757. llvm-svn: 122392	2010-12-22 08:02:57 +00:00
Chris Lattner	9a499e96eb	more cleanups, move a check for "roundedness" earlier to reject unhanded cases faster and simplify code. llvm-svn: 122391	2010-12-22 08:01:44 +00:00
Chris Lattner	222374d886	reduce indentation and improve comments, no functionality change. llvm-svn: 122389	2010-12-22 07:36:50 +00:00
Andrew Trick	fbb3ed8774	In DelayForLiveRegsBottomUp, handle instructions that read and write the same physical register. Simplifies the fix from the previous checkin r122211. llvm-svn: 122370	2010-12-21 22:27:44 +00:00
Andrew Trick	2085a96513	whitespace llvm-svn: 122368	2010-12-21 22:25:04 +00:00
Dale Johannesen	a94e36bbee	Reapply 122353-122355 with fixes. 122354 was wrong; the shift type was needed one place, the shift count type another. The transform in 123555 had the same problem. llvm-svn: 122366	2010-12-21 21:55:50 +00:00
Dale Johannesen	87c47499c6	Revert 122353-122355 for the moment, they broke stuff. llvm-svn: 122360	2010-12-21 21:22:27 +00:00
Dale Johannesen	caf42aa6a4	Add a new transform to DAGCombiner. llvm-svn: 122355	2010-12-21 20:10:51 +00:00
Dale Johannesen	fa5dc82fda	Get the type of a shift from the shift, not from its shift count operand. These should be the same but apparently are not always, and this is cleaner anyway. This improves the code in an existing test. llvm-svn: 122354	2010-12-21 20:06:19 +00:00
Dale Johannesen	d64931df77	Shift by the word size is invalid IR; don't create it. llvm-svn: 122353	2010-12-21 20:00:06 +00:00
Chris Lattner	2a7ff99979	fix some typos llvm-svn: 122349	2010-12-21 18:05:22 +00:00
Stuart Hastings	83cce8e7ab	Fix indentation, add comment. llvm-svn: 122345	2010-12-21 17:16:58 +00:00
Stuart Hastings	8c5bfcaa29	Missing logic for nested CALLSEQ_START/END. llvm-svn: 122342	2010-12-21 17:07:24 +00:00
Cameron Zwarich	79ebc7186e	Incremental progress towards a new implementation of StrongPHIElimination. Most of the problems with my last attempt were in the updating of LiveIntervals rather than the coalescing itself. Therefore, I decided to get that right first by essentially reimplementing the existing PHIElimination using LiveIntervals. It works correctly, with only a few tests failing (which may not be legitimate failures) and no new verifier failures (at least as far as I can tell, I didn't count the number per file). llvm-svn: 122321	2010-12-21 06:54:43 +00:00
Chris Lattner	3e5fbd74ed	rename MVT::Flag to MVT::Glue. "Flag" is a terrible name for something that just glues two nodes together, even if it is sometimes used for flags. llvm-svn: 122310	2010-12-21 02:38:05 +00:00
Chris Lattner	17f906be96	improve "cannot yet select" errors a trivial amount: now they are just as useless, but at least a bit more gramatical llvm-svn: 122305	2010-12-21 02:07:03 +00:00
Jakob Stoklund Olesen	2530cd2a4c	Add EdgeBundles to SplitKit. Edge bundles is an annotation on the CFG that turns it into a bipartite directed graph where each basic block is connected to an outgoing and an ingoing bundle. These bundles are useful for identifying regions of the CFG for live range splitting. llvm-svn: 122301	2010-12-21 01:50:21 +00:00
Jakob Stoklund Olesen	4c278f82c8	Use IntEqClasses to compute connected components of live intervals. llvm-svn: 122296	2010-12-21 00:48:17 +00:00
Dale Johannesen	0a291a36f2	Cosmetic changes. llvm-svn: 122259	2010-12-20 20:10:50 +00:00
Cameron Zwarich	4ffda706d0	MachineVerifier should count landing pad successors as basic blocks rather than out-edges. Fixes PR8824. llvm-svn: 122228	2010-12-20 04:19:48 +00:00
Cameron Zwarich	660bce67f3	Teach MachineVerifier that early clobber defs begin at USE slots and other defs begin at DEF slots. Fixes the second half of PR8813. llvm-svn: 122225	2010-12-20 03:15:20 +00:00
Cameron Zwarich	bc2461c5f9	Add a missing check from r122218. llvm-svn: 122224	2010-12-20 02:59:51 +00:00
Chris Lattner	0b3ca50ebb	implement type legalization promotion support for SMULO and UMULO, giving ARM (and other 32-bit-only) targets support for i8 and i16 overflow multiplies. The generated code isn't great, but this at least fixes CodeGen/Generic/overflow.ll when running on ARM hosts. llvm-svn: 122221	2010-12-20 02:05:39 +00:00
Cameron Zwarich	fc0c6b1ea9	Don't assume that an instruction ending a register's live range always reads the register; it may be a dead def instead. Fixes PR8820. llvm-svn: 122218	2010-12-20 01:22:37 +00:00
Chris Lattner	981afd206b	Fix a bug in the scheduler's handling of "unspillable" vregs. Imagine we see: EFLAGS = inst1 EFLAGS = inst2 FLAGS gpr = inst3 EFLAGS Previously, we would refuse to schedule inst2 because it clobbers the EFLAGS of the predecessor. However, it also uses the EFLAGS of the predecessor, so it is safe to emit. SDep edges ensure that the right order happens already anyway. This fixes 2 testsuite crashes with the X86 patch I'm going to commit next. llvm-svn: 122211	2010-12-20 00:55:43 +00:00
Chris Lattner	0cfe884874	the result of CheckForLiveRegDef is dead, remove it. llvm-svn: 122209	2010-12-20 00:51:56 +00:00
Chris Lattner	ed69c6e4b9	reduce indentation, no functionality change. llvm-svn: 122208	2010-12-20 00:50:16 +00:00
Cameron Zwarich	1b67d6c565	Ignore debug values when performing MachineVerifier liveness checks. Fixes PR8822. llvm-svn: 122207	2010-12-20 00:08:10 +00:00
Cameron Zwarich	0b111b1aee	Early clobber operands are allowed to be defined at use indices. This fixes one half of PR8813. llvm-svn: 122205	2010-12-19 23:50:53 +00:00
Cameron Zwarich	251337e1c4	Fix PR8815 by checking for an explicit clobber def tied to a use operand in ConnectedVNInfoEqClasses::Classify(). llvm-svn: 122202	2010-12-19 22:12:45 +00:00
Cameron Zwarich	7e24173a3c	Fix PR8811 by teaching MachineVerifier about optional defs. llvm-svn: 122199	2010-12-19 21:37:23 +00:00
Cameron Zwarich	b5cec4f11a	StrongPHIElimination will never run before TwoAddressInstructionPass. llvm-svn: 122197	2010-12-19 21:32:29 +00:00
Nick Lewycky	0de20af7ba	Add missing standard headers. Patch by Joerg Sonnenberger! llvm-svn: 122193	2010-12-19 20:43:38 +00:00
Chris Lattner	440b2804ff	teach MaskedValueIsZero how to analyze ADDE. This is enough to teach it that ADDE(0,0) is known 0 except the low bit, for example. llvm-svn: 122191	2010-12-19 20:38:28 +00:00
Cameron Zwarich	713ab37965	Remove some checks for StrongPHIElim. These checks make it impossible to use an alternative register allocator that does not require LiveIntervals by specifying it on the command-line for a target that has StrongPHIElimination enabled by default. These checks are pretty meaningless anyways, since StrongPHIElimination and PHIElimination are never used at the same time. llvm-svn: 122176	2010-12-19 18:03:27 +00:00
Chris Lattner	77a8a71414	fix PR8642: if a critical edge has a PHI value that can trap, isel is required to split the edge. PHI values get evaluated on the edge, not in their predecessor block. llvm-svn: 122170	2010-12-19 04:58:57 +00:00
Jakob Stoklund Olesen	1fa7958eaa	Apparently, operandices is not a word. llvm-svn: 122135	2010-12-18 03:28:32 +00:00
Jakob Stoklund Olesen	3b2966dc7d	Teach the inline spiller to attempt folding a load instruction into its single use before rematerializing the load. This allows us to produce: addps LCPI0_1(%rip), %xmm2 Instead of: movaps LCPI0_1(%rip), %xmm3 addps %xmm3, %xmm2 Saving a register and an instruction. The standard spiller already knows how to do this. llvm-svn: 122133	2010-12-18 03:04:14 +00:00
Jakob Stoklund Olesen	2a9f194b00	Tweak debug spew. llvm-svn: 122132	2010-12-18 03:04:11 +00:00
Jakob Stoklund Olesen	7971a3eaff	Check that the register is live-in to the loop header before inserting copies in the loop predecessors. The register can be live-out from a predecessor without being live-in to the loop header if there is a critical edge from the predecessor. llvm-svn: 122123	2010-12-18 01:06:19 +00:00
Nick Lewycky	1d108cb962	Fix GCC warning: lib/CodeGen/RegAllocGreedy.cpp:311: error: unused variable 'PhysReg' [-Wunused-variable] llvm-svn: 122122	2010-12-18 01:05:55 +00:00
Jakob Stoklund Olesen	bf4550e3fb	Pass a Banner argument to the machine code verifier both from createMachineVerifierPass and MachineFunction::verify. The banner is printed before the machine code dump, just like the printer pass. llvm-svn: 122113	2010-12-18 00:06:56 +00:00
Jakob Stoklund Olesen	cf846100d8	Avoid dereferencing end() in collectInterferingVRegs() when there is no interference. llvm-svn: 122108	2010-12-17 23:16:38 +00:00
Jakob Stoklund Olesen	2e98ee31b3	Make the -verify-regalloc command line option available to base classes as RegAllocBase::VerifyEnabled. Run the machine code verifier in a few interesting places during RegAllocGreedy. llvm-svn: 122107	2010-12-17 23:16:35 +00:00
Jakob Stoklund Olesen	1740e00104	Enable loop splitting in RegAllocGreedy. The heuristics split around the largest loop where the current register may be allocated without interference. llvm-svn: 122106	2010-12-17 23:16:32 +00:00
Bill Wendling	3fff1fd49b	During local stack slot allocation, the materializeFrameBaseRegister function may be called. If the entry block is empty, the insertion point iterator will be the "end()" value. Calling ->getParent() on it (among others) causes problems. Modify materializeFrameBaseRegister to take the machine basic block and insert the frame base register at the beginning of that block. (It's very similar to what the code does all ready. The only difference is that it will always insert at the beginning of the entry block instead of after a previous materialization of the frame base register. I doubt that that matters here.) <rdar://problem/8782198> llvm-svn: 122104	2010-12-17 23:09:14 +00:00
Bob Wilson	5408144add	Fix a DAGCombiner crash when folding binary vector operations with constant BUILD_VECTOR operands where the element type is not legal. I had previously changed this code to insert TRUNCATE operations, but that was just wrong. llvm-svn: 122102	2010-12-17 23:06:49 +00:00
Dale Johannesen	cd538afa52	Add a transform to DAG Combiner. This improves the code for the case where 32-bit divide by constant is turned into 64-bit multiply by constant. 8771012. llvm-svn: 122090	2010-12-17 21:45:49 +00:00
Jakob Stoklund Olesen	a043b62870	Allow missing kill flags on an untied operand of a two-address instruction when the operand uses the same register as a tied operand: %r1 = add %r1, %r1 If add were a three-address instruction, kill flags would be required on at least one of the uses. Since it is a two-address instruction, the tied use operand must not have a kill flag. This change makes the kill flag on the untied use operand optional. llvm-svn: 122082	2010-12-17 19:18:41 +00:00
Jakob Stoklund Olesen	38b6d494d5	Add MachineLoopRange comparators for sorting loop lists by number and by area. llvm-svn: 122073	2010-12-17 18:13:52 +00:00
Jakob Stoklund Olesen	9c7f3a46d8	Provide LiveIntervalUnion::Query::checkLoopInterference. This is a three-way interval list intersection between a virtual register, a live interval union, and a loop. It will be used to identify interference-free loops for live range splitting. llvm-svn: 122034	2010-12-17 04:09:47 +00:00
Bob Wilson	bfc6904fc6	Fix crash compiling a QQQQ REG_SEQUENCE for a Neon vld3_lane operation. Radar 8776599 llvm-svn: 122018	2010-12-17 01:21:12 +00:00
Bob Wilson	137dcdba8a	Fix a comment typo. llvm-svn: 122016	2010-12-17 01:21:05 +00:00
Daniel Dunbar	ecd0c8a557	MC: Make TargetAsmBackend available to the AsmStreamer. - Treaty talks on the non-proliferation of MC objects broke down. llvm-svn: 121949	2010-12-16 03:05:59 +00:00
Jakob Stoklund Olesen	e7601e97e1	Start using SplitKit and MachineLoopRanges in RegAllocGreedy in preparation of live range splitting around loops guided by register pressure. So far, trySplit() simply prints a lot of debug output. llvm-svn: 121918	2010-12-15 23:46:13 +00:00
Jakob Stoklund Olesen	5e97781386	Add MachineLoopRanges analysis. A MachineLoopRange contains the intervals of slot indexes covered by the blocks in a loop. This representation of the loop blocks is more efficient to compare against interfering registers during register coalescing. llvm-svn: 121917	2010-12-15 23:41:23 +00:00
Evan Cheng	b7ff5a0f20	Teach machine cse to commute instructions. llvm-svn: 121903	2010-12-15 22:16:21 +00:00
Dan Gohman	a4fcd2418d	Move Value::getUnderlyingObject to be a standalone function so that it can live in Analysis instead of VMCore. llvm-svn: 121885	2010-12-15 20:02:24 +00:00
Jakob Stoklund Olesen	1066ef6b24	Fix build. llvm-svn: 121872	2010-12-15 18:07:48 +00:00
Jakob Stoklund Olesen	28e769cc54	Detect and enumerate bypass loops. Bypass loops have the current live range live through, but contain no uses or defs. Splitting around a bypass loop can free registers for other uses inside the loop by spilling the split range. llvm-svn: 121871	2010-12-15 17:49:52 +00:00
Jakob Stoklund Olesen	4391f34aba	Separate SplitAnalysis::getSplitLoops(). This method returns the set of loops with uses that are candidates for splitting. llvm-svn: 121870	2010-12-15 17:41:19 +00:00
Chris Lattner	15090e1eb0	take care of some todos, transforming [us]mul_lohi into a wider mul if the wider mul is legal. llvm-svn: 121848	2010-12-15 06:04:19 +00:00
Chris Lattner	b86dceea1b	when transforming a MULHS into a wider MUL, there is no need to SRA the result, the top bits are truncated off anyway, just use SRL. llvm-svn: 121846	2010-12-15 05:51:39 +00:00
Jakob Stoklund Olesen	0b7ca3a6a7	Simplify RegAllocGreedy's use of register aliases. llvm-svn: 121807	2010-12-14 23:38:19 +00:00
Jakob Stoklund Olesen	47b93401d8	Simplify CCState's use of register aliases. llvm-svn: 121806	2010-12-14 23:28:01 +00:00
Jakob Stoklund Olesen	be1c8d3a82	Simplify AggressiveAntiDepBreaker's use of register aliases. llvm-svn: 121805	2010-12-14 23:23:15 +00:00
Jakob Stoklund Olesen	6a5bf7782a	Simplyfy RegAllocBasic by using getOverlaps instead of getAliasSet. llvm-svn: 121801	2010-12-14 23:10:48 +00:00
Evan Cheng	19dc77cec6	Fix a minor bug in two-address pass. It was missing a commute opportunity. regB = move RCX regA = op regB, regC RAX = move regA where both regB and regC are killed. If regB is constrainted to non-compatible physical registers but regC is not constrainted at all, then it's better to commute the instruction. movl %edi, %eax shlq $32, %rcx leaq (%rcx,%rax), %rax => movl %edi, %eax shlq $32, %rcx orq %rcx, %rax rdar://8762995 llvm-svn: 121793	2010-12-14 21:34:53 +00:00
Matt Beaumont-Gay	86a05d0bed	Move debugging code entirely within DEBUG(). Silences an unused variable warning in the opt build. llvm-svn: 121791	2010-12-14 21:14:55 +00:00
Jakob Stoklund Olesen	5c3ad0d51e	Add LiveIntervalUnion print methods, RegAllocGreedy::trySplit debug spew. llvm-svn: 121783	2010-12-14 19:38:49 +00:00
Jakob Stoklund Olesen	d5e38383e0	Use TRI::printReg instead of AbstractRegisterDescription when printing LiveIntervalUnions. llvm-svn: 121781	2010-12-14 18:53:47 +00:00
Jakob Stoklund Olesen	e7ee72087e	Q.seenAllInterferences() must be called after Q.collectInterferingVRegs(). llvm-svn: 121774	2010-12-14 17:47:36 +00:00
Jakob Stoklund Olesen	eba9095df2	Remove unused vector. llvm-svn: 121741	2010-12-14 00:58:47 +00:00
Jakob Stoklund Olesen	903b6d3261	Try reassigning all virtual register interferences, not just those with lower spill weight. Filter out fixed registers instead. Add support for reassigning an interference that was assigned to an alias. llvm-svn: 121737	2010-12-14 00:37:49 +00:00
Jakob Stoklund Olesen	3d7b8066aa	Add stub for RAGreedy::trySplit. llvm-svn: 121736	2010-12-14 00:37:44 +00:00
Chris Lattner	10bd29f1d4	Add a couple dag combines to transform mulhi/mullo into a wider multiply when the wider type is legal. This allows us to compile: define zeroext i16 @test1(i16 zeroext %x) nounwind { entry: %div = udiv i16 %x, 33 ret i16 %div } into: test1: # @test1 movzwl 4(%esp), %eax imull $63551, %eax, %eax # imm = 0xF83F shrl $21, %eax ret instead of: test1: # @test1 movw $-1985, %ax # imm = 0xFFFFFFFFFFFFF83F mulw 4(%esp) andl $65504, %edx # imm = 0xFFE0 movl %edx, %eax shrl $5, %eax ret Implementing rdar://8760399 and example #4 from: http://blog.regehr.org/archives/320 We should implement the same thing for [su]mul_hilo, but I don't have immediate plans to do this. llvm-svn: 121696	2010-12-13 08:39:01 +00:00
Chris Lattner	f8d180b808	remove the verbose-asm "constant pool double" comments that we were printing for each constant pool entry. Using WriteTypeSymbolic here takes time proportional to the size of the module, for each constant pool entry. This speeds up -verbose-asm llc on 252.eon (a random testcase at my disposal) from 4.4s to 2.137s. llc takes 2.11s with asm-verbose off, so this is now a pretty reasonable cost for verbose comments. llvm-svn: 121691	2010-12-13 07:35:47 +00:00
Chris Lattner	cb404360ca	reduce indentation by using continue, no functionality change. llvm-svn: 121662	2010-12-13 01:11:17 +00:00
Duncan Sands	d2e70b5442	Catch attempts to remove a deleted node from the CSE maps. Better to catch this here rather than later after accessing uninitialized memory etc. Fires when compiling the testcase in PR8237. llvm-svn: 121635	2010-12-12 13:22:50 +00:00
Jakob Stoklund Olesen	92da705261	Add named timer groups for the different stages of register allocation. llvm-svn: 121604	2010-12-11 00:19:56 +00:00
Jakob Stoklund Olesen	8de03d222f	Move MRI into RegAllocBase. Clean up debug output a bit. llvm-svn: 121599	2010-12-10 23:49:00 +00:00
Nick Lewycky	bb8610635f	Remove extraneous close parenthesis. Fix build breakage. llvm-svn: 121596	2010-12-10 23:14:35 +00:00
Nick Lewycky	07a95f8f06	Move variable that's unused in an NDEBUG build inside the DEBUG() macro, fixing lib/CodeGen/RegAllocGreedy.cpp:233: error: unused variable 'TRC' [-Wunused-variable] llvm-svn: 121594	2010-12-10 23:05:10 +00:00
Jakob Stoklund Olesen	adecb5e82c	Force the greedy register allocator to always use the inline spiller. Soon, RegAllocGreedy will start splitting live ranges, and then deferred spilling won't work anyway. llvm-svn: 121591	2010-12-10 22:54:44 +00:00
Jakob Stoklund Olesen	276445f3b8	Rip out live range splitting support from the inline spiller. The spiller should only spill. The register allocator will drive live range splitting, it has the needed information about register pressure and interferences. llvm-svn: 121590	2010-12-10 22:54:40 +00:00
Jakob Stoklund Olesen	4d7432ebf1	Use AllocationOrder in RegAllocGreedy, fix a bug in the hint calculation. llvm-svn: 121584	2010-12-10 22:21:05 +00:00
Jakob Stoklund Olesen	1c6196228a	Fix miscompilation caused by trivial logic error in the reassignVReg() interference check. llvm-svn: 121519	2010-12-10 20:45:04 +00:00
Jakob Stoklund Olesen	0c67e01e5f	Add an AllocationOrder class that can iterate over the allocatable physical registers for a given virtual register. Reserved registers are filtered from the allocation order, and any valid hint is returned as the first suggestion. For target dependent hints, a number of arcane target hooks are invoked. llvm-svn: 121497	2010-12-10 18:36:02 +00:00
Rafael Espindola	0a017a6db2	Fixed version of 121434 with no new memory leaks. llvm-svn: 121471	2010-12-10 07:39:47 +00:00
Rafael Espindola	a945a34c73	Revert my previous patch to make the valgrind bots happy. llvm-svn: 121461	2010-12-10 04:01:09 +00:00
Rafael Espindola	56eb741237	Initial support for the cfi directives. This is just enough to get f: .cfi_startproc nop .cfi_endproc assembled (on ELF). llvm-svn: 121434	2010-12-09 23:48:29 +00:00
Stuart Hastings	d2ea97cbef	Initial support for nested CALLSEQ_START/CALLSEQ_END constructs in LegalizeDAG. Necessary for byval support on ARM. Radar 7662569. llvm-svn: 121412	2010-12-09 21:25:20 +00:00
Jakob Stoklund Olesen	3413807913	Remember to filter out reserved rergisters from the allocation order. llvm-svn: 121411	2010-12-09 21:20:46 +00:00
Jakob Stoklund Olesen	4c2fadbc18	Add a forgotten initializer for CheckedFirstInterference. llvm-svn: 121410	2010-12-09 21:20:44 +00:00
Andrew Trick	ccef09888c	Added register reassignment prototype to RAGreedy. It's a simple heuristic to reshuffle register assignments when we can't find an available reg. llvm-svn: 121388	2010-12-09 18:15:21 +00:00
Eric Christopher	d9e8eac235	80-col fixups. llvm-svn: 121356	2010-12-09 04:48:06 +00:00
Jakob Stoklund Olesen	e6dc3c899e	IntervalMap iterators are heavyweight, so avoid copying them around and use references instead. Similarly, IntervalMap::begin() is almost as expensive as find(), so use find(x) instead of begin().advanceTo(x); This makes RegAllocBasic run another 5% faster. llvm-svn: 121344	2010-12-09 01:06:52 +00:00
Devang Patel	c26da9005b	DW_FORM_data1 may not provide sufficient room for vtable index, use _udata instead. This fixes radar 8730409. llvm-svn: 121323	2010-12-09 00:10:40 +00:00
Jakob Stoklund Olesen	8c5f0c3115	Properly deal with empty intervals when checking for interference. llvm-svn: 121319	2010-12-08 23:51:35 +00:00
Jakob Stoklund Olesen	eaa650a945	Implement very primitive hinting support in RegAllocGreedy. The hint is simply tried first and then forgotten if it couldn't be allocated immediately. llvm-svn: 121306	2010-12-08 22:57:16 +00:00
Jakob Stoklund Olesen	e0df786c98	Store (priority,regnum) pairs in the priority queue instead of providing an abstract priority queue interface in subclasses that want to override the priority calculations. Subclasses must provide a getPriority() implementation instead. This approach requires less code as long as priorities are expressable as simple floats, and it avoids the dangers of defining potentially expensive priority comparison functions. It also should speed up priority_queue operations since they no longer have to chase pointers when comparing registers. This is not measurable, though. Preferably, we shouldn't use floats to guide code generation. The use of floats here is derived from the use of floats for spill weights. Spill weights have a dynamic range that doesn't lend itself easily to a fixpoint implementation. When someone invents a stable spill weight representation, it can be reused for allocation priorities. llvm-svn: 121294	2010-12-08 22:22:41 +00:00
Eric Christopher	1b93e7b4ed	Reword comment slightly. llvm-svn: 121293	2010-12-08 22:21:42 +00:00
Eric Christopher	66a8bf57ea	Fix comment. llvm-svn: 121285	2010-12-08 21:35:09 +00:00
Jakob Stoklund Olesen	310916a22d	Trim includes. llvm-svn: 121283	2010-12-08 21:12:00 +00:00
Andrew Trick	00067fb147	Generalize PostRAHazardRecognizer so it can be used in any pass for both forward and backward scheduling. Rename it to ScoreboardHazardRecognizer (Scoreboard is one word). Remove integer division from the scoreboard's critical path. llvm-svn: 121274	2010-12-08 20:04:29 +00:00
Jakob Stoklund Olesen	b8812a1c15	Stub out RegAllocGreedy. This new register allocator is initially identical to RegAllocBasic, but it will receive all of the tricks that RegAllocBasic won't get. RegAllocGreedy will eventually replace linear scan. llvm-svn: 121234	2010-12-08 03:26:16 +00:00
Jakob Stoklund Olesen	5885e99405	Move RABasic::addMBBLiveIns to the base class, it is generally useful. Minor optimization to the use of IntervalMap iterators. They are fairly heavyweight, so prefer SI.valid() over SI != end(). llvm-svn: 121217	2010-12-08 01:06:06 +00:00
Jakob Stoklund Olesen	db357d71f1	Switch LiveIntervalUnion from std::set to IntervalMap. This speeds up RegAllocBasic by 20%, not counting releaseMemory which becomes way faster. llvm-svn: 121201	2010-12-07 23:18:47 +00:00
Jakob Stoklund Olesen	fb207c1cb9	Simplify assertion. llvm-svn: 121162	2010-12-07 18:51:27 +00:00
Jay Foad	583abbc4df	PR5207: Change APInt methods trunc(), sext(), zext(), sextOrTrunc() and zextOrTrunc(), and APSInt methods extend(), extOrTrunc() and new method trunc(), to be const and to return a new value instead of modifying the object in place. llvm-svn: 121120	2010-12-07 08:25:19 +00:00
Jakob Stoklund Olesen	436dae5cf3	Remove unused member. llvm-svn: 121098	2010-12-07 01:32:45 +00:00
Devang Patel	bca5b25721	Undefined value in reg 0 may need a marker to identify end of source range. This will be used to truncate live range of DBG_VALUE instruction by register allocator and friends. llvm-svn: 121061	2010-12-06 22:48:22 +00:00
Devang Patel	c24048a718	If dbg_declare() or dbg_value() is not lowered by isel then emit DEBUG message instead of creating DBG_VALUE for undefined value in reg0. llvm-svn: 121059	2010-12-06 22:39:26 +00:00
Rafael Espindola	44bbe36de6	Second try at making direct object emission produce the same results as llc + llvm-mc. This time ELF is not changed and I tested that llvm-gcc bootstrap on darwin10 using darwin9's assembler and linker. llvm-svn: 121006	2010-12-06 17:27:56 +00:00
Rafael Espindola	dee3062373	Revert previous two patches while I try to find out how to make both linux and darwin assemblers happy :-( llvm-svn: 121004	2010-12-06 15:35:15 +00:00
Rafael Espindola	34a06a0802	Add an EmitAbsValue helper method and use it in cases where we want to be sure that no relocations are used (on MochO). Fixes llc producing different output from llc + llvm-mc. llvm-svn: 121000	2010-12-06 14:53:14 +00:00
Cameron Zwarich	c7223a3e37	Some cleanup before I start committing some incremental progress on StrongPHIElimination. llvm-svn: 120961	2010-12-05 22:34:08 +00:00
Cameron Zwarich	a3fb8cb3d4	Remove the PHIElimination.h header, as it is no longer needed. llvm-svn: 120959	2010-12-05 21:39:42 +00:00
Cameron Zwarich	6766c420a2	I forgot to actually remove the FindCopyInsertPoint() declaration from PHIElimination.h. llvm-svn: 120953	2010-12-05 19:58:57 +00:00
Cameron Zwarich	8d1695589c	Remove the SplitCriticalEdge() method declaration from PHIElimination.h. At one time, this method existed, but now PHIElimination uses the method of the same name on MachineBasicBlock. llvm-svn: 120952	2010-12-05 19:54:23 +00:00
Cameron Zwarich	da592a9e41	Move the FindCopyInsertPoint method of PHIElimination to a new standalone function so that it can be shared with StrongPHIElimination. llvm-svn: 120951	2010-12-05 19:51:05 +00:00
Cameron Zwarich	fbd47dcc55	Remove PHIElimination's private copy of SkipPHIsAndLabels. llvm-svn: 120918	2010-12-04 20:40:15 +00:00
Benjamin Kramer	31920b0a2a	Remove unneeded zero arrays. llvm-svn: 120910	2010-12-04 15:28:22 +00:00
Jakob Stoklund Olesen	922e1fac6c	Rename virtRegMap to avoid confusion with the VirtRegMap that it isn't. llvm-svn: 120846	2010-12-03 22:25:09 +00:00
Jakob Stoklund Olesen	4408603a9e	Coalesce debug locations when possible, causing less DBG_VALUE instructions to be emitted. llvm-svn: 120845	2010-12-03 22:25:07 +00:00
Jakob Stoklund Olesen	afc2bc2c04	Emit DBG_VALUE instructions from LiveDebugVariables. llvm-svn: 120842	2010-12-03 21:47:10 +00:00
Jakob Stoklund Olesen	25cde34ae4	Also update virtRegMap when renaming virtual registers. llvm-svn: 120841	2010-12-03 21:47:08 +00:00
Jakob Stoklund Olesen	72f7e1b74f	Delete the StrongPHIElimination pass, leaving only a shell. The StrongPHIElimination pass did not work, and nobody has worked on it for two years. A rewrite is underway, so I am leaving this shell pass instead of deleting it completely. llvm-svn: 120830	2010-12-03 19:21:53 +00:00
Jakob Stoklund Olesen	9ec20111c3	Update LiveDebugVariables during coalescing. llvm-svn: 120720	2010-12-02 18:15:44 +00:00
Jakob Stoklund Olesen	4be0bd79a4	Implement the first half of LiveDebugVariables. Scan the MachineFunction for DBG_VALUE instructions, and replace them with a data structure similar to LiveIntervals. The live range of a DBG_VALUE is determined by propagating it down the dominator tree until a new DBG_VALUE is found. When a DBG_VALUE lives in a register, its live range is confined to the live range of the register's value. LiveDebugVariables runs before coalescing, so DBG_VALUEs are not artificially extended when registers are joined. The missing half will recreate DBG_VALUE instructions from the intervals when register allocation is complete. The pass is disabled by default. It can be enabled with the temporary command line option -live-debug-variables. llvm-svn: 120636	2010-12-02 00:37:37 +00:00
Jay Foad	25a5e4ca1f	PR5207: Rename overloaded APInt methods set(), clear(), flip() to setAllBits(), setBit(unsigned), etc. llvm-svn: 120564	2010-12-01 08:53:58 +00:00
Andrew Trick	781b76bd78	Comment typo. llvm-svn: 120504	2010-11-30 23:59:50 +00:00
Evan Cheng	d4b0873c06	Enable sibling call optimization of libcalls which are expanded during legalization time. Since at legalization time there is no mapping from SDNode back to the corresponding LLVM instruction and the return SDNode is target specific, this requires a target hook to check for eligibility. Only x86 and ARM support this form of sibcall optimization right now. rdar://8707777 llvm-svn: 120501	2010-11-30 23:55:39 +00:00
Andrew Trick	fce64c938a	Coding style. No significant functionality. Abandon linear scan style in favor of the widespread llvm style. Capitalize variables and add newlines for visual parsing. Rename variables for readability. And other cleanup. llvm-svn: 120490	2010-11-30 23:18:47 +00:00
Chris Lattner	ea41dfe385	add TLI support indicating that jumps are more expensive than logical operations and use this to disable a specific optimization. Patch by Micah Villmow! llvm-svn: 120435	2010-11-30 18:12:52 +00:00
Jay Foad	15084f085d	PR5207: Make APInt::set(), APInt::clear() and APInt::flip() return void. llvm-svn: 120413	2010-11-30 09:02:01 +00:00
Jakob Stoklund Olesen	d4900a644c	Stub out a new LiveDebugVariables pass. This analysis is going to run immediately after LiveIntervals. It will stay alive during register allocation and keep track of user variables mentioned in DBG_VALUE instructions. When the register allocator is moving values between registers and the stack, it is very hard to keep track of DBG_VALUE instructions. We usually get it wrong. This analysis maintains a data structure that makes it easy to update DBG_VALUE instructions. llvm-svn: 120385	2010-11-30 02:17:10 +00:00
Michael J. Spencer	447762da85	Merge System into Support. llvm-svn: 120298	2010-11-29 18:16:10 +00:00
Bob Wilson	f9b96c474f	Fix a comment typo. llvm-svn: 120235	2010-11-28 06:51:19 +00:00
Anton Korobeynikov	7283b8d18c	Move more PEI-related hooks to TFI llvm-svn: 120229	2010-11-27 23:05:25 +00:00
Anton Korobeynikov	d08fbd19f5	Move callee-saved regs spills / reloads to TFI llvm-svn: 120228	2010-11-27 23:05:03 +00:00
Benjamin Kramer	aef5bd049f	Namespacify. llvm-svn: 120146	2010-11-25 16:42:51 +00:00
Wesley Peck	527da1b6e2	Renaming ISD::BIT_CONVERT to ISD::BITCAST to better reflect the LLVM IR concept. llvm-svn: 119990	2010-11-23 03:31:01 +00:00
Benjamin Kramer	24656c9583	Implement the "if (X == 6 \|\| X == 4)" -> "if ((X\|2) == 6)" optimization. This currently only catches the most basic case, a two-case switch, but can be extended later. llvm-svn: 119964	2010-11-22 09:45:38 +00:00
Anton Korobeynikov	4687778398	Move some more hooks to TargetFrameInfo llvm-svn: 119904	2010-11-20 15:59:32 +00:00
Benjamin Kramer	f6fb58a216	Silence Release build warnings about unused functions. llvm-svn: 119903	2010-11-20 15:53:24 +00:00
Duncan Sands	7c601ded34	On X86, MEMBARRIER, MFENCE, SFENCE, LFENCE are not target memory intrinsics, so don't claim they are. They are allocated using DAG.getNode, so attempts to access MemSDNode fields results in reading off the end of the allocated memory. This fixes crashes with "llc -debug" due to debug code trying to print MemSDNode fields for these barrier nodes (since the crashes are not deterministic, use valgrind to see this). Add some nasty checking to try to catch this kind of thing in the future. llvm-svn: 119901	2010-11-20 11:25:00 +00:00
Andrew Trick	cf7fefb25c	Removing the useless test that I added recently. It was meant as an example, but not complicated enough to merit another test. llvm-svn: 119898	2010-11-20 07:26:51 +00:00
Andrew Trick	ada75c5ad1	RABasic fix. Regalloc is responsible for updating block live ins. llvm-svn: 119896	2010-11-20 02:57:05 +00:00
Andrew Trick	799ec1c4d6	Whitespace. llvm-svn: 119895	2010-11-20 02:43:55 +00:00
Bill Wendling	54df187f25	Check for _setjmp too, because it's also used. llvm-svn: 119875	2010-11-20 00:03:09 +00:00
Mon P Wang	88ff56caa3	Make isScalarToVector to return false if the node is a scalar. This will prevent DAGCombine from making an illegal transformation of bitcast of a scalar to a vector into a scalar_to_vector. llvm-svn: 119819	2010-11-19 19:08:12 +00:00
Jakob Stoklund Olesen	4031c5eb48	Don't attempt trivial coalescing for sub-register copies. Patch by Krister Wombell! llvm-svn: 119791	2010-11-19 05:45:24 +00:00
Rafael Espindola	b58867ccba	Change some methods in MCDwarf.cpp to be able to handle an arbitrary MCStreamer instead of just MCObjectStreamer. Address changes cannot be as efficient as we have to use DW_LNE_set_addres, but at least most of the logic is shared. This will be used so that, with CodeGen still using EmitDwarfLocDirective, llvm-gcc is able to produce debug_line sections without needing an assembler that supports .loc. llvm-svn: 119777	2010-11-19 02:26:16 +00:00
Anton Korobeynikov	14ee344944	Move getInitialFrameState() to TargetFrameInfo llvm-svn: 119754	2010-11-18 23:25:52 +00:00
Anton Korobeynikov	0eecf5d201	Move hasFP() and few related hooks to TargetFrameInfo. llvm-svn: 119740	2010-11-18 21:19:35 +00:00
Duncan Sands	c92331b984	Fix thinko: we must turn select(anyext, sext) into sext(select) not anyext(select). Spotted by Frits van Bommel. llvm-svn: 119739	2010-11-18 21:16:28 +00:00
Duncan Sands	12f3b3b44f	The DAGCombiner was threading select over pairs of extending loads even if the extension types were not the same. The result was that if you fed a select with sext and zext loads, as in the testcase, then it would get turned into a zext (or sext) of the select, which is wrong in the cases when it should have been an sext (resp. zext). Reported and diagnosed by Sebastien Deldon. llvm-svn: 119728	2010-11-18 20:05:18 +00:00
Dan Gohman	e4f7ec17f8	Oops, missed this file when remaing ExpandPseudos to ExpandISelPseudos. llvm-svn: 119717	2010-11-18 18:48:28 +00:00
Dan Gohman	c2b786163c	Rename ExpandPseudos to ExpandISelPseudos to help clarify its role. llvm-svn: 119716	2010-11-18 18:45:06 +00:00
Dan Gohman	5a1a2d53de	Fix typos. llvm-svn: 119712	2010-11-18 17:44:17 +00:00
Dan Gohman	21a9683641	ExpandPseudos doesn't have any dependencies, so it can use the simple form of INITIALIZE_PASS. llvm-svn: 119707	2010-11-18 17:14:05 +00:00
Rafael Espindola	67c6ab8865	Change CodeGen to use .loc directives. This produces a lot more readable output and testing is easier. A good example is the unknown-location.ll test that now can just look for ".loc 1 0 0". We also don't use a DW_LNE_set_address for every address change anymore. llvm-svn: 119613	2010-11-18 02:04:25 +00:00
Dale Johannesen	ed0d840838	Do not throw away alignment when generating the DAG for memset; we may need it to decide between MOVAPS and MOVUPS later. Adjust a test that was looking for wrong code. PR 3866 / 8675131. llvm-svn: 119605	2010-11-18 01:35:23 +00:00
John Thompson	ddc7ce548c	Bug 8621 fix - pointer cast stripped from inline asm constraint argument. llvm-svn: 119590	2010-11-17 23:58:47 +00:00
Evan Cheng	7f8ab6ee8b	Remove ARM isel hacks that fold large immediates into a pair of add, sub, and, and xor. The 32-bit move immediates can be hoisted out of loops by machine LICM but the isel hacks were preventing them. Instead, let peephole optimization pass recognize registers that are defined by immediates and the ARM target hook will fold the immediates in. Other changes include 1) do not fold and / xor into cmp to isel TST / TEQ instructions if there are multiple uses. This happens when the 'and' is live out, machine sink would have sinked the computation and that ends up pessimizing code. The peephole pass would recognize situations where the 'and' can be toggled to define CPSR and eliminate the comparison anyway. 2) Move peephole pass to after machine LICM, sink, and CSE to avoid blocking important optimizations. rdar://8663787, rdar://8241368 llvm-svn: 119548	2010-11-17 20:13:28 +00:00
Chris Lattner	79ffdc7581	With the newly simplified SourceMgr interfaces and the generalized SrcMgrDiagHandler, we can improve clang diagnostics for inline asm: instead of reporting them on a source line of the original line, we can report it on the correct line wherever the string literal came from. For something like this: void foo() { asm("push %rax\n" ".code32\n"); } we used to get this: (note that the line in t.c isn't helpful) t.c:4:7: error: warning: ignoring directive for now asm("push %rax\n" ^ <inline asm>:2:1: note: instantiated into assembly here .code32 ^ now we get: t.c:5:8: error: warning: ignoring directive for now ".code32\n" ^ <inline asm>:2:1: note: instantiated into assembly here .code32 ^ Note that we're pointing to line 5 properly now. llvm-svn: 119488	2010-11-17 08:20:42 +00:00
Chris Lattner	b0e36085c4	now that AsmPrinter::EmitInlineAsm is factored right, we can eliminate the cookie argument to the SourceMgr diagnostic stuff. This cleanly separates LLVMContext's inlineasm handler from the sourcemgr error handling definition, increasing type safety and cleaning things up. llvm-svn: 119486	2010-11-17 08:13:01 +00:00
Chris Lattner	300fa45d8b	rearrange how the handler in SourceMgr is installed, eliminating the use of the cookie argument to setDiagHandler llvm-svn: 119483	2010-11-17 08:03:32 +00:00
Chris Lattner	2a7f6fd9d4	refactor the interface to EmitInlineAsm a bit, no functionality change. llvm-svn: 119482	2010-11-17 07:53:40 +00:00
Eric Christopher	bcc230a765	Only avoid the check if we're the last operand before the variable operands in a variadic instruction. llvm-svn: 119446	2010-11-17 00:55:36 +00:00
Dan Gohman	6397420a30	Fix grammaro. llvm-svn: 119386	2010-11-16 21:27:00 +00:00
Evan Cheng	3e2ec64367	Add ExpandPseudos.cpp. llvm-svn: 119385	2010-11-16 21:20:36 +00:00
Dan Gohman	8b67c720f2	Split pseudo-instruction expansion into a separate pass, to make it easier to debug, and to avoid complications when the CFG changes in the middle of the instruction selection process. llvm-svn: 119382	2010-11-16 21:02:37 +00:00
Jakob Stoklund Olesen	9beef41f2c	Fix emergency spilling in LiveIntervals::spillPhysRegAroundRegDefsUses. Always spill the full representative register at any point where any subregister is live. This fixes PR8620 which caused the old logic to get confused and not spill anything at all. The fundamental problem here is that the coalescer is too aggressive about physical register coalescing. It sometimes makes it impossible to allocate registers without these emergency spills. llvm-svn: 119375	2010-11-16 19:55:14 +00:00
Jakob Stoklund Olesen	7583f68954	Print out the register class of the current interval. llvm-svn: 119374	2010-11-16 19:55:12 +00:00
Eric Christopher	08c083148b	Make the verifier a little quieter on instructions that it's probably (and likely) wrong about anyhow. llvm-svn: 119320	2010-11-16 01:58:21 +00:00
Jakob Stoklund Olesen	39aed737a6	Remember to resize SpillSlotToUsesMap when allocating an emergency spill slot. Use amazing new function call technology instead of writing identical code in multiple places. This fixes PR8604. llvm-svn: 119306	2010-11-16 00:41:01 +00:00
Jakob Stoklund Olesen	e2b8858611	Fix PR8612 in the standard spiller, take two. The live range of a register defined by an early clobber starts at the use slot, not the def slot. Except when it is an early clobber tied to a use operand. Then it starts at the def slot like a standard def. llvm-svn: 119305	2010-11-16 00:40:59 +00:00
Jakob Stoklund Olesen	97154f67d9	Revert "Fix PR8612 in the standard spiller as well." This reverts r119183 which borke the buildbots. llvm-svn: 119270	2010-11-15 21:51:51 +00:00
Evan Cheng	2ce016c7f8	Code clean up. The peephole pass should be the one updating the instruction iterator, not TII->OptimizeCompareInstr. llvm-svn: 119186	2010-11-15 21:20:45 +00:00
Jakob Stoklund Olesen	97825bcbfd	Fix PR8612 in the standard spiller as well. The live range of a register defined by an early clobber starts at the use slot, not the def slot. llvm-svn: 119183	2010-11-15 20:55:53 +00:00
Jakob Stoklund Olesen	ddf25c341c	When spilling a register defined by an early clobber, make sure that the new live ranges for the spill register are also defined at the use slot instead of the normal def slot. This fixes PR8612 for the inline spiller. A use was being allocated to the same register as a spilled early clobber def. This problem exists in all the spillers. A fix for the standard spiller is forthcoming. llvm-svn: 119182	2010-11-15 20:55:49 +00:00
Anton Korobeynikov	f7183edb59	First step of huge frame-related refactoring: move emit{Prologue,Epilogue} out of TargetRegisterInfo to TargetFrameInfo, which is definitely much better suitable place llvm-svn: 119097	2010-11-15 00:06:54 +00:00
Chris Lattner	7077efe894	move the pic base symbol stuff up to MachineFunction since it is trivial and will be shared between ppc and x86. This substantially simplifies the X86 backend also. llvm-svn: 119089	2010-11-14 22:48:15 +00:00
Devang Patel	53a40df6ea	Remove DW_AT_start_scope support. It is incomplete and superseeded by location entries support. llvm-svn: 118940	2010-11-12 23:20:42 +00:00
Andrew Trick	6cbf6c1db5	typo (4th checkin for one fix) llvm-svn: 118913	2010-11-12 18:36:03 +00:00
Andrew Trick	116efac780	Fixes PR8287: SD scheduling time. The fix is a failsafe that prevents catastrophic compilation time in the event of unreasonable LLVM IR. Code quality is a separate issue--someone upstream needs to do a better job of reducing to llvm.memcpy. If the situation can be reproduced with any supported frontend, then it will be a separate bug. llvm-svn: 118904	2010-11-12 17:50:46 +00:00
Chris Lattner	64634c36dd	tidy up. llvm-svn: 118896	2010-11-12 17:24:29 +00:00
Lang Hames	c702ba6ca1	Fix some style issues in PBQP. Patch by David Blaikie. llvm-svn: 118883	2010-11-12 05:47:21 +00:00
Dan Gohman	4162e3e213	Add a FIXME comment. llvm-svn: 118803	2010-11-11 18:08:43 +00:00
Andrew Trick	f11344d770	Check TRI->getReservedRegs because other allocators do it. Even though it makes no sense for allocation_order iterators to visit reserved regs. The inline spiller depends on AliasAnalysis. Manage the Query state to avoid uninitialized or stale results. llvm-svn: 118800	2010-11-11 17:46:29 +00:00
Dan Gohman	6cf9bb45ad	Remove the memmove->memcpy optimization from CodeGen. MemCpyOpt does this. llvm-svn: 118789	2010-11-11 16:24:49 +00:00
Jakob Stoklund Olesen	9a2c6b8f3e	Delete SplittingSpiller. It was not being used by anyone, and it is being superceded by SplitKit. llvm-svn: 118754	2010-11-11 00:52:44 +00:00
Jakob Stoklund Olesen	c400670bf1	Insert two blank SlotIndexes between basic blocks instead of just one. This is the first small step towards using closed intervals for liveness instead of the half-open intervals we're using now. We want to be able to distinguish between a SlotIndex that represents a variable being live-out of a basic block, and an index representing a variable live-in to its successor. That requires two separate indexes between blocks. One for live-outs and one for live-ins. With this change, getMBBEndIdx(MBB).getPrevSlot() becomes stable so it stays greater than any instructions inserted at the end of MBB. llvm-svn: 118747	2010-11-11 00:19:20 +00:00
Jakob Stoklund Olesen	3cb87f4c31	No need to add liveness that's already there. llvm-svn: 118742	2010-11-10 23:56:00 +00:00
Jakob Stoklund Olesen	868dd4e66a	Hook up AliasAnalysis in InlineSpiller. This is used for rematerializing constant loads. llvm-svn: 118741	2010-11-10 23:55:56 +00:00
Devang Patel	364bf04267	Take care of special characters while creating named MDNode name to hold function specific local variable's info. This fixes radar 8653152. I am checking in testcase as a separate check-in. llvm-svn: 118726	2010-11-10 22:19:21 +00:00
Jakob Stoklund Olesen	6ee7d9aade	Basic rematerialization during splitting. Whenever splitting wants to insert a copy, it checks if the value can be rematerialized cheaply instead. Missing features: - Delete instructions when all uses have been rematerialized. - Truncate live ranges to the remaining uses after rematerialization. llvm-svn: 118702	2010-11-10 19:31:50 +00:00
Andrew Trick	89eb6a8b94	RABasic is nearly functionally complete. There are a few remaining benchmarks hitting an assertion. Adds LiveIntervalUnion::collectInterferingVRegs. Fixes "late spilling" by checking for any unspillable live vregs among all physReg aliases. llvm-svn: 118701	2010-11-10 19:18:47 +00:00
Jakob Stoklund Olesen	de5c4dc24b	Simplify the LiveRangeEdit::canRematerializeAt() interface a bit. llvm-svn: 118661	2010-11-10 01:05:12 +00:00
Rafael Espindola	9bb44a5ce8	Fixed version of 118639 with an extra assert to catch similar problems earlier. Implicit bool -> int conversions are evil! llvm-svn: 118651	2010-11-09 23:42:07 +00:00
Andrew Trick	488660554e	Adds RABasic verification and tracing. (retry now that the windows build is green) llvm-svn: 118630	2010-11-09 21:04:34 +00:00
Matt Beaumont-Gay	7c1fddb531	Add a trivial virtual dtor to AbstractRegisterDescription to appease -Wnon-virtual-dtor. llvm-svn: 118616	2010-11-09 19:56:25 +00:00
Andrew Trick	42d50e920b	Reverting r118604. Windows build broke. llvm-svn: 118613	2010-11-09 19:47:51 +00:00
Andrew Trick	85064c17be	Adds RABasic verification and tracing. llvm-svn: 118604	2010-11-09 19:01:17 +00:00
Dan Gohman	5db8921422	Fix DAGCombiner to avoid folding a sext-in-reg or similar through a shl in order to fold it into a load. llvm-svn: 118471	2010-11-09 01:54:35 +00:00
Dale Johannesen	f11ea9ce61	Fix an inline asm pasto from 117667; was preventing {i64, i64} from matching i128. llvm-svn: 118465	2010-11-09 01:15:07 +00:00
Andrew Trick	3528465232	Adds support for spilling previously allocated live intervals to handle cases in which a register is unavailable for spill code. Adds LiveIntervalUnion::extract. While processing interferences on a live virtual register, reuses the same Query object for each physcial reg. llvm-svn: 118423	2010-11-08 18:02:08 +00:00
Che-Liang Chiou	345b98eddd	Add registry hook for assembly text output llvm-svn: 118394	2010-11-08 02:21:17 +00:00
Benjamin Kramer	63abc84630	Prune includes. llvm-svn: 118342	2010-11-06 11:45:59 +00:00
Duncan Sands	6c25ca4f2b	When passing a parameter using the 'byval' mechanism, inline code needs to be used to perform the copy, which may be of lots of memory []. It would be good if the fall-back code generated something reasonable, i.e. did the copy in a loop, rather than vast numbers of loads and stores. Add a note about this. Currently target specific code seems to always kick in so this is more of a theoretical issue rather than a practical one now that X86 has been fixed. [] It's amazing how often people pass mega-byte long arrays by copy... llvm-svn: 118275	2010-11-05 15:20:29 +00:00
Rafael Espindola	38d0756b88	Add 118023 back, but with proper spelling for .uleb128/.sleb128. llvm-svn: 118254	2010-11-04 18:17:08 +00:00
Rafael Espindola	bbc0ac2236	Revert previous patch. Some targets don't support uleb and say they do :-( llvm-svn: 118250	2010-11-04 17:04:24 +00:00
Rafael Espindola	cfd6243940	MCize. llvm-svn: 118249	2010-11-04 16:32:18 +00:00
Duncan Sands	71049f78ed	In the calling convention logic, ValVT is always a legal type, and as such can be represented by an MVT - the more complicated EVT is not needed. Use MVT for ValVT everywhere. llvm-svn: 118245	2010-11-04 10:49:57 +00:00
Jakob Stoklund Olesen	a2e098df12	Disable fancy splitting during spilling unless -extra-spiller-splits is given. This way, InlineSpiller does the same amount of splitting as the standard spiller. Splitting should really be guided by the register allocator, and doesn't belong in the spiller at all. llvm-svn: 118216	2010-11-04 00:32:32 +00:00
Eric Christopher	c6418b105a	Just return undef for invalid masks or elts, and since we're doing that, just do it earlier too. llvm-svn: 118195	2010-11-03 20:44:42 +00:00
Jakob Stoklund Olesen	c913201259	Let RegAllocBasic require MachineDominators - they are already available and splitting needs them. llvm-svn: 118194	2010-11-03 20:39:26 +00:00
Jakob Stoklund Olesen	cbbd819248	Tag debug output as regalloc llvm-svn: 118193	2010-11-03 20:39:23 +00:00
Duncan Sands	1462777017	Simplify uses of MVT and EVT. An MVT can be compared directly with a SimpleValueType, while an EVT supports equality and inequality comparisons with SimpleValueType. llvm-svn: 118169	2010-11-03 12:17:33 +00:00
Duncan Sands	f5dda01f33	Inside the calling convention logic LocVT is always a simple value type, so there is no point in passing it around using an EVT. Use the simpler MVT everywhere. Rather than trying to propagate this information maximally in all the code that using the calling convention stuff, I chose to do a mainly low impact change instead. llvm-svn: 118167	2010-11-03 11:35:31 +00:00
Eric Christopher	fcc9e6848a	If we have an undef mask our Elt will be -1 for our access, handle this by using an undef as a pointer. Fixes rdar://8625016 llvm-svn: 118164	2010-11-03 09:36:40 +00:00
Dan Gohman	68fb004616	Fix DAGCombiner to avoid going into an infinite loop when it encounters (and:i64 (shl:i64 (load:i64), 1), 0xffffffff). This fixes rdar://8606584. llvm-svn: 118143	2010-11-03 01:47:46 +00:00
Evan Cheng	debf9c502a	Two sets of changes. Sorry they are intermingled. 1. Fix pre-ra scheduler so it doesn't try to push instructions above calls to "optimize for latency". Call instructions don't have the right latency and this is more likely to use introduce spills. 2. Fix if-converter cost function. For ARM, it should use instruction latencies, not # of micro-ops since multi-latency instructions is completely executed even when the predicate is false. Also, some instruction will be "slower" when they are predicated due to the register def becoming implicit input. rdar://8598427 llvm-svn: 118135	2010-11-03 00:45:17 +00:00
Andrew Trick	82ae9a95a5	Fixes <rdar://problem/8612856>: During postRAsched, the antidependence breaker needs to check all definitions of the antidepenent register to avoid multiple defs of the same new register. llvm-svn: 118032	2010-11-02 18:16:45 +00:00
Devang Patel	e755966913	Simplify. llvm-svn: 118027	2010-11-02 17:37:00 +00:00
Devang Patel	bc741405a7	If value map does not have register for an argument then try to find frame index before giving up. llvm-svn: 118022	2010-11-02 17:19:03 +00:00
Devang Patel	94f2a2578c	Use frameindex, if available, as a last resort to emit debug info for a parameter. llvm-svn: 118020	2010-11-02 17:01:30 +00:00
Jakob Stoklund Olesen	ea26319185	Don't try to split weird critical edges that really aren't: BB#1: derived from LLVM BB %bb.nph28 Live Ins: %AL Predecessors according to CFG: BB#0 TEST8rr %reg16384<kill>, %reg16384, %EFLAGS<imp-def>; GR8:%reg16384 JNE_4 <BB#2>, %EFLAGS<imp-use,kill> JMP_4 <BB#2> Successors according to CFG: BB#2 BB#2 These double CFG edges only ever occur in bugpoint-generated code, so there is no need to attempt something clever. llvm-svn: 117992	2010-11-02 00:58:37 +00:00
Jakob Stoklund Olesen	5c86d22e67	MachineLICM should not claim to be preserving the CFG when it can split critical edges on demand. llvm-svn: 117982	2010-11-01 23:59:55 +00:00
Jakob Stoklund Olesen	2551f13c83	Be more precise about verifying missing kill flags. It is legal for an instruction to have two operands using the same register, only one a kill. This is interpreted as a kill. llvm-svn: 117981	2010-11-01 23:59:53 +00:00
Jakob Stoklund Olesen	1e32688e4c	When inserting copies during splitting, always use the parent register as the source, and let rewrite() clean it up. This way, kill flags on the inserted copies are fixed as well during rewrite(). We can't just assume that all the copies we insert are going to be kills since critical edges into loop headers sometimes require both source and dest to be live out of a block. llvm-svn: 117980	2010-11-01 23:59:48 +00:00
Jakob Stoklund Olesen	d7a824006e	Add kill flag verification. At least X86FloatingPoint requires correct kill flags after register allocation, and targets using register scavenging benefit. Conservative kill flags are not enough. llvm-svn: 117960	2010-11-01 21:51:31 +00:00
Jakob Stoklund Olesen	a5d4b4ffa2	Update kill flags while rewriting instructions after splitting. llvm-svn: 117959	2010-11-01 21:51:29 +00:00
Bill Wendling	c6627eec13	When we look at instructions to convert to setting the 's' flag, we need to look at more than those which define CPSR. You can have this situation: (1) subs ... (2) sub r6, r5, r4 (3) movge ... (4) cmp r6, 0 (5) movge ... We cannot convert (2) to "subs" because (3) is using the CPSR set by (1). There's an analogous situation here: (1) sub r1, r2, r3 (2) sub r4, r5, r6 (3) cmp r4, ... (5) movge ... (6) cmp r1, ... (7) movge ... We cannot convert (1) to "subs" because of the intervening use of CPSR. llvm-svn: 117950	2010-11-01 20:41:43 +00:00
Jakob Stoklund Olesen	ba9a4985a2	Don't assign new registers created during a split to the same stack slot, but give them individual stack slots once the are actually spilled. llvm-svn: 117945	2010-11-01 19:49:57 +00:00
Jakob Stoklund Olesen	31fffb62d9	Add basic LiveStacks verification. When an instruction refers to a spill slot with a LiveStacks entry, check that the spill slot is live at the instruction. llvm-svn: 117944	2010-11-01 19:49:52 +00:00
Bill Wendling	7a23c1fb7d	The testcase is now XFAILed. Sorry about the breakage. llvm-svn: 117904	2010-11-01 05:50:55 +00:00
Eric Christopher	ef5a1c3ec3	Revert r117876 for now, it's causing more testsuite failures. llvm-svn: 117879	2010-10-31 22:42:55 +00:00
Bill Wendling	0392f1b437	Disable the peephole optimizer until 186.crafty on armv6 is fixed. This is what looks like is happening: Without the peephole optimizer: (1) sub r6, r6, #32 orr r12, r12, lr, lsl r9 orr r2, r2, r3, lsl r10 (x) cmp r6, #0 ldr r9, LCPI2_10 ldr r10, LCPI2_11 (2) sub r8, r8, #32 (a) movge r12, lr, lsr r6 (y) cmp r8, #0 LPC2_10: ldr lr, [pc, r10] (b) movge r2, r3, lsr r8 With the peephole optimizer: ldr r9, LCPI2_10 ldr r10, LCPI2_11 (1) subs r6, r6, #32 (2) subs r8, r8, #32 (a) movge r12, lr, lsr r6 (b) movge r2, r3, lsr r8 (1) is used by (x) for the conditional move at (a). (2) is used by (y) for the conditional move at (b). After the peephole optimizer, these the flags resulting from (1) are ignored and only the flags from (2) are considered for both conditional moves. llvm-svn: 117876	2010-10-31 22:07:12 +00:00
Nicolas Geoffray	3dbe6cc155	Attach a GCModuleInfo to a MachineFunction. llvm-svn: 117867	2010-10-31 20:38:38 +00:00
Jakob Stoklund Olesen	80717dd7c6	Include MachineBasicBlock numbers in viewCFG() output. llvm-svn: 117765	2010-10-30 01:26:19 +00:00
Jakob Stoklund Olesen	0cfc497f19	Make sure copies are inserted after any exception handling labels at the top of a basic block. llvm-svn: 117764	2010-10-30 01:26:16 +00:00
Jakob Stoklund Olesen	ef54185724	Add SkipPHIsAndLabels from PHIElimination to MachineBasicBlock. It is needed elsewhere. llvm-svn: 117763	2010-10-30 01:26:14 +00:00
Jakob Stoklund Olesen	db84d8f4fd	Disable more of physical register live intervals verification. llvm-svn: 117762	2010-10-30 01:26:11 +00:00
Jakob Stoklund Olesen	6d808331ae	Print out register class of spilled register. llvm-svn: 117761	2010-10-30 01:26:09 +00:00
Evan Cheng	2b3f25e031	Teach machine cse to eliminate instructions with multiple physreg uses and defs. rdar://8610857. llvm-svn: 117745	2010-10-29 23:36:03 +00:00
Bob Wilson	08882be86c	Remove DAG combiner patch to fold vector splats. Instcombiner does it now. llvm-svn: 117720	2010-10-29 22:03:02 +00:00
Jakob Stoklund Olesen	0cce30fd34	Fix sign error. llvm-svn: 117677	2010-10-29 18:21:18 +00:00
Evan Cheng	6c1414f9c2	Avoiding overly aggressive latency scheduling. If the two nodes share an operand and one of them has a single use that is a live out copy, favor the one that is live out. Otherwise it will be difficult to eliminate the copy if the instruction is a loop induction variable update. e.g. BB: sub r1, r3, #1 str r0, [r2, r3] mov r3, r1 cmp bne BB => BB: str r0, [r2, r3] sub r3, r3, #1 cmp bne BB This fixed the recent 256.bzip2 regression. llvm-svn: 117675	2010-10-29 18:09:28 +00:00
Jakob Stoklund Olesen	140542fcea	Don't transfer unused values to the new intervals formed by splitting. llvm-svn: 117673	2010-10-29 17:47:49 +00:00
Benjamin Kramer	25ed920b0e	Silence Release build warnings. llvm-svn: 117671	2010-10-29 17:40:05 +00:00
Jakob Stoklund Olesen	dff6a6e4f1	Teach ConnectedVNInfoEqClasses::Classify to deal with unused values. We don't want unused values forming their own equivalence classes, so we lump them all together in one class, and then merge them with the class of the last used value. llvm-svn: 117670	2010-10-29 17:37:29 +00:00
Jakob Stoklund Olesen	2cdca45861	Never propagate the idom value out of a block that defines its own value. llvm-svn: 117669	2010-10-29 17:37:25 +00:00
John Thompson	e8360b7182	Inline asm multiple alternative constraints development phase 2 - improved basic logic, added initial platform support. llvm-svn: 117667	2010-10-29 17:29:13 +00:00
Bill Wendling	c2d549e007	This may be an ARM target, so check for _Unwind_SjLj_Resume. llvm-svn: 117643	2010-10-29 07:46:01 +00:00
Jakob Stoklund Olesen	13d7e0d012	Fix broken equivalence class calculation. We could probably also use EquvivalenceClasses.h except it looks like overkill when elements are continuous integers. llvm-svn: 117631	2010-10-29 00:40:59 +00:00
Jakob Stoklund Olesen	b98755472e	Print out the connected components in the verifier after complaining about their multiplicity. llvm-svn: 117630	2010-10-29 00:40:57 +00:00
Jakob Stoklund Olesen	a2578fe7f3	Run a verification pass before any splitting to better distribute blame. llvm-svn: 117629	2010-10-29 00:40:55 +00:00
Devang Patel	6e0d58968d	Ignore empty blocks. llvm-svn: 117615	2010-10-28 22:11:59 +00:00
Jakob Stoklund Olesen	dc5e7065a4	One day, physical register live ranges will be sensible. llvm-svn: 117602	2010-10-28 20:44:22 +00:00
Jakob Stoklund Olesen	c9f90c2a32	Replace SplitKit SSA update with an iterative algorithm very similar to the one in SSAUpdaterImpl.h Verifying live intervals revealed that the old method was completely wrong, and we need an iterative approach to calculating PHI placemant. Fortunately, we have MachineDominators available, so we don't have to compute that over and over like SSAUpdaterImpl.h must. Live-out values are cached between calls to mapValue() and computed in a greedy way, so most calls will be working with very small block sets. Thanks to Bob for explaining how this should work. llvm-svn: 117599	2010-10-28 20:34:52 +00:00
Jakob Stoklund Olesen	e172a8b794	Make MachineDominators available for SplitEditor. We are going to need it for proper SSA updating. This doesn't cause MachineDominators to be recomputed since we are already requiring MachineLoopInfo which uses dominators as well. llvm-svn: 117598	2010-10-28 20:34:50 +00:00
Jakob Stoklund Olesen	1005cf323d	Add a temporary command line option to verify machine code after each spill or split. llvm-svn: 117597	2010-10-28 20:34:47 +00:00
Devang Patel	1c75865037	Do not work too hard to find type's file info. There is a special field to record file info. llvm-svn: 117588	2010-10-28 19:50:08 +00:00
Devang Patel	c4b69051b7	Technically DIFile scope should also be handled here. llvm-svn: 117563	2010-10-28 17:30:52 +00:00
Bob Wilson	f63da12be9	Teach the DAG combiner to fold a splat of a splat. Radar 8597790. Also do some minor refactoring to reduce indentation. llvm-svn: 117558	2010-10-28 17:06:14 +00:00
Evan Cheng	ff310737e5	Re-commit 117518 and 117519 now that ARM MC test failures are out of the way. llvm-svn: 117531	2010-10-28 06:47:08 +00:00
Evan Cheng	e2c211c1b9	Revert 117518 and 117519 for now. They changed scheduling and cause MC tests to fail. Ugh. llvm-svn: 117520	2010-10-28 02:00:25 +00:00
Evan Cheng	523fa3a2e8	Fix a major bug in operand latency computation. The use index must be adjusted by the number of defs first for it to match the instruction itinerary. llvm-svn: 117518	2010-10-28 01:46:29 +00:00
Evan Cheng	cbdf7e874a	Putting r117193 back except for the compile time cost. Rather than assuming fallthroughs uses all registers, just gather the union of all successor liveins. llvm-svn: 117506	2010-10-27 23:17:17 +00:00
Michael J. Spencer	0f83d96852	COFF: Add IMAGE_SCN_MEM_READ to text sections. There are currently 100 references to COFF::IMAGE_SCN in 6 files and 11 different functions. Section to attribute mapping really needs to happen in one place to avoid problems like this. llvm-svn: 117473	2010-10-27 18:52:29 +00:00
Michael J. Spencer	fbdab0d633	Fix whitespace. llvm-svn: 117472	2010-10-27 18:52:20 +00:00
Jim Grosbach	e4992c88a4	Formatting. llvm-svn: 117453	2010-10-27 16:30:18 +00:00
Jakob Stoklund Olesen	79e1407c11	Handle critical loop predecessors by making both inside and outside registers live out. This doesn't prevent us from inserting a loop preheader later on, if that is better. llvm-svn: 117424	2010-10-27 00:39:07 +00:00
Jakob Stoklund Olesen	795ed98180	Compute critical loop predecessors in the same way as critical loop exits. Critical edges going into a loop are not as bad as critical exits. We can handle them by splitting the critical edge, or by having both inside and outside registers live out of the predecessor. llvm-svn: 117423	2010-10-27 00:39:05 +00:00
Jakob Stoklund Olesen	0e7a011a00	Physical registers trivially have multiple connected components all the time. Only virtuals should be requires to be connected. llvm-svn: 117422	2010-10-27 00:39:01 +00:00
Dale Johannesen	e660f4d072	Use a MemIntrinsicSDNode for ISD::PREFETCH, which touches memory, so a MachineMemOperand is useful (not propagated into the MachineInstr yet). No functional change except for dump output. llvm-svn: 117413	2010-10-26 23:11:10 +00:00
Andrew Trick	5f88cc34e1	Remove the vector of live vregs. I thought we would need to track them, but hopefully we won't. And this is not the right data structure to do it anyway. llvm-svn: 117412	2010-10-26 22:58:24 +00:00
Jakob Stoklund Olesen	e4f3317cda	After splitting, compute connected components of all new registers, not just for the remainder register. Example: bb0: x = 1 bb1: use(x) ... x = 2 jump bb1 When x is isolated in bb1, the inner part breaks into two components, x1 and x2: bb0: x0 = 1 bb1: x1 = x0 use(x1) ... x2 = 2 x0 = x2 jump bb1 llvm-svn: 117408	2010-10-26 22:36:09 +00:00
Jakob Stoklund Olesen	260fa289df	Verify that live intervals are connected. If there are multiple connected components, each should get its own virtual register. llvm-svn: 117407	2010-10-26 22:36:07 +00:00
Jakob Stoklund Olesen	022e7795cf	Call RenumberValues for all new registers created during splitting. This is necessary to get correct hasPHIKill flags. llvm-svn: 117406	2010-10-26 22:36:05 +00:00
Jakob Stoklund Olesen	4453324e5b	Preserve PHIDef bits in cloned values during splitting. llvm-svn: 117405	2010-10-26 22:36:02 +00:00
Devang Patel	05561e8b7b	Assign source ordering to nodes created for StoreInst. llvm-svn: 117404	2010-10-26 22:14:52 +00:00
Jakob Stoklund Olesen	b7050233fb	Teach MachineBasicBlock::print() to annotate instructions and blocks with SlotIndexes when available. llvm-svn: 117392	2010-10-26 20:21:46 +00:00
Jakob Stoklund Olesen	db594373bd	Remmeber to print full live interval on verification error. llvm-svn: 117391	2010-10-26 20:21:43 +00:00
Andrew Trick	84aef49e32	Jakob's review of the basic register allocator. llvm-svn: 117384	2010-10-26 18:34:01 +00:00
Devang Patel	b5694e702c	s/beginScope/beginInstruction/g s/endScope/endInstruction/g llvm-svn: 117376	2010-10-26 17:49:02 +00:00
Jakob Stoklund Olesen	9eabfa3a39	Don't verify physical registers going into landing pads. Magic is happening that we don't understand. llvm-svn: 117370	2010-10-26 16:49:23 +00:00
Evan Cheng	e96b8d7ab6	Use instruction itinerary to determine what instructions are 'cheap'. llvm-svn: 117348	2010-10-26 02:08:50 +00:00
Nick Lewycky	90b2ac2696	For statistics that are only used in functions declared in !NDEBUG, wrap the declarations in !NDEBUG to avoid -Wunused-variable warnings. Patch by Matt Beaumont-Gay! llvm-svn: 117345	2010-10-26 00:51:57 +00:00
Jakob Stoklund Olesen	e2c340c8d0	InlineSpiller can also update LiveStacks. llvm-svn: 117338	2010-10-26 00:11:35 +00:00
Jakob Stoklund Olesen	7cdc1e5f16	Make the spiller responsible for updating the LiveStacks analysis. llvm-svn: 117337	2010-10-26 00:11:33 +00:00
Bob Wilson	e1961fe289	When the "true" and "false" blocks of a diamond if-conversion are the same, do not double-count the duplicate instructions by counting once from the beginning and again from the end. Keep track of where the duplicates from the beginning ended and don't go past that point when counting duplicates at the end. Radar 8589805. This change causes one of the MC/ARM/simple-fp-encoding tests to produce different (better!) code without the vmovne instruction being tested. I changed the test to produce vmovne and vmoveq instructions but moving between register files in the opposite direction. That's not quite the same but predicated versions of those instructions weren't being tested before, so at least the test coverage is not any worse, just different. llvm-svn: 117333	2010-10-26 00:02:24 +00:00
Bob Wilson	efd360c535	Change if-conversion to keep track of the extra cost due to microcoded instructions separately from the count of non-predicated instructions. The instruction count is used in places to determine how many instructions to copy, predicate, etc. and things get confused if that count includes the extra cost for microcoded ops. llvm-svn: 117332	2010-10-26 00:02:21 +00:00
Evan Cheng	43d6f34e9f	Neuter r117193 as it causes significant post-ra scheduler compile time regression. llvm-svn: 117329	2010-10-25 23:56:21 +00:00
Devang Patel	43c3f4b63c	Simplify. Do not count use of sdisel for single call instruction. llvm-svn: 117316	2010-10-25 21:31:46 +00:00
Devang Patel	3bc6d198fb	Add counters to count basic blocks and machine basic blocks with out of order line number info. Add counters to count how many basic blocks are entirely selected by fastisel. llvm-svn: 117310	2010-10-25 20:55:43 +00:00
Devang Patel	a86114b961	Add simple counter to count no. of basic blocks without any line number information. At -O0, these basic block coule cause less than optimial debugging experience. llvm-svn: 117307	2010-10-25 20:45:32 +00:00
Jakob Stoklund Olesen	912db6d9d0	In which I learn how to forward declare template classes. llvm-svn: 117272	2010-10-25 17:27:30 +00:00
Chandler Carruth	82058c05f8	Move the remaining attribute macros to systematic names based on the attribute name and prefixed with 'LLVM_'. llvm-svn: 117203	2010-10-23 08:40:19 +00:00
Chandler Carruth	9733158bfd	Fix a likely bug in an assertion by adding parentheses around '\|\|'. This bug was found by a GCC warning. ;] llvm-svn: 117199	2010-10-23 07:46:14 +00:00
Evan Cheng	15459b695f	Properly model the latency of register defs which are 1) function returns or 2) live-outs. Previously the post-RA schedulers completely ignore these dependencies since returns, branches, etc. are all scheduling barriers. This patch model the latencies between instructions being scheduled and the barriers. It also handle calls by marking their register uses. llvm-svn: 117193	2010-10-23 02:10:46 +00:00
Jakob Stoklund Olesen	8a09620dc2	Verify LiveIntervals against the CFG, ensuring that live-in values are live-out of all predecessors. llvm-svn: 117191	2010-10-23 00:49:09 +00:00
Andrew Trick	e8719c51aa	Nonvirtual dtor that was accessible enough to be bad. llvm-svn: 117180	2010-10-22 23:33:19 +00:00
Andrew Trick	1c24605a57	This is a prototype of an experimental register allocation framework. It's purpose is not to improve register allocation per se, but to make it easier to develop powerful live range splitting. I call it the basic allocator because it is as simple as a global allocator can be but provides the building blocks for sophisticated register allocation with live range splitting. A minimal implementation is provided that trivially spills whenever it runs out of registers. I'm checking in now to get high-level design and style feedback. I've only done minimal testing. The next step is implementing a "greedy" allocation algorithm that does some register reassignment and makes better splitting decisions. llvm-svn: 117174	2010-10-22 23:09:15 +00:00
Jakob Stoklund Olesen	0fb303d3c0	Add more verification of LiveIntervals. llvm-svn: 117170	2010-10-22 22:48:58 +00:00
Jakob Stoklund Olesen	4cf8fe31bb	Be more strict about detecting multi-use blocks for isolation. When a block has exactly two uses and the register is both live-in and live-out, don't isolate the block. We would be inserting two copies, so we haven't really made any progress. If the live-in and live-out values separate into disconnected components after splitting, we would be making progress. We can't detect that for now. llvm-svn: 117169	2010-10-22 22:48:56 +00:00
Evan Cheng	21eedfb5a2	Unbreak build. llvm-svn: 117155	2010-10-22 21:49:09 +00:00
Evan Cheng	77a38320c7	Transfer implicit ops when forming load multiple and return instructions. llvm-svn: 117151	2010-10-22 21:29:58 +00:00
Jakob Stoklund Olesen	2d60075590	Be more strict when detecting critical edges before loop splitting. An exit block with a critical edge must only have predecessors in the loop, or just before the loop. This guarantees that the inserted copies in the loop predecessors dominate the exit block. llvm-svn: 117144	2010-10-22 20:28:23 +00:00
Jakob Stoklund Olesen	9a74301621	Add print methods llvm-svn: 117143	2010-10-22 20:28:21 +00:00
Michael J. Spencer	0e36e0340a	X86: Base _fltused on the FunctionType of the called value instead of the potentially null "CalledFunction". Thanks Duncan! This is needed for indirect calls. llvm-svn: 117061	2010-10-21 20:49:23 +00:00
Jakob Stoklund Olesen	f4bbe50fc3	Don't include the destination interval in the union when computing Parent - union(Y, ...). Doh. llvm-svn: 117042	2010-10-21 18:47:08 +00:00
Jakob Stoklund Olesen	7c9d584ebc	Permit landing pad successor blocks when verifying basic blocks that end in an unconditional branch. llvm-svn: 117041	2010-10-21 18:47:06 +00:00
Duncan Sands	2f16b91ce0	The variable liTRC is not used for anything useful, zap it (gcc-4.6 warning). llvm-svn: 117022	2010-10-21 16:04:43 +00:00
Duncan Sands	ee4eb2bad1	Remove some variables that are never really used (gcc-4.6 warns about these). llvm-svn: 117021	2010-10-21 16:03:28 +00:00
Michael J. Spencer	83ce5f181f	CodeGen-Windows: Only emit _fltused if a VarArg function is called with floating point args. This should be the minimum set of functions that could possibly need it. llvm-svn: 116978	2010-10-21 00:08:21 +00:00
Jakob Stoklund Olesen	a3b61d32d8	Remember to keep track of rematted values. llvm-svn: 116962	2010-10-20 22:50:42 +00:00
Evan Cheng	87066f0677	More accurate estimate / tracking of register pressure. - Initial register pressure in the loop should be all the live defs into the loop. Not just those from loop preheader which is often empty. - When an instruction is hoisted, update register pressure from loop preheader to the original BB. - Treat only use of a virtual register as kill since the code is still SSA. llvm-svn: 116956	2010-10-20 22:03:58 +00:00
Jakob Stoklund Olesen	2edaa2fb24	Move some of the InlineSpiller rematerialization code into LiveRangeEdit. llvm-svn: 116951	2010-10-20 22:00:51 +00:00
Dale Johannesen	320a553319	Remove Synthesizable from the Type system; as MMX vector types are no longer Legal on X86, we don't need it. No functional change. 8499854. llvm-svn: 116947	2010-10-20 21:32:10 +00:00
Jakob Stoklund Olesen	9b131a004f	When SimpleRegisterCoalescing is trimming kill flags on a physical register operand, also check if subregisters are killed. Add <imp-def> operands for subregisters that remain alive after a super register is killed. I don't have a testcase for this that reproduces on trunk. <rdar://problem/8441758> llvm-svn: 116940	2010-10-20 18:45:55 +00:00
Dan Gohman	a94cc6dfe8	Make CodeGen TBAA-aware. llvm-svn: 116890	2010-10-20 00:31:05 +00:00
Jim Grosbach	bbdc5d2ef9	Add a pre-dispatch SjLj EH hook on the unwind edge for targets to do any setup they require. Use this for ARM/Darwin to rematerialize the base pointer from the frame pointer when required. rdar://8564268 llvm-svn: 116879	2010-10-19 23:27:08 +00:00
Jakob Stoklund Olesen	a4941690cc	Shrink MachineOperand from 40 to 32 bytes on 64-bit hosts. Pull an unsigned out of the Contents union such that it has the same size as two pointers and no padding. Arrange members such that the Contents union and all pointers can be 8-byte aligned without padding. This speeds up code generation by 0.8% on a 64-bit host. 32-bit hosts should be unaffected. llvm-svn: 116857	2010-10-19 20:56:32 +00:00
Evan Cheng	63c7608c34	Re-enable register pressure aware machine licm with fixes. Hoist() may have erased the instruction during LICM so UpdateRegPressureAfter() should not reference it afterwards. llvm-svn: 116845	2010-10-19 18:58:51 +00:00
Owen Anderson	6c18d1aac0	Get rid of static constructors for pass registration. Instead, every pass exposes an initializeMyPassFunction(), which must be called in the pass's constructor. This function uses static dependency declarations to recursively initialize the pass's dependencies. Clients that only create passes through the createFooPass() APIs will require no changes. Clients that want to use the CommandLine options for passes will need to manually call the appropriate initialization functions in PassInitialization.h before parsing commandline arguments. I have tested this with all standard configurations of clang and llvm-gcc on Darwin. It is possible that there are problems with the static dependencies that will only be visible with non-standard options. If you encounter any crash in pass registration/creation, please send the testcase to me directly. llvm-svn: 116820	2010-10-19 17:21:58 +00:00
Daniel Dunbar	418204e523	Revert r116781 "- Add a hook for target to determine whether an instruction def is", which breaks some nightly tests. llvm-svn: 116816	2010-10-19 17:14:24 +00:00
NAKAMURA Takumi	392f084f46	lib/CodeGen/TargetLoweringObjectFileImpl.cpp: Tweak to emit ".{section}${name}" instead of ".{section}$linkonce_{name}" for linkonce sections. It seems GNU ld/PECOFF relies on section names, linking with g++'s libstdc++.a would fail. llvm-svn: 116791	2010-10-19 03:24:42 +00:00
Andrew Trick	2006bbef7d	Fix for machine licm assert: RCCost <= RegPressure[RCId] in MultiSource/Benchmarks/VersaBench/beamformer/beamformer. SmallSet.insert returns true if the element is inserted. llvm-svn: 116790	2010-10-19 02:50:50 +00:00
Evan Cheng	8249dfe6ce	- Add a hook for target to determine whether an instruction def is "long latency" enough to hoist even if it may increase spilling. Reloading a value from spill slot is often cheaper than performing an expensive computation in the loop. For X86, that means machine LICM will hoist SQRT, DIV, etc. ARM will be somewhat aggressive with VFP and NEON instructions. - Enable register pressure aware machine LICM by default. llvm-svn: 116781	2010-10-19 00:55:07 +00:00
Bill Wendling	337a31133b	Don't recompute MachineRegisterInfo in the Optimize* method. llvm-svn: 116750	2010-10-18 21:22:31 +00:00
Dan Gohman	52dacc0d7f	Add TypeBasedAliasAnalysis to the standard pass lists. Note that it is currently inert by default. llvm-svn: 116732	2010-10-18 18:50:27 +00:00
Dan Gohman	02538ac4d3	Make BasicAliasAnalysis a normal AliasAnalysis implementation which does normal initialization and normal chaining. Change the default AliasAnalysis implementation to NoAlias. Update StandardCompileOpts.h and friends to explicitly request BasicAliasAnalysis. Update tests to explicitly request -basicaa. llvm-svn: 116720	2010-10-18 18:04:47 +00:00
Jim Grosbach	a3aa17b376	Trivial grammar tweak. llvm-svn: 116710	2010-10-18 16:29:26 +00:00
Michael J. Spencer	5e683250ee	X86-Windows: Emit an undefined global __fltused symbol when targeting Windows if any floating point arguments are passed to an external function. llvm-svn: 116665	2010-10-16 08:25:41 +00:00
Michael J. Spencer	d3ea25e66e	Whitespace! llvm-svn: 116664	2010-10-16 08:25:21 +00:00
Evan Cheng	44436302fb	More machine LICM work. It now tracks register pressure for path from preheader to current BB and use the information determine whether hoisting is worthwhile. llvm-svn: 116654	2010-10-16 02:20:26 +00:00
Jakob Stoklund Olesen	20d103e74e	Remove unused accessor. llvm-svn: 116580	2010-10-15 16:06:40 +00:00
Jakob Stoklund Olesen	3f1f7b67e3	Eliminate curli from SplitEditor. Use the LiveRangeEdit reference instead. llvm-svn: 116547	2010-10-15 00:34:01 +00:00
Jakob Stoklund Olesen	0f3e98ce2e	Move stack slot assignments into LiveRangeEdit. All registers created during splitting or spilling are assigned to the same stack slot as the parent register. When splitting or rematting, we may not spill at all. In that case the stack slot is still assigned, but it will be dead. llvm-svn: 116546	2010-10-15 00:16:55 +00:00
Jakob Stoklund Olesen	72911e49fa	Create a new LiveRangeEdit class to keep track of the new registers created when splitting or spillling, and to help with rematerialization. Use LiveRangeEdit in InlineSpiller and SplitKit. This will eventually make it possible to share remat code between InlineSpiller and SplitKit. llvm-svn: 116543	2010-10-14 23:49:52 +00:00
Jakob Stoklund Olesen	f11318018a	Only split around a loop if the live range has uses outside the loop periphery. Before we would also split around a loop if any peripheral block had multiple uses. This could cause repeated splitting when splitting a different live range would insert uses into the periphery. Now -spiller=inline passes the nightly test suite again. llvm-svn: 116494	2010-10-14 18:26:45 +00:00
Evan Cheng	d62719c3fa	Register pressure and instruction latency aware machine LICM. Work in progress. llvm-svn: 116465	2010-10-14 01:16:09 +00:00
Owen Anderson	c266a36625	Analysis groups need to initialize their default implementations. llvm-svn: 116441	2010-10-13 21:49:58 +00:00
Owen Anderson	8ac477ffb5	Begin adding static dependence information to passes, which will allow us to perform initialization without static constructors AND without explicit initialization by the client. For the moment, passes are required to initialize both their (potential) dependencies and any passes they preserve. I hope to be able to relax the latter requirement in the future. llvm-svn: 116334	2010-10-12 19:48:12 +00:00
Jakob Stoklund Olesen	57feeed92f	Replace FindLiveRangeContaining() with getVNInfoAt() in LiveIntervalAnalysis. This helps hiding the LiveRange class which really should be private. llvm-svn: 116244	2010-10-11 21:45:03 +00:00
Jakob Stoklund Olesen	2f6531eb8c	Properly handle reloading and spilling around partial redefines in LocalRewriter. This is a bit of a hack that adds an implicit use operand to model the read-modify-write nature of a partial redef. Uses and defs are rewritten in separate passes, and a single operand would never be processed twice. <rdar://problem/8518892> llvm-svn: 116210	2010-10-11 18:10:36 +00:00
Chris Lattner	1ef5e84c31	Per discussion with Sanjiv, remove the PIC16 target from mainline. When/if it comes back, it will be largely a rewrite, so keeping the old codebase in tree isn't helping anyone. llvm-svn: 116190	2010-10-11 05:44:40 +00:00
Chris Lattner	eb313a46fc	fix the default va_arg expansion (in the realignment case) to not implicitly truncate the stack pointer to 32-bits on a 64-bit machine. llvm-svn: 116169	2010-10-10 18:36:26 +00:00
Benjamin Kramer	d84bb168cc	Silence compiler warning. llvm-svn: 116156	2010-10-09 16:36:44 +00:00
Jakob Stoklund Olesen	959fcc6c63	Rename SplitEditor::rewrite to finish() and break it out into a couple of new functions: computeRemainder and rewrite. When the remainder breaks up into multiple components, remember to rewrite those uses as well. llvm-svn: 116121	2010-10-08 23:42:21 +00:00
Evan Cheng	df2aae0c5a	Avoid compiler warning: comparison between signed and unsigned integer. llvm-svn: 116119	2010-10-08 23:01:57 +00:00
Jakob Stoklund Olesen	b1b0ef7d03	Extract method ProcessUses from LocalRewriter::RewriteMBB. Both parent and child are still way too long, but it's a start. No functional change intended. llvm-svn: 116116	2010-10-08 22:14:41 +00:00
Anton Korobeynikov	fc3642b205	Do not check that the bodies of two defs of same linkonce global are the same. Such a check does not make any sense in presense of inlining and other compiler-dependent stuff. This should fix bunch of warnings on mingw32. llvm-svn: 116113	2010-10-08 21:50:04 +00:00
Jakob Stoklund Olesen	05cae8326d	Classify value numbers into connected components in linear time. llvm-svn: 116105	2010-10-08 21:19:28 +00:00
Rafael Espindola	af8b4871a8	Call InitSections in llc and clang so that the binaries produced by them are easier to diff with those produced by llvm-mc. llvm-svn: 116095	2010-10-08 19:37:38 +00:00
Evan Cheng	4ac0d16c40	Don't waste time unfolding simple loads. The unfolded copy won't be hoisted. llvm-svn: 116081	2010-10-08 18:59:19 +00:00
Evan Cheng	8c5e7e51bd	Fix operand latency computation in cases where the definition operand is implicit. e.g. %D6<def>, %D7<def> = VLD1q16 %R2<kill>, 0, ..., %Q3<imp-def> %Q1<def> = VMULv8i16 %Q1<kill>, %Q3<kill>, ... The real definition indices are 0,1. llvm-svn: 116080	2010-10-08 18:42:25 +00:00
Devang Patel	dd1c289a6a	Line number 0 indicates there is no source line/file name info available for this construct. llvm-svn: 116061	2010-10-08 17:18:54 +00:00
Jakob Stoklund Olesen	0f1677e190	After splitting, the remaining LiveInterval may be fragmented into multiple connected components. These components should be allocated different virtual registers because there is no reason for them to be allocated together. Add the ConnectedVNInfoEqClasses class to calculate the connected components, and move values to new LiveIntervals. Use it from SplitKit::rewrite by creating new virtual registers for the components. llvm-svn: 116006	2010-10-07 23:34:34 +00:00
Owen Anderson	df7a4f2515	Now with fewer extraneous semicolons! llvm-svn: 115996	2010-10-07 22:25:06 +00:00
Devang Patel	3a24f9230a	Provie a clearner interface so that FE can decide whether a function has prototype or not. llvm-svn: 115988	2010-10-07 22:03:01 +00:00
Jakob Stoklund Olesen	9bfd9679f9	Print more loop info. llvm-svn: 115951	2010-10-07 18:47:07 +00:00
Jakob Stoklund Olesen	7c31730053	Print out MBB number when rewriting. llvm-svn: 115950	2010-10-07 18:47:05 +00:00
Owen Anderson	80fc0762f3	Add initialization routines for CodeGen. llvm-svn: 115949	2010-10-07 18:41:20 +00:00
Jakob Stoklund Olesen	49715fd494	Cache interval iterators in SplitEditor::addTruncSimpleRange so we only have to do one find(). llvm-svn: 115929	2010-10-07 17:56:39 +00:00
Jakob Stoklund Olesen	9575af4b06	Clean up debug printing. llvm-svn: 115928	2010-10-07 17:56:35 +00:00
Jakob Stoklund Olesen	18842783cc	Add MachineRegisterInfo::constrainRegClass and use it in MachineCSE. This function is intended to be used when inserting a machine instruction that trivially restricts the legal registers, like LEA requiring a GR32_NOSP argument. llvm-svn: 115875	2010-10-06 23:54:39 +00:00
Jakob Stoklund Olesen	1a065e4e5b	Skip unused registers when verifying LiveIntervals. llvm-svn: 115874	2010-10-06 23:54:35 +00:00
Owen Anderson	ad8134f03b	Hide analysis group registration behind a macro, just like pass registration. llvm-svn: 115835	2010-10-06 21:02:27 +00:00
Devang Patel	9a33ec24eb	Add support for DW_TAG_unspecified_parameters. llvm-svn: 115833	2010-10-06 20:50:40 +00:00
Nick Lewycky	ec0da969fb	Remove unused variables. llvm-svn: 115802	2010-10-06 18:11:50 +00:00
Dan Gohman	aadc5596f1	ComputeLinearIndex doesn't need its TLI argument. llvm-svn: 115792	2010-10-06 16:18:29 +00:00
Evan Cheng	49d4c0bd18	- Add TargetInstrInfo::getOperandLatency() to compute operand latencies. This allow target to correctly compute latency for cases where static scheduling itineraries isn't sufficient. e.g. variable_ops instructions such as ARM::ldm. This also allows target without scheduling itineraries to compute operand latencies. e.g. X86 can return (approximated) latencies for high latency instructions such as division. - Compute operand latencies for those defined by load multiple instructions, e.g. ldm and those used by store multiple instructions, e.g. stm. llvm-svn: 115755	2010-10-06 06:27:31 +00:00
Jakob Stoklund Olesen	4d5156c7d0	Count uses in all nested loops, not just the deepest. llvm-svn: 115710	2010-10-05 23:10:12 +00:00
Jakob Stoklund Olesen	56e2925e6c	Remove SplitAnalysis::removeUse. It was only used to make SplitAnalysis reusable, but that is no longer relevant since a split will always replace the original. llvm-svn: 115709	2010-10-05 23:10:09 +00:00
Jakob Stoklund Olesen	0445e2a053	dupli always has an interval now. llvm-svn: 115708	2010-10-05 23:10:04 +00:00
Jakob Stoklund Olesen	2dfa8be26a	We can split around loops with multiple exits now. llvm-svn: 115696	2010-10-05 22:19:35 +00:00
Jakob Stoklund Olesen	89d276aa48	Update SplitEditor API to reflect the fact that the original live interval is never kept after splitting. Keeping the original interval made sense when the split region doesn't modify the register, and the original is spilled. We can get the same effect by detecting reloaded values when spilling around copies. llvm-svn: 115695	2010-10-05 22:19:33 +00:00
Jakob Stoklund Olesen	b46d32367f	Intervals are half-open. llvm-svn: 115694	2010-10-05 22:19:29 +00:00
Jakob Stoklund Olesen	671bab1c7d	When we find a reaching definition, make sure it is visited from all paths by erasing it from the visited set. That ensures we create the right phi defs. llvm-svn: 115666	2010-10-05 20:36:28 +00:00
Jakob Stoklund Olesen	b0cedd5f96	Don't use nextIndex to check for live out of instruction. Insert copy after defining instruction. Fix LiveIntervalMap::extendTo to properly handle live segments starting before the current basic block. Make sure the open live range is extended to the inserted copy's use slot. llvm-svn: 115665	2010-10-05 20:36:25 +00:00
Jakob Stoklund Olesen	9a414901db	Tweak VNInfo printing. llvm-svn: 115650	2010-10-05 18:48:57 +00:00
Jakob Stoklund Olesen	1c9afa1aeb	Add assert for valid slot indexes. llvm-svn: 115649	2010-10-05 18:48:55 +00:00
Owen Anderson	d8d1dcc09a	Use a more efficient lowering of uint64_t --> float that can take advantage of hardware signed integer conversion without having to do a double cast (uint64_t --> double --> float). This is based on the algorithm from compiler_rt's __floatundisf for X86-64. llvm-svn: 115634	2010-10-05 17:24:05 +00:00
Evan Cheng	c8d6cfd730	This DAG combine BRCOND transformation can look pass truncate of the operand: // %a = ... // %b = and i32 %a, 2 // %c = srl i32 %b, 1 // brcond i32 %c ... // // into // // %a = ... // %b = and i32 %a, 2 // %c = setcc eq %b, 0 // brcond %c ... Make sure it restores local variable N1, which corresponds to the condition operand if it fails to match. This apparently breaks TCE but since that backend isn't in the tree I don't have a test for it. llvm-svn: 115571	2010-10-04 22:41:01 +00:00
Lang Hames	4108e7e192	Removed the older style (in-allocator) problem construction system from the PBQP allocator. Problem construction is now done exclusively with the new builders. llvm-svn: 115502	2010-10-04 12:13:07 +00:00
Jakob Stoklund Olesen	4088ceaf28	Stop using LiveRange in MachineVerifier. llvm-svn: 115408	2010-10-02 05:24:46 +00:00
Bob Wilson	c57c220d20	Fix a miscompile in 186.crafty for Thumb2 that was exposed by Evan's scheduling change in svn 115121. The CriticalAntiDepBreaker had bad liveness information. It was calculating the KillIndices for one scheduling region in a basic block, rescheduling that region so the KillIndices were no longer valid, and then using those wrong KillIndices to make decisions for the next scheduling region. I've not been able to reduce a small testcase for this. Radar 8502534. llvm-svn: 115400	2010-10-02 01:49:29 +00:00
Jakob Stoklund Olesen	bfea05afd6	Drop the use of LiveInterval::iterator and the LiveRange class in RemoveCopyByCommutingDef. llvm-svn: 115386	2010-10-01 23:52:27 +00:00
Jakob Stoklund Olesen	28792c4a3d	When RemoveCopyByCommutingDef is creating additional identity copies, just use LiveInterval::MergeValueNumberInto instead of trying to extend LiveRanges and getting it wrong. This fixed PR8249 where a valno with a multi-segment live range was defined by an identity copy created by RemoveCopyByCommutingDef. Some of the live segments disappeared. llvm-svn: 115385	2010-10-01 23:52:25 +00:00
Jakob Stoklund Olesen	8fd5b3c071	Pretty up the debug output during RemoveCopyByCommutingDef. llvm-svn: 115384	2010-10-01 23:52:22 +00:00
Devang Patel	e1c714647c	Add support to let FE mark explict methods as explict in debug info. llvm-svn: 115378	2010-10-01 23:31:40 +00:00
Jim Grosbach	0091535361	Nuke trailing whitespace. llvm-svn: 115377	2010-10-01 23:29:12 +00:00
Owen Anderson	f31f33ea89	Thread the determination of branch prediction hit rates back through the if-conversion heuristic APIs. For now, stick with a constant estimate of 90% (branch predictors are good!), but we might find that we want to provide more nuanced estimates in the future. llvm-svn: 115364	2010-10-01 22:45:50 +00:00
Devang Patel	d3fe5fa5d1	Fix code gen crash reported in PR 8235. We still lose debug info for the unused argument here. This is a known limitation recorded debuginfo-tests/trunk/dbg-declare2.ll function 'f6' test case. llvm-svn: 115323	2010-10-01 19:00:44 +00:00
Gabor Greif	47a3b8c30b	typo llvm-svn: 115310	2010-10-01 10:32:19 +00:00
Chris Lattner	f08bfdc29f	fix typo llvm-svn: 115300	2010-10-01 06:54:02 +00:00
Chris Lattner	a205055857	fix rdar://8494845 + PR8244 - a miscompile exposed by my patch in r101350 llvm-svn: 115294	2010-10-01 05:36:09 +00:00
Dale Johannesen	dd224d2333	Massive rewrite of MMX: The x86_mmx type is used for MMX intrinsics, parameters and return values where these use MMX registers, and is also supported in load, store, and bitcast. Only the above operations generate MMX instructions, and optimizations do not operate on or produce MMX intrinsics. MMX-sized vectors <2 x i32> etc. are lowered to XMM or split into smaller pieces. Optimizations may occur on these forms and the result casted back to x86_mmx, provided the result feeds into a previous existing x86_mmx operation. The point of all this is prevent optimizations from introducing MMX operations, which is unsafe due to the EMMS problem. llvm-svn: 115243	2010-09-30 23:57:10 +00:00
Jakob Stoklund Olesen	665aa6efcc	When isel is emitting instructions for an x86 target without CMOV, the CFG is edited during emission. If the basic block ends in a switch that gets lowered to a jump table, any phis at the default edge were getting updated wrong. The jump table data structure keeps a pointer to the header blocks that wasn't getting updated after the MBB is split. This bug was exposed on 32-bit Linux when disabling critical edge splitting in codegen prepare. The fix is to uipdate stale MBB pointers whenever a block is split during emission. llvm-svn: 115191	2010-09-30 19:44:31 +00:00
Devang Patel	bea08d1c85	Let FE mark a variable as artificial variable. llvm-svn: 115102	2010-09-29 23:07:21 +00:00
Evan Cheng	4a010fd1ea	Model Cortex-a9 load to SUB, RSB, ADD, ADC, SBC, RSC, CMN, MVN, or CMP pipeline forwarding path. llvm-svn: 115098	2010-09-29 22:42:35 +00:00
Benjamin Kramer	2016f0eaac	Silence msvc warnings. llvm-svn: 115097	2010-09-29 22:38:50 +00:00
Devang Patel	cb03b14089	Add support to let FE encode method access specifier. llvm-svn: 115089	2010-09-29 21:44:16 +00:00
Owen Anderson	0cd522428c	UnreachableBlockElim could incorrectly return false when it had not modified the CFG, but HAD modified some PHI nodes. Fixes PR8174. llvm-svn: 115083	2010-09-29 20:57:19 +00:00
Devang Patel	a1bd5a1fad	Assign DW_ACCESS_public accessibility attribute to members by default. llvm-svn: 115067	2010-09-29 19:08:08 +00:00
Bill Wendling	b0b2c57149	Revert r114997. It was causing a failure on darwin10-selfhost. llvm-svn: 115002	2010-09-28 23:11:55 +00:00
Oscar Fuentes	b4b12535e8	Removed a bunch of unnecessary target_link_libraries. llvm-svn: 114999	2010-09-28 22:39:14 +00:00
Bill Wendling	d848beb1e5	Fix a FIXME. _foo.eh symbols are currently always exported so that the linker knows about them. This is not necessary on 10.6 and later. llvm-svn: 114997	2010-09-28 22:36:56 +00:00
Owen Anderson	1b35f4cc66	Give the if-converter access to MachineLoopInfo, and use it to generate plausible branch prediction estimates. llvm-svn: 114981	2010-09-28 20:42:15 +00:00
Owen Anderson	88af7d00fc	Part one of switching to using a more sane heuristic for determining if-conversion profitability. Rather than having arbitrary cutoffs, actually try to cost model the conversion. For now, the constants are tuned to more or less match our existing behavior, but these will be changed to reflect realistic values as this work proceeds. llvm-svn: 114973	2010-09-28 18:32:13 +00:00
Devang Patel	7a55481fa4	Provide an interface to let FEs anchor debug info for types. llvm-svn: 114969	2010-09-28 18:08:20 +00:00
Devang Patel	185051cb8e	Remove dead argument. llvm-svn: 114920	2010-09-27 23:15:27 +00:00
Dale Johannesen	117f7708c4	Don't try to make a vector of x86mmx; this won't work, and asserts. llvm-svn: 114843	2010-09-27 17:29:14 +00:00
Chris Lattner	9f06f911d1	the latest assembler that runs on powerpc 10.4 machines doesn't support aligned comm. Detect when compiling for 10.4 and don't emit an alignment for comm. THis will hopefully fix PR8198. llvm-svn: 114817	2010-09-27 06:44:54 +00:00
Lang Hames	c8a4973389	Fixed some tests to avoid LiveIntervals::getInstructionFromIndex(..) overhead where possible. Thanks to Jakob for the suggestions. llvm-svn: 114798	2010-09-26 03:37:09 +00:00
Jakob Stoklund Olesen	10117c762a	Avoid using VNInfo::getCopy as much as possible. I want to get rid of it. llvm-svn: 114794	2010-09-25 18:10:38 +00:00
Lang Hames	564956867e	Removed VNInfo::isDefAccurate(). Def "accuracy" can be checked by testing whether LiveIntervals::getInstructionFromIndex(def) returns NULL. llvm-svn: 114791	2010-09-25 12:04:16 +00:00
Jakob Stoklund Olesen	bc71af341e	Remove SlotIndex::PHI_BIT. It is no longer used by anything. llvm-svn: 114779	2010-09-25 00:45:18 +00:00
Jakob Stoklund Olesen	250fed25fd	Remove the only use of SlotIndex::isPHI. This bit is not being set consistently and it will be removed shortly. llvm-svn: 114778	2010-09-25 00:45:15 +00:00
Jakob Stoklund Olesen	335b9a8ea9	Terminator gaps were unused. Might as well delete them. llvm-svn: 114776	2010-09-24 23:58:56 +00:00
John Thompson	8118ef8d3d	Fix for test/CodeGen/PowerPC/2008-10-17-AsmMatchingOperands.ll crash. llvm-svn: 114767	2010-09-24 22:24:05 +00:00
Michael J. Spencer	ded5f66813	Get rid of pop_macro warnings on MSVC. llvm-svn: 114750	2010-09-24 19:48:47 +00:00
Nicolas Geoffray	cbb421887d	Attach a DebugLoc to a GC point in order to get precise information in the JIT of a GC point. llvm-svn: 114736	2010-09-24 17:27:50 +00:00
Evan Cheng	6b8b2b7312	Revert 114634 for now since buildbot claim it broke Clang self-hosting. I doubt it but it's possible it's exposing another bug somewhere. llvm-svn: 114681	2010-09-23 18:32:19 +00:00
Oscar Fuentes	57214f533a	Fix VS 2010 build. Patch by Nathan Jeffords! llvm-svn: 114661	2010-09-23 16:59:36 +00:00
Evan Cheng	b6d175a39d	Follow up to r114630. Do not optimize away unconditional branch following a conditional one. llvm-svn: 114634	2010-09-23 07:18:35 +00:00
Evan Cheng	d4b31a7630	Don't sink insert_subreg, subreg_to_reg, reg_sequence. They are meant to be close to their sources to facilitate coalescing. llvm-svn: 114631	2010-09-23 06:53:00 +00:00
Evan Cheng	79687dda9a	SDISel should not optimize a unconditional branch following a conditional branch when the unconditional branch destination is the fallthrough block. The canonicalization makes it easier to allow optimizations on DAGs to invert conditional branches. The branch folding pass (and AnalyzeBranch) will clean up the unnecessary unconditional branches later. This is one of the patches leading up to disabling codegen prepare critical edge splitting. llvm-svn: 114630	2010-09-23 06:51:55 +00:00
Lang Hames	fd1bc42230	Moved the PBQP allocator class out of the header and back in to the cpp file to hide the gory details. Allocator instances can now be created by calling createPBQPRegisterAllocator. Tidied up use of CoalescerPair as per Jakob's suggestions. Made the new PBQPBuilder based construction process the default. The internal construction process remains in-place and available via -pbqp-builder=false for now. It will be removed shortly if the new process doesn't cause any regressions. llvm-svn: 114626	2010-09-23 04:28:54 +00:00
Owen Anderson	3231d13ddd	A select between a constant and zero, when fed by a bit test, can be efficiently lowered using a series of shifts. Fixes <rdar://problem/8285015>. llvm-svn: 114599	2010-09-22 22:58:22 +00:00
Devang Patel	804fcd4794	Use DW_OP_fbreg when offset is based on frame register. llvm-svn: 114585	2010-09-22 21:10:38 +00:00
Jakob Stoklund Olesen	6f8bd42ec7	Build the complement interval dupli after the split intervals instead of creating it before and subtracting split ranges. This way, the SSA update code in LiveIntervalMap can properly create and use new phi values in dupli. Now it is possible to create split regions where a value escapes along two different CFG edges, creating phi values outside the split region. This is a work in progress and probably quite broken. llvm-svn: 114492	2010-09-21 22:32:21 +00:00
John Thompson	c467aa2fa4	Fixed pr20314-2.c failure, added E, F, p constraint letters. llvm-svn: 114490	2010-09-21 22:04:54 +00:00
Chris Lattner	a9e57e0eff	Rework passing parent pointers into complexpatterns, I forgot that complex patterns are matched after the entire pattern has a structural match, therefore the NodeStack isn't in a useful state when the actual call to the matcher happens. llvm-svn: 114489	2010-09-21 22:00:25 +00:00
Devang Patel	99ff76212a	If only user of a vreg is an copy instruction to export copy of vreg out of current basic block then insert DBG_VALUE so that debug value of the variable is also transfered to new vreg. Testcase is in r114476. This fixes radar 8412415. llvm-svn: 114478	2010-09-21 20:56:33 +00:00
Chris Lattner	0bb8b19865	correct this logic. llvm-svn: 114474	2010-09-21 20:46:40 +00:00
Owen Anderson	5e65dfbb97	Reimplement r114460 in target-independent DAGCombine rather than target-dependent, by using the predicate to discover the number of sign bits. Enhance X86's target lowering to provide a useful response to this query. llvm-svn: 114473	2010-09-21 20:42:50 +00:00
Chris Lattner	dd83548fea	just like they can opt into getting the root of the pattern being matched, allow ComplexPatterns to opt into getting the parent node of the operand being matched. llvm-svn: 114472	2010-09-21 20:37:12 +00:00
Jakob Stoklund Olesen	beb64f55cf	Refix MSVC9 and upper_bound. It actually needs a fully symmetric comparator. llvm-svn: 114469	2010-09-21 20:16:12 +00:00
Chris Lattner	a4f199720d	finish pushing MachinePointerInfo through selectiondags. At this point, I think I've audited all uses, so it should be dependable for address spaces, and the pointer+offset info should also be accurate when there. llvm-svn: 114464	2010-09-21 18:58:22 +00:00
Chris Lattner	886250c8f0	convert a couple more places to use the new getStore() llvm-svn: 114463	2010-09-21 18:51:21 +00:00
Chris Lattner	676c61db0e	update a bunch of code to use the MachinePointerInfo version of getStore. llvm-svn: 114461	2010-09-21 18:41:36 +00:00
Jakob Stoklund Olesen	f7a8e93b76	Don't pollute the global namespace. llvm-svn: 114459	2010-09-21 18:34:17 +00:00
Jakob Stoklund Olesen	25a123df85	MSVC9 does not support upper_bound with an asymmetric comparator. llvm-svn: 114455	2010-09-21 18:24:30 +00:00
Bob Wilson	5549d496dd	Define the TargetLowering::getTgtMemIntrinsic hook for ARM so that NEON load and store intrinsics are represented with MemIntrinsicSDNodes. llvm-svn: 114454	2010-09-21 17:56:22 +00:00
Chris Lattner	6963c1f789	eliminate an old SelectionDAG::getTruncStore method, propagating MachinePointerInfo around more. llvm-svn: 114452	2010-09-21 17:42:31 +00:00
Chris Lattner	5e39ffd02f	eliminate last SelectionDAG::getLoad old entrypoint, on to stores. llvm-svn: 114450	2010-09-21 17:28:52 +00:00
Chris Lattner	ea952f05a5	fix the code that infers SV info to be correct when dealing with an indexed load/store that has an offset in the index. llvm-svn: 114449	2010-09-21 17:24:05 +00:00
Jakob Stoklund Olesen	1ccded77c0	Add LiveInterval::find and use it for most LiveRange searching operations instead of calling lower_bound or upper_bound directly. This cleans up the search logic a bit because {lower,upper}_bound compare LR->start by default, and it is usually simpler to search LR->end. Funnelling all searches through one function also makes it possible to replace the search algorithm with something faster than binary search. llvm-svn: 114448	2010-09-21 17:12:18 +00:00
Jakob Stoklund Olesen	04610c63cb	Remove dead method. llvm-svn: 114447	2010-09-21 17:12:15 +00:00
Chris Lattner	3d178ed4d4	propagate MachinePointerInfo through various uses of the old SelectionDAG::getExtLoad overload, and eliminate it. llvm-svn: 114446	2010-09-21 17:04:51 +00:00
Chris Lattner	1ffcf527c7	continue MachinePointerInfo'izing, eliminating use of one of the old getLoad overloads. llvm-svn: 114443	2010-09-21 16:36:31 +00:00
Chris Lattner	f72c3c08a4	convert dagcombine off the old form of getLoad. This fixes several bugs with SVOffset computation. llvm-svn: 114442	2010-09-21 16:08:50 +00:00
Chris Lattner	e32675253f	simplify DAGCombiner::SimplifySelectOps step #2/2. llvm-svn: 114437	2010-09-21 15:58:55 +00:00
Chris Lattner	254c445e63	substantially reduce indentation and simplify DAGCombiner::SimplifySelectOps. no functionality change (step #1) llvm-svn: 114436	2010-09-21 15:46:59 +00:00
Lang Hames	2b252f6b6d	Fixed ambiguous call. llvm-svn: 114431	2010-09-21 13:47:10 +00:00
Lang Hames	0937fc4b7f	Added an additional PBQP problem builder which adds coalescing costs (both between pairs of virtuals, and between virtuals and physicals). llvm-svn: 114429	2010-09-21 13:19:36 +00:00
Gabor Greif	adbbb93d3d	Move the search for the appropriate AND instruction into OptimizeCompareInstr. This necessitates the passing of CmpValue around, so widen the virtual functions to accomodate. No functionality changes. llvm-svn: 114428	2010-09-21 12:01:15 +00:00
Chris Lattner	a35499e2af	a few more trivial updates. This fixes PerformInsertVectorEltInMemory to not pass a completely incorrect SrcValue, which would result in a miscompile with combiner-aa. llvm-svn: 114411	2010-09-21 07:32:19 +00:00
Chris Lattner	50287ea65a	add some accessors llvm-svn: 114409	2010-09-21 06:43:24 +00:00
Chris Lattner	82fd06d3ce	it's more elegant to put the "getConstantPool" and "getFixedStack" on the MachinePointerInfo class. While this isn't the problem I'm setting out to solve, it is the right way to eliminate PseudoSourceValue, so lets go with it. llvm-svn: 114406	2010-09-21 06:22:23 +00:00
Chris Lattner	2510de2bea	reimplement memcpy/memmove/memset lowering to use MachinePointerInfo instead of srcvalue/offset pairs. This corrects SV info for mem operations whose size is > 32-bits. llvm-svn: 114401	2010-09-21 05:40:29 +00:00
Chris Lattner	de93bb065d	add some helpful accessors. llvm-svn: 114400	2010-09-21 05:39:30 +00:00
Chris Lattner	bc419ba98f	add overloads for SelectionDAG::getLoad, getStore, getTruncStore that take a MachinePointerInfo. Among other virtues, this doesn't silently truncate the svoffset to 32-bits. llvm-svn: 114399	2010-09-21 05:10:45 +00:00
Chris Lattner	d2d58ada70	simplify interface to SelectionDAG::getMemIntrinsicNode, making it take a MachinePointerInfo llvm-svn: 114397	2010-09-21 04:57:15 +00:00
Chris Lattner	15d84c460a	chagne interface to SelectionDAG::getAtomic to take a MachinePointerInfo, eliminating some weird "infer a frame address" logic which was dead. llvm-svn: 114396	2010-09-21 04:53:42 +00:00
Chris Lattner	3b5dc0cdad	don't implicitly drop the offset of a machinememoperand when legalizing atomics. llvm-svn: 114395	2010-09-21 04:51:11 +00:00
Chris Lattner	b5f4920979	force clients of MachineFunction::getMachineMemOperand to provide a MachinePointerInfo, propagating the type out a level of API. Remove the old MachineFunction::getMachineMemOperand impl. llvm-svn: 114393	2010-09-21 04:46:39 +00:00
Chris Lattner	00ca0b8e98	start pushing MachinePointerInfo out through the MachineMemOperand interface to the MachineFunction construction methods. llvm-svn: 114390	2010-09-21 04:32:08 +00:00
Chris Lattner	187f653418	refactor the Value/offset pair from MachineMemOperand out to a new MachinePointerInfo struct, no functionality change. This also adds an assert to MachineMemOperand::MachineMemOperand that verifies that the Value is either null or is an IR pointer type. llvm-svn: 114389	2010-09-21 04:23:39 +00:00
Evan Cheng	f3e9a48584	Enable machine sinking critical edge splitting. e.g. define double @foo(double %x, double %y, i1 %c) nounwind { %a = fdiv double %x, 3.2 %z = select i1 %c, double %a, double %y ret double %z } Was: _foo: divsd LCPI0_0(%rip), %xmm0 testb $1, %dil jne LBB0_2 movaps %xmm1, %xmm0 LBB0_2: ret Now: _foo: testb $1, %dil je LBB0_2 divsd LCPI0_0(%rip), %xmm0 ret LBB0_2: movaps %xmm1, %xmm0 ret This avoids the divsd when early exit is taken. rdar://8454886 llvm-svn: 114372	2010-09-20 22:52:00 +00:00
Owen Anderson	272ff94916	When TCO is turned on, it is possible to end up with aliasing FrameIndex's. Therefore, CombinerAA cannot assume that different FrameIndex's never alias, but can instead use MachineFrameInfo to get the actual offsets of these slots and check for actual aliasing. This fixes CodeGen/X86/2010-02-19-TailCallRetAddrBug.ll and CodeGen/X86/tailcallstack64.ll when CombinerAA is enabled, modulo a different register allocation sequence. llvm-svn: 114348	2010-09-20 20:39:59 +00:00
Evan Cheng	2031b768ba	Avoid splitting critical edge twice for a set of PHI uses. llvm-svn: 114338	2010-09-20 19:12:55 +00:00
Owen Anderson	7b8d2ae912	Revert r114312 while I sort out some issues. llvm-svn: 114313	2010-09-19 21:01:26 +00:00
Owen Anderson	ff82f8a35b	Tentatively enabled DAGCombiner Alias Analysis by default. As far as I know, r114268 fixed the last of the blockers to enabling it. I will be monitoring for failures. llvm-svn: 114312	2010-09-19 19:51:55 +00:00
Benjamin Kramer	45a56d3c49	Unbreak msvc build. llvm-svn: 114284	2010-09-18 14:41:26 +00:00
Lang Hames	361de9870a	Fixed non-const iterator error. llvm-svn: 114273	2010-09-18 09:49:08 +00:00
Lang Hames	cb1e1017dd	Added a separate class (PBQPBuilder) for PBQP Problem construction. This class can be extended to support custom constraints. For now the allocator still uses the old (internal) construction mechanism by default. This will be phased out soon assuming no issues with the builder system come up. To invoke the new construction mechanism just pass '-regalloc=pbqp -pbqp-builder' to llc. To provide custom constraints a Target just needs to extend PBQPBuilder and pass an instance of their derived builder to the RegAllocPBQP constructor. llvm-svn: 114272	2010-09-18 09:07:10 +00:00
Evan Cheng	b339f3da0c	Fix code that break critical edges for PHI uses. Watch out for multiple PHIs in different blocks. llvm-svn: 114270	2010-09-18 06:42:17 +00:00
Owen Anderson	b92b13d8a0	Invert the logic of reachesChainWithoutSideEffects(). What we want to check is that there is NO path to the destination containing side effects, not that SOME path contains no side effects. In practice, this only manifests with CombinerAA enabled, because otherwise the chain has little to no branching, so "any" is effectively equivalent to "all". llvm-svn: 114268	2010-09-18 04:45:14 +00:00
Evan Cheng	e53ab6dffc	Teach machine sink to 1) Do forward copy propagation. This makes it easier to estimate the cost of the instruction being sunk. 2) Break critical edges on demand, including cases where the value is used by PHI nodes. Critical edge splitting is not yet enabled by default. llvm-svn: 114227	2010-09-17 22:28:18 +00:00
Evan Cheng	b08377e0db	Machine CSE was forgetting to clear some data structures. llvm-svn: 114222	2010-09-17 21:59:42 +00:00
Evan Cheng	0dcd3362bd	Fix a potential bug that can cause miscomparison with and without debug info. llvm-svn: 114220	2010-09-17 21:56:26 +00:00
Devang Patel	871d0b1b1c	If FE forgot to provide a file name (usually it uses "stdin" as name in such situation) then make one up to ensure that debug info is not malformed. llvm-svn: 114119	2010-09-16 20:57:49 +00:00
Jakob Stoklund Olesen	9855109b65	Use the value mapping provided by LiveIntervalMap. This simplifies the code a great deal because we don't have to worry about maintaining SSA form. Unconditionally copy back to dupli when the register is live out of the split range, even if the live-out value was defined outside the range. Skipping the back-copy only makes sense when the live range is going to spill outside the split range, and we don't know that it will. Besides, this was a hack to avoid SSA update issues. Clear up some confusion about the end point of a half-open LiveRange. Methinks LiveRanges need to be closed so both start and end are included in the range. The low bits of a SlotIndex are symbolic, so a half-open range doesn't really make sense. This would be a pervasive change, though. llvm-svn: 114043	2010-09-16 00:01:36 +00:00
Devang Patel	46b96c4ba0	Check bb to ensure that alloca is in separate basic block. This fixes funcargs.exp regression reported by gdb testsuite. llvm-svn: 113992	2010-09-15 18:13:55 +00:00
Devang Patel	da25de8096	If dbg.declare from non-entry block is using alloca from entry block then use offset available in StaticAllocaMap to emit DBG_VALUE. Right now, this has no material impact because varible info also collected using offset table maintained in machine module info. llvm-svn: 113967	2010-09-15 14:48:53 +00:00
Gabor Greif	f08b36d386	must not peephole away side effects llvm-svn: 113848	2010-09-14 20:46:08 +00:00
Devang Patel	e4682fa8e2	Use frame index, if available for byval argument while lowering dbg_declare. Otherwise let getRegForValue() find register for this argument. llvm-svn: 113843	2010-09-14 20:29:31 +00:00
Michael J. Spencer	93c9b2ea93	Revert "CMake: Get rid of LLVMLibDeps.cmake and export the libraries normally." This reverts commit r113632 Conflicts: cmake/modules/AddLLVM.cmake llvm-svn: 113819	2010-09-13 23:59:48 +00:00
Jakob Stoklund Olesen	614e13936a	Mechanically replace LiveInterval* with LiveIntervalMap for intervals being edited without actually using LiveIntervalMap functionality. llvm-svn: 113816	2010-09-13 23:29:11 +00:00
Jakob Stoklund Olesen	36dad6db7c	Allow LiveIntervalMap to be reused by resetting the current live interval. llvm-svn: 113815	2010-09-13 23:29:09 +00:00
Jakob Stoklund Olesen	535e8e5f60	Let's just declare that it is impossible to construct a std::pair from a null pointer and work around that. llvm-svn: 113788	2010-09-13 21:29:45 +00:00
Benjamin Kramer	65550d7cea	Fix linux/msvc build, move include. llvm-svn: 113776	2010-09-13 20:04:49 +00:00
Eric Christopher	79127ab3f5	Silence more warnings. Two more unused variables. llvm-svn: 113771	2010-09-13 18:30:57 +00:00
John Thompson	1094c80281	Added skeleton for inline asm multiple alternative constraint support. llvm-svn: 113766	2010-09-13 18:15:37 +00:00
Bill Wendling	27dddd1fd1	Rename ConvertToSetZeroFlag to something more general. llvm-svn: 113670	2010-09-11 00:13:50 +00:00
Bill Wendling	d0a5f4e238	No need to recompute the SrcReg and CmpValue. llvm-svn: 113666	2010-09-10 23:46:12 +00:00
Bill Wendling	041230014c	Move some of the decision logic for converting an instruction into one that sets the 'zero' bit down into the back-end. There are other cases where this logic isn't sufficient, so they should be handled separately. llvm-svn: 113665	2010-09-10 23:34:19 +00:00
Bob Wilson	f3ecfd0e53	Fix a comment typo. llvm-svn: 113653	2010-09-10 22:42:21 +00:00
Bill Wendling	aee679bf35	Modify the comparison optimizations in the peephole optimizer to update the iterator when an optimization took place. This allows us to do more insane things with the code than just remove an instruction or two. llvm-svn: 113640	2010-09-10 21:55:43 +00:00
Michael J. Spencer	dc38d36ccb	CMake: Get rid of LLVMLibDeps.cmake and export the libraries normally. llvm-svn: 113632	2010-09-10 21:14:25 +00:00
Devang Patel	6095d818e5	Add DEBUG message. llvm-svn: 113614	2010-09-10 20:32:09 +00:00
Evan Cheng	bf4070756f	Teach if-converter to be more careful with predicating instructions that would take multiple cycles to decode. For the current if-converter clients (actually only ARM), the instructions that are predicated on false are not nops. They would still take machine cycles to decode. Micro-coded instructions such as LDM / STM can potentially take multiple cycles to decode. If-converter should take treat them as non-micro-coded simple instructions. llvm-svn: 113570	2010-09-10 01:29:16 +00:00
Jakob Stoklund Olesen	79e838b0a8	Remove dead code. llvm-svn: 113386	2010-09-08 18:50:24 +00:00
Jakob Stoklund Olesen	4d19d2651d	Don't add <imp-def> operands during register rewriting. LiveIntervals already adds <imp-def> operands for super-registers when a subreg def defines the whole register. Thus, it is not necessary to do it again when rewriting. In fact, the super-register imp-defs caused miscompilations because the late scheduler couldn't see that the super-register was read. We still add super-reg <imp-use,kill> operands when rewriting virtuals to physicals. llvm-svn: 113299	2010-09-07 22:38:45 +00:00
Chris Lattner	419d0aa0ed	add a comment about where this should eventually move. llvm-svn: 113117	2010-09-05 20:33:40 +00:00
Lang Hames	64a4a13617	Added initialisers for reduction rule counters. llvm-svn: 113108	2010-09-05 13:42:32 +00:00
Chris Lattner	eeba0c73e5	implement rdar://6653118 - fastisel should fold loads where possible. Since mem2reg isn't run at -O0, we get a ton of reloads from the stack, for example, before, this code: int foo(int x, int y, int z) { return x+y+z; } used to compile into: _foo: ## @foo subq $12, %rsp movl %edi, 8(%rsp) movl %esi, 4(%rsp) movl %edx, (%rsp) movl 8(%rsp), %edx movl 4(%rsp), %esi addl %edx, %esi movl (%rsp), %edx addl %esi, %edx movl %edx, %eax addq $12, %rsp ret Now we produce: _foo: ## @foo subq $12, %rsp movl %edi, 8(%rsp) movl %esi, 4(%rsp) movl %edx, (%rsp) movl 8(%rsp), %edx addl 4(%rsp), %edx ## Folded load addl (%rsp), %edx ## Folded load movl %edx, %eax addq $12, %rsp ret Fewer instructions and less register use = faster compiles. llvm-svn: 113102	2010-09-05 02:18:34 +00:00
Jakob Stoklund Olesen	313358fef9	Remove dead code. Clobber ranges are no longer used when joining physical registers. Instead, all aliases are checked for interference. llvm-svn: 113084	2010-09-04 21:09:33 +00:00
Chris Lattner	65b48b5dfc	zap dead code. llvm-svn: 113073	2010-09-04 18:12:00 +00:00
Jim Grosbach	005155e236	previous patch was a little too tricky for its own good. Don't try to overload UserInInstr. Explicitly check Allocatable. The early exit in the condition will mean the performance impact of the extra test should be minimal. llvm-svn: 113016	2010-09-03 21:45:15 +00:00
Bob Wilson	3626a8c136	Add a missing check when legalizing a vector extending load. This doesn't solve the root problem, but it corrects the bug in the code I added to support legalizing in the case where the non-extended type is also legal. llvm-svn: 112997	2010-09-03 19:20:37 +00:00
Jakob Stoklund Olesen	662fecd654	VirtRegRewriter checks for early clobbers before it reuses an available stack slot. Teach it to also check for early clobbered aliases, and early clobber operands following the current operand. This fixes the miscompilation in PR8044 where EC registers eax and ecx were being used for inputs. llvm-svn: 112988	2010-09-03 18:36:56 +00:00
Duncan Sands	bc42c906bb	Reapply commit 112702 which was speculatively reverted by echristo. Original commit message: Use the SSAUpdator to turn calls to eh.exception that are not in a landing pad into uses of registers rather than loads from a stack slot. Doesn't touch the 'orrible hack code - Bill needs to persuade me harder :) llvm-svn: 112952	2010-09-03 08:31:48 +00:00
Devang Patel	854ad26ae2	There is no need to use .set here. Thanks Chris! llvm-svn: 112900	2010-09-02 23:01:10 +00:00
Devang Patel	3bffd52d78	Detect undef value early and save unnecessary NodeMap query. llvm-svn: 112864	2010-09-02 21:29:42 +00:00
Dan Gohman	3c9b5f394b	Don't narrow the load and store in a load+twiddle+store sequence unless there are clearly no stores between the load and the store. This fixes this miscompile reported as PR7833. This breaks the test/CodeGen/X86/narrow_op-2.ll optimization, which is safe, but awkward to prove safe. Move it to X86's README.txt. llvm-svn: 112861	2010-09-02 21:18:42 +00:00
Devang Patel	98d3edfe2a	Tidy up. llvm-svn: 112858	2010-09-02 21:02:27 +00:00
Jim Grosbach	35f3252036	The scavenger should just use getAllocatableSet() rather than reinventing it locally. llvm-svn: 112845	2010-09-02 18:29:04 +00:00
Jim Grosbach	944aece38a	Anti-dependency breaking needs to be careful not to use reserved regs llvm-svn: 112832	2010-09-02 17:12:55 +00:00
Devang Patel	da3ef85460	Fix .debug_range for linux. Patch by Krister Wombell. llvm-svn: 112830	2010-09-02 16:43:44 +00:00
Lang Hames	9a6f8ee32c	Added support for register allocators to record which intervals are spill intervals, and where the uses and defs of the original intervals were in the original code. Spill intervals can be hidden using the "-rmf-intervals=virt-nospills*" option. llvm-svn: 112811	2010-09-02 08:27:00 +00:00
Chandler Carruth	d30f8ec11e	Silence an ambiguous else warning from GCC. llvm-svn: 112809	2010-09-02 07:08:05 +00:00
Lang Hames	b59620f519	Added counters for PBQP reduction rules. llvm-svn: 112807	2010-09-02 05:37:52 +00:00
Jim Grosbach	64df92a9b2	Add a bit of debug output for register scavenging llvm-svn: 112787	2010-09-02 00:51:37 +00:00
Jim Grosbach	63a8eaf559	Tweak to ignoring reserved regs. The allocator was occasionally still looking at them since they'd end up in the register weights list. Tell it to stop doing that. llvm-svn: 112756	2010-09-01 22:48:34 +00:00
Jakob Stoklund Olesen	4b6fd48bba	Teach RemoveCopyByCommutingDef to check all aliases, not just subregisters. This caused a miscompilation in WebKit where %RAX had conflicting defs when RemoveCopyByCommutingDef was commuting a %EAX use. llvm-svn: 112751	2010-09-01 22:15:35 +00:00
Jim Grosbach	d5e72a1e84	tidy up trailing whitespace and an 80 column violation. llvm-svn: 112746	2010-09-01 21:48:06 +00:00
Jim Grosbach	9dce31438d	cleanup per feedback. use a helper function for getting the first non-reserved physical register in a register class. Make sure to assert if the register class is empty. llvm-svn: 112743	2010-09-01 21:34:41 +00:00
Jim Grosbach	b070ddf6b4	The register allocator shouldn't consider allocating reserved registers. PBQP version. llvm-svn: 112742	2010-09-01 21:23:03 +00:00
Jim Grosbach	5ccf18c2fc	The register allocator shouldn't consider allocating reserved registers. r112728 did this for fast regalloc. llvm-svn: 112741	2010-09-01 21:04:27 +00:00
Jim Grosbach	df6b67bf85	The register allocator shouldn't consider allocating reserved registers. llvm-svn: 112728	2010-09-01 19:28:41 +00:00
Jim Grosbach	cb2e56fa82	tidy up a few 80-column and trailing whitespace bits. llvm-svn: 112726	2010-09-01 19:16:29 +00:00
Eric Christopher	a5d315c665	Speculatively revert 112699 and 112702, they seem to be causing self host errors on clang-x86-64. llvm-svn: 112719	2010-09-01 17:29:10 +00:00
Duncan Sands	4d51e3fd17	Use the SSAUpdator to turn calls to eh.exception that are not in a landing pad into uses of registers rather than loads from a stack slot. Doesn't touch the 'orrible hack code - Bill needs to persuade me harder :) llvm-svn: 112702	2010-09-01 14:07:47 +00:00
Devang Patel	ea63639da5	Use absolute label for DW_AT_stmt_list if a target does not prefer offset here. This patch was developed on top of original patch by Artur Pietrek. llvm-svn: 112678	2010-08-31 23:50:19 +00:00
Devang Patel	86ec8b3a3f	Reapply r112623. Included additional check for unused byval argument. llvm-svn: 112659	2010-08-31 22:22:42 +00:00
Jakob Stoklund Olesen	7993dae7bd	Track liveness of unallocatable, unreserved registers in machine DCE. Reserved registers are unpredictable, and are treated as always live by machine DCE. Allocatable registers are never reserved, and can be used for virtual registers. Unreserved, unallocatable registers can not be used for virtual registers, but otherwise behave like a normal allocatable register. Most targets only have the flag register in this set. llvm-svn: 112649	2010-08-31 21:51:05 +00:00
Jakob Stoklund Olesen	2c325dc907	Ignore unallocatable registers in RegAllocFast. llvm-svn: 112632	2010-08-31 19:54:25 +00:00
Devang Patel	529f248eb4	Revert r112623. It is causing self host build failures. llvm-svn: 112631	2010-08-31 19:41:03 +00:00
Devang Patel	8559932d36	Remember byval argument's frame index during argument lowering and use this info to emit debug info. Fixes Radar 8367011. llvm-svn: 112623	2010-08-31 18:50:09 +00:00
Jim Grosbach	365e931f7b	Improve virtual frame base register allocation heuristics. 1. Allocate them in the entry block of the function to enable function-wide re-use. The instructions to create them should be re-materializable, so there shouldn't be additional cost compared to creating them local to the basic blocks where they are used. 2. Collect all of the frame index references for the function and sort them by the local offset referenced. Iterate over the sorted list to allocate the virtual base registers. This enables creation of base registers optimized for positive-offset access of frame references. (Note: This may be appropriate to later be a target hook to do the sorting in a target appropriate manner. For now it's done here for simplicity.) llvm-svn: 112609	2010-08-31 17:58:19 +00:00
Duncan Sands	bb8a3f9f6d	Stop using the dom frontier in DwarfEHPrepare by not promoting alloca's any more. I plan to reimplement alloca promotion using SSAUpdater later. It looks like Bill's URoR logic really always needs domtree, so the pass now always asks for domtree info. llvm-svn: 112597	2010-08-31 09:05:06 +00:00
Devang Patel	417d72823a	Offset is not always unsigned number. llvm-svn: 112584	2010-08-31 06:12:08 +00:00
Devang Patel	2cfc3af181	Simplify. llvm-svn: 112583	2010-08-31 06:11:28 +00:00
Bruno Cardoso Lopes	d9ef4a1a24	zap unused method. x86 is the only user and already has a more powerfull version llvm-svn: 112571	2010-08-31 02:36:20 +00:00
Jakob Stoklund Olesen	9c39690edf	Add experimental -disable-physical-join command line option. Eventually, we want to disable physreg coalescing completely, and let the register allocator do its job using hints. This option makes it possible to measure the impact of disabling physreg coalescing. llvm-svn: 112567	2010-08-31 01:27:49 +00:00
Chris Lattner	34bfab0ad5	two changes: 1) nuke ConstDataCoalSection, which is dead. 2) revise my previous patch for rdar://8018335, which was completely wrong. Specifically, it doesn't make sense to mark __TEXT,__const_coal as PURE_INSTRUCTIONS, because it is for readonly data. templates (it turns out) go to const_coal_nt. The real fix for rdar://8018335 was to give ConstTextCoalSection a section kind of ReadOnly instead of Text. llvm-svn: 112496	2010-08-30 18:12:35 +00:00
Bill Wendling	f824489a1d	Revert r112461. It was failing on PPC... llvm-svn: 112463	2010-08-30 04:36:50 +00:00
Bill Wendling	938f299fa9	When adding a register, we should mark it as "def" if it can optionally define said (physical) register. llvm-svn: 112461	2010-08-30 01:36:05 +00:00
Chris Lattner	ea05bf2259	revert 112457, it looks like it broke selfhost. llvm-svn: 112459	2010-08-29 22:28:18 +00:00
Chris Lattner	c843fca2fd	rewrite DwarfEHPrepare to use SSAUpdater to promote its allocas instead of PromoteMemToReg. This allows it to stop using DF and DT, eliminating a computation of DT and DF from clang -O3. Clang is now down to 2 runs of DomFrontier. llvm-svn: 112457	2010-08-29 19:54:28 +00:00
Chris Lattner	d94a7c3dc1	inline function into its only caller. llvm-svn: 112455	2010-08-29 19:28:28 +00:00
Chris Lattner	13ee795c42	remove unions from LLVM IR. They are severely buggy and not being actively maintained, improved, or extended. llvm-svn: 112356	2010-08-28 04:09:24 +00:00
Chris Lattner	a5217a19a4	remove dead proto llvm-svn: 112354	2010-08-28 03:45:03 +00:00
Dan Gohman	e06905d1f0	Completely disable tail calls when fast-isel is enabled, as fast-isel doesn't currently support dealing with this. llvm-svn: 112341	2010-08-28 00:51:03 +00:00
Dan Gohman	1e06dbf881	Trim a #include. llvm-svn: 112340	2010-08-28 00:49:13 +00:00
Devang Patel	f2855b147f	Simplify. llvm-svn: 112305	2010-08-27 22:25:51 +00:00
Bill Wendling	6628431a91	Remove now unneeded command line flag that enables 'optimize compares.' llvm-svn: 112287	2010-08-27 20:39:09 +00:00
Devang Patel	b12ff5999e	Revert r112213. It is not needed. llvm-svn: 112242	2010-08-26 23:35:15 +00:00
Jim Grosbach	6a77066913	Simplify eliminateFrameIndex() interface back down now that PEI doesn't need to try to re-use scavenged frame index reference registers. rdar://8277890 llvm-svn: 112241	2010-08-26 23:32:16 +00:00
Devang Patel	ea134f56b1	If node is not available then use FuncInfo.ValueMap to emit debug info for byval parameter. llvm-svn: 112238	2010-08-26 22:53:27 +00:00
Jim Grosbach	2a1915d04b	Remove the now obsolete frame index virtual re-use algorithm from PEI. Pre-RA virtual base registers handle this function, and more. A bit more cleanup to do on the interface to eliminateFrameIndex() after this. llvm-svn: 112237	2010-08-26 22:42:12 +00:00
Devang Patel	42b4ac7ed3	Speculatively revert r112207. llvm-svn: 112216	2010-08-26 20:33:42 +00:00
Devang Patel	977057f481	80 col. llvm-svn: 112215	2010-08-26 20:32:32 +00:00
Devang Patel	384fa91deb	Update DanglingDebugInfo so that it can be used to track llvm.dbg.declare also. llvm-svn: 112213	2010-08-26 20:06:46 +00:00
Devang Patel	ab596a637c	Donot forget to resolve dangling debug info in a case where virtual register, used for a value, is initialized after a dbg intrinsic is seen. llvm-svn: 112207	2010-08-26 18:36:14 +00:00
Chris Lattner	af23e9a798	Add a hackaround for PR7993 which is causing failures on x86 builders that lack sse2. llvm-svn: 112175	2010-08-26 06:57:07 +00:00
Chris Lattner	eb2cc0ce0e	implement SplitVecOp_CONCAT_VECTORS, fixing the included testcase with SSE1. llvm-svn: 112171	2010-08-26 05:51:22 +00:00
Chris Lattner	f6418b804e	zap dead code. llvm-svn: 112155	2010-08-26 02:57:35 +00:00
Chris Lattner	8df99b523e	remove some llvmcontext arguments that are now dead post-refactoring. llvm-svn: 112104	2010-08-25 23:00:45 +00:00
Chris Lattner	75ff053497	Change handling of illegal vector types to widen when possible instead of expanding: e.g. <2 x float> -> <4 x float> instead of -> 2 floats. This affects two places in the code: handling cross block values and handling function return and arguments. Since vectors are already widened by legalizetypes, this gives us much better code and unblocks x86-64 abi and SPU abi work. For example, this (which is a silly example of a cross-block value): define <4 x float> @test2(<4 x float> %A) nounwind { %B = shufflevector <4 x float> %A, <4 x float> undef, <2 x i32> <i32 0, i32 1> %C = fadd <2 x float> %B, %B br label %BB BB: %D = fadd <2 x float> %C, %C %E = shufflevector <2 x float> %D, <2 x float> undef, <4 x i32> <i32 0, i32 1, i32 undef, i32 undef> ret <4 x float> %E } Now compiles into: _test2: ## @test2 ## BB#0: addps %xmm0, %xmm0 addps %xmm0, %xmm0 ret previously it compiled into: _test2: ## @test2 ## BB#0: addps %xmm0, %xmm0 pshufd $1, %xmm0, %xmm1 ## kill: XMM0<def> XMM0<kill> XMM0<def> insertps $0, %xmm0, %xmm0 insertps $16, %xmm1, %xmm0 addps %xmm0, %xmm0 ret This implements rdar://8230384 llvm-svn: 112101	2010-08-25 22:49:25 +00:00
Devang Patel	32a72ab072	Fix comment. llvm-svn: 112086	2010-08-25 20:41:24 +00:00
Devang Patel	3f53d6e56a	Remove dead argument. llvm-svn: 112085	2010-08-25 20:39:26 +00:00
Jim Grosbach	7c1b421ae6	Add some statistics for PEI register scavenging llvm-svn: 112084	2010-08-25 20:34:28 +00:00
Chris Lattner	05bcb488b5	split the vector case of getCopyFromParts out to its own function, no functionality change. llvm-svn: 111994	2010-08-24 23:20:40 +00:00
Chris Lattner	96a77ebd7c	split the vector case out of getCopyToParts into its own function. No functionality change. llvm-svn: 111990	2010-08-24 23:10:06 +00:00
Chris Lattner	5b8967f8a2	tidy up, reduce indentation llvm-svn: 111982	2010-08-24 22:43:11 +00:00
Jim Grosbach	2eedb7949e	Add ARM heuristic for when to allocate a virtual base register for stack access. rdar://8277890&7352504 llvm-svn: 111968	2010-08-24 21:19:33 +00:00
Jim Grosbach	b77d67f318	Move enabling the local stack allocation pass into the target where it belongs. For now it's still a command line option, but the interface to the generic code doesn't need to know that. llvm-svn: 111942	2010-08-24 19:05:43 +00:00
Devang Patel	4a213870db	Revert r107202. It is not adding any value. llvm-svn: 111870	2010-08-24 00:06:12 +00:00
Jim Grosbach	616bc356e9	Remove the MFI storage of the local allocation block size. It's not needed. llvm-svn: 111847	2010-08-23 21:29:29 +00:00
Jim Grosbach	754f8e600e	Better handling of local offsets for downwards growing stacks. This corrects relative offsets when there are offsets encoded in the instructions and simplifies final allocation in PEI. rdar://8277890 llvm-svn: 111836	2010-08-23 20:40:38 +00:00
Devang Patel	a8652674e0	Handle qualified constants that are directly folded by FE. PR 7920. llvm-svn: 111820	2010-08-23 18:25:56 +00:00
Owen Anderson	d31d82d75c	Now that PassInfo and Pass::ID have been separated, move the rest of the passes over to the new registration API. llvm-svn: 111815	2010-08-23 17:52:01 +00:00
Chandler Carruth	191c4f73b2	Fix some GCC warnings by providing a virtual destructor in the base of a class hierarchy with virtual methods and using llvm_unreachable to properly indicate unreachable states which would otherwise leave variables uninitialized. llvm-svn: 111803	2010-08-23 08:25:07 +00:00
Eli Friedman	ac305d2024	Delete dead comment. llvm-svn: 111744	2010-08-21 20:19:51 +00:00
Bill Wendling	578ee4070c	Create the new linker type "linker_private_weak_def_auto". It's similar to "linker_private_weak", but it's known that the address of the object is not taken. For instance, functions that had an inline definition, but the compiler decided not to inline it. Note, unlike linker_private and linker_private_weak, linker_private_weak_def_auto may have only default visibility. The symbols are removed by the linker from the final linked image (executable or dynamic library). llvm-svn: 111684	2010-08-20 22:05:50 +00:00
Jim Grosbach	7648a21152	Downwards growing stack allocation order reverses relative offsets llvm-svn: 111673	2010-08-20 20:25:31 +00:00
Jim Grosbach	7110941d68	Add more dbg output llvm-svn: 111670	2010-08-20 19:04:43 +00:00
Jim Grosbach	0600691fe6	properly check for whether base regs were inserted llvm-svn: 111646	2010-08-20 16:48:30 +00:00
Bob Wilson	c56fef4eac	If the target says that an extending load is not legal, regardless of whether it involves specific floating-point types, legalize should expand an extending load to a non-extending load followed by a separate extend operation. For example, we currently expand SEXTLOAD to EXTLOAD+SIGN_EXTEND_INREG (and assert that EXTLOAD should always be supported). Now we can expand that to LOAD+SIGN_EXTEND. This is needed to allow vector SIGN_EXTEND and ZERO_EXTEND to be used for NEON. llvm-svn: 111586	2010-08-19 23:52:39 +00:00
Jim Grosbach	56e56323c8	Better handling of offsets on frame index references. rdar://8277890 llvm-svn: 111585	2010-08-19 23:52:25 +00:00
Evan Cheng	e5af930156	Update debug logs. llvm-svn: 111575	2010-08-19 23:33:02 +00:00
Evan Cheng	63a868457b	Properly update MachineDominators when splitting critical edge. llvm-svn: 111574	2010-08-19 23:32:47 +00:00
Bill Wendling	68caaaf282	Correct header. llvm-svn: 111540	2010-08-19 18:52:17 +00:00
Evan Cheng	361b9be7c6	It's possible to sink a def if its local uses are PHI's. llvm-svn: 111537	2010-08-19 18:33:29 +00:00
Michael J. Spencer	abca173494	Fix the msvc 2010 build. The Microsoft (R) 32-bit C/C++ Optimizing Compiler Version 16.00.30319.01 implements parts of C++0x based on the draft standard. An old version of the draft had a bug that makes std::pair<T1, T2>(something, 0) fail to compile. This is because the template<class U, class V> pair(U&& x, V&& y) constructor is selected, even though it later fails to implicitly convert U and V to frist_type and second_type. This has been fixed in n3090, but it seems that Microsoft is not going to update msvc. llvm-svn: 111535	2010-08-19 18:16:39 +00:00
Evan Cheng	681d0c25f9	Remove disabled assertion. llvm-svn: 111531	2010-08-19 17:33:48 +00:00
Evan Cheng	ae9939c839	Teach machine-sink to break critical edges when appropriate. Work in progress. llvm-svn: 111530	2010-08-19 17:33:11 +00:00
Jim Grosbach	743d7c80e4	Update local stack block allocation to let PEI do the allocs if no additional base registers were required. This will allow for slightly better packing of the locals when alignment padding is necessary after callee saved registers. llvm-svn: 111508	2010-08-19 02:47:08 +00:00
Jim Grosbach	3ac059369b	Add a newline to debug output llvm-svn: 111453	2010-08-18 23:14:02 +00:00
Evan Cheng	25b6068b8f	If any def of a machine-sink candidate has local uses, it's obviously not safe to sink it to a successor block. This bug has been hidden because a later check for critical-edge disable these illegal optimizations. This patch should significantly reduce the amount of time spent on checking dominator information for obviously unsafe sinking. llvm-svn: 111450	2010-08-18 23:09:25 +00:00
Jim Grosbach	dbfc2ce95d	Enable ARM base register reuse to local stack slot allocation. Whenever a new frame index reference to an object in the local block is seen, check if it's near enough to any previously allocaated base register to re-use. rdar://8277890 llvm-svn: 111443	2010-08-18 22:44:49 +00:00
Jakob Stoklund Olesen	e98030ad58	Thinking about it, we don't need MachineDominatorTree after all. The DomValue map discovers the iterated dominance frontier for free. llvm-svn: 111400	2010-08-18 20:29:53 +00:00
Jakob Stoklund Olesen	f4088b022a	Revert r111394. It was too aggressive. We must complete the DFS, otherwise we might miss needed phi-defs, and prematurely color live ranges with a non-dominating value. This is not a big deal since we get to color more of the CFG and the next mapValue call will be faster. llvm-svn: 111397	2010-08-18 20:06:05 +00:00
Jakob Stoklund Olesen	5b4cb08471	Aggressively prune the DFS when inserting phi-defs. llvm-svn: 111394	2010-08-18 19:00:11 +00:00
Jakob Stoklund Olesen	ce6f055b4d	Add the LiveIntervalMap class. Don't hook it up yet. LiveIntervalMap maps values from a parent LiveInterval to a child interval that is a strict subset. It will create phi-def values as needed to preserve the VNInfo SSA form in the child interval. This leads to an algorithm very similar to the one in SSAUpdaterImpl.h, but with enough differences that the code can't be reused: - We don't need to manipulate PHI instructions. - LiveIntervals have kills. - We have MachineDominatorTree. - We can use df_iterator. llvm-svn: 111393	2010-08-18 19:00:08 +00:00
Bill Wendling	0d323aef46	Improve whitespace. llvm-svn: 111384	2010-08-18 18:41:13 +00:00
Jim Grosbach	e0e9b3013f	Add hook for re-using virtual base registers for local stack slot access. Nothing fancy, just ask the target if any currently available base reg is in range for the instruction under consideration and use the first one that is. Placeholder ARM implementation simply returns false for now. ongoing saga of rdar://8277890 llvm-svn: 111374	2010-08-18 17:57:37 +00:00
Jakob Stoklund Olesen	952a621d93	Preserve subregs on PHI source operands. Patch by Krister Wombell! llvm-svn: 111366	2010-08-18 16:09:47 +00:00
Jim Grosbach	3cf08661f4	Add materialization of virtual base registers for frame indices allocated into the local block. Resolve references to those indices to a new base register. For simplification and testing purposes, a new virtual base register is allocated for each frame index being resolved. The result is truly horrible, but correct, code that's good for exercising the new code paths. Next up is adding thumb1 support, which should be very simple. Following that will be adding base register re-use and implementing a reasonable ARM heuristic for when a virtual base register should be generated at all. llvm-svn: 111315	2010-08-17 22:41:55 +00:00
Dale Johannesen	16f96445c3	Make fast scheduler handle asm clobbers correctly. PR 7882. Follows suggestion by Amaury Pouly, thanks. llvm-svn: 111306	2010-08-17 22:17:24 +00:00
Evan Cheng	16bfe5b0f5	PHI elimination shouldn't require machineloopinfo since it's used at -O0. Move the requirement to LiveIntervalAnalysis instead. Note this does not change the number of times machineloopinfo is computed. llvm-svn: 111285	2010-08-17 21:00:37 +00:00
Evan Cheng	e0db9d01d9	Machine CSE preserves CFG. Pass manager was freeing machineloopinfo after machine cse before. llvm-svn: 111281	2010-08-17 20:57:42 +00:00
Jim Grosbach	1a58ce7646	silence warning llvm-svn: 111274	2010-08-17 20:21:30 +00:00
Jim Grosbach	c252ee2375	Add hook to examine an instruction referencing a frame index to determine whether to allocate a virtual frame base register to resolve the frame index reference in it. Implement a simple version for ARM to aid debugging. In LocalStackSlotAllocation, scan the function for frame index references to local frame indices and ask the target whether to allocate virtual frame base registers for any it encounters. Purely infrastructural for debug output. Next step is to actually allocate base registers, then add intelligent re-use of them. rdar://8277890 llvm-svn: 111262	2010-08-17 18:13:53 +00:00
Evan Cheng	647c559172	Move the decision logic whether it's a good idea to split a critical edge to clients. Also fixed an erroneous check. An edge is only a back edge when the from and to blocks are in the same loop. llvm-svn: 111256	2010-08-17 17:43:50 +00:00
Evan Cheng	a6848249ee	Fix debug message. llvm-svn: 111250	2010-08-17 17:15:14 +00:00
Eric Christopher	541f8012d9	Fix typo. llvm-svn: 111223	2010-08-17 01:30:33 +00:00
Evan Cheng	f259efde47	PHI elimination should not break back edge. It can cause some significant code placement issues. rdar://8263994 good: LBB0_2: mov r2, r0 . . . mov r1, r2 bne LBB0_2 bad: LBB0_2: mov r2, r0 . . . @ BB#3: mov r1, r2 b LBB0_2 llvm-svn: 111221	2010-08-17 01:20:36 +00:00
Jim Grosbach	a7c562d664	tidy up. remove unused local. llvm-svn: 111206	2010-08-16 23:26:09 +00:00
Jim Grosbach	36d5ec383e	Better handle alignment requirements for local objects in pre-regalloc frame mapping. Have the local block track its alignment requirement, and then apply that when the block itself is allocated. Previously, offsets could get adjusted in PEI to be different, relative to one another, than the block allocation thought they would be, which defeats the point of doing the allocation this way. Continuing rdar://8277890 llvm-svn: 111197	2010-08-16 22:30:41 +00:00
Eli Friedman	7e2f4ce439	Until uleb/sleb are MC-ized, add a hack to make them work with ELF object emission. llvm-svn: 111177	2010-08-16 20:08:40 +00:00
Jim Grosbach	8be0196afe	track local frame size in MFI, not local to the pass, since PEI needs it. llvm-svn: 111164	2010-08-16 18:06:15 +00:00
Jakob Stoklund Olesen	5f72a04ba7	Remove unused functions. llvm-svn: 111156	2010-08-16 17:18:20 +00:00
Ted Kremenek	da2eba58ed	Update CMake build. llvm-svn: 111063	2010-08-14 01:55:09 +00:00
Jim Grosbach	a030fa5297	Add a local stack object block allocation pass. This is still an experimental pass that allocates locals relative to one another before register allocation and then assigns them to actual stack slots as a block later in PEI. This will eventually allow targets with limited index offset range to allocate additional base registers (not just FP and SP) to more efficiently reference locals, as well as handle situations where locals cannot be referenced via SP or FP at all (dynamic stack realignment together with variable sized objects, for example). It's currently incomplete and almost certainly buggy. Work in progress. Disabled by default and gated via the -enable-local-stack-alloc command line option. rdar://8277890 llvm-svn: 111059	2010-08-14 00:15:52 +00:00
Jakob Stoklund Olesen	27e1f26534	Clean up the Spiller.h interface. The earliestStart argument is entirely specific to linear scan allocation, and can be easily calculated by RegAllocLinearScan. Replace std::vector with SmallVector. llvm-svn: 111055	2010-08-13 22:56:53 +00:00
Jakob Stoklund Olesen	d1191ee43c	Implement splitting inside a single block. When a live range is contained a single block, we can split it around instruction clusters. The current approach is very primitive, splitting before and after the largest gap between uses. llvm-svn: 111043	2010-08-13 21:18:48 +00:00
Jim Grosbach	d1f4465df0	tidy up whitespace a bit llvm-svn: 111019	2010-08-13 16:55:08 +00:00
Jakob Stoklund Olesen	3d1027e7a1	Let LiveInterval::addRange extend existing ranges, it will verify that value numbers match. The old check could accidentally leave holes in openli. Also let useIntv add all ranges for the phi-def value inserted by enterIntvAtEnd. This works as long at the value mapping is established in enterIntvAtEnd. llvm-svn: 110995	2010-08-13 01:05:26 +00:00
Jakob Stoklund Olesen	840b81a19e	Remember to actually update SplitAnalysis statistics now that we have a fancy function to do it. llvm-svn: 110994	2010-08-13 01:05:23 +00:00
Jakob Stoklund Olesen	991e4ee860	Handle an empty dupli. This can happen if the original interval has been broken into two disconnected parts. Ideally, we should be able to detect when the graph is disconnected and create separate intervals, but that code is not implemented yet. Example: Two basic blocks are both branching to a loop header. Our interval is defined in both basic blocks, and live into the loop along both edges. We decide to split the interval around the loop. The interval is split into an inside part and an outside part. The outside part now has two disconnected segments, one in each basic block. If we later decide to split the outside interval into single blocks, we get one interval per basic block and an empty dupli for the remainder. llvm-svn: 110976	2010-08-12 23:02:57 +00:00
Jakob Stoklund Olesen	32c181c444	Update the SplitAnalysis statistics as uses are moved from curli to the new split intervals. THis means the analysis can be used for multiple splits as long as curli doesn't shrink. llvm-svn: 110975	2010-08-12 23:02:55 +00:00
Jakob Stoklund Olesen	0910689353	Also recompute HasPHIKill flags in LiveInterval::RenumberValues. If a phi-def value were removed from the interval, the phi-kill flags are no longer valid. llvm-svn: 110949	2010-08-12 20:38:03 +00:00
Jakob Stoklund Olesen	073cd8004a	Remove trailing whitespace. llvm-svn: 110944	2010-08-12 20:01:23 +00:00
Jakob Stoklund Olesen	fa3ea11ae6	Clean up debug output. llvm-svn: 110940	2010-08-12 18:50:55 +00:00
Jakob Stoklund Olesen	622848b262	Implement single block splitting. Before spilling a live range, we split it into a separate range for each basic block where it is used. That way we only get one reload per basic block if the new smaller ranges can allocate to a register. This type of splitting is already present in the standard spiller. llvm-svn: 110934	2010-08-12 17:07:14 +00:00
Jakob Stoklund Olesen	852a2c19dd	Fix a FIXME. The SlotIndex::Slot enum should be private. llvm-svn: 110826	2010-08-11 16:50:17 +00:00
Bill Wendling	0757820f8f	Turn optimize compares back on with fix. We needed to test that a machine op was a register before checking if it was defined. llvm-svn: 110733	2010-08-10 21:38:11 +00:00
Jakob Stoklund Olesen	57f3db6e2e	Give up on register class recalculation when the register is used with subreg operands. We don't currently have a hook to provide "the largest super class of A where all registers' getSubReg(subidx) is valid and in B". llvm-svn: 110730	2010-08-10 21:16:16 +00:00
Dan Gohman	a53f4e23e4	Revert r110718; it broke clang-i386-darwin9. llvm-svn: 110726	2010-08-10 20:49:33 +00:00
Jakob Stoklund Olesen	3b870f045f	Avoid editing the current live interval during remat. The live interval may be used for a spill slot as well, and that spill slot could be shared by split registers. We cannot shrink it, even if we know the current register won't need the spill slot in that range. llvm-svn: 110721	2010-08-10 20:45:07 +00:00
Jakob Stoklund Olesen	62e721478b	More debug spew llvm-svn: 110720	2010-08-10 20:45:01 +00:00
Bill Wendling	558f822bc7	Turn optimize cmps on by default so that we can get some testing by the nightly ARM testers. llvm-svn: 110718	2010-08-10 20:23:02 +00:00
Devang Patel	8e06a5eb47	Do not forget debug info for enums. Use named mdnode to keep track of these types. llvm-svn: 110712	2010-08-10 20:01:20 +00:00
Jakob Stoklund Olesen	53c5022040	Implement register class inflation. When splitting a live range, the new registers have fewer uses and the permissible register class may be less constrained. Recompute the register class constraint from the uses of new registers created for a split. This may let them be allocated from a larger set, possibly avoiding a spill. llvm-svn: 110703	2010-08-10 18:37:40 +00:00
Jakob Stoklund Olesen	284c2dbfd7	Recalculate the spill weight and allocation hint for virtual registers created during live range splitting. llvm-svn: 110686	2010-08-10 17:07:22 +00:00
Devang Patel	b219746c80	Handle TAG_constant for integers. llvm-svn: 110656	2010-08-10 07:11:13 +00:00
Bill Wendling	884514066e	Update CMake...sorry for the breakage. llvm-svn: 110654	2010-08-10 05:16:06 +00:00
Devang Patel	18ba0b4ac3	Simplify. llvm-svn: 110653	2010-08-10 04:12:17 +00:00
Devang Patel	b1e07b3f2a	Drop "const". It does not add value here. llvm-svn: 110652	2010-08-10 04:09:06 +00:00
Evan Cheng	23ef829096	Add missing null check reported by Amaury Pouly. llvm-svn: 110649	2010-08-10 02:39:45 +00:00
Devang Patel	469c12d254	Do not include file static variable in pubnames list. Refactor and simplify code to avoid redundant checks. llvm-svn: 110642	2010-08-10 01:37:23 +00:00
Jakob Stoklund Olesen	e00c49da11	Transpose the calculation of spill weights such that we are calculating one register at a time. This turns out to be slightly faster than iterating over instructions, but more importantly, it allows us to compute spill weights for new registers created after the spill weight pass has run. Also compute the allocation hint at the same time as the spill weight. This allows us to use the spill weight as a cost metric for copies, and choose the most profitable hint if there is more than one possibility. The new hints provide a very small (< 0.1%) but universal code size improvement. llvm-svn: 110631	2010-08-10 00:02:26 +00:00
Bill Wendling	ca67835eaa	Merge the OptimizeExts and OptimizeCmps passes into one PeepholeOptimizer pass. This pass should expand with all of the small, fine-grained optimization passes to reduce compile time and increase happiment. llvm-svn: 110627	2010-08-09 23:59:04 +00:00
Devang Patel	394a69ed52	Undo accidental commit. llvm-svn: 110623	2010-08-09 23:28:52 +00:00
Devang Patel	4eda9abddb	Simplify. Avoid redundant checks. llvm-svn: 110621	2010-08-09 23:26:06 +00:00
Devang Patel	c7cf14f5f6	Refactor. llvm-svn: 110607	2010-08-09 21:39:24 +00:00
Devang Patel	6d9f9feb2b	Refactoring. Update DbgVarible to handle queries itself. llvm-svn: 110600	2010-08-09 21:01:39 +00:00
Devang Patel	b6511a36b4	It is ok, and convenient, to pass descriptors by value. llvm-svn: 110590	2010-08-09 20:20:05 +00:00
Jakob Stoklund Olesen	3fa110f227	A REG_SEQUENCE instruction may use the same register twice. If we are emitting COPY instructions for the REG_SEQUENCE, make sure the kill flag goes on the last COPY. Otherwise we may be using a killed register. <rdar://problem/8287792> llvm-svn: 110589	2010-08-09 20:19:16 +00:00
Devang Patel	406798a17d	Rename a method. llvm-svn: 110586	2010-08-09 18:51:29 +00:00
Bill Wendling	798617b1ab	Use the "isCompare" machine instruction attribute instead of calling the relatively expensive comparison analyzer on each instruction. Also rename the comparison analyzer method to something more in line with what it actually does. This pass is will eventually be folded into the Machine CSE pass. llvm-svn: 110539	2010-08-08 05:04:59 +00:00
Dan Gohman	093b42fc7c	Tidy some #includes and forward-declarations, and move the C binding code out of PassManager.cpp and into Core.cpp with the rest of the C binding code. llvm-svn: 110494	2010-08-07 00:43:20 +00:00
Jakob Stoklund Olesen	45e07c8fc5	Lazily defer duplicating the live interval we are splitting until we know it is necessary. Sometimes, live range splitting doesn't shrink the current interval, but simply changes some instructions to use a new interval. That makes the original more suitable for spilling. In this case, we don't need to duplicate the original. llvm-svn: 110481	2010-08-06 22:17:33 +00:00
Jim Grosbach	da27eb246d	Cleanup comment wording llvm-svn: 110466	2010-08-06 18:59:07 +00:00
Jakob Stoklund Olesen	1dfca4e4bb	Keep the MachiuneFunctionPass pointer around. It is useful for verification. llvm-svn: 110464	2010-08-06 18:47:06 +00:00
Jakob Stoklund Olesen	8c0f693150	Add LiveInterval::RenumberValues - Garbage collection for VNInfos. After heavy editing of a live interval, it is much easier to simply renumber the live values instead of trying to keep track of the unused ones. llvm-svn: 110463	2010-08-06 18:46:59 +00:00
Owen Anderson	a7aed18624	Reapply r110396, with fixes to appease the Linux buildbot gods. llvm-svn: 110460	2010-08-06 18:33:48 +00:00
Jakob Stoklund Olesen	8147d7a6b9	Add more verification of LiveIntervals. llvm-svn: 110454	2010-08-06 18:04:19 +00:00
Jakob Stoklund Olesen	7e0de5ef8e	Fix swapped COPY operands. llvm-svn: 110453	2010-08-06 18:04:17 +00:00
Jakob Stoklund Olesen	0e7752407c	Don't try to verify LiveIntervals for physical registers. When a physical register is in use, some alias of that register has a live interval with a relevant live range. That is the sad state of intervals after physreg coalescing of subregs, and it is good enough for correct register allocation. llvm-svn: 110452	2010-08-06 18:04:14 +00:00
Ted Kremenek	26177d2c24	Update CMake build. llvm-svn: 110429	2010-08-06 04:05:21 +00:00
Bill Wendling	7de9d52c13	Add the Optimize Compares pass (disabled by default). This pass tries to remove comparison instructions when possible. For instance, if you have this code: sub r1, 1 cmp r1, 0 bz L1 and "sub" either sets the same flag as the "cmp" instruction or could be converted to set the same flag, then we can eliminate the "cmp" instruction all together. This is a important for ARM where the ALU instructions could set the CPSR flag, but need a special suffix ('s') to do so. llvm-svn: 110423	2010-08-06 01:32:48 +00:00
Devang Patel	8a18aee421	While emitting DBG_VALUE for registers spilled at the end of a block do not use location of MBB->end(). If a block does not have terminator then incoming iterator points to end(). llvm-svn: 110411	2010-08-06 00:26:18 +00:00
Owen Anderson	bda59bd247	Revert r110396 to fix buildbots. llvm-svn: 110410	2010-08-06 00:23:35 +00:00
Jakob Stoklund Olesen	01a81b01bc	Be more aggressive about removing joined physreg copies. When a joined COPY changes subreg liveness, we keep it around as a KILL, otherwise it is safe to delete. llvm-svn: 110403	2010-08-05 23:51:28 +00:00
Jakob Stoklund Olesen	b4ef4a961d	Don't verify LiveVariables if LiveIntervals is available. LiveVariables becomes horribly wrong while the coalescer is running, but the analysis is not zapped until after the coalescer pass has run. This causes tons of false reports when calling verify form the coalescer. llvm-svn: 110402	2010-08-05 23:51:26 +00:00
Owen Anderson	755aceb5d0	Don't use PassInfo* as a type identifier for passes. Instead, use the address of the static ID member as the sole unique type identifier. Clean up APIs related to this change. llvm-svn: 110396	2010-08-05 23:42:04 +00:00
Jakob Stoklund Olesen	e7709ebb64	Add basic verification of LiveIntervals. We verify that the LiveInterval is live at uses and defs, and that all instructions have a SlotIndex. Stuff we don't check yet: - Is the LiveInterval minimal? - Do all defs correspond to instructions or phis? - Do all defs dominate all their live ranges? - Are all live ranges continually reachable from their def? llvm-svn: 110386	2010-08-05 22:32:21 +00:00
Jakob Stoklund Olesen	4583355a78	Remove double-def checking from MachineVerifier, so a register does not have to be killed before being redefined. These checks are usually disabled, and usually fail when enabled. We de facto allow live registers to be redefined without a kill, the corresponding assertions in RegScavenger were removed long ago. llvm-svn: 110362	2010-08-05 18:59:59 +00:00
Jakob Stoklund Olesen	d9572619e2	Avoid using a live std::multimap iterator while editing the map. It looks like we sometimes compare singular iterators, reported by ENABLE_EXPENSIVE_CHECKS. This fixes PR7825. llvm-svn: 110355	2010-08-05 18:12:19 +00:00
Bill Wendling	ca1cb13646	The lower invoke pass needs to have unreachable code elimination run after it because it could create such things. This fixes a MingW buildbot test failure. llvm-svn: 110279	2010-08-04 23:36:02 +00:00
Jakob Stoklund Olesen	7fd4905f08	Coalesce stack slot accesses that arise when spilling both sides of a COPY. This helps avoid silly code: %R0<def = LOAD <fi#5> STORE <fi#5>, %R0<kill> llvm-svn: 110266	2010-08-04 22:35:11 +00:00
Jakob Stoklund Olesen	dc96e28d70	Checkpoint SplitKit progress. We are now at a point where we can split around simple single-entry, single-exit loops, although still with some bugs. llvm-svn: 110257	2010-08-04 22:08:39 +00:00
Devang Patel	6c378ac473	Use location entry only of the location described by DBG_VALUE is valid. llvm-svn: 110255	2010-08-04 22:07:27 +00:00
Bill Wendling	b87f3e5a7d	The EH prepare passes really want to be the last passes run before code-gen. llvm-svn: 110248	2010-08-04 21:44:13 +00:00
Devang Patel	6d21f61b3f	Fix typo in comment. llvm-svn: 110244	2010-08-04 20:32:36 +00:00
Dan Gohman	2392287306	Change this llvm_unreachable to report_fatal_error, since it can be triggered by valid, if dubious, IR. llvm-svn: 110240	2010-08-04 18:51:09 +00:00
Devang Patel	d71bc1ae4e	While spilling live registers at the end of block check whether they are used by DBG_VALUE machine instructions or not. If a spilled register is used by DBG_VALUE machine instruction then insert a new DBG_VALUE machine instruction to encode variable's new location on stack. llvm-svn: 110235	2010-08-04 18:42:02 +00:00
Devang Patel	0e60e67efb	If a variable is spilled by code generator then use DW_OP_fbreg to describe its location on stack. llvm-svn: 110234	2010-08-04 18:40:52 +00:00
Dan Gohman	5cae103392	Eliminate unnecessary empty string literals. llvm-svn: 110183	2010-08-04 01:39:08 +00:00
Jakob Stoklund Olesen	0c18757c9d	Oops. Don't normalize spill weights twice. When the normalizeSpillWeights function was introduced, I forgot to remove this normalization. This change could affect register allocation. Hopefully for the better. llvm-svn: 110119	2010-08-03 17:21:16 +00:00
Bill Wendling	44dc60ba13	Early exit and reduce indentation. No functionality change. llvm-svn: 110069	2010-08-02 22:06:08 +00:00
Devang Patel	d070128de5	Free DbgScope created for dead functions. llvm-svn: 110045	2010-08-02 17:32:15 +00:00
Oscar Fuentes	40b31ad3ee	Prefix `next' iterator operation with `llvm::'. Fixes potential ambiguity problems on VS 2010. Patch by nobled! llvm-svn: 110029	2010-08-02 06:00:15 +00:00
Eli Friedman	460ad41d6d	PR7586: Make sure we don't claim that unknown bits are actually known in the ISD::AND case of TargetLowering::SimplifyDemandedBits. llvm-svn: 110019	2010-08-02 04:42:25 +00:00
Bill Wendling	d9900542a6	Reference the personalities. Don't copy them into a new vector. llvm-svn: 109966	2010-08-01 01:34:21 +00:00
Eli Friedman	ffe64c06ef	Fix for bug reported by Evzen Muller on llvm-commits: make sure to correctly check the range of the constant when optimizing a comparison between a constant and a sign_extend_inreg node. llvm-svn: 109854	2010-07-30 06:44:31 +00:00
Benjamin Kramer	a3e0ddb564	Plug the remaining MC leaks by giving MCObjectStreamer/MCAsmStreamer ownership of the TargetAsmBackend and the MCCodeEmitter. llvm-svn: 109767	2010-07-29 17:48:06 +00:00
Dale Johannesen	329d4741a5	Comment typo. llvm-svn: 109765	2010-07-29 17:45:24 +00:00
Jakob Stoklund Olesen	36cf119049	Fix a bug in the -regalloc=fast handling of exotic two-address instruction with multiple defs, like t2LDRSB_POST. The first def could accidentally steal the physreg that the second, tied def was required to be allocated to. Now, the tied use-def is treated more like an early clobber, and the physreg is reserved before allocating the other defs. This would never be a problem when the tied def was the only def which is the usual case. This fixes MallocBench/gs for thumb2 -O0. llvm-svn: 109715	2010-07-29 00:52:19 +00:00
Jakob Stoklund Olesen	0ff2c110ad	Print out the regclass of any virtual registers used by a machine instruction. llvm-svn: 109608	2010-07-28 18:35:46 +00:00
Devang Patel	84a74779a1	It is FE's responsibility to emit proper directory name. llvm-svn: 109538	2010-07-27 20:51:15 +00:00
Jim Grosbach	7383cf06ba	Grammar llvm-svn: 109525	2010-07-27 18:36:27 +00:00
Nate Begeman	317b969ac5	Fix a crash in the dag combiner caused by ConstantFoldBIT_CONVERTofBUILD_VECTOR calling itself recursively and returning a SCALAR_TO_VECTOR node, but assuming the input was always a BUILD_VECTOR. llvm-svn: 109519	2010-07-27 18:02:18 +00:00
Jim Grosbach	2ff0e64bc3	80 column llvm-svn: 109513	2010-07-27 17:38:47 +00:00
Jim Grosbach	7639967e6c	fix typo llvm-svn: 109511	2010-07-27 17:14:29 +00:00
Bill Wendling	0ff1ef650b	It's better to have the arrays, which would trigger the creation of stack protectors, to be near the stack protectors on the stack. Accomplish this by tagging the stack object with a predicate that indicates that it would trigger this. In the prolog-epilog inserter, assign these objects to the stack after the stack protector but before the other objects. llvm-svn: 109481	2010-07-27 01:55:19 +00:00
Jakob Stoklund Olesen	c698417e52	Add SplitEditor to SplitKit. This class will be used to edit live intervals and rewrite instructions for live range splitting. Still work in progress. llvm-svn: 109469	2010-07-26 23:44:11 +00:00
Dan Gohman	c2af77f510	Fix a use-after-free. llvm-svn: 109468	2010-07-26 23:40:24 +00:00
Bill Wendling	fa60b0ee51	Using llvm.eh.catch.all.value instead of .llvm.eh.catch.all.value. llvm-svn: 109462	2010-07-26 22:36:52 +00:00
Evan Cheng	e6d6c5dd11	The "excess register pressure" returned by HighRegPressure() is not accurate enough to factor into scheduling priority. Eliminate it and add early exits to speed up scheduling. llvm-svn: 109449	2010-07-26 21:49:07 +00:00
Dan Gohman	2810bacafb	Handle Values with no value in getCopyFromRegs. llvm-svn: 109415	2010-07-26 18:15:41 +00:00
Dan Gohman	f9da3c3b88	A block dominates itself, by definition. llvm-svn: 109402	2010-07-26 17:38:15 +00:00
Duncan Sands	136a6f0dbb	Pacify gcc-4.5 which wrongly thinks that RExcess (passed as the Excess parameter) may be used uninitialized in the callers of HighRegPressure. llvm-svn: 109393	2010-07-26 07:54:17 +00:00
Lang Hames	2e3f20b9aa	Factored out a bit of common code to mark VNInfos for deletion. llvm-svn: 109388	2010-07-26 01:49:41 +00:00
Evan Cheng	8ae3ecad2b	Add comments. llvm-svn: 109383	2010-07-25 18:59:43 +00:00
Bob Wilson	280ce9984e	Fix crashes when scheduling a CopyToReg node -- getMachineOpcode asserts on those. Radar 8231572. llvm-svn: 109367	2010-07-25 05:34:27 +00:00
Anton Korobeynikov	3c8eb80d93	Add hook to insert late LLVM=>LLVM passes just before isel llvm-svn: 109354	2010-07-24 20:48:54 +00:00
Bob Wilson	56c006561c	Change ScheduleDAGInstrs::Defs and ::Uses to be variable-size vectors instead of fixed size arrays, so that increasing FirstVirtualRegister to 16K won't cause a compile time performance regression. llvm-svn: 109330	2010-07-24 06:01:53 +00:00
Devang Patel	498877d055	Use current working directory when Dirname is empty. This only happens when absolute source file path is used on compiler command line. llvm-svn: 109302	2010-07-24 00:53:22 +00:00
Evan Cheng	37b740c4bf	Add an ILP scheduler. This is a register pressure aware scheduler that's appropriate for targets without detailed instruction iterineries. The scheduler schedules for increased instruction level parallelism in low register pressure situation; it schedules to reduce register pressure when the register pressure becomes high. On x86_64, this is a win for all tests in CFP2000. It also sped up 256.bzip2 by 16%. llvm-svn: 109300	2010-07-24 00:39:05 +00:00
Jim Grosbach	ba4b1909ce	Remove too-strict assertion. We may want the vreg copy of the physical register to be of a different register class. For example, in Thumb1 if the live-in is a high register, we want the vreg to be a low register. rdar://8224931 llvm-svn: 109291	2010-07-23 23:48:02 +00:00
Devang Patel	28499f76c9	Revert r109262. llvm-svn: 109285	2010-07-23 23:04:41 +00:00
Evan Cheng	df907f4594	- Allow target to specify when is register pressure "too high". In most cases, it's too late to start backing off aggressive latency scheduling when most of the registers are in use so the threshold should be a bit tighter. - Correctly handle live out's and extract_subreg etc. - Enable register pressure aware scheduling by default for hybrid scheduler. For ARM, this is almost always a win on # of instructions. It's runtime neutral for most of the tests. But for some kernels with high register pressure it can be a huge win. e.g. 464.h264ref reduced number of spills by 54 and sped up by 20%. llvm-svn: 109279	2010-07-23 22:39:59 +00:00
Dan Gohman	55e244698a	Use the proper type for shift counts. This fixes a bootstrap error. llvm-svn: 109265	2010-07-23 21:08:12 +00:00
Devang Patel	3032354bbe	IF directory name is empty then try to extract one using absolute file name. llvm-svn: 109262	2010-07-23 20:36:13 +00:00
Dan Gohman	0818684a70	DAGCombine (shl (anyext x, c)) to (anyext (shl x, c)) if the high bits are not demanded. This often allows the anyext to be folded away. llvm-svn: 109242	2010-07-23 18:03:30 +00:00
Dan Gohman	2e00e3b12d	Make SDNode::dump() print a newline at the end. llvm-svn: 109234	2010-07-23 16:37:47 +00:00
Eric Christopher	faf5c76114	80-col. llvm-svn: 109205	2010-07-23 01:05:59 +00:00
Chris Lattner	8f3adc9057	remove the JIT "NeedsExactSize" feature and supporting logic. llvm-svn: 109167	2010-07-22 21:17:55 +00:00
Gabor Greif	59f9970ba5	keep in 80 cols llvm-svn: 109122	2010-07-22 17:18:03 +00:00
Gabor Greif	dde79d8f1a	mass elimination of reliance on automatic iterator dereferencing llvm-svn: 109103	2010-07-22 13:36:47 +00:00
Gabor Greif	3e44ea1917	undo 80 column trespassing I caused llvm-svn: 109092	2010-07-22 10:37:47 +00:00
Evan Cheng	bf32e54bac	Re-apply r109079 with fix. llvm-svn: 109083	2010-07-22 06:24:48 +00:00
Owen Anderson	6c55cccf87	Revert r109079, which broke a lot of CodeGen tests. llvm-svn: 109082	2010-07-22 06:01:28 +00:00
Reid Kleckner	d85e3c5a86	Initial modifications to MCAssembler and TargetMachine for the MCJIT. Patch by Olivier Meurant! llvm-svn: 109080	2010-07-22 05:58:53 +00:00
Evan Cheng	bd81bff672	Initialize RegLimit only when register pressure is being tracked. llvm-svn: 109079	2010-07-22 05:18:41 +00:00
Evan Cheng	285903853f	More register pressure aware scheduling work. llvm-svn: 109064	2010-07-21 23:53:58 +00:00
Jim Grosbach	965a73a28c	For ARM/Darwin, add a dwarf entry indicating whether a function is arm or thumb rdar://8202967 llvm-svn: 109057	2010-07-21 23:03:52 +00:00
Owen Anderson	a57b97e7e7	Fix batch of converting RegisterPass<> to INTIALIZE_PASS(). llvm-svn: 109045	2010-07-21 22:09:45 +00:00
Jim Grosbach	a8683bb033	80 column and trailing whitespace cleanup llvm-svn: 109037	2010-07-21 21:21:52 +00:00
Dan Gohman	093cb79d4b	Disallow null as a named metadata operand. Make MDNode::destroy private. Fix the one thing that used MDNode::destroy, outside of MDNode itself. One should never delete or destroy an MDNode explicitly. MDNodes implicitly go away when there are no references to them (implementation details aside). llvm-svn: 109028	2010-07-21 18:54:18 +00:00
Lang Hames	bdafcc633d	Changed OStream templates to functions on raw_ostream, removed the unused "renderWarnings" function. llvm-svn: 109003	2010-07-21 09:02:06 +00:00
Evan Cheng	a77f3d3b37	Teach bottom up pre-ra scheduler to track register pressure. Work in progress. llvm-svn: 108991	2010-07-21 06:09:07 +00:00
Jakob Stoklund Olesen	0fef9dda8e	Change the createSpiller interface to take a MachineFunctionPass argument. The spillers can pluck the analyses they need from the pass reference. Switch some never-null pointers to references. llvm-svn: 108969	2010-07-20 23:50:15 +00:00
Jakob Stoklund Olesen	ed4075cc3b	Implement loop splitting analysis. Determine which loop exit blocks need a 'pre-exit' block inserted. Recognize when this would be impossible. llvm-svn: 108941	2010-07-20 21:46:58 +00:00
Dale Johannesen	6e5ec6263e	Fix test for switch statements and increase threshold a bit per experimentation. llvm-svn: 108935	2010-07-20 21:29:12 +00:00
Jakob Stoklund Olesen	ff095507e3	Appease the colonials. llvm-svn: 108845	2010-07-20 16:12:37 +00:00
Jakob Stoklund Olesen	36d12c679d	Beginning SplitKit - utility classes for live range splitting. This is a work in progress. So far we have some basic loop analysis to help determine where it is useful to split a live range around a loop. The actual loop splitting code from Splitter.cpp is also going to move in here. llvm-svn: 108842	2010-07-20 15:41:07 +00:00
Lang Hames	31dfb75b52	Updated css classes for the pressure table legend. llvm-svn: 108839	2010-07-20 14:35:55 +00:00
Lang Hames	2ff2193a80	Oops - I tables render poorly in Chrome without this explicit height specification. llvm-svn: 108824	2010-07-20 10:29:46 +00:00
Lang Hames	a475ab7f02	Use run-length encoding to represent identical adjacent cells in the pressure and interval table. Reduces output HTML file sizes by ~80% in my test cases. Also fix access of private member type by << operator. llvm-svn: 108823	2010-07-20 10:18:54 +00:00
Lang Hames	716b184108	Added support for turning HTML indentation on and off (indentation off by default). Reduces output file size ~20% on my test cases. llvm-svn: 108822	2010-07-20 09:13:29 +00:00
Lang Hames	a93fe2de3c	Switched to rendering after allocation (but before rewriting) in PBQP. Updated renderer to use allocation information from VirtRegMap (if available) to render spilled intervals differently. llvm-svn: 108815	2010-07-20 07:41:44 +00:00
Dale Johannesen	08645f1991	Don't hoist things out of a large switch inside a loop, for the reasons in the comments. This is a major win on 253.perlbmk on ARM Darwin. I expect it to be a good heuristic in general, but it's possible some things will regress; I'll be watching. `7940152`. llvm-svn: 108792	2010-07-20 00:50:13 +00:00
Stuart Hastings	61475c5c3c	Correct line info for declarations/definitions. Radar 8063111. llvm-svn: 108784	2010-07-19 23:56:30 +00:00
Devang Patel	d61b735d25	Fix memory leak reported by valgrind. Do not visit operands of old instruction. Visit all operands of new instruction. llvm-svn: 108767	2010-07-19 23:25:39 +00:00
Dan Gohman	b5e918dc05	After a custom inserter, in a block which has constant instructions, update the current basic block in addition to the current insert position, so that they remain consistent. This fixes rdar://8204072. llvm-svn: 108765	2010-07-19 22:48:56 +00:00
Evan Cheng	10f99a3490	ARM has to provide its own TargetLowering::findRepresentativeClass because its scalar floating point registers alias its vector registers. llvm-svn: 108761	2010-07-19 22:15:08 +00:00
Evan Cheng	7a135510e3	Teach computeRegisterProperties() to compute "representative" register class for legal value types. A "representative" register class is the largest legal super-reg register class for a value type. e.g. On i386, GR32 is the rep register class for i8 / i16 / i32; on x86_64 it would be GR64. This property will be used by the register pressure tracking instruction scheduler. llvm-svn: 108735	2010-07-19 18:47:01 +00:00
Jakob Stoklund Olesen	a58a7e7f9e	Spillers may alter MachineLoopInfo when breaking critical edges, so make it non-const. llvm-svn: 108734	2010-07-19 18:41:20 +00:00
Devang Patel	18efced1a2	Fix PR 7662. Do not try to insert local variable info to a DIE used for function declaration. llvm-svn: 108731	2010-07-19 17:53:55 +00:00
Benjamin Kramer	58c283ee85	Update CMake build. llvm-svn: 108700	2010-07-19 15:37:03 +00:00
Lang Hames	6624efb711	Render MachineFunctions to HTML pages, with options to render register pressure estimates and liveness alongside. Still experimental. llvm-svn: 108698	2010-07-19 15:22:28 +00:00
Owen Anderson	9c271e2835	Remove r108639 now that it is handled by InstCombine instead. llvm-svn: 108688	2010-07-19 08:10:24 +00:00
Daniel Dunbar	419197cc4d	Target: Give the TargetAsmParser access to the TargetMachine. - Unfortunate, but necessary for now to handle subtarget instruction matching. Eventually we should factor out the lower level target machine information so we don't need to do this. llvm-svn: 108664	2010-07-19 00:33:49 +00:00
Daniel Dunbar	7f5bf5ae2a	MC: Move several clients to using AsmParser constructor function. llvm-svn: 108645	2010-07-18 18:31:33 +00:00
Douglas Gregor	8ff89f5c02	Fix struct/class mismatch llvm-svn: 108642	2010-07-18 11:47:56 +00:00
Owen Anderson	f7f9c8a2f7	Add a DAGCombine xform to fold away redundant float->double->float conversions around sqrt instructions. I am assured by people more knowledgeable than me that there are no rounding issues in eliminating this. This fixed <rdar://problem/8197504>. llvm-svn: 108639	2010-07-18 08:47:54 +00:00
Lang Hames	1392b8eb79	Added -pbqp-pre-coalescing flag to PBQP. If enabled this will cause PBQP to require LoopSplitter be run prior to register allocation. Entirely for testing purposes at the moment. llvm-svn: 108634	2010-07-18 00:57:59 +00:00
Bill Wendling	ac67e99d53	Use isPrologLabel() instead of checking the opcode directly. llvm-svn: 108628	2010-07-17 19:18:44 +00:00
Zhongxing Xu	b653ce648d	update CMakeLists.txt llvm-svn: 108620	2010-07-17 12:12:42 +00:00
Lang Hames	5864012cc0	Removed unused inRange variable. llvm-svn: 108618	2010-07-17 11:43:07 +00:00
Lang Hames	225977d4f9	LoopSplitter - intended to split live intervals over loop boundaries. Still very much under development. Comments and fixes will be forthcoming. (This commit includes some small tweaks to LiveIntervals & LoopInfo to support the splitter) llvm-svn: 108615	2010-07-17 07:34:01 +00:00
Lang Hames	211e7ce7e7	Iterating over sets of pointers in a heuristic was a bad idea. Switching any command line paramater changed the register allocation produced by PBQP. Turns out variety is not the spice of life. Fixed some comparators, added others. All good now. llvm-svn: 108613	2010-07-17 06:31:41 +00:00
Eric Christopher	0baaa9bcc1	Propagate alloca alignment information via variable size object frame information. No functional change yet. llvm-svn: 108583	2010-07-17 00:28:22 +00:00
Bill Wendling	bf8370ff36	Consider this function: void foo() { __builtin_unreachable(); } It will output the following on Darwin X86: _func1: Leh_func_begin0: pushq %rbp Ltmp0: movq %rsp, %rbp Ltmp1: Leh_func_end0: This prolog adds a new Call Frame Information (CFI) row to the FDE with an address that is not within the address range of the code it describes -- part is equal to the end of the function -- and therefore results in an invalid EH frame. If we emit a nop in this situation, then the CFI row is now within the address range. llvm-svn: 108568	2010-07-16 22:51:10 +00:00
Bill Wendling	499f797cdd	Rename DBG_LABEL PROLOG_LABEL, because it's only used during prolog emission and thus is a much more meaningful name. llvm-svn: 108563	2010-07-16 22:20:36 +00:00
Jakob Stoklund Olesen	b15cbd343c	Remove remaining calls to TII::isMoveInstr. llvm-svn: 108556	2010-07-16 21:03:55 +00:00
Dan Gohman	1e936277c3	Revert r108369, sorting llvm.dbg.declare information by source position, since it doesn't work for front-ends which don't emit column information (which includes llvm-gcc in its present configuration), and doesn't work for clang for K&R style variables where the variables are declared in a different order from the parameter list. Instead, make a separate pass through the instructions to collect the llvm.dbg.declare instructions in order. This ensures that the debug information for variables is emitted in this order. llvm-svn: 108538	2010-07-16 17:54:27 +00:00
Eli Friedman	17c5a23559	Get rid of a bunch of duplicated ELF enum values. llvm-svn: 108520	2010-07-16 07:53:29 +00:00
Jakob Stoklund Olesen	37c42a3d02	Remove many calls to TII::isMoveInstr. Targets should be producing COPY anyway. TII::isMoveInstr is going tobe completely removed. llvm-svn: 108507	2010-07-16 04:45:42 +00:00
Dan Gohman	103c4ebea5	Use the source-order scheduler instead of the "fast" scheduler at -O0, because it's more likely to keep debug line information in its original order. llvm-svn: 108496	2010-07-16 02:01:19 +00:00
Dale Johannesen	bfd4fd7bb7	The SelectionDAGBuilder's handling of debug info, on rare occasions, caused code to be generated in a different order. All cases I've seen involved float softening in the type legalizer, and this could be perhaps be fixed there, but it's better not to generate things differently in the first place. 7797940 (6/29/2010..7/15/2010). llvm-svn: 108484	2010-07-16 00:02:08 +00:00
Bill Wendling	4bda1c8e68	Revert. This isn't the correct way to go. llvm-svn: 108478	2010-07-15 23:42:21 +00:00
Bill Wendling	973dc3b1d8	Handle code gen for the unreachable instruction if it's the only instruction in the function. We'll just turn it into a "trap" instruction instead. The problem with not handling this is that it might generate a prologue without the equivalent epilogue to go with it: $ cat t.ll define void @foo() { entry: unreachable } $ llc -o - t.ll -relocation-model=pic -disable-fp-elim -unwind-tables .section __TEXT,__text,regular,pure_instructions .globl _foo .align 4, 0x90 _foo: ## @foo Leh_func_begin0: ## BB#0: ## %entry pushq %rbp Ltmp0: movq %rsp, %rbp Ltmp1: Leh_func_end0: ... The unwind tables then have bad data in them causing all sorts of problems. Fixes <rdar://problem/8096481>. llvm-svn: 108473	2010-07-15 23:32:40 +00:00
Evan Cheng	55f0c6b9fc	Split -enable-finite-only-fp-math to two options: -enable-no-nans-fp-math and -enable-no-infs-fp-math. All of the current codegen fp math optimizations only care whether the fp arithmetics arguments and results can never be NaN. llvm-svn: 108465	2010-07-15 22:07:12 +00:00
Chris Lattner	60b131654b	fix the definitions of ConstTextCoalSection/ConstDataCoalSection to keep "Text" in sync with the "pure instructions" section attribute. Lack of this attribute was preventing the assembler from emitting multibyte noops instructions for templates (and inlines, and other coalesced stuff) and was causing the assembler to mismatch .o files. This fixes rdar://8018335 llvm-svn: 108461	2010-07-15 21:22:00 +00:00
Bill Wendling	2da75ef315	Use std::vector instead of TargetRegisterInfo::FirstVirtualRegister. llvm-svn: 108452	2010-07-15 20:04:36 +00:00
Bill Wendling	dd5e9d8faf	Use std::vector instead of TargetRegisterInfo::FirstVirtualRegister. llvm-svn: 108450	2010-07-15 20:01:02 +00:00
Bill Wendling	51a9c0a1b3	Use std::vector instead of TargetRegisterInfo::FirstVirtualRegister. This time make sure to allocate enough space in the std::vector. llvm-svn: 108449	2010-07-15 19:58:14 +00:00
Bill Wendling	5a8d15c553	Reserve a goodly amount of room for the vectors. llvm-svn: 108448	2010-07-15 19:41:20 +00:00
Devang Patel	df09db62e2	Fix crash reported in PR7653. llvm-svn: 108441	2010-07-15 18:45:27 +00:00
Bill Wendling	030b0286ec	Use std::vector instead of TargetRegisterInfo::FirstVirtualRegister. llvm-svn: 108440	2010-07-15 18:43:09 +00:00
Bill Wendling	57681404b0	Use std::vector instead of TargetRegisterInfo::FirstVirtualRegister. llvm-svn: 108438	2010-07-15 18:40:50 +00:00
Chris Lattner	c48adb60ca	revert bill's patches in an attempt to fix the buildbot. llvm-svn: 108419	2010-07-15 06:51:46 +00:00
Bill Wendling	1f7071a3e4	Fix headers. llvm-svn: 108413	2010-07-15 06:05:18 +00:00
Bill Wendling	e7e6ca5c57	Use std::vector instead of a hard-coded array. The length of that array could get very large, but we only need it to be the size of the number of pregs. llvm-svn: 108412	2010-07-15 06:04:38 +00:00
Bill Wendling	d5b390189d	Use std::vector instead of a hard-coded array. The length of that array could get very large, but we only need it to be the size of thenumber of pregs. llvm-svn: 108411	2010-07-15 05:56:32 +00:00
Chris Lattner	28fd6785bc	a more graceful fix for test/Other/inline-asm-newline-terminator.ll, follow on to r103765 llvm-svn: 108390	2010-07-15 00:37:34 +00:00
Eric Christopher	474e56a2bf	80-col. llvm-svn: 108381	2010-07-14 23:41:32 +00:00
Dan Gohman	f10cd5c6cb	Make the order in which variables are described in debug information independent of the order that isel happens to visit the dbg_declare intrinsics. This fixes a bug in which the formal arguments were being printed in reverse order, now that fast isel is going bottom up. llvm-svn: 108369	2010-07-14 23:08:16 +00:00
Dan Gohman	c12a6731c5	Properly restore DebugLoc after leaving the local constant area. llvm-svn: 108364	2010-07-14 22:01:31 +00:00
Dan Gohman	042523340b	Delete fast-isel's trivial load optimization; it breaks debugging because it can look past points where a debugger might modify user variables. llvm-svn: 108336	2010-07-14 17:25:37 +00:00
Evan Cheng	d542414945	Teach ProcessImplicitDefs to transform more COPY instructions into IMPLICIT_DEF (and subsequently eliminate them). This allows machine LICM to hoist IMPLICIT_DEF's. PR7620. llvm-svn: 108304	2010-07-14 01:22:19 +00:00
Dan Gohman	1f471435f8	Don't propagate debug locations to instructions for materializing constants, since they may not be emited near the other instructions which get the same line, and this confuses debug info. llvm-svn: 108302	2010-07-14 01:07:44 +00:00
Jakob Stoklund Olesen	cd7a40f4ec	Print VNInfo flags. llvm-svn: 108277	2010-07-13 21:19:05 +00:00
Dale Johannesen	caca5488dc	In inline asm treat indirect 'X' constraint as 'm'. This may not be right in all cases, but it's better than asserting which it was doing before. PR 7528. llvm-svn: 108268	2010-07-13 20:17:05 +00:00
Jakob Stoklund Olesen	fc4b8b8e80	Add an assertion to make PR7542 fail consistently. LiveInterval::overlapsFrom dereferences end() if it is called on an empty interval. It would be reasonable to just return false - an empty interval doesn't overlap anything, but I want to know who is doing it first. llvm-svn: 108264	2010-07-13 19:56:28 +00:00
Jakob Stoklund Olesen	b43455feaf	Fix LiveInterval::overlaps so it doesn't claim touching intervals overlap. Also, one binary search is enough. llvm-svn: 108261	2010-07-13 19:42:20 +00:00
Jakob Stoklund Olesen	54e620d2c7	Don't add memory operands to storeRegToStackSlot / loadRegFromStackSlot results, they already have one. This fixes the himenobmtxpa miscompilation on ARM. The PostRA scheduler got confused by the double memoperand and hoisted a stack slot load above a store to the same slot. llvm-svn: 108219	2010-07-13 00:23:30 +00:00
Rafael Espindola	a18c5a0e5e	Fix a typo and fit in 80 columns. Found by Bob Wilson. llvm-svn: 108164	2010-07-12 18:11:17 +00:00
Duncan Sands	41b4a6b36a	Convert some tab stops into spaces. llvm-svn: 108130	2010-07-12 08:16:59 +00:00
Rafael Espindola	871c724773	Convert the last use of getPhysicalRegisterRegClass and remove it. AggressiveAntiDepBreaker should not be using getPhysicalRegisterRegClass. An instruction might be using a register that can only be replaced with one from a subclass of getPhysicalRegisterRegClass. With this patch we use getMinimalPhysRegClass. This is correct, but conservative. We should check the uses of the register and select the largest register class that can be used in all of them. llvm-svn: 108122	2010-07-12 02:55:34 +00:00
Rafael Espindola	01c5a15dde	Don't use getPhysicalRegisterRegClass in PBQP. The existing checks that the physical register can be allocated in the class of the virtual are sufficient. I think that the test for virtual registers is more strict than it needs to be, it should be possible to coalesce two virtual registers the class of one is a subclass of the other. llvm-svn: 108118	2010-07-12 01:45:38 +00:00
Rafael Espindola	e35d70fafa	Convert the last getPhysicalRegisterRegClass in VirtRegRewriter.cpp to getMinimalPhysRegClass. It was used to produce spills, and it is better to use the most specific class if possible. Update getLoadStoreRegOpcode to handle GR32_AD. llvm-svn: 108115	2010-07-12 00:52:33 +00:00
Chris Lattner	0b7ae20a35	change machinelicm to use MachineInstr::isSafeToMove. No intended functionality change. The avoidance of hoistiing implicitdef seems wrong though. llvm-svn: 108109	2010-07-12 00:00:35 +00:00
Jakob Stoklund Olesen	c4227f1362	Remove TargetInstrInfo::copyRegToReg entirely. Targets must now implement TargetInstrInfo::copyPhysReg instead. There is no longer a default implementation forwarding to copyRegToReg. llvm-svn: 108095	2010-07-11 17:01:17 +00:00
Rafael Espindola	d7c4963f2f	Convert uses of getPhysicalRegisterRegClass in VirtRegRewriter.cpp. The first one was used just to call isSafeToMoveRegClassDefs. In general, using a more specific reg class is better, in practice only x86 implements that method and the results are always the same. The second one is in FindFreeRegister and is used to check if a register is in a register class, a much more direct call to contains is better as it should cover more cases and is faster. llvm-svn: 108093	2010-07-11 16:45:17 +00:00
Chandler Carruth	34e0d14ff4	Remove two other uses of ATTRIBUTE_UNUSED for variables only used within assert()s, switching to void-casts. Removed an unneeded Compiler.h include as a result. There are two other uses in LLVM, but they're not due to assert()s, so I've left them alone. llvm-svn: 108088	2010-07-11 08:18:12 +00:00
Jakob Stoklund Olesen	51642aea77	Use COPY for fast-isel bitconvert, but don't create cross-class copies. This doesn't change the behavior of SelectBitcast for X86. llvm-svn: 108073	2010-07-11 05:16:54 +00:00
Rafael Espindola	a76eccf815	Fix va_arg for doubles. With this patch VAARG nodes always contain the correct alignment information, which simplifies ExpandRes_VAARG a bit. The patch introduces a new alignment information to TargetLoweringInfo. This is needed since the two natural candidates cannot be used: * The 's' in target data: If this is set to the minimal alignment of any argument, getCallFrameTypeAlignment would return 4 for doubles on ARM for example. * The getTransientStackAlignment method. It is possible for an architecture to have argument less aligned than what we maintain the stack pointer. llvm-svn: 108072	2010-07-11 04:01:49 +00:00
Jakob Stoklund Olesen	7147ab9e78	Use COPY for extracting ImplicitDef'ed values from fast-isel instructions. This assumes that the registers can be copied which is probably a safe assumption. llvm-svn: 108070	2010-07-11 03:31:05 +00:00
Jakob Stoklund Olesen	3bb1267431	Use COPY in FastISel everywhere it is safe and trivial. The remaining copyRegToReg calls actually check the return value (shock!), so we cannot trivially replace them with COPY instructions. llvm-svn: 108069	2010-07-11 03:31:00 +00:00
Jakob Stoklund Olesen	0c76d6ec21	Replace copyRegToReg with COPY everywhere in lib/CodeGen except for FastISel. llvm-svn: 108062	2010-07-10 22:42:59 +00:00
Jakob Stoklund Olesen	ad89613b65	Only collect subreg extracting copies for later coalescing. This also avoids fatal copies from physregs. llvm-svn: 108061	2010-07-10 22:42:53 +00:00
Dan Gohman	a64a323564	Fix a bug in the code which re-inserts DBG_VALUE nodes after scheduling; if a block is split (by a custom inserter), the insert point may be in a different block than it was originally. This fixes 32-bit llvm-gcc bootstrap builds, and I haven't been able to reproduce it otherwise. llvm-svn: 108060	2010-07-10 22:42:31 +00:00
Jakob Stoklund Olesen	e50d30d586	Emit COPY instructions instead of using copyRegToReg in InstrEmitter, ScheduleDAGEmit, TwoAddressLowering, and PHIElimination. This switches the bulk of register copies to using COPY, but many less used copyRegToReg calls remain. llvm-svn: 108050	2010-07-10 19:08:25 +00:00
Dan Gohman	fbdba81550	Insert IMPLICIT_DEF instructions at the current insert position, not at the end of the block. llvm-svn: 108045	2010-07-10 13:55:45 +00:00
Dan Gohman	d7b5ce3312	Reapply bottom-up fast-isel, with several fixes for x86-32: - Check getBytesToPopOnReturn(). - Eschew ST0 and ST1 for return values. - Fix the PIC base register initialization so that it doesn't ever fail to end up the top of the entry block. llvm-svn: 108039	2010-07-10 09:00:22 +00:00
Devang Patel	57e72370ae	Update DBG_VALUE to refer appropriate stack slot in case of a spill. llvm-svn: 108023	2010-07-09 21:48:31 +00:00
Jakob Stoklund Olesen	b5c899d11b	Fix small bug in isMoveInstr -> COPY translation llvm-svn: 108013	2010-07-09 20:55:49 +00:00
Jakob Stoklund Olesen	7a7b55eb67	Automatically fold COPY instructions into stack load/store. llvm-svn: 108012	2010-07-09 20:43:13 +00:00
Jakob Stoklund Olesen	e9fdcaa68a	Remat uncoalescable COPY instrs llvm-svn: 108010	2010-07-09 20:43:05 +00:00
Bill Wendling	f831d86311	Clarify what mysterious check means. llvm-svn: 108005	2010-07-09 19:44:12 +00:00
Dan Gohman	7929c448fc	Fix MachineLICM to actually visit inner loops. llvm-svn: 108001	2010-07-09 18:49:45 +00:00
Jakob Stoklund Olesen	bd953d1805	Change TII::foldMemoryOperand API to require the machine instruction to be inserted in a MBB, and return an already inserted MI. This target API change is necessary to allow foldMemoryOperand to call storeToStackSlot and loadFromStackSlot when folding a COPY to a stack slot reference in a target independent way. The foldMemoryOperandImpl hook is going to change in the same way, but I'll wait until COPY folding is actually implemented. Most targets only fold copies and won't need to specialize this hook at all. llvm-svn: 107991	2010-07-09 17:29:08 +00:00
Bob Wilson	6586e9b203	--- Reverse-merging r107947 into '.': U utils/TableGen/FastISelEmitter.cpp --- Reverse-merging r107943 into '.': U test/CodeGen/X86/fast-isel.ll U test/CodeGen/X86/fast-isel-loads.ll U include/llvm/Target/TargetLowering.h U include/llvm/Support/PassNameParser.h U include/llvm/CodeGen/FunctionLoweringInfo.h U include/llvm/CodeGen/CallingConvLower.h U include/llvm/CodeGen/FastISel.h U include/llvm/CodeGen/SelectionDAGISel.h U lib/CodeGen/LLVMTargetMachine.cpp U lib/CodeGen/CallingConvLower.cpp U lib/CodeGen/SelectionDAG/SelectionDAGBuilder.cpp U lib/CodeGen/SelectionDAG/FunctionLoweringInfo.cpp U lib/CodeGen/SelectionDAG/FastISel.cpp U lib/CodeGen/SelectionDAG/SelectionDAGISel.cpp U lib/CodeGen/SelectionDAG/ScheduleDAGSDNodes.cpp U lib/CodeGen/SelectionDAG/InstrEmitter.cpp U lib/CodeGen/SelectionDAG/TargetLowering.cpp U lib/Target/XCore/XCoreISelLowering.cpp U lib/Target/XCore/XCoreISelLowering.h U lib/Target/X86/X86ISelLowering.cpp U lib/Target/X86/X86FastISel.cpp U lib/Target/X86/X86ISelLowering.h llvm-svn: 107987	2010-07-09 16:37:18 +00:00
Gabor Greif	52617fc462	cache result of operator* llvm-svn: 107980	2010-07-09 16:08:33 +00:00
Jakob Stoklund Olesen	d4d9e53b20	Avoid creating %physreg:subidx operands in SimpleRegisterCoalescing::RemoveCopyByCommutingDef. This fixes PR7602. llvm-svn: 107957	2010-07-09 05:56:21 +00:00
Jakob Stoklund Olesen	cac54d6435	Deal with a few remaining spots that assume physical registers have live intervals. This fixes PR7601. llvm-svn: 107955	2010-07-09 04:35:38 +00:00
Jakob Stoklund Olesen	66b3649030	Fix broken isCopy handling in TrimLiveIntervalToLastUse. llvm-svn: 107950	2010-07-09 01:27:21 +00:00
Jakob Stoklund Olesen	5165fa1c39	Handle COPY in VirtRegRewriter. llvm-svn: 107949	2010-07-09 01:27:19 +00:00
Dan Gohman	0b5aa1cdd3	Re-apply bottom-up fast-isel, with fixes. Be very careful to avoid emitting a DBG_VALUE after a terminator, or emitting any instructions before an EH_LABEL. llvm-svn: 107943	2010-07-09 00:39:23 +00:00
Bob Wilson	21eed476e8	Reenable DAG combining for vector shuffles. It looks like it was temporarily disabled and then never turned back on again. Adjust some tests, one because this change avoids an unnecessary instruction, and the other to make it continue testing what it was intended to test. llvm-svn: 107941	2010-07-09 00:38:12 +00:00
Stuart Hastings	d08fb75aaa	Reverting r107918 and r107919. Radar 8063111. llvm-svn: 107930	2010-07-08 23:25:39 +00:00
Jakob Stoklund Olesen	823e90e12a	Revert "Fix broken isCopy handling in TrimLiveIntervalToLastUse" This reverts commit 107921. It broke the clang self host. llvm-svn: 107926	2010-07-08 22:52:47 +00:00
Devang Patel	4c6bd6612f	Relax assertion. In optimized code, it is possible that first instruction is coming from a inlined function. This fixes PR7596 . llvm-svn: 107923	2010-07-08 22:39:20 +00:00
Bill Wendling	a992445ff2	Extension of r107506. Make sure that we don't mark a function as having a call if the inline ASM doesn't need a stack frame. llvm-svn: 107922	2010-07-08 22:38:02 +00:00
Jakob Stoklund Olesen	75c465585a	Fix broken isCopy handling in TrimLiveIntervalToLastUse llvm-svn: 107921	2010-07-08 22:30:38 +00:00
Stuart Hastings	43d226deea	Fix decl/def debug info for template functions. Radar 8063111. llvm-svn: 107919	2010-07-08 22:28:59 +00:00
Devang Patel	9c160e1213	Reuse DIEInteger for 1. This is frequently used while emitting an attribute using dwarf::DW_FORM_flag form. llvm-svn: 107903	2010-07-08 20:10:35 +00:00
Jim Grosbach	c280fc7514	Clean up scavengeRegister() a bit to prefer available regs, which allows the simplification of frame index register scavenging to not have to check for available registers directly and instead just let scavengeRegister() handle it. llvm-svn: 107880	2010-07-08 16:49:26 +00:00
Jakob Stoklund Olesen	00264624a9	Convert EXTRACT_SUBREG to COPY when emitting machine instrs. EXTRACT_SUBREG no longer appears as a machine instruction. Use COPY instead. Add isCopy() checks in many places using isMoveInstr() and isExtractSubreg(). The isMoveInstr hook will be removed later. llvm-svn: 107879	2010-07-08 16:40:22 +00:00
Jakob Stoklund Olesen	a1e883dcf6	Remove references to INSERT_SUBREG after de-SSA. Fix X86InstrInfo::convertToThreeAddressWithLEA to generate COPY instead of INSERT_SUBREG. llvm-svn: 107878	2010-07-08 16:40:15 +00:00
Benjamin Kramer	0ae3f08c0d	Merge the duplicated iabs optimization in DAGCombiner and let it detected a few more idioms. llvm-svn: 107868	2010-07-08 12:09:56 +00:00
Jakob Stoklund Olesen	89a4e25007	Add TargetInstrInfo::copyPhysReg hook and use it from LowerSubregs. This target hook is intended to replace copyRegToReg entirely, but for now it calls copyRegToReg. Any remaining calls to copyRegToReg wil be replaced by COPY instructions. llvm-svn: 107854	2010-07-08 05:01:41 +00:00
Dan Gohman	e75704369d	Revert 107840 107839 107813 107804 107800 107797 107791. Debug info intrinsics win for now. llvm-svn: 107850	2010-07-08 01:00:56 +00:00
Jim Grosbach	6533f24370	When processing frame index virtual registers, consider all available registers (if there are any) and use the one which remains available for the longest rather than just using the first one. This should help enable better re-use of the loaded frame index values. rdar://7318760 llvm-svn: 107847	2010-07-08 00:38:54 +00:00
Dan Gohman	eb9164dc50	Don't forward-declare registers for static allocas, which we'll prefer to materialize as local constants. This fixes the clang bootstrap abort. llvm-svn: 107840	2010-07-07 23:52:58 +00:00
Dan Gohman	1adc499dda	Fix -fast-isel-abort to check the right instruction. llvm-svn: 107839	2010-07-07 23:47:25 +00:00
Devang Patel	a37a95ea2f	One MDNode may be used to create regular DIE as well as abstract DIE. Keep track of abstract subprogram DIEs. llvm-svn: 107822	2010-07-07 22:20:57 +00:00
Evan Cheng	1c349f18f8	Move getExtLoad() and (some) getLoad() DebugLoc argument after EVT argument for consistency sake. llvm-svn: 107820	2010-07-07 22:15:37 +00:00
Dan Gohman	25d5c1b4f8	Not all custom inserters create new basic blocks. If the inserter didn't create a new block, don't reset the insert position. llvm-svn: 107813	2010-07-07 21:18:22 +00:00
Devang Patel	9a0339fc1f	Rename couple of maps. llvm-svn: 107810	2010-07-07 20:49:57 +00:00
Devang Patel	30265c4f8b	80 cols. llvm-svn: 107807	2010-07-07 20:12:52 +00:00
Dan Gohman	e7ccc51cc1	Implement bottom-up fast-isel. This has the advantage of not requiring a separate DCE pass over MachineInstrs. llvm-svn: 107804	2010-07-07 19:20:32 +00:00
Dan Gohman	2d4d01d0de	Add X86FastISel support for return statements. This entails refactoring a bunch of stuff, to allow the target-independent calling convention logic to be employed. llvm-svn: 107800	2010-07-07 18:32:53 +00:00
Dan Gohman	b792f844ad	Update the insert position after scheduling, which may change the position when emitting multiple blocks when executing a custom inserter. llvm-svn: 107797	2010-07-07 18:22:13 +00:00
Devang Patel	637ee5f149	Update comment. llvm-svn: 107796	2010-07-07 18:18:18 +00:00
Dan Gohman	769201448d	Fix debugging strings. llvm-svn: 107795	2010-07-07 17:28:45 +00:00
Dan Gohman	ffe64b1ee5	Give FunctionLoweringInfo an MBB member, avoiding the need to pass it around everywhere, and also give it an InsertPt member, to enable isel to operate at an arbitrary position within a block, rather than just appending to a block. llvm-svn: 107791	2010-07-07 16:47:08 +00:00
Dan Gohman	87fb4e8fcd	Simplify FastISel's constructor by giving it a FunctionLoweringInfo instance, rather than pointers to all of FunctionLoweringInfo's members. This eliminates an NDEBUG ABI sensitivity. llvm-svn: 107789	2010-07-07 16:29:44 +00:00
Dan Gohman	e784616fbb	Move FunctionLoweringInfo.h out into include/llvm/CodeGen. This will allow target-specific fast-isel code to make use of it directly. llvm-svn: 107787	2010-07-07 16:01:37 +00:00
Dan Gohman	fe7532a308	Split the SDValue out of OutputArg so that SelectionDAG-independent code can do calling-convention queries. This obviates OutputArgReg. llvm-svn: 107786	2010-07-07 15:54:55 +00:00
Dan Gohman	498e5f899d	Move CallingConvLower.cpp out of the SelectionDAG directory. llvm-svn: 107781	2010-07-07 15:15:27 +00:00
Jakob Stoklund Olesen	8e1338eea8	Fix more places assuming subregisters have live intervals llvm-svn: 107780	2010-07-07 14:41:22 +00:00
Dan Gohman	88c547ede9	Add a getFirstNonPHI utility function. llvm-svn: 107778	2010-07-07 14:33:51 +00:00
Jakob Stoklund Olesen	f0e551d4f4	Revert "Remove references to INSERT_SUBREG after de-SSA" r107725. Buildbot breakage. llvm-svn: 107744	2010-07-07 00:32:25 +00:00
Jim Grosbach	dc0a0659be	By default, the eh.sjlj.setjmp/longjmp intrinsics should just do nothing rather than assuming a target will custom lower them. Targets which do so should exlicitly mark them as having custom lowerings. PR7454. llvm-svn: 107734	2010-07-06 23:44:52 +00:00
Jakob Stoklund Olesen	e2d3067f6b	Remove references to INSERT_SUBREG after de-SSA llvm-svn: 107732	2010-07-06 23:40:35 +00:00
Jakob Stoklund Olesen	70ee3ecd33	Convert INSERT_SUBREG to COPY in TwoAddressInstructionPass. INSERT_SUBREG will now only appear in SSA machine instructions. Fix the handling of partial redefs in ProcessImplicitDefs. This is now relevant since partial redef COPY instructions appear. llvm-svn: 107726	2010-07-06 23:26:25 +00:00
Dan Gohman	ee0cb70381	CanLowerReturn doesn't need a SelectionDAG; it just needs an LLVMContext. SelectBasicBlock doesn't needs its BasicBlock argument. llvm-svn: 107712	2010-07-06 22:19:37 +00:00
Devang Patel	a3ca21b228	Propagate debug loc. llvm-svn: 107710	2010-07-06 22:08:15 +00:00
Jakob Stoklund Olesen	15fed3bd30	One more case assuming that subregs have live ranges. llvm-svn: 107700	2010-07-06 21:13:03 +00:00
Jakob Stoklund Olesen	bcf3409107	Fix buildbot breakage where a def is missing. llvm-svn: 107698	2010-07-06 21:06:39 +00:00
Jakob Stoklund Olesen	a64c0a3d22	Be more forgiving when calculating alias interference for physreg coalescing. It is OK for an alias live range to overlap if there is a copy to or from the physical register. CoalescerPair can work out if the copy is coalescable independently of the alias. This means that we can join with the actual destination interval instead of using the getOrigDstReg() hack. It is no longer necessary to merge clobber ranges into subregisters. llvm-svn: 107695	2010-07-06 20:31:51 +00:00
Dan Gohman	3439629239	Reapply r107655 with fixes; insert the pseudo instruction into the block before calling the expansion hook. And don't put EFLAGS in a mbb's live-in list twice. llvm-svn: 107691	2010-07-06 20:24:04 +00:00
Eric Christopher	dfc8b745a2	Fix to 80-col. llvm-svn: 107684	2010-07-06 18:35:20 +00:00
Chris Lattner	dde2ba0b60	tighten up this code. llvm-svn: 107670	2010-07-06 15:59:27 +00:00
Dan Gohman	f4f04107ef	Revert r107655. llvm-svn: 107668	2010-07-06 15:49:48 +00:00
Dan Gohman	4e49b59dad	Add versions of OutputArgReg, AnalyzeReturn, and AnalyzeCallOperands which do not depend on SelectionDAG. llvm-svn: 107666	2010-07-06 15:39:54 +00:00
Anton Korobeynikov	e415230477	Fix a major regression on COFF targets introduced by r103267: 'discardable' section means that it is used only during the program load and can be discarded afterwards. This way only debug sections can be discarded, but not the opposite. Seems like the copy-and-pasto from ELF code, since there it contains the reverse flag ('alloc'). llvm-svn: 107658	2010-07-06 15:24:56 +00:00
Dan Gohman	12205645a6	Fix a bunch of custom-inserter functions to handle the case where the pseudo instruction is not at the end of the block. llvm-svn: 107655	2010-07-06 15:18:19 +00:00
Eric Christopher	2ad0c779c3	Fix up -fstack-protector on linux to use the segment registers. Split out testcases per architecture and os now. Patch from Nelson Elhage. llvm-svn: 107640	2010-07-06 05:18:56 +00:00
Chris Lattner	c4a7073db3	more tidying. llvm-svn: 107615	2010-07-05 05:53:14 +00:00
Chris Lattner	2c0315a0f3	random tidying llvm-svn: 107612	2010-07-05 05:36:21 +00:00
Jakob Stoklund Olesen	ac0a210789	Print symbolic subreg indices on REG_SEQUENCE and INSERT_SUBREG. llvm-svn: 107602	2010-07-04 23:24:23 +00:00
Evan Cheng	f3aeb2c22c	Infer alignments of fixed frame objects when they are constructed. This ensures remat'ed loads from fixed slots have the right alignments. llvm-svn: 107591	2010-07-04 18:52:05 +00:00
Bill Wendling	f844642350	Proper indentation. llvm-svn: 107581	2010-07-04 08:58:43 +00:00
Eric Christopher	128a0197bb	Fix typo. llvm-svn: 107556	2010-07-03 01:09:18 +00:00
Evan Cheng	0664a67fe1	Remove isSS argument from CreateFixedObject. Fixed objects cannot be spill slots so it's always false. llvm-svn: 107550	2010-07-03 00:40:23 +00:00
Jakob Stoklund Olesen	4c82a9e7d0	Detect and handle COPY in many places. This code is transitional, it will soon be possible to eliminate isExtractSubreg, isInsertSubreg, and isMoveInstr in most places. llvm-svn: 107547	2010-07-03 00:04:37 +00:00
Eric Christopher	5e5416056b	80-col fixup. llvm-svn: 107537	2010-07-02 23:17:38 +00:00
Jakob Stoklund Olesen	676a15bdf5	Add a new target independent COPY instruction and code to lower it. The COPY instruction is intended to replace the target specific copy instructions for virtual registers as well as the EXTRACT_SUBREG and INSERT_SUBREG instructions in MachineFunctions. It won't we used in a selection DAG. COPY is lowered to native register copies by LowerSubregs. llvm-svn: 107529	2010-07-02 22:29:50 +00:00
Jim Grosbach	3c43248560	Custom inserters (e.g., conditional moves in Thumb1 can introduce new basic blocks, and if used as a function argument, that can cause call frame setup / destroy pairs to be split across a basic block boundary. That prevents us from doing a simple assertion to check that the pairs match and alloc/ dealloc the same amount of space. Modify the assertion to only check the amount allocated when there are matching pairs in the same basic block. rdar://8022442 llvm-svn: 107517	2010-07-02 21:23:37 +00:00
Evan Cheng	0ce84486c3	- Two-address pass should not assume unfolding is always successful. - X86 unfolding should check if the instructions being unfolded has memoperands. If there is no memoperands, then it must assume conservative alignment. If this would introduce an expensive sse unaligned load / store, then unfoldMemoryOperand etc. should not unfold the instruction. llvm-svn: 107509	2010-07-02 20:36:18 +00:00
Dale Johannesen	4d887f7ca7	Propagate the AlignStack bit in InlineAsm's to the PrologEpilog code, and use it to determine whether the asm forces stack alignment or not. gcc consistently does not do this for GCC-style asms; Apple gcc inconsistently sometimes does it for asm blocks. There is no convenient place to put a bit in either the SDNode or the MachineInstr form, so I've added an extra operand to each; unlovely, but it does allow for expansion for more bits, should we need it. PR 5125. Some existing testcases are affected. The operand lists of the SDNode and MachineInstr forms are indexed with awesome mnemonics, like "2"; I may fix this someday, but not now. I'm not making it any worse. If anyone is inspired I think you can find all the right places from this patch. llvm-svn: 107506	2010-07-02 20:16:09 +00:00
Jakob Stoklund Olesen	df8429aeb4	Remove invalid assert llvm-svn: 107505	2010-07-02 19:54:47 +00:00
Jakob Stoklund Olesen	cf6c5c960f	Properly handle debug values during inline spilling. llvm-svn: 107503	2010-07-02 19:54:40 +00:00
Jakob Stoklund Olesen	96037187e5	Rematerialize as much as possible before inserting spills and reloads. This allows us to recognize the common case where all uses could be rematerialized, and no stack slot allocation is necessary. If some values could be fully rematerialized, remove them from the live range before allocating a stack slot for the rest. llvm-svn: 107492	2010-07-02 17:44:57 +00:00
Jim Grosbach	9b7755fbc6	80-column and trailing whitespace cleanup. llvm-svn: 107490	2010-07-02 17:41:59 +00:00
Jim Grosbach	64a4f3f062	grammar tweaks llvm-svn: 107489	2010-07-02 17:38:34 +00:00
Dan Gohman	93f5920914	Rename CreateReg to CreateRegs, and MakeReg to CreateReg. llvm-svn: 107451	2010-07-02 00:10:16 +00:00
Bill Wendling	504055ce9e	Make the "linker_private" linkage type emit a non-weak symbol to the file. It will still be stripped by the linker when it generates the final image. llvm-svn: 107440	2010-07-01 22:38:24 +00:00
Bill Wendling	03bcd6ecc8	Implement the "linker_private_weak" linkage type. This will be used for Objective-C metadata types which should be marked as "weak", but which the linker will remove upon final linkage. However, this linkage isn't specific to Objective-C. For example, the "objc_msgSend_fixup_alloc" symbol is defined like this: .globl l_objc_msgSend_fixup_alloc .weak_definition l_objc_msgSend_fixup_alloc .section __DATA, __objc_msgrefs, coalesced .align 3 l_objc_msgSend_fixup_alloc: .quad _objc_msgSend_fixup .quad L_OBJC_METH_VAR_NAME_1 This is different from the "linker_private" linkage type, because it can't have the metadata defined with ".weak_definition". Currently only supported on Darwin platforms. llvm-svn: 107433	2010-07-01 21:55:59 +00:00
Devang Patel	429397529a	Do not require line number entry for undefined local variable. This is a regression caused by r106792 and caught by gdb testsuite. llvm-svn: 107430	2010-07-01 21:38:08 +00:00
Daniel Dunbar	02877d6e85	MC: Pass the target instance to the AsmParser constructor. llvm-svn: 107426	2010-07-01 20:41:56 +00:00
Daniel Dunbar	329d202362	MC: Move COFF enumeration constants to llvm/Support/COFF.h, patch by Michael Spencer! llvm-svn: 107418	2010-07-01 20:07:24 +00:00
Dan Gohman	d2965c10a1	Temporarily disable on-demand fast-isel. llvm-svn: 107393	2010-07-01 12:15:30 +00:00
Dan Gohman	42b7ee15f5	Use FuncInfo's isExportedInst accessor method instead of doing the work manually. llvm-svn: 107384	2010-07-01 03:57:05 +00:00
Dan Gohman	85e02e9340	Rename CreateRegForValue to CreateReg, and change its argument from a Value to a Type, because it doesn't actually care about the Value. llvm-svn: 107383	2010-07-01 03:55:39 +00:00
Dan Gohman	4d29fd85f9	Fast isel no longer needs DeadMachineInstrElim to clean up after it. llvm-svn: 107381	2010-07-01 03:49:59 +00:00
Dan Gohman	aef3d140b7	Teach fast-isel to avoid loading a value from memory when it's already available in a register. This is pretty primitive, but it reduces the number of instructions in common testcases by 4%. llvm-svn: 107380	2010-07-01 03:49:38 +00:00
Dan Gohman	722f5fc567	Enable on-demand fast-isel. llvm-svn: 107377	2010-07-01 02:58:57 +00:00
Dan Gohman	d432223163	Reapply r106422, splitting the code for materializing a value out of SelectionDAGBuilder::getValue into a helper function, with fixes to use DenseMaps safely. llvm-svn: 107371	2010-07-01 01:59:43 +00:00
Dan Gohman	9576645a84	Don't use operator[] here, because it's not desirable to insert a default value if the search fails. llvm-svn: 107368	2010-07-01 01:33:21 +00:00
Mikhail Glushenkov	4721ad855e	Trailing whitespace. llvm-svn: 107360	2010-07-01 01:00:22 +00:00
Jakob Stoklund Olesen	8656a4549a	Add memory operand folding support to InlineSpiller. llvm-svn: 107355	2010-07-01 00:13:04 +00:00
Jakob Stoklund Olesen	bde96ad23e	Add support for rematerialization to InlineSpiller. llvm-svn: 107351	2010-06-30 23:03:52 +00:00
Bill Wendling	e0dfb98ea0	Use the catch-all selectors we already found when converting them to use the correct catch-all value. This saves having to iterate through all of the selectors in the program again. llvm-svn: 107345	2010-06-30 22:49:53 +00:00
Jim Grosbach	e8c97a7cd7	Handle array and vector typed parameters in sjljehprepare like we do structs. rdar://8145832 llvm-svn: 107332	2010-06-30 22:20:38 +00:00
Jim Grosbach	caf9b3ab7d	grammar tweak in comment. llvm-svn: 107321	2010-06-30 21:27:56 +00:00
Jakob Stoklund Olesen	59e1cae377	Some fool committed without testing (or even building) first. llvm-svn: 107307	2010-06-30 18:41:20 +00:00
Jakob Stoklund Olesen	c39d3497c8	Remember to track spill slot uses in VirtRegMap when inserting loads and stores. LocalRewriter::runOnMachineFunction uses this information to mark dead spill slots. This means that InlineSpiller now also works for functions that spill. llvm-svn: 107302	2010-06-30 18:19:08 +00:00
Duncan Sands	945a347478	Remove an unused variable. The call to getRoot has side-effects, so this could break something (but doesn't seem to). llvm-svn: 107295	2010-06-30 17:22:28 +00:00
Gabor Greif	647d9c9797	use ArgOperand API llvm-svn: 107282	2010-06-30 13:45:50 +00:00
Gabor Greif	f69acfe133	use ArgOperand API llvm-svn: 107279	2010-06-30 12:55:46 +00:00
Gabor Greif	3390e746fa	use CallSite::arg_end instead of CallInst::op_end llvm-svn: 107276	2010-06-30 12:39:23 +00:00
John Mosby	5364655e02	Remove trailing whitespace, no functionality changes. llvm-svn: 107244	2010-06-30 03:40:54 +00:00
Devang Patel	c5b3109bec	Do not construct DIE for already processed MDNode. llvm-svn: 107237	2010-06-30 01:40:11 +00:00
Jakob Stoklund Olesen	b3b89c3bc0	Use skipInstruction() as a simpler way of iterating over instructions using SrcReg llvm-svn: 107234	2010-06-30 00:30:36 +00:00
Jakob Stoklund Olesen	08baf59da1	Use clEnumValN macro to work around keyword clash llvm-svn: 107233	2010-06-30 00:24:51 +00:00
Devang Patel	648df7bf64	Add variables into a scope before constructing scope DIE otherwise variables won't be included DIE tree. llvm-svn: 107228	2010-06-30 00:11:08 +00:00
Jakob Stoklund Olesen	f888911932	Begin implementation of an inline spiller. InlineSpiller inserts loads and spills immediately instead of deferring to VirtRegMap. This is possible now because SlotIndexes allows instructions to be inserted and renumbered. This is work in progress, and is mostly a copy of TrivialSpiller so far. It works very well for functions that don't require spilling. llvm-svn: 107227	2010-06-29 23:58:39 +00:00
Bill Wendling	3632171750	Revert r107205 and r107207. llvm-svn: 107215	2010-06-29 22:34:52 +00:00
Devang Patel	be30551600	Print InlinedAt location. llvm-svn: 107214	2010-06-29 22:29:15 +00:00
Devang Patel	c728518bfe	Print InlinedAt location. llvm-svn: 107208	2010-06-29 21:51:32 +00:00
Bill Wendling	1767723dbe	Introducing the "linker_weak" linkage type. This will be used for Objective-C metadata types which should be marked as "weak", but which the linker will remove upon final linkage. For example, the "objc_msgSend_fixup_alloc" symbol is defined like this: .globl l_objc_msgSend_fixup_alloc .weak_definition l_objc_msgSend_fixup_alloc .section __DATA, __objc_msgrefs, coalesced .align 3 l_objc_msgSend_fixup_alloc: .quad _objc_msgSend_fixup .quad L_OBJC_METH_VAR_NAME_1 This is different from the "linker_private" linkage type, because it can't have the metadata defined with ".weak_definition". llvm-svn: 107205	2010-06-29 21:24:00 +00:00
Devang Patel	24bc1b5b2f	Do not hardcode DW_AT_stmt_list value. Inspired by Artur Pietrek. llvm-svn: 107202	2010-06-29 20:17:53 +00:00
Jakob Stoklund Olesen	dadea5b178	Fix the handling of partial redefines in the fast register allocator. A partial redefine needs to be treated like a tied operand, and the register must be reloaded while processing use operands. This fixes a bug where partially redefined registers were processed as normal defs with a reload added. The reload could clobber another use operand if it was a kill that allowed register reuse. llvm-svn: 107193	2010-06-29 19:15:30 +00:00
Bob Wilson	d91d5bfc95	Fix a register scavenger crash when dealing with undefined subregs. The LowerSubregs pass needs to preserve implicit def operands attached to EXTRACT_SUBREG instructions when it replaces those instructions with copies. llvm-svn: 107189	2010-06-29 18:42:49 +00:00
Duncan Sands	83d1dd637a	It seems clear that this should return Changed. llvm-svn: 107141	2010-06-29 14:49:35 +00:00
Rafael Espindola	38a7d7cbc3	Add a VT argument to getMinimalPhysRegClass and replace the copy related uses of getPhysicalRegisterRegClass with it. If we want to make a copy (or estimate its cost), it is better to use the smallest class as more efficient operations might be possible. llvm-svn: 107140	2010-06-29 14:02:34 +00:00
Duncan Sands	d34bb4e9b0	getMachineBasicBlockAddress returns a uintptr_t - don't truncate to unsigned only to extend back to a pointer sized value on the next line. llvm-svn: 107139	2010-06-29 13:34:20 +00:00
Gabor Greif	e73d64c2cf	use ArgOperand APIs llvm-svn: 107132	2010-06-29 13:03:46 +00:00
Duncan Sands	6d28e73acc	Remove initialized but otherwise unused variables. llvm-svn: 107127	2010-06-29 11:22:26 +00:00
Jim Grosbach	907673c48d	When processing loops for scheduling latencies (used for live outs on loop back-edges), make sure not to include dbg_value instructions in the count. Closing in on the end of rdar://7797940 llvm-svn: 107119	2010-06-29 04:48:13 +00:00
Bob Wilson	1e5da550e5	Reapply my if-conversion cleanup from svn r106939 with fixes. There are 2 changes relative to the previous version of the patch: 1) For the "simple" if-conversion case, there's no need to worry about RemoveExtraEdges not handling an unanalyzable branch. Predicated terminators are ignored in this context, so RemoveExtraEdges does the right thing. This might break someday if we ever treat indirect branches (BRIND) as predicable, but for now, I just removed this part of the patch, because in the case where we do not add an unconditional branch, we rely on keeping the fall-through edge to CvtBBI (which is empty after this transformation). The change relative to the previous patch is: @@ -1036,10 +1036,6 @@ IterIfcvt = false; } - // RemoveExtraEdges won't work if the block has an unanalyzable branch, - // which is typically the case for IfConvertSimple, so explicitly remove - // CvtBBI as a successor. - BBI.BB->removeSuccessor(CvtBBI->BB); RemoveExtraEdges(BBI); // Update block info. BB can be iteratively if-converted. 2) My patch exposed a bug in the code for merging the tail of a "diamond", which had previously never been exercised. The code was simply checking that the tail had a single predecessor, but there was a case in MultiSource/Benchmarks/VersaBench/dbms where that single predecessor was neither edge of the diamond. I added the following change to check for that: @@ -1276,7 +1276,18 @@ // tail, add a unconditional branch to it. if (TailBB) { BBInfo TailBBI = BBAnalysis[TailBB->getNumber()]; - if (TailBB->pred_size() == 1 && !TailBBI.HasFallThrough) { + bool CanMergeTail = !TailBBI.HasFallThrough; + // There may still be a fall-through edge from BBI1 or BBI2 to TailBB; + // check if there are any other predecessors besides those. + unsigned NumPreds = TailBB->pred_size(); + if (NumPreds > 1) + CanMergeTail = false; + else if (NumPreds == 1 && CanMergeTail) { + MachineBasicBlock::pred_iterator PI = TailBB->pred_begin(); + if (PI != BBI1->BB && PI != BBI2->BB) + CanMergeTail = false; + } + if (CanMergeTail) { MergeBlocks(BBI, TailBBI); TailBBI.IsDone = true; } else { With these fixes, I was able to run all the SingleSource and MultiSource tests successfully. llvm-svn: 107110	2010-06-29 00:55:23 +00:00
Bob Wilson	269a89fd3a	Unlike other targets, ARM now uses BUILD_VECTORs post-legalization so they can't be changed arbitrarily by the DAGCombiner without checking if it is running after legalization. llvm-svn: 107097	2010-06-28 23:40:25 +00:00
Devang Patel	1de21ec498	Use DW_FORM_addr for DW_AT_entry_pc. llvm-svn: 107085	2010-06-28 22:22:47 +00:00
Dale Johannesen	17feb07c53	In asm's, output operands with matching input constraints have to be registers, per gcc documentation. This affects the logic for determining what "g" should lower to. PR 7393. A couple of existing testcases are affected. llvm-svn: 107079	2010-06-28 22:09:45 +00:00
Devang Patel	d10b2af260	Include inlined function in list of processed subprograms. llvm-svn: 107065	2010-06-28 20:53:04 +00:00
Jim Grosbach	ee6e29aa72	new, no longer brain-dead, r106907 llvm-svn: 107060	2010-06-28 20:26:00 +00:00
Jakob Stoklund Olesen	ffd628ec0a	After physreg coalescing, physical registers might not have live ranges where you would expect. Don't assert on that case, just give up. This fixes PR7513. llvm-svn: 107046	2010-06-28 19:39:57 +00:00
Jakob Stoklund Olesen	0d94d7af78	Add more special treatment for inline asm in RegAllocFast. When an instruction has tied operands and physreg defines, we must take extra care that the tied operands conflict with neither physreg defs nor uses. The special treatment is given to inline asm and instructions with tied operands / early clobbers and physreg defines. This fixes PR7509. llvm-svn: 107043	2010-06-28 18:34:34 +00:00
Devang Patel	f3b2db68c6	Preserve deleted function's local variables' debug info. Radar 8122864. llvm-svn: 107027	2010-06-28 18:25:03 +00:00
Gabor Greif	cd09869dfc	simplify: we have solid argument iterator range llvm-svn: 107014	2010-06-28 16:40:52 +00:00
Daniel Dunbar	b8c058cbb0	Revert r106907, "make sure to handle dbg_value instructions in the middle of the block, not...", it caused a bunch of nightly test regressions. llvm-svn: 107009	2010-06-28 15:47:17 +00:00
Devang Patel	fb6f22f010	Remove dead code. llvm-svn: 106990	2010-06-28 05:59:13 +00:00
Rafael Espindola	2041abd958	When splitting a VAARG, remember its alignment. This produces terrible but correct code. llvm-svn: 106952	2010-06-26 18:22:20 +00:00
Bob Wilson	418e64a385	Revert my if-conversion cleanup since it caused a bunch of nightly test regressions. --- Reverse-merging r106939 into '.': U test/CodeGen/Thumb2/thumb2-ifcvt3.ll U lib/CodeGen/IfConversion.cpp llvm-svn: 106951	2010-06-26 17:47:06 +00:00
Benjamin Kramer	a000002428	VNInfos don't need to be destructed anymore. llvm-svn: 106943	2010-06-26 11:30:59 +00:00
Bob Wilson	c72da6bb56	Clean up some problems with extra CFG edges being introduced during if-conversion. The RemoveExtraEdges function doesn't work for blocks that end with unanalyzable branches, so in those cases, the "extra" edges must be explicitly removed. The CopyAndPredicateBlock and MergeBlocks methods can also avoid copying successor edges due to branches that have already been removed. The latter case is especially helpful when MergeBlocks is called for handling "diamond" if-conversions, where otherwise you can end up with some weird intermediate states in the CFG. Unfortunately I've been unable to find cases where this cleanup actually makes a significant difference in the code. There is one test where we manage to remove an empty block at the end of a function. Radar 6911268. llvm-svn: 106939	2010-06-26 04:27:33 +00:00
Jim Grosbach	c34befc78f	make sure to handle dbg_value instructions in the middle of the block, not just at the head, when doing diamond if-conversion. rdar://7797940 llvm-svn: 106907	2010-06-25 23:05:46 +00:00
Jakob Stoklund Olesen	55d738e2e1	Don't track kills in VNInfo. Use interval ends instead. The VNInfo.kills vector was almost unused except for all the code keeping it updated. The few places using it were easily rewritten to check for interval ends instead. The two new methods LiveInterval::killedAt and killedInRange are replacements. This brings us down to 3 independent data structures tracking kills. llvm-svn: 106905	2010-06-25 22:53:05 +00:00
Evan Cheng	02b184de5b	Change if-conversion block size limit checks to add some flexibility. llvm-svn: 106901	2010-06-25 22:42:03 +00:00
Devang Patel	5c0f85c7dd	Collect debug info for optimized variables of inlined functions. llvm-svn: 106895	2010-06-25 22:07:34 +00:00
Jim Grosbach	8a6deefec6	80 column and typo fix llvm-svn: 106894	2010-06-25 22:02:28 +00:00
Dale Johannesen	ce97d55ad9	The hasMemory argument is irrelevant to how the argument for an "i" constraint should get lowered; PR 6309. While this argument was passed around a lot, this is the only place it was used, so it goes away from a lot of other places. llvm-svn: 106893	2010-06-25 21:55:36 +00:00
Bill Wendling	e41e40f689	- Reapply r106066 now that the bzip2 build regression has been fixed. - 2010-06-25-CoalescerSubRegDefDead.ll is the testcase for r106878. llvm-svn: 106880	2010-06-25 20:48:10 +00:00
Bill Wendling	ef7acd9a24	We should remove the live range from the destination register only if all defs are dead, not just the def of this register. I.e., a register could be dead, but it's subreg isn't. Testcase to follow with a subsequent patch. llvm-svn: 106878	2010-06-25 20:42:55 +00:00
Dale Johannesen	2ac3b9cbd4	Cosmetic. llvm-svn: 106865	2010-06-25 17:41:07 +00:00
Duncan Sands	2dc70bea54	Remove variables which are assigned to but for which the value is not used. Spotted by gcc-4.6. llvm-svn: 106854	2010-06-25 14:48:39 +00:00
Gabor Greif	b890fc8023	use ArgOperand accessors and CallInst for getting hold of the intrinsic's arguments simplify along the way (at least for me this is much more legible now) Bill, Baldrick or Anton, please review\! llvm-svn: 106838	2010-06-25 11:25:30 +00:00
Gabor Greif	7dd3afdff3	use ArgOperand API (the simple part) llvm-svn: 106837	2010-06-25 09:44:37 +00:00
Gabor Greif	eba0be7dc9	use ArgOperand API llvm-svn: 106836	2010-06-25 09:38:13 +00:00
Gabor Greif	41b81ee2fb	use ArgOperand API llvm-svn: 106835	2010-06-25 09:36:23 +00:00
Gabor Greif	ed9ae7bf21	use ArgOperand API and CallSite to access arguments of CallInst llvm-svn: 106833	2010-06-25 09:03:52 +00:00
Gabor Greif	b5874dea6e	use ArgOperand API and CallSite to access arguments of CallInst llvm-svn: 106829	2010-06-25 08:48:19 +00:00
Gabor Greif	e4eed709d4	use ArgOperand API llvm-svn: 106828	2010-06-25 08:24:59 +00:00
Gabor Greif	f6207e0a80	prune an include llvm-svn: 106827	2010-06-25 08:16:50 +00:00
Dale Johannesen	e9eaaa91d8	Fix a case where an earlyclobber operand of an asm is reused as an input. PR 4118. Testcase is too big, as usual with bugs in this area, but there's one in the PR. llvm-svn: 106816	2010-06-25 00:49:43 +00:00
Jakob Stoklund Olesen	889ab7d158	Make sure all eliminated kills are removed from VNInfo lists. This fixes PR7479 and PR7485. The test cases from those PRs are big, so not included. However, PR7485 comes from self hosting on FreeBSD, so we will surely hear about any regression. llvm-svn: 106811	2010-06-24 23:57:35 +00:00
Dan Gohman	5f0bf64c0c	Add some comments. llvm-svn: 106809	2010-06-24 23:41:59 +00:00
Dan Gohman	9a2f0473b2	Teach EmitLiveInCopies to omit copies for unused virtual registers, and to clean up unused incoming physregs from the live-in list. llvm-svn: 106805	2010-06-24 22:23:02 +00:00
Bill Wendling	2d3c490026	It's possible that a flag is added to the SDNode that points back to the original SDNode. This is badness. Also, this function allows one SDNode to point multiple flags to another SDNode. Badness as well. llvm-svn: 106793	2010-06-24 22:00:37 +00:00
Devang Patel	c657c621b7	DBG_VALUE machine instruction pointing to undefined register for a variable justify a separate scope if the variable is inlined function's argument. Radar 8122864. llvm-svn: 106792	2010-06-24 21:51:19 +00:00
Jakob Stoklund Olesen	2b87d44c5d	Don't return a std::vector in the Spiller interface, but take a reference to a vector instead. This avoids needless copying and allocation. Add documentation. llvm-svn: 106788	2010-06-24 20:54:29 +00:00
Jakob Stoklund Olesen	9b659142a6	Remove the now unused LiveIntervals::getVNInfoSourceReg(). This method was always a bit too simplistic for the real world. It didn't really deal with subregisters and such. llvm-svn: 106781	2010-06-24 20:18:15 +00:00
Jakob Stoklund Olesen	487ed997d0	Teach AdjustCopiesBackFrom to also use CoalescerPair to identify compatible copies. llvm-svn: 106780	2010-06-24 20:16:00 +00:00
Jakob Stoklund Olesen	7f894d8fdc	Remove the -fast-spill option. This code path has never really been used, and we are going to be handling spilling through the Spiller interface in the future. llvm-svn: 106777	2010-06-24 19:56:08 +00:00
Bill Wendling	3f0e992af1	Loosen up the requirements in the Horrible Hack(tm) to include all selectors which don't have a catch-all associated with them not just clean-ups. This fixes the SingleSource/Benchmarks/Shootout-C++/except.cpp testcase that broke because of my change r105902. llvm-svn: 106772	2010-06-24 18:49:10 +00:00
Jakob Stoklund Olesen	45230239e4	Replace a big gob of old coalescer logic with the new CoalescerPair class. CoalescerPair can determine if a copy can be coalesced, and which register gets merged away. The old logic in SimpleRegisterCoalescing had evolved into something a bit too convoluted. This second attempt fixes some crashes that only occurred Linux. llvm-svn: 106769	2010-06-24 18:15:01 +00:00
Jakob Stoklund Olesen	a612d7c012	Print the LSBs of a SlotIndex symbolically using letters referring to the [L]oad, [u]se, [d]ef, or [S]tore slots. This makes it easier to see if two indices refer to the same instruction, avoiding mental mod 4 calculations. llvm-svn: 106766	2010-06-24 17:31:07 +00:00
Dan Gohman	8a84cd57ae	Simplify this code; switch lowering shouldn't produce cases which trivially fold away. llvm-svn: 106765	2010-06-24 17:08:31 +00:00
Jakob Stoklund Olesen	3b2b46a700	Be more strict about subreg-to-subreg copies in CoalescerPair. Also keep track of the original DstREg before subregister adjustments. llvm-svn: 106753	2010-06-24 16:19:28 +00:00
Jakob Stoklund Olesen	53ccab7d1c	Verify that VNI kills are pointing to existing instructions. In this case it is essential that the kill is real because the spiller will decide to omit a spill if it thinks there is a later kill. llvm-svn: 106751	2010-06-24 15:56:59 +00:00
Dan Gohman	463f26b4be	Eliminate the other half of the BRCOND optimization, and update as many tests as possible. llvm-svn: 106749	2010-06-24 15:24:03 +00:00
Dan Gohman	df6b33e778	Eliminate the first have of the optimization which eliminates BRCOND when the condition is constant. This optimization shouldn't be necessary, because codegen shouldn't be able to find dead control paths that the IR-level optimizer can't find. And it's undesirable, because it encourages bugpoint to leave "br i1 false" branches in its output. And it wasn't updating the CFG. I updated all the tests I could, but some tests are too reduced and I wasn't able to meaningfully preserve them. llvm-svn: 106748	2010-06-24 15:04:11 +00:00
Dan Gohman	600f62b3ba	Reapply r106634, now that the bug it exposed is fixed. llvm-svn: 106746	2010-06-24 14:30:44 +00:00
Dan Gohman	0695e09b09	Optimize the "bit test" code path for switch lowering in the case where the bit mask has exactly one bit. llvm-svn: 106716	2010-06-24 02:06:24 +00:00
Jakob Stoklund Olesen	dbb58d2974	Revert "Replace a big gob of old coalescer logic with the new CoalescerPair class." Whiny buildbots. llvm-svn: 106710	2010-06-24 00:52:22 +00:00
Jakob Stoklund Olesen	f38e6720cc	Replace a big gob of old coalescer logic with the new CoalescerPair class. CoalescerPair can determine if a copy can be coalesced, and which register gets merged away. The old logic in SimpleRegisterCoalescing had evolved into something a bit too convoluted. llvm-svn: 106701	2010-06-24 00:12:39 +00:00
Bill Wendling	a136521a17	MorphNodeTo doesn't preserve the memory operands. Because we're morphing a node into the same node, but with different non-memory operands, we need to replace the memory operands after it's finished morphing. llvm-svn: 106643	2010-06-23 18:16:24 +00:00
Daniel Dunbar	4df321b7ad	Revert r106263, "Fold the ShrinkDemandedOps pass into the regular DAGCombiner pass,"... it was causing both 'file' (with clang) and 176.gcc (with llvm-gcc) to be miscompiled. llvm-svn: 106634	2010-06-23 17:09:26 +00:00
Jim Grosbach	b58c08b0ba	Some targets don't require the fencing MEMBARRIER instructions surrounding atomic intrinsics, either because the use locking instructions for the atomics, or because they perform the locking directly. Add support in the DAG combiner to fold away the fences. llvm-svn: 106630	2010-06-23 16:07:42 +00:00
Jakob Stoklund Olesen	731ea71f59	Add a few VNInfo data structure checks. llvm-svn: 106627	2010-06-23 15:34:36 +00:00
Daniel Dunbar	ef5a4383ad	Revert r106066, "Create a more targeted fix for not sinking instructions into a range where it"... it causes bzip2 to be miscompiled by Clang. Conflicts: lib/CodeGen/MachineSink.cpp llvm-svn: 106614	2010-06-23 00:48:25 +00:00
Jakob Stoklund Olesen	1023f6bd98	Also convert SUBREG_TO_REG to a KILL when relevant, like the other subreg instructions. This does not affect codegen much because SUBREG_TO_REG is only used by X86 and X86 does not use the register scavenger, but it prevents verifier errors. llvm-svn: 106583	2010-06-22 22:11:07 +00:00
Dan Gohman	3570f81b1e	Move PHIElimination's SplitCriticalEdge for MachineBasicBlocks out into a utility routine, teach it how to update MachineLoopInfo, and make use of it in MachineLICM to split critical edges on demand. llvm-svn: 106555	2010-06-22 17:25:57 +00:00
Jakob Stoklund Olesen	9c47dac677	Remove the SimpleJoin optimization from SimpleRegisterCoalescing. Measurements show that it does not speed up coalescing, so there is no reason the keep the added complexity around. Also clean out some unused methods and static functions. llvm-svn: 106548	2010-06-22 16:13:57 +00:00
Dan Gohman	d2d1ae105d	Use pre-increment instead of post-increment when the result is not used. llvm-svn: 106542	2010-06-22 15:08:57 +00:00
Dan Gohman	2370e2fe0f	When unfolding a load, avoid assuming which instruction that kill and dead flags will end up on. llvm-svn: 106520	2010-06-22 02:07:21 +00:00
Devang Patel	b6e058da18	Use single interface, using twine, to get named metadata. getNamedMetadata(). llvm-svn: 106518	2010-06-22 01:19:38 +00:00
Evan Cheng	37bb617f8a	Tail merging pass shall not break up IT blocks. rdar://8115404 llvm-svn: 106517	2010-06-22 01:18:16 +00:00
Devang Patel	cbc6fd8493	Discard special LLVM prefix from linkage name. llvm-svn: 106516	2010-06-22 01:06:05 +00:00
Devang Patel	ad51735794	Do not rely on Twine temporaries to survive. llvm-svn: 106515	2010-06-22 01:01:58 +00:00
Dan Gohman	851e478e6b	Fix the new load-unfolding code to update LiveVariable's dead flags, in addition to the kill flags. llvm-svn: 106512	2010-06-22 00:32:04 +00:00
Dan Gohman	3c1b3c61e9	Teach two-address lowering how to unfold a load to open up commuting opportunities. For example, this lets it emit this: movq (%rax), %rcx addq %rdx, %rcx instead of this: movq %rdx, %rcx addq (%rax), %rcx in the case where %rdx has subsequent uses. It's the same number of instructions, and usually the same encoding size on x86, but it appears faster, and in general, it may allow better scheduling for the load. llvm-svn: 106493	2010-06-21 22:17:20 +00:00
Dan Gohman	dd41bba517	Use A.append(...) instead of A.insert(A.end(), ...) when A is a SmallVector, and other SmallVector simplifications. llvm-svn: 106452	2010-06-21 19:47:52 +00:00
Dan Gohman	bbc29ea821	Revert r106422, which is breaking the non-fast-isel path. llvm-svn: 106423	2010-06-21 16:02:28 +00:00
Dan Gohman	f64fdd69d0	More changes for non-top-down fast-isel. Split the code for materializing a value out of SelectionDAGBuilder::getValue into a helper function, so that it can be used in other ways. Add a new getNonRegisterValue function which uses it, for use in code which doesn't want a CopyFromReg even when FuncMap.ValueMap already has an entry for it. llvm-svn: 106422	2010-06-21 15:13:54 +00:00
Dan Gohman	f91aff5f13	Do one lookup instead of two. llvm-svn: 106415	2010-06-21 14:21:47 +00:00
Dan Gohman	7c58cf75fa	Generalize this to look in the regular ValueMap in addition to the LocalValueMap, to make it more flexible when fast-isel isn't proceding straight top-down. llvm-svn: 106414	2010-06-21 14:17:46 +00:00
Bob Wilson	4581434c27	Tidy. llvm-svn: 106383	2010-06-19 05:33:57 +00:00
Dan Gohman	8693650422	Teach regular and fast isel to set dead flags on unused implicit defs on calls and similar instructions. llvm-svn: 106353	2010-06-18 23:28:01 +00:00
Jakob Stoklund Olesen	678927e0b1	Only run CoalesceExtSubRegs when we can expect LiveIntervalAnalysis to clean up the inserted INSERT_SUBREGs after us. llvm-svn: 106345	2010-06-18 23:10:20 +00:00
Evan Cheng	2d51c7c592	Allow ARM if-converter to be run after post allocation scheduling. - This fixed a number of bugs in if-converter, tail merging, and post-allocation scheduler. If-converter now runs branch folding / tail merging first to maximize if-conversion opportunities. - Also changed the t2IT instruction slightly. It now defines the ITSTATE register which is read by instructions in the IT block. - Added Thumb2 specific hazard recognizer to ensure the scheduler doesn't change the instruction ordering in the IT block (since IT mask has been finalized). It also ensures no other instructions can be scheduled between instructions in the IT block. This is not yet enabled. llvm-svn: 106344	2010-06-18 23:09:54 +00:00
Jim Grosbach	a57c2885cf	back-end libcall handling for ATOMIC_SWAP (__sync_lock_test_and_set) llvm-svn: 106342	2010-06-18 23:03:10 +00:00
Jakob Stoklund Olesen	07f4fa8198	TwoAddressInstructionPass::CoalesceExtSubRegs can insert INSERT_SUBREG instructions, but it doesn't really understand live ranges, so the first INSERT_SUBREG uses an implicitly defined register. Fix it in LiveVariableAnalysis by adding the <undef> flag. llvm-svn: 106333	2010-06-18 22:29:44 +00:00
Evan Cheng	cf9e8a987f	Fix an inverted condition. llvm-svn: 106330	2010-06-18 22:17:13 +00:00
Evan Cheng	f5d62535a5	Fix cross initialization compilation error. llvm-svn: 106324	2010-06-18 22:01:37 +00:00
Evan Cheng	c0e0d85b18	Teach iff-converter to properly count # of dups. It was not skipping over dbg_value's which resulted in non-duplicated instructions being deleted. rdar://8104384. llvm-svn: 106323	2010-06-18 21:52:57 +00:00
Jim Grosbach	d64dfc1568	Add Expand-to-libcall support for additional atomics. This covers the usual entries used by llvm-gcc. *_[U]MIN and such can be added later if needed. This enables the front ends to simplify handling of the atomic intrinsics by removing the target-specific decision about which targets can handle the intrinsics. llvm-svn: 106321	2010-06-18 21:43:38 +00:00
Dan Gohman	e5457c275d	Don't leak RegClass2VRegMap, which is now a new[] array instead of a std::vector. llvm-svn: 106298	2010-06-18 18:54:05 +00:00
Dan Gohman	882bb2984e	Start TargetRegisterClass indices at 0 instead of 1, so that MachineRegisterInfo doesn't have to confusingly allocate an extra entry. llvm-svn: 106296	2010-06-18 18:13:55 +00:00
Bob Wilson	f82c8fcc58	Fix PR7372: Conditional branches (at least on ARM) are treated as predicated, so when IfConverter::CopyAndPredicateBlock checks to see if it should ignore an instruction because it is a branch, it should not check if the branch is predicated. This case (when IgnoreBr is true) is only relevant from IfConvertTriangle, where new branches are inserted after the block has been copied and predicated. If the original branch is not removed, we end up with multiple conditional branches (possibly conflicting) at the end of the block. Aside from any immediate errors resulting from that, this confuses the AnalyzeBranch functions so that the branches are not analyzable. That in turn causes the IfConverter to think that the "Simple" pattern can be applied, and things go downhill fast because the "Simple" pattern does _not_ apply if the block can fall through. This is pretty fragile. If there are other degenerate cases where AnalyzeBranch fails, but where the block may still fall through, the IfConverter should not perform its "Simple" if-conversion. But, I don't know how to do that with the current AnalyzeBranch interface, so for now, the best thing seems to be to avoid creating branches that AnalyzeBranch cannot handle. Evan, please review! llvm-svn: 106291	2010-06-18 17:07:23 +00:00
Dan Gohman	9f58b3e106	Don't bother calling releaseMemory before destroying the DominatorTreeBase. llvm-svn: 106287	2010-06-18 16:09:11 +00:00
Dan Gohman	7edb39cc6b	Minor code simplifications. llvm-svn: 106286	2010-06-18 16:00:29 +00:00
Dan Gohman	6e681a5fbe	Give NamedRegionTimer an Enabled flag, allowing all its clients to switch from this: if (TimePassesIsEnabled) { NamedRegionTimer T(Name, GroupName); do_something(); } else { do_something(); // duplicate the code, this time without a timer! } to this: { NamedRegionTimer T(Name, GroupName, TimePassesIsEnabled); do_something(); } llvm-svn: 106285	2010-06-18 15:56:31 +00:00
Dan Gohman	96ca25eba5	Don't replace the old Ordering object with a new one; just clear() the old one. llvm-svn: 106284	2010-06-18 15:40:58 +00:00
Dan Gohman	a4f46b3ef8	Don't call clear() on DbgInfo when it's going to be deleted anyway. Don't replace the old DbgInfo with a new one when clear() on the old one is sufficient. llvm-svn: 106283	2010-06-18 15:36:18 +00:00
Dan Gohman	92c11acdb8	Change UpdateNodeOperands' operand and return value from SDValue to SDNode *, since it doesn't care about the ResNo value. llvm-svn: 106282	2010-06-18 15:30:29 +00:00
Dan Gohman	f1d8304fe3	Eliminate unnecessary uses of getZExtValue(). llvm-svn: 106279	2010-06-18 14:22:04 +00:00
Dan Gohman	35b6f9a929	isValueValidForType can be a static member function. llvm-svn: 106278	2010-06-18 14:01:07 +00:00
Dan Gohman	b92156d5e4	Fold the ShrinkDemandedOps pass into the regular DAGCombiner pass, which is faster, simpler, and less surprising. llvm-svn: 106263	2010-06-18 01:05:21 +00:00
Dan Gohman	0883789ec4	Handle ext(ext(x)) -> ext(x) immediately, since it's simple. llvm-svn: 106256	2010-06-18 00:08:30 +00:00
Stuart Hastings	0125b6410a	Add a DebugLoc parameter to TargetInstrInfo::InsertBranch(). This addresses a longstanding deficiency noted in many FIXMEs scattered across all the targets. This effectively moves the problem up one level, replacing eleven FIXMEs in the targets with eight FIXMEs in CodeGen, plus one path through FastISel where we actually supply a DebugLoc, fixing Radar 7421831. llvm-svn: 106243	2010-06-17 22:43:56 +00:00
Jim Grosbach	0ed5b460dc	add missing break. inconsequential as the code shouldn't be reached, but for correctness' sake, it should be there. llvm-svn: 106229	2010-06-17 17:58:54 +00:00
Jim Grosbach	3aeae8aeeb	Add entries for Expanding atomic intrinsics to libcalls. Just a placeholder for the moment. The implementation of the libcall will follow. Currently, the llvm-gcc knows when the intrinsics can be correctly handled by the back end and only generates them in those cases, issuing libcalls directly otherwise. That's too much coupling. The intrinsics should always be generated and the back end decide how to handle them, be it with a libcall, inline code, or whatever. This patch is a step in that direction. rdar://8097623 llvm-svn: 106227	2010-06-17 17:50:54 +00:00
Jim Grosbach	ba451e80dc	ISD::MEMBARRIER should lower to a libcall (__sync_synchronize) if the target sets the legalize action to Expand. llvm-svn: 106203	2010-06-17 02:00:53 +00:00
Jakob Stoklund Olesen	207cd4bbd7	Allow a register to be redefined multiple times in a basic block. LiveVariableAnalysis was a bit picky about a register only being redefined once, but that really isn't necessary. Here is an example of chained INSERT_SUBREGs that we can handle now: 68 %reg1040<def> = INSERT_SUBREG %reg1040, %reg1028<kill>, 14 register: %reg1040 +[70,134:0) 76 %reg1040<def> = INSERT_SUBREG %reg1040, %reg1029<kill>, 13 register: %reg1040 replace range with [70,78:1) RESULT: %reg1040,0.000000e+00 = [70,78:1)[78,134:0) 0@78-(134) 1@70-(78) 84 %reg1040<def> = INSERT_SUBREG %reg1040, %reg1030<kill>, 12 register: %reg1040 replace range with [78,86:2) RESULT: %reg1040,0.000000e+00 = [70,78:1)[78,86:2)[86,134:0) 0@86-(134) 1@70-(78) 2@78-(86) 92 %reg1040<def> = INSERT_SUBREG %reg1040, %reg1031<kill>, 11 register: %reg1040 replace range with [86,94:3) RESULT: %reg1040,0.000000e+00 = [70,78:1)[78,86:2)[86,94:3)[94,134:0) 0@94-(134) 1@70-(78) 2@78-(86) 3@86-(94) rdar://problem/8096390 llvm-svn: 106152	2010-06-16 21:29:40 +00:00
Jim Grosbach	6c0da25129	add FIXME llvm-svn: 106126	2010-06-16 18:45:08 +00:00
Bill Wendling	d71bd63600	Improve comment to include that the use of a preg is also verboten in this situation. llvm-svn: 106119	2010-06-16 18:01:31 +00:00
Evan Cheng	f128bdcb55	Make post-ra scheduling, anti-dep breaking, and register scavenger (conservatively) aware of predicated instructions. This enables ARM to move if-conversion before post-ra scheduler. llvm-svn: 106091	2010-06-16 07:35:02 +00:00
Devang Patel	a6d20f446f	Use separate named MDNode to hold each function's local variable info. This speeds up local variable handling in DwarfDebug. llvm-svn: 106075	2010-06-16 00:53:55 +00:00
Eric Christopher	b672ab9b53	Don't emit the linkage for initializer label for mach-o tls. llvm-svn: 106073	2010-06-16 00:27:30 +00:00
Bill Wendling	8c0cf0994d	Create a more targeted fix for not sinking instructions into a range where it will conflict with another live range. The place which creates this scenerio is the code in X86 that lowers a select instruction by splitting the MBBs. This eliminates the need to check from the bottom up in an MBB for live pregs. llvm-svn: 106066	2010-06-15 23:46:31 +00:00
Stuart Hastings	9b5005cd4b	Added a comment. llvm-svn: 106063	2010-06-15 23:06:30 +00:00
Bob Wilson	8105144fcd	Fix 80col violations, remove trailing whitespace, and clarify a comment. llvm-svn: 106057	2010-06-15 22:18:54 +00:00
Jakob Stoklund Olesen	ec2e964fd6	Remove the local register allocator. Please use the fast allocator instead. llvm-svn: 106051	2010-06-15 21:58:33 +00:00
Mon P Wang	7a84689cc5	Fixed vector widening of binary instructions that can trap. Patch by Visa Putkinen! llvm-svn: 106038	2010-06-15 20:29:05 +00:00
Bob Wilson	fc7d739422	IfConversion's AnalyzeBlocks method always returns false; clean it up. llvm-svn: 106027	2010-06-15 18:57:15 +00:00
Jim Grosbach	c964585ff8	fix naming llvm-svn: 106024	2010-06-15 18:53:34 +00:00
Jakob Stoklund Olesen	6e54c908e0	Fix an exotic bug that only showed up in an internal test case. SimpleRegisterCoalescing::JoinIntervals() uses CoalescerPair to determine if a copy is coalescable, and in very rare cases it can return true where LHS is not live - the coalescable copy can come from an alias of the physreg in LHS. llvm-svn: 106021	2010-06-15 18:49:14 +00:00
Bob Wilson	5947573f39	Fix a comment typo. llvm-svn: 106015	2010-06-15 18:19:27 +00:00
Bob Wilson	de94e66234	Add some missing checks for the case where the extract_subregs are combined to an insert_subreg, i.e., where the destination register is larger than the source. We need to check that the subregs can be composed for that case in a symmetrical way to the case when the destination is smaller. llvm-svn: 106004	2010-06-15 17:27:54 +00:00
Jakob Stoklund Olesen	246e9a07a2	Avoid processing early clobbers twice in RegAllocFast. Early clobbers defining a virtual register were first alocated to a physreg and then processed as a physreg EC, spilling the virtreg. This fixes PR7382. llvm-svn: 105998	2010-06-15 16:20:57 +00:00
Jakob Stoklund Olesen	82eca35b3e	Add CoalescerPair helper class. Given a copy instruction, CoalescerPair can determine which registers to coalesce in order to eliminate the copy. It deals with all the subreg fun to determine a tuple (DstReg, SrcReg, SubIdx) such that: - SrcReg is a virtual register that will disappear after coalescing. - DstReg is a virtual or physical register whose live range will be extended. - SubIdx is 0 when DstReg is a physical register. - SrcReg can be joined with DstReg:SubIdx. CoalescerPair::isCoalescable() determines if another copy instruction is compatible with the same tuple. This fixes some NEON miscompilations where shuffles are getting coalesced as if they were copies. The CoalescerPair class will replace a lot of the spaghetti logic in JoinCopy later. llvm-svn: 105997	2010-06-15 16:04:21 +00:00
Bob Wilson	a55b8877e6	Generalize the pre-coalescing of extract_subregs feeding reg_sequences, replacing the overly conservative checks that I had introduced recently to deal with correctness issues. This makes a pretty noticable difference in our testcases where reg_sequences are used. I've updated one test to check that we no longer emit the unnecessary subreg moves. llvm-svn: 105991	2010-06-15 05:56:31 +00:00
Ted Kremenek	d52caa5244	Update CMake build. llvm-svn: 105987	2010-06-15 04:08:14 +00:00
Jim Grosbach	412800d346	More dbg_value cleanup so the presence of debug info doesn't affect code-gen. Make sure to skip the dbg_value instructions when moving dups out of the diamond. rdar://7797940 llvm-svn: 105965	2010-06-14 21:30:32 +00:00
Evan Cheng	078f4cec21	- Do away with SimpleHazardRecognizer.h. It's not used and offers little value. - Rename ExactHazardRecognizer to PostRAHazardRecognizer and move its header to include to allow targets to extend it. llvm-svn: 105959	2010-06-14 21:06:53 +00:00
Evan Cheng	a397ada078	Avoid uncessary array copying. llvm-svn: 105955	2010-06-14 20:18:40 +00:00
Chris Lattner	0fc88efda3	fix a -Wbool-conversions warning from clang. llvm-svn: 105942	2010-06-14 18:28:34 +00:00

... 21 22 23 24 25 ...

12180 Commits