llvm-project

Commit Graph

Author	SHA1	Message	Date
Evan Cheng	16bfe5b0f5	PHI elimination shouldn't require machineloopinfo since it's used at -O0. Move the requirement to LiveIntervalAnalysis instead. Note this does not change the number of times machineloopinfo is computed. llvm-svn: 111285	2010-08-17 21:00:37 +00:00
Evan Cheng	e0db9d01d9	Machine CSE preserves CFG. Pass manager was freeing machineloopinfo after machine cse before. llvm-svn: 111281	2010-08-17 20:57:42 +00:00
Jim Grosbach	1a58ce7646	silence warning llvm-svn: 111274	2010-08-17 20:21:30 +00:00
Jim Grosbach	c252ee2375	Add hook to examine an instruction referencing a frame index to determine whether to allocate a virtual frame base register to resolve the frame index reference in it. Implement a simple version for ARM to aid debugging. In LocalStackSlotAllocation, scan the function for frame index references to local frame indices and ask the target whether to allocate virtual frame base registers for any it encounters. Purely infrastructural for debug output. Next step is to actually allocate base registers, then add intelligent re-use of them. rdar://8277890 llvm-svn: 111262	2010-08-17 18:13:53 +00:00
Evan Cheng	647c559172	Move the decision logic whether it's a good idea to split a critical edge to clients. Also fixed an erroneous check. An edge is only a back edge when the from and to blocks are in the same loop. llvm-svn: 111256	2010-08-17 17:43:50 +00:00
Evan Cheng	a6848249ee	Fix debug message. llvm-svn: 111250	2010-08-17 17:15:14 +00:00
Eric Christopher	541f8012d9	Fix typo. llvm-svn: 111223	2010-08-17 01:30:33 +00:00
Evan Cheng	f259efde47	PHI elimination should not break back edge. It can cause some significant code placement issues. rdar://8263994 good: LBB0_2: mov r2, r0 . . . mov r1, r2 bne LBB0_2 bad: LBB0_2: mov r2, r0 . . . @ BB#3: mov r1, r2 b LBB0_2 llvm-svn: 111221	2010-08-17 01:20:36 +00:00
Jim Grosbach	a7c562d664	tidy up. remove unused local. llvm-svn: 111206	2010-08-16 23:26:09 +00:00
Jim Grosbach	36d5ec383e	Better handle alignment requirements for local objects in pre-regalloc frame mapping. Have the local block track its alignment requirement, and then apply that when the block itself is allocated. Previously, offsets could get adjusted in PEI to be different, relative to one another, than the block allocation thought they would be, which defeats the point of doing the allocation this way. Continuing rdar://8277890 llvm-svn: 111197	2010-08-16 22:30:41 +00:00
Eli Friedman	7e2f4ce439	Until uleb/sleb are MC-ized, add a hack to make them work with ELF object emission. llvm-svn: 111177	2010-08-16 20:08:40 +00:00
Jim Grosbach	8be0196afe	track local frame size in MFI, not local to the pass, since PEI needs it. llvm-svn: 111164	2010-08-16 18:06:15 +00:00
Jakob Stoklund Olesen	5f72a04ba7	Remove unused functions. llvm-svn: 111156	2010-08-16 17:18:20 +00:00
Ted Kremenek	da2eba58ed	Update CMake build. llvm-svn: 111063	2010-08-14 01:55:09 +00:00
Jim Grosbach	a030fa5297	Add a local stack object block allocation pass. This is still an experimental pass that allocates locals relative to one another before register allocation and then assigns them to actual stack slots as a block later in PEI. This will eventually allow targets with limited index offset range to allocate additional base registers (not just FP and SP) to more efficiently reference locals, as well as handle situations where locals cannot be referenced via SP or FP at all (dynamic stack realignment together with variable sized objects, for example). It's currently incomplete and almost certainly buggy. Work in progress. Disabled by default and gated via the -enable-local-stack-alloc command line option. rdar://8277890 llvm-svn: 111059	2010-08-14 00:15:52 +00:00
Jakob Stoklund Olesen	27e1f26534	Clean up the Spiller.h interface. The earliestStart argument is entirely specific to linear scan allocation, and can be easily calculated by RegAllocLinearScan. Replace std::vector with SmallVector. llvm-svn: 111055	2010-08-13 22:56:53 +00:00
Jakob Stoklund Olesen	d1191ee43c	Implement splitting inside a single block. When a live range is contained a single block, we can split it around instruction clusters. The current approach is very primitive, splitting before and after the largest gap between uses. llvm-svn: 111043	2010-08-13 21:18:48 +00:00
Jim Grosbach	d1f4465df0	tidy up whitespace a bit llvm-svn: 111019	2010-08-13 16:55:08 +00:00
Jakob Stoklund Olesen	3d1027e7a1	Let LiveInterval::addRange extend existing ranges, it will verify that value numbers match. The old check could accidentally leave holes in openli. Also let useIntv add all ranges for the phi-def value inserted by enterIntvAtEnd. This works as long at the value mapping is established in enterIntvAtEnd. llvm-svn: 110995	2010-08-13 01:05:26 +00:00
Jakob Stoklund Olesen	840b81a19e	Remember to actually update SplitAnalysis statistics now that we have a fancy function to do it. llvm-svn: 110994	2010-08-13 01:05:23 +00:00
Jakob Stoklund Olesen	991e4ee860	Handle an empty dupli. This can happen if the original interval has been broken into two disconnected parts. Ideally, we should be able to detect when the graph is disconnected and create separate intervals, but that code is not implemented yet. Example: Two basic blocks are both branching to a loop header. Our interval is defined in both basic blocks, and live into the loop along both edges. We decide to split the interval around the loop. The interval is split into an inside part and an outside part. The outside part now has two disconnected segments, one in each basic block. If we later decide to split the outside interval into single blocks, we get one interval per basic block and an empty dupli for the remainder. llvm-svn: 110976	2010-08-12 23:02:57 +00:00
Jakob Stoklund Olesen	32c181c444	Update the SplitAnalysis statistics as uses are moved from curli to the new split intervals. THis means the analysis can be used for multiple splits as long as curli doesn't shrink. llvm-svn: 110975	2010-08-12 23:02:55 +00:00
Jakob Stoklund Olesen	0910689353	Also recompute HasPHIKill flags in LiveInterval::RenumberValues. If a phi-def value were removed from the interval, the phi-kill flags are no longer valid. llvm-svn: 110949	2010-08-12 20:38:03 +00:00
Jakob Stoklund Olesen	073cd8004a	Remove trailing whitespace. llvm-svn: 110944	2010-08-12 20:01:23 +00:00
Jakob Stoklund Olesen	fa3ea11ae6	Clean up debug output. llvm-svn: 110940	2010-08-12 18:50:55 +00:00
Jakob Stoklund Olesen	622848b262	Implement single block splitting. Before spilling a live range, we split it into a separate range for each basic block where it is used. That way we only get one reload per basic block if the new smaller ranges can allocate to a register. This type of splitting is already present in the standard spiller. llvm-svn: 110934	2010-08-12 17:07:14 +00:00
Jakob Stoklund Olesen	852a2c19dd	Fix a FIXME. The SlotIndex::Slot enum should be private. llvm-svn: 110826	2010-08-11 16:50:17 +00:00
Bill Wendling	0757820f8f	Turn optimize compares back on with fix. We needed to test that a machine op was a register before checking if it was defined. llvm-svn: 110733	2010-08-10 21:38:11 +00:00
Jakob Stoklund Olesen	57f3db6e2e	Give up on register class recalculation when the register is used with subreg operands. We don't currently have a hook to provide "the largest super class of A where all registers' getSubReg(subidx) is valid and in B". llvm-svn: 110730	2010-08-10 21:16:16 +00:00
Dan Gohman	a53f4e23e4	Revert r110718; it broke clang-i386-darwin9. llvm-svn: 110726	2010-08-10 20:49:33 +00:00
Jakob Stoklund Olesen	3b870f045f	Avoid editing the current live interval during remat. The live interval may be used for a spill slot as well, and that spill slot could be shared by split registers. We cannot shrink it, even if we know the current register won't need the spill slot in that range. llvm-svn: 110721	2010-08-10 20:45:07 +00:00
Jakob Stoklund Olesen	62e721478b	More debug spew llvm-svn: 110720	2010-08-10 20:45:01 +00:00
Bill Wendling	558f822bc7	Turn optimize cmps on by default so that we can get some testing by the nightly ARM testers. llvm-svn: 110718	2010-08-10 20:23:02 +00:00
Devang Patel	8e06a5eb47	Do not forget debug info for enums. Use named mdnode to keep track of these types. llvm-svn: 110712	2010-08-10 20:01:20 +00:00
Jakob Stoklund Olesen	53c5022040	Implement register class inflation. When splitting a live range, the new registers have fewer uses and the permissible register class may be less constrained. Recompute the register class constraint from the uses of new registers created for a split. This may let them be allocated from a larger set, possibly avoiding a spill. llvm-svn: 110703	2010-08-10 18:37:40 +00:00
Jakob Stoklund Olesen	284c2dbfd7	Recalculate the spill weight and allocation hint for virtual registers created during live range splitting. llvm-svn: 110686	2010-08-10 17:07:22 +00:00
Devang Patel	b219746c80	Handle TAG_constant for integers. llvm-svn: 110656	2010-08-10 07:11:13 +00:00
Bill Wendling	884514066e	Update CMake...sorry for the breakage. llvm-svn: 110654	2010-08-10 05:16:06 +00:00
Devang Patel	18ba0b4ac3	Simplify. llvm-svn: 110653	2010-08-10 04:12:17 +00:00
Devang Patel	b1e07b3f2a	Drop "const". It does not add value here. llvm-svn: 110652	2010-08-10 04:09:06 +00:00
Evan Cheng	23ef829096	Add missing null check reported by Amaury Pouly. llvm-svn: 110649	2010-08-10 02:39:45 +00:00
Devang Patel	469c12d254	Do not include file static variable in pubnames list. Refactor and simplify code to avoid redundant checks. llvm-svn: 110642	2010-08-10 01:37:23 +00:00
Jakob Stoklund Olesen	e00c49da11	Transpose the calculation of spill weights such that we are calculating one register at a time. This turns out to be slightly faster than iterating over instructions, but more importantly, it allows us to compute spill weights for new registers created after the spill weight pass has run. Also compute the allocation hint at the same time as the spill weight. This allows us to use the spill weight as a cost metric for copies, and choose the most profitable hint if there is more than one possibility. The new hints provide a very small (< 0.1%) but universal code size improvement. llvm-svn: 110631	2010-08-10 00:02:26 +00:00
Bill Wendling	ca67835eaa	Merge the OptimizeExts and OptimizeCmps passes into one PeepholeOptimizer pass. This pass should expand with all of the small, fine-grained optimization passes to reduce compile time and increase happiment. llvm-svn: 110627	2010-08-09 23:59:04 +00:00
Devang Patel	394a69ed52	Undo accidental commit. llvm-svn: 110623	2010-08-09 23:28:52 +00:00
Devang Patel	4eda9abddb	Simplify. Avoid redundant checks. llvm-svn: 110621	2010-08-09 23:26:06 +00:00
Devang Patel	c7cf14f5f6	Refactor. llvm-svn: 110607	2010-08-09 21:39:24 +00:00
Devang Patel	6d9f9feb2b	Refactoring. Update DbgVarible to handle queries itself. llvm-svn: 110600	2010-08-09 21:01:39 +00:00
Devang Patel	b6511a36b4	It is ok, and convenient, to pass descriptors by value. llvm-svn: 110590	2010-08-09 20:20:05 +00:00
Jakob Stoklund Olesen	3fa110f227	A REG_SEQUENCE instruction may use the same register twice. If we are emitting COPY instructions for the REG_SEQUENCE, make sure the kill flag goes on the last COPY. Otherwise we may be using a killed register. <rdar://problem/8287792> llvm-svn: 110589	2010-08-09 20:19:16 +00:00
Devang Patel	406798a17d	Rename a method. llvm-svn: 110586	2010-08-09 18:51:29 +00:00
Bill Wendling	798617b1ab	Use the "isCompare" machine instruction attribute instead of calling the relatively expensive comparison analyzer on each instruction. Also rename the comparison analyzer method to something more in line with what it actually does. This pass is will eventually be folded into the Machine CSE pass. llvm-svn: 110539	2010-08-08 05:04:59 +00:00
Dan Gohman	093b42fc7c	Tidy some #includes and forward-declarations, and move the C binding code out of PassManager.cpp and into Core.cpp with the rest of the C binding code. llvm-svn: 110494	2010-08-07 00:43:20 +00:00
Jakob Stoklund Olesen	45e07c8fc5	Lazily defer duplicating the live interval we are splitting until we know it is necessary. Sometimes, live range splitting doesn't shrink the current interval, but simply changes some instructions to use a new interval. That makes the original more suitable for spilling. In this case, we don't need to duplicate the original. llvm-svn: 110481	2010-08-06 22:17:33 +00:00
Jim Grosbach	da27eb246d	Cleanup comment wording llvm-svn: 110466	2010-08-06 18:59:07 +00:00
Jakob Stoklund Olesen	1dfca4e4bb	Keep the MachiuneFunctionPass pointer around. It is useful for verification. llvm-svn: 110464	2010-08-06 18:47:06 +00:00
Jakob Stoklund Olesen	8c0f693150	Add LiveInterval::RenumberValues - Garbage collection for VNInfos. After heavy editing of a live interval, it is much easier to simply renumber the live values instead of trying to keep track of the unused ones. llvm-svn: 110463	2010-08-06 18:46:59 +00:00
Owen Anderson	a7aed18624	Reapply r110396, with fixes to appease the Linux buildbot gods. llvm-svn: 110460	2010-08-06 18:33:48 +00:00
Jakob Stoklund Olesen	8147d7a6b9	Add more verification of LiveIntervals. llvm-svn: 110454	2010-08-06 18:04:19 +00:00
Jakob Stoklund Olesen	7e0de5ef8e	Fix swapped COPY operands. llvm-svn: 110453	2010-08-06 18:04:17 +00:00
Jakob Stoklund Olesen	0e7752407c	Don't try to verify LiveIntervals for physical registers. When a physical register is in use, some alias of that register has a live interval with a relevant live range. That is the sad state of intervals after physreg coalescing of subregs, and it is good enough for correct register allocation. llvm-svn: 110452	2010-08-06 18:04:14 +00:00
Ted Kremenek	26177d2c24	Update CMake build. llvm-svn: 110429	2010-08-06 04:05:21 +00:00
Bill Wendling	7de9d52c13	Add the Optimize Compares pass (disabled by default). This pass tries to remove comparison instructions when possible. For instance, if you have this code: sub r1, 1 cmp r1, 0 bz L1 and "sub" either sets the same flag as the "cmp" instruction or could be converted to set the same flag, then we can eliminate the "cmp" instruction all together. This is a important for ARM where the ALU instructions could set the CPSR flag, but need a special suffix ('s') to do so. llvm-svn: 110423	2010-08-06 01:32:48 +00:00
Devang Patel	8a18aee421	While emitting DBG_VALUE for registers spilled at the end of a block do not use location of MBB->end(). If a block does not have terminator then incoming iterator points to end(). llvm-svn: 110411	2010-08-06 00:26:18 +00:00
Owen Anderson	bda59bd247	Revert r110396 to fix buildbots. llvm-svn: 110410	2010-08-06 00:23:35 +00:00
Jakob Stoklund Olesen	01a81b01bc	Be more aggressive about removing joined physreg copies. When a joined COPY changes subreg liveness, we keep it around as a KILL, otherwise it is safe to delete. llvm-svn: 110403	2010-08-05 23:51:28 +00:00
Jakob Stoklund Olesen	b4ef4a961d	Don't verify LiveVariables if LiveIntervals is available. LiveVariables becomes horribly wrong while the coalescer is running, but the analysis is not zapped until after the coalescer pass has run. This causes tons of false reports when calling verify form the coalescer. llvm-svn: 110402	2010-08-05 23:51:26 +00:00
Owen Anderson	755aceb5d0	Don't use PassInfo* as a type identifier for passes. Instead, use the address of the static ID member as the sole unique type identifier. Clean up APIs related to this change. llvm-svn: 110396	2010-08-05 23:42:04 +00:00
Jakob Stoklund Olesen	e7709ebb64	Add basic verification of LiveIntervals. We verify that the LiveInterval is live at uses and defs, and that all instructions have a SlotIndex. Stuff we don't check yet: - Is the LiveInterval minimal? - Do all defs correspond to instructions or phis? - Do all defs dominate all their live ranges? - Are all live ranges continually reachable from their def? llvm-svn: 110386	2010-08-05 22:32:21 +00:00
Jakob Stoklund Olesen	4583355a78	Remove double-def checking from MachineVerifier, so a register does not have to be killed before being redefined. These checks are usually disabled, and usually fail when enabled. We de facto allow live registers to be redefined without a kill, the corresponding assertions in RegScavenger were removed long ago. llvm-svn: 110362	2010-08-05 18:59:59 +00:00
Jakob Stoklund Olesen	d9572619e2	Avoid using a live std::multimap iterator while editing the map. It looks like we sometimes compare singular iterators, reported by ENABLE_EXPENSIVE_CHECKS. This fixes PR7825. llvm-svn: 110355	2010-08-05 18:12:19 +00:00
Bill Wendling	ca1cb13646	The lower invoke pass needs to have unreachable code elimination run after it because it could create such things. This fixes a MingW buildbot test failure. llvm-svn: 110279	2010-08-04 23:36:02 +00:00
Jakob Stoklund Olesen	7fd4905f08	Coalesce stack slot accesses that arise when spilling both sides of a COPY. This helps avoid silly code: %R0<def = LOAD <fi#5> STORE <fi#5>, %R0<kill> llvm-svn: 110266	2010-08-04 22:35:11 +00:00
Jakob Stoklund Olesen	dc96e28d70	Checkpoint SplitKit progress. We are now at a point where we can split around simple single-entry, single-exit loops, although still with some bugs. llvm-svn: 110257	2010-08-04 22:08:39 +00:00
Devang Patel	6c378ac473	Use location entry only of the location described by DBG_VALUE is valid. llvm-svn: 110255	2010-08-04 22:07:27 +00:00
Bill Wendling	b87f3e5a7d	The EH prepare passes really want to be the last passes run before code-gen. llvm-svn: 110248	2010-08-04 21:44:13 +00:00
Devang Patel	6d21f61b3f	Fix typo in comment. llvm-svn: 110244	2010-08-04 20:32:36 +00:00
Dan Gohman	2392287306	Change this llvm_unreachable to report_fatal_error, since it can be triggered by valid, if dubious, IR. llvm-svn: 110240	2010-08-04 18:51:09 +00:00
Devang Patel	d71bc1ae4e	While spilling live registers at the end of block check whether they are used by DBG_VALUE machine instructions or not. If a spilled register is used by DBG_VALUE machine instruction then insert a new DBG_VALUE machine instruction to encode variable's new location on stack. llvm-svn: 110235	2010-08-04 18:42:02 +00:00
Devang Patel	0e60e67efb	If a variable is spilled by code generator then use DW_OP_fbreg to describe its location on stack. llvm-svn: 110234	2010-08-04 18:40:52 +00:00
Dan Gohman	5cae103392	Eliminate unnecessary empty string literals. llvm-svn: 110183	2010-08-04 01:39:08 +00:00
Jakob Stoklund Olesen	0c18757c9d	Oops. Don't normalize spill weights twice. When the normalizeSpillWeights function was introduced, I forgot to remove this normalization. This change could affect register allocation. Hopefully for the better. llvm-svn: 110119	2010-08-03 17:21:16 +00:00
Bill Wendling	44dc60ba13	Early exit and reduce indentation. No functionality change. llvm-svn: 110069	2010-08-02 22:06:08 +00:00
Devang Patel	d070128de5	Free DbgScope created for dead functions. llvm-svn: 110045	2010-08-02 17:32:15 +00:00
Oscar Fuentes	40b31ad3ee	Prefix `next' iterator operation with `llvm::'. Fixes potential ambiguity problems on VS 2010. Patch by nobled! llvm-svn: 110029	2010-08-02 06:00:15 +00:00
Eli Friedman	460ad41d6d	PR7586: Make sure we don't claim that unknown bits are actually known in the ISD::AND case of TargetLowering::SimplifyDemandedBits. llvm-svn: 110019	2010-08-02 04:42:25 +00:00
Bill Wendling	d9900542a6	Reference the personalities. Don't copy them into a new vector. llvm-svn: 109966	2010-08-01 01:34:21 +00:00
Eli Friedman	ffe64c06ef	Fix for bug reported by Evzen Muller on llvm-commits: make sure to correctly check the range of the constant when optimizing a comparison between a constant and a sign_extend_inreg node. llvm-svn: 109854	2010-07-30 06:44:31 +00:00
Benjamin Kramer	a3e0ddb564	Plug the remaining MC leaks by giving MCObjectStreamer/MCAsmStreamer ownership of the TargetAsmBackend and the MCCodeEmitter. llvm-svn: 109767	2010-07-29 17:48:06 +00:00
Dale Johannesen	329d4741a5	Comment typo. llvm-svn: 109765	2010-07-29 17:45:24 +00:00
Jakob Stoklund Olesen	36cf119049	Fix a bug in the -regalloc=fast handling of exotic two-address instruction with multiple defs, like t2LDRSB_POST. The first def could accidentally steal the physreg that the second, tied def was required to be allocated to. Now, the tied use-def is treated more like an early clobber, and the physreg is reserved before allocating the other defs. This would never be a problem when the tied def was the only def which is the usual case. This fixes MallocBench/gs for thumb2 -O0. llvm-svn: 109715	2010-07-29 00:52:19 +00:00
Jakob Stoklund Olesen	0ff2c110ad	Print out the regclass of any virtual registers used by a machine instruction. llvm-svn: 109608	2010-07-28 18:35:46 +00:00
Devang Patel	84a74779a1	It is FE's responsibility to emit proper directory name. llvm-svn: 109538	2010-07-27 20:51:15 +00:00
Jim Grosbach	7383cf06ba	Grammar llvm-svn: 109525	2010-07-27 18:36:27 +00:00
Nate Begeman	317b969ac5	Fix a crash in the dag combiner caused by ConstantFoldBIT_CONVERTofBUILD_VECTOR calling itself recursively and returning a SCALAR_TO_VECTOR node, but assuming the input was always a BUILD_VECTOR. llvm-svn: 109519	2010-07-27 18:02:18 +00:00
Jim Grosbach	2ff0e64bc3	80 column llvm-svn: 109513	2010-07-27 17:38:47 +00:00
Jim Grosbach	7639967e6c	fix typo llvm-svn: 109511	2010-07-27 17:14:29 +00:00
Bill Wendling	0ff1ef650b	It's better to have the arrays, which would trigger the creation of stack protectors, to be near the stack protectors on the stack. Accomplish this by tagging the stack object with a predicate that indicates that it would trigger this. In the prolog-epilog inserter, assign these objects to the stack after the stack protector but before the other objects. llvm-svn: 109481	2010-07-27 01:55:19 +00:00
Jakob Stoklund Olesen	c698417e52	Add SplitEditor to SplitKit. This class will be used to edit live intervals and rewrite instructions for live range splitting. Still work in progress. llvm-svn: 109469	2010-07-26 23:44:11 +00:00
Dan Gohman	c2af77f510	Fix a use-after-free. llvm-svn: 109468	2010-07-26 23:40:24 +00:00
Bill Wendling	fa60b0ee51	Using llvm.eh.catch.all.value instead of .llvm.eh.catch.all.value. llvm-svn: 109462	2010-07-26 22:36:52 +00:00
Evan Cheng	e6d6c5dd11	The "excess register pressure" returned by HighRegPressure() is not accurate enough to factor into scheduling priority. Eliminate it and add early exits to speed up scheduling. llvm-svn: 109449	2010-07-26 21:49:07 +00:00
Dan Gohman	2810bacafb	Handle Values with no value in getCopyFromRegs. llvm-svn: 109415	2010-07-26 18:15:41 +00:00
Dan Gohman	f9da3c3b88	A block dominates itself, by definition. llvm-svn: 109402	2010-07-26 17:38:15 +00:00
Duncan Sands	136a6f0dbb	Pacify gcc-4.5 which wrongly thinks that RExcess (passed as the Excess parameter) may be used uninitialized in the callers of HighRegPressure. llvm-svn: 109393	2010-07-26 07:54:17 +00:00
Lang Hames	2e3f20b9aa	Factored out a bit of common code to mark VNInfos for deletion. llvm-svn: 109388	2010-07-26 01:49:41 +00:00
Evan Cheng	8ae3ecad2b	Add comments. llvm-svn: 109383	2010-07-25 18:59:43 +00:00
Bob Wilson	280ce9984e	Fix crashes when scheduling a CopyToReg node -- getMachineOpcode asserts on those. Radar 8231572. llvm-svn: 109367	2010-07-25 05:34:27 +00:00
Anton Korobeynikov	3c8eb80d93	Add hook to insert late LLVM=>LLVM passes just before isel llvm-svn: 109354	2010-07-24 20:48:54 +00:00
Bob Wilson	56c006561c	Change ScheduleDAGInstrs::Defs and ::Uses to be variable-size vectors instead of fixed size arrays, so that increasing FirstVirtualRegister to 16K won't cause a compile time performance regression. llvm-svn: 109330	2010-07-24 06:01:53 +00:00
Devang Patel	498877d055	Use current working directory when Dirname is empty. This only happens when absolute source file path is used on compiler command line. llvm-svn: 109302	2010-07-24 00:53:22 +00:00
Evan Cheng	37b740c4bf	Add an ILP scheduler. This is a register pressure aware scheduler that's appropriate for targets without detailed instruction iterineries. The scheduler schedules for increased instruction level parallelism in low register pressure situation; it schedules to reduce register pressure when the register pressure becomes high. On x86_64, this is a win for all tests in CFP2000. It also sped up 256.bzip2 by 16%. llvm-svn: 109300	2010-07-24 00:39:05 +00:00
Jim Grosbach	ba4b1909ce	Remove too-strict assertion. We may want the vreg copy of the physical register to be of a different register class. For example, in Thumb1 if the live-in is a high register, we want the vreg to be a low register. rdar://8224931 llvm-svn: 109291	2010-07-23 23:48:02 +00:00
Devang Patel	28499f76c9	Revert r109262. llvm-svn: 109285	2010-07-23 23:04:41 +00:00
Evan Cheng	df907f4594	- Allow target to specify when is register pressure "too high". In most cases, it's too late to start backing off aggressive latency scheduling when most of the registers are in use so the threshold should be a bit tighter. - Correctly handle live out's and extract_subreg etc. - Enable register pressure aware scheduling by default for hybrid scheduler. For ARM, this is almost always a win on # of instructions. It's runtime neutral for most of the tests. But for some kernels with high register pressure it can be a huge win. e.g. 464.h264ref reduced number of spills by 54 and sped up by 20%. llvm-svn: 109279	2010-07-23 22:39:59 +00:00
Dan Gohman	55e244698a	Use the proper type for shift counts. This fixes a bootstrap error. llvm-svn: 109265	2010-07-23 21:08:12 +00:00
Devang Patel	3032354bbe	IF directory name is empty then try to extract one using absolute file name. llvm-svn: 109262	2010-07-23 20:36:13 +00:00
Dan Gohman	0818684a70	DAGCombine (shl (anyext x, c)) to (anyext (shl x, c)) if the high bits are not demanded. This often allows the anyext to be folded away. llvm-svn: 109242	2010-07-23 18:03:30 +00:00
Dan Gohman	2e00e3b12d	Make SDNode::dump() print a newline at the end. llvm-svn: 109234	2010-07-23 16:37:47 +00:00
Eric Christopher	faf5c76114	80-col. llvm-svn: 109205	2010-07-23 01:05:59 +00:00
Chris Lattner	8f3adc9057	remove the JIT "NeedsExactSize" feature and supporting logic. llvm-svn: 109167	2010-07-22 21:17:55 +00:00
Gabor Greif	59f9970ba5	keep in 80 cols llvm-svn: 109122	2010-07-22 17:18:03 +00:00
Gabor Greif	dde79d8f1a	mass elimination of reliance on automatic iterator dereferencing llvm-svn: 109103	2010-07-22 13:36:47 +00:00
Gabor Greif	3e44ea1917	undo 80 column trespassing I caused llvm-svn: 109092	2010-07-22 10:37:47 +00:00
Evan Cheng	bf32e54bac	Re-apply r109079 with fix. llvm-svn: 109083	2010-07-22 06:24:48 +00:00
Owen Anderson	6c55cccf87	Revert r109079, which broke a lot of CodeGen tests. llvm-svn: 109082	2010-07-22 06:01:28 +00:00
Reid Kleckner	d85e3c5a86	Initial modifications to MCAssembler and TargetMachine for the MCJIT. Patch by Olivier Meurant! llvm-svn: 109080	2010-07-22 05:58:53 +00:00
Evan Cheng	bd81bff672	Initialize RegLimit only when register pressure is being tracked. llvm-svn: 109079	2010-07-22 05:18:41 +00:00
Evan Cheng	285903853f	More register pressure aware scheduling work. llvm-svn: 109064	2010-07-21 23:53:58 +00:00
Jim Grosbach	965a73a28c	For ARM/Darwin, add a dwarf entry indicating whether a function is arm or thumb rdar://8202967 llvm-svn: 109057	2010-07-21 23:03:52 +00:00
Owen Anderson	a57b97e7e7	Fix batch of converting RegisterPass<> to INTIALIZE_PASS(). llvm-svn: 109045	2010-07-21 22:09:45 +00:00
Jim Grosbach	a8683bb033	80 column and trailing whitespace cleanup llvm-svn: 109037	2010-07-21 21:21:52 +00:00
Dan Gohman	093cb79d4b	Disallow null as a named metadata operand. Make MDNode::destroy private. Fix the one thing that used MDNode::destroy, outside of MDNode itself. One should never delete or destroy an MDNode explicitly. MDNodes implicitly go away when there are no references to them (implementation details aside). llvm-svn: 109028	2010-07-21 18:54:18 +00:00
Lang Hames	bdafcc633d	Changed OStream templates to functions on raw_ostream, removed the unused "renderWarnings" function. llvm-svn: 109003	2010-07-21 09:02:06 +00:00
Evan Cheng	a77f3d3b37	Teach bottom up pre-ra scheduler to track register pressure. Work in progress. llvm-svn: 108991	2010-07-21 06:09:07 +00:00
Jakob Stoklund Olesen	0fef9dda8e	Change the createSpiller interface to take a MachineFunctionPass argument. The spillers can pluck the analyses they need from the pass reference. Switch some never-null pointers to references. llvm-svn: 108969	2010-07-20 23:50:15 +00:00
Jakob Stoklund Olesen	ed4075cc3b	Implement loop splitting analysis. Determine which loop exit blocks need a 'pre-exit' block inserted. Recognize when this would be impossible. llvm-svn: 108941	2010-07-20 21:46:58 +00:00
Dale Johannesen	6e5ec6263e	Fix test for switch statements and increase threshold a bit per experimentation. llvm-svn: 108935	2010-07-20 21:29:12 +00:00
Jakob Stoklund Olesen	ff095507e3	Appease the colonials. llvm-svn: 108845	2010-07-20 16:12:37 +00:00
Jakob Stoklund Olesen	36d12c679d	Beginning SplitKit - utility classes for live range splitting. This is a work in progress. So far we have some basic loop analysis to help determine where it is useful to split a live range around a loop. The actual loop splitting code from Splitter.cpp is also going to move in here. llvm-svn: 108842	2010-07-20 15:41:07 +00:00
Lang Hames	31dfb75b52	Updated css classes for the pressure table legend. llvm-svn: 108839	2010-07-20 14:35:55 +00:00
Lang Hames	2ff2193a80	Oops - I tables render poorly in Chrome without this explicit height specification. llvm-svn: 108824	2010-07-20 10:29:46 +00:00
Lang Hames	a475ab7f02	Use run-length encoding to represent identical adjacent cells in the pressure and interval table. Reduces output HTML file sizes by ~80% in my test cases. Also fix access of private member type by << operator. llvm-svn: 108823	2010-07-20 10:18:54 +00:00
Lang Hames	716b184108	Added support for turning HTML indentation on and off (indentation off by default). Reduces output file size ~20% on my test cases. llvm-svn: 108822	2010-07-20 09:13:29 +00:00
Lang Hames	a93fe2de3c	Switched to rendering after allocation (but before rewriting) in PBQP. Updated renderer to use allocation information from VirtRegMap (if available) to render spilled intervals differently. llvm-svn: 108815	2010-07-20 07:41:44 +00:00
Dale Johannesen	08645f1991	Don't hoist things out of a large switch inside a loop, for the reasons in the comments. This is a major win on 253.perlbmk on ARM Darwin. I expect it to be a good heuristic in general, but it's possible some things will regress; I'll be watching. `7940152`. llvm-svn: 108792	2010-07-20 00:50:13 +00:00
Stuart Hastings	61475c5c3c	Correct line info for declarations/definitions. Radar 8063111. llvm-svn: 108784	2010-07-19 23:56:30 +00:00
Devang Patel	d61b735d25	Fix memory leak reported by valgrind. Do not visit operands of old instruction. Visit all operands of new instruction. llvm-svn: 108767	2010-07-19 23:25:39 +00:00
Dan Gohman	b5e918dc05	After a custom inserter, in a block which has constant instructions, update the current basic block in addition to the current insert position, so that they remain consistent. This fixes rdar://8204072. llvm-svn: 108765	2010-07-19 22:48:56 +00:00
Evan Cheng	10f99a3490	ARM has to provide its own TargetLowering::findRepresentativeClass because its scalar floating point registers alias its vector registers. llvm-svn: 108761	2010-07-19 22:15:08 +00:00
Evan Cheng	7a135510e3	Teach computeRegisterProperties() to compute "representative" register class for legal value types. A "representative" register class is the largest legal super-reg register class for a value type. e.g. On i386, GR32 is the rep register class for i8 / i16 / i32; on x86_64 it would be GR64. This property will be used by the register pressure tracking instruction scheduler. llvm-svn: 108735	2010-07-19 18:47:01 +00:00
Jakob Stoklund Olesen	a58a7e7f9e	Spillers may alter MachineLoopInfo when breaking critical edges, so make it non-const. llvm-svn: 108734	2010-07-19 18:41:20 +00:00
Devang Patel	18efced1a2	Fix PR 7662. Do not try to insert local variable info to a DIE used for function declaration. llvm-svn: 108731	2010-07-19 17:53:55 +00:00
Benjamin Kramer	58c283ee85	Update CMake build. llvm-svn: 108700	2010-07-19 15:37:03 +00:00
Lang Hames	6624efb711	Render MachineFunctions to HTML pages, with options to render register pressure estimates and liveness alongside. Still experimental. llvm-svn: 108698	2010-07-19 15:22:28 +00:00
Owen Anderson	9c271e2835	Remove r108639 now that it is handled by InstCombine instead. llvm-svn: 108688	2010-07-19 08:10:24 +00:00
Daniel Dunbar	419197cc4d	Target: Give the TargetAsmParser access to the TargetMachine. - Unfortunate, but necessary for now to handle subtarget instruction matching. Eventually we should factor out the lower level target machine information so we don't need to do this. llvm-svn: 108664	2010-07-19 00:33:49 +00:00
Daniel Dunbar	7f5bf5ae2a	MC: Move several clients to using AsmParser constructor function. llvm-svn: 108645	2010-07-18 18:31:33 +00:00
Douglas Gregor	8ff89f5c02	Fix struct/class mismatch llvm-svn: 108642	2010-07-18 11:47:56 +00:00
Owen Anderson	f7f9c8a2f7	Add a DAGCombine xform to fold away redundant float->double->float conversions around sqrt instructions. I am assured by people more knowledgeable than me that there are no rounding issues in eliminating this. This fixed <rdar://problem/8197504>. llvm-svn: 108639	2010-07-18 08:47:54 +00:00
Lang Hames	1392b8eb79	Added -pbqp-pre-coalescing flag to PBQP. If enabled this will cause PBQP to require LoopSplitter be run prior to register allocation. Entirely for testing purposes at the moment. llvm-svn: 108634	2010-07-18 00:57:59 +00:00
Bill Wendling	ac67e99d53	Use isPrologLabel() instead of checking the opcode directly. llvm-svn: 108628	2010-07-17 19:18:44 +00:00
Zhongxing Xu	b653ce648d	update CMakeLists.txt llvm-svn: 108620	2010-07-17 12:12:42 +00:00
Lang Hames	5864012cc0	Removed unused inRange variable. llvm-svn: 108618	2010-07-17 11:43:07 +00:00
Lang Hames	225977d4f9	LoopSplitter - intended to split live intervals over loop boundaries. Still very much under development. Comments and fixes will be forthcoming. (This commit includes some small tweaks to LiveIntervals & LoopInfo to support the splitter) llvm-svn: 108615	2010-07-17 07:34:01 +00:00
Lang Hames	211e7ce7e7	Iterating over sets of pointers in a heuristic was a bad idea. Switching any command line paramater changed the register allocation produced by PBQP. Turns out variety is not the spice of life. Fixed some comparators, added others. All good now. llvm-svn: 108613	2010-07-17 06:31:41 +00:00
Eric Christopher	0baaa9bcc1	Propagate alloca alignment information via variable size object frame information. No functional change yet. llvm-svn: 108583	2010-07-17 00:28:22 +00:00
Bill Wendling	bf8370ff36	Consider this function: void foo() { __builtin_unreachable(); } It will output the following on Darwin X86: _func1: Leh_func_begin0: pushq %rbp Ltmp0: movq %rsp, %rbp Ltmp1: Leh_func_end0: This prolog adds a new Call Frame Information (CFI) row to the FDE with an address that is not within the address range of the code it describes -- part is equal to the end of the function -- and therefore results in an invalid EH frame. If we emit a nop in this situation, then the CFI row is now within the address range. llvm-svn: 108568	2010-07-16 22:51:10 +00:00
Bill Wendling	499f797cdd	Rename DBG_LABEL PROLOG_LABEL, because it's only used during prolog emission and thus is a much more meaningful name. llvm-svn: 108563	2010-07-16 22:20:36 +00:00
Jakob Stoklund Olesen	b15cbd343c	Remove remaining calls to TII::isMoveInstr. llvm-svn: 108556	2010-07-16 21:03:55 +00:00
Dan Gohman	1e936277c3	Revert r108369, sorting llvm.dbg.declare information by source position, since it doesn't work for front-ends which don't emit column information (which includes llvm-gcc in its present configuration), and doesn't work for clang for K&R style variables where the variables are declared in a different order from the parameter list. Instead, make a separate pass through the instructions to collect the llvm.dbg.declare instructions in order. This ensures that the debug information for variables is emitted in this order. llvm-svn: 108538	2010-07-16 17:54:27 +00:00
Eli Friedman	17c5a23559	Get rid of a bunch of duplicated ELF enum values. llvm-svn: 108520	2010-07-16 07:53:29 +00:00
Jakob Stoklund Olesen	37c42a3d02	Remove many calls to TII::isMoveInstr. Targets should be producing COPY anyway. TII::isMoveInstr is going tobe completely removed. llvm-svn: 108507	2010-07-16 04:45:42 +00:00
Dan Gohman	103c4ebea5	Use the source-order scheduler instead of the "fast" scheduler at -O0, because it's more likely to keep debug line information in its original order. llvm-svn: 108496	2010-07-16 02:01:19 +00:00
Dale Johannesen	bfd4fd7bb7	The SelectionDAGBuilder's handling of debug info, on rare occasions, caused code to be generated in a different order. All cases I've seen involved float softening in the type legalizer, and this could be perhaps be fixed there, but it's better not to generate things differently in the first place. 7797940 (6/29/2010..7/15/2010). llvm-svn: 108484	2010-07-16 00:02:08 +00:00
Bill Wendling	4bda1c8e68	Revert. This isn't the correct way to go. llvm-svn: 108478	2010-07-15 23:42:21 +00:00
Bill Wendling	973dc3b1d8	Handle code gen for the unreachable instruction if it's the only instruction in the function. We'll just turn it into a "trap" instruction instead. The problem with not handling this is that it might generate a prologue without the equivalent epilogue to go with it: $ cat t.ll define void @foo() { entry: unreachable } $ llc -o - t.ll -relocation-model=pic -disable-fp-elim -unwind-tables .section __TEXT,__text,regular,pure_instructions .globl _foo .align 4, 0x90 _foo: ## @foo Leh_func_begin0: ## BB#0: ## %entry pushq %rbp Ltmp0: movq %rsp, %rbp Ltmp1: Leh_func_end0: ... The unwind tables then have bad data in them causing all sorts of problems. Fixes <rdar://problem/8096481>. llvm-svn: 108473	2010-07-15 23:32:40 +00:00
Evan Cheng	55f0c6b9fc	Split -enable-finite-only-fp-math to two options: -enable-no-nans-fp-math and -enable-no-infs-fp-math. All of the current codegen fp math optimizations only care whether the fp arithmetics arguments and results can never be NaN. llvm-svn: 108465	2010-07-15 22:07:12 +00:00
Chris Lattner	60b131654b	fix the definitions of ConstTextCoalSection/ConstDataCoalSection to keep "Text" in sync with the "pure instructions" section attribute. Lack of this attribute was preventing the assembler from emitting multibyte noops instructions for templates (and inlines, and other coalesced stuff) and was causing the assembler to mismatch .o files. This fixes rdar://8018335 llvm-svn: 108461	2010-07-15 21:22:00 +00:00
Bill Wendling	2da75ef315	Use std::vector instead of TargetRegisterInfo::FirstVirtualRegister. llvm-svn: 108452	2010-07-15 20:04:36 +00:00
Bill Wendling	dd5e9d8faf	Use std::vector instead of TargetRegisterInfo::FirstVirtualRegister. llvm-svn: 108450	2010-07-15 20:01:02 +00:00
Bill Wendling	51a9c0a1b3	Use std::vector instead of TargetRegisterInfo::FirstVirtualRegister. This time make sure to allocate enough space in the std::vector. llvm-svn: 108449	2010-07-15 19:58:14 +00:00
Bill Wendling	5a8d15c553	Reserve a goodly amount of room for the vectors. llvm-svn: 108448	2010-07-15 19:41:20 +00:00
Devang Patel	df09db62e2	Fix crash reported in PR7653. llvm-svn: 108441	2010-07-15 18:45:27 +00:00
Bill Wendling	030b0286ec	Use std::vector instead of TargetRegisterInfo::FirstVirtualRegister. llvm-svn: 108440	2010-07-15 18:43:09 +00:00
Bill Wendling	57681404b0	Use std::vector instead of TargetRegisterInfo::FirstVirtualRegister. llvm-svn: 108438	2010-07-15 18:40:50 +00:00
Chris Lattner	c48adb60ca	revert bill's patches in an attempt to fix the buildbot. llvm-svn: 108419	2010-07-15 06:51:46 +00:00
Bill Wendling	1f7071a3e4	Fix headers. llvm-svn: 108413	2010-07-15 06:05:18 +00:00
Bill Wendling	e7e6ca5c57	Use std::vector instead of a hard-coded array. The length of that array could get very large, but we only need it to be the size of the number of pregs. llvm-svn: 108412	2010-07-15 06:04:38 +00:00
Bill Wendling	d5b390189d	Use std::vector instead of a hard-coded array. The length of that array could get very large, but we only need it to be the size of thenumber of pregs. llvm-svn: 108411	2010-07-15 05:56:32 +00:00
Chris Lattner	28fd6785bc	a more graceful fix for test/Other/inline-asm-newline-terminator.ll, follow on to r103765 llvm-svn: 108390	2010-07-15 00:37:34 +00:00
Eric Christopher	474e56a2bf	80-col. llvm-svn: 108381	2010-07-14 23:41:32 +00:00
Dan Gohman	f10cd5c6cb	Make the order in which variables are described in debug information independent of the order that isel happens to visit the dbg_declare intrinsics. This fixes a bug in which the formal arguments were being printed in reverse order, now that fast isel is going bottom up. llvm-svn: 108369	2010-07-14 23:08:16 +00:00
Dan Gohman	c12a6731c5	Properly restore DebugLoc after leaving the local constant area. llvm-svn: 108364	2010-07-14 22:01:31 +00:00
Dan Gohman	042523340b	Delete fast-isel's trivial load optimization; it breaks debugging because it can look past points where a debugger might modify user variables. llvm-svn: 108336	2010-07-14 17:25:37 +00:00
Evan Cheng	d542414945	Teach ProcessImplicitDefs to transform more COPY instructions into IMPLICIT_DEF (and subsequently eliminate them). This allows machine LICM to hoist IMPLICIT_DEF's. PR7620. llvm-svn: 108304	2010-07-14 01:22:19 +00:00
Dan Gohman	1f471435f8	Don't propagate debug locations to instructions for materializing constants, since they may not be emited near the other instructions which get the same line, and this confuses debug info. llvm-svn: 108302	2010-07-14 01:07:44 +00:00
Jakob Stoklund Olesen	cd7a40f4ec	Print VNInfo flags. llvm-svn: 108277	2010-07-13 21:19:05 +00:00
Dale Johannesen	caca5488dc	In inline asm treat indirect 'X' constraint as 'm'. This may not be right in all cases, but it's better than asserting which it was doing before. PR 7528. llvm-svn: 108268	2010-07-13 20:17:05 +00:00
Jakob Stoklund Olesen	fc4b8b8e80	Add an assertion to make PR7542 fail consistently. LiveInterval::overlapsFrom dereferences end() if it is called on an empty interval. It would be reasonable to just return false - an empty interval doesn't overlap anything, but I want to know who is doing it first. llvm-svn: 108264	2010-07-13 19:56:28 +00:00
Jakob Stoklund Olesen	b43455feaf	Fix LiveInterval::overlaps so it doesn't claim touching intervals overlap. Also, one binary search is enough. llvm-svn: 108261	2010-07-13 19:42:20 +00:00
Jakob Stoklund Olesen	54e620d2c7	Don't add memory operands to storeRegToStackSlot / loadRegFromStackSlot results, they already have one. This fixes the himenobmtxpa miscompilation on ARM. The PostRA scheduler got confused by the double memoperand and hoisted a stack slot load above a store to the same slot. llvm-svn: 108219	2010-07-13 00:23:30 +00:00
Rafael Espindola	a18c5a0e5e	Fix a typo and fit in 80 columns. Found by Bob Wilson. llvm-svn: 108164	2010-07-12 18:11:17 +00:00
Duncan Sands	41b4a6b36a	Convert some tab stops into spaces. llvm-svn: 108130	2010-07-12 08:16:59 +00:00
Rafael Espindola	871c724773	Convert the last use of getPhysicalRegisterRegClass and remove it. AggressiveAntiDepBreaker should not be using getPhysicalRegisterRegClass. An instruction might be using a register that can only be replaced with one from a subclass of getPhysicalRegisterRegClass. With this patch we use getMinimalPhysRegClass. This is correct, but conservative. We should check the uses of the register and select the largest register class that can be used in all of them. llvm-svn: 108122	2010-07-12 02:55:34 +00:00
Rafael Espindola	01c5a15dde	Don't use getPhysicalRegisterRegClass in PBQP. The existing checks that the physical register can be allocated in the class of the virtual are sufficient. I think that the test for virtual registers is more strict than it needs to be, it should be possible to coalesce two virtual registers the class of one is a subclass of the other. llvm-svn: 108118	2010-07-12 01:45:38 +00:00
Rafael Espindola	e35d70fafa	Convert the last getPhysicalRegisterRegClass in VirtRegRewriter.cpp to getMinimalPhysRegClass. It was used to produce spills, and it is better to use the most specific class if possible. Update getLoadStoreRegOpcode to handle GR32_AD. llvm-svn: 108115	2010-07-12 00:52:33 +00:00
Chris Lattner	0b7ae20a35	change machinelicm to use MachineInstr::isSafeToMove. No intended functionality change. The avoidance of hoistiing implicitdef seems wrong though. llvm-svn: 108109	2010-07-12 00:00:35 +00:00
Jakob Stoklund Olesen	c4227f1362	Remove TargetInstrInfo::copyRegToReg entirely. Targets must now implement TargetInstrInfo::copyPhysReg instead. There is no longer a default implementation forwarding to copyRegToReg. llvm-svn: 108095	2010-07-11 17:01:17 +00:00
Rafael Espindola	d7c4963f2f	Convert uses of getPhysicalRegisterRegClass in VirtRegRewriter.cpp. The first one was used just to call isSafeToMoveRegClassDefs. In general, using a more specific reg class is better, in practice only x86 implements that method and the results are always the same. The second one is in FindFreeRegister and is used to check if a register is in a register class, a much more direct call to contains is better as it should cover more cases and is faster. llvm-svn: 108093	2010-07-11 16:45:17 +00:00
Chandler Carruth	34e0d14ff4	Remove two other uses of ATTRIBUTE_UNUSED for variables only used within assert()s, switching to void-casts. Removed an unneeded Compiler.h include as a result. There are two other uses in LLVM, but they're not due to assert()s, so I've left them alone. llvm-svn: 108088	2010-07-11 08:18:12 +00:00
Jakob Stoklund Olesen	51642aea77	Use COPY for fast-isel bitconvert, but don't create cross-class copies. This doesn't change the behavior of SelectBitcast for X86. llvm-svn: 108073	2010-07-11 05:16:54 +00:00
Rafael Espindola	a76eccf815	Fix va_arg for doubles. With this patch VAARG nodes always contain the correct alignment information, which simplifies ExpandRes_VAARG a bit. The patch introduces a new alignment information to TargetLoweringInfo. This is needed since the two natural candidates cannot be used: * The 's' in target data: If this is set to the minimal alignment of any argument, getCallFrameTypeAlignment would return 4 for doubles on ARM for example. * The getTransientStackAlignment method. It is possible for an architecture to have argument less aligned than what we maintain the stack pointer. llvm-svn: 108072	2010-07-11 04:01:49 +00:00
Jakob Stoklund Olesen	7147ab9e78	Use COPY for extracting ImplicitDef'ed values from fast-isel instructions. This assumes that the registers can be copied which is probably a safe assumption. llvm-svn: 108070	2010-07-11 03:31:05 +00:00
Jakob Stoklund Olesen	3bb1267431	Use COPY in FastISel everywhere it is safe and trivial. The remaining copyRegToReg calls actually check the return value (shock!), so we cannot trivially replace them with COPY instructions. llvm-svn: 108069	2010-07-11 03:31:00 +00:00
Jakob Stoklund Olesen	0c76d6ec21	Replace copyRegToReg with COPY everywhere in lib/CodeGen except for FastISel. llvm-svn: 108062	2010-07-10 22:42:59 +00:00
Jakob Stoklund Olesen	ad89613b65	Only collect subreg extracting copies for later coalescing. This also avoids fatal copies from physregs. llvm-svn: 108061	2010-07-10 22:42:53 +00:00
Dan Gohman	a64a323564	Fix a bug in the code which re-inserts DBG_VALUE nodes after scheduling; if a block is split (by a custom inserter), the insert point may be in a different block than it was originally. This fixes 32-bit llvm-gcc bootstrap builds, and I haven't been able to reproduce it otherwise. llvm-svn: 108060	2010-07-10 22:42:31 +00:00
Jakob Stoklund Olesen	e50d30d586	Emit COPY instructions instead of using copyRegToReg in InstrEmitter, ScheduleDAGEmit, TwoAddressLowering, and PHIElimination. This switches the bulk of register copies to using COPY, but many less used copyRegToReg calls remain. llvm-svn: 108050	2010-07-10 19:08:25 +00:00
Dan Gohman	fbdba81550	Insert IMPLICIT_DEF instructions at the current insert position, not at the end of the block. llvm-svn: 108045	2010-07-10 13:55:45 +00:00
Dan Gohman	d7b5ce3312	Reapply bottom-up fast-isel, with several fixes for x86-32: - Check getBytesToPopOnReturn(). - Eschew ST0 and ST1 for return values. - Fix the PIC base register initialization so that it doesn't ever fail to end up the top of the entry block. llvm-svn: 108039	2010-07-10 09:00:22 +00:00
Devang Patel	57e72370ae	Update DBG_VALUE to refer appropriate stack slot in case of a spill. llvm-svn: 108023	2010-07-09 21:48:31 +00:00
Jakob Stoklund Olesen	b5c899d11b	Fix small bug in isMoveInstr -> COPY translation llvm-svn: 108013	2010-07-09 20:55:49 +00:00
Jakob Stoklund Olesen	7a7b55eb67	Automatically fold COPY instructions into stack load/store. llvm-svn: 108012	2010-07-09 20:43:13 +00:00
Jakob Stoklund Olesen	e9fdcaa68a	Remat uncoalescable COPY instrs llvm-svn: 108010	2010-07-09 20:43:05 +00:00
Bill Wendling	f831d86311	Clarify what mysterious check means. llvm-svn: 108005	2010-07-09 19:44:12 +00:00
Dan Gohman	7929c448fc	Fix MachineLICM to actually visit inner loops. llvm-svn: 108001	2010-07-09 18:49:45 +00:00
Jakob Stoklund Olesen	bd953d1805	Change TII::foldMemoryOperand API to require the machine instruction to be inserted in a MBB, and return an already inserted MI. This target API change is necessary to allow foldMemoryOperand to call storeToStackSlot and loadFromStackSlot when folding a COPY to a stack slot reference in a target independent way. The foldMemoryOperandImpl hook is going to change in the same way, but I'll wait until COPY folding is actually implemented. Most targets only fold copies and won't need to specialize this hook at all. llvm-svn: 107991	2010-07-09 17:29:08 +00:00
Bob Wilson	6586e9b203	--- Reverse-merging r107947 into '.': U utils/TableGen/FastISelEmitter.cpp --- Reverse-merging r107943 into '.': U test/CodeGen/X86/fast-isel.ll U test/CodeGen/X86/fast-isel-loads.ll U include/llvm/Target/TargetLowering.h U include/llvm/Support/PassNameParser.h U include/llvm/CodeGen/FunctionLoweringInfo.h U include/llvm/CodeGen/CallingConvLower.h U include/llvm/CodeGen/FastISel.h U include/llvm/CodeGen/SelectionDAGISel.h U lib/CodeGen/LLVMTargetMachine.cpp U lib/CodeGen/CallingConvLower.cpp U lib/CodeGen/SelectionDAG/SelectionDAGBuilder.cpp U lib/CodeGen/SelectionDAG/FunctionLoweringInfo.cpp U lib/CodeGen/SelectionDAG/FastISel.cpp U lib/CodeGen/SelectionDAG/SelectionDAGISel.cpp U lib/CodeGen/SelectionDAG/ScheduleDAGSDNodes.cpp U lib/CodeGen/SelectionDAG/InstrEmitter.cpp U lib/CodeGen/SelectionDAG/TargetLowering.cpp U lib/Target/XCore/XCoreISelLowering.cpp U lib/Target/XCore/XCoreISelLowering.h U lib/Target/X86/X86ISelLowering.cpp U lib/Target/X86/X86FastISel.cpp U lib/Target/X86/X86ISelLowering.h llvm-svn: 107987	2010-07-09 16:37:18 +00:00
Gabor Greif	52617fc462	cache result of operator* llvm-svn: 107980	2010-07-09 16:08:33 +00:00
Jakob Stoklund Olesen	d4d9e53b20	Avoid creating %physreg:subidx operands in SimpleRegisterCoalescing::RemoveCopyByCommutingDef. This fixes PR7602. llvm-svn: 107957	2010-07-09 05:56:21 +00:00
Jakob Stoklund Olesen	cac54d6435	Deal with a few remaining spots that assume physical registers have live intervals. This fixes PR7601. llvm-svn: 107955	2010-07-09 04:35:38 +00:00
Jakob Stoklund Olesen	66b3649030	Fix broken isCopy handling in TrimLiveIntervalToLastUse. llvm-svn: 107950	2010-07-09 01:27:21 +00:00
Jakob Stoklund Olesen	5165fa1c39	Handle COPY in VirtRegRewriter. llvm-svn: 107949	2010-07-09 01:27:19 +00:00
Dan Gohman	0b5aa1cdd3	Re-apply bottom-up fast-isel, with fixes. Be very careful to avoid emitting a DBG_VALUE after a terminator, or emitting any instructions before an EH_LABEL. llvm-svn: 107943	2010-07-09 00:39:23 +00:00
Bob Wilson	21eed476e8	Reenable DAG combining for vector shuffles. It looks like it was temporarily disabled and then never turned back on again. Adjust some tests, one because this change avoids an unnecessary instruction, and the other to make it continue testing what it was intended to test. llvm-svn: 107941	2010-07-09 00:38:12 +00:00
Stuart Hastings	d08fb75aaa	Reverting r107918 and r107919. Radar 8063111. llvm-svn: 107930	2010-07-08 23:25:39 +00:00
Jakob Stoklund Olesen	823e90e12a	Revert "Fix broken isCopy handling in TrimLiveIntervalToLastUse" This reverts commit 107921. It broke the clang self host. llvm-svn: 107926	2010-07-08 22:52:47 +00:00
Devang Patel	4c6bd6612f	Relax assertion. In optimized code, it is possible that first instruction is coming from a inlined function. This fixes PR7596 . llvm-svn: 107923	2010-07-08 22:39:20 +00:00
Bill Wendling	a992445ff2	Extension of r107506. Make sure that we don't mark a function as having a call if the inline ASM doesn't need a stack frame. llvm-svn: 107922	2010-07-08 22:38:02 +00:00
Jakob Stoklund Olesen	75c465585a	Fix broken isCopy handling in TrimLiveIntervalToLastUse llvm-svn: 107921	2010-07-08 22:30:38 +00:00
Stuart Hastings	43d226deea	Fix decl/def debug info for template functions. Radar 8063111. llvm-svn: 107919	2010-07-08 22:28:59 +00:00
Devang Patel	9c160e1213	Reuse DIEInteger for 1. This is frequently used while emitting an attribute using dwarf::DW_FORM_flag form. llvm-svn: 107903	2010-07-08 20:10:35 +00:00
Jim Grosbach	c280fc7514	Clean up scavengeRegister() a bit to prefer available regs, which allows the simplification of frame index register scavenging to not have to check for available registers directly and instead just let scavengeRegister() handle it. llvm-svn: 107880	2010-07-08 16:49:26 +00:00
Jakob Stoklund Olesen	00264624a9	Convert EXTRACT_SUBREG to COPY when emitting machine instrs. EXTRACT_SUBREG no longer appears as a machine instruction. Use COPY instead. Add isCopy() checks in many places using isMoveInstr() and isExtractSubreg(). The isMoveInstr hook will be removed later. llvm-svn: 107879	2010-07-08 16:40:22 +00:00
Jakob Stoklund Olesen	a1e883dcf6	Remove references to INSERT_SUBREG after de-SSA. Fix X86InstrInfo::convertToThreeAddressWithLEA to generate COPY instead of INSERT_SUBREG. llvm-svn: 107878	2010-07-08 16:40:15 +00:00
Benjamin Kramer	0ae3f08c0d	Merge the duplicated iabs optimization in DAGCombiner and let it detected a few more idioms. llvm-svn: 107868	2010-07-08 12:09:56 +00:00
Jakob Stoklund Olesen	89a4e25007	Add TargetInstrInfo::copyPhysReg hook and use it from LowerSubregs. This target hook is intended to replace copyRegToReg entirely, but for now it calls copyRegToReg. Any remaining calls to copyRegToReg wil be replaced by COPY instructions. llvm-svn: 107854	2010-07-08 05:01:41 +00:00
Dan Gohman	e75704369d	Revert 107840 107839 107813 107804 107800 107797 107791. Debug info intrinsics win for now. llvm-svn: 107850	2010-07-08 01:00:56 +00:00
Jim Grosbach	6533f24370	When processing frame index virtual registers, consider all available registers (if there are any) and use the one which remains available for the longest rather than just using the first one. This should help enable better re-use of the loaded frame index values. rdar://7318760 llvm-svn: 107847	2010-07-08 00:38:54 +00:00
Dan Gohman	eb9164dc50	Don't forward-declare registers for static allocas, which we'll prefer to materialize as local constants. This fixes the clang bootstrap abort. llvm-svn: 107840	2010-07-07 23:52:58 +00:00
Dan Gohman	1adc499dda	Fix -fast-isel-abort to check the right instruction. llvm-svn: 107839	2010-07-07 23:47:25 +00:00
Devang Patel	a37a95ea2f	One MDNode may be used to create regular DIE as well as abstract DIE. Keep track of abstract subprogram DIEs. llvm-svn: 107822	2010-07-07 22:20:57 +00:00
Evan Cheng	1c349f18f8	Move getExtLoad() and (some) getLoad() DebugLoc argument after EVT argument for consistency sake. llvm-svn: 107820	2010-07-07 22:15:37 +00:00
Dan Gohman	25d5c1b4f8	Not all custom inserters create new basic blocks. If the inserter didn't create a new block, don't reset the insert position. llvm-svn: 107813	2010-07-07 21:18:22 +00:00
Devang Patel	9a0339fc1f	Rename couple of maps. llvm-svn: 107810	2010-07-07 20:49:57 +00:00
Devang Patel	30265c4f8b	80 cols. llvm-svn: 107807	2010-07-07 20:12:52 +00:00
Dan Gohman	e7ccc51cc1	Implement bottom-up fast-isel. This has the advantage of not requiring a separate DCE pass over MachineInstrs. llvm-svn: 107804	2010-07-07 19:20:32 +00:00
Dan Gohman	2d4d01d0de	Add X86FastISel support for return statements. This entails refactoring a bunch of stuff, to allow the target-independent calling convention logic to be employed. llvm-svn: 107800	2010-07-07 18:32:53 +00:00
Dan Gohman	b792f844ad	Update the insert position after scheduling, which may change the position when emitting multiple blocks when executing a custom inserter. llvm-svn: 107797	2010-07-07 18:22:13 +00:00
Devang Patel	637ee5f149	Update comment. llvm-svn: 107796	2010-07-07 18:18:18 +00:00
Dan Gohman	769201448d	Fix debugging strings. llvm-svn: 107795	2010-07-07 17:28:45 +00:00
Dan Gohman	ffe64b1ee5	Give FunctionLoweringInfo an MBB member, avoiding the need to pass it around everywhere, and also give it an InsertPt member, to enable isel to operate at an arbitrary position within a block, rather than just appending to a block. llvm-svn: 107791	2010-07-07 16:47:08 +00:00
Dan Gohman	87fb4e8fcd	Simplify FastISel's constructor by giving it a FunctionLoweringInfo instance, rather than pointers to all of FunctionLoweringInfo's members. This eliminates an NDEBUG ABI sensitivity. llvm-svn: 107789	2010-07-07 16:29:44 +00:00
Dan Gohman	e784616fbb	Move FunctionLoweringInfo.h out into include/llvm/CodeGen. This will allow target-specific fast-isel code to make use of it directly. llvm-svn: 107787	2010-07-07 16:01:37 +00:00
Dan Gohman	fe7532a308	Split the SDValue out of OutputArg so that SelectionDAG-independent code can do calling-convention queries. This obviates OutputArgReg. llvm-svn: 107786	2010-07-07 15:54:55 +00:00
Dan Gohman	498e5f899d	Move CallingConvLower.cpp out of the SelectionDAG directory. llvm-svn: 107781	2010-07-07 15:15:27 +00:00
Jakob Stoklund Olesen	8e1338eea8	Fix more places assuming subregisters have live intervals llvm-svn: 107780	2010-07-07 14:41:22 +00:00
Dan Gohman	88c547ede9	Add a getFirstNonPHI utility function. llvm-svn: 107778	2010-07-07 14:33:51 +00:00
Jakob Stoklund Olesen	f0e551d4f4	Revert "Remove references to INSERT_SUBREG after de-SSA" r107725. Buildbot breakage. llvm-svn: 107744	2010-07-07 00:32:25 +00:00
Jim Grosbach	dc0a0659be	By default, the eh.sjlj.setjmp/longjmp intrinsics should just do nothing rather than assuming a target will custom lower them. Targets which do so should exlicitly mark them as having custom lowerings. PR7454. llvm-svn: 107734	2010-07-06 23:44:52 +00:00
Jakob Stoklund Olesen	e2d3067f6b	Remove references to INSERT_SUBREG after de-SSA llvm-svn: 107732	2010-07-06 23:40:35 +00:00
Jakob Stoklund Olesen	70ee3ecd33	Convert INSERT_SUBREG to COPY in TwoAddressInstructionPass. INSERT_SUBREG will now only appear in SSA machine instructions. Fix the handling of partial redefs in ProcessImplicitDefs. This is now relevant since partial redef COPY instructions appear. llvm-svn: 107726	2010-07-06 23:26:25 +00:00
Dan Gohman	ee0cb70381	CanLowerReturn doesn't need a SelectionDAG; it just needs an LLVMContext. SelectBasicBlock doesn't needs its BasicBlock argument. llvm-svn: 107712	2010-07-06 22:19:37 +00:00
Devang Patel	a3ca21b228	Propagate debug loc. llvm-svn: 107710	2010-07-06 22:08:15 +00:00
Jakob Stoklund Olesen	15fed3bd30	One more case assuming that subregs have live ranges. llvm-svn: 107700	2010-07-06 21:13:03 +00:00
Jakob Stoklund Olesen	bcf3409107	Fix buildbot breakage where a def is missing. llvm-svn: 107698	2010-07-06 21:06:39 +00:00
Jakob Stoklund Olesen	a64c0a3d22	Be more forgiving when calculating alias interference for physreg coalescing. It is OK for an alias live range to overlap if there is a copy to or from the physical register. CoalescerPair can work out if the copy is coalescable independently of the alias. This means that we can join with the actual destination interval instead of using the getOrigDstReg() hack. It is no longer necessary to merge clobber ranges into subregisters. llvm-svn: 107695	2010-07-06 20:31:51 +00:00
Dan Gohman	3439629239	Reapply r107655 with fixes; insert the pseudo instruction into the block before calling the expansion hook. And don't put EFLAGS in a mbb's live-in list twice. llvm-svn: 107691	2010-07-06 20:24:04 +00:00
Eric Christopher	dfc8b745a2	Fix to 80-col. llvm-svn: 107684	2010-07-06 18:35:20 +00:00
Chris Lattner	dde2ba0b60	tighten up this code. llvm-svn: 107670	2010-07-06 15:59:27 +00:00
Dan Gohman	f4f04107ef	Revert r107655. llvm-svn: 107668	2010-07-06 15:49:48 +00:00
Dan Gohman	4e49b59dad	Add versions of OutputArgReg, AnalyzeReturn, and AnalyzeCallOperands which do not depend on SelectionDAG. llvm-svn: 107666	2010-07-06 15:39:54 +00:00
Anton Korobeynikov	e415230477	Fix a major regression on COFF targets introduced by r103267: 'discardable' section means that it is used only during the program load and can be discarded afterwards. This way only debug sections can be discarded, but not the opposite. Seems like the copy-and-pasto from ELF code, since there it contains the reverse flag ('alloc'). llvm-svn: 107658	2010-07-06 15:24:56 +00:00
Dan Gohman	12205645a6	Fix a bunch of custom-inserter functions to handle the case where the pseudo instruction is not at the end of the block. llvm-svn: 107655	2010-07-06 15:18:19 +00:00
Eric Christopher	2ad0c779c3	Fix up -fstack-protector on linux to use the segment registers. Split out testcases per architecture and os now. Patch from Nelson Elhage. llvm-svn: 107640	2010-07-06 05:18:56 +00:00
Chris Lattner	c4a7073db3	more tidying. llvm-svn: 107615	2010-07-05 05:53:14 +00:00
Chris Lattner	2c0315a0f3	random tidying llvm-svn: 107612	2010-07-05 05:36:21 +00:00
Jakob Stoklund Olesen	ac0a210789	Print symbolic subreg indices on REG_SEQUENCE and INSERT_SUBREG. llvm-svn: 107602	2010-07-04 23:24:23 +00:00
Evan Cheng	f3aeb2c22c	Infer alignments of fixed frame objects when they are constructed. This ensures remat'ed loads from fixed slots have the right alignments. llvm-svn: 107591	2010-07-04 18:52:05 +00:00
Bill Wendling	f844642350	Proper indentation. llvm-svn: 107581	2010-07-04 08:58:43 +00:00
Eric Christopher	128a0197bb	Fix typo. llvm-svn: 107556	2010-07-03 01:09:18 +00:00
Evan Cheng	0664a67fe1	Remove isSS argument from CreateFixedObject. Fixed objects cannot be spill slots so it's always false. llvm-svn: 107550	2010-07-03 00:40:23 +00:00
Jakob Stoklund Olesen	4c82a9e7d0	Detect and handle COPY in many places. This code is transitional, it will soon be possible to eliminate isExtractSubreg, isInsertSubreg, and isMoveInstr in most places. llvm-svn: 107547	2010-07-03 00:04:37 +00:00
Eric Christopher	5e5416056b	80-col fixup. llvm-svn: 107537	2010-07-02 23:17:38 +00:00
Jakob Stoklund Olesen	676a15bdf5	Add a new target independent COPY instruction and code to lower it. The COPY instruction is intended to replace the target specific copy instructions for virtual registers as well as the EXTRACT_SUBREG and INSERT_SUBREG instructions in MachineFunctions. It won't we used in a selection DAG. COPY is lowered to native register copies by LowerSubregs. llvm-svn: 107529	2010-07-02 22:29:50 +00:00
Jim Grosbach	3c43248560	Custom inserters (e.g., conditional moves in Thumb1 can introduce new basic blocks, and if used as a function argument, that can cause call frame setup / destroy pairs to be split across a basic block boundary. That prevents us from doing a simple assertion to check that the pairs match and alloc/ dealloc the same amount of space. Modify the assertion to only check the amount allocated when there are matching pairs in the same basic block. rdar://8022442 llvm-svn: 107517	2010-07-02 21:23:37 +00:00
Evan Cheng	0ce84486c3	- Two-address pass should not assume unfolding is always successful. - X86 unfolding should check if the instructions being unfolded has memoperands. If there is no memoperands, then it must assume conservative alignment. If this would introduce an expensive sse unaligned load / store, then unfoldMemoryOperand etc. should not unfold the instruction. llvm-svn: 107509	2010-07-02 20:36:18 +00:00
Dale Johannesen	4d887f7ca7	Propagate the AlignStack bit in InlineAsm's to the PrologEpilog code, and use it to determine whether the asm forces stack alignment or not. gcc consistently does not do this for GCC-style asms; Apple gcc inconsistently sometimes does it for asm blocks. There is no convenient place to put a bit in either the SDNode or the MachineInstr form, so I've added an extra operand to each; unlovely, but it does allow for expansion for more bits, should we need it. PR 5125. Some existing testcases are affected. The operand lists of the SDNode and MachineInstr forms are indexed with awesome mnemonics, like "2"; I may fix this someday, but not now. I'm not making it any worse. If anyone is inspired I think you can find all the right places from this patch. llvm-svn: 107506	2010-07-02 20:16:09 +00:00
Jakob Stoklund Olesen	df8429aeb4	Remove invalid assert llvm-svn: 107505	2010-07-02 19:54:47 +00:00
Jakob Stoklund Olesen	cf6c5c960f	Properly handle debug values during inline spilling. llvm-svn: 107503	2010-07-02 19:54:40 +00:00
Jakob Stoklund Olesen	96037187e5	Rematerialize as much as possible before inserting spills and reloads. This allows us to recognize the common case where all uses could be rematerialized, and no stack slot allocation is necessary. If some values could be fully rematerialized, remove them from the live range before allocating a stack slot for the rest. llvm-svn: 107492	2010-07-02 17:44:57 +00:00
Jim Grosbach	9b7755fbc6	80-column and trailing whitespace cleanup. llvm-svn: 107490	2010-07-02 17:41:59 +00:00
Jim Grosbach	64a4f3f062	grammar tweaks llvm-svn: 107489	2010-07-02 17:38:34 +00:00
Dan Gohman	93f5920914	Rename CreateReg to CreateRegs, and MakeReg to CreateReg. llvm-svn: 107451	2010-07-02 00:10:16 +00:00
Bill Wendling	504055ce9e	Make the "linker_private" linkage type emit a non-weak symbol to the file. It will still be stripped by the linker when it generates the final image. llvm-svn: 107440	2010-07-01 22:38:24 +00:00
Bill Wendling	03bcd6ecc8	Implement the "linker_private_weak" linkage type. This will be used for Objective-C metadata types which should be marked as "weak", but which the linker will remove upon final linkage. However, this linkage isn't specific to Objective-C. For example, the "objc_msgSend_fixup_alloc" symbol is defined like this: .globl l_objc_msgSend_fixup_alloc .weak_definition l_objc_msgSend_fixup_alloc .section __DATA, __objc_msgrefs, coalesced .align 3 l_objc_msgSend_fixup_alloc: .quad _objc_msgSend_fixup .quad L_OBJC_METH_VAR_NAME_1 This is different from the "linker_private" linkage type, because it can't have the metadata defined with ".weak_definition". Currently only supported on Darwin platforms. llvm-svn: 107433	2010-07-01 21:55:59 +00:00
Devang Patel	429397529a	Do not require line number entry for undefined local variable. This is a regression caused by r106792 and caught by gdb testsuite. llvm-svn: 107430	2010-07-01 21:38:08 +00:00
Daniel Dunbar	02877d6e85	MC: Pass the target instance to the AsmParser constructor. llvm-svn: 107426	2010-07-01 20:41:56 +00:00
Daniel Dunbar	329d202362	MC: Move COFF enumeration constants to llvm/Support/COFF.h, patch by Michael Spencer! llvm-svn: 107418	2010-07-01 20:07:24 +00:00
Dan Gohman	d2965c10a1	Temporarily disable on-demand fast-isel. llvm-svn: 107393	2010-07-01 12:15:30 +00:00
Dan Gohman	42b7ee15f5	Use FuncInfo's isExportedInst accessor method instead of doing the work manually. llvm-svn: 107384	2010-07-01 03:57:05 +00:00
Dan Gohman	85e02e9340	Rename CreateRegForValue to CreateReg, and change its argument from a Value to a Type, because it doesn't actually care about the Value. llvm-svn: 107383	2010-07-01 03:55:39 +00:00
Dan Gohman	4d29fd85f9	Fast isel no longer needs DeadMachineInstrElim to clean up after it. llvm-svn: 107381	2010-07-01 03:49:59 +00:00
Dan Gohman	aef3d140b7	Teach fast-isel to avoid loading a value from memory when it's already available in a register. This is pretty primitive, but it reduces the number of instructions in common testcases by 4%. llvm-svn: 107380	2010-07-01 03:49:38 +00:00
Dan Gohman	722f5fc567	Enable on-demand fast-isel. llvm-svn: 107377	2010-07-01 02:58:57 +00:00
Dan Gohman	d432223163	Reapply r106422, splitting the code for materializing a value out of SelectionDAGBuilder::getValue into a helper function, with fixes to use DenseMaps safely. llvm-svn: 107371	2010-07-01 01:59:43 +00:00
Dan Gohman	9576645a84	Don't use operator[] here, because it's not desirable to insert a default value if the search fails. llvm-svn: 107368	2010-07-01 01:33:21 +00:00
Mikhail Glushenkov	4721ad855e	Trailing whitespace. llvm-svn: 107360	2010-07-01 01:00:22 +00:00
Jakob Stoklund Olesen	8656a4549a	Add memory operand folding support to InlineSpiller. llvm-svn: 107355	2010-07-01 00:13:04 +00:00
Jakob Stoklund Olesen	bde96ad23e	Add support for rematerialization to InlineSpiller. llvm-svn: 107351	2010-06-30 23:03:52 +00:00
Bill Wendling	e0dfb98ea0	Use the catch-all selectors we already found when converting them to use the correct catch-all value. This saves having to iterate through all of the selectors in the program again. llvm-svn: 107345	2010-06-30 22:49:53 +00:00
Jim Grosbach	e8c97a7cd7	Handle array and vector typed parameters in sjljehprepare like we do structs. rdar://8145832 llvm-svn: 107332	2010-06-30 22:20:38 +00:00
Jim Grosbach	caf9b3ab7d	grammar tweak in comment. llvm-svn: 107321	2010-06-30 21:27:56 +00:00
Jakob Stoklund Olesen	59e1cae377	Some fool committed without testing (or even building) first. llvm-svn: 107307	2010-06-30 18:41:20 +00:00
Jakob Stoklund Olesen	c39d3497c8	Remember to track spill slot uses in VirtRegMap when inserting loads and stores. LocalRewriter::runOnMachineFunction uses this information to mark dead spill slots. This means that InlineSpiller now also works for functions that spill. llvm-svn: 107302	2010-06-30 18:19:08 +00:00
Duncan Sands	945a347478	Remove an unused variable. The call to getRoot has side-effects, so this could break something (but doesn't seem to). llvm-svn: 107295	2010-06-30 17:22:28 +00:00
Gabor Greif	647d9c9797	use ArgOperand API llvm-svn: 107282	2010-06-30 13:45:50 +00:00
Gabor Greif	f69acfe133	use ArgOperand API llvm-svn: 107279	2010-06-30 12:55:46 +00:00
Gabor Greif	3390e746fa	use CallSite::arg_end instead of CallInst::op_end llvm-svn: 107276	2010-06-30 12:39:23 +00:00
John Mosby	5364655e02	Remove trailing whitespace, no functionality changes. llvm-svn: 107244	2010-06-30 03:40:54 +00:00
Devang Patel	c5b3109bec	Do not construct DIE for already processed MDNode. llvm-svn: 107237	2010-06-30 01:40:11 +00:00
Jakob Stoklund Olesen	b3b89c3bc0	Use skipInstruction() as a simpler way of iterating over instructions using SrcReg llvm-svn: 107234	2010-06-30 00:30:36 +00:00
Jakob Stoklund Olesen	08baf59da1	Use clEnumValN macro to work around keyword clash llvm-svn: 107233	2010-06-30 00:24:51 +00:00
Devang Patel	648df7bf64	Add variables into a scope before constructing scope DIE otherwise variables won't be included DIE tree. llvm-svn: 107228	2010-06-30 00:11:08 +00:00
Jakob Stoklund Olesen	f888911932	Begin implementation of an inline spiller. InlineSpiller inserts loads and spills immediately instead of deferring to VirtRegMap. This is possible now because SlotIndexes allows instructions to be inserted and renumbered. This is work in progress, and is mostly a copy of TrivialSpiller so far. It works very well for functions that don't require spilling. llvm-svn: 107227	2010-06-29 23:58:39 +00:00
Bill Wendling	3632171750	Revert r107205 and r107207. llvm-svn: 107215	2010-06-29 22:34:52 +00:00
Devang Patel	be30551600	Print InlinedAt location. llvm-svn: 107214	2010-06-29 22:29:15 +00:00
Devang Patel	c728518bfe	Print InlinedAt location. llvm-svn: 107208	2010-06-29 21:51:32 +00:00
Bill Wendling	1767723dbe	Introducing the "linker_weak" linkage type. This will be used for Objective-C metadata types which should be marked as "weak", but which the linker will remove upon final linkage. For example, the "objc_msgSend_fixup_alloc" symbol is defined like this: .globl l_objc_msgSend_fixup_alloc .weak_definition l_objc_msgSend_fixup_alloc .section __DATA, __objc_msgrefs, coalesced .align 3 l_objc_msgSend_fixup_alloc: .quad _objc_msgSend_fixup .quad L_OBJC_METH_VAR_NAME_1 This is different from the "linker_private" linkage type, because it can't have the metadata defined with ".weak_definition". llvm-svn: 107205	2010-06-29 21:24:00 +00:00
Devang Patel	24bc1b5b2f	Do not hardcode DW_AT_stmt_list value. Inspired by Artur Pietrek. llvm-svn: 107202	2010-06-29 20:17:53 +00:00
Jakob Stoklund Olesen	dadea5b178	Fix the handling of partial redefines in the fast register allocator. A partial redefine needs to be treated like a tied operand, and the register must be reloaded while processing use operands. This fixes a bug where partially redefined registers were processed as normal defs with a reload added. The reload could clobber another use operand if it was a kill that allowed register reuse. llvm-svn: 107193	2010-06-29 19:15:30 +00:00
Bob Wilson	d91d5bfc95	Fix a register scavenger crash when dealing with undefined subregs. The LowerSubregs pass needs to preserve implicit def operands attached to EXTRACT_SUBREG instructions when it replaces those instructions with copies. llvm-svn: 107189	2010-06-29 18:42:49 +00:00
Duncan Sands	83d1dd637a	It seems clear that this should return Changed. llvm-svn: 107141	2010-06-29 14:49:35 +00:00
Rafael Espindola	38a7d7cbc3	Add a VT argument to getMinimalPhysRegClass and replace the copy related uses of getPhysicalRegisterRegClass with it. If we want to make a copy (or estimate its cost), it is better to use the smallest class as more efficient operations might be possible. llvm-svn: 107140	2010-06-29 14:02:34 +00:00
Duncan Sands	d34bb4e9b0	getMachineBasicBlockAddress returns a uintptr_t - don't truncate to unsigned only to extend back to a pointer sized value on the next line. llvm-svn: 107139	2010-06-29 13:34:20 +00:00
Gabor Greif	e73d64c2cf	use ArgOperand APIs llvm-svn: 107132	2010-06-29 13:03:46 +00:00
Duncan Sands	6d28e73acc	Remove initialized but otherwise unused variables. llvm-svn: 107127	2010-06-29 11:22:26 +00:00
Jim Grosbach	907673c48d	When processing loops for scheduling latencies (used for live outs on loop back-edges), make sure not to include dbg_value instructions in the count. Closing in on the end of rdar://7797940 llvm-svn: 107119	2010-06-29 04:48:13 +00:00
Bob Wilson	1e5da550e5	Reapply my if-conversion cleanup from svn r106939 with fixes. There are 2 changes relative to the previous version of the patch: 1) For the "simple" if-conversion case, there's no need to worry about RemoveExtraEdges not handling an unanalyzable branch. Predicated terminators are ignored in this context, so RemoveExtraEdges does the right thing. This might break someday if we ever treat indirect branches (BRIND) as predicable, but for now, I just removed this part of the patch, because in the case where we do not add an unconditional branch, we rely on keeping the fall-through edge to CvtBBI (which is empty after this transformation). The change relative to the previous patch is: @@ -1036,10 +1036,6 @@ IterIfcvt = false; } - // RemoveExtraEdges won't work if the block has an unanalyzable branch, - // which is typically the case for IfConvertSimple, so explicitly remove - // CvtBBI as a successor. - BBI.BB->removeSuccessor(CvtBBI->BB); RemoveExtraEdges(BBI); // Update block info. BB can be iteratively if-converted. 2) My patch exposed a bug in the code for merging the tail of a "diamond", which had previously never been exercised. The code was simply checking that the tail had a single predecessor, but there was a case in MultiSource/Benchmarks/VersaBench/dbms where that single predecessor was neither edge of the diamond. I added the following change to check for that: @@ -1276,7 +1276,18 @@ // tail, add a unconditional branch to it. if (TailBB) { BBInfo TailBBI = BBAnalysis[TailBB->getNumber()]; - if (TailBB->pred_size() == 1 && !TailBBI.HasFallThrough) { + bool CanMergeTail = !TailBBI.HasFallThrough; + // There may still be a fall-through edge from BBI1 or BBI2 to TailBB; + // check if there are any other predecessors besides those. + unsigned NumPreds = TailBB->pred_size(); + if (NumPreds > 1) + CanMergeTail = false; + else if (NumPreds == 1 && CanMergeTail) { + MachineBasicBlock::pred_iterator PI = TailBB->pred_begin(); + if (PI != BBI1->BB && PI != BBI2->BB) + CanMergeTail = false; + } + if (CanMergeTail) { MergeBlocks(BBI, TailBBI); TailBBI.IsDone = true; } else { With these fixes, I was able to run all the SingleSource and MultiSource tests successfully. llvm-svn: 107110	2010-06-29 00:55:23 +00:00
Bob Wilson	269a89fd3a	Unlike other targets, ARM now uses BUILD_VECTORs post-legalization so they can't be changed arbitrarily by the DAGCombiner without checking if it is running after legalization. llvm-svn: 107097	2010-06-28 23:40:25 +00:00
Devang Patel	1de21ec498	Use DW_FORM_addr for DW_AT_entry_pc. llvm-svn: 107085	2010-06-28 22:22:47 +00:00
Dale Johannesen	17feb07c53	In asm's, output operands with matching input constraints have to be registers, per gcc documentation. This affects the logic for determining what "g" should lower to. PR 7393. A couple of existing testcases are affected. llvm-svn: 107079	2010-06-28 22:09:45 +00:00
Devang Patel	d10b2af260	Include inlined function in list of processed subprograms. llvm-svn: 107065	2010-06-28 20:53:04 +00:00
Jim Grosbach	ee6e29aa72	new, no longer brain-dead, r106907 llvm-svn: 107060	2010-06-28 20:26:00 +00:00
Jakob Stoklund Olesen	ffd628ec0a	After physreg coalescing, physical registers might not have live ranges where you would expect. Don't assert on that case, just give up. This fixes PR7513. llvm-svn: 107046	2010-06-28 19:39:57 +00:00
Jakob Stoklund Olesen	0d94d7af78	Add more special treatment for inline asm in RegAllocFast. When an instruction has tied operands and physreg defines, we must take extra care that the tied operands conflict with neither physreg defs nor uses. The special treatment is given to inline asm and instructions with tied operands / early clobbers and physreg defines. This fixes PR7509. llvm-svn: 107043	2010-06-28 18:34:34 +00:00
Devang Patel	f3b2db68c6	Preserve deleted function's local variables' debug info. Radar 8122864. llvm-svn: 107027	2010-06-28 18:25:03 +00:00
Gabor Greif	cd09869dfc	simplify: we have solid argument iterator range llvm-svn: 107014	2010-06-28 16:40:52 +00:00
Daniel Dunbar	b8c058cbb0	Revert r106907, "make sure to handle dbg_value instructions in the middle of the block, not...", it caused a bunch of nightly test regressions. llvm-svn: 107009	2010-06-28 15:47:17 +00:00
Devang Patel	fb6f22f010	Remove dead code. llvm-svn: 106990	2010-06-28 05:59:13 +00:00
Rafael Espindola	2041abd958	When splitting a VAARG, remember its alignment. This produces terrible but correct code. llvm-svn: 106952	2010-06-26 18:22:20 +00:00
Bob Wilson	418e64a385	Revert my if-conversion cleanup since it caused a bunch of nightly test regressions. --- Reverse-merging r106939 into '.': U test/CodeGen/Thumb2/thumb2-ifcvt3.ll U lib/CodeGen/IfConversion.cpp llvm-svn: 106951	2010-06-26 17:47:06 +00:00
Benjamin Kramer	a000002428	VNInfos don't need to be destructed anymore. llvm-svn: 106943	2010-06-26 11:30:59 +00:00
Bob Wilson	c72da6bb56	Clean up some problems with extra CFG edges being introduced during if-conversion. The RemoveExtraEdges function doesn't work for blocks that end with unanalyzable branches, so in those cases, the "extra" edges must be explicitly removed. The CopyAndPredicateBlock and MergeBlocks methods can also avoid copying successor edges due to branches that have already been removed. The latter case is especially helpful when MergeBlocks is called for handling "diamond" if-conversions, where otherwise you can end up with some weird intermediate states in the CFG. Unfortunately I've been unable to find cases where this cleanup actually makes a significant difference in the code. There is one test where we manage to remove an empty block at the end of a function. Radar 6911268. llvm-svn: 106939	2010-06-26 04:27:33 +00:00
Jim Grosbach	c34befc78f	make sure to handle dbg_value instructions in the middle of the block, not just at the head, when doing diamond if-conversion. rdar://7797940 llvm-svn: 106907	2010-06-25 23:05:46 +00:00
Jakob Stoklund Olesen	55d738e2e1	Don't track kills in VNInfo. Use interval ends instead. The VNInfo.kills vector was almost unused except for all the code keeping it updated. The few places using it were easily rewritten to check for interval ends instead. The two new methods LiveInterval::killedAt and killedInRange are replacements. This brings us down to 3 independent data structures tracking kills. llvm-svn: 106905	2010-06-25 22:53:05 +00:00
Evan Cheng	02b184de5b	Change if-conversion block size limit checks to add some flexibility. llvm-svn: 106901	2010-06-25 22:42:03 +00:00
Devang Patel	5c0f85c7dd	Collect debug info for optimized variables of inlined functions. llvm-svn: 106895	2010-06-25 22:07:34 +00:00
Jim Grosbach	8a6deefec6	80 column and typo fix llvm-svn: 106894	2010-06-25 22:02:28 +00:00
Dale Johannesen	ce97d55ad9	The hasMemory argument is irrelevant to how the argument for an "i" constraint should get lowered; PR 6309. While this argument was passed around a lot, this is the only place it was used, so it goes away from a lot of other places. llvm-svn: 106893	2010-06-25 21:55:36 +00:00
Bill Wendling	e41e40f689	- Reapply r106066 now that the bzip2 build regression has been fixed. - 2010-06-25-CoalescerSubRegDefDead.ll is the testcase for r106878. llvm-svn: 106880	2010-06-25 20:48:10 +00:00
Bill Wendling	ef7acd9a24	We should remove the live range from the destination register only if all defs are dead, not just the def of this register. I.e., a register could be dead, but it's subreg isn't. Testcase to follow with a subsequent patch. llvm-svn: 106878	2010-06-25 20:42:55 +00:00
Dale Johannesen	2ac3b9cbd4	Cosmetic. llvm-svn: 106865	2010-06-25 17:41:07 +00:00
Duncan Sands	2dc70bea54	Remove variables which are assigned to but for which the value is not used. Spotted by gcc-4.6. llvm-svn: 106854	2010-06-25 14:48:39 +00:00
Gabor Greif	b890fc8023	use ArgOperand accessors and CallInst for getting hold of the intrinsic's arguments simplify along the way (at least for me this is much more legible now) Bill, Baldrick or Anton, please review\! llvm-svn: 106838	2010-06-25 11:25:30 +00:00
Gabor Greif	7dd3afdff3	use ArgOperand API (the simple part) llvm-svn: 106837	2010-06-25 09:44:37 +00:00
Gabor Greif	eba0be7dc9	use ArgOperand API llvm-svn: 106836	2010-06-25 09:38:13 +00:00
Gabor Greif	41b81ee2fb	use ArgOperand API llvm-svn: 106835	2010-06-25 09:36:23 +00:00
Gabor Greif	ed9ae7bf21	use ArgOperand API and CallSite to access arguments of CallInst llvm-svn: 106833	2010-06-25 09:03:52 +00:00
Gabor Greif	b5874dea6e	use ArgOperand API and CallSite to access arguments of CallInst llvm-svn: 106829	2010-06-25 08:48:19 +00:00
Gabor Greif	e4eed709d4	use ArgOperand API llvm-svn: 106828	2010-06-25 08:24:59 +00:00
Gabor Greif	f6207e0a80	prune an include llvm-svn: 106827	2010-06-25 08:16:50 +00:00
Dale Johannesen	e9eaaa91d8	Fix a case where an earlyclobber operand of an asm is reused as an input. PR 4118. Testcase is too big, as usual with bugs in this area, but there's one in the PR. llvm-svn: 106816	2010-06-25 00:49:43 +00:00
Jakob Stoklund Olesen	889ab7d158	Make sure all eliminated kills are removed from VNInfo lists. This fixes PR7479 and PR7485. The test cases from those PRs are big, so not included. However, PR7485 comes from self hosting on FreeBSD, so we will surely hear about any regression. llvm-svn: 106811	2010-06-24 23:57:35 +00:00
Dan Gohman	5f0bf64c0c	Add some comments. llvm-svn: 106809	2010-06-24 23:41:59 +00:00
Dan Gohman	9a2f0473b2	Teach EmitLiveInCopies to omit copies for unused virtual registers, and to clean up unused incoming physregs from the live-in list. llvm-svn: 106805	2010-06-24 22:23:02 +00:00
Bill Wendling	2d3c490026	It's possible that a flag is added to the SDNode that points back to the original SDNode. This is badness. Also, this function allows one SDNode to point multiple flags to another SDNode. Badness as well. llvm-svn: 106793	2010-06-24 22:00:37 +00:00
Devang Patel	c657c621b7	DBG_VALUE machine instruction pointing to undefined register for a variable justify a separate scope if the variable is inlined function's argument. Radar 8122864. llvm-svn: 106792	2010-06-24 21:51:19 +00:00
Jakob Stoklund Olesen	2b87d44c5d	Don't return a std::vector in the Spiller interface, but take a reference to a vector instead. This avoids needless copying and allocation. Add documentation. llvm-svn: 106788	2010-06-24 20:54:29 +00:00
Jakob Stoklund Olesen	9b659142a6	Remove the now unused LiveIntervals::getVNInfoSourceReg(). This method was always a bit too simplistic for the real world. It didn't really deal with subregisters and such. llvm-svn: 106781	2010-06-24 20:18:15 +00:00
Jakob Stoklund Olesen	487ed997d0	Teach AdjustCopiesBackFrom to also use CoalescerPair to identify compatible copies. llvm-svn: 106780	2010-06-24 20:16:00 +00:00
Jakob Stoklund Olesen	7f894d8fdc	Remove the -fast-spill option. This code path has never really been used, and we are going to be handling spilling through the Spiller interface in the future. llvm-svn: 106777	2010-06-24 19:56:08 +00:00
Bill Wendling	3f0e992af1	Loosen up the requirements in the Horrible Hack(tm) to include all selectors which don't have a catch-all associated with them not just clean-ups. This fixes the SingleSource/Benchmarks/Shootout-C++/except.cpp testcase that broke because of my change r105902. llvm-svn: 106772	2010-06-24 18:49:10 +00:00
Jakob Stoklund Olesen	45230239e4	Replace a big gob of old coalescer logic with the new CoalescerPair class. CoalescerPair can determine if a copy can be coalesced, and which register gets merged away. The old logic in SimpleRegisterCoalescing had evolved into something a bit too convoluted. This second attempt fixes some crashes that only occurred Linux. llvm-svn: 106769	2010-06-24 18:15:01 +00:00
Jakob Stoklund Olesen	a612d7c012	Print the LSBs of a SlotIndex symbolically using letters referring to the [L]oad, [u]se, [d]ef, or [S]tore slots. This makes it easier to see if two indices refer to the same instruction, avoiding mental mod 4 calculations. llvm-svn: 106766	2010-06-24 17:31:07 +00:00
Dan Gohman	8a84cd57ae	Simplify this code; switch lowering shouldn't produce cases which trivially fold away. llvm-svn: 106765	2010-06-24 17:08:31 +00:00
Jakob Stoklund Olesen	3b2b46a700	Be more strict about subreg-to-subreg copies in CoalescerPair. Also keep track of the original DstREg before subregister adjustments. llvm-svn: 106753	2010-06-24 16:19:28 +00:00
Jakob Stoklund Olesen	53ccab7d1c	Verify that VNI kills are pointing to existing instructions. In this case it is essential that the kill is real because the spiller will decide to omit a spill if it thinks there is a later kill. llvm-svn: 106751	2010-06-24 15:56:59 +00:00
Dan Gohman	463f26b4be	Eliminate the other half of the BRCOND optimization, and update as many tests as possible. llvm-svn: 106749	2010-06-24 15:24:03 +00:00
Dan Gohman	df6b33e778	Eliminate the first have of the optimization which eliminates BRCOND when the condition is constant. This optimization shouldn't be necessary, because codegen shouldn't be able to find dead control paths that the IR-level optimizer can't find. And it's undesirable, because it encourages bugpoint to leave "br i1 false" branches in its output. And it wasn't updating the CFG. I updated all the tests I could, but some tests are too reduced and I wasn't able to meaningfully preserve them. llvm-svn: 106748	2010-06-24 15:04:11 +00:00
Dan Gohman	600f62b3ba	Reapply r106634, now that the bug it exposed is fixed. llvm-svn: 106746	2010-06-24 14:30:44 +00:00
Dan Gohman	0695e09b09	Optimize the "bit test" code path for switch lowering in the case where the bit mask has exactly one bit. llvm-svn: 106716	2010-06-24 02:06:24 +00:00
Jakob Stoklund Olesen	dbb58d2974	Revert "Replace a big gob of old coalescer logic with the new CoalescerPair class." Whiny buildbots. llvm-svn: 106710	2010-06-24 00:52:22 +00:00
Jakob Stoklund Olesen	f38e6720cc	Replace a big gob of old coalescer logic with the new CoalescerPair class. CoalescerPair can determine if a copy can be coalesced, and which register gets merged away. The old logic in SimpleRegisterCoalescing had evolved into something a bit too convoluted. llvm-svn: 106701	2010-06-24 00:12:39 +00:00
Bill Wendling	a136521a17	MorphNodeTo doesn't preserve the memory operands. Because we're morphing a node into the same node, but with different non-memory operands, we need to replace the memory operands after it's finished morphing. llvm-svn: 106643	2010-06-23 18:16:24 +00:00
Daniel Dunbar	4df321b7ad	Revert r106263, "Fold the ShrinkDemandedOps pass into the regular DAGCombiner pass,"... it was causing both 'file' (with clang) and 176.gcc (with llvm-gcc) to be miscompiled. llvm-svn: 106634	2010-06-23 17:09:26 +00:00
Jim Grosbach	b58c08b0ba	Some targets don't require the fencing MEMBARRIER instructions surrounding atomic intrinsics, either because the use locking instructions for the atomics, or because they perform the locking directly. Add support in the DAG combiner to fold away the fences. llvm-svn: 106630	2010-06-23 16:07:42 +00:00
Jakob Stoklund Olesen	731ea71f59	Add a few VNInfo data structure checks. llvm-svn: 106627	2010-06-23 15:34:36 +00:00
Daniel Dunbar	ef5a4383ad	Revert r106066, "Create a more targeted fix for not sinking instructions into a range where it"... it causes bzip2 to be miscompiled by Clang. Conflicts: lib/CodeGen/MachineSink.cpp llvm-svn: 106614	2010-06-23 00:48:25 +00:00
Jakob Stoklund Olesen	1023f6bd98	Also convert SUBREG_TO_REG to a KILL when relevant, like the other subreg instructions. This does not affect codegen much because SUBREG_TO_REG is only used by X86 and X86 does not use the register scavenger, but it prevents verifier errors. llvm-svn: 106583	2010-06-22 22:11:07 +00:00
Dan Gohman	3570f81b1e	Move PHIElimination's SplitCriticalEdge for MachineBasicBlocks out into a utility routine, teach it how to update MachineLoopInfo, and make use of it in MachineLICM to split critical edges on demand. llvm-svn: 106555	2010-06-22 17:25:57 +00:00
Jakob Stoklund Olesen	9c47dac677	Remove the SimpleJoin optimization from SimpleRegisterCoalescing. Measurements show that it does not speed up coalescing, so there is no reason the keep the added complexity around. Also clean out some unused methods and static functions. llvm-svn: 106548	2010-06-22 16:13:57 +00:00
Dan Gohman	d2d1ae105d	Use pre-increment instead of post-increment when the result is not used. llvm-svn: 106542	2010-06-22 15:08:57 +00:00
Dan Gohman	2370e2fe0f	When unfolding a load, avoid assuming which instruction that kill and dead flags will end up on. llvm-svn: 106520	2010-06-22 02:07:21 +00:00
Devang Patel	b6e058da18	Use single interface, using twine, to get named metadata. getNamedMetadata(). llvm-svn: 106518	2010-06-22 01:19:38 +00:00
Evan Cheng	37bb617f8a	Tail merging pass shall not break up IT blocks. rdar://8115404 llvm-svn: 106517	2010-06-22 01:18:16 +00:00
Devang Patel	cbc6fd8493	Discard special LLVM prefix from linkage name. llvm-svn: 106516	2010-06-22 01:06:05 +00:00
Devang Patel	ad51735794	Do not rely on Twine temporaries to survive. llvm-svn: 106515	2010-06-22 01:01:58 +00:00
Dan Gohman	851e478e6b	Fix the new load-unfolding code to update LiveVariable's dead flags, in addition to the kill flags. llvm-svn: 106512	2010-06-22 00:32:04 +00:00
Dan Gohman	3c1b3c61e9	Teach two-address lowering how to unfold a load to open up commuting opportunities. For example, this lets it emit this: movq (%rax), %rcx addq %rdx, %rcx instead of this: movq %rdx, %rcx addq (%rax), %rcx in the case where %rdx has subsequent uses. It's the same number of instructions, and usually the same encoding size on x86, but it appears faster, and in general, it may allow better scheduling for the load. llvm-svn: 106493	2010-06-21 22:17:20 +00:00
Dan Gohman	dd41bba517	Use A.append(...) instead of A.insert(A.end(), ...) when A is a SmallVector, and other SmallVector simplifications. llvm-svn: 106452	2010-06-21 19:47:52 +00:00
Dan Gohman	bbc29ea821	Revert r106422, which is breaking the non-fast-isel path. llvm-svn: 106423	2010-06-21 16:02:28 +00:00
Dan Gohman	f64fdd69d0	More changes for non-top-down fast-isel. Split the code for materializing a value out of SelectionDAGBuilder::getValue into a helper function, so that it can be used in other ways. Add a new getNonRegisterValue function which uses it, for use in code which doesn't want a CopyFromReg even when FuncMap.ValueMap already has an entry for it. llvm-svn: 106422	2010-06-21 15:13:54 +00:00
Dan Gohman	f91aff5f13	Do one lookup instead of two. llvm-svn: 106415	2010-06-21 14:21:47 +00:00
Dan Gohman	7c58cf75fa	Generalize this to look in the regular ValueMap in addition to the LocalValueMap, to make it more flexible when fast-isel isn't proceding straight top-down. llvm-svn: 106414	2010-06-21 14:17:46 +00:00
Bob Wilson	4581434c27	Tidy. llvm-svn: 106383	2010-06-19 05:33:57 +00:00
Dan Gohman	8693650422	Teach regular and fast isel to set dead flags on unused implicit defs on calls and similar instructions. llvm-svn: 106353	2010-06-18 23:28:01 +00:00
Jakob Stoklund Olesen	678927e0b1	Only run CoalesceExtSubRegs when we can expect LiveIntervalAnalysis to clean up the inserted INSERT_SUBREGs after us. llvm-svn: 106345	2010-06-18 23:10:20 +00:00
Evan Cheng	2d51c7c592	Allow ARM if-converter to be run after post allocation scheduling. - This fixed a number of bugs in if-converter, tail merging, and post-allocation scheduler. If-converter now runs branch folding / tail merging first to maximize if-conversion opportunities. - Also changed the t2IT instruction slightly. It now defines the ITSTATE register which is read by instructions in the IT block. - Added Thumb2 specific hazard recognizer to ensure the scheduler doesn't change the instruction ordering in the IT block (since IT mask has been finalized). It also ensures no other instructions can be scheduled between instructions in the IT block. This is not yet enabled. llvm-svn: 106344	2010-06-18 23:09:54 +00:00
Jim Grosbach	a57c2885cf	back-end libcall handling for ATOMIC_SWAP (__sync_lock_test_and_set) llvm-svn: 106342	2010-06-18 23:03:10 +00:00
Jakob Stoklund Olesen	07f4fa8198	TwoAddressInstructionPass::CoalesceExtSubRegs can insert INSERT_SUBREG instructions, but it doesn't really understand live ranges, so the first INSERT_SUBREG uses an implicitly defined register. Fix it in LiveVariableAnalysis by adding the <undef> flag. llvm-svn: 106333	2010-06-18 22:29:44 +00:00
Evan Cheng	cf9e8a987f	Fix an inverted condition. llvm-svn: 106330	2010-06-18 22:17:13 +00:00
Evan Cheng	f5d62535a5	Fix cross initialization compilation error. llvm-svn: 106324	2010-06-18 22:01:37 +00:00
Evan Cheng	c0e0d85b18	Teach iff-converter to properly count # of dups. It was not skipping over dbg_value's which resulted in non-duplicated instructions being deleted. rdar://8104384. llvm-svn: 106323	2010-06-18 21:52:57 +00:00
Jim Grosbach	d64dfc1568	Add Expand-to-libcall support for additional atomics. This covers the usual entries used by llvm-gcc. *_[U]MIN and such can be added later if needed. This enables the front ends to simplify handling of the atomic intrinsics by removing the target-specific decision about which targets can handle the intrinsics. llvm-svn: 106321	2010-06-18 21:43:38 +00:00
Dan Gohman	e5457c275d	Don't leak RegClass2VRegMap, which is now a new[] array instead of a std::vector. llvm-svn: 106298	2010-06-18 18:54:05 +00:00
Dan Gohman	882bb2984e	Start TargetRegisterClass indices at 0 instead of 1, so that MachineRegisterInfo doesn't have to confusingly allocate an extra entry. llvm-svn: 106296	2010-06-18 18:13:55 +00:00
Bob Wilson	f82c8fcc58	Fix PR7372: Conditional branches (at least on ARM) are treated as predicated, so when IfConverter::CopyAndPredicateBlock checks to see if it should ignore an instruction because it is a branch, it should not check if the branch is predicated. This case (when IgnoreBr is true) is only relevant from IfConvertTriangle, where new branches are inserted after the block has been copied and predicated. If the original branch is not removed, we end up with multiple conditional branches (possibly conflicting) at the end of the block. Aside from any immediate errors resulting from that, this confuses the AnalyzeBranch functions so that the branches are not analyzable. That in turn causes the IfConverter to think that the "Simple" pattern can be applied, and things go downhill fast because the "Simple" pattern does _not_ apply if the block can fall through. This is pretty fragile. If there are other degenerate cases where AnalyzeBranch fails, but where the block may still fall through, the IfConverter should not perform its "Simple" if-conversion. But, I don't know how to do that with the current AnalyzeBranch interface, so for now, the best thing seems to be to avoid creating branches that AnalyzeBranch cannot handle. Evan, please review! llvm-svn: 106291	2010-06-18 17:07:23 +00:00
Dan Gohman	9f58b3e106	Don't bother calling releaseMemory before destroying the DominatorTreeBase. llvm-svn: 106287	2010-06-18 16:09:11 +00:00
Dan Gohman	7edb39cc6b	Minor code simplifications. llvm-svn: 106286	2010-06-18 16:00:29 +00:00
Dan Gohman	6e681a5fbe	Give NamedRegionTimer an Enabled flag, allowing all its clients to switch from this: if (TimePassesIsEnabled) { NamedRegionTimer T(Name, GroupName); do_something(); } else { do_something(); // duplicate the code, this time without a timer! } to this: { NamedRegionTimer T(Name, GroupName, TimePassesIsEnabled); do_something(); } llvm-svn: 106285	2010-06-18 15:56:31 +00:00
Dan Gohman	96ca25eba5	Don't replace the old Ordering object with a new one; just clear() the old one. llvm-svn: 106284	2010-06-18 15:40:58 +00:00
Dan Gohman	a4f46b3ef8	Don't call clear() on DbgInfo when it's going to be deleted anyway. Don't replace the old DbgInfo with a new one when clear() on the old one is sufficient. llvm-svn: 106283	2010-06-18 15:36:18 +00:00
Dan Gohman	92c11acdb8	Change UpdateNodeOperands' operand and return value from SDValue to SDNode *, since it doesn't care about the ResNo value. llvm-svn: 106282	2010-06-18 15:30:29 +00:00
Dan Gohman	f1d8304fe3	Eliminate unnecessary uses of getZExtValue(). llvm-svn: 106279	2010-06-18 14:22:04 +00:00
Dan Gohman	35b6f9a929	isValueValidForType can be a static member function. llvm-svn: 106278	2010-06-18 14:01:07 +00:00
Dan Gohman	b92156d5e4	Fold the ShrinkDemandedOps pass into the regular DAGCombiner pass, which is faster, simpler, and less surprising. llvm-svn: 106263	2010-06-18 01:05:21 +00:00
Dan Gohman	0883789ec4	Handle ext(ext(x)) -> ext(x) immediately, since it's simple. llvm-svn: 106256	2010-06-18 00:08:30 +00:00
Stuart Hastings	0125b6410a	Add a DebugLoc parameter to TargetInstrInfo::InsertBranch(). This addresses a longstanding deficiency noted in many FIXMEs scattered across all the targets. This effectively moves the problem up one level, replacing eleven FIXMEs in the targets with eight FIXMEs in CodeGen, plus one path through FastISel where we actually supply a DebugLoc, fixing Radar 7421831. llvm-svn: 106243	2010-06-17 22:43:56 +00:00
Jim Grosbach	0ed5b460dc	add missing break. inconsequential as the code shouldn't be reached, but for correctness' sake, it should be there. llvm-svn: 106229	2010-06-17 17:58:54 +00:00
Jim Grosbach	3aeae8aeeb	Add entries for Expanding atomic intrinsics to libcalls. Just a placeholder for the moment. The implementation of the libcall will follow. Currently, the llvm-gcc knows when the intrinsics can be correctly handled by the back end and only generates them in those cases, issuing libcalls directly otherwise. That's too much coupling. The intrinsics should always be generated and the back end decide how to handle them, be it with a libcall, inline code, or whatever. This patch is a step in that direction. rdar://8097623 llvm-svn: 106227	2010-06-17 17:50:54 +00:00
Jim Grosbach	ba451e80dc	ISD::MEMBARRIER should lower to a libcall (__sync_synchronize) if the target sets the legalize action to Expand. llvm-svn: 106203	2010-06-17 02:00:53 +00:00
Jakob Stoklund Olesen	207cd4bbd7	Allow a register to be redefined multiple times in a basic block. LiveVariableAnalysis was a bit picky about a register only being redefined once, but that really isn't necessary. Here is an example of chained INSERT_SUBREGs that we can handle now: 68 %reg1040<def> = INSERT_SUBREG %reg1040, %reg1028<kill>, 14 register: %reg1040 +[70,134:0) 76 %reg1040<def> = INSERT_SUBREG %reg1040, %reg1029<kill>, 13 register: %reg1040 replace range with [70,78:1) RESULT: %reg1040,0.000000e+00 = [70,78:1)[78,134:0) 0@78-(134) 1@70-(78) 84 %reg1040<def> = INSERT_SUBREG %reg1040, %reg1030<kill>, 12 register: %reg1040 replace range with [78,86:2) RESULT: %reg1040,0.000000e+00 = [70,78:1)[78,86:2)[86,134:0) 0@86-(134) 1@70-(78) 2@78-(86) 92 %reg1040<def> = INSERT_SUBREG %reg1040, %reg1031<kill>, 11 register: %reg1040 replace range with [86,94:3) RESULT: %reg1040,0.000000e+00 = [70,78:1)[78,86:2)[86,94:3)[94,134:0) 0@94-(134) 1@70-(78) 2@78-(86) 3@86-(94) rdar://problem/8096390 llvm-svn: 106152	2010-06-16 21:29:40 +00:00
Jim Grosbach	6c0da25129	add FIXME llvm-svn: 106126	2010-06-16 18:45:08 +00:00
Bill Wendling	d71bd63600	Improve comment to include that the use of a preg is also verboten in this situation. llvm-svn: 106119	2010-06-16 18:01:31 +00:00
Evan Cheng	f128bdcb55	Make post-ra scheduling, anti-dep breaking, and register scavenger (conservatively) aware of predicated instructions. This enables ARM to move if-conversion before post-ra scheduler. llvm-svn: 106091	2010-06-16 07:35:02 +00:00
Devang Patel	a6d20f446f	Use separate named MDNode to hold each function's local variable info. This speeds up local variable handling in DwarfDebug. llvm-svn: 106075	2010-06-16 00:53:55 +00:00
Eric Christopher	b672ab9b53	Don't emit the linkage for initializer label for mach-o tls. llvm-svn: 106073	2010-06-16 00:27:30 +00:00
Bill Wendling	8c0cf0994d	Create a more targeted fix for not sinking instructions into a range where it will conflict with another live range. The place which creates this scenerio is the code in X86 that lowers a select instruction by splitting the MBBs. This eliminates the need to check from the bottom up in an MBB for live pregs. llvm-svn: 106066	2010-06-15 23:46:31 +00:00
Stuart Hastings	9b5005cd4b	Added a comment. llvm-svn: 106063	2010-06-15 23:06:30 +00:00
Bob Wilson	8105144fcd	Fix 80col violations, remove trailing whitespace, and clarify a comment. llvm-svn: 106057	2010-06-15 22:18:54 +00:00
Jakob Stoklund Olesen	ec2e964fd6	Remove the local register allocator. Please use the fast allocator instead. llvm-svn: 106051	2010-06-15 21:58:33 +00:00
Mon P Wang	7a84689cc5	Fixed vector widening of binary instructions that can trap. Patch by Visa Putkinen! llvm-svn: 106038	2010-06-15 20:29:05 +00:00
Bob Wilson	fc7d739422	IfConversion's AnalyzeBlocks method always returns false; clean it up. llvm-svn: 106027	2010-06-15 18:57:15 +00:00
Jim Grosbach	c964585ff8	fix naming llvm-svn: 106024	2010-06-15 18:53:34 +00:00
Jakob Stoklund Olesen	6e54c908e0	Fix an exotic bug that only showed up in an internal test case. SimpleRegisterCoalescing::JoinIntervals() uses CoalescerPair to determine if a copy is coalescable, and in very rare cases it can return true where LHS is not live - the coalescable copy can come from an alias of the physreg in LHS. llvm-svn: 106021	2010-06-15 18:49:14 +00:00
Bob Wilson	5947573f39	Fix a comment typo. llvm-svn: 106015	2010-06-15 18:19:27 +00:00
Bob Wilson	de94e66234	Add some missing checks for the case where the extract_subregs are combined to an insert_subreg, i.e., where the destination register is larger than the source. We need to check that the subregs can be composed for that case in a symmetrical way to the case when the destination is smaller. llvm-svn: 106004	2010-06-15 17:27:54 +00:00
Jakob Stoklund Olesen	246e9a07a2	Avoid processing early clobbers twice in RegAllocFast. Early clobbers defining a virtual register were first alocated to a physreg and then processed as a physreg EC, spilling the virtreg. This fixes PR7382. llvm-svn: 105998	2010-06-15 16:20:57 +00:00
Jakob Stoklund Olesen	82eca35b3e	Add CoalescerPair helper class. Given a copy instruction, CoalescerPair can determine which registers to coalesce in order to eliminate the copy. It deals with all the subreg fun to determine a tuple (DstReg, SrcReg, SubIdx) such that: - SrcReg is a virtual register that will disappear after coalescing. - DstReg is a virtual or physical register whose live range will be extended. - SubIdx is 0 when DstReg is a physical register. - SrcReg can be joined with DstReg:SubIdx. CoalescerPair::isCoalescable() determines if another copy instruction is compatible with the same tuple. This fixes some NEON miscompilations where shuffles are getting coalesced as if they were copies. The CoalescerPair class will replace a lot of the spaghetti logic in JoinCopy later. llvm-svn: 105997	2010-06-15 16:04:21 +00:00
Bob Wilson	a55b8877e6	Generalize the pre-coalescing of extract_subregs feeding reg_sequences, replacing the overly conservative checks that I had introduced recently to deal with correctness issues. This makes a pretty noticable difference in our testcases where reg_sequences are used. I've updated one test to check that we no longer emit the unnecessary subreg moves. llvm-svn: 105991	2010-06-15 05:56:31 +00:00
Ted Kremenek	d52caa5244	Update CMake build. llvm-svn: 105987	2010-06-15 04:08:14 +00:00
Jim Grosbach	412800d346	More dbg_value cleanup so the presence of debug info doesn't affect code-gen. Make sure to skip the dbg_value instructions when moving dups out of the diamond. rdar://7797940 llvm-svn: 105965	2010-06-14 21:30:32 +00:00
Evan Cheng	078f4cec21	- Do away with SimpleHazardRecognizer.h. It's not used and offers little value. - Rename ExactHazardRecognizer to PostRAHazardRecognizer and move its header to include to allow targets to extend it. llvm-svn: 105959	2010-06-14 21:06:53 +00:00
Evan Cheng	a397ada078	Avoid uncessary array copying. llvm-svn: 105955	2010-06-14 20:18:40 +00:00
Chris Lattner	0fc88efda3	fix a -Wbool-conversions warning from clang. llvm-svn: 105942	2010-06-14 18:28:34 +00:00
Bill Wendling	5d6103318a	When performing the Horrible Hack(tm-Duncan) on the EH code to convert a clean-up to a catch-all after inlining, take into account that there could be filter IDs as well. The presence of filters don't mean that the selector catches anything. It's just metadata information. llvm-svn: 105872	2010-06-12 02:34:29 +00:00
Evan Cheng	e60273fd70	Allow target to provide its own hazard recognizer to post-ra scheduler. llvm-svn: 105862	2010-06-12 00:12:18 +00:00
Evan Cheng	cb1fe56fd9	Code formatting. llvm-svn: 105861	2010-06-12 00:11:53 +00:00
Stuart Hastings	afe54f1625	Support for nested functions/classes in debug output. (Again.) Radar 7424645. llvm-svn: 105828	2010-06-11 20:08:44 +00:00
Evan Cheng	38f6560461	Code refactoring, no functionality changes. llvm-svn: 105775	2010-06-10 02:09:31 +00:00
Jakob Stoklund Olesen	8bc5eca331	Mark physregs defined by inline asm as implicit. This is a bit of a hack to make inline asm look more like call instructions. It would be better to produce correct dead flags during isel. llvm-svn: 105749	2010-06-09 20:05:00 +00:00
Evan Cheng	a0746bd50a	Allow target to place 2-address pass inserted copies in better spots. Thumb2 will use this to try to avoid breaking up IT blocks. llvm-svn: 105745	2010-06-09 19:26:01 +00:00
Bill Wendling	5ac1d23d3d	It's an error to translate this: %reg1025 = <sext> %reg1024 ... %reg1026 = SUBREG_TO_REG 0, %reg1024, 4 into this: %reg1025 = <sext> %reg1024 ... %reg1027 = EXTRACT_SUBREG %reg1025, 4 %reg1026 = SUBREG_TO_REG 0, %reg1027, 4 The problem here is that SUBREG_TO_REG is there to assert that an implicit zext occurs. It doesn't insert a zext instruction. If we allow the EXTRACT_SUBREG here, it will give us the value after the <sext>, not the original value of %reg1024 before <sext>. llvm-svn: 105741	2010-06-09 19:00:55 +00:00
Jakob Stoklund Olesen	a13b1c29b0	Add argument name comments. llvm-svn: 105665	2010-06-09 00:40:31 +00:00
Bob Wilson	7149cfcda3	Fix a mistake in my previous change r105437: don't access operand 2 and assume that it is an immediate before checking that the instruction is an EXTRACT_SUBREG. llvm-svn: 105585	2010-06-07 23:48:46 +00:00
Dan Gohman	7398758719	Add some basic debug output. llvm-svn: 105561	2010-06-07 22:32:10 +00:00
Jim Grosbach	6201b991a2	Cleanup. Process the dbg_values separately llvm-svn: 105554	2010-06-07 21:28:55 +00:00
Jim Grosbach	0f445f328e	Move exit check where it really belongs. llvm-svn: 105541	2010-06-07 19:12:21 +00:00
Stuart Hastings	3ca391027f	Revert 105492 & 105493 due to a testcase regression. Radar 7424645. llvm-svn: 105511	2010-06-05 00:39:29 +00:00
Dale Johannesen	df1a7f83bf	Fix some liveout handling related to tail calls, see comments. I don't think this ever resulted in problems on x86, but it would on ARM. llvm-svn: 105509	2010-06-05 00:30:45 +00:00
Evan Cheng	a03e6f85fe	Re-apply 105308 with fix. llvm-svn: 105502	2010-06-04 23:28:13 +00:00
Jim Grosbach	a1e08fb256	Make if-conversion ignore dbg_value instructions in its analysis. rdar://7797940 llvm-svn: 105498	2010-06-04 23:01:26 +00:00
Stuart Hastings	7c015988fe	Support for nested functions/classes in debug output. Radar 7424645. llvm-svn: 105492	2010-06-04 22:36:03 +00:00
Jim Grosbach	50d229e6b3	Skip dbg_value instructions when scanning instructions in register scavenging. llvm-svn: 105481	2010-06-04 20:18:30 +00:00
Jakob Stoklund Olesen	864827afb0	Keep track of the call instructions whose clobber lists were skipped during fast register allocation. Process all of the clobber lists at the end of the function, marking the registers as used in MachineRegisterInfo. This is necessary in case the calls clobber callee-saved registers (sic). llvm-svn: 105473	2010-06-04 18:08:29 +00:00
Mon P Wang	622cdd2297	Fixed a bug during widening where we would avoid legalizing a node. When we replace an OpA with a widened OpB, it is possible to get new uses of OpA due to CSE when recursively updating nodes. Since OpA has been processed, the new uses are not examined again. The patch checks if this occurred and it it did, updates the new uses of OpA to use OpB. llvm-svn: 105453	2010-06-04 01:20:10 +00:00
Bob Wilson	a733daf18c	Add some missing checks in TwoAddressInstructionPass::CoalesceExtSubRegs. Check that all the instructions are in the same basic block, that the EXTRACT_SUBREGs write to the same subregs that are being extracted, and that the source and destination registers are in the same regclass. Some of these constraints can be relaxed with a bit more work. Jakob suggested that the loop that checks for subregs when NewSubIdx != 0 should use the "nodbg" iterator, so I made that change here, too. llvm-svn: 105437	2010-06-03 23:53:58 +00:00
Jim Grosbach	01edd68225	Cleanup 80-column and trim trailing whitespace llvm-svn: 105435	2010-06-03 23:49:57 +00:00
Dan Gohman	d83e3e7750	Fix SimplifyDemandedBits' AssertZext logic to demand all the bits. It needs to demand the high bits because it's asserting that they're zero. llvm-svn: 105406	2010-06-03 20:21:33 +00:00
Bob Wilson	30093b5d8b	Revert 105308. llvm-svn: 105399	2010-06-03 18:28:31 +00:00
Bill Wendling	f82aea634c	Machine sink could potentially sink instructions into a block where the physical registers it defines then interfere with an existing preg live range. For instance, if we had something like these machine instructions: BB#0 ... = imul ... EFLAGS<imp-def,dead> test ..., EFLAGS<imp-def> jcc BB#2 EFLAGS<imp-use> BB#1 ... ; fallthrough to BB#2 BB#2 ... ; No code that defines EFLAGS jcc ... EFLAGS<imp-use> Machine sink will come along, see that imul implicitly defines EFLAGS, but because it's "dead", it assumes that it can move imul into BB#2. But when it does, imul's "dead" imp-def of EFLAGS is raised from the dead (a zombie) and messes up the condition code for the jump (and pretty much anything else which relies upon it being correct). The solution is to know which pregs are live going into a basic block. However, that information isn't calculated at this point. Nor does the LiveVariables pass take into account non-allocatable physical registers. In lieu of this, we do a very conservative pass through the basic block to determine if a preg is live coming out of it. llvm-svn: 105387	2010-06-03 07:54:20 +00:00
Eric Christopher	f67fe3b1e8	One underscore, not two. llvm-svn: 105379	2010-06-03 04:02:59 +00:00
Eli Friedman	dbbbf73c96	Implement expansion in type legalization for add/sub with overflow. The expansion is the same as that used by LegalizeDAG. The resulting code sucks in terms of performance/codesize on x86-32 for a 64-bit operation; I haven't looked into whether different expansions might be better in general. llvm-svn: 105378	2010-06-03 03:49:50 +00:00
Jakob Stoklund Olesen	4029596f93	Use the fast register allocator by default for -O0 builds. This affects both llvm-gcc and clang. llvm-svn: 105372	2010-06-03 00:39:06 +00:00
Jakob Stoklund Olesen	818e4df2b4	Use readsWritesVirtualRegister instead of counting uses and defs when inserting spills and reloads. This means that a partial define of a register causes a reload so the other parts of the register are preserved. The reload can be prevented by adding an <imp-def> operand for the full register. This is already done by the coalescer and live interval analysis where relevant. llvm-svn: 105369	2010-06-03 00:07:47 +00:00
Jakob Stoklund Olesen	42c642cd24	Add full register <imp-def> operands when the coalescer is creating partial register updates. These operands tell the spiller that the other parts of the partially defined register are don't-care, and a reload is not necessary. llvm-svn: 105361	2010-06-02 23:22:11 +00:00
Bill Wendling	7ee730eb40	Compulsive reformating. No functionalitical changes. llvm-svn: 105359	2010-06-02 23:04:26 +00:00
Jakob Stoklund Olesen	a8ad97743d	Slightly change the meaning of the reMaterialize target hook when the original instruction defines subregisters. Any existing subreg indices on the original instruction are preserved or composed with the new subreg index. Also substitute multiple operands mentioning the original register by using the new MachineInstr::substituteRegister() function. This is necessary because there will soon be <imp-def> operands added to non read-modify-write partial definitions. This instruction: %reg1234:foo = FLAP %reg1234<imp-def> will reMaterialize(%reg3333, bar) like this: %reg3333:bar-foo = FLAP %reg333:bar<imp-def> Finally, replace the TargetRegisterInfo pointer argument with a reference to indicate that it cannot be NULL. llvm-svn: 105358	2010-06-02 22:47:25 +00:00
Rafael Espindola	f2dffcef82	Remove the TargetRegisterClass member from CalleeSavedInfo llvm-svn: 105344	2010-06-02 20:02:30 +00:00
Devang Patel	c2254f6b98	Skip identical instruction while calculating DBG_VALUE range. llvm-svn: 105340	2010-06-02 19:05:13 +00:00
Bob Wilson	2d35a9e810	Rename canCombinedSubRegIndex method to something more grammatically correct and tidy up the comment describing it. llvm-svn: 105339	2010-06-02 18:54:47 +00:00
Devang Patel	21ccf05b4c	Use local small vector. llvm-svn: 105332	2010-06-02 16:42:51 +00:00
Jim Grosbach	848548300d	Not all entries in the range will have an SUnit. Check for that when looking for debug information. llvm-svn: 105324	2010-06-02 15:29:36 +00:00
Rafael Espindola	c08ecba597	Remove uses of getCalleeSavedRegClasses from outside the backends and removes the virtual declaration. With that out of the way I should be able to cleanup one backend at a time. llvm-svn: 105321	2010-06-02 12:39:06 +00:00
Evan Cheng	a2da22734f	Enable machine cse of instructions which define physical registers. llvm-svn: 105308	2010-06-02 01:08:27 +00:00
Bob Wilson	f4a34b97b8	Fix an obvious mistake: don't change the operands until all of them have been checked and it is safe to proceed with the changes. llvm-svn: 105304	2010-06-02 00:16:08 +00:00
Jim Grosbach	12ac8f0352	Update debug information when breaking anti-dependencies. rdar://7759363 llvm-svn: 105300	2010-06-01 23:48:44 +00:00
Jakob Stoklund Olesen	7b0ac865a4	Properly compose subregister indices when coalescing. The comment about ordering of subreg indices is no longer true. This exposed a bug in the new substVirtReg method that is also fixed. llvm-svn: 105294	2010-06-01 22:39:25 +00:00
Devang Patel	d43e0ca916	Ignore line number of debug value in undefined register. llvm-svn: 105292	2010-06-01 21:43:09 +00:00
Devang Patel	b0c76394a3	Keep track of incoming debug value of unused argument. Radar 7927666. llvm-svn: 105285	2010-06-01 19:59:01 +00:00
Dan Gohman	b782caa393	Fill in missing support for ISD::FEXP, ISD::FPOWI, and friends. llvm-svn: 105283	2010-06-01 18:35:14 +00:00
Jim Grosbach	b24d5c6ce2	Add a FIXME llvm-svn: 105282	2010-06-01 18:06:35 +00:00
Jim Grosbach	74d8345512	When processing function arguments when splitting live ranges across invokes, handle structs passed by value via an extract/insert pair, as a bitcast won't work on a struct. rdar://7742824 llvm-svn: 105280	2010-06-01 18:04:09 +00:00
Chris Lattner	14c46517b5	fix PR6623: when optimizing for size, don't inline memcpy/memsets that are too large. This causes the freebsd bootloader to be too large apparently. It's unclear if this should be an -Os or -Oz thing. Thoughts welcome. llvm-svn: 105228	2010-05-31 17:30:14 +00:00
Chris Lattner	b4a773b452	the 'limit' argument to FindOptimalMemOpLowering is unsigned, not uint64_t. llvm-svn: 105226	2010-05-31 17:12:23 +00:00
Oscar Fuentes	a97311f152	Use `llvm::next' instead of `next' to make VC++ 2010 happy. llvm-svn: 105168	2010-05-30 13:14:21 +00:00
Dan Gohman	4db93c9700	Reorder some code in SelectionDAGBuilder. llvm-svn: 105105	2010-05-29 17:53:24 +00:00
Dan Gohman	d16aa541af	SelectionDAG shouldn't have a FunctionLoweringInfo member. RegsForValue shouldn't have a TargetLoweringInfo member. And FunctionLoweringInfo::set doesn't needs its EnableFastISel argument. llvm-svn: 105101	2010-05-29 17:03:36 +00:00
Benjamin Kramer	c488e92f0b	Remove unused function. llvm-svn: 105100	2010-05-29 14:03:51 +00:00
Evan Cheng	707b7cc429	Remove schedule-livein-copies. It's not being used. llvm-svn: 105095	2010-05-29 02:23:39 +00:00
Jakob Stoklund Olesen	ab6223949e	Handle composed subreg indices when processing REQ_SEQUENCE instructions. llvm-svn: 105066	2010-05-29 00:14:14 +00:00
Evan Cheng	032f3261a2	Doh. Machine LICM is re-initializing the CSE map over and over. Patch by Anna Zaks. rdar://8037934. llvm-svn: 105065	2010-05-29 00:06:36 +00:00
Evan Cheng	cc2efe11db	Fix some latency computation bugs: if the use is not a machine opcode do not just return zero. llvm-svn: 105061	2010-05-28 23:26:21 +00:00
Jakob Stoklund Olesen	64824ea99f	Add a TargetRegisterInfo::composeSubRegIndices hook with a default implementation that is correct for most targets. Tablegen will override where needed. Add MachineOperand::subst{Virt,Phys}Reg methods that correctly handle existing subreg indices when sustituting registers. llvm-svn: 104985	2010-05-28 18:18:53 +00:00
Stuart Hastings	c1e216583f	Revert 104841, 104842, 104876 due to buildbot failures. Radar 7424645. llvm-svn: 104953	2010-05-28 16:41:07 +00:00
Dan Gohman	2140a74979	Eliminate the restriction that the array size in an alloca must be i32. This will help reduce the amount of casting required on 64-bit targets. llvm-svn: 104911	2010-05-28 01:14:11 +00:00
Jakob Stoklund Olesen	b613ae2c89	Add a -regalloc=default option that chooses a register allocator based on the -O optimization level. This only really affects llc for now because both the llvm-gcc and clang front ends override the default register allocator. I intend to remove that code later. llvm-svn: 104904	2010-05-27 23:57:25 +00:00
Jim Grosbach	faa3abbe39	Update the saved stack pointer in the sjlj function context following either an alloca() or an llvm.stackrestore(). rdar://8031573 llvm-svn: 104900	2010-05-27 23:49:24 +00:00
Jim Grosbach	c9f532dddc	back out 104862/104869. Can reuse stacksave after all. Very cool. llvm-svn: 104897	2010-05-27 23:11:57 +00:00
Devang Patel	7a9dedf0ab	Do not drop location info for inlined function args. llvm-svn: 104884	2010-05-27 20:25:04 +00:00
Jim Grosbach	b68dfb45f5	hook ISD::STACKADDR to an intrinsic llvm-svn: 104869	2010-05-27 18:52:11 +00:00
Devang Patel	5e6b71ce34	inlined function's arguments need a label to mark the start point because they are not directly attached to current function. llvm-svn: 104848	2010-05-27 16:47:30 +00:00
Stuart Hastings	8e99e50d08	Support for nested functions/classes in debug output. Radar 7424645. llvm-svn: 104841	2010-05-27 16:16:54 +00:00
Devang Patel	6b9a9fe207	Simplify. Eliminate unneeded debug_loc entry. llvm-svn: 104785	2010-05-26 23:55:23 +00:00
Bill Wendling	ddee3cb163	Add FIXME comment to remove this. llvm-svn: 104749	2010-05-26 21:53:50 +00:00
Daniel Dunbar	b33dfbcba4	MC: Add TargetMachine support for setting the value of MCRelaxAll with -filetype=obj. llvm-svn: 104747	2010-05-26 21:48:55 +00:00
Devang Patel	acc32a5c19	There is no need to force an line number entry (using previous location) for a temp label at unknown location. llvm-svn: 104740	2010-05-26 21:23:46 +00:00
Bill Wendling	27311269cb	Add "setjmp_syscall", "savectx", "qsetjmp", "vfork", "getcontext" to the list of usual suspects that could "return twice". llvm-svn: 104737	2010-05-26 20:39:00 +00:00
Jim Grosbach	c98892fdaa	Adjust eh.sjlj.setjmp to properly have a chain and to have an opcode entry in ISD::. No functional change. llvm-svn: 104734	2010-05-26 20:22:18 +00:00
Devang Patel	1b08572a66	Update debug info when live-in reg is copied into a vreg. llvm-svn: 104732	2010-05-26 20:18:50 +00:00
Bill Wendling	0c3bfd3fb0	Move the check for "calls setjmp" to SelectionDAGISel so that it can be used by more than just the stack slot coloring algorithm. llvm-svn: 104722	2010-05-26 19:46:12 +00:00
Devang Patel	002d54ddc9	Identify instructions, that needs a label to mark debug info entity, in advance. This simplifies beginScope(). llvm-svn: 104720	2010-05-26 19:37:24 +00:00
Dan Gohman	52c2738324	Eliminate the use of PriorityQueue and just use a std::vector, implementing pop with a linear search for a "best" element. The priority queue was a neat idea, but in practice the comparison functions depend on dynamic information. llvm-svn: 104718	2010-05-26 18:52:00 +00:00
Dan Gohman	1e5d0b0456	Delete an unused function. llvm-svn: 104716	2010-05-26 18:34:12 +00:00
Devang Patel	95fcc96752	Remove dead code. llvm-svn: 104706	2010-05-26 17:42:50 +00:00
Devang Patel	5a5e0bc3b5	Do not construct location list backword! llvm-svn: 104705	2010-05-26 17:29:32 +00:00
Eric Christopher	e805ea9e39	Temporarily revert r104655 as it's breaking the bots. llvm-svn: 104664	2010-05-26 01:59:55 +00:00
Dan Gohman	7c00576a62	Change push_all to a non-virtual function and implement it in the base class, since all the implementations are the same. llvm-svn: 104659	2010-05-26 01:10:55 +00:00
Dan Gohman	3701b3928e	Trim #include. llvm-svn: 104657	2010-05-26 00:55:59 +00:00
Bill Wendling	c5222d6c38	Dale and Evan suggested putting the "check for setjmp" much earlier in the machine code generation. That's a good idea, so I made it so. llvm-svn: 104655	2010-05-26 00:32:40 +00:00
Devang Patel	9fc11706e3	First cut at supporting .debug_loc section. This is used to track variable information. llvm-svn: 104649	2010-05-25 23:40:22 +00:00
Bill Wendling	388f638511	Constify function. llvm-svn: 104646	2010-05-25 22:02:22 +00:00
Dan Gohman	ce3269b815	Do one map lookup instead of two. llvm-svn: 104645	2010-05-25 21:59:42 +00:00
Eric Christopher	f3925438e5	Move the verbose asm output up a bit so it can be used in the special cases as well. llvm-svn: 104642	2010-05-25 21:49:43 +00:00
Bill Wendling	b04ef0cfbc	Okay, bear with me here... If you have a setjmp/longjmp situation, it's possible for stack slot coloring to reuse a stack slot before it's really dead. For instance, if we have something like this: 1: y = g; x = sigsetjmp(env, 0); switch (x) { case 1: /* ... / goto run; case 0: run: do_run(); / marked as "no return" / break; case 3: if (...) { / ... / goto run; } / ... */ break; } 2: g = y; "y" may be put onto the stack, so the expression "g = y" is relying upon the fact that the stack slot containing "y" isn't modified between (1) and (2). But it can be, because of the "no return" calls in there. A longjmp might come back with 3, modify the stack slot, and then go to case 0. And it's perfectly acceptable to reuse the stack slot there because there's no CFG flow from case 3 to (2). The fix is to disable certain optimizations in these situations. Ideally, we'd disable them for all "returns twice" functions. But we don't support that attribute. Check for "setjmp" and "sigsetjmp" instead. llvm-svn: 104640	2010-05-25 21:44:26 +00:00
Eric Christopher	19a4b843cc	Add support for initialized global data for darwin tls. Update comments and testcases accordingly. llvm-svn: 104635	2010-05-25 21:28:50 +00:00
Jakob Stoklund Olesen	1ad0d5e25b	Print symbolic SubRegIndex names on machine operands. llvm-svn: 104628	2010-05-25 19:49:38 +00:00
Dale Johannesen	60fe2cdc4f	Fix another variant of PR 7191. Also add a testcase Mon Ping provided; unfortunately bugpoint failed to reduce it, but I think it's important to have a test for this in the suite. 8023512. llvm-svn: 104624	2010-05-25 18:47:23 +00:00
Dale Johannesen	ff384ad981	Fix PR 7191. I have been unable to create a .ll file that fails, sorry. (oye, a word which should be better known to people writing tree traversals, means grandchild.) llvm-svn: 104619	2010-05-25 17:50:03 +00:00
Jakob Stoklund Olesen	adff18518a	Disable invalid coalescer assertion. llvm-svn: 104574	2010-05-25 00:15:18 +00:00
Bill Wendling	0b7488e8d5	Print out the name of the function during SSC. llvm-svn: 104572	2010-05-24 23:16:04 +00:00
Evan Cheng	1b79babdec	Avoid adding duplicate function live-in's. llvm-svn: 104560	2010-05-24 21:33:37 +00:00
Devang Patel	51b37e0bd8	Do not emit line number entries for unknown debug values. This fixes recent regression in store.exp from gdb testsuite. llvm-svn: 104524	2010-05-24 18:26:49 +00:00
Nicolas Geoffray	c5327226e4	Encode the Caml frametable by following what the comment says: the number of descriptors is first emitted, and StackOffsets are emitted in 16 bits. llvm-svn: 104488	2010-05-24 12:24:11 +00:00
Daniel Dunbar	3ff1a06de6	MC: Add an MCLoggingStreamer, for use in debugging integrated-as mismatches. llvm-svn: 104463	2010-05-23 17:44:06 +00:00
Evan Cheng	168ced94d8	Implement @llvm.returnaddress. rdar://8015977. llvm-svn: 104421	2010-05-22 01:47:14 +00:00
Jim Grosbach	bd9485db63	Implement eh.sjlj.longjmp for ARM. Clean up the intrinsic a bit. Followups: docs patch for the builtin and eh.sjlj.setjmp cleanup to match longjmp. llvm-svn: 104419	2010-05-22 01:06:18 +00:00
Eric Christopher	6fdea1bda8	Add full bss data support for darwin tls variables. llvm-svn: 104414	2010-05-22 00:10:22 +00:00
Devang Patel	4a8e6e83dc	Collect variable information during endFunction() instead of beginFunction(). llvm-svn: 104412	2010-05-22 00:04:14 +00:00
Bob Wilson	61438fe064	Clean up extra whitespace. llvm-svn: 104410	2010-05-21 23:53:55 +00:00
Eric Christopher	53ff992dde	Make this LookAheadLimit, not the uninitialized LookAheadLeft. Evan please verify! llvm-svn: 104408	2010-05-21 23:40:03 +00:00
Evan Cheng	2c8bdead9e	Allow machine cse to cse instructions which define physical registers. Controlled by option -machine-cse-phys-defs. llvm-svn: 104385	2010-05-21 21:22:19 +00:00
Bob Wilson	51d9ee3ff6	Change CodeGen/ARM/2009-11-02-NegativeLane.ll to use 16-bit vector elements so that it will continue to test what it was meant to test when I commit a separate change for better support of BUILD_VECTOR and VECTOR_SHUFFLE for Neon. Fix a DAG combiner crash exposed by this test change. llvm-svn: 104380	2010-05-21 21:05:32 +00:00
Evan Cheng	3858451e09	- Change MachineInstr::findRegisterDefOperandIdx so it can also look for defs that are aliases of the specified register. - Rename modifiesRegister to definesRegister since it's looking a def of the specific register or one of its super-registers. It's not looking for def of a sub-register or alias that could change the specified register. - Added modifiesRegister to look for defs of aliases. llvm-svn: 104377	2010-05-21 20:53:24 +00:00
Jakob Stoklund Olesen	7d7f604321	Add MachineInstr::readsWritesVirtualRegister() to determine if an instruction reads or writes a register. This takes partial redefines and undef uses into account. Don't actually use it yet. That caused miscompiles. llvm-svn: 104372	2010-05-21 20:02:01 +00:00
Devang Patel	1782aae355	Simplify llvm-svn: 104338	2010-05-21 18:49:09 +00:00
Chris Lattner	a81e1cab04	constify accessor. llvm-svn: 104325	2010-05-21 17:47:50 +00:00
Jakob Stoklund Olesen	b4e1687270	Revert "Use MachineInstr::readsWritesVirtualRegister to determine if a register is read." This reverts r104322. I think it was causing miscompilations. llvm-svn: 104323	2010-05-21 17:36:32 +00:00
Jakob Stoklund Olesen	8e8e090301	Use MachineInstr::readsWritesVirtualRegister to determine if a register is read. This correctly handles partial redefines and undef uses. llvm-svn: 104322	2010-05-21 16:42:30 +00:00
Jakob Stoklund Olesen	a648c6a757	Teach VirtRegRewriter to handle spilling in instructions that have multiple definitions of the virtual register. This happens when spilling the registers produced by REG_SEQUENCE: %reg1047:5<def>, %reg1047:6<def>, %reg1047:7<def> = VLD3d8 %reg1033, 0, pred:14, pred:%reg0 The rewriter would spill the register multiple times, dead store elimination tried to keep up, but ended up cutting the branch it was sitting on. llvm-svn: 104321	2010-05-21 16:36:13 +00:00
Jakob Stoklund Olesen	1f3801062d	If the first definition of a virtual register is a partial redef, add an <imp-def> operand for the full register. This ensures that the full physical register is marked live after register allocation. llvm-svn: 104320	2010-05-21 16:32:16 +00:00
Evan Cheng	725211e948	Rename -pre-RA-sched=hybrid to -pre-RA-sched=list-hybrid. llvm-svn: 104306	2010-05-21 00:42:32 +00:00
Devang Patel	fbd6c45e06	Simplify. llvm-svn: 104302	2010-05-21 00:10:20 +00:00
Evan Cheng	4401f8873c	Allow targets more controls on what nodes are scheduled by reg pressure, what for latency in hybrid mode. llvm-svn: 104293	2010-05-20 23:26:43 +00:00
Devang Patel	490c8ab76d	Refactor. llvm-svn: 104265	2010-05-20 19:57:06 +00:00
Jim Grosbach	63d4f68df4	Remove dbg_value workaround and associated command line option llvm-svn: 104254	2010-05-20 18:34:01 +00:00
Devang Patel	e1c53f29d3	Split DbgVariable. Eventually, variable info will be communicated through frame index, or DBG_VALUE instruction, or collection of DBG_VALUE instructions. Plus each DbgVariable may not need a label. llvm-svn: 104233	2010-05-20 16:36:41 +00:00
Evan Cheng	bdd062dae0	Add a hybrid bottom up scheduler that reduce register usage while avoiding pipeline stall. It's useful for targets like ARM cortex-a8. NEON has a lot of long latency instructions so a strict register pressure reduction scheduler does not work well. Early experiments show this speeds up some NEON loops by over 30%. llvm-svn: 104216	2010-05-20 06:13:19 +00:00
Nick Lewycky	c53cc4f8bf	Fix typo in comment. llvm-svn: 104209	2010-05-20 03:30:09 +00:00
Eric Christopher	27e7ffc7d4	Partial code for emitting thread local bss data. llvm-svn: 104197	2010-05-20 00:49:07 +00:00
Bob Wilson	42603958fb	Optimize away insertelement of an undef value. This shows up in test/Codegen/ARM/reg_sequence.ll but it doesn't affect the generated code because the coalescer cleans it up. Radar 7998853. llvm-svn: 104185	2010-05-19 23:42:58 +00:00
Jim Grosbach	f98511473e	Enable preserving debug information through post-RA scheduling llvm-svn: 104175	2010-05-19 22:57:47 +00:00
Jim Grosbach	604560c5fe	Fix the post-RA instruction scheduler to handle instructions referenced by more than one dbg_value instruction. rdar://7759363 llvm-svn: 104174	2010-05-19 22:57:06 +00:00
Evan Cheng	70e506e18a	Code clean up. llvm-svn: 104173	2010-05-19 22:42:23 +00:00
Devang Patel	a08130864e	Revert r104165. llvm-svn: 104172	2010-05-19 21:58:28 +00:00
Jakob Stoklund Olesen	e0eddb21f5	Add support for partial redefs to the fast register allocator. A partial redef now triggers a reload if required. Also don't add <imp-def,dead> operands for physical superregisters. Kill flags are still treated as full register kills, and <imp-use,kill> operands are added for physical superregisters as before. llvm-svn: 104167	2010-05-19 21:36:05 +00:00
Devang Patel	0fe341e2e2	There is no need to maintain InsnsBeginScopeSet separately. llvm-svn: 104165	2010-05-19 21:26:53 +00:00
Jakob Stoklund Olesen	5d4c134a94	Add MachineInstr::readsVirtualRegister() in preparation for proper handling of partial redefines. We are going to treat a partial redefine of a virtual register as a read-modify-write: %reg1024:6 = OP Unless the register is fully clobbered: %reg1024:6 = OP, %reg1024<imp-def> MachineInstr::readsVirtualRegister() knows the difference. The first case is a read, the second isn't. llvm-svn: 104149	2010-05-19 20:36:22 +00:00
Evan Cheng	738e920edf	Code refactoring: pull SchedPreference enum from TargetLowering.h to TargetMachine.h and put it in its own namespace. llvm-svn: 104147	2010-05-19 20:19:50 +00:00
Jakob Stoklund Olesen	e11cdf8cc8	TwoAddressInstructionPass doesn't really know how to merge live intervals when lowering REG_SEQUENCE instructions. Insert copies for REG_SEQUENCE sources not killed to avoid breaking later passes. llvm-svn: 104146	2010-05-19 20:08:00 +00:00
Bob Wilson	6a1bfd282b	When expanding a vector_shuffle, the element type may not be legal and may need to be promoted. The BUILD_VECTOR and EXTRACT_VECTOR_ELT nodes generated here already allow the promoted type to be used without further changes, so just do the promotion. This fixes part of pr7167. llvm-svn: 104141	2010-05-19 18:48:32 +00:00
Evan Cheng	abd0ad54a4	Intrinsics which do a vector compare (results are all zero or all ones) are modeled as icmp / fcmp + sext. This is turned into a vsetcc by dag combine (yes, not a good long term solution). The targets can then isel the vsetcc to the appropriate instruction. The trouble arises when the result of a vector cmp + sext is then and'ed with all ones. Instcombine will turn it into a vector cmp + zext, dag combiner will miss turning it into a vsetcc and hell breaks loose after that. Teach dag combine to turn a vector cpm + zest into a vsetcc + and 1. This fixes rdar://7923010. llvm-svn: 104094	2010-05-19 01:08:17 +00:00
Bob Wilson	055c01d9dc	Fix a crash when debugging the coalescer. DebugValue instructions are not in the coalescer's instruction map. llvm-svn: 104086	2010-05-18 23:19:42 +00:00
Jakob Stoklund Olesen	430b6e40ab	Remember to update VirtRegLastUse when spilling without killing before a call. llvm-svn: 104074	2010-05-18 22:20:09 +00:00
Evan Cheng	f19384d54a	Sink dag combine's post index load / store code that swap base ptr and index into the target hook. Only the target knows whether the swap is safe. In Thumb2 mode, the offset must be an immediate. rdar://7998649 llvm-svn: 104060	2010-05-18 21:31:17 +00:00
Jakob Stoklund Olesen	663543b4d7	Properly handle multiple definitions of a virtual register in the same instruction. This can happen on ARM: >> %reg1035:5<def>, %reg1035:6<def> = VLD1q16 %reg1028, 0, pred:14, pred:%reg0 Regs: Q0=%reg1032* R0=%reg1028* R1=%reg1029* R2 R3=%reg1031* Killing last use: %reg1028 Allocating %reg1035 from QPR Assigning %reg1035 to Q1 << %D2<def>, %D3<def> = VLD1q16 %R0<kill>, 0, pred:14, pred:%reg0, %Q1<imp-def> llvm-svn: 104056	2010-05-18 21:10:50 +00:00
Evan Cheng	45b3f702ab	Continuously refine the register class of REG_SEQUENCE def with all the source registers and sub-register indices. llvm-svn: 104051	2010-05-18 20:07:47 +00:00
Evan Cheng	e7fc64a5c9	Fix PR7162: Use source register classes and sub-indices to determine the correct register class of the definitions of REG_SEQUENCE. llvm-svn: 104050	2010-05-18 20:03:28 +00:00
Jakob Stoklund Olesen	4843178d6b	Teach the machine code verifier to use getSubRegisterRegClass(). The old approach was wrong. It had an off-by-one error. llvm-svn: 104034	2010-05-18 17:31:12 +00:00
Daniel Dunbar	62bc96a1a5	llc (et al): Add support for --show-encoding and --show-inst. llvm-svn: 104029	2010-05-18 17:22:19 +00:00
Evan Cheng	48f0de96d6	FIX PR7158. SimplifyVBinOp was asserting when it fails to constant fold (op (build_vector), (build_vector)). llvm-svn: 104004	2010-05-18 00:03:40 +00:00
Evan Cheng	1e4f55200d	Fix PR7175. Insert copies of a REG_SEQUENCE source if it is used by other REG_SEQUENCE instructions. llvm-svn: 103994	2010-05-17 23:24:12 +00:00
Bill Wendling	02d3368831	- Set the "HasCalls" flag after instruction selection is finished. - Change the logic DisableFramePointerElim() to check for the -disable-non-leaf-fp-elim before -disable-fp-elim. llvm-svn: 103990	2010-05-17 23:09:50 +00:00
Eric Christopher	9635b3da6b	More data/parsing support for tls directives. Add a few more testcases and cleanup comments as well. llvm-svn: 103985	2010-05-17 22:53:55 +00:00
Evan Cheng	f2c9a96f3c	Fix PR7156. If the sources of a REG_SEQUENCE are all IMPLICIT_DEF's. Replace it with an IMPLICIT_DEF rather than deleting it or else it would be left without a def. llvm-svn: 103984	2010-05-17 22:09:49 +00:00
Jakob Stoklund Olesen	585792738b	Pull the UsedInInstr.test() calls into calcSpillCost() and remember aliases. This fixes the miscompilations of MultiSource/Applications/JM/l{en,de}cod. Clang now successfully self hosts in a debug build with the fast register allocator. llvm-svn: 103975	2010-05-17 21:02:08 +00:00
Eric Christopher	bf79238599	Add some section and constant support for darwin TLS. llvm-svn: 103974	2010-05-17 21:02:07 +00:00
Evan Cheng	29c463862e	Careful with reg_sequence coalescing to not to overwrite sub-register indices. llvm-svn: 103971	2010-05-17 20:57:12 +00:00
Jakob Stoklund Olesen	70563bbba5	Remove debug option. Add comment on spill order determinism. llvm-svn: 103961	2010-05-17 20:01:22 +00:00
Jakob Stoklund Olesen	176a9c4272	Avoid allocating the same physreg to multiple virtregs in one instruction. While that approach works wonders for register pressure, it tends to break everything. This should unbreak the arm-linux builder and fix a number of miscompilations. llvm-svn: 103946	2010-05-17 17:18:59 +00:00
Jakob Stoklund Olesen	f5e8c86424	Minor optimizations. DenseMap::begin() is surprisingly slow on an empty map. llvm-svn: 103940	2010-05-17 15:30:37 +00:00
Jakob Stoklund Olesen	6649cdaa23	Extract spill cost calculation to a new method, and use definePhysReg() to clear out aliases when allocating. Clean up allocVirtReg(). Use calcSpillCost() to allow more aggressive hinting. Now the hint is always taken unless blocked by a reserved register. This leads to more coalescing, lower register pressure, and less spilling. llvm-svn: 103939	2010-05-17 15:30:32 +00:00
Zhongxing Xu	188855abef	Remove unused member variable. llvm-svn: 103936	2010-05-17 09:47:55 +00:00
Jakob Stoklund Olesen	7d22a81b61	Only use clairvoyance when defining a register, and then only if it has one use. This makes allocation independent on the ordering of use-def chains. llvm-svn: 103935	2010-05-17 04:50:57 +00:00
Jakob Stoklund Olesen	f915d14955	Eliminate a hash table probe when killing virtual registers. llvm-svn: 103934	2010-05-17 03:26:09 +00:00
Jakob Stoklund Olesen	edd3d9db13	Execute virtreg kills immediately instead of after processing all uses. This is safe to do because the physreg has been marked UsedInInstr and the kill flag will be set on the last operand using the virtreg if there are more then one. llvm-svn: 103933	2010-05-17 03:26:06 +00:00
Jakob Stoklund Olesen	e07a408afc	Sprinkle superregister <imp-def> and <imp-kill> operands when dealing with subregister indices. llvm-svn: 103931	2010-05-17 02:49:21 +00:00
Jakob Stoklund Olesen	1069a09691	Now that we don't keep live registers across calls, there is not reason to go through the very long list of call-clobbered registers. We just assume all registers are clobbered. llvm-svn: 103930	2010-05-17 02:49:18 +00:00
Jakob Stoklund Olesen	397068de06	Boldly attempt consistent capitalization. Functional changes unintended. llvm-svn: 103929	2010-05-17 02:49:15 +00:00
Jakob Stoklund Olesen	8044c989d1	Spill and kill all virtual registers across a call. Debug code doesn't use callee saved registers anyway, and the code is simpler this way. Now spillVirtReg always kills, and the isKill parameter is not needed. llvm-svn: 103927	2010-05-17 02:07:32 +00:00
Jakob Stoklund Olesen	d2ef1fbc82	Reduce hashtable probes by using DenseMap::insert() for lookup. llvm-svn: 103926	2010-05-17 02:07:29 +00:00
Jakob Stoklund Olesen	fb43e065a4	Make MBB a class member instead of passing it around everywhere. llvm-svn: 103925	2010-05-17 02:07:22 +00:00
Evan Cheng	166a7993ba	Yes, if the redef is a copy, update the old val# with the copy. But make sure to clear the copy field if the redef is not a copy. llvm-svn: 103922	2010-05-17 01:47:47 +00:00
Dale Johannesen	3a366a88f2	Fix uint64->{float, double} conversion to do rounding correctly in 32-bit. The implementation in LegalizeIntegerTypes to handle this as sint64->float + appropriate power of 2 is subject to double rounding, considered incorrect by numerics people. Use this implementation only when it is safe. This leads to using library calls in some cases that produced inline code before, but it's correct now. (EVTToAPFloatSemantics belongs somewhere else, any suggestions?) Add a correctly rounding (though not particularly fast) conversion that uses X87 80-bit computations for x86-32. 7885399, 5901940. This shows up in gcc.c-torture/execute/ieee/rbug.c in the gcc testsuite on some platforms. llvm-svn: 103883	2010-05-15 18:51:12 +00:00
Dale Johannesen	bb4656c05e	Improve assertion messages. llvm-svn: 103882	2010-05-15 18:38:02 +00:00
Chris Lattner	93cd0f1c89	improve portability to systems that don't have powf/modf (e.g. solaris 9) patch by Evzen Muller! llvm-svn: 103876	2010-05-15 17:10:24 +00:00
Chandler Carruth	75142e6bfc	Fix an GCC warning that seems to have actually caught a bug (!!!) in a condition's grouping. Every other use of Allocatable.test(Hint) groups it the same way as it is indented, so move the parentheses to agree with that grouping. llvm-svn: 103869	2010-05-15 10:23:23 +00:00
Jakob Stoklund Olesen	84ce290822	Calculate liveness on the fly for local registers. When working top-down in a basic block, substituting physregs for virtregs, the use-def chains are kept up to date. That means we can recognize a virtreg kill by the use-def chain becoming empty. This makes the fast allocator independent of incoming kill flags. llvm-svn: 103866	2010-05-15 06:09:08 +00:00
Evan Cheng	e26e56e72b	A partial re-def instruction may be a copy. llvm-svn: 103850	2010-05-15 01:35:44 +00:00
Evan Cheng	8c2d062ea6	Teach two-address pass to do some coalescing while eliminating REG_SEQUENCE instructions. e.g. %reg1026<def> = VLDMQ %reg1025<kill>, 260, pred:14, pred:%reg0 %reg1027<def> = EXTRACT_SUBREG %reg1026, 6 %reg1028<def> = EXTRACT_SUBREG %reg1026<kill>, 5 ... %reg1029<def> = REG_SEQUENCE %reg1028<kill>, 5, %reg1027<kill>, 6, %reg1028, 7, %reg1027, 8, %reg1028, 9, %reg1027, 10, %reg1030<kill>, 11, %reg1032<kill>, 12 After REG_SEQUENCE is eliminated, we are left with: %reg1026<def> = VLDMQ %reg1025<kill>, 260, pred:14, pred:%reg0 %reg1029:6<def> = EXTRACT_SUBREG %reg1026, 6 %reg1029:5<def> = EXTRACT_SUBREG %reg1026<kill>, 5 The regular coalescer will not be able to coalesce reg1026 and reg1029 because it doesn't know how to combine sub-register indices 5 and 6. Now 2-address pass will consult the target whether sub-registers 5 and 6 of reg1026 can be combined to into a larger sub-register (or combined to be reg1026 itself as is the case here). If it is possible, it will be able to replace references of reg1026 with reg1029 + the larger sub-register index. llvm-svn: 103835	2010-05-14 23:21:14 +00:00
Dan Gohman	88fb253562	Fast ISel trivially coalesces away no-op casts, so check for this when setting kill flags. llvm-svn: 103832	2010-05-14 22:53:18 +00:00
Jakob Stoklund Olesen	089e9421d2	Don't bother spilling before a return llvm-svn: 103831	2010-05-14 22:40:43 +00:00
Jakob Stoklund Olesen	cdef6bc8de	RegAllocLocal can count copies too llvm-svn: 103830	2010-05-14 22:40:40 +00:00
Jakob Stoklund Olesen	b16013936b	Track allocatable instead of reserved regs, and never take an unallocatable hint. llvm-svn: 103828	2010-05-14 22:02:56 +00:00
Dan Gohman	2f277c866d	Don't set kill flags for instructions which the scheduler has cloned. llvm-svn: 103827	2010-05-14 22:01:14 +00:00
Jakob Stoklund Olesen	e68b814c8c	Avoid scanning the long tail of physreg operands on calls llvm-svn: 103823	2010-05-14 21:55:52 +00:00
Devang Patel	36debf8046	Do not forget to mark prcessed arguments. llvm-svn: 103822	2010-05-14 21:55:50 +00:00
Jakob Stoklund Olesen	6c038e33e9	Count coalesced copies llvm-svn: 103821	2010-05-14 21:55:50 +00:00
Jakob Stoklund Olesen	33af4fcdea	Allow virtreg redefines when verifying for RegAllocFast llvm-svn: 103820	2010-05-14 21:55:44 +00:00
Jim Grosbach	866b74ba8b	Remove trailing whitespace llvm-svn: 103807	2010-05-14 21:20:46 +00:00
Jim Grosbach	d772bdeb7e	80 column and trailing whitespace cleanup llvm-svn: 103806	2010-05-14 21:19:48 +00:00
Jim Grosbach	25749ad5c2	add cmd line option to leave dbgvalues in during post-RA sceduling. Useful while debugging what's mishandled about them in the post-RA pass. llvm-svn: 103805	2010-05-14 21:18:04 +00:00
Bill Wendling	95f6ebcb37	Rename "HasCalls" in MachineFrameInfo to "AdjustsStack" to better describe what the variable actually tracks. N.B., several back-ends are using "HasCalls" as being synonymous for something that adjusts the stack. This isn't 100% correct and should be looked into. llvm-svn: 103802	2010-05-14 21:14:32 +00:00
Devang Patel	e0a94bfe9f	Add support to preserve type info for the variables that are removed by the optimizer. llvm-svn: 103798	2010-05-14 21:01:35 +00:00
Jakob Stoklund Olesen	670492c8ee	When verifying two-address instructions, check the following: - Kill is implicit when use and def registers are identical. - Only virtual registers can differ. Add a -verify-fast-regalloc to run the verifier before the fast allocator. llvm-svn: 103797	2010-05-14 20:28:32 +00:00
Jakob Stoklund Olesen	4d5c1061e3	Simplify the handling of physreg defs and uses in RegAllocFast. This adds extra security against using clobbered physregs, and it adds kill markers to physreg uses. llvm-svn: 103784	2010-05-14 18:03:25 +00:00
Daniel Dunbar	148e876ac2	XFAIL the test I added with vg_leak, apparently it is the first and only llc -filetype=obj test, and -filetype=obj leaks a few objects. Added a FIXME, we need to sort out the ownership model for the various MC objects. llvm-svn: 103769	2010-05-14 07:47:51 +00:00
Daniel Dunbar	3439ed6324	Inline Asm: Ensure buffer is newline terminated to match how the text is printed. - This is a hack, but I can't decide the best place to handle this. Chris? llvm-svn: 103765	2010-05-14 04:31:50 +00:00
Jakob Stoklund Olesen	ceb5a7ada2	Enable opportunistic coalescing llvm-svn: 103764	2010-05-14 04:30:51 +00:00
Jakob Stoklund Olesen	68c235bd4d	Trust kill flags from isel and later passes. llvm-svn: 103748	2010-05-14 00:02:23 +00:00
Jakob Stoklund Olesen	41f8dc897e	Fix an embarrassing runtime regression for RegAllocFast. This loop is quadratic in the capacity for a DenseMap: while(!map.empty()) map.erase(map.begin()); Instead we now do a normal begin() - end() iteration followed by map.clear(). That also has the nice sideeffect of shrinking the map capacity on demand. llvm-svn: 103747	2010-05-14 00:02:20 +00:00
Dale Johannesen	1ae94b9394	Implement a correct ui64->f32 conversion. The old one was subject to double rounding in extreme cases. llvm-svn: 103744	2010-05-13 23:50:42 +00:00
Jakob Stoklund Olesen	d74a564feb	Clean up RegAllocFast debug output llvm-svn: 103739	2010-05-13 20:43:17 +00:00
Dan Gohman	c90f51c00b	Teach MachineLICM and MachineSink how to clear kill flags conservatively when they move instructions. llvm-svn: 103737	2010-05-13 20:34:42 +00:00
Dan Gohman	7767d2747b	Add a utility function for conservatively clearing kill flags, and make use of it in MachineCSE. llvm-svn: 103726	2010-05-13 19:24:00 +00:00
Dan Gohman	5b510c1474	An Instruction has a trivial kill only if its use is in the same basic block. llvm-svn: 103725	2010-05-13 19:19:32 +00:00
Jakob Stoklund Olesen	0ba2e2a568	Take allocation hints from copy instructions to/from physregs. This causes way more identity copies to be generated, ripe for coalescing. llvm-svn: 103686	2010-05-13 00:19:43 +00:00
Jakob Stoklund Olesen	680b74941f	More asserts around physreg uses llvm-svn: 103685	2010-05-13 00:19:39 +00:00
Evan Cheng	4aab8b5425	If REG_SEQUENCE source is livein, copy it first. Also, update livevariables information when a copy is introduced. llvm-svn: 103680	2010-05-13 00:00:35 +00:00
Evan Cheng	ecf0166012	Do not attempt copy coalescing if the source and dest sub-register indices do not match. llvm-svn: 103679	2010-05-12 23:59:42 +00:00
Jakob Stoklund Olesen	955a0e71e9	Make sure to add kill flags to the last use of a virtreg when it is redefined. The X86 floating point stack pass and others depend on good kill flags. llvm-svn: 103635	2010-05-12 18:46:03 +00:00
Duncan Sands	2576db727b	Remove unused variable. Tweak a comment while there. llvm-svn: 103586	2010-05-12 07:11:33 +00:00
Nathan Jeffords	76a07580ad	updated support for the COFF .linkonce Now, the .linkonce directive is emitted as part of MCSectionCOFF::PrintSwitchToSection instead of AsmPrinter::EmitLinkage since it is an attribute of the section the symbol was placed into not the symbol itself. llvm-svn: 103568	2010-05-12 04:26:09 +00:00
Evan Cheng	d593448643	Teach local regalloc about virtual registers with sub-indices. llvm-svn: 103539	2010-05-12 01:29:36 +00:00
Evan Cheng	0c6ebc7d95	Code clean up. llvm-svn: 103538	2010-05-12 01:27:49 +00:00
Jakob Stoklund Olesen	f98a355f9b	Avoid scoping issues, fix buildbots llvm-svn: 103530	2010-05-12 00:11:19 +00:00
Dan Gohman	1a1b51ff59	Add initial kill flag support to FastISel. llvm-svn: 103529	2010-05-11 23:54:07 +00:00
Daniel Dunbar	69b8f42400	Make Clang happy. llvm-svn: 103528	2010-05-11 23:53:13 +00:00
Jakob Stoklund Olesen	11f1ba1535	Store the Dirty bit in the LiveReg structure instead of a bit vector. llvm-svn: 103522	2010-05-11 23:24:47 +00:00
Jakob Stoklund Olesen	132668102e	Keep track of the last place a live virtreg was used. This allows us to add accurate kill markers, something the scavenger likes. Add some more tests from ARM that needed this. llvm-svn: 103521	2010-05-11 23:24:45 +00:00
Dan Gohman	afd2b8bbb7	Don't set kill flags on uses of CopyFromReg nodes. InstrEmitter doesn't create separate virtual registers for CopyFromReg values, so uses of them don't necessarily kill the value. llvm-svn: 103519	2010-05-11 21:59:14 +00:00
Jakob Stoklund Olesen	f25be99109	Silence warning llvm-svn: 103508	2010-05-11 20:51:04 +00:00
Jakob Stoklund Olesen	3f0241e0f9	Simplify the tracking of used physregs to a bulk bitor followed by a transitive closure after allocating all blocks. Add a few more test cases for -regalloc=fast. llvm-svn: 103500	2010-05-11 20:30:28 +00:00
Duncan Sands	6c5e4355bb	I got tired of VISIBILITY_HIDDEN colliding with the gcc enum. Rename it to LLVM_LIBRARY_VISIBILITY and introduce LLVM_GLOBAL_VISIBILITY, which is the opposite, for future use by dragonegg. llvm-svn: 103495	2010-05-11 20:16:09 +00:00
Dan Gohman	9132c59d43	Trim #includes and forward declarations. llvm-svn: 103489	2010-05-11 19:11:43 +00:00
Jakob Stoklund Olesen	f1b3029a54	Mostly rewrite RegAllocFast. Sorry for the big change. The path leading up to this patch had some TableGen changes that I didn't want to commit before I knew they were useful. They weren't, and this version does not need them. The fast register allocator now does no liveness calculations. Instead it relies on kill flags provided by isel. (Currently those kill flags are also ignored due to isel bugs). The allocation algorithm is supposed to work with any subset of valid kill flags. More kill flags simply means fewer spills inserted. Registers are allocated from a working set that contains no aliases. That means most allocations can be done directly without expensive alias checks. When the working set runs out of registers we do the full alias check to find new free registers. llvm-svn: 103488	2010-05-11 18:54:45 +00:00
Dan Gohman	bb919dfb6b	Implement a bunch more TargetSelectionDAGInfo infrastructure. Move EmitTargetCodeForMemcpy, EmitTargetCodeForMemset, and EmitTargetCodeForMemmove out of TargetLowering and into SelectionDAGInfo to exercise this. llvm-svn: 103481	2010-05-11 17:31:57 +00:00
Douglas Gregor	6739a89117	Fixes for Microsoft Visual Studio 2010, from Steven Watanabe! llvm-svn: 103457	2010-05-11 06:17:44 +00:00

... 12 13 14 15 16 ...

11057 Commits