llvm-project

Commit Graph

Author	SHA1	Message	Date
Nadav Rotem	b521b6037b	Cleanup the trunc-store legalization code and add asserts. llvm-svn: 141659	2011-10-11 10:04:25 +00:00
Devang Patel	478d5bc0d0	Revert r141569 and r141576. llvm-svn: 141594	2011-10-10 23:18:02 +00:00
Jakob Stoklund Olesen	add0c43ebb	Give targets a chance to expand even standard pseudos. Allow targets to expand COPY and other standard pseudo-instructions before they are expanded with copyPhysReg(). This allows the target to examine the COPY instruction for extra operands indicating it can be widened to a preferable super-register copy. See the ARM -widen-vmovs option. llvm-svn: 141578	2011-10-10 20:34:28 +00:00
Devang Patel	2689f95875	If loop header is also loop exiting block then it may not be safe to hoist instructions. llvm-svn: 141576	2011-10-10 20:32:03 +00:00
Devang Patel	e554d5995b	Add dominance check for the instruction being hoisted. For example, MachineLICM should not hoist a load that is not guaranteed to be executed. Radar 10254254. llvm-svn: 141569	2011-10-10 19:09:20 +00:00
Bill Wendling	e9574be6a3	Use the code that lowers the arguments and spills any values which are alive across unwind edges. This is for the back-end which expects such things. The code is from the original SjLj EH pass. llvm-svn: 141463	2011-10-08 00:56:47 +00:00
Bill Wendling	7ecfbd90ef	Thread the chain through the eh.sjlj.setjmp intrinsic, like it's documented to do. This will be useful later on with the new SJLJ stuff. llvm-svn: 141416	2011-10-07 21:25:38 +00:00
Andrew Trick	35c9e51219	PostRA scheduler fix. Clear stale loop dependencies. Fixes <rdar://problem/10235725> llvm-svn: 141357	2011-10-07 06:33:09 +00:00
Andrew Trick	4ef158335b	whitespace llvm-svn: 141356	2011-10-07 06:27:02 +00:00
Eli Friedman	1456cd20b4	Remove the old atomic instrinsics. autoupgrade functionality is included with this patch. llvm-svn: 141333	2011-10-06 23:20:49 +00:00
Bill Wendling	267f323d28	Modify the mapping from landing pad to call sites to accept more than one call site. llvm-svn: 141226	2011-10-05 22:24:35 +00:00
Bill Wendling	c2d55b6e50	Add an ivar that maps a landing pad's EH symbol to the call sites that may jump to the landing pad. This will be used by the back-end to generate the jump tables for dispatching the arriving longjmp in sjlj eh. llvm-svn: 141224	2011-10-05 22:20:38 +00:00
Bill Wendling	e61c62533e	Small refactoring. Cache the FunctionInfo->MBB into a local variable. llvm-svn: 141221	2011-10-05 22:16:11 +00:00
Jakob Stoklund Olesen	eb38bd8ced	Fix sub-register operand verification. PhysReg operands are not allowed to have sub-register indices at all. For virtual registers with sub-reg indices, check that all registers in the register class support the sub-reg index. llvm-svn: 141220	2011-10-05 22:12:57 +00:00
Bill Wendling	db1633530a	Fix comment to reflect the new EH stuff. llvm-svn: 141218	2011-10-05 22:04:08 +00:00
Jakob Stoklund Olesen	3abead76ea	Remove unused DstSubIdx argument. llvm-svn: 141214	2011-10-05 21:22:53 +00:00
Jakob Stoklund Olesen	f7957a9819	Simplify EXTRACT_SUBREG emission. EXTRACT_SUBREG is emitted as %dst = COPY %src:sub, so there is no need to constrain the %dst register class. RegisterCoalescer will apply the necessary constraints if it decides to eliminate the COPY. The %src register class does need to be constrained to something with the right sub-registers, though. This is currently done manually with COPY_TO_REGCLASS nodes. They can possibly be removed after this patch. llvm-svn: 141207	2011-10-05 20:26:40 +00:00
Jakob Stoklund Olesen	8ff52c4135	Simplify INSERT_SUBREG emission. The register class created by INSERT_SUBREG and SUBREG_TO_REG must be legal and support the SubIdx sub-registers. The new getSubClassWithSubReg() hook can compute that. This may create INSERT_SUBREG instructions defining a larger register class than the sub-register being inserted. That is OK, RegisterCoalescer will constrain the register class as needed when it eliminates the INSERT_SUBREG instructions. llvm-svn: 141198	2011-10-05 18:31:00 +00:00
Jakob Stoklund Olesen	ccdfbfb5e5	Add a FIXME. TwoAddressInstructionPass should annotate instructions with <undef> flags when it lower REG_SEQUENCE instructions. LiveIntervals should not be in the business of modifying code (except for kill flags, perhaps). llvm-svn: 141187	2011-10-05 16:51:21 +00:00
Jakob Stoklund Olesen	d5d39bb098	Also add <imp-use,kill> flags for redefined super-registers. For example: %vreg10:dsub_0<def,undef> = COPY %vreg1 %vreg10:dsub_1<def> = COPY %vreg2 is rewritten as: %D2<def> = COPY %D0, %Q1<imp-def> %D3<def> = COPY %D1, %Q1<imp-use,kill>, %Q1<imp-def> The first COPY doesn't care about the previous value of %Q1, so it doesn't read that register. The second COPY is a partial redefinition of %Q1, so it implicitly kills and redefines that register. This makes it possible to recognize instructions that can harmlessly clobber the full super-register. The write and don't read the super-register. llvm-svn: 141139	2011-10-05 00:01:48 +00:00
Jakob Stoklund Olesen	9d5bda9be1	Also add <def,undef> flags when coalescing sub-registers. RegisterCoalescer can create sub-register defs when it is joining a register with a sub-register. Add <undef> flags to these new sub-register defs where appropriate. llvm-svn: 141138	2011-10-05 00:01:46 +00:00
Owen Anderson	0ca562ec4c	Teach the MC to output code/data region marker labels in MachO and ELF modes. These are used by disassemblers to provide better disassembly, particularly on targets like ARM Thumb that like to intermingle data in the TEXT segment. llvm-svn: 141135	2011-10-04 23:26:17 +00:00
Bill Wendling	3d11aa7e75	Create a mapping between the landing pad basic block and the call site index for later use. llvm-svn: 141125	2011-10-04 22:00:35 +00:00
Jakob Stoklund Olesen	10f2de3261	Allow <undef> flags on def operands as well as uses. The <undef> flag says that a MachineOperand doesn't read its register, or doesn't depend on the previous value of its register. A full register def never depends on the previous register value. A partial register def may depend on the previous value if it is intended to update part of a register. For example: %vreg10:dsub_0<def,undef> = COPY %vreg1 %vreg10:dsub_1<def> = COPY %vreg2 The first copy instruction defines the full %vreg10 register with the bits not covered by dsub_0 defined as <undef>. It is not considered a read of %vreg10. The second copy modifies part of %vreg10 while preserving the rest. It has an implicit read of %vreg10. This patch adds a MachineOperand::readsReg() method to determine if an operand reads its register. Previously, this was modelled by adding a full-register <imp-def> operand to the instruction. This approach makes it possible to determine directly from a MachineOperand if it reads its register. No scanning of MI operands is required. llvm-svn: 141124	2011-10-04 21:49:33 +00:00
Bill Wendling	ac3fb4c078	Generic cleanup. llvm-svn: 141050	2011-10-04 00:16:40 +00:00
Bill Wendling	97a8695fff	Don't carry over the dispatchsetup hack from the old system. llvm-svn: 141040	2011-10-03 22:42:40 +00:00
Bill Wendling	6f3e73d6ad	Move the grabbing of the jump buffer into the caller function, eliminating the need for returning a std::pair. llvm-svn: 141026	2011-10-03 21:15:28 +00:00
Eric Christopher	cead033ced	Whitespace. llvm-svn: 141005	2011-10-03 15:49:20 +00:00
Eric Christopher	f84354bfb1	Typo. llvm-svn: 141004	2011-10-03 15:49:16 +00:00
Nadav Rotem	52e8ed9214	Moved type construction out of the loop and added an assert on the legality of the type. Formatted lines to the 80 char limit. llvm-svn: 140952	2011-10-01 18:39:28 +00:00
Bill Wendling	9925f197cc	When inferring the pointer alignment, if the global doesn't have an initializer and the alignment is 0 (i.e., it's defined globally in one file and declared in another file) it could get an alignment which is larger than the ABI allows for that type, resulting in aligned moves being used for unaligned loads. For instance, in file A.c: struct S s; In file B.c: struct { // something long }; extern S s; void foo() { struct S p = s; // ... } this copy is a 'memcpy' which is turned into a series of 'movaps' instructions on X86. But this is wrong, because 'struct S' has alignment of 4, not 16. llvm-svn: 140902	2011-09-30 23:19:55 +00:00
Nick Lewycky	f40df1d46c	Promote comment to doxycomment. Adjust whitespace. No functionality change. llvm-svn: 140899	2011-09-30 22:19:53 +00:00
Jakob Stoklund Olesen	1352be2bd3	Move getCommonSubClass() into TRI. It will soon need the context. llvm-svn: 140896	2011-09-30 22:18:51 +00:00
Torok Edwin	be5020eb95	Comment grammar fixes. thanks to Duncan. llvm-svn: 140850	2011-09-30 13:07:47 +00:00
Torok Edwin	319a1415b8	Instead of crashing when MCAsmInfo is NULL, add an assert. This helps with porting code from 2.9 to 3.0 as TargetSelect.h changed location, and if you include the old one by accident you will trigger this assert. llvm-svn: 140848	2011-09-30 12:31:57 +00:00
Eli Friedman	95031ed837	Clean up uses of switch instructions so they are not dependent on the operand ordering. Patch by Stepan Dyatkovskiy. llvm-svn: 140803	2011-09-29 20:21:17 +00:00
Duncan Sands	cac86805bf	Place this bracket according to the LLVM style. llvm-svn: 140784	2011-09-29 16:01:46 +00:00
Jakob Stoklund Olesen	463b05a2d0	Remove NumImplicitOps which is now unused. llvm-svn: 140767	2011-09-29 01:47:36 +00:00
Eric Christopher	d299dccf91	Use the local we already set up. llvm-svn: 140745	2011-09-29 00:50:59 +00:00
Jakob Stoklund Olesen	2318d1e0e9	Rewrite MachineInstr::addOperand() to avoid NumImplicitOps. The function needs to scan the implicit operands anyway, so no performance is won by caching the number of implicit operands added to an instruction. This also fixes a bug when adding operands after an implicit operand has been added manually. The NumImplicitOps count wasn't kept up to date. MachineInstr::addOperand() will now consistently place all explicit operands before all the implicit operands, regardless of the order they are added. It is possible to change an MI opcode and add additional explicit operands. They will be inserted before any existing implicit operands. The only exception is inline asm instructions where operands are never reordered. This is because of a hack that marks explicit clobber regs on inline asm as <implicit-def> to please the fast register allocator. This hack can go away when InstrEmitter and FastIsel can add exact <dead> flags to physreg defs. llvm-svn: 140744	2011-09-29 00:40:51 +00:00
Bill Wendling	899da52d60	Have the SjLjEHPrepare pass do some more heavy lifting. Upon further review, most of the EH code should remain written at the IR level. The part which breaks SSA form is the dispatch table, so that part will be moved to the back-end. llvm-svn: 140730	2011-09-28 21:56:53 +00:00
Duncan Sands	2e67937f76	A typeid of zero means a cleanup, not a catch. This case occurs when there is both a catch and a cleanup. Correct the comment. llvm-svn: 140686	2011-09-28 09:13:02 +00:00
Bill Wendling	baf3941fde	Strip off pointer casts when looking at the eh.sjlj.functioncontext's argument. llvm-svn: 140678	2011-09-28 03:52:41 +00:00
Bill Wendling	225e8481b0	Bitcast the alloca to an i8* to match the intrinsic's signature. llvm-svn: 140677	2011-09-28 03:47:11 +00:00
Bill Wendling	66b110f571	Create and use an llvm.eh.sjlj.functioncontext intrinsic. This intrinsic is used to pass the index of the function context to the back-end for further processing. The back-end is in charge of filling in the rest of the entries. llvm-svn: 140676	2011-09-28 03:36:43 +00:00
Bill Wendling	2e76ca9d9a	In the new EH model, setup the function context and the call site info. The DWARF exception pass uses the call site information, which is set up here. A pre-RA pass is too late for it to use this information. So create and setup the function context here, and then insert the call site values here (and map the call sites for the DWARF EH pass). This is simpler than the original pass, and doesn't make the CFG lose its SSA-ness. It's a win-win-win-win-lose-win-win situation. llvm-svn: 140675	2011-09-28 03:14:05 +00:00
Bill Wendling	e6138e3ad1	Don't conditionalize execution of the SjLj EH prepare pass. We may need an SjLj EH preparation pass for some call site information, at least in the short term. llvm-svn: 140674	2011-09-28 03:07:34 +00:00
Jakob Stoklund Olesen	bd5109f14d	Rename class and clean up source. No functional change intended. llvm-svn: 140664	2011-09-28 00:01:56 +00:00
Jakob Stoklund Olesen	934b7d7645	Rename SSEDomainFix -> lib/CodeGen/ExecutionDepsFix. I'll clean up the source in the next commit. llvm-svn: 140663	2011-09-28 00:01:54 +00:00
Bill Wendling	354ff9e348	This is the start of the new SjLj EH preparation pass, which will replace the current IR-level pass. The old SjLj EH pass has some problems, especially with the new EH model. Most significantly, it violates some of the new restrictions the new model has. For instance, the 'dispatch' table wants to jump to the landing pad, but we cannot allow that because only an invoke's unwind edge can jump to a landing pad. This requires us to mangle the code something awful. In addition, we need to keep the now dead landingpad instructions around instead of CSE'ing them because the DWARF emitter uses that information (they are dead because no control flow edge will execute them - the control flow edge from an invoke's unwind is superceded by the edge coming from the dispatch). Basically, this pass belongs not at the IR level where SSA is king, but at the code-gen level, where we have more flexibility. llvm-svn: 140646	2011-09-27 22:14:12 +00:00
Cameron Zwarich	7a6e8f2c5d	Remove an invalid assert that is really just asserting when the scheduler emits a suboptimal schedule. llvm-svn: 140643	2011-09-27 21:59:16 +00:00
Jim Grosbach	af136f71ec	Rename AddSelectionDAGCSEId() to addSelectionDAGCSEId(). Naming conventions consistency. No functional change. llvm-svn: 140636	2011-09-27 20:59:33 +00:00
Nadav Rotem	38b3b83362	Cleanup PromoteIntOp_EXTRACT_VECTOR_ELT and PromoteIntRes_SETCC. Add a new method: getAnyExtOrTrunc and use it to replace the manual check. llvm-svn: 140603	2011-09-27 11:16:47 +00:00
Nadav Rotem	1b857d2762	Revert r140463; The patch assumes that <4 x i1> is saved to memory as 4 x i8, while the decision is to bit-pack small values. llvm-svn: 140601	2011-09-27 10:48:29 +00:00
James Molloy	0ceb8cadd2	Fix emission of debug data for global variables. getContext() on DIGlobalVariables is not valid any more. llvm-svn: 140539	2011-09-26 17:40:42 +00:00
Jakob Stoklund Olesen	df977fedb6	Add target hook for pseudo instruction expansion. Many targets use pseudo instructions to help register allocation. Like the COPY instruction, these pseudos can be expanded after register allocation. The early expansion can make life easier for PEI and the post-ra scheduler. This patch adds a hook that is called for all remaining pseudo instructions from the ExpandPostRAPseudos pass. llvm-svn: 140472	2011-09-25 19:21:35 +00:00
Nadav Rotem	2279949129	[vector-select] Address one of the issues in pr10902. EXTRACT_VECTOR_ELEMENT SDNodes may return values which are wider than the incoming element types. In this patch we fix the integer promotion of these nodes. Fixes spill-q.ll when running -promote-elements. llvm-svn: 140471	2011-09-25 18:59:42 +00:00
Jakob Stoklund Olesen	fd719d184e	Clean up code after renaming LowerSubregs -> ExpandPostRAPseudos. No functional change intended. llvm-svn: 140470	2011-09-25 16:46:08 +00:00
Jakob Stoklund Olesen	f152df1e6b	Rename LowerSubregs to ExpandPostRAPseudos. I'll fix the file contents in the next commit. This pass is currently expanding the COPY and SUBREG_TO_REG pseudos. I am going to add a hook so targets can expand more pseudo-instructions after register allocation. Many targets have pseudo-instructions that assist the register allocator. They can be expanded after register allocation, before PEI and PostRA scheduling. llvm-svn: 140469	2011-09-25 16:46:00 +00:00
Nadav Rotem	c2deabd202	Implement Duncan's suggestion to use the result of getSetCCResultType if it is legal (this is always the case for scalars), otherwise use the promoted result type. Fix test/CodeGen/X86/vsplit-and.ll when promote-elements is enabled. llvm-svn: 140464	2011-09-24 19:48:19 +00:00
Nadav Rotem	77426a754b	[Vector-Select] Address one of the problems in 10902. When generating the trunc-store of i1's, we need to use the vector type and not the scalar type. This patch fixes the assertion in CodeGen/Generic/bool-vector.ll when running with -promote-elements. llvm-svn: 140463	2011-09-24 18:32:19 +00:00
Jakob Stoklund Olesen	3bb99bc957	Verify that terminators follow non-terminators. This exposes a -segmented-stacks bug. llvm-svn: 140429	2011-09-23 22:45:39 +00:00
Eli Friedman	8a15a5aa93	PR10998: It is not legal to sink an instruction past the terminator of a block; make sure we don't do that. llvm-svn: 140428	2011-09-23 22:41:57 +00:00
Duncan Sands	b461176cfb	Tweak the handling of MERGE_VALUES nodes: remove the need for DecomposeMERGE_VALUES to "know" that results are legalized in a particular order, by passing it the number of the result being legalized (the type legalization core provides this, it just needs to be passed on). llvm-svn: 140373	2011-09-23 13:59:22 +00:00
Nadav Rotem	57e30726ad	Vector-Select: Address one of the problems in pr10902. Add handling for the integer-promotion of CONCAT_VECTORS. Test: test/CodeGen/X86/widen_shuffle-1.ll This patch fixes the above tests (when running in with -promote-elements). llvm-svn: 140372	2011-09-23 09:33:24 +00:00
Dan Gohman	e83e1b2d2c	Fix SimplifySelectCC to add newly created nodes to the DAGCombiner worklist, as it may be possible to perform further optimization on them. llvm-svn: 140349	2011-09-22 23:01:29 +00:00
Jakob Stoklund Olesen	e92e5ee81f	Constrain register classes instead of emitting copies. Sometimes register class constraints are trivial, like GR32->GR32_NOSP, or GPR->rGPR. Teach InstrEmitter to simply constrain the virtual register instead of emitting a copy in these cases. Normally, these copies are handled by the coalescer. This saves some coalescer work. llvm-svn: 140340	2011-09-22 21:39:34 +00:00
Jakob Stoklund Olesen	0f36544c08	Add a MinNumRegs argument to MRI::constrainRegClass(). The function will refuse to use a register class with fewer registers than MinNumRegs. This can be used by clients to avoid accidentally increase register pressure too much. The default value of MinNumRegs=0 doesn't affect how constrainRegClass() works. llvm-svn: 140339	2011-09-22 21:39:31 +00:00
Bill Wendling	a58fde665a	Use the C personality function instead of the C++ personality function. llvm-svn: 140318	2011-09-22 17:56:40 +00:00
Devang Patel	5e6b65cf0d	Do not unnecessarily use AT_specification DIE because it does not add any value. Few weeks ago, llvm completely inverted the debug info graph. Earlier each debug info node used to keep track of its compile unit, now compile unit keeps track of important nodes. One impact of this change is that the global variable's do not have any context, which should be checked before deciding to use AT_specification DIE. llvm-svn: 140282	2011-09-21 23:41:11 +00:00
Bill Wendling	7b3fc8ee38	Attempt to update the shadow stack GC pass to the new EH model. This inserts a cleanup landingpad instruction and a resume to mimic the old unwind instruction. llvm-svn: 140277	2011-09-21 22:14:28 +00:00
Jim Grosbach	098f5a2911	Tidy up. Whitepsace. llvm-svn: 140275	2011-09-21 21:36:53 +00:00
Nadav Rotem	bc9ba30158	[VECTOR-SELECT] Address one of the bugs in pr10902. Vector SetCC result types need to be type-legalized. This code worked before because scalar result types are known to be legal. llvm-svn: 140249	2011-09-21 14:34:38 +00:00
Andrew Trick	924123acb3	Lower ARM adds/subs to add/sub after adding optional CPSR operand. This is still a hack until we can teach tblgen to generate the optional CPSR operand rather than an implicit CPSR def. But the strangeness is now limited to the selection DAG. ADD/SUB MI's no longer have implicit CPSR defs, nor do we allow flag setting variants of these opcodes in machine code. There are several corner cases to consider, and getting one wrong would previously lead to nasty miscompilation. It's not the first time I've debugged one, so this time I added enough verification to ensure it won't happen again. llvm-svn: 140228	2011-09-21 02:20:46 +00:00
Bruno Cardoso Lopes	6cb23f6e7f	Add a DAGCombine for subvector extracts to remove useless chains of subvector inserts and extracts. Initial patch by Rackover, Zvi with some tweak done by me. llvm-svn: 140204	2011-09-20 23:19:33 +00:00
Andrew Trick	52363bdbeb	Restore hasPostISelHook tblgen flag. No functionality change. The hook makes it explicit which patterns require "special" handling. i.e. it self-documents tblgen deficiencies. I plan to add verification in ExpandISelPseudos and Thumb2SizeReduce to catch any missing hasPostISelHooks. Otherwise it's too fragile. llvm-svn: 140160	2011-09-20 18:22:31 +00:00
Andrew Trick	8586e62d91	ARM isel bug fix for adds/subs operands. Modified ARMISelLowering::AdjustInstrPostInstrSelection to handle the full gamut of CPSR defs/uses including instructins whose "optional" cc_out operand is not really optional. This allowed removal of the hasPostISelHook to simplify the .td files and make the implementation more robust. Fixes rdar://10137436: sqlite3 miscompile llvm-svn: 140134	2011-09-20 03:17:40 +00:00
Andrew Trick	53df4b6dfa	whitespace llvm-svn: 140133	2011-09-20 03:06:13 +00:00
Nadav Rotem	7aaa0aa7a7	white space cleanups llvm-svn: 139994	2011-09-18 10:29:29 +00:00
Benjamin Kramer	67b014b2c2	Namespacify. llvm-svn: 139892	2011-09-16 00:35:06 +00:00
Jakob Stoklund Olesen	e2c92a3112	Spill mode: Hoist back-copies locally. The leaveIntvAfter() function normally inserts a back-copy after the requested instruction, making the back-copy kill the live range. In spill mode, try to insert the back-copy before the last use instead. That means the last use becomes the kill instead of the back-copy. This lowers the register pressure because the last use can now redefine the same register it was reading. This will also improve compile time: The back-copy isn't a kill, so hoisting it in hoistCopiesForSize() won't force a recomputation of the source live range. Similarly, if the back-copy isn't hoisted by the splitter, the spiller will not attempt hoisting it locally. llvm-svn: 139883	2011-09-16 00:03:35 +00:00
Jakob Stoklund Olesen	e8339b2e63	Disable local spill hoisting for non-killing copies. If the source register is live after the copy being spilled, there is no point to hoisting it. Hoisting inside a basic block only serves to resolve interferences by shortening the live range of the source. llvm-svn: 139882	2011-09-16 00:03:33 +00:00
Eli Friedman	ee8f14a799	Some legalization fixes for atomic load and store. llvm-svn: 139851	2011-09-15 21:20:49 +00:00
Jakob Stoklund Olesen	bceb9e5c05	Add an option to disable spill hoisting. When -split-spill-mode is enabled, spill hoisting is performed by SplitKit instead of by InlineSpiller. This hidden command line option is for testing the splitter spill mode. llvm-svn: 139845	2011-09-15 21:06:00 +00:00
Jakob Stoklund Olesen	53e2e48de7	VirtRegMap is counting spill slots, not register spills. Fix the stats counters to reflect that. llvm-svn: 139819	2011-09-15 18:31:13 +00:00
Jakob Stoklund Olesen	c94c967656	Count correctly when a COPY turns into a spill or reload. The number of spills could go negative since a folded COPY is just a spill, and it may be eliminated. llvm-svn: 139815	2011-09-15 18:22:52 +00:00
Jakob Stoklund Olesen	37eb6962c6	Count inserted spills and reloads more accurately. Adjust counters when removing spill and reload instructions. We still don't account for reloads being removed by eliminateDeadDefs(). llvm-svn: 139806	2011-09-15 17:54:28 +00:00
Jakob Stoklund Olesen	07b3503f8b	Trace through sibling PHIs in bulk. When traceSiblingValue() encounters a PHI-def value created by live range splitting, don't look at all the predecessor blocks. That can be very expensive in a complicated CFG. Instead, consider that all the non-PHI defs jointly dominate all the PHI-defs. Tracing directly to all the non-PHI defs is much faster that zipping around in the CFG when there are many PHIs with many predecessors. This significantly improves compile time for indirectbr interpreters. llvm-svn: 139797	2011-09-15 16:41:12 +00:00
Jakob Stoklund Olesen	b8b1d4c435	Speed up LiveIntervals::shrinkToUse with some caching. Blocks with multiple PHI successors only need to go on the worklist once. Use a SmallPtrSet to track the live-out blocks that have already been handled. This is a lot faster than the two live range check we would otherwise do. Also stop recomputing hasPHIKill flags. Like RenumberValues(), it is conservatively correct to leave them in, and they are not used for anything important. llvm-svn: 139792	2011-09-15 15:24:16 +00:00
Jakob Stoklund Olesen	fb75d78d33	Revert r139782, "RemoveCopyByCommutingDef doesn't need hasPHIKill()." It does, after all. RemoveCopyByCommutingDef rewrites the uses of one particular value number in A. It doesn't know how to rewrite phi uses, so there can't be any. llvm-svn: 139787	2011-09-15 06:27:32 +00:00
Jakob Stoklund Olesen	4c099551f9	Stop verifying hasPHIKill() flags. There is only one legitimate use remaining, in addIntervalsForSpills(). All other calls to hasPHIKill() are only used to update PHIKill flags. The addIntervalsForSpills() function is part of the old spilling framework, only used by linearscan. llvm-svn: 139783	2011-09-15 05:16:30 +00:00
Jakob Stoklund Olesen	0499e7bbd0	RemoveCopyByCommutingDef doesn't need hasPHIKill(). Instead, let HasOtherReachingDefs() test for defs in B that overlap any phi-defs in A as well. This test is slightly different, but almost identical. A perfectly precise test would only check those phi-defs in A that are reachable from AValNo. llvm-svn: 139782	2011-09-15 05:03:50 +00:00
Jakob Stoklund Olesen	dca022e377	It is safe to remat a value killed by phis. The source live range is recomputed using shrinkToUses() which does handle phis correctly. The hasPHIKill() condition was relevant in the old days when ReMaterializeTrivialDef() tried to recompute the live range itself. The shrinkToUses() function will mark the original def as dead when no more uses and phi kills remain. It is then removed by runOnMachineFunction(). llvm-svn: 139781	2011-09-15 04:52:06 +00:00
Jakob Stoklund Olesen	e7ca8ecd92	Leave hasPHIKill flags alone in LiveInterval::RenumberValues. It is conservatively correct to keep the hasPHIKill flags, even after deleting PHI-defs. The calculation can be very expensive after taildup has created a quadratic number of indirectbr edges in the CFG, and the hasPHIKill flag isn't used for anything after RenumberValues(). llvm-svn: 139780	2011-09-15 04:37:18 +00:00
Andrew Trick	76a86d3d4c	[regcoalescing] bug fix for RegistersDefinedFromSameValue. An improper SlotIndex->VNInfo lookup was leading to unsafe copy removal. Fixes PR10920 401.bzip2 miscompile with no IV rewrite. llvm-svn: 139765	2011-09-15 01:09:33 +00:00
Devang Patel	04d6d47865	Add support to emit debug info for C++0x nullptr type. llvm-svn: 139751	2011-09-14 23:13:28 +00:00
Jakob Stoklund Olesen	811b9c475d	Ignore the cloning of unknown registers. THe LRE_DidCloneVirtReg callback may be called with vitual registers that RAGreedy doesn't even know about yet. In that case, there are no data structures to update. llvm-svn: 139702	2011-09-14 17:34:37 +00:00
Jakob Stoklund Olesen	a98af39856	Hoist back-copies to the least busy dominator. When a back-copy is hoisted to the nearest common dominator, keep looking up the dominator tree for a less loopy dominator, and place the back-copy there instead. Don't do this when a single existing back-copy dominates all the others. Assume the client knows what he is doing, and keep the dominating back-copy. This prevents us from hoisting back-copies into loops in most cases. If a value is defined in a loop with multiple exits, we may still hoist back-copies into that loop. That is the speed/size tradeoff. llvm-svn: 139698	2011-09-14 16:45:39 +00:00
Nadav Rotem	d748dbacb0	Add integer promotion support for vselect llvm-svn: 139692	2011-09-14 14:42:15 +00:00
Jakob Stoklund Olesen	5d4277ddfa	Distinguish complex mapped values from forced recomputation. When a ParentVNI maps to multiple defs in a new interval, its live range may still be derived directly from RegAssign by transferValues(). On the other hand, when instructions have been rematerialized or hoisted, it may be necessary to completely recompute live ranges using LiveRangeCalc::extend() to all uses. Use a bit in the value map to indicate that a live range must be recomputed. Rename markComplexMapped() to forceRecompute(). This fixes some live range verification errors when -split-spill-mode=size hoists back-copies by recomputing source ranges when RegAssign kills can't be moved. llvm-svn: 139660	2011-09-13 23:09:04 +00:00
Jakob Stoklund Olesen	a25330f0d7	Implement -split-spill-mode=size. Whenever the complement interval is defined by multiple copies of the same value, hoist those back-copies to the nearest common dominator. This ensures that at most one copy is inserted per value in the complement inteval, and no phi-defs are needed. llvm-svn: 139651	2011-09-13 22:22:39 +00:00
Eli Friedman	f78c6a83ee	Fix check for unaligned load/store so it doesn't catch over-aligned load/store. llvm-svn: 139649	2011-09-13 22:19:59 +00:00
Eli Friedman	f1518216fd	Error out on CodeGen of unaligned load/store. Fix test so it isn't accidentally testing that case. llvm-svn: 139641	2011-09-13 20:50:54 +00:00
Nadav Rotem	66dc9ae08d	Fix the assertion which checks the size of the input operand. llvm-svn: 139633	2011-09-13 20:03:38 +00:00
Nadav Rotem	52202fbf2d	Add vselect target support for targets that do not support blend but do support xor/and/or (For example SSE2). llvm-svn: 139623	2011-09-13 19:17:42 +00:00
Devang Patel	f9e2ae9b05	Use a cache to maintain list of machine basic blocks for a given UserValue. llvm-svn: 139616	2011-09-13 18:40:53 +00:00
Jakob Stoklund Olesen	4484f99175	Add SplitEditor::markOverlappedComplement(). This function is used to flag values where the complement interval may overlap other intervals. Call it from overlapIntv, and use the flag to fully recompute those live ranges in transferValues(). llvm-svn: 139612	2011-09-13 18:05:29 +00:00
Jakob Stoklund Olesen	820c8fd0db	Eliminate the extendRange() wrapper. llvm-svn: 139608	2011-09-13 17:38:57 +00:00
Jakob Stoklund Olesen	0494c5c35d	Switch extendInBlock() to take a kill slot instead of the last use slot. Three out of four clients prefer this interface which is consistent with extendIntervalEndTo() and LiveRangeCalc::extend(). llvm-svn: 139604	2011-09-13 16:47:56 +00:00
Jakob Stoklund Olesen	054984d75b	Use a separate LiveRangeCalc for the complement in spill modes. The complement interval may overlap the other intervals created, so use a separate LiveRangeCalc instance to compute its live range. A LiveRangeCalc instance can only be shared among non-overlapping intervals. llvm-svn: 139603	2011-09-13 16:47:53 +00:00
NAKAMURA Takumi	cac923b556	Unbreak msvc. llvm-svn: 139581	2011-09-13 03:58:34 +00:00
Jakob Stoklund Olesen	487f2a37bf	Extract live range calculations from SplitKit. SplitKit will soon need two copies of these data structures, and the algorithms will also be useful when LiveIntervalAnalysis becomes independent of LiveVariables. llvm-svn: 139572	2011-09-13 01:34:21 +00:00
Bill Wendling	ac5a883624	Introduce a bit of a hack. Splitting a landing pad takes considerable care because of PHIs and other nasties. The problem is that the jump table needs to jump to the landing pad block. However, the landing pad block can be jumped to only by an invoke instruction. So we clone the landingpad instruction into its own basic block, have the invoke jump to there. The landingpad instruction's basic block's successor is now the target for the jump table. But because of PHI nodes, we need to create another basic block for the jump table to jump to. This is definitely a hack, because the values for the PHI nodes may not be defined on the edge from the jump table. But that's okay, because the jump table is simply a construct to mimic what is happening in the CFG. So the values are mysteriously there, even though there is no value for the PHI from the jump table's edge (hence calling this a hack). llvm-svn: 139545	2011-09-12 21:56:59 +00:00
Jakob Stoklund Olesen	45df7e0f22	Remove the -compact-regions flag. It has been enabled by default for a while, it was only there to allow performance comparisons. llvm-svn: 139501	2011-09-12 16:54:42 +00:00
Jakob Stoklund Olesen	eecb2fb183	Add an interface for SplitKit complement spill modes. SplitKit always computes a complement live range to cover the places where the original live range was live, but no explicit region has been allocated. Currently, the complement live range is created to be as small as possible - it never overlaps any of the regions. This minimizes register pressure, but if the complement is going to be spilled anyway, that is not very important. The spiller will eliminate redundant spills, and hoist others by making the spill slot live range overlap some of the regions created by splitting. Stack slots are cheap. This patch adds the interface to enable spill modes in SplitKit. In spill mode, SplitKit will assume that the complement is going to spill, so it will allow it to overlap regions in order to avoid back-copies. By doing some of the spiller's work early, the complement live range becomes simpler. In some cases, it can become much simpler because no extra PHI-defs are required. This will speed up both splitting and spilling. This is only the interface to enable spill modes, no implementation yet. llvm-svn: 139500	2011-09-12 16:49:21 +00:00
Jakob Stoklund Olesen	72c0ddfbc4	Update comments to reflect some (not so) recent changes. llvm-svn: 139498	2011-09-12 16:03:26 +00:00
Richard Trieu	78a812bf2d	Fix asserts in CodeGen from: assert("error"); to: assert(0 && "error"); llvm-svn: 139449	2011-09-10 01:07:54 +00:00
Chris Lattner	e74e0c8020	tidy up a bit llvm-svn: 139419	2011-09-09 22:06:59 +00:00
Eli Friedman	b7910b79f5	Make the SelectionDAG verify that all the operands of BUILD_VECTOR have the same type. Teach DAGCombiner::visitINSERT_VECTOR_ELT not to make invalid BUILD_VECTORs. Fixes PR10897. llvm-svn: 139407	2011-09-09 21:04:06 +00:00
Jakob Stoklund Olesen	278bf02581	Reapply r139247: Cache intermediate results during traceSiblingValue. In some cases such as interpreters using indirectbr, the CFG can be very complicated, and live range splitting may be forced to insert a large number of phi-defs. When that happens, traceSiblingValue can spend a lot of time zipping around in the CFG looking for defs and reloads. This patch causes more information to be cached in SibValues, and the cached values are used to terminate searches early. This speeds up spilling by 20x in one interpreter test case. For more typical code, this is just a 10% speedup of spilling. The previous version had bugs that caused miscompilations. They have been fixed. llvm-svn: 139378	2011-09-09 18:11:41 +00:00
Devang Patel	9d904e1a97	Directly point debug info to the stack slot of the arugment, instead of trying to keep track of vreg in which it the arugment is copied. The LiveDebugVariable can keep track of variable's ranges. llvm-svn: 139330	2011-09-08 22:59:09 +00:00
Jakob Stoklund Olesen	946e0a4665	Revert r139247 "Cache intermediate results during traceSiblingValue." It broke the self host and clang-x86_64-darwin10-RA. llvm-svn: 139259	2011-09-07 21:43:52 +00:00
Jakob Stoklund Olesen	b77d5c1484	Cache intermediate results during traceSiblingValue. In some cases such as interpreters using indirectbr, the CFG can be very complicated, and live range splitting may be forced to insert a large number of phi-defs. When that happens, traceSiblingValue can spend a lot of time zipping around in the CFG looking for defs and reloads. This patch causes more information to be cached in SibValues, and the cached values are used to terminate searches early. This speeds up spilling by 20x in one interpreter test case. For more typical code, this is just a 10% speedup of spilling. llvm-svn: 139247	2011-09-07 19:07:31 +00:00
James Molloy	4c493e8050	Refactor instprinter and mcdisassembler to take a SubtargetInfo. Add -mattr= handling to llvm-mc. Reviewed by Owen Anderson. llvm-svn: 139237	2011-09-07 17:24:38 +00:00
Eli Friedman	e978d2f644	Relax the MemOperands on atomics a bit. Fixes -verify-machineinstrs failures for atomic laod/store on ARM. (The fix for the related failures on x86 is going to be nastier because we actually need Acquire memoperands attached to the atomic load instrs, etc.) llvm-svn: 139221	2011-09-07 02:23:42 +00:00
Devang Patel	9de7a7db26	While sinking machine instructions, sink matching DBG_VALUEs also otherwise live debug variable pass will drop DBG_VALUEs on the floor. llvm-svn: 139208	2011-09-07 00:07:58 +00:00
Duncan Sands	f2641e1bc1	Add codegen support for vector select (in the IR this means a select with a vector condition); such selects become VSELECT codegen nodes. This patch also removes VSETCC codegen nodes, unifying them with SETCC nodes (codegen was actually often using SETCC for vector SETCC already). This ensures that various DAG combiner optimizations kick in for vector comparisons. Passes dragonegg bootstrap with no testsuite regressions (nightly testsuite as well as "make check-all"). Patch mostly by Nadav Rotem. llvm-svn: 139159	2011-09-06 19:07:46 +00:00
Duncan Sands	a098436b32	Split the init.trampoline intrinsic, which currently combines GCC's init.trampoline and adjust.trampoline intrinsics, into two intrinsics like in GCC. While having one combined intrinsic is tempting, it is not natural because typically the trampoline initialization needs to be done in one function, and the result of adjust trampoline is needed in a different (nested) function. To get around this llvm-gcc hacks the nested function lowering code to insert an additional parent variable holding the adjust.trampoline result that can be accessed from the child function. Dragonegg doesn't have the luxury of tweaking GCC code, so it stored the result of adjust.trampoline in the memory GCC set aside for the trampoline itself (this is always available in the child function), and set up some new memory (using an alloca) to hold the trampoline. Unfortunately this breaks Go which allocates trampoline memory on the heap and wants to use it even after the parent has exited (!). Rather than doing even more hacks to get Go working, it seemed best to just use two intrinsics like in GCC. Patch mostly by Sanjoy Das. llvm-svn: 139140	2011-09-06 13:37:06 +00:00
Owen Anderson	40d756eacc	Fix a truly heinous bug in DAGCombine related to AssertZext. If we have a chain of zext -> assert_zext -> zext -> use, the first zext would get simplified away because of the later zext, and then the later zext would get simplified away because of the assert. The solution is to teach SimplifyDemandedBits that assert_zext demands all of the high bits of its input, rather than only those demanded by its users. No testcase because the only example I have manifests as llvm-gcc miscompiling LLVM, and I haven't found a smaller case that reproduces this problem. Fixes <rdar://problem/10063365>. llvm-svn: 139059	2011-09-03 00:26:49 +00:00
Jakob Stoklund Olesen	97fe09ad2e	Simplify by using isFullCopy(). llvm-svn: 139019	2011-09-02 18:18:29 +00:00
Duncan Sands	5c04c62765	Darwin wants ctors/dtors to be ordered the other way round to linux. llvm-svn: 139015	2011-09-02 18:07:19 +00:00
Dan Gohman	3767be9aee	Revert r131152, r129796, r129761. This code is currently considered to be unreliable on platforms which require memcpy calls, and it is complicating broader legalize cleanups. It is hoped that these cleanups will make memcpy byval easier to implement in the future. llvm-svn: 138977	2011-09-01 23:07:08 +00:00
Benjamin Kramer	6397051ece	Don't drop alignment info on local common symbols. - On COFF the .lcomm directive has an alignment argument. - On ELF we fall back to .local + .comm Based on a patch by NAKAMURA Takumi. Fixes PR9337, PR9483 and PR10128. llvm-svn: 138976	2011-09-01 23:04:27 +00:00
Jakob Stoklund Olesen	5dc87d0f4d	Permit remat of partial register defs when it is safe. An instruction may define part of a register where the other bits are undefined. In that case, it is safe to rematerialize the instruction. For example: %vreg2:ssub_0<def> = VLDRS <cp#0>, 0, pred:14, pred:%noreg, %vreg2<imp-def> The extra <imp-def> operand indicates that the instruction does not read the other parts of the virtual register, so a remat is safe. This patch simply allows multiple def operands for the virtual register. It is MI->readsVirtualRegister() that determines if we depend on a previous value so remat is impossible. llvm-svn: 138953	2011-09-01 18:27:51 +00:00
Jakob Stoklund Olesen	e417273fce	Revert r138794, "Do not try to rematerialize a value from a partial definition." The problem is fixed for all register allocators by r138944, so this patch is no longer necessary. <rdar://problem/10032939> llvm-svn: 138945	2011-09-01 17:25:18 +00:00
Jakob Stoklund Olesen	6357fa2f06	Prevent remat of partial register redefinitions. An instruction that redefines only part of a larger register can never be rematerialized since the virtual register value depends on the old value in other parts of the register. This was fixed for the inline spiller in r138794. This patch fixes the problem for all register allocators, and includes a small test case. <rdar://problem/10032939> llvm-svn: 138944	2011-09-01 17:18:50 +00:00
Evan Cheng	90da66bb69	Teach MachineLICM reg pressure tracking code to deal with MVT::untyped. Sorry, I can't come up with a small test case. rdar://10043690 llvm-svn: 138934	2011-09-01 01:45:00 +00:00
Andrew Trick	832a6a1909	PreRA scheduler should avoid cloning compares. Added canClobberReachingPhysRegUse() to handle a particular pattern in which a two-address instruction could be forced to interfere with EFLAGS, causing a compare to be unnecessarilly cloned. Fixes rdar://problem/5875261 llvm-svn: 138924	2011-09-01 00:54:31 +00:00
David Greene	7df940d660	Fix Size Typing Stores sizes as uint64_t to avoid possible truncation. llvm-svn: 138901	2011-08-31 21:34:20 +00:00
Eli Friedman	ae1acddb95	Misc cleanup; addresses Duncan's comments on r138877. llvm-svn: 138887	2011-08-31 20:13:26 +00:00
Eli Friedman	e839ecb70b	Fill in type legalization for MERGE_VALUES in all the various cases. Patch by Micah Villmow. (No testcase because the issue only showed up in an out-of-tree backend.) llvm-svn: 138877	2011-08-31 18:36:04 +00:00
Eli Friedman	7c3bdede25	Generic expansion for atomic load/store into cmpxchg/atomicrmw xchg; implements 64-bit atomic load/store for ARM. llvm-svn: 138872	2011-08-31 18:26:09 +00:00
David Greene	cdef71f4f3	Compress Repeated Byte Output Emit a repeated sequence of bytes using .zero. This saves an enormous amount of asm file space for certain programs. llvm-svn: 138864	2011-08-31 17:30:56 +00:00
Rafael Espindola	6e31dfea35	Spelling and grammar fixes to problems found by Duncan. llvm-svn: 138858	2011-08-31 16:43:33 +00:00
Rafael Espindola	c21742112b	Emit segmented-stack specific code into function prologues for X86. Modify the pass added in the previous patch to call this new code. This new prologues generated will call a libgcc routine (__morestack) to allocate more stack space from the heap when required Patch by Sanjoy Das. llvm-svn: 138812	2011-08-30 19:39:58 +00:00
Evan Cheng	e6fba77971	Follow up to r138791. Add a instruction flag: hasPostISelHook which tells the pre-RA scheduler to call a target hook to adjust the instruction. For ARM, this is used to adjust instructions which may be setting the 's' flag. ADC, SBC, RSB, and RSC instructions have implicit def of CPSR (required since it now uses CPSR physical register dependency rather than "glue"). If the carry flag is used, then the target hook will fill in the optional operand with CPSR. Otherwise, the hook will remove the CPSR implicit def from the MachineInstr. llvm-svn: 138810	2011-08-30 19:09:48 +00:00
Bob Wilson	358a5f6a72	Do not try to rematerialize a value from a partial definition. I don't currently have a good testcase for this; will try to get one tomorrow. <rdar://problem/10032939> llvm-svn: 138794	2011-08-30 05:36:02 +00:00
Jim Grosbach	ed16ec4248	Thumb2 parsing and encoding for IT blocks. llvm-svn: 138773	2011-08-29 22:24:09 +00:00
Duncan Sands	4d63542b82	Fix PR5329: pay attention to constructor/destructor priority when outputting them. With this, the entire LLVM testsuite passes when built with dragonegg. llvm-svn: 138724	2011-08-28 13:17:22 +00:00
Bill Wendling	4707d37ac9	These splits should be done whether they are critical edges or not. llvm-svn: 138697	2011-08-27 04:40:37 +00:00
Bill Wendling	71fce2c84d	Update the dominator tree with the correct dominator for the new 'unwind' block. llvm-svn: 138664	2011-08-26 21:36:12 +00:00
Bill Wendling	fee8eda35b	Split the landing pad block only if it's a critical edge. Also intelligently split it in the other place where we're splitting critical edges. llvm-svn: 138658	2011-08-26 21:18:55 +00:00
Eli Friedman	452aae6202	Atomic load/store on ARM/Thumb. I don't really like the patterns, but I'm having trouble coming up with a better way to handle them. I plan on making other targets use the same legalization ARM-without-memory-barriers is using... it's not especially efficient, but if anyone cares, it's not that hard to fix for a given target if there's some better lowering. llvm-svn: 138621	2011-08-26 02:59:24 +00:00
Bill Wendling	8ac2041a19	Look at only the terminators of the basic block. Also, if we're using the new EH scheme, return 'true' so that it doesn't try to run the old EH scheme's fixup on the new code. llvm-svn: 138605	2011-08-25 23:48:11 +00:00
Eli Friedman	342e8df0e0	Basic x86 code generation for atomic load and store instructions. llvm-svn: 138478	2011-08-24 20:50:09 +00:00
Evan Cheng	2bb4035707	Move TargetRegistry and TargetSelect from Target to Support where they belong. These are strictly utilities for registering targets and components. llvm-svn: 138450	2011-08-24 18:08:43 +00:00
Jim Grosbach	dee9e8a37c	Tidy up. Trailing whitespace. llvm-svn: 138437	2011-08-24 16:44:17 +00:00
Bill Wendling	f4ee0c0db2	Add the sentinal "no handle" value to the ResumeInst. A value of -1 at a call site tells the personality function that this call isn't handled by the current function. Since the ResumeInsts are converted to calls to _Unwind_SjLj_Resume, add a (volatile) store of -1 to its 'call site'. llvm-svn: 138416	2011-08-24 00:00:23 +00:00
Bill Wendling	2d4f0bea57	Don't replace all uses with the new stuff. This is not necessarily the first or dominating use of the EH values. The IR breaks if it's not. So replace the specific value in the instruction with the new value. llvm-svn: 138406	2011-08-23 22:55:03 +00:00
Bill Wendling	01a325a40e	Look at the end of the entry block for an invoke. The invoke could be at the end of the entry block. If it's the only one, then we won't process all of the landingpad instructions correctly. This code is currently ugly, but should be made much nicer once the new EH switch is thrown. llvm-svn: 138397	2011-08-23 22:20:16 +00:00
Bill Wendling	4eb0433672	A landingpad instruction is neither folded nor dead. llvm-svn: 138387	2011-08-23 21:33:05 +00:00
Evan Cheng	6b477b985b	Fix 80 col violations. llvm-svn: 138356	2011-08-23 19:17:21 +00:00
Bill Wendling	f0d2dfde4f	Split the landing pad's edge. Then for all uses of a landingpad instruction's value, we insert a load of the exception object and selector object from memory, which is where it actually resides. If it's used by a PHI node, we follow that to where it is being used. Eventually, all landingpad instructions should have no uses. Any PHI nodes that were associated with those landingpads should be removed. llvm-svn: 138302	2011-08-22 23:38:40 +00:00
Evan Cheng	6aa2744bed	Follow up to Jim's r138278. This fixes commuteInstruction so it handles two-address instructions correctly. I'll let Jim add a test case. :-) llvm-svn: 138289	2011-08-22 23:04:56 +00:00
Bill Wendling	3aaed0a14c	Some whitespace fixes and #include reordering. llvm-svn: 138256	2011-08-22 18:44:49 +00:00
Nick Lewycky	97f73cb449	Be less redundant. llvm-svn: 138252	2011-08-22 18:26:12 +00:00
Devang Patel	59e27c5f12	Do not use named md nodes to track variables that are completely optimized. This does not scale while doing LTO with debug info. New approach is to include list of variables in the subprogram info directly. llvm-svn: 138145	2011-08-19 23:28:12 +00:00
Benjamin Kramer	68ed46ce9a	Roll back the rest of r126557. It's a hack that will break in some obscure cases. llvm-svn: 138130	2011-08-19 22:39:31 +00:00
Nick Lewycky	c1348074ec	Eli points out that this is what report_fatal_error() is for. llvm-svn: 138091	2011-08-19 21:45:19 +00:00
Nick Lewycky	3f73184d90	This is not actually unreachable, so don't use llvm_unreachable for it. Since the intent seems to be to terminate even in Release builds, just use abort() directly. If program flow ever reaches a __builtin_unreachable (which llvm_unreachable is #define'd to on newer GCCs) then the program is undefined. llvm-svn: 138068	2011-08-19 20:14:27 +00:00
Jakob Stoklund Olesen	6949077f74	Add llc flags to disable machine DCE and CSE. This is useful for unit tests. llvm-svn: 138028	2011-08-19 02:05:35 +00:00
Benjamin Kramer	4938edb02c	Make a bunch of symbols private. llvm-svn: 138025	2011-08-19 01:42:18 +00:00
Jakob Stoklund Olesen	9eb77bf615	Don't treat a partial <def,undef> operand as a read. Normally, a partial register def is treated as reading the super-register unless it also defines the full register like this: %vreg110:sub_32bit<def> = COPY %vreg77:sub_32bit, %vreg110<imp-def> This patch also uses the <undef> flag on partial defs to recognize non-reading operands: %vreg110:sub_32bit<def,undef> = COPY %vreg77:sub_32bit This fixes a subtle bug in RegisterCoalescer where LIS->shrinkToUses would treat a coalesced copy as still reading the register, extending the live range artificially. My test case only works when I disable DCE so a dead copy is left for RegisterCoalescer, so I am not including it. <rdar://problem/9967101> llvm-svn: 138018	2011-08-19 00:30:17 +00:00
Renato Golin	c8d4065781	add the comments of each declaration follow it, making it easier to read and compare to GCC's result. llvm-svn: 138009	2011-08-18 23:43:14 +00:00
Devang Patel	0ecbcbd12c	Eliminate unnecessary forwarding function. llvm-svn: 138006	2011-08-18 23:17:55 +00:00
Devang Patel	a6576a146d	Add new DIE into the map asap. llvm-svn: 137998	2011-08-18 22:21:50 +00:00
Ivan Krasin	d7cbd4c518	FastISel: avoid function calls between the materialization of the constant and its use. llvm-svn: 137993	2011-08-18 22:06:10 +00:00
Bill Wendling	247fd3bf59	Add the support in code-gen for the landingpad instruction lowering. The landingpad instruction is lowered into the EXCEPTIONADDR and EHSELECTION SDNodes. The information from the landingpad instruction is harvested by the 'AddLandingPadInfo' function. The new EH uses the current EH scheme in the back-end. This will change once we switch over to the new scheme. (Reviewed by Jakob!) llvm-svn: 137880	2011-08-17 21:56:44 +00:00
Bill Wendling	a408e5bf31	Revert patch. Forgot a dependent commit. llvm-svn: 137875	2011-08-17 21:28:05 +00:00
Bill Wendling	2a521948f0	Add the body of 'visitLandingPad'. This generates the SDNodes for the new exception handling scheme. It takes the two values coming from the landingpad instruction and assigns them to the EXCEPTIONADDR and EHSELECTION nodes. llvm-svn: 137873	2011-08-17 21:25:14 +00:00
Bill Wendling	1cdd7fdf54	Modify for the new EH scheme. Things are much saner now. We no longer need to modify the laning pads, because of the invariants we impose upon them. The only thing DwarfEHPrepare needs to do is convert the 'resume' instruction into a call to '_Unwind_Resume'. llvm-svn: 137855	2011-08-17 19:48:49 +00:00
Devang Patel	eb1bb4e419	Until now all debug info MDNodes referred to a root MDNode, a compile unit. This simplified handling of these needs in dwarf writer. However, one side effect of this is that during link time optimization all these MDNodes are _not_ uniqued. In other words there will be N number of MDNodes describing "int", "char" and all other types, which would suddenly grow when each object file starts using libraries like STL. MDNodes graph structure such that compiler unit keeps track of important MDNodes and update dwarf writer to process mdnodes top-down instead of bottom up. llvm-svn: 137778	2011-08-16 22:09:43 +00:00
Jim Grosbach	345768c9ff	Remove unused Target argument from AsmParser construction methods. The argument is unused, and is a layering violation in any case. llvm-svn: 137735	2011-08-16 18:33:49 +00:00
Devang Patel	927840458e	Remove unnecessary version check. llvm-svn: 137728	2011-08-16 17:41:41 +00:00
Nadav Rotem	b66b866f46	Revert r137562 because it caused PR10674 llvm-svn: 137719	2011-08-16 14:34:29 +00:00
Devang Patel	07bb9eea33	Refactor. llvm-svn: 137689	2011-08-15 23:47:24 +00:00
Devang Patel	1f4f98d664	Continue to hoist uses of getCompileUnit() up. The goal is to get rid of uses of getCompileUnit(). llvm-svn: 137683	2011-08-15 23:36:40 +00:00
Devang Patel	d2dfc5ec02	This is somewhat déjà-vu, but avoid using getCompileUnit() as much as possible. llvm-svn: 137668	2011-08-15 22:24:32 +00:00
Devang Patel	3acc70e536	Refactor. Variables are part of compile unit so let CompileUnit create new variable. llvm-svn: 137663	2011-08-15 22:04:40 +00:00
Devang Patel	d899444347	There is no need to maintain a set to keep track of variables that use location expressions. In such cases, AT_location attribute's value will be a label. llvm-svn: 137659	2011-08-15 21:43:21 +00:00
Devang Patel	900d97719b	Fix warning. llvm-svn: 137658	2011-08-15 21:35:16 +00:00
Devang Patel	3e4a965519	Simplify. Let DbgVariable keep track of variable's DBG_VALUE machine instruction. llvm-svn: 137656	2011-08-15 21:24:36 +00:00
Devang Patel	99819b527d	Simplify mapping to variable from its abstract variable info. When a variable is inlined multiple places, abstract variable keeps name, location, type etc.. info and all other concreate instances of the variable directly refers to abstract variable. llvm-svn: 137637	2011-08-15 19:01:20 +00:00
Devang Patel	d7d80aadd1	Refactor. llvm-svn: 137632	2011-08-15 18:40:16 +00:00
Devang Patel	6e4d2c9fb7	Refactor. llvm-svn: 137631	2011-08-15 18:35:42 +00:00
Devang Patel	dfd6ec3ce1	Refactor. Global variables are part of compile unit so let CompileUnit create new global variable. llvm-svn: 137621	2011-08-15 17:57:41 +00:00
Devang Patel	895437142a	Refactor. A subprogram is part of compile unit so let CompileUnit construct new subprogram. llvm-svn: 137618	2011-08-15 17:24:54 +00:00
Nadav Rotem	6858b344ed	Fix PR 10635. When generating integer constants, the constant element type may be illegal, even if the requested vector type is legal. Testcase is one of the disabled ARM tests in the vector-select patch. llvm-svn: 137562	2011-08-13 20:31:45 +00:00
Bill Wendling	fae1475823	Initial commit of the 'landingpad' instruction. This implements the 'landingpad' instruction. It's used to indicate that a basic block is a landing pad. There are several restrictions on its use (see LangRef.html for more detail). These restrictions allow the exception handling code to gather the information it needs in a much more sane way. This patch has the definition, implementation, C interface, parsing, and bitcode support in it. llvm-svn: 137501	2011-08-12 20:24:12 +00:00
Devang Patel	444034783e	Use ArrayRef. llvm-svn: 137485	2011-08-12 18:10:19 +00:00
Chris Lattner	335d399a0e	switch to use the new api for structtypes. llvm-svn: 137480	2011-08-12 18:06:37 +00:00
Devang Patel	db4374a28a	Provide fast path as Jakob suggested. llvm-svn: 137478	2011-08-12 18:01:34 +00:00
Nadav Rotem	62da15a330	Revert r137310 because it does not optimize any code on ToT llvm-svn: 137466	2011-08-12 17:15:04 +00:00
Duncan Sands	a41634e307	Silence a bunch (but not all) "variable written but not read" warnings when building with assertions disabled. llvm-svn: 137460	2011-08-12 14:54:45 +00:00
Jakob Stoklund Olesen	1f582ba609	Simplify the interference checking code a bit. This is possible now that we now longer provide an interface to iterate the interference overlaps. llvm-svn: 137397	2011-08-12 00:22:04 +00:00
Jakob Stoklund Olesen	da0192d72b	Remove the InterferenceResult class. llvm-svn: 137381	2011-08-11 22:46:06 +00:00
Jakob Stoklund Olesen	cd14efaec2	Eliminate the last use of InterferenceResult. The Query class now holds two iterators instead of an InterferenceResult instance. The iterators are used as bookmarks for repeated collectInterferingVRegs calls. llvm-svn: 137380	2011-08-11 22:46:04 +00:00
Jakob Stoklund Olesen	da4f0eb12c	Remove more dead code. collectInterferingVRegs will be the primary function for interference checks. llvm-svn: 137354	2011-08-11 21:18:34 +00:00
Jakob Stoklund Olesen	7519336752	Privatize an unused part of the LiveIntervalUnion::Query interface. No clients are iterating over interference overlaps. llvm-svn: 137350	2011-08-11 21:00:42 +00:00
Jakob Stoklund Olesen	05ff9d1f6d	Remove some dead code. The InterferenceResult iterator turned out to be less important than we thought it would be. LiveIntervalUnion clients want higher level information, like the list of interfering virtual registers. llvm-svn: 137346	2011-08-11 20:41:41 +00:00
Benjamin Kramer	fa7e6a54b1	Plug a memory leak. llvm-svn: 137321	2011-08-11 18:39:28 +00:00
Nadav Rotem	61140e1028	[AVX] When joining two XMM registers into a YMM register, make sure that the lower XMM register gets in first. This will allow the SUBREG pattern to elliminate the first vector insertion. llvm-svn: 137310	2011-08-11 16:49:36 +00:00
Chris Lattner	96710b4308	fix PR10605 / rdar://9930964 by adding a pretty scary missed check. It's somewhat surprising anything works without this. Before we would compile the testcase into: test: # @test movl $4, 8(%rdi) movl 8(%rdi), %eax orl %esi, %eax cmpl $32, %edx movl %eax, -4(%rsp) # 4-byte Spill je .LBB0_2 now we produce: test: # @test movl 8(%rdi), %eax movl $4, 8(%rdi) orl %esi, %eax cmpl $32, %edx movl %eax, -4(%rsp) # 4-byte Spill je .LBB0_2 llvm-svn: 137303	2011-08-11 06:26:54 +00:00
Devang Patel	784077eb57	Stay within 80 columns. llvm-svn: 137283	2011-08-10 23:58:09 +00:00
Devang Patel	bb23a4a9a5	Distinguish between two copies of one inlined variable. Take 2. llvm-svn: 137253	2011-08-10 21:50:54 +00:00
Devang Patel	37a62058fe	While extending definition range of a debug variable, consult lexical scopes also. There is no point extending debug variable out side its lexical block. This provides 6x compile time speedup in some cases. llvm-svn: 137250	2011-08-10 21:25:34 +00:00
Devang Patel	e30746c844	Revert unintentional parts of previous check-in. llvm-svn: 137249	2011-08-10 21:16:49 +00:00
Devang Patel	7e62302fae	Start using LexicalScopes utility. No intetional functionality change. llvm-svn: 137246	2011-08-10 20:55:27 +00:00
Devang Patel	e1649c31cb	Provide utility to extract and use lexical scoping information from machine instructions. llvm-svn: 137237	2011-08-10 19:04:06 +00:00
Jakob Stoklund Olesen	b91e489923	Trim an unneeded header. llvm-svn: 137184	2011-08-09 23:49:21 +00:00
Jakob Stoklund Olesen	53910d6aae	Inflate register classes after coalescing. Coalescing can remove copy-like instructions with sub-register operands that constrained the register class. Examples are: x86: GR32_ABCD:sub_8bit_hi -> GR32 arm: DPR_VFP2:ssub0 -> DPR Recompute the register class of any virtual registers that are used by less instructions after coalescing. This affects code generation for the Cortex-A8 where we use NEON instructions for f32 operations, c.f. fp_convert.ll: vadd.f32 d16, d1, d0 vcvt.s32.f32 d0, d16 The register allocator is now free to use d16 for the temporary, and that comes first in the allocation order because it doesn't interfere with any s-registers. llvm-svn: 137133	2011-08-09 18:19:41 +00:00
Jakob Stoklund Olesen	da96006975	Move CalculateRegClass to MRI::recomputeRegClass. This function doesn't have anything to do with spill weights, and MRI already has functions for manipulating the register class of a virtual register. llvm-svn: 137123	2011-08-09 16:46:27 +00:00
Devang Patel	6c1ed31b3b	Print variable's inline location in debug output. llvm-svn: 137096	2011-08-09 01:03:35 +00:00
Jakob Stoklund Olesen	e7dddfd7f6	Rename member variables to follow coding standards. No functional change. llvm-svn: 137094	2011-08-09 01:01:27 +00:00
Jakob Stoklund Olesen	e1f5313bc7	Move the RegisterCoalescer private to its implementation file. RegisterCoalescer.h still has the CoalescerPair class interface. llvm-svn: 137088	2011-08-09 00:43:37 +00:00
Jakob Stoklund Olesen	4c9a2fb044	Refer to the RegisterCoalescer pass by ID. A public interface is no longer needed since RegisterCoalescer is not an analysis any more. llvm-svn: 137082	2011-08-09 00:29:53 +00:00
Jakob Stoklund Olesen	daa2cad723	Hoist hasLoadFromStackSlot and hasStoreToStackSlot. These the methods are target-independent since they simply scan the memory operands. They can live in TargetInstrInfoImpl. llvm-svn: 137063	2011-08-08 20:53:24 +00:00
Devang Patel	fee7cedbc9	Simplify by creating parent first. llvm-svn: 137056	2011-08-08 18:22:10 +00:00
Jakob Stoklund Olesen	22f37a1eb1	Fix typo. Thanks, Andy! llvm-svn: 137023	2011-08-06 18:20:24 +00:00
Jakob Stoklund Olesen	d4bb1d43e8	Reject RS_Spill ranges from local splitting as well. All new local ranges are marked as RS_New now, so there is no need to attempt splitting of RS_Spill ranges any more. llvm-svn: 137002	2011-08-05 23:50:33 +00:00
Jakob Stoklund Olesen	02cf10bdfd	Only mark remainder intervals as RS_Spill after per-block splitting. The local ranges created get to stay in the RS_New stage, just like for local and region splitting. This gives tryLocalSplit a bit more freedom the first time it sees one of these new local ranges. llvm-svn: 137001	2011-08-05 23:50:31 +00:00
Jakob Stoklund Olesen	0de95ef7f5	Remember to update LiveDebugVariables after per-block splitting. llvm-svn: 136996	2011-08-05 23:10:40 +00:00
Jakob Stoklund Olesen	cef5d8ff77	Extract per-block splitting into its own method. No functional change. llvm-svn: 136994	2011-08-05 23:04:18 +00:00
Jakob Stoklund Olesen	cdf9ad9107	Delete getMultiUseBlocks and splitSingleBlocks. These functions are no longer used, and they are easily replaced with a loop calling shouldSplitSingleBlock and splitSingleBlock. llvm-svn: 136993	2011-08-05 22:52:17 +00:00
Jakob Stoklund Olesen	58995bc551	Also use shouldSplitSingleBlock() in the fallback splitting mode. Drop the use of SplitAnalysis::getMultiUseBlocks, there is no need to go through a SmallPtrSet any more. llvm-svn: 136992	2011-08-05 22:43:23 +00:00
Jakob Stoklund Olesen	8627ea91cb	Split around single instructions to enable register class inflation. Normally, we don't create a live range for a single instruction in a basic block, the spiller does that anyway. However, when splitting a live range that belongs to a proper register sub-class, inserting these extra COPY instructions completely remove the constraints from the remainder interval, and it may be allocated from the larger super-class. The spiller will mop up these small live ranges if we end up spilling anyway. It calls them snippets. llvm-svn: 136989	2011-08-05 22:20:45 +00:00
Jakob Stoklund Olesen	5122467b38	Detect proper register sub-classes. Some instructions require restricted register classes, but most of the time that doesn't affect register allocation. For example, some instructions don't work with the stack pointer, but that is a reserved register anyway. Sometimes it matters, GR32_ABCD only has 4 allocatable registers. For such a proper sub-class, the register allocator should try to enable register class inflation since that makes more registers available for allocation. Make sure only legal super-classes are considered. For example, tGPR is not a proper sub-class in Thumb mode, but in ARM mode it is. llvm-svn: 136981	2011-08-05 21:28:14 +00:00
Jakob Stoklund Olesen	d633abebf6	Fix liveness computations in BranchFolding. The old code would look at kills and defs in one pass over the instruction operands, causing problems with this code: %R0<def>, %CPSR<def,dead> = tLSLri %R5<kill>, 2, pred:14, pred:%noreg %R0<def>, %CPSR<def,dead> = tADDrr %R4<kill>, %R0<kill>, pred:14, %pred:%noreg The last instruction kills and redefines %R0, so it is still live after the instruction. This caused a register scavenger crash when compiling 483.xalancbmk for armv6. I am not including a test case because it requires too much bad luck to expose this old bug. First you need to convince the register allocator to use %R0 twice on the tADDrr instruction, then you have to convince BranchFolding to do something that causes it to run the register scavenger on he bad block. <rdar://problem/9898200> llvm-svn: 136973	2011-08-05 18:47:07 +00:00
Chandler Carruth	81b7e11c89	Temporarily revert r135528 which distinguishes between two copies of one inlined variable, based on the discussion in PR10542. This explodes the runtime of several passes down the pipeline due to a large number of "copies" remaining live across a large function. This only shows up with both debug and opt, but when it does it creates a many-minute compile when self-hosting LLVM+Clang. There are several other cases that show these types of regressions. All of this is tracked in PR10542, and progress is being made on fixing the issue. Once its addressed, the re-instated, but until then this restores the performance for self-hosting and other opt+debug builds. Devang, let me know if this causes any trouble, or impedes fixing it in any way, and thanks for working on this! llvm-svn: 136953	2011-08-05 00:51:31 +00:00
Jakob Stoklund Olesen	63e3dec9ad	Count the total amount of stack space used in compiled functions. Patch by Ivan Krasin! llvm-svn: 136921	2011-08-04 21:06:09 +00:00
Devang Patel	d61b1d505c	Print DBG_VALUE variable's location info as a comment. llvm-svn: 136916	2011-08-04 20:44:26 +00:00
Devang Patel	eabc3cea33	Increment counter inside insertDebugValue(). llvm-svn: 136915	2011-08-04 20:42:11 +00:00
Devang Patel	b456866b7b	Add counter. llvm-svn: 136901	2011-08-04 18:45:38 +00:00
Jakob Stoklund Olesen	2539af600a	Correctly handle multiple DBG_VALUE instructions at the same SlotIndex. It is possible to have multiple DBG_VALUEs for the same variable: 32L TEST32rr %vreg0<kill>, %vreg0, %EFLAGS<imp-def>; GR32:%vreg0 DBG_VALUE 2, 0, !"i" DBG_VALUE %noreg, %0, !"i" When that happens, keep the last one instead of the first. llvm-svn: 136842	2011-08-03 23:44:31 +00:00
Jakob Stoklund Olesen	11b788d5be	Enable compact region splitting by default. This helps generate better code in functions with high register pressure. The previous version of compact region splitting caused regressions because the regions were a bit too large. A stronger negative bias applied in r136832 fixed this problem. llvm-svn: 136836	2011-08-03 23:16:09 +00:00
Devang Patel	aab841cf63	Do not drop undef debug values. These are used as range termination marker by live debug variable pass. llvm-svn: 136834	2011-08-03 23:13:55 +00:00
Jakob Stoklund Olesen	869545203b	Be more conservative when forming compact regions. Apply twice the negative bias on transparent blocks when computing the compact regions. This excludes loop backedges from the region when only one of the loop blocks uses the register. Previously, we would include the backedge in the region if the loop preheader and the loop latch both used the register, but the loop header didn't. When both the header and latch blocks use the register, we still keep it live on the backedge. llvm-svn: 136832	2011-08-03 23:09:38 +00:00
Chandler Carruth	77eb5a0a37	Fix some warnings from Clang in release builds: lib/CodeGen/RegAllocGreedy.cpp:1176:18: warning: unused variable 'B' [-Wunused-variable] if (unsigned B = Cand.getBundles(BundleCand, BestCand)) { ^ lib/CodeGen/RegAllocGreedy.cpp:1188:18: warning: unused variable 'B' [-Wunused-variable] if (unsigned B = Cand.getBundles(BundleCand, 0)) { ^ llvm-svn: 136831	2011-08-03 23:07:27 +00:00
Jakub Staszak	3ef20e35f9	Fix typo in #include which revealed in the case-sensitive filesystem. llvm-svn: 136828	2011-08-03 22:53:41 +00:00
Jakub Staszak	15e5b742ad	Use MachineBranchProbabilityInfo in If-Conversion instead of its own heuristics. llvm-svn: 136826	2011-08-03 22:34:43 +00:00
Jakub Staszak	a60d130f26	Add more constantness in BlockFrequencyInfo. llvm-svn: 136816	2011-08-03 21:30:57 +00:00
Eli Friedman	30a49e93e3	New approach to r136737: insert the necessary fences for atomic ops in platform-independent code, since a bunch of platforms (ARM, Mips, PPC, Alpha are the relevant targets here) need to do essentially the same thing. I think this completes the basic CodeGen for atomicrmw and cmpxchg. llvm-svn: 136813	2011-08-03 21:06:02 +00:00
Bob Wilson	0a8d5c6047	Some revisions to Devang's change r136759 for merged global debug info. llvm-svn: 136802	2011-08-03 19:42:51 +00:00
Devang Patel	dc9cbaaf23	Use byte offset, instead of element number, to access merged global. llvm-svn: 136759	2011-08-03 01:25:46 +00:00
Jakob Stoklund Olesen	3c14505164	Use the precomputed def presence in RAGreedy::calcSpillCost. llvm-svn: 136742	2011-08-02 23:04:08 +00:00
Jakob Stoklund Olesen	057f9b68de	Inform SpillPlacement about blocks with defs. This information is not used for anything yet. llvm-svn: 136741	2011-08-02 23:04:06 +00:00
Jakob Stoklund Olesen	43859a6ad2	Rename {First,Last}Use to {First,Last}Instr. With a 'FirstDef' field right there, it is very confusing that FirstUse refers to an instruction that may be a def. llvm-svn: 136739	2011-08-02 22:54:14 +00:00
Jakob Stoklund Olesen	ae8027cc95	Add a BlockInfo::FirstDef field. This is either an invalid SlotIndex, or valno->def for the first value defined inside the block. PHI values are not counted as defined inside the block. The FirstDef field will be used when estimating the cost of spilling around a block. llvm-svn: 136736	2011-08-02 22:37:22 +00:00
Jakob Stoklund Olesen	f047ff4fe1	Delete BlockInfo::LiveThrough. It wasn't used any more. llvm-svn: 136735	2011-08-02 22:37:20 +00:00
Jakob Stoklund Olesen	d2a7d1ed97	Extend the SpillPlacement interface with two new features. The PrefBoth constraint is used for blocks that ideally want a live-in value both on the stack and in a register. This would be used by a block that has a use before interference forces a spill. Secondly, add the ChangesValue flag to BlockConstraint. This tells SpillPlacement if a live-in value on the stack can be reused as a live-out stack value for free. If the block redefines the virtual register, a spill would be required for that. This extra information will be used by SpillPlacement to more accurately calculate spill costs when a value can exist both on the stack and in a register. The simplest example is a basic block that reads the virtual register, but doesn't change its value. Spilling around such a block requires a reload, but no spill in the block. The spiller already knows this, but the spill placer doesn't. That can sometimes lead to suboptimal regions. llvm-svn: 136731	2011-08-02 21:53:03 +00:00
Eli Friedman	04c5025cd5	Don't create a ridiculous EXTRACT_ELEMENT. PR10563. The testcase looks extremely fragile, so I'm adding an assertion which should catch any cases like this. llvm-svn: 136711	2011-08-02 18:38:35 +00:00
Jay Foad	8dfee5f6bf	Remove an unnecessary cast. llvm-svn: 136609	2011-08-01 12:27:15 +00:00
Bill Wendling	f891bf8b30	Add the 'resume' instruction for the new EH rewrite. This adds the 'resume' instruction class, IR parsing, and bitcode reading and writing. The 'resume' instruction resumes propagation of an existing (in-flight) exception whose unwinding was interrupted with a 'landingpad' instruction (to be added later). llvm-svn: 136589	2011-07-31 06:30:59 +00:00
Jakob Stoklund Olesen	163e7a73f1	Time the emission of debug values. llvm-svn: 136584	2011-07-31 03:53:42 +00:00
Jakob Stoklund Olesen	eb5ea833ed	Revert r136528 "Enable compact region splitting by default." While this generally helped x86-64, there was some large regressions for i386. llvm-svn: 136571	2011-07-30 17:19:14 +00:00
Bill Wendling	ad088e6724	Revert r136253, r136263, r136269, r136313, r136325, r136326, r136329, r136338, r136339, r136341, r136369, r136387, r136392, r136396, r136429, r136430, r136444, r136445, r136446, r136253 pending review. llvm-svn: 136556	2011-07-30 05:42:50 +00:00
Jakob Stoklund Olesen	5670f850c6	Revert "Don't check liveness of unallocatable registers." The ARM target depends on CPSR liveness being tracked after register allocation. llvm-svn: 136548	2011-07-30 00:57:25 +00:00
Jakob Stoklund Olesen	95cc5440e9	Don't check liveness of unallocatable registers. This includes registers like EFLAGS and ST0-ST7. We don't check for liveness issues in the verifier and scavenger because registers will never be allocated from these classes. While in SSA form, we do care about the liveness of unallocatable unreserved registers. Liveness of EFLAGS and ST0 neds to be correct for MachineDCE and MachineSinking. llvm-svn: 136541	2011-07-29 23:36:21 +00:00
Jakob Stoklund Olesen	9dd184151b	Check for multiple defs in the machine code verifier. llvm-svn: 136535	2011-07-29 23:02:48 +00:00
Jakob Stoklund Olesen	9760f04ef9	Add an isSSA() flag to MachineRegisterInfo. This flag is true from isel to register allocation when the machine function is required to be in SSA form. The TwoAddressInstructionPass and PHIElimination passes clear the flag. The SSA flag wil be used by the machine code verifier to check for SSA form, and eventually an assertion can enforce it in +Asserts builds. This will catch the common target error of creating machine code with multiple defs of a virtual register. llvm-svn: 136532	2011-07-29 22:51:22 +00:00
Jakub Staszak	0480a8fbbb	Do not lose branch weights when lowering SwitchInst. llvm-svn: 136529	2011-07-29 22:25:21 +00:00
Jakob Stoklund Olesen	b5c2d3210c	Enable compact region splitting by default. This helps generate better code in functions with high register pressure. llvm-svn: 136528	2011-07-29 22:10:27 +00:00
Jakub Staszak	539db98987	Remove unneeded const_cast. llvm-svn: 136506	2011-07-29 20:05:36 +00:00
Nick Lewycky	019d255d3e	Fix a lot of typos, improve (but not necessarily fix) grammaros and reflow some lines. No functionality change. llvm-svn: 136458	2011-07-29 03:49:23 +00:00
Eli Friedman	adec587d5c	Misc optimizer+codegen work for 'cmpxchg' and 'atomicrmw'. They appear to be working on x86 (at least for trivial testcases); other architectures will need more work so that they actually emit the appropriate instructions for orderings stricter than 'monotonic'. (As far as I can tell, the ARM, PPC, Mips, and Alpha backends need such changes.) llvm-svn: 136457	2011-07-29 03:05:32 +00:00
Bill Wendling	7eadbeaf62	Use the pointer type size. With this, we can now compile a simple EH program. llvm-svn: 136446	2011-07-29 01:15:29 +00:00
Bill Wendling	6a8cac735a	And now something that compiles... llvm-svn: 136445	2011-07-29 01:11:33 +00:00
Bill Wendling	4b0a365beb	Make sure to sext or trunc the result from the register. llvm-svn: 136444	2011-07-29 01:11:14 +00:00
Chandler Carruth	9d7feab3e0	Rewrite the CMake build to use explicit dependencies between libraries, specified in the same file that the library itself is created. This is more idiomatic for CMake builds, and also allows us to correctly specify dependencies that are missed due to bugs in the GenLibDeps perl script, or change from compiler to compiler. On Linux, this returns CMake to a place where it can relably rebuild several targets of LLVM. I have tried not to change the dependencies from the ones in the current auto-generated file. The only places I've really diverged are in places where I was seeing link failures, and added a dependency. The goal of this patch is not to start changing the dependencies, merely to move them into the correct location, and an explicit form that we can control and change when necessary. This also removes a serialization point in the build because we don't have to scan all the libraries before we begin building various tools. We no longer have a step of the build that regenerates a file inside the source tree. A few other associated cleanups fall out of this. This isn't really finished yet though. After talking to dgregor he urged switching to a single CMake macro to construct libraries with both sources and dependencies in the arguments. Migrating from the two macros to that style will be a follow-up patch. Also, llvm-config is still generated with GenLibDeps.pl, which means it still has slightly buggy dependencies. The internal CMake 'llvm-config-like' macro uses the correct explicitly specified dependencies however. A future patch will switch llvm-config generation (when using CMake) to be based on these deps as well. This may well break Windows. I'm getting a machine set up now to dig into any failures there. If anyone can chime in with problems they see or ideas of how to solve them for Windows, much appreciated. llvm-svn: 136433	2011-07-29 00:14:25 +00:00
Bill Wendling	3cc87682e1	Visit the landingpad instruction. This generates the correct SDNodes for the landingpad instruction. It makes an assumption that the result of the landingpad instruction has at least two values. And that the first value is a pointer to the exception object and the second value is the "selector." llvm-svn: 136430	2011-07-28 23:44:58 +00:00
Bill Wendling	7fa7fe6b58	Add the AddLandingPadInfo function. AddLandingPadInfo takes a landingpad instruction and grabs all of the information from it that it needs for EH table generation. llvm-svn: 136429	2011-07-28 23:42:57 +00:00
Eli Friedman	c9a551ebed	LangRef and basic memory-representation/reading/writing for 'cmpxchg' and 'atomicrmw' instructions, which allow representing all the current atomic rmw intrinsics. The allowed operands for these instructions are heavily restricted at the moment; we can probably loosen it a bit, but supporting general first-class types (where it makes sense) might get a bit complicated, given how SelectionDAG works. As an initial cut, these operations do not support specifying an alignment, but it would be possible to add if we think it's useful. Specifying an alignment lower than the natural alignment would be essentially impossible to support on anything other than x86, but specifying a greater alignment would be possible. I can't think of any useful optimizations which would use that information, but maybe someone else has ideas. Optimizer/codegen support coming soon. llvm-svn: 136404	2011-07-28 21:48:00 +00:00
Jakob Stoklund Olesen	b16081ce8c	Handle REG_SEQUENCE with implicitly defined operands. Code like that would only be produced by bugpoint, but we should still handle it correctly. When a register is defined by a REG_SEQUENCE of undefs, the register itself is undef. Previously, we would create a register with uses but no defs. Fixes part of PR10520. llvm-svn: 136401	2011-07-28 21:38:51 +00:00
Bill Wendling	f8d95bc4c6	Use ArrayRef instead of requiring an std::vector. llvm-svn: 136396	2011-07-28 21:25:33 +00:00
Bill Wendling	4f027233d2	The personality function should be a Function* and not just a Value*. llvm-svn: 136392	2011-07-28 21:14:13 +00:00
Jakob Stoklund Olesen	cad845f4c0	Reverse order of RS_Split live ranges under -compact-regions. There are two conflicting strategies in play: - Under high register pressure, we want to assign large live ranges first. Smaller live ranges are easier to place afterwards. - Live range splitting is guided by interference, so splitting should be deferred until interference is as realistic as possible. With the recent changes to the live range stages, and with compact regions enabled, it is less traumatic to split a live range too early. If some of the split products were too big, they can often be split again. By reversing the RS_Split order, we get this queue order: 1. Normal live ranges, large to small. 2. RS_Split live ranges, large to small. The large-to-small order improves RAGreedy's puzzle solving skills under high register pressure. It may cause a bit more iterated splitting, but we handle that better now. With this change, -compact-regions is mostly an improvement on SPEC. llvm-svn: 136388	2011-07-28 20:48:23 +00:00
Bill Wendling	7b563cde19	Initial code to convert ResumeInsts into calls to _Unwind_Resume. This should be the only code necessary for DWARF EH prepare. llvm-svn: 136387	2011-07-28 20:48:05 +00:00
Nadav Rotem	9708aef2dc	CR fix: The ANY_EXTEND can be removed because the input and putput type must be identical. llvm-svn: 136355	2011-07-28 14:38:46 +00:00
Eli Friedman	26a484852e	Code generation for 'fence' instruction. llvm-svn: 136283	2011-07-27 22:21:52 +00:00
Jakub Staszak	da3df4302a	Use BlockFrequency instead of uint32_t in BlockFrequencyInfo. llvm-svn: 136278	2011-07-27 22:05:51 +00:00
Devang Patel	53dc616170	Remove outdated FIXME comment. llvm-svn: 136275	2011-07-27 22:00:01 +00:00
Bill Wendling	6c923bb8d9	Merge the contents from exception-handling-rewrite to the mainline. This adds the new instructions 'landingpad' and 'resume'. llvm-svn: 136253	2011-07-27 20:18:04 +00:00
Jeffrey Yasskin	6381c0100b	Explicitly cast narrowing conversions inside {}s that will become errors in C++0x. llvm-svn: 136211	2011-07-27 06:22:51 +00:00
Dan Gohman	456b1edd0d	Revert r136156, which broke several buildbots. llvm-svn: 136206	2011-07-27 01:10:27 +00:00
Devang Patel	f098ce2757	It is quiet possible that inlined function body is split into multiple chunks of consequtive instructions. But, there is not any way to describe this in .debug_inline accelerator table used by gdb. However, describe non contiguous ranges of inlined function body appropriately using AT_range of DW_TAG_inlined_subroutine debug info entry. llvm-svn: 136196	2011-07-27 00:34:13 +00:00
Jakob Stoklund Olesen	dab4b9a4b2	Add support for multi-way live range splitting. When splitting global live ranges, it is now possible to split for multiple destination intervals at once. Previously, we only had the main and stack intervals. Each edge bundle is assigned to a split candidate, and splitAroundRegion will insert copies between the candidate intervals and the stack interval as needed. The multi-way splitting is used to split around compact regions when enabled with -compact-regions. The best candidate register still gets all the bundles it wants, but everything outside the main interval is first split around compact regions before we create single-block intervals. Compact region splitting still causes some regressions, so it is not enabled by default. llvm-svn: 136186	2011-07-26 23:41:46 +00:00
Jakob Stoklund Olesen	b1459dbc25	Print out the MBB live-in registers. llvm-svn: 136178	2011-07-26 23:12:08 +00:00
Jakob Stoklund Olesen	c3bcb02154	Eliminate copies of undefined values during coalescing. These copies would coalesce easily, but the resulting value would be defined by a deleted instruction. Now we also remove the undefined value number from the destination register. This fixes PR10503. llvm-svn: 136174	2011-07-26 23:00:24 +00:00
Dan Gohman	9eb62cd159	Delete unnecessarily cautious LastCALLSEQ code. llvm-svn: 136156	2011-07-26 22:00:59 +00:00
Eli Friedman	06b8b571b2	Add obvious missing case to switch. PR10497. llvm-svn: 136130	2011-07-26 20:38:49 +00:00
Devang Patel	613958c82c	While extracting lexical scopes from machine instruction stream, work on one machine basic block at a time. llvm-svn: 136106	2011-07-26 18:09:53 +00:00
Duncan Sands	3ac1836540	SrcDef is only written and never read. Remove it. llvm-svn: 136080	2011-07-26 15:05:06 +00:00
Jakob Stoklund Olesen	5387bd340b	Revert to RA_Assign when a virtreg separates into components. When dead code elimination deletes a PHI value, the virtual register may split into multiple connected components. In that case, revert each component to the RS_Assign stage. The new components are guaranteed to be smaller (the original value numbers are distributed among the components), so this will always be making progress. The components are now allowed to evict other live ranges or be split again. llvm-svn: 136034	2011-07-26 00:54:56 +00:00
Evan Cheng	3a79225b4c	Rename createCodeEmitter to createMCCodeEmitter; createObjectStreamer to createMCObjectStreamer. llvm-svn: 136031	2011-07-26 00:42:34 +00:00
Evan Cheng	1142444565	Rename TargetAsmParser to MCTargetAsmParser and TargetAsmLexer to MCTargetAsmLexer; rename createAsmLexer to createMCAsmLexer and createAsmParser to createMCAsmParser. llvm-svn: 136027	2011-07-26 00:24:13 +00:00
Evan Cheng	5928e69d20	Rename TargetAsmBackend to MCAsmBackend; rename createAsmBackend to createMCAsmBackend. llvm-svn: 136010	2011-07-25 23:24:55 +00:00
Eli Friedman	fee02c6c13	Initial implementation of 'fence' instruction, the new C++0x-style replacement for llvm.memory.barrier. This is just a LangRef entry and reading/writing/memory representation; optimizer+codegen support coming soon. llvm-svn: 136009	2011-07-25 23:16:38 +00:00
Eli Friedman	cbd3ba91b7	Make sure this DAGCombine actually returns an UNDEF of the correct type; PR10476. llvm-svn: 135993	2011-07-25 22:25:42 +00:00
Jakub Staszak	875ebd5f5d	Rename BlockFrequency to BlockFrequencyInfo and MachineBlockFrequency to MachineBlockFrequencyInfo. llvm-svn: 135937	2011-07-25 19:25:40 +00:00
Jakob Stoklund Olesen	450111718c	Add an RS_Split2 stage used for loop prevention. This mechanism already exists, but the RS_Split2 stage makes it clearer. When live range splitting creates ranges that may not be making progress, they are marked RS_Split2 instead of RS_New. These ranges may be split again, but only in a way that can be proven to make progress. For local ranges, that means they must be split into ranges used by strictly fewer instructions. For global ranges, region splitting is bypassed and the RS_Split2 ranges go straight to per-block splitting. llvm-svn: 135912	2011-07-25 15:25:43 +00:00
Jakob Stoklund Olesen	3ef8cf1370	Rename live range stages to better reflect how they are used. The stage is used to control where a live range is going, not where it is coming from. Live ranges created by splitting will usually be marked RS_New, but some are marked RS_Spill to avoid wasting time trying to split them again. The old RS_Global and RS_Local stages are merged - they are really the same thing for local and global live ranges. llvm-svn: 135911	2011-07-25 15:25:41 +00:00
Jay Foad	d1b7849d49	Convert GetElementPtrInst to use ArrayRef. llvm-svn: 135904	2011-07-25 09:48:08 +00:00
Jakob Stoklund Olesen	73a9eb9f81	Never extend live ranges for <undef> uses. llvm-svn: 135886	2011-07-24 20:33:23 +00:00
Jakob Stoklund Olesen	56a56eb80e	Correctly handle <undef> tied uses when rewriting after a split. This fixes PR10463. A two-address instruction with an <undef> use operand was incorrectly rewritten so the def and use no longer used the same register, violating the tie constraint. Fix this by always rewriting <undef> operands with the register a def operand would use. llvm-svn: 135885	2011-07-24 20:23:50 +00:00
Jakob Stoklund Olesen	ecad62f909	Add RAGreedy::calcCompactRegion. This method computes the edge bundles that should be live when splitting around a compact region. This is independent of interference. The function returns false if the live range was already a compact region, or the compact region doesn't have any live bundles - it would be the same as splitting around basic blocks. Compact regions are computed using the normal spill placement code. We pretend there is interference in all live-through blocks that don't use the live range. This removes all edges from the Hopfield network used for spill placement, so it converges instantly. llvm-svn: 135847	2011-07-23 03:41:57 +00:00
Jakob Stoklund Olesen	f500ccece7	Fix bug in SplitEditor::splitLiveThroughBlock when switching registers. If there is no interference and no last split point, we cannot enterIntvBefore(Stop) - that function needs a real instruction. Use enterIntvAtEnd instead for that very easy case. This code doesn't currently run, it is needed by multi-way splitting. llvm-svn: 135846	2011-07-23 03:32:26 +00:00
Jakob Stoklund Olesen	a953bf135f	Prepare RAGreedy::growRegion for compact regions. A split candidate can have a null PhysReg which means that it doesn't map to a real interference pattern. Instead, pretend that all through blocks have interference. This makes it possible to generate compact regions where the live range doesn't go through blocks that don't use it. The live range will still be live between directly connected blocks with uses. Splitting around a compact region tends to produce a live range with a high spill weight, so it may evict a less dense live range. llvm-svn: 135845	2011-07-23 03:22:33 +00:00
Jakob Stoklund Olesen	0ab5d0ee5b	Add a simple method for marking blocks with interference in and out. This method matches addLinks - All the listed blocks are considered to have interference, so they add a negative bias to their bundles. This could also be done by addConstraints, but that requires building a separate BlockConstraint array. llvm-svn: 135844	2011-07-23 03:10:19 +00:00
Jakob Stoklund Olesen	cacefc7dca	Allow null interference cursors to be queried. They always report 'no interference'. llvm-svn: 135843	2011-07-23 03:10:17 +00:00
Evan Cheng	f2596bc62a	Move TargetAsmParser.h TargetAsmBackend.h and TargetAsmLexer.h to MC where they belong. llvm-svn: 135833	2011-07-23 00:45:41 +00:00
Jay Foad	17bab44308	Fix more MSVC warnings caused by a cases I missed when converting ConstantExpr::getGetElementPtr to use ArrayRef. llvm-svn: 135762	2011-07-22 08:52:50 +00:00
Jay Foad	040dd82f44	Convert IRBuilder::CreateGEP and IRBuilder::CreateInBoundsGEP to use ArrayRef. llvm-svn: 135761	2011-07-22 08:16:57 +00:00
Jakub Staszak	b82bbf40bb	Allow getBlockFreq to return 0. llvm-svn: 135742	2011-07-22 02:24:57 +00:00
Jakub Staszak	7987ea7460	Revert patch which broke some IfConversion tests. llvm-svn: 135738	2011-07-22 00:55:15 +00:00
Jakub Staszak	76d711582c	Fix typo in #include which revealed in the case-sensitive filesystem. llvm-svn: 135734	2011-07-22 00:39:00 +00:00
Jakub Staszak	44860314d2	Use MachineBranchProbabilityInfo instead of MachineLoopInfo in IfConversion. llvm-svn: 135724	2011-07-21 23:48:55 +00:00
Jakub Staszak	cb7c0a4927	Add missing getAnalysisUsage in MachineBlockFrequency. llvm-svn: 135714	2011-07-21 22:59:09 +00:00
Devang Patel	ddfe66e948	Refactor. llvm-svn: 135633	2011-07-20 23:00:27 +00:00
Devang Patel	8fb9fd6769	There are two ways to map a variable to its lexical scope. Lexical scope information is embedded in MDNode describing the variable. It is also available as a part of DebugLoc attached with DBG_VALUE instruction. DebugLoc attached with an instruction is less reliable in optimized code so use information embedded in the MDNode. llvm-svn: 135629	2011-07-20 22:18:50 +00:00
Devang Patel	bcd50a10d5	While emitting constant value, look through derived type and use underlying basic type to determine size and signness of the constant value. llvm-svn: 135627	2011-07-20 21:57:04 +00:00
Evan Cheng	bbf3b0de8b	Goodbye TargetAsmInfo. This eliminate last bit of CodeGen and Target in llvm-mc. There is still a bit more refactoring left to do in Targets. But we are now very close to fixing all the layering issues in MC. llvm-svn: 135611	2011-07-20 19:50:42 +00:00
Eli Friedman	6ed783228d	PR10421: Fix a straightforward bug in the widening logic for CONCAT_VECTORS. llvm-svn: 135595	2011-07-20 18:14:33 +00:00
Evan Cheng	efd9b4240f	- Move CodeModel from a TargetMachine global option to MCCodeGenInfo. - Introduce JITDefault code model. This tells targets to set different default code model for JIT. This eliminates the ugly hack in TargetMachine where code model is changed after construction. llvm-svn: 135580	2011-07-20 07:51:56 +00:00
Evan Cheng	76792992d6	Add MCObjectFileInfo and sink the MCSections initialization code from TargetLoweringObjectFileImpl down to MCObjectFileInfo. TargetAsmInfo is done to one last method. It's almost gone! llvm-svn: 135569	2011-07-20 05:58:47 +00:00
Evan Cheng	ccf243d56b	Fix an obvious typo that's preventing x86 (32-bit) from using .literal16. llvm-svn: 135535	2011-07-19 23:14:32 +00:00
Devang Patel	a59b24b090	Distinguish between two copies of one inlined variable. llvm-svn: 135528	2011-07-19 22:31:15 +00:00
Jay Foad	bf904773bb	Convert TargetData::getIndexedOffset to use ArrayRef. llvm-svn: 135478	2011-07-19 14:01:37 +00:00
Evan Cheng	2129f59637	Introduce MCCodeGenInfo, which keeps information that can affect codegen (including compilation, assembly). Move relocation model Reloc::Model from TargetMachine to MCCodeGenInfo so it's accessible even without TargetMachine. llvm-svn: 135468	2011-07-19 06:37:02 +00:00
Devang Patel	9ab3cac694	Revert r135423. llvm-svn: 135454	2011-07-19 00:28:24 +00:00
Bill Wendling	b20453faae	Add a frame with the compact unwind encoding if it exists. llvm-svn: 135450	2011-07-19 00:02:51 +00:00
Bill Wendling	6969ed6286	Rename CompactEncoding to CompactUnwindEncoding. llvm-svn: 135448	2011-07-19 00:00:58 +00:00
Bill Wendling	353404d924	Move the compact encoding from the target-specific library to the code-gen library. llvm-svn: 135443	2011-07-18 23:38:40 +00:00
Evan Cheng	67c033e6b8	Move getInitialFrameState from TargetFrameInfo to MCAsmInfo (suggestions for better location welcome). llvm-svn: 135438	2011-07-18 22:29:13 +00:00
Jeffrey Yasskin	7a16288157	Add APInt(numBits, ArrayRef<uint64_t> bigVal) constructor to prevent future ambiguity errors like the one corrected by r135261. Migrate all LLVM callers of the old constructor to the new one. llvm-svn: 135431	2011-07-18 21:45:40 +00:00
Evan Cheng	d60fa58ba1	Sink getDwarfRegNum, getLLVMRegNum, getSEHRegNum from TargetRegisterInfo down to MCRegisterInfo. Also initialize the mapping at construction time. This patch eliminate TargetRegisterInfo from TargetAsmInfo. It's another step towards fixing the layering violation. llvm-svn: 135424	2011-07-18 20:57:22 +00:00
Devang Patel	4dc76f2438	During bottom up fast-isel, instructions emitted to materalize registers are at top of basic block and do not have debug location. This may misguide debugger while entering the basic block and sometimes debugger provides semi useful view of current location to developer by picking up previous known location as current location. Assign a sensible location to the first instruction in a basic block, if it does not have one location derived from source file, so that debugger can provide meaningful user experience to developers in edge cases. [take 2] llvm-svn: 135423	2011-07-18 20:55:23 +00:00
Jakob Stoklund Olesen	c45d38e14a	Fix a crash when building 177.mesa for armv6. When splitting a live range immediately before an LDR_POST instruction that redefines the address register, make sure to use the correct value number in leaveIntvBefore. We need the value number entering the instruction. <rdar://problem/9793765> llvm-svn: 135413	2011-07-18 18:47:13 +00:00
Frits van Bommel	717d7edd3e	Migrate LLVM and Clang to use the new makeArrayRef(...) functions where previously explicit non-default constructors were used. Mostly mechanical with some manual reformatting. llvm-svn: 135390	2011-07-18 12:00:32 +00:00
Jakob Stoklund Olesen	c0dd3da9c5	Fix PR10387. When trying to rematerialize a value before an instruction that has an early-clobber redefine of the virtual register, make sure to look up the correct value number. Early-clobber defs are moved one slot back, so getBaseIndex is needed to find the used value number. Bugpoint was unable to reduce the test case for this, see PR10388. llvm-svn: 135378	2011-07-18 05:31:59 +00:00
Chris Lattner	229907cd11	land David Blaikie's patch to de-constify Type, with a few tweaks. llvm-svn: 135375	2011-07-18 04:54:35 +00:00

... 5 6 7 8 9 ...

12775 Commits