llvm-project

Commit Graph

Author	SHA1	Message	Date
Nadav Rotem	8824472a25	Improve code generation for vselect on SSE2: When checking the availability of instructions using the TLI, a 'promoted' instruction IS available. It means that the value is bitcasted to another type for which there is an operation. The correct check for the availablity of an instruction is to check if it should be expanded. llvm-svn: 142542	2011-10-19 20:43:16 +00:00
Nadav Rotem	6652e22bad	Add support for the vector-widening of vselect and vector-setcc llvm-svn: 142488	2011-10-19 09:45:11 +00:00
Nick Lewycky	ac4c1860a3	Missed a spot! llvm-svn: 142436	2011-10-18 22:40:18 +00:00
Nick Lewycky	5ca33ac926	Fix some typo/formatting issues. No functionality change. llvm-svn: 142435	2011-10-18 22:39:43 +00:00
Nadav Rotem	75c2229f41	Fix a bug in the legalization of vector anyext-load and trunc-store. Mem Index starts with zero. llvm-svn: 142434	2011-10-18 22:32:43 +00:00
Bob Wilson	681561901d	Fix a DAG combiner assertion failure when constant folding BUILD_VECTORS. svn r139159 caused SelectionDAG::getConstant() to promote BUILD_VECTOR operands with illegal types, even before type legalization. For this testcase, that led to one BUILD_VECTOR with i16 operands and another with promoted i32 operands, which triggered the assertion. llvm-svn: 142370	2011-10-18 17:34:47 +00:00
Duncan Sands	d278d35b13	Fix a bunch of unused variable warnings when doing a release build with gcc-4.6. llvm-svn: 142350	2011-10-18 12:44:00 +00:00
Hal Finkel	bab66789d5	Fix comment to refer to correct instruction llvm-svn: 142334	2011-10-18 03:51:57 +00:00
Nick Lewycky	479a8fe75e	Minor style cleanup, no functionality change. llvm-svn: 142307	2011-10-17 23:27:36 +00:00
Nick Lewycky	40f8f2ff24	Add support for a new extension to the .file directive: .file filenumber "directory" "filename" This removes one join+split of the directory+filename in MC internals. Because bitcode files have independent fields for directory and filenames in debug info, this patch may change the .o files written by existing .bc files. llvm-svn: 142300	2011-10-17 23:05:28 +00:00
Bill Wendling	aa9047d3f5	Now Igor, throw the switch...give my creation life! Use the custom inserter for the ARM setjmp intrinsics. Instead of creating the SjLj dispatch table in IR, where it frequently violates serveral assumptions -- in particular assumptions made by the landingpad instruction about what can branch to a landing pad and what cannot. Performing this in the back-end allows us to violate these assumptions without the IR getting angry at us. It also allows us to perform a small optimization. We can shove the address of the dispatch's basic block into the function context and not have to add code around the setjmp to check for the return value and jump to the dispatch. Neat, huh? <rdar://problem/10116753> llvm-svn: 142294	2011-10-17 22:26:23 +00:00
Cameron Zwarich	d85bc104ef	When deleting a phi cycle after looking through copies, constrain the register to match its final use. With this change, all of test-suite compiles for Thumb2 with -verify-coalescing enabled. llvm-svn: 142287	2011-10-17 21:54:46 +00:00
Evan Cheng	aa563df759	Constraint register class with constrainRegClass() to CSE a virtual into another. rdar://10293289 llvm-svn: 142234	2011-10-17 19:50:12 +00:00
Bill Wendling	63a4ea1859	Correct over-zealous removal of hack. Some code want to check that any call within a function has the 'returns twice' attribute, not just that the current function has one. llvm-svn: 142221	2011-10-17 18:43:40 +00:00
Bill Wendling	2a83a71c2a	Now that we have the ReturnsTwice function attribute, this method is obsolete. Check the attribute instead. <rdar://problem/8031714> llvm-svn: 142212	2011-10-17 18:22:52 +00:00
Chad Rosier	c17257c4cb	Removed set, but unused variable. Patch by Joe Abbey <jabbey@arxan.com>. llvm-svn: 142206	2011-10-17 18:01:59 +00:00
Devang Patel	69a4565e65	It is safe to speculate load from GOT. This fixes performance regression caused by r141689. Radar 10281206. llvm-svn: 142202	2011-10-17 17:35:01 +00:00
Nadav Rotem	486ff59a9f	Enable element promotion type legalization by deafault. Changed tests which assumed that vectors are legalized by widening them. llvm-svn: 142152	2011-10-16 20:31:33 +00:00
Benjamin Kramer	cc863b2bb6	Let printf do the formatting instead aligning strings ourselves. While at it, merge some format strings. llvm-svn: 142140	2011-10-16 16:30:34 +00:00
Benjamin Kramer	cb6b02a086	Twinify better. llvm-svn: 142139	2011-10-16 15:46:29 +00:00
Nadav Rotem	ebe13bc3f1	Move the legalization of vector loads and stores into LegalizeVectorOps. In some cases we need the second type-legalization pass in order to support all cases. llvm-svn: 142060	2011-10-15 07:41:10 +00:00
Bill Wendling	2730a0099a	Clear out the landing pad to call site map for each function. This isn't put into the 'clear()' method because the information needs to stick around (at least for a little bit) after the selection DAG is built. llvm-svn: 142032	2011-10-15 01:00:26 +00:00
Evan Cheng	06fdaeb5d9	A few 80-col violations. llvm-svn: 141988	2011-10-14 20:36:23 +00:00
Jakob Stoklund Olesen	06b6ccfe90	Update live-in lists when splitting critical edges. Fixes PR10814. Patch by Jan Sjödin! llvm-svn: 141960	2011-10-14 17:25:46 +00:00
Jim Grosbach	400907cc41	Fix typo. "__sync_fetch_and-xor_4" should be "__sync_fetch_and_xor_4". Pointed out by George Russell. llvm-svn: 141956	2011-10-14 15:53:48 +00:00
Jakob Stoklund Olesen	7fb5632e73	Add value numbers when spilling dead defs. When spilling around an instruction with a dead def, remember to add a value number for the def. The missing value number wouldn't normally create problems since there would be an incoming live range as well. However, due to another bug we could spill a dead V_SET0 instruction which doesn't read any values. The missing value number caused an empty live range to be created which is dangerous since it doesn't interfere with anything. This fixes part of PR11125. llvm-svn: 141923	2011-10-14 00:34:31 +00:00
Eric Christopher	76933f4c0b	Don't forget to reconstruct D after changing the scope that we're looking at. llvm-svn: 141892	2011-10-13 21:43:44 +00:00
Cameron Zwarich	86f7d3556c	Use an existing method. llvm-svn: 141855	2011-10-13 07:36:41 +00:00
Nick Lewycky	594a545821	If MI is deleted then remove it from the set. If a new MI is created, it could have the same address as the one we deleted, and we don't want that in the set yet. Noticed by inspection. llvm-svn: 141849	2011-10-13 02:16:18 +00:00
Nick Lewycky	404feb9973	Tabs to spaces. llvm-svn: 141844	2011-10-13 01:09:50 +00:00
Nick Lewycky	8488225984	Add missing braces to pacify GCC's -Wparentheses. llvm-svn: 141842	2011-10-13 00:54:59 +00:00
Jakob Stoklund Olesen	068dc91de9	Also inflate register classes around inline asm. Now that MI->getRegClassConstraint() can also handle inline assembly, don't bail when recomputing the register class of a virtual register used by inline asm. This fixes PR11078. llvm-svn: 141836	2011-10-12 23:37:40 +00:00
Jakob Stoklund Olesen	35b362fab2	Add MachineInstr::getRegClassConstraint(). Most instructions have some requirements for their register operands. Usually, this is expressed as register class constraints in the MCInstrDesc, but for inline assembly the constraints are encoded in the flag words. llvm-svn: 141835	2011-10-12 23:37:36 +00:00
Jakob Stoklund Olesen	1e73716eae	Extract a method for finding the inline asm flag operand. llvm-svn: 141834	2011-10-12 23:37:33 +00:00
Jakob Stoklund Olesen	24abd9d9b6	Encode register class constreaints in inline asm instructions. The inline asm operand constraint is initially encoded in the virtual register for the operand, but that register class may change during coalescing, and the original constraint is lost. Encode the original register class as part of the flag word for each inline asm operand. This makes it possible to recover the actual constraint required by inline asm, just like we can for normal instructions. llvm-svn: 141833	2011-10-12 23:37:29 +00:00
Bill Wendling	3e5409df77	We need to verify that the machine instruction we're using as a replacement for our current machine instruction defines a register with the same register class as what's being replaced. This showed up in the SPEC 403.gcc benchmark, where it would ICE because a tail call was expecting one register class but was given another. (The machine instruction verifier catches this situation.) <rdar://problem/10270968> llvm-svn: 141830	2011-10-12 23:03:40 +00:00
Eli Friedman	979009ea61	Use a utility from MathExtras to clarify a check and avoid undefined behavior. Based on patch by Ahmed Charles. llvm-svn: 141829	2011-10-12 22:46:45 +00:00
Evan Cheng	b35afcaa56	Disable machine LICM speculation check (for profitability) until I have time to investigate the regressions. llvm-svn: 141813	2011-10-12 21:33:49 +00:00
Cameron Zwarich	2dffcebf77	To find the exiting VN of a LiveInterval from a block, use the previous slot rather than the previous index. If a block has a single instruction, the previous index may be in a different basic block. I have no clue how this used to work on all of test-suite, because now this failure is seen quite often when trying to compile code with -strong-phi-elim. This fixes PR10252. llvm-svn: 141812	2011-10-12 21:24:54 +00:00
Dan Gohman	de239d2647	Fix a thinko that Nick noticed. The previous code actually worked as intended, but only by accident. llvm-svn: 141779	2011-10-12 15:56:56 +00:00
Bill Wendling	918cea2c27	Expand the check for a landing pad so that it looks at the basic block's containing loop's header to see if that's a landing pad. If it is, then we don't want to hoist instructions out of the loop and above the header. llvm-svn: 141767	2011-10-12 02:58:01 +00:00
Jakob Stoklund Olesen	35163e21dc	Use an existing function. llvm-svn: 141763	2011-10-12 01:24:51 +00:00
Evan Cheng	af1389546e	Fix r141744. 1. The speculation check may not have been performed if the BB hasn't had a load LICM candidate. 2. If the candidate would be CSE'ed, then go ahead and speculatively LICM the instruction even if it's in high register pressure situation. llvm-svn: 141747	2011-10-12 00:09:14 +00:00
Evan Cheng	f192ca0761	Refine r141689 with a tri-state variable. Also teach MachineLICM to avoid "speculation" when register pressure is high. llvm-svn: 141744	2011-10-11 23:48:44 +00:00
Eric Christopher	6647b83087	Add a new wrapper node for a DILexicalBlock that encapsulates it and a file. Since it should only be used when necessary propagate it through the backend code generation and tweak testcases accordingly. This helps with code like in clang's test/CodeGen/debug-info-line.c where we have multiple #line directives within a single lexical block and want to generate only a single block that contains each file change. Part of rdar://10246360 llvm-svn: 141729	2011-10-11 22:59:11 +00:00
Eric Christopher	57d1692750	Formatting. llvm-svn: 141728	2011-10-11 22:59:04 +00:00
Bill Wendling	579ff6c39c	N.B. This is with the new EH scheme: The blocks with invokes have branches to the dispatch block, because that more correctly models the behavior of the CFG. The dispatch of course has edges to the landing pads. Those landing pads could contain invokes, which then have branches back to the dispatch. This creates a loop. The machine LICM pass looks at this loop and thinks it can hoist elements out of it. But because the dispatch is an alternate entry point into the program, the hoisted instructions won't be executed. I wasn't able to get a testcase which was small and could reproduce all of the time. The function_try_block.cpp in llvm-test was where this showed up. llvm-svn: 141726	2011-10-11 22:42:31 +00:00
Devang Patel	453d401a51	Add dominance check for the instruction being hoisted. For example, MachineLICM should not hoist a load that is not guaranteed to be executed. Radar 10254254. llvm-svn: 141689	2011-10-11 18:09:58 +00:00
Nadav Rotem	3283793c9a	Add support for legalization of vector SHL/SRA/SRL instructions llvm-svn: 141667	2011-10-11 14:36:35 +00:00
Nadav Rotem	198fe81571	Add support for legalization of vector trunc-store where the saved scalar type is illegal (for example, v2i16 on systems where the smallest store size is i32) llvm-svn: 141661	2011-10-11 11:25:16 +00:00
Nadav Rotem	b521b6037b	Cleanup the trunc-store legalization code and add asserts. llvm-svn: 141659	2011-10-11 10:04:25 +00:00
Devang Patel	478d5bc0d0	Revert r141569 and r141576. llvm-svn: 141594	2011-10-10 23:18:02 +00:00
Jakob Stoklund Olesen	add0c43ebb	Give targets a chance to expand even standard pseudos. Allow targets to expand COPY and other standard pseudo-instructions before they are expanded with copyPhysReg(). This allows the target to examine the COPY instruction for extra operands indicating it can be widened to a preferable super-register copy. See the ARM -widen-vmovs option. llvm-svn: 141578	2011-10-10 20:34:28 +00:00
Devang Patel	2689f95875	If loop header is also loop exiting block then it may not be safe to hoist instructions. llvm-svn: 141576	2011-10-10 20:32:03 +00:00
Devang Patel	e554d5995b	Add dominance check for the instruction being hoisted. For example, MachineLICM should not hoist a load that is not guaranteed to be executed. Radar 10254254. llvm-svn: 141569	2011-10-10 19:09:20 +00:00
Bill Wendling	e9574be6a3	Use the code that lowers the arguments and spills any values which are alive across unwind edges. This is for the back-end which expects such things. The code is from the original SjLj EH pass. llvm-svn: 141463	2011-10-08 00:56:47 +00:00
Bill Wendling	7ecfbd90ef	Thread the chain through the eh.sjlj.setjmp intrinsic, like it's documented to do. This will be useful later on with the new SJLJ stuff. llvm-svn: 141416	2011-10-07 21:25:38 +00:00
Andrew Trick	35c9e51219	PostRA scheduler fix. Clear stale loop dependencies. Fixes <rdar://problem/10235725> llvm-svn: 141357	2011-10-07 06:33:09 +00:00
Andrew Trick	4ef158335b	whitespace llvm-svn: 141356	2011-10-07 06:27:02 +00:00
Eli Friedman	1456cd20b4	Remove the old atomic instrinsics. autoupgrade functionality is included with this patch. llvm-svn: 141333	2011-10-06 23:20:49 +00:00
Bill Wendling	267f323d28	Modify the mapping from landing pad to call sites to accept more than one call site. llvm-svn: 141226	2011-10-05 22:24:35 +00:00
Bill Wendling	c2d55b6e50	Add an ivar that maps a landing pad's EH symbol to the call sites that may jump to the landing pad. This will be used by the back-end to generate the jump tables for dispatching the arriving longjmp in sjlj eh. llvm-svn: 141224	2011-10-05 22:20:38 +00:00
Bill Wendling	e61c62533e	Small refactoring. Cache the FunctionInfo->MBB into a local variable. llvm-svn: 141221	2011-10-05 22:16:11 +00:00
Jakob Stoklund Olesen	eb38bd8ced	Fix sub-register operand verification. PhysReg operands are not allowed to have sub-register indices at all. For virtual registers with sub-reg indices, check that all registers in the register class support the sub-reg index. llvm-svn: 141220	2011-10-05 22:12:57 +00:00
Bill Wendling	db1633530a	Fix comment to reflect the new EH stuff. llvm-svn: 141218	2011-10-05 22:04:08 +00:00
Jakob Stoklund Olesen	3abead76ea	Remove unused DstSubIdx argument. llvm-svn: 141214	2011-10-05 21:22:53 +00:00
Jakob Stoklund Olesen	f7957a9819	Simplify EXTRACT_SUBREG emission. EXTRACT_SUBREG is emitted as %dst = COPY %src:sub, so there is no need to constrain the %dst register class. RegisterCoalescer will apply the necessary constraints if it decides to eliminate the COPY. The %src register class does need to be constrained to something with the right sub-registers, though. This is currently done manually with COPY_TO_REGCLASS nodes. They can possibly be removed after this patch. llvm-svn: 141207	2011-10-05 20:26:40 +00:00
Jakob Stoklund Olesen	8ff52c4135	Simplify INSERT_SUBREG emission. The register class created by INSERT_SUBREG and SUBREG_TO_REG must be legal and support the SubIdx sub-registers. The new getSubClassWithSubReg() hook can compute that. This may create INSERT_SUBREG instructions defining a larger register class than the sub-register being inserted. That is OK, RegisterCoalescer will constrain the register class as needed when it eliminates the INSERT_SUBREG instructions. llvm-svn: 141198	2011-10-05 18:31:00 +00:00
Jakob Stoklund Olesen	ccdfbfb5e5	Add a FIXME. TwoAddressInstructionPass should annotate instructions with <undef> flags when it lower REG_SEQUENCE instructions. LiveIntervals should not be in the business of modifying code (except for kill flags, perhaps). llvm-svn: 141187	2011-10-05 16:51:21 +00:00
Jakob Stoklund Olesen	d5d39bb098	Also add <imp-use,kill> flags for redefined super-registers. For example: %vreg10:dsub_0<def,undef> = COPY %vreg1 %vreg10:dsub_1<def> = COPY %vreg2 is rewritten as: %D2<def> = COPY %D0, %Q1<imp-def> %D3<def> = COPY %D1, %Q1<imp-use,kill>, %Q1<imp-def> The first COPY doesn't care about the previous value of %Q1, so it doesn't read that register. The second COPY is a partial redefinition of %Q1, so it implicitly kills and redefines that register. This makes it possible to recognize instructions that can harmlessly clobber the full super-register. The write and don't read the super-register. llvm-svn: 141139	2011-10-05 00:01:48 +00:00
Jakob Stoklund Olesen	9d5bda9be1	Also add <def,undef> flags when coalescing sub-registers. RegisterCoalescer can create sub-register defs when it is joining a register with a sub-register. Add <undef> flags to these new sub-register defs where appropriate. llvm-svn: 141138	2011-10-05 00:01:46 +00:00
Owen Anderson	0ca562ec4c	Teach the MC to output code/data region marker labels in MachO and ELF modes. These are used by disassemblers to provide better disassembly, particularly on targets like ARM Thumb that like to intermingle data in the TEXT segment. llvm-svn: 141135	2011-10-04 23:26:17 +00:00
Bill Wendling	3d11aa7e75	Create a mapping between the landing pad basic block and the call site index for later use. llvm-svn: 141125	2011-10-04 22:00:35 +00:00
Jakob Stoklund Olesen	10f2de3261	Allow <undef> flags on def operands as well as uses. The <undef> flag says that a MachineOperand doesn't read its register, or doesn't depend on the previous value of its register. A full register def never depends on the previous register value. A partial register def may depend on the previous value if it is intended to update part of a register. For example: %vreg10:dsub_0<def,undef> = COPY %vreg1 %vreg10:dsub_1<def> = COPY %vreg2 The first copy instruction defines the full %vreg10 register with the bits not covered by dsub_0 defined as <undef>. It is not considered a read of %vreg10. The second copy modifies part of %vreg10 while preserving the rest. It has an implicit read of %vreg10. This patch adds a MachineOperand::readsReg() method to determine if an operand reads its register. Previously, this was modelled by adding a full-register <imp-def> operand to the instruction. This approach makes it possible to determine directly from a MachineOperand if it reads its register. No scanning of MI operands is required. llvm-svn: 141124	2011-10-04 21:49:33 +00:00
Bill Wendling	ac3fb4c078	Generic cleanup. llvm-svn: 141050	2011-10-04 00:16:40 +00:00
Bill Wendling	97a8695fff	Don't carry over the dispatchsetup hack from the old system. llvm-svn: 141040	2011-10-03 22:42:40 +00:00
Bill Wendling	6f3e73d6ad	Move the grabbing of the jump buffer into the caller function, eliminating the need for returning a std::pair. llvm-svn: 141026	2011-10-03 21:15:28 +00:00
Eric Christopher	cead033ced	Whitespace. llvm-svn: 141005	2011-10-03 15:49:20 +00:00
Eric Christopher	f84354bfb1	Typo. llvm-svn: 141004	2011-10-03 15:49:16 +00:00
Nadav Rotem	52e8ed9214	Moved type construction out of the loop and added an assert on the legality of the type. Formatted lines to the 80 char limit. llvm-svn: 140952	2011-10-01 18:39:28 +00:00
Bill Wendling	9925f197cc	When inferring the pointer alignment, if the global doesn't have an initializer and the alignment is 0 (i.e., it's defined globally in one file and declared in another file) it could get an alignment which is larger than the ABI allows for that type, resulting in aligned moves being used for unaligned loads. For instance, in file A.c: struct S s; In file B.c: struct { // something long }; extern S s; void foo() { struct S p = s; // ... } this copy is a 'memcpy' which is turned into a series of 'movaps' instructions on X86. But this is wrong, because 'struct S' has alignment of 4, not 16. llvm-svn: 140902	2011-09-30 23:19:55 +00:00
Nick Lewycky	f40df1d46c	Promote comment to doxycomment. Adjust whitespace. No functionality change. llvm-svn: 140899	2011-09-30 22:19:53 +00:00
Jakob Stoklund Olesen	1352be2bd3	Move getCommonSubClass() into TRI. It will soon need the context. llvm-svn: 140896	2011-09-30 22:18:51 +00:00
Torok Edwin	be5020eb95	Comment grammar fixes. thanks to Duncan. llvm-svn: 140850	2011-09-30 13:07:47 +00:00
Torok Edwin	319a1415b8	Instead of crashing when MCAsmInfo is NULL, add an assert. This helps with porting code from 2.9 to 3.0 as TargetSelect.h changed location, and if you include the old one by accident you will trigger this assert. llvm-svn: 140848	2011-09-30 12:31:57 +00:00
Eli Friedman	95031ed837	Clean up uses of switch instructions so they are not dependent on the operand ordering. Patch by Stepan Dyatkovskiy. llvm-svn: 140803	2011-09-29 20:21:17 +00:00
Duncan Sands	cac86805bf	Place this bracket according to the LLVM style. llvm-svn: 140784	2011-09-29 16:01:46 +00:00
Jakob Stoklund Olesen	463b05a2d0	Remove NumImplicitOps which is now unused. llvm-svn: 140767	2011-09-29 01:47:36 +00:00
Eric Christopher	d299dccf91	Use the local we already set up. llvm-svn: 140745	2011-09-29 00:50:59 +00:00
Jakob Stoklund Olesen	2318d1e0e9	Rewrite MachineInstr::addOperand() to avoid NumImplicitOps. The function needs to scan the implicit operands anyway, so no performance is won by caching the number of implicit operands added to an instruction. This also fixes a bug when adding operands after an implicit operand has been added manually. The NumImplicitOps count wasn't kept up to date. MachineInstr::addOperand() will now consistently place all explicit operands before all the implicit operands, regardless of the order they are added. It is possible to change an MI opcode and add additional explicit operands. They will be inserted before any existing implicit operands. The only exception is inline asm instructions where operands are never reordered. This is because of a hack that marks explicit clobber regs on inline asm as <implicit-def> to please the fast register allocator. This hack can go away when InstrEmitter and FastIsel can add exact <dead> flags to physreg defs. llvm-svn: 140744	2011-09-29 00:40:51 +00:00
Bill Wendling	899da52d60	Have the SjLjEHPrepare pass do some more heavy lifting. Upon further review, most of the EH code should remain written at the IR level. The part which breaks SSA form is the dispatch table, so that part will be moved to the back-end. llvm-svn: 140730	2011-09-28 21:56:53 +00:00
Duncan Sands	2e67937f76	A typeid of zero means a cleanup, not a catch. This case occurs when there is both a catch and a cleanup. Correct the comment. llvm-svn: 140686	2011-09-28 09:13:02 +00:00
Bill Wendling	baf3941fde	Strip off pointer casts when looking at the eh.sjlj.functioncontext's argument. llvm-svn: 140678	2011-09-28 03:52:41 +00:00
Bill Wendling	225e8481b0	Bitcast the alloca to an i8* to match the intrinsic's signature. llvm-svn: 140677	2011-09-28 03:47:11 +00:00
Bill Wendling	66b110f571	Create and use an llvm.eh.sjlj.functioncontext intrinsic. This intrinsic is used to pass the index of the function context to the back-end for further processing. The back-end is in charge of filling in the rest of the entries. llvm-svn: 140676	2011-09-28 03:36:43 +00:00
Bill Wendling	2e76ca9d9a	In the new EH model, setup the function context and the call site info. The DWARF exception pass uses the call site information, which is set up here. A pre-RA pass is too late for it to use this information. So create and setup the function context here, and then insert the call site values here (and map the call sites for the DWARF EH pass). This is simpler than the original pass, and doesn't make the CFG lose its SSA-ness. It's a win-win-win-win-lose-win-win situation. llvm-svn: 140675	2011-09-28 03:14:05 +00:00
Bill Wendling	e6138e3ad1	Don't conditionalize execution of the SjLj EH prepare pass. We may need an SjLj EH preparation pass for some call site information, at least in the short term. llvm-svn: 140674	2011-09-28 03:07:34 +00:00
Jakob Stoklund Olesen	bd5109f14d	Rename class and clean up source. No functional change intended. llvm-svn: 140664	2011-09-28 00:01:56 +00:00
Jakob Stoklund Olesen	934b7d7645	Rename SSEDomainFix -> lib/CodeGen/ExecutionDepsFix. I'll clean up the source in the next commit. llvm-svn: 140663	2011-09-28 00:01:54 +00:00
Bill Wendling	354ff9e348	This is the start of the new SjLj EH preparation pass, which will replace the current IR-level pass. The old SjLj EH pass has some problems, especially with the new EH model. Most significantly, it violates some of the new restrictions the new model has. For instance, the 'dispatch' table wants to jump to the landing pad, but we cannot allow that because only an invoke's unwind edge can jump to a landing pad. This requires us to mangle the code something awful. In addition, we need to keep the now dead landingpad instructions around instead of CSE'ing them because the DWARF emitter uses that information (they are dead because no control flow edge will execute them - the control flow edge from an invoke's unwind is superceded by the edge coming from the dispatch). Basically, this pass belongs not at the IR level where SSA is king, but at the code-gen level, where we have more flexibility. llvm-svn: 140646	2011-09-27 22:14:12 +00:00
Cameron Zwarich	7a6e8f2c5d	Remove an invalid assert that is really just asserting when the scheduler emits a suboptimal schedule. llvm-svn: 140643	2011-09-27 21:59:16 +00:00
Jim Grosbach	af136f71ec	Rename AddSelectionDAGCSEId() to addSelectionDAGCSEId(). Naming conventions consistency. No functional change. llvm-svn: 140636	2011-09-27 20:59:33 +00:00
Nadav Rotem	38b3b83362	Cleanup PromoteIntOp_EXTRACT_VECTOR_ELT and PromoteIntRes_SETCC. Add a new method: getAnyExtOrTrunc and use it to replace the manual check. llvm-svn: 140603	2011-09-27 11:16:47 +00:00
Nadav Rotem	1b857d2762	Revert r140463; The patch assumes that <4 x i1> is saved to memory as 4 x i8, while the decision is to bit-pack small values. llvm-svn: 140601	2011-09-27 10:48:29 +00:00
James Molloy	0ceb8cadd2	Fix emission of debug data for global variables. getContext() on DIGlobalVariables is not valid any more. llvm-svn: 140539	2011-09-26 17:40:42 +00:00
Jakob Stoklund Olesen	df977fedb6	Add target hook for pseudo instruction expansion. Many targets use pseudo instructions to help register allocation. Like the COPY instruction, these pseudos can be expanded after register allocation. The early expansion can make life easier for PEI and the post-ra scheduler. This patch adds a hook that is called for all remaining pseudo instructions from the ExpandPostRAPseudos pass. llvm-svn: 140472	2011-09-25 19:21:35 +00:00
Nadav Rotem	2279949129	[vector-select] Address one of the issues in pr10902. EXTRACT_VECTOR_ELEMENT SDNodes may return values which are wider than the incoming element types. In this patch we fix the integer promotion of these nodes. Fixes spill-q.ll when running -promote-elements. llvm-svn: 140471	2011-09-25 18:59:42 +00:00
Jakob Stoklund Olesen	fd719d184e	Clean up code after renaming LowerSubregs -> ExpandPostRAPseudos. No functional change intended. llvm-svn: 140470	2011-09-25 16:46:08 +00:00
Jakob Stoklund Olesen	f152df1e6b	Rename LowerSubregs to ExpandPostRAPseudos. I'll fix the file contents in the next commit. This pass is currently expanding the COPY and SUBREG_TO_REG pseudos. I am going to add a hook so targets can expand more pseudo-instructions after register allocation. Many targets have pseudo-instructions that assist the register allocator. They can be expanded after register allocation, before PEI and PostRA scheduling. llvm-svn: 140469	2011-09-25 16:46:00 +00:00
Nadav Rotem	c2deabd202	Implement Duncan's suggestion to use the result of getSetCCResultType if it is legal (this is always the case for scalars), otherwise use the promoted result type. Fix test/CodeGen/X86/vsplit-and.ll when promote-elements is enabled. llvm-svn: 140464	2011-09-24 19:48:19 +00:00
Nadav Rotem	77426a754b	[Vector-Select] Address one of the problems in 10902. When generating the trunc-store of i1's, we need to use the vector type and not the scalar type. This patch fixes the assertion in CodeGen/Generic/bool-vector.ll when running with -promote-elements. llvm-svn: 140463	2011-09-24 18:32:19 +00:00
Jakob Stoklund Olesen	3bb99bc957	Verify that terminators follow non-terminators. This exposes a -segmented-stacks bug. llvm-svn: 140429	2011-09-23 22:45:39 +00:00
Eli Friedman	8a15a5aa93	PR10998: It is not legal to sink an instruction past the terminator of a block; make sure we don't do that. llvm-svn: 140428	2011-09-23 22:41:57 +00:00
Duncan Sands	b461176cfb	Tweak the handling of MERGE_VALUES nodes: remove the need for DecomposeMERGE_VALUES to "know" that results are legalized in a particular order, by passing it the number of the result being legalized (the type legalization core provides this, it just needs to be passed on). llvm-svn: 140373	2011-09-23 13:59:22 +00:00
Nadav Rotem	57e30726ad	Vector-Select: Address one of the problems in pr10902. Add handling for the integer-promotion of CONCAT_VECTORS. Test: test/CodeGen/X86/widen_shuffle-1.ll This patch fixes the above tests (when running in with -promote-elements). llvm-svn: 140372	2011-09-23 09:33:24 +00:00
Dan Gohman	e83e1b2d2c	Fix SimplifySelectCC to add newly created nodes to the DAGCombiner worklist, as it may be possible to perform further optimization on them. llvm-svn: 140349	2011-09-22 23:01:29 +00:00
Jakob Stoklund Olesen	e92e5ee81f	Constrain register classes instead of emitting copies. Sometimes register class constraints are trivial, like GR32->GR32_NOSP, or GPR->rGPR. Teach InstrEmitter to simply constrain the virtual register instead of emitting a copy in these cases. Normally, these copies are handled by the coalescer. This saves some coalescer work. llvm-svn: 140340	2011-09-22 21:39:34 +00:00
Jakob Stoklund Olesen	0f36544c08	Add a MinNumRegs argument to MRI::constrainRegClass(). The function will refuse to use a register class with fewer registers than MinNumRegs. This can be used by clients to avoid accidentally increase register pressure too much. The default value of MinNumRegs=0 doesn't affect how constrainRegClass() works. llvm-svn: 140339	2011-09-22 21:39:31 +00:00
Bill Wendling	a58fde665a	Use the C personality function instead of the C++ personality function. llvm-svn: 140318	2011-09-22 17:56:40 +00:00
Devang Patel	5e6b65cf0d	Do not unnecessarily use AT_specification DIE because it does not add any value. Few weeks ago, llvm completely inverted the debug info graph. Earlier each debug info node used to keep track of its compile unit, now compile unit keeps track of important nodes. One impact of this change is that the global variable's do not have any context, which should be checked before deciding to use AT_specification DIE. llvm-svn: 140282	2011-09-21 23:41:11 +00:00
Bill Wendling	7b3fc8ee38	Attempt to update the shadow stack GC pass to the new EH model. This inserts a cleanup landingpad instruction and a resume to mimic the old unwind instruction. llvm-svn: 140277	2011-09-21 22:14:28 +00:00
Jim Grosbach	098f5a2911	Tidy up. Whitepsace. llvm-svn: 140275	2011-09-21 21:36:53 +00:00
Nadav Rotem	bc9ba30158	[VECTOR-SELECT] Address one of the bugs in pr10902. Vector SetCC result types need to be type-legalized. This code worked before because scalar result types are known to be legal. llvm-svn: 140249	2011-09-21 14:34:38 +00:00
Andrew Trick	924123acb3	Lower ARM adds/subs to add/sub after adding optional CPSR operand. This is still a hack until we can teach tblgen to generate the optional CPSR operand rather than an implicit CPSR def. But the strangeness is now limited to the selection DAG. ADD/SUB MI's no longer have implicit CPSR defs, nor do we allow flag setting variants of these opcodes in machine code. There are several corner cases to consider, and getting one wrong would previously lead to nasty miscompilation. It's not the first time I've debugged one, so this time I added enough verification to ensure it won't happen again. llvm-svn: 140228	2011-09-21 02:20:46 +00:00
Bruno Cardoso Lopes	6cb23f6e7f	Add a DAGCombine for subvector extracts to remove useless chains of subvector inserts and extracts. Initial patch by Rackover, Zvi with some tweak done by me. llvm-svn: 140204	2011-09-20 23:19:33 +00:00
Andrew Trick	52363bdbeb	Restore hasPostISelHook tblgen flag. No functionality change. The hook makes it explicit which patterns require "special" handling. i.e. it self-documents tblgen deficiencies. I plan to add verification in ExpandISelPseudos and Thumb2SizeReduce to catch any missing hasPostISelHooks. Otherwise it's too fragile. llvm-svn: 140160	2011-09-20 18:22:31 +00:00
Andrew Trick	8586e62d91	ARM isel bug fix for adds/subs operands. Modified ARMISelLowering::AdjustInstrPostInstrSelection to handle the full gamut of CPSR defs/uses including instructins whose "optional" cc_out operand is not really optional. This allowed removal of the hasPostISelHook to simplify the .td files and make the implementation more robust. Fixes rdar://10137436: sqlite3 miscompile llvm-svn: 140134	2011-09-20 03:17:40 +00:00
Andrew Trick	53df4b6dfa	whitespace llvm-svn: 140133	2011-09-20 03:06:13 +00:00
Nadav Rotem	7aaa0aa7a7	white space cleanups llvm-svn: 139994	2011-09-18 10:29:29 +00:00
Benjamin Kramer	67b014b2c2	Namespacify. llvm-svn: 139892	2011-09-16 00:35:06 +00:00
Jakob Stoklund Olesen	e2c92a3112	Spill mode: Hoist back-copies locally. The leaveIntvAfter() function normally inserts a back-copy after the requested instruction, making the back-copy kill the live range. In spill mode, try to insert the back-copy before the last use instead. That means the last use becomes the kill instead of the back-copy. This lowers the register pressure because the last use can now redefine the same register it was reading. This will also improve compile time: The back-copy isn't a kill, so hoisting it in hoistCopiesForSize() won't force a recomputation of the source live range. Similarly, if the back-copy isn't hoisted by the splitter, the spiller will not attempt hoisting it locally. llvm-svn: 139883	2011-09-16 00:03:35 +00:00
Jakob Stoklund Olesen	e8339b2e63	Disable local spill hoisting for non-killing copies. If the source register is live after the copy being spilled, there is no point to hoisting it. Hoisting inside a basic block only serves to resolve interferences by shortening the live range of the source. llvm-svn: 139882	2011-09-16 00:03:33 +00:00
Eli Friedman	ee8f14a799	Some legalization fixes for atomic load and store. llvm-svn: 139851	2011-09-15 21:20:49 +00:00
Jakob Stoklund Olesen	bceb9e5c05	Add an option to disable spill hoisting. When -split-spill-mode is enabled, spill hoisting is performed by SplitKit instead of by InlineSpiller. This hidden command line option is for testing the splitter spill mode. llvm-svn: 139845	2011-09-15 21:06:00 +00:00
Jakob Stoklund Olesen	53e2e48de7	VirtRegMap is counting spill slots, not register spills. Fix the stats counters to reflect that. llvm-svn: 139819	2011-09-15 18:31:13 +00:00
Jakob Stoklund Olesen	c94c967656	Count correctly when a COPY turns into a spill or reload. The number of spills could go negative since a folded COPY is just a spill, and it may be eliminated. llvm-svn: 139815	2011-09-15 18:22:52 +00:00
Jakob Stoklund Olesen	37eb6962c6	Count inserted spills and reloads more accurately. Adjust counters when removing spill and reload instructions. We still don't account for reloads being removed by eliminateDeadDefs(). llvm-svn: 139806	2011-09-15 17:54:28 +00:00
Jakob Stoklund Olesen	07b3503f8b	Trace through sibling PHIs in bulk. When traceSiblingValue() encounters a PHI-def value created by live range splitting, don't look at all the predecessor blocks. That can be very expensive in a complicated CFG. Instead, consider that all the non-PHI defs jointly dominate all the PHI-defs. Tracing directly to all the non-PHI defs is much faster that zipping around in the CFG when there are many PHIs with many predecessors. This significantly improves compile time for indirectbr interpreters. llvm-svn: 139797	2011-09-15 16:41:12 +00:00
Jakob Stoklund Olesen	b8b1d4c435	Speed up LiveIntervals::shrinkToUse with some caching. Blocks with multiple PHI successors only need to go on the worklist once. Use a SmallPtrSet to track the live-out blocks that have already been handled. This is a lot faster than the two live range check we would otherwise do. Also stop recomputing hasPHIKill flags. Like RenumberValues(), it is conservatively correct to leave them in, and they are not used for anything important. llvm-svn: 139792	2011-09-15 15:24:16 +00:00
Jakob Stoklund Olesen	fb75d78d33	Revert r139782, "RemoveCopyByCommutingDef doesn't need hasPHIKill()." It does, after all. RemoveCopyByCommutingDef rewrites the uses of one particular value number in A. It doesn't know how to rewrite phi uses, so there can't be any. llvm-svn: 139787	2011-09-15 06:27:32 +00:00
Jakob Stoklund Olesen	4c099551f9	Stop verifying hasPHIKill() flags. There is only one legitimate use remaining, in addIntervalsForSpills(). All other calls to hasPHIKill() are only used to update PHIKill flags. The addIntervalsForSpills() function is part of the old spilling framework, only used by linearscan. llvm-svn: 139783	2011-09-15 05:16:30 +00:00
Jakob Stoklund Olesen	0499e7bbd0	RemoveCopyByCommutingDef doesn't need hasPHIKill(). Instead, let HasOtherReachingDefs() test for defs in B that overlap any phi-defs in A as well. This test is slightly different, but almost identical. A perfectly precise test would only check those phi-defs in A that are reachable from AValNo. llvm-svn: 139782	2011-09-15 05:03:50 +00:00
Jakob Stoklund Olesen	dca022e377	It is safe to remat a value killed by phis. The source live range is recomputed using shrinkToUses() which does handle phis correctly. The hasPHIKill() condition was relevant in the old days when ReMaterializeTrivialDef() tried to recompute the live range itself. The shrinkToUses() function will mark the original def as dead when no more uses and phi kills remain. It is then removed by runOnMachineFunction(). llvm-svn: 139781	2011-09-15 04:52:06 +00:00
Jakob Stoklund Olesen	e7ca8ecd92	Leave hasPHIKill flags alone in LiveInterval::RenumberValues. It is conservatively correct to keep the hasPHIKill flags, even after deleting PHI-defs. The calculation can be very expensive after taildup has created a quadratic number of indirectbr edges in the CFG, and the hasPHIKill flag isn't used for anything after RenumberValues(). llvm-svn: 139780	2011-09-15 04:37:18 +00:00
Andrew Trick	76a86d3d4c	[regcoalescing] bug fix for RegistersDefinedFromSameValue. An improper SlotIndex->VNInfo lookup was leading to unsafe copy removal. Fixes PR10920 401.bzip2 miscompile with no IV rewrite. llvm-svn: 139765	2011-09-15 01:09:33 +00:00
Devang Patel	04d6d47865	Add support to emit debug info for C++0x nullptr type. llvm-svn: 139751	2011-09-14 23:13:28 +00:00
Jakob Stoklund Olesen	811b9c475d	Ignore the cloning of unknown registers. THe LRE_DidCloneVirtReg callback may be called with vitual registers that RAGreedy doesn't even know about yet. In that case, there are no data structures to update. llvm-svn: 139702	2011-09-14 17:34:37 +00:00
Jakob Stoklund Olesen	a98af39856	Hoist back-copies to the least busy dominator. When a back-copy is hoisted to the nearest common dominator, keep looking up the dominator tree for a less loopy dominator, and place the back-copy there instead. Don't do this when a single existing back-copy dominates all the others. Assume the client knows what he is doing, and keep the dominating back-copy. This prevents us from hoisting back-copies into loops in most cases. If a value is defined in a loop with multiple exits, we may still hoist back-copies into that loop. That is the speed/size tradeoff. llvm-svn: 139698	2011-09-14 16:45:39 +00:00
Nadav Rotem	d748dbacb0	Add integer promotion support for vselect llvm-svn: 139692	2011-09-14 14:42:15 +00:00
Jakob Stoklund Olesen	5d4277ddfa	Distinguish complex mapped values from forced recomputation. When a ParentVNI maps to multiple defs in a new interval, its live range may still be derived directly from RegAssign by transferValues(). On the other hand, when instructions have been rematerialized or hoisted, it may be necessary to completely recompute live ranges using LiveRangeCalc::extend() to all uses. Use a bit in the value map to indicate that a live range must be recomputed. Rename markComplexMapped() to forceRecompute(). This fixes some live range verification errors when -split-spill-mode=size hoists back-copies by recomputing source ranges when RegAssign kills can't be moved. llvm-svn: 139660	2011-09-13 23:09:04 +00:00
Jakob Stoklund Olesen	a25330f0d7	Implement -split-spill-mode=size. Whenever the complement interval is defined by multiple copies of the same value, hoist those back-copies to the nearest common dominator. This ensures that at most one copy is inserted per value in the complement inteval, and no phi-defs are needed. llvm-svn: 139651	2011-09-13 22:22:39 +00:00
Eli Friedman	f78c6a83ee	Fix check for unaligned load/store so it doesn't catch over-aligned load/store. llvm-svn: 139649	2011-09-13 22:19:59 +00:00
Eli Friedman	f1518216fd	Error out on CodeGen of unaligned load/store. Fix test so it isn't accidentally testing that case. llvm-svn: 139641	2011-09-13 20:50:54 +00:00
Nadav Rotem	66dc9ae08d	Fix the assertion which checks the size of the input operand. llvm-svn: 139633	2011-09-13 20:03:38 +00:00
Nadav Rotem	52202fbf2d	Add vselect target support for targets that do not support blend but do support xor/and/or (For example SSE2). llvm-svn: 139623	2011-09-13 19:17:42 +00:00
Devang Patel	f9e2ae9b05	Use a cache to maintain list of machine basic blocks for a given UserValue. llvm-svn: 139616	2011-09-13 18:40:53 +00:00
Jakob Stoklund Olesen	4484f99175	Add SplitEditor::markOverlappedComplement(). This function is used to flag values where the complement interval may overlap other intervals. Call it from overlapIntv, and use the flag to fully recompute those live ranges in transferValues(). llvm-svn: 139612	2011-09-13 18:05:29 +00:00
Jakob Stoklund Olesen	820c8fd0db	Eliminate the extendRange() wrapper. llvm-svn: 139608	2011-09-13 17:38:57 +00:00
Jakob Stoklund Olesen	0494c5c35d	Switch extendInBlock() to take a kill slot instead of the last use slot. Three out of four clients prefer this interface which is consistent with extendIntervalEndTo() and LiveRangeCalc::extend(). llvm-svn: 139604	2011-09-13 16:47:56 +00:00
Jakob Stoklund Olesen	054984d75b	Use a separate LiveRangeCalc for the complement in spill modes. The complement interval may overlap the other intervals created, so use a separate LiveRangeCalc instance to compute its live range. A LiveRangeCalc instance can only be shared among non-overlapping intervals. llvm-svn: 139603	2011-09-13 16:47:53 +00:00
NAKAMURA Takumi	cac923b556	Unbreak msvc. llvm-svn: 139581	2011-09-13 03:58:34 +00:00
Jakob Stoklund Olesen	487f2a37bf	Extract live range calculations from SplitKit. SplitKit will soon need two copies of these data structures, and the algorithms will also be useful when LiveIntervalAnalysis becomes independent of LiveVariables. llvm-svn: 139572	2011-09-13 01:34:21 +00:00
Bill Wendling	ac5a883624	Introduce a bit of a hack. Splitting a landing pad takes considerable care because of PHIs and other nasties. The problem is that the jump table needs to jump to the landing pad block. However, the landing pad block can be jumped to only by an invoke instruction. So we clone the landingpad instruction into its own basic block, have the invoke jump to there. The landingpad instruction's basic block's successor is now the target for the jump table. But because of PHI nodes, we need to create another basic block for the jump table to jump to. This is definitely a hack, because the values for the PHI nodes may not be defined on the edge from the jump table. But that's okay, because the jump table is simply a construct to mimic what is happening in the CFG. So the values are mysteriously there, even though there is no value for the PHI from the jump table's edge (hence calling this a hack). llvm-svn: 139545	2011-09-12 21:56:59 +00:00
Jakob Stoklund Olesen	45df7e0f22	Remove the -compact-regions flag. It has been enabled by default for a while, it was only there to allow performance comparisons. llvm-svn: 139501	2011-09-12 16:54:42 +00:00
Jakob Stoklund Olesen	eecb2fb183	Add an interface for SplitKit complement spill modes. SplitKit always computes a complement live range to cover the places where the original live range was live, but no explicit region has been allocated. Currently, the complement live range is created to be as small as possible - it never overlaps any of the regions. This minimizes register pressure, but if the complement is going to be spilled anyway, that is not very important. The spiller will eliminate redundant spills, and hoist others by making the spill slot live range overlap some of the regions created by splitting. Stack slots are cheap. This patch adds the interface to enable spill modes in SplitKit. In spill mode, SplitKit will assume that the complement is going to spill, so it will allow it to overlap regions in order to avoid back-copies. By doing some of the spiller's work early, the complement live range becomes simpler. In some cases, it can become much simpler because no extra PHI-defs are required. This will speed up both splitting and spilling. This is only the interface to enable spill modes, no implementation yet. llvm-svn: 139500	2011-09-12 16:49:21 +00:00
Jakob Stoklund Olesen	72c0ddfbc4	Update comments to reflect some (not so) recent changes. llvm-svn: 139498	2011-09-12 16:03:26 +00:00
Richard Trieu	78a812bf2d	Fix asserts in CodeGen from: assert("error"); to: assert(0 && "error"); llvm-svn: 139449	2011-09-10 01:07:54 +00:00
Chris Lattner	e74e0c8020	tidy up a bit llvm-svn: 139419	2011-09-09 22:06:59 +00:00
Eli Friedman	b7910b79f5	Make the SelectionDAG verify that all the operands of BUILD_VECTOR have the same type. Teach DAGCombiner::visitINSERT_VECTOR_ELT not to make invalid BUILD_VECTORs. Fixes PR10897. llvm-svn: 139407	2011-09-09 21:04:06 +00:00
Jakob Stoklund Olesen	278bf02581	Reapply r139247: Cache intermediate results during traceSiblingValue. In some cases such as interpreters using indirectbr, the CFG can be very complicated, and live range splitting may be forced to insert a large number of phi-defs. When that happens, traceSiblingValue can spend a lot of time zipping around in the CFG looking for defs and reloads. This patch causes more information to be cached in SibValues, and the cached values are used to terminate searches early. This speeds up spilling by 20x in one interpreter test case. For more typical code, this is just a 10% speedup of spilling. The previous version had bugs that caused miscompilations. They have been fixed. llvm-svn: 139378	2011-09-09 18:11:41 +00:00
Devang Patel	9d904e1a97	Directly point debug info to the stack slot of the arugment, instead of trying to keep track of vreg in which it the arugment is copied. The LiveDebugVariable can keep track of variable's ranges. llvm-svn: 139330	2011-09-08 22:59:09 +00:00
Jakob Stoklund Olesen	946e0a4665	Revert r139247 "Cache intermediate results during traceSiblingValue." It broke the self host and clang-x86_64-darwin10-RA. llvm-svn: 139259	2011-09-07 21:43:52 +00:00
Jakob Stoklund Olesen	b77d5c1484	Cache intermediate results during traceSiblingValue. In some cases such as interpreters using indirectbr, the CFG can be very complicated, and live range splitting may be forced to insert a large number of phi-defs. When that happens, traceSiblingValue can spend a lot of time zipping around in the CFG looking for defs and reloads. This patch causes more information to be cached in SibValues, and the cached values are used to terminate searches early. This speeds up spilling by 20x in one interpreter test case. For more typical code, this is just a 10% speedup of spilling. llvm-svn: 139247	2011-09-07 19:07:31 +00:00
James Molloy	4c493e8050	Refactor instprinter and mcdisassembler to take a SubtargetInfo. Add -mattr= handling to llvm-mc. Reviewed by Owen Anderson. llvm-svn: 139237	2011-09-07 17:24:38 +00:00
Eli Friedman	e978d2f644	Relax the MemOperands on atomics a bit. Fixes -verify-machineinstrs failures for atomic laod/store on ARM. (The fix for the related failures on x86 is going to be nastier because we actually need Acquire memoperands attached to the atomic load instrs, etc.) llvm-svn: 139221	2011-09-07 02:23:42 +00:00
Devang Patel	9de7a7db26	While sinking machine instructions, sink matching DBG_VALUEs also otherwise live debug variable pass will drop DBG_VALUEs on the floor. llvm-svn: 139208	2011-09-07 00:07:58 +00:00
Duncan Sands	f2641e1bc1	Add codegen support for vector select (in the IR this means a select with a vector condition); such selects become VSELECT codegen nodes. This patch also removes VSETCC codegen nodes, unifying them with SETCC nodes (codegen was actually often using SETCC for vector SETCC already). This ensures that various DAG combiner optimizations kick in for vector comparisons. Passes dragonegg bootstrap with no testsuite regressions (nightly testsuite as well as "make check-all"). Patch mostly by Nadav Rotem. llvm-svn: 139159	2011-09-06 19:07:46 +00:00
Duncan Sands	a098436b32	Split the init.trampoline intrinsic, which currently combines GCC's init.trampoline and adjust.trampoline intrinsics, into two intrinsics like in GCC. While having one combined intrinsic is tempting, it is not natural because typically the trampoline initialization needs to be done in one function, and the result of adjust trampoline is needed in a different (nested) function. To get around this llvm-gcc hacks the nested function lowering code to insert an additional parent variable holding the adjust.trampoline result that can be accessed from the child function. Dragonegg doesn't have the luxury of tweaking GCC code, so it stored the result of adjust.trampoline in the memory GCC set aside for the trampoline itself (this is always available in the child function), and set up some new memory (using an alloca) to hold the trampoline. Unfortunately this breaks Go which allocates trampoline memory on the heap and wants to use it even after the parent has exited (!). Rather than doing even more hacks to get Go working, it seemed best to just use two intrinsics like in GCC. Patch mostly by Sanjoy Das. llvm-svn: 139140	2011-09-06 13:37:06 +00:00
Owen Anderson	40d756eacc	Fix a truly heinous bug in DAGCombine related to AssertZext. If we have a chain of zext -> assert_zext -> zext -> use, the first zext would get simplified away because of the later zext, and then the later zext would get simplified away because of the assert. The solution is to teach SimplifyDemandedBits that assert_zext demands all of the high bits of its input, rather than only those demanded by its users. No testcase because the only example I have manifests as llvm-gcc miscompiling LLVM, and I haven't found a smaller case that reproduces this problem. Fixes <rdar://problem/10063365>. llvm-svn: 139059	2011-09-03 00:26:49 +00:00
Jakob Stoklund Olesen	97fe09ad2e	Simplify by using isFullCopy(). llvm-svn: 139019	2011-09-02 18:18:29 +00:00
Duncan Sands	5c04c62765	Darwin wants ctors/dtors to be ordered the other way round to linux. llvm-svn: 139015	2011-09-02 18:07:19 +00:00
Dan Gohman	3767be9aee	Revert r131152, r129796, r129761. This code is currently considered to be unreliable on platforms which require memcpy calls, and it is complicating broader legalize cleanups. It is hoped that these cleanups will make memcpy byval easier to implement in the future. llvm-svn: 138977	2011-09-01 23:07:08 +00:00
Benjamin Kramer	6397051ece	Don't drop alignment info on local common symbols. - On COFF the .lcomm directive has an alignment argument. - On ELF we fall back to .local + .comm Based on a patch by NAKAMURA Takumi. Fixes PR9337, PR9483 and PR10128. llvm-svn: 138976	2011-09-01 23:04:27 +00:00
Jakob Stoklund Olesen	5dc87d0f4d	Permit remat of partial register defs when it is safe. An instruction may define part of a register where the other bits are undefined. In that case, it is safe to rematerialize the instruction. For example: %vreg2:ssub_0<def> = VLDRS <cp#0>, 0, pred:14, pred:%noreg, %vreg2<imp-def> The extra <imp-def> operand indicates that the instruction does not read the other parts of the virtual register, so a remat is safe. This patch simply allows multiple def operands for the virtual register. It is MI->readsVirtualRegister() that determines if we depend on a previous value so remat is impossible. llvm-svn: 138953	2011-09-01 18:27:51 +00:00
Jakob Stoklund Olesen	e417273fce	Revert r138794, "Do not try to rematerialize a value from a partial definition." The problem is fixed for all register allocators by r138944, so this patch is no longer necessary. <rdar://problem/10032939> llvm-svn: 138945	2011-09-01 17:25:18 +00:00
Jakob Stoklund Olesen	6357fa2f06	Prevent remat of partial register redefinitions. An instruction that redefines only part of a larger register can never be rematerialized since the virtual register value depends on the old value in other parts of the register. This was fixed for the inline spiller in r138794. This patch fixes the problem for all register allocators, and includes a small test case. <rdar://problem/10032939> llvm-svn: 138944	2011-09-01 17:18:50 +00:00
Evan Cheng	90da66bb69	Teach MachineLICM reg pressure tracking code to deal with MVT::untyped. Sorry, I can't come up with a small test case. rdar://10043690 llvm-svn: 138934	2011-09-01 01:45:00 +00:00
Andrew Trick	832a6a1909	PreRA scheduler should avoid cloning compares. Added canClobberReachingPhysRegUse() to handle a particular pattern in which a two-address instruction could be forced to interfere with EFLAGS, causing a compare to be unnecessarilly cloned. Fixes rdar://problem/5875261 llvm-svn: 138924	2011-09-01 00:54:31 +00:00
David Greene	7df940d660	Fix Size Typing Stores sizes as uint64_t to avoid possible truncation. llvm-svn: 138901	2011-08-31 21:34:20 +00:00
Eli Friedman	ae1acddb95	Misc cleanup; addresses Duncan's comments on r138877. llvm-svn: 138887	2011-08-31 20:13:26 +00:00
Eli Friedman	e839ecb70b	Fill in type legalization for MERGE_VALUES in all the various cases. Patch by Micah Villmow. (No testcase because the issue only showed up in an out-of-tree backend.) llvm-svn: 138877	2011-08-31 18:36:04 +00:00
Eli Friedman	7c3bdede25	Generic expansion for atomic load/store into cmpxchg/atomicrmw xchg; implements 64-bit atomic load/store for ARM. llvm-svn: 138872	2011-08-31 18:26:09 +00:00
David Greene	cdef71f4f3	Compress Repeated Byte Output Emit a repeated sequence of bytes using .zero. This saves an enormous amount of asm file space for certain programs. llvm-svn: 138864	2011-08-31 17:30:56 +00:00
Rafael Espindola	6e31dfea35	Spelling and grammar fixes to problems found by Duncan. llvm-svn: 138858	2011-08-31 16:43:33 +00:00
Rafael Espindola	c21742112b	Emit segmented-stack specific code into function prologues for X86. Modify the pass added in the previous patch to call this new code. This new prologues generated will call a libgcc routine (__morestack) to allocate more stack space from the heap when required Patch by Sanjoy Das. llvm-svn: 138812	2011-08-30 19:39:58 +00:00
Evan Cheng	e6fba77971	Follow up to r138791. Add a instruction flag: hasPostISelHook which tells the pre-RA scheduler to call a target hook to adjust the instruction. For ARM, this is used to adjust instructions which may be setting the 's' flag. ADC, SBC, RSB, and RSC instructions have implicit def of CPSR (required since it now uses CPSR physical register dependency rather than "glue"). If the carry flag is used, then the target hook will fill in the optional operand with CPSR. Otherwise, the hook will remove the CPSR implicit def from the MachineInstr. llvm-svn: 138810	2011-08-30 19:09:48 +00:00
Bob Wilson	358a5f6a72	Do not try to rematerialize a value from a partial definition. I don't currently have a good testcase for this; will try to get one tomorrow. <rdar://problem/10032939> llvm-svn: 138794	2011-08-30 05:36:02 +00:00
Jim Grosbach	ed16ec4248	Thumb2 parsing and encoding for IT blocks. llvm-svn: 138773	2011-08-29 22:24:09 +00:00
Duncan Sands	4d63542b82	Fix PR5329: pay attention to constructor/destructor priority when outputting them. With this, the entire LLVM testsuite passes when built with dragonegg. llvm-svn: 138724	2011-08-28 13:17:22 +00:00
Bill Wendling	4707d37ac9	These splits should be done whether they are critical edges or not. llvm-svn: 138697	2011-08-27 04:40:37 +00:00
Bill Wendling	71fce2c84d	Update the dominator tree with the correct dominator for the new 'unwind' block. llvm-svn: 138664	2011-08-26 21:36:12 +00:00
Bill Wendling	fee8eda35b	Split the landing pad block only if it's a critical edge. Also intelligently split it in the other place where we're splitting critical edges. llvm-svn: 138658	2011-08-26 21:18:55 +00:00
Eli Friedman	452aae6202	Atomic load/store on ARM/Thumb. I don't really like the patterns, but I'm having trouble coming up with a better way to handle them. I plan on making other targets use the same legalization ARM-without-memory-barriers is using... it's not especially efficient, but if anyone cares, it's not that hard to fix for a given target if there's some better lowering. llvm-svn: 138621	2011-08-26 02:59:24 +00:00
Bill Wendling	8ac2041a19	Look at only the terminators of the basic block. Also, if we're using the new EH scheme, return 'true' so that it doesn't try to run the old EH scheme's fixup on the new code. llvm-svn: 138605	2011-08-25 23:48:11 +00:00
Eli Friedman	342e8df0e0	Basic x86 code generation for atomic load and store instructions. llvm-svn: 138478	2011-08-24 20:50:09 +00:00
Evan Cheng	2bb4035707	Move TargetRegistry and TargetSelect from Target to Support where they belong. These are strictly utilities for registering targets and components. llvm-svn: 138450	2011-08-24 18:08:43 +00:00
Jim Grosbach	dee9e8a37c	Tidy up. Trailing whitespace. llvm-svn: 138437	2011-08-24 16:44:17 +00:00
Bill Wendling	f4ee0c0db2	Add the sentinal "no handle" value to the ResumeInst. A value of -1 at a call site tells the personality function that this call isn't handled by the current function. Since the ResumeInsts are converted to calls to _Unwind_SjLj_Resume, add a (volatile) store of -1 to its 'call site'. llvm-svn: 138416	2011-08-24 00:00:23 +00:00
Bill Wendling	2d4f0bea57	Don't replace all uses with the new stuff. This is not necessarily the first or dominating use of the EH values. The IR breaks if it's not. So replace the specific value in the instruction with the new value. llvm-svn: 138406	2011-08-23 22:55:03 +00:00
Bill Wendling	01a325a40e	Look at the end of the entry block for an invoke. The invoke could be at the end of the entry block. If it's the only one, then we won't process all of the landingpad instructions correctly. This code is currently ugly, but should be made much nicer once the new EH switch is thrown. llvm-svn: 138397	2011-08-23 22:20:16 +00:00
Bill Wendling	4eb0433672	A landingpad instruction is neither folded nor dead. llvm-svn: 138387	2011-08-23 21:33:05 +00:00
Evan Cheng	6b477b985b	Fix 80 col violations. llvm-svn: 138356	2011-08-23 19:17:21 +00:00
Bill Wendling	f0d2dfde4f	Split the landing pad's edge. Then for all uses of a landingpad instruction's value, we insert a load of the exception object and selector object from memory, which is where it actually resides. If it's used by a PHI node, we follow that to where it is being used. Eventually, all landingpad instructions should have no uses. Any PHI nodes that were associated with those landingpads should be removed. llvm-svn: 138302	2011-08-22 23:38:40 +00:00
Evan Cheng	6aa2744bed	Follow up to Jim's r138278. This fixes commuteInstruction so it handles two-address instructions correctly. I'll let Jim add a test case. :-) llvm-svn: 138289	2011-08-22 23:04:56 +00:00
Bill Wendling	3aaed0a14c	Some whitespace fixes and #include reordering. llvm-svn: 138256	2011-08-22 18:44:49 +00:00
Nick Lewycky	97f73cb449	Be less redundant. llvm-svn: 138252	2011-08-22 18:26:12 +00:00
Devang Patel	59e27c5f12	Do not use named md nodes to track variables that are completely optimized. This does not scale while doing LTO with debug info. New approach is to include list of variables in the subprogram info directly. llvm-svn: 138145	2011-08-19 23:28:12 +00:00
Benjamin Kramer	68ed46ce9a	Roll back the rest of r126557. It's a hack that will break in some obscure cases. llvm-svn: 138130	2011-08-19 22:39:31 +00:00
Nick Lewycky	c1348074ec	Eli points out that this is what report_fatal_error() is for. llvm-svn: 138091	2011-08-19 21:45:19 +00:00
Nick Lewycky	3f73184d90	This is not actually unreachable, so don't use llvm_unreachable for it. Since the intent seems to be to terminate even in Release builds, just use abort() directly. If program flow ever reaches a __builtin_unreachable (which llvm_unreachable is #define'd to on newer GCCs) then the program is undefined. llvm-svn: 138068	2011-08-19 20:14:27 +00:00
Jakob Stoklund Olesen	6949077f74	Add llc flags to disable machine DCE and CSE. This is useful for unit tests. llvm-svn: 138028	2011-08-19 02:05:35 +00:00
Benjamin Kramer	4938edb02c	Make a bunch of symbols private. llvm-svn: 138025	2011-08-19 01:42:18 +00:00
Jakob Stoklund Olesen	9eb77bf615	Don't treat a partial <def,undef> operand as a read. Normally, a partial register def is treated as reading the super-register unless it also defines the full register like this: %vreg110:sub_32bit<def> = COPY %vreg77:sub_32bit, %vreg110<imp-def> This patch also uses the <undef> flag on partial defs to recognize non-reading operands: %vreg110:sub_32bit<def,undef> = COPY %vreg77:sub_32bit This fixes a subtle bug in RegisterCoalescer where LIS->shrinkToUses would treat a coalesced copy as still reading the register, extending the live range artificially. My test case only works when I disable DCE so a dead copy is left for RegisterCoalescer, so I am not including it. <rdar://problem/9967101> llvm-svn: 138018	2011-08-19 00:30:17 +00:00
Renato Golin	c8d4065781	add the comments of each declaration follow it, making it easier to read and compare to GCC's result. llvm-svn: 138009	2011-08-18 23:43:14 +00:00
Devang Patel	0ecbcbd12c	Eliminate unnecessary forwarding function. llvm-svn: 138006	2011-08-18 23:17:55 +00:00
Devang Patel	a6576a146d	Add new DIE into the map asap. llvm-svn: 137998	2011-08-18 22:21:50 +00:00
Ivan Krasin	d7cbd4c518	FastISel: avoid function calls between the materialization of the constant and its use. llvm-svn: 137993	2011-08-18 22:06:10 +00:00
Bill Wendling	247fd3bf59	Add the support in code-gen for the landingpad instruction lowering. The landingpad instruction is lowered into the EXCEPTIONADDR and EHSELECTION SDNodes. The information from the landingpad instruction is harvested by the 'AddLandingPadInfo' function. The new EH uses the current EH scheme in the back-end. This will change once we switch over to the new scheme. (Reviewed by Jakob!) llvm-svn: 137880	2011-08-17 21:56:44 +00:00
Bill Wendling	a408e5bf31	Revert patch. Forgot a dependent commit. llvm-svn: 137875	2011-08-17 21:28:05 +00:00
Bill Wendling	2a521948f0	Add the body of 'visitLandingPad'. This generates the SDNodes for the new exception handling scheme. It takes the two values coming from the landingpad instruction and assigns them to the EXCEPTIONADDR and EHSELECTION nodes. llvm-svn: 137873	2011-08-17 21:25:14 +00:00
Bill Wendling	1cdd7fdf54	Modify for the new EH scheme. Things are much saner now. We no longer need to modify the laning pads, because of the invariants we impose upon them. The only thing DwarfEHPrepare needs to do is convert the 'resume' instruction into a call to '_Unwind_Resume'. llvm-svn: 137855	2011-08-17 19:48:49 +00:00
Devang Patel	eb1bb4e419	Until now all debug info MDNodes referred to a root MDNode, a compile unit. This simplified handling of these needs in dwarf writer. However, one side effect of this is that during link time optimization all these MDNodes are _not_ uniqued. In other words there will be N number of MDNodes describing "int", "char" and all other types, which would suddenly grow when each object file starts using libraries like STL. MDNodes graph structure such that compiler unit keeps track of important MDNodes and update dwarf writer to process mdnodes top-down instead of bottom up. llvm-svn: 137778	2011-08-16 22:09:43 +00:00
Jim Grosbach	345768c9ff	Remove unused Target argument from AsmParser construction methods. The argument is unused, and is a layering violation in any case. llvm-svn: 137735	2011-08-16 18:33:49 +00:00
Devang Patel	927840458e	Remove unnecessary version check. llvm-svn: 137728	2011-08-16 17:41:41 +00:00
Nadav Rotem	b66b866f46	Revert r137562 because it caused PR10674 llvm-svn: 137719	2011-08-16 14:34:29 +00:00
Devang Patel	07bb9eea33	Refactor. llvm-svn: 137689	2011-08-15 23:47:24 +00:00
Devang Patel	1f4f98d664	Continue to hoist uses of getCompileUnit() up. The goal is to get rid of uses of getCompileUnit(). llvm-svn: 137683	2011-08-15 23:36:40 +00:00
Devang Patel	d2dfc5ec02	This is somewhat déjà-vu, but avoid using getCompileUnit() as much as possible. llvm-svn: 137668	2011-08-15 22:24:32 +00:00
Devang Patel	3acc70e536	Refactor. Variables are part of compile unit so let CompileUnit create new variable. llvm-svn: 137663	2011-08-15 22:04:40 +00:00
Devang Patel	d899444347	There is no need to maintain a set to keep track of variables that use location expressions. In such cases, AT_location attribute's value will be a label. llvm-svn: 137659	2011-08-15 21:43:21 +00:00
Devang Patel	900d97719b	Fix warning. llvm-svn: 137658	2011-08-15 21:35:16 +00:00
Devang Patel	3e4a965519	Simplify. Let DbgVariable keep track of variable's DBG_VALUE machine instruction. llvm-svn: 137656	2011-08-15 21:24:36 +00:00
Devang Patel	99819b527d	Simplify mapping to variable from its abstract variable info. When a variable is inlined multiple places, abstract variable keeps name, location, type etc.. info and all other concreate instances of the variable directly refers to abstract variable. llvm-svn: 137637	2011-08-15 19:01:20 +00:00
Devang Patel	d7d80aadd1	Refactor. llvm-svn: 137632	2011-08-15 18:40:16 +00:00
Devang Patel	6e4d2c9fb7	Refactor. llvm-svn: 137631	2011-08-15 18:35:42 +00:00
Devang Patel	dfd6ec3ce1	Refactor. Global variables are part of compile unit so let CompileUnit create new global variable. llvm-svn: 137621	2011-08-15 17:57:41 +00:00
Devang Patel	895437142a	Refactor. A subprogram is part of compile unit so let CompileUnit construct new subprogram. llvm-svn: 137618	2011-08-15 17:24:54 +00:00
Nadav Rotem	6858b344ed	Fix PR 10635. When generating integer constants, the constant element type may be illegal, even if the requested vector type is legal. Testcase is one of the disabled ARM tests in the vector-select patch. llvm-svn: 137562	2011-08-13 20:31:45 +00:00
Bill Wendling	fae1475823	Initial commit of the 'landingpad' instruction. This implements the 'landingpad' instruction. It's used to indicate that a basic block is a landing pad. There are several restrictions on its use (see LangRef.html for more detail). These restrictions allow the exception handling code to gather the information it needs in a much more sane way. This patch has the definition, implementation, C interface, parsing, and bitcode support in it. llvm-svn: 137501	2011-08-12 20:24:12 +00:00
Devang Patel	444034783e	Use ArrayRef. llvm-svn: 137485	2011-08-12 18:10:19 +00:00
Chris Lattner	335d399a0e	switch to use the new api for structtypes. llvm-svn: 137480	2011-08-12 18:06:37 +00:00
Devang Patel	db4374a28a	Provide fast path as Jakob suggested. llvm-svn: 137478	2011-08-12 18:01:34 +00:00
Nadav Rotem	62da15a330	Revert r137310 because it does not optimize any code on ToT llvm-svn: 137466	2011-08-12 17:15:04 +00:00
Duncan Sands	a41634e307	Silence a bunch (but not all) "variable written but not read" warnings when building with assertions disabled. llvm-svn: 137460	2011-08-12 14:54:45 +00:00
Jakob Stoklund Olesen	1f582ba609	Simplify the interference checking code a bit. This is possible now that we now longer provide an interface to iterate the interference overlaps. llvm-svn: 137397	2011-08-12 00:22:04 +00:00
Jakob Stoklund Olesen	da0192d72b	Remove the InterferenceResult class. llvm-svn: 137381	2011-08-11 22:46:06 +00:00
Jakob Stoklund Olesen	cd14efaec2	Eliminate the last use of InterferenceResult. The Query class now holds two iterators instead of an InterferenceResult instance. The iterators are used as bookmarks for repeated collectInterferingVRegs calls. llvm-svn: 137380	2011-08-11 22:46:04 +00:00
Jakob Stoklund Olesen	da4f0eb12c	Remove more dead code. collectInterferingVRegs will be the primary function for interference checks. llvm-svn: 137354	2011-08-11 21:18:34 +00:00
Jakob Stoklund Olesen	7519336752	Privatize an unused part of the LiveIntervalUnion::Query interface. No clients are iterating over interference overlaps. llvm-svn: 137350	2011-08-11 21:00:42 +00:00
Jakob Stoklund Olesen	05ff9d1f6d	Remove some dead code. The InterferenceResult iterator turned out to be less important than we thought it would be. LiveIntervalUnion clients want higher level information, like the list of interfering virtual registers. llvm-svn: 137346	2011-08-11 20:41:41 +00:00
Benjamin Kramer	fa7e6a54b1	Plug a memory leak. llvm-svn: 137321	2011-08-11 18:39:28 +00:00
Nadav Rotem	61140e1028	[AVX] When joining two XMM registers into a YMM register, make sure that the lower XMM register gets in first. This will allow the SUBREG pattern to elliminate the first vector insertion. llvm-svn: 137310	2011-08-11 16:49:36 +00:00
Chris Lattner	96710b4308	fix PR10605 / rdar://9930964 by adding a pretty scary missed check. It's somewhat surprising anything works without this. Before we would compile the testcase into: test: # @test movl $4, 8(%rdi) movl 8(%rdi), %eax orl %esi, %eax cmpl $32, %edx movl %eax, -4(%rsp) # 4-byte Spill je .LBB0_2 now we produce: test: # @test movl 8(%rdi), %eax movl $4, 8(%rdi) orl %esi, %eax cmpl $32, %edx movl %eax, -4(%rsp) # 4-byte Spill je .LBB0_2 llvm-svn: 137303	2011-08-11 06:26:54 +00:00
Devang Patel	784077eb57	Stay within 80 columns. llvm-svn: 137283	2011-08-10 23:58:09 +00:00
Devang Patel	bb23a4a9a5	Distinguish between two copies of one inlined variable. Take 2. llvm-svn: 137253	2011-08-10 21:50:54 +00:00
Devang Patel	37a62058fe	While extending definition range of a debug variable, consult lexical scopes also. There is no point extending debug variable out side its lexical block. This provides 6x compile time speedup in some cases. llvm-svn: 137250	2011-08-10 21:25:34 +00:00
Devang Patel	e30746c844	Revert unintentional parts of previous check-in. llvm-svn: 137249	2011-08-10 21:16:49 +00:00
Devang Patel	7e62302fae	Start using LexicalScopes utility. No intetional functionality change. llvm-svn: 137246	2011-08-10 20:55:27 +00:00
Devang Patel	e1649c31cb	Provide utility to extract and use lexical scoping information from machine instructions. llvm-svn: 137237	2011-08-10 19:04:06 +00:00
Jakob Stoklund Olesen	b91e489923	Trim an unneeded header. llvm-svn: 137184	2011-08-09 23:49:21 +00:00
Jakob Stoklund Olesen	53910d6aae	Inflate register classes after coalescing. Coalescing can remove copy-like instructions with sub-register operands that constrained the register class. Examples are: x86: GR32_ABCD:sub_8bit_hi -> GR32 arm: DPR_VFP2:ssub0 -> DPR Recompute the register class of any virtual registers that are used by less instructions after coalescing. This affects code generation for the Cortex-A8 where we use NEON instructions for f32 operations, c.f. fp_convert.ll: vadd.f32 d16, d1, d0 vcvt.s32.f32 d0, d16 The register allocator is now free to use d16 for the temporary, and that comes first in the allocation order because it doesn't interfere with any s-registers. llvm-svn: 137133	2011-08-09 18:19:41 +00:00
Jakob Stoklund Olesen	da96006975	Move CalculateRegClass to MRI::recomputeRegClass. This function doesn't have anything to do with spill weights, and MRI already has functions for manipulating the register class of a virtual register. llvm-svn: 137123	2011-08-09 16:46:27 +00:00
Devang Patel	6c1ed31b3b	Print variable's inline location in debug output. llvm-svn: 137096	2011-08-09 01:03:35 +00:00
Jakob Stoklund Olesen	e7dddfd7f6	Rename member variables to follow coding standards. No functional change. llvm-svn: 137094	2011-08-09 01:01:27 +00:00
Jakob Stoklund Olesen	e1f5313bc7	Move the RegisterCoalescer private to its implementation file. RegisterCoalescer.h still has the CoalescerPair class interface. llvm-svn: 137088	2011-08-09 00:43:37 +00:00
Jakob Stoklund Olesen	4c9a2fb044	Refer to the RegisterCoalescer pass by ID. A public interface is no longer needed since RegisterCoalescer is not an analysis any more. llvm-svn: 137082	2011-08-09 00:29:53 +00:00
Jakob Stoklund Olesen	daa2cad723	Hoist hasLoadFromStackSlot and hasStoreToStackSlot. These the methods are target-independent since they simply scan the memory operands. They can live in TargetInstrInfoImpl. llvm-svn: 137063	2011-08-08 20:53:24 +00:00
Devang Patel	fee7cedbc9	Simplify by creating parent first. llvm-svn: 137056	2011-08-08 18:22:10 +00:00
Jakob Stoklund Olesen	22f37a1eb1	Fix typo. Thanks, Andy! llvm-svn: 137023	2011-08-06 18:20:24 +00:00
Jakob Stoklund Olesen	d4bb1d43e8	Reject RS_Spill ranges from local splitting as well. All new local ranges are marked as RS_New now, so there is no need to attempt splitting of RS_Spill ranges any more. llvm-svn: 137002	2011-08-05 23:50:33 +00:00
Jakob Stoklund Olesen	02cf10bdfd	Only mark remainder intervals as RS_Spill after per-block splitting. The local ranges created get to stay in the RS_New stage, just like for local and region splitting. This gives tryLocalSplit a bit more freedom the first time it sees one of these new local ranges. llvm-svn: 137001	2011-08-05 23:50:31 +00:00
Jakob Stoklund Olesen	0de95ef7f5	Remember to update LiveDebugVariables after per-block splitting. llvm-svn: 136996	2011-08-05 23:10:40 +00:00
Jakob Stoklund Olesen	cef5d8ff77	Extract per-block splitting into its own method. No functional change. llvm-svn: 136994	2011-08-05 23:04:18 +00:00
Jakob Stoklund Olesen	cdf9ad9107	Delete getMultiUseBlocks and splitSingleBlocks. These functions are no longer used, and they are easily replaced with a loop calling shouldSplitSingleBlock and splitSingleBlock. llvm-svn: 136993	2011-08-05 22:52:17 +00:00
Jakob Stoklund Olesen	58995bc551	Also use shouldSplitSingleBlock() in the fallback splitting mode. Drop the use of SplitAnalysis::getMultiUseBlocks, there is no need to go through a SmallPtrSet any more. llvm-svn: 136992	2011-08-05 22:43:23 +00:00
Jakob Stoklund Olesen	8627ea91cb	Split around single instructions to enable register class inflation. Normally, we don't create a live range for a single instruction in a basic block, the spiller does that anyway. However, when splitting a live range that belongs to a proper register sub-class, inserting these extra COPY instructions completely remove the constraints from the remainder interval, and it may be allocated from the larger super-class. The spiller will mop up these small live ranges if we end up spilling anyway. It calls them snippets. llvm-svn: 136989	2011-08-05 22:20:45 +00:00
Jakob Stoklund Olesen	5122467b38	Detect proper register sub-classes. Some instructions require restricted register classes, but most of the time that doesn't affect register allocation. For example, some instructions don't work with the stack pointer, but that is a reserved register anyway. Sometimes it matters, GR32_ABCD only has 4 allocatable registers. For such a proper sub-class, the register allocator should try to enable register class inflation since that makes more registers available for allocation. Make sure only legal super-classes are considered. For example, tGPR is not a proper sub-class in Thumb mode, but in ARM mode it is. llvm-svn: 136981	2011-08-05 21:28:14 +00:00
Jakob Stoklund Olesen	d633abebf6	Fix liveness computations in BranchFolding. The old code would look at kills and defs in one pass over the instruction operands, causing problems with this code: %R0<def>, %CPSR<def,dead> = tLSLri %R5<kill>, 2, pred:14, pred:%noreg %R0<def>, %CPSR<def,dead> = tADDrr %R4<kill>, %R0<kill>, pred:14, %pred:%noreg The last instruction kills and redefines %R0, so it is still live after the instruction. This caused a register scavenger crash when compiling 483.xalancbmk for armv6. I am not including a test case because it requires too much bad luck to expose this old bug. First you need to convince the register allocator to use %R0 twice on the tADDrr instruction, then you have to convince BranchFolding to do something that causes it to run the register scavenger on he bad block. <rdar://problem/9898200> llvm-svn: 136973	2011-08-05 18:47:07 +00:00
Chandler Carruth	81b7e11c89	Temporarily revert r135528 which distinguishes between two copies of one inlined variable, based on the discussion in PR10542. This explodes the runtime of several passes down the pipeline due to a large number of "copies" remaining live across a large function. This only shows up with both debug and opt, but when it does it creates a many-minute compile when self-hosting LLVM+Clang. There are several other cases that show these types of regressions. All of this is tracked in PR10542, and progress is being made on fixing the issue. Once its addressed, the re-instated, but until then this restores the performance for self-hosting and other opt+debug builds. Devang, let me know if this causes any trouble, or impedes fixing it in any way, and thanks for working on this! llvm-svn: 136953	2011-08-05 00:51:31 +00:00
Jakob Stoklund Olesen	63e3dec9ad	Count the total amount of stack space used in compiled functions. Patch by Ivan Krasin! llvm-svn: 136921	2011-08-04 21:06:09 +00:00
Devang Patel	d61b1d505c	Print DBG_VALUE variable's location info as a comment. llvm-svn: 136916	2011-08-04 20:44:26 +00:00
Devang Patel	eabc3cea33	Increment counter inside insertDebugValue(). llvm-svn: 136915	2011-08-04 20:42:11 +00:00
Devang Patel	b456866b7b	Add counter. llvm-svn: 136901	2011-08-04 18:45:38 +00:00
Jakob Stoklund Olesen	2539af600a	Correctly handle multiple DBG_VALUE instructions at the same SlotIndex. It is possible to have multiple DBG_VALUEs for the same variable: 32L TEST32rr %vreg0<kill>, %vreg0, %EFLAGS<imp-def>; GR32:%vreg0 DBG_VALUE 2, 0, !"i" DBG_VALUE %noreg, %0, !"i" When that happens, keep the last one instead of the first. llvm-svn: 136842	2011-08-03 23:44:31 +00:00
Jakob Stoklund Olesen	11b788d5be	Enable compact region splitting by default. This helps generate better code in functions with high register pressure. The previous version of compact region splitting caused regressions because the regions were a bit too large. A stronger negative bias applied in r136832 fixed this problem. llvm-svn: 136836	2011-08-03 23:16:09 +00:00
Devang Patel	aab841cf63	Do not drop undef debug values. These are used as range termination marker by live debug variable pass. llvm-svn: 136834	2011-08-03 23:13:55 +00:00
Jakob Stoklund Olesen	869545203b	Be more conservative when forming compact regions. Apply twice the negative bias on transparent blocks when computing the compact regions. This excludes loop backedges from the region when only one of the loop blocks uses the register. Previously, we would include the backedge in the region if the loop preheader and the loop latch both used the register, but the loop header didn't. When both the header and latch blocks use the register, we still keep it live on the backedge. llvm-svn: 136832	2011-08-03 23:09:38 +00:00
Chandler Carruth	77eb5a0a37	Fix some warnings from Clang in release builds: lib/CodeGen/RegAllocGreedy.cpp:1176:18: warning: unused variable 'B' [-Wunused-variable] if (unsigned B = Cand.getBundles(BundleCand, BestCand)) { ^ lib/CodeGen/RegAllocGreedy.cpp:1188:18: warning: unused variable 'B' [-Wunused-variable] if (unsigned B = Cand.getBundles(BundleCand, 0)) { ^ llvm-svn: 136831	2011-08-03 23:07:27 +00:00
Jakub Staszak	3ef20e35f9	Fix typo in #include which revealed in the case-sensitive filesystem. llvm-svn: 136828	2011-08-03 22:53:41 +00:00
Jakub Staszak	15e5b742ad	Use MachineBranchProbabilityInfo in If-Conversion instead of its own heuristics. llvm-svn: 136826	2011-08-03 22:34:43 +00:00

... 4 5 6 7 8 ...

12775 Commits