llvm-project

Commit Graph

Author	SHA1	Message	Date
Duncan Sands	bc42c906bb	Reapply commit 112702 which was speculatively reverted by echristo. Original commit message: Use the SSAUpdator to turn calls to eh.exception that are not in a landing pad into uses of registers rather than loads from a stack slot. Doesn't touch the 'orrible hack code - Bill needs to persuade me harder :) llvm-svn: 112952	2010-09-03 08:31:48 +00:00
Devang Patel	854ad26ae2	There is no need to use .set here. Thanks Chris! llvm-svn: 112900	2010-09-02 23:01:10 +00:00
Devang Patel	3bffd52d78	Detect undef value early and save unnecessary NodeMap query. llvm-svn: 112864	2010-09-02 21:29:42 +00:00
Dan Gohman	3c9b5f394b	Don't narrow the load and store in a load+twiddle+store sequence unless there are clearly no stores between the load and the store. This fixes this miscompile reported as PR7833. This breaks the test/CodeGen/X86/narrow_op-2.ll optimization, which is safe, but awkward to prove safe. Move it to X86's README.txt. llvm-svn: 112861	2010-09-02 21:18:42 +00:00
Devang Patel	98d3edfe2a	Tidy up. llvm-svn: 112858	2010-09-02 21:02:27 +00:00
Jim Grosbach	35f3252036	The scavenger should just use getAllocatableSet() rather than reinventing it locally. llvm-svn: 112845	2010-09-02 18:29:04 +00:00
Jim Grosbach	944aece38a	Anti-dependency breaking needs to be careful not to use reserved regs llvm-svn: 112832	2010-09-02 17:12:55 +00:00
Devang Patel	da3ef85460	Fix .debug_range for linux. Patch by Krister Wombell. llvm-svn: 112830	2010-09-02 16:43:44 +00:00
Lang Hames	9a6f8ee32c	Added support for register allocators to record which intervals are spill intervals, and where the uses and defs of the original intervals were in the original code. Spill intervals can be hidden using the "-rmf-intervals=virt-nospills*" option. llvm-svn: 112811	2010-09-02 08:27:00 +00:00
Chandler Carruth	d30f8ec11e	Silence an ambiguous else warning from GCC. llvm-svn: 112809	2010-09-02 07:08:05 +00:00
Lang Hames	b59620f519	Added counters for PBQP reduction rules. llvm-svn: 112807	2010-09-02 05:37:52 +00:00
Jim Grosbach	64df92a9b2	Add a bit of debug output for register scavenging llvm-svn: 112787	2010-09-02 00:51:37 +00:00
Jim Grosbach	63a8eaf559	Tweak to ignoring reserved regs. The allocator was occasionally still looking at them since they'd end up in the register weights list. Tell it to stop doing that. llvm-svn: 112756	2010-09-01 22:48:34 +00:00
Jakob Stoklund Olesen	4b6fd48bba	Teach RemoveCopyByCommutingDef to check all aliases, not just subregisters. This caused a miscompilation in WebKit where %RAX had conflicting defs when RemoveCopyByCommutingDef was commuting a %EAX use. llvm-svn: 112751	2010-09-01 22:15:35 +00:00
Jim Grosbach	d5e72a1e84	tidy up trailing whitespace and an 80 column violation. llvm-svn: 112746	2010-09-01 21:48:06 +00:00
Jim Grosbach	9dce31438d	cleanup per feedback. use a helper function for getting the first non-reserved physical register in a register class. Make sure to assert if the register class is empty. llvm-svn: 112743	2010-09-01 21:34:41 +00:00
Jim Grosbach	b070ddf6b4	The register allocator shouldn't consider allocating reserved registers. PBQP version. llvm-svn: 112742	2010-09-01 21:23:03 +00:00
Jim Grosbach	5ccf18c2fc	The register allocator shouldn't consider allocating reserved registers. r112728 did this for fast regalloc. llvm-svn: 112741	2010-09-01 21:04:27 +00:00
Jim Grosbach	df6b67bf85	The register allocator shouldn't consider allocating reserved registers. llvm-svn: 112728	2010-09-01 19:28:41 +00:00
Jim Grosbach	cb2e56fa82	tidy up a few 80-column and trailing whitespace bits. llvm-svn: 112726	2010-09-01 19:16:29 +00:00
Eric Christopher	a5d315c665	Speculatively revert 112699 and 112702, they seem to be causing self host errors on clang-x86-64. llvm-svn: 112719	2010-09-01 17:29:10 +00:00
Duncan Sands	4d51e3fd17	Use the SSAUpdator to turn calls to eh.exception that are not in a landing pad into uses of registers rather than loads from a stack slot. Doesn't touch the 'orrible hack code - Bill needs to persuade me harder :) llvm-svn: 112702	2010-09-01 14:07:47 +00:00
Devang Patel	ea63639da5	Use absolute label for DW_AT_stmt_list if a target does not prefer offset here. This patch was developed on top of original patch by Artur Pietrek. llvm-svn: 112678	2010-08-31 23:50:19 +00:00
Devang Patel	86ec8b3a3f	Reapply r112623. Included additional check for unused byval argument. llvm-svn: 112659	2010-08-31 22:22:42 +00:00
Jakob Stoklund Olesen	7993dae7bd	Track liveness of unallocatable, unreserved registers in machine DCE. Reserved registers are unpredictable, and are treated as always live by machine DCE. Allocatable registers are never reserved, and can be used for virtual registers. Unreserved, unallocatable registers can not be used for virtual registers, but otherwise behave like a normal allocatable register. Most targets only have the flag register in this set. llvm-svn: 112649	2010-08-31 21:51:05 +00:00
Jakob Stoklund Olesen	2c325dc907	Ignore unallocatable registers in RegAllocFast. llvm-svn: 112632	2010-08-31 19:54:25 +00:00
Devang Patel	529f248eb4	Revert r112623. It is causing self host build failures. llvm-svn: 112631	2010-08-31 19:41:03 +00:00
Devang Patel	8559932d36	Remember byval argument's frame index during argument lowering and use this info to emit debug info. Fixes Radar 8367011. llvm-svn: 112623	2010-08-31 18:50:09 +00:00
Jim Grosbach	365e931f7b	Improve virtual frame base register allocation heuristics. 1. Allocate them in the entry block of the function to enable function-wide re-use. The instructions to create them should be re-materializable, so there shouldn't be additional cost compared to creating them local to the basic blocks where they are used. 2. Collect all of the frame index references for the function and sort them by the local offset referenced. Iterate over the sorted list to allocate the virtual base registers. This enables creation of base registers optimized for positive-offset access of frame references. (Note: This may be appropriate to later be a target hook to do the sorting in a target appropriate manner. For now it's done here for simplicity.) llvm-svn: 112609	2010-08-31 17:58:19 +00:00
Duncan Sands	bb8a3f9f6d	Stop using the dom frontier in DwarfEHPrepare by not promoting alloca's any more. I plan to reimplement alloca promotion using SSAUpdater later. It looks like Bill's URoR logic really always needs domtree, so the pass now always asks for domtree info. llvm-svn: 112597	2010-08-31 09:05:06 +00:00
Devang Patel	417d72823a	Offset is not always unsigned number. llvm-svn: 112584	2010-08-31 06:12:08 +00:00
Devang Patel	2cfc3af181	Simplify. llvm-svn: 112583	2010-08-31 06:11:28 +00:00
Bruno Cardoso Lopes	d9ef4a1a24	zap unused method. x86 is the only user and already has a more powerfull version llvm-svn: 112571	2010-08-31 02:36:20 +00:00
Jakob Stoklund Olesen	9c39690edf	Add experimental -disable-physical-join command line option. Eventually, we want to disable physreg coalescing completely, and let the register allocator do its job using hints. This option makes it possible to measure the impact of disabling physreg coalescing. llvm-svn: 112567	2010-08-31 01:27:49 +00:00
Chris Lattner	34bfab0ad5	two changes: 1) nuke ConstDataCoalSection, which is dead. 2) revise my previous patch for rdar://8018335, which was completely wrong. Specifically, it doesn't make sense to mark __TEXT,__const_coal as PURE_INSTRUCTIONS, because it is for readonly data. templates (it turns out) go to const_coal_nt. The real fix for rdar://8018335 was to give ConstTextCoalSection a section kind of ReadOnly instead of Text. llvm-svn: 112496	2010-08-30 18:12:35 +00:00
Bill Wendling	f824489a1d	Revert r112461. It was failing on PPC... llvm-svn: 112463	2010-08-30 04:36:50 +00:00
Bill Wendling	938f299fa9	When adding a register, we should mark it as "def" if it can optionally define said (physical) register. llvm-svn: 112461	2010-08-30 01:36:05 +00:00
Chris Lattner	ea05bf2259	revert 112457, it looks like it broke selfhost. llvm-svn: 112459	2010-08-29 22:28:18 +00:00
Chris Lattner	c843fca2fd	rewrite DwarfEHPrepare to use SSAUpdater to promote its allocas instead of PromoteMemToReg. This allows it to stop using DF and DT, eliminating a computation of DT and DF from clang -O3. Clang is now down to 2 runs of DomFrontier. llvm-svn: 112457	2010-08-29 19:54:28 +00:00
Chris Lattner	d94a7c3dc1	inline function into its only caller. llvm-svn: 112455	2010-08-29 19:28:28 +00:00
Chris Lattner	13ee795c42	remove unions from LLVM IR. They are severely buggy and not being actively maintained, improved, or extended. llvm-svn: 112356	2010-08-28 04:09:24 +00:00
Chris Lattner	a5217a19a4	remove dead proto llvm-svn: 112354	2010-08-28 03:45:03 +00:00
Dan Gohman	e06905d1f0	Completely disable tail calls when fast-isel is enabled, as fast-isel doesn't currently support dealing with this. llvm-svn: 112341	2010-08-28 00:51:03 +00:00
Dan Gohman	1e06dbf881	Trim a #include. llvm-svn: 112340	2010-08-28 00:49:13 +00:00
Devang Patel	f2855b147f	Simplify. llvm-svn: 112305	2010-08-27 22:25:51 +00:00
Bill Wendling	6628431a91	Remove now unneeded command line flag that enables 'optimize compares.' llvm-svn: 112287	2010-08-27 20:39:09 +00:00
Devang Patel	b12ff5999e	Revert r112213. It is not needed. llvm-svn: 112242	2010-08-26 23:35:15 +00:00
Jim Grosbach	6a77066913	Simplify eliminateFrameIndex() interface back down now that PEI doesn't need to try to re-use scavenged frame index reference registers. rdar://8277890 llvm-svn: 112241	2010-08-26 23:32:16 +00:00
Devang Patel	ea134f56b1	If node is not available then use FuncInfo.ValueMap to emit debug info for byval parameter. llvm-svn: 112238	2010-08-26 22:53:27 +00:00
Jim Grosbach	2a1915d04b	Remove the now obsolete frame index virtual re-use algorithm from PEI. Pre-RA virtual base registers handle this function, and more. A bit more cleanup to do on the interface to eliminateFrameIndex() after this. llvm-svn: 112237	2010-08-26 22:42:12 +00:00
Devang Patel	42b4ac7ed3	Speculatively revert r112207. llvm-svn: 112216	2010-08-26 20:33:42 +00:00
Devang Patel	977057f481	80 col. llvm-svn: 112215	2010-08-26 20:32:32 +00:00
Devang Patel	384fa91deb	Update DanglingDebugInfo so that it can be used to track llvm.dbg.declare also. llvm-svn: 112213	2010-08-26 20:06:46 +00:00
Devang Patel	ab596a637c	Donot forget to resolve dangling debug info in a case where virtual register, used for a value, is initialized after a dbg intrinsic is seen. llvm-svn: 112207	2010-08-26 18:36:14 +00:00
Chris Lattner	af23e9a798	Add a hackaround for PR7993 which is causing failures on x86 builders that lack sse2. llvm-svn: 112175	2010-08-26 06:57:07 +00:00
Chris Lattner	eb2cc0ce0e	implement SplitVecOp_CONCAT_VECTORS, fixing the included testcase with SSE1. llvm-svn: 112171	2010-08-26 05:51:22 +00:00
Chris Lattner	f6418b804e	zap dead code. llvm-svn: 112155	2010-08-26 02:57:35 +00:00
Chris Lattner	8df99b523e	remove some llvmcontext arguments that are now dead post-refactoring. llvm-svn: 112104	2010-08-25 23:00:45 +00:00
Chris Lattner	75ff053497	Change handling of illegal vector types to widen when possible instead of expanding: e.g. <2 x float> -> <4 x float> instead of -> 2 floats. This affects two places in the code: handling cross block values and handling function return and arguments. Since vectors are already widened by legalizetypes, this gives us much better code and unblocks x86-64 abi and SPU abi work. For example, this (which is a silly example of a cross-block value): define <4 x float> @test2(<4 x float> %A) nounwind { %B = shufflevector <4 x float> %A, <4 x float> undef, <2 x i32> <i32 0, i32 1> %C = fadd <2 x float> %B, %B br label %BB BB: %D = fadd <2 x float> %C, %C %E = shufflevector <2 x float> %D, <2 x float> undef, <4 x i32> <i32 0, i32 1, i32 undef, i32 undef> ret <4 x float> %E } Now compiles into: _test2: ## @test2 ## BB#0: addps %xmm0, %xmm0 addps %xmm0, %xmm0 ret previously it compiled into: _test2: ## @test2 ## BB#0: addps %xmm0, %xmm0 pshufd $1, %xmm0, %xmm1 ## kill: XMM0<def> XMM0<kill> XMM0<def> insertps $0, %xmm0, %xmm0 insertps $16, %xmm1, %xmm0 addps %xmm0, %xmm0 ret This implements rdar://8230384 llvm-svn: 112101	2010-08-25 22:49:25 +00:00
Devang Patel	32a72ab072	Fix comment. llvm-svn: 112086	2010-08-25 20:41:24 +00:00
Devang Patel	3f53d6e56a	Remove dead argument. llvm-svn: 112085	2010-08-25 20:39:26 +00:00
Jim Grosbach	7c1b421ae6	Add some statistics for PEI register scavenging llvm-svn: 112084	2010-08-25 20:34:28 +00:00
Chris Lattner	05bcb488b5	split the vector case of getCopyFromParts out to its own function, no functionality change. llvm-svn: 111994	2010-08-24 23:20:40 +00:00
Chris Lattner	96a77ebd7c	split the vector case out of getCopyToParts into its own function. No functionality change. llvm-svn: 111990	2010-08-24 23:10:06 +00:00
Chris Lattner	5b8967f8a2	tidy up, reduce indentation llvm-svn: 111982	2010-08-24 22:43:11 +00:00
Jim Grosbach	2eedb7949e	Add ARM heuristic for when to allocate a virtual base register for stack access. rdar://8277890&7352504 llvm-svn: 111968	2010-08-24 21:19:33 +00:00
Jim Grosbach	b77d67f318	Move enabling the local stack allocation pass into the target where it belongs. For now it's still a command line option, but the interface to the generic code doesn't need to know that. llvm-svn: 111942	2010-08-24 19:05:43 +00:00
Devang Patel	4a213870db	Revert r107202. It is not adding any value. llvm-svn: 111870	2010-08-24 00:06:12 +00:00
Jim Grosbach	616bc356e9	Remove the MFI storage of the local allocation block size. It's not needed. llvm-svn: 111847	2010-08-23 21:29:29 +00:00
Jim Grosbach	754f8e600e	Better handling of local offsets for downwards growing stacks. This corrects relative offsets when there are offsets encoded in the instructions and simplifies final allocation in PEI. rdar://8277890 llvm-svn: 111836	2010-08-23 20:40:38 +00:00
Devang Patel	a8652674e0	Handle qualified constants that are directly folded by FE. PR 7920. llvm-svn: 111820	2010-08-23 18:25:56 +00:00
Owen Anderson	d31d82d75c	Now that PassInfo and Pass::ID have been separated, move the rest of the passes over to the new registration API. llvm-svn: 111815	2010-08-23 17:52:01 +00:00
Chandler Carruth	191c4f73b2	Fix some GCC warnings by providing a virtual destructor in the base of a class hierarchy with virtual methods and using llvm_unreachable to properly indicate unreachable states which would otherwise leave variables uninitialized. llvm-svn: 111803	2010-08-23 08:25:07 +00:00
Eli Friedman	ac305d2024	Delete dead comment. llvm-svn: 111744	2010-08-21 20:19:51 +00:00
Bill Wendling	578ee4070c	Create the new linker type "linker_private_weak_def_auto". It's similar to "linker_private_weak", but it's known that the address of the object is not taken. For instance, functions that had an inline definition, but the compiler decided not to inline it. Note, unlike linker_private and linker_private_weak, linker_private_weak_def_auto may have only default visibility. The symbols are removed by the linker from the final linked image (executable or dynamic library). llvm-svn: 111684	2010-08-20 22:05:50 +00:00
Jim Grosbach	7648a21152	Downwards growing stack allocation order reverses relative offsets llvm-svn: 111673	2010-08-20 20:25:31 +00:00
Jim Grosbach	7110941d68	Add more dbg output llvm-svn: 111670	2010-08-20 19:04:43 +00:00
Jim Grosbach	0600691fe6	properly check for whether base regs were inserted llvm-svn: 111646	2010-08-20 16:48:30 +00:00
Bob Wilson	c56fef4eac	If the target says that an extending load is not legal, regardless of whether it involves specific floating-point types, legalize should expand an extending load to a non-extending load followed by a separate extend operation. For example, we currently expand SEXTLOAD to EXTLOAD+SIGN_EXTEND_INREG (and assert that EXTLOAD should always be supported). Now we can expand that to LOAD+SIGN_EXTEND. This is needed to allow vector SIGN_EXTEND and ZERO_EXTEND to be used for NEON. llvm-svn: 111586	2010-08-19 23:52:39 +00:00
Jim Grosbach	56e56323c8	Better handling of offsets on frame index references. rdar://8277890 llvm-svn: 111585	2010-08-19 23:52:25 +00:00
Evan Cheng	e5af930156	Update debug logs. llvm-svn: 111575	2010-08-19 23:33:02 +00:00
Evan Cheng	63a868457b	Properly update MachineDominators when splitting critical edge. llvm-svn: 111574	2010-08-19 23:32:47 +00:00
Bill Wendling	68caaaf282	Correct header. llvm-svn: 111540	2010-08-19 18:52:17 +00:00
Evan Cheng	361b9be7c6	It's possible to sink a def if its local uses are PHI's. llvm-svn: 111537	2010-08-19 18:33:29 +00:00
Michael J. Spencer	abca173494	Fix the msvc 2010 build. The Microsoft (R) 32-bit C/C++ Optimizing Compiler Version 16.00.30319.01 implements parts of C++0x based on the draft standard. An old version of the draft had a bug that makes std::pair<T1, T2>(something, 0) fail to compile. This is because the template<class U, class V> pair(U&& x, V&& y) constructor is selected, even though it later fails to implicitly convert U and V to frist_type and second_type. This has been fixed in n3090, but it seems that Microsoft is not going to update msvc. llvm-svn: 111535	2010-08-19 18:16:39 +00:00
Evan Cheng	681d0c25f9	Remove disabled assertion. llvm-svn: 111531	2010-08-19 17:33:48 +00:00
Evan Cheng	ae9939c839	Teach machine-sink to break critical edges when appropriate. Work in progress. llvm-svn: 111530	2010-08-19 17:33:11 +00:00
Jim Grosbach	743d7c80e4	Update local stack block allocation to let PEI do the allocs if no additional base registers were required. This will allow for slightly better packing of the locals when alignment padding is necessary after callee saved registers. llvm-svn: 111508	2010-08-19 02:47:08 +00:00
Jim Grosbach	3ac059369b	Add a newline to debug output llvm-svn: 111453	2010-08-18 23:14:02 +00:00
Evan Cheng	25b6068b8f	If any def of a machine-sink candidate has local uses, it's obviously not safe to sink it to a successor block. This bug has been hidden because a later check for critical-edge disable these illegal optimizations. This patch should significantly reduce the amount of time spent on checking dominator information for obviously unsafe sinking. llvm-svn: 111450	2010-08-18 23:09:25 +00:00
Jim Grosbach	dbfc2ce95d	Enable ARM base register reuse to local stack slot allocation. Whenever a new frame index reference to an object in the local block is seen, check if it's near enough to any previously allocaated base register to re-use. rdar://8277890 llvm-svn: 111443	2010-08-18 22:44:49 +00:00
Jakob Stoklund Olesen	e98030ad58	Thinking about it, we don't need MachineDominatorTree after all. The DomValue map discovers the iterated dominance frontier for free. llvm-svn: 111400	2010-08-18 20:29:53 +00:00
Jakob Stoklund Olesen	f4088b022a	Revert r111394. It was too aggressive. We must complete the DFS, otherwise we might miss needed phi-defs, and prematurely color live ranges with a non-dominating value. This is not a big deal since we get to color more of the CFG and the next mapValue call will be faster. llvm-svn: 111397	2010-08-18 20:06:05 +00:00
Jakob Stoklund Olesen	5b4cb08471	Aggressively prune the DFS when inserting phi-defs. llvm-svn: 111394	2010-08-18 19:00:11 +00:00
Jakob Stoklund Olesen	ce6f055b4d	Add the LiveIntervalMap class. Don't hook it up yet. LiveIntervalMap maps values from a parent LiveInterval to a child interval that is a strict subset. It will create phi-def values as needed to preserve the VNInfo SSA form in the child interval. This leads to an algorithm very similar to the one in SSAUpdaterImpl.h, but with enough differences that the code can't be reused: - We don't need to manipulate PHI instructions. - LiveIntervals have kills. - We have MachineDominatorTree. - We can use df_iterator. llvm-svn: 111393	2010-08-18 19:00:08 +00:00
Bill Wendling	0d323aef46	Improve whitespace. llvm-svn: 111384	2010-08-18 18:41:13 +00:00
Jim Grosbach	e0e9b3013f	Add hook for re-using virtual base registers for local stack slot access. Nothing fancy, just ask the target if any currently available base reg is in range for the instruction under consideration and use the first one that is. Placeholder ARM implementation simply returns false for now. ongoing saga of rdar://8277890 llvm-svn: 111374	2010-08-18 17:57:37 +00:00
Jakob Stoklund Olesen	952a621d93	Preserve subregs on PHI source operands. Patch by Krister Wombell! llvm-svn: 111366	2010-08-18 16:09:47 +00:00
Jim Grosbach	3cf08661f4	Add materialization of virtual base registers for frame indices allocated into the local block. Resolve references to those indices to a new base register. For simplification and testing purposes, a new virtual base register is allocated for each frame index being resolved. The result is truly horrible, but correct, code that's good for exercising the new code paths. Next up is adding thumb1 support, which should be very simple. Following that will be adding base register re-use and implementing a reasonable ARM heuristic for when a virtual base register should be generated at all. llvm-svn: 111315	2010-08-17 22:41:55 +00:00
Dale Johannesen	16f96445c3	Make fast scheduler handle asm clobbers correctly. PR 7882. Follows suggestion by Amaury Pouly, thanks. llvm-svn: 111306	2010-08-17 22:17:24 +00:00
Evan Cheng	16bfe5b0f5	PHI elimination shouldn't require machineloopinfo since it's used at -O0. Move the requirement to LiveIntervalAnalysis instead. Note this does not change the number of times machineloopinfo is computed. llvm-svn: 111285	2010-08-17 21:00:37 +00:00
Evan Cheng	e0db9d01d9	Machine CSE preserves CFG. Pass manager was freeing machineloopinfo after machine cse before. llvm-svn: 111281	2010-08-17 20:57:42 +00:00
Jim Grosbach	1a58ce7646	silence warning llvm-svn: 111274	2010-08-17 20:21:30 +00:00
Jim Grosbach	c252ee2375	Add hook to examine an instruction referencing a frame index to determine whether to allocate a virtual frame base register to resolve the frame index reference in it. Implement a simple version for ARM to aid debugging. In LocalStackSlotAllocation, scan the function for frame index references to local frame indices and ask the target whether to allocate virtual frame base registers for any it encounters. Purely infrastructural for debug output. Next step is to actually allocate base registers, then add intelligent re-use of them. rdar://8277890 llvm-svn: 111262	2010-08-17 18:13:53 +00:00
Evan Cheng	647c559172	Move the decision logic whether it's a good idea to split a critical edge to clients. Also fixed an erroneous check. An edge is only a back edge when the from and to blocks are in the same loop. llvm-svn: 111256	2010-08-17 17:43:50 +00:00
Evan Cheng	a6848249ee	Fix debug message. llvm-svn: 111250	2010-08-17 17:15:14 +00:00
Eric Christopher	541f8012d9	Fix typo. llvm-svn: 111223	2010-08-17 01:30:33 +00:00
Evan Cheng	f259efde47	PHI elimination should not break back edge. It can cause some significant code placement issues. rdar://8263994 good: LBB0_2: mov r2, r0 . . . mov r1, r2 bne LBB0_2 bad: LBB0_2: mov r2, r0 . . . @ BB#3: mov r1, r2 b LBB0_2 llvm-svn: 111221	2010-08-17 01:20:36 +00:00
Jim Grosbach	a7c562d664	tidy up. remove unused local. llvm-svn: 111206	2010-08-16 23:26:09 +00:00
Jim Grosbach	36d5ec383e	Better handle alignment requirements for local objects in pre-regalloc frame mapping. Have the local block track its alignment requirement, and then apply that when the block itself is allocated. Previously, offsets could get adjusted in PEI to be different, relative to one another, than the block allocation thought they would be, which defeats the point of doing the allocation this way. Continuing rdar://8277890 llvm-svn: 111197	2010-08-16 22:30:41 +00:00
Eli Friedman	7e2f4ce439	Until uleb/sleb are MC-ized, add a hack to make them work with ELF object emission. llvm-svn: 111177	2010-08-16 20:08:40 +00:00
Jim Grosbach	8be0196afe	track local frame size in MFI, not local to the pass, since PEI needs it. llvm-svn: 111164	2010-08-16 18:06:15 +00:00
Jakob Stoklund Olesen	5f72a04ba7	Remove unused functions. llvm-svn: 111156	2010-08-16 17:18:20 +00:00
Ted Kremenek	da2eba58ed	Update CMake build. llvm-svn: 111063	2010-08-14 01:55:09 +00:00
Jim Grosbach	a030fa5297	Add a local stack object block allocation pass. This is still an experimental pass that allocates locals relative to one another before register allocation and then assigns them to actual stack slots as a block later in PEI. This will eventually allow targets with limited index offset range to allocate additional base registers (not just FP and SP) to more efficiently reference locals, as well as handle situations where locals cannot be referenced via SP or FP at all (dynamic stack realignment together with variable sized objects, for example). It's currently incomplete and almost certainly buggy. Work in progress. Disabled by default and gated via the -enable-local-stack-alloc command line option. rdar://8277890 llvm-svn: 111059	2010-08-14 00:15:52 +00:00
Jakob Stoklund Olesen	27e1f26534	Clean up the Spiller.h interface. The earliestStart argument is entirely specific to linear scan allocation, and can be easily calculated by RegAllocLinearScan. Replace std::vector with SmallVector. llvm-svn: 111055	2010-08-13 22:56:53 +00:00
Jakob Stoklund Olesen	d1191ee43c	Implement splitting inside a single block. When a live range is contained a single block, we can split it around instruction clusters. The current approach is very primitive, splitting before and after the largest gap between uses. llvm-svn: 111043	2010-08-13 21:18:48 +00:00
Jim Grosbach	d1f4465df0	tidy up whitespace a bit llvm-svn: 111019	2010-08-13 16:55:08 +00:00
Jakob Stoklund Olesen	3d1027e7a1	Let LiveInterval::addRange extend existing ranges, it will verify that value numbers match. The old check could accidentally leave holes in openli. Also let useIntv add all ranges for the phi-def value inserted by enterIntvAtEnd. This works as long at the value mapping is established in enterIntvAtEnd. llvm-svn: 110995	2010-08-13 01:05:26 +00:00
Jakob Stoklund Olesen	840b81a19e	Remember to actually update SplitAnalysis statistics now that we have a fancy function to do it. llvm-svn: 110994	2010-08-13 01:05:23 +00:00
Jakob Stoklund Olesen	991e4ee860	Handle an empty dupli. This can happen if the original interval has been broken into two disconnected parts. Ideally, we should be able to detect when the graph is disconnected and create separate intervals, but that code is not implemented yet. Example: Two basic blocks are both branching to a loop header. Our interval is defined in both basic blocks, and live into the loop along both edges. We decide to split the interval around the loop. The interval is split into an inside part and an outside part. The outside part now has two disconnected segments, one in each basic block. If we later decide to split the outside interval into single blocks, we get one interval per basic block and an empty dupli for the remainder. llvm-svn: 110976	2010-08-12 23:02:57 +00:00
Jakob Stoklund Olesen	32c181c444	Update the SplitAnalysis statistics as uses are moved from curli to the new split intervals. THis means the analysis can be used for multiple splits as long as curli doesn't shrink. llvm-svn: 110975	2010-08-12 23:02:55 +00:00
Jakob Stoklund Olesen	0910689353	Also recompute HasPHIKill flags in LiveInterval::RenumberValues. If a phi-def value were removed from the interval, the phi-kill flags are no longer valid. llvm-svn: 110949	2010-08-12 20:38:03 +00:00
Jakob Stoklund Olesen	073cd8004a	Remove trailing whitespace. llvm-svn: 110944	2010-08-12 20:01:23 +00:00
Jakob Stoklund Olesen	fa3ea11ae6	Clean up debug output. llvm-svn: 110940	2010-08-12 18:50:55 +00:00
Jakob Stoklund Olesen	622848b262	Implement single block splitting. Before spilling a live range, we split it into a separate range for each basic block where it is used. That way we only get one reload per basic block if the new smaller ranges can allocate to a register. This type of splitting is already present in the standard spiller. llvm-svn: 110934	2010-08-12 17:07:14 +00:00
Jakob Stoklund Olesen	852a2c19dd	Fix a FIXME. The SlotIndex::Slot enum should be private. llvm-svn: 110826	2010-08-11 16:50:17 +00:00
Bill Wendling	0757820f8f	Turn optimize compares back on with fix. We needed to test that a machine op was a register before checking if it was defined. llvm-svn: 110733	2010-08-10 21:38:11 +00:00
Jakob Stoklund Olesen	57f3db6e2e	Give up on register class recalculation when the register is used with subreg operands. We don't currently have a hook to provide "the largest super class of A where all registers' getSubReg(subidx) is valid and in B". llvm-svn: 110730	2010-08-10 21:16:16 +00:00
Dan Gohman	a53f4e23e4	Revert r110718; it broke clang-i386-darwin9. llvm-svn: 110726	2010-08-10 20:49:33 +00:00
Jakob Stoklund Olesen	3b870f045f	Avoid editing the current live interval during remat. The live interval may be used for a spill slot as well, and that spill slot could be shared by split registers. We cannot shrink it, even if we know the current register won't need the spill slot in that range. llvm-svn: 110721	2010-08-10 20:45:07 +00:00
Jakob Stoklund Olesen	62e721478b	More debug spew llvm-svn: 110720	2010-08-10 20:45:01 +00:00
Bill Wendling	558f822bc7	Turn optimize cmps on by default so that we can get some testing by the nightly ARM testers. llvm-svn: 110718	2010-08-10 20:23:02 +00:00
Devang Patel	8e06a5eb47	Do not forget debug info for enums. Use named mdnode to keep track of these types. llvm-svn: 110712	2010-08-10 20:01:20 +00:00
Jakob Stoklund Olesen	53c5022040	Implement register class inflation. When splitting a live range, the new registers have fewer uses and the permissible register class may be less constrained. Recompute the register class constraint from the uses of new registers created for a split. This may let them be allocated from a larger set, possibly avoiding a spill. llvm-svn: 110703	2010-08-10 18:37:40 +00:00
Jakob Stoklund Olesen	284c2dbfd7	Recalculate the spill weight and allocation hint for virtual registers created during live range splitting. llvm-svn: 110686	2010-08-10 17:07:22 +00:00
Devang Patel	b219746c80	Handle TAG_constant for integers. llvm-svn: 110656	2010-08-10 07:11:13 +00:00
Bill Wendling	884514066e	Update CMake...sorry for the breakage. llvm-svn: 110654	2010-08-10 05:16:06 +00:00
Devang Patel	18ba0b4ac3	Simplify. llvm-svn: 110653	2010-08-10 04:12:17 +00:00
Devang Patel	b1e07b3f2a	Drop "const". It does not add value here. llvm-svn: 110652	2010-08-10 04:09:06 +00:00
Evan Cheng	23ef829096	Add missing null check reported by Amaury Pouly. llvm-svn: 110649	2010-08-10 02:39:45 +00:00
Devang Patel	469c12d254	Do not include file static variable in pubnames list. Refactor and simplify code to avoid redundant checks. llvm-svn: 110642	2010-08-10 01:37:23 +00:00
Jakob Stoklund Olesen	e00c49da11	Transpose the calculation of spill weights such that we are calculating one register at a time. This turns out to be slightly faster than iterating over instructions, but more importantly, it allows us to compute spill weights for new registers created after the spill weight pass has run. Also compute the allocation hint at the same time as the spill weight. This allows us to use the spill weight as a cost metric for copies, and choose the most profitable hint if there is more than one possibility. The new hints provide a very small (< 0.1%) but universal code size improvement. llvm-svn: 110631	2010-08-10 00:02:26 +00:00
Bill Wendling	ca67835eaa	Merge the OptimizeExts and OptimizeCmps passes into one PeepholeOptimizer pass. This pass should expand with all of the small, fine-grained optimization passes to reduce compile time and increase happiment. llvm-svn: 110627	2010-08-09 23:59:04 +00:00
Devang Patel	394a69ed52	Undo accidental commit. llvm-svn: 110623	2010-08-09 23:28:52 +00:00
Devang Patel	4eda9abddb	Simplify. Avoid redundant checks. llvm-svn: 110621	2010-08-09 23:26:06 +00:00
Devang Patel	c7cf14f5f6	Refactor. llvm-svn: 110607	2010-08-09 21:39:24 +00:00
Devang Patel	6d9f9feb2b	Refactoring. Update DbgVarible to handle queries itself. llvm-svn: 110600	2010-08-09 21:01:39 +00:00
Devang Patel	b6511a36b4	It is ok, and convenient, to pass descriptors by value. llvm-svn: 110590	2010-08-09 20:20:05 +00:00
Jakob Stoklund Olesen	3fa110f227	A REG_SEQUENCE instruction may use the same register twice. If we are emitting COPY instructions for the REG_SEQUENCE, make sure the kill flag goes on the last COPY. Otherwise we may be using a killed register. <rdar://problem/8287792> llvm-svn: 110589	2010-08-09 20:19:16 +00:00
Devang Patel	406798a17d	Rename a method. llvm-svn: 110586	2010-08-09 18:51:29 +00:00
Bill Wendling	798617b1ab	Use the "isCompare" machine instruction attribute instead of calling the relatively expensive comparison analyzer on each instruction. Also rename the comparison analyzer method to something more in line with what it actually does. This pass is will eventually be folded into the Machine CSE pass. llvm-svn: 110539	2010-08-08 05:04:59 +00:00
Dan Gohman	093b42fc7c	Tidy some #includes and forward-declarations, and move the C binding code out of PassManager.cpp and into Core.cpp with the rest of the C binding code. llvm-svn: 110494	2010-08-07 00:43:20 +00:00
Jakob Stoklund Olesen	45e07c8fc5	Lazily defer duplicating the live interval we are splitting until we know it is necessary. Sometimes, live range splitting doesn't shrink the current interval, but simply changes some instructions to use a new interval. That makes the original more suitable for spilling. In this case, we don't need to duplicate the original. llvm-svn: 110481	2010-08-06 22:17:33 +00:00
Jim Grosbach	da27eb246d	Cleanup comment wording llvm-svn: 110466	2010-08-06 18:59:07 +00:00
Jakob Stoklund Olesen	1dfca4e4bb	Keep the MachiuneFunctionPass pointer around. It is useful for verification. llvm-svn: 110464	2010-08-06 18:47:06 +00:00
Jakob Stoklund Olesen	8c0f693150	Add LiveInterval::RenumberValues - Garbage collection for VNInfos. After heavy editing of a live interval, it is much easier to simply renumber the live values instead of trying to keep track of the unused ones. llvm-svn: 110463	2010-08-06 18:46:59 +00:00
Owen Anderson	a7aed18624	Reapply r110396, with fixes to appease the Linux buildbot gods. llvm-svn: 110460	2010-08-06 18:33:48 +00:00
Jakob Stoklund Olesen	8147d7a6b9	Add more verification of LiveIntervals. llvm-svn: 110454	2010-08-06 18:04:19 +00:00
Jakob Stoklund Olesen	7e0de5ef8e	Fix swapped COPY operands. llvm-svn: 110453	2010-08-06 18:04:17 +00:00
Jakob Stoklund Olesen	0e7752407c	Don't try to verify LiveIntervals for physical registers. When a physical register is in use, some alias of that register has a live interval with a relevant live range. That is the sad state of intervals after physreg coalescing of subregs, and it is good enough for correct register allocation. llvm-svn: 110452	2010-08-06 18:04:14 +00:00
Ted Kremenek	26177d2c24	Update CMake build. llvm-svn: 110429	2010-08-06 04:05:21 +00:00
Bill Wendling	7de9d52c13	Add the Optimize Compares pass (disabled by default). This pass tries to remove comparison instructions when possible. For instance, if you have this code: sub r1, 1 cmp r1, 0 bz L1 and "sub" either sets the same flag as the "cmp" instruction or could be converted to set the same flag, then we can eliminate the "cmp" instruction all together. This is a important for ARM where the ALU instructions could set the CPSR flag, but need a special suffix ('s') to do so. llvm-svn: 110423	2010-08-06 01:32:48 +00:00
Devang Patel	8a18aee421	While emitting DBG_VALUE for registers spilled at the end of a block do not use location of MBB->end(). If a block does not have terminator then incoming iterator points to end(). llvm-svn: 110411	2010-08-06 00:26:18 +00:00
Owen Anderson	bda59bd247	Revert r110396 to fix buildbots. llvm-svn: 110410	2010-08-06 00:23:35 +00:00
Jakob Stoklund Olesen	01a81b01bc	Be more aggressive about removing joined physreg copies. When a joined COPY changes subreg liveness, we keep it around as a KILL, otherwise it is safe to delete. llvm-svn: 110403	2010-08-05 23:51:28 +00:00
Jakob Stoklund Olesen	b4ef4a961d	Don't verify LiveVariables if LiveIntervals is available. LiveVariables becomes horribly wrong while the coalescer is running, but the analysis is not zapped until after the coalescer pass has run. This causes tons of false reports when calling verify form the coalescer. llvm-svn: 110402	2010-08-05 23:51:26 +00:00
Owen Anderson	755aceb5d0	Don't use PassInfo* as a type identifier for passes. Instead, use the address of the static ID member as the sole unique type identifier. Clean up APIs related to this change. llvm-svn: 110396	2010-08-05 23:42:04 +00:00
Jakob Stoklund Olesen	e7709ebb64	Add basic verification of LiveIntervals. We verify that the LiveInterval is live at uses and defs, and that all instructions have a SlotIndex. Stuff we don't check yet: - Is the LiveInterval minimal? - Do all defs correspond to instructions or phis? - Do all defs dominate all their live ranges? - Are all live ranges continually reachable from their def? llvm-svn: 110386	2010-08-05 22:32:21 +00:00
Jakob Stoklund Olesen	4583355a78	Remove double-def checking from MachineVerifier, so a register does not have to be killed before being redefined. These checks are usually disabled, and usually fail when enabled. We de facto allow live registers to be redefined without a kill, the corresponding assertions in RegScavenger were removed long ago. llvm-svn: 110362	2010-08-05 18:59:59 +00:00
Jakob Stoklund Olesen	d9572619e2	Avoid using a live std::multimap iterator while editing the map. It looks like we sometimes compare singular iterators, reported by ENABLE_EXPENSIVE_CHECKS. This fixes PR7825. llvm-svn: 110355	2010-08-05 18:12:19 +00:00
Bill Wendling	ca1cb13646	The lower invoke pass needs to have unreachable code elimination run after it because it could create such things. This fixes a MingW buildbot test failure. llvm-svn: 110279	2010-08-04 23:36:02 +00:00
Jakob Stoklund Olesen	7fd4905f08	Coalesce stack slot accesses that arise when spilling both sides of a COPY. This helps avoid silly code: %R0<def = LOAD <fi#5> STORE <fi#5>, %R0<kill> llvm-svn: 110266	2010-08-04 22:35:11 +00:00
Jakob Stoklund Olesen	dc96e28d70	Checkpoint SplitKit progress. We are now at a point where we can split around simple single-entry, single-exit loops, although still with some bugs. llvm-svn: 110257	2010-08-04 22:08:39 +00:00
Devang Patel	6c378ac473	Use location entry only of the location described by DBG_VALUE is valid. llvm-svn: 110255	2010-08-04 22:07:27 +00:00
Bill Wendling	b87f3e5a7d	The EH prepare passes really want to be the last passes run before code-gen. llvm-svn: 110248	2010-08-04 21:44:13 +00:00
Devang Patel	6d21f61b3f	Fix typo in comment. llvm-svn: 110244	2010-08-04 20:32:36 +00:00
Dan Gohman	2392287306	Change this llvm_unreachable to report_fatal_error, since it can be triggered by valid, if dubious, IR. llvm-svn: 110240	2010-08-04 18:51:09 +00:00
Devang Patel	d71bc1ae4e	While spilling live registers at the end of block check whether they are used by DBG_VALUE machine instructions or not. If a spilled register is used by DBG_VALUE machine instruction then insert a new DBG_VALUE machine instruction to encode variable's new location on stack. llvm-svn: 110235	2010-08-04 18:42:02 +00:00
Devang Patel	0e60e67efb	If a variable is spilled by code generator then use DW_OP_fbreg to describe its location on stack. llvm-svn: 110234	2010-08-04 18:40:52 +00:00
Dan Gohman	5cae103392	Eliminate unnecessary empty string literals. llvm-svn: 110183	2010-08-04 01:39:08 +00:00
Jakob Stoklund Olesen	0c18757c9d	Oops. Don't normalize spill weights twice. When the normalizeSpillWeights function was introduced, I forgot to remove this normalization. This change could affect register allocation. Hopefully for the better. llvm-svn: 110119	2010-08-03 17:21:16 +00:00
Bill Wendling	44dc60ba13	Early exit and reduce indentation. No functionality change. llvm-svn: 110069	2010-08-02 22:06:08 +00:00
Devang Patel	d070128de5	Free DbgScope created for dead functions. llvm-svn: 110045	2010-08-02 17:32:15 +00:00
Oscar Fuentes	40b31ad3ee	Prefix `next' iterator operation with `llvm::'. Fixes potential ambiguity problems on VS 2010. Patch by nobled! llvm-svn: 110029	2010-08-02 06:00:15 +00:00
Eli Friedman	460ad41d6d	PR7586: Make sure we don't claim that unknown bits are actually known in the ISD::AND case of TargetLowering::SimplifyDemandedBits. llvm-svn: 110019	2010-08-02 04:42:25 +00:00
Bill Wendling	d9900542a6	Reference the personalities. Don't copy them into a new vector. llvm-svn: 109966	2010-08-01 01:34:21 +00:00
Eli Friedman	ffe64c06ef	Fix for bug reported by Evzen Muller on llvm-commits: make sure to correctly check the range of the constant when optimizing a comparison between a constant and a sign_extend_inreg node. llvm-svn: 109854	2010-07-30 06:44:31 +00:00
Benjamin Kramer	a3e0ddb564	Plug the remaining MC leaks by giving MCObjectStreamer/MCAsmStreamer ownership of the TargetAsmBackend and the MCCodeEmitter. llvm-svn: 109767	2010-07-29 17:48:06 +00:00
Dale Johannesen	329d4741a5	Comment typo. llvm-svn: 109765	2010-07-29 17:45:24 +00:00
Jakob Stoklund Olesen	36cf119049	Fix a bug in the -regalloc=fast handling of exotic two-address instruction with multiple defs, like t2LDRSB_POST. The first def could accidentally steal the physreg that the second, tied def was required to be allocated to. Now, the tied use-def is treated more like an early clobber, and the physreg is reserved before allocating the other defs. This would never be a problem when the tied def was the only def which is the usual case. This fixes MallocBench/gs for thumb2 -O0. llvm-svn: 109715	2010-07-29 00:52:19 +00:00
Jakob Stoklund Olesen	0ff2c110ad	Print out the regclass of any virtual registers used by a machine instruction. llvm-svn: 109608	2010-07-28 18:35:46 +00:00
Devang Patel	84a74779a1	It is FE's responsibility to emit proper directory name. llvm-svn: 109538	2010-07-27 20:51:15 +00:00
Jim Grosbach	7383cf06ba	Grammar llvm-svn: 109525	2010-07-27 18:36:27 +00:00
Nate Begeman	317b969ac5	Fix a crash in the dag combiner caused by ConstantFoldBIT_CONVERTofBUILD_VECTOR calling itself recursively and returning a SCALAR_TO_VECTOR node, but assuming the input was always a BUILD_VECTOR. llvm-svn: 109519	2010-07-27 18:02:18 +00:00
Jim Grosbach	2ff0e64bc3	80 column llvm-svn: 109513	2010-07-27 17:38:47 +00:00
Jim Grosbach	7639967e6c	fix typo llvm-svn: 109511	2010-07-27 17:14:29 +00:00
Bill Wendling	0ff1ef650b	It's better to have the arrays, which would trigger the creation of stack protectors, to be near the stack protectors on the stack. Accomplish this by tagging the stack object with a predicate that indicates that it would trigger this. In the prolog-epilog inserter, assign these objects to the stack after the stack protector but before the other objects. llvm-svn: 109481	2010-07-27 01:55:19 +00:00
Jakob Stoklund Olesen	c698417e52	Add SplitEditor to SplitKit. This class will be used to edit live intervals and rewrite instructions for live range splitting. Still work in progress. llvm-svn: 109469	2010-07-26 23:44:11 +00:00
Dan Gohman	c2af77f510	Fix a use-after-free. llvm-svn: 109468	2010-07-26 23:40:24 +00:00
Bill Wendling	fa60b0ee51	Using llvm.eh.catch.all.value instead of .llvm.eh.catch.all.value. llvm-svn: 109462	2010-07-26 22:36:52 +00:00
Evan Cheng	e6d6c5dd11	The "excess register pressure" returned by HighRegPressure() is not accurate enough to factor into scheduling priority. Eliminate it and add early exits to speed up scheduling. llvm-svn: 109449	2010-07-26 21:49:07 +00:00
Dan Gohman	2810bacafb	Handle Values with no value in getCopyFromRegs. llvm-svn: 109415	2010-07-26 18:15:41 +00:00
Dan Gohman	f9da3c3b88	A block dominates itself, by definition. llvm-svn: 109402	2010-07-26 17:38:15 +00:00
Duncan Sands	136a6f0dbb	Pacify gcc-4.5 which wrongly thinks that RExcess (passed as the Excess parameter) may be used uninitialized in the callers of HighRegPressure. llvm-svn: 109393	2010-07-26 07:54:17 +00:00
Lang Hames	2e3f20b9aa	Factored out a bit of common code to mark VNInfos for deletion. llvm-svn: 109388	2010-07-26 01:49:41 +00:00
Evan Cheng	8ae3ecad2b	Add comments. llvm-svn: 109383	2010-07-25 18:59:43 +00:00
Bob Wilson	280ce9984e	Fix crashes when scheduling a CopyToReg node -- getMachineOpcode asserts on those. Radar 8231572. llvm-svn: 109367	2010-07-25 05:34:27 +00:00
Anton Korobeynikov	3c8eb80d93	Add hook to insert late LLVM=>LLVM passes just before isel llvm-svn: 109354	2010-07-24 20:48:54 +00:00
Bob Wilson	56c006561c	Change ScheduleDAGInstrs::Defs and ::Uses to be variable-size vectors instead of fixed size arrays, so that increasing FirstVirtualRegister to 16K won't cause a compile time performance regression. llvm-svn: 109330	2010-07-24 06:01:53 +00:00
Devang Patel	498877d055	Use current working directory when Dirname is empty. This only happens when absolute source file path is used on compiler command line. llvm-svn: 109302	2010-07-24 00:53:22 +00:00
Evan Cheng	37b740c4bf	Add an ILP scheduler. This is a register pressure aware scheduler that's appropriate for targets without detailed instruction iterineries. The scheduler schedules for increased instruction level parallelism in low register pressure situation; it schedules to reduce register pressure when the register pressure becomes high. On x86_64, this is a win for all tests in CFP2000. It also sped up 256.bzip2 by 16%. llvm-svn: 109300	2010-07-24 00:39:05 +00:00
Jim Grosbach	ba4b1909ce	Remove too-strict assertion. We may want the vreg copy of the physical register to be of a different register class. For example, in Thumb1 if the live-in is a high register, we want the vreg to be a low register. rdar://8224931 llvm-svn: 109291	2010-07-23 23:48:02 +00:00
Devang Patel	28499f76c9	Revert r109262. llvm-svn: 109285	2010-07-23 23:04:41 +00:00
Evan Cheng	df907f4594	- Allow target to specify when is register pressure "too high". In most cases, it's too late to start backing off aggressive latency scheduling when most of the registers are in use so the threshold should be a bit tighter. - Correctly handle live out's and extract_subreg etc. - Enable register pressure aware scheduling by default for hybrid scheduler. For ARM, this is almost always a win on # of instructions. It's runtime neutral for most of the tests. But for some kernels with high register pressure it can be a huge win. e.g. 464.h264ref reduced number of spills by 54 and sped up by 20%. llvm-svn: 109279	2010-07-23 22:39:59 +00:00
Dan Gohman	55e244698a	Use the proper type for shift counts. This fixes a bootstrap error. llvm-svn: 109265	2010-07-23 21:08:12 +00:00
Devang Patel	3032354bbe	IF directory name is empty then try to extract one using absolute file name. llvm-svn: 109262	2010-07-23 20:36:13 +00:00
Dan Gohman	0818684a70	DAGCombine (shl (anyext x, c)) to (anyext (shl x, c)) if the high bits are not demanded. This often allows the anyext to be folded away. llvm-svn: 109242	2010-07-23 18:03:30 +00:00
Dan Gohman	2e00e3b12d	Make SDNode::dump() print a newline at the end. llvm-svn: 109234	2010-07-23 16:37:47 +00:00
Eric Christopher	faf5c76114	80-col. llvm-svn: 109205	2010-07-23 01:05:59 +00:00
Chris Lattner	8f3adc9057	remove the JIT "NeedsExactSize" feature and supporting logic. llvm-svn: 109167	2010-07-22 21:17:55 +00:00
Gabor Greif	59f9970ba5	keep in 80 cols llvm-svn: 109122	2010-07-22 17:18:03 +00:00
Gabor Greif	dde79d8f1a	mass elimination of reliance on automatic iterator dereferencing llvm-svn: 109103	2010-07-22 13:36:47 +00:00
Gabor Greif	3e44ea1917	undo 80 column trespassing I caused llvm-svn: 109092	2010-07-22 10:37:47 +00:00
Evan Cheng	bf32e54bac	Re-apply r109079 with fix. llvm-svn: 109083	2010-07-22 06:24:48 +00:00
Owen Anderson	6c55cccf87	Revert r109079, which broke a lot of CodeGen tests. llvm-svn: 109082	2010-07-22 06:01:28 +00:00
Reid Kleckner	d85e3c5a86	Initial modifications to MCAssembler and TargetMachine for the MCJIT. Patch by Olivier Meurant! llvm-svn: 109080	2010-07-22 05:58:53 +00:00
Evan Cheng	bd81bff672	Initialize RegLimit only when register pressure is being tracked. llvm-svn: 109079	2010-07-22 05:18:41 +00:00
Evan Cheng	285903853f	More register pressure aware scheduling work. llvm-svn: 109064	2010-07-21 23:53:58 +00:00
Jim Grosbach	965a73a28c	For ARM/Darwin, add a dwarf entry indicating whether a function is arm or thumb rdar://8202967 llvm-svn: 109057	2010-07-21 23:03:52 +00:00
Owen Anderson	a57b97e7e7	Fix batch of converting RegisterPass<> to INTIALIZE_PASS(). llvm-svn: 109045	2010-07-21 22:09:45 +00:00
Jim Grosbach	a8683bb033	80 column and trailing whitespace cleanup llvm-svn: 109037	2010-07-21 21:21:52 +00:00
Dan Gohman	093cb79d4b	Disallow null as a named metadata operand. Make MDNode::destroy private. Fix the one thing that used MDNode::destroy, outside of MDNode itself. One should never delete or destroy an MDNode explicitly. MDNodes implicitly go away when there are no references to them (implementation details aside). llvm-svn: 109028	2010-07-21 18:54:18 +00:00
Lang Hames	bdafcc633d	Changed OStream templates to functions on raw_ostream, removed the unused "renderWarnings" function. llvm-svn: 109003	2010-07-21 09:02:06 +00:00
Evan Cheng	a77f3d3b37	Teach bottom up pre-ra scheduler to track register pressure. Work in progress. llvm-svn: 108991	2010-07-21 06:09:07 +00:00
Jakob Stoklund Olesen	0fef9dda8e	Change the createSpiller interface to take a MachineFunctionPass argument. The spillers can pluck the analyses they need from the pass reference. Switch some never-null pointers to references. llvm-svn: 108969	2010-07-20 23:50:15 +00:00
Jakob Stoklund Olesen	ed4075cc3b	Implement loop splitting analysis. Determine which loop exit blocks need a 'pre-exit' block inserted. Recognize when this would be impossible. llvm-svn: 108941	2010-07-20 21:46:58 +00:00
Dale Johannesen	6e5ec6263e	Fix test for switch statements and increase threshold a bit per experimentation. llvm-svn: 108935	2010-07-20 21:29:12 +00:00
Jakob Stoklund Olesen	ff095507e3	Appease the colonials. llvm-svn: 108845	2010-07-20 16:12:37 +00:00
Jakob Stoklund Olesen	36d12c679d	Beginning SplitKit - utility classes for live range splitting. This is a work in progress. So far we have some basic loop analysis to help determine where it is useful to split a live range around a loop. The actual loop splitting code from Splitter.cpp is also going to move in here. llvm-svn: 108842	2010-07-20 15:41:07 +00:00
Lang Hames	31dfb75b52	Updated css classes for the pressure table legend. llvm-svn: 108839	2010-07-20 14:35:55 +00:00
Lang Hames	2ff2193a80	Oops - I tables render poorly in Chrome without this explicit height specification. llvm-svn: 108824	2010-07-20 10:29:46 +00:00
Lang Hames	a475ab7f02	Use run-length encoding to represent identical adjacent cells in the pressure and interval table. Reduces output HTML file sizes by ~80% in my test cases. Also fix access of private member type by << operator. llvm-svn: 108823	2010-07-20 10:18:54 +00:00
Lang Hames	716b184108	Added support for turning HTML indentation on and off (indentation off by default). Reduces output file size ~20% on my test cases. llvm-svn: 108822	2010-07-20 09:13:29 +00:00
Lang Hames	a93fe2de3c	Switched to rendering after allocation (but before rewriting) in PBQP. Updated renderer to use allocation information from VirtRegMap (if available) to render spilled intervals differently. llvm-svn: 108815	2010-07-20 07:41:44 +00:00
Dale Johannesen	08645f1991	Don't hoist things out of a large switch inside a loop, for the reasons in the comments. This is a major win on 253.perlbmk on ARM Darwin. I expect it to be a good heuristic in general, but it's possible some things will regress; I'll be watching. `7940152`. llvm-svn: 108792	2010-07-20 00:50:13 +00:00
Stuart Hastings	61475c5c3c	Correct line info for declarations/definitions. Radar 8063111. llvm-svn: 108784	2010-07-19 23:56:30 +00:00
Devang Patel	d61b735d25	Fix memory leak reported by valgrind. Do not visit operands of old instruction. Visit all operands of new instruction. llvm-svn: 108767	2010-07-19 23:25:39 +00:00
Dan Gohman	b5e918dc05	After a custom inserter, in a block which has constant instructions, update the current basic block in addition to the current insert position, so that they remain consistent. This fixes rdar://8204072. llvm-svn: 108765	2010-07-19 22:48:56 +00:00
Evan Cheng	10f99a3490	ARM has to provide its own TargetLowering::findRepresentativeClass because its scalar floating point registers alias its vector registers. llvm-svn: 108761	2010-07-19 22:15:08 +00:00
Evan Cheng	7a135510e3	Teach computeRegisterProperties() to compute "representative" register class for legal value types. A "representative" register class is the largest legal super-reg register class for a value type. e.g. On i386, GR32 is the rep register class for i8 / i16 / i32; on x86_64 it would be GR64. This property will be used by the register pressure tracking instruction scheduler. llvm-svn: 108735	2010-07-19 18:47:01 +00:00
Jakob Stoklund Olesen	a58a7e7f9e	Spillers may alter MachineLoopInfo when breaking critical edges, so make it non-const. llvm-svn: 108734	2010-07-19 18:41:20 +00:00
Devang Patel	18efced1a2	Fix PR 7662. Do not try to insert local variable info to a DIE used for function declaration. llvm-svn: 108731	2010-07-19 17:53:55 +00:00
Benjamin Kramer	58c283ee85	Update CMake build. llvm-svn: 108700	2010-07-19 15:37:03 +00:00
Lang Hames	6624efb711	Render MachineFunctions to HTML pages, with options to render register pressure estimates and liveness alongside. Still experimental. llvm-svn: 108698	2010-07-19 15:22:28 +00:00
Owen Anderson	9c271e2835	Remove r108639 now that it is handled by InstCombine instead. llvm-svn: 108688	2010-07-19 08:10:24 +00:00
Daniel Dunbar	419197cc4d	Target: Give the TargetAsmParser access to the TargetMachine. - Unfortunate, but necessary for now to handle subtarget instruction matching. Eventually we should factor out the lower level target machine information so we don't need to do this. llvm-svn: 108664	2010-07-19 00:33:49 +00:00
Daniel Dunbar	7f5bf5ae2a	MC: Move several clients to using AsmParser constructor function. llvm-svn: 108645	2010-07-18 18:31:33 +00:00
Douglas Gregor	8ff89f5c02	Fix struct/class mismatch llvm-svn: 108642	2010-07-18 11:47:56 +00:00
Owen Anderson	f7f9c8a2f7	Add a DAGCombine xform to fold away redundant float->double->float conversions around sqrt instructions. I am assured by people more knowledgeable than me that there are no rounding issues in eliminating this. This fixed <rdar://problem/8197504>. llvm-svn: 108639	2010-07-18 08:47:54 +00:00
Lang Hames	1392b8eb79	Added -pbqp-pre-coalescing flag to PBQP. If enabled this will cause PBQP to require LoopSplitter be run prior to register allocation. Entirely for testing purposes at the moment. llvm-svn: 108634	2010-07-18 00:57:59 +00:00
Bill Wendling	ac67e99d53	Use isPrologLabel() instead of checking the opcode directly. llvm-svn: 108628	2010-07-17 19:18:44 +00:00
Zhongxing Xu	b653ce648d	update CMakeLists.txt llvm-svn: 108620	2010-07-17 12:12:42 +00:00
Lang Hames	5864012cc0	Removed unused inRange variable. llvm-svn: 108618	2010-07-17 11:43:07 +00:00
Lang Hames	225977d4f9	LoopSplitter - intended to split live intervals over loop boundaries. Still very much under development. Comments and fixes will be forthcoming. (This commit includes some small tweaks to LiveIntervals & LoopInfo to support the splitter) llvm-svn: 108615	2010-07-17 07:34:01 +00:00
Lang Hames	211e7ce7e7	Iterating over sets of pointers in a heuristic was a bad idea. Switching any command line paramater changed the register allocation produced by PBQP. Turns out variety is not the spice of life. Fixed some comparators, added others. All good now. llvm-svn: 108613	2010-07-17 06:31:41 +00:00
Eric Christopher	0baaa9bcc1	Propagate alloca alignment information via variable size object frame information. No functional change yet. llvm-svn: 108583	2010-07-17 00:28:22 +00:00
Bill Wendling	bf8370ff36	Consider this function: void foo() { __builtin_unreachable(); } It will output the following on Darwin X86: _func1: Leh_func_begin0: pushq %rbp Ltmp0: movq %rsp, %rbp Ltmp1: Leh_func_end0: This prolog adds a new Call Frame Information (CFI) row to the FDE with an address that is not within the address range of the code it describes -- part is equal to the end of the function -- and therefore results in an invalid EH frame. If we emit a nop in this situation, then the CFI row is now within the address range. llvm-svn: 108568	2010-07-16 22:51:10 +00:00
Bill Wendling	499f797cdd	Rename DBG_LABEL PROLOG_LABEL, because it's only used during prolog emission and thus is a much more meaningful name. llvm-svn: 108563	2010-07-16 22:20:36 +00:00
Jakob Stoklund Olesen	b15cbd343c	Remove remaining calls to TII::isMoveInstr. llvm-svn: 108556	2010-07-16 21:03:55 +00:00
Dan Gohman	1e936277c3	Revert r108369, sorting llvm.dbg.declare information by source position, since it doesn't work for front-ends which don't emit column information (which includes llvm-gcc in its present configuration), and doesn't work for clang for K&R style variables where the variables are declared in a different order from the parameter list. Instead, make a separate pass through the instructions to collect the llvm.dbg.declare instructions in order. This ensures that the debug information for variables is emitted in this order. llvm-svn: 108538	2010-07-16 17:54:27 +00:00
Eli Friedman	17c5a23559	Get rid of a bunch of duplicated ELF enum values. llvm-svn: 108520	2010-07-16 07:53:29 +00:00
Jakob Stoklund Olesen	37c42a3d02	Remove many calls to TII::isMoveInstr. Targets should be producing COPY anyway. TII::isMoveInstr is going tobe completely removed. llvm-svn: 108507	2010-07-16 04:45:42 +00:00
Dan Gohman	103c4ebea5	Use the source-order scheduler instead of the "fast" scheduler at -O0, because it's more likely to keep debug line information in its original order. llvm-svn: 108496	2010-07-16 02:01:19 +00:00
Dale Johannesen	bfd4fd7bb7	The SelectionDAGBuilder's handling of debug info, on rare occasions, caused code to be generated in a different order. All cases I've seen involved float softening in the type legalizer, and this could be perhaps be fixed there, but it's better not to generate things differently in the first place. 7797940 (6/29/2010..7/15/2010). llvm-svn: 108484	2010-07-16 00:02:08 +00:00
Bill Wendling	4bda1c8e68	Revert. This isn't the correct way to go. llvm-svn: 108478	2010-07-15 23:42:21 +00:00
Bill Wendling	973dc3b1d8	Handle code gen for the unreachable instruction if it's the only instruction in the function. We'll just turn it into a "trap" instruction instead. The problem with not handling this is that it might generate a prologue without the equivalent epilogue to go with it: $ cat t.ll define void @foo() { entry: unreachable } $ llc -o - t.ll -relocation-model=pic -disable-fp-elim -unwind-tables .section __TEXT,__text,regular,pure_instructions .globl _foo .align 4, 0x90 _foo: ## @foo Leh_func_begin0: ## BB#0: ## %entry pushq %rbp Ltmp0: movq %rsp, %rbp Ltmp1: Leh_func_end0: ... The unwind tables then have bad data in them causing all sorts of problems. Fixes <rdar://problem/8096481>. llvm-svn: 108473	2010-07-15 23:32:40 +00:00
Evan Cheng	55f0c6b9fc	Split -enable-finite-only-fp-math to two options: -enable-no-nans-fp-math and -enable-no-infs-fp-math. All of the current codegen fp math optimizations only care whether the fp arithmetics arguments and results can never be NaN. llvm-svn: 108465	2010-07-15 22:07:12 +00:00
Chris Lattner	60b131654b	fix the definitions of ConstTextCoalSection/ConstDataCoalSection to keep "Text" in sync with the "pure instructions" section attribute. Lack of this attribute was preventing the assembler from emitting multibyte noops instructions for templates (and inlines, and other coalesced stuff) and was causing the assembler to mismatch .o files. This fixes rdar://8018335 llvm-svn: 108461	2010-07-15 21:22:00 +00:00
Bill Wendling	2da75ef315	Use std::vector instead of TargetRegisterInfo::FirstVirtualRegister. llvm-svn: 108452	2010-07-15 20:04:36 +00:00
Bill Wendling	dd5e9d8faf	Use std::vector instead of TargetRegisterInfo::FirstVirtualRegister. llvm-svn: 108450	2010-07-15 20:01:02 +00:00
Bill Wendling	51a9c0a1b3	Use std::vector instead of TargetRegisterInfo::FirstVirtualRegister. This time make sure to allocate enough space in the std::vector. llvm-svn: 108449	2010-07-15 19:58:14 +00:00
Bill Wendling	5a8d15c553	Reserve a goodly amount of room for the vectors. llvm-svn: 108448	2010-07-15 19:41:20 +00:00
Devang Patel	df09db62e2	Fix crash reported in PR7653. llvm-svn: 108441	2010-07-15 18:45:27 +00:00
Bill Wendling	030b0286ec	Use std::vector instead of TargetRegisterInfo::FirstVirtualRegister. llvm-svn: 108440	2010-07-15 18:43:09 +00:00
Bill Wendling	57681404b0	Use std::vector instead of TargetRegisterInfo::FirstVirtualRegister. llvm-svn: 108438	2010-07-15 18:40:50 +00:00
Chris Lattner	c48adb60ca	revert bill's patches in an attempt to fix the buildbot. llvm-svn: 108419	2010-07-15 06:51:46 +00:00
Bill Wendling	1f7071a3e4	Fix headers. llvm-svn: 108413	2010-07-15 06:05:18 +00:00
Bill Wendling	e7e6ca5c57	Use std::vector instead of a hard-coded array. The length of that array could get very large, but we only need it to be the size of the number of pregs. llvm-svn: 108412	2010-07-15 06:04:38 +00:00
Bill Wendling	d5b390189d	Use std::vector instead of a hard-coded array. The length of that array could get very large, but we only need it to be the size of thenumber of pregs. llvm-svn: 108411	2010-07-15 05:56:32 +00:00
Chris Lattner	28fd6785bc	a more graceful fix for test/Other/inline-asm-newline-terminator.ll, follow on to r103765 llvm-svn: 108390	2010-07-15 00:37:34 +00:00
Eric Christopher	474e56a2bf	80-col. llvm-svn: 108381	2010-07-14 23:41:32 +00:00
Dan Gohman	f10cd5c6cb	Make the order in which variables are described in debug information independent of the order that isel happens to visit the dbg_declare intrinsics. This fixes a bug in which the formal arguments were being printed in reverse order, now that fast isel is going bottom up. llvm-svn: 108369	2010-07-14 23:08:16 +00:00
Dan Gohman	c12a6731c5	Properly restore DebugLoc after leaving the local constant area. llvm-svn: 108364	2010-07-14 22:01:31 +00:00
Dan Gohman	042523340b	Delete fast-isel's trivial load optimization; it breaks debugging because it can look past points where a debugger might modify user variables. llvm-svn: 108336	2010-07-14 17:25:37 +00:00
Evan Cheng	d542414945	Teach ProcessImplicitDefs to transform more COPY instructions into IMPLICIT_DEF (and subsequently eliminate them). This allows machine LICM to hoist IMPLICIT_DEF's. PR7620. llvm-svn: 108304	2010-07-14 01:22:19 +00:00
Dan Gohman	1f471435f8	Don't propagate debug locations to instructions for materializing constants, since they may not be emited near the other instructions which get the same line, and this confuses debug info. llvm-svn: 108302	2010-07-14 01:07:44 +00:00
Jakob Stoklund Olesen	cd7a40f4ec	Print VNInfo flags. llvm-svn: 108277	2010-07-13 21:19:05 +00:00
Dale Johannesen	caca5488dc	In inline asm treat indirect 'X' constraint as 'm'. This may not be right in all cases, but it's better than asserting which it was doing before. PR 7528. llvm-svn: 108268	2010-07-13 20:17:05 +00:00
Jakob Stoklund Olesen	fc4b8b8e80	Add an assertion to make PR7542 fail consistently. LiveInterval::overlapsFrom dereferences end() if it is called on an empty interval. It would be reasonable to just return false - an empty interval doesn't overlap anything, but I want to know who is doing it first. llvm-svn: 108264	2010-07-13 19:56:28 +00:00
Jakob Stoklund Olesen	b43455feaf	Fix LiveInterval::overlaps so it doesn't claim touching intervals overlap. Also, one binary search is enough. llvm-svn: 108261	2010-07-13 19:42:20 +00:00
Jakob Stoklund Olesen	54e620d2c7	Don't add memory operands to storeRegToStackSlot / loadRegFromStackSlot results, they already have one. This fixes the himenobmtxpa miscompilation on ARM. The PostRA scheduler got confused by the double memoperand and hoisted a stack slot load above a store to the same slot. llvm-svn: 108219	2010-07-13 00:23:30 +00:00
Rafael Espindola	a18c5a0e5e	Fix a typo and fit in 80 columns. Found by Bob Wilson. llvm-svn: 108164	2010-07-12 18:11:17 +00:00
Duncan Sands	41b4a6b36a	Convert some tab stops into spaces. llvm-svn: 108130	2010-07-12 08:16:59 +00:00
Rafael Espindola	871c724773	Convert the last use of getPhysicalRegisterRegClass and remove it. AggressiveAntiDepBreaker should not be using getPhysicalRegisterRegClass. An instruction might be using a register that can only be replaced with one from a subclass of getPhysicalRegisterRegClass. With this patch we use getMinimalPhysRegClass. This is correct, but conservative. We should check the uses of the register and select the largest register class that can be used in all of them. llvm-svn: 108122	2010-07-12 02:55:34 +00:00
Rafael Espindola	01c5a15dde	Don't use getPhysicalRegisterRegClass in PBQP. The existing checks that the physical register can be allocated in the class of the virtual are sufficient. I think that the test for virtual registers is more strict than it needs to be, it should be possible to coalesce two virtual registers the class of one is a subclass of the other. llvm-svn: 108118	2010-07-12 01:45:38 +00:00
Rafael Espindola	e35d70fafa	Convert the last getPhysicalRegisterRegClass in VirtRegRewriter.cpp to getMinimalPhysRegClass. It was used to produce spills, and it is better to use the most specific class if possible. Update getLoadStoreRegOpcode to handle GR32_AD. llvm-svn: 108115	2010-07-12 00:52:33 +00:00
Chris Lattner	0b7ae20a35	change machinelicm to use MachineInstr::isSafeToMove. No intended functionality change. The avoidance of hoistiing implicitdef seems wrong though. llvm-svn: 108109	2010-07-12 00:00:35 +00:00
Jakob Stoklund Olesen	c4227f1362	Remove TargetInstrInfo::copyRegToReg entirely. Targets must now implement TargetInstrInfo::copyPhysReg instead. There is no longer a default implementation forwarding to copyRegToReg. llvm-svn: 108095	2010-07-11 17:01:17 +00:00
Rafael Espindola	d7c4963f2f	Convert uses of getPhysicalRegisterRegClass in VirtRegRewriter.cpp. The first one was used just to call isSafeToMoveRegClassDefs. In general, using a more specific reg class is better, in practice only x86 implements that method and the results are always the same. The second one is in FindFreeRegister and is used to check if a register is in a register class, a much more direct call to contains is better as it should cover more cases and is faster. llvm-svn: 108093	2010-07-11 16:45:17 +00:00
Chandler Carruth	34e0d14ff4	Remove two other uses of ATTRIBUTE_UNUSED for variables only used within assert()s, switching to void-casts. Removed an unneeded Compiler.h include as a result. There are two other uses in LLVM, but they're not due to assert()s, so I've left them alone. llvm-svn: 108088	2010-07-11 08:18:12 +00:00
Jakob Stoklund Olesen	51642aea77	Use COPY for fast-isel bitconvert, but don't create cross-class copies. This doesn't change the behavior of SelectBitcast for X86. llvm-svn: 108073	2010-07-11 05:16:54 +00:00
Rafael Espindola	a76eccf815	Fix va_arg for doubles. With this patch VAARG nodes always contain the correct alignment information, which simplifies ExpandRes_VAARG a bit. The patch introduces a new alignment information to TargetLoweringInfo. This is needed since the two natural candidates cannot be used: * The 's' in target data: If this is set to the minimal alignment of any argument, getCallFrameTypeAlignment would return 4 for doubles on ARM for example. * The getTransientStackAlignment method. It is possible for an architecture to have argument less aligned than what we maintain the stack pointer. llvm-svn: 108072	2010-07-11 04:01:49 +00:00
Jakob Stoklund Olesen	7147ab9e78	Use COPY for extracting ImplicitDef'ed values from fast-isel instructions. This assumes that the registers can be copied which is probably a safe assumption. llvm-svn: 108070	2010-07-11 03:31:05 +00:00
Jakob Stoklund Olesen	3bb1267431	Use COPY in FastISel everywhere it is safe and trivial. The remaining copyRegToReg calls actually check the return value (shock!), so we cannot trivially replace them with COPY instructions. llvm-svn: 108069	2010-07-11 03:31:00 +00:00
Jakob Stoklund Olesen	0c76d6ec21	Replace copyRegToReg with COPY everywhere in lib/CodeGen except for FastISel. llvm-svn: 108062	2010-07-10 22:42:59 +00:00
Jakob Stoklund Olesen	ad89613b65	Only collect subreg extracting copies for later coalescing. This also avoids fatal copies from physregs. llvm-svn: 108061	2010-07-10 22:42:53 +00:00
Dan Gohman	a64a323564	Fix a bug in the code which re-inserts DBG_VALUE nodes after scheduling; if a block is split (by a custom inserter), the insert point may be in a different block than it was originally. This fixes 32-bit llvm-gcc bootstrap builds, and I haven't been able to reproduce it otherwise. llvm-svn: 108060	2010-07-10 22:42:31 +00:00
Jakob Stoklund Olesen	e50d30d586	Emit COPY instructions instead of using copyRegToReg in InstrEmitter, ScheduleDAGEmit, TwoAddressLowering, and PHIElimination. This switches the bulk of register copies to using COPY, but many less used copyRegToReg calls remain. llvm-svn: 108050	2010-07-10 19:08:25 +00:00
Dan Gohman	fbdba81550	Insert IMPLICIT_DEF instructions at the current insert position, not at the end of the block. llvm-svn: 108045	2010-07-10 13:55:45 +00:00
Dan Gohman	d7b5ce3312	Reapply bottom-up fast-isel, with several fixes for x86-32: - Check getBytesToPopOnReturn(). - Eschew ST0 and ST1 for return values. - Fix the PIC base register initialization so that it doesn't ever fail to end up the top of the entry block. llvm-svn: 108039	2010-07-10 09:00:22 +00:00
Devang Patel	57e72370ae	Update DBG_VALUE to refer appropriate stack slot in case of a spill. llvm-svn: 108023	2010-07-09 21:48:31 +00:00
Jakob Stoklund Olesen	b5c899d11b	Fix small bug in isMoveInstr -> COPY translation llvm-svn: 108013	2010-07-09 20:55:49 +00:00
Jakob Stoklund Olesen	7a7b55eb67	Automatically fold COPY instructions into stack load/store. llvm-svn: 108012	2010-07-09 20:43:13 +00:00
Jakob Stoklund Olesen	e9fdcaa68a	Remat uncoalescable COPY instrs llvm-svn: 108010	2010-07-09 20:43:05 +00:00
Bill Wendling	f831d86311	Clarify what mysterious check means. llvm-svn: 108005	2010-07-09 19:44:12 +00:00
Dan Gohman	7929c448fc	Fix MachineLICM to actually visit inner loops. llvm-svn: 108001	2010-07-09 18:49:45 +00:00
Jakob Stoklund Olesen	bd953d1805	Change TII::foldMemoryOperand API to require the machine instruction to be inserted in a MBB, and return an already inserted MI. This target API change is necessary to allow foldMemoryOperand to call storeToStackSlot and loadFromStackSlot when folding a COPY to a stack slot reference in a target independent way. The foldMemoryOperandImpl hook is going to change in the same way, but I'll wait until COPY folding is actually implemented. Most targets only fold copies and won't need to specialize this hook at all. llvm-svn: 107991	2010-07-09 17:29:08 +00:00
Bob Wilson	6586e9b203	--- Reverse-merging r107947 into '.': U utils/TableGen/FastISelEmitter.cpp --- Reverse-merging r107943 into '.': U test/CodeGen/X86/fast-isel.ll U test/CodeGen/X86/fast-isel-loads.ll U include/llvm/Target/TargetLowering.h U include/llvm/Support/PassNameParser.h U include/llvm/CodeGen/FunctionLoweringInfo.h U include/llvm/CodeGen/CallingConvLower.h U include/llvm/CodeGen/FastISel.h U include/llvm/CodeGen/SelectionDAGISel.h U lib/CodeGen/LLVMTargetMachine.cpp U lib/CodeGen/CallingConvLower.cpp U lib/CodeGen/SelectionDAG/SelectionDAGBuilder.cpp U lib/CodeGen/SelectionDAG/FunctionLoweringInfo.cpp U lib/CodeGen/SelectionDAG/FastISel.cpp U lib/CodeGen/SelectionDAG/SelectionDAGISel.cpp U lib/CodeGen/SelectionDAG/ScheduleDAGSDNodes.cpp U lib/CodeGen/SelectionDAG/InstrEmitter.cpp U lib/CodeGen/SelectionDAG/TargetLowering.cpp U lib/Target/XCore/XCoreISelLowering.cpp U lib/Target/XCore/XCoreISelLowering.h U lib/Target/X86/X86ISelLowering.cpp U lib/Target/X86/X86FastISel.cpp U lib/Target/X86/X86ISelLowering.h llvm-svn: 107987	2010-07-09 16:37:18 +00:00
Gabor Greif	52617fc462	cache result of operator* llvm-svn: 107980	2010-07-09 16:08:33 +00:00
Jakob Stoklund Olesen	d4d9e53b20	Avoid creating %physreg:subidx operands in SimpleRegisterCoalescing::RemoveCopyByCommutingDef. This fixes PR7602. llvm-svn: 107957	2010-07-09 05:56:21 +00:00
Jakob Stoklund Olesen	cac54d6435	Deal with a few remaining spots that assume physical registers have live intervals. This fixes PR7601. llvm-svn: 107955	2010-07-09 04:35:38 +00:00
Jakob Stoklund Olesen	66b3649030	Fix broken isCopy handling in TrimLiveIntervalToLastUse. llvm-svn: 107950	2010-07-09 01:27:21 +00:00
Jakob Stoklund Olesen	5165fa1c39	Handle COPY in VirtRegRewriter. llvm-svn: 107949	2010-07-09 01:27:19 +00:00
Dan Gohman	0b5aa1cdd3	Re-apply bottom-up fast-isel, with fixes. Be very careful to avoid emitting a DBG_VALUE after a terminator, or emitting any instructions before an EH_LABEL. llvm-svn: 107943	2010-07-09 00:39:23 +00:00
Bob Wilson	21eed476e8	Reenable DAG combining for vector shuffles. It looks like it was temporarily disabled and then never turned back on again. Adjust some tests, one because this change avoids an unnecessary instruction, and the other to make it continue testing what it was intended to test. llvm-svn: 107941	2010-07-09 00:38:12 +00:00
Stuart Hastings	d08fb75aaa	Reverting r107918 and r107919. Radar 8063111. llvm-svn: 107930	2010-07-08 23:25:39 +00:00
Jakob Stoklund Olesen	823e90e12a	Revert "Fix broken isCopy handling in TrimLiveIntervalToLastUse" This reverts commit 107921. It broke the clang self host. llvm-svn: 107926	2010-07-08 22:52:47 +00:00
Devang Patel	4c6bd6612f	Relax assertion. In optimized code, it is possible that first instruction is coming from a inlined function. This fixes PR7596 . llvm-svn: 107923	2010-07-08 22:39:20 +00:00
Bill Wendling	a992445ff2	Extension of r107506. Make sure that we don't mark a function as having a call if the inline ASM doesn't need a stack frame. llvm-svn: 107922	2010-07-08 22:38:02 +00:00
Jakob Stoklund Olesen	75c465585a	Fix broken isCopy handling in TrimLiveIntervalToLastUse llvm-svn: 107921	2010-07-08 22:30:38 +00:00
Stuart Hastings	43d226deea	Fix decl/def debug info for template functions. Radar 8063111. llvm-svn: 107919	2010-07-08 22:28:59 +00:00
Devang Patel	9c160e1213	Reuse DIEInteger for 1. This is frequently used while emitting an attribute using dwarf::DW_FORM_flag form. llvm-svn: 107903	2010-07-08 20:10:35 +00:00
Jim Grosbach	c280fc7514	Clean up scavengeRegister() a bit to prefer available regs, which allows the simplification of frame index register scavenging to not have to check for available registers directly and instead just let scavengeRegister() handle it. llvm-svn: 107880	2010-07-08 16:49:26 +00:00
Jakob Stoklund Olesen	00264624a9	Convert EXTRACT_SUBREG to COPY when emitting machine instrs. EXTRACT_SUBREG no longer appears as a machine instruction. Use COPY instead. Add isCopy() checks in many places using isMoveInstr() and isExtractSubreg(). The isMoveInstr hook will be removed later. llvm-svn: 107879	2010-07-08 16:40:22 +00:00
Jakob Stoklund Olesen	a1e883dcf6	Remove references to INSERT_SUBREG after de-SSA. Fix X86InstrInfo::convertToThreeAddressWithLEA to generate COPY instead of INSERT_SUBREG. llvm-svn: 107878	2010-07-08 16:40:15 +00:00
Benjamin Kramer	0ae3f08c0d	Merge the duplicated iabs optimization in DAGCombiner and let it detected a few more idioms. llvm-svn: 107868	2010-07-08 12:09:56 +00:00
Jakob Stoklund Olesen	89a4e25007	Add TargetInstrInfo::copyPhysReg hook and use it from LowerSubregs. This target hook is intended to replace copyRegToReg entirely, but for now it calls copyRegToReg. Any remaining calls to copyRegToReg wil be replaced by COPY instructions. llvm-svn: 107854	2010-07-08 05:01:41 +00:00
Dan Gohman	e75704369d	Revert 107840 107839 107813 107804 107800 107797 107791. Debug info intrinsics win for now. llvm-svn: 107850	2010-07-08 01:00:56 +00:00
Jim Grosbach	6533f24370	When processing frame index virtual registers, consider all available registers (if there are any) and use the one which remains available for the longest rather than just using the first one. This should help enable better re-use of the loaded frame index values. rdar://7318760 llvm-svn: 107847	2010-07-08 00:38:54 +00:00
Dan Gohman	eb9164dc50	Don't forward-declare registers for static allocas, which we'll prefer to materialize as local constants. This fixes the clang bootstrap abort. llvm-svn: 107840	2010-07-07 23:52:58 +00:00
Dan Gohman	1adc499dda	Fix -fast-isel-abort to check the right instruction. llvm-svn: 107839	2010-07-07 23:47:25 +00:00
Devang Patel	a37a95ea2f	One MDNode may be used to create regular DIE as well as abstract DIE. Keep track of abstract subprogram DIEs. llvm-svn: 107822	2010-07-07 22:20:57 +00:00
Evan Cheng	1c349f18f8	Move getExtLoad() and (some) getLoad() DebugLoc argument after EVT argument for consistency sake. llvm-svn: 107820	2010-07-07 22:15:37 +00:00
Dan Gohman	25d5c1b4f8	Not all custom inserters create new basic blocks. If the inserter didn't create a new block, don't reset the insert position. llvm-svn: 107813	2010-07-07 21:18:22 +00:00
Devang Patel	9a0339fc1f	Rename couple of maps. llvm-svn: 107810	2010-07-07 20:49:57 +00:00
Devang Patel	30265c4f8b	80 cols. llvm-svn: 107807	2010-07-07 20:12:52 +00:00
Dan Gohman	e7ccc51cc1	Implement bottom-up fast-isel. This has the advantage of not requiring a separate DCE pass over MachineInstrs. llvm-svn: 107804	2010-07-07 19:20:32 +00:00
Dan Gohman	2d4d01d0de	Add X86FastISel support for return statements. This entails refactoring a bunch of stuff, to allow the target-independent calling convention logic to be employed. llvm-svn: 107800	2010-07-07 18:32:53 +00:00
Dan Gohman	b792f844ad	Update the insert position after scheduling, which may change the position when emitting multiple blocks when executing a custom inserter. llvm-svn: 107797	2010-07-07 18:22:13 +00:00
Devang Patel	637ee5f149	Update comment. llvm-svn: 107796	2010-07-07 18:18:18 +00:00
Dan Gohman	769201448d	Fix debugging strings. llvm-svn: 107795	2010-07-07 17:28:45 +00:00
Dan Gohman	ffe64b1ee5	Give FunctionLoweringInfo an MBB member, avoiding the need to pass it around everywhere, and also give it an InsertPt member, to enable isel to operate at an arbitrary position within a block, rather than just appending to a block. llvm-svn: 107791	2010-07-07 16:47:08 +00:00
Dan Gohman	87fb4e8fcd	Simplify FastISel's constructor by giving it a FunctionLoweringInfo instance, rather than pointers to all of FunctionLoweringInfo's members. This eliminates an NDEBUG ABI sensitivity. llvm-svn: 107789	2010-07-07 16:29:44 +00:00
Dan Gohman	e784616fbb	Move FunctionLoweringInfo.h out into include/llvm/CodeGen. This will allow target-specific fast-isel code to make use of it directly. llvm-svn: 107787	2010-07-07 16:01:37 +00:00
Dan Gohman	fe7532a308	Split the SDValue out of OutputArg so that SelectionDAG-independent code can do calling-convention queries. This obviates OutputArgReg. llvm-svn: 107786	2010-07-07 15:54:55 +00:00
Dan Gohman	498e5f899d	Move CallingConvLower.cpp out of the SelectionDAG directory. llvm-svn: 107781	2010-07-07 15:15:27 +00:00
Jakob Stoklund Olesen	8e1338eea8	Fix more places assuming subregisters have live intervals llvm-svn: 107780	2010-07-07 14:41:22 +00:00
Dan Gohman	88c547ede9	Add a getFirstNonPHI utility function. llvm-svn: 107778	2010-07-07 14:33:51 +00:00
Jakob Stoklund Olesen	f0e551d4f4	Revert "Remove references to INSERT_SUBREG after de-SSA" r107725. Buildbot breakage. llvm-svn: 107744	2010-07-07 00:32:25 +00:00
Jim Grosbach	dc0a0659be	By default, the eh.sjlj.setjmp/longjmp intrinsics should just do nothing rather than assuming a target will custom lower them. Targets which do so should exlicitly mark them as having custom lowerings. PR7454. llvm-svn: 107734	2010-07-06 23:44:52 +00:00
Jakob Stoklund Olesen	e2d3067f6b	Remove references to INSERT_SUBREG after de-SSA llvm-svn: 107732	2010-07-06 23:40:35 +00:00
Jakob Stoklund Olesen	70ee3ecd33	Convert INSERT_SUBREG to COPY in TwoAddressInstructionPass. INSERT_SUBREG will now only appear in SSA machine instructions. Fix the handling of partial redefs in ProcessImplicitDefs. This is now relevant since partial redef COPY instructions appear. llvm-svn: 107726	2010-07-06 23:26:25 +00:00
Dan Gohman	ee0cb70381	CanLowerReturn doesn't need a SelectionDAG; it just needs an LLVMContext. SelectBasicBlock doesn't needs its BasicBlock argument. llvm-svn: 107712	2010-07-06 22:19:37 +00:00
Devang Patel	a3ca21b228	Propagate debug loc. llvm-svn: 107710	2010-07-06 22:08:15 +00:00
Jakob Stoklund Olesen	15fed3bd30	One more case assuming that subregs have live ranges. llvm-svn: 107700	2010-07-06 21:13:03 +00:00
Jakob Stoklund Olesen	bcf3409107	Fix buildbot breakage where a def is missing. llvm-svn: 107698	2010-07-06 21:06:39 +00:00
Jakob Stoklund Olesen	a64c0a3d22	Be more forgiving when calculating alias interference for physreg coalescing. It is OK for an alias live range to overlap if there is a copy to or from the physical register. CoalescerPair can work out if the copy is coalescable independently of the alias. This means that we can join with the actual destination interval instead of using the getOrigDstReg() hack. It is no longer necessary to merge clobber ranges into subregisters. llvm-svn: 107695	2010-07-06 20:31:51 +00:00
Dan Gohman	3439629239	Reapply r107655 with fixes; insert the pseudo instruction into the block before calling the expansion hook. And don't put EFLAGS in a mbb's live-in list twice. llvm-svn: 107691	2010-07-06 20:24:04 +00:00
Eric Christopher	dfc8b745a2	Fix to 80-col. llvm-svn: 107684	2010-07-06 18:35:20 +00:00
Chris Lattner	dde2ba0b60	tighten up this code. llvm-svn: 107670	2010-07-06 15:59:27 +00:00
Dan Gohman	f4f04107ef	Revert r107655. llvm-svn: 107668	2010-07-06 15:49:48 +00:00
Dan Gohman	4e49b59dad	Add versions of OutputArgReg, AnalyzeReturn, and AnalyzeCallOperands which do not depend on SelectionDAG. llvm-svn: 107666	2010-07-06 15:39:54 +00:00
Anton Korobeynikov	e415230477	Fix a major regression on COFF targets introduced by r103267: 'discardable' section means that it is used only during the program load and can be discarded afterwards. This way only debug sections can be discarded, but not the opposite. Seems like the copy-and-pasto from ELF code, since there it contains the reverse flag ('alloc'). llvm-svn: 107658	2010-07-06 15:24:56 +00:00
Dan Gohman	12205645a6	Fix a bunch of custom-inserter functions to handle the case where the pseudo instruction is not at the end of the block. llvm-svn: 107655	2010-07-06 15:18:19 +00:00
Eric Christopher	2ad0c779c3	Fix up -fstack-protector on linux to use the segment registers. Split out testcases per architecture and os now. Patch from Nelson Elhage. llvm-svn: 107640	2010-07-06 05:18:56 +00:00
Chris Lattner	c4a7073db3	more tidying. llvm-svn: 107615	2010-07-05 05:53:14 +00:00
Chris Lattner	2c0315a0f3	random tidying llvm-svn: 107612	2010-07-05 05:36:21 +00:00
Jakob Stoklund Olesen	ac0a210789	Print symbolic subreg indices on REG_SEQUENCE and INSERT_SUBREG. llvm-svn: 107602	2010-07-04 23:24:23 +00:00
Evan Cheng	f3aeb2c22c	Infer alignments of fixed frame objects when they are constructed. This ensures remat'ed loads from fixed slots have the right alignments. llvm-svn: 107591	2010-07-04 18:52:05 +00:00
Bill Wendling	f844642350	Proper indentation. llvm-svn: 107581	2010-07-04 08:58:43 +00:00
Eric Christopher	128a0197bb	Fix typo. llvm-svn: 107556	2010-07-03 01:09:18 +00:00
Evan Cheng	0664a67fe1	Remove isSS argument from CreateFixedObject. Fixed objects cannot be spill slots so it's always false. llvm-svn: 107550	2010-07-03 00:40:23 +00:00
Jakob Stoklund Olesen	4c82a9e7d0	Detect and handle COPY in many places. This code is transitional, it will soon be possible to eliminate isExtractSubreg, isInsertSubreg, and isMoveInstr in most places. llvm-svn: 107547	2010-07-03 00:04:37 +00:00
Eric Christopher	5e5416056b	80-col fixup. llvm-svn: 107537	2010-07-02 23:17:38 +00:00
Jakob Stoklund Olesen	676a15bdf5	Add a new target independent COPY instruction and code to lower it. The COPY instruction is intended to replace the target specific copy instructions for virtual registers as well as the EXTRACT_SUBREG and INSERT_SUBREG instructions in MachineFunctions. It won't we used in a selection DAG. COPY is lowered to native register copies by LowerSubregs. llvm-svn: 107529	2010-07-02 22:29:50 +00:00
Jim Grosbach	3c43248560	Custom inserters (e.g., conditional moves in Thumb1 can introduce new basic blocks, and if used as a function argument, that can cause call frame setup / destroy pairs to be split across a basic block boundary. That prevents us from doing a simple assertion to check that the pairs match and alloc/ dealloc the same amount of space. Modify the assertion to only check the amount allocated when there are matching pairs in the same basic block. rdar://8022442 llvm-svn: 107517	2010-07-02 21:23:37 +00:00
Evan Cheng	0ce84486c3	- Two-address pass should not assume unfolding is always successful. - X86 unfolding should check if the instructions being unfolded has memoperands. If there is no memoperands, then it must assume conservative alignment. If this would introduce an expensive sse unaligned load / store, then unfoldMemoryOperand etc. should not unfold the instruction. llvm-svn: 107509	2010-07-02 20:36:18 +00:00
Dale Johannesen	4d887f7ca7	Propagate the AlignStack bit in InlineAsm's to the PrologEpilog code, and use it to determine whether the asm forces stack alignment or not. gcc consistently does not do this for GCC-style asms; Apple gcc inconsistently sometimes does it for asm blocks. There is no convenient place to put a bit in either the SDNode or the MachineInstr form, so I've added an extra operand to each; unlovely, but it does allow for expansion for more bits, should we need it. PR 5125. Some existing testcases are affected. The operand lists of the SDNode and MachineInstr forms are indexed with awesome mnemonics, like "2"; I may fix this someday, but not now. I'm not making it any worse. If anyone is inspired I think you can find all the right places from this patch. llvm-svn: 107506	2010-07-02 20:16:09 +00:00
Jakob Stoklund Olesen	df8429aeb4	Remove invalid assert llvm-svn: 107505	2010-07-02 19:54:47 +00:00
Jakob Stoklund Olesen	cf6c5c960f	Properly handle debug values during inline spilling. llvm-svn: 107503	2010-07-02 19:54:40 +00:00
Jakob Stoklund Olesen	96037187e5	Rematerialize as much as possible before inserting spills and reloads. This allows us to recognize the common case where all uses could be rematerialized, and no stack slot allocation is necessary. If some values could be fully rematerialized, remove them from the live range before allocating a stack slot for the rest. llvm-svn: 107492	2010-07-02 17:44:57 +00:00
Jim Grosbach	9b7755fbc6	80-column and trailing whitespace cleanup. llvm-svn: 107490	2010-07-02 17:41:59 +00:00
Jim Grosbach	64a4f3f062	grammar tweaks llvm-svn: 107489	2010-07-02 17:38:34 +00:00
Dan Gohman	93f5920914	Rename CreateReg to CreateRegs, and MakeReg to CreateReg. llvm-svn: 107451	2010-07-02 00:10:16 +00:00
Bill Wendling	504055ce9e	Make the "linker_private" linkage type emit a non-weak symbol to the file. It will still be stripped by the linker when it generates the final image. llvm-svn: 107440	2010-07-01 22:38:24 +00:00
Bill Wendling	03bcd6ecc8	Implement the "linker_private_weak" linkage type. This will be used for Objective-C metadata types which should be marked as "weak", but which the linker will remove upon final linkage. However, this linkage isn't specific to Objective-C. For example, the "objc_msgSend_fixup_alloc" symbol is defined like this: .globl l_objc_msgSend_fixup_alloc .weak_definition l_objc_msgSend_fixup_alloc .section __DATA, __objc_msgrefs, coalesced .align 3 l_objc_msgSend_fixup_alloc: .quad _objc_msgSend_fixup .quad L_OBJC_METH_VAR_NAME_1 This is different from the "linker_private" linkage type, because it can't have the metadata defined with ".weak_definition". Currently only supported on Darwin platforms. llvm-svn: 107433	2010-07-01 21:55:59 +00:00
Devang Patel	429397529a	Do not require line number entry for undefined local variable. This is a regression caused by r106792 and caught by gdb testsuite. llvm-svn: 107430	2010-07-01 21:38:08 +00:00
Daniel Dunbar	02877d6e85	MC: Pass the target instance to the AsmParser constructor. llvm-svn: 107426	2010-07-01 20:41:56 +00:00
Daniel Dunbar	329d202362	MC: Move COFF enumeration constants to llvm/Support/COFF.h, patch by Michael Spencer! llvm-svn: 107418	2010-07-01 20:07:24 +00:00
Dan Gohman	d2965c10a1	Temporarily disable on-demand fast-isel. llvm-svn: 107393	2010-07-01 12:15:30 +00:00
Dan Gohman	42b7ee15f5	Use FuncInfo's isExportedInst accessor method instead of doing the work manually. llvm-svn: 107384	2010-07-01 03:57:05 +00:00
Dan Gohman	85e02e9340	Rename CreateRegForValue to CreateReg, and change its argument from a Value to a Type, because it doesn't actually care about the Value. llvm-svn: 107383	2010-07-01 03:55:39 +00:00
Dan Gohman	4d29fd85f9	Fast isel no longer needs DeadMachineInstrElim to clean up after it. llvm-svn: 107381	2010-07-01 03:49:59 +00:00
Dan Gohman	aef3d140b7	Teach fast-isel to avoid loading a value from memory when it's already available in a register. This is pretty primitive, but it reduces the number of instructions in common testcases by 4%. llvm-svn: 107380	2010-07-01 03:49:38 +00:00
Dan Gohman	722f5fc567	Enable on-demand fast-isel. llvm-svn: 107377	2010-07-01 02:58:57 +00:00
Dan Gohman	d432223163	Reapply r106422, splitting the code for materializing a value out of SelectionDAGBuilder::getValue into a helper function, with fixes to use DenseMaps safely. llvm-svn: 107371	2010-07-01 01:59:43 +00:00
Dan Gohman	9576645a84	Don't use operator[] here, because it's not desirable to insert a default value if the search fails. llvm-svn: 107368	2010-07-01 01:33:21 +00:00
Mikhail Glushenkov	4721ad855e	Trailing whitespace. llvm-svn: 107360	2010-07-01 01:00:22 +00:00
Jakob Stoklund Olesen	8656a4549a	Add memory operand folding support to InlineSpiller. llvm-svn: 107355	2010-07-01 00:13:04 +00:00
Jakob Stoklund Olesen	bde96ad23e	Add support for rematerialization to InlineSpiller. llvm-svn: 107351	2010-06-30 23:03:52 +00:00
Bill Wendling	e0dfb98ea0	Use the catch-all selectors we already found when converting them to use the correct catch-all value. This saves having to iterate through all of the selectors in the program again. llvm-svn: 107345	2010-06-30 22:49:53 +00:00
Jim Grosbach	e8c97a7cd7	Handle array and vector typed parameters in sjljehprepare like we do structs. rdar://8145832 llvm-svn: 107332	2010-06-30 22:20:38 +00:00
Jim Grosbach	caf9b3ab7d	grammar tweak in comment. llvm-svn: 107321	2010-06-30 21:27:56 +00:00
Jakob Stoklund Olesen	59e1cae377	Some fool committed without testing (or even building) first. llvm-svn: 107307	2010-06-30 18:41:20 +00:00
Jakob Stoklund Olesen	c39d3497c8	Remember to track spill slot uses in VirtRegMap when inserting loads and stores. LocalRewriter::runOnMachineFunction uses this information to mark dead spill slots. This means that InlineSpiller now also works for functions that spill. llvm-svn: 107302	2010-06-30 18:19:08 +00:00
Duncan Sands	945a347478	Remove an unused variable. The call to getRoot has side-effects, so this could break something (but doesn't seem to). llvm-svn: 107295	2010-06-30 17:22:28 +00:00
Gabor Greif	647d9c9797	use ArgOperand API llvm-svn: 107282	2010-06-30 13:45:50 +00:00
Gabor Greif	f69acfe133	use ArgOperand API llvm-svn: 107279	2010-06-30 12:55:46 +00:00
Gabor Greif	3390e746fa	use CallSite::arg_end instead of CallInst::op_end llvm-svn: 107276	2010-06-30 12:39:23 +00:00
John Mosby	5364655e02	Remove trailing whitespace, no functionality changes. llvm-svn: 107244	2010-06-30 03:40:54 +00:00
Devang Patel	c5b3109bec	Do not construct DIE for already processed MDNode. llvm-svn: 107237	2010-06-30 01:40:11 +00:00
Jakob Stoklund Olesen	b3b89c3bc0	Use skipInstruction() as a simpler way of iterating over instructions using SrcReg llvm-svn: 107234	2010-06-30 00:30:36 +00:00
Jakob Stoklund Olesen	08baf59da1	Use clEnumValN macro to work around keyword clash llvm-svn: 107233	2010-06-30 00:24:51 +00:00
Devang Patel	648df7bf64	Add variables into a scope before constructing scope DIE otherwise variables won't be included DIE tree. llvm-svn: 107228	2010-06-30 00:11:08 +00:00
Jakob Stoklund Olesen	f888911932	Begin implementation of an inline spiller. InlineSpiller inserts loads and spills immediately instead of deferring to VirtRegMap. This is possible now because SlotIndexes allows instructions to be inserted and renumbered. This is work in progress, and is mostly a copy of TrivialSpiller so far. It works very well for functions that don't require spilling. llvm-svn: 107227	2010-06-29 23:58:39 +00:00
Bill Wendling	3632171750	Revert r107205 and r107207. llvm-svn: 107215	2010-06-29 22:34:52 +00:00
Devang Patel	be30551600	Print InlinedAt location. llvm-svn: 107214	2010-06-29 22:29:15 +00:00
Devang Patel	c728518bfe	Print InlinedAt location. llvm-svn: 107208	2010-06-29 21:51:32 +00:00
Bill Wendling	1767723dbe	Introducing the "linker_weak" linkage type. This will be used for Objective-C metadata types which should be marked as "weak", but which the linker will remove upon final linkage. For example, the "objc_msgSend_fixup_alloc" symbol is defined like this: .globl l_objc_msgSend_fixup_alloc .weak_definition l_objc_msgSend_fixup_alloc .section __DATA, __objc_msgrefs, coalesced .align 3 l_objc_msgSend_fixup_alloc: .quad _objc_msgSend_fixup .quad L_OBJC_METH_VAR_NAME_1 This is different from the "linker_private" linkage type, because it can't have the metadata defined with ".weak_definition". llvm-svn: 107205	2010-06-29 21:24:00 +00:00
Devang Patel	24bc1b5b2f	Do not hardcode DW_AT_stmt_list value. Inspired by Artur Pietrek. llvm-svn: 107202	2010-06-29 20:17:53 +00:00
Jakob Stoklund Olesen	dadea5b178	Fix the handling of partial redefines in the fast register allocator. A partial redefine needs to be treated like a tied operand, and the register must be reloaded while processing use operands. This fixes a bug where partially redefined registers were processed as normal defs with a reload added. The reload could clobber another use operand if it was a kill that allowed register reuse. llvm-svn: 107193	2010-06-29 19:15:30 +00:00
Bob Wilson	d91d5bfc95	Fix a register scavenger crash when dealing with undefined subregs. The LowerSubregs pass needs to preserve implicit def operands attached to EXTRACT_SUBREG instructions when it replaces those instructions with copies. llvm-svn: 107189	2010-06-29 18:42:49 +00:00
Duncan Sands	83d1dd637a	It seems clear that this should return Changed. llvm-svn: 107141	2010-06-29 14:49:35 +00:00
Rafael Espindola	38a7d7cbc3	Add a VT argument to getMinimalPhysRegClass and replace the copy related uses of getPhysicalRegisterRegClass with it. If we want to make a copy (or estimate its cost), it is better to use the smallest class as more efficient operations might be possible. llvm-svn: 107140	2010-06-29 14:02:34 +00:00
Duncan Sands	d34bb4e9b0	getMachineBasicBlockAddress returns a uintptr_t - don't truncate to unsigned only to extend back to a pointer sized value on the next line. llvm-svn: 107139	2010-06-29 13:34:20 +00:00
Gabor Greif	e73d64c2cf	use ArgOperand APIs llvm-svn: 107132	2010-06-29 13:03:46 +00:00
Duncan Sands	6d28e73acc	Remove initialized but otherwise unused variables. llvm-svn: 107127	2010-06-29 11:22:26 +00:00
Jim Grosbach	907673c48d	When processing loops for scheduling latencies (used for live outs on loop back-edges), make sure not to include dbg_value instructions in the count. Closing in on the end of rdar://7797940 llvm-svn: 107119	2010-06-29 04:48:13 +00:00
Bob Wilson	1e5da550e5	Reapply my if-conversion cleanup from svn r106939 with fixes. There are 2 changes relative to the previous version of the patch: 1) For the "simple" if-conversion case, there's no need to worry about RemoveExtraEdges not handling an unanalyzable branch. Predicated terminators are ignored in this context, so RemoveExtraEdges does the right thing. This might break someday if we ever treat indirect branches (BRIND) as predicable, but for now, I just removed this part of the patch, because in the case where we do not add an unconditional branch, we rely on keeping the fall-through edge to CvtBBI (which is empty after this transformation). The change relative to the previous patch is: @@ -1036,10 +1036,6 @@ IterIfcvt = false; } - // RemoveExtraEdges won't work if the block has an unanalyzable branch, - // which is typically the case for IfConvertSimple, so explicitly remove - // CvtBBI as a successor. - BBI.BB->removeSuccessor(CvtBBI->BB); RemoveExtraEdges(BBI); // Update block info. BB can be iteratively if-converted. 2) My patch exposed a bug in the code for merging the tail of a "diamond", which had previously never been exercised. The code was simply checking that the tail had a single predecessor, but there was a case in MultiSource/Benchmarks/VersaBench/dbms where that single predecessor was neither edge of the diamond. I added the following change to check for that: @@ -1276,7 +1276,18 @@ // tail, add a unconditional branch to it. if (TailBB) { BBInfo TailBBI = BBAnalysis[TailBB->getNumber()]; - if (TailBB->pred_size() == 1 && !TailBBI.HasFallThrough) { + bool CanMergeTail = !TailBBI.HasFallThrough; + // There may still be a fall-through edge from BBI1 or BBI2 to TailBB; + // check if there are any other predecessors besides those. + unsigned NumPreds = TailBB->pred_size(); + if (NumPreds > 1) + CanMergeTail = false; + else if (NumPreds == 1 && CanMergeTail) { + MachineBasicBlock::pred_iterator PI = TailBB->pred_begin(); + if (PI != BBI1->BB && PI != BBI2->BB) + CanMergeTail = false; + } + if (CanMergeTail) { MergeBlocks(BBI, TailBBI); TailBBI.IsDone = true; } else { With these fixes, I was able to run all the SingleSource and MultiSource tests successfully. llvm-svn: 107110	2010-06-29 00:55:23 +00:00
Bob Wilson	269a89fd3a	Unlike other targets, ARM now uses BUILD_VECTORs post-legalization so they can't be changed arbitrarily by the DAGCombiner without checking if it is running after legalization. llvm-svn: 107097	2010-06-28 23:40:25 +00:00
Devang Patel	1de21ec498	Use DW_FORM_addr for DW_AT_entry_pc. llvm-svn: 107085	2010-06-28 22:22:47 +00:00
Dale Johannesen	17feb07c53	In asm's, output operands with matching input constraints have to be registers, per gcc documentation. This affects the logic for determining what "g" should lower to. PR 7393. A couple of existing testcases are affected. llvm-svn: 107079	2010-06-28 22:09:45 +00:00
Devang Patel	d10b2af260	Include inlined function in list of processed subprograms. llvm-svn: 107065	2010-06-28 20:53:04 +00:00
Jim Grosbach	ee6e29aa72	new, no longer brain-dead, r106907 llvm-svn: 107060	2010-06-28 20:26:00 +00:00
Jakob Stoklund Olesen	ffd628ec0a	After physreg coalescing, physical registers might not have live ranges where you would expect. Don't assert on that case, just give up. This fixes PR7513. llvm-svn: 107046	2010-06-28 19:39:57 +00:00
Jakob Stoklund Olesen	0d94d7af78	Add more special treatment for inline asm in RegAllocFast. When an instruction has tied operands and physreg defines, we must take extra care that the tied operands conflict with neither physreg defs nor uses. The special treatment is given to inline asm and instructions with tied operands / early clobbers and physreg defines. This fixes PR7509. llvm-svn: 107043	2010-06-28 18:34:34 +00:00
Devang Patel	f3b2db68c6	Preserve deleted function's local variables' debug info. Radar 8122864. llvm-svn: 107027	2010-06-28 18:25:03 +00:00
Gabor Greif	cd09869dfc	simplify: we have solid argument iterator range llvm-svn: 107014	2010-06-28 16:40:52 +00:00
Daniel Dunbar	b8c058cbb0	Revert r106907, "make sure to handle dbg_value instructions in the middle of the block, not...", it caused a bunch of nightly test regressions. llvm-svn: 107009	2010-06-28 15:47:17 +00:00
Devang Patel	fb6f22f010	Remove dead code. llvm-svn: 106990	2010-06-28 05:59:13 +00:00
Rafael Espindola	2041abd958	When splitting a VAARG, remember its alignment. This produces terrible but correct code. llvm-svn: 106952	2010-06-26 18:22:20 +00:00
Bob Wilson	418e64a385	Revert my if-conversion cleanup since it caused a bunch of nightly test regressions. --- Reverse-merging r106939 into '.': U test/CodeGen/Thumb2/thumb2-ifcvt3.ll U lib/CodeGen/IfConversion.cpp llvm-svn: 106951	2010-06-26 17:47:06 +00:00
Benjamin Kramer	a000002428	VNInfos don't need to be destructed anymore. llvm-svn: 106943	2010-06-26 11:30:59 +00:00
Bob Wilson	c72da6bb56	Clean up some problems with extra CFG edges being introduced during if-conversion. The RemoveExtraEdges function doesn't work for blocks that end with unanalyzable branches, so in those cases, the "extra" edges must be explicitly removed. The CopyAndPredicateBlock and MergeBlocks methods can also avoid copying successor edges due to branches that have already been removed. The latter case is especially helpful when MergeBlocks is called for handling "diamond" if-conversions, where otherwise you can end up with some weird intermediate states in the CFG. Unfortunately I've been unable to find cases where this cleanup actually makes a significant difference in the code. There is one test where we manage to remove an empty block at the end of a function. Radar 6911268. llvm-svn: 106939	2010-06-26 04:27:33 +00:00
Jim Grosbach	c34befc78f	make sure to handle dbg_value instructions in the middle of the block, not just at the head, when doing diamond if-conversion. rdar://7797940 llvm-svn: 106907	2010-06-25 23:05:46 +00:00
Jakob Stoklund Olesen	55d738e2e1	Don't track kills in VNInfo. Use interval ends instead. The VNInfo.kills vector was almost unused except for all the code keeping it updated. The few places using it were easily rewritten to check for interval ends instead. The two new methods LiveInterval::killedAt and killedInRange are replacements. This brings us down to 3 independent data structures tracking kills. llvm-svn: 106905	2010-06-25 22:53:05 +00:00
Evan Cheng	02b184de5b	Change if-conversion block size limit checks to add some flexibility. llvm-svn: 106901	2010-06-25 22:42:03 +00:00
Devang Patel	5c0f85c7dd	Collect debug info for optimized variables of inlined functions. llvm-svn: 106895	2010-06-25 22:07:34 +00:00
Jim Grosbach	8a6deefec6	80 column and typo fix llvm-svn: 106894	2010-06-25 22:02:28 +00:00
Dale Johannesen	ce97d55ad9	The hasMemory argument is irrelevant to how the argument for an "i" constraint should get lowered; PR 6309. While this argument was passed around a lot, this is the only place it was used, so it goes away from a lot of other places. llvm-svn: 106893	2010-06-25 21:55:36 +00:00
Bill Wendling	e41e40f689	- Reapply r106066 now that the bzip2 build regression has been fixed. - 2010-06-25-CoalescerSubRegDefDead.ll is the testcase for r106878. llvm-svn: 106880	2010-06-25 20:48:10 +00:00
Bill Wendling	ef7acd9a24	We should remove the live range from the destination register only if all defs are dead, not just the def of this register. I.e., a register could be dead, but it's subreg isn't. Testcase to follow with a subsequent patch. llvm-svn: 106878	2010-06-25 20:42:55 +00:00
Dale Johannesen	2ac3b9cbd4	Cosmetic. llvm-svn: 106865	2010-06-25 17:41:07 +00:00
Duncan Sands	2dc70bea54	Remove variables which are assigned to but for which the value is not used. Spotted by gcc-4.6. llvm-svn: 106854	2010-06-25 14:48:39 +00:00
Gabor Greif	b890fc8023	use ArgOperand accessors and CallInst for getting hold of the intrinsic's arguments simplify along the way (at least for me this is much more legible now) Bill, Baldrick or Anton, please review\! llvm-svn: 106838	2010-06-25 11:25:30 +00:00
Gabor Greif	7dd3afdff3	use ArgOperand API (the simple part) llvm-svn: 106837	2010-06-25 09:44:37 +00:00
Gabor Greif	eba0be7dc9	use ArgOperand API llvm-svn: 106836	2010-06-25 09:38:13 +00:00
Gabor Greif	41b81ee2fb	use ArgOperand API llvm-svn: 106835	2010-06-25 09:36:23 +00:00
Gabor Greif	ed9ae7bf21	use ArgOperand API and CallSite to access arguments of CallInst llvm-svn: 106833	2010-06-25 09:03:52 +00:00
Gabor Greif	b5874dea6e	use ArgOperand API and CallSite to access arguments of CallInst llvm-svn: 106829	2010-06-25 08:48:19 +00:00
Gabor Greif	e4eed709d4	use ArgOperand API llvm-svn: 106828	2010-06-25 08:24:59 +00:00
Gabor Greif	f6207e0a80	prune an include llvm-svn: 106827	2010-06-25 08:16:50 +00:00
Dale Johannesen	e9eaaa91d8	Fix a case where an earlyclobber operand of an asm is reused as an input. PR 4118. Testcase is too big, as usual with bugs in this area, but there's one in the PR. llvm-svn: 106816	2010-06-25 00:49:43 +00:00
Jakob Stoklund Olesen	889ab7d158	Make sure all eliminated kills are removed from VNInfo lists. This fixes PR7479 and PR7485. The test cases from those PRs are big, so not included. However, PR7485 comes from self hosting on FreeBSD, so we will surely hear about any regression. llvm-svn: 106811	2010-06-24 23:57:35 +00:00
Dan Gohman	5f0bf64c0c	Add some comments. llvm-svn: 106809	2010-06-24 23:41:59 +00:00
Dan Gohman	9a2f0473b2	Teach EmitLiveInCopies to omit copies for unused virtual registers, and to clean up unused incoming physregs from the live-in list. llvm-svn: 106805	2010-06-24 22:23:02 +00:00
Bill Wendling	2d3c490026	It's possible that a flag is added to the SDNode that points back to the original SDNode. This is badness. Also, this function allows one SDNode to point multiple flags to another SDNode. Badness as well. llvm-svn: 106793	2010-06-24 22:00:37 +00:00
Devang Patel	c657c621b7	DBG_VALUE machine instruction pointing to undefined register for a variable justify a separate scope if the variable is inlined function's argument. Radar 8122864. llvm-svn: 106792	2010-06-24 21:51:19 +00:00
Jakob Stoklund Olesen	2b87d44c5d	Don't return a std::vector in the Spiller interface, but take a reference to a vector instead. This avoids needless copying and allocation. Add documentation. llvm-svn: 106788	2010-06-24 20:54:29 +00:00
Jakob Stoklund Olesen	9b659142a6	Remove the now unused LiveIntervals::getVNInfoSourceReg(). This method was always a bit too simplistic for the real world. It didn't really deal with subregisters and such. llvm-svn: 106781	2010-06-24 20:18:15 +00:00
Jakob Stoklund Olesen	487ed997d0	Teach AdjustCopiesBackFrom to also use CoalescerPair to identify compatible copies. llvm-svn: 106780	2010-06-24 20:16:00 +00:00
Jakob Stoklund Olesen	7f894d8fdc	Remove the -fast-spill option. This code path has never really been used, and we are going to be handling spilling through the Spiller interface in the future. llvm-svn: 106777	2010-06-24 19:56:08 +00:00
Bill Wendling	3f0e992af1	Loosen up the requirements in the Horrible Hack(tm) to include all selectors which don't have a catch-all associated with them not just clean-ups. This fixes the SingleSource/Benchmarks/Shootout-C++/except.cpp testcase that broke because of my change r105902. llvm-svn: 106772	2010-06-24 18:49:10 +00:00
Jakob Stoklund Olesen	45230239e4	Replace a big gob of old coalescer logic with the new CoalescerPair class. CoalescerPair can determine if a copy can be coalesced, and which register gets merged away. The old logic in SimpleRegisterCoalescing had evolved into something a bit too convoluted. This second attempt fixes some crashes that only occurred Linux. llvm-svn: 106769	2010-06-24 18:15:01 +00:00
Jakob Stoklund Olesen	a612d7c012	Print the LSBs of a SlotIndex symbolically using letters referring to the [L]oad, [u]se, [d]ef, or [S]tore slots. This makes it easier to see if two indices refer to the same instruction, avoiding mental mod 4 calculations. llvm-svn: 106766	2010-06-24 17:31:07 +00:00
Dan Gohman	8a84cd57ae	Simplify this code; switch lowering shouldn't produce cases which trivially fold away. llvm-svn: 106765	2010-06-24 17:08:31 +00:00
Jakob Stoklund Olesen	3b2b46a700	Be more strict about subreg-to-subreg copies in CoalescerPair. Also keep track of the original DstREg before subregister adjustments. llvm-svn: 106753	2010-06-24 16:19:28 +00:00
Jakob Stoklund Olesen	53ccab7d1c	Verify that VNI kills are pointing to existing instructions. In this case it is essential that the kill is real because the spiller will decide to omit a spill if it thinks there is a later kill. llvm-svn: 106751	2010-06-24 15:56:59 +00:00
Dan Gohman	463f26b4be	Eliminate the other half of the BRCOND optimization, and update as many tests as possible. llvm-svn: 106749	2010-06-24 15:24:03 +00:00
Dan Gohman	df6b33e778	Eliminate the first have of the optimization which eliminates BRCOND when the condition is constant. This optimization shouldn't be necessary, because codegen shouldn't be able to find dead control paths that the IR-level optimizer can't find. And it's undesirable, because it encourages bugpoint to leave "br i1 false" branches in its output. And it wasn't updating the CFG. I updated all the tests I could, but some tests are too reduced and I wasn't able to meaningfully preserve them. llvm-svn: 106748	2010-06-24 15:04:11 +00:00
Dan Gohman	600f62b3ba	Reapply r106634, now that the bug it exposed is fixed. llvm-svn: 106746	2010-06-24 14:30:44 +00:00
Dan Gohman	0695e09b09	Optimize the "bit test" code path for switch lowering in the case where the bit mask has exactly one bit. llvm-svn: 106716	2010-06-24 02:06:24 +00:00
Jakob Stoklund Olesen	dbb58d2974	Revert "Replace a big gob of old coalescer logic with the new CoalescerPair class." Whiny buildbots. llvm-svn: 106710	2010-06-24 00:52:22 +00:00
Jakob Stoklund Olesen	f38e6720cc	Replace a big gob of old coalescer logic with the new CoalescerPair class. CoalescerPair can determine if a copy can be coalesced, and which register gets merged away. The old logic in SimpleRegisterCoalescing had evolved into something a bit too convoluted. llvm-svn: 106701	2010-06-24 00:12:39 +00:00
Bill Wendling	a136521a17	MorphNodeTo doesn't preserve the memory operands. Because we're morphing a node into the same node, but with different non-memory operands, we need to replace the memory operands after it's finished morphing. llvm-svn: 106643	2010-06-23 18:16:24 +00:00
Daniel Dunbar	4df321b7ad	Revert r106263, "Fold the ShrinkDemandedOps pass into the regular DAGCombiner pass,"... it was causing both 'file' (with clang) and 176.gcc (with llvm-gcc) to be miscompiled. llvm-svn: 106634	2010-06-23 17:09:26 +00:00
Jim Grosbach	b58c08b0ba	Some targets don't require the fencing MEMBARRIER instructions surrounding atomic intrinsics, either because the use locking instructions for the atomics, or because they perform the locking directly. Add support in the DAG combiner to fold away the fences. llvm-svn: 106630	2010-06-23 16:07:42 +00:00
Jakob Stoklund Olesen	731ea71f59	Add a few VNInfo data structure checks. llvm-svn: 106627	2010-06-23 15:34:36 +00:00
Daniel Dunbar	ef5a4383ad	Revert r106066, "Create a more targeted fix for not sinking instructions into a range where it"... it causes bzip2 to be miscompiled by Clang. Conflicts: lib/CodeGen/MachineSink.cpp llvm-svn: 106614	2010-06-23 00:48:25 +00:00
Jakob Stoklund Olesen	1023f6bd98	Also convert SUBREG_TO_REG to a KILL when relevant, like the other subreg instructions. This does not affect codegen much because SUBREG_TO_REG is only used by X86 and X86 does not use the register scavenger, but it prevents verifier errors. llvm-svn: 106583	2010-06-22 22:11:07 +00:00
Dan Gohman	3570f81b1e	Move PHIElimination's SplitCriticalEdge for MachineBasicBlocks out into a utility routine, teach it how to update MachineLoopInfo, and make use of it in MachineLICM to split critical edges on demand. llvm-svn: 106555	2010-06-22 17:25:57 +00:00
Jakob Stoklund Olesen	9c47dac677	Remove the SimpleJoin optimization from SimpleRegisterCoalescing. Measurements show that it does not speed up coalescing, so there is no reason the keep the added complexity around. Also clean out some unused methods and static functions. llvm-svn: 106548	2010-06-22 16:13:57 +00:00
Dan Gohman	d2d1ae105d	Use pre-increment instead of post-increment when the result is not used. llvm-svn: 106542	2010-06-22 15:08:57 +00:00
Dan Gohman	2370e2fe0f	When unfolding a load, avoid assuming which instruction that kill and dead flags will end up on. llvm-svn: 106520	2010-06-22 02:07:21 +00:00
Devang Patel	b6e058da18	Use single interface, using twine, to get named metadata. getNamedMetadata(). llvm-svn: 106518	2010-06-22 01:19:38 +00:00
Evan Cheng	37bb617f8a	Tail merging pass shall not break up IT blocks. rdar://8115404 llvm-svn: 106517	2010-06-22 01:18:16 +00:00
Devang Patel	cbc6fd8493	Discard special LLVM prefix from linkage name. llvm-svn: 106516	2010-06-22 01:06:05 +00:00
Devang Patel	ad51735794	Do not rely on Twine temporaries to survive. llvm-svn: 106515	2010-06-22 01:01:58 +00:00
Dan Gohman	851e478e6b	Fix the new load-unfolding code to update LiveVariable's dead flags, in addition to the kill flags. llvm-svn: 106512	2010-06-22 00:32:04 +00:00
Dan Gohman	3c1b3c61e9	Teach two-address lowering how to unfold a load to open up commuting opportunities. For example, this lets it emit this: movq (%rax), %rcx addq %rdx, %rcx instead of this: movq %rdx, %rcx addq (%rax), %rcx in the case where %rdx has subsequent uses. It's the same number of instructions, and usually the same encoding size on x86, but it appears faster, and in general, it may allow better scheduling for the load. llvm-svn: 106493	2010-06-21 22:17:20 +00:00
Dan Gohman	dd41bba517	Use A.append(...) instead of A.insert(A.end(), ...) when A is a SmallVector, and other SmallVector simplifications. llvm-svn: 106452	2010-06-21 19:47:52 +00:00
Dan Gohman	bbc29ea821	Revert r106422, which is breaking the non-fast-isel path. llvm-svn: 106423	2010-06-21 16:02:28 +00:00
Dan Gohman	f64fdd69d0	More changes for non-top-down fast-isel. Split the code for materializing a value out of SelectionDAGBuilder::getValue into a helper function, so that it can be used in other ways. Add a new getNonRegisterValue function which uses it, for use in code which doesn't want a CopyFromReg even when FuncMap.ValueMap already has an entry for it. llvm-svn: 106422	2010-06-21 15:13:54 +00:00
Dan Gohman	f91aff5f13	Do one lookup instead of two. llvm-svn: 106415	2010-06-21 14:21:47 +00:00
Dan Gohman	7c58cf75fa	Generalize this to look in the regular ValueMap in addition to the LocalValueMap, to make it more flexible when fast-isel isn't proceding straight top-down. llvm-svn: 106414	2010-06-21 14:17:46 +00:00
Bob Wilson	4581434c27	Tidy. llvm-svn: 106383	2010-06-19 05:33:57 +00:00
Dan Gohman	8693650422	Teach regular and fast isel to set dead flags on unused implicit defs on calls and similar instructions. llvm-svn: 106353	2010-06-18 23:28:01 +00:00
Jakob Stoklund Olesen	678927e0b1	Only run CoalesceExtSubRegs when we can expect LiveIntervalAnalysis to clean up the inserted INSERT_SUBREGs after us. llvm-svn: 106345	2010-06-18 23:10:20 +00:00
Evan Cheng	2d51c7c592	Allow ARM if-converter to be run after post allocation scheduling. - This fixed a number of bugs in if-converter, tail merging, and post-allocation scheduler. If-converter now runs branch folding / tail merging first to maximize if-conversion opportunities. - Also changed the t2IT instruction slightly. It now defines the ITSTATE register which is read by instructions in the IT block. - Added Thumb2 specific hazard recognizer to ensure the scheduler doesn't change the instruction ordering in the IT block (since IT mask has been finalized). It also ensures no other instructions can be scheduled between instructions in the IT block. This is not yet enabled. llvm-svn: 106344	2010-06-18 23:09:54 +00:00
Jim Grosbach	a57c2885cf	back-end libcall handling for ATOMIC_SWAP (__sync_lock_test_and_set) llvm-svn: 106342	2010-06-18 23:03:10 +00:00
Jakob Stoklund Olesen	07f4fa8198	TwoAddressInstructionPass::CoalesceExtSubRegs can insert INSERT_SUBREG instructions, but it doesn't really understand live ranges, so the first INSERT_SUBREG uses an implicitly defined register. Fix it in LiveVariableAnalysis by adding the <undef> flag. llvm-svn: 106333	2010-06-18 22:29:44 +00:00
Evan Cheng	cf9e8a987f	Fix an inverted condition. llvm-svn: 106330	2010-06-18 22:17:13 +00:00
Evan Cheng	f5d62535a5	Fix cross initialization compilation error. llvm-svn: 106324	2010-06-18 22:01:37 +00:00
Evan Cheng	c0e0d85b18	Teach iff-converter to properly count # of dups. It was not skipping over dbg_value's which resulted in non-duplicated instructions being deleted. rdar://8104384. llvm-svn: 106323	2010-06-18 21:52:57 +00:00
Jim Grosbach	d64dfc1568	Add Expand-to-libcall support for additional atomics. This covers the usual entries used by llvm-gcc. *_[U]MIN and such can be added later if needed. This enables the front ends to simplify handling of the atomic intrinsics by removing the target-specific decision about which targets can handle the intrinsics. llvm-svn: 106321	2010-06-18 21:43:38 +00:00
Dan Gohman	e5457c275d	Don't leak RegClass2VRegMap, which is now a new[] array instead of a std::vector. llvm-svn: 106298	2010-06-18 18:54:05 +00:00
Dan Gohman	882bb2984e	Start TargetRegisterClass indices at 0 instead of 1, so that MachineRegisterInfo doesn't have to confusingly allocate an extra entry. llvm-svn: 106296	2010-06-18 18:13:55 +00:00
Bob Wilson	f82c8fcc58	Fix PR7372: Conditional branches (at least on ARM) are treated as predicated, so when IfConverter::CopyAndPredicateBlock checks to see if it should ignore an instruction because it is a branch, it should not check if the branch is predicated. This case (when IgnoreBr is true) is only relevant from IfConvertTriangle, where new branches are inserted after the block has been copied and predicated. If the original branch is not removed, we end up with multiple conditional branches (possibly conflicting) at the end of the block. Aside from any immediate errors resulting from that, this confuses the AnalyzeBranch functions so that the branches are not analyzable. That in turn causes the IfConverter to think that the "Simple" pattern can be applied, and things go downhill fast because the "Simple" pattern does _not_ apply if the block can fall through. This is pretty fragile. If there are other degenerate cases where AnalyzeBranch fails, but where the block may still fall through, the IfConverter should not perform its "Simple" if-conversion. But, I don't know how to do that with the current AnalyzeBranch interface, so for now, the best thing seems to be to avoid creating branches that AnalyzeBranch cannot handle. Evan, please review! llvm-svn: 106291	2010-06-18 17:07:23 +00:00
Dan Gohman	9f58b3e106	Don't bother calling releaseMemory before destroying the DominatorTreeBase. llvm-svn: 106287	2010-06-18 16:09:11 +00:00
Dan Gohman	7edb39cc6b	Minor code simplifications. llvm-svn: 106286	2010-06-18 16:00:29 +00:00
Dan Gohman	6e681a5fbe	Give NamedRegionTimer an Enabled flag, allowing all its clients to switch from this: if (TimePassesIsEnabled) { NamedRegionTimer T(Name, GroupName); do_something(); } else { do_something(); // duplicate the code, this time without a timer! } to this: { NamedRegionTimer T(Name, GroupName, TimePassesIsEnabled); do_something(); } llvm-svn: 106285	2010-06-18 15:56:31 +00:00
Dan Gohman	96ca25eba5	Don't replace the old Ordering object with a new one; just clear() the old one. llvm-svn: 106284	2010-06-18 15:40:58 +00:00
Dan Gohman	a4f46b3ef8	Don't call clear() on DbgInfo when it's going to be deleted anyway. Don't replace the old DbgInfo with a new one when clear() on the old one is sufficient. llvm-svn: 106283	2010-06-18 15:36:18 +00:00
Dan Gohman	92c11acdb8	Change UpdateNodeOperands' operand and return value from SDValue to SDNode *, since it doesn't care about the ResNo value. llvm-svn: 106282	2010-06-18 15:30:29 +00:00
Dan Gohman	f1d8304fe3	Eliminate unnecessary uses of getZExtValue(). llvm-svn: 106279	2010-06-18 14:22:04 +00:00
Dan Gohman	35b6f9a929	isValueValidForType can be a static member function. llvm-svn: 106278	2010-06-18 14:01:07 +00:00
Dan Gohman	b92156d5e4	Fold the ShrinkDemandedOps pass into the regular DAGCombiner pass, which is faster, simpler, and less surprising. llvm-svn: 106263	2010-06-18 01:05:21 +00:00
Dan Gohman	0883789ec4	Handle ext(ext(x)) -> ext(x) immediately, since it's simple. llvm-svn: 106256	2010-06-18 00:08:30 +00:00
Stuart Hastings	0125b6410a	Add a DebugLoc parameter to TargetInstrInfo::InsertBranch(). This addresses a longstanding deficiency noted in many FIXMEs scattered across all the targets. This effectively moves the problem up one level, replacing eleven FIXMEs in the targets with eight FIXMEs in CodeGen, plus one path through FastISel where we actually supply a DebugLoc, fixing Radar 7421831. llvm-svn: 106243	2010-06-17 22:43:56 +00:00
Jim Grosbach	0ed5b460dc	add missing break. inconsequential as the code shouldn't be reached, but for correctness' sake, it should be there. llvm-svn: 106229	2010-06-17 17:58:54 +00:00
Jim Grosbach	3aeae8aeeb	Add entries for Expanding atomic intrinsics to libcalls. Just a placeholder for the moment. The implementation of the libcall will follow. Currently, the llvm-gcc knows when the intrinsics can be correctly handled by the back end and only generates them in those cases, issuing libcalls directly otherwise. That's too much coupling. The intrinsics should always be generated and the back end decide how to handle them, be it with a libcall, inline code, or whatever. This patch is a step in that direction. rdar://8097623 llvm-svn: 106227	2010-06-17 17:50:54 +00:00
Jim Grosbach	ba451e80dc	ISD::MEMBARRIER should lower to a libcall (__sync_synchronize) if the target sets the legalize action to Expand. llvm-svn: 106203	2010-06-17 02:00:53 +00:00
Jakob Stoklund Olesen	207cd4bbd7	Allow a register to be redefined multiple times in a basic block. LiveVariableAnalysis was a bit picky about a register only being redefined once, but that really isn't necessary. Here is an example of chained INSERT_SUBREGs that we can handle now: 68 %reg1040<def> = INSERT_SUBREG %reg1040, %reg1028<kill>, 14 register: %reg1040 +[70,134:0) 76 %reg1040<def> = INSERT_SUBREG %reg1040, %reg1029<kill>, 13 register: %reg1040 replace range with [70,78:1) RESULT: %reg1040,0.000000e+00 = [70,78:1)[78,134:0) 0@78-(134) 1@70-(78) 84 %reg1040<def> = INSERT_SUBREG %reg1040, %reg1030<kill>, 12 register: %reg1040 replace range with [78,86:2) RESULT: %reg1040,0.000000e+00 = [70,78:1)[78,86:2)[86,134:0) 0@86-(134) 1@70-(78) 2@78-(86) 92 %reg1040<def> = INSERT_SUBREG %reg1040, %reg1031<kill>, 11 register: %reg1040 replace range with [86,94:3) RESULT: %reg1040,0.000000e+00 = [70,78:1)[78,86:2)[86,94:3)[94,134:0) 0@94-(134) 1@70-(78) 2@78-(86) 3@86-(94) rdar://problem/8096390 llvm-svn: 106152	2010-06-16 21:29:40 +00:00
Jim Grosbach	6c0da25129	add FIXME llvm-svn: 106126	2010-06-16 18:45:08 +00:00
Bill Wendling	d71bd63600	Improve comment to include that the use of a preg is also verboten in this situation. llvm-svn: 106119	2010-06-16 18:01:31 +00:00
Evan Cheng	f128bdcb55	Make post-ra scheduling, anti-dep breaking, and register scavenger (conservatively) aware of predicated instructions. This enables ARM to move if-conversion before post-ra scheduler. llvm-svn: 106091	2010-06-16 07:35:02 +00:00
Devang Patel	a6d20f446f	Use separate named MDNode to hold each function's local variable info. This speeds up local variable handling in DwarfDebug. llvm-svn: 106075	2010-06-16 00:53:55 +00:00
Eric Christopher	b672ab9b53	Don't emit the linkage for initializer label for mach-o tls. llvm-svn: 106073	2010-06-16 00:27:30 +00:00
Bill Wendling	8c0cf0994d	Create a more targeted fix for not sinking instructions into a range where it will conflict with another live range. The place which creates this scenerio is the code in X86 that lowers a select instruction by splitting the MBBs. This eliminates the need to check from the bottom up in an MBB for live pregs. llvm-svn: 106066	2010-06-15 23:46:31 +00:00
Stuart Hastings	9b5005cd4b	Added a comment. llvm-svn: 106063	2010-06-15 23:06:30 +00:00
Bob Wilson	8105144fcd	Fix 80col violations, remove trailing whitespace, and clarify a comment. llvm-svn: 106057	2010-06-15 22:18:54 +00:00
Jakob Stoklund Olesen	ec2e964fd6	Remove the local register allocator. Please use the fast allocator instead. llvm-svn: 106051	2010-06-15 21:58:33 +00:00
Mon P Wang	7a84689cc5	Fixed vector widening of binary instructions that can trap. Patch by Visa Putkinen! llvm-svn: 106038	2010-06-15 20:29:05 +00:00
Bob Wilson	fc7d739422	IfConversion's AnalyzeBlocks method always returns false; clean it up. llvm-svn: 106027	2010-06-15 18:57:15 +00:00
Jim Grosbach	c964585ff8	fix naming llvm-svn: 106024	2010-06-15 18:53:34 +00:00
Jakob Stoklund Olesen	6e54c908e0	Fix an exotic bug that only showed up in an internal test case. SimpleRegisterCoalescing::JoinIntervals() uses CoalescerPair to determine if a copy is coalescable, and in very rare cases it can return true where LHS is not live - the coalescable copy can come from an alias of the physreg in LHS. llvm-svn: 106021	2010-06-15 18:49:14 +00:00
Bob Wilson	5947573f39	Fix a comment typo. llvm-svn: 106015	2010-06-15 18:19:27 +00:00
Bob Wilson	de94e66234	Add some missing checks for the case where the extract_subregs are combined to an insert_subreg, i.e., where the destination register is larger than the source. We need to check that the subregs can be composed for that case in a symmetrical way to the case when the destination is smaller. llvm-svn: 106004	2010-06-15 17:27:54 +00:00
Jakob Stoklund Olesen	246e9a07a2	Avoid processing early clobbers twice in RegAllocFast. Early clobbers defining a virtual register were first alocated to a physreg and then processed as a physreg EC, spilling the virtreg. This fixes PR7382. llvm-svn: 105998	2010-06-15 16:20:57 +00:00
Jakob Stoklund Olesen	82eca35b3e	Add CoalescerPair helper class. Given a copy instruction, CoalescerPair can determine which registers to coalesce in order to eliminate the copy. It deals with all the subreg fun to determine a tuple (DstReg, SrcReg, SubIdx) such that: - SrcReg is a virtual register that will disappear after coalescing. - DstReg is a virtual or physical register whose live range will be extended. - SubIdx is 0 when DstReg is a physical register. - SrcReg can be joined with DstReg:SubIdx. CoalescerPair::isCoalescable() determines if another copy instruction is compatible with the same tuple. This fixes some NEON miscompilations where shuffles are getting coalesced as if they were copies. The CoalescerPair class will replace a lot of the spaghetti logic in JoinCopy later. llvm-svn: 105997	2010-06-15 16:04:21 +00:00
Bob Wilson	a55b8877e6	Generalize the pre-coalescing of extract_subregs feeding reg_sequences, replacing the overly conservative checks that I had introduced recently to deal with correctness issues. This makes a pretty noticable difference in our testcases where reg_sequences are used. I've updated one test to check that we no longer emit the unnecessary subreg moves. llvm-svn: 105991	2010-06-15 05:56:31 +00:00
Ted Kremenek	d52caa5244	Update CMake build. llvm-svn: 105987	2010-06-15 04:08:14 +00:00
Jim Grosbach	412800d346	More dbg_value cleanup so the presence of debug info doesn't affect code-gen. Make sure to skip the dbg_value instructions when moving dups out of the diamond. rdar://7797940 llvm-svn: 105965	2010-06-14 21:30:32 +00:00
Evan Cheng	078f4cec21	- Do away with SimpleHazardRecognizer.h. It's not used and offers little value. - Rename ExactHazardRecognizer to PostRAHazardRecognizer and move its header to include to allow targets to extend it. llvm-svn: 105959	2010-06-14 21:06:53 +00:00
Evan Cheng	a397ada078	Avoid uncessary array copying. llvm-svn: 105955	2010-06-14 20:18:40 +00:00
Chris Lattner	0fc88efda3	fix a -Wbool-conversions warning from clang. llvm-svn: 105942	2010-06-14 18:28:34 +00:00
Bill Wendling	5d6103318a	When performing the Horrible Hack(tm-Duncan) on the EH code to convert a clean-up to a catch-all after inlining, take into account that there could be filter IDs as well. The presence of filters don't mean that the selector catches anything. It's just metadata information. llvm-svn: 105872	2010-06-12 02:34:29 +00:00
Evan Cheng	e60273fd70	Allow target to provide its own hazard recognizer to post-ra scheduler. llvm-svn: 105862	2010-06-12 00:12:18 +00:00
Evan Cheng	cb1fe56fd9	Code formatting. llvm-svn: 105861	2010-06-12 00:11:53 +00:00
Stuart Hastings	afe54f1625	Support for nested functions/classes in debug output. (Again.) Radar 7424645. llvm-svn: 105828	2010-06-11 20:08:44 +00:00
Evan Cheng	38f6560461	Code refactoring, no functionality changes. llvm-svn: 105775	2010-06-10 02:09:31 +00:00
Jakob Stoklund Olesen	8bc5eca331	Mark physregs defined by inline asm as implicit. This is a bit of a hack to make inline asm look more like call instructions. It would be better to produce correct dead flags during isel. llvm-svn: 105749	2010-06-09 20:05:00 +00:00
Evan Cheng	a0746bd50a	Allow target to place 2-address pass inserted copies in better spots. Thumb2 will use this to try to avoid breaking up IT blocks. llvm-svn: 105745	2010-06-09 19:26:01 +00:00
Bill Wendling	5ac1d23d3d	It's an error to translate this: %reg1025 = <sext> %reg1024 ... %reg1026 = SUBREG_TO_REG 0, %reg1024, 4 into this: %reg1025 = <sext> %reg1024 ... %reg1027 = EXTRACT_SUBREG %reg1025, 4 %reg1026 = SUBREG_TO_REG 0, %reg1027, 4 The problem here is that SUBREG_TO_REG is there to assert that an implicit zext occurs. It doesn't insert a zext instruction. If we allow the EXTRACT_SUBREG here, it will give us the value after the <sext>, not the original value of %reg1024 before <sext>. llvm-svn: 105741	2010-06-09 19:00:55 +00:00
Jakob Stoklund Olesen	a13b1c29b0	Add argument name comments. llvm-svn: 105665	2010-06-09 00:40:31 +00:00
Bob Wilson	7149cfcda3	Fix a mistake in my previous change r105437: don't access operand 2 and assume that it is an immediate before checking that the instruction is an EXTRACT_SUBREG. llvm-svn: 105585	2010-06-07 23:48:46 +00:00
Dan Gohman	7398758719	Add some basic debug output. llvm-svn: 105561	2010-06-07 22:32:10 +00:00
Jim Grosbach	6201b991a2	Cleanup. Process the dbg_values separately llvm-svn: 105554	2010-06-07 21:28:55 +00:00
Jim Grosbach	0f445f328e	Move exit check where it really belongs. llvm-svn: 105541	2010-06-07 19:12:21 +00:00
Stuart Hastings	3ca391027f	Revert 105492 & 105493 due to a testcase regression. Radar 7424645. llvm-svn: 105511	2010-06-05 00:39:29 +00:00
Dale Johannesen	df1a7f83bf	Fix some liveout handling related to tail calls, see comments. I don't think this ever resulted in problems on x86, but it would on ARM. llvm-svn: 105509	2010-06-05 00:30:45 +00:00
Evan Cheng	a03e6f85fe	Re-apply 105308 with fix. llvm-svn: 105502	2010-06-04 23:28:13 +00:00
Jim Grosbach	a1e08fb256	Make if-conversion ignore dbg_value instructions in its analysis. rdar://7797940 llvm-svn: 105498	2010-06-04 23:01:26 +00:00
Stuart Hastings	7c015988fe	Support for nested functions/classes in debug output. Radar 7424645. llvm-svn: 105492	2010-06-04 22:36:03 +00:00
Jim Grosbach	50d229e6b3	Skip dbg_value instructions when scanning instructions in register scavenging. llvm-svn: 105481	2010-06-04 20:18:30 +00:00
Jakob Stoklund Olesen	864827afb0	Keep track of the call instructions whose clobber lists were skipped during fast register allocation. Process all of the clobber lists at the end of the function, marking the registers as used in MachineRegisterInfo. This is necessary in case the calls clobber callee-saved registers (sic). llvm-svn: 105473	2010-06-04 18:08:29 +00:00
Mon P Wang	622cdd2297	Fixed a bug during widening where we would avoid legalizing a node. When we replace an OpA with a widened OpB, it is possible to get new uses of OpA due to CSE when recursively updating nodes. Since OpA has been processed, the new uses are not examined again. The patch checks if this occurred and it it did, updates the new uses of OpA to use OpB. llvm-svn: 105453	2010-06-04 01:20:10 +00:00
Bob Wilson	a733daf18c	Add some missing checks in TwoAddressInstructionPass::CoalesceExtSubRegs. Check that all the instructions are in the same basic block, that the EXTRACT_SUBREGs write to the same subregs that are being extracted, and that the source and destination registers are in the same regclass. Some of these constraints can be relaxed with a bit more work. Jakob suggested that the loop that checks for subregs when NewSubIdx != 0 should use the "nodbg" iterator, so I made that change here, too. llvm-svn: 105437	2010-06-03 23:53:58 +00:00
Jim Grosbach	01edd68225	Cleanup 80-column and trim trailing whitespace llvm-svn: 105435	2010-06-03 23:49:57 +00:00

... 10 11 12 13 14 ...

11057 Commits