llvm-project

Commit Graph

Author	SHA1	Message	Date
Manman Ren	7504ed4255	Debug Info: clean up usage of Verify. No functionality change. It should suffice to check the type of a debug info metadata, instead of calling Verify. llvm-svn: 185847	2013-07-08 18:33:29 +00:00
David Blaikie	ce1960f936	DebugInfo: Correct comment & re-format a nearby loop llvm-svn: 185844	2013-07-08 17:51:28 +00:00
David Blaikie	ac569a656f	DebugInfo: Simplify Address Pool index handling. Since the pool indexes are necessarily sequential and contiguous, just insert things in the right place rather than having to sort the sequence after the fact. No functionality change. llvm-svn: 185842	2013-07-08 17:33:10 +00:00
Hal Finkel	12493bb7d5	Improve the comment from r185794 (re: PromoteIntRes_BUILD_VECTOR) In response to Duncan's review, I believe that the original comment was not as clear as it could be. Hopefully, this is better. llvm-svn: 185824	2013-07-08 14:40:04 +00:00
Hal Finkel	8cb9a0e1d3	Fix PromoteIntRes_BUILD_VECTOR crash with i1 vectors This fixes a bug (found by llvm-stress) in DAGTypeLegalizer::PromoteIntRes_BUILD_VECTOR where it assumed that the result type would always be larger than the original operands. This is not always true, however, with boolean vectors. For example, promoting a node of type v8i1 (where the operands will be of type i32, the type to which i1 is promoted) will yield a node with a result vector element type of i16 (and operands of type i32). As a result, we cannot blindly assume that we can ANY_EXTEND the operands to the result type. llvm-svn: 185794	2013-07-08 06:16:58 +00:00
Kai Nacke	42097301f6	Revert: Emit personality function and Dwarf EH data for Win64 SEH. llvm-svn: 185788	2013-07-08 04:43:23 +00:00
Hal Finkel	ec474f28e3	Add the nearbyint -> FNEARBYINT mapping to BasicTargetTransformInfo This fixes an oversight that Intrinsic::nearbyint was not being mapped to ISD::FNEARBYINT (thus fixing the over-optimistic cost we were assigning to nearbyint calls for some targets). llvm-svn: 185783	2013-07-08 03:24:07 +00:00
Stephen Lin	cfe7f352c7	Remove trailing whitespace from SelectionDAG/*.cpp llvm-svn: 185780	2013-07-08 00:37:03 +00:00
Stephen Lin	6d715e8699	SelectionDAGBuilder: style fixes (add space between end parentheses and open brace) llvm-svn: 185768	2013-07-06 21:44:25 +00:00
Kai Nacke	c947ad2a2d	Emit personality function and Dwarf EH data for Win64 SEH. Obviously the personality function should be emitted as language handler instead of the hard coded _GCC_specific_handler. The language specific data must be placed after the unwind information therefore it must not be emitted into a separate section. Reviewed by Charles Davis and Nico Rieck. llvm-svn: 185761	2013-07-06 17:17:31 +00:00
Benjamin Kramer	c7332b2796	DAGCombiner: Don't drop extension behavior when shrinking a load when unsafe. ReduceLoadWidth unconditionally drops extensions from loads. Limit it to the case when all of the bits the extension would otherwise produce are dropped by the shrink. It would be possible to shrink the load in more cases by merging the extensions, but this isn't trivial and a very rare case. I left a TODO for that case. Fixes PR16551. llvm-svn: 185755	2013-07-06 14:05:09 +00:00
Tim Northover	dab4db5372	Stop putting operations after a tail call. This prevents the emission of DAG-generated vreg definitions after a tail call be dropping them entirely (on the grounds that nothing could use them anyway, and they interfere with O0 CodeGen). llvm-svn: 185754	2013-07-06 12:58:45 +00:00
Nico Rieck	a37acf702d	MC: Implement COFF .linkonce directive llvm-svn: 185753	2013-07-06 12:13:10 +00:00
Jakob Stoklund Olesen	e6abacfb8b	Use modern API to avoid exposing LiveInterval internals. No functional change intended. llvm-svn: 185733	2013-07-05 23:48:07 +00:00
Jakob Stoklund Olesen	c8a4e3857a	Remove dead function. llvm-svn: 185731	2013-07-05 23:04:55 +00:00
Richard Sandiford	c40f27b52d	[SystemZ] Remove no-op MVCs The stack coloring pass has code to delete stores and loads that become trivially dead after coloring. Extend it to cope with single instructions that copy from one frame index to another. The testcase happens to show an example of this kicking in at the moment. It did occur in Real Code too though. llvm-svn: 185705	2013-07-05 14:38:48 +00:00
Richard Sandiford	b5d9bd6f59	Fix double renaming bug in stack coloring pass The stack coloring pass renumbered frame indexes with a loop of the form: for each frame index FI for each instruction I that uses FI for each use of FI in I rename FI to FI' This caused problems if an instruction used two frame indexes F0 and F1 and if F0 was renamed to F1 and F1 to F2. The first time we visited the instruction we changed F0 to F1, then we changed both F1s to F2. In other words, the problem was that SSRefs recorded which instructions used an FI, but not which MachineOperands and MachineMemOperands within that instruction used it. This is easily fixed for MachineOperands by walking the instructions once and processing each operand in turn. There's already a loop to do that for dead store elimination, so it seemed more efficient to fuse the two at the block level. MachineMemOperands are more tricky because they can be shared between instructions. The patch handles them by making SSRefs an array of MachineMemOperands rather than an array of MachineInstrs. We might end up processing the same MachineMemOperand twice, but that's OK because we always know from the SSRefs index what the original frame index was. llvm-svn: 185703	2013-07-05 14:24:47 +00:00
Richard Sandiford	5dd52f8c4d	[SystemZ] Clean up register scavenging code SystemZ wants normal register scavenging slots, as close to the stack or frame pointer as possible. The only reason it was using custom code was because PrologEpilogInserter assumed an x86-like layout, where the frame pointer is at the opposite end of the frame from the stack pointer. This meant that when frame pointer elimination was disabled, the slots ended up being as close as possible to the incoming stack pointer, which is the opposite of what we want on SystemZ. This patch adds a new knob to say which layout is used and converts SystemZ to use target-independent scavenging slots. It's one of the pieces needed to support frame-to-frame MVCs, where two slots might be required. The ABI requires us to allocate 160 bytes for calls, so one approach would be to use that area as temporary spill space instead. It would need some surgery to make sure that the slot isn't live across a call though. I stuck to the "isFPCloseToIncomingSP - ..." style comment on the "do what the surrounding code does" principle. The FP case is already covered by several Systemz/frame-* tests, which fail without the PrologueEpilogueInserter change, so no new ones are needed. No behavioural change intended. llvm-svn: 185696	2013-07-05 12:55:00 +00:00
Benjamin Kramer	5dbec7d961	Simplify code. No functionality change. llvm-svn: 185689	2013-07-05 10:20:57 +00:00
Nico Rieck	316c37407e	Initialize object file info before output streamer r179494 switched to using the object file info to retrieve the default text section for some MC streamers. It is possible that initializing an MC streamer can request sections before the object file info is initialized when the AutoInitSections flag is set on the streamer. llvm-svn: 185670	2013-07-04 21:37:26 +00:00
Jakob Stoklund Olesen	db429d9483	Remove the EXCEPTIONADDR, EHSELECTION, and LSDAADDR ISD opcodes. These exception-related opcodes are not used any longer. llvm-svn: 185625	2013-07-04 13:54:20 +00:00
Jakob Stoklund Olesen	6a7d68349f	Typo. llvm-svn: 185618	2013-07-04 04:53:49 +00:00
Jakob Stoklund Olesen	fee2a20209	Simplify landing pad lowering. Stop using the ISD::EXCEPTIONADDR and ISD::EHSELECTION when lowering landing pad arguments. These nodes were previously legalized into CopyFromReg nodes, but that never worked properly because the CopyFromReg node weren't guaranteed to be scheduled at the top of the basic block. This meant the exception pointer and selector registers could be clobbered before being copied to a virtual register. This patch copies the two physical registers to virtual registers at the beginning of the basic block, and lowers the landingpad instruction directly to two CopyFromReg nodes reading the virtual registers. This is safe because virtual registers don't get clobbered. A future patch will remove the ISD::EXCEPTIONADDR and ISD::EHSELECTION nodes. llvm-svn: 185617	2013-07-04 04:53:45 +00:00
Jakob Stoklund Olesen	3d8560c382	FastISel can only apend to basic blocks. Compute the insertion point from the end of the basic block instead of skipping labels from the front. This caused failures in landing pads when live-in copies where inserted before instruction selection. llvm-svn: 185616	2013-07-04 04:32:39 +00:00
Jakob Stoklund Olesen	bbbb53262a	Live-in copies go after EH_LABELs. This will soon be tested by exception handling working at all. llvm-svn: 185615	2013-07-04 04:32:35 +00:00
Jakob Stoklund Olesen	a1f5b901a5	Revert r185595-185596 which broke buildbots. Revert "Simplify landing pad lowering." Revert "Remove the EXCEPTIONADDR, EHSELECTION, and LSDAADDR ISD opcodes." llvm-svn: 185600	2013-07-04 00:26:30 +00:00
Jakob Stoklund Olesen	f33ec531fa	Remove the EXCEPTIONADDR, EHSELECTION, and LSDAADDR ISD opcodes. These exception-related opcodes are not used any longer. llvm-svn: 185596	2013-07-03 23:56:31 +00:00
Jakob Stoklund Olesen	fa6a7b9b02	Simplify landing pad lowering. Stop using the ISD::EXCEPTIONADDR and ISD::EHSELECTION when lowering landing pad arguments. These nodes were previously legalized into CopyFromReg nodes, but that never worked properly because the CopyFromReg node weren't guaranteed to be scheduled at the top of the basic block. This meant the exception pointer and selector registers could be clobbered before being copied to a virtual register. This patch copies the two physical registers to virtual registers at the beginning of the basic block, and lowers the landingpad instruction directly to two CopyFromReg nodes reading the virtual registers. This is safe because virtual registers don't get clobbered. A future patch will remove the ISD::EXCEPTIONADDR and ISD::EHSELECTION nodes. llvm-svn: 185595	2013-07-03 23:56:24 +00:00
Jakob Stoklund Olesen	533c3bf2d6	Add MachineBasicBlock::addLiveIn(). This function adds a live-in physical register to an MBB and ensures that it is copied to a virtual register immediately. llvm-svn: 185594	2013-07-03 23:56:20 +00:00
Eric Christopher	614a89f5b2	Hoist all of the Entry.getLoc() calls int a single variable. llvm-svn: 185589	2013-07-03 22:40:21 +00:00
Eric Christopher	25f0642afd	Make DotDebugLocEntry a class, reorder the members along with comments for them and update all uses. llvm-svn: 185588	2013-07-03 22:40:18 +00:00
Eric Christopher	270a12cff3	Elaborate on comment. llvm-svn: 185586	2013-07-03 21:37:03 +00:00
Eric Christopher	dd7b4615d1	Add names to the header file since they help in documenting the API (and for consistency). llvm-svn: 185585	2013-07-03 21:23:59 +00:00
Eric Christopher	18cf0610ef	Move typedefs inside the class that they belong to. llvm-svn: 185573	2013-07-03 20:36:36 +00:00
Eric Christopher	98341b8d63	Remove unused field. llvm-svn: 185523	2013-07-03 08:26:07 +00:00
Eric Christopher	c665af772a	Constify a few functions. llvm-svn: 185520	2013-07-03 08:13:55 +00:00
Craig Topper	af0ad9e20f	Use SmallVectorImpl::const_iterator instead of SmallVector to avoid specifying the vector size. llvm-svn: 185514	2013-07-03 05:18:47 +00:00
Craig Topper	72cde639cb	Use SmallVectorImpl instead of SmallVector as method argument to avoid specifying vector size. llvm-svn: 185513	2013-07-03 05:16:59 +00:00
Craig Topper	e1c1d363a5	Use SmallVectorImpl instead of SmallVector for iterators and references to avoid specifying the vector size unnecessarily. llvm-svn: 185512	2013-07-03 05:11:49 +00:00
Craig Topper	24fd7eed8a	Use SmallVectorImpl::iterator/const_iterator instead of SmallVector to avoid specifying the vector size. llvm-svn: 185509	2013-07-03 04:42:33 +00:00
Craig Topper	2b4a201ba6	Introduce some typedefs for DenseMaps containing SmallVectors so the vector size doesn't have to repeated when creating iterators for the DenseMap. llvm-svn: 185508	2013-07-03 04:40:27 +00:00
Craig Topper	80170e54ad	Return SmallVectorImpl& instead of SmallVector& in a couple places to avoid having to specify the vector size in multiple places. llvm-svn: 185507	2013-07-03 04:30:58 +00:00
Craig Topper	977e9cd9c6	Use SmallVectorImpl& instead of SmallVector& to avoid needlessly respecifying the small vector size. llvm-svn: 185505	2013-07-03 04:24:43 +00:00
Craig Topper	d8e436550a	Use SmallVectorImpl::iterator/const_iterator instead of SmallVector to avoid specifying the vector size. llvm-svn: 185504	2013-07-03 04:17:25 +00:00
Eric Christopher	f94eb2bb26	Avoid doing a lot of computation when we have multiple ranges and avoid adding information for the debug_inlined section when it isn't going to be emitted anyhow. llvm-svn: 185500	2013-07-03 02:23:53 +00:00
Eric Christopher	dc42ea8fbd	Move iterator to where it's used and update comments. llvm-svn: 185498	2013-07-03 01:57:28 +00:00
Eric Christopher	070bf1685b	Move instance variable before experimental section. llvm-svn: 185497	2013-07-03 01:57:26 +00:00
Eric Christopher	808fd7bb54	Fix typo to make grep for DW_AT_comp_dir work without case-insensitive grep. llvm-svn: 185496	2013-07-03 01:57:23 +00:00
Eric Christopher	95a7d49ecd	Remove unnecessary forward declare. llvm-svn: 185495	2013-07-03 01:57:20 +00:00
Eric Christopher	c1110838d3	Add a helpful comment. llvm-svn: 185492	2013-07-03 01:22:29 +00:00
Eric Christopher	78fcf490d8	addConstantValue, addConstantFPValue never returned anything but true, so remove the return value and propagate accordingly. llvm-svn: 185490	2013-07-03 01:08:30 +00:00
Manman Ren	ac8062bb72	Debug Info: use module flag to set up Dwarf version. Correctly handles ref_addr depending on the Dwarf version. Emit Dwarf with version from module flag. TODO: turn on/off features depending on the Dwarf version. llvm-svn: 185484	2013-07-02 23:40:10 +00:00
Eric Christopher	9046f94271	Fix comment. llvm-svn: 185480	2013-07-02 21:36:07 +00:00
Ulrich Weigand	2b6fc8d613	[DebugInfo] Allow getDebugThreadLocalSymbol to return MCExpr This allows getDebugThreadLocalSymbol to return a generic MCExpr instead of just a MCSymbolRefExpr. This is in preparation for supporting debug info for TLS variables on PowerPC, where we need to describe the variable location using a more complex expression than just MCSymbolRefExpr. llvm-svn: 185460	2013-07-02 18:47:09 +00:00
Ulrich Weigand	8b3d22664e	[DebugInfo] Hold generic MCExpr in AddrPool This changes the AddrPool infrastructure to enable it to hold generic MCExpr expressions, not just MCSymbolRefExpr. This is in preparation for supporting debug info for TLS variables on PowerPC, where we need to describe the variable location using a more complex expression than just MCSymbolRefExpr. llvm-svn: 185459	2013-07-02 18:46:46 +00:00
Ulrich Weigand	396ba8b495	[DebugInfo] Introduce DIEExpr variant of DIEValue to hold MCExpr values This partially reverts r185202 and restores DIELabel to hold plain MCSymbol references. Instead, we add a new subclass DIEExpr of DIEValue that can hold generic MCExpr references. This is in preparation for supporting debug info for TLS variables on PowerPC, where we need to describe the variable location using a more complex expression than just MCSymbolRefExpr. llvm-svn: 185458	2013-07-02 18:46:26 +00:00
Jakob Stoklund Olesen	13be6bfb60	Revert (most of) r185393 and r185395. "Remove floating point computations form SpillPlacement.cpp." These commits caused test failures in lencod on clang-native-arm-lnt. I suspect these changes are only exposing an existing issue, but reverting anyway to keep the bots passing while we investigate. llvm-svn: 185447	2013-07-02 17:31:58 +00:00
Rafael Espindola	64e1af8eb9	Remove address spaces from MC. This is dead code since PIC16 was removed in 2010. The result was an odd mix, where some parts would carefully pass it along and others would assert it was zero (most of the object streamer for example). llvm-svn: 185436	2013-07-02 15:49:13 +00:00
Tim Northover	6823900e55	DAGCombiner: fix use-counting issue when forming zextload DAGCombiner was counting all uses of a load node when considering whether it's worth combining into a zextload. Really, it wants to ignore the chain and just count real uses. rdar://problem/13896307 llvm-svn: 185419	2013-07-02 09:58:53 +00:00
David Blaikie	8466ca86fe	PR14728: DebugInfo: TLS variables with -gsplit-dwarf llvm-svn: 185398	2013-07-01 23:55:52 +00:00
Jakob Stoklund Olesen	3b4e9ccfe3	Tweak some comments that referred to the old bias computations. llvm-svn: 185395	2013-07-01 23:36:37 +00:00
Jakob Stoklund Olesen	55daed27b7	Remove floating point computations form SpillPlacement.cpp. Patch by Benjamin Kramer! Use the BlockFrequency class instead of floats in the Hopfield network computations. This rescales the node Bias field from a [-2;2] float range to two block frequencies BiasN and BiasP pulling in opposite directions. This construct has a more predictable behavior when block frequencies saturate. The per-node scaling factors are no longer necessary, assuming the block frequencies around a bundle are consistent. This patch can cause the register allocator to make different spilling decisions. The differences should be small. llvm-svn: 185393	2013-07-01 23:19:39 +00:00
David Blaikie	1b01ae8648	PR16493: DebugInfo with TLS on PPC crashing due to invalid relocation Restrict the current TLS support to X86 ELF for now. Test that we don't produce it on PPC & we can flesh that test case out with the right thing once someone implements it. llvm-svn: 185389	2013-07-01 21:45:25 +00:00
Lang Hames	7d99d7977c	Make PBQP require/preserve MachineLoopInfo - the spiller requires it. llvm-svn: 185378	2013-07-01 20:47:47 +00:00
Cameron Zwarich	e35038da94	Fix the build after r185363. Use llvm::next instead of raw next. llvm-svn: 185367	2013-07-01 19:49:48 +00:00
Cameron Zwarich	867bfcd546	Fix PR16508. When phis get lowered, destination copies are inserted using an iterator that is determined once for all phis in the block, which BuildMI interprets as a request to insert an instruction directly before the iterator. In the case of a cyclic phi, source copies may also be inserted directly before this iterator, which can cause source copies to be inserted before destination copies. The fix is to keep an iterator to the last phi and then advance it while lowering each phi in order to insert destination copies directly after the phis. llvm-svn: 185363	2013-07-01 19:42:46 +00:00
Michael Gottesman	fd62bb9d3e	Added c++ mode selector to head of SelectionDAGBuilder.h so editors open it in c++ mode instead of c mode. llvm-svn: 185348	2013-07-01 16:53:41 +00:00
Benjamin Kramer	fd51092719	misched: Compress pairs returned by getUnderlyingObjectsForInstr. llvm-svn: 185266	2013-06-29 18:41:17 +00:00
David Blaikie	f269497068	DebugInfo: PR14728: TLS support Based on GCC's output for TLS variables (OP_constNu, x@dtpoff, OP_lo_user), this implements debug info support for TLS in ELF. Verified that this output is correct/sufficient on Linux (using gold - if you're using binutils-ld, you'll need something with the fix for http://sourceware.org/bugzilla/show_bug.cgi?id=15685 in it). Support on non-ELF is sort of "arbitrary" at the moment - if Apple folks want to discuss (or just go ahead & implement) how this should work in MachO, etc, I'm open. llvm-svn: 185203	2013-06-28 20:05:11 +00:00
David Blaikie	f3cd7c5115	DebugInfo: Pass MCSymbolRefExprs for labels instead of MCSymbols This is a precursor to adding debug info support for TLS which requires non-default relocations applied to TLS symbols. llvm-svn: 185202	2013-06-28 20:05:04 +00:00
David Blaikie	a67de2b1bc	DebugInfo: Revise r185189 to avoid subtle 'unsigned += bool' llvm-svn: 185190	2013-06-28 18:55:13 +00:00
David Blaikie	b8ef7851dc	DebugInfo: Simplify the AddressPool representation llvm-svn: 185189	2013-06-28 18:47:19 +00:00
David Blaikie	dea547b94d	DebugInfo: constify the AddressPool MCSymbol pointers llvm-svn: 185188	2013-06-28 18:47:14 +00:00
Lang Hames	c22e39d83d	Add missing case to switch statement - DAGTypeLegalizer::ExpandIntegerResult should expand ATOMIC_CMP_SWAP nodes the same way that it does for ATOMIC_SWAP. Since ATOMIC_LOADs on some targets (e.g. older ARM variants) get legalized to ATOMIC_CMP_SWAPs, the missing case had been causing i64 atomic loads to crash during isel. <rdar://problem/14074644> llvm-svn: 185186	2013-06-28 18:36:42 +00:00
Manman Ren	983a16c08a	Debug Info: clean up usage of Verify. No functionality change. It should suffice to check the type of a debug info metadata, instead of calling Verify. For cases where we know the type of a DI metadata, use assert. Also update testing cases to make them conform to the format of DI classes. llvm-svn: 185135	2013-06-28 05:43:10 +00:00
Benjamin Kramer	8da6468a34	Remove unused variable. llvm-svn: 185072	2013-06-27 11:26:41 +00:00
Elena Demikhovsky	fed077be03	Fixed a comment. llvm-svn: 184933	2013-06-26 12:15:53 +00:00
Elena Demikhovsky	6769c50d9e	Optimized integer vector multiplication operation by replacing it with shift/xor/sub when it is possible. Fixed a bug in SDIV, where the const operand is not a splat constant vector. llvm-svn: 184931	2013-06-26 10:55:03 +00:00
Manman Ren	aed30fa3de	Remove unused code. No functionality change. llvm-svn: 184866	2013-06-25 18:49:55 +00:00
Benjamin Kramer	866793109e	BlockFrequency: Bump up the entry frequency a bit. This is a band-aid to fix the most severe regressions we're seeing from basing spill decisions on block frequencies, until we have a better solution. llvm-svn: 184835	2013-06-25 13:34:40 +00:00
Eric Christopher	5fdd68e596	80-column and tab character fixes. llvm-svn: 184792	2013-06-24 23:20:02 +00:00
Eric Christopher	40b6bf6f46	Formatting. llvm-svn: 184788	2013-06-24 21:34:55 +00:00
Eric Christopher	bf2d23c0d7	Use const references instead of pointers to references that are never modified. No functional change. llvm-svn: 184781	2013-06-24 21:07:27 +00:00
Andrew Trick	97a1d7c475	Fix tail merging to assign the (more) correct BasicBlock when splitting. This makes it possible to write unit tests that are less susceptible to minor code motion, particularly copy placement. block-placement.ll covers this case with -pre-RA-sched=source which will soon be default. One incorrectly named block is already fixed, but without this fix, enabling new coalescing and scheduling would cause more failures. llvm-svn: 184680	2013-06-24 01:55:01 +00:00
David Blaikie	5acff7e691	DebugInfo: PR14404: Avoid truncating 64 bit values into 32 bits for ULEB128/SLEB128 generation llvm-svn: 184669	2013-06-23 18:31:11 +00:00
David Blaikie	2b380232c3	DebugInfo: Support (using GNU extensions) for template template parameters and parameter packs llvm-svn: 184643	2013-06-22 18:59:11 +00:00
Chad Rosier	295bd43adb	The getRegForInlineAsmConstraint function should only accept MVT value types. llvm-svn: 184642	2013-06-22 18:37:38 +00:00
Andrew Trick	cbd7305d09	Prevent LiveRangeEdit from deleting bundled instructions. We have no targets on trunk that bundle before regalloc. However, we have been advertising regalloc as bundle safe for use with out-of-tree targets. We need to at least contain the parts of the code that are still unsafe. llvm-svn: 184620	2013-06-22 00:33:48 +00:00
David Blaikie	97c6c5bd98	DebugInfo: Don't lose unreferenced non-trivial by-value parameters A FastISel optimization was causing us to emit no information for such parameters & when they go missing we end up emitting a different function type. By avoiding that shortcut we not only get types correct (very important) but also location information (handy) - even if it's only live at the start of a function & may be clobbered later. Reviewed/discussion by Evan Cheng & Dan Gohman. llvm-svn: 184604	2013-06-21 22:56:30 +00:00
Michael Liao	62ebfd8786	Fix PR16360 When (srl (anyextend x), c) is folded into (anyextend (srl x, c)), the high bits are not cleared. Add 'and' to clear off them. llvm-svn: 184575	2013-06-21 18:45:27 +00:00
Andrew Trick	5749b8be01	Update physreg live intervals during remat. llvm-svn: 184574	2013-06-21 18:33:26 +00:00
Andrew Trick	8d02e917f4	Added -precompute-phys-liveness for testing LiveIntervals updates. llvm-svn: 184573	2013-06-21 18:33:23 +00:00
Andrew Trick	6b9c49a275	Handle more cases in LiveRangeEdit::eliminateDeadDefs. Live intervals for dead physregs may be created during coalescing. We need to update these in the event that their instruction goes away. crash.ll is the unit test that catches it when MI sched is enabled on X86. llvm-svn: 184572	2013-06-21 18:33:20 +00:00
Andrew Trick	530fc1f486	Refactor LiveRangeEdit::eliminateDeadDefs. I want to add logic to handle more cases. llvm-svn: 184571	2013-06-21 18:33:17 +00:00
Andrew Trick	7df3f01703	whitespace llvm-svn: 184570	2013-06-21 18:33:14 +00:00
Andrew Trick	714aec021d	Fix a -join-globalcopies bug; handle undef operands. llvm-svn: 184569	2013-06-21 18:33:11 +00:00
Andrew Trick	75961ecc1a	Modify the -join-globalcopies option (off by default). Always coalesce in forward order to propagate rematerialization. I'm fixing this option so I can enable it by default soon. llvm-svn: 184568	2013-06-21 18:33:09 +00:00
Andrew Trick	3a851a27b8	Make rematerialization in the coalescer less sensitive to LRG order. llvm-svn: 184567	2013-06-21 18:33:06 +00:00
Andrew Trick	b55db58edf	MI-Sched: cleanup DEBUG output. llvm-svn: 184565	2013-06-21 18:33:01 +00:00
Andrew Trick	736dd9a255	MI-Sched: Adjust regpressure limits for reserved regs. llvm-svn: 184564	2013-06-21 18:32:58 +00:00
David Blaikie	ea2605dc1a	DebugInfo: don't use location lists when the location covers the whole function anyway Fix up three tests - one that was relying on abbreviation number, another relying on a location list in this case (& testing raw asm, changed that to use dwarfdump on the debug_info now that that's where the location is), and another which was added in r184368 - exposing a bug in that fix that is exposed when we emit the location inline rather than through a location list. Fix that bug while I'm here. llvm-svn: 184387	2013-06-20 00:25:24 +00:00
Bill Wendling	7ed411062b	This is now a duplicate. llvm-svn: 184376	2013-06-19 22:16:33 +00:00
Bill Wendling	cce21cc347	Make the comparison operators non-member functions. llvm-svn: 184373	2013-06-19 22:09:25 +00:00
David Blaikie	81a4dc75e9	DebugInfo: PR14763/r183329 correct the location of indirect parameters We had been papering over a problem with location info for non-trivial types passed by value by emitting their type as references (this caused the debugger to interpret the location information correctly, but broke the type of the function). r183329 corrected the type information but lead to the debugger interpreting the pointer parameter as the value - the debug info describing the location needed an extra dereference. Use a new flag in DIVariable to add the extra indirection (either by promoting an existing DW_OP_reg (parameter passed in a register) to DW_OP_breg + 0 or by adding DW_OP_deref to an existing DW_OP_breg + n (parameter passed on the stack). llvm-svn: 184368	2013-06-19 21:55:13 +00:00
Bill Wendling	a3cd350249	Access the TargetLoweringInfo from the TargetMachine object instead of caching it. The TLI may change between functions. No functionality change. llvm-svn: 184360	2013-06-19 21:36:55 +00:00
Bill Wendling	7a639ea2a4	Access the TargetLoweringInfo from the TargetMachine object instead of caching it. The TLI may change between functions. No functionality change. llvm-svn: 184352	2013-06-19 21:07:11 +00:00
Bill Wendling	afc1036f3e	Access the TargetLoweringInfo from the TargetMachine object instead of caching it. The TLI may change between functions. No functionality change. llvm-svn: 184349	2013-06-19 20:51:24 +00:00
Bill Wendling	0ccf31007f	Don't cache the TLI object since we have access to it through TargetMachine already. llvm-svn: 184346	2013-06-19 20:32:16 +00:00
Quentin Colombet	b51a68681a	During SelectionDAG building explicitly set a node to constant zero when the value is zero. This allows optmizations to kick in more easily. Fix some test cases so that they remain meaningful (i.e., not completely dead coded) when optimizations apply. <rdar://problem/14096009> superfluous multiply by high part of zero-extended value. llvm-svn: 184222	2013-06-18 20:14:39 +00:00
Timur Iskhodzhanov	302ba2314a	Hopefully fix the MSVS build after r184105 llvm-svn: 184178	2013-06-18 07:57:22 +00:00
Bill Wendling	bc07a8900c	Use pointers to the MCAsmInfo and MCRegInfo. Someone may want to do something crazy, like replace these objects if they change or something. No functionality change intended. llvm-svn: 184175	2013-06-18 07:20:20 +00:00
Bill Wendling	551a6775db	Simplify some of the code. No functionality change. llvm-svn: 184172	2013-06-18 06:07:26 +00:00
Andrew Trick	5d4861867a	MI-Sched: handle ReadAdvance latencies as used by Swift. llvm-svn: 184135	2013-06-17 21:45:18 +00:00
Andrew Trick	71f08a3e74	Give RegMax higher priority. llvm-svn: 184133	2013-06-17 21:45:13 +00:00
Andrew Trick	3c3a40e4c6	Remove compareRPDelta. A complex, expensive heuristic with little value in the current design. llvm-svn: 184132	2013-06-17 21:45:11 +00:00
Andrew Trick	d31db4b45d	MI Sched: fix a typo in RegPressure heuristics. llvm-svn: 184131	2013-06-17 21:45:09 +00:00
Andrew Trick	7e63046ce9	MI-Sched: Remove another heuristic that is sensitive to queue order. llvm-svn: 184130	2013-06-17 21:45:07 +00:00
Andrew Trick	d40d0f2c1b	MI-Sched: Track multiple candidates with the same priority level. This eliminates the MultiPressure scheduling "reason". It was sensitive to queue order. We don't like being sensitive to queue order. llvm-svn: 184129	2013-06-17 21:45:05 +00:00
Bill Wendling	626c991ce9	Directly access objects which may change during compilation. llvm-svn: 184121	2013-06-17 20:41:25 +00:00
Benjamin Kramer	e2a1d89e14	Switch spill weights from a basic loop depth estimation to BlockFrequencyInfo. The main advantages here are way better heuristics, taking into account not just loop depth but also __builtin_expect and other static heuristics and will eventually learn how to use profile info. Most of the work in this patch is pushing the MachineBlockFrequencyInfo analysis into the right places. This is good for a 5% speedup on zlib's deflate (x86_64), there were some very unfortunate spilling decisions in its hottest loop in longest_match(). Other benchmarks I tried were mostly neutral. This changes register allocation in subtle ways, update the tests for it. 2012-02-20-MachineCPBug.ll was deleted as it's very fragile and the instruction it looked for was gone already (but the FileCheck pattern picked up unrelated stuff). llvm-svn: 184105	2013-06-17 19:00:36 +00:00
David Blaikie	b735b4d6db	DebugInfo: remove target-specific Frame Index handling for DBG_VALUE MachineInstrs Frame index handling is now target-agnostic, so delete the target hooks for creation & asm printing of target-specific addressing in DBG_VALUEs and any related functions. llvm-svn: 184067	2013-06-16 20:34:27 +00:00
David Blaikie	0252265be0	Debug Info: Simplify Frame Index handling in DBG_VALUE Machine Instructions Rather than using the full power of target-specific addressing modes in DBG_VALUEs with Frame Indicies, simply use Frame Index + Offset. This reduces the complexity of debug info handling down to two representations of values (reg+offset and frame index+offset) rather than three or four. Ideally we could ensure that frame indicies had been eliminated by the time we reached an assembly or dwarf generation, but I haven't spent the time to figure out where the FIs are leaking through into that & whether there's a good place to convert them. Some FI+offset=>reg+offset conversion is done (see PrologEpilogInserter, for example) which is necessary for some SelectionDAG assumptions about registers, I believe, but it might be possible to make this a more thorough conversion & ensure there are no remaining FIs no matter how instruction selection is performed. llvm-svn: 184066	2013-06-16 20:34:15 +00:00
David Blaikie	c9380db4ba	DebugInfo: print DBG_VALUE MachineInstrs with [] for deref and drop the offset when it's zero llvm-svn: 184045	2013-06-15 15:52:58 +00:00
Andrew Trick	8e8415f5ab	Missing NDEBUGs. llvm-svn: 184039	2013-06-15 05:46:47 +00:00
Andrew Trick	f78e7fa140	MI-Sched: heuristics using the new latency and machine model. llvm-svn: 184038	2013-06-15 05:39:19 +00:00
Andrew Trick	de2109eb4c	Machine Model: Add MicroOpBufferSize and resource BufferSize. Replace the ill-defined MinLatency and ILPWindow properties with with straightforward buffer sizes: MCSchedMode::MicroOpBufferSize MCProcResourceDesc::BufferSize These can be used to more precisely model instruction execution if desired. Disabled some misched tests temporarily. They'll be reenabled in a few commits. llvm-svn: 184032	2013-06-15 04:49:57 +00:00
Andrew Trick	e2ff575bb8	MI-Sched: Rename IssueCount to CurrMOps. "Counts" refer to scaled resource counts within a region. CurrMOps is simply the number of micro-ops to be issue in the current cycle. llvm-svn: 184031	2013-06-15 04:49:49 +00:00
Andrew Trick	0cd8afc547	MI-Sched: Remove the temporary EnableCopyConstrain flag. llvm-svn: 184030	2013-06-15 04:49:46 +00:00
Andrew Trick	f5b8ef2690	MI-Sched: added tracking of dependent latency for better heuristics. Heuristics compare the critical path in the scheduled code, called ExpectedLatency, with the latency of instructions remaining to be scheduled. There are two ways to look at remaining latency: (1) Dependent latency includes the latency between unscheduled and scheduled instructions. (2) Independent latency is simply the height (bottom-up) or depth (top-down) of instructions currently in the ready Q. llvm-svn: 184029	2013-06-15 04:49:44 +00:00
Andrew Trick	5580e5ca80	MI-Sched: DEBUG: print critical resource. llvm-svn: 184028	2013-06-15 04:49:42 +00:00
Stephen Lin	605207fe75	SelectionDAG: slightly refactor DAGCombiner::visitSELECT_CC to avoid redudant checks... This doesn't really effect performance due to all the relevant calls being transparent but is clearer. llvm-svn: 184027	2013-06-15 04:03:33 +00:00
David Blaikie	2dbebcfede	Debug Info: Don't print the display name and colon prefix for DEBUG_VALUE comments if the display name is empty llvm-svn: 184026	2013-06-15 00:33:47 +00:00
Matt Arsenault	d2f0332a29	Introduce getSelect usage and use more getSelectCC llvm-svn: 184012	2013-06-14 22:04:37 +00:00
Stephen Lin	4e69d01b67	SelectionDAG: minor fix to order of operands in comments to match the code llvm-svn: 184008	2013-06-14 21:33:58 +00:00
Tim Northover	059cead5ed	Mark rematerialized super/sub registers as dead. When we're rematerializing into a not-quite-right register we already add the real definition as an imp-def, but we should also be marking the "official" register as dead, since nothing else is going to use it as a result of this remat. Not doing this can affect pressure tracking. rdar://problem/14158833 llvm-svn: 184002	2013-06-14 20:22:21 +00:00
Stephen Lin	e31f2d2d54	SelectionDAG: Fix incorrect condition checks in some cases of folding FADD/FMUL combinations; also improve accuracy of comments llvm-svn: 183993	2013-06-14 18:17:35 +00:00
Derek Schuff	1804b5795a	Make PrologEpilogInserter save/restore all callee saved registers in functions which call __builtin_unwind_init() __builtin_unwind_init() is an undocumented gcc intrinsic which has this effect, and is used in libgcc_eh. Goes part of the way toward fixing PR8541. llvm-svn: 183984	2013-06-14 16:15:29 +00:00
Jakub Staszak	80df8b837f	Move #include from .h to .cpp file. llvm-svn: 183960	2013-06-14 00:00:13 +00:00
Dan Gohman	6a0c9ae4f9	Print ConstantPool entries initialized to Values with WriteAsOperand instead of operator<< so that functions are printed as just their name instead of as their entire definition, which is excessively verbose in this context. llvm-svn: 183871	2013-06-12 22:19:19 +00:00
Eric Christopher	773c155c15	Formatting. llvm-svn: 183807	2013-06-11 23:41:38 +00:00
Eric Christopher	a2f806fd20	Remove a few fixmes, the only work we're doing is getting the string to return and this is done all over. llvm-svn: 183704	2013-06-10 22:24:10 +00:00
Eric Christopher	2280c5baa6	Fix up comment. llvm-svn: 183703	2013-06-10 22:24:07 +00:00
Eric Christopher	8c47bb1a02	Remove unused function. llvm-svn: 183698	2013-06-10 21:59:28 +00:00
Eric Christopher	b7e3b42c01	IndentCount is only used within NDEBUG code. llvm-svn: 183695	2013-06-10 20:58:53 +00:00
David Majnemer	0fc8670cb0	TargetLowering: Clean up method description comments llvm-svn: 183623	2013-06-08 23:51:45 +00:00
Quentin Colombet	249cb6756c	Reapply r183552. This time, use a standard type for the option to avoid template instantiation issue with non-standard type. Add a backend option to warn on a given stack size limit. Option: -mllvm -warn-stack-size=<limit> Output (if limit is exceeded): warning: Stack size limit exceeded (<actual size>) in <functionName>. The longer term plan is to hook that to a clang warning. PR:4072 <rdar://problem/13987214>. llvm-svn: 183595	2013-06-08 00:07:54 +00:00
Quentin Colombet	bd5a201c85	Revert commits related to stack warning. llvm-svn: 183579	2013-06-07 22:14:50 +00:00
Quentin Colombet	6baf581b93	Add a backend option to warn on a given stack size limit. Option: -mllvm -warn-stack-size=<limit> Output (if limit is exceeded): warning: Stack size limit exceeded (<actual size>) in <functionName>. The longer term plan is to hook that to a clang warning. PR:4072 <rdar://problem/13987214> llvm-svn: 183552	2013-06-07 20:18:12 +00:00
Quentin Colombet	ba366011c8	Teach AsmPrinter how to print odd constants. Fix an assertion when the compiler encounters big constants whose bit width is not a multiple of 64-bits. Although clang would never generate something like this, the backend should be able to handle any legal IR. <rdar://problem/13363576> llvm-svn: 183544	2013-06-07 18:36:03 +00:00
Rafael Espindola	aad6c24422	Support OpenBSD's native frame protection conventions. OpenBSD's stack smashing protection differs slightly from other platforms: 1. The smash handler function is "__stack_smash_handler(const char *funcname)" instead of "__stack_chk_fail(void)". 2. There's a hidden "long __guard_local" object that gets linked into each executable and DSO. Patch by Matthew Dempsky. llvm-svn: 183533	2013-06-07 16:35:57 +00:00
Duncan Sands	e0a607e376	Correct wrong register in this example, pointed out by Baoshan Pang. llvm-svn: 183495	2013-06-07 08:30:55 +00:00
David Blaikie	b272a75328	Fix break in r183446 - helps to increment the iterator in a loop llvm-svn: 183454	2013-06-06 22:28:26 +00:00
David Blaikie	36d5d2f08d	Debug Info: simplify parameter ordering preservation Seems we emit the parameter ordering number (spuriously named 'arg number') in the debug info, so there's no need to search through the variable list to figure out the parameter ordering. This implementation does 'always' do the work, even in non-optimized debug info (the previous implementation checked the existence of the 'variables' list on the subprogram which is only present in optimized builds). No intended functionality change. llvm-svn: 183446	2013-06-06 21:04:51 +00:00
Bill Wendling	f77190855d	Cache the TargetLowering info object as a pointer. Caching it as a pointer allows us to reset it if the TargetMachine object changes. llvm-svn: 183361	2013-06-06 00:43:09 +00:00
Bill Wendling	8db01cb262	Don't cache the TargetLoweringInfo object inside of the FunctionLowering object. The TargetLoweringInfo object is owned by the TargetMachine. In the future, the TargetMachine object may change, which may also change the TargetLoweringInfo object. llvm-svn: 183356	2013-06-06 00:11:39 +00:00
Bill Wendling	6fd15ca02e	Add space to assert message. llvm-svn: 183346	2013-06-05 23:13:26 +00:00
David Blaikie	6f1a8067fb	PR15662: Optimized debug info produces out of order function parameters When a function is inlined we lazily construct the variables representing the function's parameters. After that, we add any remaining unused parameters. If the function doesn't use all the parameters, or uses them out of order, then the DWARF would produce them in that order, producing a parameter order that doesn't match the source. This fix causes us to always keep the arg variables at the start of the variable list & in the original order from the source. llvm-svn: 183297	2013-06-05 05:39:59 +00:00
Shuxin Yang	8b8fd2171c	Fix a defect in code-layout pass, improving Benchmarks/Olden/em3d/em3d by about 30% (4.58s vs 3.2s on an oldish Mac Tower). The corresponding src is excerpted bellow. The lopp accounts for about 90% of execution time. -------------------- cat -n test-suite/MultiSource/Benchmarks/Olden/em3d/make_graph.c 90 91 for (k=0; k<j; k++) 92 if (other_node == cur_node->to_nodes[k]) break; The defective layout is sketched bellow, where the two branches need to swap. ------------------------------------------------------------------------ L: ... if (cond) goto out-of-loop goto L While this code sequence is defective, I don't understand why it incurs 1/3 of execution time. CPU-event-profiling indicates the poor laoyout dose not increase in br-misprediction; it dosen't increase stall cycle at all, and it dosen't prevent the CPU detect the loop (i.e. Loop-Stream-Detector seems to be working fine as well)... The root cause of the problem is that the layout pass calls AnalyzeBranch() with basic-block which is not updated to reflect its current layout. rdar://13966341 llvm-svn: 183174	2013-06-04 01:00:57 +00:00
Benjamin Kramer	7c275640e7	Move getRealLinkageName to a common place and remove all the duplicates of it. Also simplify code a bit while there. No functionality change. llvm-svn: 183076	2013-06-01 17:51:14 +00:00
Eric Christopher	65ac02ad78	Const-ify some printing and dumping code for DIEValues. llvm-svn: 183057	2013-05-31 22:50:40 +00:00
Quentin Colombet	bf490d4a32	Loop Strength Reduce: Scaling factor cost. Account for the cost of scaling factor in Loop Strength Reduce when rating the formulae. This uses a target hook. The default implementation of the hook is: if the addressing mode is legal, the scaling factor is free. <rdar://problem/13806271> llvm-svn: 183045	2013-05-31 21:29:03 +00:00
Kai Nacke	e1823b6b85	Remove useless code from transitioning to new EH scheme Removes all uses of the variable UsesNewEH. Simply return false in case that no resume instructions were found. llvm-svn: 183016	2013-05-31 16:30:36 +00:00
Tim Northover	69cd121dd9	Fix rematerialization into physical registers. r182872 introduced a bug in how the register-coalescer's rematerialization handled defining a physical register. It relied on the output of the coalescer's setRegisters method to determine whether the replacement instruction needed an implicit-def. However, this value isn't necessarily the same as the CopyMI's actual destination register which is what the rest of the basic-block expects us to be defining. The commit changes the rematerializer to use the actual register attached to CopyMI in its decision. This will be tested soon by an X86 patch which moves everything to using MOV32r0 instead of other sizes. llvm-svn: 182925	2013-05-30 12:30:50 +00:00
Eric Christopher	d1c5a31721	Rename variable to be more descriptive. llvm-svn: 182903	2013-05-30 00:43:35 +00:00
Eric Christopher	1e1c7f1b15	Formatting. llvm-svn: 182902	2013-05-30 00:43:32 +00:00
Andrew Trick	ad6d08ac6f	Order CALLSEQ_START and CALLSEQ_END nodes. Fixes PR16146: gdb.base__call-ar-st.exp fails after pre-RA-sched=source fixes. Patch by Xiaoyi Guo! This also fixes an unsupported dbg.value test case. Codegen was previously incorrect but the test was passing by luck. llvm-svn: 182885	2013-05-29 22:03:55 +00:00
Bill Wendling	70b1400e6d	Don't reach into the middle of TargetMachine and cache one of its ivars. Not only does this break encapsulation, it's gross. llvm-svn: 182876	2013-05-29 20:37:19 +00:00
Tim Northover	b65f6b0820	Teach ReMaterialization to be more cunning about subregisters This allows rematerialization during register coalescing to handle more cases involving operations like SUBREG_TO_REG which might need to be rematerialized using sub-register indices. For example, code like: v1(GPR64):sub_32 = MOVZ something v2(GPR64) = COPY v1(GPR64) should be convertable to: v2(GPR64):sub_32 = MOVZ something but previously we just gave up in places like this llvm-svn: 182872	2013-05-29 19:32:06 +00:00
Manman Ren	4213c39e3c	LTO+Debug Info: revert r182791. Since the testing case uses ref_addr, which requires version 3+ to work, we will solve the dwarf version issue first. This patch also causes failures in one of the bots. I will update the patch accordingly in my next attempt. rdar://13926659 llvm-svn: 182867	2013-05-29 17:16:59 +00:00
Manman Ren	b5b5453e61	LTO+Debug Info: correctly emit inlined_subroutine when the inlined callee is from a different CU. We used to print out an error message and fail to generate inlined_subroutine. If we use ref_addr in the generated DWARF, the DWARF version should be 3 or above. rdar://13926659 llvm-svn: 182791	2013-05-28 19:01:58 +00:00
Benjamin Kramer	262b154247	Simplify code. No functionality change. llvm-svn: 182779	2013-05-28 16:39:36 +00:00
Benjamin Kramer	351d53c225	Remove double semicolons. llvm-svn: 182778	2013-05-28 16:31:26 +00:00
Preston Gurd	048f99de11	Convert sqrt functions into sqrt instructions when -ffast-math is in effect. When -ffast-math is in effect (on Linux, at least), clang defines __FINITE_MATH_ONLY__ > 0 when including <math.h>. This causes the preprocessor to include <bits/math-finite.h>, which renames the sqrt functions. For instance, "sqrt" is renamed as "__sqrt_finite". This patch adds the 3 new names in such a way that they will be treated as equivalent to their respective original names. llvm-svn: 182739	2013-05-27 15:44:35 +00:00
Andrew Trick	c66d26adf0	Fix PR16143: Insert DEBUG_VALUE before terminator. llvm-svn: 182717	2013-05-26 08:58:50 +00:00
Andrew Trick	e2431c64bc	Track IR ordering of SelectionDAG nodes 3/4. Remove the old IR ordering mechanism and switch to new one. Fix unit test failures. llvm-svn: 182704	2013-05-25 03:08:10 +00:00
Andrew Trick	ef9de2a739	Track IR ordering of SelectionDAG nodes 2/4. Change SelectionDAG::getXXXNode() interfaces as well as call sites of these functions to pass in SDLoc instead of DebugLoc. llvm-svn: 182703	2013-05-25 02:42:55 +00:00
Andrew Trick	175143bf88	Track IR ordering of SelectionDAG nodes 1/4. Use a field in the SelectionDAGNode object to track its IR ordering. This adds fields and utility classes without changing existing interfaces or functionality. llvm-svn: 182701	2013-05-25 02:20:36 +00:00
Michael J. Spencer	df1ecbd734	Replace Count{Leading,Trailing}Zeros_{32,64} with count{Leading,Trailing}Zeros. llvm-svn: 182680	2013-05-24 22:23:49 +00:00
Jakob Stoklund Olesen	43711c51ec	Fix PR16110: Handle DBG_VALUE in ConnectedVNInfoEqClasses::Distribute(). Now that the LiveDebugVariables pass is running after register coalescing, the ConnectedVNInfoEqClasses class needs to deal with DBG_VALUE instructions. This only comes up when rematerialization during coalescing causes the remaining live range of a virtual register to separate into two connected components. llvm-svn: 182592	2013-05-23 17:02:23 +00:00
Benjamin Kramer	bc6666bedf	InlineSpiller: Store bucket pointers instead of iterators. Lets us use a SetVector instead of an explicit set + vector combination. llvm-svn: 182586	2013-05-23 15:42:57 +00:00
David Blaikie	5174c84add	Solidify the assumption that a DW_TAG_subprogram's type is a DW_TAG_subroutine_type There were bits & pieces of code lying around that may've given the impression that debug info metadata supported the possibility that a subprogram's type could be specified by a non-subroutine type describing the return type of a void function. This support was incomplete & unnecessary. Asserts & API have been changed to make the desired usage more clear. llvm-svn: 182532	2013-05-22 23:22:18 +00:00
Chad Rosier	abdb1d69ab	Simplify logic now that r182490 is in place. No functional change intended. llvm-svn: 182531	2013-05-22 23:17:36 +00:00
Chad Rosier	682ae15bb9	Simplify logic now that r182490 is in place. No functional change intended. llvm-svn: 182527	2013-05-22 22:36:55 +00:00
Chad Rosier	c7505ef8ba	Simplify logic now that r182490 is in place. No functional change intended. llvm-svn: 182526	2013-05-22 22:26:05 +00:00
Adrian Prantl	0d1e5592a6	Unify formatting of debug output. llvm-svn: 182495	2013-05-22 18:02:19 +00:00
Justin Holewinski	fff1f5f5e2	Drop @llvm.annotation and @llvm.ptr.annotation intrinsics during codegen. The intrinsic calls are dropped, but the annotated value is propagated. Fixes PR 15253 Original patch by Zeng Bin! llvm-svn: 182387	2013-05-21 14:37:16 +00:00
Benjamin Kramer	8aaf197990	DAGCombine: Avoid an edge case where it tried to create an i0 type for (x & 0) == 0. Fixes PR16083. llvm-svn: 182357	2013-05-21 08:51:09 +00:00
Manman Ren	9d4c735885	Dwarf: use a single line table to generate assembly when .loc is used. This is to fix PR15408 where an undefined symbol Lline_table_start1 is used. Since we do not generate the debug_line section when .loc is used, Lline_table_start1 is not emitted and we can't refer to it when calculating at_stmt_list for a compile unit. llvm-svn: 182344	2013-05-21 00:57:22 +00:00
David Blaikie	e63d5d1633	PR14606: Debug Info for namespace aliases/DW_TAG_imported_module This resolves the last of the PR14606 failures in the GDB 7.5 test suite by implementing an optional name field for DW_TAG_imported_modules/DIImportedEntities and using that to implement C++ namespace aliases (eg: "namespace X = Y;"). llvm-svn: 182328	2013-05-20 22:50:35 +00:00
Bill Wendling	eda5418e89	The DWARF EH pass doesn't need the TargetMachine, only the TargetLoweringBase like the other EH passes. llvm-svn: 182321	2013-05-20 21:54:18 +00:00
Bill Wendling	47447589c9	No need to store the TargetMachine variable in this class. llvm-svn: 182317	2013-05-20 21:28:28 +00:00
Matt Arsenault	75865923c9	Add LLVMContext argument to getSetCCResultType llvm-svn: 182180	2013-05-18 00:21:46 +00:00
Matt Arsenault	04126234e5	Replace redundant code Use EVT::changeExtendedVectorElementTypeToInteger instead of doing the same thing that it does llvm-svn: 182165	2013-05-17 21:43:43 +00:00
Matt Arsenault	52ddb7bcdd	Add missing -- C++ -- to headers llvm-svn: 182164	2013-05-17 21:43:39 +00:00
Adrian Prantl	9c93059aa4	Generate debug info for by-value struct args even if they are not used. radar://problem/13865940 llvm-svn: 182062	2013-05-16 23:44:12 +00:00
Rafael Espindola	b08d2c2db0	Remove addFrameMove. Now that we have good testing, remove addFrameMove and create cfi instructions directly. llvm-svn: 182052	2013-05-16 21:02:15 +00:00
Benjamin Kramer	fc88c3761f	DAGCombine: Also shrink eq compares where the constant is exactly as large as the smaller type. if ((x & 255) == 255) before: movzbl %al, %eax cmpl $255, %eax after: cmpb $-1, %al llvm-svn: 182038	2013-05-16 18:47:58 +00:00
Jakob Stoklund Olesen	9ae96c7aab	Add TargetRegisterInfo::getCoveringLanes(). This lane mask provides information about which register lanes completely cover super-registers. See the block comment before getCoveringLanes(). llvm-svn: 182034	2013-05-16 18:03:08 +00:00
Hal Finkel	1f6a7f53d8	Fix legalization of SETCC with promoted integer intrinsics If the input operands to SETCC are promoted, we need to make sure that we either use the promoted form of both operands (or neither); a mixture is not allowed. This can happen, for example, if a target has a custom promoted i1-returning intrinsic (where i1 is not a legal type). In this case, we need to use the promoted form of both operands. This change only augments the behavior of the existing logic in the case where the input types (which may or may not have already been legalized) disagree, and should not affect existing target code because this case would otherwise cause an assert in the SETCC operand promotion code. This will be covered by (essentially all of the) tests for the new PPCCTRLoops infrastructure. llvm-svn: 181926	2013-05-15 21:37:27 +00:00
Derek Schuff	d2c42d766d	Fix miscompile due to StackColoring incorrectly merging stack slots (PR15707) IR optimisation passes can result in a basic block that contains: llvm.lifetime.start(%buf) ... llvm.lifetime.end(%buf) ... llvm.lifetime.start(%buf) Before this change, calculateLiveIntervals() was ignoring the second lifetime.start() and was regarding %buf as being dead from the lifetime.end() through to the end of the basic block. This can cause StackColoring to incorrectly merge %buf with another stack slot. Fix by removing the incorrect Starts[pos].isValid() and Finishes[pos].isValid() checks. Just doing: Starts[pos] = Indexes->getMBBStartIdx(MBB); Finishes[pos] = Indexes->getMBBEndIdx(MBB); unconditionally would be enough to fix the bug, but it causes some test failures due to stack slots not being merged when they were before. So, in order to keep the existing tests passing, treat LiveIn and LiveOut separately rather than approximating the live ranges by merging LiveIn and LiveOut. This fixes PR15707. Patch by Mark Seaborn. llvm-svn: 181922	2013-05-15 21:15:09 +00:00
David Blaikie	041f1aa3e2	Use only explicit bool conversion operators BitVector/SmallBitVector::reference::operator bool remain implicit since they model more exactly a bool, rather than something else that can be boolean tested. The most common (non-buggy) case are where such objects are used as return expressions in bool-returning functions or as boolean function arguments. In those cases I've used (& added if necessary) a named function to provide the equivalent (or sometimes negative, depending on convenient wording) test. One behavior change (YAMLParser) was made, though no test case is included as I'm not sure how to reach that code path. Essentially any comparison of llvm::yaml::document_iterators would be invalid if neither iterator was at the end. This helped uncover a couple of bugs in Clang - test cases provided for those in a separate commit along with similar changes to `operator bool` instances in Clang. llvm-svn: 181868	2013-05-15 07:36:59 +00:00
Eric Christopher	8fd7ab07ca	Make getCompileUnit non-const and return the current DIE if it happens to be a compile unit. Noticed on inspection and tested via calling on a newly created compile unit. No functional change. llvm-svn: 181835	2013-05-14 21:33:10 +00:00
Kai Nacke	9a224ced0f	Add bitcast to store of personality function. The personality function is user defined and may have an arbitrary result type. The code assumes always i8. This results in an assertion failure if a different type is used. A bitcast to i8 is added to prevent this failure. Reviewed by: Renato Golin, Bob Wilson llvm-svn: 181802	2013-05-14 16:30:51 +00:00
Akira Hatanaka	1f24e6a6a2	StackColoring: don't clear an instruction's mem operand if the underlying object is a PseudoSourceValue and PseudoSourceValue::isConstant returns true (i.e., points to memory that has a constant value). llvm-svn: 181751	2013-05-14 01:42:44 +00:00
David Blaikie	7b770c6aed	Assert that DIEEntries are constructed with non-null DIEs This just brings a crash a little further forward from DWARF emission to DIE construction to make errors easier to diagnose. llvm-svn: 181748	2013-05-14 00:35:19 +00:00
Bob Wilson	c5c0823724	Remove redundant variable introduced by r181682. llvm-svn: 181721	2013-05-13 19:02:31 +00:00
Hao Liu	bc60196951	Fix PR15950 A bug in DAG Combiner about undef mask llvm-svn: 181682	2013-05-13 02:07:05 +00:00
Rafael Espindola	227144c23c	Remove the MachineMove class. It was just a less powerful and more confusing version of MCCFIInstruction. A side effect is that, since MCCFIInstruction uses dwarf register numbers, calls to getDwarfRegNum are pushed out, which should allow further simplifications. I left the MachineModuleInfo::addFrameMove interface unchanged since this patch was already fairly big. llvm-svn: 181680	2013-05-13 01:16:13 +00:00
Benjamin Kramer	63e39eb09d	StringRefize some debug accel table bits. llvm-svn: 181663	2013-05-11 18:24:28 +00:00
Rafael Espindola	1b09836bc3	Change getFrameMoves to return a const reference. To add a frame now there is a dedicated addFrameMove which also takes care of constructing the move itself. llvm-svn: 181657	2013-05-11 02:38:11 +00:00
Rafael Espindola	639890222e	Remove more dead code. llvm-svn: 181656	2013-05-11 02:24:41 +00:00
David Blaikie	a1e813dcd4	PR14492: Debug Info: Support for values of non-integer non-type template parameters. This is only tested for global variables at the moment (& includes tests for the unnamed parameter case, since apparently this entire function was completely untested previously) llvm-svn: 181632	2013-05-10 21:52:07 +00:00
Benjamin Kramer	a5d59333b3	DAGCombiner: Generate a correct constant for vector types when folding (xor (and)) into (and (not)). PR15948. llvm-svn: 181597	2013-05-10 14:09:52 +00:00
Owen Anderson	32baf99b1d	Teach SelectionDAG to constant fold all-constant FMA nodes the same way that it constant folds FADD, FMUL, etc. llvm-svn: 181555	2013-05-09 22:27:13 +00:00
Eric Christopher	f20ff979e9	Revert "Make sure debug info contains linkage names (DW_AT_MIPS_linkage_name)" temporarily while investigating gdb.cp/templates.exp. This reverts commit r181471. llvm-svn: 181496	2013-05-09 00:42:33 +00:00
Eric Christopher	697fa1c8be	Make sure debug info contains linkage names (DW_AT_MIPS_linkage_name) for constructors and destructors since the original declaration given by the AT_specification both won't and can't. Patch by Yacine Belkadi, I've cleaned up the testcases. llvm-svn: 181471	2013-05-08 21:23:22 +00:00
David Majnemer	386ab7f872	DAGCombiner: Simplify inverted bit tests Fold (xor (and x, y), y) -> (and (not x), y) This removes an opportunity for a constant to appear twice. llvm-svn: 181395	2013-05-08 06:44:42 +00:00
David Blaikie	3b6038b6f3	Debug Info: Support DW_TAG_imported_declaration This provides basic functionality for imported declarations. For subprograms and types some amount of lazy construction is supported (so the definition of a function can proceed the using declaration), but it still doesn't handle declared-but-not-defined functions (since we don't generally emit function declarations). Variable support is really rudimentary at the moment - simply looking up the existing definition with no support for out of order (declaration, imported_module, then definition). llvm-svn: 181392	2013-05-08 06:01:41 +00:00
David Blaikie	4dd2de7ae7	Finish renaming constructImportedModuleDIE to constructImportedEntityDIE llvm-svn: 181391	2013-05-08 06:01:38 +00:00
Eric Christopher	c57baeeee0	Pass the MDNode in and do the insertion at compile unit creation time instead of relying upon an extra call to finish initializing. llvm-svn: 181383	2013-05-08 00:58:51 +00:00
Eric Christopher	6156011ee8	Typo. llvm-svn: 181378	2013-05-08 00:11:10 +00:00
David Blaikie	1fd4365f91	Rename DIImportedModule to DIImportedEntity and allow imported declarations DIBuilder::createImportedDeclaration isn't fully plumbed through (note, lacking in AsmPrinter/DwarfDebug support) but this seemed like a sufficiently useful division of code to make the subsequent patch(es) easier to follow. llvm-svn: 181364	2013-05-07 21:35:53 +00:00
Rafael Espindola	9a383405a7	Remove exception handling support from the old JIT. llvm-svn: 181354	2013-05-07 20:53:59 +00:00
Matt Arsenault	a5733dc97e	Fix vselect when getSetCCResultType returns a different type from the operands llvm-svn: 181348	2013-05-07 20:24:18 +00:00
David Blaikie	6baa776173	Debug Info: Fix for break due to r181271 Apparently we didn't keep an association of Compile Unit metadata nodes to DIEs so looking up that parental context failed & thus caused no DW_TAG_imported_modules to be emitted at the CU scope. Fix this by adding the mapping & sure up the test case to verify this. llvm-svn: 181339	2013-05-07 17:57:13 +00:00
Timur Iskhodzhanov	ec4afe6680	Fix the VS2010 build broken by r181271 llvm-svn: 181296	2013-05-07 07:47:47 +00:00
David Blaikie	684fc5331e	DebugInfo: Support imported modules in lexical blocks llvm-svn: 181271	2013-05-06 23:33:07 +00:00
Eric Christopher	6c6de847a8	Remove unnecessary instance variable and rework logic accordingly. llvm-svn: 181227	2013-05-06 17:50:50 +00:00
Eric Christopher	f0303324be	Grammar. llvm-svn: 181226	2013-05-06 17:50:46 +00:00
Michael Kuperstein	ac868757d0	Fix slightly too aggressive conact_vector optimization. (Would sometimes optimize away conacts used to extend a vector with undef values) llvm-svn: 181186	2013-05-06 08:06:13 +00:00
Evan Cheng	8b8e8d88ff	Teach if-converter to avoid removing BBs whose addresses are takne. rdar://13782395 llvm-svn: 181160	2013-05-05 18:03:49 +00:00
Benjamin Kramer	391f5a6e21	InlineSpiller: Remove quadratic behavior. No functionality change. llvm-svn: 181149	2013-05-05 11:29:14 +00:00
Stepan Dyatkovskiy	8c02c98259	For ARM backend, fixed "byval" attribute support. Now even the small structures could be passed within byval (small enough to be stored in GPRs). In regression tests next function prototypes are checked: PR15293: %artz = type { i32 } define void @foo(%artz* byval %s) define void @foo2(%artz* byval %s, i32 %p, %artz* byval %s2) foo: "s" stored in R0 foo2: "s" stored in R0, "s2" stored in R2. Next AAPCS rules are checked: 5.5 Parameters Passing, C.4 and C.5, "ParamSize" is parameter size in 32bit words: -- NSAA != 0, NCRN < R4 and NCRN+ParamSize > R4. Parameter should be sent to the stack; NCRN := R4. -- NSAA != 0, and NCRN < R4, NCRN+ParamSize < R4. Parameter stored in GPRs; NCRN += ParamSize. llvm-svn: 181148	2013-05-05 07:48:36 +00:00
Dmitri Gribenko	3238fb7595	Add ArrayRef constructor from None, and do the cleanups that this constructor enables Patch by Robert Wilhelm. llvm-svn: 181138	2013-05-05 00:40:33 +00:00
Evan Cheng	f85a76f477	TiedTo flag can now be placed on implicit operands. isTwoAddrUse() should look at all of the operands. Previously it was skipping over implicit operands which cause infinite looping when the two-address pass try to reschedule a two-address instruction below the kill of tied operand. I'm unable to come up with a reasonably sized test case. rdar://13747577 llvm-svn: 180906	2013-05-02 02:07:32 +00:00
Bill Wendling	8f2e6feb8e	Revert r180737. The companion patch was reverted, and this is not relevant right now. llvm-svn: 180889	2013-05-01 22:32:08 +00:00
Filip Pizlo	dec20e43c0	This patch breaks up Wrap.h so that it does not have to include all of the things, and renames it to CBindingWrapping.h. I also moved CBindingWrapping.h into Support/. This new file just contains the macros for defining different wrap/unwrap methods. The calls to those macros, as well as any custom wrap/unwrap definitions (like for array of Values for example), are put into corresponding C++ headers. Doing this required some #include surgery, since some .cpp files relied on the fact that including Wrap.h implicitly caused the inclusion of a bunch of other things. This also now means that the C++ headers will include their corresponding C API headers; for example Value.h must include llvm-c/Core.h. I think this is harmless, since the C API headers contain just external function declarations and some C types, so I don't believe there should be any nasty dependency issues here. llvm-svn: 180881	2013-05-01 20:59:00 +00:00
Chad Rosier	8e4824f350	[inline asm] Return an undef SDValue of the expected value type, rather than report a fatal error. This allows us to continue processing the translation unit. Test case to come on the clang side because we need an inline asm diagnostics handler in place. rdar://13446483 llvm-svn: 180873	2013-05-01 19:49:26 +00:00
Nadav Rotem	e5a2dda372	Optimize away nop CONCAT_VECTOR nodes. Optimize CONCAT_VECTOR nodes that merge EXTRACT_SUBVECTOR values that extract from the same vector. rdar://13402653 PR15866 llvm-svn: 180871	2013-05-01 19:18:51 +00:00
Stephen Lin	699808ceb2	Only pass 'returned' to target-specific lowering code when the value of entire register is guaranteed to be preserved. llvm-svn: 180825	2013-04-30 22:49:28 +00:00
Adrian Prantl	a2888e71eb	Temporarily revert "Change the informal convention of DBG_VALUE so that we can express a" because it breaks some buildbots. This reverts commit 180816. llvm-svn: 180819	2013-04-30 22:35:14 +00:00
Adrian Prantl	9a576644e4	Change the informal convention of DBG_VALUE so that we can express a register-indirect address with an offset of 0. It used to be that a DBG_VALUE is a register-indirect value if the offset (operand 1) is nonzero. The new convention is that a DBG_VALUE is register-indirect if the first operand is a register and the second operand is an immediate. For plain registers use the combination reg, reg. rdar://problem/13658587 llvm-svn: 180816	2013-04-30 22:16:46 +00:00
Andrew Trick	dd77014acc	MI Sched: revert a minor heuristic that snuck in with -misched-vcopy. I'll fix the heuristic in a general way in a follow-up commit. llvm-svn: 180815	2013-04-30 22:10:59 +00:00
Hal Finkel	7153251ab5	LocalStackSlotAllocation improvements First, taking advantage of the fact that the virtual base registers are allocated in order of the local frame offsets, remove the quadratic register-searching behavior. Because of the ordering, we only need to check the last virtual base register created. Second, store the frame index in the FrameRef structure, and get the frame index and the local offset from this structure at the top of the loop iteration. This allows us to de-nest the loops in insertFrameReferenceRegisters (and I think makes the code cleaner). I also moved the needsFrameBaseReg check into the first loop over instructions so that we don't bother pushing FrameRefs for instructions that don't want a virtual base register anyway. Lastly, and this is the only functionality change, avoid the creation of single-use virtual base registers. These are currently not useful because, in general, they end up replacing what would be one r+r instruction with an add and a r+i instruction. Committing this removes the XFAIL in CodeGen/PowerPC/2007-09-07-LoadStoreIdxForms.ll Jim has okayed this off-list. llvm-svn: 180799	2013-04-30 20:04:37 +00:00
Bill Wendling	fb7e32ebd6	Emit the TLS initialization function pointers into the correct section. The `llvm.tls_init_funcs' (created by the front-end) holds pointers to the TLS initialization functions. These need to be placed into the correct section so that they are run before `main()'. <rdar://problem/13733006> llvm-svn: 180737	2013-04-29 22:25:40 +00:00
Andrew Trick	85058af650	Generalize the MachineTraceMetrics public API. Naturally, we should be able to pass in extra instructions, not just extra blocks. llvm-svn: 180667	2013-04-27 03:54:20 +00:00
Eric Christopher	203e12bf9e	Use the target triple from the target machine rather than the module to determine whether or not we're on a darwin platform for debug code emitting. Solves the problem of a module with no triple on the command line and no triple in the module using non-gdb ok features on darwin. Fix up the member-pointers test to check the correct things for cross platform (DW_FORM_flag is a good prefix). Unfortunately no testcase because I have no ideas how to test something without a triple and without a triple in the module yet check precisely on two platforms. Ideas welcome. llvm-svn: 180660	2013-04-27 01:07:52 +00:00
Adrian Prantl	d4c0dd4776	Cleanup and document MachineLocation. Clarify documentation and API to make the difference between register and register-indirect addressed locations more explicit. Put in a comment to point out that with the current implementation we cannot specify a register-indirect location with offset 0 (a breg 0 in DWARF). No functionality change intended. rdar://problem/13658587 llvm-svn: 180641	2013-04-26 21:57:17 +00:00
Bill Wendling	55a9c97c9c	Micro-optimization TLVs probably won't be as common as the other types of variables. Check for them last before defaulting to "DATA". llvm-svn: 180631	2013-04-26 21:15:08 +00:00
Silviu Baranga	af7e8c367f	Re-write the address propagation code for pre-indexed loads/stores to take into account some previously misssed cases (PRE_DEC addressing mode, the offset and base address are swapped, etc). This should fix PR15581. llvm-svn: 180609	2013-04-26 15:52:24 +00:00
Benjamin Kramer	d56ffc709d	DAGCombiner: Canonicalize vector integer abs in the same way we do it for scalars. This already helps SSE2 x86 a lot because it lacks an efficient way to represent a vector select. The long term goal is to enable the backend to match a canonicalized pattern into a single instruction (e.g. vabs or pabs). llvm-svn: 180597	2013-04-26 09:19:19 +00:00
Reid Kleckner	d973ca3c51	[mc-coff] Forward Linker Option flags into the .drectve section Summary: This is modelled on the Mach-O linker options implementation and should support a Clang implementation of #pragma comment(lib/linker). Reviewers: rafael CC: llvm-commits Differential Revision: http://llvm-reviews.chandlerc.com/D724 llvm-svn: 180569	2013-04-25 19:34:41 +00:00
Silviu Baranga	4ad2bc5963	Fix constant folding for one lane vector types. Constant folding one lane vector types not returns a vector instead of a scalar. llvm-svn: 180254	2013-04-25 09:32:33 +00:00
Andrew Trick	2e87517144	Fix for r180193 - MI Sched: eliminate local vreg. Fixes PR15838. Need to check for blocks with nothing but dbg.value. I'm not sure how to force this situation with a unit test. I tried to reduce the test case in PR15838 (1k lines of metadata) but gave up. llvm-svn: 180227	2013-04-24 23:19:56 +00:00
Chad Rosier	108d5a61b7	[inline asm] Fix a crasher for an invalid value type/register class. rdar://13731657 llvm-svn: 180226	2013-04-24 22:53:10 +00:00
Andrew Trick	85a1d4cbc0	MI Sched: eliminate local vreg copies. For now, we just reschedule instructions that use the copied vregs and let regalloc elliminate it. I would really like to eliminate the copies on-the-fly during scheduling, but we need a complete implementation of repairIntervalsInRange() first. The general strategy is for the register coalescer to eliminate as many global copies as possible and shrink live ranges to be extended-basic-block local. The coalescer should not have to worry about resolving local copies (e.g. it shouldn't attemp to reorder instructions). The scheduler is a much better place to deal with local interference. The coalescer side of this equation needs work. llvm-svn: 180193	2013-04-24 15:54:43 +00:00
Andrew Trick	608a698cdf	Register Coalescing: add a flag to disable rescheduling. When MachineScheduler is enabled, this functionality can be removed. Until then, provide a way to disable it for test cases and designing MachineScheduler heuristics. llvm-svn: 180192	2013-04-24 15:54:39 +00:00
Andrew Trick	7c791a3dc4	MI Sched: regpressure tracing. llvm-svn: 180191	2013-04-24 15:54:36 +00:00
Eric Christopher	4eb5eb5bc8	Formatting. llvm-svn: 180186	2013-04-24 12:56:18 +00:00
Owen Anderson	2d4cca35c3	DAGCombine should not aggressively fold SEXT(VSETCC(...)) into a wider VSETCC without first checking the target's vector boolean contents. This exposed an issue with PowerPC AltiVec where it appears it was setting the wrong vector boolean contents. The included change fixes the PowerPC tests, and was OK'd by Hal. llvm-svn: 180129	2013-04-23 18:09:28 +00:00
Stephen Lin	6c70dc7842	Add some constraints to use of 'returned': 1) Disallow 'returned' on parameter that is also 'sret' (no sensible semantics, as far as I can tell). 2) Conservatively disallow tail calls through 'returned' parameters that also are 'zext' or 'sext' (for consistency with treatment of other zero-extending and sign-extending operations in tail call position detection...can be revised later to handle situations that can be determined to be safe). This is a new attribute that is not yet used, so there is no impact. llvm-svn: 180118	2013-04-23 16:31:56 +00:00
Matt Arsenault	034ca0fe41	Remove unused DwarfSectionOffsetDirective string The value isn't actually used, and setting it emits a COFF specific directive. llvm-svn: 180064	2013-04-22 22:49:11 +00:00
Eric Christopher	04d4e9312c	Move C++ code out of the C headers and into either C++ headers or the C++ files themselves. This enables people to use just a C compiler to interoperate with LLVM. llvm-svn: 180063	2013-04-22 22:47:22 +00:00
Eli Bendersky	58b04b7e2e	Optimize MachineBasicBlock::getSymbol by caching the symbol. Since the symbol name computation is expensive, this helps save about 25% of the time spent in this function. llvm-svn: 180049	2013-04-22 21:21:08 +00:00
Rafael Espindola	74f2e46eef	Clarify that llvm.used can contain aliases. Also add a check for llvm.used in the verifier and simplify clients now that they can assume they have a ConstantArray. llvm-svn: 180019	2013-04-22 14:58:02 +00:00
Eric Christopher	44c6aa670f	Tidy. llvm-svn: 180000	2013-04-22 07:51:08 +00:00
Eric Christopher	25e3509c78	Update comment. Whitespace. llvm-svn: 179999	2013-04-22 07:47:40 +00:00
David Blaikie	f55abeaf4c	Revert "Revert "PR14606: debug info imported_module support"" This reverts commit r179840 with a fix to test/DebugInfo/two-cus-from-same-file.ll I'm not sure why that test only failed on ARM & MIPS and not X86 Linux, even though the debug info was clearly invalid on all of them, but this ought to fix it. llvm-svn: 179996	2013-04-22 06:12:31 +00:00
Jim Grosbach	563983c8a3	Legalize vector truncates by parts rather than just splitting. Rather than just splitting the input type and hoping for the best, apply a bit more cleverness. Just splitting the types until the source is legal often leads to an illegal result time, which is then widened and a scalarization step is introduced which leads to truly horrible code generation. With the loop vectorizer, these sorts of operations are much more common, and so it's worth extra effort to do them well. Add a legalization hook for the operands of a TRUNCATE node, which will be encountered after the result type has been legalized, but if the operand type is still illegal. If simple splitting of both types ends up with the result type of each half still being legal, just do that (v16i16 -> v16i8 on ARM, for example). If, however, that would result in an illegal result type (v8i32 -> v8i8 on ARM, for example), we can get more clever with power-two vectors. Specifically, split the input type, but also widen the result element size, then concatenate the halves and truncate again. For example on ARM, To perform a "%res = v8i8 trunc v8i32 %in" we transform to: %inlo = v4i32 extract_subvector %in, 0 %inhi = v4i32 extract_subvector %in, 4 %lo16 = v4i16 trunc v4i32 %inlo %hi16 = v4i16 trunc v4i32 %inhi %in16 = v8i16 concat_vectors v4i16 %lo16, v4i16 %hi16 %res = v8i8 trunc v8i16 %in16 This allows instruction selection to generate three VMOVN instructions instead of a sequences of moves, stores and loads. Update the ARMTargetTransformInfo to take this improved legalization into account. Consider the simplified IR: define <16 x i8> @test1(<16 x i32>* %ap) { %a = load <16 x i32>* %ap %tmp = trunc <16 x i32> %a to <16 x i8> ret <16 x i8> %tmp } define <8 x i8> @test2(<8 x i32>* %ap) { %a = load <8 x i32>* %ap %tmp = trunc <8 x i32> %a to <8 x i8> ret <8 x i8> %tmp } Previously, we would generate the truly hideous: .syntax unified .section __TEXT,__text,regular,pure_instructions .globl _test1 .align 2 _test1: @ @test1 @ BB#0: push {r7} mov r7, sp sub sp, sp, #20 bic sp, sp, #7 add r1, r0, #48 add r2, r0, #32 vld1.64 {d24, d25}, [r0:128] vld1.64 {d16, d17}, [r1:128] vld1.64 {d18, d19}, [r2:128] add r1, r0, #16 vmovn.i32 d22, q8 vld1.64 {d16, d17}, [r1:128] vmovn.i32 d20, q9 vmovn.i32 d18, q12 vmov.u16 r0, d22[3] strb r0, [sp, #15] vmov.u16 r0, d22[2] strb r0, [sp, #14] vmov.u16 r0, d22[1] strb r0, [sp, #13] vmov.u16 r0, d22[0] vmovn.i32 d16, q8 strb r0, [sp, #12] vmov.u16 r0, d20[3] strb r0, [sp, #11] vmov.u16 r0, d20[2] strb r0, [sp, #10] vmov.u16 r0, d20[1] strb r0, [sp, #9] vmov.u16 r0, d20[0] strb r0, [sp, #8] vmov.u16 r0, d18[3] strb r0, [sp, #3] vmov.u16 r0, d18[2] strb r0, [sp, #2] vmov.u16 r0, d18[1] strb r0, [sp, #1] vmov.u16 r0, d18[0] strb r0, [sp] vmov.u16 r0, d16[3] strb r0, [sp, #7] vmov.u16 r0, d16[2] strb r0, [sp, #6] vmov.u16 r0, d16[1] strb r0, [sp, #5] vmov.u16 r0, d16[0] strb r0, [sp, #4] vldmia sp, {d16, d17} vmov r0, r1, d16 vmov r2, r3, d17 mov sp, r7 pop {r7} bx lr .globl _test2 .align 2 _test2: @ @test2 @ BB#0: push {r7} mov r7, sp sub sp, sp, #12 bic sp, sp, #7 vld1.64 {d16, d17}, [r0:128] add r0, r0, #16 vld1.64 {d20, d21}, [r0:128] vmovn.i32 d18, q8 vmov.u16 r0, d18[3] vmovn.i32 d16, q10 strb r0, [sp, #3] vmov.u16 r0, d18[2] strb r0, [sp, #2] vmov.u16 r0, d18[1] strb r0, [sp, #1] vmov.u16 r0, d18[0] strb r0, [sp] vmov.u16 r0, d16[3] strb r0, [sp, #7] vmov.u16 r0, d16[2] strb r0, [sp, #6] vmov.u16 r0, d16[1] strb r0, [sp, #5] vmov.u16 r0, d16[0] strb r0, [sp, #4] ldm sp, {r0, r1} mov sp, r7 pop {r7} bx lr Now, however, we generate the much more straightforward: .syntax unified .section __TEXT,__text,regular,pure_instructions .globl _test1 .align 2 _test1: @ @test1 @ BB#0: add r1, r0, #48 add r2, r0, #32 vld1.64 {d20, d21}, [r0:128] vld1.64 {d16, d17}, [r1:128] add r1, r0, #16 vld1.64 {d18, d19}, [r2:128] vld1.64 {d22, d23}, [r1:128] vmovn.i32 d17, q8 vmovn.i32 d16, q9 vmovn.i32 d18, q10 vmovn.i32 d19, q11 vmovn.i16 d17, q8 vmovn.i16 d16, q9 vmov r0, r1, d16 vmov r2, r3, d17 bx lr .globl _test2 .align 2 _test2: @ @test2 @ BB#0: vld1.64 {d16, d17}, [r0:128] add r0, r0, #16 vld1.64 {d18, d19}, [r0:128] vmovn.i32 d16, q8 vmovn.i32 d17, q9 vmovn.i16 d16, q8 vmov r0, r1, d16 bx lr llvm-svn: 179989	2013-04-21 23:47:41 +00:00
Jim Grosbach	d4db72db61	Tidy up comment grammar. llvm-svn: 179986	2013-04-21 21:23:01 +00:00
Tim Northover	16aba17024	Remove unused ShouldFoldAtomicFences flag. I think it's almost impossible to fold atomic fences profitably under LLVM/C++11 semantics. As a result, this is now unused and just cluttering up the target interface. llvm-svn: 179940	2013-04-20 12:32:43 +00:00
Tim Northover	a2b533906a	Remove unused MEMBARRIER DAG node; it's been replaced by ATOMIC_FENCE. llvm-svn: 179939	2013-04-20 12:32:17 +00:00
Stephen Lin	b8bd232a3d	Add CodeGen support for functions that always return arguments via a new parameter attribute 'returned', which is taken advantage of in target-independent tail call opportunity detection and in ARM call lowering (when placed on an integral first parameter). llvm-svn: 179925	2013-04-20 05:14:40 +00:00
Stephen Lin	ffc445492c	Allow tail call opportunity detection through nested and/or multiple iterations of extractelement/insertelement indirection llvm-svn: 179924	2013-04-20 04:27:51 +00:00
Eli Bendersky	e80691dc0a	Simplify the code in FastISel::tryToFoldLoad, add an assertion and fix a comment. llvm-svn: 179908	2013-04-19 23:26:18 +00:00
Eli Bendersky	90dd3e7dfd	Move TryToFoldFastISelLoad to FastISel, where it belongs. In general, I'm trying to move as much FastISel logic as possible out of the main path in SelectionDAGISel - intermixing them just adds confusion. llvm-svn: 179902	2013-04-19 22:29:18 +00:00
Michael Liao	b53d8963ce	ArrayRefize getMachineNode(). No functionality change. llvm-svn: 179901	2013-04-19 22:22:57 +00:00
Jakob Stoklund Olesen	e17c3fde6b	Add an MRI::verifyUseLists() function. This checks the sanity of the register use lists in the MI intermediate representation. llvm-svn: 179895	2013-04-19 21:40:57 +00:00
Eli Bendersky	dbeefaa86a	Use dbgs() consistently for -debug printouts llvm-svn: 179894	2013-04-19 21:37:07 +00:00
Eric Christopher	0e89ade8ff	Revert "PR14606: debug info imported_module support" This reverts commit r179836 as it seems to have caused test failures. llvm-svn: 179840	2013-04-19 07:47:16 +00:00
David Blaikie	88564f3cf7	PR14606: debug info imported_module support Adding another CU-wide list, in this case of imported_modules (since they should be relatively rare, it seemed better to add a list where each element had a "context" value, rather than add a (usually empty) list to every scope). This takes care of DW_TAG_imported_module, but to fully address PR14606 we'll need to expand this to cover DW_TAG_imported_declaration too. llvm-svn: 179836	2013-04-19 06:57:04 +00:00
Eli Bendersky	6084f45f38	Add some more stats for fast isel vs. SelectionDAG, w.r.t lowering function arguments in entry BBs. llvm-svn: 179824	2013-04-19 01:04:40 +00:00
Peter Collingbourne	2f495b93ee	Add support for subsections to the ELF assembler. Fixes PR8717. Differential Revision: http://llvm-reviews.chandlerc.com/D598 llvm-svn: 179725	2013-04-17 21:18:16 +00:00
Andy Gibbs	b23ea72e48	Replace uses of the deprecated std::auto_ptr with OwningPtr. This is a rework of the broken parts in r179373 which were subsequently reverted in r179374 due to incompatibility with C++98 compilers. This version should be ok under C++98. llvm-svn: 179520	2013-04-15 12:06:32 +00:00
Nadav Rotem	0db0690a70	Document the decision to assume that the cost of floats is twice as much as integers. llvm-svn: 179478	2013-04-14 05:55:18 +00:00
Andrew Trick	1f0bb69b6c	MI-Sched: DEBUG formatting. llvm-svn: 179452	2013-04-13 06:07:49 +00:00
Andrew Trick	be2bccbce9	MI-Sched cleanup. If an instruction has no valid sched class, do not attempt to check for a variant. llvm-svn: 179451	2013-04-13 06:07:45 +00:00
Andrew Trick	e833e1cd6e	MI-Sched: schedule physreg copies. The register allocator expects minimal physreg live ranges. Schedule physreg copies accordingly. This is slightly tricky when they occur in the middle of the scheduling region. For now, this is handled by rescheduling the copy when its associated instruction is scheduled. Eventually we may instead bundle them, but only if we can preserve the bundles as parallel copies during regalloc. llvm-svn: 179449	2013-04-13 06:07:40 +00:00
Nadav Rotem	87a0af6e1b	CostModel: increase the default cost of supported floating point operations from 1 to two. Fixed a few tests that changes because now the cost of one insert + a vector operation on two doubles is lower than two scalar operations on doubles. llvm-svn: 179413	2013-04-12 21:15:03 +00:00
Benjamin Kramer	dae0851237	Revert broken pieces of r179373. You can't copy an OwningPtr, and move semantics aren't available in C++98. llvm-svn: 179374	2013-04-12 12:13:51 +00:00
Andy Gibbs	95777550a9	Replace uses of the deprecated std::auto_ptr with OwningPtr. llvm-svn: 179373	2013-04-12 10:56:28 +00:00
Nadav Rotem	c0adc9fd91	Don't disable block layout when forcing block alignment. llvm-svn: 179355	2013-04-12 01:24:16 +00:00
Nadav Rotem	c3b0f50ac2	Add a flag to align all basic blocks in the function. When debugging performance regressions we often ask ourselves if the regression that we see is due to poor isel/sched/ra or due to some micro-architetural problem. When comparing two code sequences one good way to rule out front-end bottlenecks (and other the issues) is to force code alignment. This pass adds a flag that forces the alignment of all of the basic blocks in the program. llvm-svn: 179353	2013-04-12 00:48:32 +00:00
Benjamin Kramer	e7c45bc670	Add braces around \|\| in && to pacify GCC. llvm-svn: 179275	2013-04-11 11:57:01 +00:00
Hal Finkel	95081bff72	Manually remove successors in if conversion when CopyAndPredicateBlock is used In the simple and triangle if-conversion cases, when CopyAndPredicateBlock is used because the to-be-predicated block has other predecessors, we need to explicitly remove the old copied block from the successors list. Normally if conversion relies on TII->AnalyzeBranch combined with BB->CorrectExtraCFGEdges to cleanup the successors list, but if the predicated block contained an un-analyzable branch (such as a now-predicated return), then this will fail. These extra successors were causing a problem on PPC because it was causing later passes (such as PPCEarlyReturm) to leave dead return-only basic blocks in the code. llvm-svn: 179227	2013-04-10 22:05:25 +00:00
Andrew Trick	e220323c7f	Generalize the PassConfig API and remove addFinalizeRegAlloc(). The target hooks are getting out of hand. What does it mean to run before or after regalloc anyway? Allowing either Pass* or AnalysisID pass identification should make it much easier for targets to use the substitutePass and insertPass APIs, and create less need for badly named target hooks. llvm-svn: 179140	2013-04-10 01:06:56 +00:00
Eric Christopher	52ce7189c1	The .dwo section shouldn't contain the unrelocated values (and therefore not at all) of the pc or statement list. We also don't need to emit the compilation dir so save so space and time and don't bother. Fix up the testcase accordingly and verify that we don't emit the attributes or the items that they use. llvm-svn: 179114	2013-04-09 19:23:15 +00:00
Benjamin Kramer	bbae991db6	DAGCombiner: Fold a shuffle on CONCAT_VECTORS into a new CONCAT_VECTORS if possible. This pattern occurs in SROA output due to the way vector arguments are lowered on ARM. The testcase from PR15525 now compiles into this, which is better than the code we got with the old scalarrepl: _Store: ldr.w r9, [sp] vmov d17, r3, r9 vmov d16, r1, r2 vst1.8 {d16, d17}, [r0] bx lr Differential Revision: http://llvm-reviews.chandlerc.com/D647 llvm-svn: 179106	2013-04-09 17:41:43 +00:00
Eric Christopher	55863befd1	DW_FORM_sec_offset should be a relocation on platforms that use a relocation across sections. Do this for DW_AT_stmt list in the skeleton CU and check the relocations in the debug_info section. Add a FIXME for multiple CUs. llvm-svn: 178969	2013-04-07 03:43:09 +00:00
Nadav Rotem	c4bd84c1d5	typo llvm-svn: 178949	2013-04-06 04:24:12 +00:00
Manman Ren	5b22f9fe18	Dwarf: use utostr on CUID to append to SmallString. We used to do "SmallString += CUID", which is incorrect, since CUID will be truncated to a char. rdar://problem/13573833 llvm-svn: 178941	2013-04-06 01:02:38 +00:00
Hal Finkel	3005c299b5	Reapply r178845 with fix - Fix bug in PEI's virtual-register scavenging This fixes PEI as previously described, but correctly handles the case where the instruction defining the virtual register to be scavenged is the first in the block. Arnold provided me with a bugpoint-reduced test case, but even that seems too large to use as a regression test. If I'm successful in cleaning it up then I'll commit that as well. Original commit message: This change fixes a bug that I introduced in r178058. After a register is scavenged using one of the available spills slots the instruction defining the virtual register needs to be moved to after the spill code. The scavenger has already processed the defining instruction so that registers killed by that instruction are available for definition in that same instruction. Unfortunately, after this, the scavenger needs to iterate through the spill code and then visit, again, the instruction that defines the now-scavenged register. In order to avoid confusion, the register scavenger needs the ability to 'back up' through the spill code so that it can again process the instructions in the appropriate order. Prior to this fix, once the scavenger reached the just-moved instruction, it would assert if it killed any registers because, having already processed the instruction, it believed they were undefined. Unfortunately, I don't yet have a small test case. Thanks to Pranav Bhandarkar for diagnosing the problem and testing this fix. llvm-svn: 178919	2013-04-05 22:31:56 +00:00
Bill Wendling	eb108bad50	Use the target options specified on a function to reset the back-end. During LTO, the target options on functions within the same Module may change. This would necessitate resetting some of the back-end. Do this for X86, because it's a Friday afternoon. llvm-svn: 178917	2013-04-05 21:52:40 +00:00
Hal Finkel	81c46d0809	Revert r178845 - Fix bug in PEI's virtual-register scavenging Reverting because this breaks one of the LTO builders. Original commit message: This change fixes a bug that I introduced in r178058. After a register is scavenged using one of the available spills slots the instruction defining the virtual register needs to be moved to after the spill code. The scavenger has already processed the defining instruction so that registers killed by that instruction are available for definition in that same instruction. Unfortunately, after this, the scavenger needs to iterate through the spill code and then visit, again, the instruction that defines the now-scavenged register. In order to avoid confusion, the register scavenger needs the ability to 'back up' through the spill code so that it can again process the instructions in the appropriate order. Prior to this fix, once the scavenger reached the just-moved instruction, it would assert if it killed any registers because, having already processed the instruction, it believed they were undefined. Unfortunately, I don't yet have a small test case. Thanks to Pranav Bhandarkar for diagnosing the problem and testing this fix. llvm-svn: 178916	2013-04-05 21:30:40 +00:00
Hal Finkel	e6f48e4e2f	Fix bug in PEI's virtual-register scavenging This change fixes a bug that I introduced in r178058. After a register is scavenged using one of the available spills slots the instruction defining the virtual register needs to be moved to after the spill code. The scavenger has already processed the defining instruction so that registers killed by that instruction are available for definition in that same instruction. Unfortunately, after this, the scavenger needs to iterate through the spill code and then visit, again, the instruction that defines the now-scavenged register. In order to avoid confusion, the register scavenger needs the ability to 'back up' through the spill code so that it can again process the instructions in the appropriate order. Prior to this fix, once the scavenger reached the just-moved instruction, it would assert if it killed any registers because, having already processed the instruction, it believed they were undefined. Unfortunately, I don't yet have a small test case. Thanks to Pranav Bhandarkar for diagnosing the problem and testing this fix. llvm-svn: 178845	2013-04-05 05:01:13 +00:00
Andrew Trick	80e66ce0b4	RegisterPressure heuristics currently require signed comparisons. llvm-svn: 178823	2013-04-05 00:31:34 +00:00
Andrew Trick	96ce3848d6	Disable DFSResult for ConvergingScheduler. For now, just save the compile time since the ConvergingScheduler heuristics don't use this analysis. We'll probably enable it later after compile-time investigation. llvm-svn: 178822	2013-04-05 00:31:31 +00:00
Andrew Trick	419d491747	MachineScheduler: format DEBUG output. I'm getting more serious about tuning and enabling on x86/ARM. Start by making the trace readable. llvm-svn: 178821	2013-04-05 00:31:29 +00:00
Arnold Schwaighofer	b977387112	CostModel: Add parameter to instruction cost to further classify operand values On certain architectures we can support efficient vectorized version of instructions if the operand value is uniform (splat) or a constant scalar. An example of this is a vector shift on x86. We can efficiently support for (i = 0 ; i < ; i += 4) w[0:3] = v[0:3] << <2, 2, 2, 2> but not for (i = 0; i < ; i += 4) w[0:3] = v[0:3] << x[0:3] This patch adds a parameter to getArithmeticInstrCost to further qualify operand values as uniform or uniform constant. Targets can then choose to return a different cost for instructions with such operand values. A follow-up commit will test this feature on x86. radar://13576547 llvm-svn: 178807	2013-04-04 23:26:21 +00:00
Manman Ren	bdcb4464e2	Debug Info: revert 178722 for now. There is a difference for FORM_ref_addr between DWARF 2 and DWARF 3+. Since Eric is against guarding DWARF 2 ref_addr with DarwinGDBCompat, we are still in discussion on how to handle this. The correct solution is to update our header to say version 4 instead of version 2 and update tool chains as well. rdar://problem/13559431 llvm-svn: 178806	2013-04-04 23:13:11 +00:00
Adrian Prantl	322f41d095	typo llvm-svn: 178804	2013-04-04 22:56:49 +00:00
Eli Bendersky	fc186358f2	Formatting llvm-svn: 178771	2013-04-04 18:03:41 +00:00
Manman Ren	5a15c9ed9f	Debug Info: according to DWARF 2, FORM_ref_addr the same size as an address on the target system. It was hard-coded to 4 bytes before. I can't get llvm to generate a ref_addr on a reasonably sized testing case. rdar://problem/13559431 llvm-svn: 178722	2013-04-04 00:22:54 +00:00
Bill Schmidt	92e26646bc	Fix PR15632: No support for ppcf128 floating-point remainder on PowerPC. For this we need to use a libcall. Previously LLVM didn't implement libcall support for frem, so I've added it in the usual straightforward manner. A test case from the bug report is included. llvm-svn: 178639	2013-04-03 13:05:44 +00:00
Eric Christopher	14c2067ca1	Fix grammar. llvm-svn: 178624	2013-04-03 05:29:58 +00:00
Eric Christopher	5590949f29	Remove ZeroOrMore from the option description. We don't need it here. llvm-svn: 178623	2013-04-03 05:26:07 +00:00
Jakob Stoklund Olesen	aeb69a5481	Allow MachineTraceMetrics to be used when the model has no resources. It it still possible to extract information from itineraries, for example. llvm-svn: 178582	2013-04-02 22:27:45 +00:00
Jakob Stoklund Olesen	8fbfc59164	Don't attempt MTM heuristics without a scheduling model present. This should fix the PPC buildbots. llvm-svn: 178558	2013-04-02 18:26:45 +00:00
Jakob Stoklund Olesen	3ca14772d0	Count processor resources individually in MachineTraceMetrics. The new instruction scheduling models provide information about the number of cycles consumed on each processor resource. This makes it possible to estimate ILP more accurately than simply counting instructions / issue width. The functions getResourceDepth() and getResourceLength() now identify the limiting processor resource, and return a cycle count based on that. This gives more precise resource information, particularly in traces that use one resource a lot more than others. llvm-svn: 178553	2013-04-02 17:49:51 +00:00
Arnold Schwaighofer	d6c6e868b2	DAGCombiner: Merge store/loads when we have extload/truncstores This is helps on architectures where i8,i16 are not legal but we have byte, and short loads/stores. Allowing us to merge copies like the one below on ARM. copy(char a, char b, int n) { do { int t0 = a[0]; int t1 = a[1]; b[0] = t0; b[1] = t1; radar://13536387 llvm-svn: 178546	2013-04-02 15:58:51 +00:00
Arnold Schwaighofer	6752366ed7	Merge load/store sequences with adresses: base + index + offset We would also like to merge sequences that involve a variable index like in the example below. int index = *idx++ int i0 = c[index+0]; int i1 = c[index+1]; b[0] = i0; b[1] = i1; By extending the parsing of the base pointer to handle dags that contain a base, index, and offset we can handle examples like the one above. The dag for the code above will look something like: (load (i64 add (i64 copyfromreg %c) (i64 signextend (i8 load %index)))) (load (i64 add (i64 copyfromreg %c) (i64 signextend (i32 add (i32 signextend (i8 load %index)) (i32 1))))) The code that parses the tree ignores the intermediate sign extensions. However, if there is a sign extension it needs to be on all indexes. (load (i64 add (i64 copyfromreg %c) (i64 signextend (add (i8 load %index) (i8 1)))) vs (load (i64 add (i64 copyfromreg %c) (i64 signextend (i32 add (i32 signextend (i8 load %index)) (i32 1))))) radar://13536387 llvm-svn: 178483	2013-04-01 18:12:58 +00:00
Benjamin Kramer	9335443236	DAGCombine: visitXOR can replace a node without returning it, bail out in that case. Fixes the crash reported in PR15608. llvm-svn: 178429	2013-03-30 21:28:18 +00:00
Eric Christopher	4887c8f4ff	Use SmallVectorImpl instead of SmallVector at the uses. llvm-svn: 178386	2013-03-29 23:34:06 +00:00
Eric Christopher	9c8414f84a	Use 12 as the magic number for our abbreviation data and our die values. A lot of DIEs have 10 attributes in C++ code (example clang), none had more than 12. Seems like a good default. llvm-svn: 178366	2013-03-29 20:23:06 +00:00
Eric Christopher	6be35037b5	Move the construction of the skeleton compile unit after the entire original compile unit has been constructed. llvm-svn: 178365	2013-03-29 20:23:02 +00:00
Benjamin Kramer	70671b9937	Remove the old CodePlacementOpt pass. It was superseded by MachineBlockPlacement and disabled by default since LLVM 3.1. llvm-svn: 178349	2013-03-29 17:14:24 +00:00
Nadav Rotem	6036f581aa	Fix a typo llvm-svn: 178346	2013-03-29 16:34:23 +00:00
Chad Rosier	dbac025d84	[fast-isel] Add a preemptive fix for the case where we fail to materialize an immediate in a register. I don't believe this should ever fail, but I see no harm in trying to make this code bullet proof. I've added an assert to ensure my assumtion is correct. If the assertion fires something is wrong and we should fix it, rather then just silently fall back to SelectionDAG isel. llvm-svn: 178305	2013-03-28 23:04:47 +00:00
Hal Finkel	35dd5c5932	Fix target-customized spilling in the register scavenger This is a follow-up to r178073 (which should actually make target-customized spilling work again). I still don't have a regression test for this (but it would be good to have one; Thumb 1 and Mips16 use this callback as well). Patch by Richard Sandiford. llvm-svn: 178137	2013-03-27 13:00:56 +00:00
Hal Finkel	1fa2f945ea	Fix the register scavenger for targets that provide custom spilling As pointed out by Richard Sandiford, my recent updates to the register scavenger broke targets that use custom spilling (because the new code assumed that if there were no valid spill slots, than spilling would be impossible). I don't have a test case, but it should be possible to create one for Thumb 1, Mips 16, etc. llvm-svn: 178073	2013-03-26 21:20:15 +00:00
Hal Finkel	4e05788cc3	Update PEI's virtual-register-based scavenging to support multiple simultaneous mappings The previous algorithm could not deal properly with scavenging multiple virtual registers because it kept only one live virtual -> physical mapping (and iterated through operands in order). Now we don't maintain a current mapping, but rather use replaceRegWith to completely remove the virtual register as soon as the mapping is established. In order to allow the register scavenger to return a physical register killed by an instruction for definition by that same instruction, we now call RS->forward(I) prior to eliminating virtual registers defined in I. This requires a minor update to forward to ignore virtual registers. These new features will be tested in forthcoming commits. llvm-svn: 178058	2013-03-26 18:56:54 +00:00
Michael Liao	bb05a1d7b5	Enhance folding of (extract_subvec (insert_subvec V1, V2, IIdx), EIdx) - Handle the case where the result of 'insert_subvect' is bitcasted before 'extract_subvec'. This removes the redundant insertf128/extractf128 pair on unaligned 256-bit vector load/store on vectors of non 64-bit integer. llvm-svn: 177945	2013-03-25 23:47:35 +00:00
Shuxin Yang	93b1f12ac1	Disable some unsafe-fp-math DAG-combine transformation after legalization. For instance, following transformation will be disabled: x + x + x => 3.0f * x; The problem of these transformations is that it introduces a FP constant, which following Instruction-Selection pass cannot handle. Reviewed by Nadav, thanks a lot! rdar://13445387 llvm-svn: 177933	2013-03-25 22:52:29 +00:00
Eric Christopher	3820184a8d	Couple more sets of tidying. llvm-svn: 177920	2013-03-25 21:26:36 +00:00
Eric Christopher	7f44037c2e	Formatting. llvm-svn: 177898	2013-03-25 20:05:35 +00:00
Duncan Sands	d58611a4cd	Teach cmake about the new Erlang GC files. llvm-svn: 177869	2013-03-25 14:12:21 +00:00
Yiannis Tsiouris	dbb4adf134	Add a GC plugin for Erlang llvm-svn: 177867	2013-03-25 13:47:46 +00:00
Owen Anderson	c81616b0a9	Remove the type legality check from the SelectionDAGBuilder when it lowers @llvm.fmuladd to ISD::FMA nodes. Performing this check unilaterally prevented us from generating FMAs when the incoming IR contained illegal vector types which would eventually be legalized to underlying types that did support FMA. For example, an @llvm.fmuladd on an OpenCL float16 should become a sequence of float4 FMAs, not float4 fmul+fadd's. NOTE: Because we still call the target-specific profitability hook, individual targets can reinstate the old behavior, if desired, by simply performing the legality check inside their callback hook. They can also perform more sophisticated legality checks, if, for example, some illegal vector types can be productively implemented as FMAs, but not others. llvm-svn: 177820	2013-03-23 08:26:53 +00:00
Hal Finkel	446122ed57	Fix comparison of mixed signedness 177774 broke the lld-x86_64-darwin11 builder; error: error: comparison of integers of different signs: 'int' and 'size_type' (aka 'unsigned long') for (SI = 0; SI < Scavenged.size(); ++SI) ~~ ^ ~~~~~~~~~~~~~~~~ Fix this by making SI also unsigned. llvm-svn: 177780	2013-03-23 00:07:29 +00:00
Hal Finkel	9e331c2f9c	Allow the register scavenger to spill multiple registers This patch lets the register scavenger make use of multiple spill slots in order to guarantee that it will be able to provide multiple registers simultaneously. To support this, the RS's API has changed slightly: setScavengingFrameIndex / getScavengingFrameIndex have been replaced by addScavengingFrameIndex / isScavengingFrameIndex / getScavengingFrameIndices. In forthcoming commits, the PowerPC backend will use this capability in order to implement the spilling of condition registers, and some special-purpose registers, without relying on r0 being reserved. In some cases, spilling these registers requires two GPRs: one for addressing and one to hold the value being transferred. llvm-svn: 177774	2013-03-22 23:32:27 +00:00
Hal Finkel	7dbe0f0636	Remove ScavengedRC from RegisterScavenging ScavengedRC was a dead private variable (set, but not otherwise used). No functionality change intended. llvm-svn: 177708	2013-03-22 07:27:44 +00:00
Matt Arsenault	4ab769f4b3	Fix missing std::. Not sure how this compiles for anyone else. llvm-svn: 177620	2013-03-21 00:57:21 +00:00
Justin Holewinski	7478f3d776	Make variable name more explicit and eliminate redundant lookup in SDNodeOrdering llvm-svn: 177600	2013-03-20 23:10:59 +00:00
Nadav Rotem	4536d582fd	When computing the demanded bits of Load SDNodes, make sure that we are looking at the loaded-value operand and not the ptr result (in case of pre-inc loads). rdar://13348420 llvm-svn: 177596	2013-03-20 22:53:44 +00:00
Christian Konig	ed34d0ef1a	Revert "pre-RA-sched: fix TargetOpcode usage" This reverts commit 06091513c283c863296f01cc7c2e86b56bb50d02. The code is obviously wrong, but the trivial fix causes inefficient code generation on X86. Somebody with more knowledge of the code needs to take a look here. Signed-off-by: Christian König <christian.koenig@amd.com> llvm-svn: 177529	2013-03-20 15:43:00 +00:00
Justin Holewinski	c2d2c8939c	Move SDNode order propagation to SDNodeOrdering, which also fixes a missed case of order propagation during isel. Thanks Owen for the suggestion! llvm-svn: 177525	2013-03-20 14:51:01 +00:00
Christian Konig	9ce2d5b862	pre-RA-sched: fix TargetOpcode usage TargetOpcodes need to be treaded as Machine- and not ISD-Opcodes. Signed-off-by: Christian König <christian.koenig@amd.com> llvm-svn: 177518	2013-03-20 13:49:22 +00:00
Justin Holewinski	d068943809	Propagate DAG node ordering during type legalization and instruction selection A node's ordering is only propagated during legalization if (a) the new node does not have an ordering (is not a CSE'd node), or (b) the new node has an ordering that is higher than the node being legalized. llvm-svn: 177465	2013-03-20 00:10:32 +00:00
Jakub Staszak	b6970267ad	Move #include of BitVector from .h to .cpp file. Also remove unneeded #include and forward declaration. llvm-svn: 177357	2013-03-18 23:45:45 +00:00
Jakub Staszak	26ac8a7b16	Add some constantness. llvm-svn: 177356	2013-03-18 23:40:46 +00:00
Hans Wennborg	7504cefa14	Fix integer comparison in DIEInteger::BestForm. The always-true "(int)Int == (signed)Int" comparison was found while experimenting with a potential new Clang warning. llvm-svn: 177290	2013-03-18 17:03:05 +00:00
Anton Korobeynikov	2810a0ab80	Windows TLS: Section name prefix to ensure correct order The linker sorts the .tls$<xyz> sections by name, and we need to make sure any extra sections we produce (e.g. for weak globals) always end up between .tls$AAA and .tls$ZZZ, even if the name starts with e.g. an underscore. Patch by David Nadlinger! llvm-svn: 177256	2013-03-18 08:10:10 +00:00
Andrew Trick	6057017c68	Change the default latency for implicit defs. Implicit defs are not currently positional and not modeled by the per-operand machine model. Unfortunately, we treat defs that are part of the architectural instruction description, like flags, the same as other implicit defs. Really, they should have a fixed MachineInstr layout and probably shouldn't be "implicit" at all. For now, we'll change the default latency to be the max operand latency. That will give flag setting operands full latency for x86 folded loads. Other kinds of "fake" implicit defs don't occur prior to regalloc anyway, and we would like them to go away postRegAlloc as well. llvm-svn: 177227	2013-03-16 18:58:57 +00:00
Hal Finkel	628ba12823	Move estimateStackSize from ARM into MachineFrameInfo This is a generic function (derived from PEI); moving it into MachineFrameInfo eliminates a current redundancy between the ARM and AArch64 backends, and will allow it to be used by the PowerPC target code. No functionality change intended. llvm-svn: 177111	2013-03-14 21:15:20 +00:00
Hal Finkel	5a765fddb0	Provide the register scavenger to processFunctionBeforeFrameFinalized Add the current PEI register scavenger as a parameter to the processFunctionBeforeFrameFinalized callback. This change is necessary in order to allow the PowerPC target code to set the register scavenger frame index after the save-area offset adjustments performed by processFunctionBeforeFrameFinalized. Only after these adjustments have been made is it possible to estimate the size of the stack frame. llvm-svn: 177108	2013-03-14 20:33:40 +00:00
Bill Wendling	965bd58902	Reset some of the target options which affect code generation. This doesn't reset all of the target options within the TargetOptions object. This is because some of those are ABI-specific and must be determined if it's okay to change those on the fly. llvm-svn: 176986	2013-03-13 22:26:59 +00:00
Manman Ren	11fec384de	No functionality change. Use unreachable in getCUOffset. llvm-svn: 176963	2013-03-13 18:41:27 +00:00
Manman Ren	723956fe1a	No functionality change. Use unreachable in getCompileUnit. llvm-svn: 176962	2013-03-13 18:33:41 +00:00
Jakob Stoklund Olesen	c1b566db1b	Check register classes also when changing them. We have the same assertion in createVirtualRegister. llvm-svn: 176959	2013-03-13 18:11:17 +00:00
Manman Ren	14a029d9f3	Debug Info: use DW_FORM_ref_addr instead of DW_FORM_ref4 if the referenced DIE belongs to a different compile unit. DW_FORM_ref_addr should be used for cross compile-unit reference. When compiling a large application, we got a dwarfdump verification error where abstract_origin points to nowhere. This error can't be reproduced on any testing case in MultiSource. We may have other cases where we use DW_FORM_ref4 unconditionally. rdar://problem/13370501 llvm-svn: 176882	2013-03-12 18:27:15 +00:00
Richard Relph	61046a9727	Avoid generating ISD::SELECT for vector operands to SIGN_EXTEND llvm-svn: 176881	2013-03-12 18:17:18 +00:00
David Blaikie	dc69ebb1fc	Remove support for versioned debug info. Versioned debug info support has been a burden to maintain & also compromised current debug info verification by causing test cases testing old debug info to remain rather than being updated to the latest. It also makes it hard to add or change the metadata schema by requiring various backwards-compatibility in the DI* hierarchy. So it's being removed in preparation for new changes to the schema to tidy up old/unnecessary fields and add new fields needed for new debug info (well, new to LLVM at least). The more surprising part of this is the changes to DI*::Verify - this became necessary due to the changes to AsmWriter. AsmWriter was relying on the version test to decide which bits of metadata were actually debug info when printing the comment annotations. Without the version information the tag numbers were too common & it would print debug info on random metadata that happened to start with an integer that matched a tag number. Instead this change makes the Verify functions more precise (just adding "number of operands" checks - not type checking those operands yet) & relies on that to decide which metadata is debug info metadata. llvm-svn: 176838	2013-03-11 23:39:23 +00:00
Nick Lewycky	48beb21185	Fix a crasher newly introduced in r176659/r176649, where fast-isel tries to lower an expect intrinsic that is a constant expression. llvm-svn: 176830	2013-03-11 21:44:37 +00:00
Jakub Staszak	df17ddd56b	Cleanup #includes. llvm-svn: 176787	2013-03-10 13:11:23 +00:00
Jakob Stoklund Olesen	1001673865	Remove wrong and unnecessary assertion. PHIs are allowed to have multiple operand pairs per predecessor, and this code works just fine when it happens. llvm-svn: 176734	2013-03-08 23:00:13 +00:00
Jan Wen Voung	7857a64909	Disable statistics on Release builds and move tests that depend on -stats. Summary: Statistics are still available in Release+Asserts (any +Asserts builds), and stats can also be turned on with LLVM_ENABLE_STATS. Move some of the FastISel stats that were moved under DEBUG() back out of DEBUG(), since stats are disabled across the board now. Many tests depend on grepping "-stats" output. Move those into a orig_dir/Stats/. so that they can be marked as unsupported when building without statistics. Differential Revision: http://llvm-reviews.chandlerc.com/D486 llvm-svn: 176733	2013-03-08 22:56:31 +00:00
Jakob Stoklund Olesen	8d1aaf21cf	Rewrite the physreg part of findLastUseBefore(). To find the last use of a register unit, start from the bottom and scan upwards until a user is found. <rdar://problem/13353090> llvm-svn: 176706	2013-03-08 18:08:57 +00:00
Benjamin Kramer	0f12a5c2a4	Remove default from fully covered switch. llvm-svn: 176703	2013-03-08 17:03:19 +00:00
Tom Stellard	d93ef7afaa	LegalizeDAG: Respect the result of TLI.getBooleanContents() when expanding SETCC llvm-svn: 176695	2013-03-08 15:37:02 +00:00
Tom Stellard	b1588fc057	DAGCombiner: Use correct value type for checking legality of BR_CC v3 LegalizeDAG.cpp uses the value of the comparison operands when checking the legality of BR_CC, so DAGCombiner should do the same. v2: - Expand more BR_CC value types for NVPTX v3: - Expand correct BR_CC value types for Hexagon, Mips, and XCore. llvm-svn: 176694	2013-03-08 15:36:57 +00:00
Andrew Trick	48f2a72435	Add -verify-misched option. This verifies live intervals both before and after scheduling. It's useful for anyone hacking on live interval update. Note that we don't yet pass verification all the time. We don't yet handle updating nonallocatable live intervals perfectly. llvm-svn: 176685	2013-03-08 05:40:34 +00:00
Bill Wendling	2d915e2c15	Revert r176154 in favor of a better approach. Code generation makes some basic assumptions about the IR it's been given. In particular, if there is only one 'invoke' in the function, then that invoke won't be going away. However, with the advent of the `llvm.donothing' intrinsic, those invokes may go away. If all of them go away, the landing pad no longer has any users. This confuses the back-end, which asserts. This happens with SjLj exceptions, because that's the model that modifies the IR based on there being invokes, etc. in the function. Remove any invokes of `llvm.donothing' during SjLj EH preparation. This will give us a CFG that the back-end won't be confused about. If all of the invokes in a function are removed, then the SjLj EH prepare pass won't insert the bogus code the relies upon the invokes being there. <rdar://problem/13228754&13316637> llvm-svn: 176677	2013-03-08 02:21:08 +00:00
Jakob Stoklund Olesen	bed165422f	Add a getPassName() method. llvm-svn: 176669	2013-03-07 23:55:51 +00:00
Jakob Stoklund Olesen	299cedc7ca	Rename isEarlierInSameTrace to isUsefulDominator. In very rare cases caused by irreducible control flow, the dominating block can have the same trace head without actually being part of the trace. As long as such a dominator still has valid instruction depths, it is OK to use it for computing instruction depths. Rename the function to avoid lying, and add a check that instruction depths are computed for the dominator. llvm-svn: 176668	2013-03-07 23:55:49 +00:00
Chad Rosier	3a200e1faf	[fast-isel] Seriously, add support for the expect intrinsic. rdar://13370942 llvm-svn: 176659	2013-03-07 21:38:33 +00:00
Chad Rosier	9c1796f877	[fast-isel] Add support for the expect intrinsic. rdar://13370942 llvm-svn: 176649	2013-03-07 20:42:17 +00:00
Benjamin Kramer	fdf362bd69	ArrayRefize some code. No functionality change. llvm-svn: 176648	2013-03-07 20:33:29 +00:00
Andrew Trick	0f23b763a9	pre-RA-sched debug-only fix llvm-svn: 176638	2013-03-07 19:21:08 +00:00
Andrew Trick	b2ab8a732c	pre-RA-sched assertion fix. This bug was exposed by r176037. rdar:13370002 [pre-RA-sched] assertion: released too many times I tracked this down to an earlier hack that is no longer applicable and interfered with normal scheduler logic. With the changes in r176037, it was causing an instruction to be scheduled multiple times. I have an external test case that I tried hard to reduce and failed. I can't even reproduce with llc. llvm-svn: 176636	2013-03-07 19:07:57 +00:00
Nadav Rotem	40cda80af8	No need to go through int64 and APInt when generating a new constant. llvm-svn: 176615	2013-03-07 06:34:49 +00:00
Jim Grosbach	48a91abc10	SDAG: Handle scalarizing an extend of a <1 x iN> vector. Just scalarize the element and rebuild a vector of the result type from that. rdar://13281568 llvm-svn: 176614	2013-03-07 05:47:54 +00:00
Manman Ren	1e4272085d	Debug Info: store the files and directories for each compile unit. We now emit a line table for each compile unit. To reduce the prologue size of each line table, the files and directories used by each compile unit are stored in std::map<unsigned, std::vector< > > instead of std::vector< >. The prologue for a lto'ed image can be as big as 93K. Duplicating 93K for each compile unit causes a huge increase of debug info. With this patch, each prologue will only emit the files required by the compile unit. rdar://problem/13342023 llvm-svn: 176605	2013-03-07 01:42:00 +00:00
Bill Wendling	a69d0aaa71	Remove unused #includes. llvm-svn: 176467	2013-03-05 01:00:45 +00:00
Eli Bendersky	b1caf3c30e	Remove duplicate line and move another closer to its actual use llvm-svn: 176391	2013-03-01 23:32:40 +00:00
Akira Hatanaka	3d055580a9	Set properties for f128 type. llvm-svn: 176378	2013-03-01 21:11:44 +00:00
Chad Rosier	b3864609cf	Generate an error message instead of asserting or segfaulting when we can't handle indirect register inputs. rdar://13322011 llvm-svn: 176367	2013-03-01 19:12:05 +00:00
Michael Liao	6af16fc3b7	Fix PR10475 - ISD::SHL/SRL/SRA must have either both scalar or both vector operands but TLI.getShiftAmountTy() so far only return scalar type. As a result, backend logic assuming that breaks. - Rename the original TLI.getShiftAmountTy() to TLI.getScalarShiftAmountTy() and re-define TLI.getShiftAmountTy() to return target-specificed scalar type or the same vector type as the 1st operand. - Fix most TICG logic assuming TLI.getShiftAmountTy() a simple scalar type. llvm-svn: 176364	2013-03-01 18:40:30 +00:00
Andrew Trick	f9669beacd	Scheduler diagnostics. Print the register name. llvm-svn: 176316	2013-03-01 00:19:14 +00:00
Andrew Trick	2a8edef70b	Instructions schedulers should report correct height/depth. We avoided computing DAG height/depth during Node printing because it shouldn't depend on an otherwise valid DAG. But this has become far too annoying for the common case of a valid DAG where we want to see valid values. If doing the computation on-the-fly turns out to be a problem in practice, then I'll add a mode to the diagnostics to only force it when we're likely to have a valid DAG, otherwise explicitly print INVALID instead of bogus numbers. For now, just go for it all the time. llvm-svn: 176314	2013-03-01 00:19:09 +00:00
Eli Bendersky	33ebf836bc	A small refactoring + adding comments. SelectionDAGIsel::LowerArguments needs a function, not a basic block. So it makes sense to pass it the function instead of extracting a basic-block from the function and then tossing it. This is also more self-documenting (functions have arguments, BBs don't). In addition, added comments to a couple of Select* methods. llvm-svn: 176305	2013-02-28 23:09:18 +00:00
Benjamin Kramer	f7cfac7a14	Cost model support for lowered math builtins. We make the cost for calling libm functions extremely high as emitting the calls is expensive and causes spills (on x86) so performance suffers. We still vectorize important calls like ceilf and friends on SSE4.1. and fabs. Differential Revision: http://llvm-reviews.chandlerc.com/D466 llvm-svn: 176287	2013-02-28 19:09:33 +00:00
Eli Bendersky	d0c6e7b038	Put some per-instruction statistics of fast isel under NDEBUG, together with other per-instruction statistics. llvm-svn: 176273	2013-02-28 18:05:12 +00:00
Eric Christopher	5958b46053	Remove unnecessary check against isGlobalVariable. We check it a few lines above. llvm-svn: 176224	2013-02-27 23:49:50 +00:00
Eric Christopher	3f79b8c374	Rework comment slightly and fix a few typos. llvm-svn: 176223	2013-02-27 23:49:47 +00:00
Eric Christopher	10d35e9065	Remove unnecessary cast to void. llvm-svn: 176222	2013-02-27 23:49:45 +00:00
Manman Ren	584e4c0eda	Debug Info: for static member variables, always put AT_MIPS_linkage_name to the definition DIE (TAG_variable), and put AT_MIPS_linkage_name to TAG_member when DarwinGDBCompat is true. Darwin GDB needs AT_MIPS_linkage_name at both places to work. Follow-up patch to r176143. rdar://problem/13291234 llvm-svn: 176220	2013-02-27 23:21:02 +00:00
Nadav Rotem	c29095fb50	Silence the unused variable warning. llvm-svn: 176218	2013-02-27 22:52:54 +00:00
Nadav Rotem	00b75dd3c4	The FastISEL should be fast. But when we record statistics we use atomic operations to increment the counters. This patch disables the counters on non-debug builds. This reduces the runtime of SelectionDAGISel::SelectCodeCommon by ~5%. llvm-svn: 176214	2013-02-27 21:59:43 +00:00
Michael Ilseman	ba8446c80e	Reverted: r176136 - Have a way for a target to opt-out of target-independent fast isel llvm-svn: 176204	2013-02-27 19:54:00 +00:00
Manman Ren	683f59b36c	SelectionDAG: If llvm.donothing has a landingpad, we should clear CurrentCallSite to avoid an assertion failure: assert(MMI.getCurrentCallSite() == 0 && "Overlapping call sites!"); rdar://problem/13228754 llvm-svn: 176154	2013-02-27 02:11:57 +00:00
Manman Ren	5ae44d2b75	Debug Info: for static member variables, add AT_MIPS_linkage_name to the definition DIE, to make old GDB happy. We have a regression for old GDB when Clang uses DW_TAG_member to declare static members inside a class, instead of DW_TAG_variable. This patch will fix this regression. rdar://problem/13291234 llvm-svn: 176143	2013-02-27 00:02:32 +00:00
Michael Ilseman	846c6f0a32	Have a way for a target to opt-out of target-independent fast isel llvm-svn: 176136	2013-02-26 23:15:23 +00:00
Manman Ren	fe494749e4	Revert r176120 as it caused a failure at static-member.cpp llvm-svn: 176129	2013-02-26 22:35:53 +00:00
Manman Ren	5222195831	Debug Info: for static member variables, move AT_MIPS_linkage_name from TAG_member inside a class to the specification DIE. Having AT_MIPS_linkage_name on TAG_member caused old gdb (GNU 6.3.50) to error out. Also gcc 4.7 has AT_MIPS_linkage_name on the specification DIE. rdar://problem/13291234 llvm-svn: 176120	2013-02-26 20:48:29 +00:00
Chad Rosier	0587597fb8	Fix wording. llvm-svn: 176055	2013-02-25 22:20:00 +00:00
Chad Rosier	a92ef4ba5b	[fast-isel] Add X86FastIsel::FastLowerArguments to handle functions with 6 or fewer scalar integer (i32 or i64) arguments. It completely eliminates the need for SDISel for trivial functions. Also, add the new llc -fast-isel-abort-args option, which is similar to -fast-isel-abort option, but for formal argument lowering. llvm-svn: 176052	2013-02-25 21:59:35 +00:00
Andrew Trick	7cf4361912	pre-RA-sched fix: only reevaluate physreg interferences when necessary. Fixes rdar:13279013: scheduler was blowing up on select instructions. llvm-svn: 176037	2013-02-25 19:11:48 +00:00
Matt Beaumont-Gay	0e760da5fc	'Hexadecimal' has two 'a's and only one 'i'. llvm-svn: 176031	2013-02-25 18:11:18 +00:00
Chandler Carruth	121dbf8846	Fix spelling noticed by Duncan. llvm-svn: 176023	2013-02-25 14:29:38 +00:00
Chandler Carruth	05920b1847	Fix the root cause of PR15348 by correctly handling alignment 0 on memory intrinsics in the SDAG builder. When alignment is zero, the lang ref says that no alignment assumptions can be made. This is the exact opposite of the internal API contracts of the DAG where alignment 0 indicates that the alignment can be made to be anything desired. There is another, more explicit alignment that is better suited for the role of "no alignment at all": an alignment of 1. Map the intrinsic alignment to this early so that we don't end up generating aligned DAGs. It is really terrifying that we've never seen this before, but we suddenly started generating a large number of alignment 0 memcpys due to the new code to do memcpy-based copying of POD class members. That patch contains a bug that rounds bitfield alignments down when they are the first field. This can in turn produce zero alignments. This fixes weird crashes I've seen in library users of LLVM on 32-bit hosts, etc. llvm-svn: 176022	2013-02-25 14:20:21 +00:00
Cameron Zwarich	1b4c64c23b	Add a use of an otherwise unused variable to remove a warning in non-Asserts builds. llvm-svn: 175981	2013-02-24 01:26:05 +00:00
Cameron Zwarich	6868f386c7	TwoAddressInstructionPass::tryInstructionTransform() only potentially returns true when shouldOnlyCommute is false, so we can remove code that checks otherwise. llvm-svn: 175980	2013-02-24 00:27:29 +00:00
Cameron Zwarich	f05c0cbb20	TwoAddrInstructionPass::tryInstructionTransform() has a case where it calls itself recursively with a new instruction that has not been finalized, in order to determine whether to keep the instruction. On 'make check' and test-suite the only cases where the recursive invocation made any transformations were simple instruction commutations, so I am restricting the recursive invocation to do only this. The other cases wouldn't work correctly when updating LiveIntervals, since the new instructions don't have slot indices and LiveIntervals hasn't yet been updated. If the other transformations were actually triggering in any test case it would be possible to support it with a lot of effort, but since they don't it's not worth it. llvm-svn: 175979	2013-02-24 00:27:26 +00:00
Cameron Zwarich	e6907bc0e6	TargetInstrInfo::commuteInstruction() doesn't actually return a new instruction unless it was requested to with an optional parameter that defaults to false, so we don't need to handle that case in TwoAddressInstructionPass. llvm-svn: 175974	2013-02-23 23:13:28 +00:00
Cameron Zwarich	63acc73f21	Make some fixes for LiveInterval repair with debug info. Debug value MachineInstrs don't have a slot index. llvm-svn: 175961	2013-02-23 10:25:25 +00:00
Cameron Zwarich	4e80d9e853	Fix a bug with the LiveIntervals updating in the two-address pass found by running ASCI_Purple/SMG2000 in the test-suite. llvm-svn: 175957	2013-02-23 04:49:22 +00:00
Cameron Zwarich	35c3050e7a	Make TwoAddressInstructionPass::sink3AddrInstruction() LiveIntervals-aware. llvm-svn: 175956	2013-02-23 04:49:20 +00:00
Cameron Zwarich	7d13fb4d5a	Make rescheduleMIBelowKill() and rescheduleKillAboveMI() LiveIntervals-aware in TwoAddressInstructionPass. The code in rescheduleMIBelowKill() is a bit tricky, since multiple instructions need to be moved down, one-at-a-time, in reverse order. llvm-svn: 175955	2013-02-23 04:49:13 +00:00
Eric Christopher	dae389bb98	Use getSplitDebugFilename when constructing the skeleton cu and update testcase accordingly to give the correct name to the cu. llvm-svn: 175934	2013-02-22 23:50:08 +00:00
Nadav Rotem	b7f90bd97b	SelectionDAG compile time improvement. One of the phases of SelectionDAG is LegalizeVectors. We don't need to sort the DAG and copy nodes around if there are no vector ops. Speeds up the compilation time of SelectionDAG on a big scalar workload by ~8%. llvm-svn: 175929	2013-02-22 23:33:30 +00:00
Pete Cooper	047f81a5df	Fix isa<> check which could never be true. It was incorrectly checking a Function* being an IntrinsicInst* which isn't possible. It should always have been checking the CallInst* instead. Added test case for x86 which ensures we only get one constant load. It was 2 before this change. rdar://problem/13267920 llvm-svn: 175853	2013-02-22 01:50:38 +00:00
Cameron Zwarich	384026b3d4	Stop relying on physical register kill flags in isKilled() in the two-address pass. One of the callers of isKilled() can cope with overapproximation of kills and the other can't, so I added a flag to indicate this. In theory this could pessimize code slightly, but in practice most physical register uses are kills, and most important kills of physical registers are the only uses of that register prior to register allocation, so we can recognize them as kills even without kill flags. This is relevant because LiveIntervals gets rid of all kill flags. llvm-svn: 175821	2013-02-21 22:58:42 +00:00
Lang Hames	4309020200	Re-apply r175688, with the changes suggested by Jakob in PR15320. llvm-svn: 175809	2013-02-21 22:16:43 +00:00
Eli Bendersky	8da87163ca	Move the eliminateCallFramePseudoInstr method from TargetRegisterInfo to TargetFrameLowering, where it belongs. Incidentally, this allows us to delete some duplicated (and slightly different!) code in TRI. There are potentially other layering problems that can be cleaned up as a result, or in a similar manner. The refactoring was OK'd by Anton Korobeynikov on llvmdev. Note: this touches the target interfaces, so out-of-tree targets may be affected. llvm-svn: 175788	2013-02-21 20:05:00 +00:00
Jakob Stoklund Olesen	2ff4dc0ff2	Make RAFast::UsedInInstr indexed by register units. This fixes some problems with too conservative checking where we were marking all aliases of a register as used, and then also checking all aliases when allocating a register. <rdar://problem/13249625> llvm-svn: 175782	2013-02-21 19:35:21 +00:00
Lang Hames	071890b554	Revert r175688 - It broke a test case (see PR15320). llvm-svn: 175765	2013-02-21 17:01:59 +00:00
Benjamin Kramer	3238dc0c61	DAGCombiner: Make the post-legalize vector op optimization more aggressive. A legal BUILD_VECTOR goes in and gets constant folded into another legal BUILD_VECTOR so we don't lose any legality here. The problematic PPC optimization that made this check necessary was fixed recently. llvm-svn: 175759	2013-02-21 15:24:35 +00:00
Cameron Zwarich	3ab4c4ba30	Don't rely on the isDead() MachineOperand flag when updating LiveIntervals. llvm-svn: 175732	2013-02-21 08:51:58 +00:00
Cameron Zwarich	d113292bce	Use getInterval() instead of getOrCreateInterval(). llvm-svn: 175731	2013-02-21 08:51:55 +00:00
Cameron Zwarich	9e722aed4a	Make another kill check LiveIntervals-aware. This brings the number of remaining failures in 'make check' without LiveVariables down to 39, with 1 unexpectedly passing test. llvm-svn: 175727	2013-02-21 07:02:30 +00:00
Cameron Zwarich	c896478f0d	Split part of isKilled() into a separate function for use elsewhere. llvm-svn: 175726	2013-02-21 07:02:28 +00:00
Cameron Zwarich	94b204be54	Update isKilledAt in TwoAddressInstructionPass.cpp to use LiveIntervals when available. With this commit there are no longer any assertion or verifier failures when running 'make check' without LiveVariables. There are still 56 failing tests with codegen differences and 1 unexpectedly passing test. llvm-svn: 175719	2013-02-21 04:33:02 +00:00
Jakob Stoklund Olesen	1744fd8018	Don't allocate memory in LiveInterval::join(). Rewrite value numbers directly in the 'Other' LiveInterval which is moribund anyway. This avoids allocating the OtherAssignments vector. llvm-svn: 175690	2013-02-20 23:51:10 +00:00
Lang Hames	24e3759230	Kill of TransferDeadFlag - Dead copies and subreg-to-reg instructions should just be turned into kills on the spot. llvm-svn: 175688	2013-02-20 23:36:57 +00:00
Jakob Stoklund Olesen	b389271101	Copy single reaching defs directly into the LiveInterval. When findReachingDefs() finds that only one value can reach the basic block, just copy the work list of visited blocks directly into the live interval. Sort the block list and use a LiveRangeUpdater to make the bulk add fast. When multiple reaching defs are found, transfer the work list to the updateSSA() work list as before. Also use LiveRangeUpdater in updateLiveIns() following updateSSA(). This makes live interval analysis more than 3x faster on one huge test case. llvm-svn: 175685	2013-02-20 23:08:26 +00:00
Cameron Zwarich	2ad3ca37bd	Only use LiveIntervals in TwoAddressInstructionPass, not a mix of Liveintervals and SlotIndexes. llvm-svn: 175674	2013-02-20 22:10:02 +00:00
Cameron Zwarich	caad7e10c1	Find anchoring end points for repairIntervalsInRange and repairIndexesInRange automatically. llvm-svn: 175673	2013-02-20 22:10:00 +00:00
Cameron Zwarich	8e7dc068c9	Make repairIntervalsInRange() more robust. There are now no longer any liveness- related failures when running 'make check' without LiveVariables with the verifier enabled. Some of the remaining failures elsewhere may still be fallout from incorrect updating of LiveIntervals or the few missing cases left in the two-address pass. llvm-svn: 175672	2013-02-20 22:09:57 +00:00
Arnold Schwaighofer	3f9568e921	DAGCombiner: Fold pointless truncate, bitcast, buildvector series (2xi32) (truncate ((2xi64) bitcast (buildvector i32 a, i32 x, i32 b, i32 y))) can be folded into a (2xi32) (buildvector i32 a, i32 b). Such a DAG would cause uneccessary vdup instructions followed by vmovn instructions. We generate this code on ARM NEON for a setcc olt, 2xf64, 2xf64. For example, in the vectorized version of the code below. double A[N]; double B[N]; void test_double_compare_to_double() { int i; for(i=0;i<N;i++) A[i] = (double)(A[i] < B[i]); } radar://13191881 Fixes bug 15283. llvm-svn: 175670	2013-02-20 21:33:32 +00:00
Jim Grosbach	341ad3e72a	Update TargetLowering ivars for name policy. http://llvm.org/docs/CodingStandards.html#name-types-functions-variables-and-enumerators-properly ivars should be camel-case and start with an upper-case letter. A few in TargetLowering were starting with a lower-case letter. No functional change intended. llvm-svn: 175667	2013-02-20 21:13:59 +00:00
Jakob Stoklund Olesen	623d8329e3	Use LiveRangeUpdater instead of mergeIntervalRanges. Performance is the same, but LiveRangeUpdater has a more flexible interface. llvm-svn: 175645	2013-02-20 18:18:15 +00:00
Jakob Stoklund Olesen	521c708e6e	Add a LiveRangeUpdater class. Adding new segments to large LiveIntervals can be expensive because the LiveRange objects after the insertion point may need to be moved left or right. This can cause quadratic behavior when adding a large number of segments to a live range. The LiveRangeUpdater class allows the LIveInterval to be in a temporary invalid state while segments are being added. It maintains an internal gap in the LiveInterval when it is shrinking, and it has a spill area for new segments when the LiveInterval is growing. The behavior is similar to the existing mergeIntervalRanges() function, except it allocates less memory for the spill area, and the algorithm is turned inside out so the loop is driven by the clients. llvm-svn: 175644	2013-02-20 18:18:12 +00:00
Michael Liao	7fb39669ef	Fix PR15267 - When extloading from a vector with non-byte-addressable element, e.g. <4 x i1>, the current logic breaks. Extend the current logic to fix the case where the element type is not byte-addressable by loading all bytes, bit-extracting/packing each element. llvm-svn: 175642	2013-02-20 18:04:21 +00:00
Benjamin Kramer	5c3e21ba55	Move the SplatByte helper to APInt and generalize it a bit. llvm-svn: 175621	2013-02-20 13:00:06 +00:00
David Blaikie	9db062ebdb	Fully qualify llvm::next to avoid ambiguity when building as C++11. llvm-svn: 175608	2013-02-20 07:39:20 +00:00
Cameron Zwarich	8e60d4d240	Add support to the two-address pass for updating LiveIntervals in many of the common transformations. This includes updating repairIntervalsInRange() to handle more cases. llvm-svn: 175604	2013-02-20 06:46:48 +00:00
Cameron Zwarich	2991feb0d3	Move the computation of the IsEarlyClobber flag into its own loop, since the correct value is needed in every iteration of the loop for updating LiveIntervals. llvm-svn: 175603	2013-02-20 06:46:46 +00:00
Cameron Zwarich	2941482387	Add SlotIndexes::repairIndexesInRange(), which repairs SlotIndexes after adding and removing instructions. The implementation seems more complicated than it needs to be, but I couldn't find something simpler that dealt with all of the corner cases. Also add a call to repairIndexesInRange() from repairIntervalsInRange(). llvm-svn: 175601	2013-02-20 06:46:41 +00:00
Cameron Zwarich	3673581de9	Remove verification after PHIElimination when using LiveIntervals, and move it after the two-address pass. The remaining problems in 'make check' are occurring later. llvm-svn: 175598	2013-02-20 06:46:34 +00:00
Cameron Zwarich	68fbc4f642	Avoid recomputing an inserted instruction's SlotIndex. llvm-svn: 175597	2013-02-20 06:46:32 +00:00
Cameron Zwarich	37ca2e81d8	Add preservation of SlotIndexes to PHIElimination. llvm-svn: 175596	2013-02-20 06:46:28 +00:00
Jakub Staszak	8bc7af1a93	Fix #includes, so we include only what we really need. llvm-svn: 175581	2013-02-20 00:26:25 +00:00
Jakob Stoklund Olesen	7e28db019a	Assert that the target provided hints are in the allocation order. Target implementations of getRegAllocationHints() should use the provided allocation order, and they can never return hints outside the order. This is already documented in TargetRegisterInfo.h. <rdar://problem/13240556> llvm-svn: 175540	2013-02-19 18:41:01 +00:00
Benjamin Kramer	b3aa2b8497	Fix GCMetadaPrinter::finishAssembly not executed, patch by Yiannis Tsiouris. Due to the execution order of doFinalization functions, the GC information were deleted before AsmPrinter::doFinalization was executed. Thus, the GCMetadataPrinter::finishAssembly was never called. The patch fixes that by moving the code of the GCInfoDeleter::doFinalization to Printer::doFinalization. llvm-svn: 175528	2013-02-19 16:51:44 +00:00
Craig Topper	4ce8a909ba	More const correcting of stack coloring. llvm-svn: 175490	2013-02-19 06:02:40 +00:00
Craig Topper	b35030b229	Const-correct the stack coloring code. llvm-svn: 175488	2013-02-19 05:32:02 +00:00
Craig Topper	9a4b318491	Avoid extra DenseMap lookups in StackColoring::calculateLocalLiveness. llvm-svn: 175487	2013-02-19 04:47:31 +00:00
Craig Topper	ac82084563	Make the dump() function const and reduce the number of hash lookups it performs. llvm-svn: 175485	2013-02-19 03:14:22 +00:00
Craig Topper	85abf9ea73	Use a reference into the BlockLiveness DenseMap to avoid repeated hash lookups in collectMarkers. llvm-svn: 175484	2013-02-19 03:06:17 +00:00
Benjamin Kramer	53bc37ca2a	Support for HiPE-compatible code emission, patch by Yiannis Tsiouris. llvm-svn: 175457	2013-02-18 20:55:12 +00:00
Chad Rosier	3489bcc9ab	[ms-inline asm] Remove a redundant call to the setHasMSInlineAsm function. llvm-svn: 175456	2013-02-18 20:13:59 +00:00
NAKAMURA Takumi	68426c79db	[ms-inline asm] Fix undefined behavior to reset hasMSInlineAsm in advance of SelectAllBasicBlocks(). llvm-svn: 175422	2013-02-18 07:06:48 +00:00
Cameron Zwarich	2495596792	Remove use of reverse iterators in repairIntervalsInRange(). While they were arguably better than forward iterators for this use case, they are confusing and there are some implementation problems with reverse iterators and MI bundles. llvm-svn: 175393	2013-02-17 11:09:00 +00:00
Cameron Zwarich	1286ef9c89	Use ArrayRef instead of a reference to a SmallVectorImpl. llvm-svn: 175385	2013-02-17 03:48:23 +00:00
Cameron Zwarich	ddeabf78a0	Fix a conversion from a forward iterator to a reverse iterator in MachineBasicBlock::SplitCriticalEdge. Since this is an iterator rather than an instr_iterator, the isBundled() check only passes if getFirstTerminator() returned end() and the garbage memory happens to lean that way. Multiple successors can be present without any terminator instructions in the case of exception handling with a fallthrough. llvm-svn: 175383	2013-02-17 01:45:04 +00:00
Cameron Zwarich	bfebb41984	Add support for updating the LiveIntervals of registers used by 'exotic' terminators that actually have register uses when splitting critical edges. This commit also introduces a method repairIntervalsInRange() on LiveIntervals, which allows for repairing LiveIntervals in a small range after an arbitrary target hook modifies, inserts, and removes instructions. It's pretty limited right now, but I hope to extend it to support all of the things that are done by the convertToThreeAddress() target hooks. llvm-svn: 175382	2013-02-17 00:10:44 +00:00
Benjamin Kramer	ae526d106a	Replace erase loop with std::remove_if. This avoids unnecessary copies. No functionality change. llvm-svn: 175367	2013-02-16 17:06:38 +00:00
Benjamin Kramer	981de9a9da	Replace loop with std::find. No functionality change. llvm-svn: 175366	2013-02-16 17:06:32 +00:00
Jakub Staszak	5c262f505e	LegalizeDAG.cpp doesn't need DenseMap. llvm-svn: 175365	2013-02-16 16:15:42 +00:00
Jakub Staszak	87609f5a29	Use const reference instead of vector object when passing an argument to updateScheduledPressure method. llvm-svn: 175362	2013-02-16 15:47:26 +00:00
Chad Rosier	925c9b499e	[ms-inline asm] Do not omit the frame pointer if we have ms-inline assembly. If the frame pointer is omitted, and any stack changes occur in the inline assembly, e.g.: "pusha", then any C local variable or C argument references will be incorrect. I pass no judgement on anyone who would do such a thing. ;) rdar://13218191 llvm-svn: 175334	2013-02-16 01:25:28 +00:00
Bill Wendling	aef9c37c65	Use the 'target-features' and 'target-cpu' attributes to reset the subtarget features. If two functions require different features (e.g., `-mno-sse' vs. `-msse') then we want to honor that, especially during LTO. We can do that by resetting the subtarget's features depending upon the 'target-feature' attribute. llvm-svn: 175314	2013-02-15 22:31:27 +00:00
Paul Redmond	f29ddfe93f	enable SDISel sincos optimization for GNU environments - add sincos to runtime library if target triple environment is GNU - added canCombineSinCosLibcall() which checks that sincos is in the RTL and if the environment is GNU then unsafe fpmath is enabled (required to preserve errno) - extended sincos-opt lit test Reviewed by: Hal Finkel llvm-svn: 175283	2013-02-15 18:45:18 +00:00
Benjamin Kramer	6ecb1e78a9	Make helpers static. Add missing include so LLVMInitializeObjCARCOpts gets C linkage. llvm-svn: 175264	2013-02-15 12:30:38 +00:00
Nadav Rotem	495b1a43c1	Dont merge consecutive loads/stores into vectors when noimplicitfloat is used. llvm-svn: 175190	2013-02-14 18:28:52 +00:00
Owen Anderson	cc068993ee	Add some legality checks for SETCC before introducing it in the DAG combiner post-operand legalization. llvm-svn: 175149	2013-02-14 09:07:33 +00:00
Cameron Zwarich	8f55064a06	RegisterCoalescer::reMaterializeTrivialDef() can constrain the destination register class to match the defining instruction. llvm-svn: 175130	2013-02-14 03:25:24 +00:00
Cameron Zwarich	48ab445621	Fix RegisterCoalescer::rematerializeTrivialDef() so that it works on flipped CoalescerPairs. Also, make it take a CoalescerPair directly like other methods of RegisterCoalescer. llvm-svn: 175123	2013-02-14 02:51:05 +00:00
Cameron Zwarich	1195e819bb	Fix some issues with rematerialization in RegisterCoalescer when the destination of the copy is a subregister def. The current code assumes that it can do a full def of the destination register, but it is not checking that the def operand is read-undef. It also doesn't clear the subregister index of the destination in the new instruction to reflect the full subregister def. These issues were found running 'make check' with my next commit that enables rematerialization in more cases. llvm-svn: 175122	2013-02-14 02:51:03 +00:00
Manman Ren	7a4c8a783c	Clean up LDV, no functionality change. Remove dead functions: renameRegister Move private member variables from LDV to Impl Remove ssp/uwtable from testing case llvm-svn: 175072	2013-02-13 20:23:48 +00:00
Andrew Trick	553e0fe365	MIsched: HazardRecognizers are created for each DAG. Free them. llvm-svn: 175067	2013-02-13 19:22:27 +00:00
Eric Christopher	bc67170056	Fix comment. llvm-svn: 175024	2013-02-13 02:29:18 +00:00
Manman Ren	f019cd62da	Debug Info: LiveDebugVarible can remove DBG_VALUEs, make sure we emit them back. RegisterCoalescer used to depend on LiveDebugVariable. LDV removes DBG_VALUEs without emitting them at the end. We fix this by removing LDV from RegisterCoalescer. Also add an assertion to make sure we call emitDebugValues if DBG_VALUEs are removed at runOnMachineFunction. rdar://problem/13183203 Reviewed by Andy & Jakob llvm-svn: 175023	2013-02-13 01:14:49 +00:00
Guy Benyei	83c74e9fad	Add static cast to unsigned char whenever a character classification function is called with a signed char argument, in order to avoid assertions in Windows Debug configuration. llvm-svn: 175006	2013-02-12 21:21:59 +00:00
Krzysztof Parzyszek	228daa6986	Allow optionally generating pubnames section in DWARF info. Introduce option "generate-dwarf-pubnames" to control it, set to "false" by default. llvm-svn: 174981	2013-02-12 18:00:14 +00:00
Sergei Larin	5e76aa9714	Equal treatment of labels and other terminators in MI DAG construction. MI sched DAG construction allows targets to include terminators into scheduling DAG. Extend this functionality to labels as well. llvm-svn: 174977	2013-02-12 16:36:03 +00:00
Paul Redmond	288604ed0c	PR14562 - Truncation of left shift became undef DAGCombiner::ReduceLoadWidth was converting (trunc i32 (shl i64 v, 32)) into (shl i32 v, 32) into undef. To prevent this, check the shift count against the final result size. Patch by: Kevin Schoedel Reviewed by: Nadav Rotem llvm-svn: 174972	2013-02-12 15:21:21 +00:00
Cameron Zwarich	7c85c9433c	Don't consider definitions by other PHIs live-in when trimming a PHI source's live range after inserting a copy at the end of a block. llvm-svn: 174945	2013-02-12 05:48:58 +00:00
Cameron Zwarich	4ee9aef208	Fix indentation. llvm-svn: 174944	2013-02-12 05:48:56 +00:00
Cameron Zwarich	15eb925ed4	Add a hidden option to PHIElimination to split all critical edges. This is particularly useful for catching issues with architectures that have exotic terminators like MIPS. llvm-svn: 174938	2013-02-12 03:49:25 +00:00
Cameron Zwarich	cdcab38ff5	Add blocks to the LiveIntervalAnalysis RegMaskBlocks array when splitting a critical edge. llvm-svn: 174936	2013-02-12 03:49:20 +00:00
Cameron Zwarich	af34931c63	Fix the updating of LiveIntervals after splitting a critical edge. PHI operand live ranges should always be extended, and the only successor that should be considered for extension of other ranges is the target of the split edge. llvm-svn: 174935	2013-02-12 03:49:17 +00:00
Pete Cooper	10a3ae7039	Check type for legality before forming a select from loads. Sorry for the lack of a test case. I tried writing one for i386 as i know selects are illegal on this target, but they are actually considered legal by isel and expanded later. I can't see any targets to trigger this, but checking for the legality of a node before forming it is general goodness. llvm-svn: 174934	2013-02-12 03:14:50 +00:00
Cameron Zwarich	b47fb384fc	Add support for updating LiveIntervals to MachineBasicBlock::SplitCriticalEdge(). This is currently a bit hairier than it needs to be, since depending on where the split block resides the end ListEntry of the split block may be the end ListEntry of the original block or a new entry. Some changes to the SlotIndexes updating should make it possible to eliminate the two cases here. This also isn't as optimized as it could be. In the future Liveinterval should probably get a flag that indicates whether the LiveInterval is within a single basic block. We could ignore all such intervals when splitting an edge. llvm-svn: 174870	2013-02-11 09:24:47 +00:00
Cameron Zwarich	ba378cea72	Update SlotIndexes after updateTerminator() possibly removes instructions. I am really trying to avoid piping SlotIndexes through to RemoveBranch() and friends. llvm-svn: 174869	2013-02-11 09:24:45 +00:00
Bob Wilson	a594fab454	Revert "Rename LLVMContext diagnostic handler types and functions." This reverts my commit 171047. Now that I've removed my misguided attempt to support backend warnings, these diagnostics are only about inline assembly. It would take quite a bit more work to generalize them properly, so I'm just reverting this. llvm-svn: 174860	2013-02-11 05:37:07 +00:00
Evan Cheng	615620c9e8	Currently, codegen may spent some time in SDISel passes even if an entire function is successfully handled by fast-isel. That's because function arguments are always handled by SDISel. Introduce FastLowerArguments to allow each target to provide hook to handle formal argument lowering. As a proof-of-concept, add ARMFastIsel::FastLowerArguments to handle functions with 4 or fewer scalar integer (i8, i16, or i32) arguments. It completely eliminates the need for SDISel for trivial functions. rdar://13163905 llvm-svn: 174855	2013-02-11 01:27:15 +00:00
Evan Cheng	d1c6404250	Remove unnecessary code. llvm-svn: 174854	2013-02-11 01:18:26 +00:00
Cameron Zwarich	21beaf6789	Fix the unused but nearly correct method SlotIndexes::insertMBBInMaps() and add support for updating SlotIndexes to MachineBasicBlock::SplitCriticalEdge(). This calls renumberIndexes() every time; it should be improved to only renumber locally. llvm-svn: 174851	2013-02-10 23:29:54 +00:00
Cameron Zwarich	bb9ad311fb	Abstract the liveness checking in PHIElimination::SplitPHIEdges() to support both LiveVariables and LiveIntervals. llvm-svn: 174850	2013-02-10 23:29:49 +00:00
Cameron Zwarich	16b64cba62	Add support for updating LiveIntervals to PHIElimination. If LiveIntervals are present, it currently verifies them with the MachineVerifier, and this passed all of the test cases in 'make check' (when accounting for existing verifier errors). There were some assertion failures in the two-address pass, but they also happened on code without phis and look like they are caused by different kill flags from LiveIntervals. The only part that doesn't work is the critical edge splitting heuristic, because there isn't currently an efficient way to update LiveIntervals after splitting an edge. I'll probably start by implementing the slow fallback and test that it works before tackling the fast path for single-block ranges. The existing code that updates LiveVariables is fairly slow as it is. There isn't a command-line option for enabling this; instead, just edit PHIElimination.cpp to require LiveIntervals. llvm-svn: 174831	2013-02-10 06:42:36 +00:00
Cameron Zwarich	71f0acbeeb	Fix a typo. llvm-svn: 174830	2013-02-10 06:42:34 +00:00
Cameron Zwarich	a158d39da0	Remove ancient references to 'atomic' phis in PHIElimination that don't really make sense anymore. llvm-svn: 174829	2013-02-10 06:42:32 +00:00
Cameron Zwarich	e0966738db	Make LiveVariables an instance variable of PHIElimination. llvm-svn: 174828	2013-02-10 06:42:30 +00:00
Jakub Staszak	853136229b	Remove unneeded "TargetMachine.h" #includes. llvm-svn: 174817	2013-02-09 20:54:05 +00:00
Manman Ren	d2c95eb995	Dwarf: do not use line_table_start in at_stmt_list since we do not always emit line table entries in assembly. llvm-svn: 174785	2013-02-09 00:41:44 +00:00
Jakob Stoklund Olesen	fac770b865	Remove the old liveness algorithm. This is part of the plan to delete LiveVariables. llvm-svn: 174783	2013-02-09 00:04:07 +00:00
Jakob Stoklund Olesen	63e5b4c269	Turn on -new-live-intervals by default. This uses a liveness algorithm that does not depend on data from the LiveVariables analysis, it is the first step towards removing LiveVariables completely. llvm-svn: 174774	2013-02-08 23:18:37 +00:00
Bob Wilson	bfb44ef9cb	Revert "Add LLVMContext::emitWarning methods and use them. <rdar://problem/12867368>" This reverts r171041. This was a nice idea that didn't work out well. Clang warnings need to be associated with warning groups so that they can be selectively disabled, promoted to errors, etc. This simplistic patch didn't allow for that. Enhancing it to provide some way for the backend to specify a front-end warning type seems like overkill for the few uses of this, at least for now. llvm-svn: 174748	2013-02-08 21:48:29 +00:00
Hal Finkel	2581905f81	DAGCombiner: Constant folding around pre-increment loads/stores Previously, even when a pre-increment load or store was generated, we often needed to keep a copy of the original base register for use with other offsets. If all of these offsets are constants (including the offset which was combined into the addressing mode), then this is clearly unnecessary. This change adjusts these other offsets to use the new incremented address. llvm-svn: 174746	2013-02-08 21:35:47 +00:00
Bob Wilson	67bbf3aa0c	Revert 172027 and 174336. Remove diagnostics about over-aligned stack objects. Aside from the question of whether we report a warning or an error when we can't satisfy a requested stack object alignment, the current implementation of this is not good. We're not providing any source location in the diagnostics and the current warning is not connected to any warning group so you can't control it. We could improve the source location somewhat, but we can do a much better job if this check is implemented in the front-end, so let's do that instead. <rdar://problem/13127907> llvm-svn: 174741	2013-02-08 20:35:15 +00:00
Arnold Schwaighofer	594fa2dc2b	ARM cost model: Address computation in vector mem ops not free Adds a function to target transform info to query for the cost of address computation. The cost model analysis pass now also queries this interface. The code in LoopVectorize adds the cost of address computation as part of the memory instruction cost calculation. Only there, we know whether the instruction will be scalarized or not. Increase the penality for inserting in to D registers on swift. This becomes necessary because we now always assume that address computation has a cost and three is a closer value to the architecture. radar://13097204 llvm-svn: 174713	2013-02-08 14:50:48 +00:00
Eric Christopher	7480433dd9	"Clean up" line section symbol emission by emitting the section syms before constructing the compile units so we're not emitting section references to sections not there already. llvm-svn: 174663	2013-02-07 21:19:50 +00:00
Eric Christopher	5b33b3c3be	Clean up multiple skeleton compile units if we have multiple compile units coming in. llvm-svn: 174548	2013-02-06 21:53:56 +00:00
Eric Christopher	30be9f8cee	Remove some dead code. llvm-svn: 174547	2013-02-06 21:53:53 +00:00
Evan Cheng	a72b9709d7	Tweak check to avoid integer overflow (for insanely large alignments) llvm-svn: 174482	2013-02-06 02:06:33 +00:00
Manman Ren	d2c38d684a	Attempt to recover gdb bot after r174445. Failure: undefined symbol 'Lline_table_start0'. Root-cause: we use a symbol subtraction to calculate at_stmt_list, but the line table entries are not dumped in the assembly. Fix: use zero instead of a symbol subtraction for Compile Unit 0. llvm-svn: 174479	2013-02-06 00:59:41 +00:00
Manman Ren	4e042a6be6	Dwarf: support for LTO where a single object file can have multiple line tables We generate one line table for each compilation unit in the object file. Reviewed by Eric and Kevin. rdar://problem/13067005 llvm-svn: 174445	2013-02-05 21:52:47 +00:00
Owen Anderson	de89ecf1fc	Reapply r174343, with a fix for a scary DAG combine bug where it failed to differentiate between the alignment of the base point of a load, and the overall alignment of the load. This caused infinite loops in DAG combine with the original application of this patch. ORIGINAL COMMIT LOG: When the target-independent DAGCombiner inferred a higher alignment for a load, it would replace the load with one with the higher alignment. However, it did not place the new load in the worklist, which prevented later DAG combines in the same phase (for example, target-specific combines) from ever seeing it. This patch corrects that oversight, and updates some tests whose output changed due to slightly different DAGCombine outputs. llvm-svn: 174431	2013-02-05 19:24:39 +00:00
Jakob Stoklund Olesen	4acf7dd866	Remove liveout lists from MachineRegisterInfo. All targets are now adding return value registers as implicit uses on return instructions, and there is no longer a need for the live out lists. llvm-svn: 174417	2013-02-05 18:21:56 +00:00
Jakob Stoklund Olesen	c338679c9d	Remove special-casing of return blocks for liveness. Now that return value registers are return instruction uses, there is no need for special treatment of return blocks. llvm-svn: 174416	2013-02-05 18:21:52 +00:00
NAKAMURA Takumi	3753b28cd2	Revert r174343, "When the target-independent DAGCombiner inferred a higher alignment for a load," It caused hangups in compiling clang/lib/Parse/ParseDecl.cpp and clang/lib/Driver/Tools.cpp in stage2 on some hosts. llvm-svn: 174374	2013-02-05 14:44:16 +00:00
Eric Christopher	411e674557	Construct a skeleton cu for each compile unit in the module, not just for the first compile unit. llvm-svn: 174352	2013-02-05 07:32:03 +00:00
Eric Christopher	7a2cdf798b	Add support for emitting a stub DW_AT_GNU_dwo_id as part of the DWARF5 split dwarf proposal. llvm-svn: 174350	2013-02-05 07:31:55 +00:00
Owen Anderson	a47fdbb032	When the target-independent DAGCombiner inferred a higher alignment for a load, it would replace the load with one with the higher alignment. However, it did not place the new load in the worklist, which prevented later DAG combines in the same phase (for example, target-specific combines) from ever seeing it. This patch corrects that oversight, and updates some tests whose output changed due to slightly different DAGCombine outputs. llvm-svn: 174343	2013-02-05 06:25:30 +00:00
Manman Ren	86b1d868ba	[Stack Alignment] emit warning instead of a hard error Per discussion in rdar://13127907, we should emit a hard error only if people write code where the requested alignment is larger than achievable and assumes the low bits are zeros. A warning should be good enough when we are not sure if the source code assumes the low bits are zeros. rdar://13127907 llvm-svn: 174336	2013-02-04 23:45:08 +00:00
Benjamin Kramer	548ffa274a	SelectionDAG: Teach FoldConstantArithmetic how to deal with vectors. This required disabling a PowerPC optimization that did the following: input: x = BUILD_VECTOR <i32 16, i32 16, i32 16, i32 16> lowered to: tmp = BUILD_VECTOR <i32 8, i32 8, i32 8, i32 8> x = ADD tmp, tmp The add now gets folded immediately and we're back at the BUILD_VECTOR we started from. I don't see a way to fix this currently so I left it disabled for now. Fix some trivially foldable X86 tests too. llvm-svn: 174325	2013-02-04 15:19:18 +00:00
Shuxin Yang	cadd8a068e	rdar://13126763 Fix a bug in DAGCombine. The symptom is mistakenly optimizing expression "x + xx" into "x 3.0". llvm-svn: 174239	2013-02-02 00:22:03 +00:00
Manman Ren	e498b25bc5	Correct indentation for dumping LexicalScope. llvm-svn: 174237	2013-02-02 00:02:03 +00:00
Manman Ren	e697d3cd2e	[Dwarf] avoid emitting multiple AT_const_value for static memebers. Testing case is reduced from MultiSource/BenchMarks/Prolangs-C++/deriv1. rdar://problem/13071590 llvm-svn: 174235	2013-02-01 23:54:37 +00:00
Nadav Rotem	f04cbeb357	Fix errant fallthrough in the generation of the lifetime markers. Found by Alexander Kornienko. llvm-svn: 174207	2013-02-01 19:25:23 +00:00
Chad Rosier	12d7800a9b	Use a continue to simplify loop and reduce indentation. No functional change. llvm-svn: 174198	2013-02-01 17:49:07 +00:00
Chad Rosier	44e65a512d	Add braces, so my head doesn't explode. llvm-svn: 174088	2013-01-31 20:57:50 +00:00
Lang Hames	dd47804394	When lowering memcpys to loads and stores, make sure we don't promote alignments past the natural stack alignment. llvm-svn: 174085	2013-01-31 20:23:43 +00:00
Manman Ren	53f3f9f0cb	[Dwarf] early exit to avoid creating dangling DIEs We used to create children DIEs for a scope, then check whether ScopeDIE is null. If ScopeDIE is null, the children DIEs will be dangling. Other DIEs can link to those dangling DIEs, which are not emitted at all, causing dwarf error. The current testing case is 4k lines, from MultiSource/BenchMark/McCat/09-vor. rdar://problem/13071959 llvm-svn: 174084	2013-01-31 20:05:14 +00:00
Chad Rosier	df782d2225	[PEI] Pass the frame index operand number to the eliminateFrameIndex function. Each target implementation was needlessly recomputing the index. Part of rdar://13076458 llvm-svn: 174083	2013-01-31 20:02:54 +00:00
Weiming Zhao	4a0b4fb9a5	Add a special handling case for untyped CopyFromReg node in GetCostForDef() of ScheduleDAGRRList llvm-svn: 173833	2013-01-29 21:18:43 +00:00
David Blaikie	9a7a7a9a6f	Support artificial parameters in function types. Provides the functionality for Clang change r172911 - I just had this still lying around. llvm-svn: 173820	2013-01-29 19:35:24 +00:00
Edwin Vane	82f80d4967	Fixing warnings revealed by gcc release build Fixed set-but-not-used warnings. Reviewer: gribozavr llvm-svn: 173810	2013-01-29 17:42:24 +00:00
Andrew Trick	399c9bfcc0	MIsched: cleanup code. Use isBoundaryNode(). llvm-svn: 173775	2013-01-29 06:26:35 +00:00
Evan Cheng	0e88c7d897	Teach SDISel to combine fsin / fcos into a fsincos node if the following conditions are met: 1. They share the same operand and are in the same BB. 2. Both outputs are used. 3. The target has a native instruction that maps to ISD::FSINCOS node or the target provides a sincos library call. Implemented the generic optimization in sdisel and enabled it for Mac OSX. Also added an additional optimization for x86_64 Mac OSX by using an alternative entry point __sincos_stret which returns the two results in xmm0 / xmm1. rdar://13087969 PR13204 llvm-svn: 173755	2013-01-29 02:32:37 +00:00
Bill Schmidt	2e4ae4e154	This patch addresses bug 15031. The common code in the post-RA scheduler to break anti-dependencies on the critical path contained a flaw. In the reported case, an anti-dependency between the overlapping registers %X4 and %R4 exists: %X29<def> = OR8 %X4, %X4 %R4<def>, %X3<def,dead,tied3> = LBZU 1, %X3<kill,tied1> The unpatched code breaks the dependency by replacing %R4 and its uses with %R3, the first register on the available list. However, %R3 and %X3 overlap, so this creates two overlapping definitions on the same instruction. The fix is straightforward, preventing selection of a register that overlaps any other defined register on the same instruction. The test case is reduced from the bug report, and verifies that we no longer produce "lbzu 3, 1(3)" when breaking this anti-dependency. llvm-svn: 173706	2013-01-28 18:36:58 +00:00
Eric Christopher	59cc071165	Fix comment. llvm-svn: 173698	2013-01-28 17:33:26 +00:00
Michael Gottesman	79d8d81226	Extracted ObjCARC.cpp into its own library libLLVMObjCARCOpts in preparation for refactoring the ARC Optimizer. llvm-svn: 173647	2013-01-28 01:35:51 +00:00
Benjamin Kramer	cf9dae17b7	Legalizer: Reword comment again, per Duncan's suggestion. llvm-svn: 173625	2013-01-27 21:02:52 +00:00
Benjamin Kramer	084e675e17	Legalizer: Add an assert and tweak a comment to clarify the assumptions this code makes. llvm-svn: 173620	2013-01-27 15:04:43 +00:00
Benjamin Kramer	05cc93964a	When the legalizer is splitting vector shifts, the result may not have the right shift amount type. Fix that by adding a cast to the shift expander. This came up with vector shifts on sse-less X86 CPUs. <2 x i64> = shl <2 x i64> <2 x i64> -> i64,i64 = shl i64 i64; shl i64 i64 -> i32,i32,i32,i32 = shl_parts i32 i32 i64; shl_parts i32 i32 i64 Now we cast the last two i64s to the right type. Fixes the crash in PR14668. llvm-svn: 173615	2013-01-27 11:19:11 +00:00
Jakub Staszak	c641adae19	Use const reference instead of vector copying. llvm-svn: 173497	2013-01-25 21:44:27 +00:00
Preston Gurd	0959bb707d	This patch aims to reduce compile time in LegalizeTypes by using SmallDenseMap, with an initial number of elements, instead of DenseMap, which has zero initial elements, in order to avoid the copying of elements when the size changes and to avoid allocating space every time LegalizeTypes is run. This patch will not affect the memory footprint, because DenseMap will increase the element size to 64 when the first element is added. Patch by Wan Xiaofei. llvm-svn: 173448	2013-01-25 15:18:54 +00:00
Andrew Trick	54b2ce3812	MIsched: Print block name. No functionality. llvm-svn: 173433	2013-01-25 07:45:31 +00:00
Andrew Trick	ea9fd951a0	MachineScheduler support for viewGraph. llvm-svn: 173432	2013-01-25 07:45:29 +00:00
Andrew Trick	b36388a1cb	ScheduleDAG: colorize the DOT graph and improve formatting. llvm-svn: 173431	2013-01-25 07:45:25 +00:00
Andrew Trick	646eeb6675	ScheduleDAG: Added isBoundaryNode to conveniently detect a common corner case. This fixes DAG subtree analysis at the boundary. llvm-svn: 173427	2013-01-25 06:52:30 +00:00
Andrew Trick	ffc8097c60	SchedDFS: Complete support for nested subtrees. Maintain separate per-node and per-tree book-keeping. Track all instructions above a DAG node including nested subtrees. Seperately track instructions within a subtree. Record subtree parents. llvm-svn: 173426	2013-01-25 06:52:27 +00:00
Andrew Trick	e2c3f5c982	MIsched: Improve the interface to SchedDFS analysis (subtrees). Allow the strategy to select SchedDFS. Allow the results of SchedDFS to affect initialization of the scheduler state. llvm-svn: 173425	2013-01-25 06:33:57 +00:00
Andrew Trick	5b07eeb24a	SchedDFS: Initial support for nested subtrees. This is mostly refactoring, along with adding an instruction count within the subtrees and ensuring we only look at data edges. llvm-svn: 173420	2013-01-25 06:02:44 +00:00
Andrew Trick	44f750a3e5	MISched: Add SchedDFSResult to ScheduleDAGMI to formalize the interface and allow other strategies to select it. llvm-svn: 173413	2013-01-25 04:01:04 +00:00
Andrew Trick	b52a8564ce	SchedDFS: Refactor and tweak the subtree selection criteria. For sanity, create a root when NumDataSuccs >= 4. Splitting large subtrees will no longer be detrimental after my next checkin to handle nested tree. A magic number of 4 is fine because single subtrees seldom rejoin more than this. It makes subtrees easier to visualize and heuristics more sane. llvm-svn: 173399	2013-01-25 00:12:57 +00:00
Jakob Stoklund Olesen	e0ef47436c	Avoid creating duplicate CFG edges in the IfConversion pass. Patch by Stefan Hepp. llvm-svn: 173395	2013-01-24 23:59:08 +00:00
Andrew Trick	92da42406e	MachineScheduler: enable biasCriticalPath for all DAGs. llvm-svn: 173318	2013-01-24 02:09:57 +00:00
Andrew Trick	d3b8629a53	MIsched: Added biasCriticalPath. Allow schedulers to order DAG edges by critical path. This makes DFS-based heuristics more stable and effective. llvm-svn: 173317	2013-01-24 02:09:55 +00:00
Bill Wendling	7c8f96a91b	Add the heuristic to differentiate SSPStrong from SSPRequired. The requirements of the strong heuristic are: * A Protector is required for functions which contain an array, regardless of type or length. * A Protector is required for functions which contain a structure/union which contains an array, regardless of type or length. Note, there is no limit to the depth of nesting. * A protector is required when the address of a local variable (i.e., stack based variable) is exposed. (E.g., such as through a local whose address is taken as part of the RHS of an assignment or a local whose address is taken as part of a function argument.) llvm-svn: 173231	2013-01-23 06:43:53 +00:00
Bill Wendling	d154e283f2	Add the IR attribute 'sspstrong'. SSPStrong applies a heuristic to insert stack protectors in these situations: * A Protector is required for functions which contain an array, regardless of type or length. * A Protector is required for functions which contain a structure/union which contains an array, regardless of type or length. Note, there is no limit to the depth of nesting. * A protector is required when the address of a local variable (i.e., stack based variable) is exposed. (E.g., such as through a local whose address is taken as part of the RHS of an assignment or a local whose address is taken as part of a function argument.) This patch implements the SSPString attribute to be equivalent to SSPRequired. This will change in a subsequent patch. llvm-svn: 173230	2013-01-23 06:41:41 +00:00
Tim Northover	29178a348a	Make APFloat constructor require explicit semantics. Previously we tried to infer it from the bit width size, with an added IsIEEE argument for the PPC/IEEE 128-bit case, which had a default value. This default value allowed bugs to creep in, where it was inappropriate. llvm-svn: 173138	2013-01-22 09:46:31 +00:00
Michael Ilseman	3e3194f4ec	Introduce a new data structure, the SparseMultiSet, and changes to the MI scheduler to use it. A SparseMultiSet adds multiset behavior to SparseSet, while retaining SparseSet's desirable properties. Essentially, SparseMultiSet provides multiset behavior by storing its dense data in doubly linked lists that are inlined into the dense vector. This allows it to provide good data locality as well as vector-like constant-time clear() and fast constant time find(), insert(), and erase(). It also allows SparseMultiSet to have a builtin recycler rather than keeping SparseSet's behavior of always swapping upon removal, which allows it to preserve more iterators. It's often a better alternative to a SparseSet of a growable container or vector-of-vector. llvm-svn: 173064	2013-01-21 18:18:53 +00:00
Nadav Rotem	9450fcfff1	Revert 172708. The optimization handles esoteric cases but adds a lot of complexity both to the X86 backend and to other backends. This optimization disables an important canonicalization of chains of SEXT nodes and makes SEXT and ZEXT asymmetrical. Disabling the canonicalization of consecutive SEXT nodes into a single node disables other DAG optimizations that assume that there is only one SEXT node. The AVX mask optimizations is one example. Additionally this optimization does not update the cost model. llvm-svn: 172968	2013-01-20 08:35:56 +00:00
David Blaikie	a39a76efbc	The last of PR14471 - emission of constant floats llvm-svn: 172941	2013-01-20 01:18:01 +00:00
Eric Christopher	e9ec2458e7	Split out DW_OP_addr for the split debug info DWARF5 proposal. llvm-svn: 172857	2013-01-18 22:11:33 +00:00
Bill Wendling	658d24d211	Use AttributeSet accessor methods instead of Attribute accessor methods. Further encapsulation of the Attribute object. Don't allow direct access to the Attribute object as an aggregate. llvm-svn: 172853	2013-01-18 21:53:16 +00:00
Bill Wendling	4f972ea2d8	Remove unused parameter. Also use the AttributeSet query methods instead of the Attribute query methods. llvm-svn: 172852	2013-01-18 21:50:24 +00:00
Daniel Dunbar	9585612876	[MC/Mach-O] Implement integrated assembler support for linker options. - Also, fixup syntax errors in LangRef and missing newline in the MCAsmStreamer. llvm-svn: 172837	2013-01-18 19:37:00 +00:00
Elena Demikhovsky	f6a30e05d5	Optimization for the following SIGN_EXTEND pairs: v8i8 -> v8i64, v8i8 -> v8i32, v4i8 -> v4i64, v4i16 -> v4i64 for AVX and AVX2. Bug 14865. llvm-svn: 172708	2013-01-17 09:59:53 +00:00
Eric Christopher	4c7765f166	Fix the assembly and dissassembly of DW_FORM_sec_offset. Found this by changing both the string of the dwo_name to be correct and the type of the statement list. Testcases all around. llvm-svn: 172699	2013-01-17 03:00:04 +00:00
Eric Christopher	1826617133	Add the DW_AT_GNU_addr_base for the skeleton cu. Add support for emitting the dwarf32 version of DW_FORM_sec_offset and correct disassembler support. llvm-svn: 172698	2013-01-17 02:59:59 +00:00
Jakob Stoklund Olesen	965665bb48	Move MachineTraceMetrics.h into include/llvm/CodeGen. Let targets use it. llvm-svn: 172688	2013-01-17 01:06:04 +00:00
Jakob Stoklund Olesen	213a2f8b3f	Provide a place for targets to insert ILP optimization passes. Move the early if-conversion pass into this group. ILP optimizations usually need to find the right balance between register pressure and ILP using the MachineTraceMetrics analysis to identify critical paths and estimate other costs. Such passes should run together so they can share dominator tree and loop info analyses. Besides if-conversion, future passes to run here here could include expression height reduction and ARM's MLxExpansion pass. llvm-svn: 172687	2013-01-17 00:58:38 +00:00
Eric Christopher	4d23a4ae1f	Define metadata interfaces for describing a static data member of a class. Emit static data member declarations and definitions through correctly. Part of PR14471. Patch by Paul Robinson! llvm-svn: 172590	2013-01-16 01:22:23 +00:00
Eric Christopher	962c9089d9	Split address information for DWARF5 split dwarf proposal. This involves using the DW_FORM_GNU_addr_index and a separate .debug_addr section which stays in the executable and is fully linked. Sneak in two other small changes: a) Print out the debug_str_offsets.dwo section. b) Change form we're expecting the entries in the debug_str_offsets.dwo section to take from ULEB128 to U32. Add tests for all of this in the fission-cu.ll test. llvm-svn: 172578	2013-01-15 23:56:56 +00:00
Bill Schmidt	d006c6938b	This patch addresses an incorrect transformation in the DAG combiner. The included test case is derived from one of the GCC compatibility tests. The problem arises after the selection DAG has been converted to type-legalized form. The combiner first sees a 64-bit load that can be converted into a pre-increment form. The original load feeds into a SRL that isolates the upper 32 bits of the loaded doubleword. This looks like an opportunity for DAGCombiner::ReduceLoadWidth() to replace the 64-bit load with a 32-bit load. However, this transformation is not valid, as the replacement load is not a pre-increment load. The pre-increment load produces an extra result, which feeds a subsequent add instruction. The replacement load only has one result value, and this value is propagated to all uses of the pre- increment load, including the add. Because the add is looking for the second result value as its operand, it ends up attempting to add a constant to a token chain, resulting in a crash. So the patch simply disables this transformation for any load with more than two result values. llvm-svn: 172480	2013-01-14 22:04:38 +00:00
Benjamin Kramer	5ea0349ef5	When lowering an inreg sext first shift left, then right arithmetically. Shifting right two times will only yield zero. Should fix SingleSource/UnitTests/SignlessTypes/factor. llvm-svn: 172322	2013-01-12 19:06:44 +00:00
Jakob Stoklund Olesen	3dd236cdd8	Limit the search space in RAGreedy::tryEvict(). When tryEvict() is looking for a cheaper register in the allocation order, skip the tail of too expensive registers when possible. llvm-svn: 172281	2013-01-12 00:57:44 +00:00
Jakob Stoklund Olesen	8f644449af	Precompute some information about register costs. Remember the minimum cost of the registers in an allocation order and the number of registers at the end of the allocation order that have the same cost per use. This information can be used to limit the search space for RAGreedy::tryEvict() when looking for a cheaper register. llvm-svn: 172280	2013-01-12 00:54:59 +00:00
Nadav Rotem	dbe5c72d03	PPC: Implement efficient lowering of sign_extend_inreg. llvm-svn: 172269	2013-01-11 22:57:48 +00:00
Benjamin Kramer	fb3c009b52	Remove some accidentaly duplicated code. This needs urgent cleanup :( llvm-svn: 172248	2013-01-11 20:11:33 +00:00
Benjamin Kramer	56b31bd9d7	Split TargetLowering into a CodeGen and a SelectionDAG part. This fixes some of the cycles between libCodeGen and libSelectionDAG. It's still a complete mess but as long as the edges consist of virtual call it doesn't cause breakage. BasicTTI did static calls and thus broke some build configurations. llvm-svn: 172246	2013-01-11 20:05:37 +00:00
Nadav Rotem	e55aa3c848	ARM Cost Model: Modify the target independent cost model to ask the target if it supports the different CAST types. We didn't do this on X86 because of the different register sizes and types, but on ARM this makes sense. llvm-svn: 172245	2013-01-11 19:54:13 +00:00
Eric Christopher	0cb6fd930e	For inline asm: - recognize string "{memory}" in the MI generation - mark as mayload/maystore when there's a memory clobber constraint. PR14859. Patch by Krzysztof Parzyszek llvm-svn: 172228	2013-01-11 18:12:39 +00:00
Andrew Trick	5907292dfd	Follow-up typo correction from building the wrong branch. llvm-svn: 172224	2013-01-11 17:51:16 +00:00
Andrew Trick	ae182ce4fc	Fix typo from r170452. Affects -enable-misched heuristics. llvm-svn: 172223	2013-01-11 17:46:50 +00:00
Tim Northover	3a51aab390	Simplify writing floating types to assembly. This removes previous special cases for each floating-point type in favour of a shared codepath. llvm-svn: 172189	2013-01-11 10:36:13 +00:00
Evan Cheng	c8444b159a	PR14896: Handle memcpy from constant string where the memcpy size is larger than the string size. llvm-svn: 172124	2013-01-10 22:13:27 +00:00
Jakub Staszak	a9286e931c	Remove unneeded includes from FunctionLoweringInfo.h. llvm-svn: 172123	2013-01-10 22:13:13 +00:00
Jakob Stoklund Olesen	f0615c71fa	Allow hasProperty() to be called on bundle-internal instructions. When calling hasProperty() on an instruction inside a bundle, it should always behave as if IgnoreBundle was passed, and just return properties for the current instruction. Only attempt to aggregate bundle properties whan asked about the bundle header. The assertion fires on existing ARM test cases without this fix. llvm-svn: 172082	2013-01-10 18:42:44 +00:00
Jakob Stoklund Olesen	55a7be237e	Support headerless bundles in MachineInstr::hasProperty(). This function can still work without a BUNDLE header instruction. llvm-svn: 172029	2013-01-10 01:29:42 +00:00
Manman Ren	207bcbacca	Stack Alignment: throw error if we can't satisfy the minimal alignment requirement when creating stack objects in MachineFrameInfo. Add CreateStackObjectWithMinAlign to throw error when the minimal alignment can't be achieved and to clamp the alignment when the preferred alignment can't be achieved. Same is true for CreateVariableSizedObject. Will not emit error in CreateSpillStackObject or CreateStackObject. As long as callers of CreateStackObject do not assume the object will be aligned at the requested alignment, we should not have miscompile since later optimizations which look at the object's alignment will have the correct information. rdar://12713765 llvm-svn: 172027	2013-01-10 01:10:10 +00:00
Nadav Rotem	b1791a75cd	ARM Cost model: Use the size of vector registers and widest vectorizable instruction to determine the max vectorization factor. llvm-svn: 172010	2013-01-09 22:29:00 +00:00
Evan Cheng	5652a8df32	Fix a DAG combine bug visitBRCOND() is transforming br(xor(x, y)) to br(x != y). It cahced XOR's operands before calling visitXOR() but failed to update the operands when visitXOR changed the XOR node. rdar://12968664 llvm-svn: 171999	2013-01-09 20:56:40 +00:00
Jakob Stoklund Olesen	6922e9ca7e	Don't print bundle flags. The bundle flags are used by MachineBasicBlock::print(), they don't need to clutter up individual MachineInstrs. llvm-svn: 171986	2013-01-09 18:35:09 +00:00
Jakob Stoklund Olesen	68d752bf6b	Don't require BUNDLE headers in MachineInstr::getBundleSize(). It is possible to build MI bundles that don't begin with a BUNDLE header. Add support for such bundles, counting all instructions inside the bundle. llvm-svn: 171985	2013-01-09 18:28:16 +00:00
Sergei Larin	3b46d7ea26	Fix a typo in MachineInstr::unbundleFromSucc() method. llvm-svn: 171983	2013-01-09 17:54:33 +00:00
Tim Northover	f1450d8d7c	Refactor to expose RTLIB calls to targets. fp128 is almost but not quite completely illegal as a type on AArch64. As a result it needs to have a register class (for argument passing mainly), but all operations need to be lowered to runtime calls. Currently there's no way for targets to do this (without duplicating code), as the relevant functions are hidden in SelectionDAG. This patch changes that. llvm-svn: 171971	2013-01-09 13:18:15 +00:00
Eric Christopher	bf7bc4966c	Last in the series of removing unnecessary '0' arguments for address space. Reordered the EmitULEB128IntValue arguments to make this easier. llvm-svn: 171949	2013-01-09 03:52:05 +00:00
Andrew Trick	9f0b95f260	MIsched: add an ILP window property to machine model. This was an experimental option, but needs to be defined per-target. e.g. PPC A2 needs to aggressively hide latency. I converted some in-order scheduling tests to A2. Hal is working on more test cases. llvm-svn: 171946	2013-01-09 03:36:49 +00:00
Eric Christopher	e3ab3d0e2c	These functions have default arguments of 0 for the last arg. Use them. llvm-svn: 171933	2013-01-09 01:57:54 +00:00
Eric Christopher	ce0cfce775	These functions have default arguments of 0 for the last arg. Use them and add one where it seemed obvious that we wanted one. llvm-svn: 171932	2013-01-09 01:35:34 +00:00
Nadav Rotem	b696c36fcd	Cost Model: Move the 'max unroll factor' variable to the TTI and add initial Cost Model support on ARM. llvm-svn: 171928	2013-01-09 01:15:42 +00:00
Eric Christopher	27614589dc	Move the string pools down into the units. No functional change. llvm-svn: 171905	2013-01-08 22:22:06 +00:00
Tim Northover	4bf47bc9d7	Add fp128 rtlib function names to LLVM llvm-svn: 171867	2013-01-08 17:09:59 +00:00
Tim Northover	7bb9992cce	Allow the asm printer to print fp128 values properly. llvm-svn: 171866	2013-01-08 16:56:23 +00:00
Chandler Carruth	a7c44e6ec6	Sink a function that refers to the SelectionDAG into that library in the one file where it is called as a static function. Nuke the declaration and the definition in lib/CodeGen, along with the include of SelectionDAG.h from this file. There is no dependency edge from lib/CodeGen to lib/CodeGen/SelectionDAG, so it isn't valid for a routine in lib/CodeGen to reference the DAG. There is a dependency from lib/CodeGen/SelectionDAG on lib/CodeGen. This breaks one violation of this layering. llvm-svn: 171842	2013-01-08 05:11:57 +00:00
Eric Christopher	72a529566c	Remove the llvm-local DW_TAG_vector_type tag and add a test to make sure that vector types do work. llvm-svn: 171833	2013-01-08 01:53:52 +00:00
Jakob Stoklund Olesen	a1b246df57	Pack MachineOperand bitfields better. Previously, 4 bits were unused. llvm-svn: 171814	2013-01-07 23:21:44 +00:00
Jakob Stoklund Olesen	5adc4a12fb	Pack MachineInstr fields better. This shrinks MachineInstr to 64 bytes (from 72). llvm-svn: 171813	2013-01-07 23:21:41 +00:00
Eric Christopher	b800ff70c5	Whitespace and 80-col. llvm-svn: 171803	2013-01-07 22:40:45 +00:00
Eric Christopher	fba226004d	Add more comments to what's going on here. llvm-svn: 171780	2013-01-07 19:32:45 +00:00
Eric Christopher	2cbd5767ad	Add support for separating strings for the split debug info DWARF5 proposal. This leaves the strings in the skeleton die as strp, but in all dwo files they're accessed now via DW_FORM_GNU_str_index. Add support for dumping these sections and modify the fission-cu.ll testcase to have the correct strings and form. Fix a small bug in the fixed form sizes routine that involved out of array accesses for the table and add a FIXME in the extractFast routine to fix this up. llvm-svn: 171779	2013-01-07 19:32:41 +00:00
Chandler Carruth	95f83e0155	Sink AddrMode back into TargetLowering, removing one of the most peculiar headers under include/llvm. This struct still doesn't make a lot of sense, but it makes more sense down in TargetLowering than it did before. llvm-svn: 171739	2013-01-07 15:14:13 +00:00
Chandler Carruth	26c59fa870	Switch the SCEV expander and LoopStrengthReduce to use TargetTransformInfo rather than TargetLowering, removing one of the primary instances of the layering violation of Transforms depending directly on Target. This is a really big deal because LSR used to be a "special" pass that could only be tested fully using llc and by looking at the full output of it. It also couldn't run with any other loop passes because it had to be created by the backend. No longer is this true. LSR is now just a normal pass and we should probably lift the creation of LSR out of lib/CodeGen/Passes.cpp and into the PassManagerBuilder. =] I've not done this, or updated all of the tests to use opt and a triple, because I suspect someone more familiar with LSR would do a better job. This change should be essentially without functional impact for normal compilations, and only change behvaior of targetless compilations. The conversion required changing all of the LSR code to refer to the TTI interfaces, which fortunately are very similar to TargetLowering's interfaces. However, it also allowed us to always expect to have some implementation around. I've pushed that simplification through the pass, and leveraged it to simplify code somewhat. It required some test updates for one of two things: either we used to skip some checks altogether but now we get the default "no" answer for them, or we used to have no information about the target and now we do have some. I've also started the process of removing AddrMode, as the TTI interface doesn't use it any longer. In some cases this simplifies code, and in others it adds some complexity, but I think it's not a bad tradeoff even there. Subsequent patches will try to clean this up even further and use other (more appropriate) abstractions. Yet again, almost all of the formatting changes brought to you by clang-format. =] llvm-svn: 171735	2013-01-07 14:41:08 +00:00
David Blaikie	5d3249b554	PR14759: Debug info support for C++ member pointers. This works fine with GDB for member variable pointers, but GDB's support for member function pointers seems to be quite unrelated to DW_TAG_ptr_to_member_type. (see GDB bug 14998 for details) llvm-svn: 171698	2013-01-07 05:51:15 +00:00
Chandler Carruth	d3e73556d6	Move TargetTransformInfo to live under the Analysis library. This no longer would violate any dependency layering and it is in fact an analysis. =] llvm-svn: 171686	2013-01-07 03:08:10 +00:00
Chandler Carruth	664e354de7	Switch TargetTransformInfo from an immutable analysis pass that requires a TargetMachine to construct (and thus isn't always available), to an analysis group that supports layered implementations much like AliasAnalysis does. This is a pretty massive change, with a few parts that I was unable to easily separate (sorry), so I'll walk through it. The first step of this conversion was to make TargetTransformInfo an analysis group, and to sink the nonce implementations in ScalarTargetTransformInfo and VectorTargetTranformInfo into a NoTargetTransformInfo pass. This allows other passes to add a hard requirement on TTI, and assume they will always get at least on implementation. The TargetTransformInfo analysis group leverages the delegation chaining trick that AliasAnalysis uses, where the base class for the analysis group delegates to the previous analysis pass, allowing all but tho NoFoo analysis passes to only implement the parts of the interfaces they support. It also introduces a new trick where each pass in the group retains a pointer to the top-most pass that has been initialized. This allows passes to implement one API in terms of another API and benefit when some other pass above them in the stack has more precise results for the second API. The second step of this conversion is to create a pass that implements the TargetTransformInfo analysis using the target-independent abstractions in the code generator. This replaces the ScalarTargetTransformImpl and VectorTargetTransformImpl classes in lib/Target with a single pass in lib/CodeGen called BasicTargetTransformInfo. This class actually provides most of the TTI functionality, basing it upon the TargetLowering abstraction and other information in the target independent code generator. The third step of the conversion adds support to all TargetMachines to register custom analysis passes. This allows building those passes with access to TargetLowering or other target-specific classes, and it also allows each target to customize the set of analysis passes desired in the pass manager. The baseline LLVMTargetMachine implements this interface to add the BasicTTI pass to the pass manager, and all of the tools that want to support target-aware TTI passes call this routine on whatever target machine they end up with to add the appropriate passes. The fourth step of the conversion created target-specific TTI analysis passes for the X86 and ARM backends. These passes contain the custom logic that was previously in their extensions of the ScalarTargetTransformInfo and VectorTargetTransformInfo interfaces. I separated them into their own file, as now all of the interface bits are private and they just expose a function to create the pass itself. Then I extended these target machines to set up a custom set of analysis passes, first adding BasicTTI as a fallback, and then adding their customized TTI implementations. The fourth step required logic that was shared between the target independent layer and the specific targets to move to a different interface, as they no longer derive from each other. As a consequence, a helper functions were added to TargetLowering representing the common logic needed both in the target implementation and the codegen implementation of the TTI pass. While technically this is the only change that could have been committed separately, it would have been a nightmare to extract. The final step of the conversion was just to delete all the old boilerplate. This got rid of the ScalarTargetTransformInfo and VectorTargetTransformInfo classes, all of the support in all of the targets for producing instances of them, and all of the support in the tools for manually constructing a pass based around them. Now that TTI is a relatively normal analysis group, two things become straightforward. First, we can sink it into lib/Analysis which is a more natural layer for it to live. Second, clients of this interface can depend on it always being available which will simplify their code and behavior. These (and other) simplifications will follow in subsequent commits, this one is clearly big enough. Finally, I'm very aware that much of the comments and documentation needs to be updated. As soon as I had this working, and plausibly well commented, I wanted to get it committed and in front of the build bots. I'll be doing a few passes over documentation later if it sticks. Commits to update DragonEgg and Clang will be made presently. llvm-svn: 171681	2013-01-07 01:37:14 +00:00
Chandler Carruth	42e9611f15	Funnel the actual TargetTransformInfo pass from the SelectionDAGISel pass into the SelectionDAG itself rather than snooping on the implementation of that pass as exposed by the TargetMachine. This removes the last direct client of the ScalarTargetTransformInfo class outside of the TTI pass implementation. llvm-svn: 171625	2013-01-05 12:32:17 +00:00
David Blaikie	800a916f99	Emit DW_TAG_formal_parameter for unnamed parameters. This change essentially reverts r87069 which came without a test case. It causes no regressions in the GDB 7.5 test suite & fixes 25 xfails (commit to the test suite to follow). If anyone can present a test case that demonstrates why this check is necessary I'd be happy to account for it in one way or another. llvm-svn: 171609	2013-01-05 07:43:02 +00:00
Jakob Stoklund Olesen	dc5285f102	Don't call destructors on MachineInstr and MachineOperand. The series of patches leading up to this one makes llc -O0 run 8% faster. When deallocating a MachineFunction, there is no need to visit all MachineInstr and MachineOperand objects to deallocate them. All their memory come from a BumpPtrAllocator that is about to be purged, and they have empty destructors anyway. This only applies when deallocating the MachineFunction. DeleteMachineInstr() should still be used to recycle MI memory during the codegen passes. Remove the LeakDetector support for MachineInstr. I've never seen it used before, and now it definitely doesn't work. With this patch, leaked MachineInstrs would be much less of a problem since all of their memory will be reclaimed by ~MachineFunction(). llvm-svn: 171599	2013-01-05 05:05:51 +00:00
Jakob Stoklund Olesen	1bfeecb491	Use ArrayRecycler for MachineInstr operand lists. Instead of an std::vector<MachineOperand>, use MachineOperand arrays from an ArrayRecycler living in MachineFunction. This has several advantages: - MachineInstr now has a trivial destructor, making it possible to delete them in batches when destroying MachineFunction. This will be enabled in a later patch. - Bypassing malloc() and free() can be faster, depending on the system library. - MachineInstr objects and their operands are allocated from the same BumpPtrAllocator, so they will usually be next to each other in memory, providing better locality of reference. - Reduce MachineInstr footprint. A std::vector is 24 bytes, the new operand array representation only uses 8+4+1 bytes in MachineInstr. - Better control over operand array reallocations. In the old representation, the use-def chains would be reordered whenever a std::vector reached its capacity. The new implementation never changes the use-def chain order. Note that some decisions in the code generator depend on the use-def chain orders, so this patch may cause different assembly to be produced in a few cases. llvm-svn: 171598	2013-01-05 05:00:09 +00:00
Jakob Stoklund Olesen	fe445cd646	Add MachineRegisterInfo::moveOperands(). This function works like memmove() for MachineOperands, except it also updates any use-def chains containing the moved operands. The use-def chains are updated without affecting the order of operands in the list. That isn't possible when using the removeRegOperandFromUseList() and addRegOperandToUseList() functions. Callers to follow soon. llvm-svn: 171597	2013-01-05 04:38:12 +00:00
Jakob Stoklund Olesen	7f92b7ad0a	Move an assertion so it doesn't dereference end(). The R600 target has test cases that exercises this code. llvm-svn: 171538	2013-01-04 22:17:31 +00:00
Eric Christopher	cad9b53c02	Add a name for the anonymous type we're creating for subrange types and a FIXME for what we should be doing. Should solve the immediacy of PR12069 where our debug info is crashing another tool. llvm-svn: 171536	2013-01-04 21:51:53 +00:00
Pedro Artigas	3383225167	small fixes to enable the reuse of the pass manager across multiple modules llvm-svn: 171475	2013-01-04 18:04:42 +00:00
Jakob Stoklund Olesen	725d57682b	Fix PR14732 by handling all kinds of IMPLICIT_DEF live ranges. Most IMPLICIT_DEF instructions are removed by the ProcessImplicitDefs pass, and a few are reinserted by PHIElimination when a PHI argument is <undef>. RegisterCoalescer was assuming that all IMPLICIT_DEF live ranges look like those created by PHIElimination, and that their live range never leaves the basic block. The PR14732 test case does tricks with PHI nodes that causes a longer IMPLICIT_DEF live range to appear. This happens very rarely, but RegisterCoalescer should be able to handle it. llvm-svn: 171435	2013-01-03 00:47:51 +00:00
Tom Stellard	567f886eb0	DAGCombiner: Avoid generating illegal vector INT_TO_FP nodes DAGCombiner::reduceBuildVecConvertToConvertBuildVec() was making two mistakes: 1. It was checking the legality of scalar INT_TO_FP nodes and then generating vector nodes. 2. It was passing the result value type to TargetLoweringInfo::getOperationAction() when it should have been passing the value type of the first operand. llvm-svn: 171420	2013-01-02 22:13:01 +00:00
Chandler Carruth	9fb823bbd4	Move all of the header files which are involved in modelling the LLVM IR into their new header subdirectory: include/llvm/IR. This matches the directory structure of lib, and begins to correct a long standing point of file layout clutter in LLVM. There are still more header files to move here, but I wanted to handle them in separate commits to make tracking what files make sense at each layer easier. The only really questionable files here are the target intrinsic tablegen files. But that's a battle I'd rather not fight today. I've updated both CMake and Makefile build systems (I think, and my tests think, but I may have missed something). I've also re-sorted the includes throughout the project. I'll be committing updates to Clang, DragonEgg, and Polly momentarily. llvm-svn: 171366	2013-01-02 11:36:10 +00:00
Chandler Carruth	be81023d74	Resort the #include lines in include/... and lib/... with the utils/sort_includes.py script. Most of these are updating the new R600 target and fixing up a few regressions that have creeped in since the last time I sorted the includes. llvm-svn: 171362	2013-01-02 10:22:59 +00:00
Hal Finkel	6dbdd4307b	Support ppcf128 in SelectionDAG::getConstantFP Fixes pr14751. Patch by Kai; Thanks! llvm-svn: 171261	2012-12-30 19:03:32 +00:00
Nuno Lopes	b6ad98224a	convert a bunch of callers from DataLayout::getIndexedOffset() to GEP::accumulateConstantOffset(). The later API is nicer than the former, and is correct regarding wrap-around offsets (if anyone cares). There are a few more places left with duplicated code, which I'll remove soon. llvm-svn: 171259	2012-12-30 16:25:48 +00:00
Bill Wendling	74dba875e2	Remove the Function::getRetAttributes method in favor of using the AttributeSet accessor method. llvm-svn: 171256	2012-12-30 13:01:51 +00:00
Bill Wendling	94dcaf8e2b	Remove Function::getParamAttributes and use the AttributeSet accessor methods instead. llvm-svn: 171255	2012-12-30 12:45:13 +00:00
Bill Wendling	698e84fc4f	Remove the Function::getFnAttributes method in favor of using the AttributeSet directly. This is in preparation for removing the use of the 'Attribute' class as a collection of attributes. That will shift to the AttributeSet class instead. llvm-svn: 171253	2012-12-30 10:32:01 +00:00
Nadav Rotem	b1dd52450e	Refactor DAGCombinerInfo. Change the different booleans that indicate if we are before or after different runs of DAGCo, with the CombineLevel enum. Also, added a new API for checking if we are running before or after the LegalizeVectorOps phase. llvm-svn: 171142	2012-12-27 06:47:41 +00:00
Eric Christopher	3bf29fda91	For the dwarf5 split debug info code split out the string section per compile unit/skeleton compile unit. Update tests accordingly. llvm-svn: 171133	2012-12-27 02:14:01 +00:00
Bob Wilson	fe73ac34c5	Rename LLVMContext diagnostic handler types and functions. These are now generally used for all diagnostics from the backend, not just for inline assembly, so this drops the "InlineAsm" from the names. No functional change. (I've left aliases for the old names but only for long enough to let me switch over clang to use the new ones.) llvm-svn: 171047	2012-12-25 00:07:12 +00:00
Bob Wilson	4ed23578da	Add LLVMContext::emitWarning methods and use them. <rdar://problem/12867368> When the backend is used from clang, it should produce proper diagnostics instead of just printing messages to errs(). Other clients may also want to register their own error handlers with the LLVMContext, and the same handler should work for warnings in the same way as the existing emitError methods. llvm-svn: 171041	2012-12-24 18:15:21 +00:00
Jakob Stoklund Olesen	7bca670a8b	Remove a special case that doesn't seem necessary any longer. Back when this exception was added, it was skipping a lot more code, but now it just looks like a premature optimization. llvm-svn: 170989	2012-12-22 17:33:22 +00:00
Jakob Stoklund Olesen	b089483993	Use getNumOperands() instead of Operands.size(). The representation of the Operands array is going to change soon so it can be allocated from a BumpPtrAllocator. llvm-svn: 170988	2012-12-22 17:13:06 +00:00
Roman Divacky	a229186a82	Remove duplicate includes. llvm-svn: 170902	2012-12-21 17:06:44 +00:00
Evan Cheng	59421aee3d	Add targets to skip running the GC passes. llvm-svn: 170836	2012-12-21 02:57:04 +00:00
Jakob Stoklund Olesen	2455b58551	Require the two-argument MI::addOperand(MF, MO) for dangling instructions. Instructions that are inserted in a basic block can still be decorated with addOperand(MO). Make the two-argument addOperand() function contain the actual implementation. This function will now always have a valid MF reference that it can use for memory allocation. llvm-svn: 170798	2012-12-20 22:54:05 +00:00
Jakob Stoklund Olesen	33f5d1492d	Add an MF argument to MI::copyImplicitOps(). This function is often used to decorate dangling instructions, so a context reference is required to allocate memory for the operands. Also add a corresponding MachineInstrBuilder method. llvm-svn: 170797	2012-12-20 22:54:02 +00:00
Jakob Stoklund Olesen	ac4210eacb	Use two-arg addOperand(MF, MO) internally in MachineInstr when possible. llvm-svn: 170796	2012-12-20 22:53:58 +00:00
Eric Christopher	48fef599a4	Whitespace and 80-column cleanup. llvm-svn: 170771	2012-12-20 21:58:40 +00:00
Eric Christopher	e698f53740	Start splitting out the debug string section handling by moving it into the DwarfUnits class. llvm-svn: 170770	2012-12-20 21:58:36 +00:00
Jakob Stoklund Olesen	00b28ecfae	Remove two dead functions. llvm-svn: 170766	2012-12-20 21:12:42 +00:00
Jakob Stoklund Olesen	2705333253	Use MachineInstrBuilder for PHI nodes in SelectionDAGISel. llvm-svn: 170716	2012-12-20 18:46:29 +00:00
Jakob Stoklund Olesen	b109a7b430	Use MachineInstrBuilder in InstrEmitter. This is supposed to be a mechanical change with no functional effects. InstrEmitter can generate all types of MachineOperands which revealed that MachineInstrBuilder was missing a few methods, added by this patch. Besides providing a context pointer to MI::addOperand(), MachineInstrBuilder seems like a better fit for this code. llvm-svn: 170712	2012-12-20 18:08:09 +00:00
Jakob Stoklund Olesen	f623e9870d	Use MachineInstrBuilder in a few CodeGen passes. This automatically passes a context pointer to MI->addOperand(). llvm-svn: 170711	2012-12-20 18:08:06 +00:00
Bob Wilson	3365b80290	Do not introduce vector operations in functions marked with noimplicitfloat. <rdar://problem/12879313> llvm-svn: 170630	2012-12-20 01:36:20 +00:00
Dmitri Gribenko	349d1a35ff	Add a missing 'else'. Found by grep '} if' No testcase because it is apparently not so trivial to construct. llvm-svn: 170595	2012-12-19 22:13:01 +00:00
Eric Christopher	3c5a1914b6	Split out abbreviations for the skeleton info from the rest of the abbreviations. Part of implementing split dwarf. llvm-svn: 170589	2012-12-19 22:02:53 +00:00
Jakob Stoklund Olesen	b159b5ff0d	Remove the explicit MachineInstrBuilder(MI) constructor. Use the version that also takes an MF reference instead. It would technically be possible to extract an MF reference from the MI as MI->getParent()->getParent(), but that would not work for MIs that are not inserted into any basic block. Given the reasonably small number of places this constructor was used at all, I preferred the compile time check to a run time assertion. llvm-svn: 170588	2012-12-19 21:31:56 +00:00
Patrik Hagglund	f9934613e8	Change AsmOperandInfo::ConstraintVT to MVT, instead of EVT. Accordingly, add MVT::getVT. llvm-svn: 170550	2012-12-19 15:19:11 +00:00
Patrik Hagglund	00e7a11904	Split the usage of 'EVT PartVT' into 'MVT PartVT' and 'EVT PartEVT'. llvm-svn: 170540	2012-12-19 12:33:30 +00:00
Patrik Hagglund	4e0f828686	Change RegVT in BitTestBlock and RegsForValue, to contain MVTs, instead of EVTs. llvm-svn: 170538	2012-12-19 12:23:01 +00:00
Patrik Hagglund	e09cac9a67	Change TargetLowering::getTypeForExtArgOrReturn to take and return MVTs, instead of EVTs. llvm-svn: 170537	2012-12-19 12:02:25 +00:00
Patrik Hagglund	3f1905199b	Change a parameter of TargetLowering::getVectorTypeBreakdown to MVT, from EVT. llvm-svn: 170536	2012-12-19 11:53:21 +00:00
Patrik Hagglund	bad545ccba	Change TargetLowering::RegisterTypeForVT to contain MVTs, instead of EVTs. llvm-svn: 170535	2012-12-19 11:48:16 +00:00
Patrik Hagglund	93060569ba	Change TargetLowering::TransformToType to contain MVTs, instead of EVTs. llvm-svn: 170534	2012-12-19 11:42:00 +00:00
Patrik Hagglund	f9eb168ef4	Change TargetLowering::findRepresentativeClass to take an MVT, instead of EVT. llvm-svn: 170532	2012-12-19 11:30:36 +00:00
Patrik Hagglund	fd41b5b969	Change TargetLowering::getTypeToPromoteTo to take and return MVTs, instead of EVTs. llvm-svn: 170529	2012-12-19 11:21:04 +00:00
Patrik Hagglund	ffd057a3e1	Change TargetLowering::isCondCodeLegal to take an MVT, instead of EVT. llvm-svn: 170524	2012-12-19 10:19:55 +00:00
Patrik Hagglund	deee9003ed	Change TargetLowering::getCondCodeAction to take an MVT, instead of EVT. llvm-svn: 170522	2012-12-19 10:09:26 +00:00
Patrik Hagglund	d7cdcf8cb5	Change TargetLowering::getTruncStoreAction to take MVTs, instead of EVTs. llvm-svn: 170510	2012-12-19 08:28:51 +00:00
Elena Demikhovsky	14a4af0e66	Optimized load + SIGN_EXTEND patterns in the X86 backend. llvm-svn: 170506	2012-12-19 07:50:20 +00:00
Nadav Rotem	33360d8ae9	After reducing the size of an operation in the DAG we zero-extend the reduced bitwidth op back to the original size. If we reduce ANDs then this can cause an endless loop. This patch changes the ZEXT to ANY_EXTEND if the demanded bits are equal or smaller than the size of the reduced operation. llvm-svn: 170505	2012-12-19 07:39:08 +00:00
Bill Wendling	3d7b0b8ac7	Rename the 'Attributes' class to 'Attribute'. It's going to represent a single attribute in the future. llvm-svn: 170502	2012-12-19 07:18:57 +00:00
Craig Topper	3f194c8f4f	Remove more of 'else's after 'returns'. No functional change. llvm-svn: 170497	2012-12-19 06:43:58 +00:00
Craig Topper	5dd8291cbe	Remove a bunch of 'else's after 'returns' llvm-svn: 170496	2012-12-19 06:39:17 +00:00
Craig Topper	63f5921776	Teach SimplifySetCC that comparing AssertZext i1 against a constant 1 can be rewritten as a compare against a constant 0 with the opposite condition. llvm-svn: 170495	2012-12-19 06:12:28 +00:00
Jakob Stoklund Olesen	d742533dbc	Use bidirectional bundle flags to simplify important functions. The bundle_iterator::operator++ function now doesn't need to dig out the basic block and check against end(). It can use the isBundledWithSucc() flag to find the last bundled instruction safely. Similarly, MachineInstr::isBundled() no longer needs to look at iterators etc. It only has to look at flags. llvm-svn: 170473	2012-12-18 23:21:49 +00:00
Jakob Stoklund Olesen	00f6c7754b	Verify bundle flag consistency when setting them. Now that the bundle flag aware APIs are all in place, it is possible to continuously verify the flag consistency. llvm-svn: 170465	2012-12-18 23:00:28 +00:00
Jakob Stoklund Olesen	29c277197e	Verify bundle flags for consistency in MachineVerifier. The new bidirectional bundle flags are redundant, so inadvertent bundle tearing can be detected in the machine code verifier. llvm-svn: 170463	2012-12-18 22:55:07 +00:00
Jakob Stoklund Olesen	a33f504b3e	Don't allow the automatically updated MI flags to be set directly. The bundle-related MI flags need to be kept in sync with the neighboring instructions. Don't allow the bulk flag-setting setFlags() function to change them. Also don't copy MI flags when cloning an instruction. The clone's bundle flags will be set when it is explicitly inserted into a bundle. llvm-svn: 170459	2012-12-18 21:36:05 +00:00
Jakob Stoklund Olesen	78eaf05fa7	Tighten up the splice() API for bundled instructions. Remove the instr_iterator versions of the splice() functions. It doesn't seem useful to be able to splice sequences of instructions that don't consist of full bundles. The normal splice functions that take MBB::iterator arguments are not changed, and they can move whole bundles around without any problems. llvm-svn: 170456	2012-12-18 20:59:41 +00:00
Andrew Trick	ec2564818c	MISched: add dependence to ExitSU to model live-out latency. llvm-svn: 170454	2012-12-18 20:53:01 +00:00
Andrew Trick	ef23569858	MISched: Cleanup, redundant statement. llvm-svn: 170453	2012-12-18 20:52:58 +00:00
Andrew Trick	d6d5ad3d7b	MISched: Heuristics, compare latency more precisely. It matters more for some targets. llvm-svn: 170452	2012-12-18 20:52:56 +00:00
Andrew Trick	44f54d97a4	MISched: Remove SchedRemainder::IsResourceLimited. I don't know how to compute it. llvm-svn: 170451	2012-12-18 20:52:54 +00:00
Andrew Trick	493b867b5d	MISched: cleanup, use the proper iterator type. llvm-svn: 170450	2012-12-18 20:52:52 +00:00
Andrew Trick	ffb6168e85	MISched: minor improvement, initialize remaining resources before the first scheduling decision. llvm-svn: 170449	2012-12-18 20:52:49 +00:00
Jakob Stoklund Olesen	422e07b091	Tighten the insert() API for bundled instructions. The normal insert() function takes an MBB::iterator position, and inserts a stand-alone MachineInstr as before. The insert() function that takes an MBB::instr_iterator position can insert instructions inside a bundle, and will now update the bundle flags correctly when that happens. When the insert position is between two bundles, it is unclear whether the instruction should be appended to the previous bundle, prepended to the next bundle, or stand on its own. The MBB::insert() function doesn't bundle the instruction in that case, use the MIBundleBuilder class for that. llvm-svn: 170437	2012-12-18 17:54:53 +00:00
Hal Finkel	943f76d1b3	Check multiple register classes for inline asm tied registers A register can be associated with several distinct register classes. For example, on PPC, the floating point registers are each associated with both F4RC (which holds f32) and F8RC (which holds f64). As a result, this code would fail when provided with a floating point register and an f64 operand because it would happen to find the register in the F4RC class first and return that. From the F4RC class, SDAG would extract f32 as the register type and then assert because of the invalid implied conversion between the f64 value and the f32 register. Instead, search all register classes. If a register class containing the the requested register has the requested type, then return that register class. Otherwise, as before, return the first register class found that contains the requested register. llvm-svn: 170436	2012-12-18 17:50:58 +00:00
Jakob Stoklund Olesen	ccfb5fb472	Tighten up the erase/remove API for bundled instructions. Most code is oblivious to bundles and uses the MBB::iterator which only visits whole bundles. MBB::erase() operates on whole bundles at a time as before. MBB::remove() now refuses to remove bundled instructions. It is not safe to remove all instructions in a bundle without deleting them since there is no way of returning pointers to all the removed instructions. MBB::remove_instr() and MBB::erase_instr() will now update bundle flags correctly, lifting individual instructions out of bundles while leaving the remaining bundle intact. The MachineInstr convenience functions are updated so eraseFromParent() erases a whole bundle as before eraseFromBundle() erases a single instruction, leaving the rest of its bundle. removeFromParent() refuses to operate on bundled instructions, and removeFromBundle() lifts a single instruction out of its bundle. These functions will no longer accidentally split or coalesce bundles - bundle flags are updated to preserve the existing bundling, and explicit bundleWith* / unbundleFrom* functions should be used to change the instruction bundling. This API update is still a work in progress. I am going to update APIs first so they maintain bundle flags automatically when possible. Then I'll add stricter verification of the bundle flags. llvm-svn: 170384	2012-12-17 23:55:38 +00:00
Patrik Hagglund	c494d24a68	Revert/correct some FastISel changes in r170104 (EVT->MVT for TargetLowering::getRegClassFor). Some isSimple() guards were missing, or getSimpleVT() were hoisted too far, resulting in asserts on valid LLVM assembly input. llvm-svn: 170336	2012-12-17 14:30:06 +00:00
Craig Topper	588ceec0f7	Add debug prints for when optimizeLoadInstr folds a load. llvm-svn: 170298	2012-12-17 03:56:00 +00:00
Dmitri Gribenko	2943ce80f3	Declare class DwarfDebug before use instead of relying on a forward declaration from some other unrelated header. Patch by Kai. llvm-svn: 170284	2012-12-16 12:57:36 +00:00
Reed Kotler	aee4d5d194	This patch is needed to make c++ exceptions work for mips16. Mips16 is really a processor decoding mode (ala thumb 1) and in the same program, mips16 and mips32 functions can exist and can call each other. If a jal type instruction encounters an address with the lower bit set, then the processor switches to mips16 mode (if it is not already in it). If the lower bit is not set, then it switches to mips32 mode. The linker knows which functions are mips16 and which are mips32. When relocation is performed on code labels, this lower order bit is set if the code label is a mips16 code label. In general this works just fine, however when creating exception handling tables and dwarf, there are cases where you don't want this lower order bit added in. This has been traditionally distinguished in gas assembly source by using a different syntax for the label. lab1: ; this will cause the lower order bit to be added lab2=. ; this will not cause the lower order bit to be added In some cases, it does not matter because in dwarf and debug tables the difference of two labels is used and in that case the lower order bits subtract each other out. To fix this, I have added to mcstreamer the notion of a debuglabel. The default is for label and debug label to be the same. So calling EmitLabel and EmitDebugLabel produce the same result. For various reasons, there is only one set of labels that needs to be modified for the mips exceptions to work. These are the "$eh_func_beginXXX" labels. Mips overrides the debug label suffix from ":" to "=." . This initial patch fixes exceptions. More changes most likely will be needed to DwarfCFException to make all of this work for actual debugging. These changes will be to emit debug labels in some places where a simple label is emitted now. Some historical discussion on this from gcc can be found at: http://gcc.gnu.org/ml/gcc-patches/2008-08/msg00623.html http://gcc.gnu.org/ml/gcc-patches/2008-11/msg01273.html llvm-svn: 170279	2012-12-16 04:00:45 +00:00
Eric Christopher	a2de826d29	To simplify some code move the unit emission into the holders. Make emitDIE public accordingly. No functional change. llvm-svn: 170258	2012-12-15 00:04:07 +00:00
Eric Christopher	16485a5164	Use begin and end label names from the section for info. llvm-svn: 170257	2012-12-15 00:04:04 +00:00
Patrik Hagglund	55d6f47a37	Change TargetLowering::getLoadExtAction to take an MVT, instead of EVT. llvm-svn: 170183	2012-12-14 09:05:13 +00:00
Jakob Stoklund Olesen	7bb2f97a90	Use the new MI bundling API in MachineInstrBundle itself. The new API is higher level than just manipulating the bundle flags directly, and the setIsInsideBundle() function will disappear soon. llvm-svn: 170159	2012-12-13 23:23:46 +00:00
David Blaikie	37fefc3f8d	Debug Info: add support to mark member variables as artificial This is the LLVM portion of r170154. llvm-svn: 170156	2012-12-13 22:43:07 +00:00
Patrik Hagglund	13abe5ec3c	Change TargetLowering::setTypeAction to take an MVT, instead fo EVT. llvm-svn: 170148	2012-12-13 20:42:43 +00:00
Patrik Hagglund	05394352c0	Change TargetLowering::getRepRegClassFor to take an MVT, instead of EVT. Accordingly, change RegDefIter to contain MVTs instead of EVTs. llvm-svn: 170140	2012-12-13 18:45:35 +00:00
Patrik Hagglund	5e6c361bc0	Change TargetLowering::getRegClassFor to take an MVT, instead of EVT. Accordingly, add helper funtions getSimpleValueType (in parallel to getValueType) in SDValue, SDNode, and TargetLowering. This is the first, in a series of patches. This is the second attempt. In the first attempt (r169837), a few getSimpleVT() were hoisted too far, detected by bootstrap failures. llvm-svn: 170104	2012-12-13 06:34:11 +00:00
Eric Christopher	996b2b7ae6	Use default label name for a section in emitting abbreviation section to help prep some code to be split about. llvm-svn: 170088	2012-12-13 03:00:38 +00:00
Evan Cheng	bf0baa9de7	Fix a bug in DAGCombiner::MatchBSwapHWord. Make sure the node has operands before referencing them. rdar://12868039 llvm-svn: 170078	2012-12-13 01:34:32 +00:00
Pedro Artigas	7212ee4534	Make the MCStreamer have a reset method and call that after finalization of the asm printer, also changed MCContext to a single reset only method for simplicity as requested on the list llvm-svn: 170041	2012-12-12 22:59:46 +00:00
Evan Cheng	b7d3d03bf9	Fix a logic bug in inline expansion of memcpy / memset with an overlapping load / store pair. It's not legal to use a wider load than the size of the remaining bytes if it's the first pair of load / store. llvm-svn: 170018	2012-12-12 20:43:23 +00:00
Evan Cheng	962711ee71	Sorry about the churn. One more change to getOptimalMemOpType() hook. Did I mention the inline memcpy / memset expansion code is a mess? This patch split the ZeroOrLdSrc argument into two: IsMemset and ZeroMemset. The first indicates whether it is expanding a memset or a memcpy / memmove. The later is whether the memset is a memset of zero. It's totally possible (likely even) that targets may want to do different things for memcpy and memset of zero. llvm-svn: 169959	2012-12-12 02:34:41 +00:00
Evan Cheng	c3d1aca657	- Rename isLegalMemOpType to isSafeMemOpType. "Legal" is a very overloade term. Also added more comments to explain why it is generally ok to return true. - Rename getOptimalMemOpType argument IsZeroVal to ZeroOrLdSrc. It's meant to be true for loaded source (memcpy) or zero constants (memset). The poor name choice is probably some kind of legacy issue. llvm-svn: 169954	2012-12-12 01:32:07 +00:00
Manman Ren	82751a105c	DAGCombine: clamp hi bit in APInt::getBitsSet to avoid assertion rdar://12838504 llvm-svn: 169951	2012-12-12 01:13:50 +00:00
Evan Cheng	04e5518783	Avoid using lossy load / stores for memcpy / memset expansion. e.g. f64 load / store on non-SSE2 x86 targets. llvm-svn: 169944	2012-12-12 00:42:09 +00:00
Evan Cheng	eb54240dc2	Replace TargetLowering::isIntImmLegal() with ScalarTargetTransformInfo::getIntImmCost() instead. "Legal" is a poorly defined term for something like integer immediate materialization. It is always possible to materialize an integer immediate. Whether to use it for memcpy expansion is more a "cost" conceern. llvm-svn: 169929	2012-12-11 23:26:14 +00:00
Eric Christopher	d692c1dbb7	Update some comments. llvm-svn: 169907	2012-12-11 19:42:09 +00:00
Joel Jones	24e440d045	Add comment for load folding llvm-svn: 169880	2012-12-11 16:10:25 +00:00
Patrik Hagglund	e98b7a0389	Revert EVT->MVT changes, r169836-169851, due to buildbot failures. llvm-svn: 169854	2012-12-11 11:14:33 +00:00
Patrik Hagglund	b31465b09b	Change RegVT in BitTestBlock and RegsForValue, to contain MVTs, instead of EVTs. llvm-svn: 169851	2012-12-11 10:24:48 +00:00
Patrik Hagglund	ad432a8e70	Change TargetLowering::getTypeForExtArgOrReturn to take and return MVTs, instead of EVTs. Accordingly, add bitsLT (and similar) to MVT. llvm-svn: 169850	2012-12-11 10:20:51 +00:00
Patrik Hagglund	d34337495e	Change a parameter of TargetLowering::getVectorTypeBreakdown to MVT, from EVT. llvm-svn: 169849	2012-12-11 10:16:19 +00:00
Patrik Hagglund	03e9628cfa	Change TargetLowering::RegisterTypeForVT to contain MVTs, instead of EVTs. llvm-svn: 169848	2012-12-11 10:09:23 +00:00
Patrik Hagglund	c50489e203	Change TargetLowering::TransformToType to contain MVTs, instead of EVTs. llvm-svn: 169847	2012-12-11 10:05:04 +00:00
Patrik Hagglund	8d2e7cf561	Change TargetLowering::findRepresentativeClass to take an MVT, instead of EVT. llvm-svn: 169845	2012-12-11 09:57:18 +00:00
Patrik Hagglund	ffb60f7c08	Change TargetLowering::getTypeToPromoteTo to take and return MVTs, instead of EVTs. llvm-svn: 169844	2012-12-11 09:54:23 +00:00
Patrik Hagglund	a970281106	Change TargetLowering::isCondCodeLegal to take an MVT, instead of EVT. llvm-svn: 169843	2012-12-11 09:51:27 +00:00
Patrik Hagglund	e3bec6365a	Change TargetLowering::getCondCodeAction to take an MVT, instead of EVT. llvm-svn: 169842	2012-12-11 09:48:14 +00:00
Patrik Hagglund	7ffcd226dd	Change TargetLowering::getTruncStoreAction to take MVTs, instead of EVTs. llvm-svn: 169841	2012-12-11 09:42:24 +00:00
Patrik Hagglund	cbc9d4d0f9	Change TargetLowering::getLoadExtAction to take an MVT, instead of EVT. llvm-svn: 169840	2012-12-11 09:39:09 +00:00
Patrik Hagglund	40e1afe970	Change TargetLowering::setTypeAction to take an MVT, instead fo EVT. llvm-svn: 169839	2012-12-11 09:32:56 +00:00
Patrik Hagglund	57b1694df1	Change TargetLowering::getRepRegClassFor to take an MVT, instead of EVT. Accordingly, change RegDefIter to contain MVTs instead of EVTs. llvm-svn: 169838	2012-12-11 09:31:43 +00:00
Patrik Hagglund	3708e548f8	Change TargetLowering::getRegClassFor to take an MVT, instead of EVT. Accordingly, add helper funtions getSimpleValueType (in parallel to getValueType) in SDValue, SDNode, and TargetLowering. This is the first, in a series of patches. llvm-svn: 169837	2012-12-11 09:10:33 +00:00
Chandler Carruth	b27041c50b	Fix a miscompile in the DAG combiner. Previously, we would incorrectly try to reduce the width of this load, and would end up transforming: (truncate (lshr (sextload i48 <ptr> as i64), 32) to i32) to (truncate (zextload i32 <ptr+4> as i64) to i32) We lost the sext attached to the load while building the narrower i32 load, and replaced it with a zext because lshr always zext's the results. Instead, bail out of this combine when there is a conflict between a sextload and a zext narrowing. The rest of the DAG combiner still optimize the code down to the proper single instruction: movswl 6(...),%eax Which is exactly what we wanted. Previously we read past the end and missed the sign extension: movl 6(...), %eax llvm-svn: 169802	2012-12-11 00:36:57 +00:00
Chad Rosier	df42cf39ab	Fall back to the selection dag isel to select tail calls. This shouldn't affect codegen for -O0 compiles as tail call markers are not emitted in unoptimized compiles. Testing with the external/internal nightly test suite reveals no change in compile time performance. Testing with -O1, -O2 and -O3 with fast-isel enabled did not cause any compile-time or execution-time failures. All tests were performed on my x86 machine. I'll monitor our arm testers to ensure no regressions occur there. In an upcoming clang patch I will be marking the objc_autoreleaseReturnValue and objc_retainAutoreleaseReturnValue as tail calls unconditionally. While it's theoretically true that this is just an optimization, it's an optimization that we very much want to happen even at -O0, or else ARC applications become substantially harder to debug. Part of rdar://12553082 llvm-svn: 169796	2012-12-11 00:18:02 +00:00
Eric Christopher	c8a310edc1	Refactor out the abbreviation handling into a separate class that controls each of the abbreviation sets (only a single one at the moment) and computes offsets separately as well for each set of DIEs. No real function change, ordering of abbreviations for the skeleton CU changed but only because we're computing in a separate order. Fix the testcase not to care. llvm-svn: 169793	2012-12-10 23:34:43 +00:00
Evan Cheng	79e2ca90bc	Some enhancements for memcpy / memset inline expansion. 1. Teach it to use overlapping unaligned load / store to copy / set the trailing bytes. e.g. On 86, use two pairs of movups / movaps for 17 - 31 byte copies. 2. Use f64 for memcpy / memset on targets where i64 is not legal but f64 is. e.g. x86 and ARM. 3. When memcpy from a constant string, do not replace the load with a constant if it's not possible to materialize an integer immediate with a single instruction (required a new target hook: TLI.isIntImmLegal()). 4. Use unaligned load / stores more aggressively if target hooks indicates they are "fast". 5. Update ARM target hooks to use unaligned load / stores. e.g. vld1.8 / vst1.8. Also increase the threshold to something reasonable (8 for memset, 4 pairs for memcpy). This significantly improves Dhrystone, up to 50% on ARM iOS devices. rdar://12760078 llvm-svn: 169791	2012-12-10 23:21:26 +00:00
Lang Hames	517fc8b264	Defer call to InitSections until after MCContext has been initialized. If InitSections is called before the MCContext is initialized it could cause duplicate temporary symbols to be emitted later (after context initialization resets the temporary label counter). llvm-svn: 169785	2012-12-10 22:49:11 +00:00
Eric Christopher	0aa4a670ad	Rearrange vars and make comments more obvious. llvm-svn: 169780	2012-12-10 22:25:41 +00:00
Eric Christopher	81d091eed9	Remove blank line at top of file. llvm-svn: 169779	2012-12-10 22:25:38 +00:00
Eric Christopher	200dd760fa	Fix a coding style nit. llvm-svn: 169776	2012-12-10 22:00:20 +00:00
Tom Stellard	30e2aa5015	LegalizeDAG: Allow type promotion of scalar loads llvm-svn: 169773	2012-12-10 21:41:58 +00:00
Tom Stellard	b785bd776c	LegalizeDAG: Allow type promotion for scalar stores llvm-svn: 169772	2012-12-10 21:41:54 +00:00
Eric Christopher	cdf218d606	Use the somewhat semantic term "split dwarf" it more matches what's going on and makes a lot of the terminology in comments make more sense. llvm-svn: 169758	2012-12-10 19:51:21 +00:00
Eric Christopher	8afd7b6066	Delete the FissionCU. llvm-svn: 169757	2012-12-10 19:51:18 +00:00
Eric Christopher	d79f5480ac	Reorder fission variables. llvm-svn: 169756	2012-12-10 19:51:13 +00:00
Hal Finkel	66859ae0f6	Use GetUnderlyingObjects in misched misched used GetUnderlyingObject in order to break false load/store dependencies, and the -enable-aa-sched-mi feature similarly relied on GetUnderlyingObject in order to ensure it is safe to use the aliasing analysis. Unfortunately, GetUnderlyingObject does not recurse through phi nodes, and so (especially due to LSR) all of these mechanisms failed for induction-variable-dependent loads and stores inside loops. This change replaces uses of GetUnderlyingObject with GetUnderlyingObjects (which will recurse through phi and select instructions) in misched. Andy reviewed, tested and simplified this patch; Thanks! llvm-svn: 169744	2012-12-10 18:49:16 +00:00
Craig Topper	d8005db486	Teach DAG combine to handle vector add/sub with vectors of all 0s. llvm-svn: 169727	2012-12-10 08:12:29 +00:00
Craig Topper	5ea3bdd75b	Remove extra blank line. llvm-svn: 169692	2012-12-09 08:20:52 +00:00
Craig Topper	a183ddb0fe	Teach DAG combine to handle vector logical operations with vectors of all 1s or all 0s. These cases can show up when vectors are split for legalizing. Fix some tests that were dependent on these cases not being combined. llvm-svn: 169684	2012-12-08 22:49:19 +00:00
Jakob Stoklund Olesen	fead62d4f4	Add higher-level API for dealing with bundled MachineInstrs. This is still a work in progress. The purpose is to make bundling and unbundling operations explicit, and to catch errors where bundles are broken or created inadvertently. The old IsInsideBundle flag is replaced by two MI flags: BundledPred which has the same meaning as IsInsideBundle, and BundledSucc which is set on instructions that are bundled with a successor. Having two flags provdes redundancy to detect when a bundle is inadvertently torn by a splice() or insert(), and it makes it possible to write bundle iterators that don't need to peek at adjacent instructions. The new flags can't be manipulated directly (once setIsInsideBundle is gone). Instead there are MI functions to make and break bundle bonds. The setIsInsideBundle function will be removed in a future commit. It should be replaced by bundleWithPred(). llvm-svn: 169583	2012-12-07 04:23:29 +00:00
Pedro Artigas	e84b13f039	fixed valgrind issues of prior commit, this change applies r169456 changes back to the tree with fixes. on darwin no valgrind issues exist in the tests that used to fail. original change description: change MCContext to work on the doInitialization/doFinalization model reviewed by Evan Cheng <evan.cheng@apple.com> llvm-svn: 169553	2012-12-06 22:12:44 +00:00
Evan Cheng	9ec512d768	Replace r169459 with something safer. Rather than having computeMaskedBits to understand target implementation of any_extend / extload, just generate zero_extend in place of any_extend for liveouts when the target knows the zero_extend will be implicit (e.g. ARM ldrb / ldrh) or folded (e.g. x86 movz). rdar://12771555 llvm-svn: 169536	2012-12-06 19:13:27 +00:00
Nadav Rotem	ac450eb59e	Fix a bug in the code that merges consecutive stores. Previously we did not check if loads that happen in between stores alias with the first store in the chain, only with the second store onwards. llvm-svn: 169516	2012-12-06 17:34:13 +00:00
Bill Wendling	3495f9b6dd	s/getLowerBoundDefault/getDefaultLowerBound/ for consistency. Also put the more natural check first in the if-then statement. llvm-svn: 169486	2012-12-06 07:55:19 +00:00
Bill Wendling	28fe9e7a36	Handle non-default array bounds. Some languages, e.g. Ada and Pascal, allow you to specify that the array bounds are different from the default (1 in these cases). If we have a lower bound that's non-default, then we emit the lower bound. We also calculate the correct upper bound in those cases. llvm-svn: 169484	2012-12-06 07:38:10 +00:00
NAKAMURA Takumi	d985d76040	Revert r169456, "change MCContext to work on the doInitialization/doFinalization model" It broke many builders. llvm-svn: 169462	2012-12-06 02:00:13 +00:00
Evan Cheng	5213139f48	Let targets provide hooks that compute known zero and ones for any_extend and extload's. If they are implemented as zero-extend, or implicitly zero-extend, then this can enable more demanded bits optimizations. e.g. define void @foo(i16* %ptr, i32 %a) nounwind { entry: %tmp1 = icmp ult i32 %a, 100 br i1 %tmp1, label %bb1, label %bb2 bb1: %tmp2 = load i16* %ptr, align 2 br label %bb2 bb2: %tmp3 = phi i16 [ 0, %entry ], [ %tmp2, %bb1 ] %cmp = icmp ult i16 %tmp3, 24 br i1 %cmp, label %bb3, label %exit bb3: call void @bar() nounwind br label %exit exit: ret void } This compiles to the followings before: push {lr} mov r2, #0 cmp r1, #99 bhi LBB0_2 @ BB#1: @ %bb1 ldrh r2, [r0] LBB0_2: @ %bb2 uxth r0, r2 cmp r0, #23 bhi LBB0_4 @ BB#3: @ %bb3 bl _bar LBB0_4: @ %exit pop {lr} bx lr The uxth is not needed since ldrh implicitly zero-extend the high bits. With this change it's eliminated. rdar://12771555 llvm-svn: 169459	2012-12-06 01:28:01 +00:00
Pedro Artigas	bf7d3bab26	change MCContext to work on the doInitialization/doFinalization model reviewed by Evan Cheng <evan.cheng@apple.com> llvm-svn: 169456	2012-12-06 00:50:55 +00:00
Andrew Trick	d3226eee03	RegPressureTracker::dump(): Remove unnecessary argument. llvm-svn: 169443	2012-12-05 23:05:22 +00:00
Andrew Trick	fda7a8832d	RegisterPressureTracker: fix findUseBetween to handle DebugValue llvm-svn: 169427	2012-12-05 21:37:50 +00:00
Andrew Trick	7bbcad7bcd	RegisterPressureTracker: unify virtual registers and physical regunits. Now that live register units are tracked individually, the code can be simplified. llvm-svn: 169426	2012-12-05 21:37:47 +00:00
Andrew Trick	7f7cee39ab	RegisterPresssureTracker: Track live physical register by unit. This is much simpler to reason about, more efficient, and fixes some corner cases involving implicit super-register defs. Fixed rdar://12797931. llvm-svn: 169425	2012-12-05 21:37:42 +00:00
Jakob Stoklund Olesen	a97cec790f	Remove unused MachineInstr constructors. A MachineInstr can only ever be constructed by CreateMachineInstr() and CloneMachineInstr(), and those factories don't use the removed constructors. llvm-svn: 169395	2012-12-05 18:27:39 +00:00
Pedro Artigas	41b98843e8	- Added calls to doInitialization/doFinalization to immutable passes - fixed ordering of calls to doFinalization to be the reverse of the pass run order due to potential dependencies - fixed machine module info to operate in the doInitialization/doFinalization model, also fixes some FIXMEs reviewed by Evan Cheng <evan.cheng@apple.com> llvm-svn: 169391	2012-12-05 17:12:22 +00:00
Andrew Trick	d52ab339cb	Added RegisterPressureTracker::dump() for debugging. llvm-svn: 169359	2012-12-05 06:47:08 +00:00
Jakob Stoklund Olesen	3cb2cb800f	Speed up the AllocationOrder class a bit. Allow the central functions to be inlined, and use the argumentless isHint() function when possible. llvm-svn: 169319	2012-12-04 22:25:16 +00:00
David Blaikie	67cb31ebdd	Comment change made in r169304 as requested by Eric Christopher. llvm-svn: 169315	2012-12-04 22:02:33 +00:00
Bill Wendling	d7767125d5	Use the 'count' attribute to calculate the upper bound of an array. The count attribute is more accurate with regards to the size of an array. It also obviates the upper bound attribute in the subrange. We can also better handle an unbound array by setting the count to -1 instead of the lower bound to 1 and upper bound to 0. llvm-svn: 169312	2012-12-04 21:34:03 +00:00
David Blaikie	5a773bb601	Reapply r160148 (reverted in r163570) fixing spurious breakpoints in modern GDB This reapplies the fix for PR13303 now with more justification. Based on my execution of the GDB 7.5 test suite this results in: expected passes: 16101 -> 20890 (+30%) unexpected failures: 4826 -> 637 (-77%) There are 23 checks that used to pass and now fail. They are all in gdb.reverse. Investigating a few looks like they were accidentally passing due to extra breakpoints being set by this bug. They're generally due to the difference in end location between gcc and clang, the test suite is trying to set breakpoints on the closing '}' that clang doesn't associate with any instructions. llvm-svn: 169304	2012-12-04 21:05:36 +00:00
Chandler Carruth	802d755533	Sort includes for all of the .h files under the 'lib' tree. These were missed in the first pass because the script didn't yet handle include guards. Note that the script is now able to handle all of these headers without manual edits. =] llvm-svn: 169224	2012-12-04 07:12:27 +00:00
Bill Wendling	bfc0e5725f	Add a 'count' field to the DWARF subrange. The count field is necessary because there isn't a difference between the 'lo' and 'hi' attributes for a one-element array and a zero-element array. When the count is '0', we know that this is a zero-element array. When it's >=1, then it's a normal constant sized array. When it's -1, then the array is unbounded. llvm-svn: 169218	2012-12-04 06:20:49 +00:00
Jakub Staszak	ae551a853d	Simplify code. No functionality change. llvm-svn: 169198	2012-12-04 01:00:52 +00:00
Manman Ren	f563941adc	Stack Alignment: when creating stack objects in MachineFrameInfo, make sure the alignment is clamped to TargetFrameLowering.getStackAlignment if the target does not support stack realignment or the option "realign-stack" is off. This will cause miscompile if the address is treated as aligned and add is replaced with or in DAGCombine. Added a bool StackRealignable to TargetFrameLowering to check whether stack realignment is implemented for the target. Also added a bool RealignOption to MachineFrameInfo to check whether the option "realign-stack" is on. rdar://12713765 llvm-svn: 169197	2012-12-04 00:52:33 +00:00
Jakub Staszak	bac8ae6506	Use dyn_cast instead of isa and cast. No functionality change. llvm-svn: 169196	2012-12-04 00:50:06 +00:00
Jakob Stoklund Olesen	084665fa6d	Remove VirtRegMap::getRegAllocPref(). Now that there can be multiple hint registers from targets, it doesn't make sense to have a function that returns 'the' preferred register. llvm-svn: 169190	2012-12-04 00:35:59 +00:00
Jakob Stoklund Olesen	1dd82dd3fc	Use MRI::getSimpleHint() instead of getRegAllocPref() in remaining cases. Targets can provide multiple hints now, so getRegAllocPref() doesn't make sense any longer because it only returns one preferred register. Replace it with getSimpleHint() in the remaining heuristics. This function only llvm-svn: 169188	2012-12-04 00:30:22 +00:00
Manman Ren	26c73f93e0	Stack Alignment: move functions from header file MachineFrameInfo.h. No functional change for this commit. The follow-up patch will add more stuff to these functions. rdar://12713765 llvm-svn: 169186	2012-12-04 00:26:44 +00:00
Jakob Stoklund Olesen	74052b041b	Add VirtRegMap::hasKnownPreference(). Virtual registers with a known preferred register are prioritized by RAGreedy. This function makes the condition explicit without depending on getRegAllocPref(). llvm-svn: 169179	2012-12-03 23:23:50 +00:00
Jakob Stoklund Olesen	c784a1f906	Use the new getRegAllocationHints() hook from AllocationOrder. This simplifies the hinting code quite a bit while making the targets easier to write at the same time. llvm-svn: 169173	2012-12-03 22:51:04 +00:00
Pedro Artigas	e4348b0412	moves doInitialization and doFinalization to the Pass class and removes some unreachable code in MachineModuleInfo reviewed by Evan Cheng <evan.cheng@apple.com> llvm-svn: 169164	2012-12-03 21:56:57 +00:00
Jakob Stoklund Olesen	499cac486a	Add a new hook for providing register allocator hints more flexibly. The TargetRegisterInfo::getRegAllocationHints() function is going to replace the existing mechanisms for providing target-dependent hints to the register allocator: ResolveRegAllocHint() and getRawAllocationOrder(). The new hook is more flexible because it allows the target to provide multiple preferred candidate registers for each virtual register, and it is easier to use because targets are not required to return a reference to a constant array like getRawAllocationOrder(). An optional VirtRegMap argument can be used to provide target-dependent hints that depend on the provisional assignments of other virtual registers. llvm-svn: 169154	2012-12-03 21:17:00 +00:00
Eli Bendersky	b42d1466a0	Fix PR12942: Allow two CUs to be generated from the same source file. Thanks Eric for the review. llvm-svn: 169142	2012-12-03 18:45:45 +00:00
Chandler Carruth	ed0881b2a6	Use the new script to sort the includes of every file under lib. Sooooo many of these had incorrect or strange main module includes. I have manually inspected all of these, and fixed the main module include to be the nearest plausible thing I could find. If you own or care about any of these source files, I encourage you to take some time and check that these edits were sensible. I can't have broken anything (I strictly added headers, and reordered them, never removed), but they may not be the headers you'd really like to identify as containing the API being implemented. Many forward declarations and missing includes were added to a header files to allow them to parse cleanly when included first. The main module rule does in fact have its merits. =] llvm-svn: 169131	2012-12-03 16:50:05 +00:00
Nadav Rotem	1157e1410c	Allow merging multiple store sequences on the same chain. llvm-svn: 169111	2012-12-02 17:14:09 +00:00
Andrew Trick	b767d1eba8	misched: Fix RegisterPressureTracker handling of DebugVals. Assertion failed: (TopRPTracker.getPos() == RegionBegin && "bad initial Top tracker"). rdar://12790302. llvm-svn: 169072	2012-12-01 01:22:49 +00:00
Andrew Trick	d5953622ce	misched: Fix the DAG builder to handle an undef operand at ExitSU. Assertion failed: (VNI && "No value to read by operand") rdar://12790267. llvm-svn: 169071	2012-12-01 01:22:44 +00:00
Andrew Trick	a01302182c	misched: Fix LiveInterval update to better handle DebugVal. Assertion failed: (itr != mi2iMap.end() && "Instruction not found in maps.") rdar://12777252. llvm-svn: 169070	2012-12-01 01:22:41 +00:00
Andrew Trick	e7ea8aa48a	misched: fix RegionBegin when DebugValues get shuffled to the top. assert (RemainingInstrs == 0 && "Instruction count mismatch!") rdar://12776937. llvm-svn: 169069	2012-12-01 01:22:38 +00:00
Jakob Stoklund Olesen	da2b6b381a	Simplify REG_SEQUENCE lowering. The TwoAddressInstructionPass takes the machine code out of SSA form by expanding REG_SEQUENCE instructions into copies. It is no longer necessary to rewrite the registers used by a REG_SEQUENCE instruction because the new coalescer algorithm can do it now. REG_SEQUENCE is just converted to a sequence of sub-register copies now. llvm-svn: 169067	2012-12-01 01:06:44 +00:00
Eric Christopher	9c2ecd93d0	Add some first skeleton work for the DWARF5 Fission proposal. Emit part of the compile unit CU and start separating out information into the various sections that will be pulled out later. WIP. llvm-svn: 169061	2012-11-30 23:59:06 +00:00
Jakob Stoklund Olesen	bb1e98318f	Convert COPY instructions into KILLs if they have implicit defs. MachineCopyPropagation doesn't understand super-register liveness well enough to be able to remove implicit defs of super-registers. This fixes a problem in ARM/2012-01-26-CopyPropKills.ll that is exposed by an future TwoAddressInstructionPass change. The KILL instructions are removed before the machine code is emitted. llvm-svn: 169060	2012-11-30 23:53:00 +00:00
Bill Wendling	c786b31233	Replace r168930 with a more reasonable patch. The original patch removed a bunch of code that the SjLjEHPrepare pass placed into the entry block if all of the landing pads were removed during the CodeGenPrepare class. The more natural way of doing things is to run the CGP before we run the SjLjEHPrepare pass. Make it so! llvm-svn: 169044	2012-11-30 22:08:55 +00:00
Eric Christopher	42e3994e77	More comment. llvm-svn: 168952	2012-11-29 22:56:13 +00:00
Justin Holewinski	edec332437	Cleanup recent addition of DAGTypeLegalizer::SplitVecOp_VSELECT llvm-svn: 168932	2012-11-29 19:42:09 +00:00
Benjamin Kramer	aa598b3be6	misched: Recompute priority queue when DFSResults are updated. This was found by MSVC10's STL debug mode on a test from the test suite. Sadly std::is_heap isn't standard so there is no way to assert this without writing our own heap verify, which looks like overkill to me. llvm-svn: 168885	2012-11-29 14:36:26 +00:00
Justin Holewinski	0ac49bf846	Teach the legalizer how to handle operands for VSELECT nodes If we need to split the operand of a VSELECT, it must be the mask operand. We split the entire VSELECT operand with EXTRACT_SUBVECTOR. llvm-svn: 168883	2012-11-29 14:26:28 +00:00
Justin Holewinski	bc45119b44	Allow targets to prefer TypeSplitVector over TypePromoteInteger when computing the legalization method for vectors For some targets, it is desirable to prefer scalarizing <N x i1> instead of promoting to a larger legal type, such as <N x i32>. llvm-svn: 168882	2012-11-29 14:26:24 +00:00
Jakob Stoklund Olesen	bdb55e0c59	Use MCPhysReg for RegisterClassInfo allocation orders. This saves a bit of memory. llvm-svn: 168852	2012-11-29 03:34:17 +00:00
Jakob Stoklund Olesen	546e9e85f1	Avoid rewriting instructions twice. This could cause miscompilations in targets where sub-register composition is not always idempotent (ARM). <rdar://problem/12758887> llvm-svn: 168837	2012-11-29 00:26:11 +00:00
Nadav Rotem	307d767177	When combining consecutive stores allow loads in between the stores, if the loads do not alias. llvm-svn: 168832	2012-11-29 00:00:08 +00:00
Jakob Stoklund Olesen	26c9d70d28	Make the LiveRegMatrix analysis available to targets. No functional change, just moved header files. Targets can inject custom passes between register allocation and rewriting. This makes it possible to tweak the register allocation before rewriting, using the full global interference checking available from LiveRegMatrix. llvm-svn: 168806	2012-11-28 19:13:06 +00:00
Andrew Trick	48d392e81e	misched: Analysis that partitions the DAG into subtrees. This is a simple, cheap infrastructure for analyzing the shape of a DAG. It recognizes uniform DAGs that take the shape of bottom-up subtrees, such as the included matrix multiplication example. This is useful for heuristics that balance register pressure with ILP. Two canonical expressions of the heuristic are implemented in scheduling modes: -misched-ilpmin and -misched-ilpmax. llvm-svn: 168773	2012-11-28 05:13:28 +00:00
Andrew Trick	cd1c2f9fb1	misched: rename ScheduleDAGILP to ScheduleDFS to prepare for other heuristics. llvm-svn: 168772	2012-11-28 05:13:24 +00:00
Andrew Trick	0be19363d1	misched: better alias analysis. This fixes a hole in the "cheap" alias analysis logic implemented within the DAG builder itself, regardless of whether proper alias analysis is enabled. It now handles this pattern produced by LSR+CodeGenPrepare. %sunkaddr1 = ptrtoint * %obj to i64 %sunkaddr2 = add i64 %sunkaddr1, %lsr.iv %sunkaddr3 = inttoptr i64 %sunkaddr2 to i32* store i32 %v, i32* %sunkaddr3 llvm-svn: 168768	2012-11-28 03:42:49 +00:00
Andrew Trick	cf7e6971e8	misched: Debug output fix. Use an always valid iterator. llvm-svn: 168767	2012-11-28 03:42:47 +00:00
Jakob Stoklund Olesen	c351aed4b1	Move the guts of TargetInstrInfoImpl into the TargetInstrInfo class. The *Impl class no longer serves a purpose now that the super-class implementation is in CodeGen. llvm-svn: 168759	2012-11-28 02:35:13 +00:00
Jakob Stoklund Olesen	fcf14e8436	Move Target{Instr,Register}Info.cpp into lib/CodeGen. The Target library is not allowed to depend on the large CodeGen library, but the TRI and TII classes provide abstract interfaces that require both caller and callee to link to CodeGen. The implementation files for these classes provide default implementations of some of the hooks. These methods may need to reference CodeGen, so they belong in that library. We already have a number of methods implemented in the TargetInstrInfoImpl sub-class because of that. I will merge that class into the parent next. llvm-svn: 168758	2012-11-28 02:35:09 +00:00
Chad Rosier	ed119d542b	Revert r168630, r168631, and r168633 as these are causing nightly test failures. llvm-svn: 168751	2012-11-28 00:21:29 +00:00
Eric Christopher	acdcbdb17d	Attempt to make the comments for dwarf debug look more like the coding standard would like. llvm-svn: 168737	2012-11-27 22:43:45 +00:00
Eric Christopher	95198f5035	Reapply section moving, make sure string section is output last. llvm-svn: 168736	2012-11-27 22:43:42 +00:00
Manman Ren	f89406ac78	CSE: allow PerformTrivialCoalescing to check copies across basic block boundaries. Given the following case: BB0 %vreg1<def> = SUBrr %vreg0, %vreg7 %vreg2<def> = COPY %vreg7 BB1 %vreg10<def> = SUBrr %vreg0, %vreg2 We should be able to CSE between SUBrr in BB0 and SUBrr in BB1. rdar://12462006 llvm-svn: 168717	2012-11-27 18:58:41 +00:00
Jakub Staszak	38e2f52e85	Remove duplicated #includes. llvm-svn: 168712	2012-11-27 18:27:14 +00:00
Ulrich Weigand	e5f9405842	Never use .lcomm on platforms where it does not accept an alignment argument. Instead, use a pair of .local and .comm directives. This avoids spurious differences between binaries built by the integrated assembler vs. those built by the external assembler, since the external assembler may impose alignment requirements on .lcomm symbols where the integrated assembler does not. llvm-svn: 168704	2012-11-27 16:11:16 +00:00
Eric Christopher	6e20a16829	Revert rearrangement of debug info sections to unblock the bots and O0 + debug codegen. llvm-svn: 168680	2012-11-27 06:49:23 +00:00
Jakub Staszak	8262b885da	Remove unneeded #include. llvm-svn: 168670	2012-11-27 02:00:27 +00:00
Jakub Staszak	508888e446	Remove unneeded #include. llvm-svn: 168664	2012-11-27 01:22:15 +00:00
NAKAMURA Takumi	2e4a30709d	llvm/CodeGen: Remove empty files in r168659. llvm-svn: 168663	2012-11-27 01:21:50 +00:00
Jakub Staszak	08a28d248f	Remove unused forward declaration. llvm-svn: 168660	2012-11-27 01:16:37 +00:00
Jakub Staszak	0820b2a360	Remove unused MachineLoopRanges analysis. llvm-svn: 168659	2012-11-27 01:14:34 +00:00
Eric Christopher	69e328e5bd	Make comment names match function names. llvm-svn: 168644	2012-11-27 00:41:57 +00:00
Eric Christopher	4c9b119d64	Add in sections for the fission case (no change so incorrect) and add a TODO for starting. llvm-svn: 168643	2012-11-27 00:41:54 +00:00
Eric Christopher	c800b12bae	Reorder section output ordering. llvm-svn: 168638	2012-11-27 00:13:58 +00:00
Eric Christopher	735401cf29	Whitespace cleanup. llvm-svn: 168637	2012-11-27 00:13:51 +00:00
Chad Rosier	110b73e0e5	Add an assertion to ensure freezeReservedRegs() is only ever called once. llvm-svn: 168633	2012-11-26 23:37:07 +00:00
Chad Rosier	f8a3a62cdb	Now that the X86 Maximal Stack Alignment Check pass has been removed (i.e., r168627), we no longer need to call the freezeReservedRegs() function a second time. Previously, this pass was conservatively adding the FP to the set of reserved registers, requiring the second update to the reserved registers. rdar://12719844 llvm-svn: 168631	2012-11-26 23:25:41 +00:00
Chad Rosier	a44e1825a3	Now that the X86 Maximal Stack Alignment Check pass has been removed (i.e., r168627), we no longer need to call the freezeReservedRegs() function a second time. Previously, this pass was conservatively adding the FP to the set of reserved registers, requiring the second update to the reserved registers. rdar://12719844 llvm-svn: 168630	2012-11-26 23:14:37 +00:00
Jakub Staszak	f18753b8d0	Don't use iterator after being erased. llvm-svn: 168622	2012-11-26 22:14:19 +00:00
Jakub Staszak	e25344225d	Remove unneeded #includes. llvm-svn: 168608	2012-11-26 21:04:19 +00:00
Craig Topper	79bd205d8c	Refactor to make helper method static. llvm-svn: 168557	2012-11-25 08:08:58 +00:00
Craig Topper	268b62288e	Remove duplicate check of LimitFloatPrecision. It was already checked earlier before IsExp10 could be set to true. llvm-svn: 168553	2012-11-25 00:48:58 +00:00
Craig Topper	8571944cf1	Factor common code out of individual if blocks into common tail. llvm-svn: 168551	2012-11-25 00:15:07 +00:00
Craig Topper	d374694b07	Remove redundant calls to getCurDebugLoc in visitIntrinsicCall. It's already called at the start of the function and captured in a local variable. llvm-svn: 168548	2012-11-24 23:05:23 +00:00
Craig Topper	d2638c1894	Refactor a bit to make some helper methods static. llvm-svn: 168546	2012-11-24 18:52:06 +00:00
Craig Topper	4a98175800	Factor some common code out of individual if blocks. llvm-svn: 168538	2012-11-24 08:22:37 +00:00
Craig Topper	bef254ab16	Refactor a bit to make some helper functions static. llvm-svn: 168524	2012-11-23 18:38:31 +00:00
Patrik Hägglund	f77cc055cd	Cleanup: Simplify loop end logic in computeRegisterProperties(). llvm-svn: 168507	2012-11-23 08:35:04 +00:00
Eli Bendersky	26e7efeb1a	Fix 80-col violation llvm-svn: 168498	2012-11-22 14:10:40 +00:00
Lang Hames	e9541c820a	llvm.fmuladd.* lowering should be checking isOperationLegalOrCustom, rather than isOperationLegal. Thanks to Craig Topper for pointing this out. llvm-svn: 168485	2012-11-22 03:31:45 +00:00
Eric Christopher	960ac37832	Pull some code out into functions to make rearranging them a bit easier. llvm-svn: 168481	2012-11-22 00:59:49 +00:00
Eric Christopher	92331fde8c	Whitespace. llvm-svn: 168402	2012-11-21 00:34:38 +00:00
Eric Christopher	7b30f2e43b	Update for some of the coding standard before rearranging functions around. llvm-svn: 168401	2012-11-21 00:34:35 +00:00
Eric Christopher	5d1cf930df	Update some comments. llvm-svn: 168400	2012-11-21 00:17:49 +00:00
Eric Christopher	55c5181525	Update and add some comments. llvm-svn: 168399	2012-11-21 00:03:31 +00:00
Eric Christopher	27527b2b92	Whitespace. llvm-svn: 168398	2012-11-21 00:03:28 +00:00
Eric Christopher	383719592a	Remove constness from this, it modifies the output stream as does everything else underneath. llvm-svn: 168395	2012-11-20 23:30:11 +00:00
Eric Christopher	1f0cbb826f	Remove unused function argument, add a bit to the comment. llvm-svn: 168387	2012-11-20 22:14:13 +00:00
Eric Christopher	1d6bd41ee6	Formatting. llvm-svn: 168384	2012-11-20 20:34:47 +00:00
Eric Christopher	7c718e41c7	Whitespace. llvm-svn: 168383	2012-11-20 20:34:44 +00:00
Tim Northover	dd219d06c2	Fix physical register liveness calculations: + Take account of clobbers + Give outputs priority over inputs since they happen later. llvm-svn: 168360	2012-11-20 09:56:11 +00:00
Eric Christopher	58f4195942	Remove a function argument and propagate const around accordingly. llvm-svn: 168338	2012-11-19 22:42:15 +00:00
Eric Christopher	6a8413853f	Whitespace and 80-col. llvm-svn: 168337	2012-11-19 22:42:10 +00:00
Anton Korobeynikov	097b0e9d6a	Make AsmPrinter::EmitTTypeReference() more robust - put the zero GV check inside, so we won't forget it at the caller side. llvm-svn: 168328	2012-11-19 21:17:20 +00:00
Anton Korobeynikov	f65a638d94	Factor out type info emission into separate routine. It turned out that ARM wants different layout of type infos. This is yet another patch in attempt to fix PR7187 llvm-svn: 168325	2012-11-19 21:06:26 +00:00
Eric Christopher	cebb0ec764	Move section label emission to module end. Nothing should be depending on them being emitted before the text and/or data sections and testing didn't uncover any. llvm-svn: 168321	2012-11-19 19:43:59 +00:00
Jakob Stoklund Olesen	31ebe55808	Handle mixed normal and early-clobber defs on inline asm. PR14376. llvm-svn: 168320	2012-11-19 19:31:10 +00:00
Craig Topper	36f29122ef	Move else onto line with preceding closing brace. llvm-svn: 168294	2012-11-19 00:11:50 +00:00
Andrew Trick	28c000b234	Broaden isSchedulingBoundary to check aliases of SP. On PPC the stack pointer is X1, but ADJCALLSTACK writes R1. Fixes PR14315: Register regmask dependency problem with misched. llvm-svn: 168248	2012-11-17 03:35:11 +00:00
Eli Friedman	30834940ec	Mark FP_EXTEND form v2f32 to v2f64 as "expand" for ARM NEON. Patch by Pete Couperus. llvm-svn: 168240	2012-11-17 01:52:46 +00:00
Andrew Trick	9d0a1ae946	Use array_pod_sort instead of std::sort. llvm-svn: 168203	2012-11-16 21:33:38 +00:00
Craig Topper	ed756c5fc8	Remove conditions from 'else if' that were guaranteed by preceding 'if'. llvm-svn: 168191	2012-11-16 20:01:39 +00:00
Craig Topper	3669de4c97	Factor out the final FADD that's common to multiple code paths in the visitLog* functions. llvm-svn: 168183	2012-11-16 19:08:44 +00:00
Craig Topper	ae89426f07	Factor some common code to reduce compile size. llvm-svn: 168143	2012-11-16 07:48:23 +00:00
Eli Friedman	e6385e61b5	Mark FP_ROUND for converting NEON v2f64 to v2f32 as expand. Add a missing case to vector legalization so this actually works. Patch by Pete Couperus. Fixes PR12540. llvm-svn: 168107	2012-11-15 22:44:27 +00:00
Ulrich Weigand	dcee8ce8ed	Use std::stable_sort instead of std::sort when sorting stack slots to guarantee deterministic code generation. llvm-svn: 168074	2012-11-15 19:33:30 +00:00
Chad Rosier	2463f67c49	[reg scavenger] Fix the isUsed/isAliasUsed functions so as to not report a false positive. In this particular case, R6 was being spilled by the register scavenger when it was in fact dead. The isUsed function reported R6 as used because the R6_R7 alias was reserved (due to the fact that we've reserved R7 as the FP). The solution is to only check if the original register (i.e., R6) isReserved and not the aliases. The aliases are only checked to make sure they're available. The test case is derived from one of the nightly tester benchmarks and is rather intractable and difficult to reproduce, so I haven't included it. rdar://12592448 llvm-svn: 168054	2012-11-15 18:13:20 +00:00
Sergei Larin	e822148c80	Fix indeterminism in MI scheduler DAG construction. Similarly to several recent fixes throughout the code replace std::map use with the MapVector. Add find() method to the MapVector. llvm-svn: 168051	2012-11-15 17:45:50 +00:00
Craig Topper	61d045781a	Add llvm.ceil, llvm.trunc, llvm.rint, llvm.nearbyint intrinsics. llvm-svn: 168025	2012-11-15 06:51:10 +00:00
Andrew Trick	449eb3f3be	Fix an obvious merge bug in -join-globalcopies (disabled). Jakub Staszak spotted this in review. I don't notice these things until I manually rerun benchmarks. But reducing unit tests is a very high priority. llvm-svn: 168021	2012-11-15 02:32:22 +00:00
Jakub Staszak	ab0139cb90	Use reserve() to avoid vector reallocation. llvm-svn: 167991	2012-11-14 22:42:17 +00:00
Jakub Staszak	542db4a0bc	canJoinPhys method doesn't modify CoalescerPair. Make it const. llvm-svn: 167972	2012-11-14 20:31:04 +00:00
Chad Rosier	e18e4add6c	Remove dead code. llvm-svn: 167970	2012-11-14 20:25:37 +00:00
Anton Korobeynikov	b619a4138d	Fix really stupid ARM EHABI info generation bug: we should not emit eh table and handler data if there are no landing pads in the function. Patch by Logan Chien with some cleanups from me. llvm-svn: 167945	2012-11-14 19:13:30 +00:00
Craig Topper	04a5cc39f4	Add newlines to end of debug messages. llvm-svn: 167913	2012-11-14 05:20:09 +00:00
Rafael Espindola	c79532d101	Handle DAG CSE adding new uses during ReplaceAllUsesWith. Fixes PR14333. llvm-svn: 167912	2012-11-14 05:08:56 +00:00
Anton Korobeynikov	e42af3699b	Use TARGET2 relocation for TType references on ARM. Do some cleanup of the code while here. Inspired by patch by Logan Chien! llvm-svn: 167904	2012-11-14 01:47:00 +00:00
Eric Christopher	0f23b82147	Revert "Use the 'count' attribute instead of the 'upper_bound' attribute." temporarily as it is breaking the gdb bots. This reverts commit r167806/e7ff4c14b157746b3e0228d2dce9f70712d1c126. llvm-svn: 167886	2012-11-13 23:30:43 +00:00
Andrew Trick	459d891a43	Revert -join-splitedges to a boolean cmd line option. llvm-svn: 167880	2012-11-13 22:19:48 +00:00
Andrew Trick	47d58ce0df	The MachineScheduler does not currently require JoinSplitEdges. This option will eventually either be enabled unconditionally or replaced by a more general live range splitting optimization. llvm-svn: 167879	2012-11-13 22:15:40 +00:00
Michael J. Spencer	f1aef758a7	[MC][COFF] Emit weak symbols to the correct section. Patch by Dmitry Puzirev! llvm-svn: 167877	2012-11-13 22:04:09 +00:00
Ulrich Weigand	3946877f88	Do not consider a machine instruction that uses and defines the same physical register as candidate for common subexpression elimination in MachineCSE. This fixes a bug on PowerPC in MultiSource/Applications/oggenc/oggenc caused by MachineCSE invalidly merging two separate DYNALLOC insns. llvm-svn: 167855	2012-11-13 18:40:58 +00:00
Andrew Trick	449c7ad7d7	Fix -join-splitedges: my previous "cleanup" broke it. Working on reducing unit tests. This won't be enabled unless a subtarget enables misched. llvm-svn: 167851	2012-11-13 17:37:46 +00:00
Duncan Sands	b8d3caf65a	Codegen support for arbitrary vector getelementptrs. llvm-svn: 167830	2012-11-13 13:01:58 +00:00
Andrew Trick	108c88c5b7	misched: Allow subtargets to enable misched and dependent options. This allows me to begin enabling (or backing out) misched by default for one subtarget at a time. To run misched we typically want to: - Disable SelectionDAG scheduling (use the source order scheduler) - Enable more aggressive coalescing (until we decide to always run the coalescer this way) - Enable MachineScheduler pass itself. Disabling PostRA sched may follow for some subtargets. llvm-svn: 167826	2012-11-13 08:47:29 +00:00
Andrew Trick	40534fe9a5	Added RegisterCoalescer support for joining global copies first. This adds the -join-globalcopies option which can be enabled by default once misched is also enabled. Ideally, the register coalescer would be able to split local live ranges in a way that produces copies that can be easily resolved by the scheduler. Until then, this heuristic should be good enough to at least allow the scheduler to run after coalescing. llvm-svn: 167825	2012-11-13 08:47:25 +00:00
Andrew Trick	4b1f9e3bac	misched: Don't consider artificial edges weak edges. For now be more conservative in case other out-of-tree schedulers rely on the old behavior of artificial edges. llvm-svn: 167808	2012-11-13 02:35:06 +00:00
Bill Wendling	f454dfb6b5	Use the 'count' attribute instead of the 'upper_bound' attribute. If we have a type 'int a[1]' and a type 'int b[0]', the generated DWARF is the same for both of them because we use the 'upper_bound' attribute. Instead use the 'count' attrbute, which gives the correct number of elements in the array. <rdar://problem/12566646> llvm-svn: 167806	2012-11-13 02:31:47 +00:00
Andrew Trick	edac22a9f3	Cleanup the main RegisterCoalescer loop. Block priorities still apply outside loops. llvm-svn: 167793	2012-11-13 00:34:44 +00:00
Andrew Trick	c25d3fe71e	Cleanup -join-splitedges. Make the loop more obvious. llvm-svn: 167785	2012-11-12 23:59:48 +00:00
Eric Christopher	2942431175	Add an option to enable prototype "fission" capabilities and debug changes. llvm-svn: 167765	2012-11-12 22:22:20 +00:00
Andrew Trick	22d688a29c	Added a temporary option to avoid critical edges splitting. This teaches the register coalescer to be less prone to split critical edges. I am currently benchmarking this with the new (post-coalescer) scheduler. I plan to enable this by default and remove the option as soon as misched is enabled. llvm-svn: 167758	2012-11-12 21:42:40 +00:00
Andrew Trick	ec369d5316	misched: rename interfaceto avoid gcc warnings llvm-svn: 167753	2012-11-12 21:28:10 +00:00
Andrew Trick	263280248a	misched: Target-independent support for MacroFusion. Uses the infrastructure from r167742 to support clustering instructure that the target processor can "fuse". e.g. cmp+jmp. Next step: target hook implementations with test cases, and enable. llvm-svn: 167744	2012-11-12 19:52:20 +00:00
Andrew Trick	a7714a0ff9	misched: Target-independent support for load/store clustering. This infrastructure is generally useful for any target that wants to strongly prefer two instructions to be adjacent after scheduling. A following checkin will add target-specific hooks with unit tests. Then this feature will be enabled by default with misched. llvm-svn: 167742	2012-11-12 19:40:10 +00:00
Andrew Trick	f1ff84c64e	misched: Infrastructure for weak DAG edges. This adds support for weak DAG edges to the general scheduling infrastructure in preparation for MachineScheduler support for heuristics based on weak edges. llvm-svn: 167738	2012-11-12 19:28:57 +00:00
Jakob Stoklund Olesen	13d5562963	Fix assertions in updateRegMaskSlots(). The RegMaskSlots contains 'r' slots while NewIdx and OldIdx are 'B' slots. This broke the checks in the assertions. This fixes PR14302. llvm-svn: 167625	2012-11-09 19:18:49 +00:00
Benjamin Kramer	c280f41864	Silence GCC warning about falling off the end of a non-void function. llvm-svn: 167618	2012-11-09 15:45:22 +00:00
Andrew Trick	3ca33acb95	misched: Heuristics based on the machine model. misched is disabled by default. With -enable-misched, these heuristics balance the schedule to simultaneously avoid saturating processor resources, expose ILP, and minimize register pressure. I've been analyzing the performance of these heuristics on everything in the llvm test suite in addition to a few other benchmarks. I would like each heuristic check to be verified by a unit test, but I'm still trying to figure out the best way to do that. The heuristics are still in considerable flux, but as they are refined we should be rigorous about unit testing the improvements. llvm-svn: 167527	2012-11-07 07:05:09 +00:00
Andrew Trick	e145559b70	misched: handle on-the-fly regpressure queries better for 2-addr instructions without relying on liveintervals. llvm-svn: 167526	2012-11-07 07:05:05 +00:00
Bill Wendling	f720bf64d4	Add comment describing what's going on here. llvm-svn: 167525	2012-11-07 05:19:04 +00:00
Bill Wendling	d9bb9b611b	When we're updating the subprogram scope DIE, we want to determine if we're updating an abstract DIE or not. If we are, then we use that. Its children will be added on later, as well as the object pointer attribute. Otherwise, this function may be called with a concrete DIE twice and adding the children and object pointer attribute to it twice. <rdar://problem/12401423&12600340> llvm-svn: 167524	2012-11-07 04:42:18 +00:00
Chad Rosier	8d2c229006	[regallocfast] Make sure the MachineRegisterInfo is aware of clobbers from a register masks. This is an obvious and necessary fix for a soon to be committed patch. No test case possible at this time. Reviewed by Jakob. llvm-svn: 167498	2012-11-06 22:52:42 +00:00
Andrew Trick	e96390ea96	misched: TargetSchedule interface for machine resources. Expose the processor resources defined by the machine model to the scheduler and other clients through the TargetSchedule interface. Normalize each resource count with respect to other kinds of resources. This allows scheduling heuristics to balance resources against other kinds of resources and latency. llvm-svn: 167444	2012-11-06 07:10:38 +00:00
Andrew Trick	4d1fa712ac	misched: Rename RemainingCount to avoid confusion with remaining resources. llvm-svn: 167443	2012-11-06 07:10:34 +00:00
Andrew Trick	baeaabb2d0	ScheduleDAG interface. Added OrderKind to distinguish nonregister dependencies. This is in preparation for adding "weak" DAG edges, but generally simplifies the design. llvm-svn: 167435	2012-11-06 03:13:46 +00:00
Owen Anderson	15fd6ac4ba	Be careful not to optimize a SELECT_CC into a SETCC post-legalization if the SETCC node would be illegal. llvm-svn: 167344	2012-11-03 00:17:26 +00:00
Manman Ren	3d5af279b1	OutputArg: added an index of the original argument to match the change to InputArg in r165616. This will enable us to get the actual type for both InputArg and OutputArg. rdar://9932559 llvm-svn: 167265	2012-11-01 23:49:58 +00:00
Chandler Carruth	5da3f0512e	Revert the majority of the next patch in the address space series: r165941: Resubmit the changes to llvm core to update the functions to support different pointer sizes on a per address space basis. Despite this commit log, this change primarily changed stuff outside of VMCore, and those changes do not carry any tests for correctness (or even plausibility), and we have consistently found questionable or flat out incorrect cases in these changes. Most of them are probably correct, but we need to devise a system that makes it more clear when we have handled the address space concerns correctly, and ideally each pass that gets updated would receive an accompanying test case that exercises that pass specificaly w.r.t. alternate address spaces. However, from this commit, I have retained the new C API entry points. Those were an orthogonal change that probably should have been split apart, but they seem entirely good. In several places the changes were very obvious cleanups with no actual multiple address space code added; these I have not reverted when I spotted them. In a few other places there were merge conflicts due to a cleaner solution being implemented later, often not using address spaces at all. In those cases, I've preserved the new code which isn't address space dependent. This is part of my ongoing effort to clean out the partial address space code which carries high risk and low test coverage, and not likely to be finished before the 3.2 release looms closer. Duncan and I would both like to see the above issues addressed before we return to these changes. llvm-svn: 167222	2012-11-01 09:14:31 +00:00
Chandler Carruth	7ec5085e01	Revert the series of commits starting with r166578 which introduced the getIntPtrType support for multiple address spaces via a pointer type, and also introduced a crasher bug in the constant folder reported in PR14233. These commits also contained several problems that should really be addressed before they are re-committed. I have avoided reverting various cleanups to the DataLayout APIs that are reasonable to have moving forward in order to reduce the amount of churn, and minimize the number of commits that were reverted. I've also manually updated merge conflicts and manually arranged for the getIntPtrType function to stay in DataLayout and to be defined in a plausible way after this revert. Thanks to Duncan for working through this exact strategy with me, and Nick Lewycky for tracking down the really annoying crasher this triggered. (Test case to follow in its own commit.) After discussing with Duncan extensively, and based on a note from Micah, I'm going to continue to back out some more of the more problematic patches in this series in order to ensure we go into the LLVM 3.2 branch with a reasonable story here. I'll send a note to llvmdev explaining what's going on and why. Summary of reverted revisions: r166634: Fix a compiler warning with an unused variable. r166607: Add some cleanup to the DataLayout changes requested by Chandler. r166596: Revert "Back out r166591, not sure why this made it through since I cancelled the command. Bleh, sorry about this! r166591: Delete a directory that wasn't supposed to be checked in yet. r166578: Add in support for getIntPtrType to get the pointer type based on the address space. llvm-svn: 167221	2012-11-01 08:07:29 +00:00
Owen Anderson	b351c8d692	Add a few more simple fast-math constant propagations and cancellations. llvm-svn: 167200	2012-11-01 02:00:53 +00:00
Jakob Stoklund Olesen	9892a4b794	Exploit the new identity composition in composeSubRegIndices(). The static compose() function in RegisterCoalescer was doing the exact same thing. llvm-svn: 167198	2012-11-01 01:15:43 +00:00
Benjamin Kramer	1559127f6f	Replace some instances of UniqueVector with SetVector, which is slightly cheaper. No functionality change. llvm-svn: 167116	2012-10-31 13:45:49 +00:00
Akira Hatanaka	d837be780d	Change signature of function RAFast::spillAll to avoid conversion between type MachineInstr* and MachineBasicBlock::iterator. llvm-svn: 167088	2012-10-31 00:56:01 +00:00
Akira Hatanaka	ebb31e9c42	Check that iterator I is not the end iterator. llvm-svn: 167086	2012-10-31 00:50:52 +00:00
Chad Rosier	909f6a035f	[inline asm] Get the mayLoad/mayStore directly from the MIOp_ExtraInfo operand. llvm-svn: 167050	2012-10-30 20:39:19 +00:00
Chad Rosier	86f6050c54	Add a comment for r167040. llvm-svn: 167046	2012-10-30 20:01:12 +00:00
Chad Rosier	9e1274fb48	[inline asm] Implement mayLoad and mayStore for inline assembly. In general, the MachineInstr MayLoad/MayLoad flags are based on the tablegen implementation. For inline assembly, however, we need to compute these based on the constraints. Revert r166929 as this is no longer needed, but leave the test case in place. rdar://12033048 and PR13504 llvm-svn: 167040	2012-10-30 19:11:54 +00:00
Bill Wendling	10e0e2ec49	Fix grammar. llvm-svn: 167029	2012-10-30 17:51:02 +00:00
Ulrich Weigand	3abb34389d	In various places throughout the code generator, there were special checks to avoid performing compile-time arithmetic on PPCDoubleDouble. Now that APFloat supports arithmetic on PPCDoubleDouble, those checks are no longer needed, and we can treat the type like any other. llvm-svn: 166958	2012-10-29 18:35:49 +00:00
Jakob Stoklund Olesen	9a06696a77	Completely disallow partial copies in adjustCopiesBackFrom(). Partial copies can show up even when CoalescerPair.isPartial() returns false. For example: %vreg24:dsub_0<def> = COPY %vreg31:dsub_0; QPR:%vreg24,%vreg31 Such a partial-partial copy is not good enough for the transformation adjustCopiesBackFrom() needs to do. llvm-svn: 166944	2012-10-29 17:51:52 +00:00
Duncan Sands	5bdd9dda48	Remove a wrapper around getIntPtrType added to GVN by Hal in commit 166624 (the wrapper returns a vector of integers when passed a vector of pointers) by having getIntPtrType itself return a vector of integers in this case. Outside of this wrapper, I didn't find anywhere in the codebase that was relying on the old behaviour for vectors of pointers, so give this a whirl through the buildbots. llvm-svn: 166939	2012-10-29 17:31:46 +00:00
Preston Gurd	52dacca977	This patch addresses a problem with the Post RA scheduler generating an incorrect instruction sequence due to it not being aware that an inline assembly instruction may reference memory. This patch fixes the problem by causing the scheduler to always assume that any inline assembly code instruction could access memory. This is necessary because the internal representation of the inline instruction does not include any information about memory accesses. This should fix PR13504. llvm-svn: 166929	2012-10-29 15:01:23 +00:00
Lang Hames	ee6142c36b	Remove unused typedef. llvm-svn: 166910	2012-10-29 04:57:52 +00:00
Jakob Stoklund Olesen	57143f7e78	Never attempt to join an early-clobber def with a regular kill. This fixes PR14194. llvm-svn: 166880	2012-10-27 17:41:27 +00:00
Jakob Stoklund Olesen	1dfe4fc60c	Reduce indentation with early exit. No functional change. llvm-svn: 166829	2012-10-26 23:05:13 +00:00
Jakob Stoklund Olesen	7fa17d4bc8	Also make the current basic block a class member. Don't pass it around everywhere as a function argument. llvm-svn: 166828	2012-10-26 23:05:10 +00:00
Jakob Stoklund Olesen	d788e32bf5	Make the Processed set a class member. Don't pass it everywhere as an argument. llvm-svn: 166820	2012-10-26 22:06:00 +00:00
Jakob Stoklund Olesen	112a44d9af	Fix whitespace and function names to be coding standardy. No functional change. llvm-svn: 166814	2012-10-26 21:12:49 +00:00
Jakob Stoklund Olesen	09d69f5b0f	Remove the canCombineSubRegIndices() target hook. The new coalescer can already do all of this, so there is no need to duplicate the efforts. llvm-svn: 166813	2012-10-26 20:38:19 +00:00
Akira Hatanaka	6fe7acab9d	Make sure I is not the end iterator when isInsideBundle is called. llvm-svn: 166784	2012-10-26 17:11:42 +00:00
Nicolas Geoffray	457b356f3a	Remove GC roots that reference dead objects. llvm-svn: 166763	2012-10-26 09:15:55 +00:00
Nick Lewycky	1a32954279	Fix typo in comment. llvm-svn: 166750	2012-10-26 04:27:49 +00:00
Jakob Stoklund Olesen	9004798da8	Stop running the machine code verifier unconditionally. llvm-svn: 166646	2012-10-25 00:05:39 +00:00
Micah Villmow	bf3eeb2dfc	Add some cleanup to the DataLayout changes requested by Chandler. llvm-svn: 166607	2012-10-24 18:36:13 +00:00
Micah Villmow	51e7246cb4	Back out r166591, not sure why this made it through since I cancelled the command. Bleh, sorry about this! llvm-svn: 166596	2012-10-24 17:25:11 +00:00
Micah Villmow	6a8f3f9e20	Delete a directory that wasn't supposed to be checked in yet. llvm-svn: 166591	2012-10-24 17:20:04 +00:00
Micah Villmow	12d9127833	Add in support for getIntPtrType to get the pointer type based on the address space. This checkin also adds in some tests that utilize these paths and updates some of the clients. llvm-svn: 166578	2012-10-24 15:52:52 +00:00
Michael Liao	5922979e49	Teach DAG combine to fold (buildvec (Xint2fp x)) to (Xint2fp (buildvec x)) - If more than 1 elemennts are defined and target supports the vectorized conversion, use the vectorized one instead to reduce the strength on conversion operation. llvm-svn: 166546	2012-10-24 04:14:18 +00:00
Jakub Staszak	a6addc2741	Keep coding standard. Don't evaluate getNumOperands() every time. llvm-svn: 166531	2012-10-24 00:38:25 +00:00
Michael Liao	6d106b7bfd	Clean up code and put transformation on (build_vec (ext x)) into a helper func llvm-svn: 166519	2012-10-23 23:06:52 +00:00
Nadav Rotem	33e034a4b3	Make the indirect branch optimization deterministic. No functionality change. Patch by Daniel Reynaud. llvm-svn: 166501	2012-10-23 21:05:33 +00:00
Richard Smith	6289a4e85e	Per the C++ standard, we need to include the definition of llvm::Calculate in every TU where it's implicitly instantiated, even if there's an implicit instantiation for the same types available in another TU. llvm-svn: 166470	2012-10-23 06:19:46 +00:00
Jakob Stoklund Olesen	fd4ced2c52	Don't crash when the Assignments vector is empty. Reported by Vincent Lejeune using an out-of-tree target. llvm-svn: 166398	2012-10-21 19:05:03 +00:00
Benjamin Kramer	a74129adad	Symbol hygiene: Make sure declarations and definitions match, make helper functions static. llvm-svn: 166376	2012-10-20 12:53:26 +00:00
Shuxin Yang	1479fcdef1	1. Remove noreturn attribute from __builtin_debugtrap(). (The change at Clang side was committed in r166345) 2. Cosmetic change in order to conform to coding standards. llvm-svn: 166350	2012-10-19 23:00:20 +00:00
Nadav Rotem	4dc976fbcb	revert r166264 because the LTO build is still failing llvm-svn: 166340	2012-10-19 21:28:43 +00:00
Shuxin Yang	cdde059a34	This patch is to fix radar://8426430. It is about llvm support of __builtin_debugtrap() which is supposed to consistently raise SIGTRAP across all systems. In contrast, __builtin_trap() behave differently on different systems. e.g. it raises SIGTRAP on ARM, and SIGILL on X86. The purpose of __builtin_debugtrap() is to consistently provide "trap" functionality, in the mean time preserve the compatibility with on gcc on __builtin_trap(). The X86 backend is already able to handle debugtrap(). This patch is to: 1) make front-end recognize "__builtin_debugtrap()" (emboddied in the one-line change to Clang). 2) In DAG legalization phase, by default, "debugtrap" will be replaced with "trap", which make the __builtin_debugtrap() "available" to all existing ports without the hassle of changing their code. 3) If trap-function is specified (via -trap-func=xyz to llc), both __builtin_debugtrap() and __builtin_trap() will be expanded into the function call of the specified trap function. This behavior may need change in the future. The provided testing-case is to make sure 2) and 3) are working for ARM port, and we already have a testing case for x86. llvm-svn: 166300	2012-10-19 20:11:16 +00:00
Nadav Rotem	4985ddc5e0	recommit the patch that makes LSR and LowerInvoke use the TargetTransform interface. llvm-svn: 166264	2012-10-19 04:27:49 +00:00
Michael Liao	2c2358036d	Simplify condition checking as CONCAT assume all inputs of the same type. llvm-svn: 166260	2012-10-19 03:17:00 +00:00
Sebastian Pop	127777d686	Clear unknown mem ops when merging stack slots (pr14090) When merging stack slots, if StackColoring::remapInstructions gets a value back from GetUnderlyingObject that it does not know about or is not itself a stack slot, clear the memory operand in case it aliases the merged slot. This prevents the introduction of incorrect aliasing information. Author: Matthew Curtis <mcurtis@codeaurora.org> llvm-svn: 166216	2012-10-18 19:53:48 +00:00
Sebastian Pop	fdd94d4955	Change MachineFrameInfo::StackObject::Alloca from Value* to AllocaInst* This more accurately reflects what is actually being stored in the field. No functionality change intended. Author: Matthew Curtis <mcurtis@codeaurora.org> llvm-svn: 166215	2012-10-18 19:53:45 +00:00
Nadav Rotem	d5f8859672	In SimplifySelectOps we pulled two loads through a select node despite the fact that one was dependent on the other. rdar://12513091 llvm-svn: 166196	2012-10-18 18:06:48 +00:00
Bob Wilson	d6d9ccca38	Temporarily revert the TargetTransform changes. The TargetTransform changes are breaking LTO bootstraps of clang. I am working with Nadav to figure out the problem, but I am reverting it for now to get our buildbots working. This reverts svn commits: 165665 165669 165670 165786 165787 165997 and I have also reverted clang svn 165741 llvm-svn: 166168	2012-10-18 05:43:52 +00:00
Michael Liao	3ac8201ea4	Revert part of r166049 back and enable test case in r166125. - Folding (trunc (concat ... X )) to (concat ... (trunc X) ...) is valid when '...' are all 'undef's. - r166125 relies on this transformation. llvm-svn: 166155	2012-10-17 23:45:54 +00:00

... 17 18 19 20 21 ...

16046 Commits