llvm-project

Commit Graph

Author	SHA1	Message	Date
Kai Nacke	d09bb4614b	PR16726: extend rol/ror matching C-like languages promote types like unsigned short to unsigned int before performing an arithmetic operation. Currently the rotate matcher in the DAGCombiner does not consider this situation. This commit extends the DAGCombiner in the way that the pattern (or (shl ([az]ext x), (ext y)), (srl ([az]ext x), (ext (sub 32, y)))) is folded into ([az]ext (rotl x, y)) The matching is restricted to aext and zext because in this cases the upper bits are either undefined or known. Test case is included. This fixes PR16726. llvm-svn: 191049	2013-09-19 23:00:28 +00:00
Kai Nacke	2d967b2751	Revert PR16726: extend rol/ror matching There is a buildbot failure. Need to investigate this. llvm-svn: 191048	2013-09-19 22:53:36 +00:00
Kai Nacke	4eaf6444fa	PR16726: extend rol/ror matching C-like languages promote types like unsigned short to unsigned int before performing an arithmetic operation. Currently the rotate matcher in the DAGCombiner does not consider this situation. This commit extends the DAGCombiner in the way that the pattern (or (shl ([az]ext x), (ext y)), (srl ([az]ext x), (ext (sub 32, y)))) is folded into ([az]ext (rotl x, y)) The matching is restricted to aext and zext because in this cases the upper bits are either undefined or known. Test case is included. This fixes PR16726. llvm-svn: 191045	2013-09-19 22:36:39 +00:00
David Blaikie	d0a869d0bf	DebugInfo: Improve IR annotation comments for GNU pubthings. llvm-svn: 191043	2013-09-19 22:19:37 +00:00
David Blaikie	8dec407649	Unshift the GDB index/GNU pubnames constants modified in r191025 Based on code review feedback from Eric Christopher, unshifting these constants as they can appear in the gdb_index itself, shifted a further 24 bits. This means that keeping them preshifted is a bit inflexible, so let's not do that. Given the motivation, wrap up some nicer enums, more type safety, and some utility functions. llvm-svn: 191035	2013-09-19 20:40:26 +00:00
David Blaikie	b20db58a4d	DebugInfo: Simplify gnu_pubnames index computation. Names open to bikeshedding. Could switch back to the constants being unshifted, but this way seems a bit easier to work with. llvm-svn: 191025	2013-09-19 18:39:59 +00:00
David Blaikie	70a3320244	Remove unnecessary conditional operators performing bool->bool conversion. llvm-svn: 191020	2013-09-19 17:33:35 +00:00
David Blaikie	0f5ad28a9d	Fix a typo and simplify a boolean expression. llvm-svn: 191018	2013-09-19 17:27:48 +00:00
Benjamin Kramer	d443e4a080	DAGCombiner: Don't fold vector muls with constants that look like a splat of a power of 2 but differ in bit width. PR17283. llvm-svn: 191000	2013-09-19 13:28:20 +00:00
Adrian Prantl	262bcf4584	Debug info: Get rid of the VLA indirection hack in FastISel. Use the DIVariable::isIndirect() flag set by the frontend instead of guessing whether to set the machine location's indirection bit. Paired commit with CFE. llvm-svn: 190961	2013-09-18 22:08:59 +00:00
Arnold Schwaighofer	cae8735a54	Costmodel: Add support for horizontal vector reductions Upcoming SLP vectorization improvements will want to be able to estimate costs of horizontal reductions. Add infrastructure to support this. We model reductions as a series of (shufflevector,add) tuples ultimately followed by an extractelement. For example, for an add-reduction of <4 x float> we could generate the following sequence: (v0, v1, v2, v3) \ \ / / \ \ / + + (v0+v2, v1+v3, undef, undef) \ / ((v0+v2) + (v1+v3), undef, undef) %rdx.shuf = shufflevector <4 x float> %rdx, <4 x float> undef, <4 x i32> <i32 2, i32 3, i32 undef, i32 undef> %bin.rdx = fadd <4 x float> %rdx, %rdx.shuf %rdx.shuf7 = shufflevector <4 x float> %bin.rdx, <4 x float> undef, <4 x i32> <i32 1, i32 undef, i32 undef, i32 undef> %bin.rdx8 = fadd <4 x float> %bin.rdx, %rdx.shuf7 %r = extractelement <4 x float> %bin.rdx8, i32 0 This commit adds a cost model interface "getReductionCost(Opcode, Ty, Pairwise)" that will allow clients to ask for the cost of such a reduction (as backends might generate more efficient code than the cost of the individual instructions summed up). This interface is excercised by the CostModel analysis pass which looks for reduction patterns like the one above - starting at extractelements - and if it sees a matching sequence will call the cost model interface. We will also support a second form of pairwise reduction that is well supported on common architectures (haddps, vpadd, faddp). (v0, v1, v2, v3) \ / \ / (v0+v1, v2+v3, undef, undef) \ / ((v0+v1)+(v2+v3), undef, undef, undef) %rdx.shuf.0.0 = shufflevector <4 x float> %rdx, <4 x float> undef, <4 x i32> <i32 0, i32 2 , i32 undef, i32 undef> %rdx.shuf.0.1 = shufflevector <4 x float> %rdx, <4 x float> undef, <4 x i32> <i32 1, i32 3, i32 undef, i32 undef> %bin.rdx.0 = fadd <4 x float> %rdx.shuf.0.0, %rdx.shuf.0.1 %rdx.shuf.1.0 = shufflevector <4 x float> %bin.rdx.0, <4 x float> undef, <4 x i32> <i32 0, i32 undef, i32 undef, i32 undef> %rdx.shuf.1.1 = shufflevector <4 x float> %bin.rdx.0, <4 x float> undef, <4 x i32> <i32 1, i32 undef, i32 undef, i32 undef> %bin.rdx.1 = fadd <4 x float> %rdx.shuf.1.0, %rdx.shuf.1.1 %r = extractelement <4 x float> %bin.rdx.1, i32 0 llvm-svn: 190876	2013-09-17 18:06:50 +00:00
Serge Pavlov	8ec39992c1	Added documentation to getMemsetStores. llvm-svn: 190866	2013-09-17 16:24:42 +00:00
Quentin Colombet	d30a9585b8	[SelectionDAG] Teach the vector scalarizer about TRUNCATE. When a truncate node defines a legal vector type but uses an illegal vector type, the legalization process was splitting the vector until <1 x vector> type, but then it was failing to scalarize the node because it did not know how to handle TRUNCATE. <rdar://problem/14989896> llvm-svn: 190830	2013-09-17 00:26:56 +00:00
Adrian Prantl	db3e26d193	Debug info: Fix PR16736 and rdar://problem/14990587. A DBG_VALUE is register-indirect iff the first operand is a register _and_ the second operand is an immediate. llvm-svn: 190821	2013-09-16 23:29:03 +00:00
Jakub Staszak	ec2ffa92d8	Use reference instead of copy. llvm-svn: 190813	2013-09-16 22:03:38 +00:00
Peter Collingbourne	3fa50f9b05	Implement function prefix data as an IR feature. Previous discussion: http://lists.cs.uiuc.edu/pipermail/llvmdev/2013-July/063909.html Differential Revision: http://llvm-reviews.chandlerc.com/D1191 llvm-svn: 190773	2013-09-16 01:08:15 +00:00
Benjamin Kramer	7d6052687e	Replace some unnecessary vector copies with references. llvm-svn: 190770	2013-09-15 22:04:42 +00:00
Hal Finkel	31658834e6	Prevent assert in CombinerGlobalAA with null values DAGCombiner::isAlias can be called with SrcValue1 or SrcValue2 null, and we can't use AA in this case (if we try, then the casting code in AA will assert). llvm-svn: 190763	2013-09-15 02:19:49 +00:00
Quentin Colombet	cf71c6320b	[Peephole] Rewrite copies to avoid cross register banks copies. By definition copies across register banks are not coalescable. Still, it may be possible to get rid of such a copy when the value is available in another register of the same register file. Consider the following example, where capital and lower letters denote different register file: b = copy A <-- cross-bank copy ... C = copy b <-- cross-bank copy This could have been optimized this way: b = copy A <-- cross-bank copy ... C = copy A <-- same-bank copy Note: b and C's definitions may be in different basic blocks. This patch adds a peephole optimization that looks through a chain of copies leading to a cross-bank copy and reuses a source that is on the same register file if available. This solution could also be used to get rid of some copies (e.g., A could have been used instead of C). However, we do not do so because: - It may over constrain the coloring of the source register for coalescing. - The register allocator may not be able to find a nice split point for the longer live-range, leading to more spill. <rdar://problem/14742333> llvm-svn: 190713	2013-09-13 18:26:31 +00:00
Eric Christopher	dd1a01203d	Add initial support for handling gnu style pubnames accepted by some versions of gold. This support is designed to allow gold to produce gdb_index sections similar to the accelerator tables and consumable by gdb. llvm-svn: 190649	2013-09-13 00:35:05 +00:00
Eric Christopher	8b3737fbb0	Reformat and hoist section grabbing to top level. llvm-svn: 190648	2013-09-13 00:34:58 +00:00
Joey Gouly	0e76fa7df5	Add an instruction deprecation feature to TableGen. The 'Deprecated' class allows you to specify a SubtargetFeature that the instruction is deprecated on. The 'ComplexDeprecationPredicate' class allows you to define a custom predicate that is called to check for deprecation. For example: ComplexDeprecationPredicate<"MCR"> would mean you would have to define the following function: bool getMCRDeprecationInfo(MCInst &MI, MCSubtargetInfo &STI, std::string &Info) Which returns 'false' for not deprecated, and 'true' for deprecated and store the warning message in 'Info'. The MCTargetAsmParser constructor was chaned to take an extra argument of the MCInstrInfo class, so out-of-tree targets will need to be changed. llvm-svn: 190598	2013-09-12 10:28:05 +00:00
Hal Finkel	6f1ff8e1a8	Fix crash in AggressiveAntiDepBreaker with empty CriticalPathSet If no register classes are added to CriticalPathRCs, then the CriticalPathSet bitmask will be empty. In that case, ExcludeRegs must remain NULL or else this line will cause a segfault: } else if ((ExcludeRegs != NULL) && ExcludeRegs->test(AntiDepReg)) { I have no in-tree test case. llvm-svn: 190584	2013-09-12 04:22:31 +00:00
Matt Arsenault	bc08ddba58	Remove pointless assertion after r190376 llvm-svn: 190565	2013-09-12 01:07:49 +00:00
Manman Ren	5b2f4b0540	Debug info: add more comments. llvm-svn: 190544	2013-09-11 19:40:28 +00:00
Hal Finkel	8f2e700522	Add getUnrollingPreferences to TTI Allow targets to customize the default behavior of the generic loop unrolling transformation. This will be used by the PowerPC backend when targeting the A2 core (which is in-order with a deep pipeline), and using more aggressive defaults is important. llvm-svn: 190542	2013-09-11 19:25:43 +00:00
Benjamin Kramer	079b96e6f7	Revert "Give internal classes hidden visibility." It works with clang, but GCC has different rules so we can't make all of those hidden. This reverts commit r190534. llvm-svn: 190536	2013-09-11 18:05:11 +00:00
Benjamin Kramer	6a44af3629	Give internal classes hidden visibility. Worth 100k on a linux/x86_64 Release+Asserts clang. llvm-svn: 190534	2013-09-11 17:42:27 +00:00
Bill Wendling	62a2d14ac5	Simplify the checking of function attributes by using the simple methods. llvm-svn: 190499	2013-09-11 08:35:09 +00:00
Eli Friedman	8f06d55697	Rename variables for consistency. No functional change. llvm-svn: 190466	2013-09-11 00:41:02 +00:00
Eli Friedman	78bffa5767	Fix unused variables. llvm-svn: 190448	2013-09-10 23:18:14 +00:00
Eric Christopher	13b99d2aba	Hoist section call out of loop. llvm-svn: 190440	2013-09-10 21:49:37 +00:00
Manman Ren	2312ed35d2	Debug Info: create scope children DIEs when the scope DIE is not null. We try to create the scope children DIEs after we create the scope DIE. But to avoid emitting empty lexical block DIE, we first check whether a scope DIE is going to be null, then create the scope children if it is not null. From the number of children, we decide whether to actually create the scope DIE. This patch also removes an early exit which checks for a special condition. It also removes deletion of un-used children DIEs that are generated because we used to generate children DIEs before the scope DIE. Deletion of un-used children DIEs may cause problem because we sometimes keep created DIEs in a member variable of a CU. llvm-svn: 190421	2013-09-10 18:40:41 +00:00
Manman Ren	34b3dcc3b5	Debug Info: define a DIRef template. Specialize the constructors for DIRef<DIScope> and DIRef<DIType> to make sure the Value is indeed a scope ref and a type ref. Use DIScopeRef for DIScope::getContext and DIType::getContext and use DITypeRef for getContainingType and getClassType. DIScope::generateRef now returns a DIScopeRef instead of a "Value *" for readability and type safety. llvm-svn: 190418	2013-09-10 18:30:07 +00:00
Matt Arsenault	d232222f34	Don't use getSetCCResultType for creating a vselect The vselect mask isn't a setcc. This breaks in the case when the result of getSetCCResultType is larger than the vector operands e.g. %tmp = select i1 %cmp <2 x i8> %a, <2 x i8> %b when getSetCCResultType returns <2 x i32>, the assertion that the (MaskTy.getSizeInBits() == Op1.getValueType().getSizeInBits()) is hit. No test since I don't think I can hit this with any of the current targets. The R600/SI implementation would break, since it returns a vector of i1 for this, but it doesn't reach ExpandSELECT for other reasons. llvm-svn: 190376	2013-09-10 00:41:56 +00:00
Andrew Trick	6c88b35090	Enable -misched-cyclicpath by default. llvm-svn: 190367	2013-09-09 23:31:14 +00:00
Manman Ren	de897a369a	Debug Info: move DIScope::getContext back from DwarfDebug. This partially reverts r190330. DIScope::getContext now returns DIScopeRef instead of DIScope. We construct a DIScopeRef from DIScope when we are dealing with subprogram, lexical block or name space. llvm-svn: 190362	2013-09-09 22:35:23 +00:00
Andrew Trick	e1f7bf2c02	mi-sched: smooth out the cyclicpath heuristic. Arnold's idea. I generally try to avoid stateful heuristics because it can make debugging harder. However, we need a way to prevent the latency priority from dominating, and it somewhat makes sense to schedule aggressively for latency only within an issue group. Swift in particular likes this, and it doesn't hurt anyone else: \| Benchmarks/MiBench/consumer-lame \| 10.39% \| \| Benchmarks/Misc/himenobmtxpa \| 9.63% \| llvm-svn: 190360	2013-09-09 22:28:08 +00:00
Jack Carter	170a5f2983	white spaces and long lines llvm-svn: 190358	2013-09-09 22:02:08 +00:00
Eric Christopher	ba506db498	Always add global names. We're adding them in the rest of the code as well as types. No functional change as they're not emitted unless the option is true anyhow. llvm-svn: 190346	2013-09-09 20:03:20 +00:00
Eric Christopher	5f93bb9299	Rename for consistency. llvm-svn: 190345	2013-09-09 20:03:17 +00:00
Bill Wendling	550c76dbd6	Call generateCompactUnwindEncodings() right before we need to output the frame information. There are more than one paths to where the frame information is emitted. Place the call to generateCompactUnwindEncodings() into the method which outputs the frame information, thus ensuring that the encoding is there for every path. This involved threading the MCAsmBackend object through to this method. <rdar://problem/13623355> llvm-svn: 190335	2013-09-09 19:48:37 +00:00
Manman Ren	116868eadd	Debug Info: Use DIScopeRef for DIType::getContext. In DIBuilder, the context field of a TAG_member is updated to use the scope reference. Verifier is updated accordingly. DebugInfoFinder now needs to generate a type identifier map to have access to the actual scope. Same applies for BreakpointPrinter. processModule of DebugInfoFinder is called during initialization phase of the verifier to make sure the type identifier map is constructed early enough. We are now able to unique a simple class as demonstrated by the added testing case. llvm-svn: 190334	2013-09-09 19:47:11 +00:00
Manman Ren	33796c5e98	Debug Info: move DIScope::getContext to DwarfDebug. DIScope::getContext is a wrapper function that calls the specific getContext method on each subclass. When we switch DIType::getContext to return DIScopeRef instead of DIScope, DIScope::getContext can no longer return a DIScope without a type identifier map. DIScope::getContext is only used by DwarfDebug, so we move it to DwarfDebug to have easy access to the type identifier map. llvm-svn: 190330	2013-09-09 19:23:58 +00:00
Bob Wilson	e407736a06	Revert patches to add case-range support for PR1255. The work on this project was left in an unfinished and inconsistent state. Hopefully someone will eventually get a chance to implement this feature, but in the meantime, it is better to put things back the way the were. I have left support in the bitcode reader to handle the case-range bitcode format, so that we do not lose bitcode compatibility with the llvm 3.3 release. This reverts the following commits: 155464, 156374, 156377, 156613, 156704, 156757, 156804 156808, 156985, 157046, 157112, 157183, 157315, 157384, 157575, 157576, 157586, 157612, 157810, 157814, 157815, 157880, 157881, 157882, 157884, 157887, 157901, 158979, 157987, 157989, 158986, 158997, 159076, 159101, 159100, 159200, 159201, 159207, 159527, 159532, 159540, 159583, 159618, 159658, 159659, 159660, 159661, 159703, 159704, 160076, 167356, 172025, 186736 llvm-svn: 190328	2013-09-09 19:14:35 +00:00
Manman Ren	3eb9dffc89	Debug Info: Move isSubprogramContext from DebugInfo to DwarfDebug. This helper function needs the type identifier map when we switch DIType::getContext to return DIScopeRef instead of DIScope. Since isSubprogramContext is used by DwarfDebug only, We move it to DwarfDebug to have easy access to the map. llvm-svn: 190325	2013-09-09 19:05:21 +00:00
Manman Ren	856191b0d1	Debug Info: Rename DITypeRef to DIScopeRef. A reference to a scope is more general than a reference to a type since DIType is a subclass of DIScope. A reference to a type can be either an identifier for the type or the DIType itself, while a reference to a scope can be either an identifier for the type (when the scope is indeed a type) or the DIScope itself. A reference to a type and a reference to a scope will be resolved in the same way. The only difference is in the verifier when a field is a reference to a type (i.e. the containing type field of a DICompositeType) or a field is a reference to a scope (i.e. the context field of a DIType). This is to get ready for switching DIType::getContext to return DIScopeRef instead of DIScope. Tighten up isTypeRef and isScopeRef to make sure the identifier is not empty and the MDNode is DIType for TypeRef and DIScope for ScopeRef. llvm-svn: 190322	2013-09-09 19:03:51 +00:00
Benjamin Kramer	d93817ffe0	[stackprotector] Modernize code with IRBuilder llvm-svn: 190317	2013-09-09 17:38:01 +00:00
Joey Gouly	a5153cb025	[ARMv8] Prevent generation of deprecated IT blocks on ARMv8 in Thumb mode. IT blocks can only be one instruction lonf, and can only contain a subset of the 16 instructions. Patch by Artyom Skrobov! llvm-svn: 190309	2013-09-09 14:21:49 +00:00
Bill Wendling	58e2d3d856	Generate compact unwind encoding from CFI directives. We used to generate the compact unwind encoding from the machine instructions. However, this had the problem that if the user used `-save-temps' or compiled their hand-written `.s' file (with CFI directives), we wouldn't generate the compact unwind encoding. Move the algorithm that generates the compact unwind encoding into the MCAsmBackend. This way we can generate the encoding whether the code is from a `.ll' or `.s' file. <rdar://problem/13623355> llvm-svn: 190290	2013-09-09 02:37:14 +00:00
Manman Ren	c4ae9b3aeb	Debug Info: Use identifier to reference DIType in containing type field of a DISubprogram. Verifier is updated accordingly. llvm-svn: 190229	2013-09-07 00:04:05 +00:00
Manman Ren	d8f798ea97	Debug Info: Use identifier to reference DIType in containing type field of a DICompositeType. Verifier is updated accordingly. llvm-svn: 190190	2013-09-06 18:46:00 +00:00
Andrew Trick	b248b4a1de	mi-sched: cleanup register pressure update, remove a FIXME. llvm-svn: 190181	2013-09-06 17:32:47 +00:00
Andrew Trick	c573cd905a	mi-sched: improve regpressure tracing. llvm-svn: 190180	2013-09-06 17:32:44 +00:00
Andrew Trick	7609b7d1b5	mi-sched: print tree size in -view-misched-dags llvm-svn: 190179	2013-09-06 17:32:42 +00:00
Andrew Trick	ffdbefb90c	mi-sched: register pressure update tracing. llvm-svn: 190178	2013-09-06 17:32:39 +00:00
Andrew Trick	ddffae9027	mi-sched: Reorder Cyclicpath (latency) and CriticalMax (pressure) heuristics. The latency based scheduling could induce spills in some cases. llvm-svn: 190177	2013-09-06 17:32:36 +00:00
Andrew Trick	75e411cc8e	Added MachineSchedPolicy. Allow subtargets to customize the generic scheduling strategy. This is convenient for targets that don't need to add new heuristics by specializing the strategy. llvm-svn: 190176	2013-09-06 17:32:34 +00:00
Matthias Braun	305ef7f5b0	avoid unnecessary direct access to LiveInterval::ranges llvm-svn: 190170	2013-09-06 16:44:32 +00:00
Matthias Braun	90e0d3c03a	remove unused argument from LiveRanges::join() llvm-svn: 190169	2013-09-06 16:44:29 +00:00
Matthias Braun	c0ad7bfa62	remove pointless assert The if above it ensures the property anyway. llvm-svn: 190168	2013-09-06 16:44:27 +00:00
Matthias Braun	b348d9703c	fix comment There's no 'B3' in the example. llvm-svn: 190167	2013-09-06 16:44:25 +00:00
Tim Northover	950fcc0577	SelectionDAG: create correct BooleanContent constants Occasionally DAGCombiner can spot that a SETCC operation is completely redundant and reduce it to "all true" or "all false". If this happens to a vector, the value produced has to take account of what a normal comparison would have produced, which may be an all-1s bitmask. The fix in SelectionDAG.cpp is tested, however, as far as I can see the code in TargetLowering.cpp is possibly unreachable and almost certainly irrelevant when triggered so there are no tests. However, I believe it's still clearly the right change and may save someone else some hassle if it suddenly becomes reachable. So I'm doing it anyway. llvm-svn: 190147	2013-09-06 12:38:12 +00:00
Manman Ren	60352032bf	Debug Info: Use identifier to reference DIType in base type field of ptr_to_member. We introduce a new class DITypeRef that represents a reference to a DIType. It wraps around a Value*, which can be either an identifier in MDString or an actual MDNode. The class has a helper function "resolve" that finds the actual MDNode for a given DITypeRef. We specialize getFieldAs to return a field that is a reference to a DIType. To correctly access the base type field of ptr_to_member, getClassType now calls getFieldAs<DITypeRef> to return a DITypeRef. Also add a typedef for DITypeIdentifierMap and a helper generateDITypeIdentifierMap in DebugInfo.h. In DwarfDebug.cpp, we keep a DITypeIdentifierMap and call generateDITypeIdentifierMap to actually populate the map. Verifier is updated accordingly. llvm-svn: 190081	2013-09-05 18:48:31 +00:00
Eric Christopher	cf7289f6d9	Move accelerator table defines and constants to Dwarf.h since we're proposing it for DWARF5. No functional change intended. llvm-svn: 190074	2013-09-05 18:20:16 +00:00
Eric Christopher	b4e2cc49ef	Reformat. llvm-svn: 190064	2013-09-05 16:46:43 +00:00
Andrew Trick	ed20075d19	mi-sched: Force bottom up scheduling for generic targets. Fast register pressure tracking currently only takes effect during bottom up scheduling. Forcing this is a bit faster and simpler for targets that don't have many scheduling constraints and don't need top-down scheduling. llvm-svn: 190014	2013-09-04 23:54:00 +00:00
Eric Christopher	e31e072c33	Remove hack ensuring that darwin didn't produce dwarf > 3 for modules without a limiting factor. Update all testcases accordingly. llvm-svn: 190002	2013-09-04 22:21:24 +00:00
Eric Christopher	c9f1e785d5	Revert "Revert r189902 as the workaround shouldn't be necessary anymore." Needs testcase updates. llvm-svn: 190000	2013-09-04 21:36:52 +00:00
Eric Christopher	b72ef638f4	Revert r189902 as the workaround shouldn't be necessary anymore. llvm-svn: 189999	2013-09-04 21:26:56 +00:00
Andrew Trick	b05db8e0b9	comment typo llvm-svn: 189997	2013-09-04 21:12:05 +00:00
Andrew Trick	2a749ee0b9	Remove dead subtree limit code. llvm-svn: 189995	2013-09-04 21:00:20 +00:00
Andrew Trick	856ecd9ab3	-view-misched-dags, better pruning. llvm-svn: 189994	2013-09-04 21:00:18 +00:00
Andrew Trick	ef54c59490	mi-sched: DEBUG cleanup, call tracePick for unidirectional scheduling. llvm-svn: 189993	2013-09-04 21:00:16 +00:00
Andrew Trick	1ab16d9ecf	80 columns llvm-svn: 189992	2013-09-04 21:00:13 +00:00
Andrew Trick	66c3dfbf8c	mi-sched: Suppress register pressure tracking when the scheduling window is too small. If the instruction window is < NumRegs/2, pressure tracking is not likely to be effective. The scheduler has to process a very large number of tiny blocks. We want this to be fast. llvm-svn: 189991	2013-09-04 21:00:11 +00:00
Andrew Trick	a6e877707f	mi-sched: Load clustering is a bit to expensive to enable unconditionally. llvm-svn: 189990	2013-09-04 21:00:08 +00:00
Andrew Trick	8c699c93b2	mi-sched: Reuse an invalid HazardRecognizer to save compile time. llvm-svn: 189989	2013-09-04 21:00:05 +00:00
Andrew Trick	310190e21f	mi-sched: bypass heuristic checks when regpressure tracking is disabled. llvm-svn: 189988	2013-09-04 21:00:02 +00:00
Andrew Trick	b6e74712b6	Added -misched-regpressure option. Register pressure tracking is half the complexity of the scheduler. It's useful to be able to turn it off for compile time and performance comparisons. llvm-svn: 189987	2013-09-04 20:59:59 +00:00
Eric Christopher	9adc55faa7	Unify and clean up. llvm-svn: 189977	2013-09-04 19:53:21 +00:00
Michael Gottesman	c89466fc22	Revert "Revert "Remove the darwin gdb option, that version of gdb is now dead and the rest of the compatibility should be done on a dwarf-N level."" This reverts commit r189913. Talked with Eric on IRC. I am going to XFAIL the failing test since it is using what Eric described as "the member hack" which was needed on that old GDB. Sorry for the noise! llvm-svn: 189914	2013-09-04 04:39:38 +00:00
Michael Gottesman	a318370b8d	Revert "Remove the darwin gdb option, that version of gdb is now dead and the rest of the compatibility should be done on a dwarf-N level." This reverts commit r189903. This commit broke the phase 1 buildbot for a while. http://lab.llvm.org:8013/builders/clang-x86_64-darwin11-nobootstrap-RAincremental/builds/6684 llvm-svn: 189913	2013-09-04 04:31:56 +00:00
Eric Christopher	614dc83603	Remove the darwin gdb option, that version of gdb is now dead and the rest of the compatibility should be done on a dwarf-N level. llvm-svn: 189903	2013-09-04 02:02:10 +00:00
Eric Christopher	38f1c64098	Make the default dwarf version 3 for darwin when we can't find one in the module. Add a FIXME with a comment about darwin's ld. llvm-svn: 189902	2013-09-04 01:38:30 +00:00
Eric Christopher	25b7adc8ce	Add a hashing routine that handles hashing types. Add a test for hashing the contents of DW_FORM_data1 on top of a type with attributes. llvm-svn: 189862	2013-09-03 21:57:57 +00:00
Eric Christopher	b86e2ad819	Sentences end with periods. llvm-svn: 189861	2013-09-03 21:57:50 +00:00
Eric Christopher	e020fa7c9c	Add the rest of the stock attributes to the attribute table. This won't affect the kinds of hashes we test for as we actually do hashing based on form and attribute. Change the fission-hash testcase one last time to handle DW_AT_comp_dir. llvm-svn: 189840	2013-09-03 20:00:20 +00:00
Andrew Trick	2c4f8b7ee8	Fix my previous checkin to updatePressureDiffs. There was one case that we could hit a DebugValue where I didn't think to check. DebugValues are evil. No checkinable test case, sorry. It's an obvious fix. llvm-svn: 189717	2013-08-31 05:17:58 +00:00
Andrew Trick	3bf33075ce	Use LiveRangeQuery for instruction-level liveness queries. Remove redundant or bug-prone LiveInterval APIs. llvm-svn: 189685	2013-08-30 17:58:49 +00:00
Andrew Trick	2bc74c2887	mi-sched: update PressureDiffs on-the-fly for liveness. This removes all expensive pressure tracking logic from the scheduling critical path of node comparison. llvm-svn: 189643	2013-08-30 04:36:57 +00:00
Andrew Trick	ff60477306	Replace LiveInterval::killedAt with isKilledAtInstr. Return true for LRGs that end at EarlyClobber or Register slots. llvm-svn: 189642	2013-08-30 04:31:01 +00:00
Andrew Trick	b1a45b6c61	mi-sched: improve the generic register pressure comparison. Only compare pressure within the same set. When multiple sets are affected, we prioritize the most constrained set. llvm-svn: 189641	2013-08-30 04:27:29 +00:00
Andrew Trick	1a8313458f	mi-sched: Precompute a PressureDiff for each instruction, adjust for liveness later. Created SUPressureDiffs array to hold the per node PDiff computed during DAG building. Added a getUpwardPressureDelta API that will soon replace the old one. Compute PressureDelta here from the precomputed PressureDiffs. Updating for liveness will come next. llvm-svn: 189640	2013-08-30 03:49:48 +00:00
Andrew Trick	ef80f50058	comment typo llvm-svn: 189635	2013-08-30 02:02:12 +00:00
Eric Christopher	4b358188c6	Don't bother emitting the pubtypes section on darwin since there aren't any maintained consumers of it on that platform. llvm-svn: 189631	2013-08-30 00:40:17 +00:00
Eric Christopher	ac8199bf60	Reformat slightly. llvm-svn: 189630	2013-08-30 00:39:57 +00:00
Andrew Trick	483f4199f3	Comment and revise the cyclic critical path code. This should be much more clear now. It's still disabled pending testing. llvm-svn: 189597	2013-08-29 18:04:49 +00:00
Hal Finkel	8e83820a04	Revert: r189565 - Add getUnrollingPreferences to TTI Revert unintentional commit (of an unreviewed change). Original commit message: Add getUnrollingPreferences to TTI Allow targets to customize the default behavior of the generic loop unrolling transformation. This will be used by the PowerPC backend when targeting the A2 core (which is in-order with a deep pipeline), and using more aggressive defaults is important. llvm-svn: 189566	2013-08-29 03:33:15 +00:00
Hal Finkel	63e6c0e9fb	Add getUnrollingPreferences to TTI Allow targets to customize the default behavior of the generic loop unrolling transformation. This will be used by the PowerPC backend when targeting the A2 core (which is in-order with a deep pipeline), and using more aggressive defaults is important. llvm-svn: 189565	2013-08-29 03:29:57 +00:00
Hal Finkel	5ef4dccdce	Use TargetSubtargetInfo::useAA() in DAGCombine This uses the TargetSubtargetInfo::useAA() function to control the defaults of the -combiner-alias-analysis and -combiner-global-alias-analysis options. llvm-svn: 189564	2013-08-29 03:29:55 +00:00
Hal Finkel	b350ffd1b1	Add useAA() to TargetSubtargetInfo There are several optional (off-by-default) features in CodeGen that can make use of alias analysis. These features are important for generating code for some kinds of cores (for example the (in-order) PPC A2 core). This adds a useAA() function to TargetSubtargetInfo to allow these features to be enabled by default on a per-subtarget basis. Here is the first use of this function: To control the default of the -enable-aa-sched-mi feature. llvm-svn: 189563	2013-08-29 03:25:05 +00:00
Juergen Ributzka	11c52c601a	Fix a typo and coding style of a previous commit. No functional change. llvm-svn: 189526	2013-08-28 22:33:58 +00:00
Eric Christopher	62caa709fe	Remove support for the .debug_inlined section. No known software in use supports it. llvm-svn: 189439	2013-08-28 04:04:28 +00:00
Eric Christopher	e9fd605b41	Add a TODO here. llvm-svn: 189428	2013-08-28 00:13:08 +00:00
Eric Christopher	d033d6fb88	Add support for DW_FORM_dataN and DW_FORM_udata to the DIE hashing algorithm. Update the split dwarf hashing testcase accordingly - this should be the last time that the hash of an empty file changes. llvm-svn: 189427	2013-08-28 00:10:38 +00:00
Eric Christopher	9d1daa87e7	Use DW_FORM_sdata for signed constant values and udata on occasion when we can. Migrate from using blocks when we're adding just a single attribute and floating point values are an unsigned, not signed, bag of bits. Update all test cases accordingly. llvm-svn: 189419	2013-08-27 23:49:04 +00:00
Tim Northover	819bfb5a25	DAGCombiner: make sure or/shl/srl really has zero high bits before forming bswap We want to convert code like (or (srl N, 8), (shl N, 8)) into (srl (bswap N), const), but this is only valid if the bits above 16 on the source pattern are 0, the checks we were doing on this were slightly wrong before. llvm-svn: 189348	2013-08-27 13:46:45 +00:00
Owen Anderson	a0260f848d	Remove an over-zealous assertion. A pointer type could be illegal if the target is prepared to custom-legalize pointer operands. This assertion was evaluated before the target would have a chance to do so, making it impossible. llvm-svn: 189299	2013-08-27 00:28:23 +00:00
Eric Christopher	ca68bbf5c0	Formatting. llvm-svn: 189296	2013-08-26 23:58:22 +00:00
Eric Christopher	6b16b43ef9	Make the lifetime of the DICompileUnit we're constructing from the MDNode more clear as just for a single argument. llvm-svn: 189294	2013-08-26 23:57:03 +00:00
Eric Christopher	6fdf324f44	Have the skeleton compile unit construction method take the CU it is constructing from as an input and keep the same unique identifier. We can use this to connect items which must stay in the .o file (e.g. pubnames and pubtypes) to the skeleton cu rather than having duplicate unique numbers for the sections and needing to do lookups based on MDNode. llvm-svn: 189293	2013-08-26 23:50:43 +00:00
Eric Christopher	6d13fe007f	Remove duplicate set of CompilationDir. llvm-svn: 189292	2013-08-26 23:50:40 +00:00
Eric Christopher	bfceb2fe8f	Remove the language parameter and variable from the compile unit. We can get it via the MDNode that's passed in. Save that instead. llvm-svn: 189291	2013-08-26 23:50:38 +00:00
Eric Christopher	4d36ca009f	Treat the pubtypes section similarly to the pubnames section and emit it by default under linux or when we're trying to keep compatibility with old gdb versions. Fix testcase for option name change. llvm-svn: 189289	2013-08-26 23:24:35 +00:00
Eric Christopher	bf1ea3c727	Only emit the section sym if we're emitting the section. llvm-svn: 189288	2013-08-26 23:24:31 +00:00
Eric Christopher	5297df025c	Fix thinko. llvm-svn: 189279	2013-08-26 20:58:35 +00:00
Tom Stellard	838e2344ec	SelectionDAG: Remove unnecessary uses of TargetLowering::getPointerTy() If we have a binary operation like ISD:ADD, we can set the result type equal to the result type of one of its operands rather than using TargetLowering::getPointerTy(). Also, any use of DAG.getIntPtrConstant(C) as an operand for a binary operation can be replaced with: DAG.getConstant(C, OtherOperand.getValueType()); llvm-svn: 189227	2013-08-26 15:06:10 +00:00
Tom Stellard	7da047c9fb	SelectionDAG: Use correct pointer size when splitting vector stores llvm-svn: 189224	2013-08-26 15:05:55 +00:00
Tom Stellard	fd155828ed	SelectionDAG: Use correct pointer size when lowering function arguments v2 This adds minimal support to the SelectionDAG for handling address spaces with different pointer sizes. The SelectionDAG should now correctly lower pointer function arguments to the correct size as well as generate the correct code when lowering getelementptr. This patch also updates the R600 DataLayout to use 32-bit pointers for the local address space. v2: - Add more helper functions to TargetLoweringBase - Use CHECK-LABEL for tests llvm-svn: 189221	2013-08-26 15:05:36 +00:00
David Majnemer	b78df507c8	AsmPrinter: Get rid of llvm$workaround$fake$stub$ We currently emit labels with the prefix Lllvm$workaround$fake$stub$ if the target's MCAsmInfo has getLinkOnceDirective() mapped to something interesting. This was apparently a work around introduced in r31033 for binutils that we don't need anymore. llvm-svn: 189187	2013-08-25 09:18:19 +00:00
Benjamin Kramer	b12cf01908	Add a function object to compare the first or second component of a std::pair. Replace instances of this scattered around the code base. llvm-svn: 189169	2013-08-24 12:54:27 +00:00
Benjamin Kramer	260de74e48	Simplify code. No functionality change. llvm-svn: 189168	2013-08-24 12:15:54 +00:00
Benjamin Kramer	892daba8d3	DwarfDebug: Delete orphaned children. Leak found by valgrind. llvm-svn: 189167	2013-08-24 11:55:49 +00:00
Andrew Trick	475a9911ca	PrintVRegOrUnit llvm-svn: 189124	2013-08-23 17:48:53 +00:00
Andrew Trick	e4c1ba762d	Rename to RegPressure API parameters RegUnits. llvm-svn: 189123	2013-08-23 17:48:51 +00:00
Andrew Trick	01bc216482	Simplify RegPressure helpers. llvm-svn: 189122	2013-08-23 17:48:48 +00:00
Andrew Trick	86a7061e5d	Add a convenient PSetIterator for visiting pressure sets affected by a register. llvm-svn: 189121	2013-08-23 17:48:46 +00:00
Andrew Trick	c01b00400d	Adds cyclic critical path computation and heuristics, temporarily disabled. Estimate the cyclic critical path within a single block loop. If the acyclic critical path is longer, then the loop will exhaust OOO resources after some number of iterations. If lag between the acyclic critical path and cyclic critical path is longer the the time it takes to issue those loop iterations, then aggressively schedule for latency. llvm-svn: 189120	2013-08-23 17:48:43 +00:00
Andrew Trick	8dd26f002f	MI Sched: record local vreg uses. This will be used to compute the cyclic critical path and to update precomputed per-node pressure differences. In the longer term, it could also be used to speed up LiveInterval update by avoiding visiting all global vreg users. llvm-svn: 189118	2013-08-23 17:48:39 +00:00
Andrew Trick	a53e101627	mi-sched: Don't call MBB.size() in initSUnits. The driver already has instr count. This fixes a pathological compile time problem with very large blocks and lots of scheduling boundaries. llvm-svn: 189116	2013-08-23 17:48:33 +00:00
Richard Sandiford	37cd6cfba2	Turn MipsOptimizeMathLibCalls into a target-independent scalar transform ...so that it can be used for z too. Most of the code is the same. The only real change is to use TargetTransformInfo to test when a sqrt instruction is available. The pass is opt-in because at the moment it only handles sqrt. llvm-svn: 189097	2013-08-23 10:27:02 +00:00
Michael Gottesman	20f25eb958	[stack protector] Work around an issue with the BMOVPCB_CALL instruction on ARM by disabling does not return on __stack_chk_fail. This is to fix the bots while I look to see if there is something I can do here. rdar://14811848 llvm-svn: 189076	2013-08-22 23:45:24 +00:00
Bill Wendling	fe88aea706	Check only if we have this attribute. If it's not an attribute, then it's assumed false. llvm-svn: 189063	2013-08-22 21:16:14 +00:00
Michael Gottesman	1adac3582d	[stackprotector] When finding the split point to splice off the end of a parentmbb into a successmbb, include any DBG_VALUE MI. Fix for PR16954. llvm-svn: 188987	2013-08-22 05:40:50 +00:00
Tom Stellard	1b2c2d8414	SelectionDAG: Make sure stores are always added to the LegalizedNodes list When truncated vector stores were being custom lowered in VectorLegalizer::LegalizeOp(), the old (illegal) and new (legal) node pair was not being added to LegalizedNodes list. Instead of the legalized result being passed to VectorLegalizer::TranslateLegalizeResult(), the result was being passed back into VectorLegalizer::LegalizeOp(), which ended up adding a (new, new) pair to the list instead. This was causing an assertion failure when a custom lowered truncated vector store was the last instruction a basic block and the VectorLegalizer was unable to find it in the LegalizedNodes list when updating the DAG root. llvm-svn: 188953	2013-08-21 22:42:58 +00:00
Juergen Ributzka	3db39dc1ae	Teach BaseIndexOffset::match to identify base pointers in loops. The small utility function that pattern matches Base + Index + Offset patterns for loads and stores fails to recognize the base pointer for loads/stores from/into an array at offset 0 inside a loop. As a result DAGCombiner::MergeConsecutiveStores was not able to merge all stores. This commit fixes the issue by adding an additional pattern match and also a test case. Reviewer: Nadav llvm-svn: 188936	2013-08-21 21:53:38 +00:00
David Majnemer	ed89b5c6e7	DebugInfo: Do not use the DWARF Version for the .debug_pubnames or .debug_pubtypes version field Summary: LLVM would generate DWARF with version 3 in the .debug_pubname and .debug_pubtypes version fields. This would lead SGI dwarfdump to fail parsing the DWARF with (in the instance of .debug_pubnames) would exit with: dwarfdump ERROR: dwarf_get_globals: DW_DLE_PUBNAMES_VERSION_ERROR (123) This fixes PR16950. Reviewers: echristo, dblaikie Reviewed By: echristo CC: cfe-commits Differential Revision: http://llvm-reviews.chandlerc.com/D1454 llvm-svn: 188869	2013-08-21 06:13:34 +00:00
Richard Sandiford	6f6d55161b	[SystemZ] Use SRST to optimize memchr SystemZTargetLowering::emitStringWrapper() previously loaded the character into R0 before the loop and made R0 live on entry. I'd forgotten that allocatable registers weren't allowed to be live across blocks at this stage, and it confused LiveVariables enough to cause a miscompilation of f3 in memchr-02.ll. This patch instead loads R0 in the loop and leaves LICM to hoist it after RA. This is actually what I'd tried originally, but I went for the manual optimisation after noticing that R0 often wasn't being hoisted. This bug forced me to go back and look at why, now fixed as r188774. We should also try to optimize null checks so that they test the CC result of the SRST directly. The select between null and the SRST GPR result could then usually be deleted as dead. llvm-svn: 188779	2013-08-20 09:38:48 +00:00
Richard Sandiford	96aa93d5f1	Fix overly pessimistic shortcut in post-RA MachineLICM Post-RA LICM keeps three sets of registers: PhysRegDefs, PhysRegClobbers and TermRegs. When it sees a definition of R it adds all aliases of R to the corresponding set, so that when it needs to test for membership it only needs to test a single register, rather than worrying about aliases there too. E.g. the final candidate loop just has: unsigned Def = Candidates[i].Def; if (!PhysRegClobbers.test(Def) && ...) { to test whether register Def is multiply defined. However, there was also a shortcut in ProcessMI to make sure we didn't add candidates if we already knew that they would fail the final test. This shortcut was more pessimistic than the final one because it checked whether _any alias_ of the defined register was multiply defined. This is too conservative for targets that define register pairs. E.g. on z, R0 and R1 are sometimes used as a pair, so there is a 128-bit register that aliases both R0 and R1. If a loop used R0 and R1 independently, and the definition of R0 came first, we would be able to hoist the R0 assignment (because that used the final test quoted above) but not the R1 assignment (because that meant we had two definitions of the paired R0/R1 register and would fail the shortcut in ProcessMI). This patch just uses the same check for the ProcessMI shortcut as we use in the final candidate loop. llvm-svn: 188774	2013-08-20 09:11:13 +00:00
Michael Gottesman	dc985ef0af	[stackprotector] Small cleanup. llvm-svn: 188772	2013-08-20 08:56:28 +00:00
Michael Gottesman	76c44be14a	[stackprotector] Small Bit of computation hoisting. llvm-svn: 188771	2013-08-20 08:56:26 +00:00
Michael Gottesman	1977d15e02	[stackprotector] Added significantly longer comment to FindPotentialTailCall to make clear its relationship to llvm::isInTailCallPosition. llvm-svn: 188770	2013-08-20 08:56:23 +00:00
Michael Gottesman	62c5d714a1	Removed trailing whitespace. llvm-svn: 188769	2013-08-20 08:46:16 +00:00
Michael Gottesman	56e246b1a1	[stackprotector] Removed stale TODO. llvm-svn: 188768	2013-08-20 08:46:13 +00:00
Michael Gottesman	5e57068b7a	[stackprotector] Added support for emitting the llvm intrinsic stack protector check. rdar://13935163 llvm-svn: 188766	2013-08-20 08:36:53 +00:00
Michael Gottesman	ce0e4c263b	[stackprotector] Refactor out the end of isInTailCallPosition into the function returnTypeIsEligibleForTailCall. This allows me to use returnTypeIsEligibleForTailCall in the stack protector pass. rdar://13935163 llvm-svn: 188765	2013-08-20 08:36:50 +00:00
Michael Gottesman	f7e1203d95	Remove unused variables that crept in. llvm-svn: 188761	2013-08-20 07:17:27 +00:00
Michael Gottesman	b27f0f1f6b	Teach selectiondag how to handle the stackprotectorcheck intrinsic. Previously, generation of stack protectors was done exclusively in the pre-SelectionDAG Codegen LLVM IR Pass "Stack Protector". This necessitated splitting basic blocks at the IR level to create the success/failure basic blocks in the tail of the basic block in question. As a result of this, calls that would have qualified for the sibling call optimization were no longer eligible for optimization since said calls were no longer right in the "tail position" (i.e. the immediate predecessor of a ReturnInst instruction). Then it was noticed that since the sibling call optimization causes the callee to reuse the caller's stack, if we could delay the generation of the stack protector check until later in CodeGen after the sibling call decision was made, we get both the tail call optimization and the stack protector check! A few goals in solving this problem were: 1. Preserve the architecture independence of stack protector generation. 2. Preserve the normal IR level stack protector check for platforms like OpenBSD for which we support platform specific stack protector generation. The main problem that guided the present solution is that one can not solve this problem in an architecture independent manner at the IR level only. This is because: 1. The decision on whether or not to perform a sibling call on certain platforms (for instance i386) requires lower level information related to available registers that can not be known at the IR level. 2. Even if the previous point were not true, the decision on whether to perform a tail call is done in LowerCallTo in SelectionDAG which occurs after the Stack Protector Pass. As a result, one would need to put the relevant callinst into the stack protector check success basic block (where the return inst is placed) and then move it back later at SelectionDAG/MI time before the stack protector check if the tail call optimization failed. The MI level option was nixed immediately since it would require platform specific pattern matching. The SelectionDAG level option was nixed because SelectionDAG only processes one IR level basic block at a time implying one could not create a DAG Combine to move the callinst. To get around this problem a few things were realized: 1. While one can not handle multiple IR level basic blocks at the SelectionDAG Level, one can generate multiple machine basic blocks for one IR level basic block. This is how we handle bit tests and switches. 2. At the MI level, tail calls are represented via a special return MIInst called "tcreturn". Thus if we know the basic block in which we wish to insert the stack protector check, we get the correct behavior by always inserting the stack protector check right before the return statement. This is a "magical transformation" since no matter where the stack protector check intrinsic is, we always insert the stack protector check code at the end of the BB. Given the aforementioned constraints, the following solution was devised: 1. On platforms that do not support SelectionDAG stack protector check generation, allow for the normal IR level stack protector check generation to continue. 2. On platforms that do support SelectionDAG stack protector check generation: a. Use the IR level stack protector pass to decide if a stack protector is required/which BB we insert the stack protector check in by reusing the logic already therein. If we wish to generate a stack protector check in a basic block, we place a special IR intrinsic called llvm.stackprotectorcheck right before the BB's returninst or if there is a callinst that could potentially be sibling call optimized, before the call inst. b. Then when a BB with said intrinsic is processed, we codegen the BB normally via SelectBasicBlock. In said process, when we visit the stack protector check, we do not actually emit anything into the BB. Instead, we just initialize the stack protector descriptor class (which involves stashing information/creating the success mbbb and the failure mbb if we have not created one for this function yet) and export the guard variable that we are going to compare. c. After we finish selecting the basic block, in FinishBasicBlock if the StackProtectorDescriptor attached to the SelectionDAGBuilder is initialized, we first find a splice point in the parent basic block before the terminator and then splice the terminator of said basic block into the success basic block. Then we code-gen a new tail for the parent basic block consisting of the two loads, the comparison, and finally two branches to the success/failure basic blocks. We conclude by code-gening the failure basic block if we have not code-gened it already (all stack protector checks we generate in the same function, use the same failure basic block). llvm-svn: 188755	2013-08-20 07:00:16 +00:00
Hal Finkel	0c5c01aa4a	Add a llvm.copysign intrinsic This adds a llvm.copysign intrinsic; We already have Libfunc recognition for copysign (which is turned into the FCOPYSIGN SDAG node). In order to autovectorize calls to copysign in the loop vectorizer, we need a corresponding intrinsic as well. In addition to the expected changes to the language reference, the loop vectorizer, BasicTTI, and the SDAG builder (the intrinsic is transformed into an FCOPYSIGN node, just like the function call), this also adds FCOPYSIGN to a few lists in LegalizeVector{Ops,Types} so that vector copysigns can be expanded. In TargetLoweringBase::initActions, I've made the default action for FCOPYSIGN be Expand for vector types. This seems correct for all in-tree targets, and I think is the right thing to do because, previously, there was no way to generate vector-values FCOPYSIGN nodes (and most targets don't specify an action for vector-typed FCOPYSIGN). llvm-svn: 188728	2013-08-19 23:35:46 +00:00
Eric Christopher	574b5c8885	Use less verbose code and update comments. llvm-svn: 188711	2013-08-19 21:41:38 +00:00
Eric Christopher	7da24888dd	Turn on pubnames by default on linux. Until gdb supports the new accelerator tables we should add the pubnames section so that gdb_index can be generated from gold at link time. On darwin we already emit the accelerator tables and so don't need to worry about pubnames. llvm-svn: 188708	2013-08-19 21:07:38 +00:00
Paul Redmond	62f840f46a	Improve the widening of integral binary vector operations - split WidenVecRes_Binary into WidenVecRes_Binary and WidenVecRes_BinaryCanTrap - WidenVecRes_BinaryCanTrap preserves the original behaviour for operations that can trap - WidenVecRes_Binary simply widens the operation and improves codegen for 3-element vectors by allowing widening and promotion on x86 (matches the behaviour of unary and ternary operation widening) - use WidenVecRes_Binary for operations on integers. Reviewed by: nrotem llvm-svn: 188699	2013-08-19 20:01:35 +00:00
Hal Finkel	e4eb78188c	Add ExpandFloatOp_FCOPYSIGN to handle ppcf128-related expansions We had previously been asserting when faced with a FCOPYSIGN f64, ppcf128 node because there was no way to expand the FCOPYSIGN node. Because ppcf128 is the sum of two doubles, and the first double must have the larger magnitude, we can take the sign from the first double. As a result, in addition to fixing the crash, this is also an optimization. llvm-svn: 188655	2013-08-19 06:55:37 +00:00
David Blaikie	715528be0b	DebugInfo: don't emit zero-length names for parameters We check this in many/all other cases, just missed this one it seems. Perhaps it'd be worth unifying this so we never emit zero-length DW_AT_names. llvm-svn: 188649	2013-08-19 03:34:03 +00:00
Jim Grosbach	06c2a68125	ARM: Fix more fast-isel verifier failures. Teach the generic instruction selection helper functions to constrain the register classes of their input operands. For non-physical register references, the generic code needs to be careful not to mess that up when replacing references to result registers. As the comment indicates for MachineRegisterInfo::replaceRegWith(), it's important to call constrainRegClass() first. rdar://12594152 llvm-svn: 188593	2013-08-16 23:37:31 +00:00
David Blaikie	d4e106e39d	DebugInfo: Allow the addition of other (such as static data) members to a record type after construction Plus a type cleanup & minor fix to enumerate members of declarations. llvm-svn: 188577	2013-08-16 20:42:14 +00:00
Richard Sandiford	0dec06a28c	[SystemZ] Use SRST to implement strlen and strnlen It would also make sense to use it for memchr; I'm working on that now. llvm-svn: 188547	2013-08-16 11:41:43 +00:00
Richard Sandiford	bb83a50f57	[SystemZ] Use MVST to implement strcpy and stpcpy llvm-svn: 188546	2013-08-16 11:29:37 +00:00
Richard Sandiford	ca23271010	[SystemZ] Use CLST to implement strcmp llvm-svn: 188544	2013-08-16 11:21:54 +00:00
Richard Sandiford	e3827751e2	[SystemZ] Fix handling of 64-bit memcmp results Generalize r188163 to cope with return types other than MVT::i32, just as the existing visitMemCmpCall code did. I've split this out into a subroutine so that it can be used for other upcoming patches. I also noticed that I'd used the wrong API to record the out chain. It's a load that uses DAG.getRoot() rather than getRoot(), so the out chain should go on PendingLoads. I don't have a testcase for that because we don't do any interesting scheduling on z yet. llvm-svn: 188540	2013-08-16 10:55:47 +00:00
Bill Wendling	33fae6935a	Make a few more things const. llvm-svn: 188484	2013-08-15 20:25:44 +00:00
Bill Wendling	2d092f05b4	Use a reference instead of making an unnecessary copy. Also use 'const'. llvm-svn: 188483	2013-08-15 20:21:49 +00:00
Craig Topper	d9c2783d8f	Replace getValueType().getSimpleVT() with getSimpleValueType(). llvm-svn: 188442	2013-08-15 02:44:19 +00:00
Mark Lacey	9d8103de7a	Auto-compute live intervals on demand. When new virtual registers are created during splitting/spilling, defer creation of the live interval until we need to use the live interval. Along with the recent commits to notify LiveRangeEdit when new virtual registers are created, this makes it possible for functions like TargetInstrInfo::loadRegFromStackSlot() and TargetInstrInfo::storeRegToStackSlot() to create multiple virtual registers as part of the process of generating loads/stores for different register classes, and then have the live intervals for those new registers computed when they are needed. llvm-svn: 188437	2013-08-14 23:50:16 +00:00
Mark Lacey	f367cd9239	Notify LiveRangeEdit of new virtual registers. Add a delegate class to MachineRegisterInfo with a single virtual function, MRI_NoteNewVirtualRegister(). Update LiveRangeEdit to inherit from this delegate class and override the definition of the callback with an implementation that tracks the newly created virtual registers. llvm-svn: 188435	2013-08-14 23:50:09 +00:00
Mark Lacey	f9ea88546f	Track new virtual registers by register number. Track new virtual registers by register number, rather than by the live interval created for them. This is the first step in separating the creation of new virtual registers and new live intervals. Eventually live intervals will be created and populated on demand after the virtual registers have been created and used in instructions. llvm-svn: 188434	2013-08-14 23:50:04 +00:00
David Blaikie	d0d6fcc923	DebugInfo: Prefer references over pointers, pass by const reference for a type that will grow in the future llvm-svn: 188422	2013-08-14 22:23:05 +00:00
Jakob Stoklund Olesen	4417c7b265	Remove unnecessary parameter to RenumberValues. Patch by Matthias Braun! llvm-svn: 188393	2013-08-14 17:28:52 +00:00
Jakob Stoklund Olesen	6d13b8fd85	Improve misleading comment. Patch by Matthias Braun! llvm-svn: 188391	2013-08-14 17:28:46 +00:00
Jakob Stoklund Olesen	874c412b6f	Remove declaration of nonexistant function. Patch by Matthias Braun! llvm-svn: 188390	2013-08-14 17:28:44 +00:00
Jakob Stoklund Olesen	21914ab441	LiveIntervalUnion is not used in RegAllocBase. Patch by Matthias Braun! llvm-svn: 188389	2013-08-14 17:28:42 +00:00
Jim Grosbach	327ccc787e	DAG: Combine (and (setne X, 0), (setne X, -1)) -> (setuge (add X, 1), 2) A common idiom is to use zero and all-ones as sentinal values and to check for both in a single conditional ("x != 0 && x != (unsigned)-1"). That generates code, for i32, like: testl %edi, %edi setne %al cmpl $-1, %edi setne %cl andb %al, %cl With this transform, we generate the simpler: incl %edi cmpl $1, %edi seta %al Similar improvements for other integer sizes and on other platforms. In general, combining the two setcc instructions into one is better. rdar://14689217 llvm-svn: 188315	2013-08-13 21:30:58 +00:00
Michael Gottesman	7a8017290a	Update makeLibCall to return both the call and the chain associated with the libcall instead of just the call. This allows us to specify libcalls that return void. LowerCallTo returns a pair with the return value of the call as the first element and the chain associated with the return value as the second element. If we lower a call that has a void return value, LowerCallTo returns an SDValue with a NULL SDNode and the chain for the call. Thus makeLibCall by just returning the first value makes it impossible for you to set up the chain so that the call is not eliminated as dead code. I also updated all references to makeLibCall to reflect the new return type. llvm-svn: 188300	2013-08-13 17:54:56 +00:00
Carlo Kok	bac096a614	Output DW_AT_stmt_list dwarf debug info as DW_FORM_sec_offset instead of DW_FORM_data4 as it is a section offset (fixes the coff/dwarf debug info statement locations) llvm-svn: 188297	2013-08-13 17:46:57 +00:00
Carlo Kok	fb849b0f21	For COFF only: dwarf debug info output a label reference as a section relative item only when it's one of dw_from strp, sec_offset, ref_addr or op_call_ref instead of going by size. llvm-svn: 188296	2013-08-13 17:45:53 +00:00
Evgeniy Stepanov	b59d82ac66	Pass DIEHash::collectAttributes output argument by-pointer instead of by-value. Before this, collectAttributes() was operating on a local object. llvm-svn: 188254	2013-08-13 07:57:01 +00:00
David Majnemer	3d96acb735	[-cxx-abi microsoft] Stick zero initialized symbols into the .bss section for COFF Summary: We need to do two things: - Initialize BSSSection in MCObjectFileInfo::InitCOFFMCObjectFileInfo - Teach TargetLoweringObjectFileCOFF::SelectSectionForGlobal what to do with it This fixes PR16861. Reviewers: rnk Reviewed By: rnk CC: llvm-commits Differential Revision: http://llvm-reviews.chandlerc.com/D1361 llvm-svn: 188244	2013-08-13 01:23:53 +00:00
Eric Christopher	d29614f98d	Add the start of DIE hashing for DWARF4 type units and split dwarf CUs. Currently only hashes the name of CUs and the names of any children, but it's an obvious first step to show the framework. The testcase should continue to be correct, however, as it's an empty TU. llvm-svn: 188243	2013-08-13 01:21:55 +00:00
Eric Christopher	cede3db5ea	Reflow comment. llvm-svn: 188233	2013-08-12 23:59:24 +00:00
Eric Christopher	166294f37a	Remove empty constructor. llvm-svn: 188232	2013-08-12 23:59:18 +00:00
Michael Gottesman	3923bec37b	Fixed SelectionDAGBuilder.h C++ filetype declaration to use the canonical C++ instead of c++. llvm-svn: 188203	2013-08-12 21:02:02 +00:00
Michael Gottesman	f1d3b7c22e	Fixed another place in CodeGen where we had a typo in our editor C++ filetype declaration. llvm-svn: 188202	2013-08-12 20:52:06 +00:00
Michael Gottesman	1649a877e1	[branchfolding] Fix typo in C++ editor declaration. llvm-svn: 188201	2013-08-12 20:49:27 +00:00
Eric Christopher	60eb7696a9	Move the addition of the dwo_id as late as possible after everything has been finalized except for sizes and offsets. Update test accordingly. llvm-svn: 188199	2013-08-12 20:27:48 +00:00
Michael Gottesman	7dce16f69d	[stackprotector] Add in the stackprotector libcall. We support this libcall on all platforms except for OpenBSD (See lib/Codegen/StackProtector.cpp). llvm-svn: 188193	2013-08-12 18:45:38 +00:00
Richard Sandiford	564681c88d	[SystemZ] Use CLC and IPM to implement memcmp For now this is restricted to fixed-length comparisons with a length in the range [1, 256], as for memcpy() and MVC. llvm-svn: 188163	2013-08-12 10:28:10 +00:00
Tim Northover	707d68f082	Allow compatible extension attributes for tail calls If the tail-callee and caller give the same bits via the same signext/zeroext attribute then a tail-call should be allowed, since the extension has already been done by the callee. llvm-svn: 188159	2013-08-12 09:45:46 +00:00
Michael Gottesman	8afcf3a408	[stackprotector] Simplify SP Pass so that we emit different fail basic blocks for each fail condition. This patch decouples the stack protector pass so that we can support stack protector implementations that do not use the IR level generated stack protector fail basic block. No codesize increase is caused by this change since the MI level tail merge pass properly merges together the fail condition blocks (see the updated test). llvm-svn: 188105	2013-08-09 21:26:18 +00:00
Benjamin Kramer	df03449a0a	Make helper static and fix formatting. llvm-svn: 188074	2013-08-09 14:44:41 +00:00
Craig Topper	0ecb26a79e	Change asserts at the top of getVectorShuffle to check that LHS and RHS have the same type as the result. Previously the asserts were only checking that RHS and LHS were the same type and had the same element type as the result. All downstream code for ISD::VECTOR_SHUFFLE requires the types to be the same. Also removed one unnecessary check of matched element counts that was present in the code. llvm-svn: 188051	2013-08-09 04:37:24 +00:00
Hal Finkel	8ec43c6a0f	Set ISD::FROUND to Expand by default for all types For most libm ISD nodes, TargetLoweringBase::initActions sets the default scalar-type action to Expand, and leaves the vector-type action default as Legal. This is not appropriate for the new ISD::FROUND node (which no backend but PowerPC handles explicitly). Fixes PR16842. llvm-svn: 188048	2013-08-09 04:13:44 +00:00
Eric Christopher	ac886fe0f8	Update the CMake build files. llvm-svn: 188030	2013-08-08 23:51:31 +00:00
Eric Christopher	4573198b30	Move hash computation code into a separate class and file. No functional change intended. llvm-svn: 188028	2013-08-08 23:45:55 +00:00
Arnold Schwaighofer	c31c2de18b	Revert "Reapply r185872 now that the address sanitizer has been changed to support this." This reverts commit r187939. It broke an O0 build of a spec benchmark. llvm-svn: 188012	2013-08-08 21:04:16 +00:00
Eric Christopher	056b647d1f	For DW_TAG_template_type_parameter the actual passed in type could be void and therefore not have a type entry. Only add the type if it is non-void and provide a testcase. llvm-svn: 187966	2013-08-08 08:09:43 +00:00
Craig Topper	9a39b07a60	Remove AllUndef check from one of the loops in getVectorShuffle. It was already handled by the 'AllLHS && AllRHS' check after the previous loop. llvm-svn: 187965	2013-08-08 08:03:12 +00:00
Eric Christopher	49e17b2049	The conversion to bool is fine here, no need to check isType. llvm-svn: 187964	2013-08-08 07:40:42 +00:00
Eric Christopher	0df08e2ff9	Make sure that if we're going to attempt to add a type to a DIE that the type exists. Fix up cases where we weren't checking for optional types and add an assert to addType to make sure we catch this in the future. Fix up a testcase that was using the tag for DW_TAG_array_type when it meant DW_TAG_enumeration_type. llvm-svn: 187963	2013-08-08 07:40:37 +00:00
Eric Christopher	afb2c4114e	Change variable name and reflow formatting. llvm-svn: 187962	2013-08-08 07:40:31 +00:00
Craig Topper	309dfefb6f	Optimize mask generation for one of the DAG combiner shufflevector cases. llvm-svn: 187961	2013-08-08 07:38:55 +00:00
David Majnemer	f76d6b3712	Revert "coff also doesn't have a ReadOnlySection yet, (!)" This reverts commit r77814. We were sticking global constants in the .data section instead of in the .rdata section when emitting for COFF. This fixes PR16831. llvm-svn: 187956	2013-08-08 01:50:52 +00:00
Eric Christopher	d25f7fc4ae	Reflow for loop. llvm-svn: 187954	2013-08-08 01:41:05 +00:00
Eric Christopher	31b0576b01	Be more rigorous about the sizes of forms and attributes. llvm-svn: 187953	2013-08-08 01:41:00 +00:00
Bill Wendling	b80f9791e4	Reapply r185872 now that the address sanitizer has been changed to support this. Original commit message: Stop emitting weak symbols into the "coal" sections. The Mach-O linker has been able to support the weak-def bit on any symbol for quite a while now. The compiler however continued to place these symbols into a "coal" section, which required the linker to map them back to the base section name. Replace the sections like this: __TEXT/__textcoal_nt instead use __TEXT/__text __TEXT/__const_coal instead use __TEXT/__const __DATA/__datacoal_nt instead use __DATA/__data <rdar://problem/14265330> llvm-svn: 187939	2013-08-07 23:42:09 +00:00
Hal Finkel	171817ee8a	Add ISD::FROUND for libm round() All libm floating-point rounding functions, except for round(), had their own ISD nodes. Recent PowerPC cores have an instruction for round(), and so here I'm adding ISD::FROUND so that round() can be custom lowered as well. For the most part, this is straightforward. I've added an intrinsic and a matching ISD node just like those for nearbyint() and friends. The SelectionDAG pattern I've named frnd (because ISD::FP_ROUND has already claimed fround). This will be used by the PowerPC backend in a follow-up commit. llvm-svn: 187926	2013-08-07 22:49:12 +00:00
Eric Christopher	7af8baf678	Using the integrated assembler we'd fail to change section to the .tbss section for zerofill thread locals. Make sure we do this before emitting the zerofills. Fixes PR15972. llvm-svn: 187913	2013-08-07 21:13:06 +00:00
Andrew Trick	2f7667e018	Confusing comment typo. llvm-svn: 187895	2013-08-07 17:20:32 +00:00
Eric Christopher	341770d7ea	Remove some parens. No functional change. llvm-svn: 187872	2013-08-07 08:35:10 +00:00
Eric Christopher	8552e22b07	Add a way to grab a particular attribute out of a DIE. Use it when we're looking for a string in particular. Update comments as well. llvm-svn: 187844	2013-08-07 01:18:33 +00:00
Eric Christopher	af15f8dd5a	Move somewhat messy conditional out of line. No functional change. llvm-svn: 187843	2013-08-07 01:18:24 +00:00
Arnold Schwaighofer	a7cd6bf3bb	LoopVectorize: Allow vectorization of loops with lifetime markers Patch by Marc Jessome! llvm-svn: 187825	2013-08-06 22:37:52 +00:00
Tim Northover	a4415854db	Refactor isInTailCallPosition handling This change came about primarily because of two issues in the existing code. Niether of: define i64 @test1(i64 %val) { %in = trunc i64 %val to i32 tail call i32 @ret32(i32 returned %in) ret i64 %val } define i64 @test2(i64 %val) { tail call i32 @ret32(i32 returned undef) ret i32 42 } should be tail calls, and the function sameNoopInput is responsible. The main problem is that it is completely symmetric in the "tail call" and "ret" value, but in reality different things are allowed on each side. For these cases: 1. Any truncation should lead to a larger value being generated by "tail call" than needed by "ret". 2. Undef should only be allowed as a source for ret, not as a result of the call. Along the way I noticed that a mismatch between what this function treats as a valid truncation and what the backends see can lead to invalid calls as well (see x86-32 test case). This patch refactors the code so that instead of being based primarily on values which it recurses into when necessary, it starts by inspecting the type and considers each fundamental slot that the backend will see in turn. For example, given a pathological function that returned {{}, {{}, i32, {}}, i32} we would consider each "real" i32 in turn, and ask if it passes through unchanged. This is much closer to what the backend sees as a result of ComputeValueVTs. Aside from the bug fixes, this eliminates the recursion that's going on and, I believe, makes the bulk of the code significantly easier to understand. The trade-off is the nasty iterators needed to find the real types inside a returned value. llvm-svn: 187787	2013-08-06 09:12:35 +00:00
NAKAMURA Takumi	e359e85649	AsmPrinter/CMakeLists.txt: Add explicit dependency to intrinsics_gen here. llvm-svn: 187778	2013-08-06 05:56:39 +00:00
Eric Christopher	0062f2edc0	Recommit previous cleanup with a fix for c++98 ambiguity. llvm-svn: 187752	2013-08-05 22:32:28 +00:00
Tom Stellard	d42c594960	TargetLowering: Add getVectorIdxTy() function v2 This virtual function can be implemented by targets to specify the type to use for the index operand of INSERT_VECTOR_ELT, EXTRACT_VECTOR_ELT, INSERT_SUBVECTOR, EXTRACT_SUBVECTOR. The default implementation returns the result from TargetLowering::getPointerTy() The previous code was using TargetLowering::getPointerTy() for vector indices, because this is guaranteed to be legal on all targets. However, using TargetLowering::getPointerTy() can be a problem for targets with pointer sizes that differ across address spaces. On such targets, when vectors need to be loaded or stored to an address space other than the default 'zero' address space (which is the address space assumed by TargetLowering::getPointerTy()), having an index that is a different size than the pointer can lead to inefficient pointer calculations, (e.g. 64-bit adds for a 32-bit address space). There is no intended functionality change with this patch. llvm-svn: 187748	2013-08-05 22:22:01 +00:00
Eric Christopher	432c99af0b	Revert "Use existing builtin hashing functions to make this routine more" This reverts commit r187745. llvm-svn: 187747	2013-08-05 22:07:30 +00:00
Eric Christopher	d728355a1c	Use existing builtin hashing functions to make this routine more simple. llvm-svn: 187745	2013-08-05 22:00:50 +00:00
Eric Christopher	0369ad7053	Change parent hashing algorithm to be non-recursive and elaborate greatly on many comments in the code. llvm-svn: 187742	2013-08-05 21:40:57 +00:00
Benjamin Kramer	483b9fbddb	Don't leak passes if added outside of the area determined by Started/Stopped flags. llvm-svn: 187722	2013-08-05 11:11:11 +00:00
Carlo Kok	4382da983a	Bugfix for making the DWARF debug strings and labels to code emitted as secrel32 instead of long opcodes (only for coff). This makes them debuggable with GDB (with fix for 64bits msvc) llvm-svn: 187656	2013-08-02 16:14:15 +00:00
NAKAMURA Takumi	6fda3b4b86	Revert r187597, "Bugfix for making the DWARF debug strings and labels to code emitted as secrel32 instead of long opcodes (only for coff). This makes them debuggable with GDB." It broke x86_64-win32 builder in llvm/test/DebugInfo. llvm-svn: 187642	2013-08-02 03:46:05 +00:00
Bill Wendling	a5c536e1ee	Use function attributes to indicate that we don't want to realign the stack. Function attributes are the future! So just query whether we want to realign the stack directly from the function instead of through a random target options structure. llvm-svn: 187618	2013-08-01 21:42:05 +00:00
David Blaikie	a1ae0e6ecb	DebugInfo: Emit definitions for types with no members. The absence of members was a poor/incorrect proxy for "is definition". llvm-svn: 187607	2013-08-01 20:30:22 +00:00
Carlo Kok	afcc62024e	Bugfix for making the DWARF debug strings and labels to code emitted as secrel32 instead of long opcodes (only for coff). This makes them debuggable with GDB. fixes Bug 16249 - LLVM generates broken debug info on Windows llvm-svn: 187597	2013-08-01 18:38:14 +00:00
Eric Christopher	e6656ac870	Fix crashing on invalid inline asm with matching constraints. For a testcase like the following: typedef unsigned long uint64_t; typedef struct { uint64_t lo; uint64_t hi; } blob128_t; void add_128_to_128(const blob128_t in, blob128_t res) { asm ("PAND %1, %0" : "+Q"(res) : "Q"(in)); } where we'll fail to allocate the register for the output constraint, our matching input constraint will not find a register to match, and could try to search past the end of the current operands array. On the idea that we'd like to attempt to keep compilation going to find more errors in the module, change the error cases when we're visiting inline asm IR to return immediately and avoid trying to create a node in the DAG. This leaves us with only a single error message per inline asm instruction, but allows us to safely keep going in the general case. llvm-svn: 187470	2013-07-31 01:26:24 +00:00
Eric Christopher	029af15086	Reflow this to be easier to read. llvm-svn: 187459	2013-07-30 22:50:44 +00:00
Andrew Trick	c7934b3e37	Down-scale slot index distance to save bits. llvm-svn: 187438	2013-07-30 19:59:19 +00:00
Andrew Trick	9c17eab761	MI Sched: Track live-thru registers. When registers must be live throughout the scheduling region, increase the limit for the register class. Once we exceed the original limit, they will be spilled, and there's no point further reducing pressure. This isn't a perfect heuristics but avoids a situation where the scheduler could become trapped by trying to achieve the impossible. llvm-svn: 187436	2013-07-30 19:59:12 +00:00
Andrew Trick	d9761776bc	MI Sched fix: assert "Disconnected LRG within the scheduling region." llvm-svn: 187435	2013-07-30 19:59:08 +00:00
Quentin Colombet	6bf4baa408	[DAGCombiner] insert_vector_elt: Avoid building a vector twice. This patch prevents the following combine when the input vector is used more than once. insert_vector_elt (build_vector elt0, ..., eltN), NewEltIdx, idx => build_vector elt0, ..., NewEltIdx, ..., eltN The reasons are: - Building a vector may be expensive, so try to reuse the existing part of a vector instead of creating a new one (think big vectors). - elt0 to eltN now have two users instead of one. This may prevent some other optimizations. llvm-svn: 187396	2013-07-30 00:24:09 +00:00
Eric Christopher	e414ece79a	Fix a truly egregious thinko in anonymous namespace check, update testcase to make sure we generate debug info for walrus by adding a non-trivial constructor and verify that we don't emit an ODR signature for the type. llvm-svn: 187393	2013-07-29 23:53:08 +00:00
Eric Christopher	d853ea3142	Make sure we don't emit an ODR hash for types with no name and make sure the comments for each testcase are a bit easier to distinguish. llvm-svn: 187392	2013-07-29 23:53:05 +00:00
Eric Christopher	f8542ec305	Elaborate a bit on the type unit and ODR conditional code. llvm-svn: 187385	2013-07-29 22:24:32 +00:00
Nico Rieck	7fdaee8f15	Use proper section suffix for COFF weak symbols 32-bit symbols have "_" as global prefix, but when forming the name of COMDAT sections this prefix is ignored. The current behavior assumes that this prefix is always present which is not the case for 64-bit and names are truncated. llvm-svn: 187356	2013-07-29 13:58:39 +00:00
Benjamin Kramer	409afcf174	DwarfDebug: MD5 is always little endian, bswap on big endian platforms. This makes LLVM emit the same signature regardless of host and target endianess. llvm-svn: 187304	2013-07-27 14:14:43 +00:00
Chandler Carruth	2a1c0d2c03	Fix a memory leak in the debug emission by simply not allocating memory. There doesn't appear to be any reason to put this variable on the heap. I'm suspicious of the LexicalScope above that we stuff in a map and then delete afterward, but I'm just trying to get the valgrind bot clean. llvm-svn: 187301	2013-07-27 11:09:58 +00:00
Nick Lewycky	0b68245ec8	Reimplement isPotentiallyReachable to make nocapture deduction much stronger. Adds unit tests for it too. Split BasicBlockUtils into an analysis-half and a transforms-half, and put the analysis bits into a new Analysis/CFG.{h,cpp}. Promote isPotentiallyReachable into llvm::isPotentiallyReachable and move it into Analysis/CFG. llvm-svn: 187283	2013-07-27 01:24:00 +00:00
Tom Stellard	8b1e021e85	SimplifyCFG: Use parallel-and and parallel-or mode to consolidate branch conditions Merge consecutive if-regions if they contain identical statements. Both transformations reduce number of branches. The transformation is guarded by a target-hook, and is currently enabled only for +R600, but the correctness has been tested on X86 target using a variety of CPU benchmarks. Patch by: Mei Ye llvm-svn: 187278	2013-07-27 00:01:07 +00:00
Eric Christopher	219fb91499	Remove addLetterToHash, no functional change. llvm-svn: 187245	2013-07-26 21:07:18 +00:00
Eric Christopher	67646438c9	Add preliminary support for hashing DIEs and breaking them into type units. Initially this support is used in the computation of an ODR checker for C++. For now we're attaching it to the DIE, but in the future it will be attached to the type unit. This also starts breaking out types into the separation for type units, but without actually splitting the DIEs. In preparation for hashing the DIEs this adds a DIEString type that contains a StringRef with the string contained at the label. llvm-svn: 187213	2013-07-26 17:02:41 +00:00
Justin Holewinski	d3f2035a3c	Add a target legalize hook for SplitVectorOperand (again) CustomLowerNode was not being called during SplitVectorOperand, meaning custom legalization could not be used by targets. This also adds a test case for NVPTX that depends on this custom legalization. Differential Revision: http://llvm-reviews.chandlerc.com/D1195 Attempt to fix the buildbots by making the X86 test I just added platform independent llvm-svn: 187202	2013-07-26 13:28:29 +00:00
Rafael Espindola	1d812728cc	Revert "Add a target legalize hook for SplitVectorOperand" This reverts commit 187198. It broke the bots. The soft float test probably needs a -triple because of name differences. On the hard float test I am getting a "roundss $1, %xmm0, %xmm0", instead of "vroundss $1, %xmm0, %xmm0, %xmm0". llvm-svn: 187201	2013-07-26 13:18:16 +00:00
Justin Holewinski	f848a24e50	Add a target legalize hook for SplitVectorOperand CustomLowerNode was not being called during SplitVectorOperand, meaning custom legalization could not be used by targets. This also adds a test case for NVPTX that depends on this custom legalization. Differential Revision: http://llvm-reviews.chandlerc.com/D1195 llvm-svn: 187198	2013-07-26 12:46:39 +00:00
Andrew Trick	f4b1ee3492	RegAllocGreedy comment. llvm-svn: 187141	2013-07-25 18:35:22 +00:00
Andrew Trick	8bb0a251fd	Evict local live ranges if they can be reassigned. The previous change to local live range allocation also suppressed eviction of local ranges. In rare cases, this could result in more expensive register choices. This commit actually revives a feature that I added long ago: check if live ranges can be reassigned before eviction. But now it only happens in rare cases of evicting a local live range because another local live range wants a cheaper register. The benefit is improved code size for some benchmarks on x86 and armv7. I measured no significant compile time increase and performance changes are noise. llvm-svn: 187140	2013-07-25 18:35:19 +00:00
Andrew Trick	8485257d6d	Allocate local registers in order for optimal coloring. Also avoid locals evicting locals just because they want a cheaper register. Problem: MI Sched knows exactly how many registers we have and assumes they can be colored. In cases where we have large blocks, usually from unrolled loops, greedy coloring fails. This is a source of "regressions" from the MI Scheduler on x86. I noticed this issue on x86 where we have long chains of two-address defs in the same live range. It's easy to see this in matrix multiplication benchmarks like IRSmk and even the unit test misched-matmul.ll. A fundamental difference between the LLVM register allocator and conventional graph coloring is that in our model a live range can't discover its neighbors, it can only verify its neighbors. That's why we initially went for greedy coloring and added eviction to deal with the hard cases. However, for singly defined and two-address live ranges, we can optimally color without visiting neighbors simply by processing the live ranges in instruction order. Other beneficial side effects: It is much easier to understand and debug regalloc for large blocks when the live ranges are allocated in order. Yes, global allocation is still very confusing, but it's nice to be able to comprehend what happened locally. Heuristics could be added to bias register assignment based on instruction locality (think late register pairing, banks...). Intuituvely this will make some test cases that are on the threshold of register pressure more stable. llvm-svn: 187139	2013-07-25 18:35:14 +00:00
Adrian Prantl	e4daf52a63	typo. llvm-svn: 187135	2013-07-25 17:52:30 +00:00
Andrew Trick	401b6959ae	MI Sched: Register pressure heuristics. Consider which set is being increased or decreased before comparing. llvm-svn: 187110	2013-07-25 07:26:35 +00:00
Andrew Trick	27e5fea665	MI Sched: track register pressure by importance of the set, not weight of the units. llvm-svn: 187109	2013-07-25 07:26:32 +00:00
Andrew Trick	9706496b0d	Dump LIS before regalloc. MI sched changes them. llvm-svn: 187107	2013-07-25 07:26:26 +00:00
Bill Wendling	440e9d81bf	Replace the "NoFramePointerElimNonLeaf" target option with a function attribute. There's no need to specify a flag to omit frame pointer elimination on non-leaf nodes...(Honestly, I can't parse that option out.) Use the function attribute stuff instead. llvm-svn: 187093	2013-07-25 00:34:29 +00:00
Quentin Colombet	bdab227e53	Fix a bug in IfConverter with nested predicates. Prior to this patch, IfConverter may widen the cases where a sequence of instructions were executed because of the way it uses nested predicates. This result in incorrect execution. For instance, Let A be a basic block that flows conditionally into B and B be a predicated block. B can be predicated with A.BrToBPredicate into A iff B.Predicate is less "permissive" than A.BrToBPredicate, i.e., iff A.BrToBPredicate subsumes B.Predicate. The IfConverter was checking the opposite: B.Predicate subsumes A.BrToBPredicate. <rdar://problem/14379453> llvm-svn: 187071	2013-07-24 20:20:37 +00:00
Tom Stellard	c54731aa9d	DAGCombiner: Pass the correct type to TargetLowering::isF(Abs\|Neg)Free This commit also implements these functions for R600 and removes a test case that was relying on the buggy behavior. llvm-svn: 187007	2013-07-23 23:55:03 +00:00
Eric Christopher	7f2b551d4e	Reformat options. llvm-svn: 186994	2013-07-23 22:16:41 +00:00
Michael Gottesman	c0659fad7f	[stackprotector] Changed isNoopBitcast/sameNoopInput to take TargetLoweringBase instead of TargetLowering. Both functions only use functionality from TargetLoweringBase. rdar://13935163 llvm-svn: 186874	2013-07-22 21:05:47 +00:00
Michael Gottesman	a6188f9fcd	[stackprotector] Refactored ssp prologue creation code into its own helper function. No functionality change. rdar://13935163 llvm-svn: 186868	2013-07-22 20:44:11 +00:00
Bill Wendling	c02a0aabb5	Recommit r186217 with testcase fix: Use the function attributes to pass along the stack protector buffer size. Now that we have robust function attributes, don't use a command line option to specify the stack protecto buffer size. llvm-svn: 186863	2013-07-22 20:15:21 +00:00
Eric Christopher	19d153261f	Formatting. llvm-svn: 186851	2013-07-22 18:26:15 +00:00
Michael Gottesman	da6365f4ed	Added missing - in the header of PrologEpilogInserter.h so that editors properly realize it is a c++ header and not a c header. llvm-svn: 186801	2013-07-22 00:52:55 +00:00
Alexey Samsonov	64c391dbe4	Fix uninitialized memory read found by MemorySanitizer: always set output parameter of ConvergingScheduler::SchedBoundary::getOtherResourceCount llvm-svn: 186658	2013-07-19 08:55:18 +00:00
Andrew Trick	b13ef17a14	MI Sched: Update the way resources are tracked so the current heuristics make more sense. llvm-svn: 186632	2013-07-19 00:20:07 +00:00
Michael Gottesman	f87a6ae65f	Add -- C++ -- to InstrEmitter.h. llvm-svn: 186527	2013-07-17 18:53:29 +00:00
Craig Topper	9fdc70e846	Make constant string pointer into an array to remove a pointer lookup for every access. llvm-svn: 186482	2013-07-17 03:11:32 +00:00
Manman Ren	8bfde8917e	Add getModuleFlag(StringRef Key) to query a module flag given Key. No functionality change. llvm-svn: 186470	2013-07-16 23:21:16 +00:00
Rafael Espindola	6d35481c94	Add a wrapper for open. This centralizes the handling of O_BINARY and opens the way for hiding more differences (like how open behaves with directories). llvm-svn: 186447	2013-07-16 19:44:17 +00:00
Jakob Stoklund Olesen	efeb3a1969	Remove floats from live range splitting costs. These floats all represented block frequencies anyway, so just use the BlockFrequency class directly. Some floating point computations remain in tryLocalSplit(). They are estimating spill weights which are still floats. llvm-svn: 186435	2013-07-16 18:26:18 +00:00
Jakob Stoklund Olesen	c5454ff046	Reapply r185393. Original commit message: Remove floating point computations from SpillPlacement.cpp. Patch by Benjamin Kramer! Use the BlockFrequency class instead of floats in the Hopfield network computations. This rescales the node Bias field from a [-2;2] float range to two block frequencies BiasN and BiasP pulling in opposite directions. This construct has a more predictable behavior when block frequencies saturate. The per-node scaling factors are no longer necessary, assuming the block frequencies around a bundle are consistent. This patch can cause the register allocator to make different spilling decisions. The differences should be small. llvm-svn: 186434	2013-07-16 18:26:15 +00:00
Craig Topper	d3a34f81f8	Add 'const' qualifiers to static const char* variables. llvm-svn: 186371	2013-07-16 01:17:10 +00:00
Manman Ren	b827123cf7	PEI: Support for non-zero SPAdj at beginning of a basic block. We can have a FrameSetup in one basic block and the matching FrameDestroy in a different basic block when we have struct byval. In that case, SPAdj is not zero at beginning of the basic block. Modify PEI to correctly set SPAdj at beginning of each basic block using DFS traversal. We used to assume SPAdj is 0 at beginning of each basic block. PEI had an assert SPAdjCount \|\| SPAdj == 0. If we have a Destroy <n> followed by a Setup <m>, PEI will assert failure. We can add an extra condition to make sure the pairs are matched: The pairs start with a FrameSetup. But since we are doing a much better job in the verifier, this patch removes the check in PEI. PR16393 llvm-svn: 186364	2013-07-15 23:47:29 +00:00
Manman Ren	aa6875b1f9	Machine Verifier: verify FrameSetup and FrameDestroy 1> on every path through the CFG, a FrameSetup <n> is always followed by a FrameDestroy <n> and a FrameDestroy is always followed by a FrameSetup. 2> stack adjustments are identical on all CFG edges to a merge point. 3> frame is destroyed at end of a return block. PR16393 llvm-svn: 186350	2013-07-15 21:26:31 +00:00
Hal Finkel	2f5e8e3d95	Remove invalid assert in DAGTypeLegalizer::RemapValue There is a comment at the top of DAGTypeLegalizer::PerformExpensiveChecks which, in part, says: // Note that these invariants may not hold momentarily when processing a node: // the node being processed may be put in a map before being marked Processed. Unfortunately, this assert would be valid only if the above-mentioned invariant held unconditionally. This was causing llc to assert when, in fact, everything was fine. Thanks to Richard Sandiford for investigating this issue! Fixes PR16562. llvm-svn: 186338	2013-07-15 18:57:05 +00:00
Craig Topper	06b3b6651e	Add 'const' qualifier to some arrays. llvm-svn: 186312	2013-07-15 08:02:13 +00:00
Eric Christopher	7980b957cc	Clarify comments. llvm-svn: 186297	2013-07-14 22:23:54 +00:00
Eric Christopher	666dc635c7	Collapse temporary variable into call. llvm-svn: 186295	2013-07-14 21:46:51 +00:00
Tobias Grosser	84f34be98e	Fix build by replacing '>>' with '> >' llvm-svn: 186276	2013-07-14 06:12:01 +00:00
Craig Topper	b94011fd28	Use SmallVectorImpl& instead of SmallVector to avoid repeating small vector size. llvm-svn: 186274	2013-07-14 04:42:23 +00:00
Craig Topper	e0b711864c	Pass SmallVector by const reference instead of by value. llvm-svn: 186243	2013-07-13 07:43:40 +00:00
Chandler Carruth	86e60a36b5	Revert commit r186217 -- this is breaking bots: http://lab.llvm.org:8013/builders/clang-x86_64-darwin11-nobootstrap-RAincremental/builds/4328 Original commit log: Use the function attributes to pass along the stack protector buffer size. llvm-svn: 186234	2013-07-13 01:00:17 +00:00
Bill Wendling	4f73ff4711	Use the function attributes to pass along the stack protector buffer size. Now that we have robust function attributes, don't use a command line option to specify the stack protecto buffer size. llvm-svn: 186217	2013-07-12 22:25:20 +00:00
Arnold Schwaighofer	9da9a43af8	TargetTransformInfo: address calculation parameter for gather/scather Address calculation for gather/scather in vectorized code can incur a significant cost making vectorization unbeneficial. Add infrastructure to add cost. Tests and cost model for targets will be in follow-up commits. radar://14351991 llvm-svn: 186187	2013-07-12 19:16:02 +00:00
Manman Ren	30d6865a23	PEI: refactor replaceFrameIndices(MF) to call replaceFrameIndices(BB). replaceFrameIndices(MF) will iterate over the BBs and call replaceFrameIndices(BB). No functionality change. llvm-svn: 186141	2013-07-12 00:37:01 +00:00
Craig Topper	2cd5ff8003	Use SmallVectorImpl& instead of SmallVector to avoid repeating small vector size. llvm-svn: 186098	2013-07-11 16:22:38 +00:00
Hal Finkel	6161b9405f	Initialize AsmPrinter::MF in the constructor MF is normally initialized in AsmPrinter::SetupMachineFunction, but if the file contains only globals (no functions), then we need this to be initialized because, when encountering an error, lowerConstant() references it. This should fix the non-deterministic failures of test/CodeGen/X86/nonconst-static-iv.ll, etc. llvm-svn: 186068	2013-07-11 06:41:14 +00:00
Hal Finkel	743b194084	RegScavenger should not exclude undef uses When computing currently-live registers, the register scavenger excludes undef uses. As a result, undef uses are ignored when computing the restore points of registers spilled into the emergency slots. While the register scavenger normally excludes from consideration, when scavenging, registers used by the current instruction, we need to not exclude undef uses. Otherwise, we might end up requiring more emergency spill slots than we have (in the case where the undef use is the currently-spilled register). Another bug found by llvm-stress. llvm-svn: 186067	2013-07-11 05:55:57 +00:00
Stephen Lin	10947502e5	Remove trailing whitespac llvm-svn: 186032	2013-07-10 20:47:39 +00:00
Adrian Prantl	d3f6fe51ab	Use the appropriate unsigned int type for the offset. llvm-svn: 186015	2013-07-10 16:56:52 +00:00
Adrian Prantl	c31ec1c948	Safeguard DBG_VALUE handling. Unbreaks the ASAN buildbot. llvm-svn: 186014	2013-07-10 16:56:47 +00:00
Adrian Prantl	a1ffd1a450	Un-break the buildbot by tweaking the indirection flag. Pulled in a testcase from the debuginfo-test suite. llvm-svn: 185993	2013-07-10 01:53:37 +00:00
Adrian Prantl	facc9f4e3e	Document a known limitation of the status quo. llvm-svn: 185992	2013-07-10 01:53:30 +00:00
Eric Christopher	93ebdd727f	Fix comment. llvm-svn: 185984	2013-07-09 23:48:45 +00:00
Adrian Prantl	19942885ba	Typo. llvm-svn: 185971	2013-07-09 21:44:06 +00:00
Adrian Prantl	418d1d1ea9	Reapply an improved version of r180816/180817. Change the informal convention of DBG_VALUE machine instructions so that we can express a register-indirect address with an offset of 0. The old convention was that a DBG_VALUE is a register-indirect value if the offset (operand 1) is nonzero. The new convention is that a DBG_VALUE is register-indirect if the first operand is a register and the second operand is an immediate. For plain register values the combination reg, reg is used. MachineInstrBuilder::BuildMI knows how to build the new DBG_VALUES. rdar://problem/13658587 llvm-svn: 185966	2013-07-09 20:28:37 +00:00
Hal Finkel	e4dd5c29f0	WidenVecRes_BUILD_VECTOR must use the first operand's type Because integer BUILD_VECTOR operands may have a larger type than the result's vector element type, and all operands must have the same type, when widening a BUILD_VECTOR node by adding UNDEFs, we cannot use the vector element type, but rather must use the type of the existing operands. Another bug found by llvm-stress. llvm-svn: 185960	2013-07-09 18:55:10 +00:00
Stephen Lin	73de7bf5de	AArch64/PowerPC/SystemZ/X86: This patch fixes the interface, usage, and all in-tree implementations of TargetLoweringBase::isFMAFasterThanMulAndAdd in order to resolve the following issues with fmuladd (i.e. optional FMA) intrinsics: 1. On X86(-64) targets, ISD::FMA nodes are formed when lowering fmuladd intrinsics even if the subtarget does not support FMA instructions, leading to laughably bad code generation in some situations. 2. On AArch64 targets, ISD::FMA nodes are formed for operations on fp128, resulting in a call to a software fp128 FMA implementation. 3. On PowerPC targets, FMAs are not generated from fmuladd intrinsics on types like v2f32, v8f32, v4f64, etc., even though they promote, split, scalarize, etc. to types that support hardware FMAs. The function has also been slightly renamed for consistency and to force a merge/build conflict for any out-of-tree target implementing it. To resolve, see comments and fixed in-tree examples. llvm-svn: 185956	2013-07-09 18:16:56 +00:00
Hal Finkel	6c29bd9088	DAGCombine tryFoldToZero cannot create illegal types after type legalization When folding sub x, x (and other similar constructs), where x is a vector, the result is a vector of zeros. After type legalization, make sure that the input zero elements have a legal type. This type may be larger than the result's vector element type. This was another bug found by llvm-stress. llvm-svn: 185949	2013-07-09 17:02:45 +00:00
Alexander Potapenko	8d2d79d05f	Revert r185872 - "Stop emitting weak symbols into the "coal" sections" This patch broke `make check-asan` on Mac, causing ld warnings like the following one: ld: warning: direct access in __GLOBAL__I_a to global weak symbol ___asan_mapping_scale means the weak symbol cannot be overridden at runtime. This was likely caused by different translation units being compiled with different visibility settings. The resulting test binaries crashed with incorrect ASan warnings. llvm-svn: 185923	2013-07-09 10:00:16 +00:00
Stephen Lin	8e8424eb17	Style fixes: remove unnecessary braces for one-statement if blocks, no else after return, etc. No funcionality change. llvm-svn: 185893	2013-07-09 00:44:49 +00:00
Bill Wendling	0176708e85	Stop emitting weak symbols into the "coal" sections. The Mach-O linker has been able to support the weak-def bit on any symbol for quite a while now. The compiler however continued to place these symbols into a "coal" section, which required the linker to map them back to the base section name. Replace the sections like this: __TEXT/__textcoal_nt instead use __TEXT/__text __TEXT/__const_coal instead use __TEXT/__const __DATA/__datacoal_nt instead use __DATA/__data <rdar://problem/14265330> llvm-svn: 185872	2013-07-08 21:34:52 +00:00
Eric Christopher	aba20dd603	Update comment to avoid mentioning DbgValues which is an instance variable later in the class. llvm-svn: 185866	2013-07-08 21:16:18 +00:00
Manman Ren	7504ed4255	Debug Info: clean up usage of Verify. No functionality change. It should suffice to check the type of a debug info metadata, instead of calling Verify. llvm-svn: 185847	2013-07-08 18:33:29 +00:00
David Blaikie	ce1960f936	DebugInfo: Correct comment & re-format a nearby loop llvm-svn: 185844	2013-07-08 17:51:28 +00:00
David Blaikie	ac569a656f	DebugInfo: Simplify Address Pool index handling. Since the pool indexes are necessarily sequential and contiguous, just insert things in the right place rather than having to sort the sequence after the fact. No functionality change. llvm-svn: 185842	2013-07-08 17:33:10 +00:00
Hal Finkel	12493bb7d5	Improve the comment from r185794 (re: PromoteIntRes_BUILD_VECTOR) In response to Duncan's review, I believe that the original comment was not as clear as it could be. Hopefully, this is better. llvm-svn: 185824	2013-07-08 14:40:04 +00:00
Hal Finkel	8cb9a0e1d3	Fix PromoteIntRes_BUILD_VECTOR crash with i1 vectors This fixes a bug (found by llvm-stress) in DAGTypeLegalizer::PromoteIntRes_BUILD_VECTOR where it assumed that the result type would always be larger than the original operands. This is not always true, however, with boolean vectors. For example, promoting a node of type v8i1 (where the operands will be of type i32, the type to which i1 is promoted) will yield a node with a result vector element type of i16 (and operands of type i32). As a result, we cannot blindly assume that we can ANY_EXTEND the operands to the result type. llvm-svn: 185794	2013-07-08 06:16:58 +00:00
Kai Nacke	42097301f6	Revert: Emit personality function and Dwarf EH data for Win64 SEH. llvm-svn: 185788	2013-07-08 04:43:23 +00:00
Hal Finkel	ec474f28e3	Add the nearbyint -> FNEARBYINT mapping to BasicTargetTransformInfo This fixes an oversight that Intrinsic::nearbyint was not being mapped to ISD::FNEARBYINT (thus fixing the over-optimistic cost we were assigning to nearbyint calls for some targets). llvm-svn: 185783	2013-07-08 03:24:07 +00:00
Stephen Lin	cfe7f352c7	Remove trailing whitespace from SelectionDAG/*.cpp llvm-svn: 185780	2013-07-08 00:37:03 +00:00
Stephen Lin	6d715e8699	SelectionDAGBuilder: style fixes (add space between end parentheses and open brace) llvm-svn: 185768	2013-07-06 21:44:25 +00:00
Kai Nacke	c947ad2a2d	Emit personality function and Dwarf EH data for Win64 SEH. Obviously the personality function should be emitted as language handler instead of the hard coded _GCC_specific_handler. The language specific data must be placed after the unwind information therefore it must not be emitted into a separate section. Reviewed by Charles Davis and Nico Rieck. llvm-svn: 185761	2013-07-06 17:17:31 +00:00
Benjamin Kramer	c7332b2796	DAGCombiner: Don't drop extension behavior when shrinking a load when unsafe. ReduceLoadWidth unconditionally drops extensions from loads. Limit it to the case when all of the bits the extension would otherwise produce are dropped by the shrink. It would be possible to shrink the load in more cases by merging the extensions, but this isn't trivial and a very rare case. I left a TODO for that case. Fixes PR16551. llvm-svn: 185755	2013-07-06 14:05:09 +00:00
Tim Northover	dab4db5372	Stop putting operations after a tail call. This prevents the emission of DAG-generated vreg definitions after a tail call be dropping them entirely (on the grounds that nothing could use them anyway, and they interfere with O0 CodeGen). llvm-svn: 185754	2013-07-06 12:58:45 +00:00
Nico Rieck	a37acf702d	MC: Implement COFF .linkonce directive llvm-svn: 185753	2013-07-06 12:13:10 +00:00
Jakob Stoklund Olesen	e6abacfb8b	Use modern API to avoid exposing LiveInterval internals. No functional change intended. llvm-svn: 185733	2013-07-05 23:48:07 +00:00
Jakob Stoklund Olesen	c8a4e3857a	Remove dead function. llvm-svn: 185731	2013-07-05 23:04:55 +00:00
Richard Sandiford	c40f27b52d	[SystemZ] Remove no-op MVCs The stack coloring pass has code to delete stores and loads that become trivially dead after coloring. Extend it to cope with single instructions that copy from one frame index to another. The testcase happens to show an example of this kicking in at the moment. It did occur in Real Code too though. llvm-svn: 185705	2013-07-05 14:38:48 +00:00
Richard Sandiford	b5d9bd6f59	Fix double renaming bug in stack coloring pass The stack coloring pass renumbered frame indexes with a loop of the form: for each frame index FI for each instruction I that uses FI for each use of FI in I rename FI to FI' This caused problems if an instruction used two frame indexes F0 and F1 and if F0 was renamed to F1 and F1 to F2. The first time we visited the instruction we changed F0 to F1, then we changed both F1s to F2. In other words, the problem was that SSRefs recorded which instructions used an FI, but not which MachineOperands and MachineMemOperands within that instruction used it. This is easily fixed for MachineOperands by walking the instructions once and processing each operand in turn. There's already a loop to do that for dead store elimination, so it seemed more efficient to fuse the two at the block level. MachineMemOperands are more tricky because they can be shared between instructions. The patch handles them by making SSRefs an array of MachineMemOperands rather than an array of MachineInstrs. We might end up processing the same MachineMemOperand twice, but that's OK because we always know from the SSRefs index what the original frame index was. llvm-svn: 185703	2013-07-05 14:24:47 +00:00
Richard Sandiford	5dd52f8c4d	[SystemZ] Clean up register scavenging code SystemZ wants normal register scavenging slots, as close to the stack or frame pointer as possible. The only reason it was using custom code was because PrologEpilogInserter assumed an x86-like layout, where the frame pointer is at the opposite end of the frame from the stack pointer. This meant that when frame pointer elimination was disabled, the slots ended up being as close as possible to the incoming stack pointer, which is the opposite of what we want on SystemZ. This patch adds a new knob to say which layout is used and converts SystemZ to use target-independent scavenging slots. It's one of the pieces needed to support frame-to-frame MVCs, where two slots might be required. The ABI requires us to allocate 160 bytes for calls, so one approach would be to use that area as temporary spill space instead. It would need some surgery to make sure that the slot isn't live across a call though. I stuck to the "isFPCloseToIncomingSP - ..." style comment on the "do what the surrounding code does" principle. The FP case is already covered by several Systemz/frame-* tests, which fail without the PrologueEpilogueInserter change, so no new ones are needed. No behavioural change intended. llvm-svn: 185696	2013-07-05 12:55:00 +00:00
Benjamin Kramer	5dbec7d961	Simplify code. No functionality change. llvm-svn: 185689	2013-07-05 10:20:57 +00:00
Nico Rieck	316c37407e	Initialize object file info before output streamer r179494 switched to using the object file info to retrieve the default text section for some MC streamers. It is possible that initializing an MC streamer can request sections before the object file info is initialized when the AutoInitSections flag is set on the streamer. llvm-svn: 185670	2013-07-04 21:37:26 +00:00
Jakob Stoklund Olesen	db429d9483	Remove the EXCEPTIONADDR, EHSELECTION, and LSDAADDR ISD opcodes. These exception-related opcodes are not used any longer. llvm-svn: 185625	2013-07-04 13:54:20 +00:00
Jakob Stoklund Olesen	6a7d68349f	Typo. llvm-svn: 185618	2013-07-04 04:53:49 +00:00
Jakob Stoklund Olesen	fee2a20209	Simplify landing pad lowering. Stop using the ISD::EXCEPTIONADDR and ISD::EHSELECTION when lowering landing pad arguments. These nodes were previously legalized into CopyFromReg nodes, but that never worked properly because the CopyFromReg node weren't guaranteed to be scheduled at the top of the basic block. This meant the exception pointer and selector registers could be clobbered before being copied to a virtual register. This patch copies the two physical registers to virtual registers at the beginning of the basic block, and lowers the landingpad instruction directly to two CopyFromReg nodes reading the virtual registers. This is safe because virtual registers don't get clobbered. A future patch will remove the ISD::EXCEPTIONADDR and ISD::EHSELECTION nodes. llvm-svn: 185617	2013-07-04 04:53:45 +00:00
Jakob Stoklund Olesen	3d8560c382	FastISel can only apend to basic blocks. Compute the insertion point from the end of the basic block instead of skipping labels from the front. This caused failures in landing pads when live-in copies where inserted before instruction selection. llvm-svn: 185616	2013-07-04 04:32:39 +00:00
Jakob Stoklund Olesen	bbbb53262a	Live-in copies go after EH_LABELs. This will soon be tested by exception handling working at all. llvm-svn: 185615	2013-07-04 04:32:35 +00:00
Jakob Stoklund Olesen	a1f5b901a5	Revert r185595-185596 which broke buildbots. Revert "Simplify landing pad lowering." Revert "Remove the EXCEPTIONADDR, EHSELECTION, and LSDAADDR ISD opcodes." llvm-svn: 185600	2013-07-04 00:26:30 +00:00
Jakob Stoklund Olesen	f33ec531fa	Remove the EXCEPTIONADDR, EHSELECTION, and LSDAADDR ISD opcodes. These exception-related opcodes are not used any longer. llvm-svn: 185596	2013-07-03 23:56:31 +00:00
Jakob Stoklund Olesen	fa6a7b9b02	Simplify landing pad lowering. Stop using the ISD::EXCEPTIONADDR and ISD::EHSELECTION when lowering landing pad arguments. These nodes were previously legalized into CopyFromReg nodes, but that never worked properly because the CopyFromReg node weren't guaranteed to be scheduled at the top of the basic block. This meant the exception pointer and selector registers could be clobbered before being copied to a virtual register. This patch copies the two physical registers to virtual registers at the beginning of the basic block, and lowers the landingpad instruction directly to two CopyFromReg nodes reading the virtual registers. This is safe because virtual registers don't get clobbered. A future patch will remove the ISD::EXCEPTIONADDR and ISD::EHSELECTION nodes. llvm-svn: 185595	2013-07-03 23:56:24 +00:00
Jakob Stoklund Olesen	533c3bf2d6	Add MachineBasicBlock::addLiveIn(). This function adds a live-in physical register to an MBB and ensures that it is copied to a virtual register immediately. llvm-svn: 185594	2013-07-03 23:56:20 +00:00
Eric Christopher	614a89f5b2	Hoist all of the Entry.getLoc() calls int a single variable. llvm-svn: 185589	2013-07-03 22:40:21 +00:00
Eric Christopher	25f0642afd	Make DotDebugLocEntry a class, reorder the members along with comments for them and update all uses. llvm-svn: 185588	2013-07-03 22:40:18 +00:00
Eric Christopher	270a12cff3	Elaborate on comment. llvm-svn: 185586	2013-07-03 21:37:03 +00:00
Eric Christopher	dd7b4615d1	Add names to the header file since they help in documenting the API (and for consistency). llvm-svn: 185585	2013-07-03 21:23:59 +00:00
Eric Christopher	18cf0610ef	Move typedefs inside the class that they belong to. llvm-svn: 185573	2013-07-03 20:36:36 +00:00
Eric Christopher	98341b8d63	Remove unused field. llvm-svn: 185523	2013-07-03 08:26:07 +00:00
Eric Christopher	c665af772a	Constify a few functions. llvm-svn: 185520	2013-07-03 08:13:55 +00:00
Craig Topper	af0ad9e20f	Use SmallVectorImpl::const_iterator instead of SmallVector to avoid specifying the vector size. llvm-svn: 185514	2013-07-03 05:18:47 +00:00
Craig Topper	72cde639cb	Use SmallVectorImpl instead of SmallVector as method argument to avoid specifying vector size. llvm-svn: 185513	2013-07-03 05:16:59 +00:00
Craig Topper	e1c1d363a5	Use SmallVectorImpl instead of SmallVector for iterators and references to avoid specifying the vector size unnecessarily. llvm-svn: 185512	2013-07-03 05:11:49 +00:00
Craig Topper	24fd7eed8a	Use SmallVectorImpl::iterator/const_iterator instead of SmallVector to avoid specifying the vector size. llvm-svn: 185509	2013-07-03 04:42:33 +00:00
Craig Topper	2b4a201ba6	Introduce some typedefs for DenseMaps containing SmallVectors so the vector size doesn't have to repeated when creating iterators for the DenseMap. llvm-svn: 185508	2013-07-03 04:40:27 +00:00
Craig Topper	80170e54ad	Return SmallVectorImpl& instead of SmallVector& in a couple places to avoid having to specify the vector size in multiple places. llvm-svn: 185507	2013-07-03 04:30:58 +00:00
Craig Topper	977e9cd9c6	Use SmallVectorImpl& instead of SmallVector& to avoid needlessly respecifying the small vector size. llvm-svn: 185505	2013-07-03 04:24:43 +00:00
Craig Topper	d8e436550a	Use SmallVectorImpl::iterator/const_iterator instead of SmallVector to avoid specifying the vector size. llvm-svn: 185504	2013-07-03 04:17:25 +00:00
Eric Christopher	f94eb2bb26	Avoid doing a lot of computation when we have multiple ranges and avoid adding information for the debug_inlined section when it isn't going to be emitted anyhow. llvm-svn: 185500	2013-07-03 02:23:53 +00:00
Eric Christopher	dc42ea8fbd	Move iterator to where it's used and update comments. llvm-svn: 185498	2013-07-03 01:57:28 +00:00
Eric Christopher	070bf1685b	Move instance variable before experimental section. llvm-svn: 185497	2013-07-03 01:57:26 +00:00
Eric Christopher	808fd7bb54	Fix typo to make grep for DW_AT_comp_dir work without case-insensitive grep. llvm-svn: 185496	2013-07-03 01:57:23 +00:00
Eric Christopher	95a7d49ecd	Remove unnecessary forward declare. llvm-svn: 185495	2013-07-03 01:57:20 +00:00
Eric Christopher	c1110838d3	Add a helpful comment. llvm-svn: 185492	2013-07-03 01:22:29 +00:00
Eric Christopher	78fcf490d8	addConstantValue, addConstantFPValue never returned anything but true, so remove the return value and propagate accordingly. llvm-svn: 185490	2013-07-03 01:08:30 +00:00
Manman Ren	ac8062bb72	Debug Info: use module flag to set up Dwarf version. Correctly handles ref_addr depending on the Dwarf version. Emit Dwarf with version from module flag. TODO: turn on/off features depending on the Dwarf version. llvm-svn: 185484	2013-07-02 23:40:10 +00:00
Eric Christopher	9046f94271	Fix comment. llvm-svn: 185480	2013-07-02 21:36:07 +00:00
Ulrich Weigand	2b6fc8d613	[DebugInfo] Allow getDebugThreadLocalSymbol to return MCExpr This allows getDebugThreadLocalSymbol to return a generic MCExpr instead of just a MCSymbolRefExpr. This is in preparation for supporting debug info for TLS variables on PowerPC, where we need to describe the variable location using a more complex expression than just MCSymbolRefExpr. llvm-svn: 185460	2013-07-02 18:47:09 +00:00
Ulrich Weigand	8b3d22664e	[DebugInfo] Hold generic MCExpr in AddrPool This changes the AddrPool infrastructure to enable it to hold generic MCExpr expressions, not just MCSymbolRefExpr. This is in preparation for supporting debug info for TLS variables on PowerPC, where we need to describe the variable location using a more complex expression than just MCSymbolRefExpr. llvm-svn: 185459	2013-07-02 18:46:46 +00:00
Ulrich Weigand	396ba8b495	[DebugInfo] Introduce DIEExpr variant of DIEValue to hold MCExpr values This partially reverts r185202 and restores DIELabel to hold plain MCSymbol references. Instead, we add a new subclass DIEExpr of DIEValue that can hold generic MCExpr references. This is in preparation for supporting debug info for TLS variables on PowerPC, where we need to describe the variable location using a more complex expression than just MCSymbolRefExpr. llvm-svn: 185458	2013-07-02 18:46:26 +00:00
Jakob Stoklund Olesen	13be6bfb60	Revert (most of) r185393 and r185395. "Remove floating point computations form SpillPlacement.cpp." These commits caused test failures in lencod on clang-native-arm-lnt. I suspect these changes are only exposing an existing issue, but reverting anyway to keep the bots passing while we investigate. llvm-svn: 185447	2013-07-02 17:31:58 +00:00
Rafael Espindola	64e1af8eb9	Remove address spaces from MC. This is dead code since PIC16 was removed in 2010. The result was an odd mix, where some parts would carefully pass it along and others would assert it was zero (most of the object streamer for example). llvm-svn: 185436	2013-07-02 15:49:13 +00:00
Tim Northover	6823900e55	DAGCombiner: fix use-counting issue when forming zextload DAGCombiner was counting all uses of a load node when considering whether it's worth combining into a zextload. Really, it wants to ignore the chain and just count real uses. rdar://problem/13896307 llvm-svn: 185419	2013-07-02 09:58:53 +00:00
David Blaikie	8466ca86fe	PR14728: DebugInfo: TLS variables with -gsplit-dwarf llvm-svn: 185398	2013-07-01 23:55:52 +00:00
Jakob Stoklund Olesen	3b4e9ccfe3	Tweak some comments that referred to the old bias computations. llvm-svn: 185395	2013-07-01 23:36:37 +00:00
Jakob Stoklund Olesen	55daed27b7	Remove floating point computations form SpillPlacement.cpp. Patch by Benjamin Kramer! Use the BlockFrequency class instead of floats in the Hopfield network computations. This rescales the node Bias field from a [-2;2] float range to two block frequencies BiasN and BiasP pulling in opposite directions. This construct has a more predictable behavior when block frequencies saturate. The per-node scaling factors are no longer necessary, assuming the block frequencies around a bundle are consistent. This patch can cause the register allocator to make different spilling decisions. The differences should be small. llvm-svn: 185393	2013-07-01 23:19:39 +00:00
David Blaikie	1b01ae8648	PR16493: DebugInfo with TLS on PPC crashing due to invalid relocation Restrict the current TLS support to X86 ELF for now. Test that we don't produce it on PPC & we can flesh that test case out with the right thing once someone implements it. llvm-svn: 185389	2013-07-01 21:45:25 +00:00
Lang Hames	7d99d7977c	Make PBQP require/preserve MachineLoopInfo - the spiller requires it. llvm-svn: 185378	2013-07-01 20:47:47 +00:00
Cameron Zwarich	e35038da94	Fix the build after r185363. Use llvm::next instead of raw next. llvm-svn: 185367	2013-07-01 19:49:48 +00:00
Cameron Zwarich	867bfcd546	Fix PR16508. When phis get lowered, destination copies are inserted using an iterator that is determined once for all phis in the block, which BuildMI interprets as a request to insert an instruction directly before the iterator. In the case of a cyclic phi, source copies may also be inserted directly before this iterator, which can cause source copies to be inserted before destination copies. The fix is to keep an iterator to the last phi and then advance it while lowering each phi in order to insert destination copies directly after the phis. llvm-svn: 185363	2013-07-01 19:42:46 +00:00
Michael Gottesman	fd62bb9d3e	Added c++ mode selector to head of SelectionDAGBuilder.h so editors open it in c++ mode instead of c mode. llvm-svn: 185348	2013-07-01 16:53:41 +00:00
Benjamin Kramer	fd51092719	misched: Compress pairs returned by getUnderlyingObjectsForInstr. llvm-svn: 185266	2013-06-29 18:41:17 +00:00
David Blaikie	f269497068	DebugInfo: PR14728: TLS support Based on GCC's output for TLS variables (OP_constNu, x@dtpoff, OP_lo_user), this implements debug info support for TLS in ELF. Verified that this output is correct/sufficient on Linux (using gold - if you're using binutils-ld, you'll need something with the fix for http://sourceware.org/bugzilla/show_bug.cgi?id=15685 in it). Support on non-ELF is sort of "arbitrary" at the moment - if Apple folks want to discuss (or just go ahead & implement) how this should work in MachO, etc, I'm open. llvm-svn: 185203	2013-06-28 20:05:11 +00:00
David Blaikie	f3cd7c5115	DebugInfo: Pass MCSymbolRefExprs for labels instead of MCSymbols This is a precursor to adding debug info support for TLS which requires non-default relocations applied to TLS symbols. llvm-svn: 185202	2013-06-28 20:05:04 +00:00
David Blaikie	a67de2b1bc	DebugInfo: Revise r185189 to avoid subtle 'unsigned += bool' llvm-svn: 185190	2013-06-28 18:55:13 +00:00
David Blaikie	b8ef7851dc	DebugInfo: Simplify the AddressPool representation llvm-svn: 185189	2013-06-28 18:47:19 +00:00
David Blaikie	dea547b94d	DebugInfo: constify the AddressPool MCSymbol pointers llvm-svn: 185188	2013-06-28 18:47:14 +00:00
Lang Hames	c22e39d83d	Add missing case to switch statement - DAGTypeLegalizer::ExpandIntegerResult should expand ATOMIC_CMP_SWAP nodes the same way that it does for ATOMIC_SWAP. Since ATOMIC_LOADs on some targets (e.g. older ARM variants) get legalized to ATOMIC_CMP_SWAPs, the missing case had been causing i64 atomic loads to crash during isel. <rdar://problem/14074644> llvm-svn: 185186	2013-06-28 18:36:42 +00:00
Manman Ren	983a16c08a	Debug Info: clean up usage of Verify. No functionality change. It should suffice to check the type of a debug info metadata, instead of calling Verify. For cases where we know the type of a DI metadata, use assert. Also update testing cases to make them conform to the format of DI classes. llvm-svn: 185135	2013-06-28 05:43:10 +00:00
Benjamin Kramer	8da6468a34	Remove unused variable. llvm-svn: 185072	2013-06-27 11:26:41 +00:00
Elena Demikhovsky	fed077be03	Fixed a comment. llvm-svn: 184933	2013-06-26 12:15:53 +00:00
Elena Demikhovsky	6769c50d9e	Optimized integer vector multiplication operation by replacing it with shift/xor/sub when it is possible. Fixed a bug in SDIV, where the const operand is not a splat constant vector. llvm-svn: 184931	2013-06-26 10:55:03 +00:00
Manman Ren	aed30fa3de	Remove unused code. No functionality change. llvm-svn: 184866	2013-06-25 18:49:55 +00:00
Benjamin Kramer	866793109e	BlockFrequency: Bump up the entry frequency a bit. This is a band-aid to fix the most severe regressions we're seeing from basing spill decisions on block frequencies, until we have a better solution. llvm-svn: 184835	2013-06-25 13:34:40 +00:00
Eric Christopher	5fdd68e596	80-column and tab character fixes. llvm-svn: 184792	2013-06-24 23:20:02 +00:00
Eric Christopher	40b6bf6f46	Formatting. llvm-svn: 184788	2013-06-24 21:34:55 +00:00
Eric Christopher	bf2d23c0d7	Use const references instead of pointers to references that are never modified. No functional change. llvm-svn: 184781	2013-06-24 21:07:27 +00:00
Andrew Trick	97a1d7c475	Fix tail merging to assign the (more) correct BasicBlock when splitting. This makes it possible to write unit tests that are less susceptible to minor code motion, particularly copy placement. block-placement.ll covers this case with -pre-RA-sched=source which will soon be default. One incorrectly named block is already fixed, but without this fix, enabling new coalescing and scheduling would cause more failures. llvm-svn: 184680	2013-06-24 01:55:01 +00:00
David Blaikie	5acff7e691	DebugInfo: PR14404: Avoid truncating 64 bit values into 32 bits for ULEB128/SLEB128 generation llvm-svn: 184669	2013-06-23 18:31:11 +00:00
David Blaikie	2b380232c3	DebugInfo: Support (using GNU extensions) for template template parameters and parameter packs llvm-svn: 184643	2013-06-22 18:59:11 +00:00
Chad Rosier	295bd43adb	The getRegForInlineAsmConstraint function should only accept MVT value types. llvm-svn: 184642	2013-06-22 18:37:38 +00:00
Andrew Trick	cbd7305d09	Prevent LiveRangeEdit from deleting bundled instructions. We have no targets on trunk that bundle before regalloc. However, we have been advertising regalloc as bundle safe for use with out-of-tree targets. We need to at least contain the parts of the code that are still unsafe. llvm-svn: 184620	2013-06-22 00:33:48 +00:00
David Blaikie	97c6c5bd98	DebugInfo: Don't lose unreferenced non-trivial by-value parameters A FastISel optimization was causing us to emit no information for such parameters & when they go missing we end up emitting a different function type. By avoiding that shortcut we not only get types correct (very important) but also location information (handy) - even if it's only live at the start of a function & may be clobbered later. Reviewed/discussion by Evan Cheng & Dan Gohman. llvm-svn: 184604	2013-06-21 22:56:30 +00:00
Michael Liao	62ebfd8786	Fix PR16360 When (srl (anyextend x), c) is folded into (anyextend (srl x, c)), the high bits are not cleared. Add 'and' to clear off them. llvm-svn: 184575	2013-06-21 18:45:27 +00:00
Andrew Trick	5749b8be01	Update physreg live intervals during remat. llvm-svn: 184574	2013-06-21 18:33:26 +00:00
Andrew Trick	8d02e917f4	Added -precompute-phys-liveness for testing LiveIntervals updates. llvm-svn: 184573	2013-06-21 18:33:23 +00:00
Andrew Trick	6b9c49a275	Handle more cases in LiveRangeEdit::eliminateDeadDefs. Live intervals for dead physregs may be created during coalescing. We need to update these in the event that their instruction goes away. crash.ll is the unit test that catches it when MI sched is enabled on X86. llvm-svn: 184572	2013-06-21 18:33:20 +00:00
Andrew Trick	530fc1f486	Refactor LiveRangeEdit::eliminateDeadDefs. I want to add logic to handle more cases. llvm-svn: 184571	2013-06-21 18:33:17 +00:00
Andrew Trick	7df3f01703	whitespace llvm-svn: 184570	2013-06-21 18:33:14 +00:00
Andrew Trick	714aec021d	Fix a -join-globalcopies bug; handle undef operands. llvm-svn: 184569	2013-06-21 18:33:11 +00:00
Andrew Trick	75961ecc1a	Modify the -join-globalcopies option (off by default). Always coalesce in forward order to propagate rematerialization. I'm fixing this option so I can enable it by default soon. llvm-svn: 184568	2013-06-21 18:33:09 +00:00
Andrew Trick	3a851a27b8	Make rematerialization in the coalescer less sensitive to LRG order. llvm-svn: 184567	2013-06-21 18:33:06 +00:00
Andrew Trick	b55db58edf	MI-Sched: cleanup DEBUG output. llvm-svn: 184565	2013-06-21 18:33:01 +00:00
Andrew Trick	736dd9a255	MI-Sched: Adjust regpressure limits for reserved regs. llvm-svn: 184564	2013-06-21 18:32:58 +00:00
David Blaikie	ea2605dc1a	DebugInfo: don't use location lists when the location covers the whole function anyway Fix up three tests - one that was relying on abbreviation number, another relying on a location list in this case (& testing raw asm, changed that to use dwarfdump on the debug_info now that that's where the location is), and another which was added in r184368 - exposing a bug in that fix that is exposed when we emit the location inline rather than through a location list. Fix that bug while I'm here. llvm-svn: 184387	2013-06-20 00:25:24 +00:00
Bill Wendling	7ed411062b	This is now a duplicate. llvm-svn: 184376	2013-06-19 22:16:33 +00:00
Bill Wendling	cce21cc347	Make the comparison operators non-member functions. llvm-svn: 184373	2013-06-19 22:09:25 +00:00
David Blaikie	81a4dc75e9	DebugInfo: PR14763/r183329 correct the location of indirect parameters We had been papering over a problem with location info for non-trivial types passed by value by emitting their type as references (this caused the debugger to interpret the location information correctly, but broke the type of the function). r183329 corrected the type information but lead to the debugger interpreting the pointer parameter as the value - the debug info describing the location needed an extra dereference. Use a new flag in DIVariable to add the extra indirection (either by promoting an existing DW_OP_reg (parameter passed in a register) to DW_OP_breg + 0 or by adding DW_OP_deref to an existing DW_OP_breg + n (parameter passed on the stack). llvm-svn: 184368	2013-06-19 21:55:13 +00:00
Bill Wendling	a3cd350249	Access the TargetLoweringInfo from the TargetMachine object instead of caching it. The TLI may change between functions. No functionality change. llvm-svn: 184360	2013-06-19 21:36:55 +00:00
Bill Wendling	7a639ea2a4	Access the TargetLoweringInfo from the TargetMachine object instead of caching it. The TLI may change between functions. No functionality change. llvm-svn: 184352	2013-06-19 21:07:11 +00:00
Bill Wendling	afc1036f3e	Access the TargetLoweringInfo from the TargetMachine object instead of caching it. The TLI may change between functions. No functionality change. llvm-svn: 184349	2013-06-19 20:51:24 +00:00
Bill Wendling	0ccf31007f	Don't cache the TLI object since we have access to it through TargetMachine already. llvm-svn: 184346	2013-06-19 20:32:16 +00:00
Quentin Colombet	b51a68681a	During SelectionDAG building explicitly set a node to constant zero when the value is zero. This allows optmizations to kick in more easily. Fix some test cases so that they remain meaningful (i.e., not completely dead coded) when optimizations apply. <rdar://problem/14096009> superfluous multiply by high part of zero-extended value. llvm-svn: 184222	2013-06-18 20:14:39 +00:00
Timur Iskhodzhanov	302ba2314a	Hopefully fix the MSVS build after r184105 llvm-svn: 184178	2013-06-18 07:57:22 +00:00
Bill Wendling	bc07a8900c	Use pointers to the MCAsmInfo and MCRegInfo. Someone may want to do something crazy, like replace these objects if they change or something. No functionality change intended. llvm-svn: 184175	2013-06-18 07:20:20 +00:00
Bill Wendling	551a6775db	Simplify some of the code. No functionality change. llvm-svn: 184172	2013-06-18 06:07:26 +00:00
Andrew Trick	5d4861867a	MI-Sched: handle ReadAdvance latencies as used by Swift. llvm-svn: 184135	2013-06-17 21:45:18 +00:00
Andrew Trick	71f08a3e74	Give RegMax higher priority. llvm-svn: 184133	2013-06-17 21:45:13 +00:00
Andrew Trick	3c3a40e4c6	Remove compareRPDelta. A complex, expensive heuristic with little value in the current design. llvm-svn: 184132	2013-06-17 21:45:11 +00:00
Andrew Trick	d31db4b45d	MI Sched: fix a typo in RegPressure heuristics. llvm-svn: 184131	2013-06-17 21:45:09 +00:00
Andrew Trick	7e63046ce9	MI-Sched: Remove another heuristic that is sensitive to queue order. llvm-svn: 184130	2013-06-17 21:45:07 +00:00
Andrew Trick	d40d0f2c1b	MI-Sched: Track multiple candidates with the same priority level. This eliminates the MultiPressure scheduling "reason". It was sensitive to queue order. We don't like being sensitive to queue order. llvm-svn: 184129	2013-06-17 21:45:05 +00:00
Bill Wendling	626c991ce9	Directly access objects which may change during compilation. llvm-svn: 184121	2013-06-17 20:41:25 +00:00
Benjamin Kramer	e2a1d89e14	Switch spill weights from a basic loop depth estimation to BlockFrequencyInfo. The main advantages here are way better heuristics, taking into account not just loop depth but also __builtin_expect and other static heuristics and will eventually learn how to use profile info. Most of the work in this patch is pushing the MachineBlockFrequencyInfo analysis into the right places. This is good for a 5% speedup on zlib's deflate (x86_64), there were some very unfortunate spilling decisions in its hottest loop in longest_match(). Other benchmarks I tried were mostly neutral. This changes register allocation in subtle ways, update the tests for it. 2012-02-20-MachineCPBug.ll was deleted as it's very fragile and the instruction it looked for was gone already (but the FileCheck pattern picked up unrelated stuff). llvm-svn: 184105	2013-06-17 19:00:36 +00:00
David Blaikie	b735b4d6db	DebugInfo: remove target-specific Frame Index handling for DBG_VALUE MachineInstrs Frame index handling is now target-agnostic, so delete the target hooks for creation & asm printing of target-specific addressing in DBG_VALUEs and any related functions. llvm-svn: 184067	2013-06-16 20:34:27 +00:00
David Blaikie	0252265be0	Debug Info: Simplify Frame Index handling in DBG_VALUE Machine Instructions Rather than using the full power of target-specific addressing modes in DBG_VALUEs with Frame Indicies, simply use Frame Index + Offset. This reduces the complexity of debug info handling down to two representations of values (reg+offset and frame index+offset) rather than three or four. Ideally we could ensure that frame indicies had been eliminated by the time we reached an assembly or dwarf generation, but I haven't spent the time to figure out where the FIs are leaking through into that & whether there's a good place to convert them. Some FI+offset=>reg+offset conversion is done (see PrologEpilogInserter, for example) which is necessary for some SelectionDAG assumptions about registers, I believe, but it might be possible to make this a more thorough conversion & ensure there are no remaining FIs no matter how instruction selection is performed. llvm-svn: 184066	2013-06-16 20:34:15 +00:00
David Blaikie	c9380db4ba	DebugInfo: print DBG_VALUE MachineInstrs with [] for deref and drop the offset when it's zero llvm-svn: 184045	2013-06-15 15:52:58 +00:00
Andrew Trick	8e8415f5ab	Missing NDEBUGs. llvm-svn: 184039	2013-06-15 05:46:47 +00:00
Andrew Trick	f78e7fa140	MI-Sched: heuristics using the new latency and machine model. llvm-svn: 184038	2013-06-15 05:39:19 +00:00
Andrew Trick	de2109eb4c	Machine Model: Add MicroOpBufferSize and resource BufferSize. Replace the ill-defined MinLatency and ILPWindow properties with with straightforward buffer sizes: MCSchedMode::MicroOpBufferSize MCProcResourceDesc::BufferSize These can be used to more precisely model instruction execution if desired. Disabled some misched tests temporarily. They'll be reenabled in a few commits. llvm-svn: 184032	2013-06-15 04:49:57 +00:00
Andrew Trick	e2ff575bb8	MI-Sched: Rename IssueCount to CurrMOps. "Counts" refer to scaled resource counts within a region. CurrMOps is simply the number of micro-ops to be issue in the current cycle. llvm-svn: 184031	2013-06-15 04:49:49 +00:00
Andrew Trick	0cd8afc547	MI-Sched: Remove the temporary EnableCopyConstrain flag. llvm-svn: 184030	2013-06-15 04:49:46 +00:00
Andrew Trick	f5b8ef2690	MI-Sched: added tracking of dependent latency for better heuristics. Heuristics compare the critical path in the scheduled code, called ExpectedLatency, with the latency of instructions remaining to be scheduled. There are two ways to look at remaining latency: (1) Dependent latency includes the latency between unscheduled and scheduled instructions. (2) Independent latency is simply the height (bottom-up) or depth (top-down) of instructions currently in the ready Q. llvm-svn: 184029	2013-06-15 04:49:44 +00:00
Andrew Trick	5580e5ca80	MI-Sched: DEBUG: print critical resource. llvm-svn: 184028	2013-06-15 04:49:42 +00:00
Stephen Lin	605207fe75	SelectionDAG: slightly refactor DAGCombiner::visitSELECT_CC to avoid redudant checks... This doesn't really effect performance due to all the relevant calls being transparent but is clearer. llvm-svn: 184027	2013-06-15 04:03:33 +00:00
David Blaikie	2dbebcfede	Debug Info: Don't print the display name and colon prefix for DEBUG_VALUE comments if the display name is empty llvm-svn: 184026	2013-06-15 00:33:47 +00:00
Matt Arsenault	d2f0332a29	Introduce getSelect usage and use more getSelectCC llvm-svn: 184012	2013-06-14 22:04:37 +00:00
Stephen Lin	4e69d01b67	SelectionDAG: minor fix to order of operands in comments to match the code llvm-svn: 184008	2013-06-14 21:33:58 +00:00
Tim Northover	059cead5ed	Mark rematerialized super/sub registers as dead. When we're rematerializing into a not-quite-right register we already add the real definition as an imp-def, but we should also be marking the "official" register as dead, since nothing else is going to use it as a result of this remat. Not doing this can affect pressure tracking. rdar://problem/14158833 llvm-svn: 184002	2013-06-14 20:22:21 +00:00
Stephen Lin	e31f2d2d54	SelectionDAG: Fix incorrect condition checks in some cases of folding FADD/FMUL combinations; also improve accuracy of comments llvm-svn: 183993	2013-06-14 18:17:35 +00:00
Derek Schuff	1804b5795a	Make PrologEpilogInserter save/restore all callee saved registers in functions which call __builtin_unwind_init() __builtin_unwind_init() is an undocumented gcc intrinsic which has this effect, and is used in libgcc_eh. Goes part of the way toward fixing PR8541. llvm-svn: 183984	2013-06-14 16:15:29 +00:00
Jakub Staszak	80df8b837f	Move #include from .h to .cpp file. llvm-svn: 183960	2013-06-14 00:00:13 +00:00
Dan Gohman	6a0c9ae4f9	Print ConstantPool entries initialized to Values with WriteAsOperand instead of operator<< so that functions are printed as just their name instead of as their entire definition, which is excessively verbose in this context. llvm-svn: 183871	2013-06-12 22:19:19 +00:00
Eric Christopher	773c155c15	Formatting. llvm-svn: 183807	2013-06-11 23:41:38 +00:00
Eric Christopher	a2f806fd20	Remove a few fixmes, the only work we're doing is getting the string to return and this is done all over. llvm-svn: 183704	2013-06-10 22:24:10 +00:00
Eric Christopher	2280c5baa6	Fix up comment. llvm-svn: 183703	2013-06-10 22:24:07 +00:00
Eric Christopher	8c47bb1a02	Remove unused function. llvm-svn: 183698	2013-06-10 21:59:28 +00:00
Eric Christopher	b7e3b42c01	IndentCount is only used within NDEBUG code. llvm-svn: 183695	2013-06-10 20:58:53 +00:00
David Majnemer	0fc8670cb0	TargetLowering: Clean up method description comments llvm-svn: 183623	2013-06-08 23:51:45 +00:00
Quentin Colombet	249cb6756c	Reapply r183552. This time, use a standard type for the option to avoid template instantiation issue with non-standard type. Add a backend option to warn on a given stack size limit. Option: -mllvm -warn-stack-size=<limit> Output (if limit is exceeded): warning: Stack size limit exceeded (<actual size>) in <functionName>. The longer term plan is to hook that to a clang warning. PR:4072 <rdar://problem/13987214>. llvm-svn: 183595	2013-06-08 00:07:54 +00:00
Quentin Colombet	bd5a201c85	Revert commits related to stack warning. llvm-svn: 183579	2013-06-07 22:14:50 +00:00
Quentin Colombet	6baf581b93	Add a backend option to warn on a given stack size limit. Option: -mllvm -warn-stack-size=<limit> Output (if limit is exceeded): warning: Stack size limit exceeded (<actual size>) in <functionName>. The longer term plan is to hook that to a clang warning. PR:4072 <rdar://problem/13987214> llvm-svn: 183552	2013-06-07 20:18:12 +00:00
Quentin Colombet	ba366011c8	Teach AsmPrinter how to print odd constants. Fix an assertion when the compiler encounters big constants whose bit width is not a multiple of 64-bits. Although clang would never generate something like this, the backend should be able to handle any legal IR. <rdar://problem/13363576> llvm-svn: 183544	2013-06-07 18:36:03 +00:00
Rafael Espindola	aad6c24422	Support OpenBSD's native frame protection conventions. OpenBSD's stack smashing protection differs slightly from other platforms: 1. The smash handler function is "__stack_smash_handler(const char *funcname)" instead of "__stack_chk_fail(void)". 2. There's a hidden "long __guard_local" object that gets linked into each executable and DSO. Patch by Matthew Dempsky. llvm-svn: 183533	2013-06-07 16:35:57 +00:00
Duncan Sands	e0a607e376	Correct wrong register in this example, pointed out by Baoshan Pang. llvm-svn: 183495	2013-06-07 08:30:55 +00:00
David Blaikie	b272a75328	Fix break in r183446 - helps to increment the iterator in a loop llvm-svn: 183454	2013-06-06 22:28:26 +00:00
David Blaikie	36d5d2f08d	Debug Info: simplify parameter ordering preservation Seems we emit the parameter ordering number (spuriously named 'arg number') in the debug info, so there's no need to search through the variable list to figure out the parameter ordering. This implementation does 'always' do the work, even in non-optimized debug info (the previous implementation checked the existence of the 'variables' list on the subprogram which is only present in optimized builds). No intended functionality change. llvm-svn: 183446	2013-06-06 21:04:51 +00:00
Bill Wendling	f77190855d	Cache the TargetLowering info object as a pointer. Caching it as a pointer allows us to reset it if the TargetMachine object changes. llvm-svn: 183361	2013-06-06 00:43:09 +00:00
Bill Wendling	8db01cb262	Don't cache the TargetLoweringInfo object inside of the FunctionLowering object. The TargetLoweringInfo object is owned by the TargetMachine. In the future, the TargetMachine object may change, which may also change the TargetLoweringInfo object. llvm-svn: 183356	2013-06-06 00:11:39 +00:00
Bill Wendling	6fd15ca02e	Add space to assert message. llvm-svn: 183346	2013-06-05 23:13:26 +00:00
David Blaikie	6f1a8067fb	PR15662: Optimized debug info produces out of order function parameters When a function is inlined we lazily construct the variables representing the function's parameters. After that, we add any remaining unused parameters. If the function doesn't use all the parameters, or uses them out of order, then the DWARF would produce them in that order, producing a parameter order that doesn't match the source. This fix causes us to always keep the arg variables at the start of the variable list & in the original order from the source. llvm-svn: 183297	2013-06-05 05:39:59 +00:00
Shuxin Yang	8b8fd2171c	Fix a defect in code-layout pass, improving Benchmarks/Olden/em3d/em3d by about 30% (4.58s vs 3.2s on an oldish Mac Tower). The corresponding src is excerpted bellow. The lopp accounts for about 90% of execution time. -------------------- cat -n test-suite/MultiSource/Benchmarks/Olden/em3d/make_graph.c 90 91 for (k=0; k<j; k++) 92 if (other_node == cur_node->to_nodes[k]) break; The defective layout is sketched bellow, where the two branches need to swap. ------------------------------------------------------------------------ L: ... if (cond) goto out-of-loop goto L While this code sequence is defective, I don't understand why it incurs 1/3 of execution time. CPU-event-profiling indicates the poor laoyout dose not increase in br-misprediction; it dosen't increase stall cycle at all, and it dosen't prevent the CPU detect the loop (i.e. Loop-Stream-Detector seems to be working fine as well)... The root cause of the problem is that the layout pass calls AnalyzeBranch() with basic-block which is not updated to reflect its current layout. rdar://13966341 llvm-svn: 183174	2013-06-04 01:00:57 +00:00
Benjamin Kramer	7c275640e7	Move getRealLinkageName to a common place and remove all the duplicates of it. Also simplify code a bit while there. No functionality change. llvm-svn: 183076	2013-06-01 17:51:14 +00:00
Eric Christopher	65ac02ad78	Const-ify some printing and dumping code for DIEValues. llvm-svn: 183057	2013-05-31 22:50:40 +00:00
Quentin Colombet	bf490d4a32	Loop Strength Reduce: Scaling factor cost. Account for the cost of scaling factor in Loop Strength Reduce when rating the formulae. This uses a target hook. The default implementation of the hook is: if the addressing mode is legal, the scaling factor is free. <rdar://problem/13806271> llvm-svn: 183045	2013-05-31 21:29:03 +00:00
Kai Nacke	e1823b6b85	Remove useless code from transitioning to new EH scheme Removes all uses of the variable UsesNewEH. Simply return false in case that no resume instructions were found. llvm-svn: 183016	2013-05-31 16:30:36 +00:00
Tim Northover	69cd121dd9	Fix rematerialization into physical registers. r182872 introduced a bug in how the register-coalescer's rematerialization handled defining a physical register. It relied on the output of the coalescer's setRegisters method to determine whether the replacement instruction needed an implicit-def. However, this value isn't necessarily the same as the CopyMI's actual destination register which is what the rest of the basic-block expects us to be defining. The commit changes the rematerializer to use the actual register attached to CopyMI in its decision. This will be tested soon by an X86 patch which moves everything to using MOV32r0 instead of other sizes. llvm-svn: 182925	2013-05-30 12:30:50 +00:00
Eric Christopher	d1c5a31721	Rename variable to be more descriptive. llvm-svn: 182903	2013-05-30 00:43:35 +00:00
Eric Christopher	1e1c7f1b15	Formatting. llvm-svn: 182902	2013-05-30 00:43:32 +00:00
Andrew Trick	ad6d08ac6f	Order CALLSEQ_START and CALLSEQ_END nodes. Fixes PR16146: gdb.base__call-ar-st.exp fails after pre-RA-sched=source fixes. Patch by Xiaoyi Guo! This also fixes an unsupported dbg.value test case. Codegen was previously incorrect but the test was passing by luck. llvm-svn: 182885	2013-05-29 22:03:55 +00:00
Bill Wendling	70b1400e6d	Don't reach into the middle of TargetMachine and cache one of its ivars. Not only does this break encapsulation, it's gross. llvm-svn: 182876	2013-05-29 20:37:19 +00:00
Tim Northover	b65f6b0820	Teach ReMaterialization to be more cunning about subregisters This allows rematerialization during register coalescing to handle more cases involving operations like SUBREG_TO_REG which might need to be rematerialized using sub-register indices. For example, code like: v1(GPR64):sub_32 = MOVZ something v2(GPR64) = COPY v1(GPR64) should be convertable to: v2(GPR64):sub_32 = MOVZ something but previously we just gave up in places like this llvm-svn: 182872	2013-05-29 19:32:06 +00:00
Manman Ren	4213c39e3c	LTO+Debug Info: revert r182791. Since the testing case uses ref_addr, which requires version 3+ to work, we will solve the dwarf version issue first. This patch also causes failures in one of the bots. I will update the patch accordingly in my next attempt. rdar://13926659 llvm-svn: 182867	2013-05-29 17:16:59 +00:00
Manman Ren	b5b5453e61	LTO+Debug Info: correctly emit inlined_subroutine when the inlined callee is from a different CU. We used to print out an error message and fail to generate inlined_subroutine. If we use ref_addr in the generated DWARF, the DWARF version should be 3 or above. rdar://13926659 llvm-svn: 182791	2013-05-28 19:01:58 +00:00
Benjamin Kramer	262b154247	Simplify code. No functionality change. llvm-svn: 182779	2013-05-28 16:39:36 +00:00
Benjamin Kramer	351d53c225	Remove double semicolons. llvm-svn: 182778	2013-05-28 16:31:26 +00:00
Preston Gurd	048f99de11	Convert sqrt functions into sqrt instructions when -ffast-math is in effect. When -ffast-math is in effect (on Linux, at least), clang defines __FINITE_MATH_ONLY__ > 0 when including <math.h>. This causes the preprocessor to include <bits/math-finite.h>, which renames the sqrt functions. For instance, "sqrt" is renamed as "__sqrt_finite". This patch adds the 3 new names in such a way that they will be treated as equivalent to their respective original names. llvm-svn: 182739	2013-05-27 15:44:35 +00:00
Andrew Trick	c66d26adf0	Fix PR16143: Insert DEBUG_VALUE before terminator. llvm-svn: 182717	2013-05-26 08:58:50 +00:00
Andrew Trick	e2431c64bc	Track IR ordering of SelectionDAG nodes 3/4. Remove the old IR ordering mechanism and switch to new one. Fix unit test failures. llvm-svn: 182704	2013-05-25 03:08:10 +00:00
Andrew Trick	ef9de2a739	Track IR ordering of SelectionDAG nodes 2/4. Change SelectionDAG::getXXXNode() interfaces as well as call sites of these functions to pass in SDLoc instead of DebugLoc. llvm-svn: 182703	2013-05-25 02:42:55 +00:00
Andrew Trick	175143bf88	Track IR ordering of SelectionDAG nodes 1/4. Use a field in the SelectionDAGNode object to track its IR ordering. This adds fields and utility classes without changing existing interfaces or functionality. llvm-svn: 182701	2013-05-25 02:20:36 +00:00
Michael J. Spencer	df1ecbd734	Replace Count{Leading,Trailing}Zeros_{32,64} with count{Leading,Trailing}Zeros. llvm-svn: 182680	2013-05-24 22:23:49 +00:00
Jakob Stoklund Olesen	43711c51ec	Fix PR16110: Handle DBG_VALUE in ConnectedVNInfoEqClasses::Distribute(). Now that the LiveDebugVariables pass is running after register coalescing, the ConnectedVNInfoEqClasses class needs to deal with DBG_VALUE instructions. This only comes up when rematerialization during coalescing causes the remaining live range of a virtual register to separate into two connected components. llvm-svn: 182592	2013-05-23 17:02:23 +00:00
Benjamin Kramer	bc6666bedf	InlineSpiller: Store bucket pointers instead of iterators. Lets us use a SetVector instead of an explicit set + vector combination. llvm-svn: 182586	2013-05-23 15:42:57 +00:00
David Blaikie	5174c84add	Solidify the assumption that a DW_TAG_subprogram's type is a DW_TAG_subroutine_type There were bits & pieces of code lying around that may've given the impression that debug info metadata supported the possibility that a subprogram's type could be specified by a non-subroutine type describing the return type of a void function. This support was incomplete & unnecessary. Asserts & API have been changed to make the desired usage more clear. llvm-svn: 182532	2013-05-22 23:22:18 +00:00
Chad Rosier	abdb1d69ab	Simplify logic now that r182490 is in place. No functional change intended. llvm-svn: 182531	2013-05-22 23:17:36 +00:00
Chad Rosier	682ae15bb9	Simplify logic now that r182490 is in place. No functional change intended. llvm-svn: 182527	2013-05-22 22:36:55 +00:00
Chad Rosier	c7505ef8ba	Simplify logic now that r182490 is in place. No functional change intended. llvm-svn: 182526	2013-05-22 22:26:05 +00:00
Adrian Prantl	0d1e5592a6	Unify formatting of debug output. llvm-svn: 182495	2013-05-22 18:02:19 +00:00
Justin Holewinski	fff1f5f5e2	Drop @llvm.annotation and @llvm.ptr.annotation intrinsics during codegen. The intrinsic calls are dropped, but the annotated value is propagated. Fixes PR 15253 Original patch by Zeng Bin! llvm-svn: 182387	2013-05-21 14:37:16 +00:00
Benjamin Kramer	8aaf197990	DAGCombine: Avoid an edge case where it tried to create an i0 type for (x & 0) == 0. Fixes PR16083. llvm-svn: 182357	2013-05-21 08:51:09 +00:00
Manman Ren	9d4c735885	Dwarf: use a single line table to generate assembly when .loc is used. This is to fix PR15408 where an undefined symbol Lline_table_start1 is used. Since we do not generate the debug_line section when .loc is used, Lline_table_start1 is not emitted and we can't refer to it when calculating at_stmt_list for a compile unit. llvm-svn: 182344	2013-05-21 00:57:22 +00:00
David Blaikie	e63d5d1633	PR14606: Debug Info for namespace aliases/DW_TAG_imported_module This resolves the last of the PR14606 failures in the GDB 7.5 test suite by implementing an optional name field for DW_TAG_imported_modules/DIImportedEntities and using that to implement C++ namespace aliases (eg: "namespace X = Y;"). llvm-svn: 182328	2013-05-20 22:50:35 +00:00
Bill Wendling	eda5418e89	The DWARF EH pass doesn't need the TargetMachine, only the TargetLoweringBase like the other EH passes. llvm-svn: 182321	2013-05-20 21:54:18 +00:00
Bill Wendling	47447589c9	No need to store the TargetMachine variable in this class. llvm-svn: 182317	2013-05-20 21:28:28 +00:00
Matt Arsenault	75865923c9	Add LLVMContext argument to getSetCCResultType llvm-svn: 182180	2013-05-18 00:21:46 +00:00
Matt Arsenault	04126234e5	Replace redundant code Use EVT::changeExtendedVectorElementTypeToInteger instead of doing the same thing that it does llvm-svn: 182165	2013-05-17 21:43:43 +00:00
Matt Arsenault	52ddb7bcdd	Add missing -- C++ -- to headers llvm-svn: 182164	2013-05-17 21:43:39 +00:00
Adrian Prantl	9c93059aa4	Generate debug info for by-value struct args even if they are not used. radar://problem/13865940 llvm-svn: 182062	2013-05-16 23:44:12 +00:00
Rafael Espindola	b08d2c2db0	Remove addFrameMove. Now that we have good testing, remove addFrameMove and create cfi instructions directly. llvm-svn: 182052	2013-05-16 21:02:15 +00:00
Benjamin Kramer	fc88c3761f	DAGCombine: Also shrink eq compares where the constant is exactly as large as the smaller type. if ((x & 255) == 255) before: movzbl %al, %eax cmpl $255, %eax after: cmpb $-1, %al llvm-svn: 182038	2013-05-16 18:47:58 +00:00
Jakob Stoklund Olesen	9ae96c7aab	Add TargetRegisterInfo::getCoveringLanes(). This lane mask provides information about which register lanes completely cover super-registers. See the block comment before getCoveringLanes(). llvm-svn: 182034	2013-05-16 18:03:08 +00:00
Hal Finkel	1f6a7f53d8	Fix legalization of SETCC with promoted integer intrinsics If the input operands to SETCC are promoted, we need to make sure that we either use the promoted form of both operands (or neither); a mixture is not allowed. This can happen, for example, if a target has a custom promoted i1-returning intrinsic (where i1 is not a legal type). In this case, we need to use the promoted form of both operands. This change only augments the behavior of the existing logic in the case where the input types (which may or may not have already been legalized) disagree, and should not affect existing target code because this case would otherwise cause an assert in the SETCC operand promotion code. This will be covered by (essentially all of the) tests for the new PPCCTRLoops infrastructure. llvm-svn: 181926	2013-05-15 21:37:27 +00:00
Derek Schuff	d2c42d766d	Fix miscompile due to StackColoring incorrectly merging stack slots (PR15707) IR optimisation passes can result in a basic block that contains: llvm.lifetime.start(%buf) ... llvm.lifetime.end(%buf) ... llvm.lifetime.start(%buf) Before this change, calculateLiveIntervals() was ignoring the second lifetime.start() and was regarding %buf as being dead from the lifetime.end() through to the end of the basic block. This can cause StackColoring to incorrectly merge %buf with another stack slot. Fix by removing the incorrect Starts[pos].isValid() and Finishes[pos].isValid() checks. Just doing: Starts[pos] = Indexes->getMBBStartIdx(MBB); Finishes[pos] = Indexes->getMBBEndIdx(MBB); unconditionally would be enough to fix the bug, but it causes some test failures due to stack slots not being merged when they were before. So, in order to keep the existing tests passing, treat LiveIn and LiveOut separately rather than approximating the live ranges by merging LiveIn and LiveOut. This fixes PR15707. Patch by Mark Seaborn. llvm-svn: 181922	2013-05-15 21:15:09 +00:00
David Blaikie	041f1aa3e2	Use only explicit bool conversion operators BitVector/SmallBitVector::reference::operator bool remain implicit since they model more exactly a bool, rather than something else that can be boolean tested. The most common (non-buggy) case are where such objects are used as return expressions in bool-returning functions or as boolean function arguments. In those cases I've used (& added if necessary) a named function to provide the equivalent (or sometimes negative, depending on convenient wording) test. One behavior change (YAMLParser) was made, though no test case is included as I'm not sure how to reach that code path. Essentially any comparison of llvm::yaml::document_iterators would be invalid if neither iterator was at the end. This helped uncover a couple of bugs in Clang - test cases provided for those in a separate commit along with similar changes to `operator bool` instances in Clang. llvm-svn: 181868	2013-05-15 07:36:59 +00:00
Eric Christopher	8fd7ab07ca	Make getCompileUnit non-const and return the current DIE if it happens to be a compile unit. Noticed on inspection and tested via calling on a newly created compile unit. No functional change. llvm-svn: 181835	2013-05-14 21:33:10 +00:00
Kai Nacke	9a224ced0f	Add bitcast to store of personality function. The personality function is user defined and may have an arbitrary result type. The code assumes always i8. This results in an assertion failure if a different type is used. A bitcast to i8 is added to prevent this failure. Reviewed by: Renato Golin, Bob Wilson llvm-svn: 181802	2013-05-14 16:30:51 +00:00
Akira Hatanaka	1f24e6a6a2	StackColoring: don't clear an instruction's mem operand if the underlying object is a PseudoSourceValue and PseudoSourceValue::isConstant returns true (i.e., points to memory that has a constant value). llvm-svn: 181751	2013-05-14 01:42:44 +00:00
David Blaikie	7b770c6aed	Assert that DIEEntries are constructed with non-null DIEs This just brings a crash a little further forward from DWARF emission to DIE construction to make errors easier to diagnose. llvm-svn: 181748	2013-05-14 00:35:19 +00:00
Bob Wilson	c5c0823724	Remove redundant variable introduced by r181682. llvm-svn: 181721	2013-05-13 19:02:31 +00:00
Hao Liu	bc60196951	Fix PR15950 A bug in DAG Combiner about undef mask llvm-svn: 181682	2013-05-13 02:07:05 +00:00
Rafael Espindola	227144c23c	Remove the MachineMove class. It was just a less powerful and more confusing version of MCCFIInstruction. A side effect is that, since MCCFIInstruction uses dwarf register numbers, calls to getDwarfRegNum are pushed out, which should allow further simplifications. I left the MachineModuleInfo::addFrameMove interface unchanged since this patch was already fairly big. llvm-svn: 181680	2013-05-13 01:16:13 +00:00
Benjamin Kramer	63e39eb09d	StringRefize some debug accel table bits. llvm-svn: 181663	2013-05-11 18:24:28 +00:00
Rafael Espindola	1b09836bc3	Change getFrameMoves to return a const reference. To add a frame now there is a dedicated addFrameMove which also takes care of constructing the move itself. llvm-svn: 181657	2013-05-11 02:38:11 +00:00
Rafael Espindola	639890222e	Remove more dead code. llvm-svn: 181656	2013-05-11 02:24:41 +00:00
David Blaikie	a1e813dcd4	PR14492: Debug Info: Support for values of non-integer non-type template parameters. This is only tested for global variables at the moment (& includes tests for the unnamed parameter case, since apparently this entire function was completely untested previously) llvm-svn: 181632	2013-05-10 21:52:07 +00:00
Benjamin Kramer	a5d59333b3	DAGCombiner: Generate a correct constant for vector types when folding (xor (and)) into (and (not)). PR15948. llvm-svn: 181597	2013-05-10 14:09:52 +00:00
Owen Anderson	32baf99b1d	Teach SelectionDAG to constant fold all-constant FMA nodes the same way that it constant folds FADD, FMUL, etc. llvm-svn: 181555	2013-05-09 22:27:13 +00:00
Eric Christopher	f20ff979e9	Revert "Make sure debug info contains linkage names (DW_AT_MIPS_linkage_name)" temporarily while investigating gdb.cp/templates.exp. This reverts commit r181471. llvm-svn: 181496	2013-05-09 00:42:33 +00:00
Eric Christopher	697fa1c8be	Make sure debug info contains linkage names (DW_AT_MIPS_linkage_name) for constructors and destructors since the original declaration given by the AT_specification both won't and can't. Patch by Yacine Belkadi, I've cleaned up the testcases. llvm-svn: 181471	2013-05-08 21:23:22 +00:00
David Majnemer	386ab7f872	DAGCombiner: Simplify inverted bit tests Fold (xor (and x, y), y) -> (and (not x), y) This removes an opportunity for a constant to appear twice. llvm-svn: 181395	2013-05-08 06:44:42 +00:00
David Blaikie	3b6038b6f3	Debug Info: Support DW_TAG_imported_declaration This provides basic functionality for imported declarations. For subprograms and types some amount of lazy construction is supported (so the definition of a function can proceed the using declaration), but it still doesn't handle declared-but-not-defined functions (since we don't generally emit function declarations). Variable support is really rudimentary at the moment - simply looking up the existing definition with no support for out of order (declaration, imported_module, then definition). llvm-svn: 181392	2013-05-08 06:01:41 +00:00
David Blaikie	4dd2de7ae7	Finish renaming constructImportedModuleDIE to constructImportedEntityDIE llvm-svn: 181391	2013-05-08 06:01:38 +00:00
Eric Christopher	c57baeeee0	Pass the MDNode in and do the insertion at compile unit creation time instead of relying upon an extra call to finish initializing. llvm-svn: 181383	2013-05-08 00:58:51 +00:00
Eric Christopher	6156011ee8	Typo. llvm-svn: 181378	2013-05-08 00:11:10 +00:00
David Blaikie	1fd4365f91	Rename DIImportedModule to DIImportedEntity and allow imported declarations DIBuilder::createImportedDeclaration isn't fully plumbed through (note, lacking in AsmPrinter/DwarfDebug support) but this seemed like a sufficiently useful division of code to make the subsequent patch(es) easier to follow. llvm-svn: 181364	2013-05-07 21:35:53 +00:00
Rafael Espindola	9a383405a7	Remove exception handling support from the old JIT. llvm-svn: 181354	2013-05-07 20:53:59 +00:00
Matt Arsenault	a5733dc97e	Fix vselect when getSetCCResultType returns a different type from the operands llvm-svn: 181348	2013-05-07 20:24:18 +00:00
David Blaikie	6baa776173	Debug Info: Fix for break due to r181271 Apparently we didn't keep an association of Compile Unit metadata nodes to DIEs so looking up that parental context failed & thus caused no DW_TAG_imported_modules to be emitted at the CU scope. Fix this by adding the mapping & sure up the test case to verify this. llvm-svn: 181339	2013-05-07 17:57:13 +00:00
Timur Iskhodzhanov	ec4afe6680	Fix the VS2010 build broken by r181271 llvm-svn: 181296	2013-05-07 07:47:47 +00:00
David Blaikie	684fc5331e	DebugInfo: Support imported modules in lexical blocks llvm-svn: 181271	2013-05-06 23:33:07 +00:00
Eric Christopher	6c6de847a8	Remove unnecessary instance variable and rework logic accordingly. llvm-svn: 181227	2013-05-06 17:50:50 +00:00
Eric Christopher	f0303324be	Grammar. llvm-svn: 181226	2013-05-06 17:50:46 +00:00
Michael Kuperstein	ac868757d0	Fix slightly too aggressive conact_vector optimization. (Would sometimes optimize away conacts used to extend a vector with undef values) llvm-svn: 181186	2013-05-06 08:06:13 +00:00
Evan Cheng	8b8e8d88ff	Teach if-converter to avoid removing BBs whose addresses are takne. rdar://13782395 llvm-svn: 181160	2013-05-05 18:03:49 +00:00
Benjamin Kramer	391f5a6e21	InlineSpiller: Remove quadratic behavior. No functionality change. llvm-svn: 181149	2013-05-05 11:29:14 +00:00
Stepan Dyatkovskiy	8c02c98259	For ARM backend, fixed "byval" attribute support. Now even the small structures could be passed within byval (small enough to be stored in GPRs). In regression tests next function prototypes are checked: PR15293: %artz = type { i32 } define void @foo(%artz* byval %s) define void @foo2(%artz* byval %s, i32 %p, %artz* byval %s2) foo: "s" stored in R0 foo2: "s" stored in R0, "s2" stored in R2. Next AAPCS rules are checked: 5.5 Parameters Passing, C.4 and C.5, "ParamSize" is parameter size in 32bit words: -- NSAA != 0, NCRN < R4 and NCRN+ParamSize > R4. Parameter should be sent to the stack; NCRN := R4. -- NSAA != 0, and NCRN < R4, NCRN+ParamSize < R4. Parameter stored in GPRs; NCRN += ParamSize. llvm-svn: 181148	2013-05-05 07:48:36 +00:00
Dmitri Gribenko	3238fb7595	Add ArrayRef constructor from None, and do the cleanups that this constructor enables Patch by Robert Wilhelm. llvm-svn: 181138	2013-05-05 00:40:33 +00:00
Evan Cheng	f85a76f477	TiedTo flag can now be placed on implicit operands. isTwoAddrUse() should look at all of the operands. Previously it was skipping over implicit operands which cause infinite looping when the two-address pass try to reschedule a two-address instruction below the kill of tied operand. I'm unable to come up with a reasonably sized test case. rdar://13747577 llvm-svn: 180906	2013-05-02 02:07:32 +00:00
Bill Wendling	8f2e6feb8e	Revert r180737. The companion patch was reverted, and this is not relevant right now. llvm-svn: 180889	2013-05-01 22:32:08 +00:00
Filip Pizlo	dec20e43c0	This patch breaks up Wrap.h so that it does not have to include all of the things, and renames it to CBindingWrapping.h. I also moved CBindingWrapping.h into Support/. This new file just contains the macros for defining different wrap/unwrap methods. The calls to those macros, as well as any custom wrap/unwrap definitions (like for array of Values for example), are put into corresponding C++ headers. Doing this required some #include surgery, since some .cpp files relied on the fact that including Wrap.h implicitly caused the inclusion of a bunch of other things. This also now means that the C++ headers will include their corresponding C API headers; for example Value.h must include llvm-c/Core.h. I think this is harmless, since the C API headers contain just external function declarations and some C types, so I don't believe there should be any nasty dependency issues here. llvm-svn: 180881	2013-05-01 20:59:00 +00:00
Chad Rosier	8e4824f350	[inline asm] Return an undef SDValue of the expected value type, rather than report a fatal error. This allows us to continue processing the translation unit. Test case to come on the clang side because we need an inline asm diagnostics handler in place. rdar://13446483 llvm-svn: 180873	2013-05-01 19:49:26 +00:00
Nadav Rotem	e5a2dda372	Optimize away nop CONCAT_VECTOR nodes. Optimize CONCAT_VECTOR nodes that merge EXTRACT_SUBVECTOR values that extract from the same vector. rdar://13402653 PR15866 llvm-svn: 180871	2013-05-01 19:18:51 +00:00
Stephen Lin	699808ceb2	Only pass 'returned' to target-specific lowering code when the value of entire register is guaranteed to be preserved. llvm-svn: 180825	2013-04-30 22:49:28 +00:00
Adrian Prantl	a2888e71eb	Temporarily revert "Change the informal convention of DBG_VALUE so that we can express a" because it breaks some buildbots. This reverts commit 180816. llvm-svn: 180819	2013-04-30 22:35:14 +00:00
Adrian Prantl	9a576644e4	Change the informal convention of DBG_VALUE so that we can express a register-indirect address with an offset of 0. It used to be that a DBG_VALUE is a register-indirect value if the offset (operand 1) is nonzero. The new convention is that a DBG_VALUE is register-indirect if the first operand is a register and the second operand is an immediate. For plain registers use the combination reg, reg. rdar://problem/13658587 llvm-svn: 180816	2013-04-30 22:16:46 +00:00
Andrew Trick	dd77014acc	MI Sched: revert a minor heuristic that snuck in with -misched-vcopy. I'll fix the heuristic in a general way in a follow-up commit. llvm-svn: 180815	2013-04-30 22:10:59 +00:00
Hal Finkel	7153251ab5	LocalStackSlotAllocation improvements First, taking advantage of the fact that the virtual base registers are allocated in order of the local frame offsets, remove the quadratic register-searching behavior. Because of the ordering, we only need to check the last virtual base register created. Second, store the frame index in the FrameRef structure, and get the frame index and the local offset from this structure at the top of the loop iteration. This allows us to de-nest the loops in insertFrameReferenceRegisters (and I think makes the code cleaner). I also moved the needsFrameBaseReg check into the first loop over instructions so that we don't bother pushing FrameRefs for instructions that don't want a virtual base register anyway. Lastly, and this is the only functionality change, avoid the creation of single-use virtual base registers. These are currently not useful because, in general, they end up replacing what would be one r+r instruction with an add and a r+i instruction. Committing this removes the XFAIL in CodeGen/PowerPC/2007-09-07-LoadStoreIdxForms.ll Jim has okayed this off-list. llvm-svn: 180799	2013-04-30 20:04:37 +00:00
Bill Wendling	fb7e32ebd6	Emit the TLS initialization function pointers into the correct section. The `llvm.tls_init_funcs' (created by the front-end) holds pointers to the TLS initialization functions. These need to be placed into the correct section so that they are run before `main()'. <rdar://problem/13733006> llvm-svn: 180737	2013-04-29 22:25:40 +00:00
Andrew Trick	85058af650	Generalize the MachineTraceMetrics public API. Naturally, we should be able to pass in extra instructions, not just extra blocks. llvm-svn: 180667	2013-04-27 03:54:20 +00:00
Eric Christopher	203e12bf9e	Use the target triple from the target machine rather than the module to determine whether or not we're on a darwin platform for debug code emitting. Solves the problem of a module with no triple on the command line and no triple in the module using non-gdb ok features on darwin. Fix up the member-pointers test to check the correct things for cross platform (DW_FORM_flag is a good prefix). Unfortunately no testcase because I have no ideas how to test something without a triple and without a triple in the module yet check precisely on two platforms. Ideas welcome. llvm-svn: 180660	2013-04-27 01:07:52 +00:00
Adrian Prantl	d4c0dd4776	Cleanup and document MachineLocation. Clarify documentation and API to make the difference between register and register-indirect addressed locations more explicit. Put in a comment to point out that with the current implementation we cannot specify a register-indirect location with offset 0 (a breg 0 in DWARF). No functionality change intended. rdar://problem/13658587 llvm-svn: 180641	2013-04-26 21:57:17 +00:00
Bill Wendling	55a9c97c9c	Micro-optimization TLVs probably won't be as common as the other types of variables. Check for them last before defaulting to "DATA". llvm-svn: 180631	2013-04-26 21:15:08 +00:00
Silviu Baranga	af7e8c367f	Re-write the address propagation code for pre-indexed loads/stores to take into account some previously misssed cases (PRE_DEC addressing mode, the offset and base address are swapped, etc). This should fix PR15581. llvm-svn: 180609	2013-04-26 15:52:24 +00:00
Benjamin Kramer	d56ffc709d	DAGCombiner: Canonicalize vector integer abs in the same way we do it for scalars. This already helps SSE2 x86 a lot because it lacks an efficient way to represent a vector select. The long term goal is to enable the backend to match a canonicalized pattern into a single instruction (e.g. vabs or pabs). llvm-svn: 180597	2013-04-26 09:19:19 +00:00
Reid Kleckner	d973ca3c51	[mc-coff] Forward Linker Option flags into the .drectve section Summary: This is modelled on the Mach-O linker options implementation and should support a Clang implementation of #pragma comment(lib/linker). Reviewers: rafael CC: llvm-commits Differential Revision: http://llvm-reviews.chandlerc.com/D724 llvm-svn: 180569	2013-04-25 19:34:41 +00:00
Silviu Baranga	4ad2bc5963	Fix constant folding for one lane vector types. Constant folding one lane vector types not returns a vector instead of a scalar. llvm-svn: 180254	2013-04-25 09:32:33 +00:00
Andrew Trick	2e87517144	Fix for r180193 - MI Sched: eliminate local vreg. Fixes PR15838. Need to check for blocks with nothing but dbg.value. I'm not sure how to force this situation with a unit test. I tried to reduce the test case in PR15838 (1k lines of metadata) but gave up. llvm-svn: 180227	2013-04-24 23:19:56 +00:00
Chad Rosier	108d5a61b7	[inline asm] Fix a crasher for an invalid value type/register class. rdar://13731657 llvm-svn: 180226	2013-04-24 22:53:10 +00:00
Andrew Trick	85a1d4cbc0	MI Sched: eliminate local vreg copies. For now, we just reschedule instructions that use the copied vregs and let regalloc elliminate it. I would really like to eliminate the copies on-the-fly during scheduling, but we need a complete implementation of repairIntervalsInRange() first. The general strategy is for the register coalescer to eliminate as many global copies as possible and shrink live ranges to be extended-basic-block local. The coalescer should not have to worry about resolving local copies (e.g. it shouldn't attemp to reorder instructions). The scheduler is a much better place to deal with local interference. The coalescer side of this equation needs work. llvm-svn: 180193	2013-04-24 15:54:43 +00:00
Andrew Trick	608a698cdf	Register Coalescing: add a flag to disable rescheduling. When MachineScheduler is enabled, this functionality can be removed. Until then, provide a way to disable it for test cases and designing MachineScheduler heuristics. llvm-svn: 180192	2013-04-24 15:54:39 +00:00
Andrew Trick	7c791a3dc4	MI Sched: regpressure tracing. llvm-svn: 180191	2013-04-24 15:54:36 +00:00
Eric Christopher	4eb5eb5bc8	Formatting. llvm-svn: 180186	2013-04-24 12:56:18 +00:00
Owen Anderson	2d4cca35c3	DAGCombine should not aggressively fold SEXT(VSETCC(...)) into a wider VSETCC without first checking the target's vector boolean contents. This exposed an issue with PowerPC AltiVec where it appears it was setting the wrong vector boolean contents. The included change fixes the PowerPC tests, and was OK'd by Hal. llvm-svn: 180129	2013-04-23 18:09:28 +00:00
Stephen Lin	6c70dc7842	Add some constraints to use of 'returned': 1) Disallow 'returned' on parameter that is also 'sret' (no sensible semantics, as far as I can tell). 2) Conservatively disallow tail calls through 'returned' parameters that also are 'zext' or 'sext' (for consistency with treatment of other zero-extending and sign-extending operations in tail call position detection...can be revised later to handle situations that can be determined to be safe). This is a new attribute that is not yet used, so there is no impact. llvm-svn: 180118	2013-04-23 16:31:56 +00:00
Matt Arsenault	034ca0fe41	Remove unused DwarfSectionOffsetDirective string The value isn't actually used, and setting it emits a COFF specific directive. llvm-svn: 180064	2013-04-22 22:49:11 +00:00
Eric Christopher	04d4e9312c	Move C++ code out of the C headers and into either C++ headers or the C++ files themselves. This enables people to use just a C compiler to interoperate with LLVM. llvm-svn: 180063	2013-04-22 22:47:22 +00:00
Eli Bendersky	58b04b7e2e	Optimize MachineBasicBlock::getSymbol by caching the symbol. Since the symbol name computation is expensive, this helps save about 25% of the time spent in this function. llvm-svn: 180049	2013-04-22 21:21:08 +00:00
Rafael Espindola	74f2e46eef	Clarify that llvm.used can contain aliases. Also add a check for llvm.used in the verifier and simplify clients now that they can assume they have a ConstantArray. llvm-svn: 180019	2013-04-22 14:58:02 +00:00
Eric Christopher	44c6aa670f	Tidy. llvm-svn: 180000	2013-04-22 07:51:08 +00:00
Eric Christopher	25e3509c78	Update comment. Whitespace. llvm-svn: 179999	2013-04-22 07:47:40 +00:00
David Blaikie	f55abeaf4c	Revert "Revert "PR14606: debug info imported_module support"" This reverts commit r179840 with a fix to test/DebugInfo/two-cus-from-same-file.ll I'm not sure why that test only failed on ARM & MIPS and not X86 Linux, even though the debug info was clearly invalid on all of them, but this ought to fix it. llvm-svn: 179996	2013-04-22 06:12:31 +00:00
Jim Grosbach	563983c8a3	Legalize vector truncates by parts rather than just splitting. Rather than just splitting the input type and hoping for the best, apply a bit more cleverness. Just splitting the types until the source is legal often leads to an illegal result time, which is then widened and a scalarization step is introduced which leads to truly horrible code generation. With the loop vectorizer, these sorts of operations are much more common, and so it's worth extra effort to do them well. Add a legalization hook for the operands of a TRUNCATE node, which will be encountered after the result type has been legalized, but if the operand type is still illegal. If simple splitting of both types ends up with the result type of each half still being legal, just do that (v16i16 -> v16i8 on ARM, for example). If, however, that would result in an illegal result type (v8i32 -> v8i8 on ARM, for example), we can get more clever with power-two vectors. Specifically, split the input type, but also widen the result element size, then concatenate the halves and truncate again. For example on ARM, To perform a "%res = v8i8 trunc v8i32 %in" we transform to: %inlo = v4i32 extract_subvector %in, 0 %inhi = v4i32 extract_subvector %in, 4 %lo16 = v4i16 trunc v4i32 %inlo %hi16 = v4i16 trunc v4i32 %inhi %in16 = v8i16 concat_vectors v4i16 %lo16, v4i16 %hi16 %res = v8i8 trunc v8i16 %in16 This allows instruction selection to generate three VMOVN instructions instead of a sequences of moves, stores and loads. Update the ARMTargetTransformInfo to take this improved legalization into account. Consider the simplified IR: define <16 x i8> @test1(<16 x i32>* %ap) { %a = load <16 x i32>* %ap %tmp = trunc <16 x i32> %a to <16 x i8> ret <16 x i8> %tmp } define <8 x i8> @test2(<8 x i32>* %ap) { %a = load <8 x i32>* %ap %tmp = trunc <8 x i32> %a to <8 x i8> ret <8 x i8> %tmp } Previously, we would generate the truly hideous: .syntax unified .section __TEXT,__text,regular,pure_instructions .globl _test1 .align 2 _test1: @ @test1 @ BB#0: push {r7} mov r7, sp sub sp, sp, #20 bic sp, sp, #7 add r1, r0, #48 add r2, r0, #32 vld1.64 {d24, d25}, [r0:128] vld1.64 {d16, d17}, [r1:128] vld1.64 {d18, d19}, [r2:128] add r1, r0, #16 vmovn.i32 d22, q8 vld1.64 {d16, d17}, [r1:128] vmovn.i32 d20, q9 vmovn.i32 d18, q12 vmov.u16 r0, d22[3] strb r0, [sp, #15] vmov.u16 r0, d22[2] strb r0, [sp, #14] vmov.u16 r0, d22[1] strb r0, [sp, #13] vmov.u16 r0, d22[0] vmovn.i32 d16, q8 strb r0, [sp, #12] vmov.u16 r0, d20[3] strb r0, [sp, #11] vmov.u16 r0, d20[2] strb r0, [sp, #10] vmov.u16 r0, d20[1] strb r0, [sp, #9] vmov.u16 r0, d20[0] strb r0, [sp, #8] vmov.u16 r0, d18[3] strb r0, [sp, #3] vmov.u16 r0, d18[2] strb r0, [sp, #2] vmov.u16 r0, d18[1] strb r0, [sp, #1] vmov.u16 r0, d18[0] strb r0, [sp] vmov.u16 r0, d16[3] strb r0, [sp, #7] vmov.u16 r0, d16[2] strb r0, [sp, #6] vmov.u16 r0, d16[1] strb r0, [sp, #5] vmov.u16 r0, d16[0] strb r0, [sp, #4] vldmia sp, {d16, d17} vmov r0, r1, d16 vmov r2, r3, d17 mov sp, r7 pop {r7} bx lr .globl _test2 .align 2 _test2: @ @test2 @ BB#0: push {r7} mov r7, sp sub sp, sp, #12 bic sp, sp, #7 vld1.64 {d16, d17}, [r0:128] add r0, r0, #16 vld1.64 {d20, d21}, [r0:128] vmovn.i32 d18, q8 vmov.u16 r0, d18[3] vmovn.i32 d16, q10 strb r0, [sp, #3] vmov.u16 r0, d18[2] strb r0, [sp, #2] vmov.u16 r0, d18[1] strb r0, [sp, #1] vmov.u16 r0, d18[0] strb r0, [sp] vmov.u16 r0, d16[3] strb r0, [sp, #7] vmov.u16 r0, d16[2] strb r0, [sp, #6] vmov.u16 r0, d16[1] strb r0, [sp, #5] vmov.u16 r0, d16[0] strb r0, [sp, #4] ldm sp, {r0, r1} mov sp, r7 pop {r7} bx lr Now, however, we generate the much more straightforward: .syntax unified .section __TEXT,__text,regular,pure_instructions .globl _test1 .align 2 _test1: @ @test1 @ BB#0: add r1, r0, #48 add r2, r0, #32 vld1.64 {d20, d21}, [r0:128] vld1.64 {d16, d17}, [r1:128] add r1, r0, #16 vld1.64 {d18, d19}, [r2:128] vld1.64 {d22, d23}, [r1:128] vmovn.i32 d17, q8 vmovn.i32 d16, q9 vmovn.i32 d18, q10 vmovn.i32 d19, q11 vmovn.i16 d17, q8 vmovn.i16 d16, q9 vmov r0, r1, d16 vmov r2, r3, d17 bx lr .globl _test2 .align 2 _test2: @ @test2 @ BB#0: vld1.64 {d16, d17}, [r0:128] add r0, r0, #16 vld1.64 {d18, d19}, [r0:128] vmovn.i32 d16, q8 vmovn.i32 d17, q9 vmovn.i16 d16, q8 vmov r0, r1, d16 bx lr llvm-svn: 179989	2013-04-21 23:47:41 +00:00
Jim Grosbach	d4db72db61	Tidy up comment grammar. llvm-svn: 179986	2013-04-21 21:23:01 +00:00
Tim Northover	16aba17024	Remove unused ShouldFoldAtomicFences flag. I think it's almost impossible to fold atomic fences profitably under LLVM/C++11 semantics. As a result, this is now unused and just cluttering up the target interface. llvm-svn: 179940	2013-04-20 12:32:43 +00:00
Tim Northover	a2b533906a	Remove unused MEMBARRIER DAG node; it's been replaced by ATOMIC_FENCE. llvm-svn: 179939	2013-04-20 12:32:17 +00:00
Stephen Lin	b8bd232a3d	Add CodeGen support for functions that always return arguments via a new parameter attribute 'returned', which is taken advantage of in target-independent tail call opportunity detection and in ARM call lowering (when placed on an integral first parameter). llvm-svn: 179925	2013-04-20 05:14:40 +00:00
Stephen Lin	ffc445492c	Allow tail call opportunity detection through nested and/or multiple iterations of extractelement/insertelement indirection llvm-svn: 179924	2013-04-20 04:27:51 +00:00
Eli Bendersky	e80691dc0a	Simplify the code in FastISel::tryToFoldLoad, add an assertion and fix a comment. llvm-svn: 179908	2013-04-19 23:26:18 +00:00
Eli Bendersky	90dd3e7dfd	Move TryToFoldFastISelLoad to FastISel, where it belongs. In general, I'm trying to move as much FastISel logic as possible out of the main path in SelectionDAGISel - intermixing them just adds confusion. llvm-svn: 179902	2013-04-19 22:29:18 +00:00
Michael Liao	b53d8963ce	ArrayRefize getMachineNode(). No functionality change. llvm-svn: 179901	2013-04-19 22:22:57 +00:00
Jakob Stoklund Olesen	e17c3fde6b	Add an MRI::verifyUseLists() function. This checks the sanity of the register use lists in the MI intermediate representation. llvm-svn: 179895	2013-04-19 21:40:57 +00:00
Eli Bendersky	dbeefaa86a	Use dbgs() consistently for -debug printouts llvm-svn: 179894	2013-04-19 21:37:07 +00:00
Eric Christopher	0e89ade8ff	Revert "PR14606: debug info imported_module support" This reverts commit r179836 as it seems to have caused test failures. llvm-svn: 179840	2013-04-19 07:47:16 +00:00
David Blaikie	88564f3cf7	PR14606: debug info imported_module support Adding another CU-wide list, in this case of imported_modules (since they should be relatively rare, it seemed better to add a list where each element had a "context" value, rather than add a (usually empty) list to every scope). This takes care of DW_TAG_imported_module, but to fully address PR14606 we'll need to expand this to cover DW_TAG_imported_declaration too. llvm-svn: 179836	2013-04-19 06:57:04 +00:00
Eli Bendersky	6084f45f38	Add some more stats for fast isel vs. SelectionDAG, w.r.t lowering function arguments in entry BBs. llvm-svn: 179824	2013-04-19 01:04:40 +00:00
Peter Collingbourne	2f495b93ee	Add support for subsections to the ELF assembler. Fixes PR8717. Differential Revision: http://llvm-reviews.chandlerc.com/D598 llvm-svn: 179725	2013-04-17 21:18:16 +00:00
Andy Gibbs	b23ea72e48	Replace uses of the deprecated std::auto_ptr with OwningPtr. This is a rework of the broken parts in r179373 which were subsequently reverted in r179374 due to incompatibility with C++98 compilers. This version should be ok under C++98. llvm-svn: 179520	2013-04-15 12:06:32 +00:00
Nadav Rotem	0db0690a70	Document the decision to assume that the cost of floats is twice as much as integers. llvm-svn: 179478	2013-04-14 05:55:18 +00:00
Andrew Trick	1f0bb69b6c	MI-Sched: DEBUG formatting. llvm-svn: 179452	2013-04-13 06:07:49 +00:00
Andrew Trick	be2bccbce9	MI-Sched cleanup. If an instruction has no valid sched class, do not attempt to check for a variant. llvm-svn: 179451	2013-04-13 06:07:45 +00:00
Andrew Trick	e833e1cd6e	MI-Sched: schedule physreg copies. The register allocator expects minimal physreg live ranges. Schedule physreg copies accordingly. This is slightly tricky when they occur in the middle of the scheduling region. For now, this is handled by rescheduling the copy when its associated instruction is scheduled. Eventually we may instead bundle them, but only if we can preserve the bundles as parallel copies during regalloc. llvm-svn: 179449	2013-04-13 06:07:40 +00:00
Nadav Rotem	87a0af6e1b	CostModel: increase the default cost of supported floating point operations from 1 to two. Fixed a few tests that changes because now the cost of one insert + a vector operation on two doubles is lower than two scalar operations on doubles. llvm-svn: 179413	2013-04-12 21:15:03 +00:00
Benjamin Kramer	dae0851237	Revert broken pieces of r179373. You can't copy an OwningPtr, and move semantics aren't available in C++98. llvm-svn: 179374	2013-04-12 12:13:51 +00:00
Andy Gibbs	95777550a9	Replace uses of the deprecated std::auto_ptr with OwningPtr. llvm-svn: 179373	2013-04-12 10:56:28 +00:00
Nadav Rotem	c0adc9fd91	Don't disable block layout when forcing block alignment. llvm-svn: 179355	2013-04-12 01:24:16 +00:00
Nadav Rotem	c3b0f50ac2	Add a flag to align all basic blocks in the function. When debugging performance regressions we often ask ourselves if the regression that we see is due to poor isel/sched/ra or due to some micro-architetural problem. When comparing two code sequences one good way to rule out front-end bottlenecks (and other the issues) is to force code alignment. This pass adds a flag that forces the alignment of all of the basic blocks in the program. llvm-svn: 179353	2013-04-12 00:48:32 +00:00
Benjamin Kramer	e7c45bc670	Add braces around \|\| in && to pacify GCC. llvm-svn: 179275	2013-04-11 11:57:01 +00:00
Hal Finkel	95081bff72	Manually remove successors in if conversion when CopyAndPredicateBlock is used In the simple and triangle if-conversion cases, when CopyAndPredicateBlock is used because the to-be-predicated block has other predecessors, we need to explicitly remove the old copied block from the successors list. Normally if conversion relies on TII->AnalyzeBranch combined with BB->CorrectExtraCFGEdges to cleanup the successors list, but if the predicated block contained an un-analyzable branch (such as a now-predicated return), then this will fail. These extra successors were causing a problem on PPC because it was causing later passes (such as PPCEarlyReturm) to leave dead return-only basic blocks in the code. llvm-svn: 179227	2013-04-10 22:05:25 +00:00
Andrew Trick	e220323c7f	Generalize the PassConfig API and remove addFinalizeRegAlloc(). The target hooks are getting out of hand. What does it mean to run before or after regalloc anyway? Allowing either Pass* or AnalysisID pass identification should make it much easier for targets to use the substitutePass and insertPass APIs, and create less need for badly named target hooks. llvm-svn: 179140	2013-04-10 01:06:56 +00:00
Eric Christopher	52ce7189c1	The .dwo section shouldn't contain the unrelocated values (and therefore not at all) of the pc or statement list. We also don't need to emit the compilation dir so save so space and time and don't bother. Fix up the testcase accordingly and verify that we don't emit the attributes or the items that they use. llvm-svn: 179114	2013-04-09 19:23:15 +00:00
Benjamin Kramer	bbae991db6	DAGCombiner: Fold a shuffle on CONCAT_VECTORS into a new CONCAT_VECTORS if possible. This pattern occurs in SROA output due to the way vector arguments are lowered on ARM. The testcase from PR15525 now compiles into this, which is better than the code we got with the old scalarrepl: _Store: ldr.w r9, [sp] vmov d17, r3, r9 vmov d16, r1, r2 vst1.8 {d16, d17}, [r0] bx lr Differential Revision: http://llvm-reviews.chandlerc.com/D647 llvm-svn: 179106	2013-04-09 17:41:43 +00:00
Eric Christopher	55863befd1	DW_FORM_sec_offset should be a relocation on platforms that use a relocation across sections. Do this for DW_AT_stmt list in the skeleton CU and check the relocations in the debug_info section. Add a FIXME for multiple CUs. llvm-svn: 178969	2013-04-07 03:43:09 +00:00
Nadav Rotem	c4bd84c1d5	typo llvm-svn: 178949	2013-04-06 04:24:12 +00:00
Manman Ren	5b22f9fe18	Dwarf: use utostr on CUID to append to SmallString. We used to do "SmallString += CUID", which is incorrect, since CUID will be truncated to a char. rdar://problem/13573833 llvm-svn: 178941	2013-04-06 01:02:38 +00:00
Hal Finkel	3005c299b5	Reapply r178845 with fix - Fix bug in PEI's virtual-register scavenging This fixes PEI as previously described, but correctly handles the case where the instruction defining the virtual register to be scavenged is the first in the block. Arnold provided me with a bugpoint-reduced test case, but even that seems too large to use as a regression test. If I'm successful in cleaning it up then I'll commit that as well. Original commit message: This change fixes a bug that I introduced in r178058. After a register is scavenged using one of the available spills slots the instruction defining the virtual register needs to be moved to after the spill code. The scavenger has already processed the defining instruction so that registers killed by that instruction are available for definition in that same instruction. Unfortunately, after this, the scavenger needs to iterate through the spill code and then visit, again, the instruction that defines the now-scavenged register. In order to avoid confusion, the register scavenger needs the ability to 'back up' through the spill code so that it can again process the instructions in the appropriate order. Prior to this fix, once the scavenger reached the just-moved instruction, it would assert if it killed any registers because, having already processed the instruction, it believed they were undefined. Unfortunately, I don't yet have a small test case. Thanks to Pranav Bhandarkar for diagnosing the problem and testing this fix. llvm-svn: 178919	2013-04-05 22:31:56 +00:00
Bill Wendling	eb108bad50	Use the target options specified on a function to reset the back-end. During LTO, the target options on functions within the same Module may change. This would necessitate resetting some of the back-end. Do this for X86, because it's a Friday afternoon. llvm-svn: 178917	2013-04-05 21:52:40 +00:00
Hal Finkel	81c46d0809	Revert r178845 - Fix bug in PEI's virtual-register scavenging Reverting because this breaks one of the LTO builders. Original commit message: This change fixes a bug that I introduced in r178058. After a register is scavenged using one of the available spills slots the instruction defining the virtual register needs to be moved to after the spill code. The scavenger has already processed the defining instruction so that registers killed by that instruction are available for definition in that same instruction. Unfortunately, after this, the scavenger needs to iterate through the spill code and then visit, again, the instruction that defines the now-scavenged register. In order to avoid confusion, the register scavenger needs the ability to 'back up' through the spill code so that it can again process the instructions in the appropriate order. Prior to this fix, once the scavenger reached the just-moved instruction, it would assert if it killed any registers because, having already processed the instruction, it believed they were undefined. Unfortunately, I don't yet have a small test case. Thanks to Pranav Bhandarkar for diagnosing the problem and testing this fix. llvm-svn: 178916	2013-04-05 21:30:40 +00:00
Hal Finkel	e6f48e4e2f	Fix bug in PEI's virtual-register scavenging This change fixes a bug that I introduced in r178058. After a register is scavenged using one of the available spills slots the instruction defining the virtual register needs to be moved to after the spill code. The scavenger has already processed the defining instruction so that registers killed by that instruction are available for definition in that same instruction. Unfortunately, after this, the scavenger needs to iterate through the spill code and then visit, again, the instruction that defines the now-scavenged register. In order to avoid confusion, the register scavenger needs the ability to 'back up' through the spill code so that it can again process the instructions in the appropriate order. Prior to this fix, once the scavenger reached the just-moved instruction, it would assert if it killed any registers because, having already processed the instruction, it believed they were undefined. Unfortunately, I don't yet have a small test case. Thanks to Pranav Bhandarkar for diagnosing the problem and testing this fix. llvm-svn: 178845	2013-04-05 05:01:13 +00:00
Andrew Trick	80e66ce0b4	RegisterPressure heuristics currently require signed comparisons. llvm-svn: 178823	2013-04-05 00:31:34 +00:00
Andrew Trick	96ce3848d6	Disable DFSResult for ConvergingScheduler. For now, just save the compile time since the ConvergingScheduler heuristics don't use this analysis. We'll probably enable it later after compile-time investigation. llvm-svn: 178822	2013-04-05 00:31:31 +00:00
Andrew Trick	419d491747	MachineScheduler: format DEBUG output. I'm getting more serious about tuning and enabling on x86/ARM. Start by making the trace readable. llvm-svn: 178821	2013-04-05 00:31:29 +00:00
Arnold Schwaighofer	b977387112	CostModel: Add parameter to instruction cost to further classify operand values On certain architectures we can support efficient vectorized version of instructions if the operand value is uniform (splat) or a constant scalar. An example of this is a vector shift on x86. We can efficiently support for (i = 0 ; i < ; i += 4) w[0:3] = v[0:3] << <2, 2, 2, 2> but not for (i = 0; i < ; i += 4) w[0:3] = v[0:3] << x[0:3] This patch adds a parameter to getArithmeticInstrCost to further qualify operand values as uniform or uniform constant. Targets can then choose to return a different cost for instructions with such operand values. A follow-up commit will test this feature on x86. radar://13576547 llvm-svn: 178807	2013-04-04 23:26:21 +00:00
Manman Ren	bdcb4464e2	Debug Info: revert 178722 for now. There is a difference for FORM_ref_addr between DWARF 2 and DWARF 3+. Since Eric is against guarding DWARF 2 ref_addr with DarwinGDBCompat, we are still in discussion on how to handle this. The correct solution is to update our header to say version 4 instead of version 2 and update tool chains as well. rdar://problem/13559431 llvm-svn: 178806	2013-04-04 23:13:11 +00:00
Adrian Prantl	322f41d095	typo llvm-svn: 178804	2013-04-04 22:56:49 +00:00
Eli Bendersky	fc186358f2	Formatting llvm-svn: 178771	2013-04-04 18:03:41 +00:00
Manman Ren	5a15c9ed9f	Debug Info: according to DWARF 2, FORM_ref_addr the same size as an address on the target system. It was hard-coded to 4 bytes before. I can't get llvm to generate a ref_addr on a reasonably sized testing case. rdar://problem/13559431 llvm-svn: 178722	2013-04-04 00:22:54 +00:00
Bill Schmidt	92e26646bc	Fix PR15632: No support for ppcf128 floating-point remainder on PowerPC. For this we need to use a libcall. Previously LLVM didn't implement libcall support for frem, so I've added it in the usual straightforward manner. A test case from the bug report is included. llvm-svn: 178639	2013-04-03 13:05:44 +00:00
Eric Christopher	14c2067ca1	Fix grammar. llvm-svn: 178624	2013-04-03 05:29:58 +00:00
Eric Christopher	5590949f29	Remove ZeroOrMore from the option description. We don't need it here. llvm-svn: 178623	2013-04-03 05:26:07 +00:00
Jakob Stoklund Olesen	aeb69a5481	Allow MachineTraceMetrics to be used when the model has no resources. It it still possible to extract information from itineraries, for example. llvm-svn: 178582	2013-04-02 22:27:45 +00:00
Jakob Stoklund Olesen	8fbfc59164	Don't attempt MTM heuristics without a scheduling model present. This should fix the PPC buildbots. llvm-svn: 178558	2013-04-02 18:26:45 +00:00
Jakob Stoklund Olesen	3ca14772d0	Count processor resources individually in MachineTraceMetrics. The new instruction scheduling models provide information about the number of cycles consumed on each processor resource. This makes it possible to estimate ILP more accurately than simply counting instructions / issue width. The functions getResourceDepth() and getResourceLength() now identify the limiting processor resource, and return a cycle count based on that. This gives more precise resource information, particularly in traces that use one resource a lot more than others. llvm-svn: 178553	2013-04-02 17:49:51 +00:00
Arnold Schwaighofer	d6c6e868b2	DAGCombiner: Merge store/loads when we have extload/truncstores This is helps on architectures where i8,i16 are not legal but we have byte, and short loads/stores. Allowing us to merge copies like the one below on ARM. copy(char a, char b, int n) { do { int t0 = a[0]; int t1 = a[1]; b[0] = t0; b[1] = t1; radar://13536387 llvm-svn: 178546	2013-04-02 15:58:51 +00:00
Arnold Schwaighofer	6752366ed7	Merge load/store sequences with adresses: base + index + offset We would also like to merge sequences that involve a variable index like in the example below. int index = *idx++ int i0 = c[index+0]; int i1 = c[index+1]; b[0] = i0; b[1] = i1; By extending the parsing of the base pointer to handle dags that contain a base, index, and offset we can handle examples like the one above. The dag for the code above will look something like: (load (i64 add (i64 copyfromreg %c) (i64 signextend (i8 load %index)))) (load (i64 add (i64 copyfromreg %c) (i64 signextend (i32 add (i32 signextend (i8 load %index)) (i32 1))))) The code that parses the tree ignores the intermediate sign extensions. However, if there is a sign extension it needs to be on all indexes. (load (i64 add (i64 copyfromreg %c) (i64 signextend (add (i8 load %index) (i8 1)))) vs (load (i64 add (i64 copyfromreg %c) (i64 signextend (i32 add (i32 signextend (i8 load %index)) (i32 1))))) radar://13536387 llvm-svn: 178483	2013-04-01 18:12:58 +00:00
Benjamin Kramer	9335443236	DAGCombine: visitXOR can replace a node without returning it, bail out in that case. Fixes the crash reported in PR15608. llvm-svn: 178429	2013-03-30 21:28:18 +00:00
Eric Christopher	4887c8f4ff	Use SmallVectorImpl instead of SmallVector at the uses. llvm-svn: 178386	2013-03-29 23:34:06 +00:00
Eric Christopher	9c8414f84a	Use 12 as the magic number for our abbreviation data and our die values. A lot of DIEs have 10 attributes in C++ code (example clang), none had more than 12. Seems like a good default. llvm-svn: 178366	2013-03-29 20:23:06 +00:00
Eric Christopher	6be35037b5	Move the construction of the skeleton compile unit after the entire original compile unit has been constructed. llvm-svn: 178365	2013-03-29 20:23:02 +00:00
Benjamin Kramer	70671b9937	Remove the old CodePlacementOpt pass. It was superseded by MachineBlockPlacement and disabled by default since LLVM 3.1. llvm-svn: 178349	2013-03-29 17:14:24 +00:00
Nadav Rotem	6036f581aa	Fix a typo llvm-svn: 178346	2013-03-29 16:34:23 +00:00
Chad Rosier	dbac025d84	[fast-isel] Add a preemptive fix for the case where we fail to materialize an immediate in a register. I don't believe this should ever fail, but I see no harm in trying to make this code bullet proof. I've added an assert to ensure my assumtion is correct. If the assertion fires something is wrong and we should fix it, rather then just silently fall back to SelectionDAG isel. llvm-svn: 178305	2013-03-28 23:04:47 +00:00
Hal Finkel	35dd5c5932	Fix target-customized spilling in the register scavenger This is a follow-up to r178073 (which should actually make target-customized spilling work again). I still don't have a regression test for this (but it would be good to have one; Thumb 1 and Mips16 use this callback as well). Patch by Richard Sandiford. llvm-svn: 178137	2013-03-27 13:00:56 +00:00
Hal Finkel	1fa2f945ea	Fix the register scavenger for targets that provide custom spilling As pointed out by Richard Sandiford, my recent updates to the register scavenger broke targets that use custom spilling (because the new code assumed that if there were no valid spill slots, than spilling would be impossible). I don't have a test case, but it should be possible to create one for Thumb 1, Mips 16, etc. llvm-svn: 178073	2013-03-26 21:20:15 +00:00
Hal Finkel	4e05788cc3	Update PEI's virtual-register-based scavenging to support multiple simultaneous mappings The previous algorithm could not deal properly with scavenging multiple virtual registers because it kept only one live virtual -> physical mapping (and iterated through operands in order). Now we don't maintain a current mapping, but rather use replaceRegWith to completely remove the virtual register as soon as the mapping is established. In order to allow the register scavenger to return a physical register killed by an instruction for definition by that same instruction, we now call RS->forward(I) prior to eliminating virtual registers defined in I. This requires a minor update to forward to ignore virtual registers. These new features will be tested in forthcoming commits. llvm-svn: 178058	2013-03-26 18:56:54 +00:00
Michael Liao	bb05a1d7b5	Enhance folding of (extract_subvec (insert_subvec V1, V2, IIdx), EIdx) - Handle the case where the result of 'insert_subvect' is bitcasted before 'extract_subvec'. This removes the redundant insertf128/extractf128 pair on unaligned 256-bit vector load/store on vectors of non 64-bit integer. llvm-svn: 177945	2013-03-25 23:47:35 +00:00
Shuxin Yang	93b1f12ac1	Disable some unsafe-fp-math DAG-combine transformation after legalization. For instance, following transformation will be disabled: x + x + x => 3.0f * x; The problem of these transformations is that it introduces a FP constant, which following Instruction-Selection pass cannot handle. Reviewed by Nadav, thanks a lot! rdar://13445387 llvm-svn: 177933	2013-03-25 22:52:29 +00:00
Eric Christopher	3820184a8d	Couple more sets of tidying. llvm-svn: 177920	2013-03-25 21:26:36 +00:00
Eric Christopher	7f44037c2e	Formatting. llvm-svn: 177898	2013-03-25 20:05:35 +00:00
Duncan Sands	d58611a4cd	Teach cmake about the new Erlang GC files. llvm-svn: 177869	2013-03-25 14:12:21 +00:00
Yiannis Tsiouris	dbb4adf134	Add a GC plugin for Erlang llvm-svn: 177867	2013-03-25 13:47:46 +00:00
Owen Anderson	c81616b0a9	Remove the type legality check from the SelectionDAGBuilder when it lowers @llvm.fmuladd to ISD::FMA nodes. Performing this check unilaterally prevented us from generating FMAs when the incoming IR contained illegal vector types which would eventually be legalized to underlying types that did support FMA. For example, an @llvm.fmuladd on an OpenCL float16 should become a sequence of float4 FMAs, not float4 fmul+fadd's. NOTE: Because we still call the target-specific profitability hook, individual targets can reinstate the old behavior, if desired, by simply performing the legality check inside their callback hook. They can also perform more sophisticated legality checks, if, for example, some illegal vector types can be productively implemented as FMAs, but not others. llvm-svn: 177820	2013-03-23 08:26:53 +00:00
Hal Finkel	446122ed57	Fix comparison of mixed signedness 177774 broke the lld-x86_64-darwin11 builder; error: error: comparison of integers of different signs: 'int' and 'size_type' (aka 'unsigned long') for (SI = 0; SI < Scavenged.size(); ++SI) ~~ ^ ~~~~~~~~~~~~~~~~ Fix this by making SI also unsigned. llvm-svn: 177780	2013-03-23 00:07:29 +00:00
Hal Finkel	9e331c2f9c	Allow the register scavenger to spill multiple registers This patch lets the register scavenger make use of multiple spill slots in order to guarantee that it will be able to provide multiple registers simultaneously. To support this, the RS's API has changed slightly: setScavengingFrameIndex / getScavengingFrameIndex have been replaced by addScavengingFrameIndex / isScavengingFrameIndex / getScavengingFrameIndices. In forthcoming commits, the PowerPC backend will use this capability in order to implement the spilling of condition registers, and some special-purpose registers, without relying on r0 being reserved. In some cases, spilling these registers requires two GPRs: one for addressing and one to hold the value being transferred. llvm-svn: 177774	2013-03-22 23:32:27 +00:00
Hal Finkel	7dbe0f0636	Remove ScavengedRC from RegisterScavenging ScavengedRC was a dead private variable (set, but not otherwise used). No functionality change intended. llvm-svn: 177708	2013-03-22 07:27:44 +00:00
Matt Arsenault	4ab769f4b3	Fix missing std::. Not sure how this compiles for anyone else. llvm-svn: 177620	2013-03-21 00:57:21 +00:00
Justin Holewinski	7478f3d776	Make variable name more explicit and eliminate redundant lookup in SDNodeOrdering llvm-svn: 177600	2013-03-20 23:10:59 +00:00
Nadav Rotem	4536d582fd	When computing the demanded bits of Load SDNodes, make sure that we are looking at the loaded-value operand and not the ptr result (in case of pre-inc loads). rdar://13348420 llvm-svn: 177596	2013-03-20 22:53:44 +00:00
Christian Konig	ed34d0ef1a	Revert "pre-RA-sched: fix TargetOpcode usage" This reverts commit 06091513c283c863296f01cc7c2e86b56bb50d02. The code is obviously wrong, but the trivial fix causes inefficient code generation on X86. Somebody with more knowledge of the code needs to take a look here. Signed-off-by: Christian König <christian.koenig@amd.com> llvm-svn: 177529	2013-03-20 15:43:00 +00:00
Justin Holewinski	c2d2c8939c	Move SDNode order propagation to SDNodeOrdering, which also fixes a missed case of order propagation during isel. Thanks Owen for the suggestion! llvm-svn: 177525	2013-03-20 14:51:01 +00:00
Christian Konig	9ce2d5b862	pre-RA-sched: fix TargetOpcode usage TargetOpcodes need to be treaded as Machine- and not ISD-Opcodes. Signed-off-by: Christian König <christian.koenig@amd.com> llvm-svn: 177518	2013-03-20 13:49:22 +00:00
Justin Holewinski	d068943809	Propagate DAG node ordering during type legalization and instruction selection A node's ordering is only propagated during legalization if (a) the new node does not have an ordering (is not a CSE'd node), or (b) the new node has an ordering that is higher than the node being legalized. llvm-svn: 177465	2013-03-20 00:10:32 +00:00
Jakub Staszak	b6970267ad	Move #include of BitVector from .h to .cpp file. Also remove unneeded #include and forward declaration. llvm-svn: 177357	2013-03-18 23:45:45 +00:00
Jakub Staszak	26ac8a7b16	Add some constantness. llvm-svn: 177356	2013-03-18 23:40:46 +00:00

... 11 12 13 14 15 ...

16046 Commits