llvm-project

Commit Graph

Author	SHA1	Message	Date
Adrian Prantl	ef129fbb41	Debug info (LTO): Move the creation of accessibility flags to getOrCreateSubprogramDIE to avoid attributes being added twice when DIEs are merged. rdar://problem/15842330. llvm-svn: 199536	2014-01-18 02:12:00 +00:00
Rafael Espindola	0b694814a8	Add an emitRawComment function and use it to simplify some uses of EmitRawText. llvm-svn: 199397	2014-01-16 16:28:37 +00:00
Tim Northover	3657cb0350	ReMat: fix overly cavalier attitude to sub-register indices There are two attempted optimisations in reMaterializeTrivialDef, trying to avoid promoting the size of a register too much when rematerializing. Unfortunately, both appear to be flawed. First, we see if the original register would have worked, but this is inadequate. Consider: v1 = SOMETHING (v1 is QQ) v2:Q0 = COPY v1:Q1 (v1, v2 are QQ) ... uses of v2 In this case even though v2 could be used directly as the output of SOMETHING, this would set the wrong bits of the QQ register involved. The correct rematerialization must be: v2:Q0_Q1 = SOMETHING (v2 promoted to QQQ) ... uses of v2:Q1_Q2 For the second optimisation, if the correct remat is "v2:idx = SOMETHING" then we can't necessarily expect v2 itself to be valid for SOMETHING, but we do try to hunt for a class between v1 and v2 that works. Unfortunately, this is also wrong: v1 = SOMETHING (v1 is QQ) v2:Q0_Q1 = COPY v1 (v1 is QQ, v2 is QQQ) ... uses of v2 as a QQQ The canonical rematerialization here is "v2:Q0_Q1 = SOMETHING". However current logic would decide that v2 could be a QQ (no interest is taken in later uses). This patch, therefore, always accepts the widened register class without trying to be clever. Generally there is no penalty to this (e.g. in the common GR32 < GR64 case, expanding the width doesn't matter because it's not like you were going to do anything else with the high bits of a GR32 register). It can increase register pressure in cases like the ARM VFP regs though (multiple non-overlapping but equivalent subregisters). This situation can be spotted by the fact that both source and destination in the not-quite-coalesced pair have a sub-register index and rematerialisation is skipped in that situation. Unfortunately, no in-tree targets actually expose this as far as I can tell (there are so few isAsCheapAsAMove instructions for it to trigger on) so I've been unable to produce a test. It was exposed in our ARM64 SPEC tests though, and I will be adding a test there that we should be able to contribute soon(TM). rdar://problem/15775279 llvm-svn: 199376	2014-01-16 12:29:55 +00:00
Rafael Espindola	74c3e63193	Use a slightly smaller hack. llvm-svn: 199363	2014-01-16 07:36:00 +00:00
Andrea Di Biagio	d7c03ec348	[DAGCombiner] Fix a wrong check in method SimplifyVBinOp. This fixes a regression intruced by r199135. Revision 199135 tried to simplify part of the logic in method DAGCombiner::SimplifyVBinOp introducing calls to method BuildVectorSDNode::isConstant(). However, that revision wrongly changed the check performed by method SimplifyVBinOp to identify dag nodes that can be folded. Before revision 199135, that method only tried to simplify vector binary operations if both operands were build_vector of Constant/ConstantFP/Undef only. After revision 199135, method SimplifyVBinop tried to simplify also vector binary operations with only one constant operand. This fixes the problem restoring the old behavior of SimplifyVBinOp. llvm-svn: 199328	2014-01-15 19:51:32 +00:00
David Majnemer	dee105772c	WinCOFF: Transform IR expressions featuring __ImageBase into image relative relocations MSVC on x64 requires that we create image relative symbol references to refer to RTTI data. Seeing as how there is no way to explicitly make reference to a given relocation type in LLVM IR, pattern match expressions of the form &foo - &__ImageBase. Differential Revision: http://llvm-reviews.chandlerc.com/D2523 llvm-svn: 199312	2014-01-15 09:16:42 +00:00
Eric Christopher	1ad8457570	Make sure we emit a relocation to the debug_ranges section in the presence of CU ranges. llvm-svn: 199276	2014-01-15 00:04:29 +00:00
Eric Christopher	39cde8cc90	Enable use of ranges for translation units in the presence of -ffunction-sections and update comments and TODOs about other places that we should enable this. llvm-svn: 199263	2014-01-14 22:44:17 +00:00
Nico Rieck	7157bb765e	Decouple dllexport/dllimport from linkage Representing dllexport/dllimport as distinct linkage types prevents using these attributes on templates and inline functions. Instead of introducing further mixed linkage types to include linkonce and weak ODR, the old import/export linkage types are replaced with a new separate visibility-like specifier: define available_externally dllimport void @f() {} @Var = dllexport global i32 1, align 4 Linkage for dllexported globals and functions is now equal to their linkage without dllexport. Imported globals and functions must be either declarations with external linkage, or definitions with AvailableExternallyLinkage. llvm-svn: 199218	2014-01-14 15:22:47 +00:00
Patrik Hagglund	682a10d4cc	Fix valgrind warning for gcc builds. Sorry, I don't understand why the warning is generated (a gcc bug?). Anyhow, the change should improve readablity. No functionality change intended. llvm-svn: 199214	2014-01-14 14:09:00 +00:00
Nico Rieck	9d2e0df049	Revert "Decouple dllexport/dllimport from linkage" Revert this for now until I fix an issue in Clang with it. This reverts commit r199204. llvm-svn: 199207	2014-01-14 12:38:32 +00:00
Nico Rieck	e43aaf7967	Decouple dllexport/dllimport from linkage Representing dllexport/dllimport as distinct linkage types prevents using these attributes on templates and inline functions. Instead of introducing further mixed linkage types to include linkonce and weak ODR, the old import/export linkage types are replaced with a new separate visibility-like specifier: define available_externally dllimport void @f() {} @Var = dllexport global i32 1, align 4 Linkage for dllexported globals and functions is now equal to their linkage without dllexport. Imported globals and functions must be either declarations with external linkage, or definitions with AvailableExternallyLinkage. llvm-svn: 199204	2014-01-14 11:55:03 +00:00
Jakob Stoklund Olesen	b6b35a4955	Always let value types influence register classes. When creating a virtual register for a def, the value type should be used to pick the register class. If we only use the register class constraint on the instruction, we might pick a too large register class. Some registers can store values of different sizes. For example, the x86 xmm registers can hold f32, f64, and 128-bit vectors. The three different value sizes are represented by register classes with identical register sets: FR32, FR64, and VR128. These register classes have different spill slot sizes, so it is important to use the right one. The register class constraint on an instruction doesn't necessarily care about the size of the value its defining. The value type determines that. This fixes a problem where InstrEmitter was picking 32-bit register classes for 64-bit values on SPARC. llvm-svn: 199187	2014-01-14 06:18:38 +00:00
Rafael Espindola	4a1a360634	Make getTargetStreamer return a possibly null pointer. This will allow it to be called from target independent parts of the main streamer that don't know if there is a registered target streamer or not. This in turn will allow targets to perform extra actions at specified points in the interface: add extra flags for some labels, extra work during finalization, etc. llvm-svn: 199174	2014-01-14 01:21:46 +00:00
Juergen Ributzka	6840282c99	[DAG] Refactor ReassociateOps - no functional change intended. llvm-svn: 199146	2014-01-13 21:49:25 +00:00
Juergen Ributzka	7384405f23	[DAG] Teach DAG to also reassociate vector operations This commit teaches DAG to reassociate vector ops, which in turn enables constant folding of vector op chains that appear later on during custom lowering and DAG combine. Reviewed by Andrea Di Biagio llvm-svn: 199135	2014-01-13 20:51:35 +00:00
Andrew Trick	7daf6a45f4	Hide the pre-RA-sched= option. This is a very confusing option for a feature that will go away. -enable-misched is exposed instead to help triage issues with the new scheduler. llvm-svn: 199133	2014-01-13 20:08:27 +00:00
Chandler Carruth	73523021d0	[PM] Split DominatorTree into a concrete analysis result object which can be used by both the new pass manager and the old. This removes it from any of the virtual mess of the pass interfaces and lets it derive cleanly from the DominatorTreeBase<> template. In turn, tons of boilerplate interface can be nuked and it turns into a very straightforward extension of the base DominatorTree interface. The old analysis pass is now a simple wrapper. The names and style of this split should match the split between CallGraph and CallGraphWrapperPass. All of the users of DominatorTree have been updated to match using many of the same tricks as with CallGraph. The goal is that the common type remains the resulting DominatorTree rather than the pass. This will make subsequent work toward the new pass manager significantly easier. Also in numerous places things became cleaner because I switched from re-running the pass (!!! mid way through some other passes run!!!) to directly recomputing the domtree. llvm-svn: 199104	2014-01-13 13:07:17 +00:00
Chandler Carruth	e509db410a	[PM] Pull the generic graph algorithms and data structures for dominator trees into the Support library. These are all expressed in terms of the generic GraphTraits and CFG, with no reliance on any concrete IR types. Putting them in support clarifies that and makes the fact that the static analyzer in Clang uses them much more sane. When moving the Dominators.h file into the IR library I claimed that this was the right home for it but not something I planned to work on. Oops. So why am I doing this? It happens to be one step toward breaking the requirement that IR verification can only be performed from inside of a pass context, which completely blocks the implementation of verification for the new pass manager infrastructure. Fixing it will also allow removing the concept of the "preverify" step (WTF???) and allow the verifier to cleanly flag functions which fail verification in a way that precludes even computing dominance information. Currently, that results in a fatal error even when you ask the verifier to not fatally error. It's awesome like that. The yak shaving will continue... llvm-svn: 199095	2014-01-13 10:52:56 +00:00
Tim Northover	7fdd4857f7	Revert "ReMat: fix overly cavalier attitude to sub-register indices" Very sorry, this was a premature patch that I still need to investigate and finish off (for some reason beyond me at the moment it doesn't actually fix the issue in all cases). This reverts commit r199091. llvm-svn: 199093	2014-01-13 10:49:11 +00:00
Tim Northover	59f8d4b4ee	ReMat: fix overly cavalier attitude to sub-register indices There are two attempted optimisations in reMaterializeTrivialDef, trying to avoid promoting the size of a register too much when rematerializing. Unfortunately, both appear to be flawed. First, we see if the original register would have worked, but this is inadequate. Consider: v1 = SOMETHING (v1 is QQ) v2:Q0 = COPY v1:Q1 (v1, v2 are QQ) ... uses of v2 In this case even though v2 could be used directly as the output of SOMETHING, this would set the wrong bits of the QQ register involved. The correct rematerialization must be: v2:Q0_Q1 = SOMETHING (v2 promoted to QQQ) ... uses of v2:Q1_Q2 For the second optimisation, if the correct remat is "v2:idx = SOMETHING" then we can't necessarily expect v2 itself to be valid for SOMETHING, but we do try to hunt for a class between v1 and v2 that works. Unfortunately, this is also wrong: v1 = SOMETHING (v1 is QQ) v2:Q0_Q1 = COPY v1 (v1 is QQ, v2 is QQQ) ... uses of v2 as a QQQ The canonical rematerialization here is "v2:Q0_Q1 = SOMETHING". However current logic would decide that v2 could be a QQ (no interest is taken in later uses). This patch, therefore, always accepts the widened register class without trying to be clever. Generally there is no penalty to this (e.g. in the common GR32 < GR64 case, expanding the width doesn't matter because it's not like you were going to do anything else with the high bits of a GR32 register). It can increase register pressure in cases like the ARM VFP regs though (multiple non-overlapping but equivalent subregisters). Hopefully this situation is rare enough that it won't matter. Unfortunately, no in-tree targets actually expose this as far as I can tell (there are so few isAsCheapAsAMove instructions for it to trigger on) so I've been unable to produce a test. It was exposed in our ARM64 SPEC tests though, and I will be adding a test there that we should be able to contribute soon(TM). llvm-svn: 199091	2014-01-13 10:47:01 +00:00
Chandler Carruth	5ad5f15cff	[cleanup] Move the Dominators.h and Verifier.h headers into the IR directory. These passes are already defined in the IR library, and it doesn't make any sense to have the headers in Analysis. Long term, I think there is going to be a much better way to divide these matters. The dominators code should be fully separated into the abstract graph algorithm and have that put in Support where it becomes obvious that evn Clang's CFGBlock's can use it. Then the verifier can manually construct dominance information from the Support-driven interface while the Analysis library can provide a pass which both caches, reconstructs, and supports a nice update API. But those are very long term, and so I don't want to leave the really confusing structure until that day arrives. llvm-svn: 199082	2014-01-13 09:26:24 +00:00
Jakob Stoklund Olesen	1995b9fead	Handle bundled terminators in isBlockOnlyReachableByFallthrough. Targets like SPARC and MIPS have delay slots and normally bundle the delay slot instruction with the corresponding terminator. Teach isBlockOnlyReachableByFallthrough to find any MBB operands on bundled terminators so SPARC doesn't need to specialize this function. llvm-svn: 199061	2014-01-12 19:24:08 +00:00
Nico Rieck	b5262d6d8f	Fix non-deterministic SDNodeOrder-dependent codegen Reset SelectionDAGBuilder's SDNodeOrder to ensure deterministic code generation. llvm-svn: 199050	2014-01-12 14:09:17 +00:00
Chandler Carruth	9d805139bd	[PM] Simplify the interface exposed for IR printing passes. Nothing was using the ability of the pass to delete the raw_ostream it printed to, and nothing was trying to pass it a pointer to the raw_ostream. Also, the function variant had a different order of arguments from all of the others which was just really confusing. Now the interface accepts a reference, doesn't offer to delete it, and uses a consistent order. The implementation of the printing passes haven't been updated with this simplification, this is just the API switch. llvm-svn: 199044	2014-01-12 11:30:46 +00:00
Chandler Carruth	b8ddc7043c	[PM] Rename the IR printing pass header to a more generic and correct name to match the source file which I got earlier. Update the include sites. Also modernize the comments in the header to use the more recommended doxygen style. llvm-svn: 199041	2014-01-12 11:10:32 +00:00
Alp Toker	798060e006	Fix 'ned' typo in doc comment Patch by Jasper Neumann! llvm-svn: 199007	2014-01-11 14:01:43 +00:00
Eric Christopher	942f22c439	Revert r198979 - accidental commit. llvm-svn: 198981	2014-01-11 00:28:12 +00:00
Eric Christopher	ceec7b02fa	Reformat. llvm-svn: 198980	2014-01-11 00:23:18 +00:00
Eric Christopher	67cde9ac07	Update function name and add some helpful comments. llvm-svn: 198979	2014-01-11 00:23:16 +00:00
David Blaikie	15ed5ebfc5	Revert "Revert r198851, "Prototype of skeleton type units for fission"" This reverts commit r198865 which reverts r198851. ASan identified a use-of-uninitialized of the DwarfTypeUnit::Ty variable in skeleton type units. llvm-svn: 198908	2014-01-10 01:38:41 +00:00
NAKAMURA Takumi	c5bf572993	Revert r198851, "Prototype of skeleton type units for fission" It caused undefined behavior. DwarfTypeUnit::Ty might not be initialized properly, I guess. llvm-svn: 198865	2014-01-09 13:08:00 +00:00
Richard Sandiford	15cfc1c33c	Handle masked rotate amounts At the moment we expect rotates to have the form: (or (shl X, Y), (shr X, Z)) where Y == bitsize(X) - Z or Z == bitsize(X) - Y. This form means that the (or ...) is undefined for Y == 0 or Z == 0. This undefinedness can be avoided by using Y == (C * bitsize(X) - Z) & (bitsize(X) - 1) or Z == (C * bitsize(X) - Y) & (bitsize(X) - 1) for any integer C (including 0, the most natural choice). llvm-svn: 198861	2014-01-09 10:56:42 +00:00
Richard Sandiford	0f264db3c6	Match the InstCombine form of rotates by X+C InstCombine converts (sub 32, (add X, C)) into (sub 32-C, X), so a rotate left of a 32-bit Y by X+C could appear as either: (or (shl Y, (add X, C)), (shr Y, (sub 32, (add X, C)))) without InstCombine or: (or (shl Y, (add X, C)), (shr Y, (sub 32-C, X))) with it. We already matched the first form. This patch handles the second too. llvm-svn: 198860	2014-01-09 10:49:40 +00:00
David Blaikie	a588365df6	Prototype of skeleton type units for fission llvm-svn: 198851	2014-01-09 05:08:28 +00:00
David Blaikie	38fe6342f6	DwarfDebug: Refactor out common skeleton construction code to be reused for type unit skeletons. llvm-svn: 198846	2014-01-09 04:28:46 +00:00
David Blaikie	b334e94492	Reformatting for r198842 llvm-svn: 198843	2014-01-09 03:24:13 +00:00
David Blaikie	f645f963ff	DwarfUnit: Rename "Node" to "CUNode" and propagate it through DwarfTypeUnit as well. Since we'll now also need the split dwarf file name along with the language in DwarfTypeUnits, just use the whole DICompileUnit rather than explicitly handling each field needed. llvm-svn: 198842	2014-01-09 03:23:41 +00:00
David Blaikie	7480ae6e19	Revert "DwarfUnit: Move the DICompileUnit Node to the DwarfCompileUnit only" This reverts commit r198830. Decided to go a different way with this... llvm-svn: 198841	2014-01-09 03:03:27 +00:00
Chandler Carruth	d48cdbf0c3	Put the functionality for printing a value to a raw_ostream as an operand into the Value interface just like the core print method is. That gives a more conistent organization to the IR printing interfaces -- they are all attached to the IR objects themselves. Also, update all the users. This removes the 'Writer.h' header which contained only a single function declaration. llvm-svn: 198836	2014-01-09 02:29:41 +00:00
David Blaikie	08badfd2ba	DwarfUnit: Move the DICompileUnit Node to the DwarfCompileUnit only It's unused in DwarfTypeUnit, as is expected. llvm-svn: 198830	2014-01-09 01:20:14 +00:00
Andrew Trick	32e1be7bd0	llvm.experimental.stackmap: fix encoding of large constants. In the stackmap format we advertise the constant field as signed. However, we were determining whether to promote to a 64-bit constant pool based on an unsigned comparison. This fix allows -1 to be encoded as a small constant. llvm-svn: 198816	2014-01-09 00:22:31 +00:00
Hal Finkel	2150e3a743	Conservatively handle multiple MMOs in MIsNeedChainEdge MIsNeedChainEdge, which is used by -enable-aa-sched-mi (AA in misched), had an llvm_unreachable when -enable-aa-sched-mi is enabled and we reach an instruction with multiple MMOs. Instead, return a conservative answer. This allows testing -enable-aa-sched-mi on x86. Also, this moves the check above the isUnsafeMemoryObject checks. isUnsafeMemoryObject is currently correct only for instructions with one MMO (as noted in the comment in isUnsafeMemoryObject): // We purposefully do no check for hasOneMemOperand() here // in hope to trigger an assert downstream in order to // finish implementation. The problem with this is that, had the candidate edge passed the "!MIa->mayStore() && !MIb->mayStore()" check, the hoped-for assert would never happen (which could, in theory, lead to incorrect behavior if one of these secondary MMOs was volatile, for example). llvm-svn: 198795	2014-01-08 21:52:02 +00:00
Andrea Di Biagio	23df4e4a2d	Teach the DAGCombiner how to fold 'vselect' dag nodes according to the following two rules: 1) fold (vselect (build_vector AllOnes), A, B) -> A 2) fold (vselect (build_vector AllZeros), A, B) -> B llvm-svn: 198777	2014-01-08 18:33:04 +00:00
Richard Sandiford	95c864d9bd	[DAGCombiner] Factor duplicated rotate code into a separate function No functional change intended. llvm-svn: 198768	2014-01-08 15:40:47 +00:00
Rafael Espindola	894843cb4e	Move the llvm mangler to lib/IR. This makes it available to tools that don't link with target (like llvm-ar). llvm-svn: 198708	2014-01-07 21:19:40 +00:00
Benjamin Kramer	8a68ab3710	Emit arange padding with a single directive. llvm-svn: 198700	2014-01-07 19:28:14 +00:00
Chandler Carruth	9aca918df9	Move the LLVM IR asm writer header files into the IR directory, as they are part of the core IR library in order to support dumping and other basic functionality. Rename the 'Assembly' include directory to 'AsmParser' to match the library name and the only functionality left their -- printing has been in the core IR library for quite some time. Update all of the #includes to match. All of this started because I wanted to have the layering in good shape before I started adding support for printing LLVM IR using the new pass infrastructure, and commandline support for the new pass infrastructure. llvm-svn: 198688	2014-01-07 12:34:26 +00:00
Chandler Carruth	8a8cd2bab9	Re-sort all of the includes with ./utils/sort_includes.py so that subsequent changes are easier to review. About to fix some layering issues, and wanted to separate out the necessary churn. Also comment and sink the include of "Windows.h" in three .inc files to match the usage in Memory.inc. llvm-svn: 198685	2014-01-07 11:48:04 +00:00
Andrew Trick	dfacda3635	Fix for PR18396: Assertion: MO->isDead "Cannot fold physreg def". InlineSpiller::foldMemoryOperand needs to handle undef call operands. llvm-svn: 198679	2014-01-07 07:31:10 +00:00
Kevin Qin	5cd73c9e0a	[AArch64 NEON] Fix invalid constant used in vselect condition. There is a wrong assumption that the vector element type and the type of each ConstantSDNode in the build_vector were the same. However, when promoting the integer operand of a legally typed build_vector, the operand type and the vector element type do not need to be the same (See method 'DAGTypeLegalizer::PromoteIntOp_BUILD_VECTOR' in LegalizeIntegerTypes.cpp). in AArch64 backend, the following dag sequence: C0: i1 = Constant<0> C1: i1 = Constant<-1> V: v8i1 = BUILD_VECTOR C1, C1, C0, C0, C0, C0, C0, C0 is type-legalized into: NewC0: i32 = Constant<0> NewC1: i32 = Constant<1> V: v8i8 = BUILD_VECTOR NewC1, NewC1, NewC0, NewC0, NewC0, NewC0, NewC0, NewC0 Forcing a getZeroExtend to VTBits to ensure that the new constant is correctly. llvm-svn: 198582	2014-01-06 02:26:10 +00:00
Bill Wendling	908bf814e7	Refactor function that checks that __builtin_returnaddress's argument is constant. This moves the check up into the parent class so that all targets can use it without having to copy (and keep in sync) the same error message. llvm-svn: 198579	2014-01-06 00:43:20 +00:00
Nico Weber	7408c7066a	Add a LLVM_DUMP_METHOD macro. The motivation is to mark dump methods as used in debug builds so that they can be called from lldb, but to not do so in release builds so that they can be dead-stripped. There's lots of potential follow-up work suggested in the thread "Should dump methods be LLVM_ATTRIBUTE_USED only in debug builds?" on cfe-dev, but everyone seems to agreen on this subset. Macro name chosen by fair coin toss. llvm-svn: 198456	2014-01-03 22:53:37 +00:00
Rafael Espindola	58873566b3	Make the llvm mangler depend only on DataLayout. Before this patch any program that wanted to know the final symbol name of a GlobalValue had to link with Target. This patch implements a compromise solution where the mangler uses DataLayout. This way, any tool that already links with Target (llc, clang) gets the exact behavior as before and new IR files can be mangled without linking with Target. With this patch the mangler is constructed with just a DataLayout and DataLayout is extended to include the information the Mangler needs. llvm-svn: 198438	2014-01-03 19:21:54 +00:00
David Blaikie	cfb2115e66	Revert "Revert "Debug Info: Type Units: Simplify type hashing using IR-provided unique names."" This reverts commit r198398, thus reapplying r198397. I had accidentally introduced an endianness issue when applying the hash to the type unit. Using support::ulittle64_t in the reinterpret_cast in addDwarfTypeUnitType fixes this issue. Original commit message: Debug Info: Type Units: Simplify type hashing using IR-provided unique names. What's good for LTO metadata size problems ought to be good for non-LTO debug info size too, so let's rely on the same uniqueness in both cases. If it's insufficient for non-LTO for whatever reason (since we now won't be uniquing CU-local types or any C types - but these are likely to not be the most significant contributors to type bloat) we should consider a frontend solution that'll help both LTO and non-LTO alike, rather than using DWARF-level DIE-hashing that only helps non-LTO debug info size. It's also much simpler this way and benefits C++ even more since we can deduplicate lexically separate definitions of the same C++ type since they have the same mangled name. llvm-svn: 198436	2014-01-03 18:59:42 +00:00
David Blaikie	ab0ba24983	Revert "Debug Info: Type Units: Simplify type hashing using IR-provided unique names." Reverting due to bot failure I won't have time to investigate until tomorrow. This reverts commit r198397. llvm-svn: 198398	2014-01-03 04:49:04 +00:00
David Blaikie	ddb66281cd	Debug Info: Type Units: Simplify type hashing using IR-provided unique names. What's good for LTO metadata size problems ought to be good for non-LTO debug info size too, so let's rely on the same uniqueness in both cases. If it's insufficient for non-LTO for whatever reason (since we now won't be uniquing CU-local types or any C types - but these are likely to not be the most significant contributors to type bloat) we should consider a frontend solution that'll help both LTO and non-LTO alike, rather than using DWARF-level DIE-hashing that only helps non-LTO debug info size. It's also much simpler this way and benefits C++ even more since we can deduplicate lexically separate definitions of the same C++ type since they have the same mangled name. llvm-svn: 198397	2014-01-03 04:20:26 +00:00
Eric Christopher	4d214b9e9c	80-column. llvm-svn: 198394	2014-01-03 02:17:35 +00:00
Eric Christopher	50effa0437	Remove TextSectionSym as it is unused. llvm-svn: 198393	2014-01-03 02:16:44 +00:00
David Blaikie	22b29a5f1a	Revert "Reverting r193835 due to weirdness with Go..." The cgo problem was that it wants dwarf2 which doesn't support direct constant encoding of the location. So let's add support for dwarf2 encoding (using a location expression) of data member locations. This reverts commit r198385. llvm-svn: 198389	2014-01-03 01:30:05 +00:00
David Blaikie	2ada116a34	Reverting r193835 due to weirdness with Go... Apologies for the noise - we're seeing some Go failures with cgo interacting with Clang's debug info due to this change. llvm-svn: 198385	2014-01-03 00:48:38 +00:00
Quentin Colombet	1fb3362a6e	[RegAlloc] Make tryInstructionSplit less aggressive. The greedy register allocator tries to split a live-range around each instruction where it is used or defined to relax the constraints on the entire live-range (this is a last chance split before falling back to spill). The goal is to have a big live-range that is unconstrained (i.e., that can use the largest legal register class) and several small local live-range that carry the constraints implied by each instruction. E.g., Let csti be the constraints on operation i. V1= op1 V1(cst1) op2 V1(cst2) V1 live-range is constrained on the intersection of cst1 and cst2. tryInstructionSplit relaxes those constraints by aggressively splitting each def/use point: V1= V2 = V1 V3 = V2 op1 V3(cst1) V4 = V2 op2 V4(cst2) Because of how the coalescer infrastructure works, each new variable (V3, V4) that is alive at the same time as V1 (or its copy, here V2) interfere with V1. Thus, we end up with an uncoalescable copy for each split point. To make tryInstructionSplit less aggressive, we check if the split point actually relaxes the constraints on the whole live-range. If it does not, we do not insert it. Indeed, it will not help the global allocation problem: - V1 will have the same constraints. - V1 will have the same interference + possibly the newly added split variable VS. - VS will produce an uncoalesceable copy if alive at the same time as V1. <rdar://problem/15570057> llvm-svn: 198369	2014-01-02 22:47:22 +00:00
Eric Christopher	94932438d4	Remove comments on CU skeleton construction, they're probably obvious. llvm-svn: 198361	2014-01-02 22:04:47 +00:00
Eric Christopher	d8beca3b78	Elaborate on comment for skeleton CU construction. llvm-svn: 198358	2014-01-02 21:38:18 +00:00
Eric Christopher	40734c4c0c	Revert seemingly unnecessary section sym for the data section. llvm-svn: 198357	2014-01-02 21:38:13 +00:00
Hal Finkel	decb024c86	Disable compare sinking in CodeGenPrepare when multiple condition registers are available As noted in the comment above CodeGenPrepare::OptimizeInst, which aggressively sinks compares to reduce pressure on the condition register(s), for targets such as PowerPC with multiple condition registers, this may not be the right thing to do. This adds an HasMultipleConditionRegisters boolean to TLI, and CodeGenPrepare::OptimizeInst is skipped when HasMultipleConditionRegisters is true. This functionality will be used by the PowerPC backend in an upcoming commit. Especially when the PowerPC backend starts tracking individual condition register bits as separate allocatable entities (which will happen in this upcoming commit), this sinking from CodeGenPrepare::OptimizeInst is significantly suboptimial. llvm-svn: 198354	2014-01-02 21:13:43 +00:00
Eric Christopher	d4368fde45	Fix up a couple of review comments: Use an if statement instead of a pair of ternary operators checking the same condition. Use a cheap method call rather than returning the local symbol. llvm-svn: 198351	2014-01-02 21:03:28 +00:00
Eric Christopher	8bdb6e1d49	Simplify conditional. llvm-svn: 198350	2014-01-02 21:03:22 +00:00
Lang Hames	8e6e6abf53	Remove redundant fold call introduced in r195944. Thanks very much to Juergen for pointing this out. llvm-svn: 198341	2014-01-02 19:38:41 +00:00
Adrian Prantl	fd3279f27f	Revert "Debug info: Add enumerators to the __apple_names accelerator table." This reverts r197927 until the discussion on llvm-commits comes to a conclusion. llvm-svn: 198333	2014-01-02 18:48:24 +00:00
Rafael Espindola	6994fdf33c	Remove the 's' DataLayout specification During the years there have been some attempts at figuring out how to align byval arguments. A look at the commit log suggests that they were * Use the ABI alignment. * When that was not sufficient for x86-64, I added the 's' specification to DataLayout. * When that was not sufficient Evan added the virtual getByValTypeAlignment. * When even that was not sufficient, we just got the FE to add the alignment to the byval. This patch is just a simple cleanup that removes my first attempt at fixing the problem. I also added an AArch64 implementation of getByValTypeAlignment to make sure this patch is a nop. I also left the 's' parsing for backward compatibility. I will send a short email to llvmdev about the change for anyone maintaining an out of tree target. llvm-svn: 198287	2014-01-01 22:29:43 +00:00
Eric Christopher	05893f475b	Refactor and reduce code duplication for non-split dwarf strings. llvm-svn: 198233	2013-12-30 18:32:31 +00:00
Eric Christopher	d86672037b	Revert r198208 and reapply: r198196: Use a pointer to keep track of the skeleton unit for each normal unit and construct it up front. r198199: Reapply r198196 with a fix to zero initialize the skeleton pointer. r198202: Fix aranges and split dwarf by ensuring that the symbol and relocation back to the compile unit from the aranges section is to the skeleton unit and not the one in the dwo. with a fix to use integer 0 for DW_AT_low_pc since the relocation to the text section symbol was causing issues with COFF. Accordingly remove addLocalLabelAddress and machinery since we're not currently using it. llvm-svn: 198222	2013-12-30 17:22:27 +00:00
NAKAMURA Takumi	17b7310858	Revert r198199 (and r198202). It broke 3 DebugInfo tests for targeting i686-cygming. r198196: Use a pointer to keep track of the skeleton unit for each normal unit and construct it up front. r198199: Reapply r198196 with a fix to zero initialize the skeleton pointer. r198202: Fix aranges and split dwarf by ensuring that the symbol and relocation back to the compile unit from the aranges section is to the skeleton unit and not the one in the dwo. They could be reproducible with explicit target. llvm/lib/MC/WinCOFFObjectWriter.cpp:224: bool {anonymous}::COFFSymbol::should_keep() const: Assertion `Section->Number != -1 && "Sections with relocations must be real!"' failed. llvm-svn: 198208	2013-12-30 09:26:10 +00:00
Eric Christopher	c2d401e952	Fix aranges and split dwarf by ensuring that the symbol and relocation back to the compile unit from the aranges section is to the skeleton unit and not the one in the dwo. Do this by adding a method to grab a forwarded on local sym and local section by querying the skeleton if one exists and using that. Add a few tests to verify the relocations are back to the correct section. llvm-svn: 198202	2013-12-30 05:25:49 +00:00
Eric Christopher	d039baad05	Reapply r198196 with a fix to zero initialize the skeleton pointer. llvm-svn: 198199	2013-12-30 03:40:32 +00:00
Eric Christopher	be4c91c57c	Temporarily revert "Use a pointer to keep track of the skeleton unit for each normal unit" as it seems to be causing problems in the asan tests. llvm-svn: 198197	2013-12-30 03:12:31 +00:00
Eric Christopher	83fff3fce7	Use a pointer to keep track of the skeleton unit for each normal unit and construct it up front. Add address ranges at the end and a helper routine so that we're not needlessly using an indirction in the case of split dwarf. Update testcases according to the new ordering of attributes on the compile unit. llvm-svn: 198196	2013-12-30 03:02:12 +00:00
Kevin Qin	ede9ce1933	Fix a bug in DAGcombiner about zero-extend after setcc. For AArch64 backend, if DAGCombiner see "sext(setcc)", it will combine them together to a single setcc with extended value type. Then if it see "zext(setcc)", it assumes setcc is Vxi1, and try to create "(and (vsetcc), (1, 1, ...)". While setcc isn't Vxi1, DAGcombiner will create wrong node and get wrong code emitted. llvm-svn: 198190	2013-12-30 02:05:13 +00:00
Saleem Abdulrasool	7230b377df	CodeGen: silence a C++11 feature warning llvm-svn: 198133	2013-12-28 22:47:55 +00:00
Andrew Trick	7afe481801	Uninitialized variable (in never taken path) after factoring. llvm-svn: 198131	2013-12-28 22:25:57 +00:00
Andrew Trick	33e05d7665	Added debugging options: -misched-only-func/block llvm-svn: 198124	2013-12-28 21:57:02 +00:00
Andrew Trick	d14d7c20f5	Add a PostMachineScheduler pass with generic implementation. PostGenericScheduler uses either the new machine model or the hazard checker for top-down scheduling. Most of the infrastructure for PreRA machine scheduling is reused. With a some tuning, this should allow MachineScheduler to be default for all ARM targets, including cortex-A9, using the new machine model. Likewise, with additional tuning, it should be able to replace PostRAScheduler for all targets. The PostMachineScheduler pass does not currently run the AntiDepBreaker. There is less need for it on targets that are already running preRA MachineScheduler. I want to prove it's necessary before committing to the maintenance burden. The PostMachineScheduler also currently removes kill flags and adds them all back later. This is a bit ridiculous. I'd prefer passes to directly use a liveness utility than rely on flags. A test case that enables this scheduler will be included in a subsequent checkin that updates the A9 model. llvm-svn: 198122	2013-12-28 21:56:57 +00:00
Andrew Trick	6b104f8b9e	Move the PostRA scheduler's fixupKills function for reuse. llvm-svn: 198121	2013-12-28 21:56:55 +00:00
Andrew Trick	17080b9bf2	Stub out a PostMachineScheduler pass. Placeholder and boilerplate for a PostRA MachineScheduler pass. llvm-svn: 198120	2013-12-28 21:56:51 +00:00
Andrew Trick	d7f890edb0	Factor MI-Sched in preparation for post-ra scheduling support. Factor the MachineFunctionPass into MachineSchedulerBase. Split the DAG class into ScheduleDAGMI and SchedulerDAGMILive. llvm-svn: 198119	2013-12-28 21:56:47 +00:00
Eric Christopher	8458862f20	Remove AsmPrinter::needsRelocationsForDwarfStringPool() since it's just calling into MAI and is only abstracting for a single interface that we actually need to check in multiple places. llvm-svn: 198092	2013-12-28 01:39:17 +00:00
Andrea Di Biagio	46dcddb350	Teach DAGCombiner how to fold a SIGN_EXTEND_INREG of a BUILD_VECTOR of ConstantSDNodes (or UNDEFs) into a simple BUILD_VECTOR. For example, given the following sequence of dag nodes: i32 C = Constant<1> v4i32 V = BUILD_VECTOR C, C, C, C v4i32 Result = SIGN_EXTEND_INREG V, ValueType:v4i1 The SIGN_EXTEND_INREG node can be folded into a build_vector since the vector in input is a BUILD_VECTOR of constants. The optimized sequence is: i32 C = Constant<-1> v4i32 Result = BUILD_VECTOR C, C, C, C llvm-svn: 198084	2013-12-27 20:20:28 +00:00
Adrian Prantl	ad64aeac44	Debug info: Add enumerators to the __apple_names accelerator table. rdar://problem/11516681. llvm-svn: 197927	2013-12-23 23:50:20 +00:00
Eric Christopher	565ab11a35	Ranges in the .debug_range section need to have begin and end labels, assert that this is so. llvm-svn: 197780	2013-12-20 04:34:22 +00:00
Eric Christopher	46e2343554	Add support for a CU to output a set of ranges for the CU. This is useful when you want to have the full list of addresses for a particular CU or when you have multiple modules linked together and can't depend upon the ordering of a single CU for begin/end ranges. llvm-svn: 197776	2013-12-20 04:16:18 +00:00
Josh Magee	22b8ba2d67	[stackprotector] Use analysis from the StackProtector pass for stack layout in PEI a nd LocalStackSlot passes. This changes the MachineFrameInfo API to use the new SSPLayoutKind information produced by the StackProtector pass (instead of a boolean flag) and updates a few pass dependencies (to preserve the SSP analysis). The stack layout follows the same approach used prior to this change - i.e., only LargeArray stack objects will be placed near the canary and everything else will be laid out normally. After this change, structures containing large arrays will also be placed near the canary - a case previously missed by the old implementation. Out of tree targets will need to update their usage of MachineFrameInfo::CreateStackObject to remove the MayNeedSP argument. The next patch will implement the rules for sspstrong and sspreq. The end goal is to support ssp-strong stack layout rules. WIP. Differential Revision: http://llvm-reviews.chandlerc.com/D2158 llvm-svn: 197653	2013-12-19 03:17:11 +00:00
Adrian Prantl	99c7af26b7	Debug info: Implement (rvalue) reference qualifiers for C++11 non-static member functions. Paired commit with CFE. rdar://problem/15356637 llvm-svn: 197613	2013-12-18 21:48:19 +00:00
David Blaikie	47f615eae5	DebugInfo: Introduce new DIValue, DIETypeSignature to encode references to type units via their signatures This simplifies type unit and type unit reference creation as well as setting the stage for inter-type hashing across type unit boundaries. llvm-svn: 197539	2013-12-17 23:32:35 +00:00
Andrew Trick	e4083f9e85	Disabled subregister copy coalescing during MachineCSE. This effectively backs out r197465 but leaves some of the general fixes in place. Not all targets are ready to handle this feature. To enable it, some infrastructure work is needed to better handle register class constraints. llvm-svn: 197514	2013-12-17 19:29:36 +00:00
Quentin Colombet	b4c44d239c	Add warning capabilities in LLVM. This reapplies r197438 and fixes the link-time circular dependency between IR and Support. The fix consists in moving the diagnostic support into IR. The patch adds a new LLVMContext::diagnose that can be used to communicate to the front-end, if any, that something of interest happened. The diagnostics are supported by a new abstraction, the DiagnosticInfo class. The base class contains the following information: - The kind of the report: What this is about. - The severity of the report: How bad this is. This patch also adds 2 classes: - DiagnosticInfoInlineAsm: For inline asm reporting. Basically, this diagnostic will be used to switch to the new diagnostic API for LLVMContext::emitError. - DiagnosticStackSize: For stack size reporting. Comes as a replacement of the hard coded warning in PEI. This patch also features dynamic diagnostic identifiers. In other words plugins can use this infrastructure for their own diagnostics (for more details, see getNextAvailablePluginDiagnosticKind). This patch introduces a new DiagnosticHandlerTy and a new DiagnosticContext in the LLVMContext that should be set by the front-end to be able to map these diagnostics in its own system. http://llvm-reviews.chandlerc.com/D2376 <rdar://problem/15515174> llvm-svn: 197508	2013-12-17 17:47:22 +00:00
Andrew Trick	e339828b90	Allow MachineCSE to coalesce trivial subregister copies the same way that it coalesces normal copies. Without this, MachineCSE is powerless to handle redundant operations with truncated source operands. This required fixing the 2-addr pass to handle tied subregisters. It isn't clear what combinations of subregisters can legally be tied, but the simple case of truncated source operands is now safely handled: %vreg11<def> = COPY %vreg1:sub_32bit; GR32:%vreg11 GR64:%vreg1 %vreg12<def> = COPY %vreg2:sub_32bit; GR32:%vreg12 GR64:%vreg2 %vreg13<def,tied1> = ADD32rr %vreg11<tied0>, %vreg12<kill>, %EFLAGS<imp-def> Test case: cse-add-with-overflow.ll. This exposed an existing bug in PPCInstrInfo::commuteInstruction. Thanks to Rafael for the test case: PowerPC/crash.ll. llvm-svn: 197465	2013-12-17 04:50:45 +00:00
Jim Grosbach	04caa27387	Make comment more explicit. Re-reading the comment I updated in previous commit, it's better to make it more explicit and avoid ambiguity more effectively. llvm-svn: 197458	2013-12-17 02:18:02 +00:00
Jim Grosbach	dde043b3fd	Typo. s/reserved/preserved/ llvm-svn: 197457	2013-12-17 02:01:13 +00:00
Jim Grosbach	ea2db453dd	Add a machine code print in DEBUG() following instruction selection. Make debugging ISel a bit easier by printing out a dump of the generated code at the end. llvm-svn: 197456	2013-12-17 02:01:10 +00:00
Quentin Colombet	382b135d92	Revert r197438 and r197447 until we figure out how to avoid circular dependency at link time llvm-svn: 197451	2013-12-17 01:19:59 +00:00
Quentin Colombet	66673f4075	Add warning capabilities in LLVM. The patch adds a new LLVMContext::diagnose that can be used to communicate to the front-end, if any, that something of interest happened. The diagnostics are supported by a new abstraction, the DiagnosticInfo class. The base class contains the following information: - The kind of the report: What this is about. - The severity of the report: How bad this is. This patch also adds 2 classes: - DiagnosticInfoInlineAsm: For inline asm reporting. Basically, this diagnostic will be used to switch to the new diagnostic API for LLVMContext::emitError. - DiagnosticStackSize: For stack size reporting. Comes as a replacement of the hard coded warning in PEI. This patch also features dynamic diagnostic identifiers. In other words plugins can use this infrastructure for their own diagnostics (for more details, see getNextAvailablePluginDiagnosticKind). This patch introduces a new DiagnosticHandlerTy and a new DiagnosticContext in the LLVMContext that should be set by the front-end to be able to map these diagnostics in its own system. http://llvm-reviews.chandlerc.com/D2376 <rdar://problem/15515174> llvm-svn: 197438	2013-12-16 23:22:51 +00:00
Rafael Espindola	f152836788	Revert "Allow MachineCSE to coalesce trivial subregister copies the same way that it coalesces normal copies." This reverts commit r197414. It broke the ppc64 bootstrap. I will post a testcase in a sec. llvm-svn: 197424	2013-12-16 20:57:09 +00:00
Andrew Trick	88bd8629b2	Allow MachineCSE to coalesce trivial subregister copies the same way that it coalesces normal copies. Without this, MachineCSE is powerless to handle redundant operations with truncated source operands. This required fixing the 2-addr pass to handle tied subregisters. It isn't clear what combinations of subregisters can legally be tied, but the simple case of truncated source operands is now safely handled: %vreg11<def> = COPY %vreg1:sub_32bit; GR32:%vreg11 GR64:%vreg1 %vreg12<def> = COPY %vreg2:sub_32bit; GR32:%vreg12 GR64:%vreg2 %vreg13<def,tied1> = ADD32rr %vreg11<tied0>, %vreg12<kill>, %EFLAGS<imp-def> llvm-svn: 197414	2013-12-16 19:36:21 +00:00
Andrew Trick	cccd82f21f	whitespace llvm-svn: 197413	2013-12-16 19:36:18 +00:00
Juergen Ributzka	c26b68a94f	[Stackmap] Refactor operand parsing. llvm-svn: 197329	2013-12-14 23:06:19 +00:00
Juergen Ributzka	db9ee00b59	Remove weak vtables. No functional change. llvm-svn: 197323	2013-12-14 12:23:14 +00:00
Juergen Ributzka	e82947539e	[Stackmap] Liveness Analysis Pass This optional register liveness analysis pass can be enabled with either -enable-stackmap-liveness, -enable-patchpoint-liveness, or both. The pass traverses each basic block in a machine function. For each basic block the instructions are processed in reversed order and if a patchpoint or stackmap instruction is encountered the current live-out register set is encoded as a register mask and attached to the instruction. Later on during stackmap generation the live-out register mask is processed and also emitted as part of the stackmap. This information is optional and intended for optimization purposes only. This will enable a client of the stackmap to reason about the registers it can use and which registers need to be preserved. Reviewed by Andy llvm-svn: 197317	2013-12-14 06:53:06 +00:00
Juergen Ributzka	310034e166	Convert register liveness tracking to work on a sub-register level instead of just register units. Reviewed by Andy llvm-svn: 197315	2013-12-14 06:52:56 +00:00
Michael Gottesman	5e985ee5b5	[block-freq] Rename getEntryFrequency() -> getEntryFreq() to match getBlockFreq() in all BlockFrequencyInfo. llvm-svn: 197304	2013-12-14 02:37:38 +00:00
Michael Gottesman	9f49d74413	[block-freq] Refactor LiveInterals::getSpillWeight to use the new MachineBlockFrequencyInfo methods. This is slightly more interesting than the previous batch of changes. Specifically: 1. We refactor getSpillWeight to take a MachineBlockFrequencyInfo (MBFI) object. This enables us to completely encapsulate the actual manner we use the MachineBlockFrequencyInfo to get our spill weights. This yields cleaner code since one does not need to fetch the actual block frequency before getting the spill weight if all one wants it the spill weight. It also gives us access to entry frequency which we need for our computation. 2. Instead of having getSpillWeight take a MachineBasicBlock (as one might think) to look up the block frequency via the MBFI object, we instead take in a MachineInstr object. The reason for this is that the method is supposed to return the spill weight for an instruction according to the comments around the function. llvm-svn: 197296	2013-12-14 00:53:32 +00:00
Michael Gottesman	092647b37a	[block-freq] Store MBFI as a field on SpillPlacement so we can access it to get the entry frequency while processing data. llvm-svn: 197291	2013-12-14 00:25:47 +00:00
Michael Gottesman	b78dec8faf	[block-freq] Update MachineBlockPlacement and RegAllocGreedy to use the new MachineBlockFrequencyInfo methods. llvm-svn: 197290	2013-12-14 00:25:45 +00:00
Michael Gottesman	b0c1ed8f4c	[block-freq] Update BlockFrequencyInfo/MachineBlockFrequencyInfo to use the new print methods. llvm-svn: 197289	2013-12-14 00:25:42 +00:00
Matt Arsenault	68c38fd6d1	Print the address space of a MachineMemOperand llvm-svn: 197288	2013-12-14 00:24:02 +00:00
Michael Gottesman	fd5c4b2c09	[block-freq] Add the equivalent methods to MachineBlockFrequencyInfo and BlockFrequencyInfo that were added to BlockFrequencyImpl in r197285 and r197284. llvm-svn: 197287	2013-12-14 00:06:03 +00:00
Andrew Trick	60cf0adeb5	comment typo. llvm-svn: 197278	2013-12-13 22:23:54 +00:00
David Blaikie	bc563276e0	DebugInfo: Move type units into the debug_types section with appropriate comdat grouping and type unit headers This commit does not complete the type units feature - there are issues around fission support (skeletal type units, pubtypes/pubnames) and hashing of some types including those containing references to types in other type units. Originally committed as r197073 and reverted in r197079. Recommitted as r197197 to reproduce the failure and reverted as r197199 Turns out there was unstable ordering in the type unit dumping code. Fixed by using MapVector in DWARFContext to store the debug_types comdat sections. Recommitted as r197210 with a fix to dumping and reverted as r197211 because I was a bit gun shy and thought I saw a failure that turned out to be unrelated. So here we go - once more with feeling! \o/ llvm-svn: 197275	2013-12-13 21:33:40 +00:00
Andrew Trick	27709d0b3c	Revert "Convert liveness tracking to work on a sub-register level instead of just register units." This reverts commit r197253. This was a great change, but Juergen should be the commit author. llvm-svn: 197262	2013-12-13 19:04:08 +00:00
Andrew Trick	7bcb0100df	Revert "Liveness Analysis Pass" This reverts commit r197254. This was an accidental merge of Juergen's patch. It will be checked in shortly, but wasn't meant to go in quite yet. Conflicts: include/llvm/CodeGen/StackMaps.h lib/CodeGen/StackMaps.cpp test/CodeGen/X86/stackmap-liveness.ll llvm-svn: 197260	2013-12-13 18:57:20 +00:00
Andrew Trick	e8cba373a3	Grow the stackmap/patchpoint format to hold 64-bit IDs. llvm-svn: 197255	2013-12-13 18:37:10 +00:00
Andrew Trick	8d6a658430	Liveness Analysis Pass llvm-svn: 197254	2013-12-13 18:37:03 +00:00
Andrew Trick	8df84fa2f2	Convert liveness tracking to work on a sub-register level instead of just register units. llvm-svn: 197253	2013-12-13 18:36:56 +00:00
David Blaikie	04adff775f	Revert "DebugInfo: Move type units into the debug_types section with appropriate comdat grouping and type unit headers" This reverts commit r197210. llvm-svn: 197211	2013-12-13 06:43:32 +00:00
David Blaikie	753c6e4eb2	DebugInfo: Move type units into the debug_types section with appropriate comdat grouping and type unit headers This commit does not complete the type units feature - there are issues around fission support (skeletal type units, pubtypes/pubnames) and hashing of some types including those containing references to types in other type units. Originally committed as r197073 and reverted in r197079. Recommitted as r197197 to reproduce the failure and reverted as r197199 Turns out there was unstable ordering in the type unit dumping code. Fixed by using MapVector in DWARFContext to store the debug_types comdat sections. llvm-svn: 197210	2013-12-13 06:27:38 +00:00
David Blaikie	6201712bb0	Revert "DebugInfo: Move type units into the debug_types section with appropriate comdat grouping and type unit headers" This reverts commit r197197. llvm-svn: 197199	2013-12-13 01:24:54 +00:00
David Blaikie	baaf74d4ca	DebugInfo: Move type units into the debug_types section with appropriate comdat grouping and type unit headers This commit does not complete the type units feature - there are issues around fission support (skeletal type units, pubtypes/pubnames) and hashing of some types including those containing references to types in other type units. Originally committed as r197073 and reverted in r197079. This commit originally got jumbled up with another build-breaking commit and I can't find the failures I thought this caused anymore. Recommitting to hopefully get some clean buildbot results to work from. I have a sneaking suspicion there's unstable output in the comdat group output of MCStreamer... llvm-svn: 197197	2013-12-13 01:06:41 +00:00
Quentin Colombet	18b779e3f4	Fix an over-constrained assertion in MachineFunction::addLiveIn. The assertion was checking that the virtual register VReg used to represent the physical register PReg uses the same register class as the one passed to MachineFunction::addLiveIn. This is over-constraining because it is sufficient to check that the register class of VReg (VRegRC) is a subclass of the register class of PReg (PRegRC) and that VRegRC contains PReg. Indeed, if VReg gets constrained because of some operation constraints between two calls of MachineFunction::addLiveIn, the original assertion cannot match. This fixes <rdar://problem/15633429>. llvm-svn: 197097	2013-12-12 00:15:47 +00:00
Hal Finkel	4fd3b1de2a	Add two additional hazard recognizer functions This adds two additional functions to the hazard recognizer interface. These are optional (in the sense that the default implementations preserve the current behavior), and used by the post-RA scheduler. Upcoming commits will use this functionality in order to improve dispatch-group formation on the POWER7 and related cores. Dispatch groups are an odd construct: sometimes we need to insert nops to force a new one to start (for performance reasons), and some instructions need to appear in certain positions within a group, but the groups are not fundamentally cycle based (they can contain instructions with data dependencies with non-trivial latencies). Motivation: unsigned PreEmitNoops(SUnit ) - Used to force the post-RA scheduler to insert nops to force a new dispatch group to begin. We already have a NoopHazard, and this is also still needed. However, NoopHazard only causes a nop to be inserted if there are no other available instructions, and so is not always sufficient. The number of nops to insert depends on state that only the hazard recognizer has, so a general callback is necessary. bool ShouldPreferAnother(SUnit ) - Used to avoid scheduling instructions that would start a new dispatch group when others are available that could be part of the current dispatch group. In this case, we don't want to issue nops, because the non-preferred instruction will implicitly start a new dispatch group regardless. Although the motivation for these functions is driven by the PowerPC backend, they are completely general. llvm-svn: 197084	2013-12-11 22:33:43 +00:00
Rafael Espindola	2b5a0c9e68	On ELF and COFF treat linker_private like private. The linkers on these systems don't have anything special to do with these symbols. Since the intent is for them to be absent from the final object, just treat them as private. llvm-svn: 197080	2013-12-11 22:18:44 +00:00
David Blaikie	727747eb29	Revert "DebugInfo: Move type units into the debug_types section with appropriate comdat grouping and type unit headers" This reverts commit r197073. The test seems to be failing on some buildbots for unknown reasons. Reverting until I can figure that out. If anyone's got a reproduction (.s and .o together would be great) - I'd really appreciate it. llvm-svn: 197079	2013-12-11 22:08:39 +00:00
David Blaikie	4fe3c00eed	DebugInfo: Move type units into the debug_types section with appropriate comdat grouping and type unit headers This commit does not complete the type units feature - there are issues around fission support (skeletal type units, pubtypes/pubnames) and hashing of some types including those containing references to types in other type units. llvm-svn: 197073	2013-12-11 21:36:27 +00:00
David Blaikie	3332d4c75f	DwarfUnit: LLVM_OVERRIDE and constify some functions llvm-svn: 197072	2013-12-11 21:14:02 +00:00
Benjamin Kramer	671a596282	SelectionDAG: Fix a typo. Found by "cppcheck". PR18208. llvm-svn: 197047	2013-12-11 16:36:09 +00:00
Richard Sandiford	d1093636cc	Extend (truncate (load)) folding DAGCombiner could fold (truncate (load)) -> smaller load if the original load was the width of the truncation result or wider. This patch extends it to handle cases where the original load was narrower (and so the extension type stays the same). llvm-svn: 197030	2013-12-11 11:37:27 +00:00
Andrew Trick	2d8826a1b5	Add TargetRegisterInfo::reverseLocalAssignment hook. This hook reverses the order of assignment for local live ranges. This will generally allocate shorter local live ranges first. For targets with many registers, this could reduce regalloc compile time by a large factor. It should still achieve optimal coloring; however, it can change register eviction decisions. It is disabled by default for two reasons: (1) Top-down allocation is simpler and easier to debug for targets that don't benefit from reversing the order. (2) Bottom-up allocation could result in poor evicition decisions on some targets affecting the performance of compiled code. llvm-svn: 197001	2013-12-11 03:40:15 +00:00
NAKAMURA Takumi	8bc9bfaa5a	Prune redundant dependencies in LLVMBuild.txt. llvm-svn: 196988	2013-12-11 00:30:57 +00:00
David Fang	1b01849f2d	on darwin<10, fallback to .weak_definition (PPC,X86) .weak_def_can_be_hidden was not yet supported by the system assembler llvm-svn: 196970	2013-12-10 21:37:41 +00:00
Matt Arsenault	0f5f015bfd	Fix gcc warnings. Unused variable and unused typedef in release build. llvm-svn: 196947	2013-12-10 18:55:37 +00:00
Reid Kleckner	ee08897fb8	Reland "Fix miscompile of MS inline assembly with stack realignment" This re-lands commit r196876, which was reverted in r196879. The tests have been fixed to pass on platforms with a stack alignment larger than 4. Update to clang side tests will land shortly. llvm-svn: 196939	2013-12-10 18:27:32 +00:00
Richard Sandiford	9afe613d12	Add TargetLowering::prepareVolatileOrAtomicLoad One unusual feature of the z architecture is that the result of a previous load can be reused indefinitely for subsequent loads, even if a cache-coherent store to that location is performed by another CPU. A special serializing instruction must be used if you want to force a load to be reattempted. Since volatile loads are not supposed to be omitted in this way, we should insert a serializing instruction before each such load. The same goes for atomic loads. The patch implements this at the IR->DAG boundary, in a similar way to atomic fences. It is a no-op for targets other than SystemZ. llvm-svn: 196905	2013-12-10 10:36:34 +00:00
NAKAMURA Takumi	396d4d3c7e	Add proper dependencies to LLVMBuild.txt in llvm/lib. I'll prune redundant deps in LLVMBuild.txt, later. llvm-svn: 196881	2013-12-10 05:39:34 +00:00
Reid Kleckner	0a9509f080	Revert "Fix miscompile of MS inline assembly with stack realignment" This reverts commit r196876. Its tests failed on the bots, so I'll figure it out tomorrow. llvm-svn: 196879	2013-12-10 05:31:27 +00:00
Reid Kleckner	7f10a8cd45	Fix miscompile of MS inline assembly with stack realignment For stack frames requiring realignment, three pointers may be needed: - ebp to address incoming arguments - esi (could be any callee-saved register) to address locals - esp to address outgoing arguments We would use esi unconditionally without verifying that it did not conflict with inline assembly. This change doesn't do the verification, it simply emits a fatal error on functions that use stack realignment, dynamic SP adjustments, and inline assembly. Because stack realignment is common on Windows, we also no longer assume that MS inline assembly clobbers esp. Instead, we analyze the inline instructions for implicit definitions and check if esp is there. If so, we require the use of a base pointer and consider it in the condition above. Mostly fixes PR16830, but we could try harder to find a non-conflicting base pointer. Reviewers: sunfish Differential Revision: http://llvm-reviews.chandlerc.com/D1317 llvm-svn: 196876	2013-12-10 05:12:23 +00:00
Nadav Rotem	6eee080450	Fix PR18162 - Incorrect assertion assumed that the SDValue resno is zero. llvm-svn: 196858	2013-12-10 01:13:59 +00:00
Eric Christopher	5090d57c24	Disable emitting DW_AT_GNU_ranges_base until we actually use it. llvm-svn: 196851	2013-12-10 00:40:03 +00:00
Eric Christopher	b95d857350	We never emit info into the macro info section, stop emitting an empty one. llvm-svn: 196849	2013-12-10 00:26:10 +00:00
Eric Christopher	4df1160536	80-col. llvm-svn: 196848	2013-12-10 00:26:06 +00:00
Eric Christopher	4287a49913	Rename CompileUnit->DwarfCompileUnit and TypeUnit->DwarfTypeUnit for clarity. No functional change. llvm-svn: 196844	2013-12-09 23:57:44 +00:00
Eric Christopher	a5a7942297	Rename Unit->DwarfUnit to match the file name and make it a bit less ambiguous. Reformat to match. llvm-svn: 196838	2013-12-09 23:32:48 +00:00
David Blaikie	1ab7c2dab4	DwarfDebug/Unit: Remove another case of label recreation by storing the gnu_ranges label in the unit. llvm-svn: 196793	2013-12-09 17:51:30 +00:00
Andrew Trick	fc127d1197	Factor out the SchedRemainder/SchedBoundary from GenericScheduler strategy. These helper classes take care of the book-keeping the drives the GenericScheduler heuristics. It is likely that developers writing target-specific schedulers that work similarly to GenericScheduler will want to use these helpers too. The immediate goal is to develop a GenericPostScheduler that can run in place of the old PostRAScheduler, but will use the new machine model. No functionality change intended. llvm-svn: 196643	2013-12-07 05:59:44 +00:00
Lang Hames	2ce64a7d9e	Correct think-o in foldPatchpoint. Thanks to Andy Trick for pointing it out. llvm-svn: 196640	2013-12-07 03:30:59 +00:00
Vincent Lejeune	92b0a64906	Add a RequireStructuredCFG Field to TargetMachine. llvm-svn: 196634	2013-12-07 01:49:19 +00:00
David Blaikie	7d73460218	DebugInfo: Move unit begin/end labels into the unit This removes another case of spooky action at a distance (building the same label names in multiple places creating an implicit dependency between those places) and helps pave the way for type units. llvm-svn: 196617	2013-12-06 22:33:05 +00:00
David Blaikie	03073f747e	DebugInfo: Include the section and start-of-section label in the unit This is a precursor to moving type units into the correct (debug_types) section with comdat groups and full type unit headers. llvm-svn: 196615	2013-12-06 22:14:48 +00:00
David Blaikie	4f623205a9	DwarfDebug: Walk skeletons during fission pubtypes/pubnames emission This more accurately represents the actual walk - pubnames/pubtypes are emitted into the .o, not the .dwo, and reference the skeletons not the full units. Use the newly established ID->index invariant to lookup the underlying full unit to retrieve its public names and types. llvm-svn: 196601	2013-12-06 19:38:49 +00:00
David Blaikie	2666e24ca5	DebugInfo: Ensure unit IDs (for non-skeletal units) match thein index in the list This simplifies reasoning about the code and enables simple navigation from a skeleton to its full unit. (currently there are no type unit skeletons, so the skeleton list doesn't have the same ID == index property) Eventually we should get rid of this ID and just store the labels we need as the IDs are allowing this code to create difficult to manage/understand associations (loops over non-skeletal units are implicitly referencing their skeletal units during pub* emission, for example). It may be necessary to have some kind of skeleton->full unit association and a more direct pointer or similar device would be preferable than an index. llvm-svn: 196600	2013-12-06 19:38:46 +00:00
Andrew Trick	f7760a24e5	comment grammar llvm-svn: 196585	2013-12-06 17:19:20 +00:00
Daniel Jasper	0d92abdfd2	Fix bug introduced in r196517. Not only does it trigger -Wparentheses, I think the assert actually relies on incorrect operator precedence. Also, the grammar as questionable, but I might not know enough about the problem at hand. llvm-svn: 196567	2013-12-06 08:58:22 +00:00
Aditya Nandakumar	73f3d33dbb	Check hint registers for interference only once before evictions llvm-svn: 196536	2013-12-05 21:18:40 +00:00
Matt Arsenault	79d55f5c1f	Revert part of GCC warning fix to fix debug build. The typedef is used inside the DEBUG(), and apparently can't be moved inside of it. llvm-svn: 196528	2013-12-05 20:02:18 +00:00
Matt Arsenault	c44a3ff638	Fix minor GCC warnings. Unused typedefs and unused variables. llvm-svn: 196526	2013-12-05 19:37:36 +00:00
Eric Christopher	f8194853ff	Rename DwarfUnits to DwarfFile to help avoid some naming confusion. llvm-svn: 196519	2013-12-05 18:06:10 +00:00
Andrew Trick	5a22df498e	MI-Sched: Model "reserved" processor resources. This allows a target to use MI-Sched as an in-order scheduler that will model strict resource conflicts without defining a processor itinerary. Instead, the target can now use the new per-operand machine model and define in-order resources with BufferSize=0. For example, this would allow restricting the type of operations that can be formed into a dispatch group. (Normally NumMicroOps is sufficient to enforce dispatch groups). If the intent is to model latency in in-order pipeline, as opposed to resource conflicts, then a resource with BufferSize=1 should be defined instead. This feature is only casually tested as there are no in-tree targets using it yet. However, Hal will be experimenting with POWER7. llvm-svn: 196517	2013-12-05 17:56:02 +00:00
Andrew Trick	880e573d98	MI-Sched: handle latency of in-order operations with the new machine model. The per-operand machine model allows the target to define "unbuffered" processor resources. This change is a quick, cheap way to model stalls caused by the latency of operations that use such resources. This only applies when the processor's micro-op buffer size is non-zero (Out-of-Order). We can't precisely model in-order stalls during out-of-order execution, but this is an easy and effective heuristic. It benefits cortex-a9 scheduling when using the new machine model, which is not yet on by default. MI-Sched for armv7 was evaluated on Swift (and only not enabled because of a performance bug related to predication). However, we never evaluated Cortex-A9 performance on MI-Sched in its current form. This change adds MI-Sched functionality to reach performance goals on A9. The only remaining change is to allow MI-Sched to run as a PostRA pass. I evaluated performance using a set of options to estimate the performance impact once MI sched is default on armv7: -mcpu=cortex-a9 -disable-post-ra -misched-bench -scheditins=false For a simple saxpy loop I see a 1.7x speedup. Here are the llvm-testsuite results: (min run time over 2 runs, filtering tiny changes) Speedups: \| Benchmarks/BenchmarkGame/recursive \| 52.39% \| \| Benchmarks/VersaBench/beamformer \| 20.80% \| \| Benchmarks/Misc/pi \| 19.97% \| \| Benchmarks/Misc/mandel-2 \| 19.95% \| \| SPEC/CFP2000/188.ammp \| 18.72% \| \| Benchmarks/McCat/08-main/main \| 18.58% \| \| Benchmarks/Misc-C++/Large/sphereflake \| 18.46% \| \| Benchmarks/Olden/power \| 17.11% \| \| Benchmarks/Misc-C++/mandel-text \| 16.47% \| \| Benchmarks/Misc/oourafft \| 15.94% \| \| Benchmarks/Misc/flops-7 \| 14.99% \| \| Benchmarks/FreeBench/distray \| 14.26% \| \| SPEC/CFP2006/470.lbm \| 14.00% \| \| mediabench/mpeg2/mpeg2dec/mpeg2decode \| 12.28% \| \| Benchmarks/SmallPT/smallpt \| 10.36% \| \| Benchmarks/Misc-C++/Large/ray \| 8.97% \| \| Benchmarks/Misc/fp-convert \| 8.75% \| \| Benchmarks/Olden/perimeter \| 7.10% \| \| Benchmarks/Bullet/bullet \| 7.03% \| \| Benchmarks/Misc/mandel \| 6.75% \| \| Benchmarks/Olden/voronoi \| 6.26% \| \| Benchmarks/Misc/flops-8 \| 5.77% \| \| Benchmarks/Misc/matmul_f64_4x4 \| 5.19% \| \| Benchmarks/MiBench/security-rijndael \| 5.15% \| \| Benchmarks/Misc/flops-6 \| 5.10% \| \| Benchmarks/Olden/tsp \| 4.46% \| \| Benchmarks/MiBench/consumer-lame \| 4.28% \| \| Benchmarks/Misc/flops-5 \| 4.27% \| \| Benchmarks/mafft/pairlocalalign \| 4.19% \| \| Benchmarks/Misc/himenobmtxpa \| 4.07% \| \| Benchmarks/Misc/lowercase \| 4.06% \| \| SPEC/CFP2006/433.milc \| 3.99% \| \| Benchmarks/tramp3d-v4 \| 3.79% \| \| Benchmarks/FreeBench/pifft \| 3.66% \| \| Benchmarks/Ptrdist/ks \| 3.21% \| \| Benchmarks/Adobe-C++/loop_unroll \| 3.12% \| \| SPEC/CINT2000/175.vpr \| 3.12% \| \| Benchmarks/nbench \| 2.98% \| \| SPEC/CFP2000/183.equake \| 2.91% \| \| Benchmarks/Misc/perlin \| 2.85% \| \| Benchmarks/Misc/flops-1 \| 2.82% \| \| Benchmarks/Misc-C++-EH/spirit \| 2.80% \| \| Benchmarks/Misc/flops-2 \| 2.77% \| \| Benchmarks/NPB-serial/is \| 2.42% \| \| Benchmarks/ASC_Sequoia/CrystalMk \| 2.33% \| \| Benchmarks/BenchmarkGame/n-body \| 2.28% \| \| Benchmarks/SciMark2-C/scimark2 \| 2.27% \| \| Benchmarks/Olden/bh \| 2.03% \| \| skidmarks10/skidmarks \| 1.81% \| \| Benchmarks/Misc/flops \| 1.72% \| Slowdowns: \| Benchmarks/llubenchmark/llu \| -14.14% \| \| Benchmarks/Polybench/stencils/seidel-2d \| -5.67% \| \| Benchmarks/Adobe-C++/functionobjects \| -5.25% \| \| Benchmarks/Misc-C++/oopack_v1p8 \| -5.00% \| \| Benchmarks/Shootout/hash \| -2.35% \| \| Benchmarks/Prolangs-C++/ocean \| -2.01% \| \| Benchmarks/Polybench/medley/floyd-warshall \| -1.98% \| \| Polybench/linear-algebra/kernels/3mm \| -1.95% \| \| Benchmarks/McCat/09-vor/vor \| -1.68% \| llvm-svn: 196516	2013-12-05 17:55:58 +00:00
Andrew Trick	bb1247b9f0	comment typo and reformat llvm-svn: 196513	2013-12-05 17:55:47 +00:00
David Blaikie	0504cdafaa	DwarfDebug/DwarfUnit: Push abbreviation structures down into DwarfUnits to reduce duplication llvm-svn: 196479	2013-12-05 07:43:55 +00:00
Alp Toker	f907b891da	Correct word hyphenations This patch tries to avoid unrelated changes other than fixing a few hyphen-related ambiguities and contractions in nearby lines. llvm-svn: 196471	2013-12-05 05:44:44 +00:00
Rafael Espindola	d50dbc783b	Try harder to get a consistent floating point results. This just extends the existing hack. It should be enough to get a reproducible bootstrap on 32 bits. I will open a bug to track getting a real fix for this. llvm-svn: 196462	2013-12-05 04:14:33 +00:00
David Blaikie	ff3ab2c222	DwarfDebug: Avoid unnecessary abbreviation lookup when emitting DIEs DIEs already contain references directly to their DIEAbbrev, use that instead of looking it up based on index. llvm-svn: 196446	2013-12-05 01:01:41 +00:00
David Blaikie	9a0b402972	DwarfDebug: Remove trivial function wrapper llvm-svn: 196445	2013-12-05 01:01:37 +00:00
Eric Christopher	b9a69f6129	80-column. llvm-svn: 196442	2013-12-05 00:36:21 +00:00
Eric Christopher	c31fe2de4a	Remove special handling for DW_AT_ranges support by constructing the values with the correct behavior. llvm-svn: 196441	2013-12-05 00:36:17 +00:00
Eric Christopher	1c70b6795b	Fix comment. llvm-svn: 196437	2013-12-05 00:13:15 +00:00
David Blaikie	6896e190cf	DwarfUnit: Correct comment by generalizing over all units, not just compilation units. Code review feedback on r196394 by Paul Robinson. llvm-svn: 196433	2013-12-04 23:39:02 +00:00
Eric Christopher	ad10cb51e3	Update comment. llvm-svn: 196431	2013-12-04 23:24:38 +00:00
Eric Christopher	5d008fed55	Update comment. llvm-svn: 196430	2013-12-04 23:24:28 +00:00
Eric Christopher	3b0ce937e5	Remove incorrect comment and pointless cast. llvm-svn: 196427	2013-12-04 23:05:21 +00:00
Eric Christopher	038a5e4630	const on its own line is confusing. llvm-svn: 196426	2013-12-04 22:54:45 +00:00
Eric Christopher	cb7119e097	Simplify check. llvm-svn: 196422	2013-12-04 22:29:02 +00:00
Eric Christopher	596077b363	Reformat slightly. llvm-svn: 196421	2013-12-04 22:26:43 +00:00
Eric Christopher	f8790646b2	Make RangeSpanList take a symbol for the beginning of the range rather than magically making the names match. llvm-svn: 196419	2013-12-04 22:04:50 +00:00
David Blaikie	155f88118b	DwarfDebug: Unconditionalize trivial asm comments While we still have a few (~4) non-trivial comments with string concatenation, etc that should remain conditionalized, these trivial literal comments can be simplified. llvm-svn: 196416	2013-12-04 21:51:05 +00:00
David Blaikie	3c842626ab	DwarfDebug: Reduce code duplication for sec offset emission llvm-svn: 196414	2013-12-04 21:31:26 +00:00
Eric Christopher	1cdb63db96	Couple of small logical cleanups to use !empty rather than other checks. No functional change. llvm-svn: 196412	2013-12-04 21:20:15 +00:00
Eric Christopher	270ba4a5d3	Use move and stack allocation for RangeSpanLists. As a result make a few things more const as well because we're now using const references to refer to iterators. llvm-svn: 196398	2013-12-04 19:06:58 +00:00
David Blaikie	91db9ab1b4	DebugInfo: Remove unused start/end labels for the debug_abbrevs section Since we always emit only one abbrevation section (shared by all the compilation units in this module) there's no need for a separate label at the start of each one (and we weren't using the CU ID anyway, so there really was only one label). Use the section label instead and drop the wholely unused debug_abbrev_end label. llvm-svn: 196394	2013-12-04 18:12:28 +00:00
David Blaikie	b7a1c4d33b	DebugInfo: Avoid recreating matching labels in disparate places. Instead, reuse the same MCSymbol - this should make the code easier to follow by avoiding hard to trace dependencies between different bits of code. llvm-svn: 196392	2013-12-04 17:55:41 +00:00
Eric Christopher	bfe7d29f7d	Update comment grammar and contents. llvm-svn: 196323	2013-12-03 22:05:55 +00:00
Michael Gottesman	748fe483a0	Fixed various whitespace/spelling/80+ issues. llvm-svn: 196310	2013-12-03 20:21:17 +00:00
Timur Iskhodzhanov	c05ef04f3d	Fix a typo in a comment llvm-svn: 196304	2013-12-03 18:57:43 +00:00
Timur Iskhodzhanov	1cd1444449	Reland 196270 "Generalize debug info / EH emission in AsmPrinter" Addressing the existense AMDGPUAsmPrinter and other subclasses of AsmPrinter llvm-svn: 196288	2013-12-03 15:10:23 +00:00
NAKAMURA Takumi	b927161274	Revert r196270, "Generalize debug info / EH emission in AsmPrinter" It broke CodeGen/R600 tests with +Asserts. llvm-svn: 196272	2013-12-03 13:15:54 +00:00
Timur Iskhodzhanov	4c719cf6c6	Generalize debug info / EH emission in AsmPrinter llvm-svn: 196270	2013-12-03 12:05:18 +00:00
Michael Gottesman	65bbcdfa57	Added MachineBlockFrequencyInfo::view for displaying the block frequency propagation graph via graphviz. This is useful for debugging issues in the BlockFrequency implementation since one can easily visualize where probability mass and other errors occur in the propagation. This is the MI version of r194654. llvm-svn: 196183	2013-12-03 00:49:33 +00:00
Eric Christopher	be2513e143	Refactor the handling of lexical block and inline scope ranges into a single function. No functional change. llvm-svn: 196181	2013-12-03 00:45:59 +00:00
Eric Christopher	44e66c1354	Update doxygen tags. llvm-svn: 196180	2013-12-03 00:45:56 +00:00
Eric Christopher	77913e039c	Reorder member function declarations to match source order. llvm-svn: 196179	2013-12-03 00:45:54 +00:00
Eric Christopher	0f63d06d64	Make ranges and range lists be a discrete entity that can be located and emitted per function and CU. Begins coalescing ranges as a first class entity through debug info. No functional change. llvm-svn: 196178	2013-12-03 00:45:45 +00:00
Rafael Espindola	04867ce9b0	Convert two char* that are only ever used as booleans to bool. llvm-svn: 196168	2013-12-02 23:04:51 +00:00
David Blaikie	68d7e762c3	Remove unnecessary/commented-out header inclusion. Review feedback from Eric Christopher on r196140 llvm-svn: 196160	2013-12-02 22:11:08 +00:00
David Blaikie	2a80e4426c	DebugInfo: Rename generic unit references to "TheU" instead of TheCU now that they might be type units instead of compile units. CR feedback from Eric Christopher on r196139. llvm-svn: 196159	2013-12-02 22:09:48 +00:00
David Blaikie	2c86a72331	DebugInfo: Rename DwarfCompileUnit.* to DwarfUnit.* to match their contents. llvm-svn: 196140	2013-12-02 19:33:15 +00:00
David Blaikie	319a05f78d	DebugInfo: Refactor CompileUnit into a Unit baseclass and CompileUnit/TypeUnit derived classes. Header/cpp file rename to follow immediately - just splitting out the commits for ease of review/reading to demonstrate that the renaming changes are entirely mechanical. llvm-svn: 196139	2013-12-02 19:33:10 +00:00
David Blaikie	3c1d33241c	DebugInfo: Type Units: Propagate the correct DW_AT_language into type units. llvm-svn: 196130	2013-12-02 18:44:29 +00:00
Rafael Espindola	f4e6b29a03	Move getSymbolWithGlobalValueBase to TargetLoweringObjectFile. This allows it to be used in TargetLoweringObjectFileImpl.cpp. llvm-svn: 196117	2013-12-02 16:25:47 +00:00
Andrew Trick	c2ab53a318	Reverse the order of eviction checks for possible compile time savings. No functionality. llvm-svn: 195969	2013-11-29 23:49:38 +00:00
Lang Hames	7468daadda	Teach LocalStackSlotAllocation that stackmaps/patchpoints don't have range constraints on their frame offsets. llvm-svn: 195950	2013-11-29 06:35:30 +00:00
Lang Hames	c8a73af391	Remove unused variable from r195944. llvm-svn: 195945	2013-11-29 03:36:53 +00:00
Lang Hames	39609996d9	Refactor a lot of patchpoint/stackmap related code to simplify and make it target independent. Most of the x86 specific stackmap/patchpoint handling was necessitated by the use of the native address-mode format for frame index operands. PEI has now been modified to treat stackmap/patchpoint similarly to DEBUG_INFO, allowing us to use a simple, platform independent register/offset pair for frame indexes on stackmap/patchpoints. Notes: - Folding is now platform independent and automatically supported. - Emiting patchpoints with direct memory references now just involves calling the TargetLoweringBase::emitPatchPoint utility method from the target's XXXTargetLowering::EmitInstrWithCustomInserter method. (See X86TargetLowering for an example). - No more ugly platform-specific operand parsers. This patch shouldn't change the generated output for X86. llvm-svn: 195944	2013-11-29 03:07:54 +00:00
Rafael Espindola	61b3d0c1fb	Remove an always true parameter. llvm-svn: 195931	2013-11-28 19:35:07 +00:00
David Blaikie	bc7e0d43bf	DebugInfo: Do not include variables only referenced by templates in aranges. ARanges included even extern variables referenced by pointer non-type template parameters even though that variable isn't part of this compilation unit. llvm-svn: 195895	2013-11-27 23:53:52 +00:00
Lang Hames	fde8e4b7c9	Show stackmap entry encodings in stackmap debug logs. This makes it easier to cross-reference debug output with encoded stack-maps, and to create stackmap test-cases. llvm-svn: 195874	2013-11-27 20:10:16 +00:00
Rafael Espindola	3c8e147a6b	Use the same tls section name as msvc. We currently error in clang with: "error: thread-local storage is unsupported for the current target", but we can start to get the llvm level ready. When compiling template<typename T> struct foo { static __declspec(thread) int bar; }; template<typename T> __declspec(therad) int foo<T>::bar; template struct foo<int>; msvc produces SECTION HEADER #3 .tls$ name 0 physical address 0 virtual address 4 size of raw data 12F file pointer to raw data (0000012F to 00000132) 0 file pointer to relocation table 0 file pointer to line numbers 0 number of relocations 0 number of line numbers C0301040 flags Initialized Data COMDAT; sym= "public: static int foo<int>::bar" (?bar@?$foo@H@@2HA) 4 byte align Read Write gcc produces a ".data$__emutls_v.<symbol>" for the testcase with __declspec(thread) replaced with thread_local. llvm-svn: 195849	2013-11-27 15:52:11 +00:00
Rafael Espindola	2d30ae2be9	Use simple section names for COMDAT sections on COFF. With this patch we use simple names for COMDAT sections (like .text or .bss). This matches the MSVC behavior. When merging it is the COMDAT symbol that is used to decide if two sections should be merged, so there is no point in building a fancy name. This survived a bootstrap on mingw32. llvm-svn: 195798	2013-11-27 01:18:37 +00:00
Eric Christopher	f52eddf9ca	80-column fixups. llvm-svn: 195790	2013-11-26 22:23:27 +00:00
David Blaikie	fd1eff5a0a	DwarfDebug: Include type units in accelerator tables. Since type units aren't in the CUMap, use the DwarfUnits list to iterate over units for tasks such as accelerator table building. llvm-svn: 195776	2013-11-26 19:14:34 +00:00
Timur Iskhodzhanov	119f307317	Rename DwarfException methods so the new names are consistent with DwarfDebug and the style guide llvm-svn: 195763	2013-11-26 13:34:55 +00:00
Andrew Trick	391dbadb51	StackMap: Implement support for DirectMemRefOp. A Direct stack map location records the address of frame index. This address is itself the value that the runtime requested. This differs from IndirectMemRefOp locations, which refer to a stack locations from which the requested values must be loaded. Direct locations can directly communicate the address if an alloca, while IndirectMemRefOp handle register spills. For example: entry: %a = alloca i64... llvm.experimental.stackmap(i32 <ID>, i32 <shadowBytes>, i64* %a) Since both the alloca and stackmap intrinsic are in the entry block, and the intrinsic takes the address of the alloca, the runtime can assume that LLVM will not substitute alloca with any intervening value. This must be verified by the runtime by checking that the stack map's location is a Direct location type. The runtime can then determine the alloca's relative location on the stack immediately after compilation, or at any time thereafter. This differs from Register and Indirect locations, because the runtime can only read the values in those locations when execution reaches the instruction address of the stack map. llvm-svn: 195712	2013-11-26 02:03:25 +00:00
David Blaikie	fbd29eb3b6	DebugInfo: Remove CompileUnit::constructTypeDIEImpl now that it's just a simple wrapper again. r195698 moved the type unit checking up into getOrCreateTypeDIE so remove the redundant check and fold the functions back together again. llvm-svn: 195700	2013-11-26 00:35:04 +00:00
David Blaikie	8a263cbc99	DebugInfo: Avoid emitting pubtype entries for type DIEs that just indirect to a type unit. llvm-svn: 195698	2013-11-26 00:22:37 +00:00
David Blaikie	9d861bed9b	DebugInfo: Pubtypes: Coelesce pubtype registration with accelerator type registration. It might be possible to eventually use one data structure, but I haven't looked at the exact criteria used for accelerator tables and pubtypes to see if there's good reason for the differences between the two or not. llvm-svn: 195696	2013-11-26 00:15:27 +00:00
Bill Wendling	9200bb08f9	Unrevert r195599 with testcase fix. I'm not sure how it was checking for the wrong values... PR18023. llvm-svn: 195670	2013-11-25 18:05:22 +00:00
Amara Emerson	f59125f5bb	Revert r195599 as it broke the builds. llvm-svn: 195636	2013-11-25 11:24:18 +00:00
Daniel Sanders	b021c6fdbd	Fixed tryFoldToZero() for vector types that need expansion. Summary: Moved the requirement for SelectionDAG::getConstant() to return legally typed nodes slightly earlier. There were two optional DAGCombine passes that were missed out and were required to produce type-legal DAGs. Simplified a code-path in tryFoldToZero() to use SelectionDAG::getConstant(). This provides support for both promoted and expanded vector types whereas the previous code only supported promoted vector types. Fixes a "Type for zero vector elements is not legal" assertion detected by an llvm-stress generated test. Reviewers: resistor CC: llvm-commits Differential Revision: http://llvm-reviews.chandlerc.com/D2251 llvm-svn: 195635	2013-11-25 11:14:43 +00:00
Bill Wendling	e3c48709ed	Don't look past volatile loads. A volatile load should block us from trying to coalesce stores. PR18023 llvm-svn: 195599	2013-11-25 05:01:21 +00:00
Chandler Carruth	260258b9c0	Output a bit more information in the debug printing for MBP. This was useful when analyzing parts of zlib's behavior here. llvm-svn: 195588	2013-11-25 00:43:41 +00:00
David Blaikie	72f1a3ec76	DwarfDebug: Move ownership of CompileUnits into DwarfUnits This avoids the need for an extra list of SkeletonCUs and associated cleanup while staging things to be cleaner for further type unit improvements. Also hopefully fixes a memory leak introduced in r195166. llvm-svn: 195536	2013-11-23 01:17:34 +00:00
Eric Christopher	4751d701b7	Refactor DW_AT_ranges handling to use labels for ranges rather than a non-relocatable number offset. One fixme to make the ranges as discrete data structures and have range lists explicitly represented rather than as a list of symbols. llvm-svn: 195523	2013-11-23 00:05:29 +00:00
Eric Christopher	f8da6aa7c7	Reformat const for readability. llvm-svn: 195522	2013-11-23 00:05:06 +00:00
Paul Robinson	d89125a5d8	Teach ISel not to optimize 'optnone' functions (revised). Improvements over r195317: - Set/restore EnableFastISel flag instead of just running FastISel within SelectAllBasicBlocks; the flag is checked in various places, and FastISel won't run properly if those places don't do the right thing. - Test looks for normal ISel versus FastISel behavior, and not something more subtle that doesn't work everywhere. Based on work by Andrea Di Biagio. llvm-svn: 195491	2013-11-22 19:11:24 +00:00
Andrew Trick	059e800fda	DEBUG shouldEvict decisions llvm-svn: 195490	2013-11-22 19:07:42 +00:00
Andrew Trick	3621b8a217	Minor cleanup. EvictionCost ctor was confusing relative to the other costs floating around in the code. llvm-svn: 195489	2013-11-22 19:07:38 +00:00
Andrew Trick	4a1abb7ab5	patchpoint: factor SD builder code for live vars. Plain stackmap also optimizes Constant values now. llvm-svn: 195488	2013-11-22 19:07:36 +00:00
Andrew Trick	a2428e0f40	patchpoint: eliminate hard coded operand indices. llvm-svn: 195487	2013-11-22 19:07:33 +00:00
Tom Stellard	06c67bcbe4	SelectionDAG: Optimize expansion of vec_type = BITCAST scalar_type The legalizer can now do this type of expansion for more type combinations without loading and storing to and from the stack. NOTE: This is a candidate for the 3.4 branch. llvm-svn: 195398	2013-11-22 00:41:05 +00:00
Tom Stellard	9cbd2c5581	Split SETCC if VSELECT requires splitting too. This patch is a rewrite of the original patch commited in r194542. Instead of relying on the type legalizer to do the splitting for us, we now peform the splitting ourselves in the DAG combiner. This is necessary for the case where the vector mask is a legal type after promotion and still wouldn't require splitting. Patch by: Juergen Ributzka NOTE: This is a candidate for the 3.4 branch. llvm-svn: 195397	2013-11-22 00:39:23 +00:00
Eric Christopher	33ff697cb1	In Dwarf 3 (and Dwarf 2) attributes whose value are offsets into a section use the form DW_FORM_data4 whilst in Dwarf 4 and later they use the form DW_FORM_sec_offset. This patch updates the places where such attributes are generated to use the appropriate form depending on the Dwarf version. The DIE entries affected have the following tags: DW_AT_stmt_list, DW_AT_ranges, DW_AT_location, DW_AT_GNU_pubnames, DW_AT_GNU_pubtypes, DW_AT_GNU_addr_base, DW_AT_GNU_ranges_base It also adds a hidden command line option "--dwarf-version=<uint>" to llc which allows the version of Dwarf to be generated to override what is specified in the metadata; this makes it possible to update existing tests to check the debugging information generated for both Dwarf 4 (the default) and Dwarf 3 using the same metadata. Patch (slightly modified) by Keith Walker! llvm-svn: 195391	2013-11-21 23:46:41 +00:00
Eric Christopher	0a13eb38c8	Move member variable up to where the rest of non-DWARF5 variables reside. llvm-svn: 195380	2013-11-21 22:56:11 +00:00
Daniel Sanders	edc071b815	Add support for legalizing SETNE/SETEQ by inverting the condition code and the result of the comparison. Summary: LegalizeSetCCCondCode can now legalize SETEQ and SETNE by returning the inverse condition and requesting that the caller invert the result of the condition. The caller of LegalizeSetCCCondCode must handle the inverted CC, and they do so as follows: SETCC, BR_CC: Invert the result of the SETCC with SelectionDAG::getNOT() SELECT_CC: Swap the true/false operands. This is necessary for MSA which lacks an integer SETNE instruction. Reviewers: resistor CC: llvm-commits Differential Revision: http://llvm-reviews.chandlerc.com/D2229 llvm-svn: 195355	2013-11-21 13:24:49 +00:00
NAKAMURA Takumi	43aa939625	Revert r195317 (and r195333), "Teach ISel not to optimize 'optnone' functions." It broke, at least, i686 target. It is reproducible with "llc -mtriple=i686-unknown". FYI, it didn't appear to add either "-O0" or "-fast-isel". llvm-svn: 195339	2013-11-21 10:55:15 +00:00
Paul Robinson	b379efeb53	Teach ISel not to optimize 'optnone' functions. Based on work by Andrea Di Biagio. llvm-svn: 195317	2013-11-21 06:33:32 +00:00
Eric Christopher	a16725b6b6	Move DebugInfoOffset member near the other data member it helps describe. llvm-svn: 195299	2013-11-21 01:29:16 +00:00
Eric Christopher	4affe8ce3e	Reflow some documentation and remove whitespace comments. Move DebugInfoOffset data member up with the rest of the data members. llvm-svn: 195298	2013-11-21 01:29:13 +00:00
Eric Christopher	9f9b304caf	Add more documenation for the lookup tables data members. llvm-svn: 195297	2013-11-21 01:16:31 +00:00
Eric Christopher	bca5c63d04	Reorder language in the CompileUnit description and add a comment. Language may only be a temporary addition. llvm-svn: 195296	2013-11-21 01:14:00 +00:00
Eric Christopher	d89221e7e3	Update comment. llvm-svn: 195293	2013-11-21 01:01:30 +00:00
Eric Christopher	0fe676a243	Constify the DIEs used for pubname and pubtype tables. Propagate through findAttribute etc. llvm-svn: 195290	2013-11-21 00:48:22 +00:00
Benjamin Kramer	c8160d6523	MachineBlockPlacement: Strengthen the source order bias when picking an exit block. We now only allow breaking source order if the exit block frequency is significantly higher than the other exit block. The actual bias is currently under a flag so the best cut-off can be found; the flag defaults to the old behavior. The idea is to get some benchmark coverage over different values for the flag and pick the best one. When we require the new frequency to be at least 20% higher than the old frequency I see a 5% speedup on zlib's deflate when compressing a random file on x86_64/westmere. Hal reported a small speedup on Fhourstones on a BG/Q and no regressions in the test suite. The test case is the full long_match function from zlib's deflate. I was reluctant to add it for previous tweaks to branch probabilities because it's large and potentially fragile, but changed my mind since it's an important use case and more likely to break with all the current work going into the PGO infrastructure. Differential Revision: http://llvm-reviews.chandlerc.com/D2202 llvm-svn: 195265	2013-11-20 19:08:44 +00:00
David Blaikie	beee345ab0	DwarfCompileUnit: Initialize DebugInfoOffset. While not strictly necessary (the class has an invariant that "setDebugInfoOffset" is called before "getDebugInfoOffset" - anyone client that actually gets the default zero offset is buggy/broken) this is consistent with the code as originally written and the removal of the initialization was an accident in r195166. Suggested by Manman Ren. llvm-svn: 195263	2013-11-20 18:52:39 +00:00
David Blaikie	bcb418e56f	CR feedback for r195166: Add comments regarding type unit mapping and type units disabling cross-CU sharing. Changes suggested by Manman Ren. llvm-svn: 195262	2013-11-20 18:40:16 +00:00
Eric Christopher	3262a11680	Remove polymorphic destruction for DIE. DIEBlocks are owned elsewhere and not polymorphically deleted and they are the only thing that derive from DIE. llvm-svn: 195183	2013-11-20 00:54:31 +00:00
Eric Christopher	b7dee8a606	Remove capability for polymorphic destruction from LexicalScope and LexicalScopes, we're not using it. llvm-svn: 195182	2013-11-20 00:54:28 +00:00
Eric Christopher	9d7d5da6a1	Grammar. llvm-svn: 195181	2013-11-20 00:54:25 +00:00
Eric Christopher	6211e4b995	Formatting, 80-col, trailing whitespace. llvm-svn: 195180	2013-11-20 00:54:19 +00:00
Jack Carter	d4b22dcbf3	long line correction llvm-svn: 195179	2013-11-20 00:32:32 +00:00
Aditya Nandakumar	c1fd0dd419	Fixed an extra for(typo) in the comments llvm-svn: 195171	2013-11-19 23:51:32 +00:00
Jack Carter	5c0af48a11	long lines and white space correction llvm-svn: 195170	2013-11-19 23:43:22 +00:00
David Blaikie	409dd9c34a	DebugInfo: Partial implementation of DWARF type units. Emit DW_TAG_type_units into the debug_info section using compile unit headers. This is bogus/unusable by debuggers, but testable and provides more isolated review. Subsequent patches will include support for type unit headers and emission into the debug_types section, as well as comdat grouping the types based on their hash. Also the CompileUnit type will be renamed 'Unit' and relevant portions pulled out into respective CompileUnit and TypeUnit types. llvm-svn: 195166	2013-11-19 23:08:21 +00:00
David Blaikie	2ea848b972	DebugInfo: Constify accelerator table handling, and separate type accelarator insertion in preparation for a second use of this code from type units. llvm-svn: 195164	2013-11-19 22:51:04 +00:00
Juergen Ributzka	b34871027f	[DAG] Refactor vector splitting code in SelectionDAG. No functional change intended. Reviewed by Tom llvm-svn: 195156	2013-11-19 21:20:17 +00:00
Rafael Espindola	60ec3836a2	Support multiple COFF sections with the same name but different COMDAT. This is the first step to fix pr17918. It extends the .section directive a bit, inspired by what the ELF one looks like. The problem with using linkonce is that given .section foo .linkonce.... .section foo .linkonce we would already have switched sections when getting to .linkonce. The cleanest solution seems to be to add the comdat information in the .section itself. llvm-svn: 195148	2013-11-19 19:52:52 +00:00
Andrew Trick	e6bf45cdae	Obvious pasto survived a couple rounds of cleanup. Caught by Aaron Ballman. llvm-svn: 195138	2013-11-19 18:29:45 +00:00
Eric Christopher	a07e4f5b0f	Formatting and 80-col. llvm-svn: 195122	2013-11-19 09:28:34 +00:00
Eric Christopher	65132a8c2c	Fix comment. llvm-svn: 195121	2013-11-19 09:11:26 +00:00
Eric Christopher	9a8f5eddad	Refactor the section emission code to remove duplicates now that we can emit various sections in any order. No functional change. llvm-svn: 195120	2013-11-19 09:04:50 +00:00
Eric Christopher	b4bef6d254	Reformat file. llvm-svn: 195119	2013-11-19 09:04:36 +00:00
Andrew Trick	1f54e805f2	Fix patchpoint comments. llvm-svn: 195103	2013-11-19 05:05:43 +00:00
Andrew Trick	d4e3dc6d14	Add an abstraction to handle patchpoint operands. Hard-coded operand indices were scattered throughout lowering stages and layers. It was super bug prone. llvm-svn: 195093	2013-11-19 03:29:56 +00:00
Juergen Ributzka	d12ccbd343	[weak vtables] Remove a bunch of weak vtables This patch removes most of the trivial cases of weak vtables by pinning them to a single object file. The memory leaks in this version have been fixed. Thanks Alexey for pointing them out. Differential Revision: http://llvm-reviews.chandlerc.com/D2068 Reviewed by Andy llvm-svn: 195064	2013-11-19 00:57:56 +00:00
David Blaikie	e26a3774c6	DwarfDebug: Move trailing else to the same line as prior closing brace llvm-svn: 195060	2013-11-18 23:59:04 +00:00
David Blaikie	5af2aca274	DwarfDebug: Remove some more redundant explicit constructions. llvm-svn: 195059	2013-11-18 23:57:26 +00:00
David Blaikie	4f6bf27ae4	DebugInfo: Simplify a few more explicit constructions, underconstrained types, and make DIType(MDNode) explicit like all the other DI node ctors. llvm-svn: 195055	2013-11-18 23:33:32 +00:00
Alexey Samsonov	49109a279c	Revert r194865 and r194874. This change is incorrect. If you delete virtual destructor of both a base class and a subclass, then the following code: Base *foo = new Child(); delete foo; will not cause the destructor for members of Child class. As a result, I observe plently of memory leaks. Notable examples I investigated are: ObjectBuffer and ObjectBufferStream, AttributeImpl and StringSAttributeImpl. llvm-svn: 194997	2013-11-18 09:31:53 +00:00
David Blaikie	2c8d5ec14c	Remove unnecessary temporary construction. llvm-svn: 194981	2013-11-17 21:59:31 +00:00
David Blaikie	3c0e6bbc37	Remove redundant explicit default initialization. llvm-svn: 194980	2013-11-17 21:57:33 +00:00
David Blaikie	a781b25ba5	DwarfCompileUnit: Add type safety to createGlobalVariableDIE llvm-svn: 194979	2013-11-17 21:55:13 +00:00
Bill Wendling	25b61dbac0	Revert "Micro-optimization" This reverts commit f1d9fe9d04ce93f6d5dcebbd2cb6a07414d7a029. This was causing PR17964. We need to use thread data before regular data. llvm-svn: 194960	2013-11-17 10:53:13 +00:00
Benjamin Kramer	bb1dd73d3e	DAGCombiner: Partially revert r192795, getNOT was fixed not to create illegal constants. llvm-svn: 194959	2013-11-17 10:40:03 +00:00
Matt Arsenault	64283bd99c	Use more getZExtOrTruncs llvm-svn: 194945	2013-11-17 02:31:26 +00:00
Matt Arsenault	873bb3ea86	Use getZExtOrTrunc instead of repeating the same logic. llvm-svn: 194944	2013-11-17 02:24:21 +00:00
Andrew Trick	10d5be4e6e	Added a size field to the stack map record to handle subregister spills. Implementing this on bigendian platforms could get strange. I added a target hook, getStackSlotRange, per Jakob's recommendation to make this as explicit as possible. llvm-svn: 194942	2013-11-17 01:36:23 +00:00
Matt Arsenault	36f5eb5949	Use right address space pointer size llvm-svn: 194940	2013-11-17 00:06:39 +00:00
Matt Arsenault	dfb3e7092e	Fix assert on unaligned access to global with different address space size. llvm-svn: 194934	2013-11-16 20:50:54 +00:00
Matt Arsenault	19231e630e	Fix codegen for null different sized pointer. llvm-svn: 194932	2013-11-16 20:24:41 +00:00
David Blaikie	52c5020dae	DwarfCompileUnit: Push type safety of DIDescriptor through CompileUnit::createAndAddDIE. llvm-svn: 194902	2013-11-16 00:29:01 +00:00
David Blaikie	eb0338feb1	DwarfCompileUnit: Remove unnecessary OwningPtr<T>::get() call llvm-svn: 194901	2013-11-16 00:28:15 +00:00
Eric Christopher	d0b82aea8c	For dwarf4 use the correct form for referencing debug_loc locations, and update test cases accordingly. This doesn't affect the output dumped using llvm-dwarfdump, but readelf does now dump the debug_loc section. llvm-svn: 194898	2013-11-16 00:18:40 +00:00
David Blaikie	b01f13ecf6	DwarfCompileUnit: Add type safety to CompileUnit::getNode by returning DICompileUnit instead of a raw MDNode*. llvm-svn: 194895	2013-11-15 23:54:45 +00:00
David Blaikie	5a15240ef7	DwarfCompileUnit: Add type safety by using DICompileUnit rather than raw MDNode* for the CU metadata node llvm-svn: 194893	2013-11-15 23:52:02 +00:00
David Blaikie	cb8e435ba4	DwarfCompileUnit: Simplify getLanguage() calls to use existing member function llvm-svn: 194892	2013-11-15 23:50:53 +00:00
Adrian Prantl	4583f7d51a	Replace the dangling context hotfix with an assertion. llvm-svn: 194883	2013-11-15 23:21:39 +00:00
David Blaikie	25bc7198b2	DwarfDebug: Push DISubprogram through updateSubprogramScopeDIE llvm-svn: 194879	2013-11-15 23:13:08 +00:00
David Blaikie	2ad0016e53	DwarfCompileUnit: Push DIDescriptors through a getDIE/insertDIE llvm-svn: 194875	2013-11-15 23:09:13 +00:00
David Blaikie	4201ddf368	DwarfCompileUnit: Push DIDescriptor usage out from isShareableAcrossCUs This is the first of a few similar patches. We'll see how far it goes/makes sense. llvm-svn: 194871	2013-11-15 22:59:36 +00:00
Juergen Ributzka	dbedae89b9	[weak vtables] Remove a bunch of weak vtables This patch removes most of the trivial cases of weak vtables by pinning them to a single object file. Differential Revision: http://llvm-reviews.chandlerc.com/D2068 Reviewed by Andy llvm-svn: 194865	2013-11-15 22:34:48 +00:00
Matt Arsenault	23c9274b1a	Fix confusing machine verifier error. The error reported the number of explicit operands, but that isn't what is checked. In my case, this resulted in the confusing errors "Too few operands." followed shortly by "8 operands expected, but 8 given." llvm-svn: 194862	2013-11-15 22:18:19 +00:00
Adrian Prantl	7d828bbe46	Reimplement r194843 in a slightly less broken way. llvm-svn: 194848	2013-11-15 21:05:09 +00:00
Adrian Prantl	fc0fea0251	Restore the behaviour from before r194728. If getDIE() fails, getOrCreateContextDIE() should also return the CUDie. llvm-svn: 194843	2013-11-15 19:53:23 +00:00
Bob Wilson	9f3e6b25ee	Avoid illegal integer promotion in fastisel Stop folding constant adds into GEP when the type size doesn't match. Otherwise, the adds' operands are effectively being promoted, changing the conditions of an overflow. Results are different when: sext(a) + sext(b) != sext(a + b) Problem originally found on x86-64, but also fixed issues with ARM and PPC, which used similar code. <rdar://problem/15292280> Patch by Duncan Exon Smith! llvm-svn: 194840	2013-11-15 19:09:27 +00:00
Daniel Sanders	50b8041066	Fix illegal DAG produced by SelectionDAG::getConstant() for v2i64 type Summary: When getConstant() is called for an expanded vector type, it is split into multiple scalar constants which are then combined using appropriate build_vector and bitcast operations. In addition to the usual big/little endian differences, the case where the element-order of the vector does not have the same endianness as the elements themselves is also accounted for. For example, for v4i32 on big-endian MIPS, the byte-order of the vector is <3210,7654,BA98,FEDC>. For little-endian, it is <0123,4567,89AB,CDEF>. Handling this case turns out to be a nop since getConstant() returns a splatted vector (so reversing the element order doesn't change the value) This fixes a number of cases in MIPS MSA where calling getConstant() during operation legalization introduces illegal types (e.g. to legalize v2i64 UNDEF into a v2i64 BUILD_VECTOR of illegal i64 zeros). It should also handle bigger differences between illegal and legal types such as legalizing v2i64 into v8i16. lowerMSASplatImm() in the MIPS backend no longer needs to avoid calling getConstant() so this function has been updated in the same patch. For the sake of transparency, the steps I've taken since the review are: * Added 'virtual' to isVectorEltOrderLittleEndian() as requested. This revealed that the MIPS tests were falsely passing because a polymorphic function was not actually polymorphic in the reviewed patch. * Fixed the tests that were now failing. This involved deleting the code to handle the MIPS MSA element-order (which was previously doing an byte-order swap instead of an element-order swap). This left isVectorEltOrderLittleEndian() unused and it was deleted. * Fixed build failures caused by rebasing beyond r194467-r194472. These build failures involved the bset, bneg, and bclr instructions added in these commits using lowerMSASplatImm() in a way that was no longer valid after this patch. Some of these were fixed by calling SelectionDAG::getConstant() instead, others were fixed by a new function getBuildVectorSplat() that provided the removed functionality of lowerMSASplatImm() in a more sensible way. Reviewers: bkramer Reviewed By: bkramer CC: llvm-commits Differential Revision: http://llvm-reviews.chandlerc.com/D1973 llvm-svn: 194811	2013-11-15 12:56:49 +00:00
Matt Arsenault	c5559bb14b	Add target hook to prevent folding some bitcasted loads. This is to avoid this transformation in some cases: fold (conv (load x)) -> (load (conv*)x) On architectures that don't natively support some vector loads efficiently casting the load to a smaller vector of larger types and loading is more efficient. Patch by Micah Villmow. llvm-svn: 194783	2013-11-15 04:42:23 +00:00
Eric Christopher	34a2c8718f	Use a reference rather than a pointer as we don't expect a NULL DbgVariable. No functional change. llvm-svn: 194761	2013-11-15 01:43:19 +00:00
Matt Arsenault	b03bd4d96b	Add addrspacecast instruction. Patch by Michele Scandale! llvm-svn: 194760	2013-11-15 01:34:59 +00:00
Andrew Trick	a9f4d928ab	When folding memory operands, preserve existing MachineMemOperands. This comes into play with patchpoint, which can fold multiple operands. Since the patchpoint is already treated as a call, the machine mem operands won't affect anything, and there's nothing to test. But we still want to do the right thing here to be sure that our MIs obey the rules. llvm-svn: 194750	2013-11-14 23:45:04 +00:00
David Blaikie	32887559c4	DebugInfo: Simplify/narrow null-check for getOrCreateType llvm-svn: 194737	2013-11-14 22:25:02 +00:00
David Blaikie	bd700e47ca	DwarfCompileUnit::getOrCreateContext: Return the compile unit DIE rather than null. llvm-svn: 194728	2013-11-14 21:24:34 +00:00
David Blaikie	1dbca7018e	Remove unnecessary 'else' after return. llvm-svn: 194724	2013-11-14 19:37:56 +00:00
Rafael Espindola	4929301af4	Error if we see an alias to a declaration. In ELF and COFF an alias is just another offset in a section. There is no way to represent an alias to something in another file. In MachO, the spec has the N_INDR type which should allow for exactly that, but is not currently implemented. Given that it is specified but not implemented, we error in codegen to avoid miscompiling but don't reject aliases to declarations in the verifier to leave the option open of implementing it. In the past we have used alias to declarations as a way of implementing weakref, which is why it exists in some old tests which this patch updates. llvm-svn: 194705	2013-11-14 13:58:06 +00:00
Andrew Trick	561f2218e0	Minor extension to llvm.experimental.patchpoint: don't require a call. If a null call target is provided, don't emit a dummy call. This allows the runtime to reserve as little nop space as it needs without the requirement of emitting a call. llvm-svn: 194676	2013-11-14 06:54:10 +00:00
David Blaikie	9208b5ed8e	DIEHash: Move header include to be first in the implementation file to flush out header inclusion ordering issues llvm-svn: 194588	2013-11-13 18:07:27 +00:00
Juergen Ributzka	34c652d34d	SelectionDAG: Teach the legalizer to split SETCC if VSELECT needs splitting too. This patch reapplies r193676 with an additional fix for the Hexagon backend. The SystemZ backend has already been fixed by r194148. The Type Legalizer recognizes that VSELECT needs to be split, because the type is to wide for the given target. The same does not always apply to SETCC, because less space is required to encode the result of a comparison. As a result VSELECT is split and SETCC is unrolled into scalar comparisons. This commit fixes the issue by checking for VSELECT-SETCC patterns in the DAG Combiner. If a matching pattern is found, then the result mask of SETCC is promoted to the expected vector mask type for the given target. Now the type legalizer will split both VSELECT and SETCC. This allows the following X86 DAG Combine code to sucessfully detect the MIN/MAX pattern. This fixes PR16695, PR17002, and <rdar://problem/14594431>. Reviewed by Nadav llvm-svn: 194542	2013-11-13 01:57:54 +00:00
Aaron Ballman	04999041e8	Replacing HUGE_VALF with llvm::huge_valf in order to work around a warning triggered in MSVC 12. Patch reviewed by Reid Kleckner and Jim Grosbach. llvm-svn: 194533	2013-11-13 00:15:44 +00:00
Arnaud A. de Grandmaison	f5f040fa1e	CalcSpillWeights: allow overidding the spill weight normalizing function This will enable the PBQP register allocator to provide its own normalizing function. No functionnal change. llvm-svn: 194417	2013-11-11 19:56:14 +00:00
Arnaud A. de Grandmaison	ea3ac1612c	CalcSpillWeights: give a better describing name to calculateSpillWeights Besides, this relates it more obviously to the VirtRegAuxInfo::calculateSpillWeightAndHint. No functionnal change. llvm-svn: 194404	2013-11-11 19:04:45 +00:00
Eric Christopher	aeb105f9fe	Unify the adding of enumerators with the construction of the enumeration. llvm-svn: 194401	2013-11-11 18:52:39 +00:00
Eric Christopher	98b7f17c72	Formatting. llvm-svn: 194400	2013-11-11 18:52:36 +00:00
Eric Christopher	e6c6c4d36b	80-col. llvm-svn: 194399	2013-11-11 18:52:33 +00:00
Eric Christopher	df9955dd89	Just pass the DIComposite type by value instead of by pointer. llvm-svn: 194398	2013-11-11 18:52:31 +00:00
Daniel Sanders	a1840d2f88	Vector forms of SHL, SRA, and SRL can be constant folded using SimplifyVBinOp too Reviewers: dsanders Reviewed By: dsanders CC: llvm-commits, nadav Differential Revision: http://llvm-reviews.chandlerc.com/D1958 llvm-svn: 194393	2013-11-11 17:23:41 +00:00
Arnaud A. de Grandmaison	760c1e0b0a	CalculateSpillWeights does not need to be a pass Based on discussions with Lang Hames and Jakob Stoklund Olesen at the hacker's lab, and in the light of upcoming work on the PBQP register allocator, it was though that CalcSpillWeights does not need to be a pass. This change will enable to customize / tune the spill weight computation depending on the allocator. Update the documentation style while there. No functionnal change. llvm-svn: 194356	2013-11-10 17:46:31 +00:00
Matt Arsenault	c900303e2f	Use type form of getIntPtrType. This should be inconsequential and is work towards removing the default address space arguments. llvm-svn: 194347	2013-11-10 04:46:57 +00:00
Lang Hames	fb82630a91	Re-apply r194300 with fixes for warnings. llvm-svn: 194311	2013-11-09 03:08:56 +00:00
Nick Lewycky	59886d00ec	Revert r194300 which broke the build. llvm-svn: 194308	2013-11-09 02:01:25 +00:00
Juergen Ributzka	87ed906b2e	[Stackmap] Materialize the jump address within the patchpoint noop slide. This patch moves the jump address materialization inside the noop slide. This enables patching of the materialization itself or its complete removal. This patch also adds the ability to define scratch registers that can be used safely by the code called from the patchpoint intrinsic. At least one scratch register is required, because that one is used for the materialization of the jump address. This patch depends on D2009. Differential Revision: http://llvm-reviews.chandlerc.com/D2074 Reviewed by Andy llvm-svn: 194306	2013-11-09 01:51:33 +00:00
Lang Hames	1662b832d9	Rewrite the PBQP graph data structure. The new graph structure replaces the node and edge linked lists with vectors. Free lists (well, free vectors) are used for fast insertion/deletion. The ultimate aim is to make PBQP graphs cheap to clone. The motivation is that the PBQP solver destructively consumes input graphs while computing a solution, forcing the graph to be fully reconstructed for each round of PBQP. This imposes a high cost on large functions, which often require several rounds of solving/spilling to find a final register allocation. If we can cheaply clone the PBQP graph and incrementally update it between rounds then hopefully we can reduce this cost. Further, once we begin pooling matrix/vector values (future work), we can cache some PBQP solver metadata and share it between cloned graphs, allowing the PBQP solver to re-use some of the computation done in earlier rounds. For now this is just a data structure update. The allocator and solver still use the graph the same way as before, fully reconstructing it between each round. I expect no material change from this update, although it may change the iteration order of the nodes, causing ties in the solver to break in different directions, and this could perturb the generated allocations (hopefully in a completely benign way). Thanks very much to Arnaud Allard de Grandmaison for encouraging me to get back to work on this, and for a lot of discussion and many useful PBQP test cases. llvm-svn: 194300	2013-11-09 00:14:07 +00:00
Juergen Ributzka	9969d3e6e8	[Stackmap] Add AnyReg calling convention support for patchpoint intrinsic. The idea of the AnyReg Calling Convention is to provide the call arguments in registers, but not to force them to be placed in a paticular order into a specified set of registers. Instead it is up tp the register allocator to assign any register as it sees fit. The same applies to the return value (if applicable). Differential Revision: http://llvm-reviews.chandlerc.com/D2009 Reviewed by Andy llvm-svn: 194293	2013-11-08 23:28:16 +00:00
Pedro Artigas	71f87cb33a	increase the accuracy of register pressure computation in the presence of dead definitions by using live intervals, if available, to identify dead definitions and proceed accordingly. llvm-svn: 194286	2013-11-08 22:46:28 +00:00
Lang Hames	8a065703ef	Fix some minor issues with r194282 to get the tree healthy again. llvm-svn: 194284	2013-11-08 22:30:52 +00:00
Lang Hames	3078977d28	Add a method to get the object-file appropriate stack map section. Thanks to Eric Christopher for the tips on the appropriate way to do this. llvm-svn: 194282	2013-11-08 22:14:49 +00:00
Arnaud A. de Grandmaison	f7a60a8e01	Revert "CalculateSpillWeights does not need to be a pass" Temporarily revert my previous commit until I understand why it breaks 3 target tests. llvm-svn: 194272	2013-11-08 18:19:19 +00:00
Quentin Colombet	b06a0ed4b0	[VirtRegMap] Fix for PR17825. Do not ignore noreturn definitions when setting isPhysRegUsed if the unwind information is required. Indeed, the runtime may need a correct stack to be able to unwind the call. llvm-svn: 194271	2013-11-08 18:14:17 +00:00
Arnaud A. de Grandmaison	ed812f6590	CalculateSpillWeights does not need to be a pass Based on discussions with Lang Hames and Jakob Stoklund Olesen at the hacker's lab, and in the light of upcoming work on the PBQP register allocator, it was though that CalcSpillWeights does not need to be a pass. This change will enable to customize / tune the spill weight computation depending on the allocator. Update the documentation style while there. No functionnal change. llvm-svn: 194269	2013-11-08 17:56:29 +00:00
Arnaud A. de Grandmaison	3b52f0b135	CalculateSpillWeights cleanup: remove unneeded includes llvm-svn: 194259	2013-11-08 15:13:05 +00:00
Andrew Trick	6664df12fb	Slightly change the way stackmap and patchpoint intrinsics are lowered. MorphNodeTo is not safe to call during DAG building. It eagerly deletes dependent DAG nodes which invalidates the NodeMap. We could expose a safe interface for morphing nodes, but I don't think it's worth it. Just create a new MachineNode and replaceAllUsesWith. My understaning of the SD design has been that we want to support early target opcode selection. That isn't very well supported, but generally works. It seems reasonable to rely on this feature even if it isn't widely used. llvm-svn: 194102	2013-11-05 22:44:04 +00:00
Eric Christopher	fedfa44922	Comment some and reformat for clarity beginFunction. llvm-svn: 193894	2013-11-01 23:14:17 +00:00
Juergen Ributzka	359c532d36	[Stackmap] Remove erroneous assert. llvm-svn: 193871	2013-11-01 17:53:27 +00:00
Rafael Espindola	716e7405d3	Remove linkonce_odr_auto_hide. linkonce_odr_auto_hide was in incomplete attempt to implement a way for the linker to hide symbols that are known to be available in every TU and whose addresses are not relevant for a particular DSO. It was redundant in that it all its uses are equivalent to linkonce_odr+unnamed_addr. Unlike those, it has never been connected to clang or llvm's optimizers, so it was effectively dead. Given that nothing produces it, this patch just nukes it (other than the llvm-c enum value). llvm-svn: 193865	2013-11-01 17:09:14 +00:00
Aaron Ballman	2b7a733b16	Commenting out this assert because it is causing the build bots to fail. This effectively reverts r193861, but needs to be fixed as part of r193769. llvm-svn: 193862	2013-11-01 15:12:23 +00:00
Aaron Ballman	96321aa523	Fixing an order of evaluation error in an assert. llvm-svn: 193861	2013-11-01 14:53:14 +00:00
David Blaikie	71d34a2eef	DebugInfo: Emit member variable locations as data instead of expressions in blocks Drive by space optimization. Also makes the DIEs more regular which might speed up DWARF parsing. llvm-svn: 193835	2013-11-01 00:25:45 +00:00
Andrew Trick	c21d86f7ec	Unused variable llvm-svn: 193819	2013-10-31 22:42:20 +00:00
Andrew Trick	153ebe6d2a	Add support for stack map generation in the X86 backend. Originally implemented by Lang Hames. llvm-svn: 193811	2013-10-31 22:11:56 +00:00
Manman Ren	4dbdc9021d	Debug Info: remove duplication of DIEs when a DIE can be shared across CUs. We add a map in DwarfDebug to map MDNodes that are shareable across CUs to the corresponding DIEs: MDTypeNodeToDieMap. These DIEs can be shared across CUs, that is why we keep the maps in DwarfDebug instead of CompileUnit. We make the assumption that if a DIE is not added to an owner yet, we assume it belongs to the current CU. Since DIEs for the type system are added to their owners immediately after creation, and other DIEs belong to the current CU, the assumption should be true. A testing case is added to show that we only create a single DIE for a type MDNode and we use ref_addr to refer to the type DIE. We also add a testing case to show ref_addr relocations for non-darwin platforms. llvm-svn: 193779	2013-10-31 17:54:35 +00:00
Andrew Trick	74f4c749cf	Lower stackmap intrinsics directly to their target opcode in the DAG builder. llvm-svn: 193769	2013-10-31 17:18:24 +00:00
Andrew Trick	d4d1d9c06e	whitespace llvm-svn: 193765	2013-10-31 17:18:07 +00:00
Rafael Espindola	dbec9d9b2a	Remove the --shrink-wrap option. It had no tests, was unused and was "experimental at best". llvm-svn: 193749	2013-10-31 14:07:59 +00:00
Jim Grosbach	7236678687	Legalize: Improve legalization of long vector extends. When an extend more than doubles the size of the elements (e.g., a zext from v16i8 to v16i32), the normal legalization method of splitting the vectors will run into problems as by the time the destination vector is legal, the source vector is illegal. The end result is the operation often becoming scalarized, with the typical horrible performance. For example, on x86_64, the simple input of: define void @bar(<16 x i8> %a, <16 x i32>* %p) nounwind { %tmp = zext <16 x i8> %a to <16 x i32> store <16 x i32> %tmp, <16 x i32>*%p ret void } Generates: .section __TEXT,__text,regular,pure_instructions .section __TEXT,__const .align 5 LCPI0_0: .long 255 ## 0xff .long 255 ## 0xff .long 255 ## 0xff .long 255 ## 0xff .long 255 ## 0xff .long 255 ## 0xff .long 255 ## 0xff .long 255 ## 0xff .section __TEXT,__text,regular,pure_instructions .globl _bar .align 4, 0x90 _bar: vpunpckhbw %xmm0, %xmm0, %xmm1 vpunpckhwd %xmm0, %xmm1, %xmm2 vpmovzxwd %xmm1, %xmm1 vinsertf128 $1, %xmm2, %ymm1, %ymm1 vmovaps LCPI0_0(%rip), %ymm2 vandps %ymm2, %ymm1, %ymm1 vpmovzxbw %xmm0, %xmm3 vpunpckhwd %xmm0, %xmm3, %xmm3 vpmovzxbd %xmm0, %xmm0 vinsertf128 $1, %xmm3, %ymm0, %ymm0 vandps %ymm2, %ymm0, %ymm0 vmovaps %ymm0, (%rdi) vmovaps %ymm1, 32(%rdi) vzeroupper ret So instead we can check if there are legal types that enable us to split more cleverly when the input vector is already legal such that we don't turn it into an illegal type. If the extend is such that it's more than doubling the size of the input we check if - the number of vector elements is even, - the source type is legal, - the type of a split source is illegal, - the type of an extended (by doubling element size) source is legal, and - the type of that extended source when split is legal. If the conditions are met, instead of just splitting both the destination and the source types, we create an extend that only goes up one "step" (doubling the element width), and the continue legalizing the rest of the operation normally. The result is that this operates as a new, more effecient, termination condition for the loop of "split the operation until the destination type is legal." With this change, the above example now compiles to: _bar: vpxor %xmm1, %xmm1, %xmm1 vpunpcklbw %xmm1, %xmm0, %xmm2 vpunpckhwd %xmm1, %xmm2, %xmm3 vpunpcklwd %xmm1, %xmm2, %xmm2 vinsertf128 $1, %xmm3, %ymm2, %ymm2 vpunpckhbw %xmm1, %xmm0, %xmm0 vpunpckhwd %xmm1, %xmm0, %xmm3 vpunpcklwd %xmm1, %xmm0, %xmm0 vinsertf128 $1, %xmm3, %ymm0, %ymm0 vmovaps %ymm0, 32(%rdi) vmovaps %ymm2, (%rdi) vzeroupper ret This generalizes a custom lowering that was added a while back to the ARM backend. That lowering is no longer necessary, and is removed. The testcases for it, however, provide excellent ARM tests for this change and so remain. rdar://14735100 llvm-svn: 193727	2013-10-31 00:20:48 +00:00
Matt Arsenault	2ba54c3d90	Fix CodeGen for unaligned loads with address spaces llvm-svn: 193721	2013-10-30 23:30:05 +00:00
Rafael Espindola	6f1b2852fc	Produce .weak_def_can_be_hidden for some linkonce_odr values With this patch llvm produces a weak_def_can_be_hidden for linkonce_odr if they are also unnamed_addr or don't have their address taken. There is not a lot of documentation about .weak_def_can_be_hidden, but from the old discussion about linkonce_odr_auto_hide and the name of the directive this looks correct: these symbols can be hidden. Testing this with the ld64 in Xcode 5 linking clang reduces the number of exported symbols from 21053 to 19049. llvm-svn: 193718	2013-10-30 22:08:11 +00:00
David Blaikie	6b288cfa7a	DebugInfo: Push header handling down into CompileUnit This is a preliminary step to handling type units by abstracting over all (type or compile) units. llvm-svn: 193714	2013-10-30 20:42:41 +00:00
David Blaikie	2d4e11228b	DwarfDebug: Change Abbreviations member from pointer to reference llvm-svn: 193699	2013-10-30 17:14:24 +00:00
Juergen Ributzka	3bd686d493	Revert "SelectionDAG: Teach the legalizer to split SETCC if VSELECT needs splitting too." Now Hexagon and SystemZ are not happy with it :-( llvm-svn: 193677	2013-10-30 06:36:19 +00:00
Juergen Ributzka	6ad05d6b95	SelectionDAG: Teach the legalizer to split SETCC if VSELECT needs splitting too. The Type Legalizer recognizes that VSELECT needs to be split, because the type is to wide for the given target. The same does not always apply to SETCC, because less space is required to encode the result of a comparison. As a result VSELECT is split and SETCC is unrolled into scalar comparisons. This commit fixes the issue by checking for VSELECT-SETCC patterns in the DAG Combiner. If a matching pattern is found, then the result mask of SETCC is promoted to the expected vector mask type for the given target. This mask has usually the same size as the VSELECT return type (except for Intel KNL). Now the type legalizer will split both VSELECT and SETCC. This allows the following X86 DAG Combine code to sucessfully detect the MIN/MAX pattern. This fixes PR16695, PR17002, and <rdar://problem/14594431>. Reviewed by Nadav llvm-svn: 193676	2013-10-30 05:48:18 +00:00
Josh Magee	7245f1d85d	Reformat code with clang-format. Differential Revision: http://llvm-reviews.chandlerc.com/D2057 llvm-svn: 193672	2013-10-30 02:25:14 +00:00
Manman Ren	251a1bd215	Debug Info: code clean up. Use EmitLabelOffsetDifference for handling on darwin platform when non-darwin platforms use EmitLabelPlusOffset. Also fix a bug in EmitLabelOffsetDifference where the size is hard-coded to 4 even though Size is passed in as an argument. llvm-svn: 193660	2013-10-29 23:14:15 +00:00
Manman Ren	ce20d460e2	Debug Info: support for DW_FORM_ref_addr. To support ref_addr, we calculate the section offset of a DIE (i.e. offset of a DIE from beginning of the debug info section). The Offset field in DIE is currently CU-relative. To calculate the section offset, we add a DebugInfoOffset field in CompileUnit to store the offset of a CU from beginning of the debug info section. We set the value in DwarfUnits::computeSizeAndOffset for each CompileUnit. A helper function DIE::getCompileUnit is added to return the CU DIE that the input DIE belongs to. We also add a map CUDieMap in DwarfDebug to help finding the CU for a given CU DIE. For a cross-referenced DIE, we first find the CU DIE it belongs to with getCompileUnit, then we use CUDieMap to get the corresponding CU for the CU DIE. Adding the section offset of the CU with the CU-relative offset of a DIE gives us the seciton offset of the DIE. We correctly emit ref_addr with relocation using EmitLabelPlusOffset when doesDwarfUseRelocationsAcrossSections is true. This commit handles the emission of DW_FORM_ref_addr when we have an attribute with FORM_ref_addr. A follow-on patch will start using ref_addr when adding a DIEEntry. This commit will be tested and verified in the follow-on patch. Reviewed off-list by Eric, Thanks. llvm-svn: 193658	2013-10-29 22:57:10 +00:00
Manman Ren	f4c339e04a	Debug Info: instead of calling addToContextOwner which constructs the context after the DIE creation, we construct the context first. Ensure that we create the context before we create a type so that we can add the newly created type to the parent. Remove last use of addToContextOwner now that it's not needed. We use createAndAddDIE to wrap around "new DIE(". Now all shareable DIEs should be added to their parents right after the creation. Reviewed off-list by Eric, Thanks. llvm-svn: 193657	2013-10-29 22:49:29 +00:00
Josh Magee	3f1c0e35e6	[stackprotector] Update the StackProtector pass to perform datalayout analysis. This modifies the pass to classify every SSP-triggering AllocaInst according to an SSPLayoutKind (LargeArray, SmallArray, AddrOf). This analysis is collected by the pass and made available for use, but no other pass uses it yet. The next patch will make use of this analysis in PEI and StackSlot passes. The end goal is to support ssp-strong stack layout rules. WIP. Differential Revision: http://llvm-reviews.chandlerc.com/D1789 llvm-svn: 193653	2013-10-29 21:16:16 +00:00
Rafael Espindola	e133ed88b5	Move getSymbol to TargetLoweringObjectFile. This allows constructing a Mangler with just a TargetMachine. llvm-svn: 193630	2013-10-29 17:28:26 +00:00
Rafael Espindola	79858aa3df	Add a helper getSymbol to AsmPrinter. llvm-svn: 193627	2013-10-29 17:07:16 +00:00
Manman Ren	f6b936bc06	Debug Info: instead of calling addToContextOwner which constructs the context after the DIE creation, we construct the context first. This touches creation of namespaces and global variables. The purpose is to handle all DIE creations similarly: constructs the context first, then creates the DIE and immediately adds the DIE to its parent. We use createAndAddDIE to wrap around "new DIE(". llvm-svn: 193589	2013-10-29 05:49:41 +00:00
Alp Toker	6a03374526	Fix "existant" typos llvm-svn: 193579	2013-10-29 02:35:28 +00:00
Manman Ren	4a841a86bd	Debug Info: use createAndAddDIE to wrap around "new DIE" in DwarfDebug. This commit ensures DIEs are constructed within a compile unit and immediately added to their parents. Reviewed off-list by Eric. llvm-svn: 193568	2013-10-29 01:03:01 +00:00
Manman Ren	73d697c641	Debug Info: use createAndAddDIE for newly-created Subprogram DIEs. More patches will be submitted to convert "new DIE(" to use createAddAndDIE in DwarfCompileUnit.cpp. This will simplify implementation of addDIEEntry where we have to decide between ref4 and ref_addr, because DIEs that can be shared across CU will be added to a CU already. Reviewed off-list by Eric. llvm-svn: 193567	2013-10-29 00:58:04 +00:00
Manman Ren	b987e517f2	Debug Info: add a helper function createAndAddDIE. It wraps around "new DIE(" and handles the bookkeeping part of the newly-created DIE. It adds the DIE to its parent, and calls insertDIE if necessary. It makes sure that bookkeeping is done at the earliest time and we should not see parentless DIEs if all constructions of DIEs go through this helper function. Later on, we can use an allocator for DIE allocation, and will only need to change createAndAddDIE instead of modifying all the "new DIE(". Reviewed off-list by Eric. llvm-svn: 193566	2013-10-29 00:53:03 +00:00
Richard Sandiford	981fdeb477	[DAGCombiner] Respect volatility when checking for aliases Making useAA() default to true for SystemZ showed that the combiner alias analysis wasn't handling volatile accesses. This hit many of the SystemZ tests, but I arbitrarily picked one for the purpose of this patch. llvm-svn: 193518	2013-10-28 12:00:00 +00:00
Richard Sandiford	39c1ce4dc1	Keep TBAA info when rewriting SelectionDAG loads and stores Most SelectionDAG code drops the TBAA info when creating a new form of a load and store (e.g. during legalization, or when converting a plain load to an extending one). This patch tries to catch all cases where the TBAA information can legitimately be carried over. The patch adds alternative forms of getLoad() and getExtLoad() that take a MachineMemOperand instead of individual fields. (The corresponding getTruncStore() already exists.) The idea is to use the MachineMemOperand forms when all fields are carried over (size, pointer info, isVolatile, isNonTemporal, alignment and TBAA info). If some adjustment is being made, e.g. to narrow the load, then we still pass the individual fields but also pass the TBAA info. llvm-svn: 193517	2013-10-28 11:17:59 +00:00
David Blaikie	8bc7db777d	DIEHash: Summary hashing of member functions llvm-svn: 193432	2013-10-25 20:04:25 +00:00
David Blaikie	65cc969f50	DIEHash: Summary hashing of nested types llvm-svn: 193427	2013-10-25 18:38:43 +00:00
Tim Northover	a564d329c2	LegalizeDAG: allow libcalls for max/min atomic operations ARM processors without ldrex/strex need to be able to make libcalls for all atomic operations, including the newer min/max versions. The alternative would probably be expanding these operations in terms of cmpxchg (as x86 does always), but in the configurations where this matters code-size tends to be paramount so the libcall is more desirable. llvm-svn: 193398	2013-10-25 09:30:20 +00:00
Nadav Rotem	d369d4bdf9	Optimize concat_vectors(X, undef) -> scalar_to_vector(X). This optimization is not SSE specific so I am moving it to DAGco. The new scalar_to_vector dag node exposed a missing pattern in the AArch64 target that I needed to add. llvm-svn: 193393	2013-10-25 06:41:18 +00:00
David Blaikie	d8c5b4e8ef	MCStreamer: Reimplement the virtual EmitRawText as a protected member, EmitRawTextImpl, to avoid string literal ambiguities Also improve the implementation of EmitRawText(Twine) so it doesn't bother using the SmallString buffer if the Twine is a simple StringRef anyway. llvm-svn: 193378	2013-10-24 22:43:10 +00:00
David Blaikie	68642d3118	DWARF emission: Remove unnecessary/redundant DIE reference code The default case at the end of the switch handles this just fine. llvm-svn: 193374	2013-10-24 22:00:44 +00:00
Eric Christopher	e34116750f	Fix name of variable in comment. llvm-svn: 193373	2013-10-24 21:54:58 +00:00
Eric Christopher	670ee0e941	Grammar. llvm-svn: 193372	2013-10-24 21:20:23 +00:00
Eric Christopher	b088d2d0bc	Update misleading comment. llvm-svn: 193371	2013-10-24 21:05:08 +00:00
David Blaikie	2aee7be871	DIEHash: Const correct and use references where non-null/non-rebound. llvm-svn: 193363	2013-10-24 18:29:03 +00:00
David Blaikie	32744412d2	DIEHash: Do not use shallow type hashing for unnamed types llvm-svn: 193361	2013-10-24 17:53:58 +00:00
David Blaikie	afcb9656c3	DIEHash: Refactor ref attribute hashing into smaller functions llvm-svn: 193360	2013-10-24 17:51:43 +00:00
David Blaikie	e568225fc3	Remove unused debug-only member variable. This may've been used at some point but the 'print' member function grew an Indent parameter that entirely shadows this parameter. llvm-svn: 193358	2013-10-24 17:10:13 +00:00
Manman Ren	ffc9a71866	Debug Info: code clean up. Since we never insert DIE for DITemplateTypeParameter to a map, there is no need to call getDIE in getOrCreateTemplateTypeParameterDIE. It is also renamed to constructTemplateTypeParameterDIE to match with other construct functions in CompileUnit. Same applies to getOrCreateTemplateValueParameterDIE. llvm-svn: 193287	2013-10-23 23:05:28 +00:00
Manman Ren	230ec864af	Debug Info: code clean up. Rename createMemberDIE to constructMemberDIE to match other construct functions in CompileUnit. llvm-svn: 193286	2013-10-23 23:00:44 +00:00
Manman Ren	57e6ff7e72	Debug Info: code clean up. Remove the unneeded return values from createMemberDIE, constructEnumTypeDIE, getOrCreateTemplateTypeParameterDIE, and getOrCreateTemplateValueParameterDIE. llvm-svn: 193285	2013-10-23 22:57:12 +00:00
Manman Ren	0cfd20b99e	Debug Info: code clean up. Unifying the argument ordering of private construct functions in CompileUnit to follow constructTypeDIE(DIE &, DIBasicType), constructTypeDIE(DIE &, DIDerivedType), constructTypeDIE(DIE &, DICompositeType), constructSubrangeDIE and constructArrayTypeDIE. llvm-svn: 193284	2013-10-23 22:52:22 +00:00
Manman Ren	b9512a7c57	Remove {} from one-line block. llvm-svn: 193276	2013-10-23 22:12:26 +00:00
Rafael Espindola	b02877416e	Reduce casting and use a fully covered switch. llvm-svn: 193272	2013-10-23 21:24:34 +00:00
Tom Stellard	8d7d4deafe	SelectionDAG: Pass along the original argument/element type in ISD::InputArg For some targets, it is useful to be able to look at the original type of an argument without having to dig through the original IR. This also fixes a bug in SelectionDAGBuilder where InputArg.PartOffset was not taking into account the offset of structure elements. Patch by: Justin Holewinski Tom Stellard: - Changed the type of ArgVT to EVT, so it can store non-simple types like v3i32. llvm-svn: 193214	2013-10-23 00:44:24 +00:00
Manman Ren	642a0acce2	Debug Info: code clean up. Remove unnecessary creation of LexicalScope in collectDeadVariables. The created LexicialScope was only used to get isAbstractScope, which should be false from the creation: "new LexicalScope(NULL, DIDescriptor(SP), NULL, false);". We can also remove a DenseMap that holds the created LexicalScopes. llvm-svn: 193196	2013-10-22 20:59:19 +00:00
David Blaikie	5ebc54d9ea	DIEHashing: Provide an assert for unreachable functionality regarding friends. Since (as of r190716) Clang no longer emits debug info for C++ friend declarations (and it seems GCC never has/does, which was the motivation for the Clang change), there's no actual reachable case for implementing the part of DWARF 4, Section 7.27 part 5 that pertains to friends. Leave an assert here so that if/when we do have a client producing friends and using type units, we can fill in the gap and add appropriate (unit and feature) tests. llvm-svn: 193193	2013-10-22 20:28:55 +00:00
David Blaikie	d70a055394	DWARF type hashing: pointers to members Includes a test case/FIXME demonstrating a bug/limitation in pointer to member hashing. To be honest I'm not sure why we don't just always use summary hashing for referenced types... but perhaps I'm missing something. llvm-svn: 193175	2013-10-22 18:14:41 +00:00
Wan Xiaofei	2f8dc08b8c	Using FoldingSet in SelectionDAG::getVTList. VTList has a long life cycle through the module and getVTList is frequently called. In current getVTList, sequential search over a std::vector is used, this is inefficient in big module. This patch use FoldingSet to implement hashing mechanism when searching. Reviewer: Nadav Rotem Test : Pass unit tests & LNT test suite llvm-svn: 193150	2013-10-22 08:02:02 +00:00
Eric Christopher	c798d8ad0a	Formatting/whitespace. llvm-svn: 193135	2013-10-22 00:22:39 +00:00
David Blaikie	fe3233a568	DWARF Type Hashing: Include reference and rvalue reference type in the declarable summary hashing path More support for 7.25 Part 5. llvm-svn: 193129	2013-10-21 23:06:19 +00:00
David Blaikie	6cf58c8980	DWARF type hashing: begin implementing Step 5, summary hashing in declarable contexts There are several other tag types that need similar handling but to ensure test coverage they'll be coming incrementally. llvm-svn: 193126	2013-10-21 22:36:50 +00:00
Matt Arsenault	bc4242114e	Remove unused TargetLowering field. llvm-svn: 193113	2013-10-21 20:04:01 +00:00
Matt Arsenault	b768912db8	Fix CodeGen for different size address space GEPs llvm-svn: 193111	2013-10-21 20:03:54 +00:00
Matt Arsenault	bbd24901cf	Reuse variable llvm-svn: 193107	2013-10-21 19:24:15 +00:00
Reid Kleckner	ad65f10d75	Fix the build in DIE.cpp with MSVC 2010 llvm-svn: 193106	2013-10-21 19:18:31 +00:00
David Blaikie	980d4994b2	DWARF type hashing: Handle multiple (including recursive) references to the same type This uses a map, keeping the type DIE numbering separate from the DIEs themselves - alternatively we could do things the way GCC does if we want to add an integer to the DIE type to record the numbering there. llvm-svn: 193105	2013-10-21 18:59:40 +00:00
Eric Christopher	691281be2f	Fix up some old review feedback. llvm-svn: 193095	2013-10-21 17:48:51 +00:00
David Blaikie	f244319cac	DebugInfo: Put each kind of constant (form, attribute, tag, etc) into its own enum for ease of use. This allows various variables to be more self-documenting and easier to debug by being of specific types without overlapping enum values. Precommit review by Eric Christopher. llvm-svn: 193091	2013-10-21 17:28:37 +00:00
David Blaikie	63bb3e1182	DebugInfo: Hash DW_FORM_GNU_str_index as a string. Found while adding type safety to the various DWARF enumerations (form, attribute, tag, etc) that caused Clang to warn on an incompletely covered switch. Converting the comment to a default/unreachable uncovered this case of an unsupported form encoding. Seems we were skipping fission strings entirely. llvm-svn: 193089	2013-10-21 16:37:22 +00:00
Peter Collingbourne	e9f45e25f9	Emit prefix data after debug and EH directives. This ensures that the prefix data is treated as part of the function for the purpose of debug info. This provides a better debugging experience, among other things by allowing a debug info client to correctly look up a function in debug info given a function pointer. llvm-svn: 193042	2013-10-20 02:16:21 +00:00
Benjamin Kramer	6ddca57327	Remove unused variable. llvm-svn: 193038	2013-10-19 16:32:15 +00:00
Eric Christopher	c2697f8390	Reformat. llvm-svn: 193024	2013-10-19 01:04:47 +00:00
Eric Christopher	8dba0d5ae9	Fix up a few minor performance problems spotted in code review. llvm-svn: 193023	2013-10-19 01:04:42 +00:00
Manman Ren	7cc6270262	Debug Info: add a newly-created DIE to a parent in the same function. With this commit, all DIEs created in CompileUnit will be added to parents inside the same function. Also make getOrCreateTemplateType\|Value functions private. No functionality change. llvm-svn: 193002	2013-10-18 21:14:19 +00:00
Manman Ren	8040bb58d3	Debug Info: simplify code a bit. llvm-svn: 193001	2013-10-18 20:52:22 +00:00
Eric Christopher	4d964a517f	Revert the rest of r192749 to bring back the buildbot. These two error messages should not be able to occur at the same time. llvm-svn: 192985	2013-10-18 16:56:48 +00:00
Bill Schmidt	3684fdd59f	[PATCH] Fix PR17168 (DAG scheduler inserts DBG_VALUE before PHI with fast-isel) PR17168 describes a test case that fails when compiling for debug with fast-isel. Investigation showed that the test was failing because a DBG_VALUE machine instruction was placed prior to a PHI. For this problem to occur requires the following: * Compile for debug * Compile with fast-isel * In a block B, fast-isel must partially succeed before punting to DAG-isel * B must start with a PHI * The first unhandled node in the DAG must not generate a machine instruction * A debug value with an order less than that of that first node exists When all of these circumstances apply, the existing test that an instruction was not inserted won't fire. Currently it tests whether the block is empty, or whether the last instruction generated is a phi. When fast-isel has partially succeeded, the last instruction generated will not be a phi. Instead, we need to check whether the current insert position is immediately following a phi. This patch adds that check, and adds the test case from the PR as a regression test. llvm-svn: 192976	2013-10-18 14:20:11 +00:00
David Majnemer	451b7dd1ef	CodeGen: Emit a libcall if the target doesn't support 16-byte wide atomics There are targets that support i128 sized scalars but cannot emit instructions that modify them directly. The proper thing to do is to emit a libcall. This fixes PR17481. llvm-svn: 192957	2013-10-18 08:03:43 +00:00
Eric Christopher	ffbc4decc2	Temporarily revert r192749 as it is causing problems for LTO and requires a more in depth change to the IR structure. llvm-svn: 192938	2013-10-18 01:57:30 +00:00
David Blaikie	01fae51fef	DIEHash: Add more things (and remove one character) from the COLLECT_ATTR macro Makes the uses more terse and requires that they use a semicolon at the end that helps editors indent proceeding lines correctly. llvm-svn: 192925	2013-10-17 22:14:08 +00:00
David Blaikie	ca353be652	DIEHash: Support for simple (non-recursive, non-reused) type references llvm-svn: 192924	2013-10-17 22:07:09 +00:00
Richard Sandiford	95f7ba988b	Replace sra with srl if a single sign bit is required E.g. (and (sra (i32 x) 31) 2) -> (and (srl (i32 x) 30) 2). llvm-svn: 192884	2013-10-17 11:16:57 +00:00
Andrea Di Biagio	561badf717	Fix edge condition in DAGCombiner to improve codegen of shift sequences. When canonicalizing dags according to the rule (shl (zext (shr X, c1) ), c1) ==> (zext (shl (shr X, c1), c1)) remember to add the new shl dag to the DAGCombiner worklist of nodes. If we don't explicitly add it to the worklist of nodes to visit, we may not trigger later on the rule that folds the shift left + logical shift right into a AND instruction with bitmask. llvm-svn: 192883	2013-10-17 11:02:58 +00:00
Eric Christopher	2c8b7907c3	According to the dwarf standard pubnames and pubtypes for languages like C++ should be the fully qualified names for the type. Add a routine that does a language specific context walk to build up the qualified name and use it when we add types/names to the tables. Expand the gnu pubnames testcase as it's the most complex to make sure that qualified types are also being added. llvm-svn: 192865	2013-10-17 02:06:06 +00:00
Jack Carter	d4e9615d1c	[projects/test-suite] White space and long line fixes. No functionality changes. llvm-svn: 192863	2013-10-17 01:34:33 +00:00
Eric Christopher	96eff3f393	Add the subprogram DIEs to the context they're created with only if they're a declaration, otherwise they're owned by the compile unit. llvm-svn: 192861	2013-10-17 01:31:12 +00:00
David Blaikie	8a142aaa01	DIEHash: Include the type's context in the type hash. llvm-svn: 192856	2013-10-17 00:10:34 +00:00
David Blaikie	6316ca45a7	DIEHash: Use DW_FORM_sdata for integers, per spec. This allows us to produce the same hash as GCC for at least some simple examples. llvm-svn: 192855	2013-10-16 23:36:20 +00:00
David Blaikie	920bb2a758	Remove ambiguity introduced in r192836 llvm-svn: 192840	2013-10-16 20:40:46 +00:00
David Blaikie	71a0ad66a9	DIEHash: Include the trailing zero byte after the children of a DIE llvm-svn: 192836	2013-10-16 20:29:06 +00:00
Andrew Trick	811a2ef96e	After PostRA scheduling, don't set kill flags on undef operands. This should fix the ATOM buildbot failing on break-avx-dep.ll. llvm-svn: 192824	2013-10-16 18:30:23 +00:00
Benjamin Kramer	00eb07b791	DAGCombiner: Don't fold xor into not if getNOT would introduce an illegal constant. This happens e.g. with <2 x i64> -1 on x86_32. It cannot be generated directly because i64 is illegal. It would be nice if getNOT would handle this transparently, but I don't see a way to generate a legal constant there right now. Fixes PR17487. llvm-svn: 192795	2013-10-16 14:16:19 +00:00
Richard Sandiford	374a0e50c4	Handle (shl (anyext (shr ...))) in SimpilfyDemandedBits This is really an extension of the current (shl (shr ...)) -> shl optimization. The main difference is that certain upper bits must also not be demanded. The motivating examples are the first two in the testcase, which occur in llvmpipe output. llvm-svn: 192783	2013-10-16 10:26:19 +00:00
Rafael Espindola	0018a59d01	Add support for metadata representing .ident directives. llvm-svn: 192764	2013-10-16 01:49:05 +00:00
Eric Christopher	d2b497b522	Fix a pair of bugs in the emission of pubname tables: 1) Make sure we emit static member variables by checking at the end of createGlobalVariableDIE rather than piecemeal in the function. (As a note, createGlobalVariableDIE needs rewriting.) 2) Make sure we use the definition rather than declaration DIE for two things: a) determining linkage for gnu pubnames, and b) as the address of the DIE for global variables. (As a note, createGlobalVariableDIE really needs rewriting.) Adjust the testcase to make sure we're checking the correct DIEs. llvm-svn: 192761	2013-10-16 01:37:49 +00:00
David Blaikie	94ded5f39e	Simplify zero initialization of DIEAttrs variable. llvm-svn: 192755	2013-10-16 00:47:21 +00:00
Eric Christopher	a6c38a32a9	Make sure we're not attempting to construct a subprogram DIE twice and just look up the value. Fix the one case where we were trying to create a subprogram DIE and we should already have had one. Reflow formatting in collectDeadVariables while fixing. llvm-svn: 192749	2013-10-15 23:31:38 +00:00
Adrian Prantl	5bf1d0093b	Remove some dead code. (DarwinGDBCompat was retired in r189903). llvm-svn: 192731	2013-10-15 20:26:37 +00:00
Pekka Jaaskelainen	eb4a6e7c28	Guard the debug temp variable with NDEBUG to avoid warning/error with NDEBUG defined. llvm-svn: 192709	2013-10-15 14:40:46 +00:00
Pekka Jaaskelainen	eb08e2e0c8	Do not assert when trying to add a meta data operand with MachineInstr::addOperand(). llvm-svn: 192707	2013-10-15 14:18:10 +00:00
Andrew Trick	3a99693c5a	Improve on r192635, ExeDepsFix for avx, and add a test case. rdar:15221834 False AVX register dependencies cause 5x slowdown on flops-5/6 and significant slowdown on several others. This was blocking the switch to MI-Sched. llvm-svn: 192669	2013-10-15 03:39:43 +00:00
Andrew Trick	b6d56be69d	Fix the ExecutionDepsFix pass to handle AVX instructions. This pass is needed to break false dependencies. Without it, unlucky register assignment can result in wild (5x) swings in performance. This pass was trying to handle AVX but not getting it right. AVX doesn't have partial register defs, it has unused register reads in which the high bits of a source operand are copied into the unused bits of the dest. Fixing this requires conservative liveness analysis. This is awkard because the pass already has its own pseudo-liveness. However, proper liveness is expensive, and we would like to use a generic utility to compute it. The fix only invokes liveness on-demand. It is rare to detect a case that needs undef-read dependence breaking, but when it happens, it can be needed many times within a very large block. I think the existing heuristic which uses a register window of 16 is too conservative for loop-carried false dependencies. If the loop is a reduction. The out-of-order engine may be able to execute several loop iterations in parallel. However, I'll leave this tuning exercise for next time. llvm-svn: 192635	2013-10-14 22:19:03 +00:00
Andrew Trick	e2f7cc4cf3	LiveRegUnits: Use *MBB for consistency and convenience. llvm-svn: 192634	2013-10-14 22:18:59 +00:00
Andrew Trick	3f4d6c6538	LiveRegUnits::removeRegsInMask safety. Clobbering is exclusive not inclusive on register units. For liveness, we need to consider all the preserved registers. e.g. A regmask that clobbers YMM0 may preserve XMM0. Units are only clobbered when all super-registers are clobbered. llvm-svn: 192623	2013-10-14 20:45:19 +00:00
Andrew Trick	276dd453f0	Use a SparseSet in LiveRegUnits. Some clients may add block live ins and may track liveness over a large scope. This guarantees an efficient implementation in all cases with no memory allocation/deallocation, independent of the number of target registers. It could be slightly less convenient but is fine in the expected case. llvm-svn: 192622	2013-10-14 20:45:17 +00:00
Andrew Trick	0aed0cfc44	Move LiveRegUnits implementation into .cpp. Comment and format. llvm-svn: 192621	2013-10-14 20:45:14 +00:00
Andrew Trick	ff3585c51c	Convert LiveRegUnits methods to the current convention (it's new code). llvm-svn: 192619	2013-10-14 20:45:09 +00:00
Manman Ren	c6b6392794	Debug Info: static member DIE creation. Clean up creation of static member DIEs. We can create static member DIEs from two places, so we call getOrCreateStaticMemberDIE from the two places. getOrCreateStaticMemberDIE will get or create the context DIE first, then it will check if the DIE already exists, if not, we create the static member DIE and add it to the context. Creation of static member DIEs are handled in a similar way as subprogram DIEs. llvm-svn: 192618	2013-10-14 20:33:57 +00:00
David Blaikie	6004dbc9fa	Fix indenting. That wasn't confusing /at all/... llvm-svn: 192617	2013-10-14 20:15:04 +00:00
Will Dietz	5cb7f4e3f2	MachineSink: Fix and tweak critical-edge breaking heuristic. Per original comment, the intention of this loop is to go ahead and break the critical edge (in order to sink this instruction) if there's reason to believe doing so might "unblock" the sinking of additional instructions that define registers used by this one. The idea is that if we have a few instructions to sink "together" breaking the edge might be worthwhile. This commit makes a few small changes to help better realize this goal: First, modify the loop to ignore registers defined by this instruction. We don't sink definitions of physical registers, and sinking an SSA definition isn't going to unblock an upstream instruction. Second, ignore uses of physical registers. Instructions that define physical registers are rejected for sinking, and so moving this one won't enable moving any defining instructions. As an added bonus, while virtual register use-def chains are generally small due to SSA goodness, iteration over the uses and definitions (used by hasOneNonDBGUse) for physical registers like EFLAGS can be rather expensive in practice. (This is the original reason for looking at this) Finally, to keep things simple continue to only consider this trick for registers that have a single use (via hasOneNonDBGUse), but to avoid spuriously breaking critical edges only do so if the definition resides in the same MBB and therefore this one directly blocks it from being sunk as well. If sinking them together is meant to be, let the iterative nature of this pass sink the definition into this block first. Update tests to accomodate this change, add new testcase where sinking avoids pipeline stalls. llvm-svn: 192608	2013-10-14 16:57:17 +00:00
Rafael Espindola	9770bde505	Remove the now unused strong phi elimination pass. llvm-svn: 192604	2013-10-14 16:39:04 +00:00
Elena Demikhovsky	82a46ebe0a	Fixed a bug in dynamic allocation memory on stack. The alignment of allocated space was wrong, see Bugzila 17345. Done by Zvi Rackover <zvi.rackover@intel.com>. llvm-svn: 192573	2013-10-14 07:26:51 +00:00
Will Dietz	ae726a93e3	TargetLowering: Don't index into empty string. (This is triggered by current lit tests) llvm-svn: 192549	2013-10-13 03:08:49 +00:00
Manman Ren	4c4b69c9c8	Debug Info: remove form from function addDIEEntry. The form must be a reference form in addDIEEntry. Which reference form to use will be decided by the callee. No functionality change. llvm-svn: 192517	2013-10-11 23:58:05 +00:00
Benjamin Kramer	a9767aed80	fConversion: Attempt #2 at fixing the MSVC build. llvm-svn: 192492	2013-10-11 19:49:09 +00:00
Benjamin Kramer	24906d9697	IfConversion: Try to unbreak the MSVC build. llvm-svn: 192487	2013-10-11 19:39:48 +00:00
Matthias Braun	d616ccc069	Remove kill flags after if conversion if necessary When if converting something like: true: ... = R0<kill> false: ... = R0<kill> then the instructions of the true block must not have a <kill> flag anymore, as the instruction of the false block follow and do still read the R0 value. Specifically this patch determines the set of register live-in in the false block (possibly after simulating the liveness changes of the duplicated instructions). Each of these live-in registers mustn't be killed. llvm-svn: 192482	2013-10-11 19:04:37 +00:00
Quentin Colombet	de0e06234c	[DAGCombiner] Reapply load slicing (192471) with a test that explicitly set sse4.2 support. This should fix the buildbots. Original commit message: [DAGCombiner] Slice a big load in two loads when the element are next to each other in memory and the target has paired load and performs post-isel loads combining. E.g., this optimization will transform something like this: a = load i64* addr b = trunc i64 a to i32 c = lshr i64 a, 32 d = trunc i64 c to i32 into: b = load i32* addr1 d = load i32* addr2 Where addr1 = addr2 +/- sizeof(i32), if the target supports paired load and performs post-isel loads combining. One should overload TargetLowering::hasPairedLoad to provide this information. The default is false. <rdar://problem/14477220> llvm-svn: 192476	2013-10-11 18:29:42 +00:00
Quentin Colombet	5aee63d9e3	[DAGCombiner] Revert load slicing (r192471), until I figure out why it fails on ubuntu. llvm-svn: 192474	2013-10-11 18:17:17 +00:00
Quentin Colombet	41dc258f71	[DAGCombiner] Slice a big load in two loads when the element are next to each other in memory and the target has paired load and performs post-isel loads combining. E.g., this optimization will transform something like this: a = load i64* addr b = trunc i64 a to i32 c = lshr i64 a, 32 d = trunc i64 c to i32 into: b = load i32* addr1 d = load i32* addr2 Where addr1 = addr2 +/- sizeof(i32), if the target supports paired load and performs post-isel loads combining. One should overload TargetLowering::hasPairedLoad to provide this information. The default is false. <rdar://problem/14477220> llvm-svn: 192471	2013-10-11 18:01:14 +00:00
Matthias Braun	b542fa514b	fix typo in comment llvm-svn: 192455	2013-10-11 15:40:14 +00:00
Justin Holewinski	660597d190	Make AsmPrinter::emitImplicitDef a virtual method so targets can emit custom comments for implicit defs For NVPTX, this fixes a crash where the emitImplicitDef implementation was expecting physical registers, while NVPTX uses virtual registers (with a couple of exceptions). Now, the implicit def comment will be emitted as a true PTX register name. Other targets can use this to customize the output of implicit def comments. Fixes PR17519 llvm-svn: 192444	2013-10-11 12:39:36 +00:00
NAKAMURA Takumi	d5d16d57eb	LiveRangeCalc.h: Update a description corresponding to r192396. [-Wdocumentation] llvm-svn: 192421	2013-10-11 04:52:03 +00:00
Matthias Braun	f6fe6bfffe	Print register in LiveInterval::print() llvm-svn: 192398	2013-10-10 21:29:05 +00:00
Matthias Braun	34e1be9451	Represent RegUnit liveness with LiveRange instance Previously LiveInterval has been used, but having a spill weight and register number is unnecessary for a register unit. llvm-svn: 192397	2013-10-10 21:29:02 +00:00
Matthias Braun	2d5c32b3b5	Work on LiveRange instead of LiveInterval where possible Also change some pointer arguments to references at some places where 0-pointers are not allowed. llvm-svn: 192396	2013-10-10 21:28:57 +00:00
Matthias Braun	364e6e9072	Change MachineVerifier to work on LiveRange + LiveInterval llvm-svn: 192395	2013-10-10 21:28:54 +00:00
Matthias Braun	88dd0abd2d	Pass LiveQueryResult by value This makes the API a bit more natural to use and makes it easier to make LiveRanges implementation details private. llvm-svn: 192394	2013-10-10 21:28:52 +00:00
Matthias Braun	d7df935bbc	Refactor LiveInterval: introduce new LiveRange class LiveRange just manages a list of segments and a list of value numbers now as LiveInterval did previously, but without having details like spill weight or a fixed register number. LiveInterval is now a subclass of LiveRange and simply adds the spill weight and the register number. llvm-svn: 192393	2013-10-10 21:28:47 +00:00
Matthias Braun	13ddb7cd65	Rename LiveRange to LiveInterval::Segment The Segment struct contains a single interval; multiple instances of this struct are used to construct a live range, but the struct is not a live range by itself. llvm-svn: 192392	2013-10-10 21:28:43 +00:00
Matthias Braun	1965bfa4c7	Rename parameter: defined regs are not incoming. llvm-svn: 192391	2013-10-10 21:28:38 +00:00
Matt Arsenault	a98c3b1816	Use getPointerSizeInBits() rather than 8 * getPointerSize() llvm-svn: 192386	2013-10-10 19:09:05 +00:00
Manman Ren	c50fa1114b	Debug Info: In DIBuilder, the context field of subprogram is updated to use DIScopeRef. A paired commit at clang is required due to changes to DIBuilder. llvm-svn: 192378	2013-10-10 18:40:01 +00:00
Manman Ren	88b0f948f5	Debug Info: In DIBuilder, the context and type fields of template_type and template_value are updated to use DIRef. A paired commit at clang is required due to changes to DIBuilder. llvm-svn: 192320	2013-10-09 19:46:28 +00:00
Reid Kleckner	cd4a25d66e	Explicitly request unsigned enum types when desired This fixes repeated -Wmicrosoft warnings when self-hosting clang on Windows, and gets us real unsigned enum types with MSVC. llvm-svn: 192227	2013-10-08 20:15:11 +00:00
Manman Ren	be5576f5f6	Add DbgVariable::resolve per Eric's suggestion. llvm-svn: 192218	2013-10-08 19:07:44 +00:00
Manman Ren	bda410f413	Debug Info: rename getOriginalTypeSize to getBaseTypeSize. llvm-svn: 192216	2013-10-08 18:46:58 +00:00
Manman Ren	93b3090a91	Debug Info: take advantage of the existing CU::resolve. llvm-svn: 192215	2013-10-08 18:42:58 +00:00
Eric Christopher	016be42362	Grammar. llvm-svn: 192199	2013-10-08 16:47:11 +00:00
Rafael Espindola	a17151ad5a	Add a MCTargetStreamer interface. This patch fixes an old FIXME by creating a MCTargetStreamer interface and moving the target specific functions for ARM, Mips and PPC to it. The ARM streamer is still declared in a common place because it is used from lib/CodeGen/ARMException.cpp, but the Mips and PPC are completely hidden in the corresponding Target directories. I will send an email to llvmdev with instructions on how to use this. llvm-svn: 192181	2013-10-08 13:08:17 +00:00
Richard Mitton	0aafb58aca	Formally added an explicit enum for DWARF TLS support. No functionality change. llvm-svn: 192118	2013-10-07 18:39:18 +00:00
Craig Topper	a7afa71494	Fix some assert messages to say the correct opcode name. Looks like one assert got copy and pasted to many places. llvm-svn: 192078	2013-10-06 22:38:19 +00:00
Rafael Espindola	78527050c2	Add support for aliases with linkonce_odr. This will be used to extend constructor aliases in clang. llvm-svn: 192066	2013-10-06 15:10:43 +00:00
Benjamin Kramer	7200a46c17	Emit a better error when running out of registers on inline asm. The most likely case where this error happens is when the user specifies too many register operands. Don't make it look like an internal LLVM bug when we can see that the error is coming from an inline asm instruction. For other instructions we keep the "ran out of registers" error. llvm-svn: 192041	2013-10-05 19:33:37 +00:00
Rafael Espindola	ac4ad25a00	Remove some really nasty uses of hasRawTextSupport. When MC was first added, targets could use hasRawTextSupport to keep features working before they were added to the MC interface. The design goal of MC is to provide an uniform api for printing assembly and object files. Short of relaxations and other corner cases, a object file is just another representation of the assembly. It was never the intention that targets would keep doing things like if (hasRawTextSupport()) Set flags in one way. else Set flags in another way. When they do that they create two code paths and the object file is no longer just another representation of the assembly. This also then requires testing with llc -filetype=obj, which is extremelly brittle. This patch removes some of these hacks by replacing them with smaller ones. The ARM flag setting is trivial, so I just moved it to the constructor. For Mips, the patch adds two temporary hack directives that allow the assembly to represent the same things as the object file was already able to. The hope is that the mips developers will replace the hack directives with the same ones that gas uses and drop the -print-hack-directives flag. I will also try to implement a target streamer interface, so that we can move this out of the common code. In summary, for any new work, two rules of the thumb are * Don't use "llc -filetype=obj" in tests. * Don't add calls to hasRawTextSupport. llvm-svn: 192035	2013-10-05 16:42:21 +00:00
Craig Topper	a1bbc323fa	Add OPC_CheckChildSame0-3 to the DAG isel matcher. This replaces sequences of MoveChild, CheckSame, MoveParent. Saves 846 bytes from the X86 DAG isel matcher, ~300 from ARM, ~840 from Hexagon. llvm-svn: 192026	2013-10-05 05:38:16 +00:00
Manman Ren	b3388601fb	Debug Info: In DIBuilder, the derived-from field of a DW_TAG_pointer_type is updated to use DITypeRef. Move isUnsignedDIType and getOriginalTypeSize from DebugInfo.h to be static helper functions in DwarfCompileUnit. We already have a static helper function "isTypeSigned" in DwarfCompileUnit, and a pointer to DwarfDebug is added to resolve the derived-from field. All three functions need to go across link for derived-from fields, so we need to get hold of a type identifier map. A pointer to DwarfDebug is also added to DbgVariable in order to resolve the derived-from field. Debug info verifier is updated to check a derived-from field is a TypeRef. Verifier will not go across link for derived-from fields, in debug info finder, we go across the link to add derived-from fields to types. Function getDICompositeType is only used by dragonegg and since dragonegg does not generate identifier for types, we use an empty map to resolve the derived-from field. When printing a derived-from field, we use DITypeRef::getName to either return the type identifier or getName of the DIType. A paired commit at clang is required due to changes to DIBuilder. llvm-svn: 192018	2013-10-05 01:43:03 +00:00
Eric Christopher	3264a48a45	Reorganize some member variables and update a comment. llvm-svn: 192017	2013-10-05 00:39:55 +00:00
Eric Christopher	87b9c49c72	Fix one comment and update another. Slightly reformat. llvm-svn: 192016	2013-10-05 00:32:34 +00:00
Eric Christopher	9e429ae779	Add a resolve method on CompileUnit that forwards to DwarfDebug. llvm-svn: 192014	2013-10-05 00:27:02 +00:00
Adrian Prantl	f01b562a15	Debug info: Don't crash in SelectionDAGISel when a vreg that is being pointed to by a dbg_value belonging to a function argument is eliminated during instruction selection. rdar://problem/15094721. llvm-svn: 192011	2013-10-05 00:08:27 +00:00
Eric Christopher	fa205cad7c	Make a bunch of CompileUnit member functions private. llvm-svn: 192009	2013-10-05 00:05:51 +00:00
David Blaikie	93ff1eb5fb	Minor formatting/comment rewording/etc. llvm-svn: 192005	2013-10-04 23:52:02 +00:00
Eric Christopher	fe3ae44179	Remove odd use of this. llvm-svn: 192004	2013-10-04 23:49:31 +00:00
Eric Christopher	f0388b7b39	Reformat some odd formattings. llvm-svn: 192003	2013-10-04 23:49:29 +00:00
Eric Christopher	08f7c8f1fe	Tighten up some type arguments to functions. Where we expect a scope, pass a scope. llvm-svn: 192002	2013-10-04 23:49:26 +00:00
David Blaikie	41369b5f41	Remove some dead code. llvm-svn: 192000	2013-10-04 23:37:30 +00:00
David Blaikie	fac5612ab0	Simplify setting of DIE tag for type DIEs by setting it in one* place. * two actually due to some weird template thing... investigating that. llvm-svn: 191998	2013-10-04 23:21:16 +00:00
Eric Christopher	baf3816283	Prune includes. llvm-svn: 191994	2013-10-04 22:54:28 +00:00
Eric Christopher	6b8209b6b7	Use addFlag to add the enum class attribute. This has the side effect of using DW_FORM_flag_present on dwarf4 and above. llvm-svn: 191991	2013-10-04 22:40:10 +00:00
Eric Christopher	dccd32866b	Use Die->addValue and DIEIntegerOne directly when we want to add a flag. No functional change. llvm-svn: 191990	2013-10-04 22:40:05 +00:00
Hal Finkel	dbc7a8a8a3	Fix DAGCombiner::visitFP_EXTEND to ignore indexed loads DAGCombiner::visitFP_EXTEND will apply the following transformation: fold (fpext (load x)) -> (fpext (fptrunc (extload x))) but the implementation does not handle indexed loads (pre/post inc.), but did not specifically ignore them either (unlike for extending loads, which it already ignored), causing an assert when the transformation was applied to an indexed load. This is the minimal fix for correctness (causing the transformation to be skipped for indexed loads). Unfortunately, I don't have an in-tree test case. llvm-svn: 191989	2013-10-04 22:18:12 +00:00
Eric Christopher	c19d6f096c	Temporarily revert r176882 as it needs to be implemented in a different way for all platforms. llvm-svn: 191975	2013-10-04 19:40:33 +00:00
Eric Christopher	e595bae4a4	Temporarily revert r191792 as it is causing some LTO debug failures on platforms with relocations in debug info and also temporarily revert r191800 due to conflicts with the revert of r191792. llvm-svn: 191967	2013-10-04 17:08:38 +00:00
Matthias Braun	caff764739	Fix comment llvm-svn: 191966	2013-10-04 16:53:02 +00:00

... 8 9 10 11 12 ...

16460 Commits