llvm-project

Commit Graph

Author	SHA1	Message	Date
Arnaud A. de Grandmaison	f5f040fa1e	CalcSpillWeights: allow overidding the spill weight normalizing function This will enable the PBQP register allocator to provide its own normalizing function. No functionnal change. llvm-svn: 194417	2013-11-11 19:56:14 +00:00
Arnaud A. de Grandmaison	ea3ac1612c	CalcSpillWeights: give a better describing name to calculateSpillWeights Besides, this relates it more obviously to the VirtRegAuxInfo::calculateSpillWeightAndHint. No functionnal change. llvm-svn: 194404	2013-11-11 19:04:45 +00:00
Eric Christopher	aeb105f9fe	Unify the adding of enumerators with the construction of the enumeration. llvm-svn: 194401	2013-11-11 18:52:39 +00:00
Eric Christopher	98b7f17c72	Formatting. llvm-svn: 194400	2013-11-11 18:52:36 +00:00
Eric Christopher	e6c6c4d36b	80-col. llvm-svn: 194399	2013-11-11 18:52:33 +00:00
Eric Christopher	df9955dd89	Just pass the DIComposite type by value instead of by pointer. llvm-svn: 194398	2013-11-11 18:52:31 +00:00
Daniel Sanders	a1840d2f88	Vector forms of SHL, SRA, and SRL can be constant folded using SimplifyVBinOp too Reviewers: dsanders Reviewed By: dsanders CC: llvm-commits, nadav Differential Revision: http://llvm-reviews.chandlerc.com/D1958 llvm-svn: 194393	2013-11-11 17:23:41 +00:00
Arnaud A. de Grandmaison	760c1e0b0a	CalculateSpillWeights does not need to be a pass Based on discussions with Lang Hames and Jakob Stoklund Olesen at the hacker's lab, and in the light of upcoming work on the PBQP register allocator, it was though that CalcSpillWeights does not need to be a pass. This change will enable to customize / tune the spill weight computation depending on the allocator. Update the documentation style while there. No functionnal change. llvm-svn: 194356	2013-11-10 17:46:31 +00:00
Matt Arsenault	c900303e2f	Use type form of getIntPtrType. This should be inconsequential and is work towards removing the default address space arguments. llvm-svn: 194347	2013-11-10 04:46:57 +00:00
Lang Hames	fb82630a91	Re-apply r194300 with fixes for warnings. llvm-svn: 194311	2013-11-09 03:08:56 +00:00
Nick Lewycky	59886d00ec	Revert r194300 which broke the build. llvm-svn: 194308	2013-11-09 02:01:25 +00:00
Juergen Ributzka	87ed906b2e	[Stackmap] Materialize the jump address within the patchpoint noop slide. This patch moves the jump address materialization inside the noop slide. This enables patching of the materialization itself or its complete removal. This patch also adds the ability to define scratch registers that can be used safely by the code called from the patchpoint intrinsic. At least one scratch register is required, because that one is used for the materialization of the jump address. This patch depends on D2009. Differential Revision: http://llvm-reviews.chandlerc.com/D2074 Reviewed by Andy llvm-svn: 194306	2013-11-09 01:51:33 +00:00
Lang Hames	1662b832d9	Rewrite the PBQP graph data structure. The new graph structure replaces the node and edge linked lists with vectors. Free lists (well, free vectors) are used for fast insertion/deletion. The ultimate aim is to make PBQP graphs cheap to clone. The motivation is that the PBQP solver destructively consumes input graphs while computing a solution, forcing the graph to be fully reconstructed for each round of PBQP. This imposes a high cost on large functions, which often require several rounds of solving/spilling to find a final register allocation. If we can cheaply clone the PBQP graph and incrementally update it between rounds then hopefully we can reduce this cost. Further, once we begin pooling matrix/vector values (future work), we can cache some PBQP solver metadata and share it between cloned graphs, allowing the PBQP solver to re-use some of the computation done in earlier rounds. For now this is just a data structure update. The allocator and solver still use the graph the same way as before, fully reconstructing it between each round. I expect no material change from this update, although it may change the iteration order of the nodes, causing ties in the solver to break in different directions, and this could perturb the generated allocations (hopefully in a completely benign way). Thanks very much to Arnaud Allard de Grandmaison for encouraging me to get back to work on this, and for a lot of discussion and many useful PBQP test cases. llvm-svn: 194300	2013-11-09 00:14:07 +00:00
Juergen Ributzka	9969d3e6e8	[Stackmap] Add AnyReg calling convention support for patchpoint intrinsic. The idea of the AnyReg Calling Convention is to provide the call arguments in registers, but not to force them to be placed in a paticular order into a specified set of registers. Instead it is up tp the register allocator to assign any register as it sees fit. The same applies to the return value (if applicable). Differential Revision: http://llvm-reviews.chandlerc.com/D2009 Reviewed by Andy llvm-svn: 194293	2013-11-08 23:28:16 +00:00
Pedro Artigas	71f87cb33a	increase the accuracy of register pressure computation in the presence of dead definitions by using live intervals, if available, to identify dead definitions and proceed accordingly. llvm-svn: 194286	2013-11-08 22:46:28 +00:00
Lang Hames	8a065703ef	Fix some minor issues with r194282 to get the tree healthy again. llvm-svn: 194284	2013-11-08 22:30:52 +00:00
Lang Hames	3078977d28	Add a method to get the object-file appropriate stack map section. Thanks to Eric Christopher for the tips on the appropriate way to do this. llvm-svn: 194282	2013-11-08 22:14:49 +00:00
Arnaud A. de Grandmaison	f7a60a8e01	Revert "CalculateSpillWeights does not need to be a pass" Temporarily revert my previous commit until I understand why it breaks 3 target tests. llvm-svn: 194272	2013-11-08 18:19:19 +00:00
Quentin Colombet	b06a0ed4b0	[VirtRegMap] Fix for PR17825. Do not ignore noreturn definitions when setting isPhysRegUsed if the unwind information is required. Indeed, the runtime may need a correct stack to be able to unwind the call. llvm-svn: 194271	2013-11-08 18:14:17 +00:00
Arnaud A. de Grandmaison	ed812f6590	CalculateSpillWeights does not need to be a pass Based on discussions with Lang Hames and Jakob Stoklund Olesen at the hacker's lab, and in the light of upcoming work on the PBQP register allocator, it was though that CalcSpillWeights does not need to be a pass. This change will enable to customize / tune the spill weight computation depending on the allocator. Update the documentation style while there. No functionnal change. llvm-svn: 194269	2013-11-08 17:56:29 +00:00
Arnaud A. de Grandmaison	3b52f0b135	CalculateSpillWeights cleanup: remove unneeded includes llvm-svn: 194259	2013-11-08 15:13:05 +00:00
Andrew Trick	6664df12fb	Slightly change the way stackmap and patchpoint intrinsics are lowered. MorphNodeTo is not safe to call during DAG building. It eagerly deletes dependent DAG nodes which invalidates the NodeMap. We could expose a safe interface for morphing nodes, but I don't think it's worth it. Just create a new MachineNode and replaceAllUsesWith. My understaning of the SD design has been that we want to support early target opcode selection. That isn't very well supported, but generally works. It seems reasonable to rely on this feature even if it isn't widely used. llvm-svn: 194102	2013-11-05 22:44:04 +00:00
Eric Christopher	fedfa44922	Comment some and reformat for clarity beginFunction. llvm-svn: 193894	2013-11-01 23:14:17 +00:00
Juergen Ributzka	359c532d36	[Stackmap] Remove erroneous assert. llvm-svn: 193871	2013-11-01 17:53:27 +00:00
Rafael Espindola	716e7405d3	Remove linkonce_odr_auto_hide. linkonce_odr_auto_hide was in incomplete attempt to implement a way for the linker to hide symbols that are known to be available in every TU and whose addresses are not relevant for a particular DSO. It was redundant in that it all its uses are equivalent to linkonce_odr+unnamed_addr. Unlike those, it has never been connected to clang or llvm's optimizers, so it was effectively dead. Given that nothing produces it, this patch just nukes it (other than the llvm-c enum value). llvm-svn: 193865	2013-11-01 17:09:14 +00:00
Aaron Ballman	2b7a733b16	Commenting out this assert because it is causing the build bots to fail. This effectively reverts r193861, but needs to be fixed as part of r193769. llvm-svn: 193862	2013-11-01 15:12:23 +00:00
Aaron Ballman	96321aa523	Fixing an order of evaluation error in an assert. llvm-svn: 193861	2013-11-01 14:53:14 +00:00
David Blaikie	71d34a2eef	DebugInfo: Emit member variable locations as data instead of expressions in blocks Drive by space optimization. Also makes the DIEs more regular which might speed up DWARF parsing. llvm-svn: 193835	2013-11-01 00:25:45 +00:00
Andrew Trick	c21d86f7ec	Unused variable llvm-svn: 193819	2013-10-31 22:42:20 +00:00
Andrew Trick	153ebe6d2a	Add support for stack map generation in the X86 backend. Originally implemented by Lang Hames. llvm-svn: 193811	2013-10-31 22:11:56 +00:00
Manman Ren	4dbdc9021d	Debug Info: remove duplication of DIEs when a DIE can be shared across CUs. We add a map in DwarfDebug to map MDNodes that are shareable across CUs to the corresponding DIEs: MDTypeNodeToDieMap. These DIEs can be shared across CUs, that is why we keep the maps in DwarfDebug instead of CompileUnit. We make the assumption that if a DIE is not added to an owner yet, we assume it belongs to the current CU. Since DIEs for the type system are added to their owners immediately after creation, and other DIEs belong to the current CU, the assumption should be true. A testing case is added to show that we only create a single DIE for a type MDNode and we use ref_addr to refer to the type DIE. We also add a testing case to show ref_addr relocations for non-darwin platforms. llvm-svn: 193779	2013-10-31 17:54:35 +00:00
Andrew Trick	74f4c749cf	Lower stackmap intrinsics directly to their target opcode in the DAG builder. llvm-svn: 193769	2013-10-31 17:18:24 +00:00
Andrew Trick	d4d1d9c06e	whitespace llvm-svn: 193765	2013-10-31 17:18:07 +00:00
Rafael Espindola	dbec9d9b2a	Remove the --shrink-wrap option. It had no tests, was unused and was "experimental at best". llvm-svn: 193749	2013-10-31 14:07:59 +00:00
Jim Grosbach	7236678687	Legalize: Improve legalization of long vector extends. When an extend more than doubles the size of the elements (e.g., a zext from v16i8 to v16i32), the normal legalization method of splitting the vectors will run into problems as by the time the destination vector is legal, the source vector is illegal. The end result is the operation often becoming scalarized, with the typical horrible performance. For example, on x86_64, the simple input of: define void @bar(<16 x i8> %a, <16 x i32>* %p) nounwind { %tmp = zext <16 x i8> %a to <16 x i32> store <16 x i32> %tmp, <16 x i32>*%p ret void } Generates: .section __TEXT,__text,regular,pure_instructions .section __TEXT,__const .align 5 LCPI0_0: .long 255 ## 0xff .long 255 ## 0xff .long 255 ## 0xff .long 255 ## 0xff .long 255 ## 0xff .long 255 ## 0xff .long 255 ## 0xff .long 255 ## 0xff .section __TEXT,__text,regular,pure_instructions .globl _bar .align 4, 0x90 _bar: vpunpckhbw %xmm0, %xmm0, %xmm1 vpunpckhwd %xmm0, %xmm1, %xmm2 vpmovzxwd %xmm1, %xmm1 vinsertf128 $1, %xmm2, %ymm1, %ymm1 vmovaps LCPI0_0(%rip), %ymm2 vandps %ymm2, %ymm1, %ymm1 vpmovzxbw %xmm0, %xmm3 vpunpckhwd %xmm0, %xmm3, %xmm3 vpmovzxbd %xmm0, %xmm0 vinsertf128 $1, %xmm3, %ymm0, %ymm0 vandps %ymm2, %ymm0, %ymm0 vmovaps %ymm0, (%rdi) vmovaps %ymm1, 32(%rdi) vzeroupper ret So instead we can check if there are legal types that enable us to split more cleverly when the input vector is already legal such that we don't turn it into an illegal type. If the extend is such that it's more than doubling the size of the input we check if - the number of vector elements is even, - the source type is legal, - the type of a split source is illegal, - the type of an extended (by doubling element size) source is legal, and - the type of that extended source when split is legal. If the conditions are met, instead of just splitting both the destination and the source types, we create an extend that only goes up one "step" (doubling the element width), and the continue legalizing the rest of the operation normally. The result is that this operates as a new, more effecient, termination condition for the loop of "split the operation until the destination type is legal." With this change, the above example now compiles to: _bar: vpxor %xmm1, %xmm1, %xmm1 vpunpcklbw %xmm1, %xmm0, %xmm2 vpunpckhwd %xmm1, %xmm2, %xmm3 vpunpcklwd %xmm1, %xmm2, %xmm2 vinsertf128 $1, %xmm3, %ymm2, %ymm2 vpunpckhbw %xmm1, %xmm0, %xmm0 vpunpckhwd %xmm1, %xmm0, %xmm3 vpunpcklwd %xmm1, %xmm0, %xmm0 vinsertf128 $1, %xmm3, %ymm0, %ymm0 vmovaps %ymm0, 32(%rdi) vmovaps %ymm2, (%rdi) vzeroupper ret This generalizes a custom lowering that was added a while back to the ARM backend. That lowering is no longer necessary, and is removed. The testcases for it, however, provide excellent ARM tests for this change and so remain. rdar://14735100 llvm-svn: 193727	2013-10-31 00:20:48 +00:00
Matt Arsenault	2ba54c3d90	Fix CodeGen for unaligned loads with address spaces llvm-svn: 193721	2013-10-30 23:30:05 +00:00
Rafael Espindola	6f1b2852fc	Produce .weak_def_can_be_hidden for some linkonce_odr values With this patch llvm produces a weak_def_can_be_hidden for linkonce_odr if they are also unnamed_addr or don't have their address taken. There is not a lot of documentation about .weak_def_can_be_hidden, but from the old discussion about linkonce_odr_auto_hide and the name of the directive this looks correct: these symbols can be hidden. Testing this with the ld64 in Xcode 5 linking clang reduces the number of exported symbols from 21053 to 19049. llvm-svn: 193718	2013-10-30 22:08:11 +00:00
David Blaikie	6b288cfa7a	DebugInfo: Push header handling down into CompileUnit This is a preliminary step to handling type units by abstracting over all (type or compile) units. llvm-svn: 193714	2013-10-30 20:42:41 +00:00
David Blaikie	2d4e11228b	DwarfDebug: Change Abbreviations member from pointer to reference llvm-svn: 193699	2013-10-30 17:14:24 +00:00
Juergen Ributzka	3bd686d493	Revert "SelectionDAG: Teach the legalizer to split SETCC if VSELECT needs splitting too." Now Hexagon and SystemZ are not happy with it :-( llvm-svn: 193677	2013-10-30 06:36:19 +00:00
Juergen Ributzka	6ad05d6b95	SelectionDAG: Teach the legalizer to split SETCC if VSELECT needs splitting too. The Type Legalizer recognizes that VSELECT needs to be split, because the type is to wide for the given target. The same does not always apply to SETCC, because less space is required to encode the result of a comparison. As a result VSELECT is split and SETCC is unrolled into scalar comparisons. This commit fixes the issue by checking for VSELECT-SETCC patterns in the DAG Combiner. If a matching pattern is found, then the result mask of SETCC is promoted to the expected vector mask type for the given target. This mask has usually the same size as the VSELECT return type (except for Intel KNL). Now the type legalizer will split both VSELECT and SETCC. This allows the following X86 DAG Combine code to sucessfully detect the MIN/MAX pattern. This fixes PR16695, PR17002, and <rdar://problem/14594431>. Reviewed by Nadav llvm-svn: 193676	2013-10-30 05:48:18 +00:00
Josh Magee	7245f1d85d	Reformat code with clang-format. Differential Revision: http://llvm-reviews.chandlerc.com/D2057 llvm-svn: 193672	2013-10-30 02:25:14 +00:00
Manman Ren	251a1bd215	Debug Info: code clean up. Use EmitLabelOffsetDifference for handling on darwin platform when non-darwin platforms use EmitLabelPlusOffset. Also fix a bug in EmitLabelOffsetDifference where the size is hard-coded to 4 even though Size is passed in as an argument. llvm-svn: 193660	2013-10-29 23:14:15 +00:00
Manman Ren	ce20d460e2	Debug Info: support for DW_FORM_ref_addr. To support ref_addr, we calculate the section offset of a DIE (i.e. offset of a DIE from beginning of the debug info section). The Offset field in DIE is currently CU-relative. To calculate the section offset, we add a DebugInfoOffset field in CompileUnit to store the offset of a CU from beginning of the debug info section. We set the value in DwarfUnits::computeSizeAndOffset for each CompileUnit. A helper function DIE::getCompileUnit is added to return the CU DIE that the input DIE belongs to. We also add a map CUDieMap in DwarfDebug to help finding the CU for a given CU DIE. For a cross-referenced DIE, we first find the CU DIE it belongs to with getCompileUnit, then we use CUDieMap to get the corresponding CU for the CU DIE. Adding the section offset of the CU with the CU-relative offset of a DIE gives us the seciton offset of the DIE. We correctly emit ref_addr with relocation using EmitLabelPlusOffset when doesDwarfUseRelocationsAcrossSections is true. This commit handles the emission of DW_FORM_ref_addr when we have an attribute with FORM_ref_addr. A follow-on patch will start using ref_addr when adding a DIEEntry. This commit will be tested and verified in the follow-on patch. Reviewed off-list by Eric, Thanks. llvm-svn: 193658	2013-10-29 22:57:10 +00:00
Manman Ren	f4c339e04a	Debug Info: instead of calling addToContextOwner which constructs the context after the DIE creation, we construct the context first. Ensure that we create the context before we create a type so that we can add the newly created type to the parent. Remove last use of addToContextOwner now that it's not needed. We use createAndAddDIE to wrap around "new DIE(". Now all shareable DIEs should be added to their parents right after the creation. Reviewed off-list by Eric, Thanks. llvm-svn: 193657	2013-10-29 22:49:29 +00:00
Josh Magee	3f1c0e35e6	[stackprotector] Update the StackProtector pass to perform datalayout analysis. This modifies the pass to classify every SSP-triggering AllocaInst according to an SSPLayoutKind (LargeArray, SmallArray, AddrOf). This analysis is collected by the pass and made available for use, but no other pass uses it yet. The next patch will make use of this analysis in PEI and StackSlot passes. The end goal is to support ssp-strong stack layout rules. WIP. Differential Revision: http://llvm-reviews.chandlerc.com/D1789 llvm-svn: 193653	2013-10-29 21:16:16 +00:00
Rafael Espindola	e133ed88b5	Move getSymbol to TargetLoweringObjectFile. This allows constructing a Mangler with just a TargetMachine. llvm-svn: 193630	2013-10-29 17:28:26 +00:00
Rafael Espindola	79858aa3df	Add a helper getSymbol to AsmPrinter. llvm-svn: 193627	2013-10-29 17:07:16 +00:00
Manman Ren	f6b936bc06	Debug Info: instead of calling addToContextOwner which constructs the context after the DIE creation, we construct the context first. This touches creation of namespaces and global variables. The purpose is to handle all DIE creations similarly: constructs the context first, then creates the DIE and immediately adds the DIE to its parent. We use createAndAddDIE to wrap around "new DIE(". llvm-svn: 193589	2013-10-29 05:49:41 +00:00
Alp Toker	6a03374526	Fix "existant" typos llvm-svn: 193579	2013-10-29 02:35:28 +00:00
Manman Ren	4a841a86bd	Debug Info: use createAndAddDIE to wrap around "new DIE" in DwarfDebug. This commit ensures DIEs are constructed within a compile unit and immediately added to their parents. Reviewed off-list by Eric. llvm-svn: 193568	2013-10-29 01:03:01 +00:00
Manman Ren	73d697c641	Debug Info: use createAndAddDIE for newly-created Subprogram DIEs. More patches will be submitted to convert "new DIE(" to use createAddAndDIE in DwarfCompileUnit.cpp. This will simplify implementation of addDIEEntry where we have to decide between ref4 and ref_addr, because DIEs that can be shared across CU will be added to a CU already. Reviewed off-list by Eric. llvm-svn: 193567	2013-10-29 00:58:04 +00:00
Manman Ren	b987e517f2	Debug Info: add a helper function createAndAddDIE. It wraps around "new DIE(" and handles the bookkeeping part of the newly-created DIE. It adds the DIE to its parent, and calls insertDIE if necessary. It makes sure that bookkeeping is done at the earliest time and we should not see parentless DIEs if all constructions of DIEs go through this helper function. Later on, we can use an allocator for DIE allocation, and will only need to change createAndAddDIE instead of modifying all the "new DIE(". Reviewed off-list by Eric. llvm-svn: 193566	2013-10-29 00:53:03 +00:00
Richard Sandiford	981fdeb477	[DAGCombiner] Respect volatility when checking for aliases Making useAA() default to true for SystemZ showed that the combiner alias analysis wasn't handling volatile accesses. This hit many of the SystemZ tests, but I arbitrarily picked one for the purpose of this patch. llvm-svn: 193518	2013-10-28 12:00:00 +00:00
Richard Sandiford	39c1ce4dc1	Keep TBAA info when rewriting SelectionDAG loads and stores Most SelectionDAG code drops the TBAA info when creating a new form of a load and store (e.g. during legalization, or when converting a plain load to an extending one). This patch tries to catch all cases where the TBAA information can legitimately be carried over. The patch adds alternative forms of getLoad() and getExtLoad() that take a MachineMemOperand instead of individual fields. (The corresponding getTruncStore() already exists.) The idea is to use the MachineMemOperand forms when all fields are carried over (size, pointer info, isVolatile, isNonTemporal, alignment and TBAA info). If some adjustment is being made, e.g. to narrow the load, then we still pass the individual fields but also pass the TBAA info. llvm-svn: 193517	2013-10-28 11:17:59 +00:00
David Blaikie	8bc7db777d	DIEHash: Summary hashing of member functions llvm-svn: 193432	2013-10-25 20:04:25 +00:00
David Blaikie	65cc969f50	DIEHash: Summary hashing of nested types llvm-svn: 193427	2013-10-25 18:38:43 +00:00
Tim Northover	a564d329c2	LegalizeDAG: allow libcalls for max/min atomic operations ARM processors without ldrex/strex need to be able to make libcalls for all atomic operations, including the newer min/max versions. The alternative would probably be expanding these operations in terms of cmpxchg (as x86 does always), but in the configurations where this matters code-size tends to be paramount so the libcall is more desirable. llvm-svn: 193398	2013-10-25 09:30:20 +00:00
Nadav Rotem	d369d4bdf9	Optimize concat_vectors(X, undef) -> scalar_to_vector(X). This optimization is not SSE specific so I am moving it to DAGco. The new scalar_to_vector dag node exposed a missing pattern in the AArch64 target that I needed to add. llvm-svn: 193393	2013-10-25 06:41:18 +00:00
David Blaikie	d8c5b4e8ef	MCStreamer: Reimplement the virtual EmitRawText as a protected member, EmitRawTextImpl, to avoid string literal ambiguities Also improve the implementation of EmitRawText(Twine) so it doesn't bother using the SmallString buffer if the Twine is a simple StringRef anyway. llvm-svn: 193378	2013-10-24 22:43:10 +00:00
David Blaikie	68642d3118	DWARF emission: Remove unnecessary/redundant DIE reference code The default case at the end of the switch handles this just fine. llvm-svn: 193374	2013-10-24 22:00:44 +00:00
Eric Christopher	e34116750f	Fix name of variable in comment. llvm-svn: 193373	2013-10-24 21:54:58 +00:00
Eric Christopher	670ee0e941	Grammar. llvm-svn: 193372	2013-10-24 21:20:23 +00:00
Eric Christopher	b088d2d0bc	Update misleading comment. llvm-svn: 193371	2013-10-24 21:05:08 +00:00
David Blaikie	2aee7be871	DIEHash: Const correct and use references where non-null/non-rebound. llvm-svn: 193363	2013-10-24 18:29:03 +00:00
David Blaikie	32744412d2	DIEHash: Do not use shallow type hashing for unnamed types llvm-svn: 193361	2013-10-24 17:53:58 +00:00
David Blaikie	afcb9656c3	DIEHash: Refactor ref attribute hashing into smaller functions llvm-svn: 193360	2013-10-24 17:51:43 +00:00
David Blaikie	e568225fc3	Remove unused debug-only member variable. This may've been used at some point but the 'print' member function grew an Indent parameter that entirely shadows this parameter. llvm-svn: 193358	2013-10-24 17:10:13 +00:00
Manman Ren	ffc9a71866	Debug Info: code clean up. Since we never insert DIE for DITemplateTypeParameter to a map, there is no need to call getDIE in getOrCreateTemplateTypeParameterDIE. It is also renamed to constructTemplateTypeParameterDIE to match with other construct functions in CompileUnit. Same applies to getOrCreateTemplateValueParameterDIE. llvm-svn: 193287	2013-10-23 23:05:28 +00:00
Manman Ren	230ec864af	Debug Info: code clean up. Rename createMemberDIE to constructMemberDIE to match other construct functions in CompileUnit. llvm-svn: 193286	2013-10-23 23:00:44 +00:00
Manman Ren	57e6ff7e72	Debug Info: code clean up. Remove the unneeded return values from createMemberDIE, constructEnumTypeDIE, getOrCreateTemplateTypeParameterDIE, and getOrCreateTemplateValueParameterDIE. llvm-svn: 193285	2013-10-23 22:57:12 +00:00
Manman Ren	0cfd20b99e	Debug Info: code clean up. Unifying the argument ordering of private construct functions in CompileUnit to follow constructTypeDIE(DIE &, DIBasicType), constructTypeDIE(DIE &, DIDerivedType), constructTypeDIE(DIE &, DICompositeType), constructSubrangeDIE and constructArrayTypeDIE. llvm-svn: 193284	2013-10-23 22:52:22 +00:00
Manman Ren	b9512a7c57	Remove {} from one-line block. llvm-svn: 193276	2013-10-23 22:12:26 +00:00
Rafael Espindola	b02877416e	Reduce casting and use a fully covered switch. llvm-svn: 193272	2013-10-23 21:24:34 +00:00
Tom Stellard	8d7d4deafe	SelectionDAG: Pass along the original argument/element type in ISD::InputArg For some targets, it is useful to be able to look at the original type of an argument without having to dig through the original IR. This also fixes a bug in SelectionDAGBuilder where InputArg.PartOffset was not taking into account the offset of structure elements. Patch by: Justin Holewinski Tom Stellard: - Changed the type of ArgVT to EVT, so it can store non-simple types like v3i32. llvm-svn: 193214	2013-10-23 00:44:24 +00:00
Manman Ren	642a0acce2	Debug Info: code clean up. Remove unnecessary creation of LexicalScope in collectDeadVariables. The created LexicialScope was only used to get isAbstractScope, which should be false from the creation: "new LexicalScope(NULL, DIDescriptor(SP), NULL, false);". We can also remove a DenseMap that holds the created LexicalScopes. llvm-svn: 193196	2013-10-22 20:59:19 +00:00
David Blaikie	5ebc54d9ea	DIEHashing: Provide an assert for unreachable functionality regarding friends. Since (as of r190716) Clang no longer emits debug info for C++ friend declarations (and it seems GCC never has/does, which was the motivation for the Clang change), there's no actual reachable case for implementing the part of DWARF 4, Section 7.27 part 5 that pertains to friends. Leave an assert here so that if/when we do have a client producing friends and using type units, we can fill in the gap and add appropriate (unit and feature) tests. llvm-svn: 193193	2013-10-22 20:28:55 +00:00
David Blaikie	d70a055394	DWARF type hashing: pointers to members Includes a test case/FIXME demonstrating a bug/limitation in pointer to member hashing. To be honest I'm not sure why we don't just always use summary hashing for referenced types... but perhaps I'm missing something. llvm-svn: 193175	2013-10-22 18:14:41 +00:00
Wan Xiaofei	2f8dc08b8c	Using FoldingSet in SelectionDAG::getVTList. VTList has a long life cycle through the module and getVTList is frequently called. In current getVTList, sequential search over a std::vector is used, this is inefficient in big module. This patch use FoldingSet to implement hashing mechanism when searching. Reviewer: Nadav Rotem Test : Pass unit tests & LNT test suite llvm-svn: 193150	2013-10-22 08:02:02 +00:00
Eric Christopher	c798d8ad0a	Formatting/whitespace. llvm-svn: 193135	2013-10-22 00:22:39 +00:00
David Blaikie	fe3233a568	DWARF Type Hashing: Include reference and rvalue reference type in the declarable summary hashing path More support for 7.25 Part 5. llvm-svn: 193129	2013-10-21 23:06:19 +00:00
David Blaikie	6cf58c8980	DWARF type hashing: begin implementing Step 5, summary hashing in declarable contexts There are several other tag types that need similar handling but to ensure test coverage they'll be coming incrementally. llvm-svn: 193126	2013-10-21 22:36:50 +00:00
Matt Arsenault	bc4242114e	Remove unused TargetLowering field. llvm-svn: 193113	2013-10-21 20:04:01 +00:00
Matt Arsenault	b768912db8	Fix CodeGen for different size address space GEPs llvm-svn: 193111	2013-10-21 20:03:54 +00:00
Matt Arsenault	bbd24901cf	Reuse variable llvm-svn: 193107	2013-10-21 19:24:15 +00:00
Reid Kleckner	ad65f10d75	Fix the build in DIE.cpp with MSVC 2010 llvm-svn: 193106	2013-10-21 19:18:31 +00:00
David Blaikie	980d4994b2	DWARF type hashing: Handle multiple (including recursive) references to the same type This uses a map, keeping the type DIE numbering separate from the DIEs themselves - alternatively we could do things the way GCC does if we want to add an integer to the DIE type to record the numbering there. llvm-svn: 193105	2013-10-21 18:59:40 +00:00
Eric Christopher	691281be2f	Fix up some old review feedback. llvm-svn: 193095	2013-10-21 17:48:51 +00:00
David Blaikie	f244319cac	DebugInfo: Put each kind of constant (form, attribute, tag, etc) into its own enum for ease of use. This allows various variables to be more self-documenting and easier to debug by being of specific types without overlapping enum values. Precommit review by Eric Christopher. llvm-svn: 193091	2013-10-21 17:28:37 +00:00
David Blaikie	63bb3e1182	DebugInfo: Hash DW_FORM_GNU_str_index as a string. Found while adding type safety to the various DWARF enumerations (form, attribute, tag, etc) that caused Clang to warn on an incompletely covered switch. Converting the comment to a default/unreachable uncovered this case of an unsupported form encoding. Seems we were skipping fission strings entirely. llvm-svn: 193089	2013-10-21 16:37:22 +00:00
Peter Collingbourne	e9f45e25f9	Emit prefix data after debug and EH directives. This ensures that the prefix data is treated as part of the function for the purpose of debug info. This provides a better debugging experience, among other things by allowing a debug info client to correctly look up a function in debug info given a function pointer. llvm-svn: 193042	2013-10-20 02:16:21 +00:00
Benjamin Kramer	6ddca57327	Remove unused variable. llvm-svn: 193038	2013-10-19 16:32:15 +00:00
Eric Christopher	c2697f8390	Reformat. llvm-svn: 193024	2013-10-19 01:04:47 +00:00
Eric Christopher	8dba0d5ae9	Fix up a few minor performance problems spotted in code review. llvm-svn: 193023	2013-10-19 01:04:42 +00:00
Manman Ren	7cc6270262	Debug Info: add a newly-created DIE to a parent in the same function. With this commit, all DIEs created in CompileUnit will be added to parents inside the same function. Also make getOrCreateTemplateType\|Value functions private. No functionality change. llvm-svn: 193002	2013-10-18 21:14:19 +00:00
Manman Ren	8040bb58d3	Debug Info: simplify code a bit. llvm-svn: 193001	2013-10-18 20:52:22 +00:00
Eric Christopher	4d964a517f	Revert the rest of r192749 to bring back the buildbot. These two error messages should not be able to occur at the same time. llvm-svn: 192985	2013-10-18 16:56:48 +00:00
Bill Schmidt	3684fdd59f	[PATCH] Fix PR17168 (DAG scheduler inserts DBG_VALUE before PHI with fast-isel) PR17168 describes a test case that fails when compiling for debug with fast-isel. Investigation showed that the test was failing because a DBG_VALUE machine instruction was placed prior to a PHI. For this problem to occur requires the following: * Compile for debug * Compile with fast-isel * In a block B, fast-isel must partially succeed before punting to DAG-isel * B must start with a PHI * The first unhandled node in the DAG must not generate a machine instruction * A debug value with an order less than that of that first node exists When all of these circumstances apply, the existing test that an instruction was not inserted won't fire. Currently it tests whether the block is empty, or whether the last instruction generated is a phi. When fast-isel has partially succeeded, the last instruction generated will not be a phi. Instead, we need to check whether the current insert position is immediately following a phi. This patch adds that check, and adds the test case from the PR as a regression test. llvm-svn: 192976	2013-10-18 14:20:11 +00:00
David Majnemer	451b7dd1ef	CodeGen: Emit a libcall if the target doesn't support 16-byte wide atomics There are targets that support i128 sized scalars but cannot emit instructions that modify them directly. The proper thing to do is to emit a libcall. This fixes PR17481. llvm-svn: 192957	2013-10-18 08:03:43 +00:00
Eric Christopher	ffbc4decc2	Temporarily revert r192749 as it is causing problems for LTO and requires a more in depth change to the IR structure. llvm-svn: 192938	2013-10-18 01:57:30 +00:00
David Blaikie	01fae51fef	DIEHash: Add more things (and remove one character) from the COLLECT_ATTR macro Makes the uses more terse and requires that they use a semicolon at the end that helps editors indent proceeding lines correctly. llvm-svn: 192925	2013-10-17 22:14:08 +00:00
David Blaikie	ca353be652	DIEHash: Support for simple (non-recursive, non-reused) type references llvm-svn: 192924	2013-10-17 22:07:09 +00:00
Richard Sandiford	95f7ba988b	Replace sra with srl if a single sign bit is required E.g. (and (sra (i32 x) 31) 2) -> (and (srl (i32 x) 30) 2). llvm-svn: 192884	2013-10-17 11:16:57 +00:00
Andrea Di Biagio	561badf717	Fix edge condition in DAGCombiner to improve codegen of shift sequences. When canonicalizing dags according to the rule (shl (zext (shr X, c1) ), c1) ==> (zext (shl (shr X, c1), c1)) remember to add the new shl dag to the DAGCombiner worklist of nodes. If we don't explicitly add it to the worklist of nodes to visit, we may not trigger later on the rule that folds the shift left + logical shift right into a AND instruction with bitmask. llvm-svn: 192883	2013-10-17 11:02:58 +00:00
Eric Christopher	2c8b7907c3	According to the dwarf standard pubnames and pubtypes for languages like C++ should be the fully qualified names for the type. Add a routine that does a language specific context walk to build up the qualified name and use it when we add types/names to the tables. Expand the gnu pubnames testcase as it's the most complex to make sure that qualified types are also being added. llvm-svn: 192865	2013-10-17 02:06:06 +00:00
Jack Carter	d4e9615d1c	[projects/test-suite] White space and long line fixes. No functionality changes. llvm-svn: 192863	2013-10-17 01:34:33 +00:00
Eric Christopher	96eff3f393	Add the subprogram DIEs to the context they're created with only if they're a declaration, otherwise they're owned by the compile unit. llvm-svn: 192861	2013-10-17 01:31:12 +00:00
David Blaikie	8a142aaa01	DIEHash: Include the type's context in the type hash. llvm-svn: 192856	2013-10-17 00:10:34 +00:00
David Blaikie	6316ca45a7	DIEHash: Use DW_FORM_sdata for integers, per spec. This allows us to produce the same hash as GCC for at least some simple examples. llvm-svn: 192855	2013-10-16 23:36:20 +00:00
David Blaikie	920bb2a758	Remove ambiguity introduced in r192836 llvm-svn: 192840	2013-10-16 20:40:46 +00:00
David Blaikie	71a0ad66a9	DIEHash: Include the trailing zero byte after the children of a DIE llvm-svn: 192836	2013-10-16 20:29:06 +00:00
Andrew Trick	811a2ef96e	After PostRA scheduling, don't set kill flags on undef operands. This should fix the ATOM buildbot failing on break-avx-dep.ll. llvm-svn: 192824	2013-10-16 18:30:23 +00:00
Benjamin Kramer	00eb07b791	DAGCombiner: Don't fold xor into not if getNOT would introduce an illegal constant. This happens e.g. with <2 x i64> -1 on x86_32. It cannot be generated directly because i64 is illegal. It would be nice if getNOT would handle this transparently, but I don't see a way to generate a legal constant there right now. Fixes PR17487. llvm-svn: 192795	2013-10-16 14:16:19 +00:00
Richard Sandiford	374a0e50c4	Handle (shl (anyext (shr ...))) in SimpilfyDemandedBits This is really an extension of the current (shl (shr ...)) -> shl optimization. The main difference is that certain upper bits must also not be demanded. The motivating examples are the first two in the testcase, which occur in llvmpipe output. llvm-svn: 192783	2013-10-16 10:26:19 +00:00
Rafael Espindola	0018a59d01	Add support for metadata representing .ident directives. llvm-svn: 192764	2013-10-16 01:49:05 +00:00
Eric Christopher	d2b497b522	Fix a pair of bugs in the emission of pubname tables: 1) Make sure we emit static member variables by checking at the end of createGlobalVariableDIE rather than piecemeal in the function. (As a note, createGlobalVariableDIE needs rewriting.) 2) Make sure we use the definition rather than declaration DIE for two things: a) determining linkage for gnu pubnames, and b) as the address of the DIE for global variables. (As a note, createGlobalVariableDIE really needs rewriting.) Adjust the testcase to make sure we're checking the correct DIEs. llvm-svn: 192761	2013-10-16 01:37:49 +00:00
David Blaikie	94ded5f39e	Simplify zero initialization of DIEAttrs variable. llvm-svn: 192755	2013-10-16 00:47:21 +00:00
Eric Christopher	a6c38a32a9	Make sure we're not attempting to construct a subprogram DIE twice and just look up the value. Fix the one case where we were trying to create a subprogram DIE and we should already have had one. Reflow formatting in collectDeadVariables while fixing. llvm-svn: 192749	2013-10-15 23:31:38 +00:00
Adrian Prantl	5bf1d0093b	Remove some dead code. (DarwinGDBCompat was retired in r189903). llvm-svn: 192731	2013-10-15 20:26:37 +00:00
Pekka Jaaskelainen	eb4a6e7c28	Guard the debug temp variable with NDEBUG to avoid warning/error with NDEBUG defined. llvm-svn: 192709	2013-10-15 14:40:46 +00:00
Pekka Jaaskelainen	eb08e2e0c8	Do not assert when trying to add a meta data operand with MachineInstr::addOperand(). llvm-svn: 192707	2013-10-15 14:18:10 +00:00
Andrew Trick	3a99693c5a	Improve on r192635, ExeDepsFix for avx, and add a test case. rdar:15221834 False AVX register dependencies cause 5x slowdown on flops-5/6 and significant slowdown on several others. This was blocking the switch to MI-Sched. llvm-svn: 192669	2013-10-15 03:39:43 +00:00
Andrew Trick	b6d56be69d	Fix the ExecutionDepsFix pass to handle AVX instructions. This pass is needed to break false dependencies. Without it, unlucky register assignment can result in wild (5x) swings in performance. This pass was trying to handle AVX but not getting it right. AVX doesn't have partial register defs, it has unused register reads in which the high bits of a source operand are copied into the unused bits of the dest. Fixing this requires conservative liveness analysis. This is awkard because the pass already has its own pseudo-liveness. However, proper liveness is expensive, and we would like to use a generic utility to compute it. The fix only invokes liveness on-demand. It is rare to detect a case that needs undef-read dependence breaking, but when it happens, it can be needed many times within a very large block. I think the existing heuristic which uses a register window of 16 is too conservative for loop-carried false dependencies. If the loop is a reduction. The out-of-order engine may be able to execute several loop iterations in parallel. However, I'll leave this tuning exercise for next time. llvm-svn: 192635	2013-10-14 22:19:03 +00:00
Andrew Trick	e2f7cc4cf3	LiveRegUnits: Use *MBB for consistency and convenience. llvm-svn: 192634	2013-10-14 22:18:59 +00:00
Andrew Trick	3f4d6c6538	LiveRegUnits::removeRegsInMask safety. Clobbering is exclusive not inclusive on register units. For liveness, we need to consider all the preserved registers. e.g. A regmask that clobbers YMM0 may preserve XMM0. Units are only clobbered when all super-registers are clobbered. llvm-svn: 192623	2013-10-14 20:45:19 +00:00
Andrew Trick	276dd453f0	Use a SparseSet in LiveRegUnits. Some clients may add block live ins and may track liveness over a large scope. This guarantees an efficient implementation in all cases with no memory allocation/deallocation, independent of the number of target registers. It could be slightly less convenient but is fine in the expected case. llvm-svn: 192622	2013-10-14 20:45:17 +00:00
Andrew Trick	0aed0cfc44	Move LiveRegUnits implementation into .cpp. Comment and format. llvm-svn: 192621	2013-10-14 20:45:14 +00:00
Andrew Trick	ff3585c51c	Convert LiveRegUnits methods to the current convention (it's new code). llvm-svn: 192619	2013-10-14 20:45:09 +00:00
Manman Ren	c6b6392794	Debug Info: static member DIE creation. Clean up creation of static member DIEs. We can create static member DIEs from two places, so we call getOrCreateStaticMemberDIE from the two places. getOrCreateStaticMemberDIE will get or create the context DIE first, then it will check if the DIE already exists, if not, we create the static member DIE and add it to the context. Creation of static member DIEs are handled in a similar way as subprogram DIEs. llvm-svn: 192618	2013-10-14 20:33:57 +00:00
David Blaikie	6004dbc9fa	Fix indenting. That wasn't confusing /at all/... llvm-svn: 192617	2013-10-14 20:15:04 +00:00
Will Dietz	5cb7f4e3f2	MachineSink: Fix and tweak critical-edge breaking heuristic. Per original comment, the intention of this loop is to go ahead and break the critical edge (in order to sink this instruction) if there's reason to believe doing so might "unblock" the sinking of additional instructions that define registers used by this one. The idea is that if we have a few instructions to sink "together" breaking the edge might be worthwhile. This commit makes a few small changes to help better realize this goal: First, modify the loop to ignore registers defined by this instruction. We don't sink definitions of physical registers, and sinking an SSA definition isn't going to unblock an upstream instruction. Second, ignore uses of physical registers. Instructions that define physical registers are rejected for sinking, and so moving this one won't enable moving any defining instructions. As an added bonus, while virtual register use-def chains are generally small due to SSA goodness, iteration over the uses and definitions (used by hasOneNonDBGUse) for physical registers like EFLAGS can be rather expensive in practice. (This is the original reason for looking at this) Finally, to keep things simple continue to only consider this trick for registers that have a single use (via hasOneNonDBGUse), but to avoid spuriously breaking critical edges only do so if the definition resides in the same MBB and therefore this one directly blocks it from being sunk as well. If sinking them together is meant to be, let the iterative nature of this pass sink the definition into this block first. Update tests to accomodate this change, add new testcase where sinking avoids pipeline stalls. llvm-svn: 192608	2013-10-14 16:57:17 +00:00
Rafael Espindola	9770bde505	Remove the now unused strong phi elimination pass. llvm-svn: 192604	2013-10-14 16:39:04 +00:00
Elena Demikhovsky	82a46ebe0a	Fixed a bug in dynamic allocation memory on stack. The alignment of allocated space was wrong, see Bugzila 17345. Done by Zvi Rackover <zvi.rackover@intel.com>. llvm-svn: 192573	2013-10-14 07:26:51 +00:00
Will Dietz	ae726a93e3	TargetLowering: Don't index into empty string. (This is triggered by current lit tests) llvm-svn: 192549	2013-10-13 03:08:49 +00:00
Manman Ren	4c4b69c9c8	Debug Info: remove form from function addDIEEntry. The form must be a reference form in addDIEEntry. Which reference form to use will be decided by the callee. No functionality change. llvm-svn: 192517	2013-10-11 23:58:05 +00:00
Benjamin Kramer	a9767aed80	fConversion: Attempt #2 at fixing the MSVC build. llvm-svn: 192492	2013-10-11 19:49:09 +00:00
Benjamin Kramer	24906d9697	IfConversion: Try to unbreak the MSVC build. llvm-svn: 192487	2013-10-11 19:39:48 +00:00
Matthias Braun	d616ccc069	Remove kill flags after if conversion if necessary When if converting something like: true: ... = R0<kill> false: ... = R0<kill> then the instructions of the true block must not have a <kill> flag anymore, as the instruction of the false block follow and do still read the R0 value. Specifically this patch determines the set of register live-in in the false block (possibly after simulating the liveness changes of the duplicated instructions). Each of these live-in registers mustn't be killed. llvm-svn: 192482	2013-10-11 19:04:37 +00:00
Quentin Colombet	de0e06234c	[DAGCombiner] Reapply load slicing (192471) with a test that explicitly set sse4.2 support. This should fix the buildbots. Original commit message: [DAGCombiner] Slice a big load in two loads when the element are next to each other in memory and the target has paired load and performs post-isel loads combining. E.g., this optimization will transform something like this: a = load i64* addr b = trunc i64 a to i32 c = lshr i64 a, 32 d = trunc i64 c to i32 into: b = load i32* addr1 d = load i32* addr2 Where addr1 = addr2 +/- sizeof(i32), if the target supports paired load and performs post-isel loads combining. One should overload TargetLowering::hasPairedLoad to provide this information. The default is false. <rdar://problem/14477220> llvm-svn: 192476	2013-10-11 18:29:42 +00:00
Quentin Colombet	5aee63d9e3	[DAGCombiner] Revert load slicing (r192471), until I figure out why it fails on ubuntu. llvm-svn: 192474	2013-10-11 18:17:17 +00:00
Quentin Colombet	41dc258f71	[DAGCombiner] Slice a big load in two loads when the element are next to each other in memory and the target has paired load and performs post-isel loads combining. E.g., this optimization will transform something like this: a = load i64* addr b = trunc i64 a to i32 c = lshr i64 a, 32 d = trunc i64 c to i32 into: b = load i32* addr1 d = load i32* addr2 Where addr1 = addr2 +/- sizeof(i32), if the target supports paired load and performs post-isel loads combining. One should overload TargetLowering::hasPairedLoad to provide this information. The default is false. <rdar://problem/14477220> llvm-svn: 192471	2013-10-11 18:01:14 +00:00
Matthias Braun	b542fa514b	fix typo in comment llvm-svn: 192455	2013-10-11 15:40:14 +00:00
Justin Holewinski	660597d190	Make AsmPrinter::emitImplicitDef a virtual method so targets can emit custom comments for implicit defs For NVPTX, this fixes a crash where the emitImplicitDef implementation was expecting physical registers, while NVPTX uses virtual registers (with a couple of exceptions). Now, the implicit def comment will be emitted as a true PTX register name. Other targets can use this to customize the output of implicit def comments. Fixes PR17519 llvm-svn: 192444	2013-10-11 12:39:36 +00:00
NAKAMURA Takumi	d5d16d57eb	LiveRangeCalc.h: Update a description corresponding to r192396. [-Wdocumentation] llvm-svn: 192421	2013-10-11 04:52:03 +00:00
Matthias Braun	f6fe6bfffe	Print register in LiveInterval::print() llvm-svn: 192398	2013-10-10 21:29:05 +00:00
Matthias Braun	34e1be9451	Represent RegUnit liveness with LiveRange instance Previously LiveInterval has been used, but having a spill weight and register number is unnecessary for a register unit. llvm-svn: 192397	2013-10-10 21:29:02 +00:00
Matthias Braun	2d5c32b3b5	Work on LiveRange instead of LiveInterval where possible Also change some pointer arguments to references at some places where 0-pointers are not allowed. llvm-svn: 192396	2013-10-10 21:28:57 +00:00
Matthias Braun	364e6e9072	Change MachineVerifier to work on LiveRange + LiveInterval llvm-svn: 192395	2013-10-10 21:28:54 +00:00
Matthias Braun	88dd0abd2d	Pass LiveQueryResult by value This makes the API a bit more natural to use and makes it easier to make LiveRanges implementation details private. llvm-svn: 192394	2013-10-10 21:28:52 +00:00
Matthias Braun	d7df935bbc	Refactor LiveInterval: introduce new LiveRange class LiveRange just manages a list of segments and a list of value numbers now as LiveInterval did previously, but without having details like spill weight or a fixed register number. LiveInterval is now a subclass of LiveRange and simply adds the spill weight and the register number. llvm-svn: 192393	2013-10-10 21:28:47 +00:00
Matthias Braun	13ddb7cd65	Rename LiveRange to LiveInterval::Segment The Segment struct contains a single interval; multiple instances of this struct are used to construct a live range, but the struct is not a live range by itself. llvm-svn: 192392	2013-10-10 21:28:43 +00:00
Matthias Braun	1965bfa4c7	Rename parameter: defined regs are not incoming. llvm-svn: 192391	2013-10-10 21:28:38 +00:00
Matt Arsenault	a98c3b1816	Use getPointerSizeInBits() rather than 8 * getPointerSize() llvm-svn: 192386	2013-10-10 19:09:05 +00:00
Manman Ren	c50fa1114b	Debug Info: In DIBuilder, the context field of subprogram is updated to use DIScopeRef. A paired commit at clang is required due to changes to DIBuilder. llvm-svn: 192378	2013-10-10 18:40:01 +00:00
Manman Ren	88b0f948f5	Debug Info: In DIBuilder, the context and type fields of template_type and template_value are updated to use DIRef. A paired commit at clang is required due to changes to DIBuilder. llvm-svn: 192320	2013-10-09 19:46:28 +00:00
Reid Kleckner	cd4a25d66e	Explicitly request unsigned enum types when desired This fixes repeated -Wmicrosoft warnings when self-hosting clang on Windows, and gets us real unsigned enum types with MSVC. llvm-svn: 192227	2013-10-08 20:15:11 +00:00
Manman Ren	be5576f5f6	Add DbgVariable::resolve per Eric's suggestion. llvm-svn: 192218	2013-10-08 19:07:44 +00:00
Manman Ren	bda410f413	Debug Info: rename getOriginalTypeSize to getBaseTypeSize. llvm-svn: 192216	2013-10-08 18:46:58 +00:00
Manman Ren	93b3090a91	Debug Info: take advantage of the existing CU::resolve. llvm-svn: 192215	2013-10-08 18:42:58 +00:00
Eric Christopher	016be42362	Grammar. llvm-svn: 192199	2013-10-08 16:47:11 +00:00
Rafael Espindola	a17151ad5a	Add a MCTargetStreamer interface. This patch fixes an old FIXME by creating a MCTargetStreamer interface and moving the target specific functions for ARM, Mips and PPC to it. The ARM streamer is still declared in a common place because it is used from lib/CodeGen/ARMException.cpp, but the Mips and PPC are completely hidden in the corresponding Target directories. I will send an email to llvmdev with instructions on how to use this. llvm-svn: 192181	2013-10-08 13:08:17 +00:00
Richard Mitton	0aafb58aca	Formally added an explicit enum for DWARF TLS support. No functionality change. llvm-svn: 192118	2013-10-07 18:39:18 +00:00
Craig Topper	a7afa71494	Fix some assert messages to say the correct opcode name. Looks like one assert got copy and pasted to many places. llvm-svn: 192078	2013-10-06 22:38:19 +00:00
Rafael Espindola	78527050c2	Add support for aliases with linkonce_odr. This will be used to extend constructor aliases in clang. llvm-svn: 192066	2013-10-06 15:10:43 +00:00
Benjamin Kramer	7200a46c17	Emit a better error when running out of registers on inline asm. The most likely case where this error happens is when the user specifies too many register operands. Don't make it look like an internal LLVM bug when we can see that the error is coming from an inline asm instruction. For other instructions we keep the "ran out of registers" error. llvm-svn: 192041	2013-10-05 19:33:37 +00:00
Rafael Espindola	ac4ad25a00	Remove some really nasty uses of hasRawTextSupport. When MC was first added, targets could use hasRawTextSupport to keep features working before they were added to the MC interface. The design goal of MC is to provide an uniform api for printing assembly and object files. Short of relaxations and other corner cases, a object file is just another representation of the assembly. It was never the intention that targets would keep doing things like if (hasRawTextSupport()) Set flags in one way. else Set flags in another way. When they do that they create two code paths and the object file is no longer just another representation of the assembly. This also then requires testing with llc -filetype=obj, which is extremelly brittle. This patch removes some of these hacks by replacing them with smaller ones. The ARM flag setting is trivial, so I just moved it to the constructor. For Mips, the patch adds two temporary hack directives that allow the assembly to represent the same things as the object file was already able to. The hope is that the mips developers will replace the hack directives with the same ones that gas uses and drop the -print-hack-directives flag. I will also try to implement a target streamer interface, so that we can move this out of the common code. In summary, for any new work, two rules of the thumb are * Don't use "llc -filetype=obj" in tests. * Don't add calls to hasRawTextSupport. llvm-svn: 192035	2013-10-05 16:42:21 +00:00
Craig Topper	a1bbc323fa	Add OPC_CheckChildSame0-3 to the DAG isel matcher. This replaces sequences of MoveChild, CheckSame, MoveParent. Saves 846 bytes from the X86 DAG isel matcher, ~300 from ARM, ~840 from Hexagon. llvm-svn: 192026	2013-10-05 05:38:16 +00:00
Manman Ren	b3388601fb	Debug Info: In DIBuilder, the derived-from field of a DW_TAG_pointer_type is updated to use DITypeRef. Move isUnsignedDIType and getOriginalTypeSize from DebugInfo.h to be static helper functions in DwarfCompileUnit. We already have a static helper function "isTypeSigned" in DwarfCompileUnit, and a pointer to DwarfDebug is added to resolve the derived-from field. All three functions need to go across link for derived-from fields, so we need to get hold of a type identifier map. A pointer to DwarfDebug is also added to DbgVariable in order to resolve the derived-from field. Debug info verifier is updated to check a derived-from field is a TypeRef. Verifier will not go across link for derived-from fields, in debug info finder, we go across the link to add derived-from fields to types. Function getDICompositeType is only used by dragonegg and since dragonegg does not generate identifier for types, we use an empty map to resolve the derived-from field. When printing a derived-from field, we use DITypeRef::getName to either return the type identifier or getName of the DIType. A paired commit at clang is required due to changes to DIBuilder. llvm-svn: 192018	2013-10-05 01:43:03 +00:00
Eric Christopher	3264a48a45	Reorganize some member variables and update a comment. llvm-svn: 192017	2013-10-05 00:39:55 +00:00
Eric Christopher	87b9c49c72	Fix one comment and update another. Slightly reformat. llvm-svn: 192016	2013-10-05 00:32:34 +00:00
Eric Christopher	9e429ae779	Add a resolve method on CompileUnit that forwards to DwarfDebug. llvm-svn: 192014	2013-10-05 00:27:02 +00:00
Adrian Prantl	f01b562a15	Debug info: Don't crash in SelectionDAGISel when a vreg that is being pointed to by a dbg_value belonging to a function argument is eliminated during instruction selection. rdar://problem/15094721. llvm-svn: 192011	2013-10-05 00:08:27 +00:00
Eric Christopher	fa205cad7c	Make a bunch of CompileUnit member functions private. llvm-svn: 192009	2013-10-05 00:05:51 +00:00
David Blaikie	93ff1eb5fb	Minor formatting/comment rewording/etc. llvm-svn: 192005	2013-10-04 23:52:02 +00:00
Eric Christopher	fe3ae44179	Remove odd use of this. llvm-svn: 192004	2013-10-04 23:49:31 +00:00
Eric Christopher	f0388b7b39	Reformat some odd formattings. llvm-svn: 192003	2013-10-04 23:49:29 +00:00
Eric Christopher	08f7c8f1fe	Tighten up some type arguments to functions. Where we expect a scope, pass a scope. llvm-svn: 192002	2013-10-04 23:49:26 +00:00
David Blaikie	41369b5f41	Remove some dead code. llvm-svn: 192000	2013-10-04 23:37:30 +00:00
David Blaikie	fac5612ab0	Simplify setting of DIE tag for type DIEs by setting it in one* place. * two actually due to some weird template thing... investigating that. llvm-svn: 191998	2013-10-04 23:21:16 +00:00
Eric Christopher	baf3816283	Prune includes. llvm-svn: 191994	2013-10-04 22:54:28 +00:00
Eric Christopher	6b8209b6b7	Use addFlag to add the enum class attribute. This has the side effect of using DW_FORM_flag_present on dwarf4 and above. llvm-svn: 191991	2013-10-04 22:40:10 +00:00
Eric Christopher	dccd32866b	Use Die->addValue and DIEIntegerOne directly when we want to add a flag. No functional change. llvm-svn: 191990	2013-10-04 22:40:05 +00:00
Hal Finkel	dbc7a8a8a3	Fix DAGCombiner::visitFP_EXTEND to ignore indexed loads DAGCombiner::visitFP_EXTEND will apply the following transformation: fold (fpext (load x)) -> (fpext (fptrunc (extload x))) but the implementation does not handle indexed loads (pre/post inc.), but did not specifically ignore them either (unlike for extending loads, which it already ignored), causing an assert when the transformation was applied to an indexed load. This is the minimal fix for correctness (causing the transformation to be skipped for indexed loads). Unfortunately, I don't have an in-tree test case. llvm-svn: 191989	2013-10-04 22:18:12 +00:00
Eric Christopher	c19d6f096c	Temporarily revert r176882 as it needs to be implemented in a different way for all platforms. llvm-svn: 191975	2013-10-04 19:40:33 +00:00
Eric Christopher	e595bae4a4	Temporarily revert r191792 as it is causing some LTO debug failures on platforms with relocations in debug info and also temporarily revert r191800 due to conflicts with the revert of r191792. llvm-svn: 191967	2013-10-04 17:08:38 +00:00
Matthias Braun	caff764739	Fix comment llvm-svn: 191966	2013-10-04 16:53:02 +00:00
Matthias Braun	6a57acf44a	Fix indentation llvm-svn: 191965	2013-10-04 16:53:00 +00:00
Matthias Braun	c9d5c0f21d	Fix typo llvm-svn: 191964	2013-10-04 16:52:58 +00:00
Craig Topper	d9a6cc031d	Revert r191940 to see if it fixes the build bots. llvm-svn: 191941	2013-10-04 05:52:17 +00:00
Craig Topper	a2efe9ebc6	Add OPC_CheckChildSame0-3 to the DAG isel matcher. This replaces sequences of MoveChild, CheckSame, MoveParent. Saves 846 bytes from the X86 DAG isel matcher, ~300 from ARM, ~840 from Hexagon. llvm-svn: 191940	2013-10-04 05:22:20 +00:00
David Blaikie	309ffe4016	DebugInfo: Fix ordering of members after r191928 In the case (shown in the attached test) where a member function definition was emitted into debug info the following could occur: 1) build the debug info for the member function definition 2) in (1), build the debug info for the member function declaration 3) construct and add the member function declaration DIE 4) add it to its context 5) build its context (the type it is a member of) 6) construct the members and add them to the type 7) except don't add member functions because "getOrCreateSubprogram" adds the function to its parent anyway 8) except we're only partway through building this subprogram declaration so it hasn't been added yet - but we returned the partially constructed DIE (since it's already in the MDNode->DIE mapping to avoid infinitely recursing trying to create the member function DIE) 9) once the type is constructed, add the member function to it 10) now the members are out of order (the member function being defined is listed as the last member, even though it was declared as the first) To avoid this, construct the context of the subprogram DIE before we query to see if it exists. That way we never end up creating it before creating its context and ending up in this situation. Alternatively, the type construction that visits/builds all the members could call something like getOrCreateSubprogram, but that doesn't ever do the "add to context" step. Then the type building code would always be responsible for adding members (and the subprogram "addToContextDIE" would no-op because the context building would have added the subprogram declaration to the type/context DIE already). (the test cases updated were overly-sensitive to offsets or abbreviation numbers. We don't have a nice way to make these tests more robust as yet - multiline FileCheck matches would be required) llvm-svn: 191939	2013-10-04 01:39:59 +00:00
Richard Mitton	c250824772	Fixed a bug with section names containing special characters. Changed the dwarf aranges code to not use getLabelEndName, as it turns out it's not reliable to call that given user-defined section names. Section names can have characters in that aren't representable as symbol names. The dwarf-aranges test case has been updated to include a special character, to check this. This fixes pr17416. llvm-svn: 191932	2013-10-03 22:07:08 +00:00
David Blaikie	811bfe6395	DebugInfo: Avoid redundantly adding child DIEs to parents. DIE::addChild had a shortcircuit that silently no-op'd when a child was readded to the same parent. This hid some quirky/redundant code in DwarfDebug/CompileUnit. By removing that functionality and replacing it with an assert I was able to find and cleanup those cases, mostly centering around adding members to types in various circumstances. 1) The original oddity I noticed while working on type units (which actually was helping me in the short term, by accident) was the addToContextOwner call in constructTypeDIE. This call was completely bogus (why was it only done for non-virtual types? what relevance does that have at all) and redundant with the more uniform addToContextOwner made in getOrCreateTypeDIE. 2) If a member function definition was visited (createSubprogramDIE), it would attempt to build the member function declaration. The declaration DIE would then be added to its context, but in building the context (the type for which this function is a member) the members of the type would be added to the type automatically, so by the time the context was constructed, the member function was already associated with it. 3) The same as (2) but without the member function being constructed first. Whenever a type was constructed, the members would be created and member functions would be created by getOrCreateSubprogramDIE - this would lead to the subprogram being added to the (incomplete) type already, then the general member-construction code would add it again. llvm-svn: 191928	2013-10-03 20:07:20 +00:00
Matt Arsenault	40dddd7147	Rename DataLayout variables TD -> DL llvm-svn: 191927	2013-10-03 19:50:01 +00:00
Eric Christopher	c948b9df23	Make sure we emit a section for pubnames even if that section is going to be empty. This is particularly important for the gnu pubnames case since we're emitting a relocation to the section. llvm-svn: 191915	2013-10-03 17:41:20 +00:00
Eric Christopher	f976c77ed7	Fix cut and paste typo. llvm-svn: 191914	2013-10-03 17:41:16 +00:00
Jin-Gu Kang	0bf8241d4b	Added checking code whehter target supports specific dag combining about rotate or not. The corresponding dag patterns are as following: "DAGCombier::MatchRotate" function in DAGCombiner.cpp Pattern1 // fold (or (shl (ext x), (ext y)), // (srl (ext x), (ext (sub 32, y)))) -> // (ext (rotl x, y)) // fold (or (shl (ext x), (ext y)), // (srl (ext x), (ext (sub 32, y)))) -> // (ext (rotr x, (sub 32, y))) pattern2 // fold (or (shl (ext x), (ext (sub 32, y))), // (srl (ext x), (ext y))) -> // (ext (rotl x, y)) // fold (or (shl (ext x), (ext (sub 32, y))), // (srl (ext x), (ext y))) -> // (ext (rotr x, (sub 32, y))) llvm-svn: 191905	2013-10-03 15:58:48 +00:00
Alexey Samsonov	4436bf03e9	Remove wild .debug_aranges entries generated from unimportant labels r191052 added emitting .debug_aranges to Clang, but this functionality is broken: it uses all MC labels added in DWARF Asm printer, including the labels for build relocations between different DWARF sections, like .Lsection_line or .Ldebug_loc0. As a result, if any DIE .debug_info would contain "DW_AT_location=0x123" attribute, .debug_aranges would also contain a range starting from 0x123, breaking tools that rely on this section. This patch fixes this by using only MC labels that corresponds to the addresses in the user program. llvm-svn: 191884	2013-10-03 08:54:43 +00:00
Chandler Carruth	ea56494625	Remove the very substantial, largely unmaintained legacy PGO infrastructure. This was essentially work toward PGO based on a design that had several flaws, partially dating from a time when LLVM had a different architecture, and with an effort to modernize it abandoned without being completed. Since then, it has bitrotted for several years further. The result is nearly unusable, and isn't helping any of the modern PGO efforts. Instead, it is getting in the way, adding confusion about PGO in LLVM and distracting everyone with maintenance on essentially dead code. Removing it paves the way for modern efforts around PGO. Among other effects, this removes the last of the runtime libraries from LLVM. Those are being developed in the separate 'compiler-rt' project now, with somewhat different licensing specifically more approriate for runtimes. llvm-svn: 191835	2013-10-02 15:42:23 +00:00
Manman Ren	9a0a67035e	Debug Info: In DIBuilder, the derived-from field of a DW_TAG_pointer_type is updated to use DITypeRef. Move isUnsignedDIType and getOriginalTypeSize from DebugInfo.h to be static helper functions in DwarfCompileUnit. We already have a static helper function "isTypeSigned" in DwarfCompileUnit, and a pointer to DwarfDebug is added to resolve the derived-from field. All three functions need to go across link for derived-from fields, so we need to get hold of a type identifier map. A pointer to DwarfDebug is also added to DbgVariable in order to resolve the derived-from field. Debug info verifier is updated to check a derived-from field is a TypeRef. Verifier will not go across link for derived-from fields, in debug info finder, we go across the link to add derived-from fields to types. Function getDICompositeType is only used by dragonegg and since dragonegg does not generate identifier for types, we use an empty map to resolve the derived-from field. When printing a derived-from field, we use DITypeRef::getName to either return the type identifier or getName of the DIType. A paired commit at clang is required due to changes to DIBuilder. llvm-svn: 191800	2013-10-01 23:45:54 +00:00
Manman Ren	8990d7ee84	Debug Info: remove duplication of DIEs when a DIE is part of the type system and it is shared across CUs. We add a few maps in DwarfDebug to map MDNodes for the type system to the corresponding DIEs: MDTypeNodeToDieMap, MDSPNodeToDieMap, and MDStaticMemberNodeToDieMap. These DIEs can be shared across CUs, that is why we keep the maps in DwarfDebug instead of CompileUnit. Sometimes, when we try to add an attribute to a DIE, the DIE is not yet added to its owner yet, so we don't know whether we should use ref_addr or ref4. We create a worklist that will be processed during finalization to add attributes with the correct form (ref_addr or ref4). We add addDIEEntry to DwarfDebug to be a wrapper around DIE->addValue. It checks whether we know the correct form, if not, we update the worklist (DIEEntryWorklist). A testing case is added to show that we only create a single DIE for a type MDNode and we use ref_addr to refer to the type DIE. llvm-svn: 191792	2013-10-01 19:52:23 +00:00
Rafael Espindola	44fee4e0eb	Remove several unused variables. Patch by Alp Toker. llvm-svn: 191757	2013-10-01 13:32:03 +00:00
Tom Stellard	6aada32dc4	SelectionDAG: Clarify comments from r191600 llvm-svn: 191724	2013-10-01 02:09:00 +00:00
Eric Christopher	9a08f9e561	Add the DW_AT_GNU_ranges_base attribute if we've emitted any ranges into the debug_ranges section. llvm-svn: 191721	2013-10-01 00:43:36 +00:00
Eric Christopher	1d06eb5d86	Update comments. llvm-svn: 191720	2013-10-01 00:43:31 +00:00
Eric Christopher	39eebfada6	The DW_AT_GNU_pubnames/pubtypes attributes are actually form SEC_OFFSET from the beginning of the section so go ahead and emit a label at the beginning of each one. llvm-svn: 191710	2013-09-30 23:14:16 +00:00
Arnold Schwaighofer	d2f96b91ca	IfConverter: Use TargetSchedule for instruction latencies For targets that have instruction itineraries this means no change. Targets that move over to the new schedule model will use be able the new schedule module for instruction latencies in the if-converter (the logic is such that if there is no itineary we will use the new sched model for the latencies). Before, we queried "TTI->getInstructionLatency()" for the instruction latency and the extra prediction cost. Now, we query the TargetSchedule abstraction for the instruction latency and TargetInstrInfo for the extra predictation cost. The TargetSchedule abstraction will internally call "TTI->getInstructionLatency" if an itinerary exists, otherwise it will use the new schedule model. ATTENTION: Out of tree targets! (I will also send out an email later to LLVMDev) This means, if your target implements unsigned getInstrLatency(const InstrItineraryData ItinData, const MachineInstr MI, unsigned PredCost); and returns a value for "PredCost", you now also need to implement unsigned getPredictationCost(const MachineInstr MI); (if your target uses the IfConversion.cpp pass) radar://15077010 llvm-svn: 191671	2013-09-30 15:28:56 +00:00
Benjamin Kramer	c3c807b3bf	Allocate AtomicSDNode operands in SelectionDAG's allocator to stop leakage. SDNode destructors are never called. As an optimization use AtomicSDNode's internal storage if we have a small number of operands. llvm-svn: 191636	2013-09-29 11:18:56 +00:00
Robert Wilhelm	f0cfb83bb4	Fix spelling intruction -> instruction. llvm-svn: 191610	2013-09-28 11:46:15 +00:00
Tom Stellard	45015d9796	SelectionDAG: Silence unused variable warning on release builds llvm-svn: 191604	2013-09-28 03:10:17 +00:00
Tom Stellard	5694d3090a	SelectionDAG: Improve legalization of SELECT_CC with illegal condition codes SelectionDAG will now attempt to inverse an illegal conditon in order to find a legal one and if that doesn't work, it will attempt to swap the operands using the inverted condition. There are no new test cases for this, but a nubmer of the existing R600 tests hit this path. llvm-svn: 191602	2013-09-28 02:50:43 +00:00
Tom Stellard	cd42818d86	SelectionDAG: Try to expand all condition codes using getCCSwappedOperands() This is useful for targets like R600, which only support GT, GE, NE, and EQ condition codes as it removes the need to handle unsupported condition codes in target specific code. There are no tests with this commit, but R600 has been updated to take advantage of this new feature, so its existing selectcc tests are now testing the swapped operands path. llvm-svn: 191601	2013-09-28 02:50:38 +00:00
Tom Stellard	08690a146f	SelectionDAG: Clean up LegalizeSetCCCondCode() function Interpreting the results of this function is not very intuitive, so I cleaned it up to make it more clear whether or not a SETCC op was legalized and how it was legalized (either by swapping LHS and RHS or replacing with AND/OR). This patch does change functionality in the LHS and RHS swapping case, but unfortunately there are no in-tree tests for this. However, this patch is a prerequisite for R600 to take advantage of the LHS and RHS swapping, so tests will be added in subsequent commits. llvm-svn: 191600	2013-09-28 02:50:32 +00:00
Eric Christopher	a51d3fc721	Unify conditionals and reformat. llvm-svn: 191582	2013-09-27 22:50:48 +00:00
Josh Magee	8ecfb52388	[stackprotector] Refactor the StackProtector pass from a single .cpp file into StackProtector.h and StackProtector.cpp. No functionality change. Future patches will add analysis which will be used in other passes (PEI, StackSlot). The end goal is to support ssp-strong stack layout rules. WIP. Differential Revision: http://llvm-reviews.chandlerc.com/D1521 llvm-svn: 191570	2013-09-27 21:58:43 +00:00
Andrea Di Biagio	56ce9c4e78	Re-apply the change from r191393 with fix for pr17380. This change fixes the problem reported in pr17380 and re-add the dagcombine transformation ensuring that the value types are always legal if the transformation is triggered after Legalization took place. Added the test case from pr17380. llvm-svn: 191509	2013-09-27 11:37:05 +00:00
Andrea Di Biagio	549d6605a0	Revert r191393 since it caused pr17380. llvm-svn: 191438	2013-09-26 16:54:01 +00:00
Venkatraman Govindaraju	4c0cdd734c	[Sparc] Implements exception handling in SPARC with DwarfCFI. llvm-svn: 191432	2013-09-26 15:11:00 +00:00
Venkatraman Govindaraju	3816d43a9a	Implements parsing and emitting of .cfi_window_save in MC. llvm-svn: 191431	2013-09-26 14:49:40 +00:00
Amara Emerson	b4ad2f396a	[ARM] Use the load-acquire/store-release instructions optimally in AArch32. Patch by Artyom Skrobov. llvm-svn: 191428	2013-09-26 12:22:36 +00:00
Andrew Trick	71e8bb6d1d	Added temp flag -misched-bench for staging in default changes. llvm-svn: 191423	2013-09-26 05:53:35 +00:00
Andrew Trick	6f5aad7a24	whitespace llvm-svn: 191422	2013-09-26 05:53:31 +00:00
Andrea Di Biagio	9f3313109f	Teach DAGCombiner how to canonicalize dags according to the rule (shl (zext (shr A, X)), X) => (zext (shl (shr A, X), X)). The rule only triggers when there are no other uses of the zext to avoid materializing more instructions. This helps the DAGCombiner understand that the shl/shr sequence can then be converted into an and instruction. llvm-svn: 191393	2013-09-25 19:01:01 +00:00
Andrew Trick	b6854d80e3	Mark the x86 machine model as incomplete. PR17367. Ideally, the machinel model is added at the time the instructions are defined. But many instructions in X86InstrSSE.td still need a model. Without this workaround the scheduler asserts because x86 already has itinerary classes for these instructions, indicating they should be modeled by the scheduler. Since we use the new machine model for other instructions, it expects a new machine model for these too. llvm-svn: 191391	2013-09-25 18:14:12 +00:00
Quentin Colombet	fa403ab3fb	[PR16882] Ignore noreturn definitions when setting isPhysRegUsed. PEI inserts a save/restore sequence for the link register, according to the information it gets from the MachineRegisterInfo. MachineRegisterInfo is populated by the VirtRegMap pass. This pass was not aware of noreturn calls and was registering the definitions of these calls the same way as regular operations. Modify VirtRegPass so that it does not set the isPhysRegUsed information for registers only defined by noreturn calls. The rational is that a noreturn call is the "last instruction" of the program (if it returns the behavior is undefined), so everything that is defined by it cannot be used and will not interfere with anything else. Therefore, it is pointless to account for then. llvm-svn: 191349	2013-09-25 00:26:17 +00:00
Eli Friedman	a961d694e2	Add missing check to SETCC optimization. PR17338. llvm-svn: 191337	2013-09-24 22:50:14 +00:00
Andrew Trick	dc4c1adfc7	Comment typo. llvm-svn: 191312	2013-09-24 17:11:19 +00:00
Benjamin Kramer	64bdb29a83	DAGCombiner: Unify rotate matching for extended and unextended amounts. No functionality change, lots of indentation changes. llvm-svn: 191303	2013-09-24 14:21:28 +00:00
Jiangning Liu	63dc840fc5	Initial support for Neon scalar instructions. Patch by Ana Pazos. 1.Added support for v1ix and v1fx types. 2.Added Scalar Pairwise Reduce instructions. 3.Added initial implementation of Scalar Arithmetic instructions. llvm-svn: 191263	2013-09-24 02:47:27 +00:00
Michael Gottesman	5e3600c1ce	[stackprotector] Allow for copies from vreg -> vreg to be in a terminator sequence. Sometimes a copy from a vreg -> vreg sneaks into the middle of a terminator sequence. It is safe to slice this into the stack protector success bb. This fixes PR16979. llvm-svn: 191260	2013-09-24 01:50:26 +00:00
Eric Christopher	55364d71d0	Add namespaces to the list of items that we expose via pubnames. llvm-svn: 191257	2013-09-24 00:17:57 +00:00
Eric Christopher	6d0f1e683a	Add more external types to the pubtypes table. Expand the asm checking patch until we get full dumping support. llvm-svn: 191239	2013-09-23 23:15:58 +00:00
Eric Christopher	ccac5c4bf9	Rename IsStatic variable to Linkage in order to be a bit more descriptive. llvm-svn: 191236	2013-09-23 22:59:14 +00:00
Eric Christopher	b0fc0b9a7b	Formatting. llvm-svn: 191235	2013-09-23 22:59:11 +00:00
Bill Wendling	8faa30ef4b	Reformat code with clang-format. llvm-svn: 191226	2013-09-23 20:57:47 +00:00
Eric Christopher	261d234302	Handle gnu pubtypes sections: a) Make sure we are emitting the correct section in our section labels when we begin the module. b) Make sure we are emitting the correct pubtypes section in the presence of gnu pubtypes. c) For C++ struct, union, class, and enumeration types are default external. llvm-svn: 191225	2013-09-23 20:55:35 +00:00
Kay Tiong Khoo	9195a5b081	fix typo: than -> then llvm-svn: 191214	2013-09-23 18:43:51 +00:00
Richard Mitton	089ed89e76	Fixed debug_aranges handling for common symbols. The size of common symbols is now tracked correctly, so they can be listed in the arange section without needing knowledge of other following symbols. .comm (and .lcomm) do not indicate to the system assembler any particular section to use, so we have to treat them as having no section. Test case update to account for this. llvm-svn: 191210	2013-09-23 17:56:20 +00:00
Benjamin Kramer	8817cca5ce	Provide basic type safety for array_pod_sort comparators. This makes using array_pod_sort significantly safer. The implementation relies on function pointer casting but that should be safe as we're dealing with void* here. llvm-svn: 191175	2013-09-22 14:09:50 +00:00
Tim Northover	31d093c705	ISelDAG: spot chain cycles involving MachineNodes Previously, the DAGISel function WalkChainUsers was spotting that it had entered already-selected territory by whether a node was a MachineNode (amongst other things). Since it's fairly common practice to insert MachineNodes during ISelLowering, this was not the correct check. Looking around, it seems that other nodes get their NodeId set to -1 upon selection, so this makes sure the same thing happens to all MachineNodes and uses that characteristic to determine whether we should stop looking for a loop during selection. This should fix PR15840. llvm-svn: 191165	2013-09-22 08:21:56 +00:00
Juergen Ributzka	f043a65327	Revert "SelectionDAG: Teach the legalizer to split SETCC if VSELECT needs splitting too." This reverts commit r191130. llvm-svn: 191138	2013-09-21 15:09:46 +00:00
Juergen Ributzka	e9a80fc912	SelectionDAG: Teach the legalizer to split SETCC if VSELECT needs splitting too. The Type Legalizer recognizes that VSELECT needs to be split, because the type is to wide for the given target. The same does not always apply to SETCC, because less space is required to encode the result of a comparison. As a result VSELECT is split and SETCC is unrolled into scalar comparisons. This commit fixes the issue by checking for VSELECT-SETCC patterns in the DAG Combiner. If a matching pattern is found, then the result mask of SETCC is promoted to the expected vector mask for the given target. This mask has usually te same size as the VSELECT return type (except for Intel KNL). Now the type legalizer will split both VSELECT and SETCC. This allows the following X86 DAG Combine code to sucessfully detect the MIN/MAX pattern. This fixes PR16695, PR17002, and <rdar://problem/14594431>. llvm-svn: 191130	2013-09-21 04:55:18 +00:00
Eric Christopher	9cd26af8b6	Move emission of the debug string table to early in the debug info finalization to greatly reduce the number of fixups that the assembler has to handle in order to improve compile time. llvm-svn: 191119	2013-09-20 23:22:52 +00:00
Eric Christopher	9c58f317da	Migrate addGlobalName to the .cpp file as an intermediate step to further work. llvm-svn: 191113	2013-09-20 22:20:55 +00:00
Andrew Trick	978674b2bc	Allow subtarget selection of the default MachineScheduler and document the interface. The global registry is used to allow command line override of the scheduler selection, but does not work well as the normal selection API. For example, the same LLVM process should be able to target multiple targets or subtargets. llvm-svn: 191071	2013-09-20 05:14:41 +00:00
David Blaikie	efd0bcb70f	DebugInfo: GDBIndexEntryString conversion functions now return const char for easy llvm::formating This was previously invoking UB by passing a user-defined type to format. Thanks to Jordan Rose for pointing this out. llvm-svn: 191060	2013-09-20 00:33:15 +00:00
David Blaikie	9d117ab7ef	Add braces to suppress Clang's dangling-else warning. These violations were introduced in r191049 llvm-svn: 191059	2013-09-20 00:33:11 +00:00
Richard Mitton	21101b3231	Added support for generate DWARF .debug_aranges sections automatically. llvm-svn: 191052	2013-09-19 23:21:01 +00:00
Andrew Trick	665d3ec3d3	Rename ConvergingScheduler to GenericScheduler. This was an experimental scheduler a year ago. It's now used by several subtargets, both in-order and out-of-order, and it is about to be enabled by default for x86 and armv7. It will be the new GenericScheduler for subtargets that don't provide their own SchedulingStrategy. llvm-svn: 191051	2013-09-19 23:10:59 +00:00
David Blaikie	404d3047c0	DebugInfo: llvm-dwarfdump support for gnu_pubnames section llvm-svn: 191050	2013-09-19 23:01:29 +00:00
Kai Nacke	d09bb4614b	PR16726: extend rol/ror matching C-like languages promote types like unsigned short to unsigned int before performing an arithmetic operation. Currently the rotate matcher in the DAGCombiner does not consider this situation. This commit extends the DAGCombiner in the way that the pattern (or (shl ([az]ext x), (ext y)), (srl ([az]ext x), (ext (sub 32, y)))) is folded into ([az]ext (rotl x, y)) The matching is restricted to aext and zext because in this cases the upper bits are either undefined or known. Test case is included. This fixes PR16726. llvm-svn: 191049	2013-09-19 23:00:28 +00:00
Kai Nacke	2d967b2751	Revert PR16726: extend rol/ror matching There is a buildbot failure. Need to investigate this. llvm-svn: 191048	2013-09-19 22:53:36 +00:00
Kai Nacke	4eaf6444fa	PR16726: extend rol/ror matching C-like languages promote types like unsigned short to unsigned int before performing an arithmetic operation. Currently the rotate matcher in the DAGCombiner does not consider this situation. This commit extends the DAGCombiner in the way that the pattern (or (shl ([az]ext x), (ext y)), (srl ([az]ext x), (ext (sub 32, y)))) is folded into ([az]ext (rotl x, y)) The matching is restricted to aext and zext because in this cases the upper bits are either undefined or known. Test case is included. This fixes PR16726. llvm-svn: 191045	2013-09-19 22:36:39 +00:00
David Blaikie	d0a869d0bf	DebugInfo: Improve IR annotation comments for GNU pubthings. llvm-svn: 191043	2013-09-19 22:19:37 +00:00
David Blaikie	8dec407649	Unshift the GDB index/GNU pubnames constants modified in r191025 Based on code review feedback from Eric Christopher, unshifting these constants as they can appear in the gdb_index itself, shifted a further 24 bits. This means that keeping them preshifted is a bit inflexible, so let's not do that. Given the motivation, wrap up some nicer enums, more type safety, and some utility functions. llvm-svn: 191035	2013-09-19 20:40:26 +00:00
David Blaikie	b20db58a4d	DebugInfo: Simplify gnu_pubnames index computation. Names open to bikeshedding. Could switch back to the constants being unshifted, but this way seems a bit easier to work with. llvm-svn: 191025	2013-09-19 18:39:59 +00:00
David Blaikie	70a3320244	Remove unnecessary conditional operators performing bool->bool conversion. llvm-svn: 191020	2013-09-19 17:33:35 +00:00
David Blaikie	0f5ad28a9d	Fix a typo and simplify a boolean expression. llvm-svn: 191018	2013-09-19 17:27:48 +00:00
Benjamin Kramer	d443e4a080	DAGCombiner: Don't fold vector muls with constants that look like a splat of a power of 2 but differ in bit width. PR17283. llvm-svn: 191000	2013-09-19 13:28:20 +00:00
Adrian Prantl	262bcf4584	Debug info: Get rid of the VLA indirection hack in FastISel. Use the DIVariable::isIndirect() flag set by the frontend instead of guessing whether to set the machine location's indirection bit. Paired commit with CFE. llvm-svn: 190961	2013-09-18 22:08:59 +00:00
Arnold Schwaighofer	cae8735a54	Costmodel: Add support for horizontal vector reductions Upcoming SLP vectorization improvements will want to be able to estimate costs of horizontal reductions. Add infrastructure to support this. We model reductions as a series of (shufflevector,add) tuples ultimately followed by an extractelement. For example, for an add-reduction of <4 x float> we could generate the following sequence: (v0, v1, v2, v3) \ \ / / \ \ / + + (v0+v2, v1+v3, undef, undef) \ / ((v0+v2) + (v1+v3), undef, undef) %rdx.shuf = shufflevector <4 x float> %rdx, <4 x float> undef, <4 x i32> <i32 2, i32 3, i32 undef, i32 undef> %bin.rdx = fadd <4 x float> %rdx, %rdx.shuf %rdx.shuf7 = shufflevector <4 x float> %bin.rdx, <4 x float> undef, <4 x i32> <i32 1, i32 undef, i32 undef, i32 undef> %bin.rdx8 = fadd <4 x float> %bin.rdx, %rdx.shuf7 %r = extractelement <4 x float> %bin.rdx8, i32 0 This commit adds a cost model interface "getReductionCost(Opcode, Ty, Pairwise)" that will allow clients to ask for the cost of such a reduction (as backends might generate more efficient code than the cost of the individual instructions summed up). This interface is excercised by the CostModel analysis pass which looks for reduction patterns like the one above - starting at extractelements - and if it sees a matching sequence will call the cost model interface. We will also support a second form of pairwise reduction that is well supported on common architectures (haddps, vpadd, faddp). (v0, v1, v2, v3) \ / \ / (v0+v1, v2+v3, undef, undef) \ / ((v0+v1)+(v2+v3), undef, undef, undef) %rdx.shuf.0.0 = shufflevector <4 x float> %rdx, <4 x float> undef, <4 x i32> <i32 0, i32 2 , i32 undef, i32 undef> %rdx.shuf.0.1 = shufflevector <4 x float> %rdx, <4 x float> undef, <4 x i32> <i32 1, i32 3, i32 undef, i32 undef> %bin.rdx.0 = fadd <4 x float> %rdx.shuf.0.0, %rdx.shuf.0.1 %rdx.shuf.1.0 = shufflevector <4 x float> %bin.rdx.0, <4 x float> undef, <4 x i32> <i32 0, i32 undef, i32 undef, i32 undef> %rdx.shuf.1.1 = shufflevector <4 x float> %bin.rdx.0, <4 x float> undef, <4 x i32> <i32 1, i32 undef, i32 undef, i32 undef> %bin.rdx.1 = fadd <4 x float> %rdx.shuf.1.0, %rdx.shuf.1.1 %r = extractelement <4 x float> %bin.rdx.1, i32 0 llvm-svn: 190876	2013-09-17 18:06:50 +00:00
Serge Pavlov	8ec39992c1	Added documentation to getMemsetStores. llvm-svn: 190866	2013-09-17 16:24:42 +00:00
Quentin Colombet	d30a9585b8	[SelectionDAG] Teach the vector scalarizer about TRUNCATE. When a truncate node defines a legal vector type but uses an illegal vector type, the legalization process was splitting the vector until <1 x vector> type, but then it was failing to scalarize the node because it did not know how to handle TRUNCATE. <rdar://problem/14989896> llvm-svn: 190830	2013-09-17 00:26:56 +00:00
Adrian Prantl	db3e26d193	Debug info: Fix PR16736 and rdar://problem/14990587. A DBG_VALUE is register-indirect iff the first operand is a register _and_ the second operand is an immediate. llvm-svn: 190821	2013-09-16 23:29:03 +00:00
Jakub Staszak	ec2ffa92d8	Use reference instead of copy. llvm-svn: 190813	2013-09-16 22:03:38 +00:00
Peter Collingbourne	3fa50f9b05	Implement function prefix data as an IR feature. Previous discussion: http://lists.cs.uiuc.edu/pipermail/llvmdev/2013-July/063909.html Differential Revision: http://llvm-reviews.chandlerc.com/D1191 llvm-svn: 190773	2013-09-16 01:08:15 +00:00
Benjamin Kramer	7d6052687e	Replace some unnecessary vector copies with references. llvm-svn: 190770	2013-09-15 22:04:42 +00:00
Hal Finkel	31658834e6	Prevent assert in CombinerGlobalAA with null values DAGCombiner::isAlias can be called with SrcValue1 or SrcValue2 null, and we can't use AA in this case (if we try, then the casting code in AA will assert). llvm-svn: 190763	2013-09-15 02:19:49 +00:00
Quentin Colombet	cf71c6320b	[Peephole] Rewrite copies to avoid cross register banks copies. By definition copies across register banks are not coalescable. Still, it may be possible to get rid of such a copy when the value is available in another register of the same register file. Consider the following example, where capital and lower letters denote different register file: b = copy A <-- cross-bank copy ... C = copy b <-- cross-bank copy This could have been optimized this way: b = copy A <-- cross-bank copy ... C = copy A <-- same-bank copy Note: b and C's definitions may be in different basic blocks. This patch adds a peephole optimization that looks through a chain of copies leading to a cross-bank copy and reuses a source that is on the same register file if available. This solution could also be used to get rid of some copies (e.g., A could have been used instead of C). However, we do not do so because: - It may over constrain the coloring of the source register for coalescing. - The register allocator may not be able to find a nice split point for the longer live-range, leading to more spill. <rdar://problem/14742333> llvm-svn: 190713	2013-09-13 18:26:31 +00:00
Eric Christopher	dd1a01203d	Add initial support for handling gnu style pubnames accepted by some versions of gold. This support is designed to allow gold to produce gdb_index sections similar to the accelerator tables and consumable by gdb. llvm-svn: 190649	2013-09-13 00:35:05 +00:00
Eric Christopher	8b3737fbb0	Reformat and hoist section grabbing to top level. llvm-svn: 190648	2013-09-13 00:34:58 +00:00
Joey Gouly	0e76fa7df5	Add an instruction deprecation feature to TableGen. The 'Deprecated' class allows you to specify a SubtargetFeature that the instruction is deprecated on. The 'ComplexDeprecationPredicate' class allows you to define a custom predicate that is called to check for deprecation. For example: ComplexDeprecationPredicate<"MCR"> would mean you would have to define the following function: bool getMCRDeprecationInfo(MCInst &MI, MCSubtargetInfo &STI, std::string &Info) Which returns 'false' for not deprecated, and 'true' for deprecated and store the warning message in 'Info'. The MCTargetAsmParser constructor was chaned to take an extra argument of the MCInstrInfo class, so out-of-tree targets will need to be changed. llvm-svn: 190598	2013-09-12 10:28:05 +00:00
Hal Finkel	6f1ff8e1a8	Fix crash in AggressiveAntiDepBreaker with empty CriticalPathSet If no register classes are added to CriticalPathRCs, then the CriticalPathSet bitmask will be empty. In that case, ExcludeRegs must remain NULL or else this line will cause a segfault: } else if ((ExcludeRegs != NULL) && ExcludeRegs->test(AntiDepReg)) { I have no in-tree test case. llvm-svn: 190584	2013-09-12 04:22:31 +00:00
Matt Arsenault	bc08ddba58	Remove pointless assertion after r190376 llvm-svn: 190565	2013-09-12 01:07:49 +00:00
Manman Ren	5b2f4b0540	Debug info: add more comments. llvm-svn: 190544	2013-09-11 19:40:28 +00:00
Hal Finkel	8f2e700522	Add getUnrollingPreferences to TTI Allow targets to customize the default behavior of the generic loop unrolling transformation. This will be used by the PowerPC backend when targeting the A2 core (which is in-order with a deep pipeline), and using more aggressive defaults is important. llvm-svn: 190542	2013-09-11 19:25:43 +00:00
Benjamin Kramer	079b96e6f7	Revert "Give internal classes hidden visibility." It works with clang, but GCC has different rules so we can't make all of those hidden. This reverts commit r190534. llvm-svn: 190536	2013-09-11 18:05:11 +00:00
Benjamin Kramer	6a44af3629	Give internal classes hidden visibility. Worth 100k on a linux/x86_64 Release+Asserts clang. llvm-svn: 190534	2013-09-11 17:42:27 +00:00
Bill Wendling	62a2d14ac5	Simplify the checking of function attributes by using the simple methods. llvm-svn: 190499	2013-09-11 08:35:09 +00:00
Eli Friedman	8f06d55697	Rename variables for consistency. No functional change. llvm-svn: 190466	2013-09-11 00:41:02 +00:00
Eli Friedman	78bffa5767	Fix unused variables. llvm-svn: 190448	2013-09-10 23:18:14 +00:00
Eric Christopher	13b99d2aba	Hoist section call out of loop. llvm-svn: 190440	2013-09-10 21:49:37 +00:00
Manman Ren	2312ed35d2	Debug Info: create scope children DIEs when the scope DIE is not null. We try to create the scope children DIEs after we create the scope DIE. But to avoid emitting empty lexical block DIE, we first check whether a scope DIE is going to be null, then create the scope children if it is not null. From the number of children, we decide whether to actually create the scope DIE. This patch also removes an early exit which checks for a special condition. It also removes deletion of un-used children DIEs that are generated because we used to generate children DIEs before the scope DIE. Deletion of un-used children DIEs may cause problem because we sometimes keep created DIEs in a member variable of a CU. llvm-svn: 190421	2013-09-10 18:40:41 +00:00
Manman Ren	34b3dcc3b5	Debug Info: define a DIRef template. Specialize the constructors for DIRef<DIScope> and DIRef<DIType> to make sure the Value is indeed a scope ref and a type ref. Use DIScopeRef for DIScope::getContext and DIType::getContext and use DITypeRef for getContainingType and getClassType. DIScope::generateRef now returns a DIScopeRef instead of a "Value *" for readability and type safety. llvm-svn: 190418	2013-09-10 18:30:07 +00:00
Matt Arsenault	d232222f34	Don't use getSetCCResultType for creating a vselect The vselect mask isn't a setcc. This breaks in the case when the result of getSetCCResultType is larger than the vector operands e.g. %tmp = select i1 %cmp <2 x i8> %a, <2 x i8> %b when getSetCCResultType returns <2 x i32>, the assertion that the (MaskTy.getSizeInBits() == Op1.getValueType().getSizeInBits()) is hit. No test since I don't think I can hit this with any of the current targets. The R600/SI implementation would break, since it returns a vector of i1 for this, but it doesn't reach ExpandSELECT for other reasons. llvm-svn: 190376	2013-09-10 00:41:56 +00:00
Andrew Trick	6c88b35090	Enable -misched-cyclicpath by default. llvm-svn: 190367	2013-09-09 23:31:14 +00:00
Manman Ren	de897a369a	Debug Info: move DIScope::getContext back from DwarfDebug. This partially reverts r190330. DIScope::getContext now returns DIScopeRef instead of DIScope. We construct a DIScopeRef from DIScope when we are dealing with subprogram, lexical block or name space. llvm-svn: 190362	2013-09-09 22:35:23 +00:00
Andrew Trick	e1f7bf2c02	mi-sched: smooth out the cyclicpath heuristic. Arnold's idea. I generally try to avoid stateful heuristics because it can make debugging harder. However, we need a way to prevent the latency priority from dominating, and it somewhat makes sense to schedule aggressively for latency only within an issue group. Swift in particular likes this, and it doesn't hurt anyone else: \| Benchmarks/MiBench/consumer-lame \| 10.39% \| \| Benchmarks/Misc/himenobmtxpa \| 9.63% \| llvm-svn: 190360	2013-09-09 22:28:08 +00:00
Jack Carter	170a5f2983	white spaces and long lines llvm-svn: 190358	2013-09-09 22:02:08 +00:00
Eric Christopher	ba506db498	Always add global names. We're adding them in the rest of the code as well as types. No functional change as they're not emitted unless the option is true anyhow. llvm-svn: 190346	2013-09-09 20:03:20 +00:00
Eric Christopher	5f93bb9299	Rename for consistency. llvm-svn: 190345	2013-09-09 20:03:17 +00:00
Bill Wendling	550c76dbd6	Call generateCompactUnwindEncodings() right before we need to output the frame information. There are more than one paths to where the frame information is emitted. Place the call to generateCompactUnwindEncodings() into the method which outputs the frame information, thus ensuring that the encoding is there for every path. This involved threading the MCAsmBackend object through to this method. <rdar://problem/13623355> llvm-svn: 190335	2013-09-09 19:48:37 +00:00
Manman Ren	116868eadd	Debug Info: Use DIScopeRef for DIType::getContext. In DIBuilder, the context field of a TAG_member is updated to use the scope reference. Verifier is updated accordingly. DebugInfoFinder now needs to generate a type identifier map to have access to the actual scope. Same applies for BreakpointPrinter. processModule of DebugInfoFinder is called during initialization phase of the verifier to make sure the type identifier map is constructed early enough. We are now able to unique a simple class as demonstrated by the added testing case. llvm-svn: 190334	2013-09-09 19:47:11 +00:00
Manman Ren	33796c5e98	Debug Info: move DIScope::getContext to DwarfDebug. DIScope::getContext is a wrapper function that calls the specific getContext method on each subclass. When we switch DIType::getContext to return DIScopeRef instead of DIScope, DIScope::getContext can no longer return a DIScope without a type identifier map. DIScope::getContext is only used by DwarfDebug, so we move it to DwarfDebug to have easy access to the type identifier map. llvm-svn: 190330	2013-09-09 19:23:58 +00:00
Bob Wilson	e407736a06	Revert patches to add case-range support for PR1255. The work on this project was left in an unfinished and inconsistent state. Hopefully someone will eventually get a chance to implement this feature, but in the meantime, it is better to put things back the way the were. I have left support in the bitcode reader to handle the case-range bitcode format, so that we do not lose bitcode compatibility with the llvm 3.3 release. This reverts the following commits: 155464, 156374, 156377, 156613, 156704, 156757, 156804 156808, 156985, 157046, 157112, 157183, 157315, 157384, 157575, 157576, 157586, 157612, 157810, 157814, 157815, 157880, 157881, 157882, 157884, 157887, 157901, 158979, 157987, 157989, 158986, 158997, 159076, 159101, 159100, 159200, 159201, 159207, 159527, 159532, 159540, 159583, 159618, 159658, 159659, 159660, 159661, 159703, 159704, 160076, 167356, 172025, 186736 llvm-svn: 190328	2013-09-09 19:14:35 +00:00
Manman Ren	3eb9dffc89	Debug Info: Move isSubprogramContext from DebugInfo to DwarfDebug. This helper function needs the type identifier map when we switch DIType::getContext to return DIScopeRef instead of DIScope. Since isSubprogramContext is used by DwarfDebug only, We move it to DwarfDebug to have easy access to the map. llvm-svn: 190325	2013-09-09 19:05:21 +00:00
Manman Ren	856191b0d1	Debug Info: Rename DITypeRef to DIScopeRef. A reference to a scope is more general than a reference to a type since DIType is a subclass of DIScope. A reference to a type can be either an identifier for the type or the DIType itself, while a reference to a scope can be either an identifier for the type (when the scope is indeed a type) or the DIScope itself. A reference to a type and a reference to a scope will be resolved in the same way. The only difference is in the verifier when a field is a reference to a type (i.e. the containing type field of a DICompositeType) or a field is a reference to a scope (i.e. the context field of a DIType). This is to get ready for switching DIType::getContext to return DIScopeRef instead of DIScope. Tighten up isTypeRef and isScopeRef to make sure the identifier is not empty and the MDNode is DIType for TypeRef and DIScope for ScopeRef. llvm-svn: 190322	2013-09-09 19:03:51 +00:00
Benjamin Kramer	d93817ffe0	[stackprotector] Modernize code with IRBuilder llvm-svn: 190317	2013-09-09 17:38:01 +00:00
Joey Gouly	a5153cb025	[ARMv8] Prevent generation of deprecated IT blocks on ARMv8 in Thumb mode. IT blocks can only be one instruction lonf, and can only contain a subset of the 16 instructions. Patch by Artyom Skrobov! llvm-svn: 190309	2013-09-09 14:21:49 +00:00
Bill Wendling	58e2d3d856	Generate compact unwind encoding from CFI directives. We used to generate the compact unwind encoding from the machine instructions. However, this had the problem that if the user used `-save-temps' or compiled their hand-written `.s' file (with CFI directives), we wouldn't generate the compact unwind encoding. Move the algorithm that generates the compact unwind encoding into the MCAsmBackend. This way we can generate the encoding whether the code is from a `.ll' or `.s' file. <rdar://problem/13623355> llvm-svn: 190290	2013-09-09 02:37:14 +00:00
Manman Ren	c4ae9b3aeb	Debug Info: Use identifier to reference DIType in containing type field of a DISubprogram. Verifier is updated accordingly. llvm-svn: 190229	2013-09-07 00:04:05 +00:00
Manman Ren	d8f798ea97	Debug Info: Use identifier to reference DIType in containing type field of a DICompositeType. Verifier is updated accordingly. llvm-svn: 190190	2013-09-06 18:46:00 +00:00
Andrew Trick	b248b4a1de	mi-sched: cleanup register pressure update, remove a FIXME. llvm-svn: 190181	2013-09-06 17:32:47 +00:00
Andrew Trick	c573cd905a	mi-sched: improve regpressure tracing. llvm-svn: 190180	2013-09-06 17:32:44 +00:00
Andrew Trick	7609b7d1b5	mi-sched: print tree size in -view-misched-dags llvm-svn: 190179	2013-09-06 17:32:42 +00:00
Andrew Trick	ffdbefb90c	mi-sched: register pressure update tracing. llvm-svn: 190178	2013-09-06 17:32:39 +00:00
Andrew Trick	ddffae9027	mi-sched: Reorder Cyclicpath (latency) and CriticalMax (pressure) heuristics. The latency based scheduling could induce spills in some cases. llvm-svn: 190177	2013-09-06 17:32:36 +00:00
Andrew Trick	75e411cc8e	Added MachineSchedPolicy. Allow subtargets to customize the generic scheduling strategy. This is convenient for targets that don't need to add new heuristics by specializing the strategy. llvm-svn: 190176	2013-09-06 17:32:34 +00:00
Matthias Braun	305ef7f5b0	avoid unnecessary direct access to LiveInterval::ranges llvm-svn: 190170	2013-09-06 16:44:32 +00:00
Matthias Braun	90e0d3c03a	remove unused argument from LiveRanges::join() llvm-svn: 190169	2013-09-06 16:44:29 +00:00
Matthias Braun	c0ad7bfa62	remove pointless assert The if above it ensures the property anyway. llvm-svn: 190168	2013-09-06 16:44:27 +00:00
Matthias Braun	b348d9703c	fix comment There's no 'B3' in the example. llvm-svn: 190167	2013-09-06 16:44:25 +00:00
Tim Northover	950fcc0577	SelectionDAG: create correct BooleanContent constants Occasionally DAGCombiner can spot that a SETCC operation is completely redundant and reduce it to "all true" or "all false". If this happens to a vector, the value produced has to take account of what a normal comparison would have produced, which may be an all-1s bitmask. The fix in SelectionDAG.cpp is tested, however, as far as I can see the code in TargetLowering.cpp is possibly unreachable and almost certainly irrelevant when triggered so there are no tests. However, I believe it's still clearly the right change and may save someone else some hassle if it suddenly becomes reachable. So I'm doing it anyway. llvm-svn: 190147	2013-09-06 12:38:12 +00:00
Manman Ren	60352032bf	Debug Info: Use identifier to reference DIType in base type field of ptr_to_member. We introduce a new class DITypeRef that represents a reference to a DIType. It wraps around a Value*, which can be either an identifier in MDString or an actual MDNode. The class has a helper function "resolve" that finds the actual MDNode for a given DITypeRef. We specialize getFieldAs to return a field that is a reference to a DIType. To correctly access the base type field of ptr_to_member, getClassType now calls getFieldAs<DITypeRef> to return a DITypeRef. Also add a typedef for DITypeIdentifierMap and a helper generateDITypeIdentifierMap in DebugInfo.h. In DwarfDebug.cpp, we keep a DITypeIdentifierMap and call generateDITypeIdentifierMap to actually populate the map. Verifier is updated accordingly. llvm-svn: 190081	2013-09-05 18:48:31 +00:00
Eric Christopher	cf7289f6d9	Move accelerator table defines and constants to Dwarf.h since we're proposing it for DWARF5. No functional change intended. llvm-svn: 190074	2013-09-05 18:20:16 +00:00
Eric Christopher	b4e2cc49ef	Reformat. llvm-svn: 190064	2013-09-05 16:46:43 +00:00
Andrew Trick	ed20075d19	mi-sched: Force bottom up scheduling for generic targets. Fast register pressure tracking currently only takes effect during bottom up scheduling. Forcing this is a bit faster and simpler for targets that don't have many scheduling constraints and don't need top-down scheduling. llvm-svn: 190014	2013-09-04 23:54:00 +00:00
Eric Christopher	e31e072c33	Remove hack ensuring that darwin didn't produce dwarf > 3 for modules without a limiting factor. Update all testcases accordingly. llvm-svn: 190002	2013-09-04 22:21:24 +00:00
Eric Christopher	c9f1e785d5	Revert "Revert r189902 as the workaround shouldn't be necessary anymore." Needs testcase updates. llvm-svn: 190000	2013-09-04 21:36:52 +00:00
Eric Christopher	b72ef638f4	Revert r189902 as the workaround shouldn't be necessary anymore. llvm-svn: 189999	2013-09-04 21:26:56 +00:00
Andrew Trick	b05db8e0b9	comment typo llvm-svn: 189997	2013-09-04 21:12:05 +00:00
Andrew Trick	2a749ee0b9	Remove dead subtree limit code. llvm-svn: 189995	2013-09-04 21:00:20 +00:00
Andrew Trick	856ecd9ab3	-view-misched-dags, better pruning. llvm-svn: 189994	2013-09-04 21:00:18 +00:00
Andrew Trick	ef54c59490	mi-sched: DEBUG cleanup, call tracePick for unidirectional scheduling. llvm-svn: 189993	2013-09-04 21:00:16 +00:00
Andrew Trick	1ab16d9ecf	80 columns llvm-svn: 189992	2013-09-04 21:00:13 +00:00
Andrew Trick	66c3dfbf8c	mi-sched: Suppress register pressure tracking when the scheduling window is too small. If the instruction window is < NumRegs/2, pressure tracking is not likely to be effective. The scheduler has to process a very large number of tiny blocks. We want this to be fast. llvm-svn: 189991	2013-09-04 21:00:11 +00:00
Andrew Trick	a6e877707f	mi-sched: Load clustering is a bit to expensive to enable unconditionally. llvm-svn: 189990	2013-09-04 21:00:08 +00:00
Andrew Trick	8c699c93b2	mi-sched: Reuse an invalid HazardRecognizer to save compile time. llvm-svn: 189989	2013-09-04 21:00:05 +00:00
Andrew Trick	310190e21f	mi-sched: bypass heuristic checks when regpressure tracking is disabled. llvm-svn: 189988	2013-09-04 21:00:02 +00:00
Andrew Trick	b6e74712b6	Added -misched-regpressure option. Register pressure tracking is half the complexity of the scheduler. It's useful to be able to turn it off for compile time and performance comparisons. llvm-svn: 189987	2013-09-04 20:59:59 +00:00
Eric Christopher	9adc55faa7	Unify and clean up. llvm-svn: 189977	2013-09-04 19:53:21 +00:00
Michael Gottesman	c89466fc22	Revert "Revert "Remove the darwin gdb option, that version of gdb is now dead and the rest of the compatibility should be done on a dwarf-N level."" This reverts commit r189913. Talked with Eric on IRC. I am going to XFAIL the failing test since it is using what Eric described as "the member hack" which was needed on that old GDB. Sorry for the noise! llvm-svn: 189914	2013-09-04 04:39:38 +00:00
Michael Gottesman	a318370b8d	Revert "Remove the darwin gdb option, that version of gdb is now dead and the rest of the compatibility should be done on a dwarf-N level." This reverts commit r189903. This commit broke the phase 1 buildbot for a while. http://lab.llvm.org:8013/builders/clang-x86_64-darwin11-nobootstrap-RAincremental/builds/6684 llvm-svn: 189913	2013-09-04 04:31:56 +00:00
Eric Christopher	614dc83603	Remove the darwin gdb option, that version of gdb is now dead and the rest of the compatibility should be done on a dwarf-N level. llvm-svn: 189903	2013-09-04 02:02:10 +00:00
Eric Christopher	38f1c64098	Make the default dwarf version 3 for darwin when we can't find one in the module. Add a FIXME with a comment about darwin's ld. llvm-svn: 189902	2013-09-04 01:38:30 +00:00
Eric Christopher	25b7adc8ce	Add a hashing routine that handles hashing types. Add a test for hashing the contents of DW_FORM_data1 on top of a type with attributes. llvm-svn: 189862	2013-09-03 21:57:57 +00:00
Eric Christopher	b86e2ad819	Sentences end with periods. llvm-svn: 189861	2013-09-03 21:57:50 +00:00
Eric Christopher	e020fa7c9c	Add the rest of the stock attributes to the attribute table. This won't affect the kinds of hashes we test for as we actually do hashing based on form and attribute. Change the fission-hash testcase one last time to handle DW_AT_comp_dir. llvm-svn: 189840	2013-09-03 20:00:20 +00:00
Andrew Trick	2c4f8b7ee8	Fix my previous checkin to updatePressureDiffs. There was one case that we could hit a DebugValue where I didn't think to check. DebugValues are evil. No checkinable test case, sorry. It's an obvious fix. llvm-svn: 189717	2013-08-31 05:17:58 +00:00
Andrew Trick	3bf33075ce	Use LiveRangeQuery for instruction-level liveness queries. Remove redundant or bug-prone LiveInterval APIs. llvm-svn: 189685	2013-08-30 17:58:49 +00:00
Andrew Trick	2bc74c2887	mi-sched: update PressureDiffs on-the-fly for liveness. This removes all expensive pressure tracking logic from the scheduling critical path of node comparison. llvm-svn: 189643	2013-08-30 04:36:57 +00:00
Andrew Trick	ff60477306	Replace LiveInterval::killedAt with isKilledAtInstr. Return true for LRGs that end at EarlyClobber or Register slots. llvm-svn: 189642	2013-08-30 04:31:01 +00:00
Andrew Trick	b1a45b6c61	mi-sched: improve the generic register pressure comparison. Only compare pressure within the same set. When multiple sets are affected, we prioritize the most constrained set. llvm-svn: 189641	2013-08-30 04:27:29 +00:00
Andrew Trick	1a8313458f	mi-sched: Precompute a PressureDiff for each instruction, adjust for liveness later. Created SUPressureDiffs array to hold the per node PDiff computed during DAG building. Added a getUpwardPressureDelta API that will soon replace the old one. Compute PressureDelta here from the precomputed PressureDiffs. Updating for liveness will come next. llvm-svn: 189640	2013-08-30 03:49:48 +00:00
Andrew Trick	ef80f50058	comment typo llvm-svn: 189635	2013-08-30 02:02:12 +00:00
Eric Christopher	4b358188c6	Don't bother emitting the pubtypes section on darwin since there aren't any maintained consumers of it on that platform. llvm-svn: 189631	2013-08-30 00:40:17 +00:00
Eric Christopher	ac8199bf60	Reformat slightly. llvm-svn: 189630	2013-08-30 00:39:57 +00:00
Andrew Trick	483f4199f3	Comment and revise the cyclic critical path code. This should be much more clear now. It's still disabled pending testing. llvm-svn: 189597	2013-08-29 18:04:49 +00:00
Hal Finkel	8e83820a04	Revert: r189565 - Add getUnrollingPreferences to TTI Revert unintentional commit (of an unreviewed change). Original commit message: Add getUnrollingPreferences to TTI Allow targets to customize the default behavior of the generic loop unrolling transformation. This will be used by the PowerPC backend when targeting the A2 core (which is in-order with a deep pipeline), and using more aggressive defaults is important. llvm-svn: 189566	2013-08-29 03:33:15 +00:00
Hal Finkel	63e6c0e9fb	Add getUnrollingPreferences to TTI Allow targets to customize the default behavior of the generic loop unrolling transformation. This will be used by the PowerPC backend when targeting the A2 core (which is in-order with a deep pipeline), and using more aggressive defaults is important. llvm-svn: 189565	2013-08-29 03:29:57 +00:00
Hal Finkel	5ef4dccdce	Use TargetSubtargetInfo::useAA() in DAGCombine This uses the TargetSubtargetInfo::useAA() function to control the defaults of the -combiner-alias-analysis and -combiner-global-alias-analysis options. llvm-svn: 189564	2013-08-29 03:29:55 +00:00
Hal Finkel	b350ffd1b1	Add useAA() to TargetSubtargetInfo There are several optional (off-by-default) features in CodeGen that can make use of alias analysis. These features are important for generating code for some kinds of cores (for example the (in-order) PPC A2 core). This adds a useAA() function to TargetSubtargetInfo to allow these features to be enabled by default on a per-subtarget basis. Here is the first use of this function: To control the default of the -enable-aa-sched-mi feature. llvm-svn: 189563	2013-08-29 03:25:05 +00:00
Juergen Ributzka	11c52c601a	Fix a typo and coding style of a previous commit. No functional change. llvm-svn: 189526	2013-08-28 22:33:58 +00:00
Eric Christopher	62caa709fe	Remove support for the .debug_inlined section. No known software in use supports it. llvm-svn: 189439	2013-08-28 04:04:28 +00:00
Eric Christopher	e9fd605b41	Add a TODO here. llvm-svn: 189428	2013-08-28 00:13:08 +00:00
Eric Christopher	d033d6fb88	Add support for DW_FORM_dataN and DW_FORM_udata to the DIE hashing algorithm. Update the split dwarf hashing testcase accordingly - this should be the last time that the hash of an empty file changes. llvm-svn: 189427	2013-08-28 00:10:38 +00:00
Eric Christopher	9d1daa87e7	Use DW_FORM_sdata for signed constant values and udata on occasion when we can. Migrate from using blocks when we're adding just a single attribute and floating point values are an unsigned, not signed, bag of bits. Update all test cases accordingly. llvm-svn: 189419	2013-08-27 23:49:04 +00:00
Tim Northover	819bfb5a25	DAGCombiner: make sure or/shl/srl really has zero high bits before forming bswap We want to convert code like (or (srl N, 8), (shl N, 8)) into (srl (bswap N), const), but this is only valid if the bits above 16 on the source pattern are 0, the checks we were doing on this were slightly wrong before. llvm-svn: 189348	2013-08-27 13:46:45 +00:00
Owen Anderson	a0260f848d	Remove an over-zealous assertion. A pointer type could be illegal if the target is prepared to custom-legalize pointer operands. This assertion was evaluated before the target would have a chance to do so, making it impossible. llvm-svn: 189299	2013-08-27 00:28:23 +00:00
Eric Christopher	ca68bbf5c0	Formatting. llvm-svn: 189296	2013-08-26 23:58:22 +00:00
Eric Christopher	6b16b43ef9	Make the lifetime of the DICompileUnit we're constructing from the MDNode more clear as just for a single argument. llvm-svn: 189294	2013-08-26 23:57:03 +00:00
Eric Christopher	6fdf324f44	Have the skeleton compile unit construction method take the CU it is constructing from as an input and keep the same unique identifier. We can use this to connect items which must stay in the .o file (e.g. pubnames and pubtypes) to the skeleton cu rather than having duplicate unique numbers for the sections and needing to do lookups based on MDNode. llvm-svn: 189293	2013-08-26 23:50:43 +00:00
Eric Christopher	6d13fe007f	Remove duplicate set of CompilationDir. llvm-svn: 189292	2013-08-26 23:50:40 +00:00
Eric Christopher	bfceb2fe8f	Remove the language parameter and variable from the compile unit. We can get it via the MDNode that's passed in. Save that instead. llvm-svn: 189291	2013-08-26 23:50:38 +00:00
Eric Christopher	4d36ca009f	Treat the pubtypes section similarly to the pubnames section and emit it by default under linux or when we're trying to keep compatibility with old gdb versions. Fix testcase for option name change. llvm-svn: 189289	2013-08-26 23:24:35 +00:00
Eric Christopher	bf1ea3c727	Only emit the section sym if we're emitting the section. llvm-svn: 189288	2013-08-26 23:24:31 +00:00
Eric Christopher	5297df025c	Fix thinko. llvm-svn: 189279	2013-08-26 20:58:35 +00:00
Tom Stellard	838e2344ec	SelectionDAG: Remove unnecessary uses of TargetLowering::getPointerTy() If we have a binary operation like ISD:ADD, we can set the result type equal to the result type of one of its operands rather than using TargetLowering::getPointerTy(). Also, any use of DAG.getIntPtrConstant(C) as an operand for a binary operation can be replaced with: DAG.getConstant(C, OtherOperand.getValueType()); llvm-svn: 189227	2013-08-26 15:06:10 +00:00
Tom Stellard	7da047c9fb	SelectionDAG: Use correct pointer size when splitting vector stores llvm-svn: 189224	2013-08-26 15:05:55 +00:00
Tom Stellard	fd155828ed	SelectionDAG: Use correct pointer size when lowering function arguments v2 This adds minimal support to the SelectionDAG for handling address spaces with different pointer sizes. The SelectionDAG should now correctly lower pointer function arguments to the correct size as well as generate the correct code when lowering getelementptr. This patch also updates the R600 DataLayout to use 32-bit pointers for the local address space. v2: - Add more helper functions to TargetLoweringBase - Use CHECK-LABEL for tests llvm-svn: 189221	2013-08-26 15:05:36 +00:00
David Majnemer	b78df507c8	AsmPrinter: Get rid of llvm$workaround$fake$stub$ We currently emit labels with the prefix Lllvm$workaround$fake$stub$ if the target's MCAsmInfo has getLinkOnceDirective() mapped to something interesting. This was apparently a work around introduced in r31033 for binutils that we don't need anymore. llvm-svn: 189187	2013-08-25 09:18:19 +00:00
Benjamin Kramer	b12cf01908	Add a function object to compare the first or second component of a std::pair. Replace instances of this scattered around the code base. llvm-svn: 189169	2013-08-24 12:54:27 +00:00
Benjamin Kramer	260de74e48	Simplify code. No functionality change. llvm-svn: 189168	2013-08-24 12:15:54 +00:00
Benjamin Kramer	892daba8d3	DwarfDebug: Delete orphaned children. Leak found by valgrind. llvm-svn: 189167	2013-08-24 11:55:49 +00:00
Andrew Trick	475a9911ca	PrintVRegOrUnit llvm-svn: 189124	2013-08-23 17:48:53 +00:00
Andrew Trick	e4c1ba762d	Rename to RegPressure API parameters RegUnits. llvm-svn: 189123	2013-08-23 17:48:51 +00:00
Andrew Trick	01bc216482	Simplify RegPressure helpers. llvm-svn: 189122	2013-08-23 17:48:48 +00:00
Andrew Trick	86a7061e5d	Add a convenient PSetIterator for visiting pressure sets affected by a register. llvm-svn: 189121	2013-08-23 17:48:46 +00:00
Andrew Trick	c01b00400d	Adds cyclic critical path computation and heuristics, temporarily disabled. Estimate the cyclic critical path within a single block loop. If the acyclic critical path is longer, then the loop will exhaust OOO resources after some number of iterations. If lag between the acyclic critical path and cyclic critical path is longer the the time it takes to issue those loop iterations, then aggressively schedule for latency. llvm-svn: 189120	2013-08-23 17:48:43 +00:00
Andrew Trick	8dd26f002f	MI Sched: record local vreg uses. This will be used to compute the cyclic critical path and to update precomputed per-node pressure differences. In the longer term, it could also be used to speed up LiveInterval update by avoiding visiting all global vreg users. llvm-svn: 189118	2013-08-23 17:48:39 +00:00
Andrew Trick	a53e101627	mi-sched: Don't call MBB.size() in initSUnits. The driver already has instr count. This fixes a pathological compile time problem with very large blocks and lots of scheduling boundaries. llvm-svn: 189116	2013-08-23 17:48:33 +00:00
Richard Sandiford	37cd6cfba2	Turn MipsOptimizeMathLibCalls into a target-independent scalar transform ...so that it can be used for z too. Most of the code is the same. The only real change is to use TargetTransformInfo to test when a sqrt instruction is available. The pass is opt-in because at the moment it only handles sqrt. llvm-svn: 189097	2013-08-23 10:27:02 +00:00
Michael Gottesman	20f25eb958	[stack protector] Work around an issue with the BMOVPCB_CALL instruction on ARM by disabling does not return on __stack_chk_fail. This is to fix the bots while I look to see if there is something I can do here. rdar://14811848 llvm-svn: 189076	2013-08-22 23:45:24 +00:00
Bill Wendling	fe88aea706	Check only if we have this attribute. If it's not an attribute, then it's assumed false. llvm-svn: 189063	2013-08-22 21:16:14 +00:00
Michael Gottesman	1adac3582d	[stackprotector] When finding the split point to splice off the end of a parentmbb into a successmbb, include any DBG_VALUE MI. Fix for PR16954. llvm-svn: 188987	2013-08-22 05:40:50 +00:00
Tom Stellard	1b2c2d8414	SelectionDAG: Make sure stores are always added to the LegalizedNodes list When truncated vector stores were being custom lowered in VectorLegalizer::LegalizeOp(), the old (illegal) and new (legal) node pair was not being added to LegalizedNodes list. Instead of the legalized result being passed to VectorLegalizer::TranslateLegalizeResult(), the result was being passed back into VectorLegalizer::LegalizeOp(), which ended up adding a (new, new) pair to the list instead. This was causing an assertion failure when a custom lowered truncated vector store was the last instruction a basic block and the VectorLegalizer was unable to find it in the LegalizedNodes list when updating the DAG root. llvm-svn: 188953	2013-08-21 22:42:58 +00:00
Juergen Ributzka	3db39dc1ae	Teach BaseIndexOffset::match to identify base pointers in loops. The small utility function that pattern matches Base + Index + Offset patterns for loads and stores fails to recognize the base pointer for loads/stores from/into an array at offset 0 inside a loop. As a result DAGCombiner::MergeConsecutiveStores was not able to merge all stores. This commit fixes the issue by adding an additional pattern match and also a test case. Reviewer: Nadav llvm-svn: 188936	2013-08-21 21:53:38 +00:00
David Majnemer	ed89b5c6e7	DebugInfo: Do not use the DWARF Version for the .debug_pubnames or .debug_pubtypes version field Summary: LLVM would generate DWARF with version 3 in the .debug_pubname and .debug_pubtypes version fields. This would lead SGI dwarfdump to fail parsing the DWARF with (in the instance of .debug_pubnames) would exit with: dwarfdump ERROR: dwarf_get_globals: DW_DLE_PUBNAMES_VERSION_ERROR (123) This fixes PR16950. Reviewers: echristo, dblaikie Reviewed By: echristo CC: cfe-commits Differential Revision: http://llvm-reviews.chandlerc.com/D1454 llvm-svn: 188869	2013-08-21 06:13:34 +00:00
Richard Sandiford	6f6d55161b	[SystemZ] Use SRST to optimize memchr SystemZTargetLowering::emitStringWrapper() previously loaded the character into R0 before the loop and made R0 live on entry. I'd forgotten that allocatable registers weren't allowed to be live across blocks at this stage, and it confused LiveVariables enough to cause a miscompilation of f3 in memchr-02.ll. This patch instead loads R0 in the loop and leaves LICM to hoist it after RA. This is actually what I'd tried originally, but I went for the manual optimisation after noticing that R0 often wasn't being hoisted. This bug forced me to go back and look at why, now fixed as r188774. We should also try to optimize null checks so that they test the CC result of the SRST directly. The select between null and the SRST GPR result could then usually be deleted as dead. llvm-svn: 188779	2013-08-20 09:38:48 +00:00
Richard Sandiford	96aa93d5f1	Fix overly pessimistic shortcut in post-RA MachineLICM Post-RA LICM keeps three sets of registers: PhysRegDefs, PhysRegClobbers and TermRegs. When it sees a definition of R it adds all aliases of R to the corresponding set, so that when it needs to test for membership it only needs to test a single register, rather than worrying about aliases there too. E.g. the final candidate loop just has: unsigned Def = Candidates[i].Def; if (!PhysRegClobbers.test(Def) && ...) { to test whether register Def is multiply defined. However, there was also a shortcut in ProcessMI to make sure we didn't add candidates if we already knew that they would fail the final test. This shortcut was more pessimistic than the final one because it checked whether _any alias_ of the defined register was multiply defined. This is too conservative for targets that define register pairs. E.g. on z, R0 and R1 are sometimes used as a pair, so there is a 128-bit register that aliases both R0 and R1. If a loop used R0 and R1 independently, and the definition of R0 came first, we would be able to hoist the R0 assignment (because that used the final test quoted above) but not the R1 assignment (because that meant we had two definitions of the paired R0/R1 register and would fail the shortcut in ProcessMI). This patch just uses the same check for the ProcessMI shortcut as we use in the final candidate loop. llvm-svn: 188774	2013-08-20 09:11:13 +00:00
Michael Gottesman	dc985ef0af	[stackprotector] Small cleanup. llvm-svn: 188772	2013-08-20 08:56:28 +00:00
Michael Gottesman	76c44be14a	[stackprotector] Small Bit of computation hoisting. llvm-svn: 188771	2013-08-20 08:56:26 +00:00
Michael Gottesman	1977d15e02	[stackprotector] Added significantly longer comment to FindPotentialTailCall to make clear its relationship to llvm::isInTailCallPosition. llvm-svn: 188770	2013-08-20 08:56:23 +00:00
Michael Gottesman	62c5d714a1	Removed trailing whitespace. llvm-svn: 188769	2013-08-20 08:46:16 +00:00
Michael Gottesman	56e246b1a1	[stackprotector] Removed stale TODO. llvm-svn: 188768	2013-08-20 08:46:13 +00:00
Michael Gottesman	5e57068b7a	[stackprotector] Added support for emitting the llvm intrinsic stack protector check. rdar://13935163 llvm-svn: 188766	2013-08-20 08:36:53 +00:00
Michael Gottesman	ce0e4c263b	[stackprotector] Refactor out the end of isInTailCallPosition into the function returnTypeIsEligibleForTailCall. This allows me to use returnTypeIsEligibleForTailCall in the stack protector pass. rdar://13935163 llvm-svn: 188765	2013-08-20 08:36:50 +00:00
Michael Gottesman	f7e1203d95	Remove unused variables that crept in. llvm-svn: 188761	2013-08-20 07:17:27 +00:00
Michael Gottesman	b27f0f1f6b	Teach selectiondag how to handle the stackprotectorcheck intrinsic. Previously, generation of stack protectors was done exclusively in the pre-SelectionDAG Codegen LLVM IR Pass "Stack Protector". This necessitated splitting basic blocks at the IR level to create the success/failure basic blocks in the tail of the basic block in question. As a result of this, calls that would have qualified for the sibling call optimization were no longer eligible for optimization since said calls were no longer right in the "tail position" (i.e. the immediate predecessor of a ReturnInst instruction). Then it was noticed that since the sibling call optimization causes the callee to reuse the caller's stack, if we could delay the generation of the stack protector check until later in CodeGen after the sibling call decision was made, we get both the tail call optimization and the stack protector check! A few goals in solving this problem were: 1. Preserve the architecture independence of stack protector generation. 2. Preserve the normal IR level stack protector check for platforms like OpenBSD for which we support platform specific stack protector generation. The main problem that guided the present solution is that one can not solve this problem in an architecture independent manner at the IR level only. This is because: 1. The decision on whether or not to perform a sibling call on certain platforms (for instance i386) requires lower level information related to available registers that can not be known at the IR level. 2. Even if the previous point were not true, the decision on whether to perform a tail call is done in LowerCallTo in SelectionDAG which occurs after the Stack Protector Pass. As a result, one would need to put the relevant callinst into the stack protector check success basic block (where the return inst is placed) and then move it back later at SelectionDAG/MI time before the stack protector check if the tail call optimization failed. The MI level option was nixed immediately since it would require platform specific pattern matching. The SelectionDAG level option was nixed because SelectionDAG only processes one IR level basic block at a time implying one could not create a DAG Combine to move the callinst. To get around this problem a few things were realized: 1. While one can not handle multiple IR level basic blocks at the SelectionDAG Level, one can generate multiple machine basic blocks for one IR level basic block. This is how we handle bit tests and switches. 2. At the MI level, tail calls are represented via a special return MIInst called "tcreturn". Thus if we know the basic block in which we wish to insert the stack protector check, we get the correct behavior by always inserting the stack protector check right before the return statement. This is a "magical transformation" since no matter where the stack protector check intrinsic is, we always insert the stack protector check code at the end of the BB. Given the aforementioned constraints, the following solution was devised: 1. On platforms that do not support SelectionDAG stack protector check generation, allow for the normal IR level stack protector check generation to continue. 2. On platforms that do support SelectionDAG stack protector check generation: a. Use the IR level stack protector pass to decide if a stack protector is required/which BB we insert the stack protector check in by reusing the logic already therein. If we wish to generate a stack protector check in a basic block, we place a special IR intrinsic called llvm.stackprotectorcheck right before the BB's returninst or if there is a callinst that could potentially be sibling call optimized, before the call inst. b. Then when a BB with said intrinsic is processed, we codegen the BB normally via SelectBasicBlock. In said process, when we visit the stack protector check, we do not actually emit anything into the BB. Instead, we just initialize the stack protector descriptor class (which involves stashing information/creating the success mbbb and the failure mbb if we have not created one for this function yet) and export the guard variable that we are going to compare. c. After we finish selecting the basic block, in FinishBasicBlock if the StackProtectorDescriptor attached to the SelectionDAGBuilder is initialized, we first find a splice point in the parent basic block before the terminator and then splice the terminator of said basic block into the success basic block. Then we code-gen a new tail for the parent basic block consisting of the two loads, the comparison, and finally two branches to the success/failure basic blocks. We conclude by code-gening the failure basic block if we have not code-gened it already (all stack protector checks we generate in the same function, use the same failure basic block). llvm-svn: 188755	2013-08-20 07:00:16 +00:00
Hal Finkel	0c5c01aa4a	Add a llvm.copysign intrinsic This adds a llvm.copysign intrinsic; We already have Libfunc recognition for copysign (which is turned into the FCOPYSIGN SDAG node). In order to autovectorize calls to copysign in the loop vectorizer, we need a corresponding intrinsic as well. In addition to the expected changes to the language reference, the loop vectorizer, BasicTTI, and the SDAG builder (the intrinsic is transformed into an FCOPYSIGN node, just like the function call), this also adds FCOPYSIGN to a few lists in LegalizeVector{Ops,Types} so that vector copysigns can be expanded. In TargetLoweringBase::initActions, I've made the default action for FCOPYSIGN be Expand for vector types. This seems correct for all in-tree targets, and I think is the right thing to do because, previously, there was no way to generate vector-values FCOPYSIGN nodes (and most targets don't specify an action for vector-typed FCOPYSIGN). llvm-svn: 188728	2013-08-19 23:35:46 +00:00

... 6 7 8 9 10 ...

16046 Commits