llvm-project

Commit Graph

Author	SHA1	Message	Date
Simon Pilgrim	718cb0ea62	[SelectionDAG][X86] CombineBT - more aggressively determine demanded bits This patch is in 2 parts: 1 - replace combineBT's use of SimplifyDemandedBits (hasOneUse only) with SelectionDAG::GetDemandedBits to more aggressively determine the lower bits used by BT. 2 - update SelectionDAG::GetDemandedBits to support ANY_EXTEND - if the demanded bits are only in the non-extended portion, then peek through and demand from the source value and then ANY_EXTEND that if we found a match. Differential Revision: https://reviews.llvm.org/D35896 llvm-svn: 309486	2017-07-29 14:50:25 +00:00
Jessica Paquette	d87f54493d	[MachineOutliner] NFC: Change IsTailCall to a call class + frame class This commit - Removes IsTailCall and replaces it with a target-defined unsigned - Refactors getOutliningCallOverhead and getOutliningFrameOverhead so that they don't use IsTailCall - Adds a call class + frame class classification to OutlinedFunction and Candidate respectively This accomplishes a couple things. Firstly, we don't need the notion of tail call in the general outlining algorithm. Secondly, we now can have different "outlining classes" for each candidate within a set of candidates. This will make it easy to add new ways to outline sequences for certain targets and dynamically choose an appropriate cost model for a sequence depending on the context that that sequence lives in. Ultimately, this should get us closer to being able to do something like, say avoid saving the link register when outlining AArch64 instructions. llvm-svn: 309475	2017-07-29 02:55:46 +00:00
Adrian Prantl	359846fe1c	Remove the unused offset field from LiveDebugValues (NFC) Followup to r309426. rdar://problem/33580047 llvm-svn: 309455	2017-07-28 23:25:51 +00:00
Adrian Prantl	b2811e50f9	Remove the unused offset field from LiveDebugVariables (NFC) Followup to r309426. rdar://problem/33580047 llvm-svn: 309451	2017-07-28 23:06:50 +00:00
Adrian Prantl	8b9bb534a1	Remove the unused offset from DBG_VALUE (NFC) Followup to r309426. rdar://problem/33580047 llvm-svn: 309450	2017-07-28 23:00:45 +00:00
Adrian Prantl	d92ac5a259	Remove the unused DBG_VALUE offset parameter from GlobalISel (NFC) Followup to r309426. rdar://problem/33580047 llvm-svn: 309449	2017-07-28 22:46:20 +00:00
Adrian Prantl	1b63dc54b0	Remove the unused DBG_VALUE offset parameter from RegAllocFast (NFC) Followup to r309426. rdar://problem/33580047 llvm-svn: 309446	2017-07-28 22:36:55 +00:00
Adrian Prantl	a617576bb1	Remove the unused dbg.value offset from SelectionDAG (NFC) Followup to r309426. rdar://problem/33580047 llvm-svn: 309436	2017-07-28 21:27:35 +00:00
Adrian Prantl	abe04759a6	Remove the obsolete offset parameter from @llvm.dbg.value There is no situation where this rarely-used argument cannot be substituted with a DIExpression and removing it allows us to simplify the DWARF backend. Note that this patch does not yet remove any of the newly dead code. rdar://problem/33580047 Differential Revision: https://reviews.llvm.org/D35951 llvm-svn: 309426	2017-07-28 20:21:02 +00:00
Reid Kleckner	9be82c3169	Fix conditional tail call branch folding when both edges are the same The conditional tail call logic did the wrong thing when both destinations of a conditional branch were the same: BB#1: derived from LLVM BB %entry Live Ins: %EFLAGS Predecessors according to CFG: BB#0 JE_1 <BB#5>, %EFLAGS<imp-use,kill> JMP_1 <BB#5> BB#5: derived from LLVM BB %sw.epilog Predecessors according to CFG: BB#1 TCRETURNdi64 <ga:@mergeable_conditional_tailcall>, 0, ... We would fold the JE_1 to a TCRETURNdi64cc, and then remove our BB#5 successor. Then BB#5 would be deleted as it had no predecessors, leaving a dangling "JMP_1 <BB#5>" reference behind to cause assertions later. This patch checks that both conditional branch destinations are different before doing the transform. The standard branch folding logic is able to remove both the JMP_1 and the JE_1, and for my test case we end up forming a better conditional tail call later. Fixes PR33980 llvm-svn: 309422	2017-07-28 19:48:40 +00:00
Jessica Paquette	4602c3437c	[MachineOutliner] NFC: Comment tidying The comment on describing the suffix tree had some pruning stuff that was out of date in it. Also fixed some typos. llvm-svn: 309365	2017-07-28 05:59:30 +00:00
Jessica Paquette	809d708b8a	[MachineOutliner] NFC: Split up getOutliningBenefit This is some more cleanup in preparation for some actual functional changes. This splits getOutliningBenefit into two cost functions: getOutliningCallOverhead and getOutliningFrameOverhead. These functions return the number of instructions that would be required to call a specific function and the number of instructions that would be required to construct a frame for a specific funtion. The actual outlining benefit logic is moved into the outliner, which calls these functions. The goal of refactoring getOutliningBenefit is to: - Get us closer to getting rid of the IsTailCall flag - Further split up "target-specific" things and "general algorithm" things llvm-svn: 309356	2017-07-28 03:21:58 +00:00
David Blaikie	89daf77a11	DebugInfo: Consider a CU containing only local imported entities to be 'empty' This can come up in ThinLTO & wastes space & makes degenerate IR. As per the added FIXME, ultimately, local imported entities should hang off the function and that way the imported entity list on the CU can be tested for emptiness like all the other CU lists. (function-attached local imported entities are probably also the best path forward for fixing how imported entities are handled both in cross-module use (currently, while ThinLTO preserves the imported entities, they would not get used at the imported inlined location - only in the abstract origin that appears in the partial CU created by the import (which isn't emitted under Fission due to cross-CU limitations there)) and to reduce the number of points where imported entities are emitted (they're currently emitted into every inlined instance, concrete instance, and abstract origin - they should only go in teh abstract origin if there is one, otherwise in the concrete instance - but this requires lots of delayed handling and wiring up, same as abstract variables & subprograms)) llvm-svn: 309354	2017-07-28 03:06:25 +00:00
Jessica Paquette	78681be2a4	[MachineOutliner] Cleanup: move findCandidates out of suffix tree Doing some cleanup in preparation for some functional changes. This commit moves findCandidates out of the suffix tree and into the MachineOutliner class. This is much easier to follow, and removes the burden of candidate choice from the suffix tree. It also adds a couple FIXMEs and simplifies building outlined function names. llvm-svn: 309334	2017-07-27 23:24:43 +00:00
Simon Pilgrim	ac84850ea6	[SelectionDAG] Improve DAGTypeLegalizer::convertMask assertion (PR33960) Improve DAGTypeLegalizer::convertMask's isSETCCorConvertedSETCC assertion to properly check for any mixture of SETCC or BUILD_VECTOR of constants, or a logical mask op of them. llvm-svn: 309302	2017-07-27 18:15:54 +00:00
Adam Nemet	6374331a8c	[OptRemark] Allow streaming of 64-bit integers llvm-svn: 309293	2017-07-27 16:54:13 +00:00
Simon Pilgrim	64a795c5f5	[SelectionDAG] Avoid repeated calls to getNumOperands in for loop. NFCI. llvm-svn: 309283	2017-07-27 15:42:21 +00:00
Adrian Prantl	960e7663f3	remove redundant check llvm-svn: 309280	2017-07-27 15:24:20 +00:00
Simon Pilgrim	0d543b5921	[SelectionDAG] Tidyup mask creation. NFCI. Assign all concat elements to UNDEF and then just replace the first element, instead of copying everything individually. llvm-svn: 309277	2017-07-27 15:08:53 +00:00
David Blaikie	2195e13676	DebugInfo: Ensure imported entities at the top level of an inlined function don't cause degenerate concrete definitions Local imported entities at the top level of a subprogram were being handled differently from those in nested scopes - that different handling would cause pseudo concrete out-of-line definitions to be created (but without any of their attributes, nor an abstract_origin) in the case where there was no real concrete definition. These local imported entities also only appeared in the concrete definition where those imported entities in nested scopes appear in all cases (abstract, concrete, and inlined). This change at least makes top level case handle the same as the others - though there's a FIXME to improve this to /only/ emit them into the abstract origin (though this requires more plumbing - like the abstract subprogram and variable handling that must defer population until the end of the unit to discover if there is an abstract origin, or only a standalone concrete definition). llvm-svn: 309237	2017-07-27 00:06:53 +00:00
Peter Collingbourne	081ffe2ff2	Change CallLoweringInfo::CS to be an ImmutableCallSite instead of a pointer. NFCI. This was a use-after-free waiting to happen. llvm-svn: 309159	2017-07-26 19:15:29 +00:00
Andrew V. Tischenko	d1fefa3d7c	This patch returns proper value to indicate the case when instruction throughput can't be calculated. Differential revision https://reviews.llvm.org/D35831 llvm-svn: 309156	2017-07-26 18:55:14 +00:00
Adrian Prantl	833ad37c90	Do a better job at emitting prefrabricated skeleton CUs. This is a better fix than r308708 for the problem introduced in r304020. It restores the skeleton CU testcases modified by that commit to their original form and most importantly ensures that frontend-generated skeleton CUs (such as used to point to Clang modules) come after the regular CUs. This broke for DICompileUnit nodes that don't have any immediate children because they are now constructed lazily instead of the order in which they are listed in !llvm.dbg.cu. After this commit we still don't guarantee that order, but we do guarantee that empty skeletons come last. Shipping versions of LLDB are very sensitive to the ordering of CUs. I'll track a fix for LLDB to be more permissive separately. This fixes a test failure in the LLDB testsuite. rdar://problem/33357252 llvm-svn: 309154	2017-07-26 18:48:32 +00:00
Zvi Rackover	092f199188	DAGCombiner: Extend reduceBuildVecToTrunc to handle non-zero offset Summary: Adding support for combining power2-strided build_vector's where the first build_vectori's operand is extracted from a non-zero index. Example: v4i32 build_vector((extract_elt V, 1), (extract_elt V, 3), (extract_elt V, 5), (extract_elt V, 7)) --> v4i32 truncate (bitcast (shuffle<1,u,3,u,5,u,7,u> V, u) to v4i64) Reviewers: delena, RKSimon, guyblank Reviewed By: RKSimon Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D35700 llvm-svn: 309108	2017-07-26 12:57:03 +00:00
Adrian Prantl	be66271f04	Debug Info: Support fragmented variables in the MMI side table This reapplies commit r309034 with a bugfix+test for inlined variables. llvm-svn: 309057	2017-07-25 23:32:59 +00:00
Adrian Prantl	b6d5faf2ea	Revert "Debug Info: Support fragmented variables in the MMI side table" This reverts commit r309034 because of a sanitizer issue. llvm-svn: 309035	2017-07-25 21:50:45 +00:00
Adrian Prantl	3d1ab0cd1e	Debug Info: Support fragmented variables in the MMI side table <rdar://problem/17816343> llvm-svn: 309034	2017-07-25 21:29:22 +00:00
Simon Pilgrim	6d59933175	[DAG] Move DAGCombiner::GetDemandedBits to SelectionDAG This patch moves the DAGCombiner::GetDemandedBits function to SelectionDAG::GetDemandedBits as a first step towards making it easier for targets to get to the source of any demanded bits without the limitations of SimplifyDemandedBits. Differential Revision: https://reviews.llvm.org/D35841 llvm-svn: 308983	2017-07-25 16:36:44 +00:00
Francois Pichet	82bf3de606	Fix endianness bug in DAGCombiner::visitTRUNCATE and visitEXTRACT_VECTOR_ELT Summary: Do not assume little endian architecture in DAGCombiner::visitTRUNCATE and DAGCombiner::visitEXTRACT_VECTOR_ELT. PR33682 Reviewers: hfinkel, sdardis, RKSimon Reviewed By: sdardis, RKSimon Subscribers: uabelho, RKSimon, sdardis, llvm-commits Differential Revision: https://reviews.llvm.org/D34990 llvm-svn: 308960	2017-07-25 09:40:35 +00:00
Matt Arsenault	5fbc87021e	RA: Replace asserts related to empty live intervals These don't exactly assert the same thing anymore, and allow empty live intervals with non-empty uses. Removed in r308808 and r308813. llvm-svn: 308906	2017-07-24 18:07:55 +00:00
Benjamin Kramer	fc638c11bb	[CodeGenPrepare] Cut off FindAllMemoryUses if there are too many uses. This avoids excessive compile time. The case I'm looking at is Function.cpp from an old version of LLVM that still had the giant memcmp string matcher in it. Before r308322 this compiled in about 2 minutes, after it, clang takes infinite* time to compile it. With this patch we're at 5 min, which is still bad but this is a pathological case. The cut off at 20 uses was chosen by looking at other cut-offs in LLVM for user scanning. It's probably too high, but does the job and is very unlikely to regress anything. Fixes PR33900. * I'm impatient and aborted after 15 minutes, on the bug report it was killed after 2h. llvm-svn: 308891	2017-07-24 16:18:09 +00:00
Reid Kleckner	898ddf61c0	[codeview] Emit 'D' as the cv source language for D code This matches DMD: `522263965c/src/ddmd/backend/cv8.c (L199)` Fixes PR33899. llvm-svn: 308890	2017-07-24 16:16:42 +00:00
Reid Kleckner	7f6b2534fb	Format some case labels and shrink an anonymous namespace NFC llvm-svn: 308889	2017-07-24 16:16:17 +00:00
Petr Hosek	710479cede	[CodeGen][X86] Fuchsia supports sincos* libcalls and sin+cos->sincos optimization Patch by Roland McGrath Differential Revision: https://reviews.llvm.org/D35748 llvm-svn: 308854	2017-07-23 22:30:00 +00:00
Nirav Dave	4e6dcf73f9	[DAG] Fix typo preventing some stores merges to truncated stores. Check the actual memory type stored and not the extended value size when considering if truncated store merge is worthwhile. Reviewers: efriedma, RKSimon, spatel, jyknight Reviewed By: efriedma Subscribers: llvm-commits, nhaehnle Differential Revision: https://reviews.llvm.org/D35623 llvm-svn: 308833	2017-07-23 02:06:28 +00:00
Matt Arsenault	c5d1e503e1	RA: Remove another assert on empty intervals This case is similar to the one fixed in r308808, except when rematerializing. Fixes bug 33884. llvm-svn: 308813	2017-07-22 00:24:01 +00:00
Matt Arsenault	6a963f76ca	RA: Remove assert on empty live intervals This is possible if there is an undef use when splitting the vreg during spilling. Fixes bug 33620. llvm-svn: 308808	2017-07-21 23:56:13 +00:00
Xin Tong	495a3022da	[DAGCombiner] Update comment. NFC llvm-svn: 308772	2017-07-21 19:10:19 +00:00
Jonas Paulsson	024e319489	[SystemZ, LoopStrengthReduce] This patch makes LSR generate better code for SystemZ in the cases of memory intrinsics, Load->Store pairs or comparison of immediate with memory. In order to achieve this, the following common code changes were made: * New TTI hook: LSRWithInstrQueries(), which defaults to false. Controls if LSR should do instruction-based addressing evaluations by calling isLegalAddressingMode() with the Instruction pointers. * In LoopStrengthReduce: handle address operands of memset, memmove and memcpy as address uses, and call isFoldableMemAccessOffset() for any LSRUse::Address, not just loads or stores. SystemZ changes: * isLSRCostLess() implemented with Insns first, and without ImmCost. * New function supportedAddressingMode() that is a helper for TTI methods looking at Instructions passed via pointers. Review: Ulrich Weigand, Quentin Colombet https://reviews.llvm.org/D35262 https://reviews.llvm.org/D35049 llvm-svn: 308729	2017-07-21 11:59:37 +00:00
Philipp Schaad	a81d23030f	Commit access test llvm-svn: 308712	2017-07-21 03:51:01 +00:00
Adrian Prantl	65e7ca995d	Debug Info: Don't strip clang module skeleton CUs. This corrects a (hopefully :-) accidental side-effect of r304020. rdar://problem/33442618 llvm-svn: 308708	2017-07-21 01:24:05 +00:00
Tim Northover	071d77a51f	GlobalISel: stop localizer putting constants before EH_LABELs If the localizer pass puts one of its constants before the label that tells the unwinder "jump here to handle your exception" then control-flow will skip it, leaving uninitialized registers at runtime. That's bad. llvm-svn: 308687	2017-07-20 22:58:26 +00:00
Matt Arsenault	db78273b6e	Add an ID field to StackObjects On AMDGPU SGPR spills are really spilled to another register. The spiller creates the spills to new frame index objects, which is used as a placeholder. This will eventually be replaced with a reference to a position in a VGPR to write to and the frame index deleted. It is most likely not a real stack location that can be shared with another stack object. This is a problem when StackSlotColoring decides it should combine a frame index used for a normal VGPR spill with a real stack location and a frame index used for an SGPR. Add an ID field so that StackSlotColoring has a way of knowing the different frame index types are incompatible. llvm-svn: 308673	2017-07-20 21:03:45 +00:00
Francis Visoiu Mistrih	39aa5dbbf5	[PEI] Fix refactoring from r308664 llvm-svn: 308666	2017-07-20 20:31:44 +00:00
Mandeep Singh Grang	d41ac895bb	[COFF, ARM64, CodeView] Add support to emit CodeView debug info for ARM64 COFF Reviewers: compnerd, ruiu, rnk, zturner Reviewed By: rnk Subscribers: majnemer, aemerson, aprantl, javed.absar, kristof.beyls, llvm-commits Differential Revision: https://reviews.llvm.org/D35518 llvm-svn: 308665	2017-07-20 20:20:00 +00:00
Francis Visoiu Mistrih	631f6b888c	[PEI] Separate saving and restoring CSRs into different functions. NFC Split insertCSRSpillsAndRestores into insertCSRSaves + insertCSRRestores. This is mostly useful for future shrink-wrapping improvements where we want to save / restore a specific part of the CSRs in a specific block. Differential Revision: https://reviews.llvm.org/D35644 llvm-svn: 308664	2017-07-20 20:17:17 +00:00
Krzysztof Parzyszek	f3a778d757	Implement LaneBitmask::getNumLanes and LaneBitmask::getHighestLane This should eliminate most uses of countPopulation and Log2_32 on the lane mask values. llvm-svn: 308658	2017-07-20 19:43:19 +00:00
Krzysztof Parzyszek	e9f0c1e031	Use LaneBitmask::getLane in a few more places llvm-svn: 308655	2017-07-20 19:15:56 +00:00
Nirav Dave	4aa51c3af1	[DAG] Commit missed nit cleanup from r308617. NFC. llvm-svn: 308645	2017-07-20 18:07:57 +00:00
Nirav Dave	df86d2d008	[DAG] Handle missing transform in fold of value extension case. Summary: When pushing an extension of a constant bitwise operator on a load into the load, change other uses of the load value if they exist to prevent the old load from persisting. Reviewers: spatel, RKSimon, efriedma Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D35030 llvm-svn: 308618	2017-07-20 13:57:32 +00:00
Nirav Dave	77cc6f23b9	[DAG] Optimize away degenerate INSERT_VECTOR_ELT nodes. Summary: Add missing vector write of vector read reduction, i.e.: (insert_vector_elt x (extract_vector_elt x idx) idx) to x Reviewers: spatel, RKSimon, efriedma Reviewed By: RKSimon Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D35563 llvm-svn: 308617	2017-07-20 13:48:17 +00:00
Simon Pilgrim	2911296f10	[DAGCombiner] Match ISD::SRL non-uniform constant vectors patterns using predicates. Use predicate matchers introduced in D35492 to match more ISD::SRL constant folds llvm-svn: 308602	2017-07-20 11:03:30 +00:00
Simon Pilgrim	b9ff25df59	Remove trailing whitespace. NFCI. llvm-svn: 308601	2017-07-20 10:43:52 +00:00
Simon Pilgrim	7ff0e49d8c	[DAGCombiner] Match ISD::SRA non-uniform constant vectors patterns using predicates. Use predicate matchers introduced in D35492 to match more ISD::SRA constant folds llvm-svn: 308600	2017-07-20 10:43:05 +00:00
Simon Pilgrim	9d7863b935	[DAGCombiner] Match non-uniform constant vectors using predicates. Most combines currently recognise scalar and splat-vector constants, but not non-uniform vector constants. This patch introduces a matching mechanism that uses predicates to check against BUILD_VECTOR of ConstantSDNode, as well as scalar ConstantSDNode cases. I've changed a couple of predicates to demonstrate - the combine-shl changes add currently unsupported cases, while the MatchRotate replaces an existing mechanism. Differential Revision: https://reviews.llvm.org/D35492 llvm-svn: 308598	2017-07-20 10:13:40 +00:00
Francis Visoiu Mistrih	185b2e3d32	Revert "[PEI] Simplify handling of targets with no phys regs. NFC" This reverts commit ce30ab6e5598f3c24f59ad016dc9526bc9a1d450. sanitizer-ppc64le-linux seems to segfault when testing the sanitizers. llvm-svn: 308581	2017-07-20 02:47:05 +00:00
Francis Visoiu Mistrih	b3ddc1686b	Revert "[PEI] Separate saving and restoring CSRs into different functions. NFC" This reverts commit 540f6a26ae932469804a379ce9a8cbe715d59c23. sanitizer-ppc64le-linux seems to segfault when testing the sanitizers. llvm-svn: 308580	2017-07-20 02:47:04 +00:00
Francis Visoiu Mistrih	303e5df4e2	[PEI] Separate saving and restoring CSRs into different functions. NFC Split insertCSRSpillsAndRestores into insertCSRSaves + insertCSRRestores. This is mostly useful for future shrink-wrapping improvements where we want to save / restore a specific part of the CSRs in a specific block. Differential Revision: https://reviews.llvm.org/D35644 llvm-svn: 308573	2017-07-20 00:58:37 +00:00
Matt Arsenault	d62fe83005	Replace -print-whole-regmask with a threshold. The previous flag/default of printing everything is not helpful when there are thousands of registers in the mask. llvm-svn: 308572	2017-07-20 00:37:31 +00:00
Francis Visoiu Mistrih	ede08ef314	Revert "[PEI] Separate saving and restoring CSRs into different functions. NFC" This reverts commit a84d1fa6847e70ebf63594d41a00b473c941bd72. llvm-svn: 308562	2017-07-20 00:08:02 +00:00
Francis Visoiu Mistrih	9b97a31870	[AsmPrinter] Constify needsCFIMoves. NFC llvm-svn: 308557	2017-07-19 23:47:33 +00:00
Francis Visoiu Mistrih	52042aa21e	[PEI] Add basic opt-remarks support Add optimization remarks support to the PrologueEpilogueInserter. For now, emit the stack size as an analysis remark, but more additions wrt shrink-wrapping may be added. https://reviews.llvm.org/D35645 llvm-svn: 308556	2017-07-19 23:47:32 +00:00
Francis Visoiu Mistrih	a1f21bca46	[PEI] Simplify handling of targets with no phys regs. NFC Make doSpillCalleeSavedRegs a member function, instead of passing most of the members of PEI as arguments. Differential Revision: https://reviews.llvm.org/D35642 llvm-svn: 308555	2017-07-19 23:47:32 +00:00
Francis Visoiu Mistrih	3b7bbdbdd5	[PEI] Separate saving and restoring CSRs into different functions. NFC Split insertCSRSpillsAndRestores into insertCSRSaves + insertCSRRestores. This is mostly useful for future shrink-wrapping improvements where we want to save / restore a specific part of the CSRs in a specific block. Differential Revision: https://reviews.llvm.org/D35644 llvm-svn: 308554	2017-07-19 23:47:31 +00:00
Derek Schuff	36454afab5	Move Runtime libcall definitions to a .def file This will allow eliminating the duplication of the names, and allow adding extra information such as signatures in a future commit. Differential Revision: https://reviews.llvm.org/D35522 llvm-svn: 308531	2017-07-19 21:53:30 +00:00
Wolfgang Pieb	e018bbd835	Fixing an issue with the initialization of LexicalScopes objects when mixing debug and non-debug units. Patch by Andrea DiBiagio. Differential Revision: https://reviews.llvm.org/D35637 llvm-svn: 308513	2017-07-19 19:36:40 +00:00
Simon Pilgrim	c77e262260	{DAGCombine] Convert (Val & Mask) == Mask to Mask.isSubsetof(Val). NFCI. llvm-svn: 308460	2017-07-19 13:39:58 +00:00
Serguei Katkov	4ea855ebe5	[CGP] Allow cycles during Phi traversal in OptimizaMemoryInst Allowing cycles in Phi traversal increases the scope of optimize memory instruction in case we are in loop. The added test shows an example of enabling optimization inside a loop. Reviewers: loladiro, spatel, efriedma Reviewed By: efriedma Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D35294 llvm-svn: 308419	2017-07-19 04:49:17 +00:00
Adrian Prantl	d63bfd218b	Debug Info: Add a file: field to DIImportedEntity. DIImportedEntity has a line number, but not a file field. To determine the decl_line/decl_file we combine the line number from the DIImportedEntity with the file from the DIImportedEntity's scope. This does not work correctly when the parent scope is a DINamespace or a DIModule, both of which do not have a source file. This patch adds a file field to DIImportedEntity to unambiguously identify the source location of the using/import declaration. Most testcase updates are mechanical, the interesting one is the removal of the FIXME in test/DebugInfo/Generic/namespace.ll. This fixes PR33822. See https://bugs.llvm.org/show_bug.cgi?id=33822 for more context. <rdar://problem/33357889> https://bugs.llvm.org/show_bug.cgi?id=33822 Differential Revision: https://reviews.llvm.org/D35583 llvm-svn: 308398	2017-07-19 00:09:54 +00:00
Nirav Dave	d839749ae8	[DAG] Improve Aliasing of operations to static alloca Re-recommiting after landing DAG extension-crash fix. Recommiting after adding check to avoid miscomputing alias information on addresses of the same base but different subindices. Memory accesses offset from frame indices may alias, e.g., we may merge write from function arguments passed on the stack when they are contiguous. As a result, when checking aliasing, we consider the underlying frame index's offset from the stack pointer. Static allocs are realized as stack objects in SelectionDAG, but its offset is not set until post-DAG causing DAGCombiner's alias check to consider access to static allocas to frequently alias. Modify isAlias to consider access between static allocas and access from other frame objects to be considered aliasing. Many test changes are included here. Most are fixes for tests which indirectly relied on our aliasing ability and needed to be modified to preserve their original intent. The remaining tests have minor improvements due to relaxed ordering. The exception is CodeGen/X86/2011-10-19-widen_vselect.ll which has a minor degradation dispite though the pre-legalized DAG is improved. Reviewers: rnk, mkuper, jonpa, hfinkel, uweigand Reviewed By: rnk Subscribers: sdardis, nemanjai, javed.absar, llvm-commits Differential Revision: https://reviews.llvm.org/D33345 llvm-svn: 308350	2017-07-18 20:06:24 +00:00
Nirav Dave	041b87758a	[DAG] Reverse node replacement in extension operation. NFCI. Reorder replacements to be user first in preparation for multi-level folding to premptively avoid inadvertantly deleting later nodes from sharing found from replacement. llvm-svn: 308348	2017-07-18 19:49:20 +00:00
Nirav Dave	07871007aa	[DAG] Avoid deleting nodes before combining them. When replacing a node and it's operand, replacing the operand node may cause the deletion of the original node leading to an assertion failure. Case around these replacements to avoid this without relying on inspecting the DELETED_NODE opcode in various extend dagcombiner cases. Fixes PR32515. Reviewers: dbabokin, RKSimon, davide, chandlerc Subscribers: chandlerc, llvm-commits Differential Revision: https://reviews.llvm.org/D34095 llvm-svn: 308330	2017-07-18 17:39:15 +00:00
Nirav Dave	f87c8e82f6	[DAG] Allow base element type of store merge type to also be a vector. Correctly calculate merged vector size if MemVT is already a vector. llvm-svn: 308312	2017-07-18 14:39:09 +00:00
Simon Pilgrim	4793a11df9	[DAGCombine] Fix issue with out of bound constant rotation (PR33828) Take the modulo of rotations by a constant greater than or equal to the bit-width llvm-svn: 308302	2017-07-18 12:31:46 +00:00
Diana Picus	df4100b3d2	GlobalISel: Support G_(S\|U)REM widening in LegalizerHelper Treat widening G_SREM and G_UREM the same as G_SDIV and G_UDIV. This is going to be used in the ARM backend (and that's when the test will come too). llvm-svn: 308278	2017-07-18 09:08:47 +00:00
Chandler Carruth	a15e080b05	Revert r308025 due to uncovering a crash in SelectionDAG. This is filed with a minimal test case in http://llvm.org/PR33833. Original commit message: Improve Aliasing of operations to static alloca llvm-svn: 308271	2017-07-18 07:53:47 +00:00
Serguei Katkov	a6fba3d69f	[CGP] Cleanup - remove redundant code in OptimizeMemoryInst. NFC optimizeMemoryInst contains a vector AddrModeInsts. The only use of this vector is to check that all instructions are in the same block as memory instruction. This check is guarded by PhiSeen flag, so if we traversed through phi node then we do not need to keep information in AddrModeInsts. AddModeInsts is set first time we found some addressing mode and updated if we found new one later. We can find next addressing mode only if we traverse phi node so all code related to update of AddModeInsts can be safely removed. Reviewers: loladiro, spatel, efriedma Reviewed By: efriedma Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D35291 llvm-svn: 308265	2017-07-18 05:16:38 +00:00
Andrew Zhogin	67a64041b9	[DAGCombiner] Recognise vector rotations with non-splat constants Fixes PR33691. Differential revision: https://reviews.llvm.org/D35381 llvm-svn: 308150	2017-07-16 23:11:45 +00:00
Simon Pilgrim	e7a2e6bdf1	Strip trailing whitespace. NFCI llvm-svn: 308108	2017-07-15 19:29:19 +00:00
Haicheng Wu	abdef9ee7e	[TTI] Refine the cost of EXT in getUserCost() Now, getUserCost() only checks the src and dst types of EXT to decide it is free or not. This change first checks the types, then calls isExtFreeImpl(), and check if EXT can form ExtLoad at last. Currently, only AArch64 has customized implementation of isExtFreeImpl() to check if EXT can be folded into its use. Differential Revision: https://reviews.llvm.org/D34458 llvm-svn: 308076	2017-07-15 02:12:16 +00:00
Dimitry Andric	e4b97459f1	Fix mixed line terminators. NFC. llvm-svn: 308052	2017-07-14 21:14:58 +00:00
Jakub Kuderski	b292c22c8d	[Dominators] Make IsPostDominator a template parameter Summary: DominatorTreeBase used to have IsPostDominators (bool) member to indicate if the tree is a dominator or a postdominator tree. This made it possible to switch between the two 'modes' at runtime, but it isn't used in practice anywhere. This patch makes IsPostDominator a template argument. This way, it is easier to switch between different algorithms at compile-time based on this argument and design external utilities around it. It also makes it impossible to incidentally assign a postdominator tree to a dominator tree (and vice versa), and to further simplify template code in GenericDominatorTreeConstruction. Reviewers: dberlin, sanjoy, davide, grosser Reviewed By: dberlin Subscribers: mzolotukhin, llvm-commits Differential Revision: https://reviews.llvm.org/D35315 llvm-svn: 308040	2017-07-14 18:26:09 +00:00
Nirav Dave	a8f63af9d1	Improve Aliasing of operations to static alloca Recommiting after adding check to avoid miscomputing alias information on addresses of the same base but different subindices. Memory accesses offset from frame indices may alias, e.g., we may merge write from function arguments passed on the stack when they are contiguous. As a result, when checking aliasing, we consider the underlying frame index's offset from the stack pointer. Static allocs are realized as stack objects in SelectionDAG, but its offset is not set until post-DAG causing DAGCombiner's alias check to consider access to static allocas to frequently alias. Modify isAlias to consider access between static allocas and access from other frame objects to be considered aliasing. Many test changes are included here. Most are fixes for tests which indirectly relied on our aliasing ability and needed to be modified to preserve their original intent. The remaining tests have minor improvements due to relaxed ordering. The exception is CodeGen/X86/2011-10-19-widen_vselect.ll which has a minor degradation dispite though the pre-legalized DAG is improved. Reviewers: rnk, mkuper, jonpa, hfinkel, uweigand Reviewed By: rnk Subscribers: sdardis, nemanjai, javed.absar, llvm-commits Differential Revision: https://reviews.llvm.org/D33345 llvm-svn: 308025	2017-07-14 13:56:21 +00:00
Jakub Kuderski	1d2dc681b1	[NFC] Move DEBUG_TYPE macro below includes... in MachineCombiner.cpp. llvm-svn: 307940	2017-07-13 19:30:52 +00:00
Simon Dardis	250256f9c9	Reland "[mips] Fix multiprecision arithmetic." For multiprecision arithmetic on MIPS, rather than using ISD::ADDE / ISD::ADDC, get SelectionDAG to break down the operation into ISD::ADDs and ISD::SETCCs. For MIPS, only the DSP ASE has a carry flag, so in the general case it is not useful to directly support ISD::{ADDE, ADDC, SUBE, SUBC} nodes. Also improve the generation code in such cases for targets with TargetLoweringBase::ZeroOrOneBooleanContent by directly using the result of the comparison node rather than using it in selects. Similarly for ISD::SUBE / ISD::SUBC. Address optimization breakage by moving the generation of MIPS specific integer multiply-accumulate nodes to before legalization. This revolves PR32713 and PR33424. Thanks to Simonas Kazlauskas and Pirama Arumuga Nainar for reporting the issue! Reviewers: slthakur Differential Revision: https://reviews.llvm.org/D33494 The previous version of this patch was too aggressive in producing fused integer multiple-addition instructions. llvm-svn: 307906	2017-07-13 11:28:05 +00:00
Simon Pilgrim	bb85cb16e3	[DAGCombiner] Fix issue with rotate combines asserting if the constant value types differ from the result type. llvm-svn: 307900	2017-07-13 10:41:49 +00:00
Simon Pilgrim	2dc42b7202	Use isNullConstantOrNullSplatConstant helper. NFCI. llvm-svn: 307895	2017-07-13 09:39:00 +00:00
Hiroshi Inoue	e9dea6e613	fix typos in comments and error messges; NFC llvm-svn: 307885	2017-07-13 06:48:39 +00:00
Geoff Berry	bea2e188e9	[TargetLowering] Add hook for adding target MMO flags when doing ISel. Summary: Add TargetLowering hook getMMOFlags() to add target specific MMO flags to load/store instructions created by ISel. Reviewers: bogner, hfinkel, qcolombet, MatzeB Subscribers: mcrosier, javed.absar, llvm-commits Differential Revision: https://reviews.llvm.org/D34962 llvm-svn: 307879	2017-07-13 03:49:42 +00:00
Geoff Berry	6748abe24d	[MIR] Add support for printing and parsing target MMO flags Summary: Add target hooks for printing and parsing target MMO flags. Targets may override getSerializableMachineMemOperandTargetFlags() to return a mapping from string to flag value for target MMO values that should be serialized/parsed in MIR output. Add implementation of this hook for AArch64 SuppressPair MMO flag. Reviewers: bogner, hfinkel, qcolombet, MatzeB Subscribers: mcrosier, javed.absar, llvm-commits Differential Revision: https://reviews.llvm.org/D34962 llvm-svn: 307877	2017-07-13 02:28:54 +00:00
Eli Friedman	6f7c9ad7d4	[CodeGenPrepare] Don't create dead instructions in addrmode sinking When we fail to sink an instruction, we must make sure not to modify the function; otherwise, we end up in an infinite loop because CodeGenPrepare iterates until it doesn't make any changes. Fixes https://bugs.llvm.org/show_bug.cgi?id=33608 . llvm-svn: 307866	2017-07-12 23:30:02 +00:00
Gerolf Hoflehner	3f164318e7	[SjLj] Replace recursive block marking algorithm with iterative algorithm Summary: Some programs run into a stack overflow issue. This change avoids this problem by replacing the recursive algorithm with the iterative version. Reviewers: MatzeB, t.p.northover, dblaikie Reviewed By: MatzeB Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D35105 llvm-svn: 307860	2017-07-12 23:05:15 +00:00
Daniel Neilson	965613ef1b	Add element atomic memset intrinsic Summary: Continuing the work from https://reviews.llvm.org/D33240, this change introduces an element unordered-atomic memset intrinsic. This intrinsic is essentially memset with the implementation requirement that all stores used for the assignment are done with unordered-atomic stores of a given element size. Reviewers: eli.friedman, reames, mkazantsev, skatkov Reviewed By: reames Subscribers: jfb, dschuff, sbc100, jgravelle-google, aheejin, efriedma, llvm-commits Differential Revision: https://reviews.llvm.org/D34885 llvm-svn: 307854	2017-07-12 21:57:23 +00:00
Sam Clegg	fd5ab25ae1	Remove unneeded use of #undef DEBUG_TYPE. NFC Where is is needed (at the end of headers that define it), be consistent about its use. Also fix a few header guards that I found in the process. Differential Revision: https://reviews.llvm.org/D34916 llvm-svn: 307840	2017-07-12 20:49:21 +00:00
Evandro Menezes	14ba3d7730	[CodeGen] Add dependency printer Add SDep printer to make debugging sessions more productive. Differential revision: https://reviews.llvm.org/D35144 llvm-svn: 307799	2017-07-12 15:30:59 +00:00
Daniel Neilson	57226ef33c	Add element atomic memmove intrinsic Summary: Continuing the work from https://reviews.llvm.org/D33240, this change introduces an element unordered-atomic memmove intrinsic. This intrinsic is essentially memmove with the implementation requirement that all loads/stores used for the copy are done with unordered-atomic loads/stores of a given element size. Reviewers: eli.friedman, reames, mkazantsev, skatkov Reviewed By: reames Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D34884 llvm-svn: 307796	2017-07-12 15:25:26 +00:00
Konstantin Zhuravlyov	bb80d3e1d3	Enhance synchscope representation OpenCL 2.0 introduces the notion of memory scopes in atomic operations to global and local memory. These scopes restrict how synchronization is achieved, which can result in improved performance. This change extends existing notion of synchronization scopes in LLVM to support arbitrary scopes expressed as target-specific strings, in addition to the already defined scopes (single thread, system). The LLVM IR and MIR syntax for expressing synchronization scopes has changed to use syncscope("<scope>"), where <scope> can be "singlethread" (this replaces singlethread keyword), or a target-specific name. As before, if the scope is not specified, it defaults to CrossThread/System scope. Implementation details: - Mapping from synchronization scope name/string to synchronization scope id is stored in LLVM context; - CrossThread/System and SingleThread scopes are pre-defined to efficiently check for known scopes without comparing strings; - Synchronization scope names are stored in SYNC_SCOPE_NAMES_BLOCK in the bitcode. Differential Revision: https://reviews.llvm.org/D21723 llvm-svn: 307722	2017-07-11 22:23:00 +00:00
Evandro Menezes	0cd23f5642	[CodeGen] Rename DEBUG_TYPE to match passnames Rename missing DEBUG_TYPE "machine-scheduler" from backend files, which were absent from https://reviews.llvm.org/rL303921. Differential revision: https://reviews.llvm.org/D35231 llvm-svn: 307719	2017-07-11 22:08:28 +00:00
Serguei Katkov	0e831c996c	Revert Revert [MBP] do not rotate loop if it creates extra branch This is a second attempt to land this patch. The first one resulted in a crash of clang sanitizer buildbot. The fix is here and regression test is added. This is a last fix for the corner case of PR32214. Actually this is not really corner case in general. We should not do a loop rotation if we create an additional branch due to it. Consider the case where we have a loop chain H, M, B, C , where H is header with viable fallthrough from pre-header and exit from the loop M - some middle block B - backedge to Header but with exit from the loop also. C - some cold block of the loop. Let's H is determined as a best exit. If we do a loop rotation M, B, C, H we can introduce the extra branch. Let's compute the change in number of branches: +1 branch from pre-header to header -1 branch from header to exit +1 branch from header to middle block if there is such -1 branch from cold bock to header if there is one So if C is not a predecessor of H then we introduce extra branch. This change actually prohibits rotation of the loop if both true Best Exit has next element in chain as successor. Last element in chain is not a predecessor of first element of chain. Reviewers: iteratee, xur, sammccall, chandlerc Reviewed By: iteratee Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D34745 llvm-svn: 307631	2017-07-11 08:34:58 +00:00
Serguei Katkov	0b7b59ada3	[CGP] Relax a bit restriction for optimizeMemoryInst to extend scope CodeGenPrepare::optimizeMemoryInst contains a check that we do nothing if all instructions combining the address for memory instruction is in the same block as memory instruction itself. However if any of these instruction are placed after memory instruction then address calculation will not be folded to memory instruction. The added test case shows an example. Reviewers: loladiro, spatel, efriedma Reviewed By: efriedma Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D34862 llvm-svn: 307628	2017-07-11 06:24:44 +00:00

1 2 3 4 5 ...

23090 Commits