llvm-project

Commit Graph

Author	SHA1	Message	Date
Eric Astor	a3ec4a3158	[ms] [llvm-ml] Allow use of locally-defined variables in expressions MASM allows variables defined by equate statements to be used in expressions. Reviewed By: thakis Differential Revision: https://reviews.llvm.org/D86946	2020-09-07 14:00:14 -04:00
Eric Astor	2feb6e9b84	[ms] [llvm-ml] Fix STRUCT field alignment MASM aligns fields to the _minimum_ of the STRUCT alignment value and the size of the next field. Reviewed By: thakis Differential Revision: https://reviews.llvm.org/D86945	2020-09-07 13:58:59 -04:00
Eric Astor	e52e7ad54d	[ms] [llvm-ml] Add support for bitwise named operators (AND, NOT, OR) in MASM Add support for expressions of the form '1 or 2', etc. Reviewed By: thakis Differential Revision: https://reviews.llvm.org/D86944	2020-09-07 13:57:54 -04:00
Simon Pilgrim	5ea9e655ef	VPlan.h - remove unnecessary forward declarations. NFCI. Already defined in includes.	2020-09-07 18:35:06 +01:00
Simon Pilgrim	4e89a0ab02	MipsISelLowering.h - remove CCState/CCValAssign forward declarations. NFCI. These are already defined in the CallingConvLower.h include.	2020-09-07 18:15:26 +01:00
Simon Pilgrim	95ca3aacf0	BTFDebug.h - reduce MachineInstr.h include to forward declaration. NFCI.	2020-09-07 17:51:13 +01:00
Simon Pilgrim	dfc333050b	LeonPasses.h - remove unnecessary includes. NFCI. Reduce to forward declarations and move includes to LeonPasses.cpp where necessary.	2020-09-07 17:51:12 +01:00
Simon Pilgrim	1c34ac03a2	LeonPasses.h - remove orphan function declarations. NFCI. The implementations no longer exist.	2020-09-07 17:51:12 +01:00
Sanjay Patel	7a6d6f0f70	[InstCombine] improve folds for icmp with multiply operands (PR47432) Check for no overflow along with an odd constant before we lose information by converting to bitwise logic. https://rise4fun.com/Alive/2Xl Pre: C1 != 0 %mx = mul nsw i8 %x, C1 %my = mul nsw i8 %y, C1 %r = icmp eq i8 %mx, %my => %r = icmp eq i8 %x, %y Name: nuw ne Pre: C1 != 0 %mx = mul nuw i8 %x, C1 %my = mul nuw i8 %y, C1 %r = icmp ne i8 %mx, %my => %r = icmp ne i8 %x, %y Name: odd ne Pre: C1 % 2 != 0 %mx = mul i8 %x, C1 %my = mul i8 %y, C1 %r = icmp ne i8 %mx, %my => %r = icmp ne i8 %x, %y	2020-09-07 12:40:37 -04:00
alex-t	2480a31e5d	[AMDGPU] SILowerControlFlow::optimizeEndCF should remove empty basic block optimizeEndCF removes EXEC restoring instruction case this instruction is the only one except the branch to the single successor and that successor contains EXEC mask restoring instruction that was lowered from END_CF belonging to IF_ELSE. As a result of such optimization we get the basic block with the only one instruction that is a branch to the single successor. In case the control flow can reach such an empty block from S_CBRANCH_EXEZ/EXECNZ it might happen that spill/reload instructions that were inserted later by register allocator are placed under exec == 0 condition and never execute. Removing empty block solves the problem. This change require further work to re-implement LIS updates. Recently, LIS is always nullptr in this pass. To enable it we need another patch to fix many places across the codegen. Reviewed By: rampitec Differential Revision: https://reviews.llvm.org/D86634	2020-09-07 19:37:27 +03:00
Momchil Velikov	eb482afaf5	Reduce the number of memory allocations when displaying a warning about clobbering reserved registers (NFC). Also address some minor inefficiencies and style issues. Differential Revision: https://reviews.llvm.org/D86088	2020-09-07 17:04:00 +01:00
Simon Pilgrim	9de0a3da6a	[X86][SSE] Don't use LowerVSETCCWithSUBUS for unsigned compare with +ve operands (PR47448) We already simplify the unsigned comparisons if we've found the operands are non-negative, but we were still calling LowerVSETCCWithSUBUS which resulted in the PR47448 regressions.	2020-09-07 16:11:40 +01:00
Simon Pilgrim	60162626a5	[X86] Replace UpgradeX86AddSubSatIntrinsics with UpgradeX86BinaryIntrinsics generic helper. NFCI. Feed the Intrinsic::ID value directly instead of via the IsSigned/IsAddition bool flags.	2020-09-07 15:57:18 +01:00
Sanjay Patel	b22910daab	[InstCombine] erase instructions leading up to unreachable Normal dead code elimination ignores assume intrinsics, so we fail to delete assumes that are not meaningful (and potentially worse if they cause conflicts with other assumptions). The motivating example in https://llvm.org/PR47416 suggests that we might have problems upstream from here (difference between C and C++), but this should be a cheap way to make sure we remove more dead code. Differential Revision: https://reviews.llvm.org/D87149	2020-09-07 10:44:08 -04:00
Simon Pilgrim	96e0f34be7	[X86] Auto upgrade SSE/AVX PABS intrinsics to generic Intrinsic::abs Minor followup to D87101, we were expanding this to a neg+icmp+select pattern like we were in CGBuiltin	2020-09-07 15:07:26 +01:00
Simon Pilgrim	6670f5d1e6	MachineStableHash.h - remove MachineInstr.h include. NFC. Use forward declarations and move the include to MachineStableHash.cpp	2020-09-07 13:33:48 +01:00
Simon Wallis	79ea83e104	[SelectionDAG] memcpy expansion of const volatile struct ignores const zero In getMemcpyLoadsAndStores(), a memcpy where the source is a zero constant is expanded to a MemOp::Set instead of a MemOp::Copy, even when the memcpy is volatile. This is incorrect. The fix is to add a check for volatile, and expand to MemOp::Copy in the volatile case. Reviewed By: chill Differential Revision: https://reviews.llvm.org/D87134	2020-09-07 13:22:09 +01:00
Sanjay Patel	3ca8b9a560	[InstCombine] give a name to an intermediate value for easier tracking; NFC As noted in PR47430, we probably want to conditionally include 'nsw' here anyway, so we are going to need to fill out the optional args.	2020-09-07 08:19:42 -04:00
Simon Pilgrim	e57cbcbdc1	LegalizeTypes.h - remove orphan SplitVSETCC declaration. NFCI. The implementation no longer exists	2020-09-07 13:11:49 +01:00
Simon Pilgrim	5bb27e735d	X86AvoidStoreForwardingBlocks.cpp - use unsigned for Opcode values. NFCI. Fixes clang-tidy cppcoreguidelines-narrowing-conversions warnings.	2020-09-07 12:56:27 +01:00
Simon Pilgrim	9b645ebfff	[X86][AVX] Use lowerShuffleWithPERMV in shuffle combining to support non-VLX targets lowerShuffleWithPERMV allows us to use the ZMM variants for 128/256-bit variable shuffles on non-VLX AVX512 targets. This is another step towards shuffle combining through between vector widths - we still end up with an annoying regression (combine_vpermilvar_vperm2f128_zero_8f32) but we're going in the right direction....	2020-09-07 12:50:50 +01:00
Sam Parker	928c4b4b49	[SCEV] Refactor isHighCostExpansionHelper To enable the cost of constants, the helper function has been reorganised: - A struct has been introduced to hold SCEV operand information so that we know the user of the operand, as well as the operand index. The Worklist now uses instead instead of a bare SCEV. - The costing of each SCEV, and collection of its operands, is now performed in a helper function. Differential Revision: https://reviews.llvm.org/D86050	2020-09-07 11:57:46 +01:00
Benjamin Kramer	7ba0f81934	[X86] Unbreak the build after `22fa6b20d9`	2020-09-07 12:24:30 +02:00
Simon Pilgrim	71dfdbe2c7	[X86] getFauxShuffleMask - handle insert_subvector(zero, sub, C) Directly use SM_SentinelZero elements if we're (widening)inserting into a zero vector.	2020-09-07 11:10:40 +01:00
Simon Pilgrim	9ad261540d	[X86] Use Register instead of unsigned. NFCI. Fixes llvm-prefer-register-over-unsigned clang-tidy warnings.	2020-09-07 10:49:29 +01:00
Simon Pilgrim	22fa6b20d9	[X86] Use Register instead of unsigned. NFCI. Fixes llvm-prefer-register-over-unsigned clang-tidy warnings.	2020-09-07 10:38:09 +01:00
Simon Pilgrim	0dbe2504af	[X86] Use Register instead of unsigned. NFCI. Fixes llvm-prefer-register-over-unsigned clang-tidy warning.	2020-09-07 10:38:08 +01:00
Sam Parker	65f78e73ad	[SimplifyCFG] Consider cost of combining predicates. Modify FoldBranchToCommonDest to consider the cost of inserting instructions when attempting to combine predicates to fold blocks. The threshold can be controlled via a new option: -simplifycfg-branch-fold-threshold which defaults to '2' to allow the insertion of a not and another logical operator. Differential Revision: https://reviews.llvm.org/D86526	2020-09-07 10:04:50 +01:00
Jay Foad	713c2ad60c	[GlobalISel] Extend not_cmp_fold to work on conditional expressions Differential Revision: https://reviews.llvm.org/D86709	2020-09-07 09:31:08 +01:00
Sam Parker	0af4147804	[ARM][CostModel] CodeSize costs for i1 arith ops When optimising for size, make the cost of i1 logical operations relatively expensive so that optimisations don't try to combine predicates. Differential Revision: https://reviews.llvm.org/D86525	2020-09-07 09:27:18 +01:00
Xing GUO	40f4131fce	[DWARFYAML] Make the debug_addr section optional. This patch makes the debug_addr section optional. When an empty debug_addr section is specified, yaml2obj only emits a section header for it. Reviewed By: jhenderson Differential Revision: https://reviews.llvm.org/D87205	2020-09-07 16:17:18 +08:00
Jay Foad	5350e1b509	[KnownBits] Implement accurate unsigned and signed max and min Use the new implementation in ValueTracking, SelectionDAG and GlobalISel. Differential Revision: https://reviews.llvm.org/D87034	2020-09-07 09:09:01 +01:00
dongAxis	1fd7dc4074	When dumping results of StackLifetime, it will print the following log: BB [7, 8): begin {}, end {}, livein {}, liveout {} BB [1, 2): begin {}, end {}, livein {}, liveout {} ... But it is not convenient to know what the basic block is. So I add the basic block name to it. Reviewed By: vitalybuka TestPlan: check-llvm Differential Revision: https://reviews.llvm.org/D87152	2020-09-07 11:43:16 +08:00
Zi Xuan Wu	69f2c79f2a	[ELF] Add a new e_machine value EM_CSKY and add some CSKY relocation types This is the split part of D86269, which add a new ELF machine flag called EM_CSKY and related relocations. Some target-specific flags and tests for csky can be added in follow-up patches later. Differential Revision: https://reviews.llvm.org/D86610	2020-09-07 10:42:28 +08:00
Thomas Lively	caee15a0ed	[WebAssembly] Fix incorrect assumption of simple value types Fixes PR47375, in which an assertion was triggering because WebAssemblyTargetLowering::isVectorLoadExtDesirable was improperly assuming the use of simple value types. Differential Revision: https://reviews.llvm.org/D87110	2020-09-06 15:42:21 -07:00
Amy Kwan	efa57f9a7a	[PowerPC] Implement Vector Expand Mask builtins in LLVM/Clang This patch implements the vec_expandm function prototypes in altivec.h in order to utilize the vector expand with mask instructions introduced in Power10. Differential Revision: https://reviews.llvm.org/D82727	2020-09-06 17:13:21 -05:00
Nikita Popov	b536cbaac5	[ValueTracking] Avoid known bits fallback for non-zero get check (NFCI) The known bits fall back will never be able to infer a non-null value here, so don't bother.	2020-09-06 23:16:38 +02:00
Florian Hahn	16bb71fd4f	[DSE,MemorySSA] Add a few additional debug messages.	2020-09-06 20:31:00 +01:00
Benjamin Kramer	8c386c9474	[SmallVector] Move error handling out of line This reduces duplication and avoids emitting ice cold code into every instance of grow().	2020-09-06 18:06:44 +02:00
Simon Pilgrim	ecac5c2808	[X86][AVX] lowerShuffleWithPERMV - adjust binary shuffle masks to account for widening on non-VLX targets rGabd33bf5eff2 enabled us to pad 128/256-bit shuffles to 512-bit on non-VLX targets, but wasn't updating binary shuffles to account for the new vector width.	2020-09-06 14:52:25 +01:00
Nikita Popov	ff218cbc84	[InstSimplify] Fold degenerate abs of abs form This addresses the remaining issue from D87188. Due to a series of folds, we may end up with abs-of-abs represented as x == 0 ? -abs(x) : abs(x). Rather than recognizing this as a special abs pattern and doing an abs-of-abs fold on it afterwards, I'm directly folding this to one of the select operands in InstSimplify. The general pattern falls into the "select with operand replaced" category, but that fold is not powerful enough to recognize that both hands of the select are the same for value zero. Differential Revision: https://reviews.llvm.org/D87197	2020-09-06 09:43:08 +02:00
Amara Emerson	d0abc75749	[GlobalISel] Disable the indexed loads combine completely unless forced. NFC. The post-index matcher, before it queries the target legality, walks uses of some instructions which in pathological cases can be massive. Since no targets actually support indexed loads yet, disable this to stop wasting compile time on something which is going to fail anyway.	2020-09-05 21:04:03 -07:00
vnalamot	aff94ec0f4	[AMDGPU] Remove the dead spill slots while spilling FP/BP to memory During the PEI pass, the dead TargetStackID::SGPRSpill spill slots are not being removed while spilling the FP/BP to memory. Fixes: SWDEV-250393 Reviewed By: arsenm Differential Revision: https://reviews.llvm.org/D87032	2020-09-06 07:04:25 +05:30
Krzysztof Parzyszek	62f89a89f3	[Hexagon] Add assertions about V6_pred_scalar2	2020-09-05 18:20:23 -05:00
Krzysztof Parzyszek	9518f032e4	[Hexagon] When widening truncate result, also widen operand if necessary	2020-09-05 18:19:32 -05:00
Krzysztof Parzyszek	8789f2bbde	[Hexagon] Resize the mem operand when widening loads and stores	2020-09-05 18:17:48 -05:00
Krzysztof Parzyszek	1387f96ab3	[Hexagon] Handle widening of vector truncate	2020-09-05 15:07:38 -05:00
Florian Hahn	1ddb3a369f	[LangRef] Adjust guarantee for llvm.memcpy to also allow equal arguments. This adjusts the description of `llvm.memcpy` to also allow operands to be equal. This is in line with what Clang currently expects. This change is intended to be temporary and followed by re-introduce a variant with the non-overlapping guarantee for cases where we can actually ensure that property in the front-end. See the links below for more details: http://lists.llvm.org/pipermail/cfe-dev/2020-August/066614.html and PR11763. Reviewed By: nikic Differential Revision: https://reviews.llvm.org/D86815	2020-09-05 19:18:23 +01:00
Nikita Popov	ac87480bd8	[SCEV] Recognize min/max intrinsics Recognize umin/umax/smin/smax intrinsics and convert them to the already existing SCEV nodes of the same name. In the future we'll want SCEVExpander to also produce the intrinsics, but we're not ready for that yet. Differential Revision: https://reviews.llvm.org/D87160	2020-09-05 16:30:11 +02:00
Nikita Popov	4892d3a198	[InstCombine] Fold abs with dominating condition Similar to D87168, but for abs. If we have a dominating x >= 0 condition, then we know that abs(x) is x. This fold is in InstCombine, because we need to create a sub instruction for the x < 0 case. Differential Revision: https://reviews.llvm.org/D87184	2020-09-05 16:18:35 +02:00
Nikita Popov	73104b0751	[InstSimplify] Fold min/max based on dominating condition If we have a dominating condition that x >= y, then umax(x, y) is x, etc. I'm doing this in InstSimplify as the corresponding transform for the select form is also done there. Differential Revision: https://reviews.llvm.org/D87168	2020-09-05 16:16:40 +02:00
Nikita Popov	ada8a17d94	[InstCombine] Fold abs intrinsic eq zero Following the same transform for the select version of abs.	2020-09-05 15:11:38 +02:00
Nikita Popov	58b28fa7a2	[InstCombine] Fold mul of abs intrinsic Same as the existing SPF_ABS fold. We don't need to explicitly handle NABS, as the negs will get folded away first.	2020-09-05 12:37:45 +02:00
Nikita Popov	10cb23c6ca	[InstCombine] Fold cttz of abs intrinsic Same as the existing fold for SPF_ABS. We don't need to explicitly handle the NABS variant, as we'll first fold away the neg in that case.	2020-09-05 12:25:41 +02:00
Jonas Paulsson	714ceefad9	[SelectionDAG] Always intersect SDNode flags during getNode() node memoization. Previously SDNodeFlags::instersectWith(Flags) would do nothing if Flags was in an undefined state, which is very bad given that this is the default when getNode() is called without passing an explicit SDNodeFlags argument. This meant that if an already existing and reused node had a flag which the second caller to getNode() did not set, that flag would remain uncleared. This was exposed by https://bugs.llvm.org/show_bug.cgi?id=47092, where an NSW flag was incorrectly set on an add instruction (which did in fact overflow in one of the two original contexts), so when SystemZElimCompare removed the compare with 0 trusting that flag, wrong-code resulted. There is more that needs to be done in this area as discussed here: Differential Revision: https://reviews.llvm.org/D86871 Review: Ulrich Weigand, Sanjay Patel	2020-09-05 10:30:38 +02:00
serge-sans-paille	3a6f3fc160	Fix return status of SimplifyCFG When a switch case is folded into default's case, that's an IR change that should be reported, update ConstantFoldTerminator accordingly. Differential Revision: https://reviews.llvm.org/D87142	2020-09-05 07:54:15 +02:00
Qiu Chaofan	705271d9cd	[PowerPC] Expand constrained ppc_fp128 to i32 conversion Libcall __gcc_qtou is not available, which breaks some tests needing it. On PowerPC, we have code to manually expand the operation, this patch applies it to constrained conversion. To keep it strict-safe, it's using the algorithm similar to expandFP_TO_UINT. For constrained operations marking FP exception behavior as 'ignore', we should set the NoFPExcept flag. However, in some custom lowering the flag is missed. This should be fixed by future patches. Reviewed By: uweigand Differential Revision: https://reviews.llvm.org/D86605	2020-09-05 13:16:20 +08:00
Krzysztof Parzyszek	89a4fe79d4	[Hexagon] Unindent everything in HexagonISelLowering.h, NFC Just a shift, no other formatting changes.	2020-09-04 17:25:29 -05:00
Craig Topper	35b35a373d	[X86] Prevent shuffle combining from creating an identical X86ISD::SHUF128. This can cause an infinite loop if SimplifiedDemandedElts asks for the node to replace itself. A similar protection exists in other places in shuffle combining. Fixes ISPC https://github.com/ispc/ispc/issues/1864	2020-09-04 14:12:49 -07:00
Fangrui Song	398ba37230	[LiveDebugVariables] Delete unneeded doInitialization	2020-09-04 13:27:42 -07:00
Lang Hames	3b64052a25	[ORC] Fix some bugs in TPCDynamicLibrarySearchGenerator, use in llvm-jitlink. TPCDynamicLibrarySearchGenerator was generating errors on missing symbols, but that doesn't fit the DefinitionGenerator contract: A symbol that isn't generated by a particular generator should not cause an error. This commit fixes the error by using SymbolLookupFlags::WeaklyReferencedSymbol for all elements of the lookup, and switches llvm-jitlink to use TPCDynamicLibrarySearchGenerator.	2020-09-04 13:23:52 -07:00
Steven Wu	97866b8de8	[ThinLTO][Legacy] Fix StringRef assertion from ThinLTO bots This is a presumed fix for FireFox thinLTO bot fix which hits assertion failure for invalid index when access StringRef. Techinically, `IRName` in the symtab should not be empty string for the entries we cared about but this will help to fix the bot before more information can be provided. Otherwise, NFCI.	2020-09-04 12:30:09 -07:00
Florian Hahn	00eb6fef08	[DSE,MemorySSA] Check for throwing instrs between killing/killed def. We also have to check all uses between the killing & killed def and check if any of them is throwing.	2020-09-04 18:54:59 +01:00
Wei Wang	4eef14f978	[OpenMPOpt] Assume indirect call always changes ICV When checking call sites, give special handling to indirect call, as the callee may be unknown and can lead to nullptr dereference later. Assume conservatively that the ICV always changes in such case. Reviewed By: sstefan1 Differential Revision: https://reviews.llvm.org/D87104	2020-09-04 09:05:32 -07:00
Teresa Johnson	45c3560384	[HeapProf] Address post-review comments in instrumentation code Addresses post-review comments from D85948, which can be found here: https://reviews.llvm.org/rG7ed8124d46f9.	2020-09-04 08:59:00 -07:00
Muhammad Asif Manzoor	1ffcbe35ae	[AArch64][SVE] Add lowering for rounding operations Add the functionality to lower SVE rounding operations for passthru variant. Created a new test case file for all rounding operations. Reviewed By: paulwalker-arm Differential Revision: https://reviews.llvm.org/D86793	2020-09-04 11:16:57 -04:00
Xing GUO	2966d6e80b	[MachOYAML] Allow handcrafting custom contents for DWARF sections. This patch enables users to handcraft custom contents for DWARF sections. If we specify the contents of DWARF sections both in the 'DWARF' entry and the 'content', yaml2obj will emit an error message. In addition, this patch helps remove the restriction that only the content of sections whose segname are __DWARF can be specified in the "DWARF" entry. Reviewed By: jhenderson Differential Revision: https://reviews.llvm.org/D87126	2020-09-04 22:42:45 +08:00
Florian Hahn	6bc5e866bd	[MemCpyOpt] Account for case that MemInsertPoint == BI. In that case, the new MemoryDef needs to be inserted before MemInsertPoint.	2020-09-04 14:04:08 +01:00
Simon Pilgrim	7582c5c023	CallingConvLower.h - remove unnecessary MachineFunction.h include. NFC. Reduce to forward declaration, add the Register.h include that we still needed, move CCState::ensureMaxAlignment into CallingConvLower.cpp as it was the only function that needed the full definition of MachineFunction. Fix a few implicit dependencies further down.	2020-09-04 12:16:48 +01:00
Simon Pilgrim	740625fecd	[X86] Make lowerShuffleAsLanePermuteAndPermute use sublanes on AVX2 Extends lowerShuffleAsLanePermuteAndPermute to search for opportunities to use vpermq (64-bit cross-lane shuffle) and vpermd (32-bit cross-lane shuffle) to get elements into the correct lane, in addition to the 128-bit full-lane permutes it previously searched for. This is especially helpful in cross-lane byte shuffles, where the alternative tends to be "vpshufb both lanes separately and blend them with a vpblendvb", which is very expensive, especially on Haswell where vpblendvb uses the same execution port as all the shuffles. Addresses PR47262 Patch By: @TellowKrinkle (TellowKrinkle) Differential Revision: https://reviews.llvm.org/D86429	2020-09-04 11:41:26 +01:00
David Green	294c0cc3eb	[ARM] Fold predicate_cast(load) into vldr p0 This adds a simple tablegen pattern for folding predicate_cast(load) into vldr p0, providing the alignment and offset are correct. Differential Revision: https://reviews.llvm.org/D86702	2020-09-04 11:29:59 +01:00
David Sherwood	73a3d350a4	[SVE][CodeGen] Fix up warnings in sve-split-insert/extract tests I have fixed up some more ElementCount/TypeSize related warnings in the following tests: CodeGen/AArch64/sve-split-extract-elt.ll CodeGen/AArch64/sve-split-insert-elt.ll In SelectionDAG::CreateStackTemporary we were relying upon the implicit cast from TypeSize -> uint64_t when calling MachineFrameInfo::CreateStackObject. I've fixed this by passing in the known minimum size instead, which I believe is fine because the associated stack id indicates whether this is a scalable object or not. I've also fixed up a case in TargetLowering::SimplifyDemandedBits when extracting a vector element from a scalable vector. The result is a scalar, hence it wasn't caught at the start of the function. If the vector is scalable we just bail out for now. Differential Revision: https://reviews.llvm.org/D86431	2020-09-04 09:51:31 +01:00
Florian Hahn	e2fc6a31d3	[MemCpyOpt] Preserve MemorySSA. This patch updates MemCpyOpt to preserve MemorySSA. It uses the MemoryDef at the insertion point of the builder and inserts the new def after that def. In some cases, we just modify a memory instruction. In that case, get the defining access, then remove the memory access and add a new one. If the defining access is in a different block, insert a new def at the beginning of the current block, otherwise after the defining access. Reviewed By: asbirlea Differential Revision: https://reviews.llvm.org/D86651	2020-09-04 09:05:33 +01:00
Matt Arsenault	3c2a7bd286	AMDGPU: Remove code to handle tied si_else operands This has not used tied operands for a long time.	2020-09-03 19:46:05 -04:00
Craig Topper	0851350557	[X86] Update stale comment. NFC The optimization in ExpandIntOp_UINT_TO_FP was removed in D72728 in January 2020.	2020-09-03 16:19:10 -07:00
Michael Liao	bf41c4d29e	[codegen] Ensure target flags are cleared/set properly. NFC. - When an operand is changed into an immediate value or like, ensure their target flags being cleared or set properly. Differential Revision: https://reviews.llvm.org/D87109	2020-09-03 18:37:39 -04:00
Puyan Lotfi	7fff1fbd3c	[MIRVRegNamer] Experimental MachineInstr stable hashing (Fowler-Noll-Vo) This hashing scheme has been useful out of tree, and I want to start experimenting with it. Specifically I want to experiment on the MIRVRegNamer, MIRCanononicalizer, and eventually the MachineOutliner. This diff is a first step, that optionally brings stable hashing to the MIRVRegNamer (and as a result, the MIRCanonicalizer). We've tested this hashing scheme on a lot of MachineOperand types that llvm::hash_value can not handle in a stable manner. This stable hashing was also the basis for "Global Machine Outliner for ThinLTO" in EuroLLVM 2020 http://llvm.org/devmtg/2020-04/talks.html#TechTalk_58 Credits: Kyungwoo Lee, Nikolai Tillmann Differential Revision: https://reviews.llvm.org/D86952	2020-09-03 16:13:09 -04:00
Arthur Eubanks	c9771391ce	[NewPM][Lint] Port -lint to NewPM This also changes -lint from an analysis to a pass. It's similar to -verify, and that is a normal pass, and lives in llvm/IR. Reviewed By: ychen Differential Revision: https://reviews.llvm.org/D87057	2020-09-03 13:03:44 -07:00
Jamie Schmeiser	b2e65cf950	Revert "Add new hidden option -print-changed which only reports changes to IR" This reverts commit `7bc9924cb2` due to failure caused by missing a space between trailing >>, required by some versions of C++:wq.	2020-09-03 18:41:20 +00:00
Amy Huang	5fe33f7399	[DebugInfo] Make DWARF ignore sizes on forward declared class types. Make sure the sizes for forward declared classes aren't emitted in DWARF. This comes before https://reviews.llvm.org/D87062, which adds sizes to all classes with definitions. Bug: https://bugs.llvm.org/show_bug.cgi?id=47338 Differential Revision: https://reviews.llvm.org/D87070	2020-09-03 11:01:49 -07:00
Simon Pilgrim	1673a08044	SelectionDAG.h - remove unnecessary FunctionLoweringInfo.h include. NFCI. Use forward declarations and move the include down to dependent files that actually use it. This also exposes a number of implicit dependencies on KnownBits.h	2020-09-03 18:33:25 +01:00
Simon Pilgrim	83ca548fcb	WebAssemblyUtilities.h - reduce unnecessary includes to forward declarations. NFCI.	2020-09-03 17:43:35 +01:00
Simon Pilgrim	46780cc0ee	PHIEliminationUtils.cpp - remove unnecessary MachineBasicBlock.h include. NFCI. This is already included in PHIEliminationUtils.h	2020-09-03 17:43:34 +01:00
Simon Pilgrim	6731eb644a	Fix Wdocumentation trailing comments warnings. NFCI.	2020-09-03 17:43:34 +01:00
Simon Pilgrim	b196c7192f	Fix Wdocumentation warning. NFCI. Remove \returns tag from a void function	2020-09-03 17:43:34 +01:00
Sanjay Patel	2391a34f9f	[InstCombine] canonicalize all commutative intrinsics with constant arg	2020-09-03 12:42:04 -04:00
Jamie Schmeiser	7bc9924cb2	Add new hidden option -print-changed which only reports changes to IR A new hidden option -print-changed is added along with code to support printing the IR as it passes through the opt pipeline in the new pass manager. Only those passes that change the IR are reported, with others only having the banner reported, indicating that they did not change the IR, were filtered out or ignored. Filtering of output via the -filter-print-funcs is supported and a new supporting hidden option -filter-passes is added. The latter takes a comma separated list of pass names and filters the output to only show those passes in the list that change the IR. The output can also be modified via the -print-module-scope function. The code introduces a template base class that generalizes the comparison of IRs that takes an IR representation as template parameter. The constructor takes a series of lambdas that provide an event based API for generalized reporting of IRs as they are changed in the opt pipeline through the new pass manager. The first of several instantiations is provided that prints the IR in a form similar to that produced by -print-after-all with the above mentioned filtering capabilities. This version, and the others to follow will be introduced at the upcoming developer's conference. See https://hotcrp.llvm.org/usllvm2020/paper/29 for more information. Reviewed By: yrouban (Yevgeny Rouban) Differential Revision: https://reviews.llvm.org/D86360	2020-09-03 15:52:35 +00:00
Simon Pilgrim	898e42db93	GlobalISel/Utils.h - remove unused includes. NFCI. Twine is unused, and TargetLowering can be reduced to a forward declaration and moved to Utils.cpp	2020-09-03 15:59:12 +01:00
Simon Pilgrim	58afaecdc2	X86/X86TargetObjectFile.cpp - remove unused headers. NFCI.	2020-09-03 15:17:44 +01:00
Simon Pilgrim	0563cd6739	Fix spelling mistake. NFC.	2020-09-03 15:17:44 +01:00
Sanjay Patel	bdd5bfd0e4	[IR][GVN] add/allow commutative intrinsics with >2 args Follow-up to D86798 and rGe25449f.	2020-09-03 10:14:53 -04:00
Simon Pilgrim	91848b11b4	LowerEmuTLS.cpp - remove unused TargetLowering.h include. NFC. We only needed llvm/IR/Constants.h.	2020-09-03 14:40:09 +01:00
Simon Pilgrim	890707aa01	[X86] Avoid llvm-qualified-auto warning by not using auto. NFC. Try to consistently use the actual type name in the file.	2020-09-03 14:21:17 +01:00
Simon Pilgrim	23d9f4b958	[X86] Fix llvm-qualified-auto warning by using auto*. NFC.	2020-09-03 14:21:17 +01:00
Simon Pilgrim	5b29269744	[X86] Fix llvm-qualified-auto warning by using const auto*. NFC.	2020-09-03 14:21:17 +01:00
Florian Hahn	6de51189b0	[PassManager] Move load/store motion pass after DSE in LTO pipeline. As far as I am aware, the placement of MergedLoadStoreMotion in the pipeline is not heavily tuned currently. It seems to not matter much if we do it after DSE in the LTO pipeline (no binary changes for -O3 -flto on MultiSource/SPEC2000/SPEC2006). Moving it after DSE however has a major benefit: MemorySSA is constructed by LICM and is consumed by DSE, so if MergedLoadStoreMotion happens after DSE, we do not need to preserve MemorySSA in it. If there are any concerns with this move, I can also update MergedLoadStoreMotion to preserve MemorySSA. This patch together with D86651 (preserve MemSSA in MemCpyOpt) and D86534 (preserve MemSSA in GVN) are the remaining patches to bring down compile-time for DSE + MemorySSA to the levels outlined in http://lists.llvm.org/pipermail/llvm-dev/2020-August/144417.html Once they land, we should be able to start with flipping the switch on enabling DSE + MmeorySSA. Reviewed By: asbirlea Differential Revision: https://reviews.llvm.org/D86967	2020-09-03 13:47:50 +01:00
Simon Pilgrim	e56edb801b	[X86][SSE] Fold select(X > -1, A, B) -> select(0 > X, B, A) (PR47404) Help PBLENDVB peek through to the sign bit source of the selection mask by swapping the select condition and inputs.	2020-09-03 13:02:08 +01:00
Ben Shi	c5716447c1	[NFC][RISCV] Simplify pass arg of RISCVMergeBaseOffsetOpt Reviewed By: lenary, asb Differential Revision: https://reviews.llvm.org/D87069	2020-09-03 20:01:23 +08:00
Florian Hahn	a344b382a0	[GVN] Preserve MemorySSA if it is available. Preserve MemorySSA if it is available before running GVN. DSE with MemorySSA will run closely after GVN. If GVN and 2 other passes preserve MemorySSA, DSE can re-use MemorySSA used by LICM when doing LTO. Reviewed By: asbirlea Differential Revision: https://reviews.llvm.org/D86534	2020-09-03 12:28:13 +01:00
David Green	245f846c4e	[MemCpyOptimizer] Change required analysis order for BasicAA/PhiValuesAnalysis This is a followup to `1ccfb52a61`, which made a number of changes including the apparently innocuous reordering of required passes in MemCpyOptimizer. This however altered the creation order of BasicAA vs Phi Values analysis, meaning BasicAA did not pick up PhiValues as a cached result. Instead if we require MemoryDependence first it will require PhiValuesAnalysis allowing BasicAA to use it for better results. I don't claim this is an excellent design, but it fixes a nasty little regressions where a query later in JumpThreading was getting worse results. Differential Revision: https://reviews.llvm.org/D87027	2020-09-03 12:01:51 +01:00

1 2 3 4 5 ...

138742 Commits