llvm-project

Commit Graph

Author	SHA1	Message	Date
Matt Arsenault	4f64ade04c	AMDGPU/GlobalISel: Select src modifiers llvm-svn: 364782	2019-07-01 15:18:56 +00:00
Diana Picus	2ba16011c1	Fixup r364512 Fix stack-use-after-scope errors from r364512. One instance was already fixed in r364611 - this patch simplifies that fix and addresses one more instance of similar code. Discussed in: https://reviews.llvm.org/D63905 llvm-svn: 364778	2019-07-01 15:07:38 +00:00
Krzysztof Parzyszek	511ad50db4	[Hexagon] Rework VLCR algorithm Add code to catch pattern for commutative instructions for VLCR. Patch by Suyog Sarda. llvm-svn: 364770	2019-07-01 13:50:47 +00:00
Matt Arsenault	1b317685e9	AMDGPU: Convert some places to Register llvm-svn: 364769	2019-07-01 13:44:46 +00:00
Matt Arsenault	5bf850d52e	AMDGPU/GlobalISel: Fix RegBankSelect for G_FCANONICALIZE llvm-svn: 364768	2019-07-01 13:40:18 +00:00
Matt Arsenault	b5fc94f3e7	AMDGPU/GlobalISel: Fix RegBankSelect for G_BUILD_VECTOR llvm-svn: 364767	2019-07-01 13:40:17 +00:00
Matt Arsenault	89fc8bcdd6	AMDGPU/GlobalISel: Fail on store to 32-bit address space llvm-svn: 364766	2019-07-01 13:37:39 +00:00
Matt Arsenault	3b7668ae4b	AMDGPU/GlobalISel: Improve icmp selection coverage. Select s64 eq/ne scalar icmp. llvm-svn: 364765	2019-07-01 13:34:26 +00:00
Matt Arsenault	c23149f612	AMDGPU/GlobalISel: RegBankSelect for WWM/WQM llvm-svn: 364763	2019-07-01 13:30:12 +00:00
Matt Arsenault	facf69e844	AMDGPU/GlobalISel: Use vcc reg bank for amdgcn.wqm.vote llvm-svn: 364762	2019-07-01 13:30:09 +00:00
Matt Arsenault	9f992c238a	AMDGPU/GlobalISel: Fix scc->vcc copy handling This was checking the size of the register with the value of the size, which happens to be exec. Also fix assuming VCC is 64-bit to fix wave32. Also remove some untested handling for physical registers which is skipped. This doesn't insert the V_CNDMASK_B32 if SCC is the physical copy source. I'm not sure if this should be trying to handle this special case instead of dealing with this in copyPhysReg. llvm-svn: 364761	2019-07-01 13:22:07 +00:00
Matt Arsenault	5dafcb9b11	AMDGPU/GlobalISel: Use and instead of BFE with inline immediate Zext from s1 is the only case where this should do anything with the current legal extensions. llvm-svn: 364760	2019-07-01 13:22:06 +00:00
Simon Atanasyan	ceb9da5bc7	[mips] Add missing schedinfo for MSA and ASE instructions llvm-svn: 364757	2019-07-01 13:21:05 +00:00
Simon Atanasyan	c0121bf874	[mips] Add missing schedinfo for atomic instructions llvm-svn: 364756	2019-07-01 13:20:56 +00:00
Simon Atanasyan	3a10810b7a	[mips] Add missing schedinfo for ADJCALLSTACKDOWN, ADJCALLSTACKUP llvm-svn: 364755	2019-07-01 13:20:48 +00:00
Florian Hahn	33c8c0ea27	[AMDGPU] Call isLoopExiting for blocks in the loop. isLoopExiting should only be called for blocks in the loop. A follow up patch makes this requirement an assertion. I've updated the usage here, to only match for actual exit blocks. Previously, it would also match blocks not in the loop. Reviewers: arsenm, nhaehnle Reviewed By: nhaehnle Differential Revision: https://reviews.llvm.org/D63980 llvm-svn: 364750	2019-07-01 12:36:44 +00:00
Fangrui Song	92e78b7bed	[RISCV] Add break; to the last switch case As suggested by jrtc27 in the post-commit review of D60528. llvm-svn: 364746	2019-07-01 11:41:07 +00:00
Simon Pilgrim	172fe5dd19	[X86] CombineShuffleWithExtract - updated description comments. NFCI. CombineShuffleWithExtract no longer requires that both shuffle ops are extract_subvectors, from the same type or from the same size. llvm-svn: 364745	2019-07-01 11:33:45 +00:00
Benjamin Kramer	ed13fef477	[SelectionDAG] Do minnum->minimum at legalization time instead of building time The SDAGBuilder behavior stems from the days when we didn't have fast math flags available in SDAG. We do now and doing the transformation in the legalizer has the advantage that it also works for vector types. llvm-svn: 364743	2019-07-01 11:00:23 +00:00
Roman Lebedev	f55818e3a7	[InstCombine] Omit 'urem' where possible This was added in D63390 / rL364286 to backend, but it makes sense to also handle it in middle-end. https://rise4fun.com/Alive/Zsln llvm-svn: 364738	2019-07-01 09:41:43 +00:00
Jeremy Morse	d2b6665e33	[DebugInfo] Avoid adding too much indirection to pointer-valued variables This patch addresses PR41675, where a stack-pointer variable is dereferenced too many times by its location expression, presenting a value on the stack as the pointer to the stack. The difference between a stack pointer DBG_VALUE and one that refers to a value on the stack, is currently the indirect flag. However the DWARF backend will also try to guess whether something is a memory location or not, based on whether there is any computation in the location expression. By simply prepending the stack offset to existing expressions, we can accidentally convert a register location into a memory location, which introduces a suprise (and unintended) dereference. The solution is to add DW_OP_stack_value whenever we add a DIExpression computation to a stack pointer. It's an implicit location computed on the expression stack, thus needs to be flagged as a stack_value. For the edge case where the offset is zero and the location could be a register location, DIExpression::prepend will still generate opcodes, and thus DW_OP_stack_value must still be added. Differential Revision: https://reviews.llvm.org/D63429 llvm-svn: 364736	2019-07-01 09:38:23 +00:00
Yevgeny Rouban	d4097b4a93	[SimpleLoopUnswitch] Implement handling of prof branch_weights metadata for SwitchInst Differential Revision: https://reviews.llvm.org/D60606 llvm-svn: 364734	2019-07-01 08:43:53 +00:00
Sam Parker	98722691b0	[ARM] WLS/LE Code Generation Backend changes to enable WLS/LE low-overhead loops for armv8.1-m: 1) Use TTI to communicate to the HardwareLoop pass that we should try to generate intrinsics that guard the loop entry, as well as setting the loop trip count. 2) Lower the BRCOND that uses said intrinsic to an Arm specific node: ARMWLS. 3) ISelDAGToDAG the node to a new pseudo instruction: t2WhileLoopStart. 4) Add support in ArmLowOverheadLoops to handle the new pseudo instruction. Differential Revision: https://reviews.llvm.org/D63816 llvm-svn: 364733	2019-07-01 08:21:28 +00:00
Craig Topper	29fff0797b	[X86] Improve the type checking fast-isel handling of vector bitcasts. We had a bunch of vector size legality checks for the source type based on feature flags, but we didn't check the destination type at all beyond ensuring that it was a "simple" type. But this allowed the destination to be i128 which isn't legal. This commit changes the code to use TLI's isTypeLegal logic in place of the all the subtarget checks. Then additionally checks that the source and dest are vectors. Fixes 42452 llvm-svn: 364729	2019-07-01 07:09:34 +00:00
Craig Topper	4ca81a9b99	[X86] Add a DAG combine to replace vector loads feeding a v4i32->v2f64 CVTSI2FP/CVTUI2FP node with a vzload. But only when the load isn't volatile. This improves load folding during isel where we only have vzload and scalar_to_vector+load patterns. We can't have full vector load isel patterns for the same volatile load issue. Also add some missing masked cvtsi2fp/cvtui2fp with vzload patterns. llvm-svn: 364728	2019-07-01 07:09:31 +00:00
Craig Topper	d1728f8987	[X86] Add MOVHPDrm/MOVLPDrm patterns that use VZEXT_LOAD. We already had patterns that used scalar_to_vector+load. But we can also have a vzload. Found while investigating combining scalar_to_vector+load to vzload. llvm-svn: 364726	2019-07-01 07:09:23 +00:00
Sanjay Patel	706b48251f	[InstCombine] canonicalize fcmp+select to minnum/maxnum intrinsics This is the opposite direction of D62158 (we have to choose 1 form or the other). Now that we have FMF on the select, this becomes more palatable. And the benefits of having a single IR instruction for this operation (less chances of missing folds based on extra uses, etc) overcome my previous comments about the potential advantage of larger pattern matching/analysis. Differential Revision: https://reviews.llvm.org/D62414 llvm-svn: 364721	2019-06-30 13:40:31 +00:00
Fangrui Song	78ee2fbf98	Cleanup: llvm::bsearch -> llvm::partition_point after r364719 llvm-svn: 364720	2019-06-30 11:19:56 +00:00
Craig Topper	725a8a5dc4	[X86] Custom lower AVX masked loads to masked load and vselect instead of selecting a maskmov+vblend during isel. AVX masked loads only support 0 as the value for masked off elements. So we need an extra blend to support other values. Previously we expanded the masked load to two instructions with isel patterns. With this patch we now insert the vselect during lowering and it will be separately selected as a blend. llvm-svn: 364718	2019-06-30 06:46:37 +00:00
Craig Topper	4d0feb28ec	[SelectionDAG] Use the memory VT instead of result VT for FoldingSet profiling in getMaskedLoad/getMaskedStore. This matches what is done by the Profile function. Otherwise CSE won't work properly. llvm-svn: 364717	2019-06-30 06:46:33 +00:00
Nikita Popov	8023c84433	[LFTR] Rephrase getLoopTest into "based-on" check; NFCI What we want to know here is whether we're already using this value for the loop condition, so make the query about that. We can extend this to a more general "based-on" relationship, rather than a direct icmp use later. llvm-svn: 364715	2019-06-29 15:12:59 +00:00
Sanjay Patel	77dc1e8568	[InstCombine] canonicalize fmin/fmax to LLVM intrinsics minnum/maxnum This transform came up in D62414, but we should deal with it first. We have LLVM intrinsics that correspond exactly to libm calls (unlike most libm calls, these libm calls never set errno). This holds without any fast-math-flags, so we should always canonicalize to those intrinsics directly for better optimization. Currently, we convert to fcmp+select only when we have FMF (nnan) because fcmp+select does not preserve the semantics of the call in the general case. Differential Revision: https://reviews.llvm.org/D63214 llvm-svn: 364714	2019-06-29 14:28:54 +00:00
Nikita Popov	61a8b62b4c	[LFTR] Remove unnecessary latch check; NFCI The whole indvars pass works on loops in simplified form, so there is always a unique latch. Convert the condition into an assertion in needsLFTR (though we also assert this in later LFTR functions). Additionally update the comment on getLoopTest() now that we are dealing with multiple exits. llvm-svn: 364713	2019-06-29 12:41:02 +00:00
Roman Lebedev	e3a94ba4a9	[InstCombine] Shift amount reassociation (PR42391) Summary: Given pattern: `(x shiftopcode Q) shiftopcode K` we should rewrite it as `x shiftopcode (Q+K)` iff `(Q+K) u< bitwidth(x)` This is valid for any shift, but they must be identical. * https://rise4fun.com/Alive/9E2 * exact on both lshr => exact https://rise4fun.com/Alive/plHk * exact on both ashr => exact https://rise4fun.com/Alive/QDAA * nuw on both shl => nuw https://rise4fun.com/Alive/5Uk * nsw on both shl => nsw https://rise4fun.com/Alive/0plg Should fix [[ https://bugs.llvm.org/show_bug.cgi?id=42391 \| PR42391]]. Reviewers: spatel, nikic, RKSimon Reviewed By: nikic Subscribers: llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D63812 llvm-svn: 364712	2019-06-29 11:51:50 +00:00
Dmitry Venikov	9e9eb62f9f	[APInt] Fix getBitsNeeded for INT_MIN values Summary: This patch fixes behaviour of APInt::getBitsNeeded for INT_MIN 10 bits values. Reviewers: regehr, RKSimon Reviewed By: RKSimon Subscribers: grandinj, dexonsmith, kristina, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D63691 llvm-svn: 364710	2019-06-29 11:38:12 +00:00
Nikita Popov	2d756c4feb	[LFTR] Fix post-inc pointer IV with truncated exit count (PR41998) Fixes https://bugs.llvm.org/show_bug.cgi?id=41998. Usually when we have a truncated exit count we'll truncate the IV when comparing against the limit, in which case exit count overflow in post-inc form doesn't matter. However, for pointer IVs we don't do that, so we have to be careful about incrementing the IV in the wide type. I'm fixing this by removing the IVCount variable (which was ExitCount or ExitCount+1) and replacing it with a UsePostInc flag, and then moving the actual limit adjustment to the individual cases (which are: pointer IV where we add to the wide type, integer IV where we add to the narrow type, and constant integer IV where we add to the wide type). Differential Revision: https://reviews.llvm.org/D63686 llvm-svn: 364709	2019-06-29 09:24:12 +00:00
Matt Arsenault	0d45209757	AMDGPU/GlobalISel: RegBankSelect for update.dpp llvm-svn: 364701	2019-06-29 00:44:36 +00:00
Matt Arsenault	fd82cf4f4d	AMDGPU/GlobalISel: RegBankSelect for atomic.inc/atomic.dec llvm-svn: 364699	2019-06-29 00:39:20 +00:00
Matt Arsenault	adb1f21e52	AMDGPU/GlobalISel: RegBankSelect for some DS intrinsics llvm-svn: 364698	2019-06-29 00:33:13 +00:00
Matt Arsenault	b416d5fc8b	AMDGPU/GlobalISel: RegBankSelect for some easy intrinsics llvm-svn: 364697	2019-06-29 00:29:56 +00:00
Matt Arsenault	5ea3c9adb2	AMDGPU/GlobalISel: RegBankSelect for icmp/fcmp intrinsics llvm-svn: 364696	2019-06-29 00:28:52 +00:00
Matt Arsenault	6aafb3068f	AMDGPU/GlobalISel: RegBankSelect for amdgcn.div.fmas llvm-svn: 364695	2019-06-29 00:25:53 +00:00
Matt Arsenault	ade5162432	AMDGPU/GlobalISel: RegBankSelect for some simple leaf intrinsics llvm-svn: 364694	2019-06-29 00:22:28 +00:00
Philip Reames	1504b6ee7e	[IndVars] Remove a bit of manual constant folding [NFC] SCEV is more than capable of folding (add x, trunc(0)) to x. llvm-svn: 364693	2019-06-29 00:19:31 +00:00
Wouter van Oortmerssen	319c87d94f	[WebAssembly] Assembler: support .int16/32/64 directives. Reviewers: sbc100 Subscribers: dschuff, jgravelle-google, aheejin, sunfish, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D63959 llvm-svn: 364689	2019-06-28 22:20:33 +00:00
Wouter van Oortmerssen	35bcba4fae	[WebAssembly] Allow @object in .type directives. Reviewers: sbc100 Subscribers: dschuff, jgravelle-google, aheejin, sunfish, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D63955 llvm-svn: 364688	2019-06-28 21:53:11 +00:00
Sanjay Patel	9126c84f50	[x86] remove stale comment about cmov; NFC The cmov node used to sometimes return a glue result (and that's what 'flag' meant in this context), but that was removed with D38664. llvm-svn: 364687	2019-06-28 21:45:55 +00:00
Wouter van Oortmerssen	fc222e23ca	[WebAssembly] Assembler: Allow offsets and p2align in symbol load. Reviewers: sbc100 Subscribers: dschuff, jgravelle-google, aheejin, sunfish, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D63951 llvm-svn: 364682	2019-06-28 20:31:13 +00:00
Wouter van Oortmerssen	597ba18008	[WebAssembly] Assembler: Improve section parsing. Reviewers: sbc100 Subscribers: dschuff, jgravelle-google, aheejin, sunfish, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D63947 llvm-svn: 364681	2019-06-28 20:29:16 +00:00
Cameron McInally	30e5cf1d8f	[NewGVN] Add unary FNeg support to NewGVN pass Differential Revision: https://reviews.llvm.org/D63933 llvm-svn: 364680	2019-06-28 20:09:32 +00:00
Cameron McInally	ab4b2364e5	[GVNSink] Add unary FNeg support to GVNSink pass Differential Revision: https://reviews.llvm.org/D63900 llvm-svn: 364678	2019-06-28 19:57:31 +00:00
Brad Smith	4b733ca617	Default to Secure PLT on PPC for musl libc. This matches the default settings of clang. llvm-svn: 364675	2019-06-28 19:48:31 +00:00
Simon Pilgrim	978a08c885	[X86] CombineShuffleWithExtract - recurse through EXTRACT_SUBVECTOR chain llvm-svn: 364667	2019-06-28 17:57:32 +00:00
Peter Collingbourne	7108df964a	hwasan: Remove the old frame descriptor mechanism. Differential Revision: https://reviews.llvm.org/D63470 llvm-svn: 364665	2019-06-28 17:53:26 +00:00
Wouter van Oortmerssen	633d222d30	[WebAssembly] Added visibility and ident directives to WasmAsmParser. Summary: These are output by clang -S, so can now be roundtripped thru clang. (partially) fixes: https://bugs.llvm.org/show_bug.cgi?id=34544 Reviewers: dschuff Subscribers: sbc100, jgravelle-google, aheejin, sunfish, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D63901 llvm-svn: 364658	2019-06-28 16:51:06 +00:00
Dmitry Preobrazhensky	e1eb25ff3e	[AMDGPU][MC] Fix 2 for sanitizer failure in 364645 llvm-svn: 364656	2019-06-28 16:28:46 +00:00
Sam Tebbs	e39e958da3	[ARM] Add support for the MVE long shift instructions MVE adds the lsll, lsrl and asrl instructions, which perform a shift on a 64 bit value separated into two 32 bit registers. The Expand64BitShift function is modified to accept ISD::SHL, ISD::SRL and ISD::SRA and convert it into the appropriate opcode in ARMISD. An SHL is converted into an lsll, an SRL is converted into an lsrl for the immediate form and a negation and lsll for the register form, and SRA is converted into an asrl. test/CodeGen/ARM/shift_parts.ll is added to test the logic of emitting these instructions. Differential Revision: https://reviews.llvm.org/D63430 llvm-svn: 364654	2019-06-28 15:43:31 +00:00
Dmitry Preobrazhensky	d12966c088	[AMDGPU][MC] Fix for sanitizer failure in 364645 llvm-svn: 364651	2019-06-28 15:22:47 +00:00
Dmitry Preobrazhensky	1d572ce395	[AMDGPU][MC] Enabled constant expressions as operands of sendmsg See bug 40820: https://bugs.llvm.org/show_bug.cgi?id=40820 Reviewers: artem.tamazov, arsenm Differential Revision: https://reviews.llvm.org/D62735 llvm-svn: 364645	2019-06-28 14:14:02 +00:00
Simon Pilgrim	a54e1a0f01	[X86] CombineShuffleWithExtract - only require 1 source to be EXTRACT_SUBVECTOR We were requiring that both shuffle operands were EXTRACT_SUBVECTORs, but we can relax this to only require one of them to be. Also, we shouldn't bother attempting this if both operands are from the lowest subvector (or not EXTRACT_SUBVECTOR at all). llvm-svn: 364644	2019-06-28 12:24:49 +00:00
David Green	9dbdfe6b78	[ARM] Add MVE mul patterns This simply adds integer and floating point VMUL patterns for MVE, same as we have add and sub. Differential Revision: https://reviews.llvm.org/D63866 llvm-svn: 364643	2019-06-28 11:44:03 +00:00
David Green	2883944035	[ARM] Mark math routines as non-legal for MVE This adds handling and tests for a number of floating point math routines, which have no MVE instructions. Differential Revision: https://reviews.llvm.org/D63725 llvm-svn: 364641	2019-06-28 11:17:38 +00:00
David Green	ff70cbc895	[ARM] MVE patterns for VABS and VNEG This simply adds the required patterns for fp neg and abs. Differential Revision: https://reviews.llvm.org/D63861 llvm-svn: 364640	2019-06-28 10:25:35 +00:00
Fangrui Song	493a120259	[DebugInfo] Simplify GSYM::AddressRange and GSYM::AddressRanges Delete unnecessary getters of AddressRange. Simplify AddressRange::size(): Start <= End check should be checked in an upper layer. Delete isContiguousWith() that doesn't make sense. Simplify AddressRanges::insert. Delete commented code. Fix it when more than 1 ranges are to be deleted. Delete trailing newline. llvm-svn: 364637	2019-06-28 10:06:11 +00:00
David Green	eb7080ac6e	[ARM] Widening loads and narrowing stores MVE has instructions to widen as it loads, and narrow as it stores. This adds the required patterns and legalisation to make them work including specifying that they are legal, patterns to select them and test changes. Patch by David Sherwood. Differential Revision: https://reviews.llvm.org/D63839 llvm-svn: 364636	2019-06-28 09:47:55 +00:00
Simon Tatham	29ff1b4f46	[ARM] Fix integer UB in MVE load/store immediate handling. llvm-svn: 364635	2019-06-28 09:28:39 +00:00
Fangrui Song	e662b6985a	[DebugInfo] GSYM cleanups after D63104/r364427 llvm-svn: 364634	2019-06-28 08:58:05 +00:00
David Green	07e53fee14	[ARM] MVE loads and stores This fills in the gaps for basic MVE loads and stores, allowing unaligned access and adding far too many tests. These will become important as narrowing/expanding and pre/post inc are added. Big endian might still not be handled very well, because we have not yet added bitcasts (and I'm not sure how we want it to work yet). I've included the alignment code anyway which maps with our current patterns. We plan to return to that later. Code written by Simon Tatham, with additional tests from Me and Mikhail Maltsev. Differential Revision: https://reviews.llvm.org/D63838 llvm-svn: 364633	2019-06-28 08:41:40 +00:00
Dylan McKay	2bc48f503a	[AVR] Don't look for the TargetFrameLowering in the FrameLowering implementation c.f. r364349 llvm-svn: 364632	2019-06-28 08:35:21 +00:00
David Green	fc4102417b	[ARM] Mark div and rem as expand for MVE We don't have vector operations for these, so they need to be expanded for both integer and float. Differential Revision: https://reviews.llvm.org/D63595 llvm-svn: 364631	2019-06-28 08:18:55 +00:00
David Green	62889b0ea5	[ARM] Select MVE fp add and sub The same as integer arithmetic, we can add simple floating point MVE addition and subtraction patterns. Initial code by David Sherwood Differential Revision: https://reviews.llvm.org/D63257 llvm-svn: 364629	2019-06-28 07:41:09 +00:00
Sam Parker	9a92be1b35	[HardwareLoops] Loop counter guard intrinsic Introduce llvm.test.set.loop.iterations which sets the loop counter and also produces an i1 after testing that the count is not zero. Differential Revision: https://reviews.llvm.org/D63809 llvm-svn: 364628	2019-06-28 07:38:16 +00:00
David Green	be05b85db9	[ARM] Select MVE add and sub This adds the first few patterns for MVE code generation, adding simple integer add and sub patterns. Initial code by David Sherwood Differential Revision: https://reviews.llvm.org/D63255 llvm-svn: 364627	2019-06-28 07:21:11 +00:00
David Green	8be372b190	[ARM] MVE vector shuffles This patch adds necessary shuffle vector and buildvector support for ARM MVE. It essentially adds support for VDUP, VREVs and some VMOVs, which are often required by other code (like upcoming patches). This mostly uses the same code from Neon that already generated NEONvdup/NEONvduplane/NEONvrev's. These have been renamed to ARMvdup/etc and moved to ARMInstrInfo as they are common to both architectures. Most of the selection code seems to be applicable to both, but NEON does have some more instructions making some parts specific. Most code originally by David Sherwood. Differential Revision: https://reviews.llvm.org/D63567 llvm-svn: 364626	2019-06-28 07:08:42 +00:00
Craig Topper	cbb88a5169	[X86] Connect the output chain properly when combining vzext_movl+load into vzext_load. llvm-svn: 364625	2019-06-28 06:58:50 +00:00
Craig Topper	e832adea0f	[X86] Remove some duplicate patterns that already exist as part of their instruction definition. NFC llvm-svn: 364623	2019-06-28 05:03:47 +00:00
Alex Brachet	3b715d67dd	[Support] Add fs::getUmask() function and change fs::setPermissions Summary: This patch changes fs::setPermissions to optionally set permissions while respecting the umask. It also adds the function fs::getUmask() which returns the current umask. Reviewers: jhenderson, rupprecht, aprantl, lhames Reviewed By: jhenderson, rupprecht Subscribers: sanaanajjar231288, hiraditya, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D63583 llvm-svn: 364621	2019-06-28 03:21:00 +00:00
Zi Xuan Wu	588a170970	[NFC][PowerPC] Move XSQP series instruction apart from XSQPO series in position of td file llvm-svn: 364620	2019-06-28 02:51:03 +00:00
Stanislav Mekhanoshin	07fd88d735	[AMDGPU] Packed thread ids in function call ABI Differential Revision: https://reviews.llvm.org/D63851 llvm-svn: 364619	2019-06-28 01:52:13 +00:00
Matt Arsenault	3018d1845b	GlobalISel: Use Register llvm-svn: 364618	2019-06-28 01:47:44 +00:00
Kai Luo	c6fe8436e8	[PowerPC][NFC] Use `\|=` to update `Simplified` flag llvm-svn: 364617	2019-06-28 01:38:42 +00:00
Matt Arsenault	1178dc3d0b	AMDGPU/GlobalISel: Convert to using Register llvm-svn: 364616	2019-06-28 01:16:46 +00:00
Matt Arsenault	5e66db6b8c	GlobalISel: Convert rest of MachineIRBuilder to using Register llvm-svn: 364615	2019-06-28 01:16:41 +00:00
Amara Emerson	ecb7ac35f9	[GlobalISel][IRTranslator] Fix some PHI bugs related to jump tables when optimizations are used. The new switch lowering code that tries to generate jump tables and range checks were tested at -O0 on arm64, but on -O3 the generic switch lowering code goes to town on trying to generate optimized lowerings, e.g. multiple jump tables, range checks etc. This exposed bugs in the way PHI nodes are handled because the CFG looks even stranger after all of this is done. llvm-svn: 364613	2019-06-27 23:56:34 +00:00
Rumeet Dhindsa	ddc2804e1a	Fix ASAN error caused by commit r364512. This patch intends to fix ASAN stack-use-after-scope error. This is at least a short-term fix to unbreak LLVM's mainline. Differential Revision: https://reviews.llvm.org/D63905 llvm-svn: 364611	2019-06-27 23:37:04 +00:00
Peter Collingbourne	5378afc02a	hwasan: Use llvm.read_register intrinsic to read the PC on aarch64 instead of taking the function's address. This shaves an instruction (and a GOT entry in PIC code) off prologues of functions with stack variables. Differential Revision: https://reviews.llvm.org/D63472 llvm-svn: 364608	2019-06-27 23:24:07 +00:00
Roman Lebedev	29d05c005f	[CodeGen] [SelectionDAG] More efficient code for X % C == 0 (UREM case) (try 3) Summary: I'm submitting a new revision since i don't understand how to reclaim/reopen/take over the existing one, D50222. There is no such action in "Add Action" menu... This implements an optimization described in Hacker's Delight 10-17: when `C` is constant, the result of `X % C == 0` can be computed more cheaply without actually calculating the remainder. The motivation is discussed here: https://bugs.llvm.org/show_bug.cgi?id=35479. This is a recommit, the original commit rL364563 was reverted in rL364568 because test-suite detected miscompile - the new comparison constant 'Q' was being computed incorrectly (we divided by `D0` instead of `D`). Original patch D50222 by @hermord (Dmytro Shynkevych) Notes: - In principle, it's possible to also handle the `X % C1 == C2` case, as discussed on bugzilla. This seems to require an extra branch on overflow, so I refrained from implementing this for now. - An explicit check for when the `REM` can be reduced to just its LHS is included: the `X % C` == 0 optimization breaks `test1` in `test/CodeGen/X86/jump_sign.ll` otherwise. I hadn't managed to find a better way to not generate worse output in this case. - The `test/CodeGen/X86/jump_sign.ll` regresses, and is being fixed by a followup patch D63390. Reviewers: RKSimon, craig.topper, spatel, hermord, xbolva00 Reviewed By: RKSimon, xbolva00 Subscribers: dexonsmith, kristina, xbolva00, javed.absar, llvm-commits, hermord Tags: #llvm Differential Revision: https://reviews.llvm.org/D63391 llvm-svn: 364600	2019-06-27 21:52:10 +00:00
Cameron McInally	6e62a796d5	[GVN] Add support for unary FNeg to GVN pass Differential Revision: https://reviews.llvm.org/D63896 llvm-svn: 364592	2019-06-27 21:05:02 +00:00
Sanjay Patel	a95ca2b5ff	[x86] prevent crashing from select narrowing with AVX512 llvm-svn: 364585	2019-06-27 20:16:58 +00:00
Jinsong Ji	c627aa2fa9	[PowerPC][NFC] Remove unused (and unsupported) fusion feature bits. FeatureFusion bits was first introduced in https://reviews.llvm.org/rL253724. for add/load integer fusion for P8. The only use of `hasFusion` was https://reviews.llvm.org/rL255319. However, this was removed later in https://reviews.llvm.org/rL280440. So, there is NO any reference to fusion in code now. Leaving it there is misleading and confusing, so remove it for now. We can alwasy add back if we ever support fusion in the future. llvm-svn: 364581	2019-06-27 19:35:11 +00:00
Johannes Doerfert	6ed459fd41	Use "willreturn" in isGuaranteedToTransferExecutionToSuccessor The `willreturn` function attribute guarantees that a function call will come back to the call site if the call is also known not to throw. Therefore, this attribute can be used in `isGuaranteedToTransferExecutionToSuccessor`. Patch by Hideto Ueno (@uenoku) Reviewers: jdoerfert, sstefan1 Reviewed By: jdoerfert Subscribers: hiraditya, jfb, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D63372 llvm-svn: 364580	2019-06-27 19:29:48 +00:00
Philip Reames	1cf9e72cbc	Update -analyze -scalar-evolution output for multiple exit loops w/computable exit values The previous output was next to useless if any exit was not computable. If we have more than one exit, show the exit count for each so that it's easier to see what's going from with SCEV analysis when debugging. llvm-svn: 364579	2019-06-27 19:22:43 +00:00
Michael Liao	c5486b23bc	Correct the file path. NFC. llvm-svn: 364577	2019-06-27 19:05:46 +00:00
Wouter van Oortmerssen	bfd3f69480	[WebAssembly] AsmParser: better atomic inst detection Summary: Previously missed atomic.notify. Fixes https://bugs.llvm.org/show_bug.cgi?id=40728 Reviewers: aheejin Subscribers: sbc100, jgravelle-google, sunfish, jfb, llvm-commits, dschuff Tags: #llvm Differential Revision: https://reviews.llvm.org/D63747 llvm-svn: 364576	2019-06-27 18:58:26 +00:00
Djordje Todorovic	774eabd097	Revert "[LiveDebugValues] Emit the debug entry values" Appears that the 'test/DebugInfo/MIR/X86/dbginfo-entryvals.mir' does not pass on Windows. This reverts commit rL364553. llvm-svn: 364571	2019-06-27 18:12:04 +00:00
Wouter van Oortmerssen	6b3f56b65f	[WebAssembly] Fix p2align in assembler. Summary: - Match the syntax output by InstPrinter. - Fix it always emitting 0 for align. Had to work around fact that opcode is not available for GetDefaultP2Align while parsing. - Updated tests that were erroneously happy with a p2align=0 Fixes https://bugs.llvm.org/show_bug.cgi?id=40752 Reviewers: aheejin, sbc100 Subscribers: jgravelle-google, sunfish, jfb, llvm-commits, dschuff Tags: #llvm Differential Revision: https://reviews.llvm.org/D63633 llvm-svn: 364570	2019-06-27 18:11:15 +00:00
Simon Pilgrim	1fd1c60979	[X86] combineX86ShufflesRecursively - merge shuffles with more than 2 inputs We already had the infrastructure for this, but were waiting for the fix for a number of regressions which were handled by the recent shuffle(extract_subvector(),extract_subvector()) -> extract_subvector(shuffle()) shuffle combines llvm-svn: 364569	2019-06-27 17:30:51 +00:00
Roman Lebedev	0a2b7b79fa	Revert "[CodeGen] [SelectionDAG] More efficient code for X % C == 0 (UREM case) (try 2)" Appears to break test-suite on http://lab.llvm.org:8011/builders/clang-cmake-x86_64-sde-avx512-linux/builds/23790 FAIL: burg.execution_time FAIL: spiff.execution_time FAIL: employ.execution_time FAIL: llu.execution_time FAIL: gramschmidt.execution_time FAIL: fdtd-apml.execution_time This reverts commit r364563. llvm-svn: 364568	2019-06-27 17:22:31 +00:00
Nicolai Haehnle	32ef9292be	AMDGPU: Make fixing i1 copies robust against re-ordering Summary: The new test case led to incorrect code. Change-Id: Ief48b227e97aa662dd3535c9bafb27d4a184efca Reviewers: arsenm, david-salinas Subscribers: kzhuravl, jvesely, wdng, yaxunl, dstuttard, tpr, t-tye, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D63871 llvm-svn: 364566	2019-06-27 16:56:44 +00:00
Simon Pilgrim	e9a2f4fe2c	Use getConstantOperandAPInt instead of getConstantOperandVal for comparisons. getConstantOperandAPInt avoids any large integer issues - these are unlikely but the fuzzers do like to mess around..... llvm-svn: 364564	2019-06-27 16:46:00 +00:00

1 2 3 4 5 ...

124284 Commits