llvm-project

Commit Graph

Author	SHA1	Message	Date
Sander de Smalen	c33d668ab7	[AArch64][SVE] Asm: Support for indexed DUP instructions. Unpredicated copy of indexed SVE element to SVE vector, along with MOV-aliases. For example: dup z0.h, z1.h[0] duplicates the first 16-bit element from z1 to all elements in the result vector z0. Reviewers: rengolin, fhahn, samparker, SjoerdMeijer, javed.absar Reviewed By: SjoerdMeijer Differential Revision: https://reviews.llvm.org/D47570 llvm-svn: 333871	2018-06-04 06:40:55 +00:00
Sander de Smalen	367a53b059	[AArch64][SVE] Asm: Support for FCPY immediate instructions. Predicated copy of floating-point immediate value to SVE vector, along with MOV-aliases. Reviewers: rengolin, fhahn, samparker, SjoerdMeijer, javed.absar Reviewed By: javed.absar Differential Revision: https://reviews.llvm.org/D47518 llvm-svn: 333869	2018-06-04 05:58:06 +00:00
Sander de Smalen	512d57f1a5	[AArch64][SVE] Asm: Support for CPY immediate instructions Predicated copy of possibly shifted immediate value into SVE vector, along with MOV-aliases. Reviewers: rengolin, fhahn, samparker, SjoerdMeijer, javed.absar Reviewed By: fhahn Differential Revision: https://reviews.llvm.org/D47517 llvm-svn: 333868	2018-06-04 05:40:46 +00:00
Craig Topper	9923eac358	[X86] Remove and autoupgrade masked avx512vnni intrinsics using the unmasked intrinsics and select instructions. llvm-svn: 333857	2018-06-03 23:24:17 +00:00
Amaury Sechet	99909e9308	Remove SETCCE use from Lanai's backend Summary: This creates a small perf regression, but after talking with Jacques Pienaar, he was good with it to get things moving toward removng SETCCE. Reviewers: jpienaar, bryant Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D47626 llvm-svn: 333838	2018-06-03 12:56:24 +00:00
Ivan A. Kosarev	60a991ed1a	[NEON] Support VLD1xN intrinsics in AArch32 mode (LLVM part) We currently support them only in AArch64. The NEON Reference, however, says they are 'ARMv7, ARMv8' intrinsics. Differential Revision: https://reviews.llvm.org/D47120 llvm-svn: 333825	2018-06-02 16:40:03 +00:00
Ivan A. Kosarev	73c5337a64	Revert r333819 "[NEON] Support VLD1xN intrinsics in AArch32 mode (Clang part)" The LLVM part was committed instead of the Clang part. Differential Revision: https://reviews.llvm.org/D47121 llvm-svn: 333824	2018-06-02 16:38:38 +00:00
Craig Topper	93d8fbd8f2	[X86] Add tied source operand to AVX5124FMAPS and AVX5124VNNIW instructions. This doesn't affect the assembly or disassembly, but is more accurate. llvm-svn: 333822	2018-06-02 16:30:39 +00:00
Craig Topper	27234f1d8f	[X86] Fix warning message for AVX5124FMAPS and AVX5124VNNIW instructions in the assembly parser. The caret was positioned on the wrong operand. It's too hard to get right so just put the caret at the beginning of the instruction. llvm-svn: 333821	2018-06-02 16:30:36 +00:00
Ivan A. Kosarev	51f19b9ee1	[NEON] Support VLD1xN intrinsics in AArch32 mode (Clang part) We currently support them only in AArch64. The NEON Reference, however, says they are 'ARMv7, ARMv8' intrinsics. Differential Revision: https://reviews.llvm.org/D47121 llvm-svn: 333819	2018-06-02 16:26:42 +00:00
Craig Topper	1534929623	[X86] Add encoding information for the AVX5124FMAPS and AVX5124VNNIW instructions so they can be assembled and disassembled. These instructions are unusual in that they operate on 4 consecutive registers so supporting them in codegen will be more difficult than normal. Includes an assembler check to warn if the source register is not the first register of a 4 register group. llvm-svn: 333812	2018-06-02 02:15:10 +00:00
Craig Topper	3828ce7eab	[X86] Do something sensible when an expand load intrinsic is passed a 0 mask. Previously we just returned undef, but really we should be returning the pass thru input. We also need to make sure we preserve the chain output that the original intrinsic node had to maintain connectivity in the DAG. So we should just return the incoming chain as the output chain. llvm-svn: 333804	2018-06-01 22:59:07 +00:00
Craig Topper	aa747412b1	[X86] Add isel patterns to use vexpand with zero masking when the passthru value is a zero vector. llvm-svn: 333800	2018-06-01 22:28:28 +00:00
Simon Atanasyan	e80c3ce9cc	[mips] Support 64-bit offsets for lb/sb/ld/sd/lld ... instructions The `MipsAsmParser::loadImmediate` can load immediates of various sizes into a register. Idea of this change is to use `loadImmediate` in the `MipsAsmParser::expandMemInst` method to load offset into a register and then call required load/store instruction. The patch removes separate `expandLoadInst` and `expandStoreInst` methods and does everything in the `expandMemInst` method to escape code duplication. Differential Revision: https://reviews.llvm.org/D47316 llvm-svn: 333774	2018-06-01 16:37:53 +00:00
Simon Atanasyan	3a44bcf95a	[mips] Extend list of relocations supported by the `.reloc` directive Supporting GOT and TLS related relocations by the `.reloc` directive is useful for purpose of testing various tools like a linker, for example. llvm-svn: 333773	2018-06-01 16:37:42 +00:00
Krzysztof Parzyszek	bc68385dad	[Hexagon] Avoid UB when shifting unsigned integer left by 32 llvm-svn: 333771	2018-06-01 15:39:10 +00:00
Krzysztof Parzyszek	aec2c0c9b6	[Hexagon] Select HVX code for vector CTPOP, CTLZ, and CTTZ llvm-svn: 333760	2018-06-01 14:52:58 +00:00
Hiroshi Inoue	9796b47df1	[NFC] Zero initialize local variables This patch makes local variables zero initialized to avoid broken values in debug output. llvm-svn: 333754	2018-06-01 14:23:15 +00:00
Krzysztof Parzyszek	0b6187c1a9	[SelectionDAG] Expand UADDO/USUBO into ADD/SUBCARRY if legal for target Additionally, implement handling of ADD/SUBCARRY on Hexagon, utilizing the UADDO/USUBO expansion. Differential Revision: https://reviews.llvm.org/D47559 llvm-svn: 333751	2018-06-01 14:00:32 +00:00
Amaury Sechet	8467411dad	Set ADDE/ADDC/SUBE/SUBC to expand by default Summary: They've been deprecated in favor of UADDO/ADDCARRY or USUBO/SUBCARRY for a while. Target that uses these opcodes are changed in order to ensure their behavior doesn't change. Reviewers: efriedma, craig.topper, dblaikie, bkramer Subscribers: jholewinski, arsenm, jyknight, sdardis, nemanjai, nhaehnle, kbarton, fedor.sergeev, asb, rbar, johnrusso, simoncook, jordy.potman.lists, apazos, sabuasal, niosHD, jrtc27, zzheng, edward-jones, mgrang, atanasyan, llvm-commits Differential Revision: https://reviews.llvm.org/D47422 llvm-svn: 333748	2018-06-01 13:21:33 +00:00
Amara Emerson	5a3bb68e12	[AArch64][GlobalISel] Zero-extend s1 values when returning. Before we were relying on the any extend of the s1 to s32, but for AAPCS we need to zero-extend it to at least s8. Fixes PR36719 Differential Revision: https://reviews.llvm.org/D47425 llvm-svn: 333747	2018-06-01 13:20:32 +00:00
Sander de Smalen	f95ea047e5	[AArch64][SVE] Asm: Support for FDUP_ZI (copy fp immediate) instruction. Unpredicated copy of floating-point immediate value into SVE vector, along with MOV-aliases. Reviewers: rengolin, fhahn, samparker, SjoerdMeijer, javed.absar Reviewed By: fhahn Differential Revision: https://reviews.llvm.org/D47482 llvm-svn: 333744	2018-06-01 12:54:46 +00:00
Simon Dardis	351aa594f6	[mips] Guard more aliases correctly. Also, duplicate an alias for microMIPS. llvm-svn: 333741	2018-06-01 10:57:13 +00:00
Simon Dardis	54217598b6	[mips] Guard 'nop' properly and add mips16's nop instruction Reviewers: smaksimovic, atanasyan, abeserminji Differential Revision: https://reviews.llvm.org/D47583 llvm-svn: 333739	2018-06-01 10:46:00 +00:00
Simon Dardis	ee67dcb837	[mips] Select the correct instruction for computing frameindexes Reviewers: smaksimovic, atanasyan, abeserminji Differential Revision: https://reviews.llvm.org/D47582 llvm-svn: 333736	2018-06-01 10:07:10 +00:00
Sander de Smalen	97ca6b9e09	[AArch64][SVE] Asm: Support for DUPM (masked immediate) instruction. Unpredicated copy of repeating immediate pattern to SVE vector, along with MOV-aliases. Reviewers: rengolin, fhahn, samparker, SjoerdMeijer, javed.absar Reviewed By: SjoerdMeijer Differential Revision: https://reviews.llvm.org/D47328 llvm-svn: 333731	2018-06-01 07:25:46 +00:00
Craig Topper	c3cf55b935	[X86][Disassembler] Make it an error to set EVEX.R' to 0 when modrm.reg encodes a GPR. This is different than the behavior of EVEX.X extending modrm.rm to 5 bits. llvm-svn: 333728	2018-06-01 06:11:29 +00:00
Craig Topper	0838c4d6bc	[X86][Disassembler] Ignore EVEX.X extension of modrm.rm to 5-bits when modrm.rm encodes a k-register. llvm-svn: 333727	2018-06-01 05:36:08 +00:00
Craig Topper	74a61b02e0	[X86][Disassembler] Clamp index to 4-bits when decoding GPR registers. A 5-bit value can occur when EVEX.X is 0 due to it being used to extend modrm.rm to encode XMM16-31. But if modrm.rm instead encodes a GPR, the Intel documentation says EVEX.X should be ignored so just mask it to 4 bits once we know its a GPR. llvm-svn: 333725	2018-06-01 05:12:44 +00:00
Craig Topper	5b1dd01e57	[X86][Disassembler] Make sure EVEX.X is not used to extend base registers of memory operations. This was an accidental side effect of EVEX.X being used to encode XMM16-XMM31 using modrm.rm with modrm.mod==0x3. I think there are still more bugs related to this. llvm-svn: 333722	2018-06-01 04:29:34 +00:00
Craig Topper	c6b2c2bb70	[X86][Disassembler] Use a local variable instead of using a field in the instruction object. NFC llvm-svn: 333721	2018-06-01 04:29:30 +00:00
Tom Stellard	e43778895c	AMDGPU/R600: Move intrinsics to IntrinsicsAMDGPU.td Reviewers: arsenm, nhaehnle, jvesely Reviewed By: arsenm Subscribers: kzhuravl, wdng, yaxunl, dstuttard, tpr, llvm-commits, t-tye Differential Revision: https://reviews.llvm.org/D47487 llvm-svn: 333720	2018-06-01 02:19:46 +00:00
Craig Topper	dc5ba1e495	[X86] Make sure the check for VEX.vvvv being all ones on instructions that don't use it doesn't ignore a bit in 32-bit mode. llvm-svn: 333717	2018-06-01 01:23:52 +00:00
Craig Topper	0179c6d0e5	[X86][Disassembler] Suppress reading of EVEX.V' and EVEX.R' in 32-bit mode. llvm-svn: 333714	2018-06-01 00:10:36 +00:00
Dan Gohman	91ab25bbe3	[WebAssembly] Update to the new names for the memory intrinsics. The WebAssembly committee has decided on the names `memory.size` and `memory.grow` for the memory intrinsics, so update the LLVM intrinsics to follow those names, keeping both sets of old names in place for compatibility. llvm-svn: 333708	2018-05-31 22:35:25 +00:00
Dan Gohman	b17de645ea	[WebAssembly] Fix the signatures for the __mulo* libcalls. The __mulo* libcalls have an extra i32* to return the overflow value. Fixes PR37401. llvm-svn: 333706	2018-05-31 22:27:24 +00:00
Heejin Ahn	5ef4d5f9c1	[WebAssembly] Support instruction selection for catching exceptions Summary: This lowers exception catching-related instructions: 1. Lowers `wasm.catch` intrinsic to `catch` instruction 2. Removes `catchpad` and `cleanuppad` instructions; they are not necessary after isel phase. (`MachineBasicBlock::isEHFuncletEntry()` or `MachineBasicBlock::isEHPad()` can be used instead.) 3. Lowers `catchret` and `cleanupret` instructions to pseudo `catchret` and `cleanupret` instructions in isel, which will be replaced with other instructions in `WebAssemblyExceptionPrepare` pass. 4. Adds 'WebAssemblyExceptionPrepare` pass, which is for running various transformation for EH. Currently this pass only replaces `catchret` and `cleanupret` instructions into appropriate wasm instructions to make this patch successfully run until the end. Currently this does not handle lowering of intrinsics related to LSDA info generation (`wasm.landingpad.index` and `wasm.lsda`), because they cannot be tested without implementing `EHStreamer`'s wasm-specific handlers. They are marked as TODO, which is needed to make isel pass. Also this does not generate `try` and `end_try` markers yet, which will be handled in later patches. This patch is based on the first wasm EH proposal. (https://github.com/WebAssembly/exception-handling/blob/master/proposals/Exceptions.md) Reviewers: dschuff, majnemer Subscribers: jfb, sbc100, jgravelle-google, sunfish, llvm-commits Differential Revision: https://reviews.llvm.org/D44090 llvm-svn: 333705	2018-05-31 22:25:54 +00:00
Stanislav Mekhanoshin	739174c4be	[AMDGPU] Construct memory clauses before RA Memory clauses are formed into bundles in presence of xnack. Their source operands are marked as early-clobber. This allows to allocate distinct source and destination registers within a clause and prevent breaking the clause with s_nop in the hazard recognizer. Clauses are undone before post-RA scheduler to allow some rescheduling, which will not break the clause since artificial edges are created in the dag to keep memory operations together. Yet this allows a better ILP in some cases. Differential Revision: https://reviews.llvm.org/D47511 llvm-svn: 333691	2018-05-31 20:13:51 +00:00
Sriraman Tallam	d10c4e07f5	Relax GOTPCREL relocations for tail jmp instructions. Differential Revision: https://reviews.llvm.org/D47563 llvm-svn: 333676	2018-05-31 18:12:33 +00:00
Francis Visoiu Mistrih	90aba024c5	[MC] Fallback on DWARF when generating compact unwind on AArch64 Instead of asserting when using the def_cfa directive with a register different from fp, fallback on DWARF. Easily triggered with: .cfi_def_cfa x1, 32; rdar://40249694 Differential Revision: https://reviews.llvm.org/D47593 llvm-svn: 333667	2018-05-31 16:33:26 +00:00
Roman Tereshin	f34d7ecc15	[GlobalISel][Mips] LegalizerInfo verifier: Adding LegalizerInfo::verify(...) call for Mips Reviewers: aemerson, qcolombet Reviewed By: qcolombet Differential Revision: https://reviews.llvm.org/D46339 llvm-svn: 333665	2018-05-31 16:16:49 +00:00
Roman Tereshin	76c29c68dc	[GlobalISel][AMDGPU] LegalizerInfo verifier: Adding LegalizerInfo::verify(...) call for AMDGPU Reviewers: aemerson, qcolombet Reviewed By: qcolombet Differential Revision: https://reviews.llvm.org/D46339 llvm-svn: 333664	2018-05-31 16:16:48 +00:00
Roman Tereshin	667c7581ed	[GlobalISel][ARM] LegalizerInfo verifier: Adding LegalizerInfo::verify(...) call and fixing bugs exposed Reviewers: aemerson, qcolombet Reviewed By: qcolombet Differential Revision: https://reviews.llvm.org/D46339 llvm-svn: 333663	2018-05-31 16:16:48 +00:00
Roman Tereshin	cc1a16fdf9	[GlobalISel][X86] LegalizerInfo verifier: Adding LegalizerInfo::verify(...) call and fixing bugs exposed Reviewers: aemerson, qcolombet Reviewed By: qcolombet Differential Revision: https://reviews.llvm.org/D46339 llvm-svn: 333662	2018-05-31 16:16:47 +00:00
Simon Pilgrim	ff0623cd29	[X86][SSE] Recognise splat rotations and expand back to shift ops. Noticed while fixing PR37426, for splat rotations (rotation by an uniform value) its better to just expand back to shift ops than performing as a general non-uniform rotation. llvm-svn: 333661	2018-05-31 15:47:17 +00:00
Simon Pilgrim	c34395d889	[X86][AVX] Add peekThroughEXTRACT_SUBVECTORs helper (NFCI) We often need this for AVX1 128-bit integer ops as they may have been split from a 256-bit source. llvm-svn: 333660	2018-05-31 15:15:49 +00:00
Clement Courbet	2e41c5a79c	[X86] Introduce WriteFLDC for x87 constant loads. Summary: {FLDL2E, FLDL2T, FLDLG2, FLDLN2, FLDPI} were using WriteMicrocoded. - I've measured the values for Broadwell, Haswell, SandyBridge, Skylake. - For ZnVer1 and Atom, values were transferred form InstRWs. - For SLM and BtVer2, I've guessed some values :( Reviewers: RKSimon, craig.topper, andreadb Subscribers: gbedwell, llvm-commits Differential Revision: https://reviews.llvm.org/D47585 llvm-svn: 333656	2018-05-31 14:22:01 +00:00
Simon Dardis	d9a453832d	[mips] Guard all short instructions correctly. Reviewers: smaksimovic, atanasyan, abeserminji Differential Revision: https://reviews.llvm.org/D47533 llvm-svn: 333645	2018-05-31 12:47:01 +00:00
Clement Courbet	b78ab5097d	[X86] Extract latency of fldz/fld1 in separate classes. Summary: - I've measured the values for Broadwell, Haswell, SandyBridge, Skylake. - For ZnVer1 and Atom, values were transferred form `InstRW`s. - For SLM and BtVer2, values are from Agner. This is split off from https://reviews.llvm.org/D47377 Reviewers: RKSimon, andreadb Subscribers: gbedwell, llvm-commits Differential Revision: https://reviews.llvm.org/D47523 llvm-svn: 333642	2018-05-31 11:41:27 +00:00
Simon Pilgrim	346886bc0d	[X86][SSE] Add support for detecting SUB(SPLAT_BV, SPLAT) cases for shift-rotate patterns. This improves splat rotations (rotation by an uniform value), to avoid having to use the generic non-uniform shift code (extension to PR37426). llvm-svn: 333641	2018-05-31 11:25:16 +00:00

1 2 3 4 5 ...

47768 Commits