llvm-project

Commit Graph

Author	SHA1	Message	Date
Matt Arsenault	4512d0a68b	AMDGPU: Replace list of SMEM buffer opcodes llvm-svn: 318506	2017-11-17 04:18:26 +00:00
Matt Arsenault	03c67d1eb2	AMDGPU: Fix breaking SMEM clauses This was completely ignoring subregisters, so was not very useful. Also only break them if xnack is actually enabled. llvm-svn: 318505	2017-11-17 04:18:24 +00:00
David Blaikie	b3bde2ea50	Fix a bunch more layering of CodeGen headers that are in Target All these headers already depend on CodeGen headers so moving them into CodeGen fixes the layering (since CodeGen depends on Target, not the other way around). llvm-svn: 318490	2017-11-17 01:07:10 +00:00
Yi Kong	39bcd4ed3e	[ARM] 't' asm constraint should accept i32 't' constraint normally only accepts f32 operands, but for VCVT the operands can be i32. LLVM is overly restrictive and rejects asm like: float foo() { float result; __asm__ __volatile__( "vcvt.f32.s32 %[result], %[arg1]\n" : [result]"=t"(result) : [arg1]"t"(0x01020304) ); return result; } Relax the value type for 't' constraint to either f32 or i32. Differential Revision: https://reviews.llvm.org/D40137 llvm-svn: 318472	2017-11-16 23:38:17 +00:00
Craig Topper	089082378f	[X86] Add DAG combine to remove sext i32->i64 from gather/scatter instructions. Only do this pre-legalize in case we're using the sign extend to legalize for KNL. This recovers all of the tests that changed when I stopped SelectionDAGBuilder from deleting sign extends. There's more work that could be done here particularly to fix the i8->i64 test case that experienced split. llvm-svn: 318468	2017-11-16 23:09:06 +00:00
Mandeep Singh Grang	47fbc5911d	[RISCV] Fix 64-bit data layout mismatch between backend and target description Reviewers: asb Reviewed By: asb Subscribers: rbar, johnrusso, simoncook, jordy.potman.lists, llvm-commits Differential Revision: https://reviews.llvm.org/D40145 llvm-svn: 318454	2017-11-16 20:30:49 +00:00
Craig Topper	e85ff4f732	[X86] Pre-truncate gather/scatter indices that have element sizes larger than 64-bits before Legalize. The wider element type will normally cause legalize to try to split and scalarize the gather/scatter, but we can't handle that. Instead, truncate the index early so the gather/scatter node is insulated from the legalization. This really shouldn't happen in practice since InstCombine will normalize index types to the same size as pointers. llvm-svn: 318452	2017-11-16 20:23:22 +00:00
Craig Topper	04be793cec	[X86] DAGCombinerInfo is in TargetLowering not X86TargetLowering. llvm-svn: 318451	2017-11-16 20:23:17 +00:00
Daniel Sanders	170baca646	[arc] Fix ambiguous overloaded operator error lib/Target/ARC/ARCISelLowering.cpp:490:22: error: use of overloaded operator '<<' is ambiguous (with operand types 'llvm::raw_ostream' and 'llvm::MVT::SimpleValueType') << RegVT.getSimpleVT().SimpleTy << "\n"); ^ ~~~~~~~~~~~~~~~~~~~~~~~~~~~~ llvm-svn: 318443	2017-11-16 19:16:56 +00:00
Yonghong Song	ce96738dee	bpf: print backward branch target properly Currently, it prints the backward branch offset as unsigned value like below: 7: 7d 34 0b 00 00 00 00 00 if r4 s>= r3 goto 11 <LBB0_3> 8: b7 00 00 00 00 00 00 00 r0 = 0 LBB0_2: 9: 07 00 00 00 01 00 00 00 r0 += 1 ...... 17: bf 31 00 00 00 00 00 00 r1 = r3 18: 6d 32 f6 ff 00 00 00 00 if r2 s> r3 goto 65526 <LBB0_3+0x7FFB0> The correct print insn 18 should be: 18: 6d 32 f6 ff 00 00 00 00 if r2 s> r3 goto -10 <LBB0_2> To provide better clarity and be consistent with kernel verifier output, the insn 7 output is changed to the following with "+" added to non-negative branch offset: 7: 7d 34 0b 00 00 00 00 00 if r4 s>= r3 goto +11 <LBB0_3> Signed-off-by: Yonghong Song <yhs@fb.com> Acked-by: Alexei Starovoitov <ast@kernel.org> llvm-svn: 318442	2017-11-16 19:15:36 +00:00
Daniel Sanders	1eaf300fac	[arc] Update TargetInfo to include the new backend name argument Also update a comment about the usage of RegisterTarget() that didn't mention the new argument. llvm-svn: 318441	2017-11-16 19:10:26 +00:00
Azharuddin Mohammed	fa8420d0a1	Fix RISCV build after r318352 Reviewers: asb, apazos, mgrang Reviewed By: mgrang Subscribers: rbar, johnrusso, simoncook, jordy.potman.lists, llvm-commits Differential Revision: https://reviews.llvm.org/D40139 llvm-svn: 318437	2017-11-16 18:39:31 +00:00
Guozhi Wei	433e8d3e04	[PPC] Change i32 constant in store instruction to i64 This patch changes all i32 constant in store instruction to i64 with truncation, to increase the chance that the referenced constant can be shared with other i64 constant. Differential Revision: https://reviews.llvm.org/D39352 llvm-svn: 318436	2017-11-16 18:27:34 +00:00
Mohammed Agabaria	6e6d5326a1	[TTI][X86] update costs of interleaved load\store of i64\double This patch contains more accurate cost of interelaved load\store of stride 2 for the types int64\double on AVX2. Reviewers: delena, RKSimon, craig.topper, dorit Reviewed By: dorit Differential Revision: https://reviews.llvm.org/D40008 llvm-svn: 318385	2017-11-16 09:38:32 +00:00
Craig Topper	46a5d58b8c	[X86] Update TTI to report that v1iX/v1fX types aren't legal for masked gather/scatter/load/store. The type legalizer will try to scalarize these operations if it sees them, but there is no handling for scalarizing them. This leads to a fatal error. With this change they will now be scalarized by the mem intrinsic scalarizing pass before SelectionDAG. llvm-svn: 318380	2017-11-16 06:02:05 +00:00
Eric Christopher	6348188e87	Fix thinko in last commit. llvm-svn: 318374	2017-11-16 03:25:02 +00:00
Eric Christopher	3148a1be88	Add NDEBUG checks around LLVM_DUMP_METHOD functions for Wunused-function warnings. llvm-svn: 318373	2017-11-16 03:18:15 +00:00
Craig Topper	e6601fd30e	[X86] Custom type legalize v2f32 masked gathers instead of trying to cleanup after type legalization. llvm-svn: 318368	2017-11-16 02:07:45 +00:00
Yonghong Song	4c3ce59e61	bpf: enable llvm-objdump to print out symbolized jmp target Add hook in BPF backend so that llvm-objdump can print out the jmp target with label names, e.g., ... if r1 != 2 goto 6 <LBB0_2> ... goto 7 <LBB0_4> ... LBB0_2: ... LBB0_4: ... Signed-off-by: Yonghong Song <yhs@fb.com> Acked-by: Alexei Starovoitov <ast@kernel.org> llvm-svn: 318358	2017-11-16 00:52:30 +00:00
Daniel Sanders	f76f315436	[globalisel][tablegen] Generate rule coverage and use it to identify untested rules Summary: This patch adds a LLVM_ENABLE_GISEL_COV which, like LLVM_ENABLE_DAGISEL_COV, causes TableGen to instrument the generated table to collect rule coverage information. However, LLVM_ENABLE_GISEL_COV goes a bit further than LLVM_ENABLE_DAGISEL_COV. The information is written to files (${CMAKE_BINARY_DIR}/gisel-coverage-* by default). These files can then be concatenated into ${LLVM_GISEL_COV_PREFIX}-all after which TableGen will read this information and use it to emit warnings about untested rules. This technique could also be used by SelectionDAG and can be further extended to detect hot rules and give them priority over colder rules. Usage: * Enable LLVM_ENABLE_GISEL_COV in CMake * Build the compiler and run some tests * cat gisel-coverage-[0-9]* > gisel-coverage-all * Delete lib/Target//GenGlobalISel.inc* * Build the compiler Known issues: * ${LLVM_GISEL_COV_PREFIX}-all must be generated as a manual step due to a lack of a portable 'cat' command. It should be the concatenation of all ${LLVM_GISEL_COV_PREFIX}-[0-9]* files. * There's no mechanism to discard coverage information when the ruleset changes Depends on D39742 Reviewers: ab, qcolombet, t.p.northover, aditya_nandakumar, rovka Reviewed By: rovka Subscribers: vsk, arsenm, nhaehnle, mgorny, kristof.beyls, javed.absar, igorb, llvm-commits Differential Revision: https://reviews.llvm.org/D39747 llvm-svn: 318356	2017-11-16 00:46:35 +00:00
Reid Kleckner	8d8a8bb7ee	Try to fix WebAssembly build after r318352 llvm-svn: 318355	2017-11-16 00:32:19 +00:00
Daniel Sanders	725584e26d	Add backend name to Target to enable runtime info to be fed back into TableGen Summary: Make it possible to feed runtime information back to tablegen to enable profile-guided tablegen-eration, detection of untested tablegen definitions, etc. Being a cross-compiler by nature, LLVM will potentially collect data for multiple architectures (e.g. when running 'ninja check'). We therefore need a way for TableGen to figure out what data applies to the backend it is generating at the time. This patch achieves that by including the name of the 'def X : Target ...' for the backend in the TargetRegistry. Reviewers: qcolombet Reviewed By: qcolombet Subscribers: jholewinski, arsenm, jyknight, aditya_nandakumar, sdardis, nemanjai, ab, nhaehnle, t.p.northover, javed.absar, qcolombet, llvm-commits, fedor.sergeev Differential Revision: https://reviews.llvm.org/D39742 llvm-svn: 318352	2017-11-15 23:55:44 +00:00
Evandro Menezes	82665b1ec4	[AArch64] Adjust the cost model for Exynos M1 and M2 Fix the modeling of FP stores. llvm-svn: 318351	2017-11-15 23:49:58 +00:00
Matt Arsenault	301162c4fe	AMDGPU: Replace i64 add/sub lowering Use VOP3 add/addc like usual. This has some tradeoffs. Inline immediates fold a little better, but other constants are worse off. SIShrinkInstructions could be made smarter to handle these cases. This allows us to avoid selecting scalar adds where we need to track the carry in scc and replace its users. This makes it easier to use the carryless VALU adds. llvm-svn: 318340	2017-11-15 21:51:43 +00:00
Evandro Menezes	5ba804bc11	[AArch64] Refactor the loads and stores optimizer Move remaining inline matching of instructions of some optimizations into separate functions, like in the other optimizations. Otherwise, NFC. Differential revision: https://reviews.llvm.org/D40090 llvm-svn: 318335	2017-11-15 21:06:22 +00:00
Craig Topper	54b57b0dd8	[X86] Add a return to the end of a switch to prevent an accidental fallthrough in the future. llvm-svn: 318330	2017-11-15 20:42:47 +00:00
Sean Fertile	0f0837e84e	[PowerPC] Implement mayBeEmittedAsTailCall for PPC Implements TargetLowering callback 'mayBeEmittedAsTailCall' that enables CodeGenPrepare to duplicate returns when they might enable a tail-call. Differential Revision: https://reviews.llvm.org/D39777 llvm-svn: 318321	2017-11-15 18:58:27 +00:00
Evandro Menezes	cbf70486bc	[AArch64] Adjust the cost model for Exynos M1 and M2 Fix the modeling of loads and stores using the pre or post indexed addressing modes. llvm-svn: 318312	2017-11-15 17:39:37 +00:00
Simon Pilgrim	56415772d6	[X86] Add CBW/CDQ/CDQE/CQO/CWD/CWDE to WriteALU schedule class Some CPUs are already overriding these sign extension instructions but we should be able to use the WriteALU schedule class by default. Differential Revision: https://reviews.llvm.org/D39899 llvm-svn: 318308	2017-11-15 17:11:24 +00:00
Sean Fertile	7b056b3048	[PowerPC] Split out the tailcall calling convention checks. NFC. Move the calling convention checks for tail-call eligibility for the 64-bit SysV ABI into a separate function. This is so that it can be shared with 'mayBeEmittedAsTailCall' in a subsequent change. llvm-svn: 318305	2017-11-15 16:53:41 +00:00
Sander de Smalen	8e607346af	[AArch64][SVE] Asm: Report SVE parsing diagnostics only once Summary: Prevent an issue where a diagnostic is reported multiple times by bailing out with a ParseFail if an invalid SVE register element qualifier/suffix is specified, for example: <stdin>:10:18: error: invalid sve vector kind qualifier add z20.h, z2.h, z31.x ^ <stdin>:10:18: error: invalid sve vector kind qualifier add z20.h, z2.h, z31.x ... <stdin>:10:18: error: invalid sve vector kind qualifier add z20.h, z2.h, z31.x ^ Reviewers: fhahn, rengolin Reviewed By: rengolin Subscribers: aemerson, javed.absar, tschuett, llvm-commits, kristof.beyls Differential Revision: https://reviews.llvm.org/D39894 llvm-svn: 318297	2017-11-15 15:44:43 +00:00
Petar Jovanovic	cd729ead01	[mips] Improve genConstMult() to work with arbitrary precision APInt is now used instead of uint64_t in function genConstMult() allowing multiplication optimizations with constants of arbitrary length. Patch by Milos Stojanovic. Differential Revision: https://reviews.llvm.org/D38130 llvm-svn: 318296	2017-11-15 15:24:04 +00:00
Momchil Velikov	4a91fb93db	[ARM] Split Arm jump table branch into i12 and rs suffixed versions This is a refactoring/cleanup of Arm `addrmode2` operand class. The patch removes it completely. Differential Revision: https://reviews.llvm.org/D39832 llvm-svn: 318291	2017-11-15 12:02:55 +00:00
Craig Topper	16a91cee6c	[X86] Redefine the 128-bit version of VPGATHERQD and VGATHERQPS to use a VK2 mask instead of a VK4 mask. This allows us to remove extra extend creation during lowering and more accurately reflects the semantics of the instruction. While there add an extra output VT to X86 masked gather node to better match the isel pattern predicate. Currently we're exploiting the fact that the isel table doesn't count how many output results a node actually has if the result type of any can be inferred from the first result and the type constraints defined in tablegen. I think we might ultimately want to lower all MGATHER/MSCATTER to an X86ISD node with the extra mask result and stop relying on this hole in the isel checking. llvm-svn: 318278	2017-11-15 07:46:43 +00:00
Hiroshi Inoue	72a1f98a67	[PowerPC] fix up in redundant compare elimination This patch fixes a potential problem in my previous commit (https://reviews.llvm.org/rL312514) by introducing an additional check. llvm-svn: 318266	2017-11-15 04:23:26 +00:00
Matt Arsenault	10c472dd83	AMDGPU: Add separate definitions for DS insts without m0 use llvm-svn: 318246	2017-11-15 01:34:06 +00:00
Matt Arsenault	45b98189bd	AMDGPU: Don't use MUBUF vaddr if address may overflow Effectively revert r263964. Before we would not allow this if vaddr was not known to be positive. llvm-svn: 318240	2017-11-15 00:45:43 +00:00
Matt Arsenault	c8903125cd	AMDGPU: Handle or in multi-use shl ptr combine llvm-svn: 318223	2017-11-14 23:46:42 +00:00
Simon Dardis	de5ed0c58e	Reland "[mips][mt][6/7] Add support for mftr, mttr instructions." This adjusts the tests to hopfully pacify the llvm-clang-x86_64-expensive-checks-win buildbot. Unlike many other instructions, these instructions have aliases which take coprocessor registers, gpr register, accumulator (and dsp accumulator) registers, floating point registers, floating point control registers and coprocessor 2 data and control operands. For the moment, these aliases are treated as pseudo instructions which are expanded into the underlying instruction. As a result, disassembling these instructions shows the underlying instruction and not the alias. Reviewers: slthakur, atanasyan Differential Revision: https://reviews.llvm.org/D35253 llvm-svn: 318207	2017-11-14 22:26:42 +00:00
Richard Smith	7007f07664	Fix unused variable warning. llvm-svn: 318201	2017-11-14 21:26:46 +00:00
Matt Arsenault	9ba465a972	AMDGPU: Error on stack size overflow llvm-svn: 318189	2017-11-14 20:33:14 +00:00
Ulrich Weigand	5f4373a2fc	[SystemZ] Do not crash when selecting an OR of two constants In rare cases, common code will attempt to select an OR of two constants. This confuses the logic in splitLargeImmediate, causing an internal error during isel. Fixed by simply leaving this case to common code to handle. This fixes PR34859. llvm-svn: 318187	2017-11-14 20:00:34 +00:00
Evandro Menezes	1c94538693	[AArch64] Adjust the cost model for Exynos M1 and M2 Fix the modeling of loads and stores of registers pairs. llvm-svn: 318186	2017-11-14 19:59:43 +00:00
Martin Storsjo	4629f52312	[ARM, AArch64] Fix an assert message, Darwin isn't the only target supporting TLS. NFC. llvm-svn: 318184	2017-11-14 19:57:59 +00:00
Ulrich Weigand	55b8590e03	[SystemZ] Fix invalid codegen using RISBMux on out-of-range bits Before using the 32-bit RISBMux set of instructions we need to verify that the input bits are actually within range of the 32-bit instruction. This fixer PR35289. llvm-svn: 318177	2017-11-14 19:20:46 +00:00
Artem Belevich	55dcf5e586	Mark intrinsics operating on the whole warp as IntrInaccessibleMemOnly It's needed to model the fact that they do access data from other threads in a warp and thus can't be CSE'd. llvm-svn: 318173	2017-11-14 19:14:00 +00:00
Craig Topper	2153114227	[X86] Fix typo in comment. NFC llvm-svn: 318156	2017-11-14 16:14:00 +00:00
Tim Northover	5cdc4f9c33	ARM: correctly update CFG when splitting BB to fix branch. Because the block-splitting code is multi-purpose, we have to meddle with the branches when using it to fixup a conditional branch destination. We got the code right, but forgot to update the CFG so the verifier complained when expensive checks were on. Probably harmless since constant-islands comes so late, but best to fix it anyway. llvm-svn: 318148	2017-11-14 11:43:54 +00:00
Diana Picus	21a42bcc0b	[ARM GlobalISel] Remove C++ code for G_CONSTANT Get rid of the handwritten instruction selector code for handling G_CONSTANT. This code wasn't checking all the preconditions correctly anyway, so it's better to leave it to TableGen, which can handle at least some cases correctly (e.g. MOVi, MOVi16, folding into binary operations). Also add tests to cover those cases. llvm-svn: 318146	2017-11-14 11:20:32 +00:00
Momchil Velikov	dc86e1444d	[ARM] Fix incorrect conversion of a tail call to an ordinary call When we emit a tail call for Armv8-M, but then discover that the caller needs to save/restore `LR`, we convert the tail call to an ordinary one, since restoring `LR` takes extra instructions, which may negate the benefits of the tail call. If the callee, however, takes stack arguments, this conversion is incorrect, since nothing has been done to pass the stack arguments. Thus the patch reverts https://reviews.llvm.org/rL294000 Also, we improve the instruction sequence for popping `LR` in the case when we couldn't immediately find a scratch low register, but we can use as a temporary one of the callee-saved low registers and restore `LR` before popping other callee-saves. Differential Revision: https://reviews.llvm.org/D39599 llvm-svn: 318143	2017-11-14 10:36:52 +00:00

1 2 3 4 5 ...

44762 Commits