llvm-project

Commit Graph

Author	SHA1	Message	Date
Fangrui Song	de9d80c1c5	[llvm] LLVM_FALLTHROUGH => [[fallthrough]]. NFC With C++17 there is no Clang pedantic warning or MSVC C5051.	2022-08-08 11:24:15 -07:00
Kjetil Kjeka	ff1920d106	[NVPTX] Promote i24, i40, i48 and i56 to next power-of-two register when passing Today llc will crash when attempting to use non-power-of-two integer types as function arguments or returns. This patch enables passing non standard integer values in functions by promoting them before store and truncating after load. The main motivation of implementing this change is that rust casts small structs (less than pointer size) into an integer of the same size. As an example, if a struct contains three u8 then it will be passed as an i24. This patch is a step towards enabling rust compilation to ptx while retaining the target independent optimizations. More context can be found in https://github.com/llvm/llvm-project/issues/55764 Differential Revision: https://reviews.llvm.org/D129291	2022-07-22 14:14:12 -07:00
Igor Kudrin	32eed8828e	Reapply "[NVPTX] Use the mask() operator to initialize packed structs with pointers" The original patch revealed an issue of reading incorrect values on BE hosts. That is now changed to use `endian::read32le()` and `endian::read64le()`. Original commit message: The current implementation assumes that all pointers used in the initialization of an aggregate are aligned according to the pointer size of the target; that might not be so if the object is packed. In that case, an array of .u8 should be used and pointers should be decorated with the mask() operator. The operator was introduced in PTX ISA 7.1, so an error is issued if the case is detected for an earlier version. Differential Revision: https://reviews.llvm.org/D127504	2022-07-18 20:56:26 +04:00
Igor Kudrin	1e451369d2	Revert "[NVPTX] Use the mask() operator to initialize packed structs with pointers" The new test fails on BE hosts. This reverts commit `04e978ccba`.	2022-07-18 20:08:39 +04:00
Igor Kudrin	04e978ccba	[NVPTX] Use the mask() operator to initialize packed structs with pointers The current implementation assumes that all pointers used in the initialization of an aggregate are aligned according to the pointer size of the target; that might not be so if the object is packed. In that case, an array of .u8 should be used and pointers should be decorated with the mask() operator. The operator was introduced in PTX ISA 7.1, so an error is issued if the case is detected for an earlier version. Differential Revision: https://reviews.llvm.org/D127504	2022-07-18 04:08:59 -07:00
Igor Kudrin	e27f9214c0	[NVPTX][NFC] Simplify printing initialization of aggregates This simplifies NVPTXAsmPrinter::AggBuffer and its usage. It is also a preparation for D127504. Differential Revision: https://reviews.llvm.org/D129773	2022-07-18 04:08:59 -07:00
David Green	3e0bf1c7a9	[CodeGen] Move instruction predicate verification to emitInstruction D25618 added a method to verify the instruction predicates for an emitted instruction, through verifyInstructionPredicates added into <Target>MCCodeEmitter::encodeInstruction. This is a very useful idea, but the implementation inside MCCodeEmitter made it only fire for object files, not assembly which most of the llvm test suite uses. This patch moves the code into the <Target>_MC::verifyInstructionPredicates method, inside the InstrInfo. The allows it to be called from other places, such as in this patch where it is called from the <Target>AsmPrinter::emitInstruction methods which should trigger for both assembly and object files. It can also be called from other places such as verifyInstruction, but that is not done here (it tends to catch errors earlier, but in reality just shows all the mir tests that have incorrect feature predicates). The interface was also simplified slightly, moving computeAvailableFeatures into the function so that it does not need to be called externally. The ARM, AMDGPU (but not R600), AVR, Mips and X86 backends all currently show errors in the test-suite, so have been disabled with FIXME comments. Recommitted with some fixes for the leftover MCII variables in release builds. Differential Revision: https://reviews.llvm.org/D129506	2022-07-14 09:33:28 +01:00
Kazu Hirata	611ffcf4e4	[llvm] Use value instead of getValue (NFC)	2022-07-13 23:11:56 -07:00
David Green	95252133e1	Revert "Move instruction predicate verification to emitInstruction" This reverts commit `e2fb8c0f4b` as it does not build for Release builds, and some buildbots are giving more warning than I saw locally. Reverting to fix those issues.	2022-07-13 13:28:11 +01:00
David Green	e2fb8c0f4b	Move instruction predicate verification to emitInstruction D25618 added a method to verify the instruction predicates for an emitted instruction, through verifyInstructionPredicates added into <Target>MCCodeEmitter::encodeInstruction. This is a very useful idea, but the implementation inside MCCodeEmitter made it only fire for object files, not assembly which most of the llvm test suite uses. This patch moves the code into the <Target>_MC::verifyInstructionPredicates method, inside the InstrInfo. The allows it to be called from other places, such as in this patch where it is called from the <Target>AsmPrinter::emitInstruction methods which should trigger for both assembly and object files. It can also be called from other places such as verifyInstruction, but that is not done here (it tends to catch errors earlier, but in reality just shows all the mir tests that have incorrect feature predicates). The interface was also simplified slightly, moving computeAvailableFeatures into the function so that it does not need to be called externally. The ARM, AMDGPU (but not R600), AVR, Mips and X86 backends all currently show errors in the test-suite, so have been disabled with FIXME comments. Differential Revision: https://reviews.llvm.org/D129506	2022-07-13 12:53:32 +01:00
Igor Kudrin	9ff10a0d62	[NVPTX] Add missing pass names Differential Revision:	2022-07-12 07:58:13 -07:00
Nicolai Hähnle	ede600377c	ManagedStatic: remove many straightforward uses in llvm (Reapply after revert in `e9ce1a5880` due to Fuchsia test failures. Removed changes in lib/ExecutionEngine/ other than error categories, to be checked in more detail and reapplied separately.) Bulk remove many of the more trivial uses of ManagedStatic in the llvm directory, either by defining a new getter function or, in many cases, moving the static variable directly into the only function that uses it. Differential Revision: https://reviews.llvm.org/D129120	2022-07-10 10:29:15 +02:00
Nicolai Hähnle	e9ce1a5880	Revert "ManagedStatic: remove many straightforward uses in llvm" This reverts commit `e6f1f06245`. Reverting due to a failure on the fuchsia-x86_64-linux buildbot.	2022-07-10 09:54:30 +02:00
Nicolai Hähnle	e6f1f06245	ManagedStatic: remove many straightforward uses in llvm Bulk remove many of the more trivial uses of ManagedStatic in the llvm directory, either by defining a new getter function or, in many cases, moving the static variable directly into the only function that uses it. Differential Revision: https://reviews.llvm.org/D129120	2022-07-10 09:15:08 +02:00
Nikita Popov	7283f48a05	[IR] Remove support for insertvalue constant expression This removes the insertvalue constant expression, as part of https://discourse.llvm.org/t/rfc-remove-most-constant-expressions/63179. This is very similar to the extractvalue removal from D125795. insertvalue is also not supported in bitcode, so no auto-ugprade is necessary. ConstantExpr::getInsertValue() can be replaced with IRBuilder::CreateInsertValue() or ConstantFoldInsertValueInstruction(), depending on whether a constant result is required (with the latter being fallible). The ConstantExpr::hasIndices() and ConstantExpr::getIndices() methods also go away here, because there are no longer any constant expressions with indices. Differential Revision: https://reviews.llvm.org/D128719	2022-07-04 09:27:22 +02:00
Nikita Popov	16033ffdd9	[ConstExpr] Remove more leftovers of extractvalue expression (NFC) Remove some leftover bits of extractvalue handling after the removal in D125795.	2022-06-29 10:45:19 +02:00
Kazu Hirata	a7938c74f1	[llvm] Don't use Optional::hasValue (NFC) This patch replaces Optional::hasValue with the implicit cast to bool in conditionals only.	2022-06-25 21:42:52 -07:00
Kazu Hirata	3b7c3a654c	Revert "Don't use Optional::hasValue (NFC)" This reverts commit `aa8feeefd3`.	2022-06-25 11:56:50 -07:00
Kazu Hirata	aa8feeefd3	Don't use Optional::hasValue (NFC)	2022-06-25 11:55:57 -07:00
Igor Kudrin	8958e70ccb	[NVPTX] Keep metadata attached to module-scope variables This helps to preserve the debug information of global variables. Differential Revision: https://reviews.llvm.org/D127510	2022-06-22 05:51:29 -07:00
Kazu Hirata	7a47ee51a1	[llvm] Don't use Optional::getValue (NFC)	2022-06-20 22:45:45 -07:00
Guillaume Chatelet	f1255186c7	[NFC][Alignment] Remove max functions between Align and MaybeAlign `llvm::max(Align, MaybeAlign)` and `llvm::max(MaybeAlign, Align)` are not used often enough to be required. They also make the code more opaque. Differential Revision: https://reviews.llvm.org/D128121	2022-06-20 08:37:48 +00:00
Kazu Hirata	129b531c9c	[llvm] Use value_or instead of getValueOr (NFC)	2022-06-18 23:07:11 -07:00
Matt Arsenault	cc5a1b3dd9	llvm-reduce: Add cloning of target MachineFunctionInfo MIR support is totally unusable for AMDGPU without this, since the set of reserved registers is set from fields here. Add a clone method to MachineFunctionInfo. This is a subtle variant of the copy constructor that is required if there are any MIR constructs that use pointers. Specifically, at minimum fields that reference MachineBasicBlocks or the MachineFunction need to be adjusted to the values in the new function.	2022-06-07 10:14:48 -04:00
Guillaume Chatelet	0788186182	[Alignment][NFC] Remove usage of MemSDNode::getAlignment I can't remove the function just yet as it is used in the generated .inc files. I would also like to provide a way to compare alignment with TypeSize since it came up a few times. Differential Revision: https://reviews.llvm.org/D126910	2022-06-07 13:52:20 +00:00
Fangrui Song	15d82c62dc	[MC] De-capitalize MCStreamer functions Follow-up to `c031378ce0` . The class is mostly consistent now.	2022-06-07 00:31:02 -07:00
Kazu Hirata	3b9707dbc0	[llvm] Convert for_each to range-based for loops (NFC)	2022-06-05 12:07:14 -07:00
Fangrui Song	557efc9a8b	[llvm] Remove unneeded cl::ZeroOrMore for cl::opt options. NFC Some cl::ZeroOrMore were added to avoid the `may only occur zero or one times!` error. More were added due to cargo cult. Since the error has been removed, cl::ZeroOrMore is unneeded. Also remove cl::init(false) while touching the lines.	2022-06-03 21:59:05 -07:00
Zongwei Lan	ad73ce318e	[Target] use getSubtarget<> instead of static_cast<>(getSubtarget()) Differential Revision: https://reviews.llvm.org/D125391	2022-05-26 11:22:41 -07:00
Christian Sigg	c4bc416418	[LLVM] Add rcp.approx.ftz.f32 intrinsic Split out from https://reviews.llvm.org/D126158. Reviewed By: tra Differential Revision: https://reviews.llvm.org/D126369	2022-05-25 21:05:20 +02:00
Shilei Tian	ecf5b78053	[NVPTX] Enable AtomicExpandPass for NVPTX This patch enables `AtomicExpandPass` for NVPTX. Depend on D125652. Reviewed By: tra Differential Revision: https://reviews.llvm.org/D125639	2022-05-20 17:25:28 -04:00
Nikita Popov	ed1cb01baf	[IRBuilder] Add IsInBounds parameter to CreateGEP() We commonly want to create either an inbounds or non-inbounds GEP based on a boolean value, e.g. when preserving inbounds from existing GEPs. Directly accept such a boolean in the API, rather than requiring a ternary between CreateGEP and CreateInBoundsGEP. This change is not entirely NFC, because we now preserve an inbounds flag in a constant expression edge-case in InstCombine.	2022-05-13 14:30:55 +02:00
Dmitry Vassiliev	2e7e0975c0	[NVPTX] Prefix "$L__" for branch label names A global variable may have the same name as a label, and ptxas does not accept it. Prefix labels with $L__ to fix this. Reviewed By: MaskRay, tra Differential Revision: https://reviews.llvm.org/D119669	2022-04-30 21:55:20 +02:00
Dmitry Vassiliev	8c49ab040c	[NVPTX] Add add.cc/addc.cc/sub.cc/subc.cc for i64 PTX supports those instructions for i64 starting from 4.3. The patch also marks corresponding DAG nodes legal for both i32 and i64. Reviewed By: tra Differential Revision: https://reviews.llvm.org/D124698	2022-04-29 15:32:22 -07:00
Andrew Savonichev	0a27622a1d	[NVPTX] Disable DWARF .file directory for PTX Default behavior for .file directory was changed in D105856, but ptxas (CUDA 11.5 release) refuses to parse it: $ llc -march=nvptx64 llvm/test/DebugInfo/NVPTX/debug-file-loc.ll $ ptxas debug-file-loc.s ptxas debug-file-loc.s, line 42; fatal : Parsing error near '"foo.h"': syntax error Added a new field to MCAsmInfo to control default value of UseDwarfDirectory. This value is used if -dwarf-directory command line option is not specified. Differential Revision: https://reviews.llvm.org/D121299	2022-04-26 21:40:36 +03:00
Jakub Chlanda	76d1f5eaa8	[NVPTX] Support float <-> 2 x half bitcasts Make sure NVPTX backend can handle bitcasting between `float` and `<2 x half>` types. This was discovered through: https://github.com/intel/llvm/issues/5969 I'm not suggesting that such bitcasts make much sense, but it feels like the compiler should not hard crash on them. Reviewed By: tra Differential Revision: https://reviews.llvm.org/D124171	2022-04-25 14:37:41 -07:00
David Green	9727c77d58	[NFC] Rename Instrinsic to Intrinsic	2022-04-25 18:13:23 +01:00
Andrew Savonichev	52053aa94f	[NVPTX] Disable parens for identifiers starting with '$' ptxas fails to parse such syntax: mov.u64 %rd1, ($str); fatal : Parsing error near '$str': syntax error A new MCAsmInfo option was added because InParens parameter of MCExpr::print is not sufficient to disable parens completely. MCExpr::print resets it to false for a recursive call in case of unary or binary expressions. Targets that require parens around identifiers that start with '$' should always pass MCAsmInfo to MCExpr::print. Therefore 'operator<<(raw_ostream &, MCExpr&)' should be avoided because it calls MCExpr::print with nullptr MAI. Differential Revision: https://reviews.llvm.org/D123702	2022-04-17 18:02:33 +03:00
Andrew Savonichev	5193f2a558	Revert "[NVPTX] Disable parens for identifiers starting with '$'" This reverts commit `78d70a1c97`. Failed on Mips32: https://lab.llvm.org/buildbot#builders/109/builds/36628 # CHECK: # fixup A - offset: 0, value: ($tmp0), kind: fixup_Mips_26 <stdin>:580:2: note: possible intended match here # fixup A - offset: 0, value: $tmp0, kind: fixup_Mips_26	2022-04-14 21:25:31 +03:00
Andrew Savonichev	78d70a1c97	[NVPTX] Disable parens for identifiers starting with '$' ptxas fails to parse such syntax: mov.u64 %rd1, ($str); fatal : Parsing error near '$str': syntax error A new MCAsmInfo option was added because InParens parameter of MCExpr::print is not sufficient to disable parens completely. MCExpr::print resets it to false for a recursive call in case of unary or binary expressions. Differential Revision: https://reviews.llvm.org/D123702	2022-04-14 21:07:43 +03:00
Andrew Savonichev	4cef5c397d	[NVPTX] .attribute(.managed) is only supported for sm_30 and PTX 4.0 PTX ISA spec, s5.4.8. Variable Attribute Directive: .attribute PTX ISA Notes Introduced in PTX ISA version 4.0. Target ISA Notes .managed attribute requires sm_30 or higher. Differential Revision: https://reviews.llvm.org/D123040	2022-04-14 17:07:52 +03:00
Andrew Savonichev	230f326964	[NVPTX] shfl.sync is introduced in PTX 6.0 PTX ISA spec, s9.7.8.6. Data Movement and Conversion Instructions: shfl.sync PTX ISA Notes Introduced in PTX ISA version 6.0. Target ISA Notes Requires sm_30 or higher. Differential Revision: https://reviews.llvm.org/D123039	2022-04-14 17:07:51 +03:00
Andrew Savonichev	369adba043	[NVPTX] 64-bit atom.{and,or,xor,min,max} require sm_32 or higher PTX ISA spec, s9.7.12.4. Parallel Synchronization and Communication Instructions: atom Target ISA Notes 64-bit atom.{and,or,xor,min,max} require sm_32 or higher. Differential Revision: https://reviews.llvm.org/D123038	2022-04-14 17:07:51 +03:00
Johannes Doerfert	0f070bee82	[NVPTX][FIX] Allow __nvvm_reflect in the presence of opaque pointers Differential Revision: https://reviews.llvm.org/D123522	2022-04-12 16:42:50 -05:00
Matt Arsenault	39f1568633	Transforms: Split LowerAtomics into separate Utils and pass This will allow code sharing from AtomicExpandPass. Not entirely sure why these exist as separate passes though.	2022-04-06 20:54:45 -04:00
Evgeniy Brevnov	acfc785c0e	Preserve aliasing info during memory intrinsics lowering By specification, source and destination of llvm.memcpy.* must either be equal or non-overlapping. This semantics is hard or impossible to figure out once lowered. This patch explicitly marks loads from source and stores to destination as not aliasing if source and destination is known to be not equal. Reviewed By: arsenm Differential Revision: https://reviews.llvm.org/D118441	2022-04-06 11:33:54 +07:00
serge-sans-paille	1e02737593	[iwyu] Fix some header include regression Running iwyu-diff from https://github.com/serge-sans-paille/preprocessor-utils makes it possible to quickly spot regression in unused includes. This patch contains the few regressions since the last header cleanup. Differential Revision: https://reviews.llvm.org/D123036	2022-04-05 15:02:03 +02:00
Shao-Ce SUN	662b9fa02c	[NFC][CodeGen] Add a setTargetDAGCombine use ArrayRef Reviewed By: arsenm Differential Revision: https://reviews.llvm.org/D122557	2022-03-29 09:53:24 +08:00
Daniil Kovalev	a8c277041a	[NVPTX] Fix poorly designed assertion introduced in D120129 NVPTXTargetLowering::getFunctionParamOptimizedAlign, which was introduces in D120129, contained a poorly designed assertion checking that a function with internal or private linkage is not a kernel. It relied on invariants that were not actually guaranteed, and that resulted in compiler crash with some CUDA versions (see discussion with @jdoerfert in D120129). This patch changes that assertion and makes it use isKernelFunction which is designed exactly for such checks. This patch also includes a test with IR that caused compiler crash before. Differential Revision: https://reviews.llvm.org/D122562	2022-03-28 17:34:58 +03:00
Kazu Hirata	6212871968	[Target] Apply clang-tidy fixes for readability-redundant-member-init (NFC)	2022-03-27 22:22:37 -07:00

1 2 3 4 5 ...

1113 Commits