llvm-project

Commit Graph

Author	SHA1	Message	Date
Juan Manuel MARTINEZ CAAMAÑO	df7606a066	[DebugInfo][LICM] Drop DebugLoc from IntrinsicInst when hoisting The DebugLoc is conserved when hoisting function calls, to ensure the DIScope is preserved if inlining occurs. This commit drops the DebugLoc in the case the call is an intrinsic call that won't be lowered into a function call. Differential Revision: https://reviews.llvm.org/D134429	2022-09-30 09:12:35 +00:00
Craig Topper	90b695d1f2	[IR][InstCombine] Support scalable vector splats ConstantExprs in Constant::getUniqueInteger(). I've left the getAggregateElement as a fast path for non-ConstantExprs to avoid a call to getSplatValue in release builds. Fixes PR57989. Reviewed By: spatel Differential Revision: https://reviews.llvm.org/D134670	2022-09-26 14:55:15 -07:00
Caroline Concatto	5431bf27bd	[AArch64]Remove svget/svset/svcreate from llvm This patch removes the aarch64 instrinsic svget/svset/svcreate from llvm. It also implements the InstCombine for vector.extract that used to be in svget. Depends on: D131547 Differential Revision: https://reviews.llvm.org/D131548	2022-09-23 10:48:43 +01:00
Nikita Popov	14947cc4cd	[IR] Handle assume intrinsics in hasClobberingOperandBundle() Operand bundles on assumes do not read or write -- we correctly modelled the read side of this, but not the write side. In practice this did not matter because of how the method is used, but this will become relevant for a future patch.	2022-09-23 10:26:58 +02:00
Jonathan Camilleri	4cd7529e4c	[clang][DebugInfo] Emit access specifiers for typedefs The accessibility level of a typedef or using declaration in a struct or class was being lost when producing debug information. Reviewed By: dblaikie Differential Revision: https://reviews.llvm.org/D134339	2022-09-22 17:08:41 +00:00
Juan Manuel MARTINEZ CAAMAÑO	b1f76e8c57	[NFC] Remove unused set construction from DILocation::getMergedLocation Reviewed By: dblaikie Differential Revision: https://reviews.llvm.org/D134357	2022-09-22 07:00:02 +00:00
Caroline Concatto	d32b8fdbdb	[LLVM][AArch64] Replace aarch64.sve.ld by aarch64.sve.ldN.sret This patch removes the intrinsic aarch64.sve.ldN from tablegen in favour of using arch64.sve.ldN.sret. Depends on: D133023 Differential Revision: https://reviews.llvm.org/D133025	2022-09-20 13:15:07 +01:00
Vitaly Buka	bbef90ace4	[IRBuilder] Use PoisonValue in CreateMasked* Followup to `72b776168c` Reviewed By: nlopes Differential Revision: https://reviews.llvm.org/D133967	2022-09-19 11:01:41 -07:00
Sander de Smalen	45d28779c5	[AArch64][SME] Fix lowering of llvm.aarch64.get.pstatesm() A thread may not have access to SME or TPIDR2_EL0, so in order to safely query PSTATE.SM in a streaming-compatible function, the code should call `__arm_sme_state()`, as described in the ABI: `c2bb09c4d4` This means that the value of pstate.sm is: * 0 if the function is non-streaming. * 1 if the function has `arm_streaming` or `arm_locally_streaming`. * evaluated at runtime by a call to __arm_sme_state() otherwise. This patch also adds a calling convention for calls to SME support routines. At some point we can remove the need for the llvm.aarch64.get.pstatesm() intrinsic and use function calls (with the corresponding cc) directly instead. Reviewed By: aemerson Differential Revision: https://reviews.llvm.org/D131571	2022-09-15 15:14:13 +00:00
Fangrui Song	057fb8153a	[IRBuilder] Fix -Wunused-variable in non-assertion build. NFC	2022-09-14 22:14:36 -07:00
Vitaly Buka	72b776168c	[IRBuilder] Add CreateMaskedExpandLoad and CreateMaskedCompressStore	2022-09-14 19:18:52 -07:00
Alexander Shaposhnikov	b6965f7246	[IR] Add alignment for llvm.threadlocal.address This diff sets the alignment attribute for the return value and the argument of llvm.threadlocal.address. (https://github.com/llvm/llvm-project/issues/57438) Test plan: ninja check-all Differential revision: https://reviews.llvm.org/D133741	2022-09-13 23:10:55 +00:00
Alexey Bataev	dfe1e9dd79	[SLP]Improve reordering of clustered reused scalars. If the reused scalars are clustered, i.e. each part of the reused mask contains all elements of the original scalars exactly once, we can reorder those clusters to improve the whole ordering of of the clustered vectors. Differential Revision: https://reviews.llvm.org/D133524	2022-09-12 06:52:25 -07:00
Sander de Smalen	cf72dddaef	[AArch64][SME] Add utility class for handling SME attributes. This patch adds a utility class that will be used in subsequent patches for parsing the function/callsite attributes and determining whether changes to PSTATE.SM are needed, or whether a lazy-save mechanism is required. It also implements some of the restrictions on the SME attributes in the IR Verifier pass. More details about the SME attributes and design can be found in D131562. Reviewed By: david-arm, aemerson Differential Revision: https://reviews.llvm.org/D131570	2022-09-12 12:41:30 +00:00
Junduo Dong	6975ab7126	[Clang] Reimplement time tracing of NewPassManager by PassInstrumentation framework The previous implementation of time tracing in NewPassManager is direct but messive. The key codes are like the demo below: ``` /// Runs the function pass across every function in the module. PreservedAnalyses run(LazyCallGraph::SCC &C, CGSCCAnalysisManager &AM, LazyCallGraph &CG, CGSCCUpdateResult &UR) { /// ... PreservedAnalyses PassPA; { TimeTraceScope TimeScope(Pass.name()); PassPA = Pass.run(F, FAM); } /// ... } ``` It can be bothered to judge where should we add the tracing codes by hands. With the PassInstrumentation framework, we can easily add `Before/After` callback functions to add time tracing codes. Differential Revision: https://reviews.llvm.org/D131960	2022-09-11 05:42:55 -07:00
Sebastian Neubauer	c7750c522e	Add helper func to get first non-alloca position The LLVM performance tips suggest that allocas should be placed at the beginning of the entry block. So far, llvm doesn’t provide any helper to find that position. Add BasicBlock::getFirstNonPHIOrDbgOrAlloca and IRBuilder::SetInsertPointPastAllocas(Function*) that get an insert position after the (static) allocas at the start of a function and use it in ShadowStackGCLowering. Differential Revision: https://reviews.llvm.org/D132554	2022-09-09 15:39:53 +02:00
Joe Loser	5e96cea1db	[llvm] Use std::size instead of llvm::array_lengthof LLVM contains a helpful function for getting the size of a C-style array: `llvm::array_lengthof`. This is useful prior to C++17, but not as helpful for C++17 or later: `std::size` already has support for C-style arrays. Change call sites to use `std::size` instead. Differential Revision: https://reviews.llvm.org/D133429	2022-09-08 09:01:53 -06:00
Nikita Popov	96cb7c2273	[ConstantExpr] Remove fneg expression As part of https://discourse.llvm.org/t/rfc-remove-most-constant-expressions/63179, this removes the fneg constant expression (which is, incidentally, the only unary operator expression). Differential Revision: https://reviews.llvm.org/D133418	2022-09-08 10:24:55 +02:00
Fangrui Song	f48931f3a8	[NewPM] Switch -filter-passes from ClassName to pass-name NewPM -filter-passes (D86360) uses ClassName instead of pass-name as used in `-passes`, `-print-after`, etc. D87216 has added a mechanism to map ClassName to pass-name. Adopt it for -filter-passes. Reviewed By: aeubanks Differential Revision: https://reviews.llvm.org/D133263	2022-09-07 22:02:26 -07:00
Nikita Popov	e047a4ab55	[ConstantFold] Avoid unary ConstantExpr::get() Call ConstantFoldUnaryInstruction() instead, to only produce a result if it folds.	2022-09-07 12:00:26 +02:00
Fangrui Song	97d00b72a2	[Verifier] Allow dllexport protected after D133267 I have noticed that this combo makes sense (D133266) but rejected it for simplicity. It turns out to be used by PlayStation, so let's allow it.	2022-09-06 15:28:29 -07:00
Marco Elver	c70f6e1362	[Metadata] Introduce MD_pcsections Introduces MD_pcsections metadata kind. See added documentation for more details. Subsequent patches enable propagating PC sections metadata through code generation to the AsmPrinter. RFC: https://discourse.llvm.org/t/rfc-pc-keyed-metadata-at-runtime/64191 Reviewed By: dvyukov, vitalybuka Differential Revision: https://reviews.llvm.org/D130875	2022-09-06 15:52:44 +02:00
Kazu Hirata	5a68d6721d	Revert "Use llvm::none_of (NFC)" This partially reverts commit `f5a68feab3`. Fixes https://github.com/llvm/llvm-project/issues/57557	2022-09-05 14:42:34 -07:00
Fangrui Song	2417618d5c	[Verifier] Reject dllexport with non-default visibility Add a visibility check for dllimport and dllexport. Note: dllimport with a non-default visibility (implicit dso_local) is already rejected, but with a less clear dso_local error. The MC level visibility `MCSA_Exported` (D123951) is mapped from IR level default visibility when dllexport is specified. The D123951 error is now very difficult to trigger (needs to disable the IR verifier). Reviewed By: mstorsjo Differential Revision: https://reviews.llvm.org/D133267	2022-09-05 10:53:41 -07:00
Yuanfang Chen	041ec82242	[Verifier] Skip debug location check for some non-inlinable functions If a callee function is not interposable, skip debug location check for its callsites. Doing this is instrumentation-friendly otherwise under some conditions this check triggers for some un-inlinable call sites. Reviewed By: aprantl Differential Revision: https://reviews.llvm.org/D133060	2022-09-02 11:03:55 -07:00
Rong Xu	0caa4a9559	[PGO] Support PGO annotation of CallBrInst We currently instrument CallBrInst but do not annotate it with the branch weight. This patch enables PGO annotation of CallBrInst. Differential Revision: https://reviews.llvm.org/D133040	2022-09-01 14:13:50 -07:00
Fangrui Song	8d95fd7e56	[MachineFunctionPass] Support -filter-passes for -print-changed [MachineFunctionPass] Support -filter-passes for -print-changed -filter-passes specifies a `PassID` (a lower-case dashed-separated pass name, also used by -print-after, -stop-after, etc) instead of a CamelCasePass. `-filter-passes=CamelCaseNewPMPass` seems like a workaround for new PM passes before we can use lower-case dashed-separated pass names (as used by `-passes=`). Example: ``` # getPassName() is "IRTranslator". PassID is "irtranslator" llc -mtriple=aarch64 -print-changed -filter-passes=irtranslator < print-changed-machine.ll ``` Close https://github.com/llvm/llvm-project/issues/57453 Reviewed By: aeubanks Differential Revision: https://reviews.llvm.org/D133055	2022-09-01 11:06:06 -07:00
Nikita Popov	972840aa3b	[IR] Add Instruction::getInsertionPointAfterDef() Transforms occasionally want to insert an instruction directly after the definition point of a value. This involves quite a few different edge cases, e.g. for phi nodes the next insertion point is not the next instruction, and for invokes and callbrs its not even in the same block. Additionally, the insertion point may not exist at all if catchswitch is involved. This adds a general Instruction::getInsertionPointAfterDef() API to implement the necessary logic. For now it is used in two places where this should be mostly NFC. I will follow up with additional uses where this fixes specific bugs in the existing implementations. Differential Revision: https://reviews.llvm.org/D129660	2022-08-31 10:50:10 +02:00
Sanjay Patel	15090afd66	[Verifier] remove stale comment about PHI with no operands; NFC The code was changed with: `9eb2c0113d` ...but missed the corresponding code comment.	2022-08-30 15:38:29 -04:00
Kazu Hirata	0e9d37ff95	[llvm] Qualify auto in range-based for loops (NFC)	2022-08-28 23:29:00 -07:00
Kazu Hirata	21de2888a4	Use llvm::is_contained (NFC)	2022-08-27 09:53:11 -07:00
Paul Kirth	deef5b8c0d	[llvm][ir][NFC] Clean up "if after return" in ProfDataUtils.cpp Reviewed By: tejohnson Differential Revision: https://reviews.llvm.org/D131236	2022-08-26 20:27:39 +00:00
Sami Tolvanen	cff5bef948	KCFI sanitizer The KCFI sanitizer, enabled with `-fsanitize=kcfi`, implements a forward-edge control flow integrity scheme for indirect calls. It uses a !kcfi_type metadata node to attach a type identifier for each function and injects verification code before indirect calls. Unlike the current CFI schemes implemented in LLVM, KCFI does not require LTO, does not alter function references to point to a jump table, and never breaks function address equality. KCFI is intended to be used in low-level code, such as operating system kernels, where the existing schemes can cause undue complications because of the aforementioned properties. However, unlike the existing schemes, KCFI is limited to validating only function pointers and is not compatible with executable-only memory. KCFI does not provide runtime support, but always traps when a type mismatch is encountered. Users of the scheme are expected to handle the trap. With `-fsanitize=kcfi`, Clang emits a `kcfi` operand bundle to indirect calls, and LLVM lowers this to a known architecture-specific sequence of instructions for each callsite to make runtime patching easier for users who require this functionality. A KCFI type identifier is a 32-bit constant produced by taking the lower half of xxHash64 from a C++ mangled typename. If a program contains indirect calls to assembly functions, they must be manually annotated with the expected type identifiers to prevent errors. To make this easier, Clang generates a weak SHN_ABS `__kcfi_typeid_<function>` symbol for each address-taken function declaration, which can be used to annotate functions in assembly as long as at least one C translation unit linked into the program takes the function address. For example on AArch64, we might have the following code: ``` .c: int f(void); int (*p)(void) = f; p(); .s: .4byte __kcfi_typeid_f .global f f: ... ``` Note that X86 uses a different preamble format for compatibility with Linux kernel tooling. See the comments in `X86AsmPrinter::emitKCFITypeId` for details. As users of KCFI may need to locate trap locations for binary validation and error handling, LLVM can additionally emit the locations of traps to a `.kcfi_traps` section. Similarly to other sanitizers, KCFI checking can be disabled for a function with a `no_sanitize("kcfi")` function attribute. Relands `67504c9549` with a fix for 32-bit builds. Reviewed By: nickdesaulniers, kees, joaomoreira, MaskRay Differential Revision: https://reviews.llvm.org/D119296	2022-08-24 22:41:38 +00:00
Sami Tolvanen	a79060e275	Revert "KCFI sanitizer" This reverts commit `67504c9549` as using PointerEmbeddedInt to store 32 bits breaks 32-bit arm builds.	2022-08-24 19:30:13 +00:00
Sami Tolvanen	67504c9549	KCFI sanitizer The KCFI sanitizer, enabled with `-fsanitize=kcfi`, implements a forward-edge control flow integrity scheme for indirect calls. It uses a !kcfi_type metadata node to attach a type identifier for each function and injects verification code before indirect calls. Unlike the current CFI schemes implemented in LLVM, KCFI does not require LTO, does not alter function references to point to a jump table, and never breaks function address equality. KCFI is intended to be used in low-level code, such as operating system kernels, where the existing schemes can cause undue complications because of the aforementioned properties. However, unlike the existing schemes, KCFI is limited to validating only function pointers and is not compatible with executable-only memory. KCFI does not provide runtime support, but always traps when a type mismatch is encountered. Users of the scheme are expected to handle the trap. With `-fsanitize=kcfi`, Clang emits a `kcfi` operand bundle to indirect calls, and LLVM lowers this to a known architecture-specific sequence of instructions for each callsite to make runtime patching easier for users who require this functionality. A KCFI type identifier is a 32-bit constant produced by taking the lower half of xxHash64 from a C++ mangled typename. If a program contains indirect calls to assembly functions, they must be manually annotated with the expected type identifiers to prevent errors. To make this easier, Clang generates a weak SHN_ABS `__kcfi_typeid_<function>` symbol for each address-taken function declaration, which can be used to annotate functions in assembly as long as at least one C translation unit linked into the program takes the function address. For example on AArch64, we might have the following code: ``` .c: int f(void); int (*p)(void) = f; p(); .s: .4byte __kcfi_typeid_f .global f f: ... ``` Note that X86 uses a different preamble format for compatibility with Linux kernel tooling. See the comments in `X86AsmPrinter::emitKCFITypeId` for details. As users of KCFI may need to locate trap locations for binary validation and error handling, LLVM can additionally emit the locations of traps to a `.kcfi_traps` section. Similarly to other sanitizers, KCFI checking can be disabled for a function with a `no_sanitize("kcfi")` function attribute. Reviewed By: nickdesaulniers, kees, joaomoreira, MaskRay Differential Revision: https://reviews.llvm.org/D119296	2022-08-24 18:52:42 +00:00
Simon Pilgrim	9317e6311f	[TTI] Add SK_Splice shuffle mask detection and X86 costs Enables fixed sized vectors to detect SK_Splice shuffle patterns and provides basic X86 cost support Differential Revision: https://reviews.llvm.org/D132374	2022-08-23 20:07:30 +01:00
Jakub Kuderski	6fa87ec10f	[ADT] Deprecate is_splat and replace all uses with all_equal See the discussion thread for more details: https://discourse.llvm.org/t/adt-is-splat-and-empty-ranges/64692 Reviewed By: dblaikie Differential Revision: https://reviews.llvm.org/D132335	2022-08-23 11:36:27 -04:00
Florian Hahn	5913d77056	[Globals] Treat nobuiltin fns as maybe-derefined. Callsites could be marked as `builtin` while calling `nobuiltin` functions. This can lead to problems, if local optimizations apply transformations based on the semantics of the builtin, but then IPO treats the function as `nobuiltin` and applies a transform that breaks builtin semantics (assumed earlier). To avoid this, mark such functions as maybey-derefined, to avoid IPO transforms on them that may break assumptions of earlier calls. Fixes #57075 Fixes #48366 Reviewed By: ychen Differential Revision: https://reviews.llvm.org/D97735	2022-08-23 13:45:10 +01:00
liqinweng	9181ab9223	[NFC]] Use llvm::all_of instead of std::all_of Reviewed By: kazu Differential Revision: https://reviews.llvm.org/D131886	2022-08-23 12:21:53 +08:00
Kazu Hirata	258531b7ac	Remove redundant initialization of Optional (NFC)	2022-08-20 21:18:28 -07:00
Adrian Prantl	2d48b403f6	Fix UB in DIExpression::appendOffset() The absolute value of 0x8000000000000000 does not fit into an int64_t and UBSan tells us that by crashing. rdar://98799670 Differential Revision: https://reviews.llvm.org/D132194	2022-08-19 09:16:59 -07:00
Archibald Elliott	3a729069e4	[IR] Update llvm.prefetch to match docs The current llvm.prefetch intrinsic docs state "The rw, locality and cache type arguments must be constant integers." This change: - Makes arg 3 (cache type) an ImmArg - Improves the verifier error messages to reference the incorrect argument. - Fixes two tests which contradict the docs. This is needed as the lowering to GlobalISel is different for ImmArgs compared to other constants. The non-ImmArgs create a G_CONSTANT MIR instruction, the for ImmArgs the constant is put directly on the intrinsic's MIR instruction as an immediate. Differential Revision: https://reviews.llvm.org/D132042	2022-08-19 09:11:17 +01:00
Fangrui Song	c2a3888793	[IR] Use Min behavior for module flag "PIC Level" Using Max for both "PIC Level" and "PIE Level" is inconsistent. PIC imposes less restriction while PIE imposes more restriction. The result generally picks the more restrictive behavior: Min for PIC. This choice matches `ld -r`: a non-pic object and a pic object merge into a result which should be treated as non-pic. To allow linking "PIC Level" using Error/Max from old bitcode files, upgrade Error/Max to Min. Reviewed By: tejohnson Differential Revision: https://reviews.llvm.org/D130531	2022-08-18 16:28:55 -07:00
Nick Desaulniers	6b0e2fa6f0	[SelectionDAG] make INLINEASM_BR use MachineBasicBlocks instead of BlockAddresses As part of re-architecting callbr to no longer use blockaddresses (https://reviews.llvm.org/D129288), we don't really need them in MIR. They make comparing MachineBasicBlocks of indirect targets during MachineVerifier a PITA. Suggested by @efriedma from the discussion: https://reviews.llvm.org/D130290#3669531 Reviewed By: efriedma, void Differential Revision: https://reviews.llvm.org/D130316	2022-08-17 09:34:31 -07:00
Paul Kirth	656c5d652c	[clang][llvm][NFC] Change misexpect's tolerance option to be 32-bit In D131869 we noticed that we jump through some hoops because we parse the tolerance option used in MisExpect.cpp into a 64-bit integer. This is unnecessary, since the value can only be in the range [0, 100). This patch changes the underlying type to be 32-bit from where it is parsed in Clang through to it's use in LLVM. Reviewed By: jloser Differential Revision: https://reviews.llvm.org/D131935	2022-08-17 14:38:53 +00:00
Kazu Hirata	f5a68feab3	Use llvm::none_of (NFC)	2022-08-14 16:25:39 -07:00
Kazu Hirata	6d9cd9199a	Use llvm::all_of (NFC)	2022-08-14 16:25:36 -07:00
Kazu Hirata	109df7f9a4	[llvm] Qualify auto in range-based for loops (NFC) Identified with readability-qualified-auto.	2022-08-13 12:55:42 -07:00
Martin Storsjö	c5b3de6745	[COFF] Emit embedded -exclude-symbols: directives for hidden visibility for MinGW This works with the automatic export of all symbols; in MinGW mode, when a DLL has no explicit dllexports, it exports all symbols (except for some that are hardcoded to be excluded, including some toolchain libraries). By hooking up the hidden visibility to the -exclude-symbols: directive, the automatic export of all symbols can be controlled in an easier way (with a mechanism that doesn't require strict annotation of every single symbol, but which allows gradually marking more unnecessary symbols as hidden). The primary use case is dylib builds of LLVM/Clang. These can be done in MinGW mode but not in MSVC mode, as MinGW builds can export all symbols (and the calling code can use APIs without corresponding dllimport directives). However, as all symbols are exported, it can easily overflow the max number of exported symbols in a DLL (65536). In the llvm-mingw distribution, only the X86, ARM and AArch64 backends are enabled; for the LLVM 13.0.0 release, libLLVM-13.dll ended up with 58112 exported symbols. For LLVM 14.0.0, it was 62015 symbols. Current builds of the 15.x branch end up at around 64650 symbols - i.e. extremely close to the limit. The msys2 packages of LLVM have had to progressively disable more of their backends in their builds, to be able to keep building with a dylib. This allows improving the current mingw dylib situation significantly, by using the same hidden visibility options and attributes as on Unix. With those in place, a current build of LLVM git main ends up at 35142 symbols instead of 64650. For code using hidden visibility, this now requires linking with either a current git lld or ld.bfd. (Older lld error out on the unknown directives, older ld.bfd will successfully link, but will print huge amounts of warnings.) Differential Revision: https://reviews.llvm.org/D130121	2022-08-11 12:00:08 +03:00
Fangrui Song	de9d80c1c5	[llvm] LLVM_FALLTHROUGH => [[fallthrough]]. NFC With C++17 there is no Clang pedantic warning or MSVC C5051.	2022-08-08 11:24:15 -07:00

1 2 3 4 5 ...

5441 Commits