llvm-project

Commit Graph

Author	SHA1	Message	Date
Serge Pavlov	9424497e43	[Clang] Use virtual FS in processing config files Clang has support of virtual file system for the purpose of testing, but treatment of config files did not use it. This change enables VFS in it as well. Differential Revision: https://reviews.llvm.org/D132867	2022-09-09 16:28:51 +07:00
Fangrui Song	781dea021a	[Support] Rename DebugCompressionType::Z to Zlib "Z" was so named when we had both gABI ELFCOMPRESS_ZLIB and the legacy .zdebug support. Now we have just one zlib format, we should use the more descriptive name.	2022-09-08 16:11:29 -07:00
raghavmedicherla	5d3cf8267f	Revert "Support: Add mapped_file_region::sync(), equivalent to msync" This reverts commit `142f51fc2f`. This shouldn't be committed, it got committed accidentally.	2022-09-08 12:49:52 -04:00
Thomas Lively	ac3b8df8f2	[WebAssembly] Prototype `f32x4.relaxed_dot_bf16x8_add_f32` As proposed in https://github.com/WebAssembly/relaxed-simd/issues/77. Only an LLVM intrinsic and a clang builtin are implemented. Since there is no bfloat16 type, use u16 to represent the bfloats in the builtin function arguments. Differential Revision: https://reviews.llvm.org/D133428	2022-09-08 08:07:49 -07:00
Joe Loser	5e96cea1db	[llvm] Use std::size instead of llvm::array_lengthof LLVM contains a helpful function for getting the size of a C-style array: `llvm::array_lengthof`. This is useful prior to C++17, but not as helpful for C++17 or later: `std::size` already has support for C-style arrays. Change call sites to use `std::size` instead. Differential Revision: https://reviews.llvm.org/D133429	2022-09-08 09:01:53 -06:00
Eric Wang	d8a2d3f7d4	[NFC][Regalloc] Introduce the RegAllocPriorityAdvisorAnalysis This patch introduces the priority analysis and the priority advisor, the default implementation, and the scaffolding for introducing the other implementations of the advisor. Reviewed By: mtrofin Differential Revision: https://reviews.llvm.org/D132835	2022-09-08 07:50:03 -07:00
David Spickett	e428baf001	[LLVM][ARM] Remove options for armv2, 2A, 3 and 3M Fixes #57486 These pre v4 architectures are not specifically supported by codegen. As demonstrated in the linked issue. GCC has not supported 3M since GCC 9 and presumably 2 and 2A earlier than that. So we are aligned in that sense. (see https://git.kernel.org/pub/scm/linux/kernel/git/torvalds/linux.git/commit/?id=2abd6e34fcf3bd9f9ffafcaa47cdc3ed443f9add) This removes the options and associated testing. The Pre_v4 build attribute remains mainly because its absence would be more confusing. It will not be used other than to complete the list of build attributes as shown in the ABI. https://github.com/ARM-software/abi-aa/blob/main/addenda32/addenda32.rst#3352the-target-related-attributes Reviewed By: nickdesaulniers, peter.smith, rengolin Differential Revision: https://reviews.llvm.org/D133109	2022-09-08 09:49:48 +00:00
Nikita Popov	96cb7c2273	[ConstantExpr] Remove fneg expression As part of https://discourse.llvm.org/t/rfc-remove-most-constant-expressions/63179, this removes the fneg constant expression (which is, incidentally, the only unary operator expression). Differential Revision: https://reviews.llvm.org/D133418	2022-09-08 10:24:55 +02:00
Fangrui Song	b6e1fd761d	[llvm-objcopy] Support --{,de}compress-debug-sections for zstd Also, add ELFCOMPRESS_ZSTD (2) from the approved generic-abi proposal: https://groups.google.com/g/generic-abi/c/satyPkuMisk ("Add new ch_type value: ELFCOMPRESS_ZSTD") Link: https://discourse.llvm.org/t/rfc-zstandard-as-a-second-compression-method-to-llvm/63399 ("[RFC] Zstandard as a second compression method to LLVM") Reviewed By: jhenderson, dblaikie Differential Revision: https://reviews.llvm.org/D130458	2022-09-08 00:59:14 -07:00
Fangrui Song	a41977dd0f	[Support] Add llvm::compression::{getReasonIfUnsupported,compress,decompress} as high-level API on top of `llvm::compression::{zlib,zstd}::`: getReasonIfUnsupported: return nullptr if the specified format is supported, or (if unsupported) a string like `LLVM was not built with LLVM_ENABLE_ZLIB ...` * compress: dispatch to zlib::uncompress or zstd::uncompress * decompress: dispatch to zlib::uncompress or zstd::uncompress Move `llvm::DebugCompressionType` from MC to Support to avoid Support->MC cyclic dependency. There are 40+ uses in llvm-project. Add another enum class `llvm::compression::Format` to represent supported compression formats, which may be a superset of ELF compression formats. See D130458 (llvm-objcopy --{,de}compress-debug-sections for zstd) for a use case. Link: https://discourse.llvm.org/t/rfc-zstandard-as-a-second-compression-method-to-llvm/63399 ("[RFC] Zstandard as a second compression method to LLVM") --- Note: this patch alone will cause -Wswitch to llvm/lib/ObjCopy/ELF/ELFObject.cpp Reviewed By: ckissane, dblaikie Differential Revision: https://reviews.llvm.org/D130506	2022-09-08 00:58:55 -07:00
Nikita Popov	0444b40ed3	Revert "[Support] Add llvm::compression::{getReasonIfUnsupported,compress,decompress}" This reverts commit `19dc3cff0f`. This reverts commit `5b19a1f8e8`. This reverts commit `9397648ac8`. This reverts commit `10842b4475`. Breaks the GCC build, as reported here: https://reviews.llvm.org/D130506#3776415	2022-09-08 09:33:12 +02:00
Fangrui Song	10842b4475	[Support] Work around GCC's enum support	2022-09-08 00:13:25 -07:00
Fangrui Song	5b19a1f8e8	[llvm-objcopy] Support --{,de}compress-debug-sections for zstd Also, add ELFCOMPRESS_ZSTD (2) from the approved generic-abi proposal: https://groups.google.com/g/generic-abi/c/satyPkuMisk ("Add new ch_type value: ELFCOMPRESS_ZSTD") Link: https://discourse.llvm.org/t/rfc-zstandard-as-a-second-compression-method-to-llvm/63399 ("[RFC] Zstandard as a second compression method to LLVM") Reviewed By: jhenderson, dblaikie Differential Revision: https://reviews.llvm.org/D130458	2022-09-07 23:53:40 -07:00
Fangrui Song	19dc3cff0f	[Support] Add llvm::compression::{getReasonIfUnsupported,compress,decompress} as high-level API on top of `llvm::compression::{zlib,zstd}::`: getReasonIfUnsupported: return nullptr if the specified format is supported, or (if unsupported) a string like `LLVM was not built with LLVM_ENABLE_ZLIB ...` * compress: dispatch to zlib::uncompress or zstd::uncompress * decompress: dispatch to zlib::uncompress or zstd::uncompress Move `llvm::DebugCompressionType` from MC to Support to avoid Support->MC cyclic dependency. There are 40+ uses in llvm-project. Add another enum class `llvm::compression::Format` to represent supported compression formats, which may be a superset of ELF compression formats. See D130458 (llvm-objcopy --{,de}compress-debug-sections for zstd) for a use case. Link: https://discourse.llvm.org/t/rfc-zstandard-as-a-second-compression-method-to-llvm/63399 ("[RFC] Zstandard as a second compression method to LLVM") Differential Revision: https://reviews.llvm.org/D130506	2022-09-07 23:53:14 -07:00
gonglingqin	d5f7a2182d	[LoongArch] Add codegen support for atomicrmw xchg operation on LA32 Depends on D131228 Differential Revision: https://reviews.llvm.org/D131229	2022-09-08 13:57:53 +08:00
gonglingqin	b60f801607	[LoongArch] Add codegen support for atomicrmw xchg operation on LA64 In order to avoid the patch being too large, the atomicrmw xchg operation on LA32 will be added later Differential Revision: https://reviews.llvm.org/D131228	2022-09-08 13:57:26 +08:00
Fangrui Song	f48931f3a8	[NewPM] Switch -filter-passes from ClassName to pass-name NewPM -filter-passes (D86360) uses ClassName instead of pass-name as used in `-passes`, `-print-after`, etc. D87216 has added a mechanism to map ClassName to pass-name. Adopt it for -filter-passes. Reviewed By: aeubanks Differential Revision: https://reviews.llvm.org/D133263	2022-09-07 22:02:26 -07:00
Marco Elver	97c2220565	[SanitizerBinaryMetadata] Introduce SanitizerBinaryMetadata instrumentation pass Introduces the SanitizerBinaryMetadata instrumentation pass which uses the new MD_pcsections metadata kinds to instrument certain types of instructions and functions required for breakpoint-based sanitizers. The first intended user of the binary metadata emitted will be a variant of GWP-TSan [1]. GWP-TSan will require information about atomic accesses; to unambiguously determine if an access is atomic or not, we also require "covered" information which code has been compiled with SanitizerBinaryMetadata instrumentation enabled. [1] https://llvm.org/devmtg/2020-09/slides/Morehouse-GWP-Tsan.pdf Reviewed By: dvyukov Differential Revision: https://reviews.llvm.org/D130887	2022-09-07 21:25:40 +02:00
Andrea Di Biagio	3262794804	[MCA] Correctly check pipeline availability for partially overlapping resource groups. This patch mostly reverts commit `70b37f4c03` which fixed PR50725. In case of explicit consumption of multiple partially overlapping group resources, the ResourceManager was not correctly checking pipeline esources availability. The fix for PR50725 only partially addressed a few instances of that issue. This is a more general (although, technically slower) fix for that same issue. It also fixes Issue #57548 Thanks to Haohai Wen for the small reproducible.	2022-09-07 12:17:59 +01:00
Marco Elver	343700358f	[AsmPrinter] Emit PCs into requested PCSections Interpret MD_pcsections in AsmPrinter emitting the requested metadata to the associated sections. Functions and normal instructions are handled. Differential Revision: https://reviews.llvm.org/D130879	2022-09-07 11:36:02 +02:00
Marco Elver	31a548021b	[GlobalISel] Propagate PCSections metadata to MachineInstr Propagate (most) PC sections metadata to MachineInstr when GlobalISel is doing instruction selection. This change results in support for architectures using GlobalISel (such as -O0 with AArch64). Not all instructions may be supported yet, and requires further target-specific handling (such as done for AArch64 pseudo-atomics). Expanding supported instructions is planned on a case-by-case basis and new use cases for PC sections metadata. Reviewed By: vitalybuka Differential Revision: https://reviews.llvm.org/D130886	2022-09-07 11:36:02 +02:00
Marco Elver	0ba8886af5	[FastISel] Propagate PCSections metadata to MachineInstr Propagate PC sections metadata to MachineInstr when FastISel is doing instruction selection. Reviewed By: vitalybuka Differential Revision: https://reviews.llvm.org/D130884	2022-09-07 11:36:01 +02:00
Nikita Popov	98a3a340c3	[ConstantExpr] Don't create fneg expressions Don't create fneg expressions unless explicitly requested by IR or bitcode.	2022-09-07 11:27:25 +02:00
Marco Elver	da695de628	[MachineInstrBuilder] Introduce MIMetadata to simplify metadata propagation In many places DebugLoc and PCSections metadata are just copied along to propagate them through MachineInstrs. Simplify doing so by bundling them up in a MIMetadata class that replaces the DebugLoc argument to most BuildMI() variants. The DebugLoc-only constructors allow implicit construction, so that existing usage of `BuildMI(.., DL, ..)` works as before, and the rest of the codebase using BuildMI() does not require changes. NFC. Reviewed By: vitalybuka Differential Revision: https://reviews.llvm.org/D130883	2022-09-07 11:22:50 +02:00
Marco Elver	4c58b00801	[SelectionDAG] Propagate PCSections through SDNodes Add a new entry to SDNodeExtraInfo to propagate PCSections through SelectionDAG. Reviewed By: vitalybuka Differential Revision: https://reviews.llvm.org/D130882	2022-09-07 11:22:50 +02:00
Jay Foad	1427d55d70	[TableGen] Document sequence with stride Document (in comments) the optional fourth "stride" argument to the sequence operator, which was added in svn r157416. Differential Revision: https://reviews.llvm.org/D133297	2022-09-07 09:58:22 +01:00
Vitaly Buka	4c18670776	[NFC][sancov] Rename ModuleSanitizerCoveragePass	2022-09-06 20:55:39 -07:00
Vitaly Buka	5e38b2a456	[NFC][msan] Rename ModuleMemorySanitizerPass	2022-09-06 20:30:35 -07:00
Vitaly Buka	93600eb50c	[NFC][asan] Rename ModuleAddressSanitizerPass	2022-09-06 15:02:11 -07:00
Vitaly Buka	e7bac3b9fa	[msan] Convert Msan to ModulePass MemorySanitizerPass function pass violatied requirement 4 of function pass to do not insert globals. Msan nees to insert globals for origin tracking, and paramereters tracking. https://llvm.org/docs/WritingAnLLVMPass.html#the-functionpass-class Reviewed By: kstoimenov, fmayer Differential Revision: https://reviews.llvm.org/D133336	2022-09-06 15:01:04 -07:00
Arthur Eubanks	7f57c97d30	[ThinLTOBitcodeWriter] Mark pass as required Or else with -opt-bisect-limit we don't write ThinLTO bitcode. Reviewed By: asbirlea Differential Revision: https://reviews.llvm.org/D133378	2022-09-06 14:47:34 -07:00
bzcheeseman	716b9f7a1a	[LLVM][Support/ADT] Add assert for isPresent to dyn_cast. This change adds an assert to dyn_cast that the value passed-in is present. In the past, this relied on the isa_impl assertion (which still works in many cases) but which we can tighten up for a better QoI. The PointerUnion change is because it seems like (based on the call sites) the semantics of the member dyn_cast are actually dyn_cast_if_present. Reviewed By: rriddle Differential Revision: https://reviews.llvm.org/D133221	2022-09-06 13:58:56 -07:00
raghavmedicherla	142f51fc2f	Support: Add mapped_file_region::sync(), equivalent to msync Add mapped_file_region::sync(), equivalent to POSIX msync, synchronizing written content to disk without unmapping the region. Asserts if the mode is not mapped_file_region::readwrite. Note that I don't have access to a Windows machine, so I can't easily run those unit tests. Change by dexonsmith Differential Revision: https://reviews.llvm.org/D95494	2022-09-06 16:46:37 -04:00
Markus Böck	f049b2c3fc	[MC] Emit Stackmaps before debug info This patch is essentially an alternative to https://reviews.llvm.org/D75836 and was mentioned by @lhames in a comment. The gist of the issue is that Mach-O has restrictions on which kind of sections are allowed after debug info has been emitted, which is also properly asserted within LLVM. Problem is that stack maps are currently emitted as one of the last sections in each target-specific AsmPrinter so far, which would cause the assertion to trigger. The current approach of special casing for the `__LLVM_STACKMAPS` section is not viable either, as downstream users can overwrite the stackmap format using plugins, which may want to use different sections. This patch fixes the issue by emitting the stack map earlier, right before debug info is emitted. The way this is implemented is by taking the choice when to emit the StackMap away from the target AsmPrinter and doing so in the base class. The only disadvantage of this approach is that the `StackMaps` member is now part of the base class, even for targets that do not support them. This is functionaly not a problem however, as emitting an empty `StackMaps` is a no-op. Differential Revision: https://reviews.llvm.org/D132708	2022-09-06 20:20:56 +02:00
Joseph Huber	58645d3252	[OpenMP] Fix `omp_get_wtime` function being marked incorrectly as readonly OpenMP has a list of of optimistic attributes that can be attached to known runtime functions to aid some analysis. The `omp_get_wtime` function incorrectly used the `readonly` attribute. This is not correct at the `omp_get_wtime` function changes values depending on some external state. This is more correctly modeled with `inaccessiblememonly` meaning that the value does not depend on anything within the module, but can not be removes as it depends on external state. Fixes #57578 Reviewed By: tianshilei1992 Differential Revision: https://reviews.llvm.org/D133360	2022-09-06 12:59:00 -05:00
Jakub Kuderski	20573d11b7	[ADT] Remove is_splat `is_splat` is superseded by `all_equal` and marked as deprecated. See the discussion thread for more details: https://discourse.llvm.org/t/adt-is-splat-and-empty-ranges/64692 Reviewed By: dblaikie Differential Revision: https://reviews.llvm.org/D132336	2022-09-06 13:49:26 -04:00
Matthias Gehre	7948d89afe	Fix "[llvm/CodeGen] Enable the ExpandLargeDivRem pass for X86, Arm and AArch64" compilation on Windows	2022-09-06 16:11:14 +01:00
Marco Elver	7d63983c65	[SelectionDAG] Properly copy ExtraInfo on RAUW During SelectionDAG legalization SDNodes with associated extra info may be replaced with a new SDNode. Preserve associated extra info on ReplaceAllUsesWith and remove entries in DeallocateNode. Reviewed By: vitalybuka Differential Revision: https://reviews.llvm.org/D130881	2022-09-06 16:32:50 +02:00
Marco Elver	cc3faf4226	[SelectionDAG] Rename CallSiteDbgInfo to NodeExtraInfo For information infrequently attached to SDNodes, it is useful to provide a way to add this information out-of-line. This is already done for call-site specific information. Rename CallSiteDbgInfo to NodeExtraInfo in preparation of adding additional information not necessarily related to call sites only. Reviewed By: vitalybuka Differential Revision: https://reviews.llvm.org/D130880	2022-09-06 16:32:50 +02:00
Matthias Gehre	2090e85fee	[llvm/CodeGen] Enable the ExpandLargeDivRem pass for X86, Arm and AArch64 This adds the ExpandLargeDivRem to the default pass pipeline. The limit at which it expands div/rem instructions is configured via a new TargetTransformInfo hook (default: no expansion) X86, Arm and AArch64 backends implement this hook to expand div/rem instructions with more than 128 bits. Differential Revision: https://reviews.llvm.org/D130076	2022-09-06 15:32:04 +01:00
Joseph Huber	5dbc7cf7ca	[Object] Refactor code for extracting offload binaries We currently extract offload binaries inside of the linker wrapper. Other tools may wish to do the same extraction operation. This patch simply factors out this handling into the `OffloadBinary.h` interface. Reviewed By: yaxunl Differential Revision: https://reviews.llvm.org/D132689	2022-09-06 08:55:16 -05:00
Marco Elver	42836e283f	[MachineInstr] Allow setting PCSections in ExtraInfo Provide MachineInstr::setPCSection(), to propagate relevant metadata through the backend. Use ExtraInfo to store the metadata. Reviewed By: vitalybuka Differential Revision: https://reviews.llvm.org/D130876	2022-09-06 15:52:44 +02:00
Marco Elver	c70f6e1362	[Metadata] Introduce MD_pcsections Introduces MD_pcsections metadata kind. See added documentation for more details. Subsequent patches enable propagating PC sections metadata through code generation to the AsmPrinter. RFC: https://discourse.llvm.org/t/rfc-pc-keyed-metadata-at-runtime/64191 Reviewed By: dvyukov, vitalybuka Differential Revision: https://reviews.llvm.org/D130875	2022-09-06 15:52:44 +02:00
Amara Emerson	3dd861818a	[GlobalISel] Combine G_INSERT/EXTRACT_VECTOR_ELT with out of bounds indices to undef. Differential Revision: https://reviews.llvm.org/D133309	2022-09-06 13:45:04 +01:00
luxufan	2e7aed1947	[MemorySSA][NFC] Simplify if condition Differential Revision: https://reviews.llvm.org/D133332	2022-09-05 10:43:17 +00:00
Eli Friedman	63335afb4e	[ARM64EC 2/?] Add target triple, and allow targeting it. Part of patchset to add initial support for ARM64EC. Per discussion on review, using the triple arm64ec-pc-windows-msvc. The parsing works the same way as Apple's alternate Arm ABI "arm64e". Differential Revision: https://reviews.llvm.org/D125412	2022-09-05 12:27:10 -07:00
Eli Friedman	488ad99ecf	[ARM64EC 1/?] Add parsing support to llvm-objdump/llvm-readobj. This is the first patch of a patchset to add initial support for ARM64EC. Basic documentation is available at https://docs.microsoft.com/en-us/windows/uwp/porting/arm64ec-abi . (Discourse post: https://discourse.llvm.org/t/initial-patches-for-arm64ec-windows-11-now-posted/62449 .) The file format for ARM64EC is basically identical to normal ARM64. There are a few extra sections, but the existing code for reading ARM64 object files just works. Differential Revision: https://reviews.llvm.org/D125411	2022-09-05 12:25:08 -07:00
Joseph Huber	c1d19a8489	[ELF] Provide the GNU hash function in libObject GNU uses a different hashing function compared to the sys-V standard function already provided in libObject. This is already used internally in LLD for generating synthetic sections. This patch simply extracts this definition and makes it availible to other users of `libObject`. This is done in preparation for supporting symbol name lookups via the GNU hash table. Reviewed By: MaskRay, jhenderson Differential Revision: https://reviews.llvm.org/D132696	2022-09-05 11:04:57 -05:00
Kazu Hirata	2bb43d72d9	[ADT] Use std::tuple_element_t (NFC)	2022-09-03 23:27:24 -07:00
Kazu Hirata	03c3c2db10	[llvm] Use std::remove_reference_t (NFC)	2022-09-03 23:27:22 -07:00
Kazu Hirata	230e57d221	[ADT] Use std::add_pointer_t (NFC)	2022-09-03 23:27:18 -07:00
Kazu Hirata	9dc6223117	[ADT] Use std::add_lvalue_reference_t (NFC)	2022-09-03 23:27:17 -07:00
Kazu Hirata	2423cf4f88	[Support] Simplify reverseBits with constexpr if (NFC) Differential Revision: https://reviews.llvm.org/D132814	2022-09-03 23:27:15 -07:00
Kazu Hirata	ee40ef7aaf	[Support] Simplify isInt and isUInt with constexpr if (NFC) Differential Revision: https://reviews.llvm.org/D132813	2022-09-03 23:27:13 -07:00
Kazu Hirata	86e8164a8f	[llvm] Qualify auto in range-based for loops (NFC) Identified with readability-qualified-auto.	2022-09-03 11:17:49 -07:00
Kazu Hirata	32aa35b504	Drop empty string literals from static_assert (NFC) Identified with modernize-unary-static-assert.	2022-09-03 11:17:47 -07:00
Kazu Hirata	baee196abb	[llvm] Use std::remove_const_t (NFC)	2022-09-03 11:17:45 -07:00
Kazu Hirata	9eca5ed790	[llvm] Use std::enable_if_t (NFC)	2022-09-03 11:17:44 -07:00
Kazu Hirata	a7a2872bb7	[ADT] Use std::add_const_t (NFC)	2022-09-03 11:17:42 -07:00
Simon Pilgrim	e2d140e9c3	[TTI] Add isExpensiveToSpeculativelyExecute wrapper CGP uses a raw `getInstructionCost(I, TargetTransformInfo::TCK_SizeAndLatency) >= TCC_Expensive` check to see if its better to move an expensive instruction used in a select behind a branch instead. This is causing issues with upcoming improvements to TCK_SizeAndLatency costs on X86 as we need to use TCK_SizeAndLatency as an uop count (so its compatible with various target-specific buffer sizes - see D132288), but we can have instructions that have a low TCK_SizeAndLatency value but should still be treated as 'expensive' (FDIV for example) - by adding a isExpensiveToSpeculativelyExecute wrapper we can keep the current behaviour but still add an x86 override in a future patch when the cost tables are updated to compensate.	2022-09-03 13:12:22 +01:00
Alexey Lapshin	79c8f51c34	[DWARFLinker] Refactor clang modules loading code. Current implementation of registerModuleReference() function not only "registers" module reference, but also clones referenced module (inside loadClangModule()). That may lead to cloning the module with incorrect options (registerModuleReference() examines module references and additionally accumulates MaxDwarfVersion and accel tables info). Since accumulated options may differ from the current values, it is incorrect to clone modules before options are fully accumulated. This patch separates "cloning" code from "registering" code. So, that accumulating option is done in the "registering stage" and "cloning" is done after all modules are registered and options accumulated. It also adds a callback for loaded compile units which can be used for D132755 and D132371(to allow doing options accumulation outside of DWARFLinker). Differential Revision: https://reviews.llvm.org/D133047	2022-09-03 11:23:52 +03:00
Craig Topper	5cf510115a	[VP] Correct LEGALPOS for more VP nodes. LEGALPOS appears to only be used by LegalizeVectorOps. It needs to point at a vector operand. Stores need to point at the second operand since the result and the first operand are MVT::Other. Reductions need to point at the second operand since the result and the first operand are scalsrs. Reviewed By: rogfer01 Differential Revision: https://reviews.llvm.org/D133048	2022-09-02 08:04:28 -07:00
Kadir Cetinkaya	4940f205d4	[llvm][Support] Add DenseMapInfo for std::variant Differential Revision: https://reviews.llvm.org/D133200	2022-09-02 15:36:10 +02:00
Simon Pilgrim	7338f9709b	[TTI] Improve description of TargetCostKind enums to aid targets in choosing cost values I'm not sure how much to add to the description as we've tried to allow targets to interpret the TargetCostKind enums in their own way. But we need to make it clear that certain cost kinds need to match threshold numbers used by various passes (and vice-versa when passes are determining a cost-benefit threshold). I'm not keen on the "The weighted sum of size and latency" description, but its very difficult to come up with anything else that's suitably generic (e.g. X86 will use uop counts here to easily work with LoopMicroOpBufferSize thresholds, even though high latency fdiv/fsqrt instructions still often have low uop counts). Differential Revision: https://reviews.llvm.org/D132288	2022-09-02 11:09:06 +01:00
Lang Hames	6ca9f42189	[ORC][ORC-RT] Consistently use pointed-to type as template arg to wrap/unwrap. Saves wrap/unwrap implementers from having to use std::remove_pointer_t to get at the pointed-to type.	2022-09-01 20:54:24 -07:00
Lang Hames	06c4634483	[JITLink] Sink ELFX86RelocationKind into implementation file (ELF_x86_64.cpp). The ELF/x86-64 backend uses the generic x86_64 edges now, so the ELFX86RelocationKind is just an implementation detail.	2022-09-01 13:36:49 -07:00
Simon Pilgrim	e5804a5a61	[ADT] bit.h - replace <stdint.h> with <cstdint> This is a C++ header after all.	2022-09-01 20:44:56 +01:00
Fangrui Song	8d95fd7e56	[MachineFunctionPass] Support -filter-passes for -print-changed [MachineFunctionPass] Support -filter-passes for -print-changed -filter-passes specifies a `PassID` (a lower-case dashed-separated pass name, also used by -print-after, -stop-after, etc) instead of a CamelCasePass. `-filter-passes=CamelCaseNewPMPass` seems like a workaround for new PM passes before we can use lower-case dashed-separated pass names (as used by `-passes=`). Example: ``` # getPassName() is "IRTranslator". PassID is "irtranslator" llc -mtriple=aarch64 -print-changed -filter-passes=irtranslator < print-changed-machine.ll ``` Close https://github.com/llvm/llvm-project/issues/57453 Reviewed By: aeubanks Differential Revision: https://reviews.llvm.org/D133055	2022-09-01 11:06:06 -07:00
Wei Yi Tee	f6b66cbc7d	[llvm][Testing/ADT] Implement `IsStringMapEntry` testing matcher for verifying the entries in a `StringMap`. Reviewed By: gribozavr2, ymandel, sgatev Differential Revision: https://reviews.llvm.org/D132753	2022-09-01 17:30:41 +00:00
Ilia Diachkov	698c800142	[SPIRV] support builtin types and ExtInsts selection The patch adds the support of OpenCL and SPIR-V built-in types. It also implements ExtInst selection and adds spv_unreachable and spv_alloca intrinsics which improve the generation of the corresponding SPIR-V code. Five LIT tests are included to demonstrate the improvement. Differential Revision: https://reviews.llvm.org/D132648 Co-authored-by: Aleksandr Bezzubikov <zuban32s@gmail.com> Co-authored-by: Michal Paszkowski <michal.paszkowski@outlook.com> Co-authored-by: Andrey Tretyakov <andrey1.tretyakov@intel.com> Co-authored-by: Konrad Trifunovic <konrad.trifunovic@intel.com>	2022-09-01 16:44:54 +03:00
Amara Emerson	4cf3db41da	[GlobalISel] Add sdiv exact (X, constant) -> mul combine. This port of the SDAG optimization is only for exact sdiv case. Differential Revision: https://reviews.llvm.org/D130517	2022-09-01 13:34:00 +01:00
David Spickett	6829cd17b5	[LLVM] Add missing stdint include to Bit.h To fix failing builds on Windows on Arm: https://lab.llvm.org/staging/#/builders/59/builds/928/steps/4/logs/stdio <...>/ADT/bit.h(50,5): error: unknown type name 'uint32_t' uint32_t v = Value; ^	2022-09-01 09:17:35 +00:00
Sam Clegg	92920c4fe3	[MC][WebAssembly] Allow accurate errors in doBeforeLabelEmit Although we only currently have one error produced in this function I am working on changes right now that add some more. This change makes the error location more accurate. Differential Revision: https://reviews.llvm.org/D133016	2022-09-01 01:26:33 -07:00
Arthur Eubanks	04f3c20989	[NFC][LICM] Stop passing around unused BFI Uses of this were removed in `1a25d0bfbb`.	2022-08-31 19:15:34 -07:00
Mark Zhuang	62454e83b0	[NFC] Fix typo Reviewed By: eopXD Differential Revision: https://reviews.llvm.org/D133079	2022-08-31 19:08:46 -07:00
Craig Topper	8dce3507a0	[VP] Correct the LEGALPOS for VP_STORE. VP_STORE has a Chain for operand 0, so the LEGALPOS should be 1. VP_STORE is always considered Legal for MVT::Other. So I suspect this was causing vp_store to be ignored by LegalizeVectorOps and instead handled in LegalizeDAG. VP_LOAD is Custom expanded in LegalizeVectorOps for RISC-V. Differential Revision: https://reviews.llvm.org/D132972	2022-08-31 11:15:47 -07:00
Wei Yi Tee	d45c04da7c	[llvm][ADT] Overload output stream operator `<<` for `StringMapEntry` and `StringMap`. Printing support enables the production of more useful error messages in unit testing e.g. when using matchers such as `UnorderedElementsAre()` to inspect the contents of a `StringMap`. Reviewed By: gribozavr2, sgatev, ymandel Differential Revision: https://reviews.llvm.org/D132747	2022-08-31 17:37:58 +00:00
Arthur Eubanks	d0b9c9c0a3	[NFC] clang-format Any.h To trigger some bots. Differential Revision: https://reviews.llvm.org/D133033	2022-08-31 10:21:30 -07:00
Daniel Thornburgh	ea99225521	[Symbolizer] Handle {{{bt}}} symbolizer markup element. This adds support for backtrace generation to the llvm-symbolizer markup filter, which is likely the largest use case. Reviewed By: peter.smith Differential Revision: https://reviews.llvm.org/D132706	2022-08-31 09:49:32 -07:00
Daniel Bertalan	f7b752d277	[lld-macho] Set the SG_READ_ONLY flag on __DATA_CONST This flag instructs dyld to make the segment read-only after fixups have been performed. I'm not sure why this flag is needed, as on macOS 13 beta at least, __DATA_CONST is read-only even without this flag; but ld64 sets it as well. Differential Revision: https://reviews.llvm.org/D133010	2022-08-31 17:04:20 +02:00
Hassnaa Hamdi	a6d9c944df	[AArch64 - SVE]: Use SVE to lower reduce.fadd. Differential Revision: https://reviews.llvm.org/D132573 skip custom-lowering for v1f64 to be expanded instead, because it has only one lane Differential Revision: https://reviews.llvm.org/D132959	2022-08-31 12:31:06 +00:00
Alvin Wong	12d865415f	[COFF] Use the more accurate GuardFlags definition everywhere This also modifies llvm-readobj to be more future-proof when printing the guard FIDs table by calculating the entry size correctly according to MS docs. Reviewed By: rnk Differential Revision: https://reviews.llvm.org/D132924	2022-08-31 15:11:34 +03:00
Alvin Wong	94baaa6a5c	[llvm-readobj][COFF] Print load config GuardFlags as enum flags Print flags as documented in MS docs. https://docs.microsoft.com/en-us/windows/win32/debug/pe-format#load-configuration-layout https://docs.microsoft.com/en-us/windows/win32/secbp/pe-metadata EH_CONTINUATION_TABLE_PRESENT is not mentioned in the docs but is instead taken from Windows SDK headers. Reviewed By: rnk Differential Revision: https://reviews.llvm.org/D132823	2022-08-31 15:01:57 +03:00
Nikita Popov	972840aa3b	[IR] Add Instruction::getInsertionPointAfterDef() Transforms occasionally want to insert an instruction directly after the definition point of a value. This involves quite a few different edge cases, e.g. for phi nodes the next insertion point is not the next instruction, and for invokes and callbrs its not even in the same block. Additionally, the insertion point may not exist at all if catchswitch is involved. This adds a general Instruction::getInsertionPointAfterDef() API to implement the necessary logic. For now it is used in two places where this should be mostly NFC. I will follow up with additional uses where this fixes specific bugs in the existing implementations. Differential Revision: https://reviews.llvm.org/D129660	2022-08-31 10:50:10 +02:00
Daniel Bertalan	389e0a81a1	[lld-macho] Support synthesizing __TEXT,__init_offsets This section stores 32-bit `__TEXT` segment offsets of initializer functions, and is used instead of `__mod_init_func` when chained fixups are enabled. Storing the offsets lets us avoid emitting fixups for the initializers. Differential Revision: https://reviews.llvm.org/D132947	2022-08-31 10:13:45 +02:00
Greg Clayton	ea9ac3519c	An upcoming patch to LLDB will require the ability to decode base64. This patch adds support for decoding base64 and adds tests. Resubmission of https://reviews.llvm.org/D126254 with where decodeBase64Byte is no longer a lambda but a static function. Some compilers have different errors or warnings with respect to what needs to be captured and what doesn't (see comments in https://reviews.llvm.org/D126254 for details). Differential Revision: https://reviews.llvm.org/D128560	2022-08-30 15:52:08 -07:00
Pavel Samolysov	88581db62f	[LazyCallGraph] Reformat the code in accordance with the code style. NFC Also, some local variables were renamed in accordance with the code style as well as `std::tie` occurrences and `.first`/`.second` member uses were replaced with structure bindings. Differential Revision: https://reviews.llvm.org/D132806	2022-08-30 11:06:42 +03:00
Rong Xu	7bc182ed8a	fix buildbot build error.	2022-08-29 17:01:27 -07:00
Rong Xu	d7ef0c3970	[llvm-profdata] Improve profile supplementation Current implementation promotes a non-cold function in the SampleFDO profile into a hot function in the FDO profile. This is too aggressive. This patch promotes a hot functions in the SampleFDO profile into a hot function, and a warm function in SampleFDO into a warm function in FDO. Differential Revision: https://reviews.llvm.org/D132601	2022-08-29 16:50:42 -07:00
Rong Xu	db18f26567	[llvm-profdata] Handle internal linkage functions in profile supplementation This patch has the following changes: (1) Handling of internal linkage functions (static functions) Static functions in FDO have a prefix of source file name, while they do not have one in SampleFDO. Current implementation does not handle this and we are not updating the profile for static functions. This patch fixes this. (2) Handling of -funique-internal-linakge-symbols Again this is for the internal linkage functions. Option -funique-internal-linakge-symbols can now be applied to both FDO and SampleFDO compilation. When it is used, it demangles internal linkage function names and adds a hash value as the postfix. When both SampleFDO and FDO profiles use this option, or both not use this option, changes in (1) should handle this. Here we also handle when the SampleFDO profile using this option while FDO profile not using this option, or vice versa. There is one case where this patch won't work: If one of the profiles used mangled name and the other does not. For example, if the SampleFDO profile uses clang c-compiler and without -funique-internal-linakge-symbols, while the FDO profile uses -funique-internal-linakge-symbols. The SampleFDO profile contains unmangled names while the FDO profile contains mangled names. If both profiles use c++ compiler, this won't happen. We think this use case is rare and does not justify the effort to fix. Differential Revision: https://reviews.llvm.org/D132600	2022-08-29 16:15:12 -07:00
Craig Topper	2f811a6c7f	[VP][RISCV] Add vp.fabs intrinsic and RISC-V support. Mostly just modeled after vp.fneg except there is a "functional instruction" for fneg while fabs is always an intrinsic. Reviewed By: fakepaper56 Differential Revision: https://reviews.llvm.org/D132793	2022-08-29 09:32:06 -07:00
Wei Yi Tee	72ebcf1a53	[llvm][ADT] Fix formatting for files relevant to `StringMap`. Differential Revision: https://reviews.llvm.org/D132744	2022-08-29 06:57:29 +00:00
Wei Yi Tee	af6a35597f	Revert "[llvm][ADT] Fix formatting for files relevant to `StringMap`." This reverts commit `d23df9c9e8`. Revert due to missing review link.	2022-08-29 06:43:48 +00:00
Wei Yi Tee	d23df9c9e8	[llvm][ADT] Fix formatting for files relevant to `StringMap`.	2022-08-29 06:40:07 +00:00
Kazu Hirata	8feb60756c	[llvm] Use range-based for loops (NFC)	2022-08-28 23:28:58 -07:00
Kazu Hirata	87c38323a2	[Support] Remove greatestCommonDivisor and GreatestCommonDivisor64 (NFC) This patch removes greatestCommonDivisor and GreatestCommonDivisor64 as I've migrated all the uses to std::gcd.	2022-08-28 17:35:08 -07:00
Kazu Hirata	ec8605ff52	[llvm] Use std::is_unsigned instead of std::numeric_limits (NFC)	2022-08-28 17:35:06 -07:00
Kazu Hirata	ce9f007c7c	[llvm] Use llvm::find_if (NFC)	2022-08-28 10:41:48 -07:00
Daniel Bertalan	47e4663c4e	[llvm-objdump] Add -dyld_info to llvm-otool This option outputs the location, encoded value and target of chained fixups, using the same format as `otool -dyld_info`. This initial implementation only supports the DYLD_CHAINED_PTR_64 and DYLD_CHAINED_PTR_64_OFFSET pointer encodings, which are used in x86_64 and arm64 userspace binaries. When Apple's effort to upstream their chained fixups code continues, we'll replace this code with the then-upstreamed code. But we need something in the meantime for testing ld64.lld's chained fixups code. Differential Revision: https://reviews.llvm.org/D132036	2022-08-28 09:22:41 +02:00
Benjamin Kramer	1bcf21ca7f	Use std::uninitialized_move where appropriate. NFCI.	2022-08-27 14:56:43 +02:00
Anubhab Ghosh	c69df92b4f	[Orc] Use MapperJITLinkMemoryManager with InProcessMapper in llvm-jitlink tool MapperJITLinkMemoryManager has slab allocation. Combined with InProcessMapper, it can replace InProcessMemoryManager. It can also replace JITLinkSlabAllocator through the InProcessDeltaMapper that adds an offset to the executor addresses for use in tests. Differential Revision: https://reviews.llvm.org/D132315	2022-08-27 11:07:09 +05:30
Lang Hames	f828135f91	Reapply "[ORC] Add "wrap" and "unwrap" steps to ExecutorAddr..." with fixes. Reapplies `f14cb494a3` (which was reverted in `2f08f8426c`) with a fix for UB in the ExecutorAddr::Unwrap::Unwrap constructor (which caused failures on some bots).	2022-08-26 14:53:51 -07:00
Lang Hames	2f08f8426c	Revert "[ORC] Add "wrap" and "unwrap" steps to ExecutorAddr toPtr/fromPtr." This reverts commit `f14cb494a3`. Reverting while I investigate bot failures, e.g. https://lab.llvm.org/buildbot#builders/117/builds/8701	2022-08-26 13:54:30 -07:00
Florian Hahn	9405af1c85	[LAA] Require AddRecs to be in the innermost loop for diff-checks. The simpler diff-checks require pointers with add-recs from the same innermost loop, but this property wasn't check completely. Add the missing check to ensure both addrecs are in the innermost loop. Fixes #57315.	2022-08-26 20:39:52 +01:00
Lang Hames	f14cb494a3	[ORC] Add "wrap" and "unwrap" steps to ExecutorAddr toPtr/fromPtr. The wrap/unwrap operations are applied to pointers after/before conversion to/from raw addresses. They can be used to tag, untag, sign, or strip signing from pointers. They currently default to 'rawPtr' (identity) on all platforms, but it is expected that the default will be set based on the host architecture, e.g. they would default to signing/stripping for arm64e.	2022-08-26 12:32:44 -07:00
Daniil Fukalov	9c710ebbdb	[TTI] NFC: Reduce InstructionCost::getValue() usage... in order to propagate `InstructionCost` value upper. Reviewed By: fhahn Differential Revision: https://reviews.llvm.org/D103406	2022-08-26 16:37:32 +03:00
Benjamin Kramer	2c1796b3d6	[ADT] GCC 7 doesn't have constexpr char_traits, add a workaround LLVM still supports GCC 7. This workaround can be removed when GCC 8 becomes the oldest supported GCC version. Fixes #57057	2022-08-26 14:11:21 +02:00
Matthias Gehre	3e39b27101	[llvm/CodeGen] Add ExpandLargeDivRem pass Adds a pass ExpandLargeDivRem to expand div/rem instructions with more than 128 bits into a loop computing that value. As discussed on https://reviews.llvm.org/D120327, this approach has the advantage that it is independent of the runtime library. This also helps the clang driver, which otherwise would need to understand enough about the runtime library to know whether to allow _BitInts with more than 128 bits. Targets are still free to disable this pass and instead provide a faster implementation in a runtime library. Fixes https://github.com/llvm/llvm-project/issues/44994 Differential Revision: https://reviews.llvm.org/D126644	2022-08-26 11:55:15 +01:00
Matthias Gehre	6d13b80fcb	Revert "[SelectionDAG] Emit calls to __divei4 and friends for division/remainder of large integers" This reverts https://reviews.llvm.org/D120329. I abandoned the PR [0] to add __divei4 functions to compiler-rt in favor of adding a pass to transform div/rem [1]. This removes the backend code that was supposed to emit calls to the __divei4 functions. [0] https://reviews.llvm.org/D120327 [1] https://reviews.llvm.org/D130076 Differential Revision: https://reviews.llvm.org/D130079	2022-08-26 10:52:56 +01:00
Alex Richardson	0483b00875	Mark the $local function begin symbol as a function While this does not matter for most targets, when building for Arm Morello, we have to mark the symbol as a function and add size information, so that LLD can correctly evaluate relocations against the local symbol. Since Morello is an out-of-tree target, I tried to reproduce this with in-tree backends and with the previous reviews applied this results in a noticeable difference when targeting Thumb. Background: Morello uses a method similar Thumb where the encoding mode is specified in the LSB of the symbol. If we don't mark the target as a function, the relocation will not have the LSB set and calls will end up using the wrong encoding mode (which will almost certainly crash). Reviewed By: MaskRay Differential Revision: https://reviews.llvm.org/D131429	2022-08-26 09:34:04 +00:00
Nicolai Hähnle	5e812e9580	Revert "ManagedStatic: remove from DebugCounter" This reverts commit `b5b6ef1500`.	2022-08-26 11:02:58 +02:00
Nicolai Hähnle	b5b6ef1500	ManagedStatic: remove from DebugCounter Follow the pattern used in MLIR for the cl::opt instances. v2: - make DebugCounter::isCountingEnabled public so that the DebugCounterOwner doesn't have to be a nested class. This simplifies later changes v3: - remove the indirection via DebugCounterOwner::instance() Differential Revision: https://reviews.llvm.org/D129116	2022-08-26 09:22:11 +02:00
Joe Loser	77eac32716	[ADT] Make `llvm::identity` a transparent function object `llvm::identity` is similar to `std::identity` from C++20, but one surprising thing is that `llvm::identity` is not a transparent function object. Add the `is_transparent` type alias to denote it can be used as a transparent function object. Differential Revision: https://reviews.llvm.org/D132628	2022-08-25 21:06:42 -06:00
Nicolai Hähnle	a0a2ddfcc5	Revert "ManagedStatic: remove from DebugCounter" This reverts commit `51d82502d9`. There is a regression in the flang-aarch64-dylib buildbot which is most likely caused by this change. Reverting until I can investigate.	2022-08-25 19:45:04 +02:00
Nicolai Hähnle	af2e54992d	[Timer][Statistics] Make global constructor ordering more robust It was observed in D129117 that the subtle dependency between statistic and timer code is not entirely robust: the global destructor ~StatisticInfo indirectly calls CreateInfoOutputFile, which requires the LibSupportInfoOutputFilename to not have been destructed. By constructing LibSupportInfoOutputFilename before the StatisticInfo object, the order of destruction is guaranteed. Differential Revision: https://reviews.llvm.org/D131059	2022-08-25 19:09:49 +02:00
Nicolai Hähnle	51d82502d9	ManagedStatic: remove from DebugCounter Follow the pattern used in MLIR for the cl::opt instances. v2: - make DebugCounter::isCountingEnabled public so that the DebugCounterOwner doesn't have to be a nested class. This simplifies later changes Differential Revision: https://reviews.llvm.org/D129116	2022-08-25 19:09:48 +02:00
Dan McGregor	3922ec46b8	[MCContext] Reverse order of DebugPrefixMap sort for generated assembly debug info Match Clang's sorting, so that longer (more specific) prefix paths will match before less specific paths. Reviewed By: MaskRay, raj.khem, #debug-info Differential Revision: https://reviews.llvm.org/D132390	2022-08-24 21:43:41 -07:00
Valery N Dmitriev	a4c8fb9d1f	[SLP][NFC] Refactor SLPVectorizerPass::vectorizeRootInstruction method. The goal is to separate collecting items for post-processing and processing them. Post processing also outlined as dedicated method. Differential Revision: https://reviews.llvm.org/D132603	2022-08-24 17:07:53 -07:00
Mircea Trofin	5ce4c9aa04	[mlgo] Use TFLite for 'development' mode. TLite is a lightweight, statically linkable[1], model evaluator, supporting a subset of what the full tensorflow library does, sufficient for the types of scenarios we envision having. It is also faster. We still use saved models as "source of truth" - 'release' mode's AOT starts from a saved model; and the ML training side operates in terms of saved models. Using TFLite solves the following problems compared to using the full TF C API: - a compiler-friendly implementation for runtime-loadable (as opposed to AOT-embedded) models: it's statically linked; it can be built via cmake; - solves an issue we had when building the compiler with both AOT and full TF C API support, whereby, due to a packaging issue on the TF side, we needed to have the pip package and the TF C API library at the same version. We have no such constraints now. The main liability is it supporting a subset of what the full TF framework does. We do not expect that to cause an issue, but should that be the case, we can always revert back to using the full framework (after also figuring out a way to address the problems that motivated the move to TFLite). Details: This change switches the development mode to TFLite. Models are still expected to be placed in a directory - i.e. the parameters to clang don't change; what changes is the directory content: we still need an `output_spec.json` file; but instead of the saved_model protobuf and the `variables` directory, we now just have one file, `model.tflite`. The change includes a utility showing how to take a saved model and convert it to TFLite, which it uses for testing. The full TF implementation can still be built (not side-by-side). We intend to remove it shortly, after patching downstream dependencies. The build behavior, however, prioritizes TFLite - i.e. trying to enable both full TF C API and TFLite will just pick TFLite. [1] thanks to @petrhosek's changes to TFLite's cmake support and its deps!	2022-08-24 16:07:24 -07:00
Sami Tolvanen	cff5bef948	KCFI sanitizer The KCFI sanitizer, enabled with `-fsanitize=kcfi`, implements a forward-edge control flow integrity scheme for indirect calls. It uses a !kcfi_type metadata node to attach a type identifier for each function and injects verification code before indirect calls. Unlike the current CFI schemes implemented in LLVM, KCFI does not require LTO, does not alter function references to point to a jump table, and never breaks function address equality. KCFI is intended to be used in low-level code, such as operating system kernels, where the existing schemes can cause undue complications because of the aforementioned properties. However, unlike the existing schemes, KCFI is limited to validating only function pointers and is not compatible with executable-only memory. KCFI does not provide runtime support, but always traps when a type mismatch is encountered. Users of the scheme are expected to handle the trap. With `-fsanitize=kcfi`, Clang emits a `kcfi` operand bundle to indirect calls, and LLVM lowers this to a known architecture-specific sequence of instructions for each callsite to make runtime patching easier for users who require this functionality. A KCFI type identifier is a 32-bit constant produced by taking the lower half of xxHash64 from a C++ mangled typename. If a program contains indirect calls to assembly functions, they must be manually annotated with the expected type identifiers to prevent errors. To make this easier, Clang generates a weak SHN_ABS `__kcfi_typeid_<function>` symbol for each address-taken function declaration, which can be used to annotate functions in assembly as long as at least one C translation unit linked into the program takes the function address. For example on AArch64, we might have the following code: ``` .c: int f(void); int (*p)(void) = f; p(); .s: .4byte __kcfi_typeid_f .global f f: ... ``` Note that X86 uses a different preamble format for compatibility with Linux kernel tooling. See the comments in `X86AsmPrinter::emitKCFITypeId` for details. As users of KCFI may need to locate trap locations for binary validation and error handling, LLVM can additionally emit the locations of traps to a `.kcfi_traps` section. Similarly to other sanitizers, KCFI checking can be disabled for a function with a `no_sanitize("kcfi")` function attribute. Relands `67504c9549` with a fix for 32-bit builds. Reviewed By: nickdesaulniers, kees, joaomoreira, MaskRay Differential Revision: https://reviews.llvm.org/D119296	2022-08-24 22:41:38 +00:00
Peter Cooper	6113998069	Add MachO MH_FILESET support to objdump https://reviews.llvm.org/D131909	2022-08-24 13:34:43 -07:00
Sami Tolvanen	a79060e275	Revert "KCFI sanitizer" This reverts commit `67504c9549` as using PointerEmbeddedInt to store 32 bits breaks 32-bit arm builds.	2022-08-24 19:30:13 +00:00
Sami Tolvanen	67504c9549	KCFI sanitizer The KCFI sanitizer, enabled with `-fsanitize=kcfi`, implements a forward-edge control flow integrity scheme for indirect calls. It uses a !kcfi_type metadata node to attach a type identifier for each function and injects verification code before indirect calls. Unlike the current CFI schemes implemented in LLVM, KCFI does not require LTO, does not alter function references to point to a jump table, and never breaks function address equality. KCFI is intended to be used in low-level code, such as operating system kernels, where the existing schemes can cause undue complications because of the aforementioned properties. However, unlike the existing schemes, KCFI is limited to validating only function pointers and is not compatible with executable-only memory. KCFI does not provide runtime support, but always traps when a type mismatch is encountered. Users of the scheme are expected to handle the trap. With `-fsanitize=kcfi`, Clang emits a `kcfi` operand bundle to indirect calls, and LLVM lowers this to a known architecture-specific sequence of instructions for each callsite to make runtime patching easier for users who require this functionality. A KCFI type identifier is a 32-bit constant produced by taking the lower half of xxHash64 from a C++ mangled typename. If a program contains indirect calls to assembly functions, they must be manually annotated with the expected type identifiers to prevent errors. To make this easier, Clang generates a weak SHN_ABS `__kcfi_typeid_<function>` symbol for each address-taken function declaration, which can be used to annotate functions in assembly as long as at least one C translation unit linked into the program takes the function address. For example on AArch64, we might have the following code: ``` .c: int f(void); int (*p)(void) = f; p(); .s: .4byte __kcfi_typeid_f .global f f: ... ``` Note that X86 uses a different preamble format for compatibility with Linux kernel tooling. See the comments in `X86AsmPrinter::emitKCFITypeId` for details. As users of KCFI may need to locate trap locations for binary validation and error handling, LLVM can additionally emit the locations of traps to a `.kcfi_traps` section. Similarly to other sanitizers, KCFI checking can be disabled for a function with a `no_sanitize("kcfi")` function attribute. Reviewed By: nickdesaulniers, kees, joaomoreira, MaskRay Differential Revision: https://reviews.llvm.org/D119296	2022-08-24 18:52:42 +00:00
Daniel Bertalan	686d8ce1ab	[llvm-objdump] Complete -chained_fixups support This commit adds definitions for the `dyld_chained_import*` structs. The imports array is now printed with `llvm-otool -chained_fixups`. This completes this option's implementation. A slight difference from cctools otool is that we don't yet dump the raw bytes of the imports entries. When Apple's effort to upstream their chained fixups code continues, we'll replace this code with the then-upstreamed code. But we need something in the meantime for testing ld64.lld's chained fixups code. Differential Revision: https://reviews.llvm.org/D131982	2022-08-24 19:29:11 +02:00
spupyrev	8d5b694da1	extending code layout alg The diff modifies ext-tsp code layout algorithm in the following ways: (i) fixes merging of cold block chains (this is a port of D129397); (ii) adjusts the cost model utilized for optimization; (iii) adjusts some APIs so that the implementation can be used in BOLT; this is a prerequisite for D129895. The only non-trivial change is (ii). Here we introduce different weights for conditional and unconditional branches in the cost model. Based on the new model it is slightly more important to increase the number of "fall-through unconditional" jumps, which makes sense, as placing two blocks with an unconditional jump next to each other reduces the number of jump instructions in the generated code. Experimentally, this makes a mild impact on the performance; I've seen up to 0.2%-0.3% perf win on some benchmarks. Reviewed By: hoy Differential Revision: https://reviews.llvm.org/D129893	2022-08-24 09:40:25 -07:00
Fangrui Song	3b4d800911	[ELF] Parallelize writes of different OutputSections We currently process one OutputSection at a time and for each OutputSection write contained input sections in parallel. This strategy does not leverage multi-threading well. Instead, parallelize writes of different OutputSections. The default TaskSize for parallelFor often leads to inferior sharding. We prepare the task in the caller instead. * Move llvm::parallel::detail::TaskGroup to llvm::parallel::TaskGroup * Add llvm::parallel::TaskGroup::execute. * Change writeSections to declare TaskGroup and pass it to writeTo. Speed-up with --threads=8: * clang -DCMAKE_BUILD_TYPE=Release: 1.11x as fast * clang -DCMAKE_BUILD_TYPE=Debug: 1.10x as fast * chrome -DCMAKE_BUILD_TYPE=Release: 1.04x as fast * scylladb build/release: 1.09x as fast On M1, many benchmarks are a small fraction of a percentage faster. Mozilla showed the largest difference with the patch being about 1.03x as fast. Differential Revision: https://reviews.llvm.org/D131247	2022-08-24 09:40:03 -07:00
Jonas Devlieghere	e854c17b02	[llvm] Teach LLVM about filesets Teach LLVM about filesets. Filesets were added in macOS 11 (Big Sur) to combine multiple Mach-O files. They introduce a new load command (LC_FILESET_ENTRY) consisting of a fileset_entry_command. struct fileset_entry_command { uint32_t cmd; /* LC_FILESET_ENTRY / uint32_t cmdsize; / includes entry_id string / uint64_t vmaddr; / memory address of the entry / uint64_t fileoff; / file offset of the entry / union lc_str entry_id; / contained entry id / uint32_t reserved; / reserved */ }; This patch teaches LLVM about the new load command and the corresponding data. Differential revision: https://reviews.llvm.org/D132432	2022-08-24 09:33:45 -07:00
Simon Pilgrim	f9de13232f	[X86] Promote i8/i16 CTTZ (BSF) instructions and remove speculation branch This patch adds a Type operand to the TLI isCheapToSpeculateCttz/isCheapToSpeculateCtlz callbacks, allowing targets to decide whether branches should occur on a type-by-type/legality basis. For X86, this patch proposes to allow CTTZ speculation for i8/i16 types that will lower to promoted i32 BSF instructions by masking the operand above the msb (we already do something similar for i8/i16 TZCNT). This required a minor tweak to CTTZ lowering - if the src operand is known never zero (i.e. due to the promotion masking) we can remove the CMOV zero src handling. Although BSF isn't very fast, most CPUs from the last 20 years don't do that bad a job with it, although there are some annoying passthrough EFLAGS dependencies. Additionally, now that we emit 'REP BSF' in most cases, we are tending towards assuming this will most likely be executed as a TZCNT instruction on any semi-modern CPU. Differential Revision: https://reviews.llvm.org/D132520	2022-08-24 17:28:18 +01:00
Pierre van Houtryve	59cf9dd923	[AMDGPU][GISel] Enable Selection of ADD3 for G_PTR_ADD Allows things like `(G_PTR_ADD (G_PTR_ADD a, b), c)` to be simplified into a single ADD3 instruction instead of two adds. Reviewed By: foad Differential Revision: https://reviews.llvm.org/D131254	2022-08-24 14:44:19 +00:00
Alex Richardson	38107171ed	[RegisterInfoEmitter] Generate isConstantPhysReg(). NFCI This commit moves the information on whether a register is constant into the Tablegen files to allow generating the implementaiton of isConstantPhysReg(). I've marked isConstantPhysReg() as final in this generated file to ensure that changes are made to tablegen instead of overriding this function, but if that turns out to be too restrictive, we can remove the qualifier. This should be pretty much NFC, but I did notice that e.g. the AMDGPU generated file also includes the LO16/HI16 registers now. The new isConstant flag will also be used by D131958 to ensure that constant registers are marked as call-preserved. Differential Revision: https://reviews.llvm.org/D131962	2022-08-24 14:16:20 +00:00
Teresa Johnson	d10c1b88f0	[memprof] Correct max size and access count computations The existing code resulted in the max size and access counts being equal to the min. Compute the max instead (max lifetime was already correct). Differential Revision: https://reviews.llvm.org/D132515	2022-08-23 16:53:46 -07:00
Simon Pilgrim	9317e6311f	[TTI] Add SK_Splice shuffle mask detection and X86 costs Enables fixed sized vectors to detect SK_Splice shuffle patterns and provides basic X86 cost support Differential Revision: https://reviews.llvm.org/D132374	2022-08-23 20:07:30 +01:00
Simon Pilgrim	336a4e03a4	[ADT] Add llvm::has_single_bit helper similar to the c++20 std::has_single_bit implementation Converted the llvm::isPowerOf2_32/64 helpers into wrappers	2022-08-23 19:51:05 +01:00
Simon Pilgrim	75767a0f9a	[Support] MathExtras.h - use llvm::bitcast<> for float-bits cast helpers. NFCI.	2022-08-23 18:27:13 +01:00
Jakub Kuderski	6fa87ec10f	[ADT] Deprecate is_splat and replace all uses with all_equal See the discussion thread for more details: https://discourse.llvm.org/t/adt-is-splat-and-empty-ranges/64692 Reviewed By: dblaikie Differential Revision: https://reviews.llvm.org/D132335	2022-08-23 11:36:27 -04:00
Philip Reames	c9608d57b8	[TTI] Plumb through OperandValueInfo in getMemoryOpCost [NFC] This has the effect of exposing the power-of-two property for use in memory op costing, but no target actually uses it yet. The main point of this change is simple consistency with the recently changes getArithmeticInstrCost, and to remove the last (interface) use of OperandValueKind.	2022-08-23 07:55:42 -07:00
Stephen Tozer	89d0cc99ec	[DebugInfo][InstrRef] Handle transfers of variadic debug values in LDV This patch adds the last of the changes required to enable DBG_VALUE_LIST handling in InstrRefLDV, handling variadic debug values during the transfer tracking step. Most of the changes are fairly straightforward, and based around tracking multiple locations per variable in TransferTracker::VLocTracker. Differential Revision: https://reviews.llvm.org/D128211	2022-08-23 15:01:28 +01:00
Florian Hahn	5913d77056	[Globals] Treat nobuiltin fns as maybe-derefined. Callsites could be marked as `builtin` while calling `nobuiltin` functions. This can lead to problems, if local optimizations apply transformations based on the semantics of the builtin, but then IPO treats the function as `nobuiltin` and applies a transform that breaks builtin semantics (assumed earlier). To avoid this, mark such functions as maybey-derefined, to avoid IPO transforms on them that may break assumptions of earlier calls. Fixes #57075 Fixes #48366 Reviewed By: ychen Differential Revision: https://reviews.llvm.org/D97735	2022-08-23 13:45:10 +01:00
Simon Pilgrim	42a9c1819c	[ADT] Add llvm::popcount to <bit> helper wrapper This patch proposes to move the llvm::detail::PopulationCounter internal helpers into ADT/bit.h and provide a llvm::popcount implementation. I've left the countPopulation implementation in place in MathExtras.h for now, but updated it to use llvm::popcount. Hopefully I've got the type_traits correct - I don't use them very often. Someday we'll move to C++20 with an actual <bit> std header, and we already have this header in place to simplify matters. We'd probably benefit from moving the other <bit> helpers here at some point, but this is a first step. Differential Revision: https://reviews.llvm.org/D132407	2022-08-23 10:36:43 +01:00
Florian Hahn	ff34432649	[LoopUtils] Remove unused Loop arg from addDiffRuntimeChecks (NFC). The argument is no longer used, remove it.	2022-08-23 10:15:28 +01:00
liqinweng	eaa539afa1	[LV][NFC] Modify code comments Reviewed By: jacquesguan Differential Revision: https://reviews.llvm.org/D132093	2022-08-23 12:21:53 +08:00
Jakub Kuderski	c9e52fbe4d	[ADT] Add all_equal predicate `llvm::all_equal` checks if all values in the given range are equal, i.e., there are no two elements that are not equal. Similar to `llvm::all_of`, it returns `true` when the range is empty. `llvm::all_equal` is intended to supersede `llvm::is_splat`, which will be deprecated and removed in future patches. See the discussion thread for more details: https://discourse.llvm.org/t/adt-is-splat-and-empty-ranges/64692. Reviewed By: dblaikie, shchenz Differential Revision: https://reviews.llvm.org/D132334	2022-08-22 23:55:23 -04:00
Philip Reames	104fa367ee	[TTI] Use OperandValueInfo in getArithmeticInstrCost implementation [NFC] This change completes the process of replacing OperandValueKind and OperandValueProperties which were previously passed independently in this API with a single container class which contains both. This is the change which motivated the whole sequence which preceeded it. In an original spike version of this change, I'd noticed a nasty bug: I'd changed the signature without changing names, and as result, we silently passed additional information through a callsite which previously dropped the power-of-two fact. This might be harmless in most cases, but at least a couple clearly dependend for correctness on not passing that property through. I did my best to split off prior changes which reduced the scope of this one, and which made it possible to use compiler assistance. For instance, every parameter which changes type in this change also changes name. This was intentional to make sure that every call site possible effected must show up in the diff. This let me audit each one closely.	2022-08-22 15:16:39 -07:00
Philip Reames	478cf94378	[X86][AArch64][WebAsm][RISCV] Query operand properties instead of using enums directly [nfc] This is part of an ongoing transition to use OperandValueInfo which combines OperandValueKind and OperandValueProperties. This change adds some accessor methods and uses them to simplify backend code. The primary motivation of doing so is removing uses of the parameters so that an upcoming api change is less error prone.	2022-08-22 13:37:59 -07:00
David Penry	ced705c440	[ModuloSchedule] Add interface call to accept/reject SMS schedules This interface allows a target to reject a proposed SMS schedule. For Hexagon/PowerPC, all schedules are accepted, leaving behavior unchanged. For ARM, schedules which exceed register pressure limits are rejected. Also, two RegisterPressureTracker methods now need to be public so that register pressure can be computed by more callers. Reapplication of D128941/(reversion:D132037) with small fix. Differential Revision: https://reviews.llvm.org/D132170	2022-08-22 12:10:13 -07:00
Philip Reames	27d3321c4f	[TTI] Use OperandValueInfo in getMemoryOpCost client api [nfc] This removes the last use of OperandValueKind from the client side API, and (once this is fully plumbed through TTI implementation) allow use of the same properties in store costing as arithmetic costing.	2022-08-22 11:26:31 -07:00
Philip Reames	274f86e7a6	[TTI] Remove OperandValueKind/Properties from getArithmeticInstrCost interface [nfc] This completes the client side transition to the OperandValueInfo version of this routine. Backend TTI implementations still use the prior versions for now.	2022-08-22 11:06:32 -07:00
Philip Reames	c42a5f1cc2	[TTI] Migrate getOperandInfo to OperandVaueInfo [nfc] This is part of merging OperandValueKind and OperandValueProperties.	2022-08-22 10:19:02 -07:00
Philip Reames	5cd427106d	[TTI] Start process of merging OperandValueKind and OperandValueProperties [nfc] OperandValueKind and OperandValueProperties both provide facts about the operands of an instruction for purposes of cost modeling. We've discussed merging them several times; before I plumb through more flags, let's go ahead and do so. This change only adds the client side interface for getArithmeticInstrCost and makes a couple of minor changes in client code to prove that it works. Target TTI implementations still use the split flags. I'm deliberately splitting what could be one big change into a series of smaller ones so that I can lean on the compiler to catch errors along the way.	2022-08-22 09:48:15 -07:00
Matthias Braun	b2542c40b9	RegisterClassInfo: Fix CSR cache invalidation `RegisterClassInfo` caches information like allocation orders and reuses it for multiple machine functions where possible. However the `MCPhysReg *CalleeSavedRegs` field used to test whether the set of callee saved registers changed did not work: After D28566 `MachineRegisterInfo::getCalleeSavedRegs()` can return dynamically computed CSR sets that are only valid while the `MachineRegisterInfo` object of the current function exists. This changes the code to make a copy of the CSR list instead of keeping a possibly invalid pointer around. Differential Revision: https://reviews.llvm.org/D132080	2022-08-22 09:28:26 -07:00
Victor Campos	1d66c5ebbc	[ARM] Fix bug in also_compatible_with attribute parser Check ScopedPrinter pointer before attempting to print the attribute's parsed information. Patch by Michael Platings and Victor Campos Reviewed By: pratlucas Differential Revision: https://reviews.llvm.org/D132214	2022-08-22 09:40:37 +01:00
Max Kazantsev	e587199a50	[SCEV] Prove condition invariance via context, try 2 Initial implementation had too weak requirements to positive/negative range crossings. Not crossing zero with nuw is not enough for two reasons: - If ArLHS has negative step, it may turn from positive to negative without crossing 0 boundary from left to right (and crossing right to left doesn't count for unsigned); - If ArLHS crosses SINT_MAX boundary, it still turns from positive to negative; In fact we require that ArLHS always stays non-negative or negative, which an be enforced by the following set of preconditions: - both nuw and nsw; - positive step (looks liftable); Because of positive step, boundary crossing is only possible from left part to the right part. And because of no-wrap flags, it is guaranteed to never happen.	2022-08-22 14:31:19 +07:00
Ting Wang	d2d77e050b	[PowerPC][Coroutines] Add tail-call check with call information for coroutines Fixes #56679. Reviewed By: ChuanqiXu, shchenz Differential Revision: https://reviews.llvm.org/D131953	2022-08-21 22:20:40 -04:00
Joe Loser	4a51b0c05b	[ADT] Remove `is_invocable` from `STLExtras.h` As a follow-up of https://reviews.llvm.org/D132318, now that the callers have been adjusted to use `std::is_invocable`, remove `llvm::is_invocable` and its tests. Differential Revision: https://reviews.llvm.org/D132321	2022-08-21 18:15:38 -06:00
Joe Loser	7e2cf2679e	[ADT] Clarify llvm::bit_cast implementation comment When reviewing https://reviews.llvm.org/D132330, I noticed a few pre-existing comments regarding the implementation of `llvm::bit_cast`. One comment is a bit misleading since `std::bit_cast` is a C++20 standard library thing, not a C++17 one (otherwise we could use it directly). Clarify that in the comment. Differential Revision: https://reviews.llvm.org/D132332	2022-08-21 18:13:41 -06:00
Kazu Hirata	be35870dc8	[ADT] Simplify llvm::bit_cast (NFC) This patch removes macro tricks to check GCC versions. The commit message from `19262fc596` states that "is_trivially_copyable is only in GCC 5.1 and later". Note that we now require GCC 7.1 or higher. Since both std::is_trivially_constructible and std::is_trivially_copyable are C++11 features, and we now require C++17, we probably don't need to worry about the availability of the C++11 features. Differential Revision: https://reviews.llvm.org/D132330	2022-08-21 10:39:21 -07:00
Kazu Hirata	36357c967c	Remove llvm::is_trivially_copyable (NFC) This patch removes llvm::is_trivially_copyable as it seems to be dead. Once I remove it, HAVE_STD_IS_TRIVIALLY_COPYABLE has no users, so this patch removes the macro also. The comment on llvm::is_trivially_copyable mentions GCC 4.9, but note that we now require GCC 7.1 or higher. Differential Revision: https://reviews.llvm.org/D132328	2022-08-21 10:39:19 -07:00
Sesha Kalyur	d9ff670330	[flang][OpenMP] Parser support for Target directive and Device clause This patch adds support for the device clause on `Target` directive. Device clause was added in OpenMP specification version 4.5 to create a device data environment for the extent of a region. On target construct, the device expression be either be `ancestor` (taking after the parent) or assign a new `device_num`. Reviewed By: kiranchandramohan Differential Revision: https://reviews.llvm.org/D126441	2022-08-21 22:26:02 +05:30
Joe Loser	99694a5d1d	[ADT] Replace `void_t` equivalent with `std::void_t` Use `std::void_t` instead of defining our own equivalent in `STLExtras.h` now that C++17 is available for use. Differential Revision: https://reviews.llvm.org/D132319	2022-08-21 10:37:38 -06:00
Simon Pilgrim	5263155d5b	[CostModel] Add CostKind argument to getShuffleCost Defaults to TCK_RecipThroughput - as most explicit calls were assuming TCK_RecipThroughput (vectorizers) or was just doing a before-vs-after comparison (vectorcombiner). Calls via getInstructionCost were just dropping the CostKind, so again there should be no change at this time (as getShuffleCost and its expansions don't use CostKind yet) - but it will make it easier for us to better account for size/latency shuffle costs in inline/unroll passes in the future. Differential Revision: https://reviews.llvm.org/D132287	2022-08-21 10:54:51 +01:00
Kazu Hirata	8b1b0d1d81	Revert "Use std::is_same_v instead of std::is_same (NFC)" This reverts commit `c5da37e42d`. This patch seems to break builds with some versions of MSVC.	2022-08-20 23:00:39 -07:00
Kazu Hirata	c5da37e42d	Use std::is_same_v instead of std::is_same (NFC)	2022-08-20 22:36:26 -07:00
Kazu Hirata	ce377df57e	Ensure newlines at the end of files (NFC)	2022-08-20 21:18:23 -07:00
Kazu Hirata	01ffe31cbb	[llvm] Remove llvm::is_trivially_{copy/move}_constructible (NFC) This patch removes llvm::is_trivially_{copy/move}_constructible in favor of std::is_trivially_{copy/move}_constructible. The previous attempt to remove them in Dec 2020, `c8d406c93c`, broke builds with "some versions of GCC" according to `6cd9608fb3`. It's been 20 months since then, and the minimum requirement for GCC has been updated to 7.1 from 5.1. FWIW, I was able to build llvm with gcc 8.4.0. Differential Revision: https://reviews.llvm.org/D132311	2022-08-20 14:06:42 -07:00
Lang Hames	e0fc85e092	[JITLink] Fix LinkGraph::makeAbsolute, add unit test. makeAbsolute was not updating the symbol address when applied to external symbols. This commit adds a unit test for makeAbsolute, and updates the makeExternal unit test to check that makeExternal works correctly for absolute symbols.	2022-08-20 13:43:21 -07:00
Kazu Hirata	7dec4648c4	[ADT] Simplify llvm::sort with constexpr if (NFC) Differential Revision: https://reviews.llvm.org/D132305	2022-08-20 09:34:36 -07:00
Kazu Hirata	abb6271d80	[ADT] Deprecate Any::hasValue This patch deprecates Any::hasValue as I've migrated all known uses of it to Any::has_value. I'm planning to remove the deprecated method in 3 months or so. Differential Revision: https://reviews.llvm.org/D132304	2022-08-20 09:34:35 -07:00
Kazu Hirata	e15359debf	[ADT] Implement Any::has_value This patch implements Any::has_value for consistency with std::any in C++17. My plan is to deprecate Any::hasValue after migrating all of its uses to Any::has_value. Since I am about to do so, this patch simply replaces hasValue with has_value in the unit test instead of adding tests for has_value. Differential Revision: https://reviews.llvm.org/D132278	2022-08-20 07:28:04 -07:00
Kazu Hirata	0e0e638249	[ADT] Simplify llvm::reverse with constexpr if (NFC) Differential Revision: https://reviews.llvm.org/D132279	2022-08-20 07:28:03 -07:00
Philip Reames	b0a2c48e9f	[tti] Consolidate getOperandInfo without OperandValueProperties copies [nfc]	2022-08-19 16:22:22 -07:00
Austin Kerbow	b0f4678b90	[AMDGPU] Add iglp_opt builtin and MFMA GEMM Opt strategy Adds a builtin that serves as an optimization hint to apply specific optimized DAG mutations during scheduling. This also disables any other mutations or clustering that may interfere with the desired pipeline. The first optimization strategy that is added here is designed to improve the performance of small gemm kernels on gfx90a. Reviewed By: jrbyrnes Differential Revision: https://reviews.llvm.org/D132079	2022-08-19 15:38:36 -07:00
Alexey Bataev	c167028684	[SLP]Delay vectorization of postponable values for instructions with no users. SLP vectorizer tries to find the reductions starting the operands of the instructions with no-users/void returns/etc. But such operands can be postponable instructions, like Cmp, InsertElement or InsertValue. Such operands still must be postponed, vectorizer should not try to vectorize them immediately. Differential Revision: https://reviews.llvm.org/D131965	2022-08-19 08:39:16 -07:00
Alexey Bataev	d53e245951	[COST][NFC]Introduce OperandValueKind in getMemoryOpCost, NFC. Added OperandValueKind OpdInfo parameter to getMemoryOpCost functions to better estimate cost with immediate values. Part of D126885.	2022-08-19 07:33:00 -07:00
Max Kazantsev	f798c042f4	Revert "[SCEV] Prove condition invariance via context" This reverts commit `a3d1fb3b59`. Reverting until investigation of https://github.com/llvm/llvm-project/issues/57247 has concluded.	2022-08-19 21:02:06 +07:00
Archibald Elliott	3a729069e4	[IR] Update llvm.prefetch to match docs The current llvm.prefetch intrinsic docs state "The rw, locality and cache type arguments must be constant integers." This change: - Makes arg 3 (cache type) an ImmArg - Improves the verifier error messages to reference the incorrect argument. - Fixes two tests which contradict the docs. This is needed as the lowering to GlobalISel is different for ImmArgs compared to other constants. The non-ImmArgs create a G_CONSTANT MIR instruction, the for ImmArgs the constant is put directly on the intrinsic's MIR instruction as an immediate. Differential Revision: https://reviews.llvm.org/D132042	2022-08-19 09:11:17 +01:00
Craig Topper	37c47b2cac	[RISCV] Change how mtune aliases are implemented. The previous implementation translated from names like sifive-7-series to sifive-7-rv32 or sifive-7-rv64. This also required sifive-7-rv32 and sifive-7-rv64 to be valid CPU names. As those are not real CPUs it doesn't make sense to accept them in -mcpu. This patch does away with the translation and adds sifive-7-series directly to RISCV.td. Removing sifive-7-rv32 and sifive-7-rv64. sifive-7-series is only allowed in -mtune. I've also added "rocket" to RISCV.td but have not removed rocket-rv32 or rocket-rv64. To prevent -mcpu=sifive-7-series or -mcpu=rocket being used with llc, I've added a Feature32Bit to all rv32 CPUs. And made it an error to have an rv32 triple without Feature32Bit. sifive-7-series and rocket do not have Feature32Bit or Feature64Bit set so the user would need to provide -mattr=+32bit or -mattr=+64bit along with the -mcpu to avoid the error. SiFive no longer names their newer products with 3, 5, or 7 series. Instead we have p200 series, x200 series, p500 series, and p600 series. Following the previous behavior would require a sifive-p500-rv32 and sifive-p500-rv64 in order to support -mtune=sifive-p500-series. There is currently no p500 product, but it could start getting confusing if there was in the future. I'm open to hearing alternatives for how to achieve my main goal of removing sifive-7-rv32/rv64 as a CPU name. Reviewed By: reames Differential Revision: https://reviews.llvm.org/D131708	2022-08-18 16:22:25 -07:00
Prabhdeep Singh Soni	bce94ea551	[OMPIRBuilder] Add support for safelen clause This patch adds OMPIRBuilder support for the safelen clause for the simd directive. Reviewed By: shraiysh, Meinersbur Differential Revision: https://reviews.llvm.org/D131526	2022-08-18 15:43:08 -04:00
Florian Hahn	b8709a9d03	[LV] Support fixed order recurrences. If the incoming previous value of a fixed-order recurrence is a phi in the header, go through incoming values from the latch until we find a non-phi value. Use this as the new Previous, all uses in the header will be dominated by the original phi, but need to be moved after the non-phi previous value. At the moment, fixed-order recurrences are modeled as a chain of first-order recurrences. Reviewed By: Ayal Differential Revision: https://reviews.llvm.org/D119661	2022-08-18 19:15:52 +01:00
Philip Reames	1436adae2c	[LV-L] Add const and move method body out of line [nfc]	2022-08-18 11:10:19 -07:00
Vitaly Buka	a0e402d41f	[test] Disable DynamicLibrary with HWASAN Re-enabled when https://github.com/llvm/llvm-project/issues/57206 fixed.	2022-08-18 10:19:18 -07:00
Paul Walker	96c8d615d6	[SVE] Extend findMoreOptimalIndexType so BUILD_VECTORs do not force 64bit indices. Extends findMoreOptimalIndexType to allow ISD::BUILD_VECTOR based indices to be truncated when such truncation is lossless. This can enable the use of 32bit gather/scatter indices thus making it less likely to have to split a gather/scatter in two. Depends on D125194 Differential Revision: https://reviews.llvm.org/D130533	2022-08-18 18:00:53 +01:00
Simon Pilgrim	fdec50182d	[CostModel] Replace getUserCost with getInstructionCost * Replace getUserCost with getInstructionCost, covering all cost kinds. * Remove getInstructionLatency, it's not implemented by any backends, and we should fold the functionality into getUserCost (now getInstructionCost) to make it easier for targets to handle the cost kinds with their existing cost callbacks. Original Patch by @samparker (Sam Parker) Differential Revision: https://reviews.llvm.org/D79483	2022-08-18 11:55:23 +01:00
Daniel Bertalan	11443ef85d	[llvm-objdump] Support dumping segment information with -chained_fixups This commit adds the definitions for `dyld_chained_starts_in_image`, `dyld_chained_starts_in_segment`, and related enums. Dumping their contents is possible with the -chained_fixups flag of llvm-otool. The chained-fixups.yaml test was changed to cover bindings/rebases, as well as weak imports, weak symbols and flat namespace symbols. Now that we have actual fixup entries, the __DATA segment contains data that would need to be hexdumped in YAML. We also test empty pages (to look for the "DYLD_CHAINED_PTR_START_NONE" annotation), so the YAML would end up quite large. So instead, this commit includes a binary file. When Apple's effort to upstream their chained fixups code continues, we'll replace this code with the then-upstreamed code. But we need something in the meantime for testing ld64.lld's chained fixups code. Differential Revision: https://reviews.llvm.org/D131961	2022-08-18 09:29:27 +02:00
Lang Hames	6494920987	[JITLink] Pass Allocator (rather than storage) into Symbol named constructors. Also switch from orc::ExecutorAddrDiff to uint64_t for the Symbol::Size field. These changes help to prepare for the introduction of symbol alias support: Aliases will require an auxiliary data structure which will also need to be allocated (hence the need to pass the allocator down). The Size field will be re-tasked to track the auxiliary data (which will hold a replacement Size field) if the symbol is either an alias, or aliased by some other symbol.	2022-08-17 15:55:42 -07:00
Daniil Fukalov	7ed3d81333	[NFCI] Move cost estimation from TargetLowering to TargetTransformInfo. TragetLowering had two last InstructionCost related `getTypeLegalizationCost()` and `getScalingFactorCost()` members, but all other costs are processed in TTI. E.g. it is not comfortable to use other TTI members in these two functions overrided in a target. Minor refactoring: `getTypeLegalizationCost()` now doesn't need DataLayout parameter - it was always passed from TTI. Reviewed By: RKSimon Differential Revision: https://reviews.llvm.org/D117723	2022-08-18 00:38:55 +03:00
Nick Desaulniers	6b0e2fa6f0	[SelectionDAG] make INLINEASM_BR use MachineBasicBlocks instead of BlockAddresses As part of re-architecting callbr to no longer use blockaddresses (https://reviews.llvm.org/D129288), we don't really need them in MIR. They make comparing MachineBasicBlocks of indirect targets during MachineVerifier a PITA. Suggested by @efriedma from the discussion: https://reviews.llvm.org/D130290#3669531 Reviewed By: efriedma, void Differential Revision: https://reviews.llvm.org/D130316	2022-08-17 09:34:31 -07:00
David Penry	1c9f0408bc	Revert "[ModuloSchedule] Add interface call to accept/reject SMS schedules" This reverts commit `8c4aea438c`. Needed because buildbot failures (warnings) gave a clue that there was a functional bug in the ARM rejection logic. Reviewed By: dmgreen Differential Revision: https://reviews.llvm.org/D132037	2022-08-17 09:32:43 -07:00
David Penry	8c4aea438c	[ModuloSchedule] Add interface call to accept/reject SMS schedules This interface allows a target to reject a proposed SMS schedule. For Hexagon/PowerPC, all schedules are accepted, leaving behavior unchanged. For ARM, schedules which exceed register pressure limits are rejected. Also, two RegisterPressureTracker methods now need to be public so that register pressure can be computed by more callers. Reviewed By: dmgreen Differential Revision: https://reviews.llvm.org/D128941	2022-08-17 08:13:26 -07:00
Paul Kirth	656c5d652c	[clang][llvm][NFC] Change misexpect's tolerance option to be 32-bit In D131869 we noticed that we jump through some hoops because we parse the tolerance option used in MisExpect.cpp into a 64-bit integer. This is unnecessary, since the value can only be in the range [0, 100). This patch changes the underlying type to be 32-bit from where it is parsed in Clang through to it's use in LLVM. Reviewed By: jloser Differential Revision: https://reviews.llvm.org/D131935	2022-08-17 14:38:53 +00:00
Simon Pilgrim	1d522a39f7	[TTI] Remove getInstructionThroughput cost helper. Pulled out of D79483 - we can just as easily use getUserCost directly	2022-08-17 11:41:47 +01:00
Eli Friedman	cfd2c5ce58	Untangle the mess which is MachineBasicBlock::hasAddressTaken(). There are two different senses in which a block can be "address-taken". There can be a BlockAddress involved, which means we need to map the IR-level value to some specific block of machine code. Or there can be constructs inside a function which involve using the address of a basic block to implement certain kinds of control flow. Mixing these together causes a problem: if target-specific passes are marking random blocks "address-taken", if we have a BlockAddress, we can't actually tell which MachineBasicBlock corresponds to the BlockAddress. So split this into two separate bits: one for BlockAddress, and one for the machine-specific bits. Discovered while trying to sort out related stuff on D102817. Differential Revision: https://reviews.llvm.org/D124697	2022-08-16 16:15:44 -07:00
Martin Sebor	345514e991	[InstCombine] Add support for strlcpy folding Reviewed By: efriedma Differential Revision: https://reviews.llvm.org/D130666	2022-08-16 16:43:40 -06:00
Alexander Shaposhnikov	d68ba43ad2	[Intrinsics] Add initial support for NonNull attribute Add initial support for NonNull attribute. (https://github.com/llvm/llvm-project/issues/57113) Test plan: verify that for __thread int x; int main() { int* y = &x; return *y; } (with this patch) clang -O -fsanitize=null -S -emit-llvm -o - doesn't emit a null-pointer check Differential revision: https://reviews.llvm.org/D131872	2022-08-16 21:28:23 +00:00
Victor Campos	784da8a722	[ARM] Simplify the creation of escaped build attribute values There is an existing mechanism to escape strings, therefore the functions created to escape Tag_also_compatible_with values are not really needed. We can simply use the pre-existing utilities. Reviewed By: pratlucas Differential Revision: https://reviews.llvm.org/D131680	2022-08-16 11:49:33 +01:00
Victor Campos	08c6840f25	[ARM] Parse Tag_also_compatible_with attribute The ARM Attribute Parser used to parse the value of also_compatible_with as it is, disregarding the way it is encoded. This patch does a context aware parsing of the also_compatible_with attribute. Additionally, some error handling is also done for incorrect cases. Reviewed By: pratlucas Differential Revision: https://reviews.llvm.org/D130913	2022-08-16 11:22:56 +01:00
Max Kazantsev	ebabd6bf18	Return "[SCEV] Use context to strengthen flags of BinOps" This reverts commit `354fa0b480`. Returning as is. The patch was reverted due to a miscompile, but this patch is not causing it. This patch made it possible to infer some nuw flags in code guarded by `false` condition, and then someone else to managed to propagate the flag from dead code outside. Returning the patch to be able to reproduce the issue.	2022-08-16 14:12:36 +07:00
Kshitij Jain	29fe204b4e	Re-apply "[JITLink] Introduce ELF/i386 backend " with correct authorship. I (lhames) accidentally pushed `5f300397c6` on Kshitij Jain's behalf without updating the patch author first (my apologies Kshitij!). Re-applying with correct authorship. https://reviews.llvm.org/D131347	2022-08-15 18:44:43 -07:00
Lang Hames	73600b7c8a	Revert "[JITLink] Introduce ELF/i386 backend support for JITLink." This reverts commit `5f300397c6`. No functional issues, I just failed to correctly set authorship on the patch.	2022-08-15 18:44:43 -07:00
Lang Hames	5f300397c6	[JITLink] Introduce ELF/i386 backend support for JITLink. This initial ELF/i386 JITLink backend enables JIT-linking of minimal ELF i386 object files. No relocations are supported yet. Reviewed By: lhames Differential Revision: https://reviews.llvm.org/D131347	2022-08-15 18:35:51 -07:00
David Blaikie	c63f2581f4	Enable -Wctad-maybe-unsupported in LLVM build Warns on potentially unintended use of C++17 Class Template Argument Deduction. Use of this feature with types that aren't intended to support it may may future refactorings of those types more difficult - so this warning fires whenever the feature is used with a type that may not have intended to be used with CTAD (the warning uses the existence of at least one explicit deduction guide to indicate that a type intentionally supports CTAD - absent that, it's assumed to not be intended to support CTAD & produces a warning). This is disabled in libcxx because lots of the standard library is assumed to provide ctad-usable APIs and the false positive suppression in the diagnostic is based on system header classification which doesn't apply in the libcxx build itself. Differential Revision: https://reviews.llvm.org/D131727	2022-08-15 23:28:51 +00:00
Martin Sebor	65967708d2	[InstCombine] Adjust snprintf folding of constant strings (PR #56598 ) Reviewed By: efriedma Differential Revision: https://reviews.llvm.org/D130494	2022-08-15 15:59:21 -06:00
Arthur Eubanks	633f5663c3	[LegacyPM] Remove ThinLTO bitcode writer legacy pass Using the legacy PM for the optimization pipeline is deprecated and in the process of being removed. This is a small step in that direction. For an example of migrating to the new PM: `853b57fe80`	2022-08-15 14:21:16 -07:00
Sunho Kim	0c69f9f32c	[ORC][COFF] Introduce DLLImportDefinitionGenerator. This class will be used to properly solve the `__imp_` symbol and jump-thunk generation issues. It is assumed to be the last definition generator to be called, and as it's the last generator the only symbols remaining in the lookup set are the symbols that are supposed to be queried outside this jitdylib. Instead of just letting them through, we issue another lookup invocation and fetch the allocated addresses, and then create jitlink graph containing `__imp_` GOT symbols and jump-thunks targetting the fetched addresses. Reviewed By: lhames Differential Revision: https://reviews.llvm.org/D131833	2022-08-16 02:06:57 +09:00
Nico Weber	940e178c00	[llvm-objdump] Start on -chained_fixups for llvm-otool And --chained-fixups for llvm-objdump. For now, this only prints the dyld_chained_fixups_header and adds plumbing for the flag. This will be expanded in future commits. When Apple's effort to upstream their chained fixups code continues, we'll replace this code with the then-upstreamed code. But we need something in the meantime for testing ld64.lld's chained fixups code. Update chained-fixups.yaml with a file that actually contains the chained fixup data (`LinkEditData` doesn't encode it yet, so use `__LINKEDIT` via `--raw-segment=data`). Differential Revision: https://reviews.llvm.org/D131890	2022-08-15 10:58:52 -04:00
wangyihan	91d784a021	[NFC][SmallVector] Use std::conditional_t instead of std::conditional Signed-off-by: wangyihan <yihan.wang@intel.com>	2022-08-15 21:51:13 +08:00
Dmitry Vassiliev	5371ab4456	[IR] Change access rights of PredIterator members These members were made private here `6177386b05` without an explanation. Our customers have an own implementation inherited from PredIterator with updated advancePastNonTerminators(). The access specifier protected looks resonable and safe here. Reviewed By: RKSimon Differential Revision: https://reviews.llvm.org/D131608	2022-08-15 14:25:58 +02:00
Max Kazantsev	354fa0b480	Revert "[SCEV] Use context to strengthen flags of BinOps" This reverts commit `34ae308c73`. Our internal testing found a miscompile. Not sure if it's caused by this patch or it revealed something else. Reverting while investigating.	2022-08-15 18:51:59 +07:00
Zijia Zhu	8719faafdb	[ADT] Make SmallSet::insert(const T &) return const_iterator This patch makes `SmallSet::insert(const T &)` return `std::pair<const_iterator, bool>` instead of `std::pair<NoneType, bool>`. This will exactly match std::set's behavior and make deduplicating items with SmallSet easier. Reviewed By: dblaikie, lattner Differential Revision: https://reviews.llvm.org/D131549	2022-08-15 13:53:34 +08:00
Fangrui Song	d797c2ffdb	[DebugInfo] -fdebug-prefix-map: handle '#line "file"' for asm source `getContext().setMCLineTableRootFile` (from D62074) sets `RootFile.Name` to `FirstCppHashFilename`. `RootFile.Name` is not processed by -fdebug-prefix-map and will go to DW_TAG_compile_unit's DT_AT_name and DW_TAG_label's DW_AT_decl_file. Remap `RootFile.Name`. Fix another issue reported by https://github.com/llvm/llvm-project/issues/56609 Reviewed By: #debug-info, dblaikie, raj.khem Differential Revision: https://reviews.llvm.org/D131848	2022-08-14 20:58:23 -07:00
Kazu Hirata	eeac9e9232	[ADT] Deprecate Optional::map This patch deprecates Optional::map in favor of Optional::transform for consistency with std::optional::transform in C++23. Note that I've migrated all known users of Optional::map. Differential Revision: https://reviews.llvm.org/D131842	2022-08-14 17:51:59 -07:00
Kazu Hirata	f5a68feab3	Use llvm::none_of (NFC)	2022-08-14 16:25:39 -07:00
Kazu Hirata	9144e49334	[Support] Drop unnecessary const from a return type (NFC) Identified with readability-const-return-type.	2022-08-14 12:51:56 -07:00
Lang Hames	1cf81274f4	[JITLink] Add eh-frame CFI inspector, fix crash on malformed FDEs. Add a fix to check that FDE pc-begin targets are defined before calling getBlock (which will crash if the target is not defined). FDE pc-begins pointing at undefined symbols are expected to arise only in obscure circumstances (malformed objects, or removal of targets by JITLink passes), but we want to handle them gracefully. With this patch the FDE will be retained, but without any keepalive edge to it. Unless some pass takes action to mark it as live it will be dead-stripped. To make it easier for passes to connect FDEs to their targets a new EHFrameCFIBlockInspector utility is added. This allows clients to quickly determine whether a CFI record is a CIE or an FDE (assuming that it's valid), and retrieve any personality, pc-begin, cie, or LSDA edges associated with it.	2022-08-14 10:49:26 -07:00
Anubhab Ghosh	23d0e71fcb	[Orc] Use IntervalMap to store free memory regions in MapperJITLinkMemoryManager MapperJITLinkMemoryManager uses a free list to keep track of available memory regions. Using an IntervalMap instead of vector allow automatic coalescing of memory regions as they are freed. Differential Revision: https://reviews.llvm.org/D131831	2022-08-14 14:35:08 +05:30
Alexey Baturo	b2f31cac28	[Triple] Add llvm::Triple::isRISCV{32,64} Reviewed By: vitalybuka, MaskRay, craig.topper Differential Revision: https://reviews.llvm.org/D131339	2022-08-13 18:51:35 -07:00
Joe Loser	9a75033402	[MC] Leverage constexpr `std::array` in `SubtargetFeature.h` Replace C-style array with `std::array` since `std::array<T, N>::operator[]` is `constexpr` in C++17. This also allows us to replace `array_lengthof` calls with member `size()` function. Differential Revision: https://reviews.llvm.org/D131826	2022-08-13 12:54:32 -06:00
Kazu Hirata	2a4748576e	[ADT] Implement Optional::transform This patch implements Optional::transform for consistency with std::optional::transform in C++23. Note that the new function is identical to Optional::map. My plan is to deprecate Optional::map after migrating all of its uses to Optional::transform. Differential Revision: https://reviews.llvm.org/D131829	2022-08-13 11:48:25 -07:00
Anubhab Ghosh	a31af32183	Reapply [Orc] Properly deallocate mapped memory in MapperJITLinkMemoryManager When memory is deallocated from MapperJITLinkMemoryManager deinitialize actions are run through mapper and in case of InProcessMapper, memory protections of the region are reset to read/write as they were previously changed and can be reused in future. Differential Revision: https://reviews.llvm.org/D131768	2022-08-13 13:07:50 +05:30
Anubhab Ghosh	8180105143	Revert "[Orc] Properly deallocate mapped memory in MapperJITLinkMemoryManager" This reverts commit `143555b2ed`.	2022-08-13 10:22:31 +05:30
Sunho Kim	9189a26664	[ORC_RT][COFF] Initial platform support for COFF/x86_64. Initial platform support for COFF/x86_64. Completed features: * Statically linked orc runtime. * Full linking/initialization of static/dynamic vc runtimes and microsoft stl libraries. * SEH exception handling. * Full static initializers support * dlfns * JIT side symbol lookup/dispatch Things to note: * It uses vc runtime libraries found in vc toolchain installations. * Bootstrapping state is separated because when statically linking orc runtime it needs microsoft stl functions to initialize the orc runtime, but static initializers need to be ran in order to fully initialize stl libraries. * Process symbols can't be used blidnly on msvc platform; otherwise duplicate definition error gets generated. If process symbols are used, it's destined to get out-of-reach error at some point. * Atexit currently not handled -- will be handled in the follow-up patches. Reviewed By: lhames Differential Revision: https://reviews.llvm.org/D130479	2022-08-13 13:48:40 +09:00
Anubhab Ghosh	143555b2ed	[Orc] Properly deallocate mapped memory in MapperJITLinkMemoryManager When memory is deallocated from MapperJITLinkMemoryManager deinitialize actions are run through mapper and in case of InProcessMapper, memory protections of the region are reset to read/write as they were previously changed and can be reused in future. Differential Revision: https://reviews.llvm.org/D131768	2022-08-13 10:08:25 +05:30
Fangrui Song	f62e60fb23	[MCDwarf] Respect -fdebug-prefix-map= for generated assembly debug info (DWARF v5) For generated assembly debug info, MCDwarfLineTableHeader::CompilationDir is an unmapped path set in MCContext::setGenDwarfRootFile. Remap it. A relative destination path of -fdebug-prefix-map= exposes a llvm-dwarfdump bug which joins relative DW_AT_comp_dir and directories[0]. Fix https://github.com/llvm/llvm-project/issues/56609 Reviewed By: dblaikie Differential Revision: https://reviews.llvm.org/D131749	2022-08-12 12:52:36 -07:00
Joe Loser	ec7e7797b1	[ADT] Mark variable inline to avoid ODR violations in Sequence.h Mark `force_iteration_on_noniterable_enum` as an `inline` variable to avoid ODR violations. Differential Revision: https://reviews.llvm.org/D131777	2022-08-12 12:55:07 -06:00
Joe Loser	7e521ed1ac	[ADT] Remove STLForwardCompat.h's C++17 equivalents As a follow-up of `e8578968f6` which replaced the callers to use the C++17 equivalents, remove the equivalents from `STLForwardCompat.h` entirely and their corresponding tests. Differential Revision: https://reviews.llvm.org/D131769	2022-08-12 12:50:52 -06:00
Wolfgang Pieb	7ddfb4dfeb	[Inlining] Introduce the function attribute "inline-max-stacksize" The value of the attribute is a size in bytes. It has the effect of suppressing inlining of functions whose stacksizes exceed the given value. Reviewed By: mtrofin Differential Revision: https://reviews.llvm.org/D129904	2022-08-12 11:07:18 -07:00
James Y Knight	20451cb06b	Update license on Unicode.org's ConvertUTF code. The code was relicensed by its owner (Unicode.org) a long time back, but we still had the old (problematic) license in our fork. Note that the source files have not been distributed from unicode.org since 2009 (due to being buggy and unmaintained upstream), but they were given this license before that. Fixes https://github.com/llvm/llvm-project/issues/32309 Differential Revision: https://reviews.llvm.org/D66390	2022-08-12 16:51:08 +00:00
Dawid Jurczak	8a17e74ca9	[NFC] Introduce llvm::to_vector_of to allow creation of SmallVector<T> from range of items convertible to type T It's https://reviews.llvm.org/D129565 follow-up. Differential Revision: https://reviews.llvm.org/D129781	2022-08-12 15:22:12 +02:00
Joe Loser	e8578968f6	[ADT] Replace STLForwardCompat.h's C++17 equivalents STLForwardCompat.h defines several utilities and type traits to mimic that of the ones in the C++17 standard library. Now that LLVM is built with the C++17 standards mode, remove use of these equivalents in favor of the ones from the standard library. Differential Revision: https://reviews.llvm.org/D131717	2022-08-12 06:55:59 -06:00
Max Kazantsev	a3d1fb3b59	[SCEV] Prove condition invariance via context Contextual knowledge may be used to prove invariance of some conditions. For example, in this case: ``` ; %len >= 0 guard(%iv = {start,+,1}<nuw> <s %len) guard(%iv = {start,+,1}<nuw> <u %len) ``` the 2nd check always fails if `start` is negative and always passes otherwise. It looks like there are more opportunities of this kind that are still to be implemented in the future. Differential Revision: https://reviews.llvm.org/D129753 Reviewed By: apilipenko	2022-08-12 14:23:35 +07:00
Arthur Eubanks	eddcfe3a9e	[NFC] Format ilist_node_options.h to cycle bots	2022-08-11 15:39:12 -07:00
Martin Storsjö	2c2fb0c737	[llvm] Use hidden visibility when building for MinGW with Clang Since `c5b3de6745` (git main, August 11th), Clang does generate working hidden visibility on MinGW targets. Using that reduces the number of exports from a dylib build of LLVM significantly, which is vital for fitting within the limit of 64k exported symbols from a DLL. It's essential that if we set CMAKE_CXX_VISIBILITY_PRESET=hidden (which passes -fvisibility=hidden on the command line), we also must define LLVM_EXTERNAL_VISIBILITY consistently to override it. (If there are mismatches, e.g. setting hidden visibility generally but never overriding it back to default for the symbols that do need to be exported, we'd get broken builds in such configurations.) We don't want to be using __attribute__((visibility("hidden"))) on MinGW with GCC, because GCC produces a warning about it. (GCC hasn't warned about the command line options that set hidden visibility though.) Clang has historically not warned about either of them, so it is harmless to use the hidden visibility when building with older Clang (so we don't need to detect the exact version of Clang/LLVM where it has an effect). This reduces the number of exported symbols for a dylib build of LLVM; previously libLLVM exported around 64650 symbols (when the maximum is 65536) when the ARM, AArch64 and X86 targets were enabled. If enabling more targets (or if building with e.g. assertions enabled), it would exceed the limit. Now with visibility flags in use, the same build with ARM, AArch64 and X86 ends up at around 35k exported symbols. Differential Revision: https://reviews.llvm.org/D131661	2022-08-12 00:57:05 +03:00
Arnold Schwaighofer	6ef223c041	[coro async] Mark async suspend function and its resume function pointer intrinsic as nomerge Coroutine splitting is not possible if the one-to-one mapping between the two is lost. Every suspend point must have a matching continuation function pointer. rdar://98404664 Differential Revision: https://reviews.llvm.org/D131684	2022-08-11 11:43:30 -07:00
Fangrui Song	c2d293ea25	Compiler.h: remove unused LLVM_NODISCARD Reviewed By: kazu Differential Revision: https://reviews.llvm.org/D131695	2022-08-11 11:06:24 -07:00
Fangrui Song	57f334d817	[Support] Remove Log2 workaround for Android API level < 18 The function added by D9467 is unneeded. https://github.com/android/ndk/wiki/Changelog-r24 shows that the NDK has moved forward to at least a minimum target API of 19. Reviewed By: srhines Differential Revision: https://reviews.llvm.org/D131656	2022-08-11 17:39:41 +00:00
Fangrui Song	1ca5fee228	[Support] Remove some #if __cplusplus > 201402L	2022-08-11 17:35:02 +00:00
Marc Auberer	84b7055afc	[Docs] Fix duplicate enum item name Removes duplicated names as recommended here: https://llvm.org/docs/CodingStandards.html#doxygen-use-in-documentation-comments Reviewed By: MaskRay Differential Revision: https://reviews.llvm.org/D131193	2022-08-11 09:59:08 -07:00
Marco Elver	c47ec95531	[MemorySanitizer] Support memcpy.inline and memset.inline Other sanitizers (ASan, TSan, see added tests) already handle memcpy.inline and memset.inline by not relying on InstVisitor to turn the intrinsics into calls. Only MSan instrumentation currently does not support them due to missing InstVisitor callbacks. Fix it by actually making InstVisitor handle MemInlineInst. While the mem.inline intrinsics promise no calls to external functions as an optimization, for the sanitizers we need to break this guarantee since access into the runtime is required either way, and performance can no longer be guaranteed. All other cases, where generating a call is incorrect, should instead use no_sanitize. Fixes: https://github.com/llvm/llvm-project/issues/57048 Reviewed By: vitalybuka, dvyukov Differential Revision: https://reviews.llvm.org/D131577	2022-08-11 10:43:49 +02:00
Martin Storsjö	5563c38fde	[JITLink] Silence GCC warnings about parentheses around && and \|\| operators This silences the following warnings: ../include/llvm/ExecutionEngine/JITLink/JITLink.h:1108:56: warning: suggest parentheses around ‘&&’ within ‘\|\|’ [-Wparentheses] 1105 \| assert(S == Scope::Local \|\| llvm::count_if(AbsoluteSymbols, \| ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 1106 \| [&](const Symbol *Sym) { \| ~~~~~~~~~~~~~~~~~~~~~~~~ 1107 \| return Sym->getName() == Name; \| ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 1108 \| }) == 0 && \| ~~~~~~~~^~ 1109 \| "Duplicate absolute symbol"); \| ~~~~~~~~~~~~~~~~~~~~~~~~~~~	2022-08-11 09:58:11 +03:00
Sunho Kim	7260cdd2e1	[ORC][COFF] Introduce COFFVCRuntimeBootstrapper. Introduces COFFVCRuntimeBootstrapper that loads/initialize vc runtime libraries. In COFF, we must jit-link vc runtime libraries as COFF relocation types have no proper way to deal with out-of-reach data symbols ragardless of linking mode. (even dynamic version msvcrt.lib have tons of static data symbols that must be jit-linked) This class tries to load vc runtime library files from msvc installations with an option to override the path. There are some complications when dealing with static version of vc runtimes. First, they need static initializers to be ran that requires COFFPlatform support but orc runtime will not be usable before vc runtimes are fully initialized. (as orc runtime will use msvc stl libraries) COFFPlatform that will be introduced in a following up patch will collect static initializers and run them manually in host before boostrapping itself. So, the user will have to do the following. 1. Create COFFPlatform that addes static initializer collecting passes. 2. LoadVCRuntime 3. InitializeVCRuntime 4. COFFPlatform.bootstrap() Second, the internal crt initialization function had to be reimplemented in orc side. There are other ways of doing this, but this is the simplest implementation that makes platform fully responsible for static initializer. The complication comes from the fact that crt initialization functions (such as acrt_initialize or dllmain_crt_process_attach) actually run all static initializers by traversing from `__xi_a` symbol to `__xi_z`. This requires symbols to be contiguously allocated in sections alphabetically sorted in memory, which is not possible right now and not practical in jit setting. We might ignore emission of `__xi_a` and `__xi_z` symbol and allocate them ourselves, but we have to take extra care after orc runtime boostrap has been done -- as that point orc runtime should be the one running the static initializers. Reviewed By: lhames Differential Revision: https://reviews.llvm.org/D130456	2022-08-11 15:27:47 +09:00
Sunho Kim	5cf0082ae3	[JITLink][COFF][x86_64] Implement SECTION/SECREL relocation. Implements SECTION/SECREL relocation. These are used by debug info (pdb) data. Reviewed By: lhames Differential Revision: https://reviews.llvm.org/D130275	2022-08-11 15:12:24 +09:00
Craig Topper	bc1f78cc3b	[RISCV] Rename PROC_ALIAS to TUNE_ALIAS to reflect it's usage. NFC This is not used as general CPU alias. Only to support -mtune. Name it as such. Reviewed By: kito-cheng Differential Revision: https://reviews.llvm.org/D131602	2022-08-10 21:44:08 -07:00
aqjune	02e56e2533	[CodeGen] Generate efficient assembly for freeze(poison) version of `mm_cast` intel intrinsics This patch makes the variants of `mm_cast` intel intrinsics that use `shufflevector(freeze(poison), ..)` emit efficient assembly. (These intrinsics are planned to use `shufflevector(freeze(poison), ..)` after shufflevector's semantics update; relevant thread: D103874) To do so, this patch 1. Updates `LowerAVXCONCAT_VECTORS` in X86ISelLowering.cpp to recognize `FREEZE(UNDEF)` operand of `CONCAT_VECTOR` in addition to `UNDEF` 2. Updates X86InstrVecCompiler.td to recognize `insert_subvector` of `FREEZE(UNDEF)` vector as its first operand. Reviewed By: craig.topper Differential Revision: https://reviews.llvm.org/D130339	2022-08-11 13:36:21 +09:00
WANG Xuerui	0c8bfbb374	[LoongArch] Define the new-style reloc types Differential Revision: https://reviews.llvm.org/D131467	2022-08-11 10:37:30 +08:00
Martin Sebor	0dcfe7aa35	[InstCombine] Tighten up known library function signature tests (PR #56463 ) Replace a switch statement used to validate arguments to known library functions with a more consistent table-driven approach and tighten it up.	2022-08-10 14:15:46 -06:00
Freddy Ye	e4888a37d3	[X86][BF16] Enable __bf16 for x86 targets. X86 psABI has updated to support __bf16 type, the ABI of which is the same as FP16. See https://discourse.llvm.org/t/patch-add-optional-bfloat16-support/63149 Reviewed By: pengfei Differential Revision: https://reviews.llvm.org/D130964	2022-08-10 09:00:47 +08:00
Adrian Prantl	af4a39f800	Fix modeline	2022-08-09 16:35:24 -07:00
Kazu Hirata	a044d0491e	[llvm-profdata] Support JSON as as an output-only format This patch teaches llvm-profdata to output the sample profile in the JSON format. The new option is intended to be used for research and development purposes. For example, one can write a Python script to take a JSON file and analyze how similar different inline instances of a given function are to each other. I've chosen JSON because Python can parse it reasonably fast, and it just takes a couple of lines to read the whole data: import json with open ('profile.json') as f: profile = json.load(f) Differential Revision: https://reviews.llvm.org/D130944	2022-08-09 16:24:53 -07:00
Dinar Temirbulatov	cab6cd6834	[AArch64][LoopVectorize] Introduce trip count minimal value threshold to ignore tail-folding. After D121595 was commited, I noticed regressions assosicated with small trip count numbersvectorisation by tail folding with scalable vectors. As a solution for those issues I propose to introduce the minimal trip count threshold value. Differential Revision: https://reviews.llvm.org/D130755	2022-08-09 22:10:17 +01:00
Archibald Elliott	b20fe2c25b	[docs][AArch64] Label Features with Arm ARM Names This patch adds the names of the Arm Architecture Reference Manual (ARM) features to the corresponding Subtarget Features in the AArch64 backend and target parser. The aim of this is to make it clearer what architectural features a subtarget feature might enable (so, which features a CPU must provide to support that subtarget feature), and so make it easier to add new CPUs in the future. Differential Revision: https://reviews.llvm.org/D131257	2022-08-09 18:45:50 +01:00
Markus Böck	205701fd47	[llvm][ADT] Allow using structured bindings with `llvm::enumerate` This patch adds the ability to deconstruct the `value_type` returned by `llvm::enumarate` into index and value of the wrapping range. Main use case is the common occurence of using it during loop iteration. After this patch it'd then be possible to write code such as: ``` for (auto [index, value] : enumerate(container)) { ... } ``` where `index` is the current index and `value` a reference to elements in the given container. Differential Revision: https://reviews.llvm.org/D131486	2022-08-09 18:12:40 +02:00

... 3 4 5 6 7 ...

49278 Commits