llvm-project

Commit Graph

Author	SHA1	Message	Date
Sunho Kim	5cf0082ae3	[JITLink][COFF][x86_64] Implement SECTION/SECREL relocation. Implements SECTION/SECREL relocation. These are used by debug info (pdb) data. Reviewed By: lhames Differential Revision: https://reviews.llvm.org/D130275	2022-08-11 15:12:24 +09:00
Craig Topper	bc1f78cc3b	[RISCV] Rename PROC_ALIAS to TUNE_ALIAS to reflect it's usage. NFC This is not used as general CPU alias. Only to support -mtune. Name it as such. Reviewed By: kito-cheng Differential Revision: https://reviews.llvm.org/D131602	2022-08-10 21:44:08 -07:00
aqjune	02e56e2533	[CodeGen] Generate efficient assembly for freeze(poison) version of `mm_cast` intel intrinsics This patch makes the variants of `mm_cast` intel intrinsics that use `shufflevector(freeze(poison), ..)` emit efficient assembly. (These intrinsics are planned to use `shufflevector(freeze(poison), ..)` after shufflevector's semantics update; relevant thread: D103874) To do so, this patch 1. Updates `LowerAVXCONCAT_VECTORS` in X86ISelLowering.cpp to recognize `FREEZE(UNDEF)` operand of `CONCAT_VECTOR` in addition to `UNDEF` 2. Updates X86InstrVecCompiler.td to recognize `insert_subvector` of `FREEZE(UNDEF)` vector as its first operand. Reviewed By: craig.topper Differential Revision: https://reviews.llvm.org/D130339	2022-08-11 13:36:21 +09:00
WANG Xuerui	0c8bfbb374	[LoongArch] Define the new-style reloc types Differential Revision: https://reviews.llvm.org/D131467	2022-08-11 10:37:30 +08:00
Martin Sebor	0dcfe7aa35	[InstCombine] Tighten up known library function signature tests (PR #56463 ) Replace a switch statement used to validate arguments to known library functions with a more consistent table-driven approach and tighten it up.	2022-08-10 14:15:46 -06:00
Freddy Ye	e4888a37d3	[X86][BF16] Enable __bf16 for x86 targets. X86 psABI has updated to support __bf16 type, the ABI of which is the same as FP16. See https://discourse.llvm.org/t/patch-add-optional-bfloat16-support/63149 Reviewed By: pengfei Differential Revision: https://reviews.llvm.org/D130964	2022-08-10 09:00:47 +08:00
Adrian Prantl	af4a39f800	Fix modeline	2022-08-09 16:35:24 -07:00
Kazu Hirata	a044d0491e	[llvm-profdata] Support JSON as as an output-only format This patch teaches llvm-profdata to output the sample profile in the JSON format. The new option is intended to be used for research and development purposes. For example, one can write a Python script to take a JSON file and analyze how similar different inline instances of a given function are to each other. I've chosen JSON because Python can parse it reasonably fast, and it just takes a couple of lines to read the whole data: import json with open ('profile.json') as f: profile = json.load(f) Differential Revision: https://reviews.llvm.org/D130944	2022-08-09 16:24:53 -07:00
Dinar Temirbulatov	cab6cd6834	[AArch64][LoopVectorize] Introduce trip count minimal value threshold to ignore tail-folding. After D121595 was commited, I noticed regressions assosicated with small trip count numbersvectorisation by tail folding with scalable vectors. As a solution for those issues I propose to introduce the minimal trip count threshold value. Differential Revision: https://reviews.llvm.org/D130755	2022-08-09 22:10:17 +01:00
Archibald Elliott	b20fe2c25b	[docs][AArch64] Label Features with Arm ARM Names This patch adds the names of the Arm Architecture Reference Manual (ARM) features to the corresponding Subtarget Features in the AArch64 backend and target parser. The aim of this is to make it clearer what architectural features a subtarget feature might enable (so, which features a CPU must provide to support that subtarget feature), and so make it easier to add new CPUs in the future. Differential Revision: https://reviews.llvm.org/D131257	2022-08-09 18:45:50 +01:00
Markus Böck	205701fd47	[llvm][ADT] Allow using structured bindings with `llvm::enumerate` This patch adds the ability to deconstruct the `value_type` returned by `llvm::enumarate` into index and value of the wrapping range. Main use case is the common occurence of using it during loop iteration. After this patch it'd then be possible to write code such as: ``` for (auto [index, value] : enumerate(container)) { ... } ``` where `index` is the current index and `value` a reference to elements in the given container. Differential Revision: https://reviews.llvm.org/D131486	2022-08-09 18:12:40 +02:00
Jun Zhang	0981975ad0	[LLVM] Use range based for loop, NFC Signed-off-by: Jun Zhang <jun@junz.org>	2022-08-10 00:04:26 +08:00
Simon Pilgrim	adb0cd62a9	[Support] TaskQueue.h - replace std::result_of_t with std::invoke_result_t std::result_of_t is deprecated in C++17 Fixes #57023	2022-08-09 10:52:39 +01:00
Jonas Devlieghere	e8c807fade	[llvm] Don't rely on C++17 deduction guide for array creation Seems like at least one bot (clang-ppc64-aix) is having trouble with the C++17 deduction guide for array creation. Specify the template arguments explicitly.	2022-08-08 22:29:26 -07:00
Jonas Devlieghere	2db6b34ea8	[llvm] Alternative attempt at fixing the modules build with C++17 My initial attempt in `db008af501` resulted in "error: no viable constructor or deduction guide for deduction of template arguments of 'array'". Let's see if we can work around that by using an ArrayRef with an explicit template argument.	2022-08-08 21:59:22 -07:00
Yuta Mukai	5357dd2f43	[MachinePipeliner] Fix Phi generation failure for large stages The previous code overwrites VRMap for prologue stages during Phi generation if a register spans many stages. As a result, the wrong register is used as the one coming from the prologue in Phis at later stages. (A process exists to correct this, but it does not work in all cases.) In addition, VRMap for prologue must be preserved until addBranches(). This patch fixes them by separating the map for Phis into a different variable (VRMapPhi). Reviewed By: bcahoon Differential Revision: https://reviews.llvm.org/D127840	2022-08-09 13:14:26 +09:00
Jonas Devlieghere	860efb10b4	Partially revert "[llvm] Repair the modules build with C++17" This reverts commit `db008af501` because this now breaks the non-module build...	2022-08-08 15:25:10 -07:00
Jonas Devlieghere	db008af501	[llvm] Repair the modules build with C++17 I'm honestly not sure if this is a legitimate issue or not, but after switching from C++14 to C++17, the modules build started confusing arrays and initializer lists. Work around the issue by being explicit.	2022-08-08 15:04:46 -07:00
Alex Brachet	dbd04b853b	[ELF] Support --package-metadata This was recently introduced in GNU linkers and it makes sense for ld.lld to have the same support. This implementation omits checking if the input string is valid json to reduce size bloat. Differential Revision: https://reviews.llvm.org/D131439	2022-08-08 21:31:58 +00:00
Fangrui Song	de9d80c1c5	[llvm] LLVM_FALLTHROUGH => [[fallthrough]]. NFC With C++17 there is no Clang pedantic warning or MSVC C5051.	2022-08-08 11:24:15 -07:00
Daniel Thornburgh	bf48b128b0	[Symbolizer] Implement pc element in symbolizing filter. Implements the pc element for the symbolizing filter, including it's "ra" and "pc" modes. Return addresses ("ra") are adjusted by decrementing one. By default, {{{pc}}} elements are assumed to point to precise code ("pc") locations. Backtrace elements will adopt the opposite convention. Along the way, some minor refactors of value printing and colorization. Reviewed By: peter.smith Differential Revision: https://reviews.llvm.org/D131115	2022-08-08 11:08:48 -07:00
Benjamin Kramer	2960299986	[ADT] Retire llvm::apply_tuple in favor of C++17 std::apply	2022-08-08 18:23:38 +02:00
Jakub Kuderski	ba9dc5f577	[ADT] Add is_splat overload accepting initializer_list Allow for `is_splat` to be used inline, similar to `is_contained`, e.g., ``` if (is_splat({type1, type2, type3, type4})) ... ``` which is much more concise and less typo-prone than an equivalent chain of equality comparisons. My immediate motivation is to clean up some code in the SPIR-V dialect that currently needs to either construct a temporary container or use `makeArrayRef` before calling `is_splat`. Reviewed By: dblaikie Differential Revision: https://reviews.llvm.org/D131289	2022-08-08 10:20:02 -04:00
Simon Pilgrim	9641a201a5	[DAG] Add initial SelectionDAG::canCreateUndefOrPoison support This patch adds basic support for a DAG variant of the canCreateUndefOrPoison call and updates DAGCombiner::visitFREEZE to use it, further Opcodes (including target specific Opcodes) can be handled when we have test coverage. So far, I've left visitFREEZE to just use this for unary nodes (which currently means the existing BITCAST/FREEZE cases) - later patches will add other unary opcodes (with test coverage) and we can also refactor visitFREEZE to support a general number of operands like we do in InstCombinerImpl::pushFreezeToPreventPoisonFromPropagating. I'm not aware of any vector test freeze coverage so the DemandedElts (and the Depth) args are not being used yet - but they are in place. Similarly we will be able to handle poison generating SDNodeFlags as and when it becomes an issue. Part of the work for D106675 / PR50468 Differential Revision: https://reviews.llvm.org/D130646	2022-08-08 15:16:06 +01:00
Tobias Hieta	576375a2d6	[LLD][COFF] Ignore DEBUG_S_XFGHASH_TYPE/VIRTUAL These are new debug types that ships with the latest Windows SDK and would warn and finally fail lld-link. The symbols seems to be related to Microsoft's XFG which is their version of CFG. We can't handle any of this yet, so for now we can just ignore these types so that lld doesn't fail with a new version of Windows SDK. Fixes: #56285 Reviewed By: rnk Differential Revision: https://reviews.llvm.org/D129378	2022-08-08 15:53:52 +02:00
Benjamin Kramer	090bdaad34	[Support] Use std::shared_mutex when we're not on old MacOS In C++17 mode this is always available.	2022-08-08 13:31:05 +02:00
Stefan Gränitz	7a66fe1075	Wrap `llvm_unreachable` macro in do-while loop Macros that expand into multiple terms can cause interesting preprocessor hickups depending on the context they are used in. https://github.com/llvm/llvm-project/issues/56867 reported a miscompilation of `llvm_unreachable(msg)` inside a `LLVM_DEBUG({ ... })` block. We were able to fix it by wrapping the expansion in a `do {} while(false)`. Differential Revision: https://reviews.llvm.org/D131337	2022-08-08 13:15:32 +02:00
Shubham Narlawar	ab4fc87a9d	[DAG] Emit table lookup from TargetLowering::expandCTTZ() This patch emits table lookup in expandCTTZ. Context - https://reviews.llvm.org/D113291 transforms set of IR instructions to cttz intrinsic but there are some targets which does not support CTTZ or CTLZ. Hence, I generate a table lookup in TargetLowering::expandCTTZ(). Differential Revision: https://reviews.llvm.org/D128911	2022-08-08 12:08:05 +01:00
Benjamin Kramer	b4e9977fc1	Remove C++17 #ifdefs around the implicit conversion between StringRef and string_view This is no longer needed as LLVM is built with C++17 now. Also drop the explicit conversion to std::string as the implicit conversion to std::string_view gets picked first anyways.	2022-08-08 12:59:23 +02:00
Simon Tatham	72017e9b16	[llvm-objdump,ARM] Fix big-endian AArch32 disassembly. The ABI for big-endian AArch32, as specified by AAELF32, is above- averagely complicated. Relocatable object files are expected to store instruction encodings in byte order matching the ELF file's endianness (so, big-endian for a BE ELF file). But executable images can //either// do that //or// store instructions little-endian regardless of data and ELF endianness (to support BE32 and BE8 platforms respectively). They signal the latter by setting the EF_ARM_BE8 flag in the ELF header. (In the case of the Thumb instruction set, this all means that each 16-bit halfword of a Thumb instruction is stored in one or other endianness. The two halfwords of a 32-bit Thumb instruction must appear in the same order no matter what, because the first halfword is the one that must avoid overlapping the encoding of any 16-bit Thumb instruction.) llvm-objdump was unconditionally expecting Arm instructions to be stored little-endian. So it would correctly disassemble a BE8 image, but if you gave it a BE32 image or a BE object file, it would retrieve every instruction in byte-swapped form and disassemble it to nonsense. (Even an object file output by LLVM itself, because ARMMCCodeEmitter outputs instructions big-endian in big-endian mode, which is correct for writing an object file.) This patch allows llvm-objdump to correctly disassemble all three of those classes of Arm ELF file. It does it by introducing a new SubtargetFeature for big-endian instructions, setting it from the ELF image type and flags during llvm-objdump setup, and teaching both ARMDisassembler and llvm-objdump itself to pay attention to it when retrieving instruction data from a section being disassembled. Differential Revision: https://reviews.llvm.org/D130902	2022-08-08 10:49:51 +01:00
Anubhab Ghosh	1eee6de873	[Orc][JITLink] Slab based memory allocator to reduce RPC calls Calling reserve() used to require an RPC call. This commit allows large ranges of executor address space to be reserved. Subsequent calls to reserve() will return subranges of already reserved address space while there is still space available. Differential Revision: https://reviews.llvm.org/D130392	2022-08-08 15:13:41 +05:30
Nathan James	5512f398a0	[ADT] Update Optional Deprecation with fix-it When Optional accessors were deprecated, in D131349, the standard c++14 style attribute was used. Clang has a slightly better deprecated attribute that enables simpler migration by embedding fix-its. Reviewed By: kazu Differential Revision: https://reviews.llvm.org/D131381	2022-08-08 10:39:31 +01:00
Carlos Alberto Enciso	a3f7a2c183	[CodeView] Add function to get size in bytes for TypeIndex/CVType. Given a TypeIndex or CVType return its size in bytes. Basically it is the inverse to 'CodeViewDebug::lowerTypeBasic', that returns a TypeIndex based in a size. Reviewed By: rnk, djtodoro Differential Revision: https://reviews.llvm.org/D129846	2022-08-08 08:48:23 +01:00
Kazu Hirata	e20d210eef	[llvm] Qualify auto (NFC) Identified with readability-qualified-auto.	2022-08-07 23:55:27 -07:00
Jun Zhang	98339ac7af	[Support] move llvm::llvm_is_multithread to header, NFC This allow optimization without LTO. Also remove some useless else-ifs. Signed-off-by: Jun Zhang <jun@junz.org> Differential Revision: https://reviews.llvm.org/D131313	2022-08-08 08:49:02 +08:00
Kazu Hirata	b5f8d42efe	[ADT] Deprecate Optional::{hasValue,getValue} (NFC) Differential Revision: https://reviews.llvm.org/D131349	2022-08-07 11:30:58 -07:00
Alexey Bader	f0f1bcadc7	[demangler] Add getters for Qual/Vector/Pointer types These are useful for downstream tool aligning the mangling of data types which differ between different languages/targets. Patch by Steffen Larsen <steffen.larsen@intel.com> Differential Revision: https://reviews.llvm.org/D130909	2022-08-07 02:10:05 -07:00
Kazu Hirata	ba0407ba86	[llvm] Use range-based for loops (NFC)	2022-08-07 00:16:21 -07:00
Kazu Hirata	a2d4501718	[llvm] Fix comment typos (NFC)	2022-08-07 00:16:14 -07:00
Kazu Hirata	3b114087c3	[llvm] Drop unnecessary const from return types (NFC) Identified with readability-const-return-type.	2022-08-07 00:16:11 -07:00
Fangrui Song	fa66789d06	[llvm] LLVM_NODISCARD => [[nodiscard]]. NFC With C++17 there is no Clang pedantic warning.	2022-08-07 00:26:33 +00:00
Fangrui Song	bf5550b679	[ADT] Fix signature of StringSet::insert to match StringMap and unordered_set.	2022-08-06 22:48:41 +00:00
Krzysztof Parzyszek	2bc390bdd6	[RDF] Use default TargetOperandInfo if not given in constructor All current in-tree users use the default implementation.	2022-08-06 14:32:52 -05:00
Markus Böck	f7b73b7e8e	[llvm] Remove uses of deprecated `std::iterator` std::iterator has been deprecated in C++17 and some standard library implementations such as MS STL or libc++ emit deperecation messages when using the class. Since LLVM has now switched to C++17 these will emit warnings on these implementations, or worse, errors in build configurations using -Werror. This patch fixes these issues by replacing them with LLVMs own llvm::iterator_facade_base which offers a superset of functionality of std::iterator. Differential Revision: https://reviews.llvm.org/D131320	2022-08-06 14:07:37 +02:00
Tobias Hieta	4b8db17c32	[llvm][macos] Fix usage of std::shared_mutex on old macOS SDK versions When setting CMAKE_CXX_STANDARD to 17 and targeting a macOS version under 10.12 the ifdefs would try to use std::shared_mutex because the of the C++ standard. This should also check the targeted SDK. See discussion in: https://reviews.llvm.org/D130689 Reviewed By: nikic Differential Revision: https://reviews.llvm.org/D131063	2022-08-05 21:45:23 +02:00
Zhaoshi Zheng	99e50e5838	[WinEH][ARM64] Split Unwind Info for Fucntions Larger than 1MB Create function segments and emit unwind info of them. A segment must be less than 1MB and no prolog or epilog is splitted between two segments. This patch should generate correct, though not optimal, unwind info for large functions. Currently it only generate pacted info (.pdata) only for functions that are less than 1MB (single-segment functions). This is NFC from before this patch. The next step is to enable (.pdata) only unwind info for the first segment or segments that have neither prolog or epilog in a multi-segment function. Another future work item is to further split segments that require more than 255 code words or have more than 65535 epilogs. Reference: https://docs.microsoft.com/en-us/cpp/build/arm64-exception-handling#function-fragments Differential Revision: https://reviews.llvm.org/D130049	2022-08-05 11:46:41 -07:00
Dawid Jurczak	1bd31a6898	[NFC] Add SmallVector constructor to allow creation of SmallVector<T> from ArrayRef of items convertible to type T Extracted from https://reviews.llvm.org/D129781 and address comment: https://reviews.llvm.org/D129781#3655571 Differential Revision: https://reviews.llvm.org/D130268	2022-08-05 13:35:41 +02:00
Paul Kirth	a812b39e8c	[llvm][ir] Add missing license to ProfDataUtils We failed to add these in D128860 or D128858 Reviewed By: davidxl Differential Revision: https://reviews.llvm.org/D131226	2022-08-05 03:39:13 +00:00
Fangrui Song	7d6017fd31	[TTI] Change new getVectorInstrCost overload to use const reference after D131114 A const reference is preferred over a non-null const pointer. `Type *` is kept as is to match the other overload. Reviewed By: davidxl Differential Revision: https://reviews.llvm.org/D131197	2022-08-04 15:16:51 -07:00
Mingming Liu	bc8f2f3649	[AArch64][TTI][NFC] Overload method 'getVectorInstrCost' to provide vector instruction itself, as a context information for cost estimation. 1) Overloaded (instruction-based) method is a wrapper around the current (opcode-based) method. 2) This patch also changes a few callsites (VectorCombine.cpp, SLPVectorizer.cpp, CodeGenPrepare.cpp) to call the overloaded method. 3) This is a split of D128302. Differential Revision: https://reviews.llvm.org/D131114	2022-08-04 12:58:25 -07:00
Marc Auberer	9dbe839627	[Docs] Fix missing docs strings for CallingConv.h Replaces ``` // ``` with ``` /// ``` for some code lines to make it visible in the auto-generated documentation. Reviewed By: dblaikie, MaskRay Differential Revision: https://reviews.llvm.org/D131152	2022-08-04 19:14:53 +00:00
Daniel Thornburgh	22df238d4a	[Symbolizer] Implement data symbolizer markup element. This connects the Symbolizer to the markup filter and enables the first working end-to-end flow using the filter. Reviewed By: peter.smith Differential Revision: https://reviews.llvm.org/D130187	2022-08-04 10:20:29 -07:00
Ellis Hoag	12e78ff881	[InstrProf] Add the skipprofile attribute As discussed in [0], this diff adds the `skipprofile` attribute to prevent the function from being profiled while allowing profiled functions to be inlined into it. The `noprofile` attribute remains unchanged. The `noprofile` attribute is used for functions where it is dangerous to add instrumentation to while the `skipprofile` attribute is used to reduce code size or performance overhead. [0] https://discourse.llvm.org/t/why-does-the-noprofile-attribute-restrict-inlining/64108 Reviewed By: phosek Differential Revision: https://reviews.llvm.org/D130807	2022-08-04 08:45:27 -07:00
Joshua Cranmer	2138c90645	[IR] Move support for dxil::TypedPointerType to LLVM core IR. This allows the construct to be shared between different backends. However, it still remains illegal to use TypedPointerType in LLVM IR--the type is intended to remain an auxiliary type, not a real LLVM type. So no support is provided for LLVM-C, nor bitcode, nor LLVM assembly (besides the bare minimum needed to make Type->dump() work properly). Reviewed By: beanz, nikic, aeubanks Differential Revision: https://reviews.llvm.org/D130592	2022-08-04 10:41:11 -04:00
Lang Hames	b5f76d83ff	[ORC] Ensure that llvm_orc_registerJITLoaderGDBAllocAction is linked into tools. Add a reference to llvm_orc_registerJITLoaderGDBAllocAction from the linkComponents function in the lli, llvm-jitlink, and llvm-jitlink-executor tools. This ensures that llvm_orc_registerJITLoaderGDBAllocAction is not dead-stripped in optimized builds, which may cause failures in these tools. The llvm_orc_registerJITLoaderGDBAllocAction function was originally added with MachO debugging support in `69be352a19`, but that patch failed to update the linkComponents functions. http://llvm.org/PR56817	2022-08-03 17:51:45 -07:00
Arthur Eubanks	203296d642	[BoundsChecking] Fix merging of sizes BoundsChecking uses ObjectSizeOffsetEvaluator to keep track of the underlying size/offset of pointers in allocations. However, ObjectSizeOffsetVisitor (something ObjectSizeOffsetEvaluator uses to check for constant sizes/offsets) doesn't quite treat sizes and offsets the same way as BoundsChecking. BoundsChecking wants to know the size of the underlying allocation and the current pointer's offset within it, but ObjectSizeOffsetVisitor only cares about the size from the pointer to the end of the underlying allocation. This only comes up when merging two size/offset pairs. Add a new mode to ObjectSizeOffsetVisitor which cares about the underlying size/offset rather than the size from the current pointer to the end of the allocation. Fixes a false positive with -fsanitize=bounds. Reviewed By: vitalybuka, asbirlea Differential Revision: https://reviews.llvm.org/D131001	2022-08-03 17:21:19 -07:00
Vitaly Buka	a2aa6809a8	[NFC][Inliner] Add cl::opt<int> to tune InstrCost The plan is tune this for sanitizers. Differential Revision: https://reviews.llvm.org/D131123	2022-08-03 17:14:10 -07:00
Congzhe Cao	76be554931	[DependenceAnalysis][PR56275] Normalize negative dependence analysis results This patch is the first of the two-patch series (D130188, D130179) that resolve PR56275 (https://github.com/llvm/llvm-project/issues/56275) which is a missed opportunity, where a perfrectly valid case for loop interchange failed interchange legality. If the distance/direction vector produced by dependence analysis (DA) is negative, it needs to be normalized (reversed). This patch provides helper functions `isDirectionNegative()` and `normalize()` in DA that does the normalization, and clients can query DA to do normalization if needed. A pass option `<normalized-results>` is added to DependenceAnalysisPrinterPass, and we leverage it to update DA test cases to make sure of test coverage. The test cases added in `Banerjee.ll` shows that negative vectors are normalized with `print<da><normalized-results>`. Reviewed By: bmahjour, Meinersbur, #loopoptwg Differential Revision: https://reviews.llvm.org/D130188	2022-08-03 19:59:00 -04:00
Bill Wendling	239c831de4	Add switch to use "source_filename" instead of a hash ID for globally promoted local During LTO a local promoted to a global gets a unique suffix based on a hash of the module IR. This means that changes in the local's module can affect the contents in another module that imported it (because the name of the imported promoted local is changed, but that doesn't reflect a real change in the importing module). So any tool that's validating changes to the importing module will see a superficial change. Instead of using the module hash, we can use the "source_filename" if it exists to generate a unique identifier that doesn't change due to LTO shenanigans. Differential Revision: https://reviews.llvm.org/D128863	2022-08-03 16:41:56 -07:00
Mircea Trofin	0cb9746a7d	[nfc][mlgo] Separate logger and training-mode model evaluator This just shuffles implementations and declarations around. Now the logger and the TF C API-based model evaluator are separate. Differential Revision: https://reviews.llvm.org/D131116	2022-08-03 16:20:28 -07:00
Vitaly Buka	e056e74dda	[NFC][inline] Add const to an argument	2022-08-03 13:20:47 -07:00
Fraser Cormack	646e2f4803	[VP] Rename VP int<->float conversion ISD opcodes These should be named like the non-VP versions for consistency. Reviewed By: reames Differential Revision: https://reviews.llvm.org/D130967	2022-08-03 10:04:38 +01:00
Jannik Silvanus	d0bfebda5b	[Docs] Improve cycle and closed path definitions Improve the cycle definition, by avoiding usage of not yet defined or only vaguely defined terminology inside definitions. More precisely, the existing definition defined "outermost cycles", and then proceeded to use the term "cycles" for further definitions, which in turn were used to actually define "cycles". Now, instead only define "cycles". This does not change the meaning of a cycle, which depends on the chosen surrounding (subgraph) of a CFG. Also mention the function CFG in the first definition, because later later definitions require it anyways. Also slightly improve the definition of a closed path, by explicitly requiring the inner nodes to be distinct. Differential Revision: https://reviews.llvm.org/D130891	2022-08-03 10:28:13 +02:00
Nikita Popov	b128e057c1	[AA] Make ModRefInfo a bitmask enum (NFC) Mark ModRefInfo as a bitmask enum, which allows using normal & and \| operators on it. This supersedes various functions like unionModRef() and intersectModRef(). I think this makes the code cleaner than going through helper functions... Differential Revision: https://reviews.llvm.org/D130870	2022-08-03 10:05:55 +02:00
Max Kazantsev	34ae308c73	[SCEV] Use context to strengthen flags of BinOps Sometimes SCEV cannot infer nuw/nsw from something as simple as ``` len in [0, MAX_INT] ... iv = phi(0, iv.next) guard(iv <s len) guard(iv <u len) iv.next = iv + 1 ``` just because flag strenthening only relies on definition and does not use local facts. This patch adds support for the simplest case: inference of flags of `add(x, constant)` if we can contextually prove that `x <= max_int - constant`. In case if it has negative CT impact, we can add an option to switch it off. I woudln't expect that though. Differential Revision: https://reviews.llvm.org/D129643 Reviewed By: apilipenko	2022-08-03 14:08:57 +07:00
Paul Kirth	d434e40f39	[llvm][NFC] Refactor code to use ProfDataUtils In this patch we replace common code patterns with the use of utility functions for dealing with profiling metadata. There should be no change in functionality, as the existing checks should be preserved in all cases. Reviewed By: bogner, davidxl Differential Revision: https://reviews.llvm.org/D128860	2022-08-03 00:09:45 +00:00
Chris Bieneman	ee4d815008	[DX] Remove IntrNoMem from create handle intrinsic The create handle intrinsic calls can't be removed, so it was incorrect to mark them as IntrNoMem.	2022-08-02 16:57:22 -05:00
Nicolai Hähnle	f7872cdce1	CommandLine: add and use cl::SubCommand::get{All,TopLevel} Prefer using these accessors to access the special sub-commands corresponding to the top-level (no subcommand) and all sub-commands. This is a preparatory step towards removing the use of ManagedStatic: with a subsequent change, these global instances will be moved to be regular function-scope statics. It is split up to give downstream projects a (albeit short) window in which they can switch to using the accessors in a forward-compatible way. Differential Revision: https://reviews.llvm.org/D129118	2022-08-02 23:49:16 +02:00
Mircea Trofin	4146c1756d	[nfc] Remove unused parameter in TailDuplicator::duplicateSimpleBB Differential Revision: https://reviews.llvm.org/D131008	2022-08-02 13:39:34 -07:00
Vladislav Dzhidzhoev	f6d9f00031	[DebugInfo] Test commit: update irrelevant comments Differential Revision: https://reviews.llvm.org/D130998	2022-08-02 20:21:24 +03:00
Jay Foad	bb2832410e	[IRBuilder] CreateIntrinsic with implicit mangling Add a new IRBuilderBase::CreateIntrinsic which takes the return type and argument values for the intrinsic call but does not take an explicit list of types to mangle. Instead the builder works this out from the intrinsic declaration and the types of the supplied arguments. This means that the mangling is hidden from the client, which in turn means that intrinsic definitions can change which arguments are mangled without requiring any changes to the client code. Differential Revision: https://reviews.llvm.org/D130776	2022-08-02 13:08:35 +01:00
Dawid Jurczak	78650b7861	[NFC] Remove some boilerplate from SmallVector header In SmallVector header we use couple of times exactly same enable_if, in this change we de-duplicate it and declare only once. Extracted from: https://reviews.llvm.org/D129990 Differential Revision: https://reviews.llvm.org/D130779	2022-08-02 11:45:55 +02:00
David Sherwood	4ef9cb6c17	[AArch64][LoopVectorize] Disable tail-folding for SVE when loop has interleaved accesses If we have interleave groups in the loop we want to vectorise then we should fall back on normal vectorisation with a scalar epilogue. In such cases when tail-folding is enabled we'll almost certainly go on to create vplans with very high costs for all vector VFs and fall back on VF=1 anyway. This is likely to be worse than if we'd just used an unpredicated vector loop in the first place. Once the vectoriser has proper support for analysing all the costs for each combination of VF and vectorisation style, then we should be able to remove this. Added an extra test here: Transforms/LoopVectorize/AArch64/sve-tail-folding-option.ll Differential Revision: https://reviews.llvm.org/D128342	2022-08-02 09:52:33 +01:00
jacquesguan	e38af7ba95	[LV] Refactor getExtendedAddReductionCost to support other extended reduction more than Add. Now the API getExtendedAddReductionCost is used to determine the cost of extended Add reduction with optional Mul. For Arm, it could cover the cases. But for other target, for example: RISCV, they support other kinds of extended recution, such as FAdd. This patch does the following changes: 1, Split getExtendedAddReductionCost into 2 new API: getExtendedReductionCost which handles the extended reduction with addtional input of Opcode; getMulAccReductionCost which handle the MLA cases the getExtendedAddReductionCost. 2, Refactor getReductionPatternCost, add some contraint condition to make sure the getMulAccReductionCost should only handle the reuction of Add + Mul. Differential Revision: https://reviews.llvm.org/D130868	2022-08-02 16:02:38 +08:00
Fangrui Song	2b70bebc6d	[MachineFunctionPass] Support -print-changed={,c}diff{,-quiet} Follow-up to D130434. Move doSystemDiff to PrintPasses.cpp and call it in MachineFunctionPass.cpp. Reviewed By: aeubanks Differential Revision: https://reviews.llvm.org/D130833	2022-08-01 12:56:15 -07:00
Simon Pilgrim	630a65f3da	SelectionDAGNodes.h - fix Wdocumentation warnings. NFC.	2022-08-01 15:09:16 +01:00
Simon Pilgrim	27105e2f30	MisExpect.h - fix Wdocumentation warnings. NFC.	2022-08-01 15:06:30 +01:00
Anubhab Ghosh	ac3cb4ecd0	[Orc] Disable use of shared memory on Android shm_open and shm_unlink are not available on Android. This commit disables SharedMemoryMapper on Android until a better solution is available. https://android.googlesource.com/platform/bionic/+/refs/heads/master/docs/status.md https://github.com/llvm/llvm-project/issues/56812 Differential Revision: https://reviews.llvm.org/D130814	2022-08-01 18:48:39 +05:30
Lucas Prates	ba9caf9170	[Arm] Fix parsing and emission of Tag_also_compatible_with eabi attribute According to the ABI for the Arm Architecture, the value for the Tag_also_compatible_with eabi attribute is represented by an NTBS entry. This string value, in turn, is composed of a pair of tag+value encoded in one of two formats: - ULEB128: tag, ULEB128: value, 0. - ULEB128: tag, NBTS: data. (See [[ `60a8eb8c55/addenda32/addenda32.rst (3373secondary-compatibility-tag)` \| section 3.3.7.3 on the Addenda to, and Errata in, the ABI for the Arm Architecture ]].) Currently the Arm assembly parser and streamer ignore the encoding of the attribute's NTBS value, which can result in incorrect attributes being emitted in both assembly and object file outputs. This patch fixes these issues by properly handing the value's encoding. An update to llvm-readobj to properly handle the attribute's value will be covered by a separate patch. Patch by Victor Campos and Lucas Prates. Reviewed By: vhscampos Differential Revision: https://reviews.llvm.org/D129500	2022-08-01 13:28:01 +01:00
Dominik Adamski	d90b7bf2c5	Add support for lowering simd if clause to LLVM IR Scope of changes: 1) Added new function to generate loop versioning 2) Added support for if clause to applySimd function 2) Added tests which confirm that lowering is successful If ifCond is specified, then collapsed loop is duplicated and if branch is added. Duplicated loop is executed if simd ifCond is evaluated to false. Reviewed By: Meinersbur Differential Revision: https://reviews.llvm.org/D129368 Signed-off-by: Dominik Adamski <dominik.adamski@amd.com>	2022-08-01 04:43:32 -05:00
David Sherwood	41119a0f52	[DAGCombiner] Extend visitAND to include EXTRACT_SUBVECTOR Eliminate an AND by redefining an anyext\|sext\|zext. (and (extract_subvector (anyext\|sext\|zext v) _) iN_mask) => (extract_subvector (zeroext_iN v)) Differential Revision: https://reviews.llvm.org/D130782	2022-08-01 10:32:32 +01:00
Vladislav Dzhidzhoev	facb3ac385	[GlobalISel][DebugInfo] salvageDebugInfo analogue for gMIR Salvage debug info of instruction that is about to be deleted as dead in Combiner pass. Currently supported instructions are COPY and G_TRUNC. It allows to salvage debug info of some dead arguments of functions, by putting DWARF expression corresponding to the instruction being deleted into related DBG_VALUE instruction. Here is an example of missing variables location https://godbolt.org/z/K48osb9dK. We see that arguments x, y of function foo are not available in debugger, and corresponding DBG_VALUE instructions have undefined register operand instead of variables locaton after Aarch64PreLegalizerCombiner pass. The reason is that registers where variables are located are removed as dead (with instruction G_TRUNC). We can use salvageDebugInfo analogue for gMIR to preserve debug locations of dead variables. Statistics of llvm object files built with vs without this commit on -O2 optimization level (CMAKE_BUILD_TYPE=RelWithDebInfo, -fglobal-isel) on Aarch64 (macOS): Number of variables with 100% of parent scope covered by DW_AT_location has been increased by 7,9%. Number of variables with 0% coverage of parent scope has been decreased by 1,2%. Number of variables processed by location statistics has been increased by 2,9%. Average PC ranges coverage has been increased by 1,8 percentage points. Coverage can be improved by supporting more instructions, or by calling salvageDebugInfo for instructions that are deleted during Combiner rules exection. Reviewed By: aprantl Differential Revision: https://reviews.llvm.org/D129909	2022-08-01 11:14:53 +02:00
Nikita Popov	5b1d10bda6	[AA] Drop setModAndRef() function (NFC) Without the "must" state, this function is pointless, because we can just directly create a ModRef instead.	2022-08-01 07:55:39 +02:00
Nikita Popov	f96ea53e89	[AA] Do not track Must in ModRefInfo getModRefInfo() queries currently track whether the result is a MustAlias on a best-effort basis. The only user of this functionality is the optimized memory access type in MemorySSA -- which in turn has no users. Given that this functionality has not found a user since it was introduced five years ago (in D38862), I think we should drop it again. The context is that I'm working to separate FunctionModRefBehavior to track mod/ref for different location kinds (like argmem or inaccessiblemem) separately, and the fact that ModRefInfo also has an unrelated Must flag makes this quite awkward, especially as this means that NoModRef is not a zero value. If we want to retain the functionality, I would probably split getModRefInfo() results into a part that just contains the ModRef information, and a separate part containing a (best-effort) AliasResult. Differential Revision: https://reviews.llvm.org/D130713	2022-08-01 07:14:31 +02:00
Chuanqi Xu	9701053517	Introduce @llvm.threadlocal.address intrinsic to access TLS variable This belongs to a series of patches which try to solve the thread identification problem in coroutines. See https://discourse.llvm.org/t/address-thread-identification-problems-with-coroutine/62015 for a full background. The problem consists of two concrete problems: TLS variable and readnone functions. This patch tries to convert the TLS problem to readnone problem by converting the access of TLS variable to an intrinsic which is marked as readnone. The readnone problem would be addressed in following patches. Reviewed By: nikic, jyknight, nhaehnle, ychen Differential Revision: https://reviews.llvm.org/D125291	2022-08-01 10:51:30 +08:00
Luís Marques	260a641068	[RISCV] Pre-RA expand pseudos pass Expand load address pseudo-instructions earlier (pre-ra) to allow follow-up patches to fold the addi of PseudoLLA instructions into the immediate operand of load/store instructions. Differential Revision: https://reviews.llvm.org/D123264	2022-07-31 23:19:00 +02:00
Dawid Jurczak	50eb5bcfcd	[NFC] Remove redundant CalculateSmallVectorDefaultInlinedElements usage from to_vector utility CalculateSmallVectorDefaultInlinedElements<..>::value is already used as default value for second template parameter in SmallVector class declaration. There is no need to pass it explicitly in to_vector. Extracted from: https://reviews.llvm.org/D129781 Differential Revision: https://reviews.llvm.org/D130774	2022-07-31 10:51:01 +02:00
Sunho Kim	e781451140	[JITLink] Relax zero-fill edge assertions. Relax zero-fill edge assertions to only consider relocation edges. Keep-alive edges to zero-fill blocks can cause this assertion which is too strict. Reviewed By: lhames Differential Revision: https://reviews.llvm.org/D130450	2022-07-31 08:34:10 +09:00
Kazu Hirata	5dd78c3608	[IR] Fix a header guard (NFC) Identified with llvm-header-guard.	2022-07-30 10:35:45 -07:00
Simon Pilgrim	2f08872d81	OMPIRBuilder.h - fix Wdocumentation warning. NFC.	2022-07-30 15:43:41 +01:00
Simon Pilgrim	caa971f216	SelectionDAGNodes.h - fix Wdocumentation warnings. NFC.	2022-07-30 11:05:33 +01:00
Dawid Jurczak	65053fbc0d	[NFC] Use more appropriate SmallVectorImpl::append call in std::initializer_list SmallVector constructor Since we are in constructor there is no need to perform redundant call to SmallVectorImpl::clear() inside assign function. Although calling cheaper append function instead assign doesn't make any difference on optimized builds (DSE does the job removing stores), we still save some cycles for debug binaries. Differential Revision: https://reviews.llvm.org/D130361	2022-07-30 09:17:35 +02:00
Fangrui Song	ce6dd4e835	Revert D130458 "[llvm-objcopy] Support --{,de}compress-debug-sections for zstd" This reverts commit `c26dc2904b`. The new Zstd dispatch has an ongoing design discussion related to https://reviews.llvm.org/D130516#3688123 . Revert for now before it is resolved.	2022-07-29 15:46:51 -07:00
Jay Foad	9436a85eb6	[IRBuilder] Make createCallHelper a member function. NFC. This just avoids explicitly passing in 'this' as an argument in a bunch of places. Differential Revision: https://reviews.llvm.org/D130752	2022-07-29 21:17:26 +01:00
Alex Bradbury	85c6fab8d3	[RISCV][doc] Improve documentation comments on atomics intrinsics Previously, it was necessary to check the atomics lowering or expansion code to determine which argument was which. This patch additionally tweaks the documentation comment in TargetLowering to clarify the return value of the intrinsic and that the intrinsic isn't required to mask and shift the result (this is handled by the target-independent code in AtomicExpandPass).	2022-07-29 15:09:12 +01:00
Alexey Lapshin	ece341f598	[Debuginfo][DWARF][NFC] Add paired methods working with DWARFDebugInfoEntry. This review is extracted from D96035. DWARF Debuginfo classes have two representations for DIEs: DWARFDebugInfoEntry (short) and DWARFDie(extended). Depending on the task, it might be more convenient to use DWARFDebugInfoEntry or/and DWARFDie. DWARFUnit class already has methods working with DWARFDie and DWARFDebugInfoEntry. This patch adds more methods working with DWARFDebugInfoEntry to have paired functionality. Reviewed By: aprantl Differential Revision: https://reviews.llvm.org/D126059	2022-07-29 16:40:17 +03:00
Simon Pilgrim	63bdff3eb8	VirtualFileSystem.h - don't use \param in general description - use \p instead to fix Wdocumentation warnings.	2022-07-29 12:21:44 +01:00
Simon Pilgrim	9f68bb1da5	Fix unknown parameter Wdocumentation warning. NFC.	2022-07-29 12:17:30 +01:00
Simon Pilgrim	9082c13106	[Support] Add KnownBits::concat method Add a method for the various cases where we need to concatenate 2 KnownBits together (BUILD_PAIR and SHIFT_PARTS in particular) - uses the existing APInt::concat 'HiBits.concat(LoBits)' convention Differential Revision: https://reviews.llvm.org/D130557	2022-07-29 11:06:39 +01:00
Florian Hahn	214e2d8fe5	[SCEV] Avoid repeated proveNoSignedWrapViaInduction calls. At the moment, proveNoSignedWrapViaInduction may be called for the same AddRec a large number of times via getSignExtendExpr. This can have a severe compile-time impact for very loop-heavy code. If proveNoSignedWrapViaInduction failed to prove NSW the first time, it is unlikely to succeed on subsequent tries and the cost doesn't seem to be justified. This is the signed version of `8daa338297` / D130648. This can drastically improve compile-time in some excessive cases and also has a slightly positive compile-time impact on CTMark: NewPM-O3: -0.06% NewPM-ReleaseThinLTO: -0.04% NewPM-ReleaseLTO-g: -0.04% https://llvm-compile-time-tracker.com/compare.php?from=8daa338297d533db4d1ae8d3770613eb25c29688&to=aed126a196e7a5a9803543d9b4d6bdb233d0009c&stat=instructions Reviewed By: nikic Differential Revision: https://reviews.llvm.org/D130694	2022-07-29 09:15:03 +01:00

1 2 3 4 5 ...

48889 Commits