llvm-project

Commit Graph

Author	SHA1	Message	Date
Simon Pilgrim	7120c67c04	Revert rG5adc94bb8a23eb819f6ca80e722f5b0e6e41401d "New regression test against expandMemCpyAsLoop utility" This was causing link errors on buildbots (and locally)	2022-04-05 08:11:21 +01:00
Evgeniy Brevnov	5adc94bb8a	New regression test against expandMemCpyAsLoop utility Unit test for functionality going to be added by D118441 Differential Revision: https://reviews.llvm.org/D118440	2022-04-05 13:28:13 +07:00
Alex Brachet	47f59df892	Revert "Reland "[lit] Use sharding for GoogleTest format"" This reverts commit `948f3deca9`.	2022-04-04 16:34:28 +00:00
Yuanfang Chen	948f3deca9	Reland "[lit] Use sharding for GoogleTest format" This relands commit `a87ba5c86d`. Adjust llvm/utils/lit/tests/googletest-timeout.py for new test output.	2022-04-03 22:35:45 -07:00
Yuanfang Chen	c0f90c84b1	Revert "[lit] Use sharding for GoogleTest format" This reverts commit `a87ba5c86d`. Breaks bots: https://lab.llvm.org/buildbot/#/builders/196/builds/10454	2022-04-03 20:04:55 -07:00
Yuanfang Chen	a87ba5c86d	[lit] Use sharding for GoogleTest format This helps lit unit test performance by a lot, especially on windows. The performance gain comes from launching one gtest executable for many subtests instead of one (this is the current situation). The shards are executed by the test runner and the results are stored in the json format supported by the GoogleTest. Later in the test reporting stage, all test results in the json file are retrieved to continue the test results summary etc. On my Win10 desktop, before this patch: `check-clang-unit`: 177s, `check-llvm-unit`: 38s; after this patch: `check-clang-unit`: 37s, `check-llvm-unit`: 11s. On my Linux machine, before this patch: `check-clang-unit`: 46s, `check-llvm-unit`: 8s; after this patch: `check-clang-unit`: 7s, `check-llvm-unit`: 4s. Reviewed By: yln, rnk Differential Revision: https://reviews.llvm.org/D122251	2022-04-03 19:47:02 -07:00
Nathan Sidwell	d7692c0f9b	[demangler] Fix node matcher test Move node matcher compilation test to non-anonymous namespace and avoid using attribute.	2022-04-01 05:51:47 -07:00
Nathan Sidwell	abffdd8876	[demangler] Fix node matchers * Add instantiation tests to ItaniumDemangleTest, to make sure all match functions provide constructor arguments to the provided functor. * Fix the Node constructors that lost const qualification on arguments. Reviewed By: dblaikie Differential Revision: https://reviews.llvm.org/D122665	2022-04-01 05:19:34 -07:00
Adrian Kuegel	ba8f45f642	[OMPIRBuilder] Remove unused variable in test (NFC)	2022-04-01 10:33:08 +02:00
Peixin-Qiao	3e7415a0ff	[OMPIRBuilder] Support ordered clause specified without parameter This patch supports ordered clause specified without parameter in worksharing-loop directive in the OpenMPIRBuilder and lowering MLIR to LLVM IR. Reviewed By: Meinersbur Differential Revision: https://reviews.llvm.org/D114940	2022-04-01 16:17:29 +08:00
yanming	a7c0b7504c	[VP] Add more cast VPintrinsic and docs. Add vp.fptoui, vp.uitofp, vp.fptrunc, vp.fpext, vp.trunc, vp.zext, vp.sext, vp.ptrtoint, vp.inttoptr intrinsic and docs. Reviewed By: frasercrmck, craig.topper Differential Revision: https://reviews.llvm.org/D122291	2022-04-01 09:16:10 +08:00
Serge Pavlov	881350a92d	Mapping of FP operations to constrained intrinsics A new function 'getConstrainedIntrinsic' is added, which for any gived instruction returns id of the corresponding constrained intrinsic. If there is no constrained counterpart for the instruction or the instruction is already a constrained intrinsic, the function returns zero. This is recommit of `115b3ace36`, reverted in `8160dd582b`. Differential Revision: https://reviews.llvm.org/D69562	2022-03-31 11:07:47 +07:00
Eli Friedman	72517e27c1	[AArch64] Fix AArch64TargetParser.def to match AArch64.td. Currently, we have two different lists of features each CPU supports... and those lists aren't consistent. This patch assumes AArch64.td is right, and tries to fix AArch64TargetParser to match. It's hard to find documentation for the right features, but reviewers have confirmed these changes. Probably we should try to unify the two lists at some point, but synchronizing them seems like a prerequisite to that anyway. Differential Revision: https://reviews.llvm.org/D122274	2022-03-30 12:15:39 -07:00
Ben Barham	3fda0edc51	[VFS] RedirectingFileSystem only replace path if not already mapped If the `ExternalFS` has already remapped a path then the `RedirectingFileSystem` should not change it to the originally provided path. This fixes the original path always being used if multiple VFS overlays were provided and the path wasn't found in the highest (ie. first in the chain). This also renames `IsVFSMapped` to `ExposesExternalVFSPath` and only sets it if `UseExternalName` is true. This flag then represents that the `Status` has an external path that's different from its virtual path. Right now the contained path is still the external path, but further PRs will change this to always be the virtual path. Clients that need the external can then request it specifically. Note that even though `ExposesExternalVFSPath` isn't set for all VFS-mapped paths, `IsVFSMapped` was only being used by a hack in `FileManager` that was specific to module searching. In that case `UseExternalNames` is always `true` and so that hack still applies. Resolves rdar://90578880 and llvm-project#53306. Differential Revision: https://reviews.llvm.org/D122549	2022-03-30 11:52:41 -07:00
Fraser Cormack	73244e8f85	[VP] Add vp.icmp comparison intrinsic and docs This patch mostly follows up on D121292 which introduced the vp.fcmp intrinsic. Reviewed By: craig.topper Differential Revision: https://reviews.llvm.org/D122729	2022-03-30 17:05:11 +01:00
Fraser Cormack	da6131f20a	[VP] Add vp.fcmp comparison intrinsic and docs This patch adds the first support for vector-predicated comparison intrinsics, starting with vp.fcmp. It uses metadata to encode its condition code, like the llvm.experimental.constrained.fcmp intrinsic. Reviewed By: craig.topper Differential Revision: https://reviews.llvm.org/D121292	2022-03-30 14:39:18 +01:00
Serge Pavlov	8160dd582b	Revert "Mapping of FP operations to constrained intrinsics" This reverts commit `115b3ace36`. Starting from this commit the buildbot sanitizer-x86_64-linux-bootstrap-msan starts failing (build 10071). Reverted for investigation.	2022-03-30 16:46:43 +07:00
Serge Pavlov	115b3ace36	Mapping of FP operations to constrained intrinsics A new function 'getConstrainedIntrinsic' is added, which for any gived instruction returns id of the corresponding constrained intrinsic. If there is no constrained counterpart for the instruction or the instruction is already a constrained intrinsic, the function returns zero. Differential Revision: https://reviews.llvm.org/D69562	2022-03-30 12:21:30 +07:00
Eli Friedman	a8ebd85e46	[MC] Make MCAsmInfo::isAcceptableChar reflect MCAsmInfo::doesAllowAtInName On targets which don't allow "@" in unquoted identifiers, make sure we don't emit them; otherwise, we can't parse our own output. Differential Revision: https://reviews.llvm.org/D122516	2022-03-29 14:01:32 -07:00
Chris Bieneman	9130e471fe	Add DXContainer DXIL is wrapped in a container format defined by the DirectX 11 specification. Codebases differ in calling this format either DXBC or DXILContainer. Since eventually we want to add support for DXBC as a target architecture and the format is used by DXBC and DXIL, I've termed it DXContainer here. Most of the changes in this patch are just adding cases to switch statements to address warnings. Reviewed By: pete Differential Revision: https://reviews.llvm.org/D122062	2022-03-29 14:34:23 -05:00
Chris Bieneman	b39f437757	[ADT] add initializer list specialization for is_contained Adding an initializer list specialization for is_contained allows for compile-time evaluation when called with a constant or runtime evaluation for non-constant values. This patch doesn't add any uses of this template, but that is coming in a subsequent patch. Reviewed By: pete Differential Revision: https://reviews.llvm.org/D122079	2022-03-29 12:39:39 -05:00
Chris Bieneman	5b6207f3cd	[ADT] Flesh out HLSL raytracing environments Fleshing this out now allows me to rely on enum math to translate values rather than having to translate the off cases. I should have added this in the first pass, but wasn't thinking about it.	2022-03-29 09:43:03 -05:00
Nathan Sidwell	1066e397fa	[demangler] Add StringView conversion operator The OutputBuffer class tries to present a NUL-terminated string API to consumers. But several of them would prefer a StringView. In particular the Microsoft demangler, juggles between NUL-terminated and StringView, which is confusing. This adds a StringView conversion, and adjusts the Demanglers that can benefit from that. Reviewed By: dblaikie Differential Revision: https://reviews.llvm.org/D120990	2022-03-28 11:19:55 -07:00
Johannes Doerfert	6c2be885ff	Revert "[OpenMP][NFC] Add missing virtual destructor to silence warning" This reverts commit `b9fd8f34ae` as it accidentally contained a unit test change that is not finished (and unrelated).	2022-03-25 16:07:11 -05:00
Johannes Doerfert	b9fd8f34ae	[OpenMP][NFC] Add missing virtual destructor to silence warning	2022-03-25 16:00:53 -05:00
Argyrios Kyrtzidis	9aa701984d	[Support] Introduce the BLAKE3 hashing function implementation BLAKE3 is a cryptographic hash function that is secure and very performant. The C implementation originates from https://github.com/BLAKE3-team/BLAKE3/tree/1.3.1/c License is at https://github.com/BLAKE3-team/BLAKE3/blob/1.3.1/LICENSE This patch adds: * `llvm/include/llvm-c/blake3.h`: The BLAKE3 C API * `llvm/include/llvm/Support/BLAKE3.h`: C++ wrapper of the C API * `llvm/lib/Support/BLAKE3`: Directory containing the BLAKE3 C implementation files, including the `LICENSE` file * `llvm/unittests/Support/BLAKE3Test.cpp`: unit tests for the BLAKE3 C++ wrapper This initial patch contains the pristine BLAKE3 sources, a follow-up patch will introduce LLVM-specific prefixes to avoid conflicts if a client also links with its own BLAKE3 version. And here's some timings comparing BLAKE3 with LLVM's SHA1/SHA256/MD5. Timings include `AVX512`, `AVX2`, `neon`, and the generic/portable implementations. The table shows the speed-up multiplier of BLAKE3 for hashing 100 MBs: \| Processor \| SHA1 \| SHA256 \| MD5 \| \|-------------------------\|-------\|--------\|------\| \| Intel Xeon W (AVX512) \| 10.4x \| 27x \| 9.4x \| \| Intel Xeon W (AVX2) \| 6.5x \| 17x \| 5.9x \| \| Intel Xeon W (portable) \| 1.3x \| 3.3x \| 1.1x \| \| M1Pro (neon) \| 2.1x \| 4.7x \| 2.8x \| \| M1Pro (portable) \| 1.1x \| 2.4x \| 1.5x \| Differential Revision: https://reviews.llvm.org/D121510	2022-03-24 10:26:39 -07:00
Shraiysh Vaishay	8722c12c12	[mlir][OpenMP][IRBuilder] Add support for nowait on single construct This patch adds the nowait parameter to `createSingle` in OpenMPIRBuilder and handling for IR generation from OpenMP Dialect. Also added tests for the same. Reviewed By: ftynse Differential Revision: https://reviews.llvm.org/D122371	2022-03-24 22:51:52 +05:30
Marcus Johnson	d14ccbc2e8	Re-land `c346068928` with fixes It was previously reverted in `a6beb18b84` due to test failures.	2022-03-23 08:13:17 -04:00
Snehasish Kumar	27a4f2545f	Reland "[memprof] Store callsite metadata with memprof records." This reverts commit `f4b794427e`. Reland with underlying msan issue fixed in D122260.	2022-03-22 14:40:02 -07:00
Aaron Ballman	a6beb18b84	Revert "Add UTF32 to/from UTF8 conversion functions" This reverts commit `c346068928`. It broke at least one of the builders: https://lab.llvm.org/buildbot#builders/100/builds/13947	2022-03-22 15:00:40 -04:00
Marcus Johnson	c346068928	Add UTF32 to/from UTF8 conversion functions This is anticipated to be used in new format specifier checking code.	2022-03-22 13:41:43 -04:00
Craig Topper	49c2206b3b	[VP] Preserve address space of pointer for strided load/store intrinsics. This adds LLVMAnyPointerToElt to use instead of LLVMPointerToElt. This allows us to preserve the address space as part of the type overload for the intrinsic, but still require the vector element type to match the pointer type. Reviewed By: nikic Differential Revision: https://reviews.llvm.org/D122042	2022-03-22 09:52:54 -07:00
Djordje Todorovic	73777b4c35	[Debugify] Optimize debugify original mode Before we start addressing the issue with having a lot of false positives when using debugify in the original mode, we have made a few patches that should speed up the execution of the testing utility Passes. For example, when testing a large project (let's say LLVM project itself), we can face a lot of potential DI issues. Usually, we use -verify-each-debuginfo-preserve (that is very similar to -debugify-each) -- it collects DI metadata before each Pass, and after the Pass it checks if the Pass preserved the DI metadata. However, we can speed up this process, since we don't need to collect DI metadata before each Pass -- we could use the DI metadata that are collected after the previous Pass from the pipeline as an input for the next Pass. This patch speeds up the utility for ~2x. Differential Revision: https://reviews.llvm.org/D115622	2022-03-22 12:14:00 +01:00
Mitch Phillips	f4b794427e	Revert "[memprof] Store callsite metadata with memprof records." This reverts commit `0d362c90d3`. Reason: Causes the MSan buildbot to fail (see comments on https://reviews.llvm.org/D121179 for more information	2022-03-21 15:59:13 -07:00
Snehasish Kumar	0d362c90d3	[memprof] Store callsite metadata with memprof records. To ease profile annotation, each of the callsites in a function can be annotated with profile data - "IR metadata format for MemProf" [1]. This patch extends the on-disk serialized record format to store the debug information for allocation callsites incl inline frames. This change is incompatible with the existing format i.e. indexed profiles must be regenerated, raw profiles are unaffected. [1] https://groups.google.com/g/llvm-dev/c/aWHsdMxKAfE/m/WtEmRqyhAgAJ Reviewed By: tejohnson Differential Revision: https://reviews.llvm.org/D121179	2022-03-21 13:58:29 -07:00
Snehasish Kumar	c9a3d29613	[memprof] Update the frame is inline logic and unittests. Since DI frames are enumerated with the leaf function at index 0, this patch fixes the logic when IsInlineFrame is set. Also update the unittests to check that only the last frame is marked as non-inline from a set of DI Frames for a PC address. Differential Revision: https://reviews.llvm.org/D121830	2022-03-21 10:41:05 -07:00
Daniel Thornburgh	7917b3c695	[Debuginfod] Don't depend on Content-Length. The present implementation of debuginfod lookups requires the Content-Length field to be populated in the HTTP server response. Unfortunately, Content-Length is optional, and there are some real scenarios where it's missing. (For example, a Google Cloud Storage server doing on-the-fly gunzipping.) This changes the debuginfod response handler to directly stream the output to the cache file as it is received. In addition to allowing lookups to proceed without a Content-Lenght, it seems somewhat more straightforward to implement, and it allows the disk I/O to be interleaved with the network I/O. Reviewed By: noajshu Differential Revision: https://reviews.llvm.org/D121720	2022-03-21 17:27:45 +00:00
Shraiysh Vaishay	423e3edc27	[OpenMP][IRBuilder] Fix emitAtomicUpdate conditions This patch fixes the condition for emitting atomic update using `atomicrmw` instruction or compare-exchange loop. Reviewed By: ftynse Differential Revision: https://reviews.llvm.org/D121546	2022-03-21 16:19:43 +05:30
Marek Kurdej	df4da5f37d	[ADT] Add drop_end. This patch adds drop_end that is analogical to drop_begin. It tries to fill the functional gap where one could drop first elements but not the last ones. The need for it came in when refactoring clang-format. Reviewed By: dblaikie Differential Revision: https://reviews.llvm.org/D122009	2022-03-21 09:43:19 +01:00
Chris Bieneman	95871187bf	Add DXIL triple This patch adds triple support for: * dxil architecture * shadermodel OS (with version parsing) * shader stages as environment Reviewed By: MaskRay, pete Differential Revision: https://reviews.llvm.org/D122031	2022-03-19 00:17:43 -05:00
Nikita Popov	f96428e16d	[MemorySSA] Don't optimize uses during construction This changes MemorySSA to be constructed in unoptimized form. MemorySSA::ensureOptimizedUses() can be called to optimize all uses (once). This should be done by passes where having optimized uses is beneficial, either because we're going to query all uses anyway, or because we're doing def-use walks. This should help reduce the compile-time impact of MemorySSA for some use cases (the reason why I started looking into this is D117926), which can avoid optimizing all uses upfront, and instead only optimize those that are actually queried. Actually, we have an existing use-case for this, which is EarlyCSE. Disabling eager use optimization there gives a significant compile-time improvement, because EarlyCSE will generally only query clobbers for a subset of all uses (this change is not included in this patch). Differential Revision: https://reviews.llvm.org/D121381	2022-03-18 09:56:16 +01:00
Shengchen Kan	1a70febf82	[X86] Set Int_MemBarrier as a meta-instruction Compiler only emits a comment for `Int_MemBarrier`, so it should be marked as a meta-instruction, which can help improve accuracy of debug location. Reviewed By: pengfei Differential Revision: https://reviews.llvm.org/D121879	2022-03-18 13:12:28 +08:00
Shengchen Kan	9e832a67fe	[Codegen][tablgen][NFC] Allow meta instruction to be target dependent An instruction is a meta-instruction if it doesn't produce any output in the form of executable instructions. So in the concept, a meta-instruction does not have to be target independent. Before this patch, `isMetaInstruction` is implemented by checking the opcode of the instruction, add we have no way to add target dependent opcode to the list, which does not make sense. After this patch, a bit `isMeta` is added for class `Instruction` in tablegen, which is used to indicate whether it's a meta instruction. Reviewed By: pengfei Differential Revision: https://reviews.llvm.org/D121600	2022-03-18 13:09:01 +08:00
Ben Barham	4125524112	[VFS] Add print/dump to the whole FileSystem hierarchy For now most are implemented by printing out the name of the filesystem, but this can be expanded in the future. Only `OverlayFileSystem` and `RedirectingFileSystem` are properly implemented in this patch. - `OverlayFileSystem`: Prints each filesystem in the order that any operations are actually run on them. Optionally prints recursively. - `RedirectingFileSystem`: Prints out all mappings, as well as the `ExternalFS`. Most of this was already implemented other than the handling for the `DirectoryRemap` case and to actually print out the mapping. Each FS should implement `printImpl` rather than `print`, where the latter just fowards to the former. This is to avoid spreading the default arguments through to the subclasses (where we may miss updating in the future). Differential Revision: https://reviews.llvm.org/D121421	2022-03-17 13:02:40 -07:00
Nikita Popov	0958450251	Revert "[FuzzMutate] Don't insert instructions after musttail call" This reverts commit `6a23d27644`. The newly added tests fail on the llvm-clang-x86_64-sie-win buildbot. Not sure why a failure only occurs there, possibly differen PRNG sequence?	2022-03-16 17:29:27 +01:00
Nikita Popov	6a23d27644	[FuzzMutate] Don't insert instructions after musttail call	2022-03-16 16:58:33 +01:00
serge-sans-paille	989f1c72e0	Cleanup codegen includes This is a (fixed) recommit of https://reviews.llvm.org/D121169 after: 1061034926 before: 1063332844 Discourse thread: https://discourse.llvm.org/t/include-what-you-use-include-cleanup Differential Revision: https://reviews.llvm.org/D121681	2022-03-16 08:43:00 +01:00
Fangrui Song	407c721ceb	[Support] Change zlib::compress to return void With a sufficiently large output buffer, the only failure is Z_MEM_ERROR. Check it and call the noreturn report_bad_alloc_error if applicable. resize_for_overwrite may call report_bad_alloc_error as well. Now that there is no other error type, we can replace the return type with void and simplify call sites. Reviewed By: ikudrin Differential Revision: https://reviews.llvm.org/D121512	2022-03-14 11:38:04 -07:00
sstwcw	65a3712af6	[yamlio] Allow parsing an entire mapping as an enumeration For when we want to change a configuration option from an enum into a struct. The need arose when working on D119599. Reviewed By: jhenderson Differential Revision: https://reviews.llvm.org/D120363	2022-03-14 04:41:40 +00:00
Andrew Litteken	0c4bbd293e	[IRSim] Make sure the first instruction of a block doesn't get missed if it is the first valid instruction in Module. If an instruction is first legal instruction in the module, and is the only legal instruction in its basic block, it will be ignored by the outliner due to a length check inherited from the older version of the outliner that was restricted to outlining within a single basic block. This removes that check, and updates any tests that broke because of it. Reviewer: paquette Differential Revision: https://reviews.llvm.org/D120786	2022-03-13 23:13:09 -05:00
serge-sans-paille	ed98c1b376	Cleanup includes: DebugInfo & CodeGen Discourse thread: https://discourse.llvm.org/t/include-what-you-use-include-cleanup Differential Revision: https://reviews.llvm.org/D121332	2022-03-12 17:26:40 +01:00
Fangrui Song	bd0bddc1ea	[CommandLine] Remove `may only occur zero or one times!` error Early adoption of new technologies or adjusting certain code generation/IR optimization thresholds is often available through some cl::opt options (which have unstable surfaces). Specifying such an option twice will lead to an error. ``` % clang -c a.c -mllvm -disable-binop-extract-shuffle -mllvm -disable-binop-extract-shuffle clang (LLVM option parsing): for the --disable-binop-extract-shuffle option: may only occur zero or one times! % clang -c a.c -mllvm -hwasan-instrument-reads=0 -mllvm -hwasan-instrument-reads=0 clang (LLVM option parsing): for the --hwasan-instrument-reads option: may only occur zero or one times! % clang -c a.c -mllvm --scalar-evolution-max-arith-depth=32 -mllvm --scalar-evolution-max-arith-depth=16 clang (LLVM option parsing): for the --scalar-evolution-max-arith-depth option: may only occur zero or one times! ``` The option is specified twice, because there is sometimes a global setting and a specific file or project may need to override (or duplicately specify) the value. The error is contrary to the common practice of getopt/getopt_long command line utilities that let the last option win and the `getLastArg` behavior used by Clang driver options. I have seen such errors for several times. I think the error just makes users inconvenient, while providing very little value on discouraging production usage of unstable surfaces (this goal is itself controversial, because developers might not want to commit to a stable surface too early, or there is just some subtle codegen toggle which is infeasible to have a driver option). Therefore, I suggest we drop the diagnostic, at least before the diagnostic gets sufficiently better support for the overridding needs. Removing the error is a degraded error checking experience. I think this error checking behavior, if desirable, should be enabled explicitly by tools. Users preferring the behavior can figure out a way to do so. Reviewed By: jhenderson, rnk Differential Revision: https://reviews.llvm.org/D120455	2022-03-11 11:25:04 -08:00
Pavel Labath	161bddf3af	[ADT] Make BitmaskEnum operations constant expressions This avoids runtime initialization (a global constructor) whenever they appear in the initializer. The patch just adds the constexpr keyword to a couple of functions. Differential Revision: https://reviews.llvm.org/D121281	2022-03-11 11:11:55 +01:00
Dmitry Vassiliev	dfeb978155	Fixed a roll-over on size_t in getNewUninitMemBuffer() Reviewed By: serge-sans-paille Differential Revision: https://reviews.llvm.org/D121399	2022-03-11 13:16:58 +04:00
Yevgeny Rouban	c5f34d1692	[CommandLine] Keep option default value unset if no cl::init() is used Current declaration of cl::opt is incoherent between class and non-class specializations of the opt_storage template. There is an inconsistency in the initialization of the Default field: for inClass instances the default constructor is used - it sets the Optional Default field to None; though for non-inClass instances the Default field is set to the type's default value. For non-inClass instances it is impossible to know if the option is defined with cl::init() initializer or not: cl::opt<int> i1("option-i1"); cl::opt<int> i2("option-i2", cl::init(0)); cl::opt<std::string> s1("option-s1"); cl::opt<std::string> s2("option-s2", cl::init("")); assert(s1.Default.hasValue() != s2.Default.hasValue()); // Ok assert(i1.Default.hasValue() != i2.Default.hasValue()); // Fails This patch changes constructor of the non-class specializations to keep the Default field unset (that is None) rather than initialize it with DataType(). Reviewed By: lattner Differential Revision: https://reviews.llvm.org/D114645	2022-03-11 14:24:25 +07:00
Lorenzo Albano	28cfa764c2	[VP] Strided loads/stores This patch introduces two new experimental IR intrinsics and SDAG nodes to represent vector strided loads and stores. Reviewed By: simoll Differential Revision: https://reviews.llvm.org/D114884	2022-03-10 18:46:54 +01:00
Nico Weber	a278250b0f	Revert "Cleanup codegen includes" This reverts commit `7f230feeea`. Breaks CodeGenCUDA/link-device-bitcode.cu in check-clang, and many LLVM tests, see comments on https://reviews.llvm.org/D121169	2022-03-10 07:59:22 -05:00
serge-sans-paille	7f230feeea	Cleanup codegen includes after: 1061034926 before: 1063332844 Differential Revision: https://reviews.llvm.org/D121169	2022-03-10 10:00:30 +01:00
serge-sans-paille	3c4410dfca	Cleanup includes: LLVMTarget Most notably, Pass.h is no longer included by TargetMachine.h before: 1063570306 after: 1063332844 Differential Revision: https://reviews.llvm.org/D121168	2022-03-10 10:00:29 +01:00
Yevgeny Rouban	fcd9fa416d	[Support] Try 2: Reset option to its default if its Default field is undefined opt::setDefaultImpl() is changed to set the option value to the option type's default if the Default field is not set. This results in option value reset by Option::reset() or ResetAllOptionOccurrences() even if the cl::init() is not specified. Example: StackOption<std::string> Str("str"); // No cl::init(). Str = "some value"; cl::ResetAllOptionOccurrences(); EXPECT_EQ("", Str); // The Str is reset. Reviewed By: lattner Differential Revision: https://reviews.llvm.org/D115433	2022-03-10 08:26:34 +07:00
Shraiysh Vaishay	7c385c4b2f	[mlir][OpenMP] Generating enums in accordance with the guidelines This patch changes the enums generated from `OMP.td` for MLIR according to the enum naming guidelines in LLVM Coding Standards. This also helps the issues we had with `static` being a C++ keyword and also a value for the schedule clause. Enumerator naming guidelines: https://llvm.org/docs/CodingStandards.html#name-types-functions-variables-and-enumerators-properly Reviewed By: kiranchandramohan Differential Revision: https://reviews.llvm.org/D120825	2022-03-09 20:10:45 +05:30
Alexey Lapshin	dedf006fa9	Fix for buildbot failure: https://lab.llvm.org/buildbot#builders/110/builds/11051 after https://reviews.llvm.org/D121005	2022-03-09 14:26:46 +03:00
Alexey Lapshin	2dc4a80ec0	[objcopy][NFC] Move NameOrPattern::create() into CommonConfig.h While moving objcopy into separate library(D88827), NameOrPattern::create() was mistakenly placed into ObjcopyOptions.cpp. This patch moves the NameOrPattern::create() into CommonConfig.h. Additionally it adds test for using NameOrPattern. Differential Revision: https://reviews.llvm.org/D121005	2022-03-09 13:03:28 +03:00
Maksim Panchenko	cf9b3ef941	Revert "[X86] Fix MCSymbolizer interface for X86Disassembler" This reverts commit `0c2b43ab8c`.	2022-03-07 10:40:48 -08:00
Maksim Panchenko	0c2b43ab8c	[X86] Fix MCSymbolizer interface for X86Disassembler Fix a number of issues with MCSymbolizer::tryAddingSymbolicOperand() in X86Disassembler: * Pass instruction size instead of immediate size. * Correctly adjust the value of PC-relative operands. * Set operand offset to zero when the operand is specified implicitly. Reviewed By: Amir, skan Differential Revision: https://reviews.llvm.org/D121065	2022-03-07 10:27:28 -08:00
Simon Moll	5f62156762	[VP] Introducing VectorBuilder, the VP intrinsic builder VectorBuilder wraps around an IRBuilder and VectorBuilder::createVectorInstructions emits VP intrinsics as if they were regular instructions. Reviewed By: craig.topper Differential Revision: https://reviews.llvm.org/D105283	2022-03-07 10:02:07 +01:00
Stella Laurenzo	38151a08c2	Reapply "[cmake] Prefix gtest and gtest_main with "llvm_"." This reverts commit `7cdda6b8ce`. Differential Revision: https://reviews.llvm.org/D121020	2022-03-04 13:45:43 -08:00
Snehasish Kumar	11314f4059	[memprof] Filter out callstack frames which cannot be symbolized. This patch filters out callstack frames which can't be symbolized or if the frames belong to the runtime. Symbolization may not be possible if debug information is unavailable or if the addresses are from a shared library. For now we only support optimization of the main binary which is statically linked to the compiler runtime. Differential Revision: https://reviews.llvm.org/D120860	2022-03-04 11:10:08 -08:00
River Riddle	e865fa7530	[TableGen] Add a library-based entry point for parsing td files This commit adds a new `TableGenParseFile` entry point for tablegen that parses an input buffer and invokes a callback function with a record keeper (notably without an output buffer). This kind of entry point is very useful for tablegen consuming tools that don't create output, and want invoke tablegen multiple times. The current way that we interact with tablegen is via relative includes to TGParser(not great). Differential Revision: https://reviews.llvm.org/D119899	2022-03-03 16:14:03 -08:00
Snehasish Kumar	dda7b74967	[memprof] Symbolize and cache stack frames. Currently, symbolization of stack frames occurs on demand when the instrprof writer iterates over all the records in the raw memprof reader. With this change we symbolize and cache the frames immediately after reading the raw profiles. For a large internal binary this results in a runtime reduction of ~50% (2m -> 48s) when merging a memprof raw profile with a raw instr profile to generate an indexed profile. This change also makes it simpler in the future to generate additional calling context metadata to attach to each memprof record. Differential Revision: https://reviews.llvm.org/D120430	2022-03-03 11:00:37 -08:00
Simon Moll	8de8731591	Revert "[VP] Introducing VectorBuilder, the VP intrinsic builder" This reverts commit `8bcbfb50e8`. Taking this patch offline to fix breakage: https://lab.llvm.org/buildbot/#/builders/110/builds/10912	2022-03-03 13:34:37 +01:00
Simon Moll	8bcbfb50e8	[VP] Introducing VectorBuilder, the VP intrinsic builder VectorBuilder wraps around an IRBuilder and VectorBuilder::createVectorInstructions emits VP intrinsics as if they were regular instructions. Reviewed By: craig.topper Differential Revision: https://reviews.llvm.org/D105283	2022-03-03 11:31:57 +01:00
Stella Laurenzo	7cdda6b8ce	Revert "[cmake] Prefix gtest and gtest_main with "llvm_"." lldb buildbot failure. will investigate and roll forward. This reverts commit `9f37775472`.	2022-03-02 11:13:46 -08:00
Stella Laurenzo	9f37775472	[cmake] Prefix gtest and gtest_main with "llvm_". The upstream project ships CMake rules for building vanilla gtest/gmock which conflict with the names chosen by LLVM. Since LLVM's build rules here are quite specific to LLVM, prefixing them to avoid collision is the right thing (i.e. there does not appear to be a path to letting someone replace LLVM's googletest with one they bring, so co-existence should be the goal). This allows LLVM to be included with testing enabled within projects that themselves have a dependency on an official gtest release. Reviewed By: mehdi_amini Differential Revision: https://reviews.llvm.org/D120789	2022-03-02 10:53:32 -08:00
Simon Moll	d05ddb86f6	[VP] vp.sitofp cast intrinsic and docs Reviewed By: frasercrmck Differential Revision: https://reviews.llvm.org/D119922	2022-03-02 10:16:19 +01:00
Duncan P. N. Exon Smith	15ab7bc3af	Testing: Make TempFile safe to move; test Temp{Dir,File,Link} Default the moves and delete the copies for TempFile, matching TempDir and TempLink, and add tests for all of them to confirm that the destructor is not harmful after it has been moved from. Differential Revision: https://reviews.llvm.org/D120691	2022-03-01 13:45:51 -08:00
serge-sans-paille	a494ae43be	Cleanup includes: TransformsUtils Estimation on the impact on preprocessor output: before: 1065307662 after: 1064800684 Discourse thread: https://discourse.llvm.org/t/include-what-you-use-include-cleanup Differential Revision: https://reviews.llvm.org/D120741	2022-03-01 21:00:07 +01:00
serge-sans-paille	71c3a5519d	Cleanup includes: LLVMAnalysis Number of lines output by preprocessor: before: 1065940348 after: 1065307662 Discourse thread: https://discourse.llvm.org/t/include-what-you-use-include-cleanup Differential Revision: https://reviews.llvm.org/D120659	2022-03-01 18:01:54 +01:00
Jeremy Morse	ab49dce01f	[DebugInfo][InstrRef][NFC] Use unique_ptr instead of raw pointers InstrRefBasedLDV allocates some big tables of ValueIDNum, to store live-in and live-out block values in, that then get passed around as pointers everywhere. This patch wraps the allocation in a std::unique_ptr, names some types based on unique_ptr, and passes references to those around instead. There's no functional change, but it makes it clearer to the reader that references to these tables are borrowed rather than owned, and we get some extra validity assertions too. Differential Revision: https://reviews.llvm.org/D118774	2022-03-01 12:49:50 +00:00
Nathan Sidwell	024495e626	[demangler] Improve buffer hysteresis Improve demangler buffer hysteresis. If we needed more than double the buffer, the original code would allocate exactly the amount needed, and thus consequently the next request would also realloc. We're very unlikely to get into wanting more than double, after the first allocation, as it would require the user to have used an identifier larger than the hysteresis. With machine generated code that's possible, but unlikely. Reviewed By: ChuanqiXu Differential Revision: https://reviews.llvm.org/D119972	2022-03-01 04:37:24 -08:00
Alexey Lapshin	a6f3fedc3f	[objcopy] Refactor CommonConfig to add posibility to specify added/updated sections as MemoryBuffer. Current objcopy implementation has a possibility to add or update sections. The incoming section is specified as a pair: section name and name of the file containing section data. The interface does not allow to specify incoming section as a memory buffer. This patch adds possibility to specify incoming section as a memory buffer. Differential Revision: https://reviews.llvm.org/D120486	2022-03-01 14:49:41 +03:00
Michael Kruse	96ae86bd29	[test] Silence compiler warning. signed/unsigned comparison in gtest macro. Also fix buidlbots: https://lab.llvm.org/buildbot#builders/36/builds/18337 https://lab.llvm.org/buildbot#builders/57/builds/15594	2022-02-28 18:46:55 -06:00
Michael Kruse	a66f7769a3	[OpenMPIRBuilder] Implement static-chunked workshare-loop schedules. Add applyStaticChunkedWorkshareLoop method implementing static schedule when chunk-size is specified. Unlike a static schedule without chunk-size (where chunk-size is chosen by the runtime such that each thread receives one chunk), we need two nested loops: one for looping over the iterations of a chunk, and a second for looping over all chunks assigned to the threads. This patch includes the following related changes: * Adapt applyWorkshareLoop to triage between the schedule types, now possible since all schedules have been implemented. The default schedule is assumed to be non-chunked static, as without OpenMPIRBuilder. * Remove the chunk parameter from applyStaticWorkshareLoop, it is ignored by the runtime. Change the value for the value passed to the init function to 0, as without OpenMPIRBuilder. * Refactor CanonicalLoopInfo::setTripCount and CanonicalLoopInfo::mapIndVar as used by both, applyStaticWorkshareLoop and applyStaticChunkedWorkshareLoop. * Enable Clang to use the OpenMPIRBuilder in the presence of the schedule clause. Differential Revision: https://reviews.llvm.org/D114413	2022-02-28 18:18:33 -06:00
Zi Xuan Wu	21bce9007a	[Support] Add CSKY target parser and attributes parser Construct LLVM Support module about CSKY target parser and attribute parser. It refers CSKY ABIv2 and implementation of GNU binutils and GCC. https://github.com/c-sky/csky-doc/blob/master/C-SKY_V2_CPU_Applications_Binary_Interface_Standards_Manual.pdf Now we only support CSKY 800 series cpus and newer cpus in the future undering CSKYv2 ABI specification. There are 11 archs including ck801, ck802, ck803, ck803s, ck804, ck805, ck807, ck810, ck810v, ck860, ck860v. Every arch has base extensions, the cpus of that arch family have more extended extensions than base extensions. We need specify extended extensions for every cpu. Every extension has its enum value, name and related llvm feature string with +/-. Every enum value represents a bit of uint64_t integer. Differential Revision: https://reviews.llvm.org/D119917	2022-02-28 11:35:07 +08:00
Yevgeny Rouban	beb92af01b	Revert "[Support] Reset option to its default if its Default field is undefined" This reverts commit `7fb39fb6d6` as clang buildbots failed.	2022-02-27 21:46:10 +07:00
Yevgeny Rouban	7fb39fb6d6	[Support] Reset option to its default if its Default field is undefined opt::setDefaultImpl() is changed to set the option value to the option type's default if the Default field is not set. This results in option value reset by Option::reset() or ResetAllOptionOccurrences() even if the cl::init() is not specified. Example: StackOption<std::string> Str("str"); // No cl::init(). Str = "some value"; cl::ResetAllOptionOccurrences(); EXPECT_EQ("", Str); // The Str is reset. Reviewed By: lattner Differential Revision: https://reviews.llvm.org/D115433	2022-02-27 20:57:28 +07:00
serge-sans-paille	db29f4374d	Cleanup include: DebugInfo/Symbolize Estimation of the impact on preprocessor output after: 1067349756 before:1067487786 Discourse thread: https://discourse.llvm.org/t/include-what-you-use-include-cleanup Differential Revision: https://reviews.llvm.org/D120433	2022-02-24 13:25:11 +01:00
Matthias Braun	6a383369f9	PGOInstrumentation, GCOVProfiling: Split indirectbr critical edges regardless of PHIs The `SplitIndirectBrCriticalEdges` function was originally designed for `CodeGenPrepare` and skipped splitting of edges when the destination block didn't contain any `PHI` instructions. This only makes sense when reducing COPYs like `CodeGenPrepare`. In the case of `PGOInstrumentation` or `GCOVProfiling` it would result in missed counters and wrong result in functions with computed goto. Differential Revision: https://reviews.llvm.org/D120096	2022-02-23 16:27:37 -08:00
Matthias Braun	d7a307320e	Simplify/cleanup BasicBlockUtilsTest Cleanup BasicBolckUtilsTest using C++ raw string literals, remove duplicated block functions and smaller style changes. Differential Revision: https://reviews.llvm.org/D120095	2022-02-23 16:27:37 -08:00
serge-sans-paille	eb4c860811	Cleanup llvm/DebugInfo/PDB headers accumulated preprocessed size: before: 1065515095 after: 1065629059 Discourse thread: https://discourse.llvm.org/t/include-what-you-use-include-cleanup Differential Revision: https://reviews.llvm.org/D120195	2022-02-23 10:31:34 +01:00
Bill Wendling	823b32fbfb	[NFC] Add #include for constants	2022-02-23 01:26:53 -08:00
Bill Wendling	c1b9672534	[NFC] Add #include for constants	2022-02-23 01:23:45 -08:00
Egor Zhdan	3a1cb36237	Add DriverKit support This patch is the first in a series of patches to upstream the support for Apple's DriverKit. Once complete, it will allow targeting DriverKit platform with Clang similarly to AppleClang. This code was originally authored by JF Bastien. Differential Revision: https://reviews.llvm.org/D118046	2022-02-22 13:42:53 +00:00
Whitney Tsang	e7afbea8ca	[MemorySSA] Clear VisitedBlocks per query The problem can be shown from the newly added test case. There are two invocations to MemorySSAUpdater::moveToPlace, and the internal data structure VisitedBlocks is changed in the first invocation, and reused in the second invocation. In between the two invocations, there is a change to the CFG, and MemorySSAUpdater is notified about the change. Reviewed By: asbirlea Differential Revision: https://reviews.llvm.org/D119898	2022-02-18 15:36:19 -05:00
Snehasish Kumar	0a4184909a	Reland "[memprof] Extend the index prof format to include memory profiles." This patch adds support for optional memory profile information to be included with and indexed profile. The indexed profile header adds a new field which points to the offset of the memory profile section (if present) in the indexed profile. For users who do not utilize this feature the only overhead is a 64-bit offset in the header. The memory profile section contains (1) profile metadata describing the information recorded for each entry (2) an on-disk hashtable containing the profile records indexed via llvm::md5(function_name). We chose to introduce a separate hash table instead of the existing one since the indexing for the instrumented fdo hash table is based on a CFG hash which itself is perturbed by memprof instrumentation. This commit also includes the changes reviewed separately in D120093. Differential Revision: https://reviews.llvm.org/D120103	2022-02-17 22:09:52 -08:00
Snehasish Kumar	19bdf44d85	Revert "Reland "[memprof] Extend the index prof format to include memory profiles."" This reverts commit `807ba7aace`.	2022-02-17 15:51:04 -08:00
Snehasish Kumar	807ba7aace	Reland "[memprof] Extend the index prof format to include memory profiles." This reverts commit `85355a560a`. This patch adds support for optional memory profile information to be included with and indexed profile. The indexed profile header adds a new field which points to the offset of the memory profile section (if present) in the indexed profile. For users who do not utilize this feature the only overhead is a 64-bit offset in the header. The memory profile section contains (1) profile metadata describing the information recorded for each entry (2) an on-disk hashtable containing the profile records indexed via llvm::md5(function_name). We chose to introduce a separate hash table instead of the existing one since the indexing for the instrumented fdo hash table is based on a CFG hash which itself is perturbed by memprof instrumentation. Differential Revision: https://reviews.llvm.org/D118653	2022-02-17 13:14:17 -08:00
Alexey Lapshin	0b57e6c46b	[objcopy] followup patch after `f75da0c8e6`	2022-02-17 19:32:10 +03:00
Alexey Lapshin	f75da0c8e6	[llvm-objcopy][NFC] Move core implementation of llvm-objcopy into separate library. This patch moves core implementation of llvm-objcopy into Object library (http://lists.llvm.org/pipermail/llvm-dev/2020-September/145075.html). The functionality for parsing input options is left inside tools/llvm-objcopy. The interface of ObjCopy library: ObjCopy/ELF/ELFObjcopy.h ``` Error executeObjcopyOnIHex(const CopyConfig &Config, MemoryBuffer &In, Buffer &Out); Error executeObjcopyOnRawBinary(const CopyConfig &Config, MemoryBuffer &In, Buffer &Out); Error executeObjcopyOnBinary(const CopyConfig &Config, object::ELFObjectFileBase &In, Buffer &Out); ``` ObjCopy/COFF/COFFObjcopy.h ``` Error executeObjcopyOnBinary(const CopyConfig &Config, object::COFFObjectFile &In, Buffer &Out); ``` ObjCopy/MachO/MachOObjcopy.h ``` Error executeObjcopyOnBinary(const CopyConfig &Config, object::MachOObjectFile &In, Buffer &Out); ``` ObjCopy/wasm/WasmObjcopy.h ``` Error executeObjcopyOnBinary(const CopyConfig &Config, object::WasmObjectFile &In, Buffer &Out); ``` Differential Revision: https://reviews.llvm.org/D88827	2022-02-17 13:11:42 +03:00
Chuanqi Xu	a2609be0b2	[ValueTracking] Checking haveNoCommonBitsSet for (x & y) and ~(x \| y) This one tries to fix: https://github.com/llvm/llvm-project/issues/53357. Simply, this one would check (x & y) and ~(x \| y) in haveNoCommonBitsSet. Since they shouldn't have common bits (we could traverse the case by enumerating), and we could convert this one to (x & y) \| ~(x \| y) . Then the compiler could handle it in InstCombineAndOrXor. Further more, since ((x & y) + (~x & ~y)) would be converted to ((x & y) + ~(x \| y)), this patch would fix it too. https://alive2.llvm.org/ce/z/qsKzRS Reviewed By: spatel, xbolva00, RKSimon, lebedev.ri Differential Revision: https://reviews.llvm.org/D118094	2022-02-16 13:42:52 +08:00
Shao-Ce SUN	2aed07e96c	[NFC][MC] remove unused argument `MCRegisterInfo` in `MCCodeEmitter` Reviewed By: skan Differential Revision: https://reviews.llvm.org/D119846	2022-02-16 13:10:09 +08:00
Shao-Ce SUN	9cc49c1951	Revert "[NFC][MC] remove unused argument `MCRegisterInfo` in `MCCodeEmitter`" This reverts commit `fe25c06cc5`.	2022-02-16 11:57:49 +08:00
Shao-Ce SUN	fe25c06cc5	[NFC][MC] remove unused argument `MCRegisterInfo` in `MCCodeEmitter` For ten years, it seems that `MCRegisterInfo` is not used by any target. Reviewed By: skan Differential Revision: https://reviews.llvm.org/D119846	2022-02-16 11:47:17 +08:00
Simon Moll	03e83cc8eb	[VP] vp.fptosi cast intrinsic and docs Reviewed By: craig.topper Differential Revision: https://reviews.llvm.org/D119535	2022-02-15 18:17:19 +01:00
serge-sans-paille	290e482342	Cleanup LLVMDWARFDebugInfo As usual with that header cleanup series, some implicit dependencies now need to be explicit: llvm/DebugInfo/DWARF/DWARFContext.h no longer includes: - "llvm/DebugInfo/DWARF/DWARFAcceleratorTable.h" - "llvm/DebugInfo/DWARF/DWARFCompileUnit.h" - "llvm/DebugInfo/DWARF/DWARFDebugAbbrev.h" - "llvm/DebugInfo/DWARF/DWARFDebugAranges.h" - "llvm/DebugInfo/DWARF/DWARFDebugFrame.h" - "llvm/DebugInfo/DWARF/DWARFDebugLoc.h" - "llvm/DebugInfo/DWARF/DWARFDebugMacro.h" - "llvm/DebugInfo/DWARF/DWARFGdbIndex.h" - "llvm/DebugInfo/DWARF/DWARFSection.h" - "llvm/DebugInfo/DWARF/DWARFTypeUnit.h" - "llvm/DebugInfo/DWARF/DWARFUnitIndex.h" Plus llvm/Support/Errc.h not included by a bunch of llvm/DebugInfo/DWARF/DWARF*.h files Preprocessed lines to build llvm on my setup: after: 1065629059 before: 1066621848 Which is a great diff! Discourse thread: https://discourse.llvm.org/t/include-what-you-use-include-cleanup Differential Revision: https://reviews.llvm.org/D119723	2022-02-15 09:16:03 +01:00
Snehasish Kumar	50713461d4	Reland "[memprof] Introduce a wrapper around MemInfoBlock." This reverts commit `e6999040f5`. Update test to fix signed int comparison warning, fix whitespace in compiler-rt MIBEntryDef.inc file. Differential Revision: https://reviews.llvm.org/D117256	2022-02-14 19:04:36 -08:00
Snehasish Kumar	f89319b841	Reland "[memprof] Refactor out the MemInfoBlock into a macro based def." This reverts commit `857ec0d01f`. Fixes -DLLVM_ENABLE_MODULES=On build by adding the new textual header to the modulemap file. Reviewed in https://reviews.llvm.org/D117722	2022-02-14 16:05:05 -08:00
Snehasish Kumar	857ec0d01f	Revert "[memprof] Refactor out the MemInfoBlock into a macro based def." This reverts commit `9def83c6d0`. [4/4]	2022-02-14 11:42:58 -08:00
Snehasish Kumar	e6999040f5	Revert "[memprof] Introduce a wrapper around MemInfoBlock." This reverts commit `9b67165285`. [3/4]	2022-02-14 11:42:58 -08:00
Snehasish Kumar	85355a560a	Revert "Reland "[memprof] Extend the index prof format to include memory profiles."" This reverts commit `de54e4ab78` [1/4]	2022-02-14 11:42:58 -08:00
Snehasish Kumar	de54e4ab78	Reland "[memprof] Extend the index prof format to include memory profiles." This reverts commit `0f73fb18ca`. Use llvm/Profile/MIBEntryDef.inc instead of relative path. Generated the raw profile data with `-mllvm -enable-name-compression=false` so that builbots where the reader is built without zlib do not fail. Also updated the test build instructions.	2022-02-14 10:52:13 -08:00
Snehasish Kumar	0f73fb18ca	Revert "[memprof] Extend the index prof format to include memory profiles." This reverts commit `43c2348c5b`. Buildbots are failing with an error on reading memprof testdata. "Inputs/basic.profraw: profile uses zlib compression but the profile reader was built without zlib support" https://lab.llvm.org/buildbot/#/builders/16/builds/24490	2022-02-14 10:25:01 -08:00
Snehasish Kumar	43c2348c5b	[memprof] Extend the index prof format to include memory profiles. This patch adds support for optional memory profile information to be included with and indexed profile. The indexed profile header adds a new field which points to the offset of the memory profile section (if present) in the indexed profile. For users who do not utilize this feature the only overhead is a 64-bit offset in the header. The memory profile section contains (1) profile metadata describing the information recorded for each entry (2) an on-disk hashtable containing the profile records indexed via llvm::md5(function_name). We chose to introduce a separate hash table instead of the existing one since the indexing for the instrumented fdo hash table is based on a CFG hash which itself is perturbed by memprof instrumentation. Differential Revision: https://reviews.llvm.org/D118653	2022-02-14 09:53:45 -08:00
Snehasish Kumar	9b67165285	[memprof] Introduce a wrapper around MemInfoBlock. Use the macro based format to add a wrapper around the MemInfoBlock when stored in the MemProfRecord. This wrapped block can then be serialized/deserialized based on a schema specified by a list of enums. Differential Revision: https://reviews.llvm.org/D117256	2022-02-14 09:53:45 -08:00
Snehasish Kumar	9def83c6d0	[memprof] Refactor out the MemInfoBlock into a macro based def. This patch refactors out the MemInfoBlock definition into a macro based header which can be included to generate enums, structus and code for each field recorded by the memprof profiling runtime. Differential Revision: https://reviews.llvm.org/D117722	2022-02-14 09:53:45 -08:00
Momchil Velikov	6398903ac8	Extend the `uwtable` attribute with unwind table kind We have the `clang -cc1` command-line option `-funwind-tables=1\|2` and the codegen option `VALUE_CODEGENOPT(UnwindTables, 2, 0) ///< Unwind tables (1) or asynchronous unwind tables (2)`. However, this is encoded in LLVM IR by the presence or the absence of the `uwtable` attribute, i.e. we lose the information whether to generate want just some unwind tables or asynchronous unwind tables. Asynchronous unwind tables take more space in the runtime image, I'd estimate something like 80-90% more, as the difference is adding roughly the same number of CFI directives as for prologues, only a bit simpler (e.g. `.cfi_offset reg, off` vs. `.cfi_restore reg`). Or even more, if you consider tail duplication of epilogue blocks. Asynchronous unwind tables could also restrict code generation to having only a finite number of frame pointer adjustments (an example of not having a finite number of `SP` adjustments is on AArch64 when untagging the stack (MTE) in some cases the compiler can modify `SP` in a loop). Having the CFI precise up to an instruction generally also means one cannot bundle together CFI instructions once the prologue is done, they need to be interspersed with ordinary instructions, which means extra `DW_CFA_advance_loc` commands, further increasing the unwind tables size. That is to say, async unwind tables impose a non-negligible overhead, yet for the most common use cases (like C++ exceptions), they are not even needed. This patch extends the `uwtable` attribute with an optional value: - `uwtable` (default to `async`) - `uwtable(sync)`, synchronous unwind tables - `uwtable(async)`, asynchronous (instruction precise) unwind tables Reviewed By: MaskRay Differential Revision: https://reviews.llvm.org/D114543	2022-02-14 14:35:02 +00:00
Chuanqi Xu	e59d6dc063	[NFC] Precommit for PR53357 Due to there are other required changes in https://reviews.llvm.org/D118094, precommit these changes to ease reviewing. Including: - Remove *_thwart tests. - Remove test for (x & y) + (~x & ~y) - Fix incorrect uniitest committeed before	2022-02-14 14:37:12 +08:00
Chuanqi Xu	4ee240b860	[NFC] [ValueTracking] Add unittest for haveNoCommonBitsSet	2022-02-14 14:10:30 +08:00
YASHASVI KHATAVKAR	70fdbf35de	Adding DiBuilder interface for assumed length strings	2022-02-11 14:40:02 -05:00
Paul Robinson	ce5588fdf4	[RGT] Refactor environment-specific checks to use GTEST_SKIP() This allows using GTEST_SKIP() to identify un-executed tests. Found by the Rotten Green Tests project.	2022-02-11 10:47:07 -08:00
Paul Robinson	a0ac6a9212	[RGT] Refactor Windows-specific checks into their own test This allows using GTEST_SKIP() to identify un-executed tests. Found by the Rotten Green Tests project.	2022-02-11 10:47:04 -08:00
Paul Robinson	d2495b69f2	[RGT] Exercise both paths through a test BitcastToGEP had an opaque/typed pointer decision point, make sure it exercises both sides. Found by the Rotten Green Tests project.	2022-02-11 10:47:00 -08:00
serge-sans-paille	06943537d9	Cleanup MCParser headers As usual with that header cleanup series, some implicit dependencies now need to be explicit: llvm/MC/MCParser/MCAsmParser.h no longer includes llvm/MC/MCParser/MCAsmLexer.h Preprocessed lines to build llvm on my setup: after: 1068185081 before: 1068324320 So no compile time benefit to expect, but we still get the looser coupling between files which is great. Discourse thread: https://discourse.llvm.org/t/include-what-you-use-include-cleanup Differential Revision: https://reviews.llvm.org/D119359	2022-02-11 10:39:29 +01:00
YASHASVI KHATAVKAR	93d1a623ce	Reverting an entire stack of changes causing build failures	2022-02-10 17:58:22 -05:00
YASHASVI KHATAVKAR	0e7341b7b1	worked on review comments	2022-02-10 15:24:51 -05:00
YASHASVI KHATAVKAR	c26a0d1cda	Updated the test to include proper string get functions	2022-02-10 15:24:50 -05:00
YASHASVI KHATAVKAR	929499eb64	Updated the test to include addtional details	2022-02-10 15:24:50 -05:00
YASHASVI KHATAVKAR	99f990be64	Added StringLocationExp to the new apis	2022-02-10 15:24:50 -05:00
YASHASVI KHATAVKAR	43d421cda3	Adding DIBuilder interface for assumed length string	2022-02-10 15:24:50 -05:00
serge-sans-paille	e72c195fdc	Cleanup LLVMObject headers Most notably, llvm/Object/Binary.h no longer includes llvm/Support/MemoryBuffer.h llvm/Object/MachOUniversal*.h no longer include llvm/Object/Archive.h llvm/Object/TapiUniversal.h no longer includes llvm/Object/TapiFile.h llvm-project preprocessed size: before: 1068185081 after: 1068324320 Discourse thread: https://discourse.llvm.org/t/include-what-you-use-include-cleanup Differential Revision: https://reviews.llvm.org/D119457	2022-02-10 21:13:44 +01:00
Lu Weining	e53e6ec6ef	[LoongArch 2/6] Add ELF machine flag and relocs for upcoming LoongArch target This patch adds necessary definitions for LoongArch ELF files, including relocation types. Also adds initial support to ELFYaml, llvm-objdump, and llvm-readobj in order to work with LoongArch ELFs. Differential revision: https://reviews.llvm.org/D115859	2022-02-10 10:23:34 +00:00
Lu Weining	42fd2bfc90	[LoongArch 1/6] Add triples loongarch{32,64} for the upcoming LoongArch target This is the first patch to incrementally add an MC layer for LoongArch to LLVM. This patch also adds unit testcases for these new triples. RFC for adding this new backend: https://lists.llvm.org/pipermail/llvm-dev/2021-December/154371.html Differential revision: https://reviews.llvm.org/D115857	2022-02-10 10:23:34 +00:00
David Sherwood	1badfbb4fc	Fix incorrect TypeSize->uint64_t cast in InductionDescriptor::isInductionPHI The code was relying upon the implicit conversion of TypeSize to uint64_t and assuming the type in question was always fixed. However, I discovered an issue when running the canon-freeze pass with some IR loops that contains scalable vector types. I've changed the code to bail out if the size is unknown at compile time, since we cannot compute whether the step is a multiple of the type size or not. I added a test here: Transforms/CanonicalizeFreezeInLoops/phis.ll Differential Revision: https://reviews.llvm.org/D118696	2022-02-10 09:39:12 +00:00
Shraiysh Vaishay	35dfa78ff8	[OpenMP][IRBuilder] Handle floats for atomic update and fix AllocaIP for update/capture This patch fixes `createAtomicUpdate` for lowering with float types. Test added for the same. This patch also changes the alloca argument for createAtomicUpdate and createAtomicCapture from `Instruction*` to `InsertPointTy`. This is in line with the other functions of the OpenMPIRBuilder class which take AllocaIP as an `InsertPointTy`. Reviewed By: Meinersbur Differential Revision: https://reviews.llvm.org/D118227	2022-02-10 13:16:10 +05:30
Philip Reames	d334fec140	[SCEV] Make SCEVUnionPredicate externally immutable [NFC] This is the last major stepping stone before being able to allocate the node via the folding set allocator. That will in turn allow more general SCEV predicate expression trees.	2022-02-09 13:47:28 -08:00
Craig Topper	60745fb16f	[VP] llvm.vp.fneg intrinsic and LangRef Reviewed By: frasercrmck Differential Revision: https://reviews.llvm.org/D119262	2022-02-09 07:54:36 -08:00
serge-sans-paille	ef736a1c39	Cleanup LLVMMC headers There's a few relevant forward declarations in there that may require downstream adding explicit includes: llvm/MC/MCContext.h no longer includes llvm/BinaryFormat/ELF.h, llvm/MC/MCSubtargetInfo.h, llvm/MC/MCTargetOptions.h llvm/MC/MCObjectStreamer.h no longer include llvm/MC/MCAssembler.h llvm/MC/MCAssembler.h no longer includes llvm/MC/MCFixup.h, llvm/MC/MCFragment.h Counting preprocessed lines required to rebuild llvm-project on my setup: before: 1052436830 after: 1049293745 Which is significant and backs up the change in addition to the usual benefits of decreasing coupling between headers and compilation units. Discourse thread: https://discourse.llvm.org/t/include-what-you-use-include-cleanup Differential Revision: https://reviews.llvm.org/D119244	2022-02-09 11:09:17 +01:00
Roman Lebedev	3564551400	ProfileDataTests: fix BUILD_SHARED_LIBS build	2022-02-09 12:27:13 +03:00
RVP	62e4a77746	[Support] Fix for two issues with clearing of the internal storage for cl::bits This patch fixes two issues with clearing of the internal storage for cl::bits 1. The internal bits storage for cl::bits is uninitialized. This is a problem if a cl::bits option is not defined with static lifetime. 2. ResetAllOptionOccurrences does not reset cl::bits options. The latter is also discussed in: https://lists.llvm.org/pipermail/llvm-dev/2021-February/148299.html Differential Revision: https://reviews.llvm.org/D119066	2022-02-09 09:46:46 +01:00
Snehasish Kumar	216575e581	Revert "Revert "[ProfileData] Read and symbolize raw memprof profiles."" This reverts commit `dbf47d227d`. Reapply https://reviews.llvm.org/D116784 now that https://reviews.llvm.org/D118413 has landed with a couple of fixes: * fix raw profile reader unaligned access identified by ubsan * fix windows build by using MOCK_CONST_METHOD3 instead of MOCK_METHOD.	2022-02-08 13:37:27 -08:00
Daniel Thornburgh	4a6553f4c2	[Debuginfod] [Symbolizer] Break debuginfod out of libLLVM. Debuginfod can pull in libcurl as a dependency, which isn't appropriate for libLLVM. (See https://gitlab.freedesktop.org/mesa/mesa/-/issues/5732). This change breaks out debuginfod into a separate non-component library that can be used directly in llvm-symbolizer. The tool can inject debuginfod into the Symbolizer library via an abstract DebugInfoFetcher interface, breaking the dependency of Symbolizer on debuinfod. See https://github.com/llvm/llvm-project/issues/52731 Reviewed By: phosek Differential Revision: https://reviews.llvm.org/D118413	2022-02-08 19:14:18 +00:00
Andy Yankovsky	3df88ec335	[Support] Don't print stacktrace if DbgHelp.dll hasn't been loaded yet On Windows certain function from `Signals.h` require that `DbgHelp.dll` is loaded. This typically happens when the main program calls `llvm::InitLLVM`, however in some cases main program doesn't do that (e.g. when the application is using LLDB via `liblldb.dll`). This patch adds a safe guard to prevent crashes. More discussion in https://reviews.llvm.org/D119009. Reviewed By: aganea Differential Revision: https://reviews.llvm.org/D119181	2022-02-08 16:37:36 +00:00
Simon Pilgrim	d1d7188b04	Fix signed/unsigned comparison warnings on ppc buildbots	2022-02-08 14:15:28 +00:00
Simon Pilgrim	58f944cf0b	Fix signed/unsigned comparison warnings on ppc buildbots	2022-02-08 13:56:41 +00:00
Simon Pilgrim	fd2bb51f1e	[ADT] Add APInt/MathExtras isShiftedMask variant returning mask offset/length In many cases, calls to isShiftedMask are immediately followed with checks to determine the size and position of the bitmask. This patch adds variants of APInt::isShiftedMask, isShiftedMask_32 and isShiftedMask_64 that return these values as additional arguments. I've updated a number of cases that were either performing seperate size/position calculations or had created their own local wrapper versions of these. Differential Revision: https://reviews.llvm.org/D119019	2022-02-08 12:04:13 +00:00
Nikita Popov	179540299a	[OpenMPIRBuilderTest] Remove getPointerElementType() uses (NFC) Use the type of the stored value instead.	2022-02-08 11:38:17 +01:00
Craig Topper	cef177d186	[VP] llvm.vp.fma intrinsic and LangRef Differential Revision: https://reviews.llvm.org/D119185	2022-02-07 15:53:27 -08:00
Sebastian Neubauer	4a02562275	[AMDGPU] Lazily init pal metadata on first function Delay reading global metadata until the first function or the end of the file is emitted. That way, earlier module passes can set metadata that is emitted in the ELF. `emitStartOfAsmFile` gets called when the passes are initialized, which prevented earlier passes from changing the metadata. This fixes issues encountered after converting AMDGPUResourceUsageAnalysis to a Module pass in D117504. Differential Revision: https://reviews.llvm.org/D118492	2022-02-04 18:39:35 +01:00
Simon Pilgrim	0b989a03b9	[Support] Add isShiftedMask_32/isShiftedMask_64 unit test coverage	2022-02-04 17:07:02 +00:00
serge-sans-paille	ffe8720aa0	Reduce dependencies on llvm/BinaryFormat/Dwarf.h This header is very large (3M Lines once expended) and was included in location where dwarf-specific information were not needed. More specifically, this commit suppresses the dependencies on llvm/BinaryFormat/Dwarf.h in two headers: llvm/IR/IRBuilder.h and llvm/IR/DebugInfoMetadata.h. As these headers (esp. the former) are widely used, this has a decent impact on number of preprocessed lines generated during compilation of LLVM, as showcased below. This is achieved by moving some definitions back to the .cpp file, no performance impact implied[0]. As a consequence of that patch, downstream user may need to manually some extra files: llvm/IR/IRBuilder.h no longer includes llvm/BinaryFormat/Dwarf.h llvm/IR/DebugInfoMetadata.h no longer includes llvm/BinaryFormat/Dwarf.h In some situations, codes maybe relying on the fact that llvm/BinaryFormat/Dwarf.h was including llvm/ADT/Triple.h, this hidden dependency now needs to be explicit. $ clang++ -E -Iinclude -I../llvm/include ../llvm/lib/Transforms/Scalar/*.cpp -std=c++14 -fno-rtti -fno-exceptions \| wc -l after: 10978519 before: 11245451 Related Discourse thread: https://llvm.discourse.group/t/include-what-you-use-include-cleanup [0] https://llvm-compile-time-tracker.com/compare.php?from=fa7145dfbf94cb93b1c3e610582c495cb806569b&to=995d3e326ee1d9489145e20762c65465a9caeab4&stat=instructions Differential Revision: https://reviews.llvm.org/D118781	2022-02-04 11:44:03 +01:00
Snehasish Kumar	dbf47d227d	Revert "[ProfileData] Read and symbolize raw memprof profiles." This reverts commit `26f978d4c5`. This patch added a transitive dependency on libcurl via symbolize. See discussion https://reviews.llvm.org/D116784#inline-1137928 https://reviews.llvm.org/D113717#3295350	2022-02-03 16:14:05 -08:00
Snehasish Kumar	26f978d4c5	[ProfileData] Read and symbolize raw memprof profiles. This change extends the RawMemProfReader to read all the sections of the raw profile and symbolize the virtual addresses recorded as part of the callstack for each allocation. For now the symbolization is used to display the contents of the profile with llvm-profdata. Differential Revision: https://reviews.llvm.org/D116784	2022-02-03 14:33:50 -08:00
Ben Barham	502f14d6f2	[VFS] Add a "redirecting-with" field to overlays Extend "fallthrough" to allow a third option: "fallback". Fallthrough allows the original path to used if the redirected (or mapped) path fails. Fallback is the reverse of this, ie. use the original path and fallback to the mapped path otherwise. While this result can be achieved today using multiple overlays, this adds a much more intuitive option. As an example, take two directories "A" and "B". We would like files from "A" to be used, unless they don't exist, in which case the VFS should fallback to those in "B". With the current fallthrough option this is possible by adding two overlays: one mapping from A -> B and another mapping from B -> A. Since the frontend nests the two RedirectingFileSystems, the result will be that "A" is mapped to "B" and back to "A", unless it isn't in "A" in which case it fallsthrough to "B" (or fails if it exists in neither). Using "fallback" semantics allows a single overlay instead: one mapping from "A" to "B" but only using that mapping if the operation in "A" fails first. "redirect-only" is used to represent the current "fallthrough: false" case. Differential Revision: https://reviews.llvm.org/D117937	2022-02-03 13:10:23 -08:00
Shilei Tian	1a547a94c1	[OMPIRBuilder] Add support for atomic compare This patch adds the support for `atomic compare` in `OMPIRBuilder`. Reviewed By: jdoerfert Differential Revision: https://reviews.llvm.org/D118547	2022-02-02 15:39:14 -05:00
Andrew Litteken	30420bc344	[IRSim] Make sure that commutative intrinsics are treated as function calls without commutativity Created to fix: https://github.com/llvm/llvm-project/issues/53537 Some intrinsics functions are considered commutative since they are performing operations like addition or multiplication. Some of these have extra parameters to provide extra information that are not part of the operation itself and are not commutative. This makes sure that if an instruction that is an intrinsic takes the non commutative path to handle this case. Reviewer: paquette Closes Issue #53537 Differential Revision: https://reviews.llvm.org/D118807	2022-02-02 13:24:56 -06:00
Simon Pilgrim	675a557316	Remove redundant LLVM_HAS_RVALUE_REFERENCE_THIS and LLVM_LVALUE_FUNCTION defines Now that VS2017 support has been dropped (D114639), the LLVM_HAS_RVALUE_REFERENCE_THIS define is always true and the LLVM_LVALUE_FUNCTION define is always enabled for ref-qualifiers. This patch proposes we remove the defines and use the qualifiers directly. Differential Revision: https://reviews.llvm.org/D118609	2022-02-02 16:56:26 +00:00
Jeremy Morse	14aaaa1236	Re-apply `3fab2d138e`, now with a triple added Was reverted in `1c1b670a73` as it broke all non-x86 bots. Original commit message: [DebugInfo][InstrRef] Add a max-stack-slots-to-track cut-out In certain circumstances with things like autogenerated code and asan, you can end up with thousands of Values live at the same time, causing a large working set and a lot of information spilled to the stack. Unfortunately InstrRefBasedLDV doesn't cope well with this and consumes a lot of memory when there are many many stack slots. See the reproducer in D116821. It seems very unlikely that a developer would be able to reason about hundreds of live named local variables at the same time, so a huge working set and many stack slots is an indicator that we're likely analysing autogenerated or instrumented code. In those cases: gracefully degrade by setting an upper bound on the amount of stack slots to track. This limits peak memory consumption, at the cost of dropping some variable locations, but in a rare scenario where it's unlikely someone is actually going to use them. In terms of the patch, this adds a cl::opt for max number of stack slots to track, and has the stack-slot-numbering code optionally return None. That then filters through a number of code paths, which can then chose to not track a spill / restore if it touches an untracked spill slot. The added test checks that we drop variable locations that are on the stack, if we set the limit to zero. Differential Revision: https://reviews.llvm.org/D118601	2022-02-02 11:04:00 +00:00
serge-sans-paille	e188aae406	Cleanup header dependencies in LLVMCore Based on the output of include-what-you-use. This is a big chunk of changes. It is very likely to break downstream code unless they took a lot of care in avoiding hidden ehader dependencies, something the LLVM codebase doesn't do that well :-/ I've tried to summarize the biggest change below: - llvm/include/llvm-c/Core.h: no longer includes llvm-c/ErrorHandling.h - llvm/IR/DIBuilder.h no longer includes llvm/IR/DebugInfo.h - llvm/IR/IRBuilder.h no longer includes llvm/IR/IntrinsicInst.h - llvm/IR/LLVMRemarkStreamer.h no longer includes llvm/Support/ToolOutputFile.h - llvm/IR/LegacyPassManager.h no longer include llvm/Pass.h - llvm/IR/Type.h no longer includes llvm/ADT/SmallPtrSet.h - llvm/IR/PassManager.h no longer includes llvm/Pass.h nor llvm/Support/Debug.h And the usual count of preprocessed lines: $ clang++ -E -Iinclude -I../llvm/include ../llvm/lib/IR/*.cpp -std=c++14 -fno-rtti -fno-exceptions \| wc -l before: 6400831 after: 6189948 200k lines less to process is no that bad ;-) Discourse thread on the topic: https://llvm.discourse.group/t/include-what-you-use-include-cleanup Differential Revision: https://reviews.llvm.org/D118652	2022-02-02 06:54:20 +01:00
Kevin Athey	1c1b670a73	Revert "[DebugInfo][InstrRef] Add a max-stack-slots-to-track cut-out" This reverts commit `3fab2d138e`. Breaking PPC sanitizer build: https://lab.llvm.org/buildbot/#/builders/105/builds/20857	2022-02-01 18:37:02 -08:00
Nemanja Ivanovic	ec00c9cdeb	[ARM] Fix build break after `762f0b5463` The commit adds a unit test that uses the facilities of libLLVMCore without adding it to link components. This causes failures with the shared libraries builds. This patch just adds the missing library to the link step.	2022-02-01 20:14:32 -06:00
Jeremy Morse	3fab2d138e	[DebugInfo][InstrRef] Add a max-stack-slots-to-track cut-out In certain circumstances with things like autogenerated code and asan, you can end up with thousands of Values live at the same time, causing a large working set and a lot of information spilled to the stack. Unfortunately InstrRefBasedLDV doesn't cope well with this and consumes a lot of memory when there are many many stack slots. See the reproducer in D116821. It seems very unlikely that a developer would be able to reason about hundreds of live named local variables at the same time, so a huge working set and many stack slots is an indicator that we're likely analysing autogenerated or instrumented code. In those cases: gracefully degrade by setting an upper bound on the amount of stack slots to track. This limits peak memory consumption, at the cost of dropping some variable locations, but in a rare scenario where it's unlikely someone is actually going to use them. In terms of the patch, this adds a cl::opt for max number of stack slots to track, and has the stack-slot-numbering code optionally return None. That then filters through a number of code paths, which can then chose to not track a spill / restore if it touches an untracked spill slot. The added test checks that we drop variable locations that are on the stack, if we set the limit to zero. Differential Revision: https://reviews.llvm.org/D118601	2022-02-01 19:25:29 +00:00
tyb0807	762f0b5463	[ARM] Make getInstSizeInBytes() use instruction size from InstrInfo.td Currently, ARMBaseInstrInfo::getInstSizeInBytes() uses hard-coded instruction size for some pseudo-instructions, while this information should ideally be found in ARMInstrInfo.td, ARMInstrThumb(2).td files (which can be accessed via MCInstrDesc). Hence, the .td files should be updated and no hard-coded instruction sizes should be used by getInstSizeInBytes() anymore. Differential Revision: https://reviews.llvm.org/D118009	2022-02-01 10:39:14 +00:00
tyb0807	dd88f40c80	[AArch64] Make getInstSizeInBytes() use instruction size from InstrInfo.td Currently, AArch64InstrInfo::getInstSizeInBytes() uses hard-coded instruction size for some pseudo-instructions, while this information should ideally be found in AArch64InstrInfo.td file (which can be accessed via MCInstrDesc). Hence, the .td file should be updated and no hard-coded instruction sizes should be used by getInstSizeInBytes() anymore. Differential Revision: https://reviews.llvm.org/D117970	2022-02-01 10:39:14 +00:00
Johannes Doerfert	783544bd16	[Attributor][FIX] Repair broken unit test	2022-02-01 02:13:17 -06:00
Kuter Dinel	b2d1ae0611	[Attributor] AAFunctionReachability, Instruction reachability. This patch implement instruction reachability for AAFunctionReachability attribute. It is used to tell if a certain instruction can reach a function transitively. NOTE: I created a new commit based of D106720 and set the author back to Kuter. Other metadata, etc. is wrong. I also addressed the remaining review comments and fixed the unit test. Differential Revision: https://reviews.llvm.org/D106720	2022-02-01 01:40:44 -06:00
Chris Bieneman	4f710515fb	Fix memory leak I introduced in `2d66ed370a` This should fix the asan issue identified on the Linux asan bot.	2022-01-31 19:44:47 -06:00
Chris Bieneman	2d66ed370a	[NFC] Skip PassBuilderCTests if no default triple This fixes the unit tests so that it is skipped if there is no default target triple set. Unset default target triple is a supported build configuration for LLVM.	2022-01-31 16:24:42 -06:00
tyb0807	78fd413cf7	[AArch64][GlobalISel] CodeGen for Armv8.8/9.3 MOPS This implements codegen for Armv8.8/9.3 Memory Operations extension (MOPS). Any memcpy/memset/memmov intrinsics will always be emitted as a series of three consecutive instructions P, M and E which perform the operation. The SelectionDAG implementation is split into a separate patch. AArch64LegalizerInfo will now consider the following generic opcodes if +mops is available, instead of legalising by expanding them to libcalls: G_BZERO, G_MEMCPY_INLINE, G_MEMCPY, G_MEMMOVE, G_MEMSET The s8 value of memset is legalised to s64 to match the pseudos. AArch64O0PreLegalizerCombinerInfo will still be able to combine G_MEMCPY_INLINE even if +mops is present, as it is unclear whether it is better to generate fixed length copies or MOPS instructions for the inline code of small or zero-sized memory operations, so we choose to be conservative for now. AArch64InstructionSelector will select the above as new pseudo instructions: AArch64::MOPSMemory{Copy/Move/Set/SetTagging} These are each expanded to a series of three instructions (e.g. SETP/SETM/SETE) which must be emitted together during code emission to avoid scheduler reordering. This is part 3/4 of a series of patches split from https://reviews.llvm.org/D117405 to facilitate reviewing. Patch by Tomas Matheson and Son Tuan Vu Differential Revision: https://reviews.llvm.org/D117763	2022-01-31 20:54:41 +00:00
Chris Bieneman	9f4f729207	[NFC] Fix build when LLVM_DEFAULT_TARGET_TRIPLE="" We do support building with a default target unspecified. This fixes two small build issues that prevented LLVM's unit tests from building and libSupport from building on Windows.	2022-01-31 13:31:55 -06:00
Ties Stuij	6b1e844b69	[ARM] Add Cortex-X1C Support for Clang and LLVM This patch upstreams support for the Arm-v8 Cortex-X1C processor for AArch64 and ARM. For more information, see: - https://community.arm.com/arm-community-blogs/b/announcements/posts/arm-cortex-x1c - https://developer.arm.com/documentation/101968/0002/Functional-description/Technical-overview/Components The following people contributed to this patch: - Simon Tatham - Ties Stuij Reviewed By: dmgreen Differential Revision: https://reviews.llvm.org/D117202	2022-01-31 14:23:35 +00:00
Andrew Litteken	3785c1d055	[IRSim][IROutliner] Allowing Intrinsic Calls to be Used in Similarity Matching and Outlined Regions Due to some complications with lifetime, and assume-like intrinsics, intrinsics were not included as outlinable instructions. This patch opens up most intrinsics, excluding lifetime and assume-like intrinsics, to be outlined. For similarity, it is required that the intrinsic IDs, and the intrinsics names match exactly, as well as the function type. This puts intrinsics in a different class than normal call instructions (https://reviews.llvm.org/D109448), where the name will no longer have to match. This also adds an additional command line flag debug option to disable outlining intrinsics. Recommit of: `8de76bd569` Adds extra checking of intrinsic function calls names to avoid taking the address of intrinsic calls when extracting function calls. Reviewers: paquette, jroelofs Differential Revision: https://reviews.llvm.org/D109450	2022-01-28 13:52:21 -06:00
Aaron Ballman	f489e86a24	Remove Waymarking.h as it is unused This file was added in https://reviews.llvm.org/D74415. There was no justification as to why it was added, and after about a year of being in-tree, it's still unused, so this removes it.	2022-01-28 14:20:06 -05:00
Yuanfang Chen	a41c8b8fd5	[ADT] support fixed-width output with `utohexstr` Will use it to output a hash value that needs fixed-width. Reviewed By: dblaikie Differential Revision: https://reviews.llvm.org/D118427	2022-01-28 10:07:54 -08:00
Amilendra Kodithuwakku	1f08b08674	[clang][ARM] Emit warnings when PACBTI-M is used with unsupported architectures Branch protection in M-class is supported by - Armv8.1-M.Main - Armv8-M.Main - Armv7-M Attempting to enable this for other architectures, either by command-line (e.g -mbranch-protection=bti) or by target attribute in source code (e.g. __attribute__((target("branch-protection=..."))) ) will generate a warning. In both cases function attributes related to branch protection will not be emitted. Regardless of the warning, module level attributes related to branch protection will be emitted when it is enabled via the command-line. The following people also contributed to this patch: - Victor Campos Reviewed By: chill Differential Revision: https://reviews.llvm.org/D115501	2022-01-28 09:59:58 +00:00
Nikita Popov	f2c2a31dd7	[OpenMPIRBuilder] Store element type in AtomicOpValue With opaque pointers, we can no longer derive this from the pointer type, so we need to explicitly provide the element type the atomic operation should work with. Differential Revision: https://reviews.llvm.org/D118359	2022-01-28 09:35:11 +01:00
Benjamin Kramer	f15014ff54	Revert "Rename llvm::array_lengthof into llvm::size to match std::size from C++17" This reverts commit `ef82063207`. - It conflicts with the existing llvm::size in STLExtras, which will now never be called. - Calling it without llvm:: breaks C++17 compat	2022-01-26 16:55:53 +01:00
serge-sans-paille	ef82063207	Rename llvm::array_lengthof into llvm::size to match std::size from C++17 As a conquence move llvm::array_lengthof from STLExtras.h to STLForwardCompat.h (which is included by STLExtras.h so no build breakage expected).	2022-01-26 16:17:45 +01:00
Nikita Popov	2461aee6db	[OpenMPIRBuilderTest] Avoid pointer element type access This was reintroduced by D110114.	2022-01-26 11:02:43 +01:00
Giorgis Georgakoudis	7cb4c26173	[OMPIRBuilder] Generate aggregate argument for parallel region outlined functions Summary: This patch modifies code generation in OpenMPIRBuilder to pass arguments to the parallel region outlined function in an aggregate (struct), besides the global_tid and bound_tid arguments. It depends on the updated CodeExtractor (see D96854) for support. It mirrors functionality of Clang codegen (see D102107). Differential Revision: https://reviews.llvm.org/D110114	2022-01-25 20:53:45 -05:00
Giorgis Georgakoudis	95b981ca2a	[CodeExtractor] Enable partial aggregate arguments Summary: Enable CodeExtractor to construct output functions that partially aggregate inputs/outputs in their argument list. A use case is the OMPIRBuilder to create outlined functions for parallel regions that aggregate in a struct the payload variables for the region while passing as scalars thread and bound identifiers. Differential Revision: https://reviews.llvm.org/D96854	2022-01-25 20:50:34 -05:00
Andrew Litteken	e8f4e41b6b	[IRSim][IROutliner] Add support for outlining PHINodes with the rest of the region. We use the same similarity scheme we used for branch instructions for phi nodes, and allow them to be outlined. There is not a lot of special handling needed for these phi nodes when outlining, as they simply act as outputs. The code extractor does not currently allow for non entry blocks within the extracted region to have predecessors, so there are not conflicts to handle with respect to predecessors no longer contained in the function. Recommit of `515eec3553` Reviewers: paquette Differential Revision: https://reviews.llvm.org/D106997	2022-01-25 18:25:50 -06:00
Andrew Litteken	e50b217b4e	Revert "[IRSim][IROutliner] Add support for outlining PHINodes with the rest of the region." This reverts commit `515eec3553`. By mistake, commit message was not complete.	2022-01-25 18:24:19 -06:00
Andrew Litteken	515eec3553	[IRSim][IROutliner] Add support for outlining PHINodes with the rest of the region.	2022-01-25 18:20:10 -06:00
Andrew Litteken	9c2daf648c	Revert "[IRSim][IROutliner] Allowing Intrinsic Calls to be Used in Similarity Matching and Outlined Regions" This reverts commit `8de76bd569`. Reverting due to failure of different-intrinsics.ll on lld-x86_64-win buildbot.	2022-01-25 18:19:33 -06:00
Andrew Litteken	8de76bd569	[IRSim][IROutliner] Allowing Intrinsic Calls to be Used in Similarity Matching and Outlined Regions Due to some complications with lifetime, and assume-like intrinsics, intrinsics were not included as outlinable instructions. This patch opens up most intrinsics, excluding lifetime and assume-like intrinsics, to be outlined. For similarity, it is required that the intrinsic IDs, and the intrinsics names match exactly, as well as the function type. This puts intrinsics in a different class than normal call instructions (https://reviews.llvm.org/D109448), where the name will no longer have to match. This also adds an additional command line flag debug option to disable outlining intrinsics. Reviewers: paquette, jroelofs Differential Revision: https://reviews.llvm.org/D109450	2022-01-25 17:06:09 -06:00
Andrew Litteken	f5f377d1fc	[IRSim][IROutliner] Adding support for recognizing and outlining indirect function calls, and function calls with different names, but the same type The outliner currently requires that function calls not be indirect calls, and have that the function name, and function type must match, as well as other attributes such as calling conventions. This patch treats called functions as values, and just another operand, and named function calls as constants. This allows functions to be treated like any other constant, or input and output into the outlined functions. There are also debugging flags added to enforce the old behaviors where indirect calls not be allowed, and to enforce the old rule that function calls names must also match. Reviewers: paquette, jroelofs Differential Revision: https://reviews.llvm.org/D109448	2022-01-25 15:19:28 -06:00
Nikita Popov	8e3e772f84	[OpenMPIRBuilderTest] Avoid some pointer element type accesses (NFC) Use isOpaqueOrPointeeTypeMatches() instead, where possible.	2022-01-25 14:40:01 +01:00
Nikita Popov	475927d046	[AsmParserTest] Avoid pointer element type accesses (NFC) Use isOpaqueOrPointeeTypeEquals() instead.	2022-01-25 14:31:51 +01:00
Nikita Popov	22487280dc	[NFC] Remove more uses of PointerType::getElementType() (NFC) Replace more uses which I missed in the first pass with Type::getPointerElementType().	2022-01-25 10:13:53 +01:00
Matt Arsenault	99e8e17313	Reapply "Revert "GlobalISel: Add G_ASSERT_ALIGN hint instruction" This reverts commit `a97e20a3a8`.	2022-01-24 09:26:52 -05:00
serge-sans-paille	5f290c090a	Move STLFunctionalExtras out of STLExtras Only using that change in StringRef already decreases the number of preoprocessed lines from 7837621 to 7776151 for LLVMSupport Perhaps more interestingly, it shows that many files were relying on the inclusion of StringRef.h to have the declaration from STLExtras.h. This patch tries hard to patch relevant part of llvm-project impacted by this hidden dependency removal. Potential impact: - "llvm/ADT/StringRef.h" no longer includes <memory>, "llvm/ADT/Optional.h" nor "llvm/ADT/STLExtras.h" Related Discourse thread: https://llvm.discourse.group/t/include-what-you-use-include-cleanup/5831	2022-01-24 14:13:21 +01:00
Phoebe Wang	37d1d02200	[X86][MS] Change the alignment of f80 to 16 bytes on Windows 32bits to match with ICC MSVC currently doesn't support 80 bits long double. ICC supports it when the option `/Qlong-double` is specified. Changing the alignment of f80 to 16 bytes so that we can be compatible with ICC's option. Reviewed By: rnk, craig.topper Differential Revision: https://reviews.llvm.org/D115942	2022-01-23 09:58:46 +08:00
Chris Bieneman	9d3437fbf3	[ADT] [NFC] Add StringRef::detectEOL This change moves EOL detection out of the clang::InclusionRewriter into llvm::StringRef so that it can be easily reused elsewhere. It also adds additional explicit test cases to verify the correct and expected return results. Reviewed By: dblaikie Differential Revision: https://reviews.llvm.org/D117626	2022-01-21 09:47:02 -06:00
serge-sans-paille	e9211e0393	Remove dependency from raw_ostream on <chrono> The tryLockFor method from raw_fd_sotreamis the sole user of that header, and it's not referenced in the mono repo. I still chose to keep it (may be useful for downstream user) but added a transient type that's forward declared to hold the duration parameter. Notable changes: - "llvm/Support/Duration.h" must be included in order to use tryLockFor. - "llvm/Support/raw_ostream.h" no longer includes <chrono> This sole change has an interesting impact on the number of processed line, as measured by: clang++ -E -Iinclude -I../llvm/include ../llvm/lib/Support/*.cpp -std=c++14 -fno-rtti -fno-exceptions \| wc -l before: 7917500 after: 7835142 Discourse thread on the topic: https://llvm.discourse.group/t/include-what-you-use-include-cleanup/5831	2022-01-21 15:17:39 +01:00
Jan Svoboda	622354a522	[llvm][ADT] Implement `BitVector::{pop_,}back` LLVM Programmer’s Manual strongly discourages the use of `std::vector<bool>` and suggests `llvm::BitVector` as a possible replacement. Currently, some users of `std::vector<bool>` cannot switch to `llvm::BitVector` because it doesn't implement the `pop_back()` and `back()` functions. To enable easy transition of `std::vector<bool>` users, this patch implements `llvm::BitVector::pop_back()` and `llvm::BitVector::back()`. Reviewed By: dexonsmith Differential Revision: https://reviews.llvm.org/D117115	2022-01-21 14:50:53 +01:00
Archibald Elliott	38ac4093d9	[NFCI][Support] Avoid ASSERT_/EXPECT_TRUE(A <op> B) The error messages in tests are far better when a test fails if the test is written using ASSERT_/EXPECT_<operator>(A, B) rather than ASSERT_/EXPECT_TRUE(A <operator> B). This commit updates all of llvm/unittests/Support to use these macros where possible. This change has not been possible in: - llvm/unittests/Support/FSUniqueIDTest.cpp - due to not overloading operators beyond ==, != and <. - llvm/unittests/Support/BranchProbabilityTest.cpp - where the unchanged tests are of the operator overloads themselves. There are other possibilities of this conversion not being valid, which have not applied in these tests, as they do not use NULL (they use nullptr), and they do not use const char* (they use std::string or StringRef). Reviewed By: mubashar_ Differential Revision: https://reviews.llvm.org/D117319	2022-01-21 13:15:04 +00:00
serge-sans-paille	75e164f61d	[llvm] Cleanup header dependencies in ADT and Support The cleanup was manual, but assisted by "include-what-you-use". It consists in 1. Removing unused forward declaration. No impact expected. 2. Removing unused headers in .cpp files. No impact expected. 3. Removing unused headers in .h files. This removes implicit dependencies and is generally considered a good thing, but this may break downstream builds. I've updated llvm, clang, lld, lldb and mlir deps, and included a list of the modification in the second part of the commit. 4. Replacing header inclusion by forward declaration. This has the same impact as 3. Notable changes: - llvm/Support/TargetParser.h no longer includes llvm/Support/AArch64TargetParser.h nor llvm/Support/ARMTargetParser.h - llvm/Support/TypeSize.h no longer includes llvm/Support/WithColor.h - llvm/Support/YAMLTraits.h no longer includes llvm/Support/Regex.h - llvm/ADT/SmallVector.h no longer includes llvm/Support/MemAlloc.h nor llvm/Support/ErrorHandling.h You may need to add some of these headers in your compilation units, if needs be. As an hint to the impact of the cleanup, running clang++ -E -Iinclude -I../llvm/include ../llvm/lib/Support/*.cpp -std=c++14 -fno-rtti -fno-exceptions \| wc -l before: 8000919 lines after: 7917500 lines Reduced dependencies also helps incremental rebuilds and is more ccache friendly, something not shown by the above metric :-) Discourse thread on the topic: https://llvm.discourse.group/t/include-what-you-use-include-cleanup/5831	2022-01-21 13:54:49 +01:00
Daniel Thornburgh	6b92bb4790	[Support] [DebugInfo] Lazily create cache dir. This change defers creating Support/Caching.cpp's cache directory until it actually writes to the cache. This allows using Caching library in a read-only fashion. If read-only, the cache is guaranteed not to write to disk. This keeps tools using DebugInfod (currently llvm-symbolizer) hermetic when not configured to perform remote lookups. Reviewed By: phosek Differential Revision: https://reviews.llvm.org/D117589	2022-01-20 19:27:15 +00:00
Michael Kruse	616f77172f	[OpenMPIRBuilder] Detect and fix ambiguous InsertPoints for createParallel. When a Builder methods accepts multiple InsertPoints, when both point to the same position, inserting instructions at one position will "move" the other after the inserted position since the InsertPoint is pegged to the instruction following the intended InsertPoint. For instance, when creating a parallel region at Loc and passing the same position as AllocaIP, creating instructions at Loc will "move" the AllocIP behind the Loc position. To avoid this ambiguity, add an assertion checking this condition and fix the unittests. In case of AllocaIP, an alternative solution could be to implicitly split BasicBlock at InsertPoint, using the first as AllocaIP, the second for inserting the instructions themselves. However, this solution is specific to AllocaIP since AllocaIP will always have to be first. Hence, this is an argument to generally handling ambiguous InsertPoints as API sage error. Reviewed By: jdoerfert Differential Revision: https://reviews.llvm.org/D117226	2022-01-20 10:13:44 -06:00
Lang Hames	9eb4939b86	[ORC] Allow JITDylib::getDFSLinkOrder and friends to fail for defunct JITDylibs. Calls to JITDylib's getDFSLinkOrder and getReverseDFSLinkOrder methods (both static an non-static versions) are now valid to make on defunct JITDylibs, but will return an error if any JITDylib in the link order is defunct. This means that platforms can safely lookup link orders by name in response to jit-dlopen calls from the ORC runtime, even if the call names a defunct JITDylib -- the call will just fail with an error.	2022-01-20 17:45:32 +11:00

... 2 3 4 5 6 ...

7736 Commits