llvm-project

Commit Graph

Author	SHA1	Message	Date
Nikita Popov	e1d47d86d8	[IR] Report whether replaceUsesOfWith() changed something (NFC) With change reporting in transformation passes in mind.	2022-05-18 11:46:28 +02:00
Alexander Shaposhnikov	0f4d9f9b71	[ConstantRange] Improve the implementation of binaryAnd This diff adjusts binaryAnd to take advantage of the analysis based on KnownBits. Differential revision: https://reviews.llvm.org/D125603 Test plan: 1/ ninja check-llvm 2/ ninja check-llvm-unit	2022-05-17 22:06:03 +00:00
Nikita Popov	2db4dc7ec0	[ConstantRange] Implement binaryXor() using known bits This allows us to compute known high bits. It's not optimal, but better than nothing.	2022-05-17 10:05:12 +02:00
Nikita Popov	8ab819ad90	[ConstantRange] Add toKnownBits() method Add toKnownBits() method to mirror fromKnownBits(). We know the top bits that are constant between min and max. The return value for an empty range is chosen to be conservative.	2022-05-16 16:12:25 +02:00
Nicolas Abram Lujan	436bbce765	[llvm-c] Add functions for enabling and creating opaque pointers This is based on https://reviews.llvm.org/D125168 which adds a wrapper to allow use of opaque pointers from the C API. I added an opaque pointer mode test to echo.ll, and to fix assertions that forbid the use of mixed typed and opaque pointers that were triggering in it I had to also add wrappers for setOpaquePointers() and isOpaquePointer(). I also changed echo.ll to remove a bitcast i32* %x to i8*, because passing it through llvm-as and llvm-dis was generating a %0 = bitcast ptr %x to ptr, but when building that same bitcast in echo.cpp it was getting elided by IRBuilderBase::CreateCast (`08ac661248/llvm/include/llvm/IR/IRBuilder.h (L1998-L1999)`). Differential Revision: https://reviews.llvm.org/D125183	2022-05-16 10:53:46 +02:00
Wolfgang Pieb	2740c1875d	[NFC][Metadata] Refactor allocation, initalization and deletion of MDNodes. This patch is refactoring the allocation, initialization and deletion of MDNodes. It is intended as a preparatory patch for the upcoming addition of dynamic resizability of MDNodes. It is fundamentally NFC, but removes the necessity for suppressing the memory sanitizer for MDNode's operator delete. Reviewers: dexonsmith Differential Revision: https://reviews.llvm.org/D125489	2022-05-13 16:05:29 -07:00
Craig Topper	39e63bd2d8	[IR][CostModel] A scalable vector shuffle can't be an identity or reverse shuffle. Even if the minimum number of elements is 1 and the length doesn't change, we don't know what vscale is so we can't classify it as identity mask. Instead it is a zero element splat. For reverse, we shouldn't classify it as a reverse unless there are at least 2 elements in the mask. This applies to both fixed and scalable vectors. For fixed vectors, a single element would be an identity shuffle. For scalable vector it's a zero elt splat. Reviewed By: sdesmalen, liaolucy Differential Revision: https://reviews.llvm.org/D124655	2022-05-09 21:37:25 -07:00
Benjamin Kramer	17d27d926b	[IR] Simplify code. NFCI.	2022-05-05 16:06:59 +02:00
Benjamin Kramer	08b20f20d2	[ConstantFold] Use getFltSemantics instead of manually checking the type Simplifies the code and makes fpext/fptrunc constant folding not crash when the result is bf16.	2022-05-05 15:52:19 +02:00
Nikita Popov	95fedfab6c	[InstCombine] Handle non-canonical GEP index in indexed compare fold (PR55228) Normally the index type will already be canonicalized here, but this is not guaranteed depending on visitation order. The code was already accounting for a potentially needed sext, but a trunc may also be needed. Add a ConstantExpr::getSExtOrTrunc() helper method to make this simpler. This matches the corresponding IRBuilder method in behavior. Fixes https://github.com/llvm/llvm-project/issues/55228.	2022-05-02 17:56:01 +02:00
Phoebe Wang	7c04454227	[ArgPromotion][Attributor] Update min-legal-vector-width when do promotion X86 codegen uses function attribute `min-legal-vector-width` to select the proper ABI. The intention of the attribute is to reflect user's requirement when they passing or returning vector arguments. So Clang front-end will iterate the vector arguments and set `min-legal-vector-width` to the width of the maximum for both caller and callee. It is assumed any middle end optimizations won't care of the attribute expect inlining and argument promotion. - For inlining, we will propagate the attribute of inlined functions because the inlining functions become the newer caller. - For argument promotion, we check the `min-legal-vector-width` of the caller and callee and refuse to promote when they don't match. The problem comes from the optimizations' combination, as shown by https://godbolt.org/z/zo3hba8xW. The caller `foo` has two callees `bar` and `baz`. When doing argument promotion, both `foo` and `bar` has the same `min-legal-vector-width`. So the argument was promoted to vector. Then the inlining inlines `baz` to `foo` and updates `min-legal-vector-width`, which results in ABI mismatch between `foo` and `bar`. This patch fixes the problem by expanding the concept of `min-legal-vector-width` to indicator of functions arguments. That says, any passes touch functions arguments have to set `min-legal-vector-width` to the value reflects the width of vector arguments. It makes sense to me because any arguments modifications are ABI related and should response for the ABI compatibility. Differential Revision: https://reviews.llvm.org/D123284	2022-05-02 14:13:05 +08:00
Jack Andersen	09325d3606	[CAPI] Expose CastInst::getCastOpcode in C API Reviewed By: deadalnix Differential Revision: https://reviews.llvm.org/D91514	2022-04-30 18:40:04 -04:00
Augie Fackler	a907d36cfe	Attributes: add a new `allocptr` attribute This continues the push away from hard-coded knowledge about functions towards attributes. We'll use this to annotate free(), realloc() and cousins and obviate the hard-coded list of free functions. Differential Revision: https://reviews.llvm.org/D123083	2022-04-26 13:57:11 -04:00
YASHASVI KHATAVKAR	e83543f8c2	Don't replace Undef with null value for Constants Differential Revision:https://reviews.llvm.org/D124098	2022-04-25 20:50:00 -04:00
Chris Bieneman	e6f44a3cd2	Add PointerType analysis for DirectX backend As implemented this patch assumes that Typed pointer support remains in the llvm::PointerType class, however this could be modified to use a different subclass of llvm::Type that could be disallowed from use in other contexts. This does not rely on inserting typed pointers into the Module, it just uses the llvm::PointerType class to track and unique types. Fixes #54918 Reviewed By: kuhar Differential Revision: https://reviews.llvm.org/D122268	2022-04-25 17:49:43 -05:00
Vitaly Buka	9be90748f1	Revert "[asan] Emit .size directive for global object size before redzone" Revert "[docs] Fix underline" Breaks a lot of asan tests in google. This reverts commit `365c3e85bc`. This reverts commit `78a784bea4`.	2022-04-21 16:21:17 -07:00
Alex Brachet	78a784bea4	[asan] Emit .size directive for global object size before redzone This emits an `st_size` that represents the actual useable size of an object before the redzone is added. Reviewed By: vitalybuka, MaskRay, hctim Differential Revision: https://reviews.llvm.org/D123010	2022-04-21 20:46:38 +00:00
Vitaly Buka	700442dee3	[msan] Destroy ConstantTokenNone before types above ~ConstantTokenNone access them, so it should be destroyed first.	2022-04-19 16:57:32 -07:00
Paul Kirth	bac6cd5bf8	[misexpect] Re-implement MisExpect Diagnostics Reimplements MisExpect diagnostics from D66324 to reconstruct its original checking methodology only using MD_prof branch_weights metadata. New checks rely on 2 invariants: 1) For frontend instrumentation, MD_prof branch_weights will always be populated before llvm.expect intrinsics are lowered. 2) for IR and sample profiling, llvm.expect intrinsics will always be lowered before branch_weights are populated from the IR profiles. These invariants allow the checking to assume how the existing branch weights are populated depending on the profiling method used, and emit the correct diagnostics. If these invariants are ever invalidated, the MisExpect related checks would need to be updated, potentially by re-introducing MD_misexpect metadata, and ensuring it always will be transformed the same way as branch_weights in other optimization passes. Frontend based profiling is now enabled without using LLVM Args, by introducing a new CodeGen option, and checking if the -Wmisexpect flag has been passed on the command line. Reviewed By: tejohnson Differential Revision: https://reviews.llvm.org/D115907	2022-04-19 21:23:48 +00:00
Craig Topper	ac8c720d48	[IR] Allow constant folding (insertelement <vscale x 2 x i32> zeroinitializer, i32 0, i32 i32 0. Most of insertelement constant folding is blocked if the vector type is scalable. I believe we can make an exception for inserting null into an all zeros vector. Reviewed By: nikic Differential Revision: https://reviews.llvm.org/D123413	2022-04-15 17:44:32 -07:00
Chih-Ping Chen	eab6e94f91	[DebugInfo] Add a TargetFuncName field in DISubprogram for specifying DW_AT_trampoline as a string. Also update the signature of DIBuilder::createFunction to reflect this addition. Differential Revision: https://reviews.llvm.org/D123697	2022-04-15 16:38:23 -04:00
Alex Richardson	9107cd632d	[AutoUpgrade] Don't lose attributes when upgrading mem intrinsics The original AutoUpgrade code from `1e68724d24` did not retain existing attributes. I noticed this in some downstream test cases, but it turns out there are also two affected testcase upstream. Differential Revision: https://reviews.llvm.org/D121971	2022-04-13 09:30:10 +00:00
Daniel Kiss	b0343a38a5	Support the min of module flags when linking, use for AArch64 BTI/PAC-RET LTO objects might compiled with different `mbranch-protection` flags which will cause an error in the linker. Such a setup is allowed in the normal build with this change that is possible. Reviewed By: pcc Differential Revision: https://reviews.llvm.org/D123493	2022-04-13 09:31:51 +02:00
Fangrui Song	982247dce5	Value::isTransitiveUsedByMetadataOnly: Don't repeatedly add an element to the worklist. NFC	2022-04-11 13:35:25 -07:00
Augie Fackler	5f09498a11	MemoryBuiltins: also check function definition for allocalign This got changed to use hasAttrSomewhere() during review, and I didn't notice until today when I was writing some tests for another part of this system that using hasAttrSomewhere only checked the callsite for allocalign, rather than both the callsite and the definition. This fixes that by introducing a helper method. Differential Revision: https://reviews.llvm.org/D121641	2022-04-07 12:38:44 -04:00
Artur Pilipenko	857d699667	Move BasicBlock::getTerminator definition to the header This way it can be inlined to its caller. This method shows up in the profile and it is essentially a fancy getter. It would benefit from inlining into its callers. NFC.	2022-04-05 13:11:38 -07:00
Tom Honermann	c54ad13602	[Lint][Verifier] NFC: Rename 'Assert' macros to 'Check'. The LLVM IR verifier and analysis linter defines and uses several macros in code that performs validation of IR expectations. Previously, these macros were named with an 'Assert' prefix. These names were misleading since the macro definitions are not conditioned on build kind; they are defined identically in builds that have asserts enabled and those that do not. This was confusing since an LLVM developer might expect these macros to be conditionally enabled as 'assert' is. Further confusion was possible since the LLVM IR verifier is implicitly disabled (in Clang::ConstructJob()) for builds without asserts enabled, but only for Clang driver invocations; not for clang -cc1 invocations. This could make it appear that the macros were not active for builds without asserts enabled, e.g. when investigating behavior using the Clang driver, and thus lead to surprises when running tests that exercise the clang -cc1 interface. This change renames this set of macros as follows: Assert -> Check AssertDI -> CheckDI AssertTBAA -> CheckTBAA	2022-04-05 15:34:35 -04:00
serge-sans-paille	1e02737593	[iwyu] Fix some header include regression Running iwyu-diff from https://github.com/serge-sans-paille/preprocessor-utils makes it possible to quickly spot regression in unused includes. This patch contains the few regressions since the last header cleanup. Differential Revision: https://reviews.llvm.org/D123036	2022-04-05 15:02:03 +02:00
Nikita Popov	46cfbe561b	[LLVMContext] Replace enableOpaquePointers() with setOpaquePointers() This allows both explicitly enabling and explicitly disabling opaque pointers, in anticipation of the default switching at some point. This also slightly changes the rules by allowing calls if either the opaque pointer mode has not yet been set (explicitly or implicitly) or if the value remains unchanged.	2022-04-05 12:02:48 +02:00
Nikita Popov	3c9f3f76f1	[ConstantFold] Fold zero-index GEPs with opaque pointers With opaque pointers, we can eliminate zero-index GEPs even if they have multiple indices, as this no longer impacts the result type of the GEP. This optimization is already done for instructions in InstSimplify, but we were missing the corresponding constant expression handling. The constexpr transform is a bit more powerful, because it can produce a vector splat constant and also handles undef values -- it is an extension of an existing single-index transform.	2022-04-04 13:04:27 +02:00
Augie Fackler	e90bce8f91	CallBase: fix getFnAttr so it also checks the function Prior to this change, CallBase::hasFnAttr checked the called function to see if it had an attribute if it wasn't set on the CallBase, but getFnAttr didn't do the same delegation, which led to very confusing behavior. This patch fixes the issue by making CallBase::getFnAttr also check the function under the same circumstances. Test changes look (to me) like they're cleaning up redundant attributes which no longer get specified both on the callee and call. We also clean up the one ad-hoc implementation of this getter over in InlineCost.cpp. Differential Revision: https://reviews.llvm.org/D122821	2022-04-03 23:19:23 -04:00
Kazu Hirata	d3684c3359	[IR] Remove unused forward declarations (NFC)	2022-04-03 12:54:54 -07:00
Vitaly Buka	0f37afc60f	Destroy ValueNames after all unique_ptr<Value> This UB detected by -fsanitize-memory-use-after-dtor in tensorflow/MLIR.	2022-03-31 21:22:07 -07:00
yanming	a7c0b7504c	[VP] Add more cast VPintrinsic and docs. Add vp.fptoui, vp.uitofp, vp.fptrunc, vp.fpext, vp.trunc, vp.zext, vp.sext, vp.ptrtoint, vp.inttoptr intrinsic and docs. Reviewed By: frasercrmck, craig.topper Differential Revision: https://reviews.llvm.org/D122291	2022-04-01 09:16:10 +08:00
Jorge Gorbe Moya	fc7573f29c	Revert "[misexpect] Re-implement MisExpect Diagnostics" This reverts commit `46774df307`.	2022-03-31 14:54:41 -07:00
Paul Kirth	46774df307	[misexpect] Re-implement MisExpect Diagnostics Reimplements MisExpect diagnostics from D66324 to reconstruct its original checking methodology only using MD_prof branch_weights metadata. New checks rely on 2 invariants: 1) For frontend instrumentation, MD_prof branch_weights will always be populated before llvm.expect intrinsics are lowered. 2) for IR and sample profiling, llvm.expect intrinsics will always be lowered before branch_weights are populated from the IR profiles. These invariants allow the checking to assume how the existing branch weights are populated depending on the profiling method used, and emit the correct diagnostics. If these invariants are ever invalidated, the MisExpect related checks would need to be updated, potentially by re-introducing MD_misexpect metadata, and ensuring it always will be transformed the same way as branch_weights in other optimization passes. Frontend based profiling is now enabled without using LLVM Args, by introducing a new CodeGen option, and checking if the -Wmisexpect flag has been passed on the command line. Reviewed By: tejohnson Differential Revision: https://reviews.llvm.org/D115907	2022-03-31 17:38:21 +00:00
Serge Pavlov	881350a92d	Mapping of FP operations to constrained intrinsics A new function 'getConstrainedIntrinsic' is added, which for any gived instruction returns id of the corresponding constrained intrinsic. If there is no constrained counterpart for the instruction or the instruction is already a constrained intrinsic, the function returns zero. This is recommit of `115b3ace36`, reverted in `8160dd582b`. Differential Revision: https://reviews.llvm.org/D69562	2022-03-31 11:07:47 +07:00
Fangrui Song	e572927f63	[AutoUpgrade] Fix -Wunused-variable in -DLLVM_ENABLE_ASSERTIONS=off builds	2022-03-30 13:31:18 -07:00
Fraser Cormack	73244e8f85	[VP] Add vp.icmp comparison intrinsic and docs This patch mostly follows up on D121292 which introduced the vp.fcmp intrinsic. Reviewed By: craig.topper Differential Revision: https://reviews.llvm.org/D122729	2022-03-30 17:05:11 +01:00
Nikita Popov	d6887256c2	[AutoUpgrade] Don't upgrade intrinsics returning overloaded struct type We only want to do the upgrade from named to anonymous struct return if the intrinsic is declared to return a struct, but not if it has an overloaded return type that just happens to be a struct. In that case the struct type will be mangled into the intrinsic name and there is no problem. This should address the problem reported in https://reviews.llvm.org/D122471#3416598.	2022-03-30 17:27:26 +02:00
Fraser Cormack	da6131f20a	[VP] Add vp.fcmp comparison intrinsic and docs This patch adds the first support for vector-predicated comparison intrinsics, starting with vp.fcmp. It uses metadata to encode its condition code, like the llvm.experimental.constrained.fcmp intrinsic. Reviewed By: craig.topper Differential Revision: https://reviews.llvm.org/D121292	2022-03-30 14:39:18 +01:00
Serge Pavlov	8160dd582b	Revert "Mapping of FP operations to constrained intrinsics" This reverts commit `115b3ace36`. Starting from this commit the buildbot sanitizer-x86_64-linux-bootstrap-msan starts failing (build 10071). Reverted for investigation.	2022-03-30 16:46:43 +07:00
Nikita Popov	8a72391f60	[IR] Require intrinsic struct return type to be anonymous This is an alternative to D122376. Rather than working around the problem, this patch requires that struct return types in intrinsics are anonymous/literal and adds auto-upgrade code to convert existing uses of intrinsics with named struct types. This ensures that the mapping between intrinsic name and intrinsic function type is actually bijective, as it is supposed to be. This also fixes https://github.com/llvm/llvm-project/issues/37891. Differential Revision: https://reviews.llvm.org/D122471	2022-03-30 09:51:24 +02:00
Serge Pavlov	115b3ace36	Mapping of FP operations to constrained intrinsics A new function 'getConstrainedIntrinsic' is added, which for any gived instruction returns id of the corresponding constrained intrinsic. If there is no constrained counterpart for the instruction or the instruction is already a constrained intrinsic, the function returns zero. Differential Revision: https://reviews.llvm.org/D69562	2022-03-30 12:21:30 +07:00
Paul Kirth	90cb325abd	Revert "[misexpect] Re-implement MisExpect Diagnostics" This reverts commit `2add3fbd97`.	2022-03-29 06:20:30 +00:00
Johannes Doerfert	7df2eba7fa	[Attributor][OpenMP] Add assumption for non-call assembly instructions Inline assembly is scary but we need to support it for the OpenMP GPU device runtime. The new assumption expresses the fact that it may not have call semantics, that is, it will not call another function but simply perform an operation or side-effect. This is important for reachability in the presence of inline assembly. Differential Revision: https://reviews.llvm.org/D109986	2022-03-28 20:57:52 -05:00
Johannes Doerfert	bb0b23174e	[InstCombineCalls] Optimize call of bitcast even w/ parameter attributes Before we gave up if a call through bitcast had parameter attributes. Interestingly, we allowed attributes for the return value already. We now handle both the same way, namely, we drop the ones that are incompatible with the new type and keep the rest. This cannot cause "more UB" than initially present. Differential Revision: https://reviews.llvm.org/D119967	2022-03-28 20:57:52 -05:00
Paul Kirth	2add3fbd97	[misexpect] Re-implement MisExpect Diagnostics Reimplements MisExpect diagnostics from D66324 to reconstruct its original checking methodology only using MD_prof branch_weights metadata. New checks rely on 2 invariants: 1) For frontend instrumentation, MD_prof branch_weights will always be populated before llvm.expect intrinsics are lowered. 2) for IR and sample profiling, llvm.expect intrinsics will always be lowered before branch_weights are populated from the IR profiles. These invariants allow the checking to assume how the existing branch weights are populated depending on the profiling method used, and emit the correct diagnostics. If these invariants are ever invalidated, the MisExpect related checks would need to be updated, potentially by re-introducing MD_misexpect metadata, and ensuring it always will be transformed the same way as branch_weights in other optimization passes. Frontend based profiling is now enabled without using LLVM Args, by introducing a new CodeGen option, and checking if the -Wmisexpect flag has been passed on the command line. Reviewed By: tejohnson Differential Revision: https://reviews.llvm.org/D115907	2022-03-28 23:30:04 +00:00
Kazu Hirata	2bc684cb6c	Apply clang-tidy fixes for readability-redundant-member-init in Module.cpp (NFC)	2022-03-28 09:18:27 -07:00
Luo, Yuanke	1fd118ffc4	Verify parameter alignment attribute In DAGISel, the parameter alignment only have 4 bits to hold the value. The encode(alignment) would plus the value by 1, so the max aligment that ISel can support is 2^14. This patch verify align attribute for parameter. Differential Revision: https://reviews.llvm.org/D122130	2022-03-27 09:03:22 +08:00
Luo, Yuanke	321cbf75be	[Verifier] Verify parameter alignment. In DAGISel, the parameter alignment only have 4 bits to hold the value. The encode(alignment) would plus the shift value by 1, so the max aligment ISel can support is 2^14. This patch verify the parameter and return value for alignment. Differential Revision: https://reviews.llvm.org/D121898	2022-03-27 08:35:05 +08:00
Nikita Popov	cde6003ae0	[LLVMContext] Respect default value of -opaque-pointers option (NFC) If the option is edited to use true as the default, we should respect that, rather than hardcoding false here.	2022-03-23 12:59:42 +01:00
Craig Topper	49c2206b3b	[VP] Preserve address space of pointer for strided load/store intrinsics. This adds LLVMAnyPointerToElt to use instead of LLVMPointerToElt. This allows us to preserve the address space as part of the type overload for the intrinsic, but still require the vector element type to match the pointer type. Reviewed By: nikic Differential Revision: https://reviews.llvm.org/D122042	2022-03-22 09:52:54 -07:00
Hendrik Greving	510a2bbda4	[IR] Allow matching pointer to vector with opaque pointers. Allows for skipping the pointer to vector type if opaque pointers are enabled and the matching pointer is a vector pointer when matching an intrinsic signature in the verifier. No test added since lacking a target using intrinsic with pointer to vector arguments. Differential Revision: https://reviews.llvm.org/D122203	2022-03-22 09:34:48 -07:00
Simon Moll	7de383c892	[VP] Fix VPintrinsic::getStaticVectorLength for vp.merge\|select VPIntrinsic::getStaticVectorLength infers the operational vector length of a VPIntrinsic instance from a type that is used with the intrinsic. The function used the mask operand before. Yet, vp.merge\|select do not have a mask operand (in the predicating sense that the other VP intrinsics are using them - it is a selection mask for them). Fallback to the return type to fix this. Reviewed By: kaz7 Differential Revision: https://reviews.llvm.org/D121913	2022-03-22 11:41:23 +01:00
Arthur Eubanks	2362c4ecdc	Revert "Revert "[OpaquePtr][LLParser] Automatically detect opaque pointers in .ll files"" This reverts commit `9c96a6bbfd`. Issues were already fixed at head.	2022-03-21 17:24:56 -07:00
Mitch Phillips	9c96a6bbfd	Revert "[OpaquePtr][LLParser] Automatically detect opaque pointers in .ll files" This reverts commit `295172ef51`. Reason: Broke the ASan buildbot. More details are available on the original Phab review at https://reviews.llvm.org/D119482.	2022-03-21 16:04:36 -07:00
Paul Kirth	964398ccb1	Revert "Revert "Revert "[misexpect] Re-implement MisExpect Diagnostics""" This reverts commit `6cf560d69a`.	2022-03-18 00:21:33 +00:00
Paul Kirth	6cf560d69a	Revert "Revert "[misexpect] Re-implement MisExpect Diagnostics"" I mistakenly reverted my commit, so I'm relanding it. This reverts commit `10866a1df4`.	2022-03-18 00:04:22 +00:00
Paul Kirth	10866a1df4	Revert "[misexpect] Re-implement MisExpect Diagnostics" This reverts commit `e7749d4713`.	2022-03-17 23:54:26 +00:00
Paul Kirth	e7749d4713	[misexpect] Re-implement MisExpect Diagnostics Reimplements MisExpect diagnostics from D66324 to reconstruct its original checking methodology only using MD_prof branch_weights metadata. New checks rely on 2 invariants: 1) For frontend instrumentation, MD_prof branch_weights will always be populated before llvm.expect intrinsics are lowered. 2) for IR and sample profiling, llvm.expect intrinsics will always be lowered before branch_weights are populated from the IR profiles. These invariants allow the checking to assume how the existing branch weights are populated depending on the profiling method used, and emit the correct diagnostics. If these invariants are ever invalidated, the MisExpect related checks would need to be updated, potentially by re-introducing MD_misexpect metadata, and ensuring it always will be transformed the same way as branch_weights in other optimization passes. Frontend based profiling is now enabled without using LLVM Args, by introducing a new CodeGen option, and checking if the -Wmisexpect flag has been passed on the command line. Differential Revision: https://reviews.llvm.org/D115907	2022-03-17 23:46:23 +00:00
Arthur Eubanks	295172ef51	[OpaquePtr][LLParser] Automatically detect opaque pointers in .ll files This allows us to not have to specify -opaque-pointers when updating IR tests from typed pointers to opaque pointers. We detect opaque pointers in .ll files by looking for relevant tokens, either "ptr" or "*". Reviewed By: #opaque-pointers, nikic Differential Revision: https://reviews.llvm.org/D119482	2022-03-17 08:37:18 -07:00
Jay Foad	a3a4591856	[LegacyPassManager] Move structural hashing into Pass classes. NFC. Move structural hashing into virtual methods on Pass. This will allow MachineFunctionPass to override the method to add hashing of the MachineFunction. Differential Revision: https://reviews.llvm.org/D120123	2022-03-17 09:51:12 +00:00
Arthur Eubanks	2371c5a0e0	[OpaquePtr][ARM] Use elementtype on ldrex/ldaex/stlex/strex Includes verifier changes checking the elementtype, clang codegen changes to emit the elementtype, and ISel changes using the elementtype. Basically the same as D120527. Reviewed By: #opaque-pointers, nikic Differential Revision: https://reviews.llvm.org/D121847	2022-03-16 14:11:53 -07:00
Arthur Eubanks	250620f76e	[OpaquePtr][AArch64] Use elementtype on ldxr/stxr Includes verifier changes checking the elementtype, clang codegen changes to emit the elementtype, and ISel changes using the elementtype. Reviewed By: #opaque-pointers, nikic Differential Revision: https://reviews.llvm.org/D120527	2022-03-14 10:09:59 -07:00
Nikita Popov	f00cd27646	[Verifier] Verify llvm.access.group metadata According to LangRef, an access scope must have zero operands and be distinct. The access group may either be a single access scope or a list of access scopes. LoopInfo may assert if this is not the case.	2022-03-14 16:16:36 +01:00
Nikita Popov	da48f08abf	[SCCP][IR] Landing pads are not safe to remove For landingpads with {} type, SCCP ended up dropping them, because we considered them as safe to remove.	2022-03-14 14:59:32 +01:00
serge-sans-paille	ed98c1b376	Cleanup includes: DebugInfo & CodeGen Discourse thread: https://discourse.llvm.org/t/include-what-you-use-include-cleanup Differential Revision: https://reviews.llvm.org/D121332	2022-03-12 17:26:40 +01:00
Nikita Popov	237df15c08	[Verifier] Check type of swifterror alloca Per LangRef, swifterror alloca must be a pointer. Not checking this may result in a verifier error after transforms instead, so make sure it's discarded early.	2022-03-11 14:52:56 +01:00
Nikita Popov	7781f61efa	[ConstantFold] Fix scalable shufflevector fold with all-undef mask If the input is scalable, we should not be returning a fixed-width vector as a result.	2022-03-11 14:30:02 +01:00
Nikita Popov	dcc4b94d94	[llvm-c] Document that LLVMGetElementType on pointers is deprecated (NFC) We can't actually deprecate the function, because it is also used for arrays and vectors, so we can only document this.	2022-03-11 09:28:18 +01:00
Lorenzo Albano	28cfa764c2	[VP] Strided loads/stores This patch introduces two new experimental IR intrinsics and SDAG nodes to represent vector strided loads and stores. Reviewed By: simoll Differential Revision: https://reviews.llvm.org/D114884	2022-03-10 18:46:54 +01:00
Florian Hahn	f98125abb2	Revert "[PassManager] Add pretty stack entries before P->run() call." This reverts commit `128745cc26`. This increased compile-time unnecessarily. Revert this change and follow ups `2c7afadb47` & `add0c5856d`. http://llvm-compile-time-tracker.com/compare.php?from=338dfcd60f843082bb589b287d890dbd9394eb82&to=128745cc2681c284bc6d0150a319673a6d6e8424&stat=instructions	2022-03-09 18:46:32 +00:00
Florian Hahn	128745cc26	[PassManager] Add pretty stack entries before P->run() call. This patch adds PrettyStackEntries before running passes. The entries include the pass name and the IR unit the pass runs on. The information is used the print additional information when a pass crashes, including the name and a reference to the IR unit on which it crashed. This is similar to the behavior of the legacy pass manager. The improved stack trace now includes: Stack dump: 0. Program arguments: bin/opt -loop-vectorize -force-vector-width=4 crash.ll 1. Running pass 'ModuleToFunctionPassAdaptor' on module 'crash.ll' 2. Running pass 'LoopVectorizePass' on function '@a' Reviewed By: aeubanks Differential Revision: https://reviews.llvm.org/D120993	2022-03-09 13:01:09 +00:00
Nikita Popov	e3d87fd6e5	[IR][IPSCCP] Treat different function type as address taken (PR54258) Without opaque pointers, this code currently treats a call through a bitcast as the function being address taken, and IPSCCP relies on this for correctness. Match the same behavior under opaque pointers by checking that the function types are the same. Fixes https://github.com/llvm/llvm-project/issues/54258.	2022-03-09 10:46:51 +01:00
Rong Xu	1712254b3f	[SampleFDO] Allow multiple of --enable-fs-discrimintor option [NFC] Allow users to use multiple of --enable-fs-discriminator option. When this option is specified multiple times, the last instance wins.	2022-03-08 11:31:20 -08:00
Simon Moll	5f62156762	[VP] Introducing VectorBuilder, the VP intrinsic builder VectorBuilder wraps around an IRBuilder and VectorBuilder::createVectorInstructions emits VP intrinsics as if they were regular instructions. Reviewed By: craig.topper Differential Revision: https://reviews.llvm.org/D105283	2022-03-07 10:02:07 +01:00
Nikita Popov	a9b03d9e2e	[Attributor] Remove function pointer restriction for AAAlign This check is not compatible with opaque pointers. We can avoid it by adjusting the getPointerAlignment() implementation to avoid creating unnecessary ptrtoint expressions for bitcasted pointers. The code already uses OnlyIfReduced to not create an expression if it does not simplify, and this makes sure that folding a bitcast and ptrtoint into a ptrtoint doesn't count as a simplification. Differential Revision: https://reviews.llvm.org/D120904	2022-03-07 10:02:45 +01:00
Augie Fackler	d664c4b73c	Attributes: add a new allocalign attribute This will let us start moving away from hard-coded attributes in MemoryBuiltins.cpp and put the knowledge about various attribute functions in the compilers that emit those calls where it probably belongs. Differential Revision: https://reviews.llvm.org/D117921	2022-03-04 15:57:53 -05:00
Nikita Popov	7a258c6a37	[Bitcode] Move x86_intrcc upgrade to bitcode reader This upgrade requires access the legacy pointer element type, so it needs to happen inside the bitcode reader.	2022-03-04 10:30:50 +01:00
Simon Moll	8de8731591	Revert "[VP] Introducing VectorBuilder, the VP intrinsic builder" This reverts commit `8bcbfb50e8`. Taking this patch offline to fix breakage: https://lab.llvm.org/buildbot/#/builders/110/builds/10912	2022-03-03 13:34:37 +01:00
Simon Moll	8bcbfb50e8	[VP] Introducing VectorBuilder, the VP intrinsic builder VectorBuilder wraps around an IRBuilder and VectorBuilder::createVectorInstructions emits VP intrinsics as if they were regular instructions. Reviewed By: craig.topper Differential Revision: https://reviews.llvm.org/D105283	2022-03-03 11:31:57 +01:00
Simon Moll	d05ddb86f6	[VP] vp.sitofp cast intrinsic and docs Reviewed By: frasercrmck Differential Revision: https://reviews.llvm.org/D119922	2022-03-02 10:16:19 +01:00
Itay Bookstein	7ca7d8126d	[Verifier] Restore defined-resolver verification for IFuncs Now that clang no longer emits GlobalIFunc-s with a declaration for a resolver, we can restore that check. In addition, add a linkage check like the one we have on GlobalAlias-es, and a Verifier test for ifuncs. Signed-off-by: Itay Bookstein <ibookstein@gmail.com> Reviewed By: MaskRay Differential Revision: https://reviews.llvm.org/D120267	2022-02-26 12:56:14 +02:00
Amanieu d'Antras	54b909de68	[Mangler] Mangle aliases to fastcall/vectorcall functions correctly These aliases are produced by MergeFunctions and need to be mangled according to the calling convention of the function they are pointing to instead of defaulting to the C calling convention. Reviewed By: rnk Differential Revision: https://reviews.llvm.org/D120382	2022-02-25 22:06:47 +00:00
Nikita Popov	87ebd9a36f	[IR] Use CallBase::getParamElementType() (NFC) As this method now exists on CallBase, use it rather than the one on AttributeList.	2022-02-25 10:01:58 +01:00
Bill Wendling	a5bbc6ef99	[NFC] Remove unnecessary "#include"s from header files	2022-02-23 01:20:48 -08:00
Momchil Velikov	030503e17c	Remove duplicated code for printing the `uwtable` attribute (NFC) Committed as obvious. Reviewed By: chill Differential Revision: https://reviews.llvm.org/D120030	2022-02-17 12:24:41 +00:00
Simon Moll	03e83cc8eb	[VP] vp.fptosi cast intrinsic and docs Reviewed By: craig.topper Differential Revision: https://reviews.llvm.org/D119535	2022-02-15 18:17:19 +01:00
Serguei Katkov	cd16836ce2	[Safepoint Verifier] Add a missed comment to previous commit.	2022-02-15 12:21:33 +07:00
Serguei Katkov	57092d4f4f	[Safepoint Verifier] gc.relocate does not change the constant property. Add traverse through gc.relocate in determining whether base is isExclusivelyDerivedFromNull OR ExclusivelyNull. Reviewers: reames, anna Reviewed By: reames, anna Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D119712	2022-02-15 12:18:46 +07:00
Ahmed Bougacha	c703f852c9	[IR] Define "ptrauth" operand bundle. This introduces a new "ptrauth" operand bundle to be used in call/invoke. At the IR level, it's semantically equivalent to an @llvm.ptrauth.auth followed by an indirect call, but it additionally provides additional hardening, by preventing the intermediate raw pointer from being exposed. This mostly adds the IR definition, verifier checks, and support in a couple of general helper functions. Clang IRGen and backend support will come separately. Note that we'll eventually want to support this bundle in indirectbr as well, for similar reasons. indirectbr currently doesn't support bundles at all, and the IR data structures need to be updated to allow that. Differential Revision: https://reviews.llvm.org/D113685	2022-02-14 11:27:35 -08:00
Momchil Velikov	6398903ac8	Extend the `uwtable` attribute with unwind table kind We have the `clang -cc1` command-line option `-funwind-tables=1\|2` and the codegen option `VALUE_CODEGENOPT(UnwindTables, 2, 0) ///< Unwind tables (1) or asynchronous unwind tables (2)`. However, this is encoded in LLVM IR by the presence or the absence of the `uwtable` attribute, i.e. we lose the information whether to generate want just some unwind tables or asynchronous unwind tables. Asynchronous unwind tables take more space in the runtime image, I'd estimate something like 80-90% more, as the difference is adding roughly the same number of CFI directives as for prologues, only a bit simpler (e.g. `.cfi_offset reg, off` vs. `.cfi_restore reg`). Or even more, if you consider tail duplication of epilogue blocks. Asynchronous unwind tables could also restrict code generation to having only a finite number of frame pointer adjustments (an example of not having a finite number of `SP` adjustments is on AArch64 when untagging the stack (MTE) in some cases the compiler can modify `SP` in a loop). Having the CFI precise up to an instruction generally also means one cannot bundle together CFI instructions once the prologue is done, they need to be interspersed with ordinary instructions, which means extra `DW_CFA_advance_loc` commands, further increasing the unwind tables size. That is to say, async unwind tables impose a non-negligible overhead, yet for the most common use cases (like C++ exceptions), they are not even needed. This patch extends the `uwtable` attribute with an optional value: - `uwtable` (default to `async`) - `uwtable(sync)`, synchronous unwind tables - `uwtable(async)`, asynchronous (instruction precise) unwind tables Reviewed By: MaskRay Differential Revision: https://reviews.llvm.org/D114543	2022-02-14 14:35:02 +00:00
Dmitry Vassiliev	d97d4d8d75	[NFC][IR] Value: assert this->takeName(this) Need to add an assert about this->takeName(this). This restriction is already documented, so this is just an NFC check. Without this assertion (as prescribed by original comments for this API), name deletion or down-stream assert failures may occur in other routines: e.g. at the beginning of replaceAllUsesWith() below. Reviewed By: aeubanks Differential Revision: https://reviews.llvm.org/D119636	2022-02-13 21:47:37 +03:00
YASHASVI KHATAVKAR	70fdbf35de	Adding DiBuilder interface for assumed length strings	2022-02-11 14:40:02 -05:00
Julien Pages	dcb2da13f1	[AMDGPU] Add a new intrinsic to control fp_trunc rounding mode Add a new llvm.fptrunc.round intrinsic to precisely control the rounding mode when converting from f32 to f16. Differential Revision: https://reviews.llvm.org/D110579	2022-02-11 12:08:23 -05:00
Nikita Popov	8f1350e03a	[IR] Check GEP source type when comparing instructions Two GEPs with same indices but different source type are not the same. Worth noting that FunctionComparator already handles this correctly.	2022-02-11 12:32:04 +01:00
YASHASVI KHATAVKAR	93d1a623ce	Reverting an entire stack of changes causing build failures	2022-02-10 17:58:22 -05:00
YASHASVI KHATAVKAR	e4f9d4a5ee	updated local branch to incorporate latest changes	2022-02-10 15:24:51 -05:00
YASHASVI KHATAVKAR	0e7341b7b1	worked on review comments	2022-02-10 15:24:51 -05:00
YASHASVI KHATAVKAR	929499eb64	Updated the test to include addtional details	2022-02-10 15:24:50 -05:00
YASHASVI KHATAVKAR	99f990be64	Added StringLocationExp to the new apis	2022-02-10 15:24:50 -05:00
YASHASVI KHATAVKAR	2c5dfeed2f	Addressed review comments	2022-02-10 15:24:50 -05:00
YASHASVI KHATAVKAR	43d421cda3	Adding DIBuilder interface for assumed length string	2022-02-10 15:24:50 -05:00
Nikita Popov	48eeefe59f	[AutoUpgrade] Handle remangling upgrade for ptr.annotation The code assumed that the upgrade would happen due to the argument count changing from 4 to 5. However, a remangling upgrade is also possible here.	2022-02-08 16:52:05 +01:00
Nikita Popov	8398e61f93	[AutoUpgrade] Also upgrade intrinsics in invokes We currently don't have any specialized upgrades for intrinsics that can be used in invokes, but they can still be subject to a generic remangling upgrade. In particular, this happens when upgrading statepoint intrinsics under -opaque-pointers. This patch just changes the upgrade code to work on CallBase instead of CallInst in particular.	2022-02-08 15:59:52 +01:00
Kazu Hirata	3a3cb929ab	[llvm] Use = default (NFC)	2022-02-06 22:18:35 -08:00
Nikita Popov	8f8e13056a	[Verifier] Require elementtype on gc.statepoint intrinsics This enforces the requirement specified in D117890.	2022-02-04 14:29:53 +01:00
serge-sans-paille	ffe8720aa0	Reduce dependencies on llvm/BinaryFormat/Dwarf.h This header is very large (3M Lines once expended) and was included in location where dwarf-specific information were not needed. More specifically, this commit suppresses the dependencies on llvm/BinaryFormat/Dwarf.h in two headers: llvm/IR/IRBuilder.h and llvm/IR/DebugInfoMetadata.h. As these headers (esp. the former) are widely used, this has a decent impact on number of preprocessed lines generated during compilation of LLVM, as showcased below. This is achieved by moving some definitions back to the .cpp file, no performance impact implied[0]. As a consequence of that patch, downstream user may need to manually some extra files: llvm/IR/IRBuilder.h no longer includes llvm/BinaryFormat/Dwarf.h llvm/IR/DebugInfoMetadata.h no longer includes llvm/BinaryFormat/Dwarf.h In some situations, codes maybe relying on the fact that llvm/BinaryFormat/Dwarf.h was including llvm/ADT/Triple.h, this hidden dependency now needs to be explicit. $ clang++ -E -Iinclude -I../llvm/include ../llvm/lib/Transforms/Scalar/*.cpp -std=c++14 -fno-rtti -fno-exceptions \| wc -l after: 10978519 before: 11245451 Related Discourse thread: https://llvm.discourse.group/t/include-what-you-use-include-cleanup [0] https://llvm-compile-time-tracker.com/compare.php?from=fa7145dfbf94cb93b1c3e610582c495cb806569b&to=995d3e326ee1d9489145e20762c65465a9caeab4&stat=instructions Differential Revision: https://reviews.llvm.org/D118781	2022-02-04 11:44:03 +01:00
Nikita Popov	c680eeab30	[IRBuilder][RS4GC] Require FunctionCallee when creating statepoint This makes the statepoint methods in IRBuilder accept a FunctionCallee, which carries both the callee and function type. This is used to add the elementtype attribute to the statepoint call. RS4GC requires an additional tweak to actually preserve that attribute -- previously the attributes on the call were completely overwritten. Differential Revision: https://reviews.llvm.org/D118886	2022-02-04 09:47:32 +01:00
Alex Lorenz	116c1bea65	[clang][macho] add clang frontend support for emitting macho files with two build version load commands This patch extends clang frontend to add metadata that can be used to emit macho files with two build version load commands. It utilizes "darwin.target_variant.triple" and "darwin.target_variant.SDK Version" metadata names for that. MachO uses two build version load commands to represent an object file / binary that is targeting both the macOS target, and the Mac Catalyst target. At runtime, a dynamic library that supports both targets can be loaded from either a native macOS or a Mac Catalyst app on a macOS system. We want to add support to this to upstream to LLVM to be able to build compiler-rt for both targets, to finish the complete support for the Mac Catalyst platform, which is right now targetable by upstream clang, but the compiler-rt bits aren't supported because of the lack of this multiple build version support. Differential Revision: https://reviews.llvm.org/D115415	2022-02-02 08:30:39 -08:00
Nikita Popov	b82a3a8ef3	[IRBuilder] Reformat two functions (NFC) These were using 1-space indentation.	2022-02-02 17:09:23 +01:00
serge-sans-paille	fa7145dfbf	Add missing includes after LLVMCore header cleanup - conditionally include header only used for expensive check - have Core.h always include llvm-c/ErrorHandling.h	2022-02-02 07:51:13 +01:00
serge-sans-paille	e188aae406	Cleanup header dependencies in LLVMCore Based on the output of include-what-you-use. This is a big chunk of changes. It is very likely to break downstream code unless they took a lot of care in avoiding hidden ehader dependencies, something the LLVM codebase doesn't do that well :-/ I've tried to summarize the biggest change below: - llvm/include/llvm-c/Core.h: no longer includes llvm-c/ErrorHandling.h - llvm/IR/DIBuilder.h no longer includes llvm/IR/DebugInfo.h - llvm/IR/IRBuilder.h no longer includes llvm/IR/IntrinsicInst.h - llvm/IR/LLVMRemarkStreamer.h no longer includes llvm/Support/ToolOutputFile.h - llvm/IR/LegacyPassManager.h no longer include llvm/Pass.h - llvm/IR/Type.h no longer includes llvm/ADT/SmallPtrSet.h - llvm/IR/PassManager.h no longer includes llvm/Pass.h nor llvm/Support/Debug.h And the usual count of preprocessed lines: $ clang++ -E -Iinclude -I../llvm/include ../llvm/lib/IR/*.cpp -std=c++14 -fno-rtti -fno-exceptions \| wc -l before: 6400831 after: 6189948 200k lines less to process is no that bad ;-) Discourse thread on the topic: https://llvm.discourse.group/t/include-what-you-use-include-cleanup Differential Revision: https://reviews.llvm.org/D118652	2022-02-02 06:54:20 +01:00
Momchil Velikov	5a90b1e4e5	Save some `std::string` allocations/deallocations when formatting attributes (NFC) Reviewed By: MaskRay Differential Revision: https://reviews.llvm.org/D118451	2022-01-31 12:13:50 +00:00
Ahmed Bougacha	634ca7349d	[ObjCARC] Require the function argument in the clang.arc.attachedcall bundle. Currently, the clang.arc.attachedcall bundle takes an optional function argument. Depending on whether the argument is present, calls with this bundle have the following semantics: - on x86, with the argument present, the call is lowered to: call _target mov rax, rdi call _objc_retainAutoreleasedReturnValue - on AArch64, without the argument, the call is lowered to: bl _target mov x29, x29 and the objc runtime call is expected to be emitted separately. That's because, on x86, the objc runtime checks for both the mov and the call on x86, and treats the combination as the ARC autorelease elision marker. But on AArch64, it only checks for the dedicated NOP marker, as that's historically been sufficiently unique. Thanks to that, the runtime call wasn't required to be adjacent to the NOP marker, so it wasn't emitted as part of the bundle sequence. This patch unifies both architectures: on AArch64, we now emit all 3 instructions for the bundle. This guarantees that the runtime call is adjacent to the marker in the sequence, and that's information the runtime can use to further optimize this. This helps simplify some of the handling, in particular BundledRetainClaimRVs, which no longer needs to know whether the bundle is sufficient or not: it now always should be. Note that this does not include an AutoUpgrade for the nullary bundles, as they are only produced in ObjCContract as part of the obj/asm emission pipeline, and are not expected to be in bitcode. Differential Revision: https://reviews.llvm.org/D118214	2022-01-28 12:41:45 -08:00
Nikita Popov	97916673d4	[IR] Support ifuncs in opaque pointer mode Relax the type assertion for opaque pointers, and enumerate the value type in TypeFinder and ValueEnumerator.	2022-01-27 13:01:33 +01:00
Nikita Popov	4d9f6ab305	[IR] Handle opaque pointers in PtrToArgument mangling It appears that this mangling type is currently unused. Make it compatible with opaque pointers in case it becomes used again...	2022-01-27 12:36:25 +01:00
Nikita Popov	0f0e699776	[ConstantFold] Disable gep of array bitcast fold with opaque pointers Once again, this fold is meaningless with opaque pointers, as there is no pointer element type to canonicalize. At some point, we may want to do GEP type canonicalizations.	2022-01-27 11:52:52 +01:00
Chih-Ping Chen	28bfa57a73	[DebugInfo] Add stringLocationExp field to DIStringType DIStringType is used to encode the debug info of a character object in Fortran. A Fortran deferred-length character object is typically implemented as a pair of the following two pieces of info: An address of the raw storage of the characters, and the length of the object. The stringLocationExp field contains the DIExpression to get to the raw storage. This patch also enables the emission of DW_AT_data_location attribute in a DW_TAG_string_type debug info entry based on stringLocationExp in DIStringType. A test is also added to ensure that the bitcode reader is backward compatible with the old DIStringType format. Differential Revision: https://reviews.llvm.org/D117586	2022-01-26 11:56:57 -05:00
Benjamin Kramer	f15014ff54	Revert "Rename llvm::array_lengthof into llvm::size to match std::size from C++17" This reverts commit `ef82063207`. - It conflicts with the existing llvm::size in STLExtras, which will now never be called. - Calling it without llvm:: breaks C++17 compat	2022-01-26 16:55:53 +01:00
serge-sans-paille	ef82063207	Rename llvm::array_lengthof into llvm::size to match std::size from C++17 As a conquence move llvm::array_lengthof from STLExtras.h to STLForwardCompat.h (which is included by STLExtras.h so no build breakage expected).	2022-01-26 16:17:45 +01:00
Nikita Popov	d8962b4139	[llvm-c] Deprecate LLVMBuildPtrDiff() In favor of LLVMBuildPtrDiff2(), which accepts an explicit element type and is compatible with opaque pointers.	2022-01-25 12:47:50 +01:00
Nikita Popov	30d4a7e295	[IRBuilder] Require explicit element type in CreatePtrDiff() For opaque pointer compatibility, we cannot derive the element type from the pointer type.	2022-01-25 12:43:57 +01:00
Nikita Popov	aa97bc116d	[NFC] Remove uses of PointerType::getElementType() Instead use either Type::getPointerElementType() or Type::getNonOpaquePointerElementType(). This is part of D117885, in preparation for deprecating the API.	2022-01-25 09:44:52 +01:00
Stephen Tozer	ea17d29a6c	[llvm] Do not replace dead constant references in metadata with undef This patch removes an incorrect behaviour in Constants.cpp, which would replace dead constant references in metadata with an undef value. This blanket replacement resulted in undef values being inserted into metadata that would not accept them. The replacement was intended for debug info metadata, but this is now instead handled in the RAUW handler. Differential Revision: https://reviews.llvm.org/D117300	2022-01-24 17:36:33 +00:00
Nikita Popov	d29e319263	[OpaquePtrs] Add getNonOpaquePointerElementType() method (NFC) This method is intended for use in places that cannot be reached with opaque pointers, or part of deprecated methods. This makes it easier to see that some uses of getPointerElementType() don't need further action. Differential Revision: https://reviews.llvm.org/D117870	2022-01-24 10:03:49 +01:00
Phoebe Wang	37d1d02200	[X86][MS] Change the alignment of f80 to 16 bytes on Windows 32bits to match with ICC MSVC currently doesn't support 80 bits long double. ICC supports it when the option `/Qlong-double` is specified. Changing the alignment of f80 to 16 bytes so that we can be compatible with ICC's option. Reviewed By: rnk, craig.topper Differential Revision: https://reviews.llvm.org/D115942	2022-01-23 09:58:46 +08:00
Adrian Prantl	24bc072edb	Fix modules build by moving implementation into .cpp file	2022-01-19 15:33:59 -08:00
Jakob Bornecrantz	bfed654e98	[LLVM-C] Use NameLen in LLVMGetNamedGlobalAlias I tried to look over the file and didn't see any other non-use of *Len variables. Reviewed By: deadalnix Differential Revision: https://reviews.llvm.org/D116482	2022-01-19 08:58:57 -08:00
Nikita Popov	42a68215a1	[AttrBuilder] Change storage to sorted vector (NFC) This follows up on the work in D116599, which changed AttrBuilder to store string attributes as SmallVector<Attribute>. This patch changes the implementation to store all attributes as a sorted vector. This both makes the implementation simpler and improves compile-time. We get a -0.5% geomean compile-time improvement on CTMark at O0. Differential Revision: https://reviews.llvm.org/D117558	2022-01-19 12:29:04 +01:00
Nikita Popov	da61cb019e	[Attributes] Make attribute addition behavior consistent Currently, the behavior when adding an attribute with the same key as an existing attribute is inconsistent, depending on the type of the attribute and the method used to add it. When going through AttrBuilder::addAttribute(), the new attribute always overwrites the old one. When going through AttrBuilder::merge() the new attribute overwrites the existing one if it is a string attribute, but keeps the existing one for int and type attributes. One particular API also asserts that you can't overwrite an align attribute, but does not handle any of the other int, type or string attributes. This patch makes the behavior consistent by always overwriting with the new attribute, which is the behavior I would intuitively expect. Two tests are affected, which now make a different (but equally valid) choice. Those tests could be improved by taking the maximum deref bytes, but I haven't bothered with that, since this is testing a degenerate case -- the important bit is that it doesn't crash. Differential Revision: https://reviews.llvm.org/D117552	2022-01-19 12:05:27 +01:00
Nikita Popov	ed0cdb2939	[Constants] Remove unused isGEPWithNoNotionalOverIndexing() method Since `d56b0ad441`, this method is no longer used -- and shouldn't be used.	2022-01-19 11:36:40 +01:00
Michael Gottesman	7ed95d1577	[debug-info] Add support for llvm.dbg.addr in DIBuilder. I based this off of the API already create for llvm.dbg.value since both intrinsics have the same arguments at the API level. I added some tests exercising the API a little as well as an additional small test that shows how one can use llvm.dbg.addr to limit the PC range where an address value is available in the debugger. This is done by calling llvm.dbg.value with undef and the same metadata info as one used to create the llvm.dbg.addr. rdar://83957028 Reviewed By: aprantl Differential Revision: https://reviews.llvm.org/D117442	2022-01-18 18:26:50 -08:00
Ellis Hoag	5b9358d774	[InstrProf][NFC] Add InstrProfInstBase base The `InstrProfInstBase` class is for all `llvm.instrprof.*` intrinsics. In a later diff we will add new instrinsic of this type. Also refactor some logic in `InstrProfiling.cpp`. Reviewed By: davidxl Differential Revision: https://reviews.llvm.org/D117261	2022-01-18 11:12:00 -08:00
Matt Arsenault	82de129ab8	AMDGPU: Remove llvm.amdgcn.alignbit and handle bitcode upgrade to fshr	2022-01-18 14:08:36 -05:00
Nikita Popov	541322540e	[AttrBuilder] Add string attribute getter (NFC) This avoids the need to scan through td_attrs() in AutoUpgrade, decoupling it from AttrBuilder implementation details.	2022-01-18 12:20:30 +01:00
Nikita Popov	0d7fbb0737	[AttrBuilder] Remove unused removeAttributes() overload The idiomatic way would be to call remove() with an AttributeMask constructed from an AttributeSet.	2022-01-16 21:32:54 +01:00
Nikita Popov	7cbbef5bbc	[AttrBuilder] Remove unused hasAttributes() overload This is unused, and doesn't make a lot of sense as an API. The usual pattern would be to combine the AttrBuilder(AttributeSet) constructor with the overlaps() method.	2022-01-16 21:00:18 +01:00
Nikita Popov	c63a3175c2	[AttrBuilder] Remove ctor accepting AttributeList and Index Use the AttributeSet constructor instead. There's no good reason why AttrBuilder itself should exact the AttributeSet from the AttributeList. Moving this out of the AttrBuilder generally results in cleaner code.	2022-01-15 22:39:31 +01:00
Florian Hahn	ba3198cfd1	[IRBuilder] Migrate select-folding to value-based FoldSelect. Reviewed By: lebedev.ri Differential Revision: https://reviews.llvm.org/D117228	2022-01-15 11:26:44 +00:00
Phoebe Wang	f63a805a4e	Revert "[X86][MS] Change the alignment of f80 to 16 bytes on Windows 32bits to match with ICC" This reverts commit `1bb0caf561`.	2022-01-15 10:54:38 +08:00
Nikita Popov	ed30a968b5	[Verifier] Avoid asserting on invalid cleanuppad chain The invalid undef value already triggers a verifier failure, but then the upwards scan from the cleanuppad ends up asserting. Make sure this is handled gacefully instead.	2022-01-14 12:10:41 +01:00
Fangrui Song	bc56097817	[GlobalValue] Make dso_local function work with comdat nodeduplicate This fixes -fno-semantic-interposition -fsanitize-coverage incompatibility. -fPIC -fno-semantic-interposition may add dso_local to an external linkage function. -fsanitize-coverage instrumentation does not clear dso_local when adding comdat nodeduplicate. This causes a compatibility issue: the function symbol may be referenced by a PC-relative relocation without using the local alias. In -shared mode, ld will report a relocation error. The fix is to either clear dso_local when adding comdat nodeduplicate, or supporting comdat nodeduplicate. The latter is more appropriate, because a comdat nodeduplicate is like not using comdat. Note: The comdat condition was originally added by D77429 to not use local alias for a hidden external linkage function in a deduplicate comdat. The condition has been unused since the code was refactored to only use local alias for default visibility symbols. Note: `canBenefitFromLocalAlias` is used by clang/lib/CodeGen/CodeGenModule.cpp and we don't want to add dso_local to default visibility external linkage comdat any (clang/test/CodeGenCUDA/usual-deallocators.cu). Differential Revision: https://reviews.llvm.org/D117190	2022-01-13 16:37:14 -08:00
Arthur Eubanks	757e044dce	[Inliner] Don't removeDeadConstantUsers() when checking if a function is dead If a function has many uses, this can take a good chunk of compile times. Reviewed By: nikic Differential Revision: https://reviews.llvm.org/D117236	2022-01-13 14:29:45 -08:00
Hans Wennborg	2bc57d85eb	Don't override __attribute__((no_stack_protector)) by inlining (PR52886) Since `26c6a3e736`, LLVM's inliner will "upgrade" the caller's stack protector attribute based on the callee. This lead to surprising results with Clang's no_stack_protector attribute added in `4fbf84c173` (D46300). Consider the following code compiled with clang -fstack-protector-strong -Os (https://godbolt.org/z/7s3rW7a1q). extern void h(int* p); inline __attribute__((always_inline)) int g() { return 0; } int __attribute__((__no_stack_protector__)) f() { int a[1]; h(a); return g(); } LLVM will inline g() into f(), and f() would get a stack protector, against the users explicit wishes, potentially breaking the program e.g. if h() changes the value of the stack cookie. That's a miscompile. More recently, `bc044a88ee` (D91816) addressed this problem by preventing inlining when the stack protector is disabled in the caller and enabled in the callee or vice versa. However, the problem remained if the callee is marked always_inline as in the example above. This affected users, see e.g. http://crbug.com/1274129 and http://llvm.org/pr52886. One way to fix this would be to prevent inlining also in the always_inline case. Despite the name, always_inline does not guarantee inlining, so this would be legal but potentially surprising to users. However, I think the better fix is to not enable the stack protector in a caller based on the callee. The motivation for the old behaviour is unclear, it seems counter-intuitive, and causes real problems as we've seen. This commit implements that fix, which means in the example above, g() gets inlined into f() (also without always_inline), and f() is emitted without stack protector. I think that matches most developers' expectations, and that's also what GCC does. Another effect of this change is that a no_stack_protector function can now be inlined into a stack protected function, e.g. (https://godbolt.org/z/hafP6W856): extern void h(int* p); inline int __attribute__((__no_stack_protector__)) __attribute__((always_inline)) g() { return 0; } int f() { int a[1]; h(a); return g(); } I think that's fine. Such code would be unusual since no_stack_protector is normally applied to a program entry point which sets up the stack canary. And even if such code exists, inlining doesn't change the semantics: there is still no stack cookie setup/check around entry/exit of the g() code region, but there may be in the surrounding context, as there was before inlining. This also matches GCC. See also the discussion at https://gcc.gnu.org/bugzilla/show_bug.cgi?id=94722 Differential revision: https://reviews.llvm.org/D116589	2022-01-13 12:04:49 +01:00
Simon Moll	33efbc8184	[VP] llvm.vp.merge intrinsic and LangRef llvm.vp.merge interprets the %evl operand differently than the other vp intrinsics: all lanes at positions greater or equal than the %evl operand are passed through from the second vector input. Otherwise it behaves like llvm.vp.select. Reviewed By: craig.topper Differential Revision: https://reviews.llvm.org/D116725	2022-01-12 14:06:56 +01:00
Phoebe Wang	1bb0caf561	[X86][MS] Change the alignment of f80 to 16 bytes on Windows 32bits to match with ICC MSVC currently doesn't support 80 bits long double. ICC supports it when the option `/Qlong-double` is specified. Changing the alignment of f80 to 16 bytes so that we can be compatible with ICC's option. Reviewed By: rnk, craig.topper Differential Revision: https://reviews.llvm.org/D115942	2022-01-12 17:50:37 +08:00
David Sherwood	51497dc0b2	[IR] Change vector.splice intrinsic to reject out-of-bounds indices I've changed the definition of the experimental.vector.splice instrinsic to reject indices that are known to be or possibly out-of-bounds. In practice, this means changing the definition so that the index is now only valid in the range [-VL, VL-1] where VL is the known minimum vector length. We use the vscale_range attribute to take the minimum vscale value into account so that we can permit more indices when the attribute is present. The splice intrinsic is currently only ever generated by the vectoriser, which will never attempt to splice vectors with out-of-bounds values. Changing the definition also makes things simpler for codegen since we can always assume that the index is valid. This patch was created in response to review comments on D115863 Differential Revision: https://reviews.llvm.org/D115933	2022-01-11 09:37:39 +00:00
Serge Guelton	d2cc6c2d0c	Use a sorted array instead of a map to store AttrBuilder string attributes Using and std::map<SmallString, SmallString> for target dependent attributes is inefficient: it makes its constructor slightly heavier, and involves extra allocation for each new string attribute. Storing the attribute key/value as strings implies extra allocation/copy step. Use a sorted vector instead. Given the low number of attributes generally involved, this is cheaper, as showcased by https://llvm-compile-time-tracker.com/compare.php?from=5de322295f4ade692dc4f1823ae4450ad3c48af2&to=05bc480bf641a9e3b466619af43a2d123ee3f71d&stat=instructions Differential Revision: https://reviews.llvm.org/D116599	2022-01-10 14:49:53 +01:00
Nikita Popov	2c0fb96254	[TypeFinder] Support opaque pointers We need to explicitly visit a number of types, as these are no longer reachable through the pointer type if opaque pointers are enabled. This is similar to ValueEnumerator changes that have been done previously.	2022-01-10 14:46:45 +01:00
Kazu Hirata	b932bdf59f	[llvm] Remove redundant member initialization (NFC) Identified with readability-redundant-member-init.	2022-01-07 17:45:09 -08:00
Nikita Popov	e4d1779990	[IR] Add ConstraintInfo::hasArg() helper (NFC) Checking whether a constraint corresponds to an argument is a recurring pattern.	2022-01-07 10:44:38 +01:00
Nikita Popov	bec726f5d2	[Verifier] Enforce elementtype attr for inline asm indirect constraints This enforces the LangRef change from D116531 in the Verifier, now that clang and tests have been updated.	2022-01-06 15:22:00 +01:00
Nikita Popov	c41aa41957	[ConstFold] Add missing check for inbounds gep If the gep is not inbounds, then the gep might compute a null value even if the base pointer is non-null.	2022-01-06 09:59:40 +01:00
Nikita Popov	32808cfb24	[IR] Track users of comdats Track all GlobalObjects that reference a given comdat, which allows determining whether a function in a comdat is dead without scanning the whole module. In particular, this makes filterDeadComdatFunctions() have complexity O(#DeadFunctions) rather than O(#SymbolsInModule), which addresses half of the compile-time issue exposed by D115545. Differential Revision: https://reviews.llvm.org/D115864	2022-01-06 09:13:58 +01:00
Luís Ferreira	34435fd105	[llvm] Add support for DW_TAG_immutable_type Added documentation about DW_TAG_immutable_type too. Reviewed By: probinson Differential Revision: https://reviews.llvm.org/D113633	2022-01-05 19:17:08 +00:00
Philip Reames	c16fd6a376	Rename doesNotReadMemory to onlyWritesMemory globally [NFC] The naming has come up as a source of confusion in several recent reviews. onlyWritesMemory is consist with onlyReadsMemory which we use for the corresponding readonly case as well.	2022-01-05 08:52:55 -08:00
Nikita Popov	6c031780aa	[ConstantFold] Remove another incorrect icmp of gep fold This folded (null + X) == g to false, but of course this is incorrect if X == g. Possibly this got confused with the null == g case, which is already handled elsewhere.	2022-01-04 16:08:09 +01:00
serge-sans-paille	9290ccc3c1	Introduce the AttributeMask class This class is solely used as a lightweight and clean way to build a set of attributes to be removed from an AttrBuilder. Previously AttrBuilder was used both for building and removing, which introduced odd situation like creation of Attribute with dummy value because the only relevant part was the attribute kind. Differential Revision: https://reviews.llvm.org/D116110	2022-01-04 15:37:46 +01:00
Nikita Popov	d74212987b	[ConstantFold] Remove unnecessary bounded index restriction The fold for merging a GEP of GEP into a single GEP currently bails if doing so would result in notional overindexing. The justification given in the comment above this check is dangerously incorrect: GEPs with notional overindexing are perfectly fine, and if some code treats them incorrectly, then that code is broken, not the GEP. Such a GEP might legally appear in source IR, so only preventing its creation cannot be sufficient. (The constant folder also ends up canonicalizing the GEP to remove the notional overindexing, but that's neither here nor there.) This check dates back to `bd4fef4a89`, and as far as I can tell the original issue this was trying to patch around has since been resolved. Differential Revision: https://reviews.llvm.org/D116587	2022-01-04 15:23:09 +01:00
Nikita Popov	1379eb5776	[ConstFold] Slightly clean up icmp of two geps fold (NFC) As we're only dealing with one type of constant expression here, try to directly cast to GEPOperator.	2022-01-04 12:33:38 +01:00
Nikita Popov	75db002725	[ConstantFold] Remove another incorrect icmp of GEP fold This fold is not correct, because indices might evaluate to zero even if they are not a literal zero integer. Additionally, this fold would be wrong (in the general case) for non-i8 types as well, due to index overflow. Drop this fold and instead let the target-dependent constant folder compute the actual offset and fold the comparison based on that.	2022-01-04 12:27:40 +01:00
Nikita Popov	8484bab9cd	[LangRef] Require elementtype attribute for indirect inline asm operands Indirect inline asm operands may require the materialization of a memory access according to the pointer element type. As this will no longer be available with opaque pointers, we require it to be explicitly annotated using the elementtype attribute, for example: define void @test(i32* %p, i32 %x) { call void asm "addl $1, $0", "=rm,r"(i32 elementtype(i32) %p, i32 %x) ret void } This patch only includes the LangRef change and Verifier updates to allow adding the elementtype attribute in this position. It does not yet enforce this, as this will require changes on the clang side (and test updates) first. Something I'm a bit unsure about is whether we really need the elementtype for all indirect constraints, rather than only indirect register constraints. I think indirect memory constraints might not strictly need it (though the backend code is written in a way that does require it). I think it's okay to just make this a general requirement though, as this means we don't need to carefully deal with multiple or alternative constraints. In addition, I believe that MemorySanitizer benefits from having the element type even in cases where it may not be strictly necessary for normal lowering (`cd2b050fa4/llvm/lib/Transforms/Instrumentation/MemorySanitizer.cpp (L4066)`). Differential Revision: https://reviews.llvm.org/D116531	2022-01-04 10:02:06 +01:00
Kazu Hirata	e5947760c2	Revert "[llvm] Remove redundant member initialization (NFC)" This reverts commit `fd4808887e`. This patch causes gcc to issue a lot of warnings like: warning: base class ‘class llvm::MCParsedAsmOperand’ should be explicitly initialized in the copy constructor [-Wextra]	2022-01-03 11:28:47 -08:00
Fraser Cormack	d762794040	[IR] Allow the 'align' param attr on vectors of pointers This patch extends the available uses of the 'align' parameter attribute to include vectors of pointers. The attribute specifies pointer alignment element-wise. This change was previously requested and discussed in D87304. The vector predication (VP) intrinsics intend to use this for scatter and gather operations, as they lack the explicit alignment parameter that the masked versions use. Reviewed By: nikic Differential Revision: https://reviews.llvm.org/D115161	2022-01-03 12:32:46 +00:00
Nikita Popov	127d955441	[ConstantFold] Drop unused function (NFC) isMaybeZeroSizeType() is no longer used after `5afbfe33e7`.	2022-01-03 10:14:52 +01:00
Nikita Popov	5afbfe33e7	[ConstantFold] Make icmp of gep fold offset based We can fold an equality or unsigned icmp between base+offset1 and base+offset2 with inbounds offsets by comparing the offsets directly. This replaces a pair of specialized folds that tried to reason based on the GEP structure instead. One of those folds was plain wrong (because it does not account for negative offsets), while the other is unnecessarily complicated and limited (e.g. it will fail with bitcasts involved). The disadvantage of this change is that it requires data layout, so the fold is no longer performed by datalayout-independent constant folding. I don't think this is a loss in practice, but it does regress the ConstantExprFold.ll test, which checks folding without running any passes. Differential Revision: https://reviews.llvm.org/D116332	2022-01-03 09:41:37 +01:00
Kazu Hirata	fd4808887e	[llvm] Remove redundant member initialization (NFC) Identified with readability-redundant-member-init.	2022-01-01 16:18:18 -08:00
Serge Pavlov	ecfd9196d5	[ConstantFolding] Use ICmpInst::Predicate instead of plain integer The function `ConstantFoldCompareInstruction` uses `unsigned short` to represent compare predicate, although all usesrs of the respective include file use definition of CmpInst also. This change replaces predicate argument type in this function to `ICmpInst::Predicate`, which allows to make code a bit clearer and simpler. No functional changes. Differential Revision: https://reviews.llvm.org/D116379	2021-12-30 14:31:44 +07:00
Kazu Hirata	5a667c0e74	[llvm] Use nullptr instead of 0 (NFC) Identified with modernize-use-nullptr.	2021-12-28 08:52:25 -08:00
Nikita Popov	23de66d163	[ConstFold] Don't fold signed comparison of gep of global An inbounds GEP may still cross the sign boundary, so signed icmps cannot be folded (https://alive2.llvm.org/ce/z/XSgi4D). This was previously fixed for other folds in this function, but this one was missed.	2021-12-28 14:13:33 +01:00
Shao-Ce SUN	ec501f15a8	[clang][CodeGen] Remove the signed version of createExpression Fix a TODO. Remove the callers of this signed version and delete. Reviewed By: CodaFi Differential Revision: https://reviews.llvm.org/D116014	2021-12-27 14:16:08 +08:00
Serge Pavlov	d86e2cc2e3	[NFC] Method for evaluation of FCmpInst for constant operands New method `FCmpInst::compare` is added, which evaluates the given compare predicate for constant operands. Interface is made similar to `ICmpInst::compare`. Differential Revision: https://reviews.llvm.org/D116168	2021-12-25 17:37:38 +07:00
Kazu Hirata	2d303e6781	Remove redundant return and continue statements (NFC) Identified with readability-redundant-control-flow.	2021-12-24 23:17:54 -08:00
Kazu Hirata	9c0a4227a9	Use Optional::getValueOr (NFC)	2021-12-24 20:57:40 -08:00
Florian Hahn	5d68dc184e	[Verifier] Iteratively traverse all indirect users. The recursive implementation can run into stack overflows, e.g. like in PR52844. The order the users are visited changes, but for the current use case this only impacts the order error messages are emitted.	2021-12-23 23:20:12 +01:00
Kazu Hirata	500c4b68dc	[llvm] Construct SmallVector with iterator ranges (NFC)	2021-12-20 23:43:24 -08:00
Sami Tolvanen	5dc8aaac39	[llvm][IR] Add no_cfi constant With Control-Flow Integrity (CFI), the LowerTypeTests pass replaces function references with CFI jump table references, which is a problem for low-level code that needs the address of the actual function body. For example, in the Linux kernel, the code that sets up interrupt handlers needs to take the address of the interrupt handler function instead of the CFI jump table, as the jump table may not even be mapped into memory when an interrupt is triggered. This change adds the no_cfi constant type, which wraps function references in a value that LowerTypeTestsModule::replaceCfiUses does not replace. Link: https://github.com/ClangBuiltLinux/linux/issues/1353 Reviewed By: nickdesaulniers, pcc Differential Revision: https://reviews.llvm.org/D108478	2021-12-20 12:55:32 -08:00
Serge Guelton	9cd55c7c34	Prevent copy of AttrBuilder It's a relatively heavy data structure, make sure it's not copied. Differential Revision: https://reviews.llvm.org/D116034	2021-12-20 10:33:32 -05:00
Nikita Popov	6e30cb7673	[Attributes] Add AttributeList ctor from AttributeSet (NFC) It was already possible to create an AttributeList from an Index and an AttributeSet. However, this would actually end up using the implicit constructor on AttrBuilder, thus doing an unnecessary conversion from AttributeSet to AttrBuilder to AttributeSet. Instead we can accept the AttributeSet directly, as that is what we need anyway.	2021-12-20 11:37:01 +01:00
Nikita Popov	65777addbd	[llvm-c] Accept GEP operators in some APIs As requested in D115787, I've added a test for LLVMConstGEP2 and LLVMConstInBoundsGEP2. However, to make this work in the echo test, I also had to change a couple of APIs to work on GEP operators, rather than only GEP instructions. Differential Revision: https://reviews.llvm.org/D115858	2021-12-17 08:54:18 +01:00
Nikita Popov	68cb111f3a	[llvm-c] Make LLVMConstGEP/LLVMConstInBoundsGEP opaque pointer compatible Weirdly, the opaque pointer compatible variants LLVMConstGEP2 and LLVMConstInBoundsGEP2 were already declared in the header, but not actually implemented. This adds the missing implementations and deprecates the incompatible functions. Differential Revision: https://reviews.llvm.org/D115787	2021-12-16 09:38:52 +01:00
Yuanfang Chen	ebf65d4842	[Verifier] Make error message precise about which variable is being diagnosed. NFCI.	2021-12-15 16:05:31 -08:00
Arthur Eubanks	5a81a60391	[NFC] Remove more calls to getAlignment() These are deprecated and should be replaced with getAlign(). Some of these asserts don't do anything because Load/Store/AllocaInst never have a 0 align value.	2021-12-15 14:40:57 -08:00
Mingming Liu	09a704c5ef	[LTO] Ignore unreachable virtual functions in WPD in hybrid LTO. Differential Revision: https://reviews.llvm.org/D115492	2021-12-14 20:18:04 +00:00
Philip Reames	423f19680a	Add FMF to hasPoisonGeneratingFlags/dropPoisonGeneratingFlags These flags are documented as generating poison values for particular input values. As such, we should really be consistent about their handling with how we handle nsw/nuw/exact/inbounds. Differential Revision: https://reviews.llvm.org/D115460	2021-12-14 08:43:00 -08:00
Nikita Popov	6213f1dd03	[IR] Make VPIntrinsic::getDeclarationForParams() opaque pointer compatible The vp.load and vp.gather intrinsics require the intrinsic return type to determine the correct function signature. With opaque pointers, it cannot be derived from the parameter pointee types. Differential Revision: https://reviews.llvm.org/D115632	2021-12-14 14:20:59 +01:00
Augie Fackler	b575405cc3	Verifier: accept enums as scopes Rust allows enums to be scopes, as shown by the previous change. Sadly, D111770 disallowed enums-as-scopes in the LLVM Verifier, which means that LLVM HEAD stopped working for Rust compiles. As a result, we back out the verifier part of D111770 with a modification to the testcase so we don't break this in the future. The testcase is now actual IR from rustc at commit 8f8092cc3, which is the nightly as of 2021-09-28. I would expect rustc 1.57 to produce similar or identical IR if someone wants to reproduce this IR in the future with minimal changes. A recipe for reproducing the IR using rustc is included in the test file. Reviewed By: dblaikie Differential Revision: https://reviews.llvm.org/D115353	2021-12-10 12:19:56 -08:00
Nikita Popov	1d1e29ba6c	[IR] Extract method to get single GEP index from offset (NFC) This exposes the core logic of getGEPIndicesForOffset() as a getGEPIndexForOffset() method that only returns a single offset, instead of following the whole chain.	2021-12-10 17:22:46 +01:00
Sameer Sahasrabuddhe	1d0244aed7	Reapply CycleInfo: Introduce cycles as a generalization of loops Reverts `02940d6d22`. Fixes breakage in the modules build. LLVM loops cannot represent irreducible structures in the CFG. This change introduce the concept of cycles as a generalization of loops, along with a CycleInfo analysis that discovers a nested hierarchy of such cycles. This is based on Havlak (1997), Nesting of Reducible and Irreducible Loops. The cycle analysis is implemented as a generic template and then instatiated for LLVM IR and Machine IR. The template relies on a new GenericSSAContext template which must be specialized when used for each IR. This review is a restart of an older review request: https://reviews.llvm.org/D83094 Original implementation by Nicolai Hähnle <nicolai.haehnle@amd.com>, with recent refactoring by Sameer Sahasrabuddhe <sameer.sahasrabuddhe@amd.com> Differential Revision: https://reviews.llvm.org/D112696	2021-12-10 14:36:43 +05:30
Arthur Eubanks	f5687e0fd0	[NFC] Use getAlign() instead of getAlignment() in haveSameSpecialState() getAlignment() is deprecated.	2021-12-09 13:19:42 -08:00
Kazu Hirata	ccdd5bb2c2	[llvm] Use range-based for loops (NFC)	2021-12-09 09:37:29 -08:00
Arthur Eubanks	1172712f46	[NFC] Replace some deprecated getAlignment() calls with getAlign() Reviewed By: gchatelet Differential Revision: https://reviews.llvm.org/D115370	2021-12-09 08:43:19 -08:00
Arthur Eubanks	cd11312607	[NFC][Verifier] Remove checks for atomic loads/stores that alignment is non-zero The alignment is never 0 since getAlign() returns 1 << bits. Reviewed By: dexonsmith Differential Revision: https://reviews.llvm.org/D115388	2021-12-08 23:17:08 -08:00
Kazu Hirata	c23ebf1714	[llvm] Use range-based for loops (NFC)	2021-12-08 20:35:39 -08:00
Stephen Neuendorffer	0fcb16eeb2	Allow DataLayout to support arbitrary pointer sizes Currently, it is impossible to specify a DataLayout with pointer size and index size that is not a whole number of bytes. This patch modifies the DataLayout class to accept arbitrary pointer sizes and to store the size as a number of bits, rather than as a number of bytes. Generally speaking, the external interface of the class as used by in-tree architectures remains the same and shouldn't affect the behavior of architecures with pointer sizes equal to a whole number of bytes. Note the interface of setPointerAlignment has changed and takes a pointer and index size that is a number of bits, rather than a number of bytes. Patch originally by Ajit Kumar Agarwal Differential Revision: https://reviews.llvm.org/D114141	2021-12-07 23:20:17 -08:00
Alex Lorenz	0756aa3978	[macho] add support for emitting macho files with two build version load commands This patch extends LLVM IR to add metadata that can be used to emit macho files with two build version load commands. It utilizes "darwin.target_variant.triple" and "darwin.target_variant.SDK Version" metadata names for that, which will be set by a future patch in clang. MachO uses two build version load commands to represent an object file / binary that is targeting both the macOS target, and the Mac Catalyst target. At runtime, a dynamic library that supports both targets can be loaded from either a native macOS or a Mac Catalyst app on a macOS system. We want to add support to this to upstream to LLVM to be able to build compiler-rt for both targets, to finish the complete support for the Mac Catalyst platform, which is right now targetable by upstream clang, but the compiler-rt bits aren't supported because of the lack of this multiple build version support. Differential Revision: https://reviews.llvm.org/D112189	2021-12-07 18:17:47 -08:00
Jonas Devlieghere	02940d6d22	Revert "CycleInfo: Introduce cycles as a generalization of loops" This reverts commit `0fe61ecc2c` because it breaks the modules build. https://green.lab.llvm.org/green/job/clang-stage2-rthinlto/4858/ https://green.lab.llvm.org/green/view/LLDB/job/lldb-cmake/39112/	2021-12-07 13:06:34 -08:00
Cullen Rhodes	0395e01583	[IR] Split vscale_range interface Interface is split from: std::pair<unsigned, unsigned> getVScaleRangeArgs() into separate functions for min/max: unsigned getVScaleRangeMin(); Optional<unsigned> getVScaleRangeMax(); Reviewed By: sdesmalen, paulwalker-arm Differential Revision: https://reviews.llvm.org/D114075	2021-12-07 10:38:26 +00:00

... 2 3 4 5 6 ...

5441 Commits