llvm-project

Commit Graph

Author	SHA1	Message	Date
Juan Manuel MARTINEZ CAAMAÑO	5cf6fbc5a0	[NFC][ASAN][AMDGPU] Use CreateNot/CreateIsNotNull instead of more verbose CreateICMP methods Reviewed By: vitalybuka Differential Revision: https://reviews.llvm.org/D138706	2022-11-29 06:36:27 -06:00
bipmis	e9393789a9	[AggressiveInstCombine] Handle the insert point of the merged load correctly. This patch updates the load insert point of the merged load in AggressiveInstCombine(). This is done to handle the reported test breaks by handling Alias Analysis correctly. Differential Revision: https://reviews.llvm.org/D137201	2022-11-29 10:53:51 +00:00
Stanislav Mekhanoshin	c46634554d	[LoadStoreVectorizer] Consider if operation is faster than before Compare a relative speed of misaligned accesses before and after vectorization, not just check the new instruction is not going to be slower. Since no target now returns anything but 0 or 1 for Fast argument of the allowsMisalignedMemoryAccesses this is still NFCI. The subsequent patch will tune actual vaues of Fast on AMDGPU. Differential Revision: https://reviews.llvm.org/D124218	2022-11-28 15:52:32 -08:00
Florian Hahn	bf15f1e489	Revert "[VPlan] Add VPDerivedIVRecipe, use for VPScalarIVStepsRecipe." This reverts commit `0fa666eced`. This triggers an assertion during AArch64 stage2 builds. Revert while I investigate. See https://lab.llvm.org/buildbot/#/builders/179/builds/4967/steps/11/logs/stdio	2022-11-28 22:43:11 +00:00
Sanjay Patel	a00936484b	[InstCombine] improve readability of combineLoadToOperationType(); NFC	2022-11-28 16:00:06 -05:00
Matt Arsenault	a2f9ca8875	Utils: Use StringRef and rename variable for clarity	2022-11-28 15:25:45 -05:00
Arthur Eubanks	7ae6838def	[LegacyPM] Remove pipeline extension mechanism Part of gradually removing the legacy PM optimization pipeline. Reviewed By: MaskRay Differential Revision: https://reviews.llvm.org/D136622	2022-11-28 12:23:15 -08:00
Florian Hahn	6a834d2f2b	[ConstraintElim] Use collectOffset instead of custom GEP idx iteration. Use collectOffset to collect scaled indices and constant offset for GEP instead of custom code. This simplifies the logic in decomposeGEP and allows to handle all cases supported by the generic helper.	2022-11-28 16:54:58 +00:00
Florian Hahn	0fa666eced	[VPlan] Add VPDerivedIVRecipe, use for VPScalarIVStepsRecipe. This patch splits off the logic to transform the canonical IV to a a value for an induction with a different start and step. This transformation only needs to be done once (independent of VF/UF) and enables sinking of VPScalarIVStepsRecipe as follow-up. Reviewed By: Ayal Differential Revision: https://reviews.llvm.org/D133758	2022-11-28 16:32:31 +00:00
OCHyams	9517806064	Revert "[Assignment Tracking][13/*] Account for assignment tracking in SROA" This reverts commit `e16d59973f`. Buildbot failure: https://lab.llvm.org/buildbot/#/builders/236/builds/1205	2022-11-28 16:07:34 +00:00
OCHyams	5e0b29bf23	Revert "[Assignment Tracking][SROA] Follow-up for failing test" This reverts commit `285d46ef4b`. Failing buildbot: https://lab.llvm.org/buildbot/#/builders/236/builds/1205	2022-11-28 16:07:34 +00:00
Qiongsi Wu	f946c70130	[SLPVectorizer] Do Not Move Loads/Stores Beyond Stacksave/Stackrestore Boundaries If left unchecked, the SLPVecrtorizer can move loads/stores below a stackrestore. The move can cause issues if the loads/stores have pointer operands from `alloca`s that are reset by the stackrestores. This patch adds the dependency check. The check is conservative, in that it does not check if the pointer operands of the loads/stores are actually from `alloca`s that may be reset. We did not observe any SPECCPU2017 performance degradation so this simple fix seems sufficient. The test could have been added to `llvm/test/Transforms/SLPVectorizer/X86/stacksave-dependence.ll`, but that test has not been updated to use opaque pointers. I am not inclined to add tests that still use typed pointers, or to refactor `llvm/test/Transforms/SLPVectorizer/X86/stacksave-dependence.ll` to use opaque pointers in this patch. If desired, I will open a different patch to refactor and consolidate the tests. Reviewed By: ABataev Differential Revision: https://reviews.llvm.org/D138585	2022-11-28 10:00:29 -05:00
OCHyams	285d46ef4b	[Assignment Tracking][SROA] Follow-up for failing test Follow-up for D133296 / `e16d59973f` Buildbot example: https://lab.llvm.org/buildbot/#/builders/6/builds/16989	2022-11-28 12:16:37 +00:00
OCHyams	e16d59973f	[Assignment Tracking][13/*] Account for assignment tracking in SROA The Assignment Tracking debug-info feature is outlined in this RFC: https://discourse.llvm.org/t/ rfc-assignment-tracking-a-better-way-of-specifying-variable-locations-in-ir Split dbg.assign intrinsics into fragments similarly to what SROA already does for dbg.declares, except that there's many more intrinsics to split. The function migrateDebugInfo generates new dbg.assigns intrinsic for each part of a split store. Reviewed By: jmorse Differential Revision: https://reviews.llvm.org/D133296	2022-11-28 11:31:59 +00:00
Matt Arsenault	1ab6fa5152	HWAsan: Fix creating unnecessary bitcast with opaque pointers This was creating a ptr to ptr bitcast. Convert a relevant test to opaque pointers.	2022-11-27 21:24:24 -05:00
Matt Arsenault	d1c0092163	SeparateConstOffsetFromGEP: Fix creating pointless bitcasts This was directly creating new BitCastInsts, so under opaque pointers, would end up producing bitcast from ptr to ptr.	2022-11-27 20:53:48 -05:00
Matt Arsenault	16663d8ab3	IndirectCallPromotion: Fix illegal promotion with opaque pointers This was doing a type check on the argument types and skipping all other safety checks if they matched.	2022-11-27 09:39:42 -05:00
Kazu Hirata	41c638875e	[Vectorize] Use std::optional in VPlanSLP.cpp (NFC) This is part of an effort to migrate from llvm::Optional to std::optional: https://discourse.llvm.org/t/deprecating-llvm-optional-x-hasvalue-getvalue-getvalueor/63716	2022-11-26 18:11:32 -08:00
Kazu Hirata	5fc8f6c37c	[Vectorize] Use std::optional in SLPVectorizer.cpp (NFC) This is part of an effort to migrate from llvm::Optional to std::optional: https://discourse.llvm.org/t/deprecating-llvm-optional-x-hasvalue-getvalue-getvalueor/63716	2022-11-26 18:03:49 -08:00
Kazu Hirata	2d1b093a37	[Utils] Use std::optional in SimplifyCFG.cpp (NFC) This is part of an effort to migrate from llvm::Optional to std::optional: https://discourse.llvm.org/t/deprecating-llvm-optional-x-hasvalue-getvalue-getvalueor/63716	2022-11-26 17:55:33 -08:00
Kazu Hirata	88988c50f8	[Utils] Use std::optional in LoopPeel.cpp (NFC) This is part of an effort to migrate from llvm::Optional to std::optional: https://discourse.llvm.org/t/deprecating-llvm-optional-x-hasvalue-getvalue-getvalueor/63716	2022-11-26 17:53:17 -08:00
Kazu Hirata	10bab60ddb	[Utils] Use std::optional in Local.cpp (NFC) This is part of an effort to migrate from llvm::Optional to std::optional: https://discourse.llvm.org/t/deprecating-llvm-optional-x-hasvalue-getvalue-getvalueor/63716	2022-11-26 17:49:07 -08:00
Kazu Hirata	12c80ef3f5	[Utils] Use std::optional in CloneFunction.cpp (NFC) This is part of an effort to migrate from llvm::Optional to std::optional: https://discourse.llvm.org/t/deprecating-llvm-optional-x-hasvalue-getvalue-getvalueor/63716	2022-11-26 17:44:44 -08:00
Kazu Hirata	d90a14f87d	[Scalar] Use std::optional in SimpleLoopUnswitch.cpp (NFC) This is part of an effort to migrate from llvm::Optional to std::optional: https://discourse.llvm.org/t/deprecating-llvm-optional-x-hasvalue-getvalue-getvalueor/63716	2022-11-26 17:41:52 -08:00
Kazu Hirata	6288a09c94	[Scalar] Use std::optional in ScalarizeMaskedMemIntrin.cpp (NFC) This is part of an effort to migrate from llvm::Optional to std::optional: https://discourse.llvm.org/t/deprecating-llvm-optional-x-hasvalue-getvalue-getvalueor/63716	2022-11-26 17:39:36 -08:00
Kazu Hirata	85cee3a4ab	[Scalar] Use std::optional in PartiallyInlineLibCalls.cpp (NFC) This is part of an effort to migrate from llvm::Optional to std::optional: https://discourse.llvm.org/t/deprecating-llvm-optional-x-hasvalue-getvalue-getvalueor/63716	2022-11-26 17:36:26 -08:00
Kazu Hirata	60d644554b	[Scalar] Use std::optional in LowerConstantIntrinsics.cpp (NFC) This is part of an effort to migrate from llvm::Optional to std::optional: https://discourse.llvm.org/t/deprecating-llvm-optional-x-hasvalue-getvalue-getvalueor/63716	2022-11-26 17:33:11 -08:00
Kazu Hirata	3bb0c7075b	[Scalar] Use std::optional in LoopUnrollPass.cpp (NFC) This is part of an effort to migrate from llvm::Optional to std::optional: https://discourse.llvm.org/t/deprecating-llvm-optional-x-hasvalue-getvalue-getvalueor/63716	2022-11-26 17:29:32 -08:00
Kazu Hirata	f207cf1d93	[Scalar] Use std::optional in LoopStrengthReduce.cpp (NFC) This is part of an effort to migrate from llvm::Optional to std::optional: https://discourse.llvm.org/t/deprecating-llvm-optional-x-hasvalue-getvalue-getvalueor/63716	2022-11-26 17:26:57 -08:00
Kazu Hirata	42c40f7e35	[Scalar] Use std::optional in LoopSimplifyCFG.cpp (NFC) This is part of an effort to migrate from llvm::Optional to std::optional: https://discourse.llvm.org/t/deprecating-llvm-optional-x-hasvalue-getvalue-getvalueor/63716	2022-11-26 17:24:20 -08:00
Kazu Hirata	27ae2354a0	[Scalar] Use std::optional in LoopRotation.cpp (NFC) This is part of an effort to migrate from llvm::Optional to std::optional: https://discourse.llvm.org/t/deprecating-llvm-optional-x-hasvalue-getvalue-getvalueor/63716	2022-11-26 17:21:19 -08:00
Kazu Hirata	3e3be2955b	[Scalar] Use std::optional in LoopPredication.cpp (NFC) This is part of an effort to migrate from llvm::Optional to std::optional: https://discourse.llvm.org/t/deprecating-llvm-optional-x-hasvalue-getvalue-getvalueor/63716	2022-11-26 17:18:54 -08:00
Kazu Hirata	d3a58c48f4	[Scalar] Use std::optional in LoopInstSimplify.cpp (NFC) This is part of an effort to migrate from llvm::Optional to std::optional: https://discourse.llvm.org/t/deprecating-llvm-optional-x-hasvalue-getvalue-getvalueor/63716	2022-11-26 17:14:23 -08:00
Florian Hahn	bf0bd85f9d	[LV] Move trunc codegen to buildScalarSteps (NFCI). This moves the code to truncate step and IV into buildScalarSteps, closer to the place where they are actually used. Suggested in D133758.	2022-11-26 23:48:46 +00:00
Florian Hahn	12bb5535d2	[VPlan] Move cast codegen to emitTransformedIndex (NFCI). This reduces duplication a bit. Suggested as simplification in D133758.	2022-11-26 22:47:13 +00:00
Kazu Hirata	589725f6e8	[llvm] Use std::size (NFC) std::size, introduced in C++17, allows us to directly obtain the number of elements of an array.	2022-11-26 13:47:32 -08:00
Roman Lebedev	25f01d593c	Revert "[SROA] `isVectorPromotionViable()`: memory intrinsics operate on vectors of bytes (take 2)" TableGen is still getting miscompiled on PPC buildbots. Sent a mail with request for help. This reverts commit `3c4d2a0396`.	2022-11-27 00:00:06 +03:00
Roman Lebedev	3c4d2a0396	[SROA] `isVectorPromotionViable()`: memory intrinsics operate on vectors of bytes (take 2) This is a recommit of `cf624b23bc`, which was reverted in `5cfc22cafe`, because the cut-off on the number of vector elements was not low enough, and it triggered both SDAG SDNode operand number assertions, and caused compile time explosions in some cases. Let's try with something really REALLY conservative first, just to get somewhere, and try to bump it (to 64/128) later. FIXME: should this respect TTI reg width * num vec regs? Original commit message: Now, there's a big caveat here - these bytes are abstract bytes, not the i8 we have in LLVM, so strictly speaking this is not exactly legal, see e.g. https://github.com/AliveToolkit/alive2/issues/860 ^ the "bytes" "could" have been a pointer, and loading it as an integer inserts an implicit ptrtoint. But at the same time, InstCombine's `InstCombinerImpl::SimplifyAnyMemTransfer()` would expand a memtransfer of 1/2/4/8 bytes into integer-typed load+store, so this isn't exactly a new problem. Note that in memory, poison is byte-wise, so we really can't widen elements, but SROA seems to be inconsistent here. Fixes #59116.	2022-11-26 23:19:15 +03:00
Kazu Hirata	bba55813fc	[Scalar] Use std::optional in LoopFlatten.cpp (NFC) This is part of an effort to migrate from llvm::Optional to std::optional: https://discourse.llvm.org/t/deprecating-llvm-optional-x-hasvalue-getvalue-getvalueor/63716	2022-11-26 00:02:40 -08:00
Kazu Hirata	7fc772bbdc	[Scalar] Use std::optional in InductiveRangeCheckElimination.cpp (NFC) This is part of an effort to migrate from llvm::Optional to std::optional: https://discourse.llvm.org/t/deprecating-llvm-optional-x-hasvalue-getvalue-getvalueor/63716	2022-11-25 23:59:45 -08:00
Kazu Hirata	b54cd3f09c	[Scalar] Use std::optional in GVN.cpp (NFC) This is part of an effort to migrate from llvm::Optional to std::optional: https://discourse.llvm.org/t/deprecating-llvm-optional-x-hasvalue-getvalue-getvalueor/63716	2022-11-25 23:55:44 -08:00
Kazu Hirata	edace862f6	[Scalar] Use std::optional in CorrelatedValuePropagation.cpp (NFC) This is part of an effort to migrate from llvm::Optional to std::optional: https://discourse.llvm.org/t/deprecating-llvm-optional-x-hasvalue-getvalue-getvalueor/63716	2022-11-25 23:52:49 -08:00
Kazu Hirata	3da96e0361	[InstCombine] Use std::optional in InstructionCombining.cpp (NFC) This is part of an effort to migrate from llvm::Optional to std::optional: https://discourse.llvm.org/t/deprecating-llvm-optional-x-hasvalue-getvalue-getvalueor/63716	2022-11-25 23:49:50 -08:00
Kazu Hirata	881076cde2	[InstCombine] Use std::optional in InstCombinePHI.cpp (NFC) This is part of an effort to migrate from llvm::Optional to std::optional: https://discourse.llvm.org/t/deprecating-llvm-optional-x-hasvalue-getvalue-getvalueor/63716	2022-11-25 23:46:38 -08:00
Kazu Hirata	c7fa15c0f2	[IPO] Use std::optional in IROutliner.cpp (NFC) This is part of an effort to migrate from llvm::Optional to std::optional: https://discourse.llvm.org/t/deprecating-llvm-optional-x-hasvalue-getvalue-getvalueor/63716	2022-11-25 23:44:08 -08:00
Kazu Hirata	a5f8a36d02	[IPO] Use std::optional in GlobalOpt.cpp (NFC) This is part of an effort to migrate from llvm::Optional to std::optional: https://discourse.llvm.org/t/deprecating-llvm-optional-x-hasvalue-getvalue-getvalueor/63716	2022-11-25 23:38:32 -08:00
Kazu Hirata	ffba7ee778	[IPO] Use std::optional in Attributor.cpp (NFC) This is part of an effort to migrate from llvm::Optional to std::optional: https://discourse.llvm.org/t/deprecating-llvm-optional-x-hasvalue-getvalue-getvalueor/63716	2022-11-25 23:35:05 -08:00
Kazu Hirata	629819cb66	[IPO] Use std::optional in AttributorAttributes.cpp (NFC) This is part of an effort to migrate from llvm::Optional to std::optional: https://discourse.llvm.org/t/deprecating-llvm-optional-x-hasvalue-getvalue-getvalueor/63716	2022-11-25 23:23:40 -08:00
Kazu Hirata	8dd2416e44	[Coroutines] Use std::optional in CoroFrame.cpp (NFC) This is part of an effort to migrate from llvm::Optional to std::optional: https://discourse.llvm.org/t/deprecating-llvm-optional-x-hasvalue-getvalue-getvalueor/63716	2022-11-25 23:18:17 -08:00
Kazu Hirata	d7fdb5d87b	[Coroutines] Use std::optional in CoroElide.cpp (NFC) This is part of an effort to migrate from llvm::Optional to std::optional: https://discourse.llvm.org/t/deprecating-llvm-optional-x-hasvalue-getvalue-getvalueor/63716	2022-11-25 23:15:51 -08:00
Florian Hahn	ed2fdace89	[LV] Use separate index to access StoredValues in vectorizeInterleave. StoredValues only has entries for members of the interleave group. If there are gaps, then using the index i here will either access a wrong entry or be out-of-bounds. Instead use a dedicated index that only gets incremented for members of the interleave group. Fixes #59090.	2022-11-25 15:28:05 +00:00
Jamie Schmeiser	be1ff1fe58	[NFC] Refactor loop peeling code for calculating phi invariance. Summary: Refactor loop peeling code by moving code for calculating phi invariance into a separate class that does the calculation. Redescribe and rework the algorithm in preparation for adding increased functionality. Add test case that does not exhibit peeling that will be subsequently supported. Author: Jamie Schmeiser <schmeise@ca.ibm.com> Reviewed By: mkazantsev (Max Kazantsev) Differential Revision: https://reviews.llvm.org/D138232	2022-11-25 09:07:14 -05:00
Matthias Gehre	5a1d92fa3e	[InstCombine] Update debug intrinsics when rewriting allocas	2022-11-25 08:20:54 +01:00
Fangrui Song	fa71c16455	[Inliner] Move cl::opt inside llvm::	2022-11-24 20:31:13 -08:00
Sanjay Patel	535c5d56a7	[InstCombine] ease restriction for extractelt (bitcast X) fold We were checking for a desirable integer type even when there is no shift in the transform. This is unnecessary since we are truncating directly to the destination type. This removes an extractelt in more cases and seems to make the canonicalization more uniform overall. There's still a potential difference between patterns that need a shift vs. trunc-only. I'm not sure if that is worth keeping at this point, but it can be adjusted in another step (assuming this change does not cause trouble). In the most basic case where I noticed this, we missed a fold that would have completely removed vector ops from a pattern like: https://alive2.llvm.org/ce/z/y4Qdte	2022-11-24 13:27:19 -05:00
Sanjay Patel	bf7f87e62c	[InstCombine] reduce code duplication in foldBitcastExtElt(); NFC	2022-11-24 10:16:37 -05:00
Guillaume Chatelet	e647b4f519	[reland][Alignment][NFC] Use the Align type in MCSection Differential Revision: https://reviews.llvm.org/D138653	2022-11-24 13:19:18 +00:00
Guillaume Chatelet	3467f9c7d6	Revert D138653 [Alignment][NFC] Use the Align type in MCSection" This breaks the bolt project. This reverts commit `409f0dc4a4`.	2022-11-24 12:42:30 +00:00
Guillaume Chatelet	409f0dc4a4	[Alignment][NFC] Use the Align type in MCSection Differential Revision: https://reviews.llvm.org/D138653	2022-11-24 12:32:58 +00:00
Anton Sidorenko	9ee8d2e081	[Debugify] Strip llvm.mir.debugify metadata We don't strip llvm.mir.debugify metadata in `llvm::stripDebugifyMetadata`. This may lead to incorrect number of lines and variables in the metadata when we run debugify twice, e.g. -run-pass=mir-debugify,...,mir-strip-debug,...,mir-debugify. Differential Revision: https://reviews.llvm.org/D138417	2022-11-24 12:20:21 +03:00
Fangrui Song	fa36d72305	[LoopVectorize] Internalize some cl::opt	2022-11-23 23:03:02 -08:00
Vasileios Porpodas	af4e856fa7	[NFC] Replaced BB->getInstList().{erase(),pop_front(),pop_back()} with eraseFromParent(). Differential Revision: https://reviews.llvm.org/D138617	2022-11-23 22:47:46 -08:00
Vasileios Porpodas	8b9a62ee49	[NFC] Use BB->size() instead of BB->getInstList().size(). Differential Revision: https://reviews.llvm.org/D138616	2022-11-23 17:25:53 -08:00
Matt Arsenault	f0693277c7	CloneModule: Handling cloning ifuncs This is tested in a future llvm-reduce patch.	2022-11-23 12:22:06 -05:00
Matt Arsenault	cb0d2887ab	Utils: Fix deleting calls to null in non-0 address spaces	2022-11-23 08:49:44 -05:00
Matt Devereau	ee4d6c8bf0	[VectorCombine] Enable scalarizeBinopOrCmp for scalable vectors This reverts a change to exclude scalarizeBinopOrCmp in VectorCombine for scalable vectors which caused poor scalable Binop codegen. Differential Revision: https://reviews.llvm.org/D138545	2022-11-23 13:17:21 +00:00
Benjamin Kramer	5cfc22cafe	Revert "[SROA] `isVectorPromotionViable()`: memory intrinsics operate on vectors of bytes" This reverts commit `cf624b23bc`. It triggers crashes in clang, see the comments on github on the original change.	2022-11-23 13:11:16 +01:00
Stefan Gränitz	c20a80092c	[ObjC][ARC] Fix UB in ObjCARCOpt with -enable-objc-arc-opts=false When ObjCARCOpt::run() returned early, Changed and CFGChanged were never initialized. CFGChanged is read unconditionally afterwards. This came up in the course of D137942.	2022-11-23 11:30:39 +01:00
Matt Arsenault	6463961941	InstCombine: Fold some identities for canonicalize Equality is directly stated as true in the LangRef, and I believe this works for every compare type.	2022-11-22 21:42:44 -05:00
Fangrui Song	297a183022	[asan] Don't demangle __odr_asan_gen_* symbols This relands the ODR indicator part of D138095 (reverted by `06c74b5e73`): a `__odr_asan_gen_*` symbol should use a mangled name as its associated symbol does.	2022-11-22 16:47:33 -08:00
Fangrui Song	06c74b5e73	Revert D138095 Use InernalAlloc in DemangleCXXABI Broke 2/3 tests on macOS which seem to be related to `free(demangled_name)` in DemangleCXXABI.	2022-11-22 16:29:24 -08:00
Roman Lebedev	655d857325	[SROA] `isVectorPromotionViable()`: avoid allowing overly large vectors Otherwise, `compiler-rt/test/asan/TestCases/pr33372.cpp` fails with an assertion: ``` clang-16: /repositories/llvm-project/llvm/lib/CodeGen/SelectionDAG/SelectionDAG.cpp:11988: void llvm::SelectionDAG::createOperands(llvm::SDNode *, ArrayRef<llvm::SDValue>): Assertion `SDNode::getMaxNumOperands() >= Vals.size() && "too many operands to fit into SDNode"' failed. ``` I'm not sure if this should be even more conservative, or if we have a named constant for this in middle-end.	2022-11-23 03:23:08 +03:00
Roman Lebedev	cf624b23bc	[SROA] `isVectorPromotionViable()`: memory intrinsics operate on vectors of bytes Now, there's a big caveat here - these bytes are abstract bytes, not the i8 we have in LLVM, so strictly speaking this is not exactly legal, see e.g. https://github.com/AliveToolkit/alive2/issues/860 ^ the "bytes" "could" have been a pointer, and loading it as an integer inserts an implicit ptrtoint. But at the same time, InstCombine's `InstCombinerImpl::SimplifyAnyMemTransfer()` would expand a memtransfer of 1/2/4/8 bytes into integer-typed load+store, so this isn't exactly a new problem. Note that in memory, poison is byte-wise, so we really can't widen elements, but SROA seems to be inconsistent here. Fixes #59116.	2022-11-23 02:38:25 +03:00
Sami Tolvanen	cacd3e73d7	Add generic KCFI operand bundle lowering The KCFI sanitizer emits "kcfi" operand bundles to indirect call instructions, which the LLVM back-end lowers into an architecture-specific type check with a known machine instruction sequence. Currently, KCFI operand bundle lowering is supported only on 64-bit X86 and AArch64 architectures. As a lightweight forward-edge CFI implementation that doesn't require LTO is also useful for non-Linux low-level targets on other machine architectures, add a generic KCFI operand bundle lowering pass that's only used when back-end lowering support is not available and allows -fsanitize=kcfi to be enabled in Clang on all architectures. This relands commit `eb2a57ebc7` with fixes. Reviewed By: nickdesaulniers, MaskRay Differential Revision: https://reviews.llvm.org/D135411	2022-11-22 23:01:18 +00:00
Roman Lebedev	529eafd9be	[SROA] `isVectorPromotionViable()`: integer-ify non-pointer non-common types This rectifies a FIXME that dates all the way back to 2014 about not doing so due to the backend issues. Presumably sufficient amount of time has passes and all the known issues have been addressed, or at least we will find out of there are some left...	2022-11-23 00:23:00 +03:00
Roman Lebedev	4e18d51ac5	[SROA] `isVectorPromotionViable()`: pointer-ness is sticky As it has been established previously by precedent, if we see a pointer type, then that is the type we must use. Essentially, we don't want to introduce `inttoptr`'s.	2022-11-23 00:23:00 +03:00
Benjamin Kramer	f116107f2d	[VectorCombine] Don't touch instruction after foldSingleElementStore, it might be deleted Use after free found by asan.	2022-11-22 21:12:42 +01:00
Rong Xu	6327d263f5	[CHR] Add a threshold for the code duplication ControlHeightReduction (CHR) clones the code region to reduce the branches in the hot code path. The number of clones is linear to the depth of the region. Currently it does not have control over the code size increase. We are seeing one ~9000 BB functions get expanded to ~250000 BBs, an 25x increase. This creates a big compile time issue for the downstream optimizations. This patch adds a cap for number of clones for one region. Differential Revision: https://reviews.llvm.org/D138333	2022-11-22 11:36:40 -08:00
Matt Arsenault	afb3509113	LoopDeletion: Fix missing newlines in debug printing	2022-11-22 11:12:00 -05:00
Sanjay Patel	ede6d608f4	[VectorCombine] switch on opcode to compile faster This follows `87debdadaf` to further eliminate wasting time calling helper functions only to early return to the main run loop. Once again, this results in significant savings based on experimental data: https://llvm-compile-time-tracker.com/compare.php?from=01023bfcd33f922ed8c934ce563e54abe8bfe246&to=3dce4f70b73e48ccb045decb634c185e6b4c67d5&stat=instructions:u This is NFCI other than making the pass faster. The total cost of VectorCombine runs in an -O3 build appears to be well under 0.1% of compile-time now, so there's not much left to do AFAICT. There's a TODO about making the code cleaner, but it probably doesn't change timing much. I didn't include those changes here because it requires updating much more code.	2022-11-22 10:23:32 -05:00
Thomas Symalla	470aea5ed4	[InstCombine] Fold extractelt with select of constants An extractelt with a constant index which extracts an element from the two vector operands of a select can be directly folded into a select. extractelt (select %x, %vec1, %vec2), %const -> select %x, %vec1[%const], %vec2[%const] Note: the implementation currently only works for constant vector operands. Reviewed By: foad, spatel Differential Revision: https://reviews.llvm.org/D137934	2022-11-22 14:07:06 +01:00
David Green	8e9e22f07b	[LoopFlatten] Fix IV increment use count The add from the IV in the inner loop was always checking for 2 uses, the phi and the compare. The compare could be based on the phi though, leaving one valid use of the compare. In the testcase we could be left with the phi and a lcssa phi as the two users, invalidly allowing flattening where we shouldn't. Fixes 58441 Differential Revision: https://reviews.llvm.org/D138404	2022-11-22 07:23:56 +00:00
Max Kazantsev	57fd7ffeff	[IndVarSimplify] Lift limitations on IV being a Phi for turn-to-invariant These limitations are too strict, and their only purpose is to avoid code size explosion. These restrictions seem obsolete, and the size problem is solved in other places through cheap expansion limits. The motivation is that the old code cannot deal with comparisons against induction variant's increment. Differential Revision: https://reviews.llvm.org/D138412 Reviewed By: lebedev.ri, reames	2022-11-22 12:53:37 +07:00
Kazu Hirata	1f914944b6	Don't use Optional::getPointer (NFC) Since std::optional does not offer getPointer(), this patch replaces X.getPointer() with &*X to make the migration from llvm::Optional to std::optional easier. This is part of an effort to migrate from llvm::Optional to std::optional: https://discourse.llvm.org/t/deprecating-llvm-optional-x-hasvalue-getvalue-getvalueor/63716 Differential Revision: https://reviews.llvm.org/D138466	2022-11-21 19:03:40 -08:00
Fangrui Song	db7c82231c	Restore global descriptor demangling after D138095 "[asan] Keep Itanium mangled names in global metadata" This amends commit `00be3578e0` to demangle symbol names in global descriptors. We keep the mangled name for the `__odr_gen_asan_*` variables and the runtime __cxa_demangle call site change (which fixed possible leaks for other scenarios: non-fatal diagnostics). compiler-rt/lib/sanitizer_common/sanitizer_symbolizer_posix_libcdep.cpp uses an undefined weak `__cxa_demangle` which does not pull in an archive definition. A -static-libstdc++ executable link does not get demangled names. Unfortunately this means we cannot rely on runtime demangling. See compiler-rt/test/asan/TestCases/global-demangle.cpp	2022-11-21 20:51:52 +00:00
Sanjay Patel	163bb6d64e	[Passes][VectorCombine] enable early run generally and try load folds An early run of VectorCombine was added with D102496 specifically to deal with unnecessary vector ops produced with the C matrix extension. This patch is proposing to try those folds in general and add a pair of load folds to the menu. The load transform will partly solve (see PhaseOrdering diffs) a longstanding vectorization perf bug by removing redundant loads via GVN: issue #17113 The main reason for not enabling the extra pass generally in the initial patch was compile-time cost. The cost of VectorCombine was significantly (surprisingly) improved with: `87debdadaf` https://llvm-compile-time-tracker.com/compare.php?from=ffe05b8f57d97bc4340f791cb386c8d00e0739f2&to=87debdadaf18f8a5c7e5d563889e10731dc3554d&stat=instructions:u ...so the extra run is going to cost very little now - the total cost of the 2 runs should be less than the 1 run before that micro-optimization: https://llvm-compile-time-tracker.com/compare.php?from=5e8c2026d10e8e2c93c038c776853bed0e7c8fc1&to=2c4b68eab5ae969811f422714e0eba44c5f7eefb&stat=instructions:u It may be possible to reduce the cost slightly more with a few more earlier-exits like that, but it's probably in the noise based on timing experiments. Differential Revision: https://reviews.llvm.org/D138353	2022-11-21 13:57:55 -05:00
Sanjay Patel	8f337f8ffe	[VectorCombine] generalize pass param name for early combines; NFC The option was added with https://reviews.llvm.org/D102496, and currently the name is accurate, but I am hoping to add a load transform that is not a scalarization. See issue #17113.	2022-11-21 13:57:55 -05:00
Manuel Brito	1e55d5b1f2	Use poison instead of undef as placeholder for vector construction [NFC] Differential Revision: https://reviews.llvm.org/D138450	2022-11-21 18:43:23 +00:00
Alexey Bataev	ac93b61165	[SLP]Fix PR59098: check if the vector type is scalarized for extractelements. If the resulting type is going to be scalarized, no need to adjust the cost of removed extractelement and insert/extract subvector costs. Otherwise, the compiler can crash because of the wrong type sizes.	2022-11-21 10:26:01 -08:00
Max Kazantsev	2a3ac7fd0c	[NFC][IndVars] Add LLVM_DEBUG printout to replaceExitCond	2022-11-21 19:33:26 +07:00
Kazu Hirata	31b6093434	[Scalar] Teach matchExpandedRem to return std::optional (NFC) This is part of an effort to migrate from llvm::Optional to std::optional: https://discourse.llvm.org/t/deprecating-llvm-optional-x-hasvalue-getvalue-getvalueor/63716	2022-11-20 22:38:43 -08:00
Bjorn Pettersson	1c308d6641	[LV] Clean up LoopVectorizationCostModel::calculateRegisterUsage. NFC Minor refactoring in LoopVectorizationCostModel::calculateRegisterUsage. Also adding some FIXME:s related to what appears to be some short comings related to how the register usage is calculated. Differential Revision: https://reviews.llvm.org/D138342	2022-11-20 20:52:13 +01:00
Kazu Hirata	7524db4d44	[llvm] Remove unused forward declarations (NFC)	2022-11-20 09:59:36 -08:00
Kazu Hirata	1fa870b1bd	Use None consistently (NFC) This patch replaces NoneType() and NoneType::None with None in preparation for migration from llvm::Optional to std::optional. In the std::optional world, we are not guranteed to be able to default-construct std::nullopt_t or peek what's inside it, so neither NoneType() nor NoneType::None has a corresponding expression in the std::optional world. Once we consistently use None, we should even be able to replace the contents of llvm/include/llvm/ADT/None.h with something like: using NoneType = std::nullopt_t; inline constexpr std::nullopt_t None = std::nullopt; to ease the migration from llvm::Optional to std::optional. Differential Revision: https://reviews.llvm.org/D138376	2022-11-20 00:24:40 -08:00
Kazu Hirata	5d1ae6346b	[Analysis] Teach getOptionalIntLoopAttribute to return std::optional (NFC) This is part of an effort to migrate from llvm::Optional to std::optional: https://discourse.llvm.org/t/deprecating-llvm-optional-x-hasvalue-getvalue-getvalueor/63716	2022-11-19 15:36:50 -08:00
Fangrui Song	00be3578e0	[asan] Keep Itanium mangled names in global metadata The runtime calls `MaybeDemangleGlobalName` for error reporting and `__cxxabiv1::__cxa_demangle` is called if available, so demanging Itanium mangled names in global metadata is unnecessary and wastes data size. Add `MaybeDemangleGlobalName` in ODR violation detection to support demangled names in a suppressions file. `MaybeDemangleGlobalName` may call `DemangleCXXABI` and leak memory. Use an internal allocation to prevent lsan leak (in case there is no fatal asan error). The debug feature `report_globals=2` prints information for all instrumented global variables. `MaybeDemangleGlobalName` would be slow, so don't do that. The output looks like `Added Global[0x56448f092d60]: beg=0x56448fa66d60 size=4/32 name=_ZL13test_global_2` and I think the mangled name is fine. Other mangled schemes e.g. Windows (see win-string-literal.ll) remain the current behavior. Reviewed By: hctim Differential Revision: https://reviews.llvm.org/D138095	2022-11-19 01:06:26 +00:00
Sanjay Patel	87debdadaf	[VectorCombine] check instruction type before dispatching to folds This is no externally visible change intended, but appears to be a noticeable (surprising) improvement in compile-time based on: https://llvm-compile-time-tracker.com/compare.php?from=0f3e72e86c8c7c6bf0ec24bf1e2acd74b4123e7b&to=5e8c2026d10e8e2c93c038c776853bed0e7c8fc1&stat=instructions:u The early returns in the individual fold functions are not good enough to avoid the overhead of the many "fold*" calls, so this speeds up the main instruction loop enough to make a difference.	2022-11-18 16:03:18 -05:00
OCHyams	4ba08d512c	[Assignment Tracking][24/*] Always RemoveRedundantDbgInstrs in instcombine in assignment tracking builds The Assignment Tracking debug-info feature is outlined in this RFC: https://discourse.llvm.org/t/ rfc-assignment-tracking-a-better-way-of-specifying-variable-locations-in-ir This reduces peak memory overhead by 15% when building CTMark's tramp3d-v4 with -O2 -g with assignment tracking enabled. Reviewed By: jmorse Differential Revision: https://reviews.llvm.org/D133321	2022-11-18 12:36:41 +00:00
OCHyams	e3cd498ff7	[Assignment Tracking][21/*] Account for assignment tracking in inliner The Assignment Tracking debug-info feature is outlined in this RFC: https://discourse.llvm.org/t/ rfc-assignment-tracking-a-better-way-of-specifying-variable-locations-in-ir The inliner requires two additions: fixupAssignments - Update inlined instructions' DIAssignID metadata so that inlined DIAssignID attachments are unique to the inlined instance. trackInlinedStores - Treat inlined stores to caller-local variables (i.e. callee stores to argument pointers that point to the caller's allocas) as assignments. Track them using trackAssignments, which is the same method as is used by the AssignmentTrackingPass. This means that we're able to detect stale memory locations due to DSE after inlining. Because the stores are only tracked _after_ inlining, any DSE or movement of stores _before_ inlining will not be accounted for. This is an accepted limitation mentioned in the RFC. One change is also required: Update CloneBlock to preserve debug use-before-defs. Otherwise the assignments will be dropped due to having the intrinsic operands replaced with empty metadata (see use-before-def.ll in this patch and this related discourse post. Reviewed By: jmorse Differential Revision: https://reviews.llvm.org/D133318	2022-11-18 11:55:05 +00:00
OCHyams	86464ed3df	[Assignment Tracking][15/*] Account for assignment tracking in simplifycfg The Assignment Tracking debug-info feature is outlined in this RFC: https://discourse.llvm.org/t/ rfc-assignment-tracking-a-better-way-of-specifying-variable-locations-in-ir Update simplifycfg: sinkLastInstruction - preserve debug use-before-defs. SpeculativelyExecuteBB - replace the value component of dbg.assign intrinsics when stores are hoisted and merged using a select, and don't delete them. Reviewed By: jmorse Differential Revision: https://reviews.llvm.org/D133310	2022-11-18 10:15:55 +00:00

1 2 3 4 5 ...

32159 Commits