llvm-project

Commit Graph

Author	SHA1	Message	Date
Kazu Hirata	b6a01caa64	[llvm/unittests] Use std::nullopt instead of None (NFC) This patch mechanically replaces None with std::nullopt where the compiler would warn if None were deprecated. The intent is to reduce the amount of manual work required in migrating from Optional to std::optional. This is part of an effort to migrate from llvm::Optional to std::optional: https://discourse.llvm.org/t/deprecating-llvm-optional-x-hasvalue-getvalue-getvalueor/63716	2022-12-02 22:10:37 -08:00
Mircea Trofin	1ee3bb17c3	[mlgo][nfc] Make `LoggedFeatureSpec` an implementation detail It's an artifact very specific to using TFAgents during training, so it belongs with ModelUnderTrainingRunner. Differential Revision: https://reviews.llvm.org/D139031	2022-11-30 15:57:58 -08:00
Teresa Johnson	9eacbba290	Restore "[MemProf] ThinLTO summary support" with more fixes This restores commit `98ed423361` and follow on fix `00c22351ba`, which were reverted in `5d938eb6f7` due to an MSVC bot failure. I've included a fix for that failure. Differential Revision: https://reviews.llvm.org/D135714	2022-11-16 09:42:41 -08:00
Jeremy Morse	5d938eb6f7	Revert "Restore "[MemProf] ThinLTO summary support" with fixes" This reverts commit `00c22351ba`. This reverts commit `98ed423361`. Seemingly MSVC has some kind of issue with this patch, in terms of linking: https://lab.llvm.org/buildbot/#/builders/123/builds/14137 I'll post more detail on D135714 momentarily.	2022-11-16 11:21:02 +00:00
Teresa Johnson	98ed423361	Restore "[MemProf] ThinLTO summary support" with fixes This restores `4745945500`, which was reverted in commit `452a14efc8`, along with fixes for a couple of bot failures.	2022-11-15 08:55:17 -08:00
Teresa Johnson	452a14efc8	Revert "[MemProf] ThinLTO summary support" This reverts commit `4745945500`. Revert while I try to fix a couple of non-Linux build failures.	2022-11-15 07:39:40 -08:00
Teresa Johnson	4745945500	[MemProf] ThinLTO summary support Implements the ThinLTO summary support for memprof related metadata. This includes support for the assembly format, and for building the summary from IR during ModuleSummaryAnalysis. To reduce space in both the bitcode format and the in memory index, we do 2 things: 1. We keep a single vector of all uniq stack id hashes, and record the index into this vector in the callsite and allocation memprof summaries. 2. When building the combined index during the LTO link, the callsite and allocation memprof summaries are only kept on the FunctionSummary of the prevailing copy. Differential Revision: https://reviews.llvm.org/D135714	2022-11-15 06:45:12 -08:00
Nathan James	6aa050a690	Reland "[llvm][NFC] Use c++17 style variable type traits" This reverts commit `632a389f96`. This relands commit `1834a310d0`. Differential Revision: https://reviews.llvm.org/D137493	2022-11-08 14:15:15 +00:00
Nathan James	632a389f96	Revert "[llvm][NFC] Use c++17 style variable type traits" This reverts commit `1834a310d0`.	2022-11-08 13:11:41 +00:00
Nathan James	1834a310d0	[llvm][NFC] Use c++17 style variable type traits This was done as a test for D137302 and it makes sense to push these changes Reviewed By: dblaikie Differential Revision: https://reviews.llvm.org/D137493	2022-11-08 12:22:52 +00:00
Karthik Senthil	d9c52c31a0	[LV][IVDescriptors] Fix recurrence identity element for FMin and FMax reductions For a min and max reduction idioms, the identity (i.e. neutral) element should be datatype's highest and lowest possible values respectively. Current implementation in IVDescriptors incorrectly returns -Inf for FMin reduction and +Inf for FMax reduction. This patch fixes this bug which was causing incorrect reduction computation results in loops vectorized by LV. Differential Revision: https://reviews.llvm.org/D137220	2022-11-04 10:39:37 -04:00
Nikita Popov	134bda4b61	[ValueLattice] Use DL-aware folding in getCompare() Use DL-aware ConstantFoldCompareInstOperands() API instead of ConstantExpr API. The practical effect of this is that SCCP can now fold comparisons that require DL.	2022-11-02 10:41:11 +01:00
Nikita Popov	7ad3bb527e	Reapply [ValueLattice] Fix getCompare() for undef values Relative to the previous attempt, this also updates the ValueLattice unit tests. ----- Resolve the TODO about incorrect getCompare() behavior. This can be made more precise (e.g. by materializing the undef value and performing constant folding on it), but for now just return an unknown result to fix the correctness issue. This should be NFC in terms of user-visible behavior, because the only user of this method (SCCP) was already guarding against UndefValue results.	2022-11-02 10:23:06 +01:00
Kazu Hirata	4d594d810d	[Analysis] Fix warnings This patch fixes: llvm/utils/unittest/googletest/include/gtest/gtest.h:1526:11: error: comparison of integers of different signs: 'const unsigned long' and 'const int' [-Werror,-Wsign-compare]	2022-10-30 12:52:06 -07:00
Simon Pilgrim	55a11b542e	[VectorUtils] Add getShuffleDemandedElts helper We have similar code to translate a demanded elements mask for a shuffle's operands in multiple places - this patch adds a helper function to VectorUtils and updates a number of locations to use it directly. Differential Revision: https://reviews.llvm.org/D136832	2022-10-30 17:03:55 +00:00
Nikita Popov	747f27d97d	[AA] Rename getModRefBehavior() to getMemoryEffects() (NFC) Follow up on D135962, renaming the method name to match the new type name.	2022-10-19 11:03:54 +02:00
Nikita Popov	1a9d9823c5	[AA] Rename uses of FunctionModRefBehavior (NFC) Followup to D135962 to rename remaining uses of FunctionModRefBehavior to MemoryEffects. Does not touch API names yet, but also updates variables names FMRB/MRB to ME, to match the new type name.	2022-10-19 10:54:47 +02:00
Nikita Popov	8d569e638b	[AA] Update unit test missed in previous commit (NFC) Missed this unit test use in `3d0b5f019e`.	2022-10-06 10:28:05 +02:00
Nikita Popov	c5bf452022	[AA] Pass AAResults through AAQueryInfo Currently, AAResultBase (from which alias analysis providers inherit) stores a reference back to the AAResults aggregation it is part of, so it can perform recursive alias analysis queries via getBestAAResults(). This patch removes the back-reference from AAResultBase to AAResults, and instead passes the used aggregation through the AAQueryInfo. This can be used to perform recursive AA queries using the full aggregation. Differential Revision: https://reviews.llvm.org/D94363	2022-10-06 10:10:19 +02:00
Arthur Eubanks	a8f1da128d	[LazyCallGraph] Handle spurious ref edges when deleting a dead function Spurious ref edges are ref edges that still exist in the call graph even though the corresponding IR reference no longer exists. This can cause issues when deleting a dead function which has a spurious ref edge pointed at it because currently we expect the dead function's RefSCC to be trivial. In the case that the dead function's RefSCC is not trivial, remove all ref edges from other nodes in the RefSCC to it. Removing a ref edge can result in splitting RefSCCs. There's actually no reason to revisit those RefSCCs because currently we only run passes on SCCs, and we've already added all SCCs in the RefSCC to the worklist. (as opposed to removing the ref edge in updateCGAndAnalysisManagerForPass() which can modify the call graph of SCCs we have not visited yet). We also don't expect that RefSCC refinement will allow us to glean any more information for optimization use. Also, doing so would drastically increase the complexity of LazyCallGraph::removeDeadFunction(), requiring us to return a list of invalidated RefSCCs and new RefSCCs to add to the worklist. Fixes #56503 Reviewed By: asbirlea Differential Revision: https://reviews.llvm.org/D133907	2022-09-22 15:01:15 -07:00
Kazu Hirata	1cd4563013	[llvm] Use has_value instead of hasValue (NFC)	2022-09-18 19:45:34 -07:00
Nikita Popov	b1cd393f9e	[AA] Tracking per-location ModRef info in FunctionModRefBehavior (NFCI) Currently, FunctionModRefBehavior tracks whether the function reads or writes memory (ModRefInfo) and which locations it can access (argmem, inaccessiblemem and other). This patch changes it to track ModRef information per-location instead. To give two examples of why this is useful: * D117095 highlights a weakness of ModRef modelling in the presence of operand bundles. For a memcpy call with deopt operand bundle, we want to say that it can read any memory, but only write argument memory. This would allow them to be treated like any other calls. However, we currently can't express this and have to say that it can read or write any memory. * D127383 would ideally be modelled as a separate threadid location, where threadid Refs outside pre-split coroutines can be ignored (like other accesses to constant memory). The current representation does not allow modelling this precisely. The patch as implemented is intended to be NFC, but there are some obvious opportunities for improvements and simplification. To fully capitalize on this we would also want to change the way we represent memory attributes on functions, but that's a larger change, and I think it makes sense to separate out the FunctionModRefBehavior refactoring. Differential Revision: https://reviews.llvm.org/D130896	2022-09-14 16:34:41 +02:00
Aiden Grossman	ec83c7e358	[MLGO] Make TFLiteUtils throw an error if some features haven't been passed to the model In the Tensorflow C lib utilities, an error gets thrown if some features haven't gotten passed into the model (due to differences in ordering which now don't exist with the transition to TFLite). However, this is not currently the case when using TFLiteUtils. This patch makes some minor changes to throw an error when not all inputs of the model have been passed, which when not handled will result in a seg fault within TFLite. Reviewed By: mtrofin Differential Revision: https://reviews.llvm.org/D133451	2022-09-10 22:59:03 +00:00
Nikita Popov	ebbac868b5	[AST] Fix unit test to use BatchAA (NFC)	2022-09-09 16:07:04 +02:00
Sanjay Patel	4e44c22c97	[ValueTracking][InstCombine] restrict FP min/max matching to avoid miscompile This is a long-standing FIXME with a non-FMF test that exposes the bug as shown in issue #57357. It's possible that there's still a way to miscompile by mis-identifying/mis-folding FP min/max patterns, but this patch only exposes a couple of seemingly minor regressions while preventing the broken transform.	2022-08-25 16:52:40 -04:00
Mircea Trofin	5ce4c9aa04	[mlgo] Use TFLite for 'development' mode. TLite is a lightweight, statically linkable[1], model evaluator, supporting a subset of what the full tensorflow library does, sufficient for the types of scenarios we envision having. It is also faster. We still use saved models as "source of truth" - 'release' mode's AOT starts from a saved model; and the ML training side operates in terms of saved models. Using TFLite solves the following problems compared to using the full TF C API: - a compiler-friendly implementation for runtime-loadable (as opposed to AOT-embedded) models: it's statically linked; it can be built via cmake; - solves an issue we had when building the compiler with both AOT and full TF C API support, whereby, due to a packaging issue on the TF side, we needed to have the pip package and the TF C API library at the same version. We have no such constraints now. The main liability is it supporting a subset of what the full TF framework does. We do not expect that to cause an issue, but should that be the case, we can always revert back to using the full framework (after also figuring out a way to address the problems that motivated the move to TFLite). Details: This change switches the development mode to TFLite. Models are still expected to be placed in a directory - i.e. the parameters to clang don't change; what changes is the directory content: we still need an `output_spec.json` file; but instead of the saved_model protobuf and the `variables` directory, we now just have one file, `model.tflite`. The change includes a utility showing how to take a saved model and convert it to TFLite, which it uses for testing. The full TF implementation can still be built (not side-by-side). We intend to remove it shortly, after patching downstream dependencies. The build behavior, however, prioritizes TFLite - i.e. trying to enable both full TF C API and TFLite will just pick TFLite. [1] thanks to @petrhosek's changes to TFLite's cmake support and its deps!	2022-08-24 16:07:24 -07:00
Chuanqi Xu	00fa513c03	[NFC] Add tests for MemorySSA shouldn't contain debug instructions Extracted from D130153	2022-08-19 15:03:29 +08:00
Martin Sebor	0dcfe7aa35	[InstCombine] Tighten up known library function signature tests (PR #56463 ) Replace a switch statement used to validate arguments to known library functions with a more consistent table-driven approach and tighten it up.	2022-08-10 14:15:46 -06:00
Mircea Trofin	0cb9746a7d	[nfc][mlgo] Separate logger and training-mode model evaluator This just shuffles implementations and declarations around. Now the logger and the TF C API-based model evaluator are separate. Differential Revision: https://reviews.llvm.org/D131116	2022-08-03 16:20:28 -07:00
Nikita Popov	f96ea53e89	[AA] Do not track Must in ModRefInfo getModRefInfo() queries currently track whether the result is a MustAlias on a best-effort basis. The only user of this functionality is the optimized memory access type in MemorySSA -- which in turn has no users. Given that this functionality has not found a user since it was introduced five years ago (in D38862), I think we should drop it again. The context is that I'm working to separate FunctionModRefBehavior to track mod/ref for different location kinds (like argmem or inaccessiblemem) separately, and the fact that ModRefInfo also has an unrelated Must flag makes this quite awkward, especially as this means that NoModRef is not a zero value. If we want to retain the functionality, I would probably split getModRefInfo() results into a part that just contains the ModRef information, and a separate part containing a (best-effort) AliasResult. Differential Revision: https://reviews.llvm.org/D130713	2022-08-01 07:14:31 +02:00
Sanjay Patel	a925bef70c	[ValueTracking] allow vector types in isImpliedCondition() The matching of constants assumed integers, but we can handle splat vector constants seamlessly with m_APInt.	2022-07-24 17:46:48 -04:00
Sanjay Patel	4da47bee48	[ValueTracking] add test for isImpliedCondition with vector types; NFC	2022-07-24 17:07:17 -04:00
Teresa Johnson	1dad6247d2	[MemProf] Add memprof metadata related analysis utilities Adds a number of utilities that are used to help create and update memprof related metadata. These will be used during profile matching and annotation, as well as by the inliner when updating the metadata. Also adds unit tests for the utilities. See also related RFCs: RFC: Sanitizer-based Heap Profiler [1] RFC: A binary serialization format for MemProf [2] RFC: IR metadata format for MemProf [3] (Note that the IR metadata format has changed from the RFC during implementation, as described in the preceeding patch adding the basic metadata and verification support.) Depends on D128141. Differential Revision: https://reviews.llvm.org/D128854	2022-07-21 13:46:01 -07:00
Chuanqi Xu	645d2dd3a9	Revert "Don't treat readnone call in presplit coroutine as not access memory" This reverts commit `57224ff4a6`. This commit may trigger crashes on some workloads. Revert it for clearness.	2022-07-20 17:00:58 +08:00
Chuanqi Xu	57224ff4a6	Don't treat readnone call in presplit coroutine as not access memory To solve the readnone problems in coroutines. See https://discourse.llvm.org/t/address-thread-identification-problems-with-coroutine/62015 for details. According to the discussion, we decide to fix the problem by inserting isPresplitCoroutine() checks in different passes instead of wrapping/unwrapping readnone attributes in CoroEarly/CoroCleanup passes. In this direction, we might not be able to cover every case at first. Let's take a "find and fix" strategy. Reviewed By: nikic, nhaehnle, jyknight Differential Revision: https://reviews.llvm.org/D127383	2022-07-20 10:37:23 +08:00
Max Kazantsev	51f837a680	[NFC] Introduce API to detect tokens penetrating LCSSA form Following discussion in PR56243, we need to somehow detect the situation when token values penetrate LCSSA form for transforms that require that it is maintained by all values (for example, to sustain use-def dominance invarians). This patch introduces a parameter to LCSSA checkers to control their ignorance about tokens. Differential Revision: https://reviews.llvm.org/D129983 Reviewed By: efriedma	2022-07-19 13:52:30 +07:00
Vitaly Buka	00b5a6eba1	[test] Fix old memory leak in a test	2022-07-15 16:56:20 -07:00
Kazu Hirata	611ffcf4e4	[llvm] Use value instead of getValue (NFC)	2022-07-13 23:11:56 -07:00
Kazu Hirata	3361a364e6	[llvm] Use has_value instead of hasValue (NFC)	2022-07-12 22:25:42 -07:00
Nikita Popov	11950efe06	[ConstExpr] Remove div/rem constant expressions D128820 stopped creating div/rem constant expressions by default; this patch removes support for them entirely. The getUDiv(), getExactUDiv(), getSDiv(), getExactSDiv(), getURem() and getSRem() on ConstantExpr are removed, and ConstantExpr::get() now only accepts binary operators for which ConstantExpr::isSupportedBinOp() returns true. Uses of these methods may be replaced either by corresponding IRBuilder methods, or ConstantFoldBinaryOpOperands (if a constant result is required). On the C API side, LLVMConstUDiv, LLVMConstExactUDiv, LLVMConstSDiv, LLVMConstExactSDiv, LLVMConstURem and LLVMConstSRem are removed and corresponding LLVMBuild methods should be used. Importantly, this also means that constant expressions can no longer trap! This patch still keeps the canTrap() method to minimize diff -- I plan to drop it in a separate NFC patch. Differential Revision: https://reviews.llvm.org/D129148	2022-07-06 10:11:34 +02:00
Kazu Hirata	a7938c74f1	[llvm] Don't use Optional::hasValue (NFC) This patch replaces Optional::hasValue with the implicit cast to bool in conditionals only.	2022-06-25 21:42:52 -07:00
Kazu Hirata	3b7c3a654c	Revert "Don't use Optional::hasValue (NFC)" This reverts commit `aa8feeefd3`.	2022-06-25 11:56:50 -07:00
Kazu Hirata	aa8feeefd3	Don't use Optional::hasValue (NFC)	2022-06-25 11:55:57 -07:00
Kazu Hirata	d152e50c15	[llvm] Don't use Optional::{hasValue,getValue} (NFC)	2022-06-25 11:24:23 -07:00
Mircea Trofin	3f8e4169c1	[FunctionPropertiesAnalysis] Generalize support for unreachable Generalized support for subgraphs that get rendered unreachable, for both `call` and `invoke` cases. Differential Revision: https://reviews.llvm.org/D127921	2022-06-21 08:18:01 -07:00
Mircea Trofin	22a1f998f7	FunctionPropertiesAnalysis: handle callsite BBs that lose edges There could be successors that were reached before but now are only reachable from elsewhere in the CFG. Suppose the following diamond CFG (lines are arrows pointing down): A / \ B C \ / D There's a call site in C that is inlined. Upon doing that, it turns out it expands to: call void @llvm.trap() unreachable D isn't reachable from C anymore, but we did discount it when we set up FunctionPropertiesUpdater, so we need to re-include it here. The patch also updates loop accounting to use LoopInfo rather than traverse BBs. Differential Revision: https://reviews.llvm.org/D127353	2022-06-14 15:19:44 -07:00
Mircea Trofin	b8c39eb275	Fix FunctionPropertiesAnalysis updating callsite in 1-BB loop If the callsite is in a single BB loop, we need to exclude the BB from the successor set (in which it'd be a member), because that set forms a boundary at which we stop traversing the CFG, when re-ingesting BBs after inlining; but after inlining, the callsite BB's new successors should be visited. Reviewed By: kazu Differential Revision: https://reviews.llvm.org/D127178	2022-06-08 14:32:00 -07:00
Mircea Trofin	f46dd19b48	[mlgo] Incrementally update FunctionPropertiesInfo during inlining Re-computing FunctionPropertiesInfo after each inlining may be very time consuming: in certain cases, e.g. large caller with lots of callsites, and when the overall IR doesn't increase (thus not tripping a size bloat threshold). This patch addresses this by incrementally updating FunctionPropertiesInfo. Differential Revision: https://reviews.llvm.org/D125841	2022-05-31 17:27:32 -07:00
Simon Pilgrim	345ed58ed5	Fix implicit double -> float truncation warnings. NFCI.	2022-05-13 19:07:00 +01:00
Arthur Eubanks	7e0802aeb5	[BasicAA] Fix order in which we pass MemoryLocations to alias() D98718 caused the order of Values/MemoryLocations we pass to alias() to be significant due to storing the offset in the PartialAlias case. But some callers weren't audited and were still passing swapped arguments, causing the returned PartialAlias offset to be negative in some cases. For example, the newly added unittests would return -1 instead of 1. Fixes #55343, a miscompile. Reviewed By: asbirlea, nikic Differential Revision: https://reviews.llvm.org/D125328	2022-05-10 12:05:38 -07:00

1 2 3 4 5 ...

674 Commits