llvm-project

Commit Graph

Author	SHA1	Message	Date
Dmitry Makogon	361034ba78	[NFC] Add LazyValueInfo::clear method This method just calls LazyValueInfoImpl::clear	2022-03-15 17:52:50 +07:00
Arthur Eubanks	4fc7c55fff	[NewPM] Actually recompute GlobalsAA before module optimization pipeline RequireAnalysis<GlobalsAA> doesn't actually recompute GlobalsAA. GlobalsAA isn't invalidated (unless specifically invalidated) because it's self-updating via ValueHandles, but can be imprecise during the self-updates. Rather than invalidating GlobalsAA, which would invalidate AAManager and any analyses that use AAManager, create a new pass that recomputes GlobalsAA. Fixes #53131. Differential Revision: https://reviews.llvm.org/D121167	2022-03-14 09:42:34 -07:00
Arthur Eubanks	55cf09ae26	[ValueTracking] Simplify llvm::isPointerOffset() We still need the code after stripAndAccumulateConstantOffsets() since it doesn't handle GEPs of scalable types and non-constant but identical indexes. Differential Revision: https://reviews.llvm.org/D120523	2022-03-14 09:32:36 -07:00
Nikita Popov	04b717c423	[TLI] Check that malloc argument has type size_t DSE assumes that this is the case when forming a calloc from a malloc + memset pair. For tests, either update the malloc signature or change the data layout.	2022-03-14 17:22:24 +01:00
Andrew Litteken	0c4bbd293e	[IRSim] Make sure the first instruction of a block doesn't get missed if it is the first valid instruction in Module. If an instruction is first legal instruction in the module, and is the only legal instruction in its basic block, it will be ignored by the outliner due to a length check inherited from the older version of the outliner that was restricted to outlining within a single basic block. This removes that check, and updates any tests that broke because of it. Reviewer: paquette Differential Revision: https://reviews.llvm.org/D120786	2022-03-13 23:13:09 -05:00
Andrew Litteken	1643f01232	[IRSim][IROutliner] Ignoring Musttail Function Musttail calls require extra handling to properly propagate the calling convention information and tail call information. The outliner does not currently do this, so we ignore call instructions that utilize the swifttailcc and tailcc calling convention as well as functions marked with the attribute musttail. Reviewers: paquette, aschwaighofer Differential Revision: https://reviews.llvm.org/D120733	2022-03-13 19:27:25 -05:00
Andrew Litteken	66f90fdff1	Revert "[IRSim][IROutliner] Ignoring Musttail Function" This reverts commit `c7037c7257`. Pushed too soon	2022-03-13 19:26:51 -05:00
Andrew Litteken	c7037c7257	[IRSim][IROutliner] Ignoring Musttail Function	2022-03-13 18:57:24 -05:00
serge-sans-paille	3d219d805c	Add missing include under EXPENSIVE_CHECKS	2022-03-12 18:54:29 +01:00
serge-sans-paille	ed98c1b376	Cleanup includes: DebugInfo & CodeGen Discourse thread: https://discourse.llvm.org/t/include-what-you-use-include-cleanup Differential Revision: https://reviews.llvm.org/D121332	2022-03-12 17:26:40 +01:00
Johannes Doerfert	d6e09ce86f	[CaptureTracking][NFCI] Expose capture tracking logic The logic exposed by this patch via `llvm::DetermineUseCaptureKind` was part of `llvm::PointerMayBeCaptured`. In the Attributor we want to keep track of the work list items but still reuse the logic if a use might capture a value. A follow up for the Attributor removes ~100 lines of code and complexity while making future handling of simplified values possible. Differential Revision: https://reviews.llvm.org/D121272	2022-03-11 22:56:16 -06:00
Anna Thomas	a4aa97d578	[InlineCost] Add cl::opt for target attributes compatibility check. NFC This patch adds a CL option for avoiding the attribute compatibility check between caller and callee in TTI. TTI attribute compatibility checks for target CPU and target features. In our downstream compiler, this attribute always remains the same between callee and caller. By avoiding the addition of this attribute to each of our inline candidate (and then checking them here during inline cost), we save some compile time. The option is kept false, so this change is an NFC upstream.	2022-03-11 18:05:16 -05:00
Nikita Popov	806450805d	[ConstFold] Don't fold calls with mismatching function type With opaque pointers, this is no longer ensured through pointer type identity.	2022-03-11 14:09:23 +01:00
Nikita Popov	02c2106002	[InstSimplify] Handle vector GEP when simplifying zero indices If the base is a scalar and the index is a vector, we can't simplify, as this is effectively a splat operation.	2022-03-11 10:56:44 +01:00
Sanjay Patel	b48fe158e0	[Analysis] remove bogus smin/smax pattern detection This is a revert of `cfcc42bdc`. The analysis is wrong as shown by the minimal tests for instcombine: https://alive2.llvm.org/ce/z/y9Dp8A There may be a way to salvage some of the other tests, but that can be done as follow-ups. This avoids a miscompile and fixes #54311.	2022-03-09 17:50:34 -05:00
Florian Hahn	f98125abb2	Revert "[PassManager] Add pretty stack entries before P->run() call." This reverts commit `128745cc26`. This increased compile-time unnecessarily. Revert this change and follow ups `2c7afadb47` & `add0c5856d`. http://llvm-compile-time-tracker.com/compare.php?from=338dfcd60f843082bb589b287d890dbd9394eb82&to=128745cc2681c284bc6d0150a319673a6d6e8424&stat=instructions	2022-03-09 18:46:32 +00:00
Florian Hahn	128745cc26	[PassManager] Add pretty stack entries before P->run() call. This patch adds PrettyStackEntries before running passes. The entries include the pass name and the IR unit the pass runs on. The information is used the print additional information when a pass crashes, including the name and a reference to the IR unit on which it crashed. This is similar to the behavior of the legacy pass manager. The improved stack trace now includes: Stack dump: 0. Program arguments: bin/opt -loop-vectorize -force-vector-width=4 crash.ll 1. Running pass 'ModuleToFunctionPassAdaptor' on module 'crash.ll' 2. Running pass 'LoopVectorizePass' on function '@a' Reviewed By: aeubanks Differential Revision: https://reviews.llvm.org/D120993	2022-03-09 13:01:09 +00:00
Nikita Popov	ba8ee4a43e	[SCEV] Verify all IR -> SCEV mappings This extends SCEV verification to check not only backedge-taken counts, but all entries in the IR -> SCEV cache. The restrictions are the same as for the BECount case, i.e. we ignore expressions based on undef, we only diagnose constant deltas (there are way too many false positives otherwise) and we limit to reachable code. Differential Revision: https://reviews.llvm.org/D121104	2022-03-09 09:33:22 +01:00
Arthur Eubanks	53e5e58670	[NewPM][Inliner] Make inlined calls to functions in same SCC as callee exponentially expensive Introduce a new attribute "function-inline-cost-multiplier" which multiplies the inline cost of a call site (or all calls to a callee) by the multiplier. When processing the list of calls created by inlining, check each call to see if the new call's callee is in the same SCC as the original callee. If so, set the "function-inline-cost-multiplier" attribute of the new call site to double the original call site's attribute value. This does not happen when the original call site is intra-SCC. This is an alternative to D120584, which marks the call sites as noinline. Hopefully fixes PR45253. Reviewed By: davidxl Differential Revision: https://reviews.llvm.org/D121084	2022-03-07 23:51:09 -08:00
Florian Hahn	a2979c8399	[IVDescriptors] Bail out instead of asserting that order is expected. When dealing with multiple phis that depend on each other, the order might have been changed and may not match the expectation. If that happens, bail out, rather than asserting. Fixes https://github.com/llvm/llvm-project/issues/54218 Fixes https://github.com/llvm/llvm-project/issues/54233 Fixes https://github.com/llvm/llvm-project/issues/54254	2022-03-07 19:57:26 +00:00
Nikita Popov	81b43b23e4	[SCEV] Enable verification under EXPENSIVE_CHECKS SCEV verification should no longer affect results of subsequent queries, and our lit tests as well as llvm-test-suite pass with SCEV verification enabled, so I think we can enable it by default under EXPENSIVE_CHECKS now. Differential Revision: https://reviews.llvm.org/D120708	2022-03-07 09:53:00 +01:00
Nikita Popov	d1e880acaa	[SCEV] Enable verification in LoopPM Currently, we hardly ever actually run SCEV verification, even in tests with -verify-scev. This is because the NewPM LPM does not verify SCEV. The reason for this is that SCEV verification can actually change the result of subsequent SCEV queries, which means that you see different transformations depending on whether verification is enabled or not. To allow verification in the LPM, this limits verification to BECounts that have actually been cached. It will not calculate new BECounts. BackedgeTakenInfo::getExact() is still not entirely readonly, it still calls getUMinFromMismatchedTypes(). But I hope that this is not problematic in the same way. (This could be avoided by performing the umin in the other SCEV instance, but this would require duplicating some of the code.) Differential Revision: https://reviews.llvm.org/D120551	2022-03-07 09:46:20 +01:00
Nikita Popov	8133778d3c	[SCEV] Fully invalidate SCEVUnknown on RAUW When a SCEVUnknown gets RAUWd, we currently drop it from the folding set, but don't forget memoized values. I believe we should be treating RAUW the same way as deletion here and invalidate all caches and dependent expressions. I don't have any specific cases where this causes issues right now, but it does address the FIXME in https://reviews.llvm.org/D119488. Differential Revision: https://reviews.llvm.org/D120033	2022-03-07 09:28:28 +01:00
Florian Hahn	de8ac485e5	[IVDescriptor] Remove SinkCandidate from SinkAfter before re-sinking. This ensures the right order in the sink-after map is maintained. If we re-sink an instruction, it must be sunk after all earlier instructions have been sunk. Fixes https://github.com/llvm/llvm-project/issues/54223	2022-03-05 19:48:26 +00:00
Arthur Eubanks	f909aed671	Revert "[SCEV] Infer ranges for SCC consisting of cycled Phis" This reverts commit `fc539b0004`. Causes miscompiles, see D110620.	2022-03-04 19:52:44 -08:00
Augie Fackler	dba73135c8	getAllocAlignment: respect allocalign attribute if present As with allocsize(), we prefer the table data to attributes. Differential Revision: https://reviews.llvm.org/D118263	2022-03-04 15:57:54 -05:00
Augie Fackler	5e4c75db3b	InstructionCombining: avoid eliding mismatched alloc/free pairs Prior to this change LLVM would happily elide a call to any allocation function and a call to any free function operating on the same unused pointer. This can cause problems in some obscure cases, for example if the body of operator::new can be inlined but the body of operator::delete can't, as in this example from jyknight: #include <stdlib.h> #include <stdio.h> int allocs = 0; void operator new(size_t n) { allocs++; void mem = malloc(n); if (!mem) abort(); return mem; } __attribute__((noinline)) void operator delete(void mem) noexcept { allocs--; free(mem); } void deleteit(inti) { delete i; } int main() { int*i = new int; deleteit(i); if (allocs != 0) printf("MEMORY LEAK! allocs: %d\n", allocs); } This patch addresses the issue by introducing the concept of an allocator function family and uses it to make sure that alloc/free function pairs are only removed if they're in the same family. Differential Revision: https://reviews.llvm.org/D117356	2022-03-04 10:41:10 -05:00
Florian Hahn	5a60260efe	[IVDescriptor] Use DT to check order of Previous, OtherPrev. Previous and OhterPrev may not be in the same block. Use DT::dominates instead of local comesBefore. DT::dominates is already used earlier to check the order of Previous and SinkCandidate. Fixes https://github.com/llvm/llvm-project/issues/54195	2022-03-04 11:07:42 +00:00
Jez Ng	dd29597e10	[LTO] Initialize canAutoHide() using canBeOmittedFromSymbolTable() Per discussion on https://reviews.llvm.org/D59709#inline-1148734, this seems like the right course of action. `canBeOmittedFromSymbolTable()` subsumes and generalizes the previous logic. In addition to handling `linkonce_odr` `unnamed_addr` globals, we now also internalize `linkonce_odr` + `local_unnamed_addr` constants. Reviewed By: tejohnson Differential Revision: https://reviews.llvm.org/D120173	2022-03-03 19:04:11 -05:00
Arthur Eubanks	41e792d725	[CostModel] Change printer pass wording to work with update_analyze_test_checks.py update_analyze_test_checks.py looks for very specific wording, update the printer pass to match the legacy `-analyze -cost-model` wording.	2022-03-03 10:10:48 -08:00
Craig Topper	608161225e	[InstCombine][Analysis] Move getFCmpCode and getPredForFCmpCode to CmpInstAnalysis. NFC The similar getICmpCode and getPredForICmpCode are already there. This moves FP for consistency. I think InstCombine is currently the only user of both. Reviewed By: RKSimon Differential Revision: https://reviews.llvm.org/D120754	2022-03-03 09:33:24 -08:00
Florian Hahn	139215af8e	[IVDescriptor] Find original 'Previous' for first-order recurrences. This patch extends first-order recurrence handling to support cases where we already sunk an instruction for a different recurrence, but LastPrev comes before Previous. To handle those cases correctly, we need to find the earliest entry for the sink-after chain, because this is references the Previous from the original recurrence. This is needed to ensure we use the correct instruction as sink point. Depends on D118558. Reviewed By: Ayal Differential Revision: https://reviews.llvm.org/D118642	2022-03-03 16:41:26 +00:00
serge-sans-paille	81a1760cac	Revert "Add missing include under EXPENSIVE_CHECK" This reverts commit `eeaca53df7`. It's a duplicate of https://reviews.llvm.org/rG50874a188b94a25827963956887b878d3701509a	2022-03-03 07:56:34 +01:00
serge-sans-paille	a494ae43be	Cleanup includes: TransformsUtils Estimation on the impact on preprocessor output: before: 1065307662 after: 1064800684 Discourse thread: https://discourse.llvm.org/t/include-what-you-use-include-cleanup Differential Revision: https://reviews.llvm.org/D120741	2022-03-01 21:00:07 +01:00
serge-sans-paille	eeaca53df7	Add missing include under EXPENSIVE_CHECK This is a followup to 344f8ec3048b6eeef94569800acb012f794ad372 It should fix https://green.lab.llvm.org/green/job/clang-stage1-cmake-RA-expensive/21961/console	2022-03-01 21:00:06 +01:00
Fangrui Song	50874a188b	Fix -DLLVM_ENABLE_EXPENSIVE_CHECKS=on build after D120659	2022-03-01 11:36:25 -08:00
Mircea Trofin	261419273a	Fix build breaks on ml-* bots introduced by include cleanups	2022-03-01 11:29:18 -08:00
Craig Topper	7bc6667845	[Analysis] Simplify the interface to llvm::getICmpCode. NFC Instead of passing an InstCmpInt * and a bool just pass the predicate from the caller. I'm considering moving the similar FCmp functions from InstCombine over here and this makes the interface consistent with what is used for FCmp. Reviewed By: RKSimon Differential Revision: https://reviews.llvm.org/D120609	2022-03-01 09:53:27 -08:00
serge-sans-paille	71c3a5519d	Cleanup includes: LLVMAnalysis Number of lines output by preprocessor: before: 1065940348 after: 1065307662 Discourse thread: https://discourse.llvm.org/t/include-what-you-use-include-cleanup Differential Revision: https://reviews.llvm.org/D120659	2022-03-01 18:01:54 +01:00
Nikita Popov	aeab6167b0	[SCEV] Only verify BECounts for reachable loops (PR50523) For unreachable loops, any BECount is legal, and since D98706 SCEV can make use of this for loops that are unreachable due to constant branches. To avoid false positives, adjust SCEV verification to only check BECounts in reachable loops. Fixes https://github.com/llvm/llvm-project/issues/50523. Differential Revision: https://reviews.llvm.org/D120651	2022-03-01 11:52:35 +01:00
Nikita Popov	3c53d3a733	[InlineCost] Use SmallPtrSet for DeadBlocks (NFC) This set is only used with contains operations, so there is no need to use a SetVector.	2022-02-28 15:26:22 +01:00
Serge Pavlov	6982c38cb1	[ConstantFolding] Fix folding of constrained compare intrinsics The change fixes treatment of constrained compare intrinsics if compared values are of vector type. Differential revision: https://reviews.llvm.org/D110322	2022-02-27 10:19:19 +07:00
Nikita Popov	2d0fc3e46f	[SCEV] Return ArrayRef from getSCEVValues() (NFC) Return a read-only view on this set. For the one internal use, directly access ExprValueMap.	2022-02-25 09:32:22 +01:00
Nikita Popov	d9715a7266	[SCEV] Don't try to reuse expressions with offset SCEVs ExprValueMap currently tracks not only which IR Values correspond to a given SCEV expression, but additionally stores that it may be expanded in the form X+Offset. In theory, this allows reusing existing IR Values in more cases. In practice, this doesn't seem to be particularly useful (the test changes are rather underwhelming) and adds a good bit of complexity. Per https://github.com/llvm/llvm-project/issues/53905, we have an invalidation issue with these offseted expressions. Differential Revision: https://reviews.llvm.org/D120311	2022-02-25 09:16:48 +01:00
Mircea Trofin	7e3606f43c	[ScalarEvolution] Control flag for nonstrict inequalities in finite loops D118090 causes a pretty significant (19%) regression in some Eigen benchmarks. Investigating is a bit time consuming as the compilation unit where this occurs is large. Rather than revert, this patch adds a flag controlling that behavior (enabled by default).	2022-02-23 17:56:35 -08:00
Malhar Jajoo	9f1c6fbf11	[LAA] Add remarks for unbounded array access Adds new optimization remarks when loop vectorization fails due to the compiler being unable to find bound of an array access inside a loop Differential Revision: https://reviews.llvm.org/D115873	2022-02-23 15:57:39 +00:00
Sanjay Patel	fc3b34c508	[InstSimplify] remove shift that is redundant with part of funnel shift In D111530, I suggested that we add some relatively basic pattern-matching folds for shifts and funnel shifts and avoid a more specialized solution if possible. We can start by implementing at least one of these in IR because it's easier to write the code and verify with Alive2: https://alive2.llvm.org/ce/z/qHpmNn This will need to be adapted/extended for SDAG to handle the motivating bug ( #49541 ) because the patterns only appear later with that example (added some tests: `bb850d422b`) This can be extended within InstSimplify to handle cases where we 'and' with a shift too (in that case, kill the funnel shift). We could also handle patterns where the shift and funnel shift directions are inverted, but I think it's better to canonicalize that instead to avoid pattern-match case explosion. Differential Revision: https://reviews.llvm.org/D120253	2022-02-23 09:10:01 -05:00
Thomas Preud'homme	40f9081958	[LAA] Add missing newline in debug print	2022-02-23 13:25:16 +00:00
Nikita Popov	6777ec9e4d	[ValueTracking] Support signed intrinsic clamp This is the same special logic we apply for SPF signed clamps when computing the number of sign bits, just for intrinsics. This just uses the same logic as the select case, but there's multiple directions this could be improved in: We could also use the num sign bits from the clamped value, we could do this during constant range calculation, and there's probably unsigned analogues for the constant range case at least.	2022-02-23 12:45:16 +01:00
Bill Wendling	a5bbc6ef99	[NFC] Remove unnecessary "#include"s from header files	2022-02-23 01:20:48 -08:00

1 2 3 4 5 ...

11364 Commits