llvm-project

Commit Graph

Author	SHA1	Message	Date
Kazu Hirata	3a3cb929ab	[llvm] Use = default (NFC)	2022-02-06 22:18:35 -08:00
Joseph Huber	9d3a47576c	[PassBuilder] Add OpenMPOpt to default LTO pipeline The LTO support for OpenMP offloading allows us to run the OpenMPOpt pass during the LTO pipeline. This patch introduces an early run of the Module pass and a late run of the CGSCC pass. These are quick no-ops if there is no OpenMP in the module. Depends on D118198 Differential Revision: https://reviews.llvm.org/D118611	2022-01-31 23:11:43 -05:00
Bjorn Pettersson	464be7af28	[NewPM] Add debugify and check-debugify to the PassRegistry Adding -debugify and -check-debugify in the PassRegistry will make sure the passes are listed properly by -print-pipeline-passes as well as -print-passes. It also allows removal of the custom pipeline parsing callback that has been used in the NewPMDriver. Differential Revision: https://reviews.llvm.org/D118369	2022-01-27 19:23:56 +01:00
Sjoerd Meijer	f269ec230e	[LoopFlatten] Move it from LPM2 to LPM1 In D110057 we moved LoopFlatten to a LoopPassManager. This caused a performance regression for our 64-bit targets (the 32-bit were unaffected), the pass is no longer triggering for a motivating example. The reason is that the IR is just very different than expected; we try to match loop statements and particular uses of induction variables. The easiest is to just move LoopFlatten to a place in the pipeline where the IR is as expected, which is just before IndVarSimplify. This means we move it from LPM2 to LPM1, so that it actually runs just a bit earlier from where it was running before. IndVarSimplify is responsible for significant rewrites that are difficult to "look through" in LoopFlatten. Differential Revision: https://reviews.llvm.org/D116612	2022-01-19 14:38:05 +00:00
Sjoerd Meijer	016022e5da	Recommit "[LoopFlatten] Move it to a LoopPassManager" This was reverted because of a performance regression, which is fixed by D116612 that I will commit directly after this change. This reverts commit `e92d63b467`.	2022-01-19 14:38:05 +00:00
Mircea Trofin	3e8553aab4	[mlgo][inline] Improve global state tracking The global state refers to the number of the nodes currently in the module, and the number of direct calls between nodes, across the module. Node counts are not a problem; edge counts are because we want strictly the kind of edges that affect inlining (direct calls), and that is not easily obtainable without iteration over the whole module. This patch avoids relying on analysis invalidation because it turned out to be too aggressive in some cases. It leverages the fact that Node objects are stable - they do not get deleted while cgscc passes are run over the module; and cgscc pass manager invariants. Reviewed By: aeubanks Differential Revision: https://reviews.llvm.org/D115847	2022-01-18 17:45:34 +00:00
Kevin Athey	7ea175d1c6	Add 'eager-checks' as a module parameter to MSAN. This creates a way to configure MSAN to for eager checks that will be leveraged by the introduction of a clang flag (-fsanitize-memory-param-retval). This is redundant with the existing flag: -mllvm -msan-eager-checks. Reviewed By: vitalybuka Differential Revision: https://reviews.llvm.org/D116855	2022-01-11 14:30:49 -08:00
David Green	e92d63b467	Revert "[LoopFlatten] Move it to a LoopPassManager" This commit caused performance regressions due to differences in the expected code during loop flattening. Reverting it until the fix is ready, which hopefully wont take too long. This reverts commit `86825fc2fb`.	2022-01-10 11:03:49 +00:00
Evgeny Mandrikov	2ccf0b76bc	Fix build failure with GCC 11 in C++20 mode See https://wg21.link/cwg2237 Reviewed By: shafik, dexonsmith Differential Revision: https://reviews.llvm.org/D115355	2022-01-06 17:20:26 +01:00
Daniil Suchkov	524abc68f2	Introduce NewPM .dot printers for DomTree This patch adds a couple of NewPM function passes (dot-dom and dot-dom-only) that dump DomTree into .dot files. Reviewed-By: aeubanks Differential Revision: https://reviews.llvm.org/D116629	2022-01-05 23:25:40 +00:00
Sjoerd Meijer	86825fc2fb	[LoopFlatten] Move it to a LoopPassManager In D109958 it was noticed that we could optimise the pipeline and avoid rerunning LoopSimplify/LCSSA for LoopFlatten by moving it to a LoopPassManager. Differential Revision: https://reviews.llvm.org/D110057	2021-12-30 12:32:14 +00:00
Jamie Schmeiser	864ca42483	Revert "A new hidden option exec-on-ir-change=exe that calls exe each time IR changes" This reverts commit `dff0e8b4ff`. Failing build: https://lab.llvm.org/buildbot/#/builders/42/builds/2800	2021-12-16 10:54:11 -05:00
Jamie Schmeiser	dff0e8b4ff	A new hidden option exec-on-ir-change=exe that calls exe each time IR changes Summary: A new option exec-on-ir-changed is defined that allows one to specify an exe that is called after each pass in the opt pipeline that changes the IR. The exec-on-ir-change=exe option saves the IR in a temporary file and calls exe with the name of the file and the name of the pass that just changed it after each pass alters the IR. exe is also called with the initial IR. This can be used, for example, to determine which pass corrupts the IR by having exe as a script that calls llc and runs a test to see after which pass the results change. The print-changed filtering options are respected. Note that this is only supported with the new pass manager. Author: Jamie Schmeiser <schmeise@ca.ibm.com> Reviewed By: aeubanks (Arthur Eubanks) Differential Revision: https://reviews.llvm.org/D110776	2021-12-16 09:01:49 -05:00
Florian Hahn	acea6e9cfa	[Passes] Only run extra vector passes if loops have been vectorized. This patch uses a similar trick as in D113947 to only run the extra passes after vectorization on functions where loops have been vectorized. The reason for running the 'extra vector passes' is simplification/unswitching of the runtime checks created by LV, there should be no need to run them if nothing got vectorized To do that, a new dummy analysis ShouldRunExtraVectorPasses has been added. If loops have been vectorized for a function, LV will cache the analysis. At the moment it uses MadeCFGChanges as proxy for loop vectorized, which isn't perfect (it could be too aggressive, e.g. because no runtime checks have been added), but should be good enough for now. The extra passes are now managed by a new FunctionPassManager that runs its passes only if ShouldRunExtraVectorPasses has been cached. Without this patch, `-extra-vectorizer-passes` has the following compile-time impact: NewPM-O3: +4.86% NewPM-ReleaseThinLTO: +3.56% NewPM-ReleaseLTO-g: +7.17% http://llvm-compile-time-tracker.com/compare.php?from=ead3979a92fc33add4710c4510d6906260dcb4ad&to=c292da649e2c6e88a31e702fdc474727d09c72bc&stat=instructions With this patch, that gets reduced to NewPM-O3: +1.43% NewPM-ReleaseThinLTO: +1.00% NewPM-ReleaseLTO-g: +1.58% http://llvm-compile-time-tracker.com/compare.php?from=ead3979a92fc33add4710c4510d6906260dcb4ad&to=e67d86b57810011cf285eb9aa1944781be6096f0&stat=instructions It is probably still too high to enable by default, but much better. Reviewed By: aeubanks Differential Revision: https://reviews.llvm.org/D115052	2021-12-10 11:42:45 +00:00
Sameer Sahasrabuddhe	1d0244aed7	Reapply CycleInfo: Introduce cycles as a generalization of loops Reverts `02940d6d22`. Fixes breakage in the modules build. LLVM loops cannot represent irreducible structures in the CFG. This change introduce the concept of cycles as a generalization of loops, along with a CycleInfo analysis that discovers a nested hierarchy of such cycles. This is based on Havlak (1997), Nesting of Reducible and Irreducible Loops. The cycle analysis is implemented as a generic template and then instatiated for LLVM IR and Machine IR. The template relies on a new GenericSSAContext template which must be specialized when used for each IR. This review is a restart of an older review request: https://reviews.llvm.org/D83094 Original implementation by Nicolai Hähnle <nicolai.haehnle@amd.com>, with recent refactoring by Sameer Sahasrabuddhe <sameer.sahasrabuddhe@amd.com> Differential Revision: https://reviews.llvm.org/D112696	2021-12-10 14:36:43 +05:30
Fraser Cormack	eb87f668fe	[NewPM] Port FlattenCFGPass to NPM Reviewed By: aeubanks Differential Revision: https://reviews.llvm.org/D115361	2021-12-09 07:55:02 +00:00
Arthur Eubanks	c4ce426504	Revert "A new hidden option test-changed=exe that calls exe after each time IR changes" This reverts commit `f9235e45fd`. Causes breakages on Windows: http://45.33.8.238/win/50453/step_11.txt.	2021-12-08 13:58:10 -08:00
Jamie Schmeiser	f9235e45fd	A new hidden option test-changed=exe that calls exe after each time IR changes Summary: A new option test-changed is defined that allows one to specify an exe that is called after each pass in the opt pipeline that changes the IR. The test-changed=exe option saves the IR in a temporary file and calls exe with the name of the file and the name of the pass that just changed it after each pass alters the IR. exe is also called with the initial IR. This can be used, for example, to determine which pass corrupts the IR by having exe as a script that calls llc and runs a test to see after which pass the results change. The print-changed filtering options are respected. Author: Jamie Schmeiser <schmeise@ca.ibm.com> Reviewed By: aeubanks (Arthur Eubanks) Differential Revision: https://reviews.llvm.org/D110776	2021-12-08 14:23:31 -05:00
Jamie Schmeiser	84687405ce	Remove Colours array in -print-changed=dot-cfg Summary: The Colours array is apparently the source of TSAN errors. It is unnecessary and was there to ease readability of the code. Remove it to clean up the TSAN errors. Author: Jamie Schmeiser <schmeise@ca.ibm.com> Reviewed By: aeubanks (Arthur Eubanks) Differential Revision: https://reviews.llvm.org/D115175	2021-12-08 09:50:51 -05:00
Jonas Devlieghere	02940d6d22	Revert "CycleInfo: Introduce cycles as a generalization of loops" This reverts commit `0fe61ecc2c` because it breaks the modules build. https://green.lab.llvm.org/green/job/clang-stage2-rthinlto/4858/ https://green.lab.llvm.org/green/view/LLDB/job/lldb-cmake/39112/	2021-12-07 13:06:34 -08:00
Arthur Eubanks	5c7e783ebe	[NFC] Clarify comment about LoopDeletionPass in the optimization pipeline Reviewed By: lebedev.ri Differential Revision: https://reviews.llvm.org/D115179	2021-12-07 09:58:12 -08:00
Sameer Sahasrabuddhe	0fe61ecc2c	CycleInfo: Introduce cycles as a generalization of loops LLVM loops cannot represent irreducible structures in the CFG. This change introduce the concept of cycles as a generalization of loops, along with a CycleInfo analysis that discovers a nested hierarchy of such cycles. This is based on Havlak (1997), Nesting of Reducible and Irreducible Loops. The cycle analysis is implemented as a generic template and then instatiated for LLVM IR and Machine IR. The template relies on a new GenericSSAContext template which must be specialized when used for each IR. This review is a restart of an older review request: https://reviews.llvm.org/D83094 Original implementation by Nicolai Hähnle <nicolai.haehnle@amd.com>, with recent refactoring by Sameer Sahasrabuddhe <sameer.sahasrabuddhe@amd.com> Differential Revision: https://reviews.llvm.org/D112696	2021-12-07 12:02:34 +05:30
Nikita Popov	ae7f468073	[NewPM] Fix MergeFunctions scheduling MergeFunctions (as well as HotColdSplitting an IROutliner) are incorrectly scheduled under the new pass manager. The code makes it look like they run towards the end of the module optimization pipeline (as they should), while in reality the run at the start. This is because the OptimizePM populated around them is only scheduled later. I'm fixing this by moving these three passes until after OptimizePM to avoid splitting the function pass pipeline. It doesn't seem important to me that some of the function passes run after these late module passes. Differential Revision: https://reviews.llvm.org/D115098	2021-12-04 17:30:30 +01:00
Anton Afanasyev	c34d157fc7	[Passes] Move AggressiveInstCombine after InstCombine Swap AIC and IC neighbouring in pipeline. This looks more natural and even almost has no effect for now (three slightly touched tests of test-suite). Also this could be the first step towards merging AIC (or its part) to -O2 pipeline. After several changes in AIC (like D108091, D108201, D107766, D109515, D109236) there've been observed several regressions (like PR52078, PR52253, PR52289) that were fixed in different passes (see D111330, D112721) by extending their functionality, but these regressions were exposed since changed AIC prevents IC from making some of early optimizations. This is common problem and it should be fixed by just moving AIC after IC which looks more logically by itself: make aggressive instruction combining only after failed ordinary one. Fixes PR52289 Reviewed By: spatel, RKSimon Differential Revision: https://reviews.llvm.org/D113179	2021-12-04 14:22:43 +03:00
Nikita Popov	5b94037a30	[PhaseOrdering] Add test for incorrect merge function scheduling Add an -enable-merge-functions option to allow testing of function merging as it will actually happen in the optimization pipeline. Based on that add a test where we currently produce two identical functions without merging them due to incorrect pass scheduling under the new pass manager.	2021-12-04 10:12:04 +01:00
Liqiang Tao	7e8f9d6b38	[llvm][Inline] Add FunctionSimplificationPipeline to module inliner pipeline The FunctionSimplificationPipeline could effectively reduce the size of .text section when module inliner is enabled. Reviewed By: kazu Differential Revision: https://reviews.llvm.org/D114704	2021-12-03 12:13:31 +09:00
Florian Hahn	770a50b28c	[AnnotationRemarks] Support generating annotation remarks with -O0. This matches the legacy pass manager behavior. If remarks are not enabled the pass is effectively a no-op.	2021-12-02 15:01:02 +00:00
Kazu Hirata	f240e528ce	[llvm] Use range-based for loops (NFC)	2021-11-29 09:04:44 -08:00
Arthur Eubanks	e3e25b5112	[NewPM] Add option to prevent rerunning function pipeline on functions in CGSCC adaptor In a CGSCC pass manager, we may visit the same function multiple times due to SCC mutations. In the inliner pipeline, this results in running the function simplification pipeline on a function multiple times even if it hasn't been changed since the last function simplification pipeline run. We use a newly introduced analysis to keep track of whether or not a function has changed since the last time the function simplification pipeline has run on it. If we see this analysis available for a function in a CGSCCToFunctionPassAdaptor, we skip running the function passes on the function. The analysis is queried at the end of the function passes so that it's available after the first time the function simplification pipeline runs on a function. This is a per-adaptor option so it doesn't apply to every adaptor. The goal of this is to improve compile times. However, currently we can't turn this on by default at least for the higher optimization levels since the function simplification pipeline is not robust enough to be idempotent in many cases, resulting in performance regressions if we stop running the function simplification pipeline on a function multiple times. We may be able to turn this on for -O1 in the near future, but turning this on for higher optimization levels would require more investment in the function simplification pipeline. Heavily inspired by D98103. Example compile time improvements with flag turned on: https://llvm-compile-time-tracker.com/compare.php?from=998dc4a5d3491d2ae8cbe742d2e13bc1b0cacc5f&to=5c27c913687d3d5559ef3ab42b5a3d513531d61c&stat=instructions Reviewed By: asbirlea, nikic Differential Revision: https://reviews.llvm.org/D113947	2021-11-17 09:06:46 -08:00
Arthur Eubanks	19867de9e7	[NewPM] Only invalidate modified functions' analyses in CGSCC passes + turn on eagerly invalidate analyses Previously, any change in any function in an SCC would cause all analyses for all functions in the SCC to be invalidated. With this change, we now manually invalidate analyses for functions we modify, then let the pass manager know that all function analyses should be preserved since we've already handled function analysis invalidation. So far this only touches the inliner, argpromotion, function-attrs, and updateCGAndAnalysisManager(), since they are the most used. This is part of an effort to investigate running the function simplification pipeline less on functions we visit multiple times in the inliner pipeline. However, this causes major memory regressions especially on larger IR. To counteract this, turn on the option to eagerly invalidate function analyses. This invalidates analyses on functions immediately after they're processed in a module or scc to function adaptor for specific parts of the pipeline. Within an SCC, if a pass only modifies one function, other functions in the SCC do not have their analyses invalidated, so in later function passes in the SCC pass manager the analyses may still be cached. It is only after the function passes that the eager invalidation takes effect. For the default pipelines this makes sense because the inliner pipeline runs the function simplification pipeline after all other SCC passes (except CoroSplit which doesn't request any analyses). Overall this has mostly positive effects on compile time and positive effects on memory usage. https://llvm-compile-time-tracker.com/compare.php?from=7f627596977624730f9298a1b69883af1555765e&to=39e824e0d3ca8a517502f13032dfa67304841c90&stat=instructions https://llvm-compile-time-tracker.com/compare.php?from=7f627596977624730f9298a1b69883af1555765e&to=39e824e0d3ca8a517502f13032dfa67304841c90&stat=max-rss D113196 shows that we slightly regressed compile times in exchange for some memory improvements when turning on eager invalidation. D100917 shows that we slightly improved compile times in exchange for major memory regressions in some cases when invalidating less in SCC passes. Turning these on at the same time keeps the memory improvements while keeping compile times neutral/slightly positive. Reviewed By: asbirlea, nikic Differential Revision: https://reviews.llvm.org/D113304	2021-11-15 14:44:53 -08:00
Arthur Eubanks	1d8750c3da	[NFC] Rename GVN -> GVNPass and SROA -> SROAPass To be more consistent with other pass struct names. There are still more passes that don't end with "Pass", but these are the important ones. Reviewed By: asbirlea Differential Revision: https://reviews.llvm.org/D112935	2021-11-09 10:35:58 -08:00
Liqiang Tao	6cad45d5c6	[llvm][Inline] Add a module level inliner Add module level inliner, which is a minimum viable product at this point. Also add some tests for it. RFC: https://lists.llvm.org/pipermail/llvm-dev/2021-August/152297.html Reviewed By: kazu Differential Revision: https://reviews.llvm.org/D106448	2021-11-09 11:03:29 +08:00
Whitney Tsang	93421108d2	Add NoOpLoopNestPass and LOOPNEST_PASS macro Having a NoOpLoopNestPass can ensure that only outermost loop is invoked for a LoopNestPass with a lit test. There are some existing passes that are implemented as LoopNestPass, but they are still using LOOP_PASS macro. It would be easier to identify LoopNestPasses with a LOOPNEST_PASS macro. Differential Revision: https://reviews.llvm.org/D113185	2021-11-05 16:11:48 +00:00
Arthur Eubanks	7175886a0f	[NewPM] Make eager analysis invalidation per-adaptor Follow-up change to D111575. We don't need eager invalidation on every adaptor. Most notably, adaptors running passes that use very few analyses, or passes that purely invalidate specific analyses. Also allow testing of this via a pipeline string "function<eager-inv>()". The compile time/memory impact of this is very comparable to D111575. https://llvm-compile-time-tracker.com/compare.php?from=9a2eec512a29df45c90c2fcb741e9d5c693b1383&to=b9f20bcdea138060967d95a98eab87ce725b22bb&stat=instructions Reviewed By: nikic Differential Revision: https://reviews.llvm.org/D113196	2021-11-04 17:16:11 -07:00
Arthur Eubanks	13317286f8	[NewPM] Use the default AA pipeline by default We almost always want to use the default AA pipeline. It's very easy for users of PassBuilder to forget to customize the AAManager to use the default AA pipeline (for example, the NewPM C API forgets to do this). If somebody wants a custom AA pipeline, similar to what is being done now with the default AA pipeline registration, they can FAM.registerPass([&] { return std::move(MyAA); }); before calling PB.registerFunctionAnalyses(FAM); For example, LTOBackend.cpp and NewPMDriver.cpp do this. Reviewed By: asbirlea Differential Revision: https://reviews.llvm.org/D113210	2021-11-04 15:10:34 -07:00
Jamie Schmeiser	8720149d9b	Remove unused function from print-changed=dot-cfg code Summary: Remove unused function from print-changed=dot-cfg code to silence a gcc compiler warning. Author: Jamie Schmeiser <schmeise@ca.ibm.com> Reviewed By: uabelho(Mikael Holmen) Differential Revision: https://reviews.llvm.org/D113188	2021-11-04 10:40:50 -04:00
Sjoerd Meijer	3fd1902ad8	[FuncSpec] Enable it only with -O3 Function specialisation was running at all optimisation levels (if enabled on the command line, it is not on by default). That was an oversight and not something we want to do. Function specialisation duplicates functions when it triggers, so the backend is processing more functions/instructions resulting in compile-time increases, which seems more appropriate with -O3 and inline with GCC. Please note that since function specialisation is not enabled by default, this didn't require updating any pass manager tests. Differential Revision: https://reviews.llvm.org/D112129	2021-11-04 13:59:00 +00:00
Vitaly Buka	32eb697c0a	[PassBuilder] Remove unused function after D113072	2021-11-03 12:03:17 -07:00
Vitaly Buka	3131714f8d	[NFC][asan] Use AddressSanitizerOptions in ModuleAddressSanitizerPass Reviewed By: kstoimenov Differential Revision: https://reviews.llvm.org/D113072	2021-11-03 11:32:14 -07:00
Kirill Stoimenov	b3145323b5	Revert "[ASan] Process functions in Asan module pass" This reverts commit `76ea87b94e`. Reviewed By: kstoimenov Differential Revision: https://reviews.llvm.org/D113129	2021-11-03 18:01:01 +00:00
Kirill Stoimenov	76ea87b94e	[ASan] Process functions in Asan module pass This came up as recommendation while reviewing D112098. Reviewed By: vitalybuka Differential Revision: https://reviews.llvm.org/D112732	2021-11-03 17:51:01 +00:00
Roman Lebedev	9c2469c1dd	[PassManager] `buildModuleOptimizationPipeline()`: schedule `LoopDeletion` pass run before vectorization passes Test thanks to Michael Kuklinski from `#llvm`: https://godbolt.org/z/bdrah5Goo originally inspired by Daniel Lemire's https://lemire.me/blog/2021/10/26/in-c-is-empty-faster-than-comparing-the-size-with-zero/ We manage to deduce that the answer does not require looping, but we do that after the last `LoopDeletion` pass run, so we end up being stuck with a dead loop. Now, as with all things SCEV, this has a very expected ~`+0.12%` compile time performance regression: https://llvm-compile-time-tracker.com/compare.php?from=0ae7bf124a9bca76dd9a91b2f7379168ff13f562&to=c2ae57c9b961aeb4a28c747266949340613a6d84&stat=instructions (for comparison, doing that in function simplification pipeline would have been ~`+0.5` compile time performance regression, D112840) Looking at the transformation stats over vanilla test-suite, i think it's rather expected: ``` \| statistic name \| baseline \| proposed \| Δ \| % \| \|%\| \| \|--------------------------------------------------\|----------:\|----------:\|------:\|-------:\|-------:\| \| scalar-evolution.NumBruteForceTripCountsComputed \| 789 \| 888 \| 99 \| 12.55% \| 12.55% \| \| scalar-evolution.NumTripCountsNotComputed \| 105592 \| 117900 \| 12308 \| 11.66% \| 11.66% \| \| loop-delete.NumBackedgesBroken \| 542 \| 559 \| 17 \| 3.14% \| 3.14% \| \| regalloc.numExtends \| 81 \| 79 \| -2 \| -2.47% \| 2.47% \| \| indvars.NumFoldedUser \| 408 \| 400 \| -8 \| -1.96% \| 1.96% \| \| indvars.NumElimCmp \| 3831 \| 3758 \| -73 \| -1.91% \| 1.91% \| \| scalar-evolution.NumTripCountsComputed \| 299759 \| 304278 \| 4519 \| 1.51% \| 1.51% \| \| loop-delete.NumDeleted \| 8055 \| 8128 \| 73 \| 0.91% \| 0.91% \| \| machine-cse.NumCommutes \| 111 \| 110 \| -1 \| -0.90% \| 0.90% \| \| globaldce.NumFunctions \| 1187 \| 1192 \| 5 \| 0.42% \| 0.42% \| \| codegenprepare.NumSelectsExpanded \| 277 \| 278 \| 1 \| 0.36% \| 0.36% \| \| loop-unroll.NumRuntimeUnrolled \| 13841 \| 13791 \| -50 \| -0.36% \| 0.36% \| \| machinelicm.NumPostRAHoisted \| 1168 \| 1172 \| 4 \| 0.34% \| 0.34% \| \| phi-node-elimination.NumCriticalEdgesSplit \| 83054 \| 82879 \| -175 \| -0.21% \| 0.21% \| \| machine-cse.NumPREs \| 3085 \| 3079 \| -6 \| -0.19% \| 0.19% \| \| branch-folder.NumBranchOpts \| 108122 \| 107942 \| -180 \| -0.17% \| 0.17% \| \| loop-unroll.NumUnrolled \| 40136 \| 40067 \| -69 \| -0.17% \| 0.17% \| \| branch-folder.NumDeadBlocks \| 130818 \| 130607 \| -211 \| -0.16% \| 0.16% \| \| codegenprepare.NumBlocksElim \| 92856 \| 92714 \| -142 \| -0.15% \| 0.15% \| \| instsimplify.NumSimplified \| 103263 \| 103129 \| -134 \| -0.13% \| 0.13% \| \| instcombine.NumConstProp \| 26070 \| 26102 \| 32 \| 0.12% \| 0.12% \| \| instsimplify.NumExpand \| 1716 \| 1718 \| 2 \| 0.12% \| 0.12% \| \| loop-unroll.NumCompletelyUnrolled \| 9236 \| 9225 \| -11 \| -0.12% \| 0.12% \| \| branch-folder.NumHoist \| 2773 \| 2770 \| -3 \| -0.11% \| 0.11% \| \| regalloc.NumReloadsRemoved \| 10822 \| 10834 \| 12 \| 0.11% \| 0.11% \| \| regalloc.NumSnippets \| 11394 \| 11406 \| 12 \| 0.11% \| 0.11% \| \| machine-cse.NumCrossBBCSEs \| 1052 \| 1053 \| 1 \| 0.10% \| 0.10% \| \| machinelicm.NumCSEed \| 99887 \| 99784 \| -103 \| -0.10% \| 0.10% \| \| branch-folder.NumTailMerge \| 72501 \| 72435 \| -66 \| -0.09% \| 0.09% \| \| codegenprepare.NumExtUses \| 22007 \| 21987 \| -20 \| -0.09% \| 0.09% \| \| local.NumRemoved \| 68232 \| 68294 \| 62 \| 0.09% \| 0.09% \| \| loop-vectorize.LoopsAnalyzed \| 75483 \| 75413 \| -70 \| -0.09% \| 0.09% \| ``` Note that i'm only changing current PM, and not touching obsolete PM. This is an alternative to the function simplification pipeline variant of the same change, D112840. It has both less compile time impact (since the additional number of SCEV trip count calculations is way lass less than with the D112840), and it is much more powerful/impactful (almost 2x more loops deleted). I have checked, and doing this after loop rotation is favorable (more loops deleted). Reviewed By: mkazantsev Differential Revision: https://reviews.llvm.org/D112851	2021-11-03 19:24:49 +03:00
Christopher Tetreault	5718b9f128	[NFC] Reformat VerifyPreservedCFG for non-CPP-aware syntax highlighters * Move `);` outside the #ENDIF. Syntax highlighters that highlight missed closing parens, but are not aware of the C Preprocessor saw the original code as having missed parens.	2021-11-02 11:35:38 -07:00
Jamie Schmeiser	816761f044	Add new choices dot-cfg and dot-cfg-quiet to print-changed. Summary: Add new options -print-changed=[dot-cfg \| dot-cfg-quiet] which create a website of DOT files showing colourized changes as the IR is changed by passes in the new pass manager pipeline. A new change reporter is introduced that creates a website of changes made by passes in the opt pipeline that change the IR. The hidden option -dot-cfg-dir=<dir> specifies a directory (defaulting to "./") into which the website will be created. A file passes.html is created that contains a list of all the passes that act on the IR. Those that do not change the IR are listed as omitted because of no change, ignored or filtered out (using -filter-print-func and -filter-passes) or not listed in quiet mode. Those that do change the IR are listed as a link to a DOT file which contains a CFG depiction of the IR (ala -dot-cfg) except that the instructions, basic blocks and links that are only in the IR before the pass (ie, removed) and those that are only in the IR after the pass (ie, added) are shown in red and green, respectively, while the aspects of the CFG that do not change are shown in black. Additional hidden options -dot-cfg-before-color=<dot named color>, -dot-cfg-after-color=<dot named color> and -dot-cfg-common-color=<dot named color> are defined that allow the customization of the colors used in colorizing the CFG. -change-printer-dot-path=<path to dot exe> is also added. Author: Jamie Schmeiser <schmeise@ca.ibm.com> Reviewed By: aeubanks (Arthur Eubanks) Differential Revision: https://reviews.llvm.org/D87202	2021-11-02 12:06:25 -04:00
Jamie Schmeiser	3af474c0a1	Changes to print-changed classes in preparation for DotCfg change printer Summary: Break out non-functional changes to the print-changed classes that are needed for reuse with the DotCfg change printer in https://reviews.llvm.org/D87202. Various changes to the change printers to facilitate reuse with the upcoming DotCfg change printer. This includes changing several of the classes and their support classes to being templates. Also, some template parameter names were simplified to avoid confusion with planned identifiers in the DotCfg change printer to come. A virtual function in the class for comparing functions was changed to a lambda. The virtual function same was replaced with calls to operator==. The only intentional functional change was to add the exe name as the first parameter to llvm::sys::ExecuteAndWait Author: Jamie Schmeiser <schmeise@ca.ibm.com> Reviewed By: aeubanks (Arthur Eubanks) Differential Revision: https://reviews.llvm.org/D110737	2021-10-19 10:58:40 -04:00
Florian Hahn	4a1d63d7d0	[VectorCombine] Add option to only run scalarization transforms. This patch adds a pass option to only run transforms that scalarize vector operations and do not create new vector instructions. When running VectorCombine early in the pipeline introducing new vector operations can have negative effects, like blocking loop or SLP vectorization. To avoid regressions, restrict the early VectorCombine run (when using -enable-matrix) to only perform scalarization and not introduce new vector operations. This is done as option to the pass directly, which is then set when adding the pass to the pipeline. This is done for the new pass manager only. Reviewed By: spatel Differential Revision: https://reviews.llvm.org/D111800	2021-10-15 20:35:58 +01:00
Hongtao Yu	42ad7e1bc9	[CSSPGO] Turn off PseudoProbeUpdatePass for non-FDO builds. PseudoProbeUpdatePass is used to distribute sample counts among dulplicated probes. It doesn't make sense for it to run without a sample profile. The pass takes 1% of the build time. Reviewed By: wenlei Differential Revision: https://reviews.llvm.org/D111847	2021-10-14 17:12:49 -07:00
Arthur Eubanks	a4095df44c	Don't print uselistorder in --print-changed Using uselistorders is fairly niche, it shouldn't be on by default and mostly just clutters the output. Reviewed By: jamieschmeiser Differential Revision: https://reviews.llvm.org/D111282	2021-10-07 14:48:18 -07:00
Jamie Schmeiser	bd5befb550	Revert "Reland A new option -print-on-crash that prints the IR as it was upon entering the last pass when there is a crash." This reverts commit `13d1592716`.	2021-10-07 15:23:48 -04:00
Jamie Schmeiser	13d1592716	Reland A new option -print-on-crash that prints the IR as it was upon entering the last pass when there is a crash. Summary: The IR is saved in its print form before each pass is started and a signal handler is registered. If the compilation crashes, the signal handler will print the saved IR to dbgs(). This option can be modified using -print-module-scope to get the IR for the complete module. Filtering options can be used to improve performance by limiting which passes (or functions) save the IR. Note that this option only works with the new pass manager. Author: Jamie Schmeiser <schmeise@ca.ibm.com> Reviewed By: aeubanks (Arthur Eubanks) yrouban (Yevgeny Rouban) Differential Revision: https://reviews.llvm.org/D86657	2021-10-07 15:02:19 -04:00

1 2 3 4 5 ...

828 Commits