llvm-project

Commit Graph

Author	SHA1	Message	Date
Hongtao Yu	23191a4ffe	[CSSPGO][llvm-profgen] Do not duplicate context profiles into base profile when converting CS flat profile to nested. Recent experiments with our two large internal services showed that duplicating context profiles into base profile caused code size inflation and didn't deliver good performance compared to no such duplication. It was a trick we made to catch up with the CS flat profile and I'm now turning it off by default. The code size inflation mainly comes from the enriched based profiles. A base profile for a function represents the uninlined (or outlined) portion of the whole function running time. Such portion could be very small if a function is inlined into most of its hot callsites. Duplicating context profiles of the function into its base profiles could cause the outlined body to be hot enough and in turn get many of its callees inlined, thus increases the code size. The size inflation could further cause perf regression. Reviewed By: wenlei Differential Revision: https://reviews.llvm.org/D124796	2022-05-12 09:29:25 -07:00
Teresa Johnson	655294866c	[memprof] Use unknown_function error type for missing functions Switch the error type when a function is not found in the memprof profile to unknown_function. This gives compatibility with normal PGO function matching, and also prevents issuing large numbers of additional matching errors since pgo-warn-missing-function is off by default. Differential Revision: https://reviews.llvm.org/D124953	2022-05-04 13:02:30 -07:00
Hongtao Yu	e36786d15f	[CSSPGO] Rename ProfileIsCSNested and ProfileIsCSFlat To be more clear and definitive, I'm renaming `ProfileIsCSFlat` back to `ProfileIsCS` which stands for full context-sensitive flat profiles. `ProfileIsCSNested` is now renamed to `ProfileIsPreInlined` and is extended to be applicable for CS flat profiles too. More specifically, `ProfileIsPreInlined` is for any kind of profiles (flat or nested) that contain 'ShouldBeInlined' contexts. The flag is encoded in the profile summary section for extbinary profiles and is computed on-the-fly for text profiles. Reviewed By: wenlei Differential Revision: https://reviews.llvm.org/D122602	2022-04-29 17:03:52 -07:00
Snehasish Kumar	6dd6a6161f	[memprof] Deduplicate and outline frame storage in the memprof profile. The current implementation of memprof information in the indexed profile format stores the representation of each calling context fram inline. This patch uses an interned representation where the frame contents are stored in a separate on-disk hash table. The table is indexed via a hash of the contents of the frame. With this patch, the compressed size of a large memprof profile reduces by ~22%. Reviewed By: tejohnson Differential Revision: https://reviews.llvm.org/D123094	2022-04-08 09:15:20 -07:00
serge-sans-paille	01be9be2f2	Cleanup includes: final pass Cleanup a few extra files, this closes the work on libLLVM dependencies on my side. Impact on libLLVM preprocessed output: -35876 lines Discourse thread: https://discourse.llvm.org/t/include-what-you-use-include-cleanup Differential Revision: https://reviews.llvm.org/D122576	2022-03-29 09:00:21 +02:00
Kazu Hirata	d3e5f0ab01	Apply clang-tidy fixes for readability-redundant-smartptr-get in SampleProfReader.cpp (NFC)	2022-03-28 09:18:39 -07:00
Kazu Hirata	14415a3a5a	Apply clang-tidy fixes for readability-redundant-smartptr-get in InstrProfReader.cpp (NFC)	2022-03-28 09:18:38 -07:00
Snehasish Kumar	27a4f2545f	Reland "[memprof] Store callsite metadata with memprof records." This reverts commit `f4b794427e`. Reland with underlying msan issue fixed in D122260.	2022-03-22 14:40:02 -07:00
Mitch Phillips	f4b794427e	Revert "[memprof] Store callsite metadata with memprof records." This reverts commit `0d362c90d3`. Reason: Causes the MSan buildbot to fail (see comments on https://reviews.llvm.org/D121179 for more information	2022-03-21 15:59:13 -07:00
Snehasish Kumar	0d362c90d3	[memprof] Store callsite metadata with memprof records. To ease profile annotation, each of the callsites in a function can be annotated with profile data - "IR metadata format for MemProf" [1]. This patch extends the on-disk serialized record format to store the debug information for allocation callsites incl inline frames. This change is incompatible with the existing format i.e. indexed profiles must be regenerated, raw profiles are unaffected. [1] https://groups.google.com/g/llvm-dev/c/aWHsdMxKAfE/m/WtEmRqyhAgAJ Reviewed By: tejohnson Differential Revision: https://reviews.llvm.org/D121179	2022-03-21 13:58:29 -07:00
Snehasish Kumar	c9a3d29613	[memprof] Update the frame is inline logic and unittests. Since DI frames are enumerated with the leaf function at index 0, this patch fixes the logic when IsInlineFrame is set. Also update the unittests to check that only the last frame is marked as non-inline from a set of DI Frames for a PC address. Differential Revision: https://reviews.llvm.org/D121830	2022-03-21 10:41:05 -07:00
Snehasish Kumar	49c048add4	[memprof] Add a test to verify callstack order. We add to ensure that we are observing the correct callstack order in memprof during symbolization. There was some confusion whether the order of DIFrame objects were reversed but in reality the leaf function is at index 0 so no code changes are required. Differential Revision: https://reviews.llvm.org/D121759	2022-03-16 10:10:57 -07:00
Igor Kudrin	1c99f650a7	[llvm-cov gcov] Fix calculating coverage of template functions Template functions share the same lines in source files, so the common container of lines' properties cannot be used to calculate the coverage statistics of individual functions. > cat tmpl.cpp template <int N> int test() { return N; } int main() { return test<1>() + test<2>(); } > clang++ --coverage tmpl.cpp -o tmpl > ./tmpl > llvm-cov gcov tmpl.cpp -f ... Function '_Z4testILi1EEiv' Lines executed:100.00% of 1 Function '_Z4testILi2EEiv' Lines executed:-nan% of 0 ... > llvm-cov-patched gcov tmpl.cpp -f ... Function '_Z4testILi1EEiv' Lines executed:100.00% of 1 Function '_Z4testILi2EEiv' Lines executed:100.00% of 1 ... Differential Revision: https://reviews.llvm.org/D121390	2022-03-15 20:46:22 +04:00
Fangrui Song	407c721ceb	[Support] Change zlib::compress to return void With a sufficiently large output buffer, the only failure is Z_MEM_ERROR. Check it and call the noreturn report_bad_alloc_error if applicable. resize_for_overwrite may call report_bad_alloc_error as well. Now that there is no other error type, we can replace the return type with void and simplify call sites. Reviewed By: ikudrin Differential Revision: https://reviews.llvm.org/D121512	2022-03-14 11:38:04 -07:00
Snehasish Kumar	11314f4059	[memprof] Filter out callstack frames which cannot be symbolized. This patch filters out callstack frames which can't be symbolized or if the frames belong to the runtime. Symbolization may not be possible if debug information is unavailable or if the addresses are from a shared library. For now we only support optimization of the main binary which is statically linked to the compiler runtime. Differential Revision: https://reviews.llvm.org/D120860	2022-03-04 11:10:08 -08:00
Snehasish Kumar	dda7b74967	[memprof] Symbolize and cache stack frames. Currently, symbolization of stack frames occurs on demand when the instrprof writer iterates over all the records in the raw memprof reader. With this change we symbolize and cache the frames immediately after reading the raw profiles. For a large internal binary this results in a runtime reduction of ~50% (2m -> 48s) when merging a memprof raw profile with a raw instr profile to generate an indexed profile. This change also makes it simpler in the future to generate additional calling context metadata to attach to each memprof record. Differential Revision: https://reviews.llvm.org/D120430	2022-03-03 11:00:37 -08:00
serge-sans-paille	fc97efa409	Cleanup includes: ProfileData Estimation of the impact on preprocessor output: before: 1067349756 after: 1065940348 Discourse thread: https://discourse.llvm.org/t/include-what-you-use-include-cleanup Differential Revision: https://reviews.llvm.org/D120434	2022-02-24 13:25:11 +01:00
Snehasish Kumar	b681799938	[instrprof] Rename the profile kind types to be more descriptive. Based on the discussion in D115393, I've updated the names to be more descriptive. Reviewed By: ellis, MaskRay Differential Revision: https://reviews.llvm.org/D120092	2022-02-23 13:15:56 -08:00
Fangrui Song	045f07b7dc	[ProfileData] Remove unused and racy FunctionSamples::Format after D51643 The write may be racy if ThinLTO creates multiple `InProcessThinBackend` instances.	2022-02-22 20:29:08 -08:00
Snehasish Kumar	0a4184909a	Reland "[memprof] Extend the index prof format to include memory profiles." This patch adds support for optional memory profile information to be included with and indexed profile. The indexed profile header adds a new field which points to the offset of the memory profile section (if present) in the indexed profile. For users who do not utilize this feature the only overhead is a 64-bit offset in the header. The memory profile section contains (1) profile metadata describing the information recorded for each entry (2) an on-disk hashtable containing the profile records indexed via llvm::md5(function_name). We chose to introduce a separate hash table instead of the existing one since the indexing for the instrumented fdo hash table is based on a CFG hash which itself is perturbed by memprof instrumentation. This commit also includes the changes reviewed separately in D120093. Differential Revision: https://reviews.llvm.org/D120103	2022-02-17 22:09:52 -08:00
Snehasish Kumar	19bdf44d85	Revert "Reland "[memprof] Extend the index prof format to include memory profiles."" This reverts commit `807ba7aace`.	2022-02-17 15:51:04 -08:00
Snehasish Kumar	27b7c1e3f5	Revert "[memprof] Fix frame deserialization on big endian systems." This reverts commit `c74389b4b5`. This broke the ml-opt-x86-64 build. https://lab.llvm.org/buildbot#builders/9/builds/4127	2022-02-17 15:51:04 -08:00
Snehasish Kumar	c74389b4b5	[memprof] Fix frame deserialization on big endian systems. We write the memprof internal call frame data in little endian format. However when reading the frame information we were casting it directly to a MemProfRecord::Frame pointer. In this change we add a separate deserialization method which uses an endian reader to read the bytes as little endian. This fixes https://lab.llvm.org/buildbot/#/builders/100/builds/12940 Differential Revision: https://reviews.llvm.org/D120093	2022-02-17 15:31:22 -08:00
Snehasish Kumar	807ba7aace	Reland "[memprof] Extend the index prof format to include memory profiles." This reverts commit `85355a560a`. This patch adds support for optional memory profile information to be included with and indexed profile. The indexed profile header adds a new field which points to the offset of the memory profile section (if present) in the indexed profile. For users who do not utilize this feature the only overhead is a 64-bit offset in the header. The memory profile section contains (1) profile metadata describing the information recorded for each entry (2) an on-disk hashtable containing the profile records indexed via llvm::md5(function_name). We chose to introduce a separate hash table instead of the existing one since the indexing for the instrumented fdo hash table is based on a CFG hash which itself is perturbed by memprof instrumentation. Differential Revision: https://reviews.llvm.org/D118653	2022-02-17 13:14:17 -08:00
Snehasish Kumar	a3beb34015	Reland "[InstrProf] Make the IndexedInstrProf header backwards compatible." This reverts commit `9fd2cb21fb`. Fixes an issue on big endian systems where the format version was not converted to little endian prior to passing to GET_VERSION. Differential Revision: https://reviews.llvm.org/D118390	2022-02-17 11:40:32 -08:00
minglotus-6	3940f1e237	[ProfData] Change type of options from int to uint64_t. - Reader uses option values to override uint64_t values. Differential Revision: https://reviews.llvm.org/D119810	2022-02-15 10:59:06 -08:00
serge-sans-paille	290e482342	Cleanup LLVMDWARFDebugInfo As usual with that header cleanup series, some implicit dependencies now need to be explicit: llvm/DebugInfo/DWARF/DWARFContext.h no longer includes: - "llvm/DebugInfo/DWARF/DWARFAcceleratorTable.h" - "llvm/DebugInfo/DWARF/DWARFCompileUnit.h" - "llvm/DebugInfo/DWARF/DWARFDebugAbbrev.h" - "llvm/DebugInfo/DWARF/DWARFDebugAranges.h" - "llvm/DebugInfo/DWARF/DWARFDebugFrame.h" - "llvm/DebugInfo/DWARF/DWARFDebugLoc.h" - "llvm/DebugInfo/DWARF/DWARFDebugMacro.h" - "llvm/DebugInfo/DWARF/DWARFGdbIndex.h" - "llvm/DebugInfo/DWARF/DWARFSection.h" - "llvm/DebugInfo/DWARF/DWARFTypeUnit.h" - "llvm/DebugInfo/DWARF/DWARFUnitIndex.h" Plus llvm/Support/Errc.h not included by a bunch of llvm/DebugInfo/DWARF/DWARF*.h files Preprocessed lines to build llvm on my setup: after: 1065629059 before: 1066621848 Which is a great diff! Discourse thread: https://discourse.llvm.org/t/include-what-you-use-include-cleanup Differential Revision: https://reviews.llvm.org/D119723	2022-02-15 09:16:03 +01:00
Snehasish Kumar	50713461d4	Reland "[memprof] Introduce a wrapper around MemInfoBlock." This reverts commit `e6999040f5`. Update test to fix signed int comparison warning, fix whitespace in compiler-rt MIBEntryDef.inc file. Differential Revision: https://reviews.llvm.org/D117256	2022-02-14 19:04:36 -08:00
Hongtao Yu	62ef77ca63	[CSSPGO] Do not merge a context that is already duplicated into the base profile. Do not merge a context that is already duplicated into the base profile. Also fixing a typo caused by previous refactoring. Reviewed By: wenlei Differential Revision: https://reviews.llvm.org/D119735	2022-02-14 18:07:11 -08:00
Snehasish Kumar	e6999040f5	Revert "[memprof] Introduce a wrapper around MemInfoBlock." This reverts commit `9b67165285`. [3/4]	2022-02-14 11:42:58 -08:00
Snehasish Kumar	9fd2cb21fb	Revert "[InstrProf] Make the IndexedInstrProf header backwards compatible." This reverts commit `14cc41a020`. [2/4]	2022-02-14 11:42:58 -08:00
Snehasish Kumar	85355a560a	Revert "Reland "[memprof] Extend the index prof format to include memory profiles."" This reverts commit `de54e4ab78` [1/4]	2022-02-14 11:42:58 -08:00
Snehasish Kumar	de54e4ab78	Reland "[memprof] Extend the index prof format to include memory profiles." This reverts commit `0f73fb18ca`. Use llvm/Profile/MIBEntryDef.inc instead of relative path. Generated the raw profile data with `-mllvm -enable-name-compression=false` so that builbots where the reader is built without zlib do not fail. Also updated the test build instructions.	2022-02-14 10:52:13 -08:00
Snehasish Kumar	0f73fb18ca	Revert "[memprof] Extend the index prof format to include memory profiles." This reverts commit `43c2348c5b`. Buildbots are failing with an error on reading memprof testdata. "Inputs/basic.profraw: profile uses zlib compression but the profile reader was built without zlib support" https://lab.llvm.org/buildbot/#/builders/16/builds/24490	2022-02-14 10:25:01 -08:00
Snehasish Kumar	43c2348c5b	[memprof] Extend the index prof format to include memory profiles. This patch adds support for optional memory profile information to be included with and indexed profile. The indexed profile header adds a new field which points to the offset of the memory profile section (if present) in the indexed profile. For users who do not utilize this feature the only overhead is a 64-bit offset in the header. The memory profile section contains (1) profile metadata describing the information recorded for each entry (2) an on-disk hashtable containing the profile records indexed via llvm::md5(function_name). We chose to introduce a separate hash table instead of the existing one since the indexing for the instrumented fdo hash table is based on a CFG hash which itself is perturbed by memprof instrumentation. Differential Revision: https://reviews.llvm.org/D118653	2022-02-14 09:53:45 -08:00
Snehasish Kumar	14cc41a020	[InstrProf] Make the IndexedInstrProf header backwards compatible. While the contents of the profile are backwards compatible the header itself is not. For example, when adding new fields to the header results in significant issues. This change adds allows for portable instantiation of the header across indexed format versions. Differential Revision: https://reviews.llvm.org/D118390	2022-02-14 09:53:45 -08:00
Snehasish Kumar	9b67165285	[memprof] Introduce a wrapper around MemInfoBlock. Use the macro based format to add a wrapper around the MemInfoBlock when stored in the MemProfRecord. This wrapped block can then be serialized/deserialized based on a schema specified by a list of enums. Differential Revision: https://reviews.llvm.org/D117256	2022-02-14 09:53:45 -08:00
Hongtao Yu	f0f70ae674	[CSSPGO] Do not recount callee samples when computing profile summary for nested CS profile. When generating nested CS profile with all calling contexts of a function duplicated into a base profile under `--generate-merged-base-profiles`, do not recount callee samples when computing profile summary. This fixes the profile summary mismatch between flat cs profile and nested cs profile, for both extbinary and text format. Reviewed By: wenlei Differential Revision: https://reviews.llvm.org/D119494	2022-02-11 09:05:51 -08:00
serge-sans-paille	e72c195fdc	Cleanup LLVMObject headers Most notably, llvm/Object/Binary.h no longer includes llvm/Support/MemoryBuffer.h llvm/Object/MachOUniversal*.h no longer include llvm/Object/Archive.h llvm/Object/TapiUniversal.h no longer includes llvm/Object/TapiFile.h llvm-project preprocessed size: before: 1068185081 after: 1068324320 Discourse thread: https://discourse.llvm.org/t/include-what-you-use-include-cleanup Differential Revision: https://reviews.llvm.org/D119457	2022-02-10 21:13:44 +01:00
Snehasish Kumar	cb81545e7d	[memprof] Add LLVM_DEBUG for unused var in RawMemProfReader.cpp. The ContainingSegment variable is only used to check whether we found the address at this time. When building without Asserts this emits a warning. So for now wrap this code in LLVM_DEBUG to avoid the warning.	2022-02-08 16:02:24 -08:00
Snehasish Kumar	216575e581	Revert "Revert "[ProfileData] Read and symbolize raw memprof profiles."" This reverts commit `dbf47d227d`. Reapply https://reviews.llvm.org/D116784 now that https://reviews.llvm.org/D118413 has landed with a couple of fixes: * fix raw profile reader unaligned access identified by ubsan * fix windows build by using MOCK_CONST_METHOD3 instead of MOCK_METHOD.	2022-02-08 13:37:27 -08:00
Snehasish Kumar	dbf47d227d	Revert "[ProfileData] Read and symbolize raw memprof profiles." This reverts commit `26f978d4c5`. This patch added a transitive dependency on libcurl via symbolize. See discussion https://reviews.llvm.org/D116784#inline-1137928 https://reviews.llvm.org/D113717#3295350	2022-02-03 16:14:05 -08:00
Snehasish Kumar	55de669660	Revert "[instrprof][NFC] Sort link components and dedupe." This reverts commit `28ba0b9f6d`. clang ppc build failed https://lab.llvm.org/buildbot#builders/121/builds/16080	2022-02-03 15:42:50 -08:00
Snehasish Kumar	28ba0b9f6d	[instrprof][NFC] Sort link components and dedupe. Accidentally added a duplicate link component in D116784.	2022-02-03 15:35:26 -08:00
Snehasish Kumar	26f978d4c5	[ProfileData] Read and symbolize raw memprof profiles. This change extends the RawMemProfReader to read all the sections of the raw profile and symbolize the virtual addresses recorded as part of the callstack for each allocation. For now the symbolization is used to display the contents of the profile with llvm-profdata. Differential Revision: https://reviews.llvm.org/D116784	2022-02-03 14:33:50 -08:00
Snehasish Kumar	14f4f63af5	[memprof] Print out the summary in YAML format. Print out the profile summary in YAML format to make it easier to for tools and tests to read in the contents of the raw profile. Differential Revision: https://reviews.llvm.org/D116783	2022-02-03 14:33:50 -08:00
Snehasish Kumar	d2df8d5a78	[instrprof][NFC] Templatize the instrprof iterator. This change templatizes the InstrProfIterator where the default specialization is based on the current usage, i.e. the reader_type is InstrProfReader and the record_type (value_type) is NamedInstrProfRecord. A subsequent patch will use the same iterator template to implement an iterator for the RawMemProfReader. Differential Revision: https://reviews.llvm.org/D116782	2022-02-03 14:33:49 -08:00
Ellis Hoag	7756b34ef2	[InstrProf][NFC] Remove stray option in InstrProfWriter This variable was added to `InstrProfWriter.cpp` in D115693 by mistake and it isn't needed. Reviewed By: kyulee Differential Revision: https://reviews.llvm.org/D118664	2022-02-02 14:29:15 -08:00
Snehasish Kumar	186dcd4aab	[instrprof][NFC] Refactor out the common logic for getProfileKind. The logic for getProfileKind for RawInstrProfReader and InstrProfReaderIndex is similar. To avoid duplication, move the logic from the header to InstrProfReader.cpp and introduce a static method which implements the common code. Differential Revision: https://reviews.llvm.org/D118656	2022-01-31 15:04:42 -08:00
Ellis Hoag	eea002a9c4	[InstrProf][NFC] Move function out of InstrProf.h `createIRLevelProfileFlagVar()` seems to be only used in `PGOInstrumentation.cpp` so we move it to that file. Then it can also take advantage of directly using options rather than passing them as arguments. Reviewed By: kyulee, phosek Differential Revision: https://reviews.llvm.org/D118097	2022-01-28 09:24:26 -08:00

1 2 3 4 5 ...

715 Commits