llvm-project

Commit Graph

Author	SHA1	Message	Date
Vitaly Buka	6318001209	[sanitizer] Support IsRssLimitExceeded in all sanitizers Reviewed By: kstoimenov Differential Revision: https://reviews.llvm.org/D115000	2021-12-03 12:45:44 -08:00
Vitaly Buka	36e6a259c8	[NFC][sanitizer] Remove SetSoftRssLimitExceededCallback According comments on D44404, something like that was the goal. Reviewed By: morehouse, kstoimenov Differential Revision: https://reviews.llvm.org/D114991	2021-12-02 14:37:02 -08:00
Vitaly Buka	d48d8670b5	[NFC][sanitizer] Rename RssLimitExceeded -> IsRssLimitExceeded	2021-12-02 12:52:00 -08:00
Dmitry Vyukov	97b4e63117	tsan: fix false positives in dynamic libs with static tls The added test demonstrates loading a dynamic library with static TLS. Such static TLS is a hack that allows a dynamic library to have faster TLS, but it can be loaded only iff all threads happened to allocate some excess of static TLS space for whatever reason. If it's not the case loading fails with: dlopen: cannot load any more object with static TLS We used to produce a false positive because dlopen will write into TLS of all existing threads to initialize/zero TLS region for the loaded library. And this appears to be racing with initialization of TLS in the thread since we model a write into the whole static TLS region (we don't what part of it is currently unused): WARNING: ThreadSanitizer: data race (pid=2317365) Write of size 1 at 0x7f1fa9bfcdd7 by main thread: 0 memset 1 init_one_static_tls 2 __pthread_init_static_tls [[ this is where main calls dlopen ]] 3 main Previous write of size 8 at 0x7f1fa9bfcdd0 by thread T1: 0 __tsan_tls_initialization Fix this by ignoring accesses during dlopen. Reviewed By: melver Differential Revision: https://reviews.llvm.org/D114953	2021-12-02 17:47:05 +01:00
Snehasish Kumar	3a4d373ec2	[memprof] Align each rawprofile section to 8b. The first 8b of each raw profile section need to be aligned to 8b since the first item in each section is a u64 count of the number of items in the section. Summary of changes: * Assert alignment when reading counts. * Update test to check alignment, relax some size checks to allow padding. * Update raw binary inputs for llvm-profdata tests. Differential Revision: https://reviews.llvm.org/D114826	2021-11-30 20:12:43 -08:00
Snehasish Kumar	1cf11df591	[memprof] Disable pedantic warnings, suppress variadic macro warning. The memprof unit tests use an older version of gmock (included in the repo) which does not build cleanly with -pedantic: https://github.com/google/googletest/issues/2650 For now just silence the warning by disabling pedantic and add the appropriate flags for gcc and clang.	2021-11-30 12:03:19 -08:00
Snehasish Kumar	a2ce97cc3f	[memprof] Fix unit test build after refactoring shared header. The memprof unittest also needs to include the MemProfData.inc header directly to have access to MEMPROF_RAW_MAGIC and MEMPROF_RAW_VERSION globals.	2021-11-30 11:35:42 -08:00
Snehasish Kumar	7cca33b40f	[memprof] Extend llvm-profdata to display MemProf profile summaries. This commit adds initial support to llvm-profdata to read and print summaries of raw memprof profiles. Summary of changes: * Refactor shared defs to MemProfData.inc * Extend show_main to display memprof profile summaries. * Add a simple raw memprof profile reader. * Add a couple of tests to tools/llvm-profdata. Differential Revision: https://reviews.llvm.org/D114286	2021-11-30 10:45:26 -08:00
Snehasish Kumar	a4b92d6158	[memprof] Remove the "Live on exit:" print for text format. We dropped the printing of live on exit blocks in rG1243cef245f6 - the commit changed the insertOrMerge logic. Remove the message since it is no longer needed (all live blocks are inserted into the hashmap) before serializing/printing the profile. Furthermore, the original intent was to capture evicted blocks so it wasn't entirely correct. Also update the binary format test invocation to remove the redundant print_text directive now that it is the default. Differential Revision: https://reviews.llvm.org/D114285	2021-11-22 13:30:48 -08:00
Haowei Wu	692131f41a	[compiler-rt] Explicitly set dependency on libcxx for MemProfUnitTest MemProfUnitTest now depends on libcxx but the dependency is not explicitly expressed in build system, causing build races. This patch addresses this issue. Differential Revision: https://reviews.llvm.org/D114267	2021-11-22 12:47:12 -08:00
Dmitry Vyukov	d1f72f02d0	memprof: don't use thread user_id memprof does not use user_id for anything, so don't pass it to ThreadCreate. Passing a random field of MemprofThread as user_id does not make much sense anyway. Depends on D113920. Reviewed By: vitalybuka Differential Revision: https://reviews.llvm.org/D113921	2021-11-17 14:49:25 +01:00
Vitaly Buka	cb0e14ce6d	[sanitizer] Switch dlsym hack to internal_allocator Since glibc 2.34, dlsym does 1. malloc 1 2. malloc 2 3. free pointer from malloc 1 4. free pointer from malloc 2 These sequence was not handled by trivial dlsym hack. This fixes https://bugs.llvm.org/show_bug.cgi?id=52278 Reviewed By: eugenis, morehouse Differential Revision: https://reviews.llvm.org/D112588	2021-11-12 16:11:10 -08:00
Snehasish Kumar	aacaebc6c2	[memprof] Make the raw binary format the default. Set the default memprof serialization format as binary. 9 tests are updated to use print_text=true. Also fixed an issue with concatenation of default and test specified options (missing separator). Differential Revision: https://reviews.llvm.org/D113617	2021-11-11 11:29:36 -08:00
Snehasish Kumar	545866cb05	[memprof] Add a raw binary format to serialize memprof profiles. This change implements the raw binary format discussed in https://lists.llvm.org/pipermail/llvm-dev/2021-September/153007.html Summary of changes * Add a new memprof option to choose binary or text (default) format. * Add a rawprofile library which serializes the MIB map to profile. * Add a unit test for rawprofile. * Mark sanitizer procmaps methods as virtual to be able to mock them. * Extend memprof_profile_dump regression test. Differential Revision: https://reviews.llvm.org/D113317	2021-11-11 11:29:36 -08:00
Snehasish Kumar	1243cef245	[memprof] Replace the block cache with a hashmap. The existing implementation uses a cache + eviction based scheme to record heap profile information. This design was adopted to ensure a constant memory overhead (due to fixed number of cache entries) along with incremental write-to-disk for evictions. We find that since the number to entries to track is O(unique-allocation-contexts) the overhead of keeping all contexts in memory is not very high. On a clang workload, the max number of unique allocation contexts was ~35K, median ~11K. For each context, we (currently) store 64 bytes of data - this amounts to 5.5MB (max). Given the low overheads for a complex workload, we can simplify the implementation by using a hashmap without eviction. Other changes: * Memory map is dumped at the end rather than startup. The relative order in the profile dump is unchanged since we no longer have evicted entries at runtime. * Added a test to check meminfoblocks are merged. Differential Revision: https://reviews.llvm.org/D111676	2021-11-11 11:29:36 -08:00
Snehasish Kumar	fc7162414e	[memprof] Move the MemInfoBlock definition to a separate header. Move the memprof MemInfoBlock struct to it's own header as requested during the review of D111676. Differential Revision: https://reviews.llvm.org/D113315	2021-11-11 11:29:36 -08:00
Vitaly Buka	1da33a51f1	[NFC][asan][memprov] Remove dlsym hack from posix_memalign It was added for RTEMS which was removed at D104279.	2021-11-10 16:35:14 -08:00
Vitaly Buka	651797f488	[NFC][sanitizer] Move GET_MALLOC_STACK_TRACE closer to the use	2021-11-10 15:42:07 -08:00
Martin Liska	13a442ca49	Enable -Wformat-pedantic and fix fallout. Differential Revision: https://reviews.llvm.org/D113172	2021-11-05 13:12:35 +01:00
Teresa Johnson	0d8bdc1786	[MemProf] Record accesses for all words touched in mem intrinsic Previously for mem* intrinsics we only incremented the access count for the first word in the range. However, after thinking it through I think it makes more sense to record an access for every word in the range. This better matches the behavior of inlined memory intrinsics, and also allows better analysis of utilization at a future date. Differential Revision: https://reviews.llvm.org/D110799	2021-09-30 15:07:55 -07:00
Vitaly Buka	7c1128f3bb	[NFC][sanitizer] Return StackDepotStats by value Differential Revision: https://reviews.llvm.org/D110644	2021-09-28 15:42:21 -07:00
Teresa Johnson	242524741f	[MemProf] Fix format warnings Fix the warnings that show up with -Wformat in this file.	2021-09-15 16:27:21 -07:00
Teresa Johnson	37a5a3ae55	[MemProf] Avoid global lock when updating MIB cache Previously we used a global Allocator-scope mutex to lock when adding a deallocation to the MIB cache. This resulted in a lot of contention. Instead add and use per-set mutexes. Along with this, we now need to remove the global miss and access count variables and instead utilize the per-set statistics to report the overall miss rate. Differential Revision: https://reviews.llvm.org/D109853	2021-09-15 16:06:11 -07:00
Vitaly Buka	c92631a59a	[sanitizer] Fix VReport of symbol version Version is already a string and does not need stringizing.	2021-08-25 14:32:15 -07:00
Michael Jones	40067b88c0	[compiler-rt] change internal internal libc invariants llvm-libc is expected to be built with sanitizers and not use interceptors in the long run. For now though, we have a hybrid process, where functions implemented in llvm-libc are instrumented, and glibc fills and sanitizer interceptors fill in the rest. Current sanitizers have an invariant that the REAL(...) function called from inside of an interceptor is uninstrumented. A lot of interceptors call strlen() in order to figure out the size of the region to check/poison. Switch these callsites over to the internal, unsanitized implementation. Reviewed By: hctim, vitalybuka Differential Revision: https://reviews.llvm.org/D108316	2021-08-24 16:41:15 +00:00
Dmitry Vyukov	123c58ea26	sanitizer_common: enable format string checking Enable -Wformat in sanitizer_common now that it's cleaned up from existing warnings. But disable it in all sanitizers for now since they are not cleaned up yet, but inherit sanitizer_common CFLAGS. Depends on D107980. Reviewed By: vitalybuka Differential Revision: https://reviews.llvm.org/D107981	2021-08-13 13:44:52 +02:00
Dmitry Vyukov	56debbf52e	sanitizers: switch BlockingMutex(LINKER_INITIALIZED) to Mutex Mutex does not support LINKER_INITIALIZED support. As preparation to switching BlockingMutex to Mutex, proactively replace all BlockingMutex(LINKER_INITIALIZED) to Mutex. All of these are objects with static storage duration and Mutex ctor is constexpr, so it should be equivalent. Reviewed By: melver Differential Revision: https://reviews.llvm.org/D106944	2021-07-28 15:09:34 +02:00
Dmitry Vyukov	dfd9808b6c	sanitizer_common: add simpler ThreadRegistry ctor Currently ThreadRegistry is overcomplicated because of tsan, it needs tid quarantine and reuse counters. Other sanitizers don't need that. It also seems that no other sanitizer now needs max number of threads. Asan used to need 2^24 limit, but it does not seem to be needed now. Other sanitizers blindly copy-pasted that without reasons. Lsan also uses quarantine, but I don't see why that may be potentially needed. Add a ThreadRegistry ctor that does not require any sizes and use it in all sanitizers except for tsan. In preparation for new tsan runtime, which won't need any of these parameters as well. Reviewed By: vitalybuka Differential Revision: https://reviews.llvm.org/D105713	2021-07-13 22:52:25 -07:00
Vitaly Buka	b4a6fa12d1	Revert "sanitizer_common: add simpler ThreadRegistry ctor" Breaks https://lab.llvm.org/buildbot/#/builders/sanitizer-x86_64-linux-android This reverts commit `6062c672bc`. This reverts commit `8e489b4b96`.	2021-07-12 12:04:12 -07:00
Dmitry Vyukov	0da172b176	sanitizer_common: add thread safety annotations Enable clang Thread Safety Analysis for sanitizers: https://clang.llvm.org/docs/ThreadSafetyAnalysis.html Thread Safety Analysis can detect inconsistent locking, deadlocks and data races. Without GUARDED_BY annotations it has limited value. But this does all the heavy lifting to enable analysis and allows to add GUARDED_BY incrementally. Reviewed By: melver Differential Revision: https://reviews.llvm.org/D105716	2021-07-12 11:46:49 +02:00
Dmitry Vyukov	8e489b4b96	sanitizer_common: add simpler ThreadRegistry ctor Currently ThreadRegistry is overcomplicated because of tsan, it needs tid quarantine and reuse counters. Other sanitizers don't need that. It also seems that no other sanitizer now needs max number of threads. Asan used to need 2^24 limit, but it does not seem to be needed now. Other sanitizers blindly copy-pasted that without reasons. Lsan also uses quarantine, but I don't see why that may be potentially needed. Add a ThreadRegistry ctor that does not require any sizes and use it in all sanitizers except for tsan. In preparation for new tsan runtime, which won't need any of these parameters as well. Reviewed By: vitalybuka Differential Revision: https://reviews.llvm.org/D105713	2021-07-11 12:36:28 +02:00
Dmitry Vyukov	2721e27c3a	sanitizer_common: deduplicate CheckFailed We have some significant amount of duplication around CheckFailed functionality. Each sanitizer copy-pasted a chunk of code. Some got random improvements like dealing with recursive failures better. These improvements could benefit all sanitizers, but they don't. Deduplicate CheckFailed logic across sanitizers and let each sanitizer only print the current stack trace. I've tried to dedup stack printing as well, but this got me into cmake hell. So let's keep this part duplicated in each sanitizer for now. Reviewed By: vitalybuka Differential Revision: https://reviews.llvm.org/D102221	2021-05-12 08:50:53 +02:00
Dmitry Vyukov	92a3a2dc3e	sanitizer_common: introduce kInvalidTid/kMainTid Currently we have a bit of a mess related to tids: - sanitizers re-declare kInvalidTid multiple times - some call it kUnknownTid - implicit assumptions that main tid is 0 - asan/memprof claim their tids need to fit into 24 bits, but this does not seem to be true anymore - inconsistent use of u32/int to store tids Introduce kInvalidTid/kMainTid in sanitizer_common and use them consistently. Reviewed By: vitalybuka Differential Revision: https://reviews.llvm.org/D101428	2021-04-30 15:58:05 +02:00
Jianzhou Zhao	75be3681d1	[msan] Remove dead function/fields To see how to extract a shared allocator interface for D101204, found some unused code. Tests passed. Are they safe to remove? Reviewed By: vitalybuka Differential Revision: https://reviews.llvm.org/D101559	2021-04-29 23:08:39 +00:00
Nico Weber	0e92cbd6a6	Revert "[sanitizer] Simplify GetTls with dl_iterate_phdr on Linux" This reverts commit `ec575e3b0a`. Still doesn't work, see https://crbug.com/1196037	2021-04-05 19:00:18 -04:00
Fangrui Song	ec575e3b0a	[sanitizer] Simplify GetTls with dl_iterate_phdr on Linux This was reverted by `f176803ef1` due to Ubuntu 16.04 x86-64 glibc 2.23 problems. This commit additionally calls `__tls_get_addr({modid,0})` to work around the dlpi_tls_data==NULL issues for glibc<2.25 (https://sourceware.org/bugzilla/show_bug.cgi?id=19826) GetTls is the range of * thread control block and optional TLS_PRE_TCB_SIZE * static TLS blocks plus static TLS surplus On glibc, lsan requires the range to include `pthread::{specific_1stblock,specific}` so that allocations only referenced by `pthread_setspecific` can be scanned. This patch uses `dl_iterate_phdr` to collect TLS blocks. Find the one with `dlpi_tls_modid==1` as one of the initially loaded module, then find consecutive ranges. The boundaries give us addr and size. This allows us to drop the glibc internal `_dl_get_tls_static_info` and `InitTlsSize` entirely. Use the simplified method with non-Android Linux for now, but in theory this can be used with *BSD and potentially other ELF OSes. This simplification enables D99566 for TLS Variant I architectures. See https://reviews.llvm.org/D93972#2480556 for analysis on GetTls usage across various sanitizers. Differential Revision: https://reviews.llvm.org/D98926	2021-04-04 15:35:53 -07:00
Nico Weber	f176803ef1	Revert "[sanitizer] Simplify GetTls with dl_iterate_phdr" This reverts commit `9be8f8b34d`. This breaks tsan on Ubuntu 16.04: $ cat tiny_race.c #include <pthread.h> int Global; void Thread1(void x) { Global = 42; return x; } int main() { pthread_t t; pthread_create(&t, NULL, Thread1, NULL); Global = 43; pthread_join(t, NULL); return Global; } $ out/gn/bin/clang -fsanitize=thread -g -O1 tiny_race.c --sysroot ~/src/chrome/src/build/linux/debian_sid_amd64-sysroot/ $ docker run -v $PWD:/foo ubuntu:xenial /foo/a.out FATAL: ThreadSanitizer CHECK failed: ../../compiler-rt/lib/tsan/rtl/tsan_platform_linux.cpp:447 "((thr_beg)) >= ((tls_addr))" (0x7fddd76beb80, 0xfffffffffffff980) #0 <null> <null> (a.out+0x4960b6) #1 <null> <null> (a.out+0x4b677f) #2 <null> <null> (a.out+0x49cf94) #3 <null> <null> (a.out+0x499bd2) #4 <null> <null> (a.out+0x42aaf1) #5 <null> <null> (libpthread.so.0+0x76b9) #6 <null> <null> (libc.so.6+0x1074dc) (Get the sysroot from here: https://commondatastorage.googleapis.com/chrome-linux-sysroot/toolchain/500976182686961e34974ea7bdc0a21fca32be06/debian_sid_amd64_sysroot.tar.xz) Also reverts follow-on commits: This reverts commit `58c62fd976`. This reverts commit `31e541e375`.	2021-04-02 18:19:17 -04:00
Fangrui Song	9be8f8b34d	[sanitizer] Simplify GetTls with dl_iterate_phdr GetTls is the range of * thread control block and optional TLS_PRE_TCB_SIZE * static TLS blocks plus static TLS surplus On glibc, lsan requires the range to include `pthread::{specific_1stblock,specific}` so that allocations only referenced by `pthread_setspecific` can be scanned. This patch uses `dl_iterate_phdr` to collect TLS ranges. Find the one with `dlpi_tls_modid==1` as one of the initially loaded module, then find consecutive ranges. The boundaries give us addr and size. This allows us to drop the glibc internal `_dl_get_tls_static_info` and `InitTlsSize` entirely. Use the simplified method with non-Android Linux for now, but in theory this can be used with *BSD and potentially other ELF OSes. In the future, we can move `ThreadDescriptorSize` code to lsan (and consider intercepting `pthread_setspecific`) to avoid hacks in generic code. See https://reviews.llvm.org/D93972#2480556 for analysis on GetTls usage across various sanitizers. Differential Revision: https://reviews.llvm.org/D98926	2021-03-25 21:55:27 -07:00
Vitaly Buka	e0dadf3de2	[sanitizer] Remove max_len parameter from InternalScopedString InternalScopedString uses InternalMmapVector internally so it can be resized dynamically as needed. Reviewed By: eugenis Differential Revision: https://reviews.llvm.org/D98751	2021-03-17 16:57:09 -07:00
Alex Richardson	ad294e572b	[sanitizers] Fix interception of GLibc regexec Previously, on GLibc systems, the interceptor was calling __compat_regexec (regexec@GLIBC_2.2.5) insead of the newer __regexec (regexec@GLIBC_2.3.4). The __compat_regexec strips the REG_STARTEND flag but does not report an error if other flags are present. This can result in infinite loops for programs that use REG_STARTEND to find all matches inside a buffer (since ignoring REG_STARTEND means that the search always starts from the first character). The underlying issue is that GLibc's dlsym(RTLD_NEXT, ...) appears to always return the oldest versioned symbol instead of the default. This means it does not match the behaviour of dlsym(RTLD_DEFAULT, ...) or the behaviour documented in the manpage. It appears a similar issue was encountered with realpath and worked around in `77ef78a0a5`. See also https://sourceware.org/bugzilla/show_bug.cgi?id=14932 and https://sourceware.org/bugzilla/show_bug.cgi?id=1319. Fixes https://github.com/google/sanitizers/issues/1371 Reviewed By: #sanitizers, vitalybuka, marxin Differential Revision: https://reviews.llvm.org/D96348	2021-03-08 10:53:55 +00:00
Fangrui Song	fde3ae88ee	[asan][memprof] Declare _DYNAMIC and fix -Wparentheses Declare `extern ElfW(Dyn) _DYNAMIC[];` so that it will trivially work on musl.	2020-12-27 20:28:59 -08:00
Jeroen Dobbelaere	d7e71b5db8	[compiler-rt santizer] Use clock_gettime instead of timespec_get On RH66, timespec_get is not available. Use clock_gettime instead. This problem was introduced with D87120 Reviewed By: tejohnson Differential Revision: https://reviews.llvm.org/D91687	2020-12-08 10:10:17 -08:00
Teresa Johnson	a75b2e87e6	[MemProf] Add interface to dump profile Add an interface so that the profile can be dumped on demand. Differential Revision: https://reviews.llvm.org/D91768	2020-11-19 10:21:53 -08:00
Vitaly Buka	4cd0927306	[memprof] Don't protect destructor in final	2020-11-03 11:33:33 -08:00
Vitaly Buka	d48f2d7c02	[sanitizer] Cleanup -Wnon-virtual-dtor warnings	2020-11-02 20:30:50 -08:00
Vitaly Buka	8b37a4e6ca	[sanitizer] Make destructors protected	2020-11-02 18:00:43 -08:00
Petr Hosek	6db314e86b	[CMake] Remove cxx-headers from runtime deps Part of D88922	2020-10-30 20:03:38 -07:00
Petr Hosek	ed4fbe6d9c	[CMake] Replace ctime with time.h in memprof Part of D88922	2020-10-30 20:02:53 -07:00
Teresa Johnson	5c20d7db9f	[MemProf] Allow the binary to specify the profile output filename This will allow the output directory to be specified by a build time option, similar to the directory specified for regular PGO profiles via -fprofile-generate=. The memory profiling instrumentation pass will set up the variable. This is the same mechanism used by the PGO instrumentation and runtime. Depends on D87120 and D89629. Differential Revision: https://reviews.llvm.org/D89086	2020-10-22 08:30:19 -07:00
Teresa Johnson	3d4bba302d	[MemProf] Memory profiling runtime support See RFC for background: http://lists.llvm.org/pipermail/llvm-dev/2020-June/142744.html Follow on companion to the clang/llvm instrumentation support in D85948 and committed earlier. This patch adds the compiler-rt runtime support for the memory profiling. Note that much of this support was cloned from asan (and then greatly simplified and renamed). For example the interactions with the sanitizer_common allocators, error handling, interception, etc. The bulk of the memory profiling specific code can be found in the MemInfoBlock, MemInfoBlockCache, and related classes defined and used in memprof_allocator.cpp. For now, the memory profile is dumped to text (stderr by default, but honors the sanitizer_common log_path flag). It is dumped in either a default verbose format, or an optional terse format. This patch also adds a set of tests for the core functionality. Differential Revision: https://reviews.llvm.org/D87120	2020-10-16 09:47:02 -07:00

50 Commits