llvm-project

Commit Graph

Author	SHA1	Message	Date
Dmitry Vyukov	79fbba9b79	Revert "tsan: new runtime (v3)" Summary: This reverts commit `ac95b8d954`. There is a number of bot failures: http://45.33.8.238/mac/38755/step_4.txt https://green.lab.llvm.org/green/view/LLDB/job/lldb-cmake/38135/consoleFull#-148886289949ba4694-19c4-4d7e-bec5-911270d8a58c Reviewers: vitalybuka, melver Subscribers:	2021-11-12 17:49:47 +01:00
Dmitry Vyukov	ac95b8d954	tsan: new runtime (v3) This change switches tsan to the new runtime which features: - 2x smaller shadow memory (2x of app memory) - faster fully vectorized race detection - small fixed-size vector clocks (512b) - fast vectorized vector clock operations - unlimited number of alive threads/goroutimes Depends on D112602. Reviewed By: melver Differential Revision: https://reviews.llvm.org/D112603	2021-11-12 14:31:49 +01:00
Dmitry Vyukov	e91595bf94	tsan: don't start background thread after clone Start the background thread only after fork, but not after clone. For fork we did this always and it's known to work (or user code has adopted). But if we do this for the new clone interceptor some code (sandbox2) fails. So model we used to do for years and don't start the background thread after clone. Reviewed By: melver Differential Revision: https://reviews.llvm.org/D113744	2021-11-12 12:58:49 +01:00
Dmitry Vyukov	bc84b2857f	tsan: enable clone interceptor only on Linux Clone does not exist on Mac. There are chances it will break on other OSes. Enable it incrementally starting with Linux only, other OSes can enable it later as needed. Reviewed By: melver, thakis Differential Revision: https://reviews.llvm.org/D113693	2021-11-11 19:27:47 +01:00
Dmitry Vyukov	82de586d4b	tsan: intercept clone gtest uses clone for death tests and it needs the same handling as fork to prevent deadlock (take runtime mutexes before and release them after). Reviewed By: melver Differential Revision: https://reviews.llvm.org/D113677	2021-11-11 18:55:54 +01:00
Dmitry Vyukov	65e795c9ca	Revert "tsan: turn off COMMON_INTERCEPTOR_NOTHING_IS_INITIALIZED" This reverts commit `5ec832269e`. It broke a number of bots, e.g.: https://lab.llvm.org/buildbot/#/builders/52/builds/11811 Reviewed By: melver Differential Revision: https://reviews.llvm.org/D112563	2021-10-26 19:53:07 +02:00
Dmitry Vyukov	5ec832269e	tsan: turn off COMMON_INTERCEPTOR_NOTHING_IS_INITIALIZED All tsan interceptors check for initialization and/or initialize things as necessary lazily, so we can pretend everything is initialized in the COMMON_INTERCEPTOR_NOTHING_IS_INITIALIZED check to avoid double-checking for initialization (this is only necessary for sanitizers that don't handle initialization on common grounds). Reviewed By: melver Differential Revision: https://reviews.llvm.org/D112446	2021-10-26 16:13:03 +02:00
Dmitry Vyukov	a0ed71ff29	tsan: make cur_thread_init return cur_thread Whenever we call cur_thread_init, we call cur_thread on the next line. So make cur_thread_init return the current thread directly. Makes code a bit shorter, does not affect codegen. Reviewed By: vitalybuka, melver Differential Revision: https://reviews.llvm.org/D110384	2021-10-05 15:24:52 +02:00
Dmitry Vyukov	82e593cf90	tsan: uninline Enable/DisableIgnores ScopedInterceptor::Enable/DisableIgnores is only used for some special cases. Unline them from the common interceptor handling. Reviewed By: vitalybuka Differential Revision: https://reviews.llvm.org/D110157	2021-09-22 07:25:14 +02:00
Dmitry Vyukov	cf93f7677d	tsan: move errno spoiling reporting into a separate function (NFC) CallUserSignalHandler function is quite large and complex. Move errno spoiling reporting into a separate function. No logical changes. Reviewed By: vitalybuka Differential Revision: https://reviews.llvm.org/D110159	2021-09-22 07:12:53 +02:00
Dmitry Vyukov	6fe35ef419	tsan: fix debug format strings Some of the DPrintf's currently produce -Wformat warnings if enabled. Fix these format strings. Reviewed By: melver Differential Revision: https://reviews.llvm.org/D110131	2021-09-21 13:23:10 +02:00
Kazuaki Ishizaki	a1e7e401d2	[compiler-rt] NFC: Fix trivial typo Reviewed By: xgupta Differential Revision: https://reviews.llvm.org/D77457	2021-09-04 14:12:58 +05:30
Dmitry Vyukov	37b78291c6	tsan: add a comment to CallUserSignalHandler Reviewed By: melver Differential Revision: https://reviews.llvm.org/D108907	2021-08-30 11:33:19 +02:00
Vitaly Buka	266a8d5cfe	[tsan] Fix sigaction interceptor after D107186 Set SA_SIGINFO only if we set sighandler, or we can set the flag, and return it as 'old' without actual sigaction set. Reviewed By: melver Differential Revision: https://reviews.llvm.org/D108616	2021-08-24 10:19:02 -07:00
Marco Elver	303d278ad2	[tsan] Fix pthread_once() on Mac OS X Change `636428c727` enabled BlockingRegion hooks for pthread_once(). Unfortunately this seems to cause crashes on Mac OS X which uses pthread_once() from locations that seem to result in crashes: \| ThreadSanitizer:DEADLYSIGNAL \| ==31465==ERROR: ThreadSanitizer: stack-overflow on address 0x7ffee73fffd8 (pc 0x00010807fd2a bp 0x7ffee7400050 sp 0x7ffee73fffb0 T93815) \| #0 __tsan::MetaMap::GetSync(__tsan::ThreadState, unsigned long, unsigned long, bool, bool) tsan_sync.cpp:195 (libclang_rt.tsan_osx_dynamic.dylib:x86_64+0x78d2a) \| #1 __tsan::MutexPreLock(__tsan::ThreadState, unsigned long, unsigned long, unsigned int) tsan_rtl_mutex.cpp:143 (libclang_rt.tsan_osx_dynamic.dylib:x86_64+0x6cefc) \| #2 wrap_pthread_mutex_lock sanitizer_common_interceptors.inc:4240 (libclang_rt.tsan_osx_dynamic.dylib:x86_64+0x3dae0) \| #3 flockfile <null>:2 (libsystem_c.dylib:x86_64+0x38a69) \| #4 puts <null>:2 (libsystem_c.dylib:x86_64+0x3f69b) \| #5 wrap_puts sanitizer_common_interceptors.inc (libclang_rt.tsan_osx_dynamic.dylib:x86_64+0x34d83) \| #6 __tsan::OnPotentiallyBlockingRegionBegin() cxa_guard_acquire.cpp:8 (foo:x86_64+0x100000e48) \| #7 wrap_pthread_once tsan_interceptors_posix.cpp:1512 (libclang_rt.tsan_osx_dynamic.dylib:x86_64+0x2f6e6) From the stack trace it can be seen that the caller is unknown, and the resulting stack-overflow seems to indicate that whoever the caller is does not have enough stack space or otherwise is running in a limited environment not yet ready for full instrumentation. Fix it by reverting behaviour on Mac OS X to not call BlockingRegion hooks from pthread_once(). Reported-by: azharudd Reviewed By: glider Differential Revision: https://reviews.llvm.org/D108305	2021-08-19 13:17:45 +02:00
Dmitry Vyukov	c90bf3ff92	tsan: clean up and enable format string checking Depends on D107982. Reviewed By: vitalybuka Differential Revision: https://reviews.llvm.org/D107983	2021-08-13 13:45:15 +02:00
Dmitry Vyukov	fc545c52cd	tsan: handle bugs in symbolizer more gracefully For symbolizer we only process SIGSEGV signals synchronously (which means bug in symbolizer or in tsan). But we still want to reset in_symbolizer to fail gracefully. Symbolizer and user code use different memory allocators, so if we don't reset in_symbolizer we can get memory allocated with one being feed with another, which can cause more crashes. Reviewed By: melver Differential Revision: https://reviews.llvm.org/D107564	2021-08-05 16:53:15 +02:00
Dmitry Vyukov	a82c7476a7	tsan: introduce RawShadow type Currently we hardcode u64 type for shadow everywhere and do lots of uptr<->u64* casts. It makes it hard to change u64 to another type (e.g. u32) and makes it easy to introduce bugs. Introduce RawShadow type and use it in MemToShadow, ShadowToMem, IsShadowMem and throughout the code base as u64 replacement. This makes it possible to change u64 to something else in future and generally improves static typing. Depends on D107481. Reviewed By: vitalybuka Differential Revision: https://reviews.llvm.org/D107482	2021-08-05 13:37:10 +02:00
Dmitry Vyukov	e3f4c63e78	tsan: don't use spinning in __cxa_guard_acquire/pthread_once Currently we use passive spinning with internal_sched_yield to wait in __cxa_guard_acquire/pthread_once. Passive spinning tends to degrade ungracefully under high load. Use FutexWait/Wake instead. Depends on D107359. Reviewed By: vitalybuka, melver Differential Revision: https://reviews.llvm.org/D107360	2021-08-04 13:56:33 +02:00
Dmitry Vyukov	0bc626d516	tsan: refactor guard_acquire/release Introduce named consts for magic values we use. Differential Revision: https://reviews.llvm.org/D107445	2021-08-04 13:52:27 +02:00
Dmitry Vyukov	636428c727	tsan: unify __cxa_guard_acquire and pthread_once implementations Currently we effectively duplicate "once" logic for __cxa_guard_acquire and pthread_once. Unify the implementations. This is not a no-op change: - constants used for pthread_once are changed to match __cxa_guard_acquire (__cxa_guard_acquire constants are tied to ABI, but it does not seem to be the case for pthread_once) - pthread_once now also uses PotentiallyBlockingRegion annotations - __cxa_guard_acquire checks thr->in_ignored_lib to skip user synchronization It's unclear if these 2 differences are intentional or a mere sloppy inconsistency. Since all tests still pass, let's assume the latter. Reviewed By: vitalybuka, melver Differential Revision: https://reviews.llvm.org/D107359	2021-08-04 13:44:19 +02:00
Dmitry Vyukov	ac2bc4e0fc	tsan: remove mallopt calls mallopt calls are left-over from the times we used __libc_malloc/__libc_free for internal allocations. Now we have own internal allocator, so this is not needed. Reviewed By: melver Differential Revision: https://reviews.llvm.org/D107342	2021-08-03 16:18:09 +02:00
Dmitry Vyukov	7779f49bc1	tsan: remove unused caller_pc from TsanInterceptorContext Reviewed By: melver Differential Revision: https://reviews.llvm.org/D107340	2021-08-03 16:17:44 +02:00
Dmitry Vyukov	e72ad3c19a	tsan: use semaphores for thread creation synchronization We currently use ad-hoc spin waiting to synchronize thread creation and thread start both ways. But spinning tend to degrade ungracefully under high contention (lots of threads are created at the same time). Use semaphores for synchronization instead. Reviewed By: melver Differential Revision: https://reviews.llvm.org/D107337	2021-08-03 13:47:01 +02:00
Dmitry Vyukov	831910c5c4	tsan: new MemoryAccess interface Currently we have MemoryAccess function that accepts "bool kAccessIsWrite, bool kIsAtomic" and 4 wrappers: MemoryRead/MemoryWrite/MemoryReadAtomic/MemoryWriteAtomic. Such scheme with bool flags is not particularly scalable/extendable. Because of that we did not have Read/Write wrappers for UnalignedMemoryAccess, and "true, false" or "false, true" at call sites is not very readable. Moreover, the new tsan runtime will introduce more flags (e.g. move "freed" and "vptr access" to memory acccess flags). We can't have 16 wrappers and each flag also takes whole 64-bit register for non-inlined calls. Introduce AccessType enum that contains bit mask of read/write, atomic/non-atomic, and later free/non-free, vptr/non-vptr. Such scheme is more scalable, more readble, more efficient (don't consume multiple registers for these flags during calls) and allows to cover unaligned and range variations of memory access functions as well. Also switch from size log to just size. The new tsan runtime won't have the limitation of supporting only 1/2/4/8 access sizes, so we don't need the logarithms. Also add an inline thunk that converts the new interface to the old one. For inlined calls it should not add any overhead because all flags/size can be computed as compile time. Reviewed By: vitalybuka, melver Differential Revision: https://reviews.llvm.org/D107276	2021-08-03 11:03:23 +02:00
Dmitry Vyukov	8a49e053ca	tsan: inline ProcessPendingSignals check ProcessPendingSignals is called in all interceptors and user atomic operations. Make the fast-path check (no pending signals) inlinable. Reviewed By: melver Differential Revision: https://reviews.llvm.org/D107217	2021-08-02 11:05:45 +02:00
Dmitry Vyukov	103d075b05	tsan: introduce Tid and StackID typedefs Currently we inconsistently use u32 and int for thread ids, there are also "unique tid" and "os tid" and just lots of other things identified by integers. Additionally new tsan runtime will introduce yet another thread identifier that is very different from current tids. Similarly for stack IDs, it's easy to confuse u32 with other integer identifiers. And when a function accepts u32 or a struct contains u32 field, it's not always clear what it is. Add Tid and StackID typedefs to make it clear what is what. Reviewed By: melver Differential Revision: https://reviews.llvm.org/D107152	2021-07-31 09:05:31 +02:00
Dmitry Vyukov	53a526790d	tsan: always setup sigaction signal handler Currently we setup either sigaction signal handler with 3 arguments or old style signal handler with 1 argument depending on user handler type. This unnecessarily complicates code. Always setup sigaction handler. Reviewed By: vitalybuka Differential Revision: https://reviews.llvm.org/D107186	2021-07-31 08:53:48 +02:00
Dmitry Vyukov	817f942a28	tsan: introduce New/Alloc/Free helpers We frequenty allocate sizeof(T) memory and call T ctor on that memory (C++ new keyword effectively). Currently it's quite verbose and usually takes 2 lines of code. Add New<T>() helper that does it much more concisely. Rename internal_free to Free that also sets the pointer to nullptr. Shorter and safer. Rename internal_alloc to Alloc, just shorter. Reviewed By: vitalybuka, melver Differential Revision: https://reviews.llvm.org/D107085	2021-07-30 11:51:55 +02:00
Dmitry Vyukov	97795be22f	tsan: optimize test-only barrier The updated lots_of_threads.c test with 300 threads started running for too long on machines with low hardware parallelism (e.g. taskset -c 0-1). On lots of CPUs it finishes in ~2 secs. But with taskset -c 0-1 it runs for hundreds of seconds effectively spinning in the barrier in the sleep loop. We now have the handy futex API in sanitizer_common. Use it instead of the passive spin loop. It makes the test run only faster with taskset -c 0-1, it runs for ~1.5 secs, while with full parallelism it still runs for ~2 secs (but consumes less CPU time). Depends on D107131. Reviewed By: vitalybuka Differential Revision: https://reviews.llvm.org/D107132	2021-07-30 11:39:38 +02:00
Dmitry Vyukov	9e9599ef78	tsan: introduce LazyInitialize We call non-inlinable Initialize from all interceptors/syscalls, but most of the time runtime is already initialized and this just introduces unnecessary overhead. Add LazyInitialize that (1) inlinable, (2) does nothing if .preinit_array is enabled (expected case on Linux). Depends on D107071. Reviewed By: melver Differential Revision: https://reviews.llvm.org/D107072	2021-07-29 17:19:29 +02:00
Dmitry Vyukov	0d68cfc996	tsan: store ThreadRegistry in Context by value It's unclear why we allocate ThreadRegistry separately, I assume it's some historical leftover. Embed ThreadRegistry into Context. Reviewed By: melver Differential Revision: https://reviews.llvm.org/D107045	2021-07-29 12:44:44 +02:00
Dmitry Vyukov	a1a37ddc3f	tsan: remove /**/ at the of multi-line macros Prefer code readability over writeability. Reviewed By: vitalybuka Differential Revision: https://reviews.llvm.org/D106982	2021-07-29 07:50:09 +02:00
Dmitry Vyukov	5acdfb7eda	tsan: remove unused pc arguments Remove pc argument of ThreadIgnoreEnd, ThreadIgnoreSyncEnd and AcquireGlobal functions. It's unused and in some places we don't even have a pc and pass 0 anyway. Don't confuse readers and don't pretend that pc is needed and that passing 0 is somehow deficient. Use simpler convention for ThreadIgnoreBegin and ThreadIgnoreSyncBegin: accept only pc instread of pc+save_stack. 0 pc means "don't save stack". Reviewed By: vitalybuka Differential Revision: https://reviews.llvm.org/D106973	2021-07-28 20:07:49 +02:00
Dmitry Vyukov	acbb4fcd5e	tsan: increase max number of threads supported by test-only barrier Currently the barrier supports only 256 threads, this does not allow to write reliable tests that use more threads. Bump max number of threads to 1024 to support writing good stress tests. Also replace sched_yield() with usleep(100) on the wait path. If we write tests that create hundreds of threads (and dozens of tests can run in parallel), yield would consume massive amounts of CPU time for spinning. Depends on D106952. Reviewed By: melver Differential Revision: https://reviews.llvm.org/D106953	2021-07-28 17:35:23 +02:00
Dmitry Vyukov	a7767171cb	tsan: switch atexit mutex to the normal Mutex Now that Mutex is blocking there is no point in using BlockingMutex. Switch to Mutex. Depends on D106379. Reviewed By: vitalybuka Differential Revision: https://reviews.llvm.org/D106560	2021-07-23 09:13:37 +02:00
Dmitry Vyukov	0118a64934	tsan: switch to the new sanitizer_common mutex Now that sanitizer_common mutex has feature-parity with tsan mutex, switch tsan to the sanitizer_common mutex and remove tsan's custom mutex. Reviewed By: vitalybuka, melver Differential Revision: https://reviews.llvm.org/D106379	2021-07-23 09:13:26 +02:00
Dmitry Vyukov	3c92eb44d4	tsan: ignore interceptors in few more places This is preparation to switching to the sanitizer_common Mutex. Without this change after the switch we will start failing on existing from the runtime with runtime mutexes held. Previously it worked because CheckNoLocks did not see sanitizer_common mutexes. Depends on D106547. Reviewed By: vitalybuka, melver Differential Revision: https://reviews.llvm.org/D106558	2021-07-23 09:12:46 +02:00
Shu-Chun Weng	4fa989c7b2	Fix TSAN signal interceptor out-of-bound access signal(2) and sigaction(2) have defined behaviors for invalid signal number (EINVAL) and some programs rely on it. The added test case also reveals that MSAN is too strict in this regard. Test case passed on x86_64 Linux and AArch64 Linux. Reviewed By: vitalybuka Differential Revision: https://reviews.llvm.org/D106468	2021-07-22 12:38:07 -07:00
Nico Weber	557855e047	Revert "tsan: make obtaining current PC faster" This reverts commit `e33446ea58`. Doesn't build on mac, and causes other problems. See reports on https://reviews.llvm.org/D106046 and https://reviews.llvm.org/D106081 Also revert follow-up "tsan: strip top inlined internal frames" This reverts commit `7b302fc9b0`.	2021-07-15 19:29:19 -04:00
Dmitry Vyukov	e33446ea58	tsan: make obtaining current PC faster We obtain the current PC is all interceptors and collectively common interceptor code contributes to overall slowdown (in particular cheaper str/mem* functions). The current way to obtain the current PC involves: 4493e1: e8 3a f3 fe ff callq 438720 <_ZN11__sanitizer10StackTrace12GetCurrentPcEv> 4493e9: 48 89 c6 mov %rax,%rsi and the called function is: uptr StackTrace::GetCurrentPc() { 438720: 48 8b 04 24 mov (%rsp),%rax 438724: c3 retq The new way uses address of a local label and involves just: 44a888: 48 8d 35 fa ff ff ff lea -0x6(%rip),%rsi I am not switching all uses of StackTrace::GetCurrentPc to GET_CURRENT_PC because it may lead some differences in produced reports and break tests. The difference comes from the fact that currently we have PC pointing to the CALL instruction, but the new way does not yield any code on its own so the PC points to a random instruction in the function and symbolizing that instruction can produce additional inlined frames (if the random instruction happen to relate to some inlined function). Reviewed By: melver Differential Revision: https://reviews.llvm.org/D106046	2021-07-15 17:34:00 +02:00
Ilya Leoshkevich	bd77f742d6	[TSan] Intercept __tls_get_addr_internal and __tls_get_offset on SystemZ Reuse the assembly glue code from sanitizer_common_interceptors.inc and the handling logic from the __tls_get_addr interceptor. Reviewed By: dvyukov Differential Revision: https://reviews.llvm.org/D105629	2021-07-15 12:18:48 +02:00
Ilya Leoshkevich	fab044045b	[TSan] Define PTHREAD_ABI_BASE for SystemZ SystemZ's glibc symbols use version 2.3.2. Reviewed By: dvyukov Differential Revision: https://reviews.llvm.org/D105629	2021-07-15 12:18:47 +02:00
Dmitry Vyukov	ed7bf7d73f	tsan: refactor fork handling Commit `efd254b636` ("tsan: fix deadlock in pthread_atfork callbacks") fixed another deadlock related to atfork handling. But builders with DCHECKs enabled reported failures of pthread_atfork_deadlock2.c and pthread_atfork_deadlock3.c tests related to the fact that we hold runtime locks on interceptor exit: https://lab.llvm.org/buildbot/#/builders/70/builds/6727 This issue is somewhat inherent to the current approach, we indeed execute user code (atfork callbacks) with runtime lock held. Refactor fork handling to not run user code (atfork callbacks) with runtime locks held. This change does this by installing own atfork callbacks during runtime initialization. Atfork callbacks run in LIFO order, so the expectation is that our callbacks run last, right before the actual fork. This way we lock runtime mutexes around fork, but not around user callbacks. Extend tests to also install after fork callbacks just to cover more scenarios. Some tests also started reporting real races that we previously suppressed. Also extend tests to cover fork syscall support. Reviewed By: vitalybuka Differential Revision: https://reviews.llvm.org/D101517	2021-04-30 08:48:20 +02:00
Tres Popp	d1e08b124c	Revert "tsan: refactor fork handling" This reverts commit `e1021dd1fd`.	2021-04-28 14:08:33 +02:00
Dmitry Vyukov	e1021dd1fd	tsan: refactor fork handling Commit `efd254b636` ("tsan: fix deadlock in pthread_atfork callbacks") fixed another deadlock related to atfork handling. But builders with DCHECKs enabled reported failures of pthread_atfork_deadlock2.c and pthread_atfork_deadlock3.c tests related to the fact that we hold runtime locks on interceptor exit: https://lab.llvm.org/buildbot/#/builders/70/builds/6727 This issue is somewhat inherent to the current approach, we indeed execute user code (atfork callbacks) with runtime lock held. Refactor fork handling to not run user code (atfork callbacks) with runtime locks held. This change does this by installing own atfork callbacks during runtime initialization. Atfork callbacks run in LIFO order, so the expectation is that our callbacks run last, right before the actual fork. This way we lock runtime mutexes around fork, but not around user callbacks. Extend tests to also install after fork callbacks just to cover more scenarios. Some tests also started reporting real races that we previously suppressed. Reviewed By: vitalybuka Differential Revision: https://reviews.llvm.org/D101385	2021-04-27 22:37:27 +02:00
Evgenii Stepanov	5275d772da	Revert "tsan: fix deadlock in pthread_atfork callbacks" Tests fail on debug builders. See the forward fix in https://reviews.llvm.org/D101385. This reverts commit `efd254b636`.	2021-04-27 12:36:31 -07:00
Dmitry Vyukov	efd254b636	tsan: fix deadlock in pthread_atfork callbacks We take report/thread_registry locks around fork. This means we cannot report any bugs in atfork handlers. We resolved this by enabling per-thread ignores around fork. This resolved some of the cases, but not all. The added test triggers a race report from a signal handler called from atfork callback, we reset per-thread ignores around signal handlers, so we tried to report it and deadlocked. But there are more cases: a signal handler can be called synchronously if it's sent to itself. Or any other report types would cause deadlocks as well: mutex misuse, signal handler spoiling errno, etc. Disable all reports for the duration of fork with thr->suppress_reports and don't re-enable them around signal handlers. Reviewed By: vitalybuka Differential Revision: https://reviews.llvm.org/D101154	2021-04-27 13:25:26 +02:00
Fangrui Song	afec953857	[sanitizer] Simplify GetTls with dl_iterate_phdr on Linux and use it on musl/FreeBSD ... so that FreeBSD specific GetTls/glibc specific pthread_self code can be removed. This also helps FreeBSD arm64/powerpc64 which don't have GetTls implementation yet. GetTls is the range of * thread control block and optional TLS_PRE_TCB_SIZE * static TLS blocks plus static TLS surplus On glibc, lsan requires the range to include `pthread::{specific_1stblock,specific}` so that allocations only referenced by `pthread_setspecific` can be scanned. This patch uses `dl_iterate_phdr` to collect TLS blocks. Find the one with `dlpi_tls_modid==1` as one of the initially loaded module, then find consecutive ranges. The boundaries give us addr and size. This allows us to drop the glibc internal `_dl_get_tls_static_info` and `InitTlsSize`. However, huge glibc x86-64 binaries with numerous shared objects may observe time complexity penalty, so exclude them for now. Use the simplified method with non-Android Linux for now, but in theory this can be used with *BSD and potentially other ELF OSes. This removal of RISC-V `__builtin_thread_pointer` makes the code compilable with more compiler versions (added in Clang in 2020-03, added in GCC in 2020-07). This simplification enables D99566 for TLS Variant I architectures. Note: as of musl 1.2.2 and FreeBSD 12.2, dlpi_tls_data returned by dl_iterate_phdr is not desired: https://bugs.freebsd.org/bugzilla/show_bug.cgi?id=254774 This can be worked around by using `__tls_get_addr({modid,0})` instead of `dlpi_tls_data`. The workaround can be shared with the workaround for glibc<2.25. This fixes some tests on Alpine Linux x86-64 (musl) ``` test/lsan/Linux/cleanup_in_tsd_destructor.c test/lsan/Linux/fork.cpp test/lsan/Linux/fork_threaded.cpp test/lsan/Linux/use_tls_static.cpp test/lsan/many_tls_keys_thread.cpp test/msan/tls_reuse.cpp ``` and `test/lsan/TestCases/many_tls_keys_pthread.cpp` on glibc aarch64. The number of sanitizer test failures does not change on FreeBSD/amd64 12.2. Differential Revision: https://reviews.llvm.org/D98926	2021-04-15 15:34:43 -07:00
Nico Weber	0e92cbd6a6	Revert "[sanitizer] Simplify GetTls with dl_iterate_phdr on Linux" This reverts commit `ec575e3b0a`. Still doesn't work, see https://crbug.com/1196037	2021-04-05 19:00:18 -04:00

1 2

72 Commits