llvm-project

Commit Graph

Author	SHA1	Message	Date
Than McIntosh	b6374437af	tsan: capture shadow map start/end on init and reuse in reset Capture the computed shadow begin/end values at the point where the shadow is first created and reuse those values on reset. Introduce new windows-specific function "ZeroMmapFixedRegion" for zeroing out an address space region previously returned by one of the MmapFixed* routines; call this function (on windows) from DoResetImpl tsan_rtl.cpp instead of MmapFixedSuperNoReserve. See https://github.com/golang/go/issues/53539#issuecomment-1168778740 for context; intended to help with updating the syso for Go's windows/amd64 race detector. Differential Revision: https://reviews.llvm.org/D128909	2022-07-26 10:04:39 -04:00
Dmitry Vyukov	7ec308715c	tsan: prevent pathological slowdown for spurious races Prevent the following pathological behavior: Since memory access handling is not synchronized with DoReset, a thread running concurrently with DoReset can leave a bogus shadow value that will be later falsely detected as a race. For such false races RestoreStack will return false and we will not report it. However, consider that a thread leaves a whole lot of such bogus values and these values are later read by a whole lot of threads. This will cause massive amounts of ReportRace calls and lots of serialization. In very pathological cases the resulting slowdown can be >100x. This is very unlikely, but it was presumably observed in practice: https://github.com/google/sanitizers/issues/1552 If this happens, previous access sid+epoch will be the same for all of these false races b/c if the thread will try to increment epoch, it will notice that DoReset has happened and will stop producing bogus shadow values. So, last_spurious_race is used to remember the last sid+epoch for which RestoreStack returned false. Then it is used to filter out races with the same sid+epoch very early and quickly. It is of course possible that multiple threads left multiple bogus shadow values and all of them are read by lots of threads at the same time. In such case last_spurious_race will only be able to deduplicate a few races from one thread, then few from another and so on. An alternative would be to hold an array of such sid+epoch, but we consider such scenario as even less likely. Note: this can lead to some rare false negatives as well: 1. When a legit access with the same sid+epoch participates in a race as the "previous" memory access, it will be wrongly filtered out. 2. When RestoreStack returns false for a legit memory access because it was already evicted from the thread trace, we will still remember it in last_spurious_race. Then if there is another racing memory access from the same thread that happened in the same epoch, but was stored in the next thread trace part (which is still preserved in the thread trace), we will also wrongly filter it out while RestoreStack would actually succeed for that second memory access. Reviewed By: melver Differential Revision: https://reviews.llvm.org/D130269	2022-07-25 10:40:11 +02:00
Dmitry Vyukov	7505cc301f	tsan: remove tracking of racy addresses We used to deduplicate based on the race address to prevent lots of repeated reports about the same race. But now we clear the shadow for the racy address in DoReportRace: // This prevents trapping on this address in future. for (uptr i = 0; i < kShadowCnt; i++) StoreShadow(&shadow_mem[i], i == 0 ? Shadow::kRodata : Shadow::kEmpty); It should have the same effect of not reporting duplicates (and actually better because it's automatically reset when the memory is reallocated). So drop the address deduplication code. Both simpler and faster. Reviewed By: melver Differential Revision: https://reviews.llvm.org/D130240	2022-07-25 10:33:26 +02:00
Mariusz Borsa	8246b2e156	[Sanitizers][Darwin] Replace SANITIZER_MAC with SANITIZER_APPLE in source files This is a follow up to [Sanitizers][Darwin] Rename Apple macro SANITIZER_MAC -> SANITIZER_APPLE (D125816) Performed a global search/replace as in title against LLVM sources Differential Revision: https://reviews.llvm.org/D126263	2022-05-24 12:59:27 -07:00
Dmitry Vyukov	9e66e5872c	tsan: print signal num in errno spoiling reports For errno spoiling reports we only print the stack where the signal handler is invoked. And the top frame is the signal handler function, which is supposed to give the info for debugging. But in same cases the top frame can be some common thunk, which does not give much info. E.g. for Go/cgo it's always runtime.cgoSigtramp. Print the signal number. This is what we can easily gather and it may give at least some hints regarding the issue. Reviewed By: melver, vitalybuka Differential Revision: https://reviews.llvm.org/D121979	2022-03-18 16:12:11 +01:00
Dmitry Vyukov	765921de5b	sanitizer_common: prefix thread-safety macros with SANITIZER_ Currently we use very common names for macros like ACQUIRE/RELEASE, which cause conflicts with system headers. Prefix all macros with SANITIZER_ to avoid conflicts. Reviewed By: vitalybuka Differential Revision: https://reviews.llvm.org/D116652	2022-01-07 15:11:00 +01:00
Dmitry Vyukov	f78d49e068	tsan: remove old vector clocks They are unused in the new tsan runtime. Depends on D112604. Reviewed By: vitalybuka, melver Differential Revision: https://reviews.llvm.org/D112605	2021-12-21 19:54:27 +01:00
Dmitry Vyukov	22a251c3d0	tsan: remove hacky call It's unused in the new tsan runtime. Depends on D112603. Reviewed By: vitalybuka, melver Differential Revision: https://reviews.llvm.org/D112604	2021-12-21 19:53:49 +01:00
Dmitry Vyukov	2eb3e20461	tsan: fix deadlock during race reporting SlotPairLocker calls SlotLock under ctx->multi_slot_mtx. SlotLock can invoke global reset DoReset if we are out of slots/epochs. But DoReset locks ctx->multi_slot_mtx as well, which leads to deadlock. Resolve the deadlock by removing SlotPairLocker/multi_slot_mtx and only lock one slot for which we will do RestoreStack. We need to lock that slot because RestoreStack accesses the slot journal. But it's unclear why we need to lock the current slot. Initially I did it just to be on the safer side (but at that time we dit not lock the second slot, so it was easy just to lock the current slot). Reviewed By: melver Differential Revision: https://reviews.llvm.org/D116040	2021-12-20 18:52:48 +01:00
Dmitry Vyukov	b332134921	tsan: new runtime (v3) This change switches tsan to the new runtime which features: - 2x smaller shadow memory (2x of app memory) - faster fully vectorized race detection - small fixed-size vector clocks (512b) - fast vectorized vector clock operations - unlimited number of alive threads/goroutimes Depends on D112602. Reviewed By: melver Differential Revision: https://reviews.llvm.org/D112603	2021-12-13 12:48:34 +01:00
Jonas Devlieghere	396113c19f	Revert "tsan: new runtime (v3)" This reverts commit `5a33e41281` becuase it breaks LLDB. https://green.lab.llvm.org/green/view/LLDB/job/lldb-cmake/39208/	2021-12-09 09:18:10 -08:00
Dmitry Vyukov	5a33e41281	tsan: new runtime (v3) This change switches tsan to the new runtime which features: - 2x smaller shadow memory (2x of app memory) - faster fully vectorized race detection - small fixed-size vector clocks (512b) - fast vectorized vector clock operations - unlimited number of alive threads/goroutimes Depends on D112602. Reviewed By: melver Differential Revision: https://reviews.llvm.org/D112603	2021-12-09 09:09:52 +01:00
Dmitry Vyukov	09859113ed	Revert "tsan: new runtime (v3)" This reverts commit `66d4ce7e26`. Chromium tests started failing: https://bugs.chromium.org/p/chromium/issues/detail?id=1275581	2021-12-01 18:00:46 +01:00
Dmitry Vyukov	66d4ce7e26	tsan: new runtime (v3) This change switches tsan to the new runtime which features: - 2x smaller shadow memory (2x of app memory) - faster fully vectorized race detection - small fixed-size vector clocks (512b) - fast vectorized vector clock operations - unlimited number of alive threads/goroutimes Depends on D112602. Reviewed By: melver Differential Revision: https://reviews.llvm.org/D112603	2021-11-25 18:32:04 +01:00
Weverything	1150f02c77	Revert "tsan: new runtime (v3)" This reverts commit `ebd47b0fb7`. This was causing unexpected behavior in programs.	2021-11-23 18:32:32 -08:00
Dmitry Vyukov	ebd47b0fb7	tsan: new runtime (v3) This change switches tsan to the new runtime which features: - 2x smaller shadow memory (2x of app memory) - faster fully vectorized race detection - small fixed-size vector clocks (512b) - fast vectorized vector clock operations - unlimited number of alive threads/goroutimes Differential Revision: https://reviews.llvm.org/D112603	2021-11-23 11:44:59 +01:00
Dmitry Vyukov	5f18ae3988	Revert "tsan: new runtime (v3)" Summary: This reverts commit `1784fe0532`. Broke some bots: https://lab.llvm.org/buildbot#builders/57/builds/12365 http://green.lab.llvm.org/green/job/clang-stage1-RA/25658/ Reviewers: vitalybuka, melver Subscribers:	2021-11-22 19:08:48 +01:00
Dmitry Vyukov	1784fe0532	tsan: new runtime (v3) This change switches tsan to the new runtime which features: - 2x smaller shadow memory (2x of app memory) - faster fully vectorized race detection - small fixed-size vector clocks (512b) - fast vectorized vector clock operations - unlimited number of alive threads/goroutimes Depends on D112602. Reviewed By: melver Differential Revision: https://reviews.llvm.org/D112603	2021-11-22 15:55:39 +01:00
Dmitry Vyukov	b5ff187b7b	tsan: mmap shadow stack We used to mmap C++ shadow stack as part of the trace region before `ed7f3f5bc9` ("tsan: move shadow stack into ThreadState"), which moved the shadow stack into TLS. This started causing timeouts and OOMs on some of our internal tests that repeatedly create and destroy thousands of threads. Allocate C++ shadow stack with mmap and small pages again. This prevents the observed timeouts and OOMs. But we now need to be more careful with interceptors that run after thread finalization because FuncEntry/Exit and TraceAddEvent all need the shadow stack. Reviewed By: vitalybuka Differential Revision: https://reviews.llvm.org/D113786	2021-11-13 09:12:55 +01:00
Dmitry Vyukov	79fbba9b79	Revert "tsan: new runtime (v3)" Summary: This reverts commit `ac95b8d954`. There is a number of bot failures: http://45.33.8.238/mac/38755/step_4.txt https://green.lab.llvm.org/green/view/LLDB/job/lldb-cmake/38135/consoleFull#-148886289949ba4694-19c4-4d7e-bec5-911270d8a58c Reviewers: vitalybuka, melver Subscribers:	2021-11-12 17:49:47 +01:00
Dmitry Vyukov	ac95b8d954	tsan: new runtime (v3) This change switches tsan to the new runtime which features: - 2x smaller shadow memory (2x of app memory) - faster fully vectorized race detection - small fixed-size vector clocks (512b) - fast vectorized vector clock operations - unlimited number of alive threads/goroutimes Depends on D112602. Reviewed By: melver Differential Revision: https://reviews.llvm.org/D112603	2021-11-12 14:31:49 +01:00
Dmitry Vyukov	e91595bf94	tsan: don't start background thread after clone Start the background thread only after fork, but not after clone. For fork we did this always and it's known to work (or user code has adopted). But if we do this for the new clone interceptor some code (sandbox2) fails. So model we used to do for years and don't start the background thread after clone. Reviewed By: melver Differential Revision: https://reviews.llvm.org/D113744	2021-11-12 12:58:49 +01:00
Dmitry Vyukov	d736002e90	tsan: move memory access functions to a separate file tsan_rtl.cpp is huge and does lots of things. Move everything related to memory access and tracing to a separate tsan_rtl_access.cpp file. No functional changes, only code movement. Reviewed By: vitalybuka, melver Differential Revision: https://reviews.llvm.org/D112625	2021-10-28 13:31:10 +02:00
Dmitry Vyukov	a0ed71ff29	tsan: make cur_thread_init return cur_thread Whenever we call cur_thread_init, we call cur_thread on the next line. So make cur_thread_init return the current thread directly. Makes code a bit shorter, does not affect codegen. Reviewed By: vitalybuka, melver Differential Revision: https://reviews.llvm.org/D110384	2021-10-05 15:24:52 +02:00
Dmitry Vyukov	354ded67b3	tsan: align ThreadState to cache line There are 2 reasons to do this: 1. We place hot data in the first cache line of ThreadState, this assumed that it's cache-line-aligned but we never actually enforced it (or it was lost at some point). 2. The new vector clock uses vector instructions and requires data alignment. Later the new vector clock will be embedded in ThreadState, then ensuring vector clock alignment will be impossible w/o ThreadState alignment. Depends on D110519. Reviewed By: melver Differential Revision: https://reviews.llvm.org/D110520	2021-09-27 12:54:09 +02:00
Dmitry Vyukov	ed7f3f5bc9	tsan: move shadow stack into ThreadState Currently the shadow stack is located in the trace memory mapping. The new tsan runtime will remove the trace memory mapping. Move the shadow stack into ThreadState as a preparation step. Reviewed By: melver Differential Revision: https://reviews.llvm.org/D110519	2021-09-27 12:53:02 +02:00
Dmitry Vyukov	b02938439d	tsan: uninline RacyStacks::operator== It's only used during race reporting. There is no point in polluting the main header file with it. Reviewed By: xgupta Differential Revision: https://reviews.llvm.org/D110470	2021-09-25 12:08:51 +02:00
Dmitry Vyukov	7faf1285f2	tsan: remove expected race leftover Remove nmissed_expected variable. It's a leftover from removed "expected race" feature and is never incremented. Reviewed By: melver Differential Revision: https://reviews.llvm.org/D110321	2021-09-23 14:13:26 +02:00
Dmitry Vyukov	702cb7afe9	tsan: move shadow declaration into a separate header file (NFC) tsan_rtl.h is very huge and contains too many things. Move FastState and Shadow types into a new tsan_shadow.h file. This also allows to use FastState/Shadow in other header files without creating circular dependencies (which most likely will happen today). Reviewed By: melver Differential Revision: https://reviews.llvm.org/D110272	2021-09-23 11:04:43 +02:00
Dmitry Vyukov	e8101f2149	tsan: move mem profile initialization into separate function BackgroundThread function is quite large, move mem profile initialization into a separate function. Depends on D110151. Reviewed By: melver, vitalybuka Differential Revision: https://reviews.llvm.org/D110152	2021-09-22 10:18:08 +02:00
Dmitry Vyukov	20ee72d4cc	tsan: don't call dlsym during exit dlsym calls into dynamic linker which calls malloc and other things. It's problematic to do it during the actual exit, because it can happen from a singal handler or from within the runtime after we reported the first bug, etc. See https://github.com/google/sanitizers/issues/1440 for an example (captured in the added test). Initialize the callbacks during startup instead. Depends on D110159. Reviewed By: vitalybuka Differential Revision: https://reviews.llvm.org/D110166	2021-09-22 07:11:59 +02:00
Dmitry Vyukov	c97318996f	tsan: add new trace Add structures for the new trace format, functions that serialize and add events to the trace and trace replaying logic. Differential Revision: https://reviews.llvm.org/D107911	2021-08-16 10:24:11 +02:00
Dmitry Vyukov	959076c596	tsan: remove dependencies on HAS_48_BIT_ADDRESS_SPACE and Mapping Remove direct uses of Mapping in preperation for removing Mapping type (which we already don't have for all platforms). Remove dependence on HAS_48_BIT_ADDRESS_SPACE in preparation for removing it. As far as I see for Apple/Mac platforms !HAS_48_BIT_ADDRESS_SPACE simply means SANITIZER_IOS. Depends on D107741. Reviewed By: melver Differential Revision: https://reviews.llvm.org/D107742	2021-08-10 20:07:23 +02:00
Dmitry Vyukov	a82c7476a7	tsan: introduce RawShadow type Currently we hardcode u64 type for shadow everywhere and do lots of uptr<->u64* casts. It makes it hard to change u64 to another type (e.g. u32) and makes it easy to introduce bugs. Introduce RawShadow type and use it in MemToShadow, ShadowToMem, IsShadowMem and throughout the code base as u64 replacement. This makes it possible to change u64 to something else in future and generally improves static typing. Depends on D107481. Reviewed By: vitalybuka Differential Revision: https://reviews.llvm.org/D107482	2021-08-05 13:37:10 +02:00
Dmitry Vyukov	bdeb15c34e	tsan: remove non-existent MemoryAccessRangeStep Probably was used for Go at some point... Depends on D107466. Reviewed By: melver Differential Revision: https://reviews.llvm.org/D107467	2021-08-04 18:04:06 +02:00
Dmitry Vyukov	c2598be8bc	tsan: move AccessType to tsan_defs.h It will be needed in more functions like ReportRace (the plan is to pass it through MemoryAccess to ReportRace) and this move will allow to split the huge tsan_rtl.h into parts (e.g. move FastState/Shadow definitions to a separate header). Depends on D107465. Reviewed By: melver Differential Revision: https://reviews.llvm.org/D107466	2021-08-04 18:03:56 +02:00
Dmitry Vyukov	2ddaffdc74	tsan: introduce kAccessExternalPC Add kAccessExternal memory access flag that denotes memory accesses with PCs that may have kExternalPCBit set. In preparation for MemoryAccess refactoring. Currently unused, but will allow to skip a branch. Depends on D107464. Reviewed By: melver Differential Revision: https://reviews.llvm.org/D107465	2021-08-04 18:03:49 +02:00
Dmitry Vyukov	d41233e9cf	tsan: introduce kAccessFree Add kAccessFree memory access flag (similar to kAccessVptr). In preparation for MemoryAccess refactoring. Reviewed By: melver Differential Revision: https://reviews.llvm.org/D107464	2021-08-04 18:03:41 +02:00
Dmitry Vyukov	18c6ed2f0f	tsan: add AccessVptr Add AccessVptr access type. For now it's converted to the same thr->is_vptr_access, but later it will be passed directly to ReportRace and will enable efficient tail calling in MemoryAccess function (currently __tsan_vptr_update/__tsan_vptr_read can't use tail calls in MemoryAccess because of the trailing assignment to thr->is_vptr_access). Depends on D107276. Reviewed By: vitalybuka, melver Differential Revision: https://reviews.llvm.org/D107282	2021-08-03 11:03:36 +02:00
Dmitry Vyukov	831910c5c4	tsan: new MemoryAccess interface Currently we have MemoryAccess function that accepts "bool kAccessIsWrite, bool kIsAtomic" and 4 wrappers: MemoryRead/MemoryWrite/MemoryReadAtomic/MemoryWriteAtomic. Such scheme with bool flags is not particularly scalable/extendable. Because of that we did not have Read/Write wrappers for UnalignedMemoryAccess, and "true, false" or "false, true" at call sites is not very readable. Moreover, the new tsan runtime will introduce more flags (e.g. move "freed" and "vptr access" to memory acccess flags). We can't have 16 wrappers and each flag also takes whole 64-bit register for non-inlined calls. Introduce AccessType enum that contains bit mask of read/write, atomic/non-atomic, and later free/non-free, vptr/non-vptr. Such scheme is more scalable, more readble, more efficient (don't consume multiple registers for these flags during calls) and allows to cover unaligned and range variations of memory access functions as well. Also switch from size log to just size. The new tsan runtime won't have the limitation of supporting only 1/2/4/8 access sizes, so we don't need the logarithms. Also add an inline thunk that converts the new interface to the old one. For inlined calls it should not add any overhead because all flags/size can be computed as compile time. Reviewed By: vitalybuka, melver Differential Revision: https://reviews.llvm.org/D107276	2021-08-03 11:03:23 +02:00
Dmitry Vyukov	8a49e053ca	tsan: inline ProcessPendingSignals check ProcessPendingSignals is called in all interceptors and user atomic operations. Make the fast-path check (no pending signals) inlinable. Reviewed By: melver Differential Revision: https://reviews.llvm.org/D107217	2021-08-02 11:05:45 +02:00
Dmitry Vyukov	103d075b05	tsan: introduce Tid and StackID typedefs Currently we inconsistently use u32 and int for thread ids, there are also "unique tid" and "os tid" and just lots of other things identified by integers. Additionally new tsan runtime will introduce yet another thread identifier that is very different from current tids. Similarly for stack IDs, it's easy to confuse u32 with other integer identifiers. And when a function accepts u32 or a struct contains u32 field, it's not always clear what it is. Add Tid and StackID typedefs to make it clear what is what. Reviewed By: melver Differential Revision: https://reviews.llvm.org/D107152	2021-07-31 09:05:31 +02:00
Dmitry Vyukov	5b30ebed96	tsan: remove "expected" races "Expected" races is a very ancient facility used in tsanv1 tests. It's not used/needed anymore. Remove it. Reviewed By: vitalybuka Differential Revision: https://reviews.llvm.org/D107175	2021-07-31 08:54:49 +02:00
Dmitry Vyukov	9e9599ef78	tsan: introduce LazyInitialize We call non-inlinable Initialize from all interceptors/syscalls, but most of the time runtime is already initialized and this just introduces unnecessary overhead. Add LazyInitialize that (1) inlinable, (2) does nothing if .preinit_array is enabled (expected case on Linux). Depends on D107071. Reviewed By: melver Differential Revision: https://reviews.llvm.org/D107072	2021-07-29 17:19:29 +02:00
Dmitry Vyukov	0d68cfc996	tsan: store ThreadRegistry in Context by value It's unclear why we allocate ThreadRegistry separately, I assume it's some historical leftover. Embed ThreadRegistry into Context. Reviewed By: melver Differential Revision: https://reviews.llvm.org/D107045	2021-07-29 12:44:44 +02:00
Dmitry Vyukov	5237b14087	tsan: print alloc stack for Java objects We maintain information about Java allocations, but for some reason never printed it in reports. Print it. Reviewed By: vitalybuka Differential Revision: https://reviews.llvm.org/D106956	2021-07-28 20:25:11 +02:00
Dmitry Vyukov	b5bc386ca1	tsan: remove mblock types We used to count number of allocations/bytes based on the type and maybe record them in heap block headers. But that's all in the past, now it's not used for anything. Remove the mblock type. Reviewed By: vitalybuka Differential Revision: https://reviews.llvm.org/D106971	2021-07-28 20:09:25 +02:00
Dmitry Vyukov	5acdfb7eda	tsan: remove unused pc arguments Remove pc argument of ThreadIgnoreEnd, ThreadIgnoreSyncEnd and AcquireGlobal functions. It's unused and in some places we don't even have a pc and pass 0 anyway. Don't confuse readers and don't pretend that pc is needed and that passing 0 is somehow deficient. Use simpler convention for ThreadIgnoreBegin and ThreadIgnoreSyncBegin: accept only pc instread of pc+save_stack. 0 pc means "don't save stack". Reviewed By: vitalybuka Differential Revision: https://reviews.llvm.org/D106973	2021-07-28 20:07:49 +02:00
Dmitry Vyukov	c3044a5db7	tsan: fix SANITIZER_DEBUG build Remove left-over debug field after moving tsan deadlock detector into sanitizer_common. Reviewed By: melver Differential Revision: https://reviews.llvm.org/D106638	2021-07-23 10:56:58 +02:00
Dmitry Vyukov	adb55d7c32	tsan: remove the stats subsystem I don't think the stat subsystem was ever used since tsan development in 2012. But it adds lots of code and this effectively dead code needs to be updated if the runtime code changes, which adds maintanance cost for no benefit. Normal profiler usually gives enough info and that info is more trustworthy. Remove the stats subsystem. Reviewed By: vitalybuka Differential Revision: https://reviews.llvm.org/D106276	2021-07-20 07:47:38 +02:00

1 2 3 4 5 ...

258 Commits