llvm-project

Commit Graph

Author	SHA1	Message	Date
Kazu Hirata	4db2e4cebe	Use {DenseSet,SetVector,SmallPtrSet}::contains (NFC)	2021-10-30 19:00:19 -07:00
Zarko Todorovski	8659b241ae	[clang][NFC] Inclusive terms: Replace uses of whitelist in clang/lib/StaticAnalyzer Replace variable and functions names, as well as comments that contain whitelist with more inclusive terms. Reviewed By: aaron.ballman, martong Differential Revision: https://reviews.llvm.org/D112642	2021-10-29 16:51:36 -04:00
Balazs Benics	49285f43e5	[analyzer] sprintf is a taint propagator not a source Due to a typo, `sprintf()` was recognized as a taint source instead of a taint propagator. It was because an empty taint source list - which is the first parameter of the `TaintPropagationRule` - encoded the unconditional taint sources. This typo effectively turned the `sprintf()` into an unconditional taint source. This patch fixes that typo and demonstrated the correct behavior with tests. Reviewed By: martong Differential Revision: https://reviews.llvm.org/D112558	2021-10-28 11:03:02 +02:00
Balazs Benics	c18407217e	[analyzer] Fix StringChecker for Unknown params It seems like protobuf crashed the `std::string` checker. Somehow it acquired `UnknownVal` as the sole `std::string` constructor parameter, causing a crash in the `castAs<Loc>()`. This patch addresses this. Reviewed By: martong Differential Revision: https://reviews.llvm.org/D112551	2021-10-26 18:15:00 +02:00
Balazs Benics	e1fdec875f	[analyzer] Add std::string checker This patch adds a checker checking `std::string` operations. At first, it only checks the `std::string` single `const char *` constructor for nullness. If It might be `null`, it will constrain it to non-null and place a note tag there. Reviewed By: martong Differential Revision: https://reviews.llvm.org/D111247	2021-10-25 11:15:40 +02:00
Kazu Hirata	d8e4170b0a	Ensure newlines at the end of files (NFC)	2021-10-23 08:45:29 -07:00
Simon Pilgrim	7562f3df89	InvalidPtrChecker - don't dereference a dyn_cast<> - use cast<> instead. Avoid dereferencing a nullptr returned by dyn_cast<>, by using cast<> instead which asserts that the cast is valid.	2021-10-20 18:06:00 +01:00
Balazs Benics	16be17ad4b	[analyzer][NFC] Refactor llvm::isa<> usages in the StaticAnalyzer It turns out llvm::isa<> is variadic, and we could have used this at a lot of places. The following patterns: x && isa<T1>(x) \|\| isa<T2>(x) ... Will be replaced by: isa_and_non_null<T1, T2, ...>(x) Sometimes it caused further simplifications, when it would cause even more code smell. Aside from this, keep in mind that within `assert()` or any macro functions, we need to wrap the isa<> expression within a parenthesis, due to the parsing of the comma. Reviewed By: martong Differential Revision: https://reviews.llvm.org/D111982	2021-10-20 17:43:31 +02:00
Kazu Hirata	0abb5d293c	[Sema, StaticAnalyzer] Use StringRef::contains (NFC)	2021-10-20 08:02:36 -07:00
Kazu Hirata	d245f2e859	[clang] Use llvm::erase_if (NFC)	2021-10-17 13:50:29 -07:00
Kazu Hirata	6a154e606e	[clang] Use llvm::is_contained (NFC)	2021-10-15 10:07:08 -07:00
Kazu Hirata	e567f37dab	[clang] Use llvm::is_contained (NFC)	2021-10-13 20:41:55 -07:00
Balazs Benics	edde4efc66	[analyzer] Introduce the assume-controlled-environment config option If the `assume-controlled-environment` is `true`, we should expect `getenv()` to succeed, and the result should not be considered tainted. By default, the option will be `false`. Reviewed By: NoQ, martong Differential Revision: https://reviews.llvm.org/D111296	2021-10-13 10:50:26 +02:00
Balazs Benics	7fc150309d	[analyzer] Bifurcate on getenv() calls The `getenv()` function might return `NULL` just like any other function. However, in case of `getenv()` a state-split seems justified since the programmer should expect the failure of this function. `secure_getenv(const char *name)` behaves the same way but is not handled right now. Note that `std::getenv()` is also not handled. Reviewed By: martong Differential Revision: https://reviews.llvm.org/D111245	2021-10-13 10:50:26 +02:00
Artem Dergachev	f3ec9d8501	[analyzer] Fix non-obvious analyzer warning: Use of zero-allocated memory. Clarify the message provided when the analyzer catches the use of memory that is allocated with size zero. Differential Revision: https://reviews.llvm.org/D111655	2021-10-12 10:41:00 -07:00
Corentin Jabot	424733c12a	Implement if consteval (P1938) Modify the IfStmt node to suppoort constant evaluated expressions. Add a new ExpressionEvaluationContext::ImmediateFunctionContext to keep track of immediate function contexts. This proved easier/better/probably more efficient than walking the AST backward as it allows diagnosing nested if consteval statements.	2021-10-05 08:04:14 -04:00
Zurab Tsinadze	811b1736d9	[analyzer] Add InvalidPtrChecker This patch introduces a new checker: `alpha.security.cert.env.InvalidPtr` Checker finds usage of invalidated pointers related to environment. Based on the following SEI CERT Rules: ENV34-C: https://wiki.sei.cmu.edu/confluence/x/8tYxBQ ENV31-C: https://wiki.sei.cmu.edu/confluence/x/5NUxBQ Reviewed By: martong Differential Revision: https://reviews.llvm.org/D97699	2021-10-04 17:08:34 +02:00
Gabor Marton	96ec9b6ff2	[Analyzer] ConversionChecker: track back the cast expression Adding trackExpressionValue to the checker so it tracks the value of the implicit cast's DeclRefExpression up to initialization/assignment. This way the report becomes cleaner. Differential Revision: https://reviews.llvm.org/D109836	2021-09-16 11:42:54 +02:00
Kristóf Umann	9d359f6c73	[analyzer] MallocChecker: Add notes from NoOwnershipChangeVisitor only when a function "intents", but doesn't change ownership, enable by default D105819 Added NoOwnershipChangeVisitor, but it is only registered when an off-by-default, hidden checker option was enabled. The reason behind this was that it grossly overestimated the set of functions that really needed a note: std::string getTrainName(const Train T) { return T->name; } // note: Retuning without changing the ownership of or deallocating memory // Umm... I mean duh? Nor would I expect this function to do anything like that... void foo() { Train T = new Train("Land Plane"); print(getTrainName(T)); // note: calling getTrainName / returning from getTrainName } // warn: Memory leak This patch adds a heuristic that guesses that any function that has an explicit operator delete call could have be responsible for deallocating the memory that ended up leaking. This is waaaay too conservative (see the TODOs in the new function), but it safer to err on the side of too little than too much, and would allow us to enable the option by default now, and add refinements one-by-one. Differential Revision: https://reviews.llvm.org/D108753	2021-09-13 15:01:20 +02:00
Kristóf Umann	0213d7ec0c	[analyzer][NFCI] Allow clients of NoStateChangeFuncVisitor to check entire function calls, rather than each ExplodedNode in it Fix a compilation error due to a missing 'template' keyword. Differential Revision: https://reviews.llvm.org/D108695	2021-09-13 13:50:01 +02:00
Jessica Paquette	b9e57e0305	Revert "[analyzer][NFCI] Allow clients of NoStateChangeFuncVisitor to check entire function calls, rather than each ExplodedNode in it" This reverts commit `a375bfb5b7`. This was causing a bot to crash: https://green.lab.llvm.org/green/job/clang-stage1-cmake-RA-incremental/23380/	2021-09-03 10:28:07 -07:00
Kristóf Umann	a375bfb5b7	[analyzer][NFCI] Allow clients of NoStateChangeFuncVisitor to check entire function calls, rather than each ExplodedNode in it D105553 added NoStateChangeFuncVisitor, an abstract class to aid in creating notes such as "Returning without writing to 'x'", or "Returning without changing the ownership status of allocated memory". Its clients need to define, among other things, what a change of state is. For code like this: f() { g(); } foo() { f(); h(); } We'd have a path in the ExplodedGraph that looks like this: -- <g> --> / \ --- <f> --------> --- <h> ---> / \ / \ -------- <foo> ------ <foo> --> When we're interested in whether f neglected to change some property, NoStateChangeFuncVisitor asks these questions: ÷×~ -- <g> --> ß / \$ @&#* --- <f> --------> --- <h> ---> / \ / \ -------- <foo> ------ <foo> --> Has anything changed in between # and ? Has anything changed in between & and ? Has anything changed in between @ and ? ... Has anything changed in between $ and ? Has anything changed in between × and ~? Has anything changed in between ÷ and ~? ... Has anything changed in between ß and ? ... This is a rather thorough line of questioning, which is why in D105819, I was only interested in whether state right before* and right after a function call changed, and early returned to the CallEnter location: if (!CurrN->getLocationAs<CallEnter>()) return; Except that I made a typo, and forgot to negate the condition. So, in this patch, I'm fixing that, and under the same hood allow all clients to decide to do this whole-function check instead of the thorough one. Differential Revision: https://reviews.llvm.org/D108695	2021-09-03 13:50:18 +02:00
Kristóf Umann	3891b45a06	Revert "[analyzer][NFCI] Allow clients of NoStateChangeFuncVisitor to check entire function calls, rather than each ExplodedNode in it" This reverts commit `7d0e62bfb7`.	2021-09-02 17:19:49 +02:00
Kristóf Umann	7d0e62bfb7	[analyzer][NFCI] Allow clients of NoStateChangeFuncVisitor to check entire function calls, rather than each ExplodedNode in it D105553 added NoStateChangeFuncVisitor, an abstract class to aid in creating notes such as "Returning without writing to 'x'", or "Returning without changing the ownership status of allocated memory". Its clients need to define, among other things, what a change of state is. For code like this: f() { g(); } foo() { f(); h(); } We'd have a path in the ExplodedGraph that looks like this: -- <g> --> / \ --- <f> --------> --- <h> ---> / \ / \ -------- <foo> ------ <foo> --> When we're interested in whether f neglected to change some property, NoStateChangeFuncVisitor asks these questions: ÷×~ -- <g> --> ß / \$ @&#* --- <f> --------> --- <h> ---> / \ / \ -------- <foo> ------ <foo> --> Has anything changed in between # and ? Has anything changed in between & and ? Has anything changed in between @ and ? ... Has anything changed in between $ and ? Has anything changed in between × and ~? Has anything changed in between ÷ and ~? ... Has anything changed in between ß and ? ... This is a rather thorough line of questioning, which is why in D105819, I was only interested in whether state right before* and right after a function call changed, and early returned to the CallEnter location: if (!CurrN->getLocationAs<CallEnter>()) return; Except that I made a typo, and forgot to negate the condition. So, in this patch, I'm fixing that, and under the same hood allow all clients to decide to do this whole-function check instead of the thorough one. Differential Revision: https://reviews.llvm.org/D108695	2021-09-02 16:56:32 +02:00
Balazs Benics	68088563fb	[analyzer] MallocOverflow should consider comparisons only preceding malloc MallocOverflow works in two phases: 1) Collects suspicious malloc calls, whose argument is a multiplication 2) Filters the aggregated list of suspicious malloc calls by iterating over the BasicBlocks of the CFG looking for comparison binary operators over the variable constituting in any suspicious malloc. Consequently, it suppressed true-positive cases when the comparison check was after the malloc call. In this patch the checker will consider the relative position of the relation check to the malloc call. E.g.: ```lang=C++ void check_after_malloc(int n, int x) { int p = NULL; if (x == 42) p = malloc(n * sizeof(int)); // Previously no warning, now it // warns about this. // The check is after the allocation! if (n > 10) { // Do something conditionally. } return p; } ``` Reviewed By: martong Differential Revision: https://reviews.llvm.org/D107804	2021-08-27 14:41:26 +02:00
Balazs Benics	6ad47e1c4f	[analyzer] Catch leaking stack addresses via stack variables Not only global variables can hold references to dead stack variables. Consider this example: void write_stack_address_to(char *q) { char local; q = &local; } void test_stack() { char p; write_stack_address_to(&p); } The address of 'local' is assigned to 'p', which becomes a dangling pointer after 'write_stack_address_to()' returns. The StackAddrEscapeChecker was looking for bindings in the store which referred to variables of the popped stack frame, but it only considered global variables in this regard. This patch relaxes this, catching stack variable bindings as well. --- This patch also works for temporary objects like: struct Bar { const int &ref; explicit Bar(int y) : ref(y) { // Okay. } // End of the constructor call, `ref` is dangling now. Warning! }; void test() { Bar{33}; // Temporary object, so the corresponding memregion is // not* a VarRegion. } --- The return value optimization aka. copy-elision might kick in but that is modeled by passing an imaginary CXXThisRegion which refers to the parent stack frame which is supposed to be the 'return slot'. Objects residing in the 'return slot' outlive the scope of the inner call, thus we should expect no warning about them - except if we explicitly disable copy-elision. Reviewed By: NoQ, martong Differential Revision: https://reviews.llvm.org/D107078	2021-08-27 11:31:16 +02:00
Rong Xu	9b8425e42c	Reapply commit `b7425e956` The commit b7425e956: [NFC] fix typos is harmless but was reverted by accident. Reapply.	2021-08-16 12:18:40 -07:00
Kostya Kortchinsky	80ed75e7fb	Revert "[NFC] Fix typos" This reverts commit `b7425e956b`.	2021-08-16 11:13:05 -07:00
Rong Xu	b7425e956b	[NFC] Fix typos s/senstive/senstive/g	2021-08-16 10:15:30 -07:00
Kristóf Umann	2d3668c997	[analyzer] MallocChecker: Add a visitor to leave a note on functions that could have, but did not change ownership on leaked memory This is a rather common feedback we get from out leak checkers: bug reports are really short, and are contain barely any usable information on what the analyzer did to conclude that a leak actually happened. This happens because of our bug report minimizing effort. We construct bug reports by inspecting the ExplodedNodes that lead to the error from the bottom up (from the error node all the way to the root of the exploded graph), and mark entities that were the cause of a bug, or have interacted with it as interesting. In order to make the bug report a bit less verbose, whenever we find an entire function call (from CallEnter to CallExitEnd) that didn't talk about any interesting entity, we prune it (click here for more info on bug report generation). Even if the event to highlight is exactly this lack of interaction with interesting entities. D105553 generalized the visitor that creates notes for these cases. This patch adds a new kind of NoStateChangeVisitor that leaves notes in functions that took a piece of dynamically allocated memory that later leaked as parameter, and didn't change its ownership status. Differential Revision: https://reviews.llvm.org/D105553	2021-08-16 16:19:00 +02:00
Balázs Kéri	9f517fd11e	[clang][analyzer] Improve bug report in alpha.security.ReturnPtrRange Add some notes and track of bad return value. Reviewed By: steakhal Differential Revision: https://reviews.llvm.org/D107051	2021-08-11 13:04:55 +02:00
Deep Majumder	80068ca623	[analyzer] Fix for faulty namespace test in SmartPtrModelling This patch: - Fixes how the std-namespace test is written in SmartPtrModelling (now accounts for functions with no Decl available) - Adds the smart pointer checker flag check where it was missing Differential Revision: https://reviews.llvm.org/D106296	2021-07-21 18:23:35 +05:30
Balázs Kéri	90cb5297ad	[clang][analyzer] Improve report of file read at EOF condition (alpha.unix.Stream checker). The checker warns if a stream is read that is already in end-of-file (EOF) state. The commit adds indication of the last location where the EOF flag is set on the stream. Reviewed By: Szelethus Differential Revision: https://reviews.llvm.org/D104925	2021-07-21 08:54:11 +02:00
Deep Majumder	d825309352	[analyzer] Handle std::make_unique Differential Revision: https://reviews.llvm.org/D103750	2021-07-18 19:54:28 +05:30
Deep Majumder	0cd98bef1b	[analyzer] Handle std::swap for std::unique_ptr This patch handles the `std::swap` function specialization for `std::unique_ptr`. Implemented to be very similar to how `swap` method is handled Differential Revision: https://reviews.llvm.org/D104300	2021-07-18 14:38:55 +05:30
Deep Majumder	13fe78212f	[analyzer] Handle << operator for std::unique_ptr This patch handles the `<<` operator defined for `std::unique_ptr` in the std namespace (ignores custom overloads of the operator). Differential Revision: https://reviews.llvm.org/D105421	2021-07-16 12:34:30 +05:30
Deep Majumder	48688257c5	[analyzer] Model comparision methods of std::unique_ptr This patch handles all the comparision methods (defined via overloaded operators) on std::unique_ptr. These operators compare the underlying pointers, which is modelled by comparing the corresponding inner-pointer SVal. There is also a special case for comparing the same pointer. Differential Revision: https://reviews.llvm.org/D104616	2021-07-16 09:54:05 +05:30
David Blaikie	1def2579e1	PR51018: Remove explicit conversions from SmallString to StringRef to future-proof against C++23 C++23 will make these conversions ambiguous - so fix them to make the codebase forward-compatible with C++23 (& a follow-up change I've made will make this ambiguous/invalid even in <C++23 so we don't regress this & it generally improves the code anyway)	2021-07-08 13:37:57 -07:00
Georgy Komarov	c558b1fca7	[analyzer] Fix calculating offset for fields with an empty type Fix offset calculation routines in padding checker to avoid assertion errors described in bugzilla issue 50426. The fields that are subojbects of zero size, marked with [[no_unique_address]] or empty bitfields will be excluded from padding calculation routines. Reviewed By: NoQ Differential Revision: https://reviews.llvm.org/D104097	2021-07-04 06:57:11 +03:00
Martin Storsjö	e5c7c171e5	[clang] Rename StringRef _lower() method calls to _insensitive() This is mostly a mechanical change, but a testcase that contains parts of the StringRef class (clang/test/Analysis/llvm-conventions.cpp) isn't touched.	2021-06-25 00:22:01 +03:00
Balázs Kéri	d7227a5bc7	[clang][Analyzer] Track null stream argument in alpha.unix.Stream . The checker contains check for passing a NULL stream argument. This change should make more easy to identify where the passed pointer becomes NULL. Reviewed By: NoQ Differential Revision: https://reviews.llvm.org/D104640	2021-06-22 11:16:56 +02:00
Valeriy Savchenko	57006d2f6d	[analyzer] Refactor trackExpressionValue to accept TrackingOptions Differential Revision: https://reviews.llvm.org/D103633	2021-06-11 12:49:04 +03:00
Valeriy Savchenko	b6bcf95322	[analyzer] Change FindLastStoreBRVisitor to use Tracker Additionally, this commit completely removes any uses of FindLastStoreBRVisitor from the analyzer except for the one in Tracker. The next step is actually removing this class altogether from the header file. Differential Revision: https://reviews.llvm.org/D103618	2021-06-11 12:49:03 +03:00
Matheus Izvekov	aef5d8fdc7	[clang] NFC: Rename rvalue to prvalue This renames the expression value categories from rvalue to prvalue, keeping nomenclature consistent with C++11 onwards. C++ has the most complicated taxonomy here, and every other language only uses a subset of it, so it's less confusing to use the C++ names consistently, and mentally remap to the C names when working on that context (prvalue -> rvalue, no xvalues, etc). Renames: * VK_RValue -> VK_PRValue * Expr::isRValue -> Expr::isPRValue * SK_QualificationConversionRValue -> SK_QualificationConversionPRValue * JSON AST Dumper Expression nodes value category: "rvalue" -> "prvalue" Signed-off-by: Matheus Izvekov <mizvekov@gmail.com> Reviewed By: rsmith Differential Revision: https://reviews.llvm.org/D103720	2021-06-09 12:27:10 +02:00
Valeriy Savchenko	92d03c20ea	[analyzer] Add forwarding `addVisitor` method The majority of all `addVisitor` callers follow the same pattern: addVisitor(std::make_unique<SomeVisitor>(arg1, arg2, ...)); This patches introduces additional overload for `addVisitor` to simplify that pattern: addVisitor<SomeVisitor>(arg1, arg2, ...); Differential Revision: https://reviews.llvm.org/D103457	2021-06-03 17:10:16 +03:00
Xuanda Yang	620cef9120	[analyzer] MallocSizeof: sizeof pointer type is compatible with void* source: https://bugs.llvm.org/show_bug.cgi?id=50214 Make sizeof pointer type compatible with void* in MallocSizeofChecker. Reviewed By: NoQ Differential Revision: https://reviews.llvm.org/D103358	2021-05-30 09:51:41 +08:00
Valeriy Savchenko	e273918038	[analyzer] Track leaking object through stores Since we can report memory leaks on one variable, while the originally allocated object was stored into another one, we should explain how did it get there. rdar://76645710 Differential Revision: https://reviews.llvm.org/D100852	2021-04-28 18:37:38 +03:00
Valeriy Savchenko	61ae2db2d7	[analyzer] Adjust the reported variable name in retain count checker When reporting leaks, we try to attach the leaking object to some variable, so it's easier to understand. Before the patch, we always tried to use the first variable that stored the object in question. This can get very confusing for the user, if that variable doesn't contain that object at the moment of the actual leak. In many cases, the warning is dismissed as false positive and it is effectively a false positive when we fail to properly explain the warning to the user. This patch addresses the bigest issue in cases like this. Now we check if the variable still contains the leaking symbolic object. If not, we look for the last variable to actually hold it and use that variable instead. rdar://76645710 Differential Revision: https://reviews.llvm.org/D100839	2021-04-28 18:37:37 +03:00
Valeriy Savchenko	1dad8c5036	[analyzer][NFC] Remove duplicated work from retain count leak report Allocation site is the key location for the leak checker. It is a uniqueing location for the report and a source of information for the warning's message. Before this patch, we calculated and used it twice in bug report and in bug report visitor. Such duplication is not only harmful performance-wise (not much, but still), but also design-wise. Because changing something about the end piece of the report should've been repeated for description as well. Differential Revision: https://reviews.llvm.org/D100626	2021-04-28 18:37:37 +03:00
Gabor Marton	4b99f9c7db	[analyzer][StdLibraryFunctionsChecker] Track dependent arguments When we report an argument constraint violation, we should track those other arguments that participate in the evaluation of the violation. By default, we depend only on the argument that is constrained, however, there are some special cases like the buffer size constraint that might be encoded in another argument(s). Differential Revision: https://reviews.llvm.org/D101358	2021-04-27 15:35:58 +02:00

1 2 3 4 5 ...

2127 Commits