llvm-project

Commit Graph

Author	SHA1	Message	Date
Anna Zaks	4278234360	[analyzer] Teach the analyzer about implicit initialization of statics in ObjCMethods. Extend FunctionTextRegion to represent ObjC methods as well as functions. Note, it is not clear what type ObjCMethod region should return. Since the type of the FunctionText region is not currently used, defer solving this issue. llvm-svn: 164046	2012-09-17 19:13:56 +00:00
Anna Zaks	f6a5d793d2	[analyzer] Don't reimplement an existing function. Thanks Jordan. llvm-svn: 163762	2012-09-13 00:37:12 +00:00
Ted Kremenek	8b3f938697	Refactor logic in ExprEngine for detecting 'noreturn' methods in NSException to a helper object in libAnalysis that can also be used by Sema. Not sure if the predicate name 'isImplicitNoReturn' is the best one, but we can massage that later. No functionality change. llvm-svn: 163759	2012-09-13 00:21:31 +00:00
Anna Zaks	5d2964e770	[analyzer] Do not report use of undef on "return foo();" when the return type is void. Fixes a false positive found by analyzing LLVM code base. llvm-svn: 163750	2012-09-12 22:57:40 +00:00
Anna Zaks	e663b80975	[analyzer] Teach UndefOrNullArgVisitor to track parent regions. llvm-svn: 163748	2012-09-12 22:57:30 +00:00
Jordan Rose	5297748e3f	[analyzer] Fix another use of the address of a temporary, like r163402. Again, GCC is more aggressive about reusing temporary space than we are, leading to Release build crashes for this undefined behavior. PR13710 (though it may not be the only problem there) llvm-svn: 163747	2012-09-12 22:48:08 +00:00
Jordan Rose	d44977ef64	[analyzer] Handle when the dynamic type is worse than the static type. Currently we don't update the dynamic type of a C++ object when it is cast. This can cause the situation above, where the static type of the region is now known to be a subclass of the dynamic type. Once we start updating DynamicTypeInfo in response to the various kinds of casts in C++, we can re-add this assert to make sure we don't miss any cases. This work is tracked by <rdar://problem/12287087>. In -Asserts builds, we will simply not return any runtime definition when our DynamicTypeInfo is known to be incorrect like this. llvm-svn: 163745	2012-09-12 21:48:17 +00:00
Jordan Rose	99c6c2b4e2	Revert "[analyzer] Use the static type for a virtual call if the dynamic type is worse." Using the static type may be inconsistent with later calls. We should just report that there is no inlining definition available if the static type is better than the dynamic type. See next commit. This reverts r163644 / 19d5886d1704e24282c86217b09d5c6d35ba604d. llvm-svn: 163744	2012-09-12 21:48:13 +00:00
Ted Kremenek	ba22a035ad	Fix regression where "looping back to the head of" PathDiagnosticEvents were not emitted. Fixes <rdar://problem/12280665>. llvm-svn: 163683	2012-09-12 06:22:18 +00:00
Richard Smith	b15fe3a5e4	PR13811: Add a FunctionParmPackExpr node to handle references to function parameter packs where the reference is not being expanded but the pack has been. Previously, Clang would segfault in such cases. llvm-svn: 163672	2012-09-12 00:56:43 +00:00
Jordan Rose	a522f1cf8b	Revert "[analyzer] Disable STL inlining. Blocked by PR13724." While PR13724 is still an issue, it's not actually an issue in the STL. We can keep this option around in case there turn out to be widespread false positives due to poor modeling of the C++ standard library functions, but for now we'd like to get more data. This reverts r163633 / c6baadceec1d5148c20ee6c902a102233c547f62. llvm-svn: 163647	2012-09-11 20:26:49 +00:00
Jordan Rose	e35fdeb330	[analyzer] Use the static type for a virtual call if the dynamic type is worse. reinterpret_cast does not provide any of the usual type information that static_cast or dynamic_cast provide -- only the new type. This can get us in a situation where the dynamic type info for an object is actually a superclass of the static type, which does not match what CodeGen does at all. In these cases, just fall back to the static type as the best possible type for devirtualization. Should fix the crashes on our internal buildbot. llvm-svn: 163644	2012-09-11 18:47:13 +00:00
Anna Zaks	464493fbf4	[analyzer] Disable STL inlining. Blocked by PR13724. llvm-svn: 163633	2012-09-11 17:15:39 +00:00
Jordan Rose	12f669e3cd	[analyzer] Member function calls that use qualified names are non-virtual. C++11 [expr.call]p1: ...If the selected function is non-virtual, or if the id-expression in the class member access expression is a qualified-id, that function is called. Otherwise, its final overrider in the dynamic type of the object expression is called. <rdar://problem/12255556> llvm-svn: 163577	2012-09-11 00:31:02 +00:00
Anna Zaks	1ded453e36	[analyzer] Turn stl inlining back on. The one reported bug, which was exposed by stl inlining, is addressed in r163558. llvm-svn: 163574	2012-09-10 23:59:02 +00:00
Anna Zaks	4f9c460874	[analyzer] Do not count calls to small functions when computing stack depth. We only want to count how many substantial functions we inlined. This is an improvement to r163558. llvm-svn: 163571	2012-09-10 23:35:11 +00:00
Anna Zaks	5446f4dfb1	[analyzer] Add an option to enable/disable objc inlining. llvm-svn: 163562	2012-09-10 22:56:41 +00:00
Anna Zaks	14ce52492f	[analyzer] Add ipa-always-inline-size option (with 3 as the default). The option allows to always inline very small functions, whose size (in number of basic blocks) is set using -analyzer-config ipa-always-inline-size option. llvm-svn: 163558	2012-09-10 22:37:19 +00:00
Jordan Rose	c6fcbf06a6	[analyzer] Make the defaults explicit for each of the new config options. Also, document both new inlining options in IPA.txt. llvm-svn: 163551	2012-09-10 21:54:24 +00:00
Jordan Rose	1e0e4001c8	[analyzer] For now, don't inline C++ standard library functions. This is a (heavy-handed) solution to PR13724 -- until we know we can do a good job inlining the STL, it's best to be consistent and not generate more false positives than we did before. We can selectively whitelist certain parts of the 'std' namespace that are known to be safe. This is controlled by analyzer config option 'c++-stdlib-inlining', which can be set to "true" or "false". This commit also adds control for whether or not to inline any templated functions (member or non-member), under the config option 'c++-template-inlining'. This option is currently on by default. llvm-svn: 163548	2012-09-10 21:27:35 +00:00
Ted Kremenek	a0fa5d6564	Fix another case where we should be using isBeforeInTranslationUnit(). llvm-svn: 163533	2012-09-10 19:07:56 +00:00
Ted Kremenek	54fd079265	Add a few more cases where we should be using isBeforeInTranslationUnit(). llvm-svn: 163531	2012-09-10 19:02:33 +00:00
Ted Kremenek	6c7a5eae6d	Revert "Revert Ted's r163489 and r163490, due to breakage." I need to see how this breaks on other platforms when I fix the issue that Benjamin Kramer pointed out. This includes r163489 and r163490, plus a two line change. llvm-svn: 163512	2012-09-10 14:50:55 +00:00
NAKAMURA Takumi	6eb1399088	Revert Ted's r163489 and r163490, due to breakage. r163489, "Take another crack at stabilizing the emission order of analyzer" r163490, "Use isBeforeInTranslationUnitThan() instead of operator<." llvm-svn: 163497	2012-09-10 09:17:27 +00:00
Ted Kremenek	f1fc8ce65d	Use isBeforeInTranslationUnitThan() instead of operator<. llvm-svn: 163490	2012-09-10 06:56:07 +00:00
Ted Kremenek	3d92699d3c	Take another crack at stabilizing the emission order of analyzer diagnostics without using FoldingSetNodeIDs. This is done by doing a complete recursive comparison of the PathDiagnostics. Note that the previous method of comparing FoldingSetNodeIDs did not end up relying on unstable things such as pointer addresses, so I suspect this may still have some issues on various buildbots because I'm not sure if the true source of non-determinism has been eliminated. The tests pass for me, so the only way to know is to commit this change and see what happens. llvm-svn: 163489	2012-09-10 06:20:06 +00:00
Ted Kremenek	9b9ee2a616	Indent the "message" key in analyzer plist output. llvm-svn: 163487	2012-09-10 06:19:43 +00:00
Ted Kremenek	e9764d8f91	Remove dead method ProgramState::MarshalState(). llvm-svn: 163479	2012-09-09 14:55:59 +00:00
Ted Kremenek	e7ec4ef48d	Fix bug in BugReporter::RemoveUneededCalls() where "prunable" PathDiagnosticEventPieces were always pruned. Instead, they are suppose to only be pruned if the entire call gets pruned. llvm-svn: 163460	2012-09-08 07:18:18 +00:00
Ted Kremenek	b0d1c70258	Attempt (again) to stabilize the order of the emission of diagnostics of the analyzer by using the FullProfile() of a PathDiagnostic for ordering them. llvm-svn: 163455	2012-09-08 04:26:37 +00:00
Jordan Rose	5481cfefa6	[analyzer] ObjCSelfInitChecker should always clean up in postCall checks. ObjCSelfInitChecker stashes information in the GDM to persist it across function calls; it is stored in pre-call checks and retrieved post-call. The post-call check is supposed to clear out the stored state, but was failing to do so in cases where the call did not have a symbolic return value. This was actually causing the inappropriate cache-out from r163361. Per discussion with Anna, we should never actually cache out when assuming the receiver of an Objective-C message is non-nil, because we guarded that node generation by checking that the state has changed. Therefore, the only states that could reach this exact ExplodedNode are ones that should have merged /before/ making this assumption. r163361 has been reverted and the test case removed, since it won't actually test anything interesting now. llvm-svn: 163449	2012-09-08 01:47:28 +00:00
Ted Kremenek	1fdcfcdf13	Revert "Attempt to make the PathDiagnostic emission order more deterministic by" llvm-svn: 163446	2012-09-08 01:25:00 +00:00
Ted Kremenek	af4cc7eab1	Revert "Further tweaks to hopefully make the PathDiagnostic emission more deterministic." llvm-svn: 163445	2012-09-08 01:24:53 +00:00
Jordan Rose	5860e329a4	[analyzer] Remove constraints on dead symbols as part of removeDeadBindings. Previously, we'd just keep constraints around forever, which means we'd never be able to merge paths that differed only in constraints on dead symbols. Because we now allow constraints on symbolic expressions, not just single symbols, this requires changing SymExpr::symbol_iterator to include intermediate symbol nodes in its traversal, not just the SymbolData leaf nodes. llvm-svn: 163444	2012-09-08 01:24:53 +00:00
Jordan Rose	dd5e8c4975	[analyzer] Symbolic regions are live if any subregions are live. RegionStoreManager was only treating a SymbolicRegion's symbel as live if there was a binding referring to the region itself. No test case because constraints are currently not being cleaned out of the constraint manager at all (even if the symbol is legitimately dead). llvm-svn: 163443	2012-09-08 01:24:49 +00:00
Jordan Rose	aaf8318480	[analyzer] Cast the result of a placement new-expression to the correct type. This is necessary because further analysis will assume that the SVal's type matches the AST type. This caused a crash when trying to perform a derived-to-base cast on a C++ object that had been new'd to be another object type. Yet another crash in PR13763. llvm-svn: 163442	2012-09-08 01:24:38 +00:00
Ted Kremenek	a11a741e2f	Further tweaks to hopefully make the PathDiagnostic emission more deterministic. llvm-svn: 163430	2012-09-07 23:13:11 +00:00
Ted Kremenek	244e1d7d0f	Remove ProgramState::getSymVal(). It was being misused by Checkers, with at least one subtle bug in MacOSXKeyChainAPIChecker where the calling the method was a substitute for assuming a symbolic value was null (which is not the case). We still keep ConstraintManager::getSymVal(), but we use that as an optimization in SValBuilder and ProgramState::getSVal() to constant-fold SVals. This is only if the ConstraintManager can provide us with that information, which is no longer a requirement. As part of this, introduce a default implementation of ConstraintManager::getSymVal() which returns null. For Checkers, introduce ConstraintManager::isNull(), which queries the state to see if the symbolic value is constrained to be a null value. It does this without assuming it has been implicitly constant folded. llvm-svn: 163428	2012-09-07 22:31:01 +00:00
Ted Kremenek	334ad6ac13	Attempt to make the PathDiagnostic emission order more deterministic by looking at PathPieces. llvm-svn: 163427	2012-09-07 22:24:24 +00:00
Ted Kremenek	58ec11612c	Remove ConstraintManager:isEqual(). It is no longer used. llvm-svn: 163425	2012-09-07 22:24:18 +00:00
Jordan Rose	8dc77398a1	[analyzer] Use cast<> instead of getAs<> for a CFGElement known to be a CFGStmt. When adding the next statement to the CoreEngine's work list, we take care of all the special cases first. We certainly shouldn't be building PostStmts with null statements (the diagnostics machinery assumes such StmtPoints do not exist), and we should find out sooner if we're missing a special case. A refinement of r163402 that should help prevent further issues like PR13760. llvm-svn: 163409	2012-09-07 19:48:09 +00:00
Jordan Rose	3c2713accf	[analyzer] Don't use the address of a temporary CFGElement. GCC destroys temporary objects more aggressively than clang, so this results in incorrect behavior when compiling GCC Release builds. We could avoid this issue under C++11 by preventing getAs from being called when 'this' is an rvalue: template<class ElemTy> const ElemTy getAs() const & { ... } template<class ElemTy> const ElemTy getAs() const && = delete; Unfortunately, we do not have compatibility macros for this behavior yet. This will hopefully fix PR13760 and PR13762. llvm-svn: 163402	2012-09-07 18:36:17 +00:00
Anna Zaks	67e0062b7c	[analyzer] Explain why we need condition 8. llvm-svn: 163394	2012-09-07 16:22:09 +00:00
Ted Kremenek	891bcdb644	ExplodedGraph::shouldCollectNode() should not collect nodes for non-Expr Stmts (as this previously was the case before this was refactored). We also shouldn't need to specially handle BinaryOperators since the eagerly-assume heuristic tags such nodes. llvm-svn: 163374	2012-09-07 06:56:18 +00:00
Ted Kremenek	7c15040e98	Fix bug in ConditionBRVisitor where for C++ (and not C) we were not ignoring implicit pointer-to-boolean conversions in condition expressions. This would result in inconsistent diagnostic emission between C and C++. A consequence of this is now ConditionBRVisitor and TrackConstraintBRVisitor may emit redundant diagnostics, for example: "Assuming pointer value is null" (TrackConstraintBRVisitor) "Assuming 'p' is null" (ConditionBRVisitor) We need to reconcile the two, and perhaps prefer one over the other in some cases. llvm-svn: 163372	2012-09-07 06:51:37 +00:00
Jordan Rose	81456d9f6d	[analyzer] Fail gracefully when the dynamic type is outside the hierarchy. With some particularly evil casts, we can get an object whose dynamic type is not actually a subclass of its static type. In this case, we won't even find the statically-resolved method as a devirtualization candidate. Rather than assert that this situation cannot occur, we now simply check that the dynamic type is not an ancestor or descendent of the static type, and leave it at that. This error actually occurred analyzing LLVM: CallEventManager uses a BumpPtrAllocator to allocate a concrete subclass of CallEvent (FunctionCall), but then casts it to the actual subclass requested (such as ObjCMethodCall) to perform the constructor. Yet another crash in PR13763. llvm-svn: 163367	2012-09-07 01:19:42 +00:00
Jordan Rose	7e97996f4e	[analyzer] Don't crash if we cache out while evaluating an ObjC message. A bizarre series of coincidences led us to generate a previously-seen node in the middle of processing an Objective-C message, where we assume the receiver is non-nil. We were assuming that such an assumption would never "cache out" like this, and blithely went on using a null ExplodedNode as the predecessor for the next step in evaluation. Although the test case committed here is complicated, this could in theory happen in other ways as well, so the correct fix is just to test if the non-nil assumption results in an ExplodedNode we've seen before. <rdar://problem/12243648> llvm-svn: 163361	2012-09-06 23:44:36 +00:00
Jordan Rose	2bc9674b0a	[analyzer] Don't attempt to devirtualize calls to base class destructors. CXXDestructorCall now has a flag for when it is a base destructor call. Other kinds of destructor calls (locals, fields, temporaries, and 'delete') all behave as "whole-object" destructors and do not behave differently from one another (specifically, in these cases we /should/ try to devirtualize a call to a virtual destructor). This was causing crashes in both our internal buildbot, the crash still being tracked in PR13765, and some of the crashes being tracked in PR13763, due to a assertion failure. (The behavior under -Asserts happened to be correct anyway.) Adding this knowledge also allows our DynamicTypePropagation checker to do a bit less work; the special rules about virtual method calls during a destructor only require extra handling during base destructors. llvm-svn: 163348	2012-09-06 20:37:08 +00:00
Roman Divacky	e637711ae0	Dont cast away const needlessly. Found by gcc48 -Wcast-qual. llvm-svn: 163325	2012-09-06 15:59:27 +00:00
Anna Zaks	3245e584db	[analyzer] Enhance the member expr tracking to account for references. As per Jordan's suggestion. (Came out of code review for r163261.) llvm-svn: 163269	2012-09-05 23:41:54 +00:00
Jordan Rose	6d671cc34a	[analyzer] Always include destructors in the analysis CFG. While destructors will continue to not be inlined (unless the analyzer config option 'c++-inlining' is set to 'destructors'), leaving them out of the CFG is an incomplete model of the behavior of an object, and can cause false positive warnings (like PR13751, now working). Destructors for temporaries are still not on by default, since (a) we haven't actually checked this code to be sure it's fully correct (in particular, we probably need to be very careful with regard to lifetime-extension when a temporary is bound to a reference, C++11 [class.temporary]p5), and (b) ExprEngine doesn't actually do anything when it sees a temporary destructor in the CFG -- not even invalidate the object region. To enable temporary destructors, set the 'cfg-temporary-dtors' analyzer config option to '1'. The old -cfg-add-implicit-dtors cc1 option, which controlled all implicit destructors, has been removed. llvm-svn: 163264	2012-09-05 22:55:23 +00:00
Anna Zaks	e5cb4981d0	[analyzer] Fix a crash PR13762. llvm-svn: 163262	2012-09-05 22:31:58 +00:00
Anna Zaks	b4b2b57ee0	[analyzer] NullOrUndef diagnostics: track symbols binded to regions. If a region is binded to a symbolic value, we should track the symbol. (The code I changed was not previously exercised by the regression tests.) llvm-svn: 163261	2012-09-05 22:31:55 +00:00
Jordan Rose	fcdda36149	[analyzer] Be more forgiving about calling methods on struct rvalues. The problem is that the value of 'this' in a C++ member function call should always be a region (or NULL). However, if the object is an rvalue, it has no associated region (only a conjured symbol or LazyCompoundVal). For now, we handle this in two ways: 1) Actually respect MaterializeTemporaryExpr. Before, it was relying on CXXConstructExpr to create temporary regions for all struct values. Now it just does the right thing: if the value is not in a temporary region, create one. 2) Have CallEvent recognize the case where its 'this' pointer is a non-region, and just return UnknownVal to keep from confusing clients. The long-term problem is being tracked internally in <rdar://problem/12137950>, but this makes many test cases pass. llvm-svn: 163220	2012-09-05 17:11:26 +00:00
Jordan Rose	d1a08b6e43	[analyzer] Clean up a couple uses of getPointeeType(). No intended functionality change. llvm-svn: 163219	2012-09-05 17:11:22 +00:00
Jordan Rose	bc009d4493	Revert "[analyzer] Treat all struct values as regions (even rvalues)." This turned out to have many implications, but what eventually seemed to make it unworkable was the fact that we can get struct values (as LazyCompoundVals) from other places besides return-by-value function calls; that is, we weren't actually able to "treat all struct values as regions" consistently across the entire analyzer core. Hopefully we'll be able to come up with an alternate solution soon. This reverts r163066 / 02df4f0aef142f00d4637cd851e54da2a123ca8e. llvm-svn: 163218	2012-09-05 17:11:15 +00:00
Jordan Rose	7523d1a847	[analyzer] Don't use makeIntVal to create a floating-point value. SimpleSValBuilder processes a couple trivial identities, including 'x - x' and 'x ^ x' (both 0). However, the former could appear with arguments of floating-point type, and we weren't checking for that. This started triggering an assert with r163069, which checks that a constant value is actually going to be used as an integer or pointer. llvm-svn: 163159	2012-09-04 19:34:58 +00:00
Joao Matos	566359c0bf	Revert r163083 per chandlerc's request. llvm-svn: 163149	2012-09-04 17:49:35 +00:00
Joao Matos	c32a7e4d8e	Implemented parsing and AST support for the MS __leave exception statement. Also a minor fix to __except printing in StmtPrinter.cpp. Thanks to Aaron Ballman for review. llvm-svn: 163083	2012-09-02 03:45:41 +00:00
Jordan Rose	d229e39a9a	[analyzer] Silence unused variable warnings in NDEBUG builds. No functionality change. llvm-svn: 163073	2012-09-01 19:15:13 +00:00
Jordan Rose	21580c2f92	[analyzer] Disallow creation of int vals with explicit bit width / signedness. All clients of BasicValueFactory should be using QualTypes instead, and indeed it seems they are. This caught the (fortunately harmless) bug fixed in the previous commit. No intended functionality change. llvm-svn: 163069	2012-09-01 17:39:24 +00:00
Jordan Rose	a44ad1b35c	[analyzer] Don't attempt to create a floating-point value of "1" for ++/--. The current logic would actually create a float- or double-sized signed integer value of 1, which is not at all the same. No test because the value would be swallowed by an Unknown as soon as it gets added or subtracted to the original value, but it enables the cleanup in the next patch. llvm-svn: 163068	2012-09-01 17:39:17 +00:00
Jordan Rose	82ae9898ef	[analyzer] Treat all struct values as regions (even rvalues). This allows us to correctly symbolicate the fields of structs returned by value, as well as get the proper 'this' value for when methods are called on structs returned by value. This does require a moderately ugly hack in the StoreManager: if we assign a "struct value" to a struct region, that now appears as a Loc value being bound to a region of struct type. We handle this by simply "dereferencing" the struct value region, which should create a LazyCompoundVal. This should fix recent crashes analyzing LLVM and on our internal buildbot. <rdar://problem/12137950> llvm-svn: 163066	2012-09-01 17:39:09 +00:00
Jordan Rose	2da564380a	[analyzer] Always derive a CallEvent's return type from its origin expr. Previously, we preferred to get a result type by looking at the callee's declared result type. This allowed us to handlereferences, which are represented in the AST as lvalues of their pointee type. (That is, a call to a function returning 'int &' has type 'int' and value kind 'lvalue'.) However, this results in us preferring the original type of a function over a casted type. This is a problem when a function pointer is casted to another type, because the conjured result value will have the wrong type. AdjustedReturnValueChecker is supposed to handle this, but still doesn't handle the case where there is no "original function" at all, i.e. where the callee is unknown. Now, we instead look at the call expression's value kind (lvalue, xvalue, or prvalue), and adjust the expr's type accordingly. This will have no effect when the function is inlined, and will conjure the value that will actually be used when it is not. This makes AdjustedReturnValueChecker /nearly/ unnecessary; unfortunately, the cases where it would still be useful are where we need to cast the result of an inlined function or a checker-evaluated function, and in these cases we don't know what we're casting /from/ by the time we can do post- call checks. In light of that, remove AdjustedReturnValueChecker, which was already not checking quite a few calls. llvm-svn: 163065	2012-09-01 17:39:00 +00:00
Ted Kremenek	cdf814900d	Split library clangRewrite into clangRewriteCore and clangRewriteFrontend. This is similar to how we divide up the StaticAnalyzer libraries to separate core functionality to what is clearly associated with Frontend actions. llvm-svn: 163050	2012-09-01 05:09:24 +00:00
Jordan Rose	219c9d0dd3	[analyzer] Though C++ inlining is enabled, don't inline ctors and dtors. More generally, this adds a new configuration option 'c++-inlining', which controls which C++ member functions can be considered for inlining. This uses the new -analyzer-config table, so the cc1 arguments will look like this: ... -analyzer-config c++-inlining=[none\|methods\|constructors\|destructors] Note that each mode implies that all the previous member function kinds will be inlined as well; it doesn't make sense to inline destructors without inlining constructors, for example. The default mode is 'methods'. llvm-svn: 163004	2012-08-31 17:06:49 +00:00
Jordan Rose	cc0b1bfa56	[analyzer] Ensure that PathDiagnostics profile the same regardless of path. PathDiagnostics are actually profiled and uniqued independently of the path on which the bug occurred. This is used to merge diagnostics that refer to the same issue along different paths, as well as by the plist diagnostics to reference files created by the HTML diagnostics. However, there are two problems with the current implementation: 1) The bug description is included in the profile, but some PathDiagnosticConsumers prefer abbreviated descriptions and some prefer verbose descriptions. Fixed by including both descriptions in the PathDiagnostic objects and always using the verbose one in the profile. 2) The "minimal" path generation scheme provides extra information about which events came from macros that the "extensive" scheme does not. This resulted not only in different locations for the plist and HTML diagnostics, but also in diagnostics being uniqued in the plist output but not in the HTML output. Fixed by storing the "end path" location explicitly in the PathDiagnostic object, rather than trying to find the last piece of the path when the diagnostic is requested. This should hopefully finish unsticking our internal buildbot. llvm-svn: 162965	2012-08-31 00:36:26 +00:00
Jordan Rose	7444f5d826	[analyzer] Fix a crash in plist-html generation introduced in r162939. Basically, do the correct thing to fix the XML generation error, rather than making it even worse by unilaterally dereferencing a null pointer. llvm-svn: 162964	2012-08-31 00:36:20 +00:00
Eli Friedman	34866c7719	Change the representation of builtin functions in the AST (__builtin_* etc.) so that it isn't possible to take their address. Specifically, introduce a new type to represent a reference to a builtin function, and a new cast kind to convert it to a function pointer in the operand of a call. Fixes PR13195. llvm-svn: 162962	2012-08-31 00:14:07 +00:00
Anna Zaks	a8017eca1a	[analyzer] Refactor the logic that determines if a functions should be reanalyzed. The policy on what to reanalyze should be in AnalysisConsumer with the rest of visitation order logic. There is no reason why ExprEngine needs to pass the Visited set to CoreEngine, it can populate it itself. llvm-svn: 162957	2012-08-30 23:42:02 +00:00
Jordan Rose	03fac27bab	[analyzer] Plist diagnostics: Fix a case where we fail to close an XML tag. If the current path diagnostic does /not/ have files associated with it, we were simply skipping on to the next diagnostic with 'continue'. But that also skipped the close tag for the diagnostic's <dict> node. Part of fixing our internal analyzer buildbot. llvm-svn: 162939	2012-08-30 20:43:09 +00:00
Ted Kremenek	efca7a7e1b	Rename 'MaxLoop' to 'maxBlockVisitOnPath' to reflect reality. We should consider renaming the command line option as well. llvm-svn: 162932	2012-08-30 19:26:56 +00:00
Ted Kremenek	6f5131f149	Rename AnalyzerOptions 'EagerlyAssume' to 'eagerlyAssumeBinOpBifurcation'. llvm-svn: 162930	2012-08-30 19:26:48 +00:00
Ted Kremenek	8756c4a1a9	Store const& to AnalyzerOptions in AnalysisManager instead of copying individual flags. llvm-svn: 162929	2012-08-30 19:26:43 +00:00
Anna Zaks	07a821fb17	[analyzer] Fixup 162863. Thanks Jordan. llvm-svn: 162875	2012-08-29 23:23:39 +00:00
Anna Zaks	5d4ec36323	[analyzer] Improved diagnostic pruning for calls initializing values. This heuristic addresses the case when a pointer (or ref) is passed to a function, which initializes the variable (or sets it to something other than '0'). On the branch where the inlined function does not set the value, we report use of undefined value (or NULL pointer dereference). The access happens in the caller and the path through the callee would get pruned away with regular path pruning. To solve this issue, we previously disabled diagnostic pruning completely on undefined and null pointer dereference checks, which entailed very verbose diagnostics in most cases. Furthermore, not all of the undef value checks had the diagnostic pruning disabled. This patch implements the following heuristic: if we pass a pointer (or ref) to the region (on which the error is reported) into a function and it's value is either undef or 'NULL' (and is a pointer), do not prune the function. llvm-svn: 162863	2012-08-29 21:22:37 +00:00
Ted Kremenek	fb5351eed3	Add new -cc1 driver option -analyzer-config, which allows one to specify a comma separated collection of key:value pairs (which are strings). This allows a general way to provide analyzer configuration data from the command line. No clients yet. llvm-svn: 162827	2012-08-29 05:55:00 +00:00
Jordan Rose	8d48938bf3	[analyzer] Teach CallEventManager that CXXTemporaryObjectExpr is also a ctor. Specifically, CallEventManager::getCaller was looking at the call site for an inlined call and trying to see what kind of call it was, but it only checked for CXXConstructExprClass. (It's not using an isa<> here to avoid doing three more checks on the the statement class.) This caused an unreachable when we actually did inline the constructor of a temporary object. PR13717 llvm-svn: 162792	2012-08-28 20:52:21 +00:00
Jordan Rose	2be6e30d96	[analyzer] When we look for the last stmt in a function, skip implicit dtors. When exiting a function, the analyzer looks for the last statement in the function to see if it's a return statement (and thus bind the return value). However, the search for "the last statement" was accepting statements that were in implicitly-generated inlined functions (i.e. destructors). So we'd go and get the statement from the destructor, and then say "oh look, this function had no explicit return...guess there's no return value". And /that/ led to the value being returned being declared dead, and all our leak checkers complaining. llvm-svn: 162791	2012-08-28 20:52:13 +00:00
Jordan Rose	595c131460	[analyzer] Don't purge dead symbols at the end of calls if -analyzer-purge=none. No test case since this is a debug option that we will never turn on by default since it makes the leak checkers much less useful. (We'll only report leaks at the end of analysis if -analyzer-purge=none.) llvm-svn: 162772	2012-08-28 18:16:45 +00:00
Jordan Rose	a0f7d35afe	[analyzer] Rename addTrackNullOrUndefValueVisitor to trackNullOrUndefValue. This helper function (in the clang::ento::bugreporter namespace) may add more than one visitor, but conceptually it's tracking a single use of a null or undefined value and should do so as best it can. Also, the BugReport parameter has been made a reference to underscore that it is non-optional. llvm-svn: 162720	2012-08-28 00:50:51 +00:00
Jordan Rose	72c5515bab	[analyzer] Refactor FindLastStoreBRVisitor to not find the store ahead of time. As Anna pointed out to me offline, it's a little silly to walk backwards through the graph to find the store site when BugReporter will do the exact same walk as part of path diagnostic generation. llvm-svn: 162719	2012-08-28 00:50:45 +00:00
Jordan Rose	5090904d6c	[analyzer] If the last store into a region came from a function, step into it. Previously, if we were tracking stores to a variable 'x', and came across this: x = foo(); ...we would simply emit a note here and stop. Now, we'll step into 'foo' and continue tracking the returned value from there. <rdar://problem/12114689> llvm-svn: 162718	2012-08-28 00:50:42 +00:00
Jordan Rose	e537cc05f5	[analyzer] Rename CallEvent::mayBeInlined to CallEvent::isCallStmt. The two callers are using this in order to be conservative, so let's just clarify the information that's actually being provided here. This is not related to inlining decisions in any way. No functionality change. llvm-svn: 162717	2012-08-28 00:50:38 +00:00
Jordan Rose	1a61674f5a	[analyzer] Look through casts when trying to track a null pointer dereference. Also, add comments to addTrackNullOrUndefValueVisitor. Thanks for the review, Anna! llvm-svn: 162695	2012-08-27 20:18:30 +00:00
Jordan Rose	561919e5bd	[analyzer] Don't inline constructors for objects allocated with operator new. Because the CXXNewExpr appears after the CXXConstructExpr in the CFG, we don't actually have the correct region to construct into at the time we decide whether or not to inline. The long-term fix (discussed in PR12014) might be to introduce a new CFG node (CFGAllocator) that appears before the constructor. Tracking the short-term fix in <rdar://problem/12180598>. llvm-svn: 162689	2012-08-27 18:39:22 +00:00
Anna Zaks	7d2babc046	[analyzer] More internal stats collection. llvm-svn: 162687	2012-08-27 18:38:32 +00:00
Jordan Rose	c93183042f	[analyzer] Inline constructors for any object with a trivial destructor. This allows us to better reason about status objects, like Clang's own llvm::Optional (when its contents are trivially destructible), which are often intended to be passed around by value. We still don't inline constructors for temporaries in the general case. <rdar://problem/11986434> llvm-svn: 162681	2012-08-27 17:50:07 +00:00
Jordan Rose	0a0aa84da3	[analyzer] Use the common evalBind infrastructure for initializers. This allows checkers (like the MallocChecker) to process the effects of the bind. Previously, using a memory-allocating function (like strdup()) in an initializer would result in a leak warning. This does bend the expectations of checkBind a bit; since there is no assignment expression, the statement being used is the initializer value. In most cases this shouldn't matter because we'll use a PostInitializer program point (rather than PostStmt) for any checker-generated nodes, though we /will/ generate a PostStore node referencing the internal statement. (In theory this could have funny effects if someone actually does an assignment within an initializer; in practice, that seems like it would be very rare.) <rdar://problem/12171711> llvm-svn: 162637	2012-08-25 01:06:23 +00:00
Chad Rosier	de70e0ef45	[ms-inline asm] As part of a larger refactoring, rename AsmStmt to GCCAsmStmt. No functional change intended. llvm-svn: 162632	2012-08-25 00:11:56 +00:00
Ted Kremenek	5bc38bad73	Rework how PathDiagnosticConsumers pass knowledge of what files they generated for a given diagnostic to another. Because PathDiagnostics are specific to a give PathDiagnosticConsumer, store in a FoldingSet a unique hash for a PathDiagnostic (that will be the same for the same bug for different PathDiagnosticConsumers) that stores a list of files generated. This can then be read by the other PathDiagnosticConsumers. This fixes breakage in the PLIST-HTML output. llvm-svn: 162580	2012-08-24 19:35:19 +00:00
Jordan Rose	51c27163c0	[analyzer] If we dereference a NULL that came from a function, show the return. More generally, any time we try to track where a null value came from, we should show if it came from a function. This usually isn't necessary if the value is symbolic, but if the value is just a constant we previously just ignored its origin entirely. Now, we'll step into the function and recursively add a visitor to the returned expression. <rdar://problem/12114609> llvm-svn: 162563	2012-08-24 16:34:31 +00:00
Anna Zaks	3d5d3d3e2c	[analyzer] Make analyzer less aggressive when dealing with [self init]. With inlining, retain count checker starts tracking 'self' through the init methods. The analyser results were too noisy if the developer did not follow 'self = [super init]' pattern (which is common especially in older code bases) - we reported self init anti-pattern AND possible use-after-free. This patch teaches the retain count checker to assume that [super init] does not fail when it's not consumed by another expression. This silences the retain count warning that warns about possibility of use-after-free when init fails, while preserving all the other checking on 'self'. llvm-svn: 162508	2012-08-24 00:06:12 +00:00
Jordan Rose	434f132060	[analyzer] For now, treat pointers-to-members as non-null void * symbols. Until we have full support for pointers-to-members, we can at least approximate some of their use by tracking null and non-null values. We thus treat &A::m_ptr as a non-null void * symbol, and MemberPointer(0) as a pointer-sized null constant. This enables support for what is sometimes called the "safe bool" idiom, demonstrated in the test case. llvm-svn: 162495	2012-08-23 23:01:43 +00:00
Jordan Rose	081af085eb	[analyzer] Handle UserDefinedConversion casts in C++. This is trivial; the UserDefinedConversion always wraps a CXXMemberCallExpr for the appropriate conversion function, so it's just a matter of propagating that value to the CastExpr itself. llvm-svn: 162494	2012-08-23 23:01:39 +00:00
Jordan Rose	e5d5393efc	[analyzer] Support C++ default arguments if they are literal values. A CXXDefaultArgExpr wraps an Expr owned by a ParmVarDecl belonging to the called function. In general, ExprEngine and Environment ought to treat this like a ParenExpr or other transparent wrapper expression, with the inside expression evaluated first. However, if we call the same function twice, we'd produce a CFG that contains the same wrapped expression twice, and we're not set up to handle that. I've added a FIXME to the CFG builder to come back to that, but meanwhile we can at least handle expressions that don't need to be explicitly evaluated: literals. This probably handles many common uses of default parameters: true/false, null, etc. Part of PR13385 / <rdar://problem/12156507> llvm-svn: 162453	2012-08-23 18:10:53 +00:00
Richard Smith	802c4b7015	Fix undefined behavior: member function calls where 'this' is a null pointer. llvm-svn: 162430	2012-08-23 06:16:52 +00:00
Ted Kremenek	78094caa56	Fix an assortment of doxygen comment issues found by -Wdocumentation. llvm-svn: 162412	2012-08-22 23:50:41 +00:00
Ted Kremenek	326702f1a1	Despite me asking Jordan to do r162313, revert it. We can provide another way to whitelist these special cases. This is an intermediate patch. llvm-svn: 162386	2012-08-22 19:58:20 +00:00
Ted Kremenek	a056d62961	Remove BasicConstraintManager. It hasn't been in active service for a while. As part of this change, I discovered that a few of our tests were not testing the RangeConstraintManager. Luckily all of those passed when I moved them over to use that constraint manager. llvm-svn: 162384	2012-08-22 19:47:13 +00:00
Ted Kremenek	6269888166	Rename 'unbindLoc()' (in ProgramState) and 'Remove()' to 'killBinding()'. The name is more specific, and one just forwarded to the other. Add some doxygen comments along the way. llvm-svn: 162350	2012-08-22 06:37:46 +00:00
Ted Kremenek	d94854a42e	Rename 'currentX' to 'currX' throughout analyzer and libAnalysis. Also rename 'getCurrentBlockCounter()' to 'blockCount()'. This ripples a bunch of code simplifications; mostly aesthetic, but makes the code a bit tighter. llvm-svn: 162349	2012-08-22 06:26:15 +00:00
Ted Kremenek	d227833cba	Rename 'getConjuredSymbol' to 'conjureSymbol'. No need to have the "get", the word "conjure" is a verb too! Getting a conjured symbol is the same as conjuring one up. This shortening is largely cosmetic, but just this simple changed cleaned up a handful of lines, making them less verbose. llvm-svn: 162348	2012-08-22 06:26:06 +00:00
Ted Kremenek	1afcb7442f	Remove Store::bindDecl() and Store::bindDeclWithNoInit(), and all forwarding methods. This functionality is already covered by bindLoc(). llvm-svn: 162346	2012-08-22 06:00:18 +00:00
Ted Kremenek	2cd56c4c6e	Rename 'BindCompoundLiteral' to 'bindCompoundLiteral' and add doxygen comments. llvm-svn: 162345	2012-08-22 06:00:12 +00:00
Ted Kremenek	34d39287b5	Consilidate SmallPtrSet count() followed by insert() into a single insert(). llvm-svn: 162330	2012-08-22 00:02:08 +00:00
Matt Beaumont-Gay	64621ea530	Add an llvm_unreachable to pacify GCC's -Wreturn-type. llvm-svn: 162325	2012-08-21 22:27:18 +00:00
Jordan Rose	e3e95cdf27	[analyzer] Set the default IPA mode to 'basic-inlining', which excludes C++. Under -analyzer-ipa=basic-inlining, only C functions, blocks, and C++ static member functions are inlined -- essentially, the calls that behave like simple C function calls. This is essentially the behavior in Xcode 4.4. C++ support still has some rough edges, and we don't want users to be worried about them if they download and run their own checker. (In particular, the massive number of false positives for analyzing LLVM comes from inlining defensively-written code in contexts where more aggressive assumptions are implicitly made. This problem is not unique to C++, but it is exacerbated by the higher proportion of code that lives in header files in C++.) The eventual goal is to be comfortable enough with C++ support (and simple Objective-C support) to advance to -analyzer-ipa=inlining as the default behavior. See the IPA design notes for more details. llvm-svn: 162318	2012-08-21 21:44:21 +00:00
Jordan Rose	81125c4497	[analyzer] Push "references are non-null" knowledge up to the common parent. This reduces duplication across the Basic and Range constraint managers, and keeps their internals free of dealing with the semantics of C++. It's still a little unfortunate that the constraint manager is dealing with this at all, but this is pretty much the only place to put it so that it will apply to all symbolic values, even when embedded in larger expressions. llvm-svn: 162313	2012-08-21 20:52:19 +00:00
Jordan Rose	075d5d2e99	[analyzer] Assume that reference symbols are non-null. By doing this in the constraint managers, we can ensure that ANY reference whose value we don't know gets the effect, even if it's not a top-level parameter. llvm-svn: 162246	2012-08-21 00:27:33 +00:00
Jordan Rose	2b10f3f8a9	[analyzer] Add comments to ExplodedNode::NodeGroup. No functionality change. llvm-svn: 162216	2012-08-20 18:59:46 +00:00
Jordan Rose	4b4613cbec	[analyzer] Replace boolean IsSink parameters with 'generateSink' methods. Generating a sink is significantly different behavior from generating a normal node, and a simple boolean parameter can be rather opaque. Per offline discussion with Anna, adding new generation methods is the clearest way to communicate intent. No functionality change. llvm-svn: 162215	2012-08-20 18:43:42 +00:00
Jordan Rose	0a9ea7c70d	[analyzer] The result of && or \|\| is always a 1 or 0. Forgetting to at least cast the result was giving us Loc/NonLoc problems in SValBuilder (hitting an assertion). But the standard (both C and C++) does actually guarantee that && and \|\| will result in the actual values 1 and 0, typed as 'int' in C and 'bool' in C++, and we can easily model that. PR13461 llvm-svn: 162209	2012-08-20 17:04:45 +00:00
Jordan Rose	a4309c941c	[analyzer] Treat C++ 'throw' as a sink. Our current handling of 'throw' is all CFG-based: it jumps to a 'catch' block if there is one and the function exit block if not. But this doesn't really get the right behavior when a function is inlined: execution will continue on the caller's side, which is always the wrong thing to do. Even within a single function, 'throw' completely skips any destructors that are to be run. This is essentially the same problem as @finally -- a CFGBlock that can have multiple entry points, whose exit points depend on whether it was entered normally or exceptionally. Representing 'throw' as a sink matches our current (non-)handling of @throw. It's not a perfect solution, but it's better than continuing analysis in an inconsistent or even impossible state. <rdar://problem/12113713> llvm-svn: 162157	2012-08-18 00:30:23 +00:00
Jordan Rose	a97a99736e	[analyzer] Treat @throw as a sink (stop processing). The CFG approximates @throw as a return statement, but that's not good enough in inlined functions. Moreover, since Objective-C exceptions are usually considered fatal, we should be suppressing leak warnings like we do for calls to noreturn functions (like abort()). The comments indicate that we were probably intending to do this all along; it may have been inadvertantly changed during a refactor at one point. llvm-svn: 162156	2012-08-18 00:30:20 +00:00
Jordan Rose	80547386b8	[analyzer] Use PointerUnion to implement ExplodedNode::NodeGroup. We shouldn't be reinventing our own wheels. This also paves the way for marking different kinds of sinks. No functionality change. llvm-svn: 162154	2012-08-18 00:30:10 +00:00
Ted Kremenek	9dcf671d13	Remove #if 0 that has been around for a long time. llvm-svn: 162030	2012-08-16 17:45:32 +00:00
Ted Kremenek	1e60273eed	Remove "range_iterator" from PathDiagnosticPiece and just use ArrayRef<SourceRange> for ranges. This removes conceptual clutter, and can allow us to easy migrate to C++11 style for-range loops if we ever move to using C++11 in Clang. llvm-svn: 162029	2012-08-16 17:45:29 +00:00
Ted Kremenek	9bf9af92a4	Allow multiple PathDiagnosticConsumers to be used with a BugReporter at the same time. This fixes several issues: - removes egregious hack where PlistDiagnosticConsumer would forward to HTMLDiagnosticConsumer, but diagnostics wouldn't be generated consistently in the same way if PlistDiagnosticConsumer was used by itself. - emitting diagnostics to the terminal (using clang's diagnostic machinery) is no longer a special case, just another PathDiagnosticConsumer. This also magically resolved some duplicate warnings, as we now use PathDiagnosticConsumer's diagnostic pruning, which has scope for the entire translation unit, not just the scope of a BugReporter (which is limited to a particular ExprEngine). As an interesting side-effect, diagnostics emitted to the terminal also have their trailing "." stripped, just like with diagnostics emitted to plists and HTML. This required some tests to be updated, but now the tests have higher fidelity with what users will see. There are some inefficiencies in this patch. We currently generate the report graph (from the ExplodedGraph) once per PathDiagnosticConsumer, which is a bit wasteful, but that could be pulled up higher in the logic stack. There is some intended duplication, however, as we now generate different PathDiagnostics (for the same issue) for different PathDiagnosticConsumers. This is necessary to produce the diagnostics that a particular consumer expects. llvm-svn: 162028	2012-08-16 17:45:23 +00:00
Richard Smith	235341bc88	Store SourceManager pointer on PrintingPolicy in the case where we're dumping, and remove ASTContext reference (which was frequently bound to a dereferenced null pointer) from the recursive lump of printPretty functions. In so doing, fix (at least) one case where we intended to use the 'dump' mode, but that failed because a null ASTContext reference had been passed in. llvm-svn: 162011	2012-08-16 03:56:14 +00:00
Jordan Rose	6ee44e1f03	[analyzer] Look through all casts when trying to track constraints. Previously, we were losing path notes (in both text and plist form) because the interesting DeclRefExpr was buried in a cast. llvm-svn: 161999	2012-08-16 00:03:33 +00:00
Jordan Rose	e9753b0640	[analyzer] Even if we are not inlining a virtual call, still invalidate! Fixes a mistake introduced in r161916. llvm-svn: 161987	2012-08-15 21:05:15 +00:00
Jordan Rose	5fc5da0578	[analyzer] Correctly devirtualize virtual method calls in constructors. This is the other half of C++11 [class.cdtor]p4 (the destructor side was added in r161915). This also fixes an issue with post-call checks where the 'this' value was already being cleaned out of the state, thus being omitted from a reconstructed CXXConstructorCall. llvm-svn: 161981	2012-08-15 20:07:17 +00:00
Jordan Rose	9910720851	[analyzer] Don't try to devirtualize if the class is incomplete. A similar issue to the previous commit, introduced by r161915. llvm-svn: 161961	2012-08-15 17:33:37 +00:00
Jordan Rose	31c3fa9c24	[analyzer] Only adjust the type of 'this' when we devirtualize a method call. With reinterpret_cast, we can get completely unrelated types in a region hierarchy together; this was resulting in CXXBaseObjectRegions being layered directly on an (untyped) SymbolicRegion, whose symbol was from a completely different type hierarchy. This was what was causing the internal buildbot to fail. Reverts r161911, which merely masked the problem. llvm-svn: 161960	2012-08-15 17:33:34 +00:00
Jordan Rose	5132aaeb04	[analyzer] Don't inline dynamic-dispatch methods unless -analyzer-ipa=dynamic. Previously we were checking -analyzer-ipa=dynamic-bifurcate only, and unconditionally inlining everything else that had an available definition, even under -analyzer-ipa=inlining (but not under -analyzer-ipa=none). llvm-svn: 161916	2012-08-15 00:52:00 +00:00
Jordan Rose	0f6d63be06	[analyzer] Correctly devirtualize virtual method calls in destructors. C++11 [class.cdtor]p4: When a virtual function is called directly or indirectly from a constructor or from a destructor, including during the construction or destruction of the class’s non-static data members, and the object to which the call applies is the object under construction or destruction, the function called is the final overrider in the constructor's or destructor's class and not one overriding it in a more-derived class. llvm-svn: 161915	2012-08-15 00:51:56 +00:00
Jordan Rose	95c841eaa0	[analyzer] A base class needs a complete definition to provide offsets. No test case yet; trying to reduce one from a failing internal buildbot. llvm-svn: 161911	2012-08-15 00:36:44 +00:00
Anna Zaks	6ddb6b1a9a	[analyzer]Assume that the properties cannot be overridden when dot syntax is used. llvm-svn: 161889	2012-08-14 19:19:18 +00:00
Benjamin Kramer	9299d8c298	Do NOT use inline functions with LLVM_ATTRIBUTE_USED. The function will be emitted into every single TU including the header! llvm-svn: 161872	2012-08-14 14:50:32 +00:00
Jordan Rose	e521f93225	[analyzer] Look up DynamicTypeInfo by region instead of symbol. This allows us to store type info for non-symbolic regions. No functionality change. llvm-svn: 161811	2012-08-13 23:59:07 +00:00
Jordan Rose	ce6c99a559	[analyzer] Reduce code duplication: make CXXDestructorCall a CXXInstanceCall. While there is now some duplication between SimpleCall and the CXXInstanceCall sub-hierarchy, this is much better than copy-and-pasting the devirtualization logic shared by both instance methods and destructors. An unfortunate side effect is that there is no longer a single CallEvent type that corresponds to "calls written as CallExprs". For the most part this is a good thing, but the checker callback eval::Call still takes a CallExpr rather than a CallEvent (since we're not sure if we want to allow checkers to evaluate other kinds of calls). A mistake here will be caught by a cast<> in CheckerManager::runCheckersForEvalCall. No functionality change. llvm-svn: 161809	2012-08-13 23:46:05 +00:00
Jordan Rose	710f6b1259	[analyzer] Be more careful when downcasting for devirtualization. Virtual base regions are never layered, so simply stripping them off won't necessarily get you to the correct casted class. Instead, what we want is the same logic for evaluating dynamic_cast: strip off base regions if possible, but add new base regions if necessary. llvm-svn: 161808	2012-08-13 23:46:01 +00:00
Jordan Rose	574ef152fc	[analyzer] Handle dynamic_casts that turn out to be upcasts. This can occur with multiple inheritance, which jumps from one parent to the other, and with virtual inheritance, since virtual base regions always wrap the actual object and can't be nested within other base regions. This also exposed some incorrect logic for multiple inheritance: even if B is known not to derive from C, D might still derive from both of them. llvm-svn: 161798	2012-08-13 22:11:42 +00:00
Jordan Rose	07a7ed80cb	[analyzer] Don't strip CXXBaseObjectRegions when checking dynamic_casts. ...and /do/ strip CXXBaseObjectRegions when casting to a virtual base class. This allows us to enforce the invariant that a CXXBaseObjectRegion can always provide an offset for its base region if its base region has a known class type, by only allowing virtual bases and direct non-virtual bases to form CXXBaseObjectRegions. This does mean some slight problems for our modeling of dynamic_cast, which needs to be resolved by finding a path from the current region to the class we're trying to cast to. llvm-svn: 161797	2012-08-13 22:11:34 +00:00
Jordan Rose	02e5309b35	[analyzer] Strip CXXBaseObjectRegions when devirtualizing method calls. This was causing a crash when we tried to re-apply a base object region to itself. It probably also caused incorrect offset calculations in RegionStore. PR13569 / <rdar://problem/12076683> llvm-svn: 161710	2012-08-10 22:26:46 +00:00
Jordan Rose	51bcb226a2	[analyzer] Try to devirtualize even if the static callee has no definition. This mostly affects pure virtual methods, but would also affect parent methods defined inline in the header when analyzing the child's source file. llvm-svn: 161709	2012-08-10 22:26:43 +00:00
Anna Zaks	75f49a9c07	[analyzer] Track if a region can be a subclass in the dynamic type info. When object is allocated with alloc or init, we assume it cannot be a subclass (currently used only for bifurcation purposes). llvm-svn: 161682	2012-08-10 18:55:58 +00:00
Anna Zaks	920af014c1	[analyzer] Optimize dynamic dispatch bifurcation by detecting the cases when we don't need to split. In some cases we know that a method cannot have a different implementation in a subclass: - the class is declared in the main file (private) - all the method declarations (including the ones coming from super classes) are in the main file. This can be improved further, but might be enough for the heuristic. (When we are too aggressive splitting the state, efficiency suffers. When we fail to split the state coverage might suffer.) llvm-svn: 161681	2012-08-10 18:55:53 +00:00
Benjamin Kramer	3a913ed805	Fix a couple of pedantic gcc warnings. llvm-svn: 161656	2012-08-10 10:06:13 +00:00
Jordan Rose	637ff0cc0f	[analyzer] Merge RegionStore's KillStruct and CopyLazyBindings: BindAggregate. Both methods need to clear out existing bindings and provide a new default binding. Originally KillStruct always provided UnknownVal as the default, but it's allowed symbolic values for quite some time (for handling returned structs in C). No functionality change. llvm-svn: 161637	2012-08-09 22:55:54 +00:00
Jordan Rose	a44a55a8f2	[analyzer] Cluster bindings in RegionStore by base region. This should speed up activities that need to access bindings by cluster, such as invalidation and dead-bindings cleaning. In some cases all we save is the cost of building the region cluster map, but other times we can actually avoid traversing the rest of the store. In casual testing, this produced a speedup of nearly 10% analyzing SQLite, with /less/ memory used. llvm-svn: 161636	2012-08-09 22:55:51 +00:00
Jordan Rose	c91e01bc11	[analyzer] Cache the "concrete offset base" for regions with symbolic offsets. This makes it faster to access and invalidate bindings with symbolic offsets by only computing this information once. No intended functionality change. llvm-svn: 161635	2012-08-09 22:55:37 +00:00
Jordan Rose	996d309fb7	[analyzer] A CXXBaseObjectRegion should correspond to a DIRECT base. An ASTContext's RecordLayoutInfo can only be used to look up offsets of direct base classes, and we need the offset to make non-symbolic bindings in RegionStore. This change makes sure that we have one layer of CXXBaseObjectRegion for each base we are casting through. This was causing crashes on an internal buildbot. llvm-svn: 161621	2012-08-09 21:24:02 +00:00
Anna Zaks	a0105b2320	[analyzer] Rename the function to better reflect what it actually does. llvm-svn: 161617	2012-08-09 21:02:45 +00:00
Anna Zaks	8d1f1f3b06	[analyzer] Clarify the values in Dyn. Dispatch Bifurcation map. llvm-svn: 161616	2012-08-09 21:02:41 +00:00
Anna Zaks	85383182ec	[analyzer] Improve readability of the dyn. dispatch bifurcation patch r161552. As per Jordan's feedback. llvm-svn: 161603	2012-08-09 18:43:00 +00:00
Anna Zaks	bc6d0ccf92	Unbreak the build. Declaring "const Decl *Decl" is not a good idea. llvm-svn: 161567	2012-08-09 02:57:02 +00:00
Anna Zaks	123af098b8	[analyzer] Bifurcate the path with dynamic dispatch. This is an initial (unoptimized) version. We split the path when inlining ObjC instance methods. On one branch we always assume that the type information for the given memory region is precise. On the other we assume that we don't have the exact type info. It is important to check since the class could be subclassed and the method can be overridden. If we always inline we can loose coverage. Had to refactor some of the call eval functions. llvm-svn: 161552	2012-08-09 00:21:33 +00:00
Jordan Rose	d86b3bdb7a	[analyzer] Clean up the printing of FieldRegions for leaks. Unfortunately, generalized region printing is very difficult: - ElementRegions are used both for casting and as actual elements. - Accessing values through a pointer means going through an intermediate SymbolRegionValue; symbolic regions are untyped. - Referring to implicitly-defined variables like 'this' and 'self' could be very confusing if they come from another stack frame. We fall back to simply not printing the region name if we can't be sure it will print well. This will allow us to improve in the future. llvm-svn: 161512	2012-08-08 18:23:36 +00:00
Jordan Rose	356279ca2d	[analyzer] Track malloc'd regions stored in structs. The main blocker on this (besides the previous commit) was that ScanReachableSymbols was not looking through LazyCompoundVals. Once that was fixed, it's easy enough to clear out malloc data on return, just like we do when we bind to a global region. <rdar://problem/10872635> llvm-svn: 161511	2012-08-08 18:23:31 +00:00
Jordan Rose	3a80cec5e9	[analyzer] Revamp RegionStore to distinguish regions with symbolic offsets. RegionStore currently uses a (Region, Offset) pair to describe the locations of memory bindings. However, this representation breaks down when we have regions like 'array[index]', where 'index' is unknown. We used to store this as (SubRegion, 0); now we mark them specially as (SubRegion, SYMBOLIC). Furthermore, ProgramState::scanReachableSymbols depended on the existence of a sub-region map, but RegionStore's implementation doesn't provide for such a thing. Moving the store-traversing logic of scanReachableSymbols into the StoreManager allows us to eliminate the notion of SubRegionMap altogether. This fixes some particularly awkward broken test cases, now in array-struct-region.c. llvm-svn: 161510	2012-08-08 18:23:27 +00:00
Anna Zaks	75930b65b4	[analyzer] Address Jordan's review of DynamicTypePropagation. llvm-svn: 161391	2012-08-07 05:12:24 +00:00
Anna Zaks	472dbcf156	[analyzer] Add a checker to manage dynamic type propagation. Instead of sprinkling dynamic type info propagation throughout ExprEngine, the added checker would add the more precise type information on known APIs (Ex: ObjC alloc, new) and propagate the type info in other cases (ex: ObjC init method, casts (the second is not implemented yet)). Add handling of ObjC alloc, new and init to the checker. llvm-svn: 161357	2012-08-06 23:25:39 +00:00
Jordan Rose	17a8757a46	[analyzer] Update initializer assertion for delegating constructors. Like base constructors, delegating constructors require no further processing in the CFGInitializer node. Also, add PrettyStackTraceLoc to the initializer and destructor logic so we can get better stack traces in the future. llvm-svn: 161283	2012-08-03 23:31:15 +00:00
Jordan Rose	cfb4eb293f	[analyzer] When a symbol is null, we should track its constraints. Because of this, we would previously emit NO path notes when a parameter is constrained to null (because there are no stores). Now we show where we made the assumption, which is much more useful. llvm-svn: 161280	2012-08-03 23:09:01 +00:00
Jordan Rose	3eb3cd45b8	[analyzer] Flatten path diagnostics for text output like we do for HTML. llvm-svn: 161279	2012-08-03 23:08:54 +00:00
Jordan Rose	92e1449b55	[analyzer] Track null/uninitialized C++ objects used in method calls. llvm-svn: 161278	2012-08-03 23:08:49 +00:00
Jordan Rose	80880ac7ee	[analyzer] Provide useful PathDiagnosticLocations for CallEnter/Exit events. llvm-svn: 161277	2012-08-03 23:08:44 +00:00
Jordan Rose	adec516f4e	[analyzer] FindLastStoreBRVisitor was not actually finding stores. The visitor walks back through the ExplodedGraph as expected, but it wasn't actually keeping track of when a value was assigned. This meant that it only worked when the value was assigned when the variable was defined. Tests in the next commit (dependent on another change). llvm-svn: 161276	2012-08-03 23:08:42 +00:00
Anna Zaks	afc13b9ec5	[analyzer] Fixup: remove the extra whitespace llvm-svn: 161265	2012-08-03 21:49:42 +00:00
Anna Zaks	150843b87e	[analyzer] ObjC Inlining: Start tracking dynamic type info in the GDM In the following code, find the type of the symbolic receiver by following it and updating the dynamic type info in the state when we cast the symbol from id to MyClass . MyClass a = [[self alloc] init]; return 5/[a testSelf]; llvm-svn: 161264	2012-08-03 21:43:37 +00:00
Anna Zaks	4bd96c4469	[analyzer] Fix a typo. Thanks Jordan. llvm-svn: 161249	2012-08-03 18:30:20 +00:00
Anna Zaks	4c03dfd4b1	[analyzer] Solve another source of non-determinism in the diagnostic engine. The code that was supposed to split the tie in a deterministic way is not deterministic. Most likely one of the profile methods uses a pointer. After this change we do finally get the consistent diagnostic output. Testing this requires running the analyzer on large code bases and diffing the results. llvm-svn: 161224	2012-08-02 23:41:05 +00:00
Jordan Rose	fa49c92b5c	[analyzer] Also emit Prev/Next links for macros in HTML output. Oops. llvm-svn: 161154	2012-08-02 02:43:42 +00:00
Jordan Rose	11790a4810	[analyzer] Add Prev/Next links to the HTML output. llvm-svn: 161153	2012-08-02 02:26:19 +00:00
Anna Zaks	4c4fe84b25	[analyzer] Flush bug reports in deterministic order. This makes the diagnostic output order deterministic. 1) This makes order of text diagnostics consistent from run to run. 2) Also resulted in different bugs being reported (from one run to another) with plist-html output. llvm-svn: 161151	2012-08-02 00:41:43 +00:00
Jordan Rose	69bd4e803b	[analyzer] Control C++ inlining with a macro in ExprEngineCallAndReturn.cpp. For now this will stay on, but this way it's easy to switch off if we need to pull back our support for a while. llvm-svn: 161064	2012-07-31 18:22:40 +00:00
Jordan Rose	a765bac7a1	[analyzer] Turn -cfg-add-initializers on by default, and remove the flag. llvm-svn: 161060	2012-07-31 18:04:59 +00:00
Jordan Rose	6a97d92ef5	[analyzer] Don't try to inline if there's no region for a message receiver. While usually we'd use a symbolic region rather than a straight-up Unknown, we can still generate unknowns via array subscripts with symbolic indexes. (And if this ever changes in the future, we still shouldn't crash.) llvm-svn: 161059	2012-07-31 18:04:53 +00:00
Jordan Rose	1f8c0b4587	[analyzer] Add a FIXME about devirtualization in ctors/dtors. llvm-svn: 161058	2012-07-31 18:04:49 +00:00
Jordan Rose	e8a21b73ac	[analyzer] Getting an lvalue for a reference field still requires a load. This was causing a crash in our array-to-pointer logic, since the region was clearly not an array. PR13440 / <rdar://problem/11977113> llvm-svn: 161051	2012-07-31 16:34:07 +00:00
Jordan Rose	42e8d6497d	[analyzer] Let CallEvent decide what goes in an inital stack frame. This removes explicit checks for 'this' and 'self' from Store::enterStackFrame. It also removes getCXXThisRegion() as a virtual method on all CallEvents; it's now only implemented in the parts of the hierarchy where it is relevant. Finally, it removes the option to ask for the ParmVarDecls attached to the definition of an inlined function, saving a recomputation of the result of getRuntimeDefinition(). No visible functionality change! llvm-svn: 161017	2012-07-31 01:07:55 +00:00
Anna Zaks	5808eb8029	[analyzer] Handle inlining of instance calls to super. Use self-init.m for testing. (It used to have a bunch of failing tests with dynamic inlining turned on.) llvm-svn: 161012	2012-07-30 23:48:36 +00:00
Jordan Rose	c2d249ce2c	[analyzer] Perform post-call checks for all inlined calls. Previously, we were only checking the origin expressions of inlined calls. Checkers using the generic postCall and older postObjCMessage callbacks were ignored. Now that we have CallEventManager, it is much easier to create a CallEvent generically when exiting an inlined function, which we can then use for post-call checks. No test case because we don't (yet) have any checkers that depend on this behavior (which is why it hadn't been fixed before now). llvm-svn: 161005	2012-07-30 23:39:47 +00:00
Anna Zaks	63282aefb9	[analyzer] Very simple ObjC instance method inlining - Retrieves the type of the object/receiver from the state. - Binds self during stack setup. - Only explores the path on which the method is inlined (no bifurcation to explore the path on which the method is not inlined). llvm-svn: 160991	2012-07-30 20:31:29 +00:00
Anna Zaks	e49190984c	[analyzer] Add -analyzer-ipa=dynamic option for inlining dynamically dispatched methods. Disabled by default for now. llvm-svn: 160988	2012-07-30 20:31:18 +00:00
Jordan Rose	fcd016e57e	[analyzer] Only allow CallEvents to be created by CallEventManager. This ensures that it is valid to reference-count any CallEvents, and we won't accidentally try to reclaim a CallEvent that lives on the stack. It also hides an ugly switch statement for handling CallExprs! There should be no functionality change here. llvm-svn: 160986	2012-07-30 20:22:09 +00:00
Jordan Rose	d457ca92ce	[analyzer] Introduce a CallEventManager to keep a pool of CallEvents. This allows us to get around the C++ "virtual constructor" problem when we'd like to create a CallEvent from an ExplodedNode, an inlined StackFrameContext, or another CallEvent. The solution has three parts: - CallEventManager uses a BumpPtrAllocator to allocate CallEvent-sized memory blocks. It also keeps a cache of freed CallEvents for reuse. - CallEvents all have protected copy constructors, along with cloneTo() methods that use placement new to copy into CallEventManager-managed memory, vtables intact. - CallEvents owned by CallEventManager are now wrapped in an IntrusiveRefCntPtr. Going forwards, it's probably a good idea to create ALL CallEvents through the CallEventManager, so that we don't accidentally try to reclaim a stack-allocated CallEvent. All of this machinery is currently unused but will be put into use shortly. llvm-svn: 160983	2012-07-30 20:21:55 +00:00
NAKAMURA Takumi	836926dbdf	clang/lib: [CMake] Update tblgen'd dependencies. llvm-svn: 160851	2012-07-27 06:18:33 +00:00
Jordan Rose	41c98d9dc3	[analyzer] Look through SubstNonTypeTemplateParmExprs. We were treating this like a CXXDefaultArgExpr, but SubstNonTypeTemplateParmExpr actually appears when a template is instantiated, i.e. we have all the information necessary to evaluate it. This allows us to inline functions like llvm::array_lengthof. <rdar://problem/11949235> llvm-svn: 160846	2012-07-27 01:15:02 +00:00
Jordan Rose	de76c92b15	[analyzer] Use a stack-based local AGAIN to fix the build for real. It's a good thing CallEvents aren't created all over the place yet. I checked all the uses this time and the private copy constructor /really/ shouldn't cause any more problems. llvm-svn: 160845	2012-07-27 00:47:52 +00:00
Jordan Rose	7aab2295be	[analyzer] Use a stack-based local instead of a temporary to fix build. Passing a temporary via reference parameter still requires a visible copy constructor. llvm-svn: 160840	2012-07-26 23:24:15 +00:00
Ted Kremenek	313c2ff375	Look at the preceding CFGBlock for the expression to load from in ExprEngine::VisitGuardedExpr instead of walking to the preceding PostStmt node. There are cases where the last evaluated expression does not appear in the ExplodedGraph. Fixes PR 13466. llvm-svn: 160819	2012-07-26 22:23:41 +00:00
Jordan Rose	72ce8e2d42	[analyzer] CallEvent is no longer a value object. After discussion, the type-based dispatch was decided to be bad for maintenance and made it very easy for subtle bugs to creep in. Instead, we'll just be very careful when we do have to allocate these on the heap. llvm-svn: 160817	2012-07-26 21:41:15 +00:00
Jordan Rose	4f7df9be69	[analyzer] Rename Calls.{h,cpp} to CallEvent.{h,cpp}. No functionality change. llvm-svn: 160815	2012-07-26 21:39:41 +00:00
Jordan Rose	25bc20f846	[analyzer] Don't crash on implicit statements inside initializers. Our BugReporter knows how to deal with implicit statements: it looks in the ParentMap until it finds a parent with a valid location. However, since initializers are not in the body of a constructor, their sub-expressions are not in the ParentMap. That was easy enough to fix in AnalysisDeclContext. ...and then even once THAT was fixed, there's still an extra funny case of Objective-C object pointer fields under ARC, which are initialized with a top-level ImplicitValueInitExpr. To catch these cases, PathDiagnosticLocation will now fall back to the start of the current function if it can't find any other valid SourceLocations. This isn't great, but it's miles better than a crash. (All of this is only relevant when constructors and destructors are being inlined, i.e. under -cfg-add-initializers and -cfg-add-implicit-dtors.) llvm-svn: 160810	2012-07-26 20:04:30 +00:00
Jordan Rose	20edae8749	[analyzer] Don't crash on array constructors and destructors. This workaround is fairly lame: we simulate the first element's constructor and destructor and rely on the region invalidation to "initialize" the rest of the elements. llvm-svn: 160809	2012-07-26 20:04:25 +00:00
Jordan Rose	54529a347e	[analyzer] Handle C++ member initializers and destructors. This uses CFG to tell if a constructor call is for a member, and uses the member's region appropriately. llvm-svn: 160808	2012-07-26 20:04:21 +00:00
Jordan Rose	05375eb4ec	[analyzer] Use the CFG to see if a constructor is for a local variable. Previously we were using ParentMap and crawling through the parent DeclStmt. This should be at least slightly cheaper (and is also more flexible). No (intended) functionality change. llvm-svn: 160807	2012-07-26 20:04:16 +00:00
Jordan Rose	b970505d0d	[analyzer] Handle base class initializers and destructors. Most of the logic here is fairly simple; the interesting thing is that we now distinguish complete constructors from base or delegate constructors. We also make sure to cast to the base class before evaluating a constructor or destructor, since non-virtual base classes may behave differently. This includes some refactoring of VisitCXXConstructExpr and VisitCXXDestructor in order to keep ExprEngine.cpp as clean as possible (leaving the details for ExprEngineCXX.cpp). llvm-svn: 160806	2012-07-26 20:04:13 +00:00
Jordan Rose	a4c0d21f42	[analyzer] Show paths for destructor calls. This modifies BugReporter and friends to handle CallEnter and CallExitEnd program points that came from implicit call CFG nodes (read: destructors). This required some extra handling for nested implicit calls. For example, the added multiple-inheritance test case has a call graph that looks like this: testMultipleInheritance3 ~MultipleInheritance ~SmartPointer ~Subclass ~SmartPointer *bug here* In this case we correctly notice that we started in an inlined function when we reach the CallEnter program point for the second ~SmartPointer. However, when we reach the next CallEnter (for ~Subclass), we were accidentally re-using the inner ~SmartPointer call in the diagnostics. Rather than guess if we saw the corresponding CallExitEnd based on the contents of the active path, we now just ask the PathDiagnostic if there's any known stack before popping off the top path. (A similar issue could have occured without multiple inheritance, but there wasn't a test case for it.) llvm-svn: 160804	2012-07-26 20:04:05 +00:00
Jordan Rose	c5d852447b	[analyzer] Inline ctors + dtors when the CFG is built for them. At the very least this means initializer nodes for constructors and automatic object destructors are present in the CFG. llvm-svn: 160803	2012-07-26 20:04:00 +00:00
Jordan Rose	443ec10e2d	[analyzer] PostImplicitCall can also occur between CFGElements. This avoids an assertion crash when we invalidate on a destructor call instead of inlining it. llvm-svn: 160802	2012-07-26 20:03:56 +00:00
Anna Zaks	83f1495fcb	[analyzer] Inline ObjC class methods. - Some cleanup(the TODOs) will be done after ObjC method inlining is complete. - Simplified CallEvent::getDefinition not to require ISDynamicDispatch parameter. - Also addressed Jordan's comments from r160530. llvm-svn: 160768	2012-07-26 00:27:51 +00:00
Ted Kremenek	80b4ac76c5	Remove the ability to stash arbitrary pointers into UndefinedVal (no longer needed). llvm-svn: 160764	2012-07-25 22:09:19 +00:00
Ted Kremenek	b5a18d5881	Remove ExprEngine::MarkBranch(), as it is no longer needed. llvm-svn: 160761	2012-07-25 21:58:29 +00:00
Ted Kremenek	bb81ffb342	Update ExprEngine's handling of ternary operators to find the ternary expression value by scanning the path, rather than assuming we have visited the '?:' operator as a terminator (which sets a value indicating which expression to grab the final ternary expression value from). llvm-svn: 160760	2012-07-25 21:58:25 +00:00
Sylvestre Ledru	830885ca64	Fix a typo (the the => the) llvm-svn: 160622	2012-07-23 08:59:39 +00:00
Benjamin Kramer	f473cd4b6a	Remove unused private member variable uncovered by the recent changes to clang's -Wunused-private-field. llvm-svn: 160584	2012-07-20 22:06:30 +00:00

... 2 3 4 5 6 ...

981 Commits