llvm-project

Commit Graph

Author	SHA1	Message	Date
Douglas Gregor	197ac203af	I predict that HeaderSearch will need the ability to generate diagnostics in the future. Make it so. llvm-svn: 144347	2011-11-11 00:35:06 +00:00
David Blaikie	21bfbf8d7c	Fixing 80 col violations (& removing any trailing whitespace on files I was touching anyway) llvm-svn: 144171	2011-11-09 06:07:30 +00:00
Eli Friedman	fcec630a57	Fix the representation of wide strings in the AST and IR so that it uses the native representation of integers for the elements. This fixes a bunch of nastiness involving treating wide strings as a series of bytes. Patch by Seth Cantrell. llvm-svn: 143417	2011-11-01 02:23:42 +00:00
Eli Friedman	703e7153af	Perform proper conversion for strings encoded in the source file as UTF-8. (For now, we are assuming the source character set is always UTF-8; this can be easily extended if necessary.) Tests will be coming up in a subsequent commit. Patch by Seth Cantrell. llvm-svn: 143416	2011-11-01 02:14:50 +00:00
Douglas Gregor	935bc7a214	Make the loading of information attached to an IdentifierInfo from an AST file more lazy, so that we don't eagerly load that information for all known identifiers each time a new AST file is loaded. The eager reloading made some sense in the context of precompiled headers, since very few identifiers were defined before PCH load time. With modules, however, a huge amount of code can get parsed before we see an @import, so laziness becomes important here. The approach taken to make this information lazy is fairly simple: when we load a new AST file, we mark all of the existing identifiers as being out-of-date. Whenever we want to access information that may come from an AST (e.g., whether the identifier has a macro definition, or what top-level declarations have that name), we check the out-of-date bit and, if it's set, ask the AST reader to update the IdentifierInfo from the AST files. The update is a merge, and we now take care to merge declarations before/after imports with declarations from multiple imports. The results of this optimization are fairly dramatic. On a small application that brings in 14 non-trivial modules, this takes modules from being > 3x slower than a "perfect" PCH file down to 30% slower for a full rebuild. A partial rebuild (where the PCH file or modules can be re-used) is down to 7% slower. Making the PCH file just a little imperfect (e.g., adding two smallish modules used by a bunch of .m files that aren't in the PCH file) tips the scales in favor of the modules approach, with 24% faster partial rebuilds. This is just a first step; the lazy scheme could possibly be improved by adding versioning, so we don't search into modules we already searched. Moreover, we'll need similar lazy schemes for all of the other lookup data structures, such as DeclContexts. llvm-svn: 143100	2011-10-27 09:33:13 +00:00
Argyrios Kyrtzidis	429ec024f8	[PCH] When visiting preprocessed entities, make it possible to avoid deserializing preprocessed entities that are #included in the range that we are interested. This is useful when we are interested in preprocessed entities of a specific file, e.g when we are annotating tokens. There is also an optimization where we cache the last result of PreprocessingRecord::getPreprocessedEntitiesInRange and we re-use it if there is a call with the same range as before. rdar://10313365 llvm-svn: 142887	2011-10-25 00:29:50 +00:00
Douglas Gregor	ebf0049901	For modules, all macros that aren't include guards are implicitly public. Add a __private_macro__ directive to hide a macro, similar to the __module_private__ declaration specifier. llvm-svn: 142188	2011-10-17 15:32:29 +00:00
Douglas Gregor	82e0d728e8	Add a preprocessor callback that is invoked every time the 'defined' operator is seen, from Jason Haslam! llvm-svn: 141926	2011-10-14 00:49:43 +00:00
Ted Kremenek	a35d67dfd9	Implement built-in macro '__has_warning', which allows one to query if a warning flag is valid. Fixes <rdar://problem/10263428>. llvm-svn: 141802	2011-10-12 19:46:30 +00:00
Richard Smith	a9e33d44a6	Handle Perforce-style conflict markers like normal conflict markers. Perforce swaps over the <<<< and >>>> markers, and uses shorter markers than traditional tools. llvm-svn: 141751	2011-10-12 00:37:51 +00:00
Richard Smith	4dd85d6fa1	Add a -Wc++0x-compat warning for C++11 keywords used as identifiers when in C++98 mode. Only the first occurrence of each keyword will produce a warning. llvm-svn: 141700	2011-10-11 19:57:52 +00:00
Argyrios Kyrtzidis	7a70d2f11b	For the FileChanged Preprocessor callback, when exiting a file, pass its FileID. llvm-svn: 141681	2011-10-11 17:29:44 +00:00
Peter Collingbourne	d937a99465	Clang-side build system infrastructure for multiple tblgens. llvm-svn: 141267	2011-10-06 01:52:10 +00:00
Abramo Bagnara	e398e60611	Fixed exapnsion range for # and ##. llvm-svn: 141012	2011-10-03 18:39:03 +00:00
John McCall	32f5fe1467	Add explicit attributes to mark functions as having had their CoreFoundation object-transfer properties audited, and add a #pragma to cause them to be automatically applied to functions in a particular span of code. This has to be implemented largely in the preprocessor because of the requirement that the region be entirely contained in a single file; that's hard to impose from the parser without registering for a ton of callbacks. llvm-svn: 140846	2011-09-30 05:12:12 +00:00
Daniel Dunbar	6de48ee860	Basic/Diagnostics: Split out the default warning "no-Werror" and "show-in-system-header" bits, which is part of teasing them apart from the diagnostic mapping kind. llvm-svn: 140742	2011-09-29 00:34:06 +00:00
Argyrios Kyrtzidis	18bcfd5595	Introduce a callback to PPCallbacks for lines skipped by the preprocessor. Patch by Jason Haslam! llvm-svn: 140612	2011-09-27 17:32:05 +00:00
David Blaikie	9c902b5502	Rename Diagnostic to DiagnosticsEngine as per issue 5397 llvm-svn: 140478	2011-09-25 23:23:43 +00:00
Argyrios Kyrtzidis	b89327ec84	Remove PreprocessingDirectiveKind since it's not necessary. llvm-svn: 140191	2011-09-20 22:14:52 +00:00
Argyrios Kyrtzidis	0d48fb89c0	The location of the name in MacroDefinition is the beginning of its range, don't store an extra location for it. llvm-svn: 140190	2011-09-20 22:14:48 +00:00
Argyrios Kyrtzidis	5733271925	In libclang, when visiting preprocessed entities in a source range, use PreprocessingRecord's getPreprocessedEntitiesInRange. Also remove all the stuff that were added in ASTUnit that are unnecessary now that we do a binary search for preprocessed entities and deserialize only what is necessary. llvm-svn: 140063	2011-09-19 20:40:48 +00:00
Argyrios Kyrtzidis	7f44836998	Introduce local_begin()/local_end() methods in PreprocessingRecord which return iterators for local, non-loaded, preprocessed entities. llvm-svn: 140062	2011-09-19 20:40:42 +00:00
Argyrios Kyrtzidis	64f6381097	Introduce PreprocessingRecord::getPreprocessedEntitiesInRange() which will do a binary search and return a pair of iterators for preprocessed entities in the given source range. Source ranges of preprocessed entities are stored twice currently in the PCH/Module file but this will be fixed in a subsequent commit. llvm-svn: 140058	2011-09-19 20:40:25 +00:00
Douglas Gregor	97eec24b0b	Add an experimental flag -fauto-module-import that automatically turns #include or #import direcctives of framework headers into module imports of the corresponding framework module. llvm-svn: 139860	2011-09-15 22:00:41 +00:00
Argyrios Kyrtzidis	03c40c5182	[PCH] Overhaul how preprocessed entities are [de]serialized. -Use an array of offsets for all preprocessed entities -Get rid of the separate array of offsets for just macro definitions; for references to macro definitions use an index inside the preprocessed entities array. -Deserialize each preprocessed entity lazily, at first request; not in bulk. Paves the way for binary searching of preprocessed entities that will offer efficiency and will simplify things on the libclang side a lot. llvm-svn: 139809	2011-09-15 18:02:56 +00:00
Douglas Gregor	1735f4e752	For modules, use a hash of the compiler version, language options, and target triple to separate modules built under different conditions. The hash is used to create a subdirectory in the module cache path where other invocations of the compiler (with the same version, language options, etc.) can find the precompiled modules. llvm-svn: 139662	2011-09-13 23:15:45 +00:00
Douglas Gregor	faeb1d4658	When an import statement fails to find a module in the module cache, but there is a corresponding umbrella header in a framework, build the module on-the-fly so it can be immediately loaded at the import statement. This is very much proof-of-concept code, with details to be fleshed out over time. llvm-svn: 139558	2011-09-12 23:31:24 +00:00
Douglas Gregor	1e44e02292	Introduce a cc1-level option to provide the path to the module cache, where the compiler will look for module files. Eliminates the egregious hack where we looked into the header search paths for modules. llvm-svn: 139538	2011-09-12 20:41:59 +00:00
Argyrios Kyrtzidis	80f78b961a	[libclang] Fix annotation and getting a "macro expansion" cursor for a builtin macro expansion. llvm-svn: 139298	2011-09-08 17:18:41 +00:00
Douglas Gregor	af5c48490e	Optimize the preprocessor's handling of the __import_module__ keyword. We now handle this keyword in HandleIdentifier, making a note for ourselves when we've seen the __import_module__ keyword so that the next lexed token can trigger a module import (if needed). This greatly simplifies Preprocessor::Lex(), and completely erases the 5.5% -Eonly slowdown Argiris noted when I originally implemented __import_module__. Big thanks to Argiris for noting that horrible regression! llvm-svn: 139265	2011-09-07 23:11:54 +00:00
Argyrios Kyrtzidis	933725c59e	Revert r139222, operator->() in PreprocessingRecord::iterator. It is useful to meet the requirements of the InputIterator concept. llvm-svn: 139261	2011-09-07 21:50:10 +00:00
Argyrios Kyrtzidis	7d500b3457	operator->() in PreprocessingRecord::iterator is useless since we are returning a pointer to pointer. llvm-svn: 139222	2011-09-07 03:43:39 +00:00
Argyrios Kyrtzidis	5cec2aea3f	Support code-completion for C++ inline methods and ObjC buffering methods. Previously we would cut off the source file buffer at the code-completion point; this impeded code-completion inside C++ inline methods and, recently, with buffering ObjC methods. Have the code-completion inserted into the source buffer so that it can be buffered along with a method body. When we actually hit the code-completion point the cut-off lexing or parsing. Fixes rdar://10056932&8319466 llvm-svn: 139086	2011-09-04 03:32:15 +00:00
Douglas Gregor	83297dfc7e	Allow the preprocessor to be constructed without performing target- and language-specific initialization. Use this to allow ASTUnit to create a preprocessor object before loading the AST file. No actual functionality change. llvm-svn: 138983	2011-09-01 23:39:15 +00:00
Argyrios Kyrtzidis	43ea78b48d	Don't try keeping a 'LeadingEmptyMacroLoc' in NullStmt. This fails in the face of buffering C++/ObjC method bodies. llvm-svn: 138972	2011-09-01 21:53:45 +00:00
Douglas Gregor	7018d5bcfb	Teach ASTContext and Preprocessor to hold on to references to the same LangOptions, rather than making distinct copies of LangOptions. Granted, LangOptions doesn't actually get modified, but this will eventually make it easier to construct ASTContext and Preprocessor before we know all of the LangOptions. llvm-svn: 138959	2011-09-01 20:23:19 +00:00
Douglas Gregor	4a69c2e6c5	Modules hide macro definitions by default, so that silly things like include guards don't show up as macro definitions in every translation unit that imports a module. Macro definitions can, however, be exported with the intentionally-ugly #__export_macro__ directive. Implement this feature by not even bothering to serialize non-exported macros to a module, because clients of that module need not (should not) know that these macros even exist. llvm-svn: 138943	2011-09-01 17:04:32 +00:00
Douglas Gregor	ca97589f7d	Switch __import__ over to __import_module__, so we don't conflict with existing practice with Python extension modules. Not that Python extension modules should be using a double-underscored identifier anyway, but... llvm-svn: 138870	2011-08-31 18:19:09 +00:00
Eli Friedman	3781a36238	Change err_pp_file_not_found back to an Error; when it's a Warning, we suppress it in system headers. And it is not a good idea to suppress it in system headers. (This was originally changed in r134996 to implement -MG.) Fixes <rdar://10041960>. And also brings down the number of warnings without a flag by one :) llvm-svn: 138842	2011-08-30 23:07:51 +00:00
Douglas Gregor	d90c3c92d5	Take an entirely different approach to handling the "parsing" of __import__ within the preprocessor, since the prior one foolishly assumed that Preprocessor::Lex() was re-entrant. We now handle __import__ at the top level (only), after macro expansion. This should fix the buildbot failures. llvm-svn: 138704	2011-08-27 06:37:51 +00:00
Douglas Gregor	081425343b	Introduce support for a simple module import declaration, which loads the named module. The syntax itself is intentionally hideous and will be replaced at some later point with something more palatable. For now, we're focusing on the semantics: - Module imports are handled first by the preprocessor (to get macro definitions) and then the same tokens are also handled by the parser (to get declarations). If both happen (as in normal compilation), the second one is redundant, because we currently have no way to hide macros or declarations when loading a module. Chris gets credit for this mad-but-workable scheme. - The Preprocessor now holds on to a reference to a module loader, which is responsible for loading named modules. CompilerInstance is the only important module loader: it now knows how to create and wire up an AST reader on demand to actually perform the module load. - We search for modules in the include path, using the module name with the suffix ".pcm" (precompiled module) for the file name. This is a temporary hack; we hope to improve the situation in the future. llvm-svn: 138679	2011-08-26 23:56:07 +00:00
Douglas Gregor	4e4c83e995	Teach the ASTReader how to avoid cycles when loading declarations that are lexically within a particular DeclContext. Test forthcoming. llvm-svn: 138668	2011-08-26 22:04:51 +00:00
Argyrios Kyrtzidis	7aecbc7661	Make Lexer::ComputePreamble accept a LangOptions parameter, otherwise it may be out-of-sync how a file is compiled. Patch by Matthias Kleine! llvm-svn: 138580	2011-08-25 20:39:19 +00:00
Argyrios Kyrtzidis	e7f7516148	Introduce SourceManager::isInSLocAddrSpace and use it in TokenLexer instead of isInFileID since it is a bit more efficient. llvm-svn: 138379	2011-08-23 21:02:38 +00:00
Argyrios Kyrtzidis	61ef3db222	Boost the efficiency of SourceManager::getMacroArgExpandedLocation. Currently getMacroArgExpandedLocation is very inefficient and for the case of a location pointing at the main file it will end up checking almost all of the SLocEntries. Make it faster: -Use a map of macro argument chunks to their expanded source location. The map is for a single source file, it's stored in the file's ContentCache and lazily computed, like the source lines cache. -In SLocEntry's FileInfo add an 'unsigned NumCreatedFIDs' field that keeps track of the number of FileIDs (files and macros) that were created during preprocessing of that particular file SLocEntry. This is useful when computing the macro argument map in skipping included files while scanning for macro arg FileIDs that lexed from a specific source file. Due to padding, the new field does not increase the size of SLocEntry. llvm-svn: 138225	2011-08-21 23:33:04 +00:00
Argyrios Kyrtzidis	eeca36fe9a	For assigning SourceLocations to macro arg tokens, reserve a single SLocEntry for tokens that are lexed consecutively from the same FileID, instead of creating a SLocEntry for each token. e.g for assert(foo == bar); there will be a single SLocEntry for the "foo == bar" chunk and locations for the 'foo', '==', 'bar' tokens will point inside that chunk. For parsing SemaExpr.cpp, this reduced the number of SLocEntries by 25%. llvm-svn: 138129	2011-08-19 22:34:17 +00:00
Argyrios Kyrtzidis	60617128e6	Rename TokenLexer::getMacroExpansionLocation -> getExpansionLocForMacroDefLoc, no functionality change. llvm-svn: 138128	2011-08-19 22:34:14 +00:00
Argyrios Kyrtzidis	75f6cd2b79	[libclang] Support code-completion inside macro arguments. llvm-svn: 137973	2011-08-18 19:41:28 +00:00
Argyrios Kyrtzidis	85a14bbd31	For the MacroExpands preprocessor callback, also pass the SourceRange of expansion (for function macros it includes the right paren). llvm-svn: 137909	2011-08-18 01:05:45 +00:00
Craig Topper	5265bb211d	Raw string followup. Pass a couple StringRefs by value. llvm-svn: 137301	2011-08-11 05:10:55 +00:00
Craig Topper	54edccafc5	Add support for C++0x raw string literals. llvm-svn: 137298	2011-08-11 04:06:15 +00:00
Eli Friedman	b3bfd84ebb	A couple fixes for preprocessor expressions: 1. Be more tolerant of comments in -CC (comment-preserving) mode. We were missing a few cases. 2. Make sure to expand the second FOO in "#if defined FOO FOO". (See also r97253, which addressed the case of "#if defined(FOO FOO".) Fixes PR10286. llvm-svn: 136748	2011-08-03 00:04:13 +00:00
Douglas Gregor	9f93e38aaf	Introduce the "-index-header-map" option, to give special semantics for quoted header lookup when dealing with not-yet-installed frameworks. Fixes <rdar://problem/9824020>. llvm-svn: 136331	2011-07-28 04:45:53 +00:00
Anna Zaks	59a3c80717	Add a utility function to the Lexer, which makes it easier to find a token after the given location. (It is a generalized version of trans::findLocationAfterSemi from ArcMigrate, which will be changed to use the Lexer utility). llvm-svn: 136268	2011-07-27 21:43:43 +00:00
Douglas Gregor	fb65e592e0	Add support for C++0x unicode string and character literals, from Craig Topper! llvm-svn: 136210	2011-07-27 05:40:30 +00:00
Ted Kremenek	fbcce6fba3	clang_getCXTUResourceUsage: report memory used by HeaderSearch. This required converting the StringMaps to use a BumpPtrAllocator. I measured the compile time and saw no observable regression. llvm-svn: 136190	2011-07-26 23:46:11 +00:00
Ted Kremenek	182543aba2	Report more memory using in Preprocessor::getTotalMemory() and PreprocessingRecord::getTotalMemory(). Most of the memory was already reported; but now we report more memory from side data structures. Fixes <rdar://problem/9379717>. llvm-svn: 136150	2011-07-26 21:17:24 +00:00
Chris Lattner	54b1677d23	Move ArrayRef to LLVM.h and eliminate now-redundant qualifiers, patch by Jon Mulder! llvm-svn: 135855	2011-07-23 17:14:25 +00:00
Chris Lattner	95c664b300	clean up forward declarations of raw_ostream to use the new LLVM.h patch by Jon Mulder! llvm-svn: 135851	2011-07-23 10:35:09 +00:00
Douglas Gregor	f676fdc8ea	One last RandomAccessIterator operator for PreprocessingRecord::iterator llvm-svn: 135680	2011-07-21 16:37:44 +00:00
Douglas Gregor	cbd50c3201	Add the remaining RandomAccessIterator operations to PreprocessingRecord::iterator. Where's concept_map when I need it? llvm-svn: 135679	2011-07-21 16:35:41 +00:00
Francois Pichet	14c7ed4b0b	For some reason I don't fully comprehend, the MSVC debug build will fail with a huge 50+ lines template error message if PreprocessingRecord::iterator has no operator<() llvm-svn: 135670	2011-07-21 06:26:00 +00:00
Douglas Gregor	4a9c39a2f6	Rework the detailed preprocessing record to separate preprocessing entities generated directly by the preprocessor from those loaded from the external source (e.g., the ASTReader). By separating these two sets of entities into different vectors, we allow both to grow independently, and eliminate the need for preallocating all of the loaded preprocessing entities. This is similar to the way the recent SourceManager refactoring treats FileIDs and the source location address space. As part of this, switch over to building a continuous range map to track preprocessing entities. llvm-svn: 135646	2011-07-21 00:47:40 +00:00
Chris Lattner	01cf8db38b	now that we have a centralized place to do so, add some using declarations for some common llvm types: stringref and smallvector. This cleans up the codebase quite a bit. llvm-svn: 135576	2011-07-20 06:58:45 +00:00
Chandler Carruth	a88a221855	Move the rest of the preprocessor terminology from 'instantiate' and variants to 'expand'. This changed a couple of public APIs, including one public type "MacroInstantiation" which is now "MacroExpansion". The rest of the codebase was updated to reflect this, especially the libclang code. Two of the C++ (and thus easily changed) libclang APIs were updated as well because they pertained directly to the old MacroInstantiation class. No functionality changed. llvm-svn: 135139	2011-07-14 08:20:46 +00:00
Chandler Carruth	e2c09ebcaa	Convert terminology in the Lexer from 'instantiate' and variants to 'expand'. Also update the public API it provides to the new term, and propagate that update to the various clients. No functionality changed. llvm-svn: 135138	2011-07-14 08:20:40 +00:00
Chandler Carruth	c9c8419c38	Switch the TokenLexer's terminology from various forms of 'instantiate' to 'expand' for macros. Only comments and uses local to the TokenLexer are updated. No functionality changed. llvm-svn: 135137	2011-07-14 08:20:34 +00:00
Argyrios Kyrtzidis	61c58f7f43	Move SourceManager::isAt[Start/End]OfMacroInstantiation functions to the Lexer, since they depend on it now. llvm-svn: 134644	2011-07-07 21:54:45 +00:00
Argyrios Kyrtzidis	41fb2d95a3	Make the Preprocessor more memory efficient and improve macro instantiation diagnostics. When a macro instantiation occurs, reserve a SLocEntry chunk with length the full length of the macro definition source. Set the spelling location of this chunk to point to the start of the macro definition and any tokens that are lexed directly from the macro definition will get a location from this chunk with the appropriate offset. For any tokens that come from argument expansion, '##' paste operator, etc. have their instantiation location point at the appropriate place in the instantiated macro definition (the argument identifier and the '##' token respectively). This improves macro instantiation diagnostics: Before: t.c:5:9: error: invalid operands to binary expression ('struct S' and 'int') int y = M(/); ^~~~ t.c:5:11: note: instantiated from: int y = M(/); ^ After: t.c:5:9: error: invalid operands to binary expression ('struct S' and 'int') int y = M(/); ^~~~ t.c:3:20: note: instantiated from: \#define M(op) (foo op 3); ~~~ ^ ~ t.c:5:11: note: instantiated from: int y = M(/); ^ The memory savings for a candidate boost library that abuses the preprocessor are: - 32% less SLocEntries (37M -> 25M) - 30% reduction in PCH file size (900M -> 635M) - 50% reduction in memory usage for the SLocEntry table (1.6G -> 800M) llvm-svn: 134587	2011-07-07 03:40:34 +00:00
Argyrios Kyrtzidis	8cc0459907	Introduce a caching mechanism for macro expanded tokens. Previously macro expanded tokens were added to Preprocessor's bump allocator and never released, even after the TokenLexer that were lexing them was finished, thus they were wasting memory. A very "useful" boost library was causing clang to eat 1 GB just for the expanded macro tokens. Introduce a special cache that works like a stack; a TokenLexer can add the macro expanded tokens in the cache, and when it finishes, the tokens are removed from the end of the cache. Now consumed memory by expanded tokens for that library is ~ 1.5 MB. Part of rdar://9327049. llvm-svn: 134105	2011-06-29 22:20:11 +00:00
Argyrios Kyrtzidis	e379ee31c0	Introduce Preprocessor::getTotalMemory() and use it in CIndex.cpp, no functionality change. llvm-svn: 134103	2011-06-29 22:20:04 +00:00
Douglas Gregor	a919bd5cee	Remove superfluous comment llvm-svn: 133757	2011-06-23 20:49:34 +00:00
Douglas Gregor	2034c85125	Bump Token::Kind from an unsigned char to an unsigned short, from Anton Lokhmotov llvm-svn: 133750	2011-06-23 20:15:25 +00:00
Douglas Gregor	3bde9b15a1	Copy diagnostic pragmas to the preprocessed output, from Richard Osborne! llvm-svn: 133633	2011-06-22 19:41:48 +00:00
Jay Foad	9a6b09874d	Make more use of llvm::StringRef in various APIs. In particular, don't use the deprecated forms of llvm::StringMap::GetOrCreateValue(). llvm-svn: 133515	2011-06-21 15:13:30 +00:00
Nico Weber	3b1d1217f8	Make it possible for external tools to distinguish between paths that come from -I and paths that come from -system. Patch from Paul Holden! llvm-svn: 131955	2011-05-24 04:31:14 +00:00
Argyrios Kyrtzidis	8b7252a8b3	Fix a nasty bug where inside StringLiteralParser: 1. We would assume that the length of the string literal token was at least 2 2. We would allocate a buffer with size length-2 And when the stars aligned (one of which would be an invalid source location due to stale PCH) The length would be 0 and we would try to allocate a 4GB buffer. Add checks for this corner case and a bunch of asserts. (We really really should have had an assert for 1.). Note that there's no test case since I couldn't get one (it was major PITA to reproduce), maybe later. llvm-svn: 131492	2011-05-17 22:09:56 +00:00
Peter Collingbourne	d5d410faa8	Introduce __has_extension macro __has_extension is a function-like macro which takes the same set of feature identifiers as __has_feature. It evaluates to 1 if the feature is supported by Clang in the current language (either as a language extension or a standard language feature) or 0 if not. At the same time, add support for the C1X feature identifiers c_generic_selections (renamed from generic_selections) and c_static_assert, and document them. Patch by myself and Jean-Daniel Dupas. llvm-svn: 131308	2011-05-13 20:54:45 +00:00
Douglas Gregor	998caead70	Introduce a new libclang parsing flag, CXTranslationUnit_NestedMacroInstantiations, which indicates whether we want to see "nested" macro instantiations (e.g., those that occur inside other macro instantiations) within the detailed preprocessing record. Many clients (e.g., those that only care about visible tokens) don't care about this information, and in code that uses preprocessor metaprogramming, this information can have a very high cost. Addresses <rdar://problem/9389320>. llvm-svn: 130990	2011-05-06 16:33:08 +00:00
Ted Kremenek	2160a0d3d7	Enhance clang_getCXTUResourceUsage() to return the amount of memory used by the Preprocessor's bump allocator as well as those from the PreprocessingRecord. llvm-svn: 130823	2011-05-04 01:38:46 +00:00
Douglas Gregor	37aa4938c8	Introduce a new libclang API, clang_isFileMultipleIncludeGuarded(), which determines whether a particular file is actually a header that is intended to be guarded from multiple inclusions within the same translation unit. llvm-svn: 130808	2011-05-04 00:14:37 +00:00
John Wiegley	1c0675e155	Parsing/AST support for Structured Exception Handling Patch authored by Sohail Somani. Provide parsing and AST support for Windows structured exception handling. llvm-svn: 130366	2011-04-28 01:08:34 +00:00
Argyrios Kyrtzidis	f7620e4d49	If a null statement was preceded by an empty macro keep its instantiation source location in NullStmt. llvm-svn: 130289	2011-04-27 05:04:02 +00:00
Manuel Klimek	0c69fd2760	To be able to replay compilations we need to accurately remodel how includes get resolved, especially when they are found relatively to another include file. We also try to get it working for framework includes, but that part of the code is untested, as I don't have a code base that uses it. llvm-svn: 130246	2011-04-26 21:50:03 +00:00
Jay Foad	1a180156b6	Remove unused STL header includes. llvm-svn: 130068	2011-04-23 19:53:52 +00:00
Douglas Gregor	46ce91a964	Initial work to improve documentation for Clang's diagnostics, from Matthieu Monrocq llvm-svn: 129614	2011-04-15 22:04:17 +00:00
Chris Lattner	57540c5be0	fix a bunch of comment typos found by codespell. Patch by Luis Felipe Strano Moraes! llvm-svn: 129559	2011-04-15 05:22:18 +00:00
Ted Kremenek	5e14d39a88	Improve crash recovery cleanup to recovery CompilerInstances during crash recovery. This was a huge resource "root" during crashes. This change requires making a bunch of fundamental Clang structures (optionally) reference counted to allow correct ownership semantics of these objects (e.g., ASTContext) to play out between an active ASTUnit and CompilerInstance object. llvm-svn: 128011	2011-03-21 18:40:17 +00:00
Chandler Carruth	3cc331a160	Add a 'RawPath' parameter to the PPCallbacks interface. This allows clients to observe the exact path through which an #included file was located. This is very useful when trying to record and replay inclusion operations without it beind influenced by the aggressive caching done inside the FileManager to avoid redundant system calls and filesystem operations. The work to compute and return this is only done in the presence of callbacks, so it should have no effect on normal compilation. Patch by Manuel Klimek. llvm-svn: 127742	2011-03-16 18:34:36 +00:00
John McCall	462c055d85	Fix my earlier commit to work with escaped newlines and leave breadcrumbs in case we want to make a world where we can check intermediate instantiations for this kind of breadcrumb. llvm-svn: 127221	2011-03-08 07:59:04 +00:00
John McCall	cff9bcfbd3	Add an API call to retrieve the spelling data of a token from its SourceLocation. llvm-svn: 127216	2011-03-08 04:06:57 +00:00
Douglas Gregor	71a2f72f1f	xpose HeaderSearch::SearchDirs to tools,s, from Paul Holden llvm-svn: 127122	2011-03-06 17:33:53 +00:00
Peter Collingbourne	2f1e36bfd0	Rename tok::eom to tok::eod. The previous name was inaccurate as this token in fact appears at the end of every preprocessing directive, not just macro definitions. No functionality change, except for a diagnostic tweak. llvm-svn: 126631	2011-02-28 02:37:51 +00:00
Peter Collingbourne	f29ce975ba	Reimplement __pragma support using a TokenLexer llvm-svn: 126221	2011-02-22 13:49:06 +00:00
Peter Collingbourne	2c9f966600	Make TokenLexer capable of storing preprocessor directive tokens llvm-svn: 126220	2011-02-22 13:49:00 +00:00
Oscar Fuentes	6f72540e46	New function for tablegenning: clang_tablegen. llvm-svn: 126093	2011-02-20 22:06:32 +00:00
Peter Collingbourne	5de2850efb	Reimplement Token::isAnnotation() using TokenKinds.def. No functionality change. llvm-svn: 126045	2011-02-19 20:06:59 +00:00
Douglas Gregor	012b69d5bb	Teach PPChainedCallbacks to forward the InclusionDirective() callback. llvm-svn: 125669	2011-02-16 18:15:35 +00:00
Peter Collingbourne	3bffa52933	Make LexOnOffSwitch a Preprocessor member function llvm-svn: 125473	2011-02-14 01:42:24 +00:00
Douglas Gregor	46c50012ca	Rename the operation that loads a preprocessed entity from a given offset to indicate that we're loading from an offset, not an index, lest one be confused. No functionality change. llvm-svn: 125394	2011-02-11 19:46:30 +00:00
Douglas Gregor	09b6989ef0	Implement two related optimizations that make de-serialization of AST/PCH files more lazy: - Don't preload all of the file source-location entries when reading the AST file. Instead, load them lazily, when needed. - Only look up header-search information (whether a header was already #import'd, how many times it's been included, etc.) when it's needed by the preprocessor, rather than pre-populating it. Previously, we would pre-load all of the file source-location entries, which also populated the header-search information structure. This was a relatively minor performance issue, since we would end up stat()'ing all of the headers stored within a AST/PCH file when the AST/PCH file was loaded. In the normal PCH use case, the stat()s were cached, so the cost--of preloading ~860 source-location entries in the Cocoa.h case---was relatively low. However, the recent optimization that replaced stat+open with open+fstat turned this into a major problem, since the preloading of source-location entries would now end up opening those files. Worse, those files wouldn't be closed until the file manager was destroyed, so just opening a Cocoa.h PCH file would hold on to ~860 file descriptors, and it was easy to blow through the process's limit on the number of open file descriptors. By eliminating the preloading of these files, we neither open nor stat the headers stored in the PCH/AST file until they're actually needed for something. Concretely, we went from * HeaderSearch Stats: 835 files tracked. 364 #import/#pragma once files. 823 included exactly once. 6 max times a file is included. 3 #include/#include_next/#import. 0 #includes skipped due to the multi-include optimization. 1 framework lookups. 0 subframework lookups. * Source Manager Stats: 835 files mapped, 3 mem buffers mapped. 37460 SLocEntry's allocated, 11215575B of Sloc address space used. 62 bytes of files mapped, 0 files with line #'s computed. with a trivial program that uses a chained PCH including a Cocoa PCH to * HeaderSearch Stats: 4 files tracked. 1 #import/#pragma once files. 3 included exactly once. 2 max times a file is included. 3 #include/#include_next/#import. 0 #includes skipped due to the multi-include optimization. 1 framework lookups. 0 subframework lookups. * Source Manager Stats: 3 files mapped, 3 mem buffers mapped. 37460 SLocEntry's allocated, 11215575B of Sloc address space used. 62 bytes of files mapped, 0 files with line #'s computed. for the same program. llvm-svn: 125286	2011-02-10 17:09:37 +00:00
Douglas Gregor	61b7f5e3fd	Separate the access-control diagnostics from other diagnostics that do not have SFINAE behavior. llvm-svn: 124441	2011-01-27 21:06:28 +00:00
Argyrios Kyrtzidis	76dbe8c800	Speed up code-completion by skipping function bodies. When we are in code-completion mode, skip parsing of all function bodies except the one where the code-completion point resides. For big .cpp files like 'SemaExpr.cpp' the improvement makes a huge difference, in some cases cutting down code-completion time -62% ! We don't get diagnostics for the bodies though, so modify the code-completion tests that check for errors. See rdar://8814203. llvm-svn: 122765	2011-01-03 19:44:02 +00:00
Abramo Bagnara	ea4f7c7761	Introduced raw_identifier token kind. llvm-svn: 122394	2010-12-22 08:23:18 +00:00
Nick Lewycky	2e998b7a6e	Add missing standard includes. Patch by Joerg Sonnenberger! llvm-svn: 122194	2010-12-19 20:49:25 +00:00
Argyrios Kyrtzidis	1cb0de1d4c	Fix diagnostic pragmas. Diagnostic pragmas are broken because we don't keep track of the diagnostic state changes and we only check the current/latest state. Problems manifest if a diagnostic is emitted for a source line that has different diagnostic state than the current state; this can affect a lot of places, like C++ inline methods, template instantiations, the lexer, etc. Fix the issue by having the Diagnostic object keep track of the source location of the pragmas so that it is able to know what is the diagnostic state at any given source location. Fixes rdar://8365684. llvm-svn: 121873	2010-12-15 18:44:22 +00:00
Douglas Gregor	f88e35ba0b	When using a precompiled preamble with detailed preprocessing records, trap the serialized preprocessing records (macro definitions, macro instantiations, macro definitions) from the generation of the precompiled preamble, then replay those when walking the list of preprocessed entities. This eliminates a bug where clang_getCursor() wasn't able to find preprocessed-entity cursors in the preamble. llvm-svn: 120396	2010-11-30 06:16:57 +00:00
Michael J. Spencer	8aaf49959c	Merge System into Support. llvm-svn: 120297	2010-11-29 18:12:39 +00:00
Chris Lattner	226efd356c	rework the stat cache, pulling it out of FileManager.h into its own header and giving it some more structure. No functionality change. llvm-svn: 120030	2010-11-23 19:19:34 +00:00
Chris Lattner	7219a5db6e	don't allow remapping PTH file paths with -fworking-directory, the client should just pass in absolute paths. llvm-svn: 120012	2010-11-23 09:01:31 +00:00
Chris Lattner	5159f6162e	now the FileManager has a FileSystemOpts ivar, stop threading FileSystemOpts through a ton of apis, simplifying a lot of code. This also fixes a latent bug in ASTUnit where it would invoke methods on FileManager without creating one in some code paths in cindextext. llvm-svn: 120010	2010-11-23 08:35:12 +00:00
Argyrios Kyrtzidis	de2bdf637e	Revert r119838 "Don't warn for empty 'if' body if there is a macro that expands to nothing" and use a better and more general approach, where NullStmt has a flag to indicate whether it was preceded by an empty macro. Thanks to Abramo Bagnara for the hint! llvm-svn: 119887	2010-11-20 02:04:01 +00:00
Craig Silverstein	1a9ca21881	Several PPCallbacks take an SourceLocation + IdentifierInfo, rather than a Token that holds the same information all in one easy-to-use package. There's no technical reason to prefer the former -- the information comes from a Token originally -- and it's clumsier to use, so I've changed the code to use tokens everywhere. Approved by clattner llvm-svn: 119845	2010-11-19 21:33:15 +00:00
Argyrios Kyrtzidis	90ee2a4ecf	Don't warn for empty 'if' body if there is a macro that expands to nothing, e.g: if (condition) CALL(0); // empty macro but don't warn for empty body. Fixes rdar://8436021. llvm-svn: 119838	2010-11-19 20:54:25 +00:00
Argyrios Kyrtzidis	d004064864	Refactoring of Diagnostic class. -Move the stuff of Diagnostic related to creating/querying diagnostic IDs into a new DiagnosticIDs class. -DiagnosticIDs can be shared among multiple Diagnostics for multiple translation units. -The rest of the state in Diagnostic object is considered related and tied to one translation unit. -Have Diagnostic point to the SourceManager that is related with. Diagnostic can now accept just a SourceLocation instead of a FullSourceLoc. -Reflect the changes to various interfaces. llvm-svn: 119730	2010-11-18 20:06:41 +00:00
Chris Lattner	39720111e0	move getSpelling from Preprocessor to Lexer, which it is more conceptually related to. llvm-svn: 119479	2010-11-17 07:26:20 +00:00
Chris Lattner	6bab435db6	propagate preprocessor out of StringLiteralParser. It is now possible to create one without a preprocessor. llvm-svn: 119476	2010-11-17 07:21:13 +00:00
Chris Lattner	2be8aa9611	push the preprocessor out of EncodeUCNEscape llvm-svn: 119475	2010-11-17 07:12:42 +00:00
Chris Lattner	2a6ee91619	move AdvanceToTokenCharacter and getLocForEndOfToken from Preprocessor to Lexer where they make more sense. llvm-svn: 119474	2010-11-17 07:05:50 +00:00
Chris Lattner	b1ab2c2d3d	add a static version of PP::AdvanceToTokenCharacter. llvm-svn: 119472	2010-11-17 06:55:10 +00:00
Chris Lattner	bde1b81eb8	push use of Preprocessor out farther. llvm-svn: 119471	2010-11-17 06:46:14 +00:00
Chris Lattner	3a324d3232	push use of Preprocessor out of getOffsetOfStringByte llvm-svn: 119470	2010-11-17 06:35:43 +00:00
Chris Lattner	30d4c928ac	add a static form of the efficient PP::getSpelling method. llvm-svn: 119469	2010-11-17 06:31:48 +00:00
Chris Lattner	7a02bfdfce	refactor the interface to StringLiteralParser::getOffsetOfStringByte, pushing the dependency on the preprocessor out a bit. llvm-svn: 119468	2010-11-17 06:26:08 +00:00
Craig Silverstein	174241e9a0	1) Fix a typo in PPCallbacks: It's elif, not elfif. :-) This is contentful, since the typo was in the method-name... 2) Clarify some comments in RecursiveASTVisitor. llvm-svn: 118448	2010-11-08 21:43:51 +00:00
Craig Silverstein	8e3d95e7df	Add PPCallbacks for #if/#ifdef/etc. The callback info for #if/#elif is not great -- ideally it would give us a list of tokens in the #if, or even better, a little parse tree. But that's a lot more work. Instead, clients can retokenize using Lexer::LexFromRawLexer(). Reviewed by nlewycky. llvm-svn: 118318	2010-11-06 01:19:03 +00:00
Argyrios Kyrtzidis	71731d6b05	Implement -working-directory. When -working-directory is passed in command line, file paths are resolved relative to the specified directory. This helps both when using libclang (where we can't require the user to actually change the working directory) and to help reproduce test cases when the reproduction work comes along. --FileSystemOptions is introduced which controls how file system operations are performed (currently it just contains the working directory value if set). --FileSystemOptions are passed around to various interfaces that perform file operations. --Opening & reading the content of files should be done only through FileManager. This is useful in general since file operations will be abstracted in the future for the reproduction mechanism. FileSystemOptions is independent of FileManager so that we can have multiple translation units sharing the same FileManager but with different FileSystemOptions. Addresses rdar://8583824. llvm-svn: 118203	2010-11-03 22:45:23 +00:00
Douglas Gregor	f09b6c9c85	Plug a leak in the preprocessing record's handling of inclusion directives. We had a std::string in an object that was allocated via a BumpPtrAllocator. llvm-svn: 117912	2010-11-01 15:03:47 +00:00
Douglas Gregor	5ef9e33137	Make the deserialization of macro definitions lazy, so that we can load identifiers without loading their corresponding macro definitions. This is likely to improve PCH performance slightly, and reduces deserialization stack depth considerably when using preprocessor metaprogramming. llvm-svn: 117750	2010-10-30 00:23:06 +00:00
Douglas Gregor	796d76a663	Extend the preprocessing record and libclang with support for inclusion directives, keeping track of every #include, #import, etc. in the translation unit. We keep track of the source location and kind of the inclusion, how the file name was spelled, and the underlying file to which the inclusion resolved. llvm-svn: 116952	2010-10-20 22:00:55 +00:00
Anders Carlsson	274a70ed7f	Add a __has_attribute macro that works much like __has_feature and __has_builtin. llvm-svn: 116906	2010-10-20 02:31:43 +00:00
Ted Kremenek	c8456f8c59	Really^2 fix <rdar://problem/8361834>, this time without crashing. Now MICache is a linked list (per the FIXME), where we tradeoff between MacroInfo objects being in MICache and MIChainHead. MacroInfo objects in the MICache chain are already "Destroy()'ed", so they can be reused. When inserting into MICache, we need to remove them from the regular linked list so that they aren't destroyed more than once. llvm-svn: 116869	2010-10-19 22:15:20 +00:00
Ted Kremenek	1f1e4bdbf7	Simplify lifetime management of MacroInfo objects in Preprocessor by having the Preprocessor maintain them in a linked list of allocated MacroInfos. This requires only 1 extra pointer per MacroInfo object, and allows us to blow them away in one place. This fixes an elusive memory leak with MacroInfos (whose exact location I couldn't still figure out despite substantial digging). Fixes <rdar://problem/8361834>. llvm-svn: 116842	2010-10-19 18:16:54 +00:00
Douglas Gregor	6769922d8c	Add iteration over the preprocessor conditional stack to PreprocessorLexer llvm-svn: 116703	2010-10-18 14:43:21 +00:00
Sebastian Redl	9609b4f1a8	When chaining PCHs, only write PPRecords that don't come from PCH, and give them the correct IDs. Fixes a crash in XCode. llvm-svn: 114913	2010-09-27 22:18:47 +00:00
Douglas Gregor	c7d6576d54	When we parse a pragma, keep track of how that pragma was originally spelled (#pragma, _Pragma, __pragma). In -E mode, use that information to add appropriate newlines when translating _Pragma and __pragma into #pragma, like GCC does. Fixes <rdar://problem/8412013>. llvm-svn: 113553	2010-09-09 22:45:38 +00:00
Fariborz Jahanian	9e42a952d7	Use getSpelling to get original text of the c++ operator token. (radar 8328250). llvm-svn: 112977	2010-09-03 17:33:04 +00:00
Fariborz Jahanian	0389df4a45	Patch to allow alternative representation of c++ operators (and, or, etc.) to be used as selectors to match g++'s behavior. llvm-svn: 112935	2010-09-03 01:26:16 +00:00
Alexis Hunt	3b7918625c	Revert my user-defined literal commits - r1124{58,60,67} pending some issues being sorted out. llvm-svn: 112493	2010-08-30 17:47:05 +00:00
Alexis Hunt	e3675ef0f3	Two minor fixes to user-defined literals: - Zero-initialize UDLData so that crashes stop - Stop complaining that we can't emit them (we most certainly can) llvm-svn: 112467	2010-08-30 09:27:16 +00:00
Alexis Hunt	8591e9e06f	Fix some test-breaking that snuck into my previous commit llvm-svn: 112460	2010-08-29 22:39:32 +00:00
Alexis Hunt	79eb5469e0	Implement C++0x user-defined string literals. The extra data stored on user-defined literal Tokens is stored in extra allocated memory, which is managed by the PreprocessorLexer because there isn't a better place to put it that makes sure it gets deallocated, but only after it's used up. My testing has shown no significant slowdown as a result, but independent testing would be appreciated. llvm-svn: 112458	2010-08-29 21:26:48 +00:00
John McCall	89e925d78e	Add support for Microsoft's __pragma in the preprocessor. Patch by Francois Pichet! llvm-svn: 112391	2010-08-28 22:34:47 +00:00
Douglas Gregor	115837041e	Introduce a preprocessor code-completion hook for contexts where we expect "natural" language and should not provide any completions, e.g., comments, string literals, #error. llvm-svn: 112054	2010-08-25 17:04:25 +00:00
Douglas Gregor	ec00a26855	Implement code completion for preprocessor expressions and in macro arguments. llvm-svn: 111976	2010-08-24 22:20:20 +00:00
Douglas Gregor	127851084d	Implement preprocessor code completion where a macro name is expected, e.g., after #ifdef/#ifndef or #undef, or inside a defined <macroname> expression in a preprocessor conditional. llvm-svn: 111954	2010-08-24 20:21:13 +00:00
Douglas Gregor	3a7ad25eb6	Introduce basic code-completion support for preprocessor directives, e.g., after a "#" we'll suggest #if, #ifdef, etc. llvm-svn: 111943	2010-08-24 19:08:16 +00:00
Sebastian Redl	d44cd6adba	More PCH -> AST renaming. llvm-svn: 111472	2010-08-18 23:57:06 +00:00
Sebastian Redl	2c499f6561	Rename PCHReader to ASTReader. llvm-svn: 111467	2010-08-18 23:56:43 +00:00
Chris Lattner	66b67d209e	no need to pass bumppointer allocator into macroinfo::destroy llvm-svn: 111364	2010-08-18 16:08:51 +00:00
Chris Lattner	c0a585d63c	Implement #pragma push_macro, patch by Francois Pichet! llvm-svn: 111234	2010-08-17 15:55:45 +00:00
Douglas Gregor	028d3e4d0f	Use precompiled preambles for in-process code completion. llvm-svn: 110596	2010-08-09 20:45:32 +00:00
Douglas Gregor	618e64a23b	Revert r110440, the fix for PR4897. Chris claims to have a better way. llvm-svn: 110544	2010-08-08 07:49:23 +00:00
Benjamin Kramer	d05f31d059	Push location through the MacroUndefined PPCallback and use it to print #undefs in -dD mode. (PR7818) llvm-svn: 110523	2010-08-07 22:27:00 +00:00
Douglas Gregor	d26129a98a	Fix the #include search path when reading from stdin, from Jon Simons! Fixes PR4897. llvm-svn: 110440	2010-08-06 12:06:13 +00:00
Sebastian Redl	9891212476	Record macros in dependent PCHs. Also add various info tables to dependent PCHs; tests for this to follow. llvm-svn: 109554	2010-07-27 23:01:28 +00:00
Ted Kremenek	2bd41d1e30	Add PTHLexer::LexEndOfFile() to emit diagnostics at end-of-file similar to those by Lexer::LexEndOfFile(). llvm-svn: 109486	2010-07-27 02:59:02 +00:00
Ted Kremenek	3d625eb2bf	Fix predicate in 'InCachingLexMode' to include 'CurPTHLexer'. llvm-svn: 109485	2010-07-27 02:58:59 +00:00
Douglas Gregor	3f4bea0646	Introduce basic support for loading a precompiled preamble while reparsing an ASTUnit. When saving a preamble, create a buffer larger than the actual file we're working with but fill everything from the end of the preamble to the end of the file with spaces (so the lexer will quickly skip them). When we load the file, create a buffer of the same size, filling it with the file and then spaces. Then, instruct the lexer to start lexing after the preamble, therefore continuing the parse from the spot where the preamble left off. It's now possible to perform a simple preamble build + parse (+ reparse) with ASTUnit. However, one has to disable a bunch of checking in the PCH reader to do so. That part isn't committed; it will likely be handled with some other kind of flag (e.g., -fno-validate-pch). As part of this, fix some issues with null termination of the memory buffers created for the preamble; we were trying to explicitly NULL-terminate them, even though they were also getting implicitly NULL terminated, leading to excess warnings about NULL characters in source files. llvm-svn: 109445	2010-07-26 21:36:20 +00:00
Douglas Gregor	cd8bdd025f	Improve performance during cursor traversal when a region of interest is present. Rather than using clang_getCursorExtent(), which requires us to lex the token at the ending position to determine its length. Then, we'd be comparing [a, b) source ranges that cover the characters in the range rather than the normal behavior for Clang's source ranges, which covers the tokens in the range. However, relexing causes us to read the source file (which may come from a precompiled header), which is rather unfortunate and affects performance. In the new scheme, we only use Clang-style source ranges that cover the tokens in the range. At the entry points where this matters (clang_annotateTokens, clang_getCursor), we make sure to move source locations to the start of the token. Addresses most of <rdar://problem/8049381>. llvm-svn: 109134	2010-07-22 20:22:31 +00:00
Douglas Gregor	af82e3510b	Introduce a new lexer function to compute the "preamble" of a file, which is the part of the file that contains all of the initial comments, includes, and preprocessor directives that occur before any of the actual code. Added a new -print-preamble cc1 action that is only used for testing. llvm-svn: 108913	2010-07-20 20:18:03 +00:00
Argyrios Kyrtzidis	36745fda34	Modify the pragma handlers to accept and use StringRefs instead of IdentifierInfos. When loading the PCH, IdentifierInfos that are associated with pragmas cause declarations that use these identifiers to be deserialized (e.g. the "clang" pragma causes the "clang" namespace to be loaded). We can avoid this if we just use StringRefs for the pragmas. As a bonus, since we don't have to create and pass IdentifierInfos, the pragma interfaces get a bit more simplified. llvm-svn: 108237	2010-07-13 09:07:17 +00:00
Argyrios Kyrtzidis	c2924de667	If we are past tok::eof and in caching lex mode, avoid caching repeated tok::eofs. llvm-svn: 108175	2010-07-12 18:49:30 +00:00
Chris Lattner	30c924b3e8	Implement support for #pragma message, patch by Michael Spencer! llvm-svn: 106950	2010-06-26 17:11:39 +00:00
Gabor Greif	2cd6c7bd70	fix some more gcc3.4 constness warnings llvm-svn: 106216	2010-06-17 11:29:31 +00:00
Chris Lattner	bba37f4dea	fix the various buildbot failures by ensuring that tokens are really completely initialized. llvm-svn: 106043	2010-06-15 21:06:38 +00:00
Daniel Dunbar	d839e77b12	Preprocessor: Ignore unknown pragmas in -E -dM and -Eonly modes. llvm-svn: 105830	2010-06-11 20:10:12 +00:00
Benjamin Kramer	0591124608	Token is POD-like. llvm-svn: 105604	2010-06-08 11:23:26 +00:00
Douglas Gregor	9af03022ff	Tell the string literal parser when it's not permitted to emit diagnostics. That would be while we're parsing string literals for the sole purpose of producing a diagnostic about them. Fixes <rdar://problem/8026030>. llvm-svn: 104684	2010-05-26 05:35:51 +00:00
Chris Lattner	d3e4ba18ac	fit in 80 cols, remove prototypes for handling #assert since apparently noone cares. llvm-svn: 103782	2010-05-14 17:35:07 +00:00
Chris Lattner	216c33beba	add the ability to associate 'category' names with diagnostics and diagnostic groups. This allows the compiler to group diagnostics together (e.g. "Logic Warning", "Format String Warning", etc) like the static analyzer does. This is not exposed through anything in the compiler yet. llvm-svn: 103051	2010-05-04 20:44:26 +00:00
Chris Lattner	fb24a3a4ec	push some source location information down through the compiler, into ContentCache::getBuffer. This allows it to produce diagnostics on the broken #include line instead of without a location. llvm-svn: 101939	2010-04-20 20:35:58 +00:00
Chris Lattner	72286d6129	add a PPCallback handler for a skipped #include, patch by Zhanyong Wan! llvm-svn: 101813	2010-04-19 20:44:31 +00:00
Chris Lattner	0384e63501	make the token paste avoidance logic turn "..." into ".. ." instead of ". . ." when avoiding paste. Patch by David Peixotto! llvm-svn: 101218	2010-04-14 03:57:19 +00:00
Daniel Dunbar	642c02d257	Fix FileEntry declaration. llvm-svn: 99896	2010-03-30 17:57:47 +00:00
Daniel Dunbar	cb9eaf59fb	PPCallbacks: Add hook for reaching the end of the main file, and fix DependencyFile to not do work in its destructor. llvm-svn: 99257	2010-03-23 05:09:10 +00:00
Douglas Gregor	8aaca67b0a	Robustify PreprocessingRecord slightly, by only creating macro instantiations when we have the corresponding macro definition and by removing macro definition information from our table when the macro is undefined. llvm-svn: 99004	2010-03-19 21:58:23 +00:00
Douglas Gregor	aae9224e49	Implement serialization and lazy deserialization of the preprocessing record (which includes all macro instantiations and definitions). As with all lay deserialization, this introduces a new external source (here, an external preprocessing record source) that loads all of the preprocessed entities prior to iterating over the entities. The preprocessing record is an optional part of the precompiled header that is disabled by default (enabled with -detailed-preprocessing-record). When the preprocessor given to the PCH writer has a preprocessing record, that record is written into the PCH file. When the PCH reader is given a PCH file that contains a preprocessing record, it will be lazily loaded (which, effectively, implicitly adds -detailed-preprocessing-record). This is the first case where we have sections of the precompiled header that are added/removed based on a compilation flag, which is unfortunate. However, this data consumes ~550k in the PCH file for Cocoa.h (out of ~9.9MB), and there is a non-trivial cost to gathering this detailed preprocessing information, so it's too expensive to turn on by default. In the future, we should investigate a better encoding of this information. llvm-svn: 99002	2010-03-19 21:51:54 +00:00
Douglas Gregor	7dc8722bd3	Make the preprocessing record a PPCallbacks subclass itself, eliminating the extra PopulatePreprocessingRecord object. This will become useful once we start writing the preprocessing record to precompiled headers. llvm-svn: 98966	2010-03-19 17:12:43 +00:00
Douglas Gregor	7f6d60dcc2	Optionally store a PreprocessingRecord in the preprocessor itself, and tie its creation to a CC1 flag -detailed-preprocessing-record. llvm-svn: 98963	2010-03-19 16:15:56 +00:00
Douglas Gregor	78ae2481b6	Explicitly link macro instantiations to macro definitions in the preprocessing record. Use that link with clang_getCursorReferenced() and clang_getCursorDefinition() to match instantiations of a macro to the definition of the macro. llvm-svn: 98842	2010-03-18 18:23:03 +00:00
Douglas Gregor	06d6d32762	Expose macro definitions as CIndex cursors. These can still only be generated by clang_annotateTokens(). llvm-svn: 98837	2010-03-18 18:04:21 +00:00
Douglas Gregor	065f8d11ca	Introduce the notion of a "preprocessing record", which keeps track of the macro definitions and macro instantiations that are found during preprocessing. Preprocessing records are not generated by default; rather, we provide a PPCallbacks subclass that hooks into the existing callback mechanism to record this activity. The only client of preprocessing records is CIndex, which keeps track of macro definitions and instantations so that they can be exposed via cursors. At present, only token annotation uses these facilities, and only for macro instantiations; both will change in the near future. However, with this change, token annotation properly annotates macro instantiations that do not produce any tokens and instantiations of macros that are later undef'd, improving our consistency. Preprocessing directives that are not macro definitions are still handled by clang_annotateTokens() via re-lexing, so that we don't have to track every preprocessing directive in the preprocessing record. Performance impact of preprocessing records is still TBD, although it is limited to CIndex and therefore out of the path of the main compiler. llvm-svn: 98836	2010-03-18 17:52:52 +00:00
Douglas Gregor	4ad3da2843	Entering the main source file in the preprocessor can fail if the source file has been changed. Handle that failure more gracefully. llvm-svn: 98727	2010-03-17 15:44:30 +00:00
Douglas Gregor	42fe858cd6	Audit all callers of SourceManager::getCharacterData(); update some of them to recover more gracefully on failure. llvm-svn: 98672	2010-03-16 20:46:42 +00:00
Douglas Gregor	5712ebced0	Fix header-search problems with precompiled headers, where the presence or absence of header map arguments when using the precompiled header would cause Clang to get confused about which headers had already been included/imported, along with their controlling macros. The fundamental problem is that the serialization of the header search information was relying on the UIDs of FileEntry objects at PCH generation time and PCH load time to be equivalent, which effectively means that we had to probe the same files in the same order. Differing header map arguments caused an extra FileEntry lookup, but it's easy to imagine other minor command-line arguments triggering this problem. Header-search information is now encoded along with the source-location entry for a file, so that we register information about a file's properties as a header at the same time we create the FileEntry for that file. Fixes <rdar://problem/7743243>. llvm-svn: 98636	2010-03-16 16:35:32 +00:00
Douglas Gregor	7bda4b8310	Introduce optional "Invalid" parameters to routines that invoke the SourceManager's getBuffer() and, therefore, could fail, along with Preprocessor::getSpelling(). Use the Invalid parameters in the literal parsers (string, floating point, integral, character) to make them robust against errors that stem from, e.g., PCH files that are not consistent with the underlying file system. I still need to audit every use caller to all of these routines, to determine which ones need specific handling of error conditions. llvm-svn: 98608	2010-03-16 05:20:39 +00:00
Kovarththanan Rajaratnam	ba2c65277a	Use SmallString instead of SmallVector llvm-svn: 98436	2010-03-13 10:17:05 +00:00
Kovarththanan Rajaratnam	661a309933	Switch over IdentifierInfoLookup to StringRef llvm-svn: 98337	2010-03-12 08:23:34 +00:00
Kovarththanan Rajaratnam	752a124aeb	Rename to addPPCallbacks since we're effectively adding a callback and maybe chaining it to an existing one llvm-svn: 97913	2010-03-07 07:30:06 +00:00
Benjamin Kramer	a197fb6731	Move method out-of-line. I thought this would be a candidate for inlining but I was wrong. llvm-svn: 97330	2010-02-27 17:05:45 +00:00
Benjamin Kramer	c5c65d2993	Fix thinko. llvm-svn: 97323	2010-02-27 14:15:42 +00:00
Benjamin Kramer	0a1abd4088	Add an overload of Preprocessor::getSpelling which takes a SmallVector and returns a StringRef. Use it to simplify some repetitive code. llvm-svn: 97322	2010-02-27 13:44:12 +00:00
Sebastian Redl	b0e3e1bf67	When placing an annotation token over an existing annotation token, make sure that the new token's range extends to the end of the old token. Assert that in AnnotateCachedTokens. Fixes PR6248. llvm-svn: 95555	2010-02-08 19:35:18 +00:00
Douglas Gregor	27b4fa994d	Introduce a CIndex API for lexing the raw tokens within a given source range. The token-annotation function does nothing, yet. llvm-svn: 94551	2010-01-26 17:06:03 +00:00
Douglas Gregor	562c1f9365	Teach CIndex's cursor visitor to restrict its traversal to a specific region of interest (if provided). Implement clang_getCursor() in terms of this traversal rather than using the Index library; the unified cursor visitor is more complete, and will be The Way Forward. Minor other tweaks needed to make this work: - Extend Preprocessor::getLocForEndOfToken() to accept an offset from the end, making it easy to move to the last character in the token (rather than just past the end of the token). - In Lexer::MeasureTokenLength(), the length of whitespace is zero. llvm-svn: 94200	2010-01-22 19:49:59 +00:00
Chris Lattner	fde85356c6	revert my patch for rdar://7520940 that warns when a published header is #included with "foo.h" style syntax instead of framework syntax. It produced too much noise. llvm-svn: 94120	2010-01-22 00:14:44 +00:00
Chris Lattner	87d0208c41	allow the HandlerComment callback to push tokens into the preprocessor. This could be used by an OpenMP implementation or something. Patch by Abramo Bagnara! llvm-svn: 93795	2010-01-18 22:35:47 +00:00
Chris Lattner	d081f8c851	stringref'ize a bunch of filename handling logic. Much nicer than passing around two const char*'s. llvm-svn: 93094	2010-01-10 01:35:12 +00:00
Chris Lattner	2ceb625f59	implement rdar://7520940: published framework headers should import other headers within the same framework with the full framework path, not with a relative include. llvm-svn: 93083	2010-01-10 00:24:58 +00:00
Daniel Dunbar	699e014588	Add missing newline (which breaks MSVC build???) llvm-svn: 92522	2010-01-04 22:04:01 +00:00
Douglas Gregor	9882a5aac6	Teach Preprocessor::macro_begin/macro_end to lazily load all macro definitions from a precompiled header. This ensures that code-completion with macro names behaves the same with or without precompiled headers. llvm-svn: 92497	2010-01-04 19:18:44 +00:00
John McCall	53b93a091e	Diagnose out-of-bounds floating-point constants. Fixes rdar://problem/6974641 llvm-svn: 92127	2009-12-24 09:08:04 +00:00
Chris Lattner	4af1aadeb5	update comments llvm-svn: 92022	2009-12-23 19:08:19 +00:00
Chris Lattner	d19564b109	set up the machinery for a MacroArgs cache hanging off Preprocessor. We creating and free thousands of MacroArgs objects (and the related std::vectors hanging off them) for the testcase in PR5610 even though there are only ~20 live at a time. This doesn't actually use the cache yet. llvm-svn: 91391	2009-12-15 01:51:03 +00:00
Chris Lattner	7c027ee4c2	teach clang to recover gracefully from conflict markers left in source files: PR5238. llvm-svn: 91270	2009-12-14 06:16:57 +00:00
Chris Lattner	8cf1f935c2	formatting changes. llvm-svn: 91263	2009-12-14 04:54:40 +00:00
Daniel Dunbar	1776679e71	Change Preprocessor::EnterSourceFile to make ErrorStr non-optional, clients should be forced to deal with error conditions. llvm-svn: 90700	2009-12-06 09:19:12 +00:00
Douglas Gregor	5f49883488	Minor cleanup to the code-completion-point logic suggested by Chris. llvm-svn: 90459	2009-12-03 17:05:59 +00:00
Douglas Gregor	53ad6b94b0	Extend the source manager with the ability to override the contents of files with the contents of an arbitrary memory buffer. Use this new functionality to drastically clean up the way in which we handle file truncation for code-completion: all of the truncation/completion logic is now encapsulated in the preprocessor where it belongs (<rdar://problem/7434737>). llvm-svn: 90300	2009-12-02 06:49:09 +00:00
Chris Lattner	ed3b360290	pass the reason for failure up from MemoryBuffer and report it in diagnostics when we fail to open a file. This allows us to report things like: $ clang test.c -I. test.c:2:10: fatal error: error opening file './foo.h': Permission denied #include "foo.h" ^ llvm-svn: 90276	2009-12-01 22:52:33 +00:00
Chris Lattner	710bb87147	Fix PR5633 by making the preprocessor handle the case where we can stat a file but where mmaping it fails. In this case, we emit an error like: t.c:1:10: fatal error: error opening file '../../foo.h' instead of "cannot find file". llvm-svn: 90110	2009-11-30 04:18:44 +00:00
Daniel Dunbar	bf410c6fc2	Add static version of Preprocessor::getSpelling. llvm-svn: 88732	2009-11-14 01:20:48 +00:00
Daniel Dunbar	1b4441915a	Wherein the TargetInfo argument to Preprocessor is made 'const' and propogated. llvm-svn: 87087	2009-11-13 05:51:54 +00:00
Daniel Dunbar	1a54e3fbb9	Switch PTHManager to using diagnostics for most errors. Also, always give errors on a token-cache PTH failure. llvm-svn: 86939	2009-11-12 02:53:48 +00:00
Daniel Dunbar	0c6c930f05	Allow Preprocessor to take ownership of the HeaderSearch object. I think it should probably always own the header search object, but I'm not sure... llvm-svn: 86882	2009-11-11 21:44:21 +00:00
Douglas Gregor	b53edfb8dc	Improve parsing of template arguments to lay the foundation for handling template template parameters properly. This refactoring: - Parses template template arguments as id-expressions, representing the result of the parse as a template name (Action::TemplateTy) rather than as an expression (lame!). - Represents all parsed template arguments via a new parser-specific type, ParsedTemplateArgument, which stores the kind of template argument (type, non-type, template) along with all of the source information about the template argument. This replaces an ad hoc set of 3 vectors (one for a void*, which was either a type or an expression; one for a bit telling whether the first was a type or an expression; and one for a single source location pointing at the template argument). - Moves TemplateIdAnnotation into the new Parse/Template.h. It never belonged in the Basic library anyway. llvm-svn: 86708	2009-11-10 19:49:08 +00:00
Daniel Dunbar	07dcd8b9d8	Make LookUpIdentifierInfo const. This makes the Identifiers table mutable and is a little fuzzy, but conceptually it's just uniquing the identifier. Chris, please review. I debated splitting into const/non-const versions where the const one propogated constness to the resulting IdentifierInfo*. llvm-svn: 86106	2009-11-05 01:53:52 +00:00
Daniel Dunbar	f539bfeb4d	StringRefize Preprocessor::getIdentifierInfo. llvm-svn: 86105	2009-11-05 01:53:39 +00:00
Daniel Dunbar	d0ba0e6108	Kill PreprocessorFactory, which was both morally repugnant and totally unused. llvm-svn: 86076	2009-11-04 23:56:25 +00:00
Daniel Dunbar	caa00adafd	Use unsigned char instead of unsigned : 8 to make the optimizer happier. llvm-svn: 85985	2009-11-04 00:34:40 +00:00
Douglas Gregor	3cf81317e4	Parsing and semantic analysis for template-ids that name overloaded operators, e.g., operator+<int> which now works in declarators, id-expressions, and member access expressions. This commit only implements the non-dependent case, where we can resolve the template-id to an actual declaration. llvm-svn: 85966	2009-11-03 23:16:33 +00:00
John Thompson	ac0b098d4d	Added __has_include and __has_include_next. llvm-svn: 85834	2009-11-02 22:28:12 +00:00
John Thompson	b535352681	Re-arranged some internal functions for coming __has_include changes. llvm-svn: 85589	2009-10-30 13:49:06 +00:00
Chandler Carruth	a3f084ce16	Update location of DataTypes.h to reflect move in LLVM with r85086. llvm-svn: 85087	2009-10-26 01:37:10 +00:00
Mike Stump	c99c022841	This fixes support for complex literals, reworked to avoid a goto, and to add a flag noting the presence of a Microsoft extension suffix (i8, i16, i32, i64). Patch by John Thompson. llvm-svn: 83591	2009-10-08 22:55:36 +00:00
Douglas Gregor	ea9b03e6e2	Replace the -code-completion-dump option with -code-completion-at=filename:line:column which performs code completion at the specified location by truncating the file at that position and enabling code completion. This approach makes it possible to run multiple tests from a single test file, and gives a more natural command-line interface. llvm-svn: 82571	2009-09-22 21:11:38 +00:00
Chris Lattner	3e8b4f6637	allow clearing this value. llvm-svn: 82271	2009-09-18 20:39:46 +00:00
Douglas Gregor	2436e7116b	Initial implementation of a code-completion interface in Clang. In essence, code completion is triggered by a magic "code completion" token produced by the lexer [], which the parser recognizes at certain points in the grammar. The parser then calls into the Action object with the appropriate CodeCompletionXXX action. Sema implements the CodeCompletionXXX callbacks by performing minimal translation, then forwarding them to a CodeCompletionConsumer subclass, which uses the results of semantic analysis to provide code-completion results. At present, only a single, "printing" code completion consumer is available, for regression testing and debugging. However, the design is meant to permit other code-completion consumers. This initial commit contains two code-completion actions: one for member access, e.g., "x." or "p->", and one for nested-name-specifiers, e.g., "std::". More code-completion actions will follow, along with improved gathering of code-completion results for the various contexts. [] In the current -code-completion-dump testing/debugging mode, the file is truncated at the completion point and EOF is translated into "code completion". llvm-svn: 82166	2009-09-17 21:32:03 +00:00
Benjamin Kramer	c1330af0b9	SmallVectorize preprocessor's token cache. Testing shows there is almost never more than one token in the cache. llvm-svn: 81612	2009-09-12 09:45:28 +00:00
Mike Stump	11289f4280	Remove tabs, and whitespace cleanups. llvm-svn: 81346	2009-09-09 15:08:12 +00:00
Mike Stump	030867b97e	Remove tab characters. llvm-svn: 81340	2009-09-09 13:12:01 +00:00
Daniel Dunbar	dc7f7b7ac3	Add missing include. llvm-svn: 81059	2009-09-05 00:48:32 +00:00
Argyrios Kyrtzidis	3a24c4aa00	Change Preprocessor to keep a copy of LangOptions instead of reference, like ASTContext. Now when creating a Preprocessor we can pass it a temporary LangOptions object instead of having to remember to keep it around. llvm-svn: 76815	2009-07-22 23:13:42 +00:00
Douglas Gregor	c6d5edd2ed	Add support for retrieving the Doxygen comment associated with a given declaration in the AST. The new ASTContext::getCommentForDecl function searches for a comment that is attached to the given declaration, and returns that comment, which may be composed of several comment blocks. Comments are always available in an AST. However, to avoid harming performance, we don't actually parse the comments. Rather, we keep the source ranges of all of the comments within a large, sorted vector, then lazily extract comments via a binary search in that vector only when needed (which never occurs in a "normal" compile). Comments are written to a precompiled header/AST file as a blob of source ranges. That blob is only lazily loaded when one requests a comment for a declaration (this never occurs in a "normal" compile). The indexer testbed now supports comment extraction. When the -point-at location points to a declaration with a Doxygen-style comment, the indexer testbed prints the associated comment block(s). See test/Index/comments.c for an example. Some notes: - We don't actually attempt to parse the comment blocks themselves, beyond identifying them as Doxygen comment blocks to associate them with a declaration. - We won't find comment blocks that aren't adjacent to the declaration, because we start our search based on the location of the declaration. - We don't go through the necessary hops to find, for example, whether some redeclaration of a declaration has comments when our current declaration does not. Similarly, we don't attempt to associate a \param Foo marker in a function body comment with the parameter named Foo (although that is certainly possible). - Verification of my "no performance impact" claims is still "to be done". llvm-svn: 74704	2009-07-02 17:08:52 +00:00
Douglas Gregor	33834516f3	Update LLVM. Implement support for C++ Substitution Failure Is Not An Error (SFINAE), which says that errors that occur during template argument deduction do not produce diagnostics and do not necessarily make a program ill-formed. Instead, template argument deduction silently fails. This is currently implemented for template argument deduction during matching of class template partial specializations, although the mechanism will also apply to template argument deduction for function templates. The scheme is simple: - If we are in a template argument deduction context, any diagnostic that is considered a SFINAE error (or warning) will be suppressed. The error will be propagated up the call stack via the normal means. - By default, all warnings and errors are SFINAE errors. Add the NoSFINAE class to a diagnostic in the .td file to make it a hard error (e.g., for access-control violations). Note that, to make this fully work, every place in Sema that emits an error and then immediately recovers will need to check Sema::isSFINAEContext() to determine whether it must immediately return an error rather than recovering. llvm-svn: 73332	2009-06-14 07:33:30 +00:00
Chris Lattner	15ba94987a	Sink the BuiltinInfo object from ASTContext into the preprocessor and initialize it early in clang-cc. This ensures that __has_builtin works in all modes, not just when ASTContext is around. llvm-svn: 73319	2009-06-14 01:54:56 +00:00
Chris Lattner	b6f77af532	implement and document a new __has_feature and __has_builtin magic builtin preprocessor macro. This appears to work with two caveats: 1) builtins are registered in -E mode, and 2) target-specific builtins are unconditionally registered even if they aren't supported by the target (e.g. SSE4 builtin when only SSE1 is enabled). llvm-svn: 73289	2009-06-13 07:13:28 +00:00
Eli Friedman	d8cec57b9d	PR4283: Don't truncate multibyte character constants in the preprocessor. llvm-svn: 72686	2009-06-01 05:25:02 +00:00
Douglas Gregor	8533490df9	Add forward declaration of Token. Thanks to Martin Doucha for pointing this out llvm-svn: 71772	2009-05-14 15:47:53 +00:00
Daniel Dunbar	d58929be46	PR4063, with feeling: Chain PP callbacks by default. - This is somewhat cleaner and also fixes PR4063 for real, I had the order wrong so we were just creating an empty dependency file. llvm-svn: 70687	2009-05-03 10:04:17 +00:00
Douglas Gregor	99734e7669	Lazily load the controlling macros for all of the headers known in the PCH file. In the Cocoa-prefixed "Hello, World" benchmark, this takes us from reading 503 identifiers down to 37 and from 470 macros down to 4. It also results in an 8% performance improvement. llvm-svn: 70094	2009-04-25 23:30:02 +00:00
Steve Naroff	3fa455a1aa	Add PCH support for #import. llvm-svn: 69987	2009-04-24 20:03:17 +00:00
Chris Lattner	d6e97af74a	improve MacroInfo to track the source range of the macro definition, patch by Alexei Svitkine! llvm-svn: 69659	2009-04-21 04:46:33 +00:00
Chris Lattner	cd6d4b105a	add a preprocessor callback function for #undef, patch by Alexei Svitkine! llvm-svn: 69656	2009-04-21 03:42:09 +00:00
Sanjiv Gupta	f09cb95236	Use an APInt of target int size to detect overflow while parsing multichars. So 'abc' on i16 platforms will warn but not on i32 platforms. llvm-svn: 69653	2009-04-21 02:21:29 +00:00
Chris Lattner	38b2cde4c4	add a new Lexer::SkipEscapedNewLines method. llvm-svn: 69483	2009-04-18 22:27:02 +00:00
Chris Lattner	fbce7aa1f4	factor escape newline measuring out into its own helper function. llvm-svn: 69482	2009-04-18 22:05:41 +00:00
Chris Lattner	0003c27f5e	#line is allowed to have macros that expand to nothing after them. llvm-svn: 69401	2009-04-17 23:30:53 +00:00
Chris Lattner	a538967177	tblgen is now passing diagnostic group information in the .inc file, ignore it everywhere. llvm-svn: 69269	2009-04-16 05:52:14 +00:00
Chris Lattner	1b595624a8	Tblgen now passes the default mapping explicitly, instead of having it be tied to the diag class. This requires an LLVM tree update. llvm-svn: 69175	2009-04-15 16:44:12 +00:00
Chris Lattner	184e65d363	Change Lexer::MeasureTokenLength to take a LangOptions reference. This allows it to accurately measure tokens, so that we get: t.cpp:8:13: error: unknown type name 'X' static foo::X P; ~~~~~^ instead of the woefully inferior: t.cpp:8:13: error: unknown type name 'X' static foo::X P; ~~~~ ^ Most of this is just plumbing to push the reference around. llvm-svn: 69099	2009-04-14 23:22:57 +00:00
Chris Lattner	0af3ba1748	implement the microsoft/gnu "__COUNTER__" macro: rdar://4329310 llvm-svn: 68933	2009-04-13 01:29:17 +00:00
Chris Lattner	928e9096cb	add a ppcallback hook for macro definitions. llvm-svn: 68883	2009-04-12 01:39:54 +00:00
Douglas Gregor	92863e475e	Compare the predefines buffer in the PCH file with the predefines buffer generated for the current translation unit. If they are different, complain and then ignore the PCH file. This effectively checks for all compilation options that somehow would affect preprocessor state (-D, -U, -include, the dreaded -imacros, etc.). When we do accept the PCH file, throw away the contents of the predefines buffer rather than parsing them, since all of the results of that parsing are already stored in the PCH file. This eliminates the ugliness with the redefinition of __builtin_va_list, among other things. llvm-svn: 68838	2009-04-10 23:10:45 +00:00
Chris Lattner	d959d753bc	do a dance with predefines, and finally enable reading of macros from PCH. This works now, except for limitations not being able to do things with identifiers. The basic example in the testcase works though. llvm-svn: 68832	2009-04-10 22:13:17 +00:00
Chris Lattner	1156565c48	make a method public llvm-svn: 68827	2009-04-10 21:40:09 +00:00
Chris Lattner	baa52f47c1	emit function-like and object-like macros to the PCH file. Note that we don't do anything useful with identifier infos yet and don't emit the tokens that the macros are defined to. llvm-svn: 68797	2009-04-10 18:00:12 +00:00
Douglas Gregor	a7f71a91c5	PCH serialization/deserialization of the source manager. With this improvement, source locations read from the PCH file will properly resolve to the source files that were used to build the PCH file itself. Once we have the preprocessor state stored in the PCH file, source locations that refer to macro instantiations that occur in the PCH file should have the appropriate instantiation information. llvm-svn: 68758	2009-04-10 03:52:48 +00:00
Chris Lattner	58a1eb0ba0	reject the #__include_macros directive unless it comes from the predefines buffer. llvm-svn: 68627	2009-04-08 18:46:40 +00:00
Ted Kremenek	dc637a0be9	PTHManager::Create(): - Make the Diagnostic::Level for PTH errors to be specified by the caller clang (driver): - Set the PTHManager diagnostic level to "Diagnostic::Error" for -include-pth (a hard error) and Diagnostic::Warning for -token-cache (we can still proceed). llvm-svn: 67462	2009-03-22 06:42:39 +00:00
Ted Kremenek	22c3dd4c1c	Add accessor Preprocessor::getPTHManager(). llvm-svn: 67351	2009-03-20 00:24:49 +00:00
Sebastian Redl	90b6edda75	Bindir and Win32 builds work, so switch to .inc files. Leave the .def files in the tree for a day or so longer. llvm-svn: 67346	2009-03-19 23:18:26 +00:00
Ted Kremenek	f9ccd5cdc2	Add PTHManager::getOriginalSourceFile(), a method that returns the name of the original source file (if any) that was used to generate the PTH cache. llvm-svn: 67343	2009-03-19 22:19:30 +00:00
Sebastian Redl	4ae9b126fe	Revert the switch to the tablegen diags. It fails for seperate objdir builds and cmake builds, and I have no clue what to do about it. Revisit this after someone with a clue about the build systems has looked at it. llvm-svn: 67009	2009-03-14 15:58:54 +00:00
Sebastian Redl	51e037e3c4	Switch diagnostics from .def to tablegen files. Please validate the Windows build. llvm-svn: 67007	2009-03-14 12:00:12 +00:00
Chris Lattner	e07ea35817	fix PR3798 by ignoring all diagnostics generated while repreprocessing a file in rewrite macros. llvm-svn: 66961	2009-03-13 21:44:46 +00:00
Chris Lattner	83aba00ee8	make Preprocessor::Diags be a pointer instead of a reference. llvm-svn: 66955	2009-03-13 21:17:43 +00:00
Chris Lattner	cf35ce9d0d	add a callback for macro expansion, based on a patch by Paolo Bolzoni! llvm-svn: 66799	2009-03-12 17:31:43 +00:00
Chris Lattner	fa217bda40	simplify some logic by making ScratchBuffer handle the application of trailing \0's to created tokens instead of making all clients do it. No functionality change. llvm-svn: 66373	2009-03-08 08:08:45 +00:00
Chris Lattner	8322dc809e	make the token lexer allocate its temporary token buffers for preexpanded macro arguments from the preprocessor's bump pointer. This reduces # mallocs from 12444 to 11792. llvm-svn: 66025	2009-03-04 06:50:57 +00:00
Douglas Gregor	96977da72c	Clean up and document code modification hints. llvm-svn: 65641	2009-02-27 17:53:17 +00:00
Chris Lattner	d42c29f9a2	fix some sema problems with wide strings and hook up basic codegen for them. llvm-svn: 65582	2009-02-26 23:01:51 +00:00
Douglas Gregor	441f24118a	Include the appropriate header for malloc llvm-svn: 65471	2009-02-25 19:48:02 +00:00
Douglas Gregor	7f74112756	Implement parsing of nested-name-specifiers that involve template-ids, e.g., std::vector<int>::allocator_type When we parse a template-id that names a type, it will become either a template-id annotation (which is a parsed representation of a template-id that has not yet been through semantic analysis) or a typename annotation (where semantic analysis has resolved the template-id to an actual type), depending on the context. We only produce a type in contexts where we know that we only need type information, e.g., in a type specifier. Otherwise, we create a template-id annotation that can later be "upgraded" by transforming it into a typename annotation when the parser needs a type. This occurs, for example, when we've parsed "std::vector<int>" above and then see the '::' after it. However, it means that when writing something like this: template<> class Outer::Inner<int> { ... }; We have two tokens to represent Outer::Inner<int>: one token for the nested name specifier Outer::, and one template-id annotation token for Inner<int>, which will be passed to semantic analysis to define the class template specialization. Most of the churn in the template tests in this patch come from an improvement in our error recovery from ill-formed template-ids. llvm-svn: 65467	2009-02-25 19:37:18 +00:00
Chris Lattner	70946da73a	switch the macroinfo argument lists from being allocated off the heap to being allocated from the same bumpptr that the MacroInfo objects themselves are. This speeds up -Eonly cocoa.h pth by ~4%, fsyntax-only is barely measurable. llvm-svn: 65195	2009-02-20 22:46:43 +00:00
Chris Lattner	f87c510cc9	detemplatify setArgumentList and some other cleanups. llvm-svn: 65187	2009-02-20 22:31:31 +00:00
Chris Lattner	666f7a42d6	require the MAcroInfo objects are explcitly destroyed. llvm-svn: 65179	2009-02-20 22:19:20 +00:00
Chris Lattner	ddb7191920	Next step toward making string diagnostics correct: handle escapes in the string for subtoken positioning. This gives us working examples like: t.m:5:16: warning: field width should have type 'int', but argument has type 'unsigned int' printf("\n\n%*d", (unsigned) 1, 1); ^ ~~~~~~~~~~~~ where before the caret pointed two spaces to the left. llvm-svn: 64940	2009-02-18 19:21:10 +00:00
Chris Lattner	9dc9c206d3	track "just a little more" location information for macro instantiations. Now instead of just tracking the expansion history, also track the full range of the macro that got replaced. For object-like macros, this doesn't change anything. For _Pragma and function-like macros, this means we track the locations of the ')'. This is required for PR3579 because apparently GCC uses the line of the ')' of a function-like macro as the location to expand __LINE__ to. llvm-svn: 64601	2009-02-15 20:52:18 +00:00
Ted Kremenek	29942a349c	Add some boilerplate to the PTH file to prepare for the caching of stats for directories (and negative stats too). llvm-svn: 64477	2009-02-13 19:13:46 +00:00
Chris Lattner	644d452de5	factor token concatenation avoidance logic out of PrintPreprocessedOutput into its own file. No functionality change. llvm-svn: 64418	2009-02-13 00:46:04 +00:00
Ted Kremenek	a5c2c27ebd	PTH: Cache stat information for files in the PTH file. Hook up FileManager to use this stat information in the PTH file using a 'StatSysCallCache' object. Performance impact (Cocoa.h, PTH): - number of stat calls reduces from 1230 to 425 - fsyntax-only: time improves by 4.2% We can reduce the number of stat calls to almost zero by caching negative stat calls and directory stat calls in the PTH file as well. llvm-svn: 64353	2009-02-12 03:26:59 +00:00
Ted Kremenek	4c1d41f2b1	PTH: Have meta data be at the beginning of the PTH file, not the end. llvm-svn: 64338	2009-02-11 23:34:32 +00:00
Ted Kremenek	e5554deb45	PTH: Replace string identifier to persistent ID lookup with a hashtable. This is actually slightly slower than the binary search. Since this is algorithmically better, further performance tuning should be able to make this faster. llvm-svn: 64326	2009-02-11 21:29:16 +00:00
Ted Kremenek	8527e3a727	PTH: Don't emit the PTH offset of the IdentifierInfo string data as that data is referenced by other tables. llvm-svn: 64304	2009-02-11 16:06:55 +00:00
Ted Kremenek	f70f089286	Bump PTH version. llvm-svn: 64249	2009-02-10 22:37:57 +00:00
Chris Lattner	1630c3c4f0	Add an implementation of -dM that follows GCC closely enough to permit diffing the output of: clang -dM -o - -E -x c foo.c \| sort llvm-svn: 63926	2009-02-06 06:45:26 +00:00
Chris Lattner	5c7cd6043e	add interface for walking macro table. llvm-svn: 63925	2009-02-06 06:26:42 +00:00
Chris Lattner	4b6713ef20	next round of diagnostics cleanups, moving some diags around, eliminating #defines, etc. Patch by Anders Johnsen! llvm-svn: 63318	2009-01-29 17:46:13 +00:00
Chris Lattner	36790cf29a	Fix -Wimplicit-function-declaration, which required some refactoring and changes in various diagnostics code. llvm-svn: 63282	2009-01-29 06:55:46 +00:00
Chris Lattner	60f36223a9	move library-specific diagnostic headers into library private dirs. Reduce redundant #includes. Patch by Anders Johnsen! llvm-svn: 63271	2009-01-29 05:15:15 +00:00
Ted Kremenek	3b0589e4b4	Enhance PTHManager::Create() to take an optional Diagnostic* argument that can be used to report issues such as a missing PTH file. llvm-svn: 63231	2009-01-28 20:49:33 +00:00
Ted Kremenek	8d178f4357	PTH: Use Token::setLiteralData() to directly store a pointer to cached spelling data in the PTH file. This removes a ton of code for looking up spellings using sourcelocations in the PTH file. This simplifies both PTH-generation and reading. Performance impact for -fsyntax-only on Cocoa.h (with Cocoa.h in the PTH file): - PTH generation time improves by 5% - PTH reading improves by 0.3%. llvm-svn: 63072	2009-01-27 00:01:05 +00:00
Chris Lattner	9240b3eccb	rename getSpelledCharacterAt to getSpellingOfSingleCharacterNumericConstant, optimize it to use the LiteralData when possible. llvm-svn: 63060	2009-01-26 22:36:52 +00:00
Ted Kremenek	2483a6126f	Add version number to PTH files. llvm-svn: 63052	2009-01-26 22:15:00 +00:00
Chris Lattner	5a7971e0c3	This change refactors some of the low-level lexer interfaces a bit. Token now has a class of kinds for "literals", which include numeric constants, strings, etc. These tokens can optionally have a pointer to the start of the token in the lexer buffer. This makes it faster to get spelling and do other gymnastics, because we don't have to go through source locations. This change is performance neutral, but will make other changes more feasible down the road. llvm-svn: 63028	2009-01-26 19:29:26 +00:00
Chris Lattner	76e689636b	add parsing and constraint enforcement for GNU line marker directives. llvm-svn: 63003	2009-01-26 06:19:46 +00:00
Chris Lattner	100c65e810	parse and enforce required constraints on #line directives. Right now we just discard them. llvm-svn: 62999	2009-01-26 05:29:08 +00:00
Chris Lattner	4fa23625ab	Check in the long promised SourceLocation rewrite. This lays the ground work for implementing #line, and fixes the "out of macro ID's" problem. There is nothing particularly tricky about the code, other than the very performance sensitive SourceManager::getFileID() method. llvm-svn: 62978	2009-01-26 00:43:02 +00:00
Chris Lattner	9db60a38e9	Preprocessor doesn't require and IdentifierInfoLookup object. Patch by Axel Naumann! llvm-svn: 62854	2009-01-23 18:00:48 +00:00
Chris Lattner	29a2a191f2	Make SourceLocation::getFileLoc private to reduce the API exposure of SourceLocation. This requires making some cleanups to token pasting and _Pragma expansion. llvm-svn: 62490	2009-01-19 06:46:35 +00:00
Chris Lattner	144aacd19e	rearrange GetIdentifierInfo so that the fast path can be partially inlined into PTHLexer::Lex. This speeds up the user time of PTH -Eonly by another 2ms (4.4%) llvm-svn: 62454	2009-01-18 02:57:21 +00:00
Chris Lattner	428613f4fc	Avoid malloc thrashing on the std::vector for ConditionalStack. Because there is one of these per header, this almost always gets alloc/free'd for each #ifdef. llvm-svn: 62451	2009-01-18 02:52:26 +00:00
Chris Lattner	eb09754a9d	switch PTH lexer from using "const char"s to "const unsigned char"s internally. This is just a cleanup that reduces the need to cast to unsigned char before assembling a larger integer. llvm-svn: 62442	2009-01-18 01:57:14 +00:00
Chris Lattner	757169b60f	Change the Lexer ctor used to lex _Pragma directives into a static factory method. This lets us clean up the interface and make it more obvious that this method is really really _Pragma specific. Note that _Pragma handling uglifies the Lexer in the critical path. It would be very interesting to consider making _Pragma remapping be a new special lexer class of its own. llvm-svn: 62425	2009-01-17 08:27:52 +00:00
Chris Lattner	ab1d4b8abd	simplify PTHManager::CreateLexer llvm-svn: 62424	2009-01-17 08:06:50 +00:00
Chris Lattner	c809089b26	Change the Lexer ctor used in the non _Pragma case to take a FileID instead of a SourceLocation. This should speed it up and definitely simplifies it. llvm-svn: 62422	2009-01-17 08:03:42 +00:00
Chris Lattner	5965a28a4b	More simplifications to the lexer ctors. llvm-svn: 62419	2009-01-17 07:56:59 +00:00
Chris Lattner	fcf6452eb4	make the verbose raw-lexer ctor fully explicit instead of having embedded magic. llvm-svn: 62417	2009-01-17 07:42:27 +00:00
Chris Lattner	08354fef13	add a simplified lexer ctor that sets up the lexer to raw-lex an entire file. llvm-svn: 62414	2009-01-17 07:35:14 +00:00
Chris Lattner	f76b92092e	refactor some common initialization code out of the two lexer ctors. llvm-svn: 62411	2009-01-17 06:55:17 +00:00
Chris Lattner	3793bba26f	suck the call to "getSpellingLoc" that all clients do into the implementation of PTHManager::getSpelling. llvm-svn: 62408	2009-01-17 06:29:33 +00:00
Chris Lattner	d32480d3db	this massive patch introduces a simple new abstraction: it makes "FileID" a concept that is now enforced by the compiler's type checker instead of yet-another-random-unsigned floating around. This is an important distinction from the "FileID" currently tracked by SourceLocation. That FileID may refer to the start of a file or to a chunk within it. The new FileID only refers to the file (and its #include stack and eventually #line data), it cannot refer to a chunk. FileID is a completely opaque datatype to all clients, only SourceManager is allowed to poke and prod it. llvm-svn: 62407	2009-01-17 06:22:33 +00:00
Chris Lattner	262d4e31b9	Improve #pragma comment support by building the string argument and notifying PPCallbacks about it. llvm-svn: 62333	2009-01-16 18:59:23 +00:00
Chris Lattner	8a24e588d7	minor cleanups to StringLiteralParser: no need to pass target info into its ctor. Also, make it handle validity checking of pascal strings instead of making clients do it. llvm-svn: 62332	2009-01-16 18:51:42 +00:00
Chris Lattner	2ff698df60	Implement basic support for parsing #pragma comment, a microsoft extension documented here: http://msdn.microsoft.com/en-us/library/7f0aews7(VS.80).aspx This is according to my understanding reading the docs, I don't know if it really agrees fully with what VC++ allows. llvm-svn: 62317	2009-01-16 08:21:25 +00:00
Chris Lattner	c4c181902e	rename PP::getPhysicalCharacterAt -> PP::getSpelledCharacterAt. Slightly speed up sema of numbers like '1' by going directly to TargetInfo instead of through ASTContext. llvm-svn: 62314	2009-01-16 07:10:29 +00:00
Chris Lattner	53e384f633	Change some terminology in SourceLocation: instead of referring to the "physical" location of tokens, refer to the "spelling" location. This is more concrete and useful, tokens aren't really physical objects! llvm-svn: 62309	2009-01-16 07:00:02 +00:00
Ted Kremenek	a705b04d7f	IdentifierInfo: - IdentifierInfo can now (optionally) have its string data not be co-located with itself. This is for use with PTH. This aspect is a little gross, as getName() and getLength() now make assumptions about a possible alternate representation of IdentifierInfo. Perhaps we should make IdentifierInfo have virtual methods? IdentifierTable: - Added class "IdentifierInfoLookup" that can be used by IdentifierTable to perform "string -> IdentifierInfo" lookups using an auxilliary data structure. This is used by PTH. - Perform tests show that IdentifierTable::get() does not slow down because of the extra check for the IdentiferInfoLookup object (the regular StringMap lookup does enough work to mitigate the impact of an extra null pointer check). - The upshot is that now that some IdentifierInfo objects might be owned by the IdentiferInfoLookup object. This should be reviewed. PTH: - Modified PTHManager::GetIdentifierInfo to not insert entries in IdentifierTable's string map, and instead create IdentifierInfo objects on the fly when mapping from persistent IDs to IdentifierInfos. This saves a ton of work with string copies, hashing, and StringMap lookup and resizing. This change was motivated because when processing source files in the PTH cache we don't need to do any string -> IdentifierInfo lookups. - PTHManager now subclasses IdentifierInfoLookup, allowing clients of IdentifierTable to transparently use IdentifierInfo objects managed by the PTH file. PTHManager resolves "string -> IdentifierInfo" queries by doing a binary search over a sorted table of identifier strings in the PTH file (the exact algorithm we use can be changed as needed). These changes lead to the following performance changes when using PTH on Cocoa.h: - fsyntax-only: 10% performance improvement - Eonly: 30% performance improvement llvm-svn: 62273	2009-01-15 18:47:46 +00:00
Ted Kremenek	e9814186ac	PTH: - Use canonical FileID when using getSpelling() caching. This addresses some cache misses we were seeing with -fsyntax-only on Cocoa.h - Added Preprocessor::getPhysicalCharacterAt() utility method for clients to grab the first character at a specified sourcelocation. This uses the PTH spelling cache. - Modified Sema::ActOnNumericConstant() to use Preprocessor::getPhysicalCharacterAt() instead of SourceManager::getCharacterData() (to get PTH hits). These changes cause -fsyntax-only to not page in any sources from Cocoa.h. We see a speedup of 27%. llvm-svn: 62193	2009-01-13 23:19:12 +00:00
Ted Kremenek	b0b4f74b6b	PTH: Fix remaining cases where the spelling cache in the PTH file was being missed when it shouldn't. This shaves another 7% off PTH time for -Eonly on Cocoa.h llvm-svn: 62186	2009-01-13 22:05:50 +00:00
Ted Kremenek	47b8cf6deb	Enhance PTH 'getSpelling' caching: - Refactor caching logic into a helper class PTHSpellingSearch - Allow "random accesses" in the spelling cache, thus catching the remaining cases where 'getSpelling' wasn't hitting the PTH cache For -Eonly, PTH, Cocoa.h: - This reduces wall time by 3% (user time unchanged, sys time reduced) - This reduces the amount of paged source by 1112K. The remaining 1112K still being paged in is from somewhere else (investigating). llvm-svn: 62009	2009-01-09 22:05:30 +00:00
Ted Kremenek	d5e6e16d0d	PTH: Hook up getSpelling() caching in PTHLexer. This results in a nice performance gain. Here's what we see for -Eonly on Cocoa.h (using PTH): - wall time decreases by 21% (26% speedup overall) - system time decreases by 35% - user time decreases by 6% These reductions are due to not paging source files just to get spellings for literals. The solution in place doesn't appear to be 100% yet, as we still see some of the pages for source files getting mapped in. Using -print-stats, we see that SourceManager maps in 7179K less bytes of source text (reduction of 75%). Will investigate why the remaining 25% are getting paged in. With these changes, here's how PTH compares to non-PTH on Cocoa.h: -Eonly: PTH takes 64% of the time as non-PTH (54% speedup) -fsyntax-only: PTH takes 89% of the time as non-PTH (11% speedup) llvm-svn: 61913	2009-01-08 04:30:32 +00:00
Ted Kremenek	884a558441	PTH: - Added stub PTHLexer::getSpelling() that will be used for fetching cached spellings from the PTH file. This doesn't do anything yet. - Added a hook in Preprocessor::getSpelling() to call PTHLexer::getSpelling() when using a PTHLexer. - Updated PTHLexer to read the offsets of spelling tables in the PTH file. llvm-svn: 61911	2009-01-08 02:47:16 +00:00
Chris Lattner	f96b08302f	Make Token::setLength assert that the token is not an annotation token. llvm-svn: 61791	2009-01-06 05:25:04 +00:00
Chris Lattner	a8a3f73a47	rename tok::annot_qualtypename -> tok::annot_typename, which is both shorter and more accurate. The type name might not be qualified. llvm-svn: 61788	2009-01-06 05:06:21 +00:00
Chris Lattner	c69537feb5	Fix a bug where we'd try to look beyond the current cached tokens when not in backtracking mode. This was just using the wrong predicate. llvm-svn: 61666	2009-01-05 01:42:04 +00:00
Ted Kremenek	78cc24730e	PTH: Remove some methods and simplify some conditions in PTHLexer::Lex(). No big functionality change. llvm-svn: 61381	2008-12-23 19:24:24 +00:00
Ted Kremenek	66076a964b	PTH: - In PTHLexer::Lex read all of the token data from PTH file before constructing the token. The idea is to enhance locality. - Do not use Read8/Read32 in PTHLexer::Lex. Inline these operations manually. - Change PTHManager::ReadIdentifierInfo() to PTHManager::GetIdentifierInfo(). They are functionally the same except that PTHLexer::Lex() reads the persistent id. These changes result in a 3.3% speedup for PTH on Cocoa.h (-Eonly). llvm-svn: 61363	2008-12-23 02:30:15 +00:00
Ted Kremenek	1b18ad240c	PTH: - Embed 'eom' tokens in PTH file. - Use embedded 'eom' tokens to not lazily generate them in the PTHLexer. This means that PTHLexer can always advance to the next token after reading a token (instead of buffering tokens using a copy). - Moved logic of 'ReadToken' into Lex. GetToken & ReadToken no longer exist. - These changes result in a 3.3% speedup (-Eonly) on Cocoa.h. - The code is a little gross. Many cleanups are possible and should be done. llvm-svn: 61360	2008-12-23 01:30:52 +00:00
Chris Lattner	9fa1552ca8	avoid using a typedef that isn't always included from headers. llvm-svn: 61269	2008-12-19 23:51:20 +00:00
Douglas Gregor	55ad91fecb	Ultrasimplistic sketch for the parsing of C++ template-ids. This won't become useful or correct until we (1) parse template arguments correctly, (2) have some way to turn template-ids into types, declarators, etc., and (3) have a real representation of templates. llvm-svn: 61208	2008-12-18 19:37:40 +00:00
Ted Kremenek	63ff81c4e1	Change PTHLexer::getSourceLocation() to not call GetToken() and instead just read the file offset in the token data buffer directly. llvm-svn: 61170	2008-12-17 23:36:32 +00:00
Ted Kremenek	8c4bb56219	PTHLexer::isNextPPTokenLParen() no longer calls GetToken() and just reads the token kind from the token data buffer. This results in a minor speedup and reduces the dependency on GetToken(). llvm-svn: 61168	2008-12-17 23:08:31 +00:00
Ted Kremenek	6c7ea11300	Preprocessor: Allocate MacroInfo objects using a BumpPtrAllocator instead using new/delete. This speeds up -Eonly on Cocoa.h using the regular lexer by 1.8% and the PTHLexer by 3%. llvm-svn: 61042	2008-12-15 19:56:42 +00:00
Ted Kremenek	56572ab9e9	Added PTH optimization to not process entire blocks of tokens that appear in skipped preprocessor blocks. This improves PTH speed by 6%. The code for this optimization itself is not very optimized, and will get cleaned up. llvm-svn: 60956	2008-12-12 18:34:08 +00:00
Ted Kremenek	ca153f7349	PTHLexer: Keep track of the location of the last '#' token and provide the means to jump ahead in the token stream. llvm-svn: 60905	2008-12-11 22:41:47 +00:00
Ted Kremenek	67ab296d5c	Remove unused ivar CurTokenIdx. llvm-svn: 60896	2008-12-11 20:39:48 +00:00
Ted Kremenek	8e1f05fc26	PreprocessorLexer (and subclasses): - Added virtual method 'getSourceLocation()' (no arguments) that gets the location of the next "observable" location (e.g., next character, next token). PPLexerChange.cpp: - Implemented FIXME by using PreprocessorLexer::getSourceLocation() to get the location in the file we are returning to after lexing a #included file. This appears to be slightly faster than having the branch (i.e., 'if(CurLexer)'). It's also not a really hot part of the Preprocessor. llvm-svn: 60860	2008-12-10 23:20:59 +00:00
Ted Kremenek	d40ab7b72a	Declare PerIDCache as IdentifierInfo** instead of void*. This is just cleaner. No performance change. llvm-svn: 60843	2008-12-10 19:40:23 +00:00
Ted Kremenek	73a4d28758	PTH: Use an array instead of a DenseMap to cache persistent IDs -> IdentifierInfo*. This leads to a 4% speedup at -fsyntax-only using PTH. llvm-svn: 60452	2008-12-03 01:16:39 +00:00
Ted Kremenek	33eeabda61	- Remove PTHManager.cpp. Move all of its functions to PTHLexer.cpp since some of the internal methods are used by PTHLexer (their implementations are intertwined.) This enables some important inlining opportunities at -O3. - Don't construct an std::vector<Token> prior to feeding PTH tokens to the Preprocessor. Stream them off the PTH file directly. llvm-svn: 60447	2008-12-03 00:38:03 +00:00
Ted Kremenek	af058b5696	Preprocessor: - Added method "setPTHManager" that will be called by the driver to install a PTHManager for the Preprocessor. - Fixed some comments. - Added EnterSourceFileWithPTH to mirror EnterSourceFileWithLexer. llvm-svn: 60437	2008-12-02 19:46:31 +00:00
Ted Kremenek	498b6210fc	Added PTHManager, a utility class that will be used by Preprocessor to lazily create PTHLexer objects for pre-tokenized files. llvm-svn: 60436	2008-12-02 19:45:05 +00:00
Ted Kremenek	1f50dc899f	PTHLexer now owns the Token vector. llvm-svn: 60136	2008-11-27 00:38:24 +00:00
Ted Kremenek	e6847594ef	Add setter method PreprocessorLexer::setParsingPreprocessorDirective(). This will be used by the mechanism to generate cached tokens. llvm-svn: 60070	2008-11-26 00:57:02 +00:00
Chris Lattner	59acca5874	remove the NumericLiteralParser::Diag helper method, inlining it into its call sites. This makes it more explicit when the hasError flag is getting set and removes a confusing difference in behavior between PP.Diag and Diag in this code. llvm-svn: 59863	2008-11-22 07:23:31 +00:00
Chris Lattner	1ef2028205	Move the Preprocessor::Diag methods inline. This has the interesting (and carefully calculated) effect of allowing the compiler to reason about the aliasing properties of DiagnosticBuilder object better, allowing the whole thing to be promoted to registers instead of resulting in a ton of stack traffic. While I'm not very concerned about the performance of the Diag() method invocations, I am more concerned about their code size and impact on the non-diagnostic code. This patch shrinks the clang executable (in release-asserts mode with gcc-4.2) from 14523980 to 14519816 bytes. This isn't much, but it shrinks the lexer from 38192 to 37776, PPDirectives.o from 31116 to 28868 bytes, etc. llvm-svn: 59862	2008-11-22 07:03:46 +00:00
Chris Lattner	fdabe83c67	inline a method into its only two call sites. llvm-svn: 59860	2008-11-22 06:42:31 +00:00
Chris Lattner	014156e108	actually, this version isn't really needed. llvm-svn: 59859	2008-11-22 06:22:39 +00:00
Chris Lattner	57dab26be1	remove a sneaky version of Diag hiding in PreprocessorLexer. llvm-svn: 59858	2008-11-22 06:20:42 +00:00
Chris Lattner	6d27a16b95	Change the Lexer::Diag method to not magically silence warnings, force the caller to check instead. This eliminates the need (and the risk!) of weird null DiagnosticBuilder's floating around. llvm-svn: 59856	2008-11-22 02:02:22 +00:00
Chris Lattner	427c9c1763	Split the DiagnosticInfo class into two disjoint classes: one for building up the diagnostic that is in flight (DiagnosticBuilder) and one for pulling structured information out of the diagnostic when formatting and presenting it. There is no functionality change with this patch. llvm-svn: 59849	2008-11-22 00:59:29 +00:00
Ted Kremenek	a3148f16e4	Fix predicate: we're not in caching mode if CurPPLexer == 0, not CurLexer == 0. llvm-svn: 59848	2008-11-22 00:41:34 +00:00
Ted Kremenek	473d76e81d	Add comment to IsFileLexer, clean up indentation, and tighten how it's written. llvm-svn: 59773	2008-11-21 01:07:52 +00:00
Ted Kremenek	53ab374d9f	PTHLexer: - Move out logic for handling the end-of-file to LexEndOfFile (to match the Lexer) class. The logic now mirrors the Lexer class more, which allows us to pass most of the Preprocessor test cases. llvm-svn: 59768	2008-11-21 00:58:35 +00:00
Ted Kremenek	111caaac58	PTHLexer: - Move PTHLexer::GetToken() to be inside PTHLexer.cpp. - When lexing in raw mode, null out identifiers. llvm-svn: 59744	2008-11-20 19:49:00 +00:00
Ted Kremenek	94981e1f23	PTHLexer: - Rename 'CurToken' and 'LastToken' to 'CurTokenIdx' and 'LastTokenIdx' respectively. - Add helper methods GetToken(), AdvanceToken(), AtLastToken() to abstract away details of the token stream. This also allows us to easily replace their implementation later. llvm-svn: 59733	2008-11-20 16:32:22 +00:00
Ted Kremenek	6bc5f3ec90	Rename IsNonPragmaNonMacroLexer to IsFileLexer. llvm-svn: 59731	2008-11-20 16:19:53 +00:00
Daniel Dunbar	203bfdfcf3	De-unionize fields in Token class. - This is fairly gross but although the code is conceptually the same, introducting the union causes gcc 4.2 on x86 (darwin, if that matters) to pessimize LexTokenInternal which is critical to our preprocessor performance. This speeds up -Eonly lexing of Cocoa.h by ~4.7% in my timings and reduces the code size of LexTokenInternal by 8.6%. llvm-svn: 59725	2008-11-20 08:01:39 +00:00
Ted Kremenek	f4fd9b7ec4	Added virtual method 'IndirectLex' to PTHLexer. This will likely get removed in the future when we correctly handle #include processing. llvm-svn: 59722	2008-11-20 07:54:06 +00:00
Ted Kremenek	49e4579a86	Preprocessor::isCurrentLexer() now takes a PreprocessorLexer* argument to match against CurPPLexer instead of CurLexer. llvm-svn: 59721	2008-11-20 07:53:31 +00:00
Ted Kremenek	b33ce32bda	Preprocessor::getCurrentFileLexer() now returns a PreprocessorLexer* instead of a Lexer*. This means it will either return the current (normal) file Lexer or a PTHLexer. llvm-svn: 59694	2008-11-20 01:49:44 +00:00
Ted Kremenek	b0262c1e64	- Default initialize ParsingPreprocessorDirective, ParsingFilename, and LexingRawMode in the ctor of PreprocessorLexer. - PTHLexer: Use "LastToken" instead of "NumToken" llvm-svn: 59690	2008-11-20 01:29:45 +00:00
Ted Kremenek	5b75170014	Fix comment. llvm-svn: 59673	2008-11-19 22:55:55 +00:00
Ted Kremenek	11cfbb473e	Add stub for PTHLexer::isNextPPTokenLParen(). llvm-svn: 59670	2008-11-19 22:42:26 +00:00
Ted Kremenek	76c3441a4e	When using a PTHLexer, use DiscardToEndOfLine() instead of ReadToEndOfLine(). llvm-svn: 59668	2008-11-19 22:21:33 +00:00
Ted Kremenek	45245217bc	- Move static function IsNonPragmaNonMacroLexer into Preprocessor.h. - Add variants of IsNonPragmaNonMacroLexer to accept an IncludeMacroStack entry (simplifies some uses). - Use IsNonPragmaNonMacroLexer in Preprocessor::LookupFile. - Add 'FileID' to PreprocessorLexer, and have Preprocessor query this fileid when looking up the FileEntry for a file Performance testing of -Eonly on Cocoa.h shows no performance regression because of this patch. llvm-svn: 59666	2008-11-19 21:57:25 +00:00
Argyrios Kyrtzidis	89709ac2cc	Fix this: With this snippet: void f(a::b); An assert is hit: Assertion failed: CachedTokens[CachedLexPos-1].getLocation() == Tok.getAnnotationEndLoc() && "The annotation should be until the most recent cached token", file ..\..\lib\Lex\PPCaching.cpp, line 98 Introduce Preprocessor::RevertCachedTokens that reverts a specific number of tokens when backtracking is enabled. llvm-svn: 59636	2008-11-19 15:22:16 +00:00
Argyrios Kyrtzidis	f5e2812e69	Remove Preprocessor::CacheTokens boolean data member. The same functionality can be provided by using Preprocessor::isBacktrackEnabled(). llvm-svn: 59631	2008-11-19 14:23:14 +00:00
Ted Kremenek	a7c279ba40	Revert 59574 (caused tests to fail). llvm-svn: 59579	2008-11-19 01:54:47 +00:00
Ted Kremenek	1b167108bb	- Move static function IsNonPragmaNonMacroLexer into Preprocessor.h. - Add variants of IsNonPragmaNonMacroLexer to accept an IncludeMacroStack entry (simplifies some uses). - Use IsNonPragmaNonMacroLexer in Preprocessor::LookupFile. Performance testing of -Eonly on Cocoa.h shows no performance regression because of this patch. llvm-svn: 59574	2008-11-19 00:46:18 +00:00
Chris Lattner	e05c4dfc42	Remove the last of the old-style Preprocessor::Diag methods. llvm-svn: 59554	2008-11-18 21:48:13 +00:00
Chris Lattner	97b8e84bd7	remove one more Preprocessor::Diag method. llvm-svn: 59512	2008-11-18 08:02:48 +00:00
Chris Lattner	907dfe94e1	Convert the lexer and start converting the PP over to using canonical Diag methods. llvm-svn: 59511	2008-11-18 07:59:24 +00:00
Ted Kremenek	4c5888c67a	Preprocessor::PushIncludeMacroStack() should always zero out CurPPLexer. llvm-svn: 59490	2008-11-18 03:28:24 +00:00
Ted Kremenek	3d9740da29	- Add Lexer::isPragma() accessor for clients of Lexer that aren't friends. - Add static method to test if the current lexer is a non-macro/non-pragma lexer. - Refactor some code in PPLexerChange to use this static method. - No performance change. llvm-svn: 59486	2008-11-18 01:33:13 +00:00
Ted Kremenek	6b73291462	Add hooks to use PTHLexer::Lex instead of Lexer::Lex when CurLexer is null. Performance tests on Cocoa.h (using the regular Lexer) shows no performance difference. llvm-svn: 59479	2008-11-18 01:04:47 +00:00
Ted Kremenek	68ef9fc6ae	- Add 'CurPPLexer' to Preprocessor to keep track of the current PreprocessorLexer, which will either be a 'Lexer' or 'PTHLexer'. - Added stub field 'CurPTHLexer' to keep track of the current PTHLexer. - Modified IncludeStackInfo to track both the current PTHLexer and current PreprocessorLexer. llvm-svn: 59472	2008-11-18 00:12:49 +00:00
Douglas Gregor	77324f3854	Introduction the DeclarationName class, as a single, general method of representing the names of declarations in the C family of languages. DeclarationName is used in NamedDecl to store the name of the declaration (naturally), and ObjCMethodDecl is now a NamedDecl. llvm-svn: 59441	2008-11-17 14:58:09 +00:00
Ted Kremenek	a0d2a1661a	Using llvm::OwningPtr<> for CurLexer and CurTokenLexer. This makes both the ownership semantics of these objects explicit within the Preprocessor and also tightens up the code (explicit deletes not needed). llvm-svn: 59249	2008-11-13 17:11:24 +00:00
Ted Kremenek	66312a3ff4	Move some diagnostic handling to PreprocessorLexer. llvm-svn: 59191	2008-11-12 23:13:54 +00:00
Ted Kremenek	3060634fba	Add virtual dtor to PreprocessorLexer. llvm-svn: 59188	2008-11-12 22:48:57 +00:00
Ted Kremenek	9d39e377df	Add LexIncludeFilename. llvm-svn: 59187	2008-11-12 22:46:33 +00:00
Ted Kremenek	6c90efb923	Move LexIncludeFilename from Lexer to PreprocessorLexer. PreprocessorLexer now has a virtual method "IndirectLex" which allows it to call the lex method of its subclasses. This is not for performance intensive operations. llvm-svn: 59185	2008-11-12 22:43:05 +00:00
Ted Kremenek	50b84bb5f6	Unbreak last commit. llvm-svn: 59180	2008-11-12 22:19:18 +00:00
Ted Kremenek	f5518cc95b	Add Preprocessor::PushIncludeMacroStack() and Preprocessor::PopIncludeMacroStack(), two utility methods for manipulating the Preprocessor stack. These will be used to remove manually manipulation of IncludeMacroStack from the rest of the Preprocessor implementation. llvm-svn: 59179	2008-11-12 22:10:22 +00:00
Ted Kremenek	7cd62457c4	Add skeleton for PTH lexer. llvm-svn: 59169	2008-11-12 21:37:15 +00:00
Ted Kremenek	3a460b1106	Move pieces of Lexer that the Preprocessor mutates to a new base class 'PreprocessorLexer'. This will also be the base class of the new Preprocessed-Token-Header (PTH) lexer. No functionality change. llvm-svn: 59168	2008-11-12 21:33:59 +00:00
Argyrios Kyrtzidis	c7e67a04c3	Introduce annotation tokens, a special kind of token, created and used only by the parser to replace a group of tokens with a single token encoding semantic information. Will be fully utilized later for C++ nested-name-specifiers. llvm-svn: 58911	2008-11-08 16:17:04 +00:00
Chris Lattner	546200529b	clarify comment, rename argument to avoid a subtle conflict with an ivar that wasn't a bug but was confusing. llvm-svn: 58311	2008-10-28 01:02:17 +00:00
Chris Lattner	66a740e66e	Rename Characteristic_t to CharacteristicKind llvm-svn: 58224	2008-10-27 01:19:25 +00:00
Ted Kremenek	4e0f205145	Added method to access the raw flags of Token. llvm-svn: 57877	2008-10-21 03:32:15 +00:00
Chris Lattner	b11c3233d8	Change FormTokenWithChars to take the token kind to form, since all clients were setting a kind and then forming it. This is just a minor API cleanup, no functionality change. llvm-svn: 57404	2008-10-12 04:51:35 +00:00
Chris Lattner	4d96344c19	Add a new mode to the lexer which enables it to return all characters, even whitespace, as tokens from the file. This is enabled with L->SetKeepWhitespaceMode(true) on a raw lexer. In this mode, you too can use clang as a really complex version of 'cat' with code like this: Lexer RawLex(SourceLocation::getFileLoc(SM.getMainFileID(), 0), PP.getLangOptions(), File.first, File.second); RawLex.SetKeepWhitespaceMode(true); Token RawTok; RawLex.LexFromRawLexer(RawTok); while (RawTok.isNot(tok::eof)) { std::cout << PP.getSpelling(RawTok); RawLex.LexFromRawLexer(RawTok); } This will emit exactly the input file, with no canonicalization or other translation. Realistic clients actually do something with the tokens of course :) llvm-svn: 57401	2008-10-12 04:05:48 +00:00
Chris Lattner	8637abd333	add a new inKeepCommentMode() accessor to abstract the KeepCommentMode ivar. llvm-svn: 57397	2008-10-12 03:22:02 +00:00
Chris Lattner	7c2e9809b1	Simplify raw mode lexing by treating an unterminate /**/ comment the same we we do an unterminated string or character literal. This makes it so we can guarantee that the lexer never calls into the preprocessor (which would be suicide for a raw lexer). llvm-svn: 57395	2008-10-12 01:31:51 +00:00
Chris Lattner	50c9050037	Change how raw lexers are handled: instead of creating them and then using LexRawToken, create one and use LexFromRawLexer. This avoids twiddling the RawLexer flag around and simplifies some code (even speeding raw lexing up a tiny bit). This change also improves the token paster to use a Lexer on the stack instead of new/deleting it. llvm-svn: 57393	2008-10-12 01:15:46 +00:00
Daniel Dunbar	405965323d	Add Preprocessor::RemovePragmaHandler. - No functionality change. llvm-svn: 57065	2008-10-04 19:17:46 +00:00
Chris Lattner	b03dc76499	clean up a bunch of fixme's I added, by moving DirectoryLookup::DirType into SourceManager.h llvm-svn: 56692	2008-09-26 21:18:42 +00:00
Chris Lattner	c88a23e8d7	Fix the rest of rdar://6243860 hopefully. This requires changing FileIDInfo to whether the fileid is a 'extern c system header' in addition to whether it is a system header, most of this is spreading plumbing around. Once we have that, PPLexerChange bases its "file enter/exit" notifications to PPCallbacks to base the system header state on FileIDInfo instead of HeaderSearch. Finally, in Preprocessor::HandleIncludeDirective, mirror logic in GCC: the system headerness of a file being entered can be set due to the #includer or the #includee. llvm-svn: 56688	2008-09-26 20:12:23 +00:00
Daniel Dunbar	2c61b32beb	Comment fix. llvm-svn: 56634	2008-09-26 01:11:51 +00:00
Argyrios Kyrtzidis	75b34536d0	Rename Preprocessor::DisableBacktrack -> Preprocessor::CommitBacktrackedTokens. llvm-svn: 55281	2008-08-24 12:29:43 +00:00
Chris Lattner	909d0881ff	Comment tweak. llvm-svn: 55272	2008-08-24 00:12:08 +00:00
Argyrios Kyrtzidis	a65490c5df	Allow nested backtracks. llvm-svn: 55204	2008-08-22 21:27:50 +00:00
Daniel Dunbar	12c9ddced1	Change Parser & Sema to use interned "super" for comparions. - Added as private members for each because it is not clear where to put the common definition. Perhaps the IdentifierInfos all of these "pseudo-keywords" should be collected into one place (this would KnownFunctionIDs and Objective-C property IDs, for example). Remove Token::isNamedIdentifier. - There isn't a good reason to use strcmp when we have interned strings, and there isn't a good reason to encourage clients to do so. llvm-svn: 54794	2008-08-14 22:04:54 +00:00
Nico Weber	4c3116437c	* Remove isInSystemHeader() from DiagClient, move it to SourceManager * Move FormatError() from TextDiagnostic up to DiagClient, remove now empty class TextDiagnostic * Make DiagClient optional for Diagnostic This fixes the following problems: * -html-diags (and probably others) does now output the same set of warnings as console clang does * nothing crashes if one forgets to call setHeaderSearch() on TextDiagnostic * some code duplication is removed llvm-svn: 54620	2008-08-10 19:59:06 +00:00
Argyrios Kyrtzidis	b3dd1e0889	Allow the preprocessor to cache the lexed tokens, so that we can do efficient lookahead and backtracking. 1) New public methods added: -EnableBacktrackAtThisPos -DisableBacktrack -Backtrack -isBacktrackEnabled 2) LookAhead() implementation is replaced with a more efficient one. 3) LookNext() is removed. llvm-svn: 54611	2008-08-10 13:15:22 +00:00
Chris Lattner	a580a92508	Add an accessor, patch by Csaba Hruska. llvm-svn: 53391	2008-07-10 05:26:30 +00:00
Argyrios Kyrtzidis	80b77ac394	Add Preprocessor::LookNext method, which implements an efficient way to 'take a peek' at the next token without consuming it. llvm-svn: 53375	2008-07-09 22:46:46 +00:00
Chris Lattner	6016a515e5	refactor some code out into a new method. llvm-svn: 52889	2008-06-30 06:39:54 +00:00
Chris Lattner	3565c8e343	Neil pointed out that clang doesn't generate ranges from diagnostics related to pp-expressions. Doing so is pretty simple and this patch implements it, yielding nice diagnostics like: t.c:2:7: error: division by zero in preprocessor expression #if 1 / (0 + 0) ~ ^ ~~~~~~~ t.c:5:14: error: expected ')' in preprocessor expression #if (412 + 42 ~~~~~~~~^ t.c:5:5: error: to match this '(' #if (412 + 42 ^ t.c:10:10: warning: left side of operator converted from negative value to unsigned: -42 to 18446744073709551574 #if (-42 + 0U) / -2 ~~~ ^ ~~ t.c:10:16: warning: right side of operator converted from negative value to unsigned: -2 to 18446744073709551614 #if (-42 + 0U) / -2 ~~~~~~~~~~ ^ ~~ 5 diagnostics generated. llvm-svn: 50638	2008-05-05 06:45:50 +00:00
Chris Lattner	ba1f37bfdf	simplify ownership of the predefines buffer. llvm-svn: 49973	2008-04-19 23:09:31 +00:00
Ted Kremenek	f42f3fb47d	class Preprocessor: Now owns the "predefines" char; it deletes [] it in its dstor. clang.cpp: InitializePreprocessor now makes a copy of the contents of PredefinesBuffer and passes it to the preprocessor object. clang.cpp: DriverPreprocessorFactory now calls "InitializePreprocessor" instead of this being done in main(). html::HighlightMacros() now takes a PreprocessorFactory, allowing it to conjure up a new Preprocessor to highlight macros. class HTMLDiagnostics now takes a PreprocessorFactory that it can use for html::HighlightMacros(). Updated clients of HTMLDiagnostics to use this new interface. llvm-svn: 49875	2008-04-17 22:31:54 +00:00
Ted Kremenek	219bab3be9	Added "PreprocessorFactory", an interface for lazily creating Preprocessor objects on-demand. llvm-svn: 49868	2008-04-17 21:23:07 +00:00
Chris Lattner	e9786c3199	reenable highlighting of (the first line of) comments llvm-svn: 49816	2008-04-16 20:54:51 +00:00
Chris Lattner	221929310b	Make the preprocessor own its PPCallbacks, fixing a memory leak. Patch by Sam Bishop! llvm-svn: 48357	2008-03-14 06:07:05 +00:00
Chris Lattner	3e4683262e	implement simple support for arbitrary token lookahead. Change the objc @try parser to use it, fixing a FIXME. Update the objc-try-catch-1.m file to pass now that we get more reasonable errors. llvm-svn: 48129	2008-03-10 06:06:04 +00:00
Chris Lattner	1d4000ba50	rename HandleEndOfMacro -> HandleEndOfTokenLexer llvm-svn: 48076	2008-03-09 03:04:16 +00:00
Chris Lattner	7ff66fb91e	split the MacroArgs class out of TokenLexer.cpp/h into MacroArgs.cpp/h llvm-svn: 48075	2008-03-09 02:55:12 +00:00
Chris Lattner	285c0c1150	rename some MacroExpander-related ivars to TokenLexer. llvm-svn: 48073	2008-03-09 02:26:03 +00:00
Chris Lattner	5bb36002be	Rename MacroExpander.cpp/h -> TokenLexer.cpp/h llvm-svn: 48072	2008-03-09 02:22:57 +00:00
Chris Lattner	95d72cdf0f	rename the MacroExpander class to TokenLexer. It handles both token streams and macro lexing, so a more generic name is useful. llvm-svn: 48071	2008-03-09 02:18:51 +00:00
Chris Lattner	d7daed1478	rename MacroTokens -> Tokens. When this is a token stream, there is no macro involved. llvm-svn: 48070	2008-03-09 02:07:49 +00:00
Chris Lattner	855d024a83	Remove the first layer of support for "portability" warnings. This is theoretically useful, but not useful in practice. It adds a bunch of complexity, and not much value. It's best to nuke it. One big advantage is that it means the target interfaces will soon lose their SLoc arguments and target queries can never emit diagnostics anymore (yay). Removing this also simplifies some of the core preprocessor which should make it slightly faster. Ted, I didn't simplify TripleProcessor, which can now have at most one triple, and can probably just be removed. Please poke at it when you have time. llvm-svn: 47930	2008-03-05 01:18:20 +00:00
Lauro Ramos Venancio	8983891531	Fix PR2086. llvm-svn: 47551	2008-02-25 19:03:15 +00:00
Ted Kremenek	72be068ab3	Two more Windows-related fixes: - More enum signeness bitfield fixes (MSVC treats enums as signed). - Fixed in Lex/HeaderSearch.cpp an unsafe copy between two HeaderSearch::PerFileInfo entries in a common vector. The copy involved two calls to getFileInfo() within the assignment; these calls could have side-effects that enlarged the internal vector, and with MSVC this would invalidate one of the values in the assignment. Patch by Argiris Kirtzidis! llvm-svn: 47536	2008-02-24 03:55:14 +00:00
Ted Kremenek	f948d2a75f	Change encoding of TokenKind in IdentifierTable to be of type "unsigned" instead of TokenKind because of signedness issues with MSVC and enums. Patch from Argiris Kirtzidis. llvm-svn: 47515	2008-02-23 01:05:54 +00:00
Chris Lattner	3b5054dda0	Implement support for the extremely atrocious MS /##/ extension, which pastes together a comment. This is only enabled with -fms-extensions of course. llvm-svn: 46845	2008-02-07 06:03:59 +00:00
Steve Naroff	72df405dc5	Cleanup previous patch (based on feedback from Ted). Since this behavior is useful for most classes, we might consider adding a simple 3 method class that implements the behavior. Ted said that Boost has such a class. llvm-svn: 46654	2008-02-02 00:10:46 +00:00
Steve Naroff	f7fe5b372f	Make sure SourceManager/HeaderSearch don't support default copy constructors (since they result in bad runtime behavior). I'm sure there are other classes that might need this "guard", however I was bitten by these 2 recently (so I thought I'd fix them). llvm-svn: 46653	2008-02-01 23:31:13 +00:00
Chris Lattner	f6df8e9702	fix comment typo llvm-svn: 46505	2008-01-29 07:59:54 +00:00
Chris Lattner	e3e358c317	readability improvement suggested by Sam Bishop, thanks! llvm-svn: 45735	2008-01-08 04:13:21 +00:00
Chris Lattner	a30be59fa2	Fix a nasty corner case that Neil noticed in PR1900, where we would incorrectly apply the multiple include optimization to files with guards like: #if !defined(x) MACRO where MACRO could expand to different things in different contexts. Thanks Neil! llvm-svn: 45716	2008-01-07 19:50:27 +00:00
Chris Lattner	5b12ab8c93	Don't attribute in file headers anymore. See llvmdev for the discussion of this change. llvm-svn: 45410	2007-12-29 19:59:25 +00:00
Seo Sanghyeon	58bdfda87c	Add newline llvm-svn: 45257	2007-12-20 07:22:20 +00:00
Ted Kremenek	230bd918b2	Interned MainFileID within SourceManager. Since SourceManager is referenced by both Preprocessor and ASTContext, we no longer need to explicitly pass MainFileID around in function calls that also pass either Preprocessor or ASTContext. This resulted in some nice cleanups in the ASTConsumers and the driver. llvm-svn: 45228	2007-12-19 22:51:13 +00:00
Ted Kremenek	f7bfae6b45	Typo fix. llvm-svn: 45227	2007-12-19 22:32:34 +00:00
Chris Lattner	c238331377	Add support for #pragma mark, which shouldn't warn about bogus tokens. llvm-svn: 45212	2007-12-19 19:38:36 +00:00
Chris Lattner	9f9a619a9f	implement enough helper functions to successfully dump out the contents of the header map. Look ma, no assumptions about input data here (aka, corrupt header maps can't crash the compiler - crazy thought). llvm-svn: 45122	2007-12-17 21:06:11 +00:00
Chris Lattner	79764a6bee	Finish hooking up the scaffolding for headermaps. They can now do everything except resolve lookups. llvm-svn: 45111	2007-12-17 18:44:09 +00:00
Chris Lattner	4ffe46cbdf	Start reading the headermap header, drop the 'errorstr' argument to the create method. llvm-svn: 45109	2007-12-17 18:34:53 +00:00
Chris Lattner	8d720d083a	Sink getName into DirectoryLookup to simplify the client in clang. llvm-svn: 45106	2007-12-17 17:57:27 +00:00
Chris Lattner	1587e6db01	add headermap.cpp llvm-svn: 45095	2007-12-17 08:22:46 +00:00
Chris Lattner	44bd21b7c1	finish stubbing out support for HeaderMap. Now we just need an implementation! llvm-svn: 45094	2007-12-17 08:17:39 +00:00
Chris Lattner	712e3873a0	refactor an better comment framework lookup code. This moves it from HeaderSearch into DirectoryLookup, as a particular framework lookup is specific to the directory we are currently querying. llvm-svn: 45093	2007-12-17 08:13:48 +00:00
Chris Lattner	f62f75895f	as it turns out, frameworks and headermaps are orthogonal. Make this so in the internal representation. This also fixes a bug where -I foo -F foo would not search foo as both a normal and framework include dir. llvm-svn: 45092	2007-12-17 07:52:39 +00:00
Chris Lattner	3e206b3a0e	teach RemoveDuplicates about header maps. llvm-svn: 45090	2007-12-17 06:44:29 +00:00
Chris Lattner	c4ba38ed1e	Step #1 in adding headermap support to clang. llvm-svn: 45089	2007-12-17 06:36:45 +00:00
Chris Lattner	67671ed4b7	add a helper method. llvm-svn: 44976	2007-12-13 01:59:49 +00:00
Ted Kremenek	f82942d04c	constified getFullLoc(). llvm-svn: 44951	2007-12-12 18:55:29 +00:00
Ted Kremenek	50d23f007f	Renamed getFullSourceLoc() -> getFullLoc(). llvm-svn: 44949	2007-12-12 18:46:37 +00:00
Ted Kremenek	d8bcfe27cc	Added method: Preprocessor::getFullSourceLoc. Used by clients of Preprocessor to get a FullSourceLoc from a SourceLocation. llvm-svn: 44948	2007-12-12 18:41:40 +00:00
Chris Lattner	615315f307	Add dumping support for locations, make -dumptokens print out the location info of each token. llvm-svn: 44741	2007-12-09 20:31:55 +00:00
Ted Kremenek	fbb08bc2e2	Added optional pass-by-reference argument "isExact" to NumericLiteralParser::GetFloatValue(). Upon method return, this flag has the value true if the returned APFloat can exactly represent the number in the parsed text, and false otherwise. Modified the implementation of GetFloatValue() to parse literals using APFloat's convertFromString method (which allows us to set the value of isExact). llvm-svn: 44339	2007-11-26 23:12:30 +00:00
Chris Lattner	4dcbc2090a	add header file I forgot to check in llvm-svn: 44179	2007-11-15 19:22:18 +00:00
Chris Lattner	fd64ebd3e4	Fix assertion for raw lexer. llvm-svn: 43091	2007-10-17 21:22:38 +00:00
Chris Lattner	8e129c23c8	Move token length calculation out of the diagnostics machinery into the lexer, where it can be shared. llvm-svn: 43090	2007-10-17 21:18:47 +00:00
Chris Lattner	02b436a05a	Add a new type of lexer: a raw lexer, which does not require a preprocessor object in order to do its thing. llvm-svn: 43084	2007-10-17 20:41:00 +00:00
Chris Lattner	caecf03215	add some comments. llvm-svn: 43079	2007-10-17 18:28:59 +00:00
Anders Carlsson	cbfc4b8824	Add support for Pascal strings. llvm-svn: 42974	2007-10-15 02:50:23 +00:00
Chris Lattner	1f1b0dbc28	Make a significant change to invert the control flow handling predefined macros. Previously, these were handled by the driver, now they are handled by the preprocessor. Some fallout of this: 1. Instead of preprocessing two buffers (the predefines, then the main source file) we now start preprocessing the main source file and inject the predefines as a "psuedo #include" from the main source file. 2. #1 allows us to nuke the Lexer::IsMainFile flag and simplify Preprocessor::isInPrimaryFile. 3. The driver doesn't have to know about standard #defines, the preprocessor knows, which is nice for people wanting to define their own drivers. 4. This allows us to put normal tokens in the predefine buffer, for example a definition for __builtin_va_list that is target-specific, and a typedef for id in objc. llvm-svn: 42818	2007-10-09 22:10:18 +00:00
Chris Lattner	0ab032aa8f	Add two new Token helper functions, "is" and "isNot". This allows us to write stuff like this: // If we don't have a comma, it is either the end of the list (a ';') or // an error, bail out. if (Tok.isNot(tok::comma)) break; instead of: // If we don't have a comma, it is either the end of the list (a ';') or // an error, bail out. if (Tok.getKind() != tok::comma) break; There is obviously no functionality change, but the code reads a bit better and is more terse. llvm-svn: 42795	2007-10-09 17:23:58 +00:00
Chris Lattner	ef6b136781	move IdentifierTable.h from liblex to libbasic. llvm-svn: 42730	2007-10-07 08:58:51 +00:00
Chris Lattner	c43ddc84a3	improve layering: Now instead of IdentifierInfo knowing anything about MacroInfo, only the preprocessor knows. This makes MacroInfo truly private to the Lex library (and its direct clients) instead of being accessed in the Basic library. llvm-svn: 42727	2007-10-07 08:44:20 +00:00
Chris Lattner	d7b971bf3d	add a hasMacroDefinition() method to IdentifierInfo, strength reduce a call to getMacroInfo to call it. llvm-svn: 42725	2007-10-07 07:57:27 +00:00
Chris Lattner	f49523d6ea	update comment. llvm-svn: 42724	2007-10-07 07:54:23 +00:00
Chris Lattner	ff067ce555	Remove the PPID bitfield from IdentifierInfo, shrinking it by a word (because all bitfields now fit in 32 bits). This shrinks the identifier table for carbon.h from 1634428 to 1451424 bytes (12%) and has no impact on compile time. llvm-svn: 42723	2007-10-07 07:52:34 +00:00
Chris Lattner	a441ca651f	First step to fixing a long lived layering violation: this moves the MacroInfo pointer to a side hash table (which currently lives in IdentifierTable.cpp). This removes a pointer from Identifier info, but doesn't shrink it, as it requires a new bit be added. This strange approach with the 'hasmacro' bit is needed to not lose preprocessor performance. llvm-svn: 42722	2007-10-07 07:09:52 +00:00
Chris Lattner	730160d32f	Shrink the builtinID down by 3 bits, allowing all the bitfields to fit in 32-bits, shrinking IdentifierInfo by a word. This shrinks the total size of the identifier pool from 1817264 to 1634428 bytes (11%) on carbon.h. llvm-svn: 42719	2007-10-07 06:29:32 +00:00
Chris Lattner	5700fab189	simplify the interfaces to create selectors: getSelector can take any number of arguments now and does the right thing, but the nullary/unary accessors are preserved as convenience functions. This allows us to slightly simplify clients. llvm-svn: 42716	2007-10-07 02:00:24 +00:00
Chris Lattner	f7f34d09e4	simplify some Selector interfaces. llvm-svn: 42715	2007-10-07 01:33:16 +00:00
Chris Lattner	dadc762ac5	Implement DenseMapInfo for Selector, allowing use of DenseMap/DenseSet of Selector's instead of requiring void* to be used. I converted one use of DenseSet<void*> over to use DenseSet<Selector> but the others should change as well. llvm-svn: 42645	2007-10-05 20:15:24 +00:00
Steve Naroff	e61bfa8bb4	Layering refinements for selectors (suggested by Chris). Specifics... - Add SelectorTable, which enables us to remove MultiKeywordSelector from the public header. - Remove FoldingSet from IdentifierInfo.h and Preprocessor.h. - Remove Parser::ObjcGetUnarySelector and Parser::ObjcGetKeywordSelector, they are subsumed by SelectorTable. - Add MultiKeywordSelector to IdentifierInfo.cpp. - Move a bunch of selector related methods from ParseObjC.cpp to IdentifierInfo.cpp. - Added some comments. llvm-svn: 42643	2007-10-05 18:42:47 +00:00
Chris Lattner	01d7f489a9	minor cleanups, make code more defensive, less branchy in Selector ctor. llvm-svn: 42603	2007-10-04 05:21:22 +00:00
Chris Lattner	0f26288998	fix an incorrect assertion llvm-svn: 42602	2007-10-04 05:16:42 +00:00
Chris Lattner	2b6abdce50	Add a new getLength() method to IdentifierInfo, which relies on a newly added method to StringMapEntry. Steve, please use this to remove the strlen calls in selector processing. llvm-svn: 42481	2007-09-30 08:32:27 +00:00
Steve Naroff	92866f4fbb	Add some comments to MultiKeywordSelector, make all methods private, add a friend, move some methods around. llvm-svn: 42456	2007-09-28 23:39:26 +00:00
Steve Naroff	8017506d9c	Yesterday I discovered that 78% of all selectors in "Cocoa.h" take 0/1 argument. This motivated implementing a devious clattner inspired solution:-) This approach uses a small value "Selector" class to point to an IdentifierInfo for the 0/1 case. For multi-keyword selectors, we instantiate a MultiKeywordSelector object (previously known as SelectorInfo). Now, the incremental cost for selectors is only 24,800 for Cocoa.h! This saves 156,592 bytes, or 86%!! The size reduction is also the result of getting rid of the AST slot, which was not strictly necessary (we will associate a selector with it's method using another table...most likely in Sema). This change was critical to make now, before we have too many clients. I still need to add some comments to the Selector class...will likely add later today/tomorrow. llvm-svn: 42452	2007-09-28 22:22:11 +00:00
Steve Naroff	65ca537b55	Fix bug in SelectorInfo::getName() - method buffer needs to be passed by reference. llvm-svn: 42411	2007-09-27 18:52:21 +00:00
Steve Naroff	f73590dbb1	Add SelectorInfo (similar in spirit to IdentifierInfo). The key difference is SelectorInfo is not string-oriented, it is a unique aggregate of IdentifierInfo's (using a folding set). SelectorInfo also has a richer API that simplifies the parser/action interface. 3 noteworthy benefits: #1: It is cleaner. I never "liked" storing keyword selectors (i.e. foo:bar:baz) in the IdentifierTable. #2: It is more space efficient. Since Cocoa keyword selectors can be quite long, this technique is space saving. For Cocoa.h, pulling the keyword selectors out saves ~180k. The cost of the SelectorInfo data is ~100k. Saves ~80k, or 43%. #3: It results in many API simplifications. Here are some highlights: - Removed 3 actions (ActOnKeywordMessage, ActOnUnaryMessage, & one flavor of ObjcBuildMethodDeclaration that was specific to unary messages). - Removed 3 funky structs from DeclSpec.h (ObjcKeywordMessage, ObjcKeywordDecl, and ObjcKeywordInfo). - Removed 2 ivars and 2 constructors from ObjCMessageExpr (fyi, this space savings has not been measured). I am happy with the way it turned out (though it took a bit more hacking than I expected). Given the central role of selectors in ObjC, making sure this is "right" will pay dividends later. Thanks to Chris for talking this through with me and suggesting this approach. llvm-svn: 42395	2007-09-27 14:38:14 +00:00
Chris Lattner	ec0a6d9be5	Use APFloat for the representation of FP immediates, ask the target for which apfloat to use for a particular type. llvm-svn: 42234	2007-09-22 18:29:59 +00:00
Steve Naroff	2cd263ff71	Remove SelectorTable/SelectorInfo, simply store all selectors in the central IdentifierTable. Rationale: We currently have a separate table to unique ObjC selectors. Since I don't need all the instance data in IdentifierInfo, I thought this would save space (and make more sense conceptually). It turns out the cost of having duplicate entries for unary selectors (i.e. names without colons) outweighs the cost difference between the IdentifierInfo & SelectorInfo structures. Here is the data: Two tables: * Selector/Identifier Stats: # Selectors/Identifiers: 51635 Bytes allocated: 1999824 One table: * Identifier Table Stats: # Identifiers: 49500 Bytes allocated: 1990316 llvm-svn: 42139	2007-09-19 16:18:46 +00:00
Steve Naroff	73d534a2e0	Add support for ObjC keyword selectors. - Add SelectorInfo/SelectorTable classes, modeled after IdentifierInfo/IdentifierTable. - Add SelectorTable instance to ASTContext, created lazily through ASTContext::getSelectorInfo(). - Add SelectorInfo slot to ObjcMethodDecl. - Add helper function to derive a SelectorInfo from ObjcKeywordInfo. Misc: Got the Decl stats stuff up and running again...it was missing support for ObjC AST's. llvm-svn: 42023	2007-09-17 14:16:13 +00:00
Chris Lattner	f8a3dadd40	Don't rely on ADL to find this member, patch by Justin Handville llvm-svn: 41783	2007-09-08 01:29:20 +00:00
Chris Lattner	78b6221ff9	Eliminate some VC++ warnings, patch by Hartmut Kaiser! llvm-svn: 41685	2007-09-03 18:28:41 +00:00
Chris Lattner	ed045421a8	1.0 is double, 1.0F is a float. llvm-svn: 41412	2007-08-26 03:29:23 +00:00
Chris Lattner	f55ab18663	1) refactor some code. 2) Add support for lexing imaginary constants (a GCC extension): t.c:5:10: warning: imaginary constants are an extension A = 1.0iF; ^ 3) Make the 'invalid suffix' diagnostic pointer more accurate: t.c:6:10: error: invalid suffix 'qF' on floating constant A = 1.0qF; ^ instead of: t.c:6:10: error: invalid suffix 'qF' on floating constant A = 1.0qF; ^ llvm-svn: 41411	2007-08-26 01:58:14 +00:00
Steve Naroff	7c34817902	Add helper functions Token::isObjCAtKeyword() and Token::getObjCKeywordID(). Convert all clients to the new cleaner, more robust API. llvm-svn: 41330	2007-08-23 18:16:40 +00:00
Chris Lattner	4c4a245475	Use a smallstring instead of an std::string in FileChanged to avoid some malloc traffic. This speeds up -E on xalancbmk by 2.4% llvm-svn: 40461	2007-07-24 06:57:14 +00:00
Chris Lattner	93ab9f134e	refactor the interface to Preprocessor::GetIncludeFilenameSpelling, no functionality changes. llvm-svn: 40414	2007-07-23 04:15:27 +00:00
Chris Lattner	5d1c02748f	Change hte lexer to start a start pointer to the underlying memorybuffer instead of a pointer to the memorybuffer itself. This reduces coupling and eliminates a pointer dereference on a hot path. This speeds up -Eonly on 483.xalancbmk by 2.1% llvm-svn: 40394	2007-07-22 18:44:36 +00:00
Chris Lattner	d427542a9b	Implement a simple cache in headersearch. This speeds up preprocessing 483.xalancbmk by about 10%, reducing the number of file lookup queries from 2139411 to 199466 (over 10x) llvm-svn: 40390	2007-07-22 07:28:00 +00:00
Chris Lattner	9c724c48ea	Fix a really subtle bug in the macro expander caching code, where redefinition of a macro could cause invalid memory to be deleted. Found preprocessing 253.perlbmk. llvm-svn: 40380	2007-07-22 01:16:55 +00:00
Chris Lattner	146762e7a4	At one point there were going to be lexer and parser tokens. Since that point is now long gone, we should rename LexerToken to Token, as it is the only kind of token we have. llvm-svn: 40105	2007-07-20 16:59:19 +00:00
Chris Lattner	77e9de50a1	simplify the lexer ctor to take a SLoc instead of a sloc and a redundant buffer*. llvm-svn: 40104	2007-07-20 16:52:03 +00:00
Chris Lattner	dc5c055fd1	Reimplement SourceLocation. Instead of having a fileid/offset pair, it now contains a bit discriminating between mapped locations and file locations. This separates the tables for macros and files in SourceManager, and allows better separation of concepts in the rest of the compiler. This allows us to have many macro instantiations before running out of 'addressing space'. This is also more efficient, because testing whether something is a macro expansion is now a bit test instead of a table lookup (which also used to require having a srcmgr around, now it doesn't). This is fully functional, but there are several refinements and optimizations left. llvm-svn: 40103	2007-07-20 16:37:10 +00:00
Chris Lattner	8a7003cd9f	Add a new Preprocessor::AdvanceToTokenCharacter method which, given a sloc specifying the start of a token and a logical (phase 3) character number, returns a sloc representing the input character corresponding to it. llvm-svn: 39905	2007-07-16 06:48:38 +00:00
Chris Lattner	b0f5d55912	factor a common predicate into a static method. llvm-svn: 39903	2007-07-16 06:16:59 +00:00
Chris Lattner	c02c4abe5b	Cache macro expander objects to avoid thrashing malloc in heavy expansion situations. This doesn't significantly improve carbon.h, but it does speed up INPUTS/macro_pounder_obj.c by 48% llvm-svn: 39864	2007-07-15 00:25:26 +00:00
Chris Lattner	564f478595	switch function-like macros from using a vector for their arguments to an explicitly new'd array. The array never mutates once created, so a vector is overkill. llvm-svn: 39862	2007-07-14 22:46:43 +00:00
Chris Lattner	819f2ef77b	switch from using a vector to a smallvector for macro replacement tokens This speeds up parsing carbon.h by 3.3% by avoiding some malloc traffic for small macros. llvm-svn: 39861	2007-07-14 22:15:50 +00:00
Chris Lattner	f40fe99118	expose an iterator interface to getReplacementTokens instead of the datastructure itself. llvm-svn: 39860	2007-07-14 22:11:41 +00:00
Chris Lattner	bd4de5df77	Fix "no newline at end of file" warnings. Patch contributed by Benoit Boissinot! llvm-svn: 39780	2007-07-12 15:43:07 +00:00
Chris Lattner	20c4ff2845	Improve portability to compilers where <cassert> is not implicitly included. Patch contributed by Benoit Boissinot! llvm-svn: 39779	2007-07-12 15:32:57 +00:00
Steve Naroff	97b9e91eb7	Bug #: Submitted by: Reviewed by: Added primitive support for 32-bit floating point literals. llvm-svn: 39719	2007-07-09 23:53:58 +00:00
Chris Lattner	23b7eb677d	Finally bite the bullet and make the major change: split the clang namespace out of the llvm namespace. This makes the clang namespace be a sibling of llvm instead of being a child. The good thing about this is that it makes many things unambiguous. The bad things is that many things in the llvm namespace (notably data structures like smallvector) now require an llvm:: qualifier. IMO, libsystem and libsupport should be split out of llvm into their own namespace in the future, which will fix this issue. llvm-svn: 39659	2007-06-15 23:05:46 +00:00
Chris Lattner	de12ae2fd7	Add support for binary literals: http://gcc.gnu.org/onlinedocs/gcc/Binary-constants.html#Binary-constants llvm-svn: 39621	2007-06-08 22:42:30 +00:00
Steve Naroff	98d153c730	Bug #: Submitted by: Reviewed by: Fixed a bug in the parser's handling of attributes on pointer declarators. For example, the following code was producing a syntax error... int __attribute(()) foo; attrib.c:10:25: error: expected identifier or '(' int __attribute(()) foo; ^ Changed Parser::ParseTypeQualifierListOpt to not consume the token following an attribute declaration. Also added LexerToken::getName() convenience method...useful when tracking down errors like this. llvm-svn: 39602	2007-06-06 23:19:11 +00:00
Bill Wendling	f06487034e	Bug #: Submitted by: Bill Wendling Reviewed by: Chris Lattner - Comment fix. llvm-svn: 39482	2007-05-23 08:03:10 +00:00
Chris Lattner	67ca9252f8	Implement Sema::ParseNumericConstant for integer constants in terms of APInt and correctly in terms of C99 6.4.4.1p5. llvm-svn: 39473	2007-05-21 01:08:44 +00:00
Chris Lattner	36982e4367	Add support for inserting up to 10 strings in a diagnostic, with %0, %1, %2, etc. llvm-svn: 39447	2007-05-16 17:49:37 +00:00
Chris Lattner	739e739b81	Remove the clang::SourceBuffer class, switch to the llvm::MemoryBuffer class. llvm-svn: 39426	2007-04-29 07:12:06 +00:00
Chris Lattner	2f5add6272	Implement support for performing semantic analysis of character literals. llvm-svn: 39390	2007-04-05 06:57:15 +00:00
Chris Lattner	871b4e101c	Minor enhancements to GetIntegerValue(APInt): * Detect overflow correctly. When it occurs, return the truncated value. * Add fixme for radix analysis. llvm-svn: 39382	2007-04-04 06:36:34 +00:00
Chris Lattner	5b743d3801	Add some really simplistic code for turning a ppnumber into an APInt. Much improvement is needed! llvm-svn: 39381	2007-04-04 05:52:58 +00:00
Steve Naroff	4f88b3113e	Bug #: Submitted by: Reviewed by: Move string literal parsing from Sema=>LiteralSupport. This consolidates all the quirky parsing code within the Lexer subsystem (yeah!). This simplifies Sema and (more importantly) allows future parsers (i.e. subclasses of Action) to benefit from this code. llvm-svn: 39354	2007-03-13 22:37:02 +00:00
Steve Naroff	f2fb89e759	Bug #: Submitted by: Reviewed by: Misc. cleanup/polish of NumericLiteralParser and it's two clients, the C preprocessor and AST builder... llvm-svn: 39353	2007-03-13 20:29:44 +00:00
Steve Naroff	451d8f1626	Bug #: Submitted by: Reviewed by: -Converted the preprocessor to use NumericLiteralParser. -Several minor changes to LiteralSupport interface/implementation. -Added an error diagnostic for floating point usage in pp expr's. llvm-svn: 39352	2007-03-12 23:22:38 +00:00
Steve Naroff	09ef474197	Bug #: Submitted by: Reviewed by: Moved numeric literal support from SemaExpr.cpp to LiteralSupport.[h,cpp] in Lex. This will allow it to be used by both Sema and Preprocessor (and should be the last major refactoring of this sub-system).. Over time, it will be reused by anyone implementing an actions module (i.e. any subclass of llvm::clang::Action. Minor changes to IntegerLiteral in Expr.h. More to come... llvm-svn: 39351	2007-03-09 23:16:33 +00:00
Chris Lattner	b055f2ddfc	switch to using iterators instead of stringmap visitors. llvm-svn: 39336	2007-02-11 08:19:57 +00:00
Chris Lattner	54d032bb76	CStringMap -> StringMap. llvm-svn: 39334	2007-02-08 19:24:25 +00:00
Chris Lattner	34d1f5a8de	adjust to CStringMap interface change. llvm-svn: 39333	2007-02-08 19:08:49 +00:00
Chris Lattner	9561a0b3e7	Add support for target-independent builtin functions (like __builtin_abs), whose decl objects are lazily created the first time they are referenced. Builtin functions are described by the clang/AST/Builtins.def file, which makes it easy to add new ones. This is missing two important pieces: 1. Support for the rest of the gcc builtins. 2. Support for target-specific builtins (e.g. __builtin_ia32_emms). Just adding this builtins reduces the number of implicit function definitions by 6, reducing the # diagnostics from 550 to 544 when parsing carbon.h. I need to add all the i386-specific ones to eliminate several hundred more. ugh. llvm-svn: 39327	2007-01-28 08:20:04 +00:00
Chris Lattner	b6738ec361	make LookupScopedDecl a method instead of a static function llvm-svn: 39326	2007-01-28 00:38:24 +00:00
Chris Lattner	5b9f4891d7	Add support for C++ operator keywords. Patch by Bill Wendling. llvm-svn: 39214	2006-11-21 17:23:33 +00:00
Chris Lattner	00348acace	clear file info after processing one file, it shouldn't carry over to the next. llvm-svn: 39211	2006-11-21 06:34:57 +00:00
Chris Lattner	b352e3edb5	Change KeepComments/KeepMacroComments modes to be facets of the preprocessor state, not aspects of the language standard being parsed. llvm-svn: 39209	2006-11-21 06:17:10 +00:00
Chris Lattner	ad7cdd37b3	simplify the Preprocessor ctor. llvm-svn: 39208	2006-11-21 06:08:20 +00:00
Chris Lattner	a92809b1ab	add an accessor llvm-svn: 39206	2006-11-21 05:52:33 +00:00
Chris Lattner	b8d6d5a81d	Formalize preprocessor callbacks together into a PPCallbacks structure, instead of having a loose collection of function pointers. This also allows clients to maintain state, and reduces the size of the Preprocessor.h interface. llvm-svn: 39203	2006-11-21 04:09:30 +00:00
Chris Lattner	f89b50c38d	init std::string with it's default ctor instead of "". llvm-svn: 39141	2006-11-06 06:37:47 +00:00
Chris Lattner	c07ba1fe2f	Refactor the paths used for checking and getting the spelling of #include filenames (and also '#pragma GCC dependency' of course). Now, assuming no cleaning is needed, we can go all the way from lexing the filename to doing filename lookups with no mallocs. This speeds up user PP time from 0.077 to 0.075s for Cocoa.h (2.6%). llvm-svn: 39092	2006-10-30 05:58:32 +00:00
Chris Lattner	b8b94f1e9b	Make Preprocessor::LookupFile take a character range instead of a string. This avoids some copying in its clients. llvm-svn: 39091	2006-10-30 05:38:06 +00:00
Chris Lattner	7cdbad945d	Push strings out of the HeaderSearch interface, it now deals solely with character ranges. llvm-svn: 39090	2006-10-30 05:33:15 +00:00
Chris Lattner	ee7bf89cd6	Change framework cache map from map to CStringMap. This speeds up PP user time on Cocoa.h from 0.078 to 0.077s. llvm-svn: 39089	2006-10-30 05:19:23 +00:00
Chris Lattner	2b9e19be87	Pull the string hashtable out of the IdentifierTable, moving into LLVM's libsupport. Now it can be used for other things besides identifier hashing. llvm-svn: 39079	2006-10-29 23:43:13 +00:00
Chris Lattner	ec659fce46	move memory allocation abstraction stuff out into LLVM's libsupport llvm-svn: 39078	2006-10-29 22:09:44 +00:00
Chris Lattner	cf163aa407	no need for these classes to be so friendly anymore. llvm-svn: 39077	2006-10-29 21:37:52 +00:00
Chris Lattner	3bc804ed3d	genericize IdentifierInfo interface to make it work more naturally. llvm-svn: 39076	2006-10-28 23:46:24 +00:00
Chris Lattner	bcb416bbd5	Implement test/Preprocessor/comment_save_if.c llvm-svn: 39069	2006-10-27 05:43:50 +00:00
Chris Lattner	1eb290b2e9	remove namelen field, it is now dead llvm-svn: 39064	2006-10-27 05:07:16 +00:00
Chris Lattner	56bdb9a9a1	Remove identifier length field from IdentifierInfo, it is now dead. llvm-svn: 39063	2006-10-27 05:06:38 +00:00
Chris Lattner	f2e3ac3b54	reimplement identifier hash table in terms of a probed table instead of a chained table. This is about 25% faster for identifier lookup. This also implements resizing of the hash table. llvm-svn: 39058	2006-10-27 03:59:10 +00:00
Chris Lattner	893f272c39	Track the full (not mod the hash table size) hash value for each token. This lets us find interesting properties of the hash distribution. llvm-svn: 39056	2006-10-26 05:12:31 +00:00
Chris Lattner	5c3ac11bf5	Reduce amount #included llvm-svn: 39035	2006-10-22 07:29:01 +00:00
Chris Lattner	25246dfeb0	Split the DirectoryLookup class out to its own header. llvm-svn: 39033	2006-10-22 07:26:52 +00:00
Chris Lattner	5ed76da296	Implement framework filesystem caching. llvm-svn: 39031	2006-10-22 07:24:13 +00:00
Chris Lattner	641a0be31b	count # framework lookups llvm-svn: 39026	2006-10-20 06:23:14 +00:00

... 9 10 11 12 13 ...

1150 Commits