llvm-project

Commit Graph

Author	SHA1	Message	Date
Chris Lattner	9db60a38e9	Preprocessor doesn't require and IdentifierInfoLookup object. Patch by Axel Naumann! llvm-svn: 62854	2009-01-23 18:00:48 +00:00
Chris Lattner	29a2a191f2	Make SourceLocation::getFileLoc private to reduce the API exposure of SourceLocation. This requires making some cleanups to token pasting and _Pragma expansion. llvm-svn: 62490	2009-01-19 06:46:35 +00:00
Chris Lattner	144aacd19e	rearrange GetIdentifierInfo so that the fast path can be partially inlined into PTHLexer::Lex. This speeds up the user time of PTH -Eonly by another 2ms (4.4%) llvm-svn: 62454	2009-01-18 02:57:21 +00:00
Chris Lattner	428613f4fc	Avoid malloc thrashing on the std::vector for ConditionalStack. Because there is one of these per header, this almost always gets alloc/free'd for each #ifdef. llvm-svn: 62451	2009-01-18 02:52:26 +00:00
Chris Lattner	eb09754a9d	switch PTH lexer from using "const char"s to "const unsigned char"s internally. This is just a cleanup that reduces the need to cast to unsigned char before assembling a larger integer. llvm-svn: 62442	2009-01-18 01:57:14 +00:00
Chris Lattner	757169b60f	Change the Lexer ctor used to lex _Pragma directives into a static factory method. This lets us clean up the interface and make it more obvious that this method is really really _Pragma specific. Note that _Pragma handling uglifies the Lexer in the critical path. It would be very interesting to consider making _Pragma remapping be a new special lexer class of its own. llvm-svn: 62425	2009-01-17 08:27:52 +00:00
Chris Lattner	ab1d4b8abd	simplify PTHManager::CreateLexer llvm-svn: 62424	2009-01-17 08:06:50 +00:00
Chris Lattner	c809089b26	Change the Lexer ctor used in the non _Pragma case to take a FileID instead of a SourceLocation. This should speed it up and definitely simplifies it. llvm-svn: 62422	2009-01-17 08:03:42 +00:00
Chris Lattner	5965a28a4b	More simplifications to the lexer ctors. llvm-svn: 62419	2009-01-17 07:56:59 +00:00
Chris Lattner	fcf6452eb4	make the verbose raw-lexer ctor fully explicit instead of having embedded magic. llvm-svn: 62417	2009-01-17 07:42:27 +00:00
Chris Lattner	08354fef13	add a simplified lexer ctor that sets up the lexer to raw-lex an entire file. llvm-svn: 62414	2009-01-17 07:35:14 +00:00
Chris Lattner	f76b92092e	refactor some common initialization code out of the two lexer ctors. llvm-svn: 62411	2009-01-17 06:55:17 +00:00
Chris Lattner	3793bba26f	suck the call to "getSpellingLoc" that all clients do into the implementation of PTHManager::getSpelling. llvm-svn: 62408	2009-01-17 06:29:33 +00:00
Chris Lattner	d32480d3db	this massive patch introduces a simple new abstraction: it makes "FileID" a concept that is now enforced by the compiler's type checker instead of yet-another-random-unsigned floating around. This is an important distinction from the "FileID" currently tracked by SourceLocation. That FileID may refer to the start of a file or to a chunk within it. The new FileID only refers to the file (and its #include stack and eventually #line data), it cannot refer to a chunk. FileID is a completely opaque datatype to all clients, only SourceManager is allowed to poke and prod it. llvm-svn: 62407	2009-01-17 06:22:33 +00:00
Chris Lattner	262d4e31b9	Improve #pragma comment support by building the string argument and notifying PPCallbacks about it. llvm-svn: 62333	2009-01-16 18:59:23 +00:00
Chris Lattner	8a24e588d7	minor cleanups to StringLiteralParser: no need to pass target info into its ctor. Also, make it handle validity checking of pascal strings instead of making clients do it. llvm-svn: 62332	2009-01-16 18:51:42 +00:00
Chris Lattner	2ff698df60	Implement basic support for parsing #pragma comment, a microsoft extension documented here: http://msdn.microsoft.com/en-us/library/7f0aews7(VS.80).aspx This is according to my understanding reading the docs, I don't know if it really agrees fully with what VC++ allows. llvm-svn: 62317	2009-01-16 08:21:25 +00:00
Chris Lattner	c4c181902e	rename PP::getPhysicalCharacterAt -> PP::getSpelledCharacterAt. Slightly speed up sema of numbers like '1' by going directly to TargetInfo instead of through ASTContext. llvm-svn: 62314	2009-01-16 07:10:29 +00:00
Chris Lattner	53e384f633	Change some terminology in SourceLocation: instead of referring to the "physical" location of tokens, refer to the "spelling" location. This is more concrete and useful, tokens aren't really physical objects! llvm-svn: 62309	2009-01-16 07:00:02 +00:00
Ted Kremenek	a705b04d7f	IdentifierInfo: - IdentifierInfo can now (optionally) have its string data not be co-located with itself. This is for use with PTH. This aspect is a little gross, as getName() and getLength() now make assumptions about a possible alternate representation of IdentifierInfo. Perhaps we should make IdentifierInfo have virtual methods? IdentifierTable: - Added class "IdentifierInfoLookup" that can be used by IdentifierTable to perform "string -> IdentifierInfo" lookups using an auxilliary data structure. This is used by PTH. - Perform tests show that IdentifierTable::get() does not slow down because of the extra check for the IdentiferInfoLookup object (the regular StringMap lookup does enough work to mitigate the impact of an extra null pointer check). - The upshot is that now that some IdentifierInfo objects might be owned by the IdentiferInfoLookup object. This should be reviewed. PTH: - Modified PTHManager::GetIdentifierInfo to not insert entries in IdentifierTable's string map, and instead create IdentifierInfo objects on the fly when mapping from persistent IDs to IdentifierInfos. This saves a ton of work with string copies, hashing, and StringMap lookup and resizing. This change was motivated because when processing source files in the PTH cache we don't need to do any string -> IdentifierInfo lookups. - PTHManager now subclasses IdentifierInfoLookup, allowing clients of IdentifierTable to transparently use IdentifierInfo objects managed by the PTH file. PTHManager resolves "string -> IdentifierInfo" queries by doing a binary search over a sorted table of identifier strings in the PTH file (the exact algorithm we use can be changed as needed). These changes lead to the following performance changes when using PTH on Cocoa.h: - fsyntax-only: 10% performance improvement - Eonly: 30% performance improvement llvm-svn: 62273	2009-01-15 18:47:46 +00:00
Ted Kremenek	e9814186ac	PTH: - Use canonical FileID when using getSpelling() caching. This addresses some cache misses we were seeing with -fsyntax-only on Cocoa.h - Added Preprocessor::getPhysicalCharacterAt() utility method for clients to grab the first character at a specified sourcelocation. This uses the PTH spelling cache. - Modified Sema::ActOnNumericConstant() to use Preprocessor::getPhysicalCharacterAt() instead of SourceManager::getCharacterData() (to get PTH hits). These changes cause -fsyntax-only to not page in any sources from Cocoa.h. We see a speedup of 27%. llvm-svn: 62193	2009-01-13 23:19:12 +00:00
Ted Kremenek	b0b4f74b6b	PTH: Fix remaining cases where the spelling cache in the PTH file was being missed when it shouldn't. This shaves another 7% off PTH time for -Eonly on Cocoa.h llvm-svn: 62186	2009-01-13 22:05:50 +00:00
Ted Kremenek	47b8cf6deb	Enhance PTH 'getSpelling' caching: - Refactor caching logic into a helper class PTHSpellingSearch - Allow "random accesses" in the spelling cache, thus catching the remaining cases where 'getSpelling' wasn't hitting the PTH cache For -Eonly, PTH, Cocoa.h: - This reduces wall time by 3% (user time unchanged, sys time reduced) - This reduces the amount of paged source by 1112K. The remaining 1112K still being paged in is from somewhere else (investigating). llvm-svn: 62009	2009-01-09 22:05:30 +00:00
Ted Kremenek	d5e6e16d0d	PTH: Hook up getSpelling() caching in PTHLexer. This results in a nice performance gain. Here's what we see for -Eonly on Cocoa.h (using PTH): - wall time decreases by 21% (26% speedup overall) - system time decreases by 35% - user time decreases by 6% These reductions are due to not paging source files just to get spellings for literals. The solution in place doesn't appear to be 100% yet, as we still see some of the pages for source files getting mapped in. Using -print-stats, we see that SourceManager maps in 7179K less bytes of source text (reduction of 75%). Will investigate why the remaining 25% are getting paged in. With these changes, here's how PTH compares to non-PTH on Cocoa.h: -Eonly: PTH takes 64% of the time as non-PTH (54% speedup) -fsyntax-only: PTH takes 89% of the time as non-PTH (11% speedup) llvm-svn: 61913	2009-01-08 04:30:32 +00:00
Ted Kremenek	884a558441	PTH: - Added stub PTHLexer::getSpelling() that will be used for fetching cached spellings from the PTH file. This doesn't do anything yet. - Added a hook in Preprocessor::getSpelling() to call PTHLexer::getSpelling() when using a PTHLexer. - Updated PTHLexer to read the offsets of spelling tables in the PTH file. llvm-svn: 61911	2009-01-08 02:47:16 +00:00
Chris Lattner	f96b08302f	Make Token::setLength assert that the token is not an annotation token. llvm-svn: 61791	2009-01-06 05:25:04 +00:00
Chris Lattner	a8a3f73a47	rename tok::annot_qualtypename -> tok::annot_typename, which is both shorter and more accurate. The type name might not be qualified. llvm-svn: 61788	2009-01-06 05:06:21 +00:00
Chris Lattner	c69537feb5	Fix a bug where we'd try to look beyond the current cached tokens when not in backtracking mode. This was just using the wrong predicate. llvm-svn: 61666	2009-01-05 01:42:04 +00:00
Ted Kremenek	78cc24730e	PTH: Remove some methods and simplify some conditions in PTHLexer::Lex(). No big functionality change. llvm-svn: 61381	2008-12-23 19:24:24 +00:00
Ted Kremenek	66076a964b	PTH: - In PTHLexer::Lex read all of the token data from PTH file before constructing the token. The idea is to enhance locality. - Do not use Read8/Read32 in PTHLexer::Lex. Inline these operations manually. - Change PTHManager::ReadIdentifierInfo() to PTHManager::GetIdentifierInfo(). They are functionally the same except that PTHLexer::Lex() reads the persistent id. These changes result in a 3.3% speedup for PTH on Cocoa.h (-Eonly). llvm-svn: 61363	2008-12-23 02:30:15 +00:00
Ted Kremenek	1b18ad240c	PTH: - Embed 'eom' tokens in PTH file. - Use embedded 'eom' tokens to not lazily generate them in the PTHLexer. This means that PTHLexer can always advance to the next token after reading a token (instead of buffering tokens using a copy). - Moved logic of 'ReadToken' into Lex. GetToken & ReadToken no longer exist. - These changes result in a 3.3% speedup (-Eonly) on Cocoa.h. - The code is a little gross. Many cleanups are possible and should be done. llvm-svn: 61360	2008-12-23 01:30:52 +00:00
Chris Lattner	9fa1552ca8	avoid using a typedef that isn't always included from headers. llvm-svn: 61269	2008-12-19 23:51:20 +00:00
Douglas Gregor	55ad91fecb	Ultrasimplistic sketch for the parsing of C++ template-ids. This won't become useful or correct until we (1) parse template arguments correctly, (2) have some way to turn template-ids into types, declarators, etc., and (3) have a real representation of templates. llvm-svn: 61208	2008-12-18 19:37:40 +00:00
Ted Kremenek	63ff81c4e1	Change PTHLexer::getSourceLocation() to not call GetToken() and instead just read the file offset in the token data buffer directly. llvm-svn: 61170	2008-12-17 23:36:32 +00:00
Ted Kremenek	8c4bb56219	PTHLexer::isNextPPTokenLParen() no longer calls GetToken() and just reads the token kind from the token data buffer. This results in a minor speedup and reduces the dependency on GetToken(). llvm-svn: 61168	2008-12-17 23:08:31 +00:00
Ted Kremenek	6c7ea11300	Preprocessor: Allocate MacroInfo objects using a BumpPtrAllocator instead using new/delete. This speeds up -Eonly on Cocoa.h using the regular lexer by 1.8% and the PTHLexer by 3%. llvm-svn: 61042	2008-12-15 19:56:42 +00:00
Ted Kremenek	56572ab9e9	Added PTH optimization to not process entire blocks of tokens that appear in skipped preprocessor blocks. This improves PTH speed by 6%. The code for this optimization itself is not very optimized, and will get cleaned up. llvm-svn: 60956	2008-12-12 18:34:08 +00:00
Ted Kremenek	ca153f7349	PTHLexer: Keep track of the location of the last '#' token and provide the means to jump ahead in the token stream. llvm-svn: 60905	2008-12-11 22:41:47 +00:00
Ted Kremenek	67ab296d5c	Remove unused ivar CurTokenIdx. llvm-svn: 60896	2008-12-11 20:39:48 +00:00
Ted Kremenek	8e1f05fc26	PreprocessorLexer (and subclasses): - Added virtual method 'getSourceLocation()' (no arguments) that gets the location of the next "observable" location (e.g., next character, next token). PPLexerChange.cpp: - Implemented FIXME by using PreprocessorLexer::getSourceLocation() to get the location in the file we are returning to after lexing a #included file. This appears to be slightly faster than having the branch (i.e., 'if(CurLexer)'). It's also not a really hot part of the Preprocessor. llvm-svn: 60860	2008-12-10 23:20:59 +00:00
Ted Kremenek	d40ab7b72a	Declare PerIDCache as IdentifierInfo** instead of void*. This is just cleaner. No performance change. llvm-svn: 60843	2008-12-10 19:40:23 +00:00
Ted Kremenek	73a4d28758	PTH: Use an array instead of a DenseMap to cache persistent IDs -> IdentifierInfo*. This leads to a 4% speedup at -fsyntax-only using PTH. llvm-svn: 60452	2008-12-03 01:16:39 +00:00
Ted Kremenek	33eeabda61	- Remove PTHManager.cpp. Move all of its functions to PTHLexer.cpp since some of the internal methods are used by PTHLexer (their implementations are intertwined.) This enables some important inlining opportunities at -O3. - Don't construct an std::vector<Token> prior to feeding PTH tokens to the Preprocessor. Stream them off the PTH file directly. llvm-svn: 60447	2008-12-03 00:38:03 +00:00
Ted Kremenek	af058b5696	Preprocessor: - Added method "setPTHManager" that will be called by the driver to install a PTHManager for the Preprocessor. - Fixed some comments. - Added EnterSourceFileWithPTH to mirror EnterSourceFileWithLexer. llvm-svn: 60437	2008-12-02 19:46:31 +00:00
Ted Kremenek	498b6210fc	Added PTHManager, a utility class that will be used by Preprocessor to lazily create PTHLexer objects for pre-tokenized files. llvm-svn: 60436	2008-12-02 19:45:05 +00:00
Ted Kremenek	1f50dc899f	PTHLexer now owns the Token vector. llvm-svn: 60136	2008-11-27 00:38:24 +00:00
Ted Kremenek	e6847594ef	Add setter method PreprocessorLexer::setParsingPreprocessorDirective(). This will be used by the mechanism to generate cached tokens. llvm-svn: 60070	2008-11-26 00:57:02 +00:00
Chris Lattner	59acca5874	remove the NumericLiteralParser::Diag helper method, inlining it into its call sites. This makes it more explicit when the hasError flag is getting set and removes a confusing difference in behavior between PP.Diag and Diag in this code. llvm-svn: 59863	2008-11-22 07:23:31 +00:00
Chris Lattner	1ef2028205	Move the Preprocessor::Diag methods inline. This has the interesting (and carefully calculated) effect of allowing the compiler to reason about the aliasing properties of DiagnosticBuilder object better, allowing the whole thing to be promoted to registers instead of resulting in a ton of stack traffic. While I'm not very concerned about the performance of the Diag() method invocations, I am more concerned about their code size and impact on the non-diagnostic code. This patch shrinks the clang executable (in release-asserts mode with gcc-4.2) from 14523980 to 14519816 bytes. This isn't much, but it shrinks the lexer from 38192 to 37776, PPDirectives.o from 31116 to 28868 bytes, etc. llvm-svn: 59862	2008-11-22 07:03:46 +00:00
Chris Lattner	fdabe83c67	inline a method into its only two call sites. llvm-svn: 59860	2008-11-22 06:42:31 +00:00
Chris Lattner	014156e108	actually, this version isn't really needed. llvm-svn: 59859	2008-11-22 06:22:39 +00:00
Chris Lattner	57dab26be1	remove a sneaky version of Diag hiding in PreprocessorLexer. llvm-svn: 59858	2008-11-22 06:20:42 +00:00
Chris Lattner	6d27a16b95	Change the Lexer::Diag method to not magically silence warnings, force the caller to check instead. This eliminates the need (and the risk!) of weird null DiagnosticBuilder's floating around. llvm-svn: 59856	2008-11-22 02:02:22 +00:00
Chris Lattner	427c9c1763	Split the DiagnosticInfo class into two disjoint classes: one for building up the diagnostic that is in flight (DiagnosticBuilder) and one for pulling structured information out of the diagnostic when formatting and presenting it. There is no functionality change with this patch. llvm-svn: 59849	2008-11-22 00:59:29 +00:00
Ted Kremenek	a3148f16e4	Fix predicate: we're not in caching mode if CurPPLexer == 0, not CurLexer == 0. llvm-svn: 59848	2008-11-22 00:41:34 +00:00
Ted Kremenek	473d76e81d	Add comment to IsFileLexer, clean up indentation, and tighten how it's written. llvm-svn: 59773	2008-11-21 01:07:52 +00:00
Ted Kremenek	53ab374d9f	PTHLexer: - Move out logic for handling the end-of-file to LexEndOfFile (to match the Lexer) class. The logic now mirrors the Lexer class more, which allows us to pass most of the Preprocessor test cases. llvm-svn: 59768	2008-11-21 00:58:35 +00:00
Ted Kremenek	111caaac58	PTHLexer: - Move PTHLexer::GetToken() to be inside PTHLexer.cpp. - When lexing in raw mode, null out identifiers. llvm-svn: 59744	2008-11-20 19:49:00 +00:00
Ted Kremenek	94981e1f23	PTHLexer: - Rename 'CurToken' and 'LastToken' to 'CurTokenIdx' and 'LastTokenIdx' respectively. - Add helper methods GetToken(), AdvanceToken(), AtLastToken() to abstract away details of the token stream. This also allows us to easily replace their implementation later. llvm-svn: 59733	2008-11-20 16:32:22 +00:00
Ted Kremenek	6bc5f3ec90	Rename IsNonPragmaNonMacroLexer to IsFileLexer. llvm-svn: 59731	2008-11-20 16:19:53 +00:00
Daniel Dunbar	203bfdfcf3	De-unionize fields in Token class. - This is fairly gross but although the code is conceptually the same, introducting the union causes gcc 4.2 on x86 (darwin, if that matters) to pessimize LexTokenInternal which is critical to our preprocessor performance. This speeds up -Eonly lexing of Cocoa.h by ~4.7% in my timings and reduces the code size of LexTokenInternal by 8.6%. llvm-svn: 59725	2008-11-20 08:01:39 +00:00
Ted Kremenek	f4fd9b7ec4	Added virtual method 'IndirectLex' to PTHLexer. This will likely get removed in the future when we correctly handle #include processing. llvm-svn: 59722	2008-11-20 07:54:06 +00:00
Ted Kremenek	49e4579a86	Preprocessor::isCurrentLexer() now takes a PreprocessorLexer* argument to match against CurPPLexer instead of CurLexer. llvm-svn: 59721	2008-11-20 07:53:31 +00:00
Ted Kremenek	b33ce32bda	Preprocessor::getCurrentFileLexer() now returns a PreprocessorLexer* instead of a Lexer*. This means it will either return the current (normal) file Lexer or a PTHLexer. llvm-svn: 59694	2008-11-20 01:49:44 +00:00
Ted Kremenek	b0262c1e64	- Default initialize ParsingPreprocessorDirective, ParsingFilename, and LexingRawMode in the ctor of PreprocessorLexer. - PTHLexer: Use "LastToken" instead of "NumToken" llvm-svn: 59690	2008-11-20 01:29:45 +00:00
Ted Kremenek	5b75170014	Fix comment. llvm-svn: 59673	2008-11-19 22:55:55 +00:00
Ted Kremenek	11cfbb473e	Add stub for PTHLexer::isNextPPTokenLParen(). llvm-svn: 59670	2008-11-19 22:42:26 +00:00
Ted Kremenek	76c3441a4e	When using a PTHLexer, use DiscardToEndOfLine() instead of ReadToEndOfLine(). llvm-svn: 59668	2008-11-19 22:21:33 +00:00
Ted Kremenek	45245217bc	- Move static function IsNonPragmaNonMacroLexer into Preprocessor.h. - Add variants of IsNonPragmaNonMacroLexer to accept an IncludeMacroStack entry (simplifies some uses). - Use IsNonPragmaNonMacroLexer in Preprocessor::LookupFile. - Add 'FileID' to PreprocessorLexer, and have Preprocessor query this fileid when looking up the FileEntry for a file Performance testing of -Eonly on Cocoa.h shows no performance regression because of this patch. llvm-svn: 59666	2008-11-19 21:57:25 +00:00
Argyrios Kyrtzidis	89709ac2cc	Fix this: With this snippet: void f(a::b); An assert is hit: Assertion failed: CachedTokens[CachedLexPos-1].getLocation() == Tok.getAnnotationEndLoc() && "The annotation should be until the most recent cached token", file ..\..\lib\Lex\PPCaching.cpp, line 98 Introduce Preprocessor::RevertCachedTokens that reverts a specific number of tokens when backtracking is enabled. llvm-svn: 59636	2008-11-19 15:22:16 +00:00
Argyrios Kyrtzidis	f5e2812e69	Remove Preprocessor::CacheTokens boolean data member. The same functionality can be provided by using Preprocessor::isBacktrackEnabled(). llvm-svn: 59631	2008-11-19 14:23:14 +00:00
Ted Kremenek	a7c279ba40	Revert 59574 (caused tests to fail). llvm-svn: 59579	2008-11-19 01:54:47 +00:00
Ted Kremenek	1b167108bb	- Move static function IsNonPragmaNonMacroLexer into Preprocessor.h. - Add variants of IsNonPragmaNonMacroLexer to accept an IncludeMacroStack entry (simplifies some uses). - Use IsNonPragmaNonMacroLexer in Preprocessor::LookupFile. Performance testing of -Eonly on Cocoa.h shows no performance regression because of this patch. llvm-svn: 59574	2008-11-19 00:46:18 +00:00
Chris Lattner	e05c4dfc42	Remove the last of the old-style Preprocessor::Diag methods. llvm-svn: 59554	2008-11-18 21:48:13 +00:00
Chris Lattner	97b8e84bd7	remove one more Preprocessor::Diag method. llvm-svn: 59512	2008-11-18 08:02:48 +00:00
Chris Lattner	907dfe94e1	Convert the lexer and start converting the PP over to using canonical Diag methods. llvm-svn: 59511	2008-11-18 07:59:24 +00:00
Ted Kremenek	4c5888c67a	Preprocessor::PushIncludeMacroStack() should always zero out CurPPLexer. llvm-svn: 59490	2008-11-18 03:28:24 +00:00
Ted Kremenek	3d9740da29	- Add Lexer::isPragma() accessor for clients of Lexer that aren't friends. - Add static method to test if the current lexer is a non-macro/non-pragma lexer. - Refactor some code in PPLexerChange to use this static method. - No performance change. llvm-svn: 59486	2008-11-18 01:33:13 +00:00
Ted Kremenek	6b73291462	Add hooks to use PTHLexer::Lex instead of Lexer::Lex when CurLexer is null. Performance tests on Cocoa.h (using the regular Lexer) shows no performance difference. llvm-svn: 59479	2008-11-18 01:04:47 +00:00
Ted Kremenek	68ef9fc6ae	- Add 'CurPPLexer' to Preprocessor to keep track of the current PreprocessorLexer, which will either be a 'Lexer' or 'PTHLexer'. - Added stub field 'CurPTHLexer' to keep track of the current PTHLexer. - Modified IncludeStackInfo to track both the current PTHLexer and current PreprocessorLexer. llvm-svn: 59472	2008-11-18 00:12:49 +00:00
Douglas Gregor	77324f3854	Introduction the DeclarationName class, as a single, general method of representing the names of declarations in the C family of languages. DeclarationName is used in NamedDecl to store the name of the declaration (naturally), and ObjCMethodDecl is now a NamedDecl. llvm-svn: 59441	2008-11-17 14:58:09 +00:00
Ted Kremenek	a0d2a1661a	Using llvm::OwningPtr<> for CurLexer and CurTokenLexer. This makes both the ownership semantics of these objects explicit within the Preprocessor and also tightens up the code (explicit deletes not needed). llvm-svn: 59249	2008-11-13 17:11:24 +00:00
Ted Kremenek	66312a3ff4	Move some diagnostic handling to PreprocessorLexer. llvm-svn: 59191	2008-11-12 23:13:54 +00:00
Ted Kremenek	3060634fba	Add virtual dtor to PreprocessorLexer. llvm-svn: 59188	2008-11-12 22:48:57 +00:00
Ted Kremenek	9d39e377df	Add LexIncludeFilename. llvm-svn: 59187	2008-11-12 22:46:33 +00:00
Ted Kremenek	6c90efb923	Move LexIncludeFilename from Lexer to PreprocessorLexer. PreprocessorLexer now has a virtual method "IndirectLex" which allows it to call the lex method of its subclasses. This is not for performance intensive operations. llvm-svn: 59185	2008-11-12 22:43:05 +00:00
Ted Kremenek	50b84bb5f6	Unbreak last commit. llvm-svn: 59180	2008-11-12 22:19:18 +00:00
Ted Kremenek	f5518cc95b	Add Preprocessor::PushIncludeMacroStack() and Preprocessor::PopIncludeMacroStack(), two utility methods for manipulating the Preprocessor stack. These will be used to remove manually manipulation of IncludeMacroStack from the rest of the Preprocessor implementation. llvm-svn: 59179	2008-11-12 22:10:22 +00:00
Ted Kremenek	7cd62457c4	Add skeleton for PTH lexer. llvm-svn: 59169	2008-11-12 21:37:15 +00:00
Ted Kremenek	3a460b1106	Move pieces of Lexer that the Preprocessor mutates to a new base class 'PreprocessorLexer'. This will also be the base class of the new Preprocessed-Token-Header (PTH) lexer. No functionality change. llvm-svn: 59168	2008-11-12 21:33:59 +00:00
Argyrios Kyrtzidis	c7e67a04c3	Introduce annotation tokens, a special kind of token, created and used only by the parser to replace a group of tokens with a single token encoding semantic information. Will be fully utilized later for C++ nested-name-specifiers. llvm-svn: 58911	2008-11-08 16:17:04 +00:00
Chris Lattner	546200529b	clarify comment, rename argument to avoid a subtle conflict with an ivar that wasn't a bug but was confusing. llvm-svn: 58311	2008-10-28 01:02:17 +00:00
Chris Lattner	66a740e66e	Rename Characteristic_t to CharacteristicKind llvm-svn: 58224	2008-10-27 01:19:25 +00:00
Ted Kremenek	4e0f205145	Added method to access the raw flags of Token. llvm-svn: 57877	2008-10-21 03:32:15 +00:00
Chris Lattner	b11c3233d8	Change FormTokenWithChars to take the token kind to form, since all clients were setting a kind and then forming it. This is just a minor API cleanup, no functionality change. llvm-svn: 57404	2008-10-12 04:51:35 +00:00
Chris Lattner	4d96344c19	Add a new mode to the lexer which enables it to return all characters, even whitespace, as tokens from the file. This is enabled with L->SetKeepWhitespaceMode(true) on a raw lexer. In this mode, you too can use clang as a really complex version of 'cat' with code like this: Lexer RawLex(SourceLocation::getFileLoc(SM.getMainFileID(), 0), PP.getLangOptions(), File.first, File.second); RawLex.SetKeepWhitespaceMode(true); Token RawTok; RawLex.LexFromRawLexer(RawTok); while (RawTok.isNot(tok::eof)) { std::cout << PP.getSpelling(RawTok); RawLex.LexFromRawLexer(RawTok); } This will emit exactly the input file, with no canonicalization or other translation. Realistic clients actually do something with the tokens of course :) llvm-svn: 57401	2008-10-12 04:05:48 +00:00
Chris Lattner	8637abd333	add a new inKeepCommentMode() accessor to abstract the KeepCommentMode ivar. llvm-svn: 57397	2008-10-12 03:22:02 +00:00
Chris Lattner	7c2e9809b1	Simplify raw mode lexing by treating an unterminate /**/ comment the same we we do an unterminated string or character literal. This makes it so we can guarantee that the lexer never calls into the preprocessor (which would be suicide for a raw lexer). llvm-svn: 57395	2008-10-12 01:31:51 +00:00
Chris Lattner	50c9050037	Change how raw lexers are handled: instead of creating them and then using LexRawToken, create one and use LexFromRawLexer. This avoids twiddling the RawLexer flag around and simplifies some code (even speeding raw lexing up a tiny bit). This change also improves the token paster to use a Lexer on the stack instead of new/deleting it. llvm-svn: 57393	2008-10-12 01:15:46 +00:00
Daniel Dunbar	405965323d	Add Preprocessor::RemovePragmaHandler. - No functionality change. llvm-svn: 57065	2008-10-04 19:17:46 +00:00
Chris Lattner	b03dc76499	clean up a bunch of fixme's I added, by moving DirectoryLookup::DirType into SourceManager.h llvm-svn: 56692	2008-09-26 21:18:42 +00:00
Chris Lattner	c88a23e8d7	Fix the rest of rdar://6243860 hopefully. This requires changing FileIDInfo to whether the fileid is a 'extern c system header' in addition to whether it is a system header, most of this is spreading plumbing around. Once we have that, PPLexerChange bases its "file enter/exit" notifications to PPCallbacks to base the system header state on FileIDInfo instead of HeaderSearch. Finally, in Preprocessor::HandleIncludeDirective, mirror logic in GCC: the system headerness of a file being entered can be set due to the #includer or the #includee. llvm-svn: 56688	2008-09-26 20:12:23 +00:00
Daniel Dunbar	2c61b32beb	Comment fix. llvm-svn: 56634	2008-09-26 01:11:51 +00:00
Argyrios Kyrtzidis	75b34536d0	Rename Preprocessor::DisableBacktrack -> Preprocessor::CommitBacktrackedTokens. llvm-svn: 55281	2008-08-24 12:29:43 +00:00
Chris Lattner	909d0881ff	Comment tweak. llvm-svn: 55272	2008-08-24 00:12:08 +00:00
Argyrios Kyrtzidis	a65490c5df	Allow nested backtracks. llvm-svn: 55204	2008-08-22 21:27:50 +00:00
Daniel Dunbar	12c9ddced1	Change Parser & Sema to use interned "super" for comparions. - Added as private members for each because it is not clear where to put the common definition. Perhaps the IdentifierInfos all of these "pseudo-keywords" should be collected into one place (this would KnownFunctionIDs and Objective-C property IDs, for example). Remove Token::isNamedIdentifier. - There isn't a good reason to use strcmp when we have interned strings, and there isn't a good reason to encourage clients to do so. llvm-svn: 54794	2008-08-14 22:04:54 +00:00
Nico Weber	4c3116437c	* Remove isInSystemHeader() from DiagClient, move it to SourceManager * Move FormatError() from TextDiagnostic up to DiagClient, remove now empty class TextDiagnostic * Make DiagClient optional for Diagnostic This fixes the following problems: * -html-diags (and probably others) does now output the same set of warnings as console clang does * nothing crashes if one forgets to call setHeaderSearch() on TextDiagnostic * some code duplication is removed llvm-svn: 54620	2008-08-10 19:59:06 +00:00
Argyrios Kyrtzidis	b3dd1e0889	Allow the preprocessor to cache the lexed tokens, so that we can do efficient lookahead and backtracking. 1) New public methods added: -EnableBacktrackAtThisPos -DisableBacktrack -Backtrack -isBacktrackEnabled 2) LookAhead() implementation is replaced with a more efficient one. 3) LookNext() is removed. llvm-svn: 54611	2008-08-10 13:15:22 +00:00
Chris Lattner	a580a92508	Add an accessor, patch by Csaba Hruska. llvm-svn: 53391	2008-07-10 05:26:30 +00:00
Argyrios Kyrtzidis	80b77ac394	Add Preprocessor::LookNext method, which implements an efficient way to 'take a peek' at the next token without consuming it. llvm-svn: 53375	2008-07-09 22:46:46 +00:00
Chris Lattner	6016a515e5	refactor some code out into a new method. llvm-svn: 52889	2008-06-30 06:39:54 +00:00
Chris Lattner	3565c8e343	Neil pointed out that clang doesn't generate ranges from diagnostics related to pp-expressions. Doing so is pretty simple and this patch implements it, yielding nice diagnostics like: t.c:2:7: error: division by zero in preprocessor expression #if 1 / (0 + 0) ~ ^ ~~~~~~~ t.c:5:14: error: expected ')' in preprocessor expression #if (412 + 42 ~~~~~~~~^ t.c:5:5: error: to match this '(' #if (412 + 42 ^ t.c:10:10: warning: left side of operator converted from negative value to unsigned: -42 to 18446744073709551574 #if (-42 + 0U) / -2 ~~~ ^ ~~ t.c:10:16: warning: right side of operator converted from negative value to unsigned: -2 to 18446744073709551614 #if (-42 + 0U) / -2 ~~~~~~~~~~ ^ ~~ 5 diagnostics generated. llvm-svn: 50638	2008-05-05 06:45:50 +00:00
Chris Lattner	ba1f37bfdf	simplify ownership of the predefines buffer. llvm-svn: 49973	2008-04-19 23:09:31 +00:00
Ted Kremenek	f42f3fb47d	class Preprocessor: Now owns the "predefines" char; it deletes [] it in its dstor. clang.cpp: InitializePreprocessor now makes a copy of the contents of PredefinesBuffer and passes it to the preprocessor object. clang.cpp: DriverPreprocessorFactory now calls "InitializePreprocessor" instead of this being done in main(). html::HighlightMacros() now takes a PreprocessorFactory, allowing it to conjure up a new Preprocessor to highlight macros. class HTMLDiagnostics now takes a PreprocessorFactory that it can use for html::HighlightMacros(). Updated clients of HTMLDiagnostics to use this new interface. llvm-svn: 49875	2008-04-17 22:31:54 +00:00
Ted Kremenek	219bab3be9	Added "PreprocessorFactory", an interface for lazily creating Preprocessor objects on-demand. llvm-svn: 49868	2008-04-17 21:23:07 +00:00
Chris Lattner	e9786c3199	reenable highlighting of (the first line of) comments llvm-svn: 49816	2008-04-16 20:54:51 +00:00
Chris Lattner	221929310b	Make the preprocessor own its PPCallbacks, fixing a memory leak. Patch by Sam Bishop! llvm-svn: 48357	2008-03-14 06:07:05 +00:00
Chris Lattner	3e4683262e	implement simple support for arbitrary token lookahead. Change the objc @try parser to use it, fixing a FIXME. Update the objc-try-catch-1.m file to pass now that we get more reasonable errors. llvm-svn: 48129	2008-03-10 06:06:04 +00:00
Chris Lattner	1d4000ba50	rename HandleEndOfMacro -> HandleEndOfTokenLexer llvm-svn: 48076	2008-03-09 03:04:16 +00:00
Chris Lattner	7ff66fb91e	split the MacroArgs class out of TokenLexer.cpp/h into MacroArgs.cpp/h llvm-svn: 48075	2008-03-09 02:55:12 +00:00
Chris Lattner	285c0c1150	rename some MacroExpander-related ivars to TokenLexer. llvm-svn: 48073	2008-03-09 02:26:03 +00:00
Chris Lattner	5bb36002be	Rename MacroExpander.cpp/h -> TokenLexer.cpp/h llvm-svn: 48072	2008-03-09 02:22:57 +00:00
Chris Lattner	95d72cdf0f	rename the MacroExpander class to TokenLexer. It handles both token streams and macro lexing, so a more generic name is useful. llvm-svn: 48071	2008-03-09 02:18:51 +00:00
Chris Lattner	d7daed1478	rename MacroTokens -> Tokens. When this is a token stream, there is no macro involved. llvm-svn: 48070	2008-03-09 02:07:49 +00:00
Chris Lattner	855d024a83	Remove the first layer of support for "portability" warnings. This is theoretically useful, but not useful in practice. It adds a bunch of complexity, and not much value. It's best to nuke it. One big advantage is that it means the target interfaces will soon lose their SLoc arguments and target queries can never emit diagnostics anymore (yay). Removing this also simplifies some of the core preprocessor which should make it slightly faster. Ted, I didn't simplify TripleProcessor, which can now have at most one triple, and can probably just be removed. Please poke at it when you have time. llvm-svn: 47930	2008-03-05 01:18:20 +00:00
Lauro Ramos Venancio	8983891531	Fix PR2086. llvm-svn: 47551	2008-02-25 19:03:15 +00:00
Ted Kremenek	72be068ab3	Two more Windows-related fixes: - More enum signeness bitfield fixes (MSVC treats enums as signed). - Fixed in Lex/HeaderSearch.cpp an unsafe copy between two HeaderSearch::PerFileInfo entries in a common vector. The copy involved two calls to getFileInfo() within the assignment; these calls could have side-effects that enlarged the internal vector, and with MSVC this would invalidate one of the values in the assignment. Patch by Argiris Kirtzidis! llvm-svn: 47536	2008-02-24 03:55:14 +00:00
Ted Kremenek	f948d2a75f	Change encoding of TokenKind in IdentifierTable to be of type "unsigned" instead of TokenKind because of signedness issues with MSVC and enums. Patch from Argiris Kirtzidis. llvm-svn: 47515	2008-02-23 01:05:54 +00:00
Chris Lattner	3b5054dda0	Implement support for the extremely atrocious MS /##/ extension, which pastes together a comment. This is only enabled with -fms-extensions of course. llvm-svn: 46845	2008-02-07 06:03:59 +00:00
Steve Naroff	72df405dc5	Cleanup previous patch (based on feedback from Ted). Since this behavior is useful for most classes, we might consider adding a simple 3 method class that implements the behavior. Ted said that Boost has such a class. llvm-svn: 46654	2008-02-02 00:10:46 +00:00
Steve Naroff	f7fe5b372f	Make sure SourceManager/HeaderSearch don't support default copy constructors (since they result in bad runtime behavior). I'm sure there are other classes that might need this "guard", however I was bitten by these 2 recently (so I thought I'd fix them). llvm-svn: 46653	2008-02-01 23:31:13 +00:00
Chris Lattner	f6df8e9702	fix comment typo llvm-svn: 46505	2008-01-29 07:59:54 +00:00
Chris Lattner	e3e358c317	readability improvement suggested by Sam Bishop, thanks! llvm-svn: 45735	2008-01-08 04:13:21 +00:00
Chris Lattner	a30be59fa2	Fix a nasty corner case that Neil noticed in PR1900, where we would incorrectly apply the multiple include optimization to files with guards like: #if !defined(x) MACRO where MACRO could expand to different things in different contexts. Thanks Neil! llvm-svn: 45716	2008-01-07 19:50:27 +00:00
Chris Lattner	5b12ab8c93	Don't attribute in file headers anymore. See llvmdev for the discussion of this change. llvm-svn: 45410	2007-12-29 19:59:25 +00:00
Seo Sanghyeon	58bdfda87c	Add newline llvm-svn: 45257	2007-12-20 07:22:20 +00:00
Ted Kremenek	230bd918b2	Interned MainFileID within SourceManager. Since SourceManager is referenced by both Preprocessor and ASTContext, we no longer need to explicitly pass MainFileID around in function calls that also pass either Preprocessor or ASTContext. This resulted in some nice cleanups in the ASTConsumers and the driver. llvm-svn: 45228	2007-12-19 22:51:13 +00:00
Ted Kremenek	f7bfae6b45	Typo fix. llvm-svn: 45227	2007-12-19 22:32:34 +00:00
Chris Lattner	c238331377	Add support for #pragma mark, which shouldn't warn about bogus tokens. llvm-svn: 45212	2007-12-19 19:38:36 +00:00
Chris Lattner	9f9a619a9f	implement enough helper functions to successfully dump out the contents of the header map. Look ma, no assumptions about input data here (aka, corrupt header maps can't crash the compiler - crazy thought). llvm-svn: 45122	2007-12-17 21:06:11 +00:00
Chris Lattner	79764a6bee	Finish hooking up the scaffolding for headermaps. They can now do everything except resolve lookups. llvm-svn: 45111	2007-12-17 18:44:09 +00:00
Chris Lattner	4ffe46cbdf	Start reading the headermap header, drop the 'errorstr' argument to the create method. llvm-svn: 45109	2007-12-17 18:34:53 +00:00
Chris Lattner	8d720d083a	Sink getName into DirectoryLookup to simplify the client in clang. llvm-svn: 45106	2007-12-17 17:57:27 +00:00
Chris Lattner	1587e6db01	add headermap.cpp llvm-svn: 45095	2007-12-17 08:22:46 +00:00
Chris Lattner	44bd21b7c1	finish stubbing out support for HeaderMap. Now we just need an implementation! llvm-svn: 45094	2007-12-17 08:17:39 +00:00
Chris Lattner	712e3873a0	refactor an better comment framework lookup code. This moves it from HeaderSearch into DirectoryLookup, as a particular framework lookup is specific to the directory we are currently querying. llvm-svn: 45093	2007-12-17 08:13:48 +00:00
Chris Lattner	f62f75895f	as it turns out, frameworks and headermaps are orthogonal. Make this so in the internal representation. This also fixes a bug where -I foo -F foo would not search foo as both a normal and framework include dir. llvm-svn: 45092	2007-12-17 07:52:39 +00:00
Chris Lattner	3e206b3a0e	teach RemoveDuplicates about header maps. llvm-svn: 45090	2007-12-17 06:44:29 +00:00
Chris Lattner	c4ba38ed1e	Step #1 in adding headermap support to clang. llvm-svn: 45089	2007-12-17 06:36:45 +00:00
Chris Lattner	67671ed4b7	add a helper method. llvm-svn: 44976	2007-12-13 01:59:49 +00:00
Ted Kremenek	f82942d04c	constified getFullLoc(). llvm-svn: 44951	2007-12-12 18:55:29 +00:00
Ted Kremenek	50d23f007f	Renamed getFullSourceLoc() -> getFullLoc(). llvm-svn: 44949	2007-12-12 18:46:37 +00:00
Ted Kremenek	d8bcfe27cc	Added method: Preprocessor::getFullSourceLoc. Used by clients of Preprocessor to get a FullSourceLoc from a SourceLocation. llvm-svn: 44948	2007-12-12 18:41:40 +00:00
Chris Lattner	615315f307	Add dumping support for locations, make -dumptokens print out the location info of each token. llvm-svn: 44741	2007-12-09 20:31:55 +00:00
Ted Kremenek	fbb08bc2e2	Added optional pass-by-reference argument "isExact" to NumericLiteralParser::GetFloatValue(). Upon method return, this flag has the value true if the returned APFloat can exactly represent the number in the parsed text, and false otherwise. Modified the implementation of GetFloatValue() to parse literals using APFloat's convertFromString method (which allows us to set the value of isExact). llvm-svn: 44339	2007-11-26 23:12:30 +00:00
Chris Lattner	4dcbc2090a	add header file I forgot to check in llvm-svn: 44179	2007-11-15 19:22:18 +00:00
Chris Lattner	fd64ebd3e4	Fix assertion for raw lexer. llvm-svn: 43091	2007-10-17 21:22:38 +00:00
Chris Lattner	8e129c23c8	Move token length calculation out of the diagnostics machinery into the lexer, where it can be shared. llvm-svn: 43090	2007-10-17 21:18:47 +00:00
Chris Lattner	02b436a05a	Add a new type of lexer: a raw lexer, which does not require a preprocessor object in order to do its thing. llvm-svn: 43084	2007-10-17 20:41:00 +00:00
Chris Lattner	caecf03215	add some comments. llvm-svn: 43079	2007-10-17 18:28:59 +00:00
Anders Carlsson	cbfc4b8824	Add support for Pascal strings. llvm-svn: 42974	2007-10-15 02:50:23 +00:00
Chris Lattner	1f1b0dbc28	Make a significant change to invert the control flow handling predefined macros. Previously, these were handled by the driver, now they are handled by the preprocessor. Some fallout of this: 1. Instead of preprocessing two buffers (the predefines, then the main source file) we now start preprocessing the main source file and inject the predefines as a "psuedo #include" from the main source file. 2. #1 allows us to nuke the Lexer::IsMainFile flag and simplify Preprocessor::isInPrimaryFile. 3. The driver doesn't have to know about standard #defines, the preprocessor knows, which is nice for people wanting to define their own drivers. 4. This allows us to put normal tokens in the predefine buffer, for example a definition for __builtin_va_list that is target-specific, and a typedef for id in objc. llvm-svn: 42818	2007-10-09 22:10:18 +00:00
Chris Lattner	0ab032aa8f	Add two new Token helper functions, "is" and "isNot". This allows us to write stuff like this: // If we don't have a comma, it is either the end of the list (a ';') or // an error, bail out. if (Tok.isNot(tok::comma)) break; instead of: // If we don't have a comma, it is either the end of the list (a ';') or // an error, bail out. if (Tok.getKind() != tok::comma) break; There is obviously no functionality change, but the code reads a bit better and is more terse. llvm-svn: 42795	2007-10-09 17:23:58 +00:00
Chris Lattner	ef6b136781	move IdentifierTable.h from liblex to libbasic. llvm-svn: 42730	2007-10-07 08:58:51 +00:00
Chris Lattner	c43ddc84a3	improve layering: Now instead of IdentifierInfo knowing anything about MacroInfo, only the preprocessor knows. This makes MacroInfo truly private to the Lex library (and its direct clients) instead of being accessed in the Basic library. llvm-svn: 42727	2007-10-07 08:44:20 +00:00
Chris Lattner	d7b971bf3d	add a hasMacroDefinition() method to IdentifierInfo, strength reduce a call to getMacroInfo to call it. llvm-svn: 42725	2007-10-07 07:57:27 +00:00
Chris Lattner	f49523d6ea	update comment. llvm-svn: 42724	2007-10-07 07:54:23 +00:00
Chris Lattner	ff067ce555	Remove the PPID bitfield from IdentifierInfo, shrinking it by a word (because all bitfields now fit in 32 bits). This shrinks the identifier table for carbon.h from 1634428 to 1451424 bytes (12%) and has no impact on compile time. llvm-svn: 42723	2007-10-07 07:52:34 +00:00
Chris Lattner	a441ca651f	First step to fixing a long lived layering violation: this moves the MacroInfo pointer to a side hash table (which currently lives in IdentifierTable.cpp). This removes a pointer from Identifier info, but doesn't shrink it, as it requires a new bit be added. This strange approach with the 'hasmacro' bit is needed to not lose preprocessor performance. llvm-svn: 42722	2007-10-07 07:09:52 +00:00
Chris Lattner	730160d32f	Shrink the builtinID down by 3 bits, allowing all the bitfields to fit in 32-bits, shrinking IdentifierInfo by a word. This shrinks the total size of the identifier pool from 1817264 to 1634428 bytes (11%) on carbon.h. llvm-svn: 42719	2007-10-07 06:29:32 +00:00
Chris Lattner	5700fab189	simplify the interfaces to create selectors: getSelector can take any number of arguments now and does the right thing, but the nullary/unary accessors are preserved as convenience functions. This allows us to slightly simplify clients. llvm-svn: 42716	2007-10-07 02:00:24 +00:00
Chris Lattner	f7f34d09e4	simplify some Selector interfaces. llvm-svn: 42715	2007-10-07 01:33:16 +00:00
Chris Lattner	dadc762ac5	Implement DenseMapInfo for Selector, allowing use of DenseMap/DenseSet of Selector's instead of requiring void* to be used. I converted one use of DenseSet<void*> over to use DenseSet<Selector> but the others should change as well. llvm-svn: 42645	2007-10-05 20:15:24 +00:00
Steve Naroff	e61bfa8bb4	Layering refinements for selectors (suggested by Chris). Specifics... - Add SelectorTable, which enables us to remove MultiKeywordSelector from the public header. - Remove FoldingSet from IdentifierInfo.h and Preprocessor.h. - Remove Parser::ObjcGetUnarySelector and Parser::ObjcGetKeywordSelector, they are subsumed by SelectorTable. - Add MultiKeywordSelector to IdentifierInfo.cpp. - Move a bunch of selector related methods from ParseObjC.cpp to IdentifierInfo.cpp. - Added some comments. llvm-svn: 42643	2007-10-05 18:42:47 +00:00
Chris Lattner	01d7f489a9	minor cleanups, make code more defensive, less branchy in Selector ctor. llvm-svn: 42603	2007-10-04 05:21:22 +00:00
Chris Lattner	0f26288998	fix an incorrect assertion llvm-svn: 42602	2007-10-04 05:16:42 +00:00
Chris Lattner	2b6abdce50	Add a new getLength() method to IdentifierInfo, which relies on a newly added method to StringMapEntry. Steve, please use this to remove the strlen calls in selector processing. llvm-svn: 42481	2007-09-30 08:32:27 +00:00
Steve Naroff	92866f4fbb	Add some comments to MultiKeywordSelector, make all methods private, add a friend, move some methods around. llvm-svn: 42456	2007-09-28 23:39:26 +00:00
Steve Naroff	8017506d9c	Yesterday I discovered that 78% of all selectors in "Cocoa.h" take 0/1 argument. This motivated implementing a devious clattner inspired solution:-) This approach uses a small value "Selector" class to point to an IdentifierInfo for the 0/1 case. For multi-keyword selectors, we instantiate a MultiKeywordSelector object (previously known as SelectorInfo). Now, the incremental cost for selectors is only 24,800 for Cocoa.h! This saves 156,592 bytes, or 86%!! The size reduction is also the result of getting rid of the AST slot, which was not strictly necessary (we will associate a selector with it's method using another table...most likely in Sema). This change was critical to make now, before we have too many clients. I still need to add some comments to the Selector class...will likely add later today/tomorrow. llvm-svn: 42452	2007-09-28 22:22:11 +00:00
Steve Naroff	65ca537b55	Fix bug in SelectorInfo::getName() - method buffer needs to be passed by reference. llvm-svn: 42411	2007-09-27 18:52:21 +00:00
Steve Naroff	f73590dbb1	Add SelectorInfo (similar in spirit to IdentifierInfo). The key difference is SelectorInfo is not string-oriented, it is a unique aggregate of IdentifierInfo's (using a folding set). SelectorInfo also has a richer API that simplifies the parser/action interface. 3 noteworthy benefits: #1: It is cleaner. I never "liked" storing keyword selectors (i.e. foo:bar:baz) in the IdentifierTable. #2: It is more space efficient. Since Cocoa keyword selectors can be quite long, this technique is space saving. For Cocoa.h, pulling the keyword selectors out saves ~180k. The cost of the SelectorInfo data is ~100k. Saves ~80k, or 43%. #3: It results in many API simplifications. Here are some highlights: - Removed 3 actions (ActOnKeywordMessage, ActOnUnaryMessage, & one flavor of ObjcBuildMethodDeclaration that was specific to unary messages). - Removed 3 funky structs from DeclSpec.h (ObjcKeywordMessage, ObjcKeywordDecl, and ObjcKeywordInfo). - Removed 2 ivars and 2 constructors from ObjCMessageExpr (fyi, this space savings has not been measured). I am happy with the way it turned out (though it took a bit more hacking than I expected). Given the central role of selectors in ObjC, making sure this is "right" will pay dividends later. Thanks to Chris for talking this through with me and suggesting this approach. llvm-svn: 42395	2007-09-27 14:38:14 +00:00
Chris Lattner	ec0a6d9be5	Use APFloat for the representation of FP immediates, ask the target for which apfloat to use for a particular type. llvm-svn: 42234	2007-09-22 18:29:59 +00:00
Steve Naroff	2cd263ff71	Remove SelectorTable/SelectorInfo, simply store all selectors in the central IdentifierTable. Rationale: We currently have a separate table to unique ObjC selectors. Since I don't need all the instance data in IdentifierInfo, I thought this would save space (and make more sense conceptually). It turns out the cost of having duplicate entries for unary selectors (i.e. names without colons) outweighs the cost difference between the IdentifierInfo & SelectorInfo structures. Here is the data: Two tables: * Selector/Identifier Stats: # Selectors/Identifiers: 51635 Bytes allocated: 1999824 One table: * Identifier Table Stats: # Identifiers: 49500 Bytes allocated: 1990316 llvm-svn: 42139	2007-09-19 16:18:46 +00:00
Steve Naroff	73d534a2e0	Add support for ObjC keyword selectors. - Add SelectorInfo/SelectorTable classes, modeled after IdentifierInfo/IdentifierTable. - Add SelectorTable instance to ASTContext, created lazily through ASTContext::getSelectorInfo(). - Add SelectorInfo slot to ObjcMethodDecl. - Add helper function to derive a SelectorInfo from ObjcKeywordInfo. Misc: Got the Decl stats stuff up and running again...it was missing support for ObjC AST's. llvm-svn: 42023	2007-09-17 14:16:13 +00:00
Chris Lattner	f8a3dadd40	Don't rely on ADL to find this member, patch by Justin Handville llvm-svn: 41783	2007-09-08 01:29:20 +00:00
Chris Lattner	78b6221ff9	Eliminate some VC++ warnings, patch by Hartmut Kaiser! llvm-svn: 41685	2007-09-03 18:28:41 +00:00
Chris Lattner	ed045421a8	1.0 is double, 1.0F is a float. llvm-svn: 41412	2007-08-26 03:29:23 +00:00
Chris Lattner	f55ab18663	1) refactor some code. 2) Add support for lexing imaginary constants (a GCC extension): t.c:5:10: warning: imaginary constants are an extension A = 1.0iF; ^ 3) Make the 'invalid suffix' diagnostic pointer more accurate: t.c:6:10: error: invalid suffix 'qF' on floating constant A = 1.0qF; ^ instead of: t.c:6:10: error: invalid suffix 'qF' on floating constant A = 1.0qF; ^ llvm-svn: 41411	2007-08-26 01:58:14 +00:00
Steve Naroff	7c34817902	Add helper functions Token::isObjCAtKeyword() and Token::getObjCKeywordID(). Convert all clients to the new cleaner, more robust API. llvm-svn: 41330	2007-08-23 18:16:40 +00:00
Chris Lattner	4c4a245475	Use a smallstring instead of an std::string in FileChanged to avoid some malloc traffic. This speeds up -E on xalancbmk by 2.4% llvm-svn: 40461	2007-07-24 06:57:14 +00:00
Chris Lattner	93ab9f134e	refactor the interface to Preprocessor::GetIncludeFilenameSpelling, no functionality changes. llvm-svn: 40414	2007-07-23 04:15:27 +00:00
Chris Lattner	5d1c02748f	Change hte lexer to start a start pointer to the underlying memorybuffer instead of a pointer to the memorybuffer itself. This reduces coupling and eliminates a pointer dereference on a hot path. This speeds up -Eonly on 483.xalancbmk by 2.1% llvm-svn: 40394	2007-07-22 18:44:36 +00:00
Chris Lattner	d427542a9b	Implement a simple cache in headersearch. This speeds up preprocessing 483.xalancbmk by about 10%, reducing the number of file lookup queries from 2139411 to 199466 (over 10x) llvm-svn: 40390	2007-07-22 07:28:00 +00:00
Chris Lattner	9c724c48ea	Fix a really subtle bug in the macro expander caching code, where redefinition of a macro could cause invalid memory to be deleted. Found preprocessing 253.perlbmk. llvm-svn: 40380	2007-07-22 01:16:55 +00:00
Chris Lattner	146762e7a4	At one point there were going to be lexer and parser tokens. Since that point is now long gone, we should rename LexerToken to Token, as it is the only kind of token we have. llvm-svn: 40105	2007-07-20 16:59:19 +00:00
Chris Lattner	77e9de50a1	simplify the lexer ctor to take a SLoc instead of a sloc and a redundant buffer*. llvm-svn: 40104	2007-07-20 16:52:03 +00:00
Chris Lattner	dc5c055fd1	Reimplement SourceLocation. Instead of having a fileid/offset pair, it now contains a bit discriminating between mapped locations and file locations. This separates the tables for macros and files in SourceManager, and allows better separation of concepts in the rest of the compiler. This allows us to have many macro instantiations before running out of 'addressing space'. This is also more efficient, because testing whether something is a macro expansion is now a bit test instead of a table lookup (which also used to require having a srcmgr around, now it doesn't). This is fully functional, but there are several refinements and optimizations left. llvm-svn: 40103	2007-07-20 16:37:10 +00:00
Chris Lattner	8a7003cd9f	Add a new Preprocessor::AdvanceToTokenCharacter method which, given a sloc specifying the start of a token and a logical (phase 3) character number, returns a sloc representing the input character corresponding to it. llvm-svn: 39905	2007-07-16 06:48:38 +00:00
Chris Lattner	b0f5d55912	factor a common predicate into a static method. llvm-svn: 39903	2007-07-16 06:16:59 +00:00
Chris Lattner	c02c4abe5b	Cache macro expander objects to avoid thrashing malloc in heavy expansion situations. This doesn't significantly improve carbon.h, but it does speed up INPUTS/macro_pounder_obj.c by 48% llvm-svn: 39864	2007-07-15 00:25:26 +00:00
Chris Lattner	564f478595	switch function-like macros from using a vector for their arguments to an explicitly new'd array. The array never mutates once created, so a vector is overkill. llvm-svn: 39862	2007-07-14 22:46:43 +00:00
Chris Lattner	819f2ef77b	switch from using a vector to a smallvector for macro replacement tokens This speeds up parsing carbon.h by 3.3% by avoiding some malloc traffic for small macros. llvm-svn: 39861	2007-07-14 22:15:50 +00:00
Chris Lattner	f40fe99118	expose an iterator interface to getReplacementTokens instead of the datastructure itself. llvm-svn: 39860	2007-07-14 22:11:41 +00:00
Chris Lattner	bd4de5df77	Fix "no newline at end of file" warnings. Patch contributed by Benoit Boissinot! llvm-svn: 39780	2007-07-12 15:43:07 +00:00
Chris Lattner	20c4ff2845	Improve portability to compilers where <cassert> is not implicitly included. Patch contributed by Benoit Boissinot! llvm-svn: 39779	2007-07-12 15:32:57 +00:00
Steve Naroff	97b9e91eb7	Bug #: Submitted by: Reviewed by: Added primitive support for 32-bit floating point literals. llvm-svn: 39719	2007-07-09 23:53:58 +00:00
Chris Lattner	23b7eb677d	Finally bite the bullet and make the major change: split the clang namespace out of the llvm namespace. This makes the clang namespace be a sibling of llvm instead of being a child. The good thing about this is that it makes many things unambiguous. The bad things is that many things in the llvm namespace (notably data structures like smallvector) now require an llvm:: qualifier. IMO, libsystem and libsupport should be split out of llvm into their own namespace in the future, which will fix this issue. llvm-svn: 39659	2007-06-15 23:05:46 +00:00
Chris Lattner	de12ae2fd7	Add support for binary literals: http://gcc.gnu.org/onlinedocs/gcc/Binary-constants.html#Binary-constants llvm-svn: 39621	2007-06-08 22:42:30 +00:00
Steve Naroff	98d153c730	Bug #: Submitted by: Reviewed by: Fixed a bug in the parser's handling of attributes on pointer declarators. For example, the following code was producing a syntax error... int __attribute(()) foo; attrib.c:10:25: error: expected identifier or '(' int __attribute(()) foo; ^ Changed Parser::ParseTypeQualifierListOpt to not consume the token following an attribute declaration. Also added LexerToken::getName() convenience method...useful when tracking down errors like this. llvm-svn: 39602	2007-06-06 23:19:11 +00:00
Bill Wendling	f06487034e	Bug #: Submitted by: Bill Wendling Reviewed by: Chris Lattner - Comment fix. llvm-svn: 39482	2007-05-23 08:03:10 +00:00
Chris Lattner	67ca9252f8	Implement Sema::ParseNumericConstant for integer constants in terms of APInt and correctly in terms of C99 6.4.4.1p5. llvm-svn: 39473	2007-05-21 01:08:44 +00:00
Chris Lattner	36982e4367	Add support for inserting up to 10 strings in a diagnostic, with %0, %1, %2, etc. llvm-svn: 39447	2007-05-16 17:49:37 +00:00
Chris Lattner	739e739b81	Remove the clang::SourceBuffer class, switch to the llvm::MemoryBuffer class. llvm-svn: 39426	2007-04-29 07:12:06 +00:00
Chris Lattner	2f5add6272	Implement support for performing semantic analysis of character literals. llvm-svn: 39390	2007-04-05 06:57:15 +00:00
Chris Lattner	871b4e101c	Minor enhancements to GetIntegerValue(APInt): * Detect overflow correctly. When it occurs, return the truncated value. * Add fixme for radix analysis. llvm-svn: 39382	2007-04-04 06:36:34 +00:00
Chris Lattner	5b743d3801	Add some really simplistic code for turning a ppnumber into an APInt. Much improvement is needed! llvm-svn: 39381	2007-04-04 05:52:58 +00:00
Steve Naroff	4f88b3113e	Bug #: Submitted by: Reviewed by: Move string literal parsing from Sema=>LiteralSupport. This consolidates all the quirky parsing code within the Lexer subsystem (yeah!). This simplifies Sema and (more importantly) allows future parsers (i.e. subclasses of Action) to benefit from this code. llvm-svn: 39354	2007-03-13 22:37:02 +00:00
Steve Naroff	f2fb89e759	Bug #: Submitted by: Reviewed by: Misc. cleanup/polish of NumericLiteralParser and it's two clients, the C preprocessor and AST builder... llvm-svn: 39353	2007-03-13 20:29:44 +00:00
Steve Naroff	451d8f1626	Bug #: Submitted by: Reviewed by: -Converted the preprocessor to use NumericLiteralParser. -Several minor changes to LiteralSupport interface/implementation. -Added an error diagnostic for floating point usage in pp expr's. llvm-svn: 39352	2007-03-12 23:22:38 +00:00
Steve Naroff	09ef474197	Bug #: Submitted by: Reviewed by: Moved numeric literal support from SemaExpr.cpp to LiteralSupport.[h,cpp] in Lex. This will allow it to be used by both Sema and Preprocessor (and should be the last major refactoring of this sub-system).. Over time, it will be reused by anyone implementing an actions module (i.e. any subclass of llvm::clang::Action. Minor changes to IntegerLiteral in Expr.h. More to come... llvm-svn: 39351	2007-03-09 23:16:33 +00:00
Chris Lattner	b055f2ddfc	switch to using iterators instead of stringmap visitors. llvm-svn: 39336	2007-02-11 08:19:57 +00:00
Chris Lattner	54d032bb76	CStringMap -> StringMap. llvm-svn: 39334	2007-02-08 19:24:25 +00:00
Chris Lattner	34d1f5a8de	adjust to CStringMap interface change. llvm-svn: 39333	2007-02-08 19:08:49 +00:00
Chris Lattner	9561a0b3e7	Add support for target-independent builtin functions (like __builtin_abs), whose decl objects are lazily created the first time they are referenced. Builtin functions are described by the clang/AST/Builtins.def file, which makes it easy to add new ones. This is missing two important pieces: 1. Support for the rest of the gcc builtins. 2. Support for target-specific builtins (e.g. __builtin_ia32_emms). Just adding this builtins reduces the number of implicit function definitions by 6, reducing the # diagnostics from 550 to 544 when parsing carbon.h. I need to add all the i386-specific ones to eliminate several hundred more. ugh. llvm-svn: 39327	2007-01-28 08:20:04 +00:00
Chris Lattner	b6738ec361	make LookupScopedDecl a method instead of a static function llvm-svn: 39326	2007-01-28 00:38:24 +00:00
Chris Lattner	5b9f4891d7	Add support for C++ operator keywords. Patch by Bill Wendling. llvm-svn: 39214	2006-11-21 17:23:33 +00:00
Chris Lattner	00348acace	clear file info after processing one file, it shouldn't carry over to the next. llvm-svn: 39211	2006-11-21 06:34:57 +00:00
Chris Lattner	b352e3edb5	Change KeepComments/KeepMacroComments modes to be facets of the preprocessor state, not aspects of the language standard being parsed. llvm-svn: 39209	2006-11-21 06:17:10 +00:00
Chris Lattner	ad7cdd37b3	simplify the Preprocessor ctor. llvm-svn: 39208	2006-11-21 06:08:20 +00:00
Chris Lattner	a92809b1ab	add an accessor llvm-svn: 39206	2006-11-21 05:52:33 +00:00
Chris Lattner	b8d6d5a81d	Formalize preprocessor callbacks together into a PPCallbacks structure, instead of having a loose collection of function pointers. This also allows clients to maintain state, and reduces the size of the Preprocessor.h interface. llvm-svn: 39203	2006-11-21 04:09:30 +00:00
Chris Lattner	f89b50c38d	init std::string with it's default ctor instead of "". llvm-svn: 39141	2006-11-06 06:37:47 +00:00
Chris Lattner	c07ba1fe2f	Refactor the paths used for checking and getting the spelling of #include filenames (and also '#pragma GCC dependency' of course). Now, assuming no cleaning is needed, we can go all the way from lexing the filename to doing filename lookups with no mallocs. This speeds up user PP time from 0.077 to 0.075s for Cocoa.h (2.6%). llvm-svn: 39092	2006-10-30 05:58:32 +00:00
Chris Lattner	b8b94f1e9b	Make Preprocessor::LookupFile take a character range instead of a string. This avoids some copying in its clients. llvm-svn: 39091	2006-10-30 05:38:06 +00:00
Chris Lattner	7cdbad945d	Push strings out of the HeaderSearch interface, it now deals solely with character ranges. llvm-svn: 39090	2006-10-30 05:33:15 +00:00
Chris Lattner	ee7bf89cd6	Change framework cache map from map to CStringMap. This speeds up PP user time on Cocoa.h from 0.078 to 0.077s. llvm-svn: 39089	2006-10-30 05:19:23 +00:00
Chris Lattner	2b9e19be87	Pull the string hashtable out of the IdentifierTable, moving into LLVM's libsupport. Now it can be used for other things besides identifier hashing. llvm-svn: 39079	2006-10-29 23:43:13 +00:00
Chris Lattner	ec659fce46	move memory allocation abstraction stuff out into LLVM's libsupport llvm-svn: 39078	2006-10-29 22:09:44 +00:00
Chris Lattner	cf163aa407	no need for these classes to be so friendly anymore. llvm-svn: 39077	2006-10-29 21:37:52 +00:00
Chris Lattner	3bc804ed3d	genericize IdentifierInfo interface to make it work more naturally. llvm-svn: 39076	2006-10-28 23:46:24 +00:00
Chris Lattner	bcb416bbd5	Implement test/Preprocessor/comment_save_if.c llvm-svn: 39069	2006-10-27 05:43:50 +00:00
Chris Lattner	1eb290b2e9	remove namelen field, it is now dead llvm-svn: 39064	2006-10-27 05:07:16 +00:00
Chris Lattner	56bdb9a9a1	Remove identifier length field from IdentifierInfo, it is now dead. llvm-svn: 39063	2006-10-27 05:06:38 +00:00
Chris Lattner	f2e3ac3b54	reimplement identifier hash table in terms of a probed table instead of a chained table. This is about 25% faster for identifier lookup. This also implements resizing of the hash table. llvm-svn: 39058	2006-10-27 03:59:10 +00:00
Chris Lattner	893f272c39	Track the full (not mod the hash table size) hash value for each token. This lets us find interesting properties of the hash distribution. llvm-svn: 39056	2006-10-26 05:12:31 +00:00
Chris Lattner	5c3ac11bf5	Reduce amount #included llvm-svn: 39035	2006-10-22 07:29:01 +00:00
Chris Lattner	25246dfeb0	Split the DirectoryLookup class out to its own header. llvm-svn: 39033	2006-10-22 07:26:52 +00:00
Chris Lattner	5ed76da296	Implement framework filesystem caching. llvm-svn: 39031	2006-10-22 07:24:13 +00:00
Chris Lattner	641a0be31b	count # framework lookups llvm-svn: 39026	2006-10-20 06:23:14 +00:00
Chris Lattner	63dd32b656	Implement subframework lookup llvm-svn: 39015	2006-10-20 04:42:40 +00:00
Chris Lattner	25e0d54a0e	Move keyword setup from the preprocessor into the IdentifierTable class. llvm-svn: 39014	2006-10-18 06:07:05 +00:00
Chris Lattner	59a9ebdb17	refactor header searching stuff out of the main Preprocessor object into it's own HeaderSearch object. This makes Preprocessor simpler and easier to understand. llvm-svn: 39012	2006-10-18 05:34:33 +00:00
Chris Lattner	04d1f3f75f	track whether DirectoryLookup dirs are framework dirs. llvm-svn: 39006	2006-10-17 06:20:32 +00:00
Chris Lattner	720f2700b1	Make the identifier table track objc keywords llvm-svn: 39003	2006-10-17 04:03:44 +00:00
Chris Lattner	87d3bec423	Make preprocessor keywords like 'define' first class citizens in the IdentifierTable, instead of making them resort to strcmp'ing. llvm-svn: 39002	2006-10-17 03:44:32 +00:00
Chris Lattner	063400e46e	Implement the #define_other_target directive. llvm-svn: 38984	2006-10-14 19:54:15 +00:00
Chris Lattner	81278c6356	Implement the #define_target preprocessor directive. llvm-svn: 38980	2006-10-14 19:03:49 +00:00
Chris Lattner	02dffbda3b	Write up TargetInfo so that use of wchar_t strings results in a warning if used in a target set where the size is not identical. llvm-svn: 38975	2006-10-14 07:50:21 +00:00
Chris Lattner	509d3c00ed	Rename LexerToken methods to be more consistent llvm-svn: 38970	2006-10-14 05:19:39 +00:00
Chris Lattner	d3e9895b9a	Initial support for semantic analysis and AST building for StringExpr nodes. llvm-svn: 38960	2006-10-06 05:22:26 +00:00
Chris Lattner	22dc378630	Split LangOptions out into its own header llvm-svn: 38806	2006-08-04 04:44:06 +00:00
Chris Lattner	34dfaee634	Misc cleanups llvm-svn: 38801	2006-07-31 01:57:54 +00:00
Chris Lattner	95a06b34f7	Simplify implementation of varargs macros by adding the __VA_ARGS__ token to the formal argument list of a C99 varargs macro. llvm-svn: 38800	2006-07-30 08:40:43 +00:00
Chris Lattner	457fc15bc5	Implement comment saving mode: the -C and -CC options. llvm-svn: 38783	2006-07-29 06:30:25 +00:00
Chris Lattner	775d832110	Implement the GNU comma swallowing extension. This implements test/Preprocessor/macro_fn_comma_swallow.c llvm-svn: 38780	2006-07-29 04:39:41 +00:00
Chris Lattner	21c8b8f71e	Implement support for __VA_ARGS__, allowing test/Preprocessor/macro_fn_varargs_iso.c to pass. llvm-svn: 38776	2006-07-29 04:04:22 +00:00
Chris Lattner	f9771166ed	add method to unpoison something for __VA_ARGS__. llvm-svn: 38772	2006-07-29 03:47:18 +00:00
Chris Lattner	6e4bf523e6	Implement C99 6.10.3.4p2, testcase here: Preprocessor/macro_disable3.c. llvm-svn: 38760	2006-07-27 06:59:25 +00:00
Chris Lattner	7a4af3b73d	Change Preprocessor::ReadFunctionLikeMacroArgs to use a SmallVector to lex argument tokens into instead of a real vector. This avoids some malloc traffic in common cases. In an "abusive macro expansion" testcase, this reduced -Eonly time by 25%. llvm-svn: 38757	2006-07-26 06:26:52 +00:00
Chris Lattner	c1410dc1e6	Change MacroArgs to allocate space for the unexpanded tokens immediately after the MacroArgs object itself. This is a bit more efficient and will be even more so shortly. llvm-svn: 38756	2006-07-26 05:22:49 +00:00
Chris Lattner	6fc08bc77d	Add a new getArgLength method and refactor some code to use it llvm-svn: 38755	2006-07-26 04:55:32 +00:00
Chris Lattner	7021657a0f	Implement a FIXME: don't copy token array into a token vector, instead, macroexpander should expand from an array directly. llvm-svn: 38754	2006-07-26 03:50:40 +00:00
Chris Lattner	36b6e8134e	speed up a brutal macro-expansion torture test by about 30% (1.5 -> 1.0s) by turning vectors of vectors into a single vector, reducing pressure on malloc. This can still be improved. llvm-svn: 38753	2006-07-21 06:38:30 +00:00
Chris Lattner	a5f4c882e2	disable malformed string/character errors when in raw mode. This fixes test/Lexer/badstring_in_if0.c llvm-svn: 38751	2006-07-20 06:08:47 +00:00
Chris Lattner	510ab61fd3	Add optimization for identifier##identifier -> identifier, the most common case of token pasting. llvm-svn: 38747	2006-07-20 04:47:30 +00:00
Chris Lattner	538d7f3c27	Simplify "raw lexing mode" even further. Now the preprocessor is only called into when a hard error is found. This simplifies logic and eliminates the need for the preprocessor to know about raw mode. llvm-svn: 38746	2006-07-20 04:31:52 +00:00
Chris Lattner	22549fe6a5	With recent simplifications, this check can be removed from a fastpath. llvm-svn: 38745	2006-07-20 04:20:02 +00:00
Chris Lattner	01ecf835c2	Implement basic token pasting (## operator). This implements test/Preprocessor/macro_paste_simple.c and macro_paste_bad.c. There are several known bugs still. llvm-svn: 38733	2006-07-19 05:42:48 +00:00
Chris Lattner	2183a6e8f4	Make end-of-file handling much less recursive. This reduces the worst case stack depth sampled by shark from ~34 to ~17 frames when preprocessing <iostream>. llvm-svn: 38726	2006-07-18 06:36:12 +00:00
Chris Lattner	7667d0d942	Implement support for lexing from a pre-constructed token stream. Use this support to implement function-like macro argument preexpansion. This implements test/Preprocessor/macro_fn_preexpand.c llvm-svn: 38724	2006-07-16 18:16:58 +00:00
Chris Lattner	203b4568e2	Implement basic argument substitution. This implements test/Preprocessor/macro_fn_disable_expand.c llvm-svn: 38720	2006-07-15 21:07:40 +00:00
Chris Lattner	ee8760b21b	Rename macroformalargs -> MacroArgs, as it represents the actual arguments, not the formal arguments, to a macro. llvm-svn: 38716	2006-07-15 07:42:55 +00:00
Chris Lattner	c653246bfb	Eliminate the IdentifierInfo::IsMacroArg flag. llvm-svn: 38715	2006-07-15 06:55:18 +00:00
Chris Lattner	c783d1dff9	Implement the microsoft charize extension #@ llvm-svn: 38712	2006-07-15 06:11:25 +00:00
Chris Lattner	2b271db205	Lex the microsoft 'charize' extension. llvm-svn: 38711	2006-07-15 05:41:09 +00:00
Chris Lattner	ecc39e9325	Change Lexer::Stringify to not add ""'s around the string. llvm-svn: 38708	2006-07-15 05:23:31 +00:00
Chris Lattner	b935d8cd90	Set up infrastructure for function-like macro expansion with preexpansion stringizing, etc. llvm-svn: 38707	2006-07-14 06:54:44 +00:00
Chris Lattner	b94ec7b668	Add an API so that external clients can create strings in the scratch buffer. llvm-svn: 38706	2006-07-14 06:54:10 +00:00
Chris Lattner	678c880a69	Move Preprocessor::isNextPPTokenLParen to Lexer::isNextPPTokenLParen, where it more rightly belongs. llvm-svn: 38702	2006-07-11 05:46:12 +00:00
Chris Lattner	3ebcf4e2cd	Change Preprocessor::SkippingContents into Lexer::LexingRawMode. Raw mode is an intra-lexer property, not a inter-lexer property, so it makes sense for it to be define here. It also makes no sense for macros, and allows us to define it more carefully in the header. While I'm at it, improve comments and structuring in Lexer.h llvm-svn: 38701	2006-07-11 05:39:23 +00:00
Chris Lattner	d8aee0e81b	Implement "lparen scanning" for lexer buffers, by making "skipping lexing" completely reversible. This implements tests 3/4 of test/Preprocessor/macro_fn_lparen_scan.c llvm-svn: 38699	2006-07-11 05:04:55 +00:00
Chris Lattner	370c135dce	improve comment llvm-svn: 38696	2006-07-11 04:03:32 +00:00
Chris Lattner	2acdac4200	Implement scanning-for-( more correctly. This implements test/Preprocessor/macro_fn_lparen_scan.c, but is not yet complete. llvm-svn: 38695	2006-07-11 04:02:48 +00:00
Chris Lattner	7818605f83	Read, remember, and validate the arguments provided the a function-style macro invocation. llvm-svn: 38685	2006-07-09 00:45:31 +00:00
Chris Lattner	8eede3e6c0	Remove pointless comments. llvm-svn: 38684	2006-07-08 23:24:05 +00:00
Chris Lattner	6e0d42c6f8	Add identifiers for macro arguments to MacroInfo, check for duplicates, enhance macro equality testing to verify argument lists match. llvm-svn: 38682	2006-07-08 20:32:52 +00:00
Chris Lattner	cefc768f5b	Start reading/validating the argument list for a function-like macro. llvm-svn: 38681	2006-07-08 08:28:12 +00:00
Chris Lattner	21284dfdd1	Implement checking for macro equality, C99 6.10.3.2 llvm-svn: 38680	2006-07-08 07:16:08 +00:00
Chris Lattner	e8eef3207b	add infrastructure for warning if redef'd macro bodies differ, but don't fully implement it. Fix warning on #define __LINE__ to warn about redefinition, not #undef. llvm-svn: 38679	2006-07-08 07:01:00 +00:00
Chris Lattner	8ff7199e4b	Warn about __VA_ARGS__ when used outside of a macro expansion llvm-svn: 38678	2006-07-06 05:17:39 +00:00
Chris Lattner	ef9eae1c44	Change the Preprocessor::getSpelling interface to let it be zero-copy in the common case. llvm-svn: 38671	2006-07-04 22:33:12 +00:00
Chris Lattner	5f084fd5fc	Add newline at end of file llvm-svn: 38657	2006-07-04 19:03:36 +00:00
Chris Lattner	e3519cc948	Change EvaluateValue/EvaluateDirectiveSubExpr to be static functions in PPExpressions.cpp instead of methods. llvm-svn: 38652	2006-07-04 18:11:39 +00:00
Chris Lattner	c79f6fb108	Rename IdentifierTokenInfo -> IdentifierInfo. llvm-svn: 38650	2006-07-04 17:53:21 +00:00
Chris Lattner	a8654ca2cf	Eliminate MultipleIncludeOpt::ReadDirective and all calls to it. Any directives that are lexed are made up of tokens, so the calls are just ugly and redundant. Hook up the MIOpt for the #if case. PPCExpressions doesn't currently implement the hook though, so we still don't handle #if !defined(X) with the MIOpt. llvm-svn: 38649	2006-07-04 17:42:08 +00:00
Chris Lattner	3665f161ca	Implement the multiple-include file optimization. llvm-svn: 38647	2006-07-04 07:26:10 +00:00
Chris Lattner	371ac8a9b7	Implement the automaton for recognizing files with controlling macros. llvm-svn: 38646	2006-07-04 07:11:10 +00:00
Chris Lattner	01d66cc891	Implement #ident and #sccs llvm-svn: 38643	2006-07-03 22:16:27 +00:00
Chris Lattner	d7de629c32	Move a method inline llvm-svn: 38639	2006-07-03 06:05:41 +00:00
Chris Lattner	bd07659488	Move a PragmaNamespace method out of line, add class comment for PragmaNamespace. llvm-svn: 38637	2006-07-03 05:34:49 +00:00
Chris Lattner	13044d942d	Implement -Wunused-macros functionality. llvm-svn: 38632	2006-07-03 05:16:44 +00:00
Chris Lattner	4ec473f871	Add support to track the real top-level file. llvm-svn: 38630	2006-07-03 05:16:05 +00:00
Chris Lattner	91cbf11c10	Add a new IdentifierVisitor class and a new IdentifierTable::VisitIdentifiers method to support iteration over all identifiers. llvm-svn: 38628	2006-07-03 04:28:52 +00:00
Chris Lattner	44f8a66bcc	Fix test/Preprocessor/macro_defined.c, factor some code. llvm-svn: 38627	2006-07-03 01:27:27 +00:00
Chris Lattner	e3e81ea8aa	Refactor some code into a new Lexer::Stringify method. llvm-svn: 38624	2006-07-03 01:13:26 +00:00
Chris Lattner	a37664b3fb	Move a virtual method out-of-line llvm-svn: 38617	2006-07-02 22:45:51 +00:00
Chris Lattner	1840e491dc	Remove Lexer::BufferStart, an unneeded instance var. llvm-svn: 38615	2006-07-02 22:30:01 +00:00
Chris Lattner	ecfeafe3ba	Fix some minor bugs handling _Pragma, including test/Preprocessor/_Pragma_syshdr.c llvm-svn: 38609	2006-07-02 21:26:45 +00:00
Chris Lattner	ef0dbae5ab	remove dead ivar llvm-svn: 38607	2006-07-02 21:17:13 +00:00
Chris Lattner	69772b026e	Implement the _Pragma-style of pragma handling, implementing test/Preprocessor/_Pragma-poison.c. This unifies the MacroStack and IncludeStack together into IncludeMacroStack. llvm-svn: 38606	2006-07-02 20:34:39 +00:00
Chris Lattner	4cca5ba7da	Allow the buffer start/end positions to be optionally specified. Make sure to use them instead of the current buffer start/end when computing diagnostics. llvm-svn: 38603	2006-07-02 20:05:54 +00:00
Chris Lattner	847e0e4552	Implement __TIMESTAMP__ llvm-svn: 38602	2006-07-01 23:49:16 +00:00
Chris Lattner	c1283b90a0	Implement __INCLUDE_LEVEL__ and __BASE_FILE__ llvm-svn: 38601	2006-07-01 23:16:30 +00:00
Chris Lattner	630b33c39e	Implement __FILE__ llvm-svn: 38600	2006-07-01 22:46:53 +00:00
Chris Lattner	c673f905d8	Implement the __TIME__ and __DATE__ builtin macros. llvm-svn: 38597	2006-06-30 06:10:41 +00:00
Chris Lattner	098dfc5e7e	Expose a new form of the getToken method. llvm-svn: 38595	2006-06-30 06:09:36 +00:00
Chris Lattner	0b8cfc2e69	Implement the __LINE__ builtin macro. llvm-svn: 38589	2006-06-28 06:49:17 +00:00
Chris Lattner	3690f1513a	Initial implementation of the ScratchBuffer class. llvm-svn: 38588	2006-06-28 06:48:36 +00:00
Chris Lattner	677757a2c0	Remove dead variables. Add initial support for builtin macros, including warning if they are defined or undefined. Register __LINE__ as a builtin macro. llvm-svn: 38586	2006-06-28 05:26:32 +00:00
Chris Lattner	274690ce76	Reindent comments. llvm-svn: 38585	2006-06-28 05:25:35 +00:00
Chris Lattner	f373a4af56	Refactor HandleIdentifier to pull macro expansion into its own method. llvm-svn: 38583	2006-06-26 06:16:29 +00:00
Chris Lattner	e9a5e18e47	remove some obsolete comments llvm-svn: 38582	2006-06-26 06:08:38 +00:00
Chris Lattner	67b07cb6fe	Implement Preprocessor/macro_expandloc.c by giving the optimized macro expansion case a correct source location. llvm-svn: 38580	2006-06-26 02:03:42 +00:00
Chris Lattner	269c232e67	implement #pragma GCC dependency llvm-svn: 38574	2006-06-25 06:23:00 +00:00
Chris Lattner	55a60954f9	Implement #pragma GCC system_header llvm-svn: 38569	2006-06-25 04:20:34 +00:00
Chris Lattner	1786217e0b	Finish implementation of #pragma once. Implement #pragma GCC poison. llvm-svn: 38568	2006-06-24 22:12:56 +00:00
Chris Lattner	b876183219	implement the pragma handling infrastructure. The only pragma so far is #pragma once, and it is not completely implemented. llvm-svn: 38566	2006-06-24 21:31:03 +00:00
Chris Lattner	c899718274	Track which headers are system and non-C++-clean-system headers. Use this information to print the 3/4 flags correctly on #line directives emitted in -E mode. llvm-svn: 38562	2006-06-22 05:52:16 +00:00
Chris Lattner	0c885f5581	Improve #line emission in -E mode to include file entry/exits. This is still pretty hacky because it doesn't compute the 3/4 markers correctly. llvm-svn: 38561	2006-06-21 06:50:18 +00:00
Chris Lattner	8bb4edb236	remove an extraneous method llvm-svn: 38554	2006-06-18 16:37:30 +00:00
Chris Lattner	50b497e072	Rename LexerToken::getSourceLocation -> getLocation llvm-svn: 38553	2006-06-18 16:32:35 +00:00
Chris Lattner	5b4876807a	Move the LexerToken definition out to LexerToken.h llvm-svn: 38552	2006-06-18 16:28:59 +00:00
Chris Lattner	d01e291332	Make a fundamental change to the way we represent the location of LexerToken's. Now, instead of keeping a pointer to the start of the token in memory, we keep the start of the token as a SourceLocation node. This means that each LexerToken knows the full include stack it was created with, and means that the LexerToken isn't reliant on a "CurLexer" member to be around (lexer tokens would previously go out of scope when their lexers were deallocated). This simplifies several things, and forces good cleanup elsewhere. Now the Preprocessor is the one that knows how to dump tokens/macros and is the one that knows how to get the spelling of a token (it has all the context). llvm-svn: 38551	2006-06-18 16:22:51 +00:00
Chris Lattner	7e0dd2b11f	Fix a fixme by passing language options into LexerToken::dump, instead of relying on TheLexer. llvm-svn: 38549	2006-06-18 07:44:41 +00:00
Chris Lattner	33ce7283ee	Change the token representation to take a Start and Length instead of a Start/End pointer. llvm-svn: 38548	2006-06-18 07:35:33 +00:00
Chris Lattner	1f5830546a	Make a method a static function llvm-svn: 38543	2006-06-18 06:53:56 +00:00
Chris Lattner	7966aafd9b	Simplify an API llvm-svn: 38541	2006-06-18 06:50:36 +00:00
Chris Lattner	cb28334ea4	Remove manual conditional error handling code. llvm-svn: 38540	2006-06-18 06:48:37 +00:00
Chris Lattner	22eb972f38	Initial checkin of c-language parser llvm-svn: 38539	2006-06-18 05:43:12 +00:00

... 15 16 17 18 19 ...

1150 Commits