llvm-project

Commit Graph

Author	SHA1	Message	Date
Chandler Carruth	2946cd7010	Update the file headers across all of the LLVM projects in the monorepo to reflect the new license. We understand that people may be surprised that we're moving the header entirely to discuss the new license. We checked this carefully with the Foundation's lawyer and we believe this is the correct approach. Essentially, all code in the project is now made available by the LLVM project under our new license, so you will see that the license headers include that license only. Some of our contributors have contributed code under our old license, and accordingly, we have retained a copy of our old license notice in the top-level files in each project and repository. llvm-svn: 351636	2019-01-19 08:50:56 +00:00
Raphael Isemann	b23ccecbb0	Misc typos fixes in ./lib folder Summary: Found via `codespell -q 3 -I ../clang-whitelist.txt -L uint,importd,crasher,gonna,cant,ue,ons,orign,ned` Reviewers: teemperor Reviewed By: teemperor Subscribers: teemperor, jholewinski, jvesely, nhaehnle, whisperity, jfb, cfe-commits Differential Revision: https://reviews.llvm.org/D55475 llvm-svn: 348755	2018-12-10 12:37:46 +00:00
Erik Pilkington	fa98390b3c	NFC: Remove the ObjC1/ObjC2 distinction from clang (and related projects) We haven't supported compiling ObjC1 for a long time (and never will again), so there isn't any reason to keep these separate. This patch replaces LangOpts::ObjC1 and LangOpts::ObjC2 with LangOpts::ObjC. Differential revision: https://reviews.llvm.org/D53547 llvm-svn: 345637	2018-10-30 20:31:30 +00:00
Richard Smith	4e966e8135	Don't emit "will be treated as an identifier character" warning for UTF-8 characters that aren't identifier characters in the current language mode. llvm-svn: 343040	2018-09-25 22:34:45 +00:00
Sam McCall	3d8051abb8	[CodeComplete] Add completions for filenames in #include directives. Summary: The dir component ("somedir" in #include <somedir/fo...>) is considered fixed. We append "foo" to each directory on the include path, and then list its files. Completions are of the forms: #include <somedir/fo^ foo.h> fox/ The filter is set to the filename part ("fo"), so fuzzy matching can be applied to the filename only. No fancy scoring/priorities are set, and no information is added to CodeCompleteResult to make smart scoring possible. Could be in future. Reviewers: ilya-biryukov Subscribers: cfe-commits Differential Revision: https://reviews.llvm.org/D52076 llvm-svn: 342449	2018-09-18 08:40:41 +00:00
Richard Smith	8ed7776bc4	PR38870: Add warning for zero-width unicode characters appearing in identifiers. llvm-svn: 341700	2018-09-07 19:25:39 +00:00
Adrian Prantl	9fc8faf9e6	Remove \brief commands from doxygen comments. This is similar to the LLVM change https://reviews.llvm.org/D46290. We've been running doxygen with the autobrief option for a couple of years now. This makes the \brief markers into our comments redundant. Since they are a visual distraction and we don't want to encourage more \brief markers in new code either, this patch removes them all. Patch produced by for i in $(git grep -l '\@brief'); do perl -pi -e 's/\@brief //g' $i & done for i in $(git grep -l '\\brief'); do perl -pi -e 's/\\brief //g' $i & done Differential Revision: https://reviews.llvm.org/D46320 llvm-svn: 331834	2018-05-09 01:00:01 +00:00
Richard Smith	b5f8171a1b	PR37189 Fix incorrect end source location and spelling for a split '>>' token. When a '>>' token is split into two '>' tokens (in C++11 onwards), or (as an extension) when we do the same for other tokens starting with a '>', we can't just use a location pointing to the first '>' as the location of the split token, because that would result in our miscomputing the length and spelling for the token. As a consequence, for example, a refactoring replacing 'A<X>' with something else would sometimes replace one character too many, and similarly diagnostics highlighting a template-id source range would highlight one character too many. Fix this by creating an expansion range covering the first character of the '>>' token, whose spelling is '>'. For this to work, we generalize the expansion range of a macro FileID to be either a token range (the common case) or a character range (used in this new case). llvm-svn: 331155	2018-04-30 05:25:48 +00:00
Ilya Biryukov	ef4ece75fd	[CodeComplete] Fix completion in the middle of ident in ctor lists. Summary: The example that was broken before (^ designates completion points): class Foo { Foo() : fie^ld^() {} // no completions were provided here. int field; }; To fix it we don't cut off lexing after an identifier followed by code completion token is lexed. Instead we skip the rest of identifier and continue lexing. This is consistent with behavior of completion when completion token is right before the identifier. Reviewers: sammccall, aaron.ballman, bkramer, sepavloff, arphaman, rsmith Reviewed By: aaron.ballman Subscribers: cfe-commits Differential Revision: https://reviews.llvm.org/D44932 llvm-svn: 330833	2018-04-25 15:13:34 +00:00
Ilya Biryukov	b3510c4254	[CodeComplete] Fix completion at the end of keywords Summary: Make completion behave consistently no matter if it is run at the start, in the middle or at the end of an identifier that happens to be a keyword or a macro name. Since completion is often ran on incomplete identifiers, they may turn into keywords by accident. For example, we should produce same results for all of these completion points: // ^ is completion point. ^class cla^ss class^ Previously clang produced different results for the last case (as if the completion point was after a space: `class ^`). This change also updates some offsets in tests that (unintentionally?) relied on the old behavior. Reviewers: sammccall, bkramer, arphaman, aaron.ballman Reviewed By: sammccall Subscribers: cfe-commits Differential Revision: https://reviews.llvm.org/D45887 llvm-svn: 330717	2018-04-24 13:48:53 +00:00
Alexander Kornienko	2a8c18d991	Fix typos in clang Found via codespell -q 3 -I ../clang-whitelist.txt Where whitelist consists of: archtype cas classs checkk compres definit frome iff inteval ith lod methode nd optin ot pres statics te thru Patch by luzpaz! (This is a subset of D44188 that applies cleanly with a few files that have dubious fixes reverted.) Differential revision: https://reviews.llvm.org/D44188 llvm-svn: 329399	2018-04-06 15:14:32 +00:00
Volodymyr Sapsai	abb8dfc114	[Lex] Avoid out-of-bounds dereference in LexAngledStringLiteral. Fix makes the loop in LexAngledStringLiteral more like the loops in LexStringLiteral, LexCharConstant. When we skip a character after backslash, we need to check if we reached the end of the file instead of reading the next character unconditionally. Discovered by OSS-Fuzz: https://bugs.chromium.org/p/oss-fuzz/issues/detail?id=3832 rdar://problem/35572754 Reviewers: arphaman, kcc, rsmith, dexonsmith Reviewed By: rsmith, dexonsmith Subscribers: cfe-commits, rsmith, dexonsmith Differential Revision: https://reviews.llvm.org/D41423 llvm-svn: 322390	2018-01-12 18:54:35 +00:00
Richard Smith	77091b167f	Warn if we find a Unicode homoglyph for a symbol in an identifier. Specifically, warn if: * we find a character that the language standard says we must treat as an identifier, and * that character is not reasonably an identifier character (it's a punctuation character or similar), and * it renders identically to a valid non-identifier character in common fixed-width fonts. Some tools "helpfully" substitute the surprising characters for the expected characters, and replacing semicolons with Greek question marks is a common "prank". llvm-svn: 320697	2017-12-14 13:15:08 +00:00
Eugene Zelenko	cb96ac64b0	[Lex] Fix some Clang-tidy modernize and Include What You Use warnings; other minor fixes (NFC). llvm-svn: 320207	2017-12-08 22:39:26 +00:00
Taewook Oh	cebac48bf7	Stringizing raw string literals containing newline Summary: This patch implements 4.3 of http://open-std.org/jtc1/sc22/wg21/docs/papers/2014/n4220.pdf. If a raw string contains a newline character, replace each newline character with the \n escape code. Without this patch, included test case (macro_raw_string.cpp) results compilation failure. Reviewers: rsmith, doug.gregor, jkorous-apple Reviewed By: jkorous-apple Subscribers: jkorous-apple, vsapsai, cfe-commits Differential Revision: https://reviews.llvm.org/D39279 llvm-svn: 319904	2017-12-06 17:00:53 +00:00
Aaron Ballman	c351fba69e	Now that C++17 is official (https://www.iso.org/standard/68564.html ), start changing the C++1z terminology over to C++17. NFC intended, these are all mechanical changes. llvm-svn: 319688	2017-12-04 20:27:34 +00:00
Richard Smith	edbf5972a4	[c++2a] P0515R3: lexer support for new <=> token. llvm-svn: 319509	2017-12-01 01:07:10 +00:00
Alex Lorenz	ebbbb81266	[refactor][extract] insert semicolons into extracted/inserted code when needed This commit implements the semicolon insertion logic into the extract refactoring. The following rules are used: - extracting expression: add terminating ';' to the extracted function. - extracting statements that don't require terminating ';' (e.g. switch): add terminating ';' to the callee. - extracting statements with ';': move (if possible) the original ';' from the callee and add terminating ';'. - otherwise, add ';' to both places. Differential Revision: https://reviews.llvm.org/D39441 llvm-svn: 317343	2017-11-03 18:11:22 +00:00
Aaron Ballman	606093a53b	Add -f[no-]double-square-bracket-attributes as new driver options to control use of [[]] attributes in all language modes. This is the initial implementation of WG14 N2165, which is a proposal to add [[]] attributes to C2x, but also allows you to enable these attributes in C++98, or disable them in C++11 or later. llvm-svn: 315856	2017-10-15 15:01:42 +00:00
Alex Lorenz	d5bf436d3a	[Lex] Avoid out-of-bounds dereference in SkipLineComment Credit to OSS-Fuzz for discovery: https://bugs.chromium.org/p/oss-fuzz/issues/detail?id=3145 rdar://34526482 llvm-svn: 315785	2017-10-14 01:18:30 +00:00
Alex Lorenz	c1e32fca96	A '<' with a trigraph '#' is not a valid editor placeholder Credit to OSS-Fuzz for discovery: https://bugs.chromium.org/p/oss-fuzz/issues/detail?id=3137#c5 rdar://34923985 llvm-svn: 315398	2017-10-11 00:41:20 +00:00
Cameron Desrochers	531aec2e90	Fixed unused variable warning introduced in r313796 causing build failure llvm-svn: 313802	2017-09-20 19:37:37 +00:00
Cameron Desrochers	84fd064ef9	[PCH] Fixed preamble breaking with BOM presence (and particularly, fluctuating BOM presence) This patch fixes broken preamble-skipping when the preamble region includes a byte order mark (BOM). Previously, parsing would fail if preamble PCH generation was enabled and a BOM was present. This also fixes preamble invalidation when a BOM appears or disappears. This may seem to be an obscure edge case, but it happens regularly with IDEs that pass buffer overrides that never (or always) have a BOM, yet the underlying file from the initial parse that generated a PCH might (or might not) have a BOM. I've included a test case for these scenarios. Differential Revision: https://reviews.llvm.org/D37491 llvm-svn: 313796	2017-09-20 19:03:37 +00:00
Erich Keane	e916d54614	[Preprocessor] Correct internal token parsing of newline characters in CRLF Correct implementation: Apparently I managed in r311683 to submit the wrong version of the patch for this, so I'm correcting it now. Differential Revision: https://reviews.llvm.org/D37079 llvm-svn: 312542	2017-09-05 17:32:36 +00:00
Erich Keane	5a2b322e0d	[Preprocessor] Correct internal token parsing of newline characters in CRLF Discovered due to a goofy git setup, the test system-headerline-directive.c (and a few others) failed because the token-consumption will consume only the '\r' in CRLF, making the preprocessor's printed value give the wrong line number when returning from an include. For example: (line 1):#include <noline.h>\r\n The "file exit" code causes the printer to try to print the 'returned to the main file' line. It looks up what the current line number is. However, since the current 'token' is the '\n' (since only the \r was consumed), it will give the line number as '1", not '2'. This results in a few failed tests, but more importantly, results in error messages being incorrect when compiling a previously preprocessed file. Differential Revision: https://reviews.llvm.org/D37079 llvm-svn: 311683	2017-08-24 18:36:07 +00:00
Alexander Kornienko	cf007a7614	[Lexer] Finding beginning of token with escaped new line Summary: Lexer::GetBeginningOfToken produced invalid location when backtracking across escaped new lines. This fixes PR26228 Reviewers: akyrtzi, alexfh, rsmith, doug.gregor Reviewed By: alexfh Subscribers: alexfh, cfe-commits Patch by Paweł Żukowski! Differential Revision: https://reviews.llvm.org/D30748 llvm-svn: 310576	2017-08-10 10:06:16 +00:00
Erik Verbruggen	795eee96b3	Fix invalid warnings for header guards in preambles Fixes https://bugs.llvm.org/show_bug.cgi?id=33574 Differential Revision: https://reviews.llvm.org/D34882 llvm-svn: 307134	2017-07-05 09:44:07 +00:00
Alex Lorenz	fb7654a8fc	[PR33394] Avoid lexing editor placeholders when Clang is used only for preprocessing r300667 added support for editor placeholder to Clang. That commit didn’t take into account that users who use Clang for preprocessing only (-E) will get the "editor placeholder in source file" error when preprocessing their source (PR33394). This commit ensures that Clang doesn't lex editor placeholders when running a preprocessor only action. rdar://32718000 Differential Revision: https://reviews.llvm.org/D34256 llvm-svn: 305576	2017-06-16 20:13:39 +00:00
Galina Kistanova	39edaaa65c	Added LLVM_FALLTHROUGH to address warning: this statement may fall through. NFC. llvm-svn: 304643	2017-06-03 06:25:47 +00:00
Erik Verbruggen	b34c79ff27	Allow for unfinished #if blocks in preambles Previously, a preamble only included #if blocks (and friends like ifdef) if there was a corresponding #endif before any declaration or definition. The problem is that any header file that uses include guards will not have a preamble generated, which can make code-completion very slow. To prevent errors about unbalanced preprocessor conditionals in the preamble, and unbalanced preprocessor conditionals after a preamble containing unfinished conditionals, the conditional stack is stored in the pch file. This fixes PR26045. Differential Revision: http://reviews.llvm.org/D15994 llvm-svn: 304207	2017-05-30 11:54:55 +00:00
Alex Lorenz	d47546612d	[Lexer] Ensure that the token is not an annotation token when retrieving the identifer info for an Objective-C keyword This commit fixes an assertion that's triggered in getIdentifier when the token is an annotation token. rdar://32225463 llvm-svn: 303246	2017-05-17 11:08:36 +00:00
Alex Lorenz	9c5c2bfe54	Add a fix-it for -Wunguarded-availability This patch adds a fix-it for the -Wunguarded-availability warning. This fix-it is similar to the Swift one: it suggests that you wrap the statement in an `if (@available)` check. The produced fixits are indented (just like the Swift ones) to make them look nice in Xcode's fix-it preview. rdar://31680358 Differential Revision: https://reviews.llvm.org/D32424 llvm-svn: 302253	2017-05-05 16:42:44 +00:00
Alex Lorenz	1be800c511	Add support for editor placeholders to Clang This commit teaches Clang to recognize editor placeholders that are produced when an IDE like Xcode inserts a code-completion result that includes a placeholder. Now when the lexer sees a placeholder token, it emits an 'editor placeholder in source file' error and creates an identifier token that represents the placeholder. The parser/sema can now recognize the placeholders and can suppress the diagnostics related to the placeholders. This ensures that live issues in an IDE like Xcode won't get spurious diagnostics related to placeholders. This commit also adds a new compiler option named '-fallow-editor-placeholders' that silences the 'editor placeholder in source file' error. This is useful for an IDE like Xcode as we don't want to display those errors in live issues. rdar://31581400 Differential Revision: https://reviews.llvm.org/D32081 llvm-svn: 300667	2017-04-19 08:58:56 +00:00
Richard Smith	4c132e576b	Do not warn about whitespace between ??/ trigraph and newline in line comments if trigraphs are disabled in the current language. llvm-svn: 300609	2017-04-18 21:45:04 +00:00
Richard Smith	1d2ae94b5d	Fix mishandling of escaped newlines followed by newlines or nuls. Previously, if an escaped newline was followed by a newline or a nul, we'd lex the escaped newline as a bogus space character. This led to a bunch of different broken corner cases: For the pattern "\\\n\0#", we would then have a (horizontal) space whose spelling ends in a newline, and would decide that the '#' is at the start of a line, and incorrectly start preprocessing a directive in the middle of a logical source line. If we were already in the middle of a directive, this would result in our attempting to process multiple directives at the same time! This resulted in crashes, asserts, and hangs on invalid input, as discovered by fuzz-testing. For the pattern "\\\n" at EOF (with an implicit following nul byte), we would produce a bogus trailing space character with spelling "\\\n". This was mostly harmless, but would lead to clang-format getting confused and misformatting in rare cases. We now produce a trailing EOF token with spelling "\\\n", consistent with our handling for other similar cases -- an escaped newline is always part of the token containing the next character, if any. For the pattern "\\\n\n", this was somewhat more benign, but would produce an extraneous whitespace token to clients who care about preserving whitespace. However, it turns out that our lexing for line comments was relying on this bug due to an off-by-one error in its computation of the end of the comment, on the slow path where the comment might contain escaped newlines. llvm-svn: 300515	2017-04-17 23:44:51 +00:00
Sanne Wouda	db1bdf472a	Skip Unicode character expansion in assembly files Summary: When using the C preprocessor with assembly files, either with a capital `S` file extension, or with `-xassembler-with-cpp`, the Unicode escape sequence `\u` is ignored. The `\u` pattern can be used for expanding a macro argument that starts with `u`. Author: Salman Arif <salman.arif@arm.com> Reviewers: rengolin, olista01 Reviewed By: olista01 Subscribers: cfe-commits Differential Revision: https://reviews.llvm.org/D31765 llvm-svn: 299754	2017-04-07 10:13:00 +00:00
Eric Fiselier	cb2f326a75	Allow lexer to handle string_view literals. Patch from Anton Bikineev. This implements the compiler side of p0403r0. This patch was reviewed as https://reviews.llvm.org/D26829. llvm-svn: 290744	2016-12-30 04:51:10 +00:00
Justin Lebar	9091055efa	Move UTF functions into namespace llvm. Summary: This lets people link against LLVM and their own version of the UTF library. I determined this only affects llvm, clang, lld, and lldb by running $ git grep -wl 'UTF[0-9]\+\\|\bConvertUTF\bisLegalUTF\\|getNumBytesFor' \| cut -f 1 -d '/' \| sort \| uniq clang lld lldb llvm Tested with ninja lldb ninja check-clang check-llvm check-lld (ninja check-lldb doesn't complete for me with or without this patch.) Reviewers: rnk Subscribers: klimek, beanz, mgorny, llvm-commits Differential Revision: https://reviews.llvm.org/D24996 llvm-svn: 282822	2016-09-30 00:38:45 +00:00
Eugene Zelenko	e95e7d5d64	Fix some Clang-tidy modernize-use-using and Include What You Use warnings; other minor fixes. Differential revision: https://reviews.llvm.org/D24115 llvm-svn: 280870	2016-09-07 21:53:17 +00:00
Vassil Vassilev	644ea61d2d	Implement filtering for code completion of identifiers. Patch by Cristina Cristescu and Axel Naumann! Agreed on post commit review (D17820). llvm-svn: 276878	2016-07-27 14:56:59 +00:00
Benjamin Kramer	22f24f6815	[Lexer] Let the compiler infer string lengths. No functionality change intended. llvm-svn: 265126	2016-04-01 10:04:07 +00:00
Benjamin Kramer	e550bbdf9d	[Lexer] Don't read out of bounds if a conflict marker is at the end of a file This can happen as we look for '<<<<' while scanning tokens but then expect '<<<<\n' to tell apart perforce from diff3 conflict markers. Just harden the pointer arithmetic. Found by libfuzzer + asan! llvm-svn: 265125	2016-04-01 09:58:45 +00:00
Richard Smith	560a3579b2	Update diagnostics now that hexadecimal literals look likely to be part of C++17. llvm-svn: 262753	2016-03-04 22:32:06 +00:00
Richard Trieu	cc3949d99a	Remove use of builtin comma operator. Cleanup for upcoming Clang warning -Wcomma. No functionality change intended. llvm-svn: 261271	2016-02-18 22:34:54 +00:00
Anastasia Stulova	735c6cdebd	[OpenCL] Adding reserved operator logical xor for OpenCL This patch adds the reserved operator ^^ when compiling for OpenCL (spec v1.1 s6.3.g), which results in a more meaningful error message. Patch by Neil Hickey! Review: http://reviews.llvm.org/D13280 M test/SemaOpenCL/unsupported.cl M include/clang/Basic/TokenKinds.def M include/clang/Basic/DiagnosticParseKinds.td M lib/Basic/OperatorPrecedence.cpp M lib/Lex/Lexer.cpp M lib/Parse/ParseExpr.cpp llvm-svn: 259651	2016-02-03 15:17:14 +00:00
Richard Trieu	3a5c958182	Fix -Wnull-conversion for long macros. Move the function to get a macro name from DiagnosticRenderer.cpp to Lexer.cpp so that other files can use it. Lexer now has two functions to get the immediate macro name, the newly added one is better for diagnostic purposes. Make -Wnull-conversion use this function for better NULL macro detection. llvm-svn: 258778	2016-01-26 02:51:55 +00:00
Nico Weber	de2310bddf	Emit a -Wmicrosoft warning when treating ^Z as EOF in MS mode. llvm-svn: 256596	2015-12-29 23:17:27 +00:00
Vinicius Tinti	92e68c2766	[clang] Disable Unicode in asm files Clang should not convert tokens to Unicode when preprocessing assembly files. Fixes PR25558. llvm-svn: 253738	2015-11-20 23:42:39 +00:00
Craig Topper	7f5ff2175f	Use %select to merge similar diagnostics. NFC llvm-svn: 253119	2015-11-14 02:09:55 +00:00
Craig Topper	a6324c9463	Disable trigraph and escaped newline expansion on all types of raw string literals not just ASCII type. llvm-svn: 251025	2015-10-22 15:35:21 +00:00
Rafael Espindola	c0f18a91ac	Replace a few std::string& with StringRef. NFC. Patch by Косов Евгений! llvm-svn: 238774	2015-06-01 20:00:16 +00:00
Kostya Serebryany	6c2479bee4	Fix buffer overflow in Lexer Summary: Fix PR22407, where the Lexer overflows the buffer when parsing #include<\ (end of file after slash) Test Plan: Added a test that will trigger in asan build. This case is also covered by the clang-fuzzer bot. Reviewers: rnk Reviewed By: rnk Subscribers: cfe-commits Differential Revision: http://reviews.llvm.org/D9489 llvm-svn: 236466	2015-05-04 22:30:29 +00:00
Benjamin Kramer	f04f98d543	Use delegating ctors to reduce code duplication. NFC. llvm-svn: 231476	2015-03-06 14:15:57 +00:00
David Majnemer	5a54977ea8	Lex: Don't crash if both conflict markers are on the same line We would check if the terminator marker is on a newline. However, the logic would end up out-of-bounds if the terminator marker immediately follows the start marker. This fixes PR21820. llvm-svn: 224210	2014-12-14 04:53:11 +00:00
Richard Smith	3e3a705062	[c++1z] Support for u8 character literals. llvm-svn: 221576	2014-11-08 06:08:42 +00:00
Jay Foad	6af95d3864	Fix warning in Altivec code when building with GCC 4.8.2 on Ubuntu 14.04. llvm-svn: 220855	2014-10-29 14:42:12 +00:00
Aaron Ballman	dd69ef38db	C++1y is now C++14! Changes diagnostic options, language standard options, diagnostic identifiers, diagnostic wording to use c++14 instead of c++1y. It also modifies related test cases to use the updated diagnostic wording. llvm-svn: 215982	2014-08-19 15:55:55 +00:00
Rafael Espindola	cd0b380a3c	Use StringRef instead of MemoryBuffer&. This code doesn't care where the data it is processing comes from, so a StringRef is probably the most natural interface. llvm-svn: 215448	2014-08-12 15:46:24 +00:00
David Blaikie	3d95d858f9	Change MemoryBuffer* to MemoryBuffer& parameter to Lexer::ComputePreamble (dropping const from the reference as MemoryBuffer is immutable already, so const is just redundant - and while I'd personally put const everywhere, that's not the LLVM Way (see llvm::Type for another example of an immutable type where "const" is omitted for brevity)) Changing the pointer argument to a reference parameter makes call sites identical between callers with unique_ptrs or raw pointers, minimizing the churn in a pending unique_ptr migrations. llvm-svn: 215391	2014-08-11 22:08:06 +00:00
Alp Toker	d4a3f0e894	Hide the concept of diagnostic levels from lex, parse and sema The compilation pipeline doesn't actually need to know about the high-level concept of diagnostic mappings, and hiding the final computed level presents several simplifications and other potential benefits. The only exceptions are opportunistic checks to see whether expensive code paths can be avoided for diagnostics that are guaranteed to be ignored at a certain SourceLocation. This commit formalizes that invariant by introducing and using DiagnosticsEngine::isIgnored() in place of individual level checks throughout lex, parse and sema. llvm-svn: 211005	2014-06-15 23:30:39 +00:00
Alp Toker	7755aff556	Remove historical Unicode TODOs There's no immediate demand or plan to work on these. llvm-svn: 209090	2014-05-18 18:37:59 +00:00
Craig Topper	d2d442ca73	[C++11] Use 'nullptr'. Lex edition. llvm-svn: 209083	2014-05-17 23:10:59 +00:00
Alp Toker	2d57cea256	Provide and use a safe Token::getRawIdentifier() accessor llvm-svn: 209061	2014-05-17 04:53:25 +00:00
Roman Divacky	6150990d59	Revert r205436: Extend the SSE2 comment lexing to AVX2. Only 16byte align when not on AVX2. This provides some 3% speedup when preprocessing gcc.c as a single file. The patch is wrong, it always uses SSE2, and when I fix that there's no speedup at all. I am not sure where the 3% came from previously. --Thi lie, and those below, will be ignored-- M Lex/Lexer.cpp llvm-svn: 205548	2014-04-03 18:04:52 +00:00
Roman Divacky	071e830bbb	Extend the SSE2 comment lexing to AVX2. Only 16byte align when not on AVX2. This provides some 3% speedup when preprocessing gcc.c as a single file. llvm-svn: 205436	2014-04-02 17:27:03 +00:00
Benjamin Kramer	867ea1d426	[C++11] Replace llvm::tie with std::tie. llvm-svn: 202639	2014-03-02 13:01:17 +00:00
Richard Smith	35ddad0723	Fix a minor bug in lexing pp-numbers with digit separators: if a pp-number contains "'e+", the pp-number ends between the 'e' and the '+'. llvm-svn: 202533	2014-02-28 20:06:02 +00:00
Richard Smith	8b7258bdb3	PR18855: Add support for UCNs and UTF-8 encoding within ud-suffixes. llvm-svn: 201532	2014-02-17 21:52:30 +00:00
Alp Toker	bfa3934f27	Rename language option MicrosoftMode to MSVCCompat There's been long-standing confusion over the role of these two options. This commit makes the necessary changes to differentiate them clearly, following up from r198936. MicrosoftExt (aka. fms-extensions): Enable largely unobjectionable Microsoft language extensions to ease portability. This mode, also supported by gcc, is used for building software like FreeBSD and Linux kernel extensions that share code with Windows drivers. MSVCCompat (aka. -fms-compatibility, formerly MicrosoftMode): Turn on a special mode supporting 'heinous' extensions for drop-in compatibility with the Microsoft Visual C++ product. Standards-compilant C and C++ code isn't guaranteed to work in this mode. Implies MicrosoftExt. Note that full -fms-compatibility mode is currently enabled by default on the Windows target, which may need tuning to serve as a reasonable default. See cfe-commits for the full discourse, thread 'r198497 - Move MS predefined type_info out of InitializePredefinedMacros' No change in behaviour. llvm-svn: 199209	2014-01-14 12:51:41 +00:00
Chandler Carruth	5553d0d4ca	Sort all the #include lines with LLVM's utils/sort_includes.py which encodes the canonical rules for LLVM's style. I noticed this had drifted quite a bit when cleaning up LLVM, so wanted to clean up Clang as well. llvm-svn: 198686	2014-01-07 11:51:46 +00:00
Alp Toker	6de6da603e	Lexer: Issue -Wbackslash-newline-escape for line comments The warning for backslash and newline separated by whitespace was missed in this code path. backslash<whitespace><newline> is handled differently from compiler to compiler so it's important to warn consistently where there's ambiguity. Matches similar handling of block comments and non-comment lines. llvm-svn: 197331	2013-12-14 23:32:31 +00:00
Alp Toker	08c2500f9c	Fix raw lex crash and -frewrite-includes noeol-at-eof failure Raw lexers don't have a preprocessor so we need to null check. llvm-svn: 197245	2013-12-13 17:04:55 +00:00
Justin Bogner	5353513058	Lex: Don't restrict legal UCNs when preprocessing assembly The C and C++ standards disallow using universal character names to refer to some characters, such as basic ascii and control characters, so we reject these sequences in the lexer. However, when the preprocessor isn't being used on C or C++, it doesn't make sense to apply these restrictions. Notably, accepting these characters avoids issues with unicode escapes when GHC uses the compiler as a preprocessor on haskell sources. Fixes rdar://problem/14742289 llvm-svn: 193067	2013-10-21 05:02:28 +00:00
Richard Smith	7f2707a7f4	Per updates to D3781, allow underscore under ' in a pp-number, and allow ' in a #line directive. llvm-svn: 191443	2013-09-26 18:13:20 +00:00
Richard Smith	fde9485297	Implement C++1y digit separator proposal (' as a digit separator). This is not yet approved by full committee, but was unanimously supported by EWG. llvm-svn: 191417	2013-09-26 03:33:06 +00:00
Richard Smith	5acb759f39	Avoid a signed/unsigned comparison warning with compilers that don't know how to handle constant expressions. llvm-svn: 191336	2013-09-24 22:13:21 +00:00
Richard Smith	2a98862be2	Handle standard libraries that miss out the space when defining the standard literal operators. Also, for now, allow the proposed C++1y "il", "i", and "if" suffixes too. (Will revert the latter if LWG decides not to go ahead with that change after all.) llvm-svn: 191274	2013-09-24 04:06:10 +00:00
Eli Friedman	29749d2e3b	Fix use-after-free in r190980. llvm-svn: 190984	2013-09-19 01:51:23 +00:00
Eli Friedman	0834a4b901	Make Preprocessor::Lex non-recursive. Before this patch, Lex() would recurse whenever the current lexer changed (e.g. upon entry into a macro). This patch turns the recursion into a loop: the various lex routines now don't return a token when the current lexer changes, and at the top level Preprocessor::Lex() now loops until it finds a token. Normally, the recursion wouldn't end up being very deep, but the recursion depth can explode in edge cases like a bunch of consecutive macros which expand to nothing (like in the testcase test/Preprocessor/macro_expand_empty.c in this patch). <rdar://problem/14569770> llvm-svn: 190980	2013-09-19 00:41:32 +00:00
Alexander Kornienko	37d6b18633	Use new UnicodeCharSet interface. Summary: This is a Clang part of http://llvm-reviews.chandlerc.com/D1534 Reviewers: jordan_rose, klimek, rsmith Reviewed By: rsmith CC: cfe-commits Differential Revision: http://llvm-reviews.chandlerc.com/D1535 llvm-svn: 189583	2013-08-29 12:12:31 +00:00
Eli Friedman	cefc7eafc6	Fix "//" comments with -traditional-cpp in C++. Apparently, gcc's -traditional-cpp behaves slightly differently in C++ mode; specifically, it discards "//" comments. Match gcc's behavior. <rdar://problem/14808126> llvm-svn: 189515	2013-08-28 20:53:32 +00:00
Jordan Rose	4c55d45b13	Respect -Wnewline-eof even in C++11 mode. If the user has requested this warning, we should emit it, even if it's not an extension in the current language mode. However, being an extension is more important, so prefer the pedantic warning or the pedantic-compatibility warning if those are enabled. <rdar://problem/12922063> llvm-svn: 189110	2013-08-23 15:42:01 +00:00
Fariborz Jahanian	d38ad47cfa	ObjectiveC migrator: More work towards insertion of ObjC audit pragmas. llvm-svn: 188733	2013-08-20 00:07:23 +00:00
Richard Smith	f4198b7598	C++1y literal suffix support: * Allow ns, us, ms, s, min, h as numeric ud-suffixes * Allow s as string ud-suffix llvm-svn: 186933	2013-07-23 08:14:48 +00:00
Michael J. Spencer	8c39840087	Replace Count{Leading,Trailing}Zeros_{32,64} with count{Leading,Trailing}Zeros. llvm-svn: 182675	2013-05-24 21:42:04 +00:00
Argyrios Kyrtzidis	dc9fdaf217	[modules] If we hit a failure while loading a PCH/module, abort parsing instead of trying to continue in an invalid state. Also don't let libclang create a PCH with such an error. Fixes rdar://13953768 llvm-svn: 182629	2013-05-24 05:44:08 +00:00
Argyrios Kyrtzidis	065d720c31	[Lexer] Improve Lexer::getSourceText() when the given range deals with function macro arguments. This is a modified version of a patch by Manuel Klimek. llvm-svn: 182055	2013-05-16 21:37:39 +00:00
Richard Smith	0f7f6f1abc	Typo and misc comment fix. llvm-svn: 181583	2013-05-10 02:36:35 +00:00
Argyrios Kyrtzidis	0903f8dac5	[libclang] Make sure the preable does not truncate comments. rdar://13647445 llvm-svn: 179907	2013-04-19 23:24:25 +00:00
Richard Smith	06d274fdb7	Add -Wc99-compat warning for C11 unicode string and character literals. llvm-svn: 176817	2013-03-11 18:01:42 +00:00
Richard Smith	9b36209e31	When lexing in C11 mode, accept unicode character and string literals, per C11 6.4.4.4/1 and 6.4.5/1. llvm-svn: 176780	2013-03-09 23:56:02 +00:00
Jordan Rose	864b810739	Preprocessor: don't consider // to be a line comment in -E -std=c89 mode. It's beneficial when compiling to treat // as the start of a line comment even in -std=c89 mode, since it's not valid C code (with a few rare exceptions) and is usually intended as such. We emit a pedantic warning and then continue on as if line comments were enabled. This has been our behavior for quite some time. However, people use the preprocessor for things besides C source files. In today's prompting example, the input contains (unquoted) URLs, which contain // but should still be preserved. This change instructs the lexer to treat // as a plain token if Clang is in C90 mode and generating preprocessed output rather than actually compiling. <rdar://problem/13338743> llvm-svn: 176526	2013-03-05 22:51:04 +00:00
Jordan Rose	cb8a1aca35	Preprocessor: preserve whitespace in -traditional-cpp mode. Note that unlike GNU cpp we currently do not preserve whitespace in macros (even in -traditional-cpp mode). <rdar://problem/12897179> llvm-svn: 175778	2013-02-21 18:53:19 +00:00
Jordan Rose	58c61e006f	Properly validate UCNs for C99 and C++03 (both more restrictive than C(++)11). Add warnings under -Wc++11-compat, -Wc++98-compat, and -Wc99-compat when a particular UCN is incompatible with a different standard, and -Wunicode when a UCN refers to a surrogate character in C++03. llvm-svn: 174788	2013-02-09 01:10:25 +00:00
Jordan Rose	a2100d755a	Pull Lexer's CharInfo table out for general use throughout Clang. Rewriting the same predicates over and over again is bad for code size and code maintainence. Using the functions in <ctype.h> is generally unsafe unless they are specified to be locale-independent (i.e. only isdigit and isxdigit). The next commit will try to clean up uses of <ctype.h> functions within Clang. llvm-svn: 174765	2013-02-08 22:30:22 +00:00
Jordan Rose	cc538345be	Lexer: Don't warn about Unicode in preprocessor directives. This allows people to use Unicode in their #pragma mark and in macros that exist only to be string-ized. <rdar://problem/13107323&13121362> llvm-svn: 174081	2013-01-31 19:48:48 +00:00
Jordan Rose	f649795f84	Fix r173881 to properly skip invalid UTF-8 characters in raw lexing and -E. This caused hangs as we processed the same invalid byte over and over. <rdar://problem/13115651> llvm-svn: 173959	2013-01-30 19:21:12 +00:00
Dmitri Gribenko	9feeef40f5	Move UTF conversion routines from clang/lib/Basic to llvm/lib/Support This is required to use them in TableGen. llvm-svn: 173924	2013-01-30 12:06:08 +00:00
Jordan Rose	17441589c3	Don't warn about Unicode characters in -E mode. People use the C preprocessor for things other than C files. Some of them have Unicode characters. We shouldn't warn about Unicode characters appearing outside of identifiers in this case. There's not currently a way for the preprocessor to tell if it's in -E mode, so I added a new flag, derived from the PreprocessorOutputOptions. This is only used by the Unicode warnings for now, but could conceivably be used by other warnings or even behavioral differences later. <rdar://problem/13107323> llvm-svn: 173881	2013-01-30 01:52:57 +00:00
Jordan Rose	cccbdbf0db	PR15067 (again): Don't warn about UCNs in C90 if we're raw-lexing. Fixes a crash. Thanks, Richard. llvm-svn: 173701	2013-01-28 17:49:02 +00:00

1 2 3 4 5 ...

372 Commits