llvm-project

Commit Graph

Author	SHA1	Message	Date
Jan Svoboda	1d8882b5e4	[clang][deps] Fix failing modules.cpp test Commit `d8bab69ead` updated the ClangScanDeps/modules.cpp test. The new `{{.*}}` regex is supposed to only match `modules_cdb_input.o`, `a.o` or `b.o`. However, due to non-determinism, this can sometimes also match `modules_cdb_input2.o`, causing match failure on the next line. This commit changes the regex to only match one of the three valid cases. Buildbot failure: https://lab.llvm.org/buildbot/#/builders/109/builds/16675	2021-06-14 12:55:56 +02:00
Jan Svoboda	d8bab69ead	[clang][deps] Move invocation adjustments from `clang-scan-deps` to `DependencyScanning` library The `clang-scan-deps` tool has some logic that parses and modifies the original Clang command-line. The goal is to setup `DependencyOutputOptions` by injecting `-M -MT <target>` and prevent the creation of output files. This patch moves the logic into the `DependencyScanning` library, and uses the parsed `CompilerInvocation` instead of the raw command-line. The code simpler and can be used from the C++ API as well. The `-o /dev/null` arguments are not necessary, since the `DependencyScanning` library only runs a preprocessing action, so there's no way it'll produce an actual object file. Related: The `-M` argument implies `-w`, which would appear on the command-line of modular dependencies even though it was not on the original TU command line (see D104036). Some related tests were updated. Reviewed By: arphaman Differential Revision: https://reviews.llvm.org/D104030	2021-06-14 12:23:33 +02:00
Jan Svoboda	35fa3e60d1	[clang][deps] Move stripping of diagnostic serialization from `clang-scan-deps` to `DependencyScanning` library To prevent the creation of diagnostics file, `clang-scan-deps` strips the corresponding command-line argument. This behavior is useful even when using the C++ `DependencyScanner` library. This patch transforms stripping of command-line in `clang-scan-deps` into stripping of `CompilerInvocation` in `DependencyScanning`. AFAIK, the `clang-cl` driver doesn't even accept `--serialize-diagnostics`, so I've removed the test. (It would fail with an unknown command-line argument otherwise.) Note: Since we're generating command-lines for modular dependencies from `CompilerInvocation`, the `--serialize-diagnostics` will be dropped. This was already happening in `clang-scan-deps` before this patch, but it will now happen also when using `DependencyScanning` library directly. This is resolved in D104036. Reviewed By: dexonsmith, arphaman Differential Revision: https://reviews.llvm.org/D104012	2021-06-14 12:23:32 +02:00
Gabor Marton	8ddbb442b6	[Analyzer][solver] Simplify existing eq classes and constraints when a new constraint is added Update `setConstraint` to simplify existing equivalence classes when a new constraint is added. In this patch we iterate over all existing equivalence classes and constraints and try to simplfy them with simplifySVal. This solves problematic cases where we have two symbols in the tree, e.g.: ``` int test_rhs_further_constrained(int x, int y) { if (x + y != 0) return 0; if (y != 0) return 0; clang_analyzer_eval(x + y == 0); // expected-warning{{TRUE}} clang_analyzer_eval(y == 0); // expected-warning{{TRUE}} return 0; } ``` Differential Revision: https://reviews.llvm.org/D103314	2021-06-14 12:19:09 +02:00
Jan Svoboda	4629554f0b	[clang][deps] Handle modular dependencies present in PCH When a translation unit uses a PCH and imports the same modules as the PCH, we'd prefer to resolve to those modules instead of inventing new modules and reporting them as modular dependencies. Since the PCH modules have already been built nudge the compiler to reuse them when deciding whether to build a new module and don't report them as regular modular dependencies. Depends on D103524 & D103802. Reviewed By: dexonsmith Differential Revision: https://reviews.llvm.org/D103526	2021-06-14 11:59:35 +02:00
Jan Svoboda	9223209be1	[clang][deps] Handle precompiled headers' AST files The `PreprocessOnlyAction` doesn't support loading the AST file of a precompiled header. This is problematic for dependency scanning, since the `#include` manufactured for the PCH is treated as textual. This means the PCH contents get scanned with each TU, which is redundant. Moreover, dependencies of the PCH end up being considered dependency of the TU. To handle AST file of PCH properly, this patch creates new `FrontendAction` that behaves the same way `PreprocessorOnlyAction` does, but treats the manufactured PCH `#include` as a normal compilation would (by not claiming it only uses a preprocessor and creating the default AST consumer). The AST file is now reported as a file dependency of the TU. Depends on D103519. Reviewed By: Bigcheese Differential Revision: https://reviews.llvm.org/D103524	2021-06-14 11:28:39 +02:00
Jan Svoboda	78668c822a	[clang][modules][pch] Allow loading PCH with different modules cache path It's useful to be able to load explicitly-built PCH files into an implicit build (e.g. during dependency scanning). That's currently impossible, since the explicitly-built PCH has an empty modules cache path, while the current compilation has (and needs to have) a valid path, triggering an error in the `PCHValidator`. This patch adds a preprocessor option and command-line flag that can be used to omit this check. Reviewed By: dexonsmith Differential Revision: https://reviews.llvm.org/D103802	2021-06-14 11:04:56 +02:00
Stephan Bergmann	b5b9489b24	Only consider built-in compound assignment operators for -Wunused-but-set-* At least LibreOffice has, for mainly historic reasons that would be hard to change now, a class Any with an overloaded operator >>= that semantically does not assign to the LHS but rather extracts into the (by-reference) RHS. Which thus caused false positive -Wunused-but-set-parameter and -Wunused-but-set-variable after those have been introduced recently. This change is more conservative about the assumed semantics of overloaded operators, excluding compound assignment operators but keeping plain operator = ones. At least for LibreOffice, that strikes a good balance of not producing false positives but still finding lots of true ones. (The change to the BinaryOperator case in MaybeDecrementCount is necessary because e.g. the template f4 test code in warn-unused-but-set-variables-cpp.cpp turns the += into a BinaryOperator.) Differential Revision: https://reviews.llvm.org/D103949	2021-06-14 08:04:03 +02:00
Fred Grim	673c5ba584	[clang-format] Adds a formatter for aligning arrays of structs This adds a new formatter to arrange array of struct initializers into neat columns. Differential Revision: https://reviews.llvm.org/D101868	2021-06-13 21:14:37 +02:00
Matheus Izvekov	bf20631782	[clang] Implement P2266 Simpler implicit move This Implements [[http://www.open-std.org/jtc1/sc22/wg21/docs/papers/2021/p2266r1.html\|P2266 Simpler implicit move]]. Signed-off-by: Matheus Izvekov <mizvekov@gmail.com> Reviewed By: Quuxplusone Differential Revision: https://reviews.llvm.org/D99005	2021-06-13 12:10:56 +02:00
Matheus Izvekov	1e50c3d785	[clang] NRVO: Improvements and handling of more cases. This expands NRVO propagation for more cases: Parse analysis improvement: * Lambdas and Blocks with dependent return type can have their variables marked as NRVO Candidates. Variable instantiation improvements: * Fixes crash when instantiating NRVO variables in Blocks. * Functions, Lambdas, and Blocks which have auto return type have their variables' NRVO status propagated. For Blocks with non-auto return type, as a limitation, this propagation does not consider the actual return type. This also implements exclusion of VarDecls which are references to dependent types. Signed-off-by: Matheus Izvekov <mizvekov@gmail.com> Reviewed By: Quuxplusone Differential Revision: https://reviews.llvm.org/D99696	2021-06-12 16:43:32 +02:00
Alex Lorenz	22dea69231	[clang][ObjC] allow the use of NSAttributedString * argument type with format attribute This is useful for APIs that want to accept an attributed NSString as their format string rdar://79163229	2021-06-11 13:24:32 -07:00
Kevin Athey	e0b469ffa1	[clang-cl][sanitizer] Add -fsanitize-address-use-after-return to clang. Also: - add driver test (fsanitize-use-after-return.c) - add basic IR test (asan-use-after-return.cpp) - (NFC) cleaned up logic for generating table of __asan_stack_malloc depending on flag. for issue: https://github.com/google/sanitizers/issues/1394 Reviewed By: vitalybuka Differential Revision: https://reviews.llvm.org/D104076	2021-06-11 12:07:35 -07:00
eahcmrh	ce44fe199b	[Sema] Address-space sensitive check for unbounded arrays (v2) Check applied to unbounded (incomplete) arrays and pointers to spot cases where the computed address is beyond the largest possible addressable extent of the array, based on the address space in which the array is delcared, or which the pointer refers to. Check helps to avoid cases of nonsense pointer math and array indexing which could lead to linker failures or runtime exceptions. Of particular interest when building for embedded systems with small address spaces. This is version 2 of this patch -- version 1 had some testing issues due to a sign error in existing code. That error is corrected and lit test for this chagne is extended to verify the fix. Originally reviewed/accepted by: aaron.ballman Original revision: https://reviews.llvm.org/D86796 Reviewed By: aaron.ballman, ebevhan Differential Revision: https://reviews.llvm.org/D88174	2021-06-11 19:34:03 +02:00
Aaron En Ye Shi	f2cc0427b1	[HIP] Fix --hip-version flag with 0 as component Allow the usage of minor version 0, for hip versions such as 4.0. Change the default values when performing version checks. Reviewed By: yaxunl Differential Revision: https://reviews.llvm.org/D104062	2021-06-11 16:25:03 +00:00
eahcmrh	7e9822cc55	Revert "[Sema] Address-space sensitive check for unbounded arrays (v2)" This reverts commit `e42a347b74`.	2021-06-11 17:44:06 +02:00
eahcmrh	e42a347b74	[Sema] Address-space sensitive check for unbounded arrays (v2) Check applied to unbounded (incomplete) arrays and pointers to spot cases where the computed address is beyond the largest possible addressable extent of the array, based on the address space in which the array is delcared, or which the pointer refers to. Check helps to avoid cases of nonsense pointer math and array indexing which could lead to linker failures or runtime exceptions. Of particular interest when building for embedded systems with small address spaces. This is version 2 of this patch -- version 1 had some testing issues due to a sign error in existing code. That error is corrected and lit test for this chagne is extended to verify the fix. Originally reviewed/accepted by: aaron.ballman Original revision: https://reviews.llvm.org/D86796 Reviewed By: aaron.ballman, ebevhan Differential Revision: https://reviews.llvm.org/D88174	2021-06-11 17:36:16 +02:00
Matt Morehouse	0867edfc64	[HWASan] Add basic stack tagging support for LAM. Adds the basic instrumentation needed for stack tagging. Currently does not support stack short granules or TLS stack histories, since a different code path is followed for the callback instrumentation we use. We may simply wait to support these two features until we switch to a custom calling convention. Patch By: xiangzhangllvm, morehouse Reviewed By: vitalybuka Differential Revision: https://reviews.llvm.org/D102901	2021-06-11 08:21:17 -07:00
Zahira Ammarguellat	150f7cedfb	Referencing a static function defined in an opnemp clause, is generating an erroneous warning. See here: https://godbolt.org/z/ajKPc36M7	2021-06-11 06:56:01 -07:00
Ole Strohm	b102e6880a	[OpenCL] Fix overloading resolution of addrspace constructors This fixes the prioritization of address spaces when choosing a constructor, stopping them from being considered equally good, which made the construction of types that could be constructed by more than one of the constructors. It does this by preferring the most specific address space, which is decided by seeing if one of the address spaces is a superset of the other, and preferring the other. Fixes: PR50329 Reviewed By: Anastasia Differential Revision: https://reviews.llvm.org/D102850	2021-06-11 11:12:33 +01:00
Bing1 Yu	56d5c46b49	[X86] Support __tile_stream_loadd intrinsic for new AMX interface Adding support for __tile_stream_loadd intrinsic. Reviewed By: LuoYuanke Differential Revision: https://reviews.llvm.org/D103784	2021-06-11 17:28:43 +08:00
Petr Hosek	22f194909a	Revert "[Driver] Support libc++ in MSVC" This reverts commit `9625d61eb6` since libc++ currently has issues with disabled exceptions which breaks the runtimes build.	2021-06-11 00:45:56 -07:00
Arthur Eubanks	85ca7e424f	Revert "[clang] NRVO: Improvements and handling of more cases." This reverts commit `667fbcdd0b`. Causes crashes on a stage 2 build on Windows.	2021-06-10 20:37:01 -07:00
Arthur Eubanks	db26615aa6	Revert "[clang] Implement P2266 Simpler implicit move" This reverts commit `cbd0054b9e`.	2021-06-10 19:54:50 -07:00
Nick Desaulniers	fc018ebb60	[IR] make -warn-frame-size into a module attr -Wframe-larger-than= is an interesting warning; we can't know the frame size until PrologueEpilogueInsertion (PEI); very late in the compilation pipeline. -Wframe-larger-than= was propagated through CC1 as an -mllvm flag, then was a cl::opt in LLVM's PEI pass; this meant it was dropped during LTO and needed to be re-specified via -plugin-opt. Instead, make it part of the IR proper as a module level attribute, similar to D103048. Introduce -fwarn-stack-size CC1 option. Reviewed By: rsmith, qcolombet Differential Revision: https://reviews.llvm.org/D103928	2021-06-10 16:15:27 -07:00
Matheus Izvekov	cbd0054b9e	[clang] Implement P2266 Simpler implicit move This Implements [[http://www.open-std.org/jtc1/sc22/wg21/docs/papers/2021/p2266r1.html\|P2266 Simpler implicit move]]. Signed-off-by: Matheus Izvekov <mizvekov@gmail.com> Reviewed By: Quuxplusone Differential Revision: https://reviews.llvm.org/D99005	2021-06-11 00:56:06 +02:00
Matheus Izvekov	667fbcdd0b	[clang] NRVO: Improvements and handling of more cases. This expands NRVO propagation for more cases: Parse analysis improvement: * Lambdas and Blocks with dependent return type can have their variables marked as NRVO Candidates. Variable instantiation improvements: * Fixes crash when instantiating NRVO variables in Blocks. * Functions, Lambdas, and Blocks which have auto return type have their variables' NRVO status propagated. For Blocks with non-auto return type, as a limitation, this propagation does not consider the actual return type. This also implements exclusion of VarDecls which are references to dependent types. Signed-off-by: Matheus Izvekov <mizvekov@gmail.com> Reviewed By: Quuxplusone Differential Revision: https://reviews.llvm.org/D99696	2021-06-10 23:02:51 +02:00
Michael Kruse	a22236120f	[OpenMP] Implement '#pragma omp unroll'. Implementation of the unroll directive introduced in OpenMP 5.1. Follows the approach from D76342 for the tile directive (i.e. AST-based, not using the OpenMPIRBuilder). Tries to use `llvm.loop.unroll.*` metadata where possible, but has to fall back to an AST representation of the outer loop if the partially unrolled generated loop is associated with another directive (because it needs to compute the number of iterations). Reviewed By: ABataev Differential Revision: https://reviews.llvm.org/D99459	2021-06-10 14:30:17 -05:00
Melanie Blower	c3cc14f87f	Revert "[clang][FPEnv] Clang floatng point model ffp-model=precise enables ffp-contract=on" This reverts commit `8daac37140`. The build bots are showing some fails on broadwell and arm. Fix to LNT test suite needs work.	2021-06-10 12:19:02 -04:00
Markus Böck	c70b0e808d	[clang-cl] Add /permissive and /permissive- This patch adds the command line options /permissive and /permissive- to clang-cl. These flags are used in MSVC to enable various /Zc language conformance options at once. In particular, /permissive is used to enable the various non standard behaviour of MSVC, while /permissive- is the opposite. When either of two command lines are specified they are simply expanded to the various underlying /Zc options. In particular when /permissive is passed it currently expands to: /Zc:twoPhase- (disable two phase lookup) -fno-operator-names (disable C++ operator keywords) /permissive- expands to the opposites of these flags + /Zc:strictStrings (/Zc:strictStrings- does not currently exist). In the future, if any more MSVC workarounds are ever added they can easily be added to the expansion. One is also able to override settings done by permissive. Specifying /permissive- /Zc:twoPhase- will apply the settings from permissive minus, but disables two phase lookup. Motivation for this patch was mainly parity with MSVC as well as compatibility with Windows SDK headers. The /permissive page from MSVC documents various workarounds that have to be done for the Windows SDK headers [1], when MSVC is used with /permissive-. In these, Microsoft often recommends simply compiling with /permissive for the specified source files. Since some of these also apply to clang-cl (which acts like /permissive- by default mostly), and some are currently implemented as "hacks" within clang that I'd like to remove, adding /permissive and /permissive- to be in full parity with MSVC and Microsofts documentation made sense to me. [1] https://docs.microsoft.com/en-us/cpp/build/reference/permissive-standards-conformance?view=msvc-160#windows-header-issues Differential Revision: https://reviews.llvm.org/D103773	2021-06-10 17:06:19 +02:00
Markus Böck	936d6756cc	[clang][msvc] Define _HAS_STATIC_RTTI to 0, when compiling with -fno-rtti When using the -fno-rtti option of the GCC style clang++, using typeid results in an error. The MSVC STL however kindly provides a define flag called _HAS_STATIC_RTTI, which either enables or disables uses of typeid throughout the STL. By default, if undefined, it is set to 1, enabling the use of typeid. With this patch, _HAS_STATIC_RTTI is set to 0 when -fno-rtti is specified. This way various headers of the MSVC STL like functional can be consumed without compilation failures. Differential Revision: https://reviews.llvm.org/D103771	2021-06-10 17:02:44 +02:00
Markus Böck	9833b57981	[clang][driver] Add -foperator-names This patch adds the command line option -foperator-names which acts as the opposite of -fno-operator-names. With this command line option it is possible to reenable C++ operator keywords on the command line if -fno-operator-names had previously been passed. Differential Revision: https://reviews.llvm.org/D103749	2021-06-10 17:01:35 +02:00
Adam Czachorowski	49eba8bf17	[clang] Do not crash when ArgTy is null in CheckArgAlignment This can happen around RecoveryExpr. Differential Revision: https://reviews.llvm.org/D103825	2021-06-10 16:54:15 +02:00
Melanie Blower	8daac37140	[clang][FPEnv] Clang floatng point model ffp-model=precise enables ffp-contract=on This patch changes the ffp-model=precise to enables -ffp-contract=on (previously -ffp-model=precise enabled -ffp-contract=fast). This is a follow-up to Andy Kaylor's comments in the llvm-dev discussion "Floating Point semantic modes". From the same email thread, I put Andy's distillation of floating point options and floating point modes into UsersManual.rst Differential Revision: https://reviews.llvm.org/D74436	2021-06-10 09:30:41 -04:00
Yaxun (Sam) Liu	734213d7b5	Fix test hip-device-compile.hip Add stricter check for 'ld'.	2021-06-10 09:13:27 -04:00
Joseph Huber	0c32ffceed	[OpenMP] Add type to firstprivate symbol for const firstprivate values Clang will create a global value put in constant memory if an aggregate value is declared firstprivate in the target device. The symbol name only uses the name of the firstprivate variable, so symbol name conflicts will occur if the variable is allowed to have different types through templates. An example of this behvaiour is shown in https://godbolt.org/z/EsMjYh47n. This patch adds the mangled type name to the symbol to avoid such naming conflicts. This fixes https://bugs.llvm.org/show_bug.cgi?id=50642. Reviewed By: ABataev Differential Revision: https://reviews.llvm.org/D103995	2021-06-10 09:02:20 -04:00
Adam Czachorowski	a95937452f	[clang] Make CXXDefaultArgExpr inherit dependence from the inner Expr Before this change, CXXDefaultArgExpr would always have ExprDependence::None. This can lead to issues when, for example, the inner expression is RecoveryExpr and yet containsErrors() on the default expression is false. Differential Revision: https://reviews.llvm.org/D103982	2021-06-10 14:51:08 +02:00
Sanjay Patel	cc86b87a57	[CodeGen] limit tests to current pass manager to avoid variability; NFC Post-commit feedback for `d69c4372bf` says the output may vary between pass managers. This is hopefully a quick fix, but we might want to investigate how to better solve this type of problem.	2021-06-10 08:50:06 -04:00
David Spickett	7fb1f62d12	[clang][Arm] Require arm and aarch64 target for bf16 intrinsics test	2021-06-10 09:45:10 +00:00
Ole Strohm	ac677e69bd	[OpenCL] Relax test implicit members in C++ for OpenCL Addresses the issue from a comment in D103252 Relaxes the test to account for some targets with added attributes to inside the pattern.	2021-06-10 10:18:52 +01:00
Sven van Haastregt	c5ffc6f8bd	[OpenCL] Add builtin header test Add a test to verify OpenCL builtin declarations using OpenCLBuiltins.td. This test consists of parsing a 60k line generated input file. The entire test takes about 60s with a debug build on a decent machine. Admittedly this is not the fastest test, but doesn't seem excessive compared to other tests in clang/test/Headers (with one of the tests taking 85s for example). RFC: https://lists.llvm.org/pipermail/cfe-dev/2021-April/067973.html Differential Revision: https://reviews.llvm.org/D97869	2021-06-10 10:05:53 +01:00
Yaxun (Sam) Liu	5fc2673fbc	[HIP] Add --gpu-bundle-output Added --gpu-bundle-output to control bundling/unbundling output of HIP device compilation. By default preprocessor expansion, llvm bitcode and assembly are unbundled, code objects are bundled. Reviewed by: Artem Belevich, Jan Svoboda Differential Revision: https://reviews.llvm.org/D101630	2021-06-09 23:31:43 -04:00
Ben Shi	b0eb391983	[clang][RISCV][test] Add more tests of the -mabi and -march options 1. There is no tests for mabi=ilp32e, and my patch covers that. 2. The tests in riscv-abi.c will show default ABI changes for special archs in the future, especially the arch with the F but without the D extension. 3. The tests in riscv-arch.c will show default arch changes for abi=ilp32, which is rv32imacfd currently, but it is better to be rv32imac. And it is also better for abi=ilp32f defaults to arch=imacf. Reviewed By: MaskRay, luismarques Differential Revision: https://reviews.llvm.org/D103878	2021-06-10 09:14:14 +08:00
Hongtao Yu	64b2fb7967	[CSSPGO] Emit mangled dwarf names for line tables debug option under -fpseudo-probe-for-profiling Reviewed By: wenlei Differential Revision: https://reviews.llvm.org/D103909	2021-06-09 10:46:03 -07:00
AndreyChurbanov	9ce2e5e700	Revert "[OpenMP] libomp: implement OpenMP 5.1 inoutset task dependence type" This reverts commit `a1f550e052`. Revert in order to fix backwards compatibility breakage caused by type size change for task dependence flag.	2021-06-09 17:38:38 +03:00
Nathan Sidwell	c1cd743519	[clang] p1099 using-enum feature macro & web page This completes the series implementing p1099, by adding the feature macro and updating the web page. Differential Revision: https://reviews.llvm.org/D102242	2021-06-09 05:40:53 -07:00
Hans Wennborg	64dbd649cf	[clang-cl] Parse /await:strict, new in MSVC 16.10	2021-06-09 14:27:05 +02:00
Aaron Ballman	c92f505346	Correct the behavior of va_arg checking in C++ Clang checks whether the type given to va_arg will automatically cause undefined behavior, but this check was issuing false positives for enumerations in C++. The issue turned out to be because typesAreCompatible() in C++ checks whether the types are the same, so this uses custom logic if the type compatibility check fails. This issue was found by a user on code like: typedef enum { CURLINFO_NONE, CURLINFO_EFFECTIVE_URL, CURLINFO_LASTONE = 60 } CURLINFO; ... __builtin_va_arg(list, CURLINFO); // false positive warning Given that C++ defers to C for the rules around va_arg, the behavior should be the same in both C and C++ and not diagnose because int and CURLINFO are "compatible enough" types for va_arg.	2021-06-09 07:18:32 -04:00
Matheus Izvekov	aef5d8fdc7	[clang] NFC: Rename rvalue to prvalue This renames the expression value categories from rvalue to prvalue, keeping nomenclature consistent with C++11 onwards. C++ has the most complicated taxonomy here, and every other language only uses a subset of it, so it's less confusing to use the C++ names consistently, and mentally remap to the C names when working on that context (prvalue -> rvalue, no xvalues, etc). Renames: * VK_RValue -> VK_PRValue * Expr::isRValue -> Expr::isPRValue * SK_QualificationConversionRValue -> SK_QualificationConversionPRValue * JSON AST Dumper Expression nodes value category: "rvalue" -> "prvalue" Signed-off-by: Matheus Izvekov <mizvekov@gmail.com> Reviewed By: rsmith Differential Revision: https://reviews.llvm.org/D103720	2021-06-09 12:27:10 +02:00
Brendon Cahoon	294efbbd3e	Reland "[AMDGPU] Add gfx1013 target" This reverts commit `211e584fa2`. Fixed a use-after-free error that caused the sanitizers to fail.	2021-06-08 21:15:35 -04:00
Brendon Cahoon	211e584fa2	Revert "[AMDGPU] Add gfx1013 target" This reverts commit `ea10a86984`. A sanitizer buildbot reports an error.	2021-06-08 16:29:41 -04:00
Sanjay Patel	d69c4372bf	[CodeGen] remove instcombine from codegen tests; NFC The FileCheck lines in these files are auto-generated and complete, so there's very little upside (less CHECK lines) from running -instcombine on them and violating the expected test layering (optimizer developers shouldn't have to be aware of clang tests). Running opt passes like this makes it harder to make changes such as: D93817	2021-06-08 15:31:15 -04:00
Nathan Sidwell	b2d0c16e91	[clang] p1099 using enum part 2 This implements the 'using enum maybe-qualified-enum-tag ;' part of 1099. It introduces a new 'UsingEnumDecl', subclassed from 'BaseUsingDecl'. Much of the diff is the boilerplate needed to get the new class set up. There is one case where we accept ill-formed, but I believe this is merely an extended case of an existing bug, so consider it orthogonal. AFAICT in class-scope the c++20 rule is that no 2 using decls can bring in the same target decl ([namespace.udecl]/8). But we already accept: struct A { enum { a }; }; struct B : A { using A::a; }; struct C : B { using A::a; using B::a; }; // same enumerator this patch permits mixtures of 'using enum Bob;' and 'using Bob::member;' in the same way. Differential Revision: https://reviews.llvm.org/D102241	2021-06-08 11:11:46 -07:00
Nick Desaulniers	3787ee4571	reland [IR] make -stack-alignment= into a module attr Relands commit `433c8d950c` with fixes for MIPS. Similar to D102742, specifying the stack alignment via CodegenOpts means that this flag gets dropped during LTO, unless the command line is re-specified as a plugin opt. Instead, encode this information as a module level attribute so that we don't have to expose this llvm internal flag when linking the Linux kernel with LTO. Looks like external dependencies might need a fix: * https://github.com/llvm-hs/llvm-hs/issues/345 * https://github.com/halide/Halide/issues/6079 Link: https://github.com/ClangBuiltLinux/linux/issues/1377 Reviewed By: tejohnson Differential Revision: https://reviews.llvm.org/D103048	2021-06-08 10:59:46 -07:00
Brendon Cahoon	ea10a86984	[AMDGPU] Add gfx1013 target Differential Revision: https://reviews.llvm.org/D103663	2021-06-08 12:49:49 -04:00
Hans Wennborg	172fcd9600	[clang-cl] Parse the /external: flags (PR36003) They are still unsupported, but at least this makes clang-cl not mistake them for being filenames. As pointed out in the bug, VS 16.10 now uses these flags in new projects by default.	2021-06-08 18:00:03 +02:00
Nick Desaulniers	a596b54d47	Revert "[IR] make -stack-alignment= into a module attr" This reverts commit `433c8d950c`. Breaks the MIPS build.	2021-06-08 08:55:50 -07:00
Nick Desaulniers	433c8d950c	[IR] make -stack-alignment= into a module attr Similar to D102742, specifying the stack alignment via CodegenOpts means that this flag gets dropped during LTO, unless the command line is re-specified as a plugin opt. Instead, encode this information as a module level attribute so that we don't have to expose this llvm internal flag when linking the Linux kernel with LTO. Looks like external dependencies might need a fix: * https://github.com/llvm-hs/llvm-hs/issues/345 * https://github.com/halide/Halide/issues/6079 Link: https://github.com/ClangBuiltLinux/linux/issues/1377 Reviewed By: tejohnson Differential Revision: https://reviews.llvm.org/D103048	2021-06-08 08:31:04 -07:00
Yaxun (Sam) Liu	054cc3b1b4	[CUDA][HIP] Fix store of vtbl in ctor vtbl itself is in default global address space. When clang emits ctor, it gets a pointer to the vtbl field based on the this pointer, then stores vtbl to the pointer. Since this pointer can point to any address space (e.g. an object created in stack), this pointer points to default address space, therefore the pointer to vtbl field in this object should also be in default address space. Currently, clang incorrectly casts the pointer to vtbl field in this object to global address space. This caused assertions in backend. This patch fixes that by removing the incorrect addr space cast. Reviewed by: Artem Belevich Differential Revision: https://reviews.llvm.org/D103835	2021-06-08 10:24:44 -04:00
Nathan Sidwell	012898b92c	[clang] p1099 using enum part 1 This adds support for p1099's 'using SCOPED_ENUM::MEMBER;' functionality, bringing a member of an enumerator into the current scope. The novel feature here, is that there need not be a class hierarchical relationship between the current scope and the scope of the SCOPED_ENUM. That's a new thing, the closest equivalent is a typedef or alias declaration. But this means that Sema::CheckUsingDeclQualifier needs adjustment. (a) one can't call it until one knows the set of decls that are being referenced -- if exactly one is an enumerator, we're in the new territory. Thus it needs calling later in some cases. Also (b) there are two ways we hold the set of such decls. During parsing (or instantiating a dependent scope) we have a lookup result, and during instantiation we have a set of shadow decls. Thus two optional arguments, at most one of which should be non-null. Differential Revision: https://reviews.llvm.org/D100276	2021-06-08 04:40:42 -07:00
Sven van Haastregt	d54e7b731e	[OpenCL] Add memory_scope_all_devices Add the `memory_scope_all_devices` enum value, which is restricted to OpenCL 3.0 or newer and the `__opencl_c_atomic_scope_all_devices` feature. Also guard `memory_scope_all_svm_devices` accordingly, which is already available in OpenCL 2.0. The `__opencl_c_atomic_scope_all_devices` feature is header-only, so set its define to 1 in `opencl-c-base.h`. This is done unconditionally at the moment, as the mechanism for disabling header-only options hasn't been decided yet. This patch only adds a negative test for now. Ideally adding a CL3.0 run line to atomic-ops.cl should suffice as a positive test, but we cannot do that yet until (at least) generic address spaces and program scope variables are supported in OpenCL 3.0 mode. Differential Revision: https://reviews.llvm.org/D103241	2021-06-08 11:51:12 +01:00
Martin Storsjö	b34da6ff9c	[clang] Apply MS ABI details on __builtin_ms_va_list on non-windows platforms on x86_64 This fixes inconsistencies in the ms_abi.c testcase. Also add a couple cases of missing double pointers in the windows part of the testcase; the outcome of building that testcase on windows hasn't changed, but the previous form of the test was imprecise (checking for "%[[STRUCT_FOO]]" when clang actually generates "%[[STRUCT_FOO]]*"), which still used to match. Ideally this would share code with the native Windows case, but X86_64ABIInfo and WinX86_64ABIInfo aren't superclasses/subclasses of each other so it's impractical, and the code to share currently only consists of a couple lines. Differential Revision: https://reviews.llvm.org/D103837	2021-06-08 12:14:12 +03:00
Petr Hosek	9625d61eb6	[Driver] Support libc++ in MSVC This implements support for using libc++ headers and library in the MSVC toolchain. We only support libc++ that is a part of the toolchain, and not headers installed elsewhere on the system. Differential Revision: https://reviews.llvm.org/D101479	2021-06-07 23:36:10 -07:00
Jake.Egan	f38eff777e	[AIX] Define __STDC_NO_ATOMICS__ and __STDC_NO_THREADS__ Revert/reapply to fix Git authorship metadata Differential Revision: https://reviews.llvm.org/D103707	2021-06-07 22:45:41 -04:00
Chris Bowler	f97e01e61a	Revert "[AIX] Define __STDC_NO_ATOMICS__ and __STDC_NO_THREADS__ predefined macros" This reverts commit `e6629be31e`.	2021-06-07 22:45:41 -04:00
=Jake Egan	e6629be31e	[AIX] Define __STDC_NO_ATOMICS__ and __STDC_NO_THREADS__ predefined macros Differential Revision: https://reviews.llvm.org/D103707	2021-06-07 22:04:18 -04:00
Jian Cai	9145a3d4ab	Revert "[AArch64] handle -Wa,-march=" This reverts commit `fd11a26d36`.	2021-06-07 14:31:07 -07:00
Harald van Dijk	75521bd9d8	[X32] Add Triple::isX32(), use it. So far, support for x86_64-linux-gnux32 has been handled by explicit comparisons of Triple.getEnvironment() to GNUX32. This worked as long as x86_64-linux-gnux32 was the only X32 environment to worry about, but we now have x86_64-linux-muslx32 as well. To support this, this change adds an isX32() function and uses it. It replaces all checks for GNUX32 or MuslX32 by isX32(), except for the following: - Triple::isGNUEnvironment() and Triple::isMusl() are supposed to treat GNUX32 and MuslX32 differently. - computeTargetTriple() needs to be able to transform triples to add or remove X32 from the environment and needs to map GNU to GNUX32, and Musl to MuslX32. - getMultiarchTriple() completely lacks any Musl support and retains the explicit check for GNUX32 as it can only return x86_64-linux-gnux32. Reviewed By: MaskRay Differential Revision: https://reviews.llvm.org/D103777	2021-06-07 20:48:39 +01:00
Martin Storsjö	6de45b9e6a	[clang] Fix reading long doubles with va_arg on x86_64 mingw On x86_64 mingw, long doubles are always passed indirectly as arguments (see an existing case in WinX86_64ABIInfo::classify); generalize the existing code for reading varargs - any non-aggregate type that is larger than 64 bits (which would be both long double in mingw, and __int128) are passed indirectly too. This makes reading varargs consistent with how they're passed, fixing interop with both gcc and clang callers, for long double and __int128. Differential Revision: https://reviews.llvm.org/D103452	2021-06-07 22:34:10 +03:00
AndreyChurbanov	a1f550e052	[OpenMP] libomp: implement OpenMP 5.1 inoutset task dependence type Refactored code of dependence processing and added new inoutset dependence type. Compiler can set dependence flag to 0x8 when call __kmpc_omp_task_with_deps. Size of type of the dependence flag changed from 1 to 4 bytes in clang. All dependence flags library gets so far and corresponding dependence types: 1 - IN, 2 - OUT, 3 - INOUT, 4 - MUTEXINOUTSET, 8 - INOUTSET. Differential Revision: https://reviews.llvm.org/D97085	2021-06-07 21:42:51 +03:00
Jian Cai	fd11a26d36	[AArch64] handle -Wa,-march= This fixed PR#48894 for AArch64. The issue has been fixed for Arm in https://reviews.llvm.org/D95872 The following rules apply to -Wa,-march with this change: - Only compiler options apply to non assembly files - Compiler and assembler options apply to assembly files - For assembly files, we prefer the assembler option(s) if we have both kinds of option - Of the options that apply (or are preferred), the last value wins (it's not additive) Reviewed By: DavidSpickett, nickdesaulniers Differential Revision: https://reviews.llvm.org/D103184	2021-06-07 10:15:53 -07:00
Hsiangkai Wang	2b13ff6979	[Clang][CodeGen] Set the size of llvm.lifetime to unknown for scalable types. If the memory object is scalable type, we do not know the exact size of it at compile time. Set the size of lifetime marker to unknown if the object is scalable one. Differential Revision: https://reviews.llvm.org/D102822	2021-06-07 23:30:13 +08:00
Bradley Smith	60c9b5f35c	[AArch64][SVE] Improve codegen for dupq SVE ACLE intrinsics Use llvm.experimental.vector.insert instead of storing into an alloca when generating code for these intrinsics. This defers the codegen of the generated vector to instruction selection, allowing existing shufflevector style optimizations to apply. Additionally, introduce a new target transform that can recognise fixed predicate patterns in the svbool variants of these intrinsics. Differential Revision: https://reviews.llvm.org/D103082	2021-06-07 12:21:38 +01:00
Andrew Savonichev	b31f41e78b	[Clang] Support a user-defined __dso_handle This fixes PR49198: Wrong usage of __dso_handle in user code leads to a compiler crash. When Init is an address of the global itself, we need to track it across RAUW. Otherwise the initializer can be destroyed if the global is replaced. Differential Revision: https://reviews.llvm.org/D101156	2021-06-07 12:54:08 +03:00
Ole Strohm	438cf5577e	[OpenCL] Fix missing addrspace on implicit move assignment operator This fixes the missing address space on `this` in the implicit move assignment operator. The function called here is an abstraction around the lines that have been removed which also sets the address space correctly. This is copied from CopyConstructor, CopyAssignment and MoveConstructor, all of which use this function, and now MoveAssignment does too. Fixes: PR50259 Reviewed By: svenvh Differential Revision: https://reviews.llvm.org/D103252	2021-06-07 09:37:53 +01:00
Aaron Ballman	ca68f3bc48	Fix a diagnoses-valid bug with using declarations The following was found by a customer and is accepted by the other primary C++ compilers, but fails to compile in Clang: namespace sss { double foo(int, double); template <class T> T foo(T); // note: target of using declaration } // namespace sss namespace oad { void foo(); } namespace oad { using ::sss::foo; } namespace sss { using oad::foo; // note: using declaration } namespace sss { double foo(int, double) { return 0; } template <class T> T foo(T t) { // error: declaration conflicts with target of using return t; } } // namespace sss I believe the issue is that MergeFunctionDecl() was calling checkUsingShadowRedecl() but only considering a FunctionDecl as a possible shadow and not FunctionTemplateDecl. The changes in this patch largely mirror how variable declarations were being handled by also catching FunctionTemplateDecl.	2021-06-04 15:52:07 -04:00
Jason Zheng	333987b045	[OpenCL] Add DWARF address spaces mapping for SPIR Extend debug info handling by adding DWARF address space mapping for SPIR, with corresponding test case. Reviewed By: Anastasia Differential Revision: https://reviews.llvm.org/D103097	2021-06-04 18:10:54 +01:00
Konstantin Zhuravlyov	4d9f8527db	CUDA/HIP: Change device-use-host-var.cu's NOT "external" check to include variable name Otherwise it is causing one of our build jobs to fail, it is using "external" as directory, and NOT is failing because "external" is found in ModuleID. Differential Revision: https://reviews.llvm.org/D103658	2021-06-04 13:10:00 -04:00
Mikhail Goncharov	b109172d99	[clang] use a different name for generated test cdb if build system copied source files as readonly, then override of db_tu.json will fail	2021-06-04 18:12:58 +02:00
Alexey Bataev	c84a5448b5	[OPENMP]Fix PR50129: omp cancel parallel not working as expected. Need to emit a call for __kmpc_cancel_barrier in the exit block for __kmpc_cancel function call if cancellation of the parallel block is requested. Differential Revision: https://reviews.llvm.org/D103646	2021-06-04 08:24:55 -07:00
Nico Weber	f917c5b8d4	Revert test fixups after `e9a9c85098` (which reverted `a14fc74`). This reverts commit `da3ed58b97`. This reverts commit `ba1fb0ff8c`.	2021-06-04 10:42:25 -04:00
Alexey Bataev	827b5c2154	[OPENMP]Fix PR49790: Constexpr values not handled in `omp declare mapper` clause. Patch allows using of constexpr vars evaluatable to constant calue to be used in declare mapper construct. Differential Revision: https://reviews.llvm.org/D103642	2021-06-04 07:32:14 -07:00
Yaxun (Sam) Liu	b5dea8701b	[HIP] Fix spack HIP device lib detection spack HIP device library is installed at amdgcn directory under llvm/clang directory. This patch fixes detection of HIP device library for spack. Reviewed by: Artem Belevich, Harmen Stoppels Differential Revision: https://reviews.llvm.org/D103281	2021-06-04 09:12:41 -04:00
Jan Svoboda	de07b1e84d	[clang][deps] Support object files When a project uses PCH with explicit modules, the build will look like this: 1. scan PCH dependencies 2. explicitly build PCH 3. scan TU dependencies 4. explicitly build TU Step 2 produces an object file for the PCH, which the dependency scanner needs to read in step 3. This patch adds support for this. The `clang-scan-deps` invocation in the attached test would fail without this change. Depends on D103516. Reviewed By: Bigcheese Differential Revision: https://reviews.llvm.org/D103519	2021-06-04 14:58:42 +02:00
Jan Svoboda	93a058190c	[clang][deps] Add argument for customizing PCM paths Dependency scanning currently performs an implicit build. When testing that Clang can build modules with the command-lines generated by `clang-scan-deps`, the actual compilation would overwrite artifacts created during the scan, which makes debugging harder than it should be and can lead to errors in multi-step builds. To prevent this, this patch adds new flag to `clang-scan-deps` that allows developers to customize the directory to use when generating module map paths, instead of always using the module cache. Moreover, the explicit context hash in now part of the PCM path, which will be useful in D102488, where the context hash can change due to command-line pruning. Reviewed By: Bigcheese Differential Revision: https://reviews.llvm.org/D103516	2021-06-04 14:45:18 +02:00
ShihPo Hung	fcf8827a98	[Sema][RISCV][SVE] Allow ?: to select Typedef BuiltinType in C This patch solves an error such as: incompatible operand types ('vbool4_t' (aka '__rvv_bool4_t') and '__rvv_bool4_t') when one of the value is a TypedefType of the other value in ?:. Reviewed By: rjmccall Differential Revision: https://reviews.llvm.org/D103603	2021-06-04 15:33:14 +08:00
Yaxun (Sam) Liu	e42def62d8	[HIP] Fix amdgcn builtin for long type Currently some amdgcn builtins are defined with long int type, which causes invalid IR on Windows since long int is 32 bit on Windows whereas these builtins have 64 bit arguments. long long int type cannot be used since it is 128 bit in OpenCL. This patch uses 64 bit int type instead of long int to define 64 bit int arguments or return for amdgcn builtins. Reviewed by: Artem Belevich Differential Revision: https://reviews.llvm.org/D103563	2021-06-03 19:05:56 -04:00
Teresa Johnson	d0ee8b64ec	[LTO] Fix -fwhole-program-vtables handling after HIP ThinLTO patch A recent change (D99683) to support ThinLTO for HIP caused a regression when compiling cuda code with -flto=thin -fwhole-program-vtables. Specifically, we now get an error: error: invalid argument '-fwhole-program-vtables' only allowed with '-flto' This error is coming from the device offload cc1 action being set up for the cuda compile, for which -flto=thin doesn't apply and gets dropped. This is a regression, but points to a potential issue that was silently occurring before the patch, details below. Before D99683, the check for fwhole-program-vtables in the driver looked like: if (WholeProgramVTables) { if (!D.isUsingLTO()) D.Diag(diag::err_drv_argument_only_allowed_with) << "-fwhole-program-vtables" << "-flto"; CmdArgs.push_back("-fwhole-program-vtables"); } And D.isUsingLTO() returned true since we have -flto=thin. However, because the cuda cc1 compile is doing device offloading, which didn't support any LTO, there was other code that suppressed -flto* options from being passed to the cc1 invocation. So the cc1 invocation silently had -fwhole-program-vtables without any -flto. This seems potentially problematic, since if we had any virtual calls we would get type test assume sequences without the corresponding LTO pass that handles them. However, with the patch, which adds support for device offloading LTO option -foffload-lto=thin, the code has changed so that we set a bool IsUsingLTO based on either -flto or -foffload-lto, depending on whether this is the device offloading action. For the device offload action in our compile, since we don't have -foffload-lto, IsUsingLTO is false, and the check for LTO with -fwhole-program-vtables now fails. What we should do is only pass through -fwhole-program-vtables to the cc1 invocation that has LTO enabled (either the device offload action with -foffload-lto, or the non-device offload action with -flto), and otherwise drop the -fwhole-program-vtables for the non-LTO action. Then we should error only if we have -fwhole-program-vtables without any -flto* options. Differential Revision: https://reviews.llvm.org/D103579	2021-06-03 14:25:03 -07:00
Fangrui Song	da3ed58b97	[test] Update more clang/test/Profile tests	2021-06-03 13:43:19 -07:00
Fangrui Song	ba1fb0ff8c	[test] Update c-linkage-available_externally.c	2021-06-03 13:40:01 -07:00
Chris Bieneman	13a9b2220f	Don't delete the module you're inspecting Prior to this patch when you used `clang -module-file-info` clang would delete the module on completion because the module was treated as an output file. This fixes the issue so you don't need to invoke cc1 directly to get module file information. Reviewed By: steven_wu, phosek Differential Revision: https://reviews.llvm.org/D103547	2021-06-03 13:00:09 -05:00
Yi Kong	dcd7664f92	Add -fno-visibility-inlines-hidden option This allows overriding -fvisibility-inlines-hidden. Differential Revision: https://reviews.llvm.org/D103537	2021-06-03 17:07:53 +08:00
David Spickett	f4543dce5d	[clang][ARM] Remove arm2/3/6/7m CPU names These legacy CPUs are known to clang but not llvm. Their use was ignored by llvm and it would print a warning saying it did not recognise them. However because some of them are default CPUs for their architecture, you would get those warnings even if you didn't choose a cpu explicitly. (now those architectures will default to a "generic" CPU) Information is thin on the ground for these older chips so this is the best I could find: https://en.wikichip.org/wiki/acorn/microarchitectures/arm2 https://en.wikichip.org/wiki/acorn/microarchitectures/arm3 https://en.wikichip.org/wiki/arm_holdings/microarchitectures/arm6 https://en.wikichip.org/wiki/arm_holdings/microarchitectures/arm7 Final part of fixing https://bugs.llvm.org/show_bug.cgi?id=50454. Reviewed By: efriedma Differential Revision: https://reviews.llvm.org/D103028	2021-06-03 08:55:44 +00:00
Michael Kruse	64e5a3bbdd	[clang] Fix fail of OpenMP/tile_codegen_tile_for.cpp. Clang's version string can be customized using CLANG_VENDOR which the test did not consider. Change the test to accept any version string.	2021-06-02 21:02:05 -05:00
Michael Kruse	07a6beb402	[Clang][OpenMP] Emit dependent PreInits before directive. The PreInits of a loop transformation (atm moment only tile) include the computation of the trip count. The trip count is needed by any loop-associated directives that consumes the transformation-generated loop. Hence, we must ensure that the PreInits of consumed loop transformations are emitted with the consuming directive. This is done by addinging the inner loop transformation's PreInits to the outer loop-directive's PreInits. The outer loop-directive will consume the de-sugared AST such that the inner PreInits are not emitted twice. The PreInits of a loop transformation are still emitted directly if its generated loop(s) are not associated with another loop-associated directive. Reviewed By: ABataev Differential Revision: https://reviews.llvm.org/D102180	2021-06-02 16:59:35 -05:00
Erich Keane	d8e0ae9a76	[SYCL] Fix __builtin_sycl_unique_stable_name to work on windows/spir In the case where the device is an itanium target, and the host is a windows target, we were getting the names wrong, since in the itanium case we filter by lambda-signature. The fix is to always filter by the signature rather than just on non-windows builds. I considered doing the reverse (that is, checking the aux-triple), but doing so would result in duplicate lambda mangling numbers (from linux reusing the same number for different signatures).	2021-06-02 13:16:14 -07:00
Richard Smith	13659f48a1	PR50337, PR50561: Fix determination of whether a template parameter list contains constrained parameters.	2021-06-02 13:06:40 -07:00
Erik Pilkington	369c648399	[clang] Implement the using_if_exists attribute This attribute applies to a using declaration, and permits importing a declaration without knowing if that declaration exists. This is useful for libc++ C wrapper headers that re-export declarations in std::, in cases where the base C library doesn't provide all declarations. This attribute was proposed in http://lists.llvm.org/pipermail/cfe-dev/2020-June/066038.html. rdar://69313357 Differential Revision: https://reviews.llvm.org/D90188	2021-06-02 10:30:24 -04:00
Yaxun (Sam) Liu	61c65d8e4a	Fix comments in test cuda-kernel-call.cu	2021-06-02 10:21:26 -04:00
Yaxun (Sam) Liu	04caa7c3e0	[CUDA][HIP] Promote const variables to constant Recently we added diagnosing ODR-use of host variables in device functions, which includes ODR-use of const host variables since they are not really emitted on device side. This caused regressions since we used to allow ODR-use of const host variables in device functions. This patch allows ODR-use of const variables in device functions if the const variables can be statically initialized and have an empty dtor. Such variables are marked with implicit constant attrs and emitted on device side. This is in line with what clang does for constexpr variables. Reviewed by: Artem Belevich Differential Revision: https://reviews.llvm.org/D103108	2021-06-01 21:28:41 -04:00
Yaxun (Sam) Liu	f7e87dd6ff	[CUDA][HIP] Change default lang std to c++14 Currently clang and nvcc use c++14 as default std for C++. gcc 11 even uses c++17 as default std for C++. However, clang uses c++98 as default std for CUDA/HIP. As c++14 has been well adopted and became default for clang, it seems reasonable to use c++14 as default std for CUDA/HIP. Reviewed by: Artem Belevich Differential Revision: https://reviews.llvm.org/D103221	2021-06-01 20:45:10 -04:00
Leonard Chan	e6f88dc01a	[clang][Fuchsia] Turn on relative-vtables by default for Fuchsia All fuchsia targets will now use the relative-vtables ABI by default. Also remove -fexperimental-relative-c++-abi-vtables from test RUNs targeting fuchsia. Differential Revision: https://reviews.llvm.org/D102374	2021-06-01 15:46:09 -07:00
Michael Benfield	cf49cae278	[Clang] -Wunused-but-set-parameter and -Wunused-but-set-variable These are intended to mimic warnings available in gcc. Differential Revision: https://reviews.llvm.org/D100581	2021-06-01 15:38:48 -07:00
Aaron Ballman	d7f846fc6b	Fix a Clang diagnostic to start with a lowercase letter; NFC	2021-06-01 17:03:31 -04:00
Nathan Sidwell	c138f3ce5c	[clang] Fix ICE with typeid & polymorphic class (pr50497) This addresses pr50497. The argument of a typeid expression is unevaluated, except when it's a polymorphic type. We handle this by parsing as unevaluated and then transforming to evaluated if we discover it should have been an evaluated context. We do the same in TreeTransform<Derived>::TransformCXXTypeidExpr, entering unevaluated context before transforming and rebuilding the typeid. But that's incorrect and can lead us to converting to evaluated context twice -- and hitting an assert. During normal template instantiation we're always cloning the expression, but during generic lambda processing we do not necessarily AlwaysRebuild, and end up with TransformDeclRefExpr unconditionally calling MarkDeclRefReferenced around line 10226. That triggers the assert. // Mark it referenced in the new context regardless. // FIXME: this is a bit instantiation-specific. SemaRef.MarkDeclRefReferenced(E); This patch makes 2 changes. a) TreeTransform<Derived>::TransformCXXTypeidExpr only enters unevaluated context if the typeid's operand is not a polymorphic glvalue. If it is, it keeps the same evaluation context. b) Sema::BuildCXXTypeId is altered to only transform to evaluated, if the current context is unevaluated. Differential Revision: https://reviews.llvm.org/D103258	2021-06-01 12:55:29 -07:00
David Goldman	13a8aa3ee1	[clang] RecursiveASTVisitor visits ObjCPropertyRefExpr's class receiver We now make up a TypeLoc for the class receiver to simplify visiting, notably for indexing, availability, and clangd. Differential Revision: https://reviews.llvm.org/D101645	2021-06-01 14:45:25 -04:00
Aaron Ballman	baa2b8d085	Fix a git apply that went bad somehow. When applying the changes in `8edd3464af`, it seems that this bit got merged incorrectly and no test coverage caught the issue. This fixes the diagnostic and adds a test.	2021-06-01 14:06:39 -04:00
Louis Dionne	97d234935f	[clang][Parse] Add parsing support for C++ attributes on using-declarations This is a re-application of `dc67299` which was reverted in `f63adf5b` because it broke the build. The issue should now be fixed. Attribution note: The original author of this patch is Erik Pilkington. I'm only trying to land it after rebasing. Differential Revision: https://reviews.llvm.org/D91630	2021-06-01 08:47:50 -04:00
Ole Strohm	94b0aec0f5	[OpenCL] Fix ICE with invalid use of half Because half is limited to the `cl_khr_fp16` extension being enabled, `DefaultLvalueConversion` can fail when it's not enabled. The original assumption that it will never fail is therefore wrong now. Fixes: PR47976 Reviewed By: Anastasia Differential Revision: https://reviews.llvm.org/D103175	2021-06-01 13:43:07 +01:00
Bjorn Pettersson	f0e10cc91b	[HIP] Relax checks in hip-include-path.hip test case again The tightened checks from commit `722c39fef5` did not work fully for buildbots using symlinks in repo paths. This patch is not fully reverting `722c39fef5`, as we still match that there is a "/lib" somewhere in the path before "/clang/". So this is once again a bit fragile in case someone would put their repo in a base directory, for example, named "/scratch/lib/foo/clang/llvm-project/". But it is atleast a bit better than the original checks (avoiding the problem that commit `722c39fef5` was solving).	2021-05-31 22:40:21 +02:00
Qiu Chaofan	c0b3071833	[PowerPC] Fix x86 vector intrinsics wrapper compilation under C++ Reviewed By: nemanjai Differential Revision: https://reviews.llvm.org/D103386	2021-06-01 01:19:12 +08:00
Abbas Sabra	116179c2ee	Re-commit [clang] Add support for the "abstract" contextual keyword of MSVC https://docs.microsoft.com/en-us/cpp/extensions/abstract-cpp-component-extensions?view=msvc-160 Note: like the already supported "sealed" keyword, the "abstract" keyword is supported by MSVC by default. This re-commits `818338add7` with added initialization of Parser::Ident_abstract. Differential revision: https://reviews.llvm.org/D102517	2021-05-31 18:45:26 +02:00
Mikhail Goncharov	c36ff6424f	Revert "[clang] Add support for the "abstract" contextual keyword of MSVC" This reverts commit `818338add7`. Tests fail under sanitizer: https://lab.llvm.org/buildbot/#/builders/5/builds/8150	2021-05-31 15:34:20 +02:00
Mikhail Goncharov	8006ac7fe4	fix test for symlinked clang `c1ee4fb5af`	2021-05-31 13:29:33 +02:00
Juneyoung Lee	a723ca32af	fix broken clang tests after `7161bb87c9`	2021-05-31 19:25:14 +09:00
Abbas Sabra	818338add7	[clang] Add support for the "abstract" contextual keyword of MSVC https://docs.microsoft.com/en-us/cpp/extensions/abstract-cpp-component-extensions?view=msvc-160 Note: like the already supported "sealed" keyword, the "abstract" keyword is supported by MSVC by default. Differential revision: https://reviews.llvm.org/D102517	2021-05-31 10:44:53 +02:00
Ben Shi	c1ee4fb5af	[clang][AVR] Add avr-libc/include to clang system include paths Reviewed By: dylanmckay Differential Revision: https://reviews.llvm.org/D97669	2021-05-30 22:39:07 +08:00
Xuanda Yang	620cef9120	[analyzer] MallocSizeof: sizeof pointer type is compatible with void* source: https://bugs.llvm.org/show_bug.cgi?id=50214 Make sizeof pointer type compatible with void* in MallocSizeofChecker. Reviewed By: NoQ Differential Revision: https://reviews.llvm.org/D103358	2021-05-30 09:51:41 +08:00
Denys Petrov	fae3534b30	[analyzer] Use Optional as a return type of StoreManager::castRegion Summary: Make StoreManager::castRegion function usage safier. Replace `const MemRegion ` with `Optional<const MemRegion >`. Simplified one of related test cases due to suggestions in D101635. Differential Revision: https://reviews.llvm.org/D103319	2021-05-29 15:16:56 +03:00
Nico Weber	f63adf5b67	Revert "[clang][Parse] Add parsing support for C++ attributes on using-declarations" This reverts commit `dc672999a9`. Breaks check-clang everywhere, see https://reviews.llvm.org/D91630	2021-05-28 14:49:18 -04:00
Erik Pilkington	dc672999a9	[clang][Parse] Add parsing support for C++ attributes on using-declarations Differential Revision: https://reviews.llvm.org/D91630	2021-05-28 12:00:33 -04:00
Florian Hahn	5bccdde070	[Matrix] Move C++ matrix cast checks to TryStaticCast. At the moment, the matrix support in CheckCXXCStyleCast (added in D101696) breaks function-style constructor calls that take a single matrix value, because it is treated as matrix cast. Instead, unify the C++ matrix cast handling by moving the logic to TryStaticCast and only handle the case where both types are matrix types. Otherwise, fall back to the generic mis-match detection. Suggested by @rjmccall Reviewed By: SaurabhJha Differential Revision: https://reviews.llvm.org/D103163	2021-05-28 13:00:28 +01:00
Tim Northover	e94fada045	SwiftAsync: add Clang attribute to apply the LLVM `swiftasync` one. Expected to be used by Swift runtime developers.	2021-05-28 12:31:12 +01:00
Martin Storsjö	f59cd8a4a6	[clang] [MinGW] Fix gcc version detection/picking Actually compare each version to the version of the last chosen one. There's no guarantee that the added test case does showcase the previous issue (it depends on the order that directory entries are returned when iterating), but with the issue fixed it should behave deterministically in any case. Also improve the match patterns in the mingw-sysroot.cpp test a bit. Differential Revision: https://reviews.llvm.org/D102873	2021-05-28 11:44:20 +03:00
Zequan Wu	59b8afe502	[clang-cl] Bump default -fms-compatibility-version to 19.14 MSVC required version is 19.14 now (https://reviews.llvm.org/D92515). Update the default -fms-compatibility-version to 19.14. Differential Revision: https://reviews.llvm.org/D103293	2021-05-27 20:40:37 -07:00
Quinn Pham	62b5df7fe2	[PowerPC] Added multiple PowerPC builtins This is the first in a series of patches to provide builtins for compatibility with the XL compiler. Most of the builtins already had intrinsics and only needed to be implemented in the front end. Intrinsics were created for the three iospace builtins, eieio, and icbt. Pseudo instructions were created for eieio and iospace_eieio to ensure that nops were inserted before the eieio instruction. Reviewed By: nemanjai, #powerpc Differential Revision: https://reviews.llvm.org/D102443	2021-05-27 16:23:03 -05:00
Martin Storsjö	0e4cf807ae	[clang] [MinGW] Don't mark emutls variables as DSO local These actually can be automatically imported from another DLL. (This works properly as long as the actual implementation of emutls is linked dynamically from e.g. libgcc; if the implementation comes from compiler-rt or a statically linked libgcc, it doesn't work as intended.) This fixes PR50146 and https://github.com/msys2/MINGW-packages/issues/8706 (fixing calling std::call_once in a dynamically linked libstdc++); since `f731839584` the dso_local attribute on the TLS variable affected the actual generated code for accessing the emutls variable. The dso_local attribute on the emutls variable made those accesses to use 32 bit relative addressing in code, which requires runtime pseudo relocations in the text section, and breaks entirely if the actual other variable ends up loaded too far away in the virtual address space. Differential Revision: https://reviews.llvm.org/D102970	2021-05-27 23:51:22 +03:00
Erich Keane	cb66bf2c6d	Replace 'magic static' with a member variable for SCYL kernel names I discovered when merging the __builtin_sycl_unique_stable_name into my downstream that it is actually possible for the cc1 invocation to have more than 1 Sema instance, if you pass it multiple input files, each gets its own Sema instance and thus ASTContext instance. The result was that the call to Filter the SYCL kernels was using an ItaniumMangleContext stored via a 'magic static', so it had an invalid reference to ASTContext when processing the 2nd failure. The failure is unfortunately flakey/transient, but the test that fails was added anyway. The magic-static was switched to a unique_ptr member variable in ASTContext that is initialized when needed.	2021-05-27 13:46:31 -07:00
Georgeta Igna	50f17e9d31	[analyzer] RetainCountChecker: Disable reference counting for OSMetaClass. It is a reference-counted class but it uses different methods for that and the checker doesn't understand them yet. Differential Revision: https://reviews.llvm.org/D103081	2021-05-27 13:12:19 -07:00
Marco Elver	4fbc66cd6d	[Clang] Enable __has_feature(coverage_sanitizer) Like other sanitizers, enable __has_feature(coverage_sanitizer) if clang has enabled at least one SanitizerCoverage instrumentation type. Because coverage instrumentation selection is not handled via normal -fsanitize= (and thus not in SanitizeSet), passing this information through to LangOptions required propagating the already parsed -fsanitize-coverage= options from CodeGenOptions through to LangOptions in FixupInvocation(). Reviewed By: aaron.ballman Differential Revision: https://reviews.llvm.org/D103159	2021-05-27 18:24:21 +02:00
Qiu Chaofan	5c18d11366	[SPE] Disable strict-fp for SPE by default As discussed in PR50385, strict-fp on PowerPC SPE has not been handled well. This patch disables it by default for SPE. Reviewed By: nemanjai, vit9696, jhibbits Differential Revision: https://reviews.llvm.org/D103235	2021-05-28 00:14:35 +08:00
Yaxun (Sam) Liu	6d2c095020	[HIP] Check compatibility of -fgpu-sanitize with offload arch -fgpu-sanitize is incompatible with offload arch containing xnack-. This patch checks that. Reviewed by: Artem Belevich Differential Revision: https://reviews.llvm.org/D102975	2021-05-27 12:06:42 -04:00
jasonliu	7922ff6010	[AIX] Add -lc++abi and -lunwind for linking Summary: We are going to have libc++abi.a and libunwind.a on AIX. Add the necessary linking command to pick the libraries up. Reviewed By: daltenty Differential Revision: https://reviews.llvm.org/D102813	2021-05-27 15:48:53 +00:00
Aaron Puchert	cf0b337c1b	Thread safety analysis: Allow exlusive/shared joins for managed and asserted capabilities Similar to how we allow managed and asserted locks to be held and not held in joining branches, we also allow them to be held shared and exclusive. The scoped lock should restore the original state at the end of the scope in any event, and asserted locks need not be released. We should probably only allow asserted locks to be subsumed by managed, not by (directly) acquired locks, but that's for another change. Reviewed By: delesley Differential Revision: https://reviews.llvm.org/D102026	2021-05-27 17:46:04 +02:00
Sven van Haastregt	85f5272ffc	[OpenCL][NFC] Fix typos in test	2021-05-27 16:06:33 +01:00
Erich Keane	eba69b59d1	Reimplement __builtin_unique_stable_name- The original version of this was reverted, and @rjmcall provided some advice to architect a new solution. This is that solution. This implements a builtin to provide a unique name that is stable across compilations of this TU for the purposes of implementing the library component of the unnamed kernel feature of SYCL. It does this by running the Itanium mangler with a few modifications. Because it is somewhat common to wrap non-kernel-related lambdas in macros that aren't present on the device (such as for logging), this uniquely generates an ID for all lambdas involved in the naming of a kernel. It uses the lambda-mangling number to do this, except replaces this with its own number (starting at 10000 for readabililty reasons) for lambdas used to name a kernel. Additionally, this implements itself as constexpr with a slight catch: if a name would be invalidated by the use of this lambda in a later kernel invocation, it is diagnosed as an error (see the Sema tests). Differential Revision: https://reviews.llvm.org/D103112	2021-05-27 07:12:20 -07:00
Aaron Ballman	8edd3464af	Add support for #elifdef and #elifndef WG14 adopted N2645 and WG21 EWG has accepted P2334 in principle (still subject to full EWG vote + CWG review + plenary vote), which add support for #elifdef as shorthand for #elif defined and #elifndef as shorthand for #elif !defined. This patch adds support for the new preprocessor directives.	2021-05-27 08:57:47 -04:00
Zahira Ammarguellat	a4b61c82cf	The compiler is crashing when compiling a coroutine intrinsic without the use of the option fcoroutines-ts. This is a patch to fix this. Fix for https://bugs.llvm.org/show_bug.cgi?id=50406	2021-05-26 18:07:31 -07:00
Mitch Phillips	f7c5c0d87b	Revert "[Scudo] Make -fsanitize=scudo use standalone. Migrate tests." This reverts commit `6911114d8c`. Broke the QEMU sanitizer bots due to a missing header dependency. This actually needs to be fixed on the bot-side, but for now reverting this patch until I can fix up the bot.	2021-05-26 10:50:26 -07:00
Mitch Phillips	6911114d8c	[Scudo] Make -fsanitize=scudo use standalone. Migrate tests. This patch moves -fsanitize=scudo to link the standalone scudo library, rather than the original compiler-rt based library. This is one of the major remaining roadblocks to deleting the compiler-rt based scudo, which should not be used any more. The standalone Scudo is better in pretty much every way and is much more suitable for production usage. As well as patching the litmus tests for checking that the scudo_standalone lib is linked instead of the scudo lib, this patch also ports all the scudo lit tests to run under scudo standalone. This patch also adds a feature to scudo standalone that was under test in the original scudo - that arguments passed to an aligned operator new were checked that the alignment was a power of two. Some lit tests could not be migrated, due to the following issues: 1. Features that aren't supported in scudo standalone, like the rss limit. 2. Different quarantine implementation where the test needs some more thought. 3. Small bugs in scudo standalone that should probably be fixed, like the Secondary allocator having a full page on the LHS of an allocation that only contains the chunk header, so underflows by <= a page aren't caught. 4. Slight differences in behaviour that's technically correct, like 'realloc(malloc(1), 0)' returns nullptr in standalone, but a real pointer in old scudo. 5. Some tests that might be migratable, but not easily. Tests that are obviously not applicable to scudo standalone (like testing that no sanitizer symbols made it into the DSO) have been deleted. After this patch, the remaining work is: 1. Update the Scudo documentation. The flags have changed, etc. 2. Delete the old version of scudo. 3. Patch up the tests in lit-unmigrated, or fix Scudo standalone. Reviewed By: cryptoad, vitalybuka Differential Revision: https://reviews.llvm.org/D102543	2021-05-26 10:03:17 -07:00
Hans Wennborg	a8f75d497d	[clang-cl] Add driver support for /std:c++20 and bump /std:c++latest (PR50465) VS 2019 16.11 (just released in Preview) is adding support for the /std:c++20 option and bumping /std:c++latest to "post-c++20". This updates clang-cl to match. Differential revision: https://reviews.llvm.org/D103155	2021-05-26 16:05:52 +02:00
Sven van Haastregt	ba0fe85ec0	[OpenCL] Include header for atomic-ops test Avoid duplicating the memory_order and memory_scope enum definitions.	2021-05-26 12:32:07 +01:00
Bjorn Pettersson	a3b3f7e631	[HIP] Adjust check in hip-include-path.hip test case The changes in commit `722c39fef5` caused the test case to fail when building with -DLLVM_LIBDIR_SUFFIX=64. This patch makes the checks a bit more relaxed to support libdir suffixes again. Also adjusting the regular expressions to avoid mathes including double quotes.	2021-05-26 11:08:05 +02:00
Jake Egan	5bc644aeca	Revert "[AIX] Avoid structor alias; die before bad alias codegen" Avoiding structor alias is no longer needed because AIX now has an alias implementation here: https://reviews.llvm.org/D83252. This reverts commit `b116ded57d`. Reviewed By: jasonliu Differential Revision: https://reviews.llvm.org/D102724	2021-05-25 15:07:40 -04:00
Momchil Velikov	21aa107eb7	Reland "Do not create LLVM IR `constant`s for objects with dynamic initialisation" This relands commit `13dd65b3a1`. The original commit contained a test, which failed when compiled for a MACH-O target. This patch changes the test to run for x86_64-linux instead of `%itanium_abi_triple`, to avoid having invalid syntax for MACH-O sections. The patch itself does not care about section attribute syntax and a x86 backend does not even need to be included in the build. Differential Revision: https://reviews.llvm.org/D102693	2021-05-25 15:54:40 +01:00
David Spickett	8427053f81	[clang][ARM] When handling multiple -mimplicit-it mark all as used Since `4468e5b899` clang will prefer the last one it finds of "-mimplicit-it" or "-Wa,-mimplicit-it". Due to a mistake in that patch the compiler argument "-mimplicit-it" was never marked as used, even if it was the last one and was passed to llvm. Move the Claim call back to the start of the loop and update the testing to check we don't get any unused argument warnings. Reviewed By: mstorsjo Differential Revision: https://reviews.llvm.org/D103086	2021-05-25 14:53:07 +00:00
Sanjay Patel	16e78ec0b4	[Headers][WASM] adjust test that runs the optimizer; NFC This broke with the LLVM change in `0bab0f6161`	2021-05-25 09:17:10 -04:00
Marco Elver	280333021e	[SanitizeCoverage] Add support for NoSanitizeCoverage function attribute We really ought to support no_sanitize("coverage") in line with other sanitizers. This came up again in discussions on the Linux-kernel mailing lists, because we currently do workarounds using objtool to remove coverage instrumentation. Since that support is only on x86, to continue support coverage instrumentation on other architectures, we must support selectively disabling coverage instrumentation via function attributes. Unfortunately, for SanitizeCoverage, it has not been implemented as a sanitizer via fsanitize= and associated options in Sanitizers.def, but rolls its own option fsanitize-coverage. This meant that we never got "automatic" no_sanitize attribute support. Implement no_sanitize attribute support by special-casing the string "coverage" in the NoSanitizeAttr implementation. To keep the feature as unintrusive to existing IR generation as possible, define a new negative function attribute NoSanitizeCoverage to propagate the information through to the instrumentation pass. Fixes: https://bugs.llvm.org/show_bug.cgi?id=49035 Reviewed By: vitalybuka, morehouse Differential Revision: https://reviews.llvm.org/D102772	2021-05-25 12:57:14 +02:00
Marco Elver	85feebf5a3	[NFC][SanitizeCoverage] Test always_inline functions work Test that always_inline functions are instrumented as expected. Reviewed By: vitalybuka Differential Revision: https://reviews.llvm.org/D102929	2021-05-25 12:57:14 +02:00
Balazs Benics	f05b70c236	Revert "[analyzer][ctu] Avoid parsing invocation list again and again during on-demand parsing of CTU" This reverts commit `db8af0f21d`. clang-x86_64-debian-fast fails on this. + : 'RUN: at line 4' + /usr/bin/ccache /b/1/clang-x86_64-debian-fast/llvm.src/clang/test/Analysis/ctu-on-demand-parsing-multiple-invocation-list-parsing.cpp -fPIC -shared -o /b/1/clang-x86_64-debian-fast/llvm.obj/tools/clang/test/Analysis/Output/ctu-on-demand-parsing-multiple-invocation-list-parsing.cpp.tmp/mock_open.so ccache: error: execv of /b/1/clang-x86_64-debian-fast/llvm.src/clang/test/Analysis/ctu-on-demand-parsing-multiple-invocation-list-parsing.cpp failed: Permission denied	2021-05-25 09:29:56 +02:00
Ella Ma	db8af0f21d	[analyzer][ctu] Avoid parsing invocation list again and again during on-demand parsing of CTU During CTU, the on-demand parsing will read and parse the invocation list to know how to compile the file being imported. However, it seems that the invocation list will be parsed again if a previous parsing has failed. Then, parse again and fail again. This patch tries to overcome the problem by storing the error code during the first parsing, and re-create the stored error during the later parsings. Reviewed By: steakhal Patch By: OikawaKirie! Differential Revision: https://reviews.llvm.org/D101763	2021-05-25 09:19:14 +02:00
Min-Yih Hsu	6685a3f3e4	[cfe] Support target-specific escaped character in inline asm GCC allows each target to define a set of non-letter and non-digit escaped characters for inline assembly that will be replaced by another string (They call this "punctuation" characters. The existing "%%" and "%{" -- replaced by '%' and '{' at the end -- can be seen as special cases shared by all targets). This patch implements this feature by adding a new hook in `TargetInfo`. Differential Revision: https://reviews.llvm.org/D103036	2021-05-24 21:39:21 -07:00
Logan Smith	a5a3efa82a	[Sema] Always search the full function scope context if a potential availability violation is encountered This fixes both https://bugs.llvm.org/show_bug.cgi?id=50309 and https://bugs.llvm.org/show_bug.cgi?id=50310. Previously, lambdas inside functions would mark their own bodies for later analysis when encountering a potentially unavailable decl, without taking into consideration that the entire lambda itself might be correctly guarded inside an @available check. The same applied to inner class member functions. Blocks happened to work as expected already, since Sema::getEnclosingFunction() skips through block scopes. This patch instead simply and conservatively marks the entire outermost function scope for search, and removes some special-case logic that prevented DiagnoseUnguardedAvailabilityViolations from traversing down into lambdas and nested functions. This correctly accounts for arbitrarily nested lambdas, inner classes, and blocks that may be inside appropriate @available checks at any ancestor level. It also treats all potential availability violations inside functions consistently, without being overly sensitive to the current DeclContext, which previously caused issues where e.g. nested struct members were warned about twice. DiagnoseUnguardedAvailabilityViolations now has more work to do in some cases, particularly in functions with many (possibly deeply) nested lambdas and classes, but the big-O is the same, and the simplicity of the approach and the fact that it fixes at least two bugs feels like a strong win. Differential Revision: https://reviews.llvm.org/D102338	2021-05-24 21:13:30 -07:00
Nico Weber	d881319cc5	Revert "Do not create LLVM IR `constant`s for objects with dynamic initialisation" This reverts commit `13dd65b3a1`. Breaks check-clang on macOS, see https://reviews.llvm.org/D102693	2021-05-24 21:22:07 -04:00
Richard Smith	de6164ec4d	PR50456: Properly handle multiple escaped newlines in a '*/'.	2021-05-24 16:21:03 -07:00
Momchil Velikov	13dd65b3a1	Do not create LLVM IR `constant`s for objects with dynamic initialisation When a const-qualified object has a section attribute, that section is set to read-only and clang outputs a LLVM IR constant for that object. This is incorrect for dynamically initialised objects. For example: int init() { return 15; } __attribute__((section("SA"))) const int a = init(); a is allocated to a read-only section and is left unintialised (zero-initialised). This patch adds checks if an initialiser is a constant expression and allocates objects to sections as follows: * const-qualified objects - no initialiser or constant initialiser: .rodata - dynamic initializer: .bss * non const-qualified objects - no initialiser or dynamic initialiser: .bss - constant initialiser: .data (".rodata", ".data", and ".bss" names used just for explanatory purpose) Differential Revision: https://reviews.llvm.org/D102693	2021-05-24 22:04:15 +01:00
serge-sans-paille	4ab3041acb	Revert "[NFC] remove explicit default value for strboolattr attribute in tests" This reverts commit `bda6e5bee0`. See https://lab.llvm.org/buildbot/#/builders/109/builds/15424 for instance	2021-05-24 19:43:40 +02:00
serge-sans-paille	bda6e5bee0	[NFC] remove explicit default value for strboolattr attribute in tests Since `d6de1e1a71`, no attributes is quivalent to setting attribute to false. This is a preliminary commit for https://reviews.llvm.org/D99080	2021-05-24 19:31:04 +02:00
Anastasia Stulova	626e9641a2	[OpenCL] Fix test by adding SPIR triple	2021-05-24 13:03:50 +01:00
Anastasia Stulova	237c6924bd	[OpenCL] Add clang extension for bit-fields. Allow use of bit-fields as a clang extension in OpenCL. The extension can be enabled using pragma directives. This fixes PR45339! Differential Revision: https://reviews.llvm.org/D101843	2021-05-24 12:42:17 +01:00
Tomasz Kamiński	058f384ae9	[analyzer] Correctly propagate ConstructionContextLayer thru ParenExpr Previously, information about `ConstructionContextLayer` was not propagated thru causing the expression like: Var c = (createVar()); To produce unrelated temporary for the `createVar()` result and conjure a new symbol for the value of `c` in C++17 mode. Reviewed By: steakhal Patch By: tomasz-kaminski-sonarsource! Differential Revision: https://reviews.llvm.org/D102835	2021-05-24 10:16:52 +02:00
Chen Zheng	99d45ed22f	[Debug-Info] handle DW_TAG_rvalue_reference_type at strict DWARF. When -gstrict-dwarf is specified, generate DW_TAG_rvalue_reference_type at DWARF 4 or above Reviewed By: dblaikie, aprantl Differential Revision: https://reviews.llvm.org/D100630	2021-05-23 21:24:13 -04:00
Petr Hosek	5ff79f001f	Revert "[Driver] Support libc++ in MSVC" This reverts commit `b604301be3` since it caused compilation failure in sanitizer_unwind_win.cpp when using the runtimes build.	2021-05-22 15:49:46 -07:00
Petr Hosek	b604301be3	[Driver] Support libc++ in MSVC This implements support for using libc++ headers and library in the MSVC toolchain. We only support libc++ that is a part of the toolchain, and not headers installed elsewhere on the system. Differential Revision: https://reviews.llvm.org/D101479	2021-05-22 13:32:23 -07:00
Yaxun (Sam) Liu	bf6124580d	[HIP] support ThinLTO Add options -[no-]offload-lto and -foffload-lto=[thin,full] for controlling LTO for offload compilation. Allow LTO for AMDGPU target. AMDGPU target does not support codegen of object files containing call of external functions, therefore the LLVM module passed to AMDGPU backend needs to contain definitions of all the callees. An LLVM option is added to allow function importer to import functions with noinline attribute. HIP toolchain passes proper LLVM options to lld to make sure function importer imports definitions of all the callees. Reviewed by: Teresa Johnson, Artem Belevich Differential Revision: https://reviews.llvm.org/D99683	2021-05-22 10:48:34 -04:00
Arthur Eubanks	f7788e1bff	Revert "[NewPM] Only invalidate modified functions' analyses in CGSCC passes" This reverts commit `d14d84af2f`. Causes unacceptable memory regressions.	2021-05-21 16:38:03 -07:00
Nick Desaulniers	033138ea45	[IR] make stack-protector-guard-* flags into module attrs D88631 added initial support for: - -mstack-protector-guard= - -mstack-protector-guard-reg= - -mstack-protector-guard-offset= flags, and D100919 extended these to AArch64. Unfortunately, these flags aren't retained for LTO. Make them module attributes rather than TargetOptions. Link: https://github.com/ClangBuiltLinux/linux/issues/1378 Reviewed By: tejohnson Differential Revision: https://reviews.llvm.org/D102742	2021-05-21 15:53:30 -07:00
Martin Storsjö	4468e5b899	[clang] Don't pass multiple backend options if mixing -mimplicit-it and -Wa,-mimplicit-it If multiple instances of the -arm-implicit-it option is passed to the backend, it errors out. Also fix cases where there are multiple -Wa,-mimplicit-it; the existing tests indicate that the last one specified takes effect, while in practice it passed double options, which didn't work as intended. Differential Revision: https://reviews.llvm.org/D102812	2021-05-22 00:05:31 +03:00
Anton Zabaznov	826905787a	[OpenCL] Add support of OpenCL C 3.0 __opencl_c_fp64 There already exists cl_khr_fp64 extension. So OpenCL C 3.0 and higher should use the feature, earlier versions still use the extension. OpenCL C 3.0 API spec states that extension will be not described in the option string if corresponding optional functionality is not supported (see 4.2. Querying Devices). Due to that fact the usage of features for OpenCL C 3.0 must be as follows: ``` $ clang -Xclang -cl-ext=+cl_khr_fp64,+__opencl_c_fp64 ... $ clang -Xclang -cl-ext=-cl_khr_fp64,-__opencl_c_fp64 ... ``` e.g. the feature and the equivalent extension (if exists) must be set to the same values Reviewed By: Anastasia Differential Revision: https://reviews.llvm.org/D96524	2021-05-21 15:01:19 +03:00
Melvin Fox	b5b3843f8d	[clang] Fix for "Bug 27113 - MSVC-compat __identifier implementation incomplete" this patch fixes Bug 27113 by adding support for string literals to the implementation of the MS extension __identifier. Differential revision: https://reviews.llvm.org/D100252	2021-05-21 11:14:01 +02:00
Jan Svoboda	89ea60f392	[clang][deps] Explicitly enable C++ modules in a test In Apple Clang, enabling modules for C++ input requires additional flag: -fcxx-modules.	2021-05-21 10:25:02 +02:00
Timm Bäder	95423c7c99	[clang][driver] Treat -flto=[auto,jobserver] as -flto Instead of ignoring flto=auto and -flto=jobserver, treat them as -flto and pass -flto=full along. Differential Revision: https://reviews.llvm.org/D102479	2021-05-21 08:38:41 +02:00
Jinsong Ji	edf4d69d38	[AIX] Print printable byte list as quoted string .byte supports string, so if the whole byte list are printable, we can actually print the string for readability and LIT tests maintainence. .byte 'H,'e,'l,'l,'o,',,' ,'w,'o,'r,'l,'d -> .byte "Hello, world" Reviewed By: hubert.reinterpretcast Differential Revision: https://reviews.llvm.org/D102814	2021-05-21 02:37:55 +00:00
Yaxun (Sam) Liu	4cb42564ec	[CUDA][HIP] Fix device variables used by host variables emitted on both host and device side with different addresses when ODR-used by host function should not cause device side counter-part to be force emitted. This fixes the regression caused by https://reviews.llvm.org/D102237 Reviewed by: Artem Belevich Differential Revision: https://reviews.llvm.org/D102801	2021-05-20 17:04:29 -04:00
Min-Yih Hsu	dccf5c7dfb	[M68k] Support for inline asm operands w/ simple constraints This patch adds supports for inline assembly operands and some simple operand constraints, including register and constant operands. Differential Revision: https://reviews.llvm.org/D102585	2021-05-20 14:00:09 -07:00
Min-Yih Hsu	e620bea211	[M68k] Allow user to preserve certain registers Add `-ffixed-a[0-6]` and `-ffixed-d[0-7]` and the corresponding subtarget features to prevent certain register from being allocated. Differential Revision: https://reviews.llvm.org/D102805	2021-05-20 13:57:22 -07:00
Heejin Ahn	3eb12b0ae1	[WebAssembly] Warn on exception spec for Emscripten EH It turns out we have not correctly supported exception spec all along in Emscripten EH. Emscripten EH supports `throw()` but not `throw` with types. See https://bugs.llvm.org/show_bug.cgi?id=50396. Wasm EH also only supports `throw()` but not `throw` with types, and we have been printing a warning message for the latter. This prints the same warning message for `throw` with types when Emscripten EH is used, or more precisely, when Wasm EH is not used. (So this will print the warning messsage even when `-fno-exceptions` is used but I think that should be fine. It's cumbersome to do a complilcated option checking in CGException.cpp and options checkings are mostly done in elsewhere.) Reviewed By: dschuff, kripken Differential Revision: https://reviews.llvm.org/D102791	2021-05-20 13:00:20 -07:00
Jamie Schmeiser	136ced498b	When vector is found as a type or non-type id, check if it is really the altivec vector token. Summary: Call TryAltiVecVectorToken when an identifier is seen in the parser before annotating the token. This checks the next token where necessary to ensure that vector is properly handled as the altivec token. Author: Jamie Schmeiser <schmeise@ca.ibm.com> Reviewed By: ZarkoCA (Zarko Todorovski) Differential Revision: https://reviews.llvm.org/D100991	2021-05-20 12:39:04 -04:00
Peter Waller	2d574a1104	[CodeGen][AArch64][SVE] Canonicalize intrinsic rdffr{ => _z} Follow up to D101357 / `3fa6510f6`. Supersedes D102330. Goal: Use flags setting rdffrs instead of rdffr + ptest. Problem: RDFFR_P doesn't have have a flags setting equivalent. Solution: in instcombine, canonicalize to RDFFR_PP at the IR level, and rely on RDFFR_PP+PTEST => RDFFRS_PP optimization in AArch64InstrInfo::optimizePTestInstr. While here: * Test that rdffr.z+ptest generates a rdffrs. * Use update_{test,llc}_checks.py on the tests. * Use sve attribute on functions. Differential Revision: https://reviews.llvm.org/D102623	2021-05-20 16:22:50 +00:00
Aaron Ballman	beb5a3a298	Correct some thread safety analysis diagnostics; NFC. The diagnostics were not following the usual style rules.	2021-05-20 11:30:21 -04:00
Daniel Kiss	801ab71032	[ARM][AArch64] SLSHardening: make non-comdat thunks possible Linker scripts might not handle COMDAT sections. SLSHardeing adds new section for each __llvm_slsblr_thunk_xN. This new option allows the generation of the thunks into the normal text section to handle these exceptional cases. ,comdat or ,noncomdat can be added to harden-sls to control the codegen. -mharden-sls=[all\|retbr\|blr],nocomdat. Reviewed By: kristof.beyls Differential Revision: https://reviews.llvm.org/D100546	2021-05-20 17:07:05 +02:00
Hubert Tong	603818b97c	[test] Fix pre-ra-sched.c to check for error message from stderr The test previous accidentally passed because it was looking for a lack of specific input from the binary(!) output being sent to stdout.	2021-05-20 09:51:21 -04:00
Xiangling Liao	d74b6635ef	Fix LIT failure on native aix On AIX, char bitfields have the same alignment as unsigned int. Reference: https://reviews.llvm.org/D87029 Differential Revision: https://reviews.llvm.org/D102715	2021-05-20 09:38:52 -04:00
Haojian Wu	80c1adfd18	[clang] Invalidate a non-dependent-type RecordDecl when it has any dependent-type base class specifier. This happens during the error-recovery, and it would esacpe all dependent-type check guards in getTypeInfo/constexpr-evaluator code paths, which lead to crashes. Differential Revision: https://reviews.llvm.org/D102773	2021-05-20 15:33:05 +02:00
Michael Spencer	7defab0820	Reapply "[clang][deps] Support inferred modules" This reverts commit `76b8754d` and ensures the PCM files are created in the correct directory (not in the current working directory).	2021-05-20 12:41:52 +02:00
Sergey Dmitriev	f8444a8e94	[clang-offload-bundler] Delimit input/output file names by '--' for llvm-objcopy That fixes a problem of using bundler with file names starting with dash. Reviewed By: ABataev Differential Revision: https://reviews.llvm.org/D102752	2021-05-19 20:25:05 -07:00
Johannes Doerfert	6ff380f439	[OpenMP][NFC] Remove SIMD check lines for non-simd tests If a test does not contain an " simd" but -fopenmp-simd RUN lines we can just check that we do not create __kmpc\|__tgt calls. Reviewed By: ABataev Differential Revision: https://reviews.llvm.org/D101973	2021-05-19 21:35:33 -05:00
Zhiwei Chen	dbc641deb9	[sanitizer] Reduce redzone size for small size global objects Currently 1 byte global object has a ridiculous 63 bytes redzone. This patch reduces the redzone size to be less than 32 if the size of global object is less than or equal to half of 32 (the minimal size of redzone). A 12 bytes object has a 20 bytes redzone, a 20 bytes object has a 44 bytes redzone. Reviewed By: MaskRay, #sanitizers, vitalybuka Differential Revision: https://reviews.llvm.org/D102469	2021-05-19 19:18:50 -07:00
Fangrui Song	ac06f6d06d	[test] Fix test	2021-05-19 16:23:52 -07:00
Fangrui Song	37561ba89b	-fno-semantic-interposition: Don't set dso_local on GlobalVariable `clang -fpic -fno-semantic-interposition` may set dso_local on variables for -fpic. GCC folks consider there are 'address interposition' and 'semantic interposition', and 'disabling semantic interposition' can optimize function calls but cannot change variable references to use local aliases (https://gcc.gnu.org/bugzilla/show_bug.cgi?id=100483). This patch removes dso_local for variables in `clang -fpic -fno-semantic-interposition` mode so that the built shared objects can work with copy relocations. Building llvm-project tiself with -fno-semantic-interposition (D102453) should now be safe with trunk Clang. Example: ``` // a.c int var; int *addr() { return var; } // old: cannot be interposed movslq .Lvar$local(%rip), %rax // new: can be interposed movq var@GOTPCREL(%rip), %rax movslq (%rax), %rax ``` The local alias lowering for `GlobalVariable`s is kept in case there is a future option allowing local aliases. Reviewed By: rnk Differential Revision: https://reviews.llvm.org/D102583	2021-05-19 16:08:28 -07:00
Richard Smith	2f8ac0758b	PR50402: Use proper constant evaluation rules for checking constraint satisfaction. Previously we used the rules for constant folding in a non-constant context, meaning that we'd incorrectly accept foldable non-constant expressions and that std::is_constant_evaluated() would evaluate to false.	2021-05-19 16:02:53 -07:00
Martin Storsjö	688b917b4b	Revert "[Driver] Delete -mimplicit-it=" This reverts commit `2919222d80`. That commit broke backwards compatibility. Additionally, the replacement, -Wa,-mimplicit-it, isn't yet supported by any stable release of Clang. See D102812 for a fix for the error cases when callers specify both -mimplicit-it and -Wa,-mimplicit-it.	2021-05-20 00:17:50 +03:00
Richard Smith	d38057f3ec	Treat implicit deduction guides as being equivalent to their corresponding constructor for access checking purposes.	2021-05-19 13:31:53 -07:00
Alex Lorenz	50be48b0f3	[clang][ObjC] Allow different availability annotation on a method when implementing an optional protocol requirement When an Objective-C method implements an optional protocol requirement, allow the method to use a newer introduced or older obsoleted availability version than what's specified on the method in the protocol itself. This allows SDK adopters to adopt an optional method from a protocol later than when the method is introduced in the protocol. The users that call an optional method on an object that conforms to this protocol are supposed to check whether the object implements the method or not, so a lack of appropriate `if (@available)` check for a new OS version is not a cause of concern as there's already another runtime check that's required. Differential Revision: https://reviews.llvm.org/D102459	2021-05-19 12:13:57 -07:00
Joseph Huber	2db182ff8d	[Diagnostics] Allow emitting analysis and missed remarks on functions Summary: Currently, only `OptimizationRemarks` can be emitted using a Function. Add constructors to allow this for `OptimizationRemarksAnalysis` and `OptimizationRemarkMissed` as well. Reviewed By: jdoerfert thegameg Differential Revision: https://reviews.llvm.org/D102784	2021-05-19 15:10:20 -04:00
Frederik Gossen	76b8754d1b	Revert "Reapply "[clang][deps] Support inferred modules"" This reverts commit `c98833cdaa`. The test `ClangScanDeps/modules-inferred-explicit-build.m` creates files in the current directory.	2021-05-19 19:19:37 +02:00
Melanie Blower	d30dfa8676	[clang][patch] Add support for option -fextend-arguments={32,64}: widen integer arguments to int64 in unprototyped function calls Reviewed By: Aaron Ballman Differential Revision: https://reviews.llvm.org/D101640	2021-05-19 10:59:56 -04:00
Michael Spencer	c98833cdaa	Reapply "[clang][deps] Support inferred modules" This reapplies commit 95033eb3 that reverted commit `1d9e8e13`. The tests were failing on Windows due to spaces and backslashes in paths not being handled carefully.	2021-05-19 13:35:51 +02:00
Haojian Wu	f5b5426433	[clang] Fix a crash on CheckArgAlignment. We might encounter an undeduced type before calling getTypeAlignInChars. NOTE: this retrieves the fix from `8f80c66bd2`, which was removed in Adam's followup fix `fbfcfdbf68`. We originally thought the crash was caused by recovery-ast, but it turns out it can occur for other cases, e.g. typo-correction. Differential Revision: https://reviews.llvm.org/D102750	2021-05-19 13:29:28 +02:00
Bjorn Pettersson	722c39fef5	[HIP] Tighten checks in hip-include-path.hip test case The checks (both positive and negative checks) in the test case hip-include-path.hip could mistakenly end up matching the string "clang" from the InstalledDir in case the build dir for example was named "/home/username/build-clang/". Intention with this patch is to tighten up the checks a bit to filter our the part of the paths that match with InstalledDir when doing the checks, as well as matching "/lib/clang/" rather than just "clang/". Problem was found when building with -DCLANG_DEFAULT_RTLIB=compiler-rt -DCLANG_DEFAULT_CXX_STDLIB=libc++ and having "clang/" in the path to the build dir. Reviewed By: yaxunl Differential Revision: https://reviews.llvm.org/D102723	2021-05-19 13:11:57 +02:00
Kristóf Umann	479ea2a8ed	[analyzer] Check the checker name, rather than the ProgramPointTag when silencing a checker The program point created by the checker, even if it is an error node, might not be the same as the name under which the report is emitted. Make sure we're checking the name of the checker, because thats what we're silencing after all. Differential Revision: https://reviews.llvm.org/D102683	2021-05-19 12:40:09 +02:00
Arthur Eubanks	0c509dbc7e	[NewPM] Add options to PrintPassInstrumentation To bring D99599's implementation in line with the existing PrintPassInstrumentation, and to fix a FIXME, add more customizability to PrintPassInstrumentation. Introduce three new options. The first takes over the existing "-debug-pass-manager-verbose" cl::opt. The second and third option are specific to -fdebug-pass-structure. They allow indentation, and also don't print analysis queries. To avoid more golden file tests than necessary, prune down the -fdebug-pass-structure tests. Reviewed By: asbirlea Differential Revision: https://reviews.llvm.org/D102196	2021-05-18 20:59:35 -07:00
Mike Rice	ff99fdf63f	[OpenMP] Stabilize OpenMP/parallel_for_codegen.cpp test (NFC) Revert recent commit to require x86-registered-target (`e4b790c5e3`). Remove -O1 from the run lines so they are less dependent on backend passes. Update the CHECK6 and CHECK10 lines with script. Differential Revision: https://reviews.llvm.org/D102720	2021-05-18 16:07:22 -07:00
Fangrui Song	2919222d80	[Driver] Delete -mimplicit-it= This is a GNU as and Clang cc1as option, not a GCC option. Users should specify `-Wa,-mimplicit-it=` instead. Note: mixing the -m option and the -Wa, option doesn't work `-Wa,-mimplicit-it=never -mimplicit-it=always` => `clang (LLVM option parsing): for the --arm-implicit-it option: may only occur zero or one times!` Reviewed By: nickdesaulniers, raj.khem Differential Revision: https://reviews.llvm.org/D102568	2021-05-18 10:57:24 -07:00
Aaron Ballman	ccbac06a07	Speculatively fix failing tests from `6381664580` This was causing some Mac-specific build failures: http://45.33.8.238/macm1/9739/step_7.txt http://45.33.8.238/mac/31615/step_7.txt As best I can tell with psychic debugging, the /Users/blah path to the source file is being treated as a macro undef with the clang-cl driver. This splits the filename off explicitly so hopefully the rest of the command line arguments will be read properly.	2021-05-18 12:44:58 -04:00
Sergey Dmitriev	8998a8aa97	[clang-offload-bundler] Add sections and set section flags using one llvm-objcopy invocation llvm-objcopy has been changed to support adding a section and updating section flags in one run (D90438), so we can now change clang-offload-bundler to run llvm-objcopy tool only once when creating fat object. Reviewed By: ABataev Differential Revision: https://reviews.llvm.org/D102670	2021-05-18 08:44:41 -07:00
Aaron Ballman	6381664580	Introduce SYCL 2020 mode Currently, we have support for SYCL 1.2.1 (also known as SYCL 2017). This patch introduces the start of support for SYCL 2020 mode, which is the latest SYCL standard available at (https://www.khronos.org/registry/SYCL/specs/sycl-2020/html/sycl-2020.html). This sets the default SYCL to be 2020 in the driver, and introduces the notion of a "default" version (set to 2020) when cc1 is in SYCL mode but there was no explicit -sycl-std= specified on the command line.	2021-05-18 10:34:14 -04:00
Jinsong Ji	7d6449322e	[DebugInfo][test] Check specific func name to ignore codegen differences We use `CHECK-LABEL: define` to divide input stream into functions, this works well on most platforms. But there are cases that some platforms (eg: AIX) may have different codegen , especially for global constructor and descructors. On AIX, the codegen will have two more functions: __dtor_b, __finalize_b, which will fail the test. The fix is to use specific function name so that we can safely ignore those unrelated codegen differences. Reviewed By: dblaikie Differential Revision: https://reviews.llvm.org/D102654	2021-05-18 14:03:27 +00:00
Alexey Bader	2ab513cd3e	[SYCL] Enable `opencl_global_[host,device]` attributes for SYCL Differential Revision: https://reviews.llvm.org/D100396	2021-05-18 10:27:35 +03:00
Ole Strohm	642d2f000b	[OpenCL] Fix initialization of __constant constructors without arguments This fixes the initialization of objects in the __constant address space that occurs when declaring the object. Fixes part of PR42566 Reviewed By: Anastasia Differential Revision: https://reviews.llvm.org/D102248	2021-05-18 10:59:53 +01:00
Ten Tzen	797ad70152	[Windows SEH]: HARDWARE EXCEPTION HANDLING (MSVC -EHa) - Part 1 This patch is the Part-1 (FE Clang) implementation of HW Exception handling. This new feature adds the support of Hardware Exception for Microsoft Windows SEH (Structured Exception Handling). This is the first step of this project; only X86_64 target is enabled in this patch. Compiler options: For clang-cl.exe, the option is -EHa, the same as MSVC. For clang.exe, the extra option is -fasync-exceptions, plus -triple x86_64-windows -fexceptions and -fcxx-exceptions as usual. NOTE:: Without the -EHa or -fasync-exceptions, this patch is a NO-DIFF change. The rules for C code: For C-code, one way (MSVC approach) to achieve SEH -EHa semantic is to follow three rules: * First, no exception can move in or out of _try region., i.e., no "potential faulty instruction can be moved across _try boundary. * Second, the order of exceptions for instructions 'directly' under a _try must be preserved (not applied to those in callees). * Finally, global states (local/global/heap variables) that can be read outside of _try region must be updated in memory (not just in register) before the subsequent exception occurs. The impact to C++ code: Although SEH is a feature for C code, -EHa does have a profound effect on C++ side. When a C++ function (in the same compilation unit with option -EHa ) is called by a SEH C function, a hardware exception occurs in C++ code can also be handled properly by an upstream SEH _try-handler or a C++ catch(...). As such, when that happens in the middle of an object's life scope, the dtor must be invoked the same way as C++ Synchronous Exception during unwinding process. Design: A natural way to achieve the rules above in LLVM today is to allow an EH edge added on memory/computation instruction (previous iload/istore idea) so that exception path is modeled in Flow graph preciously. However, tracking every single memory instruction and potential faulty instruction can create many Invokes, complicate flow graph and possibly result in negative performance impact for downstream optimization and code generation. Making all optimizations be aware of the new semantic is also substantial. This design does not intend to model exception path at instruction level. Instead, the proposed design tracks and reports EH state at BLOCK-level to reduce the complexity of flow graph and minimize the performance-impact on CPP code under -EHa option. One key element of this design is the ability to compute State number at block-level. Our algorithm is based on the following rationales: A _try scope is always a SEME (Single Entry Multiple Exits) region as jumping into a _try is not allowed. The single entry must start with a seh_try_begin() invoke with a correct State number that is the initial state of the SEME. Through control-flow, state number is propagated into all blocks. Side exits marked by seh_try_end() will unwind to parent state based on existing SEHUnwindMap[]. Note side exits can ONLY jump into parent scopes (lower state number). Thus, when a block succeeds various states from its predecessors, the lowest State triumphs others. If some exits flow to unreachable, propagation on those paths terminate, not affecting remaining blocks. For CPP code, object lifetime region is usually a SEME as SEH _try. However there is one rare exception: jumping into a lifetime that has Dtor but has no Ctor is warned, but allowed: Warning: jump bypasses variable with a non-trivial destructor In that case, the region is actually a MEME (multiple entry multiple exits). Our solution is to inject a eha_scope_begin() invoke in the side entry block to ensure a correct State. Implementation: Part-1: Clang implementation described below. Two intrinsic are created to track CPP object scopes; eha_scope_begin() and eha_scope_end(). _scope_begin() is immediately added after ctor() is called and EHStack is pushed. So it must be an invoke, not a call. With that it's also guaranteed an EH-cleanup-pad is created regardless whether there exists a call in this scope. _scope_end is added before dtor(). These two intrinsics make the computation of Block-State possible in downstream code gen pass, even in the presence of ctor/dtor inlining. Two intrinsic, seh_try_begin() and seh_try_end(), are added for C-code to mark _try boundary and to prevent from exceptions being moved across _try boundary. All memory instructions inside a _try are considered as 'volatile' to assure 2nd and 3rd rules for C-code above. This is a little sub-optimized. But it's acceptable as the amount of code directly under _try is very small. Part-2 (will be in Part-2 patch): LLVM implementation described below. For both C++ & C-code, the state of each block is computed at the same place in BE (WinEHPreparing pass) where all other EH tables/maps are calculated. In addition to _scope_begin & _scope_end, the computation of block state also rely on the existing State tracking code (UnwindMap and InvokeStateMap). For both C++ & C-code, the state of each block with potential trap instruction is marked and reported in DAG Instruction Selection pass, the same place where the state for -EHsc (synchronous exceptions) is done. If the first instruction in a reported block scope can trap, a Nop is injected before this instruction. This nop is needed to accommodate LLVM Windows EH implementation, in which the address in IPToState table is offset by +1. (note the purpose of that is to ensure the return address of a call is in the same scope as the call address. The handler for catch(...) for -EHa must handle HW exception. So it is 'adjective' flag is reset (it cannot be IsStdDotDot (0x40) that only catches C++ exceptions). Suppress push/popTerminate() scope (from noexcept/noTHrow) so that HW exceptions can be passed through. Original llvm-dev [RFC] discussions can be found in these two threads below: https://lists.llvm.org/pipermail/llvm-dev/2020-March/140541.html https://lists.llvm.org/pipermail/llvm-dev/2020-April/141338.html Differential Revision: https://reviews.llvm.org/D80344/new/	2021-05-17 22:42:17 -07:00
Jinsong Ji	82b5281247	[Driver][test] Don't assume integrated-as The tests of fdebug-compilation-dir and -ffile-compilation-dir for `-x assembler` are assuming integrated-as. If the platform set the no-itegrated-as by default (eg: AIX for now), then this test will fail. Add the -integrated-as to aviod relying on the platform defaults. Reviewed By: thakis Differential Revision: https://reviews.llvm.org/D102647	2021-05-17 20:24:21 +00:00
Eli Friedman	698568b74c	[clang CodeGen] Don't crash on large atomic function parameter. I wouldn't recommend writing code like the testcase; a function parameter isn't atomic, so using an atomic type doesn't really make sense. But it's valid, so clang shouldn't crash on it. The code was assuming hasAggregateEvaluationKind(Ty) implies Ty is a RecordType, which isn't true. Just use isRecordType() instead. Differential Revision: https://reviews.llvm.org/D102015	2021-05-17 13:18:23 -07:00
Arthur Eubanks	3a0b6dc3e8	Revert "[Clang] -Wunused-but-set-parameter and -Wunused-but-set-variable" This reverts commit `14dfb3831c`. More false positives, see D100581.	2021-05-17 12:16:10 -07:00
Nick Desaulniers	0f41778919	[AArch64] Support customizing stack protector guard Follow up to D88631 but for aarch64; the Linux kernel uses the command line flags: 1. -mstack-protector-guard=sysreg 2. -mstack-protector-guard-reg=sp_el0 3. -mstack-protector-guard-offset=0 to use the system register sp_el0 for the stack canary, enabling the kernel to have a unique stack canary per task (like a thread, but not limited to userspace as the kernel can preempt itself). Address pr/47341 for aarch64. Fixes: https://github.com/ClangBuiltLinux/linux/issues/289 Signed-off-by: Nick Desaulniers <ndesaulniers@google.com> Reviewed By: xiangzhangllvm, DavidSpickett, dmgreen Differential Revision: https://reviews.llvm.org/D100919	2021-05-17 11:49:22 -07:00
Abbas Sabra	ebcf030efc	[analyzer] Engine: fix crash with SEH __leave keyword MSVC has a `try-except` statement. This statement could containt a `__leave` keyword, which is similar to `goto` to the end of the try block. The semantic of this keyword is not implemented. We should at least parse such code without crashing. https://docs.microsoft.com/en-us/cpp/cpp/try-except-statement?view=msvc-160 Patch By: AbbasSabra! Reviewed By: steakhal Differential Revision: https://reviews.llvm.org/D102280	2021-05-17 20:10:26 +02:00
Michael Benfield	14dfb3831c	[Clang] -Wunused-but-set-parameter and -Wunused-but-set-variable These are intended to mimic warnings available in gcc. Reviewed By: aeubanks Differential Revision: https://reviews.llvm.org/D100581	2021-05-17 11:02:26 -07:00
Yaxun (Sam) Liu	18cb17ce4c	[HIP] Fix spack detection Missing or duplicate spack package should not cause error, since users may only installed llvm/clang package, or users may installed duplicate HIP package but will use environment variable or compiler option to choose HIP path. The message about missing or duplicate spack package is informational, therefore should be emitted only when -v is specified. Reviewed by: Artem Belevich Differential Revision: https://reviews.llvm.org/D102556	2021-05-17 13:24:05 -04:00
Steffen Larsen	f226e28a88	[Clang][NVPTX] Add NVPTX intrinsics and builtins for CUDA PTX redux.sync instructions Adds NVPTX builtins and intrinsics for the CUDA PTX `redux.sync` instructions for `sm_80` architecture or newer. PTX ISA description of `redux.sync`: https://docs.nvidia.com/cuda/parallel-thread-execution/index.html#parallel-synchronization-and-communication-instructions-redux-sync Authored-by: Steffen Larsen <steffen.larsen@codeplay.com> Differential Revision: https://reviews.llvm.org/D100124	2021-05-17 09:46:59 -07:00
Stuart Adams	02c2468864	[Clang][NVPTX] Add NVPTX intrinsics and builtins for CUDA PTX cp.async instructions Adds NVPTX builtins and intrinsics for the CUDA PTX `cp.async` instructions for `sm_80` architecture or newer. PTX ISA description of `cp.async`: https://docs.nvidia.com/cuda/parallel-thread-execution/index.html#data-movement-and-conversion-instructions-asynchronous-copy https://docs.nvidia.com/cuda/parallel-thread-execution/index.html#parallel-synchronization-and-communication-instructions-cp-async-mbarrier-arrive Authored-by: Stuart Adams <stuart.adams@codeplay.com> Co-Authored-by: Alexander Johnston <alexander@codeplay.com> Differential Revision: https://reviews.llvm.org/D100394	2021-05-17 09:46:59 -07:00
Xiangling Liao	e0921655b1	[AIX] Implement AIX special bitfield related alignment rules 1.[bool, char, short] bitfields have the same alignment as unsigned int 2.Adjust alignment on typedef field decls/honor align attribute 3.Fix alignment for scoped enum class 4.Long long bitfield has 4bytes alignment and StorageUnitSize under 32 bit compile mode Differential Revision: https://reviews.llvm.org/D87029	2021-05-17 11:30:29 -04:00
Sylvain Audi	6052a8a535	[clang] In DependencyCollector on Windows, ignore case and separators when discarding duplicate dependency file paths. This patch removes duplicates also encountered in the output of clang-scan-deps when one same header file is encountered with different casing and/or different separators ('/' vs '\'). The case of separators can appear when the same file is included externally by `#include <folder/file.h>` whereas a file from the same folder does `#include "file.h"` Under Windows, clang computes the paths using '/' from the include directive, the `\` from the -I options, and the concatenations use the native `\`, leading to internal paths containing a mix of both separators. Differential Revision: https://reviews.llvm.org/D102339	2021-05-17 10:32:52 -04:00
Irina Dobrescu	50511df32e	[AArch64] Lower bitreverse in ISel Adding lowering support for bitreverse. Previously, lowering bitreverse would expand it into a series of other instructions. This patch makes it so this produces a single rbit instruction instead. Reviewed By: dmgreen Differential Revision: https://reviews.llvm.org/D102397	2021-05-17 13:35:27 +01:00
Raphael Isemann	888ce70af2	[DebugInfo] Fix DWARF expressions for __block vars that are not on the heap `__block` variables used to be always stored on the head instead of stack. D51564 allowed `__block` variables to the stored on the stack like normal variablesif they not captured by any escaping block, but the debug-info generation code wasn't made aware of it so we still unconditionally emit DWARF expressions pointing to the heap. This patch makes CGDebugInfo use the `EscapingByref` introduced in D51564 that tracks whether the `__block` variable is actually on the heap. If it's stored on the stack instead we just use the debug info we would generate for normal variables instead. Reviewed By: ahatanak, aprantl Differential Revision: https://reviews.llvm.org/D99946	2021-05-17 14:32:07 +02:00
Ole Strohm	66b112d624	[OpenCL] Fix reinterpret_cast of vectors Fixes issues with vectors in reinterpret_cast in C++ for OpenCL and adds tests to make sure they both pass without errors and generate the correct code. Fixes: PR47977 Reviewed By: Anastasia Differential Revision: https://reviews.llvm.org/D101519	2021-05-17 12:42:21 +01:00
Jan Svoboda	ccd485e554	Revert "[clang][deps] Support inferred modules" This reverts commit `1d9e8e13` Something is off with handling Windows paths: http://45.33.8.238/win/38908/step_7.txt Most likely introduced in https://reviews.llvm.org/D102491	2021-05-17 13:27:14 +02:00
Anastasia Stulova	3549466ac0	[OpenCL] Drop pragma handling for extension types/decls. Drop non-conformant extension pragma implementation as it does not properly disable anything and therefore enabling non-disabled logic has no meaning. This simplifies clang code and user interface to the extension functionality. With this patch extension pragma 'begin'/'end' and 'enable'/'disable' are only accepted for backward compatibility and no longer have any default behavior. Differential Revision: https://reviews.llvm.org/D101043	2021-05-17 12:09:43 +01:00
Michael Spencer	1d9e8e13dd	[clang][deps] Support inferred modules This patch adds support for inferred modules to the dependency scanner. Effectively a cherry-pick of https://github.com/apple/llvm-project/pull/699 authored by @Bigcheese with libclang and other changes omitted. Contains following changes: 1. [Clang][ScanDeps] Ignore __inferred_module.map dependency. * This shows up with inferred modules, but it doesn't exist on disk, so don't report it as a dependency. 2. [Clang][ScanDeps] Use the module map a module was inferred from for inferred modules. Also includes a smoke test that uses clang-scan-deps output to perform an explicit build. There's no intention to duplicate whatever `test/Modules` contains, just to verify the produced command-line does "work" (with very loose definition of work). Split from D100934. Reviewed By: dexonsmith Differential Revision: https://reviews.llvm.org/D102495	2021-05-17 11:41:54 +02:00
Michael Spencer	d3676d4b66	[clang][modules] Build inferred modules This patch enables explicitly building inferred modules. Effectively a cherry-pick of https://github.com/apple/llvm-project/pull/699 authored by @Bigcheese with libclang and dependency scanner changes omitted. Contains the following changes: 1. [Clang] Fix the header paths in clang::Module for inferred modules. * The UmbrellaAsWritten and NameAsWritten fields in clang::Module are a lie for framework modules. For those they actually are the path to the header or umbrella relative to the clang::Module::Directory. * The exception to this case is for inferred modules. Here it actually is the name as written, because we print out the module and read it back in when implicitly building modules. This causes a problem when explicitly building an inferred module, as we skip the printing out step. * In order to fix this issue this patch adds a new field for the path we want to use in getInputBufferForModule. It also makes NameAsWritten actually be the name written in the module map file (or that would be, in the case of an inferred module). 2. [Clang] Allow explicitly building an inferred module. * Building the actual module still fails, but make sure it fails for the right reason. Split from D100934. Reviewed By: dexonsmith Differential Revision: https://reviews.llvm.org/D102491	2021-05-17 10:40:51 +02:00
Pan, Tao	976a3e5f61	[SelectionDAG] Make fast and linearize visible by clang -pre-RA-sched ScheduleDAGFast.cpp is compiled to object file, but the ScheduleDAGFast object file isn't linked into clang executable file as no symbol is referred by outside. Add calling to createXxx of ScheduleDAGFast.cpp, then the ScheduleDAGFast object file will be linked into clang executable file. The static RegisterScheduler will register scheduler fast and linearize at clang boot time. Reviewed By: pengfei Differential Revision: https://reviews.llvm.org/D101601	2021-05-17 11:25:15 +08:00
Alessandro Decina	833e9b2ea7	[BPF] add support for 32 bit registers in inline asm Add "w" constraint type which allows selecting 32 bit registers. 32 bit registers were added in https://reviews.llvm.org/rGca31c3bb3ff149850b664838fbbc7d40ce571879. Differential Revision: https://reviews.llvm.org/D102118	2021-05-16 11:01:47 -07:00
Florian Hahn	803c52d0db	Recommit "[Clang,Driver] Add -fveclib=Darwin_libsystem_m support." Recommit D102489, with the test case requiring the AArch64 backend. This reverts the revert `59b419adc6`.	2021-05-16 18:49:53 +01:00
Danila Kutenin	d29f7f1a7b	[clang] Fix ternary operator in the second for loop statement Fix ternary operator in for loop argument, it was by mistake not set as CanBeForRangeDecl and led to incorrect codegen. It fixes https://bugs.llvm.org/show_bug.cgi?id=50038. I don't have commit rights. Danila Kutenin. kutdanila@yandex.ru Reviewed By: rsmith Differential Revision: https://reviews.llvm.org/D102502	2021-05-16 10:42:52 -07:00
Nikita Popov	656296b1c2	Reapply [CaptureTracking] Do not check domination Reapply after adjusting the synchronized.m test case, where the TODO is now resolved. The pointer is only captured on the exception handling path. ----- For the CapturesBefore tracker, it is sufficient to check that I can not reach BeforeHere. This does not necessarily require that BeforeHere dominates I, it can also occur if the capture happens on an entirely disjoint path. This change was previously accepted in D90688, but had to be reverted due to large compile-time impact in some cases: It increases the number of reachability queries that are performed. After recent changes, the compile-time impact is largely mitigated, so I'm reapplying this patch. The remaining compile-time impact is largely proportional to changes in code-size.	2021-05-16 15:46:31 +02:00
Saurabh Jha	ecb235d940	[Matrix] Implement static cast for matrix types This patch implements static casts for matrix types. This patch finishes all the work needed for https://bugs.llvm.org/show_bug.cgi?id=47141 Reviewed By: fhahn Differential Revision: https://reviews.llvm.org/D102125	2021-05-16 08:09:12 +01:00
Hubert Tong	9ae529d0db	[AIX][clang-repl][test] Mark unsupported pending XCOFF64 integrated-as This patch replaces the `powerpc64` token with the `system-aix` one in the UNSUPPORTED line of a test. The `powerpc64` token was originally added temporarily in `71a0609a2b`. If AIX uses integrated-as by default and it works both for 32-bit and 64-bit objects, then the issues encountered so far (see comments in D96033) would be mostly solved. As it is, marking the test as expected-to-fail (as opposed to unsupported) on AIX might cause more trouble in the form of 32-bit versus 64-bit differences. I am not aware of other situations where LIT tests are dependent on whether the LLVM build is 64-bit or 32-bit. Reviewed By: jsji Differential Revision: https://reviews.llvm.org/D102560	2021-05-15 22:41:45 -04:00
Pengxuan Zheng	c9b36a041f	Support GCC's -fstack-usage flag This patch adds support for GCC's -fstack-usage flag. With this flag, a stack usage file (i.e., .su file) is generated for each input source file. The format of the stack usage file is also similar to what is used by GCC. For each function defined in the source file, a line with the following information is produced in the .su file. <source_file>:<line_number>:<function_name> <size_in_byte> <static/dynamic> "Static" means that the function's frame size is static and the size info is an accurate reflection of the frame size. While "dynamic" means the function's frame size can only be determined at run-time because the function manipulates the stack dynamically (e.g., due to variable size objects). The size info only reflects the size of the fixed size frame objects in this case and therefore is not a reliable measure of the total frame size. Reviewed By: MaskRay Differential Revision: https://reviews.llvm.org/D100509	2021-05-15 10:22:49 -07:00
Douglas Yung	59b419adc6	Revert "[Clang,Driver] Add -fveclib=Darwin_libsystem_m support." This reverts commit `187a14e1f3`. The test added in this commit is failing on several build bots: https://lab.llvm.org/buildbot/#/builders/139/builds/4059 https://lab.llvm.org/buildbot/#/builders/132/builds/5605	2021-05-14 22:39:12 -07:00
Florian Hahn	187a14e1f3	[Clang,Driver] Add -fveclib=Darwin_libsystem_m support. Support for Darwin's libsystem_m's vector functions has been added to LLVM in `93a9a8a8d9`. This patch adds support for -fveclib=Darwin_libsystem_m to Clang. Reviewed By: arphaman Differential Revision: https://reviews.llvm.org/D102489	2021-05-14 21:00:13 +01:00
Anastasia Stulova	769cc335e6	[OpenCL] Simplify use of C11 atomic types. Remove requirements on extension pragma in atomic types because it has not respected the spec wrt disabling types and hasn't been useful either. With this change, the developers can use atomic types from the extensions if they are supported without enabling the pragma just like the builtin functions This patch does not break backward compatibility since the extension pragma is still supported and it makes the behavior of the compiler less strict by accepting code without needless and inconsistent pragma statements. Differential Revision: https://reviews.llvm.org/D100976	2021-05-14 17:43:00 +01:00
Nemanja Ivanovic	7cd2833311	[PowerPC] Add vec_vupkhpx and vec_vupklpx for XL compatibility These are old names for these functions that XL still supports.	2021-05-14 08:02:00 -05:00
David Spickett	2db090a2eb	[llvm][AsmPrinter] Restore source location to register clobber warning Since `5de2d189e6` this particular warning hasn't had the location of the source file containing the inline assembly. Fix this by reporting via LLVMContext. Which means that we no longer have the "instantiated into assembly here" lines but they were going to point to the start of the inline asm string anyway. This message is already tested via IR in llvm. However we won't have the required location info there so I've added a C file test in clang to cover it. (though strictly, this is testing llvm code) Reviewed By: ychen Differential Revision: https://reviews.llvm.org/D102244	2021-05-14 08:22:57 +00:00
Michael Kruse	83ff0ff463	[Clang][OpenMP] Allow unified_shared_memory for Pascal-generation GPUs. The Pascal architecture supports the page migration engine required for unified_shared_memory, as indicated by NVIDIA: * https://developer.nvidia.com/blog/unified-memory-cuda-beginners/ * https://developer.nvidia.com/blog/beyond-gpu-memory-limits-unified-memory-pascal/ * https://docs.nvidia.com/cuda/cuda-c-programming-guide/index.html#um-requirements The limitation was introduced in D54493 which justified the cut-off by the requirement for unified addressing. However, Unified Virtual Addressing (UVA) is already available with sm20 (Fermi, Kepler, Maxwell): * https://docs.nvidia.com/cuda/gpudirect-rdma/index.html#basics-of-uva-cuda-memory-management Unified shared memory might even be possible with these, but with migration of entire allocations on kernel startup. Reviewed By: JonChesterfield Differential Revision: https://reviews.llvm.org/D101595	2021-05-13 17:15:34 -05:00
Lang Hames	71a0609a2b	[clang-repl] Temporarily disable the execute.cpp test on ppc64. This test is failing on some builders (see [1]) with the following error: error: Added modules have incompatible data layouts: e-m:e-i64:64-n32:64-S128-v256:256:256-v512:512:512 (module) vs E-m:a-i64:64-n32:64-S128-v256:256:256-v512:512:512 (jit) The JIT layout is correct, but some IR module added to the JIT is using a little-endian layout instead. This commit disables the test on ppc64 until we can investigate further and fix the bug. [1] https://lab.llvm.org/staging/#/builders/126/builds/371	2021-05-13 14:39:12 -07:00
Aakanksha Patil	464e4dc50f	[AMDGPU] Add gfx1034 target Differential Revision: https://reviews.llvm.org/D102306	2021-05-13 14:25:18 -04:00
Roman Lebedev	0d8f91d2a9	[NFC] Delete two newly-added test cases Failing on bots in unobvious ways.	2021-05-13 21:23:01 +03:00
Roman Lebedev	ecc4e9e8f4	[NFC] Try to fix CodeGenCXX/thunk-wrong-return-type.cpp test	2021-05-13 21:17:31 +03:00
cynecx	8ec9fd4839	Support unwinding from inline assembly I've taken the following steps to add unwinding support from inline assembly: 1) Add a new `unwind` "attribute" (like `sideeffect`) to the asm syntax: ``` invoke void asm sideeffect unwind "call thrower", "~{dirflag},~{fpsr},~{flags}"() to label %exit unwind label %uexit ``` 2.) Add Bitcode writing/reading support + LLVM-IR parsing. 3.) Emit EHLabels around inline assembly lowering (SelectionDAGBuilder + GlobalISel) when `InlineAsm::canThrow` is enabled. 4.) Tweak InstCombineCalls/InlineFunction pass to not mark inline assembly "calls" as nounwind. 5.) Add clang support by introducing a new clobber: "unwind", which lower to the `canThrow` being enabled. 6.) Don't allow unwinding callbr. Reviewed By: Amanieu Differential Revision: https://reviews.llvm.org/D95745	2021-05-13 19:13:03 +01:00
Roman Lebedev	9d3eb7885d	[NFC] Try to fix CodeGenCXX/thunk-wrong-this.cpp test	2021-05-13 21:10:14 +03:00
Roman Lebedev	16d0381841	Return "[CGCall] Annotate `this` argument with alignment" The original change was reverted because it was discovered that clang mishandles thunks, and they receive wrong attributes for their this/return types - the ones for the function they will call, not the ones they have. While i have tried to fix this in https://reviews.llvm.org/D100388 that patch has been up and stuck for a month now, with little signs of progress. So while it will be good to solve this for real, for now we can simply avoid introducing the bug, by not annotating this/return for thunks. This reverts commit `6270b3a1ea`, relanding `0aa0458f14`.	2021-05-13 20:33:14 +03:00
Roman Lebedev	a624cec56d	[Clang][Codegen] Do not annotate thunk's this/return types with align/deref/nonnull attrs As it was discovered in post-commit feedback for `0aa0458f14`, we handle thunks incorrectly, and end up annotating their this/return with attributes that are valid for their callees, not for thunks themselves. While it would be good to fix this properly, and keep annotating them on thunks, i've tried doing that in https://reviews.llvm.org/D100388 with little success, and the patch is stuck for a month now. So for now, as a stopgap measure, subj.	2021-05-13 20:33:08 +03:00
Roman Lebedev	70aa4623de	[NFC][Clang][Codegen] Add tests with wrong attributes on this/return of thunks From https://reviews.llvm.org/D100388	2021-05-13 20:32:40 +03:00
Valeriy Savchenko	45212dec01	[analyzer][solver] Prevent use of a null state rdar://77686137 Differential Revision: https://reviews.llvm.org/D102240	2021-05-13 20:16:29 +03:00
Lei Huang	9469ff15b7	[PowerPC] Add clang option -m[no-]prefixed Add user-facing front end option to turn off power10 prefixed instructions. Reviewed By: nemanjai Differential Revision: https://reviews.llvm.org/D102191	2021-05-13 12:02:10 -05:00
Joe Ellis	2ed7db0d20	[InstSimplify] Remove redundant {insert,extract}_vector intrinsic chains This commit removes some redundant {insert,extract}_vector intrinsic chains by implementing the following patterns as instsimplifies: (insert_vector _, (extract_vector X, 0), 0) -> X (extract_vector (insert_vector _, X, 0), 0) -> X Reviewed By: peterwaller-arm Differential Revision: https://reviews.llvm.org/D101986	2021-05-13 16:09:50 +00:00
Aaron En Ye Shi	6a67e05a26	[HIP] Add __builtin_amdgcn_groupstaticsize Differential Revision: https://reviews.llvm.org/D102403	2021-05-13 15:50:08 +00:00
Zarko Todorovski	8fa168fc50	Parse vector bool when stdbool.h and altivec.h are included Currently when including stdbool.h and altivec.h declaration of `vector bool` leads to errors due to `bool` being expanded to '_Bool`. This patch allows the parser to recognize `_Bool`. Reviewed By: hubert.reinterpretcast, Everybody0523 Differential Revision: https://reviews.llvm.org/D102064	2021-05-13 11:48:32 -04:00
Juneyoung Lee	395607af3c	Reapply [ConstantFold] Fold more operations to poison This was reverted to mitigate mitigate miscompiles caused by the logical and/or to bitwise and/or fold. Reapply it now that the underlying issue has been fixed by D101191. ----- This patch folds more operations to poison. Alive2 proof: https://alive2.llvm.org/ce/z/mxcb9G (it does not contain tests about div/rem because they fold to poison when raising UB) Reviewed By: nikic Differential Revision: https://reviews.llvm.org/D92270	2021-05-13 16:04:12 +02:00
Nemanja Ivanovic	39e4676ca7	[PowerPC] Provide doubleword vector predicate form comparisons on Power7 There are two reasons this shouldn't be restricted to Power8 and up: 1. For XL compatibility 2. Because clang will expand comparison operators to these intrinsics* *Without this patch, the following causes a selection error: int test(vector signed long a, vector signed long b) { return a < b; } This patch provides the handling for the intrinsics in the back end and removes the Power8 guards from the predicate functions (vec_{all\|any}_{eq\|ne\|gt\|ge\|lt\|le}).	2021-05-13 04:56:56 -05:00
serge-sans-paille	6045cb89e5	Use an allow list on reserved macro identifiers The allow list is based on various official sources (see in-code comment). This fixes https://bugs.llvm.org/show_bug.cgi?id=50248 Differential Revision: https://reviews.llvm.org/D102168	2021-05-13 09:23:47 +02:00
Vassil Vassilev	92f9852fc9	[clang-repl] Recommit "Land initial infrastructure for incremental parsing" Original commit message: In http://lists.llvm.org/pipermail/llvm-dev/2020-July/143257.html we have mentioned our plans to make some of the incremental compilation facilities available in llvm mainline. This patch proposes a minimal version of a repl, clang-repl, which enables interpreter-like interaction for C++. For instance: ./bin/clang-repl clang-repl> int i = 42; clang-repl> extern "C" int printf(const char*,...); clang-repl> auto r1 = printf("i=%d\n", i); i=42 clang-repl> quit The patch allows very limited functionality, for example, it crashes on invalid C++. The design of the proposed patch follows closely the design of cling. The idea is to gather feedback and gradually evolve both clang-repl and cling to what the community agrees upon. The IncrementalParser class is responsible for driving the clang parser and codegen and allows the compiler infrastructure to process more than one input. Every input adds to the “ever-growing” translation unit. That model is enabled by an IncrementalAction which prevents teardown when HandleTranslationUnit. The IncrementalExecutor class hides some of the underlying implementation details of the concrete JIT infrastructure. It exposes the minimal set of functionality required by our incremental compiler/interpreter. The Transaction class keeps track of the AST and the LLVM IR for each incremental input. That tracking information will be later used to implement error recovery. The Interpreter class orchestrates the IncrementalParser and the IncrementalExecutor to model interpreter-like behavior. It provides the public API which can be used (in future) when using the interpreter library. Differential revision: https://reviews.llvm.org/D96033	2021-05-13 06:30:29 +00:00
Vassil Vassilev	f6907152db	Revert "[clang-repl] Land initial infrastructure for incremental parsing" This reverts commit `44a4000181`. We are seeing build failures due to missing dependency to libSupport and CMake Error at tools/clang/tools/clang-repl/cmake_install.cmake file INSTALL cannot find	2021-05-13 04:44:19 +00:00
Vassil Vassilev	44a4000181	[clang-repl] Land initial infrastructure for incremental parsing In http://lists.llvm.org/pipermail/llvm-dev/2020-July/143257.html we have mentioned our plans to make some of the incremental compilation facilities available in llvm mainline. This patch proposes a minimal version of a repl, clang-repl, which enables interpreter-like interaction for C++. For instance: ./bin/clang-repl clang-repl> int i = 42; clang-repl> extern "C" int printf(const char*,...); clang-repl> auto r1 = printf("i=%d\n", i); i=42 clang-repl> quit The patch allows very limited functionality, for example, it crashes on invalid C++. The design of the proposed patch follows closely the design of cling. The idea is to gather feedback and gradually evolve both clang-repl and cling to what the community agrees upon. The IncrementalParser class is responsible for driving the clang parser and codegen and allows the compiler infrastructure to process more than one input. Every input adds to the “ever-growing” translation unit. That model is enabled by an IncrementalAction which prevents teardown when HandleTranslationUnit. The IncrementalExecutor class hides some of the underlying implementation details of the concrete JIT infrastructure. It exposes the minimal set of functionality required by our incremental compiler/interpreter. The Transaction class keeps track of the AST and the LLVM IR for each incremental input. That tracking information will be later used to implement error recovery. The Interpreter class orchestrates the IncrementalParser and the IncrementalExecutor to model interpreter-like behavior. It provides the public API which can be used (in future) when using the interpreter library. Differential revision: https://reviews.llvm.org/D96033	2021-05-13 04:23:24 +00:00
Richard Smith	e1aa528d3a	Handle unexpanded packs appearing in type-constraints. For a type-constraint in a lambda signature, this makes the lambda contain an unexpanded pack; for requirements in a requires-expressions it makes the requires-expression contain an unexpanded pack; otherwise it's invalid.	2021-05-12 18:45:34 -07:00
Richard Smith	2f9d8b08ea	PR50306: When instantiating a generic lambda with a constrained 'auto', properly track that it has constraints. Previously an instantiation of a constrained generic lambda would behave as if unconstrained because we incorrectly cached a "has no constraints" value that we computed before the constraints from 'auto' parameters were attached.	2021-05-12 18:45:33 -07:00
Richard Smith	4c88cfb1dc	Add test for substitutability of variable templates in closure type mangling.	2021-05-12 18:45:33 -07:00
Pushpinder Singh	10c779d206	[AMDGPU][OpenMP] Emit textual IR for -emit-llvm -S Previously clang would print a binary blob into the bundled file for amdgcn. With this patch, it will instead print textual IR as expected. Reviewed By: JonChesterfield, ronlieb Differential Revision: https://reviews.llvm.org/D102065 Change-Id: I10c0127ab7357787769fdf9a2edd4b3071e790a1	2021-05-13 01:34:03 +00:00
Richard Smith	5bb7e81c64	Fix bad mangling of <data-member-prefix> for a closure in the initializer of a variable at global namespace scope. This implements the direction proposed in https://github.com/itanium-cxx-abi/cxx-abi/pull/126. Differential Revision: https://reviews.llvm.org/D101968	2021-05-12 13:13:21 -07:00
Erich Keane	08ba9ce1ef	Suppress Deferred Diagnostics in discarded statements. It doesn't really make sense to emit language specific diagnostics in a discarded statement, and suppressing these diagnostics results in a programming pattern that many users will feel is quite useful. Basically, this makes sure we only emit errors from the 'true' side of a 'constexpr if'. It does this by making the ExprEvaluatorBase type have an opt-in option as to whether it should visit discarded cases. Differential Revision: https://reviews.llvm.org/D102251	2021-05-12 12:48:47 -07:00
Pratyush Das	99d63ccff0	Add type information to integral template argument if required. Non-comprehensive list of cases: * Dumping template arguments; * Corresponding parameter contains a deduced type; * Template arguments are for a DeclRefExpr that hadMultipleCandidates() Type information is added in the form of prefixes (u8, u, U, L), suffixes (U, L, UL, LL, ULL) or explicit casts to printed integral template argument, if MSVC codeview mode is disabled. Differential revision: https://reviews.llvm.org/D77598	2021-05-12 19:00:08 +00:00
Nico Weber	d8c227ba05	Revert "Produce warning for performing pointer arithmetic on a null pointer." This reverts commit `dfc1e31d49`. See discussion on https://reviews.llvm.org/D98798	2021-05-12 14:53:50 -04:00
Anastasia Stulova	58d18dde5c	[OpenCL] Remove pragma requirement from Arm dot extension. This removed the pointless need for extension pragma since it doesn't disable anything properly and it doesn't need to enable anything that is not possible to disable. The change doesn't break existing kernels since it allows to compile more cases i.e. without pragma statements but the pragma continues to be accepted. Differential Revision: https://reviews.llvm.org/D100985	2021-05-12 16:25:33 +01:00
Yaxun (Sam) Liu	98575708da	[CUDA][HIP] Fix device template variables Currently clang does not emit device template variables instantiated only in host functions, however, nvcc is able to do that: https://godbolt.org/z/fneEfferY This patch fixes this issue by refactoring and extending the existing mechanism for emitting static device var ODR-used by host only. Basically clang records device variables ODR-used by host code and force them to be emitted in device compilation. The existing mechanism makes sure these device variables ODR-used by host code are added to llvm.compiler-used, therefore they are guaranteed not to be deleted. It also fixes non-ODR-use of static device variable by host code causing static device variable to be emitted and registered, which should not. Reviewed by: Artem Belevich Differential Revision: https://reviews.llvm.org/D102237	2021-05-12 11:13:29 -04:00
Ben Shi	892c56eabe	[clang][AVR] Redefine some types to be compatible with avr-gcc Reviewed By: dylanmckay Differential Revision: https://reviews.llvm.org/D100701	2021-05-12 22:05:26 +08:00
Qiu Chaofan	cbd93cee9b	Revert "[PowerPC] [Clang] Enable float128 feature on VSX targets" This commit brought build break in some f128 related tests. But that's not the root cause. There exists some differences between Clang and GCC's definition for 128-bit float types on PPC, so macros/functions in glibc may not work with clang -mfloat128 well. We need to handle this carefully and reland it.	2021-05-12 16:51:52 +08:00
Qiu Chaofan	febbe4b5a0	[PowerPC] [Clang] Enable float128 feature on VSX targets Reviewed By: nemanjai, steven.zhang Differential Revision: https://reviews.llvm.org/D92815	2021-05-12 14:33:41 +08:00
Richard Smith	bb726383ac	Revert "Fix bad mangling of <data-member-prefix> for a closure in the initializer of a variable at global namespace scope." This reverts commit `697ac15a0f`, for which review was not complete. That change was accidentally pushed when an unrelated change was pushed.	2021-05-11 17:46:18 -07:00
Richard Smith	3978333b71	Add test for PR50039. I believe Clang's behavior is correct according to the standard here, but this is an unusual situation for which we had no test coverage, so I'm adding some.	2021-05-11 17:35:34 -07:00
Richard Smith	697ac15a0f	Fix bad mangling of <data-member-prefix> for a closure in the initializer of a variable at global namespace scope. This implements the direction proposed in https://github.com/itanium-cxx-abi/cxx-abi/pull/126. Differential Revision: https://reviews.llvm.org/D101968	2021-05-11 17:35:33 -07:00
Leonard Chan	5cb17728d1	[clang][Fuchsia] Introduce compat multilibs These are GCC-compatible multilibs that use the generic Itanium C++ ABI instead of the Fuchsia C++ ABI. Differential Revision: https://reviews.llvm.org/D102030	2021-05-11 15:45:38 -07:00
Victor Huang	46475a79f8	[AIX][TLS] Diagnose use of unimplemented TLS models Add front end diagnostics to report error for unimplemented TLS models set by - compiler option `-ftls-model` - attributes like `__thread int __attribute__((tls_model("local-exec"))) var_name;` Reviewed by: aaron.ballman, nemanjai, PowerPC Differential Revision: https://reviews.llvm.org/D102070	2021-05-11 17:21:08 -05:00
Mike Rice	f90abac6ca	[OpenMP] Use compound operators for reduction combiner if available. The OpenMP spec seems to require the compound operators be used for +, *, &, \|, and ^ reduction. So use these if a class has those operators. If not try the simple operators as we did previously to limit the impact to existing code. Fixes: https://bugs.llvm.org/show_bug.cgi?id=48584 Differential Revision: https://reviews.llvm.org/D101941	2021-05-11 11:39:12 -07:00
Fangrui Song	2075f2b296	[clang] Support -fpic -fno-semantic-interposition for RISCV -fno-semantic-interposition (only effective with -fpic) can optimize default visibility external linkage (non-ifunc-non-COMDAT) variable access and function calls to avoid GOT/PLT, by using local aliases, e.g. ``` int var; __attribute__((optnone)) int fun(int x) { return x * x; } int test() { return fun(var); } ``` -fpic (var and fun are dso_preemptable) ``` test: .LBB1_1: auipc a0, %got_pcrel_hi(var) ld a0, %pcrel_lo(.LBB1_1)(a0) lw a0, 0(a0) // fun is preemptible by default in ld -shared mode. ld will create a PLT. tail fun@plt ``` vs -fpic -fno-semantic-interposition (var and fun are dso_local) ``` test: .Ltest$local: .LBB1_1: auipc a0, %pcrel_hi(.Lvar$local) addi a0, a0, %pcrel_lo(.LBB1_1) lw a0, 0(a0) // The assembler either resolves .Lfun$local at assembly time (-mno-relax // -fno-function-sections), or produces a relocation referencing a non-preemptible // local symbol (which can avoid PLT). tail .Lfun$local ``` Note: Clang's default -fpic is more aggressive than GCC -fpic: interprocedural optimizations (including inlining) are available but local aliases are not used. -fpic -fsemantic-interposition can disable interprocedural optimizations. Depends on D101875 Reviewed By: luismarques Differential Revision: https://reviews.llvm.org/D101876	2021-05-11 11:38:32 -07:00
Jamie Schmeiser	dfc1e31d49	Produce warning for performing pointer arithmetic on a null pointer. Summary: Test and produce warning for subtracting a pointer from null or subtracting null from a pointer. Reuse existing warning that this is undefined behaviour. Also add unit test for both warnings. Reformat to satisfy clang-format. Respond to review comments: add additional test. Respond to review comments: Do not issue warning for nullptr - nullptr in C++. Fix indenting to satisfy clang-format. Respond to review comments: Add C++ tests. Author: Jamie Schmeiser <schmeise@ca.ibm.com> Reviewed By: efriedma (Eli Friedman), nickdesaulniers (Nick Desaulniers) Differential Revision: https://reviews.llvm.org/D98798	2021-05-11 11:29:50 -04:00
Pushpinder Singh	eca3d68399	Revert "[AMDGPU][OpenMP] Emit textual IR for -emit-llvm -S" This reverts commit `7f78e409d0`.	2021-05-11 10:07:13 -05:00
Anastasia Stulova	13ea238b1e	[OpenCL] Allow use of double type without extension pragma. Simply use of extensions by allowing the use of supported double types without the pragma. Since earlier standards instructed that the pragma is used explicitly a new warning is introduced in pedantic mode to indicate that use of type without extension pragma enable can be non-portable. This patch does not break backward compatibility since the extension pragma is still supported and it makes the behavior of the compiler less strict by accepting code without extra pragma statements. Differential Revision: https://reviews.llvm.org/D100980	2021-05-11 12:54:38 +01:00
Paulo Matos	d7086af214	[WebAssembly] Support for WebAssembly globals in LLVM IR This patch adds support for WebAssembly globals in LLVM IR, representing them as pointers to global values, in a non-default, non-integral address space. Instruction selection legalizes loads and stores to these pointers to new WebAssemblyISD nodes GLOBAL_GET and GLOBAL_SET. Once the lowering creates the new nodes, tablegen pattern matches those and converts them to Wasm global.get/set of the appropriate type. Based on work by Paulo Matos in https://reviews.llvm.org/D95425. Reviewed By: pmatos Differential Revision: https://reviews.llvm.org/D101608	2021-05-11 11:19:29 +02:00
Craig Topper	18f3a14e13	[RISCV] Validate the SEW and LMUL operands to __builtin_rvv_vsetvli(max) These are required to be constants, this patch makes sure they are in the accepted range of values. These are usually created by wrappers in the riscv_vector.h header which should always be correct. This patch protects against a user using the builtin directly. Reviewed By: khchen Differential Revision: https://reviews.llvm.org/D102086	2021-05-10 12:11:13 -07:00
Fangrui Song	68a20c7f36	[clang] Support -fpic -fno-semantic-interposition for AArch64 -fno-semantic-interposition (only effective with -fpic) can optimize default visibility external linkage (non-ifunc-non-COMDAT) variable access and function calls to avoid GOT/PLT, by using local aliases, e.g. ``` int var; __attribute__((optnone)) int fun(int x) { return x * x; } int test() { return fun(var); } ``` -fpic (var and fun are dso_preemptable) ``` test: // @test adrp x8, :got:var ldr x8, [x8, :got_lo12:var] ldr w0, [x8] // fun is preemptible by default in ld -shared mode. ld will create a PLT. b fun ``` vs -fpic -fno-semantic-interposition (var and fun are dso_local) ``` test: // @test .Ltest$local: adrp x8, .Lvar$local ldr w0, [x8, :lo12:.Lvar$local] // The assembler either resolves .Lfun$local at assembly time, or produces a // relocation referencing a non-preemptible section symbol (which can avoid PLT). b .Lfun$local ``` Note: Clang's default -fpic is more aggressive than GCC -fpic: interprocedural optimizations (including inlining) are available but local aliases are not used. -fpic -fsemantic-interposition can disable interprocedural optimizations. Depends on D101872 Reviewed By: peter.smith Differential Revision: https://reviews.llvm.org/D101873	2021-05-10 09:43:33 -07:00
Momchil Velikov	5c7b43aa82	[clang][AArch32] Correctly align HA arguments when passed on the stack Analogously to https://reviews.llvm.org/D98794 this patch uses the `alignstack` attribute to fix incorrect passing of homogeneous aggregate (HA) arguments on AArch32. The EABI/AAPCS was recently updated to clarify how VFP co-processor candidates are aligned: `4488e34998` Differential Revision: https://reviews.llvm.org/D100853	2021-05-10 16:28:46 +01:00
Alexey Bataev	230953d577	[OPENMP]Fix PR48851: the locals are not globalized in SPMD mode. Follow the more general patch for now, do not try to SPMDize the kernel if the variable is used and local. Differential Revision: https://reviews.llvm.org/D101911	2021-05-10 06:34:11 -07:00
Nico Weber	08de6e3ada	clang: Fix tests after `7f78e409d0` if clang is not called clang-13 We might release a new version at some point after all. In fact, use the same pattern the other CHECK lines in this test use, for consistency.	2021-05-10 08:49:26 -04:00
Kadir Cetinkaya	761f3d1675	[clang][PreProcessor] Cutoff parsing after hitting completion point This fixes a crash caused by Lexers being invalidated at code completion points in https://github.com/llvm/llvm-project/blob/main/clang/lib/Lex/PPLexerChange.cpp#L520. Differential Revision: https://reviews.llvm.org/D102069	2021-05-10 11:24:27 +02:00
Pushpinder Singh	7f78e409d0	[AMDGPU][OpenMP] Emit textual IR for -emit-llvm -S Previously clang would print a binary blob into the bundled file for amdgcn. With this patch, it will instead print textual IR as expected. Reviewed By: JonChesterfield Differential Revision: https://reviews.llvm.org/D102065	2021-05-10 07:54:23 +00:00
Yuanfang Chen	9ffd4924e8	[NFC][Coroutines] Fix two tests by removing hardcoded SSA value.	2021-05-09 19:06:16 -07:00
Arthur Eubanks	34a8a437bf	[NewPM] Hide pass manager debug logging behind -debug-pass-manager-verbose Printing pass manager invocations is fairly verbose and not super useful. This allows us to remove DebugLogging from pass managers and PassBuilder since all logging (aside from analysis managers) goes through instrumentation now. This has the downside of never being able to print the top level pass manager via instrumentation, but that seems like a minor downside. Reviewed By: ychen Differential Revision: https://reviews.llvm.org/D101797	2021-05-07 21:51:47 -07:00
Petr Hosek	167906c109	[BareMetal] Ensure that sysroot always comes after library paths This addresses an issue introduced in D91559. We would invoke the compiler with -Lpath/to/lib --sysroot=path/to/sysroot where both locations contain libraries with the same name, but we expect linker to pick up the library in path/to/lib since that version is more specialized. This was the case before D91559 where the sysroot path would be ignored, but after that change linker would now pick up the library from the sysroot which resulted in unexpected behavior. The sysroot path should always come after any user provided library paths, followed by compiler runtime paths. We want for libraries in user provided library paths to always take precedence over sysroot libraries. This matches the behavior of other toolchains used with other targets. Differential Revision: https://reviews.llvm.org/D102049	2021-05-07 14:42:02 -07:00
Petr Hosek	f97ada27aa	Revert "[BareMetal] Ensure that sysroot always comes after library paths" This reverts commit `6b00b34b8a`.	2021-05-07 13:38:04 -07:00
Olivier Goffart	c4adc49a1c	[SEH] Fix regression with SEH in noexpect functions Commit `5baea05601` set the CurCodeDecl because it was needed to pass the assert in CodeGenFunction::EmitLValueForLambdaField, But this was not right to do as CodeGenFunction::FinishFunction passes it to EmitEndEHSpec and cause corruption of the EHStack. Revert the part of the commit that changes the CurCodeDecl, and instead adjust the assert to check for a null CurCodeDecl. Differential Revision: https://reviews.llvm.org/D102027	2021-05-07 13:27:59 -07:00
Petr Hosek	6b00b34b8a	[BareMetal] Ensure that sysroot always comes after library paths This addresses an issue introduced in D91559. We would invoke the compiler with -Lpath/to/lib --sysroot=path/to/sysroot where both locations contain libraries with the same name, but we expect linker to pick up the library in path/to/lib since that version is more specialized. This was the case before D91559 where the sysroot path would be ignored, but after that change linker would now pick up the library from the sysroot which resulted in unexpected behavior. The sysroot path should always come after any user provided library paths, followed by compiler runtime paths. We want for libraries in user provided library paths to always take precedence over sysroot libraries. This matches the behavior of other toolchains used with other targets. Differential Revision: https://reviews.llvm.org/D102049	2021-05-07 13:21:07 -07:00
Thomas Lively	1e9c39a3f9	[WebAssembly] Use functions instead of macros for const SIMD intrinsics To improve hygiene, consistency, and usability, it would be good to replace all the macro intrinsics in wasm_simd128.h with functions. The reason for using macros in the first place was to enforce the use of constants for some arguments using `_Static_assert` with `__builtin_constant_p`. This commit switches to using functions and uses the `__diagnose_if__` attribute rather than `_Static_assert` to enforce constantness. The remaining macro intrinsics cannot be made into functions until the builtin functions they are implemented with can be replaced with normal code patterns because the builtin functions themselves require that their arguments are constants. This commit also fixes a bug with the const_splat intrinsics in which the f32x4 and f64x2 variants were incorrectly producing integer vectors. Differential Revision: https://reviews.llvm.org/D102018	2021-05-07 11:50:19 -07:00
Ahsan Saghir	25bbff632d	[PowerPC] Provide MMA builtins for compatibility Vector pair intrinsics and builtins were renamed in https://reviews.llvm.org/D91974 to replace the _mma_ prefix by _vsx_. However, some projects used the _mma_ version, so this patch adds these intrinsics to provide compatibility. Fixes Bugzilla: https://bugs.llvm.org/show_bug.cgi?id=50159 Reviewed By: nemanjai, amyk Differential Revision: https://reviews.llvm.org/D100482	2021-05-07 09:10:16 -05:00
Anastasia Stulova	76f1de10f4	[OpenCL] Fix optional image types. This change allows the use of identifiers for image types from `cl_khr_gl_msaa_sharing` freely in the kernel code if the extension is not supported since they are not in the list of the reserved identifiers. This change also removed the need for pragma for the types in the extensions since the spec does not require the pragma uses. Differential Revision: https://reviews.llvm.org/D100983	2021-05-07 13:29:28 +01:00
Bruno Cardoso Lopes	819e0d105e	[CGAtomic] Lift strong requirement for remaining compare_exchange combinations Follow up on `431e3138a` and complete the other possible combinations. Besides enforcing the new behavior, it also mitigates TSAN false positives when combining orders that used to be stronger.	2021-05-06 21:05:20 -07:00
Stanislav Mekhanoshin	c714d03785	[AMDGPU] Expose __builtin_amdgcn_perm for v_perm_b32 Differential Revision: https://reviews.llvm.org/D102022	2021-05-06 16:17:33 -07:00
Thomas Lively	b198b9b897	[WebAssembly] Fix argument types in SIMD narrowing intrinsics The builtins were updated to take signed parameters in `627a526955`, but the intrinsics that use those builtins were not updated as well. The intrinsic test did not catch this sign mismatch because it is only reported as an error under -fno-lax-vector-conversions. This commit fixes the type mismatch and adds -fno-lax-vector-conversions to the test to catch similar problems in the future. Differential Revision: https://reviews.llvm.org/D101979	2021-05-06 10:07:45 -07:00
Nemanja Ivanovic	1faf3b195e	[PowerPC] Re-commit `ed87f512bb` This was reverted in `3761b9a234` just as I was about to commit the fix. This patch inlcudes the necessary fix.	2021-05-06 09:50:12 -05:00
David Spickett	e4b790c5e3	[OpenMP] Temporarily require X86 target for parallel_for_codegen.cpp test Since https://reviews.llvm.org/D101849 this test has been failing on bots that only enable either Arm or AArch64 targets. See: https://lab.llvm.org/buildbot/#/builders/107/builds/7601 Temporarily requires X86 for this test while the difference is figured out.	2021-05-06 14:16:43 +00:00
Nico Weber	3761b9a234	Revert "[PowerPC] Provide some P8-specific altivec overloads for P7" This reverts commit `ed87f512bb`. Breaks check-clang, see e.g. https://lab.llvm.org/buildbot/#/builders/139/builds/3818	2021-05-06 10:01:16 -04:00
Nemanja Ivanovic	ed87f512bb	[PowerPC] Provide some P8-specific altivec overloads for P7 This adds additional support for XL compatibility. There are a number of functions in altivec.h that produce a single instruction (or a very short sequence) for Power8 but can be done on Power7 without scalarization. XL provides these implementations. This patch adds the following overloads for doubleword vectors: vec_add vec_cmpeq vec_cmpgt vec_cmpge vec_cmplt vec_cmple vec_sl vec_sr vec_sra	2021-05-06 08:37:36 -05:00
Anastasia Stulova	c28a602329	[OpenCL] Remove subgroups pragma in enqueue kernel and pipe builtins. This patch simplifies the parser and makes the language semantics consistent. There is no extension pragma requirement in the spec for the subgroup functions in enqueue kernel or pipes and all other builtin functions are available without the pragama. Differential Revision: https://reviews.llvm.org/D100984	2021-05-06 13:59:38 +01:00
Johannes Doerfert	df729e2b82	[OpenMP] Overhaul `declare target` handling This patch fixes various issues with our prior `declare target` handling and extends it to support `omp begin declare target` as well. This started with PR49649 in mind, trying to provide a way for users to avoid the "ref" global use introduced for globals with internal linkage. From there it went down the rabbit hole, e.g., all variables, even `nohost` ones, were emitted into the device code so it was impossible to determine if "ref" was needed late in the game (based on the name only). To make it really useful, `begin declare target` was needed as it can carry the `device_type`. Not emitting variables eagerly had a ripple effect. Finally, the precedence of the (explicit) declare target list items needed to be taken into account, that meant we cannot just look for any declare target attribute to make a decision. This caused the handling of functions to require fixup as well. I tried to clean up things while I was at it, e.g., we should not "parse declarations and defintions" as part of OpenMP parsing, this will always break at some point. Instead, we keep track what region we are in and act on definitions and declarations instead, this is what we do for declare variant and other begin/end directives already. Highlights: - new diagnosis for restrictions specificed in the standard, - delayed emission of globals not mentioned in an explicit list of a declare target, - omission of `nohost` globals on the host and `host` globals on the device, - no explicit parsing of declarations in-between `omp [begin] declare variant` and the corresponding end anymore, regular parsing instead, - precedence for explicit mentions in `declare target` lists over implicit mentions in the declaration-definition-seq, and - `omp allocate` declarations will now replace an earlier emitted global, if necessary. --- Notes: The patch is larger than I hoped but it turns out that most changes do on their own lead to "inconsistent states", which seem less desirable overall. After working through this I feel the standard should remove the explicit declare target forms as the delayed emission is horrible. That said, while we delay things anyway, it seems to me we check too often for the current status even though that is often not sufficient to act upon. There seems to be a lot of duplication that can probably be trimmed down. Eagerly emitting some things seems pretty weak as an argument to keep so much logic around. --- Reviewed By: ABataev Differential Revision: https://reviews.llvm.org/D101030	2021-05-06 02:10:41 -05:00
Johannes Doerfert	5d8d994dfb	[OpenMP] Make sure classes work on the device as they do on the host We do provide `operator delete(void*)` in `<new>` but it should be available by default. This is mostly boilerplate to test it and the unconditional include of `<new>` in the header we always in include on the device. Reviewed By: JonChesterfield Differential Revision: https://reviews.llvm.org/D100620	2021-05-06 02:10:30 -05:00
Giorgis Georgakoudis	207b08a913	[OpenMP][NFC] Refactor Clang OpenMP tests using update_cc_test_checks This patch refactors a subset of Clang OpenMP tests, generating checklines using the update_cc_test_checks script. This refactoring facilitates updating the Clang OpenMP code generation codebase by automating test generation. Reviewed By: jdoerfert Differential Revision: https://reviews.llvm.org/D101849	2021-05-05 20:08:38 -07:00
Juneyoung Lee	8a156d1c27	[InstCombine] Fully disable select to and/or i1 folding This is a patch that disables the poison-unsafe select -> and/or i1 folding. It has been blocking D72396 and also has been the source of a few miscompilations described in llvm.org/pr49688 . D99674 conditionally blocked this folding and successfully fixed the latter one. The former one was still blocked, and this patch addresses it. Note that a few test functions that has `_logical` suffix are now deoptimized. These are created by @nikic to check the impact of disabling this optimization by copying existing original functions and replacing and/or with select. I can see that most of these are poison-unsafe; they can be revived by introducing freeze instruction. I left comments at fcmp + select optimizations (or-fcmp.ll, and-fcmp.ll) because I think they are good targets for freeze fix. Reviewed By: nikic Differential Revision: https://reviews.llvm.org/D101191	2021-05-06 09:29:52 +09:00
Petr Hosek	9d3dbcd24c	[Driver] Move -print-runtime-dir and -print-resource-dir tests Put these into a separate files to match other -print-* options tests. Differential Revision: https://reviews.llvm.org/D101813	2021-05-05 15:23:49 -07:00
Richard Smith	6bbfa0fd40	When performing template argument deduction to select a partial specialization while substituting a partial template parameter pack, don't try to extend the existing deduction. This caused us to select the wrong partial specialization in some rare cases. A recent change to libc++ caused this to happen in practice for code using std::conjunction.	2021-05-05 14:47:18 -07:00
Giorgis Georgakoudis	78a7d8c4dd	[Utils][NFC] Rename replace-function-regex in update_cc_test_checks This patch renames the replace-function-regex to replace-value-regex to indicate that the existing regex replacement functionality can replace any IR value besides functions. Reviewed By: jdoerfert Differential Revision: https://reviews.llvm.org/D101934	2021-05-05 14:19:30 -07:00
Thomas Lively	81fce29d6e	[WebAssembly] Add SIMD const_splat intrinsics These intrinsics do not correspond to their own underlying instruction, but are a convenience for the common case of materializing a constant vector that has the same value in each lane. Differential Revision: https://reviews.llvm.org/D101885	2021-05-05 13:46:45 -07:00
Thomas Lively	602f318cfd	[WebAssembly] Fix constness of pointer params to load intrinsics Update the SIMD builtin load functions to take pointers to const data and update the intrinsics themselves to not cast away constness. Differential Revision: https://reviews.llvm.org/D101884	2021-05-05 13:16:56 -07:00
Thomas Lively	627a526955	[WebAssembly] Update narrowing builtin function operand types Make the inputs to all narrowing builtins signed, which is how they are interpreted by the underlying instructions (only the result changes sign between instructions). Differential Revision: https://reviews.llvm.org/D101883	2021-05-05 13:04:04 -07:00
Nick Desaulniers	aefbfbcbd7	[Clang] remove text extension from diag::err_drv_invalid_value_with_suggestion This hinders translations, as per: https://clang.llvm.org/docs/InternalsManual.html#the-format-string Reviewed By: MaskRay, xbolva00 Differential Revision: https://reviews.llvm.org/D101387	2021-05-05 11:01:43 -07:00
Nico Weber	f16afcd9b5	[clang] remove an incremental build workaround This cleaned up an oversight over a year ago. Should no longer be needed.	2021-05-05 12:21:56 -04:00
Jinsong Ji	20d0aca430	[clang][Driver] Add -fintegrate-as to debug-pass-structure test CGProfilePass is not always on, it will be disabled when using non-intergrated assemblers. // Only enable CGProfilePass when using integrated assembler, since // non-integrated assemblers don't recognize .cgprofile section. PMBuilder.CallGraphProfile = !CodeGenOpts.DisableIntegratedAS; Add -fintegrate-as to make sure the output don't rely on the platform default. Reviewed By: evgeny777 Differential Revision: https://reviews.llvm.org/D101918	2021-05-05 16:10:57 +00:00
Pushpinder Singh	1f5cacfcb8	[AMDGPU][OpenMP] Fix clang driver crash when provided -c The offload action is used in four different ways as explained in Driver.cpp:4495. When -c is present, the final phase will be assemble (linker when -c is not present). However, this phase is skipped according to D96769 for amdgcn. So, offload action arrives into following situation, compile (device) ---> offload ---> offload without -c the chain looks like, compile (device) ---> offload ---> linker (device) ---> offload The former situation creates an unhandled case which causes problem. The solution presented in this patch delays the D96769 logic until job creation time. This keeps the offload action in the 1 of the 4 specified situations. Reviewed By: JonChesterfield Differential Revision: https://reviews.llvm.org/D101901	2021-05-05 14:26:58 +00:00
Anastasia Stulova	e994e74bca	[OpenCL] Add clang extension for non-portable kernel parameters. Added __cl_clang_non_portable_kernel_param_types extension that allows using non-portable types as kernel parameters. This allows bypassing the portability guarantees from the restrictions specified in C++ for OpenCL v1.0 s2.4. Currently this only disables the restrictions related to the data layout. The programmer should ensure the compiler generates the same layout for host and device or otherwise the argument should only be accessed on the device side. This extension could be extended to other case (e.g. permitting size_t) if desired in the future. Patch by olestrohm (Ole Strohm)! https://reviews.llvm.org/D101168	2021-05-05 14:58:23 +01:00
Hans Wennborg	4f4aa7b78d	Require asserts for clang/test/Headers/wasm.c The test doesn't pass in no-asserts builds, see comment on https://reviews.llvm.org/D101805	2021-05-05 11:42:18 +02:00
Giorgis Georgakoudis	f016c06abb	Revert "[OpenMP][NFC] Refactor Clang OpenMP tests using update_cc_test_checks" This reverts commit `956cae2f09`.	2021-05-04 17:12:32 -07:00
Giorgis Georgakoudis	956cae2f09	[OpenMP][NFC] Refactor Clang OpenMP tests using update_cc_test_checks This patch refactors a subset of Clang OpenMP tests, generating checklines using the update_cc_test_checks script. This refactoring facilitates updating the Clang OpenMP code generation codebase by automating test generation. Reviewed By: jdoerfert Differential Revision: https://reviews.llvm.org/D101849	2021-05-04 16:58:45 -07:00
Thomas Lively	f3b769e82f	[WebAssembly] Add codegen test for wasm_simd128.h We previously did not have tests demonstrating that the intrinsics in wasm_simd128.h lower to reasonable LLVM IR. This commit adds such a test. Differential Revision: https://reviews.llvm.org/D101805	2021-05-04 16:11:00 -07:00
Leonard Chan	0277a24f4b	[clang][test] Update -fc++-abi tests This attempts to move driver tests out of Frontend and to Driver, separates RUNs that should fail from RUNs that should succeed, and prevent creating output files or dumping output. Differential Revision: https://reviews.llvm.org/D101867	2021-05-04 15:53:00 -07:00
Giorgis Georgakoudis	92f2c39f91	[Utils] Run non-filecheck runlines in-order in update_cc_test_checks The script update_cc_test_checks runs all non-filechecked runlines before the filechecked ones. This creates problems since outputs of those non-filechecked runlines may conflict and that will fail the execution of update_cc_test_checks. This patch executes non-filechecked in the order specified in the test file to avoid this issue. Reviewed By: jdoerfert Differential Revision: https://reviews.llvm.org/D101683	2021-05-04 12:06:03 -07:00
Leonard Chan	9c72a210b5	Fix for test failure caused by `84c4754372`. Reduces the number of targets/triples for this test since not all cmake invocations will build for those targets.	2021-05-04 11:45:32 -07:00
Dan Liew	1971823ecb	[Driver] Fix `ToolChain::getCompilerRTPath()` to return the correct path on Apple platforms. When the target triple was an Apple platform `ToolChain::getOSLibName()` (called by `getCompilerRTPath()`) would return the full OS name including the version number (e.g. `darwin20.3.0`). This is not correct because the library directory for all Apple platforms is `darwin`. This in turn caused * `-print-runtime-dir` to return a non-existant path. * `-print-file-name=<any compiler-rt library>` to return the filename instead of the full path to the library. Two regression tests are included. rdar://77417317 Differential Revision: https://reviews.llvm.org/D101682	2021-05-04 11:28:26 -07:00
Leonard Chan	84c4754372	[clang] Add -fc++-abi= flag for specifying which C++ ABI to use This implements the flag proposed in RFC http://lists.llvm.org/pipermail/cfe-dev/2020-August/066437.html. The goal is to add a way to override the default target C++ ABI through a compiler flag. This makes it easier to test and transition between different C++ ABIs through compile flags rather than build flags. In this patch: - Store -fc++-abi= in a LangOpt. This isn't stored in a CodeGenOpt because there are instances outside of codegen where Clang needs to know what the ABI is (particularly through ASTContext::createCXXABI), and we should be able to override the target default if the flag is provided at that point. - Expose the existing ABIs in TargetCXXABI as values that can be passed through this flag. - Create a .def file for these ABIs to make it easier to check flag values. - Add an error for diagnosing bad ABI flag values. Differential Revision: https://reviews.llvm.org/D85802	2021-05-04 10:52:13 -07:00
Andrew Savonichev	b451ecd86e	[Clang][AArch64] Disable rounding of return values for AArch64 If a return value is explicitly rounded to 64 bits, an additional zext instruction is emitted, and in some cases it prevents tail call optimization. As discussed in D100225, this rounding is not necessary and can be disabled. Differential Revision: https://reviews.llvm.org/D100591	2021-05-04 20:29:01 +03:00
Jennifer Yu	5285748c2c	Fix assert on the variable which is used in omp clause is not marked as used. The problem only happens with constexpr variable, for constexpr variable, variable is not marked during parser variable. This is because compiler might find some var's associate expressions may not actully an odr-used later, the variables get kept in MaybeODRUseExprs, in normal case, at end of process fullExpr, the variable will be marked during the call to CleanupVarDeclMarking(). Since we are processing expression of OpenMP clauses, and the ActOnFinishFullExpr is not getting called that casue variable is not get marked. One way to fix this is to call CleanupVarDeclMarking() in EndOpenMPClause for each omp directive. This to fix https://bugs.llvm.org/show_bug.cgi?id=50206 Differential Revision: https://reviews.llvm.org/D101781	2021-05-04 09:07:35 -07:00
Ella Ma	d882750f11	[analyzer] Fix a crash for dereferencing an empty llvm::Optional variable in SMTConstraintManager.h. The first crash reported in the bug report 44338. Condition `!isSat.hasValue() \|\| isNotSat.getValue()` here should be `!isNotSat.hasValue() \|\| isNotSat.getValue()`. `getValue()` here crashed when we used the static analyzer to analyze postgresql-12.0. Patch By: OikawaKirie Reviewed By: steakhal, martong Differential Revision: https://reviews.llvm.org/D83660	2021-05-04 16:50:21 +02:00
Saurabh Jha	db210bc69b	[Matrix] Implement C-style explicit type conversions in CXX for matrix types This patch implements C-style explicit type conversions in CXX for matrix types. It is part of fixing https://bugs.llvm.org/show_bug.cgi?id=47141 Reviewed By: fhahn Differential Revision: https://reviews.llvm.org/D101696	2021-05-04 15:27:57 +01:00
Nico Weber	d7ec48d71b	[clang] accept -fsanitize-ignorelist= in addition to -fsanitize-blacklist= Use that for internal names (including the default ignorelists of the sanitizers). Differential Revision: https://reviews.llvm.org/D101832	2021-05-04 10:24:00 -04:00
Anastasia Stulova	64911eec75	[OpenCL] Allow pipe as a valid identifier prior to OpenCL 2.0. Pipe has not been a reserved keyword in the earlier OpenCL standards. However we failed to allow its use as an identifier in the original commit. This issues is fixed now and testing is improved accordingly. Differential Revision: https://reviews.llvm.org/D101052	2021-05-04 14:30:42 +01:00
serge-sans-paille	b83b23275b	Introduce -Wreserved-identifier Warn when a declaration uses an identifier that doesn't obey the reserved identifier rule from C and/or C++. Differential Revision: https://reviews.llvm.org/D93095	2021-05-04 11:19:01 +02:00
Arthur Eubanks	d14d84af2f	[NewPM] Only invalidate modified functions' analyses in CGSCC passes Previously, any change in any function in an SCC would cause all analyses for all functions in the SCC to be invalidated. With this change, we now manually invalidate analyses for functions we modify, then let the pass manager know that all function analyses should be preserved. So far this only touches the inliner, argpromotion, funcattrs, and updateCGAndAnalysisManager(), since they are the most used. Slight compile time improvements: http://llvm-compile-time-tracker.com/compare.php?from=326da4adcb8def2abdd530299d87ce951c0edec9&to=8942c7669f330082ef159f3c6c57c3c28484f4be&stat=instructions Reviewed By: mtrofin Differential Revision: https://reviews.llvm.org/D100917	2021-05-03 17:21:44 -07:00
Arthur Eubanks	2df3426fd1	[NewPM] Invalidate AAManager after populating GlobalsAA GlobalsAA is only created at the beginning of the inliner pipeline. If an AAManager is cached from previous passes, it won't get rebuilt to include the newly created GlobalsAA. Reviewed By: mtrofin Differential Revision: https://reviews.llvm.org/D101379	2021-05-03 16:37:32 -07:00
Heejin Ahn	1c1406f24d	[WebAssembly] Reenable end-to-end test in wasm-eh.cpp This was temporarily disabled while we were reimplementing the new spec. Reviewed By: tlively Differential Revision: https://reviews.llvm.org/D101735	2021-05-03 14:42:12 -07:00
Saurabh Jha	696becbd13	[Matrix] Remove bitcast when casting between matrices of the same size In matrix type casts, we were doing bitcast when the matrices had the same size. This was incorrect and this patch fixes that. Also added some new CodeGen tests for signed <-> usigned conversions Reviewed By: fhahn Differential Revision: https://reviews.llvm.org/D101754	2021-05-03 15:31:43 +01:00
Nathan Sidwell	ab7316f1c6	[clang] Spell correct variable fix Trailling -> Trailing (two ll-> one l) Differential Revision: https://reviews.llvm.org/D101753	2021-05-03 05:33:47 -07:00
Aaron Puchert	daca6edb31	Thread safety analysis: Fix false negative on break We weren't modifying the lock set when intersecting with one coming from a break-terminated block. This is inconsistent, since break isn't a back edge, and it leads to false negatives with scoped locks. We usually don't warn for those when joining locksets aren't the same, we just silently remove locks that are not in the intersection. But not warning and not removing them isn't right. Reviewed By: aaron.ballman Differential Revision: https://reviews.llvm.org/D101202	2021-05-03 14:03:17 +02:00
Hans Wennborg	876bf516e7	[clang-cl] Add parsing support for a bunch of new flags MSVC has added some new flags. Although they're not supported, this adds parsing support for them so clang-cl doesn't treat them as filenames. Except for /fsanitize=address which we do support. (clang-cl already exposes the -fsanitize= option, but this allows using the MSVC-spelling with a slash.) Differential revision: https://reviews.llvm.org/D101439	2021-05-03 13:51:27 +02:00
Nathan Sidwell	fe4c9b3cb0	[clang] Remove libstdc++ friend template hack this hack is for a now-unsupported version of libstdc++ Differential Revision: https://reviews.llvm.org/D101392	2021-05-03 04:19:30 -07:00
Craig Topper	cfe3b0005f	[RISCV] Reorder masked builtin operands. Use clang_builtin_alias for all overloaded vector builtins. This patch makes the builtin operand order match the C operand order for all intrinsics. With this we can use clang_builtin_alias for all overloaded intrinsics. This should further reduce the test time for vector intrinsics. Differential Revision: https://reviews.llvm.org/D101700	2021-05-02 10:57:25 -07:00
Yaxun (Sam) Liu	c58a6a6fb4	[HIP] Fix device lib selection Choose optimized device lib bitcode by fp options for performance. Reviewed by: Artem Belevich, Fangrui Song Differential Revision: https://reviews.llvm.org/D101654	2021-05-01 20:31:11 -04:00
Craig Topper	f36e6e16a8	[RISCV] Add missing frontend tests for vcompress intrinsics.	2021-05-01 13:07:28 -07:00
Pratyush Das	8518742104	Fix type printing of array template args The code example: ``` constexpr const char kEta[] = "Eta"; template <const char*, typename T> class Column {}; using quick = Column<kEta,double>; void lookup() { quick c1; c1.ls(); } ``` emits error: no member named 'ls' in 'Column<&kEta, double>'. The patch fixes the printed type name by not printing the ampersand for array types. Differential Revision: https://reviews.llvm.org/D36368	2021-05-01 18:50:09 +00:00
Yaxun (Sam) Liu	0175999805	[AMDGPU] Add options -mamdgpu-ieee -mno-amdgpu-ieee AMDGPU backend need to know whether floating point opcodes that support exception flag gathering quiet and propagate signaling NaN inputs per IEEE754-2008, which is conveyed by a function attribute "amdgpu-ieee". "amdgpu-ieee"="false" turns this off. Without this function attribute backend assumes it is on for compute functions. -mamdgpu-ieee and -mno-amdgpu-ieee are added to Clang to control this function attribute. By default it is on. -mno-amdgpu-ieee requires -fno-honor-nans or equivalent. Reviewed by: Matt Arsenault Differential Revision: https://reviews.llvm.org/D77013	2021-05-01 09:02:55 -04:00
Alex Lorenz	8fc5f07fc0	[clang][driver][darwin] use the deployment target version as the SDK version when passing -platform_version to the linker The use of a valid SDK version is preferred over an empty SDK version (0.0.0) as the system's runtime might expect the linked binary to contain a valid SDK version in order for the binary to work correctly rdar://66795188	2021-04-30 18:54:02 -07:00
Nemanja Ivanovic	bfd60b36f8	[PowerPC] Add floating point overloads for vec_sldw These are added for compatibility with XLC.	2021-04-30 20:29:03 -05:00
Nemanja Ivanovic	c3da07d216	[PowerPC] Provide fastmath sqrt and div functions in altivec.h This adds the long overdue implementations of these functions that have been part of the ABI document and are now part of the "Power Vector Intrinsic Programming Reference" (PVIPR). The approach is to add new builtins and to emit code with the fast flag regardless of whether fastmath was specified on the command line. Differential revision: https://reviews.llvm.org/D101209	2021-04-30 19:17:48 -05:00
Joel E. Denny	82e99f5035	[OpenMP] Fix second debug name from map clause This patch fixes a bug from D89802. For example, without it, Clang generates x as the debug map name for both x and y in the following example: ``` #pragma omp target map(to: x, y) x = y = 1; ``` Reviewed By: jhuber6 Differential Revision: https://reviews.llvm.org/D101564	2021-04-30 16:26:59 -04:00
Vince Bridgers	a27af1d816	[analyzer] Fix assertion in SVals.h Fix assertion in SVals.h apparently caused by https://reviews.llvm.org/D89055. clang:clang/include/clang/StaticAnalyzer/Core/PathSensitive/SVals.h:596: clang::ento::loc::MemRegionVal::MemRegionVal(const clang::ento::MemRegion *): Assertion `r' failed. Backtrace: ... clang/include/clang/StaticAnalyzer/Core/PathSensitive/SVals.h:597:3 clang::QualType, clang::QualType) clang/lib/StaticAnalyzer/Core/SValBuilder.cpp:773:18 clang::QualType, clang::QualType) clang/lib/StaticAnalyzer/Core/SValBuilder.cpp:612:12 clang::QualType) clang/lib/StaticAnalyzer/Core/SValBuilder.cpp:587:12 namespace)::RegionBindingsRef const&, clang::ento::Loc, clang::QualType) clang/lib/StaticAnalyzer/Core/RegionStore.cpp:1510:24 ... Reviewed By: ASDenysPetrov Differential Revision: https://reviews.llvm.org/D101635	2021-04-30 11:00:43 -05:00
Adam Czachorowski	fbfcfdbf68	[clang] Fix assert() crash when checking undeduced arg alignment There already was a check for undeduced and incomplete types, but it failed to trigger when outer type (SubstTemplateTypeParm in test) looked fine, but inner type was not. Differential Revision: https://reviews.llvm.org/D100667	2021-04-30 16:24:33 +02:00
Anastasia Stulova	3ec82e5195	[OpenCL] Prevent adding vendor extensions for all targets Removed extension begin/end pragma as it has no effect and it is added unconditionally for all targets. Differential Revision: https://reviews.llvm.org/D92244	2021-04-30 14:42:51 +01:00
Florian Hahn	6c31295493	[clang] Refactor mustprogress handling, add it to all loops in c++11+. Currently Clang does not add mustprogress to inifinite loops with a known constant condition, matching C11 behavior. The forward progress guarantee in C++11 and later should allow us to add mustprogress to any loop (http://eel.is/c++draft/intro.progress#1). This allows us to simplify the code dealing with adding mustprogress a bit. Reviewed By: aaron.ballman, lebedev.ri Differential Revision: https://reviews.llvm.org/D96418	2021-04-30 14:13:47 +01:00
Tomas Matheson	b14a6f06cc	[ARM][MVE] vcreateq lane ordering for big endian Use of bitcast resulted in lanes being swapped for vcreateq with big endian. Fix this by using vreinterpret. No code change for little endian. Adds IR lit test. Differential Revision: https://reviews.llvm.org/D101606	2021-04-30 13:48:05 +01:00
Keith Walker	109bf25e2c	[AArch64] Change __ARM_FEATURE_FP16FML macro name to __ARM_FEATURE_FP16_FML The "Arm C Language extensions" document (the current version can be found at https://developer.arm.com/documentation/101028/0012/?lang=en) states that the name of the feature test macro for the FP16 FML extension is __ARM_FEATURE_FP16_FML. Differential Revision: https://reviews.llvm.org/D101532	2021-04-30 11:03:15 +01:00
Evgeny Leviant	c81ec19fba	Fix -fdebug-pass-structure test case Pass structure can change when -O0 is given and extensions are used.	2021-04-30 10:18:23 +03:00
Wang, Pengfei	e0c7db7d8c	[MS] Preserve base register %rbx around cpuid This patch copies implementation from cpuid.h, which preserve base register %rbx around cpuid. It fixes PR50133. Reviewed By: rnk Differential Revision: https://reviews.llvm.org/D101338	2021-04-30 10:16:25 +08:00
Akira Hatanaka	809435e390	[Sema] Don't set BlockDecl's DoesNotEscape bit if the parameter type of the function the block is passed to isn't a block pointer type This patch fixes a bug where a block passed to a function taking a parameter that doesn't have a block pointer type (e.g., id or reference to a block pointer) was marked as noescape. This partially fixes PR50043. rdar://77030453 Differential Revision: https://reviews.llvm.org/D101097	2021-04-29 16:19:48 -07:00
Akira Hatanaka	2e1d9ebd46	[ObjC][ARC] Don't enter the cleanup scope if the initializer expression isn't an ExprWithCleanups This patch fixes a bug where a temporary ObjC pointer is released before the end of the full expression. This fixes PR50043. rdar://77030453 Differential Revision: https://reviews.llvm.org/D101502	2021-04-29 16:04:30 -07:00
Alex Lorenz	6b938d2ead	Recommit "[clang][driver] Use the provided arch name for a Darwin target triple This ensures that the Darwin driver uses a consistent target triple representation when the triple is printed out to the user. This reverts the revert commit `ab0df6c034`. Differential Revision: https://reviews.llvm.org/D100807	2021-04-29 15:00:40 -07:00
Dan Liew	2d42b2ee7b	[ASan] Rename `-fsanitize-address-destructor-kind=` to drop the `-kind` suffix. Renaming the option is based on discussions in https://reviews.llvm.org/D101122. It is normally not a good idea to rename driver flags but this flag is new enough and obscure enough that it is very unlikely to have adopters. While we're here also drop the `<kind>` metavar. It's not necessary and is actually inconsistent with the documentation in `clang/docs/ClangCommandLineReference.rst`. Differential Revision: https://reviews.llvm.org/D101491	2021-04-29 11:55:42 -07:00
Chirag Khandelwal	c204106188	[Clang][OpenMP] Frontend work for sections - D89671 This patch is child of D89671, contains the clang implementation to use the OpenMP IRBuilder's section construct. Co-author: @anchu-rajendran Reviewed By: jdoerfert Differential Revision: https://reviews.llvm.org/D91054	2021-04-29 19:52:27 +05:30
David Zarzycki	3eb2be67b9	Unbreak no-asserts testing	2021-04-29 10:01:37 -04:00
Evgeny Leviant	6a0283d0d2	[NewPM] Add an option to dump pass structure Patch adds -debug-pass-structure option to dump pass structure when new pass manager is used. Differential revision: https://reviews.llvm.org/D99599	2021-04-29 10:29:42 +03:00
Denys Petrov	b30521c28a	[analyzer] Wrong type cast occurs during pointer dereferencing after type punning Summary: During pointer dereferencing CastRetrievedVal uses wrong type from the Store after type punning. Namely, the pointer casts to another type and then assigns with a value of one more another type. It produces NonLoc value when Loc is expected. Differential Revision: https://reviews.llvm.org/D89055 Fixes: https://bugs.llvm.org/show_bug.cgi?id=37503 https://bugs.llvm.org/show_bug.cgi?id=49007	2021-04-29 01:03:38 +03:00
Petr Hosek	ea12d779bc	[libc++] Support per-target __config_site in per-target runtime build When using the per-target runtime build, it may be desirable to have different __config_site headers for each target where all targets cannot share a single configuration. The layout used for libc++ headers after this change is: ``` include/ c++/ v1/ <libc++ headers except for __config_site> <target1>/ c++/ v1/ __config_site <target2>/ c++/ v1/ __config_site <other targets> ``` This is the most optimal layout since it avoids duplication, the only headers that's per-target is __config_site, all other headers are shared across targets. This also means that we no need two -isystem flags: one for the target-agnostic headers and one for the target specific headers. Differential Revision: https://reviews.llvm.org/D89013	2021-04-28 14:27:16 -07:00
Arthur Eubanks	6d8d133862	Revert "[Clang] -Wunused-but-set-parameter and -Wunused-but-set-variable" This reverts commit `9b0501abc7`. False positives reported in D100581.	2021-04-28 12:47:18 -07:00
Ryan Santhirarajan	0395f9e70b	[ARM] Neon Polynomial vadd Intrinsic fix The Neon vadd intrinsics were added to the ARMSIMD intrinsic map, however due to being defined under an AArch64 guard in arm_neon.td, were not previously useable on ARM. This change rectifies that. It is important to note that poly128 is not valid on ARM, thus it was extracted out of the original arm_neon.td definition and separated for the sake of AArch64. Reviewed By: DavidSpickett Differential Revision: https://reviews.llvm.org/D100772	2021-04-28 11:59:40 -07:00
Valeriy Savchenko	602c8b4db5	[analyzer][NFC] Fix tests failing after a rebase	2021-04-28 18:55:20 +03:00
Valeriy Savchenko	ab5823867c	[analyzer] Find better description for tracked symbolic values When searching for stores and creating corresponding notes, the analyzer is more specific about the target region of the store as opposed to the stored value. While this description was tweaked for constant and undefined values, it lacked in the most general case of symbolic values. This patch tries to find a memory region, where this value is stored, to use it as a better alias for the value. rdar://76645710 Differential Revision: https://reviews.llvm.org/D101041	2021-04-28 18:37:38 +03:00
Valeriy Savchenko	e273918038	[analyzer] Track leaking object through stores Since we can report memory leaks on one variable, while the originally allocated object was stored into another one, we should explain how did it get there. rdar://76645710 Differential Revision: https://reviews.llvm.org/D100852	2021-04-28 18:37:38 +03:00
Valeriy Savchenko	61ae2db2d7	[analyzer] Adjust the reported variable name in retain count checker When reporting leaks, we try to attach the leaking object to some variable, so it's easier to understand. Before the patch, we always tried to use the first variable that stored the object in question. This can get very confusing for the user, if that variable doesn't contain that object at the moment of the actual leak. In many cases, the warning is dismissed as false positive and it is effectively a false positive when we fail to properly explain the warning to the user. This patch addresses the bigest issue in cases like this. Now we check if the variable still contains the leaking symbolic object. If not, we look for the last variable to actually hold it and use that variable instead. rdar://76645710 Differential Revision: https://reviews.llvm.org/D100839	2021-04-28 18:37:37 +03:00
David Candler	b8baa2a913	[ARM][AArch64] Require appropriate features for crypto algorithms This patch changes the AArch32 crypto instructions (sha2 and aes) to require the specific sha2 or aes features. These features have already been implemented and can be controlled through the command line, but do not have the expected result (i.e. `+noaes` will not disable aes instructions). The crypto feature retains its existing meaning of both sha2 and aes. Several small changes are included due to the knock-on effect this has: - The AArch32 driver has been modified to ensure sha2/aes is correctly set based on arch/cpu/fpu selection and feature ordering. - Crypto extensions are permitted for AArch32 v8-R profile, but not enabled by default. - ACLE feature macros have been updated with the fine grained crypto algorithms. These are also used by AArch64. - Various tests updated due to the change in feature lists and macros. Reviewed By: lenary Differential Revision: https://reviews.llvm.org/D99079	2021-04-28 16:26:18 +01:00
Anton Zabaznov	f0efc00751	[OpenCL] Introduce new method for validating OpenCL target Language options are not available when a target is being created, thus, a new method is introduced. Also, some refactoring is done, such as removing OpenCL feature macros setting from TargetInfo. Reviewed By: Anastasia Differential Revision: https://reviews.llvm.org/D101087	2021-04-28 16:00:02 +03:00
Alexey Bataev	c835630c25	[OPENMP]Fix PR49098: respect firstprivate of declare target variable. Need to respect mapping/privatization of declare target variables in the target regions if explicitly specified by the user. Differential Revision: https://reviews.llvm.org/D99530	2021-04-28 05:39:10 -07:00
Hans Wennborg	2d37f2115d	Try to fix clang/test/Driver/cl-options.c on non-x86 hosts The /QIntel-jcc-erratum flag only works when targeting x86, so pass --target to the driver to do that also on non-x86 hosts.	2021-04-28 11:57:09 +02:00
Hans Wennborg	789549bea4	[clang-cl] Map /QIntel-jcc-erratum to -mbranches-within-32B-boundaries	2021-04-28 11:10:08 +02:00
Hsiangkai Wang	b358a2be52	[RISCV] Remove riscv32 test cases for vector intrinsics.	2021-04-28 15:54:25 +08:00
Petr Hosek	36430d44ed	[Driver] Use normalized triples for per-target runtimes This is a partial revert of `b4537c3f51` based on the discussion in https://reviews.llvm.org/D101194. Rather than using the getMultiarchTriple, we use the getTripleString.	2021-04-27 22:31:36 -07:00
Petr Hosek	a921d2d2fb	[Driver] Add -print-multiarch This is useful in runtimes build for example which currently try to guess the correct triple where to place libraries in the multiarch layout. Using this flag, the build system can get the correct triple directly by querying Clang. Differential Revision: https://reviews.llvm.org/D101400	2021-04-27 16:04:54 -07:00
Samuel Thibault	b13e913b31	hurd: Clean up test - Unsupported Windows to drop backslashes code - Upgrade to current gcc 10 version Reviewed By: MaskRay Differential Revision: https://reviews.llvm.org/D101347	2021-04-27 13:19:17 -07:00
Samuel Thibault	932e8c3241	hurd: Detect libstdc++ include paths on Debian Hurd i386 This is a follow-up of `e92d2b80c6` ("[Driver] Detect libstdc++ include paths for native gcc (-m32 and -m64) on Debian i386") for the Debian Hurd case, which has the same multiarch name reduction from i686 to i386. i386-linux-gnu is actually Linux-only, so this moves the code of that commit to Linux.cpp, and adds the same to Hurd.cpp Reviewed By: MaskRay Differential Revision: https://reviews.llvm.org/D101331	2021-04-27 13:04:41 -07:00
Samuel Thibault	9c552d27ee	hurd: Fix i386 research path `f263418402` ("[Driver] Gnu.cpp: remove obsoleted i386 triple detection from end-of-life distribution versions") dropped the i686-gnu gcc path, but GNU/Hurd's gcc is actually using it, and not i386. This fixes the gcc path and update the tests to reflect it. Reviewed By: MaskRay Differential Revision: https://reviews.llvm.org/D101317	2021-04-27 12:41:18 -07:00
Petr Hosek	887bdff8ba	[Driver] Fix tests failing in per-target multiarch layout These failures were revealed by `b4537c3f51`. Differential Revision: https://reviews.llvm.org/D101348	2021-04-27 10:19:00 -07:00
Gabor Marton	4b99f9c7db	[analyzer][StdLibraryFunctionsChecker] Track dependent arguments When we report an argument constraint violation, we should track those other arguments that participate in the evaluation of the violation. By default, we depend only on the argument that is constrained, however, there are some special cases like the buffer size constraint that might be encoded in another argument(s). Differential Revision: https://reviews.llvm.org/D101358	2021-04-27 15:35:58 +02:00
Anastasia Stulova	6a92c19f3b	[C++4OpenCL] Add diagnostics for OpenCL types in templates. Refactored diagnostics for OpenCL types to allow their reuse for templates. Patch by olestrohm (Ole Strohm)! Differential Revision: https://reviews.llvm.org/D100860	2021-04-27 13:04:25 +01:00
Pushpinder Singh	59ad4e0f01	Reapply "[AMDGPU][OpenMP] Add amdgpu-arch tool to list AMD GPUs installed" This reverts commit `93604305bb`.	2021-04-27 10:47:05 +00:00
Pushpinder Singh	93604305bb	Revert "Reapply "[AMDGPU][OpenMP] Add amdgpu-arch tool to list AMD GPUs installed"" This reverts commit `15be0c41d2`.	2021-04-27 02:23:44 +00:00
Alex Lorenz	2509f9fbad	[clang] Don't crash when loading invalid VFS for the module dep collector The VFS is null when it's invalid so return early in collectVFSFromYAML.	2021-04-26 17:05:22 -07:00
Yonghong Song	a2a3ca8d97	BPF: emit debuginfo for Function of DeclRefExpr if requested Commit `e3d8ee35e4` ("reland "[DebugInfo] Support to emit debugInfo for extern variables"") added support to emit debugInfo for extern variables if requested by the target. Currently, only BPF target enables this feature by default. As BPF ecosystem grows, callback function started to get support, e.g., recently bpf_for_each_map_elem() is introduced (https://lwn.net/Articles/846504/) with a callback function as an argument. In the future we may have something like below as a demonstration of use case : extern int do_work(int); long bpf_helper(void callback_fn, void callback_ctx, ...); long prog_main() { struct { ... } ctx = { ... }; return bpf_helper(&do_work, &ctx, ...); } Basically bpf helper may have a callback function and the callback function is defined in another file or in the kernel. In this case, we would like to know the debuginfo types for do_work(), so the verifier can proper verify the safety of bpf_helper() call. For the following example, extern int do_work(int); long bpf_helper(void callback_fn); long prog() { return bpf_helper(&do_work); } Currently, there is no debuginfo generated for extern function do_work(). In the IR, we have, ... define dso_local i64 @prog() local_unnamed_addr #0 !dbg !7 { entry: %call = tail call i64 @bpf_helper(i8 bitcast (i32 (i32)* @do_work to i8*)) #2, !dbg !11 ret i64 %call, !dbg !12 } ... declare dso_local i32 @do_work(i32) #1 ... This patch added support for the above callback function use case, and the generated IR looks like below: ... declare !dbg !17 dso_local i32 @do_work(i32) #1 ... !17 = !DISubprogram(name: "do_work", scope: !1, file: !1, line: 1, type: !18, flags: DIFlagPrototyped, spFlags: DISPFlagOptimized, retainedNodes: !2) !18 = !DISubroutineType(types: !19) !19 = !{!20, !20} !20 = !DIBasicType(name: "int", size: 32, encoding: DW_ATE_signed) The TargetInfo.allowDebugInfoForExternalVar is renamed to TargetInfo.allowDebugInfoForExternalRef as now it guards both extern variable and extern function debuginfo generation. Differential Revision: https://reviews.llvm.org/D100567	2021-04-26 16:53:25 -07:00
Michael Benfield	9b0501abc7	[Clang] -Wunused-but-set-parameter and -Wunused-but-set-variable These are intended to mimic warnings available in gcc. -Wunused-but-set-variable is triggered in the case of a variable which appears on the LHS of an assignment but not otherwise used. For instance: void f() { int x; x = 0; } -Wunused-but-set-parameter works similarly, but for function parameters instead of variables. In C++, they are triggered only for scalar types; otherwise, they are triggered for all types. This is gcc's behavior. -Wunused-but-set-parameter is controlled by -Wextra, while -Wunused-but-set-variable is controlled by -Wunused. This is slightly different from gcc's behavior, but seems most consistent with clang's behavior for -Wunused-parameter and -Wunused-variable. Reviewed By: aeubanks Differential Revision: https://reviews.llvm.org/D100581	2021-04-26 15:09:03 -07:00
Alex Lorenz	ab0df6c034	Revert "[clang][driver] Use the provided arch name for a Darwin target triple" This reverts commit `6cc62043c8`. This caused a test failure on a M1 mac CI job (https://reviews.llvm.org/D100807#2718006), I will recommit this with a fix.	2021-04-26 14:57:00 -07:00
Martin Storsjö	f8de9aaef2	[llvm-rc] Add a GNU windres-like frontend to llvm-rc This primarily parses a different set of options and invokes the same resource compiler as llvm-rc normally. Additionally, it can convert directly to an object file (which in MSVC style setups is done with the separate cvtres tool, or by the linker). (GNU windres also supports other conversions; from coff object file back to .res, and from .res or object file back to .rc form; that's not yet implemented.) The other bigger complication lies in being able to imply or pass the intended target triple, to let clang find the corresponding mingw sysroot for finding include files, and for specifying the default output object machine format. It can be implied from the tool triple prefix, like `<triple>-[llvm-]windres` or picked up from the windres option e.g. `-F pe-x86-64`. In GNU windres, that option takes BFD style format names such as pe-i386 or pe-x86-64. As libbfd in binutils doesn't support Windows on ARM, there's no such canonical name for the ARM targets. Therefore, as an LLVM specific extension, this option is extended to allow passing full triples, too. Differential Revision: https://reviews.llvm.org/D100756	2021-04-26 22:04:29 +03:00
Alex Lorenz	6cc62043c8	[clang][driver] Use the provided arch name for a Darwin target triple This ensures that the Darwin driver uses a consistent target triple representation when the triple is printed out to the user. Differential Revision: https://reviews.llvm.org/D100807	2021-04-26 11:31:50 -07:00
Wael Yehia	91db11c98d	XFAIL thinlto testcases on AIX In D100350, we switched off thinLTO on the AIX OS. So, XFAIL tests that invoke clang with -flto=thin on AIX.	2021-04-26 17:02:06 +00:00
Nigel Perks	e7b6c0f398	[clang][XCore] Define __xcore__ for XCore target. The headers shipped with the XMOS XCore compiler expect __xcore__ to be defined. The __XS1B__ macro, already defined, is for the default subtarget. No other targets affected.	2021-04-26 15:06:04 +01:00
Alexey Bader	7818906ca1	[SYCL] Implement SYCL address space attributes handling Default address space (applies when no explicit address space was specified) maps to generic (4) address space. Added SYCL named address spaces `sycl_global`, `sycl_local` and `sycl_private` defined as sub-sets of the default address space. Static variables without address space now reside in global address space when compile for SPIR target, unless they have an explicit address space qualifier in source code. Differential Revision: https://reviews.llvm.org/D89909	2021-04-26 13:44:10 +03:00
Jeremy Morse	3c9bcf0e35	[Clang][Coroutine][DebugInfo] Relax test ordering requirement The test added in D97533 (and modified by this patch) has some overly strict printed metadata ordering requirements, specifically the interleaving of DILocalVariable nodes and DILocation nodes. Slight changes in metadata emission can easily break this unfortunately. This patch stops after clang codegen rather than allowing the coro splitter to run, and reduces the need for ordering: it picks out the DILocalVariable nodes being sought, in any order (CHECK-DAG), and doesn't examine any DILocations. The implicit CHECK-NOT is what's important: the test seeks to ensure a duplicate set of DILocalVariables aren't emitted in the same scope. Differential Revision: https://reviews.llvm.org/D100298	2021-04-26 10:07:22 +01:00
Hsiangkai Wang	16dc9afa09	[RISCV] Add REQUIRES: riscv-registered-target for RISC-V test cases.	2021-04-26 17:02:27 +08:00
Jan Svoboda	0f7d4105c6	[clang][deps] Only generate absolute paths when asked to Add option to `clang-scan-deps` to enable/disable generation of command-line arguments with absolute paths. This is essentially a revert of D100533, but with improved naming and added test. Reviewed By: dexonsmith Differential Revision: https://reviews.llvm.org/D101051	2021-04-26 10:53:41 +02:00
Hsiangkai Wang	bd32c2d648	[RISCV] Implement the vwcvt{u}.x.x.v/vncvt.x.x.w builtin. Differential Revision: https://reviews.llvm.org/D100824	2021-04-26 15:38:31 +08:00
Hsiangkai Wang	645c5f27a8	[RISCV] Implement the pseudo compare builtin. Differential Revision: https://reviews.llvm.org/D100823	2021-04-26 15:38:31 +08:00
Hsiangkai Wang	bfb3fca7e1	[RISCV] Implement the vfabs.v/vfneg.v builtin. Differential Revision: https://reviews.llvm.org/D100822	2021-04-26 15:38:31 +08:00
Hsiangkai Wang	4b2434172c	[RISCV] Implement the vmmv.m/vmnot.m builtin. Differential Revision: https://reviews.llvm.org/D100821	2021-04-26 15:38:31 +08:00
Levy Hsu	8cf54c7ff5	[RISCV] [1/2] Add IR intrinsic for Zbe extension RV32/64: bcompress bdecompress RV64 ONLY: bcompressw bdecompressw Reviewed By: craig.topper Differential Revision: https://reviews.llvm.org/D101143	2021-04-25 19:14:34 -07:00
Hsiangkai Wang	d8fa5ef6a2	[RISCV] Apply clang_builtin_alias to overloaded builtins. We only apply `clang_builtin_alias` to non-masked builtins. Masked builtins could not use `clang_builtin_alias` because the operand order is different between overloaded intrinsics and builtins. A bunch of test cases need to be updated. Differential Revision: https://reviews.llvm.org/D100658	2021-04-25 18:12:19 +08:00
Hsiangkai Wang	108864397d	[Clang] Add clang attribute `clang_builtin_alias`. In some cases, we want to provide the alias name for the clang builtins. For example, the arguments must be constant integers for some RISC-V builtins. If we use wrapper functions, we could not constrain the arguments be constant integer. This attribute is used to achieve the purpose. Besides this, use `clang_builtin_alias` is more efficient than using wrapper functions. We use this attribute to deal with test time issue reported in https://bugs.llvm.org/show_bug.cgi?id=49962. In our downstream testing, it could decrease the testing time from 6.3 seconds to 3.7 seconds for vloxei.c test. Differential Revision: https://reviews.llvm.org/D100611	2021-04-25 08:49:19 +08:00
David Tolnay	967ebad125	Fix null ptr crash dumping TemplateTemplateParmDecl The following program winds up with D->getDefaultArgStorage().getInheritedFrom() == nullptr during dumping the TemplateTemplateParmDecl corresponding to the template parameter of i. template <typename> struct R; template <template <typename> class = R> void i(); This patch fixes the null pointer dereference.	2021-04-24 12:28:10 -04:00
Jon Chesterfield	fc88d927e3	[clang][amdgpu] Use implicit code object version [clang][amdgpu] Use implicit code object version At present, clang always passes amdhsa-code-object-version on to -cc1. That is great for certainty over what object version is being used when debugging. Unfortunately, the command line argument is in AMDGPUBaseInfo.cpp in the amdgpu target. If clang is used with an llvm compiled with DLLVM_TARGETS_TO_BUILD that excludes amdgpu, this will be diagnosed (as discovered via D98658): - Unknown command line argument '--amdhsa-code-object-version=4' This means that clang, built only for X86, can be used to compile the nvptx devicertl for openmp but not the amdgpu one. That would shortly spawn fragile logic in the devicertl cmake to try to guess whether the clang used will work. This change omits the amdhsa-code-object-version parameter when it matches the default that AMDGPUBaseInfo.cpp specifies, with a comment to indicate why. As this is the only part of clang's codegen for amdgpu that depends on the target in the back end it suffices to build the openmp runtime on most (all?) systems. It is a non-functional change, though observable in the updated tests and when compiling with -###. It may cause minor disruption to the amd-stg-open branch. Revision of D98746, builds on refactor in D101077 Reviewed By: yaxunl Differential Revision: https://reviews.llvm.org/D101095	2021-04-23 23:52:50 +01:00
Thomas Lively	502f54049d	[WebAssembly] Finalize wasm_simd128.h intrinsics Adds new intrinsics for instructions that are in the final SIMD spec but did not previously have intrinsics. Also updates the names of existing intrinsics to reflect the final names of the underlying instructions in the spec. Keeps the old names as deprecated functions to ease the transition to the new names. Differential Revision: https://reviews.llvm.org/D101112	2021-04-23 13:37:27 -07:00
Nemanja Ivanovic	19b29b1ed1	[PowerPC] Provide XL-compatible builtins in altivec.h There are some interfaces in altivec.h that are not compatible between Clang and XL (although Clang is compatible with GCC). Currently, we have found 3 but there may be others. Clang/GCC signatures: vector double vec_ctf(vector signed long long) vector double vec_ctf(vector unsigned long long) vector signed long long vec_cts(vector double) vector unsigned long long vec_ctu(vector double) XL signatures: vector float vec_ctf(vector signed long long) vector float vec_ctf(vector unsigned long long) vector signed int vec_cts(vector double) vector unsigned int vec_ctu(vector double) This patch provides the XL behaviour under the __XL_COMPAT_ALTIVEC__ macro for users that rely on XL behaviour. Differential revision: https://reviews.llvm.org/D101130	2021-04-23 15:13:46 -05:00
Johannes Doerfert	cbe8b57a67	[Clang] Allow the combination of loader_uninitialized and address spaces When an object is allocated in a non-default address space we do not need to check for a constructor if it is not initialized and has a trivial constructor (which we won't call then). Reviewed By: JonChesterfield Differential Revision: https://reviews.llvm.org/D100929	2021-04-23 11:21:52 -05:00
Nemanja Ivanovic	6725b90a02	[PowerPC] Add vec_ctsl and vec_ctul to altivec.h These are added for compatibility with XLC. They are similar to vec_cts and vec_ctu except that the result is a doubleword vector regardless of the parameter type.	2021-04-23 11:03:38 -05:00
Gabor Marton	a7cb951fa4	[Analyzer][StdLibraryFunctionsChecker] Describe arg constraints In this patch, I provide a detailed explanation for each argument constraint. This explanation is added in an extra 'note' tag, which is displayed alongside the warning. Since these new notes describe clearly the constraint, there is no need to provide the number of the argument (e.g. 'Arg3') within the warning. However, I decided to keep the name of the constraint in the warning (but this could be a subject of discussion) in order to be able to identify the different kind of constraint violations easily in a bug database (e.g. CodeChecker). Differential Revision: https://reviews.llvm.org/D101060	2021-04-23 17:27:54 +02:00
Anastasia Stulova	fcb45b544d	[OpenCL] Fix typo in the test.	2021-04-23 14:36:36 +01:00
Dávid Bolvanský	2cae7025c1	Reland "[Clang] Propagate guaranteed alignment for malloc and others" This relands commit `6914a0ed2b`. Crash in InstCombine was fixed.	2021-04-23 14:05:57 +02:00
Adam Czachorowski	ddfbdbfefa	[clang] Do not crash on template specialization following a fatal error There was a missing isInvalid() check leading to an attempt to instantiate template with an empty instantiation stack. Differential Revision: https://reviews.llvm.org/D100675	2021-04-23 13:34:05 +02:00
Dávid Bolvanský	6914a0ed2b	Revert "[Clang] Propagate guaranteed alignment for malloc and others" This reverts commit `c2297544c0`. Some buildbots are broken.	2021-04-23 11:33:33 +02:00
Dávid Bolvanský	c2297544c0	[Clang] Propagate guaranteed alignment for malloc and others LLVM should be smarter about known malloc's alignment and this knowledge may enable other optimizations. Originally started as LLVM patch - https://reviews.llvm.org/D100862 but this logic should be really in Clang. Reviewed By: rjmccall Differential Revision: https://reviews.llvm.org/D100879	2021-04-23 11:07:14 +02:00
Hsiangkai Wang	43cd588c2d	[RISCV] Implement the vnot.v builtin. Differential Revision: https://reviews.llvm.org/D100820	2021-04-23 13:16:23 +08:00
Hsiangkai Wang	14cc1cb222	[RISCV] Implement the vneg.v builtin. Differential Revision: https://reviews.llvm.org/D100819	2021-04-23 11:40:18 +08:00
Fangrui Song	2786e673c7	[IR][sanitizer] Add module flag "frame-pointer" and set it for cc1 -mframe-pointer={non-leaf,all} The Linux kernel objtool diagnostic `call without frame pointer save/setup` arise in multiple instrumentation passes (asan/tsan/gcov). With the mechanism introduced in D100251, it's trivial to respect the command line -m[no-]omit-leaf-frame-pointer/-f[no-]omit-frame-pointer, so let's do it. Fix: https://github.com/ClangBuiltLinux/linux/issues/1236 (tsan) Fix: https://github.com/ClangBuiltLinux/linux/issues/1238 (asan) Also document the function attribute "frame-pointer" which is long overdue. Differential Revision: https://reviews.llvm.org/D101016	2021-04-22 18:07:30 -07:00
Jon Chesterfield	15be0c41d2	Reapply "[AMDGPU][OpenMP] Add amdgpu-arch tool to list AMD GPUs installed" This reverts commit `24c1ed3b34`.	2021-04-23 01:07:16 +01:00
Richard Smith	40beb1f84a	CWG2076: Permit implicit conversions within a single level of braces during overload resolution, even when calling a copy constructor.	2021-04-22 16:49:39 -07:00
Levy Hsu	b49337bbb9	[RISCV] [1/2] Add IR intrinsic for Zbp extension RV32/64: grev grevi gorc gorci shfl shfli unshfl unshfli RV64 ONLY: grevw greviw gorcw gorciw shflw shfli (For non-existing shfliw) unshfli (For non-existing unshfliw) Reviewed By: craig.topper Differential Revision: https://reviews.llvm.org/D100830	2021-04-22 16:34:51 -07:00
Yaxun (Sam) Liu	8baba6890d	[HIP] Support overloaded math functions for hipRTC Remove the dependence on standard C++ header for overloaded math functions in HIP header since standard C++ header is not available for hipRTC. Reviewed by: Artem Belevich, Justin Lebar Differential Revision: https://reviews.llvm.org/D100794	2021-04-22 19:06:51 -04:00
Petr Hosek	d5f433d330	Revert "Re-land "[Driver] Support default libc++ library location on Darwin"" This reverts commit `6331680ad2` because this breaks the compiler-rt build.	2021-04-22 14:04:24 -07:00
Jon Chesterfield	24c1ed3b34	Revert "[AMDGPU][OpenMP] Add amdgpu-arch tool to list AMD GPUs installed" This reverts commit `722d4d8e75`. Unclear where hsa.h should be included from, see report in D99949	2021-04-22 19:39:37 +01:00
Dávid Bolvanský	abf3ca61e3	[Diagnostics] Restore -Wdeprecated warning when user-declared copy assignment operator is defined as deleted (PR45634) Solves https://bugs.llvm.org/show_bug.cgi?id=45634 Be more agressive than GCC with -Wdeprecated-copy. Also provide -W(no-)deprecated-copy-user-provided-copy/dtor options to on/off this behaviour. Reviewed By: Quuxplusone Differential Revision: https://reviews.llvm.org/D79714	2021-04-22 20:35:41 +02:00
Petr Hosek	45340efb4c	[Driver] Specify -ccc-install-dir for linux-cross test This avoids test failures where extra files exist in the tree, such as the standard library built using the runtimes build. Differential Revision: https://reviews.llvm.org/D101023	2021-04-22 10:58:26 -07:00
Nemanja Ivanovic	e43963db24	[PowerPC] Add vec_load_splats to altivec.h Add these overloads for compatibility with XLC. This is a word load-and-splat.	2021-04-22 10:31:00 -05:00
Nemanja Ivanovic	a0e6189712	[PowerPC] Add vec_xlds to altivec.h Add these overloads for compatibility with XLC. This is a doubleword load-and-splat.	2021-04-22 10:31:00 -05:00
Nemanja Ivanovic	a1d325af67	[PowerPC] Add vec_roundz as alias for vec_trunc in altivec.h Add the overloads for compatibility with XLC.	2021-04-22 10:31:00 -05:00
Nemanja Ivanovic	1550c47c18	[PowerPC] Add vec_roundp as alias for vec_ceil Add the overloads for compatibility with XLC.	2021-04-22 10:30:59 -05:00
Nemanja Ivanovic	3a46667059	[PowerPC] Add vec_roundm as alias for vec_floor in altivec.h Add the overloads for compatibility with XLC.	2021-04-22 10:30:59 -05:00
Anastasia Stulova	362958ac73	[C++4OpenCL] Add extra diagnostics for kernel argument types Add restrictions on type layout (PR48099): - Types passed by pointer or reference must be standard layout types. - Types passed by value must be POD types. Patch by olestrohm (Ole Strohm)! Differential Revision: https://reviews.llvm.org/D100471	2021-04-22 15:28:04 +01:00
Sven van Haastregt	4138e7bd76	[OpenCL] Add missing C++ legacy atomics with generic https://reviews.llvm.org/D62335 added some C++ for OpenCL specific builtins to opencl-c.h, but these were not mirrored to the TableGen builtin functions yet. The TableGen builtins machinery does not have dedicated version handling for C++ for OpenCL at the moment: all builtin versioning is tied to `LangOpts.OpenCLVersion` (i.e., the OpenCL C version). As a workaround, to add builtins that are only available in C++ for OpenCL, we define a function extension guarded by the __cplusplus macro. Differential Revision: https://reviews.llvm.org/D100935 Fixes PR50041.	2021-04-22 15:08:36 +01:00
Nathan Sidwell	6ad7e87806	clang: libstdc++ LWM is 4.8.3 Document oldest libstdc++ as 4.8.3, remove a hack for a 4.6 issue. Differential Revision: https://reviews.llvm.org/D100465	2021-04-22 05:26:07 -07:00
Sander de Smalen	e951b045bf	[AArch64][SVE] Regression test all ACLE tests with C++ We found issues with a number of intrinsics when building them with C++, so it makes sense to guard these tests with some extra RUN lines to build the tests in C++ mode.	2021-04-22 13:24:04 +01:00
Valeriy Savchenko	5780dbeee6	[-Wcalled-once] Do not run analysis on Obj-C++ Objective-C++ is not yet suppoerted. rdar://76729552 Differential Revision: https://reviews.llvm.org/D100955	2021-04-22 15:20:52 +03:00
Jan Svoboda	850e01a34d	[clang][deps] Check extra args in tests These flags are being generated by `clang-scan-deps` and it makes sense to ensure it keeps doing so.	2021-04-22 14:10:08 +02:00
Hamza Mahfooz	be2277fbf2	[Matrix] Support #pragma clang fp From https://bugs.llvm.org/show_bug.cgi?id=49739: Currently, `#pragma clang fp` are ignored for matrix types. For the code below, the `contract` fast-math flag should be added to the generated call to `llvm.matrix.multiply` and `fadd` ``` typedef float fx2x2_t __attribute__((matrix_type(2, 2))); void foo(fx2x2_t &A, fx2x2_t &C, fx2x2_t &B) { #pragma clang fp contract(fast) C = A*B + C; } ``` Reviewed By: fhahn, mibintc Differential Revision: https://reviews.llvm.org/D100834	2021-04-22 11:45:34 +01:00
Nemanja Ivanovic	3bcd0ece43	[PowerPC] Add vec_roundc as alias for vec_rint in altivec.h For compatibility with XLC, add these overloads.	2021-04-22 05:31:38 -05:00
Jan Svoboda	398f253400	[clang][deps] Include "-cc1" in the arguments To simplify tools consuming dependency scanning results, prepend the "-cc1" argument by default. Reviewed By: Bigcheese Differential Revision: https://reviews.llvm.org/D100942	2021-04-22 10:57:56 +02:00
Pushpinder Singh	722d4d8e75	[AMDGPU][OpenMP] Add amdgpu-arch tool to list AMD GPUs installed This patch adds new clang tool named amdgpu-arch which uses HSA to detect installed AMDGPU and report back latter's march. This tool is built only if system has HSA installed. The value printed by amdgpu-arch is used to fill -march when latter is not explicitly provided in -Xopenmp-target. Reviewed By: JonChesterfield, gregrodgers Differential Revision: https://reviews.llvm.org/D99949	2021-04-22 05:20:28 +00:00
Chen Zheng	26f138eed4	[Debug-Info] implement -gstrict-dwarf This patch implements -gstrict-dwarf option in clang FE. Reviewed By: dblaikie, probinson, aprantl Differential Revision: https://reviews.llvm.org/D100809	2021-04-22 00:41:25 -04:00
Giorgis Georgakoudis	a2dbfb6b72	[OpenMP] Simplify offloading parallel call codegen This revision simplifies Clang codegen for parallel regions in OpenMP GPU target offloading and corresponding changes in libomptarget: SPMD/non-SPMD parallel calls are unified under a single `kmpc_parallel_51` runtime entry point for parallel regions (which will be commonized between target, host-side parallel regions), data sharing is internalized to the runtime. Tests have been auto-generated using `update_cc_test_checks.py`. Also, the revision contains changes to OpenMPOpt for remark creation on target offloading regions. Reviewed By: jdoerfert, Meinersbur Differential Revision: https://reviews.llvm.org/D95976	2021-04-21 18:46:07 -07:00
Fangrui Song	77ac823fd2	Delete le32/le64 targets They are unused now. Note: NaCl is still used and is currently expected to be needed until 2022-06 (https://blog.chromium.org/2020/08/changes-to-chrome-app-support-timeline.html). Differential Revision: https://reviews.llvm.org/D100981	2021-04-21 18:44:12 -07:00
Fangrui Song	775a9483e5	[IR][sanitizer] Set nounwind on module ctor/dtor, additionally set uwtable if -fasynchronous-unwind-tables On ELF targets, if a function has uwtable or personality, or does not have nounwind (`needsUnwindTableEntry`), it marks that `.eh_frame` is needed in the module. Then, a function gets `.eh_frame` if `needsUnwindTableEntry` or `-g[123]` is specified. (i.e. If -g[123], every function gets `.eh_frame`. This behavior is strange but that is the status quo on GCC and Clang.) Let's take asan as an example. Other sanitizers are similar. `asan.module_[cd]tor` has no attribute. `needsUnwindTableEntry` returns true, so every function gets `.eh_frame` if `-g[123]` is specified. This is the root cause that `-fno-exceptions -fno-asynchronous-unwind-tables -g` produces .debug_frame while `-fno-exceptions -fno-asynchronous-unwind-tables -g -fsanitize=address` produces .eh_frame. This patch * sets the nounwind attribute on sanitizer module ctor/dtor. * let Clang emit a module flag metadata "uwtable" for -fasynchronous-unwind-tables. If "uwtable" is set, sanitizer module ctor/dtor additionally get the uwtable attribute. The "uwtable" mechanism is generic: synthesized functions not cloned/specialized from existing ones should consider `Function::createWithDefaultAttr` instead of `Function::create` if they want to get some default attributes which have more of module semantics. Other candidates: "frame-pointer" (https://github.com/ClangBuiltLinux/linux/issues/955 https://github.com/ClangBuiltLinux/linux/issues/1238), dso_local, etc. Differential Revision: https://reviews.llvm.org/D100251	2021-04-21 15:58:20 -07:00
Petr Hosek	f749550cfe	[libcxx] Stop using use c++ subdirectory for libc++ library The new layout more closely matches the layout used by other compilers. This is only used when LLVM_ENABLE_PER_TARGET_RUNTIME_DIR is enabled. Differential Revision: https://reviews.llvm.org/D100869	2021-04-21 15:39:03 -07:00
Jonas Devlieghere	6331680ad2	Re-land "[Driver] Support default libc++ library location on Darwin" This reverts commit `05eeed9691` and after fixing the impacted lldb tests in `5d1c43f333`. [Driver] Support default libc++ library location on Darwin Darwin driver currently uses libc++ headers that are part of Clang toolchain when available (by default ../include/c++/v1 relative to executable), but it completely ignores the libc++ library itself because it doesn't pass the location of libc++ library that's part of Clang (by default ../lib relative to the exceutable) to the linker always using the system copy of libc++. This may lead to subtle issues when the compilation fails because the headers that are part of Clang toolchain are incompatible with the system library. Either the driver should ignore both headers as well as the library, or it should always try to use both when available. This patch changes the driver behavior to do the latter which seems more reasonable, it makes it easy to test and use custom libc++ build on Darwin while still allowing the use of system version. This also matches the Clang driver behavior on other systems. Differential Revision: https://reviews.llvm.org/D45639	2021-04-21 14:22:13 -07:00
Yaxun (Sam) Liu	5a2d78b163	[HIP] Add option -fgpu-inline-threshold Add option -fgpu-inline-threshold for inline threshold for device compilation only. Reviewed by: Artem Belevich Differential Revision: https://reviews.llvm.org/D99233	2021-04-21 17:18:18 -04:00
Alexey Bataev	079884225a	[OPENMP]Fix PR49698: OpenMP declare mapper causes segmentation fault. The implicitly generated mappings for allocation/deallocation in mappers runtime should be mapped as implicit, also no need to clear member_of flag to avoid ref counter increment. Also, the ref counter should not be incremented for the very first element that comes from the mapper function. Differential Revision: https://reviews.llvm.org/D100673	2021-04-21 10:38:31 -07:00
Victor Campos	ee3e01627f	[Clang][ARM] Define __VFP_FP__ macro unconditionally Clang only defines __VFP_FP__ when the FPU is enabled. However, gcc defines it unconditionally. This patch aligns Clang with gcc. Reviewed By: peter.smith, rengolin Differential Revision: https://reviews.llvm.org/D100372	2021-04-21 15:03:59 +01:00
LemonBoy	8e336511f1	[clang] Recognize ppc32 as valid mcpu value The `ppc32` cpu model was introduced a while ago in `a9321059b9` as an independent copy of the `ppc` one but was never wired into clang. Reviewed By: #powerpc, nemanjai Differential Revision: https://reviews.llvm.org/D100933	2021-04-21 15:11:01 +02:00
John Brawn	5f02c308e6	Fix clang/test/Frontend/plugin-annotate-functions.c FileCheck now gives an error when there's a check for an undefined variable, which this test does in one of its NOT checks. Fix this by being a bit looser in what the test checks.	2021-04-21 14:00:38 +01:00
Sylvain Audi	8c16c8b7ef	Reland "[clang-scan-deps] Add support for clang-cl" This reverts commit `199c397482`. This time, clang-scan-deps's search for output argument in clang-cl command line will now ignore arguments preceded by "-Xclang". That way, it won't detect a /o argument in "-Xclang -ivfsoverlay -Xclang /opt/subpath" Initial patch description: clang-scan-deps contains some command line parsing and modifications. This patch adds support for clang-cl command options. Differential Revision: https://reviews.llvm.org/D92191	2021-04-21 07:56:39 -04:00
Sven van Haastregt	e2b3b89bf1	[OpenCL] Do not add builtins with unavailable types Add functionality to assign extensions to types in OpenCLBuiltins.td and use that information to filter candidates that should not be exposed if a type is not available. Differential Revision: https://reviews.llvm.org/D100209	2021-04-21 11:59:29 +01:00
Martin Storsjö	174e796c7d	[llvm-rc] Fix a new test to disambiguate macOS paths like /Users/... from options starting with a slash This should fix test failures on macOS.	2021-04-21 13:34:33 +03:00
Simon Tatham	77e170db86	[ARM][Driver][Windows] Allow command-line upgrade to Armv8. If you gave clang the options `--target=arm-pc-windows-msvc` and `-march=armv8-a+crypto` together, the crypto extension would not be enabled in the compilation, and you'd see the following warning message suggesting that the 'armv8-a' had been ignored: clang: warning: ignoring extension 'crypto' because the 'armv7-a' architecture does not support it [-Winvalid-command-line-argument] This happens because Triple::getARMCPUForArch(), for the Win32 OS, unconditionally returns "cortex-a9" (an Armv7 CPU) regardless of MArch, which overrides the architecture setting on the command line. I don't think that the combination of Windows and AArch32 _should_ unconditionally outlaw the use of the crypto extension. MSVC itself doesn't think so: you can perfectly well compile Thumb crypto code using its AArch32-targeted compiler. All the other default CPUs in the same switch statement are conditional on a particular MArch setting; this is the only one that returns a particular CPU _regardless_ of MArch. So I've fixed this one by adding a condition, so that if you ask for an architecture above v7, the default of Cortex-A9 no longer overrides it. Reviewed By: mstorsjo Differential Revision: https://reviews.llvm.org/D100937	2021-04-21 11:20:05 +01:00
Martin Storsjö	066b8f2fc6	[llvm-rc] Try to fix the Preprocessor/llvm-rc.rc test on non arm/x86 architectures When llvm-rc invokes clang for preprocessing, it uses a target triple derived from the default target. The test verifies that e.g. _WIN32 is defined when preprocessing. If running clang with e.g. -target ppc64le-windows-msvc, that particular arch/OS combination isn't hooked up, so _WIN32 doesn't get defined in that configuration. Therefore, the preprocessing test fails. Instead make llvm-rc inspect the architecture of the default target. If it's one of the known supported architectures, use it as such, otherwise set a default one (x86_64). (Clang can run preprocessing with an x86_64 target triple, even if the x86 backend isn't enabled.) Also remove superfluous llvm:: specifications on enums in llvm-rc.cpp.	2021-04-21 12:47:33 +03:00
Martin Storsjö	64bc44f5dd	[llvm-rc] Run clang to preprocess input files Allow opting out from preprocessing with a command line argument. Update tests to pass -no-preprocess to make it not try to use clang (which isn't a build level dependency of llvm-rc), but add a test that does preprocessing under clang/test/Preprocessor. Update a few options to allow them both joined (as -DFOO) and separate (-D BR), as rc.exe allows both forms of them. With the verbose flag set, this prints the preprocessing command used (which differs from what rc.exe does). Tests under llvm/test/tools/llvm-rc only test constructing the preprocessor commands, while tests under clang/test/Preprocessor test actually running the preprocessor. Differential Revision: https://reviews.llvm.org/D100755	2021-04-21 11:50:10 +03:00
ShihPo Hung	11072a0bdb	[RISCV][Clang] Add RVV AMO builtins Add vamo[swap/add/xor/and/or/min/max/minu/maxu] builtins. Reviewed By: khchen Differential Revision: https://reviews.llvm.org/D100448	2021-04-21 01:48:02 -07:00
Pushpinder Singh	0ad50bf27f	Revert "[AMDGPU][OpenMP] Add amdgpu-arch tool to list AMD GPUs installed" This reverts commit `3194761d27`.	2021-04-21 08:05:38 +00:00
Pushpinder Singh	3194761d27	[AMDGPU][OpenMP] Add amdgpu-arch tool to list AMD GPUs installed This patch adds new clang tool named amdgpu-arch which uses HSA to detect installed AMDGPU and report back latter's march. This tool is built only if system has HSA installed. The value printed by amdgpu-arch is used to fill -march when latter is not explicitly provided in -Xopenmp-target. Reviewed By: JonChesterfield, gregrodgers Differential Revision: https://reviews.llvm.org/D99949	2021-04-21 05:05:49 +00:00
Jonas Devlieghere	05eeed9691	Revert "[Driver] Support default libc++ library location on Darwin" This reverts the following commits because it breaks TestAppleSimulatorOSType.py on GreenDragon [1]. `caff17e503` `f5efe0aa04` `ae8b2cab67` [1] http://green.lab.llvm.org/green/view/LLDB/job/lldb-cmake/31346/	2021-04-20 20:42:50 -07:00
Liu, Chen3	72e4bf12ee	[X86] Support some missing intrinsics Support for _mm512_i32logather_pd, _mm512_mask_i32logather_pd, _mm512_i32logather_epi64, _mm512_mask_i32logather_epi64, _mm512_i32loscatter_pd, _mm512_mask_i32loscatter_pd, _mm512_i32loscatter_epi64, _mm512_mask_i32loscatter_epi64. Differential Revision: https://reviews.llvm.org/D100368	2021-04-21 10:50:37 +08:00
Dávid Bolvanský	9f1e2ee462	[Clang, builtins] Added aligned_alloc, memalign support	2021-04-21 00:11:54 +02:00
Petr Hosek	caff17e503	[Driver] Don't use capture for InstalledDir This is another attempt to address the issue introduced in `ae8b2cab67`. We cannot capture InstalledDir because FileCheck doesn't handle the backslashes correctly, so instead we just consume the entire path prefix which is what other tests are doing.	2021-04-20 13:43:56 -07:00
Petr Hosek	f5efe0aa04	[Driver] Support both slashes This addresses Windows breakage introduced by `ae8b2cab67`.	2021-04-20 13:25:38 -07:00
Petr Hosek	ae8b2cab67	[Driver] Support default libc++ library location on Darwin Darwin driver currently uses libc++ headers that are part of Clang toolchain when available (by default ../include/c++/v1 relative to executable), but it completely ignores the libc++ library itself because it doesn't pass the location of libc++ library that's part of Clang (by default ../lib relative to the exceutable) to the linker always using the system copy of libc++. This may lead to subtle issues when the compilation fails because the headers that are part of Clang toolchain are incompatible with the system library. Either the driver should ignore both headers as well as the library, or it should always try to use both when available. This patch changes the driver behavior to do the latter which seems more reasonable, it makes it easy to test and use custom libc++ build on Darwin while still allowing the use of system version. This also matches the Clang driver behavior on other systems. Differential Revision: https://reviews.llvm.org/D45639	2021-04-20 12:30:35 -07:00
Ahmed Bougacha	cedb5b06df	[AArch64] Don't always override CPU for arm64e. This demotes the apple-a12 CPU selection for arm64e to just be the last-resort default. Concretely, this means: - an explicitly-specified -mcpu will override the arm64e default; a user could potentially pick an invalid CPU that doesn't have v8.3a support, but that's not a major problem anymore - arm64e-apple-macos (and variants) will pick apple-m1 instead of being forced to apple-a12.	2021-04-20 08:41:04 -07:00
Ahmed Bougacha	a8a3a43792	[AArch64] Add apple-m1 CPU, and default to it for macOS. apple-m1 has the same level of ISA support as apple-a14, so this is a straightforward mechanical change. However, that also means this inherits apple-a14's v8.5a+nobti quirkiness. rdar://68287159	2021-04-20 08:41:04 -07:00
Erich Keane	0ed613612c	Ensure target-multiversioning emits deferred declarations As reported in PR50025, sometimes we would end up not emitting functions needed by inline multiversioned variants. This is because we typically use the 'deferred decl' mechanism to emit these. However, the variants are emitted after that typically happens. This fixes that by ensuring we re-run deferred decls after this happens. Also, the multiversion emission is done recursively to ensure that MV functions that require other MV functions to be emitted get emitted.	2021-04-20 08:10:26 -07:00
Thomas Preud'homme	8cee150e9a	[C++, test] Fix typo in NSS* vars The NSS FileCheck variables at the end of the CodeGenCXX/split-stacks.cpp clang testcase are off by 1, resulting in the use of an undefined variable (NSS3). One of the CHECK-NOT is also redundant because _Z8tnosplitIiEiv uses the same attribute as _Z3foov without split stack. This commit fixes that. Reviewed By: ChuanqiXu Differential Revision: https://reviews.llvm.org/D99839	2021-04-20 12:07:41 +01:00
Serguei Katkov	70193bdfc0	Re-land [GreedyRA ORE] Add Cost of spill locations into remark Re-land the patch with a fix of clang test. Cost of spill location is computed basing on relative branch frequency where corresponding spill/reload/copy are located. While the number itself is highly depends on incoming IR, the total cost can be used when do some changes in RA. Revert "Revert "[GreedyRA ORE] Add Cost of spill locations into remark"" This reverts commit `680f3d6de7`.	2021-04-20 16:21:07 +07:00
Alexandre Ganea	199c397482	Revert "[clang-scan-deps] Add support for clang-cl" This reverts commit `bb26fa8c28`.	2021-04-19 17:45:18 -04:00
Yaxun (Sam) Liu	d8805574c1	[CUDA][HIP] Allow non-ODR use of host var in device Reviewed by: Artem Belevich, Richard Smith Differential Revision: https://reviews.llvm.org/D98193	2021-04-19 14:45:24 -04:00
Wael Yehia	369c0e0f48	[AIX] Diagnose thinLTO usage in clang on AIX. Reviewed By: Xiangling Liao Differential Revision: https://reviews.llvm.org/D100350	2021-04-19 16:39:48 +00:00
Hans Wennborg	aa1e391402	Fix test/Frontend/print-header-includes.c I managed to run the wrong tests before committing D100759.	2021-04-19 15:39:09 +02:00
Jan Svoboda	0a92e09c07	[clang][deps] Generate the full command-line for modules This patch uses the new `CompilerInvocation::generateCC1CommandLine` to generate the full canonical command line for modular dependencies, instead of only appending additional arguments. Reviewed By: dexonsmith Differential Revision: https://reviews.llvm.org/D100534	2021-04-19 14:32:49 +02:00
Timm Bäder	320311a01b	[clang][parser] Unify rejecting (non) decl stmts with gnu attributes The comment here was introduced in `a3e01cf822` and suggests that we should handle declaration statements and non-declaration statements the same, but don't because ProhibitAttributes() can't handle GNU attributes. That has recently changed, so remove the comment and handle all statements the same. Differential Revision: https://reviews.llvm.org/D99936	2021-04-19 12:43:55 +02:00
Jan Svoboda	2b73565210	[clang][deps] Remove the -full-command-line flag This patch removes the `-full-command-line` option from `clang-scan-deps`. It's only used with `-format=experimental-full`, where omitting the command lines doesn't make much sense. There are no tests without `-full-command-line`. Depends on D100531. Reviewed By: Bigcheese Differential Revision: https://reviews.llvm.org/D100533	2021-04-19 12:28:02 +02:00
Jan Svoboda	32219c8c44	[clang][deps] Simplify function discovering .pcm and .modulemap files This patch simplifies (and renames) the `appendCommonModuleArguments` function. It no longer tries to construct the command line for explicitly building modules. Instead, it only performs the DFS traversal of modular dependencies and queries the callbacks to collect paths to `.pcm` and `.modulemap` files. This makes it more flexible and usable in two contexts: * Generating additional command line arguments for the main TU in modular build. The `std::vector<std::string>` output parameters can be used to manually generate appropriate command line flags. * Generate full command line for a module. The output parameters can be the corresponding parts of `CompilerInvocation`. (In a follow-up patch.) Reviewed By: dexonsmith Differential Revision: https://reviews.llvm.org/D100531	2021-04-19 12:11:39 +02:00
ShihPo Hung	27edaee84e	[RISCV][Driver] Make the ordering of CmdArgs consistent between RISCV::Linker and baremetal::Linker In baremetal::Linker::ConstructJob, LinkerInput is handled prior to T_Group options, but on the other side in RISCV::Linker::ConstructJob, it is opposite. We want it to be consistent whether users are using RISCV::Linker or baremetal::Linker. Reviewed By: MaskRay Differential Revision: https://reviews.llvm.org/D100615	2021-04-18 19:05:20 -07:00
Xun Li	5faba87938	Revert "[Coroutines] Set presplit attribute in Clang instead of CoroEarly pass" This reverts commit `fa6b54c44a`. The commited patch broke mlir tests. It seems that mlir tests depend on coroutine function properties set in CoroEarly pass.	2021-04-18 17:22:28 -07:00
Xun Li	fa6b54c44a	[Coroutines] Set presplit attribute in Clang instead of CoroEarly pass Presplit coroutines cannot be inlined. During AlwaysInliner we check if a function is a presplit coroutine, if so we skip inlining. The presplit coroutine attributes are set in CoroEarly pass. However in O0 pipeline, AlwaysInliner runs before CoroEarly, so the attribute isn't set yet and will still inline the coroutine. This causes Clang to crash: https://bugs.llvm.org/show_bug.cgi?id=49920 To fix this, we set the attributes in the Clang front-end instead of in CoroEarly pass. Reviewed By: rjmccall, ChuanqiXu Differential Revision: https://reviews.llvm.org/D100282	2021-04-18 15:41:09 -07:00
Xun Li	c0211e8d7d	Revert "[Coroutines] Move CoroEarly pass to before AlwaysInliner" This reverts commit `2b50f5a434`. Forgot to update the description of the commit to sync with phabricator. Going to redo the commit.	2021-04-18 15:38:19 -07:00
Xun Li	2b50f5a434	[Coroutines] Move CoroEarly pass to before AlwaysInliner Presplit coroutines cannot be inlined. During AlwaysInliner we check if a function is a presplit coroutine, if so we skip inlining. The presplit coroutine attributes are set in CoroEarly pass. However in O0 pipeline, AlwaysInliner runs before CoroEarly, so the attribute isn't set yet and will still inline the coroutine. This causes Clang to crash: https://bugs.llvm.org/show_bug.cgi?id=49920 Differential Revision: https://reviews.llvm.org/D100282	2021-04-18 14:54:04 -07:00
Sylvain Audi	bb26fa8c28	[clang-scan-deps] Add support for clang-cl clang-scan-deps contains some command line parsing and modifications. This patch adds support for clang-cl command options. Differential Revision: https://reviews.llvm.org/D92191	2021-04-17 14:22:51 -04:00
Sylvain Audi	488a19d00c	[clang-scan-deps] Support double-dashes in clang command lines This fixes argument injection in clang command lines, by adding them before "--". Previously, the arguments were injected at the end of the command line and could be added after "--", which would be wrongly interpreted as input file paths. This fix is needed for a subsequent patch, see D92191. Differential Revision: https://reviews.llvm.org/D95099	2021-04-17 14:22:51 -04:00
Yaxun (Sam) Liu	6823af0ca8	[HIP] Support hipRTC in header hipRTC compiles HIP device code at run time. Since the system may not have development tools installed, when a HIP program is compiled through hipRTC, there is no standard C or C++ header available. As such, the HIP headers should not depend on standard C or C++ headers when used with hipRTC. Basically when hipRTC is used, HIP headers only provides definitions of HIP device API functions. This is in line with what nvRTC does. This patch adds support of hipRTC to HIP headers in clang. Basically hipRTC defines a macro __HIPCC_RTC__ when compile HIP code at run time. When this macro is defined, HIP headers do not include standard C/C++ headers. Reviewed by: Artem Belevich Differential Revision: https://reviews.llvm.org/D100652	2021-04-17 11:34:52 -04:00
Dávid Bolvanský	12a1f1d9d7	[Pragma] Added support for GCC unroll/nounroll GCC 8 introduced these new pragmas to control loop unrolling. We should support them for compatibility reasons and the implementation itself requires few lines of code, since everything needed is already implemented for #pragma unroll/nounroll.	2021-04-17 17:29:55 +02:00
Yaxun (Sam) Liu	d5c0f00e21	[CUDA][HIP] Mark device var used by host only Add device variables to llvm.compiler.used if they are ODR-used by either host or device functions. This is necessary to prevent them from being eliminated by whole-program optimization where the compiler has no way to know a device variable is used by some host code. Reviewed by: Artem Belevich Differential Revision: https://reviews.llvm.org/D98814	2021-04-17 11:25:25 -04:00
Yaxun (Sam) Liu	3597f02fd5	[AMDGPU] Add GlobalDCE before internalization pass The internalization pass only internalizes global variables with no users. If the global variable has some dead user, the internalization pass will not internalize it. To be able to internalize global variables with dead users, a global dce pass is needed before the internalization pass. This patch adds that. Reviewed by: Artem Belevich, Matt Arsenault Differential Revision: https://reviews.llvm.org/D98783	2021-04-17 11:25:25 -04:00
Ben Barham	1206b95e07	[ASTReader] Only mark module out of date if not already compiled If a module contains errors (ie. it was built with -fallow-pcm-with-compiler-errors and had errors) and was from the module cache, it is marked as out of date - see `a2c1054c30`. When a module is imported multiple times in the one compile, this caused it to be recompiled each time - removing the existing buffer from the module cache and replacing it. This results in various errors further down the line. Instead, only mark the module as out of date if it isn't already finalized in the module cache. Reviewed By: akyrtzi Differential Revision: https://reviews.llvm.org/D100619	2021-04-16 17:57:03 -07:00
Philip Reames	f549176ad9	[funcattrs] Add the maximal set of implied attributes to definitions Have funcattrs expand all implied attributes into the IR. This expands the infrastructure from D100400, but for definitions not declarations this time. Somewhat subtly, this mostly isn't semantic. Because the accessors did the inference, any client which used the accessor was already getting the stronger result. Clients that directly checked presence of attributes (there are some), will see a stronger result now. The old behavior can end up quite confusing for two reasons: * Without this change, we have situations where function-attrs appears to fail when inferring an attribute (as seen by a human reading IR), but that consuming code will see that it should have been implied. As a human trying to sanity check test results and study IR for optimization possibilities, this is exceeding error prone and confusing. (I'll note that I wasted several hours recently because of this.) * We can have transforms which trigger without the IR appearing (on inspection) to meet the preconditions. This change doesn't prevent this from happening (as the accessors still involve multiple checks), but it should make it less frequent. I'd argue in favor of deleting the extra checks out of the accessors after this lands, but I want that in it's own review as a) it's purely stylistic, and b) I already know there's some disagreement. Once this lands, I'm also going to do a cleanup change which will delete some now redundant duplicate predicates in the inference code, but again, that deserves to be a change of it's own. Differential Revision: https://reviews.llvm.org/D100226	2021-04-16 14:22:19 -07:00
Thomas Lively	5c729750a6	[WebAssembly] Remove saturating fp-to-int target intrinsics Use the target-independent @llvm.fptosi and @llvm.fptoui intrinsics instead. This includes removing the instrinsics for i32x4.trunc_sat_zero_f64x2_{s,u}, which are now represented in IR as a saturating truncation to a v2i32 followed by a concatenation with a zero vector. Differential Revision: https://reviews.llvm.org/D100596	2021-04-16 12:11:20 -07:00
Dávid Bolvanský	0daf273025	[Builtins] Add memory allocation builtins (PR12543)	2021-04-16 20:36:46 +02:00
Artem Belevich	eaa9ef075d	[CUDA, FDO] Filter out profiling options from GPU-side compilations. Differential Revision: https://reviews.llvm.org/D100598	2021-04-16 11:35:28 -07:00
Zakk Chen	8f683366af	[RISCV][Clang] Add RVV miscellaneous intrinsic functions. 1. vreinterpret 2. vundefined 3. LMUL truncation and extension. Reviewed By: craig.topper Differential Revision: https://reviews.llvm.org/D100391	2021-04-16 09:41:19 -07:00
Zakk Chen	ca9e52f67c	[RISCV][Clang] Drop the assembly tests for RVV intrinsics. We had verified the correctness of all intrinsics in downstream, so dropping the assembly tests to decrease the check-clang time. It would remove 1/3 of the RUN lines. https://reviews.llvm.org/D99151#2654154 mentions why we need to have the ASM tests before. Reviewed By: craig.topper Differential Revision: https://reviews.llvm.org/D100617	2021-04-16 09:30:12 -07:00
Troy Johnson	8628ed0310	[Driver] Allow both lib64 and lib in rocm-detect test. Differential Revision: https://reviews.llvm.org/D100502	2021-04-16 09:55:57 -05:00
Alexey Bataev	10c7b9f64f	[OPENMP]Fix PR49115: Incorrect results for scan directive. For combined worksharing directives need to emit the temp arrays outside of the parallel region and update them in the master thread only. Differential Revision: https://reviews.llvm.org/D100187	2021-04-16 06:25:35 -07:00
Pushpinder Singh	efc013ec4d	Revert "[AMDGPU][OpenMP] Add amdgpu-arch tool to list AMD GPUs installed" This reverts commit `7029cffc4e`.	2021-04-16 09:16:58 +00:00
Pushpinder Singh	7029cffc4e	[AMDGPU][OpenMP] Add amdgpu-arch tool to list AMD GPUs installed This patch adds new clang tool named amdgpu-arch which uses HSA to detect installed AMDGPU and report back latter's march. This tool is built only if system has HSA installed. The value printed by amdgpu-arch is used to fill -march when latter is not explicitly provided in -Xopenmp-target. Reviewed By: JonChesterfield, gregrodgers Differential Revision: https://reviews.llvm.org/D99949	2021-04-16 05:26:20 +00:00
Richard Smith	f7c9de0de5	Add triple to fix test failure. This test uses `__regcall`, support for which is target-specific.	2021-04-15 18:08:35 -07:00
Joshua Haberman	8344675908	Implemented [[clang::musttail]] attribute for guaranteed tail calls. This is a Clang-only change and depends on the existing "musttail" support already implemented in LLVM. The [[clang::musttail]] attribute goes on a return statement, not a function definition. There are several constraints that the user must follow when using [[clang::musttail]], and these constraints are verified by Sema. Tail calls are supported on regular function calls, calls through a function pointer, member function calls, and even pointer to member. Future work would be to throw a warning if a users tries to pass a pointer or reference to a local variable through a musttail call. Reviewed By: rsmith Differential Revision: https://reviews.llvm.org/D99517	2021-04-15 17:12:21 -07:00
Momchil Velikov	f9d932e673	[clang][AArch64] Correctly align HFA arguments when passed on the stack When we pass a AArch64 Homogeneous Floating-Point Aggregate (HFA) argument with increased alignment requirements, for example struct S { __attribute__ ((__aligned__(16))) double v[4]; }; Clang uses `[4 x double]` for the parameter, which is passed on the stack at alignment 8, whereas it should be at alignment 16, following Rule C.4 in AAPCS (https://github.com/ARM-software/abi-aa/blob/master/aapcs64/aapcs64.rst#642parameter-passing-rules) Currently we don't have a way to express in LLVM IR the alignment requirements of the function arguments. The align attribute is applicable to pointers only, and only for some special ways of passing arguments (e..g byval). When implementing AAPCS32/AAPCS64, clang resorts to dubious hacks of coercing to types, which naturally have the needed alignment. We don't have enough types to cover all the cases, though. This patch introduces a new use of the stackalign attribute to control stack slot alignment, when and if an argument is passed in memory. The attribute align is left as an optimizer hint - it still applies to pointer types only and pertains to the content of the pointer, whereas the alignment of the pointer itself is determined by the stackalign attribute. For byval arguments, the stackalign attribute assumes the role, previously perfomed by align, falling back to align if stackalign` is absent. On the clang side, when passing arguments using the "direct" style (cf. `ABIArgInfo::Kind`), now we can optionally specify an alignment, which is emitted as the new `stackalign` attribute. Patch by Momchil Velikov and Lucas Prates. Differential Revision: https://reviews.llvm.org/D98794	2021-04-15 22:58:14 +01:00
Martin Storsjö	8e0f2e89ff	[clang] [AArch64] Fix handling of HFAs passed to Windows variadic functions The documentation says that for variadic functions, all composites are treated similarly, no special handling of HFAs/HVAs, not even for the fixed arguments of a variadic function. Differential Revision: https://reviews.llvm.org/D100467	2021-04-15 22:21:27 +03:00
cchen	e0c2125d1d	[OpenMP] Added codegen for masked directive Reviewed By: ABataev Differential Revision: https://reviews.llvm.org/D100514	2021-04-15 12:55:07 -05:00
Melanie Blower	938b863bb5	[clang][patch] Modify diagnostic level from err to warn: anyx86_interrupt_regsave Reviewed By: Aaron Ballman Differential Revision: https://reviews.llvm.org/D100511	2021-04-15 13:11:33 -04:00
Arthur Eubanks	c8f0a7c215	[NewPM] Cleanup IR printing instrumentation Being lazy with printing the banner seems hard to reason with, we should print it unconditionally first (it could also lead to duplicate banners if we have multiple functions in -filter-print-funcs). The printIR() functions were doing too many things. I separated out the call from PrintPassInstrumentation since we were essentially doing two completely separate things in printIR() from different callers. There were multiple ways to generate the name of some IR. That's all been moved to getIRName(). The printing of the IR name was also inconsistent, now it's always "IR Dump on $foo" where "$foo" is the name. For a function, it's the function name. For a loop, it's what's printed by Loop::print(), which is more detailed. For an SCC, it's the list of functions in parentheses. For a module it's "[module]", to differentiate between a possible SCC with a function called "module". To preserve D74814, we have to check if we're going to print anything at all first. This is unfortunate, but I would consider this a special case that shouldn't be handled in the core logic. Reviewed By: jamieschmeiser Differential Revision: https://reviews.llvm.org/D100231	2021-04-15 09:50:55 -07:00
Mark Johnston	99eca1bd9c	[Driver] Enable kernel address and memory sanitizers on FreeBSD Test Plan: using kernel ASAN and MSAN implementations in FreeBSD Reviewed By: emaste, dim, arichardson Differential Revision: https://reviews.llvm.org/D98286	2021-04-15 17:49:00 +01:00
Aaron Ballman	ad2d6bbb14	Fix potential infinite loop with malformed attribute syntax Double square bracket attribute arguments can be arbitrarily complex, and the attribute argument parsing logic recovers by skipping tokens. As a fallback recovery mechanism, parse recovery stops before reading a semicolon. This could lead to an infinite loop in the attribute list parsing logic.	2021-04-15 10:47:32 -04:00
Matthias Klose	56cb214b38	add test case for ignoring -flto=auto and -flto=jobserver as requested in https://reviews.llvm.org/D99501, test that the two new options are ignored. Reviewed By: tejohnson, fhahn Differential Revision: https://reviews.llvm.org/D100484	2021-04-15 12:19:14 +02:00
Martin Storsjö	ee570e2153	[clang] [test] Share patterns in CodeGen/ms_abi_aarch64.c between cases. NFC. Differential Revision: https://reviews.llvm.org/D100468	2021-04-15 11:02:14 +03:00
Zakk Chen	ea5d33dbc1	[RISCV][Clang] Add vmv and vfmv series intrinsic functions. Authored-by: Roger Ferrer Ibanez <rofirrim@gmail.com> Co-Authored-by: Zakk Chen <zakk.chen@sifive.com> Reviewed By: craig.topper, Jim Differential Revision: https://reviews.llvm.org/D100266	2021-04-14 22:22:39 -07:00
Eli Friedman	dc1ab590a0	[Sema] Fold VLA types in compound literals to constant arrays. Similar to variables with an initializer, this is never valid in standard C, so we can safely constant-fold as an extension. I ran into this construct in a couple proprietary codebases. While I'm here, drive-by fix for 090dd647: we should only fold variables with VLA types, not arbitrary variably modified types. Differential Revision: https://reviews.llvm.org/D98363	2021-04-14 17:09:59 -07:00
Philip Reames	dd985551c2	Reapply "[InferAttributes] Materialize all infered attributes for declaration"" and follow on patches. This reverts commit `ab98f2c712` and `98eea392cd`. It includes a fix for the clang test which triggered the revert. I failed to notice this one because there was another AMDGPU llvm test with a similiar name and the exact same text in the error message. Odd. Since only one build bot reported the clang test, I didn't notice that one.	2021-04-14 16:38:07 -07:00
Thomas Lively	6a18cc23ef	[WebAssembly] Codegen for i64x2.extend_{low,high}_i32x4_{s,u} Removes the builtins and intrinsics used to opt in to using these instructions and replaces them with normal ISel patterns now that they are no longer prototypes. Differential Revision: https://reviews.llvm.org/D100402	2021-04-14 13:43:09 -07:00
Sterling Augustine	d2bb3cbbf8	Make test runnable on read-only file systems.	2021-04-14 13:29:51 -07:00
Alex Lorenz	c1554f32e3	[clang][FileManager] Support empty file name in getVirtualFileRef for serialized diagnostics After https://reviews.llvm.org/D90484 libclang is unable to read a serialized diagnostic file which contains a diagnostic which came from a file with an empty filename. The reason being is that the serialized diagnostic reader is creating a virtual file for the "" filename, which now fails after the changes in https://reviews.llvm.org/D90484. This patch restores the previous behavior in getVirtualFileRef by allowing it to construct a file entry ref with an empty name by pretending its name is "." so that the directory entry can be created. Differential Revision: https://reviews.llvm.org/D100428	2021-04-14 11:29:25 -07:00
Thomas Lively	af7925b4dd	[WebAssembly] Codegen for f64x2.convert_low_i32x4_{s,u} Add a custom DAG combine and ISD opcode for detecting patterns like (uint_to_fp (extract_subvector ...)) before the extract_subvector is expanded to ensure that they will ultimately lower to f64x2.convert_low_i32x4_{s,u} instructions. Since these instructions are no longer prototypes and can now be produced via standard IR, this commit also removes the target intrinsics and builtins that had been used to prototype the instructions. Differential Revision: https://reviews.llvm.org/D100425	2021-04-14 10:42:45 -07:00
Thomas Lively	af7ab81ce3	[WebAssembly] Use standard intrinsics for f32x4 and f64x2 ops Now that these instructions are no longer prototypes, we do not need to be careful about keeping them opt-in and can use the standard LLVM infrastructure for them. This commit removes the bespoke intrinsics we were using to represent these operations in favor of the corresponding target-independent intrinsics. The clang builtins are preserved because there is no standard way to easily represent these operations in C/C++. For consistency with the scalar codegen in the Wasm backend, the intrinsic used to represent {f32x4,f64x2}.nearest is @llvm.nearbyint even though @llvm.roundeven better captures the semantics of the underlying Wasm instruction. Replacing our use of @llvm.nearbyint with use of @llvm.roundeven is left to a potential future patch. Differential Revision: https://reviews.llvm.org/D100411	2021-04-14 09:19:27 -07:00
Hans Wennborg	f29dcbdde1	Add flag for showing skipped headers in -H / --show-includes output Consider the following set of files: a.cc: #include "a.h" a.h: #ifndef A_H #define A_H #include "b.h" #include "c.h" // This gets "skipped". #endif b.h: #ifndef B_H #define B_H #include "c.h" #endif c.h: #ifndef C_H #define C_H void c(); #endif And the output of the -H option: $ clang -c -H a.cc . ./a.h .. ./b.h ... ./c.h Note that the include of c.h in a.h is not shown in the output (GCC does the same). This is because of the include guard optimization: clang knows c.h is covered by an include guard which is already defined, so when it sees the include in a.h, it skips it. The same would have happened if #pragma once were used instead of include guards. However, a.h does include c.h, and it may be useful to show that in the -H output. This patch adds a flag for doing that. Differential revision: https://reviews.llvm.org/D100480	2021-04-14 17:01:51 +02:00
Erich Keane	92aba5ae49	CPUDispatch- allow out of line member definitions ICC permits this, and after some extensive testing it looks like we can support this with very little trouble. We intentionally don't choose to do this with attribute-target (despite it likely working as well!) because GCC does not support that, and introducing said incompatibility doesn't seem worth it.	2021-04-14 06:19:49 -07:00
Martin Storsjö	3637c5c8ec	[clang] [AArch64] Fix Windows va_arg handling for larger structs Aggregate types over 16 bytes are passed by reference. Contrary to the x86_64 ABI, smaller structs with an odd (non power of two) are padded and passed in registers. Differential Revision: https://reviews.llvm.org/D100374	2021-04-14 14:51:53 +03:00
Liu, Chen3	1c4108ab66	[i386] Modify the alignment of __m128/__m256/__m512 vector type according i386 abi. According to i386 System V ABI: 1. when __m256 are required to be passed on the stack, the stack pointer must be aligned on a 0 mod 32 byte boundary at the time of the call. 2. when __m512 are required to be passed on the stack, the stack pointer must be aligned on a 0 mod 64 byte boundary at the time of the call. The current method of clang passing __m512 parameter are as follow: 1. when target supports avx512, passing it with 64 byte alignment; 2. when target supports avx, passing it with 32 byte alignment; 3. Otherwise, passing it with 16 byte alignment. Passing __m256 parameter are as follow: 1. when target supports avx or avx512, passing it with 32 byte alignment; 2. Otherwise, passing it with 16 byte alignment. This pach will passing __m128/__m256/__m512 following i386 System V ABI and apply it to Linux only since other System V OS (e.g Darwin, PS4 and FreeBSD) don't want to spend any effort dealing with the ramifications of ABI breaks at present. Differential Revision: https://reviews.llvm.org/D78564	2021-04-14 16:44:54 +08:00
Anton Bikineev	69545154cc	[Sema] Move 'char-expression-as-unsigned < 0' into a separate diagnostic This change splits '-Wtautological-unsigned-zero-compare' by reporting char-expressions-interpreted-as-unsigned under a separate diagnostic '-Wtautological-unsigned-char-zero-compare'. This is beneficial for projects that want to enable '-Wtautological-unsigned-zero-compare' but at the same time want to keep code portable for platforms with char being signed or unsigned, such as Chromium. Differential Revision: https://reviews.llvm.org/D99808	2021-04-14 01:01:40 +02:00
Sander de Smalen	204aaf8795	[AArch64][SVE] Always use overloaded methods instead of preprocessor macro. This fixes a subtle issue where: svprf(pg, ptr, SV_ALL /is sv_pattern instead of sv_prfop/) would be quietly accepted. With this change, the function declaration guards that the third parameter is a `enum sv_prfop`. Previously `svprf` would map directly to `__builtin_sve_svprfb`, which accepts the enum operand as a signed integer and only checks that the incoming range is valid, meaning that SV_ALL would be discarded as being outside the valid immediate range, but would have allowed SV_VL1 without issuing a warning (C) or error (C++). Reviewed By: c-rhodes Differential Revision: https://reviews.llvm.org/D100297	2021-04-13 21:12:53 +01:00
Hana Dusíková	64c24f493e	Remove warning "suggest braces" for aggregate initialization of an empty class with an aggregate base class. I recently ran into issues with aggregates and inheritance, I'm using it for creating a type-safe library where most of the types are build over "tagged" std::array. After bit of cleaning and enabling -Wall -Wextra -pedantic I noticed clang only in my pipeline gives me warning. After a bit of focusing on it I found it's not helpful, and contemplate disabling the warning all together. After a discussion with other library authors I found it's bothering more people and decided to fix it. Removes this warning: template<typename T, int N> struct StdArray { T contents[N]; }; template<typename T, int N> struct AggregateAndEmpty : StdArray<T,N> { }; AggregateAndEmpty<int, 3> p = {1, 2, 3}; // <-- warning here about omitted braces	2021-04-13 15:45:09 -04:00
Aaron Ballman	c058a71227	Correct the tablegen for checking mutually exclusive stmt attrs The previous implementation was insufficient for checking statement attribute mutual exclusion because attributed statements do not collect their attributes one-at-a-time in the same way that declarations do. So the design that was attempting to check for mutual exclusion as each attribute was processed would not ever catch a mutual exclusion in a statement. This was missed due to insufficient test coverage, which has now been added for the [[likely]] and [[unlikely]] attributes. The new approach is to check all of attributes that are to be applied to the attributed statement in a group. This required generating another DiagnoseMutualExclusions() function into AttrParsedAttrImpl.inc.	2021-04-13 15:20:30 -04:00
ThePhD	701d70d4c2	String Literal and Wide String Literal Encoding from the Preprocessor Adds the __clang_literal_encoding__ and __clang_wide_literal_encoding__ predefined macros to expose the encoding used for string literals to the preprocessor.	2021-04-13 14:18:07 -04:00
Aaron Ballman	62328f2f29	Implement WG21 P2156R1/WG14 N2557 on duplicate attributes These proposals make the same changes to both C++ and C and remove a restriction on standard attributes appearing multiple times in the same attribute list. We could warn on the duplicate attributes, but do not. This is for consistency as we do not warn on attributes duplicated within the attribute specifier sequence. If we want to warn on duplicated standard attributes, we should do so both for both situations: [[foo, foo]] and [[foo]][[foo]].	2021-04-13 12:30:04 -04:00
Aaron Ballman	5ad15f4d1c	Require commas between double square bracket attributes. Clang currently has a bug where it allows you to write [[foo bar]] and both attributes are silently accepted. This patch corrects the comma parsing rules for such attributes and handles the test case fallout, as a few tests were accidentally doing this.	2021-04-13 06:43:01 -04:00
Ben Dunbobbin	eae2d4b852	[Windows Itanium][PS4] handle dllimport/export w.r.t vtables/rtti The existing Windows Itanium patches for dllimport/export behaviour w.r.t vtables/rtti can't be adopted for PS4 due to backwards compatibility reasons (see comments on https://reviews.llvm.org/D90299). This commit adds our PS4 scheme for this to Clang. Differential Revision: https://reviews.llvm.org/D93203	2021-04-13 11:41:10 +01:00
Sander de Smalen	fa936b610f	[AArch64][SVE] Fix dup/dupq intrinsics for C++. This patch changes the builtin prototype to use 'b' (boolean) instead of the default integer element type. That fixes the dup/dupq intrinsics when compiling with C++. This patch also fixes one of the defines for __ARM_FEATURE_SVE2_BITPERM. Reviewed By: kmclaughlin Differential Revision: https://reviews.llvm.org/D100294	2021-04-13 10:55:20 +01:00
Alexey Bader	95c614afcd	[NFC][SYCL] Drop idle triple component from regression tests.	2021-04-13 08:00:21 +03:00
Freddy Ye	3fc1fe8db8	[X86] Support -march=rocketlake Reviewed By: skan, craig.topper, MaskRay Differential Revision: https://reviews.llvm.org/D100085	2021-04-13 09:48:13 +08:00

... 9 10 11 12 13 ...

44094 Commits