llvm-project

Commit Graph

Author	SHA1	Message	Date
Christopher Di Bella	0871954197	Revert "Revert "[clang][pp] adds '#pragma include_instead'"" Includes regression test for problem noted by @hans. This reverts commit `973de71856`. Differential Revision: https://reviews.llvm.org/D106898	2021-07-29 19:21:43 +00:00
Anjan Kumar	109954410c	[AIX] Pass the -b option to linker on AIX Parse the -b option in the driver and pass it to the linker if the target OS is AIX. This will establish compatibility with the other AIX compilers. Reviewed By: Zarko Todorovski Differential Revision: https://reviews.llvm.org/D106688	2021-07-29 18:14:41 +00:00
Chris Bieneman	26c695b789	Support macro deprecation #pragma clang deprecated This patch adds `#pragma clang deprecated` to enable deprecation of preprocessor macros. The macro must be defined before `#pragma clang deprecated`. When deprecating a macro a custom message may be optionally provided. Warnings are emitted at the use site of a deprecated macro, and can be controlled via the `-Wdeprecated` warning group. This patch takes some rough inspiration and a few lines of code from https://reviews.llvm.org/D67935. Reviewed By: aaron.ballman Differential Revision: https://reviews.llvm.org/D106732	2021-07-29 12:40:53 -05:00
Melanie Blower	fd251d903b	[clang][patch] Remove erroneous run line committed in D102343	2021-07-29 12:42:04 -04:00
Melanie Blower	bc5b5ea037	[clang][patch][FPEnv] Make initialization of C++ globals strictfp aware @kpn pointed out that the global variable initialization functions didn't have the "strictfp" metadata set correctly, and @rjmccall said that there was buggy code in SetFPModel and StartFunction, this patch is to solve those problems. When Sema creates a FunctionDecl, it sets the FunctionDeclBits.UsesFPIntrin to "true" if the lexical FP settings (i.e. a combination of command line options and #pragma float_control settings) correspond to ConstrainedFP mode. That bit is used when CodeGen starts codegen for a llvm function, and it translates into the "strictfp" function attribute. See bugs.llvm.org/show_bug.cgi?id=44571 Reviewed By: Aaron Ballman Differential Revision: https://reviews.llvm.org/D102343	2021-07-29 12:02:37 -04:00
Kai Luo	e4902e69e9	[PowerPC] Fix return type of XL compat CAS `__compare_and_swap*` should return `i32` rather than `i1`. Reviewed By: jsji Differential Revision: https://reviews.llvm.org/D107077	2021-07-29 14:49:26 +00:00
Jamie Schmeiser	c3c1826c31	Set TargetCPUName for AIX to default to pwr7. Summary: Set the TargetCPUName for AIX to default to pwr7, removing the setting of it based on the major/minor of the OS version, which previously set it to pwr4 for AIX 7.1 and earlier. The old code would also set it to pwr4 when the OS version was not specified and with the change, it will default it to pwr7 in all cases. Author: Jamie Schmeiser <schmeise@ca.ibm.com> Reviewed By:hubert.reinterpretcast (Hubert Tong) Differential Revision: https://reviews.llvm.org/D107063	2021-07-29 09:59:24 -04:00
Freddy Ye	58712987e5	[NFC][X86] add missing tests in clang/test/CodeGen/attr-target-mv.c Reviewed By: pengfei Differential Revision: https://reviews.llvm.org/D106849	2021-07-29 13:28:10 +08:00
Michael Kruse	c6b0b16c0f	[Preprocessor] -E -P: Ensure newline after 8 skipped lines. The implementation of -fminimize-whitespace (D104601) revised the logic when to emit newlines. There was no case to handle when more than 8 lines were skippped in -P (DisableLineMarkers) mode and instead fell through the case intended for -fminimize-whitespace, i.e. emit nothing. This patch will emit one newline in this case. The newline logic is slightly reorganized. The `-P -fminimize-whitespace` case is handled explicitly and emitting at least one newline is the new fallback case. The choice between emitting a line marker or up to 7 empty lines is now a choice only with enabled line markers. The up to 8 newlines likely are fewer characters than a line directive, but in -P mode this had the paradoxic effect that it would print up to 7 empty lines, but none at all if more than 8 lines had to be skipped. Now with DisableLineMarkers, we don't consider printing empty lines (just start a new line) which matches gcc's behavior. The line-directive-output-mincol.c test is replaced with a more comprehensive test skip-empty-lines.c also testing the more than 8 skipped lines behaviour with all flag combinations. Reviewed By: aaron.ballman Differential Revision: https://reviews.llvm.org/D106924	2021-07-28 22:50:54 -05:00
Matheus Izvekov	87aa31827b	[clang] fix concepts crash on substitution failure during normalization When substitution failed on the first constrained template argument (but only the first), we would assert / crash. Checking for failure was only being performed from the second constraint on. This changes it so the checking is performed in that case, and the code is also now simplified a little bit to hopefully avoid this confusion. Signed-off-by: Matheus Izvekov <mizvekov@gmail.com> Reviewed By: rsmith Differential Revision: https://reviews.llvm.org/D106907	2021-07-28 23:28:45 +02:00
Michael Benfield	e12e02df09	[clang] Evaluate strlen of strcpy argument for -Wfortify-source. Also introduce Expr::tryEvaluateStrLen. Differential Revision: https://reviews.llvm.org/D104887	2021-07-28 20:52:57 +00:00
Fangrui Song	828767f325	COFF/ELF: Place llvm.global_ctors elements in llvm.used if comdat is used On ELF, an SHT_INIT_ARRAY outside a section group is a GC root. The current codegen abuses SHT_INIT_ARRAY in a section group to mean a GC root. On PE/COFF, the dynamic initialization for `__declspec(selectany)` in a comdat can be garbage collected by `-opt:ref`. Call `addUsedGlobal` for the two cases to fix the abuse/bug. Reviewed By: rnk Differential Revision: https://reviews.llvm.org/D106925	2021-07-28 11:44:19 -07:00
Jessica Clarke	0e79a94836	[Utils] Support class template specializations in update_cc_test_checks ClassTemplateSpecializationDecl not within a ClassTemplateDecl represents an explicit instatiation of a template and so should be handled as if it were a normal CXXRecordDecl. Unfortunately, having an equivalent for FunctionTemplateDecl remains a TODO in ASTDumper's VisitFunctionTemplateDecl, with all the explicit instantiations just being emitted inside the FunctionTemplateDecl along with all the other specializations, meaning we can't easily support explicit function instantiations in update_cc_test_checks. Reviewed By: arichardson Differential Revision: https://reviews.llvm.org/D106243	2021-07-28 16:03:41 +01:00
Melanie Blower	66ddac22e2	[CLANG][PATCH][FPEnv] Add support for option -ffp-eval-method and extend #pragma float_control similarly The Intel compiler ICC supports the option "-fp-model=(source\|double\|extended)" which causes the compiler to use a wider type for intermediate floating point calculations. Also supported is a way to embed this effect in the source program with #pragma float_control(source\|double\|extended). This patch extends pragma float_control syntax, and also adds support for a new floating point option "-ffp-eval-method=(source\|double\|extended)". source: intermediate results use source precision double: intermediate results use double precision extended: intermediate results use extended precision Reviewed By: Aaron Ballman Differential Revision: https://reviews.llvm.org/D93769	2021-07-28 10:50:32 -04:00
Aaron Ballman	b0ef3d8f66	Allow #pragma float_control(push\|pop) within a language linkage specification Currently, we prohibit this pragma from appearing within a language linkage specification, but this is useful functionality that is supported by MSVC (which is where we inherited this feature from). This patch allows you to use the pragma within an extern "C" {} (etc) block.	2021-07-28 07:37:56 -04:00
Jinsong Ji	edbdf8e5b5	[AIX] Update fetch_and_add type It turns out that the AIX kernel is defining int instead of unsigned int for fetch_and_add. Legacy XL also defines this to be signed. https://www.ibm.com/docs/en/aix/7.2?topic=f-fetch-add-kernel-services So update the type for compat. Reviewed By: hubert.reinterpretcast Differential Revision: https://reviews.llvm.org/D106920	2021-07-27 22:13:29 +00:00
Jose M Monsalve Diaz	0276db1416	[OpenMP] Creating the `omp_target_num_teams` and `omp_target_thread_limit` attributes to outlined functions The device runtime contains several calls to __kmpc_get_hardware_num_threads_in_block and __kmpc_get_hardware_num_blocks. If the thread_limit and the num_teams are constant, these calls can be folded to the constant value. In commit D106033 we have the optimization phase. This commit adds the attributes to the outlined function for the grid size. the two attributes are `omp_target_num_teams` and `omp_target_thread_limit`. These values are added as long as they are constant. Two functions are created `getNumThreadsExprForTargetDirective` and `getNumTeamsExprForTargetDirective`. The original functions `emitNumTeamsForTargetDirective` and `emitNumThreadsForTargetDirective` identify the expresion and emit the code. However, for the Device version of the outlined function, we cannot emit anything. Therefore, this is a first attempt to separate emision of code from deduction of the values. Reviewed By: jdoerfert Differential Revision: https://reviews.llvm.org/D106298	2021-07-27 17:21:04 -04:00
Florian Mayer	835ef6f93d	[hwasan] Fix stack safety test for old PM. With the old PM, the stub for __hwasan_generate_tag is still generated in the IR, but never called. Reviewed By: vitalybuka Differential Revision: https://reviews.llvm.org/D106858	2021-07-27 20:50:46 +01:00
Melanie Blower	48ad446a0f	[clang][fpenv][patch] Change clang option -ffp-model=precise to select ffp-contract=on Change the ffp-model=precise to enables -ffp-contract=on (previously -ffp-model=precise enabled -ffp-contract=fast). This is a follow-up to Andy Kaylor's comments in the llvm-dev discussion "Floating Point semantic modes". From the same email thread, I put Andy's distillation of floating point options and floating point modes into UsersManual.rst Also fixes bugs.llvm.org/show_bug.cgi?id=50222 I had to revert this a few times because of failures on the x86-64 buildbot but I think we finally have that fixed by LNT/79f2b03c51. Reviewed By: rjmccall, andrew.kaylor Differential Revision: https://reviews.llvm.org/D74436	2021-07-27 13:55:31 -04:00
Thomas Lively	33786576fd	[WebAssembly] Codegen for extmul SIMD instructions Replace the clang builtins and LLVM intrinsics for the SIMD extmul instructions with normal codegen patterns. Differential Revision: https://reviews.llvm.org/D106724	2021-07-27 08:41:30 -07:00
Anastasia Stulova	e5f47eedeb	[OpenCL] NULL redefined as nullptr in C++ mode. Redefines NULL as nullptr instead of ((void*)0) in C++ for OpenCL. Such internal representation of NULL provides compatibility with C++11 and later language standards. Patch by Topotuna (Justas Janickas)! Differential Revision: https://reviews.llvm.org/D105987	2021-07-27 16:33:50 +01:00
Hans Wennborg	973de71856	Revert "[clang][pp] adds '#pragma include_instead'" > `#pragma clang include_instead(<header>)` is a pragma that can be used > by system headers (and only system headers) to indicate to a tool that > the file containing said pragma is an implementation-detail header and > should not be directly included by user code. > > The library alternative is very messy code that can be seen in the first > diff of D106124, and we'd rather avoid that with something more > universal. > > This patch takes the first step by warning a user when they include a > detail header in their code, and suggests alternative headers that the > user should include instead. Future work will involve adding a fixit to > automate the process, as well as cleaning up modules diagnostics to not > suggest said detail headers. Other tools, such as clangd can also take > advantage of this pragma to add the correct user headers. > > Differential Revision: https://reviews.llvm.org/D106394 This caused compiler crashes in Chromium builds involving PCH and an include directive with macro expansion, when Token::getLiteralData() returned null. See the code review for details. This reverts commit `e8a64e5491`.	2021-07-27 17:29:48 +02:00
Nico Weber	452095fe2f	[clang/darwin] Pass libclang_rt.profile last on linker command This reverts the functional change of https://reviews.llvm.org/D35385 because it sounds like this is no longer necessary (https://bugs.llvm.org/show_bug.cgi?id=51135#c11) and makes clang's behavior more uniform across platforms. Differential Revision: https://reviews.llvm.org/D106733	2021-07-27 07:51:06 -04:00
Hans Wennborg	a648f34342	[clang-cl] Expose -fmodules and related flags in the driver (PR43391) I don't know how well this works with clang-cl, but people want to try it out, and I think we want to make it work, so exposing the flags seems reasonable. Differential revision: https://reviews.llvm.org/D106791	2021-07-27 11:27:16 +02:00
Jan Svoboda	11ee699b3c	[clang][tooling] Accept Clang invocations with multiple jobs When `-fno-integrated-as` is passed to the Clang driver (or set by default by a specific toolchain), it will construct an assembler job in addition to the cc1 job. Similarly, the `-fembed-bitcode` driver flag will create additional cc1 job that reads LLVM IR file. The Clang tooling library only cares about the job that reads a source file. Instead of relying on the fact that the client injected `-fsyntax-only` to the driver invocation to get a single `-cc1` invocation that reads the source file, this patch filters out such jobs from `Compilation` automatically and ignores the rest. This fixes a test failure in `ClangScanDeps/headerwithname.cpp` and `ClangScanDeps/headerwithnamefollowedbyinclude.cpp` on AIX reported here: https://reviews.llvm.org/D103461#2841918 and `clang-scan-deps` failures with `-fembed-bitcode`. Depends on D106788. Reviewed By: dexonsmith Differential Revision: https://reviews.llvm.org/D105695	2021-07-27 10:47:55 +02:00
Amy Huang	1a3bf2953a	[DebugInfo] Switch to using constructor homing (-debug-info-kind=constructor) by default when debug info is enabled Constructor homing reduces the amount of class type info that is emitted by emitting conmplete type info for a class only when a constructor for that class is emitted. This will mainly reduce the amount of duplicate debug info in object files. In Chrome enabling ctor homing decreased total build directory sizes by about 30%. It's also expected that some class types (such as unused classes) will no longer be emitted in the debug info. This is fine, since we wouldn't expect to need these types when debugging. In some cases (e.g. libc++, https://reviews.llvm.org/D98750), classes are used without calling the constructor. Since this is technically undefined behavior, enabling constructor homing should be fine. However Clang now has an attribute `__attribute__((standalone_debug))` that can be used on classes to ignore ctor homing. Bug: https://bugs.llvm.org/show_bug.cgi?id=46537 Differential Revision: https://reviews.llvm.org/D106084	2021-07-26 17:24:42 -07:00
Albion Fung	18526b0d66	[PowerPC] Changed sema checking range for tdw td builtin To match xlc behaviour and definition in the PowerPC ISA3.1, it is a better idea to have ibm-clang produce an error when a 0 is passed to the builtin, which will match xlc's behaviour. This patch changes the accepted range from 0 to 31 to 1 to 31. Differential revision: https://reviews.llvm.org/D106817	2021-07-26 18:44:33 -05:00
Tom Stellard	c7b3a91017	libclang.so: Make SONAME independent from LLVM version Reviewed By: MaskRay Differential Revision: https://reviews.llvm.org/D105527	2021-07-26 16:37:26 -07:00
Reid Kleckner	f9f56488e0	[DebugInfo] Use per-enumerator signedness for DIEnumerator Allegedly the DWARF backend ignores this field of DIEnumerator, but we set it nonetheless in case we decide to use it in the future. Alternatively, we could remove it, but it is simpler to pass down the signed bit as it is in the AST for now. Implemented to address comments on D106585	2021-07-26 16:14:28 -07:00
Reid Kleckner	a9b114c5dd	Disable the new enum i128 test under ASan, it uncovers an existing leak See llvm.org/pr51221	2021-07-26 15:48:32 -07:00
Joseph Huber	af000197c4	[OpenMP] Always inline the OpenMP outlined function This patch adds the always inline attribute to the outlined functions generated by OpenMP regions. Because there is only a single instance of this function and it always has internal linkage it is safe to inline in every instance it is created. This could potentially lead to performance degredation due to inflated register counts in the parallel region. Reviewed By: jdoerfert Differential Revision: https://reviews.llvm.org/D106799	2021-07-26 17:27:59 -04:00
Joseph Huber	d297211692	[OpenMP] Add a driver flag to enable the new device runtime library This patch adds a driver flag `-fopenmp-target-new-runtime` to optionally enable the new device runtime bitcode library. This allows users to enable the new experimental runtime before it becomes the default in the future. Reviewed By: jdoerfert Differential Revision: https://reviews.llvm.org/D106793	2021-07-26 16:35:56 -04:00
Matheus Izvekov	20555a15a5	[clang] P2266 implicit moves STL workaround This patch replaces the workaround for simpler implicit moves implemented in D105518. The Microsoft STL currently has some issues with P2266. Where before, with -fms-compatibility, we would disable simpler implicit moves globally, with this change, we disable it only when the returned expression is in a context contained by std namespace and is located within a system header. Signed-off-by: Matheus Izvekov <mizvekov@gmail.com> Reviewed By: aaron.ballman, mibintc Differential Revision: https://reviews.llvm.org/D105951	2021-07-26 22:21:31 +02:00
Reid Kleckner	3230493299	Fix clang debug info irgen of i128 enums DIEnumerator stores an APInt as of April 2020, so now we don't need to truncate the enumerator value to 64 bits. Fixes assertions during IRGen. Split from D105320, thanks to Matheus Izvekov for the test case and report. Differential Revision: https://reviews.llvm.org/D106585	2021-07-26 12:25:29 -07:00
Eli Friedman	0fb16d5ad1	Fix clang regression test after `5c486ce0`	2021-07-26 11:59:40 -07:00
Nemanja Ivanovic	1c50a5da36	[PowerPC] Implement partial vector ld/st builtins for XL compatibility XL provides functions __vec_ldrmb/__vec_strmb for loading/storing a sequence of 1 to 16 bytes in big endian order, right justified in the vector register (regardless of target endianness). This is equivalent to vec_xl_len_r/vec_xst_len_r which are only available on Power9. This patch simply uses the Power9 functions when compiled for Power9, but provides a more general implementation for Power8. Differential revision: https://reviews.llvm.org/D106757	2021-07-26 13:19:52 -05:00
Qiu Chaofan	240dde9482	[PowerPC] Change altivec indexed load/store builtins argument type This patch changes the index argument of lvxl?/lve[bhw]x and stvxl?/stve[bhw]x builtins from int to long. Because on 64-bit subtargets, an extra extsw will always been generated, which is incorrect. Reviewed By: nemanjai Differential Revision: https://reviews.llvm.org/D106530	2021-07-27 00:26:50 +08:00
Christopher Di Bella	e8a64e5491	[clang][pp] adds '#pragma include_instead' `#pragma clang include_instead(<header>)` is a pragma that can be used by system headers (and only system headers) to indicate to a tool that the file containing said pragma is an implementation-detail header and should not be directly included by user code. The library alternative is very messy code that can be seen in the first diff of D106124, and we'd rather avoid that with something more universal. This patch takes the first step by warning a user when they include a detail header in their code, and suggests alternative headers that the user should include instead. Future work will involve adding a fixit to automate the process, as well as cleaning up modules diagnostics to not suggest said detail headers. Other tools, such as clangd can also take advantage of this pragma to add the correct user headers. Differential Revision: https://reviews.llvm.org/D106394	2021-07-26 16:07:45 +00:00
Shilei Tian	3274cdc83e	[Clang][OpenMP] Remove the mandatory flush for capture for OpenMP 5.1 In OpenMP 5.1: > If the `write` or `update` clause is specifieded, the atomic operation is not an atomic conditional update for which the comparison fails, and the effective memory ordering is `release`, `acq_rel`, or `seq_cst`, the strong flush on entry to the atomic operation is also a release flush. If the `read` or `update` clause is specified and the effective memory ordering is `acquire`, `acq_rel`, or `seq_cst` then the strong flush on exit from the atomic operation is also an acquire flush. In OpenMP 5.0: > If the `write`, `update`, or `capture` clause is specified and the `release`, `acq_rel`, or `seq_cst` clause is specified then the strong flush on entry to the atomic operation is also a release flush. If the `read` or `capture` clause is specified and the `acquire`, `acq_rel`, or `seq_cst` clause is specified then the strong flush on exit from the atomic operation is also an acquire flush. From my understanding, in OpenMP 5.1, `capture` is removed from the requirement for flush, therefore we don't have to enforce it. Reviewed By: ABataev Differential Revision: https://reviews.llvm.org/D100768	2021-07-26 11:00:44 -04:00
Ulrich Weigand	8cd8120a7b	[SystemZ] Add support for new cpu architecture - arch14 This patch adds support for the next-generation arch14 CPU architecture to the SystemZ backend. This includes: - Basic support for the new processor and its features. - Detection of arch14 as host processor. - Assembler/disassembler support for new instructions. - New LLVM intrinsics for certain new instructions. - Support for low-level builtins mapped to new LLVM intrinsics. - New high-level intrinsics in vecintrin.h. - Indicate support by defining __VEC__ == 10304. Note: No currently available Z system supports the arch14 architecture. Once new systems become available, the official system name will be added as supported -march name.	2021-07-26 16:57:28 +02:00
Anastasia Stulova	81600160b3	[OpenCL] Change default standard version to CL1.2 Set default version for OpenCL C to 1.2. This means that the absence of any standard flag will be equivalent to passing '-cl-std=CL1.2'. Note that this patch also fixes incorrect version check for the pointer to pointer kernel arguments diagnostic and atomic test. Differential Revision: https://reviews.llvm.org/D106504	2021-07-26 15:04:34 +01:00
Michael Kruse	ae6b400002	[Preprocessor] Implement -fminimize-whitespace. This patch adds the -fminimize-whitespace with the following effects: * If combined with -E, remove as much non-line-breaking whitespace as possible. * If combined with -E -P, removes as much whitespace as possible, including line-breaks. The motivation is to reduce the amount of insignificant changes in the preprocessed output with source files where only whitespace has been changed (add/remove comments, clang-format, etc.) which is in particular useful with ccache. A patch for ccache for using this flag has been proposed to ccache as well: https://github.com/ccache/ccache/pull/815, which will use -fnormalize-whitespace when clang-13 has been detected, and additionally uses -P in "unify_mode". ccache already had a unify_mode in an older version which was removed because of problems that using the preprocessor itself does not have (such that the custom tokenizer did not recognize C++11 raw strings). This patch slightly reorganizes which part is responsible for adding newlines that are required for semantics. It is now either startNewLineIfNeeded() or MoveToLine() but never both; this avoids the ShouldUpdateCurrentLine workaround and avoids redundant lines being inserted in some cases. It also fixes a mandatory newline not inserted after a _Pragma("...") that is expanded into a #pragma. Reviewed By: aaron.ballman Differential Revision: https://reviews.llvm.org/D104601	2021-07-25 23:30:57 -05:00
Melanie Blower	05ae303555	[clang][patch] Remove test artifact before running test for consistent results Fix non-deterministic test behavior by removing previously-created test directory, see comments in D95159	2021-07-24 07:55:10 -04:00
Thomas Lively	85157c0079	[WebAssembly] Codegen for pmin and pmax Replace the clang builtins and LLVM intrinsics for {f32x4,f64x2}.{pmin,pmax} with standard codegen patterns. Since wasm_simd128.h uses an integer vector as the standard single vector type, the IR for the pmin and pmax intrinsic functions contains bitcasts that would not be there otherwise. Add extra codegen patterns that can still select the pmin and pmax instructions in the presence of these bitcasts. Differential Revision: https://reviews.llvm.org/D106612	2021-07-23 14:49:21 -07:00
Yaxun (Sam) Liu	44dbbe6106	[HIP] Preserve ASAN bitcode library functions Address sanitizer passes may generate call of ASAN bitcode library functions after bitcode linking in lld, therefore lld cannot add those symbols since it does not know they will be used later. To solve this issue, clang emits a reference to a bicode library function which calls all ASAN functions which need to be preserved. This basically force all ASAN functions to be linked in. Reviewed by: Artem Belevich Differential Revision: https://reviews.llvm.org/D106315	2021-07-23 10:35:52 -04:00
Yaxun (Sam) Liu	9a977daaf6	Fix __hip_fabin visibility In -fgpu-rdc case, fat binary is embedded as global variable __hip_fatbin. It needs to have protected visibility to avoid conflict between shared libraries. Reviewed by: Siu Chi Chan Differential Revision: https://reviews.llvm.org/D106571 Fixes: SWDEV-292290	2021-07-23 10:14:29 -04:00
Gabor Marton	44fa31fa6d	[Analyzer][solver] Fix inconsistent equivalence class data https://bugs.llvm.org/show_bug.cgi?id=51109 When we merged two classes, `this` became an obsolete representation of the new `State`. This is b/c the member relations had changed during the previous merge of another member of the same class in a way that `this` had no longer any members. (`mergeImpl` might keep the member relations to `Other` and could dissolve `*this`.) Differential Revision: https://reviews.llvm.org/D106285	2021-07-23 14:25:32 +02:00
Anastasia Stulova	5c63bf3abd	[OpenCL] Add NULL to standards prior to v2.0. NULL was undefined in OpenCL prior to version 2.0. However, the language specification states that "macro names defined by the C99 specification but not currently supported by OpenCL are reserved for future use". Therefore, application developers cannot redefine NULL. The change is supposed to resolve inconsistency between language versions. Currently there is no apparent reason why NULL should be kept undefined. Patch by Topotuna (Justas Janickas)! Differential Revision: https://reviews.llvm.org/D105988	2021-07-23 11:54:36 +01:00
Sven van Haastregt	989bedec7a	[OpenCL] Add cl_khr_integer_dot_product Add the builtins defined by Section 42 "Integer dot product" in the OpenCL Extension Specification. Differential Revision: https://reviews.llvm.org/D106434	2021-07-23 10:10:16 +01:00
namazso	91bc85b1eb	[MS] Preserve base register %esi around movs[bwl] fix for behavior reported in https://bugs.llvm.org/show_bug.cgi?id=51100 workaround for root cause https://bugs.llvm.org/show_bug.cgi?id=16830 similar to https://reviews.llvm.org/D101338 Reviewed By: craig.topper Differential Revision: https://reviews.llvm.org/D106210	2021-07-23 16:28:32 +08:00
Kai Luo	e4ed93cb25	[PowerPC] Implement XL compatible behavior of __compare_and_swap According to https://www.ibm.com/docs/en/xl-c-and-cpp-aix/16.1?topic=functions-compare-swap-compare-swaplp XL's `__compare_and_swap` has a weird behavior that > In either case, the contents of the memory location specified by addr are copied into the memory location specified by old_val_addr. (unlike c11 `atomic_compare_exchange` specified in http://www.open-std.org/jtc1/sc22/wg14/www/docs/n1548.pdf) This patch let clang's implementation follow this behavior. Reviewed By: jsji Differential Revision: https://reviews.llvm.org/D106344	2021-07-23 01:16:02 +00:00
Thomas Lively	481084f669	[WebAssembly][NFC] Update test expectations labels after `db7efcab7d` Commit `db7efcab7d` changed the implementations of the wasm__extract_lane and wasm__replace_lane intrinsics from using builtin functions to using the standard vector extensions. This did not change the resulting IR, but it changes how update_cc_test_checks.py labels values in the IR. This commit simply updates those labels. Differential Revision: https://reviews.llvm.org/D106611	2021-07-22 16:31:12 -07:00
Florian Mayer	96c63492cb	[hwasan] Use stack safety analysis. This avoids unnecessary instrumentation. Reviewed By: eugenis, vitalybuka Differential Revision: https://reviews.llvm.org/D105703	2021-07-22 16:20:27 -07:00
Amy Huang	3e2ad26b08	[DebugInfo] Add -fno-ctor-homing for as counterpart to -fuse-ctor-homing Add an opt out flag for constructor homing. Differential Revision: https://reviews.llvm.org/D106582	2021-07-22 14:52:36 -07:00
David Blaikie	83225936af	PR51158: Don't emit -Wswitch or -Wcovered-switch-default for empty enums An empty enum is used to implement C++'s new-ish "byte" type (to make sure it's a separate type for overloading, etc - compared to a typedef) - without any enumerators. Some clang warnings don't make sense in this sort of situation, so let's skip them for empty enums. It's arguable that possibly some situations of enumerations without enumerators might want the previous-to-this-patch behavior (if the enum is autogenerated and in some cases comes up empty, then maybe a default in an empty switch would still be considered problematic - so that when you add the first enumeration you do get a -Wswitch warning). But I think that's niche enough & this std::byte case is mainstream enough that we should prioritize the latter over the former. If someone's got a middle ground proposal to account for both of those situations, I'm open to patches/suggestions/etc.	2021-07-22 14:51:56 -07:00
Paulo Matos	46667a1003	[WebAssembly] Implementation of global.get/set for reftypes in LLVM IR Reland of `31859f896`. This change implements new DAG notes GLOBAL_GET/GLOBAL_SET, and lowering methods for load and stores of reference types from IR globals. Once the lowering creates the new nodes, tablegen pattern matches those and converts them to Wasm global.get/set. Reviewed By: tlively Differential Revision: https://reviews.llvm.org/D104797	2021-07-22 22:07:24 +02:00
Jake Egan	1b52e9bac2	[AIX] Define __LONGDOUBLE64 macro This patch defines the macro __LONGDOUBLE64 for AIX when long double is 8 bytes. Reviewed By: cebowleratibm Differential Revision: https://reviews.llvm.org/D105477	2021-07-22 16:05:14 -04:00
Anjan Kumar Guttahalli Krishna	7d669e6666	[AIX] Generate large code model relocations when mcmodel=medium on AIX This patch makes the changes in the driver that converts the medium code model to large. Reviewed By: hubert.reinterpretcast Differential Revision: https://reviews.llvm.org/D106371	2021-07-22 15:47:22 -04:00
Anjan Kumar Guttahalli Krishna	f719dff043	[AIX] Clang's library integration support for 128-bit long double is incomplete on AIX. Emit the unsupported option error until the Clang's library integration support for 128-bit long double is available for AIX. Reviewed By: Whitney, cebowleratibm Differential Revision: https://reviews.llvm.org/D106074	2021-07-22 15:32:48 -04:00
Aaron Ballman	178c2b4c1e	Correctly diagnose taking the address of a register variable in C We caught the cases where the user would explicitly use the & operator, but we were missing implicit conversions such as array decay. Fixes PR26336. Thanks to Samuel Neves for inspiration for the patch.	2021-07-22 14:53:23 -04:00
Alex Lorenz	40d2d0c412	[clang][test] Add -fuse-ld= to test case added in `2542c1a5a1` to resolve test failure with CLANG_DEFAULT_LINKER=lld	2021-07-22 11:12:38 -07:00
Alex Lorenz	2542c1a5a1	[clang][driver][darwin] Add driver support for Mac Catalyst This commit adds driver support for the Mac Catalyst target, as supported by the Apple clang compile Differential Revision: https://reviews.llvm.org/D105960	2021-07-22 10:20:19 -07:00
Victor Huang	26ea4a4432	[PowerPC] Add PowerPC "__stbcx" builtin and intrinsic for XL compatibility This patch is in a series of patches to provide builtins for compatibility with the XL compiler. This patch adds the builtin and intrinsic for "__stbcx". Reviewed By: nemanjai, #powerpc Differential revision: https://reviews.llvm.org/D106484	2021-07-22 10:48:46 -05:00
Anastasia Stulova	b510e0127d	[OpenCL][NFC] Refactors lang version check in test. Fixed test to use predefined version marco instead of passing extra macro in the command line. Patch by Topotuna (Justas Janickas)! Differential Revision: https://reviews.llvm.org/D106254	2021-07-22 16:47:38 +01:00
Alexey Bataev	b88a68c45e	[OPENMP]Fix PR49787: Codegen for calling __tgt_target_teams_nowait_mapper has too few arguments. Added missed arguments in __tgt_target_teams_nowait_mapper/__tgt_target_nowait_mapper runtime functions calls. Differential Revision: https://reviews.llvm.org/D106542	2021-07-22 08:44:37 -07:00
Alexey Bataev	f828f0a90f	Revert "[OPENMP]Fix PR49787: Codegen for calling __tgt_target_teams_nowait_mapper has too few arguments." This reverts commit `b455f7f225` to fix buildbots.	2021-07-22 08:06:29 -07:00
Alexey Bataev	b455f7f225	[OPENMP]Fix PR49787: Codegen for calling __tgt_target_teams_nowait_mapper has too few arguments. Added missed arguments in __tgt_target_teams_nowait_mapper/__tgt_target_nowait_mapper runtime functions calls. Differential Revision: https://reviews.llvm.org/D106542	2021-07-22 07:53:37 -07:00
Melanie Blower	4296d633b0	Revert "[clang][fpenv][patch] Change clang option -ffp-model=precise to select ffp-contract=on" This reverts commit `b9b696bba6`. Buildbot failures see https://lab.llvm.org/buildbot#builders/118/builds/4138 and https://lab.llvm.org/buildbot#builders/110/builds/5112	2021-07-22 09:40:54 -04:00
Aaron Ballman	6bb042e700	Implement _ExtInt conversion rules Clang implemented the _ExtInt datatype as a bit-precise integer type, which was then proposed to WG14. WG14 has accepted the proposal (http://www.open-std.org/jtc1/sc22/wg14/www/docs/n2709.pdf), but Clang requires some additional work as a result. In the original Clang implementation, we elected to disallow implicit conversions involving these types until after WG14 finalized the rules. This patch implements the rules decided by WG14: no integer promotion for bit-precise types, conversions prefer the larger of the two types and in the event of a tie (say _ExtInt(32) and a 32-bit int), the standard type wins. There are more changes still needed to conform to N2709, but those will be handled in follow-up patches.	2021-07-22 09:10:36 -04:00
Melanie Blower	b9b696bba6	[clang][fpenv][patch] Change clang option -ffp-model=precise to select ffp-contract=on Change the ffp-model=precise to enables -ffp-contract=on (previously -ffp-model=precise enabled -ffp-contract=fast). This is a follow-up to Andy Kaylor's comments in the llvm-dev discussion "Floating Point semantic modes". From the same email thread, I put Andy's distillation of floating point options and floating point modes into UsersManual.rst Also fixes bugs.llvm.org/show_bug.cgi?id=50222 Reviewed By: rjmccall, andrew.kaylor Differential Revision: https://reviews.llvm.org/D74436	2021-07-22 07:59:18 -04:00
Florian Mayer	789a4a2e5c	Revert "[hwasan] Use stack safety analysis." This reverts commit `bde9415fef`.	2021-07-22 12:16:16 +01:00
Florian Mayer	bde9415fef	[hwasan] Use stack safety analysis. This avoids unnecessary instrumentation. Reviewed By: eugenis, vitalybuka Differential Revision: https://reviews.llvm.org/D105703	2021-07-22 12:04:54 +01:00
Jun Ma	599b2f0037	[AArch64][SVE] Handle svbool_t VLST <-> VLAT/GNUT conversion According to https://godbolt.org/z/q5rME1naY and acle, we found that there are different SVE conversion behaviours between clang and gcc. It turns out that llvm does not handle SVE predicates width properly. This patch 1) checks SVE predicates width rightly with svbool_t type. 2) removes warning on svbool_t VLST <-> VLAT/GNUT conversion. 3) disables VLST <-> VLAT/GNUT conversion between SVE vectors and predicates due to different width. Differential Revision: https://reviews.llvm.org/D106333	2021-07-22 13:55:08 +08:00
Hsiangkai Wang	698f288fa1	[Clang][RISCV] Implement vsoxseg and vsuxseg. Differential Revision: https://reviews.llvm.org/D103873	2021-07-22 09:24:41 +08:00
Hsiangkai Wang	915e6dc09c	[Clang][RISCV] Implement vssseg. Differential Revision: https://reviews.llvm.org/D103872	2021-07-22 09:24:40 +08:00
Hsiangkai Wang	d1a401b35b	[Clang][RISCV] Implement vsseg. Differential Revision: https://reviews.llvm.org/D103871	2021-07-22 09:24:39 +08:00
Hsiangkai Wang	e08825b0fc	[Clang][RISCV] Add vloxseg and vluxseg test cases.	2021-07-22 09:24:27 +08:00
Hsiangkai Wang	1c55033ea1	[Clang][RISCV] Implement vloxseg and vluxseg. Differential Revision: https://reviews.llvm.org/D103809	2021-07-22 09:23:47 +08:00
Hsiangkai Wang	a9de8f7a53	[Clang][RISCV] Implement vlsseg. Differential Revision: https://reviews.llvm.org/D103796	2021-07-22 09:23:47 +08:00
Joseph Huber	754eb1c210	[OpenMP] Change `__kmpc_free_shared` to include the paired allocation size This patch changes `__kmpc_free_shared` to take an additional argument corresponding to the associated allocation's size. This makes it easier to implement the allocator in the runtime. Reviewed By: jdoerfert Differential Revision: https://reviews.llvm.org/D106496	2021-07-21 20:56:21 -04:00
Thomas Lively	8af333cf1a	[WebAssembly] Replace @llvm.wasm.popcnt with @llvm.ctpop.v16i8 Use the standard target-independent intrinsic to take advantage of standard optimizations. Differential Revision: https://reviews.llvm.org/D106506	2021-07-21 16:45:54 -07:00
Thomas Lively	db7efcab7d	[WebAssembly] Remove clang builtins for extract_lane and replace_lane These builtins were added to capture the fact that the underlying Wasm instructions return i32s and implicitly sign or zero extend the extracted lanes in the case of the i8x16 and i16x8 variants. But we do sufficient optimizations during code gen that these low-level details do not need to be exposed to users. This commit replaces the use of the builtins in wasm_simd128.h with normal target-independent vector code. As a result, we can switch the relevant intrinsics to use functions rather than macros and can use more user-friendly return types rather than trying to precisely expose the underlying Wasm types. Note, however, that the generated LLVM IR is no different after this change. Differential Revision: https://reviews.llvm.org/D106500	2021-07-21 16:11:00 -07:00
Christopher Di Bella	9a72580a54	[clang][Sema] removes -Wfree-nonheap-object reference param false positive Taking the address of a reference parameter might be valid, and without CFA, false positives are going to be more trouble than they're worth. Differential Revision: https://reviews.llvm.org/D102728	2021-07-21 21:30:16 +00:00
Alex Lorenz	eb26ba9da8	[clang][darwin] add support for remapping macOS availability to Mac Catalyst availability This commit adds supports for clang to remap macOS availability attributes that have introduced, deprecated or obsoleted versions to appropriate Mac Catalyst availability attributes. This mapping is done using the version mapping provided in the macOS SDK, in the SDKSettings.json file. The mappings in the SDKSettings json file will also be used in the clang driver for the driver Mac Catalyst patch, and they could also be used in the future for other platforms as well. Differential Revision: https://reviews.llvm.org/D105257	2021-07-21 11:32:25 -07:00
Jon Chesterfield	d71062fbda	Revert "[OpenMP][AMDGCN] Initial math headers support" This reverts commit `968899ad9c`.	2021-07-21 17:35:40 +01:00
Thomas Lively	1a57ee1276	[WebAssembly] Codegen for v128.load{32,64}_zero Replace the experimental clang builtins and LLVM intrinsics for these instructions with normal instruction selection patterns. The wasm_simd128.h intrinsics header was already using portable code for the corresponding intrinsics, so now it produces the correct instructions. Differential Revision: https://reviews.llvm.org/D106400	2021-07-21 09:02:12 -07:00
Pushpinder Singh	968899ad9c	[OpenMP][AMDGCN] Initial math headers support With this patch, OpenMP on AMDGCN will use the math functions provided by ROCm ocml library. Linking device code to the ocml will be done in the next patch. Reviewed By: JonChesterfield, jdoerfert, scchan Differential Revision: https://reviews.llvm.org/D104904	2021-07-21 16:15:39 +01:00
Quinn Pham	e002d251dd	[PowerPC] Floating Point Builtins for XL Compat. This patch is in a series of patches to provide builtins for compatibility with the XL compiler. This patch adds builtins related to floating point operations Reviewed By: #powerpc, nemanjai, amyk, NeHuang Differential Revision: https://reviews.llvm.org/D103986	2021-07-21 08:33:39 -05:00
Deep Majumder	80068ca623	[analyzer] Fix for faulty namespace test in SmartPtrModelling This patch: - Fixes how the std-namespace test is written in SmartPtrModelling (now accounts for functions with no Decl available) - Adds the smart pointer checker flag check where it was missing Differential Revision: https://reviews.llvm.org/D106296	2021-07-21 18:23:35 +05:30
Sven van Haastregt	724f0e2abb	[OpenCL] Add cl_khr_extended_bit_ops Add the builtins defined by Section 40 "Extended Bit Operations" in the OpenCL Extension Specification. Differential Revision: https://reviews.llvm.org/D106267	2021-07-21 10:01:19 +01:00
Balázs Kéri	90cb5297ad	[clang][analyzer] Improve report of file read at EOF condition (alpha.unix.Stream checker). The checker warns if a stream is read that is already in end-of-file (EOF) state. The commit adds indication of the last location where the EOF flag is set on the stream. Reviewed By: Szelethus Differential Revision: https://reviews.llvm.org/D104925	2021-07-21 08:54:11 +02:00
Hsiangkai Wang	89ce644902	[Clang][RISCV] Add half-precision FP for vle16/vse16. I missed to add half-precision FP types for vle16/vse16 in the previous patches. Added them in this patch. Differential Revision: https://reviews.llvm.org/D106340	2021-07-21 09:55:21 +08:00
Albion Fung	2fd1520247	[PowerPC] Implemented mtmsr, mfspr, mtspr Builtins Implemented builtins for mtmsr, mfspr, mtspr on PowerPC; the patch is intended for XL Compatibility. Differential revision: https://reviews.llvm.org/D106130	2021-07-20 17:51:00 -05:00
Matheus Izvekov	1d68ecafd6	[clang] fix oops: enable implicit moves in MSVC compatibility mode When disabling simpler implicit moves in MSVC compatibility mode as a workaround in D105518, we forgot to make the opposite change and enable regular (P1825) implicit moves in the same mode. As a result, we were not doing any implicit moves at all. OOPS! This fixes it and adds test for this. This is a fix to a temporary workaround, there is ongoing work to replace this, applying the workaround only to system headers and the ::stl namespace. Signed-off-by: Matheus Izvekov <mizvekov@gmail.com> Reviewed By: aaron.ballman Differential Revision: https://reviews.llvm.org/D106303	2021-07-20 23:32:05 +02:00
Albion Fung	3434ac9e39	[PowerPC] Store, load, move from and to registers related builtins This patch implements store, load, move from and to registers related builtins, as well as the builtin for stfiw. The patch aims to provide feature parady with xlC on AIX. Differential revision: https://reviews.llvm.org/D105946	2021-07-20 15:46:14 -05:00
Melanie Blower	d48ad358b1	Revert "[CLANG][PATCH][FPEnv] Add support for option -ffp-eval-method and extend #pragma float_control similarly" This reverts commit `ce8024e8ff`. There are a couple buildbot problems	2021-07-20 16:40:55 -04:00
Melanie Blower	ce8024e8ff	[CLANG][PATCH][FPEnv] Add support for option -ffp-eval-method and extend #pragma float_control similarly The Intel compiler ICC supports the option "-fp-model=(source\|double\|extended)" which causes the compiler to use a wider type for intermediate floating point calculations. Also supported is a way to embed this effect in the source program with #pragma float_control(source\|double\|extended). This patch extends pragma float_control syntax, and also adds support for a new floating point option "-ffp-eval-method=(source\|double\|extended)". source: intermediate results use source precision double: intermediate results use double precision extended: intermediate results use extended precision Reviewed By: Aaron Ballman Differential Revision: https://reviews.llvm.org/D93769	2021-07-20 16:02:09 -04:00
Alex Lorenz	a8262a383b	[clang][darwin] add support for Mac Catalyst availability This commit adds support for Mac Catalyst availability attribute, as supported by the Apple clang compiler. A follow-up commit will provide additional support for inferring Mac Catalyst availability from macOS availability using the mapping in the SDKSettings.json. Differential Revision: https://reviews.llvm.org/D105052	2021-07-20 12:51:57 -07:00
Alex Lorenz	c68f247275	[clang-scan-deps] ignore top-level module dependencies that aren't actually imported Whenever -fmodule-name=top_level_module name is parsed, and clang actually tries to import top_level_module, the headers are imported textually and the module isn't actually built. However, the dependency scanner could still record it as a potential dependency if the module was reimported and thus recorded by the preprocessor callbacks. This change avoids collecting this kind of module as a dependency by verifying that we don't collect top level modules without actual PCM files. Differential Revision: https://reviews.llvm.org/D106100	2021-07-20 11:11:28 -07:00
Victor Huang	1a762f93f8	[PowerPC] Add PowerPC cmpb builtin and emit target indepedent code for XL compatibility This patch is in a series of patches to provide builtins for compatibility with the XL compiler. This patch add the builtin and emit target independent code for __cmpb. Reviewed By: nemanjai, #powerpc Differential revision: https://reviews.llvm.org/D105194	2021-07-20 13:06:22 -05:00
Fangrui Song	e8bc871ca2	[PowerPC][test] Don't write to srcdir	2021-07-20 10:50:11 -07:00
Fangrui Song	5b899c22f3	[Driver] Detect libstdc++ include paths for native gcc on 32-bit non-Debian Linux Fixes https://bugs.llvm.org/show_bug.cgi?id=50303 Differential Revision: https://reviews.llvm.org/D106119	2021-07-20 09:18:24 -07:00
Quinn Pham	59d2ba2a3d	[PowerPC] Semachecking for XL compat builtin icbt This patch is in a series of patches to provide builtins for compatibility with the XL compiler. This patch adds semachecking for an already implemented builtin, `__icbt`. `__icbt` is only valid for Power8 and up. Reviewed By: #powerpc, nemanjai Differential Revision: https://reviews.llvm.org/D105834	2021-07-20 11:05:22 -05:00
Joel E. Denny	5b0a948a81	[UpdateCCTestChecks] Implement --global-hex-value-regex For example, in OpenMP offload codegen tests, global variables like `.offload_maptypes*` are much easier to read in hex. Reviewed By: jdoerfert Differential Revision: https://reviews.llvm.org/D104743	2021-07-20 11:23:20 -04:00
Joel E. Denny	2f5b2ea6cd	[UpdateCCTestChecks] Implement --global-value-regex `--check-globals` activates checks for all global values, and `--global-value-regex` filters them. For example, I'd like to use it in OpenMP offload codegen tests to check only global variables like `.offload_maptypes*`. Reviewed By: jdoerfert Differential Revision: https://reviews.llvm.org/D104742	2021-07-20 11:23:20 -04:00
Quinn Pham	fd855c24c7	[PowerPC] Restore FastMathFlags of Builder for Vector FDiv Builtins This patch fixes `__builtin_ppc_recipdivf`, `__builtin_ppc_recipdivd`, `__builtin_ppc_rsqrtf`, and `__builtin_ppc_rsqrtd`. FastMathFlags are set to fast immediately before emitting these builtins. Now the flags are restored to their previous values after the builtins are emitted. Reviewed By: nemanjai, #powerpc Differential Revision: https://reviews.llvm.org/D105984	2021-07-20 09:41:00 -05:00
Jamie Schmeiser	9cb00b9ecb	Reland Produce warning for performing pointer arithmetic on a null pointer. Summary: Test and produce warning for subtracting a pointer from null or subtracting null from a pointer. This reland adds the functionality that the warning is no longer reusing an existing warning, it has different wording for C vs C++ to refect the fact that nullptr-nullptr has defined behaviour in C++, it is suppressed when the warning is triggered by a system header and adds -Wnull-pointer-subtraction to allow the warning to be controlled. -Wextra implies -Wnull-pointer-subtraction. Author: Jamie Schmeiser <schmeise@ca.ibm.com> Reviewed By: efriedma (Eli Friedman), nickdesaulniers (Nick Desaulniers) Differential Revision: https://reviews.llvm.org/D98798	2021-07-20 10:12:20 -04:00
Stefan Pintilie	02cd937945	[PowerPC][Builtins] Added a number of builtins for compatibility with XL. Added a number of different builtins that exist in the XL compiler. Most of these builtins already exist in clang under a different name. Reviewed By: nemanjai, #powerpc Differential Revision: https://reviews.llvm.org/D104386	2021-07-20 08:57:55 -05:00
Jan Svoboda	e564fd93ab	[clang][deps] Avoid minimizing PCH input files This patch avoid minimizing input files that contributed to a PCH or its modules. This prevents the implicit modular build to fail on unexpected file size. Depends on D106146. Reviewed By: dexonsmith Differential Revision: https://reviews.llvm.org/D104536	2021-07-20 12:20:10 +02:00
Florian Mayer	5f08219322	Revert "[hwasan] Use stack safety analysis." This reverts commit `e9c63ed10b`.	2021-07-20 10:36:46 +01:00
Florian Mayer	e9c63ed10b	[hwasan] Use stack safety analysis. This avoids unnecessary instrumentation. Reviewed By: eugenis, vitalybuka Differential Revision: https://reviews.llvm.org/D105703	2021-07-20 10:06:35 +01:00
Albion Fung	0d4f63e1b7	Revert "[PowerPC] Extra test case for LDARX" This reverts commit `1d3e77e7a8` as some buildbots seem to be unable to obtain the target powerpc64le-unknown-linux-gnu.	2021-07-19 21:27:02 -05:00
Hsiangkai Wang	0d22dee2ca	[Clang][RISCV] Correct the alignment of stores generated by vlseg/vlsegff. Differential Revision: https://reviews.llvm.org/D106255	2021-07-20 09:29:06 +08:00
Albion Fung	1d3e77e7a8	[PowerPC] Extra test case for LDARX An extra test case added for the builtin __LDARX. Differential revision: https://reviews.llvm.org/D105926	2021-07-19 20:03:45 -05:00
Quinn Pham	0268e123be	[PowerPC] swdiv_nochk Builtins for XL Compat This patch is in a series of patches to provide builtins for compatibility with the XL compiler. This patch adds software divide builtins with no checking. These builtins are each emitted as a fast fdiv. Reviewed By: #powerpc, nemanjai Differential Revision: https://reviews.llvm.org/D106150	2021-07-19 16:51:10 -05:00
Haowei Wu	6103fdfab4	[ifs][elfabi] Merge llvm-ifs/elfabi tools This change merges llvm-elfabi and llvm-ifs tools. Differential Revision: https://reviews.llvm.org/D100139	2021-07-19 11:23:19 -07:00
Haowei Wu	61fa9afe4c	[ifs] Prepare llvm-ifs for elfabi/ifs merging. This diff changes llvm-ifs to use unified IFS file format and perform other renaming changes in preparation for the merging between elfabi/ifs. Differential Revision: https://reviews.llvm.org/D99810	2021-07-19 11:23:00 -07:00
Amy Kwan	356300a351	[NFC][PowerPC] Update builtins-ppc-altivec.c to be run under `-faltivec-src-compat=mixed` This patch adds the `-faltivec-src-compat=mixed` option to the `builtins-ppc-altivec.c` test. Currently, the default for `-faltivec-src-compat` is `mixed`. The reason we explicitly specify `mixed` to the RUN lines of this test is because eventually, the default will set to `xl`. Having the default as `xl` changes the CHECKs of this test slightly, as it reorders some of the `vector bool` and `vector pixel` CHECKs (since under the `xl` option, `vector bool` and `vector pixel` are treated in the same way as other vector scalars). Explicitly specifying `mixed` ensures that we are testing pre-existing Clang behaviour. Differential Revision: https://reviews.llvm.org/D106282	2021-07-19 11:20:21 -05:00
Hsiangkai Wang	77bb82d068	[Clang][RISCV] Support half-precision floating point for RVV intrinsics. Use _Float16 as the half-precision floating point type. Define a new type specifier 'x' for the _Float16 type. Differential Revision: https://reviews.llvm.org/D105001	2021-07-19 23:17:01 +08:00
Giorgis Georgakoudis	fb0cf01795	Revert "[OpenMP] Codegen aggregate for outlined function captures" This reverts commit `e9c7291cb2`. Fix failing tests	2021-07-19 07:54:26 -07:00
Amy Kwan	dd5aa657a5	[PowerPC] Implement vector bool/pixel initialization under -faltivec-src-compat=xl This patch implements the initialization of vectors under the -faltivec-src-compat=xl option introduced in https://reviews.llvm.org/D103615. Under this option, the initialization of scalar vectors, vector bool, and vector pixel are treated the same, where the initialization value is splatted across the whole vector. This patch does not change the behaviour of the -faltivec-src-compat=mixed option, which is the current default for Clang. Differential Revision: https://reviews.llvm.org/D106120	2021-07-19 09:10:06 -05:00
Jamie Schmeiser	73840f9f81	thread_local support for AIX Summary: The AIX linker will produce errors on unresolved weak symbols. Change the generated code to not check for the initialization function but just call it and ensure that it always exists. Also, the AIX atexit routine has a different name (and signature) so call it correctly. Update the lit tests to test on AIX appropriately. Author: Jamie Schmeiser <schmeise@ca.ibm.com> Reviewed By: hubert.reinterpretcast (Hubert Tong) Differential Revision: https://reviews.llvm.org/D104420	2021-07-19 10:03:22 -04:00
Florian Mayer	807d50100c	Revert "[hwasan] Use stack safety analysis." This reverts commit `12268fe14a`.	2021-07-19 12:08:32 +01:00
Florian Mayer	12268fe14a	[hwasan] Use stack safety analysis. This avoids unnecessary instrumentation. Reviewed By: eugenis, vitalybuka Differential Revision: https://reviews.llvm.org/D105703	2021-07-19 11:54:44 +01:00
Deep Majumder	d825309352	[analyzer] Handle std::make_unique Differential Revision: https://reviews.llvm.org/D103750	2021-07-18 19:54:28 +05:30
Deep Majumder	0cd98bef1b	[analyzer] Handle std::swap for std::unique_ptr This patch handles the `std::swap` function specialization for `std::unique_ptr`. Implemented to be very similar to how `swap` method is handled Differential Revision: https://reviews.llvm.org/D104300	2021-07-18 14:38:55 +05:30
David Blaikie	dac582ad3a	DebugInfo: Name class templates with default arguments consistently (both direct naming, and as a template argument for a function template) It's noteworthy that GCC has the same bug here, which is a bit surprising. Both Clang and GCC's bug is only for function template arguments that are themselves templates with default template arguments (f1<t1<int[, missing_default_here]>>). Probably because function name matching isn't generally necessary - whereas type matching is necessary for DWARF consumers to associate declarations and definitions across translation units, so the bug's been addressed there already - but continued to exist for function templates since it's fairly benign there. I came across this while working on a change that could reconstitute these pretty printed names based on the rest of the DWARF, reducing the size of the DWARF by not having to encode all the template parameters in the name string. That reconstitution code can't tell the difference between a defaulted argument or not, so couldn't create the current buggy-ish output. Making the names more consistent between direct and indirect references, and between function and class templates seems all to the good. (I fixed the function template version of this a few years back in `9fdd09a4cc` - clearly I should've looked more closely and generalized the code better so it only had to be fixed once - well, doing that here now)	2021-07-17 23:58:15 -07:00
Nikita Popov	be5af50e7d	[BPF] Use elementtype attribute for preserve.array/struct.index intrinsics Use the elementtype attribute introduced in D105407 for the llvm.preserve.array/struct.index intrinsics. It carries the element type of the GEP these intrinsics effectively encode. This patch: * Adds a verifier check that the attribute is required. * Adds it in the IRBuilder methods for these intrinsics. * Autoupgrades old bitcode without the attribute. * Updates the lowering code to use the attribute rather than the pointer element type. * Updates lots of tests to specify the attribute. * Adds -force-opaque-pointers to the intrinsic-array.ll test to demonstrate they work now. https://reviews.llvm.org/D106184	2021-07-17 11:09:18 +02:00
Giorgis Georgakoudis	e9c7291cb2	[OpenMP] Codegen aggregate for outlined function captures Parallel regions are outlined as functions with capture variables explicitly generated as distinct parameters in the function's argument list. That complicates the fork_call interface in the OpenMP runtime: (1) the fork_call is variadic since there is a variable number of arguments to forward to the outlined function, (2) wrapping/unwrapping arguments happens in the OpenMP runtime, which is sub-optimal, has been a source of ABI bugs, and has a hardcoded limit (16) in the number of arguments, (3) forwarded arguments must cast to pointer types, which complicates debugging. This patch avoids those issues by aggregating captured arguments in a struct to pass to the fork_call. Reviewed By: jdoerfert Differential Revision: https://reviews.llvm.org/D102107	2021-07-16 23:27:44 -07:00
Hongtao Yu	77aec978a9	[CSSPGO] Turn on unique linkage name by default for pseudo probe. Turning on -funique-internal-linkage-names when -fpseudo-probe-for-profiling is on, unless -fno-unique-internal-linkage-names is specified. Reviewed By: wenlei Differential Revision: https://reviews.llvm.org/D106193	2021-07-16 16:43:23 -07:00
Nemanja Ivanovic	35a18a981f	[PowerPC] Implement intrinsics for mtfsf[i] This provides intrinsics for emitting instructions that set the FPSCR (`mtfsf/mtfsfi`). The patch also conservatively marks the rounding mode as an implicit def for both since they both may set the rounding mode depending on the operands. Reviewed By: #powerpc, qiucf Differential Revision: https://reviews.llvm.org/D105957	2021-07-16 16:26:11 -05:00
Lei Huang	c8937b6cb9	[PowerPC] Implement XL compact math builtins Implement a subset of builtins required for compatiblilty with AIX XL compiler. Reviewed By: nemanjai Differential Revision: https://reviews.llvm.org/D105930	2021-07-16 13:21:13 -05:00
Joseph Huber	2c31d5ebfb	[OpenMP] Add IDs to OpenMP remarks This patch adds unique idenfitiers to the existing OpenMP remarks. This makes it easier to identify the corresponding documentation for each remark that will be hosted in the OpenMP webpage. Depends on D105898 Reviewed By: jdoerfert Differential Revision: https://reviews.llvm.org/D105939	2021-07-16 14:07:03 -04:00
Joseph Huber	eef6601b0f	[OpenMP] Rework OpenMP remarks This patch rewrites and reworks a few of the existing remarks to make the mmore concise and consistent prior to writing the documentation for them. Reviewed By: jdoerfert Differential Revision: https://reviews.llvm.org/D105898	2021-07-16 14:07:00 -04:00
Stefan Pintilie	0bf4b81d57	[Clang] Add an empty builtins.h file. On Power PC some legacy compilers included a number of builtins in a builtins.h header file. While this header file is not required to hold builtins for clang some legacy code does try to include this file and so this patch provides an empty version of that file. Differential Revision: https://reviews.llvm.org/D106065	2021-07-16 12:50:04 -05:00
serge-sans-paille	8ada884cbc	SubstTemplateTypeParmType can contain an 'auto' type in their replacement type This fixes bug 36064 Differential Revision: https://reviews.llvm.org/D106093	2021-07-16 14:35:55 +02:00
Zarko Todorovski	66225db98d	[PowerPC][AIX] Add warning when alignment is incompatible with XL https://reviews.llvm.org/D105659 implements ByVal handling in llc but some cases are not compatible with existing XL compiler on AIX. Adding a clang warning for such cases. Reviewed By: aaron.ballman Differential Revision: https://reviews.llvm.org/D105660	2021-07-16 07:52:47 -04:00
Vince Bridgers	918bda1241	[analyzer] Do not assume that all pointers have the same bitwidth as void* This change addresses this assertion that occurs in a downstream compiler with a custom target. ```APInt.h:1151: bool llvm::APInt::operator==(const llvm::APInt &) const: Assertion `BitWidth == RHS.BitWidth && "Comparison requires equal bit widths"'``` No covering test case is susbmitted with this change since this crash cannot be reproduced using any upstream supported target. The test case that exposes this issue is as simple as: ```lang=c++ void test(int * p) { int * q = p-1; if (q) {} if (q) {} // crash (void)q; } ``` The custom target that exposes this problem supports two address spaces, 16-bit `char`s, and a `_Bool` type that maps to 16-bits. There are no upstream supported targets with similar attributes. The assertion appears to be happening as a result of evaluating the `SymIntExpr` `(reg_$0<int * p>) != 0U` in `VisitSymIntExpr` located in `SimpleSValBuilder.cpp`. The `LHS` is evaluated to `32b` and the `RHS` is evaluated to `16b`. This eventually leads to the assertion in `APInt.h`. While this change addresses the crash and passes LITs, two follow-ups are required: 1) The remainder of `getZeroWithPtrWidth()` and `getIntWithPtrWidth()` should be cleaned up following this model to prevent future confusion. 2) We're not sure why references are found along with the modified code path, that should not be the case. A more principled fix may be found after some further comprehension of why this is the case. Acks: Thanks to @steakhal and @martong for the discussions leading to this fix. Reviewed By: NoQ Differential Revision: https://reviews.llvm.org/D105974	2021-07-16 03:22:57 -05:00
Deep Majumder	13fe78212f	[analyzer] Handle << operator for std::unique_ptr This patch handles the `<<` operator defined for `std::unique_ptr` in the std namespace (ignores custom overloads of the operator). Differential Revision: https://reviews.llvm.org/D105421	2021-07-16 12:34:30 +05:30
Deep Majumder	48688257c5	[analyzer] Model comparision methods of std::unique_ptr This patch handles all the comparision methods (defined via overloaded operators) on std::unique_ptr. These operators compare the underlying pointers, which is modelled by comparing the corresponding inner-pointer SVal. There is also a special case for comparing the same pointer. Differential Revision: https://reviews.llvm.org/D104616	2021-07-16 09:54:05 +05:30
Victor Huang	4eb107ccba	[PowerPC] Add PowerPC population count, reversed load and store related builtins and instrinsics for XL compatibility This patch is in a series of patches to provide builtins for compatibility with the XL compiler. This patch adds the builtins and instrisics for population count, reversed load and store related operations. Reviewed By: nemanjai, #powerpc Differential revision: https://reviews.llvm.org/D106021	2021-07-15 17:23:56 -05:00
Victor Huang	803cf7ac0c	[PowerPC][NFC] Add the missing 'REQUIRES: powerpc-registered-target.' in the builtins' front end test cases for XL compatibility	2021-07-15 16:09:45 -05:00
Harald van Dijk	66ab8568c4	[Driver] Fix compiler-rt lookup for x32 x86_64-linux-gnu and x86_64-linux-gnux32 use different ABIs and objects built for one cannot be used for the other. In order to build and use compiler-rt for x32, we need to treat x32 as a new arch there. This updates the driver to search using the new arch name. Reviewed By: glaubitz Differential Revision: https://reviews.llvm.org/D100148	2021-07-15 20:52:25 +01:00
Artem Belevich	d774b4aa5e	[NVPTX, CUDA] Add .and.popc variant of the b1 MMA instruction. That should allow clang to compile mma.h from CUDA-11.3. Differential Revision: https://reviews.llvm.org/D105384	2021-07-15 12:02:09 -07:00
Quinn Pham	de3956605a	[PowerPC] Fix popcntb XL Compat Builtin for 32bit This patch implements the `__popcntb` XL compatibility builtin for 32bit in the frontend and backend. This patch also updates tests for `__popcntb` and other XL Compat sync related builtins. Reviewed By: #powerpc, nemanjai, amyk Differential Revision: https://reviews.llvm.org/D105360	2021-07-15 13:19:47 -05:00
Victor Huang	d40e8091bd	[PowerPC] Add PowerPC rotate related builtins and emit target independent code for XL compatibility This patch is in a series of patches to provide builtins for compatibility with the XL compiler. This patch adds the builtins and emit target independent code for rotate related operations. Reviewed By: nemanjai, #powerpc Differential revision: https://reviews.llvm.org/D104744	2021-07-15 10:23:54 -05:00
Anton Zabaznov	05eb59e1d0	[OpenCL] Add support of __opencl_c_program_scope_global_variables feature macro Reviewed By: Anastasia Differential Revision: https://reviews.llvm.org/D103191	2021-07-15 17:21:19 +03:00
Tim Northover	f24335c69e	MachO: fix Clang test broken by dropping private labels in LLVM. LLVM changed to not emit L... labels for things marked "do_not_dead_strip" because the linker can sometimes drop the flag if there's no proper symbol. This Clang test checked for the old behaviour, but doesn't actually care about that bit.	2021-07-15 15:05:08 +01:00
serge-sans-paille	4b219051a3	Fix undeduced type assert If the instantiation of a member variable makes it possible to compute a previously undeduced type, we should use that piece of information. Fix bug#50590 Differential Revision: https://reviews.llvm.org/D103849	2021-07-15 10:52:25 +02:00
Chuanqi Xu	8a1727ba51	[Coroutines] Run coroutine passes by default This patch make coroutine passes run by default in LLVM pipeline. Now the clang and opt could handle IR inputs containing coroutine intrinsics without special options. It should be fine. On the one hand, the coroutine passes seems to be stable since there are already many projects using coroutine feature. On the other hand, the coroutine passes should do nothing for IR who doesn't contain coroutine intrinsic. Test Plan: check-llvm Reviewed by: lxfind, aeubanks Differential Revision: https://reviews.llvm.org/D105877	2021-07-15 14:33:40 +08:00
Thomas Lively	4a4229f70f	[WebAssembly] Codegen for v128.storeX_lane instructions Replace the experimental clang builtins and LLVM intrinsics for these instructions with normal codegen patterns. Resolves PR50435. Differential Revision: https://reviews.llvm.org/D106019	2021-07-14 16:15:25 -07:00
Kirill Stoimenov	ac500fd18f	[asan][clang] Add flag to outline instrumentation Summary This option can be used to reduce the size of the binary. The trade-off in this case would be the run-time performance. Reviewed By: vitalybuka Differential Revision: https://reviews.llvm.org/D105726	2021-07-14 13:36:34 -07:00
Thomas Lively	970e090010	[WebAssembly] Codegen for v128.loadX_lane instructions Replace the experimental clang builtin and LLVM intrinsics for these instructions with normal codegen patterns. Resolves PR50433. Differential Revision: https://reviews.llvm.org/D105950	2021-07-14 11:31:53 -07:00
Aaron Ballman	aefd6c615c	Combine two diagnostics into one and correct grammar The anonymous and non-anonymous bit-field diagnostics are easily combined into one diagnostic. However, the diagnostic was missing a "the" that is present in the almost-identically worded warn_bitfield_width_exceeds_type_width diagnostic, hence the changes to test cases.	2021-07-14 11:43:28 -04:00
Gabor Marton	bdf31471c7	[Analyzer][solver] Add dump methods for (dis)equality classes. This proved to be very useful during debugging. Differential Revision: https://reviews.llvm.org/D103967	2021-07-14 13:45:02 +02:00
Kito Cheng	5635d2a56d	[RISCV] Pass -u to linker correctly. `-u` is a linker option used to pretend a symbol is undefined, this option are common used for forcing archive member extraction. This option should pass to `ld`, and many other toolchain in Clang like `tools::gnutools` has pass that too. Reviewed By: MaskRay Differential Revision: https://reviews.llvm.org/D105091	2021-07-14 14:25:02 +08:00
Zakk Chen	08cf69c31f	[RISCV] Support overloading for RVV miscellaneous functions. Based on this update to the intrinsic doc https://github.com/riscv/rvv-intrinsic-doc/pull/103 Reviewed By: craig.topper Differential Revision: https://reviews.llvm.org/D105611	2021-07-13 21:35:37 -07:00
Richard Smith	8a0f1163d0	Fix test trying to write a spurious output file into the source directory. This causes test failures if the source directory is read-only.	2021-07-13 18:58:24 -07:00
Victor Huang	18c19414eb	[PowerPC] Add PowerPC compare and multiply related builtins and instrinsics for XL compatibility This patch is in a series of patches to provide builtins for compatibility with the XL compiler. This patch adds the builtins and instrisics for compare and multiply related operations. Reviewed By: nemanjai, #powerpc Differential revision: https://reviews.llvm.org/D102875	2021-07-13 16:55:09 -05:00
Artem Belevich	25629bb45f	Fix cuda-bad-arch.cu test. Tests for correctness of HIP architecture need `- xhip`	2021-07-13 11:57:25 -07:00
Victor Huang	781929b423	[PowerPC][NFC] Power ISA features for Semachecking [NFC] This patch adds features for pwr7, pwr8, and pwr9 that can be used for semachecking builtin functions that are only valid for certain versions of ppc. Reviewed By: nemanjai, #powerpc Authored By: Quinn Pham <Quinn.Pham@ibm.com> Differential revision: https://reviews.llvm.org/D105501	2021-07-13 13:13:34 -05:00
Artem Belevich	01d3a3dcab	[CUDA] Only allow NVIDIA offload-arch during CUDA compilation. Otherwise, if someone specifies a valid AMD arch, we may end up triggering an assertion on unexpected arch later on. Differential Revision: https://reviews.llvm.org/D105295	2021-07-13 11:09:14 -07:00
Valeriy Savchenko	60bd8cbc0c	[analyzer][solver][NFC] Refactor how we detect (dis)equalities This patch simplifies the way we deal with (dis)equalities. Due to the symmetry between constraint handler and range inferrer, we can have very similar implementations of logic handling questions about (dis)equality and assumptions involving (dis)equality. It also helps us to remove one more visitor, and removes uncertainty that we got all the right places to put `trackNE` and `trackEQ`. Differential Revision: https://reviews.llvm.org/D105693	2021-07-13 21:00:30 +03:00
Tom Stellard	303ddb60a2	Fix utils/update_cc_test_checks/check-globals.test on stand-alone builds We want to use LLVM_EXTERNAL_LIT if defined for the %lit substitution. Reviewed By: jdenny Differential Revision: https://reviews.llvm.org/D105873	2021-07-13 10:47:30 -07:00
Matheus Izvekov	03282f2fe1	[clang] C++98 implicit moves are back with a vengeance After taking C++98 implicit moves out in D104500, we put it back in, but now in a new form which preserves compatibility with pure C++98 programs, while at the same time giving almost all the goodies from P1825. * We use the exact same rules as C++20 with regards to which id-expressions are move eligible. The previous incarnation would only benefit from the proper subset which is copy ellidable. This means we can implicit move, in addition: * Parameters. * RValue references. * Exception variables. * Variables with higher-than-natural required alignment. * Objects with different type from the function return type. * We preserve the two-overload resolution, with one small tweak to the first one: If we either pick a (possibly converting) constructor which does not take an rvalue reference, or a user conversion operator which is not ref-qualified, we abort into the second overload resolution. This gives C++98 almost all the implicit move patterns which we had created test cases for, while at the same time preserving the meaning of these three patterns, which are found in pure C++98 programs: * Classes with both const and non-const copy constructors, but no move constructors, continue to have their non-const copy constructor selected. * We continue to reject as ambiguous the following pattern: ``` struct A { A(B &); }; struct B { operator A(); }; A foo(B x) { return x; } ``` * We continue to pick the copy constructor in the following pattern: ``` class AutoPtrRef { }; struct AutoPtr { AutoPtr(AutoPtr &); AutoPtr(); AutoPtr(AutoPtrRef); operator AutoPtrRef(); }; AutoPtr test_auto_ptr() { AutoPtr p; return p; } ``` Signed-off-by: Matheus Izvekov <mizvekov@gmail.com> Reviewed By: Quuxplusone Differential Revision: https://reviews.llvm.org/D105756	2021-07-13 19:16:49 +02:00
Fangrui Song	3d89fb4d13	[RISCV] Support machine constraint "S" Similar to D46745, "S" represents an absolute symbolic operand, which can be used to specify the access models, e.g. extern int var; void addr_via_asm() { void ret; asm("lui %0, %%hi(%1)\naddi %0,%0,%%lo(%1)" : "=r"(ret) : "S"(&var)); return ret; } 'S' is documented in trunk GCC: https://gcc.gnu.org/bugzilla/show_bug.cgi?id=101275 Reviewed By: luismarques Differential Revision: https://reviews.llvm.org/D105254	2021-07-13 09:30:09 -07:00
Albion Fung	f1aca5ac96	[PowerPC] Fix L[D\|W]ARX Implementation LDARX and LWARX sometimes gets optimized out by the compiler when it is critical to the correctness of the code. This inline asm generation ensures that it preserved. Differential Revision: https://reviews.llvm.org/D105754	2021-07-13 11:02:07 -05:00
Dave MacLachlan	45ffe6341d	[clang/objc] Optimize getters for non-atomic, copied properties Properties that were declared `@property(copy, nonatomic) id foo` make an unnecessary call to objc_get_property(). This call can be replaced with a direct access to the backing variable identical to how a `@property(nonatomic) id foo` would do it. This reduces codegen by 4 bytes (x86_64/arm64) and removes a cross linkage unit function call per property declared as copy/nonatomic. Differential Revision: https://reviews.llvm.org/D105311	2021-07-13 09:22:13 -04:00
Anton Zabaznov	ab76101f40	[OpenCL] Add support of __opencl_c_read_write_images feature macro This feature requires support of __opencl_c_images, so diagnostics for that is provided as well Reviewed By: Anastasia Differential Revision: https://reviews.llvm.org/D104915	2021-07-13 15:38:23 +03:00
Anton Zabaznov	78463ebde2	[OpenCL] Add support of __opencl_c_generic_address_space feature macro Reviewed By: Anastasia Differential Revision: https://reviews.llvm.org/D103401	2021-07-13 13:14:10 +03:00
SharmaRithik	cad9b7f708	[analyzer] Print time taken to analyze each function Summary: This patch is a part of an attempt to obtain more timer data from the analyzer. In this patch, we try to use LLVM::TimeRecord to save time before starting the analysis and to print the time that a specific function takes while getting analyzed. The timer data is printed along with the -analyzer-display-progress outputs. ANALYZE (Syntax): test.c functionName : 0.4 ms ANALYZE (Path, Inline_Regular): test.c functionName : 2.6 ms Authored By: RithikSharma Reviewer: NoQ, xazax.hun, teemperor, vsavchenko Reviewed By: NoQ Differential Revision: https://reviews.llvm.org/D105565	2021-07-13 04:52:47 +00:00
Fangrui Song	51fc742ce7	[Driver] Let -fno-integrated-as -gdwarf-5 use -fdwarf-directory-asm While GNU as only allows the directory form of the .file directive for DWARF v5, the integrated assembler prefers the directory form on all DWARF versions (-fdwarf-directory-asm). We currently set CC1 -fno-dwarf-directory-asm for -fno-integrated-as -gdwarf-5 which may cause the directory entry 0 and the filename entry 0 to be incorrect (see D105662 and the example below). This patch makes -fno-integrated-as -gdwarf-5 use -fdwarf-directory-asm as well. ``` cd /tmp/c before % clang -g -gdwarf-5 -fno-integrated-as e/a.c -S -o - \| grep '\.file.0' .file 0 "/tmp/c/e/a.c" md5 0x97e31cee64b4e58a4af8787512d735b6 % clang -g -gdwarf-5 -fno-integrated-as e/a.c -c % llvm-dwarfdump a.o \| grep include_directories include_directories[ 0] = "/tmp/c/e" after % clang -g -gdwarf-5 -fno-integrated-as e/a.c -S -o - \| grep '\.file.0' .file 0 "/tmp/c" "e/a.c" md5 0x97e31cee64b4e58a4af8787512d735b6 % clang -g -gdwarf-5 -fno-integrated-as e/a.c -c % llvm-dwarfdump a.o \| grep include_directories include_directories[ 0] = "/tmp/c" ``` Reviewed By: #debug-info, dblaikie, osandov Differential Revision: https://reviews.llvm.org/D105835	2021-07-12 15:46:20 -07:00
Steven Wan	798fe3c774	[PowerPC][AIX] Fix Zero-width bit fields wrt MaxFieldAlign. On AIX when there is a pragma pack, or pragma align in effect then zero-width bitfields should pad out to the end of the bitfield container but not increase the alignment requirements of the struct greater then the max field align. Reviewed By: ZarkoCA Differential Revision: https://reviews.llvm.org/D105635	2021-07-12 15:31:15 -04:00
Thomas Lively	cbabfc63b1	[WebAssembly] Custom combines for f32x4.demote_zero_f64x2 Replace the clang builtin function and LLVM intrinsic for f32x4.demote_zero_f64x2 with combines from normal SDNodes. Also add missing combines for i32x4.trunc_sat_zero_f64x2_{s,u}, which share the same pattern. Differential Revision: https://reviews.llvm.org/D105755	2021-07-12 10:32:18 -07:00
Albion Fung	ef49d925e2	[PowerPC] Implement trap and conversion builtins for XL compatibility This patch implements trap and FP to and from double conversions. The builtins generate code that mirror what is generated from the XL compiler. Intrinsics are named conventionally with builtin_ppc, but are aliased to provide the same builtin names as the XL compiler. Differential Revision: https://reviews.llvm.org/D103668	2021-07-12 11:04:17 -05:00
Bardia Mahjour	2071ce9d45	[Altivec] Use signed comparison for vec_all_* and vec_any_* interfaces We are currently being inconsistent in using signed vs unsigned comparisons for vec_all_* and vec_any_* interfaces that use vector bool types. For example we use signed comparison for vec_all_ge(vector signed char, vector bool char) but unsigned comparison for when the arguments are swapped. GCC and XL use signed comparison instead. This patch makes clang consistent with itself and with XL and GCC. Reviewed By: nemanjai Differential Revision: https://reviews.llvm.org/D105666	2021-07-12 11:41:16 -04:00
Abbas Sabra	1af97c9d0b	[analyzer] LoopUnrolling: fix crash when a loop counter is captured in a lambda by reference Reviewed By: vsavchenko Differential Revision: https://reviews.llvm.org/D102273	2021-07-12 17:06:07 +03:00
Corentin Jabot	8747234032	Partially implement P1401R5 (Narrowing contextual conversions to bool) Support Narrowing conversions to bool in if constexpr condition under C++23 language mode. Only if constexpr is implemented as the behavior of static_assert is already conforming. Still need to work on explicit(bool) to complete support.	2021-07-12 08:06:27 -04:00
Nemanja Ivanovic	84e429693f	[PowerPC] Fix rounding mode for vec_round in altivec.h The function is supposed to be the equivalent of rint() (as in round to nearest, ties to even) rather than round() (round to nearest, ties away from zero). In fact, the instruction we emit without VSX is vrfin which is correct. However, with VSX we emit xvrspi which is the equivalent of round() and therefore incorrect. Since there is no equivalent VSX instruction, simply use vrfin regardless of availability of VSX.	2021-07-12 06:11:27 -05:00
Aaron Ballman	de59f56440	[OpenMP] Support OpenMP 5.1 attributes OpenMP 5.1 added support for writing OpenMP directives using [[]] syntax in addition to using #pragma and this introduces support for the new syntax. In OpenMP, the attributes take one of two forms: [[omp::directive(...)]] or [[omp::sequence(...)]]. A directive attribute contains an OpenMP directive clause that is identical to the analogous #pragma syntax. A sequence attribute can contain either sequence or directive arguments and is used to ensure that the attributes are processed sequentially for situations where the order of the attributes matter (remember: https://eel.is/c++draft/dcl.attr.grammar#4.sentence-4). The approach taken here is somewhat novel and deserves mention. We could refactor much of the OpenMP parsing logic to work for either pragma annotation tokens or for attribute clauses. It would be a fair amount of effort to share the logic for both, but it's certainly doable. However, the semantic attribute system is not designed to handle the arbitrarily complex arguments that OpenMP directives contain. Adding support to thread the novel parsed information until we can produce a semantic attribute would be considerably more effort. What's more, existing OpenMP constructs are not (often) represented as semantic attributes. So doing this through Attr.td would be a massive undertaking that would likely only benefit OpenMP and comes with additional risks. Rather than walk down that path, I am taking advantage of the fact that the syntax of the directives within the directive clause is identical to that of the #pragma form. Once the parser recognizes that we're processing an OpenMP attribute, it caches all of the directive argument tokens and then replays them as though the user wrote a pragma. This reuses the same OpenMP parsing and semantic logic directly, but does come with a risk if the OpenMP committee decides to purposefully diverge their pragma and attribute syntaxes. So, despite this being a novel approach that does token replay, I think it's actually a better approach than trying to do this through the declarative syntax in Attr.td.	2021-07-12 06:51:19 -04:00
Nemanja Ivanovic	41ce5ec5f6	[PowerPC] Remove unnecessary 64-bit guards from altivec.h A number of functions in the header have guards for 64-bit only that were presumably added as some of the functions in the blocks use vector __int128 which is only available in 64-bit mode. A more appropriate guard (__SIZEOF_INT128__) has been added for those functions since, making the 64-bit guards redundant. This patch removes those guards as they inadvertently guard code that uses vector long long which does not actually require 64-bit mode.	2021-07-12 04:59:00 -05:00
Balazs Benics	d3e14fafc6	[analyzer][NFC] Display the correct function name even in crash dumps The `-analyzer-display-progress` displayed the function name of the currently analyzed function. It differs in C and C++. In C++, it prints the argument types as well in a comma-separated list. While in C, only the function name is displayed, without the brackets. E.g.: C++: foo(), foo(int, float) C: foo In crash traces, the analyzer dumps the location contexts, but the string is not enough for `-analyze-function` in C++ mode. This patch addresses the issue by dumping the proper function names even in stack traces. Reviewed By: NoQ Differential Revision: https://reviews.llvm.org/D105708	2021-07-12 09:06:46 +02:00
Johannes Doerfert	514c033db1	[OpenMP] Detect SPMD compatible kernels and execute them as such In the spirit of TRegions [0], this patch analyzes a kernel and tracks if it can be executed in SPMD-mode. If so, we flip the arguments of the __kmpc_target_init and deinit call to enable the mode. We also update the `<kernel>_exec_mode` flag to indicate to the runtime we changed the mode to SPMD. The code analysis is done interprocedurally by extending the AAKernelInfo abstract attribute to track SPMD compatibility as well. [0] https://link.springer.com/chapter/10.1007/978-3-030-28596-8_11 Differential Revision: https://reviews.llvm.org/D102307	2021-07-10 18:44:25 -05:00
Johannes Doerfert	a706b94ea5	[OpenMP][NFCI] Re-enable two remarks tests after D101977 landed	2021-07-10 18:18:34 -05:00
Johannes Doerfert	e2cfbfcc0c	[OpenMP] Unified entry point for SPMD & generic kernels in the device RTL In the spirit of TRegions [0], this patch provides a simpler and uniform interface for a kernel to set up the device runtime. The OMPIRBuilder is used for reuse in Flang. A custom state machine will be generated in the follow up patch. The "surplus" threads of the "master warp" will not exit early anymore so we need to use non-aligned barriers. The new runtime will not have an extra warp but also require these non-aligned barriers. [0] https://link.springer.com/chapter/10.1007/978-3-030-28596-8_11 This was in parts extracted from D59319. Reviewed By: ABataev, JonChesterfield Differential Revision: https://reviews.llvm.org/D101976	2021-07-10 17:53:56 -05:00
Vassil Vassilev	f01d45c378	Reland "[clang-repl] Allow passing in code as positional arguments." This reverts commit `3ec88ca60b` which reverted `e386871e1d` due to a asan build failure. This patch removes the new lines in the test case which seem to introduce the failure. Differential revision: https://reviews.llvm.org/D104898	2021-07-10 17:54:00 +00:00
Thomas Lively	e5220104d0	[WebAssembly] Custom combines for f64x2.promote_low_f32x4 Replace the clang builtin function and LLVM intrinsic previously used to select the f64x2.promote_low_f32x4 instruction with custom combines from standard SelectionDAG nodes. Implement the new combines to share code with the similar combines for f64x2.convert_low_i32x4_{s,u}. Resolves PR50232. Differential Revision: https://reviews.llvm.org/D105675	2021-07-09 18:59:29 -07:00
Aaron En Ye Shi	ccb10266f5	[HIP] Move std headers after device malloc/free Set the device malloc and free functions as weak, and move the std headers after device malloc/free to avoid issues with std malloc/free. Fixes: SWDEV-293590 Reviewed By: yaxunl Differential Revision: https://reviews.llvm.org/D105707	2021-07-09 21:20:16 +00:00
Alexey Bataev	ab8989ab87	[OPENMP]Fix overlapped mapping for dereferenced pointer members. If the base is used in a map clause and later we have a memberexpr with this base, and the member is a pointer, and this pointer is dereferenced anyhow (subscript, array section, dereference, etc.), such components should be considered as overlapped, otherwise it may lead to incorrect size computations, since we try to map a pointee as a part of the whole struct, which is not true for the pointer members. Differential Revision: https://reviews.llvm.org/D105562	2021-07-09 12:51:26 -07:00
David Blaikie	768e3af634	PR51034: Debug Info: Remove 'prototyped' from K&R function declarations Regression caused by `6c9559b67b`.	2021-07-09 12:07:36 -07:00
Nikita Popov	ff8b1b1b9c	Reapply [IR] Don't mark mustprogress as type attribute Reapply with fixes for clang tests. ----- This is a simple enum attribute. Test changes are because enum attributes are sorted before type attributes, so mustprogress is now in a different position.	2021-07-09 20:57:44 +02:00
Varun Gandhi	92dcb1d2db	[Clang] Introduce Swift async calling convention. This change is intended as initial setup. The plan is to add more semantic checks later. I plan to update the documentation as more semantic checks are added (instead of documenting the details up front). Most of the code closely mirrors that for the Swift calling convention. Three places are marked as [FIXME: swiftasynccc]; those will be addressed once the corresponding convention is introduced in LLVM. Reviewed By: rjmccall Differential Revision: https://reviews.llvm.org/D95561	2021-07-09 11:50:10 -07:00
Nico Weber	97c675d3d4	Revert "Revert "Temporarily do not drop volatile stores before unreachable"" This reverts commit `52aeacfbf5`. There isn't full agreement on a path forward yet, but there is agreement that this shouldn't land as-is. See discussion on https://reviews.llvm.org/D105338 Also reverts unreviewed "[clang] Improve `-Wnull-dereference` diag to be more in-line with reality" This reverts commit `f4877c78c0`. And all the related changes to tests: This reverts commit `9a0152799f`. This reverts commit `3f7c9cc274`. This reverts commit `329f8197ef`. This reverts commit `aa9f58cc2c`. This reverts commit `2df37d5ddd`. This reverts commit `a72a441812`.	2021-07-09 11:44:34 -04:00
Roman Lebedev	329f8197ef	[NFC][Clang][CodegenOpenCL] Fix test not to rely on volatile store not being removed	2021-07-09 14:16:54 +03:00
Haojian Wu	47653db6d2	[clang] Fix an infinite loop during typo-correction See https://bugs.llvm.org/show_bug.cgi?id=50797#c6 Differential Revision: https://reviews.llvm.org/D105533	2021-07-09 12:03:57 +02:00
Roman Lebedev	f4877c78c0	[clang] Improve `-Wnull-dereference` diag to be more in-line with reality * Drop any mention of `volatile`. Please refer to https://reviews.llvm.org/D105338 * Drop address space check - it really doesn't affect the behavior, the store will still be dropped: https://godbolt.org/z/dP8fevxG4	2021-07-09 12:51:12 +03:00
jacquesguan	88326bbce3	[RISCV][clang] Add macro __riscv_zvlsseg for RVV Zvlsseg builtins Add extension macro __riscv_zvlsseg to enable Zvlsseg builtins only with target feature Zvlsseg. Reviewed By: HsiangKai Differential Revision: https://reviews.llvm.org/D105626	2021-07-09 13:18:42 +08:00
Alexey Bataev	f57d396dca	[OPENMP]Do no privatize const firstprivates in target regions. No need to emit private copyfor firstprivate constants in target regions, we can use the original copy instead. Differential Revision: https://reviews.llvm.org/D105647	2021-07-08 11:55:37 -07:00
Matheus Izvekov	5a1c50410c	[clang] fix constexpr code generation for user conversions. When building the member call to a user conversion function during an implicit cast, the expression was not being checked for immediate invocation, so we were never adding the ConstantExpr node to AST. This would cause the call to the user conversion operator to be emitted even if it was constantexpr evaluated, and this would even trip an assert when said user conversion was declared consteval: `Assertion failed: !cast<FunctionDecl>(GD.getDecl())->isConsteval() && "consteval function should never be emitted", file clang\lib\CodeGen\CodeGenModule.cpp, line 3530` Fixes PR48855. Signed-off-by: Matheus Izvekov <mizvekov@gmail.com> Reviewed By: rsmith Differential Revision: https://reviews.llvm.org/D105446	2021-07-08 20:23:19 +02:00
Michael Liao	4e5d9c8803	[Internalize] Preserve variables externally initialized. - ``externally_initialized`` variables would be initialized or modified elsewhere. Particularly, CUDA or HIP may have host code to initialize or modify ``externally_initialized`` device variables, which may not be explicitly referenced on the device side but may still be used through the host side interfaces. Not preserving them triggers the elimination of them in the GlobalDCE and breaks the user code. Reviewed By: yaxunl Differential Revision: https://reviews.llvm.org/D105135	2021-07-08 10:48:19 -04:00

... 2 3 4 5 6 ...

44094 Commits